BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy2558
         (348 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|161408101|dbj|BAF94154.1| cathepsin F-like cysteine protease [Plautia stali]
          Length = 803

 Score =  341 bits (874), Expect = 4e-91,   Method: Compositional matrix adjust.
 Identities = 176/366 (48%), Positives = 222/366 (60%), Gaps = 24/366 (6%)

Query: 3   CFYFFAGVALLSLTVSVSSFMVVGD------EKLHHLHHVKHTALFNYFLEQH------- 49
           C +   G  L  LTV   S++   D      + +H +  + H+A      EQ        
Sbjct: 442 CPFQEEGRMLCQLTVWERSWLKKIDLTSSKCDPIHTVMDISHSAELLGVDEQDKDYIKFK 501

Query: 50  ------NKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKY 103
                  ++Y T  E   R  IF  N++K   LQ TE G+  YG+  FSD+S+ EF+  Y
Sbjct: 502 FFTKKFQRSYKTTEELKKRFRIFRANMKKADYLQKTEQGTAKYGVTIFSDISSKEFKKHY 561

Query: 104 LGFKLK-PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
           LG K + P    +   A IPNITLP  +DWR Y+AVT VK+Q MCGS WAFS TGNIEG 
Sbjct: 562 LGLKKRTPDIKFKQEMAQIPNITLPEEYDWRNYNAVTPVKNQGMCGSCWAFSVTGNIEGQ 621

Query: 163 YAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA 222
           YA KT  LVSLSEQEL+DCD+ DDGCEGG    A+  I     GGLE E  YPY G D  
Sbjct: 622 YAIKTGNLVSLSEQELVDCDKYDDGCEGGLFETAYHAIEEL--GGLELESDYPYSGRDNT 679

Query: 223 CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCD 282
           C  N    +V I   V++S DETDMAK+LV NGP+++ INA A+QFY+ GVSHP++F CD
Sbjct: 680 CHFNSSEVRVSITSSVNISNDETDMAKWLVANGPISIGINANAMQFYLGGVSHPLKFLCD 739

Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDY 342
              + L H VLIVGYG+ RT   H+ +PYW+IKNSW   WG KGY+ LYRGDGSCG+N +
Sbjct: 740 P--KTLDHGVLIVGYGIHRTWLLHRHLPYWLIKNSWSSYWGAKGYYMLYRGDGSCGVNQW 797

Query: 343 VRSALV 348
             SA++
Sbjct: 798 PSSAVL 803


>gi|244790097|ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
          Length = 586

 Score =  340 bits (873), Expect = 5e-91,   Method: Compositional matrix adjust.
 Identities = 164/311 (52%), Positives = 220/311 (70%), Gaps = 9/311 (2%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+  HNK Y +L E   R  IF+ N++K++LLQ+ E GS +YG  +F+DL+  EF+ 
Sbjct: 280 FENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFADLTKNEFKK 339

Query: 102 KYLGFKLKPSYADRSVP-AMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
           KYLG     + + +++P A+IP + ++P  FDWR ++ VT VK+Q  CGS WAFS   NI
Sbjct: 340 KYLGLDSSMT-SKKTLPMAVIPQSASIPNEFDWRNHNVVTPVKNQGACGSCWAFSAIANI 398

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG- 218
           EG YA K+K+L+SLSEQELIDCD  D+GC GG ++ AF+ + +   GGLE E  YPY G 
Sbjct: 399 EGQYALKSKELLSLSEQELIDCDNLDNGCGGGLMTQAFEAVENL--GGLETESDYPYEGH 456

Query: 219 -DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
            D K C+L K   +V I+  V+VS DE D+AK+LV++GP++V +NA A+QFY+ GVSHPI
Sbjct: 457 ADRKGCQLKKSDVKVSISKAVNVSTDEEDIAKFLVKHGPLSVGVNANAMQFYMGGVSHPI 516

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
              C    ++L H V IVGYGV RTK+THK +PYW+IKNSWG GWGEKGY+ LYRGDGSC
Sbjct: 517 HALCSP--KSLDHGVAIVGYGVHRTKYTHKNLPYWLIKNSWGPGWGEKGYYLLYRGDGSC 574

Query: 338 GINDYVRSALV 348
           G+N  V SA++
Sbjct: 575 GVNQMVSSAII 585


>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
          Length = 1032

 Score =  334 bits (856), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 165/327 (50%), Positives = 223/327 (68%), Gaps = 11/327 (3%)

Query: 27   DEKLHHL-HHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
            +EK+  +   ++   LF  F+  +N+TYAT  E   RL IF  NL  I+LL+  E G+G 
Sbjct: 711  NEKMLRIAEDMRSERLFENFVNTYNRTYATEEERNLRLSIFRENLGIIRLLRKNEQGTGQ 770

Query: 86   YGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVP---AMIPNITLPRAFDWREYDAVTGV 141
            YG+N+F+D+ST EF A YLG  L+P    + ++P   A IP+I LP +FDWR+  AVT V
Sbjct: 771  YGVNQFADVSTEEFHAFYLG--LRPDLRTENNIPLRQAEIPDIELPNSFDWRQKGAVTPV 828

Query: 142  KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIM 201
            K+Q MCGS WAFS TGN+EG YA K  KL+SLSEQEL+DCD  D+GC GG   NA+  I 
Sbjct: 829  KNQGMCGSCWAFSVTGNVEGQYAIKHNKLLSLSEQELVDCDDLDEGCNGGLPDNAYRAI- 887

Query: 202  SKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI 261
             KL GGLE E  YPY  +++ C   K   +V++   V+++ +ET +A++LV NGP+++ I
Sbjct: 888  EKL-GGLELESDYPYEAENERCHFKKNMAKVQVGSAVNITSNETQIAQWLVANGPISIGI 946

Query: 262  NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
            NA A+QFY+ GVSHP +F C+   +NL H VLIVGYG       HK +PYWI+KNSWG+ 
Sbjct: 947  NANAMQFYMGGVSHPFKFLCNP--KNLDHGVLIVGYGTSNYPLFHKKLPYWIVKNSWGDR 1004

Query: 322  WGEKGYFRLYRGDGSCGINDYVRSALV 348
            WGE+GY+R+YRGDG+CG+N    SA+V
Sbjct: 1005 WGEQGYYRVYRGDGTCGLNTMASSAVV 1031


>gi|307175778|gb|EFN65613.1| Putative cysteine proteinase CG12163 [Camponotus floridanus]
          Length = 887

 Score =  334 bits (856), Expect = 5e-89,   Method: Compositional matrix adjust.
 Identities = 165/323 (51%), Positives = 217/323 (67%), Gaps = 10/323 (3%)

Query: 30  LHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN 89
           L     V+   LFN F+  +N+TY+T  E   RL IF  NL  IQLL+ TE G+  Y +N
Sbjct: 570 LQIAEDVRSEQLFNNFVVTYNRTYSTPEERNLRLRIFRENLGIIQLLRKTERGTAHYDVN 629

Query: 90  EFSDLSTAEFQAKYLGFKLKPSY-ADRSVP---AMIPNITLPRAFDWREYDAVTGVKDQT 145
            F+D+S  EF+++YLG  L+P   ++  +P   A IP++ LP  FDWRE   VT VKDQ 
Sbjct: 630 MFADMSPEEFRSRYLG--LRPDLRSENDIPLREAEIPDVELPPKFDWREKSVVTPVKDQG 687

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLG 205
           MCGS WAFS TGNIEG YA K  +L+SLSEQEL+DCD  D+GC GG   NA+  I  KL 
Sbjct: 688 MCGSCWAFSVTGNIEGQYAIKHGRLLSLSEQELVDCDDLDEGCNGGLPDNAYRAI-EKL- 745

Query: 206 GGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA 265
           GGLE E  YPY  +++ C   K   +V++   V+++ +ET MA++LV+NGP+++ INA A
Sbjct: 746 GGLELESDYPYEAENEKCHFKKNLAKVQLASAVNITSNETQMAQWLVQNGPISIGINANA 805

Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
           +QFYV GVSHP +F C+   +NL H VLIVGYG       HK +PYW IKNSWG+ WGE+
Sbjct: 806 MQFYVGGVSHPFKFLCNP--KNLDHGVLIVGYGTSDYPLFHKKLPYWTIKNSWGKRWGEQ 863

Query: 326 GYFRLYRGDGSCGINDYVRSALV 348
           GY+R+YRGDG+CG+N    SA+V
Sbjct: 864 GYYRVYRGDGTCGLNTLATSAVV 886


>gi|242014216|ref|XP_002427787.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
 gi|212512256|gb|EEB15049.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
          Length = 434

 Score =  330 bits (846), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 159/308 (51%), Positives = 211/308 (68%), Gaps = 9/308 (2%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+ + NK Y +  E+  R  IF  N++KI  L   E G+  YG+ EFSDLS  EF+ 
Sbjct: 134 FKDFVLKFNKVYFSKEEFKKRFRIFRANMKKINFLNKAEKGTAQYGITEFSDLSVTEFK- 192

Query: 102 KYLGFKLKPSYADRSVP-AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
            YLG K KP   +  +P A IP++ LP  FDWR Y+AVT VK+Q  CGS WAFS TGNIE
Sbjct: 193 NYLGLKKKP---ESKLPTAEIPDVKLPDNFDWRHYNAVTPVKNQGSCGSCWAFSVTGNIE 249

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G++A K  +L+SLSEQELIDCD+ D+GC GG +   ++ IM KL GGLE E  YPY  ++
Sbjct: 250 GLWAIKKHELLSLSEQELIDCDKIDNGCNGGYMPETYEAIM-KL-GGLETETDYPYEAEN 307

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
           + C LNK   +VKING V++++ E D+AK+L +NGP++  +NA A+QFY+ G+SHP +  
Sbjct: 308 EKCNLNKTEIKVKINGAVNLTKSELDIAKWLYKNGPVSAGLNANAMQFYLGGISHPPKIL 367

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
           C+   E   H +LIVGYG+ ++    + +PYWIIKNSWG+ WGEKGY+RLYRG G CGIN
Sbjct: 368 CNP--EEQDHGILIVGYGIHKSSILKRTIPYWIIKNSWGKHWGEKGYYRLYRGSGVCGIN 425

Query: 341 DYVRSALV 348
             V SAL+
Sbjct: 426 QMVSSALI 433


>gi|383863617|ref|XP_003707276.1| PREDICTED: uncharacterized protein LOC100880620 [Megachile
           rotundata]
          Length = 884

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 162/315 (51%), Positives = 209/315 (66%), Gaps = 8/315 (2%)

Query: 37  KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
           K   LF  F++ +NKTY +  E   R  +F  NL+ I+ L+  E G+ VYG+  F+DL+ 
Sbjct: 574 KDELLFEDFVKTYNKTYLSAKEKADRYKVFRKNLKMIEKLRKFEQGTAVYGVTMFADLTP 633

Query: 97  AEFQAKYLGFKLKPSYADRSVP---AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
            EF+ KYLG K   +  +  +P   A+IP+I LP  FDWREY+AVT VKDQ  CGS WAF
Sbjct: 634 EEFKTKYLGLKTNLN-QENDIPLQEAVIPDIDLPPKFDWREYNAVTPVKDQGQCGSCWAF 692

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIEG YA K KKL+SLSEQEL+DCD  DDGC GG + NA+ T+  KL GGLE E  
Sbjct: 693 SAIGNIEGQYAIKHKKLLSLSEQELVDCDNLDDGCGGGYMINAYKTV-EKL-GGLELETD 750

Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
           YPY   ++ C   K   +V++   ++++ DE  MA++LV+NGP++V INA A+QFY  GV
Sbjct: 751 YPYDARNEKCHFLKNKAKVQVASALNITNDEKKMAQWLVKNGPISVGINANAMQFYFGGV 810

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
           SHP +F CD    NL H VLIVGY         K +PYWIIKNSWG  WGE+GY+R+YRG
Sbjct: 811 SHPFKFLCDPA--NLDHGVLIVGYATSTYPLFKKKLPYWIIKNSWGPKWGEQGYYRVYRG 868

Query: 334 DGSCGINDYVRSALV 348
           DG+CG+N    SA+V
Sbjct: 869 DGTCGVNAMASSAIV 883


>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
          Length = 774

 Score =  328 bits (840), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 157/317 (49%), Positives = 219/317 (69%), Gaps = 11/317 (3%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K   LFN F+  +N+TY++L E   R  IF  NL  I+ L++TE G+G+YG+N F+D+S
Sbjct: 464 MKAERLFNNFMTTYNRTYSSL-ERNLRFKIFRENLNFIEELRETEQGTGIYGVNMFADMS 522

Query: 96  TAEFQAKYLGFKLKPSY-ADRSVP---AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
             EF+ +YLG  L+P   ++  +P   A IP+I LP +FDWR+   VT VK+Q  CGS W
Sbjct: 523 QKEFRTRYLG--LRPDLQSENEIPLPKAEIPDIDLPSSFDWRQKGVVTPVKNQGQCGSCW 580

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
           AFS TGN+EG YA K  +L+SLSEQEL+DCD  D+GC GG   NA+  I  +  GGLE E
Sbjct: 581 AFSVTGNVEGQYAIKHGQLLSLSEQELVDCDHLDEGCNGGLPDNAYRAI--EQLGGLELE 638

Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
             YPY  +++ C   +   +V++   V+++ +ET +A++LV+NGP+A+ INA A+QFY+ 
Sbjct: 639 SDYPYEAENEKCHFKQNLVKVELASAVNITSNETQIAQWLVQNGPIAIGINANAMQFYMG 698

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GVSHP++  C+    NL+H VLIVGYG  R    HK +PYWIIKNSWG+ WGE+GY+R+Y
Sbjct: 699 GVSHPLKILCNPN--NLNHGVLIVGYGTSRYPLFHKNLPYWIIKNSWGKSWGEQGYYRVY 756

Query: 332 RGDGSCGINDYVRSALV 348
           RGDG+CG+N    SA+V
Sbjct: 757 RGDGTCGLNTMASSAVV 773


>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
          Length = 884

 Score =  326 bits (835), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 156/319 (48%), Positives = 211/319 (66%), Gaps = 10/319 (3%)

Query: 34  HHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSD 93
             +K   LF  F+++  KTY +  E   R  IF  NL+ I+ LQ  E G+  YG+  F+D
Sbjct: 571 EEIKDETLFEAFIKKFGKTYNSADEKLDRFKIFKQNLKIIEELQTFERGTAEYGVTMFAD 630

Query: 94  LSTAEFQAKYLGFKLKPSYA-DRSVP---AMIPNITLPRAFDWREYDAVTGVKDQTMCGS 149
           L+  EF+A+YLG  L+P    +  +P   A IP+++LP  FDWR++  VT VKDQ  CGS
Sbjct: 631 LTPKEFKARYLG--LRPELKHENEIPLPEAEIPDVSLPLKFDWRDHSVVTPVKDQGQCGS 688

Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLE 209
            WAFS TGN+EG YA K  +L+SLSEQEL+DCD  D+GC GG + NA+  I  +  GGLE
Sbjct: 689 CWAFSVTGNVEGQYAIKHNQLLSLSEQELVDCDSLDEGCNGGDMENAYKAI--ERLGGLE 746

Query: 210 EEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
            E  YPY   D+ C   +   +V++   V+++ DE  MA++LV+NGP++V INA A+QFY
Sbjct: 747 LESDYPYDAKDEKCHFLQNKAKVQVVSAVNITSDEKRMAQWLVKNGPISVGINANAMQFY 806

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GVSHP+ F C+   +NL H VLIVGYG+ +    HK +PYWIIKNSWG  WGE+GY+R
Sbjct: 807 FGGVSHPLNFLCNP--KNLDHGVLIVGYGISKYPLFHKELPYWIIKNSWGPRWGERGYYR 864

Query: 330 LYRGDGSCGINDYVRSALV 348
           +YRGDG+CG+N    SA+V
Sbjct: 865 VYRGDGTCGVNTMATSAVV 883


>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
 gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
          Length = 2676

 Score =  323 bits (828), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 158/318 (49%), Positives = 207/318 (65%), Gaps = 7/318 (2%)

Query: 34   HHVKHTALFNYFLEQHNKTYAT-LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
            HH++   LF  FL  +   Y     +   R  IF  N+RK+  L   E G+  YG+  F+
Sbjct: 2363 HHLQAEHLFYEFLSTYKPEYIDDRHQMRQRFEIFKENVRKMHELNTHERGTATYGVTRFA 2422

Query: 93   DLSTAEFQAKYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
            DL+  EF  K++G K  L+     +   A+IPN+T P +FDWR++ AVTGVKDQ  CGS 
Sbjct: 2423 DLTYEEFSTKHMGMKASLRDPNQVQFRKAVIPNVTAPDSFDWRDHGAVTGVKDQGSCGSC 2482

Query: 151  WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
            WAFS TGNIEG +  KT  LVSLSEQEL+DCD+ D GC GG   NA+  I  +  GGLE 
Sbjct: 2483 WAFSVTGNIEGQWKMKTGDLVSLSEQELVDCDKLDQGCNGGLPDNAYRAI--EQLGGLES 2540

Query: 211  EKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
            E  YPY G D  C  NK   +V+I+G V+++ +ETDMAK+LV++GP+++ INA A+QFY+
Sbjct: 2541 EDDYPYEGSDDKCSFNKTLARVQISGAVNITSNETDMAKWLVKHGPISIGINANAMQFYM 2600

Query: 271  TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
             G+SHP +  C+    NL H VLIVGYG       HK +PYWIIKNSWG  WGE+GY+R+
Sbjct: 2601 GGISHPWRMLCNPS--NLDHGVLIVGYGAKDYPLFHKHLPYWIIKNSWGTSWGEQGYYRV 2658

Query: 331  YRGDGSCGINDYVRSALV 348
            YRGDG+CG+N    SA+V
Sbjct: 2659 YRGDGTCGVNQMASSAVV 2676


>gi|401758208|gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
          Length = 537

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 160/319 (50%), Positives = 212/319 (66%), Gaps = 8/319 (2%)

Query: 34  HHVKHTALFNYFLEQHNKTYAT-LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           HHV+   LF  F+  +   Y    VE   R  IF  N++KI  L   E G+GVY +  F+
Sbjct: 223 HHVQAEQLFFNFITTYKPEYINDHVEMTKRFEIFKENVKKIHELNTHERGTGVYAVTRFT 282

Query: 93  DLSTAEFQAKYLGFK--LKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGS 149
           DL+  EF++KYLG    LK         A IP +  LP +FDWR   AVT VKDQ  CGS
Sbjct: 283 DLTYEEFKSKYLGLNPNLKKPNQIPMRQAEIPKVHQLPASFDWRPLGAVTEVKDQGACGS 342

Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLE 209
            WAFS TGNIEG +  KT KL+SLSEQEL+DCD+ DDGC+GG + NA+  I  +  GGLE
Sbjct: 343 CWAFSVTGNIEGQWKLKTGKLLSLSEQELVDCDKMDDGCDGGYMDNAYRAI--EQLGGLE 400

Query: 210 EEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
            E+ YPY  +D  C  NK  ++V+I+G V++S +ET+MAK+LV NGP+++ INA A+QFY
Sbjct: 401 TEEEYPYEAEDDKCSFNKSLSKVQISGAVNISSNETNMAKWLVHNGPISIGINANAMQFY 460

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
           V GVSHP +  C+   +N+ H VLIVGYG+      +K +PYW++KNSWG GWGE+GY+R
Sbjct: 461 VGGVSHPWKALCNP--KNIDHGVLIVGYGIKEYPLFNKQLPYWVVKNSWGPGWGEQGYYR 518

Query: 330 LYRGDGSCGINDYVRSALV 348
           ++RGDG+CG+N    SA+V
Sbjct: 519 VFRGDGTCGVNTMASSAVV 537


>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
          Length = 1036

 Score =  322 bits (826), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 160/323 (49%), Positives = 212/323 (65%), Gaps = 10/323 (3%)

Query: 30   LHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN 89
            L     +K   LF+ F+ ++ K Y    E   R  IF  NL  I+ LQ  E G+G YG+ 
Sbjct: 719  LQQSRQLKEEILFHEFMGKYKKMYHNKEEKEMRFQIFKDNLNLIEELQRNEMGTGRYGVT 778

Query: 90   EFSDLSTAEFQAKYLGFKLKPSY-ADRSVP---AMIPNITLPRAFDWREYDAVTGVKDQT 145
            +F+DL+ AEF+A++LG  LKP+  ++  +P   A IP+I LP  +DWR ++ VT VKDQ 
Sbjct: 779  QFTDLTKAEFKARHLG--LKPTLKSENDIPMPMATIPDIELPSDYDWRHHNVVTPVKDQG 836

Query: 146  MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLG 205
             CGS WAFS TGNIEG YA K  +L+SLSEQEL+DCD+ D GC GG    A+  I     
Sbjct: 837  SCGSCWAFSVTGNIEGQYAIKHGELLSLSEQELVDCDKLDSGCNGGLPDTAYRAIEEL-- 894

Query: 206  GGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA 265
            GGLE E  YPY  +D+ C  NK   +V I   ++++ +ET MA++LV+NGPM++ INA A
Sbjct: 895  GGLELESDYPYDAEDEKCHFNKNKVKVNIVSGLNITSNETQMAQWLVKNGPMSIGINANA 954

Query: 266  LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
            +QFY+ GVSHP +F C    ++L H VLIVGYGV       K +PYWIIKNSWG  WGE+
Sbjct: 955  MQFYMGGVSHPFKFLC--SPDSLDHGVLIVGYGVKFYPIFKKTMPYWIIKNSWGPRWGEQ 1012

Query: 326  GYFRLYRGDGSCGINDYVRSALV 348
            GY+R+YRGDG+CG+N  V SA+V
Sbjct: 1013 GYYRVYRGDGTCGVNKMVTSAVV 1035


>gi|244790093|ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
          Length = 586

 Score =  318 bits (816), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 154/311 (49%), Positives = 214/311 (68%), Gaps = 9/311 (2%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+  HNK Y +L E   R  IF+ N++K++LLQ+ E GS +YG  +F+DL+  EF+ 
Sbjct: 280 FENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQFADLTKNEFKK 339

Query: 102 KYLGFKLKPSYADRSVP-AMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
           KYLG     + + +++P A+IP + ++P  FDWR ++ VT VK+Q  CGS WAFS   NI
Sbjct: 340 KYLGLDSSMT-SKKTLPMAVIPQSASIPNEFDWRNHNVVTPVKNQGACGSCWAFSAIANI 398

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG- 218
           EG YA K+K+L+SLSEQELIDCD  D+GC GG ++ AF+ + +   GGLE E  YPY G 
Sbjct: 399 EGQYALKSKELLSLSEQELIDCDNLDNGCGGGLMTQAFEAVENL--GGLETESDYPYEGH 456

Query: 219 -DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
            D K C+L K   +V I+  V+VS DE D+AK+LV++GP++V +NA A+QFY+ GVSHPI
Sbjct: 457 ADRKGCQLKKSDVKVSISKAVNVSTDEEDIAKFLVKHGPLSVGVNANAMQFYMGGVSHPI 516

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
              C    ++L H V IVGYGV +  + +  +P+W IKNSWG+ WG +GY+ LYRGDGSC
Sbjct: 517 HALCSP--KSLDHGVAIVGYGVHKYPYLNATLPFWTIKNSWGDKWGMQGYYLLYRGDGSC 574

Query: 338 GINDYVRSALV 348
           G+N  V SA++
Sbjct: 575 GVNQMVSSAII 585


>gi|380025691|ref|XP_003696602.1| PREDICTED: putative cysteine proteinase CG12163-like [Apis florea]
          Length = 881

 Score =  312 bits (799), Expect = 2e-82,   Method: Compositional matrix adjust.
 Identities = 160/321 (49%), Positives = 218/321 (67%), Gaps = 6/321 (1%)

Query: 30  LHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN 89
           L    ++K+  LF  F+ + NKT+++  E  +R  IF  NL+ I+ LQ  E G+  YG+ 
Sbjct: 564 LKLAQNIKYETLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIIKELQTFEQGTAEYGVT 623

Query: 90  EFSDLSTAEFQAKYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMC 147
            F+DL+  EF+ +YLGF+  LK           + +I LP  FDWR+Y+AVT VKDQ +C
Sbjct: 624 MFADLTPKEFKTRYLGFRPELKQENEIPLAKIEVSDIFLPPKFDWRDYNAVTPVKDQGLC 683

Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
           GS WAFS TGN+EG YA K KKL+SLSEQEL+DCD  D+GC GG + NA+  I  KL GG
Sbjct: 684 GSCWAFSVTGNVEGQYAIKYKKLLSLSEQELLDCDTLDEGCNGGYMENAYKAI-EKL-GG 741

Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
           LE E  YPY G ++ C   KK  +V++ G V+++ +ET MA++L++NGP+++ INA A+Q
Sbjct: 742 LELESDYPYDGRNEKCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGINANAMQ 801

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
           FY+ GVSHP  F C+   ++L H VLIVGYG+ +    HK +PYWIIKNSWG  WGE GY
Sbjct: 802 FYIGGVSHPFHFLCNP--KDLDHGVLIVGYGISKYPLFHKELPYWIIKNSWGSRWGENGY 859

Query: 328 FRLYRGDGSCGINDYVRSALV 348
           +R+YRGDG+CG+N    SA+V
Sbjct: 860 YRVYRGDGTCGVNAMASSAIV 880


>gi|189239337|ref|XP_973607.2| PREDICTED: similar to cathepsin F-like cysteine protease [Tribolium
            castaneum]
          Length = 1726

 Score =  309 bits (791), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 153/313 (48%), Positives = 212/313 (67%), Gaps = 8/313 (2%)

Query: 38   HTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTA 97
            H +LF  FL+++NK Y     Y  R ++F  NL +I++L   E G+  YG+  F+D++  
Sbjct: 1419 HLSLFTDFLKKYNKKYHKKE-YKYRFNVFVQNLMQIRVLNTFEQGTATYGITRFADMTQK 1477

Query: 98   EFQAKYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            EF ++ LG +  L+         A IPNI LP+ FDWR+ + VT VK+Q  CGS WAFS 
Sbjct: 1478 EF-SRSLGLRTDLRNENETPFAQAKIPNIELPKEFDWRKKNVVTEVKNQEQCGSCWAFSV 1536

Query: 156  TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
            TGN+EG YA +  KL+  SEQEL+DCD +D GC GG +  A+ +I  K+ GGLE E+ YP
Sbjct: 1537 TGNVEGQYALRHGKLLEFSEQELVDCDTDDQGCNGGLMDTAYRSI-EKI-GGLETEQDYP 1594

Query: 216  YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
            Y  +D+ C  N+   +V++ G +++S +ETDMAK+LV NGP+++AINA A+QFY+ GVSH
Sbjct: 1595 YDAEDEKCHFNRTLARVQVTGALNISHNETDMAKWLVANGPISIAINANAMQFYMGGVSH 1654

Query: 276  PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
            P +F C    +NL H VLIVGYGV       K++PYWI+KNSWG GWGE+GY+R+YRGDG
Sbjct: 1655 PFKFLC--SPKNLDHGVLIVGYGVHNYPLFKKSLPYWIVKNSWGTGWGEQGYYRVYRGDG 1712

Query: 336  SCGINDYVRSALV 348
            +CG+N    SA+V
Sbjct: 1713 TCGLNQTPSSAIV 1725


>gi|270011071|gb|EFA07519.1| cystatin [Tribolium castaneum]
          Length = 1761

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 151/313 (48%), Positives = 210/313 (67%), Gaps = 8/313 (2%)

Query: 38   HTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTA 97
            H +LF  FL+++        EY  R ++F  NL +I++L   E G+  YG+  F+D++  
Sbjct: 1454 HLSLFTDFLKKY-NKKYHKKEYKYRFNVFVQNLMQIRVLNTFEQGTATYGITRFADMTQK 1512

Query: 98   EFQAKYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            EF ++ LG +  L+         A IPNI LP+ FDWR+ + VT VK+Q  CGS WAFS 
Sbjct: 1513 EF-SRSLGLRTDLRNENETPFAQAKIPNIELPKEFDWRKKNVVTEVKNQEQCGSCWAFSV 1571

Query: 156  TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
            TGN+EG YA +  KL+  SEQEL+DCD +D GC GG +  A+ +I  K+ GGLE E+ YP
Sbjct: 1572 TGNVEGQYALRHGKLLEFSEQELVDCDTDDQGCNGGLMDTAYRSI-EKI-GGLETEQDYP 1629

Query: 216  YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
            Y  +D+ C  N+   +V++ G +++S +ETDMAK+LV NGP+++AINA A+QFY+ GVSH
Sbjct: 1630 YDAEDEKCHFNRTLARVQVTGALNISHNETDMAKWLVANGPISIAINANAMQFYMGGVSH 1689

Query: 276  PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
            P +F C    +NL H VLIVGYGV       K++PYWI+KNSWG GWGE+GY+R+YRGDG
Sbjct: 1690 PFKFLC--SPKNLDHGVLIVGYGVHNYPLFKKSLPYWIVKNSWGTGWGEQGYYRVYRGDG 1747

Query: 336  SCGINDYVRSALV 348
            +CG+N    SA+V
Sbjct: 1748 TCGLNQTPSSAIV 1760


>gi|405977658|gb|EKC42097.1| Cathepsin F [Crassostrea gigas]
          Length = 715

 Score =  308 bits (790), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 158/310 (50%), Positives = 213/310 (68%), Gaps = 14/310 (4%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F     + Y +  E  +R  IF  N+RK + LQD E G+ VYG+ +F+D+S +EF+
Sbjct: 417 VFQQFQAAFKRLYMSKQEEKTRFKIFCENMRKAKKLQDVEKGTAVYGVTKFADMSESEFK 476

Query: 101 AKYLGFKLKPSYADRSVP-AMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            +Y+G K+    A++ +  A IP + +LP +FDWRE+ AVT VK+Q  CGS WAFSTTGN
Sbjct: 477 -QYVG-KVWDQNANKGMKKAKIPEMNSLPNSFDWREHGAVTEVKNQGSCGSCWAFSTTGN 534

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           IEG +A   KKLVSLSEQEL+DCD+ D+GC GG  S A+  I+    GGLE E  Y YRG
Sbjct: 535 IEGQWAISKKKLVSLSEQELVDCDKVDEGCNGGLPSQAYKEIIRL--GGLETETDYKYRG 592

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
            ++ C ++K   +VKING VS+S +ET+MA +LV+NGP+++ INA+A+QFY+ G+SHP +
Sbjct: 593 HNEKCSMDKSKIRVKINGSVSISSNETEMAAWLVKNGPISIGINAFAMQFYMGGISHPWK 652

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
            FC+   + L H VLIVGYGV  +K      PYWIIKNSWG  WGEKGY+ +YRG G CG
Sbjct: 653 IFCN--PKELDHGVLIVGYGVKGSK------PYWIIKNSWGPDWGEKGYYLVYRGAGVCG 704

Query: 339 INDYVRSALV 348
           +N    SA+V
Sbjct: 705 LNTMCTSAVV 714


>gi|328788558|ref|XP_392381.3| PREDICTED: putative cysteine proteinase CG12163-like [Apis
           mellifera]
          Length = 881

 Score =  308 bits (789), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 159/321 (49%), Positives = 214/321 (66%), Gaps = 6/321 (1%)

Query: 30  LHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN 89
           L     +K   LF  F+ + NKT+++  E  +R  IF  NL+ I  LQ  E G+  YG+ 
Sbjct: 564 LKLAQDIKDEMLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIINELQTFEQGTAEYGVT 623

Query: 90  EFSDLSTAEFQAKYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMC 147
            F+DL+  EF+ +YLGF+  LK           + +I LP  FDWR+Y+ VT VKDQ +C
Sbjct: 624 MFADLTPKEFKTRYLGFRPELKQENEIPLAKIEVSDIFLPLKFDWRDYNVVTPVKDQGLC 683

Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
           GS WAFS TGN+EG YA K KKL+SLSEQEL+DCD  D+GC GG + NA+  I  KL GG
Sbjct: 684 GSCWAFSVTGNVEGQYAIKYKKLLSLSEQELLDCDTLDEGCNGGYMENAYKAI-EKL-GG 741

Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
           LE E  YPY G ++ C   KK  +V++ G V+++ +ET MA++L++NGP+++ INA A+Q
Sbjct: 742 LELESDYPYDGRNEKCHFFKKNAKVQVVGAVNITSNETKMAQWLIKNGPISIGINANAMQ 801

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
           FY+ GVSHP  F C+   ++L H VLIVGYG+ +    HK +PYWIIKNSWG  WGE GY
Sbjct: 802 FYIGGVSHPFHFLCNP--KDLDHGVLIVGYGISKYPLFHKKLPYWIIKNSWGSRWGENGY 859

Query: 328 FRLYRGDGSCGINDYVRSALV 348
           +R+YRGDG+CG+N    SA+V
Sbjct: 860 YRVYRGDGTCGVNAMASSAIV 880


>gi|427777627|gb|JAA54265.1| Putative cathepsin f-like cysteine protease [Rhipicephalus
           pulchellus]
          Length = 475

 Score =  303 bits (777), Expect = 6e-80,   Method: Compositional matrix adjust.
 Identities = 158/349 (45%), Positives = 216/349 (61%), Gaps = 11/349 (3%)

Query: 7   FAGVALLSLTVSVSSFMVVGDEKL----HHLHHVKHTALFNYFLEQHNKTYATLVEYYSR 62
           F   A +S+   +S  +   D        HL   +  +LF+ F   +NKTY    E+ +R
Sbjct: 127 FTCEAAMSIVTRISGVLDPKDLTFAYLSKHLKLSQERSLFSVFARTYNKTYKDKEEHEAR 186

Query: 63  LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK--LKPSYADRSVPAM 120
             IF  NL++I L    E G+  YGL EFSDLS +EF+  YLG K  L    A+     +
Sbjct: 187 FMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFERHYLGLKKDLAEHKAEVKPIKV 246

Query: 121 IP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
            P N  LP  FDWR   AVT VK+Q MCGS WAFS TGN+EG +     KL+SLSEQEL+
Sbjct: 247 GPVNEPLPDLFDWRTKGAVTEVKNQGMCGSCWAFSVTGNVEGQWFLSRSKLLSLSEQELV 306

Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
           DCD  D GC+GG +  A   ++    GGLE E  YPY+G D  C  NK  ++ ++  +V 
Sbjct: 307 DCDHGDHGCKGGYMGQAMKAVIEM--GGLETESEYPYKGVDGTCEFNKTESKARVQSFVG 364

Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
           + ++ET++A +L+++GP+++ INA A+QFY  G+SHP +F C     +L H VL+VG+GV
Sbjct: 365 LPQNETELAYWLMKHGPVSIGINANAMQFYFGGISHPWKFLCS--PTDLDHGVLLVGFGV 422

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           D+  F  K VPYWI+KNSWG+ WGEKGY+R+YRGDG+CG+N    SA+V
Sbjct: 423 DKRSFRRKPVPYWIVKNSWGKYWGEKGYYRVYRGDGTCGVNQMALSAVV 471


>gi|289740839|gb|ADD19167.1| cysteine proteinase cathepsin F [Glossina morsitans morsitans]
          Length = 471

 Score =  299 bits (765), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 145/318 (45%), Positives = 201/318 (63%), Gaps = 6/318 (1%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
           H+L+ V+H  LF  F  +  + Y T +E   R  IF  NL+ I+ L   E GS  YG+ E
Sbjct: 157 HNLNKVEH--LFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYGITE 214

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
           F+D+++ E++ +   ++  P  A  +  A IPNI LP+ FDWRE  A++ VK+Q  CGS 
Sbjct: 215 FADMTSPEYKQRTGLWQRDPQKAASNPKAEIPNIDLPKEFDWREKGAISAVKNQGNCGSC 274

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
           WAFS TGNIEG++A +T  L   SEQEL+DCD  D  C GG   NA++ I  K+ GGLE 
Sbjct: 275 WAFSVTGNIEGLHAVRTGVLEQYSEQELLDCDTSDSACNGGLPDNAYEAI-EKI-GGLEL 332

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           E  YPY      C  N     VK+ G+V + ++ET +A++L+ NGP+++ INA A+QFY 
Sbjct: 333 ESDYPYHARKDQCHFNSTKIHVKVKGHVDLPKNETAIAQWLIANGPISIGINANAMQFYR 392

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
            GVSHP    C    +NL H VLIVGYGV       K +PYWI+KNSWG+ WGE+GY+R+
Sbjct: 393 GGVSHPPHILC--SRKNLDHGVLIVGYGVSDYPMFKKTLPYWIVKNSWGKKWGEQGYYRV 450

Query: 331 YRGDGSCGINDYVRSALV 348
           YRGD +CG+++   SA++
Sbjct: 451 YRGDNTCGVSEMSSSAVL 468


>gi|291230041|ref|XP_002734978.1| PREDICTED: cysteine proteinase inhibitor-like [Saccoglossus
           kowalevskii]
          Length = 352

 Score =  298 bits (763), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 159/314 (50%), Positives = 197/314 (62%), Gaps = 13/314 (4%)

Query: 37  KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
           K   LF  F++ ++K Y T  E+  R  IF  NL K + LQ TE  +G YG+ +F DLS 
Sbjct: 49  KTQDLFQDFMKTYDKKYDTEEEHQLRYQIFQDNLLKAERLQQTEQATGQYGVTKFMDLSE 108

Query: 97  AEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYD--AVTGVKDQTMCGSSWAFS 154
            EF+  YL    + S       A IP  T P AFDWR+ D  AVT VK+Q  CGS WAFS
Sbjct: 109 EEFRKYYLTPVWRGSDPHMK-KAEIPKGTPPAAFDWRDADKNAVTKVKNQGTCGSCWAFS 167

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           TTGNIEG +  K   LVSLSEQEL+DCD+ D GC GG  SNA+  IM    GG+  E  Y
Sbjct: 168 TTGNIEGQWKIKKGTLVSLSEQELVDCDKLDQGCNGGLPSNAYQEIMRF--GGIMSEDDY 225

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
           PY G D+ C+LN    +V ING +++S+DE DMA +L  NGP+++ INA A+QFY  GVS
Sbjct: 226 PYTGRDQDCKLNATLNKVYINGSMNISKDEGDMASWLAANGPISIGINANAMQFYFGGVS 285

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
           HP + FC+   ENL H VLIVGYG      T    PYWIIKNSWG  WG +GY+ +YRG 
Sbjct: 286 HPWKIFCN--PENLDHGVLIVGYG------TKDGTPYWIIKNSWGRSWGVEGYYLVYRGG 337

Query: 335 GSCGINDYVRSALV 348
           G CG+N+   SA+V
Sbjct: 338 GVCGLNEMCTSAIV 351


>gi|38683931|gb|AAR27011.1| cysteine protease [Periserrula leucophryna]
          Length = 283

 Score =  297 bits (761), Expect = 5e-78,   Method: Compositional matrix adjust.
 Identities = 147/287 (51%), Positives = 190/287 (66%), Gaps = 6/287 (2%)

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI 121
           R  IF  N++KI  L D E G   YG+ +FSDL+  EF+  YL  K   S+    V A I
Sbjct: 2   RFKIFRENMKKINTLNDNELGDAEYGVTQFSDLAEEEFRRYYLTPKWDLSHRPDLVRAKI 61

Query: 122 PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
           P++  P +FDWR+++AVT VK+Q MCGS WAFSTT NIEG +A    KLVSLSEQEL+DC
Sbjct: 62  PDVDPPASFDWRDHNAVTPVKNQGMCGSCWAFSTTENIEGQWAIHRNKLVSLSEQELVDC 121

Query: 182 DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
           D+ DDGCEGG   NA++ I+    GGLE EK YPY  +D+ C+       V IN  V++S
Sbjct: 122 DKLDDGCEGGLPVNAYEEIIRL--GGLESEKKYPYDAEDEKCKFTVGDVAVYINSSVNIS 179

Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
            +E DMA +L +NGP+++ INA+A+QFY+ GVSHP  F C    + L H VLIVGYG  +
Sbjct: 180 SNEADMAAWLYKNGPISIGINAFAMQFYMGGVSHPFSFLC--SPDELDHGVLIVGYGTKK 237

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             F+    PYWI+KNSWG  WG +GY+ +YRGDG CG+N    SA+V
Sbjct: 238 GWFSDS--PYWIVKNSWGASWGVQGYYLVYRGDGVCGLNKMPTSAIV 282


>gi|83944664|gb|ABC48936.1| cathepsin F like protease [Glossina morsitans morsitans]
          Length = 471

 Score =  296 bits (757), Expect = 1e-77,   Method: Compositional matrix adjust.
 Identities = 144/318 (45%), Positives = 200/318 (62%), Gaps = 6/318 (1%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
           H+L+ V+H  LF  F  +  + Y T +E   R  IF  NL+ I+ L   E GS  YG+ E
Sbjct: 157 HNLNKVEH--LFAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYGITE 214

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
           F+D+++ E++ +   ++  P  A  +  A IPNI LP+ FDWRE  A++ VK+Q  CGS 
Sbjct: 215 FADMTSPEYKQRTGLWQRDPQKAASNPKAEIPNIDLPKEFDWREKGAISAVKNQGNCGSC 274

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
           WAFS TGNIEG++A +T  L   SEQEL+DCD  D  C GG   NA++ I  K+ GGLE 
Sbjct: 275 WAFSVTGNIEGLHAVRTGVLEQYSEQELLDCDTSDSACNGGLPDNAYEAI-EKI-GGLEL 332

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           E  YPY      C  N     VK+ G+V + ++ET +A++L+ NGP+++ INA A+QFY 
Sbjct: 333 ESDYPYHARKDQCHFNSTKIHVKVKGHVDLPKNETAIAQWLIANGPISIGINANAMQFYR 392

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
            GVSHP    C    +NL H VLIVGY V       K +PYWI+KNSWG+ WGE+GY+R+
Sbjct: 393 GGVSHPPHILC--SRKNLDHGVLIVGYRVSDYPMFKKTLPYWIVKNSWGKKWGEQGYYRV 450

Query: 331 YRGDGSCGINDYVRSALV 348
           YRGD +CG+++   SA++
Sbjct: 451 YRGDNTCGVSEMSSSAVL 468


>gi|195111686|ref|XP_002000409.1| GI10216 [Drosophila mojavensis]
 gi|193917003|gb|EDW15870.1| GI10216 [Drosophila mojavensis]
          Length = 605

 Score =  295 bits (756), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 148/319 (46%), Positives = 203/319 (63%), Gaps = 9/319 (2%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           L+ V H  LF+ F  ++ + YA  +E+  RL IF  NLR IQ L D E GS  YG+ EF+
Sbjct: 292 LNKVDH--LFHVFQIKYKRRYANSMEHQMRLRIFRQNLRTIQELNDNEQGSAKYGITEFA 349

Query: 93  DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSS 150
           D++++E+  +   ++   +      PA++P     LP+ FDWRE +AVT VK+Q  CGS 
Sbjct: 350 DMTSSEYTQRAGLWQRSANKPTGGKPAVVPAYKGELPKEFDWREKNAVTQVKNQGSCGSC 409

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
           WAFS TGNIEG+YA KT +L   SEQEL+DCD  D  C GG + NA+  I  K  GGLE 
Sbjct: 410 WAFSVTGNIEGLYAIKTGELREFSEQELLDCDSTDSACNGGLMDNAYKAI--KDIGGLEY 467

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFY 269
           E  YPY    K C  NK  + V++  +V + + +ET M ++L+ NGP+++ +NA A+QFY
Sbjct: 468 ESEYPYLAKKKQCHFNKTLSHVQVADFVDLPKGNETAMQEWLLANGPISIGLNANAMQFY 527

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GVSHP    C    +NL H VLIVGYGV      HK +PYWI+KNSWG  WGE+GY+R
Sbjct: 528 RGGVSHPWGPLC--SKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYR 585

Query: 330 LYRGDGSCGINDYVRSALV 348
           +YRGD +CG+++   SA++
Sbjct: 586 IYRGDNTCGVSEMATSAVL 604


>gi|312378084|gb|EFR24752.1| hypothetical protein AND_10451 [Anopheles darlingi]
          Length = 1785

 Score =  294 bits (752), Expect = 5e-77,   Method: Compositional matrix adjust.
 Identities = 147/312 (47%), Positives = 205/312 (65%), Gaps = 10/312 (3%)

Query: 42   FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
            F  F   H + YA+  E+  R +IF  NL KI  L   E G+G YG+ +F+D++TAE++A
Sbjct: 1478 FEKFKLHHQRQYASSFEHEMRYNIFRNNLYKIDQLNRHERGTGKYGVTKFADMTTAEYRA 1537

Query: 102  KYLGFKLKPSYAD--RSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
             + G  +   +++  R+  A +     +LP +FDWR++ AVTGVK+Q  CGS WAFS  G
Sbjct: 1538 -HTGLIVPKQHSNHIRNPIATVSTERTSLPTSFDWRDHGAVTGVKNQGNCGSCWAFSAIG 1596

Query: 158  NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
            NIEG++  KTKKL + SEQELIDCD  D+GC GG + +AF  I  KL GGLE E  YPY+
Sbjct: 1597 NIEGLHQIKTKKLEAYSEQELIDCDTVDNGCNGGYMDDAFKAI-EKL-GGLELEDEYPYQ 1654

Query: 218  GD-DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
                K C  NK  + V++ G V + ++ET +A+YL+ENGP+A+ +NA A+QFY  G+SHP
Sbjct: 1655 AKAQKTCHFNKTLSHVRVKGAVDMPKNETFIAQYLIENGPIAIGLNANAMQFYRGGISHP 1714

Query: 277  IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
                C   ++ + H VLIVGYGV      +K +PYW IKNSWG  WGE+GY+R+YRGD S
Sbjct: 1715 WHLLC--SHKQIDHGVLIVGYGVKEYPLFNKTLPYWTIKNSWGPKWGEQGYYRIYRGDNS 1772

Query: 337  CGINDYVRSALV 348
            CG+++   SA++
Sbjct: 1773 CGVSEMASSAIL 1784


>gi|427778331|gb|JAA54617.1| Putative cysteine proteinase cathepsin f [Rhipicephalus pulchellus]
          Length = 361

 Score =  291 bits (746), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 153/338 (45%), Positives = 207/338 (61%), Gaps = 25/338 (7%)

Query: 32  HLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
           HL   +  +LF+ F   +NKTY    E+ +R  IF  NL++I L    E G+  YGL EF
Sbjct: 24  HLKLSQERSLFSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEF 83

Query: 92  SDLSTAEFQAKYLGFK--LKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCG 148
           SDLS +EF+  YLG K  L    A+     + P N  LP  FDWR   AVT VK+Q MCG
Sbjct: 84  SDLSPSEFERHYLGLKKDLAEHKAEVKPIKVGPVNEPLPDLFDWRTKGAVTEVKNQGMCG 143

Query: 149 SSWAFS------------------TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEG 190
           S WAFS                   TGN+EG +     KL+SLSEQEL+DCD  D GC+G
Sbjct: 144 SCWAFSXXTEVKNQGMCGSCWAFSVTGNVEGQWFLSRSKLLSLSEQELVDCDHGDHGCKG 203

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
           G +  A   ++    GGLE E  YPY+G D  C  NK  ++ ++  +V + ++ET++A +
Sbjct: 204 GYMGQAMKAVIEM--GGLETESEYPYKGVDGTCEFNKTESKARVQSFVGLPQNETELAYW 261

Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVP 310
           L+++GP+++ INA A+QFY  G+SHP +F C     +L H VL+VG+GVD+  F  K VP
Sbjct: 262 LMKHGPVSIGINANAMQFYFGGISHPWKFLCS--PTDLDHGVLLVGFGVDKRSFRRKPVP 319

Query: 311 YWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           YWI+KNSWG+ WGEKGY+R+YRGDG+CG+N    SA+V
Sbjct: 320 YWIVKNSWGKYWGEKGYYRVYRGDGTCGVNQMALSAVV 357


>gi|195395906|ref|XP_002056575.1| GJ11017 [Drosophila virilis]
 gi|194143284|gb|EDW59687.1| GJ11017 [Drosophila virilis]
          Length = 599

 Score =  291 bits (746), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 146/319 (45%), Positives = 200/319 (62%), Gaps = 9/319 (2%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           L+ V H  LF+ F  ++ + YA   E+  RL IF  +L+ IQ L   E GS  YG+ EF+
Sbjct: 286 LNKVDH--LFHKFQVKYKRRYANSAEHQMRLRIFRQSLKTIQELNANEQGSAKYGITEFA 343

Query: 93  DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSS 150
           D+++ E+  +   ++           A++P     LP+ FDWR+ +AVT VK+Q  CGS 
Sbjct: 344 DMTSTEYAQRAGLWQRSEGKPTGGAAAVVPAYAGELPKEFDWRQKNAVTHVKNQGQCGSC 403

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
           WAFS TGNIEG YA KT  L   SEQEL+DCD +D  C GG + NA+  I  K  GGLE 
Sbjct: 404 WAFSVTGNIEGAYAIKTGDLQEFSEQELLDCDSKDSACNGGLMDNAYKAI--KDIGGLEY 461

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFY 269
           E  YPY G  K C  N+  + V+++G+V + + +ET M ++L+ NGP+++ INA A+QFY
Sbjct: 462 ESEYPYEGKKKQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTNGPISIGINANAMQFY 521

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GVSHP    C    +NL H VLIVGYGV      HK +PYWI+KNSWG  WGE+GY+R
Sbjct: 522 RGGVSHPWSPLC--SKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYR 579

Query: 330 LYRGDGSCGINDYVRSALV 348
           +YRGD +CG+++   SAL+
Sbjct: 580 VYRGDNTCGVSEMATSALL 598


>gi|390994427|gb|AFM37363.1| cathepsin F1 [Dictyocaulus viviparus]
          Length = 459

 Score =  291 bits (744), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 149/309 (48%), Positives = 194/309 (62%), Gaps = 15/309 (4%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           F+ +H K Y +  +   R  +F  NL+ I+  Q+ E G+ VYG+ +FSDL+  EF+  YL
Sbjct: 160 FMGRHEKVYNSKHDTLKRFRVFKRNLKAIRSWQEKEEGTAVYGITQFSDLTPEEFKKIYL 219

Query: 105 GFKL-KPSYADRSVPAMIP----NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
            +   +P   +R V         N TLP +FDWR++ AVT VK+Q  CGS WAFSTTGNI
Sbjct: 220 PYIWDEPIVPNRMVDLTAEGVHLNETLPESFDWRDHGAVTDVKNQGFCGSCWAFSTTGNI 279

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           EG +    KKLVSLSEQEL+DCD+ DDGCEGG  S A+  IM    GGLE E  YPY G 
Sbjct: 280 EGQWFLAKKKLVSLSEQELVDCDKVDDGCEGGLPSQAYKEIMRM--GGLETESAYPYDGR 337

Query: 220 DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQF 279
            + C +N+    V IN  V +  DE  M  +LV+ GP+++ INA  LQFY  G+SHP +F
Sbjct: 338 GEECHINRTEFAVYINDSVELPHDEESMKAWLVKKGPISIGINANPLQFYRHGISHPWKF 397

Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
           FC+     L+H VL+VGYG ++ K      PYWIIKNSWG  WGE GY+RLYRG   CG+
Sbjct: 398 FCEP--YMLNHGVLLVGYGSEKNK------PYWIIKNSWGPKWGENGYYRLYRGKNVCGV 449

Query: 340 NDYVRSALV 348
           ++   SA+V
Sbjct: 450 HEMPTSAVV 458


>gi|194746631|ref|XP_001955780.1| GF16067 [Drosophila ananassae]
 gi|190628817|gb|EDV44341.1| GF16067 [Drosophila ananassae]
          Length = 620

 Score =  290 bits (743), Expect = 5e-76,   Method: Compositional matrix adjust.
 Identities = 145/319 (45%), Positives = 201/319 (63%), Gaps = 9/319 (2%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           L  V+H  LF+ F  +  + Y +  E   RL IF  NL+ I+ L   E GS  YG+ EF+
Sbjct: 307 LDKVEH--LFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFA 364

Query: 93  DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSS 150
           D+++ E++ +   ++   + A    PA++P  +  LP+ FDWR  +AVTGVK+Q  CGS 
Sbjct: 365 DMTSTEYKERTGLWQRDEAKATGGSPAVVPAYSGELPKEFDWRSKNAVTGVKNQGQCGSC 424

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
           WAFS TGNIEG+YA K  +L   SEQEL+DCD  D  C GG + NA+  I  K  GGLE 
Sbjct: 425 WAFSVTGNIEGLYALKYGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEY 482

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFY 269
           E  YPY    K C  NK  + V++  +V + + +ET M ++LV NGP+++ INA A+QFY
Sbjct: 483 EAEYPYEAKKKQCHFNKTMSHVQVKDFVDLPKGNETAMQEWLVSNGPISIGINANAMQFY 542

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GVSHP +  C    +NL H VL+VGYGV      HK +PYWI+KNSWG  WGE+GY+R
Sbjct: 543 RGGVSHPWKALC--SKKNLDHGVLVVGYGVSDYPNYHKTLPYWIVKNSWGPRWGEQGYYR 600

Query: 330 LYRGDGSCGINDYVRSALV 348
           +YRGD +CG+++   SA++
Sbjct: 601 VYRGDNTCGVSEMATSAVL 619


>gi|195453400|ref|XP_002073772.1| GK14287 [Drosophila willistoni]
 gi|194169857|gb|EDW84758.1| GK14287 [Drosophila willistoni]
          Length = 610

 Score =  290 bits (742), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 144/322 (44%), Positives = 200/322 (62%), Gaps = 10/322 (3%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
           H L  V+H  LF+ F  +  + Y   VE   RL IF  NLR I+ L   E GS  YG+ E
Sbjct: 294 HSLDKVEH--LFHKFQIKFERRYVNSVERQMRLRIFRQNLRIIEQLNANEMGSAKYGITE 351

Query: 91  FSDLSTAEFQAK---YLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMC 147
           F+D+++ E++ +   +   + +P+   ++V    P   LP+ FDWR+  AV+ VK+Q  C
Sbjct: 352 FADMTSTEYKERTGLWQRTEGQPTGGQKAVVPSYPGGELPKEFDWRQKGAVSSVKNQGSC 411

Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
           GS WAFST GNIEG+ A KT +L   SEQEL+DCD +D  C GG   NA+  I     GG
Sbjct: 412 GSCWAFSTIGNIEGLNAVKTGQLKEFSEQELLDCDTKDSACNGGLPDNAYKAIQEI--GG 469

Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYAL 266
           LE E  YPY+   + C  NK    V++ G+V + + +ET M ++L+ NGP+++ INA A+
Sbjct: 470 LEYESEYPYKARKEQCHFNKTLAHVQVTGFVDLPKNNETAMQEWLIANGPISIGINANAM 529

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
           QFY  GVSHP +  C+    NL H VLIVGYGV      HK +PYWI+KNSWG  WGE+G
Sbjct: 530 QFYRGGVSHPWKILCE--KSNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQG 587

Query: 327 YFRLYRGDGSCGINDYVRSALV 348
           Y+R+YRGD +CG+++   SA++
Sbjct: 588 YYRVYRGDNTCGVSEMASSAIL 609


>gi|347968731|ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
 gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles gambiae str. PEST]
          Length = 1834

 Score =  287 bits (735), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 144/328 (43%), Positives = 210/328 (64%), Gaps = 15/328 (4%)

Query: 29   KLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGL 88
            K+    HV+   +F+ F   H + YA+ +E+  R +IF  NL KI+ L   E G+  YG+
Sbjct: 1513 KIDDDAHVRR--MFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGV 1570

Query: 89   NEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-------LPRAFDWREYDAVTGV 141
             +F+D++ AE++A + G  +        V   + +         LPR+FDWR++ AVT V
Sbjct: 1571 TKFADMTVAEYRA-HTGLVVPKHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGAVTEV 1629

Query: 142  KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIM 201
            K+Q  CGS WAFS  GN+EG++  KTKKL S SEQELIDCD+ D+GC GG + +AF  I 
Sbjct: 1630 KNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAI- 1688

Query: 202  SKLGGGLEEEKTYPYRGD-DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVA 260
             +  GGLE E  YPY     K+C  N+  + V++ G V + ++ET +AKYL++NGP+A+ 
Sbjct: 1689 -EQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIG 1747

Query: 261  INAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
            +NA A+QFY  G+SHP    C+  ++++ H VLIVGYG+      +K +PYWIIKNSWG 
Sbjct: 1748 LNANAMQFYRGGISHPWHPLCN--HKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGP 1805

Query: 321  GWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGE+GY+R+YRGD SCG+++   SA++
Sbjct: 1806 RWGEQGYYRIYRGDNSCGVSEMASSAIL 1833


>gi|24644155|ref|NP_730901.1| CG12163, isoform A [Drosophila melanogaster]
 gi|32699625|sp|Q9VN93.2|CPR1_DROME RecName: Full=Putative cysteine proteinase CG12163; Flags:
           Precursor
 gi|23170427|gb|AAF52055.2| CG12163, isoform A [Drosophila melanogaster]
 gi|27819876|gb|AAO24986.1| LP08529p [Drosophila melanogaster]
          Length = 614

 Score =  287 bits (735), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 143/321 (44%), Positives = 200/321 (62%), Gaps = 9/321 (2%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
           H    V H  LF  F  +  + Y +  E   RL IF  NL+ I+ L   E GS  YG+ E
Sbjct: 299 HRFDKVDH--LFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITE 356

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCG 148
           F+D++++E++ +   ++   + A     A++P     LP+ FDWR+ DAVT VK+Q  CG
Sbjct: 357 FADMTSSEYKERTGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCG 416

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
           S WAFS TGNIEG+YA KT +L   SEQEL+DCD  D  C GG + NA+  I  K  GGL
Sbjct: 417 SCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGL 474

Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQ 267
           E E  YPY+     C  N+  + V++ G+V + + +ET M ++L+ NGP+++ INA A+Q
Sbjct: 475 EYEAEYPYKAKKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANAMQ 534

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
           FY  GVSHP +  C    +NL H VL+VGYGV      HK +PYWI+KNSWG  WGE+GY
Sbjct: 535 FYRGGVSHPWKALC--SKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGY 592

Query: 328 FRLYRGDGSCGINDYVRSALV 348
           +R+YRGD +CG+++   SA++
Sbjct: 593 YRVYRGDNTCGVSEMATSAVL 613


>gi|347968733|ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
 gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles gambiae str. PEST]
          Length = 1810

 Score =  287 bits (735), Expect = 4e-75,   Method: Compositional matrix adjust.
 Identities = 144/328 (43%), Positives = 210/328 (64%), Gaps = 15/328 (4%)

Query: 29   KLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGL 88
            K+    HV+   +F+ F   H + YA+ +E+  R +IF  NL KI+ L   E G+  YG+
Sbjct: 1489 KIDDDAHVRR--MFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGV 1546

Query: 89   NEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-------LPRAFDWREYDAVTGV 141
             +F+D++ AE++A + G  +        V   + +         LPR+FDWR++ AVT V
Sbjct: 1547 TKFADMTVAEYRA-HTGLVVPKHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGAVTEV 1605

Query: 142  KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIM 201
            K+Q  CGS WAFS  GN+EG++  KTKKL S SEQELIDCD+ D+GC GG + +AF  I 
Sbjct: 1606 KNQGSCGSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAI- 1664

Query: 202  SKLGGGLEEEKTYPYRGD-DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVA 260
             +  GGLE E  YPY     K+C  N+  + V++ G V + ++ET +AKYL++NGP+A+ 
Sbjct: 1665 -EQLGGLELENDYPYEAKAQKSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIG 1723

Query: 261  INAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
            +NA A+QFY  G+SHP    C+  ++++ H VLIVGYG+      +K +PYWIIKNSWG 
Sbjct: 1724 LNANAMQFYRGGISHPWHPLCN--HKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGP 1781

Query: 321  GWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGE+GY+R+YRGD SCG+++   SA++
Sbjct: 1782 RWGEQGYYRIYRGDNSCGVSEMASSAIL 1809


>gi|24644153|ref|NP_649521.1| CG12163, isoform B [Drosophila melanogaster]
 gi|23170426|gb|AAN13266.1| CG12163, isoform B [Drosophila melanogaster]
 gi|378548248|gb|AFC17498.1| FI18603p1 [Drosophila melanogaster]
          Length = 475

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 143/321 (44%), Positives = 201/321 (62%), Gaps = 9/321 (2%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
           H    V H  LF  F  +  + Y +  E   RL IF  NL+ I+ L   E GS  YG+ E
Sbjct: 160 HRFDKVDH--LFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITE 217

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIP--NITLPRAFDWREYDAVTGVKDQTMCG 148
           F+D++++E++ +   ++   + A     A++P  +  LP+ FDWR+ DAVT VK+Q  CG
Sbjct: 218 FADMTSSEYKERTGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCG 277

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
           S WAFS TGNIEG+YA KT +L   SEQEL+DCD  D  C GG + NA+  I  K  GGL
Sbjct: 278 SCWAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGL 335

Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQ 267
           E E  YPY+     C  N+  + V++ G+V + + +ET M ++L+ NGP+++ INA A+Q
Sbjct: 336 EYEAEYPYKAKKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLANGPISIGINANAMQ 395

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
           FY  GVSHP +  C    +NL H VL+VGYGV      HK +PYWI+KNSWG  WGE+GY
Sbjct: 396 FYRGGVSHPWKALC--SKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGY 453

Query: 328 FRLYRGDGSCGINDYVRSALV 348
           +R+YRGD +CG+++   SA++
Sbjct: 454 YRVYRGDNTCGVSEMATSAVL 474


>gi|347968729|ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
 gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles gambiae str. PEST]
          Length = 953

 Score =  287 bits (735), Expect = 5e-75,   Method: Compositional matrix adjust.
 Identities = 143/322 (44%), Positives = 208/322 (64%), Gaps = 15/322 (4%)

Query: 35  HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDL 94
           HV+   +F+ F   H + YA+ +E+  R +IF  NL KI+ L   E G+  YG+ +F+D+
Sbjct: 638 HVRR--MFDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGVTKFADM 695

Query: 95  STAEFQAKYLGFKLKPSYADRSVPAMIPNIT-------LPRAFDWREYDAVTGVKDQTMC 147
           + AE++A + G  +        V   + +         LPR+FDWR++ AVT VK+Q  C
Sbjct: 696 TVAEYRA-HTGLVVPKHDRANHVGNRVASEEDVAGVGDLPRSFDWRDHGAVTEVKNQGSC 754

Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
           GS WAFS  GN+EG++  KTKKL S SEQELIDCD+ D+GC GG + +AF  I  +  GG
Sbjct: 755 GSCWAFSAVGNVEGLHQIKTKKLESYSEQELIDCDKVDNGCGGGYMDDAFKAI--EQLGG 812

Query: 208 LEEEKTYPYRGD-DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           LE E  YPY     K+C  N+  + V++ G V + ++ET +AKYL++NGP+A+ +NA A+
Sbjct: 813 LELENDYPYEAKAQKSCHFNRSLSHVQVKGAVDMPKNETYIAKYLIKNGPIAIGLNANAM 872

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
           QFY  G+SHP    C+  ++++ H VLIVGYG+      +K +PYWIIKNSWG  WGE+G
Sbjct: 873 QFYRGGISHPWHPLCN--HKSIDHGVLIVGYGIKEYPMFNKTLPYWIIKNSWGPRWGEQG 930

Query: 327 YFRLYRGDGSCGINDYVRSALV 348
           Y+R+YRGD SCG+++   SA++
Sbjct: 931 YYRIYRGDNSCGVSEMASSAIL 952


>gi|195054270|ref|XP_001994049.1| GH22731 [Drosophila grimshawi]
 gi|193895919|gb|EDV94785.1| GH22731 [Drosophila grimshawi]
          Length = 617

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 141/321 (43%), Positives = 200/321 (62%), Gaps = 9/321 (2%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
           H L+ ++H  LF+ F  ++ + YA   E+  RL IF  NLR I+ L   E GS  YG+ +
Sbjct: 302 HTLNKIEH--LFHKFQLKYKRQYANTAEHQMRLRIFRQNLRTIEELNANERGSAKYGITQ 359

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCG 148
           F+D+++ E++     ++           A++P     +P+ FDWR+  AVT VK+Q  CG
Sbjct: 360 FADMTSTEYKLHAGLWQRSEDKPTGGAAAVVPPYAGEMPKEFDWRQKKAVTHVKNQGQCG 419

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
           S WAFS TGNIEG+YA KT +L   SEQEL+DCD  D  C GG + NA+  I  K  GGL
Sbjct: 420 SCWAFSVTGNIEGLYAIKTGELEEFSEQELLDCDSTDSACNGGLMDNAYKAI--KDIGGL 477

Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQ 267
           E E  YPY      C  N+  + V+++G+V + + +ET M ++L+ NGP+++ +NA A+Q
Sbjct: 478 EYESEYPYAAKKMQCHFNRTMSHVQLSGFVDLPKGNETAMQEWLLSNGPISIGLNANAMQ 537

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
           FY  GVSHP    C    +NL H VLIVGYGV      HK +PYWI+KNSWG  WGE+GY
Sbjct: 538 FYRGGVSHPWAPLC--SKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGY 595

Query: 328 FRLYRGDGSCGINDYVRSALV 348
           +R+YRGD +CG+++   SA++
Sbjct: 596 YRIYRGDNTCGVSEMATSAVL 616


>gi|195343593|ref|XP_002038380.1| GM10654 [Drosophila sechellia]
 gi|194133401|gb|EDW54917.1| GM10654 [Drosophila sechellia]
          Length = 615

 Score =  286 bits (731), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 140/311 (45%), Positives = 197/311 (63%), Gaps = 7/311 (2%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF  F  +  + Y +  E   RL IF  NL+ I+ L   E GS  YG+ EF+D++++E++
Sbjct: 308 LFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYK 367

Query: 101 AKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            +   ++   + A     A++P     LP+ FDWR+ DAVT VK+Q  CGS WAFS TGN
Sbjct: 368 ERTGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKDAVTQVKNQGSCGSCWAFSVTGN 427

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           IEG+YA KT +L   SEQEL+DCD  D  C GG + NA+  I  K  GGLE E  YPY+ 
Sbjct: 428 IEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEYEAEYPYKA 485

Query: 219 DDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
               C  N+  + V++ G+V + + +ET M ++L+ NGP+++ INA A+QFY  GVSHP 
Sbjct: 486 KKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTNGPISIGINANAMQFYRGGVSHPW 545

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
           +  C    +NL H VL+VGYGV      HK +PYWI+KNSWG  WGE+GY+R+YRGD +C
Sbjct: 546 KALC--SKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYRVYRGDNTC 603

Query: 338 GINDYVRSALV 348
           G+++   SA++
Sbjct: 604 GVSEMATSAVL 614


>gi|170032975|ref|XP_001844355.1| conserved hypothetical protein [Culex quinquefasciatus]
 gi|167873312|gb|EDS36695.1| conserved hypothetical protein [Culex quinquefasciatus]
          Length = 1454

 Score =  284 bits (727), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 151/344 (43%), Positives = 213/344 (61%), Gaps = 19/344 (5%)

Query: 16   TVSVSSFMVVGDEKLHHL----HHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
            TV+  S     + + HH      H +H  LF+ F  +HN+TY + +E+  R  IF  NL 
Sbjct: 1118 TVAKRSLRPHPNLEAHHYSKSEDHSRH--LFDKFKTRHNRTYQSSLEHEMRFRIFKNNLF 1175

Query: 72   KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYAD-----RSVPAMI-PNIT 125
            KI+ L   E G+  YG+  F+D+++AE++A+  G  + P   D     R+  A I  ++ 
Sbjct: 1176 KIEQLNKYEQGTAKYGITHFADMTSAEYRAR-TGLVV-PREGDEVNHIRNPMAEIDEHME 1233

Query: 126  LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
            LP AFDWRE  AV+ VK+Q  CGS WAFS  GNIEG++  KTKKL   SEQEL+DCD  D
Sbjct: 1234 LPDAFDWRELGAVSEVKNQGNCGSCWAFSVVGNIEGLHQVKTKKLEEYSEQELLDCDTVD 1293

Query: 186  DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSVSRDE 244
              C GG + +A+  I  K+ GGLE E  YPY     K C  NK    V++ G V + ++E
Sbjct: 1294 SACNGGFMDDAYKAI-EKI-GGLELESEYPYLAKKQKTCHFNKTMAHVRVKGAVDLPKNE 1351

Query: 245  TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
            T +A++LV NGP+++ +NA A+QFY  G+SHP +  C    +NL H VLIVGYGV     
Sbjct: 1352 TAIAQFLVANGPVSIGLNANAMQFYRGGISHPWKPLC--SKKNLDHGVLIVGYGVKEYPM 1409

Query: 305  THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             +K +PYWI+KNSWG  WGE+GY+R++RGD +CG+++   SA++
Sbjct: 1410 FNKTLPYWIVKNSWGPKWGEQGYYRVFRGDNTCGVSEMATSAVL 1453


>gi|194898683|ref|XP_001978897.1| GG11133 [Drosophila erecta]
 gi|190650600|gb|EDV47855.1| GG11133 [Drosophila erecta]
          Length = 615

 Score =  284 bits (727), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 142/319 (44%), Positives = 200/319 (62%), Gaps = 9/319 (2%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           L  V H  LF+ F  +  + Y +  E   RL IF  NL+ I+ L   E GS  YG+ EF+
Sbjct: 302 LDKVDH--LFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFA 359

Query: 93  DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSS 150
           DL+++E++ +   ++   + A     A++P     LP+ FDWR+ +AVT VK+Q  CGS 
Sbjct: 360 DLTSSEYKERTGLWQRDEAKATGGSAAVVPAYHGELPKEFDWRQKNAVTPVKNQGSCGSC 419

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
           WAFS TGNIEG+YA KT +L   SEQEL+DCD  D  C GG + NA+  I  K  GGLE 
Sbjct: 420 WAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEY 477

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFY 269
           E  YPY+     C  N+  + V++ G+V + + +ET M ++L+  GP+++ INA A+QFY
Sbjct: 478 EAEYPYKAKKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTKGPISIGINANAMQFY 537

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GVSHP +  C    +NL H VL+VGYGV      HK +PYWI+KNSWG  WGE+GY+R
Sbjct: 538 RGGVSHPWKALC--SKKNLDHGVLVVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYR 595

Query: 330 LYRGDGSCGINDYVRSALV 348
           +YRGD +CG+++   SA++
Sbjct: 596 VYRGDNTCGVSEMATSAVL 614


>gi|341878608|gb|EGT34543.1| hypothetical protein CAEBREN_26318 [Caenorhabditis brenneri]
          Length = 478

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 148/309 (47%), Positives = 188/309 (60%), Gaps = 15/309 (4%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           F+++H K Y    E   R  +F  N + I+ LQ  E G+ VYG  +FSD++T EF+   L
Sbjct: 179 FIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKETML 238

Query: 105 GFKL-KPSYADRS----VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
            ++  +P   D++        I    LP +FDWRE+ AVT VK+Q  CGS WAFSTTGNI
Sbjct: 239 PYQWEQPVPMDQANFEKEGVTISEEDLPDSFDWREHGAVTQVKNQGSCGSCWAFSTTGNI 298

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           EG +    KKLVSLSEQEL+DCD  D GC GG  SNA+  I+    GGLE E  YPY G 
Sbjct: 299 EGAWFLAKKKLVSLSEQELVDCDSVDQGCNGGLPSNAYKEIIRM--GGLEPEDAYPYDGR 356

Query: 220 DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQF 279
            + C L +K   V ING V +  DE +M K+LV  GP+++ +NA  LQFY  GV HP + 
Sbjct: 357 GETCHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKI 416

Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
           FC+     L+H VLIVGYG D  K      PYWI+KNSWG  WGE GYF+LYRG   CG+
Sbjct: 417 FCEPF--MLNHGVLIVGYGKDGRK------PYWIVKNSWGPTWGEAGYFKLYRGKNVCGV 468

Query: 340 NDYVRSALV 348
            +   S+LV
Sbjct: 469 QEMATSSLV 477


>gi|341878637|gb|EGT34572.1| hypothetical protein CAEBREN_13324 [Caenorhabditis brenneri]
          Length = 478

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 148/309 (47%), Positives = 188/309 (60%), Gaps = 15/309 (4%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           F+++H K Y    E   R  +F  N + I+ LQ  E G+ VYG  +FSD++T EF+   L
Sbjct: 179 FIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKETML 238

Query: 105 GFKL-KPSYADRS----VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
            ++  +P   D++        I    LP +FDWRE+ AVT VK+Q  CGS WAFSTTGNI
Sbjct: 239 PYQWEQPVPMDQANFEKEGVTISEEDLPDSFDWREHGAVTQVKNQGSCGSCWAFSTTGNI 298

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           EG +    KKLVSLSEQEL+DCD  D GC GG  SNA+  I+    GGLE E  YPY G 
Sbjct: 299 EGAWFLAKKKLVSLSEQELVDCDSVDQGCNGGLPSNAYKEIIRM--GGLEPEDAYPYDGR 356

Query: 220 DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQF 279
            + C L +K   V ING V +  DE +M K+LV  GP+++ +NA  LQFY  GV HP + 
Sbjct: 357 GETCHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFKI 416

Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
           FC+     L+H VLIVGYG D  K      PYWI+KNSWG  WGE GYF+LYRG   CG+
Sbjct: 417 FCEPF--MLNHGVLIVGYGKDGRK------PYWIVKNSWGPTWGEAGYFKLYRGKNVCGV 468

Query: 340 NDYVRSALV 348
            +   S+LV
Sbjct: 469 QEMATSSLV 477


>gi|339246873|ref|XP_003375070.1| viral cathepsin [Trichinella spiralis]
 gi|316971622|gb|EFV55373.1| viral cathepsin [Trichinella spiralis]
          Length = 496

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 145/310 (46%), Positives = 192/310 (61%), Gaps = 13/310 (4%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  FL+   K Y +  E   R  IF  N++ +++LQ  E G+ VYG+  F+DL+  EF+ 
Sbjct: 196 FKEFLKTFKKWYLSEKELLKRYDIFKVNMKTVEMLQKNEQGTAVYGVTFFADLTPEEFRK 255

Query: 102 KYLGFKLKPSYADRSVP---AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
               F L P +    +P   A IP   +   +DWRE++AVT VK+Q MCGS WAF+T  N
Sbjct: 256 ----FYLSPQWKRDQLPQRKASIPKGKIEDRWDWREHNAVTEVKNQGMCGSCWAFATIAN 311

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EGV+A K  +LVSLSEQEL+DCD  D GC GG  SNA+  I+    GGL  E  Y Y G
Sbjct: 312 VEGVWAVKKGELVSLSEQELVDCDTLDQGCSGGYPSNAYKEIIRL--GGLTTETNYSYDG 369

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
           +   CR   +  +V IN  VS+  DET++A Y+ ENGP+AV INA+A+ FY  G++HP +
Sbjct: 370 NQGTCRFKTQNAKVYINDSVSLPEDETEIAAYIRENGPVAVGINAFAMMFYRHGIAHPWR 429

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
           F C    + L H V IVGY V++   + K  PYWIIKNSWG  WGE GY+ LYRG G CG
Sbjct: 430 FLCSP--DALDHGVAIVGYDVEKQ--SKKPKPYWIIKNSWGTHWGEGGYYMLYRGAGVCG 485

Query: 339 INDYVRSALV 348
           +N  V SA++
Sbjct: 486 VNKMVTSAII 495


>gi|390339264|ref|XP_791714.3| PREDICTED: putative cysteine proteinase CG12163-like
           [Strongylocentrotus purpuratus]
          Length = 453

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 143/330 (43%), Positives = 209/330 (63%), Gaps = 17/330 (5%)

Query: 14  SLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYAT---LVEYYSRLHIFSGNL 70
           SL++    F +  D +   +   ++  LF+ FL    + Y       EY  R  +F  N+
Sbjct: 129 SLSLKAQDFSITKDCQASDIKD-EYRDLFDKFLMTFKREYRQNDGTNEYEYRYSVFVQNM 187

Query: 71  RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAF 130
             +++    E G+  YG  +F+D++ AEF+    G  LK +   +   A IP   +P  +
Sbjct: 188 LTVEMFNQFEQGTAKYGPTKFADMTEAEFRKLQSG-PLKKTGIKKQ--AAIPQGPVPEEY 244

Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEG 190
           DWR + AVT VK+Q MCGS WAFS  GN+EG +  K  +L+SLSEQEL+DCD+ D GCEG
Sbjct: 245 DWRTHGAVTPVKNQGMCGSCWAFSAIGNMEGQWQIKKGELISLSEQELVDCDKVDGGCEG 304

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
           G +S+A++ I+ KLGG + EEK YPYRG+++ C+ N    +VKINGYV++S++ET+MA +
Sbjct: 305 GEMSDAYEAII-KLGGAMSEEK-YPYRGENEKCKFNMTDVRVKINGYVNISKNETEMAGW 362

Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVP 310
           L  +GP+++ INA  +QFY  G++HP + FC    ++L H VLIVGY V          P
Sbjct: 363 LAAHGPISIGINALMMQFYFGGIAHPWKIFCS--PDSLDHGVLIVGYSV------KDGEP 414

Query: 311 YWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
           YWI+KNSWG+ WGE+GY+ +YRGDG+CG+N
Sbjct: 415 YWIVKNSWGKDWGEEGYYLVYRGDGTCGLN 444


>gi|156389068|ref|XP_001634814.1| predicted protein [Nematostella vectensis]
 gi|156221901|gb|EDO42751.1| predicted protein [Nematostella vectensis]
          Length = 276

 Score =  284 bits (726), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 147/287 (51%), Positives = 186/287 (64%), Gaps = 13/287 (4%)

Query: 63  LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF-QAKYLGFKLKPSYADRSVPAMI 121
           + IF  N+RK   +Q  + G+  YG   FSDLS  EF + K +    KP Y  +   A I
Sbjct: 1   MKIFESNMRKAAKMQKMDSGTAQYGPTIFSDLSEEEFRKQKMMPGWGKPLYEMKD--AEI 58

Query: 122 PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
           P   +P + DWR+   VT VK+Q  CGS WAFSTTGNIEG YA KT KLVSLSEQEL+DC
Sbjct: 59  PLGDIPESVDWRDKGVVTPVKNQGSCGSCWAFSTTGNIEGQYAIKTGKLVSLSEQELVDC 118

Query: 182 DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
           D  D GCEGG  SNA+  I  KL GGLE E  YPY+G D  C+ NK   +V IN  V +S
Sbjct: 119 DTIDKGCEGGLPSNAYKQI-EKL-GGLESESDYPYKGADSKCKFNKAEVKVTINSSVVIS 176

Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
           +DE ++A +L +NGP+++ INA A+QFY+ G++HP + FC+    +L+H VLIVGYGV  
Sbjct: 177 KDEKEIAAWLAKNGPISIGINANAMQFYMGGIAHPWKIFCN--PSSLNHGVLIVGYGV-- 232

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                   PYWIIKNSWG  WGEKGY+ +YRG G CG+N    SA++
Sbjct: 233 ----KNGTPYWIIKNSWGPSWGEKGYYLIYRGGGCCGLNTMCTSAVI 275


>gi|195497262|ref|XP_002096026.1| GE25302 [Drosophila yakuba]
 gi|194182127|gb|EDW95738.1| GE25302 [Drosophila yakuba]
          Length = 615

 Score =  284 bits (726), Expect = 6e-74,   Method: Compositional matrix adjust.
 Identities = 138/311 (44%), Positives = 197/311 (63%), Gaps = 7/311 (2%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF+ F  +  + Y +  E   RL IF  NL+ I+ L   E GS  YG+ EF+D++++E++
Sbjct: 308 LFHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMGSAKYGITEFADMTSSEYK 367

Query: 101 AKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            +   ++   + A     A++P     LP+ FDWR+ +AVT VK+Q  CGS WAFS TGN
Sbjct: 368 ERTGLWQRNEAKATGGSVAVVPAYHGELPKEFDWRQKNAVTQVKNQGSCGSCWAFSVTGN 427

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           IEG++A KT  L   SEQEL+DCD  D  C GG + NA+  I  K  GGLE E  YPY+ 
Sbjct: 428 IEGLHAVKTGDLKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEYEAEYPYKA 485

Query: 219 DDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
               C  N+  + V++ G+V + + +ET M ++L+ NGP+++ INA A+QFY  GVSHP 
Sbjct: 486 KKNQCHFNRTLSHVQVAGFVDLPKGNETAMQEWLLTNGPISIGINANAMQFYRGGVSHPW 545

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
           +  C    +NL H VL+VGYGV      HK +PYWI+KNSWG  WGE+GY+R+YRGD +C
Sbjct: 546 KALC--SKKNLDHGVLVVGYGVSEYPNFHKTLPYWIVKNSWGPRWGEQGYYRVYRGDNTC 603

Query: 338 GINDYVRSALV 348
           G+++   SA++
Sbjct: 604 GVSEMATSAVL 614


>gi|6649575|gb|AAF21461.1|U69120_1 cysteine proteinase PWCP1 [Paragonimus westermani]
          Length = 427

 Score =  283 bits (725), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 153/341 (44%), Positives = 207/341 (60%), Gaps = 17/341 (4%)

Query: 7   FAGVALLSLTVSVSSFMVVG---DEKLHHLHHVKHTA-LFNYFLEQHNKTYATLVEYYSR 62
           F  +A+  L +S  S  +V    + +L      ++T+ LF  F  +  K+Y++  +   R
Sbjct: 88  FQRLAIEQLRISRRSIELVSLPSNIELLGFRLPQNTSRLFEEFQRKFRKSYSS--DTAKR 145

Query: 63  LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIP 122
             +F  NL K+QL+Q  E G+  YG+ +FSDLS  EF+      K + S   +   A+ P
Sbjct: 146 YALFKYNLLKMQLIQRLEKGTANYGITKFSDLSAEEFRHSLANMKRRKSKGSQMETAIFP 205

Query: 123 NI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELID 180
               +LP +FDWR   AVT VKDQ MCGS WAF+TTGNIEG +  KT KL+SLSEQ+L+D
Sbjct: 206 TTIQSLPPSFDWRANGAVTEVKDQGMCGSCWAFATTGNIEGQWFRKTNKLISLSEQQLLD 265

Query: 181 CDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVS 239
           CD +D+ C GG    A+D I+    GGL  EK YPY    +++C L +      ING  +
Sbjct: 266 CDTKDEACNGGLPEWAYDEIVKM--GGLMSEKDYPYEAMKEQSCHLRRPNISAYINGSAT 323

Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
           +  DE  +A +LV+NGP++V +NA  LQFY+ G+SHP    C      L H+VL+VGYGV
Sbjct: 324 LPSDEAKLAAWLVQNGPISVGVNANFLQFYLGGISHPPHMLCS--EAGLDHAVLLVGYGV 381

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
                T    PYWI+KNSWG GWGEKGYFR+YRGDG+CGIN
Sbjct: 382 S----TFLRRPYWIVKNSWGGGWGEKGYFRMYRGDGTCGIN 418


>gi|163914827|ref|NP_001106423.1| cathepsin F precursor [Xenopus (Silurana) tropicalis]
 gi|157423494|gb|AAI53364.1| LOC100127591 protein [Xenopus (Silurana) tropicalis]
          Length = 463

 Score =  283 bits (724), Expect = 8e-74,   Method: Compositional matrix adjust.
 Identities = 142/315 (45%), Positives = 198/315 (62%), Gaps = 14/315 (4%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K   LF  F+  +NK Y+   E   RL IFS NL+K Q++Q+ + G+  YG+ ++SDL+
Sbjct: 160 LKTLTLFKDFVTTYNKKYSDQEEAARRLQIFSQNLKKAQMIQEMDQGTAEYGVTKYSDLT 219

Query: 96  TAEFQAKYLGFKL--KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
             EF++ YL   L  KP Y  +   A++PN++ P  +DWR++ AVT VK+Q MCGS WAF
Sbjct: 220 EDEFRSLYLNPLLSSKPLYQMKK--AIVPNMSAPDQWDWRDHGAVTEVKNQGMCGSCWAF 277

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIEG +  K   LVSLSEQEL+DCD  D  C GG  SNA++ I  KL GG+E E+ 
Sbjct: 278 SVIGNIEGQWFLKKGSLVSLSEQELVDCDGVDHACAGGLPSNAYEAI-EKL-GGIETEQE 335

Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
           Y Y G    C  +       IN  V + +DE ++A +L +NGP+++A+NA+A+QFY  G+
Sbjct: 336 YSYEGHKNTCSFSTSKVSAYINSSVEIPKDENEIAAWLAQNGPISIALNAFAMQFYRKGI 395

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
           SHP +  C+     + H+VL+VGYG           P+W IKNSWG  WGE+GY+ LYRG
Sbjct: 396 SHPFRILCNPW--MIDHAVLLVGYG------ERNGTPFWAIKNSWGTDWGEQGYYYLYRG 447

Query: 334 DGSCGINDYVRSALV 348
            G+CG+N    SA+V
Sbjct: 448 TGACGMNTMCSSAVV 462


>gi|268554660|ref|XP_002635317.1| C. briggsae CBR-TAG-196 protein [Caenorhabditis briggsae]
          Length = 477

 Score =  283 bits (724), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 150/310 (48%), Positives = 188/310 (60%), Gaps = 16/310 (5%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           F+++H K Y+   E   R   F  N + I+ LQ  E GS VYG  +FSD++T EF+   L
Sbjct: 177 FIDRHEKRYSNKREVLKRFRTFKKNAKVIRELQKNEQGSAVYGFTKFSDMTTMEFKQTML 236

Query: 105 GFKL-KPSY----ADRSVPAM-IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            ++  +P Y    AD     + I    LP +FDWR++ AVT VK+Q  CGS WAFSTTGN
Sbjct: 237 PYQWEQPVYPMAEADFEKEGVTISEDDLPDSFDWRDHGAVTQVKNQGNCGSCWAFSTTGN 296

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EG +    KKLVSLSEQEL+DCD  D GC GG  SNA+  IM    GGLE E  YPY G
Sbjct: 297 VEGAWYLAKKKLVSLSEQELVDCDSVDQGCNGGLPSNAYKEIMRM--GGLEPEDAYPYDG 354

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
             + C + +K   V ING V +  DE  + K+LV  GP+++ +NA  LQFY  GV HP +
Sbjct: 355 KGETCHIVRKDIAVYINGSVELPHDEVKIQKWLVTKGPISIGLNANTLQFYRHGVVHPFK 414

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
            FC+     L+H VLIVGYG D  K      PYWI+KNSWG  WGE GYFRLYRG   CG
Sbjct: 415 IFCEPF--MLNHGVLIVGYGKDGRK------PYWIVKNSWGPTWGESGYFRLYRGKNVCG 466

Query: 339 INDYVRSALV 348
           + +   SALV
Sbjct: 467 VQEMATSALV 476


>gi|71993922|ref|NP_505215.2| Protein TAG-196 [Caenorhabditis elegans]
 gi|351050011|emb|CCD64084.1| Protein TAG-196 [Caenorhabditis elegans]
          Length = 477

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 147/310 (47%), Positives = 186/310 (60%), Gaps = 16/310 (5%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           F+++H K Y    E   R  +F  N + I+ LQ  E G+ VYG  +FSD++T EF+   L
Sbjct: 177 FVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEFKKIML 236

Query: 105 GFKL-KPSYADRSVPAMIPNIT-----LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            ++  +P Y          ++T     LP +FDWRE  AVT VK+Q  CGS WAFSTTGN
Sbjct: 237 PYQWEQPVYPMEQANFEKHDVTINEEDLPESFDWREKGAVTQVKNQGNCGSCWAFSTTGN 296

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EG +     KLVSLSEQEL+DCD  D GC GG  SNA+  I+    GGLE E  YPY G
Sbjct: 297 VEGAWFIAKNKLVSLSEQELVDCDSMDQGCNGGLPSNAYKEIIRM--GGLEPEDAYPYDG 354

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
             + C L +K   V ING V +  DE +M K+LV  GP+++ +NA  LQFY  GV HP +
Sbjct: 355 RGETCHLVRKDIAVYINGSVELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFK 414

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
            FC+     L+H VLIVGYG D  K      PYWI+KNSWG  WGE GYF+LYRG   CG
Sbjct: 415 IFCEPF--MLNHGVLIVGYGKDGRK------PYWIVKNSWGPNWGEAGYFKLYRGKNVCG 466

Query: 339 INDYVRSALV 348
           + +   SALV
Sbjct: 467 VQEMATSALV 476


>gi|213513816|ref|NP_001133678.1| Cathepsin F precursor [Salmo salar]
 gi|209154908|gb|ACI33686.1| Cathepsin F precursor [Salmo salar]
          Length = 475

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 146/308 (47%), Positives = 198/308 (64%), Gaps = 11/308 (3%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+ ++N+TY++  +   RL IF  NL+  + LQ  + G+  YG+ +FSDL+  EF+ 
Sbjct: 177 FKEFMVRYNRTYSSQEDTDRRLRIFHENLKTAEKLQSLDLGTAEYGVTKFSDLTEEEFRT 236

Query: 102 KYLGFKLKPSYADRSV-PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
            YL   L      RS+ PA +P+   P ++DWRE+ AV+ VK+Q MCGS WAFS TGNIE
Sbjct: 237 LYLNPLLSQQKLQRSMKPAAMPHGPAPPSWDWREHGAVSPVKNQGMCGSCWAFSVTGNIE 296

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G +  KT KLVSLSEQEL+DCD  D  C GG  SNA++ I  KL GG+E E  Y Y G  
Sbjct: 297 GQWFVKTGKLVSLSEQELVDCDTADQACGGGLPSNAYEAI-EKL-GGVETETDYSYTGKK 354

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
           ++C          IN  V +S+DE ++A +L ENGP++VA+NA+A+QFY  GVSHP++ F
Sbjct: 355 QSCDFTTDKVTAYINSSVELSKDENEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIF 414

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
           C+     + H+VL+VGYG  + K      P+W IKNSWGE +GE+GY+ LYRG   CGIN
Sbjct: 415 CNPW--MIDHAVLLVGYGERQGK------PFWAIKNSWGEDYGEQGYYYLYRGSRLCGIN 466

Query: 341 DYVRSALV 348
               SA+V
Sbjct: 467 TMCSSAIV 474


>gi|223648298|gb|ACN10907.1| Cathepsin F precursor [Salmo salar]
          Length = 474

 Score =  283 bits (723), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 147/308 (47%), Positives = 197/308 (63%), Gaps = 11/308 (3%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+ ++N+TY++  E   RL +F  NL+  + LQ  + G+  YG+ +FSDL+  EF+ 
Sbjct: 176 FKEFMVRYNRTYSSQEEADRRLRVFHENLKTAEKLQSLDQGTAEYGVTKFSDLTEEEFRT 235

Query: 102 KYLGFKLKPSYADRSV-PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
            YL   L      +S+ PA +P    P ++DWRE+ AV+ VK+Q MCGS WAFS TGNIE
Sbjct: 236 LYLNPLLSQQNLQQSMKPAAMPRGPAPPSWDWREHGAVSPVKNQGMCGSCWAFSVTGNIE 295

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G + AKT KLVSLSEQEL+DCD  D  C GG  SNA++ I  KL GGLE E  Y Y G  
Sbjct: 296 GQWFAKTGKLVSLSEQELVDCDTVDQACGGGLPSNAYEAI-EKL-GGLETETDYSYTGKK 353

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
           ++C          IN  V +S DE ++A +L ENGP++VA+NA+A+QFY  GVSHP++ F
Sbjct: 354 QSCDFTTDKVIAYINSSVELSTDENEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIF 413

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
           C+     + H+VL+VGYG  + K      P+W IKNSWGE +GE+GY+ LYRG   CGIN
Sbjct: 414 CNPW--MIDHAVLLVGYGERQGK------PFWAIKNSWGEDYGEQGYYYLYRGSRLCGIN 465

Query: 341 DYVRSALV 348
               SA+V
Sbjct: 466 KMCSSAIV 473


>gi|198453932|ref|XP_002137768.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
 gi|198132577|gb|EDY68326.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
          Length = 629

 Score =  282 bits (722), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 140/319 (43%), Positives = 198/319 (62%), Gaps = 9/319 (2%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           L  V H  LF+ F  +  + Y    E   RL IF  NL+ I+ L   E GS  YG+ EF+
Sbjct: 316 LDKVDH--LFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFA 373

Query: 93  DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSS 150
           D+++ E++ +   ++          PA++P      P+ FDWR+ +AVT VK+Q  CGS 
Sbjct: 374 DMTSTEYKERTGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSC 433

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
           WAFS TGNIEG+YA KT +L   SEQEL+DCD  D  C GG + NA+  I  K  GGLE 
Sbjct: 434 WAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEY 491

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFY 269
           E  YPY    + C  N+  + V+++G+V + + +ET M ++L+ +GP+++ +NA A+QFY
Sbjct: 492 EAEYPYEAKKQQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLNANAMQFY 551

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GVSHP +  C    +NL H VLIVGYGV      HK +PYWI+KNSWG  WGE+GY+R
Sbjct: 552 RGGVSHPWKALC--SKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYR 609

Query: 330 LYRGDGSCGINDYVRSALV 348
           +YRGD +CG+++   SA++
Sbjct: 610 VYRGDNTCGVSEMATSAVL 628


>gi|324522685|gb|ADY48108.1| Cathepsin L, partial [Ascaris suum]
          Length = 308

 Score =  282 bits (722), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 146/312 (46%), Positives = 190/312 (60%), Gaps = 22/312 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           F+ ++N+TY+   E   R  I+  NLR  ++ Q  E G+ +YG  +FSDL+ AEF+   L
Sbjct: 10  FIGRYNRTYSNKKEMLKRFRIYKRNLRAAKIWQANEQGTAIYGETQFSDLTQAEFRKIML 69

Query: 105 GFKLKPSYADRSVPAMIPNIT--------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
            +K    +    VP  + N          +P +FDWRE +AVT VK+Q  CGS WAFS T
Sbjct: 70  PYK----WETPKVPNKMANFKEFGIAQNDIPESFDWREKNAVTEVKNQGSCGSCWAFSVT 125

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIEG +A KT KLVSLSEQEL+DCD  D GC GG  SNA+  I+    GGLE E  YPY
Sbjct: 126 GNIEGAWAIKTSKLVSLSEQELVDCDIIDQGCNGGLPSNAYREIIRM--GGLEAESDYPY 183

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
            G  + C L KK   V IN  + +  DE  MA +LV  GP+++ +NA  LQFY  G++HP
Sbjct: 184 DGRGEKCHLMKKDIAVYINDSLQLPHDEEKMAAWLVAKGPISIGLNANPLQFYRHGIAHP 243

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
            + FC    ++L H VLIVGYG +  K      PYWIIKNSWG  WGE+GYFRL+RG   
Sbjct: 244 WRVFCS--PKHLDHGVLIVGYGSETDK------PYWIIKNSWGTKWGEEGYFRLFRGKNV 295

Query: 337 CGINDYVRSALV 348
           CGI +   +A++
Sbjct: 296 CGIQEMATTAII 307


>gi|195152617|ref|XP_002017233.1| GL22196 [Drosophila persimilis]
 gi|194112290|gb|EDW34333.1| GL22196 [Drosophila persimilis]
          Length = 627

 Score =  282 bits (721), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 140/319 (43%), Positives = 198/319 (62%), Gaps = 9/319 (2%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           L  V H  LF+ F  +  + Y    E   RL IF  NL+ I+ L   E GS  YG+ EF+
Sbjct: 314 LDKVDH--LFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFA 371

Query: 93  DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSS 150
           D+++ E++ +   ++          PA++P      P+ FDWR+ +AVT VK+Q  CGS 
Sbjct: 372 DMTSTEYKERTGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSC 431

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
           WAFS TGNIEG+YA KT +L   SEQEL+DCD  D  C GG + NA+  I  K  GGLE 
Sbjct: 432 WAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEY 489

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFY 269
           E  YPY    + C  N+  + V+++G+V + + +ET M ++L+ +GP+++ +NA A+QFY
Sbjct: 490 EAEYPYEAKKQQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLNANAMQFY 549

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GVSHP +  C    +NL H VLIVGYGV      HK +PYWI+KNSWG  WGE+GY+R
Sbjct: 550 RGGVSHPWKALC--SKKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYR 607

Query: 330 LYRGDGSCGINDYVRSALV 348
           +YRGD +CG+++   SA++
Sbjct: 608 VYRGDNTCGVSEMATSAVL 626


>gi|390178852|ref|XP_003736743.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
 gi|388859612|gb|EIM52816.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
          Length = 477

 Score =  281 bits (720), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 140/319 (43%), Positives = 198/319 (62%), Gaps = 9/319 (2%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           L  V H  LF+ F  +  + Y    E   RL IF  NL+ I+ L   E GS  YG+ EF+
Sbjct: 164 LDKVDH--LFHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFA 221

Query: 93  DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSS 150
           D+++ E++ +   ++          PA++P      P+ FDWR+ +AVT VK+Q  CGS 
Sbjct: 222 DMTSTEYKERTGLWQRDEQKPTGGAPAVVPAYEGEFPKEFDWRQKNAVTPVKNQGSCGSC 281

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
           WAFS TGNIEG+YA KT +L   SEQEL+DCD  D  C GG + NA+  I  K  GGLE 
Sbjct: 282 WAFSVTGNIEGLYAVKTGELKEFSEQELLDCDTTDSACNGGLMDNAYKAI--KDIGGLEY 339

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFY 269
           E  YPY    + C  N+  + V+++G+V + + +ET M ++L+ +GP+++ +NA A+QFY
Sbjct: 340 EAEYPYEAKKQQCHFNRTLSHVQVSGFVDLPKGNETAMQEWLLTHGPISIGLNANAMQFY 399

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GVSHP +  C    +NL H VLIVGYGV      HK +PYWI+KNSWG  WGE+GY+R
Sbjct: 400 RGGVSHPWKALCS--KKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGPRWGEQGYYR 457

Query: 330 LYRGDGSCGINDYVRSALV 348
           +YRGD +CG+++   SA++
Sbjct: 458 VYRGDNTCGVSEMATSAVL 476


>gi|403183546|gb|EJY58173.1| AAEL017153-PA [Aedes aegypti]
          Length = 1165

 Score =  281 bits (718), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 141/320 (44%), Positives = 200/320 (62%), Gaps = 14/320 (4%)

Query: 35   HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDL 94
            H +H  LF  F  +H++ Y + +E+  R  IF  NL KI+ L   E G+  YG+  F+D+
Sbjct: 853  HARH--LFEKFKLKHSREYQSTLEHEMRFRIFKNNLFKIEQLNKYEQGTAKYGITHFADM 910

Query: 95   STAEFQAKYLGFKLKPSYADRS-----VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGS 149
            ++AE++ +  G  + P   DR+        +  N+ LP +FDWRE  AV+ VK+Q  CGS
Sbjct: 911  TSAEYRQR-TGLVI-PRDEDRNHVGNPKAEIDENMELPESFDWRELGAVSPVKNQGNCGS 968

Query: 150  SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLE 209
             WAFS  GNIEG++  KTK L   SEQEL+DCD  D  C+GG + +A+  I  K+ GGLE
Sbjct: 969  CWAFSVVGNIEGLHQIKTKVLEEYSEQELLDCDAVDSACQGGYMDDAYKAI-EKI-GGLE 1026

Query: 210  EEKTYPYRG-DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
             E  YPY     K C  N     V++ G V + ++ET MA+YLV NGP+++ +NA A+QF
Sbjct: 1027 LESEYPYLAKKQKTCHFNSTEVHVRVKGAVDLPKNETAMAQYLVANGPISIGLNANAMQF 1086

Query: 269  YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
            Y  G+SHP +  C    +NL H VLIVGYGV      +K +PYWI+KNSWG  WGE+GY+
Sbjct: 1087 YRGGISHPWKPLC--SKKNLDHGVLIVGYGVKEYPMFNKTMPYWIVKNSWGPKWGEQGYY 1144

Query: 329  RLYRGDGSCGINDYVRSALV 348
            R++RGD +CG+++   SA++
Sbjct: 1145 RIFRGDNTCGVSEMASSAVL 1164


>gi|321460289|gb|EFX71333.1| hypothetical protein DAPPUDRAFT_189155 [Daphnia pulex]
          Length = 266

 Score =  280 bits (716), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 137/269 (50%), Positives = 179/269 (66%), Gaps = 7/269 (2%)

Query: 82  GSGVYGLNEFSDLSTAEFQAKYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           G+ VYG   FSD S AE++A   GF   L+ S A R   A IP I LP  FDWR +  VT
Sbjct: 2   GTAVYGDTPFSDWSAAEYKAHLAGFNPSLRQSNA-RLRQAAIPEIDLPDEFDWRNHSVVT 60

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ  CGS WAFS TGN+EG+YA +   L+SLSEQEL+DCD+ D GC GG   NA+  
Sbjct: 61  PVKDQGSCGSCWAFSVTGNVEGIYAVRNGDLLSLSEQELVDCDKLDSGCNGGLPENAYKA 120

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I     GGLE E  YPY G +  C+ N   T+V++ G V +S +ET+MA++L++NGP+++
Sbjct: 121 IHDI--GGLETESDYPYNGHENKCKFNSNITRVQVTGGVEISTNETEMAQWLIQNGPISI 178

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
            INA A+Q+Y  GVSHP +  C  G   + H VLIVGYGV +    +K +PYWI+KNSWG
Sbjct: 179 GINANAMQYYRGGVSHPWKVLCRPG--GIDHGVLIVGYGVSQYPKFNKTLPYWIVKNSWG 236

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGE+GY+R++RGDG+CG+N    SA +
Sbjct: 237 TRWGEQGYYRVFRGDGTCGLNQMCTSATL 265


>gi|308506829|ref|XP_003115597.1| CRE-TAG-196 protein [Caenorhabditis remanei]
 gi|308256132|gb|EFP00085.1| CRE-TAG-196 protein [Caenorhabditis remanei]
          Length = 475

 Score =  280 bits (716), Expect = 8e-73,   Method: Compositional matrix adjust.
 Identities = 146/310 (47%), Positives = 185/310 (59%), Gaps = 16/310 (5%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           F+++H K Y+   E   R   F  N + I+ LQ  E G+ VYG  +FSD++T EF+   L
Sbjct: 175 FIDRHEKRYSNKREVLKRFRTFKKNAKAIRELQKNEQGTAVYGFTKFSDMTTMEFKQTML 234

Query: 105 GFKL-KPSYADRSVPAMIPNIT-----LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            ++  +P Y           IT     LP +FDWR+  AVT VK+Q  CGS WAFSTTGN
Sbjct: 235 PYQWEQPVYPMDQADFEKEGITISEEDLPESFDWRDKGAVTQVKNQGNCGSCWAFSTTGN 294

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EG +     KLVSLSEQEL+DCD  D GC GG  SNA+  I+    GGLE E  YPY G
Sbjct: 295 VEGAWFLAKNKLVSLSEQELVDCDGVDQGCNGGLPSNAYKEIIRM--GGLEPEDAYPYDG 352

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
             + C L +K   V ING + +  DE +M K+LV  GP+++ +NA  LQFY  GV HP +
Sbjct: 353 KGETCHLVRKDIAVYINGSIELPHDEVEMQKWLVTKGPISIGLNANTLQFYRHGVVHPFK 412

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
            FC+     L+H VLIVGYG D  K      PYWI+KNSWG  WGE GYF+LYRG   CG
Sbjct: 413 IFCEPF--MLNHGVLIVGYGKDGRK------PYWIVKNSWGPTWGESGYFKLYRGKNVCG 464

Query: 339 INDYVRSALV 348
           + +   SALV
Sbjct: 465 VQEMATSALV 474


>gi|224555777|gb|ACN56478.1| cathepsin F [Paralichthys olivaceus]
          Length = 475

 Score =  278 bits (711), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 145/308 (47%), Positives = 194/308 (62%), Gaps = 11/308 (3%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+ ++NK Y++  E   RL IF  NL+  + LQ  + GS  YG+ +FSDL+  EF++
Sbjct: 177 FKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSDLTEEEFRS 236

Query: 102 KYLGFKLKPSYADRSV-PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
            YL   L      R + PA       P ++DWR++ AV+ VK+Q MCGS WAFS TGNIE
Sbjct: 237 TYLNPLLSQWTLHRPMKPASPAKGPAPASWDWRDHGAVSSVKNQGMCGSCWAFSVTGNIE 296

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G +  K   LVSLSEQEL+DCD  D  C GG  SNA++ I  KL GGLE E  Y Y G  
Sbjct: 297 GQWFLKNGTLVSLSEQELVDCDGLDQACNGGLPSNAYEAI-EKL-GGLETETDYSYIGKK 354

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
           ++C    K     IN  V +S+DE ++A +L ENGP++VA+NA+A+QFY  GVSHP++ F
Sbjct: 355 QSCDFATKKVAAYINSSVELSKDEKEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIF 414

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
           C+     + H+VL+VGYG        K +P+W IKNSWGE +GE+GY+ LYRG  +CGIN
Sbjct: 415 CNPW--MIDHAVLMVGYG------ERKGIPFWAIKNSWGEDYGEQGYYNLYRGSNACGIN 466

Query: 341 DYVRSALV 348
               SA+V
Sbjct: 467 KMCSSAVV 474


>gi|395544492|ref|XP_003774144.1| PREDICTED: cathepsin F [Sarcophilus harrisii]
          Length = 451

 Score =  277 bits (709), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 150/338 (44%), Positives = 204/338 (60%), Gaps = 28/338 (8%)

Query: 26  GDEKLH------------HLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI 73
           GD++LH                V+  +LF  FL  +NK+YA   E   RL IF+ NL   
Sbjct: 126 GDQRLHWTSGRQAPAPAAQEDSVQLISLFKDFLTTYNKSYANATETQRRLGIFARNLELA 185

Query: 74  QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK--PSYADRSVPAMIPNITLPRAFD 131
           + +Q+ + GS  YG+ +FSDL+  EF+  YL   L   P  A R  PA       P ++D
Sbjct: 186 RKVQELDRGSAEYGVTKFSDLTEEEFRTSYLNPLLSSLPGRALRPGPAT--RGPAPASWD 243

Query: 132 WREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGG 191
           WR++ AVTGVK+Q  CGS WAFS TGN+EG +  +   L++LSEQEL+DCD  D  C GG
Sbjct: 244 WRDHGAVTGVKNQGACGSCWAFSVTGNVEGQWFLRRGALLALSEQELVDCDTLDQACGGG 303

Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYL 251
             SNA+ T + KL GGLE EK Y Y G  + C  +    +V IN  V +SRDE ++A +L
Sbjct: 304 LPSNAY-TAIEKL-GGLETEKDYSYEGRKERCSFSPDKARVYINSSVDLSRDEEELATWL 361

Query: 252 VENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKA-VP 310
            ENGP+++A+NA+A+QFY  GVSHP +  C      + H+VL+VGYG       H++ +P
Sbjct: 362 AENGPVSIALNAFAMQFYRRGVSHPFRPLCS--PWFIDHAVLLVGYG-------HRSGIP 412

Query: 311 YWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           +W IKNSWG  WGE+GY+ LYRG  +CG+N    SA+V
Sbjct: 413 FWAIKNSWGPDWGEEGYYYLYRGARACGVNAMASSAIV 450


>gi|196014793|ref|XP_002117255.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
 gi|190580220|gb|EDV20305.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
          Length = 353

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 145/315 (46%), Positives = 200/315 (63%), Gaps = 14/315 (4%)

Query: 38  HTALF-NY--FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDL 94
           H  +F NY  F++++NK+Y  + E   R  +F+ N+ +  L Q  ++ +G YG  + SDL
Sbjct: 48  HDPMFKNYLQFIKEYNKSYNNIQELNYRYQVFTKNMARAMLFQKHDNATGRYGFTKLSDL 107

Query: 95  STAEFQAKYLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           +  E ++ Y   K  P     +  A IP + +LP++FDWR   AVT VKDQ  CG+ WAF
Sbjct: 108 TDQEVKSFY-AMKKWPQQLYPTKKANIPQLNSLPQSFDWRSKGAVTAVKDQKRCGACWAF 166

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           +TTGNIEG +     KL SLSEQEL+DCD+ D+GC+GG   NA+ +IM++L GGLE EK 
Sbjct: 167 ATTGNIEGQWYLNKGKLYSLSEQELVDCDKIDEGCKGGLPLNAYHSIMNRL-GGLETEKD 225

Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
           YPY   +  C+LNK    V IN  V VS +ETD+A +LV +GP+A+ IN+  +  Y  G+
Sbjct: 226 YPYVAKNGKCKLNKSEEVVYINSSVKVSTNETDLAAWLVAHGPVAIGINSVNMLHYKGGI 285

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
           +HP    C+   + L H VLIVGYG +      K+ PYWIIKNSWG  WGEKGY+R+ RG
Sbjct: 286 AHPTNKDCNP--KLLDHGVLIVGYGEE------KSTPYWIIKNSWGTDWGEKGYYRVVRG 337

Query: 334 DGSCGINDYVRSALV 348
            G+CG+N    SA+V
Sbjct: 338 IGACGLNKSATSAIV 352


>gi|186688051|gb|ACC86111.1| cathepsin F [Paralichthys olivaceus]
          Length = 475

 Score =  275 bits (704), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 144/308 (46%), Positives = 194/308 (62%), Gaps = 11/308 (3%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+ ++NK Y++  E   RL IF  NL+  + LQ  + GS  YG+ +FSDL+  EF++
Sbjct: 177 FKEFMVKYNKVYSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSDLTEEEFRS 236

Query: 102 KYLGFKLKPSYADRSV-PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
            YL   L      R + PA       P ++DWR++ AV+ VK+Q MCGS WAFS TGNIE
Sbjct: 237 TYLNPLLSQWTLHRPMKPASPAKGPAPASWDWRDHGAVSSVKNQGMCGSCWAFSVTGNIE 296

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G +  K   LVSLSEQEL+DCD  D  C GG  SNA++ I  KL GGLE E  Y Y G  
Sbjct: 297 GQWFLKNGTLVSLSEQELVDCDGLDQACNGGLPSNAYEAI-EKL-GGLETETDYSYIGKK 354

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
           ++C    K     IN  V +S+DE ++A +L ENGP++VA+NA+A+QFY  GVSHP++ F
Sbjct: 355 QSCDFATKKVAAYINSSVELSKDEKEIAAWLAENGPVSVALNAFAMQFYRKGVSHPLKIF 414

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
           C+     + H+VL+VGYG        K +P+W IKNSWGE +GE+GY+ L+RG  +CGIN
Sbjct: 415 CNPW--MIDHAVLMVGYG------ERKGIPFWAIKNSWGEDYGEQGYYYLHRGSNACGIN 466

Query: 341 DYVRSALV 348
               SA+V
Sbjct: 467 KMCSSAVV 474


>gi|126338866|ref|XP_001379280.1| PREDICTED: cathepsin F-like [Monodelphis domestica]
          Length = 567

 Score =  271 bits (693), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 146/330 (44%), Positives = 199/330 (60%), Gaps = 19/330 (5%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           SS   +GD        V+  +LF  FL  +NK+YA   E   RL IF+ NL     LQ+ 
Sbjct: 255 SSLPRMGDS-------VELISLFKDFLTTYNKSYANATETQRRLGIFARNLELAHKLQEL 307

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV-PAMIPNITLPRAFDWREYDAV 138
           + GS  YG+ +FSDL+  EF+  YL   L  S   R++ PA       P ++DWR++ A+
Sbjct: 308 DQGSAQYGVTKFSDLTEEEFRMFYLNPLLS-SLPGRALRPAPRARGPAPASWDWRDHGAL 366

Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFD 198
           T  K+Q MCGS WAFS TGN+EG +  +   L++LSEQEL+DCD  D  C GG  SNA+ 
Sbjct: 367 TAAKNQGMCGSCWAFSVTGNVEGQWFLRRGALLTLSEQELVDCDTLDQACGGGLPSNAYT 426

Query: 199 TIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMA 258
            I +   GGLE EK Y Y G  + C  +    +  IN  V +SRDE ++A +L ENGP++
Sbjct: 427 AIETL--GGLETEKDYSYEGRKERCSFSPDKARAYINSSVDLSRDEQEIAAWLAENGPVS 484

Query: 259 VAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           +A+NA+A+QFY  GVSHP +  C      + H+VL+VGYG DR+      +P+W IKNSW
Sbjct: 485 IALNAFAMQFYRRGVSHPFRPLCS--PWFIDHAVLLVGYG-DRS-----GIPFWAIKNSW 536

Query: 319 GEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           G  WGE+GY+ LYRG  +CG+N    SA+V
Sbjct: 537 GPDWGEEGYYYLYRGARACGMNTMASSAIV 566


>gi|443696723|gb|ELT97360.1| hypothetical protein CAPTEDRAFT_147978 [Capitella teleta]
          Length = 274

 Score =  271 bits (693), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 136/279 (48%), Positives = 175/279 (62%), Gaps = 6/279 (2%)

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRA 129
           + K + +Q+ E G   YG + F+DL+  EF+  YL      ++     PA IP  T P A
Sbjct: 1   MIKARRIQEKEQGDATYGASPFADLTAEEFRKNYLSPVWNVTHDPFLKPASIPIETPPDA 60

Query: 130 FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCE 189
           FDWR++DAVT VK+Q  CGS WAFS TGN+EG +A + KKL+SLSEQEL+DCD+ D GC 
Sbjct: 61  FDWRDHDAVTPVKNQGSCGSCWAFSVTGNVEGQWAIQKKKLLSLSEQELVDCDKVDLGCN 120

Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAK 249
           GG    A+  IM    GGLE EK YPY G    C   K   +V I G V++S +E DM  
Sbjct: 121 GGLPLQAYKEIMRI--GGLETEKDYPYEGKGDKCVFEKAEVEVNITGAVNISSNEDDMKA 178

Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV 309
           +L +NGP+++ +NA A+QFY+ GVSHP  F C     +L H VLI GYG+ +   +    
Sbjct: 179 WLWKNGPISIGLNANAMQFYMGGVSHPFSFLCS--PSSLDHGVLITGYGIKQGWMSDS-- 234

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           P+W IKNSWGE WGEKGY+ LYRG G CG+N    SA V
Sbjct: 235 PFWAIKNSWGESWGEKGYYLLYRGAGVCGVNQMPTSATV 273


>gi|21218381|gb|AAM44058.1|AF510740_1 cathepsin L1 [Schistosoma japonicum]
          Length = 317

 Score =  271 bits (692), Expect = 5e-70,   Method: Compositional matrix adjust.
 Identities = 140/287 (48%), Positives = 180/287 (62%), Gaps = 9/287 (3%)

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI 121
           R  IF  NL K QL Q  E GS VYG+  +SDL+T EF   +L    + S    ++P   
Sbjct: 39  RFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTTDEFSRTHLTAPWRASSKRNTIPPRR 98

Query: 122 PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
               +P  FDWRE  AVT VK+Q MCGS WAFSTTGNIE  +  KT KL+SLSEQ+L+DC
Sbjct: 99  EVGDIPNNFDWREKGAVTEVKNQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQQLVDC 158

Query: 182 DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
           D  DDGC GG  SNA+++I+    GGL  E  YPY   ++ C L        IN  V+++
Sbjct: 159 DSLDDGCNGGLPSNAYESIIRM--GGLMLEDNYPYDAKNEKCHLKVGNVAAYINSSVNLT 216

Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
           +DE+++A +L  +  ++V +NA  LQFY  G+SHP   FC      L H+VL+VGYGV  
Sbjct: 217 QDESELAIWLYHHSAISVGMNALLLQFYRHGISHPWWIFCS--KYLLDHAVLLVGYGV-- 272

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
              + K  P+WI+KNSWG  WGEKGYFR+YRGDG+CGIN    SAL+
Sbjct: 273 ---SEKNEPFWIVKNSWGVEWGEKGYFRMYRGDGTCGINTGATSALI 316


>gi|256077197|ref|XP_002574894.1| cathepsin F (C01 family) [Schistosoma mansoni]
 gi|353230780|emb|CCD77197.1| cathepsin F (C01 family) [Schistosoma mansoni]
          Length = 419

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 139/289 (48%), Positives = 184/289 (63%), Gaps = 11/289 (3%)

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAM 120
           R +IF  N+ K QL Q  E GS +YG+  +SDL+T EF   +L    + PS    +  ++
Sbjct: 139 RFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTTDEFARTHLTASWVVPSSRSNTPTSL 198

Query: 121 IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
              +  +P+ FDWRE  AVT VK+Q MCGS WAFSTTGN+E  +  KT KL+SLSEQ+L+
Sbjct: 199 GKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLV 258

Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
           DCD  DDGC GG  SNA+++I+    GGL  E  YPY   ++ C L      V IN  V+
Sbjct: 259 DCDGLDDGCNGGLPSNAYESIIKM--GGLMLEDNYPYDAKNEKCHLKTDGVAVYINSSVN 316

Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
           +++DET++A +L  N  ++V +NA  LQFY  G+SHP   FC      L H+VL+VGYGV
Sbjct: 317 LTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPWWIFCS--KYLLDHAVLLVGYGV 374

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                + K  P+WI+KNSWG  WGE GYFR+YRGDG+CGIN    SAL+
Sbjct: 375 -----SEKNEPFWIVKNSWGVEWGENGYFRMYRGDGTCGINTVATSALI 418


>gi|256077193|ref|XP_002574892.1| cathepsin F (C01 family) [Schistosoma mansoni]
 gi|353230781|emb|CCD77198.1| cathepsin F (C01 family) [Schistosoma mansoni]
          Length = 457

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 139/289 (48%), Positives = 184/289 (63%), Gaps = 11/289 (3%)

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAM 120
           R +IF  N+ K QL Q  E GS +YG+  +SDL+T EF   +L    + PS    +  ++
Sbjct: 177 RFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTTDEFARTHLTASWVVPSSRSNTPTSL 236

Query: 121 IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
              +  +P+ FDWRE  AVT VK+Q MCGS WAFSTTGN+E  +  KT KL+SLSEQ+L+
Sbjct: 237 GKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLV 296

Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
           DCD  DDGC GG  SNA+++I+    GGL  E  YPY   ++ C L      V IN  V+
Sbjct: 297 DCDGLDDGCNGGLPSNAYESIIKM--GGLMLEDNYPYDAKNEKCHLKTDGVAVYINSSVN 354

Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
           +++DET++A +L  N  ++V +NA  LQFY  G+SHP   FC      L H+VL+VGYGV
Sbjct: 355 LTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPWWIFC--SKYLLDHAVLLVGYGV 412

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                + K  P+WI+KNSWG  WGE GYFR+YRGDG+CGIN    SAL+
Sbjct: 413 -----SEKNEPFWIVKNSWGVEWGENGYFRMYRGDGTCGINTVATSALI 456


>gi|256077195|ref|XP_002574893.1| cathepsin F (C01 family) [Schistosoma mansoni]
 gi|353230782|emb|CCD77199.1| cathepsin F (C01 family) [Schistosoma mansoni]
          Length = 456

 Score =  269 bits (688), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 139/289 (48%), Positives = 184/289 (63%), Gaps = 11/289 (3%)

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAM 120
           R +IF  N+ K QL Q  E GS +YG+  +SDL+T EF   +L    + PS    +  ++
Sbjct: 176 RFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTTDEFARTHLTASWVVPSSRSNTPTSL 235

Query: 121 IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
              +  +P+ FDWRE  AVT VK+Q MCGS WAFSTTGN+E  +  KT KL+SLSEQ+L+
Sbjct: 236 GKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLV 295

Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
           DCD  DDGC GG  SNA+++I+    GGL  E  YPY   ++ C L      V IN  V+
Sbjct: 296 DCDGLDDGCNGGLPSNAYESIIKM--GGLMLEDNYPYDAKNEKCHLKTDGVAVYINSSVN 353

Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
           +++DET++A +L  N  ++V +NA  LQFY  G+SHP   FC      L H+VL+VGYGV
Sbjct: 354 LTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPWWIFCS--KYLLDHAVLLVGYGV 411

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                + K  P+WI+KNSWG  WGE GYFR+YRGDG+CGIN    SAL+
Sbjct: 412 -----SEKNEPFWIVKNSWGVEWGENGYFRMYRGDGTCGINTVATSALI 455


>gi|56755191|gb|AAW25775.1| SJCHGC00511 protein [Schistosoma japonicum]
          Length = 454

 Score =  268 bits (686), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 139/287 (48%), Positives = 179/287 (62%), Gaps = 9/287 (3%)

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI 121
           R  IF  NL K QL Q  E GS VYG+  +SDL+T EF   +L    + S    ++    
Sbjct: 176 RFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTTDEFSRTHLTAPWRASSKRNTISPRR 235

Query: 122 PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
               +P  FDWRE  AVT VK+Q MCGS WAFSTTGNIE  +  KT KL+SLSEQ+L+DC
Sbjct: 236 EVGDIPNNFDWREKGAVTEVKNQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQQLVDC 295

Query: 182 DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
           D  DDGC GG  SNA+++I+    GGL  E  YPY   ++ C L        IN  V+++
Sbjct: 296 DSLDDGCNGGLPSNAYESIIRM--GGLMLEDNYPYDAKNEKCHLKVANVAAYINSSVNLT 353

Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
           +DE+++A +L  +  ++V +NA  LQFY  G+SHP   FC      L H+VL+VGYGV  
Sbjct: 354 QDESELAIWLYHHSAISVGMNALLLQFYRHGISHPWWIFC--SKYLLDHAVLLVGYGV-- 409

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
              + K  P+WI+KNSWG  WGEKGYFR+YRGDG+CGIN    SAL+
Sbjct: 410 ---SEKNEPFWIVKNSWGVEWGEKGYFRMYRGDGTCGINTDATSALI 453


>gi|348528696|ref|XP_003451852.1| PREDICTED: cathepsin F-like [Oreochromis niloticus]
          Length = 475

 Score =  268 bits (684), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 139/308 (45%), Positives = 190/308 (61%), Gaps = 11/308 (3%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+ ++NK Y++  E   RL IF  NL+  + LQ  + GS  YG+ +FSDL+  EF++
Sbjct: 177 FKEFMTKYNKVYSSQEEVDRRLRIFHENLKTAEKLQALDQGSAEYGVTKFSDLTEEEFRS 236

Query: 102 KYLGFKLKPSYADRSV-PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
            YL   L      + + PA       P ++DWR++ AV+ VK+Q MCGS WAFS  GNIE
Sbjct: 237 TYLNPLLSQWTLHQPMKPATPAKGPSPDSWDWRDHGAVSPVKNQGMCGSCWAFSVIGNIE 296

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G +  K   L+SLSEQEL+DCD  D  C GG  SNA++ I  KL GGLE E  Y Y G  
Sbjct: 297 GQWFLKNGTLLSLSEQELVDCDGLDQACRGGLPSNAYEAI-EKL-GGLETESDYSYTGHK 354

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
           + C          IN  V + +DE ++A +L ENGP++VA+NA+A+QFY  G+SHP++ F
Sbjct: 355 QRCDFTTGKVAAYINSSVELPKDEKEIAAWLAENGPVSVALNAFAMQFYRKGISHPLKIF 414

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
           C+     + H+VL+VGYG        K +P+W IKNSWGE +GE+GY+ LYRG  +CGIN
Sbjct: 415 CNPW--MIDHAVLLVGYG------ERKGIPFWAIKNSWGEDYGEQGYYYLYRGSNACGIN 466

Query: 341 DYVRSALV 348
               SA+V
Sbjct: 467 KMCSSAVV 474


>gi|226468424|emb|CAX69889.1| Temporarily Assigned Gene name [Schistosoma japonicum]
          Length = 454

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 138/287 (48%), Positives = 179/287 (62%), Gaps = 9/287 (3%)

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI 121
           R  IF  NL K QL Q  E GS VYG+  +SDL+T EF   +L    + S    ++    
Sbjct: 176 RFSIFKSNLLKAQLYQVLERGSAVYGVTPYSDLTTDEFSRTHLTAPWRASSKRNTISPRR 235

Query: 122 PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
               +P  FDWR+  AVT VK+Q MCGS WAFSTTGNIE  +  KT KL+SLSEQ+L+DC
Sbjct: 236 EVGDIPNNFDWRKKGAVTEVKNQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQQLVDC 295

Query: 182 DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
           D  DDGC GG  SNA+++I+    GGL  E  YPY   ++ C L        IN  V+++
Sbjct: 296 DNLDDGCNGGLPSNAYESIIRM--GGLMLEDNYPYDAKNEKCHLKVANVAAYINSSVNLT 353

Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
           +DE+++A +L  +  ++V +NA  LQFY  G+SHP   FC      L H+VL+VGYGV  
Sbjct: 354 QDESELAIWLYHHSAISVGMNALLLQFYRHGISHPWWIFC--SKYLLDHAVLLVGYGV-- 409

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
              + K  P+WI+KNSWG  WGEKGYFR+YRGDG+CGIN    SAL+
Sbjct: 410 ---SEKNEPFWIVKNSWGVEWGEKGYFRMYRGDGTCGINTDATSALI 453


>gi|313235882|emb|CBY11269.1| unnamed protein product [Oikopleura dioica]
          Length = 371

 Score =  267 bits (682), Expect = 7e-69,   Method: Compositional matrix adjust.
 Identities = 145/318 (45%), Positives = 197/318 (61%), Gaps = 16/318 (5%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  FL +H K Y+   E +SR   F  NL++I+     E GS  YG+ EF+DLS  EF+ 
Sbjct: 50  FENFLLEHPKMYSEQ-ESHSRFQTFWENLKRIKFHNHIEQGSAKYGVTEFADLSDFEFRR 108

Query: 102 KYLGFKLKPSYADR---------SVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
            YLG K +    +R         S   +    T+   FDW E  AVT VK+Q MCGS WA
Sbjct: 109 HYLGLKPELKIPNRKKYERKSRNSSKKLKFAKTVDETFDWVEKGAVTEVKNQGMCGSCWA 168

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           FSTTGNIEG +   T  LVSLSEQEL+DCDQ+D GC GG +  AF+ ++    GGLE E+
Sbjct: 169 FSTTGNIEGAWFKATGDLVSLSEQELVDCDQKDSGCNGGLMDQAFEEVIRI--GGLETEQ 226

Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
            YPY G  + C   K  ++V+I+ ++ +  DE ++A+ L E+GP+++AINA+ +QFY  G
Sbjct: 227 QYPYDGVQETCNFEKSLSKVQIDDFMDIGEDEEEIAEALEEHGPLSIAINAFGMQFYRGG 286

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVD-RTKFTHK-AVPYWIIKNSWGEGWGEKGYFRL 330
           +SHP+ F C    + L H VL+VGYGV+  T + H+   PYW IKNSWG  WGE GY+R+
Sbjct: 287 ISHPLSFLC--SQDGLDHGVLMVGYGVEHHTTWRHRHPRPYWKIKNSWGPRWGEDGYYRV 344

Query: 331 YRGDGSCGINDYVRSALV 348
            RG G CG+N  V +++V
Sbjct: 345 ARGKGVCGVNKMVSTSIV 362


>gi|3023456|sp|Q26534.1|CATL_SCHMA RecName: Full=Cathepsin L; AltName: Full=SMCL1; Flags: Precursor
 gi|555663|gb|AAC46485.1| preprocathepsin L [Schistosoma mansoni]
 gi|1094710|prf||2106314A cathepsin L
          Length = 319

 Score =  266 bits (680), Expect = 9e-69,   Method: Compositional matrix adjust.
 Identities = 138/289 (47%), Positives = 183/289 (63%), Gaps = 11/289 (3%)

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAM 120
           R +IF  N+ K QL Q    GS +YG+  +SDL+T EF   +L    + PS    +  ++
Sbjct: 39  RFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTTDEFARTHLTASWVVPSSRSNTPTSL 98

Query: 121 IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
              +  +P+ FDWRE  AVT VK+Q MCGS WAFSTTGN+E  +  KT KL+SLSEQ+L+
Sbjct: 99  GKEVNNIPKNFDWREKGAVTEVKNQGMCGSCWAFSTTGNVESQWFRKTGKLLSLSEQQLV 158

Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
           DCD  DDGC GG  SNA+++I+    GGL  E  YPY   ++ C L      V IN  V+
Sbjct: 159 DCDGLDDGCNGGLPSNAYESIIKM--GGLMLEDNYPYDAKNEKCHLKTDGVAVYINSSVN 216

Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
           +++DET++A +L  N  ++V +NA  LQFY  G+SHP   FC      L H+VL+VGYGV
Sbjct: 217 LTQDETELAAWLYHNSTISVGMNALLLQFYQHGISHPWWIFCS--KYLLDHAVLLVGYGV 274

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                + K  P+WI+KNSWG  WGE GYFR+YRGDGSCGIN    SA++
Sbjct: 275 -----SEKNEPFWIVKNSWGVEWGENGYFRMYRGDGSCGINTVATSAMI 318


>gi|182892046|gb|AAI65744.1| Ctsf protein [Danio rerio]
          Length = 473

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 141/314 (44%), Positives = 195/314 (62%), Gaps = 11/314 (3%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           V+   +F  F+  +N+TY++  E   RL IF  N++  Q LQ  E GS  YG+ +FSDL+
Sbjct: 169 VELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLT 228

Query: 96  TAEFQAKYLGFKLKP-SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
             EF+  YL   L   S      PA+  +   P  +DWR++ AV+ VK+Q MCGS WAFS
Sbjct: 229 EDEFRMMYLNPMLSQWSLKKEMKPAIPASAPAPDTWDWRDHGAVSPVKNQGMCGSCWAFS 288

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
            TGNIEG +  KT +L+SLSEQEL+DCD+ D  C GG  SNA++ I +   GGLE E  Y
Sbjct: 289 VTGNIEGQWFKKTGQLLSLSEQELVDCDKLDQACGGGLPSNAYEAIENL--GGLETETDY 346

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
            Y G  ++C  +       IN  V + +DE ++A +L ENGP++ A+NA+A+QFY  GVS
Sbjct: 347 SYTGHKQSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVSAALNAFAMQFYRKGVS 406

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
           HP++ FC+     + H+VL+VG+G          VP+W IKNSWGE +GE+GY+ LYRG 
Sbjct: 407 HPLKIFCNPW--MIDHAVLLVGFG------QRNGVPFWAIKNSWGEDYGEQGYYYLYRGS 458

Query: 335 GSCGINDYVRSALV 348
           G CGI+    SA+V
Sbjct: 459 GLCGIHKMCSSAIV 472


>gi|117606135|ref|NP_001071036.1| cathepsin F precursor [Danio rerio]
 gi|115313533|gb|AAI24244.1| Cathepsin F [Danio rerio]
          Length = 473

 Score =  266 bits (679), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 141/314 (44%), Positives = 195/314 (62%), Gaps = 11/314 (3%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           V+   +F  F+  +N+TY++  E   RL IF  N++  Q LQ  E GS  YG+ +FSDL+
Sbjct: 169 VELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKFSDLT 228

Query: 96  TAEFQAKYLGFKLKP-SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
             EF+  YL   L   S      PA+  +   P  +DWR++ AV+ VK+Q MCGS WAFS
Sbjct: 229 EDEFRMMYLNPMLSQWSLKKEMKPAIPASAPAPDTWDWRDHGAVSPVKNQGMCGSCWAFS 288

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
            TGNIEG +  KT +L+SLSEQEL+DCD+ D  C GG  SNA++ I +   GGLE E  Y
Sbjct: 289 VTGNIEGQWFKKTGQLLSLSEQELVDCDKLDQACGGGLPSNAYEAIENL--GGLETETDY 346

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
            Y G  ++C  +       IN  V + +DE ++A +L ENGP++ A+NA+A+QFY  GVS
Sbjct: 347 SYTGHKQSCDFSTGKVAAYINSSVELPKDEKEIAAFLAENGPVSAALNAFAMQFYRKGVS 406

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
           HP++ FC+     + H+VL+VG+G          VP+W IKNSWGE +GE+GY+ LYRG 
Sbjct: 407 HPLKIFCNPW--MIDHAVLLVGFG------QRNGVPFWAIKNSWGEDYGEQGYYYLYRGS 458

Query: 335 GSCGINDYVRSALV 348
           G CGI+    SA+V
Sbjct: 459 GLCGIHKMCSSAIV 472


>gi|223049408|gb|ACM80348.1| cysteine proteinase [Solanum lycopersicum]
          Length = 368

 Score =  265 bits (678), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 148/335 (44%), Positives = 202/335 (60%), Gaps = 26/335 (7%)

Query: 24  VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS 83
           VVGDE  HH+ + +H   F  F ++  KTYA+  E++ R  +F  NLR+    Q  +  S
Sbjct: 39  VVGDED-HHMLNAEHH--FTLFKKRFGKTYASDEEHHYRFSVFKANLRRAMRHQKLD-PS 94

Query: 84  GVYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTG 140
            V+G+ +FSD++  EF  K+LG   +   PS A+++   ++P   LP  FDWRE+ AVT 
Sbjct: 95  AVHGVTQFSDMTPDEFSQKFLGVNRRLRFPSDANKA--PILPTEDLPSDFDWREHGAVTP 152

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGG 191
           VK+Q  CGS W+FSTTG +EG     T KLVSLSEQ+L+DCD E         D GC GG
Sbjct: 153 VKNQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEKDSCDSGCSGG 212

Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKY 250
            +++AF+  +    GGL  E+ YPY G DKA C+ +      K+  +  VS DE  +A  
Sbjct: 213 LMNSAFEYTLK--AGGLMREEDYPYTGTDKATCKFDNTKVAAKVANFSVVSLDEEQIAAN 270

Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVP 310
           LV+NGP+AVAINA  +Q YV GVS P  + C   ++ L H VL+VGYG   +    K  P
Sbjct: 271 LVKNGPLAVAINAVFMQTYVGGVSCP--YIC---SKQLDHGVLLVGYGTGFSPIRMKEKP 325

Query: 311 YWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           YWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 326 YWIIKNSWGEKWGESGYYKIRRGRNVCGVDSMVST 360


>gi|313220237|emb|CBY31096.1| unnamed protein product [Oikopleura dioica]
          Length = 371

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 145/318 (45%), Positives = 196/318 (61%), Gaps = 16/318 (5%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  FL +H K Y+   E +SR   F  NL++I+     E GS  YG+ EF+DLS  EF+ 
Sbjct: 50  FENFLLEHPKMYSEQ-ESHSRFQTFWENLKRIKFHNHIEQGSAKYGVTEFTDLSDFEFRR 108

Query: 102 KYLGFKLKPSYADR---------SVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
            YLG K +    +R         S   +    T    FDW E  AVT VK+Q MCGS WA
Sbjct: 109 HYLGLKPELKNLNRKKYERKSRNSSKKLKFAKTADETFDWVEKGAVTEVKNQGMCGSCWA 168

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           FSTTGNIEG +   T  L+SLSEQEL+DCDQ+D GC GG +  AF+ ++    GGLE E+
Sbjct: 169 FSTTGNIEGAWFKATGDLISLSEQELVDCDQKDSGCNGGLMDQAFEEVIRI--GGLETEQ 226

Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
            YPY G  + C   K  ++V+I+ ++ +  DE ++A+ L E+GP+++AINA+ +QFY  G
Sbjct: 227 QYPYDGVQETCNFEKSLSKVQIDDFMDIGEDEEEIAEALEEHGPLSIAINAFGMQFYRGG 286

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVD-RTKFTHK-AVPYWIIKNSWGEGWGEKGYFRL 330
           VSHP+ F C    + L H VL+VGYGV+  T + H+   PYW IKNSWG  WGE GY+R+
Sbjct: 287 VSHPLSFLC--SPDGLDHGVLMVGYGVEHHTTWRHRHPRPYWKIKNSWGPRWGEDGYYRV 344

Query: 331 YRGDGSCGINDYVRSALV 348
            RG G CG+N  V +++V
Sbjct: 345 ARGKGVCGVNKMVSTSIV 362


>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
          Length = 325

 Score =  265 bits (676), Expect = 3e-68,   Method: Compositional matrix adjust.
 Identities = 145/349 (41%), Positives = 195/349 (55%), Gaps = 31/349 (8%)

Query: 1   MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYY 60
           +SC  F  G      TV V                     L+  F   + K+YA   +  
Sbjct: 6   VSCLTFLVGCVFAVSTVQVPD---------------SARELYEQFKRDYGKSYAN-DDDE 49

Query: 61  SRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM 120
            R  IF  NL + Q  Q  E G+  YG+ +FSDL+  EF AK+L  +    + D+     
Sbjct: 50  KRFAIFKDNLVRAQNYQLQEQGTARYGVTQFSDLTPEEFAAKFLSSR----FDDQVERVQ 105

Query: 121 IPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
           + ++   P + DWRE  AV  V+DQ  CGS WAFS  GN+EG +  KT +LVSLS+Q+L+
Sbjct: 106 LNDLKAAPESVDWRELGAVAPVEDQGSCGSCWAFSVAGNVEGQWFLKTGQLVSLSKQQLV 165

Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
           DCD +D GC+GG     +  I+    GGLE ++ YPY G ++ C+L++     KIN  + 
Sbjct: 166 DCDVQDSGCDGGYPPTTYGEIIRM--GGLEAQRDYPYVGREQPCKLDESKLLAKINSSIV 223

Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
           +  +E   A Y+ E+GPM+  INA  LQFY +G+SHP +  C    + L+H VL VGYG 
Sbjct: 224 LEANEKKQAAYIAEHGPMSSGINAVTLQFYQSGISHPSKSQCQ--PDWLNHGVLSVGYG- 280

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                T   VPYWIIKNSWG GWGEKGYFRLYRGDG+CGI   V SA++
Sbjct: 281 -----TEDGVPYWIIKNSWGTGWGEKGYFRLYRGDGTCGIEKVVSSAII 324


>gi|431910221|gb|ELK13294.1| Cathepsin F [Pteropus alecto]
          Length = 458

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 135/309 (43%), Positives = 189/309 (61%), Gaps = 10/309 (3%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ++F  F+  +N+TY T  E   R+ +F  N+ + Q +Q  + G+  YG+ +FSDL+  EF
Sbjct: 159 SIFKEFVITYNRTYETKEEAQWRMSVFINNMMRAQKIQALDRGTARYGVTKFSDLTEEEF 218

Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
           +  YL   LK   + R   AM  +   P  +DWR   AVT VKDQ MCGS WAFS TGN+
Sbjct: 219 RTIYLNPLLKELRSKRMPLAMSVSGPAPPEWDWRNKGAVTKVKDQGMCGSCWAFSVTGNV 278

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           EG +  K   L+SLSEQEL+DCD+ D  C GG  SNA+  I  K  GGLE E  Y Y G 
Sbjct: 279 EGQWFLKRGDLLSLSEQELVDCDKLDKACLGGLPSNAYSAI--KTLGGLETEDDYGYNGH 336

Query: 220 DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQF 279
            + C  + +  +V IN  V +S++E  +A +L +NGP+++AINA+ +QFY  G+SHP++ 
Sbjct: 337 LQTCNFSAEKAKVYINDSVELSQNEQKLAAWLAKNGPISIAINAFGMQFYRHGISHPLRP 396

Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
            C      + H+VL+VGYG          +P+W IKNSWG  WGE+GY+ L+RG G+CG+
Sbjct: 397 LCSPW--LIDHAVLLVGYG------NRSDIPFWAIKNSWGTDWGEEGYYYLHRGSGACGV 448

Query: 340 NDYVRSALV 348
           N    SA+V
Sbjct: 449 NIMASSAVV 457


>gi|339244639|ref|XP_003378245.1| cathepsin F [Trichinella spiralis]
 gi|316972864|gb|EFV56510.1| cathepsin F [Trichinella spiralis]
          Length = 366

 Score =  264 bits (674), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 133/307 (43%), Positives = 185/307 (60%), Gaps = 7/307 (2%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+ + NK Y T      + +IF  N+   + LQ+ E G+ +YG   F+D++  EF+ 
Sbjct: 66  FKQFMVEFNKWYETEKLTAEKYNIFKSNMVIAKRLQEEEQGTAIYGPTIFADMTPEEFRK 125

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
            +L F        + + A IP   +    DWR+++AVT VKDQ  CGS WAF T  NIEG
Sbjct: 126 THLNFNPNNVKKPKRM-ANIPKSNISERMDWRKFNAVTSVKDQGNCGSCWAFCTVANIEG 184

Query: 162 VYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
            +A KT +L+SLSEQ+L+DCD+ DDGCEGG   NA+  I+    GGLE+E+ Y Y     
Sbjct: 185 AWAVKTAQLISLSEQQLVDCDRLDDGCEGGLPVNAYLEIIRL--GGLEKEEDYKYTARSG 242

Query: 222 ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFC 281
            C+ N   + V IN  V +  DE  +A+Y+ ENGP+AV +NA A+ FY +G++HP +  C
Sbjct: 243 KCKFNHTKSAVYINDTVVLPEDEDAIARYVSENGPVAVGLNADAMMFYRSGIAHPSRLMC 302

Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIND 341
               + ++H V IVGY V  + F   + PYWIIKNSWG  WGEKGY+ LYRG G CGI+ 
Sbjct: 303 SP--DGINHGVTIVGYDVKESLFW--STPYWIIKNSWGPNWGEKGYYYLYRGKGVCGIDQ 358

Query: 342 YVRSALV 348
              S ++
Sbjct: 359 MASSVVI 365


>gi|417401303|gb|JAA47542.1| Putative cathepsin f [Desmodus rotundus]
          Length = 459

 Score =  263 bits (672), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 138/332 (41%), Positives = 192/332 (57%), Gaps = 10/332 (3%)

Query: 17  VSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL 76
            S S F ++  + L     +K  +LF +F+  +N+TY T  E   R+ IF  N+ + Q +
Sbjct: 137 TSSSFFPLLNKDPLPQNFSMKMVSLFKHFIATYNRTYETEEEAQWRMSIFINNMVRAQEI 196

Query: 77  QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYD 136
           Q  + G+  YG+ +FSDL+  EF+  YL   LK     +   A   +   P  +DWR   
Sbjct: 197 QALDRGTAQYGVTKFSDLTEEEFRTFYLNPLLKEGLGKKMRLAKPVDDPAPPEWDWRNKG 256

Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNA 196
           AVT VK+Q MCGS WAFS TGN+EG +  K   L+SLSEQEL+DCD  D  C GG  SNA
Sbjct: 257 AVTKVKNQGMCGSCWAFSVTGNVEGQWFLKQGDLLSLSEQELVDCDTLDKACMGGLPSNA 316

Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
           +  I  K  GGLE E  Y Y G  + C    +  +V IN  V +S+DE  +A +L + GP
Sbjct: 317 YSAI--KTLGGLETEDDYSYHGHLQTCSFTAEKVKVYINDSVELSKDEQKLAAWLAKKGP 374

Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
           +++AINA+ +QFY  G+S P++  C      + H+VL+VGYG          VP+W IKN
Sbjct: 375 ISIAINAFGMQFYRRGISRPLRLLCSPW--FIDHAVLLVGYG------NRSDVPFWAIKN 426

Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           SWG  WGE+GY+ L+RG  +CG+N    SA+V
Sbjct: 427 SWGTDWGEEGYYYLHRGSRACGVNVMASSAVV 458


>gi|432091081|gb|ELK24293.1| Cathepsin F, partial [Myotis davidii]
          Length = 410

 Score =  263 bits (671), Expect = 1e-67,   Method: Compositional matrix adjust.
 Identities = 136/330 (41%), Positives = 195/330 (59%), Gaps = 11/330 (3%)

Query: 20  SSFMVVGD-EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD 78
           S  + + D + L    +++  +LF YF+  +N+TY T  E   R+ +F  N+ + Q +Q 
Sbjct: 90  SPLLPLSDRDPLPQDFYLRMASLFKYFITTYNRTYETEEEAQWRMSVFINNMIRAQKIQA 149

Query: 79  TEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
            + G+  YG+ +FSDL+  EF+  YL   LK     +           P  +DWR+  AV
Sbjct: 150 LDRGTAQYGVTKFSDLTEEEFRTMYLNPLLKEELGKKMRLVKFVGDPAPPEWDWRKKGAV 209

Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFD 198
           T VK+Q MCGS WAFS TGN+EG +  K   L+SLSEQEL+DCD+ D  C GG  SNA+ 
Sbjct: 210 TKVKNQGMCGSCWAFSVTGNVEGQWFLKRGDLLSLSEQELVDCDKVDKACMGGLPSNAYS 269

Query: 199 TIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMA 258
            I  K  GGLE E  Y Y G  + C  + +  +V IN  V +S +E ++A +L +NGP++
Sbjct: 270 AI--KTLGGLETEDDYSYSGHLQTCSFSAQKAKVYINDSVELSHNEQELAAWLAKNGPIS 327

Query: 259 VAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           +AINA+ +QFY  G+S P++  C      + H+VL+VGYG          VP+W IKNSW
Sbjct: 328 IAINAFGMQFYRHGISRPLRPLCS--RWFIDHAVLLVGYG------NRSDVPFWAIKNSW 379

Query: 319 GEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           G  WGE+GY+ L+RG G+CG+N    SA+V
Sbjct: 380 GTDWGEEGYYYLHRGSGACGVNVMASSAVV 409


>gi|2414683|emb|CAB16316.1| cysteine proteinase precursor [Vicia sativa]
          Length = 379

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 139/318 (43%), Positives = 190/318 (59%), Gaps = 18/318 (5%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F++ ++K Y+T  EY  RL IF+ N+ K    Q  +  + ++G+ +FSDLS  EF+ 
Sbjct: 55  FKLFMKDYSKKYSTTEEYLLRLGIFAKNMVKAAEHQALDP-TAIHGVTQFSDLSEEEFER 113

Query: 102 KYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
            Y GFK     S A   V   +     P  FDWRE  AVTG+K Q  CGS WAF+TTG+I
Sbjct: 114 FYTGFKGGFPSSNAAGGVAPPLDVKGFPENFDWREKGAVTGIKTQGKCGSCWAFTTTGSI 173

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQE--------DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           EG     T KLVSLSEQ+L+DCD +        D+GC GG ++ A+D +M    GGLEEE
Sbjct: 174 EGANFLATGKLVSLSEQQLVDCDNKCDITKTSCDNGCNGGLMTTAYDYLME--AGGLEEE 231

Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            +YPY G    C+ +     V+++ + ++  DE  +A YLV +GP+A+A+NA  +Q YV 
Sbjct: 232 TSYPYTGAQGECKFDPNKVAVRVSNFTNIPADENQIAAYLVNHGPLAIAVNAVFMQTYVG 291

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH-KAVPYWIIKNSWGEGWGEKGYFRL 330
           GVS P+   C      L+H VL+VGY  +       +  PYW IKNSWGE WGEKGY++L
Sbjct: 292 GVSCPL--ICS--KRRLNHGVLLVGYNAEGFSILRLRKKPYWTIKNSWGEQWGEKGYYKL 347

Query: 331 YRGDGSCGINDYVRSALV 348
            RG G CG+N  V +A+V
Sbjct: 348 CRGHGMCGMNTMVSAAMV 365


>gi|356553413|ref|XP_003545051.1| PREDICTED: cysteine proteinase 15A-like [Glycine max]
          Length = 367

 Score =  260 bits (664), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 142/324 (43%), Positives = 193/324 (59%), Gaps = 20/324 (6%)

Query: 32  HLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
           HL + +H   F  F  +  K YAT  E+  R  +F  NLR+ +L    +  S V+G+ +F
Sbjct: 45  HLLNAEHH--FASFKAKFGKKYATKEEHDRRFGVFKSNLRRARLHAKLDP-SAVHGVTKF 101

Query: 92  SDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
           SDL+ AEF+ ++LGFK     A+     ++P   LP+ FDWR+  AVT VKDQ  CGS W
Sbjct: 102 SDLTPAEFRRQFLGFKPLRLPANAQKAPILPTKDLPKDFDWRDKGAVTNVKDQGACGSCW 161

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMS 202
           +FSTTG +EG +   T +LVSLSEQ+L+DCD           D GC GG ++NAF+ I+ 
Sbjct: 162 SFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQ 221

Query: 203 KLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAIN 262
              GG+++EK YPY G D  C+ +K      ++ Y  VS DE  +A  LV+NGP+AV IN
Sbjct: 222 S--GGVQKEKDYPYTGRDGTCKFDKTKVAATVSNYSVVSLDEDQIAANLVKNGPLAVGIN 279

Query: 263 AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEG 321
           A  +Q Y+ GVS P  + C    ++L H VLIVGYG         K  PYWIIKNSWGE 
Sbjct: 280 AVFMQTYIGGVSCP--YIC---GKHLDHGVLIVGYGEGAYAPIRFKNKPYWIIKNSWGES 334

Query: 322 WGEKGYFRLYRGDGSCGINDYVRS 345
           WGE GY+++ RG   CG++  V +
Sbjct: 335 WGENGYYKICRGRNVCGVDSMVST 358


>gi|67773372|gb|AAY81943.1| cysteine protease 5 [Paragonimus westermani]
          Length = 325

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 149/348 (42%), Positives = 193/348 (55%), Gaps = 29/348 (8%)

Query: 1   MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYY 60
           +SC  F  G A    TV V                     L+  F   + K YA   +  
Sbjct: 6   VSCLAFLVGCAFAVSTVPVPD---------------NARELYEQFKRDYGKVYAN-DDDQ 49

Query: 61  SRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM 120
            R  IF  NL + Q LQ  + G+  YG+ +FSDL+  EF AKYL   +     +R  P  
Sbjct: 50  KRFAIFKDNLVRAQKLQLKDRGTARYGVTQFSDLTPEEFAAKYLSRPMNDQ-VERVRPTG 108

Query: 121 IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELID 180
           +     P   DWRE+ AV  V++Q  CGS WAFS  GN+EG +  KT +LVSLS+Q+L+D
Sbjct: 109 LK--AAPERMDWREWGAVGPVENQGSCGSCWAFSVAGNVEGQWFLKTGQLVSLSKQQLVD 166

Query: 181 CDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV 240
           CD  D GC GG  +NA+  IM    GGLE +  YPY G  + C LNK+    KI+  + +
Sbjct: 167 CDVMDYGCGGGWPTNAYMEIMRM--GGLELQSDYPYVGVQQQCYLNKEKLLAKIDDLIVL 224

Query: 241 SRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
              E + A YL E+GP++ A+NA  LQFY +G+SHP    C     +L+H+VL VGY   
Sbjct: 225 GAYEEEHAAYLAEHGPLSSALNAGYLQFYQSGISHPSYEECSPA--SLNHAVLTVGYD-- 280

Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               T   VPYWIIKNSWG GWGE GYFRLYRGDG+CGIN  + SA++
Sbjct: 281 ----TENGVPYWIIKNSWGTGWGENGYFRLYRGDGTCGINRMITSAII 324


>gi|7242888|dbj|BAA92495.1| cysteine protease [Vigna mungo]
          Length = 364

 Score =  259 bits (663), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 142/331 (42%), Positives = 193/331 (58%), Gaps = 24/331 (7%)

Query: 25  VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG 84
           V D  L+  HH      F+ F  +  KTYAT  E+  R  +F  NLR+ +L    +  S 
Sbjct: 39  VEDHLLNAEHH------FSNFKAKFGKTYATKEEHDHRFGVFKSNLRRARLHAQLDP-SA 91

Query: 85  VYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
           V+G+ +FSDL+ AEFQ ++LG K     A+     ++P   LP+ FDWR+  AVT VKDQ
Sbjct: 92  VHGVTKFSDLTAAEFQRQFLGLKPLGLPANAQKAPILPTNNLPKDFDWRDKGAVTNVKDQ 151

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISN 195
             CGS W+FSTTG +EG +   T +LVSLSEQ+L+DCD           D GC GG ++N
Sbjct: 152 GACGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNN 211

Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
           AF+ I+    GG++ E+ YPY G D +C+ +K      +  Y  +S DE  +A  LV+NG
Sbjct: 212 AFEYILG--AGGVQREEDYPYAGRDSSCKFDKSKIAASVANYSVISLDEDQIAANLVKNG 269

Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWII 314
           P+AV INA  +Q Y+ GVS P  + C    + L H V IVGYG         K  PYWII
Sbjct: 270 PLAVGINAVYMQTYIGGVSCP--YIC---AKRLDHGVQIVGYGESGYAPIRFKEKPYWII 324

Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           KNSWGE WGE GY+++ RG  +CG++  V +
Sbjct: 325 KNSWGESWGENGYYKICRGQNACGVDSMVST 355


>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
          Length = 365

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 145/333 (43%), Positives = 200/333 (60%), Gaps = 28/333 (8%)

Query: 25  VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG 84
           V D  L+  HH      F+ F  +  KTYAT  E+  R  +F  N+R+ +L    +  S 
Sbjct: 40  VEDHLLNAEHH------FSTFKAKFGKTYATKEEHDHRFGVFKSNMRRARLHAQLD-PSA 92

Query: 85  VYGLNEFSDLSTAEFQAKYLGFK-LK-PSYADRSVPAMIPNITLPRAFDWREYDAVTGVK 142
           V+G+ +FSDL+ AEF  K+LG K L+ P++A ++   ++P   LP+ FDWR+  AVT VK
Sbjct: 93  VHGVTKFSDLTPAEFHRKFLGLKPLRLPAHAQKA--PILPTNNLPKDFDWRDKGAVTNVK 150

Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSI 193
           DQ  CGS W+FSTTG +EG +   T +LVSLSEQ+L+DCD           D GC GG +
Sbjct: 151 DQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLM 210

Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
           +NAF+ ++    GG++ EK YPY G D  C+ +K      ++ Y  +S DE  +A  LV+
Sbjct: 211 NNAFEYLIGS--GGVQREKDYPYTGRDGTCKFDKSKIAASVSNYSVISLDEEQIAANLVK 268

Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYW 312
           NGP+AVAINA  +Q YV GVS P  + C    ++L H VL+VGYG         K  PYW
Sbjct: 269 NGPLAVAINAVYMQTYVGGVSCP--YIC---GKHLDHGVLLVGYGEGAYAPIRFKEKPYW 323

Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           IIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 324 IIKNSWGENWGENGYYKICRGRNVCGVDSMVST 356


>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
          Length = 370

 Score =  259 bits (662), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 146/331 (44%), Positives = 200/331 (60%), Gaps = 28/331 (8%)

Query: 27  DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
           D  L+  HH      F  F  +  KTYAT  E+  R  +F  NLR+ +L    +  S V+
Sbjct: 47  DNLLNAEHH------FASFKAKFAKTYATKEEHDHRFGVFKSNLRRARLHAKLD-PSAVH 99

Query: 87  GLNEFSDLSTAEFQAKYLGFK-LK-PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
           G+ +FSDL+ AEF+ ++LG K L+ P++A ++   ++P   LP+ FDWR+  AVT VKDQ
Sbjct: 100 GVTKFSDLTPAEFRRQFLGLKPLRFPAHAQKA--PILPTKDLPKDFDWRDKGAVTNVKDQ 157

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISN 195
             CGS W+FSTTG +EG +   T +LVSLSEQ+L+DCD           D GC GG ++N
Sbjct: 158 GACGSCWSFSTTGALEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNN 217

Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
           AF+ I+    GG+++EK YPY G D  C+ +K      ++ Y  VS DE  +A  LV+NG
Sbjct: 218 AFEYILQS--GGVQKEKDYPYTGRDGTCKFDKTKVAATVSNYSVVSLDEEQIAANLVKNG 275

Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWII 314
           P+AVAINA  +Q YV GVS P  + C    ++L H VL+VGYG         K  PYWII
Sbjct: 276 PLAVAINAVFMQTYVGGVSCP--YIC---GKHLDHGVLLVGYGEGAYAPIRFKNKPYWII 330

Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           KNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 331 KNSWGESWGENGYYKICRGRNVCGVDSMVST 361


>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
 gi|255639509|gb|ACU20049.1| unknown [Glycine max]
          Length = 366

 Score =  258 bits (659), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 143/334 (42%), Positives = 194/334 (58%), Gaps = 21/334 (6%)

Query: 23  MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
            VV D + HHL + +H   F+ F  +  KTYAT  E+  R  IF  NL + +  Q  +  
Sbjct: 34  QVVPDAEDHHLLNAEHH--FSAFKTKFGKTYATQEEHDHRFRIFKNNLLRAKSHQKLD-P 90

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVK 142
           S V+G+  FSDL+ AEF+ ++LG K     +D     ++P   LP  FDWRE+ AVTGVK
Sbjct: 91  SAVHGVTRFSDLTPAEFRRQFLGLKPLRLPSDAQKAPILPTNDLPTDFDWREHGAVTGVK 150

Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSI 193
           +Q  CGS W+FS  G +EG +   T +LVSLSEQ+L+DCD E         D GC GG +
Sbjct: 151 NQGSCGSCWSFSAVGALEGAHFLSTGELVSLSEQQLVDCDHECDPEERGACDSGCNGGLM 210

Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLV 252
           + AF+  +    GGL  EK YPY G D+  C+ +K      +  +  VS DE  +A  LV
Sbjct: 211 TTAFEYTLQ--AGGLMREKDYPYTGRDRGPCKFDKSKVAASVANFSVVSLDEEQIAANLV 268

Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPY 311
           +NGP+AV INA  +Q Y+ GVS P  + C    ++L H VL+VGYG         K  PY
Sbjct: 269 QNGPLAVGINAVFMQTYIGGVSCP--YIC---GKHLDHGVLLVGYGSGAYAPIRFKEKPY 323

Query: 312 WIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           WIIKNSWGE WGE+GY+++ RG   CG++  V +
Sbjct: 324 WIIKNSWGESWGEEGYYKICRGRNVCGVDSMVST 357


>gi|356576257|ref|XP_003556249.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase 15A-like
           [Glycine max]
          Length = 374

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 143/344 (41%), Positives = 201/344 (58%), Gaps = 32/344 (9%)

Query: 19  VSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD 78
           ++  + VGD +L     ++    F  F+E + ++Y+T  EY  RL IFS N+     L+ 
Sbjct: 36  IARKLKVGDNEL-----LRTEKKFKVFMENYGRSYSTREEYLRRLGIFSQNM-----LRA 85

Query: 79  TEH----GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWRE 134
            EH     + V+G+ +FSDL+  EF+  Y G     +    + P  +    LP  FDWRE
Sbjct: 86  AEHQALDPTAVHGVTQFSDLTEVEFEKLYTGXPSTNTAGGVAPPLEVEG--LPENFDWRE 143

Query: 135 YDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------D 185
             AVT VK Q  CGS WAFSTTG+IEG     T KLVSLSEQ+L+DCD +         D
Sbjct: 144 KGAVTEVKIQGRCGSCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDNKCEITEKTSCD 203

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDET 245
           +GC GG ++NA++ ++    GGLEEE +YPY G+   C+ + +   V+I  + ++  DE 
Sbjct: 204 NGCNGGLMTNAYNYLLES--GGLEEESSYPYTGERGECKFDPEKITVRITNFTNIPVDEN 261

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A YLV+NGP+A+ +NA  +Q Y+ GVS P+   C    + L+H VL+VGYG       
Sbjct: 262 QIAAYLVKNGPLAMGVNAIFMQTYIGGVSCPL--ICS--KKRLNHGVLLVGYGAKGFSIL 317

Query: 306 HKA-VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                PYWIIKNSWG+ WGE GY++L RG G CGIN  V +A+V
Sbjct: 318 RLGNKPYWIIKNSWGKKWGEDGYYKLCRGHGMCGINTMVSAAMV 361


>gi|351726954|ref|NP_001236888.1| cysteine proteinase precursor [Glycine max]
 gi|479060|emb|CAA83673.1| cysteine proteinase [Glycine max]
 gi|300507422|gb|ADK24076.1| cysteine proteinase [Glycine max]
 gi|300507425|gb|ADK24077.1| cysteine proteinase [Glycine max]
 gi|1096153|prf||2111244A Cys protease
          Length = 380

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 141/343 (41%), Positives = 202/343 (58%), Gaps = 25/343 (7%)

Query: 19  VSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD 78
           ++  + +GD +L     ++    F  F+E + ++Y+T  EY  RL IF+ N+ +    Q 
Sbjct: 36  IARKLKLGDNEL-----LRTEKKFKVFMENYGRSYSTEEEYLRRLGIFAQNMVRAAEHQA 90

Query: 79  TEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT---LPRAFDWREY 135
            +  + V+G+ +FSDL+  EF+  Y G       ++ +   + P +    LP  FDWRE 
Sbjct: 91  LDP-TAVHGVTQFSDLTEDEFEKLYTGVNGGFPSSNNAAGGIAPPLEVDGLPENFDWREK 149

Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DD 186
            AVT VK Q  CGS WAFSTTG+IEG     T KLVSLSEQ+L+DCD +         D+
Sbjct: 150 GAVTEVKLQGRCGSCWAFSTTGSIEGANFLATGKLVSLSEQQLLDCDNKCDITEKTSCDN 209

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETD 246
           GC GG ++NA++ ++    GGLEEE +YPY G+   C+ + +   VKI  + ++  DE  
Sbjct: 210 GCNGGLMTNAYNYLLES--GGLEEESSYPYTGERGECKFDPEKIAVKITNFTNIPADENQ 267

Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
           +A YLV+NGP+A+ +NA  +Q Y+ GVS P+   C    + L+H VL+VGYG        
Sbjct: 268 IAAYLVKNGPLAMGVNAIFMQTYIGGVSCPL--ICS--KKRLNHGVLLVGYGAKGFSILR 323

Query: 307 KA-VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSWGE WGE GY++L RG G CGIN  V +A+V
Sbjct: 324 LGNKPYWIIKNSWGEKWGEDGYYKLCRGHGMCGINTMVSAAMV 366


>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
          Length = 472

 Score =  258 bits (658), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 142/325 (43%), Positives = 189/325 (58%), Gaps = 28/325 (8%)

Query: 37  KHTALFNYFLE---QHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSD 93
           K   L+N FL+   +  + Y+++ E   R   +  NL  ++ LQ  E G+ +YG+ +FSD
Sbjct: 162 KTEMLWNSFLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSD 221

Query: 94  LSTAEFQAKYLGFKLKPS-YADRSVPAMIP------NIT---LPRAFDWREYDAVTGVKD 143
           +S  EFQ   L     PS + DR V   +       N+T   LP  FDWR    VT VK+
Sbjct: 222 MSPEEFQKTML-----PSLWWDRVVSNGVEYDLKKFNLTFNNLPEQFDWRTKGVVTPVKN 276

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSK 203
           Q  CGS WAFS TGNIEG++A KT KL+SLSEQELIDCD+ D GC GG   NAF  I   
Sbjct: 277 QGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQELIDCDRIDKGCNGGLPINAFREIQRM 336

Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA 263
             GGLE E  YPY+  +  C L + A  V I+  V + R+ET M  ++V+ GP++V I+A
Sbjct: 337 --GGLEPEDQYPYKARNGTCHLIRSAIAVTIDDAVEIPRNETVMKAWIVQRGPLSVGIDA 394

Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
             L +Y +G+ HP +  C      + H VLI GYGV+        +PYW IKNSWG+ WG
Sbjct: 395 KLLAYYKSGILHPSRSRCPPS--GIDHGVLITGYGVEN------GLPYWTIKNSWGDQWG 446

Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
           E GYFRL  G   CG++D V SA++
Sbjct: 447 EDGYFRLMLGKDVCGVSDLVSSAII 471


>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
          Length = 437

 Score =  257 bits (656), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 142/325 (43%), Positives = 189/325 (58%), Gaps = 28/325 (8%)

Query: 37  KHTALFNYFLE---QHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSD 93
           K   L+N FL+   +  + Y+++ E   R   +  NL  ++ LQ  E G+ +YG+ +FSD
Sbjct: 127 KTEMLWNSFLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSD 186

Query: 94  LSTAEFQAKYLGFKLKPS-YADRSVPAMIP------NIT---LPRAFDWREYDAVTGVKD 143
           +S  EFQ   L     PS + DR V   +       N+T   LP  FDWR    VT VK+
Sbjct: 187 MSPEEFQKTML-----PSLWWDRVVSNGVEYDLKKFNLTFNNLPEQFDWRTKGVVTPVKN 241

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSK 203
           Q  CGS WAFS TGNIEG++A KT KL+SLSEQELIDCD+ D GC GG   NAF  I   
Sbjct: 242 QGSCGSCWAFSVTGNIEGLWAIKTGKLISLSEQELIDCDRIDKGCNGGLPINAFREIQRM 301

Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA 263
             GGLE E  YPY+  +  C L + A  V I+  V + R+ET M  ++V+ GP++V I+A
Sbjct: 302 --GGLEPEDQYPYKARNGTCHLIRSAIAVTIDDAVEIPRNETVMKAWIVQRGPLSVGIDA 359

Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
             L +Y +G+ HP +  C      + H VLI GYGV+        +PYW IKNSWG+ WG
Sbjct: 360 KLLAYYKSGILHPSRSRCPPS--GIDHGVLITGYGVEN------GLPYWTIKNSWGDQWG 411

Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
           E GYFRL  G   CG++D V SA++
Sbjct: 412 EDGYFRLMLGKDVCGVSDLVSSAII 436


>gi|260830531|ref|XP_002610214.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
 gi|229295578|gb|EEN66224.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
          Length = 274

 Score =  257 bits (656), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 135/289 (46%), Positives = 173/289 (59%), Gaps = 18/289 (6%)

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI 121
           R  +F  NL+K + LQD+E G+  YG+ +F DL+  EF+  YL    K + A    PA I
Sbjct: 1   RYFVFQDNLKKAETLQDSERGTAKYGVTKFMDLTEEEFRRYYLTPVWK-APAKPLPPATI 59

Query: 122 PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
           P    P AFDWR++ AVT VKDQ  CGS WAFSTTGNIEG +A K   L  LSEQ     
Sbjct: 60  PKKDAPTAFDWRDHGAVTEVKDQGQCGSCWAFSTTGNIEGQWAIKKGNLPDLSEQHT--- 116

Query: 182 DQEDDGCEGGSISNAFDTIMSKLGG--GLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
                  E   I+         + G  GLE EK YPY   D+ C ++    QV IN  V+
Sbjct: 117 ----SKIESCHINPIVKRTKRSIDGKSGLESEKAYPYEAKDEQCHMDYSKVQVYINSSVN 172

Query: 240 VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
           +S+DE DMA +L ENGP+++ INA+ +QFY+ G+SHP + FC+   E L H VLIVGYG 
Sbjct: 173 ISKDENDMASWLAENGPISIGINAFPMQFYMGGISHPWRIFCN--PEELDHGVLIVGYG- 229

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                T    PYWIIKNSWG+ WGE+GY+ +YRG G CG+N    S++V
Sbjct: 230 -----TKDETPYWIIKNSWGKNWGEEGYYLVYRGGGVCGLNTMCTSSVV 273


>gi|335281454|ref|XP_003122543.2| PREDICTED: cathepsin F [Sus scrofa]
 gi|350579927|ref|XP_003480717.1| PREDICTED: cathepsin F-like [Sus scrofa]
          Length = 490

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 144/332 (43%), Positives = 202/332 (60%), Gaps = 11/332 (3%)

Query: 18  SVSSFM-VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL 76
           + SSF+ ++  + L     VK  ++F  F+  +N+TY T  E   R+ +F+ N+ + Q +
Sbjct: 168 TFSSFLPLLNKDPLPQDFSVKMASIFKEFVTTYNRTYDTKEEARWRMSVFANNMVRAQKI 227

Query: 77  QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYD 136
           Q  + G+  YG+ +FSDL+  EF+  YL   L+     +   A   +   P  +DWR+  
Sbjct: 228 QALDTGTARYGVTKFSDLTEEEFRTIYLNPLLQEEPGRKMRLAKSVSSLPPPEWDWRKKG 287

Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNA 196
           AVT VKDQ MCGS WAFS TGN+EG +  K   L+SLSEQEL+DCD+ D GC GG  SNA
Sbjct: 288 AVTKVKDQGMCGSCWAFSVTGNVEGQWFLKQGTLLSLSEQELLDCDKVDKGCMGGLPSNA 347

Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
           +  I  K  GGLE E+ Y YRG  + C  N +  +V IN  V +S++E  +A +L E GP
Sbjct: 348 YSAI--KTLGGLETEEDYSYRGHLQTCSFNAEKAKVYINDSVELSQNEQKLAAWLAEKGP 405

Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
           ++VAINA+ +QFY  G+SHP++  C      + H+VL+VGYG         A P+W IKN
Sbjct: 406 ISVAINAFGMQFYRHGISHPLRPLCSPW--LIDHAVLLVGYG------NRSATPFWAIKN 457

Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           SWG  WGE+GY+ LYRG G+CG+N    SA+V
Sbjct: 458 SWGTDWGEEGYYYLYRGSGACGVNIMASSAVV 489


>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 365

 Score =  256 bits (655), Expect = 9e-66,   Method: Compositional matrix adjust.
 Identities = 144/333 (43%), Positives = 199/333 (59%), Gaps = 28/333 (8%)

Query: 25  VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG 84
           V D  L+  HH      F+ F  +  KTYAT  E+  R  +F  N+R+ +L    +  S 
Sbjct: 40  VEDHLLNAEHH------FSTFKSKFGKTYATKEEHDHRFGVFKSNMRRARLHAQLDP-SA 92

Query: 85  VYGLNEFSDLSTAEFQAKYLGFK-LK-PSYADRSVPAMIPNITLPRAFDWREYDAVTGVK 142
           V+G+ +FSDL+ AEF  K+LG K L+ P++A ++   ++P   LP+ FDWR+  AVT VK
Sbjct: 93  VHGVTKFSDLTPAEFHRKFLGLKPLRLPAHAQKA--PILPTNNLPKDFDWRDKGAVTNVK 150

Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSI 193
           DQ  CGS W+FSTTG +EG +   T +LVSLSEQ+L+DCD           D GC GG +
Sbjct: 151 DQGSCGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLM 210

Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
           +NAF+ ++    GG++ EK YPY G D  C+ +K      ++ Y  +S DE  +A  LV+
Sbjct: 211 NNAFEYLIGS--GGVQREKDYPYTGRDGTCKFDKSKIAASVSNYSVISLDEEQIAANLVK 268

Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYW 312
           NGP+AVAINA  +Q YV GVS P  + C    ++L H VL+VGYG         K  PYW
Sbjct: 269 NGPLAVAINAVYMQTYVGGVSCP--YIC---GKHLDHGVLLVGYGEGAYAPIRFKEKPYW 323

Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           IIKNSWGE WG  GY+++ RG   CG++  V +
Sbjct: 324 IIKNSWGENWGGNGYYKICRGRNVCGVDSMVST 356


>gi|74273320|gb|ABA01328.1| secreted cathepsin F [Teladorsagia circumcincta]
          Length = 364

 Score =  256 bits (653), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 135/312 (43%), Positives = 181/312 (58%), Gaps = 17/312 (5%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+E+H+K Y    E   R  IF  NL  I+  Q+ + G+ +YG+N+F+DLS  EF+ 
Sbjct: 64  FTSFIERHDKVYRNESEALKRFGIFKRNLEIIRSAQENDKGTAIYGINQFADLSPEEFKK 123

Query: 102 KYLGFKLK-PSYADRSV----PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
            +L    K P + +R V      + P   LP +FDWRE+ AVT VK +  C + WAFS T
Sbjct: 124 THLPHTWKQPDHPNRIVDLAAEGVDPKEPLPESFDWREHGAVTKVKTEGHCAACWAFSVT 183

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIEG +    KKLVSLS Q+L+DCD  D+GC GG   +A+  I+    GGLE E  YPY
Sbjct: 184 GNIEGQWFLAKKKLVSLSAQQLLDCDVVDEGCNGGFPLDAYKEIVRM--GGLEPEDKYPY 241

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
               + CRL      V ING V +  DE  M  +LV+ GP+++ I    +QFY  GVS P
Sbjct: 242 EAKAEQCRLVPSDIAVYINGSVELPHDEEKMRAWLVKKGPISIGITVDDIQFYKGGVSRP 301

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
                     ++ H  L+VGYGV+      K +PYWIIKNSWG  WGE GY+R+ RG+ +
Sbjct: 302 TTCRLS----SMIHGALLVGYGVE------KNIPYWIIKNSWGPNWGEDGYYRMVRGENA 351

Query: 337 CGINDYVRSALV 348
           C IN +  SA+V
Sbjct: 352 CRINRFPTSAVV 363


>gi|115495381|ref|NP_001068884.1| cathepsin F precursor [Bos taurus]
 gi|111304901|gb|AAI20004.1| Cathepsin F [Bos taurus]
 gi|296471599|tpg|DAA13714.1| TPA: cathepsin F [Bos taurus]
          Length = 460

 Score =  255 bits (652), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 139/332 (41%), Positives = 201/332 (60%), Gaps = 11/332 (3%)

Query: 18  SVSSFM-VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL 76
           + SSF+ ++  + L     VK  ++F  F+  +N+TY +  E   R+ +F+ N+ + Q +
Sbjct: 138 TFSSFLPLLNKDPLPQDFSVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKI 197

Query: 77  QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYD 136
           Q  + G+  YG+ +FSDL+  EF+  YL   LK +      PA       P  +DWR   
Sbjct: 198 QALDRGTARYGVTKFSDLTEEEFRTIYLNPLLKDAPGRNMRPAQPVTDVPPPQWDWRNKG 257

Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNA 196
           AVT VKDQ MCGS WAFS TGN+EG +  K   L+SLSEQEL+DCD+ D  C GG  SNA
Sbjct: 258 AVTNVKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNA 317

Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
           +  I  +  GGLE E  Y YRG  + C  + +  +V IN  V +S++E  +A +L +NGP
Sbjct: 318 YSAI--RTLGGLETEDDYSYRGRLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKNGP 375

Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
           +++AINA+ +QFY  G+SHP++  C      + H+VL+VGYG         A+P+W IKN
Sbjct: 376 VSIAINAFGMQFYRHGISHPLRPLCSPW--LIDHAVLLVGYG------NRSAIPFWAIKN 427

Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           SWG  WGE+GY+ L+RG G+CG+N    SA++
Sbjct: 428 SWGTDWGEEGYYYLHRGSGACGVNIMASSAVI 459


>gi|7381219|gb|AAF61440.1|AF138264_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
          Length = 368

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 143/337 (42%), Positives = 198/337 (58%), Gaps = 29/337 (8%)

Query: 24  VVGD---EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTE 80
           VVGD   + L+  HH      F  F  +  K YA+  E+  RL +F  N+R+ +  Q+ +
Sbjct: 36  VVGDGDGDLLNADHH------FTVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQELD 89

Query: 81  HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVT 139
             + V+G+ +FSDL+  EF+ K+LG   +  + AD     ++P   LP  FDWR++ AVT
Sbjct: 90  PAA-VHGVTQFSDLTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDHGAVT 148

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
            VK+Q  CGS W+FSTTG +EG     T KLVSLSEQ+L+DCD E         D GC G
Sbjct: 149 PVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNG 208

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAK 249
           G +++AF+  +    GGL  E+ YPY G+D + CR +K     K+  +  VS DE  +A 
Sbjct: 209 GLMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQIAA 266

Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKA 308
            LV+NGP+AVAINA  +Q Y+ GVS P  + C   ++ L H VL+VGYG         K 
Sbjct: 267 NLVKNGPLAVAINAVFMQTYIGGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKE 321

Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            PYWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 322 KPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 358


>gi|7211741|gb|AAF40414.1|AF216783_1 papain-like cysteine proteinase isoform I [Ipomoea batatas]
          Length = 368

 Score =  255 bits (651), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 143/337 (42%), Positives = 198/337 (58%), Gaps = 29/337 (8%)

Query: 24  VVGD---EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTE 80
           VVGD   + L+  HH      F  F  +  K YA+  E+  RL +F  N+R+ +  Q+ +
Sbjct: 36  VVGDGDGDLLNADHH------FTVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQELD 89

Query: 81  HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVT 139
             + V+G+ +FSDL+  EF+ K+LG   +  + AD     ++P   LP  FDWR++ AVT
Sbjct: 90  PAA-VHGVTQFSDLTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDHGAVT 148

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
            VK+Q  CGS W+FSTTG +EG     T KLVSLSEQ+L+DCD E         D GC G
Sbjct: 149 PVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNG 208

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAK 249
           G +++AF+  +    GGL  E+ YPY G+D + CR +K     K+  +  VS DE  +A 
Sbjct: 209 GLMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQIAA 266

Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKA 308
            LV+NGP+AVAINA  +Q Y+ GVS P  + C   ++ L H VL+VGYG         K 
Sbjct: 267 NLVKNGPLAVAINAVFMQTYIGGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKE 321

Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            PYWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 322 KPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 358


>gi|432880227|ref|XP_004073613.1| PREDICTED: cathepsin F-like [Oryzias latipes]
          Length = 473

 Score =  255 bits (651), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 138/314 (43%), Positives = 189/314 (60%), Gaps = 11/314 (3%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           V+    F  F+ ++ K Y++  E   RL IF  NL+  + LQ  + GS  YG+ +FSDL+
Sbjct: 169 VQLLGQFKDFMVKYKKDYSSQEEAERRLQIFQENLKTAEKLQALDQGSAEYGVTKFSDLT 228

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMIPNITLPR-AFDWREYDAVTGVKDQTMCGSSWAFS 154
             EF++ YL   L      R +    P  T    ++DWR++ AV+ VK+Q MCGS WAFS
Sbjct: 229 EEEFRSTYLNPLLSQWTLHRGMKPAPPAKTPAPDSWDWRDHGAVSPVKNQGMCGSCWAFS 288

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
            TGNIEG +  K   L+SLSEQEL+DCD  D  C GG  SNA++ I  KL GGLE E  Y
Sbjct: 289 VTGNIEGQWFLKNGTLLSLSEQELVDCDGLDQACRGGLPSNAYEAI-EKL-GGLESETDY 346

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
            Y G  + C    +     IN  V + +DE ++A +L ENGP++VA+NA+A+QFY  GVS
Sbjct: 347 SYTGHKQKCDFTNRKVAAYINSSVELPKDEREIAAWLAENGPISVALNAFAMQFYKKGVS 406

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
           HP + FC+     + H+VL+VGYG          +P+W IKNSWGE +GE+GY+ L RG 
Sbjct: 407 HPWKIFCNPW--MIDHAVLLVGYG------ERNGIPFWAIKNSWGEDYGEQGYYYLQRGS 458

Query: 335 GSCGINDYVRSALV 348
            +CGIN    SA++
Sbjct: 459 NACGINRMGSSAVI 472


>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
          Length = 360

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 142/330 (43%), Positives = 197/330 (59%), Gaps = 24/330 (7%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
           HH+ + +H   F  F  +  K+YAT  E+  R  +F  NLR+ +L    +  S  +G+ +
Sbjct: 35  HHMLNAEHH--FTTFKTKFGKSYATQEEHDYRFGVFRANLRRAKLHAKLDP-SAEHGVTK 91

Query: 91  FSDLSTAEFQAKYLGFK-LK-PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
           FSDL+  EF+ +YLG K L+ PS A+++   ++P   LP  FDWR+  AVT VK+Q  CG
Sbjct: 92  FSDLTPEEFKRQYLGLKPLRLPSTANKA--PILPTSDLPENFDWRDKGAVTPVKNQGSCG 149

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDT 199
           S WAFSTTG +EG +   T +LVSLSEQ+L+DCD           D GC GG ++NAFD 
Sbjct: 150 SCWAFSTTGALEGAHYLSTGELVSLSEQQLVDCDHVCDPEEYGACDAGCNGGLMNNAFDY 209

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I+    GG++ EK YPY G D+ C+ +K      +  +  VS DE  +A  LV++GP+AV
Sbjct: 210 ILQ--AGGVQTEKDYPYSGRDETCKFDKSKVAATVANFSVVSLDEDQIAANLVKHGPLAV 267

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSW 318
            INA  +Q Y+ GVS P  + C    +NL H VL+VGYG         K  P+WIIKNSW
Sbjct: 268 GINAIFMQTYIGGVSCP--YIC---GKNLDHGVLLVGYGAAGYAPIRFKDKPFWIIKNSW 322

Query: 319 GEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           GE WGE GY+++ RG   CG++  V S + 
Sbjct: 323 GESWGEDGYYKICRGKNVCGVDSMVSSVVA 352


>gi|7381221|gb|AAF61441.1|AF138265_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
          Length = 366

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 143/336 (42%), Positives = 198/336 (58%), Gaps = 28/336 (8%)

Query: 24  VVGD--EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
           VVGD  + L+  HH      F  F  +  K YA+  E+  RL +F  N+R+ +  Q+ + 
Sbjct: 35  VVGDGGDLLNADHH------FTVFKRRFGKVYASDEEHDYRLSVFKANMRRAKQHQELDP 88

Query: 82  GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVTG 140
            + V+G+ +FSDL+  EF+ K+LG   +  + AD     ++P   LP  FDWR++ AVT 
Sbjct: 89  AA-VHGVTQFSDLTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDHGAVTP 147

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGG 191
           VK+Q  CGS W+FSTTG +EG     T KLVSLSEQ+L+DCD E         D GC GG
Sbjct: 148 VKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGG 207

Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAKY 250
            +++AF+  +    GGL  E+ YPY G+D + CR +K     K+  +  VS DE  +A  
Sbjct: 208 LMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQIAAN 265

Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAV 309
           LV+NGP+AVAINA  +Q Y+ GVS P  + C   ++ L H VL+VGYG         K  
Sbjct: 266 LVKNGPLAVAINAVFVQTYIGGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKEK 320

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           PYWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 321 PYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 356


>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
          Length = 394

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 140/318 (44%), Positives = 186/318 (58%), Gaps = 19/318 (5%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           A F +F+++ NK Y+   E+  R  IF  NL K    Q  +    ++G+N+FSDL+  EF
Sbjct: 73  AHFAHFVKKFNKEYSGAEEHARRFSIFKKNLHKALRHQKLDR-DAIHGINKFSDLTEEEF 131

Query: 100 QAKYLGFKLKP-SYADRSVPA-MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
             +YLG    P S + R+ PA ++P   LP  FDWRE  AVT VK+Q  CGS W FSTTG
Sbjct: 132 HEQYLGLTTPPRSLSQRTQPAPILPTDDLPPDFDWRELGAVTPVKNQGACGSCWTFSTTG 191

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGL 208
            +EG    KT KL+SLSEQ+L+DCD E         D GC GG ++ A+   +    GGL
Sbjct: 192 AMEGANFMKTGKLISLSEQQLVDCDHECDSSEPDVCDSGCNGGLMTTAYQYALK--AGGL 249

Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
           + E+ YPY G D +C+ +       +  + +VS DE  +A  LV+NGP+AV INA  +Q 
Sbjct: 250 QREEDYPYTGIDGSCKFDNTKVAAMVANFSTVSIDEDQIAANLVKNGPLAVGINAAFMQT 309

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
           YV GVS P  + C+   +NL H VL+VGYG         K  P+WIIKNSWG  WGE GY
Sbjct: 310 YVGGVSCP--YVCN--KQNLDHGVLLVGYGAAGYAPGRLKNKPFWIIKNSWGPDWGEDGY 365

Query: 328 FRLYRGDGSCGINDYVRS 345
           ++L RG   CGIN  V +
Sbjct: 366 YKLCRGHNVCGINTMVST 383


>gi|302754322|ref|XP_002960585.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
 gi|300171524|gb|EFJ38124.1| hypothetical protein SELMODRAFT_266583 [Selaginella moellendorffii]
          Length = 330

 Score =  254 bits (650), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 136/317 (42%), Positives = 187/317 (58%), Gaps = 22/317 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+ +  K YAT   Y  RL +F  NL +    Q  +  S V+G+ +FSDL+  EF+ 
Sbjct: 21  FKSFIARFGKAYATAEAYAHRLKVFEANLVRAVSHQALDP-SAVHGITQFSDLTEEEFKQ 79

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
           ++LG ++     + +   ++P   LP  FDWRE+ AVT VK+Q  CGS WAFSTTG IEG
Sbjct: 80  QFLGLRVPSRLREANKAPVLPTNDLPEDFDWREHGAVTEVKNQGACGSCWAFSTTGAIEG 139

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
            +  +T KL+SLSEQ+L+DCD           D GC GG ++NA+D +M    GGLE E 
Sbjct: 140 AHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLMTNAYDYVMKS--GGLETET 197

Query: 213 TYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G+    C+ N       +  + +VS DE  +A  LV++GP+A+ INA  +Q Y+ 
Sbjct: 198 DYPYTGNSNGKCQFNANKIVASVANFSTVSLDEDQIAANLVKHGPLAIGINAVFMQTYIG 257

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVD---RTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           GVS PI   C     ++ H VL+VGYG       +FT K  PYWIIKNSWG  WGE+GY+
Sbjct: 258 GVSCPI--ICS--KHHIDHGVLLVGYGAKGYAPIRFTEK--PYWIIKNSWGATWGEQGYY 311

Query: 329 RLYRGDGSCGINDYVRS 345
           ++ RG G CG+N  V +
Sbjct: 312 KICRGHGMCGMNTMVST 328


>gi|354496134|ref|XP_003510182.1| PREDICTED: cathepsin F [Cricetulus griseus]
 gi|344250261|gb|EGW06365.1| Cathepsin F [Cricetulus griseus]
          Length = 462

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 143/332 (43%), Positives = 200/332 (60%), Gaps = 11/332 (3%)

Query: 18  SVSSFM-VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL 76
           + SSF+ ++ +E L     VK T +F  F+  +N+TY +  E   RL +F+ N+ K Q +
Sbjct: 140 TFSSFLPLLNEEPLPQDFSVKMTTVFKDFMITYNRTYESREETQWRLTVFTRNMVKAQKI 199

Query: 77  QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYD 136
           +  + G+  YG+ +FSDL+  EF   YL   L+     +   A   N   P  +DWR+  
Sbjct: 200 EALDRGTAQYGITKFSDLTEEEFYTIYLNPLLQKKPGSKMSLAKSINDPAPPEWDWRKKG 259

Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNA 196
           AVT VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA
Sbjct: 260 AVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACLGGMPSNA 319

Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
           +  I S   GGLE E  Y Y+G  +AC  + +  +V IN  V +S++E+ MA +L + GP
Sbjct: 320 YTAIKSL--GGLETEDDYSYKGYVQACNFSAQKAKVYINDSVELSKNESKMAAWLAQKGP 377

Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
           ++VAINA+ +QFY  G++HP++  C      + H+VL+VGYG           PYW IKN
Sbjct: 378 ISVAINAFGMQFYRHGIAHPLRPLCSPW--LIDHAVLLVGYG------NRSNTPYWAIKN 429

Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           SWG  WGE+GY+ LYRG G+CG+N    SA+V
Sbjct: 430 SWGSNWGEEGYYYLYRGSGACGVNTMASSAVV 461


>gi|302771610|ref|XP_002969223.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
 gi|300162699|gb|EFJ29311.1| hypothetical protein SELMODRAFT_91274 [Selaginella moellendorffii]
          Length = 367

 Score =  254 bits (649), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 136/317 (42%), Positives = 187/317 (58%), Gaps = 22/317 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+ +  K YAT   Y  RL +F  NL +    Q  +  S V+G+ +FSDL+  EF+ 
Sbjct: 58  FKSFIARFGKAYATAEAYAHRLKVFEANLVRAVSHQALDP-SAVHGITQFSDLTEEEFKQ 116

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
           ++LG ++     + +   ++P   LP  FDWRE+ AVT VK+Q  CGS WAFSTTG IEG
Sbjct: 117 QFLGLRVPSRLREANKAPVLPTNDLPEDFDWREHGAVTEVKNQGACGSCWAFSTTGAIEG 176

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
            +  +T KL+SLSEQ+L+DCD           D GC GG ++NA+D +M    GGLE E 
Sbjct: 177 AHFLETGKLISLSEQQLVDCDHSCDPTDKVSCDAGCNGGLMTNAYDYVMKS--GGLETET 234

Query: 213 TYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G+    C+ N       +  + +VS DE  +A  LV++GP+A+ INA  +Q Y+ 
Sbjct: 235 DYPYTGNSNGKCQFNANKIVASVANFSTVSLDEDQIAANLVKHGPLAIGINAVFMQTYIG 294

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVD---RTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           GVS PI   C     ++ H VL+VGYG       +FT K  PYWIIKNSWG  WGE+GY+
Sbjct: 295 GVSCPI--ICS--KHHIDHGVLLVGYGAKGYAPIRFTEK--PYWIIKNSWGATWGEQGYY 348

Query: 329 RLYRGDGSCGINDYVRS 345
           ++ RG G CG+N  V +
Sbjct: 349 KICRGHGMCGMNTMVST 365


>gi|225458119|ref|XP_002279862.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
 gi|302142581|emb|CBI19784.3| unnamed protein product [Vitis vinifera]
          Length = 368

 Score =  254 bits (648), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 140/318 (44%), Positives = 186/318 (58%), Gaps = 19/318 (5%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  +  KTYAT  E+  R ++F  NLR+ +  Q  +  S V+G+ +FSDL+ AEF+ 
Sbjct: 52  FEKFKARFQKTYATPEEHDYRFNVFKANLRRAKRHQLLDP-SAVHGVTQFSDLTPAEFRR 110

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
            YLG       AD     ++P   LP  FDWRE  AVT VK+Q  CGS W+FST G +EG
Sbjct: 111 DYLGLNPLRFPADAQQAPILPTDNLPTDFDWRENGAVTPVKNQGNCGSCWSFSTIGALEG 170

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
            +   T  L SLSEQ+L+DCD+E         DDGC GG ++NAF+ I+    GG+E EK
Sbjct: 171 AHFLATGNLESLSEQQLVDCDRECDPEEYDACDDGCNGGLMNNAFEYILKT--GGVEREK 228

Query: 213 TYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G D++ C+ N+      ++ +  VS DE  +A  LV+NGP+AV INA  +Q Y  
Sbjct: 229 DYPYTGRDRSPCKFNESKIVASVSNFSVVSIDEDQIAANLVKNGPLAVGINAVFMQTYTA 288

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           GVS P  F C G    L H VL+VGYG    +    K  PYWI+KNSW + WGE GY+R+
Sbjct: 289 GVSCP--FLCSG---ELDHGVLLVGYGSAGYSPIRFKEKPYWILKNSWSKYWGEHGYYRI 343

Query: 331 YRGDGSCGINDYVRSALV 348
            RG   CG++  V S + 
Sbjct: 344 CRGQNMCGVDSMVSSVVA 361


>gi|124484383|dbj|BAF46302.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 369

 Score =  254 bits (648), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 141/336 (41%), Positives = 193/336 (57%), Gaps = 30/336 (8%)

Query: 27  DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
           D  L+  HH      F  F  ++ K+YAT  E+  RL +F  NLR+ +  Q  +  S V+
Sbjct: 38  DALLNADHH------FTLFKSKYGKSYATQEEHDYRLSVFKANLRRAKRHQLLDP-SAVH 90

Query: 87  GLNEFSDLSTAEFQAKYLGFKLKPS-------YADRSVPAMIPNITLPRAFDWREYDAVT 139
           G+ +FSDL+  EF+  +LG +   S        AD     ++P   LP  FDWR+Y AVT
Sbjct: 91  GVTKFSDLTPKEFRRTFLGIRKSSSGKRKLKLPADAHAAEILPTSDLPSDFDWRDYGAVT 150

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
           GVKDQ  CGS W+FSTTG +EG     T +LVSLSEQ+L+DCD           D GC G
Sbjct: 151 GVKDQGSCGSCWSFSTTGALEGANFLATGELVSLSEQQLVDCDHLCDPEEAGACDSGCNG 210

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
           G ++ A++ ++    GGLE+EK YPY G D  C+ +K      +  +  VS DE  +A  
Sbjct: 211 GLMTTAYEYVLQS--GGLEKEKDYPYTGKDGTCKFDKSKIAAAVANFSVVSLDEDQIAAN 268

Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAV 309
           LV++GP++V INA  +Q Y+ GVS P  + C     NL H VL+VGYG         K  
Sbjct: 269 LVKHGPLSVGINAVFMQTYIGGVSCP--YIC--SKRNLDHGVLLVGYGAAGYAPIRFKDK 324

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           PYWI+KNSWGE WGE+GY+++ RG+  CGI+  V +
Sbjct: 325 PYWIVKNSWGENWGEEGYYKICRGNNICGIDSMVST 360


>gi|359492709|ref|XP_002280798.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
 gi|147841854|emb|CAN73591.1| hypothetical protein VITISV_022889 [Vitis vinifera]
 gi|302142582|emb|CBI19785.3| unnamed protein product [Vitis vinifera]
          Length = 371

 Score =  253 bits (647), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 138/315 (43%), Positives = 185/315 (58%), Gaps = 19/315 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  +  KTYAT  E+  R ++F  NLR+ +  Q  +  S  +G+ +FSDL+  EF+ 
Sbjct: 56  FAEFKTKFGKTYATAEEHDHRFNVFKANLRRAKRHQLLDP-SAEHGVTQFSDLTPREFRQ 114

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
            YLG K     AD     ++P   LP  FDWR++ AVT VKDQ  CGS W+FST G +EG
Sbjct: 115 NYLGLKRLQLPADAQKAPILPTKDLPTDFDWRDHGAVTAVKDQGYCGSCWSFSTIGALEG 174

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
            +   T  LVSLS Q+L+DCD E         DDGC GG ++NAF+ I+    GG+ +E+
Sbjct: 175 AHFLATGNLVSLSTQQLLDCDTECDPEEYDACDDGCNGGLMNNAFEYILK--AGGVAQEE 232

Query: 213 TYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G D+  CR NK      +  +  VS DE  +A  LV+NGP+AV INA  +Q Y +
Sbjct: 233 DYPYTGTDRGLCRFNKTKIAASVANFSVVSLDEDQIAANLVKNGPLAVGINAVFMQTYKS 292

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           GVS P  + C   +  L H VL+VGYG    +    K  PYWIIKNSWGE WGE+GY+++
Sbjct: 293 GVSCP--YIC---SSTLDHGVLLVGYGSAGYSPIRFKEKPYWIIKNSWGESWGEQGYYKI 347

Query: 331 YRGDGSCGINDYVRS 345
            RG   CG++  V +
Sbjct: 348 CRGHNICGVDSMVST 362


>gi|225431287|ref|XP_002275759.1| PREDICTED: cysteine proteinase RD19a isoform 1 [Vitis vinifera]
 gi|297735094|emb|CBI17456.3| unnamed protein product [Vitis vinifera]
          Length = 367

 Score =  253 bits (647), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 138/331 (41%), Positives = 192/331 (58%), Gaps = 25/331 (7%)

Query: 26  GDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
           GD+ L   H       F  F  +  KTY+T+ E+  R  +F  NLR+ +  Q  +  S V
Sbjct: 42  GDDLLSAEHQ------FGLFKAKFGKTYSTVEEHDYRFSVFEANLRRARRHQLLDP-SAV 94

Query: 86  YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQT 145
           +G+  FSDL+  EF+  YLG K     AD     ++P   LP  FDWR++ AVT VKDQ 
Sbjct: 95  HGVTRFSDLTPDEFRRDYLGLKPLRLPADAQKAPILPTNDLPTDFDWRDHGAVTPVKDQG 154

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNA 196
            CGS W+FS  G +EG +   T  L+S+SEQ+L+DCD E         D GC GG +++A
Sbjct: 155 SCGSCWSFSAIGALEGAHFLTTGNLISMSEQQLVDCDHECDPEEYGACDQGCNGGLMTSA 214

Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
           F+ I+    GG+E E+TYPY G D+ +C+ NK      ++ +  VS DE  +A  +V+NG
Sbjct: 215 FEYILK--AGGVEREETYPYIGSDRGSCKFNKSQIVASVSNFSVVSLDEDQIAANMVKNG 272

Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWII 314
           P+AV INA  +Q Y+ GVS P  + C   + NL H V++VGYG         K  PYWII
Sbjct: 273 PLAVGINAVFMQTYMKGVSCP--YIC---SRNLDHGVVLVGYGSAGYAPIRFKEKPYWII 327

Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           KNSWGE WGE GY+++ RG  +CG++  V +
Sbjct: 328 KNSWGESWGEDGYYKICRGHNACGVDSMVST 358


>gi|2511695|emb|CAB17077.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 377

 Score =  253 bits (647), Expect = 8e-65,   Method: Compositional matrix adjust.
 Identities = 148/368 (40%), Positives = 210/368 (57%), Gaps = 32/368 (8%)

Query: 1   MSCFYFFAGVALLSLTVSVSSFMVVGD----EKLHHLHHVKHTALFNYFLEQHNKTYATL 56
           ++CF   + + L +LT+S +    V D     KL     ++    FN F+E + K Y+T 
Sbjct: 9   LTCFARIS-LVLFALTLSSARQTTVHDIAKKLKLQDNQLLRTEKKFNVFMENYGKKYSTR 67

Query: 57  VEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPSY 112
            EY  RL IF+GN+ +    Q  +  + ++G+ +FSDL+  EFQ  Y G    F      
Sbjct: 68  EEYLQRLEIFAGNMLRAPENQALDP-TAIHGVTQFSDLTEDEFQRHYTGVNGGFPWNNGV 126

Query: 113 ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
            D + P  +    LP  FDWRE  AVT VK Q  CGS WAFSTTG+IEG     T KL++
Sbjct: 127 RDVAPPLKVDG--LPEDFDWREKGAVTEVKMQGKCGSCWAFSTTGSIEGANFIATGKLLN 184

Query: 173 LSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC 223
           LSEQ+L+DCD +         D+GC GG ++NA+  ++    GGLEEE +YPY G    C
Sbjct: 185 LSEQQLVDCDSQCDITESTTCDNGCMGGLMTNAYKYLLQS--GGLEEESSYPYTGAKGEC 242

Query: 224 RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDG 283
           + +     V+I  + ++  DE  +A YLV++GP+AV +NA  +Q Y+ GVS P+   C  
Sbjct: 243 KFDPGKVAVRITNFTNIPVDENQIAAYLVKHGPLAVGLNAIFMQTYIGGVSCPL--ICS- 299

Query: 284 GNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
             + L+H VL+VGY   G    +  +K  PYWIIKNSWG+ WG  GY++L RG G CG+N
Sbjct: 300 -KKWLNHGVLLVGYRAKGFSILRLGNK--PYWIIKNSWGKRWGVDGYYKLCRGHGMCGMN 356

Query: 341 DYVRSALV 348
             V +A+V
Sbjct: 357 TMVSTAMV 364


>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
           Full=Turgor-responsive protein 15A; Flags: Precursor
 gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
          Length = 363

 Score =  253 bits (646), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 140/336 (41%), Positives = 202/336 (60%), Gaps = 25/336 (7%)

Query: 23  MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
            VV +E+ H L+   H   F  F  + +K+YAT  E+  R  +F  NL K +L Q+ +  
Sbjct: 32  QVVDNEEDHLLNAEHH---FTSFKSKFSKSYATKEEHDYRFGVFKSNLIKAKLHQNRD-P 87

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           +  +G+ +FSDL+ +EF+ ++LG K +   P++A ++   ++P   LP  FDWRE  AVT
Sbjct: 88  TAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQKA--PILPTTNLPEDFDWREKGAVT 145

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
            VKDQ  CGS WAFSTTG +EG +   T KLVSLSEQ+L+DCD           D GC G
Sbjct: 146 PVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNG 205

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
           G ++NAF+ ++    GG+ +EK Y Y G D +C+ +K      ++ +  V+ DE  +A  
Sbjct: 206 GLMNNAFEYLLE--SGGVVQEKDYAYTGRDGSCKFDKSKVVASVSNFSVVTLDEDQIAAN 263

Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
           LV+NGP+AVAINA  +Q Y++GVS P  + C      L H VL+VG+G         K  
Sbjct: 264 LVKNGPLAVAINAAWMQTYMSGVSCP--YVC--AKSRLDHGVLLVGFGKGAYAPIRLKEK 319

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           PYWIIKNSWG+ WGE+GY+++ RG   CG++  V +
Sbjct: 320 PYWIIKNSWGQNWGEQGYYKICRGRNVCGVDSMVST 355


>gi|356509908|ref|XP_003523684.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
          Length = 366

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 140/333 (42%), Positives = 193/333 (57%), Gaps = 21/333 (6%)

Query: 24  VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS 83
           VV D + HHL + +H   F+ F  +  KTYAT  E+  R  IF  NL + +  Q  +  S
Sbjct: 35  VVPDAEDHHLLNAEHH--FSAFKTKFAKTYATQEEHDHRFRIFKNNLLRAKSHQKLD-PS 91

Query: 84  GVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
            V+G+  FSDL+ +EF+ ++LG K     +D     ++P   LP  FDWR++ AVTGVK+
Sbjct: 92  AVHGVTRFSDLTPSEFRGQFLGLKPLRLPSDAQKAPILPTSDLPTDFDWRDHGAVTGVKN 151

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSIS 194
           Q  CGS W+FS  G +EG +   T  LVSLSEQ+L+DCD E         D GC GG ++
Sbjct: 152 QGSCGSCWSFSAVGALEGAHFLSTGGLVSLSEQQLVDCDHECDPEERGACDSGCNGGLMT 211

Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
            AF+  +    GGL  E+ YPY G D+  C+ +K      +  +  VS DE  +A  LV+
Sbjct: 212 TAFEYTLK--AGGLMREEDYPYTGRDRGPCKFDKSKIAASVANFSVVSLDEEQIAANLVK 269

Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYW 312
           NGP+AV INA  +Q Y+ GVS P  + C    ++L H VL+VGYG         K  PYW
Sbjct: 270 NGPLAVGINAVFMQTYIGGVSCP--YIC---GKHLDHGVLLVGYGSGAYAPIRFKEKPYW 324

Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           IIKNSWGE WGE+GY+++ RG   CG++  V +
Sbjct: 325 IIKNSWGESWGEEGYYKICRGRNVCGVDSMVST 357


>gi|67773382|gb|AAY81948.1| cysteine protease 11 [Paragonimus westermani]
          Length = 322

 Score =  253 bits (645), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 138/308 (44%), Positives = 182/308 (59%), Gaps = 13/308 (4%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           L+  F   + K YA   +   R  IF  NL + Q LQ  + G+  YG+ +FSDL+  EF 
Sbjct: 26  LYEQFKRDYGKVYAN-EDDQKRFAIFKDNLVRAQKLQLRDQGTARYGVTQFSDLTPEEFA 84

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           AKYL   L     +R  P  +     P   DWR   AVT V++Q  CGS WAFST GN+E
Sbjct: 85  AKYLSPPLNSDQVERVQPTGLK--AAPERMDWRAKGAVTPVENQGECGSCWAFSTAGNVE 142

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G +  KT +LVSLS+Q+L+DCD   +GC GG  S+++  IM    GGLE E  YPY G +
Sbjct: 143 GQWFIKTGQLVSLSKQQLVDCDMAAEGCNGGWPSSSYLEIMDM--GGLESENDYPYVGVE 200

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
           + C LNK+    KI+  V +   E +   YL E+GP++  +NA ALQ Y +G+ HP    
Sbjct: 201 QTCALNKEKLVAKIDDAVVLGASENEHVDYLAEHGPLSTLLNAVALQHYQSGILHPSHKD 260

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
           C   +++L+H+VL VGY  DR       +PYWIIKNSWG  WGEKGYFRL+RGD  CGIN
Sbjct: 261 CP--DDDLNHAVLTVGY--DR----EGDMPYWIIKNSWGTDWGEKGYFRLFRGDCVCGIN 312

Query: 341 DYVRSALV 348
               SA++
Sbjct: 313 RMATSAVI 320


>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 369

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 133/316 (42%), Positives = 184/316 (58%), Gaps = 21/316 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+++  K Y T+ EY  R  +F  NL +  L       +  +G+  FSDL+  EF  
Sbjct: 56  FESFIKEFGKVYHTVEEYEHRFKVFKSNLLRA-LKHQALDPTASHGVTMFSDLTEEEFAT 114

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
           +YLG K   + +       +P   LP +FDWRE  AV  VK+Q  CGS WAFSTTG +EG
Sbjct: 115 QYLGLKRPSALSTAPTAEPLPTGDLPPSFDWREKGAVGPVKNQGSCGSCWAFSTTGAVEG 174

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
            +   T KL+SLSEQ+L+DCD +         D GC GG ++NA+  +  +  GGLE E 
Sbjct: 175 AHFLATGKLLSLSEQQLVDCDHQCDPEEAQACDAGCGGGLMTNAYKYV--EEAGGLELES 232

Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
            YPY+G D  C+ N      K++ + ++  DE  +A YL+++GP+A+ INA  +Q YV G
Sbjct: 233 DYPYKGRDGKCQFNPNKVAAKVSNFTNIPIDEDQVAAYLIKSGPLAIGINAEFMQTYVAG 292

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
           VS PI  FC+    NL H VL+VGY   G    +  +K  PYWIIKNSWG  WG+KGY++
Sbjct: 293 VSCPI--FCN--KRNLDHGVLLVGYAEHGFAPARLAYK--PYWIIKNSWGPMWGDKGYYK 346

Query: 330 LYRGDGSCGINDYVRS 345
           + RG G CG+N  V +
Sbjct: 347 ICRGHGECGLNTMVSA 362


>gi|7211745|gb|AAF40416.1|AF216785_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
 gi|7381223|gb|AAF61442.1|AF138266_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
          Length = 366

 Score =  252 bits (644), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 143/337 (42%), Positives = 196/337 (58%), Gaps = 29/337 (8%)

Query: 24  VVGD---EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTE 80
           VVGD   + L+  HH      F  F  +  K YA+  E+  RL +F  N+R+ +  Q  +
Sbjct: 34  VVGDGDGDLLNADHH------FAVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQQLD 87

Query: 81  HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVT 139
             + V+G+ +FSDL+  EF+ K+LG   +  + AD     ++P   LP  FDWR+  AVT
Sbjct: 88  PAA-VHGVTQFSDLTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDRGAVT 146

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
            VK+Q  CGS W+FSTTG +EG     T KLVSLSEQ+L+DCD E         D GC G
Sbjct: 147 PVKNQGTCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNG 206

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAK 249
           G +++AF+  +    GGL  E+ YPY G+D + CR +K     K+  +  VS DE  +A 
Sbjct: 207 GLMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQIAA 264

Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKA 308
            LV+NGP+AVAINA  +Q Y+ GVS P  + C   ++ L H VL+VGYG         K 
Sbjct: 265 NLVKNGPLAVAINAVFMQTYIGGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKE 319

Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            PYWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 320 KPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 356


>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
          Length = 358

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 140/336 (41%), Positives = 200/336 (59%), Gaps = 25/336 (7%)

Query: 23  MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
            VV +E+ H L+   H   F  F  + +K+YAT  E+  R  +F  NL K +L Q  +  
Sbjct: 27  QVVDNEEDHLLNAEHH---FTSFKSKFSKSYATKEEHDYRFGVFKANLIKAKLHQKLDP- 82

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           +  +G+ +FSDL+ +EF+ ++LG   +   P++A ++   ++P   LP  FDWRE  AVT
Sbjct: 83  TAEHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKA--PILPTTNLPEDFDWREKGAVT 140

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
            VKDQ  CGS WAFSTTG +EG +   T KLVSLSEQ+L+DCD           D GC G
Sbjct: 141 PVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEEAGSCDSGCNG 200

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
           G ++NAF+ ++    GG+ +EK Y Y G D +C+ +K      ++ +  VS DE  +A  
Sbjct: 201 GLMNNAFEYLLQ--SGGVVQEKDYAYTGRDGSCKFDKSKVVASVSNFSVVSLDEEQIAAN 258

Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
           LV+NGP+AVAINA  +Q Y++GVS P  + C      L H VL+VG+G         K  
Sbjct: 259 LVKNGPLAVAINAAWMQAYMSGVSCP--YVC--AKARLDHGVLLVGFGKGAYAPIRLKEK 314

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           PYWIIKNSWG+ WGE+GY+++ RG   CG++  V +
Sbjct: 315 PYWIIKNSWGQNWGEQGYYKICRGRNVCGVDSMVST 350


>gi|34761156|gb|AAQ81938.1| cysteine proteinase precursor [Ipomoea batatas]
          Length = 371

 Score =  252 bits (643), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 141/338 (41%), Positives = 195/338 (57%), Gaps = 32/338 (9%)

Query: 27  DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
           D  L+  HH      F  F  ++ K+YAT  E+  RL +F  NLR+ +  Q  +  S V+
Sbjct: 38  DALLNADHH------FTLFKSKYGKSYATQEEHDYRLSVFKANLRRAKRHQMLDP-SAVH 90

Query: 87  GLNEFSDLSTAEFQAKYLGFKLKPSY---------ADRSVPAMIPNITLPRAFDWREYDA 137
           G+ +FSDL+  EF+  YLG +   S          AD     ++P   LP  F+WR+Y A
Sbjct: 91  GVTKFSDLTPKEFRRTYLGIRKSSSSKQKLKLKLPADAHAAEILPTSDLPFDFEWRDYGA 150

Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGC 188
           VTGVKDQ +CGS W+FSTTG +EG     T +L+SL+EQEL+DCD           D GC
Sbjct: 151 VTGVKDQGLCGSCWSFSTTGTLEGTNFLATGELLSLNEQELVDCDHLCDPKKAGACDAGC 210

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMA 248
            GG ++ A++ ++    GGLE+EK YPY G D  C+ +K      +  +  VS DE  +A
Sbjct: 211 NGGLMTTAYEYVLQS--GGLEKEKDYPYTGRDGTCKFDKSKIAAAVANFSVVSLDEDQIA 268

Query: 249 KYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHK 307
             LV++GP++V IN+  +Q Y+ GVS P  + C    +NL H VLIVGYG         K
Sbjct: 269 ANLVKHGPLSVGINSIFMQTYIGGVSCP--YIC--SKKNLDHGVLIVGYGAAGYAPIRFK 324

Query: 308 AVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
             PYWIIKNSWGE WGE+GY+++ RG+  CG++  V S
Sbjct: 325 DKPYWIIKNSWGENWGEEGYYKICRGNNICGVDSMVSS 362


>gi|301784869|ref|XP_002927853.1| PREDICTED: cathepsin F-like [Ailuropoda melanoleuca]
          Length = 394

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 138/329 (41%), Positives = 196/329 (59%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++  E L     V+  ++F  F+  +N+TY +  E   R+ +FS N+ + Q +Q  
Sbjct: 75  SVLPLLNKEPLPQDFSVRMVSIFKEFVTTYNRTYESKEEAEWRMSVFSNNVMRAQKIQAL 134

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+ +   +   A     + P  +DWR   AVT
Sbjct: 135 DRGTAQYGITKFSDLTEEEFRTIYLNPLLRENRGKKMDLAKSIGDSAPPEWDWRNKGAVT 194

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +  K   L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 195 QVKDQGMCGSCWAFSVTGNVEGQWFLKRGALLSLSEQELLDCDKVDKACLGGLPSNAYSA 254

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y YRG  + C  + K  +V IN  V +S++E  +  +L +NGP++V
Sbjct: 255 I--KTLGGLETEDDYSYRGHVQTCSFSSKKARVYINDSVELSQNEQKLVAWLAQNGPISV 312

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+SHP++  C      + H+VL+VGYG          +P+W IKNSWG
Sbjct: 313 AINAFGMQFYRRGISHPLRPLCSPW--LIDHAVLLVGYG------NRSGIPFWAIKNSWG 364

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGE+GY+ L+RG G+CG+N    SA+V
Sbjct: 365 TDWGEEGYYYLHRGSGACGVNTMASSAVV 393


>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
          Length = 363

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 143/356 (40%), Positives = 208/356 (58%), Gaps = 27/356 (7%)

Query: 6   FFAGVALLSL-TVSVSSFMV--VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSR 62
            FA VA  S    +   F++  V D +  HL + +H   F  F  + +K+Y+T  E+  R
Sbjct: 11  LFAAVATSSTDNTNTDDFIIRQVVDNEEDHLLNAEHH--FTSFKSKFSKSYSTKEEHDYR 68

Query: 63  LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPA 119
             +F  NL K +L Q  +  +  +G+ +FSDL+ +EF+ ++LG K +   P++A ++   
Sbjct: 69  FGVFKSNLIKAKLHQKLDP-TAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQKA--P 125

Query: 120 MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
           ++P   LP  FDWRE  AVT VKDQ  CGS WAFSTTG +EG +   T KLVSLSEQ+L+
Sbjct: 126 ILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLV 185

Query: 180 DCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT 230
           DCD           D GC GG ++NAF+ ++    GG+ +EK Y Y G D +C+ +K   
Sbjct: 186 DCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQ--SGGVVQEKDYAYTGRDGSCKFDKSKV 243

Query: 231 QVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSH 290
              ++ +  VS DE  +A  LV+NGP+AV INA  +Q Y++GVS P  + C      L H
Sbjct: 244 VASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQTYMSGVSCP--YVC--AKSRLDH 299

Query: 291 SVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            VL+VG+G         K  PYWI+KNSWG+ WGE+GY+++ RG   CG++  V +
Sbjct: 300 GVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWGEQGYYKICRGRNVCGVDSMVST 355


>gi|291385469|ref|XP_002709277.1| PREDICTED: cathepsin F [Oryctolagus cuniculus]
          Length = 460

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 138/329 (41%), Positives = 195/329 (59%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S+   +  + L     VK  ++F  F+  +N+TY +  E   RL +F+ N+ + Q +Q  
Sbjct: 141 STLPALNRDSLPQDFSVKMASIFKKFVRTYNRTYESKEEAQWRLSVFASNMVRAQKIQSL 200

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+     +   A       P  +DWR   AVT
Sbjct: 201 DRGTAQYGITKFSDLTEEEFRTIYLNPLLRSEPGKKMQLAKPVEDPAPPQWDWRSKGAVT 260

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +  K   L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 261 NVKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKLDKACLGGLPSNAYSA 320

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E+ Y Y+G  +AC  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 321 I--KNLGGLETEEDYTYQGHMQACNFSAQKAKVYINDSVELSQNEQKLAAWLAKRGPISV 378

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G++HP++  C      + H+VL+VGYG         A P+W IKNSWG
Sbjct: 379 AINAFGMQFYRRGIAHPLRPLCSPW--LIDHAVLLVGYG------NRSATPFWAIKNSWG 430

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGE+GY+ LYRG G CG+N    SA+V
Sbjct: 431 ADWGEEGYYYLYRGSGVCGVNTMASSAVV 459


>gi|294462776|gb|ADE76932.1| unknown [Picea sitchensis]
          Length = 403

 Score =  251 bits (642), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 144/352 (40%), Positives = 200/352 (56%), Gaps = 35/352 (9%)

Query: 17  VSVSSFMVVGDEKLH--HLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ 74
           VS   F+    EK +  HL +++   LF+ F+ +H K Y+T+ EY  RL IF  NL K  
Sbjct: 63  VSEGGFIAQVTEKFNREHLLNLRSKTLFDKFIVEHGKVYSTIEEYVRRLRIFEKNLLKAA 122

Query: 75  LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF--KLKPSYADRSVPAMIPNITLPRAFDW 132
             Q  +  + V+G+  FSDL+  EF+++Y G     +    ++    ++P   LP  FDW
Sbjct: 123 ENQALDP-TAVHGITPFSDLTEYEFESRYTGLLGVRQGLVNEKQTAEILPVDDLPANFDW 181

Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-------- 184
           RE  AVT VK Q  CGS WAFSTTG +EG     T KL++LSEQ+LIDCD +        
Sbjct: 182 REKGAVTEVKTQGNCGSCWAFSTTGVVEGANFLATGKLLNLSEQQLIDCDHKCDPLNTKA 241

Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRD 243
            D+GC GG ++NA++ +M    GG+EE K YPY G    C+ N     VK   + +V+ D
Sbjct: 242 CDNGCHGGLMTNAYNYLME--AGGIEEAKNYPYTGVQGDCKFNPDLAAVKAINFTTVNLD 299

Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
           E  +A  LV++GP+AV +NA  +Q Y+ GVS P+   C      ++H VL+VGYG     
Sbjct: 300 EKQIAANLVKHGPLAVGLNAAFMQTYIGGVSCPL--ICS--KRFINHGVLLVGYG----- 350

Query: 304 FTHKAV--------PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSAL 347
             HK          PYWIIKNSWG+ WGE GY++L RG G CG+N  V + +
Sbjct: 351 --HKGFALLRLGYRPYWIIKNSWGKRWGEHGYYKLCRGHGECGMNKMVSAVI 400


>gi|198427474|ref|XP_002119872.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 596

 Score =  251 bits (641), Expect = 3e-64,   Method: Compositional matrix adjust.
 Identities = 132/292 (45%), Positives = 182/292 (62%), Gaps = 22/292 (7%)

Query: 41  LFNYFLEQHNKTYATLV-EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           LF+ FLE++ +TY++   EY  R  IF  N + +Q L + E G+ VYG+ +F D+S  E+
Sbjct: 168 LFDMFLEKYPRTYSSSSDEYNERFEIFKTNYQVVQHLNEIERGTAVYGITKFMDMSEEEY 227

Query: 100 QAKYLGFKLKPSYADRSVP------AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
                   L P +    VP      A +    +P + DWR++ AVT VK+Q  CGS WAF
Sbjct: 228 HRT-----LAPGFTRPLVPIQTLNSAELDTTNIPDSMDWRKHGAVTEVKNQGSCGSCWAF 282

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           STTGN+EG +  K KKL+SLSEQEL+DCD  D GC GG  SNA+ +I  KL GGLE EK 
Sbjct: 283 STTGNVEGQWFLKHKKLISLSEQELVDCDTLDSGCGGGLPSNAYKSI-EKL-GGLEPEKD 340

Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
           YPY G+ + C + +   +V +N  V++ +DE  +A +L +NGP+++ INA  +QFY  G+
Sbjct: 341 YPYVGEGEKCAIKQSDFKVFVNNSVALPKDEVKLAAWLAQNGPISIGINANLMQFYWGGI 400

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
           SHP + FC+   ++L H VLIVGYG      T    P+WIIKNSWG  WGE+
Sbjct: 401 SHPWKIFCNP--KSLDHGVLIVGYG------TENGTPFWIIKNSWGPDWGEE 444



 Score =  105 bits (262), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 66/143 (46%), Positives = 80/143 (55%), Gaps = 22/143 (15%)

Query: 108 LKPSYADRSVP------AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
           L P +    VP      A +    +P + DWR++ AVT VK+Q  CGS WAFSTTGN+EG
Sbjct: 451 LAPGFTRPLVPIQTLNSAELDTTNIPDSMDWRKHGAVTEVKNQGSCGSCWAFSTTGNVEG 510

Query: 162 VYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLE------------ 209
            +  K KKL+SLSEQEL+DCD  D GC GG  SNA+ +I  KL  G              
Sbjct: 511 QWFLKHKKLISLSEQELVDCDTLDSGCGGGLPSNAYKSI-EKLENGTPFWIIKNSWGPDW 569

Query: 210 -EEKTYP-YRGDDKACRLNKKAT 230
            EE  Y  YRGD  +C LN  AT
Sbjct: 570 GEEGYYRIYRGDG-SCGLNNMAT 591



 Score = 70.1 bits (170), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 26/43 (60%), Positives = 34/43 (79%)

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               P+WIIKNSWG  WGE+GY+R+YRGDGSCG+N+   S++V
Sbjct: 553 ENGTPFWIIKNSWGPDWGEEGYYRIYRGDGSCGLNNMATSSIV 595


>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
          Length = 363

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 143/356 (40%), Positives = 208/356 (58%), Gaps = 27/356 (7%)

Query: 6   FFAGVALLSLT-VSVSSFMV--VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSR 62
            FA VA  S    +   F++  V D +  HL + +H   F  F  + +K+Y+T  E+  R
Sbjct: 11  LFAAVATSSTDDTNTDDFIIRQVVDNEEDHLLNAEHH--FTSFKSKFSKSYSTKEEHDYR 68

Query: 63  LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPA 119
             +F  NL K +L Q  +  +  +G+ +FSDL+ +EF+ ++LG K +   P++A ++   
Sbjct: 69  FGVFKSNLIKAKLHQKLDP-TAEHGITKFSDLTASEFRRQFLGLKKRLRLPAHAQKA--P 125

Query: 120 MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
           ++P   LP  FDWRE  AVT VKDQ  CGS WAFSTTG +EG +   T KLVSLSEQ+L+
Sbjct: 126 ILPTTNLPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLV 185

Query: 180 DCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT 230
           DCD           D GC GG ++NAF+ ++    GG+ +EK Y Y G D +C+ +K   
Sbjct: 186 DCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQ--SGGVVQEKDYAYTGRDGSCKFDKSKV 243

Query: 231 QVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSH 290
              ++ +  VS DE  +A  LV+NGP+AV INA  +Q Y++GVS P  + C      L H
Sbjct: 244 VASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQTYMSGVSCP--YVC--AKSRLDH 299

Query: 291 SVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            VL+VG+G         K  PYWI+KNSWG+ WGE+GY+++ RG   CG++  V +
Sbjct: 300 GVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWGEQGYYKICRGRNVCGVDSMVST 355


>gi|56718881|gb|AAW28151.1| westerpain-1 [Paragonimus westermani]
          Length = 322

 Score =  251 bits (641), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 135/308 (43%), Positives = 181/308 (58%), Gaps = 13/308 (4%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           L+  F   + K YA   +   R  IF  NL + Q LQ  + G+  YG+ +FSDL+  EF 
Sbjct: 26  LYEQFKRDYGKVYAN-EDDQKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFA 84

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           AKYL   +      R  P  +     P   DWR   AVT V++Q  CGS WAFST GN+E
Sbjct: 85  AKYLSAPVNNDQVKRVRPTGLK--AAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVE 142

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G +  KT +LVSLS+Q+L+DCD+   GC GG  ++++  IM    GGLE E  YPY G +
Sbjct: 143 GQWFIKTGQLVSLSKQQLVDCDRAAQGCNGGWPASSYLEIMYM--GGLESESDYPYVGVE 200

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
           + C LNK+    KI+  + +  +E D A YL E+GP++  +NA ALQ+Y +GV  P   F
Sbjct: 201 QTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVALQYYQSGVLKPT--F 258

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
            +  +  L+H+VL VGY           +PYWIIKNSWG  WGEKGYFRL+RGD +CGIN
Sbjct: 259 EECPDTELNHAVLTVGYD------KEGDMPYWIIKNSWGTDWGEKGYFRLFRGDCTCGIN 312

Query: 341 DYVRSALV 348
               SA++
Sbjct: 313 RMATSAII 320


>gi|4826565|emb|CAB42884.1| cathepsin F [Mus musculus]
          Length = 462

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 142/333 (42%), Positives = 197/333 (59%), Gaps = 11/333 (3%)

Query: 17  VSVSSFMVVGD-EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
            + SSF+ + D + L     VK   LF  F+  +N+TY +  E   RL +F+ N+ + Q 
Sbjct: 139 ATFSSFLPLLDKDPLPQDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQK 198

Query: 76  LQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY 135
           +Q  + G+  YG+ +FSDL+  EF   YL   L+     +  PA   N   P  +DWR+ 
Sbjct: 199 IQALDRGTAQYGITKFSDLTEEEFHTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKK 258

Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISN 195
            AVT VK+Q MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SN
Sbjct: 259 GAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSN 318

Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
           A+  I  K  GGLE E  Y Y+G  + C  + +  +V IN  V +SR+E  +A +L + G
Sbjct: 319 AYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKG 376

Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
           P++VAINA+ +QFY  G++HP +  C      + H+VL+VGYG          +PYW IK
Sbjct: 377 PISVAINAFGMQFYRHGIAHPFRPLCSPW--FIDHAVLLVGYG------NRSNIPYWAIK 428

Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           NSWG  WGE+GY+ LYRG G+CG+N    SA+V
Sbjct: 429 NSWGSDWGEEGYYYLYRGSGACGVNTMASSAVV 461


>gi|11066228|gb|AAG28508.1|AF197480_1 cathepsin F [Mus musculus]
          Length = 462

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 142/333 (42%), Positives = 197/333 (59%), Gaps = 11/333 (3%)

Query: 17  VSVSSFMVVGD-EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
            + SSF+ + D + L     VK   LF  F+  +N+TY +  E   RL +F+ N+ + Q 
Sbjct: 139 ATFSSFLPLLDKDPLPQDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQK 198

Query: 76  LQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY 135
           +Q  + G+  YG+ +FSDL+  EF   YL   L+     +  PA   N   P  +DWR+ 
Sbjct: 199 IQALDRGTAQYGITKFSDLTEEEFHTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKK 258

Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISN 195
            AVT VK+Q MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SN
Sbjct: 259 GAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSN 318

Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
           A+  I  K  GGLE E  Y Y+G  + C  + +  +V IN  V +SR+E  +A +L + G
Sbjct: 319 AYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKG 376

Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
           P++VAINA+ +QFY  G++HP +  C      + H+VL+VGYG          +PYW IK
Sbjct: 377 PISVAINAFGMQFYRHGIAHPFRPLCSPW--FIDHAVLLVGYG------NRSNIPYWAIK 428

Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           NSWG  WGE+GY+ LYRG G+CG+N    SA+V
Sbjct: 429 NSWGSDWGEEGYYYLYRGSGACGVNTMASSAVV 461


>gi|444510192|gb|ELV09527.1| Cathepsin F [Tupaia chinensis]
          Length = 597

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 141/329 (42%), Positives = 196/329 (59%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++  + L     VK  ++F  F+  +N+TY T  E   RL +F+ N+ + Q +Q  
Sbjct: 278 SGLPLLTKDPLSQDFSVKMASIFKNFVTTYNRTYQTKEEAQWRLSVFASNMVRAQKIQAL 337

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           +HG+  YG+ +FSDL+  EF+  YL   L+     +   A       P  +DWR+  AVT
Sbjct: 338 DHGTAQYGVTKFSDLTEEEFRTIYLNPLLREVPGKKMHLAKSIGDPAPPEWDWRKNGAVT 397

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 398 KVKDQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 457

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y Y+G  +AC  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 458 I--KNLGGLETEDDYSYQGHMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISV 515

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G++HP++  C      + H+VLIVGYG          VP+W IKNSWG
Sbjct: 516 AINAFGMQFYRHGIAHPLRPLCSPW--LIDHAVLIVGYG------NRSEVPFWAIKNSWG 567

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG GSCG+N    SA+V
Sbjct: 568 TDWGEKGYYYLHRGSGSCGVNTMASSAVV 596


>gi|67773378|gb|AAY81946.1| cysteine protease 8 [Paragonimus westermani]
          Length = 325

 Score =  251 bits (640), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 137/310 (44%), Positives = 180/310 (58%), Gaps = 18/310 (5%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           L+  F   + K YA   +   R  IF  NL + Q  Q  E G+  YG+ +FSDL+  EF+
Sbjct: 31  LYEQFKRDYGKAYANEDDQ-KRFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTPEEFE 89

Query: 101 AKYLGFKLKPSYADRSVPAMIPN--ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           AKYLG ++     D  V  +  N   T P + DWRE  AV  +++Q  CGS WAFS  GN
Sbjct: 90  AKYLGLRI-----DEQVDRVQLNDLQTAPASVDWREKGAVGPIENQGSCGSCWAFSVVGN 144

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           IEG +  KT  LVSLS+Q+L+DCD  D+GC GG     +  I  K  GGLE +  YPY G
Sbjct: 145 IEGQWFLKTGYLVSLSKQQLVDCDTVDNGCYGGYPPYTYKEI--KRMGGLELQSDYPYTG 202

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
               CRL++     KI+  + +  DE   A +L E+GPM+  +NA  LQFY +G+ HP +
Sbjct: 203 WGHGCRLDRSKLFAKIDDSIVLEADEEKQAAWLAEHGPMSTCLNAKYLQFYQSGILHPSK 262

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
             C    E L+H+VL VGY       T   +PYWIIKNSWG  WGE GYFR+YRGDG+CG
Sbjct: 263 AMC--SPEGLNHAVLTVGYD------TKHGIPYWIIKNSWGTSWGEDGYFRIYRGDGTCG 314

Query: 339 INDYVRSALV 348
           I+    SA++
Sbjct: 315 IDRLTTSAII 324


>gi|9845246|ref|NP_063914.1| cathepsin F precursor [Mus musculus]
 gi|12643321|sp|Q9R013.1|CATF_MOUSE RecName: Full=Cathepsin F; Flags: Precursor
 gi|6467384|gb|AAF13147.1|AF136280_1 cathepsin F precursor [Mus musculus]
 gi|7141165|gb|AAF37228.1|AF217224_1 cathepsin F [Mus musculus]
 gi|26344728|dbj|BAC36013.1| unnamed protein product [Mus musculus]
 gi|37589148|gb|AAH58758.1| Cathepsin F [Mus musculus]
 gi|148701127|gb|EDL33074.1| cathepsin F, isoform CRA_b [Mus musculus]
          Length = 462

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 142/333 (42%), Positives = 197/333 (59%), Gaps = 11/333 (3%)

Query: 17  VSVSSFMVVGD-EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
            + SSF+ + D + L     VK   LF  F+  +N+TY +  E   RL +F+ N+ + Q 
Sbjct: 139 ATFSSFLPLLDKDPLPQDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQK 198

Query: 76  LQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY 135
           +Q  + G+  YG+ +FSDL+  EF   YL   L+     +  PA   N   P  +DWR+ 
Sbjct: 199 IQALDRGTAQYGITKFSDLTEEEFHTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKK 258

Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISN 195
            AVT VK+Q MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SN
Sbjct: 259 GAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSN 318

Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
           A+  I  K  GGLE E  Y Y+G  + C  + +  +V IN  V +SR+E  +A +L + G
Sbjct: 319 AYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKG 376

Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
           P++VAINA+ +QFY  G++HP +  C      + H+VL+VGYG          +PYW IK
Sbjct: 377 PISVAINAFGMQFYRHGIAHPFRPLCSPW--FIDHAVLLVGYG------NRSNIPYWAIK 428

Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           NSWG  WGE+GY+ LYRG G+CG+N    SA+V
Sbjct: 429 NSWGSDWGEEGYYYLYRGSGACGVNTMASSAVV 461


>gi|351629613|gb|AEQ54770.1| cysteine proteinase CP1 [Coffea canephora]
          Length = 397

 Score =  251 bits (640), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 148/353 (41%), Positives = 202/353 (57%), Gaps = 45/353 (12%)

Query: 31  HHLHH-----VKHTAL-------FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD 78
           HH HH       H  L       F  F+E++ KTY+T  EY  RL IF+ NL K    Q 
Sbjct: 51  HHRHHPGRSSANHRLLGTTTEVHFKSFVEEYEKTYSTHEEYVHRLGIFAKNLIKAAEHQA 110

Query: 79  TEHGSGVYGLNEFSDLSTAEFQAKYLGF----------KLKPSYADRSVPAMIPNIT-LP 127
            +  S ++G+ +FSDL+  EF+A Y+G           +L     D S   ++ +++ LP
Sbjct: 111 MD-PSAIHGVTQFSDLTEEEFEATYMGLKGGAGVGGTTQLGKDDGDESAAEVMMDVSDLP 169

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--- 184
            +FDWRE  AVT VK Q  CGS WAFSTTG IEG     T KL+SLSEQ+L+DCD     
Sbjct: 170 ESFDWREKGAVTEVKTQGRCGSCWAFSTTGAIEGANFIATGKLLSLSEQQLVDCDHMCDL 229

Query: 185 ------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYV 238
                 DDGC GG ++ AF+ ++    GG+EEE TYPY G    C+ N +   VK+  + 
Sbjct: 230 KEKDDCDDGCSGGLMTTAFNYLIE--AGGIEEEVTYPYTGKRGECKFNPEKVAVKVRNFA 287

Query: 239 SVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY- 297
            +  DE+ +A  +V NGP+A+ +NA  +Q Y+ GVS P+   CD   + ++H VL+VGY 
Sbjct: 288 KIPEDESQIAANVVHNGPLAIGLNAVFMQTYIGGVSCPL--ICD--KKRINHGVLLVGYG 343

Query: 298 --GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             G    +  +K  PYWIIKNSWG+ WGE GY+RL RG   CG++  V SA+V
Sbjct: 344 SRGFSILRLGYK--PYWIIKNSWGKRWGEHGYYRLCRGHNMCGMSTMV-SAVV 393


>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 369

 Score =  250 bits (639), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 133/316 (42%), Positives = 181/316 (57%), Gaps = 21/316 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F++   K Y ++ EY  R  +F  NL K  L       +  +G+  FSDL+  EF +
Sbjct: 56  FESFMKDFGKVYHSVEEYEHRFGVFKSNLLK-ALKHQALDPTASHGVTMFSDLTEEEFTS 114

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
           KYLG K     +       +P   LP  FDWRE  AV  VKDQ  CGS WAFSTTG +EG
Sbjct: 115 KYLGLKRPSVLSSAPQAPPLPTEDLPPNFDWREKGAVGPVKDQGGCGSCWAFSTTGAVEG 174

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
            +   + KLVSLSEQ+L+DCD +         D GC GG ++NA+  +  +  GGLE E 
Sbjct: 175 AHFLNSGKLVSLSEQQLVDCDHQCDREEADACDAGCNGGFMTNAYQYV--EAAGGLELES 232

Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
            YPY G D  C+ +     VK++ + ++  DE  +A YL+++GP+A+ INA  +Q Y+ G
Sbjct: 233 DYPYEGRDGKCKFDSNKVAVKVSNFTNIPVDEDQVAAYLIKSGPLAIGINAEFMQTYIAG 292

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
           VS PI  FC+    NL H VL+VGY   G    +  +K  PYWIIKNSWG  WG+ GY++
Sbjct: 293 VSCPI--FCN--KRNLDHGVLLVGYAERGFAPARLAYK--PYWIIKNSWGPNWGDNGYYK 346

Query: 330 LYRGDGSCGINDYVRS 345
           + RG G CG+N  V +
Sbjct: 347 ICRGHGECGLNTMVSA 362


>gi|338712411|ref|XP_001491536.3| PREDICTED: cathepsin F [Equus caballus]
          Length = 459

 Score =  250 bits (639), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 138/313 (44%), Positives = 191/313 (61%), Gaps = 10/313 (3%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           VK  ++F +F+  +N+TY T  E   R+ IF+ N+ + Q +Q  + G+  YG+ +FSDL+
Sbjct: 156 VKMASIFKHFVTTYNRTYETKEEAQWRMSIFASNMVRAQKIQALDRGTAQYGVTKFSDLT 215

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
             EF+  YL   LK     +   A     + P  +DWR   AVT VKDQ MCGS WAFS 
Sbjct: 216 EEEFRTIYLNPLLKEEPGVKMRRAKSVGDSAPPEWDWRSKGAVTEVKDQGMCGSCWAFSV 275

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
           TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  I  K  GGLE E  Y 
Sbjct: 276 TGNVEGQWFLNRGALLSLSEQELLDCDKVDKACMGGLPSNAYSAI--KTLGGLETEDDYS 333

Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           Y G  +AC  + +  +V IN  V ++++E  +A +L + GP++VAINA+ +QFY  G+SH
Sbjct: 334 YHGHLQACSFSAEKAKVYINDSVELTKNEQKLAAWLAKKGPISVAINAFGMQFYRHGISH 393

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
           P++  C      + H+VL+VGYG         AVP+W IKNSWG  WGE+GY+ LYRG G
Sbjct: 394 PLRPLCSPW--LIDHAVLLVGYG------NRSAVPFWAIKNSWGTDWGEEGYYYLYRGSG 445

Query: 336 SCGINDYVRSALV 348
           +CG+N    SA+V
Sbjct: 446 ACGVNTMASSAVV 458


>gi|113819972|gb|AAH04054.2| Ctsf protein [Mus musculus]
          Length = 332

 Score =  250 bits (639), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 142/333 (42%), Positives = 197/333 (59%), Gaps = 11/333 (3%)

Query: 17  VSVSSFMVVGD-EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
            + SSF+ + D + L     VK   LF  F+  +N+TY +  E   RL +F+ N+ + Q 
Sbjct: 9   ATFSSFLPLLDKDPLPQDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQK 68

Query: 76  LQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY 135
           +Q  + G+  YG+ +FSDL+  EF   YL   L+     +  PA   N   P  +DWR+ 
Sbjct: 69  IQALDRGTAQYGITKFSDLTEEEFHTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKK 128

Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISN 195
            AVT VK+Q MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SN
Sbjct: 129 GAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSN 188

Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
           A+  I  K  GGLE E  Y Y+G  + C  + +  +V IN  V +SR+E  +A +L + G
Sbjct: 189 AYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKG 246

Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
           P++VAINA+ +QFY  G++HP +  C      + H+VL+VGYG          +PYW IK
Sbjct: 247 PISVAINAFGMQFYRHGIAHPFRPLCSPW--FIDHAVLLVGYG------NRSNIPYWAIK 298

Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           NSWG  WGE+GY+ LYRG G+CG+N    SA+V
Sbjct: 299 NSWGSDWGEEGYYYLYRGSGACGVNTMASSAVV 331


>gi|2731635|gb|AAB93494.1| pre-procathepsin L [Paragonimus westermani]
          Length = 325

 Score =  250 bits (639), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 134/310 (43%), Positives = 182/310 (58%), Gaps = 18/310 (5%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           L+  F   + K YA   +   R  IF  NL + Q  Q  E G+  YG+ +FSDL+  EF 
Sbjct: 31  LYEQFKRDYGKAYANEDDQ-KRFAIFKDNLVRAQQYQTQEQGTAKYGVTQFSDLTNEEFA 89

Query: 101 AKYLGFKLKPSYADRSVPAMIPN--ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           A YLG ++     D  V  +  N   T P + DWRE  AV  V+ Q  CGS WAFS T N
Sbjct: 90  AMYLGSRI-----DERVDRVQLNDLQTAPASVDWREKGAVGPVEHQGSCGSCWAFSVTAN 144

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EG +  KT +LVSLS+Q+L+DCD+ D GC GG     +  I  K  GGLE +  YPY G
Sbjct: 145 VEGQWFLKTGRLVSLSKQQLVDCDRLDHGCSGGYPPYTYKEI--KRMGGLELQSAYPYTG 202

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
            ++ACRL++     KI+  + + ++E   A +L E+GPM+  +NA  LQFY  G+ HP +
Sbjct: 203 WEQACRLDRSKLFAKIDDSIVLEKNEEKQAAWLAEHGPMSTCLNAGPLQFYRYGILHPSE 262

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
           + C    E L+H+VL VGY       T + VPYW ++NSWG  WGE GYFR+YRGDG+CG
Sbjct: 263 YACS--PEGLNHAVLTVGYD------TERGVPYWTVRNSWGTRWGENGYFRIYRGDGTCG 314

Query: 339 INDYVRSALV 348
           I+    SA++
Sbjct: 315 IDRLTTSAII 324


>gi|77628008|ref|NP_001029282.1| cathepsin F precursor [Rattus norvegicus]
 gi|71681040|gb|AAH99780.1| Cathepsin F [Rattus norvegicus]
 gi|149062007|gb|EDM12430.1| cathepsin F, isoform CRA_a [Rattus norvegicus]
 gi|159895422|gb|ABX09995.1| cathepsin F [Rattus norvegicus]
          Length = 462

 Score =  250 bits (639), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 140/325 (43%), Positives = 192/325 (59%), Gaps = 10/325 (3%)

Query: 24  VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS 83
           ++  + L     VK   LF  F+  +N+TY +  E   RL +F+ N+ + Q +Q  + G+
Sbjct: 147 MLDKDPLPQDFSVKMATLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRGT 206

Query: 84  GVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
             YG+ +FSDL+  EF   YL   L+     +   A   N   P  +DWR+  AVT VKD
Sbjct: 207 AQYGITKFSDLTEEEFHTIYLNPLLQKESGGKMSLAKSINDLAPPEWDWRKKGAVTEVKD 266

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSK 203
           Q MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  I  K
Sbjct: 267 QGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKMDKACMGGLPSNAYTAI--K 324

Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA 263
             GGLE E  Y Y+G  +AC  + +  +V IN  V +SRDE  +A +L + GP++VAINA
Sbjct: 325 NLGGLETEDDYGYQGHVQACNFSTQMAKVYINDSVELSRDENKIAAWLAQKGPISVAINA 384

Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
           + +QFY  G++HP +  C      + H+VL+VGYG          +PYW IKNSWG  WG
Sbjct: 385 FGMQFYRHGIAHPFRPLCSPW--FIDHAVLLVGYG------NRSNIPYWAIKNSWGRDWG 436

Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
           E+GY+ LYRG G+CG+N    SA+V
Sbjct: 437 EEGYYYLYRGSGACGVNTMASSAVV 461


>gi|358339356|dbj|GAA47436.1| cathepsin L [Clonorchis sinensis]
          Length = 236

 Score =  250 bits (638), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 123/238 (51%), Positives = 159/238 (66%), Gaps = 8/238 (3%)

Query: 111 SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKL 170
           S ++R      P  +LP +FDWR++  VT VKDQ MCGS WAF+ TGNIEG +  KTKKL
Sbjct: 6   SRSNRPKVTSYPTQSLPGSFDWRQHGVVTEVKDQGMCGSCWAFAVTGNIEGQWYKKTKKL 65

Query: 171 VSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT 230
           VSLSEQ+L+DCD++D+ C GG    A+++I+    GGL  EK YPY    + C L     
Sbjct: 66  VSLSEQQLLDCDKKDEACNGGFPEWAYESIVKM--GGLMSEKDYPYEAHKETCNLKPNNI 123

Query: 231 QVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSH 290
              IN  V++S+DE ++A +L ENGP++V +NA  LQFY  GVSHP    C    + L H
Sbjct: 124 SAYINDSVTLSKDEKELAAWLTENGPISVGMNANFLQFYFGGVSHPPHMLCS--EQGLDH 181

Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           +VL+VGYGV  T F  +  PYWI+KNSWG  WGEKGYFR+YRGDG+CGIN    S++V
Sbjct: 182 AVLLVGYGV--TSFWQR--PYWIVKNSWGRSWGEKGYFRIYRGDGTCGINADATSSIV 235


>gi|148701126|gb|EDL33073.1| cathepsin F, isoform CRA_a [Mus musculus]
          Length = 417

 Score =  250 bits (638), Expect = 7e-64,   Method: Compositional matrix adjust.
 Identities = 142/333 (42%), Positives = 197/333 (59%), Gaps = 11/333 (3%)

Query: 17  VSVSSFMVVGD-EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
            + SSF+ + D + L     VK   LF  F+  +N+TY +  E   RL +F+ N+ + Q 
Sbjct: 94  ATFSSFLPLLDKDPLPQDFSVKMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQK 153

Query: 76  LQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY 135
           +Q  + G+  YG+ +FSDL+  EF   YL   L+     +  PA   N   P  +DWR+ 
Sbjct: 154 IQALDRGTAQYGITKFSDLTEEEFHTIYLNPLLQKESGRKMSPAKSINDLAPPEWDWRKK 213

Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISN 195
            AVT VK+Q MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SN
Sbjct: 214 GAVTEVKNQGMCGSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKVDKACLGGLPSN 273

Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
           A+  I  K  GGLE E  Y Y+G  + C  + +  +V IN  V +SR+E  +A +L + G
Sbjct: 274 AYAAI--KNLGGLETEDDYGYQGHVQTCNFSAQMAKVYINDSVELSRNENKIAAWLAQKG 331

Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
           P++VAINA+ +QFY  G++HP +  C      + H+VL+VGYG          +PYW IK
Sbjct: 332 PISVAINAFGMQFYRHGIAHPFRPLCSPW--FIDHAVLLVGYG------NRSNIPYWAIK 383

Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           NSWG  WGE+GY+ LYRG G+CG+N    SA+V
Sbjct: 384 NSWGSDWGEEGYYYLYRGSGACGVNTMASSAVV 416


>gi|345783063|ref|XP_533219.3| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Canis lupus
           familiaris]
          Length = 490

 Score =  250 bits (638), Expect = 8e-64,   Method: Compositional matrix adjust.
 Identities = 138/330 (41%), Positives = 199/330 (60%), Gaps = 11/330 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++  + L     VK  ++F  F+  +N+TY T  E   R+ +FS N+ + Q +Q  
Sbjct: 170 SVLPLLNKDPLPQDFSVKMASVFKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQAL 229

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADR-SVPAMIPNITLPRAFDWREYDAV 138
           + G+  YG+ +FSDL+  EF+  YL   L+ +   +  +   I +   P  +DWR   AV
Sbjct: 230 DRGTAQYGITKFSDLTEEEFRTIYLNPLLRENRGKKMRLAKSISDHAPPPEWDWRSKGAV 289

Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFD 198
           T VKDQ MCGS WAFS TGN+EG +  K   L+SLSEQEL+DCD+ D  C GG  SNA+ 
Sbjct: 290 TKVKDQGMCGSCWAFSVTGNVEGQWFLKEGTLLSLSEQELLDCDKVDKACLGGLPSNAYS 349

Query: 199 TIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMA 258
            IM+   GGLE E  Y Y+G  +AC  + K  +V IN  + +S++E  +A +L + GP++
Sbjct: 350 AIMTL--GGLETEDDYSYQGHLQACSFSAKKARVYINDSMELSQNEQKLAAWLAKKGPIS 407

Query: 259 VAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           VAINA+ +QFY  G+SHP++  C      + H+VL+VGYG          +P+W IKNSW
Sbjct: 408 VAINAFGMQFYRHGISHPLRPLCSPW--LIDHAVLLVGYG------NRSGIPFWAIKNSW 459

Query: 319 GEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           G  WGE+GY+ L+RG G+CG+N    SA+V
Sbjct: 460 GTDWGEEGYYYLHRGSGACGVNTMASSAVV 489


>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
          Length = 461

 Score =  249 bits (636), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 136/317 (42%), Positives = 182/317 (57%), Gaps = 25/317 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+++  + Y+++ E   R  I+  N+   + LQ  E G+ +YG  +FSD++  EFQ 
Sbjct: 159 FMTFIKKFKREYSSIEEQLDRFRIYLQNMNFAKKLQFEEKGTAIYGATKFSDMTAEEFQK 218

Query: 102 KYLGFKLKPS-YADRSVPAMIP------NIT---LPRAFDWREYDAVTGVKDQTMCGSSW 151
             L     PS + DR     I       N++   LP  FDWR    VT VKDQ  CGS W
Sbjct: 219 IML-----PSIWWDRVESNGITFNLNDFNLSIYNLPSKFDWRTEGVVTPVKDQGSCGSCW 273

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
           AFS TGNIE ++A KT KL+SLSEQELIDCD  D GC GG   NAF  I  K  GGLE E
Sbjct: 274 AFSVTGNIESLWAIKTGKLISLSEQELIDCDVIDKGCNGGLPINAFREI--KRMGGLEPE 331

Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
             YPY   +  C L +    V I+  V + R+ET M  ++ + GP++V I+A  L +Y +
Sbjct: 332 DQYPYEAKNGTCHLVRAQIAVSIDDAVEIPRNETVMKAWIAQRGPLSVGIDAELLSYYKS 391

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+ HP +  C      ++H VLI GYG++        +PYW IKNSWGE WGE GYF+L 
Sbjct: 392 GILHPSKSRCPPS--KINHGVLITGYGIENN------LPYWTIKNSWGEQWGENGYFQLM 443

Query: 332 RGDGSCGINDYVRSALV 348
           RG   CG++D V SA++
Sbjct: 444 RGKNICGVSDLVSSAII 460


>gi|359492179|ref|XP_002280808.2| PREDICTED: cysteine proteinase RD19a-like [Vitis vinifera]
 gi|302142580|emb|CBI19783.3| unnamed protein product [Vitis vinifera]
          Length = 365

 Score =  249 bits (635), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 139/329 (42%), Positives = 188/329 (57%), Gaps = 24/329 (7%)

Query: 27  DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
           D+ L   HH      F  F  +  KTYAT  E+  R  IF  NLR+ +  Q  +  S V+
Sbjct: 43  DDLLSAEHH------FAAFKARFRKTYATAEEHDYRFSIFKANLRRAKRNQLLDP-SAVH 95

Query: 87  GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTM 146
           G+  FSDL+ AEF+  YLG K      D     ++P   LP  FDWR++ AVT VKDQ  
Sbjct: 96  GVTRFSDLTPAEFRQNYLGLKPLRFPIDTQQAPILPTNDLPTDFDWRDHGAVTAVKDQGE 155

Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAF 197
           CGS W+FSTTG +EG +   T  LVSLSEQ+L+DCD E         D GC GG ++ AF
Sbjct: 156 CGSCWSFSTTGALEGAHFLATGNLVSLSEQQLVDCDHECDPEEYGACDRGCNGGLMNTAF 215

Query: 198 DTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPM 257
           + I+    GG+   + YPY G D  C+ +K      ++ + +VS DE  +A  LV+NGP+
Sbjct: 216 EYILK--AGGVVRGEDYPYTGTDGHCKFDKTKIAASVSNFSTVSIDEDQIAANLVKNGPL 273

Query: 258 AVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKN 316
           AV INA  +Q Y  GVS P  F C   + +L+H VL+VGYG    +    K  PYW++KN
Sbjct: 274 AVGINAIFMQSYAGGVSCP--FIC---STSLNHGVLLVGYGSAGYSPIRFKEKPYWLLKN 328

Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           SWG+ WGE GY+++ RG   CG++  V +
Sbjct: 329 SWGQNWGEHGYYKICRGHNICGVDSMVST 357


>gi|56718883|gb|AAW28152.1| westerpain-10 [Paragonimus westermani]
          Length = 327

 Score =  248 bits (634), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 142/348 (40%), Positives = 194/348 (55%), Gaps = 28/348 (8%)

Query: 1   MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYY 60
           +SCF       ++S  ++VS+  V    +           L+  F   + K YA   +  
Sbjct: 6   VSCFAL-----IVSCAIAVSAGRVPDSAR----------ELYEQFKRGYGKVYAN-EDDQ 49

Query: 61  SRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM 120
            R  IF  NL + Q LQ  + G+  YG+ +FSDL+  EF AKYL   +      R  P  
Sbjct: 50  KRFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTPEEFAAKYLSAPVNDDQVKRMRPTG 109

Query: 121 IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELID 180
           +     P   DWR   AVT V++Q  CGS WAFST GN+EG +  KT +LVSLS+Q+L+D
Sbjct: 110 LK--AAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVEGQWFIKTGQLVSLSKQQLVD 167

Query: 181 CDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV 240
           CD+   GC GG  ++++  IM    GGLE E  YPY G ++ C LNK+    KI+  + +
Sbjct: 168 CDRAAQGCNGGWPASSYLEIMYM--GGLESESDYPYVGVEQTCALNKEKLVAKIDDSIVL 225

Query: 241 SRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
             +E D A YL E+GP++  +NA ALQ Y +GV  P   F +  +  L+H+VL VGY   
Sbjct: 226 GPEEEDHAAYLAEHGPLSTLLNAVALQHYQSGVLKPT--FDECPDTELNHAVLTVGYD-- 281

Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                   +PYWIIKNSWG  WGEKGYFRL+RGD +CGIN    SA++
Sbjct: 282 ----KEGDMPYWIIKNSWGTDWGEKGYFRLFRGDCTCGINRMATSAII 325


>gi|344295816|ref|XP_003419606.1| PREDICTED: cathepsin F [Loxodonta africana]
          Length = 473

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 136/312 (43%), Positives = 190/312 (60%), Gaps = 10/312 (3%)

Query: 37  KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
           K  ++F  F+  +N+TY T  E   R+ +F+ N+ + Q LQ  + G+  YG+ +FSDL+ 
Sbjct: 171 KMASIFKNFVTTYNRTYETKEETKWRMSVFANNMIRAQKLQALDQGTAQYGITKFSDLTE 230

Query: 97  AEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
            EF+  YL   L+     +      P   +P  +DWR   AVT VKDQ MCGS WAFS T
Sbjct: 231 EEFRTIYLNPLLREDPGQKMRLGKAPKGPVPPDWDWRTKGAVTKVKDQGMCGSCWAFSVT 290

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  I  K  GGLE E+ Y Y
Sbjct: 291 GNVEGQWFLNRGTLLSLSEQELLDCDKVDKACMGGVPSNAYSAI--KTLGGLETEEDYSY 348

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
            G  +AC  + +  +V IN  V +S++E  +A +L +NGP++VAINA+ +QFY  G++HP
Sbjct: 349 HGHLQACSFSAEKAKVYINDSVELSQNEYKLAAWLAKNGPISVAINAFGMQFYRHGIAHP 408

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
           ++  C      + H+VLIVGYG          VP+W IKNSWG  WGE+GY+ L+RG G+
Sbjct: 409 LRPLCS--PWLIDHAVLIVGYG------NRSDVPFWAIKNSWGTDWGEEGYYYLHRGSGA 460

Query: 337 CGINDYVRSALV 348
           CG+N    SA+V
Sbjct: 461 CGVNTMASSAVV 472


>gi|33945877|emb|CAE45588.1| papain-like cysteine proteinase-like protein 1 [Lotus japonicus]
          Length = 359

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 142/337 (42%), Positives = 195/337 (57%), Gaps = 32/337 (9%)

Query: 24  VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL---RKIQLLQDTE 80
           VV DE L   HH      F  F  +  K YAT  E+  R ++F  N+   R+ QLL    
Sbjct: 33  VVDDEGLGAEHH------FLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDP-- 84

Query: 81  HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTG 140
             S V+G+ +FSDL+  EFQ   LG +     +D     ++P   LP+ FDWRE+ AVT 
Sbjct: 85  --SAVHGVTQFSDLTPMEFQHSVLGLRGVGLPSDADSAPILPTDNLPKDFDWREHGAVTP 142

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE----------DDGCEG 190
           VK+Q  CGS W+FS TG +EG +   T +LVSLSEQ+L+DCD +          D GC G
Sbjct: 143 VKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQQCDPEEAGSCDSGCNG 202

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAK 249
           G +++AF+ I++   GG+  E+ YPY G +   C+ +K      +  +  VSRDE  +A 
Sbjct: 203 GLMNSAFEYILNN--GGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQIAA 260

Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKA 308
            LV+NGP+AVAINA  +Q YV GVS P  + C   ++ L+H VL+VGYG +       K 
Sbjct: 261 NLVKNGPLAVAINAVYMQTYVGGVSCP--YVC---SKKLNHGVLLVGYGSESYAPIRMKQ 315

Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            PYWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 316 KPYWIIKNSWGENWGENGYYKICRGRNICGVDSMVST 352


>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
          Length = 360

 Score =  248 bits (634), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 140/335 (41%), Positives = 192/335 (57%), Gaps = 23/335 (6%)

Query: 23  MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
            VV D++   L    H   F+ FL ++ K+YA   E+  R  +F  NLR+ +  Q  +  
Sbjct: 29  QVVSDDQQQLLSAEAH---FSSFLSRYGKSYADEAEHAYRFSVFKSNLRRARRHQRLDP- 84

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA-MIPNITLPRAFDWREYDAVTGV 141
           + V+G+  F+DL+ +EF+  YLG + +P  A  +  A ++P   LP  FDWR++ AVT V
Sbjct: 85  TAVHGVTRFADLTPSEFRRTYLGLRRRPRTAGSTHDAPILPTNELPADFDWRDHGAVTPV 144

Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGS 192
           K+Q  CGS W+FS  G +EG     T  LVSLSEQ+L+DCD E         D GC GG 
Sbjct: 145 KNQGSCGSCWSFSAAGALEGANYLSTGNLVSLSEQQLVDCDHECDSSEPDSCDQGCNGGL 204

Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYL 251
           ++ AF+ I+    GGLE E  YPY G D+  C+ NK       + +  VS DE  +A  L
Sbjct: 205 MTTAFEYILKS--GGLEREADYPYTGTDRGTCKFNKAKISAVASNFSVVSIDEDQIAANL 262

Query: 252 VENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVP 310
           V++GP+AV INA  +Q YV GVS P  + C    ++L H VL+VGYG         K  P
Sbjct: 263 VKHGPLAVGINAVFMQTYVGGVSCP--YIC---GKHLDHGVLLVGYGSAGFAPIRFKEKP 317

Query: 311 YWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           YWIIKNSWGE WGE GY+++ RG   CG++  V S
Sbjct: 318 YWIIKNSWGENWGENGYYKICRGRNVCGVDSMVSS 352


>gi|7211743|gb|AAF40415.1|AF216784_1 papain-like cysteine proteinase isoform II [Ipomoea batatas]
          Length = 368

 Score =  248 bits (633), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 141/337 (41%), Positives = 195/337 (57%), Gaps = 29/337 (8%)

Query: 24  VVGD---EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTE 80
           VVGD   + L+  HH      F  F  +  K YA+  E+  RL +F  N+R+ +  Q+ +
Sbjct: 36  VVGDGDGDLLNADHH------FTVFKRRFGKAYASDEEHDYRLSVFKANMRRAKRHQELD 89

Query: 81  HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVT 139
             + V+G+ +FSD +  EF+ K+LG   +  + AD     ++P   LP  FDWR+  AVT
Sbjct: 90  PAA-VHGVTQFSDSTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDRGAVT 148

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD---------GCEG 190
            VK+Q  CG  W+FSTTG +EG     T KLVSLSEQ+L+DCD E D         GC G
Sbjct: 149 PVKNQGTCGLCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDFGCNG 208

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAK 249
           G +++AF+  +    GGL  E+ YPY G+D + CR +K     K+  +  VS DE  +A 
Sbjct: 209 GLMNSAFEYTLK--AGGLMREEDYPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQIAA 266

Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKA 308
            LV+NGP+AVAINA  +Q Y+ GVS P  + C   ++ L H VL+VGYG         K 
Sbjct: 267 NLVKNGPLAVAINAVFMQTYIGGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKE 321

Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            PYWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 322 KPYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 358


>gi|426252094|ref|XP_004019753.1| PREDICTED: cathepsin F isoform 1 [Ovis aries]
          Length = 460

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 138/332 (41%), Positives = 198/332 (59%), Gaps = 11/332 (3%)

Query: 18  SVSSFM-VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL 76
           + SSF+ ++  + L     VK  ++F  F+  +N+TY +  E   R+ +F+ N+ + Q +
Sbjct: 138 TFSSFLPLLNKDPLPQDFSVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKI 197

Query: 77  QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYD 136
           Q  + G+  YG+ +FSDL+  EF+  YL   LK +       A       P  +DWR   
Sbjct: 198 QALDRGTAQYGVTKFSDLTEEEFRTIYLNPLLKDAPGRNMRLAQPVTDVPPPQWDWRNKG 257

Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNA 196
           AVT VKDQ MCGS WAFS TGN+EG +  K   L+SLSEQEL+DCD+ D  C GG  SNA
Sbjct: 258 AVTDVKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNA 317

Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
           +  I  +  GGLE E  Y YRG  + C  + +  +V IN  V +S++E  +A +L + GP
Sbjct: 318 YSAI--RTLGGLETEDDYSYRGHLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKKGP 375

Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
           ++VAINA+ +QFY  G+SHP++  C      + H+VL+VGYG         A P+W IKN
Sbjct: 376 ISVAINAFGMQFYRHGISHPLRPLCSPW--LIDHAVLLVGYG------NRSATPFWAIKN 427

Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           SWG  WGE+GY+ L+RG G+CG+N    SA++
Sbjct: 428 SWGTNWGEEGYYYLHRGSGACGVNIMASSAVI 459


>gi|4757570|gb|AAD29084.1|AF082181_1 cysteine proteinase precursor [Solanum melongena]
          Length = 363

 Score =  248 bits (632), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 139/337 (41%), Positives = 192/337 (56%), Gaps = 28/337 (8%)

Query: 23  MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDT 79
            VV +   +H+ + +H   F+ F  ++ K YA+  E+  RL +F  NLR+    QLL  T
Sbjct: 30  QVVSETDDNHMLNAEHH--FSLFKSKYGKIYASQEEHDHRLKVFKANLRRARRHQLLDPT 87

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
                 +G+ +FSDL+ +EF+  YLG  K +P    +  P ++P   LP  FDWRE  AV
Sbjct: 88  AE----HGITQFSDLTPSEFRRTYLGLHKPRPKLNAQKAP-ILPTSDLPEDFDWREKGAV 142

Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCE 189
           TGVK+Q  CGS W+FSTTG +EG +   T +LVSLSEQ+L+DCD E         D GC 
Sbjct: 143 TGVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEEKSECDAGCN 202

Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAK 249
           GG ++ AF+  +    GGL+ EK YPY G D  C  +K      +  +  +  DE  +A 
Sbjct: 203 GGLMTTAFEYTLK--AGGLQREKDYPYTGRDGKCHFDKSKIAASVANFSVIGLDEDQIAA 260

Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKA 308
            LV++GP+AV INA  +Q Y+ GVS P+  F     +   H VL+VGYG         K 
Sbjct: 261 NLVKHGPLAVGINAAWMQTYMRGVSCPLICF-----KRQDHGVLLVGYGSAGFAPIRLKE 315

Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            PYWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 316 KPYWIIKNSWGENWGEHGYYKICRGHNICGVDAMVST 352


>gi|426252096|ref|XP_004019754.1| PREDICTED: cathepsin F isoform 2 [Ovis aries]
          Length = 477

 Score =  247 bits (631), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 138/332 (41%), Positives = 198/332 (59%), Gaps = 11/332 (3%)

Query: 18  SVSSFM-VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL 76
           + SSF+ ++  + L     VK  ++F  F+  +N+TY +  E   R+ +F+ N+ + Q +
Sbjct: 155 TFSSFLPLLNKDPLPQDFSVKMASIFKDFVTTYNRTYDSQEEASWRMSVFANNMVRAQKI 214

Query: 77  QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYD 136
           Q  + G+  YG+ +FSDL+  EF+  YL   LK +       A       P  +DWR   
Sbjct: 215 QALDRGTAQYGVTKFSDLTEEEFRTIYLNPLLKDAPGRNMRLAQPVTDVPPPQWDWRNKG 274

Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNA 196
           AVT VKDQ MCGS WAFS TGN+EG +  K   L+SLSEQEL+DCD+ D  C GG  SNA
Sbjct: 275 AVTDVKDQGMCGSCWAFSVTGNVEGQWFLKRGTLLSLSEQELLDCDKTDKACLGGLPSNA 334

Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
           +  I  +  GGLE E  Y YRG  + C  + +  +V IN  V +S++E  +A +L + GP
Sbjct: 335 YSAI--RTLGGLETEDDYSYRGHLQTCSFSAEKAKVYINDSVELSKNEQKLAAWLAKKGP 392

Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
           ++VAINA+ +QFY  G+SHP++  C      + H+VL+VGYG         A P+W IKN
Sbjct: 393 ISVAINAFGMQFYRHGISHPLRPLCSPW--LIDHAVLLVGYG------NRSATPFWAIKN 444

Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           SWG  WGE+GY+ L+RG G+CG+N    SA++
Sbjct: 445 SWGTNWGEEGYYYLHRGSGACGVNIMASSAVI 476


>gi|67773370|gb|AAY81942.1| cysteine protease 3 [Paragonimus westermani]
          Length = 321

 Score =  247 bits (631), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 133/308 (43%), Positives = 179/308 (58%), Gaps = 13/308 (4%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           L+  F   + K YA   +   R  IF  NL + Q LQ  + G+  YG+ +FSDL+  EF 
Sbjct: 26  LYEQFKRDYGKVYAN-EDDQKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFA 84

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           AKYL   +      R  P  +     P   DWR   AVT V++Q  CGS WAFST GN+E
Sbjct: 85  AKYLSAPVNNDQVKRVRPTGLK--AAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVE 142

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G +  KT +LVSLS+Q+L+DCD+  DGC GG  ++++  IM    GGLE +  YPY G  
Sbjct: 143 GQWFIKTGQLVSLSKQQLVDCDRAADGCNGGWPASSYLEIMHM--GGLESQDDYPYAGVK 200

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
           + C + K+    KI+  +++   E D A YL E+GP++  +NA  LQ+Y +G+ HP    
Sbjct: 201 EQCFMEKERLLAKIDDSIALGPSEDDNAAYLAEHGPLSTLLNAITLQYYQSGIIHPSYEE 260

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
           C     +L+H+VL VGY           +PYWIIKNSW   WGEKGYFRLYRGDG+CGIN
Sbjct: 261 C--SPVDLNHAVLTVGYD------KEGDMPYWIIKNSWNVEWGEKGYFRLYRGDGTCGIN 312

Query: 341 DYVRSALV 348
               SA++
Sbjct: 313 RMPTSAII 320


>gi|410913409|ref|XP_003970181.1| PREDICTED: cathepsin F-like [Takifugu rubripes]
          Length = 476

 Score =  247 bits (631), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 138/314 (43%), Positives = 188/314 (59%), Gaps = 11/314 (3%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           V+   LF  F+ ++NK Y++  E   RL IF  NL+  + +Q  + GS  YG+ +FSDL+
Sbjct: 172 VELLGLFKEFMTKYNKVYSSQEEADRRLQIFKENLKTAEKIQSLDEGSAEYGVTKFSDLT 231

Query: 96  TAEFQAKYLGFKLKPSYADRSV-PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
             EF+  YL   L      R + PA       P ++DWR++ AV+ VK+Q +CGS WAFS
Sbjct: 232 EEEFRLTYLNPLLSQWTLRRPMKPASPARSPAPASWDWRDHGAVSPVKNQGLCGSCWAFS 291

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
            TGNIEG +  K  KL+SLSEQEL+DCD  D  C GG  SNA++ I     GGLE E  Y
Sbjct: 292 VTGNIEGQWFLKHGKLLSLSEQELVDCDGLDHACRGGLPSNAYEAIEGL--GGLEAENDY 349

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
            Y G  + C    +     IN  V +  DE +MA +L ENGP++VA+NA+A+QFY  GVS
Sbjct: 350 TYSGHKQKCSFATEKVAAYINSSVELPSDENEMAAWLAENGPVSVALNAFAMQFYKKGVS 409

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
           HP    C+     + H+VL+VGYG          +P+W IKNSWGE +GE+GY+ LY+G 
Sbjct: 410 HPWMILCNPW--MIDHAVLLVGYG------ERNGIPFWAIKNSWGEDYGEEGYYYLYKGS 461

Query: 335 GSCGINDYVRSALV 348
            +CGIN    SA++
Sbjct: 462 NACGINKMGSSAVI 475


>gi|410974700|ref|XP_003993781.1| PREDICTED: cathepsin F [Felis catus]
          Length = 459

 Score =  247 bits (630), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 141/329 (42%), Positives = 194/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++  + L     VK  ++F  F+  +N+TY T  E   RL +FS N+ + Q +Q  
Sbjct: 140 SVLPLLNKDPLPQDFSVKMASIFKEFVTTYNRTYGTQEEAQWRLSVFSNNMVRAQKIQAL 199

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+A YL   LK +       A       P  +DWR   AVT
Sbjct: 200 DRGTAQYGITKFSDLTEEEFRAIYLNPLLKENRNKMMHLAKSIGDHAPPEWDWRTKGAVT 259

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VK+Q MCGS WAFS TGN+EG +  K   L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 260 NVKNQGMCGSCWAFSVTGNVEGQWFLKQGDLLSLSEQELLDCDKVDKACLGGLPSNAYLA 319

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y Y G  + C  + K  +V IN  V +S++E  +A +L + GP++V
Sbjct: 320 I--KNLGGLETEDDYSYSGHLQTCSFSAKKAKVYINDSVELSQNEQKLAAWLAKKGPISV 377

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+SHP++  C      + H+VL+VGYG          +P+W IKNSWG
Sbjct: 378 AINAFGMQFYRRGISHPLRPLCSPW--LIDHAVLLVGYG------NRSGIPFWAIKNSWG 429

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGE+GY+ LYRG G+CG+N    SA+V
Sbjct: 430 TDWGEEGYYYLYRGSGACGVNAMASSAVV 458


>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
 gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
 gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
          Length = 366

 Score =  247 bits (630), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 135/314 (42%), Positives = 181/314 (57%), Gaps = 18/314 (5%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F +F+ ++ K Y+   E+  R  +F  NL +    Q  +  +  +G+ +FSDL+  EF+ 
Sbjct: 57  FRHFIRRYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRAS-HGVTKFSDLTQEEFRH 115

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
           +YLG +  P       P ++P   LP  FDWRE  AVT VK+Q  CGS WAFSTTG +EG
Sbjct: 116 QYLGLRAPPLRDAHDAP-ILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEG 174

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
               KT +LVSLSEQ+L+DCD E         D GC GG +++A+   +    GGLE+E+
Sbjct: 175 ANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKS--GGLEKEE 232

Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
            YPY G D  C  NK      ++ +  VS DE  +A  LV+NGP++V INA  +Q YV G
Sbjct: 233 DYPYTGKDGTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGG 292

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           VS P  + C     NL H VL+VGYG         K  PYW+IKNSWG  WGE GY++L 
Sbjct: 293 VSCP--YVC--SKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLC 348

Query: 332 RGDGSCGINDYVRS 345
           RG   CGIN+ V +
Sbjct: 349 RGHNVCGINNMVST 362


>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
 gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
 gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
 gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
 gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
          Length = 366

 Score =  247 bits (630), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 135/314 (42%), Positives = 181/314 (57%), Gaps = 18/314 (5%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F +F+ ++ K Y+   E+  R  +F  NL +    Q  +  +  +G+ +FSDL+  EF+ 
Sbjct: 57  FRHFIRRYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRAS-HGVTKFSDLTQEEFRH 115

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
           +YLG +  P       P ++P   LP  FDWRE  AVT VK+Q  CGS WAFSTTG +EG
Sbjct: 116 QYLGLRAPPLRDAHDAP-ILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEG 174

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
               KT +LVSLSEQ+L+DCD E         D GC GG +++A+   +    GGLE+E+
Sbjct: 175 ANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKS--GGLEKEE 232

Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
            YPY G D  C  NK      ++ +  VS DE  +A  LV+NGP++V INA  +Q YV G
Sbjct: 233 DYPYTGKDGTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGG 292

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           VS P  + C     NL H VL+VGYG         K  PYW+IKNSWG  WGE GY++L 
Sbjct: 293 VSCP--YVC--SKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLC 348

Query: 332 RGDGSCGINDYVRS 345
           RG   CGIN+ V +
Sbjct: 349 RGHNVCGINNMVST 362


>gi|171854651|dbj|BAG16515.1| putative cysteine proteinase [Capsicum chinense]
          Length = 367

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 142/334 (42%), Positives = 193/334 (57%), Gaps = 29/334 (8%)

Query: 27  DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGS 83
           D+  +HL + +H   F+ F  +  K YAT  E+  RL +F  NLR+    QLL  T    
Sbjct: 37  DDNNNHLLNAEHH--FSLFKSKFGKIYATQEEHDHRLKVFKANLRRARRHQLLDPTAE-- 92

Query: 84  GVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVK 142
             +G+ +FSDL+ +EF+  YLG  K KP  +    P ++P   LP  FDWRE  AVTGVK
Sbjct: 93  --HGITKFSDLTPSEFRRTYLGLHKPKPKLSTTKAP-ILPTSDLPEDFDWREKGAVTGVK 149

Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSI 193
           +Q  CGS W+FSTTG +EG +   T +LVSLSEQ+L+DCD E         D GC GG +
Sbjct: 150 NQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDAEQKSECDAGCGGGLM 209

Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
           + AF+  +    GGL+ EK YPY G +  C  +K      +  Y  V  DE  +A  LV+
Sbjct: 210 TTAFEYTLK--AGGLQREKDYPYTGRNGQCHFDKSKIAASVTNYSVVGLDEDQIAANLVK 267

Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYW 312
           +GP+AV IN+  +Q Y+ GVS P+  F     ++  H VL+VGYG         KA PYW
Sbjct: 268 HGPLAVGINSAWMQTYIGGVSCPLVCF-----KHQDHGVLLVGYGSAGFAPIRLKAKPYW 322

Query: 313 IIKNSWGEGWGEKGYFRLYRGDGS-CGINDYVRS 345
           IIKNSWGE WGE GY+++ RG  + CG++  V +
Sbjct: 323 IIKNSWGEHWGEHGYYKICRGQHNICGVDAMVST 356


>gi|67773376|gb|AAY81945.1| cysteine protease 7 [Paragonimus westermani]
          Length = 325

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 133/310 (42%), Positives = 179/310 (57%), Gaps = 18/310 (5%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           L+  F   + K YA   +   R  IF  NL + Q  Q  E G+  YG+ +FSDL+  EF 
Sbjct: 31  LYEQFKRDYGKAYAN-EDDQKRFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTPEEFA 89

Query: 101 AKYLGFKLKPSYADRSVPAMIPN--ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           A YLG ++     D  V  +  N   T P + DWR+  AV  V+DQ  CGS WAFS T N
Sbjct: 90  AMYLGSRI-----DERVDRVQLNDLQTAPASVDWRKKGAVGPVEDQGSCGSCWAFSVTAN 144

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EG +  KT +LVSLS+Q+L+DCD+ D GC GG     +  I  K  GGLE +  YPY  
Sbjct: 145 VEGQWFLKTGRLVSLSKQQLVDCDRLDHGCSGGYPPYTYKEI--KRMGGLELQSAYPYTS 202

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
             +ACR+++     KI+  + +  DE   A +L E+GPM+  +NA  LQFY +G+ HP +
Sbjct: 203 WKQACRIDRSKLVAKIDDSIVLETDEEKQAAWLAEHGPMSTCLNAGPLQFYQSGILHPSK 262

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
             C    E L+H+VL VGY       T   VPYW ++NSWG  WGE GYFR+YRGDG+CG
Sbjct: 263 AMC--SPEGLNHAVLTVGYD------TEHGVPYWTVRNSWGTRWGENGYFRIYRGDGTCG 314

Query: 339 INDYVRSALV 348
           I+    SA++
Sbjct: 315 IDRLTTSAII 324


>gi|91992514|gb|ABE72973.1| cathepsin L [Aedes aegypti]
          Length = 265

 Score =  247 bits (630), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 123/269 (45%), Positives = 173/269 (64%), Gaps = 12/269 (4%)

Query: 86  YGLNEFSDLSTAEFQAKYLGFKLKPSYADRS-----VPAMIPNITLPRAFDWREYDAVTG 140
           YG+  F+D+++AE++ +  G  + P   DR+        +  N+ LP +FDWRE  AV+ 
Sbjct: 2   YGITHFADMTSAEYRQR-TGLVI-PRDEDRNHVGNPKAEIDENMELPESFDWRELGAVSP 59

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTI 200
           VK+Q  CGS WAFS  GNIEG++  KTK L   SEQEL+DCD  D  C+GG + +A+  I
Sbjct: 60  VKNQGNCGSCWAFSVVGNIEGLHQIKTKVLEEYSEQELLDCDAVDSACQGGYMDDAYKAI 119

Query: 201 MSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
             K+ GGLE E  YPY     K C  N     V++ G V + ++ET MA+YLV NGP+++
Sbjct: 120 -EKI-GGLELESEYPYLAKKQKTCHFNSTEVHVRVKGAVDLPKNETAMAQYLVANGPISI 177

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
            +NA A+QFY  G+SHP +  C    +NL H VLIVGYGV      +K +PYWI+KNSWG
Sbjct: 178 GLNANAMQFYRGGISHPWKPLCS--KKNLDHGVLIVGYGVKEYPMFNKTMPYWIVKNSWG 235

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGE+GY+R++RGD +CG+++   SA++
Sbjct: 236 PKWGEQGYYRIFRGDNTCGVSEMASSAVL 264


>gi|255538808|ref|XP_002510469.1| cysteine protease, putative [Ricinus communis]
 gi|223551170|gb|EEF52656.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score =  247 bits (630), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 138/332 (41%), Positives = 188/332 (56%), Gaps = 25/332 (7%)

Query: 25  VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG 84
           V D  L   HH      F  F  +  K YAT  E+  R  +F  NLR+ Q  Q  +  S 
Sbjct: 40  VEDYLLSAQHH------FTAFKAKFGKNYATQEEHDYRFKVFKANLRRAQKHQLMDP-SA 92

Query: 85  VYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
           V+G+ +FSDL+  EF+ +YLG K     AD     ++P   +P  FDWR++ AVT VK+Q
Sbjct: 93  VHGVTKFSDLTPREFRRQYLGLKKLRLPADAHEAPILPTDGIPEDFDWRDHGAVTNVKNQ 152

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISN 195
             CGS W+FS  G +EG +   T +LVSLSEQ+L+DCD E         D GC GG ++N
Sbjct: 153 GSCGSCWSFSAAGALEGAHFLATGELVSLSEQQLVDCDHECDPTEYGACDSGCNGGLMTN 212

Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVEN 254
           AF+ I+    GGLE E+ YPY G D+  C+  +      +N +  VS DE  +A  LV+N
Sbjct: 213 AFEYILK--AGGLEREEDYPYTGSDRGPCKFERAKIAASVNNFSVVSVDEDQIAANLVQN 270

Query: 255 GPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWI 313
           GP+AV INA  +Q Y+ GVS P  + C   ++   H V++VGYG         K  P+WI
Sbjct: 271 GPLAVGINAVFMQTYIGGVSCP--YIC---SKRQDHGVVLVGYGSAGYAPVRLKDKPFWI 325

Query: 314 IKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           IKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 326 IKNSWGENWGENGYYKICRGRNVCGVDAMVST 357


>gi|355681647|gb|AER96812.1| cathepsin F [Mustela putorius furo]
          Length = 408

 Score =  247 bits (630), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 138/329 (41%), Positives = 192/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++  E L     VK  ++F  F+  +N+TY +  E   R+ +FS N+ + Q +Q  
Sbjct: 90  SVLPLLNKEPLPQDFSVKMASIFKEFVTTYNRTYESKEETQWRMSVFSNNMMRAQKIQAL 149

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+               + P  +DWR   AVT
Sbjct: 150 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLREYRGKNMRLDKSTGDSAPSEWDWRRKGAVT 209

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VK+Q MCGS WAFS TGN+EG +  K   L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 210 KVKNQGMCGSCWAFSVTGNVEGQWFLKQGALLSLSEQELLDCDKVDKACLGGLPSNAYSA 269

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y YRG  + C  + K  +V IN  V +S++E  +A +L E GP++V
Sbjct: 270 I--KTLGGLETEDDYSYRGRMQTCGFSPKKARVYINDSVELSQNEETLAAWLAEKGPISV 327

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+SHP++  C      + H+VL+VGYG           P+W IKNSWG
Sbjct: 328 AINAFGMQFYRHGISHPLRPLCS--PWLIDHAVLLVGYG------NRSGTPFWAIKNSWG 379

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGE+GY+ L+RG G+CG+N    SA+V
Sbjct: 380 SDWGEEGYYYLHRGSGACGVNTMASSAVV 408


>gi|13491752|gb|AAK27969.1|AF242373_1 cysteine protease [Ipomoea batatas]
          Length = 366

 Score =  246 bits (629), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 141/336 (41%), Positives = 196/336 (58%), Gaps = 28/336 (8%)

Query: 24  VVGD--EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
           VVGD  + L+  HH      F  F  +  K YA+  E+  RL  F  N+R+ +  Q+ + 
Sbjct: 35  VVGDGGDLLNADHH------FTVFKRRFGKVYASDEEHDYRLSEFKANMRRAKQHQELDP 88

Query: 82  GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVTG 140
            + V+G+ +FSDL+  EF+ K+LG   +  + AD     ++P   LP  FDWR++ AVT 
Sbjct: 89  AA-VHGVTQFSDLTPTEFRRKFLGLNRRLKFPADAKTAPILPTDELPSDFDWRDHGAVTP 147

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGG 191
           VK+Q  CGS  +FSTTG +EG     T KLVSLSEQ+L+DCD E         D GC GG
Sbjct: 148 VKNQGTCGSCCSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGG 207

Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAKY 250
            +++AF+  +    GGL  E+ +PY G+D + CR +K     K+  +  VS DE  +A  
Sbjct: 208 LMNSAFEYTLK--AGGLMREEDHPYTGNDLQVCRFDKTKIAAKVANFSVVSLDEDQIAAN 265

Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAV 309
           LV+NGP+AVAINA  +Q Y+ GVS P  + C   ++ L H VL+VGYG         K  
Sbjct: 266 LVKNGPLAVAINAVFMQTYIGGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKEK 320

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           PYWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 321 PYWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 356


>gi|33945878|emb|CAE45589.1| papain-like cysteine proteinase-like protein 2 [Lotus japonicus]
          Length = 361

 Score =  246 bits (629), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 142/337 (42%), Positives = 194/337 (57%), Gaps = 32/337 (9%)

Query: 24  VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL---RKIQLLQDTE 80
           VV DE L   HH      F  F  +  K YAT  E+  R ++F  N+   R+ QLL    
Sbjct: 33  VVDDEGLGAEHH------FLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDP-- 84

Query: 81  HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTG 140
             S V+G+  FSDL+  EF+   LG +     +D     ++P   LP+ FDWRE+ AVT 
Sbjct: 85  --SAVHGVTRFSDLTPMEFRHSVLGLRGVGLPSDADSAPILPTDNLPKDFDWREHGAVTP 142

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE----------DDGCEG 190
           VK+Q  CGS W+FS TG +EG +   T KLVSLSEQ+L+DCD E          D GC+G
Sbjct: 143 VKNQGSCGSCWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHEQCDPEEAGSCDSGCKG 202

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGD-DKACRLNKKATQVKINGYVSVSRDETDMAK 249
           G +++AF+ I++   GG+  E+ YPY G     C+ ++      +  +  VSRDE  +A 
Sbjct: 203 GLMNSAFEYILNN--GGVMREEDYPYSGTAGGTCKFDQTKIAASVANFSVVSRDEDQIAA 260

Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKA 308
            LV+NGP+AVAINA  +Q YV GVS P  + C   ++ L+H VL+VGYG +       K 
Sbjct: 261 NLVKNGPLAVAINAVYMQTYVGGVSCP--YVC---SKKLNHGVLLVGYGSESYAPIRMKQ 315

Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            PYWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 316 KPYWIIKNSWGENWGENGYYKICRGRNVCGVDSMVST 352


>gi|290980288|ref|XP_002672864.1| predicted protein [Naegleria gruberi]
 gi|284086444|gb|EFC40120.1| predicted protein [Naegleria gruberi]
          Length = 356

 Score =  246 bits (629), Expect = 8e-63,   Method: Compositional matrix adjust.
 Identities = 146/361 (40%), Positives = 199/361 (55%), Gaps = 32/361 (8%)

Query: 12  LLSLTVSVSSFMVVGDE---KLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
           L+ + + V+SF++  +      + L   +   LF  F  +H K Y T      R  IF  
Sbjct: 4   LILVVLLVASFILAIEAAKGPFNALPESEMQQLFTQFRRKHVKLYGTKQVQDRRYQIFKQ 63

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYAD------RSVPA--- 119
           N+ + +          + G+  FSDL+  EF++ +L     P  A       R  PA   
Sbjct: 64  NVERARFENYLTERDNM-GVTRFSDLTPDEFKSMFLMKSYTPKQARELLSGMRQYPANAK 122

Query: 120 --MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
             M      P+ FDWRE++AVT VKDQ  CGS W FSTTGN+EG+YAAKT KL+SLSEQ+
Sbjct: 123 LTMKQVSDAPKEFDWREHNAVTPVKDQGNCGSCWTFSTTGNVEGMYAAKTGKLISLSEQQ 182

Query: 178 LIDCDQE----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK 227
           L+DCD            + GC GG + ++F+ I+    GGL  E++YPY   D  CR N 
Sbjct: 183 LVDCDHNCVVWEGEKTCNAGCNGGLMWSSFEHIIKT--GGLVTEESYPYEAVDNRCRFNV 240

Query: 228 KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNEN 287
               VKI+ +  VS +E +MA +L  NGP+A+AINA  LQ+Y  G+ +P +  CD   E 
Sbjct: 241 SNAVVKISNWTFVSSNEDEMAAWLANNGPIAIAINADYLQYYRKGILNPSR--CDP--EE 296

Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSAL 347
           L+H VLIVGYG ++     K   YWI+KNSW   WGEKGY R+ RG G CG+N    SAL
Sbjct: 297 LNHGVLIVGYGEEKAA-NGKVEKYWIVKNSWSASWGEKGYVRVLRGKGVCGLNAVPSSAL 355

Query: 348 V 348
           +
Sbjct: 356 I 356


>gi|255543801|ref|XP_002512963.1| cysteine protease, putative [Ricinus communis]
 gi|223547974|gb|EEF49466.1| cysteine protease, putative [Ricinus communis]
          Length = 373

 Score =  246 bits (629), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 134/305 (43%), Positives = 182/305 (59%), Gaps = 19/305 (6%)

Query: 52  TYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPS 111
           TYA+  E+  R  IF  NLR+ +  Q  +  +  +G+ +FSDL+ +EF+ ++LG +    
Sbjct: 68  TYASQEEHDYRFKIFKSNLRRAERHQKLDP-TATHGVTQFSDLTHSEFRRQFLGLRRLRL 126

Query: 112 YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
             D +   M+P   LP  FDWRE  AVT VK+Q  CGS W+FSTTG +EG     T KLV
Sbjct: 127 PKDANEAPMLPTNDLPADFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLATGKLV 186

Query: 172 SLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK- 221
           SLSEQ+L+DCD E         D GC GG +++AF+  +    GGL  E+ YPY G D+ 
Sbjct: 187 SLSEQQLVDCDHECDPAEEGACDSGCNGGLMNSAFEYTLK--AGGLMREEDYPYTGTDRG 244

Query: 222 ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFC 281
           AC+ +K     K+  +  VS DE  +A  LV+NGP+AVAINA  +Q Y+ GVS P  + C
Sbjct: 245 ACQFDKTKIAAKVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIGGVSCP--YIC 302

Query: 282 DGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
              ++ L H VL+VGYG         K  PYWIIKNSWGE WGE GY+++ RG   CG++
Sbjct: 303 ---SKRLDHGVLLVGYGSAGYAPIRMKEKPYWIIKNSWGENWGESGYYKICRGRNICGVD 359

Query: 341 DYVRS 345
             V +
Sbjct: 360 SMVST 364


>gi|317106675|dbj|BAJ53178.1| JHL18I08.12 [Jatropha curcas]
          Length = 368

 Score =  246 bits (629), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 144/364 (39%), Positives = 202/364 (55%), Gaps = 31/364 (8%)

Query: 3   CFYFFAGVALLSLTVSVSS----------FMVVGDEKLHHLHHVKHTALFNYFLEQHNKT 52
           C   F   ALLS T++ ++            VV D    HL + +H   F  F  +  KT
Sbjct: 5   CLISFLVYALLSFTIASTTSPDELDDPLIRQVVPDGDQDHLLNAEHH--FTTFKAKFGKT 62

Query: 53  YATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY 112
           YAT  E+  R  +F  NLR+ +  Q  +  + V+G+  FSDL+  EF+ +YLG +     
Sbjct: 63  YATQEEHDYRFKLFKANLRRARKHQMMD-PTAVHGVTMFSDLTPREFRRQYLGLRRLRLP 121

Query: 113 ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
           AD     ++P   LP  FDWR++ AVT VK+Q  CGS W+FS  G +EG +   T +LVS
Sbjct: 122 ADAHEAPILPTNDLPTDFDWRDHGAVTNVKNQGSCGSCWSFSAAGALEGAHFLATGELVS 181

Query: 173 LSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK-A 222
           LSEQ+L+DCD E         D GC GG ++ AF+  +    GGLE E+ YPY G+D+  
Sbjct: 182 LSEQQLVDCDHECDPEEYGACDSGCNGGLMTTAFEYTLK--AGGLEREEDYPYTGNDRGP 239

Query: 223 CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCD 282
           C+ ++      ++ +  VS DE  +A  LV++GP+AV INA  +Q Y+ GVS P  + C 
Sbjct: 240 CKFDRNKIVASVSNFSVVSIDEDQIAANLVKHGPLAVGINAVFMQTYMGGVSCP--YIC- 296

Query: 283 GGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIND 341
             ++   H VL+VGYG         K  P+WIIKNSWGE WGE GY+R+ RG   CG++ 
Sbjct: 297 --SKRQDHGVLLVGYGSAGYAPIRLKDKPFWIIKNSWGESWGENGYYRICRGRNICGVDA 354

Query: 342 YVRS 345
            V S
Sbjct: 355 MVSS 358


>gi|42407296|dbj|BAD10859.1| cysteine protease [Aster tripolium]
          Length = 363

 Score =  246 bits (629), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 143/348 (41%), Positives = 196/348 (56%), Gaps = 25/348 (7%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
           L+   V      +  D  L   HH      F  F  +  +TY T  E+  RL +F  NLR
Sbjct: 26  LIRQVVQNDETEIESDPLLDPEHH------FKLFKNKFGRTYDTEEEHEYRLTVFKSNLR 79

Query: 72  KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAF 130
           + +  Q  +  +  +G+ +FSDL+ +EF+ KYLG K K    AD +   ++P   LP+ F
Sbjct: 80  RAKRHQVLDP-TAKHGVTKFSDLTPSEFRKKYLGLKSKLKLPADANKAPILPTSNLPQDF 138

Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE------ 184
           DWR+  AVT VK+Q  CGS W+FSTTG +EG +  +T +LVSLSEQ+L+DCD E      
Sbjct: 139 DWRDKGAVTPVKNQGSCGSCWSFSTTGALEGSHFLQTGELVSLSEQQLVDCDHECDPAEY 198

Query: 185 ---DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
              D GC GG ++NAF+ I+    GGL++E  YPY G D  C+ +K      +  +  VS
Sbjct: 199 NSCDSGCNGGLMNNAFEYILK--AGGLQKEADYPYTGRDGTCKFDKSKIAASVANFSVVS 256

Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VD 300
            DE  +A  LV NGP+A+ INA  +Q Y+  VS P  + C      + H VL+VGYG   
Sbjct: 257 TDEDQIAANLVTNGPLAIGINAAWMQTYIGQVSCP--YIC--SKTKMDHGVLLVGYGSAG 312

Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                 K  PYWIIKNSWGE WGE GY++L  G  +CG++  V SA+V
Sbjct: 313 YAPLRFKEKPYWIIKNSWGEDWGEDGYYKLCSGYNACGMDTMV-SAVV 359


>gi|164605518|dbj|BAF98584.1| CM0216.500.nc [Lotus japonicus]
          Length = 360

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 141/336 (41%), Positives = 194/336 (57%), Gaps = 31/336 (9%)

Query: 24  VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL---RKIQLLQDTE 80
           VV DE L   HH      F  F  +  K YAT  E+  R ++F  N+   R+ QLL    
Sbjct: 33  VVDDEGLGAEHH------FLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDP-- 84

Query: 81  HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTG 140
             S V+G+  FSDL+  EF+   LG +     +D     ++P   LP+ FDWRE+ AVT 
Sbjct: 85  --SAVHGVTRFSDLTPMEFRHSVLGLRGVGLPSDADSAPILPTDNLPKDFDWREHGAVTP 142

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGG 191
           VK+Q  CGS W+FS TG +EG +   T +LVSLSEQ+L+DCD +         D GC GG
Sbjct: 143 VKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQCDPEEAGSCDSGCNGG 202

Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKY 250
            +++AF+ I++   GG+  E+ YPY G +   C+ +K      +  +  VSRDE  +A  
Sbjct: 203 LMNSAFEYILNN--GGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQIAAN 260

Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
           LV+NGP+AVAINA  +Q YV GVS P  + C   ++ L+H VL+VGYG +       K  
Sbjct: 261 LVKNGPLAVAINAVYMQTYVGGVSCP--YVC---SKKLNHGVLLVGYGSESYAPIRMKQK 315

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           PYWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 316 PYWIIKNSWGENWGENGYYKICRGRNICGVDSMVST 351


>gi|3377952|emb|CAA08906.1| cysteine proteinase [Cicer arietinum]
          Length = 362

 Score =  246 bits (628), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 139/331 (41%), Positives = 197/331 (59%), Gaps = 27/331 (8%)

Query: 27  DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
           D+ L+  HH      F  F  + +K+YAT  E+  R  +F  NL+K +L Q  +  S  +
Sbjct: 38  DQLLNAEHH------FTTFKSKFSKSYATKEEHDYRFGVFKSNLKKAKLHQKLD-PSAEH 90

Query: 87  GLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
           G+ +FSDL+ +EF+ ++LG K +   P++A ++   ++P   LP  FDWRE  AVT VKD
Sbjct: 91  GVTKFSDLTASEFRRQFLGLKKRLRLPAHAQKA--PILPTNNLPEDFDWREKGAVTPVKD 148

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSIS 194
           Q  CGS WAFSTTG +EG     T KLVSLSEQ+L+DCD           D GC GG ++
Sbjct: 149 QGSCGSCWAFSTTGALEGANYLATGKLVSLSEQQLVDCDHVCDPDEYNSCDSGCNGGLMN 208

Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVEN 254
           NAF+ ++    GG+  E+ Y Y G D +C+ +K      ++ +  VS DE  +A  LV+N
Sbjct: 209 NAFEYLLQ--SGGVVREQDYSYTGRDGSCKFDKSKIAASVSNFSVVSVDEDQIAANLVKN 266

Query: 255 GPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWII 314
           GP+AVAINA  +Q Y++GVS P  + C      L H VL+VG+G        K  PYWII
Sbjct: 267 GPLAVAINAAWMQTYMSGVSCP--YIC--AKSRLDHGVLLVGFGNGFAPIRLKEKPYWII 322

Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           KNSWG+ WGE+GY+++ RG   CG++  V +
Sbjct: 323 KNSWGQNWGEEGYYKICRGRNICGVDSMVST 353


>gi|1619903|gb|AAB16996.1| thiol protease isoform B, partial [Glycine max]
          Length = 319

 Score =  246 bits (627), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 135/307 (43%), Positives = 183/307 (59%), Gaps = 23/307 (7%)

Query: 51  KTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK- 109
           + YAT  E+  R  +F  NLR+      +     V+G+ +FSDL+ AEF+ ++LG K   
Sbjct: 15  RPYATKEEHDHRFGVFKSNLRRASCTPSST--PRVHGVTKFSDLTPAEFRRQFLGLKAVR 72

Query: 110 -PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTK 168
            P++A ++   ++P   LP+ FDWR+  AVT VKDQ  CGS W+FSTTG +EG Y   T 
Sbjct: 73  FPAHAQKA--PILPTKDLPKDFDWRDKGAVTNVKDQGGCGSCWSFSTTGALEGAYYLATG 130

Query: 169 KLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           +LVSLSEQ+L+DCD           D GC GG ++NAF+ I+    GG+++EK YPY G 
Sbjct: 131 ELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQS--GGVQKEKDYPYTGR 188

Query: 220 DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQF 279
           D  C+ +K      ++ Y  V  DE  +A  LV+NGP+AVAINA  +Q YV GVS P  +
Sbjct: 189 DGTCKFDKTKVAATVSNYSVVCLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCP--Y 246

Query: 280 FCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
            C    ++L H VL+VGYG         K  PYWIIKNSWGE WGE GY  + RG   CG
Sbjct: 247 IC---GKHLDHGVLLVGYGEGAYAPIRFKNKPYWIIKNSWGESWGENGYDEICRGRNVCG 303

Query: 339 INDYVRS 345
           ++  V +
Sbjct: 304 VDSMVST 310


>gi|357473427|ref|XP_003606998.1| Cysteine proteinase [Medicago truncatula]
 gi|355508053|gb|AES89195.1| Cysteine proteinase [Medicago truncatula]
          Length = 362

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 134/315 (42%), Positives = 186/315 (59%), Gaps = 19/315 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           FN F  +  K Y++  E+  R  IF  NL + +  Q  +  S V+G+  FSDL+  EF+ 
Sbjct: 48  FNLFKHKFGKVYSSKDEHDYRFKIFKSNLNRAKRHQLMDP-SAVHGVTRFSDLTPREFRK 106

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
             LG +      D +   ++P   LP+ FDWRE  AVT VK+Q  CGS W+FSTTG +EG
Sbjct: 107 SVLGLRGVGLPKDANAAPILPTDNLPKDFDWREKGAVTAVKNQGSCGSCWSFSTTGALEG 166

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
            +   T KLVSLSEQ+L+DCD E         D GC GG +++AF+ I+    GG+  E+
Sbjct: 167 AHFLSTGKLVSLSEQQLVDCDHECDPEQPGSCDAGCNGGLMNSAFEYILKS--GGVMREE 224

Query: 213 TYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G D+ +C+ +KK     +  +  VS DE  +A  LV+NGP+A+A+NA  +Q YV 
Sbjct: 225 DYPYSGTDRGSCKFDKKKIAASVANFSVVSLDEDQIAANLVKNGPLAIALNAVYMQTYVG 284

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           GVS P  + C   ++ L H VL+VGYG    +    K  PYWIIKNSWGE WGE GY+++
Sbjct: 285 GVSCP--YIC---SKRLDHGVLLVGYGSGAYSPIRLKEKPYWIIKNSWGETWGENGYYKI 339

Query: 331 YRGDGSCGINDYVRS 345
            RG   CG++  V +
Sbjct: 340 CRGRNICGVDSMVST 354


>gi|146215998|gb|ABQ10201.1| cysteine protease Cp3 [Actinidia deliciosa]
          Length = 365

 Score =  245 bits (626), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 138/334 (41%), Positives = 191/334 (57%), Gaps = 25/334 (7%)

Query: 23  MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
           +V GD  L   HH      F  F  +  K+YAT  ++  R  +F  NLR+ +  Q  +  
Sbjct: 37  IVDGDHPLSADHH------FRLFKRRFGKSYATQEDHDYRFSVFKTNLRRARHHQRLDP- 89

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVK 142
           S V+G+ +FSDL+ AEF+  +LG K     AD +   ++P   LP  FDWR++ AV  VK
Sbjct: 90  SAVHGVTQFSDLTPAEFRRNHLGLKRLRFPADANKAPILPTEDLPADFDWRDHGAVASVK 149

Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSI 193
           +Q  CGS W+FSTTG +EG     T KLVSLSEQ+L+DCD E         D GC GG +
Sbjct: 150 NQGSCGSCWSFSTTGALEGANFLATGKLVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLM 209

Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLV 252
           ++A +  +    GGL  E+ YPY G D+  C+ ++      +  +  VS DE  +A  LV
Sbjct: 210 NSALEYTLK--AGGLMREEDYPYSGTDRGTCKFDETKIAASVANFSVVSLDENQIAANLV 267

Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPY 311
           +NGP+AVAINA  +Q YV GVS P  + C   ++ L H VL+VGYG         K  PY
Sbjct: 268 KNGPLAVAINAVFMQTYVGGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKEKPY 322

Query: 312 WIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           WIIKNSWGE WGE G++++ +G   CG++  V +
Sbjct: 323 WIIKNSWGESWGENGFYKICQGRNVCGVDSMVST 356


>gi|225427714|ref|XP_002264345.1| PREDICTED: cysteine proteinase RD19a [Vitis vinifera]
          Length = 377

 Score =  245 bits (625), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 133/316 (42%), Positives = 189/316 (59%), Gaps = 21/316 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F+ F  +  K+YA+  E+  R  +F  NLR+ +  Q  +  S  +G+ +FSDL+ AEF+ 
Sbjct: 62  FSIFKRRFGKSYASQEEHDYRFKVFKANLRRARRHQQLDP-SATHGVTQFSDLTPAEFRG 120

Query: 102 KYLGFK-LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
            YLG + LK  +  +  P ++P   LP  FDWR++ AVT VK+Q  CGS W+FSTTG +E
Sbjct: 121 TYLGLRPLKLPHDAQKAP-ILPTNDLPEDFDWRDHGAVTAVKNQGSCGSCWSFSTTGALE 179

Query: 161 GVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           G     T  LVSLSEQ+L++CD E         D GC GG ++ AF+  +    GGL +E
Sbjct: 180 GANFLATGNLVSLSEQQLVECDHECDPEEMGSCDSGCNGGLMNTAFEYTLK--AGGLMKE 237

Query: 212 KTYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           + YPY G D+ +C+ +K      ++ +  +S DE  +A  LV+NGP+AVAINA  +Q YV
Sbjct: 238 EDYPYTGTDRGSCKFDKTKIAASVSNFSVISLDEDQIAANLVKNGPLAVAINAVFMQTYV 297

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            GVS P  + C   ++ L H VL+VGYG         K  PYWIIKNSWGE WGE G+++
Sbjct: 298 GGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKDKPYWIIKNSWGENWGENGFYK 352

Query: 330 LYRGDGSCGINDYVRS 345
           + RG   CG++  V +
Sbjct: 353 ICRGRNVCGVDSMVST 368


>gi|4678299|emb|CAB41090.1| cysteine proteinase precursor-like protein [Arabidopsis thaliana]
          Length = 363

 Score =  245 bits (625), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 135/315 (42%), Positives = 186/315 (59%), Gaps = 20/315 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+  + K Y+T  EY  RL IF+ N+ K    Q  +  S V+G+ +FSDL+  EF+ 
Sbjct: 51  FRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDP-SAVHGVTQFSDLTEEEFKR 109

Query: 102 KYLGFKLKPSYADRSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            Y G          +V A  P +    LP  FDWRE   VT VK+Q  CGS WAFSTTG 
Sbjct: 110 MYTGVADVGGSRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGA 169

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE-----DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
            EG +   T KL+SLSEQ+L+DCDQ      D+GC GG ++NA++ +M    GGLEEE++
Sbjct: 170 AEGAHFVSTGKLLSLSEQQLVDCDQADKKACDNGCGGGLMTNAYEYLME--AGGLEEERS 227

Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
           YPY G    C+ + +   V++  + ++  DE  +A  LV +GP+AV +NA  +Q Y+ GV
Sbjct: 228 YPYTGKRGHCKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTYIGGV 287

Query: 274 SHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           S P+   C     N++H VL+VGY   G    + ++K  PYWIIKNSWG+ WGE GY++L
Sbjct: 288 SCPL--ICS--KRNVNHGVLLVGYGSKGFSILRLSNK--PYWIIKNSWGKKWGENGYYKL 341

Query: 331 YRGDGSCGINDYVRS 345
            RG   CGIN  V +
Sbjct: 342 CRGHDICGINSMVSA 356


>gi|355566270|gb|EHH22649.1| Cathepsin F [Macaca mulatta]
          Length = 484

 Score =  244 bits (624), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 137/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 165 SVLSLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 224

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 225 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 284

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 285 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 344

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y YRG  +AC  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 345 I--KNLGGLETEDDYSYRGHMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISV 402

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG          +P+W IKNSWG
Sbjct: 403 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDIPFWAIKNSWG 454

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 455 TDWGEKGYYYLHRGSGACGVNTMASSAVV 483


>gi|402892718|ref|XP_003909556.1| PREDICTED: cathepsin F [Papio anubis]
          Length = 460

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 137/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 141 SVLSLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 200

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 201 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 260

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 261 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 320

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y YRG  +AC  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 321 I--KNLGGLETEDDYSYRGHMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 378

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG          +P+W IKNSWG
Sbjct: 379 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDIPFWAIKNSWG 430

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 431 TDWGEKGYYYLHRGSGACGVNTMASSAVV 459


>gi|67773380|gb|AAY81947.1| cysteine protease 9 [Paragonimus westermani]
          Length = 322

 Score =  244 bits (623), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 134/308 (43%), Positives = 180/308 (58%), Gaps = 13/308 (4%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           L+  F   + K YA   +   R  IF  NL + Q LQ  + G+  YG+ +FSDL+  EF 
Sbjct: 26  LYEQFKRDYGKVYAN-EDDQKRFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTPEEFA 84

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           AKYL   +     +R  P  +     P   DWRE  AVT V++Q  CGS WAFS  GN+E
Sbjct: 85  AKYLRAAVNNDQVERVRPTGLK--AAPERMDWREKGAVTAVENQGSCGSCWAFSAAGNVE 142

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G +  KT +LVSLS+Q+L+DCD+  +GC GG   +++  I  K  GGLE E  YPY G +
Sbjct: 143 GQWFIKTGQLVSLSKQQLVDCDRVAEGCNGGWPVSSYLEI--KHMGGLESESDYPYVGAE 200

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
           + C LNK+    KI+  + +   E + A YL E+GP++  +NA ALQ Y +GV +P    
Sbjct: 201 QTCALNKEKLLAKIDDLIVLGAYEEEHAAYLAEHGPLSTLLNAVALQHYQSGVLNPTYEE 260

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
           C   +  L+H+VL VGY           +PYWIIKNSWG  WGEKGYFRL+RGD +CGIN
Sbjct: 261 CP--DTELNHAVLTVGYD------KEGDMPYWIIKNSWGTDWGEKGYFRLFRGDYTCGIN 312

Query: 341 DYVRSALV 348
               SA++
Sbjct: 313 RMATSAII 320


>gi|224285931|gb|ACN40679.1| unknown [Picea sitchensis]
          Length = 366

 Score =  244 bits (623), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 134/314 (42%), Positives = 180/314 (57%), Gaps = 18/314 (5%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F +F+ ++ K Y+   E+  R  +F  NL +    Q  +  +  +G+ +FSDL+   F+ 
Sbjct: 57  FRHFIRRYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDPRAS-HGVTKFSDLTQEGFRH 115

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
           +YLG +  P       P ++P   LP  FDWRE  AVT VK+Q  CGS WAFSTTG +EG
Sbjct: 116 QYLGLRAPPLRDAHDAP-ILPTNDLPEDFDWREKGAVTEVKNQGSCGSCWAFSTTGALEG 174

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
               KT +LVSLSEQ+L+DCD E         D GC GG +++A+   +    GGLE+E+
Sbjct: 175 ANFLKTGELVSLSEQQLVDCDHECDPSDARSCDSGCNGGLMTSAYQYALKS--GGLEKEE 232

Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
            YPY G D  C  NK      ++ +  VS DE  +A  LV+NGP++V INA  +Q YV G
Sbjct: 233 DYPYTGKDGTCSFNKNKIVAHVSNFSVVSIDEGQIAANLVKNGPLSVGINAAFMQTYVGG 292

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           VS P  + C     NL H VL+VGYG         K  PYW+IKNSWG  WGE GY++L 
Sbjct: 293 VSCP--YVC--SKRNLDHGVLLVGYGAAAFAPIRMKDKPYWVIKNSWGPNWGENGYYKLC 348

Query: 332 RGDGSCGINDYVRS 345
           RG   CGIN+ V +
Sbjct: 349 RGHNVCGINNMVST 362


>gi|355751926|gb|EHH56046.1| Cathepsin F, partial [Macaca fascicularis]
          Length = 381

 Score =  244 bits (622), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 137/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 62  SVLSLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 121

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 122 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 181

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 182 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 241

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y YRG  +AC  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 242 I--KNLGGLETEDDYSYRGHMQACNFSAEKAKVYINDSVELSQNEQKLAAWLAKKGPISV 299

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG          +P+W IKNSWG
Sbjct: 300 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDIPFWAIKNSWG 351

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 352 TDWGEKGYYYLHRGSGACGVNTMASSAVV 380


>gi|357438145|ref|XP_003589348.1| Cysteine proteinase [Medicago truncatula]
 gi|355478396|gb|AES59599.1| Cysteine proteinase [Medicago truncatula]
          Length = 364

 Score =  244 bits (622), Expect = 6e-62,   Method: Compositional matrix adjust.
 Identities = 138/334 (41%), Positives = 194/334 (58%), Gaps = 24/334 (7%)

Query: 25  VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG 84
           V D    H+ + +H   F  F  + +K YAT  E+  R  +F  NL K +L Q  +  S 
Sbjct: 36  VVDTAEDHILNAEHH--FTSFKSKFSKNYATKEEHDYRFGVFKSNLIKAKLHQKLDP-SA 92

Query: 85  VYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGV 141
            +G+ +FSDL+ +EF+ ++LG   +   P++A ++   ++P   LP  FDWRE  AVT V
Sbjct: 93  QHGITKFSDLTASEFRRQFLGLNKRLRLPAHAQKA--PILPTNNLPEDFDWREKGAVTPV 150

Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGS 192
           KDQ  CGS WAFSTTG +EG     T KL SLSEQ+L+DCD           D GC GG 
Sbjct: 151 KDQGSCGSCWAFSTTGALEGANYLATGKLTSLSEQQLVDCDHVCDPEERGSCDSGCNGGL 210

Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
           ++NAF+ I+    GG+  EK Y Y G D +C+ +K      ++ +  VS DE  +A  LV
Sbjct: 211 MNNAFEYILQS--GGVVSEKDYAYTGRDGSCKFDKSKVVASVSNFSVVSLDEDQIAANLV 268

Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV-DRTKFTHKAVPY 311
           +NGP+AVAINA  +Q Y++GVS P  + C      L H VL++G+G         K  PY
Sbjct: 269 KNGPLAVAINAAWMQTYMSGVSCP--YIC--AKARLDHGVLLLGFGQGGYAPIRLKEKPY 324

Query: 312 WIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           WIIKNSWG+ WGE+GY+++ RG   CG++  V +
Sbjct: 325 WIIKNSWGQNWGEEGYYKICRGRNVCGVDSMVST 358


>gi|403293601|ref|XP_003937801.1| PREDICTED: cathepsin F [Saimiri boliviensis boliviensis]
          Length = 379

 Score =  243 bits (620), Expect = 9e-62,   Method: Compositional matrix adjust.
 Identities = 137/327 (41%), Positives = 194/327 (59%), Gaps = 10/327 (3%)

Query: 22  FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
           F ++ ++ L     VK  ++F  F+  +N+TY +  E   RL IF+ N+ + Q +Q  + 
Sbjct: 62  FSLLNEDPLPQDLTVKMASIFRNFVITYNRTYESKEEAQWRLSIFAHNMVRAQKIQALDR 121

Query: 82  GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGV 141
           G+  YG+ +FSDL+  EF+  YL   L+     +   A       P  +DWR   AVT V
Sbjct: 122 GTAQYGVTKFSDLTEEEFRTIYLNPLLREEPGKKMKQAKSVGDLAPPEWDWRSKGAVTKV 181

Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIM 201
           KDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  S+A+  I 
Sbjct: 182 KDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKIDKACMGGLPSSAYSAI- 240

Query: 202 SKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI 261
            K  GGLE E  Y YRG  +AC  + +  +V IN  V +S++E  +A +L + GP++VAI
Sbjct: 241 -KNLGGLETEDDYSYRGHMQACSFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAI 299

Query: 262 NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
           NA+ +QFY  G+S P++  C      + H+VL+VGYG          +P+W IKNSWG  
Sbjct: 300 NAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDIPFWAIKNSWGTD 351

Query: 322 WGEKGYFRLYRGDGSCGINDYVRSALV 348
           WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 352 WGEKGYYYLHRGSGACGVNTMASSAVV 378


>gi|357162946|ref|XP_003579573.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
          Length = 376

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 145/367 (39%), Positives = 206/367 (56%), Gaps = 33/367 (8%)

Query: 1   MSCFYFFAGVALLSLTV--SVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVE 58
           ++     +GVA LS  V   +   +V GDEK  +   +   A F  F+++ NK+Y    E
Sbjct: 11  VAAVLLLSGVAALSSPVEDPLIEQVVGGDEK--NELELNAEAHFASFVQRFNKSYRDADE 68

Query: 59  YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF-KLKPSY----- 112
           +  RL +F+ NLR+ +  Q  +  S V+G+ +FSDL+  EF+ ++LG  K + S+     
Sbjct: 69  HAHRLSVFTANLRRARRHQRLDP-SAVHGVTKFSDLTPDEFRDRFLGLRKYRRSFLKGLS 127

Query: 113 -ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
            +    PA+ P   LP  FDWRE+ AV  VKDQ  CGS W+FST+G +EG +   T KL 
Sbjct: 128 GSAHDAPAL-PTDGLPTEFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHYLATGKLE 186

Query: 172 SLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA 222
            LSEQ+++DCD E         D GC GG ++ AF  +     GGLE EK YPY G   A
Sbjct: 187 VLSEQQMVDCDHECDPSEPRACDAGCNGGLMTTAFSYLAK--AGGLETEKDYPYTGRGGA 244

Query: 223 CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCD 282
           C+ +K     ++  + +V+ DE  +A  LV++GP+A+ INA  +Q Y+ GVS P  F C 
Sbjct: 245 CKFDKSKIAAQVKNFSTVAVDEDQIAANLVKHGPLAIGINAVFMQTYIGGVSCP--FIC- 301

Query: 283 GGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---DGSCG 338
               +L H VL+VGYG         K  PYWIIKNSWGE WGE GY+++ RG      CG
Sbjct: 302 --GRHLDHGVLLVGYGSAGYAPLRFKEKPYWIIKNSWGENWGESGYYKICRGAHVKNKCG 359

Query: 339 INDYVRS 345
           ++  V +
Sbjct: 360 VDSMVST 366


>gi|205364757|gb|ACI04578.1| cysteine protease-like protein [Robinia pseudoacacia]
          Length = 335

 Score =  243 bits (620), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 139/335 (41%), Positives = 202/335 (60%), Gaps = 26/335 (7%)

Query: 23  MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
            VV D + H L+   H   F+ F  + +KTYAT  E+  R  +F  N+R+ +L    +  
Sbjct: 6   QVVDDNEDHVLNAEHH---FSTFKSKFSKTYATKEEHDYRFGVFKSNVRRAKLHAKLD-P 61

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGFK-LK-PSYADRSVPAMIPNITLPRAFDWREYDAVTG 140
           S V+G+ +FSDL+ +EF+ ++LG K L+ P +A ++   ++P   LP  FDWR+  AVT 
Sbjct: 62  SAVHGVTKFSDLTPSEFRRQFLGLKPLRLPEHAQKA--PILPTHDLPEDFDWRDKGAVTH 119

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGG 191
           VK+Q  CGS WAFSTTG +EG +   T +LVSLS+Q+L+DCD           D GC GG
Sbjct: 120 VKNQGSCGSCWAFSTTGALEGSHFLATGELVSLSDQQLVDCDHVCDPEQYGACDSGCNGG 179

Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYL 251
            ++NAF+ I+    GG++ E+ YPY G D+   ++ +A    ++ +  VS DE  ++  L
Sbjct: 180 LMNNAFEYILES--GGVQREEDYPYTGRDRGPAID-EANAASVSNFSVVSLDEDQISANL 236

Query: 252 VENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVP 310
           V+NGP+A+ INA  +Q Y+ GVS P  + C    +NL H VL+VGYG         K  P
Sbjct: 237 VKNGPLAIGINAVFMQTYIGGVSCP--YIC---GKNLDHGVLLVGYGKAGYAPIRLKEKP 291

Query: 311 YWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           YWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 292 YWIIKNSWGESWGENGYYKICRGRNVCGVDSMVST 326


>gi|240255643|ref|NP_567010.5| Papain family cysteine protease [Arabidopsis thaliana]
 gi|17979125|gb|AAL49820.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332645795|gb|AEE79316.1| Papain family cysteine protease [Arabidopsis thaliana]
          Length = 367

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 135/319 (42%), Positives = 186/319 (58%), Gaps = 24/319 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+  + K Y+T  EY  RL IF+ N+ K    Q  +  S V+G+ +FSDL+  EF+ 
Sbjct: 51  FRLFMSDYGKNYSTREEYIHRLGIFAKNVLKAAEHQMMDP-SAVHGVTQFSDLTEEEFKR 109

Query: 102 KYLGFKLKPSYADRSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            Y G          +V A  P +    LP  FDWRE   VT VK+Q  CGS WAFSTTG 
Sbjct: 110 MYTGVADVGGSRGGTVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGA 169

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLE 209
            EG +   T KL+SLSEQ+L+DCDQ          D+GC GG ++NA++ +M    GGLE
Sbjct: 170 AEGAHFVSTGKLLSLSEQQLVDCDQACDPKDKKACDNGCGGGLMTNAYEYLME--AGGLE 227

Query: 210 EEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           EE++YPY G    C+ + +   V++  + ++  DE  +A  LV +GP+AV +NA  +Q Y
Sbjct: 228 EERSYPYTGKRGHCKFDPEKVAVRVLNFTTIPLDENQIAANLVRHGPLAVGLNAVFMQTY 287

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
           + GVS P+   C     N++H VL+VGY   G    + ++K  PYWIIKNSWG+ WGE G
Sbjct: 288 IGGVSCPL--ICS--KRNVNHGVLLVGYGSKGFSILRLSNK--PYWIIKNSWGKKWGENG 341

Query: 327 YFRLYRGDGSCGINDYVRS 345
           Y++L RG   CGIN  V +
Sbjct: 342 YYKLCRGHDICGINSMVSA 360


>gi|1353726|gb|AAB01769.1| cysteine proteinase homolog, partial [Naegleria fowleri]
          Length = 347

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 141/354 (39%), Positives = 200/354 (56%), Gaps = 26/354 (7%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
           L++  + ++   VV  ++   L   +   LF  F  ++ K Y T  E+ +R  IF  N+ 
Sbjct: 3   LIAAVLLIACVGVVLAQEYKPLAESEMKKLFIKFSRKYAKVYGT-EEHNNRYQIFKANVE 61

Query: 72  KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI------- 124
           K +           +G+ +FSDL+  EF+  +L     P  A + + A    +       
Sbjct: 62  KSRYYNHVGKREN-FGITKFSDLTPEEFKRMFLMKTYTPEEAKKILAAPQHAVLSEKEVQ 120

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
           T P +FDWR++ AVT VK+Q  CGS W FSTTGN+EG +A K  KLVSLSEQ+L+DCD  
Sbjct: 121 TAPTSFDWRQHGAVTRVKNQGACGSCWTFSTTGNVEGQWAIKKGKLVSLSEQQLVDCDHN 180

Query: 185 ----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKI 234
                     D GC GG + +AF  ++    GGL+ E +YPY G D  CR NK      I
Sbjct: 181 CVTYQNQQACDSGCNGGLMWSAFQYVIKN--GGLDTEDSYPYEGVDDTCRFNKSNVAATI 238

Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLI 294
           + + S+S DE  MA +L  NGP+++AINA  LQ+Y +G+S P  +FC+   ++L H VLI
Sbjct: 239 SSWTSISSDENQMAAWLAANGPISIAINAEWLQYYTSGISDP--WFCN--PQDLDHGVLI 294

Query: 295 VGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           VGYGV ++    +   YWI+KNSWG  WGE GYFR+ RG G CG+N    S++V
Sbjct: 295 VGYGVGKSWLGSEE-NYWIVKNSWGSDWGEDGYFRIIRGKGKCGLNSVPSSSIV 347


>gi|395742406|ref|XP_003777749.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pongo abelii]
          Length = 490

 Score =  243 bits (619), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 137/329 (41%), Positives = 195/329 (59%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL IF  N+ + Q +Q  
Sbjct: 171 SVISLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSIFVNNMVRAQKIQAL 230

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+   +++   A       P  +DWR   AVT
Sbjct: 231 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLREEPSNKMKQAKSVGDLAPPEWDWRSKGAVT 290

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 291 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 350

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y Y+G  ++C  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 351 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 408

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG          VP+W IKNSWG
Sbjct: 409 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 460

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 461 TDWGEKGYYYLHRGSGACGVNTMASSAVV 489


>gi|18399697|ref|NP_565512.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
 gi|12643282|sp|P43295.2|A494_ARATH RecName: Full=Probable cysteine proteinase A494; Flags: Precursor
 gi|4567274|gb|AAD23687.1| cysteine proteinase [Arabidopsis thaliana]
 gi|116325924|gb|ABJ98563.1| At2g21430 [Arabidopsis thaliana]
 gi|330252083|gb|AEC07177.1| putative cysteine proteinase A494 [Arabidopsis thaliana]
          Length = 361

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 145/359 (40%), Positives = 209/359 (58%), Gaps = 37/359 (10%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL--------FNYFLEQHNKTYATLVEYYS 61
           V+L+ + VSVS   V GDE +     V  T          F  F ++  K Y ++ E+Y 
Sbjct: 11  VSLIFVFVSVS---VCGDEDVLIRQVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHYY 67

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSV 117
           R  +F  NL +    Q  +  S  +G+ +FSDL+ +EF+ K+LG    FKL P  A+++ 
Sbjct: 68  RFSVFKANLLRAMRHQKMDP-SARHGVTQFSDLTRSEFRRKHLGVKGGFKL-PKDANQA- 124

Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
             ++P   LP  FDWR+  AVT VK+Q  CGS W+FSTTG +EG +   T KLVSLSEQ+
Sbjct: 125 -PILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQ 183

Query: 178 LIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNK 227
           L+DCD E         D GC GG +++AF+  +    GGL  EK YPY G D  +C+L++
Sbjct: 184 LVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKT--GGLMREKDYPYTGTDGGSCKLDR 241

Query: 228 KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNEN 287
                 ++ +  VS +E  +A  L++NGP+AVAINA  +Q Y+ GVS P  + C   +  
Sbjct: 242 SKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCP--YIC---SRR 296

Query: 288 LSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           L+H VL+VGYG    ++   K  PYWIIKNSWGE WGE G++++ +G   CG++  V +
Sbjct: 297 LNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVST 355


>gi|224113123|ref|XP_002316398.1| predicted protein [Populus trichocarpa]
 gi|222865438|gb|EEF02569.1| predicted protein [Populus trichocarpa]
          Length = 327

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 136/324 (41%), Positives = 190/324 (58%), Gaps = 32/324 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+++HNK YAT  EY  R  IF  NL +    Q  +  + ++G+  F DL+  EF+ 
Sbjct: 14  FKMFIKEHNKEYATREEYVHRFGIFGKNLIRAVEHQALDP-TAIHGVTPFMDLTEEEFER 72

Query: 102 KYLGFKLKPSYADRSVPAMIPNIT------LPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            Y G          +VP    +++      LP +FDWRE  AVT VK Q  CGS WAFST
Sbjct: 73  MYAGV-----LGGGTVPVEKGSVSFMDASGLPDSFDWREKGAVTDVKIQGSCGSCWAFST 127

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGG 206
           TG++EG     T KL++LSEQ+L+DCD+          DDGC GG ++NA+  ++    G
Sbjct: 128 TGSVEGANFIATGKLLNLSEQQLVDCDRVCDKTDKASCDDGCGGGLMTNAYRYLIE--AG 185

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           GL+EE +YPY G    C+ + +   VK+  + S++ DE  +A  LV +GP+A+ +NA  +
Sbjct: 186 GLQEESSYPYTGKSGECKFDPEKIAVKVANFTSIAVDENQIAANLVHHGPLAIGLNAIFM 245

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV---DRTKFTHKAVPYWIIKNSWGEGWG 323
           Q Y+ GVS P+   C  G + L+H VL+VGYG       +F +K  PYWIIKNSWG  WG
Sbjct: 246 QTYIGGVSCPL--IC--GKKWLNHGVLLVGYGARGYSILRFGYK--PYWIIKNSWGNHWG 299

Query: 324 EKGYFRLYRGDGSCGINDYVRSAL 347
           EKGY+RL RG G CG+N  V + +
Sbjct: 300 EKGYYRLCRGHGMCGMNKMVSAVV 323


>gi|6042196|ref|NP_003784.2| cathepsin F precursor [Homo sapiens]
 gi|12643325|sp|Q9UBX1.1|CATF_HUMAN RecName: Full=Cathepsin F; Short=CATSF; Flags: Precursor
 gi|4731642|gb|AAD26616.2|AF088886_1 cathepsin F precursor [Homo sapiens]
 gi|5305722|gb|AAD41790.1|AF132894_1 cathepsin F [Homo sapiens]
 gi|4826528|emb|CAB42883.1| cysteine proteinase [Homo sapiens]
 gi|15079738|gb|AAH11682.1| Cathepsin F [Homo sapiens]
 gi|22209085|gb|AAH36451.1| Cathepsin F [Homo sapiens]
 gi|61363874|gb|AAX42458.1| cathepsin F [synthetic construct]
 gi|123993139|gb|ABM84171.1| cathepsin F [synthetic construct]
 gi|189053904|dbj|BAG36411.1| unnamed protein product [Homo sapiens]
          Length = 484

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 136/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 165 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 224

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 225 DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 284

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 285 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 344

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y Y+G  ++C  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 345 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 402

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG          VP+W IKNSWG
Sbjct: 403 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 454

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 455 TDWGEKGYYYLHRGSGACGVNTMASSAVV 483


>gi|94556727|gb|ABF46642.1| papain-like cysteine proteinase [Pachysandra terminalis]
          Length = 374

 Score =  242 bits (618), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 134/315 (42%), Positives = 181/315 (57%), Gaps = 18/315 (5%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F+ F ++  K Y +  E+  R  +F  NLR+ +  Q  +  S V+G+ +F DL+ AEF+ 
Sbjct: 58  FSSFKKRFGKAYTSCDEHDRRFGVFKANLRRAKRNQILDP-SAVHGVTQFFDLTPAEFRR 116

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
            YLG K     AD     ++P   LP  FDWR++ AVT VK+Q  CGS W+FS TG +EG
Sbjct: 117 TYLGLKRLRLPADTHEAPILPTNDLPADFDWRDHGAVTPVKNQGSCGSCWSFSATGALEG 176

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
                T KLVSLSEQ+L+DCD           D GC GG +++AF+  +    GGLE E+
Sbjct: 177 ANFLATGKLVSLSEQQLVDCDHVCDSEDPSSCDSGCNGGLMTSAFEYTLK--AGGLEREE 234

Query: 213 TYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G D + C+ +K    V  + +  VS DE  +A  LV NGP+A+ INA  +Q Y+ 
Sbjct: 235 DYPYTGTDHSKCKFDKTKIAVSASNFSVVSLDENQIAANLVTNGPLAIGINAMFMQTYIG 294

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           GVS P  + C      L H VL+VGYG         K  PYWIIKNSWGE WGEKGY+++
Sbjct: 295 GVSCP--YIC--SKRLLDHGVLLVGYGSAGFAPIRFKEKPYWIIKNSWGESWGEKGYYKI 350

Query: 331 YRGDGSCGINDYVRS 345
            RG   CG++  V +
Sbjct: 351 CRGRNICGMDSMVSA 365


>gi|297824991|ref|XP_002880378.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297326217|gb|EFH56637.1| hypothetical protein ARALYDRAFT_481008 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 360

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 145/359 (40%), Positives = 209/359 (58%), Gaps = 37/359 (10%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL--------FNYFLEQHNKTYATLVEYYS 61
           V+LL + VSVS   + GDE L     V             F  F ++  K Y ++ E+Y 
Sbjct: 10  VSLLFVFVSVS---ICGDEDLLIRQVVDEAEPKVLSSEDHFTLFKKKFGKDYGSIEEHYY 66

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSV 117
           R  +F  NLR+    Q  +  S  +G+ +FSDL+ +EF+ K+LG    FKL P  A+++ 
Sbjct: 67  RFSVFKANLRRAMRHQKMDP-SARHGVTQFSDLTGSEFRRKHLGVTGGFKL-PKDANQA- 123

Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
             ++P   LP  FDWR+  AVT VK+Q  CGS W+FSTTG +EG +   T KLVSLSEQ+
Sbjct: 124 -PILPTHNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQ 182

Query: 178 LIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNK 227
           L+DCD E         D GC GG +++AF+  +    GGL  E+ YPY G D  +C+L++
Sbjct: 183 LVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKT--GGLMREEDYPYTGTDGGSCKLDR 240

Query: 228 KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNEN 287
                 ++ +  VS +E  +A  LV+NGP+AVAINA  +Q Y+ GVS P  + C   +  
Sbjct: 241 SKIVASVSNFSVVSINEDQIAANLVKNGPLAVAINAAYMQTYIGGVSCP--YIC---SRR 295

Query: 288 LSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           L+H VL++GYG    ++   K  PYWIIKNSWGE WGE G++++ +G   CG++  V +
Sbjct: 296 LNHGVLLMGYGSSGYSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVST 354


>gi|218137972|gb|ACK57563.1| cysteine protease-like protein [Arachis hypogaea]
          Length = 364

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 137/334 (41%), Positives = 197/334 (58%), Gaps = 24/334 (7%)

Query: 26  GDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
           GDE   HL + +H   F+ F  + +KTYAT  E+  R  +F  NL + +  Q+ +  S +
Sbjct: 38  GDE---HLLNAEHH--FSAFKTKFSKTYATKEEHDYRFGVFKSNLLRAKSHQELD-PSAI 91

Query: 86  YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQT 145
           +G+ +FSDL+ +EF++++LG K     +D     ++P   LP+ FDWR++ AVT VK+Q 
Sbjct: 92  HGVTKFSDLTPSEFRSQFLGLKPLSLPSDAHNAPILPTDNLPKDFDWRDHGAVTNVKNQG 151

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNA 196
             GS W+FSTTG +EG +   T +LVSLSEQ+L+DCD E         D GC GG ++ A
Sbjct: 152 TGGSCWSFSTTGALEGAHFLATGELVSLSEQQLVDCDHECDPDLNDACDSGCNGGLMTTA 211

Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
           F    +K  GGL  E+ Y Y G D+  C+ +K      ++ +  VS DE  +A  LV+NG
Sbjct: 212 FG--YTKKAGGLVREEDYLYTGRDRGPCKFDKSKIAASVSNFSVVSLDEDQIAANLVKNG 269

Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWII 314
           P++V INA  +Q Y+ GVS P  F C    ++L H VL+VGYG         K  PYWII
Sbjct: 270 PLSVGINAVYMQTYIGGVSCP--FIC---GKHLDHGVLLVGYGAGGYAPIRFKEKPYWII 324

Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           KNSWGE WGE GY+++ RG   CG++  V + + 
Sbjct: 325 KNSWGENWGENGYYKICRGPNMCGVDSMVSTVIA 358


>gi|119594953|gb|EAW74547.1| cathepsin F, isoform CRA_a [Homo sapiens]
 gi|119594954|gb|EAW74548.1| cathepsin F, isoform CRA_a [Homo sapiens]
          Length = 392

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 136/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 73  SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 132

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 133 DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 192

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 193 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 252

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y Y+G  ++C  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 253 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 310

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG          VP+W IKNSWG
Sbjct: 311 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 362

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 363 TDWGEKGYYYLHRGSGACGVNTMASSAVV 391


>gi|3916212|gb|AAC78838.1| cathepsin F [Homo sapiens]
          Length = 338

 Score =  242 bits (617), Expect = 2e-61,   Method: Compositional matrix adjust.
 Identities = 136/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 19  SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 78

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 79  DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 138

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 139 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 198

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y Y+G  ++C  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 199 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 256

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG          VP+W IKNSWG
Sbjct: 257 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 308

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 309 TDWGEKGYYYLHRGSGACGVNTMASSAVV 337


>gi|297816790|ref|XP_002876278.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297322116|gb|EFH52537.1| hypothetical protein ARALYDRAFT_485911 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 368

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 144/368 (39%), Positives = 203/368 (55%), Gaps = 39/368 (10%)

Query: 1   MSCFYFFAGV--ALLSLTVSVSSFMVVGDEKL--HHLHHVKHTALFNYFLEQHNKTYATL 56
           ++C  FF  V  ++  LT+      V  DE+    +L      + F  F+  + K Y+T 
Sbjct: 10  ITCIIFFCHVVASVEDLTIR----QVTADERRVRPNLLGTHTESKFRVFMSDYGKNYSTR 65

Query: 57  VEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYA 113
            EY  RL IF+ N+ K    Q++  T     V+G+ +FSDL+  EF+  Y G        
Sbjct: 66  EEYIHRLGIFAKNVLKAAEHQMMDPT----AVHGVTQFSDLTEEEFKRMYTGVADVGGSR 121

Query: 114 DRSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKL 170
             +V A  P +    LP  FDWRE   VT VK+Q  CGS WAFSTTG  EG +   T KL
Sbjct: 122 GHAVGAEAPMVEVDGLPEDFDWREKGGVTEVKNQGACGSCWAFSTTGAAEGAHFVSTGKL 181

Query: 171 VSLSEQELIDCDQE----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           +SLSEQ+L+DCDQ           D+GC GG ++NA++ +M    GGLEEE++YPY G  
Sbjct: 182 LSLSEQQLVDCDQAVCDPKDKKACDNGCGGGLMTNAYEYLME--AGGLEEERSYPYTGKR 239

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
             C+ + +   V++  + ++  DE  +A  LV  GP+AV +NA  +Q Y+ GVS P+   
Sbjct: 240 GHCKFDPEKVAVRVVNFTTIPLDEDQIAANLVRQGPLAVGLNAVFMQTYIGGVSCPL--I 297

Query: 281 CDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
           C      ++H VL+VGY   G    + ++K  PYWIIKNSWG+ WGE GY++L RG   C
Sbjct: 298 CS--KRKVNHGVLLVGYGSKGFSILRLSNK--PYWIIKNSWGKKWGENGYYKLCRGHDIC 353

Query: 338 GINDYVRS 345
           GIN  V +
Sbjct: 354 GINSMVSA 361


>gi|426369382|ref|XP_004051670.1| PREDICTED: cathepsin F [Gorilla gorilla gorilla]
          Length = 517

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 136/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 198 SVISLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 257

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 258 DRGTAQYGVTKFSDLTEEEFRTIYLNSLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 317

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 318 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 377

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y Y+G  ++C  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 378 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 435

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG          VP+W IKNSWG
Sbjct: 436 AINAFGMQFYRHGISRPLRPLCS--PWLIDHAVLLVGYG------NRSDVPFWAIKNSWG 487

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 488 TDWGEKGYYYLHRGSGACGVNTMASSAVV 516


>gi|297804580|ref|XP_002870174.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297316010|gb|EFH46433.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 373

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 145/371 (39%), Positives = 211/371 (56%), Gaps = 41/371 (11%)

Query: 4   FYFFAGVALLSLTVSVSSF-------------MVVGDEKLHHLHHVKHTALFNYFLEQHN 50
           F+F     LL++++  +                VV +E   HL + +H   F+ F  ++ 
Sbjct: 6   FFFLIAATLLAVSLGSAVISGEVNYGFVNPIRQVVPEENDEHLLNAEHH--FSLFKSKYE 63

Query: 51  KTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK- 109
           KTYAT  E+  R  +F  NLR+ +  Q  +  S V+G+ +FSDL+  EF+ K+LG K + 
Sbjct: 64  KTYATQEEHDHRFRVFKANLRRARRNQLLD-PSAVHGVTQFSDLTPKEFRRKFLGLKRRG 122

Query: 110 ---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
              P+  D     ++P   LP  FDWRE  AVT VK+Q MCGS W+FS  G +EG +   
Sbjct: 123 FRLPT--DTQTAPILPTSDLPTEFDWREQGAVTPVKNQGMCGSCWSFSAIGALEGAHFLA 180

Query: 167 TKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           TK+LVSLSEQ+L+DCD E         D GC GG ++NAF+  +    GGL +E+ YPY 
Sbjct: 181 TKELVSLSEQQLVDCDHECDPAQANSCDSGCSGGLMNNAFEYALK--AGGLMKEEDYPYT 238

Query: 218 G-DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
           G D+ AC+ +K      ++ +  VS DE  +A  LV++GP+A+AINA  +Q Y+ GVS P
Sbjct: 239 GRDNTACKFDKSKIAASVSNFSVVSSDEDQIAANLVKHGPLAIAINAMWMQTYIGGVSCP 298

Query: 277 IQFFCDGGNENLSHSVLIVGYGVD-RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
             + C   +++  H VL+VG+G         K  PYWIIKNSWG  WGE GY+++ RG  
Sbjct: 299 --YVC---SKSQDHGVLLVGFGSSGYAPIRLKEKPYWIIKNSWGAMWGEHGYYKICRGPH 353

Query: 336 S-CGINDYVRS 345
           + CG++  V +
Sbjct: 354 NMCGMDTMVST 364


>gi|161778780|gb|ABX79341.1| cysteine protease [Vitis vinifera]
          Length = 377

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 132/316 (41%), Positives = 188/316 (59%), Gaps = 21/316 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F+ F  +  K+YA+  E+  R  +F  NLR+ +  Q  +  S  +G+ +FSDL+ AEF+ 
Sbjct: 62  FSIFKRRFGKSYASQEEHDYRFKVFKANLRRARRHQQLDP-SATHGVTQFSDLTPAEFRG 120

Query: 102 KYLGFK-LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
            YLG + LK  +  +  P ++P   LP  FDWR++ AVT VK+Q  CGS W+FSTTG +E
Sbjct: 121 TYLGLRPLKLPHDAQKAP-ILPTNDLPEDFDWRDHGAVTAVKNQGSCGSCWSFSTTGALE 179

Query: 161 GVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           G     T  LVSLSEQ+L++CD E         D GC GG ++ AF+  +    GGL +E
Sbjct: 180 GANFLATGNLVSLSEQQLVECDHECDPEEMGSCDSGCNGGLMNTAFEYTLK--AGGLMKE 237

Query: 212 KTYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           + YPY G D+ +C+ +K      ++ +  +S DE  +A  LV+ GP+AVAINA  +Q YV
Sbjct: 238 EDYPYTGTDRGSCKFDKTKIAASVSNFSVISLDEDQIAANLVKIGPLAVAINAVFMQTYV 297

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            GVS P  + C   ++ L H VL+VGYG         K  PYWIIKNSWGE WGE G+++
Sbjct: 298 GGVSCP--YIC---SKRLDHGVLLVGYGSAGYAPIRMKDKPYWIIKNSWGENWGENGFYK 352

Query: 330 LYRGDGSCGINDYVRS 345
           + RG   CG++  V +
Sbjct: 353 ICRGRNVCGVDSMVST 368


>gi|19849|emb|CAA78361.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
          Length = 363

 Score =  241 bits (616), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 139/336 (41%), Positives = 190/336 (56%), Gaps = 26/336 (7%)

Query: 23  MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
            VV +    HL + +H   F+ F  +  K YA+  E+  R  +F  NLR+ +L Q  +  
Sbjct: 30  QVVSETDDSHLLNAEHH--FSLFKSKFGKIYASEEEHDHRFKVFKANLRRARLNQLLD-P 86

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGV 141
           S  +G+ +FSDL+ +EF+  YLG  K KP       P ++P   LP  FDWR++ AVTGV
Sbjct: 87  SAEHGITKFSDLTPSEFRRTYLGLHKPKPKLNAEKAP-ILPTSDLPADFDWRDHGAVTGV 145

Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGS 192
           K+Q  CGS W+FSTTG +EG +   T +LVSLSEQ+L+DCD E         D GC GG 
Sbjct: 146 KNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGH 205

Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
            + AF+  +    GGL+ EK YPY G D  C  +K      +  +  +  DE  +A  LV
Sbjct: 206 YATAFEYTLK--AGGLQLEKDYPYTGKDGKCHFDKSKICAAVTNFSVIGLDEDQIAANLV 263

Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAV 309
           ++GP+AV INA  +Q YV GVS P+  F     +   H VL+VGY   G    +   KA 
Sbjct: 264 KHGPLAVGINAAWMQTYVGGVSCPLICF-----KRQDHGVLLVGYGSHGFAPIRLKEKA- 317

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            YWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 318 -YWIIKNSWGENWGEHGYYKICRGHNICGVDAMVST 352


>gi|449464688|ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
 gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
          Length = 377

 Score =  241 bits (615), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 131/319 (41%), Positives = 191/319 (59%), Gaps = 25/319 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F+ F ++  K+YA+  E+  R  +F  NL++ Q  Q  +  S  +G+ +FSDL+ +EF+ 
Sbjct: 60  FSVFKQKFGKSYASKEEHDHRFRVFKANLKRAQRHQALDP-SATHGVTQFSDLTPSEFRR 118

Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
            +LG + +    P+ A+++   ++P   LP  FDWR+  AV+ VK+Q  CGS W+FS TG
Sbjct: 119 SFLGLRSRRLGLPADANKA--PILPTDGLPTDFDWRDKGAVSEVKNQGSCGSCWSFSATG 176

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGL 208
            +EG     T KLVSLSEQ+L+DCD E         D GC GG +++AF+  +    GGL
Sbjct: 177 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEKGSCDSGCNGGLMNSAFEYTLKS--GGL 234

Query: 209 EEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
            +E+ YPY G D+  C+ +K      +  +  VS DE  +A  LV+NGP+AVAINA  +Q
Sbjct: 235 MKEQDYPYTGTDRGTCKFDKSKIAASVANFSVVSLDEEQIAANLVKNGPLAVAINAVFMQ 294

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKG 326
            Y+ GVS P  + C   +++L H VL+VGYG D       K  PYWIIKNSWG  WGE G
Sbjct: 295 TYIKGVSCP--YIC---SKHLDHGVLLVGYGSDGYAPIRLKDKPYWIIKNSWGANWGENG 349

Query: 327 YFRLYRGDGSCGINDYVRS 345
           Y+++ RG   CG++  V +
Sbjct: 350 YYKICRGRNICGVDSMVST 368


>gi|57282617|emb|CAE54306.1| putative papain-like cysteine proteinase [Gossypium hirsutum]
          Length = 373

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 132/315 (41%), Positives = 183/315 (58%), Gaps = 19/315 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           ++ F ++  K+Y +  E+  R  IF  NLR+    Q+ +  S  +G+ +FSDL+  EF+ 
Sbjct: 58  YSLFKKRFKKSYGSQKEHDYRFKIFQVNLRRAARHQNLDP-SATHGVTQFSDLTPGEFRK 116

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
            YLG +      D +   ++P   LP+ FDWRE  AVT VK+Q  CGS W+FSTTG +EG
Sbjct: 117 AYLGLRRLRLPKDATEAPILPTDNLPQDFDWREKGAVTPVKNQGSCGSCWSFSTTGALEG 176

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
                T KLVSLSEQ+L+DCD E         D GC GG +++AF+  +    GGL  E+
Sbjct: 177 ANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLK--AGGLMREE 234

Query: 213 TYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G D+  C+ +      K+  +  VS DE  +A  L +NGP+AVAINA  +Q Y+ 
Sbjct: 235 DYPYTGTDRGTCKFDNTKVAAKVANFSVVSLDEDQIAANLFKNGPLAVAINAVFMQTYIG 294

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           GVS P  + C   ++ L H VL+VGYG         K  PYWIIKNSWGE WGE G++R+
Sbjct: 295 GVSCP--YIC---SKRLDHGVLLVGYGSAGYAPVRMKDKPYWIIKNSWGENWGENGFYRI 349

Query: 331 YRGDGSCGINDYVRS 345
            RG   CG++  V +
Sbjct: 350 CRGRNICGVDSMVST 364


>gi|290997496|ref|XP_002681317.1| cysteine protease [Naegleria gruberi]
 gi|284094941|gb|EFC48573.1| cysteine protease [Naegleria gruberi]
          Length = 350

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 135/333 (40%), Positives = 184/333 (55%), Gaps = 26/333 (7%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           L   +   LF  F ++H K Y    ++  R  IF  N+ K +           +G+++F 
Sbjct: 27  LSEAEMKKLFVKFSKKHAKLYGA-EDHGKRYQIFKSNVEKARYYNHVGKRE-TFGVSKFM 84

Query: 93  DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-------PRAFDWREYDAVTGVKDQT 145
           DL+  EF+  +L     P  A + + A    +         P ++DWR+  AVT VK+Q 
Sbjct: 85  DLTPEEFKRMFLMKTYTPEEARKILAAPKEAVVTAQQVKDTPTSWDWRQKGAVTPVKNQG 144

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE----------DDGCEGGSISN 195
            CGS W FSTTGN+EG++  KT KLVSLSEQ+L+DCD            D GC GG + +
Sbjct: 145 ACGSCWTFSTTGNVEGIHQIKTGKLVSLSEQQLVDCDHNCVTYQGQQACDAGCNGGLMWS 204

Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
           AF  ++    GGL  E +YPY G D  CR NK    V IN + S+  DE  MA +L  NG
Sbjct: 205 AFQYVIKT--GGLVTEDSYPYEGVDDTCRFNKSNVAVTINSWTSIPSDEGKMAAWLAANG 262

Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIK 315
           P+++AINA  LQ Y +G+S+P  +FC+   ++L H VLIVG+G        K   YWIIK
Sbjct: 263 PISIAINAEWLQTYTSGISNP--WFCN--PQDLDHGVLIVGFGTGSNWLGEKE-DYWIIK 317

Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           NSWG  WGE GYFR+ RG G CG+N    S+L+
Sbjct: 318 NSWGADWGESGYFRIVRGKGKCGLNSVPSSSLI 350


>gi|18141287|gb|AAL60581.1|AF454959_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 368

 Score =  241 bits (615), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 138/321 (42%), Positives = 192/321 (59%), Gaps = 30/321 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F+ F ++  K YA+  E+  R  +F  NLR+ +  Q  +  S  +G+ +FSDL+ +EF+ 
Sbjct: 51  FSLFKKKFGKVYASREEHDYRFSVFKSNLRRARRHQKLDP-SARHGVTQFSDLTRSEFKR 109

Query: 102 KYLG----FKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           K+LG    FKL P  A+++   ++P   LP  FDWRE  AVT VK+Q  CGS W+FS TG
Sbjct: 110 KHLGVKGGFKL-PKDANKA--PILPTENLPEEFDWRERGAVTPVKNQGSCGSCWSFSATG 166

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGL 208
            +EG     T KLVSLSEQ+L+DCD E         D GC GG +++AF+  +    GGL
Sbjct: 167 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKT--GGL 224

Query: 209 EEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
             E+ YPY G D A C+L+K      ++ +  +S DE  +A  LV+NGP+AVAINA  +Q
Sbjct: 225 MREEDYPYTGKDGATCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAAYMQ 284

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV---DRTKFTHKAVPYWIIKNSWGEGWGE 324
            Y+ GVS P  + C      L+H VL+VGYG       +F  K  PYWIIKNSWGE WGE
Sbjct: 285 TYIGGVSCP--YIC---MRRLNHGVLLVGYGSAGYAPARFKEK--PYWIIKNSWGETWGE 337

Query: 325 KGYFRLYRGDGSCGINDYVRS 345
            G++++ RG   CG++  V +
Sbjct: 338 DGFYKICRGRNVCGVDSLVST 358


>gi|356541074|ref|XP_003539008.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
          Length = 363

 Score =  241 bits (614), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 130/314 (41%), Positives = 186/314 (59%), Gaps = 23/314 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           F  +  K YA+  E+  R  +F  N+R+ +  Q  +  S  +G+  FSDL+ +EF+ K L
Sbjct: 51  FKRRFGKAYASQEEHNYRFEVFKANMRRARRHQSLDP-SAAHGVTRFSDLTASEFRNKVL 109

Query: 105 GFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
           G +    PS A+++   ++P   LP  FDWR++ AVT VK+Q  CGS W+FSTTG +EG 
Sbjct: 110 GLRGVRLPSNANKA--PILPTDNLPSDFDWRDHGAVTPVKNQGSCGSCWSFSTTGALEGA 167

Query: 163 YAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           +   T +LVSLSEQ+L+DCD E         D GC GG +++AF+ I+    GG+  E+ 
Sbjct: 168 HFLSTGELVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYILKS--GGVMREED 225

Query: 214 YPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
           YPY G D+  C+ +K      +  +  +S DE  +A  LV+NGP+AVAINA  +Q Y+ G
Sbjct: 226 YPYSGTDRGNCKFDKAKIAASVANFSVISLDEDQIAANLVKNGPLAVAINAAYMQTYIGG 285

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           VS P  + C   +  L H VL+VGYG         K  P+WIIKNSWGE WGE GY+++ 
Sbjct: 286 VSCP--YIC---SRRLDHGVLLVGYGSGAYAPIRMKEKPFWIIKNSWGENWGENGYYKIC 340

Query: 332 RGDGSCGINDYVRS 345
           RG   CG++  V +
Sbjct: 341 RGRNICGVDSMVST 354


>gi|297801998|ref|XP_002868883.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314719|gb|EFH45142.1| hypothetical protein ARALYDRAFT_490677 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 368

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 151/371 (40%), Positives = 208/371 (56%), Gaps = 47/371 (12%)

Query: 3   CFYFFAGVALLSLTVSVSSF-----------MVVGDEKLHHLHHVKHTALFNYFLEQHNK 51
           CF  F    L  L VSVSS             VVG  +   L    H   F+ F  +  K
Sbjct: 7   CFSVFV---LFFLIVSVSSSDVNDGDDLVIRQVVGGAEPQVLTSEDH---FSLFKSKFGK 60

Query: 52  TYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FK 107
            YA+  E+  R  +F  NLR+ +  Q  +  S  +G+ +FSDL+ +EF+ K+LG    FK
Sbjct: 61  VYASNEEHDYRFSVFKANLRRARRHQKLDP-SARHGVTQFSDLTRSEFRKKHLGVRAGFK 119

Query: 108 LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKT 167
           L P  A+++   ++P   LP  FDWR+  AVT VK+Q  CGS W+FS TG +EG     T
Sbjct: 120 L-PKDANKA--PILPTENLPEDFDWRDRGAVTPVKNQGSCGSCWSFSATGALEGANFLAT 176

Query: 168 KKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
            KLVSLSEQ+L+DCD E         D GC GG +++AF+  +    GGL +E+ YPY G
Sbjct: 177 GKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKT--GGLMKEEDYPYTG 234

Query: 219 DD-KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
            D K C+L+K      ++ +  +S DE  +A  LV+NGP+AVAINA  +Q Y+ GVS P 
Sbjct: 235 KDGKTCKLDKSKIVASVSNFSVISIDEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCP- 293

Query: 278 QFFCDGGNENLSHSVLIVGYGV---DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
            + C      L+H VL+VGYG       +F  K  PYWIIKNSWGE WGE G++++ +G 
Sbjct: 294 -YIC---TRRLNHGVLLVGYGSAGYAPARFKEK--PYWIIKNSWGETWGENGFYKICKGR 347

Query: 335 GSCGINDYVRS 345
             CG++  V +
Sbjct: 348 NICGVDSLVST 358


>gi|296218871|ref|XP_002755611.1| PREDICTED: cathepsin F [Callithrix jacchus]
          Length = 489

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 136/327 (41%), Positives = 194/327 (59%), Gaps = 11/327 (3%)

Query: 22  FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
           F ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  + 
Sbjct: 173 FSLLNEDPLPQDLAVKMASIFRNFVITYNRTYESKEEAQWRLSVFVHNMVRAQKIQALDR 232

Query: 82  GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGV 141
           G+  YG+ +FSDL+  EF+  YL   L+           + ++  P  +DWR   AVT V
Sbjct: 233 GTAQYGVTKFSDLTEEEFRTTYLNPLLREPGKKMKQAKSVGDLAPPE-WDWRSKGAVTKV 291

Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIM 201
           KDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  S+A+  I 
Sbjct: 292 KDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKIDKACMGGLPSSAYSAI- 350

Query: 202 SKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI 261
            K  GGLE E  Y YRG  +AC  + +  +V IN  V +S++E  +A +L + GP++VAI
Sbjct: 351 -KNLGGLETEDDYSYRGHMQACNFSPEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAI 409

Query: 262 NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
           NA+ +QFY  G+S P++  C      + H+VL+VGYG          VP+W IKNSWG  
Sbjct: 410 NAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWGTD 461

Query: 322 WGEKGYFRLYRGDGSCGINDYVRSALV 348
           WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 462 WGEKGYYYLHRGSGACGVNTMASSAVV 488


>gi|5051468|emb|CAB44983.1| putative preprocysteine proteinase [Nicotiana tabacum]
          Length = 363

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 138/335 (41%), Positives = 190/335 (56%), Gaps = 26/335 (7%)

Query: 24  VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS 83
           VV +    HL + +H   F+ F  +  K YA+  E+  R  +F  NLR+ +  Q  +  S
Sbjct: 31  VVSETDDSHLLNAEHH--FSLFKSKFGKIYASEEEHDHRFKVFKANLRRARRHQLLD-PS 87

Query: 84  GVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVK 142
             +G+ +FSDL+ +EF+  YLG  K KP       P ++P   LP  FDWR++ AVTGVK
Sbjct: 88  AEHGITKFSDLTPSEFRRTYLGLHKPKPKLNAEKAP-ILPTSDLPADFDWRDHGAVTGVK 146

Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSI 193
           +Q  CGS W+FSTTG +EG +   T +LVSLSEQ+L+DCD E         D GC GG +
Sbjct: 147 NQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGLM 206

Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
           + AF+  +    GGL+ EK YPY G D  C  +K      +  +  +  DE  +A  LV+
Sbjct: 207 TTAFEYTLK--AGGLQLEKDYPYTGKDGKCHFDKSKIAAAVTNFSVIGLDEDQIAANLVK 264

Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVP 310
           +GP+AV INA  +Q YV GVS P+  F     +   H VL+VGY   G    +   KA  
Sbjct: 265 HGPLAVGINAAWMQTYVGGVSCPLICF-----KRQDHGVLLVGYGSHGFAPIRLKEKA-- 317

Query: 311 YWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           YWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 318 YWIIKNSWGENWGEHGYYKICRGHNICGVDAMVST 352


>gi|397517049|ref|XP_003828732.1| PREDICTED: cathepsin F [Pan paniscus]
          Length = 379

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 136/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 60  SVISLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 119

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 120 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 179

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 180 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 239

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y Y+G  ++C  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 240 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPISV 297

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG          VP+W IKNSWG
Sbjct: 298 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 349

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 350 TDWGEKGYYYLHRGSGACGVNTMASSAVV 378


>gi|21593213|gb|AAM65162.1| cysteine proteinase RD19A [Arabidopsis thaliana]
          Length = 368

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 141/340 (41%), Positives = 199/340 (58%), Gaps = 33/340 (9%)

Query: 23  MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
            VVG  +   L    H   F+ F  +  K YA+  E+  R  +F  NLR+ +  Q  +  
Sbjct: 35  QVVGGAEPQVLTSEDH---FSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDP- 90

Query: 83  SGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
           S  +G+ +FSDL+ +EF+ K+LG    FKL P  A+++   ++P   LP  FDWR++ AV
Sbjct: 91  SATHGVTQFSDLTRSEFRKKHLGVRSGFKL-PKDANKA--PILPTENLPEDFDWRDHGAV 147

Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCE 189
           T VK+Q  CGS W+FS TG +EG     T KLVSLSEQ+L+DCD E         D GC 
Sbjct: 148 TPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCN 207

Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMA 248
           GG +++AF+  +    GGL +E+ YPY G D K C+L+K      ++ +  +S DE  +A
Sbjct: 208 GGLMNSAFEHTLKT--GGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQIA 265

Query: 249 KYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV---DRTKFT 305
             LV+NGP+AVAINA  +Q Y+ GVS P  + C      L+H VL+VGYG       +F 
Sbjct: 266 ANLVKNGPLAVAINAGYMQTYIGGVSCP--YIC---TRRLNHGVLLVGYGAAGYAPARFK 320

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            K  PYWIIKNSWGE WGE G++++ +G   CG++  V +
Sbjct: 321 EK--PYWIIKNSWGETWGENGFYKICKGRNICGVDSMVST 358


>gi|54696066|gb|AAV38405.1| cathepsin F [synthetic construct]
          Length = 485

 Score =  241 bits (614), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 135/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 165 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 224

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 225 DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 284

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 285 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 344

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y Y+G  ++C  + +  +V IN  + +S++E  +A +L + GP++V
Sbjct: 345 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSMELSQNEQKLAAWLAKRGPISV 402

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG          VP+W IKNSWG
Sbjct: 403 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 454

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 455 TDWGEKGYYYLHRGSGACGVNTMASSAVV 483


>gi|6467382|gb|AAF13146.1|AF136279_1 cathepsin F precursor [Homo sapiens]
          Length = 484

 Score =  240 bits (613), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 135/329 (41%), Positives = 194/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 165 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 224

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 225 DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 284

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN++G +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 285 KVKDQGMCGSCWAFSVTGNVKGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 344

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y Y+G  ++C  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 345 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 402

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG          VP+W IKNSWG
Sbjct: 403 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 454

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 455 TDWGEKGYYYLHRGSGACGVNTMASSAVV 483


>gi|18420375|ref|NP_568052.1| cysteine proteinase RD19a [Arabidopsis thaliana]
 gi|1172872|sp|P43296.1|RD19A_ARATH RecName: Full=Cysteine proteinase RD19a; Short=RD19; Flags:
           Precursor
 gi|435618|dbj|BAA02373.1| thiol protease [Arabidopsis thaliana]
 gi|4539328|emb|CAB38829.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|7270892|emb|CAB80572.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|19310552|gb|AAL85009.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
 gi|22136868|gb|AAM91778.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
 gi|110740898|dbj|BAE98545.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
           thaliana]
 gi|332661616|gb|AEE87016.1| cysteine proteinase RD19a [Arabidopsis thaliana]
          Length = 368

 Score =  240 bits (613), Expect = 6e-61,   Method: Compositional matrix adjust.
 Identities = 141/340 (41%), Positives = 199/340 (58%), Gaps = 33/340 (9%)

Query: 23  MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
            VVG  +   L    H   F+ F  +  K YA+  E+  R  +F  NLR+ +  Q  +  
Sbjct: 35  QVVGGAEPQVLTSEDH---FSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDP- 90

Query: 83  SGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
           S  +G+ +FSDL+ +EF+ K+LG    FKL P  A+++   ++P   LP  FDWR++ AV
Sbjct: 91  SATHGVTQFSDLTRSEFRKKHLGVRSGFKL-PKDANKA--PILPTENLPEDFDWRDHGAV 147

Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCE 189
           T VK+Q  CGS W+FS TG +EG     T KLVSLSEQ+L+DCD E         D GC 
Sbjct: 148 TPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEEADSCDSGCN 207

Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMA 248
           GG +++AF+  +    GGL +E+ YPY G D K C+L+K      ++ +  +S DE  +A
Sbjct: 208 GGLMNSAFEYTLKT--GGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVISIDEEQIA 265

Query: 249 KYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV---DRTKFT 305
             LV+NGP+AVAINA  +Q Y+ GVS P  + C      L+H VL+VGYG       +F 
Sbjct: 266 ANLVKNGPLAVAINAGYMQTYIGGVSCP--YIC---TRRLNHGVLLVGYGAAGYAPARFK 320

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            K  PYWIIKNSWGE WGE G++++ +G   CG++  V +
Sbjct: 321 EK--PYWIIKNSWGETWGENGFYKICKGRNICGVDSMVST 358


>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
 gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
          Length = 368

 Score =  240 bits (613), Expect = 7e-61,   Method: Compositional matrix adjust.
 Identities = 130/315 (41%), Positives = 184/315 (58%), Gaps = 19/315 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F+ F  +  K+Y +  E+  R  +F  NLR+    Q  +  +  +G+ +FSDL++AEF+ 
Sbjct: 53  FSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDP-TASHGVTQFSDLTSAEFRK 111

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
           + LG +      D +   ++P   LP  FDWRE  AV  VK+Q  CGS W+FSTTG +EG
Sbjct: 112 QVLGLRKLRLPKDANTAPILPTNDLPEDFDWREKGAVGPVKNQGSCGSCWSFSTTGALEG 171

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
            +   T +LVSLSEQ+L+DCD E         D GC GG +++AF+  +    GGL  E+
Sbjct: 172 AHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGLMREE 229

Query: 213 TYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G D+ AC+ +K      +  + +VS DE  +A  LV+NGP+AVAINA  +Q Y+ 
Sbjct: 230 DYPYTGMDRGACKFDKNKVAAGVANFSAVSLDEDQIAANLVKNGPLAVAINAVFMQTYIG 289

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           GVS P  + C   +  L H VL+VGYG         K  PYWIIKNSWGE WGE G++++
Sbjct: 290 GVSCP--YIC---SRRLDHGVLLVGYGSAAYAPVRMKEKPYWIIKNSWGESWGENGFYKI 344

Query: 331 YRGDGSCGINDYVRS 345
            RG   CG++  V +
Sbjct: 345 CRGRNICGVDSMVST 359


>gi|19851|emb|CAA78365.1| tobacco pre-pro-cysteine proteinase [Nicotiana tabacum]
          Length = 365

 Score =  240 bits (612), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 136/327 (41%), Positives = 188/327 (57%), Gaps = 26/327 (7%)

Query: 32  HLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
           HL + +H   F+ F  +  K YA+  E+  R  +F  NLR+ +L Q  +  S  +G+ +F
Sbjct: 41  HLLNAEHH--FSLFKSKFGKIYASEEEHDHRFKVFKANLRRARLNQLLD-PSAEHGITKF 97

Query: 92  SDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
           SDL+ +EF+  YLG  K KP       P ++P   LP  +DWR++ AVTGVK+Q  CGS 
Sbjct: 98  SDLTPSEFRRTYLGLHKPKPKVNAEKAP-ILPTSDLPADYDWRDHGAVTGVKNQGSCGSC 156

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIM 201
           W+FSTTG +EG +   T +LVSLSEQ+L+DCD E         D GC GG ++ AF+  +
Sbjct: 157 WSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDSEQQDSCDAGCGGGLMTTAFEYTL 216

Query: 202 SKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI 261
               GGL+ EK YPY G D  C  +K      +  +  +  DE  +A  LV++GP+AV I
Sbjct: 217 K--AGGLQLEKDYPYTGKDGKCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGI 274

Query: 262 NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSW 318
           NA  +Q YV GVS P+  F     +   H VL+VGY   G    +   KA  YWIIKNSW
Sbjct: 275 NAAWMQTYVGGVSCPLICF-----KRQDHGVLLVGYGSHGFAPIRLKEKA--YWIIKNSW 327

Query: 319 GEGWGEKGYFRLYRGDGSCGINDYVRS 345
           GE WGE GY+++ RG   CG++  V +
Sbjct: 328 GENWGEHGYYKICRGHNICGVDAMVST 354


>gi|351710879|gb|EHB13798.1| Cathepsin F [Heterocephalus glaber]
          Length = 482

 Score =  240 bits (612), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 135/321 (42%), Positives = 191/321 (59%), Gaps = 10/321 (3%)

Query: 28  EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYG 87
           + L     +K  ++F  F+  +N+TY +  E   RL +F+ N+   Q +Q  +HG+  YG
Sbjct: 171 DPLPEEFSMKMISIFKNFVATYNRTYESKKEAQWRLSVFTRNMVLAQRIQALDHGTAQYG 230

Query: 88  LNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMC 147
           + +FSDL+  EF+  YL   L+     +   A       P  +DWR+  AVT VK+Q MC
Sbjct: 231 VTKFSDLTEEEFRTIYLNPLLREEPGKKMHLAKAVRDPAPLEWDWRKKGAVTEVKNQGMC 290

Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
           GS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  I S   GG
Sbjct: 291 GSCWAFSVTGNVEGQWFLNRGTLLSLSEQELLDCDKMDKACMGGFPSNAYLAIKSL--GG 348

Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
           LE E  Y Y+G  KAC  + K  +V IN  V +S++E  +A +L   GP++VAINA+ +Q
Sbjct: 349 LETEDDYSYQGHMKACNFSAKKAKVYINDSVELSKNEQKLAAWLAVKGPISVAINAFGMQ 408

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
           FY  G++HP++  C      + H++L+VGYG          VP+W IKNSWG  WGE+GY
Sbjct: 409 FYRHGIAHPLRPLCS--PWFIDHAMLVVGYG------NRSNVPFWAIKNSWGTDWGEEGY 460

Query: 328 FRLYRGDGSCGINDYVRSALV 348
           + L+RG G+CG+N    SA+V
Sbjct: 461 YYLHRGSGACGVNIMASSAVV 481


>gi|4760897|gb|AAD29130.1| cysteine proteinase 1 precursor [Clonorchis sinensis]
          Length = 328

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 130/310 (41%), Positives = 180/310 (58%), Gaps = 13/310 (4%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           AL+  F  ++ KTY+   +   R  IF  NL + + LQ+ E G+  YG+ +FSDL++ EF
Sbjct: 30  ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88

Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           + +YL  +          P+   ++T+    FDWRE+ AV  V DQ  CGS WAFS  GN
Sbjct: 89  KTRYLRMRFDGPIVSED-PSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGN 147

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EG +  KT  L++LSEQ+L+DCD  D GC GG     +  I     GGLE    YPY G
Sbjct: 148 VEGQWFRKTGDLLALSEQQLVDCDHLDKGCNGGYPPKTYGEIEKM--GGLELASDYPYTG 205

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
            D  C +N+      +N    +   E   A+ L E GP++ A+NA  LQFY+ G+  PI 
Sbjct: 206 VDGICYMNQSKFVAYVNESTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPIP 265

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
           F C+     L+H+VL VGYG      T   +PYWI+KNSWG G+GEKGYFR++RG G+CG
Sbjct: 266 FLCN--PHGLNHAVLTVGYG------TEFGIPYWIVKNSWGVGFGEKGYFRIFRGAGTCG 317

Query: 339 INDYVRSALV 348
           IN  V +A++
Sbjct: 318 INLVVSTAII 327


>gi|118489556|gb|ABK96580.1| unknown [Populus trichocarpa x Populus deltoides]
          Length = 367

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 134/333 (40%), Positives = 195/333 (58%), Gaps = 30/333 (9%)

Query: 27  DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
           D+ L+  HH      F  F  +  KTYAT  E+  R  +F  NLR+ +  Q  +  +  +
Sbjct: 42  DDLLNAEHH------FTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKKHQMIDP-TAAH 94

Query: 87  GLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
           G+ +FSDL+  EF+ ++LG K +   P+ A+++   ++P   LP  +DWR++ AVT VKD
Sbjct: 95  GVTKFSDLTPKEFRRQFLGLKRRLRLPTDANKA--PILPTTDLPTDYDWRDHGAVTEVKD 152

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSIS 194
           Q  CGS W+FS TG +EG +   T +L SLSEQ+L+DCD E         D GC+GG ++
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212

Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
           NAF+  +    GGLE E+ YPY G D   C+ +K      ++ +  VS DE  +A  LV+
Sbjct: 213 NAFEYALK--AGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSIDEDQIAANLVK 270

Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYW 312
           +GP++VAINA  +Q YV GVS P  + C   ++   H VL+VGYG         K  P+W
Sbjct: 271 HGPLSVAINAAFMQTYVGGVSCP--YIC---SKRQDHGVLLVGYGSAGYAPIRFKEKPFW 325

Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           IIKNSWG+ WGE GY+++ RG   CG++  V +
Sbjct: 326 IIKNSWGQNWGENGYYKICRGRNICGVDSMVST 358


>gi|348564702|ref|XP_003468143.1| PREDICTED: cathepsin F-like [Cavia porcellus]
          Length = 462

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 135/313 (43%), Positives = 190/313 (60%), Gaps = 10/313 (3%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K  +LF  F+  +N+TY +  E   RL +F+ N+   Q +Q  + G+  YG+ +FSDL+
Sbjct: 159 MKIASLFKKFVATYNRTYESKEETQWRLSVFTRNMILAQKIQALDRGTAQYGVTKFSDLT 218

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
             EF+  YL   L+   +     A I + + P  +DWR+  AVT VK+Q MCGS WAFS 
Sbjct: 219 EEEFRTIYLNPLLREHPSKTMRQAKIVHDSAPPEWDWRKKGAVTEVKNQGMCGSCWAFSV 278

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
           TGN+EG +  K   L+SLSEQEL+DCD+ D  C GG   NA+  I S   GGLE E  Y 
Sbjct: 279 TGNVEGQWFLKKGTLLSLSEQELLDCDKVDKACMGGLPINAYSAIKSL--GGLETEDDYS 336

Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           Y+G  +AC  + K  +V IN  V +S++E  +A +L   GP+++AINA+ +QFY  G++H
Sbjct: 337 YQGHMEACNFSAKKAKVYINDSVELSKNEQYLAAWLAVKGPISIAINAFGMQFYRHGIAH 396

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
           P+Q  C      + H++LIVGYG          VP+W IKNSWG  WGE+GY+ L+RG  
Sbjct: 397 PLQPLCSPW--FIDHAMLIVGYG------KRSGVPFWAIKNSWGTDWGEEGYYYLHRGSR 448

Query: 336 SCGINDYVRSALV 348
           SCG+N    SA+V
Sbjct: 449 SCGVNVMASSAVV 461


>gi|395851695|ref|XP_003798388.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Otolemur garnettii]
          Length = 491

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 134/313 (42%), Positives = 188/313 (60%), Gaps = 10/313 (3%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           ++  ++F  FL  +N+TY +  E   RL IF  N+ + Q +Q  + G+  YG+ +FSDL+
Sbjct: 188 MQMLSVFKNFLTTYNRTYESKEETQWRLSIFINNMVRAQKIQALDQGTARYGITKFSDLT 247

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
             EF+  YL   L+     +   A       P  +DWR   AVT VK+Q MCGS WAFS 
Sbjct: 248 EEEFRTIYLNPLLREDPGKKMRVAKPVGDPAPPEWDWRNKGAVTNVKNQGMCGSCWAFSV 307

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
           TGN+EG +  K   L+SLSEQEL+DCD+ D  C GG  SNA+  I  K  GGLE E+ Y 
Sbjct: 308 TGNVEGQWFLKQGTLLSLSEQELLDCDKMDKACLGGLPSNAYSAI--KNLGGLETEEDYS 365

Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           Y+G  +AC  + +  +V IN  V +S +E  +A +L + GP++VAINA+ +QFY  G+S 
Sbjct: 366 YQGQMQACNFSAEKAKVYINDSVELSHNEQKLAAWLAKKGPISVAINAFGMQFYRHGISR 425

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
           P++  C      + H+VLIVGYG          +P+W IKNSWG  WGE+GY+ L+RG G
Sbjct: 426 PLRPLCTPW--LIDHAVLIVGYG------NRSDIPFWAIKNSWGTDWGEQGYYYLHRGSG 477

Query: 336 SCGINDYVRSALV 348
           +CG+N    SA+V
Sbjct: 478 ACGVNTMASSAVV 490


>gi|18414611|ref|NP_567489.1| Papain family cysteine protease [Arabidopsis thaliana]
 gi|2244977|emb|CAB10398.1| cysteine proteinase like protein [Arabidopsis thaliana]
 gi|7268368|emb|CAB78661.1| cysteine proteinase like protein [Arabidopsis thaliana]
 gi|14517442|gb|AAK62611.1| AT4g16190/dl4135w [Arabidopsis thaliana]
 gi|22136546|gb|AAM91059.1| AT4g16190/dl4135w [Arabidopsis thaliana]
 gi|22530956|gb|AAM96982.1| cysteine proteinase [Arabidopsis thaliana]
 gi|23397184|gb|AAN31875.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|110740834|dbj|BAE98514.1| cysteine proteinase like protein [Arabidopsis thaliana]
 gi|332658313|gb|AEE83713.1| Papain family cysteine protease [Arabidopsis thaliana]
          Length = 373

 Score =  239 bits (611), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 141/338 (41%), Positives = 199/338 (58%), Gaps = 28/338 (8%)

Query: 24  VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS 83
           VV +E    L + +H   F  F  ++ KTYAT VE+  R  +F  NLR+ +  Q  +  S
Sbjct: 39  VVPEENDEQLLNAEHH--FTLFKSKYEKTYATQVEHDHRFRVFKANLRRARRNQLLD-PS 95

Query: 84  GVYGLNEFSDLSTAEFQAKYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREYDAVT 139
            V+G+ +FSDL+  EF+ K+LG K +    P+  D     ++P   LP  FDWRE  AVT
Sbjct: 96  AVHGVTQFSDLTPKEFRRKFLGLKRRGFRLPT--DTQTAPILPTSDLPTEFDWREQGAVT 153

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
            VK+Q MCGS W+FS  G +EG +   TK+LVSLSEQ+L+DCD E         D GC G
Sbjct: 154 PVKNQGMCGSCWSFSAIGALEGAHFLATKELVSLSEQQLVDCDHECDPAQANSCDSGCSG 213

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSVSRDETDMAK 249
           G ++NAF+  +    GGL +E+ YPY G D  AC+ +K      ++ +  VS DE  +A 
Sbjct: 214 GLMNNAFEYALK--AGGLMKEEDYPYTGRDHTACKFDKSKIVASVSNFSVVSSDEDQIAA 271

Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD-RTKFTHKA 308
            LV++GP+A+AINA  +Q Y+ GVS P  + C   +++  H VL+VG+G         K 
Sbjct: 272 NLVQHGPLAIAINAMWMQTYIGGVSCP--YVC---SKSQDHGVLLVGFGSSGYAPIRLKE 326

Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGS-CGINDYVRS 345
            PYWIIKNSWG  WGE GY+++ RG  + CG++  V +
Sbjct: 327 KPYWIIKNSWGAMWGEHGYYKICRGPHNMCGMDTMVST 364


>gi|51969854|dbj|BAD43619.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 144/359 (40%), Positives = 208/359 (57%), Gaps = 37/359 (10%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL--------FNYFLEQHNKTYATLVEYYS 61
           V+L+ + VSVS   V GDE +     V  T          F  F ++  K Y ++ E+Y 
Sbjct: 11  VSLIFVFVSVS---VCGDEDVLIRQVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHYY 67

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSV 117
           R  +F  NL +    Q  +  S  +G+ +FSDL+ +EF+ K+LG    FKL P  A+++ 
Sbjct: 68  RFSVFKANLLRAMRHQKMDP-SARHGVTQFSDLTRSEFRRKHLGVKGGFKL-PKDANQA- 124

Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
             ++P   LP  FDWR+  AVT VK+Q  CGS W+FSTTG +EG +   T KLVSLSEQ+
Sbjct: 125 -PILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEGAHFLATGKLVSLSEQQ 183

Query: 178 LIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNK 227
           L+DCD E         D GC G  +++AF+  +    GGL  EK YPY G D  +C+L++
Sbjct: 184 LVDCDHECDPEEEGSCDSGCNGRLMNSAFEYTLKT--GGLMREKDYPYTGTDGGSCKLDR 241

Query: 228 KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNEN 287
                 ++ +  VS +E  +A  L++NGP+AVAINA  +Q Y+ GVS P  + C   +  
Sbjct: 242 SKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYIGGVSCP--YIC---SRR 296

Query: 288 LSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           L+H VL+VGYG    ++   K  PYWIIKNSWGE WGE G++++ +G   CG++  V +
Sbjct: 297 LNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYKICKGRNICGVDSLVST 355


>gi|85068708|gb|ABC69434.1| cysteine protease [Clonorchis sinensis]
 gi|85068710|gb|ABC69435.1| cysteine protease [Clonorchis sinensis]
          Length = 328

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 131/311 (42%), Positives = 182/311 (58%), Gaps = 15/311 (4%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           AL+  F  ++ KTY+   +   R  IF  NL + + LQ+ E G+  YG+ +FSDL++ EF
Sbjct: 30  ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88

Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           + +YL  +   P  ++   P    ++T+    FDWRE+ AV  V DQ  CGS WAFS  G
Sbjct: 89  KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           N+EG +  KT  L++LSEQ+L+DCD  D GC GG     +  I     GGLE    YPY 
Sbjct: 147 NVEGQWFRKTGDLLALSEQQLVDCDHLDKGCNGGYPPKTYGEIEKM--GGLELASDYPYT 204

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           G D  C +N+      +N    +   E   A+ L E GP++ A+NA  LQFY+ G+  PI
Sbjct: 205 GVDGICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPI 264

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
            F C+     L+H+VL VGYG      T   +PYWI+KNSWG G+GEKGYFR++RG G+C
Sbjct: 265 PFLCN--PHGLNHAVLTVGYG------TEFGIPYWIVKNSWGVGFGEKGYFRIFRGAGTC 316

Query: 338 GINDYVRSALV 348
           GIN  V +A++
Sbjct: 317 GINLVVSTAII 327


>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
          Length = 368

 Score =  239 bits (610), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 130/315 (41%), Positives = 183/315 (58%), Gaps = 19/315 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F+ F  +  K+Y +  E+  R  +F  NLR+    Q  +  +  +G+ +FSDL++AEF+ 
Sbjct: 53  FSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDP-TASHGVTQFSDLTSAEFRK 111

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
           + LG +      D +   ++P   LP  FDWRE  AV  VK+Q  CGS W+FSTTG +EG
Sbjct: 112 QVLGLRKLRLPKDANTAPILPTNDLPEDFDWREKGAVGPVKNQGSCGSCWSFSTTGALEG 171

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
            +   T +LVSLSEQ+L+DCD E         D GC GG +++AF+  +    GGL  E+
Sbjct: 172 AHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGLMREE 229

Query: 213 TYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G D+ AC+ +K      +  +  VS DE  +A  LV+NGP+AVAINA  +Q Y+ 
Sbjct: 230 DYPYTGMDRGACKFDKNKVAAGVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIG 289

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           GVS P  + C   +  L H VL+VGYG         K  PYWIIKNSWGE WGE G++++
Sbjct: 290 GVSCP--YIC---SRRLDHGVLLVGYGSAAYAPVRMKEKPYWIIKNSWGESWGENGFYKI 344

Query: 331 YRGDGSCGINDYVRS 345
            RG   CG++  V +
Sbjct: 345 CRGRNICGVDSMVST 359


>gi|312281839|dbj|BAJ33785.1| unnamed protein product [Thellungiella halophila]
          Length = 373

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 136/321 (42%), Positives = 191/321 (59%), Gaps = 30/321 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F+ F  +  K YA+  E+  RL +F  NLR+ +  Q  +  S  +G+ +FSDL+ +EF+ 
Sbjct: 56  FSLFKRKFGKVYASSEEHDYRLSVFKANLRRARRHQKLDP-SARHGVTQFSDLTRSEFRK 114

Query: 102 KYLG----FKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           K+LG    FKL P  A+++   ++P   LP  FDWR+  AVT VK+Q  CGS W+FS TG
Sbjct: 115 KHLGVRGGFKL-PKDANKA--PILPTENLPEDFDWRDRGAVTPVKNQGSCGSCWSFSATG 171

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGL 208
            +EG     T KLVSLSEQ+L+DCD E         D GC GG +++AF+  +    GGL
Sbjct: 172 ALEGANFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKT--GGL 229

Query: 209 EEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
             E+ YPY G D   C+L+K      ++ +  +S DE  +A  LV+NGP+AVAINA  +Q
Sbjct: 230 MREEDYPYTGKDGPTCKLDKSKIVASVSNFSVISIDEDQIAANLVKNGPLAVAINAAYMQ 289

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV---DRTKFTHKAVPYWIIKNSWGEGWGE 324
            Y+ GVS P  + C      L+H VL+VGYG       +F  K  PYWIIKNSWGE WGE
Sbjct: 290 TYIGGVSCP--YIC---ARRLNHGVLLVGYGSAGYAPARFKEK--PYWIIKNSWGESWGE 342

Query: 325 KGYFRLYRGDGSCGINDYVRS 345
            G++++ +G   CG++  V +
Sbjct: 343 NGFYKICKGRNICGVDSLVST 363


>gi|49456321|emb|CAG46481.1| CTSF [Homo sapiens]
          Length = 338

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 135/329 (41%), Positives = 193/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 19  SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 78

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 79  DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 138

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 139 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 198

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE    Y Y+G  ++C  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 199 I--KNLGGLETVDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 256

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG          VP+W IKNSWG
Sbjct: 257 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWG 308

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 309 TDWGEKGYYYLHRGSGACGVNTMASSAVV 337


>gi|118485910|gb|ABK94801.1| unknown [Populus trichocarpa]
          Length = 367

 Score =  239 bits (609), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 134/333 (40%), Positives = 195/333 (58%), Gaps = 30/333 (9%)

Query: 27  DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
           D+ L+  HH      F  F  +  KTYAT  E+  R  +F  NLR+ +  Q  +  +  +
Sbjct: 42  DDLLNAEHH------FTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKKHQMIDP-TAAH 94

Query: 87  GLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
           G+ +FSDL+  EF+ ++LG K +   P+ A+++   ++P   LP  +DWR++ AVT VKD
Sbjct: 95  GVTKFSDLTPKEFRRQFLGLKRRLRLPTDANKA--PILPTTDLPTDYDWRDHGAVTEVKD 152

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSIS 194
           Q  CGS W+FS TG +EG +   T +L SLSEQ+L+DCD E         D GC+GG ++
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212

Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
           NAF+  +    GGLE E+ YPY G D   C+ +K      ++ +  VS DE  +A  LV+
Sbjct: 213 NAFEYALK--AGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSIDEDQIAANLVK 270

Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYW 312
           +GP++VAINA  +Q YV GVS P  + C   ++   H VL+VGYG         K  P+W
Sbjct: 271 HGPLSVAINAAFMQTYVGGVSCP--YIC---SKRQDHGVLLVGYGSAGYAPIRFKEKPFW 325

Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           IIKNSWG+ WGE GY+++ RG   CG++  V +
Sbjct: 326 IIKNSWGQNWGENGYYKICRGRNICGVDSMVST 358


>gi|356545108|ref|XP_003540987.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
          Length = 365

 Score =  238 bits (608), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 134/331 (40%), Positives = 188/331 (56%), Gaps = 25/331 (7%)

Query: 26  GDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
           GD +L   HH      F  F  +  K Y +  E+  R  +F  N+R+ +  Q  +  S  
Sbjct: 40  GDVRLGAEHH------FLEFKRRFGKAYDSEDEHDYRYKVFKANMRRARRHQSLDP-SAA 92

Query: 86  YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQT 145
           +G+  FSDL+ +EF+ K LG +      D +   ++P   LP  FDWR++ AVT VK+Q 
Sbjct: 93  HGVTRFSDLTPSEFRNKVLGLRGVRLPLDANKAPILPTDNLPSDFDWRDHGAVTPVKNQG 152

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNA 196
            CGS W+FSTTG +EG +   T +LVSLSEQ+L+DCD E         D GC GG +++A
Sbjct: 153 SCGSCWSFSTTGALEGAHFLSTGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSA 212

Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
           F+ I+    GG+  E+ YPY G D   C+ +K      +  +  VS DE  +A  LV+NG
Sbjct: 213 FEYILKS--GGVMREEDYPYSGADSGTCKFDKTKIAASVANFSVVSLDEDQIAANLVKNG 270

Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWII 314
           P+AVAINA  +Q Y+ GVS P  + C   +  L+H VL+VGYG         K  P+WII
Sbjct: 271 PLAVAINAAYMQTYIGGVSCP--YVC---SRRLNHGVLLVGYGSGAYAPIRMKEKPFWII 325

Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           KNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 326 KNSWGENWGENGYYKICRGRNICGVDSMVST 356


>gi|28192375|gb|AAK07731.1| CPR2-like cysteine proteinase [Nicotiana tabacum]
          Length = 363

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 137/336 (40%), Positives = 189/336 (56%), Gaps = 26/336 (7%)

Query: 23  MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
            VV +    HL + +H   F+ F  +  K YA+  E+  R  +F  N R+ +  Q  +  
Sbjct: 30  QVVSETDDSHLLNAEHH--FSLFKSKFGKIYASEEEHDHRFKVFKANRRRARRHQLLD-P 86

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGV 141
           S  +G+ +FSDL+ +EF+  YLG  K KP       P ++P   LP  FDWR++ AVTGV
Sbjct: 87  SAEHGITKFSDLTPSEFRRTYLGLHKPKPKLNAEKAP-ILPTSDLPADFDWRDHGAVTGV 145

Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGS 192
           K+Q  CGS W+FSTTG +EG +   T +LVSLSEQ+L+DCD E         D GC GG 
Sbjct: 146 KNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDACDAGCGGGL 205

Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
           ++ AF+  +    GGL+ EK YPY G D  C  +K      +  +  +  DE  +A  LV
Sbjct: 206 MTTAFEYTLK--AGGLQLEKDYPYTGKDGKCHFDKSKIAAAVTNFSVIGLDEDQIAANLV 263

Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAV 309
           ++GP+AV INA  +Q YV GVS P+  F     +   H VL+VGY   G    +   KA 
Sbjct: 264 KHGPLAVGINAAWMQTYVGGVSCPLICF-----KRQDHGVLLVGYGSHGFAPIRLKEKA- 317

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            YWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 318 -YWIIKNSWGENWGEHGYYKICRGHNICGVDAMVST 352


>gi|19195|emb|CAA78403.1| pre-pro-cysteine proteinase [Solanum lycopersicum]
          Length = 361

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 138/353 (39%), Positives = 196/353 (55%), Gaps = 23/353 (6%)

Query: 4   FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
           F  F+     S    +   +V G++  H L+   H   F+ F  +  K YA+  E+  RL
Sbjct: 10  FALFSSAIAFSDDDPLIRQVVSGNDDNHMLNAEHH---FSLFKAKFGKIYASQEEHDHRL 66

Query: 64  HIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIP 122
            +F  NL + +  Q  +  S  +G+ +FSDL+ +EF+  YLG  K +P+      P ++P
Sbjct: 67  KVFKANLHRAKRHQLLD-PSAEHGITQFSDLTPSEFRRTYLGLNKPRPNLNAEKAP-ILP 124

Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
              LP  FDWRE  AVT VK+Q  CGS W+FSTTG +EG +   T +LVSLSEQ+L+DCD
Sbjct: 125 TKDLPSDFDWREKGAVTDVKNQGSCGSCWSFSTTGAVEGAHFLATGELVSLSEQQLVDCD 184

Query: 183 QE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVK 233
            E         D GC GG ++ AF+  +    GGL+ EK YPY G +  C  +K      
Sbjct: 185 HECDPVEKNDCDAGCNGGLMTTAFEYTLK--AGGLQLEKDYPYTGRNGKCHFDKSRIAAS 242

Query: 234 INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
           ++ +  V  DE  +A  L+++GP+AV INA  +Q YV GVS P+  F     +   H VL
Sbjct: 243 VSNFSVVGLDEDQIAANLLKHGPLAVGINAAWMQTYVRGVSCPLICF-----KRQDHGVL 297

Query: 294 IVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           +VGYG +       K  PYWIIKNSWG+ WGE GY+++ RG   CG++  V +
Sbjct: 298 LVGYGSEGFAPIRLKNKPYWIIKNSWGKTWGEHGYYKICRGHHICGVDAMVST 350


>gi|5679322|gb|AAD46920.1|AF167986_1 putative cysteine proteinase GmPM33 [Glycine max]
          Length = 363

 Score =  238 bits (607), Expect = 3e-60,   Method: Compositional matrix adjust.
 Identities = 138/340 (40%), Positives = 194/340 (57%), Gaps = 36/340 (10%)

Query: 19  VSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD 78
           ++  + +GD +L     ++    F  F+E + ++Y+T  EY  RL IF+ N+ +    Q 
Sbjct: 36  IARKLKLGDNEL-----LRTEKKFKVFMENYGRSYSTEEEYLRRLGIFAQNMVRAAEHQA 90

Query: 79  TEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
            +  + V+G+ +FS L  +   A   G    P   D           LP  FDWRE  AV
Sbjct: 91  LDP-TAVHGVTQFS-LPVSNNAA---GGIAPPLEVD----------GLPENFDWREKGAV 135

Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCE 189
           T VK Q  CGS WAFSTTG+IEG     T KLVSLS+Q+L+DCD +         D+GC 
Sbjct: 136 TEVKLQGRCGSCWAFSTTGSIEGANFLATGKLVSLSDQQLLDCDNKCDITEKTSCDNGCN 195

Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAK 249
           GG ++NA++ ++    GGLEEE +YPY G+   C+ + +   VKI  + ++  DE  +A 
Sbjct: 196 GGLMTNAYNYLLES--GGLEEESSYPYTGERGECKFDPEKIAVKITNFTNIPADENQIAA 253

Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKA- 308
           YLV+NGP+A+ +NA  +Q Y+ GVS P+   C    + L+H VL+VGYG           
Sbjct: 254 YLVKNGPLAMGVNAIFMQTYIGGVSCPL--ICS--KKRLNHGVLLVGYGAKGFSILRLGN 309

Query: 309 VPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
            PYWIIKNSWGE WGE GY++L RG G CGIN  V +A+V
Sbjct: 310 KPYWIIKNSWGEKWGEDGYYKLCRGHGMCGINTMVSAAMV 349


>gi|224066056|ref|XP_002302004.1| predicted protein [Populus trichocarpa]
 gi|222843730|gb|EEE81277.1| predicted protein [Populus trichocarpa]
          Length = 367

 Score =  238 bits (606), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 135/333 (40%), Positives = 196/333 (58%), Gaps = 30/333 (9%)

Query: 27  DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
           D+ L+  HH      F  F  +  KTYAT  E+  R  +F  NLR+ +  Q  +  +  +
Sbjct: 42  DDLLNAEHH------FTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKKHQMIDP-TAAH 94

Query: 87  GLNEFSDLSTAEFQAKYLGFK--LK-PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
           G+ +FSDL+  EF+ ++LG K  L+ P+ A+++   ++P   LP  +DWR++ AVT VKD
Sbjct: 95  GITKFSDLTPKEFRRQFLGLKRWLRLPTDANKA--PILPTTDLPTDYDWRDHGAVTEVKD 152

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSIS 194
           Q  CGS W+FS TG +EG +   T +L SLSEQ+L+DCD E         D GC+GG ++
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212

Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
           NAF+  +    GGLE E+ YPY G D   C+ +K      ++ +  VS DE  +A  LV+
Sbjct: 213 NAFEYALK--AGGLEREEDYPYTGTDGGTCKFDKSKVVASVSNFSVVSIDEDQIAANLVK 270

Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYW 312
           +GP++VAINA  +Q YV GVS P  + C   ++   H VL+VGYG         K  P+W
Sbjct: 271 HGPLSVAINAAFMQTYVGGVSCP--YIC---SKRQDHGVLLVGYGSAGYAPIRFKEKPFW 325

Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           IIKNSWG+ WGE GY+++ RG   CG++  V +
Sbjct: 326 IIKNSWGQNWGENGYYKICRGRNICGVDSMVST 358


>gi|516865|emb|CAA52403.1| putative thiol protease [Arabidopsis thaliana]
          Length = 313

 Score =  237 bits (605), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 132/316 (41%), Positives = 192/316 (60%), Gaps = 26/316 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           F ++  K Y ++ E+Y R  +F  NL +    Q  +  S  +G+ +FSDL+ +EF+ K+L
Sbjct: 3   FKKKFGKVYGSIEEHYYRFSVFKANLLRAMRHQKMDP-SARHGVTQFSDLTRSEFRRKHL 61

Query: 105 G----FKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           G    FKL P  A+++   ++P   LP  FDWR+  AVT VK+Q  CGS W+FSTTG +E
Sbjct: 62  GVKGGFKL-PKDANQA--PILPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALE 118

Query: 161 GVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           G +   T KLVSLSEQ+L+DCD E         D GC GG +++AF+  +    GGL  E
Sbjct: 119 GAHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYTLKT--GGLMRE 176

Query: 212 KTYPYRGDD-KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           K YPY G D  +C+L++      ++ +  VS +E  +A  L++NGP+AVAINA  +Q Y+
Sbjct: 177 KDYPYTGTDGGSCKLDRSKIVASVSNFSVVSINEDQIAANLIKNGPLAVAINAAYMQTYI 236

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            GVS P  + C   +  L+H VL+VGYG    ++   K  PYWIIKNSWGE WGE G+++
Sbjct: 237 GGVSCP--YIC---SRRLNHGVLLVGYGSAGFSQARLKEKPYWIIKNSWGESWGENGFYK 291

Query: 330 LYRGDGSCGINDYVRS 345
           + +G   CG++  V +
Sbjct: 292 ICKGRNICGVDSLVST 307


>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
 gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
          Length = 368

 Score =  237 bits (605), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 126/315 (40%), Positives = 184/315 (58%), Gaps = 19/315 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F+ F  +  K+Y +  E+  R  +F  NLR+    Q+ +  +  +G+ +FSDL+ AEF+ 
Sbjct: 53  FSLFKSKFKKSYGSQEEHDYRFSVFKANLRRAARHQELDP-TASHGVTQFSDLTPAEFRK 111

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
           + LG +      D +   ++P   LP  FDWR+  AV  +K+Q  CGS W+FS TG +EG
Sbjct: 112 QVLGLRRLRLPKDANEAPILPTSDLPEDFDWRDKGAVGPIKNQGSCGSCWSFSATGALEG 171

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
            +   T +LVSLSEQ+L+DCD E         D GC GG +++AF+  +    GGL  E+
Sbjct: 172 AHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGLMREE 229

Query: 213 TYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G D+ AC+ +K     ++  +  VS DE  +A  LV+NGP+AVAINA  +Q Y+ 
Sbjct: 230 DYPYTGTDRDACKFDKNKVAARVANFSVVSLDEDQIAANLVKNGPLAVAINAVFMQTYIG 289

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           GVS P  + C   +  L H VL+VGYG    +    K  P+WIIKNSWGE WGE G++++
Sbjct: 290 GVSCP--YIC---SRRLDHGVLLVGYGSAGYSPVRMKEKPFWIIKNSWGEKWGENGFYKI 344

Query: 331 YRGDGSCGINDYVRS 345
            RG   CG++  V +
Sbjct: 345 CRGRNVCGVDSMVST 359


>gi|3916214|gb|AAC78839.1| cathepsin F [Homo sapiens]
          Length = 302

 Score =  237 bits (604), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 132/310 (42%), Positives = 186/310 (60%), Gaps = 10/310 (3%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  + G+  YG+ +FSDL+  E
Sbjct: 2   ASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEE 61

Query: 99  FQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           F+  YL   L+    ++   A       P  +DWR   AVT VKDQ MCGS WAFS TGN
Sbjct: 62  FRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGN 121

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  I  K  GGLE E  Y Y+G
Sbjct: 122 VEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSAI--KNLGGLETEDDYSYQG 179

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
             ++C  + +  +V IN  V +S++E  +A +L + GP++VAINA+ +QFY  G+S P++
Sbjct: 180 HMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISVAINAFGMQFYRHGISRPLR 239

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
             C      + H+VL+VGYG          VP+W IKNSWG  WGEKGY+ L+RG G+CG
Sbjct: 240 PLCSPW--LIDHAVLLVGYG------NRSDVPFWAIKNSWGTDWGEKGYYYLHRGSGACG 291

Query: 339 INDYVRSALV 348
           +N    SA+V
Sbjct: 292 VNTMASSAVV 301


>gi|118485796|gb|ABK94746.1| unknown [Populus trichocarpa]
          Length = 367

 Score =  237 bits (604), Expect = 6e-60,   Method: Compositional matrix adjust.
 Identities = 135/333 (40%), Positives = 195/333 (58%), Gaps = 30/333 (9%)

Query: 27  DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
           D+ L+  HH      F  F  +  KTYAT  E+  R  +F  NLR+ +  Q  +  +  +
Sbjct: 42  DDLLNAEHH------FTSFKSKFGKTYATQEEHDYRFGVFKANLRRAKKHQMIDP-TAAH 94

Query: 87  GLNEFSDLSTAEFQAKYLGFK--LK-PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
           G+ +FSDL+  EF+ ++LG K  L+ P+ A+++   ++P   LP  +DWR++ AVT VKD
Sbjct: 95  GITKFSDLTPKEFRRQFLGLKRWLRLPTDANKA--PILPTTDLPTDYDWRDHGAVTEVKD 152

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSIS 194
           Q  CGS W+FS TG +EG +   T +L SLSEQ+L+DCD E         D GC+GG ++
Sbjct: 153 QGSCGSCWSFSATGALEGAHYLATGELASLSEQQLVDCDHECDPEEYGACDSGCDGGLMN 212

Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
           NAF+  +    GGLE E  YPY G D   C+ +K      ++ +  VS DE  +A  LV+
Sbjct: 213 NAFEYALK--AGGLEREADYPYTGTDGGTCKFDKSKVVASVSNFSVVSIDEDQIAANLVK 270

Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYW 312
           +GP++VAINA  +Q YV GVS P  + C   ++   H VL+VGYG         K  P+W
Sbjct: 271 HGPLSVAINAAFMQTYVGGVSCP--YIC---SKRQDHGVLLVGYGSAGYAPIRFKEKPFW 325

Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           IIKNSWG+ WGE GY+++ RG   CG++  V +
Sbjct: 326 IIKNSWGQNWGENGYYKICRGRNICGVDSMVST 358


>gi|147809367|emb|CAN64491.1| hypothetical protein VITISV_015725 [Vitis vinifera]
          Length = 321

 Score =  237 bits (604), Expect = 7e-60,   Method: Compositional matrix adjust.
 Identities = 135/321 (42%), Positives = 185/321 (57%), Gaps = 24/321 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+E++ K Y++  EY  RL IF+ N+ +    Q  +  + ++G+  FSDLS  EF+ 
Sbjct: 7   FRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDPXA-LHGVTPFSDLSEEEFER 65

Query: 102 KYLGFKLKPSYAD--RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
            + G   +P          A +    LP +FDWRE  AVT VK Q  CGS WAFSTTG +
Sbjct: 66  MFTGVVGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFSTTGAV 125

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEE 210
           EG +   TKKL++LSEQ+L+DCD           D GCEGG ++NA+  ++    GGLEE
Sbjct: 126 EGAHFISTKKLLTLSEQQLVDCDHMCDIRDKXACDSGCEGGLMTNAYKYLIE--AGGLEE 183

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           E +YPY G    C+       V++  +  V  BE  +A  LV +GP+AV +NA  +Q Y+
Sbjct: 184 ESSYPYTGKHGECKFKPDRVAVRVVNFTEVPIBENQIAANLVCHGPLAVGLNAXFMQTYI 243

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDR---TKFTHKAVPYWIIKNSWGEGWGEKGY 327
            GVS P+   C      ++H VL+VGYG       +F +K  PYWIIKNSWG  WGE GY
Sbjct: 244 GGVSCPL--ICP--KRWINHGVLLVGYGAKGYSILRFGYK--PYWIIKNSWGXRWGEHGY 297

Query: 328 FRLYRGDGSCGINDYVRSALV 348
           +RL RG G CG+N  V SA+V
Sbjct: 298 YRLCRGHGMCGMNTMV-SAVV 317


>gi|71482944|gb|AAZ32411.1| cysteine proteinase glycinain type [Nicotiana benthamiana]
          Length = 355

 Score =  236 bits (603), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 133/325 (40%), Positives = 184/325 (56%), Gaps = 22/325 (6%)

Query: 32  HLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
           HL + +H   F+ F  +  K YA+  E+  R  +F  NLR+ +  Q  +  S  +G+ +F
Sbjct: 41  HLLNAEHH--FSLFKSKFGKIYASEEEHDHRFKVFKANLRRARRHQLLD-PSAEHGITKF 97

Query: 92  SDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
           SDL+ +EF+  YLG  K KP       P ++P   LP  +DWR++ AVTGVK+Q  CGS 
Sbjct: 98  SDLTPSEFRRTYLGLHKPKPKLNAEKAP-ILPTSDLPADYDWRDHGAVTGVKNQGSCGSC 156

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIM 201
           W+FSTTG +EG +   T +LVSLSEQ+L+DCD E         D GC GG ++ AF+  +
Sbjct: 157 WSFSTTGAVEGAHFLATGELVSLSEQQLVDCDHECDPEQQDSCDAGCSGGLMTTAFEYTL 216

Query: 202 SKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI 261
               GGL+ EK YPY G    C  +K      +  +  +  DE  +A  LV++GP+AV I
Sbjct: 217 K--AGGLQREKDYPYTGKXGKCHFDKSKIAAAVTNFSVIGLDEDQIAANLVKHGPLAVGI 274

Query: 262 NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGE 320
           NA  +Q YV GVS P+  F     +   H VL+VGYG         K   YWIIKNSWGE
Sbjct: 275 NAAWMQTYVGGVSCPLICF-----KRQDHGVLLVGYGSHGFAPIRLKEKAYWIIKNSWGE 329

Query: 321 GWGEKGYFRLYRGDGSCGINDYVRS 345
            WGE GY+++ RG   CG++  V +
Sbjct: 330 NWGEHGYYKICRGHNICGVDAMVST 354


>gi|225448924|ref|XP_002266821.1| PREDICTED: cysteine proteinase 15A-like [Vitis vinifera]
          Length = 375

 Score =  236 bits (603), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 132/320 (41%), Positives = 184/320 (57%), Gaps = 23/320 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+E++ K Y++  EY  RL IF+ N+ +    Q  +  + ++G+  FSDLS  EF+ 
Sbjct: 61  FRMFMEKYGKEYSSREEYVHRLGIFAKNMVRAAEHQALDP-TALHGVTPFSDLSEEEFER 119

Query: 102 KYLGFKLKPSYAD--RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
            + G   +P          A +    LP +FDWRE  AVT VK Q  CGS WAFSTTG +
Sbjct: 120 MFTGVVGRPHMKGGVAETAAALEVDGLPESFDWREKGAVTEVKMQGTCGSCWAFSTTGAV 179

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEE 210
           EG +   TKKL++LSEQ+L+DCD           D GCEGG ++NA+  ++    GGLEE
Sbjct: 180 EGAHFISTKKLLTLSEQQLVDCDHMCDIRDKTACDSGCEGGLMTNAYKYLIE--AGGLEE 237

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           E +YPY G    C+       V++  +  V  +E  +A  LV +GP+AV +NA  +Q Y+
Sbjct: 238 ESSYPYTGKHGECKFKPDRVAVRVVNFTEVPINENQIAANLVCHGPLAVGLNAIFMQTYI 297

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDR---TKFTHKAVPYWIIKNSWGEGWGEKGY 327
            GVS P+   C      ++H VL+VGYG       +F +K  PYWIIKNSWG+ WGE GY
Sbjct: 298 GGVSCPL--ICP--KRWINHGVLLVGYGAKGYSILRFGYK--PYWIIKNSWGKRWGEHGY 351

Query: 328 FRLYRGDGSCGINDYVRSAL 347
           +RL RG G CG+N  V + +
Sbjct: 352 YRLCRGHGMCGMNTMVSAVV 371


>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
          Length = 603

 Score =  236 bits (603), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 128/308 (41%), Positives = 178/308 (57%), Gaps = 11/308 (3%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           L+  F +++ KTY    + Y R  +F  NL +   LQ  E G+  YG+ +F DL++ EFQ
Sbjct: 306 LYEEFKQKYKKTYVNDDDEY-RFSVFKENLLRAHQLQTMEQGTAEYGVTQFFDLTSQEFQ 364

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
            +YLGFK +       +      +    +FDWR++ AV  V DQ  CGS WAFST GNIE
Sbjct: 365 IQYLGFKYEDMQDTEEMSPSTRVVMDEDSFDWRDHGAVGPVLDQGKCGSCWAFSTIGNIE 424

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G +  KT +L+SLSEQ+LIDCD  D+GC GG     +  ++    GGLE    YPY+   
Sbjct: 425 GQWFLKTGELLSLSEQQLIDCDNVDEGCNGGYPPKTYGAVIKM--GGLELNSDYPYKALA 482

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
           + C ++++  +V IN  V   R+E   A+ L   GP++ A+NA  L+FY TG+ H     
Sbjct: 483 EKCHMDRQKLKVYINDSVVFPRNEHLQAEALKLMGPLSSALNANPLKFYKTGIMHLPVAS 542

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
           C      L+H+VL VGYG      T   +PYW +KNSWG  +GE GYFR+YRG G+CGIN
Sbjct: 543 C--FPRALNHAVLTVGYG------TENGLPYWTVKNSWGTAFGEDGYFRIYRGGGTCGIN 594

Query: 341 DYVRSALV 348
             V +A +
Sbjct: 595 RLVSTAAI 602



 Score =  157 bits (396), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 88/231 (38%), Positives = 123/231 (53%), Gaps = 10/231 (4%)

Query: 95  STAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           S  EF  KYLG +L     +  V            FDWR++ AV  V +Q  CGS WAFS
Sbjct: 8   SGEEFANKYLGVQLDELATEEEVDPEEDVTVADDNFDWRQHGAVGPVWNQGPCGSCWAFS 67

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIEG +  K+ +L+ LS Q+++DCD  D GC GG     +  +     GGL+ +  Y
Sbjct: 68  AVGNIEGQWFLKSGELLHLSVQQVLDCDHVDHGCNGGYPPQVYRQVNQM--GGLQLDADY 125

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
            Y+     C  ++   +  +N  V +S++E   A  L   GP+A  +NA  LQFY  G+ 
Sbjct: 126 SYKAAVGKCHTDRSKFRAYVNSSVILSQNEQFQANKLKTIGPLASTLNARTLQFYRKGIM 185

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
           HP    C+ G   L+H+VL VGYG      T + +PYWI+KNSW  G+GE+
Sbjct: 186 HPTPSACNPG--QLNHAVLTVGYG------TEQGMPYWIVKNSWSRGFGEQ 228


>gi|164605519|dbj|BAF98585.1| CM0216.510.nc [Lotus japonicus]
          Length = 360

 Score =  236 bits (601), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 137/336 (40%), Positives = 191/336 (56%), Gaps = 31/336 (9%)

Query: 24  VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL---RKIQLLQDTE 80
           VV  E L   HH      F  F  +  K Y +  E+  R ++F  N+   R+ QLL    
Sbjct: 33  VVDGEGLGAEHH------FLEFKRRFGKVYVSEEEHGYRFNVFKSNMHRARRHQLLDP-- 84

Query: 81  HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTG 140
             S V+G+  FSDL+  EF+   LG +     +D     ++    LP+ FDWRE+ AVT 
Sbjct: 85  --SAVHGVTRFSDLTPMEFRHSVLGLRGVGLPSDADSAPILRTDNLPKDFDWREHGAVTP 142

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGG 191
           VK+Q  CG+ W+FS TG +EG +   T KLVSLSEQ+L+DCD E         D GC+GG
Sbjct: 143 VKNQGSCGACWSFSATGALEGAHFLSTGKLVSLSEQQLVDCDHECDPEEAGSCDSGCKGG 202

Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGD-DKACRLNKKATQVKINGYVSVSRDETDMAKY 250
            +++AF+ I++   GG+  E+ YPY G     C+ ++      +  +  VSRDE  +A  
Sbjct: 203 LMNSAFEYILNN--GGVMREEDYPYSGTAGGTCKFDQTKIAASVANFSVVSRDEDQIAAN 260

Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
           LV+NGP+AVAINA  +Q YV GVS P  + C   ++ L+H VL+VGYG +       K  
Sbjct: 261 LVKNGPLAVAINAVYMQTYVGGVSCP--YVC---SKKLNHGVLLVGYGSESYAPIRMKQK 315

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           PYWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 316 PYWIIKNSWGENWGENGYYKICRGRNVCGVDSMVST 351


>gi|30575716|gb|AAP33050.1| cysteine proteinase 3 [Clonorchis sinensis]
 gi|358339353|dbj|GAA47433.1| cathepsin F [Clonorchis sinensis]
          Length = 327

 Score =  236 bits (601), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 135/336 (40%), Positives = 191/336 (56%), Gaps = 14/336 (4%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           L   V++  F V+G   +    + +   L+  F  ++ K+Y+   + Y R  +F  NL +
Sbjct: 5   LCFLVALGFFGVLG-SNIPESENARQ--LYEEFKLKYKKSYSNDDDEY-RFRVFKDNLLR 60

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDW 132
           I+  Q+ E G+  YG+ +FSDL+  EF+ +YL  K      DR     I        FDW
Sbjct: 61  IKQFQNMERGTAKYGVTQFSDLTAQEFKVRYLRSKFGGVPVDREPVPFIRMDVDDDNFDW 120

Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGS 192
           R + AV  V DQ  CGS WAFS  GNIEG +  KT  L+ LSEQ+L+DCD+ D+GC GG+
Sbjct: 121 RNHGAVGPVLDQGDCGSCWAFSAVGNIEGQWFRKTDNLLQLSEQQLLDCDEVDEGCNGGT 180

Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
              AF  I+    GGL+ +  YPY G +  CR+     +V ING   +  DE   A+ L 
Sbjct: 181 PQQAFKQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKILPEDEQIQAQMLK 238

Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYW 312
           E GP++ A+NA  LQFY  G+ HP+   CD   ++L+H+VL VGYG          +PYW
Sbjct: 239 ETGPLSSALNALFLQFYTEGILHPLPALCDA--QSLNHAVLTVGYG------KEGRLPYW 290

Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
            +KNSW   +GE GYFR+YRGDG+CGIN  V ++++
Sbjct: 291 TVKNSWSTMFGENGYFRIYRGDGTCGINTLVSTSII 326


>gi|118429521|gb|ABK91808.1| cysteine proteinase prozyme precursor [Clonorchis sinensis]
          Length = 316

 Score =  235 bits (599), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 130/308 (42%), Positives = 180/308 (58%), Gaps = 11/308 (3%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           L+  F  ++ K+Y+   + Y R  +F  NL +I+  Q+ E G+  YG+ +FSDL+  EF+
Sbjct: 19  LYEEFKLKYKKSYSNDDDEY-RFRVFKDNLLRIKQFQNMERGTAKYGVTQFSDLTAQEFK 77

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
            +YL  K      DR     I        FDWR + AV  V DQ  CGS WAFS  GNIE
Sbjct: 78  VRYLRSKFGGVPVDREPVPFIRMDVDDDNFDWRNHGAVGPVLDQGDCGSCWAFSAVGNIE 137

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G +  KT  L+ LSEQ+L+DCD+ D+GC GG+   AF  I+    GGL+ +  YPY G +
Sbjct: 138 GQWFRKTDNLLQLSEQQLLDCDEVDEGCNGGTPQQAFKQILGM--GGLQLDSDYPYEGRE 195

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
             CR+     +V ING   +  DE   A+ L E GP++ A+NA  LQFY  G+ HP+   
Sbjct: 196 GQCRMVPSKVKVYINGSKILPEDEQIQAQMLKETGPLSSALNALFLQFYTEGILHPLPAL 255

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
           CD   ++L+H+VL VGYG          +PYW +KNSW   +GE GYFR+YRGDG+CGIN
Sbjct: 256 CDA--QSLNHAVLTVGYG------KEGRLPYWTVKNSWSTMFGENGYFRIYRGDGTCGIN 307

Query: 341 DYVRSALV 348
             V ++++
Sbjct: 308 TLVSTSII 315


>gi|25956267|dbj|BAC41322.1| hypothetical protein [Lotus japonicus]
          Length = 358

 Score =  235 bits (599), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 140/336 (41%), Positives = 193/336 (57%), Gaps = 31/336 (9%)

Query: 24  VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL---RKIQLLQDTE 80
           VV DE L   HH      F  F  +  K YAT  E+  R ++F  N+   R+ QLL    
Sbjct: 33  VVDDEGLGAEHH------FLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDP-- 84

Query: 81  HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTG 140
             S V+G+ +FSDL+  EFQ   LG +     +D     ++P   LP+ FDWR + AVT 
Sbjct: 85  --SAVHGVTQFSDLTPMEFQHSVLGLRGVGLPSDADSAPILPTDNLPKDFDWRGHGAVTP 142

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGS-------- 192
           VK+Q  CGS W+FS TG +EG +   T +LVSLSEQ+L+DCD + D  E GS        
Sbjct: 143 VKNQGSCGSCWSFSATGALEGAHFLSTGELVSLSEQQLVDCDHQCDPEEAGSCGSGCNGG 202

Query: 193 -ISNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYVSVSRDETDMAKY 250
            +++AF+ I++   GG+  E+ YPY G +   C+ +K      +  +  VSRDE  +A  
Sbjct: 203 LMNSAFEYILNN--GGVMREEDYPYSGTNGGTCKFDKAKIAASVANFSVVSRDEDQIAAN 260

Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
           LV+NGP+AVAINA  +Q YV GVS P  + C   ++ L+H VL+VGYG +       K  
Sbjct: 261 LVKNGPLAVAINAVYMQTYVGGVSCP--YVC---SKKLNHGVLLVGYGSESYAPIRMKQK 315

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           PYWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 316 PYWIIKNSWGENWGENGYYKICRGRNICGVDSMVST 351


>gi|358339355|dbj|GAA47435.1| cathepsin F [Clonorchis sinensis]
          Length = 1157

 Score =  235 bits (599), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 123/301 (40%), Positives = 181/301 (60%), Gaps = 11/301 (3%)

Query: 35  HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDL 94
            + H  L   F  + +  Y T+   +  L     N+++ +  Q  E G+ +YG+ +FSDL
Sbjct: 620 EMNHAGLAVGFGFEQDVPYWTIKNSWGMLWGEEDNIKQAEFYQTLERGTALYGVTQFSDL 679

Query: 95  STAEFQAKYLGFKLKPSYA-DRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           +  EFQ  +LG +L   Y+  +S      ++++P  +DWR Y AV  V DQ  CGS WAF
Sbjct: 680 TGEEFQETFLGLRLDEQYSKSQSYVKKKHSVSIPENYDWRPYGAVGPVLDQGHCGSCWAF 739

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIEG +  KT +LVSLS+Q+L+DCD+   GC GG     +D+I  +  GGLE E  
Sbjct: 740 SVIGNIEGQWFRKTGQLVSLSKQQLVDCDRSSRGCGGGYPPATYDSI--RRIGGLEIELD 797

Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
           Y Y G D  C  N +     +N  V++++DE  +A++L  +GP+++A+NA  LQFYV+G+
Sbjct: 798 YRYTGRDGVCHQNPRKFVAYVNSSVALTKDENTIAEWLSYHGPISMALNARLLQFYVSGI 857

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
            HP   +C    +++SH+VL VG+G      T   VP+WI+KNSWG  WGE+GYFR+YRG
Sbjct: 858 MHPPAAYCP--VKDISHAVLSVGFG------TKGNVPFWIVKNSWGTLWGEEGYFRIYRG 909

Query: 334 D 334
           D
Sbjct: 910 D 910



 Score =  182 bits (462), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 99/229 (43%), Positives = 134/229 (58%), Gaps = 11/229 (4%)

Query: 98  EFQAKYLGFKLKPSYADRSVPAMIPNITLPR-AFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           EF+A YL         ++S       +  P+ +FDWR+Y AV  V DQ  CG+SWAFS  
Sbjct: 434 EFKALYLTAMYDHRKLNQSKTTEPETVGEPQDSFDWRDYGAVGPVLDQDRCGASWAFSAI 493

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIEG Y  +  +L+SLSEQ+L+DCD+ D GC GG+   AF+ I     GGLE E  YPY
Sbjct: 494 GNIEGQYFMRVHRLLSLSEQQLVDCDRIDQGCAGGTPYGAFEGIQQL--GGLELEADYPY 551

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
            G    C+ N     V ING V + +DE  +A+YL ++GP++V IN   LQ+Y +G+  P
Sbjct: 552 LGHQDNCQSNPLRFVVSINGSVQLPKDEDQIAQYLFDHGPLSVGINGALLQYYSSGIMQP 611

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
           +   C+    N  H+ L VG+G ++       VPYW IKNSWG  WGE+
Sbjct: 612 LWDNCNPAEMN--HAGLAVGFGFEQD------VPYWTIKNSWGMLWGEE 652



 Score =  146 bits (369), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 79/205 (38%), Positives = 113/205 (55%), Gaps = 31/205 (15%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
           LP  FDWREY AV  V++Q  CGS WA S                      E++DCD  D
Sbjct: 218 LPSYFDWREYGAVGPVRNQGQCGSCWAISA---------------------EVVDCDHAD 256

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDET 245
            GC GG   +A++ +     GGLE    YPY G  + C+ + +     ING V++ +D  
Sbjct: 257 HGCSGGFPIHAYECVQRL--GGLELAVRYPYVGYQQYCQADPRYFVAYINGSVALPKDSE 314

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +AK+L   GP++V ++A  LQ+Y +G+ +P   +C+   E L+H+VL VG+G      T
Sbjct: 315 QIAKFLATFGPLSVVLDARLLQYYRSGILNPSVAYCN--PEELNHAVLSVGFG------T 366

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRL 330
            + +PYWIIKNSWGE WGE+   +L
Sbjct: 367 EQGIPYWIIKNSWGEQWGEQHLTKL 391



 Score =  134 bits (336), Expect = 8e-29,   Method: Compositional matrix adjust.
 Identities = 87/223 (39%), Positives = 117/223 (52%), Gaps = 16/223 (7%)

Query: 61   SRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG--FKLKPSYADRSVP 118
            + L  +S  LR+ QL ++ +   G    NE        F   YLG  F  +PS A   V 
Sbjct: 940  TSLAEYSRELRERQLYEEFKLNYGKVYENE------GMFYFLYLGARFDREPSRAGSMVV 993

Query: 119  AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
              +  I  P  FDWRE  AV  ++DQ  CGS WAFST GNIEG +  KT +L++LSEQ+L
Sbjct: 994  DDLGEI--PERFDWRELGAVGPIQDQGDCGSCWAFSTIGNIEGQWFKKTGQLLTLSEQQL 1051

Query: 179  IDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYV 238
            IDCD  DDGC GG   + +  I+    GGLE    YPY   D  C++ +   +  +N  +
Sbjct: 1052 IDCDSVDDGCGGGYPPDTYGDIVKM--GGLELNADYPYIAADGVCKMERSKFRAYVNKSL 1109

Query: 239  SVSRDETDMAKYLVENGPMAVAINAYALQ----FYVTGVSHPI 277
             +   E   A +L +NGP++  INA  LQ    FY   V+ PI
Sbjct: 1110 VLPTKEDQQAVWLSKNGPLSAGINADYLQVVILFYERSVNGPI 1152



 Score =  114 bits (285), Expect = 6e-23,   Method: Compositional matrix adjust.
 Identities = 58/150 (38%), Positives = 90/150 (60%), Gaps = 10/150 (6%)

Query: 176 QELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKIN 235
           Q+L+DCD  D GCEGG   +AF  +     GGL+    YPY    +AC+ N K     + 
Sbjct: 23  QQLVDCDHVDRGCEGGFPLDAFMAVQRL--GGLQLSIDYPYIASRQACQFNPKQAVAFVT 80

Query: 236 GYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIV 295
           G+ ++ R+E  +A+YL  NGP++V +N+  L+FY +G+ +     CD   E L+H+ L V
Sbjct: 81  GFAALPRNELLIAEYLHRNGPLSVGLNSRTLKFYNSGILNLAAEQCD--PEALNHAALAV 138

Query: 296 GYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
           G+G D      ++ P+WIIKN++G+ WGE+
Sbjct: 139 GFGTD------ESTPFWIIKNTFGKDWGEQ 162


>gi|116242322|gb|ABJ89818.1| cysteine proteinase 3 [Clonorchis sinensis]
          Length = 327

 Score =  234 bits (597), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 135/336 (40%), Positives = 190/336 (56%), Gaps = 14/336 (4%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           L   V++  F V+G   +    + +   L+  F  ++ K+Y+   + Y R  +F  NL +
Sbjct: 5   LCFLVALGFFGVLG-SNIPESENARQ--LYEEFKLKYKKSYSNDDDEY-RFRVFKDNLLR 60

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDW 132
           I+  Q+ E G+  YG+ +FSDL+  EF+ +YL  K      DR     I        FDW
Sbjct: 61  IKQFQNMERGTAKYGVTQFSDLTAQEFKVRYLRSKFGGVPVDREPVPFIRMDVDDDNFDW 120

Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGS 192
           R + AV  V DQ  CGS WAFS  GNIEG +  KT  L+ LSEQ+L+DCD  D+GC GG+
Sbjct: 121 RNHGAVGPVLDQGDCGSCWAFSAVGNIEGQWFRKTDNLLQLSEQQLLDCDGVDEGCNGGT 180

Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
              AF  I+    GGL+ +  YPY G +  CR+     +V ING   +  DE   A+ L 
Sbjct: 181 PQQAFKQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKILPEDEQIQAQMLK 238

Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYW 312
           E GP++ A+NA  LQFY  G+ HP+   CD   ++L+H+VL VGYG          +PYW
Sbjct: 239 ETGPLSSALNALFLQFYTEGILHPLPALCDA--QSLNHAVLTVGYG------KEGRLPYW 290

Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
            +KNSW   +GE GYFR+YRGDG+CGIN  V ++++
Sbjct: 291 TVKNSWSTMFGENGYFRIYRGDGTCGINTLVSTSII 326


>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
          Length = 374

 Score =  234 bits (597), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 128/315 (40%), Positives = 181/315 (57%), Gaps = 19/315 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
            + F  +  K+Y +  E+  R  +F  NLR+    Q  +  +  +G+ +FSDL++AEF+ 
Sbjct: 59  LSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDP-TASHGVTQFSDLTSAEFRK 117

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
           + LG +      D +   ++P   LP  FDWRE  AV  VK+Q  CGS W+FSTTG +EG
Sbjct: 118 QVLGLRKLRLPKDANKAPILPTNDLPEDFDWREKGAVGPVKNQGSCGSCWSFSTTGALEG 177

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
            +   T +LVSLSEQ+L+DCD E         D GC GG +++AF+  +    GGL  E+
Sbjct: 178 AHFLATGELVSLSEQQLVDCDHECDPEEPGSCDSGCNGGLMNSAFEYTLK--AGGLMREE 235

Query: 213 TYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G D+ AC+ +K      +  +  VS DE  +A  LV+NGP+AVA NA  +Q Y+ 
Sbjct: 236 DYPYTGMDRGACKFDKDKVAAGVANFSVVSLDEDQIAANLVKNGPLAVATNAVFMQTYIG 295

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           GVS P  + C   +  L H VL+VGYG         K  PYWIIKNSWGE WGE G++++
Sbjct: 296 GVSCP--YIC---SRRLDHGVLLVGYGSAGYAPVRMKEKPYWIIKNSWGESWGENGFYKI 350

Query: 331 YRGDGSCGINDYVRS 345
            RG   CG++  V +
Sbjct: 351 CRGRNICGVDSMVST 365


>gi|351693703|gb|AEQ59229.1| cysteine protease precursor [Clonorchis sinensis]
          Length = 327

 Score =  234 bits (597), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 134/336 (39%), Positives = 190/336 (56%), Gaps = 14/336 (4%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           L   V++  F V+G   +    + +   L+  F  ++ K+Y+   + Y R  +F  NL +
Sbjct: 5   LCFLVALGFFGVLG-SNIPESENARQ--LYEEFKLKYKKSYSNDDDEY-RFRVFKDNLLR 60

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDW 132
           I+  Q+ E G+  YG+ +FSDL+  EF+ +YL  K      DR     I        FDW
Sbjct: 61  IKQFQNMERGTAKYGVTQFSDLTAQEFKVRYLRSKFGGVPVDREPVPFIRMDVDDDNFDW 120

Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGS 192
           R + AV  V D+  CGS WAFS  GNIEG +  KT  L+ LSEQ+L+DCD+ D+GC GG+
Sbjct: 121 RNHGAVGPVLDKGDCGSCWAFSAVGNIEGQWFRKTDNLLQLSEQQLLDCDEVDEGCNGGT 180

Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
              AF  I+    GGL+ +  YPY G +  CR+     +V ING   +  DE   A+ L 
Sbjct: 181 PQQAFKQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKILPEDEQIQAQMLK 238

Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYW 312
           E GP + A+NA +LQFY  G+ HP+   CD   ++L+H+VL VGYG          +PYW
Sbjct: 239 ETGPFSSALNALSLQFYTEGILHPLPALCDA--QSLNHAVLTVGYG------KEGRLPYW 290

Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
            +KNSW   +GE GYFR+YRGDG CGIN  V ++++
Sbjct: 291 TVKNSWSTMFGENGYFRIYRGDGPCGINTLVSTSII 326


>gi|85068712|gb|ABC69436.1| cysteine protease [Clonorchis sinensis]
          Length = 328

 Score =  234 bits (597), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 128/310 (41%), Positives = 179/310 (57%), Gaps = 13/310 (4%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           AL+  F  ++ KTY+   +   R  IF  NL + + LQ+ E G+  YG+ +FSDL++ EF
Sbjct: 30  ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88

Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           + +YL  +          P+   ++T+    FDWRE+ AV  V DQ  CGS WAFS  GN
Sbjct: 89  KTRYLRMRFDGPIVSED-PSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGN 147

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EG +  KT  L++LSEQ+L+DCD  + GC GG     +  I     GGLE    YPY G
Sbjct: 148 VEGQWFRKTGDLLALSEQQLVDCDHLEKGCNGGYPPKTYGEIEKM--GGLELASDYPYTG 205

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
            D  C +N+      +N    +   E   A+ L E GP++ A+NA  LQFY+ G+  PI 
Sbjct: 206 VDGICYMNQSKFVAYVNDSTVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPIP 265

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
           F C+     L+H+VL VGYG      T   +PYWI+KNS G G+GEKGYFR++RG G+CG
Sbjct: 266 FLCN--PHGLNHAVLTVGYG------TEFGIPYWIVKNSLGVGFGEKGYFRIFRGAGTCG 317

Query: 339 INDYVRSALV 348
           IN  V +A++
Sbjct: 318 INLVVSTAII 327


>gi|322801532|gb|EFZ22193.1| hypothetical protein SINV_14496 [Solenopsis invicta]
          Length = 781

 Score =  234 bits (596), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 135/274 (49%), Positives = 179/274 (65%), Gaps = 10/274 (3%)

Query: 30  LHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN 89
           L     V+   LF+ F+  +N+TY++  E   RL IF  NL  I+LLQ TE  +G YG+N
Sbjct: 512 LQIAEDVRTERLFDDFVATYNRTYSSPDERNLRLQIFRENLGIIELLQKTEQATGRYGVN 571

Query: 90  EFSDLSTAEFQAKYLGFKLKPSY-ADRSVP---AMIPNITLPRAFDWREYDAVTGVKDQT 145
            F+D+S  EF+ +YLG  L+P   ++  +P   A  PNI LP  FDWR+   VT VK+Q 
Sbjct: 572 MFADMSREEFRTRYLG--LRPDLQSENEIPLQEAKFPNIELPPTFDWRKKGVVTPVKNQG 629

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLG 205
            CGS WAFS TGN+EG YA K  +L+SLSEQEL+DCD  DDGC GG   NA+  I  KL 
Sbjct: 630 GCGSCWAFSVTGNVEGQYAIKHGQLLSLSEQELVDCDDLDDGCGGGLPDNAYRAI-EKL- 687

Query: 206 GGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA 265
           GGLE E  YPY  +++ C   K   +V++   V+V+ DET MA++LV+NGP+++ INA A
Sbjct: 688 GGLELESDYPYEAENEKCHFKKNLVKVELTSAVNVTSDETQMAQWLVQNGPISIGINANA 747

Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
           +QFY+ GVSHP +F C+   +NL H VLIVGYG 
Sbjct: 748 MQFYMGGVSHPFKFLCNP--KNLDHGVLIVGYGT 779


>gi|55979119|gb|AAV69023.1| cysteine protease [Opisthorchis viverrini]
 gi|224923980|gb|ACN68966.1| cathepsin F-like cysteine protease [Opisthorchis viverrini]
          Length = 326

 Score =  233 bits (595), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 129/310 (41%), Positives = 181/310 (58%), Gaps = 15/310 (4%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           AL+  F  ++ KTY+   +   R  IF  NL + + LQ  E G+  YG+ +FSDL++ EF
Sbjct: 30  ALYEEFKLKYKKTYSNDDDEL-RFRIFKDNLERAKRLQAMEQGTAEYGVTQFSDLTSEEF 88

Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLPRA-FDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           + +YL  +      +   P    ++T+  + FDWR++ AV  V DQ  CGS WAFS  GN
Sbjct: 89  KTRYLRMRFDEPIVNED-PTPQEDVTMDNSNFDWRDHGAVGPVLDQGDCGSCWAFSVIGN 147

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EG +  KT  L+ LSEQ+LIDCD  D GC+GG     +  I     GGLE    YPY G
Sbjct: 148 VEGQWFRKTGDLLGLSEQQLIDCDHSDQGCDGGYPPQTYSAIEEM--GGLELRSDYPYTG 205

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
            D  C +++      +NG   +   E   AK L E GP++  +NA  LQ Y  G+  P  
Sbjct: 206 KDGICYMDQSKFVAYVNGSTRLPWCEKTQAKSLKEIGPLSSGLNAVLLQLYKRGIMRPR- 264

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
            +C+     L+H+VL VGYG++     H+ +PYWI+KNSWG+ +GEKGYFR+YRGDG+CG
Sbjct: 265 -WCNPA--ELNHAVLTVGYGME-----HR-MPYWIVKNSWGKRFGEKGYFRIYRGDGTCG 315

Query: 339 INDYVRSALV 348
           IN  V +A+V
Sbjct: 316 INRAVTTAVV 325


>gi|38344381|emb|CAD40319.2| OSJNBb0054B09.3 [Oryza sativa Japonica Group]
 gi|116309071|emb|CAH66180.1| OSIGBa0130O15.4 [Oryza sativa Indica Group]
 gi|116309098|emb|CAH66205.1| OSIGBa0148D14.11 [Oryza sativa Indica Group]
          Length = 381

 Score =  233 bits (595), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 129/324 (39%), Positives = 184/324 (56%), Gaps = 27/324 (8%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           A F  F  +  +TY    E   R+ +F+ NLR+ +  Q  +  +  +G+ +FSDL+  EF
Sbjct: 56  AHFASFERRFGRTYRDAGERAYRMSVFAANLRRARRHQRLDP-TATHGVTKFSDLTPGEF 114

Query: 100 QAKYLGFKLKPSY-----ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           + ++LG + +PS       +     ++P   LP  FDWRE+ AV  VKDQ  CGS W+FS
Sbjct: 115 RDRFLGLR-RPSLEGLVGGEPHEAPILPTDGLPDDFDWREHGAVGPVKDQGSCGSCWSFS 173

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLG 205
           T+G +EG +   T KL  LSEQ+++DCD E         D GC GG ++ AF  +M    
Sbjct: 174 TSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGCNGGLMTTAFSYLMKS-- 231

Query: 206 GGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA 265
           GGL+ EK YPY G +  C+ +K     ++  +  +S +E  +A  LV++GP+A+AINA  
Sbjct: 232 GGLQSEKDYPYAGRENTCKFDKSKIVAQVKNFSVISVNEDQIAANLVKHGPLAIAINAAY 291

Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGE 324
           +Q Y+ GVS P  F C     +L H VL+VGYG         K  PYWIIKNSWGE WGE
Sbjct: 292 MQTYIGGVSCP--FIC---GRHLDHGVLLVGYGSAGYAPIRFKEKPYWIIKNSWGENWGE 346

Query: 325 KGYFRLYRG---DGSCGINDYVRS 345
           KGY+++ RG      CG++  V S
Sbjct: 347 KGYYKICRGPHDKNKCGVDSMVSS 370


>gi|281209544|gb|EFA83712.1| cysteine proteinase 1 [Polysphondylium pallidum PN500]
          Length = 465

 Score =  233 bits (593), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 135/309 (43%), Positives = 171/309 (55%), Gaps = 23/309 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD---TEHGSGVYGLNEFSDLSTAE 98
           F  F  ++NK Y T  EY  R   F  NL+ I        +   S  +G+NEF+DLS +E
Sbjct: 28  FRQFQIKYNKQY-TSSEYAERFATFKSNLKVIDEKNRDAASRKSSVRFGVNEFADLSQSE 86

Query: 99  FQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           F+A YL         + +V A +P   LP AFDWR   AVTGVK+Q  CGS W+FSTTGN
Sbjct: 87  FRATYLNSVQAVRDPNAAVAADLPVEDLPTAFDWRTKGAVTGVKNQGQCGSCWSFSTTGN 146

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE----------DDGCEGGSISNAFDTIMSKLGGGL 208
           +EG +      L  LSEQ L+DCD E          D GC GG   NA+  I+    GG+
Sbjct: 147 VEGQWFLAGNTLTGLSEQNLVDCDHECMEYLGDNVCDQGCNGGLQPNAYTYIIK--NGGI 204

Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
           + E +YPY+G D  C         KI+ +  VS +ET MA YLV NGP+A+A +A   QF
Sbjct: 205 DTEASYPYQGVDGTCSFKAANIGAKISNWTYVSSNETQMAAYLVANGPLAIAADAVEWQF 264

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y+ GV     F    GN  L H +LIVGY  + T F HK   YWI+KNSWG  WGE+GY 
Sbjct: 265 YLGGV-----FDVPCGN-TLDHGILIVGYSAENTIF-HKDKAYWIVKNSWGATWGEQGYI 317

Query: 329 RLYRGDGSC 337
            + RG+G C
Sbjct: 318 YISRGNGEC 326


>gi|118394988|ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89284124|gb|EAR82188.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 330

 Score =  233 bits (593), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 140/342 (40%), Positives = 188/342 (54%), Gaps = 32/342 (9%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           VALL+  V  + F  + D+ +         A F  F + +NK Y++   Y +RL IF  N
Sbjct: 7   VALLAACV-FARFSTMQDQDI--------AAAFKKFTQTYNKKYSSEEHYNARLSIFKEN 57

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRA 129
           LR+I+L    +     +G+ +F+DL+  EF   YLG+K +   +   V       T P A
Sbjct: 58  LRRIELFNKNDEAQ--HGITQFADLTHEEFADMYLGYKPQLRNSQAKVSLSSTPFTAPTA 115

Query: 130 FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKK-LVSLSEQELIDCD-QEDDG 187
            DW    AVT VK+Q  CGS WAFSTTG+IEG Y  + K+ L S SEQ+L+DCD +ED G
Sbjct: 116 IDWTTKGAVTPVKNQGSCGSCWAFSTTGSIEGQYVLQLKQNLTSFSEQQLVDCDTKEDQG 175

Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYV------SVS 241
           C GG + NAF  + S     LE E  YPY   D +C+ N+    V +  +V      +V+
Sbjct: 176 CNGGLMDNAFTYLES---AKLETESAYPYTAVDGSCKYNQSLGVVGVASFVDIEQGKTVA 232

Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
             E  M   L   GP++VAINA  LQFY  G+S+P+   C+     L+H VLIVG G + 
Sbjct: 233 DTENTMGVALDNIGPLSVAINANNLQFYAGGISNPL--ICNP--NGLNHGVLIVGLGSEN 288

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
            K       +W +KNSWG  WGEKGYFR+ RG G CGIN  V
Sbjct: 289 GK------DFWKVKNSWGASWGEKGYFRIVRGKGKCGINRAV 324


>gi|30575714|gb|AAP33049.1| cysteine proteinase 1 [Clonorchis sinensis]
          Length = 326

 Score =  232 bits (592), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 131/311 (42%), Positives = 179/311 (57%), Gaps = 17/311 (5%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           AL+  F  ++ KTY+   +   R  IF  NL + + LQ+ E G+  YG+ +FSDL++ EF
Sbjct: 30  ALYEEFTLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88

Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           + +YL  +   P  ++   P    ++T+    FDWRE+ AV  V DQ  CGS WAFS  G
Sbjct: 89  KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           N+ G +  KT  L++LSEQ+L+DCD  DDGC+GG     +  I     GGLE    YPY 
Sbjct: 147 NVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           G    C ++K      +NG   +   E   A+ L   GP++ A+NA  LQ Y  G+  P 
Sbjct: 205 GVGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK 264

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
             +CD    N  H+VL VGYGV   K      PYWI+KNSWGE +GEKGYFR+YRGDG+C
Sbjct: 265 --WCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEKGYFRIYRGDGTC 314

Query: 338 GINDYVRSALV 348
           GIN  V +A++
Sbjct: 315 GINSIVTTAII 325


>gi|449516391|ref|XP_004165230.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
          Length = 387

 Score =  231 bits (589), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 142/375 (37%), Positives = 204/375 (54%), Gaps = 44/375 (11%)

Query: 4   FYFFAGVALLSLTVSVSSFMVV-------GDEKL-----------HHLHHVKHTALFNYF 45
           F+ FA +  ++ T+  S  +V        GD  +           HH    +H   F+ F
Sbjct: 5   FFLFAVITAVTATLCSSEPLVSQHSVEHDGDPLIRQVVENDGDFNHHALGAEHH--FSLF 62

Query: 46  LEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG 105
             +  K+YAT  E+  R  IF  N+R+ +  Q  +  S ++G+ +FSDL+  EF+  +LG
Sbjct: 63  KRRFGKSYATEEEHDRRFKIFKANMRRAERHQSFD-PSAIHGVTQFSDLTPFEFRKAFLG 121

Query: 106 FK---LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
            +   L+      + P ++P   LP  FDWR++  VT VK+Q  CGS W+FSTTG +EG 
Sbjct: 122 LRGHRLRLPVDTNAAP-ILPTENLPIDFDWRQHGGVTRVKNQGSCGSCWSFSTTGALEGA 180

Query: 163 YAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
               T +LVSLSEQ+L+DCD E         D GC GG +++AF+  +    GGL +E+ 
Sbjct: 181 NFLATGELVSLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLK--AGGLMKEQD 238

Query: 214 YPYRGDDK-ACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYALQFYVT 271
           YPY G D+  C  +K      I  +  V S DE  +A  LV+NGP+A+AINA  +Q Y+ 
Sbjct: 239 YPYAGIDRNTCNFDKSKIAASIANFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQTYIG 298

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           GVS P  F C   ++ L H VL+VGYG         +   YWIIKNSWGE WGE GY+++
Sbjct: 299 GVSCP--FIC---SKRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGESWGENGYYKI 353

Query: 331 YRGDGSCGINDYVRS 345
            RG   CG++  V +
Sbjct: 354 CRGRNICGVDSLVST 368


>gi|1185457|gb|AAA87848.1| cathepsin L, partial [Schistosoma japonicum]
          Length = 224

 Score =  231 bits (589), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 116/223 (52%), Positives = 149/223 (66%), Gaps = 9/223 (4%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
           +P  FDWRE  AVT VK+Q MCGS WAFSTTGNIE  +  KT KL+SLSEQ+L+DCD  D
Sbjct: 10  IPNNFDWREKGAVTEVKNQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQQLVDCDSLD 69

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDET 245
           DGC GG  SNA+++I+    GGL  E  YPY   ++ C L        IN  V++++DE+
Sbjct: 70  DGCNGGLPSNAYESIIRM--GGLMLEDNYPYDAKNEKCHLKVGNVAAYINSSVNLTQDES 127

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
           ++A +L  +  ++V +NA  LQFY  G+SHP   FC      L H+VL+VGYGV     +
Sbjct: 128 ELAIWLYHHSAISVGMNALLLQFYRHGISHPWWIFCS--KYLLDHAVLLVGYGV-----S 180

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
            K  P+WI+KNSWG  WGEKGYFR+YRGDG+CGIN    SAL+
Sbjct: 181 EKNEPFWIVKNSWGVEWGEKGYFRMYRGDGTCGINTGATSALI 223


>gi|224082940|ref|XP_002306900.1| predicted protein [Populus trichocarpa]
 gi|118481986|gb|ABK92924.1| unknown [Populus trichocarpa]
 gi|222856349|gb|EEE93896.1| predicted protein [Populus trichocarpa]
          Length = 367

 Score =  231 bits (589), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 136/331 (41%), Positives = 194/331 (58%), Gaps = 32/331 (9%)

Query: 32  HLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL---RKIQLLQDTEHGSGVYGL 88
           HL + +H   F  F  +  K YAT  E+  R  +F  NL   +K Q++  T      +G+
Sbjct: 43  HLLNAEHH--FTTFKSKFGKNYATQEEHDYRFSVFKANLLRAKKHQIMDPT----AAHGV 96

Query: 89  NEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQT 145
            +FSDL+  EF+ + LG K +   P+ A+++   ++P   LP  FDWR++ AVT VKDQ 
Sbjct: 97  TKFSDLTPKEFRRQLLGLKRRLRLPTDANKA--PILPTGDLPTDFDWRDHGAVTSVKDQG 154

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNA 196
            CGS W+FS TG +EG +   T +LVSLSEQ+L+DCD E         D GC GG ++NA
Sbjct: 155 SCGSCWSFSATGALEGAHYLATGELVSLSEQQLVDCDHECDPEEYGACDSGCSGGLMNNA 214

Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
           F+  +    GGLE EK YPY G+D+ AC+  K      ++ +  VS DE  +A  LV++G
Sbjct: 215 FEYALK--AGGLEREKDYPYTGNDRGACKFEKSKVAASVSNFSVVSLDEDQIAANLVKHG 272

Query: 256 PMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWII 314
           P++VAINA  +Q Y+ GVS P  + C   +++  H VL+VGYG         K  P+WII
Sbjct: 273 PLSVAINAVFMQTYIGGVSCP--YIC---SKHQDHGVLLVGYGAAGYAPIRFKEKPFWII 327

Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           KNSWGE WGE GY+++ R    CG++  V +
Sbjct: 328 KNSWGENWGENGYYKICRARNICGVDSMVST 358


>gi|1617037|emb|CAA26255.1| cysteine proteinase I precursor [Dictyostelium discoideum]
          Length = 343

 Score =  231 bits (588), Expect = 5e-58,   Method: Compositional matrix adjust.
 Identities = 137/355 (38%), Positives = 192/355 (54%), Gaps = 36/355 (10%)

Query: 11  ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
            L   TV VSS  +  +E+   L           F ++ NK Y+   EY  R  IF  NL
Sbjct: 8   VLAVFTVFVSSRGIPPEEQSQFLE----------FQDKFNKKYSH-EEYLERFEIFKSNL 56

Query: 71  RKIQ---LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN---I 124
            KI+   L+         +G+N+F+DLS+ EF+  YL  K      D  V   + +    
Sbjct: 57  GKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFIN 116

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
           ++P AFDWR   AVT VK+Q  CGS W+FSTTGN+EG +     KLVSLSEQ L+DCD E
Sbjct: 117 SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176

Query: 185 ----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVK 233
                     D+GC GG   NA++ I+    GG++ E +YPY  +    C  N      K
Sbjct: 177 CMEYEGEEACDEGCNGGLQPNAYNYIIKN--GGIQTESSYPYTAETGTQCNFNSANIGAK 234

Query: 234 INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
           I+ +  + ++ET MA Y+V  GP+A+A +A   QFY+ GV     F       +L H +L
Sbjct: 235 ISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGV-----FDIPCNPNSLDHGIL 289

Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           IVGY    T F  K +PYWI+KNSWG  WGE+GY  L RG  +CG++++V ++++
Sbjct: 290 IVGYSAKNTIF-RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343


>gi|449487301|ref|XP_004157559.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
          Length = 406

 Score =  230 bits (587), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 132/325 (40%), Positives = 184/325 (56%), Gaps = 28/325 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+E++ K+Y T  EY  R  IF  NL +    Q  +  + V+G+ +FSDLS  EF+ 
Sbjct: 89  FVMFMEKYGKSYPTRKEYLHRFGIFVKNLIRAAEHQALDP-TAVHGVTQFSDLSEEEFER 147

Query: 102 KYLGFKLKPSYADRSVPAMIPNIT--------LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
            ++   ++       +P M   +         LP  FDWR+  AVT VK Q  CGS WAF
Sbjct: 148 MFM--GVRGGAGGEGLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWAF 205

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD---------GCEGGSISNAFDTIMSKL 204
           ST G +EG     T  L++LSEQ+L+DCD   D         GC GG ++NA+  ++   
Sbjct: 206 STCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQS- 264

Query: 205 GGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAY 264
            GGLEEE +YPY G    C        VK++ + ++  DE  +A +LV +GP+AV +NA 
Sbjct: 265 -GGLEEESSYPYTGRSGQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAV 323

Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWG 323
            +Q Y+ GVS P+   C  G   ++H VL+VGYG +  +    + +PYW+IKNSWGE WG
Sbjct: 324 FMQTYIGGVSCPL--IC--GKRFVNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERWG 379

Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
           E GY+RL RG G CGIN  V SA+V
Sbjct: 380 EHGYYRLCRGHGMCGINTMV-SAVV 403


>gi|118429527|gb|ABK91811.1| cathepsin F precursor [Clonorchis sinensis]
          Length = 326

 Score =  230 bits (587), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 131/311 (42%), Positives = 178/311 (57%), Gaps = 17/311 (5%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           AL+  F  ++ KTY+   +   R  IF  NL + + LQ+ E G+  YG+ +FSDL++ EF
Sbjct: 30  ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88

Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           + +YL  +   P  ++   P    ++T+    FDWRE+ AV  V DQ  CGS WAFS  G
Sbjct: 89  KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           N+EG +  KT  L++LSEQ+L+DCD  D GC+GG     +  I     GGLE    YPY 
Sbjct: 147 NVEGQWFRKTGDLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           G    C ++K      ING   +   E   A+ L   GP++ A+NA  LQ Y  G+  P 
Sbjct: 205 GVGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP- 263

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
              CD    N  H+VL VGYGV   K      PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 264 -RLCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314

Query: 338 GINDYVRSALV 348
           GIN  V +A++
Sbjct: 315 GINSIVTTAII 325


>gi|449449489|ref|XP_004142497.1| PREDICTED: cysteine proteinase 15A-like [Cucumis sativus]
          Length = 406

 Score =  230 bits (587), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 132/325 (40%), Positives = 184/325 (56%), Gaps = 28/325 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+E++ K+Y T  EY  R  IF  NL +    Q  +  + V+G+ +FSDLS  EF+ 
Sbjct: 89  FVMFMEKYGKSYPTRKEYLHRFGIFVKNLIRAAEHQALDP-TAVHGVTQFSDLSEEEFER 147

Query: 102 KYLGFKLKPSYADRSVPAMIPNIT--------LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
            ++   ++       +P M   +         LP  FDWR+  AVT VK Q  CGS WAF
Sbjct: 148 MFM--GVRGGAGGEGLPEMNQAVEVTAEEVKGLPERFDWRDKGAVTEVKMQGTCGSCWAF 205

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD---------GCEGGSISNAFDTIMSKL 204
           ST G +EG     T  L++LSEQ+L+DCD   D         GC GG ++NA+  ++   
Sbjct: 206 STCGAVEGANFIATGNLLNLSEQQLVDCDHTCDPTDKTACNNGCNGGLMTNAYKYLIQS- 264

Query: 205 GGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAY 264
            GGLEEE +YPY G    C        VK++ + ++  DE  +A +LV +GP+AV +NA 
Sbjct: 265 -GGLEEESSYPYTGRSGQCNFQSDKIAVKVSNFTTIPIDENQIAAHLVRSGPLAVGLNAV 323

Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWG 323
            +Q Y+ GVS P+   C  G   ++H VL+VGYG +  +    + +PYW+IKNSWGE WG
Sbjct: 324 FMQTYIGGVSCPL--IC--GKRFVNHGVLMVGYGDEGFSILRFRKLPYWVIKNSWGERWG 379

Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
           E GY+RL RG G CGIN  V SA+V
Sbjct: 380 EHGYYRLCRGHGMCGINTMV-SAVV 403


>gi|410045434|ref|XP_003313198.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pan troglodytes]
          Length = 548

 Score =  230 bits (587), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 132/329 (40%), Positives = 191/329 (58%), Gaps = 10/329 (3%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 229 SVISLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 288

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 289 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 348

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 349 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 408

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y Y+G  ++C  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 409 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVVLSQNEQKLAAWLAKRGPISV 466

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG          VP+W IKNSWG
Sbjct: 467 AINAFGMQFYRHGISRPLRPLCS--PWLIDHAVLLVGYG------NRSDVPFWAIKNSWG 518

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+ G  +CG+N     ++V
Sbjct: 519 TDWGEKGYYYLHCGSEACGVNTMASLSVV 547


>gi|116242314|gb|ABJ89814.1| cysteine protease preprotein [Clonorchis sinensis]
          Length = 326

 Score =  230 bits (587), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 130/311 (41%), Positives = 179/311 (57%), Gaps = 17/311 (5%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           AL+  F  ++ KTY+   +   R  IF  NL + + LQ+ E G+  YG+ +FSDL++ EF
Sbjct: 30  ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88

Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           + +YL  +   P  ++   P    ++T+    FDWRE+ AV  V DQ  CGS WAFS  G
Sbjct: 89  KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           N+ G +  KT  L++LSEQ+L+DCD  DDGC+GG     +  I     GGLE    YPY 
Sbjct: 147 NVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           G    C ++K      +NG   +   E   A+ L   GP++ A+NA  LQ Y  G+  P 
Sbjct: 205 GVGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK 264

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
             +CD    N  H+VL VGYGV   K      PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 265 --WCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314

Query: 338 GINDYVRSALV 348
           GIN  V +A++
Sbjct: 315 GINSIVTTAII 325


>gi|118429515|gb|ABK91805.1| cysteine proteinase 7 precursor [Clonorchis sinensis]
          Length = 326

 Score =  230 bits (586), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 131/311 (42%), Positives = 179/311 (57%), Gaps = 17/311 (5%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           AL+  F  ++ KTY+   +   R  IF  NL + + LQ+ E G+  YG+ +FSDL++ EF
Sbjct: 30  ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88

Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           + +YL  +   P  ++   P    ++T+    FDWRE+ AV  V DQ  CGS WAFS  G
Sbjct: 89  KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           N+EG +  KT  L++LSEQ+L+DCD  D GC+GG     +  I     GGLE    YPY 
Sbjct: 147 NVEGQWFRKTGDLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           G    C ++K      ING   +   E   A+ L   GP++ A+NA  LQ Y  G+  P 
Sbjct: 205 GVGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK 264

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
             +CD    N  H+VL VGYGV   K      PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 265 --WCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314

Query: 338 GINDYVRSALV 348
           GIN  V +A++
Sbjct: 315 GINSIVTTAII 325


>gi|66803148|ref|XP_635417.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
 gi|166201987|sp|P04988.2|CYSP1_DICDI RecName: Full=Cysteine proteinase 1; Flags: Precursor
 gi|60463731|gb|EAL61909.1| cysteine proteinase 1 [Dictyostelium discoideum AX4]
          Length = 343

 Score =  230 bits (586), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 137/355 (38%), Positives = 192/355 (54%), Gaps = 36/355 (10%)

Query: 11  ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
            L   TV VSS  +  +E+   L           F ++ NK Y+   EY  R  IF  NL
Sbjct: 8   VLAVFTVFVSSRGIPLEEQSQFLE----------FQDKFNKKYSH-EEYLERFEIFKSNL 56

Query: 71  RKIQ---LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN---I 124
            KI+   L+         +G+N+F+DLS+ EF+  YL  K      D  V   + +    
Sbjct: 57  GKIEELNLIAINHKADTKFGVNKFADLSSDEFKNYYLNNKEAIFTDDLPVADYLDDEFIN 116

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
           ++P AFDWR   AVT VK+Q  CGS W+FSTTGN+EG +     KLVSLSEQ L+DCD E
Sbjct: 117 SIPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLSEQNLVDCDHE 176

Query: 185 ----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVK 233
                     D+GC GG   NA++ I+    GG++ E +YPY  +    C  N      K
Sbjct: 177 CMEYEGEQACDEGCNGGLQPNAYNYIIKN--GGIQTESSYPYTAETGTQCNFNSANIGAK 234

Query: 234 INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
           I+ +  + ++ET MA Y+V  GP+A+A +A   QFY+ GV     F       +L H +L
Sbjct: 235 ISNFTMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGV-----FDIPCNPNSLDHGIL 289

Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           IVGY    T F  K +PYWI+KNSWG  WGE+GY  L RG  +CG++++V ++++
Sbjct: 290 IVGYSAKNTIF-RKNMPYWIVKNSWGADWGEQGYIYLRRGKNTCGVSNFVSTSII 343


>gi|290984408|ref|XP_002674919.1| predicted protein [Naegleria gruberi]
 gi|284088512|gb|EFC42175.1| predicted protein [Naegleria gruberi]
          Length = 353

 Score =  230 bits (586), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 138/356 (38%), Positives = 200/356 (56%), Gaps = 30/356 (8%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLE---QHNKTYATLVEYYSRLHIFSGN 69
           L+  V ++  ++  D++ +    +  TA+ ++FL+   +  + Y    EY  RL +F  N
Sbjct: 7   LTFLVILACGILAFDQETYQ--PLSETAVRDHFLDFTRKFQRFYKGPEEYEYRLKVFREN 64

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP----AMIPNI- 124
           +   + + +   G+  YG+ +FSDL++ EF+  YL  K  P    + +      M+ N  
Sbjct: 65  IETSRRM-NIREGNNNYGITKFSDLTSDEFRKFYLMEKKTPKEIQKMMRMDSNKMVSNSY 123

Query: 125 --TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
               P  +DWR + A+TGVKDQ  CGS WAFS  G+IEG YA K K+LVS SEQ+L+DCD
Sbjct: 124 AKPAPDHYDWRNHGAITGVKDQGQCGSCWAFSAIGSIEGSYAIKHKQLVSFSEQQLVDCD 183

Query: 183 QE----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV 232
                       DDGC GG   +A+  +M    GG+  EK YPY  +   C +       
Sbjct: 184 NNCVTFENQQSCDDGCNGGLQWSAYQYLMK--AGGVVTEKDYPYYAERYKCEVKPANFVA 241

Query: 233 KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSV 292
           K++ +  +S +ET+MA +L ENGP+AVA+NA  LQ Y  G++ P   +CD     L H V
Sbjct: 242 KLSNWTMLSTNETEMANWLAENGPIAVALNADFLQNYNNGIADPA--WCDP--TQLDHGV 297

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           LIVGYG++ T +  K  PYWI+KNSWG  +GE GYFR+ +G G CGIN    +A V
Sbjct: 298 LIVGYGLE-TFWFGKPQPYWIVKNSWGYDFGEDGYFRIVKGVGRCGINTVPSAAFV 352


>gi|242061538|ref|XP_002452058.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
 gi|241931889|gb|EES05034.1| hypothetical protein SORBIDRAFT_04g017830 [Sorghum bicolor]
          Length = 371

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 133/337 (39%), Positives = 185/337 (54%), Gaps = 30/337 (8%)

Query: 26  GDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
           GD+    L+   H   F  F+++  K+Y    E+  RL IF  NLR+ +  Q  +  S  
Sbjct: 35  GDDNELELNAESH---FLSFVQRFGKSYKDAEEHAYRLSIFKANLRRARRHQLLDP-SAE 90

Query: 86  YGLNEFSDLSTAEFQAKYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           +G+ +FSDL+ AEF+  YLG +      L+      +   ++P   LP  FDWR++ AVT
Sbjct: 91  HGVTKFSDLTPAEFRRTYLGLRKSRRALLRELGKSANEAPVLPTDGLPDDFDWRDHGAVT 150

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
            VK+Q  CGS W+FST+G +EG +   T KL  LSEQ+++DCD           D GC G
Sbjct: 151 PVKNQGSCGSCWSFSTSGALEGAHYLATGKLEVLSEQQMVDCDHVCDTSEPDSCDSGCNG 210

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
           G ++NAF  +     GGLE EK YPY G D  C+ +K      +  +  VS DE  +A  
Sbjct: 211 GLMTNAFSYLQK--AGGLESEKDYPYTGSDDKCKFDKSKIVASVQNFSVVSVDEGQIAAN 268

Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
           L+++GP+A+ INA  +Q Y+ GVS P  + C      L H VL+VGYG         K  
Sbjct: 269 LIKHGPLAIGINAAYMQTYIGGVSCP--YIC---GRTLDHGVLLVGYGAAGFAPIRLKDK 323

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGD---GSCGINDYV 343
           PYWIIKNSWGE WGE GY+++ RG      CG++  V
Sbjct: 324 PYWIIKNSWGENWGENGYYKICRGSNVRNKCGVDSMV 360


>gi|7219908|gb|AAF40479.1| cystein protease [Clonorchis sinensis]
          Length = 326

 Score =  229 bits (585), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 130/311 (41%), Positives = 178/311 (57%), Gaps = 17/311 (5%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           AL+  F  ++ KTY+   +   R  IF  NL + + LQ+ E G+  YG+ +FSDL++ EF
Sbjct: 30  ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88

Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           + +YL  +   P  ++   P    ++T+    FDWRE+ AV  V DQ  CGS WAFS  G
Sbjct: 89  KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           N+ G +  KT  L++LSEQ+L+DCD  DDGC+GG     +  I     GGLE    YPY 
Sbjct: 147 NVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           G    C ++K      +NG   +   E   A+ L   GP++ A+NA  LQ Y  G+  P 
Sbjct: 205 GVGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK 264

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
             +CD    N  H VL VGYGV   K      PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 265 --WCDPAGVN--HGVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314

Query: 338 GINDYVRSALV 348
           GIN  V +A++
Sbjct: 315 GINSIVTTAII 325


>gi|330792958|ref|XP_003284553.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
 gi|325085467|gb|EGC38873.1| hypothetical protein DICPUDRAFT_96752 [Dictyostelium purpureum]
          Length = 346

 Score =  229 bits (583), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 134/334 (40%), Positives = 192/334 (57%), Gaps = 32/334 (9%)

Query: 34  HHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTE-HGSGV-YGLNE 90
           H ++ T  F  F +++NK Y++  EY ++   F  NL  I QL Q  + H S   +G+NE
Sbjct: 22  HTIEQTQ-FVAFQQKYNKVYSS-NEYSAKFETFKANLGVIAQLNQKAKLHKSDTKFGVNE 79

Query: 91  FSDLSTAEFQAKYLGFKL-KPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMC 147
           F+DLS AEF+  YL  ++ KP  +    P +   +  T+P AFDWR   AVTGVK+Q  C
Sbjct: 80  FADLSAAEFRKYYLNAQVAKPDASLPMAPLLTEEVLETIPTAFDWRTKGAVTGVKNQGQC 139

Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE----------DDGCEGGSISNAF 197
           GS W+FSTTGNIEG +      LV LSEQ L+DCD +          D GC+GG   NA+
Sbjct: 140 GSCWSFSTTGNIEGQWYLAGNTLVGLSEQNLVDCDHQCMEYDGQKSCDAGCDGGLQPNAY 199

Query: 198 DTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVEN 254
             ++    GGL+ E +YPY    GD  +C+        KI+ +  + ++ET MA YL  +
Sbjct: 200 RYVIEN--GGLDSENSYPYLAVTGD--SCKFKSGNVAAKISNFTMIPQNETQMAGYLATH 255

Query: 255 GPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWII 314
           GP+A+A +A   QFY+ GV       C    ++L H +LIVG+  ++  F H   PYWI+
Sbjct: 256 GPLAIAADAAEWQFYIGGV---FDLPC---GQSLDHGILIVGFSAEKNIFGHLK-PYWIV 308

Query: 315 KNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           KNSWG  WGE+GY  L +G   CG++D+V ++ +
Sbjct: 309 KNSWGASWGEQGYLYLGKGKNLCGVSDFVSTSTI 342


>gi|85068704|gb|ABC69432.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  228 bits (582), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 130/311 (41%), Positives = 178/311 (57%), Gaps = 17/311 (5%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           AL+  F  ++ KTY+   +   R  IF  NL + + LQ+ E G+  YG+ +FSDL++ EF
Sbjct: 30  ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88

Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           + +YL  +   P  ++   P    ++T+    FDWRE+ AV  V DQ  CGS WAFS  G
Sbjct: 89  ETRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           N+ G +  KT  L++LSEQ+L+DCD  DDGC+GG     +  I     GGLE    YPY 
Sbjct: 147 NVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           G    C ++K      +NG   +   E   A+ L   GP++ A+NA  LQ Y  G+  P 
Sbjct: 205 GVGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK 264

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
             +CD    N  H+VL VGYGV   K      PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 265 --WCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314

Query: 338 GINDYVRSALV 348
           GIN  V +A +
Sbjct: 315 GINSIVTTARI 325


>gi|41019551|tpe|CAD66657.1| TPA: putative cysteine proteinase precursor [Hordeum vulgare subsp.
           vulgare]
 gi|326489967|dbj|BAJ94057.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326525847|dbj|BAJ93100.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 377

 Score =  228 bits (582), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 129/323 (39%), Positives = 181/323 (56%), Gaps = 31/323 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+++  KTY    E+  RL +F  NLR+ +  Q  +  S  +G+ +FSDL+ AEF+ 
Sbjct: 53  FVGFVQRFGKTYRDAEEHAHRLSVFKANLRRARRHQLLDP-SAEHGVTKFSDLTPAEFRR 111

Query: 102 KYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            YLG K      L+          ++P   LP  FDWR++ AV  VK+Q  CGS W+FS 
Sbjct: 112 TYLGLKTTRRSFLREMAGSAHDAPVLPTDGLPEDFDWRDHGAVGPVKNQGSCGSCWSFSA 171

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGG 206
           +G +EG     + K+  LSEQ+L+DCD E         D GC GG +++AF  ++    G
Sbjct: 172 SGALEGANYLASGKMEVLSEQQLVDCDHECDPSEPDSCDAGCNGGLMTSAFSYLLKS--G 229

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           GLE EK YPY G D  C+ +K      +  Y  V+ DE  +A  LV+ GP+A+ INA  +
Sbjct: 230 GLEREKDYPYTGKDGTCKFDKSKIAASVQNYSVVAVDEEQIAANLVKYGPLAIGINAAYM 289

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD---RTKFTHKAVPYWIIKNSWGEGWG 323
           Q Y+ GVS P  + C     +L H VL+VGYG      ++F  K  PYWIIKNSWGE WG
Sbjct: 290 QTYIGGVSCP--YIC---GRHLDHGVLLVGYGASGFAPSRFKEK--PYWIIKNSWGENWG 342

Query: 324 EKGYFRLYRGD---GSCGINDYV 343
           +KGY+++ RG      CG++  V
Sbjct: 343 DKGYYKICRGSNVRNKCGVDSMV 365


>gi|194352746|emb|CAQ00101.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 381

 Score =  228 bits (582), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 134/340 (39%), Positives = 186/340 (54%), Gaps = 29/340 (8%)

Query: 23  MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
            VVG +  + L  +   A F  F+ +  K+Y    E+  RL +F  NLR+ +  Q  +  
Sbjct: 40  QVVGGDAENELE-LNAEAHFASFVRRFGKSYRDADEHEHRLSVFRANLRRARRHQRLD-P 97

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYD 136
           S V+G+ +FSDL+  EF+ ++LG +      LK           +P   LP  FDWRE+ 
Sbjct: 98  SAVHGITKFSDLTPDEFRERFLGLRKSRRSFLKGISGSAHDAPALPTDGLPTEFDWREHG 157

Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDG 187
           AV  VKDQ  CGS W+FST+G +EG     T KL  LSEQ+L+DCD E         D G
Sbjct: 158 AVGPVKDQGSCGSCWSFSTSGALEGANYLATGKLEVLSEQQLVDCDHECDPSEPRACDAG 217

Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDM 247
           C GG ++ AF  +     GGLE EK YPY G + AC+ +K     ++  + +V+ DE  +
Sbjct: 218 CNGGLMTTAFSYLAK--AGGLETEKDYPYTGRNSACKFDKSKIAAQVKNFSTVAIDEDQI 275

Query: 248 AKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTH 306
           A  LV++GP+A+ INA  +Q Y+ GVS P  + C     +L H V +VGYG         
Sbjct: 276 AANLVKHGPLAIGINAVFMQTYIGGVSCP--YIC---GRHLDH-VFLVGYGSAGYAPLRF 329

Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRG---DGSCGINDYV 343
           K  PYWIIKNSWGE WGE GY+++ RG      CG++  V
Sbjct: 330 KEKPYWIIKNSWGENWGESGYYKICRGPHVKNKCGVDSMV 369


>gi|357148994|ref|XP_003574963.1| PREDICTED: cysteine proteinase 1-like [Brachypodium distachyon]
          Length = 377

 Score =  228 bits (581), Expect = 3e-57,   Method: Compositional matrix adjust.
 Identities = 127/321 (39%), Positives = 179/321 (55%), Gaps = 27/321 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+++  KTY    E+  RL +F  NLR+ +  Q  +  S  +G+ +FSDL+ AEF+ 
Sbjct: 53  FTSFVQRFGKTYKDAEEHAHRLSVFKANLRRARRHQLLDP-SAEHGITKFSDLTPAEFRR 111

Query: 102 KYLGFKLKPSYADRSV------PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            +LG K       R +        ++P   LP  FDWR++ AV  VK+Q  CGS W+FS 
Sbjct: 112 TFLGLKTSRRSFLREIGGSAHDAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSA 171

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGG 206
           +G +EG     T K+  LSEQ+ +DCD E         D GC GG +++AF  ++    G
Sbjct: 172 SGALEGANYLATGKMEVLSEQQFVDCDHECDPEEPDSCDAGCNGGLMTSAFSYLLKS--G 229

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           GLE EK YPY G D  C+ +K      +  +  VS DE  +A  LV++GP+A+ INA  +
Sbjct: 230 GLEREKDYPYTGRDGTCKFDKSKIVASVQNFSVVSVDEEQIAANLVKHGPLAIGINAAYM 289

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH-KAVPYWIIKNSWGEGWGEK 325
           Q Y+ GVS P  + C     +L H VL+VGYG      +  K  PYW+IKNSWGE WGEK
Sbjct: 290 QTYIGGVSCP--YIC---GRSLDHGVLLVGYGASGFAPSRLKNKPYWVIKNSWGENWGEK 344

Query: 326 GYFRLYRGD---GSCGINDYV 343
           GY+++ RG      CG++  V
Sbjct: 345 GYYKICRGSNVRNKCGVDSMV 365


>gi|449469923|ref|XP_004152668.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
 gi|449520697|ref|XP_004167370.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
          Length = 371

 Score =  228 bits (580), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 128/315 (40%), Positives = 171/315 (54%), Gaps = 18/315 (5%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  +  KTY T  E+  R  +F  NLRK +  Q  +    V+G+  FSDL+ +EF+ 
Sbjct: 58  FQDFKLKFGKTYTTDEEHDYRFRVFKANLRKAKRHQKLDP-DAVHGVTRFSDLTESEFRE 116

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
            ++G       AD     ++P   L   FDWR+  AVT VKDQ  CGS W+FS  G +EG
Sbjct: 117 NFVGLNRLRLPADAHQAPILPTDNLASDFDWRDQGAVTPVKDQGSCGSCWSFSAVGALEG 176

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
                T KL+SLSEQ+L+DCD E         D GC GG +++AF+ I+    GGLE E+
Sbjct: 177 ANFLSTGKLISLSEQQLVDCDHECDPEEAGACDAGCNGGLMTSAFEYIVK--AGGLEREE 234

Query: 213 TYPYRGDDK-ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G D+ +C+            +  +S D   +A  LV+NGP+A+ INA  +Q Y+ 
Sbjct: 235 DYPYTGTDRGSCKFQNGKIAASAANFSVISNDADQIAANLVKNGPLAIGINAVFMQTYMK 294

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           G+S P  + C     NL H VL+VGYG         K  PYWIIKNSWGE WGE GY+ +
Sbjct: 295 GISCP--YIC--SKRNLDHGVLLVGYGAAGFAPIRLKEKPYWIIKNSWGENWGENGYYFI 350

Query: 331 YRGDGSCGINDYVRS 345
            +G   CG    V S
Sbjct: 351 CKGKNICGSESMVSS 365


>gi|85068702|gb|ABC69431.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  227 bits (579), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 130/311 (41%), Positives = 177/311 (56%), Gaps = 17/311 (5%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           AL+  F  ++ KTY+   +   R  IF  NL + + LQ+ E G+  YG+ +FSDL++ EF
Sbjct: 30  ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88

Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           + +YL  +   P  ++   P    ++T+    FDWRE+ AV  V DQ  CGS WAFS  G
Sbjct: 89  KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           N+ G +  KT  L++LSEQ+L+DCD  D GC+GG     +  I     GGLE    YPY 
Sbjct: 147 NVVGQWFRKTGHLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           G    C ++K      ING   +   E   A+ L   GP++ A+NA  LQ Y  G+  P 
Sbjct: 205 GVGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP- 263

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
              CD    N  H+VL VGYGV   K      PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 264 -RLCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314

Query: 338 GINDYVRSALV 348
           GIN  V +A++
Sbjct: 315 GINSIVTTAII 325


>gi|358339045|dbj|GAA32724.2| cathepsin F, partial [Clonorchis sinensis]
          Length = 271

 Score =  227 bits (578), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 121/282 (42%), Positives = 166/282 (58%), Gaps = 14/282 (4%)

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK-PSYADRSVPAMIPNITLP 127
            L   + LQ+ E G+  YG+ +FSDL++ EF+ +YL  +   P  ++   P    ++T+ 
Sbjct: 1   QLAAAKRLQEMEQGTAQYGVTQFSDLTSEEFKTRYLRMRFDGPIVSEDLTPE--EDVTMD 58

Query: 128 -RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
              FDWRE+ AV  V DQ  CGS WAFS  GN+EG +  KT  L++LSEQ+L+DCD  D 
Sbjct: 59  NEKFDWREHGAVGPVLDQGKCGSCWAFSVIGNVEGQWFRKTGDLLALSEQQLVDCDHLDK 118

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETD 246
           GC GG     +  I     GGLE    YPY G D  C +N+      +N    +   E  
Sbjct: 119 GCNGGYPPKTYGEIEKM--GGLELASDYPYTGVDGICYMNQSKFVAYVNDSTVLPLSEKI 176

Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
            A+ L E GP++ A+NA  LQFY+ G+  PI F C+     L+H+VL VGYG      T 
Sbjct: 177 QAQKLKEIGPLSSALNAVLLQFYLGGIIFPIPFLCN--PHGLNHAVLTVGYG------TE 228

Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             +PYWI+KNSWG G+GEKGYFR++RG G+CGIN  V +A++
Sbjct: 229 FGIPYWIVKNSWGVGFGEKGYFRIFRGAGTCGINLVVSTAII 270


>gi|85068698|gb|ABC69429.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  226 bits (577), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 129/310 (41%), Positives = 174/310 (56%), Gaps = 15/310 (4%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           AL+  F  ++ KTY+   +   R  IF  NL + + LQ+ E G+  YG+ +FSDL++ EF
Sbjct: 30  ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88

Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           + +YL  +          P+   ++T+    FDWRE+ AV  V DQ  CGS WAFS  GN
Sbjct: 89  KTRYLRMRFDGPIVSED-PSPEEDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIGN 147

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           + G +  KT  L++LSEQ+L+DCD  D GC+GG     +  I     GGLE    YPY G
Sbjct: 148 VVGQWFRKTGHLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQKM--GGLELASDYPYTG 205

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQ 278
               C ++K      ING   +   E   A+ L   GP++ A+NA  LQ Y  G+  P  
Sbjct: 206 VGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP-- 263

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG 338
             CD    N  H+VL VGYGV   K      PYWI+KNSWGE +GE+GYFR+YRGDG+CG
Sbjct: 264 RLCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTCG 315

Query: 339 INDYVRSALV 348
           IN  V +A +
Sbjct: 316 INSIVTTARI 325


>gi|255585361|ref|XP_002533377.1| cysteine protease, putative [Ricinus communis]
 gi|223526784|gb|EEF29008.1| cysteine protease, putative [Ricinus communis]
          Length = 381

 Score =  225 bits (574), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 129/321 (40%), Positives = 186/321 (57%), Gaps = 25/321 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAE 98
           F  F+ +++K Y T  EY  RL +F+ NL +    Q+L  T     V+G+  F DL+  E
Sbjct: 67  FKMFMIKYDKEYDTREEYMHRLGVFAKNLIRAAEHQVLDPT----AVHGITPFMDLTEEE 122

Query: 99  FQAKYLGFKLKPSYADRSVPA--MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           F+  Y G     +     V A   +    LP +FDWR+  AVT VK Q  CGS WAFSTT
Sbjct: 123 FERMYTGVVGGGAVGAEGVTATSFLETAGLPSSFDWRKKGAVTDVKMQGACGSCWAFSTT 182

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGG 207
           G IEG     T KL++LSEQ+L+DCD+          DDGC GG ++NA+  ++    GG
Sbjct: 183 GAIEGANFIATGKLLNLSEQQLVDCDRVCDIKEKTACDDGCGGGLMTNAYRYLIE--AGG 240

Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
           LE+E +YPY G    C+ ++K   V++  + S+  DE  +A +LV +GP+A+ +NA  +Q
Sbjct: 241 LEDEISYPYTGKPGKCKFDEKKIAVRVVNFTSIPIDENQIAAHLVHHGPLAIGLNAVFMQ 300

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV-PYWIIKNSWGEGWGEKG 326
            Y+ GVS P+   C  G + ++H VL+VGYG            PYWIIKNSWG+ WGE+G
Sbjct: 301 TYIGGVSCPL--IC--GKKWINHGVLLVGYGAKGFSILRLGYKPYWIIKNSWGKRWGEEG 356

Query: 327 YFRLYRGDGSCGINDYVRSAL 347
           Y+R+ +G G CG++  V + +
Sbjct: 357 YYRICKGYGMCGMDRMVSAVV 377


>gi|56682917|gb|AAW21813.1| cysteine protease [Triticum aestivum]
          Length = 377

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 127/320 (39%), Positives = 178/320 (55%), Gaps = 31/320 (9%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           F+++  KTY    E+  RL +F  NLR+ +  Q  +  S  +G+ +FSDL+ AEF+  +L
Sbjct: 56  FVQRFGKTYRDAEEHAHRLSVFKANLRRARRHQMLDP-SAEHGVTKFSDLTPAEFRRTFL 114

Query: 105 GFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           G K      L+          ++P   LP  FDWR++ AV  VK+Q  C S W+FS +G 
Sbjct: 115 GLKTTRRSFLREMAGSAHDAPVLPTDGLPEDFDWRDHGAVGPVKNQGSCWSCWSFSASGA 174

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLE 209
           +EG     T K+  LSEQ+L+DCD E         D GC GG +++AF  ++    GGLE
Sbjct: 175 LEGANYLATGKMEVLSEQQLVDCDHECDPAEPDSCDAGCNGGLMTSAFSYLLKS--GGLE 232

Query: 210 EEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
            EK YPY G D  C+  K      +  +  V+ DE  +A  LVE GP+A+ INA  +Q Y
Sbjct: 233 REKDYPYTGKDGTCKFEKSKIAASVQNFSVVAVDEEQIAANLVEYGPLAIGINAAYMQTY 292

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVD---RTKFTHKAVPYWIIKNSWGEGWGEKG 326
           + GVS P  + C     +L H VL+VGYG      ++F  K  PYWIIKNSWGE WG+KG
Sbjct: 293 IGGVSCP--YIC---GRHLDHGVLLVGYGASGFAPSRFKEK--PYWIIKNSWGENWGDKG 345

Query: 327 YFRLYRGD---GSCGINDYV 343
           Y+++ RG      CG++  V
Sbjct: 346 YYKICRGSNVRNKCGVDSMV 365


>gi|327358519|gb|AEA51106.1| cathepsin F, partial [Oryzias melastigma]
          Length = 255

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 120/260 (46%), Positives = 161/260 (61%), Gaps = 11/260 (4%)

Query: 90  EFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPR-AFDWREYDAVTGVKDQTMCG 148
           +FSDL+  EF + YL   L      R +    P  +    ++DWR++ AV+ VK+Q MCG
Sbjct: 5   KFSDLTEEEFHSAYLNPLLSQWTLHREMKPAPPAKSPAPDSWDWRDHGAVSPVKNQGMCG 64

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
           S WAFS TGNIEG +  K   L+SLSEQEL+DCD  D  C GG  SNA++ I  KL GGL
Sbjct: 65  SCWAFSVTGNIEGQWFLKNGTLLSLSEQELVDCDGLDQACRGGLPSNAYEAI-EKL-GGL 122

Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
           E E  Y Y G  + C    +     IN  V + +DE ++A +L ENGP++VA+NA+A+QF
Sbjct: 123 ETETDYSYTGKKQRCDFTNRKVAAYINSSVELPKDEKEIAAWLAENGPISVALNAFAMQF 182

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  GVSHP + FC+     + H+VL+VGYG          +P+W IKNSWGE +GE+GY+
Sbjct: 183 YKKGVSHPWKIFCNPW--MIDHAVLLVGYG------ERNGIPFWAIKNSWGEDYGEQGYY 234

Query: 329 RLYRGDGSCGINDYVRSALV 348
            L+RG  +CGIN    SA+V
Sbjct: 235 YLHRGSNACGINKMGSSAVV 254


>gi|12597541|ref|NP_075125.1| cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
 gi|15426394|ref|NP_203611.1| cathepsin [Helicoverpa armigera NPV]
 gi|12483807|gb|AAG53799.1|AF271059_56 cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
 gi|15384470|gb|AAK96381.1|AF303045_123 cathepsin [Helicoverpa armigera NPV]
 gi|18027090|gb|AAL55725.1|AF268612_1 cathepsin [Helicoverpa armigera NPV]
          Length = 365

 Score =  224 bits (572), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 122/324 (37%), Positives = 182/324 (56%), Gaps = 31/324 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ-----------LLQDTEHGSGVYGLNE 90
           F +FL+Q+NK+Y    EY  R ++F  NL KI               D+   S  +G+N+
Sbjct: 55  FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 114

Query: 91  FSDLSTAEFQAKYLGFKLKPS----YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTM 146
           FSD +  E      GF L  S      +  +    PNI LP  +DWR+ + VT +KDQ +
Sbjct: 115 FSDKTPDEVLHSNTGFFLNLSQHYTLCENRIVKGAPNIRLPDYYDWRDTNKVTPIKDQGV 174

Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGG 206
           CGS WAF   GNIE  YA +  KL+ LSEQ+L+DCD+ D GC GG +  AF  ++  L G
Sbjct: 175 CGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCDEVDLGCNGGLMHLAFQELL--LMG 232

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINAYA 265
           G+E E  YPY+G ++ C L+ +   VK+N       RDE  + + +   GP+A+A++A  
Sbjct: 233 GVETEADYPYQGSEQMCTLDNRKIAVKLNSCFKYDIRDENKLKELVYTTGPVAIAVDAMD 292

Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
           +  Y  G+ +    +      +L+H+VL++G+G++        VPYWIIKNSWGE WGE 
Sbjct: 293 IINYRRGILNQCHIY------DLNHAVLLIGWGIENN------VPYWIIKNSWGEDWGEN 340

Query: 326 GYFRLYRGDGSCG-INDYVRSALV 348
           GY R+ R   +CG +N++  S+++
Sbjct: 341 GYLRVRRNVNACGLLNEFGASSVI 364


>gi|115446097|ref|NP_001046828.1| Os02g0469600 [Oryza sativa Japonica Group]
 gi|47497527|dbj|BAD19579.1| putative cysteine proteinase 1 precursor [Oryza sativa Japonica
           Group]
 gi|113536359|dbj|BAF08742.1| Os02g0469600 [Oryza sativa Japonica Group]
 gi|215701326|dbj|BAG92750.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215704370|dbj|BAG93804.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215708762|dbj|BAG94031.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|218200777|gb|EEC83204.1| hypothetical protein OsI_28465 [Oryza sativa Indica Group]
 gi|222622835|gb|EEE56967.1| hypothetical protein OsJ_06681 [Oryza sativa Japonica Group]
          Length = 373

 Score =  224 bits (571), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 130/337 (38%), Positives = 184/337 (54%), Gaps = 30/337 (8%)

Query: 26  GDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
           GD+    L+  +H   F  F+++  K+Y    E+  RL +F  NLR+ +  Q  +  S  
Sbjct: 37  GDDNELELNAERH---FASFVQRFGKSYRDADEHAYRLSVFKANLRRARRHQLLDP-SAE 92

Query: 86  YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV------PAMIPNITLPRAFDWREYDAVT 139
           +G+ +FSDL+ AEF+  YLG +       R +        ++P   LP  FDWR++ AV 
Sbjct: 93  HGVTKFSDLTPAEFRRAYLGLRTSRRAFLRGLGGSAHEAPVLPTDGLPDDFDWRDHGAVG 152

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEG 190
            VK+Q  CGS W+FS +G +EG     T K+  LSEQ+++DCD E         D GC G
Sbjct: 153 PVKNQGSCGSCWSFSASGALEGANYLATGKMDVLSEQQMVDCDHECDSSEPDSCDAGCNG 212

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY 250
           G ++NAF  ++    GGLE EK YPY G D  C+ +K      +  +  VS DE  +A  
Sbjct: 213 GLMTNAFSYLLKS--GGLESEKDYPYTGRDGTCKFDKSKIVTSVQNFSVVSVDEDQIAAN 270

Query: 251 LVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAV 309
           LV++GP+A+ INA  +Q Y+ GVS P  + C     +L H VL+VGYG         K  
Sbjct: 271 LVKHGPLAIGINAAYMQTYIGGVSCP--YIC---GRHLDHGVLLVGYGASGFAPIRLKDK 325

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGD---GSCGINDYV 343
            YWIIKNSWGE WGE GY+++ RG      CG++  V
Sbjct: 326 AYWIIKNSWGENWGEHGYYKICRGSNVRNKCGVDSMV 362


>gi|344310882|gb|AEN03980.1| cathepsin-like cysteine proteinase [Helicoverpa armigera NPV strain
           Australia]
          Length = 367

 Score =  224 bits (571), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 122/324 (37%), Positives = 182/324 (56%), Gaps = 31/324 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ-----------LLQDTEHGSGVYGLNE 90
           F +FL+Q+NK+Y    EY  R ++F  NL KI               D+   S  +G+N+
Sbjct: 57  FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116

Query: 91  FSDLSTAEFQAKYLGFKLKPS----YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTM 146
           FSD +  E      GF L  S      +  +    PNI LP  +DWR+ + VT +KDQ +
Sbjct: 117 FSDKTPDEVLHSNTGFFLNLSQHYTLCENRIVKGAPNIRLPDYYDWRDTNKVTPIKDQGV 176

Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGG 206
           CGS WAF   GNIE  YA +  KL+ LSEQ+L+DCD+ D GC GG +  AF  ++  L G
Sbjct: 177 CGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCDEVDLGCNGGLMHLAFQELL--LMG 234

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINAYA 265
           G+E E  YPY+G ++ C L+ +   VK+N       RDE  + + +   GP+A+A++A  
Sbjct: 235 GVETEADYPYQGSEQMCTLDNRKIAVKLNSCFKYDIRDENKLKELVYTTGPVAIAVDAMD 294

Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
           +  Y  G+ +    +      +L+H+VL++G+G++        VPYWIIKNSWGE WGE 
Sbjct: 295 IINYRRGILNQCHIY------DLNHAVLLIGWGIENN------VPYWIIKNSWGEDWGEN 342

Query: 326 GYFRLYRGDGSCG-INDYVRSALV 348
           GY R+ R   +CG +N++  S+++
Sbjct: 343 GYLRVRRNVNACGLLNEFGASSVI 366


>gi|85068706|gb|ABC69433.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  224 bits (571), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 128/311 (41%), Positives = 177/311 (56%), Gaps = 17/311 (5%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           AL+  F  ++ KTY+   +   R  IF  NL + + LQ+ E G+  YG+ +FSDL++ EF
Sbjct: 30  ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88

Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           + +YL  +   P  ++   P    ++T+    FDWRE+ AV  V DQ  CGS WAFS  G
Sbjct: 89  KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           N+ G +  +T  L++LS Q+L+DCD  DDGC+GG     +  I     GGLE    YPY 
Sbjct: 147 NVVGQWFRETGHLLALSGQQLVDCDYLDDGCDGGYPPQTYTAIQKM--GGLELASDYPYT 204

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           G    C ++K      +NG   +   E   A+ L   GP++ A+NA  LQ Y  G+  P 
Sbjct: 205 GVGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRPK 264

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
             +CD    N  H+VL VGYGV   K      PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 265 --WCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314

Query: 338 GINDYVRSALV 348
           GIN  V +A +
Sbjct: 315 GINSIVTTARI 325


>gi|85068700|gb|ABC69430.1| cysteine protease [Clonorchis sinensis]
          Length = 326

 Score =  224 bits (571), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 131/311 (42%), Positives = 178/311 (57%), Gaps = 17/311 (5%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           AL+  F  ++ KTY+   +   R  IF  NL + + LQ+ E G+  YG+ +FSDL++ EF
Sbjct: 30  ALYEEFKLKYKKTYSNDDDEL-RFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEF 88

Query: 100 QAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           + +YL  +   P  ++   P    ++T+    FDWRE+ AV  V DQ  CGS WAFS  G
Sbjct: 89  KTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGAVGPVLDQGKCGSCWAFSVIG 146

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           N+ G +  KT  L++LSEQ L+DCD  D GC+GG      +T + K+ GGLE    YPY 
Sbjct: 147 NVVGQWFRKTGHLLALSEQPLVDCDYLDGGCDGGYPPQT-NTAIQKM-GGLELASDYPYT 204

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           G    C ++K      ING   +   E   A+ L   GP++ A+NA  LQ Y  G+  P 
Sbjct: 205 GVGGICYMDKSKFVAYINGSTILPLSEKVQAQKLRAIGPLSSALNADTLQLYKGGIMRP- 263

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
              CD    N  H+VL VGYGV   K      PYWI+KNSWGE +GE+GYFR+YRGDG+C
Sbjct: 264 -RLCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNSWGEDFGEEGYFRIYRGDGTC 314

Query: 338 GINDYVRSALV 348
           GIN  V +A +
Sbjct: 315 GINSIVTTARI 325


>gi|194705198|gb|ACF86683.1| unknown [Zea mays]
 gi|413936851|gb|AFW71402.1| cysteine protease1 [Zea mays]
          Length = 371

 Score =  223 bits (569), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 125/321 (38%), Positives = 176/321 (54%), Gaps = 27/321 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+++  K+Y    E+  RL +F  NLR+ +  Q  +  S  +G+ +FSDL+ AEF+ 
Sbjct: 48  FLSFVQRFGKSYKDADEHAYRLSVFKANLRRARRHQLLDP-SAEHGVTKFSDLTPAEFRR 106

Query: 102 KYLGFKLKPSYADRSV------PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            YLG +       R +        ++P   LP  FDWR++ AV  VK+Q  CGS W+FS 
Sbjct: 107 TYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSA 166

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGG 206
           +G +EG +   T KL  LSEQ+ +DCD E         D GC GG ++ AF  +     G
Sbjct: 167 SGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQK--AG 224

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           GLE EK YPY G D  C+ +K      +  +  VS DE  ++  L+++GP+A+ INA  +
Sbjct: 225 GLESEKDYPYTGSDGKCKFDKSKIVASVQNFSVVSVDEAQISANLIKHGPLAIGINAAYM 284

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEK 325
           Q Y+ GVS P  + C     +L H VL+VGYG         K  PYWIIKNSWGE WGE 
Sbjct: 285 QTYIGGVSCP--YIC---GRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGEN 339

Query: 326 GYFRLYRGD---GSCGINDYV 343
           GY+++ RG      CG++  V
Sbjct: 340 GYYKICRGSNVRNKCGVDSMV 360


>gi|242045644|ref|XP_002460693.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
 gi|241924070|gb|EER97214.1| hypothetical protein SORBIDRAFT_02g033270 [Sorghum bicolor]
          Length = 373

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 131/332 (39%), Positives = 185/332 (55%), Gaps = 32/332 (9%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           A F  F+ +H + Y+   EY  RL +F+ NL +    Q  +  +  +G+  FSDL+  EF
Sbjct: 47  AQFAAFVRRHGRRYSGPEEYARRLRVFAANLARAAAHQALDP-TARHGVTPFSDLTREEF 105

Query: 100 QAKYLGFK---------LKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGS 149
           +A+  G +         L  S A  + PA    ++ LP +FDWR+  AVTGVK Q  CGS
Sbjct: 106 EARLTGVRAGAGGDVQRLVMSGAPAAPPASQEEVSRLPASFDWRDKGAVTGVKMQGACGS 165

Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTI 200
            WAFSTTG +EG     T KL+ LSEQ+L+DCD           ++GC GG ++NA+  +
Sbjct: 166 CWAFSTTGAVEGANFLATGKLLELSEQQLVDCDHTCSAVAQNECNNGCAGGLMTNAYAYL 225

Query: 201 MSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAV 259
           M    GGL E++ YPY G    CR +     V++  + +V + DE  +   LV  GP+AV
Sbjct: 226 MKS--GGLMEQRAYPYTGAPGPCRFDPAKAAVRVANFTAVPAGDEAQIRAALVRRGPLAV 283

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD---RTKFTHKAVPYWIIKN 316
            +NA  +Q YV GVS P+   C      ++H VL+VGYG       +  ++  PYWIIKN
Sbjct: 284 GLNAAFMQTYVGGVSCPL--LCP--RAWVNHGVLLVGYGARGFAALRLGYR--PYWIIKN 337

Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           SWGE WGE+GY+RL RG   CG++  V +  V
Sbjct: 338 SWGERWGEQGYYRLCRGSNVCGVDSMVSAVAV 369


>gi|302774134|ref|XP_002970484.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
 gi|300162000|gb|EFJ28614.1| hypothetical protein SELMODRAFT_93661 [Selaginella moellendorffii]
          Length = 343

 Score =  223 bits (568), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 135/347 (38%), Positives = 193/347 (55%), Gaps = 31/347 (8%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLH---HVKHT-ALFNYFLEQHNKTYATLVEYYSRLHI 65
           V LL L V  SS   +   K+  +     VK     F +F+++  K Y T  EY  RL +
Sbjct: 10  VGLLILVVCCSSSNRLDIGKIRQVTDNLEVKDVEGHFKHFMQKFGKVYGTTEEYVHRLKV 69

Query: 66  FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV--PAMIPN 123
           F  NL  +  L+  +  + ++G+  F+DL+  E  +++LGF+   +Y++R V    ++P 
Sbjct: 70  FQANLAHVMSLKKQDP-TAIHGITSFADLTPEEL-SRFLGFR--KAYSNRVVNQAPLLPT 125

Query: 124 ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ 183
             LP AFDWRE+ AVT VK Q  CGS W FSTTG +EG    KT KL+SLSE++LIDCD 
Sbjct: 126 DNLPEAFDWREHGAVTPVKFQGRCGSCWTFSTTGVVEGANFLKTGKLISLSEEQLIDCDY 185

Query: 184 EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY-------RGDDKACRLNKKATQVKING 236
           +D+GCEGG + +A++ + ++   GLE E+ YPY       +     CR         I  
Sbjct: 186 KDNGCEGGDMLSAYEYVKAR---GLEAEEDYPYEELGYRHKPVRGPCRYQPSKVVATIAN 242

Query: 237 YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
           Y  VS DE  +A  LV+NGP+++A+    L  Y  GV+ P    C G    ++H VL+VG
Sbjct: 243 YSRVSEDEDQIAANLVKNGPLSIALRGNVLFTYEGGVACP--RICPG---EINHGVLLVG 297

Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
           YGV+        + YW  KN+W + +GE GYFRL RG G C +N  V
Sbjct: 298 YGVE------NGLRYWTFKNTWTDEFGENGYFRLCRGVGVCDMNSEV 338


>gi|162459555|ref|NP_001105685.1| cysteine proteinase 1 precursor [Zea mays]
 gi|1706260|sp|Q10716.1|CYSP1_MAIZE RecName: Full=Cysteine proteinase 1; Flags: Precursor
 gi|643597|dbj|BAA08244.1| cysteine proteinase [Zea mays]
          Length = 371

 Score =  223 bits (567), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 125/321 (38%), Positives = 176/321 (54%), Gaps = 27/321 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+++  K+Y    E+  RL +F  NLR+ +  Q  +  S  +G+ +FSDL+ AEF+ 
Sbjct: 48  FLSFVQRFGKSYKDADEHAYRLSVFKDNLRRARRHQLLDP-SAEHGVTKFSDLTPAEFRR 106

Query: 102 KYLGFKLKPSYADRSV------PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            YLG +       R +        ++P   LP  FDWR++ AV  VK+Q  CGS W+FS 
Sbjct: 107 TYLGLRKSRRALLRELGESAHEAPVLPTDGLPDDFDWRDHGAVGPVKNQGSCGSCWSFSA 166

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGG 206
           +G +EG +   T KL  LSEQ+ +DCD E         D GC GG ++ AF  +     G
Sbjct: 167 SGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLMTTAFSYLQK--AG 224

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           GLE EK YPY G D  C+ +K      +  +  VS DE  ++  L+++GP+A+ INA  +
Sbjct: 225 GLESEKDYPYTGSDGKCKFDKSKIVASVQNFSVVSVDEAQISANLIKHGPLAIGINAAYM 284

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEK 325
           Q Y+ GVS P  + C     +L H VL+VGYG         K  PYWIIKNSWGE WGE 
Sbjct: 285 QTYIGGVSCP--YIC---GRHLDHGVLLVGYGASGFAPIRLKDKPYWIIKNSWGENWGEN 339

Query: 326 GYFRLYRGD---GSCGINDYV 343
           GY+++ RG      CG++  V
Sbjct: 340 GYYKICRGSNVRNKCGVDSMV 360


>gi|66803062|ref|XP_635374.1| cysteine protease [Dictyostelium discoideum AX4]
 gi|60463697|gb|EAL61879.1| cysteine protease [Dictyostelium discoideum AX4]
          Length = 352

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 134/335 (40%), Positives = 180/335 (53%), Gaps = 41/335 (12%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL--QDTEHGSGV-YGLNEFSDLSTAE 98
           F  F  ++NK Y+   EY  +   F  NL  I  L  Q T  GS   +G+N+F+DLS  E
Sbjct: 27  FIAFQNKYNKIYSA-EEYLVKFETFKSNLLNIDALNKQATTIGSDTKFGVNKFADLSKEE 85

Query: 99  FQAKYLGFKLKPSYADRSVPAMIPNIT------LPRAFDWR---------EYDAVTGVKD 143
           F+  YL    K +     +P M+PN++       P AFDWR         +   VT VK+
Sbjct: 86  FKKYYL--SSKEARLTDDLP-MLPNLSDDIISATPAAFDWRNTGGSTKFPQGTPVTAVKN 142

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE----------DDGCEGGSI 193
           Q  CGS W+FSTTGN+EG +   T  LV LSEQ L+DCD            + GC+GG  
Sbjct: 143 QGQCGSCWSFSTTGNVEGQHYLSTGTLVGLSEQNLVDCDHTCMTYENENVCNAGCDGGLQ 202

Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
            NA++ I+    GG++ E TYPY   D  C+ N      KI+ +  V ++ET +A YL  
Sbjct: 203 PNAYNYIIKN--GGIQTEATYPYTAVDGECKFNSAQVGAKISSFTMVPQNETQIASYLFN 260

Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWI 313
           NGP+A+A +A   QFY+ GV     F C    + L H +LIVGYG   T    K  PYWI
Sbjct: 261 NGPLAIAADAEEWQFYMGGV---FDFPC---GQTLDHGILIVGYGAQDT-IVGKNTPYWI 313

Query: 314 IKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           IKNSWG  WGE GY ++ R    CG+ ++V S++V
Sbjct: 314 IKNSWGADWGEAGYLKVERNTDKCGVANFVSSSIV 348


>gi|285002340|ref|YP_003422404.1| cathepsin [Pseudaletia unipuncta granulovirus]
 gi|197343600|gb|ACH69415.1| cathepsin [Pseudaletia unipuncta granulovirus]
          Length = 338

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 135/357 (37%), Positives = 193/357 (54%), Gaps = 35/357 (9%)

Query: 4   FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHH--VKHTALFNYFLEQHNKTYATLVEYYS 61
            +++  V LL+ T  VSS        +++L +       LF+ F+ ++ K YA   E  S
Sbjct: 5   IFWYGFVCLLATTPIVSS--------MNNLQYDLSNSEVLFDEFVTKYGKVYANDAERKS 56

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL--------KPSYA 113
           R  +F  NL  I   ++ +  S  +G+N +SDLS+ E   K  GFK         K  Y 
Sbjct: 57  RFDVFKANLAIINE-RNAQEESATFGINFYSDLSSNELLRKQTGFKTALHNDNEKKSKYC 115

Query: 114 DRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSL 173
            R V        LP AF+WR+ DAVT VK Q  CGS WAFS   NIE  Y  K K+ V L
Sbjct: 116 TRRVITGPSTRLLPEAFNWRDSDAVTSVKQQRDCGSCWAFSAVANIESQYYIKNKQYVDL 175

Query: 174 SEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVK 233
           SEQ+++DCD  ++GC GG +S A + +M    GG++ E+ Y Y G++  C+ N  A  V+
Sbjct: 176 SEQQIVDCDPINNGCNGGLMSWAMEYVMRS--GGVQLEEDYQYVGNEGVCK-NNSANVVQ 232

Query: 234 INGYVSVS-RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSV 292
           I+G VS   R+E  + + LV NGP++VAI+   +  Y +G++             L+H+V
Sbjct: 233 ISGCVSYDLRNEERLRELLVSNGPISVAIDVMDVTNYQSGIAKHCSVA-----HGLNHAV 287

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG-INDYVRSALV 348
           L+VGYGV          PYW+ KNSWG  WGE GYFR+ R   SCG +N Y  +A++
Sbjct: 288 LLVGYGV------QNNTPYWVFKNSWGSDWGENGYFRVLRDVNSCGMLNQYAATAIL 338


>gi|449461649|ref|XP_004148554.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD19a-like
           [Cucumis sativus]
          Length = 381

 Score =  221 bits (564), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 138/375 (36%), Positives = 200/375 (53%), Gaps = 50/375 (13%)

Query: 4   FYFFAGVALLSLTVSVSSFMVV-------GDEKL-----------HHLHHVKHTALFNYF 45
           F+ FA +  ++ T+  S  +V        GD  +           HH    +H   F+ F
Sbjct: 5   FFLFAVITAVTATLCSSEPLVSQHSVEHDGDPLIRQVVENDGDFNHHALGAEHH--FSLF 62

Query: 46  LEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG 105
             +  K+YAT  E+  R  IF  N+R+ +  Q  +  S ++G+ +FSDL+  EF+  +LG
Sbjct: 63  KRRFGKSYATEEEHDRRFKIFKANMRRAERHQSFD-PSAIHGVTQFSDLTPFEFRKAFLG 121

Query: 106 FK---LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
            +   L+      + P ++P   LP  FDWR++  VT VK+Q  CGS W+FSTTG +EG 
Sbjct: 122 LRGHRLRLPVDTNAAP-ILPTENLPIDFDWRQHGGVTRVKNQGSCGSCWSFSTTGALEGA 180

Query: 163 YAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
                   + LSEQ+L+DCD E         D GC GG +++AF+  +    GGL +E+ 
Sbjct: 181 ------NFLXLSEQQLVDCDHECDPEEEDACDSGCNGGLMNSAFEYTLK--AGGLMKEQD 232

Query: 214 YPYRGDDK-ACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYALQFYVT 271
           YPY G D+  C  +K      I  +  V S DE  +A  LV+NGP+A+AINA  +Q Y+ 
Sbjct: 233 YPYAGIDRNTCNFDKSKIAASIASFSVVNSIDEDQIAANLVKNGPLAIAINAVFMQTYIG 292

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           GVS P  F C   ++ L H VL+VGYG         +   YWIIKNSWGE WGE GY+++
Sbjct: 293 GVSCP--FIC---SKRLDHGVLLVGYGSAGYAPIRMRDKDYWIIKNSWGESWGENGYYKI 347

Query: 331 YRGDGSCGINDYVRS 345
            RG   CG++  V +
Sbjct: 348 CRGRNICGVDSLVST 362


>gi|18138384|ref|NP_542680.1| cathepsin [Helicoverpa zea SNPV]
 gi|209401110|ref|YP_002273979.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
 gi|37077430|sp|Q8V5U0.1|CATV_NPVHZ RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|18028766|gb|AAL56202.1|AF334030_127 ORF57 [Helicoverpa zea SNPV]
 gi|209364362|dbj|BAG74621.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
          Length = 367

 Score =  221 bits (563), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 120/324 (37%), Positives = 182/324 (56%), Gaps = 31/324 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ-----------LLQDTEHGSGVYGLNE 90
           F +FL+Q+NK+Y    EY  R ++F  NL KI               D+   S  +G+N+
Sbjct: 57  FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116

Query: 91  FSDLSTAEFQAKYLGFKLKPS----YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTM 146
           FSD +  E      GF L  S      +  +    P+I LP  +DWR+ + VT +KDQ +
Sbjct: 117 FSDKTPDEVLHSNTGFFLNLSQHYTLCENRIVKGAPDIRLPDYYDWRDTNKVTPIKDQGV 176

Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGG 206
           CGS WAF   GNIE  YA +  KL+ LSEQ+L+DCD+ D GC GG +  AF  ++  L G
Sbjct: 177 CGSCWAFVAIGNIESQYAIRHNKLIDLSEQQLLDCDEVDLGCNGGLMHLAFQELL--LMG 234

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINAYA 265
           G+E E  YPY+G ++ C L+ +   VK+N       RDE  + + +   GP+A+A++A  
Sbjct: 235 GVETEADYPYQGSEQMCTLDNRKIAVKLNSCFKYDIRDENKLKELVYTTGPVAIAVDAMD 294

Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
           +  Y  G+ +    +      +L+H+VL++G+G++        VPYWIIKNSWGE WGE 
Sbjct: 295 IINYRRGILNQCHIY------DLNHAVLLIGWGIENN------VPYWIIKNSWGEDWGEN 342

Query: 326 GYFRLYRGDGSCG-INDYVRSALV 348
           G+ R+ R   +CG +N++  S+++
Sbjct: 343 GFLRVRRNVNACGLLNEFGASSVI 366


>gi|4972585|gb|AAD34707.1|AF071801_1 cysteine proteinase [Paragonimus westermani]
          Length = 229

 Score =  221 bits (563), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 113/222 (50%), Positives = 144/222 (64%), Gaps = 10/222 (4%)

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P   DWRE+ AV  V++Q  CGS WAFS  GN+EG +  KT +LVSLS+Q+L+DCD  D 
Sbjct: 17  PERMDWREWGAVGPVENQGSCGSCWAFSVAGNVEGQWFLKTGQLVSLSKQQLVDCDVMDY 76

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETD 246
           GC GG  +NA+  IM    GGLE +  YPY G  + C LNK+    KI+  + +   E +
Sbjct: 77  GCGGGWPTNAYMEIMRM--GGLELQSDYPYVGVQQQCYLNKEKLLAKIDDLIVLGAYEEE 134

Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
            A YL E+GP++ A+NA  LQFY +G+SHP    C     +L+H+VL VGY       T 
Sbjct: 135 HAAYLAEHGPLSSALNAGYLQFYQSGISHPSYEECS--PASLNHAVLTVGYD------TE 186

Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             VPYWIIKNSWG GWGE GYFRLYRGDG+CGIN  + SA++
Sbjct: 187 NGVPYWIIKNSWGTGWGENGYFRLYRGDGTCGINRMITSAII 228


>gi|343412462|emb|CCD21670.1| cysteine peptidase (CP), putative [Trypanosoma vivax Y486]
          Length = 367

 Score =  220 bits (561), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 123/313 (39%), Positives = 171/313 (54%), Gaps = 18/313 (5%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF  F +++ ++Y T  E   RL +F  N+R+ ++     +    +G+  FSDL+  EF+
Sbjct: 33  LFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYA-AANPHATFGVTPFSDLTPEEFR 91

Query: 101 AKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            +Y   +     A   V  ++  P    P A DWR   AVT VKDQ  CGS W+FS  GN
Sbjct: 92  TRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGTCGSCWSFSAIGN 151

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY-- 216
           IEG +AA    L SLSEQ L+ CD +D+GC GG + NAF+ I+ +  G +  EK+YPY  
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVS 211

Query: 217 -RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
             G++  C+         I G+V +  DE  +AKYL +NGP+AVA++A     Y  GV  
Sbjct: 212 GGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV-- 269

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                    +E L+H VL+VGY  D +K      PYWIIKNSW   WGEKGY R+ +G  
Sbjct: 270 ----VTSCTSEALNHGVLLVGYN-DSSK-----PPYWIIKNSWSSSWGEKGYIRIEKGTN 319

Query: 336 SCGINDYVRSALV 348
            C +     SA+V
Sbjct: 320 QCLVAQLASSAVV 332


>gi|414590229|tpg|DAA40800.1| TPA: putative cysteine protease family protein [Zea mays]
          Length = 381

 Score =  220 bits (561), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 129/329 (39%), Positives = 180/329 (54%), Gaps = 29/329 (8%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           A F  F+ +H + Y+   EY  RL +F+ NL +    Q  +  +  +G+  FSDL+  EF
Sbjct: 58  AQFAAFVRRHGRRYSGPKEYARRLRVFAANLARAAAHQALDP-TARHGVTPFSDLTREEF 116

Query: 100 QAKYLGFKLKPSYAD--RSVPAMIPN-----ITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           +A+  G +           VPA  P        LP +FDWR+  AVTGVK Q  CGS WA
Sbjct: 117 EARLTGLRAGGDVQRLMSGVPAAPPASKEEVARLPASFDWRDKGAVTGVKTQGACGSCWA 176

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSK 203
           FSTTG +EG     T +LV LSEQ+L+DCD           ++GC GG ++NA+  +M  
Sbjct: 177 FSTTGAVEGANFLATGELVDLSEQQLVDCDHTCSAVAQNECNNGCAGGLMTNAYSYLMES 236

Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAIN 262
             GGL E+  YPY G    CR +     V++  + +V + DE  +   LV  GP+AV +N
Sbjct: 237 --GGLMEQSAYPYTGAAGPCRFDPTQVAVRVANFTAVPAGDEAQIRAALVRRGPLAVGLN 294

Query: 263 AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD---RTKFTHKAVPYWIIKNSWG 319
           A  +Q YV GVS P+   C      ++H VL+VGYG       +  ++  PYWIIKNSWG
Sbjct: 295 AAFMQTYVGGVSCPL--ICP--RAWVNHGVLLVGYGARGFAALRLGYR--PYWIIKNSWG 348

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           + WGE+GY+RL RG   CG++  V +  V
Sbjct: 349 KQWGEQGYYRLCRGSNVCGVDSMVSAVAV 377


>gi|340053965|emb|CCC48258.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
          Length = 441

 Score =  220 bits (561), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 123/313 (39%), Positives = 171/313 (54%), Gaps = 18/313 (5%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF  F +++ ++Y T  E   RL +F  N+R+ ++     +    +G+  FSDL+  EF+
Sbjct: 33  LFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYA-AANPHATFGVTPFSDLTPEEFR 91

Query: 101 AKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            +Y   +     A   V  ++  P    P A DWR   AVT VKDQ  CGS W+FS  GN
Sbjct: 92  TRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIGN 151

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY-- 216
           IEG +AA    L SLSEQ L+ CD +D+GC GG + NAF+ I+ +  G +  EK+YPY  
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVS 211

Query: 217 -RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
             G++  C+         I G+V +  DE  +AKYL +NGP+AVA++A     Y  GV  
Sbjct: 212 GGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 271

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                    +E L+H VL+VGY  D +K      PYWIIKNSW   WGEKGY R+ +G  
Sbjct: 272 SCT------SEALNHGVLLVGYN-DSSK-----PPYWIIKNSWSSSWGEKGYIRIEKGTN 319

Query: 336 SCGINDYVRSALV 348
            C +     SA+V
Sbjct: 320 QCLVAQLASSAVV 332


>gi|340053963|emb|CCC48256.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
          Length = 452

 Score =  220 bits (561), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 123/313 (39%), Positives = 171/313 (54%), Gaps = 18/313 (5%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF  F +++ ++Y T  E   RL +F  N+R+ ++     +    +G+  FSDL+  EF+
Sbjct: 33  LFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYA-AANPHATFGVTPFSDLTPEEFR 91

Query: 101 AKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            +Y   +     A   V  ++  P    P A DWR   AVT VKDQ  CGS W+FS  GN
Sbjct: 92  TRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIGN 151

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY-- 216
           IEG +AA    L SLSEQ L+ CD +D+GC GG + NAF+ I+ +  G +  EK+YPY  
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDSKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVS 211

Query: 217 -RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
             G++  C+         I G+V +  DE  +AKYL +NGP+AVA++A     Y  GV  
Sbjct: 212 GGGEEPPCKPRGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 271

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                    +E L+H VL+VGY  D +K      PYWIIKNSW   WGEKGY R+ +G  
Sbjct: 272 SCT------SEALNHGVLLVGYN-DSSK-----PPYWIIKNSWSSSWGEKGYIRIEKGTN 319

Query: 336 SCGINDYVRSALV 348
            C +     SA+V
Sbjct: 320 QCLVAQLASSAVV 332


>gi|412992445|emb|CCO18425.1| unknown [Bathycoccus prasinos]
          Length = 500

 Score =  220 bits (560), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 136/318 (42%), Positives = 180/318 (56%), Gaps = 32/318 (10%)

Query: 55  TLVEYYSRLHIFSGNL-RKIQLLQDTEHGSGV--YGLNEFSDLSTAEFQAKYLGF----- 106
           T  EY  R+ IF  N  R I+   D   G G   +G+ +F DLS  EF+ +YLG      
Sbjct: 188 TEEEYEKRMEIFQENWKRAIEREIDDRKGGGSAKHGVTKFFDLSEEEFREQYLGLLSTST 247

Query: 107 ---KLKPSYADRSV--PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
                K ++    +  P+      LP+ +DWR   AVT VKDQ  CGS W FSTTG IEG
Sbjct: 248 SSSASKDAFRKHQMEAPSEEDLEKLPQYYDWRARGAVTPVKDQGQCGSCWTFSTTGAIEG 307

Query: 162 VYAAKTKKLVSLSEQELIDCD---------QEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
               KT KLVSLSEQ+L+DCD           D GC GG  SNA + I+    GGL+ EK
Sbjct: 308 ANFIKTGKLVSLSEQQLLDCDVGCAPDIPNACDSGCNGGLPSNAMEYIVEH--GGLDTEK 365

Query: 213 TYPYRG-DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
           +YPY+   +  CR  +      I+ Y  V ++ET MA  LV+ GP+++ INA  +Q YV 
Sbjct: 366 SYPYKAYKEDTCRAKEGKLGATISNYTFVGKNETHMAHALVKYGPLSIGINAAWMQSYVG 425

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVD--RTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
           GV+ P  + C+   + L H VLIVGYG +       HK  PYW+IKNSWG GWGE+GY+R
Sbjct: 426 GVACP--WLCN--KDALDHGVLIVGYGEEGFAPARLHKE-PYWVIKNSWGMGWGEEGYYR 480

Query: 330 LYRGDGSCGINDYVRSAL 347
           + +  G+CG+N+ V +AL
Sbjct: 481 ICKDKGNCGVNNMVVAAL 498


>gi|328870281|gb|EGG18656.1| hypothetical protein DFA_04151 [Dictyostelium fasciculatum]
          Length = 347

 Score =  220 bits (560), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 129/320 (40%), Positives = 175/320 (54%), Gaps = 27/320 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLSTAE 98
           F  F  ++NK Y +  E+  +L  F  +L++IQ L D    + V   +G+N+F+DLS  E
Sbjct: 30  FREFQLKYNKHYESH-EFAQKLATFKNSLKRIQELNDMAKRAKVDTEFGVNKFADLSKEE 88

Query: 99  FQAKYLGF---------KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGS 149
           F   YL              P Y+D+ +        LP +FDWR   AVT VKDQ  CGS
Sbjct: 89  FANYYLNKGGMESTDSETYAPDYSDKEIS------NLPTSFDWRTQGAVTPVKDQGQCGS 142

Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLE 209
            W+FSTTGN+EG +      L  LSEQ L+DC  ++DGC GG +  A+D I+     G++
Sbjct: 143 CWSFSTTGNVEGQWFLAGNDLTGLSEQNLVDCSTKNDGCNGGLMPLAYDYIVEN--NGID 200

Query: 210 EEKTYPYRG-DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
            E +YPY     K C+ N      KI+GY +VS +ET M   LV NGP+++A +A   Q+
Sbjct: 201 TEASYPYLAIQQKNCQFNPANIGAKIDGYYNVSSNETQMQINLVNNGPLSIAADAAEWQY 260

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  G+   I   C    +NL H +LIVGYG   T+F  +   +WIIKNSW   WG  G+ 
Sbjct: 261 YKKGIFSGIFGIC---GKNLDHGILIVGYGQQTTEFGTEL--FWIIKNSWSTDWGLSGFM 315

Query: 329 RLYRGDGSCGINDYVRSALV 348
            + RG G CGIN  V SA V
Sbjct: 316 LIKRGTGECGINLAVTSAYV 335


>gi|13507095|gb|AAK28439.1| cysteine protease 3 precursor [Clonorchis sinensis]
          Length = 320

 Score =  220 bits (560), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 132/336 (39%), Positives = 186/336 (55%), Gaps = 21/336 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           L   V++  F V+G   +    + +   L+  F  ++ K+Y+   + Y R  +F  NL +
Sbjct: 5   LCFLVALGFFGVLG-SNIPESENARQ--LYEEFKLKYKKSYSNDDDEY-RFRVFKDNLLR 60

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDW 132
           I+  Q+ E G+  YG+ +FSDL+  EF+ +YL  K      DR     I        FDW
Sbjct: 61  IKQFQNMERGTAKYGVTQFSDLTAQEFKVRYLRSKFGGVPVDREPVPFIRMDVDDDNFDW 120

Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGS 192
           R + AV  V DQ  CGS WAFS  GNIEG +  KT  L+ LSEQ+L+DCD  D+GC GG+
Sbjct: 121 RNHGAVGPVLDQGDCGSCWAFSAVGNIEGQWFRKTDNLLQLSEQQLLDCDGVDEGCNGGT 180

Query: 193 ISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLV 252
              AF  I+    GGL+ +  YPY G +  CR+     +V ING   +  DE   A+ L 
Sbjct: 181 PQQAFRQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKILPEDEQIQAQMLK 238

Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYW 312
           E GP++ A+NA  LQ       HP+   CD   ++L+H+VL VGYG          +PYW
Sbjct: 239 ETGPLSSALNALFLQ-------HPLPALCDA--QSLNHAVLTVGYG------KEGRLPYW 283

Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
            +KNSW   +GE GYFR+YRGDG+CGIN  V ++++
Sbjct: 284 TVKNSWSTMFGENGYFRIYRGDGTCGINTLVSTSII 319


>gi|302793594|ref|XP_002978562.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
 gi|300153911|gb|EFJ20548.1| hypothetical protein SELMODRAFT_109056 [Selaginella moellendorffii]
          Length = 343

 Score =  220 bits (560), Expect = 9e-55,   Method: Compositional matrix adjust.
 Identities = 126/314 (40%), Positives = 180/314 (57%), Gaps = 33/314 (10%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F +F+++  K Y T  EY  RL +F  NL  +  L+  +  + ++G+  F+DL+  E  +
Sbjct: 46  FKHFMQKFGKVYGTTEEYVHRLKVFQANLVHVMSLKKQDP-TAIHGITSFADLTPEEL-S 103

Query: 102 KYLGFKLKPSYADRSV--PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
           ++LGF+   +Y++R V    ++P   LP AFDWRE+ AVT VK Q  CGS W FSTTG +
Sbjct: 104 RFLGFR--KAYSNRVVNQAPLLPTDNLPEAFDWREHGAVTPVKFQGRCGSCWTFSTTGVV 161

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY--- 216
           EG    KT KL+SLSE++LIDCD +D+GCEGG + +A++ + ++   GLE ++ YPY   
Sbjct: 162 EGANFLKTGKLISLSEEQLIDCDYKDNGCEGGDMLSAYEYVKAR---GLEADEDYPYEEL 218

Query: 217 -------RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
                  RG    CR         I  Y  VS DE  +A  LV+NGP+++A+    L  Y
Sbjct: 219 GYRHKPVRG---PCRYQPSKVVATIANYSRVSEDEDQIAANLVKNGPLSIALRGNVLFTY 275

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GV+ P    C G    ++H VL+VGYGV+        + YW  KNSW + +GE GYFR
Sbjct: 276 EGGVACP--RICPG---EINHGVLLVGYGVE------NGLRYWTFKNSWTDEFGENGYFR 324

Query: 330 LYRGDGSCGINDYV 343
           L RG G C +   V
Sbjct: 325 LCRGVGVCDMTSEV 338


>gi|40806502|gb|AAR92156.1| putative cysteine protease 3 [Iris x hollandica]
          Length = 292

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 121/288 (42%), Positives = 168/288 (58%), Gaps = 24/288 (8%)

Query: 71  RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV--PAMIPNITLPR 128
           R+ Q L  T     V+G+ +FSDL+  EF+  YLG +    +   S     ++P   LP 
Sbjct: 5   RRHQQLDPT----AVHGVTQFSDLTPGEFKRTYLGLRKGKKHLVGSAHEAPLLPTNDLPE 60

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---- 184
            FDWR+  AVTGVK+Q  CGS W+FST+G +EG     T KL +LSEQ+++DCD E    
Sbjct: 61  DFDWRDKGAVTGVKNQGSCGSCWSFSTSGALEGANFLATGKLETLSEQQMVDCDHECDAE 120

Query: 185 -----DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKATQVKINGYV 238
                D GC GG ++ AF  +     GGLE EK YPY G D+  C+ ++   +  ++ + 
Sbjct: 121 EPDDCDQGCNGGLMNTAFQYLQKV--GGLESEKDYPYTGTDRGTCKFDESKIKASVHNFS 178

Query: 239 SVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
            VS DE  +A  LV++GP+A+AINA  +Q Y+ GVS P  + C    ++L H VL+VGYG
Sbjct: 179 VVSIDEEQIAANLVKHGPLAIAINAVFMQTYIGGVSCP--YIC---GKHLDHGVLLVGYG 233

Query: 299 -VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
                    K  PYWIIKNSWGE WGE GY+++ RG   CG++  V +
Sbjct: 234 SAGYAPIRLKEKPYWIIKNSWGETWGENGYYKICRGRNVCGVDSMVST 281


>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
 gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
          Length = 337

 Score =  219 bits (559), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 129/343 (37%), Positives = 187/343 (54%), Gaps = 23/343 (6%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
             A   +L + V  + F +     L     ++   +F  +  +H K+Y++  E   RL I
Sbjct: 1   MIASTLILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDWEKARRLMI 60

Query: 66  FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI- 124
           FS  L  I+      + +   GLN+FSDL+ AEF+A ++G   +P Y DR +PA   ++ 
Sbjct: 61  FSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDR-LPAEDEDVD 119

Query: 125 --TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
             +LP + DWR+  AVT +KDQ  CGS WAFS   +IE  +   TK+LVSLSEQ+L+DCD
Sbjct: 120 VSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDCD 179

Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSVS 241
             D GC+GG +  AF  ++    GG+  E  YPY G   +C  NK   +V +I G+  V+
Sbjct: 180 TVDAGCDGGLMETAFKFVVKN--GGVTTEAAYPYTGSVGSCNANKAKNKVAEITGFKVVT 237

Query: 242 RDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
            D  D     V   P+ V+I  +    Q Y +G+   +   CD   ++L H VL++GYG 
Sbjct: 238 EDSADALMKAVSKTPVTVSICGSDENFQNYKSGI---LSGKCD---DSLDHGVLLIGYG- 290

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR--GDGSCGIN 340
                T   +PYWIIKNSWG  WGE G+ ++ R  GDG CG+N
Sbjct: 291 -----TEGGMPYWIIKNSWGTSWGEDGFMKIERKDGDGMCGMN 328


>gi|432091112|gb|ELK24324.1| Cathepsin W [Myotis davidii]
          Length = 370

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 123/323 (38%), Positives = 176/323 (54%), Gaps = 19/323 (5%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F  Q+N++Y+   EY  RL IF+ NL   Q LQ+ + G+  +G+  FSDL+  EF 
Sbjct: 41  VFTLFQIQYNRSYSNPAEYAHRLDIFARNLAHAQRLQEEDLGTAEFGVTAFSDLTEEEFD 100

Query: 101 AKYLGFKL--KPSYADRSVPAMIPNITLPRAFDWREYDAV-TGVKDQTMCGSSWAFSTTG 157
             Y   +   +    DR V +     ++P   DWR+   V + VKDQ  C   WA +  G
Sbjct: 101 QLYGNQRAAGRAPNVDREVGSDEWQESVPSTCDWRKAPGVMSPVKDQKTCSCCWAMAAAG 160

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           NIE  +  KT++ V +S QEL+DC +  DGC GG + +AF T+++    GL  EK YP++
Sbjct: 161 NIEAQWGIKTRQSVEVSVQELLDCGRCGDGCSGGFVWDAFITVLNN--SGLASEKDYPFQ 218

Query: 218 GDDKA-CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
           G  +A C+  K      I  ++ +S +E  +A YL   GP+ V IN   LQ Y  GV   
Sbjct: 219 GAVRAKCQAKKHKKVAWIQDFIMLSDNEQRIAWYLATEGPITVTINKKLLQQYQNGVIKA 278

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRT-----------KFTHKAVPYWIIKNSWGEGWGEK 325
            Q  CD   +N+ H VL+VG+G  ++               ++ PYWI+KNSWG  WGEK
Sbjct: 279 TQTTCD--PQNVDHVVLLVGFGKTKSVEGRQAKGVPGHSRRRSTPYWILKNSWGANWGEK 336

Query: 326 GYFRLYRGDGSCGINDYVRSALV 348
           GYFRL+RG  +CGI  Y  +A V
Sbjct: 337 GYFRLHRGSNACGITKYPITARV 359


>gi|339244637|ref|XP_003378244.1| cathepsin F [Trichinella spiralis]
 gi|316972865|gb|EFV56511.1| cathepsin F [Trichinella spiralis]
          Length = 317

 Score =  219 bits (558), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 120/282 (42%), Positives = 163/282 (57%), Gaps = 20/282 (7%)

Query: 76  LQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREY 135
           LQ  E G+ +YG   F+D++  EF+  YL      +   +   A++  +  P  FDWR Y
Sbjct: 3   LQQQEKGTAIYGPTIFADMTQDEFRKTYLNMLETSALLPKQRIALL-KVDRPNKFDWRNY 61

Query: 136 DAVTGVKDQTM----------CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
           + VT VK Q            CGSSWAFST  NIE  +A K   L+SLSEQ++IDCD+ +
Sbjct: 62  NVVTKVKRQVWHKMQKKFLGKCGSSWAFSTIANIESAWAIKFGDLISLSEQQIIDCDKIN 121

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDET 245
            GC GG    A+  I+     G++ E  YPY G   +C+LNK+  +V IN  V + ++ET
Sbjct: 122 RGCRGGQPLKAYHEIIRM--SGVQAESDYPYTGLHGSCKLNKEKIKVYINDTVLLHKNET 179

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNEN-LSHSVLIVGYGVDRTKF 304
            +A YL E+GP+AV +NA  L  Y  G+  P +  C   N N L+H   I+GYG  +  +
Sbjct: 180 TIANYLYEHGPVAVRMNADILMLYRKGIIKPTKSSC---NPNFLNHGATIIGYG--KESW 234

Query: 305 TH-KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
            H  + PYWIIKNSWG  WGE GYFRLYRG+ +CG+N  V S
Sbjct: 235 LHWWSNPYWIIKNSWGVDWGENGYFRLYRGNEACGVNRMVTS 276


>gi|149725427|ref|XP_001494683.1| PREDICTED: cathepsin W-like [Equus caballus]
          Length = 373

 Score =  219 bits (557), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 125/325 (38%), Positives = 184/325 (56%), Gaps = 22/325 (6%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F  Q+N++Y++  EY  RL IF+ NL + Q LQ+ + G+  +G++ FSDL+  EF 
Sbjct: 41  VFTLFQIQYNRSYSSPAEYAHRLDIFARNLAQAQRLQEDDLGTAEFGVSPFSDLTEEEFG 100

Query: 101 AKYLGFKLKPSYAD---RSVPAMIPNITLPRAFDWREYDAV-TGVKDQTMCGSSWAFSTT 156
             Y G +   + A    R V +     T+P+  DW++   V + VK+Q MC   WA +  
Sbjct: 101 QLY-GHRRAAAGAPHVGRKVESEKWEKTVPQTCDWQKAAGVISSVKNQEMCNCCWAMAAA 159

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIE ++A    + V +S Q+L+DCD+  +GC+GG + +AF T+++    GL  EK YP+
Sbjct: 160 GNIEALWAITYHQSVEVSIQQLLDCDRCGNGCKGGFVWDAFLTVLNN--SGLASEKDYPF 217

Query: 217 RGDDKACRLNKKATQVK-INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           RGD K  R   K  +V  I  ++ +  DE  +A+YL  +GP+ V IN   LQ Y  GV  
Sbjct: 218 RGDAKPHRCQAKKPKVAWIQDFIRLPEDEQKIAEYLATHGPITVTINMKLLQQYQKGVIK 277

Query: 276 PIQFFCDGGNENLSHSVLIVGYG------------VDRTKFTHKAVPYWIIKNSWGEGWG 323
                CD   ++L HSVL+VG+G            V       ++  YWI+KNSWG  WG
Sbjct: 278 ATPTTCD--PQHLDHSVLLVGFGGGKSVEGRRPGAVSSQSRPRRSSSYWILKNSWGAKWG 335

Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
           E+GYFRL+RG  +CGI  Y  +ALV
Sbjct: 336 EEGYFRLHRGSNTCGITKYALTALV 360


>gi|303275866|ref|XP_003057227.1| predicted protein [Micromonas pusilla CCMP1545]
 gi|226461579|gb|EEH58872.1| predicted protein [Micromonas pusilla CCMP1545]
          Length = 329

 Score =  218 bits (556), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 128/335 (38%), Positives = 183/335 (54%), Gaps = 33/335 (9%)

Query: 36  VKHTALFNYFLEQHNKTYAT-LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDL 94
            +H   F+ F+ +H KTYA+   EY  RL IF+ N+ + + +  +      YG   F+DL
Sbjct: 2   TRHERDFDAFVLEHGKTYASDAKEYAKRLEIFAENMARAKEM--SARDGAEYGATPFADL 59

Query: 95  STAEFQAKYLGF---------KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQT 145
           +  EF +  L           +LK   + R +P  +P   +P  FDWR   AVT VK+Q 
Sbjct: 60  TEDEFASSLLMREPIDAARVERLKRHESSRVLP-HLPTENIPLNFDWRALGAVTPVKNQG 118

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNA 196
           MCGS W+FS TG +EG +  K+  LVSLSEQ+L+DCD           D GC+GG  +NA
Sbjct: 119 MCGSCWSFSATGAVEGAHFVKSGALVSLSEQQLVDCDHTCDPDSGTACDSGCDGGLPANA 178

Query: 197 FDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
              ++ +  GGL+ E  YPY   RGD +            I  Y  VS DE+ +A  LV+
Sbjct: 179 MAYVVKR--GGLDAEAAYPYLGARGDGRCKSKEDGPPAATITNYSFVSADESQIAAALVK 236

Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH--KAVPY 311
           +GP++V I+A  +Q Y  GV+ P  + CD     L H VLIVG+G +        +  P+
Sbjct: 237 HGPLSVGIDARWMQLYRRGVACP--WACD--KTRLDHGVLIVGFGAEGRAPARGFRREPF 292

Query: 312 WIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSA 346
           W+IKNSWG  WGE+GY+++ +  GSCG+N  V +A
Sbjct: 293 WLIKNSWGARWGEEGYYKICKDKGSCGVNTMVLAA 327


>gi|633096|dbj|BAA04664.1| prepro NTP [Paragonimus westermani]
          Length = 245

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 116/253 (45%), Positives = 154/253 (60%), Gaps = 12/253 (4%)

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
           T EF AKYL   +      R  P  +     P   DWR   AVT V++Q  CGS WAFST
Sbjct: 3   TPEFAAKYLSAPVNNDQVKRVRPTGLK--AAPERMDWRAKGAVTPVENQGECGSCWAFST 60

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
            GN+EG +  KT +LVSLS+Q+L+DCD   +GC GG  ++++  IM    GGLE E  YP
Sbjct: 61  AGNVEGQWFIKTGQLVSLSKQQLVDCDMAAEGCNGGWPASSYLEIMYM--GGLESESDYP 118

Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           Y G ++ C LNK+    KI+  + +  +E D A YL E+GP++  +NA ALQ+Y +GV  
Sbjct: 119 YVGVEQTCALNKEKLVAKIDDSIVLGPEEEDHAAYLAEHGPLSTLLNAVALQYYQSGVLK 178

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
           P   F +  +  L+H+VL VGY  +        +PYWIIKNSWG  WGEKGYFRL+RGD 
Sbjct: 179 PT--FEECPDTELNHAVLTVGYDKEGD------MPYWIIKNSWGTDWGEKGYFRLFRGDC 230

Query: 336 SCGINDYVRSALV 348
           +CGIN    SA++
Sbjct: 231 TCGINRMATSAII 243


>gi|308808478|ref|XP_003081549.1| Cysteine proteinase Cathepsin F (ISS) [Ostreococcus tauri]
 gi|116060014|emb|CAL56073.1| Cysteine proteinase Cathepsin F (ISS), partial [Ostreococcus tauri]
          Length = 293

 Score =  218 bits (555), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 124/293 (42%), Positives = 166/293 (56%), Gaps = 22/293 (7%)

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG-FKLKPSYADR-----SVPAMIPN 123
           L +    Q  + GS  +G+  FSDL+  EF  +YLG  KL   + ++      V   +P 
Sbjct: 3   LIRAATQQANDRGSAKHGVTRFSDLTPEEFAERYLGHVKLSSEHREKVRARGGVIEDLPT 62

Query: 124 ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD- 182
             LP  FDWR   AV+ VKDQ  CGS W FSTTG IEG +   T KLV LSEQ+L+DCD 
Sbjct: 63  KHLPAEFDWRFKGAVSRVKDQGQCGSCWTFSTTGAIEGAHFISTGKLVELSEQQLLDCDV 122

Query: 183 --------QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKI 234
                     D GC GG  SNA + I+    GG++ EK+YPY G+   C+ ++      +
Sbjct: 123 GCDPDVPNACDSGCNGGLPSNAMEYIVEH--GGIDTEKSYPYVGEKGECKADEGTLGATL 180

Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLI 294
             +  VS DE  MA  LV++GP+++ INA  +Q Y+ GV+ P  + CD  +E L H VLI
Sbjct: 181 KNFSYVSSDEKQMAAALVKHGPLSIGINAAWMQTYIGGVACP--WLCD--SEALDHGVLI 236

Query: 295 VGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSA 346
           VGYG         +  PYWI+KNSW   WGE GY+R+ +  GSCGIN+ V +A
Sbjct: 237 VGYGSSGFAPVRWQQEPYWIVKNSWSPAWGEGGYYRICKDKGSCGINNMVVAA 289


>gi|118197532|ref|YP_874244.1| cathepsin [Ectropis obliqua NPV]
 gi|113472527|gb|ABI35734.1| cathepsin [Ectropis obliqua NPV]
          Length = 299

 Score =  218 bits (554), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 123/312 (39%), Positives = 177/312 (56%), Gaps = 25/312 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           F+  +NK Y   +E   R  IF  NLR I + ++  +GS VY +N+FSDLST+E   KY 
Sbjct: 2   FVANYNKMYDDDLEKTKRYSIFRDNLRDINI-KNKLNGSAVYRINKFSDLSTSEIVLKYT 60

Query: 105 GFKLKPSYADRSVPAMIPNITL-------PRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  + P+  +R        I L       P  FDWR  + VT +K+Q +CG+ WAF+T  
Sbjct: 61  GLSVPPT--ERLTTNFCKTIVLDQPPGKGPLNFDWRHQNKVTSIKNQGVCGACWAFATLA 118

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           +IE  YA K    ++LSEQ++IDCD  D GC+GG +  AF+ ++    GG++ E  YPY 
Sbjct: 119 SIESQYAIKHNVQINLSEQQMIDCDYVDMGCDGGLLHTAFEQMIEM--GGVKHEHEYPYE 176

Query: 218 GDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
           G +  CRLN     VKI G Y  +   E  +   L   GP+ +AI+A  +  Y  GV + 
Sbjct: 177 GINMNCRLNDDNFAVKIIGCYRYIVLQEEKLKDLLRAVGPIPIAIDASGIANYYQGVIN- 235

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
              +C+  N  L+H+VL+VGYGV+        +PYW IKN+WGE WGE GYFR+ +   +
Sbjct: 236 ---YCE--NHGLNHAVLLVGYGVENN------IPYWTIKNTWGEDWGENGYFRVRQNINA 284

Query: 337 CGINDYVRSALV 348
           CG+ + + S+ V
Sbjct: 285 CGMTNELASSAV 296


>gi|398010921|ref|XP_003858657.1| cathepsin L-like protease, partial [Leishmania donovani]
 gi|322496866|emb|CBZ31937.1| cathepsin L-like protease, partial [Leishmania donovani]
          Length = 345

 Score =  218 bits (554), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 131/319 (41%), Positives = 182/319 (57%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A     LVSLSEQ+L+ CD +D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKS 213

Query: 214 YPY---RGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C   +K     +I+GYV +  +ET MA +L ENGP+A+A++A +   Y
Sbjct: 214 YPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  + L+H VL+VGY  ++T      VPYW+IKNSWGE WGEKGY R
Sbjct: 274 QSGVLTS----CAG--DALNHGVLLVGY--NKT----GGVPYWVIKNSWGEDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +++Y  SA V
Sbjct: 322 VAMGRNACLLSEYPVSAHV 340


>gi|222628593|gb|EEE60725.1| hypothetical protein OsJ_14236 [Oryza sativa Japonica Group]
          Length = 364

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 117/281 (41%), Positives = 164/281 (58%), Gaps = 26/281 (9%)

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-----ADRSVPAMIPNITLPRAFDWREYDA 137
           +  +G+ +FSDL+  EF+ ++LG + +PS       +     ++P   LP  FDWRE+ A
Sbjct: 81  TATHGVTKFSDLTPGEFRDRFLGLR-RPSLEGLVGGEPHEAPILPTDGLPDDFDWREHGA 139

Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGC 188
           V  VKDQ  CGS W+FST+G +EG +   T KL  LSEQ+++DCD E         D GC
Sbjct: 140 VGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGC 199

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMA 248
            GG ++ AF  +M    GGL+ EK YPY G +  C+ +K     ++  +  +S +E  +A
Sbjct: 200 NGGLMTTAFSYLMKS--GGLQSEKDYPYAGRENTCKFDKSKIVAQVKNFSVISVNEDQIA 257

Query: 249 KYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHK 307
             LV++GP+A+AINA  +Q Y+ GVS P  F C     +L H VL+VGYG         K
Sbjct: 258 ANLVKHGPLAIAINAAYMQTYIGGVSCP--FIC---GRHLDHGVLLVGYGSAGYAPIRFK 312

Query: 308 AVPYWIIKNSWGEGWGEKGYFRLYRG---DGSCGINDYVRS 345
             PYWIIKNSWGE WGEKGY+++ RG      CG++  V S
Sbjct: 313 EKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDSMVSS 353


>gi|33622213|ref|NP_891858.1| cathepsin [Cryptophlebia leucotreta granulovirus]
 gi|33569322|gb|AAQ21608.1| cathepsin [Cryptophlebia leucotreta granulovirus]
          Length = 332

 Score =  218 bits (554), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 133/352 (37%), Positives = 203/352 (57%), Gaps = 33/352 (9%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           +  LSL + VS+F  + +  +++L   +   LF+ F++Q+NKTY T  E   +   F  N
Sbjct: 1   MKFLSLFLLVSAFSFI-ESVIYNLE--QSEKLFDSFVKQYNKTYLTEEERMIKFDNFKNN 57

Query: 70  LRKIQLLQDTEHGS--GVYGLNEFSDLSTAEFQAKYLGFKL--KPSYADRSVPAM----- 120
           LR   ++ +   GS   V+ +N++SDL+  +      GFKL  K +Y+  +V        
Sbjct: 58  LR---IINEKNRGSKHAVFDINKYSDLNKNDLLRHTTGFKLGLKKNYSFTTVKECGVVEI 114

Query: 121 --IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
              P + LP  FDWR+   VT VK+Q +CGS WAFST GNIE +Y  K  K++ LSEQ L
Sbjct: 115 KEEPQVLLPETFDWRDKHGVTPVKNQLICGSCWAFSTIGNIESLYNIKYDKVIDLSEQHL 174

Query: 179 IDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYV 238
           I+CD  ++GC GG +  A + I+ + GGG+  E+  PY G D  C+  K   ++ I+G  
Sbjct: 175 INCDLVNNGCNGGLMHWALENILQE-GGGVVSEENDPYYGLDSVCK--KTPWELNISGCK 231

Query: 239 S-VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
             + ++E  + + LV NGP++VAI+   +  Y +G++      C+  N  L+H+VL+VGY
Sbjct: 232 RYILQNENKLKELLVVNGPISVAIDVSDVINYKSGIAD----ICENNN-GLNHAVLLVGY 286

Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG-INDYVRSALV 348
           G       +  VPYWI+KNSWG  WGE G+FR+ R   SCG +N+Y  SA++
Sbjct: 287 G------EYDEVPYWILKNSWGIEWGEDGFFRIQRNKNSCGLLNEYASSAVL 332


>gi|194352748|emb|CAQ00102.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 368

 Score =  217 bits (553), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 127/328 (38%), Positives = 178/328 (54%), Gaps = 28/328 (8%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           A F  F+ +H K Y+   EY  RL +F+ N+ +    Q  + G+  +G+  FSDL+  EF
Sbjct: 48  AQFAAFVRRHGKEYSGPEEYARRLRVFAANVARAAAHQALDPGA-RHGVTPFSDLTREEF 106

Query: 100 QAKYLGFK-----LKPSYADRSVPAMIPN--ITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           +A+  G       L+ +    +           LP +FDWR+  AVT VK Q +CGS WA
Sbjct: 107 EARLTGLVGAGDVLRSARRMPAAAPATEEEVAALPASFDWRDKGAVTDVKMQGVCGSCWA 166

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD---------GCEGGSISNAFDTIMSK 203
           FSTTG +EG     T KL+ LSEQ+L+DCD   D         GC GG ++NA+  +MS 
Sbjct: 167 FSTTGAVEGANFVATGKLLDLSEQQLVDCDHTCDAVAKTECNSGCSGGLMTNAYRYLMSS 226

Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA 263
             GGL E+  YPY G    CR ++    V++  + +V  DE  M   LV  GP+AV +NA
Sbjct: 227 --GGLMEQAAYPYTGAQGPCRFDRGKVAVRVANFTAVPLDEDQMRAALVRGGPLAVGLNA 284

Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV---DRTKFTHKAVPYWIIKNSWGE 320
             +Q YV GVS P+   C      ++H VL+VGYG       +  ++  PYW+IKNSWG 
Sbjct: 285 AFMQTYVGGVSCPL--ICP--RAMVNHGVLLVGYGARGFSALRLGYR--PYWLIKNSWGA 338

Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
            WGE GY++L RG   CG++  V +  V
Sbjct: 339 QWGEGGYYKLCRGRNVCGVDSMVSAVAV 366


>gi|343417244|emb|CCD20093.1| cysteine peptidase precursor [Trypanosoma vivax Y486]
          Length = 454

 Score =  217 bits (553), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 122/313 (38%), Positives = 170/313 (54%), Gaps = 18/313 (5%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF  F +++ ++Y T  E   RL +F  N+R+ ++     +    +G+  FSDL+  EF+
Sbjct: 33  LFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYA-AANPHATFGVTPFSDLTPEEFR 91

Query: 101 AKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            +Y   +     A   V  ++  P    P A DW    AVT VKDQ  CGS W+FS  GN
Sbjct: 92  TRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWGRKGAVTPVKDQGTCGSCWSFSAIGN 151

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY-- 216
           IEG +AA    L SLSEQ L+ CD +D+GC GG + NAF+ I+ +  G +  EK+YPY  
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDTKDNGCGGGLMDNAFEWIVKENSGKVYTEKSYPYVS 211

Query: 217 -RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
             G++  C+         I G+V +  DE  +AKYL +NGP+AVA++A     Y  GV  
Sbjct: 212 GGGEEPPCKPRGHKVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 271

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                    +E L+H VL+VGY  D +K      PYWIIKNSW   WGEKGY R+ +G  
Sbjct: 272 SCT------SEALNHGVLLVGYN-DSSK-----PPYWIIKNSWSSSWGEKGYIRIEKGTN 319

Query: 336 SCGINDYVRSALV 348
            C +     SA+V
Sbjct: 320 QCLVAQRASSAVV 332


>gi|378943046|gb|AFC76264.1| cathepsin L-like protease [Leishmania major]
 gi|378943056|gb|AFC76269.1| cathepsin L-like protease [Leishmania major]
 gi|394331745|gb|AFN27095.1| cysteine protease [Leishmania major]
          Length = 348

 Score =  217 bits (553), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 129/319 (40%), Positives = 177/319 (55%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213

Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C  + + A   +I+GYVS+   E  MA +L +NGP+++A++A +   Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  E L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +  Y  S LV
Sbjct: 322 VTMGVNACLLTGYPVSVLV 340


>gi|378943060|gb|AFC76271.1| cathepsin L-like protease [Leishmania major]
          Length = 348

 Score =  217 bits (553), Expect = 6e-54,   Method: Compositional matrix adjust.
 Identities = 129/319 (40%), Positives = 177/319 (55%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213

Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C  + + A   +I+GYVS+   E  MA +L +NGP+++A++A +   Y
Sbjct: 214 YPYTSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  E L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +  Y  S LV
Sbjct: 322 VTMGVNACLLTGYPVSVLV 340


>gi|125547724|gb|EAY93546.1| hypothetical protein OsI_15336 [Oryza sativa Indica Group]
          Length = 348

 Score =  217 bits (552), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 117/281 (41%), Positives = 163/281 (58%), Gaps = 26/281 (9%)

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGFKLKPSY-----ADRSVPAMIPNITLPRAFDWREYDA 137
           +  +G+ +FSDL+  EF+ + LG + +PS       +     ++P   LP  FDWRE+ A
Sbjct: 65  TATHGVTKFSDLTPGEFRDRLLGLR-RPSLEGLVGGEPHEAPILPTDGLPDDFDWREHGA 123

Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGC 188
           V  VKDQ  CGS W+FST+G +EG +   T KL  LSEQ+++DCD E         D GC
Sbjct: 124 VGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHECDASESRACDSGC 183

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMA 248
            GG ++ AF  +M    GGL+ EK YPY G +  C+ +K     ++  +  +S +E  +A
Sbjct: 184 NGGLMTTAFSYLMKS--GGLQSEKDYPYAGRENTCKFDKSKIVAQVKNFSVISVNEDQIA 241

Query: 249 KYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG-VDRTKFTHK 307
             LV++GP+A+AINA  +Q Y+ GVS P  F C     +L H VL+VGYG         K
Sbjct: 242 ANLVKHGPLAIAINAAYMQTYIGGVSCP--FIC---GRHLDHGVLLVGYGSAGYAPIRFK 296

Query: 308 AVPYWIIKNSWGEGWGEKGYFRLYRG---DGSCGINDYVRS 345
             PYWIIKNSWGE WGEKGY+++ RG      CG++  V S
Sbjct: 297 EKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDSMVSS 337


>gi|354494740|ref|XP_003509493.1| PREDICTED: cathepsin W-like [Cricetulus griseus]
 gi|344243260|gb|EGV99363.1| Cathepsin W [Cricetulus griseus]
          Length = 376

 Score =  217 bits (552), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 124/369 (33%), Positives = 191/369 (51%), Gaps = 46/369 (12%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           +ALL+ +  ++   +  D     L  ++   +F  F  ++N++YA   EY  RL+IF+ N
Sbjct: 11  LALLTASQGLNDSFLTKDTGPRPLELIE---VFKLFQIKYNRSYANPAEYARRLNIFAHN 67

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT---- 125
           L + Q LQ+ + G+  +G   FSDL+  EF            Y  +  P  IPN+     
Sbjct: 68  LAQAQRLQEEDLGTAEFGETPFSDLTEEEFGQ---------LYGQQKAPKRIPNMVKKAG 118

Query: 126 -------LPRAFDWRE-YDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
                  +P   DWR+  + ++ +K+Q  C   WA +   NIE ++  KT+  V +S QE
Sbjct: 119 SEKWGQPVPSTCDWRKATNIISSIKNQKTCRCCWAIAAADNIEALWRIKTQHFVEVSVQE 178

Query: 178 LIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG--DDKACRLNKKATQVKIN 235
           L+DC++  +GC+GG + +A+ T+++    GL  EK YP++G  +   C  N+      I 
Sbjct: 179 LLDCERCGNGCDGGFVWDAYMTVLN--NSGLASEKDYPFKGYPNPHGCLANRYKKVAWIQ 236

Query: 236 GYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIV 295
            +  + RDE  +A YL  +GP+ V IN   LQ Y  GV       CD   + + HSVL+V
Sbjct: 237 DFTMLGRDEQVIAGYLATHGPITVTINMKLLQGYQKGVIKATPTTCDP--QQVDHSVLLV 294

Query: 296 GYG----------------VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
           G+G                  + +   ++VPYWI+KNSWG  WGEKGYFRLYRG+ SCGI
Sbjct: 295 GFGKGKEKEDIQSGTILSQTRKPRKPRRSVPYWILKNSWGAEWGEKGYFRLYRGNNSCGI 354

Query: 340 NDYVRSALV 348
             Y  +A +
Sbjct: 355 TKYPITACL 363


>gi|290999038|ref|XP_002682087.1| predicted protein [Naegleria gruberi]
 gi|284095713|gb|EFC49343.1| predicted protein [Naegleria gruberi]
          Length = 349

 Score =  217 bits (552), Expect = 8e-54,   Method: Compositional matrix adjust.
 Identities = 133/342 (38%), Positives = 179/342 (52%), Gaps = 47/342 (13%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV-------YGLNEFSDL 94
           F +F + + K YAT  E++ R  IF  N+  +  L      + +       YG+ +F D+
Sbjct: 15  FQHFKKLYLKRYATEEEHHRRWKIFYDNINLVNQLNIMHKPNEIAGKPVAQYGITQFMDM 74

Query: 95  STAEFQAKYLGFKLKPSYADRSV------PAMIPNI-TLPRAFDWREYDAVTGVKDQTMC 147
           S  EF       KL P    + +      P     I  LP +FDWRE+ AVT VKDQ  C
Sbjct: 75  SPNEFAR----VKLLPPTKQKDINHTPTAPKEKYQIDALPESFDWREHGAVTAVKDQASC 130

Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
           GS WAFST  NIEG Y      L   S Q+L+DCD  + GC GG    A   I  +  GG
Sbjct: 131 GSCWAFSTVENIEGAYFLAGHNLTKFSPQQLVDCDNLNCGCFGGFPFIAMQYIQKR--GG 188

Query: 208 LEEEKTYPY----RGDDKACRLNKKATQ-----------------VKINGYVSVSRDETD 246
           L  E +YPY     G+   C  NK                      K+ GY +VS++E D
Sbjct: 189 LATESSYPYCIPPLGNCFPCNTNKTYCPSGEYCNRTCSVQNYQLVAKVAGYENVSQNEDD 248

Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
           +A YLV+NGP+++ +NA  LQFY +G+S P+  +C     ++ H+VL+VG+G   T +  
Sbjct: 249 IAAYLVKNGPLSICLNAMWLQFYHSGISDPM--YCP---PDIDHAVLLVGFGT-HTNWLG 302

Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           +   YWI+KNSWGE WGEKGYFRL RG   CGIN  V +A+V
Sbjct: 303 EKTNYWIVKNSWGESWGEKGYFRLIRGKDKCGINTMVANAIV 344


>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
          Length = 324

 Score =  216 bits (551), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 126/336 (37%), Positives = 179/336 (53%), Gaps = 30/336 (8%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
           L SL V   S  ++ ++ +H          F  F  +H KTY    E   R  IF  NLR
Sbjct: 6   LASLLVVAVSATLLKEDGVH----------FQSFKLKHGKTYKNQAEETKRFAIFRENLR 55

Query: 72  KIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKY-LGFKLKPSYADRSVPAMIPNITL 126
           KI+   + E+  G++    G+N+F+D++ AEF+A      K KPS        +   +++
Sbjct: 56  KIEA-HNAEYKQGIHSYTQGINKFADMTRAEFKAMLATQVKTKPSIVATKTFQLADGVSV 114

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P + DWR  + VT +KDQ  CGS W+F+  G+ EG YA  T KL   SEQ+L+DC  + +
Sbjct: 115 PESIDWRSRNVVTPIKDQAQCGSCWSFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTDLN 174

Query: 187 -GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDET 245
            GC+GG + + F  I +    GLE E  YPY G D +C  +      K++ YVSV  +E 
Sbjct: 175 YGCDGGYLDDTFPYIQTN---GLELESDYPYTGYDGSCSYDSSKVVTKVSSYVSVPANEQ 231

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            + + +   GP+A+AINA  LQFY +G+      +CD   E L H VL VGY       +
Sbjct: 232 ALLEAVGTAGPVAIAINADDLQFYFSGIID--DKYCD--PEWLDHGVLAVGYN------S 281

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIND 341
              + YW+IKNSWG  WGE GYFR  RG   CG+ +
Sbjct: 282 ENGLDYWLIKNSWGADWGESGYFRFLRGQNICGVKE 317


>gi|394331805|gb|AFN27125.1| cysteine protease [Leishmania major]
          Length = 348

 Score =  216 bits (551), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 128/319 (40%), Positives = 176/319 (55%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKACADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVSTEKS 213

Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C  + + A   +I+GYVS+   E  MA +L +NGP+++A++A +   Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  E L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +  Y  S  V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340


>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
 gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
          Length = 343

 Score =  216 bits (551), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 129/345 (37%), Positives = 187/345 (54%), Gaps = 25/345 (7%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
             A   +L + V  + F +     L     ++   +F  +  +H K+Y++ +E   RL I
Sbjct: 5   MIASTLILLVVVGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDLEKARRLMI 64

Query: 66  FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI- 124
           FS  L  I+      + +   GLN+FSDL+ AEF+A ++G   +P Y DR +PA   ++ 
Sbjct: 65  FSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDR-LPAEDEDVD 123

Query: 125 --TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
             +LP + DWR+  AVT +KDQ  CGS WAFS   +IE  +   TK+LVSLSEQ+L+DCD
Sbjct: 124 VSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDCD 183

Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVK---INGYVS 239
             D GC+GG +  AF  ++    GG+  E +YPY G   +C  NK A   K   I G+  
Sbjct: 184 TVDAGCDGGLMETAFKFVVKN--GGVTTEASYPYTGSVGSCNANKVAIINKVAEITGFKV 241

Query: 240 VSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
           V+ D  D     V   P+ V+I  +    Q Y +G+   +   C    ++L H VL++GY
Sbjct: 242 VTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGI---LSGQC---GDSLDHGVLLIGY 295

Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR--GDGSCGIN 340
           G      T   +PYWIIKNSWG  WGE G+ ++ R  GDG CG+N
Sbjct: 296 G------TEGGMPYWIIKNSWGTSWGEDGFMKIERKDGDGICGMN 334


>gi|146335582|gb|ABQ23400.1| cathepsin L isotype 3 [Trypanoplasma borreli]
          Length = 442

 Score =  216 bits (551), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 123/320 (38%), Positives = 176/320 (55%), Gaps = 26/320 (8%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF  F   H + YA+  E   R  IF+ N++K   L + ++    +G NEF+D+S+ EFQ
Sbjct: 24  LFRDFKTTHARNYASADEERKRFEIFAANMKKAAEL-NRKNPMATFGPNEFADMSSEEFQ 82

Query: 101 AK------YLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            +      Y     +P    ++      N  + +  DWR   AVT VK+Q  CGS W+FS
Sbjct: 83  TRHNAARHYAAVMARPPKNTKTFTEEEINAAVGQKVDWRLKGAVTPVKNQGSCGSCWSFS 142

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           TTGNIEG +A  T +LVSLSEQEL+ CD  DDGC GG + NAF  ++S   G +  E +Y
Sbjct: 143 TTGNIEGQHAIATGQLVSLSEQELVSCDTVDDGCSGGLMDNAFGWLLSAHNGQITTEASY 202

Query: 215 PY---RGDDKACRLNKKATQV--KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           PY    G   AC  N  +  V   I  +  + + E DMA ++ + GP+++ ++A + Q Y
Sbjct: 203 PYVSGNGIVPACTFNSNSNPVGATITSFHDIPKTERDMAAFVFKYGPLSIGVDASSWQSY 262

Query: 270 VTGV-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           + G+ SH     C   +  + H VLIVG+  D T  T    PYWIIKNSW   WGE+GY 
Sbjct: 263 IGGILSH-----CS--DVQIDHGVLIVGF--DDTAST----PYWIIKNSWSSMWGEQGYI 309

Query: 329 RLYRGDGSCGINDYVRSALV 348
           R+ +G   CG+  +  S++V
Sbjct: 310 RVAKGSNQCGLTSFPSSSVV 329


>gi|357116897|ref|XP_003560213.1| PREDICTED: probable cysteine proteinase A494-like [Brachypodium
           distachyon]
          Length = 373

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 130/332 (39%), Positives = 187/332 (56%), Gaps = 35/332 (10%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSR-LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
           A F  F+ +H K Y+   E Y+R L +F+ NL +    Q  + G+  +G+  FSDL+  E
Sbjct: 52  AKFAAFVRRHGKEYSGGAEEYARRLRVFAANLARAAAHQALDPGA-RHGVTPFSDLTPEE 110

Query: 99  FQAKYLGFKLK------PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           FQA+  G + +      P+ A  +   +    TLP +FDWR   AVT VK Q MCGS WA
Sbjct: 111 FQARLTGLQQQGTNNNMPAAARATAEELA---TLPASFDWRAKGAVTEVKMQGMCGSCWA 167

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSK 203
           FSTTG +EG +   T KL++LSEQ+L+DCD           D GC GG ++NA+  ++  
Sbjct: 168 FSTTGAVEGAHFVATGKLLNLSEQQLVDCDHTCDAVAKNECDSGCSGGLMTNAYTYLIR- 226

Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKY-LVENGPMAVAIN 262
             GGL E+  YPY G    CR +     V++  + +V  D+ D  +  LV  GP+AV +N
Sbjct: 227 -AGGLMEQAAYPYTGAQGTCRFDANKVAVRVTSFTAVPPDDEDQIRASLVRAGPLAVGLN 285

Query: 263 AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWIIKNSWG 319
           A  +Q Y+ GVS P+   C    + ++H VL+VGY   G+   +  ++  PYWIIKNSWG
Sbjct: 286 AAFMQTYLGGVSCPL--LCP--RKLINHGVLLVGYGARGLAPLRLGYR--PYWIIKNSWG 339

Query: 320 EGWGEKGYFRLYRGDGS---CGINDYVRSALV 348
           + WGE GY+RL RG  +   CG++  V +  V
Sbjct: 340 KEWGEGGYYRLCRGARNRNVCGVDSMVSAVAV 371


>gi|15824691|gb|AAL09443.1| cysteine protease [Leishmania donovani]
          Length = 443

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 129/319 (40%), Positives = 180/319 (56%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A     LVSLSEQ+L+ CD +D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWARVGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKS 213

Query: 214 YPY---RGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C   +K     +I+GYV +  +ET MA +L ENGP+A+A++A +   Y
Sbjct: 214 YPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV            + L+H VL+VGY  ++T      VPYW+IKNSWGE WGEKGY R
Sbjct: 274 QSGV------LTSCAGDALNHGVLLVGY--NKT----GGVPYWVIKNSWGEDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +++Y  SA V
Sbjct: 322 VAMGKNACLLSEYPVSAHV 340


>gi|328866896|gb|EGG15279.1| cysteine protease [Dictyostelium fasciculatum]
          Length = 347

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 128/335 (38%), Positives = 176/335 (52%), Gaps = 49/335 (14%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLSTAE 98
           F  F  ++NK Y +  E+  +   F  NL +I  L      SG    +G+NEF+DLS  E
Sbjct: 27  FRDFQVKYNKVYGSH-EFSQKFVTFKDNLNRIDTLNANAAASGSDTKFGVNEFADLSVQE 85

Query: 99  FQAKYLGFKLKPSYADRSVPAMIPN-------------ITLPRAFDWREYDAVTGVKDQT 145
           F+  Y+           +VPA +P+              ++P +FDWR   AVT VK+Q 
Sbjct: 86  FRKFYM----------NAVPASVPSDAQVAGDYSDETLASIPSSFDWRTKGAVTPVKNQG 135

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE----------DDGCEGGSISN 195
            CGS W+FSTTGN+EG +      L  LSEQ L+DCD            DDGC GG   N
Sbjct: 136 QCGSCWSFSTTGNVEGQWFLAGNTLTGLSEQNLVDCDHHCMTYDGQQSCDDGCNGGLQPN 195

Query: 196 AFDTIMSKLGGGLEEEKTYPYR--GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVE 253
           AF  I+    GG++ E +YPY     DK C+        KI+ +  +S +ET +A YL  
Sbjct: 196 AFQYIIGN--GGIDTETSYPYLAVAQDK-CQFKASNIGAKISNWQMLSTNETQIAAYLAL 252

Query: 254 NGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWI 313
           NGP+++A +A   QFY+ GV       C    + L H +LIVGY  +   F H A PYW 
Sbjct: 253 NGPVSIAADAAEWQFYIGGV---FDLPC---GKALDHGILIVGYDTETNIFGH-AKPYWW 305

Query: 314 IKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           +KNSWG  WGE+GY ++ RG G CG+N +V ++ V
Sbjct: 306 VKNSWGASWGEQGYLKVLRGAGECGLNTFVSTSCV 340


>gi|46309423|ref|YP_006313.1| ORF31 [Agrotis segetum granulovirus]
 gi|46200640|gb|AAS82707.1| ORF31 [Agrotis segetum granulovirus]
          Length = 327

 Score =  216 bits (550), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 124/321 (38%), Positives = 186/321 (57%), Gaps = 30/321 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF  F++++NK+Y++  E   +   F  N+R I   +++   S VY +N +SD++  E  
Sbjct: 24  LFEDFVQKYNKSYSSEEERQIKFDNFKNNIRSINE-KNSLSNSAVYDINFYSDMNKNELL 82

Query: 101 AKYLGFK-------LKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSS 150
            K  GFK       L  S+  +    +I   P + LP +FDWR+   +T VK+Q  CGS 
Sbjct: 83  RKQTGFKINLKKNNLDLSWNIKCNKKLINGNPAVLLPDSFDWRDRHVITSVKNQRDCGSC 142

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
           WAFST  NIE +YA K  KL+ LSEQ+L++CD++++GC GG +  A + I+ +  GG+  
Sbjct: 143 WAFSTIANIESLYAIKYNKLLDLSEQQLVNCDEQNNGCNGGLMHWAMEEIIRQ--GGVSN 200

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYALQFY 269
           E  +PY   D  C+  +K   V ING    +  +E  + + L+ NGP+++AI+   +  Y
Sbjct: 201 ETDFPYTASDGFCK--RKQGFVNINGCNQFILSNEDRLRELLIFNGPISIAIDVIDVIDY 258

Query: 270 VTGVSHPIQFFCDGGNEN-LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
             G+S   +      N+N L+H+VL+VGYGV         +PYWI+KNSWG  WGE GYF
Sbjct: 259 SQGISSTCR------NDNGLNHAVLLVGYGVKNN------IPYWILKNSWGSQWGENGYF 306

Query: 329 RLYRGDGSCG-INDYVRSALV 348
           R+ R   SCG INDY  SA++
Sbjct: 307 RVQRNINSCGMINDYAASAIL 327


>gi|339896953|ref|XP_003392238.1| cathepsin L-like protease [Leishmania infantum JPCM5]
 gi|14349351|gb|AAC38832.2| cysteine protease [Leishmania chagasi]
 gi|17384031|emb|CAD12393.1| cysteine proteinase [Leishmania infantum]
 gi|321398984|emb|CBZ08377.1| cathepsin L-like protease [Leishmania infantum JPCM5]
          Length = 443

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 129/319 (40%), Positives = 180/319 (56%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A     LVSLSEQ+L+ CD +D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKS 213

Query: 214 YPY---RGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C   +K     +I+GYV +  +ET MA +L ENGP+A+A++A +   Y
Sbjct: 214 YPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV            + L+H VL+VGY  ++T      VPYW+IKNSWGE WGEKGY R
Sbjct: 274 QSGV------LTSCAGDALNHGVLLVGY--NKT----GGVPYWVIKNSWGEDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +++Y  SA V
Sbjct: 322 VVMGLNACLLSEYPVSAHV 340


>gi|343471272|emb|CCD16264.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 447

 Score =  216 bits (549), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 122/326 (37%), Positives = 185/326 (56%), Gaps = 24/326 (7%)

Query: 30  LHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN 89
           LH    ++    F  F ++++++Y    E   R  +F  ++ + +  +   +    +G+ 
Sbjct: 31  LHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRMFKQSMERAKE-EAAANPYATFGVT 87

Query: 90  EFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PRAFDWREYDAVTGVKDQT 145
           +FSD+S  E +A YL G K   +   R  P  + N++    P A DWR+  AVT VKDQ 
Sbjct: 88  QFSDMSPEELRATYLNGAKYYAAALKR--PRKVVNVSTGKAPPAVDWRKKGAVTPVKDQR 145

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLG 205
            CGS WAFS TGNIEG +     +L SLSEQ L+ CD  DDGC+GG +  A   I+S   
Sbjct: 146 KCGSCWAFSATGNIEGQWKVAGHELTSLSEQMLVSCDNMDDGCQGGLMDRALKWIVSSNK 205

Query: 206 GGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAIN 262
           G +  E++YPY    GD   C ++ K    KI+G++++ +DE  +A++L +NGP+A+A++
Sbjct: 206 GNVFTEESYPYDSTDGDVPPCNMSGKVVGAKISGHINLPKDENAIAEWLAKNGPVAIAVD 265

Query: 263 AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGW 322
           A +   Y  GV           ++ L+H VL+VGY  D +K      PYWIIKNSWG+ W
Sbjct: 266 ASSFLDYKGGV------LTSCSSDALNHDVLLVGYD-DTSK-----PPYWIIKNSWGKKW 313

Query: 323 GEKGYFRLYRGDGSCGINDYVRSALV 348
           GE+GY R+ +G   C + +Y RSA+V
Sbjct: 314 GEEGYIRVEKGTNQCLMKEYARSAVV 339


>gi|332326581|gb|AEE42614.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 122/318 (38%), Positives = 177/318 (55%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIE  +A    +L +LSEQ+L+ CD +D GC GG ++ AF+ ++  + G +  E +Y
Sbjct: 155 AVGNIESQWAVAGHRLTALSEQQLVSCDDKDSGCGGGLMTQAFEWLLRNMNGTMXTEDSY 214

Query: 215 PY---RGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY    GD  AC  + +     +I+GYV++   ET MA +L ++GP+++A++A +   Y 
Sbjct: 215 PYVSSTGDVPACTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYX 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV            + L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV------LTSCAGKXLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|402585860|gb|EJW79799.1| cysteine protease 6 [Wuchereria bancrofti]
          Length = 242

 Score =  215 bits (548), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 107/224 (47%), Positives = 138/224 (61%), Gaps = 10/224 (4%)

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
            LP  FDW     VT VK+Q  CGS WAFS TGNIE ++A KT  L+SLSEQELIDCD  
Sbjct: 28  NLPNKFDWNTKGVVTPVKNQGSCGSCWAFSVTGNIESLWAIKTGNLISLSEQELIDCDVI 87

Query: 185 DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDE 244
           D+GC GG   NAF  I  K  GGLE E  YPY+  +  C L +    V I+  + + R+E
Sbjct: 88  DNGCNGGLPINAFREI--KRMGGLEPEDQYPYKAKNGTCHLVRAQIAVTIDDAIEIPRNE 145

Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
           T M  ++ + GP++V I+A  L +Y +G+ HP +  C      ++H VLI GYG++    
Sbjct: 146 TVMKAWIAQRGPLSVGIDAELLAYYKSGILHPSKSRCPP--SKINHGVLITGYGIE---- 199

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               +PYW IKNSWGE WGE GYFRL RG   CG++D V SA++
Sbjct: 200 --NGLPYWTIKNSWGEEWGENGYFRLMRGKDICGVSDLVSSAII 241


>gi|394331739|gb|AFN27092.1| cysteine protease [Leishmania major]
          Length = 348

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 128/319 (40%), Positives = 176/319 (55%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHCRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213

Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C  + + A   +I+GYVS+   E  MA +L +NGP+++A++A +   Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  E L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +  Y  S  V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340


>gi|340053966|emb|CCC48259.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
           Y486]
          Length = 447

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 123/313 (39%), Positives = 170/313 (54%), Gaps = 18/313 (5%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF  F +++ ++Y T  E   RL +F  N+R+ ++     +    +G+  FSDL+  EF+
Sbjct: 25  LFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYA-AANPHATFGVTPFSDLTPEEFR 83

Query: 101 AKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            +Y   +     A   V  ++  P    P A DWR   AVT VKDQ  CGS W+FS  GN
Sbjct: 84  TRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGSCGSCWSFSAIGN 143

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           IEG +AA    L SLSEQ L+ CD +D+GC GG + NAF+ I+ +  G +  EK+YPY  
Sbjct: 144 IEGQWAAAGNPLTSLSEQMLVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVS 203

Query: 219 DDKA---CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           +D +   C          I G+V +  DE  +AKYL +NGP+AVA++A     Y  GV  
Sbjct: 204 EDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 263

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                    +E L+H VL+VGY  D +K      PYWIIKNSW   WGEKGY R+ +G  
Sbjct: 264 SCT------SEALNHGVLLVGYN-DSSK-----PPYWIIKNSWSSSWGEKGYIRIEKGTN 311

Query: 336 SCGINDYVRSALV 348
            C +     SA+V
Sbjct: 312 QCLVAQLASSAVV 324


>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
 gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 131/345 (37%), Positives = 187/345 (54%), Gaps = 25/345 (7%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
           F     +L + +     M         LH ++ T     ++ +H K Y    E   R  I
Sbjct: 3   FLCKGKILPIALFFVLAMCADQAASRELHELEMTGRHEKWMAKHGKVYKDDKEKLRRFQI 62

Query: 66  FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV-PAMIPNI 124
           F  N+  I+      + S + G+N+F+DL+  EF+A + G+K +P  A R + P    N+
Sbjct: 63  FKSNVVFIESFNTAGNKSYMLGINKFADLTNEEFRAFWNGYK-RPLGASRKITPFKYENV 121

Query: 125 T-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD- 182
           T LP + DWR   AVT +KDQ +CGS WAFS     EG++  +T KLVSLSEQEL+DCD 
Sbjct: 122 TALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIHKLRTGKLVSLSEQELVDCDV 181

Query: 183 -QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV 240
             +D GC+GG + +AF  I  K  GG+  E  YPY+G D  C   K+A++ VKI GY +V
Sbjct: 182 KGQDKGCQGGLMVDAFKFI--KRHGGMTSEANYPYQGRDGKCDTKKEASRAVKITGYQAV 239

Query: 241 SRDETDMAKYLVENGPMAVAINAYAL--QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
            ++        V N P++VAI+A +L  QFY +G+      F     ++++H V  VGYG
Sbjct: 240 PKNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGI------FTGICGKDINHGVAAVGYG 293

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
                 ++    YWI+KNSWG  WGEKGY R+ R     +G CGI
Sbjct: 294 R-----SNSGSKYWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGI 333


>gi|53748485|emb|CAH59428.1| cysteine protease 2 [Plantago major]
          Length = 245

 Score =  215 bits (547), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 112/243 (46%), Positives = 144/243 (59%), Gaps = 17/243 (6%)

Query: 113 ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
           AD +    +P   LP  FDWRE  AVT VK+Q  CGS W+FSTTG +EG     T +L+S
Sbjct: 1   ADENKAPKLPTSNLPEEFDWREKGAVTAVKNQGSCGSCWSFSTTGALEGANYLATGELIS 60

Query: 173 LSEQELIDCDQE----------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA 222
           LSEQ+L+DCD E          D GC GG ++NAF+  +    GGL++EK YPY G D  
Sbjct: 61  LSEQQLVDCDHECDPEEGADSCDAGCNGGLMNNAFEYALK--AGGLQKEKDYPYTGKDGT 118

Query: 223 CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCD 282
           C+ +K      ++ +  VS DE  +A  LV+ GP+AV INA  +Q Y+ GVS P  + C 
Sbjct: 119 CKFDKTKIAASVHNFSVVSIDEDQIAANLVKYGPLAVGINAAWMQTYIGGVSCP--YIC- 175

Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDY 342
              ++L H VLIVGYG        K  PYWIIKNSWGE WGE GY+++ RG   CG+   
Sbjct: 176 --GKSLDHGVLIVGYGTGYAPVRLKNKPYWIIKNSWGESWGESGYYKICRGRNVCGVESM 233

Query: 343 VRS 345
           V S
Sbjct: 234 VSS 236


>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
          Length = 324

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 122/308 (39%), Positives = 167/308 (54%), Gaps = 20/308 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLS 95
           A F  F  +H KTY    E   R  IF  NLRKI+   + E+  G++    G+N+F+D++
Sbjct: 24  AHFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEA-HNAEYKQGIHSYTQGINKFADMT 82

Query: 96  TAEFQAKY-LGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            AEF+A      K KPS        +   +++P + DWR  + VT +KDQ  CGS WAF+
Sbjct: 83  RAEFKAMLATQVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWAFA 142

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKT 213
             G+ EG YA  T KL   SEQ+L+DC  + + GC+GG + + F  I +    GLE E  
Sbjct: 143 VVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYGCDGGYLDDTFPYIQTN---GLELESD 199

Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
           YPY G D  C         K++ YVSV  +E  + + +   GP+A+AINA  LQFY +G+
Sbjct: 200 YPYTGYDGYCSYESSKVVTKVSSYVSVPANEQALLEAVGTAGPVAIAINADDLQFYFSGI 259

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                 +CD   E L H VL VGY  +  +       YW+IKNSWG  WGE GYFR  RG
Sbjct: 260 ID--DKYCD--PEYLDHGVLAVGYDSENGR------DYWLIKNSWGADWGESGYFRFLRG 309

Query: 334 DGSCGIND 341
              CG+ +
Sbjct: 310 QNICGVKE 317


>gi|56553473|gb|AAV97878.1| recombinant cysteine protease [Cloning vector pQ-CPB]
          Length = 335

 Score =  214 bits (546), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 131/320 (40%), Positives = 176/320 (55%), Gaps = 28/320 (8%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ-DTEHGSGVYGLNEFSDLSTA 97
           + LF  F + + + YATL E   RL  F  NL  ++  Q +  H    +G+ +F DLS  
Sbjct: 27  SVLFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHAR--FGITKFFDLSEE 84

Query: 98  EFQAKYLG----FKLKPSYAD---RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
           EF  +YL     F     +A    R V A +   T P A DWRE  AVT VKDQ MCGS 
Sbjct: 85  EFATRYLSGATHFAKAKKFASQYYRKVGADLS--TAPAAVDWREKGAVTPVKDQGMCGSC 142

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
           WAFS  GNIE  +   T  L+SLSEQEL+ CD  D+GC GG +  AFD +++   G +  
Sbjct: 143 WAFSAIGNIESKWYLATHSLISLSEQELVSCDDVDEGCNGGLMGQAFDWLLNNRNGAVYT 202

Query: 211 EKTYPY-RGDDKACRLNKKATQV---KINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
             +YPY  G+      ++ +  V    I+G+V++  +E  MA +L  NGP+A+A++A A 
Sbjct: 203 GASYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDASAF 262

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
             Y  GV       CDG  + L+H VL+VGY +         VPYW+IKNSWGE WGEKG
Sbjct: 263 MSYTGGVLTS----CDG--KQLNHGVLLVGYNMT------GEVPYWVIKNSWGENWGEKG 310

Query: 327 YFRLYRGDGSCGINDYVRSA 346
           Y R+ +G   C I +Y  SA
Sbjct: 311 YVRVRKGTNECLIQEYPVSA 330


>gi|157864851|ref|XP_001681134.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124428|emb|CAJ02284.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|378943050|gb|AFC76266.1| cathepsin L-like protease [Leishmania major]
 gi|378943052|gb|AFC76267.1| cathepsin L-like protease [Leishmania major]
 gi|378943054|gb|AFC76268.1| cathepsin L-like protease [Leishmania major]
 gi|378943058|gb|AFC76270.1| cathepsin L-like protease [Leishmania major]
 gi|394331737|gb|AFN27091.1| cysteine protease [Leishmania major]
 gi|394331741|gb|AFN27093.1| cysteine protease [Leishmania major]
 gi|394331747|gb|AFN27096.1| cysteine protease [Leishmania major]
 gi|394331749|gb|AFN27097.1| cysteine protease [Leishmania major]
 gi|394331751|gb|AFN27098.1| cysteine protease [Leishmania major]
 gi|394331753|gb|AFN27099.1| cysteine protease [Leishmania major]
 gi|394331755|gb|AFN27100.1| cysteine protease [Leishmania major]
 gi|394331757|gb|AFN27101.1| cysteine protease [Leishmania major]
 gi|394331759|gb|AFN27102.1| cysteine protease [Leishmania major]
 gi|394331761|gb|AFN27103.1| cysteine protease [Leishmania major]
 gi|394331763|gb|AFN27104.1| cysteine protease [Leishmania major]
 gi|394331765|gb|AFN27105.1| cysteine protease [Leishmania major]
 gi|394331767|gb|AFN27106.1| cysteine protease [Leishmania major]
 gi|394331769|gb|AFN27107.1| cysteine protease [Leishmania major]
 gi|394331771|gb|AFN27108.1| cysteine protease [Leishmania major]
 gi|394331773|gb|AFN27109.1| cysteine protease [Leishmania major]
 gi|394331775|gb|AFN27110.1| cysteine protease [Leishmania major]
 gi|394331777|gb|AFN27111.1| cysteine protease [Leishmania major]
 gi|394331779|gb|AFN27112.1| cysteine protease [Leishmania major]
 gi|394331781|gb|AFN27113.1| cysteine protease [Leishmania major]
 gi|394331783|gb|AFN27114.1| cysteine protease [Leishmania major]
 gi|394331785|gb|AFN27115.1| cysteine protease [Leishmania major]
 gi|394331787|gb|AFN27116.1| cysteine protease [Leishmania major]
 gi|394331789|gb|AFN27117.1| cysteine protease [Leishmania major]
 gi|394331791|gb|AFN27118.1| cysteine protease [Leishmania major]
 gi|394331793|gb|AFN27119.1| cysteine protease [Leishmania major]
 gi|394331795|gb|AFN27120.1| cysteine protease [Leishmania major]
 gi|394331797|gb|AFN27121.1| cysteine protease [Leishmania major]
 gi|394331799|gb|AFN27122.1| cysteine protease [Leishmania major]
 gi|394331801|gb|AFN27123.1| cysteine protease [Leishmania major]
 gi|394331803|gb|AFN27124.1| cysteine protease [Leishmania major]
          Length = 348

 Score =  214 bits (546), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 128/319 (40%), Positives = 176/319 (55%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213

Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C  + + A   +I+GYVS+   E  MA +L +NGP+++A++A +   Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  E L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +  Y  S  V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340


>gi|394331743|gb|AFN27094.1| cysteine protease [Leishmania major]
          Length = 348

 Score =  214 bits (545), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 126/313 (40%), Positives = 174/313 (55%), Gaps = 22/313 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213

Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C  + + A   +I+GYVS+   E  MA +L +NGP+++A++A +   Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  E L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321

Query: 330 LYRGDGSCGINDY 342
           +  G  +C +  Y
Sbjct: 322 VTMGVNACLLTGY 334


>gi|74229834|gb|AAU14993.2| cysteine proteinase [Cryptobia salmositica]
          Length = 443

 Score =  214 bits (545), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 189/344 (54%), Gaps = 28/344 (8%)

Query: 16  TVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL 75
           TV V++ ++V     + +       LF  F   H + YA+  E   R  IF+GN++K  +
Sbjct: 3   TVIVAALLMV----CNAMGAPTTEVLFGNFKAAHARNYASPDEERKRFEIFAGNMKKAAV 58

Query: 76  LQDTEHGSGVYGLNEFSDLSTAEFQAKY------LGFKLKPSYADRSVPAMIPNITLPRA 129
           L + ++    +G NEF+D+++ EFQ ++         K +P    ++  A      + + 
Sbjct: 59  L-NRKNPMATFGPNEFADMTSEEFQTRHNAARHYAAAKARPPKNTKTFTAEEIKAAVGQQ 117

Query: 130 FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCE 189
            DWR   AVT VK+Q  CGS W+FSTTGNIEG +A  T +LV++SEQEL+ CD  DDGC 
Sbjct: 118 IDWRLKGAVTPVKNQGACGSCWSFSTTGNIEGQHAIATGQLVAVSEQELVSCDPIDDGCN 177

Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQV--KINGYVSVSRDE 244
           GG + NAF  ++S   G +  E  YPY    G   AC  + ++  V   I+ +  ++R E
Sbjct: 178 GGLMDNAFGWLISAHKGQIATEANYPYVSGNGIVPACSSSPESKPVGATISAFQDIARTE 237

Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
            DMA ++ ++GP+++ ++A   Q Y  G    I  +C    + + H VLIVG+  D T  
Sbjct: 238 EDMAAFVFKHGPLSIGVDASTWQSYAGG----IMSYCP--QDQIDHGVLIVGF--DDTAS 289

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           T    PYWIIKNSW   WGE+GY R+ +G   CG+  +  S++V
Sbjct: 290 T----PYWIIKNSWTANWGEEGYIRVAKGSNQCGLTSHPSSSVV 329


>gi|157864853|ref|XP_001681135.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|157864857|ref|XP_001681137.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124429|emb|CAJ02285.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124431|emb|CAJ02287.1| cathepsin L-like protease [Leishmania major strain Friedlin]
          Length = 443

 Score =  214 bits (545), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 128/318 (40%), Positives = 176/318 (55%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+Y
Sbjct: 155 AVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSY 214

Query: 215 PY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY    GD   C  + + A   +I+GYVS+   E  MA +L +NGP+++A++A +   Y 
Sbjct: 215 PYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYH 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV       C G  E L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C +  Y  S  V
Sbjct: 323 TMGVNACLLTGYPVSVHV 340


>gi|157864845|ref|XP_001681131.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124425|emb|CAJ02281.1| cathepsin L-like protease [Leishmania major strain Friedlin]
          Length = 348

 Score =  214 bits (545), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 127/319 (39%), Positives = 175/319 (54%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVSTEKS 213

Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C  + + A   +I+GYVS+   E  M  +L +NGP+++A++A +   Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMTAWLAKNGPISIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  E L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +  Y  S  V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340


>gi|343477619|emb|CCD11596.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 124/343 (36%), Positives = 188/343 (54%), Gaps = 24/343 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  ++ +
Sbjct: 14  VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRMFKQSMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+ +FSD+S  EF+A YL G K   +   R  P  +  ++    P 
Sbjct: 72  AKE-EAAANPYATFGVTQFSDMSPEEFRATYLNGAKYYAAALKR--PRKVVTVSTGKAPP 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  CGS WAFS  GNIEG +     +L SLSEQ L+ CD  DDGC
Sbjct: 129 AIDWRKKGAVTPVKDQRKCGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDNMDDGC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRDET 245
           +GG +  A   I+S   G +  E++YPY    GD   C  + K    KI+G +++ +DE 
Sbjct: 189 QGGLMDRALKWIVSSNKGNVFTEESYPYDSTDGDVPPCNKSGKVVGAKISGLINLPKDEN 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L +NGP+A+A++A +   Y  GV           ++ L+H VL+VGY  D +K  
Sbjct: 249 AIAEWLAKNGPIAIAVDASSFLDYTGGV------LTSCSSDALNHDVLLVGYD-DSSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSWG+ WGE+GY R+ +G   C + +Y RSA+V
Sbjct: 300 ---PPYWIIKNSWGKKWGEEGYIRVEKGTNQCLMKEYARSAVV 339


>gi|1848231|gb|AAB48120.1| cathepsin L-like protease [Leishmania major]
          Length = 443

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 128/318 (40%), Positives = 176/318 (55%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+Y
Sbjct: 155 AVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSY 214

Query: 215 PY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY    GD   C  + + A   +I+GYVS+   E  MA +L +NGP+++A++A +   Y 
Sbjct: 215 PYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYH 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV       C G  E L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C +  Y  S  V
Sbjct: 323 TMGVNACLLTGYPVSVHV 340


>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
          Length = 324

 Score =  214 bits (544), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 125/308 (40%), Positives = 176/308 (57%), Gaps = 21/308 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLNEFSDLST 96
           A F  F  +H KTY    E   R +IF+ N+R I+      E G   Y  G+N+F+D+S 
Sbjct: 24  AKFQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMSQ 83

Query: 97  AEFQAKY-LGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            EF+    L    KP+    S   +   + +P + DWR+   VTGVKDQ  CGS WAFS 
Sbjct: 84  EEFKTMLTLSASRKPTLETTSY--VKTGVEIPSSVDWRKEGRVTGVKDQGDCGSCWAFSI 141

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           TG+ EG YA K+ KLVSLSEQ+LIDC  +   GC+GGS+ + F  +M     GL+ E++Y
Sbjct: 142 TGSTEGAYARKSGKLVSLSEQQLIDCCTDTSAGCDGGSLDDNFKYVMKD---GLQSEESY 198

Query: 215 PYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
            Y+G+D AC+ N  +   K++ Y S+ + DE  + + +   GP++V ++A  L  Y +G+
Sbjct: 199 TYKGEDGACKYNVASVVTKVSKYTSIPAEDEDALLEAVATVGPVSVGMDASYLSSYDSGI 258

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                   D     L+H++L VGYG +  K       YWIIKNSWG  WGE+GYFRL RG
Sbjct: 259 YEDQ----DCSPAGLNHAILAVGYGTENGK------DYWIIKNSWGASWGEQGYFRLARG 308

Query: 334 DGSCGIND 341
              CGI++
Sbjct: 309 KNQCGISE 316


>gi|20147096|gb|AAM09951.1| 49 kDa cysteine proteinase Cysp1 [Cryptobia salmositica]
          Length = 428

 Score =  214 bits (544), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 120/319 (37%), Positives = 179/319 (56%), Gaps = 24/319 (7%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF  F   H + YA+  E   R  IF+GN++K  +L + ++    +G NEF+D+++ EFQ
Sbjct: 9   LFGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVL-NRKNPMATFGPNEFADMTSEEFQ 67

Query: 101 AKY------LGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            ++         K +P    ++  A      + +  DWR   AVT VK+Q  CGS W+FS
Sbjct: 68  TRHNAARHYAAAKARPPKNTKTFTAEEIKAAVGQQIDWRLKGAVTPVKNQGACGSCWSFS 127

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           TTGNIEG +A  T +LV++SEQEL+ CD  DDGC GG + NAF  ++S   G +  E  Y
Sbjct: 128 TTGNIEGQHAIATGQLVAVSEQELVSCDPIDDGCNGGLMDNAFGWLISAHKGQIATEANY 187

Query: 215 PY---RGDDKACRLNKKATQV--KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           PY    G   AC  + ++  V   I+ +  ++R E DMA ++ ++GP+++ ++A   Q Y
Sbjct: 188 PYVSGNGIVPACSSSPESKPVGATISAFQDIARTEEDMAAFVFKHGPLSIGVDASTWQSY 247

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             G    I  +C    + + H VLIVG+  D T  T    PYWIIKNSW   WGE+GY R
Sbjct: 248 AGG----IMSYCP--QDQIDHGVLIVGF--DDTAST----PYWIIKNSWTANWGEEGYIR 295

Query: 330 LYRGDGSCGINDYVRSALV 348
           + +G   CG+  +  S++V
Sbjct: 296 VAKGSNQCGLTSHPSSSVV 314


>gi|157864847|ref|XP_001681132.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124426|emb|CAJ02282.1| cathepsin L-like protease [Leishmania major strain Friedlin]
          Length = 443

 Score =  213 bits (543), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 128/318 (40%), Positives = 176/318 (55%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAVKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+Y
Sbjct: 155 AVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKSY 214

Query: 215 PY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY    GD   C  + + A   +I+GYVS+   E  MA +L +NGP+++A++A +   Y 
Sbjct: 215 PYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSYH 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV       C G  E L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C +  Y  S  V
Sbjct: 323 TMGVNACLLTGYPVSVHV 340


>gi|154332645|ref|XP_001562139.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134059587|emb|CAM37169.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 441

 Score =  213 bits (543), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 131/320 (40%), Positives = 176/320 (55%), Gaps = 28/320 (8%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ-DTEHGSGVYGLNEFSDLSTA 97
           + LF  F + + + YATL E   RL  F  NL  ++  Q +  H    +G+ +F DLS  
Sbjct: 35  SVLFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHAR--FGITKFFDLSEE 92

Query: 98  EFQAKYLG----FKLKPSYAD---RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
           EF  +YL     F     +A    R V A +   T P A DWRE  AVT VKDQ MCGS 
Sbjct: 93  EFATRYLSGATHFAKAKKFASQYYRKVGADLS--TAPAAVDWREKGAVTPVKDQGMCGSC 150

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
           WAFS  GNIE  +   T  L+SLSEQEL+ CD  D+GC GG +  AFD +++   G +  
Sbjct: 151 WAFSAIGNIESKWYLATHSLISLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNRNGAVYT 210

Query: 211 EKTYPY-RGDDKACRLNKKATQV---KINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
             +YPY  G+      ++ +  V    I+G+V++  +E  MA +L  NGP+A+A++A A 
Sbjct: 211 GASYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDASAF 270

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
             Y  GV       CDG  + L+H VL+VGY +         VPYW+IKNSWGE WGEKG
Sbjct: 271 MSYTGGVLTS----CDG--KQLNHGVLLVGYNMT------GEVPYWLIKNSWGENWGEKG 318

Query: 327 YFRLYRGDGSCGINDYVRSA 346
           Y R+ +G   C I +Y  SA
Sbjct: 319 YVRVRKGTNECLIQEYPVSA 338


>gi|394331735|gb|AFN27090.1| cysteine protease [Leishmania major]
          Length = 348

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 127/319 (39%), Positives = 176/319 (55%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWR+  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213

Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C  + + A   +I+GYVS+   E  MA +L +NGP+++A++A +   Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  E L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +  Y  S  V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340


>gi|157864849|ref|XP_001681133.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124427|emb|CAJ02283.1| cathepsin L-like protease [Leishmania major strain Friedlin]
          Length = 348

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 127/319 (39%), Positives = 176/319 (55%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213

Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C  + + A   +I+GYVS+   E  MA +L +NGP+++A++A +   Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  E L+H VL+VGY +         VPYW+IKNSWG+ WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGKDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +  Y  S  V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340


>gi|154332649|ref|XP_001562141.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134059589|emb|CAM37171.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 441

 Score =  213 bits (542), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 128/318 (40%), Positives = 176/318 (55%), Gaps = 24/318 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ-DTEHGSGVYGLNEFSDLSTA 97
           + LF  F + + + YATL E   RL  F  NL  ++  Q +  H    +G+ +F DLS  
Sbjct: 35  SVLFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHAR--FGITKFFDLSEE 92

Query: 98  EFQAKYLG----FKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           EF  +YL     F     +A +    +  ++ T P A DWRE  AVT VKDQ MCGS WA
Sbjct: 93  EFATRYLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWA 152

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           FS  GNIE  +   T  L+SLSEQEL+ CD  D+GC GG +  AFD +++   G +    
Sbjct: 153 FSAIGNIESQWYLATHSLISLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNRNGAVYTGV 212

Query: 213 TYPY-RGDDKACRLNKKATQVK---INGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
           +YPY  G+      ++ +  V    I+G+V++  +E  MA +L  NGP+A+A++A A   
Sbjct: 213 SYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDASAFMS 272

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  GV       CDG  + L+H VL+VGY +         VPYW+IKNSWGE WGEKGY 
Sbjct: 273 YTGGVLTS----CDG--KQLNHGVLLVGYNMT------GEVPYWLIKNSWGENWGEKGYV 320

Query: 329 RLYRGDGSCGINDYVRSA 346
           R+ +G   C I +Y  SA
Sbjct: 321 RVRKGTNECLIQEYPVSA 338


>gi|157864855|ref|XP_001681136.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124430|emb|CAJ02286.1| cathepsin L-like protease [Leishmania major strain Friedlin]
          Length = 348

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 127/319 (39%), Positives = 175/319 (54%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213

Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C  + + A   +I+GYVS+   E  M  +L +NGP+++A++A +   Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMTAWLAKNGPISIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  E L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +  Y  S  V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340


>gi|115472081|ref|NP_001059639.1| Os07g0480900 [Oryza sativa Japonica Group]
 gi|27261016|dbj|BAC45132.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113611175|dbj|BAF21553.1| Os07g0480900 [Oryza sativa Japonica Group]
 gi|215693312|dbj|BAG88694.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 376

 Score =  213 bits (541), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 128/335 (38%), Positives = 178/335 (53%), Gaps = 35/335 (10%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           A F  F+ +H + Y+   EY  RL +F+ NL +    Q  +  +  +G+  FSDL+  EF
Sbjct: 46  AQFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQALDP-TARHGVTPFSDLTREEF 104

Query: 100 QAKYLGFK--LKPSYADRSVPAMIPNIT-----LPRAFDWREYDAVTGVKDQTMCGSSWA 152
           +A+  G    +      R +P+  P        LP +FDWR+  AVT VK Q  CGS WA
Sbjct: 105 EARLTGLAADVGDDVRRRPMPSAAPATEEEVSGLPASFDWRDRGAVTDVKMQGACGSCWA 164

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSK 203
           FSTTG +EG     T  L+ LSEQ+L+DCD           D GC GG ++NA+  +MS 
Sbjct: 165 FSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTECDSGCGGGLMTNAYAYLMSS 224

Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRD-------ETDMAKYLVENGP 256
             GGL E+  YPY G    CR +     V++  +  V+         +  M   LV +GP
Sbjct: 225 --GGLMEQSAYPYTGAQGTCRFDANRVAVRVANFTVVAPPGGNDGDGDAQMRAALVRHGP 282

Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAVPYWI 313
           +AV +NA  +Q YV GVS P+   C      ++H VL+VGY   G    +  H+  PYWI
Sbjct: 283 LAVGLNAAYMQTYVGGVSCPL--VCP--RAWVNHGVLLVGYGERGFAALRLGHR--PYWI 336

Query: 314 IKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           IKNSWG+ WGE+GY+RL RG   CG++  V +  V
Sbjct: 337 IKNSWGKAWGEQGYYRLCRGRNVCGVDTMVSAVAV 371


>gi|114679921|ref|YP_758371.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
 gi|39598652|gb|AAR28838.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
          Length = 359

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 129/328 (39%), Positives = 175/328 (53%), Gaps = 44/328 (13%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+  +N+TY   VE   R   F  NL+ I  L      S  Y +N+FSDL+  E  A
Sbjct: 54  FERFVRDYNRTYIDSVEREQRYETFVQNLKNINRLNQKSQAS--YDINKFSDLTKDEVVA 111

Query: 102 KYLGFKLKPS-----YADRS--------------VPAMIPNITLPRAFDWREYDAVTGVK 142
           ++ G  L PS     Y D +               P  +P++     +DWR    VT VK
Sbjct: 112 RFTG--LDPSLAAAAYTDNNGTQYQLCKVVVVDGTPGRVPDL-----WDWRNSQKVTSVK 164

Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMS 202
            Q +CGS WAF++  NIE  YA +  +L+ LSEQ+L+DCDQ D GC GG +  AF  I+ 
Sbjct: 165 QQGVCGSCWAFASVANIESQYAIRHDRLLDLSEQQLVDCDQIDQGCSGGLMHLAFQEILQ 224

Query: 203 KLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAI 261
              GGLE E  YPY+G D ACRLN +   VK++       RDE  + + +   GP+AVAI
Sbjct: 225 M--GGLESELVYPYQGVDYACRLNPRKFDVKLSDCHRYDLRDERKLRELVYTVGPIAVAI 282

Query: 262 NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
           +   +  Y +G+       C+  N  L+H+VL+VG+G++         PYWI+KNSWG  
Sbjct: 283 DCIDIIDYKSGIVS----MCN--NNGLNHAVLLVGFGIEFD------TPYWILKNSWGND 330

Query: 322 WGEKGYFRLYRGDGSCG-INDYVRSALV 348
           WGEKGYFRL R    CG +N+   SA V
Sbjct: 331 WGEKGYFRLKRNINGCGMMNELAASATV 358


>gi|72389847|ref|XP_845218.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389849|ref|XP_845219.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389851|ref|XP_845220.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389857|ref|XP_845223.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359926|gb|AAX80351.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359927|gb|AAX80352.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359928|gb|AAX80353.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359931|gb|AAX80356.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801753|gb|AAZ11659.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801754|gb|AAZ11660.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801755|gb|AAZ11661.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801758|gb|AAZ11664.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 450

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 127/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
           V LL++   ++S        L  LH  +   + F  F +++ K Y    E   R   F  
Sbjct: 14  VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
           N+ + ++ Q   +    +G+  FSD++  EF+A+Y       + A + +   + N+T   
Sbjct: 68  NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125

Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
            P A DWRE  AVT VKDQ  CGS WAFST GNIEG +      LVSLSEQ L+ CD  D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
            GC GG + NAF+ I++  GG +  E +YPY    G+   C++N       I  +V + +
Sbjct: 186 SGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245

Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
           DE  +A YL ENGP+A+A++A +   Y  G+           +E L H VL+VGY  +  
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                  PYWIIKNSW   WGE GY R+ +G   C +N  V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339


>gi|72389855|ref|XP_845222.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389865|ref|XP_845227.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389867|ref|XP_845228.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359930|gb|AAX80355.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359935|gb|AAX80360.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359936|gb|AAX80361.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801757|gb|AAZ11663.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801762|gb|AAZ11668.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801763|gb|AAZ11669.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score =  213 bits (541), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 127/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
           V LL++   ++S        L  LH  +   + F  F +++ K Y    E   R   F  
Sbjct: 14  VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
           N+ + ++ Q   +    +G+  FSD++  EF+A+Y       + A + +   + N+T   
Sbjct: 68  NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125

Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
            P A DWRE  AVT VKDQ  CGS WAFST GNIEG +      LVSLSEQ L+ CD  D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
            GC GG + NAF+ I++  GG +  E +YPY    G+   C++N       I  +V + +
Sbjct: 186 SGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245

Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
           DE  +A YL ENGP+A+A++A +   Y  G+           +E L H VL+VGY  +  
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                  PYWIIKNSW   WGE GY R+ +G   C +N  V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339


>gi|13625989|gb|AAK35220.1|AF362769_1 pre-procathepsin L [Paragonimus westermani]
          Length = 235

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 104/224 (46%), Positives = 142/224 (63%), Gaps = 10/224 (4%)

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
           T P + DWR+  AV  V+ Q  CGS WAFS T N+EG +  KT +LVSLS+Q+L+DCD+ 
Sbjct: 21  TAPASVDWRKKGAVGPVEHQGSCGSCWAFSVTANVEGQWFLKTGRLVSLSKQQLVDCDRL 80

Query: 185 DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDE 244
           D GC GG     +  I  K  GGLE +  YPY G ++ACRL++     KI+  + + ++E
Sbjct: 81  DHGCSGGYPPYTYKEI--KRMGGLELQSAYPYTGWEQACRLDRSKLFAKIDDSIVLEKNE 138

Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
              A +L E+GPM+  +NA  LQFY  G+ HP ++ C    E L+H+VL VGY  +R   
Sbjct: 139 EKQAAWLAEHGPMSTCLNAGPLQFYRYGILHPSEYACS--PEGLNHAVLTVGYDTER--- 193

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               VPYW ++NSWG  WGE GYFR+YRGDG+CGI+    SA++
Sbjct: 194 ---GVPYWTVRNSWGTRWGENGYFRIYRGDGTCGIDRLTTSAII 234


>gi|72389861|ref|XP_845225.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|72389863|ref|XP_845226.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359933|gb|AAX80358.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|62359934|gb|AAX80359.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801760|gb|AAZ11666.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|70801761|gb|AAZ11667.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 127/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
           V LL++   ++S        L  LH  +   + F  F +++ K Y    E   R   F  
Sbjct: 14  VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
           N+ + ++ Q   +    +G+  FSD++  EF+A+Y       + A + +   + N+T   
Sbjct: 68  NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125

Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
            P A DWRE  AVT VKDQ  CGS WAFST GNIEG +      LVSLSEQ L+ CD  D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
            GC GG + NAF+ I++  GG +  E +YPY    G+   C++N       I  +V + +
Sbjct: 186 SGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245

Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
           DE  +A YL ENGP+A+A++A +   Y  G+           +E L H VL+VGY  +  
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                  PYWIIKNSW   WGE GY R+ +G   C +N  V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339


>gi|332326587|gb|AEE42617.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 121/318 (38%), Positives = 175/318 (55%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIE  +A    +L +LSEQ+L+ CD +D GC GG ++ AF+ ++  + G +  E +Y
Sbjct: 155 AVGNIESQWAVAGHRLTALSEQQLVSCDDKDSGCNGGLMTQAFEWLLRNMNGTMLTEDSY 214

Query: 215 PY---RGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY    GD   C  + +     +I+GYV++   ET MA +L ++GP+++A++A +   Y 
Sbjct: 215 PYVSSTGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYE 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV            + L+H VL+VGY           VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV------LTSCAGDALNHGVLLVGYNXT------GEVPYWVIKNSWGEDWGEKGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
 gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
          Length = 300

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 124/307 (40%), Positives = 172/307 (56%), Gaps = 23/307 (7%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  +  +H K+Y++  E   RL IFS  L  I+      + +   GLN+FSDL+ AEF+
Sbjct: 1   MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60

Query: 101 AKYLGFKLKPSYADRSVPAMIPNI---TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           A Y+G    P Y DR  PA   ++   +LP + DWR+  AVT +KDQ  CGS WAFS   
Sbjct: 61  ANYVGKFKPPRYQDRR-PAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           +IE  +   TK+LVSLSEQ+LIDCD  D GC+GG   +AF  ++    GG+  E+ YPY 
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVEN--GGVTTEEAYPYT 177

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSH 275
           G   +C  NK    V+I GY  V++D  D     V   P+ V I  +    Q Y +G+  
Sbjct: 178 GFAGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGI-- 234

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR--G 333
            +   C    +   H+VL++GYG      T   +PYWIIKNSWG  WGE G+ R+ +  G
Sbjct: 235 -LSGHCSNSRD---HAVLVIGYG------TEGGMPYWIIKNSWGTSWGEDGFMRIKKKDG 284

Query: 334 DGSCGIN 340
           +G CG+N
Sbjct: 285 EGMCGMN 291


>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
 gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
 gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
          Length = 300

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 124/307 (40%), Positives = 172/307 (56%), Gaps = 23/307 (7%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  +  +H K+Y++  E   RL IFS  L  I+      + +   GLN+FSDL+ AEF+
Sbjct: 1   MFEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFR 60

Query: 101 AKYLGFKLKPSYADRSVPAMIPNI---TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           A Y+G    P Y DR  PA   ++   +LP + DWR+  AVT +KDQ  CGS WAFS   
Sbjct: 61  ANYVGKFKPPRYQDRR-PAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           +IE  +   TK+LVSLSEQ+LIDCD  D GC+GG   +AF  ++    GG+  E+ YPY 
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVEN--GGVTTEEAYPYT 177

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSH 275
           G   +C  NK    V+I GY  V++D  D     V   P+ V I  +    Q Y +G+  
Sbjct: 178 GFAGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGI-- 234

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR--G 333
            +   C    +   H+VL++GYG      T   +PYWIIKNSWG  WGE G+ R+ +  G
Sbjct: 235 -LSGHCSNSRD---HAVLVIGYG------TEGGMPYWIIKNSWGTSWGEDGFMRIKKEDG 284

Query: 334 DGSCGIN 340
           +G CG+N
Sbjct: 285 EGMCGMN 291


>gi|72389853|ref|XP_845221.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359929|gb|AAX80354.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801756|gb|AAZ11662.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 449

 Score =  212 bits (540), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 127/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
           V LL++   ++S        L  LH  +   + F  F +++ K Y    E   R   F  
Sbjct: 14  VVLLAIAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
           N+ + ++ Q   +    +G+  FSD++  EF+A+Y       + A + +   + N+T   
Sbjct: 68  NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125

Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
            P A DWRE  AVT VKDQ  CGS WAFST GNIEG +      LVSLSEQ L+ CD  D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
            GC GG + NAF+ I++  GG +  E +YPY    G+   C++N       I  +V + +
Sbjct: 186 SGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245

Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
           DE  +A YL ENGP+A+A++A +   Y  G+           +E L H VL+VGY  +  
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                  PYWIIKNSW   WGE GY R+ +G   C +N  V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339


>gi|309752918|gb|ADO85436.1| cathepsin [Pieris rapae granulovirus]
          Length = 339

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 125/323 (38%), Positives = 184/323 (56%), Gaps = 33/323 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS--GVYGLNEFSDLSTAE 98
           +F  F++++NK+YAT  E   +   F  NL+   ++ D  +GS   V+ +N FSDL+  +
Sbjct: 35  IFEDFIKKYNKSYATDQERAIKYENFKNNLK---MINDKNNGSKDAVFDINAFSDLNKND 91

Query: 99  FQAKYLGFKL---KPSY--------ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMC 147
              +  GF++   K SY         +  V    P I LP +FDWR+   VT VK+Q  C
Sbjct: 92  LLRRTTGFRMGLKKNSYYTPDVSKECNVQVIKSEPQIILPESFDWRDKHGVTPVKNQLEC 151

Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
           GS WAFS   NIE +Y  K  K + LSEQ LI+CD  ++GC GG +  A +TI+ +  GG
Sbjct: 152 GSCWAFSAIANIESLYNIKHNKELDLSEQHLINCDSINNGCGGGLMHWALETILQQ--GG 209

Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYAL 266
           +  EK  PY G D  C+   K   V I+G    V ++E  + + L+ NGP+++A++   +
Sbjct: 210 IVSEKDEPYYGLDAVCK--PKQFNVSISGCTRYVLKNENKLRELLIANGPISMAVDIIDV 267

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
             Y  G++      C+  N  L+H+VL+VGYGV      H  +PYWI+KNSWGE WGEKG
Sbjct: 268 IDYKEGITD----ICENMN-GLNHAVLLVGYGV------HNNIPYWIMKNSWGEEWGEKG 316

Query: 327 YFRLYRGDGSCGI-NDYVRSALV 348
           Y R+ R   SCG+ N++  SA++
Sbjct: 317 YLRVQRNINSCGLMNEFASSAIL 339


>gi|340053971|emb|CCC48265.1| cysteine peptidase precursor, fragment, partial [Trypanosoma vivax
           Y486]
          Length = 389

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 122/313 (38%), Positives = 169/313 (53%), Gaps = 18/313 (5%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF  F +++ ++Y T  E   RL +F  N+R+ ++     +    +G+  FSDL+  EF+
Sbjct: 33  LFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYA-AANPHATFGVTPFSDLTPEEFR 91

Query: 101 AKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            +Y   +     A   V  ++  P    P A DWR   AVT VKDQ  CGS W+FS  GN
Sbjct: 92  TRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGTCGSCWSFSAIGN 151

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           IEG +AA    L SLSEQ L+ CD +D+GC GG + NAF+ I+ +  G +   K+YPY  
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTGKSYPYVS 211

Query: 219 DDKA---CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           +D +   C          I G+V +  DE  +AKYL +NGP+AVA++A     Y  GV  
Sbjct: 212 EDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGVVT 271

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                    +E L+H VL+VGY  D +K      PYWIIKNSW   WGEKGY R+ +G  
Sbjct: 272 SCT------SEALNHGVLLVGYN-DSSK-----PPYWIIKNSWSSSWGEKGYIRIEKGTN 319

Query: 336 SCGINDYVRSALV 348
            C +     SA+V
Sbjct: 320 QCLVAQLASSAVV 332


>gi|288804650|ref|YP_003429335.1| cathepsin [Pieris rapae granulovirus]
 gi|270161225|gb|ACZ63497.1| cathepsin [Pieris rapae granulovirus]
          Length = 339

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 125/323 (38%), Positives = 184/323 (56%), Gaps = 33/323 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS--GVYGLNEFSDLSTAE 98
           +F  F++++NK+YAT  E   +   F  NL+   ++ D  +GS   V+ +N FSDL+  +
Sbjct: 35  IFEDFIKKYNKSYATDQERAIKYENFKNNLK---MINDKNNGSKYAVFDINAFSDLNKND 91

Query: 99  FQAKYLGFKL---KPSY--------ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMC 147
              +  GF++   K SY         +  V    P I LP +FDWR+   VT VK+Q  C
Sbjct: 92  LLRRTTGFRMGLKKNSYYTPDVSKECNVQVIKSEPQIILPESFDWRDKHGVTPVKNQLEC 151

Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
           GS WAFS   NIE +Y  K  K + LSEQ LI+CD  ++GC GG +  A +TI+ +  GG
Sbjct: 152 GSCWAFSAIANIESLYNIKHNKELDLSEQHLINCDSINNGCGGGLMHWALETILQQ--GG 209

Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYAL 266
           +  EK  PY G D  C+   K   V I+G    V ++E  + + L+ NGP+++A++   +
Sbjct: 210 IVSEKDEPYYGLDAVCK--PKQFNVSISGCTRYVLKNENKLRELLIANGPISMAVDIIDV 267

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
             Y  G++      C+  N  L+H+VL+VGYGV      H  +PYWI+KNSWGE WGEKG
Sbjct: 268 IDYKEGITD----ICENMN-GLNHAVLLVGYGV------HNNIPYWIMKNSWGEEWGEKG 316

Query: 327 YFRLYRGDGSCGI-NDYVRSALV 348
           Y R+ R   SCG+ N++  SA++
Sbjct: 317 YLRVQRNINSCGLMNEFASSAIL 339


>gi|15824693|gb|AAL09444.1| cysteine protease [Leishmania donovani]
          Length = 394

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 131/320 (40%), Positives = 179/320 (55%), Gaps = 24/320 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A     LVSLSEQ+L+ CD +D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKS 213

Query: 214 YPY---RGDDKACRLN--KKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
           YPY    GD   C LN  K     +I+GYV +  +ET MA +L ENGP+A+ ++A +   
Sbjct: 214 YPYTSGNGDVAEC-LNSSKLVPGARIDGYVMIPSNETVMAAWLAENGPIAIGVDASSFMS 272

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +GV       C G  + L+H VL+VGY       T   VPY +IKNSWGE WGEKGY 
Sbjct: 273 YQSGVLTS----CAG--DALNHGVLLVGYN------TTGGVPYCVIKNSWGEDWGEKGYV 320

Query: 329 RLYRGDGSCGINDYVRSALV 348
           R+  G  +C +++Y  SA V
Sbjct: 321 RVAMGLNACLLSEYPVSAHV 340


>gi|113195461|ref|YP_717598.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
 gi|66968272|gb|AAY59557.1| V-CATH [Clanis bilineata nucleopolyhedrosis virus]
          Length = 325

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 122/318 (38%), Positives = 174/318 (54%), Gaps = 19/318 (5%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+  +NK Y    E   R  IF  NL +I +    E    V+ +N+FSD+S
Sbjct: 21  LKAPDYFESFVANYNKMYNDTQEKAYRYKIFKHNLEEINIKNQVED-HAVFSINKFSDMS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
            +E  +KY G  L     +    A+I   P    P  FDWR+Y+AVT V+ Q  CGS WA
Sbjct: 80  KSEIISKYTGLSLPSLMQENFCRAIILDGPPNKAPINFDWRQYNAVTPVRVQGNCGSCWA 139

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           FST   IE  Y+ K  K +SLS Q+L+DCD  + GC GG +  A + I++  GGG+ +E+
Sbjct: 140 FSTLAGIESQYSIKYNKQISLSVQQLVDCDTSNMGCAGGLLHTALEQIINA-GGGVLQEE 198

Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY+G DK C L      V++ G Y  +  +E  +   L   GP+ VAI+A ++  Y  
Sbjct: 199 DYPYKGVDKQCNLPHNNFAVQVLGCYRYIVMNEEKLKDVLRAVGPIPVAIDAASIVDYSR 258

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+     ++       L+H+VL+VGYGV         VPYW +KN+WG+ WGE GYFR+ 
Sbjct: 259 GIIRTCTYY------GLNHAVLLVGYGV------QDGVPYWTLKNTWGDDWGEHGYFRVR 306

Query: 332 RGDGSCG-INDYVRSALV 348
           +   SCG IND   +A++
Sbjct: 307 QNVNSCGIINDLASTAVI 324


>gi|29567137|ref|NP_818699.1| cathepsin [Adoxophyes honmai NPV]
 gi|37076951|sp|Q80LP4.1|CATV_NPVAH RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|29467913|dbj|BAC67303.1| cathepsin [Adoxophyes honmai NPV]
          Length = 337

 Score =  212 bits (539), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 126/348 (36%), Positives = 188/348 (54%), Gaps = 29/348 (8%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTA--LFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
           ++L +  +  +V   +   HL    H A   F  F+  +NK Y        R  IF  NL
Sbjct: 1   MTLLMIFTILLVASSQIEGHLKFDIHDAQHYFETFIINYNKQYPDTKTKNYRFKIFKQNL 60

Query: 71  RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMI-------- 121
             I   ++  + S +Y +N+FSDLS  E   KY G    KPS   RS             
Sbjct: 61  EDINE-KNKLNDSAIYNINKFSDLSKNELLTKYTGLTSKKPSNMVRSTSNFCNVIHLDAP 119

Query: 122 PNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
           P++   LP+ FDWR  + +T VKDQ  CGS WA +  G +E +YA K   L++LSEQ+LI
Sbjct: 120 PDVHDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLI 179

Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
           DCD  +  C+GG +  AF+ +M+   GGL EE  YPY+G    C+++ K   + ++    
Sbjct: 180 DCDSANMACDGGLMHTAFEQLMN--AGGLMEEIDYPYQGTKGVCKIDNKKFALSVSSCKR 237

Query: 240 -VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
            + ++E ++ K L+  GP+A+AI+A ++  Y  G+ H    FC+  N  L+H+VL+VGYG
Sbjct: 238 YIFQNEENLKKELITMGPIAMAIDAASISTYSKGIIH----FCE--NLGLNHAVLLVGYG 291

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSA 346
                 T   V YW +KNSWG  WGE GYFR+ R   +CG+N+ + ++
Sbjct: 292 ------TEGGVSYWTLKNSWGSDWGEDGYFRVKRNINACGLNNQLAAS 333


>gi|357619727|gb|EHJ72186.1| cathepsin [Danaus plexippus]
          Length = 336

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 123/307 (40%), Positives = 173/307 (56%), Gaps = 22/307 (7%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGS--GVYGLNEFSDLSTAE 98
           LF  F+ ++NK Y +  E   R  IF  NL++I    D  H S   V+G+N+F+DLS  E
Sbjct: 40  LFENFIREYNKKYDSK-EKEERFKIFVNNLKRIN---DLNHKSTNAVHGINKFTDLSKEE 95

Query: 99  FQAKYLGFKLKPSYADRSV--PAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
           F+  Y GFK   S+ D ++  P+ +  NIT P AFDWR+   VT VK+Q  CGS WAFST
Sbjct: 96  FKKFYTGFKPDKSFLDDNIKKPSQLSFNITAPPAFDWRDKGVVTRVKNQGTCGSCWAFST 155

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
            GN+E V A K   LV LSEQ+L+DCD +D+ C+ G   NA   ++S    G   E++YP
Sbjct: 156 IGNVESVNAIKHGNLVELSEQQLVDCDSKDEACDSGLPDNAQQYLVSH---GAISEQSYP 212

Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           Y+G    C  +     V+++ +  V   E  MA+ L    P+++ I A  L  Y  G+  
Sbjct: 213 YKGYAANCTYDSSQVVVRLSNFEKVVLSECQMAEKLYSTAPLSIVIAAEVLGTYTKGI-- 270

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
            +   C+  +++L+H+VL+VGYG            +WI+KNSWG  WGE GYFR+ RG  
Sbjct: 271 -LVNECE-QSQDLNHAVLLVGYG------NEGGTNFWILKNSWGTNWGEGGYFRIKRGVN 322

Query: 336 SCGINDY 342
              I DY
Sbjct: 323 CLMITDY 329


>gi|332326589|gb|AEE42618.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  212 bits (539), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 123/318 (38%), Positives = 178/318 (55%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYWRVYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHHRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIE  +A    +L +LSEQ+L+ CD +D GC GG ++ AF+ ++  + G +  E +Y
Sbjct: 155 AVGNIESQWAVAGHRLTALSEQQLVSCDDKDSGCNGGLMTQAFEWLLRNMNGTMLTEDSY 214

Query: 215 PY---RGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY    GD   C  + +     +I+GYV++   ET MA +L ++GP+++A++A +   Y 
Sbjct: 215 PYVSSTGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYE 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV            + L+H VL+VGY  +RT      VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV------LTSCAGDALNHGVLLVGY--NRT----GEVPYWVIKNSWGEDWGEKGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|343476708|emb|CCD12273.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 363

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 125/343 (36%), Positives = 186/343 (54%), Gaps = 24/343 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  N+ +
Sbjct: 14  VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+  FSD+S  EF+A Y  G +   +   R  P  +  ++    P 
Sbjct: 72  AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVTVSTGKAPD 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT V+D+ +C SSWAFS  GNIEG +     +L SLSEQ L+ CD  +DGC
Sbjct: 129 AVDWRKKGAVTPVRDERLCDSSWAFSAIGNIEGQWKVAGHELTSLSEQMLLSCDTREDGC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
            GG +  AF  I+S   G +  E++YPY    GD   C  + K    KI+ YV + +DE 
Sbjct: 189 GGGLMDRAFQWIVSSNKGNVFTEQSYPYASTDGDVPRCNKSGKVVGAKISDYVDLPQDEN 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L +NGP+A+A+ A +LQ Y  GV           +E L H VL+VGY  D +K  
Sbjct: 249 AIAEWLAKNGPVAIAVEATSLQRYTGGV------LTSCISEQLDHGVLLVGYD-DTSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSWG+GWGE+GY R+ +G   C + +Y  SA+V
Sbjct: 300 ---PPYWIIKNSWGKGWGEEGYIRIEKGTNQCLMKNYASSAVV 339


>gi|1749812|emb|CAA90237.1| cysteine proteinase LmCPB1 [Leishmania mexicana]
          Length = 359

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 127/319 (39%), Positives = 175/319 (54%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93

Query: 99  FQAKYLGFKLKPSYADRSVPAMIPNI-----TLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL      + A R  P   P        +P A DWRE  AVT VKDQ  CGS WAF
Sbjct: 94  FCARYLNGAAYFAAAKRHTPQHYPKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIEG +     +LVSLSEQ+L+ CD  +DGC+GG +  AFD ++    G L  E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDS 213

Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY   +    +    +K     +I+G+V +   E  MA +L +NGP+A+A++A +   Y
Sbjct: 214 YPYVSGNGYLPECSNSSKLVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  + ++H+VL+VGY  D T      VPYW+IKNSWG  WGE+GY R
Sbjct: 274 KSGV----LTACIG--KQVNHAVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +++Y  SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340


>gi|209978824|ref|YP_002300567.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
 gi|192758806|gb|ACF05341.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
          Length = 337

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 126/348 (36%), Positives = 187/348 (53%), Gaps = 29/348 (8%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTA--LFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
           ++L +  +  +V   +   HL    H A   F  F+  +NK YA       R  IF  NL
Sbjct: 1   MTLLMIFTILLVASSQIEGHLKFDIHDAQHYFETFIVNYNKQYADTKTKNYRFKIFVQNL 60

Query: 71  RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNI----- 124
             I   ++  + S +Y +N+FSDLS  E   KY G    KPS   +S       I     
Sbjct: 61  EYINE-KNKLNDSAIYNINKFSDLSKNELLTKYTGLTSRKPSNMVKSTSNFCNVIHLDAP 119

Query: 125 -----TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
                 LP+ FDWR  + +T VKDQ  CGS WA +  G +E +YA K   L++LSEQ+LI
Sbjct: 120 PDARDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLI 179

Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
           DCD  +  C+GG +  AF+ +M+   GGL EE  YPY+G    C+++ K   + ++    
Sbjct: 180 DCDSANMACDGGLMHTAFEQLMN--AGGLMEEIDYPYQGTKGICKIDNKKFALSVSSCKR 237

Query: 240 -VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
            + ++E ++ K L+  GP+A+AI+A ++  Y  G+ H    FC+  N  L+H+VL+VGYG
Sbjct: 238 YIFQNEENLKKELITTGPIAMAIDAASISTYSKGIIH----FCE--NLGLNHAVLLVGYG 291

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSA 346
                 T   V YW +KNSWG  WGE GYFR+ R   +CG+N+ + ++
Sbjct: 292 ------TEGGVSYWTLKNSWGSDWGEDGYFRVKRNINACGLNNQLAAS 333


>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
 gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
          Length = 299

 Score =  211 bits (538), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 123/307 (40%), Positives = 172/307 (56%), Gaps = 23/307 (7%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  +  +H K+Y++  E   RL IFS  L  I+      + +   GLN+FSDL+ AEF+
Sbjct: 1   MFEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60

Query: 101 AKYLGFKLKPSYADRSVPAMIPNI---TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           A Y+G    P Y DR  PA   ++   +LP + DWR+  AVT +KDQ  CGS WAFS   
Sbjct: 61  ANYVGKFKSPRYQDRR-PAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           +IE  +   TK+LVSLSEQ+LIDCD  D GC+GG   +AF  ++    GG+  E+ YPY 
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVEN--GGVTTEEAYPYT 177

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSH 275
           G   +C  NK    V+I GY  V++D  D     V   P+ V I  +    Q Y +G+  
Sbjct: 178 GFAGSCNANKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGI-- 234

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR--G 333
            +   C    +   H+VL++GYG      T   +PYWIIKNSWG  WGE G+ ++ +  G
Sbjct: 235 -LSGQCSNSRD---HAVLVIGYG------TEGGMPYWIIKNSWGTSWGENGFMKIKKKDG 284

Query: 334 DGSCGIN 340
           +G CG+N
Sbjct: 285 EGMCGMN 291


>gi|10391|emb|CAA38238.1| unnamed protein product [Trypanosoma brucei]
          Length = 450

 Score =  211 bits (538), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 128/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
           V LL++   ++S        L  LH  +   + F  F +++ K Y    E   R   F  
Sbjct: 14  VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
           N+ + ++ Q   +    +G+  FSD++  EF+A+Y       + A + V   + N+T   
Sbjct: 68  NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRVRKTV-NVTTGR 125

Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
            P A DWRE  AVT VKDQ  CGS WAFST GNIEG +      LVSLSEQ L+ CD  D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
            GC GG + NAF+ I++  GG +  E +YPY    G+   C++N       I  +V + +
Sbjct: 186 FGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245

Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
           DE  +A YL ENGP+A+A++A +   Y  G+           +E L H VL+VGY  +  
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                  PYWIIKNSW   WGE GY R+ +G   C +N  V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339


>gi|378943048|gb|AFC76265.1| cathepsin L-like protease [Leishmania major]
          Length = 348

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 127/319 (39%), Positives = 175/319 (54%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS A 
Sbjct: 35  AALFEEFKRTYQRAYGTLTEEQRRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAV 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213

Query: 214 YPY---RGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C  + + A   +I+GYVS+   E  MA +L +NGP+++A++A +   Y
Sbjct: 214 YPYVSGNGDVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  E L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +  Y  S  V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340


>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
          Length = 326

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 126/315 (40%), Positives = 175/315 (55%), Gaps = 19/315 (6%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLN 89
           +H +     +  F  ++NK+Y   +E   R  IF G+LRKI+   D  +HG   +  G+ 
Sbjct: 14  VHALSDKEEWVQFKVRNNKSYRNYIEEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVT 73

Query: 90  EFSDLSTAEFQAKYLGF-KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
           +F+DL+  EF +  LG  +   S   R + ++ P   LP  FDWRE  AVT VKDQ  CG
Sbjct: 74  KFADLTEKEF-SDMLGISRSTKSSRPRVIHSLTPVKDLPSKFDWREKGAVTEVKDQGSCG 132

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGG 207
           S W+FSTTG +EG Y  KT KLVSLSEQ L+DC +ED  GC GG +  A + I  +  GG
Sbjct: 133 SCWSFSTTGTVEGAYFLKTGKLVSLSEQNLVDCAKEDCYGCSGGYMDKALEYI--ETAGG 190

Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA-YA 265
           +  E  YPY G D  CR +      KI+ +  + + DE D+   ++  GP++VAI+A + 
Sbjct: 191 IMSENDYPYEGIDDKCRFDSSKVAAKISNFTYIKKNDEDDLKNAVIAKGPISVAIDASFN 250

Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
            Q Y +G+      + D    +L+H VL+VGYG      T K   YWI+KNSWG  WG  
Sbjct: 251 FQLYDSGILDDSSCYSDFN--SLNHGVLVVGYG------TEKEQDYWIVKNSWGADWGMD 302

Query: 326 GYFRLYRG-DGSCGI 339
           GY  + R  +  CGI
Sbjct: 303 GYIWMSRNKNNQCGI 317


>gi|6649593|gb|AAF21470.1|U85983_1 cysteine proteinase [Clonorchis sinensis]
          Length = 259

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 117/271 (43%), Positives = 158/271 (58%), Gaps = 16/271 (5%)

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLK-PSYADRSVPAMIPNITLP-RAFDWREYDA 137
           E G+  YG+ +FSDL++ EF+ +YL  +   P  ++   P    ++T+    FDWRE+ A
Sbjct: 2   EQGTAHYGVTQFSDLTSEEFKTRYLRMRFDGPIVSEDLTPE--EDVTMDNEKFDWREHGA 59

Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAF 197
           V  V DQ  CGS WAFS  GN+ G +  KT  L++LSEQ+L+DCD  DDGC+GG     +
Sbjct: 60  VGPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTY 119

Query: 198 DTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPM 257
             I     GGLE    YPY G    C ++K      +NG   +   E   A+ L   GP+
Sbjct: 120 TAIQKM--GGLELASDYPYTGVGGICHMDKSKFVAYVNGSTILPLSEKVQAQKLRAIGPL 177

Query: 258 AVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNS 317
           + A+NA  LQ Y  G+  P   +CD    N  H+VL VGYGV   K      PYWI+KNS
Sbjct: 178 SSALNADTLQLYKGGIMRPK--WCDPAGVN--HAVLTVGYGVQNGK------PYWIVKNS 227

Query: 318 WGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           WGE +GE+GYFR+YRGDG+CGIN  V +A++
Sbjct: 228 WGEDFGEEGYFRIYRGDGTCGINSIVTTAII 258


>gi|154332647|ref|XP_001562140.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134059588|emb|CAM37170.1| cathepsin L-like protease [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 441

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 127/318 (39%), Positives = 176/318 (55%), Gaps = 24/318 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ-DTEHGSGVYGLNEFSDLSTA 97
           + LF  F + + + YATL E   RL  F  NL  ++  Q +  H    +G+ +F DLS  
Sbjct: 35  SVLFEEFKQTYQRVYATLDEEQQRLANFQRNLELMREHQANNPHAR--FGITKFFDLSEE 92

Query: 98  EFQAKYLG----FKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           EF  +YL     F     +A +    +  ++ T P A DWRE  AVT VKDQ MCGS WA
Sbjct: 93  EFATRYLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWREKGAVTPVKDQGMCGSCWA 152

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           FS  GNIE  +   T  L+SLSEQEL+ CD  D+GC GG +  AFD +++   G +    
Sbjct: 153 FSAIGNIESQWYLATHSLISLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNRNGAVYTGV 212

Query: 213 TYPY-RGDDKACRLNKKATQVK---INGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
           +YPY  G+      ++ +  V    I+G+V++  +E  MA +L  NGP+A+A++A A   
Sbjct: 213 SYPYVSGNGSVPECSESSDLVIGAYIDGHVTIESNEDTMAAWLAANGPIAIAVDASAFMS 272

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  GV       CDG  + L+H VL+VGY +         VPYW+IKNSWG+ WGEKGY 
Sbjct: 273 YTGGVLTS----CDG--KQLNHGVLLVGYNMT------GEVPYWLIKNSWGKNWGEKGYV 320

Query: 329 RLYRGDGSCGINDYVRSA 346
           R+ +G   C I +Y  SA
Sbjct: 321 RVRKGTNECLIQEYPVSA 338


>gi|90592736|ref|YP_529689.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
 gi|71559186|gb|AAZ38185.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
          Length = 343

 Score =  211 bits (537), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 121/314 (38%), Positives = 180/314 (57%), Gaps = 23/314 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F+ Q+NK Y    E   R +IF  N+ +I   +++ + S VY +N F+D++  E   
Sbjct: 45  FEQFISQYNKQYKNEAEKRHRFNIFMHNIEEINQ-KNSRNDSAVYKINRFADMTKNEVVI 103

Query: 102 KYLGF----KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           ++ G     +L  ++ +  V         P +FDWR Y+ VT VKDQ+MCG+ WAF++ G
Sbjct: 104 RHTGLASIGELNSNFCETVVVDGPGQRQRPSSFDWRTYNKVTSVKDQSMCGACWAFASLG 163

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
            +E  YA K  +L+ L+EQ+L+DCD  D GC+GG I  A++ IM    GG+E+E  YPYR
Sbjct: 164 ALESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMQM--GGVEQEFDYPYR 221

Query: 218 GDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
            + + C L  +K A  V+   +  V R+E  +   L   GP+A+A++A  L  Y  G+  
Sbjct: 222 AERQPCALKPHKFAAGVR-KCFRYVLRNEERLEDLLRHVGPIAIAVDAVDLTDYYGGIVS 280

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
               FC+  N  L+H+VL+VGYGV+        VP+W +KNSWG  +GE GY R+ RG  
Sbjct: 281 ----FCE--NNGLNHAVLLVGYGVENN------VPFWTLKNSWGSDYGEDGYVRVRRGVN 328

Query: 336 SCG-INDYVRSALV 348
           SCG +N+   SA V
Sbjct: 329 SCGLVNELASSAQV 342


>gi|343470378|emb|CCD16903.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 124/343 (36%), Positives = 187/343 (54%), Gaps = 24/343 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  ++ +
Sbjct: 14  VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRMFKQSMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+ +FSD+S  EF+A YL G K   +   R  P  + N++    P 
Sbjct: 72  AKE-EAAANPYATFGVTQFSDMSPEEFRATYLNGAKYYAAALKR--PRKVVNVSTGKAPP 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  CGS WAFS  GNIEG +     +L SLSEQ L+ CD  D GC
Sbjct: 129 AIDWRKKGAVTPVKDQGKCGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDNMDYGC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRDET 245
            GG +  A   I+S   G +  E++YPY    GD   C  + K    KI+G +++ +DE 
Sbjct: 189 RGGFLDRALKWIVSSNKGNVFTEESYPYDSTDGDVPPCNKSGKVVGAKISGLINLPKDEN 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L +NGP+A+A++A +   Y  GV           ++ L+H VL+VGY  D +K  
Sbjct: 249 AIAEWLAKNGPIAIAVDASSFLDYTGGV------LTSCSSDALNHGVLLVGYD-DSSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSWG+ WGE+GY R+ +G   C + +Y RSA+V
Sbjct: 300 ---PPYWIIKNSWGKKWGEEGYIRVEKGTNQCLMKEYARSAVV 339


>gi|332326583|gb|AEE42615.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 123/318 (38%), Positives = 177/318 (55%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYWRVYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHHRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIE  +A    +L  LSEQ+L+ CD +D GC GG ++ AF+ ++  + G +  E +Y
Sbjct: 155 AVGNIESQWAVADHRLXXLSEQQLVSCDDKDSGCNGGLMTQAFEWLLRNMNGTMLTEDSY 214

Query: 215 PY---RGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY    GD   C  + +     +I+GYV++   ET MA +L ++GP+++A++A +   Y 
Sbjct: 215 PYVSSTGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIAVDASSFMSYE 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV            + L+H VL+VGY  +RT      VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV------LTSCAGDALNHGVLLVGY--NRT----GEVPYWVIKNSWGEDWGEKGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|44844204|emb|CAF32698.1| cysteine proteinase [Leishmania infantum]
          Length = 443

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 127/319 (39%), Positives = 176/319 (55%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK    CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKXXGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A     LVSLSEQ+L+ CD +D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEXLLRHMYGIVFTEKS 213

Query: 214 YPY---RGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY    GD   C   +K     +I+GYV +  +ET MA +L ENGP+A+A++A +   Y
Sbjct: 214 YPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV            + L+H VL+VGY  ++T      VPYW+IKNSWGE WGEKGY R
Sbjct: 274 QSGV------LTSCAGDALNHGVLLVGY--NKT----GGVPYWVIKNSWGEDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C + +   SA V
Sbjct: 322 VVMGXNACLLXEXPXSAHV 340


>gi|340053968|emb|CCC48262.1| cysteine peptidase, Clan CA, family C1,Cathepsin L-like, fragment,
           partial [Trypanosoma vivax Y486]
          Length = 323

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 120/302 (39%), Positives = 165/302 (54%), Gaps = 18/302 (5%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF  F +++ ++Y T  E   RL +F  N+R+ ++     +    +G+  FSDL+  EF+
Sbjct: 33  LFAAFKQKYGRSYGTAAEEAFRLRVFEDNMRRSRMYA-AANPHATFGVTPFSDLTPEEFR 91

Query: 101 AKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            +Y   +     A   V  ++  P    P A DWR   AVT VKDQ  CGS W+FS  GN
Sbjct: 92  TRYHNGERHFEAARGRVRTLVQVPPGKAPAAVDWRRKGAVTPVKDQGRCGSCWSFSAIGN 151

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           IEG +AA    L SLSEQ L+ CD +D+GC GG + NAF+ I+ +  G +  EK+YPY  
Sbjct: 152 IEGQWAAAGNPLTSLSEQMLVSCDFKDNGCGGGFMDNAFEWIVKENSGKVYTEKSYPYVS 211

Query: 219 DDKA---CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           +D +   C          I G+V +  DE  +AKYL +NGP+AVA++A     Y  GV  
Sbjct: 212 EDGSKPFCIPYGHEVGATITGHVDIPHDEDAIAKYLADNGPVAVAVDATTFMSYSGGV-- 269

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                    +E L+H VL+VGY  D +K      PYWIIKNSW   WGEKGY R+ +G  
Sbjct: 270 ----VTSCTSEALNHGVLLVGYN-DSSK-----PPYWIIKNSWSSSWGEKGYIRIEKGTN 319

Query: 336 SC 337
            C
Sbjct: 320 QC 321


>gi|218199600|gb|EEC82027.1| hypothetical protein OsI_25996 [Oryza sativa Indica Group]
          Length = 709

 Score =  211 bits (536), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 128/331 (38%), Positives = 177/331 (53%), Gaps = 39/331 (11%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           A F  F+ +H + Y+   EY  RL +F+ NL +    Q  +  +  +G+  FSDL+  EF
Sbjct: 46  AQFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQALDP-TARHGVTPFSDLTREEF 104

Query: 100 QAKYLGF---------KLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGS 149
           +A+  G          + +      + PA    ++ LP +FDWR+  AVTGVK Q  CGS
Sbjct: 105 EARLTGLATDVGDDDVRRRRLPMPSAAPATEEEVSGLPSSFDWRDRGAVTGVKMQGACGS 164

Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTI 200
            WAFSTTG +EG     T  L+ LSEQ+L+DCD           D GC GG ++NA+  +
Sbjct: 165 CWAFSTTGAVEGANFLATGNLLDLSEQQLVDCDHTCDAEKKTECDSGCGGGLMTNAYAYL 224

Query: 201 MSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR--------DETDMAKYLV 252
           MS   GGL E+  YPY G   ACR +     V++  +  V+          +  M   LV
Sbjct: 225 MSS--GGLMEQSAYPYTGAQGACRFDANRVAVRVANFTVVAPAAGPGGNDGDAQMRAALV 282

Query: 253 ENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY---GVDRTKFTHKAV 309
            +GP+AV +NA  +Q YV GVS P+   C     N  H VL+VGY   G    +  H+  
Sbjct: 283 RHGPLAVGLNAAYMQTYVGGVSCPL--VCPRAWVN--HGVLLVGYGERGFAALRLGHR-- 336

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGIN 340
           PYWIIKNSWG+ WGE+GY+RL RG   CG++
Sbjct: 337 PYWIIKNSWGKAWGEQGYYRLCRGRNVCGVD 367


>gi|441611591|ref|XP_003273955.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Nomascus leucogenys]
          Length = 548

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 127/329 (38%), Positives = 183/329 (55%), Gaps = 16/329 (4%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 235 SVISLLNEDPLPQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 294

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 295 DRGTAQYGVTKFSDLTEEEFRTIYLNPLLREEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 354

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 355 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 414

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y Y+G  ++C  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 415 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 472

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           AINA+ +Q  V    H   +  +  +    H       G D        +P+W IKNSWG
Sbjct: 473 AINAFGMQ--VRPXPHCSAWIINSPDSCTLHCT----PGSD--------IPFWAIKNSWG 518

Query: 320 EGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 519 TDWGEKGYYYLHRGSGACGVNTMASSAVV 547


>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
           max]
          Length = 337

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 128/336 (38%), Positives = 172/336 (51%), Gaps = 29/336 (8%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
           +L L++  S  M        +LH    +     +++++ K Y    E   RL IF  N+ 
Sbjct: 14  VLLLSICTSQVMS------RYLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVE 67

Query: 72  KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAF 130
            I+      +     G+N  +D +  EF A + G+K K S++    P    N+T +P A 
Sbjct: 68  FIESFNAAGNKPYKLGINHLADQTNEEFVASHNGYKHKASHSQ--TPFKYENVTGVPNAV 125

Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEG 190
           DWRE  AVT VKDQ  CGS WAFST    EG+Y   T  L+SLSEQEL+DCD  D GC+G
Sbjct: 126 DWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSVDHGCDG 185

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT-QVKINGYVSVSRDETDMAK 249
           G +   F+ I+    GG+  E  YPY   D  C  NK+A+   +I GY +V  +  D  +
Sbjct: 186 GYMEGGFEFIIKN--GGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSEDALQ 243

Query: 250 YLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHK 307
             V N P++V I+A   A QFY +GV      F       L H V  VGYG      T  
Sbjct: 244 KAVANQPVSVTIDAGGSAFQFYSSGV------FTGQCGTQLDHGVTAVGYGS-----TDD 292

Query: 308 AVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
              YWI+KNSWG  WGE+GY R+ RG    +G CGI
Sbjct: 293 GTQYWIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGI 328


>gi|395852405|ref|XP_003798729.1| PREDICTED: cathepsin W [Otolemur garnettii]
          Length = 367

 Score =  211 bits (536), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 121/323 (37%), Positives = 176/323 (54%), Gaps = 24/323 (7%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F  Q N++Y+   E+  RL IF+ NL K Q LQ+ + G+  +G+   SDL+  EF 
Sbjct: 41  VFKLFQVQFNRSYSNPAEHSRRLDIFAHNLAKAQQLQEEDLGTAEFGMTSLSDLTEEEF- 99

Query: 101 AKYLGFKLKPSYADRSVPAMIPNI-------TLPRAFDWR-EYDAVTGVKDQTMCGSSWA 152
            K  G +     A   VP M   +       TLPR  DWR +   ++ +K+Q  C   WA
Sbjct: 100 GKIFGHQ----KAVGEVPRMGRKVGSEQQGETLPRTCDWRNKAGIISRIKNQENCKCCWA 155

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
            +   NIE ++  K  + V +S QEL+DC++  DGC+GG + +AF T+++    GL  EK
Sbjct: 156 MAAADNIEALWGIKYHQSVEVSVQELLDCNRCGDGCQGGFVWDAFITVLN--NSGLASEK 213

Query: 213 TYPYRGDDKA--CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
            YP++   K   C  NK      I  ++ +  +E  +A+YL  +GP+ V IN   LQ Y 
Sbjct: 214 DYPFKASVKTHRCLANKYRKVAWIQDFIMLEDNEHKIAQYLATHGPITVTINMKLLQHYK 273

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK-----FTHKAVPYWIIKNSWGEGWGEK 325
            GV       CD   + ++HSVL+VG+G +          H++ PYWI+KNSWG  WGE+
Sbjct: 274 KGVIKAKPTTCDP--QLVNHSVLLVGFGAETVSSQSHLRPHRSTPYWILKNSWGAHWGEE 331

Query: 326 GYFRLYRGDGSCGINDYVRSALV 348
           GYFRL+RG  SCGI  Y  +A V
Sbjct: 332 GYFRLHRGSNSCGITKYPFTARV 354


>gi|161598418|gb|ABX74953.1| cysteine protease [Leishmania panamensis]
          Length = 441

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 122/318 (38%), Positives = 175/318 (55%), Gaps = 24/318 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ-DTEHGSGVYGLNEFSDLSTA 97
           + LF  F + + + YATL E   R+  F  NL  ++  Q +  H    +G+ +F DLS A
Sbjct: 35  SVLFEEFKQTYKRVYATLAEEQQRVANFQRNLELMREHQANNPHAR--FGITKFFDLSEA 92

Query: 98  EFQAKYLG----FKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           EF  +YL     F     +A +    +  ++ T P A DWR+  AVT V DQ  CGS WA
Sbjct: 93  EFATRYLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWRQMGAVTPVNDQGACGSCWA 152

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           FS  GNIE  +   T  L++LSEQEL+ CD  D+GC GG +  AFD +++   G +    
Sbjct: 153 FSAIGNIESQWYVTTHSLITLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNKNGAVYTGA 212

Query: 213 TYPYRGDDKACRLNKKATQVK----INGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
           +YPY   + +     +++++     I+G+V++  +E  MA +L  NGP+A+A++A A   
Sbjct: 213 SYPYVSGNGSVPECSESSELVVGAYIDGHVTIESNEDTMAAWLAVNGPIAIAVDASAFMS 272

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  G    I   CDG    L+H VL+VGY +         VPYW+IKNSWGE WGEKGY 
Sbjct: 273 YTGG----ILTSCDG--RQLNHGVLLVGYNMT------GEVPYWLIKNSWGENWGEKGYV 320

Query: 329 RLYRGDGSCGINDYVRSA 346
           R+ +G   C I +Y  SA
Sbjct: 321 RVRKGTNECLIQEYPASA 338


>gi|96979798|ref|YP_611001.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
 gi|37077647|sp|Q91CL9.1|CATV_NPVAP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|16041073|dbj|BAB69773.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
 gi|94983331|gb|ABF50271.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
 gi|146229694|gb|ABQ12259.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
          Length = 324

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 119/315 (37%), Positives = 182/315 (57%), Gaps = 20/315 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K  + F  FL + NK Y++  E   R  IF  NL +I + ++    S  Y +N+FSDLS
Sbjct: 22  LKAPSYFEEFLHKFNKNYSSESEKLRRFKIFQHNLEEI-INKNQNDTSAQYEINKFSDLS 80

Query: 96  TAEFQAKYLGFKL---KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  +KY G  L   K ++ +  V    P+   P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 81  KDETISKYTGLSLPLQKQNFCEVVVLDRPPDKG-PLEFDWRRLNKVTSVKNQGMCGACWA 139

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T G++E  +A K  +L++LSEQ+LIDCD  D GC+GG +  A++ +M+   GG++ E 
Sbjct: 140 FATLGSLESQFAIKHDQLINLSEQQLIDCDFVDVGCDGGLLHTAYEAVMNM--GGIQAEN 197

Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  ++  CR+N     V++   Y  V+  E  +   L   GP+ VAI+A  +  Y  
Sbjct: 198 DYPYEANNGPCRVNAAKFVVRVKKCYRYVTLFEEKLKDLLRIVGPIPVAIDASDIVGYKR 257

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+      +C+  N  L+H+VL+VGYGV+        +P+WI+KN+WG  WGE+GYFR+ 
Sbjct: 258 GIIR----YCE--NHGLNHAVLLVGYGVE------NGIPFWILKNTWGADWGEQGYFRVQ 305

Query: 332 RGDGSCGINDYVRSA 346
           +   +CGI + + S+
Sbjct: 306 QNINACGIKNELPSS 320


>gi|261328617|emb|CBH11595.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
 gi|261328620|emb|CBH11598.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
          Length = 450

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 127/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
           V LL++   ++S        L  LH  +   + F  F +++ K Y    E   R   F  
Sbjct: 14  VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
           N+ + ++ Q   +    +G+  FSD++  EF+A+Y       + A + +   + N+T   
Sbjct: 68  NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125

Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
            P A DWRE  AVT VKDQ  CGS WAFST GNIEG +      LVSLSEQ L+ CD  D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
            GC GG + NAF+ I++  GG +  E +YPY    G+   C++N       I  +V + +
Sbjct: 186 FGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245

Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
           DE  +A YL ENGP+A+A++A +   Y  G+           +E L H VL+VGY  +  
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                  PYWIIKNSW   WGE GY R+ +G   C +N  V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339


>gi|261328615|emb|CBH11593.1| cysteine peptidase precursor [Trypanosoma brucei gambiense DAL972]
          Length = 451

 Score =  210 bits (534), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 127/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
           V LL++   ++S        L  LH  +   + F  F +++ K Y    E   R   F  
Sbjct: 14  VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
           N+ + ++ Q   +    +G+  FSD++  EF+A+Y       + A + +   + N+T   
Sbjct: 68  NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125

Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
            P A DWRE  AVT VKDQ  CGS WAFST GNIEG +      LVSLSEQ L+ CD  D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
            GC GG + NAF+ I++  GG +  E +YPY    G+   C++N       I  +V + +
Sbjct: 186 FGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245

Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
           DE  +A YL ENGP+A+A++A +   Y  G+           +E L H VL+VGY  +  
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                  PYWIIKNSW   WGE GY R+ +G   C +N  V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339


>gi|72389859|ref|XP_845224.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
 gi|62359932|gb|AAX80357.1| cysteine peptidase precursor [Trypanosoma brucei]
 gi|70801759|gb|AAZ11665.1| cysteine peptidase precursor [Trypanosoma brucei brucei strain
           927/4 GUTat10.1]
          Length = 450

 Score =  210 bits (534), Expect = 9e-52,   Method: Compositional matrix adjust.
 Identities = 127/346 (36%), Positives = 180/346 (52%), Gaps = 27/346 (7%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
           V LL++   ++S        L  LH  +   + F  F +++ K Y    E   R   F  
Sbjct: 14  VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
           N+ + ++ Q   +    +G+  FSD++  EF+A+Y       + A + +   + N+T   
Sbjct: 68  NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125

Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
            P A DWRE  AVT VKDQ  CGS WAFST GNIEG +      LVSLSEQ L+ CD  D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
            GC GG + NAF+ I++  GG +  E +YPY    G+   C++N       I  +V + +
Sbjct: 186 FGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245

Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
           DE  +A YL ENGP+A+A++A +   Y  G+           +E L H VL+VGY  +  
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDNSN 299

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                  PYWIIKNSW   WGE GY R+ +G   C +N  V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339


>gi|37732137|gb|AAR02406.1| cysteine proteinase [Anthonomus grandis]
          Length = 322

 Score =  210 bits (534), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 121/306 (39%), Positives = 174/306 (56%), Gaps = 19/306 (6%)

Query: 37  KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLNEFSD 93
           KH ALF  F  ++ K+Y   VE   R +IF  N+ +I+      E G   Y   +N+F+D
Sbjct: 21  KHQALFETFKVENGKSYRNQVEEVQRFNIFRANVLEIEQHNALYEQGLVSYKKAINQFTD 80

Query: 94  LSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           L+  EF+A YLG  +KP   + ++   +  + +P + DWR    VTGVK+Q  CGS W+F
Sbjct: 81  LTQEEFKA-YLGLHVKP-VLNNTIQYELKGLEVPTSVDWRSAGQVTGVKNQGSCGSCWSF 138

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEK 212
           + TG+ EG Y  K K+LVSLSEQ+L+DC    + GC GG +   F  I      GL+ E 
Sbjct: 139 ALTGSTEGAYYRKHKQLVSLSEQQLVDCSTSINYGCNGGFLDATFPYIEQY---GLQTES 195

Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
           +YPY G D +C+ +      KI+ YVS+   E+ + + +   GP+A+ ++A  L  Y +G
Sbjct: 196 SYPYTGVDGSCKYDSSKVVTKISNYVSLHGSESKVLEPVGSIGPVAITMDASYLSSYSSG 255

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           +    +  C     NL+H+VL+VGYG      +     YWI+KNSWG GWGE+GYFRL R
Sbjct: 256 IYAANK--CT--TTNLNHAVLVVGYG------SQNGQNYWIVKNSWGSGWGEQGYFRLLR 305

Query: 333 GDGSCG 338
           G   CG
Sbjct: 306 GSNECG 311


>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
 gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
 gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
 gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
          Length = 300

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 122/307 (39%), Positives = 173/307 (56%), Gaps = 23/307 (7%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  +  +H+K+Y++  E   RL +FS  L  I+      + +   GLN+FSDL+ AEF+
Sbjct: 1   MFEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFR 60

Query: 101 AKYLGFKLKPSYADRSVPAMIPNI---TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           A Y+G    P Y DR  PA   ++   +LP + DWR+  AVT +KDQ  CGS WAFS   
Sbjct: 61  ANYVGKFKPPRYQDRR-PAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           +IE  +   TK+LVSLSEQ+LIDCD  D GC+GG   +AF  ++    GG+  E+ YPY 
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPDDAFKFVVEN--GGVTTEEAYPYT 177

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSH 275
           G   +C  NK    V+I GY  V++D  D     V   P+ V I  +    Q Y +G+  
Sbjct: 178 GFAGSCNTNKNKV-VEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGI-- 234

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR--G 333
            +   C    +   H+VL++GYG      T   +PYWIIKNSWG  WGE G+ ++ +  G
Sbjct: 235 -LSGQCCNSRD---HAVLVIGYG------TEGGMPYWIIKNSWGTSWGEDGFMKIKKKDG 284

Query: 334 DGSCGIN 340
           +G CG+N
Sbjct: 285 EGMCGMN 291


>gi|241062152|gb|ACS66748.1| cysteine protease [Leishmania guyanensis]
          Length = 441

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 122/318 (38%), Positives = 175/318 (55%), Gaps = 24/318 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ-DTEHGSGVYGLNEFSDLSTA 97
           + LF  F + + + YATL E   R+  F  NL  ++  Q +  H    +G+ +F DLS A
Sbjct: 35  SVLFEEFKQTYKRVYATLAEEQQRVANFQRNLELMREHQANNPHAR--FGITKFFDLSEA 92

Query: 98  EFQAKYLG----FKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           EF  +YL     F     +A +    +  ++ T P A DWR+  AVT VKDQ  CGS WA
Sbjct: 93  EFATRYLSGATHFAKAKKFASQHYRKVGADLSTAPAAVDWRQMGAVTPVKDQGACGSCWA 152

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
            S  GNIE  +   T  L++LSEQEL+ CD  D+GC GG +  AFD +++   G +    
Sbjct: 153 LSAIGNIESQWYVTTHSLITLSEQELVSCDDVDEGCNGGLMLQAFDWLLNNKNGAVYTGA 212

Query: 213 TYPYRGDDKACRLNKKATQVK----INGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
           +YPY   + +     +++++     I+G+V++  +E  MA +L  NGP+A+A++A A   
Sbjct: 213 SYPYVSGNGSVPECSESSELVVGAYIDGHVTIESNEDTMAAWLAVNGPIAIAVDASAFMS 272

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  G    I   CDG    L+H VL+VGY +         VPYW+IKNSWGE WGEKGY 
Sbjct: 273 YTGG----ILTSCDG--RQLNHGVLLVGYNMT------GEVPYWLIKNSWGENWGEKGYV 320

Query: 329 RLYRGDGSCGINDYVRSA 346
           R+ +G   C I +Y  SA
Sbjct: 321 RVRKGTNECLIQEYPVSA 338


>gi|15485586|emb|CAC67416.1| cysteine protease [Trypanosoma brucei rhodesiense]
          Length = 450

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 127/346 (36%), Positives = 179/346 (51%), Gaps = 27/346 (7%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
           V LL++   ++S        L  LH  +   + F  F +++ K Y    E   R   F  
Sbjct: 14  VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
           N+ + ++ Q   +    +G+  FSD++  EF+A+Y       + A + +   + N+T   
Sbjct: 68  NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125

Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
            P A DWRE  AVT VKDQ  CGS WAFST GNIEG +      LVSLSEQ L+ CD  D
Sbjct: 126 APAAVDWREKGAVTPVKDQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
            GC GG + NAF+ I++  GG +  E +YPY    G+   C++N       I  +V + +
Sbjct: 186 FGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245

Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
           DE  +A YL ENGP+A+A++A +   Y  G+           +E L H VL+VGY     
Sbjct: 246 DEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTSCTSEQLDHGVLLVGYNDSSN 299

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                  PYWIIKNSW   WGE GY R+ +G   C +N  V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339


>gi|343476707|emb|CCD12272.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 447

 Score =  209 bits (533), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 123/343 (35%), Positives = 184/343 (53%), Gaps = 24/343 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  N+ +
Sbjct: 14  VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+  FSD+S  EF+A Y  G +   +   R  P  +  ++    P 
Sbjct: 72  AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVTVSTGKAPE 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  CGS WAFS  GNIEG +      L SLSEQ L+ CD ED GC
Sbjct: 129 AVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVTGHNLTSLSEQMLVSCDTEDLGC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
            GG + NAF  I+S     +  E++YPY    G+   CR++ K    KI  +V + +DE 
Sbjct: 189 AGGLMDNAFKWIVSSNRHNVFTEESYPYASKGGNVPPCRMSGKVVGAKIRDHVDLPKDEN 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L +NGP+A+A+++ + Q Y  GV           ++ L H VL+VGY  D +K  
Sbjct: 249 AIAEWLAKNGPVAIAVDSTSFQSYTGGV------LTSCISKQLDHGVLLVGYD-DTSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSW +GWGE+GY R+ +G   C + +Y  SA+V
Sbjct: 300 ---PPYWIIKNSWSKGWGEEGYIRIEKGTNQCLVKNYATSAVV 339


>gi|52546912|gb|AAU81589.1| cysteine proteinase [Petunia x hybrida]
          Length = 257

 Score =  209 bits (532), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 112/249 (44%), Positives = 145/249 (58%), Gaps = 18/249 (7%)

Query: 107 KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
           K KP  +    P ++P   LP  FDWRE  AVTGVK+Q  CGS W+FSTTG +EG +   
Sbjct: 6   KAKPKLSTDKAP-ILPTSDLPDDFDWREKGAVTGVKNQGSCGSCWSFSTTGAVEGAHFLA 64

Query: 167 TKKLVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           T +LVSLSEQ+L+DCD E         D GC GG ++ AF+  +    GGL+ EK YPY 
Sbjct: 65  TGELVSLSEQQLVDCDHECDAEQQNECDAGCGGGLMTTAFEYTLK--AGGLQREKDYPYT 122

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           G D  C  +K      +  +  V  DE  +A  LV++GP+AV INA  +Q YV GVS P+
Sbjct: 123 GRDGKCHFDKSKIAASVANFSVVGLDEDQIAANLVKHGPLAVGINAAWMQTYVGGVSCPL 182

Query: 278 QFFCDGGNENLSHSVLIVGYG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
             F     +   H VL+VGYG         K  PYWIIKNSWGE WGE+GY+++ RG   
Sbjct: 183 ICF-----KRQDHGVLLVGYGSAGFAPIRLKEKPYWIIKNSWGESWGEQGYYKICRGRNI 237

Query: 337 CGINDYVRS 345
           CG++  V +
Sbjct: 238 CGVDAMVST 246


>gi|157864843|ref|XP_001681130.1| cathepsin L-like protease [Leishmania major strain Friedlin]
 gi|68124424|emb|CAJ02280.1| cathepsin L-like protease [Leishmania major strain Friedlin]
          Length = 348

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 124/319 (38%), Positives = 174/319 (54%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYQRAYGTLTEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A    KLV LSEQ+L+ CD  D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWAVAGHKLVRLSEQQLVSCDHVDNGCGGGLMLQAFEWVLRNMNGTVFTEKS 213

Query: 214 YPYRGD----DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY        +    ++ A   +I+GYVS+   E  MA +L +NGP+++A++A +   Y
Sbjct: 214 YPYTSTFGYVPECSNSSELAPGARIDGYVSMESSERVMAAWLAKNGPISIAVDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  E L+H VL+VGY +         VPYW+IKNSWG+ WGEKGY R
Sbjct: 274 HSGVLTS----CIG--EQLNHGVLLVGYNMT------GEVPYWVIKNSWGKDWGEKGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +  Y  S  V
Sbjct: 322 VTMGVNACLLTGYPVSVHV 340


>gi|68304200|ref|YP_249668.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
 gi|67973029|gb|AAY83995.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
          Length = 344

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 122/312 (39%), Positives = 174/312 (55%), Gaps = 20/312 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  ++ K YA   E   R  IF  NL  I L ++ ++ S VY +N+F+DL+  E  A
Sbjct: 47  FETFQTKYKKVYADDNERDYRYKIFKTNLEIINL-KNQQNDSAVYNINKFADLTKNEVIA 105

Query: 102 KYLGFKLK-PSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           K+ G  ++ P+  +   P ++  P+      FDWR+++ +T VKDQ  CGS WAFST   
Sbjct: 106 KFTGLGIRSPALKNSCEPVIVDGPSKYTQETFDWRQFNKITSVKDQGFCGSCWAFSTIAG 165

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +E  YA K  + V LSEQ+L+DCD  D GC GG +  A++ IM+   GGLE E+ YPYR 
Sbjct: 166 LESQYAIKYNEHVDLSEQQLVDCDTIDMGCAGGLLHTAYEEIMAM--GGLEYEEDYPYRS 223

Query: 219 DDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
               CRL     +V + N Y  V   E  +   L E GP+AVA++A  L  Y  G+    
Sbjct: 224 VQGPCRLQSDKFEVSVDNCYRYVLYSEDKLKDVLHEMGPIAVAVDAVDLTDYYGGIITSC 283

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
           +      N  L+H+VL+VGYG++        VP+W++KNSWG  +GE G+ R+ R   SC
Sbjct: 284 K------NYGLNHAVLLVGYGIEN------GVPFWVLKNSWGSDYGENGFVRVKRNVNSC 331

Query: 338 G-INDYVRSALV 348
           G IN+   SA +
Sbjct: 332 GMINELAASARI 343


>gi|15617524|ref|NP_258322.1| cathepsin-like cysteine proteinase [Spodoptera litura NPV]
 gi|37077642|sp|Q91BH1.1|CATV_NPVST RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|15553260|gb|AAL01738.1|AF325155_50 cathepsin-like cysteine proteinase [Spodoptera litura NPV]
          Length = 337

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 120/316 (37%), Positives = 173/316 (54%), Gaps = 27/316 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           F++QHNK Y T  +  +    F  NL  +  + +  +   VYG+N+FSD+    F  ++ 
Sbjct: 36  FIKQHNKEYTTPDQRDAAFVNFKRNLADMNAMNNVSN-QAVYGINKFSDIDKITFVNEHA 94

Query: 105 GF----------KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           G              P      V    P+   P +FDWR+ + VT VK+Q +CGS WAF+
Sbjct: 95  GLVSNLINSTDSNFDPYRLCEYVTVAGPSARTPESFDWRKLNKVTKVKEQGVCGSCWAFA 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIE  YA     L+ LSEQ+L+DCD+ D GC+GG +  AF  I+    GG+E E  Y
Sbjct: 155 AIGNIESQYAIMHDSLIDLSEQQLLDCDRVDQGCDGGLMHLAFQEIIRI--GGVEHEIDY 212

Query: 215 PYRGDDKACRLNKKATQVKIN-GYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
           PY+G + ACRL      V+++  Y    RDE  + + L +NGP+AVAI+   +  Y +G+
Sbjct: 213 PYQGIEYACRLAPSKLAVRLSHCYQYDLRDERKLLELLYKNGPIAVAIDCVDIIDYRSGI 272

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
           +       D G   L+H+VL+VGYG++         PYWI KNSWG  WGE GYFR  R 
Sbjct: 273 ATVCN---DNG---LNHAVLLVGYGIEND------TPYWIFKNSWGSNWGENGYFRARRN 320

Query: 334 DGSCG-INDYVRSALV 348
             +CG +N++  SA++
Sbjct: 321 INACGMLNEFAASAVL 336


>gi|355681666|gb|AER96819.1| cathepsin W [Mustela putorius furo]
          Length = 373

 Score =  209 bits (531), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 117/331 (35%), Positives = 178/331 (53%), Gaps = 22/331 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K   +F  F  Q+N++Y+   EY  RL IF+ NL + Q ++  +  +  +G+  FSDL+
Sbjct: 36  LKLEQVFELFRAQYNRSYSNPKEYAHRLEIFAHNLAQAQKMEVEDLATAEFGMTPFSDLT 95

Query: 96  TAEFQAKYLGFKLKPSYAD---RSVPAMIPNITLPRAFDWREYDAV-TGVKDQTMCGSSW 151
             EF+  +   K+ P       R V + +   ++P + DWR+   V + +K+Q  C   W
Sbjct: 96  EEEFEQLHGHQKITPGETPAVGRKVGSEVVMESVPASCDWRKLKGVKSPIKEQGNCNCCW 155

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
           A +  GNIE +++ +  + V +S QEL+DC++  DGC+GG + +AF T+++    GL  E
Sbjct: 156 AMAAAGNIEALWSIRYNQSVQVSVQELLDCNRCGDGCKGGFVWDAFVTVLN--NSGLASE 213

Query: 212 KTYPYRGDDK--ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           K YP+RG  K   C  +       I  ++ +  +E  MA YL  +GP+ V IN   LQ Y
Sbjct: 214 KDYPFRGSLKRHKCLASNYKKVAWIQDFIMLQNNEQTMANYLATHGPITVTINMKLLQQY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT------------KFTHKAVPYWIIKNS 317
             GV       CD    N  HSVL+VG+G   +               H+ +PYWI+KNS
Sbjct: 274 KKGVIKATPATCDPYLVN--HSVLLVGFGKTNSSERRRAKGGHFWPHPHRPIPYWILKNS 331

Query: 318 WGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           WG  WGE+GYFRL+RG  +CGI  Y  +A V
Sbjct: 332 WGAEWGEEGYFRLHRGSNTCGITKYPLTARV 362


>gi|145351119|ref|XP_001419933.1| predicted protein [Ostreococcus lucimarinus CCE9901]
 gi|144580166|gb|ABO98226.1| predicted protein [Ostreococcus lucimarinus CCE9901]
          Length = 272

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 122/273 (44%), Positives = 152/273 (55%), Gaps = 23/273 (8%)

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSVPA-------MIPNITLPRAFDWREYDAVTGVKD 143
           FSDL+  EF A+YLG     S       A        +P   LP  FDWR   AVT VKD
Sbjct: 2   FSDLTAEEFAARYLGHVRLSSEEREKRKARGGETLETLPVEHLPEEFDWRFKGAVTRVKD 61

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD---------QEDDGCEGGSIS 194
           Q  CGS W FSTTG IEG +   T KLV LSEQ+L+DCD           D GC GG  S
Sbjct: 62  QGQCGSCWTFSTTGAIEGAHFISTGKLVELSEQQLVDCDVGCDPDVPNACDSGCNGGLPS 121

Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVEN 254
           NA + I+    GG++ EK+YPY G+   C+  K      +  +  VS DE  MA  LV+ 
Sbjct: 122 NAMEYIVEH--GGIDTEKSYPYVGEKGECKAKKGKLGATLKNFSFVSDDEKQMAAALVKY 179

Query: 255 GPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV-PYWI 313
           GP+++ INA  +Q Y+ GV+ P  + CD   E+L H VLIVGYG         A  PYWI
Sbjct: 180 GPLSIGINAAWMQSYIGGVACP--WLCDA--ESLDHGVLIVGYGSSGFAPVRWAPEPYWI 235

Query: 314 IKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSA 346
           +KNSW   WGE GY+R+ +  GSCGIN+ V +A
Sbjct: 236 VKNSWSPAWGEGGYYRICKDKGSCGINNMVVAA 268


>gi|332326585|gb|AEE42616.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 119/318 (37%), Positives = 175/318 (55%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWRE  AVT VKBQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKBQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIE  +A    +L  LSEQ+L+ CD +D GC GG ++ AF+ ++  + G +  E +Y
Sbjct: 155 AVGNIESQWAVAGHRLXXLSEQQLVSCDDKDSGCXGGLMTQAFEWLLRXMNGTMFTEDSY 214

Query: 215 PY---RGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY    GD   C  + +     +I+GYV +  +ET MA +L ++GP+++ ++A +   Y 
Sbjct: 215 PYVSSTGDVPECTNSSELVPGARIDGYVMIESNETVMAAWLAKSGPISIGVDASSFMSYE 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV            ++L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV------LTSCAGKHLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLXEYPVSAHV 340


>gi|343477445|emb|CCD11724.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
          Length = 380

 Score =  208 bits (530), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 125/343 (36%), Positives = 185/343 (53%), Gaps = 24/343 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  N+ +
Sbjct: 14  VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+  FSD+S  EF+A Y  G +   +   R  P  + N++    P 
Sbjct: 72  AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPE 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  CGS WAFS  GNIEG +     +L SLSEQ L+ CD  D GC
Sbjct: 129 AVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDTNDFGC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
           EGG + +AF  I+S   G +  E++YPY    G+  AC  + K    KI  +V +  DE 
Sbjct: 189 EGGLMDDAFKWIVSSNKGNVFTEQSYPYASGGGNVPACDKSGKVVGAKIRDHVDLPEDEN 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L +NGP+A+A++A + Q Y  GV           +E+L H VL+VGY  D +K  
Sbjct: 249 AIAEWLAKNGPVAIAVDATSFQSYTGGV------LTSCISEHLDHGVLLVGYD-DTSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSW +GWGE+GY R+ +G   C + +   SA+V
Sbjct: 300 ---PPYWIIKNSWSKGWGEEGYIRIEKGTNQCLMKNLPSSAVV 339


>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
          Length = 359

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 132/342 (38%), Positives = 174/342 (50%), Gaps = 26/342 (7%)

Query: 11  ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSRLHIFSG 68
           ALLS+ + + S  +               +L++ +     H+     L +   R ++F  
Sbjct: 7   ALLSVVLVLGSVALAQSIPFDEKDLASEESLWSLYEKWRAHHAVSRDLDDTDKRFNVFKE 66

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP-----AMIPN 123
           N++ I      +  +    LN+F D++  EF++ Y G K+      R V      +    
Sbjct: 67  NVKFIHEFNQKKDATYKLALNKFGDMTNQEFRSTYAGSKIDHHMTLRGVKDAGEFSYEKF 126

Query: 124 ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ 183
             LP + DWRE  AVTGVKDQ  CGS WAFST   +EG+   KT +LVSLSEQ+L+DCD 
Sbjct: 127 HDLPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGINQIKTNELVSLSEQQLVDCDT 186

Query: 184 EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRD 243
           ++ GC GG +  AFD I  K  GGL  E +YPY  + K+C     +  V I+GY  V R+
Sbjct: 187 KNSGCNGGLMDYAFDFI--KNNGGLSSEDSYPYLAEQKSCGSEANSAVVTIDGYQDVPRN 244

Query: 244 ETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
                   V N P++VAI A  YA QFY  GV     F    G E L H V  VGYGVD 
Sbjct: 245 NEAALMKAVANQPVSVAIEASGYAFQFYSQGV-----FSGHCGTE-LDHGVAAVGYGVD- 297

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
                    YWI+KNSWGEGWGE GY R+ RG     G CGI
Sbjct: 298 ----DDGKKYWIVKNSWGEGWGESGYIRMERGIKDKRGKCGI 335


>gi|71663163|ref|XP_818578.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|70883837|gb|EAN96727.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 467

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 116/315 (36%), Positives = 164/315 (52%), Gaps = 18/315 (5%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
           T+ F  F ++H + Y +  E   RL +F  NL  +  L    +    +G+  FSDL+  E
Sbjct: 35  TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF-LARLHAAANPHATFGVTPFSDLTREE 93

Query: 99  FQAKYLGFKLKPSYADRS--VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           F+++Y    +  + A     VP  +  +  P A DWR   AVT VKDQ  CGS WAFS  
Sbjct: 94  FRSRYHNGAVHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GN+E  +      L +LSEQ L+ CD+ D GC GG ++NAF+ I+ +  G +  E +YPY
Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213

Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
               G    C  +       I G+V + +DE  +A +L  NGP+AVA++A +   Y  GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                      +E L H VL+VGY          AVPYWIIKNSW   WGE+GY R+ +G
Sbjct: 274 ------MTSCVSEQLDHGVLLVGYN------DSAAVPYWIIKNSWTTQWGEEGYIRIAKG 321

Query: 334 DGSCGINDYVRSALV 348
              C + +   SA+V
Sbjct: 322 SNQCLVKEEASSAVV 336


>gi|395545396|ref|XP_003774588.1| PREDICTED: cathepsin W [Sarcophilus harrisii]
          Length = 358

 Score =  208 bits (529), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 125/320 (39%), Positives = 178/320 (55%), Gaps = 30/320 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF-- 99
           F  F  Q+NK+Y    E   RL IF+ NL + Q L +   G   +G+  FSDL+  EF  
Sbjct: 44  FKAFQIQYNKSYPDAAEQECRLKIFADNLARAQQLTEEHQGLAQFGVTRFSDLTEEEFRR 103

Query: 100 -----QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
                Q  YLG ++K        P +    T  R+ DWR+   +T V+DQ  C S WA S
Sbjct: 104 LYQPSQPNYLGLRVKTEGG--GYPRLQRLKT--RSCDWRKARVLTPVRDQKNCNSCWAIS 159

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GN+E ++A   ++L  LS QEL+DC +   GCEGG + +A+ TI+++   GL EE+ Y
Sbjct: 160 AVGNVEALWAINYQQLFKLSVQELLDCRRCGQGCEGGFVWDAYMTILNQ--SGLAEEQDY 217

Query: 215 PYRGD-DKACRLNKKATQVKINGYVSVSRDET-----DMAKYLVENGPMAVAINAYALQF 268
           PYR    K C+  KK  +  I+ ++ + ++E      DMA+YL E GP+ V IN+  L+ 
Sbjct: 218 PYRPQLSKGCQ--KKKKRAWIHDFLMLHKEENSPSPPDMAQYLAEKGPITVTINSRLLKS 275

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y+ GV  P    CD   + + H V +VG+G     FT     YWI+KNSWG  WGEKGYF
Sbjct: 276 YIRGVIKPGN-NCDP--KYVDHVVQLVGFGQIHN-FT-----YWILKNSWGSSWGEKGYF 326

Query: 329 RLYRGDGSCGINDYVRSALV 348
           RL+RG  +CGI  +  +A++
Sbjct: 327 RLHRGRNACGITKFPLTAVL 346


>gi|332326593|gb|AEE42620.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 120/318 (37%), Positives = 174/318 (54%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYWRVYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIE  +A    +L +LSEQ+L+ CD +D GC GG ++ AF+ ++  + G +  E +Y
Sbjct: 155 AVGNIESQWAVAGHRLTALSEQQLVSCDDKDSGCGGGLMTQAFEWLLRNMNGTMFTEDSY 214

Query: 215 PY---RGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY    GD   C  + +     +I+GYV++   ET MA +L ++GP+++ ++A +   Y 
Sbjct: 215 PYVSSXGDVPECTNSSQLVPGARIDGYVTIESSETVMAAWLAKSGPISIGVDASSFMSYE 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV            + L+H VL+VGY           VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV------LTSCAGBXLNHGVLLVGYNXT------GEVPYWVIKNSWGEDWGEKGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 AMGVNACLLTEYPVSAHV 340


>gi|146078033|ref|XP_001463431.1| cathepsin L-like protease [Leishmania infantum JPCM5]
 gi|134067516|emb|CAM65796.1| cathepsin L-like protease [Leishmania infantum JPCM5]
          Length = 381

 Score =  207 bits (528), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 129/320 (40%), Positives = 176/320 (55%), Gaps = 37/320 (11%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A     LVSLSEQ+L+ CD +D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKS 213

Query: 214 YPY---RGDDKACRLN--KKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
           YPY    GD   C LN  K     +I+GYV +  +ET MA +L ENGP+A+A++A +   
Sbjct: 214 YPYTSGNGDVAEC-LNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMS 272

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +G                   VL+VGY  ++T      VPYW+IKNSWGE WGEKGY 
Sbjct: 273 YQSG-------------------VLLVGY--NKT----GGVPYWVIKNSWGEDWGEKGYV 307

Query: 329 RLYRGDGSCGINDYVRSALV 348
           R+  G  +C +++Y  SA V
Sbjct: 308 RVAMGLNACLLSEYPVSAHV 327


>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
          Length = 463

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 136/354 (38%), Positives = 177/354 (50%), Gaps = 29/354 (8%)

Query: 1   MSCFYFFAGVALLSLTVSVS--SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVE 58
           M     FA +AL ++  S S   F ++G +            L+  +L QH K Y  L E
Sbjct: 1   MGILLLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAIMELYELWLAQHKKAYNGLGE 60

Query: 59  YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP 118
             +R  +F  N   I    +  + S   GLN+F+DLS  EF+A YLG KL       + P
Sbjct: 61  KQNRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSNSP 120

Query: 119 AMIPNIT----LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLS 174
           +     +    LP + DWRE  AVT VKDQ  CGS WAFST   +EG+    T  L SLS
Sbjct: 121 SPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 180

Query: 175 EQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL-NKKATQV 232
           EQEL+DCD   + GC GG +  AF  I++   GGL+ E  YPY+ +D +C    K A  V
Sbjct: 181 EQELVDCDTSYNQGCNGGLMDYAFQFIINN--GGLDSEDDYPYKANDGSCDAYRKNAHVV 238

Query: 233 KINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSH 290
            I+ Y  V  ++    K    N P++VAI A   A QFY +GV      F       L H
Sbjct: 239 TIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGV------FTSTCGTQLDH 292

Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
            V +VGYG      +     YWI+KNSWG+ WGEKG+ RL R       G CGI
Sbjct: 293 GVTLVGYG------SESGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVSTGMCGI 340


>gi|31981819|ref|NP_034115.2| cathepsin W preproprotein [Mus musculus]
 gi|341940311|sp|P56203.2|CATW_MOUSE RecName: Full=Cathepsin W; AltName: Full=Lymphopain; Flags:
           Precursor
 gi|26353368|dbj|BAC40314.1| unnamed protein product [Mus musculus]
 gi|44890089|gb|AAS48498.1| cathepsin W precursor [Mus musculus]
 gi|148701190|gb|EDL33137.1| cathepsin W, isoform CRA_b [Mus musculus]
 gi|162317774|gb|AAI56226.1| Cathepsin W [synthetic construct]
 gi|162318342|gb|AAI56999.1| Cathepsin W [synthetic construct]
          Length = 371

 Score =  207 bits (528), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 122/333 (36%), Positives = 177/333 (53%), Gaps = 38/333 (11%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F  + N++Y    EY  RL IF+ NL + Q LQ  + G+  +G   FSDL+  EF 
Sbjct: 39  VFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDLTEEEFG 98

Query: 101 AKYLGFKLKPSYADRSVPAMIPNIT-----------LPRAFDWRE-YDAVTGVKDQTMCG 148
                      Y     P   PN+T           +PR  DWR+  + ++ VK+Q  C 
Sbjct: 99  Q---------LYGQERSPERTPNMTKKVESNTWGESVPRTCDWRKAKNIISSVKNQGSCK 149

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
             WA +   NI+ ++  K ++ V +S QEL+DC++  +GC GG + +A+ T+++    GL
Sbjct: 150 CCWAMAAADNIQALWRIKHQQFVDVSVQELLDCERCGNGCNGGFVWDAYLTVLN--NSGL 207

Query: 209 EEEKTYPYRGDDKACR-LNKKATQVK-INGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
             EK YP++GD K  R L KK  +V  I  +  +S +E  +A YL  +GP+ V IN   L
Sbjct: 208 ASEKDYPFQGDRKPHRCLAKKYKKVAWIQDFTMLSNNEQAIAHYLAVHGPITVTINMKLL 267

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR------TKFTHK-----AVPYWIIK 315
           Q Y  GV       CD     + HSVL+VG+G ++      T  +H      + PYWI+K
Sbjct: 268 QHYQKGVIKATPSSCDP--RQVDHSVLLVGFGKEKEGMQTGTVLSHSRKRRHSSPYWILK 325

Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           NSWG  WGEKGYFRLYRG+ +CG+  Y  +A V
Sbjct: 326 NSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQV 358


>gi|332375406|gb|AEE62844.1| unknown [Dendroctonus ponderosae]
          Length = 320

 Score =  207 bits (527), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 124/325 (38%), Positives = 173/325 (53%), Gaps = 19/325 (5%)

Query: 22  FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-E 80
           F ++G   +  +        F  F  + NKTY T VE  +R  IF   L +I+      E
Sbjct: 3   FFILGSLFVAAVAASLEQDAFQAFKLKQNKTYKTPVEETTRYGIFQAKLLEIEEHNSRFE 62

Query: 81  HGSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
            G   Y  G+N+FSD +  EF A YLG   KP+   + +P +   +++P + DWR    V
Sbjct: 63  QGLETYKKGVNKFSDWTQDEFNA-YLGLHPKPAKLGKGIPYVKTGVSVPASVDWRTEGYV 121

Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNA 196
           TGVK+Q  CGS WAFS TG++EG     T KLVSLSEQ+L+DC     + GC+GG +   
Sbjct: 122 TGVKNQGDCGSCWAFSLTGSVEGALFKSTGKLVSLSEQQLVDCTYGTVNFGCDGGYLEET 181

Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
           F  I      GLE E +YPY+  D  C+ +      KIN YV    DE  + +     GP
Sbjct: 182 FPYIQET---GLEAEASYPYKARDGTCKFDASKVVTKINDYVYWYGDEEALLEATATIGP 238

Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
           ++VA++A  +  Y +GV       C   +++L+H VL+VGYG      +   V YW++KN
Sbjct: 239 ISVAMDANYIDSYASGVFS--SRLCS--SDDLNHGVLVVGYG------SENGVNYWLVKN 288

Query: 317 SWGEGWGEKGYFRLYRGDGSCGIND 341
           SW E WGE GY +L RG   CGI +
Sbjct: 289 SWAEDWGESGYLKLLRGQNECGIAE 313


>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
 gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
          Length = 337

 Score =  207 bits (527), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 127/336 (37%), Positives = 171/336 (50%), Gaps = 29/336 (8%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
           +L L++  S  M        +LH    +     +++++ K Y    E   RL IF  N+ 
Sbjct: 14  VLLLSICTSQVMS------RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVE 67

Query: 72  KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAF 130
            I+      +      +N  +D +  EF A + G+K K S++    P    N+T +P A 
Sbjct: 68  FIESFNAAGNRPYKLSINHLADQTNEEFVASHNGYKHKGSHSQ--TPFKYENVTGVPNAV 125

Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEG 190
           DWRE  AVT VKDQ  CGS WAFST    EG+Y   T  L+SLSEQEL+DCD  D GC+G
Sbjct: 126 DWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSVDHGCDG 185

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT-QVKINGYVSVSRDETDMAK 249
           G +   F+ I+    GG+  E  YPY   D  C  NK+A+   +I GY +V  +  D  +
Sbjct: 186 GYMEGGFEFIIKN--GGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSEDALQ 243

Query: 250 YLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHK 307
             V N P++V I+A   A QFY +GV      F       L H V  VGYG      T  
Sbjct: 244 KAVANQPVSVTIDAGGSAFQFYSSGV------FTGQCGTQLDHGVTAVGYGS-----TDD 292

Query: 308 AVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
              YWI+KNSWG  WGE+GY R+ RG    +G CGI
Sbjct: 293 GTQYWIVKNSWGTQWGEEGYIRMQRGTDAQEGLCGI 328


>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
 gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
          Length = 340

 Score =  207 bits (527), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 137/353 (38%), Positives = 186/353 (52%), Gaps = 43/353 (12%)

Query: 5   YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
           Y FA +AL+++  +VS   V+ +E             +  F  +H K Y    E   RL 
Sbjct: 4   YIFALLALVAVAQAVSFADVIKEE-------------WQTFKLEHRKQYQDETEERFRLK 50

Query: 65  IFSGNLRKI----QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM 120
           IF+ N  KI    QL    E  S   GLN+++D+   EF     GF        R+  A 
Sbjct: 51  IFNENKHKIAKHNQLYAAGE-VSFKMGLNKYADMLHHEFHETMNGFNYTLHKQLRASDAT 109

Query: 121 IPNIT--------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
              +T        LP++ DWR   AVTGVKDQ  CGS WAFS+TG +EG +  KT  L+S
Sbjct: 110 FTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKTGTLIS 169

Query: 173 LSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT 230
           LSEQ L+DC  +  ++GC GG + NAF  I  K  GG++ EK+YPY G D +C  NK   
Sbjct: 170 LSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPYEGIDDSCHFNKGTI 227

Query: 231 QVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNEN 287
                G+  + + DE  +A+ +   GP++VAI+A   + QFY TGV    Q  CD   +N
Sbjct: 228 GATDRGFTDIPQGDEKKLAQAVATIGPVSVAIDASHESFQFYSTGVYDEPQ--CD--PQN 283

Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
           L H VL+VGYG D          YW++KNSWG  WG+KG+ ++ R  D  CGI
Sbjct: 284 LDHGVLVVGYGTDEN-----GKDYWLVKNSWGTTWGDKGFIKMARNDDNQCGI 331


>gi|19747207|gb|AAL96762.1|AC104496_8 Tcc1l8.8 [Trypanosoma cruzi]
          Length = 500

 Score =  207 bits (527), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 116/315 (36%), Positives = 164/315 (52%), Gaps = 18/315 (5%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
           T+ F  F ++H + Y +  E   RL +F  NL  +  L    +    +G+  FSDL+  E
Sbjct: 68  TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF-LARLHAAANPHATFGVTPFSDLTREE 126

Query: 99  FQAKYLGFKLKPSYADRS--VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           F+++Y    +  + A     VP  +  +  P A DWR   AVT VKDQ  CGS WAFS  
Sbjct: 127 FRSRYHNGAVHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 186

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GN+E  +      L +LSEQ L+ CD+ D GC GG ++NAF+ I+ +  G +  E +YPY
Sbjct: 187 GNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 246

Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
               G    C  +       I G+V + +DE  +A +L  NGP+AVA++A +   Y  GV
Sbjct: 247 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 306

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                      +E L H VL+VGY          AVPYWIIKNSW   WGE+GY R+ +G
Sbjct: 307 ------MTSCVSEQLDHGVLLVGYN------DSAAVPYWIIKNSWTTQWGEEGYIRIAKG 354

Query: 334 DGSCGINDYVRSALV 348
              C + +   SA+V
Sbjct: 355 LNQCLVKEEASSAVV 369


>gi|393717301|gb|AFN21222.1| V-Cath [Bombyx mori NPV]
          Length = 323

 Score =  207 bits (527), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 119/317 (37%), Positives = 176/317 (55%), Gaps = 21/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y++ VE   R  IF  NL +I  +   ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P+        ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T G++E  +A K  +L++LSEQ++IDCD  D GC GG +  AF+ I+    GG++ E 
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  D+  CR+N     V++ + Y  +   E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  D G   L+H+VL+VGYGV+        VPYW  KN+WG  WGE G+FR+ 
Sbjct: 257 GI---IKYCFDSG---LNHAVLLVGYGVENN------VPYWTFKNTWGTDWGEDGFFRVQ 304

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 305 QNINACGMRNELASTAV 321


>gi|11464864|gb|AAG35357.1|AF314929_1 cruzipain [Trypanosoma cruzi]
          Length = 467

 Score =  207 bits (527), Expect = 6e-51,   Method: Compositional matrix adjust.
 Identities = 117/315 (37%), Positives = 166/315 (52%), Gaps = 18/315 (5%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
           T+ F  F ++H + Y +  E   RL +F  NL  +  L    +    +G+  FSDL+  E
Sbjct: 35  TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF-LARLHAAANPHATFGVTPFSDLTREE 93

Query: 99  FQAKYL-GFKLKPSYADRS-VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           F+++Y  G     +  +R+ VP  +  +  P A DWR   AVT VKDQ  CGS WAFS  
Sbjct: 94  FRSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GN+E  +      L +LSEQ L+ CD+ D GC GG ++NAF+ I+ +  G +  E +YPY
Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213

Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
               G    C  +       I G+V + +DE  +A +L  NGP+AVA++A +   Y  GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                      +E L H VL+VGY          AVPYWIIKNSW   WGE+GY R+ +G
Sbjct: 274 ------MTSCVSEQLDHGVLLVGYN------DSAAVPYWIIKNSWTTQWGEEGYIRIAKG 321

Query: 334 DGSCGINDYVRSALV 348
              C + +   SA+V
Sbjct: 322 SNQCLVKEEASSAVV 336


>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
           [Vitis vinifera]
          Length = 374

 Score =  207 bits (527), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 125/314 (39%), Positives = 172/314 (54%), Gaps = 30/314 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           ++ +++ +H K Y  L E   R  IF  NL+ I    + ++ +   GLN F+DL+  E++
Sbjct: 45  MYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDE-HNAQNRTYKVGLNRFADLTNEEYR 103

Query: 101 AKYLGFKLKP----SYADRSVP--AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           A YLG +  P    +    + P  A++P   LP + DWRE  AV  VKDQ  CGS WAFS
Sbjct: 104 AIYLGTRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSCGSCWAFS 163

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           T   +EG+    T +L+SLSEQEL+DCD E D GC GG +  AFD I+    GGL+ EK 
Sbjct: 164 TVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKN--GGLDTEKD 221

Query: 214 YPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYV 270
           YPY G D  C L+ K+++ V I+GY  V   +    +  V + P++VA+ A   ALQ YV
Sbjct: 222 YPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRALQLYV 281

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +G+      F       L H ++ VGYG      T     YWI++NSWG  WGE GY R+
Sbjct: 282 SGI------FTGECGTALDHGIVAVGYG------TENGTDYWIVRNSWGSSWGENGYIRM 329

Query: 331 YRG-----DGSCGI 339
            R       G CGI
Sbjct: 330 ERNMADAFSGKCGI 343


>gi|449139100|gb|AGE89905.1| cathepsin-like cysteine proteinase [Spodoptera littoralis NPV]
          Length = 336

 Score =  207 bits (526), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 122/318 (38%), Positives = 173/318 (54%), Gaps = 26/318 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F++QHNK Y T  +       F  NL  +  + +  +   VYG+N+FSD+    F  
Sbjct: 33  FENFIKQHNKEYTTPDQRDDAFVNFKRNLVNMNAMNNISN-HAVYGINKFSDIDKITFAN 91

Query: 102 KYLGFKLKPSYADRS---------VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
            + G  L  +  D +         V    P+   P +FDWR+   VT VK+Q +CGS WA
Sbjct: 92  VHAGLVLTLNATDSNFDPYRLCEFVTVAGPSARTPESFDWRKLHKVTKVKEQGVCGSCWA 151

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+  GNIE  YA     L+ LSEQ+L+DCD+ D GC+GG +  AF  IM    GG+E E 
Sbjct: 152 FAAIGNIESQYAILHDSLIDLSEQQLLDCDRIDQGCDGGLMHLAFQEIMRI--GGVEHEI 209

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY+G + ACR       V++ + Y    RDE  + + L +NGP+AVAI+   +  Y +
Sbjct: 210 DYPYQGIEYACRSAPSKFAVRLSHCYQYDLRDERKLLELLYKNGPIAVAIDCRDIIDYRS 269

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G++       D G   L+H+VL+VGYG++         PYWI KNSWG  WGE GYFR  
Sbjct: 270 GIATVCN---DNG---LNHAVLLVGYGIEND------TPYWIFKNSWGSNWGENGYFRAR 317

Query: 332 RGDGSCG-INDYVRSALV 348
           R   +CG +N++  SA++
Sbjct: 318 RNINACGMLNEFAASAVL 335


>gi|118157|sp|P25779.1|CYSP_TRYCR RecName: Full=Cruzipain; AltName: Full=Cruzaine; AltName:
           Full=Major cysteine proteinase; Flags: Precursor
 gi|162048|gb|AAA30181.1| cruzain [Trypanosoma cruzi]
 gi|29409382|gb|AAM33131.1| cysteine proteinase precursor [Trypanosoma cruzi]
          Length = 467

 Score =  207 bits (526), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 117/315 (37%), Positives = 166/315 (52%), Gaps = 18/315 (5%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
           T+ F  F ++H + Y +  E   RL +F  NL  +  L    +    +G+  FSDL+  E
Sbjct: 35  TSQFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREE 93

Query: 99  FQAKYL-GFKLKPSYADRS-VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           F+++Y  G     +  +R+ VP  +  +  P A DWR   AVT VKDQ  CGS WAFS  
Sbjct: 94  FRSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GN+E  +      L +LSEQ L+ CD+ D GC GG ++NAF+ I+ +  G +  E +YPY
Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213

Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
               G    C  +       I G+V + +DE  +A +L  NGP+AVA++A +   Y  GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                      +E L H VL+VGY          AVPYWIIKNSW   WGE+GY R+ +G
Sbjct: 274 ------MTSCVSEQLDHGVLLVGYN------DSAAVPYWIIKNSWTTQWGEEGYIRIAKG 321

Query: 334 DGSCGINDYVRSALV 348
              C + +   SA+V
Sbjct: 322 SNQCLVKEEASSAVV 336


>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 463

 Score =  207 bits (526), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 135/311 (43%), Positives = 178/311 (57%), Gaps = 28/311 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F+ +LE H++ Y +L E + R  IF  N   I    + +  S   GLN+FSDL+  EF+
Sbjct: 48  VFHQWLETHSRVYRSLSEKHHRFQIFKENFLYIHA-HNKQQKSYWLGLNKFSDLTHQEFR 106

Query: 101 AKYLGFKLKPSYADRSVPA-MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
           A+YLG   KP    R     M  ++      DWR   AVT VKDQ  CGS WAFS  G++
Sbjct: 107 AQYLG--TKPVNRQRKEANFMYEDVEAEPKVDWRLKGAVTDVKDQGACGSCWAFSAVGSV 164

Query: 160 EGVYAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           EGV A KT +LVSLSEQEL+DCD +++ GC GG +  AF+ I+    GG++ EK YPY+ 
Sbjct: 165 EGVNAIKTGELVSLSEQELVDCDRKQNQGCNGGLMDYAFEFIIKN--GGIDTEKDYPYKA 222

Query: 219 DDKACRLNKKATQ-VKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVS 274
            D  C   ++ ++ V I+ Y  V ++ E+ + K L +N P++VAI A     Q Y  GV 
Sbjct: 223 RDGRCDEGRRNSKVVVIDDYQDVPTQSESALMKALTKN-PVSVAIEAGGRDFQHYQGGV- 280

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-- 332
               F    G+E L H VL VGYG D        V YWI+KNSWG GWGEKGY R+ R  
Sbjct: 281 ----FTGPCGSE-LDHGVLAVGYGTD-----DDGVNYWIVKNSWGPGWGEKGYIRMERFG 330

Query: 333 ---GDGSCGIN 340
               DG CGIN
Sbjct: 331 SDSTDGKCGIN 341


>gi|343472324|emb|CCD15484.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  207 bits (526), Expect = 7e-51,   Method: Compositional matrix adjust.
 Identities = 123/343 (35%), Positives = 184/343 (53%), Gaps = 24/343 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  N+ +
Sbjct: 14  VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+  FSD+S  EF+A Y  G +   +   R  P  +  ++    P 
Sbjct: 72  AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVTVSTGKAPE 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  CGS WAFS  GNIEG +     +L SLSEQ L+ CD  +  C
Sbjct: 129 AVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVAGHELTSLSEQTLVSCDPTEYAC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK---ACRLNKKATQVKINGYVSVSRDET 245
           EGG + NAF  I+S   G +  E++YPY    +   AC ++ K     I+ YV + +DE 
Sbjct: 189 EGGFMDNAFRWIISSNKGKVFTEQSYPYSSGGRNVPACNMSGKVVGANISDYVDLPQDEN 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L +NGP++V ++A + Q Y  GV           ++ L+H+VL+VGY  D +K  
Sbjct: 249 AIAEWLAKNGPVSVIVDATSFQSYTGGV------LTSCLSKILNHAVLLVGYD-DTSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSW E WGEKGY R+ +G   C + +Y  SALV
Sbjct: 300 ---PPYWIIKNSWSEKWGEKGYIRIEKGTNQCLVQEYASSALV 339


>gi|9630927|ref|NP_047524.1| Cystein Protease [Bombyx mori NPV]
 gi|1168798|sp|P41721.1|CATV_NPVBM RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|540066|gb|AAB49542.1| cysteine protease [Bombyx mori NPV]
 gi|3745946|gb|AAC63793.1| Cystein Protease [Bombyx mori NPV]
          Length = 323

 Score =  207 bits (526), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 118/317 (37%), Positives = 176/317 (55%), Gaps = 21/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y++ VE   R  IF  NL +I  +   ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P+        ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T G++E  +A K  +L++LSEQ++IDCD  D GC GG +  AF+ I+    GG++ E 
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  D+  CR+N     V++ + Y  +   E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLPLVGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  D G   L+H+VL+VGYGV+        +PYW  KN+WG  WGE G+FR+ 
Sbjct: 257 GI---IKYCFDSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 305 QNINACGMRNELASTAV 321


>gi|118156|sp|P14658.1|CYSP_TRYBB RecName: Full=Cysteine proteinase; Flags: Precursor
 gi|10393|emb|CAA34485.1| unnamed protein product [Trypanosoma brucei]
          Length = 450

 Score =  207 bits (526), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 125/346 (36%), Positives = 179/346 (51%), Gaps = 27/346 (7%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSG 68
           V LL++   ++S        L  LH  +   + F  F +++ K Y    E   R   F  
Sbjct: 14  VVLLAMAACLASV------ALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFEE 67

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
           N+ + ++ Q   +    +G+  FSD++  EF+A+Y       + A + +   + N+T   
Sbjct: 68  NMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGASYFAAAQKRLRKTV-NVTTGR 125

Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
            P A DWRE  AVT VK Q  CGS WAFST GNIEG +      LVSLSEQ L+ CD  D
Sbjct: 126 APAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQWQVAGNPLVSLSEQMLVSCDTID 185

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSR 242
            GC GG + NAF+ I++  GG +  E +YPY    G+   C++N       I  +V + +
Sbjct: 186 SGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQCQMNGHEIGAAITDHVDLPQ 245

Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
           DE  +A YL ENGP+A+A++A +   Y  G+           ++ L H VL+VGY  +  
Sbjct: 246 DEDAIAAYLAENGPLAIAVDAESFMDYNGGI------LTSCTSKQLDHGVLLVGYNDNSN 299

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                  PYWIIKNSW   WGE GY R+ +G   C +N  V SA+V
Sbjct: 300 P------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQAVSSAVV 339


>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
 gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
          Length = 366

 Score =  207 bits (526), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 134/354 (37%), Positives = 192/354 (54%), Gaps = 43/354 (12%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHT--------ALFNYFLEQHNKTYATLVEYYSRL 63
           LL L  ++SS   +     +H HH + +        +++N++L +H+KTY  L E   R 
Sbjct: 10  LLFLFFTLSSAWDMSILSHNHGHHHQSSWRSDNEVISMYNWWLAKHSKTYNKLGEREKRF 69

Query: 64  HIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN 123
            IF  NLR I    ++++ +   GL  F+DL+  E++AK+LG K  P    R + +  P+
Sbjct: 70  EIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRAKFLGTKSDPKR--RLMKSKNPS 127

Query: 124 I--------TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSE 175
                     LP + DWR+  AV+ +KDQ  CGS WAFST   +EGV    T +L+SLSE
Sbjct: 128 QRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFSTIAAVEGVNKIVTGELISLSE 187

Query: 176 QELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK-KATQVK 233
           QEL+DCD+  + GC GG + NAF  I++   GG++ +K YPY+  D  C   K K   V 
Sbjct: 188 QELVDCDRSYNAGCNGGLMDNAFQFIINN--GGIDTDKDYPYQAVDGKCDTTKVKNKAVT 245

Query: 234 INGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSH 290
           I+G+  V + DE  + K  V + P++VAI A   ALQFY +GV      F       L H
Sbjct: 246 IDGFEDVMAFDEMALQK-AVAHQPVSVAIEASGMALQFYQSGV------FTGECGSALDH 298

Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
            V+IVGYG      T   + YW+++NSWG  WGE GY ++ R       G CGI
Sbjct: 299 GVVIVGYG------TEDGIDYWLVRNSWGRDWGENGYIKMQRNVVDTFTGKCGI 346


>gi|343471318|emb|CCD16236.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  207 bits (526), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 122/340 (35%), Positives = 183/340 (53%), Gaps = 24/340 (7%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  ++ +
Sbjct: 14  VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRMFKQSMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+ +FSD+S  EF+A YL G K   +  +R  P  + N++    P 
Sbjct: 72  AKE-EAAANPYATFGVTQFSDMSPEEFRATYLNGAKYYAAALER--PRKVVNVSTGKAPP 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  CGS WAF+ TGNIEG +     +L SLSEQ L+ CD  +D C
Sbjct: 129 AVDWRKKGAVTPVKDQGSCGSCWAFAATGNIEGQWKIAGHELTSLSEQMLVSCDTTEDNC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD---KACRLNKKATQVKINGYVSVSRDET 245
            GG    AF  I+S   G +  E++YPY   D     C  + K    KI+G++++ +DE 
Sbjct: 189 RGGFADRAFKWIVSSNKGNVFTEESYPYASTDGYVPPCNKSGKVVGAKISGHINLPKDEN 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L  NGP+A+A++A     Y  GV           +E LSH VL+VGY  D +K  
Sbjct: 249 AIAEWLARNGPVAIAVDASTFLDYKGGV------LTSCSSEGLSHDVLLVGYN-DTSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
               PYWIIKNSW + WGE+GY R+ +G   C + +Y RS
Sbjct: 300 ---PPYWIIKNSWDKEWGEEGYIRIEKGTNLCLMKEYARS 336


>gi|302794759|ref|XP_002979143.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
 gi|300152911|gb|EFJ19551.1| hypothetical protein SELMODRAFT_110288 [Selaginella moellendorffii]
          Length = 227

 Score =  207 bits (526), Expect = 8e-51,   Method: Compositional matrix adjust.
 Identities = 104/234 (44%), Positives = 151/234 (64%), Gaps = 20/234 (8%)

Query: 120 MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
           ++P   LP++FDWRE+ A+T VK+Q  CGS W FS+TG +EG +  K+++L+SL E++L+
Sbjct: 3   LLPTDNLPKSFDWREHGAMTPVKNQGSCGSCWTFSSTGAVEGAHFLKSRELISLREEQLV 62

Query: 180 DCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD--------KACRLNKKATQ 231
           DCD+ D GC+GG + NA++ I +K   GLE E+ YPY+ ++          C        
Sbjct: 63  DCDRMDGGCKGGDMLNAYEYIKAK---GLEAEEDYPYQEENYKEYMFPHHRCHFRPSKVA 119

Query: 232 VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHS 291
             I  Y +VS DE  +A  LV+NGP+++A+NA  +  Y+ GV+ P    C GG +N++H+
Sbjct: 120 ATIANYSTVSEDEDQIAANLVKNGPLSIALNANYIMDYMGGVACP--RICPGG-DNMNHA 176

Query: 292 VLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRS 345
           VL+VGYG+D  K      PYWI+KNSW E +GE GYFRL RG G CG+N  V +
Sbjct: 177 VLLVGYGMDGDK------PYWILKNSWSENYGEDGYFRLCRGFGVCGMNTRVST 224


>gi|74229746|ref|YP_308950.1| cathepsin [Trichoplusia ni SNPV]
 gi|72259660|gb|AAZ67431.1| cathepsin [Trichoplusia ni SNPV]
          Length = 344

 Score =  206 bits (525), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 121/312 (38%), Positives = 173/312 (55%), Gaps = 20/312 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  ++ K YA   E   R  IF  NL  I L ++ ++ S VY +N+F+DL+  E  A
Sbjct: 47  FETFQTKYKKVYADDNERDYRYKIFKTNLEIINL-KNQQNDSAVYNINKFADLTKNEVIA 105

Query: 102 KYLGFKLK-PSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           K+ G  +K P+  +   P ++  P+      FDWR+++ +T VKDQ  CGS WAFST   
Sbjct: 106 KFTGLGVKSPNLKNFCDPLIVDGPSKYTQETFDWRQFNKITSVKDQGFCGSCWAFSTIAG 165

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +E  YA K  + + LSEQ+L+DCD  D GC GG +  A++ IMS   GG+E E+ YPYR 
Sbjct: 166 LESQYAIKYNEHIDLSEQQLVDCDTIDMGCAGGLLHTAYEEIMSM--GGVEYEEDYPYRS 223

Query: 219 DDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
               CR+     QV + N Y  +   E  +   L E GP+AVA++A  L  Y  G+    
Sbjct: 224 VQGPCRIENDKFQVSVDNCYRYILYSEDKLKDVLHEMGPIAVAVDAVDLTDYYGGIITSC 283

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
           +      N  L+H+VL+VGYG      T   +P+W++KNSWG  +GE G+ R+ R   SC
Sbjct: 284 K------NYGLNHAVLLVGYG------TENGIPFWVLKNSWGTDYGENGFVRVKRNVNSC 331

Query: 338 G-INDYVRSALV 348
           G IN+   SA +
Sbjct: 332 GMINELAASARI 343


>gi|393717160|gb|AFN21082.1| V-Cath [Bombyx mori NPV]
 gi|393717442|gb|AFN21362.1| V-Cath [Bombyx mori NPV]
          Length = 323

 Score =  206 bits (525), Expect = 9e-51,   Method: Compositional matrix adjust.
 Identities = 118/317 (37%), Positives = 176/317 (55%), Gaps = 21/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y++ VE   R  IF  NL +I  +   ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P+        ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T G++E  +A K  +L++LSEQ++IDCD  D GC GG +  AF+ I+    GG++ E 
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  D+  CR+N     V++ + Y  +   E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  D G   L+H+VL+VGYGV+        +PYW  KN+WG  WGE G+FR+ 
Sbjct: 257 GI---IKYCFDSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 305 QNINACGMRNELASTAV 321


>gi|332326591|gb|AEE42619.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 123/318 (38%), Positives = 175/318 (55%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYWRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWRE  AVT VKBQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKBQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIE  +A     LV LSEQ+L+ CD +D GC GG ++ AF+ ++  + G +  E +Y
Sbjct: 155 AVGNIESQWAVAXHGLVRLSEQQLVSCDDKDSGCGGGLMTQAFEWLLRNMNGTMFTEDSY 214

Query: 215 PY---RGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY    GD   C  + +     +I+GYV +   ET MA +L ++GP+++A++A     Y 
Sbjct: 215 PYVSSTGDVPECTNSSELVPGARIDGYVMIESXETVMAAWLAKSGPISIAVDASPFMSYE 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV       C G  + L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV----LTSCVG--KXLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGEKGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
           heavy chain; Contains: RecName: Full=Cathepsin L light
           chain; Flags: Precursor
 gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
          Length = 339

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 130/308 (42%), Positives = 179/308 (58%), Gaps = 27/308 (8%)

Query: 48  QHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYL 104
           QH K YA  VE   R+ IF+ N  KI +  Q    G   Y  GLN+++D+   EF+    
Sbjct: 34  QHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADMLHHEFKETMN 93

Query: 105 GFK--LKPSYADRS--VPAM-IP--NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G+   L+    +R+  V A  IP  ++T+P++ DWRE+ AVTGVKDQ  CGS WAFS+TG
Sbjct: 94  GYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHCGSCWAFSSTG 153

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
            +EG +  K   LVSLSEQ L+DC  +  ++GC GG + NAF  I  K  GG++ EK+YP
Sbjct: 154 ALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYP 211

Query: 216 YRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTG 272
           Y G D +C  NK        G+V +   DE  M K +   GP++VAI+A   + Q Y  G
Sbjct: 212 YEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEG 271

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           V +  +  CD   +NL H VL+VGYG D +      + YW++KNSWG  WGE+GY ++ R
Sbjct: 272 VYNEPE--CD--EQNLDHGVLVVGYGTDES-----GMDYWLVKNSWGTTWGEQGYIKMAR 322

Query: 333 G-DGSCGI 339
             +  CGI
Sbjct: 323 NQNNQCGI 330


>gi|2582055|gb|AAB82455.1| lymphopain [Mus musculus]
          Length = 371

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 122/333 (36%), Positives = 176/333 (52%), Gaps = 38/333 (11%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F  + N++Y    EY  RL IF+ NL + Q LQ  + G+  +G   FSDL+  EF 
Sbjct: 39  VFKLFQIRFNRSYWNPAEYTRRLSIFAHNLAQAQRLQQEDLGTAEFGETPFSDLTEEEFG 98

Query: 101 AKYLGFKLKPSYADRSVPAMIPNIT-----------LPRAFDWRE-YDAVTGVKDQTMCG 148
                      Y     P   PN+T           +PR  DWR+  + ++ VK+Q  C 
Sbjct: 99  Q---------LYGQERSPERTPNMTKKVESNTWGESVPRTCDWRKAKNIISSVKNQGSCK 149

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
             WA +   NI+ ++  K ++ V +S QEL+DC++  +GC GG + +A+ T+++    GL
Sbjct: 150 CCWAMAAADNIQALWRIKHQQFVDVSVQELLDCERCGNGCNGGFVWDAYLTVLN--NSGL 207

Query: 209 EEEKTYPYRGDDKACR-LNKKATQVK-INGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
             EK YP++GD K  R L KK  +V  I  +  +S +E  +A YL  +GP+ V IN   L
Sbjct: 208 ASEKDYPFQGDRKPHRCLAKKYKKVAWIQDFTMLSNNEQAIAHYLAVHGPITVTINMKLL 267

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR------TKFTHK-----AVPYWIIK 315
           Q Y  GV       CD     + HSVL+VG+G  +      T  +H      + PYWI+K
Sbjct: 268 QHYQKGVIKATPSSCDP--RQVDHSVLLVGFGKKKEGMQTGTVLSHSRKRRHSSPYWILK 325

Query: 316 NSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           NSWG  WGEKGYFRLYRG+ +CG+  Y  +A V
Sbjct: 326 NSWGAHWGEKGYFRLYRGNNTCGVTKYPFTAQV 358


>gi|55735421|gb|AAV59468.1| cathepsin [Bombyx mori NPV]
          Length = 323

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 117/317 (36%), Positives = 176/317 (55%), Gaps = 21/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y++ VE   R  IF  NL +I  +   ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P+        ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T  ++E  +A K  +L++LSEQ++IDCD  D GC GG +  AF+ I+    GG++ E 
Sbjct: 139 FATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  D+  CR+N     V++ + Y  ++  E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  D G   L+H+VL+VGYGV+        +PYW  KN+WG  WGE G+FR+ 
Sbjct: 257 GI---IKYCFDSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 305 QNINACGMRNELASTAV 321


>gi|401430288|ref|XP_003886537.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|356491333|emb|CBZ40988.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 533

 Score =  206 bits (525), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 125/318 (39%), Positives = 176/318 (55%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 126 ALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAEF 184

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAFS
Sbjct: 185 AARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 244

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIEG +     +LVSLSEQ+L+ CD  +DGC+GG +  AFD ++    G L  E +Y
Sbjct: 245 AVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDSY 304

Query: 215 PYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY   +    +    ++     +I+G+V +   E  MA +L +NGP+A+A++A +   Y 
Sbjct: 305 PYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYK 364

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV       C G  + L+H VL+VGY  D T      VPYW+IKNSWG  WGE+GY R+
Sbjct: 365 SGV----LTACIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVRV 412

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C +++Y  SA V
Sbjct: 413 VMGVNACLLSEYPVSAHV 430


>gi|12024965|gb|AAG45727.1| cathepsin L-like cysteine protease [Leishmania chagasi]
          Length = 381

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 128/320 (40%), Positives = 176/320 (55%), Gaps = 37/320 (11%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A     LVSLSEQ+L+ CD +D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKS 213

Query: 214 YPY---RGDDKACRLN--KKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
           YPY    GD   C LN  K     +I+GYV +  +ET MA +L ENGP+A+A++A +   
Sbjct: 214 YPYTSGNGDVAEC-LNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMS 272

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +G                   VL+VGY  ++T      VPYW+IKNSWGE WGEKGY 
Sbjct: 273 YQSG-------------------VLLVGY--NKT----GGVPYWVIKNSWGEDWGEKGYV 307

Query: 329 RLYRGDGSCGINDYVRSALV 348
           R+  G  +C +++Y  SA V
Sbjct: 308 RVAMGLNACLLSEYPVSAHV 327


>gi|15320768|ref|NP_203280.1| V-CATH [Epiphyas postvittana NPV]
 gi|37077652|sp|Q91GE3.1|CATV_NPVEP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|15213236|gb|AAK85675.1| V-CATH [Epiphyas postvittana NPV]
          Length = 323

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 118/316 (37%), Positives = 175/316 (55%), Gaps = 25/316 (7%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ Q+NK Y +  E   R  IF  NL  I  +    + + VY +N+FSDLS
Sbjct: 22  LKAPNYFEEFVRQYNKQYDSEYEKLRRYKIFQHNLNDI--ITKNRNDTAVYKINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P +       ++   P    P  FDWR ++ +T VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PLHTQNFCEVVVLDRPPGKGPLEFDWRRFNKITSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T  ++E  +A    +L++LSEQ++IDCD  D GCEGG +  AF+ I+S   GG++ E 
Sbjct: 139 FATLASLESQFAIAHDRLINLSEQQMIDCDSVDVGCEGGLLHTAFEAIISM--GGVQIEN 196

Query: 213 TYPYRGDDKACRLNKKATQVKI---NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
            YPY   +  CR++     V +   N Y+++   E  +   L   GP+ VAI+A  +  Y
Sbjct: 197 DYPYESSNNYCRMDPTKFVVGVKQCNRYITIY--EEKLKDVLRLAGPIPVAIDASDILNY 254

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             G+      +C   N  L+H+VL+VGYGV+        VPYWI+KNSWG  WGE+G+F+
Sbjct: 255 EQGIIK----YC--ANNGLNHAVLLVGYGVENN------VPYWILKNSWGTDWGEQGFFK 302

Query: 330 LYRGDGSCGINDYVRS 345
           + +   +CGI + + S
Sbjct: 303 IQQNVNACGIKNELAS 318


>gi|215401412|ref|YP_002332715.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
 gi|209483953|gb|ACI47386.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
          Length = 337

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 125/323 (38%), Positives = 183/323 (56%), Gaps = 23/323 (7%)

Query: 33  LHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
           L+++    L F  F+ Q+NK Y T  E   R +IF  N+  I   +++ + S +Y +N F
Sbjct: 30  LYNINSAPLYFEKFIAQYNKKYKTEDEKKYRYNIFRHNMESINH-KNSRNDSAIYKINRF 88

Query: 92  SDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
           +D++  E   ++ G    +L  ++ +  V         P +FDWR  + VT VKDQ MCG
Sbjct: 89  ADMTKNEVVIRHTGLASGELGANFCETIVVDGPAQRQRPTSFDWRTLNKVTSVKDQGMCG 148

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
           + WAF+  G +E  YA K  +L+ L+EQ+L+DCD  D GC+GG I  A++ IM    GG+
Sbjct: 149 ACWAFAGLGALESQYAIKYDRLIDLAEQQLVDCDSVDMGCDGGLIHTAYEQIMHM--GGV 206

Query: 209 EEEKTYPYRGDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           E+E  YPYR + + C L  +K A  V+ + Y  V  +E  +   L   GP+A+A++A  L
Sbjct: 207 EQEFDYPYRAERQPCALKPHKFAAGVR-SCYRYVLLNEERLEDLLRYVGPIAIAVDAVDL 265

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
             Y  G+      FC+  N  L+H+VL+VGYGV+        VP+WIIKNSWG  +GE G
Sbjct: 266 TDYYGGIVS----FCE--NNGLNHAVLLVGYGVENN------VPFWIIKNSWGSDYGEDG 313

Query: 327 YFRLYRGDGSCG-INDYVRSALV 348
           Y R+ RG  SCG IN+   SA V
Sbjct: 314 YVRVRRGVNSCGMINELASSAQV 336


>gi|237643659|ref|YP_002884349.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
 gi|229358205|gb|ACQ57300.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
          Length = 323

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 118/317 (37%), Positives = 176/317 (55%), Gaps = 21/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y++ VE   R  IF  NL +I  +   ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P+        +I   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PTQTQNFCKVIILDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T G++E  +A K  +L++LSEQ++IDCD  D GC GG +  AF+ I+    GG++ E 
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  D+  CR+N     V++ + Y  +   E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  + G   L+H+VL+VGYGV+        +PYW  KN+WG  WGE G+FR+ 
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 305 QNINACGMRNELASTAV 321


>gi|228861649|ref|YP_002854669.1| cathepsin [Euproctis pseudoconspersa nucleopolyhedrovirus]
 gi|226425097|gb|ACO53509.1| cathepsin [Euproctis pseudoconspersa nucleopolyhedrovirus]
          Length = 334

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 121/318 (38%), Positives = 180/318 (56%), Gaps = 20/318 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+  +NK Y   +E   R HIF  NL +I   ++  + + VY +N+FSDLS
Sbjct: 31  LKAADYFELFVANYNKNYTDPLEKTKRYHIFKDNLEEINN-KNKSNDTAVYRINKFSDLS 89

Query: 96  TAEFQAKYLGFKLKPSYAD--RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           T E  +KY G  +    A+  + V    P    P  FDWR+ + VT +K+Q  CG+ WAF
Sbjct: 90  TNELISKYTGLNVPGETANFCKIVVLDQPPGKGPLNFDWRQQNKVTPIKNQGACGACWAF 149

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           +T  +IE  YA +    + LSEQ++IDCD  D GC GG +  AF+ ++    GG+EEE+ 
Sbjct: 150 ATLASIESQYAIRNNVHLDLSEQQMIDCDYVDMGCYGGLLHTAFEQMIQM--GGVEEERQ 207

Query: 214 YPYRGDDKACRL-NKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
           YPY G +  CRL + +   VK+ G Y  +   E  +   L   GP+ +AI+A ++  Y  
Sbjct: 208 YPYEGVNNNCRLKSDERFVVKVKGCYRYLVMREEKLKDLLRAVGPLPMAIDASSIFNYYR 267

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GV +    +C  GN  L+H+VL+VGYGV+        VP+W  KN+WG+ WGE GYFR+ 
Sbjct: 268 GVIN----YC--GNNGLNHAVLLVGYGVE------NGVPFWTFKNTWGDDWGEDGYFRVR 315

Query: 332 RGDGSCG-INDYVRSALV 348
           +   +CG +N+   SA++
Sbjct: 316 QNVDACGMLNELTSSAVI 333


>gi|401430108|ref|XP_003879535.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|356491914|emb|CBZ40911.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 359

 Score =  206 bits (524), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 125/319 (39%), Positives = 178/319 (55%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGECGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S+ GNIEG +     +LVSLSEQ+L+ CD  +DGC+GG +  AFD ++    G L  E +
Sbjct: 154 SSVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDS 213

Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY   +    +    +K     +I+G+V +   E  MA +L +NGP+A+A++A +   Y
Sbjct: 214 YPYVSGNGYLPECSNSSKLVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  + ++H+VL+VGY  D T      VPYW+IKNSWG  WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQVNHAVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +++Y  SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340


>gi|444724527|gb|ELW65130.1| Cathepsin W [Tupaia chinensis]
          Length = 491

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 114/317 (35%), Positives = 175/317 (55%), Gaps = 14/317 (4%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F  Q+N++Y++  E+  RL IF+ NL + Q LQ+ + G+  +G+  FSDL+  EF 
Sbjct: 167 VFALFQIQYNRSYSSPAEHARRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTDEEFS 226

Query: 101 AKYLGFKLKPSYAD--RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
             Y   K+        R V ++     +P   DWR+   ++ +++Q  C   WA +   N
Sbjct: 227 QVYKQPKVPGEVPRMVRKVRSLKQGKPVPPTCDWRKARIISPIRNQKNCSCCWAMAAADN 286

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           IE  +  +  + V +S QEL+DC +  DGC+GG + +AF T+++    GL  EK YPY+ 
Sbjct: 287 IEAQWGIRYNQSVKVSVQELLDCGRCGDGCKGGWVWDAFITVLNN--SGLASEKDYPYQS 344

Query: 219 --DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
             D + CR+ K+     I  ++ +  +E  +A+YL  +GP+ V IN   L+ Y  GV   
Sbjct: 345 NVDPQRCRV-KRNKVAWIQDFIMLQDNEQIIAQYLASHGPITVTINMKPLKQYRKGVFEA 403

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRT-----KFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
               CD     + HSVL+VG+G  ++       T  + PYWI+KNSWG  WGEKGYFRL+
Sbjct: 404 TPATCDPW--LVDHSVLLVGFGSSKSVKGMRAGTASSKPYWILKNSWGAKWGEKGYFRLH 461

Query: 332 RGDGSCGINDYVRSALV 348
           RG  +CGI  Y  +A V
Sbjct: 462 RGSNTCGIAKYPLTARV 478


>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 474

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 129/315 (40%), Positives = 168/315 (53%), Gaps = 33/315 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  +L +H K+Y  + E   R  IF  NL+ I      E+ S   GLN F+D++  E++
Sbjct: 49  MFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADITNEEYR 108

Query: 101 AKYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
             YLG K      +  S +DR  P  +   +LP + DWRE  AVTGVKDQ  CGS WAFS
Sbjct: 109 TGYLGAKRDASRNMVKSKSDRYAP--VAGDSLPDSIDWREKGAVTGVKDQGSCGSCWAFS 166

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           T   +EGV    T  L+SLSEQEL+DCD++ + GC GG +  AF  I+    GG++ E+ 
Sbjct: 167 TIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFIIKN--GGIDSEED 224

Query: 214 YPYRGDDKAC---RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
           YPY G D  C   R N  A    I+GY  V  +     +  V N P++VAI A  Y  Q 
Sbjct: 225 YPYTGKDGKCDSYRQN-NAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGGYDFQL 283

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +G+      F      +L H V  VGYG      T   V YWI+KNSWG+ WGEKGY 
Sbjct: 284 YSSGI------FTGSCGTDLDHGVAAVGYG------TENGVDYWIVKNSWGDYWGEKGYV 331

Query: 329 RLYRG----DGSCGI 339
           R+ R      G CGI
Sbjct: 332 RMQRNVKAKTGLCGI 346


>gi|401416324|ref|XP_003872657.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322488881|emb|CBZ24131.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 443

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 125/319 (39%), Positives = 176/319 (55%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIEG +     +LVSLSEQ+L+ CD  +DGC+GG +  AFD ++    G L  E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDS 213

Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY   +    +    ++     +I+G+V +   E  MA +L +NGP+A+A++A +   Y
Sbjct: 214 YPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  + L+H VL+VGY  D T      VPYW+IKNSWG  WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +++Y  SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340


>gi|71663165|ref|XP_818579.1| cruzipain precursor [Trypanosoma cruzi strain CL Brener]
 gi|70883838|gb|EAN96728.1| cruzipain precursor, putative [Trypanosoma cruzi]
          Length = 467

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 117/315 (37%), Positives = 166/315 (52%), Gaps = 18/315 (5%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
           T+ F  F ++H + Y +  E   RL +F  NL  +  L    +    +G+  FSDL+  E
Sbjct: 35  TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF-LARLHAAANPHATFGVTPFSDLTREE 93

Query: 99  FQAKYL-GFKLKPSYADRS-VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           F+++Y  G     +  +R+ VP  +  +  P A DWR   AVT VKDQ  CGS WAFS  
Sbjct: 94  FRSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GN+E  +      L +LSEQ L+ CD+ D GC GG ++NAF+ I+ +  G +  E +YPY
Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDKTDFGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213

Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
               G    C  +       I G+V + +DE  +A +L  NGP+AVA++A +   Y  GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                      +E L H VL+VGY          AVPYWIIKNSW   WGE+GY R+ +G
Sbjct: 274 ------MTSCVSEQLDHGVLLVGYN------DSAAVPYWIIKNSWTTQWGEEGYIRIAKG 321

Query: 334 DGSCGINDYVRSALV 348
              C + +   SA+V
Sbjct: 322 SNQCLVKEEASSAVV 336


>gi|461905|sp|Q05094.1|CYSP2_LEIPI RecName: Full=Cysteine proteinase 2; AltName: Full=Amastigote
           cysteine proteinase A-2; Flags: Precursor
 gi|159298|gb|AAA29229.1| cysteine proteinase [Leishmania pifanoi]
          Length = 444

 Score =  206 bits (523), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 128/320 (40%), Positives = 177/320 (55%), Gaps = 23/320 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIEG +     +LVSLSEQ+L+ CD  +DGC+GG +  AFD ++    G L  E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDS 213

Query: 214 YPY---RGDDKACRLNKKATQV--KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
           YPY    G    C  + +   V  +I+G+V +   E  MA +L +NGP+A+A++A +   
Sbjct: 214 YPYVSGNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMS 273

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +GV       C G  + L+H VL+VGY  D T      VPYW+IKNSWG  WGE+GY 
Sbjct: 274 YKSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYV 321

Query: 329 RLYRGDGSCGINDYVRSALV 348
           R+  G  +C +++Y  SA V
Sbjct: 322 RVVMGVNACLLSEYPVSAHV 341


>gi|1730100|sp|P36400.2|LMCPB_LEIME RecName: Full=Cysteine proteinase B; Flags: Precursor
 gi|899313|emb|CAA90236.1| LmCPb2.8 [Leishmania mexicana]
          Length = 443

 Score =  205 bits (522), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 125/319 (39%), Positives = 176/319 (55%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIEG +     +LVSLSEQ+L+ CD  +DGC+GG +  AFD ++    G L  E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHTEDS 213

Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY   +    +    ++     +I+G+V +   E  MA +L +NGP+A+A++A +   Y
Sbjct: 214 YPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  + L+H VL+VGY  D T      VPYW+IKNSWG  WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +++Y  SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340


>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
          Length = 322

 Score =  205 bits (522), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 123/304 (40%), Positives = 169/304 (55%), Gaps = 21/304 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ---LLQDTEHGSGVYGLNEFSDLSTAE 98
           F  F  +H KTY   VE  +R +IF  NLR I+   +L +    S   G+N F+D++  E
Sbjct: 25  FQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTDMTQEE 84

Query: 99  FQA-KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           F+A   L    KP +   +   ++  + +P + DWR    VTGVKDQ  CGS WAFS TG
Sbjct: 85  FRAFLTLSSSKKPHF--NTTEHVLTGLAVPDSIDWRTKGQVTGVKDQGNCGSCWAFSVTG 142

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           + E  Y  K  KLVSLSEQ+L+DC  + + GC GG +   F  + SK   GLE E TYPY
Sbjct: 143 STEAAYYRKAGKLVSLSEQQLVDCSTDINAGCNGGYLDETFTYVKSK---GLEAESTYPY 199

Query: 217 RGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           +G D +C+ +      K++G+ S+ S DE  +   +   GP++VAI+A  L  Y +G+  
Sbjct: 200 KGTDGSCKYSASKVVTKVSGHKSLKSEDENALLDAVGNVGPVSVAIDATYLSSYESGIYE 259

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
               +C      L+H VL+VGYG    K       YWI+KNSWG  +GE GYFRL RG  
Sbjct: 260 --DDWCS--PSELNHGVLVVGYGTSNGK------KYWIVKNSWGGSFGESGYFRLLRGKN 309

Query: 336 SCGI 339
            CG+
Sbjct: 310 ECGV 313


>gi|401416322|ref|XP_003872656.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|322488880|emb|CBZ24130.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 366

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 125/319 (39%), Positives = 175/319 (54%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIEG +     +LVSLSEQ+L+ CD  +DGC GG +  AFD ++    G L  E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCSGGLMLQAFDWLLQNTNGHLYTEDS 213

Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY   +    +    ++     +I+G+V +   E  MA +L +NGP+A+A++A +   Y
Sbjct: 214 YPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  + L+H VL+VGY  D T      VPYW+IKNSWG  WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +++Y  SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340


>gi|9542|emb|CAA78443.1| cysteine proteinase [Leishmania mexicana]
          Length = 443

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 125/319 (39%), Positives = 175/319 (54%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIEG +     +LVSLSEQ+L+ CD  D+GC GG +  AFD ++    G L  E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMDNGCSGGLMLQAFDWLLQNTNGHLHTEDS 213

Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY   +    +    ++     +I+G+V +   E  MA +L +NGP+A+A++A +   Y
Sbjct: 214 YPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  + L+H VL+VGY  D T      VPYW+IKNSWG  WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +++Y  SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340


>gi|30387350|ref|NP_848429.1| cathepsin [Choristoneura fumiferana MNPV]
 gi|1168799|sp|P41715.1|CATV_NPVCF RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|332509|gb|AAA96732.1| cathepsin [Choristoneura fumiferana MNPV]
 gi|30270084|gb|AAP29900.1| cathepsin [Choristoneura fumiferana MNPV]
          Length = 324

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 116/315 (36%), Positives = 175/315 (55%), Gaps = 20/315 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  FL + NK+Y++  E   R  IF  NL +I + ++    +  Y +N+F+DLS
Sbjct: 22  LKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEI-INKNHNDSTAQYEINKFADLS 80

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  +KY G  L P         ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 81  KDETISKYTGLSL-PLQTQNFCEVVVLDRPPDKGPLEFDWRRLNKVTSVKNQGMCGACWA 139

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T G++E  +A K  + ++LSEQ+LIDCD  D GC+GG +  AF+ +M+   GG++ E 
Sbjct: 140 FATLGSLESQFAIKHNQFINLSEQQLIDCDFVDAGCDGGLLHTAFEAVMNM--GGIQAES 197

Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  ++  CR N     VK+   Y  ++  E  +   L   GP+ VAI+A  +  Y  
Sbjct: 198 DYPYEANNGDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPIPVAIDASDIVNYKR 257

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+      +C   N  L+H+VL+VGY V+        VP+WI+KN+WG  WGE+GYFR+ 
Sbjct: 258 GIMK----YC--ANHGLNHAVLLVGYAVE------NGVPFWILKNTWGADWGEQGYFRVQ 305

Query: 332 RGDGSCGINDYVRSA 346
           +   +CGI + + S+
Sbjct: 306 QNINACGIQNELPSS 320


>gi|393660044|gb|AFN09033.1| V-Cath [Bombyx mori NPV]
          Length = 323

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 117/317 (36%), Positives = 176/317 (55%), Gaps = 21/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y++ VE   R  IF  NL +I  +   ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P+        ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T G++E  +A K  +L++LSEQ++IDCD  D GC GG +  AF+ I+    GG++ E 
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  D+  CR+N     V++ + Y  +   E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  + G   L+H+VL+VGYGV+        +PYW  KN+WG  WGE G+FR+ 
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 305 QNINACGMRNELASTAV 321


>gi|167833701|gb|ACA02577.1| cathepsin [Spodoptera frugiperda MNPV]
          Length = 340

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 126/323 (39%), Positives = 181/323 (56%), Gaps = 23/323 (7%)

Query: 33  LHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
           L+++    L F  F+ Q+NK Y +  E   R +IF  N+  I   +++ + S VY +N F
Sbjct: 33  LYNINSAPLYFEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQ-KNSRNDSAVYKINRF 91

Query: 92  SDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
           +D++  E   ++ G    +L  ++ +  V         P  FDWR  + VT VKDQ MCG
Sbjct: 92  ADMTKNEIVIRHTGLASGELGANFCETVVVDGPAQRQRPANFDWRTLNKVTSVKDQGMCG 151

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
           + WAF+  G +E  YA K  +L+ L+EQ+L+DCD  D GC+GG I  A++ IM    GG+
Sbjct: 152 ACWAFAGLGALESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMRM--GGV 209

Query: 209 EEEKTYPYRGDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           E+E  YPY+ + + C L  +K A  V+ N Y  V  +E  +   L   GP+A+A++A  L
Sbjct: 210 EQEFDYPYKAERQPCALKPHKFAAGVR-NCYRYVLMNEERLEDLLRYVGPIAIAVDAVDL 268

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
             Y  G+      FC   N  L+H+VL+VGYGV+        VPYWIIKNSWG  +GE G
Sbjct: 269 TDYYGGIVS----FCK--NNGLNHAVLLVGYGVENN------VPYWIIKNSWGSDYGEDG 316

Query: 327 YFRLYRGDGSCG-INDYVRSALV 348
           Y R+ RG  SCG IN+   SA V
Sbjct: 317 YVRVRRGVNSCGMINELASSAQV 339


>gi|125860143|ref|YP_001036312.1| viral cathepsin [Spodoptera frugiperda MNPV]
 gi|120969288|gb|ABM45731.1| viral cathepsin [Spodoptera frugiperda MNPV]
 gi|319997353|gb|ADV91251.1| V-CATH [Spodoptera frugiperda MNPV]
 gi|384087478|gb|AFH58958.1| v-cath [Spodoptera frugiperda MNPV]
          Length = 339

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 126/323 (39%), Positives = 181/323 (56%), Gaps = 23/323 (7%)

Query: 33  LHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
           L+++    L F  F+ Q+NK Y +  E   R +IF  N+  I   +++ + S VY +N F
Sbjct: 32  LYNINSAPLYFEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQ-KNSRNDSAVYKINRF 90

Query: 92  SDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
           +D++  E   ++ G    +L  ++ +  V         P  FDWR  + VT VKDQ MCG
Sbjct: 91  ADMTKNEIVIRHTGLASGELGANFCETVVVDGPAQRQRPANFDWRTLNKVTSVKDQGMCG 150

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
           + WAF+  G +E  YA K  +L+ L+EQ+L+DCD  D GC+GG I  A++ IM    GG+
Sbjct: 151 ACWAFAGLGALESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMRM--GGV 208

Query: 209 EEEKTYPYRGDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           E+E  YPY+ + + C L  +K A  V+ N Y  V  +E  +   L   GP+A+A++A  L
Sbjct: 209 EQEFDYPYKAERQPCALKPHKFAAGVR-NCYRYVLMNEERLEDLLRYVGPIAIAVDAVDL 267

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
             Y  G+      FC   N  L+H+VL+VGYGV+        VPYWIIKNSWG  +GE G
Sbjct: 268 TDYYGGIVS----FCK--NNGLNHAVLLVGYGVENN------VPYWIIKNSWGSDYGEDG 315

Query: 327 YFRLYRGDGSCG-INDYVRSALV 348
           Y R+ RG  SCG IN+   SA V
Sbjct: 316 YVRVRRGVNSCGMINELASSAQV 338


>gi|37651368|ref|NP_932731.1| cathepsin [Choristoneura fumiferana DEF MNPV]
 gi|82024252|sp|Q6VTL7.1|CATV_NPVCD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|37499277|gb|AAQ91676.1| cathepsin [Choristoneura fumiferana DEF MNPV]
          Length = 324

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 122/316 (38%), Positives = 175/316 (55%), Gaps = 22/316 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI--QLLQDTEHGSGVYGLNEFSD 93
           +K  + F  FL   NK Y++  E   R  IF  NL +I  + L DT   S  Y +N+FSD
Sbjct: 22  LKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDT---SAQYEINKFSD 78

Query: 94  LSTAEFQAKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
           LS  E  +KY G  L     +     ++  P    P  FDWR  + VT VK+Q  CG+ W
Sbjct: 79  LSKDETISKYTGLSLPLQNQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGACW 138

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
           AF+T G++E  +A K  +L++LSEQ+LIDCD  D GC+GG +  A++ +M+   GG++ E
Sbjct: 139 AFATLGSLESQFAIKHDQLINLSEQQLIDCDFVDMGCDGGLLHTAYEAVMNM--GGIQAE 196

Query: 212 KTYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
             YPY  ++  CRLN     VK+   Y  V   E  +   L   GP+ VAI+A  +  Y 
Sbjct: 197 NDYPYEANNGDCRLNAAKFVVKVKKCYRYVLMFEEKLKDLLRIVGPLPVAIDASDIVNYK 256

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
            GV      +C   N  L+H+VL+VGY V+        VP+WI+KN+WG  WGE+GYFR+
Sbjct: 257 RGVIR----YC--ANHGLNHAVLLVGYAVEN------GVPFWILKNTWGTDWGEQGYFRV 304

Query: 331 YRGDGSCGINDYVRSA 346
            +   +CGI + + S+
Sbjct: 305 QQNINACGIQNELPSS 320


>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
 gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
          Length = 436

 Score =  205 bits (521), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 123/308 (39%), Positives = 167/308 (54%), Gaps = 22/308 (7%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF  + EQ+ KTY++  E  SRL +F  N   +       + S    LN F+DL+  EF+
Sbjct: 28  LFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFADLTHHEFK 87

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           A  LGF    + + RSV   +  + +P A DWR+  AVTGVKDQ  CG  W+FSTTG IE
Sbjct: 88  ASRLGFSPGRAQSIRSVGTPVQELHVPPAVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIE 147

Query: 161 GVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           G+    T  LVSLSEQEL+DCD+  + GCEGG +  A+  ++     G++ E  YPY G 
Sbjct: 148 GINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKNQ--GIDSEADYPYVGM 205

Query: 220 DKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSHP 276
           DK C   K K   V I+GY  +  ++      +V   P++V I  +    Q Y  GV   
Sbjct: 206 DKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGSEKTFQLYSKGV--- 262

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
              +    +  L H+VLIVGYG      T   V +WI+KNSWGE WG +GY  + R +G+
Sbjct: 263 ---YTGPCSSTLDHAVLIVGYG------TEDGVDFWIVKNSWGEHWGMRGYIHMLRNNGT 313

Query: 337 ----CGIN 340
               CGIN
Sbjct: 314 AEGICGIN 321


>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
          Length = 318

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 122/305 (40%), Positives = 171/305 (56%), Gaps = 27/305 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYG----LNEFSDLSTA 97
           F  F  +H+K+Y+  VE   RL IF+ NLR I+   +  + +G+      +N+F+DL+  
Sbjct: 25  FQSFKLKHSKSYSNQVEEAKRLAIFTENLRDIEE-HNALYAAGLVSYNKSVNQFTDLTID 83

Query: 98  EFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           EF+A YL    KP+    +VP +   + +P   DWR    VTGVKDQ  CGS WAFS  G
Sbjct: 84  EFKA-YLTLHSKPTL--NTVPYVRTGLQVPTTLDWRSQGYVTGVKDQGDCGSCWAFSVVG 140

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           + EG Y   T KLVSLSEQ+LIDC    +DGC+GG +   F  +      GL  E +YPY
Sbjct: 141 STEGAYYKSTGKLVSLSEQQLIDCTTNVNDGCDGGYLEETFPYVQQT---GLVSESSYPY 197

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV--S 274
            G D  CR+++     K++ YV +   E D+ + +   GP++VA++A  +  Y +GV  S
Sbjct: 198 TGRDGNCRISESDVVTKVSKYVLLG-GEADLLEAVGSVGPVSVAMDATYIYSYASGVYES 256

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
                +      +L+H VL+VGYG      T     YW+IKNSWG  WGE+GY +L RG 
Sbjct: 257 SLCSLY------SLNHGVLVVGYG------TQDGKDYWLIKNSWGNTWGEQGYLKLLRGT 304

Query: 335 GSCGI 339
             CGI
Sbjct: 305 NECGI 309


>gi|401430350|ref|XP_003886559.1| unnamed protein product, partial [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|356491516|emb|CBZ40966.1| unnamed protein product, partial [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 503

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 125/318 (39%), Positives = 176/318 (55%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 96  ALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAEF 154

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAFS
Sbjct: 155 AARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAFS 214

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIEG +     +LVSLSEQ+L+ CD  +DGC+GG +  AFD ++    G L  E +Y
Sbjct: 215 AVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDSY 274

Query: 215 PYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY   +    +    ++     +I+G+V +   E  MA +L +NGP+A+A++A +   Y 
Sbjct: 275 PYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSYK 334

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV       C G  + L+H VL+VGY  D T      VPYW+IKNSWG  WGE+GY R+
Sbjct: 335 SGV----LTACIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVRV 382

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C +++Y  SA V
Sbjct: 383 VMGVNACLLSEYPVSAHV 400


>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
 gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
          Length = 477

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 133/355 (37%), Positives = 179/355 (50%), Gaps = 35/355 (9%)

Query: 4   FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTA-----LFNYFLEQHNKTYATLVE 58
           F F A    LS+ +++   M + D  L H    + T      L+  +L ++ K Y  L E
Sbjct: 8   FAFLATFYFLSVCLAID--MSIIDYNLKHGQVPERTEAETLRLYEMWLVKYGKAYNALGE 65

Query: 59  YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP 118
              R  IF  NL+ +       + S   GLN+F+DLS  E++A YLG ++         P
Sbjct: 66  KERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLGTRMDGKRRLLGGP 125

Query: 119 AMIPNI-----TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSL 173
                +      LP + DWRE  AV  VKDQ  CGS WAFST G +EG+    T  L SL
Sbjct: 126 KSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSL 185

Query: 174 SEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQ 231
           SEQEL+DCD+  + GC GG +  AF+ IM    GG++ E+ YPY+  D  C  N+K A  
Sbjct: 186 SEQELVDCDKVYNQGCNGGLMDYAFEFIMKN--GGIDTEEDYPYKAVDSMCDPNRKNARV 243

Query: 232 VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLS 289
           V I+GY  V +++    +  V N P++VAI A   A Q Y +GV      F       L 
Sbjct: 244 VTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGV------FTGSCGTQLD 297

Query: 290 HSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
           H V+ VGYG      T   V YW+++NSWG  WGE GY R+ R       G CGI
Sbjct: 298 HGVVAVGYG------TENGVDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGI 346


>gi|394331816|gb|AFN27127.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 118/318 (37%), Positives = 176/318 (55%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWR+  AVT VKDQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             G+IE  +A    +L +LSEQ+L+ CD +D+GC GG +  AF+ ++  + G +  E +Y
Sbjct: 155 AVGSIESQWALAGHRLTALSEQQLVSCDDKDNGCAGGLMLQAFEWLLRNMNGTMFTEDSY 214

Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY            ++Q+    +I+GY+++   ET MA +L +NGP+++A++A +   Y 
Sbjct: 215 PYVSSTGYVPECSNSSQLVPGARIDGYLTIESSETVMAAWLAKNGPISIAVDASSFMSYQ 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV            + L+H VL+VGY  +RT      VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGV------LTSCAGDALNHGVLLVGY--NRT----GEVPYWVIKNSWGENWGENGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
          Length = 471

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 124/314 (39%), Positives = 166/314 (52%), Gaps = 29/314 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           ++ ++L +H K Y  + E   R  IF  NLR +         +   GL +F+DL+  E++
Sbjct: 51  MYEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYR 110

Query: 101 AKYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           A YLG K      L+   + R +     +  LP   DWRE  AVT VKDQ  CGS WAFS
Sbjct: 111 AMYLGAKMEKKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFS 170

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           T G++EG+    T  L+SLSEQEL+DCD+  + GC GG +  AF+ I+    GG++ E  
Sbjct: 171 TVGSVEGINQIVTGDLISLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKN--GGIDSEAD 228

Query: 214 YPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYV 270
           YPYR  D  C  N+K A  V I+GY  V  ++ +  K  V N P++VAI A     Q Y 
Sbjct: 229 YPYRASDNMCDSNRKNAHVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQ 288

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV      F      NL H V+ VGYG      T   + YWI++NSWG  WGE GY R+
Sbjct: 289 SGV------FTGRCGTNLDHGVVAVGYG------TENGIDYWIVRNSWGPKWGESGYIRM 336

Query: 331 YRG-----DGSCGI 339
            R       G CGI
Sbjct: 337 ERNVASTDTGKCGI 350


>gi|311247276|ref|XP_003122571.1| PREDICTED: cathepsin W-like [Sus scrofa]
          Length = 367

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 120/319 (37%), Positives = 177/319 (55%), Gaps = 16/319 (5%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF- 99
           +F  F  Q+N++Y+   E+  RL IF+ NL K Q LQ+ + G+  +G+  FSDL+  EF 
Sbjct: 41  VFTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFG 100

Query: 100 --QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAV-TGVKDQTMCGSSWAFSTT 156
                + G    PS   + V +     T+P++ DWR+   V + +K Q  C   WA +  
Sbjct: 101 QLHGHHWGAGKAPSMGIK-VGSEESGETVPQSCDWRKKPGVISAIKHQKDCNCCWAMAAV 159

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
            N+E  +A K  + V LS Q+++DCD+  +GC GG + +AF T+++    GL  E+ YPY
Sbjct: 160 DNVEAQWAIKYHQAVQLSVQQVLDCDRCGNGCNGGFVWDAFLTVLNT--SGLASEQDYPY 217

Query: 217 RGDDKACR-LNKKATQVK-INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
           +G  K  R L K+  +V  I  ++ +   E  +A+YL   GP+ V INA  LQ Y  GV 
Sbjct: 218 KGTVKTHRCLAKQHRKVAWIQDFLMLQFCEQSIARYLATEGPITVTINAGLLQQYKRGVI 277

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVD-----RTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
                 CD     ++HSVL+VG+G       R      ++PYWI+KNSWG  WGE+GYFR
Sbjct: 278 RATPATCD--PHLVNHSVLLVGFGKSKSVEGRRPRPGHSIPYWILKNSWGPDWGEEGYFR 335

Query: 330 LYRGDGSCGINDYVRSALV 348
           L+RG  +CGI  Y  +A V
Sbjct: 336 LHRGSNTCGITKYPVTARV 354


>gi|71666430|ref|XP_820174.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|70885508|gb|EAN98323.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 467

 Score =  204 bits (520), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 120/345 (34%), Positives = 176/345 (51%), Gaps = 20/345 (5%)

Query: 9   GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
            ++L ++ V ++  +      LH    +   + F  F ++H + Y +  E   RL +F  
Sbjct: 7   ALSLAAVLVVMACLVPAATASLHAEETLA--SQFAEFKQKHGRVYESAAEEAFRLSVFRE 64

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITL 126
           NL  +  L    +    +G+  FSDL+  EF+++Y  G     +  +R+ VP  +  +  
Sbjct: 65  NLF-LARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVNVEVVGA 123

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P A DWR   AVT VKDQ  CGS WAFS  GN+E  +      L +LSEQ L+ CD+ D 
Sbjct: 124 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 183

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRD 243
           GC GG ++NAF+ I+ +  G +  E +YPY    G    C  +       I G+V + +D
Sbjct: 184 GCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 243

Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
           E  +A +L  NGP+AVA++A +   Y  GV           +E L H VL+VGY      
Sbjct: 244 EAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN----- 292

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               AVPYWIIKNSW   WGE GY R+ +G   C + +   SA+V
Sbjct: 293 -DSAAVPYWIIKNSWTAQWGEDGYIRIAKGSNQCLVKEEASSAVV 336


>gi|348565006|ref|XP_003468295.1| PREDICTED: cathepsin W-like [Cavia porcellus]
          Length = 375

 Score =  204 bits (519), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 121/326 (37%), Positives = 173/326 (53%), Gaps = 22/326 (6%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F  Q N++Y+   EY  RL IF  NL   Q LQ+ E G+  +G+  FSDL+  EF 
Sbjct: 41  VFKLFQIQFNRSYSNQAEYARRLDIFVHNLATAQRLQEEELGTAEFGVTPFSDLTEEEFG 100

Query: 101 AKYLGFKL--KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
             Y   ++  K     R V        + ++ DWR+   ++ VK+Q  C   WA +  GN
Sbjct: 101 QLYGNRRVARKDLRVARKVSFDKQEELMSQSCDWRKAHIISPVKNQGNCRCCWAIAAAGN 160

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           IE ++  + K  V+LS QEL+DC + +DGC GG I +AF T+++    GL  EK YP+RG
Sbjct: 161 IEAMWNIRYKVSVTLSVQELLDCARCEDGCAGGYIWDAFITVLNY--SGLASEKDYPFRG 218

Query: 219 --DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
             +   C  +       I  Y+ + RDE  +A+Y+   GP+ V IN+  LQ Y  G+   
Sbjct: 219 HANIHKCLASNYRKVAWIYDYIMLPRDEQGIARYVATQGPITVIINSKILQHYKKGIIKG 278

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDR--------TKFTHK------AVPYWIIKNSWGEGW 322
               CD     + H VL+VGYG  +        T  +H       ++PYWI+KNSWG  W
Sbjct: 279 TSSKCDPW--FVDHYVLLVGYGRSKAEEEKWTETDLSHSNRPPRHSIPYWILKNSWGANW 336

Query: 323 GEKGYFRLYRGDGSCGINDYVRSALV 348
           GE+GYFRL+RG  +CGI  Y  +A V
Sbjct: 337 GEEGYFRLHRGSNTCGITKYPITARV 362


>gi|393904668|gb|EFO15826.2| hypothetical protein LOAG_12683 [Loa loa]
          Length = 202

 Score =  204 bits (519), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 102/202 (50%), Positives = 130/202 (64%), Gaps = 10/202 (4%)

Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGG 206
           CGS WAFS TGNIEG +A K  KL+SLSEQELIDCD  D GC+GG   NA+  I+    G
Sbjct: 10  CGSCWAFSVTGNIEGAWAIKKGKLISLSEQELIDCDVIDQGCKGGLPLNAYKEIIRM--G 67

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           GLE EK YPY G  + C L +K   V IN  + +  DE  +A ++ + GP+++ +NA  L
Sbjct: 68  GLESEKDYPYDGHGEKCHLVRKEIAVYINDSIQLPDDEIKIAAWVAKKGPVSIGVNAGPL 127

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
           QFY  G+SHP + FC     +++H VLIVGYG +  K      PYWIIKNSWG  WGE G
Sbjct: 128 QFYRHGISHPWKAFCL--PSHINHGVLIVGYGQEANK------PYWIIKNSWGTKWGENG 179

Query: 327 YFRLYRGDGSCGINDYVRSALV 348
           Y+RLYRG   CG+ +   +A+V
Sbjct: 180 YYRLYRGKNVCGVKEMATTAIV 201


>gi|14602252|ref|NP_148795.1| ORF11 cathepsin [Cydia pomonella granulovirus]
 gi|13124000|sp|O91466.1|CATV_GVCPM RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|14591773|gb|AAK70678.1| ORF11 cathepsin [Cydia pomonella granulovirus]
          Length = 333

 Score =  204 bits (518), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 127/348 (36%), Positives = 189/348 (54%), Gaps = 29/348 (8%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
           LL+  +  S   V      + L++     LF  F  ++NKTY +  E   +L  F  NL+
Sbjct: 4   LLNFVILASVLTVTAHALTYDLNNSDE--LFKNFAIKYNKTYVSDEERAIKLENFKNNLK 61

Query: 72  KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL----KPSYADRSVPAMI-----P 122
            I   ++      V+ +NE+SDL+      +  GF+L     PS    +  +++     P
Sbjct: 62  MINE-KNMASKYAVFDINEYSDLNKNALLRRTTGFRLGLKKNPSAFTMTECSVVVIKDEP 120

Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
              LP   DWR+   VT VK+Q  CGS WAFST  NIE +Y  K  K ++LSEQ L++CD
Sbjct: 121 QALLPETLDWRDKHGVTPVKNQMECGSCWAFSTIANIESLYNIKYDKALNLSEQHLVNCD 180

Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS-VS 241
             ++GC GG +  A ++I+ +  GG+   +  PY G D  C+  K   ++ I+G    V 
Sbjct: 181 NINNGCAGGLMHWALESILQE--GGVVSAENEPYYGFDGVCK--KSPFELSISGSRRYVL 236

Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
           ++E  + + LV NGP++VAI+   L  Y  G++      C+  NE L+H+VL+VGYGV  
Sbjct: 237 QNENKLRELLVVNGPISVAIDVSDLINYKAGIAD----ICE-NNEGLNHAVLLVGYGVKN 291

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG-INDYVRSALV 348
                  VPYWI+KNSWG  WGE+GYFR+ R   SCG +N+Y  SA++
Sbjct: 292 D------VPYWILKNSWGAEWGEEGYFRVQRDKNSCGMMNEYASSAIL 333


>gi|312095086|ref|XP_003148243.1| hypothetical protein LOAG_12683 [Loa loa]
          Length = 195

 Score =  204 bits (518), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 102/202 (50%), Positives = 130/202 (64%), Gaps = 10/202 (4%)

Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGG 206
           CGS WAFS TGNIEG +A K  KL+SLSEQELIDCD  D GC+GG   NA+  I+    G
Sbjct: 3   CGSCWAFSVTGNIEGAWAIKKGKLISLSEQELIDCDVIDQGCKGGLPLNAYKEIIRM--G 60

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           GLE EK YPY G  + C L +K   V IN  + +  DE  +A ++ + GP+++ +NA  L
Sbjct: 61  GLESEKDYPYDGHGEKCHLVRKEIAVYINDSIQLPDDEIKIAAWVAKKGPVSIGVNAGPL 120

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
           QFY  G+SHP + FC     +++H VLIVGYG +  K      PYWIIKNSWG  WGE G
Sbjct: 121 QFYRHGISHPWKAFCL--PSHINHGVLIVGYGQEANK------PYWIIKNSWGTKWGENG 172

Query: 327 YFRLYRGDGSCGINDYVRSALV 348
           Y+RLYRG   CG+ +   +A+V
Sbjct: 173 YYRLYRGKNVCGVKEMATTAIV 194


>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
 gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  204 bits (518), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 126/320 (39%), Positives = 168/320 (52%), Gaps = 25/320 (7%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
             LH          ++ +H K Y    E   R  IF  N+  I+      + S + G+N 
Sbjct: 28  RELHESTMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVEFIESSNAAGNNSYMLGINR 87

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSV-PAMIPNIT-LPRAFDWREYDAVTGVKDQTMCG 148
           F+DL+  EF+A + G+K +P  A R V P    N+T LP + DWR   AVT +KDQ  CG
Sbjct: 88  FADLTNEEFRASWNGYK-RPLDASRIVTPFKYENVTALPYSMDWRRKGAVTSIKDQRECG 146

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGG 206
           S WAFS     EGV+  +T KLVSLSEQEL+DCD   ED GC+GG + +AF  I  K  G
Sbjct: 147 SCWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVKGEDKGCQGGLMEDAFKFI--KRNG 204

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSVSRDETDMAKYLVENGPMAVAINA-- 263
           G+  E  Y YRG D  C   K+A+ V KI GY  V  +        V + P++V+I+A  
Sbjct: 205 GITTEANYAYRGRDGKCDTKKEASHVAKITGYQVVPENSEAALLKAVAHQPVSVSIDAGS 264

Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
            + QFY +G+      +      +L+H V  VGYG      +     YWI+KNSWG  WG
Sbjct: 265 MSFQFYQSGI------YAGSCGSDLNHGVAAVGYGT-----SSSGSKYWIVKNSWGPEWG 313

Query: 324 EKGYFRLYRG----DGSCGI 339
           E+GY R+ R      G CGI
Sbjct: 314 ERGYVRMKRDITSRKGLCGI 333


>gi|66730453|ref|NP_001019413.1| cathepsin W precursor [Rattus norvegicus]
 gi|62531092|gb|AAH93401.1| Cathepsin W [Rattus norvegicus]
 gi|149062072|gb|EDM12495.1| cathepsin W [Rattus norvegicus]
          Length = 371

 Score =  204 bits (518), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 118/325 (36%), Positives = 180/325 (55%), Gaps = 22/325 (6%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F  Q N++Y+   EY  RL IF+ NL + Q LQ+ + G+  +G   FSDL+  EF 
Sbjct: 39  VFKLFQIQFNRSYSNPAEYTRRLGIFAHNLAQAQRLQEEDLGTAEFGQTPFSDLTEEEFG 98

Query: 101 AKYLGFKLKPSY---ADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
             Y G +  P       + V +     ++P   DWR+  + ++ +K+Q  C   WA +  
Sbjct: 99  QLY-GHQRAPERILNMAKKVKSERWGESVPPTCDWRKVKNIISSIKNQGNCRCCWAIAAA 157

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
            NI+ ++  KT++ V +S QEL+DCD+  +GC GG + +A+ T+++    GL  E+ YP+
Sbjct: 158 DNIQTLWRIKTQQFVDVSVQELLDCDRCGNGCNGGFVWDAYITVLN--NSGLASEEDYPF 215

Query: 217 RGDDKA--CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
           +G  K   C  +K      I  +  +S +E  +A YL  +GP+ V IN   LQ+Y  GV 
Sbjct: 216 QGHQKPHRCLADKYRKVAWIQDFTMLSSNEQVIAGYLAIHGPITVTINMKLLQYYQKGVI 275

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDR------TKFTH-----KAVPYWIIKNSWGEGWG 323
                 CD     ++HSVL+VG+G ++      T  +H     ++ PYWI+KNSWG  WG
Sbjct: 276 KATPSTCDP--HLVNHSVLLVGFGKEKGGMQTGTLLSHSRKPRRSTPYWILKNSWGAEWG 333

Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
           EKGYFRLYRG+ +CGI  Y  +A V
Sbjct: 334 EKGYFRLYRGNNTCGIAKYPITARV 358


>gi|1136308|gb|AAB41119.1| cruzipain [Trypanosoma cruzi]
          Length = 467

 Score =  204 bits (518), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 119/345 (34%), Positives = 176/345 (51%), Gaps = 20/345 (5%)

Query: 9   GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
            ++L ++ V ++  +      LH    +   + F  F ++H + Y +  E   RL +F  
Sbjct: 7   ALSLAAVLVVMACLVPAATASLHAEETLA--SQFAEFKQKHGRVYGSAAEEAFRLSVFRE 64

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITL 126
           NL  +  L    +    +G+  FSDL+  EF+++Y  G     +  +R+ VP  +  +  
Sbjct: 65  NL-FLARLHAAANPHATFGVTAFSDLTREEFRSRYHNGAAHFAAAQERARVPVNVEVVGA 123

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P A DWR   AVT VKDQ  CGS WAFS  GN+E  +      L +LSEQ L+ CD+ D 
Sbjct: 124 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 183

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRD 243
           GC GG ++NAF+ I+ +  G +  E +YPY    G    C  +       I G+V + +D
Sbjct: 184 GCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 243

Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
           E  +A +L  NGP+AVA++A +   Y  GV           +E L H VL+VGY      
Sbjct: 244 EAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN----- 292

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               AVPYW+IKNSW   WGE GY R+ +G   C + +   SA+V
Sbjct: 293 -DSAAVPYWVIKNSWTTQWGEDGYIRIAKGSNQCLVKEEASSAVV 336


>gi|375073976|gb|AFA34855.1| cathepsin L-like protein [Trypanosoma cruzi]
          Length = 467

 Score =  204 bits (518), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 119/345 (34%), Positives = 176/345 (51%), Gaps = 20/345 (5%)

Query: 9   GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
            ++L ++ V ++  +      LH    +   + F  F ++H + Y +  E   RL +F  
Sbjct: 7   ALSLAAVLVVMACLVPAATASLHAEETLA--SQFAEFKQKHGRVYGSAAEEAFRLSVFRE 64

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITL 126
           NL  +  L    +    +G+  FSDL+  EF+++Y  G     +  +R+ VP  +  +  
Sbjct: 65  NL-FLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVNVEVVGA 123

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P A DWR   AVT VKDQ  CGS WAFS  GN+E  +      L +LSEQ L+ CD+ D 
Sbjct: 124 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 183

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRD 243
           GC GG ++NAF+ I+ +  G +  E +YPY    G    C  +       I G+V + +D
Sbjct: 184 GCGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 243

Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
           E  +A +L  NGP+AVA++A +   Y  GV           +E L H VL+VGY      
Sbjct: 244 EAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN----- 292

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               AVPYW+IKNSW   WGE GY R+ +G   C + +   SA+V
Sbjct: 293 -DSAAVPYWVIKNSWTTQWGEDGYIRIAKGSNQCLVKEEASSAVV 336


>gi|296085959|emb|CBI31400.3| unnamed protein product [Vitis vinifera]
          Length = 257

 Score =  204 bits (518), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 109/235 (46%), Positives = 143/235 (60%), Gaps = 20/235 (8%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
           LP +FDWRE  AVT VK Q  CGS WAFSTTG +EG +   TKKL++LSEQ+L+DCD   
Sbjct: 17  LPESFDWREKGAVTEVKMQGTCGSCWAFSTTGAVEGAHFISTKKLLTLSEQQLVDCDHMC 76

Query: 185 --------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
                   D GCEGG ++NA+  ++    GGLEEE +YPY G    C+       V++  
Sbjct: 77  DIRDKTACDSGCEGGLMTNAYKYLIE--AGGLEEESSYPYTGKHGECKFKPDRVAVRVVN 134

Query: 237 YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
           +  V  +E  +A  LV +GP+AV +NA  +Q Y+ GVS P+   C      ++H VL+VG
Sbjct: 135 FTEVPINENQIAANLVCHGPLAVGLNAIFMQTYIGGVSCPL--ICP--KRWINHGVLLVG 190

Query: 297 YGVDR---TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           YG       +F +K  PYWIIKNSWG+ WGE GY+RL RG G CG+N  V +  V
Sbjct: 191 YGAKGYSILRFGYK--PYWIIKNSWGKRWGEHGYYRLCRGHGMCGMNTMVSAVNV 243


>gi|27819101|gb|AAO23117.1| cysteine proteinase [Bombyx mori NPV]
          Length = 323

 Score =  204 bits (518), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 117/317 (36%), Positives = 175/317 (55%), Gaps = 21/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y++ VE   R  IF  NL +I  +   ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P+        ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T G++E  +A K  +L++LSEQ++I CD  D GC GG +  AF+ I+    GG++ E 
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIGCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  D+  CR+N     V++ + Y  +   E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  D G   L+H+VL+VGYGV+        +PYW  KN+WG  WGE G+FR+ 
Sbjct: 257 GI---IKYCFDSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 305 QNINACGMRNELASTAV 321


>gi|8468605|gb|AAF75546.1| cruzipain [Trypanosoma cruzi]
          Length = 467

 Score =  203 bits (517), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 120/345 (34%), Positives = 175/345 (50%), Gaps = 20/345 (5%)

Query: 9   GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
            ++L ++ V ++  +      LH    +   + F  F ++H + Y +  E   RL +F  
Sbjct: 7   ALSLAAVLVVMACLVPAATASLHAEETLA--SQFAEFKQKHGRVYESAAEEAFRLSVFRE 64

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITL 126
           NL  +  L    +    +G+  FSDL+  EF+++Y  G     +  +R+ VP  +  +  
Sbjct: 65  NLF-LARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVNVEVVGA 123

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P A DWR   AVT VKDQ  CGS WAFS  GN+E  +      L +LSEQ L+ CD+ D 
Sbjct: 124 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 183

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRD 243
           GC GG ++NAF  I+ +  G +  E +YPY    G    C  +       I G+V + +D
Sbjct: 184 GCGGGLMNNAFGWIVQENNGAVYTENSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 243

Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
           E  +A +L  NGP+AVA++A +   Y  GV           +E L H VL+VGY      
Sbjct: 244 EAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN----- 292

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               AVPYWIIKNSW   WGE GY R+ +G   C + +   SA+V
Sbjct: 293 -DSAAVPYWIIKNSWTAQWGEDGYIRIAKGSNQCLVKEEASSAVV 336


>gi|209170907|ref|YP_002268053.1| agip23 [Agrotis ipsilon multiple nucleopolyhedrovirus]
 gi|208436498|gb|ACI28725.1| viral cathepsin [Agrotis ipsilon multiple nucleopolyhedrovirus]
          Length = 364

 Score =  203 bits (517), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 124/323 (38%), Positives = 182/323 (56%), Gaps = 23/323 (7%)

Query: 33  LHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
           L+++    L F  F+ Q+NK Y    E   R +IF  N+  I   +++ + S VY +N F
Sbjct: 57  LYNINSAPLYFEKFISQYNKHYKNEDEKKYRYNIFRHNIESINH-KNSRNDSAVYKINRF 115

Query: 92  SDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
           +D++  E   ++ G    +L  ++ +  V         P +FDWR  + VT VKDQ MCG
Sbjct: 116 ADMTKNEVVIRHTGLASGELGVNFCETIVVDGPGQRQRPTSFDWRTLNKVTSVKDQGMCG 175

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
           + WAF+  G +E  YA K  +L+ LSEQ+L+DCD  D GC+GG I  A++ IM    GG+
Sbjct: 176 ACWAFAGLGALESQYAIKYDRLIDLSEQQLVDCDHVDMGCDGGLIHTAYEEIMRM--GGV 233

Query: 209 EEEKTYPYRGDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           E++  YPYR + + C L  +K A  V+ + Y  V  +E  +   L   GP+A+A++A  +
Sbjct: 234 EQDFDYPYRAERQPCALKPHKFAAGVR-SCYRYVLLNEERLEDLLRHVGPIAIAVDAVDI 292

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
             Y  G+      FC+  N  L+H+VL+VGYGV+        VPYWI+KNSWG  +GE G
Sbjct: 293 TDYYGGIVS----FCE--NNGLNHAVLLVGYGVENN------VPYWILKNSWGSDYGEDG 340

Query: 327 YFRLYRGDGSCG-INDYVRSALV 348
           Y R+ RG  SCG IN+   SA V
Sbjct: 341 YVRVRRGVNSCGMINELASSAQV 363


>gi|71406896|ref|XP_805951.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|70869552|gb|EAN84100.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 426

 Score =  203 bits (517), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 116/313 (37%), Positives = 164/313 (52%), Gaps = 18/313 (5%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
           T+ F  F ++H + Y +  E   RL +F  NL  +  L    +    +G+  FSDL+  E
Sbjct: 35  TSQFAEFKQKHGRVYESAAEEAFRLSVFRENLF-LARLHAAANPHATFGVTPFSDLTREE 93

Query: 99  FQAKYL-GFKLKPSYADRS-VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           F+++Y  G     +  +R+ VP  +  +  P A DWR   AVT VKDQ  CGS WAFS  
Sbjct: 94  FRSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GN+E  +      L +LSEQ L+ CD+ D GC GG ++NAF+ I+ +  G +  E +YPY
Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213

Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
               G    C  +       I G+V + +DE  +A +L  NGP+AVA++A +   Y  GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                      +E L H VL+VGY          AVPYWIIKNSW   WGE+GY R+ +G
Sbjct: 274 MTSCV------SEQLDHGVLLVGYN------DSAAVPYWIIKNSWTTQWGEEGYIRIAKG 321

Query: 334 DGSCGINDYVRSA 346
              C + +   SA
Sbjct: 322 LNQCLVKEEASSA 334


>gi|357619725|gb|EHJ72184.1| hypothetical protein KGM_03271 [Danaus plexippus]
          Length = 338

 Score =  203 bits (517), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 118/329 (35%), Positives = 174/329 (52%), Gaps = 24/329 (7%)

Query: 11  ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
            L+  ++       +G  +L+ L       LF  F++ +NK Y    E   R  IF  NL
Sbjct: 12  VLVLFSIDQCKVRELGQRRLYSLEEA--PTLFEQFIKDYNKEYDE-SEKEERFKIFVNNL 68

Query: 71  RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK--PSYADRSVPAMIP--NITL 126
           + I  + +    + VYG+N+FSDLS  EF   Y G K +  PS  D     +    N+T 
Sbjct: 69  KDINAMNE-RSSNAVYGINKFSDLSKEEFIKYYTGLKREESPSNEDHKKTDLPESFNVTA 127

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P  FDWR+   V+ +K+Q  CGS WAFS   N+E ++A KT KL+ +SEQ+L+DCD+ D 
Sbjct: 128 PDQFDWRKKGVVSSIKNQKHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDCDKYDS 187

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETD 246
           GC GG     +D +   +  G    K+YPY   +  CR +    ++++ GY   S+   D
Sbjct: 188 GCSGGL---PWDALRYFVANGAMSLKSYPYVAKEGKCRYDSSKVEIRLKGYKIFSKISED 244

Query: 247 MAK-YLVENGPMAVAINAYALQFYVTG-VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
             K +L   GP+++AI+   ++ YV G V       C      ++H+VL+VGYG + +  
Sbjct: 245 QIKEHLYNIGPLSIAIDVSPIKPYVGGIVMEECHEVC-----QVNHAVLLVGYGKEYS-- 297

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
               V YWI+KNSWG  WGE GYFR+ RG
Sbjct: 298 ----VEYWIVKNSWGPNWGENGYFRMERG 322


>gi|8468607|gb|AAF75547.1| cruzipain [Trypanosoma cruzi]
          Length = 467

 Score =  203 bits (517), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 120/344 (34%), Positives = 176/344 (51%), Gaps = 20/344 (5%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           ++L ++ V ++  +      LH    +   + F  F ++H + Y +  E   RL +F  N
Sbjct: 8   LSLAAVLVVMACLVPAATASLHAEETLA--SQFAEFKQKHGRVYESAAEEAFRLSVFREN 65

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITLP 127
           L  +  L    +    +G+  FSDL+  EF ++Y  G     +  +R+ VP  +  +  P
Sbjct: 66  LF-LARLHAAANPHATFGVTPFSDLTREEFWSRYHNGAAHFAAAQERARVPVNVEVVGAP 124

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
            A DWR   AVT VKDQ  CGS WAFS  GN+E  +      L +LSEQ L+ CD+ D G
Sbjct: 125 AAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDSG 184

Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRDE 244
           C GG ++NAF+ I+ +  G +  E +YPY    G    C  +       I G+V + +DE
Sbjct: 185 CGGGLMNNAFEWIVQENNGAVYTEGSYPYASGEGISPPCTTSGHTVGATITGHVEIPQDE 244

Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
             +A +L  NGP+AVA++A +   Y  GV           +E L H VL+VGY       
Sbjct: 245 AQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN------ 292

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
              AVPYW+IKNSW   WGE GY R+ +G   C + + V SA+V
Sbjct: 293 DSAAVPYWVIKNSWTTHWGEGGYIRIAKGSNQCLVKEGVSSAVV 336


>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
          Length = 344

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 126/339 (37%), Positives = 186/339 (54%), Gaps = 25/339 (7%)

Query: 28  EKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSR------LHIFSGNLRKIQLLQDTEH 81
           +K   L H K+ + ++ +++++NK +   V+ YS         +F  NL  I +  + E+
Sbjct: 13  DKSAALAHQKYLSAWSSWVKEYNKEH--WVDPYSSPESTRAFEVFQKNLDMI-MKHNEEY 69

Query: 82  GSGV----YGLNEFSDLSTAEFQAKYLGFK----LKPSYADRSVPAMIPNITLPRAFDWR 133
             G+     GLN F+ L+  EF A+YLG+      +P               +P + DWR
Sbjct: 70  NQGLQSYEMGLNGFAHLTFEEFSAQYLGYGGAEVEQPKTRRAGKHERKSRSEIPASVDWR 129

Query: 134 EYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGG 191
           E  AV  VK+Q  CGS WAFS    +EG +   + +L+SLSEQ+L+DC ++  + GC GG
Sbjct: 130 EKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGELISLSEQQLVDCSKKFGNHGCAGG 189

Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKY 250
            + NAF+  M+  G G + EK YPY+G D  C+ +    +  I+GY  V + +ETD+   
Sbjct: 190 YMDNAFEYWMNNTGHGDDSEKDYPYKGMDGKCKFSADGVRATISGYNDVKQGNETDLLDA 249

Query: 251 LVENGPMAVAINA-YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV 309
           +   GP++VAI+A  ALQFY+ GV + +   C G    L+H V  VGYG    +F  K +
Sbjct: 250 VANVGPVSVAIHAGAALQFYLRGVFNGVAGTCFG---PLNHGVTAVGYGTASLRFGRK-M 305

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
            YWIIKNSWG GWGEKG+ R  RG   CG+ +     LV
Sbjct: 306 DYWIIKNSWGMGWGEKGFVRFARGKNLCGVANGASYPLV 344


>gi|20069912|ref|NP_613116.1| cathepsin [Mamestra configurata NPV-A]
 gi|37077373|sp|Q8QLK1.1|CATV_NPVMC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|20043306|gb|AAM09141.1| cathepsin [Mamestra configurata NPV-A]
 gi|33331744|gb|AAQ11052.1| putative cysteine proteinase [Mamestra configurata NPV-A]
          Length = 337

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 127/343 (37%), Positives = 183/343 (53%), Gaps = 21/343 (6%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
           LL   V  S   VV      +L+++    L F  F+ Q+NK Y++  E   R +IF  N+
Sbjct: 9   LLVSAVLTSHDQVVAVTIKPNLYNINSAPLYFEKFISQYNKQYSSEDEKKYRYNIFRHNI 68

Query: 71  RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLP 127
             I   +++ + S VY +N F+D++  E   ++ G     +  ++ +  V         P
Sbjct: 69  ESINA-KNSRNDSAVYKINRFADMTKNEVVNRHTGLASGDIGANFCETIVVDGPGQRQRP 127

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
             FDWR Y+ VT VKDQ MCG+ WAF+  G +E  YA K  +L+ L+EQ+L+DCD  D G
Sbjct: 128 ANFDWRNYNKVTSVKDQGMCGACWAFAGLGALESQYAIKYDRLIDLAEQQLVDCDFVDMG 187

Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETD 246
           C+GG I  A++ IM    GG+E+E  YPY+     C +      V + N Y  V   E  
Sbjct: 188 CDGGLIHTAYEQIMHI--GGVEQEYDYPYKAVRLPCAVKPHKFAVGVRNCYRYVLLSEER 245

Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
           +   L   GP+A+A++A  L  Y  GV      FC+  N  L+H+VL+VGYG++      
Sbjct: 246 LEDLLRHVGPIAIAVDAVDLTDYYGGVIS----FCE--NNGLNHAVLLVGYGIENN---- 295

Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCG-INDYVRSALV 348
             VPYW IKNSWG  +GE GY R+ RG  SCG IN+   SA +
Sbjct: 296 --VPYWTIKNSWGSDYGENGYVRIRRGVNSCGMINELASSAQI 336


>gi|9634237|ref|NP_037776.1| ORF16 cathepsin [Spodoptera exigua MNPV]
 gi|37077857|sp|Q9J8B9.1|CATV_NPVSE RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|6960476|gb|AAF33546.1|AF169823_16 ORF16 cathepsin [Spodoptera exigua MNPV]
          Length = 337

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 125/323 (38%), Positives = 182/323 (56%), Gaps = 23/323 (7%)

Query: 33  LHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEF 91
           L+++    L F  F+ Q+NK Y +  E   R +IF  N+  I   +++ + S VY +N F
Sbjct: 30  LYNINSAPLYFEKFITQYNKQYKSEDEKKYRYNIFRHNIESINQ-KNSRNDSAVYKINRF 88

Query: 92  SDLSTAEFQAKYLGF---KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
           +D+   E   ++ G    +L  ++ +  V         P +FDWR  + +T VKDQ MCG
Sbjct: 89  ADMPKNEIVIRHTGLASGELGLNFCETIVVDGPAQRQRPVSFDWRSMNKITSVKDQGMCG 148

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
           + W F++ G +E  YA K  +L+ LSEQ+L+DCD  D GC+GG I  A++ IM    GG+
Sbjct: 149 ACWRFASLGALESQYAIKYDRLIDLSEQQLVDCDFVDMGCDGGLIHTAYEQIMKM--GGV 206

Query: 209 EEEKTYPYRGDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           E+E  Y Y+ + + C L  +K AT V+ N Y  V  +E  +   L   GP+A+A++A  L
Sbjct: 207 EQEFDYSYKAERQPCALKPHKFATGVR-NCYRYVILNEERLEDLLRYVGPIAIAVDAVDL 265

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
             Y  G+      FC+  N  L+H+VL+VGYGV+        VPYWIIKNSWG  +GE G
Sbjct: 266 TDYYGGIVS----FCE--NNGLNHAVLLVGYGVENN------VPYWIIKNSWGSDYGEDG 313

Query: 327 YFRLYRGDGSCG-INDYVRSALV 348
           Y R+ RG  SCG IN+   SA V
Sbjct: 314 YVRVRRGVNSCGMINELASSAQV 336


>gi|2351557|gb|AAB68595.1| cathepsin [Choristoneura fumiferana MNPV]
          Length = 324

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 119/316 (37%), Positives = 174/316 (55%), Gaps = 22/316 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI--QLLQDTEHGSGVYGLNEFSD 93
           +K  + F  FL   NK Y++  E   R  IF  NL +I  + L DT   S  Y +N+FSD
Sbjct: 22  LKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDT---SAQYEINKFSD 78

Query: 94  LSTAEFQAKYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
           LS  E  +KY G  L     +     ++  P    P  FDWR  + VT VK+Q  CG+ W
Sbjct: 79  LSKDETISKYTGLSLPLQNQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGTCGACW 138

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
           AF+T G++E  +A K  +L++LSEQ+LIDCD  D GC+GG +  A++ +M+   GG++ E
Sbjct: 139 AFATLGSLESQFAIKHDQLINLSEQQLIDCDFVDMGCDGGLLHTAYEAVMNM--GGIQAE 196

Query: 212 KTYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
             YPY  ++  CR N     VK+   Y  ++  E  +   L   GP+ VAI+A  +  Y 
Sbjct: 197 NDYPYEANNGDCRANAAKFVVKVKKCYRYITVFEEKLKDLLRSVGPIPVAIDASDIVNYK 256

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
            G+      +C   N  L+H+VL+VGY V         VP+WI+KN+WG  WGE+GYFR+
Sbjct: 257 RGIMK----YC--ANHGLNHAVLLVGYAV------QNGVPFWILKNTWGADWGEQGYFRV 304

Query: 331 YRGDGSCGINDYVRSA 346
            +   +CGI + + S+
Sbjct: 305 QQNINACGIQNELPSS 320


>gi|259016196|sp|P56202.2|CATW_HUMAN RecName: Full=Cathepsin W; AltName: Full=Lymphopain; Flags:
           Precursor
          Length = 376

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 119/328 (36%), Positives = 176/328 (53%), Gaps = 27/328 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  Q N++Y +  E+  RL IF+ NL + Q LQ+ + G+  +G+  FSDL+  EF  
Sbjct: 42  FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101

Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
            Y G++      PS   R + +  P  ++P + DWR+   A++ +KDQ  C   WA +  
Sbjct: 102 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFSCDWRKVASAISPIKDQKNCNCCWAMAAA 159

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIE ++       V +S QEL+DC +  DGC GG + +AF T+++    GL  EK YP+
Sbjct: 160 GNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPF 217

Query: 217 RGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
           +G  +A R + K  Q    I  ++ +  +E  +A+YL   GP+ V IN   LQ Y  GV 
Sbjct: 218 QGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVI 277

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
                 CD   + + HSVL+VG+G  +++                    PYWI+KNSWG 
Sbjct: 278 KATPTTCD--PQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGA 335

Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
            WGEKGYFRL+RG  +CGI  +  +A V
Sbjct: 336 QWGEKGYFRLHRGSNTCGITKFPLTARV 363


>gi|1581746|prf||2117247B Cys protease:ISOTYPE=2
          Length = 467

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 117/315 (37%), Positives = 166/315 (52%), Gaps = 22/315 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL-LQDTEHGSGVYGLNEFSDLSTAEFQ 100
           F  F ++H K Y +  E   RL +F  NL   +L      H S  +G+  FSDL+  EF+
Sbjct: 38  FAAFKQRHGKVYGSAAEEAFRLGVFKENLLFARLHAAANPHAS--FGVTPFSDLTREEFR 95

Query: 101 AKYLGFKLKPSYADRSVPAMIPNIT----LPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           ++Y       + A + V   +         P A DWR   AVT +KDQ  CGS WAFST 
Sbjct: 96  SRYHNAAAHFAAAQKRVRVPVEVEVEVGGAPAAVDWRARGAVTAIKDQGGCGSCWAFSTI 155

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIEG +      L  LSEQ L+ CD  D+GC+GG + +AFD I+ +  G +  E +Y Y
Sbjct: 156 GNIEGQWHLAGNPLTGLSEQMLVSCDNADNGCDGGLMDSAFDWIVGQNNGSVYTEASYSY 215

Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
               GD + C ++       I+G+V + +DE  MA +L  NGP+A+A++A +   Y  GV
Sbjct: 216 VSGGGDSQTCNMSSHVVGAVISGHVDLPQDEDKMAAWLAVNGPLAIAVDATSFMSYTGGV 275

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                   +  ++ L H V++VGY            PYWIIKNSWG  WGE+GY R+ +G
Sbjct: 276 ------LTNCVSDQLDHGVVLVGYNDSSNP------PYWIIKNSWGADWGEEGYIRIQKG 323

Query: 334 DGSCGINDYVRSALV 348
              C + +Y  SA+V
Sbjct: 324 TNQCLVKNYACSAVV 338


>gi|118395092|ref|XP_001029901.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila]
 gi|89284178|gb|EAR82238.1| Papain family cysteine protease containing protein [Tetrahymena
           thermophila SB210]
          Length = 344

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 117/325 (36%), Positives = 169/325 (52%), Gaps = 30/325 (9%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           A F  F  + NK Y    E++S  H +  +   I +    E+ +  +G  +FSD+S  EF
Sbjct: 31  AEFEEFKSKFNKYYHNEHEHHSSFHNYKTSREHI-VKHQMENPNAKFGHTKFSDMSPEEF 89

Query: 100 QAKYLGF---------------KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
           + K L F               K +P          + N  LP +FDWR+   +T  K Q
Sbjct: 90  ENKMLNFDFSLFKKAKSQGIKLKAEPMKGYLRQGENVDNSDLPESFDWRDKGIITPAKFQ 149

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKL 204
             CGS W F+TTG IE  YA K  +L+  SEQ L+DCD  + GC GG +++A+  +    
Sbjct: 150 NTCGSCWTFATTGVIESQYALKYGELLHFSEQMLLDCDNINQGCRGGLMTDAYQFLQQ-- 207

Query: 205 GGGLEEEKTY-PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA 263
            GG++   TY  Y+     C  +K   + K+  +  +  +E  + + LV+NGP+AV INA
Sbjct: 208 SGGIQTADTYGDYKNKKDICNFDKAKVKAKVVDWYQIPENEETIRRELVKNGPVAVGINA 267

Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
             LQFY  G+  P    CD   + ++H+VLIVGYGV+      + +PYW+IKN WG  WG
Sbjct: 268 RTLQFYEGGIVDPKN--CD---DKINHAVLIVGYGVE------EGIPYWLIKNQWGAEWG 316

Query: 324 EKGYFRLYRGDGSCGINDYVRSALV 348
            KG+F+L RG   CGI+ Y   A V
Sbjct: 317 IKGFFKLIRGKKQCGIHTYASIAYV 341


>gi|1136312|gb|AAB41118.1| cruzipain [Trypanosoma cruzi]
          Length = 383

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 118/343 (34%), Positives = 174/343 (50%), Gaps = 20/343 (5%)

Query: 9   GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
            ++L ++ V ++  +      LH    +   + F  F ++H + Y +  E   RL +F  
Sbjct: 7   ALSLAAVLVVMACLVPAATASLHAEETL--ASQFAEFKQKHGRVYGSAAEEAFRLSVFRA 64

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITL 126
           NL  +  L    +    +G+  FSDL+  EF+++Y  G     +  +R+ VP  +  +  
Sbjct: 65  NLF-LARLHAAANPHATFGVTAFSDLTREEFRSRYHNGAAHFAAAQERARVPVNVEVVGA 123

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P A DWR   AVT VKDQ  CGS WAFS  GN+E  +      L +LSEQ L+ CD+ D 
Sbjct: 124 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 183

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRD 243
           GC GG ++NAF+ I+ +  G +  E +YPY    G    C  +       I G+V + +D
Sbjct: 184 GCGGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 243

Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
           E  +A +L  NGP+AVA++A +   Y  GV           +E L H VL+VGY      
Sbjct: 244 EAQIAAWLAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN----- 292

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSA 346
               AVPYW+IKNSW   WGE GY R+ +G   C + +   SA
Sbjct: 293 -DSAAVPYWVIKNSWTTQWGEDGYIRIAKGSNQCLVKEEASSA 334


>gi|375073984|gb|AFA34859.1| cathepsin L-like protein [Trypanosoma rangeli]
          Length = 467

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 120/315 (38%), Positives = 166/315 (52%), Gaps = 22/315 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD-TEHGSGVYGLNEFSDLSTAEFQ 100
           F  F ++H K Y +  E   RL +F  NL   +L      H S  +G+  FSDL+  EF+
Sbjct: 38  FAAFKQRHGKVYRSAAEEAFRLGVFKENLLLARLHAAANPHAS--FGVTPFSDLTREEFR 95

Query: 101 AKYLGFKLKPSYADRSVPAMIPNIT----LPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           ++Y       + A +     +         P A DWR   AVT VKDQ  CGS WAFST 
Sbjct: 96  SRYHNAAAHFAAAQKRARVPVEVEVEVGGAPAAVDWRARGAVTAVKDQGECGSCWAFSTI 155

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIEG +      L SLSEQ L+ CD  D+GC+GG + NAFD I+ K  G +  E +Y Y
Sbjct: 156 GNIEGQWHLAGNPLTSLSEQMLVSCDNADNGCDGGLMDNAFDWIVGKNNGTVYTEASYSY 215

Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
               G+ + C ++       I+G+V + +DE  MA +L  NGP+A+A++A +   Y  GV
Sbjct: 216 VSGGGNSQKCDMSGHVVGAVISGHVDLPKDEDKMAAWLAANGPLAIAVDATSFMSYTGGV 275

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                   +  ++ L H V++VGY            PYWIIKNSWG  WGE GY R+ +G
Sbjct: 276 ------LTNCISDQLDHGVVLVGYNDSSNP------PYWIIKNSWGADWGEGGYIRIQKG 323

Query: 334 DGSCGINDYVRSALV 348
              C +N+Y  SA+V
Sbjct: 324 TNQCLVNNYACSAVV 338


>gi|375073978|gb|AFA34856.1| cathepsin L-like protein [Trypanosoma cruzi]
          Length = 467

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 119/345 (34%), Positives = 175/345 (50%), Gaps = 20/345 (5%)

Query: 9   GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
            ++L ++ V ++  +      LH    +   + F  F ++H + Y +  E   RL +F  
Sbjct: 7   ALSLAAVLVVMACLVPAATASLHAEETL--ASQFAEFKQKHGRVYESAAEEAFRLSVFRE 64

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITL 126
           NL  +  L    +    +G+  FSDL+  EF+++Y  G     +  +R+ VP  +  +  
Sbjct: 65  NL-FLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVNVEVVGA 123

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P A DWR   AVT VKDQ  CGS WAFS  GN+E  +      L +LSEQ L+ CD+ D 
Sbjct: 124 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDS 183

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRD 243
           GC GG ++NAF+ I+ +  G +  E +YPY    G    C  +       I G+V + +D
Sbjct: 184 GCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQD 243

Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
           E  +A +L  NGP+AV ++A +   Y  GV           +E L H VL+VGY      
Sbjct: 244 EAQIAAWLAVNGPVAVGVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN----- 292

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               AVPYWIIKNSW   WGE GY R+ +G   C + +   SA+V
Sbjct: 293 -DSAAVPYWIIKNSWTTQWGEGGYIRVAKGSNQCLVKEEASSAVV 336


>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
 gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
          Length = 347

 Score =  203 bits (516), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 138/347 (39%), Positives = 191/347 (55%), Gaps = 34/347 (9%)

Query: 8   AGVALLSL-TVSVSSFMVVGDEKLHHLH-HVKHTAL---FNYFLEQHNKTYATLVEYYSR 62
           AG+ L++L T+ + S   +   ++H L      TA+   ++ +LEQ+ + Y T  EY  R
Sbjct: 10  AGLMLITLCTLWIPS---IARSEIHSLPIDSAPTAMKVRYDKWLEQYGRKYDTKDEYLLR 66

Query: 63  LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIP 122
             I+  N++ I+ + ++++ S     N+F+DL+  EF + YLG++++ SY  R++  M  
Sbjct: 67  FGIYHSNIQFIEYI-NSQNLSFKLTDNKFADLTNDEFNSIYLGYQIR-SYKRRNLSHMHE 124

Query: 123 NIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
           N T LP A DWRE  AVT +KDQ  CGS WAFS    +EG+   KT  LVSLSEQEL+DC
Sbjct: 125 NSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGNLVSLSEQELVDC 184

Query: 182 DQEDD--GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYV 238
           D   D  GC GG +  AF  I S   GGL  E  YPY+G D +C   K     V I GY 
Sbjct: 185 DVNGDNKGCNGGFMEKAFTFIKSI--GGLTTENDYPYKGTDGSCEKAKTDNHAVIIGGYE 242

Query: 239 SVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
           +V  +  +  K  V   P++VAI+A  Y  Q Y  GV      +C      L+H V IVG
Sbjct: 243 TVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEGV---FSGYC---GIQLNHGVTIVG 296

Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD----GSCGI 339
           YG       +    YW++KNSWG+GWGE GY R+ R      G CGI
Sbjct: 297 YG------DNNGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMCGI 337


>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
          Length = 350

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 124/336 (36%), Positives = 175/336 (52%), Gaps = 28/336 (8%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
           +L L++  S  M        +LH    +     +++++ K Y    E   RL IF  N+ 
Sbjct: 14  VLLLSICTSQVMS------RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVE 67

Query: 72  KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAF 130
            I+      +      +N  +D +  EF A + G+K K S++    P    N+T +P A 
Sbjct: 68  FIESFNAAGNKPYKLSINHLADQTNEEFVASHNGYKYKGSHSQ--TPFKYGNVTDIPTAV 125

Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEG 190
           DWR+  AVT VKDQ  CGS WAFST    EG+Y   T  L+SLSEQEL+DCD  D GC+G
Sbjct: 126 DWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSVDHGCDG 185

Query: 191 GSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT-QVKINGYVSVSRDETDMAK 249
           G + + F+ I+    GG+  E  YPY   D  C  +K+A+   +I GY +V  +  +  +
Sbjct: 186 GLMEDGFEFIIKN--GGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSEEALQ 243

Query: 250 YLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHK 307
             V N P++V+I+A     QFY +GV      F       L H V +VGYG      TH+
Sbjct: 244 QAVANQPVSVSIDAGGSGFQFYSSGV------FTGQCGTQLDHGVTVVGYGT-TDDGTHE 296

Query: 308 AVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
              YWI+KNSWG  WGE+GY R+ RG    +G CGI
Sbjct: 297 ---YWIVKNSWGTQWGEEGYIRMQRGIDAQEGLCGI 329


>gi|343473370|emb|CCD14732.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 120/343 (34%), Positives = 183/343 (53%), Gaps = 24/343 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  N+ +
Sbjct: 14  VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+  FSD+S  EF+A Y  G +   +   R  P  + N++    P 
Sbjct: 72  AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPE 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  CGS WAFS  GNIEG +     +L SLSEQ L+ CD  D GC
Sbjct: 129 AVDWRKKGAVTPVKDQGACGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDTTDYGC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
            GG +  +   I+S   G +   ++YPY    G    C  + K    KI+G++++ +DE 
Sbjct: 189 RGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDEN 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L +NGP+A+A++A +   Y  GV           ++ L H VL+VGY  D +K  
Sbjct: 249 AIAEWLAKNGPVAIAVDATSFLGYKGGV------LTSCISKGLDHDVLLVGYN-DTSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSW +GWGE+GY R+ +G   C + +Y RSA+V
Sbjct: 300 ---PPYWIIKNSWSKGWGEEGYIRIEKGTNQCLMKNYARSAVV 339


>gi|292397748|ref|YP_003517814.1| cathepsin [Lymantria xylina MNPV]
 gi|291065465|gb|ADD73783.1| cathepsin [Lymantria xylina MNPV]
          Length = 335

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 117/315 (37%), Positives = 172/315 (54%), Gaps = 23/315 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD--TEHGSGVYGLNEFSDLSTAEF 99
           F  F+E +NK Y +  E   R  IF  NL +I       T+  +  YG+N+FSDLS +E 
Sbjct: 35  FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYGINKFSDLSKSEL 94

Query: 100 QAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
            AK+ G  + P  A      ++   P    P  FDWRE + VT +K+Q  CG+ WAF+T 
Sbjct: 95  IAKFTGLSI-PQRASNFCKTIVLNQPPDKGPLHFDWREQNKVTSIKNQGACGACWAFATL 153

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
            ++E  +A +  +LV LSEQ+LIDCD  D GC GG +  AF+ I+    GG++ E  YP+
Sbjct: 154 ASVESQFAMRHNRLVDLSEQQLIDCDSVDMGCNGGLLHTAFEEIIRM--GGVQAELDYPF 211

Query: 217 RGDDKACRLNKKATQVK--INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
            G D+ C +++    V   +  Y  V  +E  +   L   GP+ +AI+A  +  Y  GV 
Sbjct: 212 VGRDRRCGVDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPIPMAIDAADIVNYYRGVI 271

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
              +      N  L+H+VL+VGYGV+        VPYW  KN+WG+ WGE GYFR+ +  
Sbjct: 272 SSCE------NNGLNHAVLLVGYGVE------NGVPYWAFKNTWGDDWGENGYFRVRQNI 319

Query: 335 GSCG-INDYVRSALV 348
            +CG +ND   +A++
Sbjct: 320 NACGMVNDLASTAVL 334


>gi|86355549|ref|YP_473217.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
 gi|86198154|dbj|BAE72318.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
          Length = 324

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 121/318 (38%), Positives = 179/318 (56%), Gaps = 21/318 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K  + F  FL + NK Y++  E   R  IF  NL +I ++++    +  Y +N+FSDLS
Sbjct: 22  LKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEI-IIKNQNDTTAQYEINKFSDLS 80

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  +KY G  L P         ++   P    P  FDWR  + VT VK+Q +CG+ WA
Sbjct: 81  KDETISKYTGLAL-PLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGACWA 139

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T  ++E  +A K  +L++LSEQ+LIDCD  D GC GG +  A++ +M    GG++ E 
Sbjct: 140 FATLASLESQFAIKHNQLINLSEQQLIDCDYVDAGCNGGLLHTAYEAVMQM--GGVQAEN 197

Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G D  CR++     VK+   Y  ++  E  +   L   GP+ VAI+A  +  Y  
Sbjct: 198 DYPYEGSDGNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPIPVAIDASDIVNYRR 257

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+      +C   N  L+H+VL+VGYGV+        VPYWI+KN+WGE WGE+GYFR+ 
Sbjct: 258 GIMR----YC--SNYGLNHAVLLVGYGVENN------VPYWILKNTWGEDWGEQGYFRVQ 305

Query: 332 RGDGSCGI-NDYVRSALV 348
           +   +CGI N+ + SA +
Sbjct: 306 QNINACGIRNELLASAEI 323


>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 469

 Score =  202 bits (515), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 129/333 (38%), Positives = 175/333 (52%), Gaps = 30/333 (9%)

Query: 21  SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTE 80
           S +  GD +L      +  A++  +L +H K+Y  L E   R  IF  NLR I+      
Sbjct: 34  SIISYGD-RLEKRTDAEVMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAVN 92

Query: 81  HGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP-----AMIPNITLPRAFDWREY 135
               V GLN F+DL+  E++++YLG + +     R+       +      LP + DWRE 
Sbjct: 93  RTYKV-GLNRFADLTNEEYRSRYLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDWREK 151

Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSIS 194
            AV  VKDQ  CGS WAFST   +EG+    T  L+SLSEQEL+DCD+  + GC GG + 
Sbjct: 152 GAVVPVKDQGNCGSCWAFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMD 211

Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVE 253
            AF+ I++   GG++ E+ YPYR  D  C  N+K A  V I+GY  V +++    K  V 
Sbjct: 212 YAFEFIINN--GGIDSEEDYPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVA 269

Query: 254 NGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPY 311
           N P++VAI A   A Q Y +GV      F       L H V+ VGYG      T  +V Y
Sbjct: 270 NQPVSVAIEAGGRAFQLYQSGV------FTGQCGTQLDHGVVAVGYG------TENSVDY 317

Query: 312 WIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
           WI++NSWG  WGE GY +L R       G CGI
Sbjct: 318 WIVRNSWGPNWGESGYIKLERNLAGTETGKCGI 350


>gi|45822209|emb|CAE47501.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
          Length = 325

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 121/303 (39%), Positives = 169/303 (55%), Gaps = 20/303 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG----VYGLNEFSDLSTAEFQ 100
           F  ++NK+Y + VE  +R  IF  NLRKI+   + ++ +G     +G+ +F+DL+  EF 
Sbjct: 26  FKVKNNKSYKSYVEEQTRFRIFQENLRKIEN-HNEKYNNGESTFKFGVTKFTDLTEKEFL 84

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
              +  K        +   + P   LP AFDWR+  AVT VKDQ MCGS W FSTTG++E
Sbjct: 85  DLLVLSKNARPNRTHATHLLAPLRDLPSAFDWRDKGAVTEVKDQGMCGSCWTFSTTGSVE 144

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
             +  KT  LVSLSEQ L+DC ++   GC GG +  A + I     GG+  EK YPY G 
Sbjct: 145 AAHFLKTGNLVSLSEQNLVDCAKDTCYGCGGGWMDKALEYIEK---GGIMSEKDYPYEGV 201

Query: 220 DKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPI 277
           D  CR +      KI+ +  + + DE D+   +   GP++VAI+A A  Q YV+G+    
Sbjct: 202 DDNCRFDISKVAAKISNFTYIKKNDEEDLKNAVAAKGPISVAIDASATFQLYVSGILDDT 261

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGS 336
           +  C    ++L+H VL+VGYG +  K       YWIIKNSWG  WG  GY R+ R  +  
Sbjct: 262 E--CSNEFDSLNHGVLVVGYGTENGK------DYWIIKNSWGVNWGMDGYIRMSRNKNNQ 313

Query: 337 CGI 339
           CGI
Sbjct: 314 CGI 316


>gi|394331814|gb|AFN27126.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 116/318 (36%), Positives = 173/318 (54%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWR+  AVT VKDQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPYAVDWRKKGAVTPVKDQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             G+IE  +A    +L +LSEQ+L+ CD +D GC GG +  AF+ ++  + G +  E +Y
Sbjct: 155 AVGSIESQWALAGHRLTALSEQQLVSCDDKDSGCGGGLMLQAFEWLLRNMNGTMFTEDSY 214

Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY            ++Q+    +I+GY+++   ET MA +L +NGP+++A++A +   Y 
Sbjct: 215 PYVSSSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYE 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV            + L+H VL+VGY +         VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGV------LTSCAGDTLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGENGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|332249835|ref|XP_003274061.1| PREDICTED: cathepsin W [Nomascus leucogenys]
          Length = 403

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 119/328 (36%), Positives = 174/328 (53%), Gaps = 27/328 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  Q N++Y +  E+  RL IF+ NL + Q LQ+ + G+  +G+  FSDL+  EF  
Sbjct: 69  FKLFQIQFNRSYLSPEEHARRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 128

Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
            Y G++      PS   R + +  P  ++P   DWR+   A++ +KDQ  C   WA +  
Sbjct: 129 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFTCDWRKVAGAISPIKDQKNCNCCWAMAAA 186

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIE ++       V +S QEL+DC +  DGC GG + +AF T+++    GL  EK YP+
Sbjct: 187 GNIEALWRINFWDFVDVSVQELLDCSRCGDGCHGGFVWDAFITVLN--NSGLASEKDYPF 244

Query: 217 RGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
           +G  +A R + K  Q    I  ++ +   E  +A+YL   GP+ V IN   LQ Y  GV 
Sbjct: 245 QGKVRAHRCHPKKYQKVAWIQDFIMLQNSEHRIAQYLATYGPITVTINMKPLQLYRKGVI 304

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
                 CD   + + HSVL+VG+G  +++                    PYWI+KNSWG 
Sbjct: 305 KATSTTCD--PQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGA 362

Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
            WGEKGYFRL+RG  +CGI  +  +A V
Sbjct: 363 QWGEKGYFRLHRGSNTCGITKFPLTARV 390


>gi|9627870|ref|NP_054157.1| viral cathepsin-like protein [Autographa californica
           nucleopolyhedrovirus]
 gi|114680178|ref|YP_758591.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
 gi|115751|sp|P25783.1|CATV_NPVAC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|332491|gb|AAA46752.1| viral cathepsin [Autographa californica nucleopolyhedrovirus]
 gi|559196|gb|AAA66757.1| viral cathepsin-like protein [Autographa californica
           nucleopolyhedrovirus]
 gi|113015253|gb|ABE68510.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
          Length = 323

 Score =  202 bits (515), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 116/317 (36%), Positives = 174/317 (54%), Gaps = 21/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y + VE   R  IF  NL +I  +   ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P         ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T  ++E  +A K  +L++LSEQ++IDCD  D GC GG +  AF+ I+    GG++ E 
Sbjct: 139 FATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  D+  CR+N     V++ + Y  ++  E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  + G   L+H+VL+VGYGV+        +PYW  KN+WG  WGE G+FR+ 
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 305 QNINACGMRNELASTAV 321


>gi|23110964|ref|NP_001326.2| cathepsin W preproprotein [Homo sapiens]
 gi|29476894|gb|AAH48255.1| Cathepsin W [Homo sapiens]
 gi|119594870|gb|EAW74464.1| cathepsin W (lymphopain), isoform CRA_b [Homo sapiens]
          Length = 376

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 119/328 (36%), Positives = 176/328 (53%), Gaps = 27/328 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  Q N++Y +  E+  RL IF+ NL + Q LQ+ + G+  +G+  FSDL+  EF  
Sbjct: 42  FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101

Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
            Y G++      PS   R + +  P  ++P + DWR+   A++ +KDQ  C   WA +  
Sbjct: 102 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAA 159

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIE ++       V +S QEL+DC +  DGC GG + +AF T+++    GL  EK YP+
Sbjct: 160 GNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPF 217

Query: 217 RGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
           +G  +A R + K  Q    I  ++ +  +E  +A+YL   GP+ V IN   LQ Y  GV 
Sbjct: 218 QGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVI 277

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
                 CD   + + HSVL+VG+G  +++                    PYWI+KNSWG 
Sbjct: 278 KATPTTCD--PQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGA 335

Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
            WGEKGYFRL+RG  +CGI  +  +A V
Sbjct: 336 QWGEKGYFRLHRGSNTCGITKFPLTARV 363


>gi|44844206|emb|CAF32699.1| cathepsin L-like cysteine proteinase [Leishmania infantum]
          Length = 381

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 127/320 (39%), Positives = 174/320 (54%), Gaps = 37/320 (11%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYRRAYGTLAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK    CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKXXGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIE  +A     LVSLSEQ+L+ CD +D+GC GG +  AF+ ++  + G +  EK+
Sbjct: 154 SAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWLLRHMYGIVFTEKS 213

Query: 214 YPY---RGDDKACRLN--KKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
           YPY    GD   C LN  K     +I+GYV +  +ET MA +L ENGP+A+A++A +   
Sbjct: 214 YPYTSGNGDVAEC-LNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGPIAIAVDASSFMS 272

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +G                   VL+VGY  ++T      VPYW+IKNSWGE WGEKGY 
Sbjct: 273 YQSG-------------------VLLVGY--NKT----GGVPYWVIKNSWGEDWGEKGYV 307

Query: 329 RLYRGDGSCGINDYVRSALV 348
           R+  G  +C +++Y  SA V
Sbjct: 308 RVAMGLNACLLSEYPVSAHV 327


>gi|82659048|gb|ABB88697.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 118/318 (37%), Positives = 175/318 (55%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWR+  AVT VKDQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             G+IE  +A     L +LSEQ+L+ CD +D+GC GG +  AF+ ++  + G +  E +Y
Sbjct: 155 AVGSIESQWALAGHGLTALSEQQLVSCDDKDNGCGGGLMLQAFEWLLRNMNGTMFTEDSY 214

Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY            ++Q+    +I+GY+++   ET MA +L +NGP+++A++A +   Y 
Sbjct: 215 PYVSSSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQ 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV            + L+H VL+VGY  +RT      VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGV------LTSCAGDALNHGVLLVGY--NRT----GEVPYWVIKNSWGEDWGENGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|9631045|ref|NP_047715.1| cathepsin-like proteinase [Lymantria dispar MNPV]
 gi|13124028|sp|Q9YMP9.1|CATV_NPVLD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|3822313|gb|AAC70264.1| cathepsin-like proteinase [Lymantria dispar MNPV]
          Length = 356

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 114/314 (36%), Positives = 173/314 (55%), Gaps = 21/314 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQD--TEHGSGVYGLNEFSDLSTAEF 99
           F  F+E +NK Y +  E   R  IF  NL +I       T+  +  Y +N+FSDLS +E 
Sbjct: 56  FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKFSDLSKSEL 115

Query: 100 QAKYLGFKLKPSYAD--RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
            AK+ G  +    ++  +++    P    P  FDWRE + VT +K+Q  CG+ WAF+T  
Sbjct: 116 IAKFTGLSIPERVSNFCKTIILNQPPDKGPLHFDWREQNKVTSIKNQGACGACWAFATLA 175

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++E  +A +  +L+ LSEQ+LIDCD  D GC GG +  AF+ IM    GG++ E  YP+ 
Sbjct: 176 SVESQFAMRHNRLIDLSEQQLIDCDSVDMGCNGGLLHTAFEEIMRM--GGVQTELDYPFV 233

Query: 218 GDDKACRLNKKATQVK--INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           G ++ C L++    V   +  Y  V  +E  +   L   GP+ +AI+A  +  Y  GV  
Sbjct: 234 GRNRRCGLDRHRPYVVSLVGCYRYVMVNEEKLKDLLRAVGPIPMAIDAADIVNYYRGVIS 293

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
             +      N  L+H+VL+VGYGV+        VPYW+ KN+WG+ WGE GYFR+ +   
Sbjct: 294 SCE------NNGLNHAVLLVGYGVE------NGVPYWVFKNTWGDDWGENGYFRVRQNVN 341

Query: 336 SCG-INDYVRSALV 348
           +CG +ND   +A++
Sbjct: 342 ACGMVNDLASTAVL 355


>gi|397516975|ref|XP_003828695.1| PREDICTED: cathepsin W [Pan paniscus]
          Length = 376

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 118/328 (35%), Positives = 177/328 (53%), Gaps = 27/328 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  Q N++Y +  E+  RL IF+ NL + Q LQ+ + G+  +G+  FSDL+  EF  
Sbjct: 42  FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101

Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
            Y G++      PS   R + +  P  ++P + DWR+   A++ +KDQ  C   WA +  
Sbjct: 102 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAA 159

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIE ++       V +S QEL+DC +  DGC+GG + +AF T+++    GL  EK YP+
Sbjct: 160 GNIETLWRISFWDFVDVSVQELLDCSRCGDGCQGGFVWDAFITVLNN--SGLASEKDYPF 217

Query: 217 RGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
           +G  +A R + K  Q    I  ++ +  +E  +A+YL   GP+ V IN   L+ Y  GV 
Sbjct: 218 QGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLRLYRKGVI 277

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
                 CD   + + HSVL+VG+G  +++                    PYWI+KNSWG 
Sbjct: 278 KATPTTCD--PQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGA 335

Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
            WGEKGYFRL+RG  +CGI  +  +A V
Sbjct: 336 QWGEKGYFRLHRGSNTCGITKFPLTARV 363


>gi|114638622|ref|XP_001170363.1| PREDICTED: cathepsin W [Pan troglodytes]
          Length = 376

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 118/328 (35%), Positives = 177/328 (53%), Gaps = 27/328 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  Q N++Y +  E+  RL IF+ NL + Q LQ+ + G+  +G+  FSDL+  EF  
Sbjct: 42  FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101

Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
            Y G++      PS   R + +  P  ++P + DWR+   A++ +KDQ  C   WA +  
Sbjct: 102 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAA 159

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIE ++       V +S QEL+DC +  DGC+GG + +AF T+++    GL  EK YP+
Sbjct: 160 GNIETLWRISFWDFVDVSVQELLDCSRCGDGCQGGFVWDAFITVLN--NSGLASEKDYPF 217

Query: 217 RGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
           +G  +A R + K  Q    I  ++ +  +E  +A+YL   GP+ V IN   L+ Y  GV 
Sbjct: 218 QGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLRLYRKGVI 277

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
                 CD   + + HSVL+VG+G  +++                    PYWI+KNSWG 
Sbjct: 278 KATPTTCD--PQLVDHSVLLVGFGSVKSEEGIWAERVSSQSQPQPPHPTPYWILKNSWGA 335

Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
            WGEKGYFRL+RG  +CGI  +  +A V
Sbjct: 336 QWGEKGYFRLHRGSNTCGITKFPLTARV 363


>gi|340380717|ref|XP_003388868.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
          Length = 337

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 118/303 (38%), Positives = 171/303 (56%), Gaps = 14/303 (4%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQ 100
           FN +++++ KTY+T+ EY  RL +++ N   I+ L + EHG    Y LN+FSDL+ AEF+
Sbjct: 35  FNMWMKKYEKTYSTMEEYNERLRVYTSNYYYIEQL-NKEHGPHTEYELNQFSDLTFAEFK 93

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
             YL      S  + +    + N   P A DWRE + +T VKDQ  CGS W FSTTG +E
Sbjct: 94  KIYLTEPQHCSATNGNFQKPV-NARDPVAVDWREKNVITPVKDQGKCGSCWTFSTTGCLE 152

Query: 161 GVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
             +A KT +L+SLSEQ+L+DC     + GC GG  S AF+ I  K  GG+E E  Y Y  
Sbjct: 153 AHHAIKTGQLISLSEQQLVDCAGAFNNHGCNGGLPSQAFEYI--KYNGGIESESNYNYTA 210

Query: 219 DDKACRLNKKATQVKINGYVSVSRD-ETDMAKYLVENGPMAVAINAY-ALQFYVTGVSHP 276
            D  CR N       ++  V++++D E D+   +   GP+++A     + Q Y  GV   
Sbjct: 211 KDGVCRFNSSLVAATVSDVVNITKDAEGDIGTAVANVGPVSIAFEVTKSFQHYKKGVYQG 270

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
               C    + ++H+VL+VGY  ++TK   +   YWI+KNSW   WG  GYF + RG  +
Sbjct: 271 EIEVCSQSPDKVNHAVLVVGY--NQTKLGEE---YWIVKNSWSASWGMDGYFWIRRGHNA 325

Query: 337 CGI 339
           CG+
Sbjct: 326 CGL 328


>gi|401416326|ref|XP_003872658.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|14348750|emb|CAC41275.1| CPB2 protein [Leishmania mexicana]
 gi|322488882|emb|CBZ24132.1| putative cathepsin L-like protease [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 359

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 123/319 (38%), Positives = 177/319 (55%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGECGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S+ GNIEG +     +LVSLSEQ+L+ CD  +DGC+GG +  AFD ++    G L  E +
Sbjct: 154 SSVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYTEDS 213

Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY   +    +    ++     +I+ +V +   E  MA +L +NGP+A+A++A +   Y
Sbjct: 214 YPYVSGNGYLPECSNSSELVVGAQIDSHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  + ++H+VL+VGY  D T      VPYW+IKNSWG  WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KEVNHAVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +++Y  SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340


>gi|23577865|ref|NP_703114.1| viral cathepsin [Rachiplusia ou MNPV]
 gi|37077115|sp|Q8B9D5.1|CATV_NPVR1 RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|23476510|gb|AAN28057.1| viral cathepsin [Rachiplusia ou MNPV]
          Length = 323

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 116/317 (36%), Positives = 175/317 (55%), Gaps = 21/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y + VE   R  IF  NL +I +    ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEIII--KNQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P         ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T  ++E  +A K  +L++LSEQ++IDCD  D GC GG +  AF+ I+    GG++ E 
Sbjct: 139 FATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  D+  CR+N     V++ + Y  ++  E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEADNNNCRMNTNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  + G   L+H+VL+VGYGV+        +PYW  KN+WG  WGE+G+FR+ 
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEEGFFRVQ 304

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 305 QNINACGMRNELASTAV 321


>gi|442736236|gb|AGC65593.1| cathepsin [Achaea janata granulovirus]
          Length = 338

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 123/329 (37%), Positives = 177/329 (53%), Gaps = 44/329 (13%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF+ F+ +++K Y + +E  ++  +F  NL  +    D +  +  + +N ++D S  E  
Sbjct: 33  LFDLFMIKYHKVYRSELERAAKFEVFKRNLATLNDKNDKDE-NATFDINAYTDRSRNELL 91

Query: 101 AKYLGFKLKPSYADRSVP-------------AMIPNITLPRAFDWREYDAVTGVKDQTMC 147
               GF+   ++A  + P             A  P   LP +FDWR+ + VT VKDQ  C
Sbjct: 92  RTQTGFQ--SNFARNASPFTQKKGMCITRVVAGTPPCLLPESFDWRDKNVVTPVKDQLEC 149

Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
           GS WAF+   N E  YA K  K V  SEQ L+DCDQ + GC+GG +  AF+ I+    GG
Sbjct: 150 GSCWAFTAIANFESQYAIKHGKHVDFSEQHLLDCDQLNYGCDGGLMHWAFEEIIRM--GG 207

Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-------RDETDMAKYLVENGPMAVA 260
           +  E  YPY G +  C  N       +N Y ++S       RDE  + + LV NGP+AVA
Sbjct: 208 VVLEYDYPYTGVESFCANN-------VNMYTTISGCVQYDLRDEEKLRELLVTNGPIAVA 260

Query: 261 INAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
           ++   +  Y +GV      FC G N  L+H+VL+VGYGVD+T      + YW++KNSWG 
Sbjct: 261 LDIVDIVDYKSGVVS----FC-GTNNGLNHAVLLVGYGVDKT------IEYWLLKNSWGT 309

Query: 321 GWGEKGYFRLYRGDGSCGI-NDYVRSALV 348
            WGE+GYFR+ R   SCGI N Y  S ++
Sbjct: 310 DWGEEGYFRIKRNRNSCGILNSYAASVIL 338


>gi|397133545|gb|AFO10079.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus S2]
          Length = 323

 Score =  202 bits (514), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 116/317 (36%), Positives = 174/317 (54%), Gaps = 21/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y + VE   R  IF  NL +I  +   ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEI--INKDQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P         ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PIQTQNFCKVIVLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T  ++E  +A K  +L++LSEQ++IDCD  D GC GG +  AF+ I+    GG++ E 
Sbjct: 139 FATLASLESQFAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  D+  CR+N     V++ + Y  ++  E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYITVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  + G   L+H+VL+VGYGV+        +PYW  KN+WG  WGE G+FR+ 
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 305 QNINACGMRNELASTAV 321


>gi|71084302|gb|AAZ23596.1| cysteine protease [Leishmania aethiopica]
          Length = 443

 Score =  202 bits (513), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 116/318 (36%), Positives = 171/318 (53%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             GNIE  +A    +L +LSEQ+L+ CD  D GC GG ++ AF+ ++  + G +  E +Y
Sbjct: 155 VVGNIESQWAVAGHRLTALSEQQLVSCDDMDSGCGGGLMTQAFEWLLRNMNGTMFTEDSY 214

Query: 215 PYRGD----DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY        +    ++     +I+GYV +  +ET MA +L ++GP+++ ++A +   Y 
Sbjct: 215 PYVSTFGYVPECTNSSQLVPGARIDGYVMIESNETVMAAWLAKSGPISIGVDASSFMSYH 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
            GV            + L+H VL+VGY +         VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 GGV------LTSCAGKQLNHGVLLVGYNMT------GEVPYWVIKNSWGENWGEKGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|2780176|emb|CAA71085.1| cystein proteinase [Leishmania mexicana]
          Length = 443

 Score =  202 bits (513), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 124/319 (38%), Positives = 175/319 (54%), Gaps = 22/319 (6%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VK+Q  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKNQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIEG +     +LVSLSEQ+L+ CD  D+GC GG +  AFD ++    G L  E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMDNGCSGGLMLQAFDWLLQNTNGHLYTEDS 213

Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY   +    +    ++     +I+G+V +   E  MA +L +NGP+A+A++A +   Y
Sbjct: 214 YPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  + L+H VL+VGY  D T      VPYW+IKNSWG  WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321

Query: 330 LYRGDGSCGINDYVRSALV 348
           +  G  +C +++Y  SA V
Sbjct: 322 VVMGVNACLLSEYPVSAHV 340


>gi|340503366|gb|EGR29962.1| hypothetical protein IMG5_145110 [Ichthyophthirius multifiliis]
          Length = 1095

 Score =  202 bits (513), Expect = 3e-49,   Method: Composition-based stats.
 Identities = 111/277 (40%), Positives = 157/277 (56%), Gaps = 24/277 (8%)

Query: 83   SGVYGLNEFSDLSTAEFQAKYLGF------KLKPSYADRSVPAMIPNITL----PRAFDW 132
            S V+G  +FSDLS  +F  K+L        ++K      + P +  +IT+    P  FDW
Sbjct: 831  SAVFGHTKFSDLSPQQFAQKHLKLNQKKLLQVKKETKKLTTP-IQQDITVEENVPEQFDW 889

Query: 133  REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGS 192
            R+ + VT  K Q  CGS W FSTTG IE  YA K +KLV  SEQ+L+DCD  +DGC GG 
Sbjct: 890  RDRNVVTEPKYQNTCGSCWTFSTTGVIESQYAIKHQKLVPFSEQQLVDCDDINDGCHGGL 949

Query: 193  ISNAFDTIMSKLGGGLEEEKTY-PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYL 251
            +++A+  +     GGLE  + Y  Y+   + C+ +    Q KI  +  +  DE  + K L
Sbjct: 950  MTDAYKYLQQS--GGLEFAEDYGDYKNKKEKCKFDLNKVQAKIKEWQQIDEDEEIIKKQL 1007

Query: 252  VENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPY 311
             +NGP+A  +NA  LQFY +G+  P +  CD    +++H++LIVGYGV++         Y
Sbjct: 1008 YQNGPIAAGVNARLLQFYKSGIFDPKE--CD---SDINHAILIVGYGVEK-----DGQKY 1057

Query: 312  WIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
            WIIKN WG+ WG  GYF+L RG   CGI+ Y   A +
Sbjct: 1058 WIIKNQWGKDWGMDGYFKLARGKKQCGIHTYASIAFI 1094


>gi|401430387|ref|XP_003886572.1| unnamed protein product, partial [Leishmania mexicana
           MHOM/GT/2001/U1103]
 gi|356491640|emb|CBZ40951.1| unnamed protein product, partial [Leishmania mexicana
           MHOM/GT/2001/U1103]
          Length = 332

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 121/308 (39%), Positives = 168/308 (54%), Gaps = 22/308 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
            ALF  F   + + Y TL E   RL  F  NL  ++  Q   +    +G+ +F DLS AE
Sbjct: 35  AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQ-ARNPHAQFGITKFFDLSEAE 93

Query: 99  FQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           F A+YL     F     +A +       +++ +P A DWRE  AVT VKDQ  CGS WAF
Sbjct: 94  FAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSCWAF 153

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S  GNIEG +     +LVSLSEQ+L+ CD  +DGC GG +  AFD ++    G L  E +
Sbjct: 154 SAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCSGGLMLQAFDWLLQNTNGHLHTEDS 213

Query: 214 YPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           YPY   +    +    ++     +I+G+V +   E  MA +L +NGP+A+A++A +   Y
Sbjct: 214 YPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSFMSY 273

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV       C G  + L+H VL+VGY  D T      VPYW+IKNSWG  WGE+GY R
Sbjct: 274 KSGVLTA----CIG--KQLNHGVLLVGY--DMT----GEVPYWVIKNSWGGDWGEQGYVR 321

Query: 330 LYRGDGSC 337
           +  G  +C
Sbjct: 322 VVMGVNAC 329


>gi|343477225|emb|CCD11889.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 447

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 120/343 (34%), Positives = 183/343 (53%), Gaps = 24/343 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  N+ +
Sbjct: 14  VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+  FSD+S  EF+A Y  G +   +   R  P  + N++    P 
Sbjct: 72  AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPP 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  CGS WAFS  GNIEG +     +L SLSEQ L+ CD  D GC
Sbjct: 129 AVDWRKKGAVTPVKDQGACGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDTTDYGC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
            GG +  +   I+S   G +   ++YPY    G    C  + K    KI+G++++ +DE 
Sbjct: 189 RGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDEN 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L +NGP+A+A++A +   Y  GV           ++ L H VL+VGY  D +K  
Sbjct: 249 AIAEWLAKNGPVAIAVDATSFLGYKGGV------LTSCISKGLDHDVLLVGYD-DTSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSW +GWGE+GY R+ +G   C + +Y RSA+V
Sbjct: 300 ---PPYWIIKNSWSKGWGEEGYIRIEKGTNQCLMKNYARSAVV 339


>gi|1163075|emb|CAA81061.1| cysteine proteinase [Trypanosoma congolense]
          Length = 442

 Score =  202 bits (513), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 120/343 (34%), Positives = 183/343 (53%), Gaps = 24/343 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  N+ +
Sbjct: 9   VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 66

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+  FSD+S  EF+A Y  G +   +   R  P  + N++    P 
Sbjct: 67  AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPP 123

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  CGS WAFS  GNIEG +     +L SLSEQ L+ CD  D GC
Sbjct: 124 AVDWRKKGAVTPVKDQGACGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDTTDYGC 183

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
            GG +  +   I+S   G +   ++YPY    G    C  + K    KI+G++++ +DE 
Sbjct: 184 RGGLMDKSLQWIVSSNKGNVFTAQSYPYASGGGKMPPCNKSGKVVGAKISGHINLPKDEN 243

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L +NGP+A+A++A +   Y  GV           ++ L H VL+VGY  D +K  
Sbjct: 244 AIAEWLAKNGPVAIAVDATSFLGYKGGV------LTSCISKGLDHDVLLVGYD-DTSK-- 294

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSW +GWGE+GY R+ +G   C + +Y RSA+V
Sbjct: 295 ---PPYWIIKNSWSKGWGEEGYIRIEKGTNQCLMKNYARSAVV 334


>gi|11464866|gb|AAG35358.1|AF314930_1 cruzipain [Trypanosoma cruzi]
          Length = 467

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 116/315 (36%), Positives = 165/315 (52%), Gaps = 18/315 (5%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
           T+ F  F ++H + Y +  E   RL +F  NL  +  L    +    +G+  FSDL+  E
Sbjct: 35  TSQFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHAAANPHATFGVTPFSDLTREE 93

Query: 99  FQAKYL-GFKLKPSYADRS-VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           F+++Y  G     +  +R+ VP  +  +  P A DWR   AVT VKDQ  CGS WAFS  
Sbjct: 94  FRSRYHNGAAHFAAAQERARVPVKVEVVGAPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GN+E  +      L +LSEQ L+ CD+ D GC GG ++NAF+ I+ +  G +  E +YPY
Sbjct: 154 GNVECQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNNAFEWIVQENNGAVYTEDSYPY 213

Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
               G    C  +       I G+V + +DE  +A +L  NGP+AVA++A +   Y  GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVGLPQDEAQIAAWLAVNGPVAVAVDASSWMTYTGGV 273

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                      +E L H VL+VGY          AVPYWIIKNS    WGE+GY R+ +G
Sbjct: 274 ------MTSCVSEQLDHGVLLVGYN------DSAAVPYWIIKNSRTTQWGEEGYIRIAKG 321

Query: 334 DGSCGINDYVRSALV 348
              C + +   SA+V
Sbjct: 322 SNQCLVKEEASSAVV 336


>gi|73983670|ref|XP_540846.2| PREDICTED: cathepsin W [Canis lupus familiaris]
          Length = 374

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 124/366 (33%), Positives = 188/366 (51%), Gaps = 35/366 (9%)

Query: 13  LSLTVSVSSFMVVGDEKLHH------------LHHVKHTALFNYFLEQHNKTYATLVEYY 60
           ++LT+ +S  + +    L H               ++   +F  F  Q+N++Y+   EY 
Sbjct: 1   MALTIYLSCLLALSVASLAHGIKRSLKNQDPGPQPLELKQVFALFQIQYNRSYSNPEEYA 60

Query: 61  SRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK---PSYADRSV 117
            RL IF+ NL + Q L+D + G+  +G+  FSDL+  EF   Y   ++    PS   R V
Sbjct: 61  RRLDIFAHNLAQAQQLEDEDLGTAEFGVTPFSDLTEEEFGQFYGHQRMAGEAPSVG-RKV 119

Query: 118 PAMIPNITLPRAFDWREYDAV-TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
            +      +P   DWR+   + + +K Q  C   WA +  GNIE ++  +  + V +S Q
Sbjct: 120 ESEEWGEPVPPTCDWRKLPGIISPIKQQGNCRCCWAMAAAGNIEALWGIRYHQPVEVSVQ 179

Query: 177 ELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQVK-I 234
           EL+DC +  DGC+GG   +AF T+++    GL   K YP+ G+ K  R L KK  +V  I
Sbjct: 180 ELLDCGRCGDGCKGGFTWDAFITVLNN--SGLASAKDYPFLGNTKPHRCLAKKYKKVAWI 237

Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLI 294
             ++ +  +E  +A YL   GP+ V IN   LQ Y  GV       CD   + + HSVL+
Sbjct: 238 QDFIMLQGNEQAIAWYLATKGPITVTINMKLLQHYQKGVIQATHTTCD--PQRVDHSVLL 295

Query: 295 VGYGVDRT------------KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDY 342
           VG+G  ++               H  +PYWI+KNSWG  WGE+GYFRL+RG+ +CGI  Y
Sbjct: 296 VGFGKSKSVAGKQAEGGSSRPRPHHPIPYWILKNSWGAEWGEEGYFRLHRGNNTCGITKY 355

Query: 343 VRSALV 348
             +A V
Sbjct: 356 PVTARV 361


>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
          Length = 437

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 122/314 (38%), Positives = 170/314 (54%), Gaps = 31/314 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           ++N +L +H K+Y  L E  +R  IF  NLR I         S   GLN F+DL+  E++
Sbjct: 48  MYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNADPDRSYELGLNRFADLTNEEYR 107

Query: 101 AKYLGFKLKPSY-------ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           AKYLG K + S        +DR  P  +    LP + DWRE  AV  VKDQ  CGS WAF
Sbjct: 108 AKYLGTKSRESRPKLSKGPSDRYAP--VEGEELPDSIDWREKGAVAAVKDQGSCGSCWAF 165

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           S  G +EG+    T +L++LSEQEL+DCD+  ++GCEGG +  AF+ I+    GG++ + 
Sbjct: 166 SAIGAVEGINQITTGELITLSEQELVDCDRSYNEGCEGGLMDYAFNFIIKN--GGIDSDL 223

Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL--QFY 269
            YPY G D  C  NK+ A  V I+ Y  V   +    +    N P++VAI A  +  Q Y
Sbjct: 224 DYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKALQKAAANQPISVAIEAGGMDFQLY 283

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
           V+G+      F       + H V++VGYG      + + + YWI++NSWG  WGE GY +
Sbjct: 284 VSGI------FTGKCGTAVDHGVVVVGYG------SEEGMDYWIVRNSWGAAWGEAGYLK 331

Query: 330 LYRG----DGSCGI 339
           + R      G CGI
Sbjct: 332 MQRNVGKSSGLCGI 345


>gi|71400414|ref|XP_803044.1| cysteine peptidase [Trypanosoma cruzi strain CL Brener]
 gi|70865609|gb|EAN81598.1| cysteine peptidase, putative [Trypanosoma cruzi]
          Length = 467

 Score =  201 bits (512), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 121/345 (35%), Positives = 177/345 (51%), Gaps = 20/345 (5%)

Query: 9   GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
            ++L ++ V ++  +      LH    +   + F  F ++H + Y +  E   RL +F  
Sbjct: 7   ALSLAAVLVVMACLVPAATASLHAEETLA--SQFAEFKQKHGRVYGSAAEEAFRLSVFRA 64

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITL 126
           NL  +  L    +    +G+  FSDL+  EF+++Y  G     +  +R+ VP  +  +  
Sbjct: 65  NL-FLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVDVEFVGA 123

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P A DWRE  AVT VK+Q MCGS WAF+  GNIE  +      L  LSEQ L+ CD  + 
Sbjct: 124 PAAKDWREEGAVTAVKNQGMCGSCWAFAAIGNIECQWFLAGNPLTRLSEQMLVSCDNTNS 183

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRD 243
           GC GG    AF  I+ +  G +  E++YPY    G    C  +       I GYV++ RD
Sbjct: 184 GCGGGWPLVAFKWIVDRNNGTVYTEESYPYHSCIGISPPCTTSGHTVGATITGYVTIPRD 243

Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
           E  +A +L  NGP+AV ++A +  FY  GV           ++ LSH+VL+VGY    T 
Sbjct: 244 ENGIAAWLAVNGPVAVVVDASSWIFYTGGV------MTSCVSKQLSHAVLLVGYNDSAT- 296

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
                VP+WIIKNSW   WGE GY R+ +G   C + + V SA+V
Sbjct: 297 -----VPHWIIKNSWTTHWGEDGYIRIAKGSNQCLVKEGVSSAVV 336


>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
          Length = 461

 Score =  201 bits (512), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 133/319 (41%), Positives = 168/319 (52%), Gaps = 40/319 (12%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLS 95
           AL+  +L +H KTY  L E   R  IF  NLR I      EH SG +    GLN+F+DL+
Sbjct: 50  ALYESWLVKHGKTYNALGEKDRRFQIFKDNLRFID-----EHNSGDHTYKLGLNKFADLT 104

Query: 96  TAEFQAKYLGFK-------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
             E++  Y G K       L    +DR   A     +LP   DWRE  AVT VKDQ  CG
Sbjct: 105 NEEYRMTYTGIKTIDDKKKLSKMKSDRY--AYRSGDSLPEYVDWREQGAVTDVKDQGSCG 162

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGG 207
           S WAFSTTG++EGV    T  L+S+SEQEL++CD   + GC GG +  AF+ I+    GG
Sbjct: 163 SCWAFSTTGSVEGVNKIVTGDLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKN--GG 220

Query: 208 LEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--Y 264
           ++ E+ YPY G D  C  NKK A  V I+ Y  V  ++    K  V N P+AVAI A   
Sbjct: 221 IDTEEDYPYTGKDGKCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGR 280

Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
             QFY +G+      F       L H VL  GYG +  K       YW++KNSWG  WGE
Sbjct: 281 DFQFYTSGI------FTGSCGTALDHGVLAAGYGTEDGK------DYWLVKNSWGAEWGE 328

Query: 325 KGYFRLYRG----DGSCGI 339
            GY ++ R      G CGI
Sbjct: 329 GGYLKMERNIADKSGKCGI 347


>gi|13124026|sp|Q9WGE0.1|CATV_NPVHC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|4884631|gb|AAD31760.1|AF120926_1 cysteine proteinase [Hyphantria cunea nucleopolyhedrovirus]
          Length = 324

 Score =  201 bits (511), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 120/318 (37%), Positives = 178/318 (55%), Gaps = 21/318 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K  + F  FL + NK Y++  E   R  IF  NL +I ++++    +  Y +N+FSDLS
Sbjct: 22  LKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEI-IIKNQNDTTAQYEINKFSDLS 80

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  +KY G  L P         ++   P    P  FDWR  + VT VK+Q +CG+ WA
Sbjct: 81  KDETISKYTGLAL-PLQTQNFCEVVVLNRPPDKGPLEFDWRRLNKVTSVKNQGICGACWA 139

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T  ++E  +A K  +L++LSEQ+LIDCD  D GC GG +  A++ +M    GG++ E 
Sbjct: 140 FATLASLESQFAIKHNQLINLSEQQLIDCDYVDAGCNGGLLHTAYEAVMQM--GGVQAEN 197

Query: 213 TYPYRGDDKACRLNKKATQVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY G D  CR++     VK+   Y  ++  E  +   L   GP+ VAI+A  +  Y  
Sbjct: 198 DYPYEGSDGNCRVDVAKFVVKVKKCYRYIAVFEEKLKDLLRIVGPIPVAIDASDIVNYRR 257

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+      +C   N   +H+VL+VGYGV+        VPYWI+KN+WGE WGE+GYFR+ 
Sbjct: 258 GIMR----YC--SNYGFNHAVLLVGYGVENN------VPYWILKNTWGEDWGEQGYFRVQ 305

Query: 332 RGDGSCGI-NDYVRSALV 348
           +   +CGI N+ + SA +
Sbjct: 306 QNINACGIRNELLASAEI 323


>gi|343474734|emb|CCD13687.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 524

 Score =  201 bits (511), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 119/343 (34%), Positives = 184/343 (53%), Gaps = 24/343 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  ++ +
Sbjct: 93  VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRMFKQSMER 150

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+ +FSD+S  EF+A YL G K   +   R  P  + N++    P 
Sbjct: 151 AKE-EAAANPYATFGVTQFSDMSPEEFRATYLNGAKYYAAALKR--PRKVVNVSTGKAPP 207

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  CGS WAF+  GNIEG +     +L SLSEQ L+ CD  +D C
Sbjct: 208 AVDWRKKGAVTPVKDQGSCGSCWAFAAIGNIEGQWKIAGHELTSLSEQMLVSCDTTEDNC 267

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD---KACRLNKKATQVKINGYVSVSRDET 245
            GG    AF  I+S   G +  E++YPY   D     C  + K    KI+G++++ +DE 
Sbjct: 268 GGGFADRAFKWIVSSNKGNVFTERSYPYASIDGYVPPCNKSGKVVGAKISGHINLPKDEN 327

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L  NGP+A+A++A     Y  GV           +++++H VL+VGY  D +K  
Sbjct: 328 AIAEWLARNGPVAIAVDASTFLDYKGGV------LTSCSSKHVNHEVLLVGYN-DTSK-- 378

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSW + WGE+GY R+ +G   C + +Y RS +V
Sbjct: 379 ---PPYWIIKNSWDKEWGEEGYIRIEKGTNLCLMKEYARSVVV 418


>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 356

 Score =  201 bits (511), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 140/348 (40%), Positives = 182/348 (52%), Gaps = 32/348 (9%)

Query: 11  ALLSLTVSV-----SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
           ALL L V       S F +VG  +     + +   LF  +L +H K YA+  E   R  +
Sbjct: 13  ALLLLCVGACVARNSDFSIVGYSEEDLSSNERLVELFEKWLAKHQKAYASFEEKLHRFEV 72

Query: 66  FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT 125
           F  NL+ I  + + E  S   GLNEF+DL+  EF+A YLG    P+    S      +++
Sbjct: 73  FKDNLKHIDKI-NREVTSYWLGLNEFADLTHDEFKAAYLGLDAAPARRGSSRSFRYEDVS 131

Query: 126 ---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
              LP++ DWR+  AVT VK+Q  CGS WAFST   +EG+ A  T  L +LSEQELIDC 
Sbjct: 132 ASDLPKSVDWRKKGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCS 191

Query: 183 QE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ--VKINGYVS 239
            + + GC GG +  AF  I S   GGL  E+ YPY  ++ +C   KKA    V I+GY  
Sbjct: 192 VDGNSGCNGGLMDYAFSYIASS--GGLHTEEAYPYLMEEGSCGDGKKAESEAVTISGYED 249

Query: 240 V-SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
           V + DE  + K L    P++VAI A     QFY  GV      F       L H V  VG
Sbjct: 250 VPANDEQALIKALAHQ-PVSVAIEASGRHFQFYSGGV------FDGPCGAQLDHGVAAVG 302

Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGIN 340
           YG D+     K   Y I++NSWG  WGEKGY R+ R    G+G CGIN
Sbjct: 303 YGSDKG----KGHDYIIVRNSWGAQWGEKGYIRMKRGTSNGEGLCGIN 346


>gi|47779249|gb|AAT38521.1| cysteine protease [Bombyx mori NPV]
          Length = 323

 Score =  201 bits (511), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 116/317 (36%), Positives = 174/317 (54%), Gaps = 21/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y++ VE   R  IF  NL +I  +   ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P+        ++   P    P  FDWR  + VT VK+Q MCG+ WA
Sbjct: 80  KDETIAKYTGLSL-PTQTQNFCKVILLDQPPGKGPLEFDWRRLNKVTSVKNQGMCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T G++E  +A K  +L++LSEQ++IDCD  D GC GG +  AF+       GG++ E 
Sbjct: 139 FATLGSLESQFAIKHNELINLSEQQMIDCDFVDAGCNGGLLHTAFEANCRM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  D+  CR+N     V++ + Y  +   E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEADNNNCRMNSNKFLVQVKDCYRYIIVYEEKLKDLLRLVGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  + G   L+H+VL+VGYGV+        +PYW  KN+WG  WGE G+FR+ 
Sbjct: 257 GI---IKYCFNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGEDGFFRVQ 304

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 305 QNINACGMRNELASTAV 321


>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
          Length = 460

 Score =  201 bits (511), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 121/312 (38%), Positives = 166/312 (53%), Gaps = 26/312 (8%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           A +  +L +H K+Y  L E   R  IF  N   I      +  S   GLN F+DL+  E+
Sbjct: 42  AAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNRFADLTNEEY 101

Query: 100 QAKYLGFKLKPSYADRSVP----AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
           ++KY G + K S    S      A +   +LP + DWRE+ AV  VKDQ  CGS WAFST
Sbjct: 102 RSKYTGIRTKDSRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQGQCGSCWAFST 161

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
              +EG+    T KL++LSEQEL+DCD+  ++GC GG + +AF  I++   GG++ +  Y
Sbjct: 162 ISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINN--GGIDSDADY 219

Query: 215 PYRGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVT 271
           PY G D  C +  K A  V I+ Y  V   +    +    N P++VAI A     QFY +
Sbjct: 220 PYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPISVAIEASGRDFQFYDS 279

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+      F      +L H V++VGYG +  K       YWI++NSWG  WGEKGY R+ 
Sbjct: 280 GI------FTGKCGTDLDHGVVVVGYGTENGK------DYWIVRNSWGADWGEKGYLRME 327

Query: 332 RG----DGSCGI 339
           RG     G CGI
Sbjct: 328 RGISSKAGICGI 339


>gi|394331822|gb|AFN27130.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  201 bits (510), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 116/318 (36%), Positives = 172/318 (54%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWR+  AVT VKDQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPYAVDWRKKGAVTPVKDQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             G+IE  +A    +L +LSEQ+L+ CD +D GC GG +  AF+ ++  + G +  E +Y
Sbjct: 155 AVGSIESQWALAGHRLTALSEQQLVSCDDKDSGCGGGLMLQAFEWLLRNMNGTMFTEDSY 214

Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY            ++Q+    +I+GY+++   ET MA +L +NGP+++A++A +   Y 
Sbjct: 215 PYVSSSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYE 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV              L+H VL+VGY +         VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGV------LTSCAGITLNHGVLLVGYNMT------GEVPYWVIKNSWGEDWGENGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
          Length = 333

 Score =  201 bits (510), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 124/317 (39%), Positives = 173/317 (54%), Gaps = 26/317 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQ 100
           F   H K+Y + +E   R  IFS N   +    + ++  G+     G+N+F DL   EF 
Sbjct: 30  FKATHKKSYQSNMEELLRFKIFSENSLLVAR-HNEKYARGLVSYKLGMNQFGDLLPHEFA 88

Query: 101 AKYLGFKLKPSYADRSV---PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
             + G++   +    S    PA +   +LP++ DWRE  AVT VK+Q  CGS WAFSTTG
Sbjct: 89  RMFNGYRGARTAGRGSTFLPPANVNYSSLPQSMDWREKGAVTPVKNQGQCGSCWAFSTTG 148

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
           ++EG +  KT  LVSLSEQ L+DC +   + GCEGG + NAF  I  K  GG++ EK+YP
Sbjct: 149 SLEGQHFLKTGVLVSLSEQNLVDCSETFGNHGCEGGLMDNAFQYI--KANGGIDTEKSYP 206

Query: 216 YRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTG 272
           Y  +D  CR  K+       G+V + +  E D+ K +   GP++VAI+A   + Q Y  G
Sbjct: 207 YEAEDGECRFKKQNVGATDTGFVDIEQGSEDDLKKAVATVGPVSVAIDASHSSFQLYSEG 266

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           V    +  C   +E L H VL+VGYGV+  K       YW++KNSW E WG+ GY ++ R
Sbjct: 267 VYDETE--CS--SEQLDHGVLVVGYGVEDGK------KYWLVKNSWAESWGDNGYIKMSR 316

Query: 333 G-DGSCGINDYVRSALV 348
             D  CGI       LV
Sbjct: 317 DKDNQCGIASAASYPLV 333


>gi|119964630|ref|YP_950826.1| cathepsin [Maruca vitrata MNPV]
 gi|119514473|gb|ABL76048.1| cathepsin [Maruca vitrata MNPV]
          Length = 324

 Score =  201 bits (510), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 115/317 (36%), Positives = 171/317 (53%), Gaps = 20/317 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ Q NK Y + +E   R  IF  NL +I + ++    +  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVLQFNKNYGSEIEKLRRFKIFQHNLNEI-INKNQNDSAAKYEINKFSDLS 80

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P         ++   P    P  FDWR  + VT VK+Q +CG+ WA
Sbjct: 81  KDETIAKYTGLSL-PIQTQNFCKVIVLDQPPGKGPFEFDWRRLNKVTNVKNQGVCGACWA 139

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+   ++E  +A K  +L+ LSEQ++IDCD  D GC GG +  AF+ ++    GG++ EK
Sbjct: 140 FAALASLESQFAMKHNQLIDLSEQQMIDCDSVDAGCNGGLLHTAFEAVIKM--GGVQLEK 197

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY   +  CR+N     VK+ + Y  +   E  +   L   GP+ +AI+A  +  Y  
Sbjct: 198 DYPYEAANNNCRMNSNKFLVKVKDCYRYIIVYEEKLKDLLRSVGPIPMAIDAADIVNYKQ 257

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+   I++  + G   L+H+VL+VGYGV+        +PYW  KN+WG  WGE GYFRL 
Sbjct: 258 GI---IKYCLNSG---LNHAVLLVGYGVENN------IPYWTFKNTWGTDWGESGYFRLQ 305

Query: 332 RGDGSCGINDYVRSALV 348
           +   +CG+ + + S  V
Sbjct: 306 QNINACGMRNELASTAV 322


>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
          Length = 461

 Score =  201 bits (510), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 126/314 (40%), Positives = 167/314 (53%), Gaps = 31/314 (9%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           A++  +L +H K+Y  + E   R  IF  NLR I    + E  +   GLN F+DL+  E+
Sbjct: 44  AMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDE-HNAESRTYKVGLNRFADLTNDEY 102

Query: 100 QAKYLGFKL-------KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           ++ YLG +            +DR VP  +   +LP + DWRE  AV GVKDQ  CGS WA
Sbjct: 103 RSMYLGARTGSRRRLSTQKRSDRYVP--VAGESLPDSVDWREKGAVVGVKDQGSCGSCWA 160

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           FST   +EG+    T  L+SLSEQEL+DCD   ++GC GG +  AF+ I+    GG++ E
Sbjct: 161 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTE 218

Query: 212 KTYPYRGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
           + YPY   D  C +  K A  V I+ Y  V  +     +  V N P++VAI A   A QF
Sbjct: 219 EDYPYNARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSVAIEASGMAFQF 278

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +GV      F       L H V  VGYG      T  +V YWI+KNSWG  WGE GY 
Sbjct: 279 YESGV------FTGNCGTALDHGVTAVGYG------TENSVDYWIVKNSWGSSWGESGYI 326

Query: 329 RLYRGDGS---CGI 339
           R+ R  G+   CGI
Sbjct: 327 RMERNTGATGKCGI 340


>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
          Length = 367

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 129/325 (39%), Positives = 174/325 (53%), Gaps = 40/325 (12%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF+ +L +H K Y +  E   RL IF  NL+ I       + S   GLN+F+DL+  EF+
Sbjct: 42  LFDRWLGRHGKLYGSHEEKARRLQIFRTNLQYIHAHNKNSNSSFRLGLNKFADLTNEEFK 101

Query: 101 AKYLGFKLKPSYADR----------------SVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
            +Y G K    + DR                +V +   + ++  + DWR+  AVTGVKDQ
Sbjct: 102 TRYFG-KNSKQWRDRRRTELEGAELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVKDQ 160

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKL 204
             CGS WAFSTTG IEGV    T KLVSLSEQEL+ CD  + GCEGG +  AF  ++   
Sbjct: 161 AQCGSCWAFSTTGAIEGVNFISTGKLVSLSEQELVACDATNYGCEGGDMDYAFTWVIQN- 219

Query: 205 GGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENG--PMAVAI 261
            GG++ EK Y Y G D  C  NK+A + V I+GY  VS D++ +   L   G  P++V I
Sbjct: 220 -GGIDTEKDYSYTGVDSTCNTNKEAKKIVSIDGYTDVSPDDSAL---LCAAGSQPVSVGI 275

Query: 262 NAYAL--QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           +  A+  Q Y  G+       C G  +++ H+VL+VGY     K       YWI+KNSWG
Sbjct: 276 DGSAIDFQLYTGGI---YDGDCSGNPDDIDHAVLVVGYSAKNGK------DYWIVKNSWG 326

Query: 320 EGWGEKGYFRLYRGD----GSCGIN 340
             WG +GYF + R      G C IN
Sbjct: 327 TDWGLEGYFYILRNTELPYGVCAIN 351


>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
 gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
          Length = 358

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 122/303 (40%), Positives = 167/303 (55%), Gaps = 26/303 (8%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           H+    +L E   R ++F  NL+ I  +   +    +  LN F+D++  EF   Y G K+
Sbjct: 46  HHTVSRSLAEKQERFNVFKENLKHIHKVNHKDRPYKLK-LNSFADMTNHEFLQHYGGSKV 104

Query: 109 KPSYADRS----VPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
                 R       +M  + + LP + DWR+  AVTG+KDQ  CGS WAFST   +EG+ 
Sbjct: 105 SHYRVLRGQRQGTGSMHEDTSKLPSSVDWRKNGAVTGIKDQGKCGSCWAFSTVAAVEGIN 164

Query: 164 AAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC 223
             KT +L+SLSEQEL+DCD ++ GC GG + +AF+ I  K  GGL  E TYPYR  ++ C
Sbjct: 165 KIKTGELISLSEQELVDCDSDNHGCNGGLMEDAFNFI--KQIGGLTSENTYPYRAKEEPC 222

Query: 224 RLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFF 280
             NK  +  V I+GY  V  ++ +     V N P+A+A++A    LQFY   +     F 
Sbjct: 223 DSNKMNSPVVNIDGYEMVPENDENALMKAVANQPVAIAMDAGGKDLQFYSEAI-----FT 277

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGS 336
            D G E L+H V +VGYG      T     YWI+KNSWG  WGEKGY R+ RG    +G 
Sbjct: 278 GDCGTE-LNHGVALVGYGT-----TQDGTKYWIVKNSWGTDWGEKGYIRMQRGIDAEEGL 331

Query: 337 CGI 339
           CGI
Sbjct: 332 CGI 334


>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
 gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
          Length = 471

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 127/312 (40%), Positives = 169/312 (54%), Gaps = 28/312 (8%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           ++  +L +H K Y  L E   R  IF  NLR I      +    V GLN F+DL+  E++
Sbjct: 50  MYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSVDRSYKV-GLNRFADLTNEEYK 108

Query: 101 AKYLGFKLKPS---YADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           A +LG K++        RS   +  +   LP   DWRE  AV  VKDQ  CGS WAFST 
Sbjct: 109 AMFLGTKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQGQCGSCWAFSTV 168

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
           G +EG+    T +L+SLSEQEL+DCD+  + GC GG +  AF+ I++   GG++ E+ YP
Sbjct: 169 GAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINN--GGIDTEEDYP 226

Query: 216 YRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
           Y+  D  C  N+K A  V I+GY  V  ++ +  K  V + P++VAI A   A Q Y +G
Sbjct: 227 YKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRAFQLYKSG 286

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           V     F    G E L H V+ VGYG      T   V YWI++NSWG  WGE GY R+ R
Sbjct: 287 V-----FTGRCGTE-LDHGVVAVGYG------TENGVNYWIVRNSWGSAWGESGYIRMER 334

Query: 333 G-----DGSCGI 339
                  G CGI
Sbjct: 335 NVANTKTGKCGI 346


>gi|340381055|ref|XP_003389037.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
          Length = 329

 Score =  201 bits (510), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 118/334 (35%), Positives = 178/334 (53%), Gaps = 21/334 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L ++V++ +V            ++TA +  +   + K Y++  E   R  I+  N +K
Sbjct: 1   MKLLIAVAALIVCATA-------FEYTAEWELWKRTNGKDYSSEKEELYRQTIWEAN-KK 52

Query: 73  IQLLQDTEHGSGVYGL--NEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAF 130
           I L  +       + L  N F+DL ++EF A Y G++     ++ +   +     LP   
Sbjct: 53  IVLEHNANADKWGWTLEMNAFADLESSEFAAMYNGYRRSARKSNATRYHVPTGNALPDTV 112

Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGC 188
           DWR   AVT VK+Q  CGS WAFSTTG++EG    K   L SLSEQ+L+DC  +  + GC
Sbjct: 113 DWRTKGAVTPVKNQKQCGSCWAFSTTGSLEGQTFLKKGTLPSLSEQQLVDCSDKYGNHGC 172

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMA 248
           +GG + NAF  I  +  GG++ E +YPY   +  CR  + A      GY  +  D+ D  
Sbjct: 173 QGGLMDNAFKYI--EANGGIDSEASYPYEAKNGKCRFQQSAVAATCTGYKDIPHDDIDGL 230

Query: 249 KYLVEN-GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
           +  V N GP++VA++A   + Q Y  GV  P+   C   +  L H VL VGYG + +   
Sbjct: 231 QDAVANVGPISVAMDASHSSFQLYAAGVYDPL--LCS--STRLDHGVLAVGYGTEPSGLF 286

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
           H+  PYW++KNSWG  WG++GYF++ R D  CGI
Sbjct: 287 HEEKPYWLVKNSWGPDWGQQGYFKIVRKDNKCGI 320


>gi|394331820|gb|AFN27129.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  200 bits (509), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 117/318 (36%), Positives = 174/318 (54%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWR+  AVT VKDQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             G+IE  +A    +L +LSEQ+L+ CD +D+GC GG +  AF+ ++  + G +  E +Y
Sbjct: 155 AVGSIESQWALAGHRLTALSEQQLVSCDDKDNGCRGGLMLQAFEWLLRNMNGTMFTEDSY 214

Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY            ++Q+    +I+GY+++   ET MA +L +NGP+++A++A +   Y 
Sbjct: 215 PYVSSTGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQ 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV              L+H VL+V Y  +RT      VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGV------LTSCAGMPLNHGVLLVWY--NRT----GEVPYWVIKNSWGENWGENGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|2582045|gb|AAB82449.1| lymphopain [Homo sapiens]
 gi|2582181|gb|AAB82457.1| lymphopain [Homo sapiens]
 gi|3033547|gb|AAC32181.1| cathepsin W [Homo sapiens]
          Length = 376

 Score =  200 bits (509), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 118/328 (35%), Positives = 175/328 (53%), Gaps = 27/328 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  Q N++Y +  E+  RL IF+ NL + Q LQ+ + G+  +G+  FSDL+  EF  
Sbjct: 42  FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101

Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
            Y G++      PS   R + +  P  ++P + DWR+   A++ +KDQ  C   WA +  
Sbjct: 102 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFSCDWRKVAGAISPIKDQKNCNCCWAMAAA 159

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIE ++       V +S  EL+DC +  DGC GG + +AF T+++    GL  EK YP+
Sbjct: 160 GNIETLWRISFWDFVDVSVHELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPF 217

Query: 217 RGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
           +G  +A R + K  Q    I  ++ +  +E  +A+YL   GP+ V IN   LQ Y  GV 
Sbjct: 218 QGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLQLYRKGVI 277

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
                 CD   + + HSVL+VG+G  +++                    PYWI+KNSWG 
Sbjct: 278 KATPTTCD--PQLVDHSVLLVGFGSVKSEEGIWAETVSSQSQPQPPHPTPYWILKNSWGA 335

Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
            WGEKGYFRL+RG  +CGI  +  +A V
Sbjct: 336 QWGEKGYFRLHRGSNTCGITKFPLTARV 363


>gi|9630063|ref|NP_046281.1| cathepsin [Orgyia pseudotsugata MNPV]
 gi|2499880|sp|O10364.1|CATV_NPVOP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|7435821|pir||T10394 cathepsin - Orgyia pseudotsugata nuclear polyhedrosis virus
 gi|1911371|gb|AAC59124.1| cathepsin [Orgyia pseudotsugata MNPV]
          Length = 324

 Score =  200 bits (509), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 116/318 (36%), Positives = 172/318 (54%), Gaps = 21/318 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  FL + NK Y++  E   R  IF  NL +I + ++    +  Y +N+FSDLS
Sbjct: 22  LKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLEEI-INKNQNDSTAQYEINKFSDLS 80

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  +KY G  L P         +I   P    P  FDWR+++ VT VK+Q +CG+ WA
Sbjct: 81  KEEAISKYTGLSL-PHQTQNFCEVVILDRPPDRGPLEFDWRQFNKVTSVKNQGVCGACWA 139

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T G++E  +A K  +L++LSEQ+ IDCD+ + GC+GG +  AF++ M    GG++ E 
Sbjct: 140 FATLGSLESQFAIKYNRLINLSEQQFIDCDRVNAGCDGGLLHTAFESAMEM--GGVQMES 197

Query: 213 TYPYRGDDKACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY   +  CR+N     V +      +   E  +   L   GP+ VAI+A  +  Y  
Sbjct: 198 DYPYETANGQCRINPNRFVVGVRSCRRYIVMFEEKLKDLLRAVGPIPVAIDASDIVNYRR 257

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           G+           N  L+H+VL+VGY V+        +PYWI+KN+WG  WGE GYFR+ 
Sbjct: 258 GIMRQC------ANHGLNHAVLLVGYAVENN------IPYWILKNTWGTDWGEDGYFRVQ 305

Query: 332 RGDGSCGI-NDYVRSALV 348
           +   +CGI N+ V SA +
Sbjct: 306 QNINACGIRNELVSSAEI 323


>gi|403293523|ref|XP_003937763.1| PREDICTED: cathepsin W [Saimiri boliviensis boliviensis]
          Length = 373

 Score =  200 bits (509), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 115/328 (35%), Positives = 177/328 (53%), Gaps = 30/328 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F +F  Q N++Y T  E+  RL IF+ NL + Q LQ+ + G+  +G+  FSDL+  EF  
Sbjct: 42  FKFFQRQFNRSYLTPEEHARRLDIFAHNLAQAQQLQEEDFGTAEFGVTPFSDLTEEEFGQ 101

Query: 102 KYLGFKLKPSYADRSVPAM-------IPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAF 153
            Y G +     A   VP M        P  ++P   DWR+   A++ +++Q  C   WA 
Sbjct: 102 LY-GHR----RAAGGVPGMGRVVGPEEPEESVPHTCDWRKVAGAISSIRNQGNCNCCWAM 156

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           +  GNIE ++     K V++S QEL+DC +  +GC GG +  AF T+++    G+  E+ 
Sbjct: 157 AAAGNIEALWGINFLKFVNVSVQELLDCGRCGNGCYGGYVWEAFLTVLNN--SGVASERD 214

Query: 214 YPYRGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
           YP+R + +  R + K +     I  ++ +  +E  +A+YL   GP+ V IN   L+ Y  
Sbjct: 215 YPFRANFRPHRCHAKTSNKVAWIQDFIFLPDNEQRIAQYLATYGPITVTINMKYLKLYQK 274

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT-----------HKAVPYWIIKNSWGE 320
           GV       CD   + + HSVL+VG+G D+++              ++ PYWI+KNSWG 
Sbjct: 275 GVIKASPTTCD--PQFVDHSVLLVGFGSDKSEGMGAETVSSPSRHPRSTPYWILKNSWGA 332

Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
            WGE+GYFRL+RG  +CGI  Y  +A V
Sbjct: 333 QWGEEGYFRLHRGSNTCGITKYPVTARV 360


>gi|577617|gb|AAC37213.1| cysteine proteinase [Trypanosoma cruzi]
          Length = 467

 Score =  200 bits (509), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 118/345 (34%), Positives = 177/345 (51%), Gaps = 20/345 (5%)

Query: 9   GVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSG 68
            ++L ++ V ++  +      LH    +   + F  F ++H + Y +  E   RL +F  
Sbjct: 7   ALSLAAVLVVMACLVPAATASLHAEETL--ASQFAEFKQKHGRVYGSAAEEAFRLSVFRA 64

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRS--VPAMIPNITL 126
           NL  +  L    +    +G+  FSDL+  EF+++Y       + A+    VP  +  +  
Sbjct: 65  NL-FLARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAAEERARVPVDVEVVGA 123

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P A DWRE  AVT VK+Q +CGS WAF+  GNIEG +      L  LSEQ L+ CD  + 
Sbjct: 124 PAAKDWREEGAVTAVKNQGICGSCWAFAAIGNIEGQWFLAGNPLTRLSEQMLVSCDNTNS 183

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRD 243
           GC GG  S AF+ I+ +  G +  E +YPY    G    C+ + +     I G+V + +D
Sbjct: 184 GCGGGLSSKAFEWIVQENNGAVYTEDSYPYHSCIGIKLPCKDSDRTVGATITGHVELPQD 243

Query: 244 ETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
           E  +A      GP++VA++A +  FY  GV        +  ++ LSH+VL+VGY      
Sbjct: 244 EAQIAASGAVKGPLSVAVDASSWFFYTGGV------LTNCVSKRLSHAVLLVGYN----- 292

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               AVPYWIIKNSW   WGE GY R+ +G   C + + V SA+V
Sbjct: 293 -DSAAVPYWIIKNSWTTHWGEGGYIRIAKGSNQCLVKEEVSSAVV 336


>gi|22549430|ref|NP_689203.1| cath gene product [Mamestra configurata NPV-B]
 gi|215401259|ref|YP_002332563.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
 gi|22476609|gb|AAM95015.1| putative cysteine proteinase [Mamestra configurata NPV-B]
 gi|198448759|gb|ACH88549.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
 gi|390165231|gb|AFL64878.1| cathepsin [Mamestra brassicae MNPV]
 gi|401665635|gb|AFP95747.1| putative cysteine proteinase [Mamestra brassicae MNPV]
          Length = 341

 Score =  200 bits (509), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 122/323 (37%), Positives = 177/323 (54%), Gaps = 21/323 (6%)

Query: 32  HLHHVKHTAL-FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
           +L+++    L F  F+ Q+NK Y++  E   R +IF  N+  I   +++ + S VY +N 
Sbjct: 33  NLYNINSAPLYFEKFITQYNKQYSSEDEKKYRYNIFRHNIESINA-KNSRNDSAVYKINR 91

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIP---NITLPRAFDWREYDAVTGVKDQTMC 147
           F+D++  E   ++ G     + A+     ++        P  FDWR Y+ VT VKDQ MC
Sbjct: 92  FADMTKNEVVNRHTGLASGDTGANFCETIVVDGPGQRQRPANFDWRNYNKVTSVKDQGMC 151

Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGG 207
           G+ WAF+  G +E  YA K  +L+ L+EQ+L+DCD  D GC+GG I  A++ IM    GG
Sbjct: 152 GACWAFAGLGALESQYAIKYDRLIDLAEQQLVDCDFVDMGCDGGLIHTAYEQIMHI--GG 209

Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
           +E+E  YPY+     C +      V + N Y  V   E  +   L   GP+A+A++A  L
Sbjct: 210 VEQEYDYPYKAVRLPCAVKPHKFAVGVRNCYRYVLLSEERLEDLLRHVGPIAIAVDAVDL 269

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
             Y  GV      FC+  N  L+H+VL+VGYGV+        VPYW IKNSWG  +GE G
Sbjct: 270 TDYYGGVIS----FCE--NNGLNHAVLLVGYGVENN------VPYWTIKNSWGPDYGENG 317

Query: 327 YFRLYRGDGSCG-INDYVRSALV 348
           Y R+ RG  SCG IN+   SA +
Sbjct: 318 YVRIRRGVNSCGMINELASSAQI 340


>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
          Length = 441

 Score =  200 bits (509), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 123/314 (39%), Positives = 169/314 (53%), Gaps = 30/314 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF  + +QH KTYA+  E   RL +F  N   +       + S    LN F+DL+  EF+
Sbjct: 29  LFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEFK 88

Query: 101 AKYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           A  LG        L    ++R +P  + ++  P + DWR+  AVT VKDQ  CG+ W+FS
Sbjct: 89  ASRLGLSSAASASLNVDRSNRQIPDFVADV--PASVDWRKNGAVTQVKDQGNCGACWSFS 146

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
            TG IEG+    T  LVSLSEQEL+DCD+  ++GCEGG +  AF  ++     G++ E+ 
Sbjct: 147 ATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDN--HGIDTEED 204

Query: 214 YPYRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYV 270
           YPY+G D++C   K K   V I+GYV V ++        V N P++V I  +  A Q Y 
Sbjct: 205 YPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYS 264

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
            G+      F    + +L H+VLIVGYG      +   V YWI+KNSWG  WG  GY  +
Sbjct: 265 KGI------FTGPCSTSLDHAVLIVGYG------SENGVDYWIVKNSWGSYWGMDGYMHM 312

Query: 331 YRGDGS----CGIN 340
            R  GS    CGIN
Sbjct: 313 QRNSGSSRGLCGIN 326


>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  200 bits (509), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 141/352 (40%), Positives = 186/352 (52%), Gaps = 37/352 (10%)

Query: 1   MSC-FYFFAGVALLSLTVSVSSFMVVG--DEKLHHLHHVKHTALFNYFLEQHNKTYATLV 57
           ++C F  FA +A+         F +VG   E L  +   K   LF  ++ +H K Y ++ 
Sbjct: 11  LACSFCLFASLAV------AGDFSIVGYSSEDLKSMD--KLIELFESWMSRHGKIYQSIE 62

Query: 58  EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
           E   R  IF  NL+ I           + GLNEF+DLS  EF+ KYLG K+  S    S 
Sbjct: 63  EKLHRFDIFKDNLKHIDERNKVVSNYWL-GLNEFADLSHQEFKNKYLGLKVDYSRRRESP 121

Query: 118 PAMI-PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
                 +  LP++ DWR+  AVT VK+Q  CGS WAFST   +EG+    T  L SLSEQ
Sbjct: 122 EEFTYKDFELPKSVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQ 181

Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKI 234
           ELIDCD+  ++GC GG +  AF  I+    GGL +E+ YPY  ++  C + K+ T+ V I
Sbjct: 182 ELIDCDRTYNNGCNGGLMDYAFSFIVEN--GGLHKEEDYPYIMEEGTCEMTKEETEVVTI 239

Query: 235 NGYVSVSR-DETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHS 291
           +GY  V + +E  + K LV N P++VAI A     QFY  GV      F      +L H 
Sbjct: 240 SGYHDVPQNNEQSLLKALV-NQPLSVAIEASGRDFQFYSGGV------FDGHCGSDLDHG 292

Query: 292 VLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           V  VGYG      T K V Y I+KNSWG  WGEKGY R+ R     +G CGI
Sbjct: 293 VAAVGYG------TSKGVNYIIVKNSWGSKWGEKGYIRMRRNIGKPEGICGI 338


>gi|426369199|ref|XP_004051582.1| PREDICTED: cathepsin W [Gorilla gorilla gorilla]
          Length = 376

 Score =  200 bits (508), Expect = 8e-49,   Method: Compositional matrix adjust.
 Identities = 117/328 (35%), Positives = 173/328 (52%), Gaps = 27/328 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  Q N++Y +  E+  RL IF+ NL + Q LQ+ + G+  +G+  FSDL+  EF  
Sbjct: 42  FKLFQIQFNRSYLSPEEHAHRLDIFAHNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101

Query: 102 KYLGFKLK----PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTT 156
            Y G++      PS   R + +  P  ++P   DWR+   A++ +KDQ  C   WA +  
Sbjct: 102 LY-GYRRAAGGVPSMG-REIRSEEPEESVPFTCDWRKVAGAISPIKDQKNCNCCWAMAAA 159

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIE ++       V +S QEL+DC +  DGC GG + +AF T+++    GL  EK YP+
Sbjct: 160 GNIETLWRISFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKDYPF 217

Query: 217 RGDDKA--CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVS 274
           +G  +A  C   K      I  ++ +  +E  +A+YL   GP+ V IN   L+ Y  GV 
Sbjct: 218 QGKVRAHSCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKPLRLYRKGVI 277

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNSWGE 320
                 CD   + + HSVL+VG+G  +++                    PYWI+KNSWG 
Sbjct: 278 KATPITCD--PQLVDHSVLLVGFGSIKSEEGILAETVSSQSQPQPPHPTPYWILKNSWGA 335

Query: 321 GWGEKGYFRLYRGDGSCGINDYVRSALV 348
            WGEKGYFRL+RG  +CGI  +  +A V
Sbjct: 336 QWGEKGYFRLHRGSNTCGITKFPLTARV 363


>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  200 bits (508), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 127/343 (37%), Positives = 171/343 (49%), Gaps = 31/343 (9%)

Query: 7   FAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIF 66
           FA V  L L     S   + D  +H  H          ++ ++ K Y  L E   R +IF
Sbjct: 12  FALVLCLGLWAFQVSSRTLQDASMHERHE--------QWMARYGKVYKDLQEKEKRFNIF 63

Query: 67  SGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYA-DRSVPAMIPNIT 125
             N++ I+   +  +     G+N+F+DL+  EF A    FK   S +  R+      N+T
Sbjct: 64  QENVKYIEASNNAGNKPYKLGVNQFTDLTNKEFIATRNKFKGHMSSSITRTTTFKYENVT 123

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
            P   DWR+  AVT VK+Q  CG  WAFS     EG++   T  LVSLSEQEL+DCD   
Sbjct: 124 APSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSG 183

Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSVSR 242
            D GC+GG + +AF  I+    GGL  E  YPY+G D  C  N++ T V  I GY  V  
Sbjct: 184 ADQGCQGGLMDDAFKFIIQN--GGLNTEAQYPYQGVDGTCNTNEEVTHVATITGYEDVPS 241

Query: 243 DETDMAKYLVENGPMAVAINAYALQF--YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
           +     +  V N P++VAI+A    F  Y +GV      F       L H V +VGYGV 
Sbjct: 242 NNEQALQQAVANQPISVAIDASGSDFQNYQSGV------FTGSCGTQLDHGVAVVGYGV- 294

Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
               +     YW++KNSWGE WGE+GY R+ R     +G CGI
Sbjct: 295 ----SDDGTKYWLVKNSWGEDWGEEGYIRMQRDVEAPEGLCGI 333


>gi|4581057|gb|AAD24589.1|AF139913_1 cysteine protease [Trypanosoma congolense]
          Length = 440

 Score =  200 bits (508), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 124/343 (36%), Positives = 181/343 (52%), Gaps = 24/343 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  N+ +
Sbjct: 14  VGLHAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+  FSD+S  EF+A Y  G +   +   R  P  + N++    P 
Sbjct: 72  AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPP 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  C SSWAFS  GNIEG +     +L SLSEQ L+ CD  D GC
Sbjct: 129 AIDWRKKGAVTPVKDQGQCHSSWAFSAIGNIEGQWKIAGHELTSLSEQMLVSCDTNDFGC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
            GG    AF  I+S   G +  E++YPY    G+   C  + K    KI   V + RDE 
Sbjct: 189 GGGFSDPAFKWIVSSNKGNVFTEQSYPYASGGGNVPTCDKSGKVVGAKIRDRVDLPRDEN 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L + GP+A+A++A + Q Y  GV           +E+L H VL+VGY  D +K  
Sbjct: 249 AIAEWLAKKGPVAIAVDATSFQSYTGGV------LTSCISEHLDHGVLLVGYD-DTSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSWG+GWGE+GY R+ +G   C + +   SA+V
Sbjct: 300 ---PPYWIIKNSWGKGWGEEGYIRIEKGTNQCLMKNLPSSAVV 339


>gi|394331826|gb|AFN27132.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  200 bits (508), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 117/318 (36%), Positives = 173/318 (54%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWR+  AVT VKDQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             G+IE  +A     L +LSEQ+L+ CD +D+GC GG +  AF+ ++  + G +  E +Y
Sbjct: 155 AVGSIESQWALAGHGLTALSEQQLVSCDDKDNGCSGGLMLQAFEWLLRNMNGTMFTEDSY 214

Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY            ++Q+    +I GY+++   ET    +L +NGP+++A++A +   Y 
Sbjct: 215 PYVSSSGYVPECSNSSQLVPGARIEGYMTIESSETVKGAWLAKNGPISIAVDASSFMSYQ 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV            + L+H VL+VGY  +RT      VPYW+IKNSWGE WGEKGY R+
Sbjct: 275 SGV------LTSCAGDALNHGVLLVGY--NRT----GEVPYWVIKNSWGEDWGEKGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
          Length = 371

 Score =  200 bits (508), Expect = 9e-49,   Method: Compositional matrix adjust.
 Identities = 126/316 (39%), Positives = 168/316 (53%), Gaps = 32/316 (10%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
            L+  ++ QH K Y  + E   R  IF  NLR I       + +   GLN+F+DL+  E+
Sbjct: 43  GLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEY 102

Query: 100 QAKYLGFKLKPSYADRSVPAMIPNI--------TLPRAFDWREYDAVTGVKDQTMCGSSW 151
           +AK+LG +  P    R + + IP+          LP + DWR++ AV+ VKDQ  CGS W
Sbjct: 103 RAKFLGTRTDPRR--RLMKSKIPSSRYAHRAGDNLPDSVDWRDHGAVSPVKDQGSCGSCW 160

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEE 210
           AFST   +EG+    + +LVSLSEQEL+DCD+  D GC GG +  AF  IM    GG++ 
Sbjct: 161 AFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDAGCNGGLMDYAFQFIMDN--GGIDT 218

Query: 211 EKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQ 267
           EK YPY G +  C   KK A  V I+GY  V  +E  + K  V + P+++AI A   A Q
Sbjct: 219 EKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPNNENALKK-AVAHQPVSIAIEAGGRAFQ 277

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
            Y +GV      F       L H V+ VGYG D          YWI++NSWG  WGE GY
Sbjct: 278 LYESGV------FNGECGLALDHGVVAVGYGTD-----DNGQDYWIVRNSWGSNWGENGY 326

Query: 328 FRLYR----GDGSCGI 339
            R+ R      G CGI
Sbjct: 327 IRMERNINANTGKCGI 342


>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
          Length = 462

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 125/341 (36%), Positives = 177/341 (51%), Gaps = 25/341 (7%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           VA  +L +S+ ++      K       +  A++  +L +H K+Y  L E   R  IF  N
Sbjct: 18  VASSALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHGKSYNALGEKEKRFQIFKDN 77

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLP 127
           LR I      E+ S   GLN F+DL+  E+++ YLG K KP  +        P +  +LP
Sbjct: 78  LRFIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKPKLSKVKSDRYAPRVGDSLP 137

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DD 186
            + DWR   AV  +KDQ  CGS WAFST   +EG+    T +L++LSEQEL+DCD+  ++
Sbjct: 138 ESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNE 197

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYVSVSRDET 245
           GC+GG +   F+ I++   GG++ +K YPY G D  C +  K A  V I+ Y  V  +  
Sbjct: 198 GCDGGLMDYGFEFIINN--GGIDTDKDYPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNE 255

Query: 246 DMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
           +  K  V + P++V I     A QFY +G+      F       L H V +VGYG     
Sbjct: 256 EALKKAVASQPVSVGIEGGGRAFQFYDSGI------FTGKCGTALDHGVNVVGYG----- 304

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
            T K   YWI++NSWG  WGE GY R+ R       G CGI
Sbjct: 305 -TEKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSVGKCGI 344


>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
          Length = 454

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 132/354 (37%), Positives = 177/354 (50%), Gaps = 29/354 (8%)

Query: 1   MSCFYFFAGVALLSLTVSVS--SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVE 58
           M     FA +AL ++  S S   F ++  +    +       L+  +L QH K Y  L E
Sbjct: 1   MGILLLFAVLALSAMAGSASRADFSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDE 60

Query: 59  YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL--KPSYADRS 116
              +  +F  N   I    +  + S   GLN+F+DLS  EF+A YLG KL  K   +   
Sbjct: 61  KQKKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKAAYLGTKLDAKKRLSRSP 120

Query: 117 VPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLS 174
            P    ++   LP + DWRE  AVT VK+Q  CGS WAFST   +EG+    T  L SLS
Sbjct: 121 SPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 180

Query: 175 EQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL-NKKATQV 232
           EQEL+DCD   + GC GG +  AF  I+S   GGL+ E  YPY+ ++ +C    K A  V
Sbjct: 181 EQELVDCDTSYNQGCNGGLMDYAFQFIISN--GGLDSEDDYPYKANNGSCDAYRKNAHVV 238

Query: 233 KINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSH 290
            I+ Y  V  ++    K    N P++VAI A   A QFY +GV      F       L H
Sbjct: 239 TIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGV------FTSNCGTQLDH 292

Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
            V +VGYG      +   + YW++KNSWG  WGEKG+ +L R       G CGI
Sbjct: 293 GVTLVGYG------SESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGASTGMCGI 340


>gi|33333704|gb|AAQ11970.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 125/309 (40%), Positives = 165/309 (53%), Gaps = 20/309 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAE 98
           +  F   H KTY +LVE   R  +F  NL  IQ      E G   +   + +F+D++  E
Sbjct: 23  WQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEE 82

Query: 99  FQ--AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           F    K  G    PS A         ++    A DWRE  AVT VKDQ  CGS WAFS  
Sbjct: 83  FLDLLKLQGVPALPSNAVHFDNFEDIDMEEKDAIDWREEGAVTPVKDQANCGSCWAFSAV 142

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           G IEG +  K   LVSLS QEL+DC  ED   +GC+GG +  AFD +  +   G++ E++
Sbjct: 143 GAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEES 199

Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
           YPY G   +C+ + +    K+  YV    DE +MA+ +   GP+AVAI A  L FY  G+
Sbjct: 200 YPYEGRRSSCKKSGEYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGI 257

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
               +  C    E+L+H VL+VGYG      +   V YWI+KNSWG  WGEKGYFRL + 
Sbjct: 258 VDE-RCRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKD 310

Query: 334 DGSCGINDY 342
             +CGI  Y
Sbjct: 311 VKACGIGTY 319


>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
          Length = 360

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 127/307 (41%), Positives = 170/307 (55%), Gaps = 32/307 (10%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           H+    +L E + R ++F  N+  +      +    +  LN+F+D++  EF+  Y G K+
Sbjct: 44  HHTVSRSLDEKHKRFNVFKANVHYVHNFNKKDKPYKL-KLNKFADMTNHEFRQHYAGSKI 102

Query: 109 KPSY----ADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
           K       A R+    +      +P + DWR+  AVT VKDQ  CGS WAFST   +EG+
Sbjct: 103 KHHRTLLGASRANGTFMYANEDNVPPSIDWRKKGAVTPVKDQGQCGSCWAFSTVVAVEGI 162

Query: 163 YAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
              KTKKLVSLSEQEL+DCD  E+ GC GG +  AFD I  +  GG+  E+ YPY+ +D 
Sbjct: 163 NQIKTKKLVSLSEQELVDCDTTENQGCNGGLMDPAFDFIKKR--GGITTEERYPYKAEDD 220

Query: 222 ACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQ 278
            C + K+ T  V I+G+  V  ++ D     V N P++VAI+A     QFY  GV     
Sbjct: 221 KCDIQKRNTPVVSIDGHEDVPPNDEDALLKAVANQPISVAIDASGSQFQFYSEGV----- 275

Query: 279 FFCDGGNENLSHSVLIVGYG--VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG--- 333
           F  + G E L H V IVGYG  VD TK       YWI+KNSWG GWGEKGY R+ R    
Sbjct: 276 FTGECGTE-LDHGVAIVGYGTTVDGTK-------YWIVKNSWGAGWGEKGYIRMQRKVDA 327

Query: 334 -DGSCGI 339
            +G CGI
Sbjct: 328 EEGLCGI 334


>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
          Length = 338

 Score =  200 bits (508), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 122/311 (39%), Positives = 171/311 (54%), Gaps = 27/311 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  +H K Y + VE   R+ IF+ N  KI     L      S   GLN+++D+   EF+ 
Sbjct: 30  FKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADMLHHEFKE 89

Query: 102 KYLGF------KLKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
              G+      +L+       +  + P N+ +P+A DWR++ AVT VKDQ  CGS W+FS
Sbjct: 90  TMNGYNHTMRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGHCGSCWSFS 149

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           +TG++EG +  K   LVSLSEQ L+DC  +  ++GC GG + NAF  I  K  GG++ EK
Sbjct: 150 STGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGVDTEK 207

Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFY 269
           +YPY G D +C  NK        G+V + + DE  M K +   GP+AVAI+A   + Q Y
Sbjct: 208 SYPYEGIDDSCHFNKATVGATDTGFVDIPQGDEEAMMKAVATMGPVAVAIDASNESFQLY 267

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GV +      D    NL H VL+VGYG D+         YW++KNSWG  WG++GY +
Sbjct: 268 SEGVYNDPNCSSD----NLDHGVLVVGYGTDK-----DGQDYWLVKNSWGTTWGDQGYIK 318

Query: 330 LYRG-DGSCGI 339
           + R  D  CGI
Sbjct: 319 MARNQDNQCGI 329


>gi|261328618|emb|CBH11596.1| cysteine peptidase precursor, (fragment) [Trypanosoma brucei
           gambiense DAL972]
          Length = 404

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 115/306 (37%), Positives = 162/306 (52%), Gaps = 20/306 (6%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           + K Y    E   R   F  N+ + ++ Q   +    +G+  FSD++  EF+A+Y     
Sbjct: 2   YGKVYKDAKEEAFRFRAFEENMEQAKI-QAAANPYATFGVTPFSDMTREEFRARYRNGAS 60

Query: 109 KPSYADRSVPAMIPNITL---PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAA 165
             + A + +   + N+T    P A DWRE  AVT +KDQ  CGS WAF + GNIEG +  
Sbjct: 61  YFAAAQKRLRKTV-NVTTGRAPAAVDWREKGAVTPMKDQGQCGSCWAFYSIGNIEGQWQV 119

Query: 166 KTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKA 222
               LVSLSEQ L+ CD  D GC GG + NAF+ I++  GG +  E +YPY    G+   
Sbjct: 120 AGNPLVSLSEQMLVSCDTIDFGCGGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNGEQPQ 179

Query: 223 CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCD 282
           C++N       I  +V + +DE  +A YL ENGP+A+A++A +   Y  G+         
Sbjct: 180 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDATSFMDYNGGI------LTS 233

Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDY 342
             +E L H VL+VGY  +         PYWIIKNSW   WGE GY R+ +G   C +N  
Sbjct: 234 CTSEQLDHGVLLVGYNDNSNP------PYWIIKNSWSNMWGEDGYIRIEKGTNQCLMNQA 287

Query: 343 VRSALV 348
           V SA+V
Sbjct: 288 VSSAVV 293


>gi|146335580|gb|ABQ23399.1| cathepsin L isotype 2 [Trypanoplasma borreli]
          Length = 443

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 122/324 (37%), Positives = 169/324 (52%), Gaps = 34/324 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF+ F   H + Y +  E   R  IF+ N++K   L + ++    +G NEF+D+S+ EFQ
Sbjct: 24  LFSDFKATHARNYVSPGEERKRFEIFAANMKKAAEL-NRKNPMATFGPNEFADMSSEEFQ 82

Query: 101 AKY-----------LGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGS 149
            ++              K   S+    + A        +  DWR   AVT VK+Q  CGS
Sbjct: 83  TRHNAARHYAAAKARRAKHTKSFTKEEIKAADG-----QKIDWRLKGAVTSVKNQGSCGS 137

Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLE 209
            W+FSTTGNIEG  A  T  LVSLSEQEL+ CD  D+GC GG + NAF  ++S  GG + 
Sbjct: 138 CWSFSTTGNIEGQNAIATGNLVSLSEQELVSCDTTDNGCNGGLMDNAFGWLISTRGGQIA 197

Query: 210 EEKTYPY---RGDDKAC--RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAY 264
            E +YPY    G   AC   L+ K     I+ +  ++  E DMA ++   GP+++ ++A 
Sbjct: 198 TEASYPYVSGNGIVPACSYNLDNKPVGATISNFQDITGTEEDMAAFVFNYGPLSIGVDAS 257

Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
             Q Y  G    I  +C   +  + H VLIVGY  D T  T    PYWIIKNSW   WGE
Sbjct: 258 TWQSYAGG----IITYCP--DVQIDHGVLIVGY--DDTAPT----PYWIIKNSWTANWGE 305

Query: 325 KGYFRLYRGDGSCGINDYVRSALV 348
            GY R+ +G   CG+     S++V
Sbjct: 306 DGYIRVAKGSNMCGLTSTPSSSVV 329


>gi|343477446|emb|CCD11725.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 361

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 120/343 (34%), Positives = 184/343 (53%), Gaps = 24/343 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  N+ +
Sbjct: 14  VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYRDATEEAFRFRVFKQNMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+  FSD+S  EF+A Y  G +   +   R  P  + N++    P 
Sbjct: 72  AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPE 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  C SSWAF+  GNIEG +     +L SLSEQ L+ CD  D GC
Sbjct: 129 AVDWRKKGAVTPVKDQGKCDSSWAFTVIGNIEGQWKIAGHELTSLSEQMLVSCDTNDLGC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
             G +  AF  I+S   G +  E++YPY    G+  AC  + K     I+ +V +  +E 
Sbjct: 189 RAGFMDTAFKWIVSPNDGNVFTEQSYPYASGGGNVPACNKSGKVVGANIDDHVHILDNEN 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L +NGP+A+A++A + Q Y  GV           ++ ++ + L+VGY  D +K  
Sbjct: 249 AIAEWLAKNGPVAIAVDATSFQRYTGGV------LTSCISKEVNSAALLVGYD-DTSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSWG+GWGE+GY R+ +G   C + DYV SA+V
Sbjct: 300 ---PPYWIIKNSWGKGWGEEGYIRIEKGTNQCRMKDYVSSAVV 339


>gi|113931178|ref|NP_001039033.1| cathepsin W [Xenopus (Silurana) tropicalis]
 gi|89269052|emb|CAJ83515.1| cathepsin W [Xenopus (Silurana) tropicalis]
          Length = 303

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 112/305 (36%), Positives = 168/305 (55%), Gaps = 22/305 (7%)

Query: 48  QHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK 107
           Q+N++Y T  E+  RL IFS NL++   LQ  E G+  YG+ +FSDL+  EF   +L   
Sbjct: 3   QYNRSYKTREEFKYRLRIFSENLKEASRLQREELGTAQYGVTKFSDLTDEEFSIYHLPTN 62

Query: 108 LKPSYADRSVPAMIPN----ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
           + P+      P ++      +  P + DWR  + ++  K+Q  C S WAF+   NIE  +
Sbjct: 63  ILPT------PPILKQSEEVLPFPTSCDWRTQNVISKAKNQRTCHSCWAFAAVANIEAQW 116

Query: 164 AAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC 223
           A    + +SLSEQ++IDC+   +GC GG   +AF T++ +  GGL  EK+YPY G    C
Sbjct: 117 AI-LGQTISLSEQQVIDCNTCRNGCSGGYAWDAFMTVLQQ--GGLTSEKSYPYTGHVSNC 173

Query: 224 RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDG 283
           R   +A    I+ +  + ++ET MA ++   G + V IN   L+ Y  G+   ++  CD 
Sbjct: 174 RKGFEAVGW-IHDFEMLKKNETAMASHVAHKGTLTVTINKAPLKHYQKGIVDTLRSNCDP 232

Query: 284 GNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
               + H VLIVGY           +P WI+KNSWGE WGEKG+FR++R   +CGI  Y 
Sbjct: 233 NY--VDHVVLIVGYR------GGGKLPQWILKNSWGEDWGEKGFFRMFRDKNACGITKYP 284

Query: 344 RSALV 348
            + +V
Sbjct: 285 VTCIV 289


>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 455

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 134/348 (38%), Positives = 181/348 (52%), Gaps = 29/348 (8%)

Query: 6   FFAGVALLS-LTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
            FA  AL S L +S+ S+     +K       +  +L+  +L +H K Y  L E   R  
Sbjct: 3   LFALFALSSALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKRFQ 62

Query: 65  IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA--MIP 122
           IF  NLR I   Q+ E+ +   GLN F+DL+  E++A+YLG K+ P+      P+    P
Sbjct: 63  IFKDNLRFIDQ-QNAENRTYKLGLNRFADLTNEEYRARYLGTKIDPNRRLGRTPSNRYAP 121

Query: 123 NI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELID 180
            +  TLP + DWR+  AV  VKDQ  CGS WAFS  G +EG+    T  L+SLSEQEL+D
Sbjct: 122 RVGETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQELVD 181

Query: 181 CDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYV 238
           CD   + GC GG +  AF+ I+    GG++ E+ YPY+G D  C    K A  V I+GY 
Sbjct: 182 CDTGYNMGCNGGLMDYAFEFIIKN--GGIDSEEDYPYKGVDGRCDEYRKNAKVVSIDGYE 239

Query: 239 SVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
            V+  +    K  V N P++VA+       Q Y +GV      F       L H V+ VG
Sbjct: 240 DVNTYDELALKKAVANQPVSVAVEGGGREFQLYSSGV------FTGRCGTALDHGVVAVG 293

Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
           YG D          +WI++NSWG  WGE+GY RL R       G CGI
Sbjct: 294 YGTD------NGHDFWIVRNSWGADWGEEGYIRLERNLGNSRSGKCGI 335


>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 126/306 (41%), Positives = 165/306 (53%), Gaps = 24/306 (7%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F++Q++K Y+   E+ SR + F  N+  I+L     + S   GLNEF+DLS  EF+
Sbjct: 41  MFTAFMKQYSKAYSH-AEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFK 99

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
            KY G+K       RS          P + DWR  +AVT +KDQ  CGS WAFS TG+IE
Sbjct: 100 GKYFGYKHVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSIE 159

Query: 161 GVYAAKTK-KLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           G +  + K  L SLSEQ+L+DC     D GC GG +  AF+ I++    G+  E  YPY+
Sbjct: 160 GAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANK--GICAESAYPYK 217

Query: 218 GDDKACRLNKKATQ-VKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGV 273
           G    C+  K  T+ V I+GY  V S DE  +   +   GP++VAI A     QFY +GV
Sbjct: 218 GVGGLCQ--KSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGV 275

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                 F      NL H VL VGYG      T  +  YWI+KNSWG  WGE GY R+ R 
Sbjct: 276 ------FSGTCGHNLDHGVLAVGYG------TTGSQDYWIVKNSWGTSWGESGYIRMIRN 323

Query: 334 DGSCGI 339
              CGI
Sbjct: 324 KNQCGI 329


>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 351

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 142/349 (40%), Positives = 182/349 (52%), Gaps = 32/349 (9%)

Query: 10  VALLSLTVSV-----SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
           VA+L L V       S F +VG  +     H +   LF  +L +H K YA+  E   R  
Sbjct: 7   VAVLLLCVGACVARNSDFSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASFEEKLHRFE 66

Query: 65  IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI 124
           +F  NL+ I  + + E  S   GLNEF+DL+  EF+  YLG    P+    S      N+
Sbjct: 67  VFKDNLKLIDEI-NREVTSYWLGLNEFADLTHDEFKTTYLGLSPPPARRSSSRSFRYENV 125

Query: 125 T---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
               LP+A DWR+  AVT VK+Q  CGS WAFST   +EG+ A  T  L +LSEQELIDC
Sbjct: 126 AAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDC 185

Query: 182 DQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ--VKINGYV 238
             + + GC GG +  AF  I S   GGL  E+ YPY  ++ +C   KK+    V I+GY 
Sbjct: 186 SVDGNSGCNGGMMDYAFSYIASS--GGLHTEEAYPYLMEEGSCGDGKKSESEAVSISGYE 243

Query: 239 SV-SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIV 295
            V ++DE  + K L    P++VAI A     QFY  GV      F       L H V  V
Sbjct: 244 DVPTKDEQALIKALAHQ-PVSVAIEASGRHFQFYSGGV------FDGPCGAQLDHGVAAV 296

Query: 296 GYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
           GYG D+     K   Y I+KNSWG  WGEKGY R+ RG    +G CGIN
Sbjct: 297 GYGSDKG----KGHDYIIVKNSWGGKWGEKGYIRMKRGTGKSEGLCGIN 341


>gi|319976406|gb|ADV90878.1| cysteine proteinase B [Leishmania donovani]
          Length = 332

 Score =  199 bits (507), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 117/272 (43%), Positives = 163/272 (59%), Gaps = 21/272 (7%)

Query: 86  YGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTG 140
           +G+ +F DLS AEF A+YL     F     +A +       +++ +P A DWRE  AVT 
Sbjct: 5   FGITKFFDLSEAEFAARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP 64

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTI 200
           VK+Q  CGS WAFS  GNIE  +A     LVSLSEQ+L+ CD +D+GC GG +  AF+ +
Sbjct: 65  VKNQGACGSCWAFSAVGNIESQWARAGHGLVSLSEQQLVSCDDKDNGCNGGLMLQAFEWL 124

Query: 201 MSKLGGGLEEEKTYPY---RGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
           +  + G +  EK+YPY    GD   C   +K     +I+GYV +  +ET MA +L ENGP
Sbjct: 125 LRHMYGIVFTEKSYPYTSGNGDVAECLNSSKLVPGAQIDGYVMIPSNETVMAAWLAENGP 184

Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
           +A+A++A +   Y +GV       C G  + L+H VL+VGY  ++T      VPYW+IKN
Sbjct: 185 IAIAVDASSFMSYQSGV----LTSCAG--DALNHGVLLVGY--NKT----GGVPYWVIKN 232

Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           SWGE WGEKGY R+  G  +C +++Y  SA V
Sbjct: 233 SWGEDWGEKGYVRVAMGLNACLLSEYPVSAHV 264


>gi|440907378|gb|ELR57532.1| Cathepsin W [Bos grunniens mutus]
          Length = 382

 Score =  199 bits (506), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 122/336 (36%), Positives = 175/336 (52%), Gaps = 35/336 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F  Q+N++Y    EY  RL IF+ NL K Q LQ+ + G+  +G+ +FSDL+  EF 
Sbjct: 41  VFRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEFV 100

Query: 101 AKY----LGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
             Y     G  L  S   R V +     + PR  DWR+   ++ V+DQ  C   WA +  
Sbjct: 101 QLYGSQVAGEALGVS---RKVGSEEWGESEPRTCDWRKVGPISLVRDQRNCNCCWAMAAA 157

Query: 157 GNIEGVYAAKTKKLVSLSEQ--------ELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
           GNIE ++A K +  V +S Q        EL+DCD+  +GC GG + +AF T+++    GL
Sbjct: 158 GNIEALWAIKFRHFVEVSVQRMAGGRGWELLDCDRCGNGCRGGFVWDAFLTVLNN--SGL 215

Query: 209 EEEKTYPYRGDDKACR-LNKKATQVK-INGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
             EK YP+ G  K  R L KK  +V  I  ++ +   E  MA++L   GP+ V IN   L
Sbjct: 216 ASEKDYPFDGSGKTHRCLAKKYKKVAWIQDFIILQACEQSMARHLATEGPITVTINMTLL 275

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT--------------KFTHKAVPYW 312
           Q Y  GV       CD     + HSVL+VG+G  ++                  +++ YW
Sbjct: 276 QQYQKGVIKATPTTCD--PTQVDHSVLLVGFGKTKSGEGRQGKAASFGSYARPRRSMAYW 333

Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
            +KNSWG  WGE+GYFRL+RG  +CGI  +  +A V
Sbjct: 334 TLKNSWGPQWGEEGYFRLHRGSNTCGITKFPVTARV 369


>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
 gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
 gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
 gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
 gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
 gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
          Length = 365

 Score =  199 bits (506), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 130/341 (38%), Positives = 175/341 (51%), Gaps = 35/341 (10%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   +VG  +     H +   LF  F+ ++ K Y++L E   R  +F  NL  I   ++ 
Sbjct: 30  SELSIVGYSEEDLASHERLMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHID--EEN 87

Query: 80  EHGSGVY-GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM----IPNITLPRAFDWRE 134
           +  +G + GLNEF+DL+  EF+A YLG  L P+  + +        +   +LP+  DWR+
Sbjct: 88  KKITGYWLGLNEFADLTHDEFKAAYLGLTLTPARRNSNDQLFRYEEVEAASLPKEVDWRK 147

Query: 135 YDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSI 193
             AVT VK+Q  CGS WAFST   +EG+ A  T  L  LSEQELIDCD + ++GC GG +
Sbjct: 148 KGAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGLM 207

Query: 194 SNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL--------NKKATQVKINGYVSVSRDET 245
             AF  I +   GGL  E++YPY  ++  CR          + A  V I+GY  V R+  
Sbjct: 208 DYAFSYIAAN--GGLHTEESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNE 265

Query: 246 DMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
                 + + P++VAI A     QFY  GV      F       L H V  VGYG     
Sbjct: 266 QALLKALAHQPVSVAIEASGRNFQFYSGGV------FDGPCGTRLDHGVTAVGYGT---- 315

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
              K   Y I+KNSWG  WGEKGY R+ RG    DG CGIN
Sbjct: 316 -ASKGHDYIIVKNSWGSHWGEKGYIRMRRGTGKHDGLCGIN 355


>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
          Length = 364

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 127/307 (41%), Positives = 170/307 (55%), Gaps = 24/307 (7%)

Query: 50  NKTYATLVEYYSR-LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK- 107
           N+ YA+  E Y R  +I+  NLR      +  H S    +  ++DLS  E+++K LG+  
Sbjct: 58  NRAYASSAEVYERRFNIWLDNLRFAHEY-NARHTSHWLSMGVYADLSQDEYRSKALGYNA 116

Query: 108 -LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
            L      R+ P +      P   DW    AVT VKDQ +CGS WAFSTTG +EG  A  
Sbjct: 117 HLHKKRPLRAAPFLYKGTVPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGANAIA 176

Query: 167 TKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL 225
           T KLVSLSEQ L+DCD+E D GC GG + +AFD I++   GG++ E  YPYR +D  C+ 
Sbjct: 177 TGKLVSLSEQMLVDCDREYDTGCRGGFMDSAFDFIVNN--GGIDTEDDYPYRAEDGICQD 234

Query: 226 NKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCD 282
           N+     V I+GY  V  ++ +     V + P++VAI A   A Q Y  GV     F  +
Sbjct: 235 NRTRRHVVTIDGYQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGV-----FDAE 289

Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG------DGS 336
            G   L H+VL+VGYG   +  TH  +PYW++KNSWG  WGEKGY RL R       +G 
Sbjct: 290 CGTA-LDHAVLVVGYGT-ASNGTHN-LPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQ 346

Query: 337 CGINDYV 343
           CG+  Y 
Sbjct: 347 CGLAMYA 353


>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
          Length = 496

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 125/313 (39%), Positives = 164/313 (52%), Gaps = 26/313 (8%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
           T LF  +L  H K+Y  L E   R  IF  NLR I      E      GLN+F+DL+  E
Sbjct: 42  TTLFESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEE 101

Query: 99  FQAKYLGFKLKPSYADRSVP----AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           +++KY G K K      S      A +   +LP + DWRE  AV  VKDQ  CGS WAFS
Sbjct: 102 YRSKYTGIKSKDLRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKDQGSCGSCWAFS 161

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           T   +EG+    T KL++LSEQEL+DCD+  ++GC GG +  AF+ I++   GG++ +  
Sbjct: 162 TISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDYAFEFIINN--GGIDTDVD 219

Query: 214 YPYRGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYV 270
           YPY G D  C +  K A  V I+ Y  V   +    K    N P++VAI A     QFY 
Sbjct: 220 YPYTGRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRDFQFYD 279

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +G+      F       L H V++VGYG +  K       YWI++NSWG  WGE GY R+
Sbjct: 280 SGI------FTGKCGIALDHGVVVVGYGTENGK------DYWIVRNSWGADWGENGYLRM 327

Query: 331 YRG----DGSCGI 339
            RG     G CGI
Sbjct: 328 ERGISSKTGICGI 340


>gi|297688135|ref|XP_002821545.1| PREDICTED: cathepsin W [Pongo abelii]
          Length = 376

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 120/331 (36%), Positives = 175/331 (52%), Gaps = 33/331 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  Q N++Y +  E+  RL IF+ NL + Q LQ+ + G+  +G+  FSDL+  EF  
Sbjct: 42  FKLFQIQFNRSYLSPEEHAHRLDIFANNLAQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101

Query: 102 KYLGFKLKPSYADRSVPAMIPNI-------TLPRAFDWREY-DAVTGVKDQTMCGSSWAF 153
            Y G++     A   VP+M   I       ++P   DWR+   A++ +KDQ  C   WA 
Sbjct: 102 LY-GYR----RAAGGVPSMGREIRSEELEESVPFTCDWRKVAGAISPIKDQKNCNCCWAM 156

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           +  GNIE ++       V +S QEL+DC +  DGC GG + +AF T+++    GL  EK 
Sbjct: 157 AAAGNIETLWRINFWDFVDVSVQELLDCGRCGDGCHGGFVWDAFITVLNN--SGLASEKD 214

Query: 214 YPYRGDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
           YP++G  +A R + K  Q    I  ++ +  +E  +A+YL   GP+ V IN   LQ Y  
Sbjct: 215 YPFQGKVRAHRCHPKKYQKVAWIQDFIMLQNNEHRIAQYLATYGPITVTINMKLLQLYRK 274

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK--------------FTHKAVPYWIIKNS 317
           GV       CD   + + HSVL+VG+G  +++                    PYWI+KNS
Sbjct: 275 GVIKATPTTCD--PQLVDHSVLLVGFGNVKSEEGIWAETVLSQSQPQPPHPTPYWILKNS 332

Query: 318 WGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           WG  WGEKGYFRL+RG  +CGI  +  +A V
Sbjct: 333 WGAQWGEKGYFRLHRGSNTCGITKFPLTARV 363


>gi|146335578|gb|ABQ23398.1| cathepsin L isotype 1 [Trypanoplasma borreli]
          Length = 443

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 122/324 (37%), Positives = 169/324 (52%), Gaps = 34/324 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF+ F   H + Y +  E   R  IF+ N++K   L + ++    +G NEF+D+S+ EFQ
Sbjct: 24  LFSDFKATHARNYVSPGEERKRFEIFAANMKKAAEL-NRKNPMATFGPNEFADMSSEEFQ 82

Query: 101 AKY-----------LGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGS 149
            ++              K   S+    + A        +  DWR   AVT VK+Q  CGS
Sbjct: 83  TRHNAARHYAAAKARRAKHTKSFTKEEIKA-----ADGQKIDWRLKGAVTSVKNQGSCGS 137

Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLE 209
            W+FSTTGNIEG  A  T  LVSLSEQEL+ CD  D+GC GG + NAF  ++S  GG + 
Sbjct: 138 CWSFSTTGNIEGQNAIATGNLVSLSEQELVSCDTTDNGCNGGLMDNAFGWLISTRGGQIA 197

Query: 210 EEKTYPY---RGDDKAC--RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAY 264
            E +YPY    G   AC   L+ K     I+ +  ++  E DMA ++   GP+++ ++A 
Sbjct: 198 TEASYPYVSGNGIVPACSYNLDNKPVGATISNFQDITGTEEDMAAFVFNYGPLSIGVDAS 257

Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
             Q Y  G    I  +C   +  + H VLIVGY  D T  T    PYWIIKNSW   WGE
Sbjct: 258 TWQSYAGG----IITYCP--DVQIDHGVLIVGY--DDTAPT----PYWIIKNSWTANWGE 305

Query: 325 KGYFRLYRGDGSCGINDYVRSALV 348
            GY R+ +G   CG+     S++V
Sbjct: 306 DGYIRVAKGSNMCGLTSTPSSSVV 329


>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
 gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 123/305 (40%), Positives = 160/305 (52%), Gaps = 25/305 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ Q+ + Y   VE   RL+IF  N+  I+             +NEF+DL+  EFQA   
Sbjct: 7   WMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEEFQASRN 66

Query: 105 GFKLKPSYADRSV-PAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
           G+K+    +  S  P    N++ +P   DWR+  AVT +KDQ  CG  WAFS     EG+
Sbjct: 67  GYKMSAHLSSSSTKPFRYENVSAVPSTMDWRKKGAVTPIKDQGQCGCCWAFSAVAATEGI 126

Query: 163 YAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
               T KL+SLSEQEL+DCD   ED GC GG + +AFD I+     GL  E  YPY+G D
Sbjct: 127 TQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNK--GLTTEANYPYQGAD 184

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQ 278
            AC   K A   KI GY  V  +        V N P++VAI+A   A QFY +GV     
Sbjct: 185 GACNSGKAA--AKITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFYSSGV----- 237

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----D 334
           F  D G + L H V  VGYG+     +     YW++KNSWG  WGE GY R+ R     +
Sbjct: 238 FTGDCGTD-LDHGVTAVGYGM-----SDDGTKYWLVKNSWGTSWGENGYIRMERDIDAQE 291

Query: 335 GSCGI 339
           G CGI
Sbjct: 292 GLCGI 296


>gi|1581747|prf||2117247C Cys protease:ISOTYPE=3
          Length = 469

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 116/317 (36%), Positives = 165/317 (52%), Gaps = 24/317 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL-LQDTEHGSGVYGLNEFSDLSTAEFQ 100
           F  F ++H K Y +  E   RL +F  NL   +L      H S  +G+  FSDL+  EF+
Sbjct: 38  FAAFKQRHGKVYGSAAEETFRLGVFKENLLFARLHAAANPHAS--FGVTPFSDLTREEFR 95

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITL------PRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           ++Y       + A + V   +           P A DWR   AVT +KDQ  C S WAFS
Sbjct: 96  SRYHNAAAHFAAAQKRVRVPVEVEVEVEVGGAPAAVDWRARGAVTAIKDQGNCSSCWAFS 155

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           T GNIEG +      L  LSEQ L+ CD  D+GC+GG + +AFD I+ +  G +  E +Y
Sbjct: 156 TIGNIEGQWHLAGNPLTGLSEQMLVSCDNADNGCDGGLMDSAFDWIVEQNNGSVYTEASY 215

Query: 215 PY---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            Y    GD + C ++       I+G+V + +DE  MA +L  NGP+A+A++A +   Y  
Sbjct: 216 SYVSGGGDSQTCDMSDHVVGAVISGHVDLPQDEDKMAAWLAVNGPLAIAVDATSFMSYTG 275

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GV        +  ++ L H V++VGY            PYWIIKNSWG  WGE+GY R+ 
Sbjct: 276 GV------LTNCVSDQLDHGVVLVGYNDSSNP------PYWIIKNSWGADWGEEGYIRIQ 323

Query: 332 RGDGSCGINDYVRSALV 348
           +G   C + +Y  SA+V
Sbjct: 324 KGTNQCLVKNYACSAVV 340


>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
          Length = 364

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 131/346 (37%), Positives = 183/346 (52%), Gaps = 41/346 (11%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
           LLS T S ++ M + +   + +       ++  +L +H K Y  L E   R  +F  NL 
Sbjct: 11  LLSFTFSHATAMSIINYSENEVMD-----MYEEWLVKHRKVYNGLDEKEKRFQVFKDNLG 65

Query: 72  KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP---------AMIP 122
            IQ   + ++ +   GLN+F+D++  E++A YLG +     A R V          A   
Sbjct: 66  FIQD-HNAQNNTYTLGLNKFADITNEEYRAMYLGTRTD---AKRRVMKTQNTGHRYAYNS 121

Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
              LP   DWR   AV  +KDQ  CGS WAFST   +EG+    T + VSLSEQEL+DCD
Sbjct: 122 GDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCD 181

Query: 183 QE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV 240
           +E D+GC GG +  AF  I+    GG++ E+ YPY+G D  C   KK T+ V+I+GY  V
Sbjct: 182 REYDEGCNGGLMDYAFQFIIQN--GGIDTEEDYPYQGIDGTCDQTKKKTKVVQIDGYEDV 239

Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
             +  +  K  V + P++VAI A   ALQ Y +GV      F       L H V++VGYG
Sbjct: 240 PSNNENALKKAVSHQPVSVAIEASGRALQLYQSGV------FTGKCGTALDHGVVVVGYG 293

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
                 T   V YW+++NSWG GWGE GYF++ R      +G CGI
Sbjct: 294 ------TENGVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGI 333


>gi|343473977|emb|CCD14279.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 361

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 120/343 (34%), Positives = 183/343 (53%), Gaps = 24/343 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  N+ +
Sbjct: 14  VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYRDATEEAFRFRVFKQNMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+  FSD+S  EF+A Y  G +   +   R  P  + N++    P 
Sbjct: 72  AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPE 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  C SSWAF+  GNIEG +     +L SLSEQ L+ CD  D GC
Sbjct: 129 AVDWRKKGAVTPVKDQGKCDSSWAFTVIGNIEGQWKIAGHELTSLSEQMLVSCDTNDLGC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
             G +  AF  I+S   G +  E++YPY    G+  AC  + K     I  +V +  +E 
Sbjct: 189 RAGFMDTAFKWIVSPNDGNVFTEQSYPYASGGGNVPACNKSGKVVGANIRDHVHILDNEN 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L +NGP+A+A++A + Q Y  GV           ++ ++ + L+VGY  D +K  
Sbjct: 249 AIAEWLAKNGPVAIAVDATSFQRYTGGV------LTSCISKEVNSAALLVGYD-DTSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSWG+GWGE+GY R+ +G   C + DYV SA+V
Sbjct: 300 ---PPYWIIKNSWGKGWGEEGYIRIEKGTNQCRMKDYVSSAVV 339


>gi|115457680|ref|NP_001052440.1| Os04g0311400 [Oryza sativa Japonica Group]
 gi|113564011|dbj|BAF14354.1| Os04g0311400, partial [Oryza sativa Japonica Group]
          Length = 384

 Score =  199 bits (506), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 105/233 (45%), Positives = 139/233 (59%), Gaps = 20/233 (8%)

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
           LP  FDWRE+ AV  VKDQ  CGS W+FST+G +EG +   T KL  LSEQ+++DCD E 
Sbjct: 148 LPDDFDWREHGAVGPVKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQQMVDCDHEC 207

Query: 185 --------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
                   D GC GG ++ AF  +M    GGL+ EK YPY G +  C+ +K     ++  
Sbjct: 208 DASESRACDSGCNGGLMTTAFSYLMKS--GGLQSEKDYPYAGRENTCKFDKSKIVAQVKN 265

Query: 237 YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
           +  +S +E  +A  LV++GP+A+AINA  +Q Y+ GVS P  F C     +L H VL+VG
Sbjct: 266 FSVISVNEDQIAANLVKHGPLAIAINAAYMQTYIGGVSCP--FIC---GRHLDHGVLLVG 320

Query: 297 YG-VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---DGSCGINDYVRS 345
           YG         K  PYWIIKNSWGE WGEKGY+++ RG      CG++  V S
Sbjct: 321 YGSAGYAPIRFKEKPYWIIKNSWGENWGEKGYYKICRGPHDKNKCGVDSMVSS 373


>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
 gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
          Length = 339

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 128/352 (36%), Positives = 189/352 (53%), Gaps = 43/352 (12%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
            +A +AL+++  +VS   V+ +E             ++ F  +H KTY    E   RL I
Sbjct: 4   LYALLALVAVAQAVSFADVIKEE-------------WHTFKLEHRKTYQDETEERFRLKI 50

Query: 66  FSGNLRKIQLLQDTEHGSG----VYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI 121
           F+ N  KI    +  + +G       +N+++D+   EF+    GF        R+     
Sbjct: 51  FNENKHKI-AKHNQRYATGEVTFKMAVNKYADMLHHEFRETMNGFNYTLHKELRASDPSF 109

Query: 122 PNIT--------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSL 173
             IT        LP++ DWRE  AVT VKDQ  CGS WAFS+TG +EG +  KT  LVSL
Sbjct: 110 TGITFISPAHVKLPKSVDWREKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKTGTLVSL 169

Query: 174 SEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ 231
           SEQ L+DC  +  ++GC GG + NAF  I  K  GG++ EK+YPY G D +C  NK +  
Sbjct: 170 SEQNLVDCSAKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPYEGIDDSCHFNKDSVG 227

Query: 232 VKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENL 288
               G+  + + +E  MA+ +   GP++VAI+A   + QFY  G+ +  +  C+  ++NL
Sbjct: 228 ATDRGFADIPQGNEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPE--CN--SQNL 283

Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
            H VL+VGYG D +        YW++KNSWG  WG+KG+ ++ R  D  CGI
Sbjct: 284 DHGVLVVGYGTDES-----GKDYWLVKNSWGTTWGDKGFIKMARNEDNQCGI 330


>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
 gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
          Length = 343

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 113/308 (36%), Positives = 165/308 (53%), Gaps = 26/308 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG-SGVYGLNEFSDLSTAEFQAKY 103
           ++ +H + YA   E  +R  +F  N+  I+ L + ++G +    +N+F+DL+  EF++ Y
Sbjct: 40  WMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMY 99

Query: 104 LGFKLKPSYADRSVPAM-----IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            G+K     + R+ P       + +  LP + DWR+  AVT +KDQ  CGS WAFS    
Sbjct: 100 TGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAA 159

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           IEGV   K  KL+SLSEQEL+DCD  DDGC GG +++AF+  M+   GGL  E  YPY+ 
Sbjct: 160 IEGVAQIKKGKLISLSEQELVDCDTNDDGCMGGYMNSAFNYTMTT--GGLTSESNYPYKS 217

Query: 219 DDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSH 275
            D  C +NK K     I G+  V  ++       V + P+++ I       QFY +GV  
Sbjct: 218 TDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGV-- 275

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD- 334
               F    + +L H V +VGYG      +     YWI+KNSWG  WGE+GY R+ +   
Sbjct: 276 ----FSGECSTHLDHGVAVVGYGK-----SSNGSKYWILKNSWGPKWGERGYMRIKKDTK 326

Query: 335 ---GSCGI 339
              G CG+
Sbjct: 327 AKHGQCGL 334


>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 496

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 126/313 (40%), Positives = 164/313 (52%), Gaps = 27/313 (8%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           +++  +L +H K Y  L E   R  IF  NLR I      E  +   GLN F+DL+  E+
Sbjct: 77  SMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSQEDRTYKLGLNRFADLTNEEY 136

Query: 100 QAKYLGFKLKPSYADRSVPA--MIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
           +AKYLG K+ P+      P+    P +   LP + DWR+  AV  VKDQ  CGS WAFS 
Sbjct: 137 RAKYLGTKIDPNRRLGKTPSNRYAPRVGDKLPESVDWRKEGAVPPVKDQGGCGSCWAFSA 196

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
            G +EG+    T +L+SLSEQEL+DCD   ++GC GG +  AF+ I++   GG++ E+ Y
Sbjct: 197 IGAVEGINKIVTGELISLSEQELVDCDTGYNEGCNGGLMDYAFEFIINN--GGIDSEEDY 254

Query: 215 PYRGDDKACRL-NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVT 271
           PYRG D  C    K A  V I+ Y  V   +    K  V N P++VAI       Q YV+
Sbjct: 255 PYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVS 314

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GV      F       L H V+ VGYG      T     YWI++NSWG  WGE GY RL 
Sbjct: 315 GV------FTGRCGTALDHGVVAVGYG------TANGHDYWIVRNSWGPSWGEDGYIRLE 362

Query: 332 RG-----DGSCGI 339
           R       G CGI
Sbjct: 363 RNLANSRSGKCGI 375


>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
          Length = 463

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 131/347 (37%), Positives = 174/347 (50%), Gaps = 32/347 (9%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHT-----ALFNYFLEQHNKTYATLVEYYSRLH 64
             LL L  + SS + +        H  + T     A++  +L  H K Y  + E   R  
Sbjct: 10  ACLLFLCFAFSSALDMSIISYDQTHPPQRTDAEAMAIYEKWLTTHGKAYNAIGEKERRFE 69

Query: 65  IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSVPAM 120
           IF  NLR +    +   GS   GLN F+DL+  E+++ +LG     K + +       A 
Sbjct: 70  IFKDNLRFVDE-HNAVAGSYRVGLNRFADLTNEEYRSMFLGGNMEMKERSASTKSDRYAF 128

Query: 121 IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELID 180
                LP + DWRE  AV+ VKDQ  CGS WAFST   +EG+    T +L+SLSEQEL+D
Sbjct: 129 RAGDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISLSEQELVD 188

Query: 181 CDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYV 238
           CD+  + GC GG +   F  I++   GG++ E+ YPYR  D  C +  K A  V INGY 
Sbjct: 189 CDKSYNMGCNGGLMDYGFQFIINN--GGIDTEEDYPYRAVDGTCDQFRKNARVVSINGYE 246

Query: 239 SVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
            V  D+ +  K  V N P++VAI A   A Q Y +GV      F      NL H V+ VG
Sbjct: 247 DVPEDDENSLKKAVANQPVSVAIEAGGRAFQLYESGV------FTGHCGTNLDHGVVAVG 300

Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           YG      T   V YW ++NSWG  WGE GY +L R      G CGI
Sbjct: 301 YG------TENGVDYWTVRNSWGPKWGENGYIKLERNINATSGKCGI 341


>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
          Length = 359

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 130/353 (36%), Positives = 180/353 (50%), Gaps = 37/353 (10%)

Query: 6   FFAGVALLSLTVSVSSFMV-VGDEKLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSR 62
            F+ + + S   SV++  + + D+ L         +L+N +     H+     L E   R
Sbjct: 3   LFSLILVASFLASVAATAIDIADKDLE-----TEDSLWNLYERWRSHHTVSRDLDEKQKR 57

Query: 63  LHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRS------ 116
            ++F  N R I      +       LN+F+DL+  EF++ Y G ++    + R       
Sbjct: 58  FNVFKENPRYIHDFNKRKDIPYKLRLNKFADLTNHEFRSTYAGSRINHHRSLRGSRRGGA 117

Query: 117 ----VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
               +   + + +LP + DWR+  AVT VKDQ  CGS WAFST   +EG+   KTKKL+S
Sbjct: 118 TNSFMYQSLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKKLLS 177

Query: 173 LSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ 231
           LSEQELIDCD  E++GC GG +  AFD I  K  GG+  E  YPY  +D  C   KK+  
Sbjct: 178 LSEQELIDCDTDENNGCNGGLMDYAFDFI--KKNGGISSEAEYPYAAEDSYCATEKKSHV 235

Query: 232 VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLS 289
           V I+G+  V  ++ D     V N P+++AI A  Y  QFY  GV     F    G E L 
Sbjct: 236 VSIDGHEDVPANDEDSLLKAVANQPVSIAIEASGYDFQFYSEGV-----FTGRSGTE-LD 289

Query: 290 HSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS---CGI 339
           H V IVGYG      T +   YWI++NSWG  WGEKGY R+     S   CG+
Sbjct: 290 HGVAIVGYGK-----TQQGTKYWIVRNSWGAEWGEKGYIRISAASDSKRLCGL 337


>gi|408009|gb|AAA18215.1| cysteine protease precursor [Trypanosoma congolense]
          Length = 444

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 124/343 (36%), Positives = 183/343 (53%), Gaps = 25/343 (7%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  N+ +
Sbjct: 14  VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+  FSD+S  EF+A Y  G +   +   R  P  + N++    P 
Sbjct: 72  AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPE 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  CGS WAFS  GNIEG +     +L SLSEQ L+ CD  D GC
Sbjct: 129 AVDWRKKGAVTPVKDQGQCGSCWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDTNDFGC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
           EGG + +AF  I+S   G +  E++YPY    G+   C  + K    KI  +V +  DE 
Sbjct: 189 EGGLMDDAFKWIVSSNKGNVFTEQSYPYASGGGNVPTCDKSGKVVGAKIRDHVDLPEDEN 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L +NGP+A+A++A + Q Y  GV           +E+L H VL+VGY  D +K  
Sbjct: 249 AIAEWLAKNGPVAIAVDATSFQSYTGGV------LTSCISEHLDHGVLLVGYD-DTSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSW +GWGE+GY  L R +  C + +   SA+V
Sbjct: 300 ---PPYWIIKNSWSKGWGEEGYSALRRHN-QCLMKNLPSSAVV 338


>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 476

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 131/340 (38%), Positives = 175/340 (51%), Gaps = 28/340 (8%)

Query: 14  SLTVSVSSFMVVGDEKLHHLHHVKH-TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           +L +S+ S+     +K   L   +   +++  +L +H K Y  L E   R  IF  NLR 
Sbjct: 30  ALDMSIISYDSAHADKAATLRTEEELMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRF 89

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA--MIPNI--TLPR 128
           I      E  +   GLN F+DL+  E++AKYLG K+ P+      P+    P +   LP 
Sbjct: 90  IDDHNSAEDRTYKLGLNRFADLTNEEYRAKYLGTKIDPNRRLGKTPSNRYAPRVGDKLPD 149

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDG 187
           + DWR+  AV  VKDQ  CGS WAFS  G +EG+    T +L+SLSEQEL+DCD   + G
Sbjct: 150 SVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDTGYNQG 209

Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL-NKKATQVKINGYVSVSRDETD 246
           C GG +  AF+ I++   GG++ ++ YPYRG D  C    K A  V I+ Y  V   +  
Sbjct: 210 CNGGLMDYAFEFIINN--GGIDSDEDYPYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDEL 267

Query: 247 MAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
             K  V N P++VAI       Q YV+GV      F       L H V+ VGYG      
Sbjct: 268 ALKKAVANQPVSVAIEGGGREFQLYVSGV------FTGRCGTALDHGVVAVGYG------ 315

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
           T K   YWI++NSWG  WGE GY RL R       G CGI
Sbjct: 316 TAKGHDYWIVRNSWGSSWGEDGYIRLERNLANSRSGKCGI 355


>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
           [Brachypodium distachyon]
          Length = 415

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 124/345 (35%), Positives = 185/345 (53%), Gaps = 37/345 (10%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           +A+L+ T +VS+              +   A    ++ ++ + Y  + E   RL +F  N
Sbjct: 84  IAILACTCAVSALAA-----RDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKAN 138

Query: 70  LRKIQLLQDTEHGSGVYGL--NEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL- 126
           +  I+L+     G+  + L  N+F+D++  EF+A + G+K  P+   R+      N++L 
Sbjct: 139 VAFIELVN---AGNDKFSLEANQFADMTVDEFRAAHTGYKPVPANKGRTTQFKYANVSLD 195

Query: 127 --PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
             P + DWR   AVT +KDQ  CG  WAFST  ++EG+    T KL+SLSEQEL+DCD +
Sbjct: 196 ALPASMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVD 255

Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSV- 240
             D GCEGG + NAF+ I+    GGL  E  YPY G D +C  NK++  V  I GY  V 
Sbjct: 256 GMDQGCEGGLMDNAFEFIIDN--GGLTTEGNYPYTGTDDSCNSNKESNDVASIKGYEDVP 313

Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
           S DET + K +    P+++A++      +FY  GV   +   C  G E L H +  VGYG
Sbjct: 314 SNDETSLLKAVAAQ-PVSIAVDGGDNLFRFYKGGV---LSGAC--GTE-LDHGIAAVGYG 366

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           +     T     +W++KNSWG  WGEKG+ R+ R     +G CG+
Sbjct: 367 I-----TSDGTKFWLMKNSWGTSWGEKGFIRMERDIADEEGLCGL 406


>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
 gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
          Length = 364

 Score =  199 bits (505), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 131/346 (37%), Positives = 183/346 (52%), Gaps = 41/346 (11%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
           LLS T S ++ M + +   + +       ++  +L +H K Y  L E   R  +F  NL 
Sbjct: 11  LLSFTFSHATAMSIINYSENEVMD-----MYEEWLVKHRKVYNGLDEKEKRFQVFKDNLG 65

Query: 72  KIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP---------AMIP 122
            IQ   + ++ +   GLN+F+D++  E++A YLG +     A R V          A   
Sbjct: 66  FIQD-HNAQNNTYTLGLNKFADITNKEYRAMYLGTRTD---AKRRVMKTQNTGHRYAYNS 121

Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
              LP   DWR   AV  +KDQ  CGS WAFST   +EG+    T + VSLSEQEL+DCD
Sbjct: 122 GDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCD 181

Query: 183 QE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV 240
           +E D+GC GG +  AF  I+    GG++ E+ YPY+G D  C   KK T+ V+I+GY  V
Sbjct: 182 REYDEGCNGGLMDYAFQFIIQN--GGIDTEEDYPYQGIDGTCDETKKKTKVVQIDGYEDV 239

Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
             +  +  K  V + P++VAI A   ALQ Y +GV      F       L H V++VGYG
Sbjct: 240 PSNNENALKKAVSHQPVSVAIEASGRALQLYQSGV------FTGKCGTALDHGVVVVGYG 293

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
                 T   V YW+++NSWG GWGE GYF++ R      +G CGI
Sbjct: 294 ------TENGVDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGI 333


>gi|410493601|ref|YP_006908539.1| V-CATH [Epinotia aporema granulovirus]
 gi|354805035|gb|AER41457.1| V-CATH [Epinotia aporema granulovirus]
          Length = 329

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 120/317 (37%), Positives = 182/317 (57%), Gaps = 21/317 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF+ F+ ++NK YAT  E  ++  IF  NL  I   ++++  + +Y +N  SDL+  E 
Sbjct: 26  ALFDDFVIKYNKVYATDEERAAKYEIFRNNLVVINE-KNSKTTNALYDINRLSDLNKNEL 84

Query: 100 QAKYLGF------KLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             +  GF       L PS   +  + A  P+ +LP +FDWR  +AVT VK+Q  CGS WA
Sbjct: 85  -LRSTGFSVNLKKNLNPSKECEYVLVADAPSRSLPASFDWRANNAVTPVKNQLDCGSCWA 143

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           FST  NIE +YA K    V L+EQ L++CD  ++ C GG +  A + I+    GG+ EE+
Sbjct: 144 FSTIANIESLYAIKYGVEVDLAEQYLLNCDYTNNNCNGGLMHWALENILINDNGGVVEER 203

Query: 213 TYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
             PY G+  AC   +    +      ++  + T + + L+ENGP++VAI+ + +  Y  G
Sbjct: 204 HAPYVGEVTACDKEEYLFTITNCKRFNLVNEHT-LQQLLIENGPISVAIDVFDILDYKQG 262

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           +S   +   D G   L+H+VL+VGYGV     +   +PYW+ KNSWG+ WGE+G+FR+ R
Sbjct: 263 ISDNCR--SDNG---LNHAVLLVGYGV-----SINGIPYWVFKNSWGDDWGEQGFFRVRR 312

Query: 333 GDGSCG-INDYVRSALV 348
              SCG +N Y  SA++
Sbjct: 313 DINSCGMMNAYAASAVL 329


>gi|343472974|emb|CCD15016.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 361

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 120/343 (34%), Positives = 182/343 (53%), Gaps = 24/343 (6%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  N+ +
Sbjct: 14  VGLLAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQNMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+  FSD+S  EF+A Y  G +   +   R  P  + N++    P 
Sbjct: 72  AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGRPPM 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
             DWR+  AVT VKDQ  C SSWAFS TGNIEG +     +L SLSEQ L+ CD +D GC
Sbjct: 129 TVDWRKKGAVTPVKDQGKCDSSWAFSATGNIEGQWKVAGHELTSLSEQMLVSCDTDDLGC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
             G    AF+ I+S   G +  E++YPY    G+   C  + K    KI  +V ++RDE 
Sbjct: 189 RDGFPDIAFNWIVSSNKGNVFTEQSYPYASGGGNVPTCDKSGKVVGAKIRDHVDLARDED 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L   GP A+ ++A + Q Y  GV           ++ ++ + L+VGY  D +K  
Sbjct: 249 MIAEWLARKGPAAITVDATSFQRYTGGV------LTSCISKEMNSAALLVGYD-DTSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
               PYWIIKNSWG+GWGE+GY R+ +G   C + +Y RSA+V
Sbjct: 300 ---PPYWIIKNSWGKGWGEEGYIRIEKGTNQCLVQEYARSAVV 339


>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
          Length = 332

 Score =  198 bits (504), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 113/308 (36%), Positives = 165/308 (53%), Gaps = 26/308 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG-SGVYGLNEFSDLSTAEFQAKY 103
           ++ +H + YA   E  +R  +F  N+  I+ L + ++G +    +N+F+DL+  EF++ Y
Sbjct: 34  WMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMY 93

Query: 104 LGFKLKPSYADRSVPAM-----IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            G+K     + R+ P       + +  LP + DWR+  AVT +KDQ  CGS WAFS    
Sbjct: 94  TGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAA 153

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           IEGV   K  KL+SLSEQEL+DCD  DDGC GG +++AF+  M+   GGL  E  YPY+ 
Sbjct: 154 IEGVAQIKKGKLISLSEQELVDCDTNDDGCMGGYMNSAFNYTMTT--GGLTSESNYPYKS 211

Query: 219 DDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAYALQFYVTGVSH 275
            D  C +NK K     I G+  V  ++       V + P+++ I       QFY +GV  
Sbjct: 212 TDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQFYSSGV-- 269

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD- 334
               F    + +L H V +VGYG      +     YWI+KNSWG  WGE+GY R+ +   
Sbjct: 270 ----FSGECSTHLDHGVAVVGYGK-----SSNGSKYWILKNSWGPKWGERGYMRIKKDTK 320

Query: 335 ---GSCGI 339
              G CG+
Sbjct: 321 AKHGQCGL 328


>gi|375073982|gb|AFA34858.1| cathepsin L-like protein [Trypanosoma dionisii]
          Length = 467

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 113/312 (36%), Positives = 160/312 (51%), Gaps = 18/312 (5%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F +++ + Y +  E   RL +F  NL   +L     +    +G+  FSDL+  EF++
Sbjct: 38  FADFKQRYGRVYKSAAEEAFRLSVFRKNLLDAKL-HAAANPHATFGVTPFSDLTREEFRS 96

Query: 102 KYL--GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
           ++               VP  +     P A DWR+  AVT VKDQ  CGS WAFS  GN+
Sbjct: 97  RHHSGAAHFAAGRKRARVPVDVGVGDAPAAVDWRDRGAVTPVKDQGQCGSCWAFSAIGNV 156

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           EG +      L SLSEQ L+ CD  D GC+GG +++AF+ I+    G +  E++Y Y   
Sbjct: 157 EGQWFLAGNALTSLSEQMLVSCDTMDSGCDGGLMNSAFEWIVEHHNGTVYTEESYRYASG 216

Query: 220 D---KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
           D   + CR + +     I G+V +  DE  MA +L  NGP+AVA++A +  FY  GV   
Sbjct: 217 DGIAQPCRTSGRTVGAVITGHVKLPPDEAKMATWLAANGPLAVAVDASSWMFYTGGV--- 273

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
                   +  L H VL+VGY          A PYWI+KNSWG  WGE GY R+ +G   
Sbjct: 274 ---LTSCVSNELDHGVLLVGYN------DSAAPPYWIVKNSWGTLWGEDGYVRIAKGTNQ 324

Query: 337 CGINDYVRSALV 348
           C + +   SA+V
Sbjct: 325 CLVKEEASSAVV 336


>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
          Length = 334

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 131/356 (36%), Positives = 187/356 (52%), Gaps = 38/356 (10%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
           F  G  L+ L+ ++S   ++ DE             ++ F   H K Y + +E   R+ I
Sbjct: 4   FLLGAVLVQLSAALSLTNLLADE-------------WHLFKATHKKEYPSQLEEKFRMKI 50

Query: 66  FSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---SYADRSVPA 119
           +  N  K+    +L +    S    +N+F DL   EF++   G++ K    S A+ +   
Sbjct: 51  YLENKHKVAKHNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTF 110

Query: 120 MIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
           M P N+T+P + DWRE  A+T VKDQ  CGS WAFS+TG +EG    KT KLVSLSEQ L
Sbjct: 111 MEPANVTVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNL 170

Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
           IDC  +  ++GC GG +  AF  I  K   G++ E TYPY  +D  CR N +       G
Sbjct: 171 IDCSGKYGNEGCNGGLMDQAFQYI--KDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRG 228

Query: 237 YVSVSRDETDMAKYLVEN-GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
           +V +   E D  K  V   GP++VAI+A   + QFY  GV +  +  CD  +++L H VL
Sbjct: 229 FVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYY--EPSCD--SDDLDHGVL 284

Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           +VGYG D  K       YW++KNSW E WG++GY ++ R     CG+       LV
Sbjct: 285 VVGYGSDNGK------DYWLVKNSWSEHWGDEGYIKMARNRKNHCGVASAASYPLV 334


>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
 gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
 gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
 gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
 gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
          Length = 358

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 134/338 (39%), Positives = 175/338 (51%), Gaps = 31/338 (9%)

Query: 18  SVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ 77
           S   F +VG  +     H +   LF  ++ ++ K YA+  E   R  +F  NL  I  + 
Sbjct: 27  SGGEFSIVGYSEEDLASHDRLIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDI- 85

Query: 78  DTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRS-------VPAMIPNITLPRAF 130
           + +  S   GLNEF+DL+  EF+A YLG    P+ ++             + N  +P+  
Sbjct: 86  NKKVTSYWLGLNEFADLTHDEFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEM 145

Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCE 189
           DWR+ +AVT VK+Q  CGS WAFST   +EG+ A  T  L SLSEQELIDC  + ++GC 
Sbjct: 146 DWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCN 205

Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMA 248
           GG +  AF  I S   GGL  E+ YPY  ++  C   K A  V I+GY  V + DE  + 
Sbjct: 206 GGLMDYAFSYIAST--GGLRTEEAYPYAMEEGDCDEGKGAAVVTISGYEDVPANDEQALV 263

Query: 249 KYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
           K L    P++VAI A     QFY  GV      F     E L H V  VGYG      T 
Sbjct: 264 KALAHQ-PVSVAIEASGRHFQFYSGGV------FDGPCGEQLDHGVTAVGYG------TS 310

Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGIN 340
           K   Y I+KNSWG  WGEKGY R+ R    G+G CGIN
Sbjct: 311 KGQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEGLCGIN 348


>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
          Length = 351

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 134/348 (38%), Positives = 180/348 (51%), Gaps = 29/348 (8%)

Query: 4   FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
           F  F  +A+ + +     F +VG          K T LF  ++ +H K+Y +  E   R 
Sbjct: 10  FLLFISMAVFAYSAFARDFSIVGYSPDDLTSMDKLTDLFESWMSKHGKSYRSFEEKLHRF 69

Query: 64  HIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLK-PSYADRSVPAM 120
            +F  NL+ I    +T      Y  GLNEF+DLS  EF+ KYLG K++ P   D      
Sbjct: 70  EVFQDNLKHID---ETNKKVSSYWLGLNEFADLSHEEFKRKYLGLKIELPKRRDSPEEFS 126

Query: 121 IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
             ++  LP++ DWR+  AV  VK+Q  CGS WAFST   +EG+    T  L +LSEQELI
Sbjct: 127 YKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQIVTGNLTALSEQELI 186

Query: 180 DCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGY 237
           DCD+  ++GC GG +  AF  I+S   GGL +E+ YPY  ++  C   K+  + V I+GY
Sbjct: 187 DCDKPFNNGCNGGLMDYAFAFIISN--GGLRKEEDYPYVMEEGTCGEKKEELEVVTISGY 244

Query: 238 VSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIV 295
             V  D        + N P++VAI A +   QFY  G+     F    G E L H V  V
Sbjct: 245 HDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGI-----FNGHCGTE-LDHGVAAV 298

Query: 296 GYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           GYG      T K V Y  +KNSWG  WGEKGY R+ R     +G CGI
Sbjct: 299 GYG------TSKGVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGI 340


>gi|405953314|gb|EKC21001.1| Cathepsin F [Crassostrea gigas]
          Length = 397

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 124/351 (35%), Positives = 179/351 (50%), Gaps = 50/351 (14%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
            LF  +  +HNK Y   +   S+  +F  NL+ I  L     G   +GLN+ +DLS  EF
Sbjct: 52  PLFQKWKSEHNKIYRNHMIERSKFKVFLENLKVINELNGQFQGKTTFGLNQLADLSQKEF 111

Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
               L  K +    ++   ++  + +LP +FDW     VT VKDQ   GS WAFS  GNI
Sbjct: 112 SRIVLMPKRRAPVFEKERSSL--SGSLPDSFDWTNQSKVTAVKDQGAAGSCWAFSAIGNI 169

Query: 160 EGVYAAKTKKLVSLSEQELIDCD--------QEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
           EG +A   K L + S ++++DCD        + D G  GG    A+  +M    GGLE  
Sbjct: 170 EGQWAMMGKPLTNFSVEQIVDCDGMEDVAKGEADCGVFGGWPFLAYQYVMR--AGGLETW 227

Query: 212 KTY------------------------------PYRGDDKAC----RLNKKATQVKINGY 237
           + Y                              PY    ++C     ++K    +K+  +
Sbjct: 228 EDYWYCSGLGGAAGTCEVCPAPGYNTALCGPPIPYCNMTQSCVTKLDVSKFHPGLKVMSW 287

Query: 238 VSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
            ++ ++ET +A+ L++ GP++VA+NA  LQFY  G+  P  F CD   +NL H+VL+VGY
Sbjct: 288 KAIDQNETSIAEQLIKLGPLSVALNAELLQFYHHGIFDPPSFVCD--PKNLDHAVLLVGY 345

Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           G +++ F  K   YW IKNSWG  WGEKGYFR+ RG G CGIN  V SA++
Sbjct: 346 GSEKSIFGTKD--YWKIKNSWGPKWGEKGYFRMLRGQGKCGINTAVTSAVL 394


>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
          Length = 479

 Score =  198 bits (504), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 128/322 (39%), Positives = 167/322 (51%), Gaps = 28/322 (8%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
           H     +  AL+  +L  H K Y  + E   R  IF  NLR I    + E  +   GL  
Sbjct: 51  HQRPDEEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDE-HNRESRTYKVGLTR 109

Query: 91  FSDLSTAEFQAKYLG--FKLKP--SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTM 146
           F+DL+  E++A++LG  F  KP  S A     A      LP   DWR+  AV  VKDQ  
Sbjct: 110 FADLTNEEYRARFLGGRFSRKPRLSAAKSGRYAAALGDDLPDDVDWRKKGAVATVKDQGQ 169

Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLG 205
           CGS WAFS+   +EG+    T +L+ LSEQEL+DCD+  + GC GG +  AF  I+    
Sbjct: 170 CGSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGN-- 227

Query: 206 GGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA- 263
           GG++ E+ YPY+G D AC  N+K A  V I+GY  V  ++    K  V N P++VAI A 
Sbjct: 228 GGIDTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAG 287

Query: 264 -YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGW 322
             A Q Y +GV      F      +L H V+ VGYG D          YWI++NSWG+ W
Sbjct: 288 GRAFQLYQSGV------FTGRCGTDLDHGVVAVGYGTD------NGTDYWIVRNSWGKDW 335

Query: 323 GEKGYFRLYRG-----DGSCGI 339
           GE GY RL R       G CGI
Sbjct: 336 GESGYIRLERNVANITTGKCGI 357


>gi|146335576|gb|ABQ23397.1| cathepsin L [Trypanosoma carassii]
          Length = 456

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 119/313 (38%), Positives = 167/313 (53%), Gaps = 17/313 (5%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           A F  F  +H K+Y +  E   R+ +F  ++ K        +    +G+ +FSDL+  EF
Sbjct: 34  AQFAAFKAEHGKSYTSAAEEGYRMRVFEESM-KAAQAHAAANPHAKFGVTKFSDLTHEEF 92

Query: 100 QAKYLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           +  Y       + A +     +    T P  +DWR+  AVT VKDQ  CGS W FSTTGN
Sbjct: 93  KTLYANGAAHFAAAAKRARRPVSVTGTAPDEWDWRKKGAVTPVKDQGHCGSCWTFSTTGN 152

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY-- 216
           IEG +A    +L +LSEQ L+ CD  D GC GG + NAF+ I+++  G +  E++YPY  
Sbjct: 153 IEGQWAVAGNELTNLSEQMLVSCDARDYGCSGGLMDNAFEWIVNQNDGFVFTEESYPYAS 212

Query: 217 -RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
             GD   C +  +     I G+V +  DE  MA +L  NGP+++A++A + + Y  GV  
Sbjct: 213 GSGDAPLCDVGGRKVGATIKGHVGLPNDEEKMAAWLAANGPISIAVDADSFKAYKGGV-- 270

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                C+ G   L H VL+VGY     K  +   PYWIIKNSWG  WGE GY R+  G  
Sbjct: 271 --LTGCEEG--QLDHGVLLVGY----NKVANP--PYWIIKNSWGPNWGEHGYIRVGFGTN 320

Query: 336 SCGINDYVRSALV 348
            C +N Y  SA+V
Sbjct: 321 QCNLNSYACSAIV 333


>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
 gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
 gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
 gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
          Length = 307

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 118/304 (38%), Positives = 162/304 (53%), Gaps = 22/304 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ QH + Y  + E   R  IF  N+ +I+   +        G+N+F+DL+  EF+A + 
Sbjct: 8   WMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMHH 67

Query: 105 GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYA 164
           G+K + S    S         +P + DWR+  AVT VKDQ  CG  WAFS    IEG+  
Sbjct: 68  GYKRQSSKLMSSSFRHENLSAIPTSMDWRKAGAVTPVKDQGTCGCCWAFSAVAAIEGIIK 127

Query: 165 AKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA 222
            KT KL+SLSEQ+L+DCD +  D GC GG + NAF  I+    GGL  E TYPY+G D  
Sbjct: 128 LKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRN--GGLTSEATYPYQGVDGT 185

Query: 223 CRLNKKAT-QVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQF 279
           C+  K A+ + KI GY  V  +  +     V   P++VA+    Y  QFY +GV     F
Sbjct: 186 CKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQFYKSGV-----F 240

Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DG 335
             D G   L H+V  +GYG +          YW++KNSWG  WGE GY R+ RG    +G
Sbjct: 241 KGDCGTY-LDHAVTAIGYGTN-----SDGTNYWLVKNSWGTSWGESGYMRMQRGIGAREG 294

Query: 336 SCGI 339
            CG+
Sbjct: 295 LCGV 298


>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
          Length = 474

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 134/360 (37%), Positives = 184/360 (51%), Gaps = 39/360 (10%)

Query: 3   CFYFFAGVALLSLTVSVSSFMVVGDEKLHHLH------HVKHTALFNYFLEQHNKTYATL 56
           C   F  +A   +  S S   ++  ++ H L+      H +  +L+  +L +H+K Y  L
Sbjct: 15  CLVLFFSLASFLMLSSASDMSIITYDETHGLNSPPLRTHDQLLSLYESWLVKHHKNYNAL 74

Query: 57  VEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPS----- 111
            E  +R  IF  N+  +       + S   GLN+F+DL+  E+++ YL  K+        
Sbjct: 75  GEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRSLYLSGKMMKRERKNE 134

Query: 112 ---YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTK 168
               +DR V        LP + DWR+  AV  VKDQ  CGS WAFST G +EG+    T 
Sbjct: 135 DGFRSDRFV--FEDGDHLPESVDWRDRGAVAPVKDQGQCGSCWAFSTVGAVEGINKIVTG 192

Query: 169 KLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK 227
           +L+SLSEQEL+DCD   + GC GG +  AF+ I+    GG++ E  YPY+G D  C  N+
Sbjct: 193 ELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKN--GGIDTEDDYPYKGVDGLCDQNR 250

Query: 228 K-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGG 284
           K A  V INGY  V  ++    K  V + P++VAI A   A Q Y +GV     F    G
Sbjct: 251 KNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGV-----FTGQCG 305

Query: 285 NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
            E L H V+ VGYG +  K       YWI++NSWG  WGE GY RL R       G CGI
Sbjct: 306 TE-LDHGVVAVGYGSENGK------DYWIVRNSWGPDWGESGYIRLERNVASTSTGKCGI 358


>gi|165969032|ref|YP_001650932.1| peptidase [Orgyia leucostigma NPV]
 gi|164663528|gb|ABY65748.1| peptidase [Orgyia leucostigma NPV]
          Length = 328

 Score =  198 bits (503), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 121/321 (37%), Positives = 175/321 (54%), Gaps = 24/321 (7%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+  + K Y   +E   R  IF  NL +I + ++  + + VY +N+FSDLS
Sbjct: 23  LKAPDYFESFVANYQKNYNDDLEKSKRYTIFKDNLEEINV-KNRLNDTAVYRINKFSDLS 81

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  +KY G    PS        ++   P    P  FDWR+ + VT +K+Q  CG+ WA
Sbjct: 82  KTEIISKYTGLN-APSETTNFCKTIVLDQPPGKGPLNFDWRQQNKVTSIKNQGSCGACWA 140

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T  +IE  YA +  + ++LSEQ+LIDCD  D GC GG +  AF+ ++    GG+++E 
Sbjct: 141 FATLASIESQYAIRNDRHINLSEQQLIDCDYVDMGCYGGLLHTAFEQMIQM--GGVKQEH 198

Query: 213 TYPYRGDDKACRLNKKATQ---VKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQF 268
            YPY G +K C LN        V+I G Y  V   E  +   L   GP+ +AI+A  +  
Sbjct: 199 EYPYAGVNKQCELNDITDDSFVVRIKGCYRYVVVREEKLKDLLRAVGPIPIAIDASGIVN 258

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  GV +    +C+  N  L+H+VL+VGYGVD        VPYW  KN+WG  WGE GYF
Sbjct: 259 YYKGVIN----YCE--NYGLNHAVLLVGYGVD------NGVPYWTFKNTWGVDWGENGYF 306

Query: 329 RLYRGDGSCGI-NDYVRSALV 348
           RL +   +CG+ N+   SA++
Sbjct: 307 RLRQNINACGMANELASSAVI 327


>gi|33333712|gb|AAQ11974.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 125/306 (40%), Positives = 164/306 (53%), Gaps = 20/306 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
           F   H KTY +LVE   R  +F  NL  IQ      E G   +   + +F+D++  EF  
Sbjct: 26  FKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFLD 85

Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
             K  G    PS A         ++    A DWRE  AVT VKDQ  CGS WAFS  G I
Sbjct: 86  LLKLQGVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           EG +  K   LVSLS QEL+DC  ED   +GC+GG +  AFD +  +   G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEESYPY 202

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
            G   +C+ + +    K+  YV    DE +MA+ +   GP+AVAI A  L FY  G+   
Sbjct: 203 EGRRSSCKKSGEYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
            +  C    E+L+H VL+VGYG      +   V YWI+KNSWG  WGEKGYFRL +   +
Sbjct: 261 -RCRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313

Query: 337 CGINDY 342
           CGI  Y
Sbjct: 314 CGIGYY 319


>gi|33333706|gb|AAQ11971.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 125/306 (40%), Positives = 164/306 (53%), Gaps = 20/306 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
           F   H KTY +LVE   R  +F  NL  IQ      E G   +   + +F+D++  EF  
Sbjct: 26  FKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFLD 85

Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
             K  G    PS A         ++    A DWRE  AVT VKDQ  CGS WAFS  G I
Sbjct: 86  LLKLQGVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           EG +  K   LVSLS QEL+DC  ED   +GC+GG +  AFD +  +   G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEESYPY 202

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
            G   +C+ + +    K+  YV    DE +MA+ +   GP+AVAI A  L FY  G+   
Sbjct: 203 EGRRSSCKKSGEYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
            +  C    E+L+H VL+VGYG      +   V YWI+KNSWG  WGEKGYFRL +   +
Sbjct: 261 -RCRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313

Query: 337 CGINDY 342
           CGI  Y
Sbjct: 314 CGIGYY 319


>gi|33333700|gb|AAQ11968.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 124/309 (40%), Positives = 165/309 (53%), Gaps = 20/309 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAE 98
           +  F   H KTY +LVE   R  +F  NL  IQ      E G   +   + +F+D++  E
Sbjct: 23  WQQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEE 82

Query: 99  FQ--AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           F    K  G    PS A     +   ++    A DWRE  AVT  KDQ  CGS WAFS  
Sbjct: 83  FLDLLKLQGVPALPSNAVHFDNSEDIDMEEKDAVDWREEGAVTPAKDQANCGSCWAFSAV 142

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           G IEG +  K   LVSLS QEL+DC  ED   +GC+GG +  AFD +  +   G++ E++
Sbjct: 143 GAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEES 199

Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
           YPY G   +C+ + +    K+  YV    DE +MA+ +   GP+AVAI A  L FY  G+
Sbjct: 200 YPYEGRRSSCKKSGEYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGI 257

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
               +  C    E+L+H VL+VGYG      +   V YWI+KNSWG  WGEKGYFRL + 
Sbjct: 258 VDE-RCRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKD 310

Query: 334 DGSCGINDY 342
             +CGI  Y
Sbjct: 311 VKACGIGYY 319


>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
 gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
 gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
          Length = 338

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 125/306 (40%), Positives = 165/306 (53%), Gaps = 24/306 (7%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F++Q++K Y+   E+ SR + F  N+  I+L     + S   GLNEF+DLS  EF+
Sbjct: 41  MFTAFMKQYSKAYSH-AEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFK 99

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
            KY G+K       RS          P + DWR  +AVT +KDQ  CGS WAFS TG+IE
Sbjct: 100 GKYFGYKHVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSIE 159

Query: 161 GVYAAKTK-KLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           G +  + K  L SLSEQ+L+DC     + GC GG +  AF+ I++    G+  E  YPY+
Sbjct: 160 GAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANK--GICAESAYPYK 217

Query: 218 GDDKACRLNKKATQ-VKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGV 273
           G    C+  K  T+ V I+GY  V S DE  +   +   GP++VAI A     QFY +GV
Sbjct: 218 GVGGLCQ--KSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGV 275

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                 F      NL H VL VGYG      T  +  YWI+KNSWG  WGE GY R+ R 
Sbjct: 276 ------FSGTCGHNLDHGVLAVGYG------TTGSQDYWIVKNSWGTSWGESGYIRMIRN 323

Query: 334 DGSCGI 339
              CGI
Sbjct: 324 KNQCGI 329


>gi|33333702|gb|AAQ11969.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  198 bits (503), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 125/306 (40%), Positives = 164/306 (53%), Gaps = 20/306 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
           F   H KTY +LVE   R  +F  NL  IQ      E G   +   + +F+D++  EF  
Sbjct: 26  FKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEFLD 85

Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
             K  G    PS A         ++    A DWRE  AVT VKDQ  CGS WAFS  G I
Sbjct: 86  LLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           EG +  K   LVSLS QEL+DC  ED   +GC+GG +  AFD +  +   G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEESYPY 202

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
            G   +C+ + +    K+  YV    DE +MA+ +   GP+AVAI A  L FY  G+   
Sbjct: 203 EGRRSSCKKSGEYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
            +  C    E+L+H VL+VGYG      +   V YWI+KNSWG  WGEKGYFRL +   +
Sbjct: 261 -RCRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313

Query: 337 CGINDY 342
           CGI  Y
Sbjct: 314 CGIGYY 319


>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
 gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
          Length = 341

 Score =  197 bits (502), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 128/312 (41%), Positives = 172/312 (55%), Gaps = 28/312 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
           F  +H K Y    E   RL IF+ N  KI +  Q    G   + L  N+++DL   EF+ 
Sbjct: 32  FKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADLLHHEFRQ 91

Query: 102 KYLGFKLKPSYADRS-------VPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
              GF        RS       V  + P ++TLP++ DWR   AVT VKDQ  CGS WAF
Sbjct: 92  LMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 151

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           S+TG +EG +  K+  LVSLSEQ L+DC  +  ++GC GG + NAF  I  K  GG++ E
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 209

Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
           K+YPY   D +C  NK A      G+  + + DE  MA+ +   GP+AVAI+A   + QF
Sbjct: 210 KSYPYEAIDDSCHFNKGAIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDASHESFQF 269

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  GV +  Q  CD   +NL H VL+VGYG D +        YW++KNSWG  WG+KG+ 
Sbjct: 270 YSEGVYNEPQ--CDA--QNLDHGVLVVGYGTDES-----GDDYWLVKNSWGTTWGDKGFI 320

Query: 329 RLYRG-DGSCGI 339
           ++ R  D  CGI
Sbjct: 321 KMLRNKDNQCGI 332


>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
 gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
          Length = 463

 Score =  197 bits (502), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 123/314 (39%), Positives = 167/314 (53%), Gaps = 31/314 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           ++  ++  H +TY  + E   R  +F  NLR I    +    +GV+    GLN F+DL+ 
Sbjct: 40  MYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDA-HNAAADAGVHSFRLGLNRFADLTN 98

Query: 97  AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
            E++A YLG + +P   +R + A      N  LP + DWR   AV  VKDQ  CGS WAF
Sbjct: 99  DEYRATYLGARTRPQ-RERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWAF 157

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           ST   +EG+    T  L+SLSEQEL+DCD   + GC GG +  AF+ I++   GG++ EK
Sbjct: 158 STIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEK 215

Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFY 269
            YPY+G D  C +N+K A  V I+ Y  V  ++    +  V N P++VAI A   A Q Y
Sbjct: 216 DYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLY 275

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +G+      F       L H V  VGYG +  K       YWI+KNSWG  WGE GY R
Sbjct: 276 SSGI------FTGSCGTALDHGVTAVGYGTENGK------DYWIVKNSWGSSWGESGYVR 323

Query: 330 LYRG----DGSCGI 339
           + R      G CGI
Sbjct: 324 MERNIKASSGKCGI 337


>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
          Length = 352

 Score =  197 bits (502), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 137/349 (39%), Positives = 182/349 (52%), Gaps = 35/349 (10%)

Query: 6   FFAGVALLSLTVSVSSFMVVG--DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
           F   V++L+ +   + F ++G   E L  +H V H  LF  +L +H+K Y +L E   R 
Sbjct: 13  FLVFVSVLACSALANEFSILGYAPEDLTSIHKVIH--LFESWLAKHSKIYESLDEKLHRF 70

Query: 64  HIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLK-PSYADRSVPAM 120
            IF  NL+ I    DT      Y  GLNEF+DL+  EF+ K+LG K + P   D S+   
Sbjct: 71  EIFMDNLKHID---DTNKKVSNYWLGLNEFADLTHEEFKNKFLGLKGELPERKDESIEEF 127

Query: 121 IPN--ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
                + LP++ DWR+  AV  VK+Q  CGS WAFST   +EG+    T  L  LSEQEL
Sbjct: 128 SYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQEL 187

Query: 179 IDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKING 236
           IDCD   ++GC GG +  AF  +M     GL +E+ YPY   +  C   K  ++ V I+G
Sbjct: 188 IDCDTTFNNGCNGGLMDYAFAYVMR---SGLHKEEEYPYIMSEGTCDEKKDVSETVTISG 244

Query: 237 YVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLI 294
           Y  V R+  D     + N P++VAI A     QFY  GV     F    G E L H V  
Sbjct: 245 YHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGV-----FDGHCGTE-LDHGVAA 298

Query: 295 VGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS----CGI 339
           VGYG      T K + Y I++NSWG  WGEKGY R+ R  G     CG+
Sbjct: 299 VGYG------TTKGLDYVIVRNSWGPKWGEKGYIRMKRKTGKPHGMCGL 341


>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
          Length = 468

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 123/314 (39%), Positives = 167/314 (53%), Gaps = 31/314 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           ++  ++  H +TY  + E   R  +F  NLR I    +    +GV+    GLN F+DL+ 
Sbjct: 45  MYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDA-HNAAADAGVHSFRLGLNRFADLTN 103

Query: 97  AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
            E++A YLG + +P   +R + A      N  LP + DWR   AV  VKDQ  CGS WAF
Sbjct: 104 DEYRATYLGARTRPQ-RERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWAF 162

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           ST   +EG+    T  L+SLSEQEL+DCD   + GC GG +  AF+ I++   GG++ EK
Sbjct: 163 STIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEK 220

Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFY 269
            YPY+G D  C +N+K A  V I+ Y  V  ++    +  V N P++VAI A   A Q Y
Sbjct: 221 DYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLY 280

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +G+      F       L H V  VGYG +  K       YWI+KNSWG  WGE GY R
Sbjct: 281 SSGI------FTGSCGTALDHGVTAVGYGTENGK------DYWIVKNSWGSSWGESGYVR 328

Query: 330 LYRG----DGSCGI 339
           + R      G CGI
Sbjct: 329 MERNIKASSGKCGI 342


>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
 gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
          Length = 340

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 131/325 (40%), Positives = 173/325 (53%), Gaps = 29/325 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAE 98
           +N +  QH K Y +  E   RL I+  N  KI +  Q  E G   + L  N+++DL   E
Sbjct: 27  WNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDLLHEE 86

Query: 99  FQAKYLGFK--------LKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGS 149
           F     GF         LK    D  V  + P N+ +P+  DWRE  AVT VKDQ  CGS
Sbjct: 87  FVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQGHCGS 146

Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGG 207
            W+FS TG +EG +  KT KLVSLSEQ L+DC  +  ++GC GG +  AF  I  K  GG
Sbjct: 147 CWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYI--KDNGG 204

Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY-- 264
           ++ EK YPY   D  C  N KA      G+V + + DE  + K +   GP++VAI+A   
Sbjct: 205 IDTEKAYPYEAIDDTCHYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSVAIDASHE 264

Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
           + QFY  GV +  Q  CD  +ENL H VL VGYG      + +   YW++KNSWG  WG+
Sbjct: 265 SFQFYSEGVYYEPQ--CD--SENLDHGVLAVGYGT-----SEEGEDYWLVKNSWGTTWGD 315

Query: 325 KGYFRLYRG-DGSCGINDYVRSALV 348
           +GY ++ R  D  CGI       LV
Sbjct: 316 QGYVKMARNRDNHCGIATAASYPLV 340


>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
          Length = 374

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 119/313 (38%), Positives = 171/313 (54%), Gaps = 30/313 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           +  +L +H + Y  L E   R  IF  NLR I+   ++ + +   GLN+F+DL+  E++ 
Sbjct: 50  YEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADLTNEEYRT 109

Query: 102 KYLGFK--LKPSYADRSVP----AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            YLG K   +  +     P    A  PN  +P + DWR+  AV  +K+Q  CGS WAFST
Sbjct: 110 MYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWAFST 169

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQ-EDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
              +EG+    T ++++LSEQEL+DCD+ ++ GC GG +  AF+ I+S   GG++ EK Y
Sbjct: 170 VAAVEGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISN--GGMDTEKHY 227

Query: 215 PYRGDDKACR-LNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVT 271
           PYRG +  C  + K    V I+GY  V R+E  + K  V + P+ VAI A   A Q Y +
Sbjct: 228 PYRGVEGRCDPVRKNYKVVSIDGYEDVPRNERALQK-AVAHQPVCVAIEASGRAFQLYSS 286

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GV      F     E + H V++VGYG      +   V YWI++NSWG  WGE GY ++ 
Sbjct: 287 GV------FTGECGEEVDHGVVVVGYG------SEDGVDYWIVRNSWGTKWGENGYVKME 334

Query: 332 RGD-----GSCGI 339
           R       G CGI
Sbjct: 335 RNVKKSHLGKCGI 347


>gi|9635308|ref|NP_059206.1| ORF58 [Xestia c-nigrum granulovirus]
 gi|13124001|sp|Q9PYY5.1|CATV_GVXN RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|6175702|gb|AAF05172.1|AF162221_58 ORF58 [Xestia c-nigrum granulovirus]
          Length = 346

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 121/337 (35%), Positives = 175/337 (51%), Gaps = 30/337 (8%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           VALL+L V   S++       + + + +   LFN F+ ++NK Y    E  +R  IF  N
Sbjct: 19  VALLTLNVCAVSYIA------YDMSNAQE--LFNEFVVKYNKVYKDDQEKEARFEIFKQN 70

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT---- 125
           L  I      E  S ++ +N  +D+S+ E   K  G KL     ++      P +     
Sbjct: 71  LADINARNALED-SAMFEINSRADISSNELLQKLTGLKLSLMRGEKKNSFCTPTVISGDS 129

Query: 126 ---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
              +P +FDWR+ ++VT VK Q  CGS WAFS   NIE +Y  K    + LSEQ+L+DCD
Sbjct: 130 SGKVPDSFDWRDRNSVTSVKMQKECGSCWAFSAVANIESLYHIKHNVSLDLSEQQLVDCD 189

Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR 242
           + ++GC GG +S AF+ I+    GG+  E  YPY G D  C+   +  Q+    Y    R
Sbjct: 190 KVNNGCNGGLMSWAFEGIIR--AGGISYEAPYPYTGVDGVCKNTTRYVQLS-GCYAYDLR 246

Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
            E  + + L E GP++VAI+   L  Y +GV+          +  L+H VL+VGYG +  
Sbjct: 247 SEKKLRQVLHEKGPVSVAIDVVDLTNYKSGVAKHCSV-----DHGLNHGVLLVGYGQEND 301

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
                 V YW +KNSWG  WGE+G+FR+ R   SCGI
Sbjct: 302 ------VKYWTLKNSWGSDWGEQGFFRIKRDVNSCGI 332


>gi|17384029|emb|CAD12392.1| cysteine proteinase [Leishmania infantum]
          Length = 354

 Score =  197 bits (502), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 129/360 (35%), Positives = 179/360 (49%), Gaps = 41/360 (11%)

Query: 5   YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
           +FFA V  +   V   S ++     L  +  +  +A +  F ++H K +    E   R +
Sbjct: 7   FFFAIVVTIRFVVCYGSALIA-QTPLGVVDFIA-SAHYGRFKKRHGKPFGEDAEEGRRFN 64

Query: 65  IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY------------ 112
            F  N++    L      +      +F+DL+  EF   YL     P+Y            
Sbjct: 65  AFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYKEHV 120

Query: 113 -ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
             D SV + + ++      DWRE   VT VK+Q MCGS WAF+TTGNIEG +A K   LV
Sbjct: 121 HVDDSVRSGVMSV------DWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWALKNHSLV 174

Query: 172 SLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKK 228
           SLSEQ L+ CD  DDGC GG +  A   I++   G +  E +YPY    G    C  N  
Sbjct: 175 SLSEQVLVSCDNIDDGCNGGLMQQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDN-G 233

Query: 229 ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENL 288
               KI GY+S+  DE ++A Y+ +NGP+AVA++A   Q Y  GV       C G   +L
Sbjct: 234 TVGAKIKGYMSLPHDEEEIAAYVGKNGPVAVAVDATTRQLYFGGVV----TLCFG--LSL 287

Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           +H VL+VG+            PYWI+KNSWG  WGEKGY RL  G   C + +YV +A +
Sbjct: 288 NHGVLVVGFN------RQAKPPYWIVKNSWGSSWGEKGYIRLAMGSNQCLLKNYVVTATI 341


>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
 gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
          Length = 349

 Score =  197 bits (501), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 137/352 (38%), Positives = 185/352 (52%), Gaps = 34/352 (9%)

Query: 2   SCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYS 61
           S   FFA  +L   +V    F +VG    H     K   LF  ++  H K Y +L E   
Sbjct: 9   SFLTFFA--SLFVCSVLAHDFSIVGYSPEHLTSVDKLVELFESWISGHGKAYNSLEEKLH 66

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSV 117
           R  +F  NL+ I   ++ E  S   GLNEF+DLS  EF++K+LG    F  K S  D S 
Sbjct: 67  RFEVFKENLKHIDQ-RNKEVTSYWLGLNEFADLSHEEFKSKFLGLYPEFPRKKSSEDFSY 125

Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
             ++    LP++ DWR+  AVT VK+Q  CGS WAFST   +EG+       L SLSEQ+
Sbjct: 126 RDVV---DLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVAGNLTSLSEQQ 182

Query: 178 LIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKIN 235
           LIDCD   ++GC GG +  AF+ I++   GGL +E+ YPY  ++  C   ++  + V I+
Sbjct: 183 LIDCDTSFNNGCNGGLMDYAFEFIVNN--GGLHKEEDYPYLMEEGTCDEKREEMEVVTIS 240

Query: 236 GYVSVSR-DETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSV 292
           GY  V R DE  + K L    P++VAI+A     QFY  GV      F      +L H V
Sbjct: 241 GYHDVPRNDEQSLLKALAHQ-PLSVAIDASGRDFQFYSGGV------FSGPCGTDLDHGV 293

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
             VGYG      +   + Y I+KNSWG  WGE+GY R+ R     +G CGIN
Sbjct: 294 AAVGYG------SSSGIDYIIVKNSWGPKWGERGYLRMKRNTGKPEGLCGIN 339


>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  197 bits (501), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 121/314 (38%), Positives = 163/314 (51%), Gaps = 27/314 (8%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           A  + ++  H+K Y  L E   R  IF  N+ +I+     E      G+N+FSDL+  +F
Sbjct: 40  ARHDQWIAHHDKVYKDLNEKEMRFKIFKENVERIEAFNAGEDKGYKLGVNKFSDLTNEKF 99

Query: 100 QAKYLGFKLK-PSYADRSVPAM---IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           +  + G+K   P     S P       N+T +P   DWR+  AVT +KDQ  CG  WAFS
Sbjct: 100 RVLHTGYKRSHPKVMSSSKPKTHFRYANVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFS 159

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
                EG++  KT KL+ LSEQEL+DCD   ED+GC GG +  AFD I+     GL  E 
Sbjct: 160 AVAATEGLHQLKTGKLIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILK--NKGLTTEA 217

Query: 213 TYPYRGDDKACRLNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFY 269
            YPY+G+D  C   K A +  KI GY  V  +        V N P++VAI+  ++  QFY
Sbjct: 218 NYPYKGEDGVCNKKKSALSAAKIAGYEDVPANSEKALLQAVANQPVSVAIDGSSFDFQFY 277

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV      F    +  L+H+V  VGYG      T     YWIIKNSWG  WG+ GY R
Sbjct: 278 SSGV------FSGSCSTWLNHAVTAVGYGA-----TTDGTKYWIIKNSWGSKWGDSGYMR 326

Query: 330 LYRG----DGSCGI 339
           + R     +G CG+
Sbjct: 327 IKRDVHEKEGLCGL 340


>gi|255088003|ref|XP_002505924.1| cysteine endopeptidase [Micromonas sp. RCC299]
 gi|226521195|gb|ACO67182.1| cysteine endopeptidase [Micromonas sp. RCC299]
          Length = 291

 Score =  197 bits (501), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 118/289 (40%), Positives = 163/289 (56%), Gaps = 26/289 (8%)

Query: 77  QDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL--KPSYADRSVPAMIPNIT---LPRAFD 131
           Q  + GS V+G+ +FSDL+  EF + +LG KL  +   A RS    +P+     LP  FD
Sbjct: 8   QAQDRGSAVHGVTQFSDLTPTEFASTFLGTKLANEDVAAIRSGMTTLPDYPAHDLPLEFD 67

Query: 132 WREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD----- 186
           WRE  AVT VK+Q  CGS W FS TG +EG    KT +LVSLSEQ+L+DCD   D     
Sbjct: 68  WRERGAVTPVKNQGACGSCWTFSATGAVEGANFLKTGELVSLSEQQLVDCDHTCDPSAPR 127

Query: 187 ----GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKA-TQVKINGYVSVS 241
               GC GG   NA   +      GL+ E  YPY+G D  C   +       ++ +  VS
Sbjct: 128 NCDYGCNGGLPLNAMRYVQKH---GLDTESNYPYKGVDGKCASARHGPAAASVSSFNLVS 184

Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
            +ET +A  L+++GP+++ I+A  +Q YV GV+ P  + C+     L H VLIVGYGV+ 
Sbjct: 185 TNETQIAAALLKHGPLSIGIDAAWMQTYVGGVACP--WICN--KAGLDHGVLIVGYGVNG 240

Query: 302 T---KFTHKAVPYWIIKNSWGEGWG-EKGYFRLYRGDGSCGINDYVRSA 346
           T   +  H+   YWI+KNSWG  WG E GY+ + +   +CG+N  V +A
Sbjct: 241 TAPARPWHRRQDYWIVKNSWGPNWGVEGGYYHICKDRAACGLNTMVVAA 289


>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
           Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
           Precursor
 gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
 gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
 gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
          Length = 361

 Score =  197 bits (501), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 129/309 (41%), Positives = 171/309 (55%), Gaps = 36/309 (11%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           H+    +L E   R ++F  N++ I      +  S    LN+F D+++ EF+  Y G  +
Sbjct: 44  HHTVARSLEEKAKRFNVFKHNVKHIHETNKKDK-SYKLKLNKFGDMTSEEFRRTYAGSNI 102

Query: 109 K-------PSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           K          A +S   M  N+ TLP + DWR+  AVT VK+Q  CGS WAFST   +E
Sbjct: 103 KHHRMFQGEKKATKSF--MYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVE 160

Query: 161 GVYAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           G+   +TKKL SLSEQEL+DCD  ++ GC GG +  AF+ I  K  GGL  E  YPY+  
Sbjct: 161 GINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIKEK--GGLTSELVYPYKAS 218

Query: 220 DKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHP 276
           D+ C  NK+ A  V I+G+  V ++  D     V N P++VAI+A     QFY  GV   
Sbjct: 219 DETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGV--- 275

Query: 277 IQFFCDGGNENLSHSVLIVGYG--VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG- 333
             F    G E L+H V +VGYG  +D TK       YWI+KNSWGE WGEKGY R+ RG 
Sbjct: 276 --FTGRCGTE-LNHGVAVVGYGTTIDGTK-------YWIVKNSWGEEWGEKGYIRMQRGI 325

Query: 334 ---DGSCGI 339
              +G CGI
Sbjct: 326 RHKEGLCGI 334


>gi|42564161|gb|AAS20592.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 326

 Score =  197 bits (501), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 122/305 (40%), Positives = 173/305 (56%), Gaps = 19/305 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ---DTEHGSGVYGLNEFSDLSTAEFQA 101
           F + H KTY +L+E  +R  IF  NLRKI+      D    S   G+  F+DL+  EF+ 
Sbjct: 26  FKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTHDEFKD 85

Query: 102 KYL-GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           K     K KP+  + ++      + +P + DW +  AV  VK Q  CGS WAFS TG +E
Sbjct: 86  KLRRQIKTKPN-VEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAFSATGALE 144

Query: 161 GVYAAKTKKLVSLSEQELIDCDQE--DDGCE-GGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           G  A      + LSEQ+L+DC +   +D CE GG +S AFD ++ K   G+E + +YPY+
Sbjct: 145 GQNAIVNNVKIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVLDK---GIEADSSYPYK 201

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           G D  C+ + K T +KI GY +VS  E ++ K +   GP++VAI+A  +Q Y  G+   +
Sbjct: 202 GIDTPCQYDAKKTVLKIKGYRNVSISEEELKKAVGTVGPVSVAIDADPIQLYSGGILDGL 261

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGS 336
             FC     NL+H VL VGYG +   F  K   +W +KNSWG+ WGE+GYFR+ R  +  
Sbjct: 262 --FC---THNLNHGVLAVGYGEEDHLFGKKK--FWKVKNSWGKDWGEQGYFRIKRDANNL 314

Query: 337 CGIND 341
           CGI D
Sbjct: 315 CGIAD 319


>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 330

 Score =  197 bits (501), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 137/339 (40%), Positives = 181/339 (53%), Gaps = 30/339 (8%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
           +LSLTV+    + VG        H +H  LF     QHNKTY    +   R  IF  N++
Sbjct: 2   ILSLTVAC---IFVGVSPAAVDAHDEHWELFK---RQHNKTYLQKQDV-GRRAIFEANIK 54

Query: 72  KIQ---LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL-- 126
           KI    LL D    S   GLN F+D++  EF+ KY G + + + A  S      N ++  
Sbjct: 55  KINAHNLLYDLGRSSYRLGLNGFADMTPDEFE-KYRGTRFEANEARVSKLQHRDNRSMHV 113

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--E 184
           P   DWR    VT VK+Q +CGS WAFSTTG +EG +  ++  LVSLSEQ L+DC     
Sbjct: 114 PDTVDWRTEGYVTPVKNQGVCGSCWAFSTTGALEGQHFRRSGDLVSLSEQMLVDCSAVYG 173

Query: 185 DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRD 243
           + GC GG + NAF  I  K  GGLE EK+YPY G D  C  + +    K+ G+V V SRD
Sbjct: 174 NAGCNGGLMDNAFRFI--KDAGGLETEKSYPYTGKDGTCHFDARGIGAKLTGFVDVPSRD 231

Query: 244 ETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
           E  + +     GP++VAI+A     QFY  GV   I   C   + +L H VL+VGYG   
Sbjct: 232 EEALKEAAGVVGPVSVAIDASGQNFQFYKDGVYDEIT--CS--STSLDHGVLVVGYGT-- 285

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
              T     YW++KNSWG  WG+ GY ++ R  +  CGI
Sbjct: 286 ---TRDGKDYWLVKNSWGSSWGQSGYIQMSRNKENQCGI 321


>gi|42564159|gb|AAS20591.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
          Length = 326

 Score =  197 bits (501), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 121/305 (39%), Positives = 173/305 (56%), Gaps = 19/305 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ---DTEHGSGVYGLNEFSDLSTAEFQA 101
           F + H KTY +L+E  +R  IF  NLRKI+      D    S   G+  F+DL+  EF+ 
Sbjct: 26  FKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFADLTHDEFKD 85

Query: 102 KYL-GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           +     K KP+  + ++      + +P + DW +  AV  VK Q  CGS WAFS TG +E
Sbjct: 86  ELRRQIKTKPN-VEATLAVFPEGLEVPDSIDWTQKGAVLDVKYQGGCGSCWAFSATGALE 144

Query: 161 GVYAAKTKKLVSLSEQELIDCDQE--DDGCE-GGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           G  A      + LSEQ+L+DC +   +D CE GG +S AFD ++ K   G+E + +YPY+
Sbjct: 145 GQNAIVNNVKIPLSEQQLLDCSKPYGNDDCEHGGLMSFAFDYVLDK---GIEADSSYPYK 201

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           G D  C+ + K T +KI GY +VS  E ++ K +   GP++VAI+A  +Q Y  G+   +
Sbjct: 202 GIDTPCQYDAKKTVLKIKGYKNVSNSEEELKKAVGTVGPVSVAIDADPIQLYFGGILDGL 261

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGS 336
             FC     NL+H VL VGYG +   F  K   +W +KNSWG+ WGE+GYFR+ R  +  
Sbjct: 262 --FC---THNLNHGVLAVGYGEEDHLFGKKK--FWKVKNSWGKDWGEQGYFRIKRDANNL 314

Query: 337 CGIND 341
           CGI D
Sbjct: 315 CGIAD 319


>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  197 bits (501), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 126/314 (40%), Positives = 166/314 (52%), Gaps = 26/314 (8%)

Query: 37  KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
           K   LF  ++ +H K Y T+ E   R  +F  NL+ I           + GLNEF+DLS 
Sbjct: 42  KLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIVSNYWL-GLNEFADLSH 100

Query: 97  AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
            EF+ KYLG K+  S    S         ++ LP++ DWR+  AVT VK+Q  CGS WAF
Sbjct: 101 QEFKNKYLGLKVNLSQRRESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAF 160

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           ST   +EG+    T  L SLSEQELIDCD   ++GC GG +  AF  I+    GGL +E 
Sbjct: 161 STVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQ--NGGLHKED 218

Query: 213 TYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFY 269
            YPY  ++  C + K+ TQ V INGY  V ++        + N P++VAI A +   QFY
Sbjct: 219 DYPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFY 278

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GV      F      +L H V  VGYG      T K + Y I+KNSWG  WGEKG+ R
Sbjct: 279 SGGV------FDGHCGSDLDHGVSAVGYG------TSKNLDYIIVKNSWGAKWGEKGFIR 326

Query: 330 LYRG----DGSCGI 339
           + R     +G CG+
Sbjct: 327 MKRNIGKPEGICGL 340


>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 324

 Score =  197 bits (501), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 121/316 (38%), Positives = 174/316 (55%), Gaps = 22/316 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F+ +   H  +YAT+ E  +R  I+  NL  I+   ++E  S    +N+F+DL+  EF A
Sbjct: 22  FDSWKATHGVSYATVGEETARRGIYRANLDFIEK-HNSEGHSYKLAVNKFADLTYPEFAA 80

Query: 102 KYLGFKLKPSYADRSVPAM--IPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           KYLG +   + A +S  A   +P  ++LP + DWR    VT +KDQ  CGS W+FSTTG+
Sbjct: 81  KYLGLRFDATNATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTTGS 140

Query: 159 IEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           +EG +A KT +LVSLSEQ L+DC   Q + GC GG +  AF  I+S    G++ E +YPY
Sbjct: 141 VEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISN--NGIDTESSYPY 198

Query: 217 RGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAY--ALQFYVTGV 273
              D  C+ N       +  Y  + S  E+D+   +   GP++VAI+A   + QFY +GV
Sbjct: 199 TAQDGTCQFNSANVGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSGV 258

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR- 332
            +  +  C   +  L H VL VGYG      T  +  YW++KNSWG  WG+ GY  + R 
Sbjct: 259 YN--EPACS--SSQLDHGVLAVGYG------TSGSSDYWLVKNSWGTSWGQSGYIWMTRN 308

Query: 333 GDGSCGINDYVRSALV 348
            +  CGI       LV
Sbjct: 309 SNNQCGIATAASYPLV 324


>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
 gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
          Length = 339

 Score =  197 bits (500), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 129/352 (36%), Positives = 185/352 (52%), Gaps = 43/352 (12%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
            FA +AL+++  +VS   V+ +E             +  F  +H K Y    E   RL I
Sbjct: 4   LFALLALVAVAQAVSYADVIKEE-------------WQTFKLEHRKNYVDETEERFRLKI 50

Query: 66  FSGNLRKIQLLQDTEHGSG----VYGLNEFSDLSTAEFQAKYLGFKLKPSYADR-SVPAM 120
           F+ N  KI    +  + SG       +N+++D+   EF     GF        R S P+ 
Sbjct: 51  FNENKHKI-AKHNQRYASGEVSFKMAVNKYADMLHHEFHTTMNGFNYTLHKQLRASDPSF 109

Query: 121 I-------PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSL 173
           +        ++ +P++ DWR   AVT VKDQ  CGS WAFS+TG +EG +  K   L+SL
Sbjct: 110 VGVTFISPEHVKIPKSVDWRSKGAVTEVKDQGHCGSCWAFSSTGALEGQHFRKAGTLISL 169

Query: 174 SEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ 231
           SEQ L+DC  +  ++GC GG + NAF  I  K  GG++ EK+YPY G D +C  NK    
Sbjct: 170 SEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPYEGIDDSCHFNKATIG 227

Query: 232 VKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENL 288
               G V + + DE  MA+ +   GP++VAI+A   + QFY  G+ +  Q  CD   +NL
Sbjct: 228 ATDRGSVDIPQGDEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPQ--CDP--QNL 283

Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGI 339
            H VL+VGYG D +        YW++KNSWG  WG+KG+ ++ R  D  CGI
Sbjct: 284 DHGVLVVGYGTDES-----GQDYWLVKNSWGTTWGDKGFIKMARNADNQCGI 330


>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 349

 Score =  197 bits (500), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 126/312 (40%), Positives = 166/312 (53%), Gaps = 24/312 (7%)

Query: 37  KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
           K   LF  ++ +H K Y ++ E   R  IF  NL+ I           + GLNEF+DLS 
Sbjct: 42  KLIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWL-GLNEFADLSH 100

Query: 97  AEFQAKYLGFKLKPSYADRSVPAMI-PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            EF+ KYLG K+  S    S       ++ LP++ DWR+  AV  VK+Q  CGS WAFST
Sbjct: 101 QEFKNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWAFST 160

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
              +EG+    T  L SLSEQELIDCD+  ++GC GG +  AF  I+    GGL +E+ Y
Sbjct: 161 VAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVEN--GGLHKEEDY 218

Query: 215 PYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVT 271
           PY  ++  C + K+ T+ V I+GY  V ++        + N P++VAI A     QFY  
Sbjct: 219 PYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSG 278

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GV      F      +L H V  VGYG      T K V Y I+KNSWG  WGEKGY R+ 
Sbjct: 279 GV------FDGHCGSDLDHGVAAVGYG------TAKGVDYIIVKNSWGSKWGEKGYIRMR 326

Query: 332 RG----DGSCGI 339
           R     +G CGI
Sbjct: 327 RNIGKPEGICGI 338


>gi|146084829|ref|XP_001465113.1| cysteine peptidase A (CPA) [Leishmania infantum JPCM5]
 gi|134069209|emb|CAM67356.1| cysteine peptidase A (CPA) [Leishmania infantum JPCM5]
          Length = 354

 Score =  197 bits (500), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 128/360 (35%), Positives = 178/360 (49%), Gaps = 41/360 (11%)

Query: 5   YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
           +FFA V  +   V   S ++   +    +     +A +  F ++H K +    E   R +
Sbjct: 7   FFFAIVVTILFVVCYGSALIA--QTPLGVDDFIASAHYGRFKKRHGKPFGEDAEEGRRFN 64

Query: 65  IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY------------ 112
            F  N++    L      +      +F+DL+  EF   YL     P+Y            
Sbjct: 65  AFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYKEHV 120

Query: 113 -ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
             D SV + + ++      DWRE   VT VK+Q MCGS WAF+TTGNIEG +A K   LV
Sbjct: 121 HVDDSVRSGVMSV------DWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWALKNHSLV 174

Query: 172 SLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKK 228
           SLSEQ L+ CD  DDGC GG +  A   I++   G +  E +YPY    G    C  N  
Sbjct: 175 SLSEQVLVSCDNIDDGCNGGLMQQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDN-G 233

Query: 229 ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENL 288
               KI GY+S+  DE ++A Y+ +NGP+AVA++A   Q Y  GV       C G   +L
Sbjct: 234 TVGAKIKGYMSLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVV----TLCFG--LSL 287

Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           +H VL+VG+            PYWI+KNSWG  WGEKGY RL  G   C + +YV +A +
Sbjct: 288 NHGVLVVGFN------RQAKPPYWIVKNSWGSSWGEKGYIRLAMGSNQCLLKNYVVTATI 341


>gi|440804656|gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
           str. Neff]
          Length = 330

 Score =  197 bits (500), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 117/312 (37%), Positives = 171/312 (54%), Gaps = 18/312 (5%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  Q+ K+YA+  E+  RL IF  NL +I  L     G+  YG+N+F+DL+  EF+A
Sbjct: 32  FRQFAAQYGKSYAS-EEFGERLRIFRDNLDRIDALNSANTGA-RYGVNKFADLTPKEFKA 89

Query: 102 KYL-GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
            YL G +        +   +     LP  FDWR+  AVT  KDQ  CG  WAFS T  IE
Sbjct: 90  TYLKGARSAGQKKAAATAKLDMTGPLPSQFDWRDKGAVTPTKDQGQCG--WAFSVTEAIE 147

Query: 161 GVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
             +    +KLVSL+ Q+++DCDQ   D GC+GG    A++ ++    GGL+ E++YPY  
Sbjct: 148 SQWFLSGRKLVSLAPQQIVDCDQGNGDYGCDGGDPPTAYEYVIK--AGGLDTEESYPYTA 205

Query: 219 DDKACRLNKKATQVKING--YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
           +D  C     A   KI+   Y++ +++ET+M   L   GP+++ ++A + Q+Y+ GV   
Sbjct: 206 EDGQCAFKPSAVGAKISNWTYITTTKNETEMQYGLASRGPLSICVDASSWQYYIGGV--- 262

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
           I   C+   ++L H V+I GY V +  +       W I+NSWGE WG  GY  + RG   
Sbjct: 263 ITSLCE---DSLDHCVMITGYSV-QEGWDFMKYDVWNIRNSWGEDWGYGGYLYVQRGSNL 318

Query: 337 CGINDYVRSALV 348
           CG+ D V   LV
Sbjct: 319 CGVGDEVTIPLV 330


>gi|394331824|gb|AFN27131.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  196 bits (499), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 118/318 (37%), Positives = 172/318 (54%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWR+  AVT VKDQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             G+IE  +A    +L +LSEQ+L+ CD +D GC    +  AF+ ++  + G +  E +Y
Sbjct: 155 AVGSIESQWALAGHRLTALSEQQLVSCDDKDSGCRARLMLQAFEWLLRNMNGTMFTEDSY 214

Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY            + Q+    +I+GY+++   ET MA +L +NGP+++A++A +   Y 
Sbjct: 215 PYVSSTGYVPECSNSIQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQ 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
            GV       C G    L+H VL+VGY  +RT      VPYW+IKNSWGE WGE GY R+
Sbjct: 275 RGVVTS----CAG--MPLNHGVLLVGY--NRT----GEVPYWVIKNSWGENWGENGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
 gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
          Length = 372

 Score =  196 bits (499), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 120/294 (40%), Positives = 159/294 (54%), Gaps = 31/294 (10%)

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADR-- 115
           RL +F  NLR I    + E  +G++    GL  F+DL+  E++ + LGF+ +   A R  
Sbjct: 70  RLEVFRDNLRYIDA-HNAEADAGLHTFRLGLTPFADLTLEEYRGRALGFRARRGGASRVG 128

Query: 116 SVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
           S  +  P      LP A DWRE  AVTGVK+Q  CG  WAFS    IEG+    T  LVS
Sbjct: 129 SGSSYRPRPRGGDLPDAIDWRELGAVTGVKNQEQCGGCWAFSAVAAIEGINEIVTGNLVS 188

Query: 173 LSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ- 231
           LSEQE+IDCD +D GC GG + NAF  +++   GG++ E  YPY G D AC  N+   + 
Sbjct: 189 LSEQEIIDCDTQDGGCNGGEMQNAFQFVINN--GGIDTEADYPYLGTDAACDANRVNERV 246

Query: 232 VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF--YVTGVSHPIQFFCDGGNENLS 289
           V I+G+VSV+ +     +  V N P++VAI+A   +F  Y +G+      F       L 
Sbjct: 247 VTIDGFVSVATENETALQEAVANQPVSVAIDASGRKFQHYTSGI------FNGPCGTQLD 300

Query: 290 HSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
           H V  VGYG +  K       YWI+KNSW   WGE GY R+ R      G CGI
Sbjct: 301 HGVTAVGYGSENGK------DYWIVKNSWSSSWGEAGYIRIRRNVAAATGKCGI 348


>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 471

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 133/331 (40%), Positives = 180/331 (54%), Gaps = 29/331 (8%)

Query: 24  VVGDEKLHHLHHVKHTA-LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
            + D + H LH       +F+ +LE+H++ Y +L E   R  IF  NL  I      E  
Sbjct: 33  AIMDYEAHELHSDDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQEK- 91

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGFKLK-PSYADRSVPAMI-PNITLPRAFDWREYDAVTG 140
           S   GLN+FSDL+  EF+A YLG +    ++  R+    I  ++      DWR+  AV+ 
Sbjct: 92  SYWLGLNKFSDLTHDEFRALYLGIRPAGRAHGLRNGDRFIYEDVVAEEMVDWRKKGAVSD 151

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ-EDDGCEGGSISNAFDT 199
           VKDQ  CGS WAFS  G++EGV A  T +L+SLSEQEL+DCD+ ++ GC GG +  AFD 
Sbjct: 152 VKDQGSCGSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQNQGCNGGLMDYAFDF 211

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ--VKINGYVSV-SRDETDMAKYLVENGP 256
           I+    GG++ E+ YPY+  D  C   +K T   V I+ Y  V ++ E+ + K + +N P
Sbjct: 212 IIKN--GGIDTEEDYPYKATDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSKN-P 268

Query: 257 MAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWII 314
           ++VAI A     Q Y  GV      F      +L H VL VGYG D        V YWI+
Sbjct: 269 VSVAIEAGGRDFQHYQGGV------FTGPCGTDLDHGVLAVGYGTD-----DDGVNYWIV 317

Query: 315 KNSWGEGWGEKGYFRLYR-----GDGSCGIN 340
           KNSWG  WGEKGY R+ R       G CGIN
Sbjct: 318 KNSWGPSWGEKGYIRMERMGSNSTSGKCGIN 348


>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
          Length = 334

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 131/351 (37%), Positives = 188/351 (53%), Gaps = 40/351 (11%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
           L++L  +VS F +  DE             F  F + H K Y   +E   R  IF  N +
Sbjct: 10  LIALGQAVSFFDLSADE-------------FTLFKKFHRKEYDNELEESYRKKIFLENKK 56

Query: 72  KIQLLQDTEHGSGVYG----LNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA--MIP--N 123
           +I+   ++ +  G       LN  +D+   E+   YLGF       +  + +   IP  +
Sbjct: 57  RIEK-HNSRYKQGKVSFKLKLNHLADMLIHEYSDVYLGFNKSSKANNNKLQSYTFIPPAH 115

Query: 124 ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ 183
           +TL +  DWR   AVT VK+Q  CGS WAFSTTG +EG    KT KLVSLSEQ L+DC  
Sbjct: 116 VTLNKEVDWRTKGAVTPVKNQGHCGSCWAFSTTGALEGQNFRKTGKLVSLSEQNLVDCSG 175

Query: 184 E--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
              ++GCEGG + NAF  I  K   G++ EK+YPY G+D+ CR  K +     +G+V ++
Sbjct: 176 SYGNNGCEGGLMDNAFQYI--KENHGIDTEKSYPYEGEDETCRFRKTSIGATDSGFVDIT 233

Query: 242 R-DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
           + DE  + + +   GP++VAI+A   + QFY  GV +  +  C   +ENL H VL+VGYG
Sbjct: 234 QGDEEALMQAVATIGPISVAIDASHQSFQFYSEGVYYEPE--C--SSENLDHGVLVVGYG 289

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           V+  +       YW++KNSWG  WG+ GY ++ R  D +CGI       LV
Sbjct: 290 VEDNQ------KYWLVKNSWGTQWGDGGYIKMARDQDNNCGIATQASYPLV 334


>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
          Length = 337

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 119/310 (38%), Positives = 172/310 (55%), Gaps = 26/310 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  +H K + + VE   R+ IF+ N  KI     L      S   GLN++SD+   EF+ 
Sbjct: 30  FKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSDMLYHEFKE 89

Query: 102 KYLGFK--LKPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
              G+   ++     +    +I     N+ +P++ DWR++ AVT VKDQ  CGS WAFS+
Sbjct: 90  TMNGYNHTMRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQGHCGSCWAFSS 149

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           T  +EG +  K   LVSLSEQ L+DC  +  ++GC GG + NAF  I  K  GG++ EK+
Sbjct: 150 TAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKS 207

Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYV 270
           YPY G D +C   K        G+V + + DE  + K +   GP++VAI+A   + Q Y 
Sbjct: 208 YPYEGIDDSCHFTKSGVGATDTGFVDIPQGDEEALMKAVATMGPVSVAIDASHESFQLYS 267

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
            GV +  +  CD   +NL H VL+VGYG D+T      + YW++KNSWG  WG++GY ++
Sbjct: 268 EGVYNEPE--CDA--QNLDHGVLVVGYGTDKT-----GLDYWLVKNSWGTTWGDQGYIKM 318

Query: 331 YRG-DGSCGI 339
            R  D  CGI
Sbjct: 319 ARNQDNQCGI 328


>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
 gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
 gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
          Length = 350

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 128/315 (40%), Positives = 169/315 (53%), Gaps = 29/315 (9%)

Query: 37  KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDL 94
           K   LF  ++ +H K Y T+ E   R  +F  NL+ I    D       Y  GLNEF+DL
Sbjct: 42  KLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHID---DRNKVVSNYWLGLNEFADL 98

Query: 95  STAEFQAKYLGFKLKPSYADRSVPAMIP--NITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           S  EF+ KYLG K+  S    S        ++ LP++ DWR+  AVT VK+Q  CGS WA
Sbjct: 99  SHQEFKNKYLGLKVDLSQRRESSEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWA 158

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           FST   +EG+    T  L SLSEQELIDCD   ++GC GG +  AF  I+    GGL +E
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVKN--GGLHKE 216

Query: 212 KTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQF 268
           + YPY  ++  C + K+ ++ V INGY  V ++        + N P++VAI A     QF
Sbjct: 217 EDYPYIMEESTCEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQF 276

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  GV     F    G+E L H V  VGYG      T K + Y I+KNSWG  WGEKG+ 
Sbjct: 277 YSGGV-----FDGHCGSE-LDHGVSAVGYG------TSKGLDYIIVKNSWGAKWGEKGFI 324

Query: 329 RLYRG----DGSCGI 339
           R+ R     +G CG+
Sbjct: 325 RMKRNIGKSEGICGL 339


>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
          Length = 380

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 128/348 (36%), Positives = 188/348 (54%), Gaps = 28/348 (8%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHT-ALFNYFLEQHNKTYATLVEYYSRLH 64
           F +   L   T  + SF +  D K+  L       AL+  +L ++ K+Y +L E   R+ 
Sbjct: 7   FISMSLLFFSTFLIFSFAI--DAKISPLRTNDEVMALYESWLVKYGKSYNSLGEREMRIE 64

Query: 65  IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK--LKPSYADRSVPAMIP 122
           IF  NLR I       + S   GLN+F+DL+  E+++ YLGFK  LK   ++R +P +  
Sbjct: 65  IFKENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFKSSLKSKVSNRYMPQV-- 122

Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
              LP   DWR   AV  VK+Q +C S WAF+T   +E +    T  L+SLSEQEL+DC+
Sbjct: 123 GEVLPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSEQELVDCN 182

Query: 183 QE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVS 239
           +   ++GC+GG + +A++ I++   GG+  E+ YPY G D  C   KK    V I+ Y  
Sbjct: 183 RTPINEGCKGGFMDDAYEFIINN--GGINTEENYPYIGQDDQCDEPKKNQNYVTIDSYEQ 240

Query: 240 VSRDETDMAKYLVENGPMAVAINAYAL--QFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
           V  ++    K  V   P++VAI+AY L  +FY +G+     F        L+H+V I+GY
Sbjct: 241 VPPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGI-----FTGGSCGTTLNHAVTIIGY 295

Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR---GDGSCGINDY 342
           G      T   + YWI+KNS+G  WGE GY ++ R   G+G CGI  Y
Sbjct: 296 G------TENGIDYWIVKNSYGTQWGESGYGKVQRNVGGEGRCGIASY 337


>gi|323451241|gb|EGB07119.1| hypothetical protein AURANDRAFT_54023 [Aureococcus anophagefferens]
          Length = 377

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 117/315 (37%), Positives = 160/315 (50%), Gaps = 21/315 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGL-NEFSDLSTAEFQ 100
           F  F+ +  KTY T+ E+  RL +F+ N  KI L  D +      GL N+F+D +  EF 
Sbjct: 65  FMTFMTKFEKTYETVEEWAHRLTVFAQNA-KIVLEHDAKAEGFALGLDNQFADWTAEEF- 122

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           A Y     +P  +       + +   P A DWR    V  +K+Q  CGS W FST  +IE
Sbjct: 123 ASYQKLHSRPKPSQAGATHEVSDKAAPTAVDWRTEGVVADIKNQGSCGSCWTFSTVVSIE 182

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDD---------GCEGGSISNAFDTIMSKLGGGLEEE 211
           G  A KT KLV+LSEQ L+DC ++D          GC GG + NAFD I+    GG++ E
Sbjct: 183 GAAARKTGKLVTLSEQNLVDCVKKDQIDGGDECCMGCSGGLMDNAFDYIIKNQDGGIDTE 242

Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINAY-ALQFY 269
            +Y Y G D  C  +K      I+ +  V+  DE  +A  L   GP+++A++A    Q Y
Sbjct: 243 ASYGYTGKDGTCAFDKANVGATISNWTDVAVGDEVALADALANAGPVSIALDASKQWQLY 302

Query: 270 VTGVSHPIQFF-CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
             G+  P     C     +  H V IVGYG D        V YW I+NSWG  WGE GY 
Sbjct: 303 SGGILKPRSILGCSSDPTHADHGVAIVGYGTD------DGVDYWWIRNSWGTTWGESGYM 356

Query: 329 RLYRGDGSCGINDYV 343
           RL RG  +CG+ ++ 
Sbjct: 357 RLERGVNACGVANFA 371


>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
 gi|1582621|prf||2119193B cathepsin L-related Cys protease
          Length = 313

 Score =  196 bits (499), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 115/306 (37%), Positives = 170/306 (55%), Gaps = 21/306 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLNEFSDLSTAE 98
           + +F  Q+ + Y    E   R  +F  N + ++      E+G   +   +N+F D++  E
Sbjct: 12  WEHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQFGDMTNEE 71

Query: 99  FQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           F A   G+K K S  + +         +    DWR   AVT VKDQ  CGS WAFS TG+
Sbjct: 72  FNAVMKGYK-KGSRGEPTTVFTAEGRPMAADVDWRTKGAVTPVKDQGQCGSCWAFSATGS 130

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           +EG +  K  +LVSLSEQEL+DC  E  +DGC GG +++AFD I  K  GG++ E +YPY
Sbjct: 131 LEGQHFLKNNELVSLSEQELVDCSTEYGNDGCGGGWMTSAFDYI--KDNGGIDTESSYPY 188

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVS 274
              D++CR +  +      G+V V   E  + + + + GP++VAI+A  ++ QFY +GV 
Sbjct: 189 EAQDRSCRFDANSIGATCTGFVEVQHTEEALHEAVSDIGPISVAIDASHFSFQFYSSGVY 248

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG- 333
           +  +  C     NL H VL VGYG + T+       YW++KNSWG GWG+ GY ++ R  
Sbjct: 249 YEKK--CS--PTNLDHGVLAVGYGTESTE------DYWLVKNSWGSGWGDAGYIKMSRNR 298

Query: 334 DGSCGI 339
           D +CGI
Sbjct: 299 DNNCGI 304


>gi|390470786|ref|XP_003734355.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin W [Callithrix jacchus]
          Length = 373

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 115/324 (35%), Positives = 176/324 (54%), Gaps = 22/324 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F +F  Q N++Y T  E+  RL IF+ NL + Q LQ+ + G+  +G+  FSDL+  EF  
Sbjct: 42  FKFFQIQFNRSYLTPEEHARRLDIFAHNLVQAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101

Query: 102 KYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREY-DAVTGVKDQTMCGSSWAFSTTG 157
            Y   +     PS + R V +  P  ++P   DWR+   A++ +++Q  C   WA +  G
Sbjct: 102 LYGHQRAAGGVPSMS-RVVGSEEPEESVPHTCDWRKVAGAISFIRNQGNCLCCWAMAAAG 160

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           NIE +++    K V++S QEL+DC +  DGC GG + +AF T++     G+  E  YP++
Sbjct: 161 NIEALWSINFLKFVNVSVQELLDCGRCGDGCHGGYVWDAFSTVLKN--SGVVSESDYPFQ 218

Query: 218 GDDKACRLNKKATQ--VKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
            +    R + K       I  ++ +  D   +A+YL   GP+ V INA  LQ Y  GV  
Sbjct: 219 ANFGPHRCHAKTYNKVAWIMDFIFLPDDXQRIAQYLTTYGPITVTINAKHLQLYQKGVIK 278

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFT-----------HKAVPYWIIKNSWGEGWGE 324
                CD   + + HSVL+VG+G ++++              ++ PYWI+KNSWG  WGE
Sbjct: 279 ARPTTCD--PQFVDHSVLLVGFGSEKSEGMGAKTVSSQSRHPRSTPYWILKNSWGAQWGE 336

Query: 325 KGYFRLYRGDGSCGINDYVRSALV 348
           +GYFRL+RG  +CGI  Y  +A V
Sbjct: 337 EGYFRLHRGSNTCGITKYPVTARV 360


>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
 gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
           Precursor
 gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
 gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
 gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
          Length = 355

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 129/331 (38%), Positives = 177/331 (53%), Gaps = 28/331 (8%)

Query: 22  FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
           F +VG    H  +  K   LF  ++ +H+K Y ++ E   R  +F  NL  I   ++ E 
Sbjct: 31  FSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQ-RNNEI 89

Query: 82  GSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM---IPNIT-LPRAFDWREYDA 137
            S   GLNEF+DL+  EF+ +YLG   KP ++ +  P+      +IT LP++ DWR+  A
Sbjct: 90  NSYWLGLNEFADLTHEEFKGRYLGLA-KPQFSRKRQPSANFRYRDITDLPKSVDWRKKGA 148

Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNA 196
           V  VKDQ  CGS WAFST   +EG+    T  L SLSEQELIDCD   + GC GG +  A
Sbjct: 149 VAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYA 208

Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENG 255
           F  I+S   GGL +E  YPY  ++  C+  K+   +V I+GY  V  ++ +     + + 
Sbjct: 209 FQYIIST--GGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQ 266

Query: 256 PMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWI 313
           P++VAI A     QFY  GV      F      +L H V  VGYG      + K   Y I
Sbjct: 267 PVSVAIEASGRDFQFYKGGV------FNGKCGTDLDHGVAAVGYG------SSKGSDYVI 314

Query: 314 IKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
           +KNSWG  WGEKG+ R+ R     +G CGIN
Sbjct: 315 VKNSWGPRWGEKGFIRMKRNTGKPEGLCGIN 345


>gi|308462787|ref|XP_003093674.1| hypothetical protein CRE_29187 [Caenorhabditis remanei]
 gi|308249538|gb|EFO93490.1| hypothetical protein CRE_29187 [Caenorhabditis remanei]
          Length = 392

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 119/346 (34%), Positives = 191/346 (55%), Gaps = 23/346 (6%)

Query: 4   FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
            YFF  +  L++T+ +     +  +++      ++   F  F ++  K +   VE+  R 
Sbjct: 58  LYFFTALFFLTVTLGL-----LYQKRVERQEFFENLQEFRDFNQKFQKIHKNSVEFKERF 112

Query: 64  HIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL-KPSYADRSVPAM-I 121
            IF GNL+K+++L+ +      + +N+FSD+S  E +   L  KL + ++ + ++ +  +
Sbjct: 113 LIFRGNLKKLEILRSSNPDID-FSINQFSDMSENELKLILLDKKLLERNFQNSTLKSFDL 171

Query: 122 P-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELID 180
           P N+T P   DWR+   V  VK+Q  CGS WAF+T   +E  YA +   L SLSEQEL+D
Sbjct: 172 PMNLTRPERIDWRDSGKVMSVKNQGACGSCWAFATVAAVESQYAIRKGTLWSLSEQELVD 231

Query: 181 CDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR-GDDKACRLNKKATQVKINGYVS 239
           CD E  GC GG +  A   +   LG GLE E  YPY       C +N   T+V ++   S
Sbjct: 232 CDGESYGCGGGFLDKALGWV---LGNGLETEDDYPYECTQHDQCYINGGKTRVTVDEGWS 288

Query: 240 VSRDETDMAKYLVENGPMAVAIN-AYALQFYVTGVSHPIQFFCDGGNENLS-HSVLIVGY 297
           + RDE  +A ++   GP+A A++   +   Y  GV +P +  C   +E+L  H++ ++GY
Sbjct: 289 LGRDEDSIADWVASVGPVAFAMSVPNSFTAYSNGVYNPSEHECR--DESLGYHAMTLIGY 346

Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
           G +  +      PYWI+KNSWG  WG++GY RL RG+ +CG+ D+V
Sbjct: 347 GTEGNQ------PYWIVKNSWGSSWGDQGYMRLARGNNACGMRDFV 386


>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
           haemaphysaloides haemaphysaloides]
          Length = 335

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 127/323 (39%), Positives = 169/323 (52%), Gaps = 26/323 (8%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLST 96
           A ++ F   H K YA+  E Y RL I+  N  KI    +    S V     +NEF DL  
Sbjct: 25  AEWSAFKALHGKDYASDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDLLH 84

Query: 97  AEFQAKYLGFKLKPSYADRS-----VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
            EF +   GFK     + R       P    ++ LP+  DWR+  AVT VK+Q  CGS W
Sbjct: 85  HEFVSTRNGFKRNYRDSPREGSFFVEPEGFEDLQLPKTVDWRKKGAVTPVKNQGQCGSCW 144

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLE 209
           AFSTTG++EG +  KT+KLVSLSEQ L+DC +   ++GCEGG + NAF  I S    G++
Sbjct: 145 AFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFGNNGCEGGLMDNAFKYIKSN--KGID 202

Query: 210 EEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--AL 266
            E +YPY   D  C  N+        G+V +   DE  + K +   GP++VAI+A   + 
Sbjct: 203 TEWSYPYNATDGVCHFNRSDVGATDTGFVDIPEGDENKLKKAVAAVGPVSVAIDASHESF 262

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
           QFY  GV    +  C   +E L H VL+VGYG      T     YW++KNSWG  WG++G
Sbjct: 263 QFYSEGVYDEPE--C--SSEQLDHGVLVVGYG------TKDGQDYWLVKNSWGTTWGDEG 312

Query: 327 YFRLYRG-DGSCGINDYVRSALV 348
           Y  + R  D  CGI       LV
Sbjct: 313 YIYMTRNKDNQCGIASSASYPLV 335


>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
 gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
          Length = 480

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 122/314 (38%), Positives = 165/314 (52%), Gaps = 31/314 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           ++  ++  H +TY  +     R  +F  NLR I    +    +GV+    GLN F+DL+ 
Sbjct: 43  MYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDA-HNAAADAGVHSFRLGLNRFADLTN 101

Query: 97  AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
            E+ A YLG + +P   DR + A      N  LP + DWR   AV  VKDQ  CG+ WAF
Sbjct: 102 DEYPATYLGARTRPQR-DRKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGTCWAF 160

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           ST   +EG+    T  L+SLSEQEL+DCD   + GC GG +  AF+ I++   GG++ EK
Sbjct: 161 STIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEK 218

Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFY 269
            YPY+G D  C +N+K A  V I+ Y  V  ++    +  V N P++VAI A   A Q Y
Sbjct: 219 DYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLY 278

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +G+      F       L H V  VGYG +  K       YWI+KNSWG  WGE GY R
Sbjct: 279 SSGI------FTGSCGTRLDHGVTAVGYGTENGK------DYWIVKNSWGSSWGESGYVR 326

Query: 330 LYRG----DGSCGI 339
           + R      G CGI
Sbjct: 327 MERNIKASSGKCGI 340


>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
          Length = 344

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 114/308 (37%), Positives = 163/308 (52%), Gaps = 26/308 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG-SGVYGLNEFSDLSTAEFQAKY 103
           ++ +H + YA   E  +R  +F  N+ +I+ L D + G +    +N+F+DL+  EF++ Y
Sbjct: 41  WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 100

Query: 104 LGFKLKPSYADRSVPAM-----IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            GFK     + R+ P       + +  LP + DWR+  AVT +KDQ +CGS WAFS    
Sbjct: 101 TGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAA 160

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           IEGV   K  KL+SLSEQEL+DCD  D GC GG +  AF+  ++   GGL  E  YPY+ 
Sbjct: 161 IEGVAQIKKGKLISLSEQELVDCDTNDGGCMGGLMDTAFNYTITI--GGLTSESNYPYKS 218

Query: 219 DDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSH 275
            +  C  NK K     I G+  V  ++       V + P+++ I       QFY +GV  
Sbjct: 219 TNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGV-- 276

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-- 333
               F      +L H V  VGYG  R+K     + YWI+KNSWG  WGE+GY R+ +   
Sbjct: 277 ----FSGECTTHLDHGVTAVGYG--RSK---NGLKYWILKNSWGPKWGERGYMRIKKDIK 327

Query: 334 --DGSCGI 339
              G CG+
Sbjct: 328 PKHGQCGL 335


>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
          Length = 377

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 121/274 (44%), Positives = 152/274 (55%), Gaps = 37/274 (13%)

Query: 88  LNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-----------LPRAFDWREYD 136
           LN F D+S AEF+A + G ++  S   R  PA  P++            LPR+ DWR+  
Sbjct: 91  LNRFGDMSQAEFRATFAGSRV--SDRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKG 148

Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED-DGCEGGSISN 195
           AVTGVK+Q  CGS WAFST  ++EG+ A +T KLVSLSEQELIDCD  D DGCEGG + N
Sbjct: 149 AVTGVKNQGKCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDN 208

Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ----VKINGYVSVSRDETDMAKYL 251
           AF+ I  K  GGL  E  YPYR  +  C+  K A      V I+G+  V  +  +     
Sbjct: 209 AFEYI--KKNGGLTTEAAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKA 266

Query: 252 VENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV 309
           V N P++V I+A   A  FY  GV     F  + G E L H V +VGYGV          
Sbjct: 267 VANQPVSVGIDASGKAFMFYSEGV-----FTGECGTE-LDHGVAVVGYGV-----AEDGK 315

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGS----CGI 339
            YW +KNSWG  WGEKGY R+ +  G+    CGI
Sbjct: 316 AYWTVKNSWGPSWGEKGYIRVEKDSGAEGGLCGI 349


>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 454

 Score =  196 bits (498), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 124/311 (39%), Positives = 168/311 (54%), Gaps = 27/311 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  +  +H K Y++L E+  R  ++  NL  IQ   +    S   GL +F+D++  EF+ 
Sbjct: 46  FGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEKNR-SYWLGLTKFADITNDEFRR 104

Query: 102 KYLGFKLKPS-YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           +Y G ++  S  + R       +   P + DWR+  AVT VKDQ  CGS WAFS  G++E
Sbjct: 105 QYTGTRIDRSKRSKRKTGFRYADSEAPESVDWRKKGAVTTVKDQGSCGSCWAFSAIGSVE 164

Query: 161 GVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           G+ A +T + VSLSEQEL+DCD E + GC GG +  AFD I+    GG++ E  YPY+G 
Sbjct: 165 GINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILEN--GGIDTENDYPYKGL 222

Query: 220 DKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHP 276
           D  C  NKK A  V I+GY  V  ++ +  K  V   P++VAI A     Q Y  GV   
Sbjct: 223 DGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGGV--- 279

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD-- 334
              F      +L H VL VGYG      +  ++ YWI+KNSWGE WGE GY R+ R    
Sbjct: 280 ---FTGECGTDLDHGVLAVGYG------SEGSLDYWIVKNSWGEYWGESGYLRMQRNIKD 330

Query: 335 -----GSCGIN 340
                G CGIN
Sbjct: 331 SNHQFGLCGIN 341


>gi|394331818|gb|AFN27128.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 115/318 (36%), Positives = 172/318 (54%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWR+  AVT VKDQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             G+IE  +A    +L +LSE  L+ C  ++ GC GG +  AF+ ++  + G +  E +Y
Sbjct: 155 AVGSIESQWALAGHRLTALSEHHLVSCHDKNSGCTGGLMLQAFEWLLRNMNGTMFTEDSY 214

Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY            ++Q+    +I+GY+++   ET MA +L +NGP+++A++A +   Y 
Sbjct: 215 PYVSSSGYVPECSNSSQLVPGARIDGYMTIESSETVMAAWLAKNGPISIAVDASSFMSYQ 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV             +L+H VL+VGY  +RT      VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGV------LTSCAGISLNHGVLLVGY--NRT----GEVPYWVIKNSWGENWGENGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 333

 Score =  196 bits (498), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 114/308 (37%), Positives = 163/308 (52%), Gaps = 26/308 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG-SGVYGLNEFSDLSTAEFQAKY 103
           ++ +H + YA   E  +R  +F  N+ +I+ L D + G +    +N+F+DL+  EF++ Y
Sbjct: 35  WMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMY 94

Query: 104 LGFKLKPSYADRSVPAM-----IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            GFK     + R+ P       + +  LP + DWR+  AVT +KDQ +CGS WAFS    
Sbjct: 95  TGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAA 154

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           IEGV   K  KL+SLSEQEL+DCD  D GC GG +  AF+  ++   GGL  E  YPY+ 
Sbjct: 155 IEGVAQIKKGKLISLSEQELVDCDTNDGGCMGGLMDTAFNYTITI--GGLTSESNYPYKS 212

Query: 219 DDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSH 275
            +  C  NK K     I G+  V  ++       V + P+++ I       QFY +GV  
Sbjct: 213 TNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGV-- 270

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-- 333
               F      +L H V  VGYG  R+K     + YWI+KNSWG  WGE+GY R+ +   
Sbjct: 271 ----FSGECTTHLDHGVTAVGYG--RSK---NGLKYWILKNSWGPKWGERGYMRIKKDIK 321

Query: 334 --DGSCGI 339
              G CG+
Sbjct: 322 PKHGQCGL 329


>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
 gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
          Length = 352

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 116/307 (37%), Positives = 163/307 (53%), Gaps = 24/307 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  ++ ++ + Y    E   R  IF  N+  I+   +    S   G+N+F+D++  EF A
Sbjct: 37  FEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVA 96

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLP---RAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           +Y G   +P   ++       ++ +    ++ DWR+Y AVT VKDQ  CGS WAFS    
Sbjct: 97  QYTGGISRPLNIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIAT 156

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EG+Y   T  LVSLSEQE++DC    +GC+GG + NA+D I+S    G+  E  YPY+ 
Sbjct: 157 VEGIYKIVTGYLVSLSEQEVLDC-AVSNGCDGGFVDNAYDFIISN--NGVASEADYPYQA 213

Query: 219 DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSH 275
               C  N       I GY  V S DE+ M KY V N P+A AI+A     Q+Y  GV  
Sbjct: 214 YQGDCAANSWPNSAYITGYSYVRSNDESSM-KYAVWNQPIAAAIDASGDNFQYYNGGV-- 270

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-- 333
               F      +L+H++ I+GYG D +        YWI+KNSWG  WGE+GY R+ RG  
Sbjct: 271 ----FSGPCGTSLNHAITIIGYGQDSS-----GTQYWIVKNSWGSSWGERGYIRMARGVS 321

Query: 334 -DGSCGI 339
             G CGI
Sbjct: 322 SSGLCGI 328


>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
          Length = 338

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 129/356 (36%), Positives = 186/356 (52%), Gaps = 38/356 (10%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
           F  G  L+ L+ ++S   ++ DE             ++ F   H K Y + +E   R+ I
Sbjct: 8   FLLGAVLVQLSAALSLTNLLADE-------------WHLFKATHKKEYPSQLEEKFRMKI 54

Query: 66  FSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---SYADRSVPA 119
           +  N  K+    +L +    S    +N+F DL   EF++   G++ K    S A+ +   
Sbjct: 55  YLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTF 114

Query: 120 MIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
           M P N+ +P + DWRE  A+T VKDQ  CGS WAFS+TG +EG    KT KL+SLSEQ L
Sbjct: 115 MEPANVEVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNL 174

Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
           IDC  +  ++GC GG +  AF  I  K   G++ E TYPY  +D  CR N +       G
Sbjct: 175 IDCSGKYGNEGCNGGLMDQAFQYI--KDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRG 232

Query: 237 YVSVSRDETDMAKYLVEN-GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
           +V +   E D  K  V   GP++VAI+A   + QFY  GV +  +  CD  +++L H VL
Sbjct: 233 FVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYY--EPSCD--SDDLDHGVL 288

Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           +VGYG D  K       YW++KNSW E WG++GY ++ R     CG+       LV
Sbjct: 289 VVGYGSDNGK------DYWLVKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPLV 338


>gi|1619905|gb|AAB16997.1| thiol protease isoform A, partial [Glycine max]
          Length = 318

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 113/246 (45%), Positives = 149/246 (60%), Gaps = 24/246 (9%)

Query: 110 PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKK 169
           P++A ++   ++P   LP+ FDWR+  AVT VKD   CGS W+FSTTG +E  +   T +
Sbjct: 74  PAHAQKA--PILPTKDLPKDFDWRDKGAVTNVKDLGGCGSCWSFSTTGALEVSFYLATGE 131

Query: 170 LVSLSEQELIDCDQE---------DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           LVSLSEQ+L+DCD           D GC GG ++NAF+ + S   GG+++EK  PY G D
Sbjct: 132 LVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEILQS---GGVQKEKDIPYTGRD 188

Query: 221 KACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQF 279
             C+ +K  T+V     +  VS DE  +A  LV+NGP+AVAINA  +Q YV GVS P  +
Sbjct: 189 GTCKFDK--TKVAATDLIKRVSLDEEQIAANLVKNGPLAVAINAVFMQTYVGGVSCP--Y 244

Query: 280 FCDGGNENLSHSVLIVGYGVDR-TKFTHKAVPYWIIKNSWGEGWGEK-GYFRLYRGDGSC 337
            C    ++L H VL+VGYG  R      K  PYWIIKNSWGE WGE  GY  + RG   C
Sbjct: 245 IC---GKHLDHGVLLVGYGEGRYAPIRFKNKPYWIIKNSWGESWGENDGYDEICRGRNVC 301

Query: 338 GINDYV 343
           G++  V
Sbjct: 302 GVDAMV 307


>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
          Length = 365

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 119/309 (38%), Positives = 168/309 (54%), Gaps = 31/309 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           ++  +L +H+K Y+ LVEY  R  IF  NL+ I    ++E+ +   GL  ++DL+  EFQ
Sbjct: 44  IYELWLAKHDKVYSGLVEYEKRFEIFKDNLKFIDE-HNSENHTYKMGLTPYTDLTNEEFQ 102

Query: 101 AKYLG------FKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           A YLG       +LK +       A      LP   DWR+  AVT VK+Q  CGS WAFS
Sbjct: 103 AIYLGTRSDTIHRLKRTINISERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKCGSCWAFS 162

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           T   +E +   +T  L+SLSEQ+L+DC++++ GC+GG+   A+  I+    GG++ E  Y
Sbjct: 163 TVSTVESINQIRTGNLISLSEQQLVDCNKKNHGCKGGAFVYAYQYIID--NGGIDTEANY 220

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF--YVTG 272
           PY+     CR  KK   V+I+GY  V     +  K  V + P  VAI+A + QF  Y +G
Sbjct: 221 PYKAVQGPCRAAKKV--VRIDGYKGVPHCNENALKKAVASQPSVVAIDASSKQFQHYKSG 278

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           +      F       L+H V+IVGY  D          YWI++NSWG  WGE+GY R+ R
Sbjct: 279 I------FSGPCGTKLNHGVVIVGYWKD----------YWIVRNSWGRYWGEQGYIRMKR 322

Query: 333 --GDGSCGI 339
             G G CGI
Sbjct: 323 VGGCGLCGI 331


>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
          Length = 351

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 128/316 (40%), Positives = 167/316 (52%), Gaps = 30/316 (9%)

Query: 37  KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDL 94
           K   LF  ++ +H K Y T+ E   R  +F  NL+ I    D       Y  GLNEF+DL
Sbjct: 42  KLIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHID---DRNKIVSNYWLGLNEFADL 98

Query: 95  STAEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
           S  EF+ KYLG K+  S    S         ++ LP++ DWR+  AVT VK+Q  CGS W
Sbjct: 99  SHQEFKNKYLGLKVDLSQRRESSNEEEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCW 158

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEE 210
           AFST   +EG+    T  L SLSEQELIDCD   ++GC GG +  AF  I     GGL +
Sbjct: 159 AFSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQ--NGGLHK 216

Query: 211 EKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQ 267
           E+ YPY  ++  C + K+ TQ V INGY  V ++        + N P++VAI A +   Q
Sbjct: 217 EEDYPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQ 276

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
           FY  GV      F      +L H V  VGYG      T K + Y I+KNSWG  WGEKG+
Sbjct: 277 FYSGGV------FDGHCGSDLDHGVSAVGYG------TSKNLDYIIVKNSWGAKWGEKGF 324

Query: 328 FRLYRG----DGSCGI 339
            R+ R     +G CG+
Sbjct: 325 IRMKRDIGKPEGICGL 340


>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
           [Glycine max]
          Length = 466

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 124/321 (38%), Positives = 166/321 (51%), Gaps = 32/321 (9%)

Query: 35  HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDL 94
           H +H  ++  +L +H K Y  L E   R  IF  NLR I+        S   GLN+F+DL
Sbjct: 43  HTRH--VYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADL 100

Query: 95  STAEFQAKYLGFKLKPSYADRSVPAMIPNI-------TLPRAFDWREYDAVTGVKDQTMC 147
           +  E++A +LG + +      +V A   +         LP   DWRE  AVT +KDQ  C
Sbjct: 101 TNEEYRAMFLGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKDQGQC 160

Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGG 206
           GS WAFST G +EG+    T  L SLSEQEL+DCD+  + GC GG +  AF+ I+    G
Sbjct: 161 GSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGGLMDYAFEFIVQN--G 218

Query: 207 GLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA 265
           G++ E+ YPY   D  C  N+K A  V I+GY  V  ++       V N P++VAI A  
Sbjct: 219 GIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAIEAGG 278

Query: 266 LQF--YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
           ++F  Y +GV      F      NL H V+ VGYG      T     YW+++NSWG  WG
Sbjct: 279 MEFQLYQSGV------FTGRCGTNLDHGVVAVGYG------TENGTDYWLVRNSWGSAWG 326

Query: 324 EKGYFRLYRG-----DGSCGI 339
           E GY +L R       G CGI
Sbjct: 327 ENGYIKLERNVQNTETGKCGI 347


>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 461

 Score =  196 bits (497), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 129/327 (39%), Positives = 169/327 (51%), Gaps = 32/327 (9%)

Query: 30  LHHLHHVKHTAL----FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
           LH    ++H  L    F  +  +H K Y    +   R  ++  NL  I+  +     S  
Sbjct: 38  LHMTTDLEHENLLLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHSETNRTYS-- 95

Query: 86  YGLNEFSDLSTAEFQAKYLGFKLKPSY-ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
            GL +F+DL+  EF+  Y G ++  S  A R       +   P + DWR+  AVT VKDQ
Sbjct: 96  LGLTKFADLTNEEFRRMYTGTRIDRSRRAKRRTGFRYADSEAPESVDWRKNGAVTSVKDQ 155

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSK 203
             CGS WAFS  G++EG+ A +  + VSLSEQEL+DCD E + GC GG +  AFD I+  
Sbjct: 156 GSCGSCWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQN 215

Query: 204 LGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAIN 262
             GG++ EK YPY+G D  C  +KK A  V I+GY  V  ++ +  K  V   P++VAI 
Sbjct: 216 --GGIDTEKDYPYKGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIE 273

Query: 263 AYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
           A     Q Y  GV      F      +L H VL VGYG      T   V YWI+KNSWGE
Sbjct: 274 AGGRDFQLYAQGV------FSGECGTDLDHGVLAVGYG------TEDGVDYWIVKNSWGE 321

Query: 321 GWGEKGYFRLYR-------GDGSCGIN 340
            WGE GY R+ R       G G CGIN
Sbjct: 322 YWGESGYLRMKRNMKDSNDGPGLCGIN 348


>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
 gi|255640677|gb|ACU20623.1| unknown [Glycine max]
          Length = 366

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 124/317 (39%), Positives = 169/317 (53%), Gaps = 34/317 (10%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           A++  +L +H K Y  L +   R  +F  NL  IQ   +  + +   GLN+F+D++  E+
Sbjct: 36  AMYEEWLVRHQKGYNELGKKDKRFQVFKDNLGFIQEHNNNLNNTYKLGLNKFADMTNEEY 95

Query: 100 QAKYLGFKLKPSYADRSVP---------AMIPNITLPRAFDWREYDAVTGVKDQTMCGSS 150
           +A YLG K   S A R +          A      LP   DWR   AV  +KDQ  CGS 
Sbjct: 96  RAMYLGTK---SNAKRRLMKTKSTGHRYAFSARDRLPVHVDWRMKGAVAPIKDQGSCGSC 152

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLE 209
           WAFST   +E +    T K VSLSEQEL+DCD+  ++GC GG +  AF+ I+    GG++
Sbjct: 153 WAFSTVATVEAINKIVTGKFVSLSEQELVDCDRAYNEGCNGGLMDYAFEFIIQN--GGID 210

Query: 210 EEKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YAL 266
            +K YPYRG D  C   KK A  V I+GY  V   + +  K  V + P++VAI A   AL
Sbjct: 211 TDKDYPYRGFDGICDPTKKNAKVVNIDGYEDVPPYDENALKKAVAHQPVSVAIEASGRAL 270

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
           Q Y +GV      F      +L H V++VGYG      +   V YW+++NSWG GWGE G
Sbjct: 271 QLYQSGV------FTGKCGTSLDHGVVVVGYG------SENGVDYWLVRNSWGTGWGEDG 318

Query: 327 YFRLYRG----DGSCGI 339
           YF++ R      G CGI
Sbjct: 319 YFKMQRNVRTSTGKCGI 335


>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 129/351 (36%), Positives = 185/351 (52%), Gaps = 40/351 (11%)

Query: 11  ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
           A++++TV+ SS  ++  +             +  F   H KTY + +E   R  IF+ N 
Sbjct: 9   AIVAVTVAASSQEILRTQ-------------WEAFKTTHKKTYQSHMEELLRFKIFTEN- 54

Query: 71  RKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQAKYLGF--KLKPSYADRSVPAMIPNI 124
             I    + ++  G+     G+N+F DL   EF   + G+    K   +    PA + + 
Sbjct: 55  SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGYHGSRKSGGSTFLPPANVNDS 114

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
           +LP+A DWR+  AVT VKDQ  CGS WAFSTTG++EG +  K  +LVSLSEQ L+DC Q 
Sbjct: 115 SLPKAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQS 174

Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR 242
             ++GCEGG + +AF  I  K   G++ EK+YPY   D  CR  K+       GYV +  
Sbjct: 175 FGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKA 232

Query: 243 D-ETDMAKYLVENGPMAVAINA--YALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYG 298
             E D+ K +   GP++VAI+A   + Q Y  GV   P     +  +E+L H VL+VGYG
Sbjct: 233 GCEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEP-----ECSSEDLDHGVLVVGYG 287

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGINDYVRSALV 348
           V   K       YW++KNSW E WG++GY  + R  +  CGI       LV
Sbjct: 288 VKGGK------KYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPLV 332


>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 357

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 139/359 (38%), Positives = 187/359 (52%), Gaps = 43/359 (11%)

Query: 3   CFYFFAGVALLSLTVSVS-SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYS 61
           CF      A LSL+V+ S  + +VG        H K   LF  ++    K Y T+ E   
Sbjct: 11  CFPLALSAATLSLSVAASHDYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKLL 70

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKL-------KPSY 112
           R  +F  NL+ I    +T      Y  GLNEF+DLS  EF+  YLG K        + SY
Sbjct: 71  RFEVFKDNLKHID---ETNKKVKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSY 127

Query: 113 AD---RSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKK 169
           A+   R V A      +P++ DWR+  AV  VK+Q  CGS WAFST   +EG+    T  
Sbjct: 128 AEFAYRDVEA------VPKSVDWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGN 181

Query: 170 LVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK 228
           L +LSEQELIDCD   ++GC GG +  AF+ I+    GGL +E+ YPY  ++  C + K 
Sbjct: 182 LTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVK--NGGLRKEEDYPYSMEEGTCEMQKD 239

Query: 229 ATQ-VKINGYVSV-SRDETDMAKYLVENGPMAVAINAYALQF-YVTGVSHPIQFFCDGGN 285
            ++ V I+G+  V + DE  + K L    P++VAI+A   +F + +GVS     F     
Sbjct: 240 ESETVTIDGHQDVPTNDEKSLLKALAHQ-PLSVAIDASGREFQFYSGVS----VFDGRCG 294

Query: 286 ENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
            +L H V  VGYG      + K   Y I+KNSWG  WGEKGY RL R     +G CGIN
Sbjct: 295 VDLDHGVAAVGYG------SSKGSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLCGIN 347


>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
 gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
          Length = 345

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 131/355 (36%), Positives = 185/355 (52%), Gaps = 44/355 (12%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
           FF  + +LS+  +VS + +V +E             +  F  +H K Y   VE   R+ I
Sbjct: 5   FFIALTVLSIN-AVSFYDLVMEE-------------WQLFKAEHKKNYNNDVEEKFRMKI 50

Query: 66  FSGNLRKIQLLQDTEHGSG----VYGLNEFSDLSTAEFQAKYLGFK---LKPSYADRSVP 118
           F  N +KI    +T++  G      GLN++SD+   EF   + GF    + P     +  
Sbjct: 51  FMDNKQKI-TKHNTKYQRGEVGYKLGLNKYSDMLHHEFINTFNGFNKSIIPPHLRSNNGK 109

Query: 119 A------MIP--NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKL 170
                   IP  N+ LP+  DW +  AVT VKDQ  CGS WAFS TG +EG++  KTK L
Sbjct: 110 THLKGSFFIPPANVKLPKHVDWVKLGAVTPVKDQGHCGSCWAFSATGALEGLHFRKTKVL 169

Query: 171 VSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK 228
           VSLSEQ LIDC  E+  +GC GG +  AF  +  ++ GG++ E++YPY G++  CR   +
Sbjct: 170 VSLSEQNLIDCSTEEGNNGCNGGLMDQAFQYV--RINGGIDTERSYPYEGNNDVCRYEPE 227

Query: 229 ATQVKINGYVSVSRDETDMAKYLVEN-GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGN 285
            +     GY  V   + D  K  V   GP++VAI+A   + Q Y +GV    +  C    
Sbjct: 228 NSGAIDTGYTDVPLGDEDALKSAVATVGPVSVAIDASQESFQLYSSGVY--FEPNCKNEP 285

Query: 286 ENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGI 339
           E+L H VL+VGYG D          YW++KNSWG+ WGE GY ++ R  D  CGI
Sbjct: 286 ESLDHGVLVVGYGTDE----ETQQDYWLVKNSWGDSWGENGYIKMARNADNQCGI 336


>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
          Length = 467

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 130/354 (36%), Positives = 179/354 (50%), Gaps = 30/354 (8%)

Query: 1   MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYY 60
           +S F      A  ++ +S+ S+     +K       +  A++  +L +H K Y  L E  
Sbjct: 8   LSLFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGEKE 67

Query: 61  SRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP-- 118
            R  IF  NLR I    ++++ +   GLN F+DL+  E+++ YLG K   +   R V   
Sbjct: 68  KRFGIFKDNLRFIDE-HNSQNLTYRLGLNRFADLTNEEYRSMYLGVKPGATRVTRKVSRK 126

Query: 119 ----AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLS 174
               A      LP   DWR+  AV GVKDQ  CGS WAFST   +EG+    T  L+SLS
Sbjct: 127 SDRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLS 186

Query: 175 EQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQV 232
           EQEL+DCD   ++GC GG +  AF+ I++   GG++ E+ YPYR  D+ C +  K A  V
Sbjct: 187 EQELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDSEEDYPYRAADQKCDQYRKNANVV 244

Query: 233 KINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSH 290
            I+GY  V  ++    K  V   P++VAI A   A Q Y +GV      F      +L H
Sbjct: 245 SIDGYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGV------FTGKCGTSLDH 298

Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
            V  VGYG      T     YWI+ NSWG+ WGE GY R+ R       G CGI
Sbjct: 299 GVAAVGYG------TENGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGI 346


>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 361

 Score =  195 bits (496), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 130/308 (42%), Positives = 174/308 (56%), Gaps = 34/308 (11%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           H+    +L E   R ++F  N++ I      E+ S    LN+F D+++ EF+  Y G  +
Sbjct: 44  HHTIARSLEEKAKRFNVFKHNVKHIHETNKKEN-SYKLKLNKFGDMTSEEFRRTYAGSNI 102

Query: 109 KPS---YADRSVPA--MIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
           K       +R      M  N+ TLP + DWR+  AVT VK+Q  CGS WAFST   +EG+
Sbjct: 103 KHHRMFQGERQTTKSFMYANVDTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGI 162

Query: 163 YAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
              +TKKL SLSEQEL+DCD  ++ GC GG +  AF+ I  K  GGL  E  YPY+  D+
Sbjct: 163 NQIRTKKLTSLSEQELVDCDTNKNQGCNGGLMDLAFEFIKEK--GGLTSELVYPYKASDE 220

Query: 222 ACRLNKK-ATQVKINGYVSVSRD-ETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPI 277
            C  NK+ A  V I+G+  V ++ E D+ K  V + P++VAI+A     QFY  GV    
Sbjct: 221 TCDTNKENAPVVSIDGHEDVPKNSEVDLMK-AVAHQPVSVAIDAGGSDFQFYSEGV---- 275

Query: 278 QFFCDGGNENLSHSVLIVGYG--VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-- 333
            F    G E L+H V +VGYG  +D TK       YWI+KNSWGE WGEKGY R+ RG  
Sbjct: 276 -FTGRCGTE-LNHGVAVVGYGTTIDGTK-------YWIVKNSWGEEWGEKGYIRMQRGIR 326

Query: 334 --DGSCGI 339
             +G CGI
Sbjct: 327 HKEGLCGI 334


>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 131/329 (39%), Positives = 170/329 (51%), Gaps = 28/329 (8%)

Query: 22  FMVVG--DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           F +VG   E L  +   K   LF  ++ +H K Y  + E   R  IF  NL+ I      
Sbjct: 28  FSIVGYSSEDLKSMD--KLIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKV 85

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI-PNITLPRAFDWREYDAV 138
                + GLNEF+DLS  EF  KYLG K+  S    S       ++ LP++ DWR+  AV
Sbjct: 86  VSNYWL-GLNEFADLSHREFNNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAV 144

Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAF 197
             VK+Q  CGS WAFST   +EG+    T  L SLSEQELIDCD+  ++GC GG +  AF
Sbjct: 145 APVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAF 204

Query: 198 DTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGP 256
             I+    GGL +E+ YPY  ++  C + K+ TQ V I+GY  V ++        + N P
Sbjct: 205 SFIVEN--GGLHKEEDYPYIMEEGTCEMTKEETQVVTISGYHDVPQNNEQSLLKALANQP 262

Query: 257 MAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWII 314
           ++VAI A     QFY  GV      F      +L H V  VGYG      T K V Y  +
Sbjct: 263 LSVAIEASGRDFQFYSGGV------FDGHCGSDLDHGVAAVGYG------TAKGVDYITV 310

Query: 315 KNSWGEGWGEKGYFRLYRG----DGSCGI 339
           KNSWG  WGEKGY R+ R     +G CGI
Sbjct: 311 KNSWGSKWGEKGYIRMRRNIGKPEGICGI 339


>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
 gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
          Length = 339

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 128/315 (40%), Positives = 171/315 (54%), Gaps = 28/315 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAE 98
           +N F  QH K Y +  E   RL I+  N  KI +  Q  + G   Y L  N+++DL   E
Sbjct: 27  WNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEE 86

Query: 99  FQAKYLGFK-------LKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSS 150
           F     GF        LK    +  V  + P N+ +P   DWR+  AVT VKDQ  CGS 
Sbjct: 87  FVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSC 146

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGL 208
           W+FS TG +EG +  KT KLVSLSEQ L+DC  +  ++GC GG +  AF  I  K  GG+
Sbjct: 147 WSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYI--KDNGGI 204

Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--A 265
           + EK+YPY   D  C  N KA      GYV + + DE  + K L   GP+++AI+A   +
Sbjct: 205 DTEKSYPYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHES 264

Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
            QFY  GV +  Q  CD  +ENL H VL VGYG      + +   YW++KNSWG  WG++
Sbjct: 265 FQFYSEGVYYEPQ--CD--SENLDHGVLAVGYGT-----SEEGEDYWLVKNSWGTTWGDQ 315

Query: 326 GYFRLYRG-DGSCGI 339
           GY ++ R  D  CG+
Sbjct: 316 GYVKMARNRDNHCGV 330


>gi|33333694|gb|AAQ11965.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 123/306 (40%), Positives = 163/306 (53%), Gaps = 20/306 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
           F   H KTY ++VE   R  +F  NL  IQ      E G   +   + +F+D++  EF  
Sbjct: 26  FKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEFLD 85

Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
             K  G    PS A         ++    A DWRE  AVT VKDQ  CGS WAFS  G I
Sbjct: 86  LLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           EG +  K   LVSLS QEL+DC  E+   +GC GG +  AFD +  +   G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFVQDE---GIQTEESYPY 202

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
            G   +C+ +      K+  YV    DE +MA+ +   GP+AVAI A  L FY  G+   
Sbjct: 203 EGRRSSCKKSGDYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
            +  C    E+L+H VL+VGYG      +   V YWI+KNSWG  WGEKGYFRL +   +
Sbjct: 261 -KCRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313

Query: 337 CGINDY 342
           CGI+ Y
Sbjct: 314 CGIDYY 319


>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
           distachyon]
          Length = 377

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 114/263 (43%), Positives = 153/263 (58%), Gaps = 22/263 (8%)

Query: 88  LNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQT 145
           LN F D+  AEF++ + G   + +   +S+P  I +    +P+A DWR+  AVTGVKDQ 
Sbjct: 96  LNRFGDMDQAEFRSTFAGPLHRHTRPAQSIPGFIYDTVKDIPQAVDWRQKGAVTGVKDQG 155

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSK 203
            CGS WAFS   ++EG+ A +T  LVSLSEQELIDCD   +D+GC+GG + +AF+ I + 
Sbjct: 156 KCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMESAFEFI-AH 214

Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKAT-QVKINGYVSVSRDETDMAKYLVENGPMAVAIN 262
             GGL  E  YPY   +  C  N+ ++  V+I+G+ SV     +     V + P++VAI+
Sbjct: 215 SAGGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAKAVAHQPVSVAID 274

Query: 263 A--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
           A   A QFY  GV     F  D G+E L H V +VGYGV           YWI+KNSWG 
Sbjct: 275 AGGQAFQFYSEGV-----FTGDCGSE-LDHGVAVVGYGVAE----EDGKEYWIVKNSWGP 324

Query: 321 GWGEKGYFRLYRGDGS----CGI 339
           GWGE GY R+ R  G     CGI
Sbjct: 325 GWGEHGYVRMQRDSGVDGGLCGI 347


>gi|33333708|gb|AAQ11972.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 124/306 (40%), Positives = 161/306 (52%), Gaps = 20/306 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
           F   H KTY +LVE   R  +F  NL  IQ      E G   +   + +F+D++  EF  
Sbjct: 26  FKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEFLD 85

Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
             K  G    PS A         ++    A DWRE  AVT VKDQ  CGS WAFS  G I
Sbjct: 86  LLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           EG +  K   LVSLS QEL+DC  E+   +GC GG +  AFD +  +   G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFVQDE---GIQTEESYPY 202

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
            G   +C+ +      K+  YV    DE +MA+ +   GP+AVAI A  L FY  G+   
Sbjct: 203 EGRRSSCKKSGDYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
               C    E+L+H VL+VGYG      +   V YWI+KNSWG  WGEKGYFRL +   +
Sbjct: 261 T-CRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313

Query: 337 CGINDY 342
           CGI  Y
Sbjct: 314 CGIGYY 319


>gi|33333696|gb|AAQ11966.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 124/306 (40%), Positives = 161/306 (52%), Gaps = 20/306 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
           F   H KTY +LVE   R  +F  NL  IQ      E G   +   + +F+D++  EF  
Sbjct: 26  FKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEFLD 85

Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
             K  G    PS A         ++    A DWRE  AVT VKDQ  CGS WAFS  G I
Sbjct: 86  LLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           EG +  K   LVSLS QEL+DC  E+   +GC GG +  AFD +  +   G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFVQDE---GIQTEESYPY 202

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
            G   +C+ +      K+  YV    DE +MA+ +   GP+AVAI A  L FY  G+   
Sbjct: 203 EGRRSSCKKSGDYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
               C    E+L+H VL+VGYG      +   V YWI+KNSWG  WGEKGYFRL +   +
Sbjct: 261 T-CRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313

Query: 337 CGINDY 342
           CGI  Y
Sbjct: 314 CGIGYY 319


>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
          Length = 340

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 120/312 (38%), Positives = 167/312 (53%), Gaps = 35/312 (11%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  ++ ++ + Y    E   R  IF  N+  I+   +    S   G+N+F+D++  EF  
Sbjct: 37  FEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVT 96

Query: 102 KYLG------FKLKP--SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           +Y G      FK +P  S+ D ++ A      + ++ DWR+Y AVT VKDQ  CGS WAF
Sbjct: 97  QYTGVSLPLNFKREPVVSFDDVNISA------VGQSIDWRDYGAVTEVKDQNPCGSCWAF 150

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           S    +EG+Y   T  LVSLSEQE++DC    +GC+GG + NA+D I+S    G+  E  
Sbjct: 151 SAIATVEGIYKIVTGYLVSLSEQEVLDC-AVSNGCDGGFVDNAYDFIISN--NGVASEAD 207

Query: 214 YPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYA--LQFYV 270
           YPY+  +  C  N       I GY  V S DE+ M KY V N P+A AI+A     Q+Y 
Sbjct: 208 YPYQAYEGDCTANSWPNSAYITGYSYVRSNDESSM-KYAVWNQPIAAAIDASGDNFQYYN 266

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
            GV      F      +L+H++ I+GYG D +        YWI+KNSWG  WGE+GY R+
Sbjct: 267 GGV------FSGPCGTSLNHAITIIGYGQDSS-----GTQYWIVKNSWGSSWGERGYVRM 315

Query: 331 YRG---DGSCGI 339
            RG    G CGI
Sbjct: 316 ARGVSSSGLCGI 327


>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
          Length = 372

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 126/331 (38%), Positives = 182/331 (54%), Gaps = 33/331 (9%)

Query: 27  DEKLHHLHHVKHTALFNYFLEQHNKTYA-TLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV 85
           DE+L     ++   L++ +  QH  T +    E+  R  IF  N++ I  + + + G   
Sbjct: 32  DEELESDESLR--GLYDKWALQHRSTRSLDSDEHARRFEIFKENVKHIDSV-NKKDGPYK 88

Query: 86  YGLNEFSDLSTAEFQAKYLGFKL---KPSYADRSVPA---MIPNIT-LPRAFDWREYDAV 138
            GLN+F+DLS  EF+A ++  K+   K    DR V +   M  N   LP + DWR+  AV
Sbjct: 89  LGLNKFADLSNEEFKAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKRLPASIDWRKKGAV 148

Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFD 198
           T VK+Q  CGS WAFST  ++EG+   KT KLVSLSEQ+L+DC +E+ GC GG + NAF 
Sbjct: 149 TPVKNQGQCGSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDCSKENAGCNGGLMDNAFQ 208

Query: 199 TIMSKLGGGLEEEKTYPYRGDDKAC---RLNKKATQVKINGYVSVSRDETDMAKYLVENG 255
            I+    GG+  E  YPY  +   C   ++  K+    I+G+  V  +     K  V + 
Sbjct: 209 YIIDN--GGIVTEDEYPYTAEAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQ 266

Query: 256 PMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWI 313
           P+++AI A  +  QFY TGV     F    G E L H V++VGYG      + + + YWI
Sbjct: 267 PVSIAIEASGHDFQFYSTGV-----FTGKCGTE-LDHGVVVVGYGK-----SPEGINYWI 315

Query: 314 IKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
           ++NSWG  WGE+GY R+ RG    +G CGI+
Sbjct: 316 VRNSWGPEWGEQGYIRMQRGIEATEGKCGIS 346


>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
 gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
          Length = 456

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 131/351 (37%), Positives = 177/351 (50%), Gaps = 38/351 (10%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHT--------ALFNYFLEQHNKTYATLVEYYS 61
           + LL L  ++SS   +     H  H  K +        A++  +L +H K Y  L E   
Sbjct: 2   LMLLFLVFALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEK 61

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF-----KLKPSYADRS 116
           R  IF  NL  I    ++E+ +   GLN F+DL+  EF++ YLG      K  P  +DR 
Sbjct: 62  RFEIFKDNLMFIDQ-HNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSDRY 120

Query: 117 VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
            P +    +LP + DWR+  AV  VKDQ  CGS WAFST   +EG+    T  L++LSEQ
Sbjct: 121 APRV--GDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQ 178

Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL-NKKATQVKI 234
           EL+DCD   ++GC GG +  AF+ I++   GG++ E  YPY G D  C    K A  V I
Sbjct: 179 ELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDTEDDYPYLGRDGRCDTYRKNAKVVSI 236

Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHSV 292
           + Y  V  ++    K  V N P++VAI       Q Y +GV      F      +L H V
Sbjct: 237 DSYEDVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGV------FTGECGTSLDHGV 290

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
             VGYG      T K   YWI++NSWG+ WGE GY R+ R      G CGI
Sbjct: 291 AAVGYG------TEKGKDYWIVRNSWGKSWGESGYIRMERNIASPTGKCGI 335


>gi|334347644|ref|XP_001379528.2| PREDICTED: cathepsin W-like [Monodelphis domestica]
          Length = 619

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 116/315 (36%), Positives = 167/315 (53%), Gaps = 26/315 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  F  Q+NK+YA   E   R  IF+ NL   Q L +   G   +G+ +FSDL+  EF  
Sbjct: 266 FKAFQIQYNKSYADPAEQERRFEIFADNLAWAQQLTEKHGGMAQFGVTQFSDLTEEEFHQ 325

Query: 102 KYLGFK---LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
            Y   +    +PS   R  P +     L R+ DWR+   +T V+ Q  C S WA +  GN
Sbjct: 326 HYQPAQSSYKEPSLKTRKHPRL--QRPLIRSCDWRKAGVLTPVRKQKKCRSCWAIAAVGN 383

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +E ++A   ++   LS QE++DCD+    C+GG + +AF TI+ +   GL  E+ YPY+ 
Sbjct: 384 VEALWAIHYEQHFELSVQEVLDCDRCGKACKGGFVWDAFLTILRQR--GLARERDYPYQD 441

Query: 219 DDKACRLNKKATQVK------INGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTG 272
                +L++K  Q K      I  ++ + ++E  MA++L   GP+ V IN   L+ Y  G
Sbjct: 442 -----QLSRKGCQKKQNRTGWIQDFLMLPKEENAMAEHLALKGPITVTINQALLKTYRKG 496

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           V  P     D     + HSVL+VG+G +      K   YWI+KNSWG  WGE+GYFRL R
Sbjct: 497 VIRPKD---DCDPNQVDHSVLLVGFGQN-----TKDGAYWILKNSWGSDWGEEGYFRLRR 548

Query: 333 GDGSCGINDYVRSAL 347
           G  +CGI  Y  +AL
Sbjct: 549 GTNACGITKYPVTAL 563


>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
          Length = 374

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 118/313 (37%), Positives = 170/313 (54%), Gaps = 30/313 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           +  +L +H + Y  L E   R  IF  NLR I+   ++ + +   GLN+F+DL+  E++ 
Sbjct: 50  YEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFADLTNEEYRT 109

Query: 102 KYLGFK--LKPSYADRSVP----AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            YLG K   +  +     P    A  PN  +P + DWR+  AV  +K+Q  CGS WAFST
Sbjct: 110 MYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQGSCGSCWAFST 169

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQ-EDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
              + G+    T ++++LSEQEL+DCD+ ++ GC GG +  AF+ I+S   GG++ EK Y
Sbjct: 170 VAAVGGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISN--GGMDTEKHY 227

Query: 215 PYRGDDKACR-LNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVT 271
           PYRG +  C  + K    V I+GY  V R+E  + K  V + P+ VAI A   A Q Y +
Sbjct: 228 PYRGVEGRCDPVRKNYKVVSIDGYEDVPRNERALQK-AVAHQPVCVAIEASGRAFQLYSS 286

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GV      F     E + H V++VGYG      +   V YWI++NSWG  WGE GY ++ 
Sbjct: 287 GV------FTGECGEEVDHGVVVVGYG------SEDGVDYWIVRNSWGTKWGENGYVKME 334

Query: 332 RGD-----GSCGI 339
           R       G CGI
Sbjct: 335 RNVKKSHLGKCGI 347


>gi|334265690|ref|YP_004376219.1| cathepsin [Clostera anachoreta granulovirus]
 gi|315451014|gb|ADU24593.1| cathepsin [Clostera anachoreta granulovirus]
 gi|327553705|gb|AEB00299.1| cathepsin [Clostera anachoreta granulovirus]
          Length = 332

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 122/322 (37%), Positives = 172/322 (53%), Gaps = 29/322 (9%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
            LF  F+   NKTY++  E   R  IF  NL  I   ++ E     + +N +SDL   + 
Sbjct: 27  TLFEEFVTNFNKTYSSQDEKLIRYEIFKKNLALINN-KNMESKHATFDINIYSDLHKNDL 85

Query: 100 QAK----YLGFKLKPSYAD---RSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCG 148
             +     +G K  P +     R     +    P+  LP  FDWR  + VT VKDQ  CG
Sbjct: 86  LHRTTGLRIGLKKNPLFKAITFRECGVQVIGDEPHALLPETFDWRLRNGVTSVKDQLQCG 145

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
           + WAFS  GNIE ++  K    + LSEQ L++CD  ++GC+GG +  A + I+ +  GGL
Sbjct: 146 ACWAFSALGNIESLHKIKYGVELDLSEQHLVNCDPLNNGCDGGLMHWALENILYE--GGL 203

Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYALQ 267
             E+  PY G D  C+   K     I+G    V ++E  + + LV NGP++VAI+   + 
Sbjct: 204 VAERDEPYFGYDAVCK--PKRLSSTISGCTRFVLQNENRLRELLVVNGPVSVAIDVIDVI 261

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
            Y  G++      C   N  L+H+VL+VGYGVD        VPYWI+KNSWGE WGE G+
Sbjct: 262 DYKEGIAD----MCHNKN-GLNHAVLLVGYGVDND------VPYWILKNSWGENWGENGF 310

Query: 328 FRLYRGDGSCGI-NDYVRSALV 348
           FR+ R   SCGI N+Y  SA++
Sbjct: 311 FRVQRNVNSCGIMNEYASSAIL 332


>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
          Length = 466

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 120/307 (39%), Positives = 168/307 (54%), Gaps = 20/307 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F+++++   + YA+  EY  R  ++  NLR +    +  H S    +  ++DLS  E+++
Sbjct: 40  FDFWVQTLKRAYASAEEYERRFDVWLDNLRFVHEY-NAGHTSHWLSMGVYADLSQDEYRS 98

Query: 102 KYLGFK--LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
           K LG+   L      R+ P +      P+  DW    AVT VK+Q +CGS WAFSTTG +
Sbjct: 99  KALGYNADLHEERPLRAAPFLYEGTVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAV 158

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           EG  A  T KL SLSEQ L+DCD+E D+GC GG +  AF+ IM    GG++ E  YPY  
Sbjct: 159 EGASAIATGKLASLSEQMLVDCDRERDNGCHGGLMDFAFEFIMKN--GGIDTEDDYPYTA 216

Query: 219 DDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSH 275
           ++  C+ NK +   V I+ Y  V  ++       V N P++VAI A   A Q Y  GV  
Sbjct: 217 EEGMCQDNKMRRHVVTIDDYQDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGV-- 274

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-- 333
              F  + G   L H VL+VGYG       H  +PYW++KNSWG  WG+KGY RL R   
Sbjct: 275 ---FDAECGTA-LDHGVLVVGYGTASNGTHH--LPYWLVKNSWGAEWGDKGYIRLLRNLG 328

Query: 334 -DGSCGI 339
            +G CG+
Sbjct: 329 EEGQCGV 335


>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
          Length = 339

 Score =  195 bits (495), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 128/315 (40%), Positives = 171/315 (54%), Gaps = 28/315 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAE 98
           +N F  QH K Y +  E   RL I+  N  KI +  Q  + G   Y L  N+++DL   E
Sbjct: 27  WNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLHEE 86

Query: 99  FQAKYLGFK-------LKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSS 150
           F     GF        LK    +  V  + P N+ +P   DWR+  AVT VKDQ  CGS 
Sbjct: 87  FVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCGSC 146

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGL 208
           W+FS TG +EG +  KT KLVSLSEQ L+DC  +  ++GC GG +  AF  I  K  GG+
Sbjct: 147 WSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYI--KDNGGI 204

Query: 209 EEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--A 265
           + EK+YPY   D  C  N KA      GYV + + DE  + K L   GP+++AI+A   +
Sbjct: 205 DTEKSYPYEAIDDTCHFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHES 264

Query: 266 LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEK 325
            QFY  GV +  Q  CD  +ENL H VL VGYG      + +   YW++KNSWG  WG++
Sbjct: 265 FQFYSEGVYYEPQ--CD--SENLDHGVLAVGYGT-----SEEGEDYWLVKNSWGTTWGDQ 315

Query: 326 GYFRLYRG-DGSCGI 339
           GY ++ R  D  CG+
Sbjct: 316 GYVKMARNHDNHCGV 330


>gi|33333710|gb|AAQ11973.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 124/306 (40%), Positives = 163/306 (53%), Gaps = 20/306 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
           F   H KTY +LVE   R  +F  NL  IQ      E G   +   + +F+D++  EF  
Sbjct: 26  FKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEFLD 85

Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
             K  G    PS A         ++    A DWRE  AVT VKDQ  CGS WAFS  G I
Sbjct: 86  LLKLQGVPALPSNAVHFDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           EG +  K   LVSLS QEL+DC  ED   +GC+GG +  AFD +  +   G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFVQDE---GIQTEESYPY 202

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
            G   +C+ + +    K+  YV    DE +MA+ +   GP+AVAI A  L FY  G+   
Sbjct: 203 EGRRSSCKKSGEYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
            +  C    E+L+  VL+VGYG      +   V YWI+KNSWG  WGEKGYFRL +   +
Sbjct: 261 -RCRCSNKREDLNPGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313

Query: 337 CGINDY 342
           CGI  Y
Sbjct: 314 CGIGYY 319


>gi|14349349|gb|AAC38833.2| cysteine protease [Leishmania chagasi]
          Length = 353

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 127/360 (35%), Positives = 177/360 (49%), Gaps = 41/360 (11%)

Query: 5   YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
           +FFA V  +   V   S ++   +    +     +A +  F ++H K +    E   R +
Sbjct: 6   FFFAIVVTILFVVCYGSALIA--QTPLGVDDFIASAHYGRFKKRHGKPFGEDAEEGRRFN 63

Query: 65  IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY------------ 112
            F  N++    L      +      +F+DL+  EF   YL     P+Y            
Sbjct: 64  AFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYKEHV 119

Query: 113 -ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
             D SV + + ++      DWRE   VT VK+Q MCGS WAF+TTGNIEG +A K   LV
Sbjct: 120 HVDDSVRSGVMSV------DWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWALKNHSLV 173

Query: 172 SLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKK 228
           SLSEQ L+ CD  DDGC GG +  A   I++   G +  E +YPY    G    C  N  
Sbjct: 174 SLSEQVLVSCDNIDDGCNGGLMQQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDN-G 232

Query: 229 ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENL 288
               KI GY+S+  DE ++A Y+ +NGP+AVA++A   Q Y  GV       C G   +L
Sbjct: 233 TVGAKIAGYMSLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVV----TLCFG--LSL 286

Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           +H VL+VG+            PYWI+KNSWG  WGEKGY RL  G   C + +Y  +A +
Sbjct: 287 NHGVLVVGFN------RQAKPPYWIVKNSWGSSWGEKGYIRLAMGSNQCLLKNYAVTATI 340


>gi|398014254|ref|XP_003860318.1| cysteine peptidase A (CBA) [Leishmania donovani]
 gi|13518086|gb|AAK27384.1| cysteine proteinase-like protein [Leishmania donovani]
 gi|322498538|emb|CBZ33611.1| cysteine peptidase A (CBA) [Leishmania donovani]
          Length = 354

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 127/360 (35%), Positives = 177/360 (49%), Gaps = 41/360 (11%)

Query: 5   YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
           +FFA V  +   V   S ++   +    +     +A +  F ++H K +    E   R +
Sbjct: 7   FFFAIVVTILFVVCYGSALIA--QTPLGVDDFIASAHYGRFKKRHGKPFGEDAEEGRRFN 64

Query: 65  IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY------------ 112
            F  N++    L      +      +F+DL+  EF   YL     P+Y            
Sbjct: 65  AFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYKEHV 120

Query: 113 -ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
             D SV + + ++      DWRE   VT VK+Q MCGS WAF+TTGNIEG +A K   LV
Sbjct: 121 HVDDSVRSGVMSV------DWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWALKNHSLV 174

Query: 172 SLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKK 228
           SLSEQ L+ CD  DDGC GG +  A   I++   G +  E +YPY    G    C  N  
Sbjct: 175 SLSEQVLVSCDNIDDGCNGGLMEQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDN-G 233

Query: 229 ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENL 288
               KI GY+S+  DE ++A Y+ +NGP+AVA++A   Q Y  GV       C G   +L
Sbjct: 234 TVGAKIAGYMSLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVV----TLCFG--LSL 287

Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           +H VL+VG+            PYWI+KNSWG  WGEKGY RL  G   C + +Y  +A +
Sbjct: 288 NHGVLVVGFN------RQAKPPYWIVKNSWGSSWGEKGYIRLAMGSNQCLLKNYAVTATI 341


>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
 gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
          Length = 455

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 123/313 (39%), Positives = 160/313 (51%), Gaps = 28/313 (8%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           ++  +L +H + Y  L E   R  IF  NL+ I       + S   GLN+F+DLS  E++
Sbjct: 24  IYEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADLSNDEYR 83

Query: 101 AKYLGFKLKPSYADRSVPA-----MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
           + YLG ++         P            LP   DWRE  AV  VKDQ  CGS WAFST
Sbjct: 84  SVYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDWREKGAVAPVKDQGQCGSCWAFST 143

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
            G +EG+    T  L SLSEQEL+DCD+  + GC GG +  AFD I+    GG++ E+ Y
Sbjct: 144 VGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFIIEN--GGIDTEEDY 201

Query: 215 PYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVT 271
           PY+  D  C  N+K A  V I+GY  V +++    K  V N P++VAI A     Q Y +
Sbjct: 202 PYKAIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRGFQLYQS 261

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GV      F       L H V+ VGYG      T   V YWI++NSWG  WGE GY R+ 
Sbjct: 262 GV------FTGSCGTQLDHGVVTVGYG------TEHGVDYWIVRNSWGPAWGENGYIRME 309

Query: 332 RG-----DGSCGI 339
           R       G CGI
Sbjct: 310 RDVASTETGKCGI 322


>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
          Length = 350

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 131/329 (39%), Positives = 171/329 (51%), Gaps = 28/329 (8%)

Query: 22  FMVVG--DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           F +VG   E L  +   K   LF  ++ +H K Y  + E   R  IF  NL+ I      
Sbjct: 28  FSIVGYSSEDLKSMD--KLIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKV 85

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI-PNITLPRAFDWREYDAV 138
                + GL+EF+DLS  EF  KYLG K+  S    S       ++ LP++ DWR+  AV
Sbjct: 86  VSNYWL-GLSEFADLSHREFNNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAV 144

Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAF 197
             VK+Q  CGS WAFST   +EG+    T  L SLSEQELIDCD+  ++GC GG +  AF
Sbjct: 145 APVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAF 204

Query: 198 DTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGP 256
             I+    GGL +E+ YPY  ++ AC + K+ TQ V I+GY  V ++        + N P
Sbjct: 205 SFIVEN--GGLHKEEDYPYIMEEGACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQP 262

Query: 257 MAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWII 314
           ++VAI A     QFY  GV      F      +L H V  VGYG      T K V Y  +
Sbjct: 263 LSVAIEASGRDFQFYSGGV------FDGHCGSDLDHGVAAVGYG------TAKGVDYITV 310

Query: 315 KNSWGEGWGEKGYFRLYRG----DGSCGI 339
           KNSWG  WGEKGY R+ R     +G CGI
Sbjct: 311 KNSWGSKWGEKGYIRMRRNIGKPEGICGI 339


>gi|15824704|gb|AAL09448.1| cysteine protease [Leishmania donovani]
          Length = 353

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 127/360 (35%), Positives = 177/360 (49%), Gaps = 41/360 (11%)

Query: 5   YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
           +FFA V  +   V   S ++   +    +     +A +  F ++H K +    E   R +
Sbjct: 6   FFFAIVVTILFVVCYGSALIA--QTPLGVDDFIASAHYGRFKKRHGKPFGEDAEEGRRFN 63

Query: 65  IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSY------------ 112
            F  N++    L      +      +F+DL+  EF   YL     P+Y            
Sbjct: 64  AFKQNMQTAYFLNAHNPHAHYDVSGKFADLTPQEFAKLYL----NPNYYARHGKDYKEHV 119

Query: 113 -ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
             D SV + + ++      DWRE   VT VK+Q MCGS WAF+TTGNIEG +A K   LV
Sbjct: 120 HVDDSVRSGVMSV------DWREKGVVTPVKNQGMCGSCWAFATTGNIEGQWALKNHSLV 173

Query: 172 SLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKK 228
           SLSEQ L+ CD  DDGC GG +  A   I++   G +  E +YPY    G    C  N  
Sbjct: 174 SLSEQVLVSCDNIDDGCNGGLMEQAMQWIINDHNGTVPTEDSYPYTSAGGTRPPCHDN-G 232

Query: 229 ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENL 288
               KI GY+S+  DE ++A Y+ +NGP+AVA++A   Q Y  GV       C G   +L
Sbjct: 233 TVGAKIAGYMSLPHDEEEIAAYVGKNGPVAVAVDATTWQLYFGGVV----TLCFG--LSL 286

Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           +H VL+VG+            PYWI+KNSWG  WGEKGY RL  G   C + +Y  +A +
Sbjct: 287 NHGVLVVGFN------RQAKPPYWIVKNSWGSSWGEKGYIRLAMGSNQCLLKNYAVTATI 340


>gi|343472970|emb|CCD15012.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 382

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 119/336 (35%), Positives = 177/336 (52%), Gaps = 24/336 (7%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + L    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  N+ +
Sbjct: 14  VGLHAVAACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYRDATEEAFRFRVFKQNMER 71

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PR 128
            +  +   +    +G+  FSD+S  EF+A Y  G +   +   R  P  + N++    P 
Sbjct: 72  AKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGKAPP 128

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           A DWR+  AVT VKDQ  C SSWAFS  GNIEG +     +L SLSEQ L+ CD  D GC
Sbjct: 129 AIDWRKKGAVTPVKDQGQCDSSWAFSAIGNIEGQWKVAGHELTSLSEQMLVSCDTNDFGC 188

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDET 245
            GG    AF  I+S   G +  E++YPY    G+   C  + K    KI   V + RDE 
Sbjct: 189 GGGFSDPAFKWIVSSNKGNVFTEQSYPYASGGGNVPTCDKSGKVVGAKIRDRVDLPRDEN 248

Query: 246 DMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            +A++L +NGP+A+A++A + Q Y  GV           ++ ++ +VL+VGY  D +K  
Sbjct: 249 AIAEWLAKNGPVAIAVDATSFQSYTGGV------LTSCISKEMNSAVLLVGYD-DTSK-- 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGIND 341
               PYWIIKNSW +GWGEKGY R+ +G   C + +
Sbjct: 300 ---PPYWIIKNSWSKGWGEKGYIRIEKGTNQCLVKN 332


>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
 gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
          Length = 381

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 122/311 (39%), Positives = 164/311 (52%), Gaps = 30/311 (9%)

Query: 47  EQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYGLNEFSDLSTAEFQAKYLG 105
           + H++ +    E   R   F  N+R I       +  S    LN F D+   EF++ +  
Sbjct: 50  QTHHRVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFGDMGPEEFRSTFAD 109

Query: 106 FKL-------KPSYADRSVPAMIPN--ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
            ++       + S A  +VP  + +    +PR+ DWR++ AVT VK+Q  CGS WAFST 
Sbjct: 110 SRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQHGAVTAVKNQGRCGSCWAFSTV 169

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
             +EG+ A +T  LVSLSEQEL+DCD  ++GC+GG + NAFD I S   GG+  E  YPY
Sbjct: 170 VAVEGINAIRTGSLVSLSEQELVDCDTAENGCQGGLMENAFDFIKSY--GGITTESAYPY 227

Query: 217 RGDDKAC---RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVT 271
           R  +  C   R  +    V I+G+  V     D     V   P++VAI+A   A QFY  
Sbjct: 228 RASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVARQPVSVAIDAGGQAFQFYSE 287

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GV     F  D G + L H V +VGYGV     T    PYWI+KNSWG  WGE GY R+ 
Sbjct: 288 GV-----FTGDCGTD-LDHGVAVVGYGVSDVDGT----PYWIVKNSWGPSWGEGGYIRMQ 337

Query: 332 RGDGS---CGI 339
           RG G+   CGI
Sbjct: 338 RGAGNGGLCGI 348


>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
          Length = 337

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 109/294 (37%), Positives = 157/294 (53%), Gaps = 17/294 (5%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYA 164
           G  +  SY   S    + +  +P   DWRE  AVT VK+Q  CG  WAFS  G++EG Y 
Sbjct: 102 GLNIPNSYLSPSPINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYK 161

Query: 165 AKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
             T  L+  SEQEL+DC   + GC GG ++NAFD I  K  GG+  E  Y Y G    CR
Sbjct: 162 IATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYLGQQYTCR 219

Query: 225 LNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQFFCDG 283
             +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G         DG
Sbjct: 220 SQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-------DG 271

Query: 284 GNEN-LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
              N ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G+
Sbjct: 272 SCANRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGEDGFMKIIRDSGN 320


>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
          Length = 337

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 109/294 (37%), Positives = 157/294 (53%), Gaps = 17/294 (5%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYA 164
           G  +  SY   S    + +  +P   DWRE  AVT VK+Q  CG  WAFS  G++EG Y 
Sbjct: 102 GLNIPNSYLSPSPINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYK 161

Query: 165 AKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
             T  L+  SEQEL+DC   + GC GG ++NAFD I  K  GG+  E  Y Y G    CR
Sbjct: 162 IATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYLGQQYTCR 219

Query: 225 LNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQFFCDG 283
             +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G         DG
Sbjct: 220 SQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-------DG 271

Query: 284 GNEN-LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
              N ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G+
Sbjct: 272 SCANRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGEDGFMKIIRDSGN 320


>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
           [Tribolium castaneum]
 gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
          Length = 337

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 125/344 (36%), Positives = 184/344 (53%), Gaps = 30/344 (8%)

Query: 21  SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTE 80
           +  VVG + +     V+    +  F   H K Y +  E   R+ IF  N  K+    +  
Sbjct: 8   ALCVVGSQAVSFFDLVQEQ--WGAFKVTHKKQYESETEERFRMKIFMENAHKV-AKHNKL 64

Query: 81  HGSGV----YGLNEFSDLSTAEFQAKYLGFK-----LKPSYADRSVPAMIP-NITLPRAF 130
           +  G+     G+N++SD+   EF     G+      L+    D S+  + P N+ LP+  
Sbjct: 65  YAQGLVSFKLGVNKYSDMLNHEFVHTLNGYNRSKTPLRSGELDESITFIPPANVELPKQI 124

Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGC 188
           DWR+  AVT VKDQ  CGS W+FSTTG++EG +  K+KKLVSLSEQ LIDC ++  ++GC
Sbjct: 125 DWRKLGAVTPVKDQGQCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDCSEKYGNNGC 184

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDM 247
            GG + NAF  I  K  GG++ E++YPY+ +D+ C    +       G+V + S DE  +
Sbjct: 185 NGGLMDNAFRYI--KDNGGIDTEQSYPYKAEDEKCHYKPRNKGATDRGFVDIESGDEEKL 242

Query: 248 AKYLVENGPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
              +   GP++VAI+A     Q Y  GV +  +  C   +E L H VL+VGYG D     
Sbjct: 243 KAAVATVGPISVAIDASHPTFQQYSEGVYYEPE--C--SSEQLDHGVLVVGYGTDED--- 295

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
                YW++KNSWG+ WG++GY ++ R  D +CGI       LV
Sbjct: 296 --GNDYWLVKNSWGDSWGDQGYIKMARNRDNNCGIATQASYPLV 337


>gi|13124011|sp|Q9YWK4.1|CATV_NPVBS RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
           proteinase; Short=CP; Flags: Precursor
 gi|3882976|gb|AAC77812.1| cathepsin [Buzura suppressaria NPV]
          Length = 331

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 116/311 (37%), Positives = 169/311 (54%), Gaps = 19/311 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  FL  +NK Y    E   R  IF   L +I   ++  + S VY +N+F+DLS  E  +
Sbjct: 31  FETFLANYNKMYNDTSEKERRFSIFQQTLEEINY-KNRLNDSAVYQINKFADLSKNEIIS 89

Query: 102 KYLGFKLKPSYADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
           KY G  +     +     +I  P    P  FDWR+ + VT +K+Q  CG+ WAF+T  +I
Sbjct: 90  KYTGLNMPVQTTNFCKTIVIDQPPGKGPLNFDWRQQNKVTSIKNQKACGACWAFATLASI 149

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           E  YA K    + LSEQ++IDCD  D GC+GG +  AF+ ++    G L +E  YPY G 
Sbjct: 150 ESQYAIKNNVHIDLSEQQMIDCDYVDMGCDGGLLHTAFEQMIQM--GELVQEHEYPYAGV 207

Query: 220 DKACRLNKKAT-QVKING-YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           +K C L    T  VK+ G Y  V   E  +   L   GP+ +AI+A  +  Y  G+ H  
Sbjct: 208 NKPCELRGDETGVVKVKGCYRYVVFREEKLKDLLRAVGPIPMAIDASGIVNYHHGIIH-- 265

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
             +C+  N  L+H+VL+VGYGV+        VP+W  KN+WG+ WGE+GYFR+ +   +C
Sbjct: 266 --YCE--NYGLNHAVLLVGYGVENN------VPFWTFKNTWGKDWGEEGYFRVRQNVDAC 315

Query: 338 GINDYVRSALV 348
           G+ + + S+ V
Sbjct: 316 GMTNELASSAV 326


>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 445

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 125/317 (39%), Positives = 167/317 (52%), Gaps = 26/317 (8%)

Query: 34  HHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSD 93
            + +   +F  +L +++K Y  L E   R  IF  NL+ +Q      + S   GL  F+D
Sbjct: 29  RNPEEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGLTRFAD 88

Query: 94  LSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSW 151
           L+  EF+A YL  K++ +         + N+   LP   DWR   AV  VKDQ  CGS W
Sbjct: 89  LTNEEFRAIYLRSKMERTRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPVKDQGSCGSCW 148

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEE 210
           AFS  G +EG+   KT +LVSLSEQEL+DCD   ++GC GG +  AF  I+S   GG++ 
Sbjct: 149 AFSAIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFIISN--GGIDT 206

Query: 211 EKTYPYRG-DDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YAL 266
           E+ YPY   DD  C  +KK T+ V I+GY  V  +E  + K L  N P++VAI A     
Sbjct: 207 EEDYPYTATDDNICNTDKKNTRVVTIDGYEDVPENENSLKKALA-NQPISVAIEAGGRGF 265

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
           Q Y +GV      F       L H V+ VGYG      T +   YWII+NSWG  WGE G
Sbjct: 266 QLYKSGV------FTGTCGTALDHGVVAVGYG------TSEGQDYWIIRNSWGSNWGESG 313

Query: 327 YFRLYRG----DGSCGI 339
           Y +L R      G CG+
Sbjct: 314 YIKLQRNIKDSSGKCGV 330


>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
 gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
          Length = 463

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 132/326 (40%), Positives = 174/326 (53%), Gaps = 37/326 (11%)

Query: 38  HTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYG--------LN 89
           + ALF+ +  +H K YAT  E  +RL +F+ N   +       + +G  G        LN
Sbjct: 37  YEALFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALN 96

Query: 90  EFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI------TLPRAFDWREYDAVTGVKD 143
            F+DL+  EF+A  LG     + A RS  A +          +P A DWRE  AVT VKD
Sbjct: 97  AFADLTHEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKD 156

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMS 202
           Q  CG+ W+FS TG +EG+   KT  LVSLSEQELIDCD+  + GC GG +  A+  ++ 
Sbjct: 157 QGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVK 216

Query: 203 KLGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI 261
              GG++ E+ YPYR  D  C  NK K   V I+GY  V  ++ D+    V   P++V I
Sbjct: 217 N--GGIDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGI 274

Query: 262 --NAYALQFYVTGVSHPIQFFCDGGNE-NLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
             +A A Q Y        Q   DG    +L H+VLIVGYG +  K       YWI+KNSW
Sbjct: 275 CGSARAFQLYSQ------QGIFDGPCPTSLDHAVLIVGYGSEGGK------DYWIVKNSW 322

Query: 319 GEGWGEKGYFRLYR--GD--GSCGIN 340
           GE WG KGY  ++R  GD  G CGIN
Sbjct: 323 GESWGMKGYMHMHRNTGDSKGVCGIN 348


>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
          Length = 340

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 116/305 (38%), Positives = 157/305 (51%), Gaps = 27/305 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ QH + Y    E   R  IF  N+ +I+      H   + G+N+F+DL+  EF+ +  
Sbjct: 44  WMAQHGRVYKNAAEKAHRFEIFRANVERIESFNAENHKFKL-GVNQFADLTNEEFKTRNT 102

Query: 105 GFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
              LKPS    +      N+T +P   DWR   AVT +KDQ  CGS WAFS     EG+ 
Sbjct: 103 ---LKPSKMASTKSFKYENVTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGIT 159

Query: 164 AAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
              T KL+SLSEQE++DCD   +D GC GG + +AF+ I+     G+  E  YPY+  D 
Sbjct: 160 KLSTGKLISLSEQEVVDCDVTSDDQGCNGGEMDDAFEYIIKN--KGITTEANYPYKAADG 217

Query: 222 ACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQ 278
            C   K A+    I GY  V+ +          N P+AVAI+A  +A Q Y +GV     
Sbjct: 218 TCNTKKAASHAASITGYEDVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGV----- 272

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----D 334
           F  D G + L H V +VGYG      T     YW++KNSWG  WGE GY R+ R     +
Sbjct: 273 FTGDCGTD-LDHGVTLVGYGA-----TSDGTKYWLVKNSWGTSWGEDGYIRMERDVDAKE 326

Query: 335 GSCGI 339
           G CGI
Sbjct: 327 GLCGI 331


>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
 gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
          Length = 341

 Score =  194 bits (494), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 123/320 (38%), Positives = 159/320 (49%), Gaps = 25/320 (7%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
             LH          ++ ++ + Y    E   R  IF  N+  I+      +      +NE
Sbjct: 27  RSLHDAAMNERHEMWMVKYGRVYKDNSEKERRFEIFRNNVEFIESFNKPGNRPYKLDINE 86

Query: 91  FSDLSTAEFQAKYLGFKLKPSYA-DRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCG 148
           F+DL+  EF+A   G+K   +            N+T +P + DWR+  AVT +KDQ  CG
Sbjct: 87  FADLTNEEFKASRNGYKRSSNVGLSEKSSFRYGNVTAVPTSMDWRQKGAVTPIKDQGQCG 146

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGG 206
             WAFS    +EG+    T KL+SLSEQEL+DCD   ED GCEGG + +AF+ I  K  G
Sbjct: 147 CCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFI--KQNG 204

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA-- 263
           GL  E  YPY+G D  C  NK      KI GY  V  +  D     V + P++VAI+A  
Sbjct: 205 GLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSEDALLKAVASQPVSVAIDASG 264

Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
            A QFY  GV     F  D G E L H V  VGYG      T     YW++KNSWG  WG
Sbjct: 265 SAFQFYSGGV-----FTGDCGTE-LDHGVTAVGYG------TSDGTKYWLVKNSWGTSWG 312

Query: 324 EKGYFRLYRG----DGSCGI 339
           E GY R+ R     +G CGI
Sbjct: 313 EDGYIRMERDIEAKEGLCGI 332


>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
          Length = 422

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 124/317 (39%), Positives = 169/317 (53%), Gaps = 34/317 (10%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           +L+  +L +H K Y  L E   R  IF  NLR I    + ++ +   GLN F+DL+  E+
Sbjct: 2   SLYEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDD-HNADNRTYKLGLNRFADLTNEEY 60

Query: 100 QAKYLGFKLKP--------SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
           +A+YLG ++ P        + ++R  P +  N  LP + DWR   AV  VKDQ  CGS W
Sbjct: 61  RARYLGTRIDPNRRFVKTKTQSNRYAPRVGDN--LPESVDWRNESAVLPVKDQGNCGSCW 118

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEE 210
           AFST G +EG+    T  L+SLSEQEL+DCD   + GC GG +  A++ I++   GG++ 
Sbjct: 119 AFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINN--GGIDS 176

Query: 211 EKTYPYRGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQ 267
           E+ YPYR  D  C +  K A  V I+ Y  V  ++    K  V N P++VAI       Q
Sbjct: 177 EEDYPYRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQ 236

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
            YV+GV      F       L H V+ VGYG      + K   YWI++NSWG  WGE+GY
Sbjct: 237 LYVSGV------FTGRCGTALDHGVVAVGYG------SVKGHDYWIVRNSWGASWGEEGY 284

Query: 328 FRLYRG-----DGSCGI 339
            RL R       G CGI
Sbjct: 285 VRLERNLAKSRSGKCGI 301


>gi|33333698|gb|AAQ11967.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 326

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 123/306 (40%), Positives = 162/306 (52%), Gaps = 20/306 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQ- 100
           F   H KTY ++VE   R  +F  NL  IQ      E G   +   + +F+D++  EF  
Sbjct: 26  FKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEFLD 85

Query: 101 -AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
             K  G    PS A         ++    A DWRE  AVT VKDQ  CGS WAFS  G I
Sbjct: 86  LLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCWAFSAVGAI 145

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQED---DGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           EG +  K   LVSLS QEL+DC  E+   +GC GG +  AFD +  +   G++ E++YPY
Sbjct: 146 EGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFVQDE---GIQTEESYPY 202

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHP 276
            G   +C+ +      K+  YV    DE +MA+ +   GP+AVAI A  L FY  G+   
Sbjct: 203 EGRRSSCKKSGDYV-TKVKTYV-FPLDEQEMARTVAAKGPVAVAIEASQLSFYDKGIVDE 260

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
            +  C    E+L+H VL+VGYG      +   V YWI+KNSWG  WGEKGYFRL +   +
Sbjct: 261 -KCRCSNKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKA 313

Query: 337 CGINDY 342
           CGI  Y
Sbjct: 314 CGIGYY 319


>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
          Length = 349

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 121/321 (37%), Positives = 164/321 (51%), Gaps = 41/321 (12%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           A  + ++  H K Y  L E   R  IF  N+ +I+     E      G N+FSDL+  EF
Sbjct: 40  ARHDQWIVHHEKVYKDLNEKEVRFQIFKENVERIEAFNAGEDKGYKLGFNKFSDLTNEEF 99

Query: 100 QAKYLGFKLKPSYADRSVPAMIP-----------NIT-LPRAFDWREYDAVTGVKDQTMC 147
           +  + G+K       RS P ++            N+T +P   DWR+  AVT +KDQ  C
Sbjct: 100 RVLHTGYK-------RSHPKVMTSSKGKTHFRYTNVTDIPPTMDWRKKGAVTPIKDQKEC 152

Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLG 205
           G  WAFS    +EG++  KT +L+ LSEQEL+DCD   ED+GC GG +  AFD I+    
Sbjct: 153 GCCWAFSAVAAMEGLHQLKTGELIPLSEQELVDCDVEGEDEGCSGGLLDTAFDFILKN-- 210

Query: 206 GGLEEEKTYPYRGDDKACRLNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAIN-- 262
            GL  E  YPY+G+D  C   K A +  KI GY  V  +        V N P++VAI+  
Sbjct: 211 KGLTTEVNYPYKGEDGVCNKKKSALSAAKITGYEDVPANSEKALLQAVANQPVSVAIDGS 270

Query: 263 AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGW 322
           ++  QFY +GV      F    +  L+H+V  VGYG      T     YWIIKNSWG  W
Sbjct: 271 SFDFQFYSSGV------FSGSCSTWLNHAVTAVGYGA-----TTDGTKYWIIKNSWGSKW 319

Query: 323 GEKGYFRLYRG----DGSCGI 339
           G+ GY R+ R     +G CG+
Sbjct: 320 GDSGYMRIKRDVHEKEGLCGL 340


>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
 gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
          Length = 325

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 120/307 (39%), Positives = 168/307 (54%), Gaps = 22/307 (7%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           H KTY T  E   R  I++ NL  ++   + E+ S    +N F+DL+  EF+ +++G++ 
Sbjct: 34  HGKTY-TGEEEDLRRAIWNDNLEIVKK-HNAENHSYKLDMNHFADLTVTEFKQRFMGYRA 91

Query: 109 KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTK 168
             +    S    + N+ LP   DWR+   VT VK+Q  CGS WAFS+TG++EG +  KT 
Sbjct: 92  ASNSTGGSTFLPLSNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTG 151

Query: 169 KLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLN 226
           KLVSLSEQ L+DC ++  ++GCEGG +  AF  I  K   G++ E++YPY   D  C   
Sbjct: 152 KLVSLSEQNLVDCSKKYGNNGCEGGLMDYAFKYI--KNNDGIDTEQSYPYTARDGQCHFK 209

Query: 227 KKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTGV-SHPIQFFCD 282
             +    + GY  V R  E D+   +   GP++VAI+A   + Q Y TGV S P     D
Sbjct: 210 PGSVGATVTGYTDVQRGSEGDLQSAVATVGPISVAIDAGHSSFQLYKTGVYSEP-----D 264

Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGIND 341
             +  L H VL VGYG +  K       YW++KNSWGEGWG  GY ++ R  D  CGI  
Sbjct: 265 CSSTQLDHGVLAVGYGAEDGK------DYWLVKNSWGEGWGMNGYIKMSRNKDNQCGIAT 318

Query: 342 YVRSALV 348
                LV
Sbjct: 319 QASYPLV 325


>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
          Length = 469

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 132/358 (36%), Positives = 181/358 (50%), Gaps = 38/358 (10%)

Query: 1   MSCFYFFAGVALLSL-TVSVSSFMVVGDEKLHHLHHVKHT-----ALFNYFLEQHNKTYA 54
           M+ F F     LL L + S     ++G ++ H       T     A++  +L +H K+Y 
Sbjct: 10  MAVFLFL----LLGLASASAXDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYN 65

Query: 55  TLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKP 110
            L E   R  IF  NLR I    + E+ +   GLN F+DL+  E+++ YLG     K + 
Sbjct: 66  ALGEKERRFQIFKDNLRFIDE-HNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAAKRRS 124

Query: 111 SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKL 170
           S       A     +LP + DWR+  AV  VKDQ  CGS WAFST   +EG+    T  L
Sbjct: 125 SNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGL 184

Query: 171 VSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKK 228
           +SLSEQEL+DCD   ++GC GG +  AF+ I++   GG++ E+ YPY+  D  C +  K 
Sbjct: 185 ISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDSEEDYPYKASDGRCDQYRKN 242

Query: 229 ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNE 286
           A  V I+GY  V  ++    +  V N P++VAI A     Q Y +G+      F      
Sbjct: 243 AXVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGI------FTGRCGT 296

Query: 287 NLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-----GDGSCGI 339
            L H V  VGYG      T   V YWI+KNSWG  WGE+GY R+ R       G CGI
Sbjct: 297 ALDHGVTAVGYG------TENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGI 348


>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
 gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
          Length = 305

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 114/302 (37%), Positives = 159/302 (52%), Gaps = 20/302 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ QH + Y  + E   R  IF  N+ +I+   +        G+N+F+DL+  EF+A Y 
Sbjct: 8   WMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYH 67

Query: 105 GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYA 164
           G+K + S    S         +P + DWR   AVT VKDQ  CG  WAFST   IEG+  
Sbjct: 68  GYKRQSSKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIK 127

Query: 165 AKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
            +T  L+SLSEQ+L+DC   + GC+GG +  AF  I+    GGL  E  YPY+G D  C 
Sbjct: 128 LQTGNLISLSEQQLVDCTAGNKGCQGGLMDTAFQYIIRN--GGLTSEDNYPYQGVDGTCS 185

Query: 225 LNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFC 281
             K A T+ +I GY  V ++  +     V   P++VA++      +FY +GV     F  
Sbjct: 186 SEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFYKSGV-----FEG 240

Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSC 337
           D G  NL+H V  +GYG D          YW++KNSWG  WGE GY R+ RG    +G C
Sbjct: 241 DCGT-NLNHGVTAIGYGTD-----SDGTDYWLVKNSWGTSWGESGYTRMQRGIGASEGLC 294

Query: 338 GI 339
           G+
Sbjct: 295 GV 296


>gi|261289811|ref|XP_002611767.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
 gi|229297139|gb|EEN67777.1| hypothetical protein BRAFLDRAFT_284308 [Branchiostoma floridae]
          Length = 336

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 129/333 (38%), Positives = 176/333 (52%), Gaps = 34/333 (10%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
           + H   +  +  QH K Y T  E YSR   F  N  KI      EH         S    
Sbjct: 18  LPHNKEWEMWKLQHGKQYETEAEEYSRRFTFEKNTIKI-----AEHNIRASLGMHSYTLA 72

Query: 88  LNEFSDLSTAEFQAKYLGFKLKPSYADR-----SVPAMIPNITLPRAFDWREYDAVTGVK 142
           +N+F D+   EF  + +G  LK    ++      V     N TLP++ DWR    V+ VK
Sbjct: 73  MNKFGDMHHEEFHQRIMGGCLKIVKVNKPLLGSEVGDNDDNGTLPKSVDWRNSAMVSEVK 132

Query: 143 DQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTI 200
           DQ  CGS WAFSTTG++EG +A KT KLV LSEQ+L+DC ++  + GC GG +  AF  I
Sbjct: 133 DQGECGSCWAFSTTGSLEGQHANKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI 192

Query: 201 MSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMA 258
             K  GGL+ E++YPY   DDK C+ +  +    + GY  V S +E  + + +   GP++
Sbjct: 193 --KANGGLDTEESYPYTATDDKPCKFDNSSVGATLIGYKDVKSGNEHALKRAVATVGPIS 250

Query: 259 VAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
           VAI+A   + QFY +GV    Q  C   +E L H VL+VGYG      +H+A  +WI+KN
Sbjct: 251 VAIDAGHESFQFYSSGVYDEPQ--CS--SEQLDHGVLVVGYGA-MNDNSHQA--FWIVKN 303

Query: 317 SWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           SWG  WG++GY  + R  D  CGI       LV
Sbjct: 304 SWGPNWGDQGYIMMSRNKDNQCGIATSASYPLV 336


>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 355

 Score =  194 bits (493), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 131/354 (37%), Positives = 183/354 (51%), Gaps = 30/354 (8%)

Query: 1   MSCFYFFAGVALLSLTVSV--SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVE 58
           ++ F     ++  +L  S     F +VG          K   LF  ++ +H+K Y ++ E
Sbjct: 8   LTKFSLLVAISASALLCSALARDFSIVGYTPEQLTSTEKLLELFESWMSEHSKVYKSVEE 67

Query: 59  YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP 118
              R  +F  NL  I   ++ E  S   GLNEF+DL+  EF+ +YLG   KP ++ +  P
Sbjct: 68  KVHRFEVFRENLMHIDQ-RNNEINSYWLGLNEFADLTHEEFKGRYLGLA-KPQFSRKRQP 125

Query: 119 AM---IPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLS 174
           +      +IT LP++ DWR+  AV  VKDQ  CGS WAFST   +EG+    T  L SLS
Sbjct: 126 SANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLS 185

Query: 175 EQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQV 232
           EQELIDCD   + GC GG +  AF  I+S   GGL +E  YPY  ++  C+  K+   +V
Sbjct: 186 EQELIDCDTTFNSGCNGGLMDYAFQYIIST--GGLHKEDDYPYLMEEGICQEQKEDVERV 243

Query: 233 KINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSH 290
            I+GY  V  ++ +     + + P++VAI A     QFY  GV      F      +L H
Sbjct: 244 TISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGV------FNGQCGTDLDH 297

Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
            V  VGYG      + K   Y I+KNSWG  WGEKG+ R+ R     +G CGIN
Sbjct: 298 GVAAVGYG------SSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEGLCGIN 345


>gi|432910512|ref|XP_004078392.1| PREDICTED: cathepsin K-like [Oryzias latipes]
          Length = 331

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 130/337 (38%), Positives = 177/337 (52%), Gaps = 31/337 (9%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLR 71
           LL L+ SV S M   DE     H       +  +   H K Y T+ E   R  I+  NLR
Sbjct: 8   LLLLSASVMSQM---DETTLDAH-------WEEWKMTHTKEYITVEEEGIRRAIWEKNLR 57

Query: 72  KIQLL-QDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN--ITL 126
            I+   Q+   G   Y  G+N+F D++  E   +  G ++ P   +  VP       I L
Sbjct: 58  MIEAHNQEAALGMHTYTLGMNQFGDMTQEEVVERMTGLQM-PLNPEPRVPMETDGSLIKL 116

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P++ D+R+   VT VK+Q  CGS WAFS+ G +EG  A KT  LV LS Q L+DC  E+D
Sbjct: 117 PKSVDYRKKGMVTSVKNQGSCGSCWAFSSVGALEGQLAKKTGNLVDLSPQNLVDCVTEND 176

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DET 245
           GC GG ++NAF  +     GG++ E  YPY G+D+ CR N      +I GY  V   DE 
Sbjct: 177 GCGGGYMTNAFKYVQEN--GGIDSEAAYPYMGEDQPCRYNVSGLAAQIKGYKEVPEGDEH 234

Query: 246 DMAKYLVENGPMAVAINAYALQF--YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
            +A  L + GP++V I+A    F  Y  G    I F  +   E+++H+VL VGYGV+   
Sbjct: 235 ALAVALFKAGPVSVGIDASQNSFLYYQKG----IYFDRNCNKEDINHAVLAVGYGVNA-- 288

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS-CGI 339
              K   +WI+KNSWGE WG KGY  + R  G+ CGI
Sbjct: 289 ---KGKKFWIVKNSWGETWGNKGYVLMARNRGNVCGI 322


>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
 gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
          Length = 341

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 124/312 (39%), Positives = 171/312 (54%), Gaps = 28/312 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
           F  +H K Y    E   RL IF+ N  KI +  Q    G   + L  N+++DL   EF+ 
Sbjct: 32  FKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 91

Query: 102 KYLGFKL----KPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
              GF      +   AD S   +      ++TLP++ DWR   AVT VKDQ  CGS WAF
Sbjct: 92  LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 151

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           S+TG +EG +  K+  LVSLSEQ L+DC  +  ++GC GG + NAF  I  K  GG++ E
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 209

Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
           K+YPY   D +C  NK        G+  + + DE  MA+ +   GP+AVAI+A   + QF
Sbjct: 210 KSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDASHESFQF 269

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  GV +  Q  CD   +NL H VL+VG+G D +        YW++KNSWG  WG+KG+ 
Sbjct: 270 YSEGVYNEPQ--CDA--QNLDHGVLVVGFGTDES-----GEDYWLVKNSWGTTWGDKGFI 320

Query: 329 RLYRG-DGSCGI 339
           ++ R  +  CGI
Sbjct: 321 KMLRNKENQCGI 332


>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
 gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
           Contains: RecName: Full=Cathepsin L heavy chain;
           Contains: RecName: Full=Cathepsin L light chain; Flags:
           Precursor
 gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
          Length = 371

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 123/312 (39%), Positives = 171/312 (54%), Gaps = 28/312 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
           F  +H K Y    E   RL IF+ N  KI +  Q    G   + L  N+++DL   EF+ 
Sbjct: 62  FKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 121

Query: 102 KYLGFKL----KPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
              GF      +   AD S   +      ++TLP++ DWR   AVT VKDQ  CGS WAF
Sbjct: 122 LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 181

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           S+TG +EG +  K+  LVSLSEQ L+DC  +  ++GC GG + NAF  I  K  GG++ E
Sbjct: 182 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 239

Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
           K+YPY   D +C  NK        G+  + + DE  MA+ +   GP++VAI+A   + QF
Sbjct: 240 KSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQF 299

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  GV +  Q  CD   +NL H VL+VG+G D +        YW++KNSWG  WG+KG+ 
Sbjct: 300 YSEGVYNEPQ--CDA--QNLDHGVLVVGFGTDES-----GEDYWLVKNSWGTTWGDKGFI 350

Query: 329 RLYRG-DGSCGI 339
           ++ R  +  CGI
Sbjct: 351 KMLRNKENQCGI 362


>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
 gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
          Length = 498

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 137/354 (38%), Positives = 185/354 (52%), Gaps = 38/354 (10%)

Query: 7   FAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYAT-LVEYYSRLHI 65
           F  +AL  L     +  ++    +  L  V+    F  +  QH +TY+    EY  RL +
Sbjct: 5   FLALALAGLVGLSCAHALLSSADMLALAQVEPERAFGLWATQHARTYSEGSPEYTRRLGV 64

Query: 66  FSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQAKYLGFK-----LKPSYADRSVPA 119
           F+ N+R I   +     +G+   LNE++D +  EF AK LG K     LK   A  S  +
Sbjct: 65  FADNVRAIA--EQNRRNTGITLALNEYADETWEEFAAKRLGLKISQEQLKAREARSSSSS 122

Query: 120 MI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSE 175
                   +  P A DWR  +AVT VK+Q  CGS WAFS  G+IEG  A  T +LV+LSE
Sbjct: 123 SSSWRYAQVQTPAAVDWRAKNAVTQVKNQGQCGSCWAFSAVGSIEGANALATGQLVALSE 182

Query: 176 QELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQ 231
           Q+L+DCD   + GC GG + +AF  ++    GG++ E+ Y Y    G    C   K+  +
Sbjct: 183 QQLVDCDTASNMGCSGGLMDDAFKYVLDN--GGIDTEEDYSYWSGYGFGFWCNKRKQTDR 240

Query: 232 --VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQFFCDGGNENL 288
             V I+GY  V   E  + K  V   P+AVAI A A +QFY +GV   I   C+G    L
Sbjct: 241 PAVSIDGYEDVPTSEPALLK-AVAGQPVAVAICASANMQFYSSGV---INSCCEG----L 292

Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS---CGI 339
           +H VL VGY       + KA PYWI+KNSWG  WGE+GYFRL  G+G    CGI
Sbjct: 293 NHGVLAVGYDT-----SDKAQPYWIVKNSWGGSWGEQGYFRLKMGEGPKGLCGI 341


>gi|1581745|prf||2117247A Cys protease:ISOTYPE=1
          Length = 467

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 114/315 (36%), Positives = 163/315 (51%), Gaps = 22/315 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL-LQDTEHGSGVYGLNEFSDLSTAEFQ 100
           F  F ++H K Y +  E   RL +F  NL   +L      H S  + +  FSDL+  EF+
Sbjct: 38  FAAFKQRHGKVYGSAAEEAFRLGVFKENLLFARLHAAANPHAS--FAVTPFSDLTREEFR 95

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPR----AFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           ++Y       + A + V   +           A DWR   AVT +KDQ  C S WAFST 
Sbjct: 96  SRYHNAAAHFAAAQKRVRVPVEVEVEVGGPPAAVDWRARGAVTAIKDQGNCSSCWAFSTI 155

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GNIEG +      L  LSEQ L+ CD  D+GC+GG + +AFD I+ +  G +  E +Y Y
Sbjct: 156 GNIEGQWHLAGNPLTGLSEQMLVSCDNADNGCDGGLMDSAFDWIVEQNNGSVYTEASYSY 215

Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
               GD + C ++       I+G+V + +DE  MA +L  NGP+A+A++A +   Y  GV
Sbjct: 216 VSGGGDSQTCDMSDHVVGAVISGHVDLPQDEDKMAAWLAVNGPLAIAVDATSFMSYTGGV 275

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                   +  ++ L H V++VGY            PYWIIKNSWG  WGE+GY R+ +G
Sbjct: 276 ------LTNCVSDQLDHGVVLVGYNDSSNP------PYWIIKNSWGADWGEEGYIRIQKG 323

Query: 334 DGSCGINDYVRSALV 348
              C + +Y  SA+V
Sbjct: 324 TNQCLVKNYACSAVV 338


>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
          Length = 501

 Score =  194 bits (493), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 130/352 (36%), Positives = 191/352 (54%), Gaps = 33/352 (9%)

Query: 4   FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
            + +A +A LS ++  + F + G+E       V+   LF+ + E+H + Y    E   R 
Sbjct: 12  LFIWASLACLSSSLP-TEFYITGEE-FASEERVRE--LFHLWKERHKRVYKHAEETAKRF 67

Query: 64  HIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPS-----YADRSVP 118
            IF  NL+ + + ++++      G+N+F+D+S  EF+ KYL    KP      Y  RS+ 
Sbjct: 68  EIFKENLKYV-IERNSKGHRHTLGMNKFADMSNEEFKEKYLSKIKKPINKKNNYLRRSMQ 126

Query: 119 AM--IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
                 +   P + DWR+   VTG+KDQ  CGS WAFS+TG +EG+ A  T  L+SLSEQ
Sbjct: 127 QKKGTASCEAPSSLDWRKKGVVTGIKDQGDCGSCWAFSSTGAMEGINAIVTGDLISLSEQ 186

Query: 177 ELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKIN 235
           EL+DCD  + GCEGG +  AF+ ++S   GG++ E  YPY G D  C   K+ T+ V I+
Sbjct: 187 ELVDCDTTNYGCEGGYMDYAFEWVISN--GGIDSESDYPYTGTDGTCNTTKEDTKVVSID 244

Query: 236 GYVSVSRDETDMAKYLVE-NGPMAVAINAYAL--QFYVTGVSHPIQFFCDGGNENLSHSV 292
           GY  V  DE+D A      N P++V ++  AL  Q Y +G+            +++ H+V
Sbjct: 245 GYKDV--DESDSALLCAAVNQPISVGMDGSALDFQLYTSGIYAGDCSD---DPDDIDHAV 299

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD----GSCGIN 340
           LIVGYG + ++       YWI KNSWG  WG +GYF + R      G C IN
Sbjct: 300 LIVGYGSEDSE------DYWICKNSWGTSWGMEGYFYIKRNTDLPYGECAIN 345


>gi|375073980|gb|AFA34857.1| cathepsin L-like protein [Trypanosoma cruzi marinkellei]
          Length = 467

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 112/314 (35%), Positives = 161/314 (51%), Gaps = 18/314 (5%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAE 98
           T+ F  F ++H + Y +  E   RL +F  NL  +  L    +    +G+  FSDL+  E
Sbjct: 35  TSQFAEFKQKHGRVYKSAAEEAFRLSVFRENLF-LARLHAAANPHATFGVTPFSDLTREE 93

Query: 99  FQAKYLGFKLKPSYADRSV--PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           F+++Y       + A      P  +  + +P A DWR   AVT VKDQ  CGS WAFS  
Sbjct: 94  FRSRYHNGAAHFAAAQERARVPVNVEVVGVPAAVDWRARGAVTAVKDQGQCGSCWAFSAI 153

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           GN+E  +      L +LSEQ L+ CD+ D GC GG +++AF+ I+ +  G +  E++YPY
Sbjct: 154 GNVESQWFLAGHPLTNLSEQMLVSCDKTDSGCSGGLMNDAFEWIVQENDGAVYTEESYPY 213

Query: 217 ---RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGV 273
               G    C  +       I G+V + +DE  +A +L  NGP+AVA++A +   Y  GV
Sbjct: 214 ASGEGISPPCTTSGHTVGATITGHVELPQDEAQIAAWLAANGPVAVAVDATSWMTYTGGV 273

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                      +E L H VL+VGY           VPYWIIKNSW   WGE GY R+ +G
Sbjct: 274 ------MTSCVSEQLDHGVLLVGYN------DSAPVPYWIIKNSWTTLWGEDGYIRIAKG 321

Query: 334 DGSCGINDYVRSAL 347
              C + +   SA+
Sbjct: 322 SNQCLVKEEASSAV 335


>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
          Length = 339

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 125/321 (38%), Positives = 168/321 (52%), Gaps = 28/321 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYGL--NEFSDLSTAEFQA 101
           F  QH K Y +  E   R+ IF  N  K+       E G   Y L  N+++D+   EF  
Sbjct: 30  FKLQHKKQYKSDTEEKFRMKIFMENSHKVAKXNKLYEMGLVSYKLKINKYADMLHHEFVH 89

Query: 102 KYLGFK-------LKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
              GF        L  S  ++    + P N+  P   DWRE+ AVT VKDQ  CGS W+F
Sbjct: 90  TVNGFNRTKNTPLLGTSEDEQGATFIAPANVKFPENVDWREHGAVTXVKDQGHCGSCWSF 149

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           S TG +EG +  KT KLVSLSEQ L+DC  +  +DGC GG + NAF  +  K   G++ E
Sbjct: 150 SATGALEGQHFRKTNKLVSLSEQNLVDCSTKFGNDGCNGGLMDNAFKYV--KYNHGIDTE 207

Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAY--ALQF 268
            +YPY  DD+ C  N K +     G+V + + DE  +   +   GP++VAI+A   + Q 
Sbjct: 208 ASYPYHADDEKCHYNPKTSGATDRGFVDIPTGDEEKLMAAVATVGPVSVAIDASHESFQL 267

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  GV +  +  C   +E L H VL+VGYG D          YWI+KNSWGE WGE+GY 
Sbjct: 268 YSEGVYYDPE--C--SSEELDHGVLVVGYGTDEN-----GQDYWIVKNSWGESWGEQGYI 318

Query: 329 RLYRG-DGSCGINDYVRSALV 348
           ++ R  D +CGI       LV
Sbjct: 319 KMARNRDNNCGIATQASYPLV 339


>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  194 bits (492), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 128/351 (36%), Positives = 185/351 (52%), Gaps = 40/351 (11%)

Query: 11  ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
           A++++TV+ SS  ++  +             +  F   H KTY + +E   R  IF+ N 
Sbjct: 9   AIVAVTVAASSQEILRTQ-------------WEAFKTTHKKTYQSHMEELLRFKIFTEN- 54

Query: 71  RKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQAKYLGFK--LKPSYADRSVPAMIPNI 124
             I    + ++  G+     G+N+F DL   EF   + G +   K   +    PA + + 
Sbjct: 55  SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHRGTRKTGGSTFLPPANVNDS 114

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
           +LP+A DWR+  AVT VKDQ  CGS WAFS TG++EG +  K  +LVSLSEQ L+DC Q 
Sbjct: 115 SLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQS 174

Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-S 241
             ++GCEGG + +AF  I  K   G++ EK+YPY   D  CR  K+       GYV + +
Sbjct: 175 FGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKA 232

Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYG 298
             E D+ K +   GP++VAI+A   + Q Y  GV   P     +  +E+L H VL+VGYG
Sbjct: 233 GSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEP-----ECSSEDLDHGVLVVGYG 287

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGINDYVRSALV 348
           V   K       YW++KNSW E WG++GY  + R  +  CGI       LV
Sbjct: 288 VKGGK------KYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPLV 332


>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
          Length = 312

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 115/303 (37%), Positives = 161/303 (53%), Gaps = 24/303 (7%)

Query: 46  LEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG 105
           + ++ + Y    E   R  IF  N+  I+   +    S   G+N+F+D++  EF A+Y G
Sbjct: 1   MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60

Query: 106 FKLKPSYADRSVPAMIPNITLP---RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
              +P   ++       ++ +    ++ DWR+Y AVT VKDQ  CGS WAFS    +EG+
Sbjct: 61  GISRPLNIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGI 120

Query: 163 YAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA 222
           Y   T  LVSLSEQE++DC    +GC+GG + NA+D I+S    G+  E  YPY+     
Sbjct: 121 YKIVTGYLVSLSEQEVLDC-AVSNGCDGGFVDNAYDFIISN--NGVASEADYPYQAYQGD 177

Query: 223 CRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQF 279
           C  N       I GY  V S DE+ M KY V N P+A AI+A     Q+Y  GV      
Sbjct: 178 CAANSWPNSAYITGYSYVRSNDESSM-KYAVWNQPIAAAIDASGDNFQYYNGGV------ 230

Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---DGS 336
           F      +L+H++ I+GYG D +        YWI+KNSWG  WGE+GY R+ RG    G 
Sbjct: 231 FSGPCGTSLNHAITIIGYGQDSS-----GTQYWIVKNSWGSSWGERGYIRMARGVSSSGL 285

Query: 337 CGI 339
           CGI
Sbjct: 286 CGI 288


>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
          Length = 389

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 117/315 (37%), Positives = 172/315 (54%), Gaps = 28/315 (8%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           +F  + E+H K Y    E   R   F GNL+ I L ++ +  +  +    GLN+F+D+S 
Sbjct: 48  IFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYI-LERNAKRKANKWEHHVGLNKFADMSN 106

Query: 97  AEFQAKYLGFKLKPSYA----DRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
            EF+  YL    KP        R++   + +   P + DWR Y  VT VKDQ  CGS WA
Sbjct: 107 EEFRKAYLSKVKKPINKGITLSRNMRRKVQSCDAPSSLDWRNYGVVTAVKDQGSCGSCWA 166

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           FS+TG +EG+ A  T  L+SLSEQEL++CD  + GCEGG +  AF+ +++   GG++ E 
Sbjct: 167 FSSTGAMEGINALVTGDLISLSEQELVECDTSNYGCEGGYMDYAFEWVINN--GGIDSES 224

Query: 213 TYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL--QFY 269
            YPY G D  C   K+ T+ V I+GY  V + ++ +   + +  P++V I+  A+  Q Y
Sbjct: 225 DYPYTGVDGTCNTTKEETKVVSIDGYQDVEQSDSALLCAVAQQ-PVSVGIDGSAIDFQLY 283

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             G+       C    +++ H+VLIVGYG + ++       YWI+KNSWG  WG  GYF 
Sbjct: 284 TGGI---YDGSCSDDPDDIDHAVLIVGYGSEDSE------EYWIVKNSWGTSWGIDGYFY 334

Query: 330 LYRGD----GSCGIN 340
           L R      G C +N
Sbjct: 335 LKRDTDLPYGVCAVN 349


>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
          Length = 469

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 119/313 (38%), Positives = 164/313 (52%), Gaps = 29/313 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           ++  ++  H +TY  + E   R  +F  NLR +    +    +GV+    GLN F+DL+ 
Sbjct: 45  MYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDA-HNAAADAGVHSFRLGLNRFADLTN 103

Query: 97  AEFQAKYLGFKLKPSYADRSVPAMIP--NITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            E++A YLG + +P    R     +   N  LP + DWR   AV  VKDQ  CGS WAFS
Sbjct: 104 DEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFS 163

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           T   +EG+    T  ++SLSEQEL+DCD   + GC GG +  AF+ I++   GG++ E+ 
Sbjct: 164 TIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEED 221

Query: 214 YPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYV 270
           YPY+G D  C +N+K A  V I+ Y  V  +     +  V N P++VAI A   A Q Y 
Sbjct: 222 YPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYN 281

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +G+      F       L H V  VGYG +  K       YWI+KNSWG  WGE GY R+
Sbjct: 282 SGI------FTGTCGTALDHGVTAVGYGTENGK------DYWIVKNSWGSSWGESGYVRM 329

Query: 331 YRG----DGSCGI 339
            R      G CGI
Sbjct: 330 ERNIKASSGKCGI 342


>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 325

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 119/312 (38%), Positives = 165/312 (52%), Gaps = 26/312 (8%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           HN+ YA+  E   R  I+  NL  I         S   G+NEF DL+  EF AKYLG + 
Sbjct: 28  HNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAAKYLGVRF 87

Query: 109 KPSYADRS------VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
               A +S      +P M+   +LP + DWR    VT VK+Q  CGS W+FSTTG++EG 
Sbjct: 88  NGVNATKSFASSTYLPRMV---SLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGSVEGQ 144

Query: 163 YAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           +A KT  LVSLSEQ L+DC  ++  +GC GG + +AF+ I+    GG++ E +YPY    
Sbjct: 145 HARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKN--GGIDTEASYPYTATT 202

Query: 221 KACRLNKKATQVKINGYVS-VSRDETDMAKYLVENGPMAVAINAYAL--QFYVTGVSHPI 277
             C+ N       +  Y   ++  E+D+   +   GP++VAI+A  +  QFY TGV +  
Sbjct: 203 GTCKFNAANIGATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGVYNEK 262

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGS 336
           +  C      L H VL VGYG      + +   YW++KNSWG  WG+ GY  + R  D  
Sbjct: 263 K--CS--TTQLDHGVLAVGYGT-----STEGKDYWLVKNSWGATWGKAGYIWMSRNADNQ 313

Query: 337 CGINDYVRSALV 348
           CGI       LV
Sbjct: 314 CGIATSASYPLV 325


>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
 gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
 gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
 gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
 gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
 gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
 gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
 gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
          Length = 341

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 123/312 (39%), Positives = 171/312 (54%), Gaps = 28/312 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
           F  +H K Y    E   RL IF+ N  KI +  Q    G   + L  N+++DL   EF+ 
Sbjct: 32  FKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 91

Query: 102 KYLGFKL----KPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
              GF      +   AD S   +      ++TLP++ DWR   AVT VKDQ  CGS WAF
Sbjct: 92  LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 151

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           S+TG +EG +  K+  LVSLSEQ L+DC  +  ++GC GG + NAF  I  K  GG++ E
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 209

Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
           K+YPY   D +C  NK        G+  + + DE  MA+ +   GP++VAI+A   + QF
Sbjct: 210 KSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQF 269

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  GV +  Q  CD   +NL H VL+VG+G D +        YW++KNSWG  WG+KG+ 
Sbjct: 270 YSEGVYNEPQ--CDA--QNLDHGVLVVGFGTDES-----GEDYWLVKNSWGTTWGDKGFI 320

Query: 329 RLYRG-DGSCGI 339
           ++ R  +  CGI
Sbjct: 321 KMLRNKENQCGI 332


>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
          Length = 375

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 123/312 (39%), Positives = 171/312 (54%), Gaps = 28/312 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
           F  +H K Y    E   RL IF+ N  KI +  Q    G   + L  N+++DL   EF+ 
Sbjct: 66  FKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 125

Query: 102 KYLGFKL----KPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
              GF      +   AD S   +      ++TLP++ DWR   AVT VKDQ  CGS WAF
Sbjct: 126 LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 185

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           S+TG +EG +  K+  LVSLSEQ L+DC  +  ++GC GG + NAF  I  K  GG++ E
Sbjct: 186 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 243

Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
           K+YPY   D +C  NK        G+  + + DE  MA+ +   GP++VAI+A   + QF
Sbjct: 244 KSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQF 303

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  GV +  Q  CD   +NL H VL+VG+G D +        YW++KNSWG  WG+KG+ 
Sbjct: 304 YSEGVYNEPQ--CDA--QNLDHGVLVVGFGTDES-----GEDYWLVKNSWGTTWGDKGFI 354

Query: 329 RLYRG-DGSCGI 339
           ++ R  +  CGI
Sbjct: 355 KMLRNKENQCGI 366


>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
          Length = 342

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 125/323 (38%), Positives = 172/323 (53%), Gaps = 31/323 (9%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH-GSGVY--GLNEFSDLSTAEFQA 101
           F  +H+K Y +  E   R+ IF+ N +KI       H GS  Y  G+N++ D+   EF  
Sbjct: 32  FKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDMLHHEFVN 91

Query: 102 KYLGFKLKPS----YADRSVPAM-----IPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
              GF+   S     A+R            ++ +P++ DWRE  AVT VKDQ  CGS WA
Sbjct: 92  MMNGFRANTSGAGYKANRGFQGAHFVEPPEDVVMPKSVDWREKGAVTEVKDQGSCGSCWA 151

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEE 210
           FS TG +EG +  +T  LVSLSEQ L+DC  +  ++GC GG + NAF  I  K+ GG++ 
Sbjct: 152 FSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYI--KVNGGIDT 209

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQ 267
           EK+YPY  +D+ CR N         G+V V   +E  + K +   GP++VAI+A   + Q
Sbjct: 210 EKSYPYEAEDEPCRYNPANAGADDRGFVDVREGNENALKKAIATIGPVSVAIDASQDSFQ 269

Query: 268 FYVTGV-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
           FY  GV S P     D   ENL H VL VGYG      T     YW++KNSW + WG++G
Sbjct: 270 FYQHGVYSDP-----DCSAENLDHGVLAVGYGT-----TEDGQDYWLVKNSWSKSWGDQG 319

Query: 327 YFRLYRGDGS-CGINDYVRSALV 348
           Y ++ R   + CGI       LV
Sbjct: 320 YIKIARNQNNMCGIASAASYPLV 342


>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
 gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
          Length = 337

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 125/312 (40%), Positives = 172/312 (55%), Gaps = 22/312 (7%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
           H+K Y    E + R+ I+  NL+KI++  + EH  G++    G+N F D++  EF+    
Sbjct: 36  HSKKYHATEEGWRRV-IWEKNLKKIEM-HNLEHSMGIHTYRLGMNHFGDMTHEEFRQVMN 93

Query: 105 GFKLKPSYADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
           GFK K     R    M PN I +P   DWRE   VT VKDQ  CGS WAFSTTG +EG  
Sbjct: 94  GFKHKKDRRFRGSLFMEPNFIEVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQM 153

Query: 164 AAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DD 220
             KT KLVSLSEQ L+DC + +  +GC GG +  AF  +  +   GL+ E++YPY G DD
Sbjct: 154 FRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDQ--NGLDSEESYPYLGTDD 211

Query: 221 KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPI 277
           + C  + K +     G+V + S  E  + K +   GP++VAI+A   + QFY +G+ +  
Sbjct: 212 QPCHFDPKNSAANDTGFVDIPSGKERALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEK 271

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGS 336
           +  C   +E L H VL VGYG +      K   YWI+KNSW E WG+KGY  + +     
Sbjct: 272 E--C--SSEELDHGVLAVGYGFEGEDVDGKK--YWIVKNSWSENWGDKGYIYMAKDRHNH 325

Query: 337 CGINDYVRSALV 348
           CGI       LV
Sbjct: 326 CGIATAASYPLV 337


>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
          Length = 350

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 125/312 (40%), Positives = 165/312 (52%), Gaps = 24/312 (7%)

Query: 37  KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
           K   LF  ++ +H K Y ++ E   R  IF  NL+ I           + GLNEF+DLS 
Sbjct: 43  KLIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVSNYWL-GLNEFADLSH 101

Query: 97  AEFQAKYLGFKLKPSYADRSVPAMI-PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            EF+ KYLG K+  S    S       ++ LP++ DWR+  AVT VK+Q  CGS WAFST
Sbjct: 102 QEFKNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVTQVKNQGSCGSCWAFST 161

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
              +EG+    T  L SLSEQELIDCD+  ++GC GG +  AF  I+     GL +E+ Y
Sbjct: 162 VAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVEN--DGLHKEEDY 219

Query: 215 PYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVT 271
           PY  ++  C + K+ T+ V I+GY  V ++        + N P++VAI A     QFY  
Sbjct: 220 PYIMEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSG 279

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GV      F      +L H V  VGYG      T K V Y  +KNSWG  WGEKGY R+ 
Sbjct: 280 GV------FDGHCGSDLDHGVAAVGYG------TAKGVDYITVKNSWGSKWGEKGYIRMR 327

Query: 332 RG----DGSCGI 339
           R     +G CGI
Sbjct: 328 RNIGKPEGICGI 339


>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
          Length = 338

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 129/356 (36%), Positives = 185/356 (51%), Gaps = 38/356 (10%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
           F  G  L+ L+ ++S   ++ DE             ++ F   H K Y + +E   R+ I
Sbjct: 8   FLLGAVLVQLSAALSLTNLLADE-------------WHLFKATHKKEYPSQLEEKFRMKI 54

Query: 66  FSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---SYADRSVPA 119
           +  N  K+    +L +    S    +N+F DL   EF++   G++ K    S A+ +   
Sbjct: 55  YLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTF 114

Query: 120 MIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
           M P N+ +P + DWR   A+T VKDQ  CGS WAFS+TG +EG    KT KL+SLSEQ L
Sbjct: 115 MEPANVEVPESVDWRVKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNL 174

Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
           IDC  +  ++GC GG +  AF  I  K   G++ E TYPY  +D  CR N +       G
Sbjct: 175 IDCSGKYGNEGCNGGLMDQAFQYI--KDNKGIDTENTYPYEAEDNVCRYNPRNRGAIDRG 232

Query: 237 YVSVSRDETDMAKYLVEN-GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
           +V +   E D  K  V   GP++VAI+A   + QFY  GV +  +  CD  +++L H VL
Sbjct: 233 FVHIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYY--EPSCD--SDDLDHGVL 288

Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           +VGYG D  K       YW++KNSW E WG++GY ++ R     CGI       LV
Sbjct: 289 VVGYGSDNGK------DYWLVKNSWSEHWGDEGYIKIARNRKNHCGIATAASYPLV 338


>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
          Length = 335

 Score =  194 bits (492), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 124/325 (38%), Positives = 172/325 (52%), Gaps = 30/325 (9%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLNEFSDLST 96
           A ++ F  +H K+Y +  E   RL I+  N  KI    +    G   Y   +NEF D+  
Sbjct: 25  AEWSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLH 84

Query: 97  AEFQAKYLGFKLKPSYADRSV-------PAMIPNITLPRAFDWREYDAVTGVKDQTMCGS 149
            EF +   GFK   +Y D+         P  I + +LP+  DWR   AVT VK+Q  CGS
Sbjct: 85  HEFVSTRNGFKR--NYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGS 142

Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGG 207
            WAFS TG++EG +  K+  +VSLSEQ L+DC  +  ++GCEGG + NAF  I  +   G
Sbjct: 143 CWAFSATGSLEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYI--RANKG 200

Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY-- 264
           ++ EK+YPY G D  C   K       +G+V +    ET + K +   GP++VAI+A   
Sbjct: 201 IDTEKSYPYNGTDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHE 260

Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
           + QFY  GV    +  CD  +E+L H VL+VGYG      T     YW++KNSWG  WG+
Sbjct: 261 SFQFYSDGVYDEPE--CD--SESLDHGVLVVGYG------TLNGTDYWLVKNSWGTTWGD 310

Query: 325 KGYFRLYRG-DGSCGINDYVRSALV 348
           +GY R+ R     CGI       LV
Sbjct: 311 EGYIRMSRNKKNQCGIASSASYPLV 335


>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
 gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
          Length = 360

 Score =  194 bits (492), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 130/351 (37%), Positives = 181/351 (51%), Gaps = 37/351 (10%)

Query: 7   FAGVALLSLT-VSVSSFMVVGDEKLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSRL 63
           F  +AL++L+ +S++  +   ++ L         +L+N +     H+     L E   R 
Sbjct: 6   FIALALVALSFLSIAQSIPFTEKDL-----ASEDSLWNLYEKWRTHHTVARDLDEKNRRF 60

Query: 64  HIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA---- 119
           ++F  N++ I      +       LN+F D++  EF++KY G K++   + R +      
Sbjct: 61  NVFKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGS 120

Query: 120 -MIPNI-TLPRA-FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
            M  N+ +LP A  DWR   AVTGVKDQ  CGS WAFST  ++EG+   KT +LVSLSEQ
Sbjct: 121 FMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQ 180

Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLN-KKATQVKI 234
           EL+DCD   ++GC GG +  AF+ I      G+  E +YPY   D  C  N   +  V I
Sbjct: 181 ELVDCDTSYNEGCNGGLMDYAFEFIQKN---GITTEDSYPYAEQDGTCASNLLNSPVVSI 237

Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSV 292
           +G+  V  +  +     V N P++V+I A  Y  QFY  GV     F    G E L H V
Sbjct: 238 DGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGV-----FTGRCGTE-LDHGV 291

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
            IVGYG      T     YWI+KNSWGE WGE GY R+ RG     G CGI
Sbjct: 292 AIVGYGA-----TRDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGI 337


>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
 gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
          Length = 469

 Score =  193 bits (491), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 118/313 (37%), Positives = 164/313 (52%), Gaps = 29/313 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           ++  ++  H +TY  + E   R  +F  NLR +    +    +GV+    GLN F+DL+ 
Sbjct: 45  MYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDA-HNAAADAGVHSFRLGLNRFADLTN 103

Query: 97  AEFQAKYLGFKLKPSYADRSVPAMIP--NITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            E++A YLG + +P    R     +   N  LP + DWR   AV  +KDQ  CGS WAFS
Sbjct: 104 DEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEIKDQGSCGSCWAFS 163

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           T   +EG+    T  ++SLSEQEL+DCD   + GC GG +  AF+ I++   GG++ E+ 
Sbjct: 164 TIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEED 221

Query: 214 YPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYV 270
           YPY+G D  C +N+K A  V I+ Y  V  +     +  V N P++VAI A   A Q Y 
Sbjct: 222 YPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLYN 281

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +G+      F       L H V  VGYG +  K       YWI+KNSWG  WGE GY R+
Sbjct: 282 SGI------FTGTCGTALDHGVTAVGYGTENGK------DYWIVKNSWGSSWGESGYVRM 329

Query: 331 YRG----DGSCGI 339
            R      G CGI
Sbjct: 330 ERNIKASSGKCGI 342


>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
 gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
 gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
           thaliana]
 gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
          Length = 346

 Score =  193 bits (491), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 124/356 (34%), Positives = 180/356 (50%), Gaps = 41/356 (11%)

Query: 1   MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYY 60
           M  F F A  +    ++++S    + +E +    H++       ++ +H + YA + E  
Sbjct: 6   MQIFLFVAIFSSFCFSITLSR--PLDNELIMQKRHIE-------WMTKHGRVYADVKEEN 56

Query: 61  SRLHIFSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA 119
           +R  +F  N+ +I+ L     G      +N+F+DL+  EF++ Y GFK   + + +S   
Sbjct: 57  NRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSALSSQSQTK 116

Query: 120 MIP----NIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
           M P    N++   LP + DWR+  AVT +K+Q  CG  WAFS    IEG    K  KL+S
Sbjct: 117 MSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLIS 176

Query: 173 LSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC---RLNKKA 229
           LSEQ+L+DCD  D GCEGG +  AF+ I  K  GGL  E  YPY+G+D  C   + N KA
Sbjct: 177 LSEQQLVDCDTNDFGCEGGLMDTAFEHI--KATGGLTTESNYPYKGEDATCNSKKTNPKA 234

Query: 230 TQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNEN 287
           T   I GY  V  ++       V + P++V I    +  QFY +GV      F       
Sbjct: 235 TS--ITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGV------FTGECTTY 286

Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           L H+V  +GYG      +     YWIIKNSWG  WGE GY R+ +      G CG+
Sbjct: 287 LDHAVTAIGYGE-----STNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGL 337


>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
          Length = 376

 Score =  193 bits (491), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 121/316 (38%), Positives = 169/316 (53%), Gaps = 32/316 (10%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
            ++  +L +H K Y  + E   R  IF  NL+ +    ++E+ S   GLN F+DL+  E+
Sbjct: 45  GIYAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDE-HNSENRSYKVGLNRFADLTNEEY 103

Query: 100 QAKYLGFK-------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           ++ +LG K       +K   A R   A+  +  LP + DWRE  AV  +KDQ  CGS WA
Sbjct: 104 RSMFLGTKTDSKRRFMKSKSASRRY-AVQDSDMLPESVDWRESGAVAPIKDQGSCGSCWA 162

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           FST   +EGV    T +++ LSEQEL+DCD+  D GC GG +  AF+ I++   GG++ E
Sbjct: 163 FSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAGCNGGLMDYAFEFIINN--GGIDTE 220

Query: 212 KTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
           + YPYRG D  C   +K T+ V IN Y  V   +    K  V + P++VAI A   A Q 
Sbjct: 221 EDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVSVAIEASGRAFQL 280

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y++GV      F       L H V++VGYG D          +WI++NSWG  WGE GY 
Sbjct: 281 YLSGV------FTGECGRALDHGVVVVGYGTD------NGADHWIVRNSWGTSWGENGYI 328

Query: 329 RLYRG-----DGSCGI 339
           R+ R       G CGI
Sbjct: 329 RMERNVVDNFGGKCGI 344


>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
          Length = 365

 Score =  193 bits (491), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 125/325 (38%), Positives = 162/325 (49%), Gaps = 40/325 (12%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           L+    T   + ++ ++ + Y T  E   R  IF  NL+ IQ      +     G+NEF+
Sbjct: 30  LNEASMTETHDQWMARYGRVYKTANEKNRRSTIFQENLKYIQTFNKANNKPYKLGVNEFA 89

Query: 93  DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI-------TLPRAFDWREYDAVTGVKDQT 145
           DL+  EF      FK         V A + N+        +P   DWR+  AVT +K+Q 
Sbjct: 90  DLTNEEFTTSRNKFK-------SHVCATVTNVFRYENVTAVPATMDWRKKGAVTPIKNQG 142

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSK 203
            CG  WAFS    +EG+   KT KL+SLSEQEL+DCD   ED GCEGG +  AFD I   
Sbjct: 143 QCGCCWAFSAVAAMEGITQLKTGKLISLSEQELVDCDTNGEDQGCEGGLMDYAFDFIQQN 202

Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAIN 262
              GL  E  YPY G D  C  NK+A     I G+  V  +        V N P++VAI+
Sbjct: 203 H--GLSTETNYPYSGTDGTCNANKEANHAATITGHEDVPANSESALLKAVANQPISVAID 260

Query: 263 AYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV--DRTKFTHKAVPYWIIKNSW 318
           A     QFY +GV     F  + G E L H V  VGYG   D TK       YW++KNSW
Sbjct: 261 ASGSDFQFYSSGV-----FTGECGTE-LDHGVTAVGYGTAADGTK-------YWLVKNSW 307

Query: 319 GEGWGEKGYFRLYRG----DGSCGI 339
           G  WGE+GY ++ RG    +G CGI
Sbjct: 308 GTSWGEEGYIQMQRGVAAAEGLCGI 332


>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 335

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 125/320 (39%), Positives = 167/320 (52%), Gaps = 25/320 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLR---KIQLLQDTEHGSGVYGLNEFSDLSTAE 98
           +N +  +H K Y +  E  SR  I+  NL    K  L  D  H +   G+N+F+DL   E
Sbjct: 28  WNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLKNEE 87

Query: 99  FQAKYLGFKLKPSYADRSVPAMIP--NI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
           F A   GF++  +         +P  NI  LP+  DWR    VT VKDQ  CGS WAFST
Sbjct: 88  FVAMMTGFRVNGTSKAAKGSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCWAFST 147

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           TG++EG +   T KLVSLSEQ L+DC   + ++GC+GG +  AF  I+    GG++ E++
Sbjct: 148 TGSLEGQHFKATGKLVSLSEQNLVDCSGKEGNEGCDGGLMDQAFQYIIK--AGGIDTEES 205

Query: 214 YPYRGDDKACRLNKKATQVKINGYVSVSRD-ETDMAKYLVENGPMAVAINA--YALQFYV 270
           YPY+  D  C   K      + GY  V+ D ET + K +   GP++VAI+A   + Q Y 
Sbjct: 206 YPYKAVDGECHFKKANIGATVTGYTDVTSDSETALQKAVAHIGPISVAIDASHMSFQLYK 265

Query: 271 TGV-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
           +GV + P     D  +  L H VL VGYG      T     YWI+KNSW E WG  GY  
Sbjct: 266 SGVYNEP-----DCSSTLLDHGVLAVGYGT-----TSDGTDYWIVKNSWAETWGMNGYLW 315

Query: 330 LYRG-DGSCGINDYVRSALV 348
           + R  D  CGI       LV
Sbjct: 316 MSRNKDNQCGIATQASYPLV 335


>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
 gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
          Length = 356

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 124/342 (36%), Positives = 183/342 (53%), Gaps = 42/342 (12%)

Query: 23  MVVGDEKLHHLHHVKHT--------ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ 74
           M + +   +HL H + +        +++ ++L++H K Y  L E   R  IF  NLR I 
Sbjct: 1   MSIFNHDDNHLSHDQSSWRSDDEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFID 60

Query: 75  LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN--------ITL 126
              ++++ +   GL +F+DL+  E++A +LG +  P    R + +  P+          L
Sbjct: 61  E-HNSQNRTYKVGLTKFADLTNQEYRAMFLGTRSDPKR--RLMKSKNPSERYAYKAGDKL 117

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ-ED 185
           P + DWR   AV  +KDQ  CGS WAFST   +EG+    T +L+SLSEQEL+DCD+  +
Sbjct: 118 PESVDWRGKGAVNPIKDQGSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYN 177

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDE 244
            GC GG +  AF  I++   GGL+ EK YPY G+D  C  +K  T+ V I+G+  V   +
Sbjct: 178 AGCNGGLMDYAFQFIINN--GGLDTEKDYPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFD 235

Query: 245 TDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
               +  V + P++VAI A   ALQFY +GV      F       L H V++VGYG    
Sbjct: 236 EKALQKAVAHQPVSVAIEASGMALQFYQSGV------FTGECGTALDHGVVVVGYG---- 285

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
             T K + YW+++NSWG  WGE GY ++ R       G CGI
Sbjct: 286 --TEKGLDYWLVRNSWGTEWGEHGYIKMQRNVRDTYTGRCGI 325


>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
 gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
          Length = 350

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 111/301 (36%), Positives = 168/301 (55%), Gaps = 21/301 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ ++ +TY    E   R  IF  NL  I+   +  + S   GLN +SDL++ EF A + 
Sbjct: 36  WMMKYERTYTNSSEMEKRKKIFKENLEYIENFNNVGNKSYKLGLNRYSDLTSEEFIASHT 95

Query: 105 GFKLKPSYADRSVPAM-IP---NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           GFK+    +D  + ++ IP   N  +P  FDWRE   VT VK+Q  CG  WAF+    +E
Sbjct: 96  GFKVSDQLSDSKMRSVAIPFNLNDDVPTNFDWREKGVVTDVKNQRQCGCCWAFTAVAAVE 155

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G+   K   L+SLSEQ+L+DCD++  GC GG    AFD+I+     G+ +E  YPY+ +D
Sbjct: 156 GIVKIKNGNLISLSEQQLVDCDRQSSGCGGGDFVLAFDSIIKSR--GIVKEDDYPYKAND 213

Query: 221 -KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAIN-AYALQFYVTGVSHPI 277
            + C+L +     +INGY  V + DE  + + +++  P++VAI+ +Y    Y+ GV    
Sbjct: 214 VQTCQLGQIPGAAQINGYFKVPANDEQQLLRAVLQQ-PVSVAISTSYDFHHYMGGV---- 268

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
             +       L+H+V I+GYGV     +     YW+IKNSWGE WGEKGY ++ R   + 
Sbjct: 269 --YEGSCGPKLNHAVTIIGYGV-----SEAGKKYWLIKNSWGETWGEKGYMKVLRESSAT 321

Query: 338 G 338
           G
Sbjct: 322 G 322


>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
          Length = 352

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 109/306 (35%), Positives = 160/306 (52%), Gaps = 22/306 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  ++ ++ + Y    E   R  IF  N+  I+        S   G+N+F+D++ +EF A
Sbjct: 37  FEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSHNGNSYTLGINQFTDMTKSEFVA 96

Query: 102 KYLGFKLKPSYADRSVPAMIPNITL---PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           +Y G   +P   +R       ++ +   P++ DWR+Y AV  VK+Q  CGS WAF+    
Sbjct: 97  QYTGGISRPLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIAT 156

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EG+Y  KT  LVSLSEQE++DC     GC+GG ++ A+D I+S    G+  E+ YPY+ 
Sbjct: 157 VEGIYKIKTGYLVSLSEQEVLDC-AVSYGCKGGWVNKAYDFIISN--NGVTTEENYPYQA 213

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPI 277
               C  N       I GY  V R++     Y V N P+A  I+A    Q+Y  GV    
Sbjct: 214 YQGTCNANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGGV---- 269

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
             F      +L+H++ I+GYG D +        YWI++NSWG  WGE GY R+ RG    
Sbjct: 270 --FSGPCGTSLNHAITIIGYGQDSS-----GTKYWIVRNSWGSSWGEGGYVRMARGVSSS 322

Query: 334 DGSCGI 339
            G+CGI
Sbjct: 323 SGACGI 328


>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
          Length = 328

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 129/323 (39%), Positives = 174/323 (53%), Gaps = 35/323 (10%)

Query: 36  VKHTALFNYFLE-------QHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYG 87
           + HTAL +YF E       Q  K+Y    E   R++++  N RKI +  +  E+G   Y 
Sbjct: 13  ISHTALHDYFPEEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYK 72

Query: 88  L--NEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKD 143
           L  N F DL   EF+A     KLK S   ++   +       LP   DWR+  AVT VKD
Sbjct: 73  LKMNHFGDLMQHEFKALN---KLKRSAKQQNSGEVFRATGGKLPAKVDWRQKGAVTPVKD 129

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIM 201
              CGS WAFS+TG++ G    K KKLVSLSEQ+L+DC     +DGC+GG +  AF  I 
Sbjct: 130 PGQCGSCWAFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYI- 188

Query: 202 SKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVA 260
            K  GG++ E +YPY  +D  CR   K+      GYV +++ DE  + + + E GP++VA
Sbjct: 189 -KGNGGIDTEGSYPYEAEDDKCRYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVA 247

Query: 261 INA--YALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNS 317
           I+A   + QFY  G+   P   FC   N  L H VL+VGYG      T     YW++KNS
Sbjct: 248 IDAGNLSFQFYSEGIYDEP---FCS--NTELDHGVLVVGYG------TENGQDYWLVKNS 296

Query: 318 WGEGWGEKGYFRLYRG-DGSCGI 339
           WG  WGE GY ++ R  +  CGI
Sbjct: 297 WGPSWGENGYIKIARNHNNHCGI 319


>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
          Length = 354

 Score =  193 bits (491), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 122/344 (35%), Positives = 187/344 (54%), Gaps = 27/344 (7%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
           +FA + + ++ VS S+F    +     +  ++    +  +L QH + Y    E+     I
Sbjct: 7   YFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKR--YERWLVQHGRRYKNRDEWQRHFGI 64

Query: 66  FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL-KPSYADRSVPAMIPNI 124
           +  N+R I  + + ++ S     N+F+D++  E++A Y+G    + S  ++S      + 
Sbjct: 65  YQSNVRFINYI-NAQNFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQSSFKRERSK 123

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
            LP + DWR+  AVT V++Q  CGS WAFST   +EG+   +T KLVSLSEQEL+DCD +
Sbjct: 124 VLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDCDID 183

Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVS 241
             ++GC GG + NAF  I  K  GG+   + YPY G+   C  +K A   VKI+GY +V 
Sbjct: 184 SGNEGCNGGYMVNAFKFI--KQNGGITTARNYPYIGEQGICNKDKAANHVVKISGYETVP 241

Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
            +   + +  V   P++VAI+A  Y  Q Y  G+      FC    + L+H+V ++GYG 
Sbjct: 242 PNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGI---FNGFC---GKQLNHAVTVIGYGE 295

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
           D  K       YW++KNSWG GWGE GY R+ R     +G CGI
Sbjct: 296 DNGK------KYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGI 333


>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
 gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
          Length = 417

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 120/309 (38%), Positives = 172/309 (55%), Gaps = 28/309 (9%)

Query: 48  QHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           +H K Y    E   RL IF+ N  KI     L  +   S    +N+++D+   EF+    
Sbjct: 111 EHRKNYLDETEERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMN 170

Query: 105 GFKL----KPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           GF      +   AD S   +      ++TLP++ DWR+  AVTGVKDQ  CGS WAFS+T
Sbjct: 171 GFNYTLHKELRAADESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSST 230

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           G +EG +  K+  LVSLSEQ L+DC  +  ++GC GG + NAF  I  K  GG++ EK+Y
Sbjct: 231 GALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSY 288

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVT 271
           PY   D +C  NK        G+V + + +E  +A+ +   GP++VAI+A   + QFY  
Sbjct: 289 PYEALDDSCHFNKGTIGATDRGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSE 348

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GV   ++  CD   +NL H VL+VG+G D +        YW++KNSWG  WG+KG+ ++ 
Sbjct: 349 GVY--VEPACDA--QNLDHGVLVVGFGTDES-----GQDYWLVKNSWGTTWGDKGFIKML 399

Query: 332 RG-DGSCGI 339
           R  D  CGI
Sbjct: 400 RNKDNQCGI 408


>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
           occidentalis]
          Length = 469

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 120/308 (38%), Positives = 169/308 (54%), Gaps = 23/308 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEF 99
           F +F E   KTY    E+  R  IF  NL  I+     +  S  Y  G+ +F+D+STAEF
Sbjct: 166 FEHFKEHFGKTYEG-DEHALRQGIFQRNLAHIEKFNAEKAASRGYTLGITQFADMSTAEF 224

Query: 100 QAKYLGFKLKPSYADR----SVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
           +  YLG ++  S   +        +  +  LP A DWR+  AV+ VKDQ  CGS WAFST
Sbjct: 225 RQTYLGLRMNASTIAKLRKLQREVVADDRDLPEAVDWRDKGAVSPVKDQGQCGSCWAFST 284

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
           +G IEG +  K  +L+SLSEQ+++DC   D GC GG    A + +  +  GGLE E  YP
Sbjct: 285 SGAIEGQHFLKNGELLSLSEQQMVDCSWLDFGCNGGQPMLAMEYV--RFNGGLELETAYP 342

Query: 216 YRGDDKACRLNKKATQVKINGY-VSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTG 272
           Y+G   +C  +KK+   KI G+ ++    E+ + K + + GP++V ++A     Q Y +G
Sbjct: 343 YKGVGGSCHSDKKSAAAKITGFWMAGFYSESALQKAVAKVGPISVGMDASGEDFQHYKSG 402

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           + +P      G    L H+VL VGYG      T     YW++KNSW   WGEKGYF+L R
Sbjct: 403 IYNPESCSSIG----LDHAVLAVGYG------TSDDGDYWLVKNSWNTSWGEKGYFKLPR 452

Query: 333 GDGS-CGI 339
             G+ CGI
Sbjct: 453 NKGNKCGI 460


>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
          Length = 324

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 110/306 (35%), Positives = 161/306 (52%), Gaps = 23/306 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  ++ ++ + Y    E   R  IF  N++ I+        S   G+N+F+D++ +EF A
Sbjct: 10  FEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDMTKSEFVA 69

Query: 102 KYLGFKLKPSYADRSVPAMIPNITL---PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           +Y G  L P   +R       ++ +   P++ DWR+Y AV  VK+Q  CGS WAF+    
Sbjct: 70  QYTGVSL-PLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIAT 128

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EG+Y  KT  LVSLSEQE++DC     GC+GG ++ A+D I+S    G+  E+ YPY+ 
Sbjct: 129 VEGIYKIKTGYLVSLSEQEVLDC-AVSYGCKGGWVNKAYDFIISN--NGVTTEENYPYQA 185

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPI 277
               C  N       I GY  V R++     Y V N P+A  I+A    Q+Y  GV    
Sbjct: 186 YQGTCNANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGGV---- 241

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
             F      +L+H++ I+GYG D +        YWI++NSWG  WGE GY R+ RG    
Sbjct: 242 --FSGPCGTSLNHAITIIGYGQDSS-----GTKYWIVRNSWGSSWGEGGYVRMARGVSSS 294

Query: 334 DGSCGI 339
            G+CGI
Sbjct: 295 SGACGI 300


>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
 gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
          Length = 341

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 124/312 (39%), Positives = 174/312 (55%), Gaps = 28/312 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
           F  +H K Y    E   RL IF+ N  KI +  Q    G   + L  N+++DL   EF+ 
Sbjct: 32  FKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 91

Query: 102 KYLGF------KLKPSYAD-RSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
              GF      +L+ +    + V  + P ++TLP++ DWR   AVT VKDQ  CGS WAF
Sbjct: 92  LMNGFNYTLHKQLRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQGHCGSCWAF 151

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           S+TG +EG +  K+  LVSLSEQ L+DC  +  ++GC GG + NAF  I  K  GG++ E
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 209

Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
           K+YPY   D +C  NK        G+  + + DE  MA+ +   GP++VAI+A   + QF
Sbjct: 210 KSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQF 269

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  GV +  Q  CD   +NL H VL+VG+G D +        YW++KNSWG  WG+KG+ 
Sbjct: 270 YSEGVYNEPQ--CDA--QNLDHGVLVVGFGTDES-----GDDYWLVKNSWGTTWGDKGFI 320

Query: 329 RLYRG-DGSCGI 339
           ++ R  D  CGI
Sbjct: 321 KMLRNKDNQCGI 332


>gi|157868354|ref|XP_001682730.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
 gi|68126185|emb|CAJ07238.1| cysteine peptidase A (CPA) [Leishmania major strain Friedlin]
          Length = 354

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 124/321 (38%), Positives = 169/321 (52%), Gaps = 29/321 (9%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN-EFSDLSTA 97
           +A +  F E+H K++    +   R + F  N++    L +T +    Y ++ +F+DL+  
Sbjct: 39  SAHYGRFKERHGKSFGEDADEGHRFNAFKQNMQTAYFL-NTHNPHAHYDVSGKFADLTPQ 97

Query: 98  EFQAKYLGFKLKPSY-----ADRSVPAMIPNITLPRAF--DWREYDAVTGVKDQTMCGSS 150
           EF   YL     P Y      D      + +  L  A   DWRE  AVT VK+Q MCGS 
Sbjct: 98  EFAKLYL----NPDYYAHRGKDYKEHVHVDDSVLSGAMSVDWREKGAVTPVKNQGMCGSC 153

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
           WAFS  GNIE  +A K   LVSLSEQ L+ CD  DDGC GG +  A + I+    G +  
Sbjct: 154 WAFSAIGNIESQWALKNHSLVSLSEQMLVSCDDIDDGCNGGLMDQAMEWIIQHHNGTVPT 213

Query: 211 EKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
           EK+YPY    G    C  +K     +I+GY+S+  DE  +A Y+ + GP+AVA++A   Q
Sbjct: 214 EKSYPYASAGGTSPPCH-DKGEFGARISGYMSLPHDEKAIAAYVEKKGPVAVAVDATTWQ 272

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
            Y  GV       C G   +L+H VL+VG+   R K      PYWI+KNSWG  WGEKGY
Sbjct: 273 LYFGGVV----TLCFG--LSLNHGVLVVGFN-KRAK-----PPYWIVKNSWGTSWGEKGY 320

Query: 328 FRLYRGDGSCGINDYVRSALV 348
            RL  G   C + +Y  +A V
Sbjct: 321 IRLAMGSNQCLLKNYPVTATV 341


>gi|154336052|ref|XP_001564262.1| cysteine peptidase A (CPA) [Leishmania braziliensis
           MHOM/BR/75/M2904]
 gi|134061296|emb|CAM38321.1| cysteine peptidase A (CPA) [Leishmania braziliensis
           MHOM/BR/75/M2904]
          Length = 479

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 124/352 (35%), Positives = 171/352 (48%), Gaps = 27/352 (7%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
            FA VA +   +   S ++     LH +     +A F +F +QH K++        R + 
Sbjct: 8   LFAMVATVLFALCYCSTVIA--RTLHGIDDEVASAHFMHFKKQHGKSFGEEAVEGHRFNA 65

Query: 66  FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT 125
           F  N++    L      +      +F+ L+  EF  +YL     P Y  R + A      
Sbjct: 66  FKENMQTAVYLNAQNPHAHYDVSGKFAALTPQEFAKQYL----NPDYYTRQLKAHKERAH 121

Query: 126 L-------PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
           +         A DWRE  AVT VKDQ +CGS WAFS  GNIEG +A     LVSLSEQ L
Sbjct: 122 VYEGVRGGLSAVDWREKGAVTEVKDQGLCGSCWAFSAIGNIEGQWALSGNTLVSLSEQML 181

Query: 179 IDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD--KACRLNKKATQVKING 236
           + CD  D GC GG +  A+  I+    G +  E +YPY   D   A  L+      +I+G
Sbjct: 182 VSCDTVDMGCNGGLMDQAWAWIIKNHSGAVYTEVSYPYTSGDGSTASCLSTGKVGARISG 241

Query: 237 YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
            VS+ +DE  +  +L +NGP+++A++A   Q Y  GV      +      NL+H VL+VG
Sbjct: 242 QVSLPQDEDAIEAWLEKNGPISIAVDATTWQLYFGGVVSNCFAY------NLNHGVLLVG 295

Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           Y            PYWI+KNSWG  WGE GY RL +G   C + DY  SA V
Sbjct: 296 YNNSANP------PYWIVKNSWGTSWGEHGYIRLAKGSNQCMMKDYAMSATV 341


>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
          Length = 358

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 122/344 (35%), Positives = 187/344 (54%), Gaps = 27/344 (7%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
           +FA + + ++ VS S+F    +     +  ++    +  +L QH + Y    E+     I
Sbjct: 11  YFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKR--YERWLVQHGRRYKNRDEWQRHFGI 68

Query: 66  FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL-KPSYADRSVPAMIPNI 124
           +  N+R I  + + ++ S     N+F+D++  E++A Y+G    + S  ++S      + 
Sbjct: 69  YQSNVRFINYI-NAQNFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQSSFKRERSK 127

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
            LP + DWR+  AVT V++Q  CGS WAFST   +EG+   +T KLVSLSEQEL+DCD +
Sbjct: 128 VLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDCDID 187

Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVS 241
             ++GC GG + NAF  I  K  GG+   + YPY G+   C  +K A   VKI+GY +V 
Sbjct: 188 SGNEGCNGGYMVNAFKFI--KQNGGITTARNYPYIGEQGICNKDKAANHVVKISGYETVP 245

Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
            +   + +  V   P++VAI+A  Y  Q Y  G+      FC    + L+H+V ++GYG 
Sbjct: 246 PNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGI---FNGFC---GKQLNHAVTVIGYGE 299

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
           D  K       YW++KNSWG GWGE GY R+ R     +G CGI
Sbjct: 300 DNGK------KYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGI 337


>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 128/331 (38%), Positives = 174/331 (52%), Gaps = 32/331 (9%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
           + H   +  +  QH K Y T  E YSR  IF  N  KI      EH         S    
Sbjct: 18  LPHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKI-----AEHNIRASLGMHSYTLA 72

Query: 88  LNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
           +N+F D+   EF  + +G  LK          V     N TLP++ DWR    V+ VKDQ
Sbjct: 73  MNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQ 132

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMS 202
             CGS WAFSTTG++EG ++ KT KLV LSEQ+L+DC ++  + GC GG +  AF  I  
Sbjct: 133 GECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI-- 190

Query: 203 KLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVA 260
           K  GGL+ E++YPY   DDK C+ +  +    + GY  V S +E  + + +   GP++VA
Sbjct: 191 KANGGLDTEESYPYTATDDKPCKFDNSSVGATLIGYKDVKSSNEHALKRAVATVGPVSVA 250

Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           I+A   + QFY +GV    Q  C    E L H VL+VGYG      +H+A  +WI+KNSW
Sbjct: 251 IDAGHESFQFYSSGVYDEPQ--CS--TEQLDHGVLVVGYGA-MNDNSHQA--FWIVKNSW 303

Query: 319 GEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           G  WG++GY  + R  +  CGI       LV
Sbjct: 304 GPNWGDQGYIMMSRNKNNQCGIATSASYPLV 334


>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
          Length = 336

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 126/319 (39%), Positives = 176/319 (55%), Gaps = 22/319 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTA 97
           +N + + H+K Y    E + R+ ++  NL+KI+L  + EH  G +    G+N F D++  
Sbjct: 28  WNLWKDWHSKKYHEKEEGWRRM-VWEKNLKKIEL-HNLEHSMGKHTYSLGMNHFGDMTHE 85

Query: 98  EFQAKYLGFKLKPSYADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           EF+    G+KLK     R    M PN +  PR+ DWR+   VT VKDQ  CGS WAFSTT
Sbjct: 86  EFRQIMNGYKLKSQRKLRGSLFMEPNFLEAPRSVDWRDKGYVTPVKDQGQCGSCWAFSTT 145

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           G +EG +  KT  LVSLSEQ L+DC + +  +GC GG +  AF  I  K  GGL+ E++Y
Sbjct: 146 GAMEGQHFRKTGTLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI--KDNGGLDSEESY 203

Query: 215 PYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYV 270
           PY G D+  C  +         G+V V S  E  + K +   GP++VAI+A   + QFY 
Sbjct: 204 PYLGTDEGPCHYDPSYNSANDTGFVDVPSGSERALMKAVASVGPVSVAIDAGHESFQFYH 263

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +G    I +  +  +E L H VL+VGYG +      K   YWI+KNSW E WG+KGY  +
Sbjct: 264 SG----IYYDKECSSEELDHGVLVVGYGFEGKDVDGKK--YWIVKNSWSENWGDKGYIYM 317

Query: 331 YRGDGS-CGINDYVRSALV 348
            +   + CGI       LV
Sbjct: 318 AKDKKNHCGIATAASYPLV 336


>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
          Length = 341

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 126/344 (36%), Positives = 170/344 (49%), Gaps = 27/344 (7%)

Query: 5   YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
           Y +  +ALL +  + +S       K  +LH          ++ Q+ + Y    E   R  
Sbjct: 7   YRYICLALLFVLAAWASHA-----KARNLHEASMYERHEDWMAQYGRVYKDAGEKSKRYK 61

Query: 65  IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI 124
           IF  N+ +I+      + S    +NEF+DL+  EF+A    FK      + +        
Sbjct: 62  IFKDNVARIESFNKAMNKSYKLSINEFADLTNEEFRASRNRFKAHICSTEATSFKYEHVX 121

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ- 183
            +P   DWR+  AVT +KDQ  CGS WAFS    +EG+    T KL+SLSEQEL+DCD  
Sbjct: 122 AVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTS 181

Query: 184 -EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKA-TQVKINGYVSVS 241
            ED GC GG + +AF  I  +   GL  E  YPY G D  C   K A    KINGY  V 
Sbjct: 182 GEDQGCSGGLMDDAFKFI--EQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVP 239

Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
            +     +  V + P+AVAI+A  +  QFY +GV     F    G E L H V  VGYG 
Sbjct: 240 ANNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGV-----FTGQCGTE-LDHGVSAVGYGT 293

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
                +   + YW++KNSWG GWGE+GY R+ R     +G CGI
Sbjct: 294 -----SDDGMKYWLVKNSWGTGWGEEGYIRMQRDVTEKEGLCGI 332


>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
           distachyon]
          Length = 457

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 134/335 (40%), Positives = 177/335 (52%), Gaps = 29/335 (8%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S F +VG  +     + +   LF  +L +H K YA+  E   R  +F  NL+ I  + + 
Sbjct: 128 SDFSIVGYSEEDLSSNDRIIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKV-NR 186

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM----IPNITLPRAFDWREY 135
           E  S   GLNEF+DL+  EF+A YLG    P+ A  S  +     +    LP++ DWR  
Sbjct: 187 EVTSYWLGLNEFADLTHEEFKATYLGLA-PPAPARESRGSFKYEDVSADDLPKSVDWRTK 245

Query: 136 DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSIS 194
            AVT VK+Q  CGS WAFST   +EG+ A  T  L +LSEQELIDC  + ++GC GG + 
Sbjct: 246 GAVTEVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGCNGGLMD 305

Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ--VKINGYVSV-SRDETDMAKYL 251
            AF  I S   GGL  E+ YPY  ++ +C   KK+    V I+GY  V + +E  + K L
Sbjct: 306 YAFSYIASS--GGLHTEEAYPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKAL 363

Query: 252 VENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV 309
               P++VAI A     QFY  GV      F       L H V  VGYG D+     K  
Sbjct: 364 AHQ-PVSVAIEASGRHFQFYSGGV------FDGPCGTQLDHGVAAVGYGSDKG----KGH 412

Query: 310 PYWIIKNSWGEGWGEKGYFRLYR----GDGSCGIN 340
            Y I++NSWG  WGEKGY R+ R    G+G CGIN
Sbjct: 413 DYIIVRNSWGAKWGEKGYIRMKRGTGKGEGLCGIN 447


>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
 gi|194706024|gb|ACF87096.1| unknown [Zea mays]
 gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
          Length = 460

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 129/327 (39%), Positives = 171/327 (52%), Gaps = 41/327 (12%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG-------------SGVY 86
           A F+ +  +H K YAT  E  +RL +F+ N   +                      S   
Sbjct: 34  AQFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSYTL 93

Query: 87  GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI-----PNITLPRAFDWREYDAVTGV 141
            LN F+DL+  EF+A  LG ++ P  A RS  A +         +P A DWR+  AVT V
Sbjct: 94  ALNAFADLTHEEFRAARLG-RIAPGAALRSRAAPVYWGLGGGAAVPDALDWRKSGAVTKV 152

Query: 142 KDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTI 200
           KDQ  CG+ W+FS TG +EG+   KT  LVSLSEQELIDCD+  + GC GG +  A+  +
Sbjct: 153 KDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFV 212

Query: 201 MSKLGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           +    GG++ E+ YPYR  D  C  NK K   V I+GY  V  ++ D+    V   P++V
Sbjct: 213 IKN--GGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPVSV 270

Query: 260 AI--NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNS 317
            I  +A A Q Y  G+      F      +L H+VLIVGYG +  K       YWI+KNS
Sbjct: 271 GICGSARAFQLYYQGI------FDGPCPTSLDHAVLIVGYGSEGGK------DYWIVKNS 318

Query: 318 WGEGWGEKGYFRLYR--GD--GSCGIN 340
           WGE WG KGY  ++R  GD  G CGIN
Sbjct: 319 WGESWGMKGYMHMHRNTGDSKGVCGIN 345


>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
 gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
 gi|223946183|gb|ACN27175.1| unknown [Zea mays]
 gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
          Length = 385

 Score =  193 bits (490), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 136/371 (36%), Positives = 175/371 (47%), Gaps = 53/371 (14%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
             AG + L+L      F +VG  +     H     LF  +L +H + YA+L E   R  +
Sbjct: 23  LLAGSSCLALARPSGDFSIVGYSEEDLSSHESLAELFERWLSRHRRAYASLEEKLRRFQV 82

Query: 66  FSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFK-----------LKPSY 112
           F  NL  I    +T      Y  GLNEF+DL+  EF+A YLG +                
Sbjct: 83  FKDNLHHID---ETNRKVSSYWLGLNEFADLTHDEFKATYLGLRSSVGDGGSGIDDDDEP 139

Query: 113 ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
            +      +   +LP++ DWR   AVTGVK+Q  CGS WAFST   +EG+    T  L +
Sbjct: 140 EEEEGYEGVDGASLPKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTA 199

Query: 173 LSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR------- 224
           LSEQELIDCD + ++GC GG +  AF  I     GGL  E+ YPY  ++  C+       
Sbjct: 200 LSEQELIDCDTDGNNGCNGGLMDYAFSYIAHN--GGLHTEEAYPYLMEEGTCQRSSSSEK 257

Query: 225 --------LNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYA--LQFYVTGV 273
                    N  A  V I+GY  V R +E  + K L +  P++VAI A     QFY  GV
Sbjct: 258 KWPGSSEDANDDAAVVTISGYEDVPRNNEQALLKALAQQ-PVSVAIEASGRNFQFYSGGV 316

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                 F       L H V  VGYG        K   Y I+KNSWG  WGEKGY R+ RG
Sbjct: 317 ------FDGPCGTQLDHGVAAVGYGT-----AAKGHDYIIVKNSWGPSWGEKGYIRMRRG 365

Query: 334 DGS----CGIN 340
            G     CGIN
Sbjct: 366 TGKRQGLCGIN 376


>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
 gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
          Length = 338

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 119/309 (38%), Positives = 170/309 (55%), Gaps = 30/309 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGL--NEFSDLSTAEF 99
           +  +L+++ + Y    E+  R  I+  N++ I+      +    Y L  N F+D++  EF
Sbjct: 39  YETWLKRYGRHYRDREEWEVRFDIYQSNVQYIEFYNSQNYS---YKLIDNRFADITNEEF 95

Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
           ++ YLG+   P +  ++      +  LP++ DWR+  AVT VKDQ  CGS WAFS    +
Sbjct: 96  KSTYLGYL--PRFRVQTEFRYHKHGELPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAV 153

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           EG+   KT+ LVSLSEQ+LIDCD +  ++GCEGG +  AF+ I  K  GG+   K YPY+
Sbjct: 154 EGINKIKTENLVSLSEQQLIDCDIKSGNEGCEGGDMYIAFNYI--KKHGGIATAKEYPYK 211

Query: 218 GDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVS 274
           G D  C  +K K   V I+GY SV      M K  V + P+++A +A  YA QFY  G+ 
Sbjct: 212 GRDGNCNKSKAKNNAVTISGYESVPARNEKMLKAAVAHQPVSIATDAGGYAFQFYSKGI- 270

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG- 333
                F     +NL+H + IVGYG +          YWI+KNSW   WGE GY R+ R  
Sbjct: 271 -----FSGSCGKNLNHGMTIVGYGEE------NGDKYWIVKNSWANDWGESGYVRMKRDT 319

Query: 334 ---DGSCGI 339
              DG+CGI
Sbjct: 320 KDKDGTCGI 328


>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
          Length = 467

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 132/357 (36%), Positives = 180/357 (50%), Gaps = 38/357 (10%)

Query: 1   MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHT-----ALFNYFLEQHNKTYAT 55
           M+ F F     LL L  S     ++G ++ H       T     A++  +L +H K+Y  
Sbjct: 10  MAVFLFL----LLGL-ASALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNA 64

Query: 56  LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLG----FKLKPS 111
           L E   R  IF  NLR I    + E+ +   GLN F+DL+  E+++ YLG     K + S
Sbjct: 65  LGEKERRFQIFKDNLRFIDE-HNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAAKRRSS 123

Query: 112 YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
                  A     +LP + DWR+  AV  VKDQ  CGS WAFST   +EG+    T  L+
Sbjct: 124 NKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLI 183

Query: 172 SLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKA 229
           SLSEQEL+DCD   ++GC GG +  AF+ I++   GG++ E+ YPY+  D  C +  K A
Sbjct: 184 SLSEQELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDSEEDYPYKASDGRCDQYRKNA 241

Query: 230 TQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNEN 287
             V I+GY  V  ++    +  V N P++VAI A     Q Y +G+      F       
Sbjct: 242 KVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGI------FTGRCGTA 295

Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-----GDGSCGI 339
           L H V  VGYG      T   V YWI+KNSWG  WGE+GY R+ R       G CGI
Sbjct: 296 LDHGVTAVGYG------TENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGI 346


>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
 gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
          Length = 341

 Score =  192 bits (489), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 123/312 (39%), Positives = 171/312 (54%), Gaps = 28/312 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQA 101
           F  +H K Y    E   RL IF+ N  KI +  Q    G   + L  N+++DL   EF+ 
Sbjct: 32  FKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLHHEFRQ 91

Query: 102 KYLGFKL----KPSYADRSVPAMI----PNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
              GF      +   AD S   +      ++TLP++ DWR   AVT VKDQ  CGS WAF
Sbjct: 92  LMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCGSCWAF 151

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           S+TG +EG +  K+  LVSLSEQ L+DC  +  ++GC GG + NAF  I  K  GG++ E
Sbjct: 152 SSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTE 209

Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
           K+YPY   D +C  NK        G+  + + DE  MA+ +   GP++VAI+A   + QF
Sbjct: 210 KSYPYEAIDDSCHFNKGTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHESFQF 269

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  GV +  Q  CD   +NL H VL+VG+G D +        YW++KNSWG  WG+KG+ 
Sbjct: 270 YSEGVYNEPQ--CDA--QNLDHGVLVVGFGTDES-----GDDYWLVKNSWGTTWGDKGFI 320

Query: 329 RLYRG-DGSCGI 339
           ++ R  +  CGI
Sbjct: 321 KMLRNKENQCGI 332


>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
 gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
          Length = 479

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 135/323 (41%), Positives = 173/323 (53%), Gaps = 39/323 (12%)

Query: 40  ALFNYFLEQHNKTYATLV--------EYYSRLHIFSGNLRKIQLLQDTEHGSGVY-GLNE 90
           ALF+ ++ QH K+YA           E  +R  IF  NLR I    + E   G + GLN 
Sbjct: 55  ALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIH--GENEKNQGYFLGLNA 112

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSVPAM----IPNITLPRAFDWREYDAVTGVKDQTM 146
           F+DL+  EF+A+  G +   S    S        +    LP + DWRE  AV GVKDQ  
Sbjct: 113 FADLTNEEFRAQRHGGRFDRSRERTSYEEFRYGSVQLKDLPDSIDWREKGAVVGVKDQGS 172

Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ-EDDGCEGGSISNAFDTIMSKLG 205
           CGS WAFS    IEGV    T +LVSLSEQEL+DCD+ ED+GC GG +  AF  ++    
Sbjct: 173 CGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGFVIKN-- 230

Query: 206 GGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINA 263
           GGL+ E  YPY+G    C  +K  A  V I+GY  V   DET + K  V + P++VAI+A
Sbjct: 231 GGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLK-AVAHQPVSVAIDA 289

Query: 264 --YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
              ++QFY +G+      F      +L H V  VGYG +  K       YWIIKNSWG  
Sbjct: 290 GGSSMQFYRSGI------FTGRCGTDLDHGVTNVGYGKEDGK------AYWIIKNSWGSN 337

Query: 322 WGEKGYFRLYRGD----GSCGIN 340
           WGEKGY ++ R      G CGIN
Sbjct: 338 WGEKGYIKMARNTGLAAGLCGIN 360


>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
          Length = 330

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/353 (35%), Positives = 173/353 (49%), Gaps = 36/353 (10%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
               VA+++L+ +  SF +  +E             ++ F   H KTY    E   R+ I
Sbjct: 4   LLVAVAIIALSYAHPSFDIYPEE-------------WHVFKAMHGKTYKNQFEEMFRMKI 50

Query: 66  FSGNLRKIQLLQDT-EHGSGVYGL--NEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIP 122
           F  N +KI+      E G   Y +  N F DL   EF+A   GFK+ P            
Sbjct: 51  FMDNKKKIEAHNAKYEQGEVSYKMMMNHFGDLMVHEFKALMNGFKMSPDTKRNGELYFPS 110

Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
           N  LP+  DWR+  AVT VKDQ  CGS W+FS TG++EG    KT KLVSLSEQ L+DC 
Sbjct: 111 NSNLPKTVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQVFLKTGKLVSLSEQNLVDCS 170

Query: 183 QE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV 240
               ++GCEGG +  AF  +      G++ E +YPY   +  CR  K        G+V +
Sbjct: 171 TSYGNNGCEGGLMDQAFQYVSDN--KGIDTEASYPYEARENTCRFKKNKVGGTDKGHVDI 228

Query: 241 -SRDETDMAKYLVENGPMAVAINAY--ALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVG 296
            + DE  +   L   GP++VAI+A   + QFY  GV + P     +  + +L H VL VG
Sbjct: 229 PAGDEKALQNALATVGPISVAIDANHGSFQFYSKGVYNEP-----NCSSYDLDHGVLAVG 283

Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS-CGINDYVRSALV 348
           YG      T     YW++KNSWG  WGE GY ++ R   + CGI       LV
Sbjct: 284 YG------TENGQDYWLVKNSWGPSWGENGYIKIARNHSNHCGIASMASYPLV 330


>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
 gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
          Length = 340

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 113/302 (37%), Positives = 158/302 (52%), Gaps = 20/302 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ QH + Y  + E   R  IF  N+ +I+   +        G+N+F+DL+  EF+A Y 
Sbjct: 43  WMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYH 102

Query: 105 GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYA 164
           G+K + S    S         +P + DWR   AVT VKDQ  CG  WAFST   IEG+  
Sbjct: 103 GYKRQSSKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIK 162

Query: 165 AKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
            +T  L+SLSEQ+L+DC   + GC+GG +  AF  I+    GGL  E  YPY+G D  C 
Sbjct: 163 LQTGNLISLSEQQLVDCTAGNKGCQGGLMDTAFQYIIRN--GGLTSEDNYPYQGVDGTCS 220

Query: 225 LNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFC 281
             K A T+ +I GY  V ++  +     V   P++V ++      QFY +GV     F  
Sbjct: 221 SEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVGVDGGGNDFQFYKSGV-----FNG 275

Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS----C 337
           D G +  +H+V  +GYG D          YW++KNSWG  WGE GY R+ RG GS    C
Sbjct: 276 DCGTQQ-NHAVTAIGYGTD-----IDGTDYWLVKNSWGTSWGENGYMRMRRGIGSSEGLC 329

Query: 338 GI 339
           G+
Sbjct: 330 GV 331


>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
          Length = 344

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 121/311 (38%), Positives = 166/311 (53%), Gaps = 31/311 (9%)

Query: 45  FLEQHNKTYATLVEYYS--RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAK 102
           ++ QH + YA   E +   R ++F  N+ +I+   D +  +    +N+F+DL+  EF+A 
Sbjct: 40  WMSQHGRVYADEQEDHKNKRFNVFKENVERIEEFNDGK--TFKLAINQFADLTNEEFRAS 97

Query: 103 YLGFK---LKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           Y GFK   +  S   +  P    N++  LP + DWR+  AVT VK+Q  CG  WAFS   
Sbjct: 98  YNGFKGPMVLSSQITKPTPFRYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVA 157

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
            IEG+    T KL+SLSEQEL+DCD +  D GCEGG +  AF+ I++   GGL  E  YP
Sbjct: 158 AIEGITQISTGKLISLSEQELVDCDTKGIDHGCEGGLMDTAFEFIINN--GGLTTESNYP 215

Query: 216 YRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTG 272
           Y+G+D  C  NK     V I GY  V  ++       V + P++VAI A     QFY +G
Sbjct: 216 YKGEDGTCNFNKTNPIAVSITGYEDVPANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSG 275

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           V     F  + G E L H+V  VGYG      +     YWI+KNSWG  WGE GY  + +
Sbjct: 276 V-----FTGECGTE-LDHAVTAVGYGE-----SEDGSKYWIVKNSWGTKWGESGYIEMQK 324

Query: 333 G----DGSCGI 339
                 G CGI
Sbjct: 325 DIKVKQGLCGI 335


>gi|343470212|emb|CCD17026.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 445

 Score =  192 bits (489), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 118/346 (34%), Positives = 184/346 (53%), Gaps = 27/346 (7%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           V LL++    + F+ V    LH    ++    F  F ++++++Y    E   R  +F  N
Sbjct: 14  VGLLAV---AACFVPVALGVLHAEQSLQQQ--FAAFKQKYSRSYKDATEEAFRFRVFKQN 68

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL-- 126
           + + +  +   +    +G+  FSD+S  EF+A Y  G +   +   R  P  + N++   
Sbjct: 69  MERAKE-EAAANPYATFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGK 125

Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQED 185
            P A DWR+  AVT VKDQ  C SSWAF+  GNIEG +     +L SLSEQ L+ CD  D
Sbjct: 126 APEAVDWRKKGAVTPVKDQGKCDSSWAFTVIGNIEGQWKIAGHELTSLSEQMLVSCDTND 185

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSR 242
            GC  G +  AF  I+S   G +  E++YPY    G+   C  + K     I+ +V +  
Sbjct: 186 LGCRAGFMDTAFKWIVSSNNGNVFTEQSYPYASGGGNVPTCNKSGKVVGANIDDHVHILD 245

Query: 243 DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
           +E  +A++L + GP+A+A++A + Q Y  GV           ++ ++ + L+VGY  D +
Sbjct: 246 NENAIAEWLAKKGPVAIAVDATSFQSYTGGV------LTSCISKEVNSAALLVGYD-DTS 298

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           K      PYWIIKNSW +GWGE+GY R+ +G   C + +YV SA+V
Sbjct: 299 K-----PPYWIIKNSWSKGWGEEGYIRIEKGTNQCRMKEYVSSAVV 339


>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
          Length = 459

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 122/315 (38%), Positives = 167/315 (53%), Gaps = 33/315 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           L+  +  +H K+Y  + E   R   F  NLR I    +    +GV+    GLN F+DL+ 
Sbjct: 40  LYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE-HNAAADAGVHSFRLGLNRFADLTN 98

Query: 97  AEFQAKYLGFKLKP----SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
            E++  YLG + KP      +DR + A   N  LP + DWR   AV  +KDQ  CGS WA
Sbjct: 99  EEYRDTYLGLRNKPRRERKVSDRYLAA--DNEALPESVDWRTKGAVAEIKDQGGCGSCWA 156

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           FS    +EG+    T  L+SLSEQEL+DCD   ++GC GG +  AFD I++   GG++ E
Sbjct: 157 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTE 214

Query: 212 KTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
             YPY+G D+ C +N+K A  V I+ Y  V+ +     +  V N P++VAI A   A Q 
Sbjct: 215 DDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQL 274

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +G+      F       L H V  VGYG +  K       YWI++NSWG+ WGE GY 
Sbjct: 275 YSSGI------FTGKCGTALDHGVAAVGYGTENGK------DYWIVRNSWGKSWGESGYV 322

Query: 329 RLYRG----DGSCGI 339
           R+ R      G CGI
Sbjct: 323 RMERNIKASSGKCGI 337


>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
          Length = 368

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 117/306 (38%), Positives = 158/306 (51%), Gaps = 26/306 (8%)

Query: 47  EQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF 106
           + H++ +    E   R   F  N R I              LN F D+   EF++ +   
Sbjct: 46  QTHHRVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRLRLNRFGDMGREEFRSGFADS 105

Query: 107 KLKPSYADRSVPAMIPNIT------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           ++     + +    +P         LPR+ DWR+  AVT VK+Q  CGS WAFST   +E
Sbjct: 106 RINDLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAVTAVKNQGRCGSCWAFSTVVAVE 165

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G+ A +T  LVSLSEQELIDCD +++GC+GG + NAF+ I S   GG+  E  YPY   +
Sbjct: 166 GINAIRTGSLVSLSEQELIDCDTDENGCQGGLMENAFEFIKSH--GGITTESAYPYHASN 223

Query: 221 KAC--RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHP 276
             C     ++   V I+G+ +V     D     V + P++VAI+A   ALQFY  GV   
Sbjct: 224 GTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAIDAGGQALQFYSEGV--- 280

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
             F  D G + L H V  VGYGV     +    PYWI+KNSWG  WGE GY R+ RG G+
Sbjct: 281 --FTGDCGTD-LDHGVAAVGYGV-----SDDGTPYWIVKNSWGPSWGEGGYIRMQRGTGN 332

Query: 337 ---CGI 339
              CGI
Sbjct: 333 GGLCGI 338


>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
           [Zea mays]
 gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
           mays]
          Length = 465

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 122/314 (38%), Positives = 166/314 (52%), Gaps = 31/314 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           ++  ++  H +TY  + E   R  +F  NLR I    +    +GV+    GLN F+DL+ 
Sbjct: 43  MYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDA-HNAAADAGVHSFRLGLNRFADLTN 101

Query: 97  AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
            E++A YLG + +P   +R + A      N  LP + DWR   AV  VKDQ   GS WAF
Sbjct: 102 DEYRATYLGARTRPQ-RERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSYGSCWAF 160

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           ST   +EG+    T  L+SLSEQEL+DCD   + GC GG +  AF+ I++   GG++ EK
Sbjct: 161 STIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDTEK 218

Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQF--Y 269
            YPY+G D  C +N+K A  V I+ Y  V  ++    +  V N P++VAI A   QF  Y
Sbjct: 219 DYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTQFQLY 278

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +G+      F       L H V  VGYG +  K       YWI+KNSWG  WGE GY R
Sbjct: 279 SSGI------FTGSCGTALDHGVTAVGYGTENGK------DYWIVKNSWGSSWGESGYVR 326

Query: 330 LYRG----DGSCGI 339
           + R      G CGI
Sbjct: 327 MERNIKASSGKCGI 340


>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
          Length = 366

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 119/313 (38%), Positives = 164/313 (52%), Gaps = 28/313 (8%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           ++  +L +H K Y  L E   R  +F  NL  IQ   + ++ +   GLN+F+D++  E++
Sbjct: 39  MYEEWLVKHQKVYNGLGEKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNKFADMTNEEYR 98

Query: 101 AKYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
             Y G K      L  + +     A      LP   DWR   AV  +KDQ  CGS WAFS
Sbjct: 99  VMYFGTKSDAKRRLMKTKSTGHRYAYSAGDQLPVHVDWRVKGAVAPIKDQGSCGSCWAFS 158

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           T   +E +    T K VSLSEQEL+DCD+  + GC GG +  AF+ I+    GG++ +K 
Sbjct: 159 TVATVEAINKIVTGKFVSLSEQELVDCDRAYNQGCNGGLMDYAFEFIIQN--GGIDTDKD 216

Query: 214 YPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYV 270
           YPYRG D  C   KK A  V I+GY  V   + +  K  V   P+++AI A   ALQ Y 
Sbjct: 217 YPYRGFDGICDPTKKNAKAVNIDGYEDVPPYDENALKKAVARQPVSIAIEASGRALQLYQ 276

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV      F      +L H V++VGYG      +   V YW+++NSWG GWGE GYF++
Sbjct: 277 SGV------FTGECGTSLDHGVVVVGYG------SENGVDYWLVRNSWGTGWGEDGYFKM 324

Query: 331 YRG----DGSCGI 339
            R      G CGI
Sbjct: 325 QRNVRTPTGKCGI 337


>gi|6649595|gb|AAF21471.1|U85984_1 cysteine proteinase [Clonorchis sinensis]
          Length = 217

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 103/219 (47%), Positives = 136/219 (62%), Gaps = 10/219 (4%)

Query: 130 FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCE 189
           FDWR + AV  V DQ  CGS WAFS  GNIEG +  KT  L+ LSEQ+L+DCD  D+GC 
Sbjct: 8   FDWRNHGAVGPVLDQGDCGSCWAFSAVGNIEGQWFRKTDNLLQLSEQQLLDCDGVDEGCN 67

Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAK 249
           GG+   AF  I+    GGL+ +  YPY G +  CR+     +V ING   +  DE   A+
Sbjct: 68  GGTPQQAFKQILGM--GGLQLDSDYPYEGREGQCRMVPSKVKVYINGSKILPEDEQIQAQ 125

Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV 309
            L E GP++ A+NA  LQFY  G+ HP+   CD   ++L+H+VL VGYG          +
Sbjct: 126 MLKETGPLSSALNALFLQFYTEGILHPLPALCDA--QSLNHAVLTVGYG------KEGRL 177

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           PYW +KNSW   +GE GYFR+YRGDG+CGIN  V ++++
Sbjct: 178 PYWTVKNSWSTMFGENGYFRIYRGDGTCGINTLVSTSII 216


>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 122/317 (38%), Positives = 169/317 (53%), Gaps = 27/317 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQ 100
           F   H KTY + +E   R  IF+ N   I    + ++  G+     G+N+F DL   EF 
Sbjct: 30  FKTTHKKTYQSHMEELLRFKIFTEN-SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFA 88

Query: 101 AKYLGFKLKPSYADRSV--PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
             + G          S   PA + + +LP+  DWR+  AVT VKDQ  CGS WAFS TG+
Sbjct: 89  RIFNGHHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGS 148

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           +EG +  K  +LVSLSEQ L+DC Q   ++GCEGG + +AF  I  K   G++ EK+YPY
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPY 206

Query: 217 RGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGV 273
           +  D  CR  K+       GYV + +  E D+ K +   GP++VAI+A   + Q Y  GV
Sbjct: 207 KAVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGV 266

Query: 274 -SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
              P     +  +E+L H VL+VGYGV   K       YW++KNSW E WG++GY  + R
Sbjct: 267 YDEP-----ECSSEDLDHGVLVVGYGVKGGK------KYWLVKNSWAESWGDQGYILMSR 315

Query: 333 -GDGSCGINDYVRSALV 348
             +  CGI       LV
Sbjct: 316 DNNNQCGIASQASYPLV 332


>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
 gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
          Length = 457

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 123/314 (39%), Positives = 167/314 (53%), Gaps = 30/314 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  +  +H K Y+   E   R  ++  NL  IQ     ++ S   GL +F+DL+  EF+ 
Sbjct: 45  FAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQR-HSEKNLSYWLGLTKFADLTNEEFRR 103

Query: 102 KYLGFKLKPSY---ADRSVPAMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           +Y G ++  S      R+        N   P++ DWRE  AVT VKDQ  CGS WAFS  
Sbjct: 104 QYTGTRIDRSRRLKKGRNATGSFRYANSEAPKSIDWREKGAVTSVKDQGSCGSCWAFSAV 163

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
           G++EG+ A +T   +SLS QEL+DCD++ + GC GG +  AFD ++    GG++ EK YP
Sbjct: 164 GSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGGLMDYAFDFVIQN--GGIDTEKDYP 221

Query: 216 YRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTG 272
           Y+G D  C +NK  A  V I+ Y  V  ++ +  K  V   P++VAI A     Q Y  G
Sbjct: 222 YQGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGG 281

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           V      F      +L H VL VGYG      + K + YWI+KNSWGE WGE GY R+ R
Sbjct: 282 V------FTGRCGTDLDHGVLAVGYG------SEKGLDYWIVKNSWGEYWGESGYLRMQR 329

Query: 333 ------GDGSCGIN 340
                 G G CGIN
Sbjct: 330 NLKDDNGYGLCGIN 343


>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
          Length = 365

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 125/323 (38%), Positives = 170/323 (52%), Gaps = 33/323 (10%)

Query: 34  HHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSD 93
             V++T  +  +L +H KTY  L E  SR  IF+ NL+ I     + + S   GLN+F+D
Sbjct: 30  EEVRNT--YELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVGLNQFAD 87

Query: 94  LSTAEFQAKYLGFKLKPSYADRSVP--------AMIPNITLPRAFDWREYDAVTGVKDQT 145
           L+  E+++ YLG K+ P      +         A+  N   P   DWRE  AV+ VK+Q 
Sbjct: 88  LTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGAVSPVKNQG 147

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKL 204
            CGS WAFST  ++EG+    T  L+SLSEQEL+DCD + + GC GGS+  AF  I+S  
Sbjct: 148 GCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYAFQFIVSN- 206

Query: 205 GGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA 263
            GG++ E  YPY+G    C  +  KA  V I+GY  V           V + P++V I A
Sbjct: 207 -GGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPVSVGIEA 265

Query: 264 --YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
              A Q Y +GV   +   C     NL H V++VGYG +  K       YWI++NSWG  
Sbjct: 266 SGRAFQLYTSGV---LTGSC---GTNLDHGVVVVGYGSENGK------DYWIVRNSWGPE 313

Query: 322 WGEKGYFRLYRGD-----GSCGI 339
           WGE GY R+ R       G CGI
Sbjct: 314 WGEDGYIRMERNMVDTPVGMCGI 336


>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
          Length = 359

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 125/298 (41%), Positives = 167/298 (56%), Gaps = 28/298 (9%)

Query: 56  LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---SY 112
           L E   R ++F  N + +  +   +    +  LN+F+D++  EF++ Y G K+K      
Sbjct: 53  LEEKNKRFNVFKENTKHVHKVNQMDKPYKLK-LNKFADMTNHEFRSSYGGSKVKHYRMLR 111

Query: 113 ADRSVPA--MIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKK 169
            DR      M    T LP + DWR+  AVTG+KDQ  CGS WAFST   +EG+   KTK+
Sbjct: 112 GDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKE 171

Query: 170 LVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK- 227
           L+SLSEQ+LIDCD+ DD GC GG + +AF+ I  K  GG+  E  YPY+  D+ C + K 
Sbjct: 172 LLSLSEQQLIDCDRSDDHGCNGGLMESAFEFI--KKNGGITTENNYPYKAKDERCDMLKM 229

Query: 228 KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGN 285
            A  V I+G+ SV  ++       V + P++VAI+A    LQFY  GV     F  + G 
Sbjct: 230 NAPVVTIDGHESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGV-----FDGECGT 284

Query: 286 ENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           E L H V IVGYG      T     YWI+KNSWG  WGEKGY R+ RG    +G CGI
Sbjct: 285 E-LDHGVAIVGYGT-----TLDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGI 336


>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
 gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
          Length = 341

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 129/348 (37%), Positives = 186/348 (53%), Gaps = 43/348 (12%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           +AL+++  +VS   V+ +E             ++ F  +H K Y    E   RL IF+ N
Sbjct: 10  LALVAVAQAVSYAEVIQEE-------------WHTFKLEHRKNYQDETEERFRLKIFNEN 56

Query: 70  LRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL----KPSYADRSVPAMI- 121
             KI     L  T   S    +N+++D+   EF +   GF      +   AD S   +  
Sbjct: 57  KHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTF 116

Query: 122 ---PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
               ++TLP+  DWR   AVT VKDQ  CGS WAFS+TG +EG +  K+  LVSLSEQ L
Sbjct: 117 ISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNL 176

Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
           +DC  +  ++GC GG + NAF  I  K  GG++ EK+YPY   D +C  NK +      G
Sbjct: 177 VDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPYEAIDDSCHFNKGSIGATDRG 234

Query: 237 YVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGV-SHPIQFFCDGGNENLSHSV 292
           +V + + +E  MA+ +   GP+AVAI+A   + QFY  GV + P    CD   +NL H V
Sbjct: 235 FVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPA---CDA--QNLDHGV 289

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
           L+VG+G D +        YW++KNSWG  WG+KG+ ++ R  +  CGI
Sbjct: 290 LVVGFGTDES-----GEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGI 332


>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
          Length = 465

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 126/330 (38%), Positives = 168/330 (50%), Gaps = 38/330 (11%)

Query: 31  HHLHHVKHT--------ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG 82
           H  H  K +        A++  +L +H K Y  L E   R  IF  NL  I    ++E+ 
Sbjct: 32  HQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQ-HNSENR 90

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGF-----KLKPSYADRSVPAMIPNITLPRAFDWREYDA 137
           +   GLN F+DL+  EF++ YLG      K  P  +DR  P +    +LP + DWR+  A
Sbjct: 91  TYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSDRYAPRV--GDSLPDSVDWRKEGA 148

Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNA 196
           V  VKDQ  CGS WAFST   +EG+    T  L++LSEQEL+DCD   ++GC GG +  A
Sbjct: 149 VAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGLMDYA 208

Query: 197 FDTIMSKLGGGLEEEKTYPYRGDDKACRL-NKKATQVKINGYVSVSRDETDMAKYLVENG 255
           F+ I++   GG++ E  YPY G D  C    K A  V I+ Y  V  ++    K  V N 
Sbjct: 209 FEFIINN--GGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVANQ 266

Query: 256 PMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWI 313
           P++VAI       Q Y +GV      F      +L H V  VGYG      T K   YWI
Sbjct: 267 PVSVAIEGGGRNFQLYNSGV------FTGECGTSLDHGVAAVGYG------TEKGKDYWI 314

Query: 314 IKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           ++NSWG+ WGE GY R+ R      G CGI
Sbjct: 315 VRNSWGKSWGESGYIRMERNIASPTGKCGI 344


>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
          Length = 423

 Score =  192 bits (488), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 125/312 (40%), Positives = 168/312 (53%), Gaps = 39/312 (12%)

Query: 46  LEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV-----YGLNEFSDLSTAEFQ 100
           L +H+K Y  L     R  IF  NLR I      EH  GV      GLN+F+DLS  E++
Sbjct: 11  LVKHHKNYNALGAKEKRFEIFKDNLRFID-----EHNKGVNQSFKLGLNKFADLSNEEYK 65

Query: 101 AKYLGFKL----KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           + +LG ++    K   +DR    +     LP++ DWRE  AV  VKDQ  CGS WAFST 
Sbjct: 66  SMFLGGRMVRDRKGFESDRFKYGV--GDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTV 123

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
             +EG+    T  L+SLSEQEL+DCD+  + GC GG +  AF+ I+    GG++ E  YP
Sbjct: 124 AAVEGINQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKN--GGIDTEDDYP 181

Query: 216 YRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
           Y+G D  C  N+K A  V ING+  V +++    K  V + P++VAI A   A Q Y +G
Sbjct: 182 YKGVDGQCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESG 241

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           + + +         +L H V+ VGYG +  K       YWI++NSWG  WGE GY RL R
Sbjct: 242 IFNGL------CGTDLDHGVVAVGYGTEDGK------DYWIVRNSWGPNWGENGYIRLER 289

Query: 333 -----GDGSCGI 339
                  G CGI
Sbjct: 290 NVASTNTGKCGI 301


>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
 gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
          Length = 342

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 121/320 (37%), Positives = 158/320 (49%), Gaps = 24/320 (7%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
             LH          ++ ++ + Y    E   R  IF  N+  I+      +      +NE
Sbjct: 27  RSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNNVEFIESFNKLGNRPYKLDINE 86

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSVPAM-IPNIT-LPRAFDWREYDAVTGVKDQTMCG 148
           F+DL+  EF+    G+K           +    N+T +P + DWR+  AVT +KDQ  CG
Sbjct: 87  FADLTNEEFKVSKNGYKRSSGVGLTEKSSFRYANVTAVPTSMDWRQNGAVTPIKDQGQCG 146

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGG 206
             WAFS    +EG+    T KL+SLSEQEL+DCD   ED GCEGG + +AF+ I  K  G
Sbjct: 147 CCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFI--KQNG 204

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA-- 263
           GL  E  YPY+G D  C  NK      KI GY  V  +  D     V + P++VAI+A  
Sbjct: 205 GLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSEDALLKAVASQPVSVAIDASG 264

Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
            A QFY  GV     F  D G E L H V  VGYG      +     YW++KNSWG  WG
Sbjct: 265 SAFQFYSGGV-----FTGDCGTE-LDHGVTAVGYGT-----SDDGTKYWLVKNSWGTSWG 313

Query: 324 EKGYFRLYRG----DGSCGI 339
           E GY R+ R     +G CGI
Sbjct: 314 EDGYIRMERDIEAKEGLCGI 333


>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 123/317 (38%), Positives = 170/317 (53%), Gaps = 27/317 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQ 100
           F   H KTY + +E   R  IF+ N   I    + ++  G+     G+N+F DL   EF 
Sbjct: 30  FKTTHKKTYQSHMEELLRFKIFTEN-SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFA 88

Query: 101 AKYLGF--KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
             + G     K   +    PA + + +LP+A DWR+  AVT VKDQ  CGS WAFS TG+
Sbjct: 89  RIFNGHHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGS 148

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           +EG +  K  +LVSLSEQ L+DC Q   ++GCEGG + +AF  I  K   G++ EK+YPY
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPY 206

Query: 217 RGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGV 273
              D  CR  K+       GYV + +  E D+ K +   GP++VAI+A   + Q Y  GV
Sbjct: 207 EAVDGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGV 266

Query: 274 -SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
              P     +  +E+L H VL+VGYGV   K       YW++KNSW E WG++GY  + R
Sbjct: 267 YDEP-----ECSSEDLDHGVLVVGYGVKGGK------KYWLVKNSWAESWGDQGYILMSR 315

Query: 333 -GDGSCGINDYVRSALV 348
             +  CGI       LV
Sbjct: 316 DNNNQCGIASQASYPLV 332


>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
          Length = 466

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 121/348 (34%), Positives = 184/348 (52%), Gaps = 34/348 (9%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKH---TALFNYFLEQHNKTYATLVEYYSRLHIF 66
           + L+  T+S +S M +      H+HH      +AL+  +L +H K+Y  L E   R  IF
Sbjct: 14  LMLIFSTLSSASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNALGEKDKRFQIF 73

Query: 67  SGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-------LKPSYADRSVPA 119
             NL+ I       + S   GL +F+DL+  E+++ YLG K       L  + +DR +P 
Sbjct: 74  KDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRRKLSKNKSDRYLPK 133

Query: 120 MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
           +    +LP + DWR+   + GVKDQ  CGS WAFS    +E + A  T  L+SLSEQEL+
Sbjct: 134 V--GDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELV 191

Query: 180 DCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGY 237
           DCD+  ++GC+GG +  AF+ +++   GG++ E+ YPY+  +  C +  K A  VKI+ Y
Sbjct: 192 DCDKSYNEGCDGGLMDYAFEFVINN--GGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSY 249

Query: 238 VSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIV 295
             V  +     +  V + P+++AI A    LQ Y +G+      F       + H V+  
Sbjct: 250 EDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGI------FTGKCGTAVDHGVVAA 303

Query: 296 GYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           GYG      +   + YWI++NSWG  WGEKGY R+ R      G CG+
Sbjct: 304 GYG------SENGMDYWIVRNSWGAKWGEKGYLRVQRNVASSSGLCGL 345


>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
 gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
          Length = 458

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 122/315 (38%), Positives = 167/315 (53%), Gaps = 33/315 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           L+  +  +H K+Y  + E   R   F  NLR I    +    +GV+    GLN F+DL+ 
Sbjct: 39  LYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE-HNAAADAGVHSFRLGLNRFADLTN 97

Query: 97  AEFQAKYLGFKLKP----SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
            E++  YLG + KP      +DR + A   N  LP + DWR   AV  +KDQ  CGS WA
Sbjct: 98  EEYRDTYLGLRNKPRRERKVSDRYLAA--DNEALPESVDWRTKGAVAEIKDQGGCGSCWA 155

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           FS    +EG+    T  L+SLSEQEL+DCD   ++GC GG +  AFD I++   GG++ E
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTE 213

Query: 212 KTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
             YPY+G D+ C +N+K A  V I+ Y  V+ +     +  V N P++VAI A   A Q 
Sbjct: 214 DDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQL 273

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +G+      F       L H V  VGYG +  K       YWI++NSWG+ WGE GY 
Sbjct: 274 YSSGI------FTGKCGTALDHGVAAVGYGTENGK------DYWIVRNSWGKSWGESGYV 321

Query: 329 RLYRG----DGSCGI 339
           R+ R      G CGI
Sbjct: 322 RMERNIKASSGKCGI 336


>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
          Length = 360

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 135/339 (39%), Positives = 175/339 (51%), Gaps = 35/339 (10%)

Query: 31  HHLHHVKHTALFNY---------FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
             LH +K  A  NY         F   H+KTY  L E   R  IF  N++KI+      H
Sbjct: 36  EQLHILKAKAGINYQPYEQAWKEFKILHDKTYDALEEESRRFEIFRENVQKIEEHNKLYH 95

Query: 82  -GSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIP--NITLPRAFDWREYD 136
            G   Y  G+N+FSDL   EF  KY G K K S  D    + +   N+  P + DWR+  
Sbjct: 96  LGKKSYYLGVNQFSDLKHEEF-VKYNGLK-KTSLKDGGCSSYLAANNLVEPDSVDWRKKG 153

Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSIS 194
            VT VK+Q  CGS W+FSTTG++EG +  K+ KLVSLSE +L+DC Q   ++GC GG + 
Sbjct: 154 YVTDVKNQGQCGSCWSFSTTGSLEGQHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMD 213

Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVE 253
           NAF  I S   GGLE E+ YPY+     C+ +         G V V S  E+ + K + E
Sbjct: 214 NAFKYIKSV--GGLESEEDYPYKPKQGTCKFDDTKVAATDTGCVDVESGSESALKKAVSE 271

Query: 254 NGPMAVAINA--YALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVP 310
            GP++VAI+A   + Q Y  GV   P     +  +E L H VL VGYG D      +   
Sbjct: 272 VGPVSVAIDASHSSFQSYAGGVYDEP-----ECSSEQLDHGVLCVGYGTD-----DQGQD 321

Query: 311 YWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           YWI+KNSWG  WGE GY ++ R     CGI       LV
Sbjct: 322 YWIVKNSWGAEWGEDGYVKMSRNKKNQCGIATQASYPLV 360


>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
          Length = 365

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 120/301 (39%), Positives = 162/301 (53%), Gaps = 45/301 (14%)

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPS---------- 111
           R ++F  N+R I      +    +  LN+F+D++T EF+  Y G +++            
Sbjct: 62  RFNVFKENVRYIHEANKKDRPFRL-ALNKFADMTTDEFRRTYAGSRVRHHRSLSGGRRQG 120

Query: 112 -----YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
                YAD           LP A DWR+  AVT +KDQ  CGS WAFST   +EG+   +
Sbjct: 121 GGSFMYADAE--------NLPAAVDWRQKGAVTPIKDQGQCGSCWAFSTIVAVEGINKIR 172

Query: 167 TKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL 225
           T +LVSLSEQEL+DC+  E+DGC GG +  AF  I     GG+  E +YPY+G+  +C  
Sbjct: 173 TGRLVSLSEQELMDCNIGENDGCNGGLMDVAFQFIQQN--GGITTEASYPYQGEQNSCDQ 230

Query: 226 NKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCD 282
           +K+ +  V I+GY  V  ++    +  V N P++VAI+A     QFY  GV     F  D
Sbjct: 231 SKENSHDVSIDGYEDVPANDESALQKAVANQPVSVAIDASGNDFQFYSEGV-----FTTD 285

Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCG 338
           GG + L H V  VGYG      T     YWI+KNSWGE WGEKGY R+ RG    +G CG
Sbjct: 286 GGTD-LDHGVAAVGYGT-----TRDGTKYWIVKNSWGEDWGEKGYIRMQRGVKQAEGLCG 339

Query: 339 I 339
           I
Sbjct: 340 I 340


>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
          Length = 509

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/356 (35%), Positives = 182/356 (51%), Gaps = 32/356 (8%)

Query: 4   FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
           F  +A +  L  +   S F +VG      +   +   LF  + E+H K Y    E   + 
Sbjct: 14  FLVWASLTSLISSSLPSEFSIVG-RPGESIAEERVVELFKKWTEKHGKVYKHGQEVEKKF 72

Query: 64  HIFSGNLRKIQLLQDTEHGSG--VYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI 121
             F  NLR +         SG  + GLN+F+D+S  EF+  Y+    KP+    ++    
Sbjct: 73  QNFRDNLRYVMEKNGERGASGGHLVGLNKFADMSNEEFREVYVSKVKKPTSKRMAIERRR 132

Query: 122 PNITL----------PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
                          P + DWR+Y  VTGVKDQ  CGS WAFS+TG IEG+ A     L+
Sbjct: 133 QGKAAAAKAVAACDGPTSLDWRKYGIVTGVKDQGDCGSCWAFSSTGAIEGINALANGDLI 192

Query: 172 SLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ 231
           SLSEQEL+DCD  +DGCEGG +  AF+ +MS   GG++ E  YPY G+D  C   K+ T+
Sbjct: 193 SLSEQELVDCDSTNDGCEGGYMDYAFEWVMSN--GGIDTETDYPYTGEDGTCNTTKEETK 250

Query: 232 -VKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL--QFYVTGVSHPIQFFCDGGNENL 288
            V I+GY  V+ +E+ +   +++  P++V I+  A+  Q Y  G+            +++
Sbjct: 251 AVSIDGYEDVAEEESALFCAVLKQ-PISVGIDGGAIDFQLYTGGIYDGDCSD---DPDDI 306

Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD----GSCGIN 340
            H+VL+VGYG +  +       YWIIKNSWG  WG KGY  + R      G C IN
Sbjct: 307 DHAVLVVGYGAESGE------EYWIIKNSWGTDWGMKGYAYIKRNTSKDYGVCAIN 356


>gi|33242872|gb|AAQ01140.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 128/331 (38%), Positives = 174/331 (52%), Gaps = 32/331 (9%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
           + H   +  +  QH K Y T  E YSR  IF  N  KI      EH         S    
Sbjct: 18  LPHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKI-----AEHNIRASLGMHSYTLA 72

Query: 88  LNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
           +N+F D+   EF  + +G  LK          V     N TLP++ DWR    V+ VKDQ
Sbjct: 73  MNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQ 132

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMS 202
             CGS WAFSTTG++EG +++KT KLV LSEQ+L+DC ++  + GC GG +  AF  I  
Sbjct: 133 GECGSCWAFSTTGSLEGQHSSKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI-- 190

Query: 203 KLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVA 260
           K  GGL+ E++YPY   DDK C+ +  +    + GY  V S +E  + + +   GP++VA
Sbjct: 191 KANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVA 250

Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           I+A   + QFY +GV    Q  C    E L H VL VGYG      +H+A  +WI+KNSW
Sbjct: 251 IDAGHESFQFYSSGVYDEPQ--CS--TEQLDHGVLAVGYGA-MNDNSHQA--FWIVKNSW 303

Query: 319 GEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           G  WG++GY  + R  +  CGI       LV
Sbjct: 304 GPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334


>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
          Length = 394

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 125/319 (39%), Positives = 170/319 (53%), Gaps = 29/319 (9%)

Query: 34  HHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSD 93
           HH +  + F  F   HNK YAT  E   R  IF  NL  I    + +  S V  +N+F D
Sbjct: 83  HHFQ--SQFYQFQRDHNKFYATEEERLKRYAIFKNNLTYIHN-HNMQGYSYVLKMNKFGD 139

Query: 94  LSTAEFQAKYLGFK-----LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCG 148
           L+  EF+ +YLG+K       P   D ++ ++  N  +P   DWR+   VT VKDQ  CG
Sbjct: 140 LTLEEFRQRYLGYKKPDLRTPPREVDTTLESVEDN-DIPTHVDWRQRGCVTSVKDQGDCG 198

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGG 206
           S WAFS TG +EGVY AKT KLV+LS+Q+L+DC +   + GC+GG +  AF+ ++    G
Sbjct: 199 SCWAFSATGAMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVEN--G 256

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAI--NA 263
           G+   + YPY   D  C+ ++  +   I GY SV  R E  M   L    P++VAI  N 
Sbjct: 257 GICSGENYPYMRKDGVCKSSQCTSVATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQ 316

Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
            A QFY  G+      F      NL H VL+VGY  +    T     YWI+KNSWG  WG
Sbjct: 317 AAFQFYYDGI------FDAPCGTNLDHGVLLVGYSAE----TAGQGDYWIMKNSWGAAWG 366

Query: 324 EKGY--FRLYRGD-GSCGI 339
           + GY    +++G  G CG+
Sbjct: 367 KGGYMLMAMHKGPAGQCGV 385


>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
 gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
          Length = 452

 Score =  192 bits (487), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/346 (36%), Positives = 171/346 (49%), Gaps = 38/346 (10%)

Query: 11  ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
           + ++ + S + F ++  + L     +    L+  +L +H + Y  L E   R  +F  N 
Sbjct: 13  SAMAGSASRADFSIISSKDLREDDAIME--LYELWLAEHKRAYNGLDEKQKRFSVFKDNF 70

Query: 71  RKIQLLQDTEHGSG----VYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT- 125
             I      EH  G      GLN+F+DLS  EF+A YLG KL         P+     + 
Sbjct: 71  LYIH-----EHNQGNRSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSRPPSRRYQYSD 125

Query: 126 ---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
              LP + DWRE  AVT VKDQ  CGS WAFST   +EG+    T  L+SLSEQEL+DCD
Sbjct: 126 GEDLPESIDWREKGAVTSVKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCD 185

Query: 183 QE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYVSV 240
              + GC GG +  AF+ I++   GGL+ E+ YPY   D +C    K A  V I+ Y  V
Sbjct: 186 TSYNQGCNGGLMDYAFEFIINN--GGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDYEDV 243

Query: 241 SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
             ++    K    N P++VAI A     QFY +GV      F       L H V +VGYG
Sbjct: 244 PENDEKSLKKAAANQPISVAIEASGREFQFYDSGV------FTSTCGTQLDHGVTLVGYG 297

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
                 +     YW +KNSWG+ WGE+G+ RL R       G CGI
Sbjct: 298 ------SESGTDYWTVKNSWGKSWGEEGFIRLQRNIEVASTGMCGI 337


>gi|332374900|gb|AEE62591.1| unknown [Dendroctonus ponderosae]
          Length = 359

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 124/316 (39%), Positives = 164/316 (51%), Gaps = 39/316 (12%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYGLNE 90
           T  F  F +++ K Y    E   R  IF  NL KI+     EH         S   GLN+
Sbjct: 21  TETFVTFQQKYGKVYQNDSELSVREEIFKENLAKIE-----EHNKQFQQNLVSYELGLNQ 75

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSVPAM------IPNITLPRAFDWREYDAVTGVKDQ 144
           FSDL+ AEFQA      + P   D+    M          T P + +W E   VT VK+Q
Sbjct: 76  FSDLTEAEFQAL---LTMSP-LTDQLTKQMEKYNSEFDIKTAPVSVNWAEKGVVTPVKNQ 131

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKL 204
             CGS W F+TTG IE   A KT  LVSLSEQ+L+DC++ + GC+GG +S A   + S  
Sbjct: 132 GNCGSCWTFTTTGTIESRLALKTGSLVSLSEQQLLDCNRVNAGCDGGVLSYALQYVES-- 189

Query: 205 GGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA 263
             GL  E  YPY+  +  C    K       GY  + +R E+D+ K + E GP+AVA+NA
Sbjct: 190 -AGLTTEDEYPYKAWNGTCNSTHKPVAAYTKGYTLIYTRSESDLMKAVAE-GPVAVALNA 247

Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
             LQ+Y  G+ +P        +  ++H  L+VGY  + T      +PYWIIKNSWG  WG
Sbjct: 248 DLLQYYSKGIFNP-----SACSSTVNHGGLVVGYEENAT------LPYWIIKNSWGATWG 296

Query: 324 EKGYFRLYRGDGSCGI 339
           E GYFR+ +G   CGI
Sbjct: 297 ENGYFRMAKGYNLCGI 312


>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
          Length = 357

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 125/298 (41%), Positives = 167/298 (56%), Gaps = 28/298 (9%)

Query: 56  LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---SY 112
           L E   R ++F  N + +  +   +    +  LN+F+D++  EF++ Y G K+K      
Sbjct: 51  LEEKNKRFNVFKENTKHVHKVNQMDKPYKLK-LNKFADMTNHEFRSSYGGSKVKHYRMLR 109

Query: 113 ADRSVPA--MIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKK 169
            DR      M    T LP + DWR+  AVTG+KDQ  CGS WAFST   +EG+   KTK+
Sbjct: 110 GDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKE 169

Query: 170 LVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK- 227
           L+SLSEQ+LIDCD+ DD GC GG + +AF+ I  K  GG+  E  YPY+  D+ C + K 
Sbjct: 170 LLSLSEQQLIDCDRSDDHGCNGGLMESAFEFI--KKNGGITTENNYPYKAKDERCDMLKM 227

Query: 228 KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGN 285
            A  V I+G+ SV  ++       V + P++VAI+A    LQFY  GV     F  + G 
Sbjct: 228 NAPVVTIDGHESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGV-----FDGECGT 282

Query: 286 ENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           E L H V IVGYG      T     YWI+KNSWG  WGEKGY R+ RG    +G CGI
Sbjct: 283 E-LDHGVAIVGYGT-----TLDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGI 334


>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
 gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
          Length = 341

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 129/348 (37%), Positives = 185/348 (53%), Gaps = 43/348 (12%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           +AL+++  +VS   V+ +E             ++ F  +H K Y    E   RL IF+ N
Sbjct: 10  LALVAVAQAVSYAEVIQEE-------------WHTFKLEHRKNYQDETEERFRLKIFNEN 56

Query: 70  LRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL----KPSYADRSVPAMI- 121
             KI     L  T   S    +N+++D+   EF +   GF      +   AD S   +  
Sbjct: 57  KHKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTF 116

Query: 122 ---PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
               ++TLP+  DWR   AVT VKDQ  CGS WAFS+TG +EG +  K+  LVSLSEQ L
Sbjct: 117 ISPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNL 176

Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
           +DC  +  ++GC GG + NAF  I  K  GG++ EK+YPY   D +C  NK        G
Sbjct: 177 VDCSTKYGNNGCNGGLMDNAFRYI--KDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDRG 234

Query: 237 YVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGV-SHPIQFFCDGGNENLSHSV 292
           +V + + +E  MA+ +   GP+AVAI+A   + QFY  GV + P    CD   +NL H V
Sbjct: 235 FVDIPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPA---CDA--QNLDHGV 289

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
           L+VG+G D +        YW++KNSWG  WG+KG+ ++ R  +  CGI
Sbjct: 290 LVVGFGTDES-----GQDYWLVKNSWGTTWGDKGFIKMLRNKENQCGI 332


>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
 gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
          Length = 331

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 125/313 (39%), Positives = 163/313 (52%), Gaps = 30/313 (9%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           H K Y    E   R  I+  NL+KI    + +H S    +N   D+++ E     LG KL
Sbjct: 36  HGKEYPNKNEETMRNFIWQNNLKKIVTHNEGKH-SFKLAMNHLGDMTSLEISQTLLGLKL 94

Query: 109 K------PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
           K      P  A    PA   N+ +  + DWR    VT VK+Q  CGS WAFSTTG +EG 
Sbjct: 95  KKHAESQPKGATFLPPA---NVKVVDSIDWRSKGYVTPVKNQGQCGSCWAFSTTGALEGQ 151

Query: 163 YAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           +  KT KLVSLSEQ L+DC  +  ++GCEGG + NAF  I  K  GG++ EK+YPY   D
Sbjct: 152 HFRKTGKLVSLSEQNLVDCSGKYGNNGCEGGLMDNAFQYI--KENGGIDTEKSYPYLAKD 209

Query: 221 KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGV-SHP 276
             C  NK A   K  G+V + + DE  + + L   GP+++AI+A      FY  GV   P
Sbjct: 210 GVCHYNKSAIGAKDTGFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQGVYDDP 269

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD-G 335
                D  +  L H VL VGYG D  K       YW++KNSWG  WGE+GY ++ R D  
Sbjct: 270 -----DCSSTRLDHGVLAVGYGTDDGK------DYWLVKNSWGPSWGEEGYIKIARNDHD 318

Query: 336 SCGINDYVRSALV 348
            CG+       LV
Sbjct: 319 KCGVASKASYPLV 331


>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
          Length = 372

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 122/316 (38%), Positives = 167/316 (52%), Gaps = 32/316 (10%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
            L+  ++ QH K Y  + E   R  IF  NLR I       + +   GLN+F+DL+  E+
Sbjct: 44  GLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEY 103

Query: 100 QAKYLGFKLKPSYADRSVPAMIPNI--------TLPRAFDWREYDAVTGVKDQTMCGSSW 151
           +AK+LG +  P    R + + IP+          LP + +WR++ AV+ VKDQ  CGS W
Sbjct: 104 RAKFLGTRTDPRR--RLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGSCGSCW 161

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEE 210
           AFS    +EG+    + +L+SLSEQEL+DCD+  D GC GG +  AF  I+    GG++ 
Sbjct: 162 AFSAIAAVEGINKIVSGELISLSEQELVDCDRSYDAGCNGGLMDYAFQFIIDN--GGIDT 219

Query: 211 EKTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQ 267
           EK YPY G +  C   KK A  V I+GY  V  +E  + K  V + P+++AI A   A Q
Sbjct: 220 EKDYPYLGFNNQCDPTKKNAKVVSIDGYEDVPNNENALKK-AVAHQPVSIAIEAGGRAFQ 278

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
            Y +GV      F       L H V+ VGYG D          YWI++NSWG  WGE GY
Sbjct: 279 LYESGV------FNGECGLALDHGVVAVGYGSD-----DNGQDYWIVRNSWGGNWGENGY 327

Query: 328 FRLYR----GDGSCGI 339
            R+ R      G CGI
Sbjct: 328 IRMERNINANTGKCGI 343


>gi|34811401|pdb|1M6D|A Chain A, Crystal Structure Of Human Cathepsin F
 gi|34811402|pdb|1M6D|B Chain B, Crystal Structure Of Human Cathepsin F
          Length = 214

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 106/222 (47%), Positives = 142/222 (63%), Gaps = 10/222 (4%)

Query: 127 PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD 186
           P  +DWR   AVT VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D 
Sbjct: 2   PPEWDWRSKGAVTKVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDK 61

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETD 246
            C GG  SNA+  I  K  GGLE E  Y Y+G  ++C+ + +  +V I   V +S++E  
Sbjct: 62  ACMGGLPSNAYSAI--KNLGGLETEDDYSYQGHMQSCQFSAEKAKVYIQDSVELSQNEQK 119

Query: 247 MAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
           +A +L + GP++VAINA+ +QFY  G+S P++  C      + H+VL+VGYG        
Sbjct: 120 LAAWLAKRGPISVAINAFGMQFYRHGISRPLRPLCS--PWLIDHAVLLVGYG------QR 171

Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
             VP+W IKNSWG  WGEKGY+ L+RG G+CG+N    SA+V
Sbjct: 172 SDVPFWAIKNSWGTDWGEKGYYYLHRGSGACGVNTMASSAVV 213


>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
 gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 127/351 (36%), Positives = 182/351 (51%), Gaps = 40/351 (11%)

Query: 11  ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
           A++++TV+ SS  ++  +             +  F   H KTY + +E   R  IF+ N 
Sbjct: 9   AIVAVTVAASSQEILRTQ-------------WEAFKTTHKKTYQSHMEELLRFKIFTEN- 54

Query: 71  RKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV--PAMIPNI 124
             I    + ++  G+     G+N+F DL   EF   + G          S   PA + + 
Sbjct: 55  SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHGTRKTGGSSFLPPANVNDS 114

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
           +LP+  DWR+  AVT VKDQ  CGS WAFS TG++EG +  K  +LVSLSEQ L+DC Q 
Sbjct: 115 SLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQS 174

Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-S 241
             ++GCEGG + +AF  I  K   G++ EK+YPY   D  CR  K+       GYV + +
Sbjct: 175 FGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKA 232

Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYG 298
             E D+ K +   GP++VAI+A   + Q Y  GV   P     +  +E+L H VL+VGYG
Sbjct: 233 GSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEP-----ECSSEDLDHGVLVVGYG 287

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGINDYVRSALV 348
           V   K       YW++KNSW E WG++GY  + R  +  CGI       LV
Sbjct: 288 VKGGK------KYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPLV 332


>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 123/343 (35%), Positives = 167/343 (48%), Gaps = 31/343 (9%)

Query: 7   FAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIF 66
           FA V  L L     S   + D  +   H          ++ ++ + Y  L E   R  IF
Sbjct: 12  FALVLCLGLWAFQVSSRTLQDASMQERHE--------QWMARYGRVYKDLQEKEKRFSIF 63

Query: 67  SGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYA-DRSVPAMIPNIT 125
             N+  I+   +        G+N+F+DL+  EF A    FK   S +  R+      N+T
Sbjct: 64  KENVNYIEASNNAGDKPYKLGVNQFADLTNEEFIATRNKFKGHMSSSITRTTTFKYENVT 123

Query: 126 LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE- 184
            P   DWR+  AVT VK+Q  CG  WAFS     EG++   T  LVSLSEQEL+DCD   
Sbjct: 124 APSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSG 183

Query: 185 -DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQV-KINGYVSVSR 242
            D GC+GG + +AF  I+    GGL  E  YPY+G D  C  N++AT V  I GY  V  
Sbjct: 184 ADQGCQGGLMDDAFKFIIQN--GGLNTEAQYPYQGVDGTCNTNEEATHVATITGYEDVPS 241

Query: 243 DETDMAKYLVENGPMAVAINAYALQF--YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
           +     +  V N P+++AI+A    F  Y +GV      F       L H V +VGYGV 
Sbjct: 242 NNEQALQQAVANQPISIAIDASGSDFQNYQSGV------FTGSCGTQLDHGVAVVGYGV- 294

Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
               +     YW++KNSWG  WGE+GY R+ R     +G CG+
Sbjct: 295 ----SDDGTKYWLVKNSWGADWGEEGYIRMQRDVDAPEGLCGL 333


>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
          Length = 343

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 127/313 (40%), Positives = 172/313 (54%), Gaps = 31/313 (9%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ---DTEHGSGVYGL--NEFSDLSTAEF 99
           F  +H K Y    E   R+ I+  N  K+Q+ Q   D E     Y L  N++ D+   EF
Sbjct: 31  FKMEHKKCYKHEAEERLRMKIYMKN--KLQIAQHNCDYELKKVTYRLKINKYGDMLNHEF 88

Query: 100 QAKYLGFKLKPSYADRS--VP---AMIP--NITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           +    G+    ++  R+  +P   A I   N+ LP+  DWR+  AVT VKDQ  CGS WA
Sbjct: 89  KNMLNGYNRTINHTLRNERLPVGAAFIEPCNVELPKMVDWRKCGAVTEVKDQGHCGSCWA 148

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEE 210
           FS TG++EG +  +T  LVSLSEQ LIDC     ++GC GG +  AF  I  K   GL+ 
Sbjct: 149 FSATGSLEGQHFRRTGVLVSLSEQNLIDCSGSYGNNGCNGGLMDQAFSYI--KDNKGLDT 206

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINA--YALQ 267
           EKTYPY G+D  CR +K+++     G+V +   DE  +   +   GP++VAI+A   + Q
Sbjct: 207 EKTYPYEGEDDKCRYDKRSSGASDVGFVDIPVGDEQKLKAAVATVGPVSVAIDASHQSFQ 266

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
           FY  G    I F  +  + NL H VL+VGYG D      +   YWI+KNSWGE WGEKGY
Sbjct: 267 FYSDG----IYFEPECSSTNLDHGVLVVGYGTDE-----EGRDYWIVKNSWGESWGEKGY 317

Query: 328 FRLYRG-DGSCGI 339
            ++ R  D  CGI
Sbjct: 318 IKMARNIDNHCGI 330


>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
           pulchellus]
          Length = 331

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 126/323 (39%), Positives = 166/323 (51%), Gaps = 26/323 (8%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLST 96
           A ++ F   H K Y +  E Y RL I+  N  KI    +    S V     +NEF D+  
Sbjct: 21  AEWSAFKALHGKEYESDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNEFGDMLH 80

Query: 97  AEFQAKYLGFKLKPSYADRS-----VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSW 151
            EF +   GFK       R       P  + +  LP+  DWR+  AVT VK+Q  CGS W
Sbjct: 81  HEFVSTRNGFKRNYRDTPREGSFFVEPEGLEDFHLPKTVDWRKKGAVTPVKNQGQCGSCW 140

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLE 209
           +FSTTG++EG +  K  KLVSLSEQ LIDC +   ++GCEGG +  AF  I  K   G++
Sbjct: 141 SFSTTGSLEGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYI--KANKGID 198

Query: 210 EEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--AL 266
            E++YPY   D  C  NK A      G+V +   DE  + K +   GP++VAI+A   + 
Sbjct: 199 TEQSYPYNATDGVCHFNKSAVGATDTGFVDIPEGDENKLKKAVATVGPVSVAIDASHESF 258

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
           QFY  GV    +  CD  +E L H VL+VGYG      T     YW++KNSWG  WG+ G
Sbjct: 259 QFYSEGVYDEPE--CD--SEQLDHGVLVVGYG------TKDGQDYWLVKNSWGTTWGDGG 308

Query: 327 YFRLYRG-DGSCGINDYVRSALV 348
           Y  + R  D  CGI       LV
Sbjct: 309 YIYMSRNKDNQCGIASAASYPLV 331


>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
          Length = 431

 Score =  192 bits (487), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 130/352 (36%), Positives = 184/352 (52%), Gaps = 39/352 (11%)

Query: 8   AGVALLSLTVSVSSFM---VVGDEKLHHL----HHVKHTALFNYFLEQHNKTYATLVEYY 60
           A V L    + VSS M   ++  +K HH       V+ + L+  ++ +H K   +L E  
Sbjct: 1   ATVILFLAMIVVSSAMDMSIISYDKNHHTVSSRSDVEVSRLYEEWVVKHGKAQNSLTEKD 60

Query: 61  SRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
            R  IF  NLR I    D  +G  +    GL +F+DL+  E+++ YLG +LK      S+
Sbjct: 61  RRFEIFKDNLRFI----DEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRLKRKATKTSL 116

Query: 118 --PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSE 175
              A + +  +P + DWR+  AV  VKDQ  CGS WAFST G +EG+    T  L+SLSE
Sbjct: 117 RYEARVGD-AIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSE 175

Query: 176 QELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVK 233
           QEL+DCD   ++GC GG +  AF+ I+    GG++ E+ YPY+G D  C +  K A  V 
Sbjct: 176 QELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTEEDYPYKGVDGRCDQTRKNAKVVT 233

Query: 234 INGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHS 291
           I+ Y  V  +  +  K  + + P++VAI     A Q Y +G+   I         +L H 
Sbjct: 234 IDSYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGI------CGTDLDHG 287

Query: 292 VLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
           V+ VGYG +  K       YWI+KNSWG  WGE GY R+ R      G CGI
Sbjct: 288 VVAVGYGTENGK------DYWIVKNSWGTSWGESGYIRMERNIASSAGKCGI 333


>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
 gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
          Length = 479

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 135/323 (41%), Positives = 173/323 (53%), Gaps = 39/323 (12%)

Query: 40  ALFNYFLEQHNKTYATLV--------EYYSRLHIFSGNLRKIQLLQDTEHGSGVY-GLNE 90
           ALF+ ++ QH K+YA           E  +R  IF  NLR I    + E   G + GLN 
Sbjct: 55  ALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIH--GENEKNQGYFLGLNA 112

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSVPAM----IPNITLPRAFDWREYDAVTGVKDQTM 146
           F+DL+  EF+A+  G +   S    S        +    LP + DWRE  AV GVKDQ  
Sbjct: 113 FADLTNEEFRAQRHGGRFDRSRERTSHEEFRYGSVQLKDLPDSIDWREKGAVVGVKDQGS 172

Query: 147 CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ-EDDGCEGGSISNAFDTIMSKLG 205
           CGS WAFS    IEGV    T +LVSLSEQEL+DCD+ ED+GC GG +  AF  ++    
Sbjct: 173 CGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGFVIKN-- 230

Query: 206 GGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINA 263
           GGL+ E  YPY+G    C  +K  A  V I+GY  V   DET + K  V + P++VAI+A
Sbjct: 231 GGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLK-AVAHQPVSVAIDA 289

Query: 264 --YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
              ++QFY +G+      F      +L H V  VGYG +  K       YWIIKNSWG  
Sbjct: 290 GGSSMQFYRSGI------FTGRCGTDLDHGVTNVGYGKEDGK------AYWIIKNSWGSN 337

Query: 322 WGEKGYFRLYRGD----GSCGIN 340
           WGEKGY ++ R      G CGIN
Sbjct: 338 WGEKGYVKMARNTGLAAGLCGIN 360


>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
 gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
          Length = 346

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 123/356 (34%), Positives = 178/356 (50%), Gaps = 41/356 (11%)

Query: 1   MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYY 60
           M  F F A  +    ++S+S    + +E +    H++       ++ +H + YA + E  
Sbjct: 6   MQIFLFVAIFSSFYFSISLSR--PLDNELIMQKRHIE-------WMTKHGRVYADVKEKS 56

Query: 61  SRLHIFSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA 119
           +R  +F  N+ +I+ L +   G      +N+F+DL+  EF++ Y GFK   S + +S   
Sbjct: 57  NRYVVFKSNVERIEHLNNIPAGRTFKLAVNQFADLTNDEFRSMYTGFKGVSSLSSQSQTK 116

Query: 120 M-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
                   + +  LP + DWR   AVT +K+Q  CG  WAFS    IEG    K  KL+S
Sbjct: 117 TTSFRYQNVSSGALPISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLIS 176

Query: 173 LSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC---RLNKKA 229
           LSEQ+L+DCD  D GCEGG +  AF+ IM+   GGL  E  YPY+G+D  C   + N KA
Sbjct: 177 LSEQQLVDCDTNDFGCEGGLMDTAFEHIMAT--GGLTTESNYPYKGEDATCNSKKTNPKA 234

Query: 230 TQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNEN 287
           T   I GY  V  ++       V + P++V I    +  QFY +GV      F       
Sbjct: 235 TS--ITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGV------FTGECTTY 286

Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           L H+V  +GYG      +     YWIIKNSWG  WGE GY R+ +      G CG+
Sbjct: 287 LDHAVTAIGYGQ-----STNGSKYWIIKNSWGTKWGESGYMRIQKDIKDKQGLCGL 337


>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
          Length = 458

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 122/315 (38%), Positives = 166/315 (52%), Gaps = 33/315 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           L+  +  +H K Y  + E   R   F  NLR I    +    +GV+    GLN F+DL+ 
Sbjct: 39  LYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDE-HNAAADAGVHSFRLGLNRFADLTN 97

Query: 97  AEFQAKYLGFKLKP----SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
            E++  YLG + KP      +DR + A   N  LP + DWR   AV  +KDQ  CGS WA
Sbjct: 98  EEYRDTYLGLRNKPRRERKVSDRYLAA--DNEALPESVDWRTKGAVAEIKDQGGCGSCWA 155

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           FS    +EG+    T  L+SLSEQEL+DCD   ++GC GG +  AFD I++   GG++ E
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTE 213

Query: 212 KTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
             YPY+G D+ C +N+K A  V I+ Y  V+ +     +  V N P++VAI A   A Q 
Sbjct: 214 DDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQL 273

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +G+      F       L H V  VGYG +  K       YWI++NSWG+ WGE GY 
Sbjct: 274 YSSGI------FTGKCGTALDHGVAAVGYGTENGK------DYWIVRNSWGKSWGESGYV 321

Query: 329 RLYRG----DGSCGI 339
           R+ R      G CGI
Sbjct: 322 RMERNIKASSGKCGI 336


>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 135/357 (37%), Positives = 184/357 (51%), Gaps = 35/357 (9%)

Query: 1   MSCFYFFAGVALLSLTVSVS-----SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYAT 55
           M+   F     +LS T+ ++      F +VG    H     K   LF  ++ +H+KTY +
Sbjct: 1   MALSTFSKATLILSATLFITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYRS 60

Query: 56  LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYA 113
           + E   R  IF  NL+ I    +T      Y  GLNEF+DLS  EF++KYLG +++    
Sbjct: 61  IEEKLHRFEIFLDNLKHID---ETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFPRK 117

Query: 114 DRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
             S      ++  LP + DWR   AVT VK+Q  CGS WAFST   +EG+    T  L S
Sbjct: 118 RSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 177

Query: 173 LSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKAT 230
           LSEQELIDCD+  ++GC GG +  AF  IMS    GL +E+ YPY  ++  C R  ++  
Sbjct: 178 LSEQELIDCDRSFNNGCYGGLMDYAFQYIMSN--SGLRKEEDYPYLMEEGRCIREKEQFE 235

Query: 231 QVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNEN 287
            V I+GY  V + DE  + K L    P++VAI A +   QFY  G+      F       
Sbjct: 236 VVTISGYEDVPANDEQSLLKALSHQ-PVSVAIEASSRNFQFYKGGI------FTGRCGTQ 288

Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
           + H V  VGYG      + +   Y I+KNSWG  WGE GY R+ R     +G CGIN
Sbjct: 289 MDHGVTAVGYG------SSEGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGIN 339


>gi|341888721|gb|EGT44656.1| hypothetical protein CAEBREN_22029 [Caenorhabditis brenneri]
          Length = 396

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 123/342 (35%), Positives = 182/342 (53%), Gaps = 28/342 (8%)

Query: 15  LTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ 74
           +T+ ++S   +  EKL      +    FN   ++ +KT   L EY  R  IF  NLR I+
Sbjct: 64  MTILMASIFRIRAEKLKFFGLQQQFKDFNAKFQREHKT---LEEYKMRFEIFQKNLRDIE 120

Query: 75  LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK----------LKPSYADRSVPAMIPNI 124
            L + ++ S  YG+N+FSD + +E +   +  K          LK   + R+   +I N+
Sbjct: 121 EL-NLKNPSVQYGINKFSDKTESELKNLLMDKKFLDSSLSNSTLKTLSSYRNPRNIIKNV 179

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
             P   DWR    V  VKDQ  CGS WAF+T   +E  YA +   L SLSEQEL+DCD  
Sbjct: 180 QRPDYIDWRNDGKVMSVKDQGQCGSCWAFATVAAVESQYAIRKGTLWSLSEQELVDCDGA 239

Query: 185 DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQVKINGYVSVSRD 243
             GC GG +++A   I   LG GLE E  YPY       C +N   T+V I+    ++  
Sbjct: 240 SYGCGGGFLTSALGFI---LGNGLETEDDYPYSATRHDQCWINGDKTRVWIDEGYQLTMS 296

Query: 244 ETDMAKYLVENGPMAVAIN-AYALQFYVTGVSHPIQFFCDGGNENLS-HSVLIVGYGVDR 301
           E D+A+++   GP++ A++   +  +Y  G+  P +  C   +E+L  H++ I+GYG + 
Sbjct: 297 EDDVAEWVANVGPVSFAMSVPKSFPYYHDGIYSPSEHECK--DESLGYHAMAIIGYGQEG 354

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYV 343
            +       YWI+KNSWG  WG++GY RL RG  +CG+NDYV
Sbjct: 355 GQ------NYWIVKNSWGGSWGDQGYMRLARGVNACGMNDYV 390


>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
 gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
          Length = 344

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 119/320 (37%), Positives = 163/320 (50%), Gaps = 26/320 (8%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           LH          ++ Q+ + Y   VE   R  IF  N+  I+   +  +     G+N F+
Sbjct: 29  LHEASMELRHKTWMTQYGRVYKGNVEKEKRFKIFKENVEFIESFNNNGNKPYKLGINAFT 88

Query: 93  DLSTAEFQAKYLGFKLKPSYAD---RSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCG 148
           DL+  EF+A + G+ +  S      R+      N+T +P + DWR   AVT +KDQ  CG
Sbjct: 89  DLTNEEFRASHNGYTMSMSSHQSSYRTKSFRYENVTAVPPSLDWRTKGAVTHIKDQGQCG 148

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGG 206
             WAFS    +EG+    T  L+SLSEQEL+DCD    D GCEGG + +AF+ I+     
Sbjct: 149 CCWAFSAVAAMEGITKLSTGTLISLSEQELVDCDTSGMDQGCEGGLMDDAFEFIIEN--N 206

Query: 207 GLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA-- 263
           GL  E  YPY G D +C   K A    KI GY +V   + +  +  V N P++VAI+A  
Sbjct: 207 GLTTEANYPYEGVDGSCNTRKAANHAAKITGYENVPAYDEEALRKAVANQPVSVAIDAGE 266

Query: 264 YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG 323
            A Q Y +G+     F  D G E L H V +VGYG      +     YW++KNSWG  WG
Sbjct: 267 SAFQHYSSGI-----FTGDCGTE-LDHGVTVVGYGT-----SDDGTKYWLVKNSWGTSWG 315

Query: 324 EKGYFRLYRG----DGSCGI 339
           E GY R+ R     +G CGI
Sbjct: 316 EDGYIRMERDIDAKEGLCGI 335


>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
          Length = 369

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 125/340 (36%), Positives = 181/340 (53%), Gaps = 32/340 (9%)

Query: 15  LTVSVSSFMV-VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYS-RLHIFSGNLRK 72
           L+ S S F     DE L     ++  +L++ +  QH  + +   E ++ R  IF  N++ 
Sbjct: 20  LSASASDFTPGFTDEDLESEKSLR--SLYDNWALQHRSSRSLDSEEHAERFEIFKENVKY 77

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA---MIPNI-TLPR 128
           I  +   +    + GLN+F+DLS  EF+A Y+G K+     DR V +   M  N   LP 
Sbjct: 78  IDSVNKKDSPYKL-GLNKFADLSNEEFKAIYMGTKMDLR-GDREVQSGSFMYQNSEPLPA 135

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGC 188
           + DWR+  AV  VK+Q  CGS WAFST  ++EG+    T  LVSLSEQ+L+DC  E+ GC
Sbjct: 136 SIDWRQKGAVAAVKNQGHCGSCWAFSTVASVEGINYITTGNLVSLSEQQLVDCSTENSGC 195

Query: 189 EGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC---RLNKKATQVKINGYVSVSRDET 245
            GG +  AF  I++   GG+  E  YPY  +   C   ++N + T+V I+G+  V  +  
Sbjct: 196 NGGLMDTAFQYIINN--GGIVTEDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNE 253

Query: 246 DMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
              K  V + P++VAI A     QFY TGV      F       L H V+ VGYG     
Sbjct: 254 QALKEAVAHQPVSVAIEASGQDFQFYSTGV------FTGKCGTALDHGVVAVGYGT---- 303

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
            + + + YWI++NSWG  WGE+GY R+ +G    +G CGI
Sbjct: 304 -SPEGINYWIVRNSWGPKWGEEGYIRMQQGIEAAEGKCGI 342


>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
 gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
          Length = 362

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 129/345 (37%), Positives = 178/345 (51%), Gaps = 31/345 (8%)

Query: 12  LLSLTVSVSSFMVVGDE-KLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSRLHIFSG 68
           LL + +S++  +VV +    H        +L++ +     H+     L E   R ++F  
Sbjct: 6   LLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVSRNLNEKQKRFNVFKS 65

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA-----MIPN 123
           N+  +      +    +  LN+F+D++  EF+  Y G K+      R  P      M  N
Sbjct: 66  NVMHVHNTNKMDKPYKL-KLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGTFMYEN 124

Query: 124 IT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
            T  P + DWR+  AVT VKDQ  CGS WAFST   +EG+   KT +LV LSEQELIDCD
Sbjct: 125 FTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDCD 184

Query: 183 -QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSV 240
            QE+ GC GG +  AF+ I  K  GG+  E  YPY  +D +C   K+    V I+G+ +V
Sbjct: 185 NQENQGCNGGLMEYAFEYIKQK--GGITTESYYPYTANDGSCDATKENVPAVSIDGHETV 242

Query: 241 SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
             ++ D     V N P++VAI+A     QFY  GV     F  D G E L+H V IVGYG
Sbjct: 243 PANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGV-----FTGDCGKE-LNHGVAIVGYG 296

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
                 T     YWI++NSWG  WGE+GY R+ R     +G CGI
Sbjct: 297 T-----TVDGTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGI 336


>gi|340375899|ref|XP_003386471.1| PREDICTED: probable cysteine proteinase A494-like [Amphimedon
           queenslandica]
          Length = 373

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 124/352 (35%), Positives = 178/352 (50%), Gaps = 57/352 (16%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQ 100
           F  + + H+K+Y T+ E   R  ++  N   +Q L +      V + LN F+DLS  EF+
Sbjct: 33  FTDWCKLHSKSYRTITEAKERESVYKSNADLVQQLNNEYRERNVTFSLNHFADLSIEEFK 92

Query: 101 AKYLGFKLKPS------YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
              L    KP       Y   S+P   PN      FDWR+   VT VK+Q   G+ WAFS
Sbjct: 93  KLVLMSPQKPQPLPKQRYHSFSLPQDPPN-----TFDWRDKHVVTSVKNQGSAGTCWAFS 147

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE--------DDGCEGGSISNAFDTIMSKLGG 206
           T GN+EG +A     L SLS ++L+DCD          D G  GG    A++ I ++  G
Sbjct: 148 TVGNVEGQWALGGHNLTSLSTEQLVDCDDTYDHNNLHMDCGVFGGWPYLAYEYIKNE--G 205

Query: 207 GLEEEKTYPYRGDDKAC----------------------------RLNK-KATQ-VKING 236
           G+E E+ YPY      C                            +L+K K  Q + I  
Sbjct: 206 GIEREEDYPYCSGQGTCFPCVPSGWNKTRCGPPPLYCNDTFSCTHKLDKSKFVQGLSIKS 265

Query: 237 YVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
           ++++ +DE +M   L++ GP++V INA  LQFY +GV  PI   C+   + L H+VL+VG
Sbjct: 266 WIAIQKDEVEMQAALIKQGPLSVLINALLLQFYRSGVWDPI-LKCNP--QELDHAVLLVG 322

Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           YG ++     K  PYW+IKNSWG  WG  GYF++ RG G CG++  V SA++
Sbjct: 323 YGTEKGLLEDK--PYWLIKNSWGIKWGMDGYFKMIRGKGKCGVDQQVTSAVL 372


>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
          Length = 357

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 122/306 (39%), Positives = 156/306 (50%), Gaps = 30/306 (9%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK- 107
           H+     L +   R ++F  N++ I      +  +    LN+F D++  EF+AKY G K 
Sbjct: 44  HHAVSRDLDQKQKRFNVFKENVKFIHEFNKNKDVTFKLALNKFGDMTNQEFRAKYAGSKV 103

Query: 108 -----LKPSYADRSVPA--MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
                +K S       A  M  N   P + DWRE  AV  VK+Q  CGS WAFS    +E
Sbjct: 104 HHHRTMKGSRHGSGSGAKFMYENAVAPPSIDWRERGAVAAVKNQGQCGSCWAFSAIAAVE 163

Query: 161 GVYAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           G+    TK+LV LSEQELIDCD  ++ GC GG +  AF+ I  K  GG+  E  YPY+ +
Sbjct: 164 GINQIVTKELVPLSEQELIDCDTDQNQGCSGGLMDYAFEFI--KNNGGITTEDVYPYQAE 221

Query: 220 DKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPI 277
           D  C+ N  A  V I+GY  V  ++ D     V N P+AVAI A  Y  QFY  GV    
Sbjct: 222 DATCKKNSPA--VVIDGYEDVPTNDEDALMKAVANQPVAVAIEASGYVFQFYSEGV---- 275

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
            F    G E L H V +VGYG      T     YW ++NSWG  WGE GY R+ RG    
Sbjct: 276 -FTGRCGTE-LDHGVAVVGYGT-----TQDGTKYWTVRNSWGADWGESGYVRMQRGIKAT 328

Query: 334 DGSCGI 339
            G CGI
Sbjct: 329 HGLCGI 334


>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 140/350 (40%), Positives = 182/350 (52%), Gaps = 37/350 (10%)

Query: 6   FFAGVALLSLTVSVSSFMVVG--DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
            F  V++L+ +     F ++G   E L  +H V H  LF  +L +H+K Y +L E   R 
Sbjct: 13  LFLFVSILACSALAHEFSILGYAPEDLTSIHKVIH--LFESWLVKHSKFYESLDEKLHRF 70

Query: 64  HIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLK-PSYADRSVPAM 120
            IF  NL+ I    +T      Y  GLNEF+DL+  EF+ K+LGFK +     D S    
Sbjct: 71  EIFMDNLKHID---ETNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAERKDESSKEF 127

Query: 121 --IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
                + LP++ DWR+  AV  VK+Q  CGS WAFST   +EG+    T  L  LSEQEL
Sbjct: 128 GYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQEL 187

Query: 179 IDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKING 236
           IDCD   ++GC GG +  AF  +M     GL +E+ YPY   +  C   K  ++ V I+G
Sbjct: 188 IDCDTTFNNGCNGGLMDYAFAYVMR---SGLHKEEEYPYIMSEGTCDEKKDVSEKVTISG 244

Query: 237 YVSVSR-DETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVL 293
           Y  V R DE    K L  N P++VAI A     QFY  GV     F    G E L H V 
Sbjct: 245 YHDVPRNDEASFLKALA-NQPISVAIEASGRDFQFYSGGV-----FDGHCGTE-LDHGVA 297

Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS----CGI 339
            VGYG      T K + Y I++NSWG  WGEKGY R+ RG G     CG+
Sbjct: 298 AVGYG------TTKGLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGL 341


>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
          Length = 301

 Score =  191 bits (486), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 125/311 (40%), Positives = 170/311 (54%), Gaps = 22/311 (7%)

Query: 50  NKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLG 105
           +K Y    E + R+ ++  NL+KI++  + EH  G +    G+N F D++  EF+    G
Sbjct: 1   SKKYHEKEEGWRRM-VWEKNLKKIEM-HNLEHSMGTHSYRLGMNHFGDMTHEEFRQIMNG 58

Query: 106 FKLKPSYADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYA 164
           +K KP         M PN +  PRA DWR+   VT VKDQ  CGS WAFSTTG +EG + 
Sbjct: 59  YKRKPQRKFTGSLFMEPNFLEAPRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHF 118

Query: 165 AKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDK 221
            KT KLVSLSEQ L+DC + +  +GC GG +  AF  I  K   GL+ E +YPY G DD+
Sbjct: 119 RKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI--KDNQGLDSEDSYPYLGTDDQ 176

Query: 222 ACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQ 278
            C  + K       G+V + S  E  + K +   GP++VAI+A   + QFY +G    I 
Sbjct: 177 PCHYDPKYNSANDTGFVDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSG----IY 232

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSC 337
           +  D  +E L H VL+VGYG +      K   YWI+KNSW E WG+KGY  + +     C
Sbjct: 233 YEKDCSSEELDHGVLVVGYGFEGEDVDGKK--YWIVKNSWSEKWGDKGYIYMAKDRKNHC 290

Query: 338 GINDYVRSALV 348
           GI       LV
Sbjct: 291 GIATAASYPLV 301


>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
          Length = 346

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 124/356 (34%), Positives = 179/356 (50%), Gaps = 41/356 (11%)

Query: 1   MSCFYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYY 60
           M  F F A  +    ++++S    + +E +    H++       ++ +H + YA + E  
Sbjct: 6   MQIFLFVAIFSSFCFSITLSR--PLDNELIMQKRHIE-------WMTKHGRVYADVKEEN 56

Query: 61  SRLHIFSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA 119
           +R  +F  N+ +I+ L     G      +N+F+DL+  EF + Y GFK   + + +S   
Sbjct: 57  NRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFCSMYTGFKGVSALSSQSQTK 116

Query: 120 MIP----NIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
           M P    N++   LP + DWR+  AVT +K+Q  CG  WAFS    IEG    K  KL+S
Sbjct: 117 MSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLIS 176

Query: 173 LSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC---RLNKKA 229
           LSEQ+L+DCD  D GCEGG +  AF+ I  K  GGL  E  YPY+G+D  C   + N KA
Sbjct: 177 LSEQQLVDCDTNDFGCEGGLMDTAFEHI--KATGGLTTESDYPYKGEDATCNSKKTNPKA 234

Query: 230 TQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNEN 287
           T   I GY  V  ++       V + P++V I    +  QFY +GV      F       
Sbjct: 235 TS--ITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGV------FTGECTTY 286

Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           L H+V  +GYG      +     YWIIKNSWG  WGE GY R+ +      G CG+
Sbjct: 287 LDHAVTAIGYGE-----STNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGL 337


>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
          Length = 323

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 114/299 (38%), Positives = 162/299 (54%), Gaps = 19/299 (6%)

Query: 48  QHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG-VYGLNEFSDLSTAEFQAKYLGF 106
           +HNK Y+  +E  +R  I+ GN + I++        G   G+N+F DL + EF   + G+
Sbjct: 28  EHNKKYSDDLEELTRYKIWQGNQKIIEVHNANSDKFGFTLGMNKFGDLESHEFAEMFNGY 87

Query: 107 KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
            ++       V    PN       DWR   AVTGVK+Q  CGS WAFSTTG++EG +  K
Sbjct: 88  MMQARSNSTKVFVADPNYKADPTVDWRTKGAVTGVKNQGQCGSCWAFSTTGSLEGQHFLK 147

Query: 167 TKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
           T KLVSLSEQ L+DC   + ++GC GG +  AF+ I  K  GG++ E +YPY+  D+ CR
Sbjct: 148 TGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYI--KKNGGIDTEASYPYQAHDERCR 205

Query: 225 LNKKATQVKINGYVSVSRDETDMAKYLVEN-GPMAVAINA--YALQFYVTGVSHPIQFFC 281
                      GYV + R++ +     VE  GP++VAI+A   + Q Y +GV +  +  C
Sbjct: 206 FKASDVGATCTGYVDIKREDENALMQAVEKIGPVSVAIDASHSSFQLYRSGVYYERE--C 263

Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
                 L H VL +GYG      T     YW++KNSWG  WG +GY  + R  + +CGI
Sbjct: 264 S--QTALDHGVLAIGYG------TEGGSDYWLVKNSWGTDWGMEGYIMMSRNRNNNCGI 314


>gi|33242878|gb|AAQ01143.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  191 bits (486), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 128/331 (38%), Positives = 173/331 (52%), Gaps = 32/331 (9%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
           + H   +  +  QH K Y T  E YSR  IF  N  KI      EH         S    
Sbjct: 18  LPHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKI-----AEHNIRASLGMHSYTLA 72

Query: 88  LNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
           +N+F D+   EF  + +G  LK          V     N TLP++ DWR    V+ VKDQ
Sbjct: 73  MNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSEVGDNDDNGTLPKSVDWRNSHMVSEVKDQ 132

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMS 202
             CGS WAFSTTG++EG ++ KT KLV LSEQ+L+DC ++  + GC GG +  AF  I  
Sbjct: 133 GECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI-- 190

Query: 203 KLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVA 260
           K  GGL+ E++YPY   DDK C+ +  +    + GY  V S +E  + + +   GP++VA
Sbjct: 191 KANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVA 250

Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           I+A   + QFY +GV    Q  C    E L H VL VGYG      +H+A  +WI+KNSW
Sbjct: 251 IDAGHESFQFYSSGVYDEPQ--CS--TEQLDHGVLAVGYGA-MNDNSHQA--FWIVKNSW 303

Query: 319 GEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           G  WG++GY  + R  +  CGI       LV
Sbjct: 304 GPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334


>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 122/317 (38%), Positives = 168/317 (52%), Gaps = 27/317 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQ 100
           F   H KTY + +E   R  IF+ N   I    + ++  G+     G+N+F DL   EF 
Sbjct: 30  FKTTHKKTYQSHMEELLRFKIFTEN-SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFA 88

Query: 101 AKYLGFKLKPSYADRSV--PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
             + G          S   PA + + +LP+  DWR+  AVT VKDQ  CGS WAFS TG+
Sbjct: 89  RIFNGHHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGS 148

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           +EG +  K  +LVSLSEQ L+DC Q   ++GCEGG + +AF  I  K   G++ EK+YPY
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPY 206

Query: 217 RGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGV 273
              D  CR  K+       GYV + +  E D+ K +   GP++VAI+A   + Q Y  GV
Sbjct: 207 EAVDGECRFKKEDVGATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGV 266

Query: 274 -SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
              P     +  +E+L H VL+VGYGV   K       YW++KNSW E WG++GY  + R
Sbjct: 267 YDEP-----ECSSEDLDHGVLVVGYGVKGGK------KYWLVKNSWAESWGDQGYILMSR 315

Query: 333 -GDGSCGINDYVRSALV 348
             +  CGI       LV
Sbjct: 316 DNNNQCGIASQASYPLV 332


>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
 gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
          Length = 326

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 113/307 (36%), Positives = 166/307 (54%), Gaps = 21/307 (6%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           H K+Y+ + E  +R+ I+  NL KI+     +H S    +N   DL+  EF+  YLG + 
Sbjct: 34  HGKSYSDVHEERTRMAIWQQNLEKIKRHNAEDH-SYKMAMNHLGDLTEDEFRYFYLGVRA 92

Query: 109 KPSYADRSVPAMIP--NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
             +   R     +P  N+ +P + DW +   VTGVK+Q  CGS WAFSTTG++EG +  K
Sbjct: 93  HHNSTKRGWATYMPPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRK 152

Query: 167 TKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
           T  LVSLSEQ LIDC     ++GC+GG + NAF  I S   GG++ E +YPY G   +C 
Sbjct: 153 TGSLVSLSEQNLIDCSGSYGNNGCQGGLMDNAFRYIESN--GGIDTESSYPYLGQQGSCH 210

Query: 225 LNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFYVTGV-SHPIQFFCD 282
            +      ++ GY  + +  E  +   +   GP++VA++A   QFY +GV  +P   +C 
Sbjct: 211 FSSSHVGARVTGYQDIPQGSEQALQSAVATVGPVSVAVDASQWQFYSSGVYDNP---YCS 267

Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGIND 341
             +  L H VL++GYG       +    YW++KNSWG  WG +GY  + R  +  CGI  
Sbjct: 268 --STQLDHGVLVIGYG------NYNGQDYWLVKNSWGYSWGVEGYIMMSRNKNNQCGIAS 319

Query: 342 YVRSALV 348
                LV
Sbjct: 320 SASYPLV 326


>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
          Length = 333

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 121/318 (38%), Positives = 166/318 (52%), Gaps = 28/318 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQ 100
           F  QHNK Y++ VE   R  IF+ N   +    + ++  G+      +N+F DL   EF 
Sbjct: 30  FKSQHNKAYSSHVEELLRFKIFTENTLLV-AKHNAKYAKGLVSYKLAMNKFGDLLPHEFA 88

Query: 101 AKYLGFKLKPSYADRSV---PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
               G++ K +   R     PA + + +LP   DWR+  AVT VK+Q  CGS WAFSTTG
Sbjct: 89  KMVNGYRGKQNKEQRPTFIPPANLNDSSLPTTVDWRKKGAVTPVKNQGQCGSCWAFSTTG 148

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
           ++EG +  KT KLVSLSEQ L+DC  +  + GC GG + N F  I  K  GG++ E+++P
Sbjct: 149 SLEGQHFRKTGKLVSLSEQNLVDCSDDFGNQGCNGGLMDNGFQYI--KANGGIDTEESHP 206

Query: 216 YRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTG 272
           Y   D  C+  K        G+V + +  E D+ K +   GP++VAI+A   + Q Y  G
Sbjct: 207 YTAQDGDCKFKKADVGATDAGFVDIQQGSEDDLKKAVATVGPVSVAIDASHGSFQLYSQG 266

Query: 273 V-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           V   P     D  +  L H VL VGYGV   K       YW++KNSWG  WG+ GY  + 
Sbjct: 267 VYDEP-----DCSSSQLDHGVLTVGYGVKNGK------KYWLVKNSWGGDWGDNGYILMS 315

Query: 332 RG-DGSCGINDYVRSALV 348
           R  D  CGI       LV
Sbjct: 316 RDKDNQCGIASSASYPLV 333


>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
          Length = 441

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 129/351 (36%), Positives = 182/351 (51%), Gaps = 37/351 (10%)

Query: 8   AGVALLSLTVSVSSFM---VVGDEKLHHLHHVKHTA----LFNYFLEQHNKTYATLVEYY 60
           A V L    + VSS M   ++  +K HH    +  A    L+  +L +H K   +L E  
Sbjct: 1   ATVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKD 60

Query: 61  SRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
            R  IF  NLR I    D  +G  +    GL +F+DL+  E+++ YLG +LK      S+
Sbjct: 61  RRFEIFKDNLRFI----DEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRLKRKATKSSL 116

Query: 118 PAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
              +     +P + DWR+  AV  VKDQ  CGS WAFST G +EG+    T  L++LSEQ
Sbjct: 117 RYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQ 176

Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKI 234
           EL+DCD   ++GC GG +  AF+ I++   GG++ E+ YPY+G D  C +  K A  V I
Sbjct: 177 ELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDTEEDYPYKGVDGRCDQTRKNAKVVTI 234

Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHSV 292
           + Y  V  +  +  K  + + P++VAI     A Q Y +G+   I         +L H V
Sbjct: 235 DLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGI------CGTDLDHGV 288

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           + VGYG +  K       YWI+KNSWG  WGE GY R+ R      G CGI
Sbjct: 289 VAVGYGTENGK------DYWIVKNSWGTSWGESGYIRMERNIASSAGKCGI 333


>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
 gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
          Length = 349

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 138/353 (39%), Positives = 183/353 (51%), Gaps = 39/353 (11%)

Query: 4   FYFFAGVALLSLTVS-VSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSR 62
           F FF  V+L  L  S  +   +VG        + K   LF  ++ +  + Y +  E   R
Sbjct: 8   FLFFLAVSLSFLAYSGFARDSIVGYAPEDLTSNDKLIDLFESWISRFGRVYESAEEKLER 67

Query: 63  LHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM 120
             IF  NL  I    DT      Y  GLNEF+DLS  EF+ KYLG  LKP  + R   A 
Sbjct: 68  FEIFKDNLFHID---DTNKKVRNYWLGLNEFADLSHEEFKNKYLG--LKPDLSKR---AQ 119

Query: 121 IP------NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLS 174
            P      ++ +P++ DWR+  AVT VK+Q  CGS WAFST   +EG+    T  L SLS
Sbjct: 120 CPEEFTYKDVAIPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 179

Query: 175 EQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-V 232
           EQELIDCD   ++GC GG +  AF  I++   GGL +E+ YPY  ++  C + K+ +  V
Sbjct: 180 EQELIDCDTTYNNGCNGGLMDYAFAYIVAN--GGLHKEEDYPYIMEEGTCDMRKEESDAV 237

Query: 233 KINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSH 290
            I+GY  V ++  +     + N P+++AI A     QFY  GV     F    G E L H
Sbjct: 238 TISGYHDVPQNSEESLLKALANQPLSIAIEASGRDFQFYSGGV-----FDGHCGTE-LDH 291

Query: 291 SVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
            V  VGYG      T K + Y I+KNSWG  WGEKGY R+ R     +G CGI
Sbjct: 292 GVAAVGYG------TSKGLDYIIVKNSWGPKWGEKGYIRMKRKTSKPEGICGI 338


>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
          Length = 485

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 129/351 (36%), Positives = 182/351 (51%), Gaps = 37/351 (10%)

Query: 8   AGVALLSLTVSVSSFM---VVGDEKLHHLHHVKHTA----LFNYFLEQHNKTYATLVEYY 60
           A V L    + VSS M   ++  +K HH    +  A    L+  +L +H K   +L E  
Sbjct: 7   ATVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHGKAQNSLTEKD 66

Query: 61  SRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
            R  IF  NLR I    D  +G  +    GL +F+DL+  E+++ YLG +LK      S+
Sbjct: 67  RRFEIFKDNLRFI----DEHNGKNLSYRLGLTKFADLTNDEYRSMYLGSRLKRKATKSSL 122

Query: 118 PAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
              +     +P + DWR+  AV  VKDQ  CGS WAFST G +EG+    T  L++LSEQ
Sbjct: 123 RYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQ 182

Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKI 234
           EL+DCD   ++GC GG +  AF+ I++   GG++ E+ YPY+G D  C +  K A  V I
Sbjct: 183 ELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDTEEDYPYKGVDGRCDQTRKNAKVVTI 240

Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFFCDGGNENLSHSV 292
           + Y  V  +  +  K  + + P++VAI     A Q Y +G+   I         +L H V
Sbjct: 241 DLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGI------CGTDLDHGV 294

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           + VGYG +  K       YWI+KNSWG  WGE GY R+ R      G CGI
Sbjct: 295 VAVGYGTENGK------DYWIVKNSWGTSWGESGYIRMERNIASSAGKCGI 339


>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
 gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 119/315 (37%), Positives = 162/315 (51%), Gaps = 29/315 (9%)

Query: 35  HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDL 94
           H KH      F     + Y+   E   R  IF  N+++I+        S   G+N+F+DL
Sbjct: 36  HEKHEEWMTRF----KRVYSDAKEKEIRYKIFKENVQRIESFNKASEKSYKLGINQFADL 91

Query: 95  STAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           +  EF+     FK     + ++ P    NIT +P + DWR+  AVT +KDQ  CGS WAF
Sbjct: 92  TNEEFKTSRNRFKGHMC-SSQAGPFRYENITAVPSSMDWRKEGAVTAIKDQGQCGSCWAF 150

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEE 211
           S    +EG+    T KL+SLSEQEL+DCD   ED GC+GG + +AF  I  +   GL  E
Sbjct: 151 SAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFI--EQNQGLTTE 208

Query: 212 KTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
             YPY G D  C   ++A    KING+  V  +        V   P++VAI+A  +  QF
Sbjct: 209 ANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGALMKAVAKQPVSVAIDAGGFEFQF 268

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +G+     F  D G E L H V  VGYG          + YW++KNSWG  WGE+GY 
Sbjct: 269 YSSGI-----FTGDCGTE-LDHGVAAVGYG------ESNGMNYWLVKNSWGTQWGEEGYI 316

Query: 329 RLYRG----DGSCGI 339
           R+ +     +G CGI
Sbjct: 317 RMQKDIDAKEGLCGI 331


>gi|343472975|emb|CCD15017.1| unnamed protein product [Trypanosoma congolense IL3000]
          Length = 293

 Score =  191 bits (485), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 110/271 (40%), Positives = 156/271 (57%), Gaps = 21/271 (7%)

Query: 85  VYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRSVPAMIPNITL---PRAFDWREYDAVTG 140
            +G+  FSD+S  EF+A Y  G +   +   R  P  + N++    P   DWR+  AVT 
Sbjct: 15  TFGVTRFSDMSPEEFRATYHNGAEYYAAALKR--PRKVVNVSTGRPPMTVDWRKKGAVTP 72

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTI 200
           VKD+  C S WAFS  GNIEG +     +L SLS Q L+ CD+++ GCEGG +  AF  I
Sbjct: 73  VKDEGKCDSFWAFSAIGNIEGQWKIAGHELTSLSGQMLVSCDKKNYGCEGGLMDRAFQWI 132

Query: 201 MSKLGGGLEEEKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPM 257
           +S   G +  E++YPY    GD  AC ++ K    KI+ YV + +DE  +A++L +NGP+
Sbjct: 133 VSSNKGNVFTEQSYPYDSSWGDVPACNMSGKVVGAKISSYVDLPQDENAIAEWLAKNGPV 192

Query: 258 AVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNS 317
           A+A++A + + Y  GV           +  L H VL+VGY  D +K      PYWIIKNS
Sbjct: 193 AIAVDATSFRSYTGGV------LTSCISRRLDHGVLLVGYD-DTSK-----PPYWIIKNS 240

Query: 318 WGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           WG+GWGE GY R+ +G   C + +Y  SA+V
Sbjct: 241 WGKGWGEWGYIRIEKGTNQCLVQEYASSAVV 271


>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
          Length = 368

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 122/307 (39%), Positives = 159/307 (51%), Gaps = 29/307 (9%)

Query: 47  EQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF 106
           ++H+       E + R   F  N+R I   +  +   G   LN F D+   EF+A + G 
Sbjct: 50  QEHHHVPRHHGEKHRRFGAFKDNVRYIH--EHNKRAPGYAPLNRFGDMGREEFRATFAGS 107

Query: 107 KLKPSYADRSVPAMIPNIT------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
                  D      +P         LPRA DWR   AVTGVKDQ  CGS WAFST  ++E
Sbjct: 108 HANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCGSCWAFSTVVSVE 167

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           G+ A +T +LVSLSEQELIDCD  D+ GC+GG + NAF+ I  K  GG+  E  YPYR  
Sbjct: 168 GINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYI--KHSGGITTESAYPYRAA 225

Query: 220 DKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHP 276
           +  C  +  +   V I+G+ +V  +        V N P++VAI+A   + QFY  GV   
Sbjct: 226 NGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSFQFYSDGV--- 282

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
             F  D G + L H V +VGYG      T+    YWI+KNSWG  WGE GY R+ R  G 
Sbjct: 283 --FAGDCGTD-LDHGVAVVGYGE-----TNDGTEYWIVKNSWGTAWGEGGYIRMQRDSGY 334

Query: 337 ----CGI 339
               CGI
Sbjct: 335 DGGLCGI 341


>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
          Length = 330

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 121/306 (39%), Positives = 162/306 (52%), Gaps = 20/306 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEF 99
           +N F   H K+Y    E   R  IF  NL  I+           +  G+NEF+D++  EF
Sbjct: 28  WNAFKSTHLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFADMTNTEF 87

Query: 100 QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
               LG   +   A  SV        LP   DW +   VT VK+Q  CGS WAFSTTG++
Sbjct: 88  SNMLLGLGGRNKIAGDSVFESSHVQDLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTTGSL 147

Query: 160 EGVYAAKTKKLVSLSEQELIDC--DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           EG    KT KLVSLSEQ L+DC   + + GC GG +  AF  I  K  GG++ E  YPY 
Sbjct: 148 EGQVFKKTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYI--KKNGGIDTEAAYPYT 205

Query: 218 GDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYAL--QFYVTGVS 274
           G D  CR  +      ++G+V V S DE  + + +   GP++VAI+A ++  QFY  GV 
Sbjct: 206 GSDGTCRFLENKVGATVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGVY 265

Query: 275 HPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
           +P  +FC   +  L H VL+VGYG +  K       YW++KNSWG  WG KGY ++ R  
Sbjct: 266 NP--WFCS--STELDHGVLVVGYGTEGGK------DYWLVKNSWGSSWGLKGYIKMVRNK 315

Query: 335 GS-CGI 339
            + CGI
Sbjct: 316 KNRCGI 321


>gi|33242874|gb|AAQ01141.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 128/331 (38%), Positives = 173/331 (52%), Gaps = 32/331 (9%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
           + H   +  +  QH K Y T  E YSR  IF  N  KI      EH         S    
Sbjct: 18  LPHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKI-----AEHNIRASLGMHSYTLA 72

Query: 88  LNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
           +N+F D+   EF  + +G  LK          V     N TLP++ DWR    V+ VKDQ
Sbjct: 73  MNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQ 132

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMS 202
             CGS WAFSTTG++EG ++ KT KLV LSEQ+L+DC ++  + GC GG +  AF  I  
Sbjct: 133 GECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI-- 190

Query: 203 KLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVA 260
           K  GGL+ E++YPY   DDK C+ +  +    + GY  V S +E  + + +   GP++VA
Sbjct: 191 KANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVA 250

Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           I+A   + QFY +GV    Q  C    E L H VL VGYG      +H+A  +WI+KNSW
Sbjct: 251 IDAGHESFQFYSSGVYDEPQ--CS--TEQLDHGVLAVGYGA-MNDNSHQA--FWIVKNSW 303

Query: 319 GEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           G  WG++GY  + R  +  CGI       LV
Sbjct: 304 GPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334


>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 351

 Score =  191 bits (485), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 135/352 (38%), Positives = 180/352 (51%), Gaps = 33/352 (9%)

Query: 4   FYFFAGVALLSLTVSV--SSFMVVG--DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEY 59
           FYFF  + +    V+     F +VG   E L  +  +    LF  ++  H K Y T+ E 
Sbjct: 8   FYFFLAMCMSFFVVTSFGKDFSIVGYWPEDLTSMDRL--IELFEEWISNHGKIYETIEEK 65

Query: 60  YSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA 119
           + R  +F  NL+ I    + +  S   G+NEF+DL+  EF+  YLG K++ S   +S   
Sbjct: 66  WHRFEVFKDNLKHIDE-TNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVESSRTRQSPEE 124

Query: 120 MIPN--ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
                 + LP++ DWR+  AVT VK+Q  CGS WAFST   +EG+       L SLSEQE
Sbjct: 125 FTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGGNLTSLSEQE 184

Query: 178 LIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKIN 235
           LIDCD+  ++GC GG +  AF  I+S   GGL +E+ YPY   +  C   K   + V I+
Sbjct: 185 LIDCDRPYNNGCHGGLMDYAFSFIVSS--GGLHKEEDYPYLEVESTCDNKKGELEVVTIS 242

Query: 236 GYVSVSR-DETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSV 292
           GY  V   +E  + K L    P++VAI A     QFY  GV      F       L H V
Sbjct: 243 GYKDVPENNEASLIKALAHQ-PLSVAIEASGRDFQFYSGGV------FDGPCGTQLDHGV 295

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS----CGIN 340
             VGYG      + K V Y I+KNSWG  WGEKGY R+ R  G     CGIN
Sbjct: 296 TAVGYG------SSKGVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAGLCGIN 341


>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
          Length = 334

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 127/356 (35%), Positives = 182/356 (51%), Gaps = 38/356 (10%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
           F  G   + L+ ++S   ++ DE             ++ F   H K Y + +E   R+ I
Sbjct: 4   FLLGAVFVQLSAALSLTNLLADE-------------WHLFKATHKKEYPSQLEEKFRMKI 50

Query: 66  FSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---SYADRSVPA 119
           +  N  K+    +L +    S    +N+F DL   EF++   G++ K    S A+ +   
Sbjct: 51  YLENKHKVAKHNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTF 110

Query: 120 MIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
           M P N+ +P + DWRE  A+T VKDQ  CG  WAFS+TG +EG    KT KLVSL EQ L
Sbjct: 111 MEPANVEVPESVDWREKGAITPVKDQGQCGPCWAFSSTGALEGQTFRKTGKLVSLREQNL 170

Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
           IDC  +  ++GC GG +  AF  I  K   G++ E TYPY  +D  CR N +       G
Sbjct: 171 IDCSGKYGNEGCNGGLMDQAFQYI--KDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRG 228

Query: 237 YVSVSRDETDMAKYLVEN-GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
           +V +   E D  K  V   GP++VAI+A   + QFY  GV +     CD  +++L H VL
Sbjct: 229 FVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPS--CD--SDDLDHGVL 284

Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           +VGYG D  K       YW++KNSW E WG++GY ++ R     CG+       LV
Sbjct: 285 VVGYGSDNGK------DYWLVKNSWSEHWGDQGYIKIARNRKNHCGVATAASYPLV 334


>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
          Length = 358

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 124/359 (34%), Positives = 191/359 (53%), Gaps = 43/359 (11%)

Query: 12  LLSLTVSVSSFMVVG---DEKLHHLH-HVKHTALFNY--------FLEQHNKTYATLVEY 59
           ++ +T+ + S  ++G    E++  +  H ++  L N+        F  +H K+Y T  E 
Sbjct: 1   MIRITLLLHSIFLLGFVNSEQISQIQEHPRNNLLINHPYYPVWTNFKLKHAKSYKTKDEE 60

Query: 60  YSRLHIFSGNLRKIQLLQDTEHGSGVYG----LNEFSDLSTAEFQAKYLGFKL------- 108
             R  +F+ N + I+   + E+ +G +     LN+F+D++ AEF+ +  GFKL       
Sbjct: 61  LLRFQVFASNHKVIEQ-HNIEYEAGQHSFALSLNKFADMTNAEFRQRMNGFKLPAKRKLA 119

Query: 109 --KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
             +P   D  +  M  N+T+P + DWR+   VT VKDQ  CGS WAFS TG++EG +  +
Sbjct: 120 KSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCWAFSATGSLEGQHYKQ 179

Query: 167 TKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
           T KLVSLSEQ L+DCD   +D+GC GG +  AF  +  +   G++ E +YPY+G D  CR
Sbjct: 180 TGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYV--ETNKGIDTEASYPYKGRDGRCR 237

Query: 225 LNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFC 281
              +       G+V +   +ET +   +   GP++VAI+A  +  QFY    SH + +  
Sbjct: 238 FKSEDVGATDTGFVDIPEGNETLLEAAIATVGPVSVAIDAASFKFQFY----SHGVYYDR 293

Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL-YRGDGSCGI 339
               E L H VL VGY       T     Y+I+KNSW E WG+ GY  +  R + +CGI
Sbjct: 294 SCSPEYLDHGVLAVGYNS-----TKDGKQYYIVKNSWSEDWGDDGYILMSRRKNNNCGI 347


>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
 gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
 gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
 gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
 gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
 gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
          Length = 422

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 123/310 (39%), Positives = 160/310 (51%), Gaps = 25/310 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F+ F   + K+YAT  E   R  IF  NL  I    + +  S    +N F DLS  EF+ 
Sbjct: 117 FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHT-HNQQGYSYSLKMNHFGDLSRDEFRR 175

Query: 102 KYLGFKLKPSYADR--SVPAMIPNI---TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           KYLGFK   +       V   + N+    LP   DWR    VT VKDQ  CGS WAFSTT
Sbjct: 176 KYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTT 235

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           G +EG + AKT KLVSLSEQEL+DC + +    C GG +++AF  ++    GG+  E  Y
Sbjct: 236 GALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLD--SGGICSEDAY 293

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
           PY   D+ CR       VKI G+  V R      K  +   P+++AI A     QFY  G
Sbjct: 294 PYLARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEG 353

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG--YFRL 330
           V      F      +L H VL+VGYG D+         +WI+KNSWG GWG  G  Y  +
Sbjct: 354 V------FDASCGTDLDHGVLLVGYGTDK----ESKKDFWIMKNSWGTGWGRDGYMYMAM 403

Query: 331 YRG-DGSCGI 339
           ++G +G CG+
Sbjct: 404 HKGEEGQCGL 413


>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
          Length = 368

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 122/307 (39%), Positives = 159/307 (51%), Gaps = 29/307 (9%)

Query: 47  EQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF 106
           ++H+       E + R   F  N+R I   +  +   G   LN F D+   EF+A + G 
Sbjct: 50  QEHHHVPRHHGEKHRRFGAFKDNVRYIH--EHNKRAPGYPPLNRFGDMGREEFRATFAGS 107

Query: 107 KLKPSYADRSVPAMIPNIT------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
                  D      +P         LPRA DWR   AVTGVKDQ  CGS WAFST  ++E
Sbjct: 108 HANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCGSCWAFSTVVSVE 167

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           G+ A +T +LVSLSEQELIDCD  D+ GC+GG + NAF+ I  K  GG+  E  YPYR  
Sbjct: 168 GINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYI--KHSGGITTESAYPYRAA 225

Query: 220 DKAC-RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHP 276
           +  C  +  +   V I+G+ +V  +        V N P++VAI+A   + QFY  GV   
Sbjct: 226 NGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSFQFYSDGV--- 282

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS 336
             F  D G + L H V +VGYG      T+    YWI+KNSWG  WGE GY R+ R  G 
Sbjct: 283 --FAGDCGTD-LDHGVAVVGYGE-----TNDGTEYWIVKNSWGTAWGEGGYIRMQRDSGY 334

Query: 337 ----CGI 339
               CGI
Sbjct: 335 DGGLCGI 341


>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
          Length = 338

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 127/356 (35%), Positives = 184/356 (51%), Gaps = 38/356 (10%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
           F     L+ L+ ++S   ++ DE             ++ F   H K Y + +E   R+ I
Sbjct: 8   FLLAAVLVQLSAALSLTNLLADE-------------WHLFKATHKKEYPSQLEEKLRMKI 54

Query: 66  FSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---SYADRSVPA 119
           +  N  K+    +L +    S    +N+F DL   EF++   G++ K    S A+ +   
Sbjct: 55  YLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTF 114

Query: 120 MIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
           M P N+ +P + DWRE  A+T VKDQ  CGS WAFS+TG +EG    KT KLVSLSEQ L
Sbjct: 115 MEPANVEVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNL 174

Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
           IDC  +  ++GC GG +  AF  I  K   G++ E TYPY  +D  CR N +       G
Sbjct: 175 IDCSGKYGNEGCNGGLMDQAFQYI--KDNKGIDTENTYPYEAEDGVCRYNPRNRGAVDRG 232

Query: 237 YVSVSRDETDMAKYLVEN-GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
           +V +   E D  K  V   GP++VAI+A   + QFY  G  +  +  CD  +++L H VL
Sbjct: 233 FVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGXYY--EPSCD--SDDLDHGVL 288

Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           +VGYG D  +       YW++KNSW E WG++GY ++ R     CG+       LV
Sbjct: 289 VVGYGSDNGE------DYWLVKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPLV 338


>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
 gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
          Length = 336

 Score =  191 bits (484), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 125/312 (40%), Positives = 172/312 (55%), Gaps = 22/312 (7%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
           H+K Y    E + RL ++  NLRKI+L  + EH  G +    G+N F D++  EF+    
Sbjct: 35  HSKNYHEKEEGWRRL-VWEKNLRKIEL-HNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMN 92

Query: 105 GFKLKPSYADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
           G+K +          M PN +  PRA DWR+   VT VKDQ  CGS WAFSTTG +EG  
Sbjct: 93  GYKRREQRKYSGSLFMEPNFLEAPRAVDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQQ 152

Query: 164 AAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DD 220
             KT KLVSLSEQ L+DC + +  +GC GG +  AF  +  K   GL+ E  YPY+G DD
Sbjct: 153 FRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYV--KDNQGLDSEDFYPYKGTDD 210

Query: 221 KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPI 277
           + C+ N + + V   G+V + S  E  + K +   GP++VAI+A   + QFY +G    I
Sbjct: 211 QPCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAIDAGHESFQFYQSG----I 266

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGS 336
            F  +  ++ L H VL+VGYG +      K   YWI+KNSW E WG+KG+  + +     
Sbjct: 267 YFEKECSSDELDHGVLVVGYGFEGEDVDGKK--YWIVKNSWSEKWGDKGFIYMAKDRHNH 324

Query: 337 CGINDYVRSALV 348
           CGI       LV
Sbjct: 325 CGIATAASYPLV 336


>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
          Length = 366

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 165/314 (52%), Gaps = 28/314 (8%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
            ++  +L +H K Y  L E   R  +F  NL  IQ   + ++ +   GLN+F+D++  E+
Sbjct: 38  TMYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNQFADMTNEEY 97

Query: 100 QAKYLGFK------LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           +  Y G K      L  + +     A      LP   DWR   AV  +KDQ  CGS WAF
Sbjct: 98  RVMYFGTKSDAKRRLMKTKSTGHRYAYSAGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAF 157

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           ST   +E +    T K VSLSEQEL+DCD+  ++GC GG +  AF+ I+    GG++ +K
Sbjct: 158 STVATVEAINKIVTGKFVSLSEQELVDCDRAYNEGCNGGLMDYAFEFIIQN--GGIDTDK 215

Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFY 269
            YPYRG D  C   KK A  V I+G+  V   + +  K  V + P+++AI A    LQ Y
Sbjct: 216 DYPYRGFDGICDPTKKNAKVVNIDGFEDVPPYDENALKKAVAHQPVSIAIEASGRDLQLY 275

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +GV      F      +L H V++VGYG      +   V YW+++NSWG GWGE GYF+
Sbjct: 276 QSGV------FTGKCGTSLDHGVVVVGYG------SENGVDYWLVRNSWGTGWGEDGYFK 323

Query: 330 LYRG----DGSCGI 339
           + R      G CGI
Sbjct: 324 MQRNVRTPTGKCGI 337


>gi|5881566|dbj|BAA84280.1| Cysteine proteinase [Clonorchis sinensis]
          Length = 232

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 103/219 (47%), Positives = 130/219 (59%), Gaps = 12/219 (5%)

Query: 130 FDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCE 189
           FDWRE+ AV  V DQ  CGS WAFS  GN+ G +  KT  L++LSEQ+L+DCD  DDGC+
Sbjct: 25  FDWREHGAVGPVLDQGKCGSCWAFSVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDDGCD 84

Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAK 249
           GG     +  I     GGLE    YPY G    C ++K      ING   +   E   A+
Sbjct: 85  GGYPPQTYTAIQKM--GGLELASDYPYTGVGGICHMDKSKFVAYINGSTILPLSEKVQAQ 142

Query: 250 YLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAV 309
            L   GP++ A+NA  LQ Y  G+  P   +CD    N  H+VL VGYGV   K      
Sbjct: 143 KLRAIGPLSSALNADTLQLYKGGIMRPK--WCDPAGVN--HAVLTVGYGVQNGK------ 192

Query: 310 PYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           PYWI+KNSWGE +GE+GYFR+YRGDG+CGIN  V +A++
Sbjct: 193 PYWIVKNSWGEDFGEEGYFRIYRGDGTCGINSIVTTAII 231


>gi|394331828|gb|AFN27133.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 116/318 (36%), Positives = 172/318 (54%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWR+  AVT VKDQ  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             G+IE  +A    +L +LS+  L+ C  +D+G   G +  AF+ ++  + G +  E +Y
Sbjct: 155 AVGSIESQWALAGHRLTALSDHHLVSCHDKDNGRPAGLMLQAFEWLLRNMNGTMFTEDSY 214

Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY            ++Q+    +I+GYV++   ET MA +L +NGP+++A++A +   Y 
Sbjct: 215 PYVSSSGYVPECSNSSQLVPGARIDGYVTIESSETVMAAWLAKNGPISIALDASSFMSYQ 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV       C G    L+H VL+VGY  +RT      VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGVV----TSCAG--MPLNHGVLLVGY--NRT----GEVPYWVIKNSWGENWGENGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
          Length = 333

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 123/318 (38%), Positives = 169/318 (53%), Gaps = 28/318 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV----YGLNEFSDLSTAEF- 99
           F   H KTY + VE   R  IF+ N   I    + ++  G+     G+N+F+DL   EF 
Sbjct: 30  FKSTHKKTYKSNVEELLRFKIFTENSLFIAK-HNVKYAKGLVSYKLGINQFADLLPHEFV 88

Query: 100 --QAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
                Y G +L    +    PA + + +LP+  DWR+  AVT VKDQ  CGS WAFS+TG
Sbjct: 89  KMMNGYQGKRLAGRGSTYLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSSTG 148

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
           ++EG +  KT KLVSLSEQ L+DC     + GC GG + N+F+ I  K  GG++ E +YP
Sbjct: 149 SLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYI--KANGGIDTEDSYP 206

Query: 216 YRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTG 272
           Y  +D  CR  K+       G+V +    E D+ K +   GP++VAI+A   + Q Y  G
Sbjct: 207 YEAEDGDCRYKKEDVGATDTGFVDIKEGSEKDLQKAVATVGPVSVAIDASQQSFQLYSEG 266

Query: 273 V-SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           V   P     +  +E+L H VL VGYGV   K       YW++KNSW E WG+ GY  + 
Sbjct: 267 VYDEP-----NCSSESLDHGVLAVGYGVKNGK------KYWLVKNSWAETWGQDGYILMS 315

Query: 332 RG-DGSCGINDYVRSALV 348
           R  +  CGI       LV
Sbjct: 316 RDKNNQCGIASSASYPLV 333


>gi|237651947|gb|ACR08662.1| cathepsin F, partial [Drosophila silvestris]
          Length = 186

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 90/189 (47%), Positives = 125/189 (66%), Gaps = 5/189 (2%)

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G+YA +T +L   SEQEL+DCD  D  C GG + NA+  I  K  GGLE E  YPY    
Sbjct: 1   GLYAIRTGELQEFSEQELLDCDSTDSACNGGLMDNAYKAI--KDIGGLEYESEYPYAAKK 58

Query: 221 KACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQF 279
             C  N+  + V+I+G+V + + +ET M ++L+ NGP+++ +NA A+QFY  GVSHP   
Sbjct: 59  MQCHFNRTLSHVQISGFVDLPKGNETAMQEWLLSNGPISIGLNANAMQFYRGGVSHPWAP 118

Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
            C    +NL H VLIVGYGV      HK +PYWI+KNSWG+ WGE+GY+R+YRGD +CG+
Sbjct: 119 LCS--KKNLDHGVLIVGYGVSDYPNFHKTLPYWIVKNSWGQRWGEQGYYRIYRGDNTCGV 176

Query: 340 NDYVRSALV 348
           ++   SA++
Sbjct: 177 SEMATSAVL 185


>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
          Length = 707

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 131/315 (41%), Positives = 168/315 (53%), Gaps = 27/315 (8%)

Query: 37  KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
           K  A F  ++ +H K Y ++ E   R  +F  NL  I   ++ E  S   GLNEF+DLS 
Sbjct: 399 KLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDE-RNKEVSSYWLGLNEFADLSH 457

Query: 97  AEFQAKYLGFKLK-PSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            EF++KYLG + + P   D S      ++  LP + DWR+  AVT VK+Q  CGS WAFS
Sbjct: 458 EEFKSKYLGLRAEFPRSRDYSGEFRYRDVADLPESVDWRKKGAVTHVKNQGACGSCWAFS 517

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           T   +EG+    T  L +LSEQELIDCD   + GC GG +  AF  I S   GGL +E  
Sbjct: 518 TVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASN--GGLHKEDD 575

Query: 214 YPYRGDDKACRLNKKATQ-VKINGYVSV-SRDETDMAKYLVENGPMAVAINAYA--LQFY 269
           YPY  ++  C   K+    V I+GY  V  +DE  + K L    P++VAI A     QFY
Sbjct: 576 YPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQ-PLSVAIEASGRDFQFY 634

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GV     F    G E L H V  VGYG      + K + Y I+KNSWG  WGEKGY R
Sbjct: 635 SGGV-----FNGPCGTE-LDHGVAAVGYG------SSKGLDYIIVKNSWGPKWGEKGYIR 682

Query: 330 LYRG----DGSCGIN 340
           + R     +G CGIN
Sbjct: 683 MKRNTGKTEGLCGIN 697


>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
          Length = 345

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 109/304 (35%), Positives = 159/304 (52%), Gaps = 30/304 (9%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAMIPNIT----------LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           G  +  SY     P+ +P+            +P   DWRE  AVT VK+Q  CG  WAFS
Sbjct: 102 GLNIPNSYLS---PSPMPSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFS 158

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             G++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I+    GG+  E  Y
Sbjct: 159 AVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDY 216

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA-YALQFYVTGV 273
            Y G    CR   K   V+I+ Y  V   ET + + + +  P+++ I A + LQFY  G 
Sbjct: 217 EYLGQQYTCRSQGKTAAVQISNYQVVPEGETSLLQAVTKQ-PVSIGIAASHDLQFYAGGT 275

Query: 274 SHPIQFFCDGGNEN-LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
                   DG   N ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R
Sbjct: 276 Y-------DGSCANRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIR 323

Query: 333 GDGS 336
             G+
Sbjct: 324 DSGN 327


>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
          Length = 333

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 123/308 (39%), Positives = 162/308 (52%), Gaps = 21/308 (6%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLR---KIQLLQDTEHGSGVYGLNEFSDLSTAE 98
           +N +  +H K Y +  E  SR  I+  NL    K  L  D  H +   G+N+F+DL   E
Sbjct: 28  WNEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGINQFTDLQNEE 87

Query: 99  FQAKYLGFKLK-PSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
           F A   GF++   S A +    + PN    LP+  DWR    VT VKDQ  CGS WAFST
Sbjct: 88  FVAMMTGFRVSGTSKAAKGSTFLPPNNVGELPKTVDWRTKGYVTPVKDQGQCGSCWAFST 147

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
           TG++EG +   T KLVSLSEQ L+DC   D GC+GG +  AF  I+    GG++ E +YP
Sbjct: 148 TGSVEGQHFKATGKLVSLSEQNLVDCSGRDAGCDGGFMDRAFQYIID--AGGIDTEASYP 205

Query: 216 YRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
           Y+  D  C   K      + GY  V S  E  + K +   GP++VAI+A   + Q Y +G
Sbjct: 206 YKAVDGKCHFKKANVGATVTGYTDVTSGSEKALQKAVAHVGPISVAIDASHMSFQHYKSG 265

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           V +  +  CD  +  L H VL VGYG      +     YWI+KNSW E WG  GY  + R
Sbjct: 266 VYN--EPGCD--STVLDHGVLAVGYGT-----SSDGTDYWIVKNSWAETWGMNGYVWMSR 316

Query: 333 G-DGSCGI 339
             D  CGI
Sbjct: 317 NKDNQCGI 324


>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
          Length = 343

 Score =  191 bits (484), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 118/306 (38%), Positives = 159/306 (51%), Gaps = 24/306 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ Q+ K Y    E  +R  IF+ N+  ++     +  S   G+N+F+DL+  EF A   
Sbjct: 42  WMSQYGKIYKDHQERETRFKIFTENVNYVEASNADDTKSYKLGINQFADLTNEEFVASRN 101

Query: 105 GFK-LKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
            FK    S   R+      N++ +P   DWR+  AVT VK+Q  CG  WAFS     EG+
Sbjct: 102 KFKGHMCSSITRTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGI 161

Query: 163 YAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           +   T KL+SLSEQEL+DCD +  D GCEGG + +AF  I+     GL  E  YPY G D
Sbjct: 162 HKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNH--GLSTEAQYPYEGVD 219

Query: 221 KACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPI 277
             C  NK + Q V I GY  V  +     +  V N P++VAI+A     QFY +GV    
Sbjct: 220 GTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAIDASGSDFQFYKSGV---- 275

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
             F       L H V  VGYGV     ++    YW++KNSWG  WGE+GY  + RG    
Sbjct: 276 --FTGSCGTELDHGVTAVGYGV-----SNDGTKYWLVKNSWGTDWGEEGYIMMQRGVEAA 328

Query: 334 DGSCGI 339
           +G CGI
Sbjct: 329 EGLCGI 334


>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
           Precursor
 gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
          Length = 351

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 109/306 (35%), Positives = 159/306 (51%), Gaps = 23/306 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  ++ ++ + Y    E   R  IF  N++ I+        S   G+N+F+D++ +EF A
Sbjct: 37  FEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVA 96

Query: 102 KYLGFKLKPSYADRSVPAMIPNITL---PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           +Y G  L P   +R       ++ +   P++ DWR+Y AV  VK+Q  CGS W+F+    
Sbjct: 97  QYTGVSL-PLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIAT 155

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           +EG+Y  KT  LVSLSEQE++DC     GC+GG ++ A+D I+S    G+  E+ YPY  
Sbjct: 156 VEGIYKIKTGYLVSLSEQEVLDC-AVSYGCKGGWVNKAYDFIISN--NGVTTEENYPYLA 212

Query: 219 DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPI 277
               C  N       I GY  V R++     Y V N P+A  I+A    Q+Y  GV    
Sbjct: 213 YQGTCNANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGGV---- 268

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
             F      +L+H++ I+GYG D +        YWI++NSWG  WGE GY R+ RG    
Sbjct: 269 --FSGPCGTSLNHAITIIGYGQDSS-----GTKYWIVRNSWGSSWGEGGYVRMARGVSSS 321

Query: 334 DGSCGI 339
            G CGI
Sbjct: 322 SGVCGI 327


>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
          Length = 361

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 118/306 (38%), Positives = 161/306 (52%), Gaps = 24/306 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ Q+ + Y    E   R  IF  N++ I+        S    +NEF+D +  EFQA   
Sbjct: 60  WMIQYGRVYKDEAEKSVRFQIFMDNVKFIEEFNKDGRQSYKLAVNEFADQTNEEFQASRN 119

Query: 105 GFKLK-PSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
           G+K+   S   ++      N+T +P + DWR+  AVT VKDQ  CGS WAFST    EG+
Sbjct: 120 GYKMAVSSRPSQTTLFRYENVTAVPSSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGI 179

Query: 163 YAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
              KT KL+SLSEQEL+DCD+  ED GCEGG + + F+ I+   G  L  E +YPY   D
Sbjct: 180 TKLKTGKLISLSEQELVDCDKTGEDQGCEGGYMEDGFEFIVKNKGIAL--EASYPYTAAD 237

Query: 221 KACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPI 277
             C   ++A++  KI+GY  V  +        V N P++V+I+A   A QFY +GV    
Sbjct: 238 GTCNSKEEASRAAKISGYEKVPANSETALLKAVANQPVSVSIDASGVAFQFYSSGV---- 293

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
             F      +L H V  VGYG      T     YW++KNSWG  WG+ GY  + RG    
Sbjct: 294 --FTGECGTDLDHGVTAVGYGK-----TSDGTKYWLVKNSWGASWGDSGYIMMQRGVAAK 346

Query: 334 DGSCGI 339
            G CGI
Sbjct: 347 GGLCGI 352


>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
 gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
 gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
          Length = 466

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 121/348 (34%), Positives = 181/348 (52%), Gaps = 34/348 (9%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKH---TALFNYFLEQHNKTYATLVEYYSRLHIF 66
           + L+  T+S +S M +      H+H       +AL+  +L +H K+Y  L E   R  IF
Sbjct: 14  LMLIFSTLSSASDMSIISYDETHIHRRTDDEVSALYESWLIEHGKSYNALGEKDKRFQIF 73

Query: 67  SGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-------LKPSYADRSVPA 119
             NLR I       + S   GL +F+DL+  E+++ YLG K       L  + +DR +P 
Sbjct: 74  KDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRKKLSKNKSDRYLPK 133

Query: 120 MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELI 179
           +    +LP + DWRE   + GVKDQ  CGS WAFS    +E + A  T  L+SLSEQEL+
Sbjct: 134 V--GDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELV 191

Query: 180 DCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGY 237
           DCD+  ++GC+GG +  AF+ ++    GG++ E+ YPY+  +  C +  K A  VKI+ Y
Sbjct: 192 DCDRSYNEGCDGGLMDYAFEFVIKN--GGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSY 249

Query: 238 VSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIV 295
             V  +     +  V + P+++A+ A     Q Y +G+      F       + H V+I 
Sbjct: 250 EDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGI------FTGKCGTAVDHGVVIA 303

Query: 296 GYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
           GYG      T   + YWI++NSWG  WGE GY R+ R      G CG+
Sbjct: 304 GYG------TENGMDYWIVRNSWGANWGENGYLRVQRNVASSSGLCGL 345


>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
          Length = 337

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 126/312 (40%), Positives = 167/312 (53%), Gaps = 22/312 (7%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
           H K Y    E + R+ I+  NLRKIQ   + EH  G++    G+N F D++  EF+    
Sbjct: 36  HGKNYHEKEEGWRRM-IWEKNLRKIQF-HNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMN 93

Query: 105 GFKLKPSYADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
           G+K K     +    M PN + +P   DWRE   VT VKDQ  CGS WAFSTTG +EG  
Sbjct: 94  GYKHKTERKFKGSLFMEPNFLEVPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQM 153

Query: 164 AAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DD 220
             K  KLVSLSEQ L+DC + +  +GC GG +  AF  I  K   GL+ E+ YPY G DD
Sbjct: 154 FRKQGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYI--KDNNGLDSEEAYPYLGTDD 211

Query: 221 KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPI 277
           + C  + K       G+V + S  E  + K +   GP++VAI+A   + QFY +G    I
Sbjct: 212 QPCHYDPKYNAANDTGFVDIPSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSG----I 267

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGS 336
            F  +  +E L H VL+VGYG +      K   YWI+KNSW E WG+KGY  + +     
Sbjct: 268 YFEKECSSEELDHGVLVVGYGFEGEDVDGKK--YWIVKNSWSESWGDKGYIYMAKDRKNH 325

Query: 337 CGINDYVRSALV 348
           CGI       LV
Sbjct: 326 CGIATAASYPLV 337


>gi|401419663|ref|XP_003874321.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|1706259|sp|P35591.2|CYSP1_LEIPI RecName: Full=Cysteine proteinase 1; AltName: Full=Amastigote
           cysteine proteinase A-1; Flags: Precursor
 gi|1220383|gb|AAA91859.1| cysteine proteinase [Leishmania pifanoi]
 gi|322490556|emb|CBZ25817.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 354

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 119/319 (37%), Positives = 168/319 (52%), Gaps = 25/319 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN-EFSDLSTA 97
           +A +  F ++H K +    E   R + F  N++    L +T++    Y ++ +F+DL+  
Sbjct: 39  SAHYGSFKKRHGKAFGGDAEEGHRFNAFKQNMQTAYFL-NTQNPHAHYDVSGKFADLTPQ 97

Query: 98  EFQAKYL-----GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           EF   YL        LK    D  V    P+  +  + DWR+  AVT VK+Q +CGS WA
Sbjct: 98  EFAKLYLNPDYYARHLKDHKEDVHVDDSAPSGVM--SVDWRDKGAVTPVKNQGLCGSCWA 155

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           FS  GNIEG +AA    LVSLSEQ L+ CD  D+GC GG +  A + IM    G +  E 
Sbjct: 156 FSAIGNIEGQWAASGHSLVSLSEQMLVSCDNIDEGCNGGLMDQAMNWIMQSHNGSVFTEA 215

Query: 213 TYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           +YPY    G    C  ++     KI G++S+  DE  +A+++ + GP+AVA++A   Q Y
Sbjct: 216 SYPYTSGGGTRPPCH-DEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLY 274

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GV       C     +L+H VLIVG+  +         PYWI+KNSWG  WGEKGY R
Sbjct: 275 FGGVVS----LCLA--WSLNHGVLIVGFNKNAKP------PYWIVKNSWGSSWGEKGYIR 322

Query: 330 LYRGDGSCGINDYVRSALV 348
           L  G   C + +Y  SA V
Sbjct: 323 LAMGSNQCMLKNYPVSATV 341


>gi|126021|sp|P25775.1|LMCPA_LEIME RecName: Full=Cysteine proteinase A; Flags: Precursor
 gi|9573|emb|CAA44094.1| cysteine proteinase [Leishmania mexicana]
          Length = 354

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 119/319 (37%), Positives = 168/319 (52%), Gaps = 25/319 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN-EFSDLSTA 97
           +A +  F ++H K +    E   R + F  N++    L +T++    Y ++ +F+DL+  
Sbjct: 39  SAHYGSFKKRHGKAFGGDAEEGHRFNAFKQNMQTAYFL-NTQNPHAHYDVSGKFADLTPQ 97

Query: 98  EFQAKYL-----GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           EF   YL        LK    D  V    P+  +  + DWR+  AVT VK+Q +CGS WA
Sbjct: 98  EFAKLYLNPDYYARHLKNHKEDVHVDDSAPSGVM--SVDWRDKGAVTPVKNQGLCGSCWA 155

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           FS  GNIEG +AA    LVSLSEQ L+ CD  D+GC GG +  A + IM    G +  E 
Sbjct: 156 FSAIGNIEGQWAASGHSLVSLSEQMLVSCDNIDEGCNGGLMDQAMNWIMQSHNGSVFTEA 215

Query: 213 TYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           +YPY    G    C  ++     KI G++S+  DE  +A+++ + GP+AVA++A   Q Y
Sbjct: 216 SYPYTSGGGTRPPCH-DEGEVGAKITGFLSLPHDEERIAEWVEKRGPVAVAVDATTWQLY 274

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GV       C     +L+H VLIVG+  +         PYWI+KNSWG  WGEKGY R
Sbjct: 275 FGGVVS----LCLA--WSLNHGVLIVGFNKNAKP------PYWIVKNSWGSSWGEKGYIR 322

Query: 330 LYRGDGSCGINDYVRSALV 348
           L  G   C + +Y  SA V
Sbjct: 323 LAMGSNQCMLKNYPVSATV 341


>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
          Length = 421

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 123/310 (39%), Positives = 160/310 (51%), Gaps = 25/310 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F+ F   + K+YAT  E   R  IF  NL  I    + +  S    +N F DLS  EF+ 
Sbjct: 116 FSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHT-HNQQGYSYSLKMNHFGDLSRDEFRR 174

Query: 102 KYLGFKLKPSYADR--SVPAMIPNI---TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           KYLGFK   +       V   + N+    LP   DWR    VT VKDQ  CGS WAFSTT
Sbjct: 175 KYLGFKKSRNLKSHHLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGSCWAFSTT 234

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           G +EG + AKT KLVSLSEQEL+DC + +    C GG +++AF  ++    GG+  E  Y
Sbjct: 235 GALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLD--SGGICSEDAY 292

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
           PY   D+ CR       VKI G+  V R      K  +   P+++AI A     QFY  G
Sbjct: 293 PYLARDEECRAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEG 352

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG--YFRL 330
           V      F      +L H VL+VGYG D+         +WI+KNSWG GWG  G  Y  +
Sbjct: 353 V------FDASCGTDLDHGVLLVGYGTDK----ESKKDFWIMKNSWGTGWGRDGYMYMAM 402

Query: 331 YRG-DGSCGI 339
           ++G +G CG+
Sbjct: 403 HKGEEGQCGL 412


>gi|30142040|gb|AAN34825.1| cysteine proteinase [Leishmania amazonensis]
 gi|30142042|gb|AAN34826.1| cysteine proteinase [Leishmania amazonensis]
 gi|30142572|gb|AAP21894.1| cysteine proteinase [Leishmania amazonensis]
          Length = 354

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 118/319 (36%), Positives = 168/319 (52%), Gaps = 25/319 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN-EFSDLSTA 97
           +A +  F ++H+K +    E   R + F  N++    L +T++    Y ++ +F+DL+  
Sbjct: 39  SAHYGSFKKRHSKAFGGDAEEGHRFNAFKQNMQTAYFL-NTQNPHAHYDVSGKFADLTPQ 97

Query: 98  EFQAKYLG-----FKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           EF   YL        LK    D  V    P+  +  + DWR+  AVT VK+Q +CGS WA
Sbjct: 98  EFAKLYLNPDYYTSHLKDHKEDVHVDDSAPSGVM--SVDWRDKGAVTPVKNQGLCGSCWA 155

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           FS  GNIEG +AA    LVSLSEQ L+ CD  D+GC GG +  A + IM    G +  E 
Sbjct: 156 FSAIGNIEGQWAASGHSLVSLSEQMLVSCDNVDEGCNGGLMDQAMNWIMQSHNGSVFTEA 215

Query: 213 TYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFY 269
           +YPY    G    C  ++     KI G++S+  DE  +A ++ + GP+AVA++A   Q Y
Sbjct: 216 SYPYTSGGGTRPPCH-DEGEVGAKITGFLSLPHDEERIADWVEKRGPVAVAVDATTWQLY 274

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
             GV      +      +L+H VLIVG+  +         PYWI+KNSWG  WGEKGY R
Sbjct: 275 FGGVVSLCLAW------SLNHGVLIVGFNKNAKP------PYWIVKNSWGSSWGEKGYIR 322

Query: 330 LYRGDGSCGINDYVRSALV 348
           L  G   C + +Y  SA V
Sbjct: 323 LAMGSNQCMLKNYPVSATV 341


>gi|71084306|gb|AAZ23598.1| cysteine protease [Leishmania major]
          Length = 327

 Score =  190 bits (483), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 123/321 (38%), Positives = 169/321 (52%), Gaps = 29/321 (9%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLN-EFSDLSTA 97
           +A +  F E+H K++    +   R + F  N++    L +T +    Y ++ +F+DL+  
Sbjct: 12  SAHYGRFKERHGKSFGEDADEGHRFNAFKQNMQTAYFL-NTHNPHAHYDVSGKFADLTPQ 70

Query: 98  EFQAKYLGFKLKPSY-----ADRSVPAMIPNITLPRAF--DWREYDAVTGVKDQTMCGSS 150
           EF   YL     P Y      D      + +  L  A   DWRE  AVT VK+Q MCGS 
Sbjct: 71  EFAKLYL----NPDYYARRGKDYKEHVHVDDSVLSGAMSVDWREKVAVTPVKNQGMCGSC 126

Query: 151 WAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEE 210
           WAFS  GNIE  +A K   LVSLSEQ L+ CD  DDGC GG +  A + I+    G +  
Sbjct: 127 WAFSAIGNIESQWALKNHSLVSLSEQMLVSCDDIDDGCNGGLMDQAMEWIIQHHNGTVPT 186

Query: 211 EKTYPYR---GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQ 267
           E++YPY    G    C  +K     +I+GY+S+  DE  +A Y+ + GP+AVA++A   Q
Sbjct: 187 EESYPYASAGGTSPPCH-DKGEFGARISGYMSLPHDEKAIAAYVEKKGPVAVAVDATTWQ 245

Query: 268 FYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGY 327
            Y  GV       C G   +L+H VL+VG+   R K      PYWI+KNSWG  WGEKGY
Sbjct: 246 LYFGGVV----TLCFGW--SLNHGVLVVGFN-KRAK-----PPYWIVKNSWGTSWGEKGY 293

Query: 328 FRLYRGDGSCGINDYVRSALV 348
            RL  G   C + +Y  +A V
Sbjct: 294 IRLAMGSNQCLLKNYPVTATV 314


>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
          Length = 359

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 125/346 (36%), Positives = 182/346 (52%), Gaps = 35/346 (10%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVK-HTALFNYF--LEQHNKTYATLVEYYSRLHIFSG 68
           LL +++S++    V +    + H ++   +L+N +     H+     L E ++R ++F  
Sbjct: 6   LLFISLSLALIFTVANTFDFNEHDLESEKSLWNLYERWRSHHTVTRNLDEKHNRFNVFKA 65

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA-----MIPN 123
           N+  +      +    +  LN+F D++  EF+  Y   K+      R +       M  N
Sbjct: 66  NVMHVHNTNKLDKPYKL-KLNKFGDMTNYEFRRIYADSKISHHRMFRGMSHENGTFMYEN 124

Query: 124 -ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
            + +P + DWR   AVTGVKDQ  CGS WAFST   +EG+   KT+KLVSLSEQ+L+DCD
Sbjct: 125 AVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSLSEQQLVDCD 184

Query: 183 -QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS 241
            +E++GC GG +  AF+ I      G+  E  YPY   D  C + K+   V I+G+ +V 
Sbjct: 185 TEENEGCNGGLMEYAFEFIKQ---NGITTESNYPYAAKDGTCDVEKEDKAVSIDGHENVP 241

Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
            +            P++VAI+A  Y  QFY  GV      F    + +L+H V IVGYGV
Sbjct: 242 INNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGV------FTGHCDTDLNHGVAIVGYGV 295

Query: 300 --DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
             DRTK       YWI+KNSWG  WGE+GY R+ RG    +G CGI
Sbjct: 296 TQDRTK-------YWIMKNSWGSEWGEQGYIRMQRGISSREGLCGI 334


>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
          Length = 458

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 121/315 (38%), Positives = 167/315 (53%), Gaps = 33/315 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           L+  +  +H K+Y  + E   R   F  NLR I    +    +GV+    GLN F+DL+ 
Sbjct: 39  LYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE-HNAAADAGVHSFRLGLNRFADLTN 97

Query: 97  AEFQAKYLGFKLKP----SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
            E++  YLG + KP      +DR + A   N  LP + DWR   AV  +KDQ + GS WA
Sbjct: 98  EEYRDTYLGLRNKPRRERKVSDRYLAA--DNEALPESVDWRTKGAVAEIKDQEVAGSCWA 155

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           FS    +EG+    T  L+SLSEQEL+DCD   ++GC GG +  AFD I++   GG++ E
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTE 213

Query: 212 KTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
             YPY+G D+ C +N+K A  V I+ Y  V+ +     +  V N P++VAI A   A Q 
Sbjct: 214 DDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQL 273

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +G+      F       L H V  VGYG +  K       YWI++NSWG+ WGE GY 
Sbjct: 274 YSSGI------FTGKCGTALDHGVAAVGYGTENGK------DYWIVRNSWGKSWGESGYV 321

Query: 329 RLYRG----DGSCGI 339
           R+ R      G CGI
Sbjct: 322 RMERNIKASSGKCGI 336


>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
          Length = 339

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 123/337 (36%), Positives = 172/337 (51%), Gaps = 28/337 (8%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           L+L  + S+++      L  L  V+H      ++ Q+ + Y   VE   R +IF  N+  
Sbjct: 12  LALVFATSAYLATSRTLLDSLMAVRH----EQWMAQYGRVYKNEVEKTKRYNIFKENVEY 67

Query: 73  IQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFD 131
           I+            G+N F+DL+  EF A   G+ L P     + P    N++ +P   D
Sbjct: 68  IESFNKAGTKPYKLGINAFADLTNKEFIASRNGYIL-PHECSSNTPFRYENVSAVPTTVD 126

Query: 132 WREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCE 189
           WR+  AVT VKDQ  CG  WAFS    +EG+    T  L+SLSEQEL+DCD +  D GCE
Sbjct: 127 WRKKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCE 186

Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYVSVSRDETDMA 248
           GG + +AF  I++    GL  E  YPY+G D +C +     +  KI+GY  V  +     
Sbjct: 187 GGLMDDAFTFIINN--KGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESAL 244

Query: 249 KYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
           +  V N P++VAI+A     QFY +GV     F  + G E L H V  VGYG+       
Sbjct: 245 EKAVANQPVSVAIDAGGSDFQFYSSGV-----FTGECGTE-LDHGVTAVGYGI-----AE 293

Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
               YW++KNSWG  WGEKGY R+ +     +G CGI
Sbjct: 294 DGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGI 330


>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
           CEP1 [Vitis vinifera]
          Length = 341

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 122/319 (38%), Positives = 159/319 (49%), Gaps = 24/319 (7%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
            +LH          ++ Q+ + Y    E   R  IF  N+ +I+        S    +NE
Sbjct: 28  RNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINE 87

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGS 149
           F+DL+  EF      FK     +  +      N+T +P   DWR+  AVT +KDQ  CGS
Sbjct: 88  FADLTNEEFGTSRNRFKAHIC-STEATSFKYENVTAVPSTIDWRKKGAVTPIKDQGQCGS 146

Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGG 207
            WAFS    +EG+    T KL+SLSEQEL+DCD   ED GC GG + +AF  I  K   G
Sbjct: 147 CWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFI--KQNHG 204

Query: 208 LEEEKTYPYRGDDKACRLNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--Y 264
           L  E  YPY G D  C   K A    KINGY  V  +     +  V + P+AVAI+A  +
Sbjct: 205 LTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVVHQPIAVAIDAGGF 264

Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
             QFY +GV     F    G E L H V  VGYG      +   + YW++KNSWG GWGE
Sbjct: 265 EFQFYSSGV-----FTGQCGTE-LDHGVAAVGYGT-----SDDGMKYWLVKNSWGTGWGE 313

Query: 325 KGYFRLYRG----DGSCGI 339
           +GY R+ R     +G CGI
Sbjct: 314 EGYIRMQRDVTAKEGLCGI 332


>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 122/317 (38%), Positives = 170/317 (53%), Gaps = 27/317 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQ 100
           F   H KTY + +E   R  IF+ +   I    + ++  G+     G+N+F DL   EF 
Sbjct: 30  FKTTHKKTYQSHMEELLRFKIFTES-SLIIARHNAKYAKGLVSYKLGMNQFGDLLAHEFA 88

Query: 101 AKYLGF--KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
             + G     K   +    PA + + +LP+A DWR+  AVT VKDQ  CGS WAFS TG+
Sbjct: 89  RIFNGHHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGS 148

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           +EG +  K  +LVSLSEQ L+DC Q   ++GCEGG + +AF  I  K   G++ EK+YPY
Sbjct: 149 LEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPY 206

Query: 217 RGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGV 273
              D  CR  K+       GYV + +  E D+ K +   GP++VAI+A   + Q Y  GV
Sbjct: 207 EAVDGECRFKKEDVGATDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGV 266

Query: 274 -SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
              P     +  +E+L H VL+VGYGV   K       YW++KNSW E WG++GY  + R
Sbjct: 267 YDEP-----ECSSEDLDHGVLVVGYGVKGGK------KYWLVKNSWAESWGDQGYILMSR 315

Query: 333 -GDGSCGINDYVRSALV 348
             +  CGI       LV
Sbjct: 316 DNNNQCGIASQASYPLV 332


>gi|33333714|gb|AAQ11975.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
           maculatus]
          Length = 323

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 118/305 (38%), Positives = 167/305 (54%), Gaps = 21/305 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVYG--LNEFSDLSTAEFQA 101
           F   H KTY ++VE   R  +F  NL  IQ      E G   +   + +F+D++  EF  
Sbjct: 26  FKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEF-L 84

Query: 102 KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
             L  +  P+    +V     +I    A DWR+  AVT VK+Q  CGS WAFS  G IEG
Sbjct: 85  DLLKLQGVPALPSDAVYFEETDIEEKDAVDWRKEGAVTPVKNQGHCGSCWAFSAVGAIEG 144

Query: 162 VYAAKTKKLVSLSEQELIDCDQE---DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
            +  K   LVSLS QEL+DC  E   ++GC GG +  AFD +  +   G++ E++YPY+ 
Sbjct: 145 QFFKKNGTLVSLSAQELVDCATEYYGNEGCNGGLMGQAFDFVEDE---GIQTEESYPYKA 201

Query: 219 DDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
               C++N +  T+VK      +  +E ++A+ +   GP+AVAI+A  L FY  G+    
Sbjct: 202 KRSICQMNGEYVTKVKT---YHLLLNEQEIARAVSAKGPVAVAIDASQLSFYDQGIVDE- 257

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
           +  C    E+L+H VL+VGYG      +   V YWI+KNSWG  WGEKGYFRL +   +C
Sbjct: 258 KCKCSKKREDLNHGVLVVGYG------SENGVDYWIVKNSWGADWGEKGYFRLKKDVKAC 311

Query: 338 GINDY 342
           GI +Y
Sbjct: 312 GIGNY 316


>gi|438000427|ref|YP_007250532.1| v-cath protein [Thysanoplusia orichalcea NPV]
 gi|429842964|gb|AGA16276.1| v-cath protein [Thysanoplusia orichalcea NPV]
          Length = 323

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 117/314 (37%), Positives = 172/314 (54%), Gaps = 21/314 (6%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +K    F  F+ + NK Y++  E   R  IF  NL +I  +   ++ S  Y +N+FSDLS
Sbjct: 22  LKAPNYFEEFVHRFNKNYSSETEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79

Query: 96  TAEFQAKYLGFKLKPSYADRSVPAMI---PNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
             E  AKY G  L P+        +I   P    P  FDWR  + VT VK+Q  CG+ WA
Sbjct: 80  KDETIAKYTGLSL-PTQTQNFCKVIILDQPPGKGPLDFDWRRLNKVTNVKNQGTCGACWA 138

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           F+T  ++E  YA K  +L++LSEQ++IDCD  D GC GG +  AF+ I+    GG++ E 
Sbjct: 139 FATLASLESQYAIKHNQLINLSEQQMIDCDFVDAGCNGGLLHTAFEAIIKM--GGVQLES 196

Query: 213 TYPYRGDDKACRLNKKATQVKI-NGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVT 271
            YPY  ++  CR+N     V++ + Y  V+  E  +   L   GP+ +AI+A  +  Y  
Sbjct: 197 DYPYEANNNNCRMNGNKFAVRVKDCYRYVTVYEEKLKDLLRVAGPIPMAIDAADIVNYKQ 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GV      +C   N  L+H+VL+VGYGV+        +P+WI KN+WG  WGE GYFR+ 
Sbjct: 257 GVIR----YC--FNSGLNHAVLLVGYGVENN------IPFWIFKNTWGTDWGEDGYFRVQ 304

Query: 332 RGDGSCGINDYVRS 345
           +   +CG+ + + S
Sbjct: 305 QNINACGMRNELAS 318


>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
 gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
           Group]
 gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
           Japonica Group]
 gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
 gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
 gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
          Length = 378

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 120/303 (39%), Positives = 154/303 (50%), Gaps = 37/303 (12%)

Query: 58  EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
           E   R ++F  N R I              LN+F+D++T EF+  Y G + +     RS+
Sbjct: 66  EARRRFNVFVENARYIHEANRRGGRPFRLALNKFADMTTDEFRRTYAGSRARHH---RSL 122

Query: 118 PAMIPNI------------TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAA 165
                               LP A DWRE  AVTG+KDQ  CGS WAFST   +EGV   
Sbjct: 123 SGGRGGEGGSFRYGGDDEDNLPPAVDWRERGAVTGIKDQGQCGSCWAFSTVAAVEGVNKI 182

Query: 166 KTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
           KT +LV+LSEQEL+DCD  D+ GC+GG +  AF  I  K  GG+  E  YPYR +   C 
Sbjct: 183 KTGRLVTLSEQELVDCDTGDNQGCDGGLMDYAFQFI--KRNGGITTESNYPYRAEQGRCN 240

Query: 225 LNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFC 281
             K ++  V I+GY  V  ++    +  V N P+AVA+ A     QFY  GV      F 
Sbjct: 241 KAKASSHDVTIDGYEDVPANDESALQKAVANQPVAVAVEASGQDFQFYSEGV------FT 294

Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGS 336
                +L H V  VGYG+     T     YWI+KNSWGE WGE+GY R+ RG     +G 
Sbjct: 295 GECGTDLDHGVAAVGYGI-----TRDGTKYWIVKNSWGEDWGERGYIRMQRGVSSDSNGL 349

Query: 337 CGI 339
           CGI
Sbjct: 350 CGI 352


>gi|300175452|emb|CBK20763.2| unnamed protein product [Blastocystis hominis]
          Length = 313

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 117/311 (37%), Positives = 160/311 (51%), Gaps = 28/311 (9%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           +++   FN F  ++ K Y    E   R  +F+ N+   Q +   +H   V G   F+D++
Sbjct: 17  LRYENTFNSFEARYGKNYINAAERAFRQKVFAYNMEWAQKINSEDHPYTV-GATPFADMT 75

Query: 96  TAEFQ-AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
             EF  +K  G  LKP     + P M P      A DWRE  AVT VK+Q  CGS WAFS
Sbjct: 76  NTEFAVSKLCGCMLKPKMTKPATPIMEP---AAEAVDWREKGAVTPVKNQASCGSCWAFS 132

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
            TG +EG       +L+SLSEQ+L+DCD +  GC GG ++ AF+    K   G+ +E+ Y
Sbjct: 133 ATGAMEGRNFVANGELISLSEQQLVDCDHQSSGCGGGLMTYAFEYAKKK---GMCKEEDY 189

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL--QFYVTG 272
           PY   D+ C+ +K    V   GY  V R +    K  V  GP++VA+ A ++  Q Y  G
Sbjct: 190 PYHAVDEDCKDDKCTPVVFPKGYEEVPRFDGAALKQAVSQGPVSVAVEADSIVFQMYTGG 249

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY- 331
           V             +L+H VL VGYG D          YWI+KNSWGE WG+KGY ++  
Sbjct: 250 VIDS-----SACGTSLNHGVLAVGYGAD----------YWIVKNSWGESWGDKGYLKIKY 294

Query: 332 --RGDGSCGIN 340
              G G CGIN
Sbjct: 295 TESGAGICGIN 305


>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  190 bits (483), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 108/301 (35%), Positives = 159/301 (52%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKKNMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T KL+  SEQEL+DC   + GC GG ++NAFD I+    GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGKLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAEGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|401430127|ref|XP_003886478.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
 gi|356491231|emb|CBZ41048.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
          Length = 375

 Score =  190 bits (482), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 111/272 (40%), Positives = 157/272 (57%), Gaps = 21/272 (7%)

Query: 86  YGLNEFSDLSTAEFQAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTG 140
           +G+ +F DLS AEF A+YL     F     +A +       +++ +P A DWRE  AVT 
Sbjct: 13  FGITKFFDLSEAEFAARYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTP 72

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTI 200
           VKDQ  CGS WAFS  GNIEG +     +LVSLSEQ+L+ CD  +DGC+GG +  AFD +
Sbjct: 73  VKDQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWL 132

Query: 201 MSKLGGGLEEEKTYPYRGDD----KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGP 256
           +    G L  E +YPY   +    +    ++     +I+G+V +   E  MA +L +NGP
Sbjct: 133 LQNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGP 192

Query: 257 MAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKN 316
           +A+A++A +   Y +GV       C G  + L+H VL+VGY  D T      VPYW+IKN
Sbjct: 193 IAIALDASSFMSYKSGV----LTACIG--KQLNHGVLLVGY--DMT----GEVPYWVIKN 240

Query: 317 SWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           SWG  WGE+GY R+  G  +C +++Y  SA V
Sbjct: 241 SWGGDWGEQGYVRVVMGVNACLLSEYPVSAHV 272


>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 361

 Score =  190 bits (482), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 122/341 (35%), Positives = 169/341 (49%), Gaps = 35/341 (10%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           +A L+  V+  S     D  ++  H          ++ ++ K Y    E   R  IF  N
Sbjct: 36  MAFLAFQVTCRSLQ---DASMYERHE--------QWMTRYGKVYKDPQEREKRFRIFKEN 84

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAMIPNIT-LP 127
           +  I+   +  +      +N+F+DL+  EF A    FK    S   R+      N+T +P
Sbjct: 85  VNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRTTTFKYENVTAVP 144

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--D 185
              DWR+  AVT +KDQ  CG  WAFS     EG++A  + KL+SLSEQEL+DCD +  D
Sbjct: 145 STVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVD 204

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDE 244
            GCEGG + +AF  ++     GL  E  YPY+G D  C  N+ A   V I GY  V  + 
Sbjct: 205 QGCEGGLMDDAFKFVIQN--HGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVPANN 262

Query: 245 TDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
               +  V N P++VAI+A     QFY +GV      F       L H V  VGYGV   
Sbjct: 263 EKALQKAVANQPVSVAIDASGSDFQFYKSGV------FTGSCGTELDHGVTAVGYGV--- 313

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
             ++    YW++KNSWG  WGE+GY R+ RG    +G CGI
Sbjct: 314 --SNDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGI 352


>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  190 bits (482), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 118/308 (38%), Positives = 168/308 (54%), Gaps = 23/308 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ---LLQDTEHGSGVYGLNEFSDLSTAE 98
           +  +L+ H K Y    E   R+ I+ GNL  I+   L  D    S   G+NE+ D++  E
Sbjct: 27  WQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMTNEE 85

Query: 99  FQAKYLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           F++   G+K++   +  S+     NI  LP   DWR    VT +K+Q  CGS W+FS TG
Sbjct: 86  FRSTMNGYKMRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATG 145

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
           ++EG    KT KL SLSEQ L+DC Q+  + GC+GG + +AF  I  K   G++ E +YP
Sbjct: 146 SLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYI--KDNNGIDTESSYP 203

Query: 216 YRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
           Y   +  CR N        +G+  + S+ E+D+   +   GP+AVAI+A   + Q Y +G
Sbjct: 204 YEAKNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDASHMSFQLYKSG 263

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           V H  +FFC      L H VL VGYG +  K       YW++KNSWGE WG+KGY  + R
Sbjct: 264 VYH--EFFCS--ETRLDHGVLAVGYGTESGK------DYWLVKNSWGESWGQKGYIMMSR 313

Query: 333 GD-GSCGI 339
               +CGI
Sbjct: 314 NKRNNCGI 321


>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
 gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score =  190 bits (482), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 121/311 (38%), Positives = 160/311 (51%), Gaps = 29/311 (9%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++E+H K Y    E   R  IF  NL  I+             +N+F D +  EF+A YL
Sbjct: 38  WMEEHGKFYKDAAEKEQRFQIFKENLEFIESFNAAGDNGFNLSINQFGDQTNDEFKANYL 97

Query: 105 GFKLKP-------SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
             K KP       +  + SV        +P   DWRE  AVT +K Q +CGS WAF+T  
Sbjct: 98  NGKKKPLIGVGIAAIEEESVFRYENVTEVPATMDWRERGAVTPIKHQHLCGSCWAFATVA 157

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
            IEG++   T +LVSLSEQEL+DC + +  DGC GG + +A D I+ K  GG+  E  YP
Sbjct: 158 AIEGIHQITTGRLVSLSEQELVDCVKTNTTDGCNGGYVEDACDFIVKK--GGITSETNYP 215

Query: 216 YRGDDKACRLNKKATQV-KINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
           Y   D  C + K    V KI GY  V  +        V N P+AV I A   A QFY +G
Sbjct: 216 YTRVDGKCNVRKGTYNVAKIKGYEHVPANNEKALLKAVANQPIAVYIAATKRAFQFYSSG 275

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           +   ++  C     +L H+V IVGYG      +   V YW++KNSWG  WGEKGY ++ R
Sbjct: 276 I---LKGKC---GIDLDHTVTIVGYGT-----SDDGVKYWLVKNSWGTKWGEKGYIKIKR 324

Query: 333 G----DGSCGI 339
                +GSCGI
Sbjct: 325 DVHAKEGSCGI 335


>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
 gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
 gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
 gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
 gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
           thaliana]
          Length = 452

 Score =  190 bits (482), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 123/336 (36%), Positives = 174/336 (51%), Gaps = 27/336 (8%)

Query: 15  LTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ 74
           L +S+S   V   E   +    +   ++  +L ++ K Y  L E   R  IF  NL+ ++
Sbjct: 18  LLISLSLGSVTATETTRNEAEARR--MYERWLVENRKNYNGLGEKERRFEIFKDNLKFVE 75

Query: 75  LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI--TLPRAFDW 132
                 + +   GL  F+DL+  EF+A YL  K++ +         +  +  +LP A DW
Sbjct: 76  EHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGEKYLYKVGDSLPDAIDW 135

Query: 133 REYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGG 191
           R   AV  VKDQ  CGS WAFS  G +EG+   KT +L+SLSEQEL+DCD   +DGC GG
Sbjct: 136 RAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTSYNDGCGGG 195

Query: 192 SISNAFDTIMSKLGGGLEEEKTYPYRGDD-KACRLNKKATQ-VKINGYVSVSRDETDMAK 249
            +  AF  I+    GG++ E+ YPY   D   C  +KK T+ V I+GY  V +++    K
Sbjct: 196 LMDYAFKFIIEN--GGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVPQNDEKSLK 253

Query: 250 YLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHK 307
             + N P++VAI A   A Q Y +GV      F      +L H V+ VGYG      +  
Sbjct: 254 KALANQPISVAIEAGGRAFQLYTSGV------FTGTCGTSLDHGVVAVGYG------SEG 301

Query: 308 AVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
              YWI++NSWG  WGE GYF+L R      G CG+
Sbjct: 302 GQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGV 337


>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
          Length = 336

 Score =  190 bits (482), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 127/307 (41%), Positives = 162/307 (52%), Gaps = 31/307 (10%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           + K YA+  E   R  +F  NL  I  + + +  S   GLNEF+DL+  EF+A YLG   
Sbjct: 36  YRKAYASFEEKVRRFEVFKDNLNHIDDI-NKKVTSYWLGLNEFADLTHDEFKATYLGLTP 94

Query: 109 KPSYADRS-------VPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
            P+ ++             + N  +P+  DWR+ +AVT VK+Q  CGS WAFST   +EG
Sbjct: 95  PPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAVEG 154

Query: 162 VYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           + A  T  L SLSEQELIDC  + ++GC GG +  AF  I S   GGL  E+ YPY  ++
Sbjct: 155 INAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIAST--GGLRTEEAYPYAMEE 212

Query: 221 KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPI 277
             C   K A  V I+GY  V + DE  + K L    P++VAI A     QFY  GV    
Sbjct: 213 GDCDEGKGAAVVTISGYEDVPANDEQALVKALAHQ-PVSVAIEASGRHFQFYSGGV---- 267

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----G 333
             F     E L H V  VGYG      T K   Y I+KNSWG  WGEKGY R+ R    G
Sbjct: 268 --FDGPCGEQLDHGVTAVGYG------TSKGQDYIIVKNSWGPHWGEKGYIRMKRGTGKG 319

Query: 334 DGSCGIN 340
           +G CGIN
Sbjct: 320 EGLCGIN 326


>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
 gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
           [Oryza sativa Japonica Group]
 gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
          Length = 350

 Score =  190 bits (482), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 120/310 (38%), Positives = 157/310 (50%), Gaps = 27/310 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYG----LNEFSDLSTAEFQ 100
           ++ +H KTY    E   RL +F  N + I          G  G     N F+DL+  EF+
Sbjct: 45  WMAKHGKTYKDEEEKARRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDDEFR 104

Query: 101 AKYLGFKLKPSYADRSVPAMI-PNITL---PRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           A   G++  P+    +    +  N +L   P++ DWR   AVTGVKDQ  CG  WAFS  
Sbjct: 105 AARTGYQRPPAAVAGAGGGFLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGCCWAFSAV 164

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             +EG+   +T +LVSLSEQEL+DCD   ED GCEGG +  AF  I  +  GGL  E +Y
Sbjct: 165 AAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARR--GGLAAESSY 222

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTG 272
           PYRG D ACR         I G+  V  ++       V   P++VAIN   Y  +FY  G
Sbjct: 223 PYRGVDGACRAAAGRAAASIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFYDRG 282

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           V         G    L+H+V  VGYG            YW++KNSWG  WGE GY R+ R
Sbjct: 283 V-----LGGAGCGTELNHAVTAVGYGT-----ASDGTGYWLMKNSWGASWGEGGYVRIRR 332

Query: 333 G---DGSCGI 339
           G   +G+CGI
Sbjct: 333 GVGREGACGI 342


>gi|6649577|gb|AAF21462.1|U69121_1 cysteine proteinase PWCP2 [Paragonimus westermani]
          Length = 260

 Score =  190 bits (482), Expect = 9e-46,   Method: Compositional matrix adjust.
 Identities = 104/257 (40%), Positives = 146/257 (56%), Gaps = 7/257 (2%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           L+  F   + K YA   +   R  IF  NL + Q LQ  + G+  YG+ +FSDL+  EF 
Sbjct: 5   LYEQFKRXYGKVYAN-EDDQKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFA 63

Query: 101 AKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           AKYL   +      R  P  +     P   DWR   AVT V++Q  CGS WAFST GN+E
Sbjct: 64  AKYLSAPVNNDQVKRVRPTGLK--AAPERIDWRAKGAVTAVENQGSCGSCWAFSTAGNVE 121

Query: 161 GVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           G +  KT +LVSLS+Q+L+DCD+  DGC GG  ++++  IM    GGLE +  YPY G  
Sbjct: 122 GQWFIKTGQLVSLSKQQLVDCDRAADGCNGGWPASSYLEIMHM--GGLESQDDYPYAGVK 179

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFF 280
           + C + K+    KI+  +++   E D A YL E+GP++  +NA  LQ+Y +G+ HP    
Sbjct: 180 EQCFMEKERLLAKIDDSIALXPSEDDNAAYLAEHGPLSTLLNAITLQYYQSGIIHPSYXX 239

Query: 281 CDGGNENLSHSVLIVGY 297
           C     +L+H+VL VGY
Sbjct: 240 C--SPVDLNHAVLTVGY 254


>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 436

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 163/314 (51%), Gaps = 31/314 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           ++  ++ +H  TY  + E   R   F  NLR I    +    +GV+    GLN F+DL+ 
Sbjct: 42  MYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQ-HNAAADAGVHSFRLGLNRFADLTN 100

Query: 97  AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
            E+++ YLG + KP   +R + A      N  LP + DWR+  AV  VKDQ  CGS WAF
Sbjct: 101 EEYRSTYLGARTKPDR-ERKLSARYQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAF 159

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           S    +EG+    T  ++ LSEQEL+DCD   + GC GG +  AF+ I++   GG++ E+
Sbjct: 160 SAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDSEE 217

Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFY 269
            YPY+  D  C  NKK A  V I+GY  V  +     +  V N P++VAI A   A Q Y
Sbjct: 218 DYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLY 277

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +G+      F       L H V  VGYG +  K       YW+++NSWG  WGE GY R
Sbjct: 278 KSGI------FTGTCGTALDHGVAAVGYGTENGK------DYWLVRNSWGSVWGEDGYIR 325

Query: 330 LYR----GDGSCGI 339
           + R      G CGI
Sbjct: 326 MERNIKASSGKCGI 339


>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
 gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
          Length = 349

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 134/357 (37%), Positives = 183/357 (51%), Gaps = 35/357 (9%)

Query: 1   MSCFYFFAGVALLSLTVSVS-----SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYAT 55
           M+   F     +LS T+ ++      F +VG    H     K   LF  ++ +H+K Y +
Sbjct: 1   MALSTFSKATLILSATLFITYATAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKAYRS 60

Query: 56  LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYA 113
           + E   R  IF  NL+ I    +T      Y  GLNEF+DLS  EF++KYLG +++    
Sbjct: 61  IEEKLHRFEIFLDNLKHID---ETNKKVSSYWLGLNEFADLSHEEFKSKYLGLRVEFPRK 117

Query: 114 DRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
             S      ++  LP + DWR   AVT VK+Q  CGS WAFST   +EG+    T  L S
Sbjct: 118 RSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTS 177

Query: 173 LSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKAT 230
           LSEQELIDCD+  ++GC GG +  AF  IMS    GL +E+ YPY  ++  C R  ++  
Sbjct: 178 LSEQELIDCDRSFNNGCYGGLMDYAFQYIMSN--SGLRKEEDYPYLMEEGRCIREKEQFE 235

Query: 231 QVKINGYVSV-SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNEN 287
            V I+GY  V + DE  + K L    P++VAI A +   QFY  G+      F       
Sbjct: 236 VVTISGYEDVPANDEQSLLKALSHQ-PVSVAIEASSRNFQFYKGGI------FTGRCGTQ 288

Query: 288 LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
           + H V  VGYG      + +   Y I+KNSWG  WGE GY R+ R     +G CGIN
Sbjct: 289 MDHGVTAVGYG------SSEGTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGIN 339


>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
 gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
           sativus]
          Length = 365

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 129/350 (36%), Positives = 183/350 (52%), Gaps = 45/350 (12%)

Query: 10  VALLS---LTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIF 66
           +ALLS   L++S S+     D ++  +        ++ +L +H K Y  + E   R  IF
Sbjct: 8   LALLSFFFLSISASALSRRSDGEVREI--------YDLWLAKHGKAYNGIDEREKRFQIF 59

Query: 67  SGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA------- 119
             NL+ I    ++E+ +   GLN F+DL+  E++A YLG +  P  A R + A       
Sbjct: 60  KENLKFIDD-HNSENRTYKVGLNMFADLTNEEYRALYLGTRSPP--ARRVMKAKTASRRY 116

Query: 120 MIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
            + N+  LP + DWR   AV  VK+Q  CGS WAFST   +EG+    T +L+SLSEQEL
Sbjct: 117 AVNNLDRLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQEL 176

Query: 179 IDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKING 236
           + CD++ + GC GG +  AF  I+    GGL+ E+ YPY   D  C   +K A  V I+ 
Sbjct: 177 VSCDKKYNSGCNGGLMDYAFQFIIDN--GGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDA 234

Query: 237 YVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLI 294
           Y  V  ++ +  K  V + P++VAI A   ALQ Y +GV      F       L H V+ 
Sbjct: 235 YEDVPANDEESLKKAVAHQPVSVAIEASGLALQLYQSGV------FTGKCGSALDHGVVA 288

Query: 295 VGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
           VGYG          V YW+++NSWG  WGE GYF+L R      +G CGI
Sbjct: 289 VGYG------KENGVDYWLVRNSWGTSWGEDGYFKLERNVKHITEGKCGI 332


>gi|11359985|pir||T46294 hypothetical protein DKFZp434F0610.1 - human (fragment)
 gi|6808322|emb|CAB70900.1| hypothetical protein [Homo sapiens]
          Length = 308

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 111/279 (39%), Positives = 163/279 (58%), Gaps = 4/279 (1%)

Query: 20  SSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           S   ++ ++ L     VK  ++F  F+  +N+TY +  E   RL +F  N+ + Q +Q  
Sbjct: 5   SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 64

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVT 139
           + G+  YG+ +FSDL+  EF+  YL   L+    ++   A       P  +DWR   AVT
Sbjct: 65  DRGTAQYGVTKFSDLTEEEFRTIYLNTLLRKEPGNKMKQAKSVGDLAPPEWDWRSKGAVT 124

Query: 140 GVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDT 199
            VKDQ MCGS WAFS TGN+EG +      L+SLSEQEL+DCD+ D  C GG  SNA+  
Sbjct: 125 KVKDQGMCGSCWAFSVTGNVEGQWFLNQGTLLSLSEQELLDCDKMDKACMGGLPSNAYSA 184

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAV 259
           I  K  GGLE E  Y Y+G  ++C  + +  +V IN  V +S++E  +A +L + GP++V
Sbjct: 185 I--KNLGGLETEDDYSYQGHMQSCNFSAEKAKVYINDSVELSQNEQKLAAWLAKRGPISV 242

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
           AINA+ +QFY  G+S P++  C      + H+VL+VGYG
Sbjct: 243 AINAFGMQFYRHGISRPLRPLCSPW--LIDHAVLLVGYG 279


>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
          Length = 458

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 121/315 (38%), Positives = 166/315 (52%), Gaps = 33/315 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           L+  +  +H K+Y  + E   R   F  NLR I    +    +GV+    GLN F+DL+ 
Sbjct: 39  LYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE-HNAAADAGVHSFRLGLNRFADLTN 97

Query: 97  AEFQAKYLGFKLKP----SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
            E++  YLG + KP      +DR + A   N  LP + DWR   AV  +KDQ  CGS WA
Sbjct: 98  EEYRDTYLGLRNKPRRERKVSDRYLAA--DNEALPESVDWRTKGAVAEIKDQGGCGSCWA 155

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEE 211
           FS    +E +    T  L+SLSEQEL+DCD   ++GC GG +  AFD I++   GG++ E
Sbjct: 156 FSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINN--GGIDTE 213

Query: 212 KTYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQF 268
             YPY+G D+ C +N+K A  V I+ Y  V+ +     +  V N P++VAI A   A Q 
Sbjct: 214 DDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQPVSVAIEAGGRAFQL 273

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +G+      F       L H V  VGYG +  K       YWI++NSWG+ WGE GY 
Sbjct: 274 YSSGI------FTGKCGTALDHGVAAVGYGTENGK------DYWIVRNSWGKSWGESGYV 321

Query: 329 RLYRG----DGSCGI 339
           R+ R      G CGI
Sbjct: 322 RMERNIKASSGKCGI 336


>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
          Length = 332

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 126/351 (35%), Positives = 183/351 (52%), Gaps = 40/351 (11%)

Query: 11  ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
           A++++TV+ SS  ++  +             +  F   H K+Y + +E   R  IF+ N 
Sbjct: 9   AIVAVTVAASSQEILRTQ-------------WEAFKTTHKKSYQSHMEELLRFKIFTEN- 54

Query: 71  RKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQAKYLGF--KLKPSYADRSVPAMIPNI 124
             I    + ++  G+     G+N+F DL   EF   + G     K   +    PA + + 
Sbjct: 55  SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHGTRKTGGSTFLPPANVNDS 114

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
           +LP+  DWR+  AVT VKDQ  CGS WAFS TG++EG +  K  +LVSLSEQ L+DC Q 
Sbjct: 115 SLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQS 174

Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-S 241
             ++GCEGG + +AF  I  K   G++ EK+YPY   D  CR  K+       GYV + +
Sbjct: 175 FGNNGCEGGLMEDAFKYI--KANDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKA 232

Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYG 298
             E D+ K +   GP++VAI+A   + Q Y  GV   P     +  +E+L H VL+VGYG
Sbjct: 233 GSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEP-----ECSSEDLDHGVLVVGYG 287

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGINDYVRSALV 348
           V   K       YW++KNSW E WG++GY  + R  +  CGI       LV
Sbjct: 288 VKGGK------KYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPLV 332


>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
          Length = 343

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 124/345 (35%), Positives = 174/345 (50%), Gaps = 35/345 (10%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
            F  + LL++ V+  +     D+ +   H          ++  + K Y    E   RL I
Sbjct: 14  LFFCLGLLAIQVTSRTLQ---DDSIFERHE--------QWMTHYGKVYKNPQEREKRLRI 62

Query: 66  FSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAMIPN 123
           F+ NL+ I+   +  +      G+N+F+DL+  EF A    FK    S   R+      N
Sbjct: 63  FTENLKYIEASNNAGNNKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIRTTTFKYEN 122

Query: 124 ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ 183
            ++P   DWR+  AVT VK+Q  CG  WAFS     EG++   T KLVSLSEQEL+DCD 
Sbjct: 123 TSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDT 182

Query: 184 E--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV 240
              D GCEGG + +AF  I+     G+  E  YPY+G D  C+ N+ +T    I GY  V
Sbjct: 183 NGVDQGCEGGLMDDAFKFIIQN--NGISTEAGYPYQGVDGTCKANEASTSAATITGYEDV 240

Query: 241 SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
             +  +  +  V N P++VAI+A     QFY +GV     F    G E L H V  VGYG
Sbjct: 241 PANNENALQKAVANQPISVAIDASGSDFQFYKSGV-----FTGSCGTE-LDHGVTAVGYG 294

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
           +     ++    YW++KNSWG  WGE+GY R+ R     +G CGI
Sbjct: 295 I-----SNDGTKYWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCGI 334


>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
          Length = 344

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 109/301 (36%), Positives = 158/301 (52%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK+Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYVSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I  K  GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G    CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GQQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGGNEN-LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   N ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCANRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGEDGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  190 bits (482), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 118/323 (36%), Positives = 161/323 (49%), Gaps = 32/323 (9%)

Query: 27  DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
           D  +H  H          ++ Q+ K Y    E   R  IF  N+++I+   +  + S   
Sbjct: 32  DASMHERHE--------QWMAQYGKVYKDSYEKELRSKIFKENVQRIEAFNNAGNKSYKL 83

Query: 87  GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQT 145
           G+N+F+DL+  EF+A+        S + R+      ++T +P + DWR+  AVT +KDQ 
Sbjct: 84  GINQFADLTNEEFKARNRFKGHMCSNSTRTPTFKYEHVTSVPASLDWRQKGAVTPIKDQG 143

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSK 203
            CG  WAFS     EG+    T KL+SLSEQEL+DCD +  D GCEGG + +AF  IM  
Sbjct: 144 QCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQN 203

Query: 204 LGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAIN 262
              GL  E  YPY+G D  C  N +A     I G+  V  +        V N P++VAI+
Sbjct: 204 --KGLNTEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAID 261

Query: 263 AYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
           A     QFY +GV      F       L H V  VGYG D          YW++KNSWGE
Sbjct: 262 ASGSEFQFYSSGV------FTGSCGTELDHGVTAVGYGSD------GGTKYWLVKNSWGE 309

Query: 321 GWGEKGYFRLYRG----DGSCGI 339
            WGE+GY R+ R     +G CG 
Sbjct: 310 QWGEQGYIRMQRDVAAEEGLCGF 332


>gi|426252044|ref|XP_004019728.1| PREDICTED: cathepsin W [Ovis aries]
          Length = 375

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 111/327 (33%), Positives = 169/327 (51%), Gaps = 24/327 (7%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F  Q+N++Y    E+  RL IF+ NL K Q LQ+ + G+  +G+ +FSDL+  EF 
Sbjct: 41  VFRLFQMQYNRSYPNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEFV 100

Query: 101 AKYLGFKLKPSY--ADRSVPAMIPNITLPRAFDWR-EYDAVTGVKDQTMCGSSWAFSTTG 157
             Y G ++        R V +     + P   DWR + + ++ V++Q  C   WA +  G
Sbjct: 101 QLY-GSRVAGEALGVSRKVGSEEWGESQPPTCDWRNKPNTISPVRNQRHCNCCWAMAAAG 159

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           NIE ++A K  + V     EL+DCD+  +GC+GG + +AF T++     GL  E  YP+ 
Sbjct: 160 NIEALWAIKFNRSVEERGGELLDCDRCGNGCKGGFVWDAFLTVLKNR--GLASETDYPFD 217

Query: 218 GDDKA--CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           G  K   C   K      I  ++ +   E  +A++L   GP+ V IN   LQ Y  GV  
Sbjct: 218 GSGKTHRCLAEKHKKVAWIQDFIMLQACEQSIARHLATQGPITVTINVKLLQQYQKGVIK 277

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRT--------------KFTHKAVPYWIIKNSWGEG 321
                CD    ++ HSVL+VG+G  ++                  +++ YW +KNSWG  
Sbjct: 278 ATPTTCD--PRHVDHSVLLVGFGKTKSVEGRQGKAASFRSYTRPRRSMAYWTLKNSWGPH 335

Query: 322 WGEKGYFRLYRGDGSCGINDYVRSALV 348
           WGE+GYFRL+RG  +CGI  Y  +A+V
Sbjct: 336 WGEEGYFRLHRGSNTCGITKYPVTAIV 362


>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
 gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
           vulgaris gb|U52970 and is a member of the papain
           cysteine protease family PF|00112 [Arabidopsis thaliana]
 gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
          Length = 343

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 123/340 (36%), Positives = 175/340 (51%), Gaps = 27/340 (7%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           +A+L   V ++S +   D  ++  H       F  +L+ H+K Y    E+  R  I+  N
Sbjct: 12  LAVLICFVLIASKLCSVDSSVYDPHKTLKQR-FEKWLKTHSKLYGGRDEWMLRFGIYQSN 70

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP-SYADRSVPAMIPNITLPR 128
           ++ I  + ++ H       N F+D++ +EF+A +LG          +  P   P   +P 
Sbjct: 71  VQLIDYI-NSLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQRPVCDPAGNVPD 129

Query: 129 AFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD--QEDD 186
           A DWR   AVT +++Q  CG  WAFS    IEG+   KT  LVSLSEQ+LIDCD    + 
Sbjct: 130 AVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNK 189

Query: 187 GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVSRDET 245
           GC GG +  AF+ I  K  GGL  E  YPY G +  C   K K   V I GY  V+++E 
Sbjct: 190 GCSGGLMETAFEFI--KTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQKVAQNEA 247

Query: 246 DMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTK 303
            + +      P++V I+A  +  Q Y +GV      F +    NL+H V +VGYGV+  +
Sbjct: 248 SL-QIAAAQQPVSVGIDAGGFIFQLYSSGV------FTNYCGTNLNHGVTVVGYGVEGDQ 300

Query: 304 FTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
                  YWI+KNSWG GWGE+GY R+ RG     G CGI
Sbjct: 301 ------KYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGI 334


>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
          Length = 332

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 127/351 (36%), Positives = 183/351 (52%), Gaps = 40/351 (11%)

Query: 11  ALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNL 70
           A++++TV+ SS  ++  +             +  F   H KTY + +E   R  IF+ N 
Sbjct: 9   AIVAVTVAASSQEILRTQ-------------WEAFKTTHKKTYQSHMEELLRFKIFTEN- 54

Query: 71  RKIQLLQDTEHGSGV----YGLNEFSDLSTAEFQAKYLGF--KLKPSYADRSVPAMIPNI 124
             I    + ++  G+     G+N+F DL   EF   + G     K   +    PA + + 
Sbjct: 55  SLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHGTRKTGGSTFLPPANVNDS 114

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
           +LP+  DWR+  AVT VKDQ  CGS WAFS TG++EG +  K  +LVSLSEQ L+DC Q 
Sbjct: 115 SLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQS 174

Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV-S 241
             ++GCEGG + +AF  I  K   G++ EK+YPY   D  CR  K+       GYV + +
Sbjct: 175 FGNNGCEGGLMEDAFKYI--KENDGIDTEKSYPYEAVDGECRFKKEDVGATDTGYVEIKA 232

Query: 242 RDETDMAKYLVENGPMAVAINA--YALQFYVTGV-SHPIQFFCDGGNENLSHSVLIVGYG 298
             E D+ K +   GP++VAI+A   + Q Y  GV   P     +  +E+L H VL+VGYG
Sbjct: 233 GSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEP-----ECSSEDLDHGVLVVGYG 287

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGINDYVRSALV 348
           V   K       YW++KNSW E WG++GY  + R  +  CGI       LV
Sbjct: 288 VKGGK------KYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPLV 332


>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
          Length = 336

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 131/351 (37%), Positives = 187/351 (53%), Gaps = 30/351 (8%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           VA+L++ +S +      D +L      +H  L+  +   H K Y    E + R+ ++  N
Sbjct: 4   VAVLAVCLSAALSAPSLDPQLD-----EHWDLWKSW---HTKKYHEKEEGWRRM-VWEKN 54

Query: 70  LRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN-I 124
           L+KI+L  + EH  G +    G+N F D++  EF+    G+K K     +    M PN +
Sbjct: 55  LKKIEL-HNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMYGYKRKSERKFKGSLFMEPNFL 113

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
             PR+ DWR+   VT VKDQ  CGS WAFSTTG +EG +  KT KLVSLSEQ L+DC + 
Sbjct: 114 EAPRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRP 173

Query: 185 D--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV- 240
           +  +GC GG +  AF  I  K   GL+ E +YPY G DD+ C  + K       G++ + 
Sbjct: 174 EGNEGCNGGLMDQAFQYI--KDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIP 231

Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
           S  E  + K +   GP++VAI+A   + QFY +G    I +  +  +E L H VL+VGYG
Sbjct: 232 SGKERALMKAVAAVGPVSVAIDAGHESFQFYQSG----IYYEKECSSEELDHGVLVVGYG 287

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
            +      K   YWI+KNSW E WG+KGY  + +     CGI       LV
Sbjct: 288 FEGEDVDGKK--YWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336


>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 118/304 (38%), Positives = 154/304 (50%), Gaps = 22/304 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ Q+ + Y    E   R  IF  N+ +I+        S    +NEF+DL+  EF+A   
Sbjct: 42  WMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRASRN 101

Query: 105 GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYA 164
            FK      + +         +P   DWR+  AVT +KDQ  CGS WAFS    +EG+  
Sbjct: 102 RFKAHICSTEATSFKYEHVAAVPSTVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQ 161

Query: 165 AKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA 222
             T KL+SLSEQEL+DCD   ED GC GG + +AF  I  +   GL  E  YPY G D  
Sbjct: 162 LSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFI--EQNHGLATEANYPYAGTDGT 219

Query: 223 CRLNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQF 279
           C   K A    KINGY  V  +     +  V + P+AVAI+A  +  QFY +GV     F
Sbjct: 220 CNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQFYSSGV-----F 274

Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DG 335
               G E L H V  VGYG      +   + YW++KNSWG GWGE GY R+ R     +G
Sbjct: 275 TGQCGTE-LDHGVAAVGYGT-----SDDGMKYWLVKNSWGTGWGEVGYIRMQRDVTAKEG 328

Query: 336 SCGI 339
            CGI
Sbjct: 329 LCGI 332


>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
          Length = 890

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 116/306 (37%), Positives = 158/306 (51%), Gaps = 24/306 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ ++ K Y    E   R  IF  N+  I+   +  +      +N+F+DL+  EF A   
Sbjct: 589 WMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRN 648

Query: 105 GFK-LKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
            FK    S   R+      N+T +P   DWR+  AVT +KDQ  CG  WAFS     EG+
Sbjct: 649 RFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGI 708

Query: 163 YAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           +A  + KL+SLSEQEL+DCD +  D GCEGG + +AF  ++     GL  E  YPY+G D
Sbjct: 709 HALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQ--NHGLNTEANYPYKGVD 766

Query: 221 KACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPI 277
             C  N+ A   V I GY  V  +     +  V N P++VAI+A     QFY +GV    
Sbjct: 767 GKCNANEAANDVVTITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKSGV---- 822

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
             F       L H V  VGYGV     ++    YW++KNSWG  WGE+GY R+ RG    
Sbjct: 823 --FTGSCGTELDHGVTAVGYGV-----SNDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSE 875

Query: 334 DGSCGI 339
           +G CGI
Sbjct: 876 EGLCGI 881


>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
 gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
          Length = 340

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 122/343 (35%), Positives = 170/343 (49%), Gaps = 33/343 (9%)

Query: 7   FAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIF 66
            A + LL   VS +    + D  +H  H          ++ +  + Y    E   R  IF
Sbjct: 12  LALIFLLGALVSQAMARTLQDASMHEKHE--------EWMSRFGRVYNDGNEKEIRYKIF 63

Query: 67  SGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITL 126
             N+++I+        S   G+N+F+DL+  EF+     FK     + ++ P    N+T 
Sbjct: 64  KENVQRIESFNKASGKSYKLGINQFADLTNEEFKTSRNRFKGHMC-SSQAGPFRYENLTA 122

Query: 127 -PRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ-- 183
            P + DWR+  AVT +KDQ  CGS WAFS    +EG+    T KL+SLSEQEL+DCD   
Sbjct: 123 APSSMDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKG 182

Query: 184 EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSR 242
           ED GC+GG + +AF  I      GL  E  YPY G D  C   ++A    KING+  V  
Sbjct: 183 EDQGCQGGLMDDAFKFIEQNQ--GLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPA 240

Query: 243 DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
           +        V   P++VAI+A  +  QFY +G+     F  D G E L H V  VGYG  
Sbjct: 241 NNEGALMKAVAKQPVSVAIDAGGFGFQFYSSGI-----FTGDCGTE-LDHGVAAVGYG-- 292

Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
                   + YW++KNSWG  WGE+GY R+ +     +G CGI
Sbjct: 293 ----ESNGMNYWLVKNSWGTQWGEEGYIRMQKDIDAKEGLCGI 331


>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
 gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
          Length = 461

 Score =  189 bits (481), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 117/313 (37%), Positives = 162/313 (51%), Gaps = 29/313 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTA 97
           ++  ++ +H +TY  + E   R  +F  NLR I       D    S   GLN F+DL+  
Sbjct: 40  MYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLHSFRLGLNRFADLTNE 99

Query: 98  EFQAKYLGFKLKPSYADRSVPAMIP---NITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           E+++ YLG + KP   +R + A      N  LP   DWR+  AV  +KDQ  CGS WAFS
Sbjct: 100 EYRSTYLGARTKPDR-ERKLSARYQADDNEELPETVDWRKKGAVAAIKDQGGCGSCWAFS 158

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
               +EG+    T  ++ LSEQEL+DCD   ++GC GG +  AF+ I++   GG++ E+ 
Sbjct: 159 AIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGGLMDYAFEFIINN--GGIDSEED 216

Query: 214 YPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYV 270
           YPY+  D  C  NKK A  V I+GY  V  +     +  V N P++VAI A   A Q Y 
Sbjct: 217 YPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLYK 276

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +G+      F       L H V  VGYG +  K       YW+++NSWG  WGE GY R+
Sbjct: 277 SGI------FTGTCGTALDHGVAAVGYGTENGK------DYWLVRNSWGTVWGEDGYIRM 324

Query: 331 YRG----DGSCGI 339
            R      G CGI
Sbjct: 325 ERNIKASSGKCGI 337


>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
          Length = 343

 Score =  189 bits (480), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 124/345 (35%), Positives = 174/345 (50%), Gaps = 35/345 (10%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
            F  + LL++ V+  +     D+ +   H          ++  + K Y    E   RL I
Sbjct: 14  LFFCLGLLAIQVTSRTLQ---DDSIFERHE--------QWMTHYGKVYKNPQEREKRLRI 62

Query: 66  FSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAMIPN 123
           F+ NL+ I+   +  +      G+N+F+DL+  EF A    FK    S   R+      N
Sbjct: 63  FTENLKYIEASNNAGNKKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIRTTTFKYEN 122

Query: 124 ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ 183
            ++P   DWR+  AVT VK+Q  CG  WAFS     EG++   T KLVSLSEQEL+DCD 
Sbjct: 123 TSVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDT 182

Query: 184 E--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV 240
              D GCEGG + +AF  I+     G+  E  YPY+G D  C+ N+ +T    I GY  V
Sbjct: 183 NGVDQGCEGGLMDDAFKFIIQN--NGISTEAGYPYQGVDGTCKANEASTSAATITGYEDV 240

Query: 241 SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
             +  +  +  V N P++VAI+A     QFY +GV     F    G E L H V  VGYG
Sbjct: 241 PANNENALQKAVANQPISVAIDASGSDFQFYKSGV-----FTGSCGTE-LDHGVTAVGYG 294

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
           +     ++    YW++KNSWG  WGE+GY R+ R     +G CGI
Sbjct: 295 I-----SNDGTKYWLVKNSWGTDWGEEGYIRMQRSIDAAEGLCGI 334


>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
 gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
          Length = 336

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 126/314 (40%), Positives = 169/314 (53%), Gaps = 33/314 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAE 98
           LF  ++ +H K Y ++ E + R  IF  NL  I    +T      Y  GLNEFSDLS  E
Sbjct: 32  LFESWISKHGKIYESIEEKWLRFEIFKDNLFHID---ETNKKVVNYWLGLNEFSDLSHEE 88

Query: 99  FQAKYLGFKLKPSYADRSVPAMIPN----ITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           F+ KYLG K+  S  +R   +   N    +++P++ DWR+  AVT VK+Q  CGS WAFS
Sbjct: 89  FKNKYLGLKVDMS--ERRECSQEFNYKDVMSIPKSVDWRKKGAVTDVKNQGSCGSCWAFS 146

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKT 213
           T   +EG+    T  L SLSEQEL+DCD  ++ GC GG +  AF  I+S   GGL +E  
Sbjct: 147 TVAAVEGINQIVTGNLTSLSEQELVDCDTTNNYGCNGGLMDYAFSYIISN--GGLHKEVD 204

Query: 214 YPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYV 270
           YPY  ++  C + K+ ++ V I+GY  V ++  +     + N P++VAI A     QFY 
Sbjct: 205 YPYIMEEGTCEMRKEESEVVTISGYHDVPQNSEESLLKALANQPLSVAIEASGRDFQFYS 264

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
            GV      F       L H V  VGYG      +   + Y I+KNSWG  WGEKGY R+
Sbjct: 265 GGV------FDGHCGTQLDHGVAAVGYG------STNGLDYIIVKNSWGSKWGEKGYIRM 312

Query: 331 YRGDGS----CGIN 340
            R  G     CGIN
Sbjct: 313 KRNTGKPAGLCGIN 326


>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
          Length = 359

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 118/306 (38%), Positives = 169/306 (55%), Gaps = 31/306 (10%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           H+     L E   R ++F  N+  I  +   +    +  LN F+D++  EF+ ++   K+
Sbjct: 46  HHTVSRDLSEKRKRFNVFKANVHHIHKVNQKDKPYKLK-LNSFADMTNHEFR-EFYSSKV 103

Query: 109 KP---SYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
           K     +  R+    +   T  LP + DWR+  AVTGVK+Q  CGS WAFST   +EG+ 
Sbjct: 104 KHYRMLHGSRANTGFMHGKTESLPASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGIN 163

Query: 164 AAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC 223
             KT +LVSLSEQEL+DC+ +++GC GG + NA++ I  K  GG+  E+ YPY+  D +C
Sbjct: 164 KIKTGQLVSLSEQELVDCETDNEGCNGGLMENAYEFI--KKSGGITTERLYPYKARDGSC 221

Query: 224 RLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFF 280
             +K  A  V I+G+  V  ++ +     V N P++VAI+A    +QFY  GV     + 
Sbjct: 222 DSSKMNAPAVTIDGHEMVPANDENALMKAVANQPVSVAIDASGSDMQFYSEGV-----YA 276

Query: 281 CDGGNENLSHSVLIVGYG--VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----- 333
            D     L H V +VGYG  +D TK       YWI+KNSWG GWGE+GY R+ RG     
Sbjct: 277 GDSCGNELDHGVAVVGYGTALDGTK-------YWIVKNSWGTGWGEQGYIRMQRGVDAAE 329

Query: 334 DGSCGI 339
            G CGI
Sbjct: 330 GGVCGI 335


>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
          Length = 363

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 120/315 (38%), Positives = 169/315 (53%), Gaps = 23/315 (7%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           L   +H   F  F  ++ K+Y +  E   R  IFS +L +++   + +  S   G+N +S
Sbjct: 53  LGRSRHALRFARFAVRYGKSYESAAEVQRRFRIFSESLEEVRST-NQKGLSYRLGINRYS 111

Query: 93  DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
           D+S  EFQA  LG     S   R    M     LP   DWRE   V+ VKDQ+ CGS W 
Sbjct: 112 DMSWEEFQASRLGAAQTCSATLRGNHRMQDANALPETKDWREDGIVSPVKDQSHCGSCWT 171

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEE 210
           FSTTG +E  Y   T K +SLSEQ+L+DC     + GC GG  S AF+ I  K  GGL+ 
Sbjct: 172 FSTTGALEAAYTQATGKNISLSEQQLVDCAGAYNNFGCNGGLPSQAFEYI--KYNGGLDT 229

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVS---RDETDMAKYLVENGPMAVA---INAY 264
           E++YPY+G +  C    +   V++   V+++    DE   A  LV   P++VA   IN +
Sbjct: 230 EESYPYKGVNGVCHYKPENAAVQVLDSVNITLNAEDELQNAVGLVR--PVSVAFEVINGF 287

Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
             + Y +GV       C    ++++H+VL VGYGV+         PYW+IKNSWGE WG+
Sbjct: 288 --RQYKSGVY--TSDHCGTTPDDVNHAVLAVGYGVE------NGTPYWLIKNSWGESWGD 337

Query: 325 KGYFRLYRGDGSCGI 339
           KGYF++ RG   C +
Sbjct: 338 KGYFKMERGKNMCAV 352


>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 452

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 124/338 (36%), Positives = 177/338 (52%), Gaps = 31/338 (9%)

Query: 15  LTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ 74
           L +S+S   V   +   +    +   ++  +L ++ K Y  L E  +R  IF+ NL+ I+
Sbjct: 18  LLISLSLGSVTAADTTRNEAEARR--MYEQWLVENRKNYNGLGEKETRFEIFTDNLKYIE 75

Query: 75  LLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK----PSYADRSVPAMIPNITLPRAF 130
                 + +   GL  F+DL+  EF+A YL  K++    P   +R +  +    TLP   
Sbjct: 76  EHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGERYLYKV--GDTLPDQI 133

Query: 131 DWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCE 189
           DWR   AV  VKDQ  CGS WAFS  G +EG+   KT +L+SLSEQEL+DCD   + GC 
Sbjct: 134 DWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCDTSYNGGCG 193

Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQ-VKINGYVSVSRDETDM 247
           GG +  AF  I+    GG++ E+ YPY   DD  C  +KK ++ V I+GY  V +++   
Sbjct: 194 GGLMDYAFKFIIEN--GGIDTEEDYPYTATDDNICNSDKKNSRVVTIDGYEDVPQNDEKS 251

Query: 248 AKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFT 305
            K  + N P++VAI A   A Q Y +GV      F      +L H V+ VGYG      +
Sbjct: 252 LKKALANQPISVAIEAGGRAFQLYKSGV------FTGTCGTSLDHGVVAVGYG------S 299

Query: 306 HKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
                YWI++NSWG  WGE GYF+L R      G CG+
Sbjct: 300 EGGQDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGV 337


>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
          Length = 328

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 123/317 (38%), Positives = 166/317 (52%), Gaps = 28/317 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT----EHGSGVYGL--NEFSDLSTAE 98
           F  +H + YA++ E   RL +F  N    Q + D     E+G   + L  N+F D+++ E
Sbjct: 27  FKAEHGRRYASVQEERYRLSVFEQNQ---QFIDDHNARFENGEVTFTLQMNQFGDMTSEE 83

Query: 99  FQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           F A   GF   PS    ++    P+ TLP+  DWR   AVT VKDQ  CGS WAFSTTG+
Sbjct: 84  FTATMNGFLNVPSRRPTAILRADPDETLPKEVDWRTKGAVTPVKDQKQCGSCWAFSTTGS 143

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           +EG +  K  KLVSLSEQ L+DC  +  + GC GG +  AF  I  K   G++ E +YPY
Sbjct: 144 LEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYI--KANKGIDTEDSYPY 201

Query: 217 RGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGV 273
              D  CR +         GYV V    E+ + K +   GP++VAI+A   + QFY  GV
Sbjct: 202 EAQDGKCRFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVAIDASQPSFQFYHDGV 261

Query: 274 SHPIQFFCDGGNEN-LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
                ++ +G +   L H VL VGYG      T K   YW++KNSW   WG KGY ++ R
Sbjct: 262 -----YYEEGCSSTMLDHGVLAVGYGE-----TEKGEAYWLVKNSWNTSWGNKGYIQMSR 311

Query: 333 G-DGSCGINDYVRSALV 348
               +CGI       LV
Sbjct: 312 DKKNNCGIASQASYPLV 328


>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
          Length = 341

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 119/331 (35%), Positives = 171/331 (51%), Gaps = 28/331 (8%)

Query: 22  FMVVGDEKL--HHLHHVKHTALFNY--FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQ 77
            + VG  ++    LH  + + +  +  ++ +++K Y    E   R  IF  N+  I+   
Sbjct: 17  LLAVGISRVISRELHETETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVEFIESFN 76

Query: 78  DTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYD 136
              +     G+N  +DL+  EF+A   G K    Y   +      N+T +P + DWR+  
Sbjct: 77  AAGNKPYKLGVNHLADLTIEEFKASRNGLKRSYDYEVGTTSFKYENVTAIPASVDWRKKG 136

Query: 137 AVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSIS 194
           AVT +KDQ  CGS WAFST    EG++   T KLVSLSEQEL+DCD++  D GCEGG + 
Sbjct: 137 AVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGTDQGCEGGYME 196

Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVEN 254
           + F+ I+    GG+  E  YPY+  D +C+ N  A   +I GY  V  +        V N
Sbjct: 197 DGFEFIIKN--GGITTEANYPYKAVDGSCK-NATAPAAQIKGYEKVPVNSEKALLKAVAN 253

Query: 255 GPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYW 312
            P++V+I+A   +  FY +G+     F  + G E L H V  VGYG            YW
Sbjct: 254 QPVSVSIDAADGSFMFYSSGI-----FTGECGTE-LDHGVTAVGYG------RANGTDYW 301

Query: 313 IIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           I+KNSWG  WGE+GY R+ RG    +G CGI
Sbjct: 302 IVKNSWGTVWGEQGYIRMQRGIAAKEGLCGI 332


>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
 gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
          Length = 474

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 126/350 (36%), Positives = 180/350 (51%), Gaps = 37/350 (10%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATL--VEYYSRLHIFS 67
            A  +L +S+ S+     +K       +   ++  +  +H K    +   E   R  IF 
Sbjct: 21  TATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNNNIDGSEKDKRFEIFK 80

Query: 68  GNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKP---------SYADRSVP 118
            NL+ I    + E+ +   GLN F+DLS  E++++YLG K+ P         + ++R  P
Sbjct: 81  DNLKFIDE-HNAENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMMMARTKTRSNRYAP 139

Query: 119 AMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
           ++     LP++ DWR   AV  VKDQ  CGS WAFST   +EG+    T +LVSLSEQEL
Sbjct: 140 SV--GDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIVTGELVSLSEQEL 197

Query: 179 IDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKING 236
           +DCD+  + GC+GG +  AF+ I++   GG++ ++ YPYRG D  C +  K A  V I+ 
Sbjct: 198 VDCDRTVNAGCDGGLMEYAFEFIINN--GGIDSDEDYPYRGVDGKCDQYKKNARVVSIDD 255

Query: 237 YVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLI 294
           Y  V   +    K  V N P++VAI A     Q YV+G+      F       L H V  
Sbjct: 256 YEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGI------FTGKCGTALDHGVTA 309

Query: 295 VGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSCGI 339
           VGYG      T   V YWI++NSWG+ WGE GY R+ R       G CGI
Sbjct: 310 VGYG------TENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGI 353


>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
          Length = 341

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 122/319 (38%), Positives = 160/319 (50%), Gaps = 24/319 (7%)

Query: 31  HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
            +LH          ++ Q+ + Y    E   R  IF  N+ +I+        S    +NE
Sbjct: 28  RNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINE 87

Query: 91  FSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGS 149
           F+DL+  EF+A    FK     +  +      N+T +P   DWR+  AVT +KDQ  CGS
Sbjct: 88  FADLTNEEFRASRNRFKAHIC-STEATSFKYENVTAVPSTVDWRKKGAVTPIKDQGQCGS 146

Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGG 207
            WAFS    +EG+    T KL+SLSEQEL+DCD   ED GC GG + +AF  I  +   G
Sbjct: 147 CWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFI--EQNHG 204

Query: 208 LEEEKTYPYRGDDKACRLNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--Y 264
           L  E  YPY G D  C   K A    KINGY  V  +     +  V + P+AVAI+A   
Sbjct: 205 LTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGS 264

Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
             QFY +GV     F    G E L H V  VGYG      +   + YW++KNSWG GWGE
Sbjct: 265 EFQFYSSGV-----FTGQCGTE-LDHGVSAVGYGT-----SDDGMKYWLVKNSWGTGWGE 313

Query: 325 KGYFRLYRG----DGSCGI 339
           +GY R+ R     +G CGI
Sbjct: 314 EGYIRMQRDVTAKEGLCGI 332


>gi|151547430|gb|ABS12459.1| cysteine protease Cp [Citrus sinensis]
          Length = 361

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 120/318 (37%), Positives = 167/318 (52%), Gaps = 24/318 (7%)

Query: 30  LHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--G 87
           L  +   +H   F  F  ++ K Y ++ E   R   FS NL    L++ T      Y  G
Sbjct: 50  LQVIGQARHALSFARFARRYGKIYESVEEMKLRFATFSKNL---DLIRSTNCKGLSYRLG 106

Query: 88  LNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMC 147
           LN+F+D S  EFQ   LG     S   +    +  ++ LP   DWRE   V+ VKDQ  C
Sbjct: 107 LNKFADWSWEEFQRHRLGAAQNCSATTKGNHKLTADV-LPETKDWRESGIVSPVKDQGHC 165

Query: 148 GSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLG 205
           GS W FSTTG++E  Y     K +SLSEQ+L+DC Q   + GC GG  S AF+ I  K  
Sbjct: 166 GSCWTFSTTGSLEAAYHQAFGKGISLSEQQLVDCAQAFNNQGCNGGLPSQAFEYI--KYN 223

Query: 206 GGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS---RDETDMAKYLVENGPMAVAIN 262
           GGL+ E+ YPY G D  C+ + +   V++   V+++    DE   A  LV   P++VA  
Sbjct: 224 GGLDTEEAYPYTGKDGVCKFSSENVGVQVLDSVNITLGAEDELQHAVGLVR--PVSVAFE 281

Query: 263 AY-ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEG 321
                +FY +GV    +  C     +++H+V+ VGYGV+        VPYW+IKNSWGE 
Sbjct: 282 VVDGFRFYKSGVYSSTK--CGNTPMDVNHAVVAVGYGVE------DGVPYWLIKNSWGEN 333

Query: 322 WGEKGYFRLYRGDGSCGI 339
           WG+ GYF++  G   CGI
Sbjct: 334 WGDHGYFKIKMGKNMCGI 351


>gi|426345827|ref|XP_004040600.1| PREDICTED: cathepsin O [Gorilla gorilla gorilla]
          Length = 321

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 113/284 (39%), Positives = 156/284 (54%), Gaps = 20/284 (7%)

Query: 71  RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADR---SVPAMIPNITLP 127
           R +  L  +E+ +  YG+N+FS L   EF+A YL  + KPS   R    V   IPN++LP
Sbjct: 52  RYLNSLFPSENSTAFYGINQFSHLFPEEFKAIYL--RSKPSKFPRYSAEVHMSIPNVSLP 109

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
             FDWR+   VT V++Q MCG  WAFS  G +E  YA K K L  LS Q++IDC   + G
Sbjct: 110 LRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYG 169

Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQVKINGYVS--VSRDE 244
           C GGS  NA +  ++K+   L ++  YP++  +  C   +   +   I GY +   S  E
Sbjct: 170 CNGGSTLNALN-WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAHDFSNQE 228

Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
            +MAK L+  GP+ V ++A + Q Y+ G+   IQ  C  G  N  H+VLI G+  D+T  
Sbjct: 229 DEMAKALLTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEAN--HAVLITGF--DKTGS 281

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           T    PYWI++NSWG  WG  GY  +  G   CGI D V S  V
Sbjct: 282 T----PYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 321


>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
          Length = 352

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 139/350 (39%), Positives = 182/350 (52%), Gaps = 37/350 (10%)

Query: 6   FFAGVALLSLTVSVSSFMVVG--DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
            F  V++L+ +     F ++G   E L  +H V H  LF  +L +H+K Y +L E   R 
Sbjct: 13  LFLFVSILACSPLAHEFSILGYAPEDLTSIHKVIH--LFESWLVKHSKFYESLDEKLHRF 70

Query: 64  HIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLK-PSYADRSVPAM 120
            IF  NL+ I    +T      Y  GLNEF+DL+  EF+ K+LGFK +     D S    
Sbjct: 71  EIFMDNLKHID---ETNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAERKDESSKEF 127

Query: 121 --IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
                + LP++ DWR+  AV  VK+Q  CG+ WAFST   +EG+    T  L  LSEQEL
Sbjct: 128 GYRDFVDLPKSVDWRKKGAVAPVKNQGQCGNCWAFSTVAAVEGINQIVTGNLTMLSEQEL 187

Query: 179 IDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKING 236
           IDCD   ++GC GG +  AF  +M     GL +E+ YPY   +  C   K  ++ V I+G
Sbjct: 188 IDCDTTFNNGCNGGLMDYAFAYVMR---SGLHKEEEYPYIMSEGTCDEKKDVSEKVTISG 244

Query: 237 YVSVSR-DETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVL 293
           Y  V R DE    K L  N P++VAI A     QFY  GV     F    G E L H V 
Sbjct: 245 YHDVPRNDEASFLKALA-NQPISVAIEASGRDFQFYSGGV-----FDGHCGTE-LDHGVA 297

Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS----CGI 339
            VGYG      T K + Y I++NSWG  WGEKGY R+ RG G     CG+
Sbjct: 298 AVGYG------TTKGLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGL 341


>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
          Length = 343

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 121/341 (35%), Positives = 169/341 (49%), Gaps = 35/341 (10%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           +A L+  V+  S     D  ++  H          ++ ++ K Y    E   R  IF  N
Sbjct: 18  MAFLAFQVTCRSLQ---DASMYERHE--------QWMTRYGKVYKDPQEREKRFRIFKEN 66

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAMIPNIT-LP 127
           +  I+   +  +      +N+F+DL+  EF A    FK    S   R+      N+T +P
Sbjct: 67  VNYIEAFNNAANKRYKLAINQFADLTNEEFIAPRNRFKGHMCSSIIRTTTFKYENVTAVP 126

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--D 185
              DWR+  AVT +KDQ  CG  WAFS     EG++A  + KL+SLSEQEL+DCD +  D
Sbjct: 127 STVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVD 186

Query: 186 DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVK-INGYVSVSRDE 244
            GCEGG + +AF  ++     GL  E  YPY+G D  C +N+ A     I GY  V  + 
Sbjct: 187 QGCEGGLMDDAFKFVIQN--HGLNTEANYPYKGVDGKCNVNEAANDAATITGYEDVPANN 244

Query: 245 TDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT 302
               +  V N P++VAI+A     QFY +GV      F       L H V  VGYGV   
Sbjct: 245 EKALQKAVANQPVSVAIDASGSDFQFYKSGV------FTGSCGTELDHGVTAVGYGV--- 295

Query: 303 KFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
             ++    YW++KNSWG  WGE+GY R+ RG    +G CGI
Sbjct: 296 --SNDGTEYWLVKNSWGTEWGEEGYIRMQRGVNSEEGLCGI 334


>gi|344295866|ref|XP_003419631.1| PREDICTED: cathepsin W-like [Loxodonta africana]
          Length = 376

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 117/336 (34%), Positives = 176/336 (52%), Gaps = 41/336 (12%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           +F  F  Q+N++Y+   E+  RL IF+ NL + Q LQ+ + G+  +G+  FSDL+  EF+
Sbjct: 41  VFALFQLQYNRSYSNPAEHARRLDIFARNLAQAQQLQEEDLGTAKFGVTPFSDLTEEEFR 100

Query: 101 AKYLGFKLKPSYADRSVPAMIPNIT-----------LPRAFDWREY-DAVTGVKDQTMCG 148
                      Y  +  P   PN++           +P   DWR+  + +  V++Q  C 
Sbjct: 101 Q---------VYGQQKAPGRAPNVSRKAGPKEWGRPVPATCDWRKMANVIKPVRNQKNCK 151

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGL 208
             WA +  GNIE ++  K  + V +S QEL+DC +  DGC GG + +AF T+++    GL
Sbjct: 152 CCWAMAVAGNIEALWGIKYSQSVEVSVQELLDCGRCGDGCGGGFVWDAFITVLN--NSGL 209

Query: 209 EEEKTYPYRGDDKA--CRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL 266
             EK YP++G+ KA  C+  K      I  ++ +  DE  +A YL   GP+ V IN   L
Sbjct: 210 ASEKDYPFQGNVKAHKCQAKKHTNVAWIQDFIMLQDDEQIIAGYLATQGPITVTINMKLL 269

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRT--------------KFTHKAVPYW 312
           Q Y  GV       CD     ++HSVL+VG+G  ++                  +++PYW
Sbjct: 270 QHYQKGVIRAKSNDCDP--HRVNHSVLLVGFGKGKSVARMPAETPQGGAPAHPSRSIPYW 327

Query: 313 IIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           I+KNSWG  WGE+GYFRL+RG  +CGI  Y  +A V
Sbjct: 328 ILKNSWGSNWGEEGYFRLHRGSNTCGITKYPLTARV 363


>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
          Length = 334

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 114/312 (36%), Positives = 171/312 (54%), Gaps = 30/312 (9%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTA 97
           F  +     K+Y+  VE  +R  ++  N    ++L D  +G+G++    G+N F+DL+  
Sbjct: 30  FEAWKRTFGKSYSDAVEEINRRAVWEAN----KMLVDAHNGAGIHSYTLGMNIFADLTHE 85

Query: 98  EFQAKYLGFKL---KPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
           EF+  YLG K+   +P     S      N+  LP + DWR    VT VKDQ  CGS W+F
Sbjct: 86  EFKRFYLGTKVDLNRPRSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWSF 145

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEE 211
           STTG++EG +A KT +LVSLSEQ L+DC   Q + GC GG + +AF  I++    G++ E
Sbjct: 146 STTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNK--GIDTE 203

Query: 212 KTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQF 268
            +YPY   D  C+ N       ++ +  ++R  E+D+   +   GP++VAI+A   + Q 
Sbjct: 204 ASYPYTAKDGTCKFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQL 263

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +GV +  +  C   + +L H VL  GYG      T    PYW++KNSWG  WG+ GY 
Sbjct: 264 YTSGVYNEKK--CS--STSLDHGVLAAGYG------TSNGTPYWLVKNSWGSSWGQAGYI 313

Query: 329 RLYR-GDGSCGI 339
            + R  +  CGI
Sbjct: 314 WMSRNANNQCGI 325


>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
          Length = 345

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 108/302 (35%), Positives = 159/302 (52%), Gaps = 25/302 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM--------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           G  +  SY   S  +         + +  +P   DWRE  AVT VK Q  CG  WAFS  
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAV 161

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           G++EG Y   T KL+  SEQEL+DC   + GC GG ++NAFD I+    GG+  E  Y Y
Sbjct: 162 GSLEGAYKIATGKLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEY 219

Query: 217 RGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSH 275
            G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G   
Sbjct: 220 LGEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY- 277

Query: 276 PIQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD 334
                 DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  
Sbjct: 278 ------DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDS 326

Query: 335 GS 336
           G+
Sbjct: 327 GN 328


>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
          Length = 347

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 112/310 (36%), Positives = 160/310 (51%), Gaps = 28/310 (9%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV-YGLNEFSDLSTAEFQAKY 103
           ++ +H + YA + E  +R  +F  N+ +I+ L +   G      +N+F+DL+  EF++ Y
Sbjct: 42  WMAKHGRVYADMKEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMY 101

Query: 104 LGFK----LKPSYADRSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
            G+K    L      ++      N++   LP + DWR+  AVT +K+Q  CG  WAFS  
Sbjct: 102 TGYKGGSVLSSQSGTKTSSFRYQNVSSGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAV 161

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
             IEG    K  KL+SLSEQ+L+DCD  D GC GG +  AF+ IM+   GGL  E  YPY
Sbjct: 162 AAIEGATKIKKGKLISLSEQQLVDCDTNDFGCSGGLMDTAFEHIMAT--GGLTTESNYPY 219

Query: 217 RGDDKACRL-NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGV 273
           +G D  C++ N K T   I GY  V  ++       V + P+++ I    +  QFY +GV
Sbjct: 220 KGKDATCKIKNTKPTATSITGYEDVPVNDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGV 279

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                 F       L H+V  VGYG      +     YWIIKNSWG  WGE GY R+ + 
Sbjct: 280 ------FTGECTTYLDHAVTAVGYGQ-----SSNGSKYWIIKNSWGTKWGESGYMRIKKD 328

Query: 334 ----DGSCGI 339
                G CG+
Sbjct: 329 VKDKKGLCGL 338


>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
 gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
           Precursor
 gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
 gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
 gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
 gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
 gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
          Length = 356

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 130/326 (39%), Positives = 173/326 (53%), Gaps = 43/326 (13%)

Query: 35  HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY-GLNEFSD 93
           H K   LF  ++    K Y T+ E + R  +F  NL+ I   +  + G   + GLNEF+D
Sbjct: 44  HDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHID--ETNKKGKSYWLGLNEFAD 101

Query: 94  LSTAEFQAKYLGFKL-------KPSYAD---RSVPAMIPNITLPRAFDWREYDAVTGVKD 143
           LS  EF+  YLG K        + SYA+   R V A      +P++ DWR+  AV  VK+
Sbjct: 102 LSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA------VPKSVDWRKKGAVAEVKN 155

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMS 202
           Q  CGS WAFST   +EG+    T  L +LSEQELIDCD   ++GC GG +  AF+ I+ 
Sbjct: 156 QGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVK 215

Query: 203 KLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSV-SRDETDMAKYLVENGPMAVA 260
              GGL +E+ YPY  ++  C + K  ++ V ING+  V + DE  + K L    P++VA
Sbjct: 216 N--GGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQ-PLSVA 272

Query: 261 INAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           I+A     QFY  GV      F      +L H V  VGYG      + K   Y I+KNSW
Sbjct: 273 IDASGREFQFYSGGV------FDGRCGVDLDHGVAAVGYG------SSKGSDYIIVKNSW 320

Query: 319 GEGWGEKGYFRLYRG----DGSCGIN 340
           G  WGEKGY RL R     +G CGIN
Sbjct: 321 GPKWGEKGYIRLKRNTGKPEGLCGIN 346


>gi|33242876|gb|AAQ01142.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  189 bits (480), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 127/331 (38%), Positives = 172/331 (51%), Gaps = 32/331 (9%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
           + H   +  +  QH K Y T  E YSR  I   N  KI      EH         S    
Sbjct: 18  LPHNKEWEMWKLQHGKQYETEAEEYSRRFILEKNTVKI-----AEHNIRASLGMHSYTLA 72

Query: 88  LNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
           +N+F D+   EF  + +G  LK          V     N TLP++ DWR    V+ VKDQ
Sbjct: 73  MNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQ 132

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMS 202
             CGS WAFSTTG++EG ++ KT KLV LSEQ+L+DC ++  + GC GG +  AF  I  
Sbjct: 133 GECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYI-- 190

Query: 203 KLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVA 260
           K  GGL+ E++YPY   DDK C+ +  +    + GY  V S +E  + + +   GP++VA
Sbjct: 191 KANGGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVA 250

Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           I+A   + QFY +GV    Q  C    E L H VL VGYG      +H+A  +WI+KNSW
Sbjct: 251 IDAGHESFQFYSSGVYDEPQ--CS--TEQLDHGVLAVGYGA-MNDNSHQA--FWIVKNSW 303

Query: 319 GEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           G  WG++GY  + R  +  CGI       LV
Sbjct: 304 GPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334


>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
 gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
          Length = 345

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 118/354 (33%), Positives = 177/354 (50%), Gaps = 33/354 (9%)

Query: 1   MSCFYFFAGVALLSLTVSVSSF--MVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVE 58
           M+  +F   + ++ L  S+ S    +V    L  L  ++       ++  H + Y   +E
Sbjct: 1   MASNFFLKNITVVLLLFSILSLYPFIVTSRNLKELSMLER---HENWMVHHGRVYKDDIE 57

Query: 59  YYSRLHIFSGNLRKIQLLQDTEHGSGVYGL--NEFSDLSTAEFQAKYLGFKLKPSYADRS 116
              R   F  N+  I+     ++G+  Y L  N+++DL+T EF   ++G          S
Sbjct: 58  KEHRFKTFKENVEFIESFN--KNGTQRYKLAVNKYADLTTEEFTTSFMGLDTSLLSQQES 115

Query: 117 VPAMIP----NIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLV 171
                     ++T +P + DWR+  +VTGVKDQ +CG  WAFS    IEG Y     +L+
Sbjct: 116 TATTTSFKYDSVTEVPNSMDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNELI 175

Query: 172 SLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ 231
           SLSEQ+L+DC  ++ GCEGG ++ A+D ++   GGG+  E  YPY      C+  + A  
Sbjct: 176 SLSEQQLLDCSTQNKGCEGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVCKTEQPAA- 234

Query: 232 VKINGYVSVSRDETDMAKYLVENGPMAVAINAY-ALQFYVTGVSHPIQFFCDGG-NENLS 289
           V INGY  V  DE+ + K +V N P++V I A      Y +G+        DG  N  L+
Sbjct: 235 VTINGYEVVPSDESSLLKAVV-NQPISVGIAANDEFHMYGSGIY-------DGSCNSRLN 286

Query: 290 HSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
           H+V ++GYG            YWI+KNSWG  WGE+GY R+ R      G CGI
Sbjct: 287 HAVTVIGYGTSE----EDGTKYWIVKNSWGSDWGEEGYMRIARDVGVDGGHCGI 336


>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
          Length = 336

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 131/351 (37%), Positives = 187/351 (53%), Gaps = 30/351 (8%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           VA+L++ +S +      D +L      +H  L+  +   H K Y    E + R+ ++  N
Sbjct: 4   VAVLAVCLSAALSAPSLDPQLD-----EHWDLWKSW---HTKKYHEKEEGWRRM-VWEKN 54

Query: 70  LRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN-I 124
           L+KI+L  + EH  G +    G+N F D++  EF+    G+K K     +    M PN +
Sbjct: 55  LKKIEL-HNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMNGYKRKSERKFKGSLFMEPNFL 113

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
             PR+ DWR+   VT VKDQ  CGS WAFSTTG +EG +  KT KLVSLSEQ L+DC + 
Sbjct: 114 EAPRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRP 173

Query: 185 D--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV- 240
           +  +GC GG +  AF  I  K   GL+ E +YPY G DD+ C  + K       G++ + 
Sbjct: 174 EGNEGCNGGLMDQAFQYI--KDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIP 231

Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
           S  E  + K +   GP++VAI+A   + QFY +G    I +  +  +E L H VL+VGYG
Sbjct: 232 SGKERALMKAVAAVGPVSVAIDAGHESFQFYQSG----IYYEKECSSEELDHGVLVVGYG 287

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
            +      K   YWI+KNSW E WG+KGY  + +     CGI       LV
Sbjct: 288 FEGEDVDGKK--YWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336


>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
 gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
 gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
 gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
 gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
 gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
 gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
          Length = 371

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 124/310 (40%), Positives = 161/310 (51%), Gaps = 32/310 (10%)

Query: 47  EQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGL--NEFSDLSTAEFQAKYL 104
           ++H+       E + R   F  N+R I   +  + G   Y L  N F D+   EF+A + 
Sbjct: 50  QEHHHVPRHHGEKHRRFGAFKDNVRYIH--EHNKRGGRGYRLRLNRFGDMGREEFRATFA 107

Query: 105 GFKLKPSYADRSVPAMIPNIT------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
           G        D      +P         LPRA DWR   AVTGVKDQ  CGS WAFST  +
Sbjct: 108 GSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCGSCWAFSTVVS 167

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           +EG+ A +T +LVSLSEQELIDCD  D+ GC+GG + NAF+ I  K  GG+  E  YPYR
Sbjct: 168 VEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYI--KHSGGITTESAYPYR 225

Query: 218 GDDKACRL--NKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGV 273
             +  C     ++A  V I+G+ +V  +        V N P++VAI+A   + QFY  GV
Sbjct: 226 AANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSFQFYSDGV 285

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                F  D G + L H V +VGYG      T+    YWI+KNSWG  WGE GY R+ R 
Sbjct: 286 -----FAGDCGTD-LDHGVAVVGYGE-----TNDGTEYWIVKNSWGTAWGEGGYIRMQRD 334

Query: 334 DGS----CGI 339
            G     CGI
Sbjct: 335 SGYDGGLCGI 344


>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
          Length = 330

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 117/308 (37%), Positives = 168/308 (54%), Gaps = 23/308 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ---LLQDTEHGSGVYGLNEFSDLSTAE 98
           +  +L+ H K Y    E   R+ I+ GNL  I+   L  D    S   G+NE+ D++  E
Sbjct: 27  WQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMTNEE 85

Query: 99  FQAKYLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           F++   G+K++   +  S+     NI  LP   DWR    VT +K+Q  CGS W+FS TG
Sbjct: 86  FRSTMNGYKMRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATG 145

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
           ++EG    KT KL SLSEQ L+DC Q+  + GC+GG + +AF  I  K   G++ E +YP
Sbjct: 146 SLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYI--KDNSGIDTESSYP 203

Query: 216 YRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTG 272
           Y   +  CR N        +G+  + S+ E+D+   +   GP++VAI+A   + Q Y +G
Sbjct: 204 YEAKNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPISVAIDASHMSFQLYRSG 263

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           V H  +FFC      L H VL VGYG +  K       YW++KNSWGE WG+KGY  + R
Sbjct: 264 VYH--EFFCS--ETRLDHGVLAVGYGTESGK------DYWLVKNSWGESWGQKGYIMMSR 313

Query: 333 GD-GSCGI 339
               +CGI
Sbjct: 314 NKRNNCGI 321


>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
 gi|223948637|gb|ACN28402.1| unknown [Zea mays]
 gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
          Length = 354

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 110/304 (36%), Positives = 150/304 (49%), Gaps = 21/304 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAK 102
           ++ +H +TY    E   R  +F  N   +            Y   LNEF+D++  EF A 
Sbjct: 54  WMAEHGRTYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAM 113

Query: 103 YLGFKLKPSYADRSVPAMIPNITLPRA------FDWREYDAVTGVKDQTMCGSSWAFSTT 156
           Y G +  P+ A +       N+TL  A       DWR+  AVTG+K+Q  CG  WAF+  
Sbjct: 114 YTGLRPVPAGAKKMAGFKYGNVTLSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAV 173

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
             +EG++   T  LVSLSEQ+++DCD E ++GC GG I NAF  I     GGL  E  YP
Sbjct: 174 AAVEGIHQITTGNLVSLSEQQVLDCDTEGNNGCNGGYIDNAFQYIAGN--GGLATEDAYP 231

Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           Y      C+  +      I+GY  V   +       V N P++VAI+A+  Q Y  GV  
Sbjct: 232 YTAAQAMCQSVQPV--AAISGYQDVPSGDEAALAAAVANQPVSVAIDAHNFQLYGGGVMT 289

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                      NL+H+V  VGYG           PYW++KN WG+ WGE GY RL RG  
Sbjct: 290 AASCSTP---PNLNHAVTAVGYGT-----AEDGTPYWLLKNQWGQNWGEGGYLRLERGAN 341

Query: 336 SCGI 339
           +CG+
Sbjct: 342 ACGV 345


>gi|357619726|gb|EHJ72185.1| cathepsin [Danaus plexippus]
          Length = 1118

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 117/311 (37%), Positives = 171/311 (54%), Gaps = 25/311 (8%)

Query: 32   HLHHVKHT-ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNE 90
            HL+ ++    LF  F++ +NK Y    E   R  IF  NL+ I  + +    + VYG+N+
Sbjct: 808  HLYSLEEAPTLFEQFIKDYNKEYDE-SEKEERFKIFVNNLKDINAMNE-RSSNAVYGINK 865

Query: 91   FSDLSTAEFQAKYLGFKLK--PSYADRSVPAMIP--NITLPRAFDWREYDAVTGVKDQTM 146
            FSDLS  EF   Y G K +  PS  D     +    N+T P  FDWR+   V+ VK Q  
Sbjct: 866  FSDLSKDEFVKFYTGLKREESPSNEDHKKTDLPKSFNVTAPDQFDWRKKGVVSSVKFQGH 925

Query: 147  CGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGG-SISNAFDTIMSKLG 205
            C S WAFS  GN+E + A KT KL+ +SEQ+L+DCD+ + GC GG + S +  +   K G
Sbjct: 926  CVSCWAFSVAGNVESINAIKTGKLIDVSEQQLVDCDEWNFGCSGGIACSKSHFSYFHKKG 985

Query: 206  GGLEEEKTYPYRGDDKACRLNKKATQVKINGY---VSVSRDETDMAKYLVENGPMAVAIN 262
                E  +YPY G +  CR N     +++  Y   +++S DE  + +YL   GP+++ I+
Sbjct: 986  AMSLE--SYPYVGKEGQCRYNSSKVVIRLKDYQYFIALSEDE--IKEYLYNIGPLSIDID 1041

Query: 263  AYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGW 322
            +  +  Y  G+   +   C    +  +H+VL+VGYG +        V YWI+KNSWG+ W
Sbjct: 1042 SSQIHHYKGGI---VIKECQEVKKT-NHAVLLVGYGKEN------GVEYWIVKNSWGQNW 1091

Query: 323  GEKGYFRLYRG 333
            GEKGYFR+ RG
Sbjct: 1092 GEKGYFRIQRG 1102



 Score =  175 bits (444), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 107/306 (34%), Positives = 162/306 (52%), Gaps = 22/306 (7%)

Query: 25  VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG 84
           +G  +L+ L       LF  F++ +NK Y    E   R  IF  NL+ I  + +    + 
Sbjct: 504 LGQRRLYSLEEA--PTLFEQFIKDYNKEYDE-SEKEERFKIFVNNLKDINAMNE-RSSNA 559

Query: 85  VYGLNEFSDLSTAEFQAKYLGFKLK--PSYADRSVPAMIP--NITLPRAFDWREYDAVTG 140
           VYG+N+FSDLS  EF   Y G K +  PS  D     +    N+T P  FDWR+   V+ 
Sbjct: 560 VYGINKFSDLSKEEFIKYYTGLKREESPSNEDHKKTDLPESFNVTAPDQFDWRKKGVVSS 619

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTI 200
           +K+Q  CGS WAFS  GN+E ++A KT KLV +SEQ+L+DCD +D GC GG   NA    
Sbjct: 620 IKNQKHCGSCWAFSAAGNVESIHAIKTGKLVHVSEQQLVDCDSQDSGCSGGLTWNAMRYF 679

Query: 201 MSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAV 259
            +     L   K+YPY   ++ CR +     +++  Y  +++  E  + ++L   G +++
Sbjct: 680 RTNGAVSL---KSYPYVAQNENCRYDSNKVVIRLKDYKHITQLSEDQIKEHLYNIGLLSI 736

Query: 260 AINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
            I +  L +Y  G+   +   C   +  + H+VL+V YG + +      V YWI+KNSWG
Sbjct: 737 DITSTQLTWYEGGI---LIEECRRSDL-VDHAVLLVEYGKENS------VEYWIVKNSWG 786

Query: 320 EGWGEK 325
           +  GEK
Sbjct: 787 QNGGEK 792



 Score =  175 bits (443), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 96/260 (36%), Positives = 141/260 (54%), Gaps = 26/260 (10%)

Query: 83  SGVYGLNEFSDLSTAEFQAKYLGFKLK--PSYADRSVPAMIP--NITLPRAFDWREYDAV 138
           + VYG+N+FSDLS  EF   Y G K +  PS  D     +    N+T P  FDWR+   V
Sbjct: 7   NAVYGINKFSDLSKEEFVKYYTGLKREESPSNEDHKKTDLPESFNVTAPDQFDWRKKGVV 66

Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFD 198
           + +K+Q  CGS WAFS   N+E ++A KT KL+ +SEQ+L+DCD+ D GC GG     +D
Sbjct: 67  SSIKNQKHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDCDKYDSGCSGGL---PWD 123

Query: 199 TIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPM 257
            +   +  G    K+YPY   +  CR +    ++++  Y    +  E  + ++L   GP+
Sbjct: 124 ALRYFVANGAMSLKSYPYVAKEGKCRYDSSKVEIRLKEYKHKEKLSEDQIKEHLYNIGPL 183

Query: 258 AVAINAYALQFYVTGV----SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWI 313
           ++AI +  L  Y  G+     H            ++H+VL+VGYG +        V YWI
Sbjct: 184 SIAITSSPLASYNGGILIEECHRSYL--------INHAVLLVGYGKEN------GVKYWI 229

Query: 314 IKNSWGEGWGEKGYFRLYRG 333
           +KNSWG+ WGE GYFR+  G
Sbjct: 230 VKNSWGQNWGENGYFRMKMG 249



 Score =  125 bits (314), Expect = 3e-26,   Method: Compositional matrix adjust.
 Identities = 69/171 (40%), Positives = 96/171 (56%), Gaps = 8/171 (4%)

Query: 25  VGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSG 84
           +G  +L+ L       LF  F++ +NK Y    E   R  IF  NL+ I  + +    + 
Sbjct: 287 LGQRRLYSLEEA--PTLFEQFIKDYNKEYDE-SEKEERFKIFVNNLKDINAMNE-RSSNA 342

Query: 85  VYGLNEFSDLSTAEFQAKYLGFKL-KPSYADRSVPAMIP---NITLPRAFDWREYDAVTG 140
           VYG+N+FSDLS  EF   Y G K  + +  +      +P   NIT P  FDWR+   V+ 
Sbjct: 343 VYGINKFSDLSKEEFIKYYTGLKRDRCTTTEHHKSTDLPKSFNITAPDQFDWRKKGVVSS 402

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGG 191
           VK+Q  CGS WAFS   N+E ++A KT KL+ +SEQ+L+DCD+ D GC GG
Sbjct: 403 VKNQRHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDCDKYDSGCSGG 453


>gi|354472953|ref|XP_003498701.1| PREDICTED: cathepsin K [Cricetulus griseus]
          Length = 329

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 116/303 (38%), Positives = 166/303 (54%), Gaps = 23/303 (7%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
           H K Y + V+  SR  I+  NL+ I +  + E   GV+     +N   D+++ E   K  
Sbjct: 33  HRKQYNSKVDEISRRLIWEKNLKHISI-HNLEASLGVHTYELAMNHLGDMTSEEVVQKMT 91

Query: 105 GFKLKPSYADRSVPAMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
           G KL PS++  +    IP      P A D+R+   VT VK+Q  CGS WAFS+ G +EG 
Sbjct: 92  GLKLPPSHSHSNDTLYIPEWEGRAPDAIDYRKKGYVTPVKNQGECGSCWAFSSAGALEGQ 151

Query: 163 YAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA 222
              KT KL++LS Q L+DC  E+ GC GG ++ AF  + +   GG++ E  YPY G D++
Sbjct: 152 LKKKTGKLLNLSPQNLVDCVSENYGCGGGYMTTAFRYVQTN--GGIDSEDAYPYVGQDQS 209

Query: 223 CRLNKKATQVKINGYVSVS-RDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQF 279
           C  N  A   K  GY  +    E  + + +   GP++V+I+A   + QFY  GV +    
Sbjct: 210 CMYNPTAKAAKCRGYREIPVGSEKALKRAVARVGPISVSIDASLTSFQFYSRGVYYDEN- 268

Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCG 338
            CDG  +N++H+VL+VGYG        K   +WIIKNSWGE WG KGY  L R  + +CG
Sbjct: 269 -CDG--DNVNHAVLVVGYGA------QKGNKHWIIKNSWGESWGNKGYVLLARNRNNACG 319

Query: 339 IND 341
           I +
Sbjct: 320 ITN 322


>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
          Length = 337

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 124/312 (39%), Positives = 171/312 (54%), Gaps = 22/312 (7%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
           H K Y    E + R+ ++  NL+KI+L  + EH  G +    G+N F D++  EF+    
Sbjct: 36  HGKKYHEKEEGWRRM-VWEKNLQKIEL-HNLEHSMGTHTYRLGMNRFGDMTHEEFRQVMN 93

Query: 105 GFKLKPSYADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
           G+K K     R    M PN + +P + DWRE   VT VKDQ  CGS WAFSTTG +EG  
Sbjct: 94  GYKHKKERRFRGSLFMEPNFLEVPNSLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQM 153

Query: 164 AAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DD 220
             KT KLVSLSEQ L+DC + +  +GC GG +  AF  I  +   GL+ E++YPY G DD
Sbjct: 154 FRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDQ--NGLDSEESYPYVGTDD 211

Query: 221 KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPI 277
           + C  + K +     G+V + S  E  + K +   GP++VAI+A   + QFY +G+ +  
Sbjct: 212 QPCHYDPKYSAANDTGFVDIPSGKEHALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEK 271

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGS 336
           +  C   +E L H VL VGYG +      K   YWI+KNSW E WG+KGY  + +     
Sbjct: 272 E--CS--SEELDHGVLAVGYGFEGEDVDGKK--YWIVKNSWSENWGDKGYVYMAKDRHNH 325

Query: 337 CGINDYVRSALV 348
           CGI       LV
Sbjct: 326 CGIATAASYPLV 337


>gi|318037269|ref|NP_001187182.1| cathepsin L precursor [Ictalurus punctatus]
 gi|196475596|gb|ACG76367.1| cathepsin L [Ictalurus punctatus]
          Length = 336

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 134/351 (38%), Positives = 182/351 (51%), Gaps = 31/351 (8%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           +A+L++ +S  S     D +L           +  + E HNK Y    E + R+ ++  N
Sbjct: 5   LAVLAVCLSTVSAAPTVDREL--------DGHWQQWKEWHNKDYHEKEEGWRRM-VWEKN 55

Query: 70  LRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN-I 124
           L+KI+L  + EH  G +     +N F D+   EF+    G+K K     R    M PN +
Sbjct: 56  LKKIEL-HNLEHSLGKHSYRLAMNHFGDMPHEEFRQVMNGYKHKVRKI-RGSLFMEPNFL 113

Query: 125 TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
             P   DWRE   VT VKDQ  CGS WAFSTTG +EG    KT KLVSLSEQ L+DC + 
Sbjct: 114 EAPSKLDWREKGYVTPVKDQGQCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRP 173

Query: 185 D--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV- 240
           +  +GC GG +  AF  I  K  GGL+ EK YPY G DD+ C  +   +     G+V + 
Sbjct: 174 EGNEGCNGGLMDQAFQYI--KDNGGLDTEKFYPYLGTDDQPCHYDPSYSAANDTGFVDIP 231

Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
           S  E  + K +   GP++VAI+A   + QFY +G    I +  D  +E+L H VL+VGYG
Sbjct: 232 SGKEHALMKAVTAVGPVSVAIDAGHESFQFYQSG----IYYEADCSSEDLDHGVLVVGYG 287

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
            +      K   YWI+KNSW E WG KGY  + +     CGI       LV
Sbjct: 288 YEGENVDGKK--YWIVKNSWSEQWGNKGYIYMAKDRHNHCGIATAASYPLV 336


>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
          Length = 336

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 136/353 (38%), Positives = 188/353 (53%), Gaps = 32/353 (9%)

Query: 10  VALLSLTVSVSSFM--VVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFS 67
           + LL LT  +SS +   V D +L+     +H  L+  +   H+K Y    E + R+ ++ 
Sbjct: 2   LPLLVLTACLSSVLSAPVLDAQLN-----EHWDLWKSW---HSKKYHEKEEGWRRM-VWE 52

Query: 68  GNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN 123
            NL+KI+L  + EH  G +    G+N F D++  EF+    G+KLK          M PN
Sbjct: 53  KNLQKIEL-HNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLKTQRKFTGSLFMEPN 111

Query: 124 -ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
            +T P A DWRE   VT VKDQ  CGS WAFSTTG +EG    KT KLVSLSEQ L+DC 
Sbjct: 112 FMTAPSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCS 171

Query: 183 QED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVS 239
           + +  +GC GG +  AF  +      GL+ E +YPY G DD+ C  +         G+V 
Sbjct: 172 RPEGNEGCGGGLMDQAFQYVTDNQ--GLDSEDSYPYTGTDDQPCHYDPLYNSANDTGFVD 229

Query: 240 V-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
           V S  E  + K +   GP++VAI+A   + QFY +G+ +  +  C   +E L H VL VG
Sbjct: 230 VPSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKE--CS--SEELDHGVLAVG 285

Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           YG +      K   +WI+KNSWGE WG+KGY  + +     CGI       LV
Sbjct: 286 YGFEGEDKMGKK--FWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPLV 336


>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
           Full=Cysteine proteinase EP-C1; Flags: Precursor
 gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
          Length = 362

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 116/305 (38%), Positives = 166/305 (54%), Gaps = 28/305 (9%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           H+    +L E + R ++F  NL  +      +    +  LN+F+D++  EF++ Y G K+
Sbjct: 46  HHTVSRSLGEKHKRFNVFKANLMHVHNTNKMDKPYKLK-LNKFADMTNHEFRSTYAGSKV 104

Query: 109 KPSYADRSVP----AMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
                 R  P    A +    +++P + DWR+  AVT VKDQ  CGS WAFST   +EG+
Sbjct: 105 NHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGI 164

Query: 163 YAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
              KT KLV+LSEQEL+DCD+E++ GC GG + +AF+ I  K  GG+  E  YPY+  + 
Sbjct: 165 NQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQK--GGITTESNYPYKAQEG 222

Query: 222 ACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQ 278
            C  +K     V I+G+ +V  ++ D     V N P++VAI+A     QFY  GV     
Sbjct: 223 TCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGV----- 277

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----D 334
            F    + +L+H V IVGYG      T     YWI++NSWG  WGE GY R+ R     +
Sbjct: 278 -FTGDCSTDLNHGVAIVGYGT-----TVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKE 331

Query: 335 GSCGI 339
           G CGI
Sbjct: 332 GLCGI 336


>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
          Length = 339

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 123/345 (35%), Positives = 169/345 (48%), Gaps = 31/345 (8%)

Query: 5   YFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLH 64
           Y +  +ALL +  + +S           LH          ++ ++ + Y    E   R  
Sbjct: 7   YQYVSMALLFILAAWAS-----QATSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFK 61

Query: 65  IFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNI 124
           IF  N+ +I+        +    +NEF+DL+  EF++  L  + K      +      N+
Sbjct: 62  IFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRS--LRNRFKAHICSEATTFKYENV 119

Query: 125 T-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ 183
           T +P   DWR+  AVT +KDQ  CG  WAFS     EG+    T KL+SLSEQEL+DCD 
Sbjct: 120 TAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDT 179

Query: 184 --EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKA-TQVKINGYVSV 240
             E+ GC GG + +AF  I      GL  E TYPY GDD  C   K+A    KI GY  V
Sbjct: 180 GGENQGCSGGLMDDAFRFIKIH---GLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDV 236

Query: 241 SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
             +     +  V + P+AVAI+A  +  QFY +GV     F    G E L H V  VGYG
Sbjct: 237 PANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGV-----FTGQCGTE-LDHGVAAVGYG 290

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           +         + YW++KNSWG GWGE+GY R+ R     +G CGI
Sbjct: 291 IG-----DDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGI 330


>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
          Length = 368

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 126/362 (34%), Positives = 187/362 (51%), Gaps = 52/362 (14%)

Query: 1   MSCFYFFAGVALLSLTVSVSSFMVVG---DEKLHHLHHVKHTALFNYFLEQHNKTYATLV 57
           M+   FF   +L++ ++++   +  G   DE +          ++  +L +H K Y  L 
Sbjct: 4   MTILPFFLFFSLITFSLALDIQLPTGRSNDEVM---------TMYEEWLVKHQKVYNGLR 54

Query: 58  EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
           E   R  IF  NL  I    + ++ + + GLN+F+D++  E++  YLG +     +D   
Sbjct: 55  EKDQRFQIFKDNLNFIDE-HNAQNYTYIVGLNKFADMTNEEYRDMYLGTR-----SDIKR 108

Query: 118 PAMIPNIT-----------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
             M   IT           LP   DWR   A+T +KDQ  CGS WAFST   +E +    
Sbjct: 109 RIMKNKITGHRYAYNSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIV 168

Query: 167 TKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR- 224
           T KLVSLSEQEL+DCD+  ++GC GG +  AF+ I+    GG++ ++ YPY+G +  C  
Sbjct: 169 TGKLVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIGN--GGIDTDQHYPYKGFEGRCDP 226

Query: 225 LNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCD 282
             KKA  V I+GY  V  +  +  K  V + P++VAI A   ALQ Y +GV      F  
Sbjct: 227 TRKKAKIVSIDGYEDVPSNNENALKKAVAHQPVSVAIEASGRALQLYQSGV------FTG 280

Query: 283 GGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGSC 337
               +L H+V+IVGYG      +   + YW+++NSWG  WGE GYF++ R       G C
Sbjct: 281 KCGTSLDHAVVIVGYG------SENGLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKC 334

Query: 338 GI 339
           GI
Sbjct: 335 GI 336


>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 108/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I  K  GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|340380715|ref|XP_003388867.1| PREDICTED: pro-cathepsin H-like [Amphimedon queenslandica]
          Length = 347

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 110/311 (35%), Positives = 169/311 (54%), Gaps = 16/311 (5%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLS 95
           V+  A F  +  +H KTYAT  EY  RL +++ N   ++ L +    +  + LN+F+DL+
Sbjct: 37  VQRAAEFERWTIKHKKTYATAEEYNWRLRVYTANHYYVKRLNEGHGPATEFELNQFADLT 96

Query: 96  TAEFQAKYLGFK---LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWA 152
            AEF+  YL       + +  +  +P    N+  P A DWR+ + +T V+DQ  CGS WA
Sbjct: 97  FAEFKRIYLSSSSQHCRATTGNFQMPVKKNNVEDPVAIDWRKRNVITPVRDQGSCGSCWA 156

Query: 153 FSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEE 210
           FS T  +    A KT +L+SLS+Q+L+DC +   + GC+GG  S AF+ I  +  GG+E 
Sbjct: 157 FSATSCLSAHLALKTGQLISLSKQQLLDCSRSFNNRGCKGGLPSQAFEYI--RYNGGIES 214

Query: 211 EKTYPYRGDDKACRLNKKATQVKINGYVSVSRD-ETDMAKYLVENGPMAVAINAY-ALQF 268
           E+ YPY+  ++ C          + G V+ ++  E D+A  L   GP+++ I++  +   
Sbjct: 215 ERDYPYKDREEKCHFKPSLVAATVTGVVNFTQGAEDDIAVALANIGPVSIGIHSTKSFAT 274

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y  G+       C      ++H+VLIVGY  D+T    K   YWI KNSWG  WG  GYF
Sbjct: 275 YKKGIYQGK--LCSKNPRKINHAVLIVGY--DQTASGEK---YWIGKNSWGTNWGMNGYF 327

Query: 329 RLYRGDGSCGI 339
            + RG  +CG+
Sbjct: 328 WIRRGHNACGL 338


>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
 gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
          Length = 335

 Score =  189 bits (479), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 116/310 (37%), Positives = 171/310 (55%), Gaps = 23/310 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTA 97
           +N +  QH K+Y   VE   R+ I+  NLRKI+   + E+  G +    G+N+F D++  
Sbjct: 28  WNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQ-HNFEYSLGNHTFKMGMNQFGDMTNE 85

Query: 98  EFQAKYLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           EF+    G+K  P+   +    M P+    P+  DWR+   VT VKDQ  CGS W+FS+T
Sbjct: 86  EFRQAMNGYKQDPNRTSKGALFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSST 145

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           G +EG    KT KL+S+SEQ L+DC   Q + GC GG +  AF  +  K   GL+ E++Y
Sbjct: 146 GALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYV--KENKGLDSEQSY 203

Query: 215 PYRG-DDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYV 270
           PY   DD  CR + +    KI G+V + R +E  +   +   GP++VAI+A   +LQFY 
Sbjct: 204 PYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVGPVSVAIDASHQSLQFYQ 263

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +G+     ++       L H+VL+VGYG            YWI+KNSW + WG+KGY  +
Sbjct: 264 SGI-----YYERACTSRLDHAVLVVGYGYQGADVAGNR--YWIVKNSWSDKWGDKGYIYM 316

Query: 331 YRG-DGSCGI 339
            +  +  CGI
Sbjct: 317 AKDKNNHCGI 326


>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
 gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
           nagariensis]
          Length = 489

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 130/325 (40%), Positives = 173/325 (53%), Gaps = 25/325 (7%)

Query: 28  EKLHHLHHVKHTALFNYFLEQHNKTYAT-LVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY 86
           EKL         A F  ++ Q+ K YA  + E  +R  ++  NL  I L  +    S   
Sbjct: 31  EKLLLDAKANPMAAFQQWMMQYTKAYANDIKELETRFSVWLENLNYI-LAYNARTTSHWL 89

Query: 87  GLNEFSDLSTAEFQAKYLGFKLKPSYAD---RSVPAMIPNI---TLPRAFDWREYDAVTG 140
            LN F+DL+T EF+ + LG+  K   A    +S P +  N+    LP   DWR+  AVT 
Sbjct: 90  HLNAFADLTTDEFRNR-LGYDFKARQASNRLQSSPFIYDNVDANQLPTEIDWRKKGAVTE 148

Query: 141 VKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDT 199
           VK+Q  CGS WAF+TTG++EG+ A  T +L SLSEQEL+DCD  ED GC GG +  A+  
Sbjct: 149 VKNQGQCGSCWAFATTGSVEGINAIVTGELASLSEQELVDCDTDEDRGCSGGLMDYAYQW 208

Query: 200 IMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMA 258
           I+    GGL+ E  YPY  +D  C   KK  + V I+GYV +  ++    K    + P+A
Sbjct: 209 IIKN--GGLDTEDDYPYTAEDGVCVAAKKNRRVVTIDGYVDIPENDEVALKKAAAHQPIA 266

Query: 259 VAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           VAI A A  F + G        C     +L+H VL+VGYG D   F +    YWI+KNSW
Sbjct: 267 VAIEADAKSFQLYGGGVYDDPTC---GTSLNHGVLVVGYGKD-PHFGN----YWIVKNSW 318

Query: 319 GEGWGEKGYFRLYRG----DGSCGI 339
           G  WG+ GY RL  G     G CGI
Sbjct: 319 GPEWGDNGYIRLRMGAEDVQGMCGI 343


>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
          Length = 361

 Score =  188 bits (478), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 116/305 (38%), Positives = 166/305 (54%), Gaps = 28/305 (9%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           H+    +L E + R ++F  NL  +      +    +  LN+F+D++  EF++ Y G K+
Sbjct: 45  HHTVSRSLGEKHKRFNVFKANLMHVHNTNKMDKPYKLK-LNKFADMTNHEFRSTYAGSKV 103

Query: 109 KPSYADRSVP----AMI--PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
                 R  P    A +    +++P + DWR+  AVT VKDQ  CGS WAFST   +EG+
Sbjct: 104 NHHRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGI 163

Query: 163 YAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
              KT KLV+LSEQEL+DCD+E++ GC GG + +AF+ I  K  GG+  E  YPY+  + 
Sbjct: 164 NQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQK--GGITTESNYPYKAQEG 221

Query: 222 ACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQ 278
            C  +K     V I+G+ +V  ++ D     V N P++VAI+A     QFY  GV     
Sbjct: 222 TCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGV----- 276

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----D 334
            F    + +L+H V IVGYG      T     YWI++NSWG  WGE GY R+ R     +
Sbjct: 277 -FTGDCSTDLNHGVAIVGYGT-----TVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKE 330

Query: 335 GSCGI 339
           G CGI
Sbjct: 331 GLCGI 335


>gi|323454466|gb|EGB10336.1| hypothetical protein AURANDRAFT_22962 [Aureococcus anophagefferens]
          Length = 416

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 124/330 (37%), Positives = 183/330 (55%), Gaps = 30/330 (9%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           +LF+ F+++++K+Y T  EY  R  IFS NL  I  L +T++   ++GLN F+D +  E 
Sbjct: 87  SLFDQFIDEYSKSYDTTHEYNDRFTIFSKNLNYIDAL-NTQNPHALFGLNVFADQTEEER 145

Query: 100 QAKYLGFKLKPSY--------ADRSVPAMIPNI------TLPRAFDWREYDAVTGVKDQT 145
             + +      +Y        +D +   + P         LP  FDWRE  AVT VK+Q 
Sbjct: 146 SKRRMTDPSITNYTRVGWASGSDCAACNLYPAFGEYDMGNLPDDFDWRELGAVTRVKNQA 205

Query: 146 MCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLG 205
            CGS W+FST  ++EG +   T  L S + Q+L++C+  + GC+GG    A    +S  G
Sbjct: 206 YCGSCWSFSTAADLEGTHYLATGDLESYAPQQLVECNTMNLGCDGGYPFAAM-QYLSHFG 264

Query: 206 GGLEEEKTYPYRGDDKACRLNKKATQ---VKINGY--VSVSRD-ETDMAKYLVENGPMAV 259
           G +  E T PY+   K   LN+K        I+G+  V++  D E+ M   LV+NGP+++
Sbjct: 265 GMVTWE-TMPYK---KIELLNEKLEDGDVAHISGWQMVAMGADYESLMRVTLVKNGPLSI 320

Query: 260 AINAYALQFYVTGVSHPIQFF-CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           A NA  + +YV GV      F CD    +L H+VL+VGYGV  T    K VPYW+IKNSW
Sbjct: 321 AFNANGMDYYVHGVDGDGDMFTCD--PTSLDHAVLVVGYGVQHTDGNGK-VPYWVIKNSW 377

Query: 319 GEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
            + WGE GY+RL RG  +CG+ + V  ++V
Sbjct: 378 DDVWGEDGYYRLVRGSNACGVANMVVHSIV 407


>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
 gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
          Length = 344

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 123/317 (38%), Positives = 165/317 (52%), Gaps = 29/317 (9%)

Query: 35  HVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG-SGVYGLNEFSD 93
           H +H    N++     K Y    E   R  IF+ N++ I+   + ++  S   G+N+F+D
Sbjct: 36  HERHERWMNHY----GKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKLGINQFAD 91

Query: 94  LSTAEFQAKYLGFK-LKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSW 151
           L+  EF A    FK    S   R+      N++ +P   DWR+  AVT VK+Q  CG  W
Sbjct: 92  LTNEEFVASRNKFKGHMCSSIIRTTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCW 151

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLE 209
           AFS     EG++   T KLVSLSEQEL+DCD +  D GCEGG + +AF  I+     GL 
Sbjct: 152 AFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNH--GLN 209

Query: 210 EEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--L 266
            E  YPY+G D  C  NK + Q   I GY  V  +     +  V N P++VAI+A     
Sbjct: 210 TEAQYPYQGVDGTCNANKASIQATTITGYEDVPANNEQALQKAVANQPISVAIDASGSDF 269

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
           QFY +GV     F    G E L H V  VGYGV     ++    YW++KNSWG  WGE+G
Sbjct: 270 QFYKSGV-----FTGSCGTE-LDHGVTAVGYGV-----SNDGTKYWLVKNSWGTDWGEEG 318

Query: 327 YFRLYRG----DGSCGI 339
           Y  + RG    +G CGI
Sbjct: 319 YIMMQRGVEAAEGLCGI 335


>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
          Length = 333

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 124/349 (35%), Positives = 186/349 (53%), Gaps = 40/349 (11%)

Query: 4   FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
           FY       L L  +   F    D + H             +  QH +TYA   + + R 
Sbjct: 3   FYLCLASLCLGLVAATPEFDQTLDSQWHQ------------WKAQHRRTYAANEDGWRRA 50

Query: 64  HIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA 119
             +  NL+ I++  + E+ +G +    G+N+F D++T EF+    G+    S   R+  +
Sbjct: 51  -TWEKNLKMIEM-HNLEYSAGKHSFQLGMNKFGDMTTEEFKQVMNGYNSNGS-QKRTKGS 107

Query: 120 MIPN---ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
           +        LP++ DWRE   VT VK+Q  CGS WAFS TG++EG +  KTKKLVSLSEQ
Sbjct: 108 LYREPLLAQLPKSVDWREKGYVTPVKNQGQCGSCWAFSATGSLEGQWFHKTKKLVSLSEQ 167

Query: 177 ELIDC--DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKI 234
            L+DC   + ++GC GG + NAF+ +  K  GG++ E+ YPY G D  C+   + +   +
Sbjct: 168 NLVDCSTSEGNNGCSGGLMDNAFEYV--KNNGGIDTEQAYPYLGQDNECKYRAECSGANV 225

Query: 235 NGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHS 291
            G+V + S +E  + K +   GP++VAI+A   + QFY +GV +  Q  C   +  L H 
Sbjct: 226 TGFVDIPSMNERALMKAVANVGPISVAIDAGNPSFQFYESGVYYEPQ--CS--SSQLDHG 281

Query: 292 VLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGI 339
           VL+VGYG      +     YWI+KNSWGE WG+KGY  + +  +  CGI
Sbjct: 282 VLVVGYG------SIGKDEYWIVKNSWGEEWGKKGYVLMAKFRNNHCGI 324


>gi|33242870|gb|AAQ01139.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 126/331 (38%), Positives = 173/331 (52%), Gaps = 32/331 (9%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
           + H   +  +  QH K Y T  E YSR  IF  N  KI      EH         S    
Sbjct: 18  LPHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKI-----AEHNIRASLGMHSYTLA 72

Query: 88  LNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
           +N+F D+   EF  + +G  LK          V     N TLP++ DWR    V+ VKDQ
Sbjct: 73  MNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSDVGDNDDNGTLPKSVDWRNSHMVSEVKDQ 132

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMS 202
             CGS WAFSTTG++EG ++ KT KLV LSEQ+L+DC ++  + GC GG +  AF  I +
Sbjct: 133 GECGSCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYITA 192

Query: 203 KLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVA 260
              GGL+ E++YPY   DD+ C+ +  +    + GY  V S +E  + + +   GP++VA
Sbjct: 193 N--GGLDTEESYPYTATDDEPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVA 250

Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           I+A   + QFY +GV    Q  C    E L H VL VGYG      +H+A  +WI+KNSW
Sbjct: 251 IDAGHESFQFYSSGVYDEPQ--CS--TEQLDHGVLAVGYGA-MNDNSHQA--FWIVKNSW 303

Query: 319 GEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           G  WG++GY  + R  +  CGI       LV
Sbjct: 304 GPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334


>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
          Length = 330

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 118/309 (38%), Positives = 167/309 (54%), Gaps = 25/309 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKI---QLLQDTEHGSGVYGLNEFSDLSTAE 98
           +N F +Q+NK Y    E   RL ++  NL  I    L  D    +   G+NE+ D++  E
Sbjct: 27  WNIFKKQYNKLYQNEEEARRRL-VWESNLDFITLHNLAADRGEHTFWVGMNEYGDMTNEE 85

Query: 99  FQAKYLGFKLKPSYADRSVPAMIPNIT--LPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           F     G++++   ++  V  M PN    LP   DWR    VT +K+Q  CGS W+FS T
Sbjct: 86  FTKTMNGYRMRNKTSNAPV-FMPPNNMGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSAT 144

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           G++EG    KT KLVSLSEQ L+DC   Q + GCEGG + +AF  I  K   G++ E +Y
Sbjct: 145 GSLEGQTFKKTGKLVSLSEQNLVDCSKKQGNHGCEGGLMDDAFTYI--KANNGIDTEASY 202

Query: 215 PYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVT 271
           PY+  D  C            G+V + ++DE  + + +   GP++VAI+A   + Q Y T
Sbjct: 203 PYKARDGKCEFKSADVGATDTGFVDIKTKDEEALKQAVATVGPISVAIDASHMSFQLYRT 262

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GV H   +FC      L H VL VGYG + +K       YW++KNSWGE WG+KGY ++ 
Sbjct: 263 GVYH--DWFC--SQTKLDHGVLAVGYGTEDSK------DYWLVKNSWGESWGQKGYIQMS 312

Query: 332 RG-DGSCGI 339
           R    +CGI
Sbjct: 313 RNRRNNCGI 321


>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
 gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
          Length = 369

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 121/296 (40%), Positives = 157/296 (53%), Gaps = 27/296 (9%)

Query: 58  EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLK-----PSY 112
           E   R   F  N+R I              LN F D+   EF++ +   ++       S 
Sbjct: 57  EKGRRFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDMGREEFRSTFADSRINDLRRAESP 116

Query: 113 ADRSVPA-MIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKL 170
           A  +VP  M   +T LP + DWR+  AVT VKDQ  CGS WAFST  ++EG+ A +T  L
Sbjct: 117 AAPAVPGFMYDGVTDLPPSVDWRKEGAVTAVKDQGHCGSCWAFSTVVSVEGINAIRTGSL 176

Query: 171 VSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR--LNKK 228
           VSLSEQELIDCD +++GC+GG + NAF+ I S   GG+  E  YPYR  +  C    +++
Sbjct: 177 VSLSEQELIDCDTDENGCQGGLMENAFEFIKSY--GGVTTESAYPYRASNGTCDSVRSRR 234

Query: 229 ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNE 286
              V I+G+  V     D     V N P++VAI+A   A QFY  GV     F  D G  
Sbjct: 235 GQIVSIDGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGV-----FTGDCGT- 288

Query: 287 NLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS---CGI 339
           +L H V  VGYGV     +     YWI+KNSWG  WGE GY R+ RG G+   CGI
Sbjct: 289 DLDHGVAAVGYGV-----SDDGTAYWIVKNSWGPSWGEGGYIRMQRGAGNGGLCGI 339


>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 126/324 (38%), Positives = 167/324 (51%), Gaps = 37/324 (11%)

Query: 38  HTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYG------LNEF 91
           + A F  +  +H K YAT  E  +RL  F+ N   +    D    SG  G      LN F
Sbjct: 35  YEAQFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAF 94

Query: 92  SDLSTAEFQAKYLGFKL-------KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
           +DL+  EF+A  LG           PS +D      +  +  P A DWR+  AVT VKDQ
Sbjct: 95  ADLTHDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAV--PDALDWRQSGAVTKVKDQ 152

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSK 203
             CG+ W+FS TG +EG+    T  L+SLSEQELIDCD+  + GC GG ++ A+  ++  
Sbjct: 153 GSCGACWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKN 212

Query: 204 LGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI- 261
             GG++ E  YP+R  D  C  NK K   V I+GY  V   + D+    V   P++V I 
Sbjct: 213 --GGIDTEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGIC 270

Query: 262 -NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
            +A A Q Y  G+      F      +L H+VLIVGYG +  K       YWI+KNSWGE
Sbjct: 271 GSARAFQLYSQGI------FDGPCPTSLDHAVLIVGYGSEGGK------DYWIVKNSWGE 318

Query: 321 GWGEKGYFRLYRGDGS----CGIN 340
            WG KGY  ++R  GS    CGIN
Sbjct: 319 RWGMKGYMHMHRNTGSSSGICGIN 342


>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
          Length = 351

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 123/337 (36%), Positives = 171/337 (50%), Gaps = 29/337 (8%)

Query: 27  DEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQL---LQDTEHGS 83
           D  L   + V    L+  F   H + Y    E   R  +F  NL+KI++   L      S
Sbjct: 29  DTILRFPNQVPFEKLWQDFKTVHERNYGE-TEEMQRKEVFRNNLKKIEMHNYLHSQGKSS 87

Query: 84  GVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRS------VPAMIPNITLPRAFDWREYDA 137
              G+N+F+D+   EF +   GF++      R       +   IP ++LP   DWR+   
Sbjct: 88  YRMGINQFADMEVKEFASVVNGFRMNNRTKVRDHLHSHYISPAIP-VSLPAEVDWRKEGY 146

Query: 138 VTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISN 195
           VT +KDQ  CGS W+FSTTG +EG +  KT KLVSLSEQ LIDC     ++GC GG +  
Sbjct: 147 VTPIKDQGHCGSCWSFSTTGALEGQHFRKTGKLVSLSEQNLIDCSTSYGNNGCNGGVMDY 206

Query: 196 AFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVEN 254
           AF  I  K   G + E +YPY   D  CR  K+       GY  + + DE  M + +   
Sbjct: 207 AFQYI--KDNDGDDTEDSYPYEAADGPCRFKKEYVGATDTGYTDLPKGDEEKMKEAVAMV 264

Query: 255 GPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYW 312
           GP++VAI+A   + Q Y +GV   ++  CD   E L H VL+VGYG      T     YW
Sbjct: 265 GPVSVAIDASHTSFQMYQSGVYDEVE--CDP--EGLDHGVLVVGYG------TELGQDYW 314

Query: 313 IIKNSWGEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           ++KNSWG  WG++GY ++ R  +  CGI+      LV
Sbjct: 315 LVKNSWGTKWGDEGYIKMSRNKNNQCGISSMASYPLV 351


>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
          Length = 343

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 109/303 (35%), Positives = 160/303 (52%), Gaps = 25/303 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF  K+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  + PSY   S  +        + +  +P   DWRE  AVT VK+Q  CG  WAFS  G
Sbjct: 102 GINI-PSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVG 160

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I  K  GG+  E  Y Y+
Sbjct: 161 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISSESDYEYQ 218

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G    CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 219 GQQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 275

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 276 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 325

Query: 336 SCG 338
           + G
Sbjct: 326 NPG 328


>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 457

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 126/324 (38%), Positives = 167/324 (51%), Gaps = 37/324 (11%)

Query: 38  HTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYG------LNEF 91
           + A F  +  +H K YAT  E  +RL  F+ N   +    D    SG  G      LN F
Sbjct: 35  YEAQFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAF 94

Query: 92  SDLSTAEFQAKYLGFKL-------KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
           +DL+  EF+A  LG           PS +D      +  +  P A DWR+  AVT VKDQ
Sbjct: 95  ADLTHDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAV--PDALDWRQSGAVTKVKDQ 152

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSK 203
             CG+ W+FS TG +EG+    T  L+SLSEQELIDCD+  + GC GG ++ A+  ++  
Sbjct: 153 GSCGACWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKN 212

Query: 204 LGGGLEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI- 261
             GG++ E  YP+R  D  C  NK K   V I+GY  V   + D+    V   P++V I 
Sbjct: 213 --GGIDTEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGIC 270

Query: 262 -NAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGE 320
            +A A Q Y  G+      F      +L H+VLIVGYG +  K       YWI+KNSWGE
Sbjct: 271 GSARAFQLYSQGI------FDGPCPTSLDHAVLIVGYGSEGGK------DYWIVKNSWGE 318

Query: 321 GWGEKGYFRLYRGDGS----CGIN 340
            WG KGY  ++R  GS    CGIN
Sbjct: 319 RWGMKGYMHMHRNTGSSSGICGIN 342


>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 107/301 (35%), Positives = 159/301 (52%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I+    GG+  E  Y Y+
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYQ 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
          Length = 339

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 127/344 (36%), Positives = 175/344 (50%), Gaps = 40/344 (11%)

Query: 7   FAGVALLSLTVSVS-SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
           FA  A L+ + ++S S MVV  E+               ++ Q+ + Y T  E   R +I
Sbjct: 16  FATSAYLATSRTLSDSLMVVRHEQ---------------WMAQYGRVYKTEAEKTKRFNI 60

Query: 66  FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT 125
           F  N+  I+            G+N F+DL+  EF+A   G+KL P     + P    N++
Sbjct: 61  FKENVEYIESFNKAGTKPYKLGINAFADLTNQEFKASRNGYKL-PHDCSSNTPFRYENVS 119

Query: 126 -LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
            +P   DWR   AVT VKDQ  CG  WAFS    +EG+    T  L+SLSEQEL+DCD +
Sbjct: 120 SVPTTVDWRTKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVK 179

Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYVSVS 241
             D GCEGG + +AF  I++    GL  E  YPY+G D +C +     +  KI+GY  V 
Sbjct: 180 GTDQGCEGGLMDDAFSFIINNK--GLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVP 237

Query: 242 RDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
            +     +  V N P++VAI+A     QFY +GV     F  + G E L H V  VGYG+
Sbjct: 238 ANSESALEKAVANQPVSVAIDAGGSDFQFYSSGV-----FTGECGTE-LDHGVTAVGYGI 291

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
                      YW++KNSWG  WGEKGY R+ +     +G CGI
Sbjct: 292 -----AEDGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGI 330


>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
          Length = 335

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 122/325 (37%), Positives = 170/325 (52%), Gaps = 30/325 (9%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLNEFSDLST 96
           A ++ F  +H K+Y +  E   RL I+  N  KI    +    G   Y   +NEF D+  
Sbjct: 25  AEWSAFKAKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLH 84

Query: 97  AEFQAKYLGFKLKPSYADRSV-------PAMIPNITLPRAFDWREYDAVTGVKDQTMCGS 149
            EF +   GFK   +Y D+         P  I + +LP+  DWR   AVT VK+Q  CGS
Sbjct: 85  HEFVSTRNGFKR--NYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGS 142

Query: 150 SWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGG 207
            WAFS TG++EG +  K+  +VSLSEQ L+ C  +  ++GCEGG + +AF  I  +   G
Sbjct: 143 CWAFSATGSLEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYI--RANKG 200

Query: 208 LEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY-- 264
           ++ EK+YPY G D  C   K       +G+V +    ET + K +   GP++VAI+A   
Sbjct: 201 IDTEKSYPYNGTDGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHE 260

Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
           + QFY  GV    +  CD  +E+L H VL+VGYG      T     YW +KNSWG  WG+
Sbjct: 261 SFQFYSDGVYDEPE--CD--SESLDHGVLVVGYG------TLNGTDYWFVKNSWGTTWGD 310

Query: 325 KGYFRLYRG-DGSCGINDYVRSALV 348
           +GY R+ R     CGI       LV
Sbjct: 311 EGYIRMSRNKKNQCGIASSASIPLV 335


>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
          Length = 378

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 123/346 (35%), Positives = 177/346 (51%), Gaps = 34/346 (9%)

Query: 4   FYFFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRL 63
             FF+ + +LSL + + + +   ++++         A++  +L +  K+Y +L E   R 
Sbjct: 12  LLFFSTLLILSLALDIENSVQRTNDQV--------MAMYESWLVEQGKSYNSLDEKEMRF 63

Query: 64  HIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN 123
            IF  NLR I       + S   GLN F+DL+  E+++ YLG K+ P   D S   M P 
Sbjct: 64  EIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKMGPK-TDVSNEYM-PK 121

Query: 124 I--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
           +   LP   DWR   AV GVK+Q +C S WAFS    +EG+    T  L+SLSEQEL+DC
Sbjct: 122 VGEALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSEQELVDC 181

Query: 182 --DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYV 238
              Q   GC  G +++AF  I++   GG+  E  YPY   D  C L+ K  + V I+ Y 
Sbjct: 182 GRTQRTKGCNRGLMTDAFQFIINN--GGINTEDNYPYTAKDGQCNLSLKNQKYVTIDNYK 239

Query: 239 SVSRDETDMAKYLVENGPMAVAINAYALQF--YVTGVSHPIQFFCDGGNENLSHSVLIVG 296
           +V  +     K  V   P++V + +   +F  Y +G+      FC      + H V IVG
Sbjct: 240 NVPSNNEMALKKAVAYQPVSVGVESEGGKFKLYTSGI---FTGFCGTA---VDHGVTIVG 293

Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR---GDGSCGI 339
           YG +R       + YWI+KNSWG  WGE GY R+ R   G G CGI
Sbjct: 294 YGTER------GMDYWIVKNSWGTNWGENGYIRIQRNIGGAGKCGI 333


>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
          Length = 325

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 119/306 (38%), Positives = 163/306 (53%), Gaps = 20/306 (6%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKL 108
           HNK Y+   E   R  I+  N+ +I    +++  + +  +N F D++  EF+AK  G  L
Sbjct: 34  HNKAYSHESEENVRYAIWKDNMNRITEY-NSKSKNVILRMNHFGDMTNTEFRAKMNGLLL 92

Query: 109 KPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTK 168
              + + S   +  +   P A DWR    VT VK+Q  CGS WAFS+TG +EG +  KT 
Sbjct: 93  H-KHQNGSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGALEGQHFKKTG 151

Query: 169 KLVSLSEQELIDC--DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLN 226
           +LVSLSEQ L+DC  D  ++GC GG + NAF  I  K  GG++ E  YPY G D  CR +
Sbjct: 152 RLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYI--KANGGIDTETGYPYEGQDGTCRYS 209

Query: 227 KKATQVKINGYVSVSRDETDMAKYLVEN-GPMAVAINA--YALQFYVTGVSHPIQFFCDG 283
           K +      G+V +   + D  K  V   GP++VAI+A   + QFY +GV    Q  C  
Sbjct: 210 KSSIGADDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQFYHSGVYDEPQ--CS- 266

Query: 284 GNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD-GSCGINDY 342
               L H VL+VGYG D  K       YW++KNSWG GWG +GY  + R +   CGI   
Sbjct: 267 -PSALDHGVLVVGYGTDNGK------DYWLVKNSWGTGWGTEGYIYMSRNNQNQCGIASK 319

Query: 343 VRSALV 348
               LV
Sbjct: 320 ASYPLV 325


>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 108/301 (35%), Positives = 159/301 (52%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC+GG ++NAFD I  K  GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFI--KENGGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
          Length = 344

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 109/301 (36%), Positives = 158/301 (52%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        I +  +P   DWRE  AVT VK+Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDISDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I  +  GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--RENGGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G    CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GQQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGGNEN-LSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   N ++H+V  +GYG D          YW++KNSWG  WGEKG+ ++ R  G
Sbjct: 277 -----DGSCANRINHAVTAIGYGTD-----ENGQKYWLLKNSWGTSWGEKGFMKIIRDYG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
 gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
          Length = 462

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 125/351 (35%), Positives = 186/351 (52%), Gaps = 40/351 (11%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHT---------ALFNYFLEQHNKTYA--TLVE 58
           +A+++++ +V   ++  DEK    H V  T         +++  +L +H K  +  +LVE
Sbjct: 13  LAMVTVSSAVDMSIISYDEK----HGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVE 68

Query: 59  YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP 118
              R  IF  NLR +    + ++ S   GL  F+DL+  E+++KYLG K++     R+  
Sbjct: 69  KDRRFEIFKDNLRFVDE-HNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSL 127

Query: 119 AMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
                +   LP + DWR+  AV  VKDQ  CGS WAFST G +EG+    T  L++LSEQ
Sbjct: 128 RYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQ 187

Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKI 234
           EL+DCD   ++GC GG +  AF+ I+    GG++ +K YPY+G D  C ++ K A  V I
Sbjct: 188 ELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245

Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSV 292
           + Y  V     +  K  V + P+++AI A   A Q Y +G+      F       L H V
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGI------FDGSCGTQLDHGV 299

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
           + VGYG +  K       YWI++NSWG+ WGE GY R+ R      G CGI
Sbjct: 300 VAVGYGTENGK------DYWIVRNSWGKSWGESGYLRMARNIASSSGKCGI 344


>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
          Length = 344

 Score =  188 bits (478), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 119/307 (38%), Positives = 159/307 (51%), Gaps = 25/307 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH-GSGVYGLNEFSDLSTAEFQAKY 103
           ++ Q+ K Y    E  +R  IF  N+  I+   + +   S   G+N+F+DL+  EF A  
Sbjct: 42  WMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADDTKSYKLGINQFADLTNEEFIASR 101

Query: 104 LGFK-LKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
             FK    S   R+      N++ +P   DWR+  AVT VK+Q  CG  WAFS     EG
Sbjct: 102 NKFKGHMCSSIMRTTSFKYENVSGIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEG 161

Query: 162 VYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           ++   T KL+SLSEQEL+DCD +  D GCEGG + +AF  I+     GL  E  YPY G 
Sbjct: 162 IHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNH--GLSTEAQYPYEGV 219

Query: 220 DKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHP 276
           D  C  NK + Q V I GY  V  +     +  V N P++VAI+A     QFY +GV   
Sbjct: 220 DGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAIDASGSDFQFYKSGV--- 276

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG--- 333
              F       L H V  VGYGV     ++    YW++KNSWG  WGE+GY  + RG   
Sbjct: 277 ---FTGACGTELDHGVTAVGYGV-----SNDGTKYWLVKNSWGTDWGEEGYIMMQRGIEA 328

Query: 334 -DGSCGI 339
            +G CGI
Sbjct: 329 AEGICGI 335


>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 110/310 (35%), Positives = 163/310 (52%), Gaps = 25/310 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENIKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC+GG ++NAFD I  K  GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFI--KENGGISSESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S-CGINDYVR 344
           +  G+ D  +
Sbjct: 327 NPAGLCDIAK 336


>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
          Length = 339

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 116/303 (38%), Positives = 154/303 (50%), Gaps = 23/303 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           + +++ K Y    E   RL IF  N+  I+      +      +N  +D +  EF A + 
Sbjct: 43  WTKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLSINHLTDQTNEEFVASHN 102

Query: 105 GFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
           G+K K S++    P    NIT +P A DWRE  AV  +KDQ  CG+ WAFST    EG+Y
Sbjct: 103 GYKHKGSHS--QTPFKYENITGVPNAVDWRENGAVXAMKDQGQCGNCWAFSTVATTEGIY 160

Query: 164 AAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC 223
              T  L+SLSEQEL+DCD  D GC+GG +   F+ I     GG+  E  YPY   D   
Sbjct: 161 QITTSMLMSLSEQELVDCDSVDHGCDGGYMEGGFEFIXKN--GGISSEANYPYTAVDGTY 218

Query: 224 RLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAIN--AYALQFYVTGVSHPIQFF 280
             NK+A+   +I GY +V  +  D  +  V N P++V I+    A QF  +GV      F
Sbjct: 219 DANKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDVGGSAFQFNSSGV------F 272

Query: 281 CDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGS 336
                  L H V  VGYG      T     YWI+KNSWG  WGE+GY R+ RG    +G 
Sbjct: 273 TGQCGTQLDHGVTAVGYGS-----TDDGTQYWIVKNSWGTQWGEEGYIRMQRGTDAQEGL 327

Query: 337 CGI 339
           CGI
Sbjct: 328 CGI 330


>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
          Length = 462

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 164/314 (52%), Gaps = 31/314 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           ++  ++ +H+ TY  + E   R   F  NLR I    +    +GV+    GLN F+DL+ 
Sbjct: 41  MYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQ-HNAAADAGVHSFRLGLNRFADLTN 99

Query: 97  AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
            E+++ YLG + KP   +R + A      N  LP + DWR+  AV  VKDQ  CGS WAF
Sbjct: 100 EEYRSTYLGARTKPDR-ERKLSARYQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAF 158

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           S    +EG+    T  ++ LSEQEL+DCD   + GC GG +  AF+ I++   GG++ E+
Sbjct: 159 SAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDSEE 216

Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFY 269
            YPY+  D  C  NKK A  V I+GY  V  +     +  V N P++VAI A   A Q Y
Sbjct: 217 DYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLY 276

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +G+      F       L H V  VGYG +  K       YW+++NSWG  WGE GY R
Sbjct: 277 KSGI------FTGTCGTALDHGVAAVGYGTENGK------DYWLVRNSWGSVWGENGYIR 324

Query: 330 LYRG----DGSCGI 339
           + R      G CGI
Sbjct: 325 MERNIKASSGKCGI 338


>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
          Length = 340

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 123/353 (34%), Positives = 176/353 (49%), Gaps = 30/353 (8%)

Query: 13  LSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRK 72
           + +TV ++  +  G      L+  +H +LF  +     K Y T+ E   ++  +  N  K
Sbjct: 1   MKVTVLLAVVLFAGCCSAMQLNQ-QHVSLFQTWKNLWKKVYQTVEEEEQKMATWFNNWNK 59

Query: 73  I---QLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMI-------- 121
           I    +    +  S    +NE+ DL++ EF +   G++       +S             
Sbjct: 60  ISEHNMQYSLKQKSYRLEMNEYGDLTSEEFSSMMNGYRNDIRLKRKSTGGSTYLNLLSFG 119

Query: 122 PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
             I LP   DWR++  VT VK+Q  CGS W+FS TG++EG +  KT KLVSLSEQ LIDC
Sbjct: 120 SQIQLPTLVDWRKHGLVTPVKNQGQCGSCWSFSATGSLEGQHKKKTGKLVSLSEQNLIDC 179

Query: 182 D--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVS 239
              + +DGC GG +  AF  I  K+ GG++ E  YPY   D  CR N   +     G+V 
Sbjct: 180 STPEGNDGCNGGLMDQAFKYI--KIQGGIDTEAYYPYEAKDDTCRFNITDSGATDTGFVD 237

Query: 240 VSRDETDMAKYLVEN-GPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVG 296
           +   + +M K      GP++VAI+A   + QFY  GV    +  C   +  L H VL+VG
Sbjct: 238 IKSGDEEMLKEAAATVGPISVAIDASHTSFQFYSNGVYS--ETACS--STMLDHGVLVVG 293

Query: 297 YGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR-GDGSCGINDYVRSALV 348
           YG +  K       YW++KNSWGEGWGE GY ++ R  D  CGI       LV
Sbjct: 294 YGTENGK------DYWLVKNSWGEGWGEAGYIKMSRNADNQCGIATQASYPLV 340


>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
          Length = 362

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 128/345 (37%), Positives = 177/345 (51%), Gaps = 31/345 (8%)

Query: 12  LLSLTVSVSSFMVVGDE-KLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSRLHIFSG 68
           LL + +S++  +VV +    H        +L++ +     H+     L E   R ++F  
Sbjct: 6   LLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVSRNLNEKQKRFNVFKS 65

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA-----MIPN 123
           N+  +      +    +  LN+F+D++  EF+  Y G K+      R  P      M  N
Sbjct: 66  NVMHVHNTNKMDKPYKLK-LNKFADMTNHEFKTTYAGTKVNHHRMFRGTPRVSGTFMYEN 124

Query: 124 IT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
            T  P + DWR+  AVT VKDQ  CGS WAFST   +EG+   KT +LV LSEQELIDCD
Sbjct: 125 FTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDCD 184

Query: 183 -QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSV 240
            QE+ GC GG +  AF+ I  K  GG+  E  YPY  +D +C   K+    V I+G+ +V
Sbjct: 185 NQENQGCNGGLMEYAFEYIKQK--GGVTTESYYPYTANDGSCDATKENVPTVSIDGHETV 242

Query: 241 SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
             ++ D     V N P++VAI+A     QFY  GV     F  D G E L+H V IVGYG
Sbjct: 243 PANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGV-----FTGDCGKE-LNHGVAIVGYG 296

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
                 T     YWI++NSWG  WGE+G  R+ R     +G CGI
Sbjct: 297 T-----TVDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGI 336


>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
 gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
          Length = 345

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 115/308 (37%), Positives = 162/308 (52%), Gaps = 23/308 (7%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           ++  ++++H K Y +  EY  R  IF  N+  I       + S   GLN+F+DL+ +EF+
Sbjct: 37  VYQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEFR 96

Query: 101 AKYLGFKLKPS-YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNI 159
             Y+G   +P+ + +    A++ +     + DWR+   VT +KDQ  CGS WAFS    +
Sbjct: 97  GLYVGRLQRPAPFHEVGDIALVADTAT--SVDWRKKGGVTEIKDQGDCGSCWAFSAVAAV 154

Query: 160 EGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           EG+    T  LVSLSEQEL+DCD   + GC+GG +  AF  ++    GG+  +  YPYR 
Sbjct: 155 EGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMDYAFQYMIRN--GGITSQSNYPYRA 212

Query: 219 DDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSH 275
              AC  +K K     ING+ ++     ++    V N P++VAI A     Q Y +GV  
Sbjct: 213 LRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYSSGV-- 270

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR--- 332
               F      NL H V IVGYG D          YW++KNSWG GWGE GY R+ R   
Sbjct: 271 ----FTGECGSNLDHGVAIVGYGTDA-----GGRQYWLVKNSWGSGWGESGYVRMERQGP 321

Query: 333 GDGSCGIN 340
           G G CGIN
Sbjct: 322 GAGVCGIN 329


>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
 gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
           Precursor
 gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
 gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
 gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
 gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
          Length = 462

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 125/351 (35%), Positives = 186/351 (52%), Gaps = 40/351 (11%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHT---------ALFNYFLEQHNKTYA--TLVE 58
           +A+++++ +V   ++  DEK    H V  T         +++  +L +H K  +  +LVE
Sbjct: 13  LAMVAVSSAVDMSIISYDEK----HGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVE 68

Query: 59  YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP 118
              R  IF  NLR +    + ++ S   GL  F+DL+  E+++KYLG K++     R+  
Sbjct: 69  KDRRFEIFKDNLRFVDE-HNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSL 127

Query: 119 AMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
                +   LP + DWR+  AV  VKDQ  CGS WAFST G +EG+    T  L++LSEQ
Sbjct: 128 RYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQ 187

Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKI 234
           EL+DCD   ++GC GG +  AF+ I+    GG++ +K YPY+G D  C ++ K A  V I
Sbjct: 188 ELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245

Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSV 292
           + Y  V     +  K  V + P+++AI A   A Q Y +G+      F       L H V
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGI------FDGSCGTQLDHGV 299

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
           + VGYG +  K       YWI++NSWG+ WGE GY R+ R      G CGI
Sbjct: 300 VAVGYGTENGK------DYWIVRNSWGKSWGESGYLRMARNIASSSGKCGI 344


>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 109/301 (36%), Positives = 160/301 (53%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYAD----RSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY       S   +I +++   +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I  K  GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|2352469|gb|AAC00067.1| cysteine protease [Trypanosoma cruzi]
          Length = 471

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 118/344 (34%), Positives = 173/344 (50%), Gaps = 22/344 (6%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           V L ++ V ++  +      LH    +  T+ F  F ++H + Y +       L +F  N
Sbjct: 8   VLLAAVLVVMACLVPAATASLHAEETL--TSQFAEFKQKHGRVYESAARRLP-LSVFREN 64

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL-GFKLKPSYADRS-VPAMIPNITLP 127
           L  +  L    +    +G+  FSDL+  EF+++Y  G     +  +R+ VP  +  +  P
Sbjct: 65  LF-LARLHAAANPHATFGVTPFSDLTREEFRSRYHNGAAHFAAAQERARVPVKVEVVGAP 123

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
            A DWR   AVT VKDQ  CGS WAFS  GN+E  +      L +LSEQ L+ CD+ D G
Sbjct: 124 AAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSCDKTDFG 183

Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPY---RGDDKACRLNKKATQVKINGYVSVSRDE 244
           C GG ++NAF+ I+ +  G +  E +YPY    G    C  +       I G+V + +DE
Sbjct: 184 CSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQDE 243

Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
             +A  +  NGP+AVA++A +   Y  GV           +E L H VL+VGY       
Sbjct: 244 AQIAACVAVNGPVAVAVDASSWMTYTGGV------MTSCVSEQLDHGVLLVGYN------ 291

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
              AVPYWIIKNSW    GE+GY R+ +G   C + +   SA+V
Sbjct: 292 DSAAVPYWIIKNSWTTQ-GEEGYIRIAKGSNQCLVKEEASSAVV 334


>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
 gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score =  188 bits (477), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 117/305 (38%), Positives = 160/305 (52%), Gaps = 26/305 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ ++ K Y   +E   R  IF  N+  I+     ++      +N  +DL+  EF+A   
Sbjct: 43  WMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLADLTLDEFKASRN 102

Query: 105 GFK-LKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
           G+K +   +A  S      N+T +P A DWR   AVT +KDQ  CGS WAFST   IEG+
Sbjct: 103 GYKKIDREFATTSFK--YENVTAIPEAVDWRVKGAVTPIKDQGQCGSCWAFSTVAAIEGI 160

Query: 163 YAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
               T KL+SLSEQEL+DCD   ED GCEGG + + F+ I+    GG+  E  YPY+  D
Sbjct: 161 NQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKN--GGITSETNYPYKAAD 218

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQ 278
            +C     A   KI GY  V  +        V N P++V+I+A   +  FY +G+     
Sbjct: 219 GSCNTATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSIDASDSSFMFYSSGI----- 273

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----D 334
           +  + G E L H V  VGYG      +     YWI+KNSWG  WGEKGY R+ RG    +
Sbjct: 274 YTGECGTE-LDHGVTAVGYG------SANGTDYWIVKNSWGTVWGEKGYIRMQRGIADKE 326

Query: 335 GSCGI 339
           G CGI
Sbjct: 327 GLCGI 331


>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
          Length = 433

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 125/351 (35%), Positives = 186/351 (52%), Gaps = 40/351 (11%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHT---------ALFNYFLEQHNKTYA--TLVE 58
           +A+++++ +V   ++  DEK    H V  T         +++  +L +H K  +  +LVE
Sbjct: 13  LAMVAVSSAVDMSIISYDEK----HGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVE 68

Query: 59  YYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVP 118
              R  IF  NLR +    + ++ S   GL  F+DL+  E+++KYLG K++     R+  
Sbjct: 69  KDRRFEIFKDNLRFVDE-HNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSL 127

Query: 119 AMIPNI--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQ 176
                +   LP + DWR+  AV  VKDQ  CGS WAFST G +EG+    T  L++LSEQ
Sbjct: 128 RYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQ 187

Query: 177 ELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKI 234
           EL+DCD   ++GC GG +  AF+ I+    GG++ +K YPY+G D  C ++ K A  V I
Sbjct: 188 ELVDCDTSYNEGCNGGLMDYAFEFIIKN--GGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245

Query: 235 NGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSV 292
           + Y  V     +  K  V + P+++AI A   A Q Y +G+      F       L H V
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGI------FDGSCGTQLDHGV 299

Query: 293 LIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR----GDGSCGI 339
           + VGYG +  K       YWI++NSWG+ WGE GY R+ R      G CGI
Sbjct: 300 VAVGYGTENGK------DYWIVRNSWGKSWGESGYLRMARNIASSSGKCGI 344


>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
          Length = 367

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 111/291 (38%), Positives = 154/291 (52%), Gaps = 24/291 (8%)

Query: 58  EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
           E  +R H+F  N++ I  +   +    +  LN+F DL+ +EF   Y   K+     + S 
Sbjct: 59  EKQNRFHVFKENVKYINEVNKMDKPYKLR-LNQFGDLTPSEFARTYANSKIIEGTRNESG 117

Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
             M  N+ +PR+ DWR   AVT VK+Q  CG  WAFS    +EG+    T +L+SLSEQ+
Sbjct: 118 GFMYENVEVPRSIDWRVKGAVTPVKNQGRCGGCWAFSAAAAVEGINQITTGQLISLSEQQ 177

Query: 178 LIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK-KATQVKING 236
           LIDCD ++ GC GG++  AF+ I  +  GG+  E  YPY+     C+ N  +   V I+G
Sbjct: 178 LIDCDTQNSGCRGGTMGRAFEYIKQR--GGITSEANYPYKAQAGMCKNNLIQRPTVSIDG 235

Query: 237 YVSVSRDETDMAKYLVENGPMAVAINAYALQ-----FYVTGVSHPIQFFCDGGNENLSHS 291
           Y ++ R E  + K L    P++VA++A         FY  GV      F       L+H 
Sbjct: 236 YYNIRRSEDAVLKILAHQ-PVSVAVDATTWSSLDWMFYFQGV------FTGPCGTKLNHG 288

Query: 292 VLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD---GSCGI 339
           V  VGYG      T+    YWIIKNSWGE WGE+GY R+ RG    G CGI
Sbjct: 289 VTAVGYGT-----TNDGYDYWIIKNSWGETWGERGYMRMLRGVSPYGLCGI 334


>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
 gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
 gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
          Length = 498

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 119/314 (37%), Positives = 169/314 (53%), Gaps = 25/314 (7%)

Query: 39  TALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGV---YGLNEFSDLS 95
           T +F  + E+H K Y    E   R+  F  NL+ I + ++ +  SG+    GLN+F+DLS
Sbjct: 47  TEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYI-IEKNGKRKSGLEHKVGLNKFADLS 105

Query: 96  TAEFQAKYLGFKLKP-SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
             EF+  YL    KP +  ++     +     P + DWR    VT VKDQ  CGS W+FS
Sbjct: 106 NEEFREMYLSKVKKPITIEEKRKHRHLQTCDAPSSLDWRNKGVVTAVKDQGDCGSCWSFS 165

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKT 213
           TTG IE + A  T  L+SLSEQEL+DCD  ++ GCEGG + +AF  ++    GG++ E  
Sbjct: 166 TTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGN--GGIDTEAD 223

Query: 214 YPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYAL--QFYV 270
           YPY G D  C   K+  + V I GYV V   ++ +    V+  P++V ++  AL  Q Y 
Sbjct: 224 YPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSDSALLCATVQQ-PISVGMDGSALDFQLYT 282

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
            G+       C G   ++ H++LIVGYG +  +       YWI+KNSWG  WG +GYF +
Sbjct: 283 GGI---YDGDCSGDPNDIDHAILIVGYGSENDE------DYWIVKNSWGTEWGMEGYFYI 333

Query: 331 YRGD----GSCGIN 340
            R      G C IN
Sbjct: 334 RRNTSKPYGVCAIN 347


>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
 gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
          Length = 362

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 128/345 (37%), Positives = 177/345 (51%), Gaps = 31/345 (8%)

Query: 12  LLSLTVSVSSFMVVGDE-KLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSRLHIFSG 68
           LL + +S++  +VV +    H        +L++ +     H+     L E   R ++F  
Sbjct: 6   LLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVSRNLNEKQKRFNVFKS 65

Query: 69  NLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA-----MIPN 123
           N+  +      +    +  LN+F+D++  EF+  Y G K+      R  P      M  N
Sbjct: 66  NVMHVHNTNKMDKPYKLK-LNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGTFMYEN 124

Query: 124 IT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
            T  P + DWR+  AVT VKDQ  CGS WAFST   +EG+   KT +LV LSEQELIDCD
Sbjct: 125 FTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELIDCD 184

Query: 183 -QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKK-ATQVKINGYVSV 240
            QE+ GC GG +  AF+ I  K  GG+  E  YPY  +D +C   K+    V I+G+ +V
Sbjct: 185 NQENQGCNGGLMEYAFEYIKQK--GGVTTESYYPYTANDGSCDATKENVPTVSIDGHETV 242

Query: 241 SRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYG 298
             ++ D     V N P++VAI+A     QFY  GV     F  D G E L+H V IVGYG
Sbjct: 243 PANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGV-----FTGDCGKE-LNHGVAIVGYG 296

Query: 299 VDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
                 T     YWI++NSWG  WGE+G  R+ R     +G CGI
Sbjct: 297 T-----TVDGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGI 336


>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
 gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
 gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
 gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
          Length = 345

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 125/337 (37%), Positives = 171/337 (50%), Gaps = 34/337 (10%)

Query: 15  LTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ 74
             + V+S  +  D  ++  H          ++  + K Y  L E  +RL IF  N+  I+
Sbjct: 22  FAIQVTSRTLQDDSNIYEKHE--------QWMVHYGKVYKDLQERENRLKIFKENVNYIE 73

Query: 75  LLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFK-LKPSYADRSVPAMIPNITLPRAFD 131
              +  +   +Y  G+N+F+DL+  EF A    FK    S   ++      N ++P   D
Sbjct: 74  ASNNAGNNK-LYKLGINQFADLTNEEFIASRNKFKGHMCSSITKTSTFKYENASVPSTVD 132

Query: 132 WREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCE 189
           WR+  AVT VK+Q  CG  WAFS     EG++   T KLVSLSEQEL+DCD +  D GCE
Sbjct: 133 WRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCE 192

Query: 190 GGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMA 248
           GG + +AF  I+     GL  E  YPY+G D  C  NK +   V I GY  V  +     
Sbjct: 193 GGLMDDAFKFIIQNH--GLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQAL 250

Query: 249 KYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTH 306
           +  V N P++VAI+A     QFY +GV     F    G E L H V  VGYGV      +
Sbjct: 251 QKAVANQPISVAIDASGSDFQFYKSGV-----FTGSCGTE-LDHGVTAVGYGVG-----N 299

Query: 307 KAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
               YW++KNSWG  WGE+GY ++ RG    +G CGI
Sbjct: 300 DGTKYWLVKNSWGTDWGEEGYIKMQRGVDAAEGLCGI 336


>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
 gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
          Length = 341

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 127/344 (36%), Positives = 175/344 (50%), Gaps = 40/344 (11%)

Query: 7   FAGVALLSLTVSVS-SFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
           FA  A L+ + ++S S MVV  E+               ++ Q+ + Y   VE   R +I
Sbjct: 18  FATSAYLATSRTLSDSLMVVRHEQ---------------WMAQYGRVYENEVEKTKRFNI 62

Query: 66  FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT 125
           F  N+  I+            G+N F+DL+  EF+A   G+KL P     + P    N++
Sbjct: 63  FKENVEYIESFNKAGTKPYKLGINAFADLTNQEFKASRNGYKL-PHDCSSNTPFRYENVS 121

Query: 126 -LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE 184
            +P   DWR   AVT VKDQ  CG  WAFS    +EG+    T  L+SLSEQEL+DCD +
Sbjct: 122 SVPTTVDWRTKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVK 181

Query: 185 --DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKAC-RLNKKATQVKINGYVSVS 241
             D GCEGG + +AF  I++    GL  E  YPY+G D +C +     +  KI+GY  V 
Sbjct: 182 GIDQGCEGGLMDDAFSFIINNK--GLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVP 239

Query: 242 RDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGV 299
            +     +  V N P++VAI+A     QFY +GV     F  + G E L H V  VGYG+
Sbjct: 240 ANSESALEKAVANQPVSVAIDAGGSDFQFYSSGV-----FTGECGTE-LDHGVTAVGYGI 293

Query: 300 DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
                      YW++KNSWG  WGEKGY R+ +     +G CGI
Sbjct: 294 -----AEDGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGI 332


>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
          Length = 345

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 120/299 (40%), Positives = 158/299 (52%), Gaps = 20/299 (6%)

Query: 37  KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
           K   LF  ++ +H K Y ++ E   R  IF  NL+ I           + GLNEF+DLS 
Sbjct: 42  KLIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWL-GLNEFADLSH 100

Query: 97  AEFQAKYLGFKLKPSYADRSVPAMI-PNITLPRAFDWREYDAVTGVKDQTMCGSSWAFST 155
            EF+ KYLG K+  S    S       ++ LP++ DWR+  AV  VK+Q  CGS WAFST
Sbjct: 101 QEFKNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSCWAFST 160

Query: 156 TGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
              +EG+    T  L SLSEQELIDCD+   +GC GG +  AF  I+    GGL +E+ Y
Sbjct: 161 VAAVEGINQIVTGNLTSLSEQELIDCDRTYSNGCNGGLMDYAFSFIVEN--GGLHKEEDY 218

Query: 215 PYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVT 271
           PY  ++  C + K+ T+ V I+GY  V ++        + N  ++VAI A     QFY  
Sbjct: 219 PYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYSG 278

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           GV      F      +L H V  VGYG      T K V Y I+KNSWG  WGEKGY R+
Sbjct: 279 GV------FDGHCGSDLDHGVAAVGYG------TAKGVDYIIVKNSWGSKWGEKGYIRM 325


>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
 gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
          Length = 341

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 126/347 (36%), Positives = 180/347 (51%), Gaps = 41/347 (11%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           +AL+++T +VS   +V +E             +N F  +H K YA   E   R+ IF+ N
Sbjct: 10  IALVAMTQAVSYSELVREE-------------WNTFKLEHRKNYADSTEETFRMKIFNEN 56

Query: 70  LRKI-QLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT- 125
              I +  Q    G   Y   LN+++D+   EF+    GF        RS       +T 
Sbjct: 57  KHHIAKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESFTGVTF 116

Query: 126 -------LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQEL 178
                  LP A DWR   AVT VKDQ  CGS WAFS+TG IEG +  K+  LVSLSEQ L
Sbjct: 117 ISPEHVKLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNL 176

Query: 179 IDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKING 236
           +DC  +  ++GC GG + NAF  +  K  GG++ EK+Y Y G D +C  +K +      G
Sbjct: 177 VDCSTKYGNNGCNGGLMDNAFRYV--KDNGGIDTEKSYAYEGIDDSCHFDKNSIGATDRG 234

Query: 237 YVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVL 293
           +  + + +E  +A+ +   GP++VAI+A   + QFY  GV       C    ENL H VL
Sbjct: 235 FADIPQGNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPN--CSA--ENLDHGVL 290

Query: 294 IVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGI 339
           +VGYG ++         YW++KNSWG  WG+KG+ ++ R  +  CGI
Sbjct: 291 VVGYGTEK-----DGSDYWLVKNSWGTTWGDKGFIKMSRNKENQCGI 332


>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
          Length = 336

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 120/317 (37%), Positives = 161/317 (50%), Gaps = 28/317 (8%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           LH          ++ ++ K Y    E   R  IF  N+  I+      +     G+N  +
Sbjct: 29  LHETSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLGVNHLA 88

Query: 93  DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSW 151
           DL+  EF+A   GFK    ++  +      N+T +P A DWR   AVT +KDQ  CGS W
Sbjct: 89  DLTVEEFKASRNGFKRPHEFSTTTF--KYENVTAIPAAIDWRTKGAVTPIKDQGQCGSCW 146

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLE 209
           AFST    EG++   T KLVSLSEQEL+DCD +  D GCEGG + + F+ I+    GG+ 
Sbjct: 147 AFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKN--GGIT 204

Query: 210 EEKTYPYRGDDKACRLNKKATQV-KINGYVSVSRDETDMAKYLVENGPMAVAINA--YAL 266
            E  YPY+  D  C  NK  + V +I GY  V  +     +  V N P++V+I+A     
Sbjct: 205 SETNYPYKAVDGKC--NKATSPVAQIKGYEKVPPNSETALQKAVANQPVSVSIDADGAGF 262

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
            FY +G+     +  + G E L H V  VGYG      T     YWI+KNSWG  WGEKG
Sbjct: 263 MFYSSGI-----YNGECGTE-LDHGVTAVGYG------TANGTDYWIVKNSWGTQWGEKG 310

Query: 327 YFRLYRG----DGSCGI 339
           Y R+ RG     G CGI
Sbjct: 311 YVRMQRGIAAKHGLCGI 327


>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
          Length = 342

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 115/317 (36%), Positives = 161/317 (50%), Gaps = 23/317 (7%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           L  V        ++ Q+ K Y    E   R +IF  N+++I+   +  +     G+N+F+
Sbjct: 30  LEDVSLKERHEQWMTQYGKVYTDSYEKELRSNIFKENVQRIEAFNNAGNKPYKLGINQFA 89

Query: 93  DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSW 151
           DL+  EF+A+        S + R+      +++ +P + DWR+  AVT +KDQ  CG  W
Sbjct: 90  DLTNEEFKARNRFKGHMCSNSTRTPTFKYEDVSSVPASLDWRQKGAVTPIKDQGQCGCCW 149

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLE 209
           AFS     EG+    T KL+SLSEQEL+DCD +  D GCEGG + +AF  IM     GL 
Sbjct: 150 AFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQN--KGLN 207

Query: 210 EEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--L 266
            E  YPY+G D  C  N +A     I G+  V  +        V N P++VAI+A     
Sbjct: 208 TEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSEF 267

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
           QFY +G+      F       L H V  VGYGV     +     YW++KNSWGE WGE+G
Sbjct: 268 QFYSSGL------FTGSCGTELDHGVTAVGYGV-----SDDGTKYWLVKNSWGEQWGEEG 316

Query: 327 YFRLYRG----DGSCGI 339
           Y R+ R     +G CGI
Sbjct: 317 YIRMQRDVAAEEGLCGI 333


>gi|33242882|gb|AAQ01145.1| cathepsin [Branchiostoma lanceolatum]
          Length = 334

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 126/331 (38%), Positives = 172/331 (51%), Gaps = 32/331 (9%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYG 87
           + H   +  +  QH K Y T  E YSR  IF  N  KI      EH         S    
Sbjct: 18  LPHNKEWEMWKLQHGKQYETEAEEYSRRFIFEKNTVKI-----AEHNIRASLGMHSYTLA 72

Query: 88  LNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
           +N+F D+   EF  + +G  LK          V     N TLP++ DWR    V+ VKDQ
Sbjct: 73  MNKFGDMHHEEFHQRIMGGCLKIVKKPLLGSEVGDSDDNGTLPKSVDWRNSHMVSEVKDQ 132

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMS 202
             CG  WAFSTTG++EG ++ KT KLV LSEQ+L+DC ++  + GC GG +  AF  I +
Sbjct: 133 GECGPCWAFSTTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIPA 192

Query: 203 KLGGGLEEEKTYPYRG-DDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVA 260
              GGL+ E++YPY   DDK C+ +  +    + GY  V S +E  + + +   GP++VA
Sbjct: 193 N--GGLDTEESYPYTATDDKPCKFDNSSVGATLVGYKDVKSGNEHALKRAVATVGPVSVA 250

Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           I+A   + QFY +GV    Q  C    E L H VL VGYG      +H+A  +WI+KNSW
Sbjct: 251 IDAGHESFQFYSSGVYDEPQ--CS--TEQLDHGVLAVGYGA-MNDNSHQA--FWIVKNSW 303

Query: 319 GEGWGEKGYFRLYRG-DGSCGINDYVRSALV 348
           G  WG++GY  + R  +  CGI       LV
Sbjct: 304 GPSWGDQGYIMMSRNKNNQCGIATSASYPLV 334


>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
          Length = 336

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 122/312 (39%), Positives = 170/312 (54%), Gaps = 22/312 (7%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
           H+K Y    E + R+ ++  NL+KI+L  + EH  G +    G+N F D++  EF+    
Sbjct: 35  HSKKYHEKEEGWRRM-VWEKNLKKIEL-HNLEHSMGTHSYRLGMNHFGDMTHEEFRQLMN 92

Query: 105 GFKLKPSYADRSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
           G+K K     R    + PN +  P++ DWR+   VT VKDQ  CGS WAFSTTG +EG +
Sbjct: 93  GYKRKAETKARGSLFLEPNFLEAPKSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQH 152

Query: 164 AAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-DD 220
             KT KLVSLSEQ L+DC + +  +GC GG +  AF  +  K   GL+ E +YPY G DD
Sbjct: 153 FRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYV--KDNQGLDSEDSYPYLGTDD 210

Query: 221 KACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPI 277
           + C  +     V   G+V + S  E  + K +   GP++VAI+A   + QFY +G    I
Sbjct: 211 QPCHYDPTYNSVNDTGFVDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSG----I 266

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGS 336
            +  +  +E L H VL+VGYG        K   YWI+KNSW E WG+KGY  + +     
Sbjct: 267 YYEKECSSEELDHGVLVVGYGFQGEDVDGKK--YWIVKNSWSEKWGDKGYIYMAKDRKNH 324

Query: 337 CGINDYVRSALV 348
           CGI       LV
Sbjct: 325 CGIATAASYPLV 336


>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
          Length = 337

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 125/316 (39%), Positives = 166/316 (52%), Gaps = 22/316 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKI-QLLQDTEHGSGVY--GLNEFSDLSTAEFQA 101
           F   HNK Y + VE   R+ I+  N RKI +  +  E     Y  G+N++ D+   EF  
Sbjct: 32  FKLHHNKVYKSPVEEGYRMKIYMDNKRKIAEHNRKYELNEVTYKLGMNKYGDMLHHEFVN 91

Query: 102 KYLGFK--LKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
              GF   +        V  + P N+ LP   DW +  AVT VKDQ  CGS WAFS+TG 
Sbjct: 92  TLNGFNKSVTAGIETEGVTFISPANVKLPDEVDWTKQGAVTAVKDQGHCGSCWAFSSTGA 151

Query: 159 IEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPY 216
           +EG +   T  LVSLSEQ LIDC  +  ++GC GG +  AF  I  K   GL+ EKTYPY
Sbjct: 152 LEGQHFRSTGYLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFQYI--KDNKGLDTEKTYPY 209

Query: 217 RGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGV 273
             ++  CR N + +     GYV + + DE  +   +   GP++VAI+A   + Q Y  GV
Sbjct: 210 EAENDRCRYNPRNSGATDKGYVDIPQGDEEKLKAAVATIGPISVAIDASHESFQLYSEGV 269

Query: 274 SHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
                +  D   ENL H VLIVGYG D T        YW++KNSWG+ WG+KGY ++ R 
Sbjct: 270 ----YYDPDCSAENLDHGVLIVGYGTDET----SGHDYWLVKNSWGKTWGQKGYIKMARN 321

Query: 334 -DGSCGINDYVRSALV 348
            +  CGI       LV
Sbjct: 322 KNNHCGIASSASYPLV 337


>gi|166235890|ref|NP_031827.2| pro-cathepsin H preproprotein [Mus musculus]
 gi|341940309|sp|P49935.2|CATH_MOUSE RecName: Full=Pro-cathepsin H; AltName: Full=Cathepsin B3; AltName:
           Full=Cathepsin BA; Contains: RecName: Full=Cathepsin H
           mini chain; Contains: RecName: Full=Cathepsin H;
           Contains: RecName: Full=Cathepsin H heavy chain;
           Contains: RecName: Full=Cathepsin H light chain; Flags:
           Precursor
 gi|74151776|dbj|BAE29677.1| unnamed protein product [Mus musculus]
 gi|74181999|dbj|BAE34071.1| unnamed protein product [Mus musculus]
 gi|74211659|dbj|BAE29188.1| unnamed protein product [Mus musculus]
 gi|74213518|dbj|BAE35569.1| unnamed protein product [Mus musculus]
 gi|148688954|gb|EDL20901.1| cathepsin H, isoform CRA_b [Mus musculus]
          Length = 333

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 123/339 (36%), Positives = 178/339 (52%), Gaps = 33/339 (9%)

Query: 8   AGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFS 67
           AG  LLS T + +   V   EK H          F  +++QH KTY++ VEY  RL +F+
Sbjct: 10  AGAWLLS-TGATAELTVNAIEKFH----------FKSWMKQHQKTYSS-VEYNHRLQMFA 57

Query: 68  GNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRS--VPAMIPNIT 125
            N RKIQ      H   +  LN+FSD+S AE + K+L  + +   A +S  +    P   
Sbjct: 58  NNWRKIQAHNQRNHTFKM-ALNQFSDMSFAEIKHKFLWSEPQNCSATKSNYLRGTGP--- 113

Query: 126 LPRAFDWREY-DAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ- 183
            P + DWR+  + V+ VK+Q  CGS W FSTTG +E   A  + K++SL+EQ+L+DC Q 
Sbjct: 114 YPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMLSLAEQQLVDCAQA 173

Query: 184 -EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVS- 241
             + GC+GG  S AF+ I+     G+ EE +YPY G D +CR N +     +   V+++ 
Sbjct: 174 FNNHGCKGGLPSQAFEYIL--YNKGIMEEDSYPYIGKDSSCRFNPQKAVAFVKNVVNITL 231

Query: 242 RDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
            DE  M + +    P++ A         Y +GV       C    + ++H+VL VGYG  
Sbjct: 232 NDEAAMVEAVALYNPVSFAFEVTEDFLMYKSGVYSSKS--CHKTPDKVNHAVLAVGYG-- 287

Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGI 339
                   + YWI+KNSWG  WGE GYF + RG   CG+
Sbjct: 288 ----EQNGLLYWIVKNSWGSQWGENGYFLIERGKNMCGL 322


>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 109/300 (36%), Positives = 159/300 (53%), Gaps = 24/300 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYAD----RSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY       S   +I +++   +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I  K  GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326


>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
 gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
          Length = 326

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 114/309 (36%), Positives = 179/309 (57%), Gaps = 24/309 (7%)

Query: 37  KHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLST 96
           ++   F  ++ +H K+Y T  E+ SR  +F  N+  I    + +  + + GLN  +DL+ 
Sbjct: 27  QYQTAFQNWMVKHQKSY-TNDEFGSRYSVFQDNM-DIVAKWNQKGSNTILGLNVMADLTN 84

Query: 97  AEFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
            EF+  YLG K   +Y  +++  +     LP + DWR   AVT VK+Q  CG  +AFSTT
Sbjct: 85  EEFKKLYLGTKANVTYKKKTLVGVSG---LPASVDWRANGAVTAVKNQGQCGGCYAFSTT 141

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDC--DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           G++EG++   +++LV LSEQ+++DC   + ++GC+GG ++N+F+ I++   GGL+ E +Y
Sbjct: 142 GSVEGIHEITSQQLVPLSEQQILDCSGSEGNNGCDGGLMTNSFEYIIAV--GGLDTEASY 199

Query: 215 PYRGDDKACRLNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVT 271
           PY G+   C+ NKK     I GY +V S  E+D+ +  V   P++VAI+A   + Q Y +
Sbjct: 200 PYTGEVGKCKFNKKNIGATITGYKNVESGSESDL-QTAVAAQPVSVAIDASQSSFQLYAS 258

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GV +  +  C   +  L H VL VGYG      +     YWI+KNSWG  WGE G+  + 
Sbjct: 259 GVYYEPE--CS--STQLDHGVLAVGYG------SQSGQDYWIVKNSWGADWGENGFILMA 308

Query: 332 RG-DGSCGI 339
           R  D +CGI
Sbjct: 309 RNKDNNCGI 317


>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 107/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I+    GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYKVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|394331830|gb|AFN27134.1| cysteine protease [Leishmania tropica]
          Length = 443

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 112/318 (35%), Positives = 170/318 (53%), Gaps = 22/318 (6%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           ALF  F   + + Y T+ E   RL  F  NL  ++  Q   +    +G+ +F DLS AEF
Sbjct: 36  ALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQ-ARNPHARFGITKFFDLSEAEF 94

Query: 100 QAKYLG----FKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
            A+YL     F     +A +       +++ +P A DWR+  A+T VK+Q  CGS WAFS
Sbjct: 95  AARYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGALTPVKNQGACGSCWAFS 154

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
             G+I+  +A    +L +LSEQ+L+ C  +D+GC G  +  AF  ++  + G +  E +Y
Sbjct: 155 AVGSIQSQWALAGHRLTALSEQQLVSCHDKDNGCPGRLMLQAFVGVLQNMNGTMFTEDSY 214

Query: 215 PYRGDDKACRLNKKATQV----KINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYV 270
           PY            ++Q+    +I+GY+++    T MA  L +NGP+++A++A +   Y 
Sbjct: 215 PYVSSTGYVPECSNSSQLVPGARIDGYMTMESSGTVMAACLAKNGPISIAVDASSFMSYQ 274

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +GV              L+H VL+VGY  +RT      VPYW+IKNSWGE WGE GY R+
Sbjct: 275 SGV------LTSCAGMPLNHGVLLVGY--NRT----GEVPYWVIKNSWGENWGENGYVRV 322

Query: 331 YRGDGSCGINDYVRSALV 348
             G  +C + +Y  SA V
Sbjct: 323 TMGVNACLLTEYPVSAHV 340


>gi|114596533|ref|XP_517502.2| PREDICTED: cathepsin O [Pan troglodytes]
 gi|410212082|gb|JAA03260.1| cathepsin O [Pan troglodytes]
 gi|410330245|gb|JAA34069.1| cathepsin O [Pan troglodytes]
          Length = 318

 Score =  188 bits (477), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 113/284 (39%), Positives = 156/284 (54%), Gaps = 20/284 (7%)

Query: 71  RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADR---SVPAMIPNITLP 127
           R +  L  +E+ +  YG+N+FS L   EF+A YL  + KPS   R    V   IPN++LP
Sbjct: 49  RYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYL--RSKPSKFPRYSAEVHMSIPNVSLP 106

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
             FDWR+   VT V++Q MCG  WAFS  G +E  YA K K L  LS Q++IDC   + G
Sbjct: 107 LRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYG 166

Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQVKINGYVS--VSRDE 244
           C GGS  NA +  ++K+   L ++  YP++  +  C   +   +   I GY +   S  E
Sbjct: 167 CNGGSTLNALN-WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSNQE 225

Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
            +MAK L+  GP+ V ++A + Q Y+ G+   IQ  C  G  N  H+VLI G+  D+T  
Sbjct: 226 DEMAKALLTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEAN--HAVLITGF--DKTGS 278

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           T    PYWI++NSWG  WG  GY  +  G   CGI D V S  V
Sbjct: 279 T----PYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 318


>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
          Length = 340

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 117/305 (38%), Positives = 160/305 (52%), Gaps = 26/305 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ ++ K Y   +E   R  IF  N+  I+     ++      +N  +DL+  EF+A   
Sbjct: 43  WMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLADLTLDEFKASRN 102

Query: 105 GFK-LKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
           G+K +   +A  S      N+T +P A DWR   AVT +KDQ  CGS WAFST   IEG+
Sbjct: 103 GYKKIDREFATTSFK--YENVTAIPEAVDWRVKGAVTPIKDQGQCGSCWAFSTVAAIEGI 160

Query: 163 YAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
               T KL+SLSEQEL+DCD   ED GCEGG + + F+ I+    GG+  E  YPY+  D
Sbjct: 161 NQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKN--GGITSETNYPYKAAD 218

Query: 221 KACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQ 278
            +C     A   KI GY  V  +        V N P++V+I+A   +  FY +G+     
Sbjct: 219 GSCSAATTAPVAKITGYEKVPVNSEISLLKAVANQPISVSIDASDSSFMFYSSGI----- 273

Query: 279 FFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----D 334
           +  + G E L H V  VGYG      +     YWI+KNSWG  WGEKGY R+ RG    +
Sbjct: 274 YTGECGTE-LDHGVTAVGYG------SANGTDYWIVKNSWGTVWGEKGYIRMQRGIADKE 326

Query: 335 GSCGI 339
           G CGI
Sbjct: 327 GLCGI 331


>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 108/301 (35%), Positives = 160/301 (53%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYAD----RSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY       S   +I +++   +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I+    GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYKVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
 gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
 gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 109/300 (36%), Positives = 159/300 (53%), Gaps = 24/300 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYAD----RSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY       S   +I +++   +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I  K  GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326


>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 108/301 (35%), Positives = 160/301 (53%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYAD----RSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY       S   +I +++   +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I+    GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
 gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
 gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 463

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 118/314 (37%), Positives = 163/314 (51%), Gaps = 31/314 (9%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLST 96
           ++  ++ +H  TY  + E   R   F  NLR I    +    +GV+    GLN F+DL+ 
Sbjct: 42  MYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQ-HNAAADAGVHSFRLGLNRFADLTN 100

Query: 97  AEFQAKYLGFKLKPSYADRSVPA---MIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
            E+++ YLG + KP   +R + A      N  LP + DWR+  AV  VKDQ  CGS WAF
Sbjct: 101 EEYRSTYLGARTKPDR-ERKLSARYQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWAF 159

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEK 212
           S    +EG+    T  ++ LSEQEL+DCD   + GC GG +  AF+ I++   GG++ E+
Sbjct: 160 SAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINN--GGIDSEE 217

Query: 213 TYPYRGDDKACRLNKK-ATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFY 269
            YPY+  D  C  NKK A  V I+GY  V  +     +  V N P++VAI A   A Q Y
Sbjct: 218 DYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQLY 277

Query: 270 VTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFR 329
            +G+      F       L H V  VGYG +  K       YW+++NSWG  WGE GY R
Sbjct: 278 KSGI------FTGTCGTALDHGVAAVGYGTENGK------DYWLVRNSWGSVWGEDGYIR 325

Query: 330 LYRG----DGSCGI 339
           + R      G CGI
Sbjct: 326 MERNIKASSGKCGI 339


>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
 gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
          Length = 425

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 115/296 (38%), Positives = 157/296 (53%), Gaps = 33/296 (11%)

Query: 62  RLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAM- 120
           R   F  N R I+        S   GLN+FSDL++ EF+ ++LG  L+P   D  V  M 
Sbjct: 34  RFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRFLG--LRPDLIDSPVLKMP 91

Query: 121 --------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVS 172
                     N+ LP + DWR++ AVT  KDQ  CG  WAF+TTG IEG+    T +LVS
Sbjct: 92  RDSDIEEGFQNVDLPASVDWRQHGAVTAPKDQGSCGGCWAFATTGAIEGINQIVTGQLVS 151

Query: 173 LSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ 231
           LSEQELIDCD++ D GC+GG + NA+  I+    GGL+ E  YPY   +  C + K  ++
Sbjct: 152 LSEQELIDCDKKADKGCDGGLMENAYQFIVEN--GGLDTETDYPYHASESHCNMKKLNSR 209

Query: 232 -VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENL 288
            V I+GY ++   +       V   P++VAI   +   Q Y +GV      F     E +
Sbjct: 210 VVAIDGYKAIPEGDEQALLLAVAKQPVSVAIEGASKDFQHYASGV------FTGHCGEEI 263

Query: 289 SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS----CGIN 340
           +H VLIVGYG      T   + YWI+KNSW   WG+ G+ ++ R  G     C IN
Sbjct: 264 NHGVLIVGYG------TEDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCSIN 313


>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
          Length = 378

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 119/303 (39%), Positives = 153/303 (50%), Gaps = 37/303 (12%)

Query: 58  EYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
           E   R ++F  N R I              LN+F+D++T EF+  Y G + +     RS+
Sbjct: 66  EARRRFNVFVENARYIHEANRRGGRPFRLALNKFADMTTDEFRRTYAGSRARHH---RSL 122

Query: 118 PAMIPNI------------TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAA 165
                               LP A DWRE  AVTG+KDQ  CGS WAFS    +EGV   
Sbjct: 123 RGGRGGEGGSFRYGGDDEDNLPPAVDWRERGAVTGIKDQGQCGSCWAFSAVAAVEGVNKI 182

Query: 166 KTKKLVSLSEQELIDCDQEDD-GCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
           KT +LV+LSEQEL+DCD  D+ GC+GG +  AF  I  K  GG+  E  YPYR +   C 
Sbjct: 183 KTGRLVTLSEQELVDCDTGDNQGCDGGLMDYAFQFI--KRNGGITTESNYPYRAEQGRCN 240

Query: 225 LNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPIQFFC 281
             K ++  V I+GY  V  ++    +  V N P+AVA+ A     QFY  GV      F 
Sbjct: 241 KAKASSHDVTIDGYEDVPANDESALQKAVANQPVAVAVEASGQDFQFYSEGV------FT 294

Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-----DGS 336
                +L H V  VGYG+     T     YWI+KNSWGE WGE+GY R+ RG     +G 
Sbjct: 295 GECGTDLDHGVAAVGYGI-----TRDGTKYWIVKNSWGEDWGERGYIRMQRGVSSDSNGL 349

Query: 337 CGI 339
           CGI
Sbjct: 350 CGI 352


>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
 gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
          Length = 330

 Score =  187 bits (476), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 113/308 (36%), Positives = 171/308 (55%), Gaps = 26/308 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEHGSGVY--GLNEFSDLSTAEFQA 101
           F  +H+K Y+   EY  RL IF  NL+ I+   Q+ + G   Y  G+N+F+D++ AE+  
Sbjct: 27  FKIKHDKVYSEKEEYARRL-IFQDNLKTIESHNQEADTGKHSYWLGVNQFADMTHAEYLN 85

Query: 102 KYLGFKLKPS----YADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           + +G  L  S       R+    +PN+ +    DWR+   VT +KDQ  CGS WAFSTTG
Sbjct: 86  QVIGGCLITSNLTKTGSRATYRYMPNMQVNDTVDWRDKGLVTDIKDQGQCGSCWAFSTTG 145

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
           ++EG +A  T  LVSLSEQ L+DC +++   GCEGG +   F  I+     G++ E+ YP
Sbjct: 146 SLEGQHAKATGTLVSLSEQNLVDCSRQEGNKGCEGGDMDQGFQYIIQN--KGIDTEQCYP 203

Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVEN-GPMAVAINA--YALQFYVTG 272
           Y+  +  C+ +       ++ +  V+  + D  K    N GP++V I+A   + QFY +G
Sbjct: 204 YKAKNHRCKFDNSCIGATMSSFTDVTSGDEDALKQACANIGPISVGIDASHQSFQFYSSG 263

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           V +  +F C   +  L H VL+VGYG      T+ +  YW++KNSWG  WG +GY  + R
Sbjct: 264 VYN--EFEC--SSTKLDHGVLVVGYG------TYGSKDYWLVKNSWGTVWGNEGYIMMSR 313

Query: 333 G-DGSCGI 339
             D  CG+
Sbjct: 314 NKDNQCGV 321


>gi|356560855|ref|XP_003548702.1| PREDICTED: P34 probable thiol protease-like [Glycine max]
          Length = 357

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 122/357 (34%), Positives = 178/357 (49%), Gaps = 24/357 (6%)

Query: 4   FYFFAGVALLSLTVSVSSFMV---VGDEKLHHLHHVKHT-ALFNYFLEQHNKTYATLVEY 59
            +FF  + L+  + S S+F V   +    L  L     T  LF  + ++H   Y  L E 
Sbjct: 11  IFFFICITLICFSSS-SNFPVQYSILGPNLDKLPSQDETIQLFQLWRKEHGLVYKDLKEM 69

Query: 60  YSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKYLGFKLKPSYADRSV 117
             R  IF  NL  I            Y  GLN F+D S +EFQ  YL     P+ +   +
Sbjct: 70  AKRFEIFLSNLNYIIEFNAKRSSPSGYLLGLNNFADWSPSEFQEIYLHSLDMPTDSAPKL 129

Query: 118 PAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQE 177
              + +   P + DWR   AVT +K+Q  CGS WAFS  G IEG++A  T +L+SLSEQE
Sbjct: 130 NGPLLSCIAPASLDWRNKVAVTAIKNQGSCGSCWAFSAAGAIEGIHAITTGELISLSEQE 189

Query: 178 LIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKA-CRLNKKAT-QVKIN 235
           L++CD+   GC GG ++ AFD ++S   GG+  E  YPY G D   C  +K+   +  I+
Sbjct: 190 LVNCDRVSKGCNGGWVNKAFDWVISN--GGITLEAEYPYTGKDGGNCNSDKQVPIKATID 247

Query: 236 GYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIV 295
           GY  V + +  +   +V+  P+++ +NA   Q Y +G+    Q  C   ++  +H VLIV
Sbjct: 248 GYEQVEQSDNGLLCSIVKQ-PISICLNATDFQLYESGIFDGQQ--CSSSSKYTNHCVLIV 304

Query: 296 GYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD----GSCGINDYVRSALV 348
           GY       +     YWI+KNSWG  WG  GY  + R      G CG+N +  +  +
Sbjct: 305 GYD------SSNGEDYWIVKNSWGTKWGINGYIWIKRNTGLPYGVCGMNAWAYNPTI 355


>gi|397504019|ref|XP_003822607.1| PREDICTED: cathepsin O [Pan paniscus]
          Length = 321

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 113/284 (39%), Positives = 156/284 (54%), Gaps = 20/284 (7%)

Query: 71  RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADR---SVPAMIPNITLP 127
           R +  L  +E+ +  YG+N+FS L   EF+A YL  + KPS   R    V   IPN++LP
Sbjct: 52  RYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYL--RSKPSKFPRYSAEVHMSIPNVSLP 109

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
             FDWR+   VT V++Q MCG  WAFS  G +E  YA K K L  LS Q++IDC   + G
Sbjct: 110 LRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYG 169

Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQVKINGYVS--VSRDE 244
           C GGS  NA +  ++K+   L ++  YP++  +  C   +   +   I GY +   S  E
Sbjct: 170 CNGGSTLNALN-WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSNQE 228

Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
            +MAK L+  GP+ V ++A + Q Y+ G+   IQ  C  G  N  H+VLI G+  D+T  
Sbjct: 229 DEMAKALLTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEAN--HAVLITGF--DKTGS 281

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           T    PYWI++NSWG  WG  GY  +  G   CGI D V S  V
Sbjct: 282 T----PYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 321


>gi|395735444|ref|XP_002815290.2| PREDICTED: cathepsin O [Pongo abelii]
          Length = 318

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 113/284 (39%), Positives = 156/284 (54%), Gaps = 20/284 (7%)

Query: 71  RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADR---SVPAMIPNITLP 127
           R +  L  +E+ +  YG+N+FS L   EF+A YL  + KPS   R    V   IPN++LP
Sbjct: 49  RYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYL--RSKPSKFPRYSAEVRMSIPNVSLP 106

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
             FDWR+   VT V++Q MCG  WAFS  G +E  YA K K L  LS Q++IDC   + G
Sbjct: 107 LRFDWRDKHVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYG 166

Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQVKINGYVS--VSRDE 244
           C GGS  NA +  ++K+   L ++  YP++  +  C   +   +   I GY +   S  E
Sbjct: 167 CNGGSTLNALN-WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSNQE 225

Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
            +MAK L+  GP+ V ++A + Q Y+ G+   IQ  C  G  N  H+VLI G+  D+T  
Sbjct: 226 DEMAKALLTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEAN--HAVLITGF--DKTGS 278

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           T    PYWI++NSWG  WG  GY  +  G   CGI D V S  V
Sbjct: 279 T----PYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 318


>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
 gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
 gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 108/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I  K  GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
          Length = 319

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 122/313 (38%), Positives = 173/313 (55%), Gaps = 23/313 (7%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
           HNK Y    E + R+ ++  NL+ I+L  + +H  G +    G+N+F D++T EF+    
Sbjct: 17  HNKDYHEREESWRRV-VWEKNLKMIEL-HNLDHTLGKHSYKLGMNQFGDMTTEEFRQLMN 74

Query: 105 GFKLKPSYAD-RSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
           G+  K S    R    + P+ +  PR+ DWRE   VT VKDQ  CGS WAFSTTG +EG 
Sbjct: 75  GYAHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQ 134

Query: 163 YAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-D 219
           +  KT KLVSLSEQ L+DC + +   GC GG +  AF  +     GG++ E++YPY   D
Sbjct: 135 HFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDN--GGIDSEESYPYTAKD 192

Query: 220 DKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHP 276
           D+ CR   +       G+V + +  E  + K +   GP++VAI+A   + QFY +G    
Sbjct: 193 DEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSG---- 248

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DG 335
           I +  D  +E+L H VL+VGYG +      K   YWI+KNSWGE WG+KGY  + +    
Sbjct: 249 IYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKK--YWIVKNSWGEKWGDKGYIYMAKDRKN 306

Query: 336 SCGINDYVRSALV 348
            CGI       LV
Sbjct: 307 HCGIATAASYPLV 319


>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
 gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
           tropicalis]
 gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
 gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
 gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
          Length = 335

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 116/310 (37%), Positives = 171/310 (55%), Gaps = 23/310 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTA 97
           +N +  QH K+Y   VE   R+ I+  NLRKI+   + E+  G +    G+N+F D++  
Sbjct: 28  WNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQ-HNFEYSYGNHTFKMGMNQFGDMTNE 85

Query: 98  EFQAKYLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           EF+    G+K  P+   +    M P+    P+  DWR+   VT VKDQ  CGS W+FS+T
Sbjct: 86  EFRQAMNGYKHDPNRTSQGPLFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSST 145

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           G +EG    KT KL+S+SEQ L+DC   Q + GC GG +  AF  +  K   GL+ E++Y
Sbjct: 146 GALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYV--KENKGLDSEQSY 203

Query: 215 PYRG-DDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYV 270
           PY   DD  CR + +    KI G+V + R +E  +   +   GP++VAI+A   +LQFY 
Sbjct: 204 PYLARDDLPCRYDPRFNVAKITGFVDIPRGNELALMNAVAAVGPVSVAIDASHQSLQFYQ 263

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +G+     ++       L H+VL+VGYG            YWI+KNSW + WG+KGY  +
Sbjct: 264 SGI-----YYERACTSRLDHAVLVVGYGYQGADVAGNR--YWIVKNSWSDKWGDKGYIYM 316

Query: 331 YRG-DGSCGI 339
            +  +  CGI
Sbjct: 317 AKDKNNHCGI 326


>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
          Length = 448

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 122/321 (38%), Positives = 168/321 (52%), Gaps = 45/321 (14%)

Query: 36  VKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEF 91
           V    LF+ F  + NK Y +  E   R  +FS N+  I    + E   GV+     +N+F
Sbjct: 24  VNKGRLFDAFKTKFNKVYESAEEEARRFSVFSQNIDFINR-HNAEAARGVHTHTVDVNQF 82

Query: 92  SDLSTAEFQAKYL--------GFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKD 143
           +DL+  E++  YL        G + +  + D       PN     + DWR+  AVT +K+
Sbjct: 83  ADLTNEEYRQLYLRPYPTELLGRERQEVWLDG------PNAG---SVDWRQKGAVTPIKN 133

Query: 144 QTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIM 201
           Q  CGS W+FSTTG++EG +A  T  LVSLSEQ+L+DC     + GC GG + NAF  I+
Sbjct: 134 QGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYII 193

Query: 202 SKLGGGLEEEKTYPYRGDDKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVA 260
           S   GGL+ E+ YPY   D  C  +K++   V I+GY  V ++  D     VE GP++VA
Sbjct: 194 SN--GGLDTEQDYPYTARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVA 251

Query: 261 INA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSW 318
           I A   + Q Y +GV      F      NL H VL+VGY  D          YWI+KNSW
Sbjct: 252 IEADQQSFQMYSSGV------FSGPCGTNLDHGVLVVGYTSD----------YWIVKNSW 295

Query: 319 GEGWGEKGYFRLYRGDGSCGI 339
           G  WG++GY  + RG  S GI
Sbjct: 296 GASWGDQGYIMMKRGVSSAGI 316


>gi|324512246|gb|ADY45078.1| Cathepsin L [Ascaris suum]
          Length = 388

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 123/308 (39%), Positives = 167/308 (54%), Gaps = 27/308 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLNEFSDLSTAEFQA 101
           F EQH K Y         +  F  NL +I+      + G   +  G N  +DL   E++ 
Sbjct: 86  FKEQHGKNYEDEETENDHMLAFLSNLEEIRKHNARYQRGESSFEMGTNHITDLPFEEYR- 144

Query: 102 KYLGFKLKPSYAD---RSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           K  G+K  P Y D        ++P NI +P  +DWR++  VT VK+Q MCGS WAFS TG
Sbjct: 145 KLNGYK--PRYDDSHRNGTKFLVPFNINVPGHWDWRDHGYVTEVKNQGMCGSCWAFSATG 202

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
            +EG +  K   LVSLSEQ L+DC ++  ++GC GG +  AF+ I  K   G++ E +YP
Sbjct: 203 ALEGQHKRKIGSLVSLSEQNLVDCSRKYGNNGCNGGLMDYAFEYI--KDNHGVDTEASYP 260

Query: 216 YRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTG 272
           Y+G +  C  NKK    +  GYV +   DE  +   +   GP++VAI+A   + Q Y  G
Sbjct: 261 YKGKEMKCHFNKKTVGAEDEGYVDLPEGDEEKLKIAVATQGPISVAIDAGHPSFQMYRKG 320

Query: 273 VSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYR 332
           V +  Q  C   +E+L H VL+VGYG D          YWI+KNSWG GWGEKGY R+ R
Sbjct: 321 VYYEPQ--C--SSESLDHGVLVVGYGTDEIDGD-----YWIVKNSWGPGWGEKGYVRIAR 371

Query: 333 G-DGSCGI 339
             D  CGI
Sbjct: 372 NRDNHCGI 379


>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
 gi|194689328|gb|ACF78748.1| unknown [Zea mays]
 gi|219886279|gb|ACL53514.1| unknown [Zea mays]
 gi|238010470|gb|ACR36270.1| unknown [Zea mays]
 gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
          Length = 354

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 109/304 (35%), Positives = 151/304 (49%), Gaps = 21/304 (6%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAK 102
           ++ +H +TY    E   R  +F  N   +            Y   LNEF+D++  EF A 
Sbjct: 54  WMAEHGRTYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAM 113

Query: 103 YLGFKLKPSYADRSVPAMIPNITLPRA------FDWREYDAVTGVKDQTMCGSSWAFSTT 156
           Y G +  P+ A +       N+TL  A       DWR+  AVTG+K+Q  CG  WAF+  
Sbjct: 114 YTGLRPVPAGAKKMAGFKYGNVTLSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAV 173

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYP 215
             +EG++   T  LVSLSEQ+++DCD + ++GC GG I NAF  I+    GGL  E  YP
Sbjct: 174 AAVEGIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIVGN--GGLGTEDAYP 231

Query: 216 YRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
           Y      C+  +      I+GY  V   +       V N P++VAI+A+  Q Y  GV  
Sbjct: 232 YTAAQAMCQSVQPV--AAISGYQDVPSGDEAALAAAVANQPVSVAIDAHNFQLYGGGVMT 289

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                      NL+H+V  VGYG           PYW++KN WG+ WGE GY RL RG  
Sbjct: 290 AASCSTP---PNLNHAVTAVGYGT-----AEDGTPYWLLKNQWGQNWGEGGYLRLERGAN 341

Query: 336 SCGI 339
           +CG+
Sbjct: 342 ACGV 345


>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
 gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
 gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
 gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
          Length = 342

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 122/310 (39%), Positives = 165/310 (53%), Gaps = 31/310 (10%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ Q  K+Y    E   R  IF  N+  I+L     +      +N F+DL+  EF+A   
Sbjct: 40  WMTQFGKSYKDAAEKEKRFQIFKNNVEFIELFNAVGNKPFNLSINHFADLTNEEFKASLN 99

Query: 105 GFKL---KPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
           G K    K    + +      N+T +P + DWR+  AVT +K+Q  CGS WAFST  +IE
Sbjct: 100 GNKKLHDKFDILNETTSFRYHNVTSVPASMDWRKRGAVTPIKNQGSCGSCWAFSTVASIE 159

Query: 161 GVYAAKTKKLVSLSEQELIDCDQ-EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           G++   T +LVSLSEQELIDC +    GC GG + +AF  I  K  GG+  E  YPY+  
Sbjct: 160 GIHQITTGELVSLSEQELIDCVRGNSSGCSGGYLEDAFKFIAKK--GGMASETNYPYKET 217

Query: 220 DKACRLNKKATQV-KINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSH 275
           D+ C+  K++  V +I GY  V S  E D+ K  V N P++V ++A  Y  QFY  G+  
Sbjct: 218 DEKCKFKKESKHVAEIKGYEKVPSNSENDLLK-AVANQPVSVYVDAGDYVFQFYSGGI-- 274

Query: 276 PIQFFCDGGNENLSHSVLIVGYGV--DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG 333
               F      +  H V IVGYGV  D T+       YW++KNSWG GWGEKGY +L R 
Sbjct: 275 ----FTGKCGTDTDHVVTIVGYGVSLDYTE-------YWLVKNSWGTGWGEKGYMKLKRN 323

Query: 334 ----DGSCGI 339
                G CGI
Sbjct: 324 VDSKKGLCGI 333


>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
          Length = 359

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 123/343 (35%), Positives = 170/343 (49%), Gaps = 29/343 (8%)

Query: 12  LLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYF--LEQHNKTYATLVEYYSRLHIFSGN 69
           LL+L V+++   V      +        +L+  +     H+     L E   R ++F  N
Sbjct: 7   LLALVVALAFVGVARTIPFNEKDLASEESLWGLYERWRSHHTVSRDLSEKNKRFNVFKEN 66

Query: 70  LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA-----MIPNI 124
            + I      +    + GLN+F+D++  EF++ Y G K+      R  P      M  N+
Sbjct: 67  AKFIHEFNKKDAPYKL-GLNKFADMTNQEFRSTYAGSKIHHHRTQRGTPRATGSFMYENV 125

Query: 125 -TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD- 182
            ++P + DWR   AV  VKDQ  CGS WAFST  ++EG+   KT +LV LS Q+L+DCD 
Sbjct: 126 HSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKIKTNQLVPLSGQQLVDCDT 185

Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR 242
            +++GC GG +  AF+ I S   GG+  E  YPY  +  +C     A  V I+GY  V  
Sbjct: 186 DQNEGCNGGLMDYAFEFIKSN--GGITSESAYPYTAEQGSCASESSAPVVTIDGYEDVPA 243

Query: 243 DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVD 300
           +        V N  ++VAI A   A QFY  GV     F    GNE L H V +VGYG  
Sbjct: 244 NNEAALMKAVANQVVSVAIEASGMAFQFYSEGV-----FTGSCGNE-LDHGVAVVGYGAT 297

Query: 301 RTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           R         YWI++NSWG  WGEKGY R+ RG     G CGI
Sbjct: 298 R-----DGTKYWIVRNSWGAEWGEKGYIRMQRGIRARHGLCGI 335


>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 107/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I+    GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
          Length = 353

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 121/313 (38%), Positives = 173/313 (55%), Gaps = 23/313 (7%)

Query: 49  HNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYL 104
           H+K Y    E + R+ ++  NL+ I+L  + +H  G +    G+N+F D++  EF+    
Sbjct: 51  HSKDYHEREESWRRV-VWEKNLKMIEL-HNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMN 108

Query: 105 GFKLKPSYAD-RSVPAMIPN-ITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGV 162
           G+K K S    R    + P+ +  PR+ DWRE   VT VKDQ  CGS WAFSTTG +EG 
Sbjct: 109 GYKHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQ 168

Query: 163 YAAKTKKLVSLSEQELIDCDQED--DGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG-D 219
           +  KT KLVSLSEQ L+DC + +   GC GG +  AF  +     GG++ E++YPY   D
Sbjct: 169 HFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDN--GGIDSEESYPYTAKD 226

Query: 220 DKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVTGVSHP 276
           D+ CR   +       G+V + +  E  + K +   GP++VAI+A   + QFY +G    
Sbjct: 227 DEDCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSG---- 282

Query: 277 IQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DG 335
           I +  D  +E+L H VL+VGYG +      K   YWI+KNSWGE WG+KGY  + +    
Sbjct: 283 IYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKK--YWIVKNSWGEKWGDKGYIYMAKDRKN 340

Query: 336 SCGINDYVRSALV 348
            CGI       LV
Sbjct: 341 HCGIATAASYPLV 353


>gi|45822201|emb|CAE47497.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
          Length = 315

 Score =  187 bits (476), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 118/303 (38%), Positives = 162/303 (53%), Gaps = 29/303 (9%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT-EHGSGVY--GLNEFSDLSTAEFQA 101
           F   HNK+Y  ++E   R  +F  NL+KI+      E G   Y   +N+F+D S+AEFQA
Sbjct: 27  FKATHNKSY-NVIEDKLRFAVFQDNLKKIEEHNAKYESGEETYYLAVNKFADWSSAEFQA 85

Query: 102 ---KYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGN 158
              + +  K K S+  + V    PN+      DWR+  AV GVKDQ  CGS WAFSTTG+
Sbjct: 86  MLARQMANKPKQSFIAKHVAD--PNVQAVEEVDWRD-SAVLGVKDQGQCGSCWAFSTTGS 142

Query: 159 IEGVYAAKTKKLVSLSEQELIDCD-QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           +EG  A    + V LSEQEL+DCD   + GC GG +++AF+ +      GL  E  Y Y 
Sbjct: 143 LEGQLAIHKNQRVPLSEQELVDCDTSRNAGCNGGLMTDAFNYVKRH---GLSSESQYAYT 199

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPI 277
           G D  C+  +      I+GYV +   E  +A  +   GP+++A++A   Q Y  G+    
Sbjct: 200 GRDDRCKNVENKPLSSISGYVELETTEDALASAVASVGPVSIAVDADTWQLYGGGL---- 255

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSC 337
            F       NL+H VL VGY  D           +I+KNSWG  WGE+GY R+ RG+  C
Sbjct: 256 -FNNKNCRTNLNHGVLAVGYTKDA----------FIVKNSWGTSWGEQGYIRVARGENLC 304

Query: 338 GIN 340
           GIN
Sbjct: 305 GIN 307


>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
 gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
          Length = 335

 Score =  187 bits (476), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 115/310 (37%), Positives = 171/310 (55%), Gaps = 23/310 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY----GLNEFSDLSTA 97
           +N +  QH K+Y   VE   R+ I+  NLRKI+   + E+  G +    G+N+F D++  
Sbjct: 28  WNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIEQ-HNFEYSYGNHTFKMGMNQFGDMTNE 85

Query: 98  EFQAKYLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           EF+    G+K  P+   +    M P+    P+  DWR+   VT VKDQ  CGS W+FS+T
Sbjct: 86  EFRQAMNGYKQDPNRTSKGALFMEPSFFAAPQQVDWRQRGYVTPVKDQKQCGSCWSFSST 145

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           G +EG    KT KL+S+SEQ L+DC   Q + GC GG +  AF  +  K   GL+ E++Y
Sbjct: 146 GALEGQLFRKTGKLISMSEQNLVDCSRPQGNQGCNGGIMDQAFQYV--KENKGLDSEQSY 203

Query: 215 PYRG-DDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYV 270
           PY   DD  CR + +    KI G+V + + +E  +   +   GP++VAI+A   +LQFY 
Sbjct: 204 PYLARDDLPCRYDPRFNVAKITGFVDIPKGNELALMNAVAAVGPVSVAIDASHQSLQFYQ 263

Query: 271 TGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRL 330
           +G+     ++       L H+VL+VGYG            YWI+KNSW + WG+KGY  +
Sbjct: 264 SGI-----YYERACTSRLDHAVLVVGYGYQGADVAGNR--YWIVKNSWSDKWGDKGYIYM 316

Query: 331 YRG-DGSCGI 339
            +  +  CGI
Sbjct: 317 AKDKNNHCGI 326


>gi|328876826|gb|EGG25189.1| hypothetical protein DFA_03437 [Dictyostelium fasciculatum]
          Length = 341

 Score =  187 bits (476), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 111/308 (36%), Positives = 155/308 (50%), Gaps = 15/308 (4%)

Query: 38  HTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTA 97
           +T  F  ++ +HNK Y    E+Y RL  F  N+  I+ +      +  +GLN+FSDLS  
Sbjct: 28  YTTRFKTWMVEHNKMYHEEEEFYLRLSNFIRNIHSIEKMNRQYGRTATFGLNKFSDLSLD 87

Query: 98  EFQAKYLGFKLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           EF+  YL    KP           P+  +P   DWR    VT VK+Q MCGS WAFS T 
Sbjct: 88  EFKKHYLMPNYKPKARVTKETFNYPS-NIPATLDWRTKGYVTPVKNQLMCGSCWAFSATE 146

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
            IE        ++  LSEQ+++DCD  D GC GG    A+  + +   GGL    TYPY 
Sbjct: 147 QIETANIMAGGQVEYLSEQQIVDCDPYDGGCGGGDPYTAYQYVQNN--GGLTLNVTYPYT 204

Query: 218 GDDKACRLNKKATQVKIN--GYVSVSRDETDMAKYLVENGPMAVAINAYALQFYVTGVSH 275
             + AC  N  A  V++   GY S   +ET + + +   GP+++ +NA     Y +G+  
Sbjct: 205 AANGACYANSTAPAVQVTAFGYASSQGNETQLREAMAARGPLSICVNAEPWMSYQSGI-- 262

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
               F    +++L H V IVGY  D T  T    PY+I++NSWG  WG  GY  +  G  
Sbjct: 263 ----FSSTCSDDLDHCVQIVGYDTDATSKT----PYFIVRNSWGTDWGLLGYIYIQAGSN 314

Query: 336 SCGINDYV 343
            CGI + V
Sbjct: 315 LCGITNEV 322


>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
          Length = 344

 Score =  187 bits (476), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 133/365 (36%), Positives = 182/365 (49%), Gaps = 46/365 (12%)

Query: 6   FFAGVALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHI 65
           F   V+ L+   +VS F +V +E             +N F  QH K Y +  E   R+ I
Sbjct: 4   FLLLVSFLAAANAVSIFNLVKEE-------------WNAFKLQHRKKYDSESEERIRMKI 50

Query: 66  FSGNLRKI-QLLQDTEHGSGVYGL--NEFSDLSTAEFQAKYLGFKLKPSYADR------- 115
           +  N  KI +  Q  + G   + L  N+++DL   EF     GF    +   +       
Sbjct: 51  YVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSAAAGSKLLGREQL 110

Query: 116 -----SVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKK 169
                 +  + P N+ +P   DWRE  AVT VKDQ  CGS W+FS TG +EG +  KT K
Sbjct: 111 MTIEEPITWIEPANVDVPTTIDWREKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGK 170

Query: 170 LVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNK 227
           LVSLSEQ L+DC  +  ++GC GG + NAF  +  K   G++ EK YPY   D  C  N 
Sbjct: 171 LVSLSEQNLVDCSTKYGNNGCNGGLMDNAFQYV--KDNKGIDTEKAYPYEAIDDECHYNP 228

Query: 228 KATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINAY--ALQFYVTGVSHPIQFFCDGG 284
           KA      G+V + + DE  + K L   GP++VAI+A   + QFY  GV +  Q  CD  
Sbjct: 229 KAIGATDKGFVDIPQGDEKALKKALATVGPVSVAIDASHESFQFYSEGVYYEPQ--CD-- 284

Query: 285 NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-DGSCGINDYV 343
           +E L H VL VGYG      T     YW++KNSWG  WG++GY ++ R  +  CGI    
Sbjct: 285 SEQLDHGVLAVGYGT-----TEDGEDYWLVKNSWGTTWGDQGYVKMARNRENHCGIATTA 339

Query: 344 RSALV 348
              LV
Sbjct: 340 SYPLV 344


>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
 gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
          Length = 371

 Score =  187 bits (476), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 129/332 (38%), Positives = 174/332 (52%), Gaps = 29/332 (8%)

Query: 22  FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
           F +VG      +HH +   LF  ++ ++ K YA+  E   R  +F  NL  I    + + 
Sbjct: 46  FSIVGYSPEDLVHHDRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEA-NKKV 104

Query: 82  GSGVYGLNEFSDLSTAEFQAKYLGFK---LKPSYADRSVPAMIPNITLPRAFDWREYDAV 138
            +   GLN F+DL+  EF+A YLG +    K +   R     + +  +P + DWR+  AV
Sbjct: 105 TTYWLGLNAFADLTHDEFKATYLGLRQPETKKTTDSRFRYGGVADDDVPASVDWRKKGAV 164

Query: 139 TGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAF 197
           T VK+Q  CGS WAFST   +EG+    T  L SLSEQEL+DC  + ++GC GG + NAF
Sbjct: 165 TDVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAF 224

Query: 198 DTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQ--VKINGYVSV-SRDETDMAKYLVEN 254
             I S   GGL  E+ YPY  ++  C    +  +  V I+GY  V + DE  + K L   
Sbjct: 225 SYIASS--GGLRTEEAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQ 282

Query: 255 GPMAVAINAYA--LQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYW 312
            P++VAI A     QFY  GV     F    G+E L H V  VGYG      + K   Y 
Sbjct: 283 -PLSVAIEASGRHFQFYSGGV-----FNGPCGSE-LDHGVAAVGYG------SSKGQDYI 329

Query: 313 IIKNSWGEGWGEKGYFRLYRG----DGSCGIN 340
           I+KNSWG  WGEKGY R+ RG    +G CGIN
Sbjct: 330 IVKNSWGSHWGEKGYIRMKRGTGKPEGLCGIN 361


>gi|213512938|ref|NP_001133871.1| Cathepsin K precursor [Salmo salar]
 gi|209155648|gb|ACI34056.1| Cathepsin K precursor [Salmo salar]
 gi|223647252|gb|ACN10384.1| Cathepsin K precursor [Salmo salar]
 gi|223673129|gb|ACN12746.1| Cathepsin K precursor [Salmo salar]
          Length = 331

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 116/329 (35%), Positives = 178/329 (54%), Gaps = 25/329 (7%)

Query: 22  FMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEH 81
            + +G    H L+ +   A ++ +   H + Y  L E   R  I+  N+R I+   + E 
Sbjct: 8   LLFLGSVLAHPLNEMSLDAQWDSWKTTHLREYNGLGEEVIRRTIWEKNMRLIEA-HNEEA 66

Query: 82  GSGVY----GLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPAMIPN---ITLPRAFDWRE 134
             G++    G+N   D+++ E   K  G ++ P   DRS    IP+   + +PR+ D+R+
Sbjct: 67  ALGIHSYELGMNHLGDMTSEEIAEKLTGLQV-PMNRDRS-NTWIPDNNVVKIPRSIDYRK 124

Query: 135 YDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSIS 194
              VT VK+Q  CGS WAFS+ G +EG  A  T KL+ LS Q L+DC  E++GC GG ++
Sbjct: 125 KGMVTPVKNQLSCGSCWAFSSAGALEGQLAKTTGKLIDLSPQNLVDCVTENNGCGGGYMT 184

Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVE 253
           NAF+ +     GG++ E+ YPY G D  C  N      +  G+  +   DE  + K +V+
Sbjct: 185 NAFEYVEEN--GGIDTEEAYPYLGQDGQCAYNASGMGAQCRGFKEIPEGDEWALTKAVVK 242

Query: 254 NGPMAVAINAY--ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPY 311
            GP+AV I+A     QFY  GV +     C+   ++++H+VL VGYG      T K + +
Sbjct: 243 VGPVAVGIDATLSTFQFYQRGVYYDPN--CN--KDDINHAVLAVGYGQ-----TAKGMKF 293

Query: 312 WIIKNSWGEGWGEKGYFRLYRGDG-SCGI 339
           WI+KNSW E WG++GY  + R  G +CGI
Sbjct: 294 WIVKNSWSESWGKQGYIMMARNRGNACGI 322


>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
           cycling base population CrGC5, Peptide, 328 aa]
          Length = 328

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 122/324 (37%), Positives = 171/324 (52%), Gaps = 42/324 (12%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLL-QDTEHGSGVYGLNEF 91
           L H K  +  N  + Q ++          R +IF  NLR I L  ++ ++ +   GL  F
Sbjct: 9   LEHGKSNSNSNGIINQQDE----------RFNIFKDNLRFIDLHNENNKNATYKLGLTIF 58

Query: 92  SDLSTAEFQAKYLGFKLKP-------SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQ 144
           ++L+  E+++ YLG + +P          +    A + ++ +P   DWR+  AV  +KDQ
Sbjct: 59  ANLTNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQ 118

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSK 203
             CGS WAFST   +EG+    T +LVSLSEQEL+DCD+  + GC GG +  AF  IM  
Sbjct: 119 GTCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKN 178

Query: 204 LGGGLEEEKTYPYRGDDKACR-LNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAI 261
             GGL  EK YPY G +  C  L K +  V I+GY  V S+DET + K  V   P++VAI
Sbjct: 179 --GGLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETAL-KRAVSYQPVSVAI 235

Query: 262 NA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWG 319
           +A   A Q Y +G+      F      N+ H+V+ VGYG      +   V YWI++NSWG
Sbjct: 236 DAGGRAFQHYQSGI------FTGKCGTNMDHAVVAVGYG------SENGVDYWIVRNSWG 283

Query: 320 EGWGEKGYFRLYRG----DGSCGI 339
             WGE GY R+ R      G CGI
Sbjct: 284 TRWGEDGYIRMERNVASKSGKCGI 307


>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
 gi|255636658|gb|ACU18666.1| unknown [Glycine max]
          Length = 367

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 120/316 (37%), Positives = 166/316 (52%), Gaps = 35/316 (11%)

Query: 40  ALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEF 99
           +++  +L +H K Y  + E   R  IF  NL  I+          V GLN FSDLS  E+
Sbjct: 50  SIYEEWLVKHGKVYNAVEEKEKRFQIFKDNLNFIEEHNAVNRTYKV-GLNRFSDLSNEEY 108

Query: 100 QAKYLGFKLKPSY-----ADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFS 154
           ++KYLG K+ PS      + R  P +  N  LP + DWR+  AV  VK+Q+ C   WAFS
Sbjct: 109 RSKYLGTKIDPSRMMARPSRRYSPRVADN--LPESVDWRKEGAVVRVKNQSECEGCWAFS 166

Query: 155 TTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
               +EG+    T  L +LSEQEL+DCD+  + GC GG +  AF+ I++   GG++ E+ 
Sbjct: 167 AIAAVEGINKIVTGNLTALSEQELLDCDRTVNAGCSGGLVDYAFEFIINN--GGIDTEED 224

Query: 214 YPYRGDDKAC---RLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQF 268
           YP++G D  C   ++N +A  V I+GY  V   +    K  V N P++VAI AY    Q 
Sbjct: 225 YPFQGADGICDQYKINARA--VTIDGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQL 282

Query: 269 YVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYF 328
           Y +G+      F      ++ H V  VGYG      T   + YWI+KNSWGE WGE GY 
Sbjct: 283 YESGI------FTGTCGTSIDHGVTAVGYG------TENGIDYWIVKNSWGENWGEAGYV 330

Query: 329 RLYRG-----DGSCGI 339
            + R       G CGI
Sbjct: 331 GMERNIAEDTAGKCGI 346


>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 108/300 (36%), Positives = 157/300 (52%), Gaps = 24/300 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I  K  GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326


>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
 gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 108/300 (36%), Positives = 157/300 (52%), Gaps = 24/300 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I  K  GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326


>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
          Length = 467

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 125/346 (36%), Positives = 179/346 (51%), Gaps = 33/346 (9%)

Query: 12  LLSLTVSVSSFM-VVGDEKLHHLHHVKHT-----ALFNYFLEQHNKTYATLVEYYSRLHI 65
            LS T+S +S M ++  ++ H       T     A++  +L +  K Y  L E   R  +
Sbjct: 16  FLSFTLSSASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQGKVYNALGEREKRFQV 75

Query: 66  FSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFK--LKPSYADRSVPAMIPN 123
           F  NLR I    ++E+ +   GLN F+DL+  E+++ YLG +  +K +   ++     P 
Sbjct: 76  FKDNLRFIDE-HNSENRTYKLGLNGFADLTNEEYRSTYLGARGGMKRNRLRKTSDRYAPR 134

Query: 124 I--TLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDC 181
           +  +LP + DWR+  AV  VKDQ  CGS WAFST   +EG+    T  L+SLSEQEL+DC
Sbjct: 135 VGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDC 194

Query: 182 DQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRL-NKKATQVKINGYVS 239
           D   ++GC GG +  AF+ I++   GG++ E+ YPY   D  C    K A  V I+ Y  
Sbjct: 195 DTSYNEGCNGGLMDYAFEFIINN--GGIDTEEDYPYLARDGRCDTYRKNAKVVTIDDYED 252

Query: 240 VSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
           V  +     +  V N P++VAI A     QFY +G+      F       L H V  VGY
Sbjct: 253 VPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGI------FSGRCGTQLDHGVAAVGY 306

Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           G +  K       YWI++NSWG+ WGE GY R+ R      G CGI
Sbjct: 307 GTENGK------DYWIVRNSWGKSWGENGYLRMARSINSPTGICGI 346


>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
          Length = 341

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 121/317 (38%), Positives = 158/317 (49%), Gaps = 24/317 (7%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           LH          ++ Q+ + Y    E   R  IF  N+ +I+        S    +NEF+
Sbjct: 30  LHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFA 89

Query: 93  DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSW 151
           DL+  EF+A    FK     +  +      N+T +P   DWR+  AVT +KDQ  CGS W
Sbjct: 90  DLTNEEFRASRNRFKAHIC-STEATSFKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCW 148

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLE 209
           AFS    +EG+    T KL+SLSEQEL+DCD   ED GC GG + +AF  I  +   GL 
Sbjct: 149 AFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFI--EQNHGLT 206

Query: 210 EEKTYPYRGDDKACRLNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--L 266
            E  YPY G D  C   K A    KINGY  V  +     +  V + P+AVAI+A     
Sbjct: 207 TEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEF 266

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
           QFY +GV     F    G E L H V  VGYG      +   + YW++KNSW  GWGE+G
Sbjct: 267 QFYSSGV-----FTGQCGTE-LDHGVAAVGYGT-----SDDGMKYWLVKNSWSTGWGEEG 315

Query: 327 YFRLYRG----DGSCGI 339
           Y R+ R     +G CGI
Sbjct: 316 YIRMQRDVTAKEGLCGI 332


>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 107/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPVSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I+    GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|340504799|gb|EGR31212.1| papain family cysteine protease, putative [Ichthyophthirius
           multifiliis]
          Length = 250

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 99/227 (43%), Positives = 132/227 (58%), Gaps = 14/227 (6%)

Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
           N  LP  FDWRE   +T VK Q  CG  W F+TTG IE  YA K  KLV+ SEQ+LIDCD
Sbjct: 36  NQVLPSYFDWREQGIITPVKYQDTCGGCWTFATTGVIESQYALKYNKLVNFSEQQLIDCD 95

Query: 183 QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY-PYRGDDKACRLNKKATQVKINGYVSVS 241
             +DGC GG +++A+  I     GGLE  + Y  Y      C+++      K+  +  +S
Sbjct: 96  SINDGCRGGLMTDAYKAIQEM--GGLETSEDYGEYLNSKGQCKIDSNKVSAKVINWYQIS 153

Query: 242 RDETDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDR 301
            DE  + + LV+NGP+AV +NA  LQFY  G+  P    CD   ++++H+VLIVGYG + 
Sbjct: 154 EDEEAIRRELVQNGPIAVGVNARFLQFYQGGILDPK--LCD---DSINHAVLIVGYGEEN 208

Query: 302 TKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
            K       YWIIKN WG+ WG  GYF+L RG   CG++ Y   A +
Sbjct: 209 GK------KYWIIKNQWGKSWGINGYFKLVRGKKQCGVHTYASIAFI 249


>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 108/300 (36%), Positives = 157/300 (52%), Gaps = 24/300 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I  K  GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFI--KENGGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326


>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 107/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC+GG ++NAFD I+    GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFIIEN--GGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G    CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GQQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
          Length = 344

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 107/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I+    GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
 gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
 gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
 gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
          Length = 340

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 116/304 (38%), Positives = 156/304 (51%), Gaps = 24/304 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H K Y   +E   R  IF  N+  I+     ++      +N  +DL+  EF+A   
Sbjct: 43  WMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKLSVNHLADLTLDEFKASRN 102

Query: 105 GFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVY 163
           G+K K      +      N+T +P A DWR   AVT +KDQ  CGS WAFST    EG+ 
Sbjct: 103 GYK-KIDREFTTTSFKYENVTAIPAAVDWRVKGAVTPIKDQGQCGSCWAFSTVAATEGIN 161

Query: 164 AAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDK 221
              T KLVSLSEQEL+DCD   ED GCEGG + + F+ I+    GG+  E  YPY+  D 
Sbjct: 162 QITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKN--GGITSETNYPYKAADG 219

Query: 222 ACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQF 279
           +C         KI GY  V  +        V N P++V+I+A   +  FY +G+     +
Sbjct: 220 SCNTATTTPVAKITGYEKVPVNSEKSLLKAVANQPISVSIDASDSSFMFYSSGI-----Y 274

Query: 280 FCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG----DG 335
             + G E L H V  VGYG      +     YWI+KNSWG  WGEKGY R+ RG    +G
Sbjct: 275 TGECGTE-LDHGVTAVGYG------SANGTDYWIVKNSWGTVWGEKGYIRMQRGIAAKEG 327

Query: 336 SCGI 339
            CGI
Sbjct: 328 LCGI 331


>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
          Length = 333

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 125/343 (36%), Positives = 180/343 (52%), Gaps = 35/343 (10%)

Query: 10  VALLSLTVSVSSFMVVGDEKLHHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGN 69
           +++L++     S  +  D KL+     +H  L+    E +NK Y+   E+  R   + GN
Sbjct: 4   ISVLAVLALAFSCTLAFDAKLN-----QHWKLWK---EANNKRYSDAEEHVRRA-TWEGN 54

Query: 70  LRKIQLLQDTEHGSGVY----GLNEFSDLSTAEFQAKYLGFKLK---PSYADRSVPAMIP 122
           L+K+Q   + +   GV+    G+N+++D++  EF     G+          DR   +   
Sbjct: 55  LQKVQE-HNLQADLGVHTYWLGMNKYADMTVTEFVKVMNGYNATMRGQRTQDRHTFSFNS 113

Query: 123 NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCD 182
            I LP   DWR+   VT VKDQ  CGS WAFSTTG +EG +  +T KLVSLSEQ L+DC 
Sbjct: 114 KIALPDTVDWRDKGYVTDVKDQGQCGSCWAFSTTGALEGQHFKQTGKLVSLSEQNLVDCS 173

Query: 183 --QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSV 240
             Q + GC GG +  AF+ I  K   G++ E +YPY   D  CR           G+  +
Sbjct: 174 GKQGNMGCNGGLMDQAFEYI--KENNGIDTEDSYPYEAVDNQCRFKAANVGATDTGFTDI 231

Query: 241 -SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGY 297
            S+DE+ + + +   GP++VAI+A   + Q Y  GV +  + FC      L H VL VGY
Sbjct: 232 TSKDESALQQAVATVGPISVAIDAGHTSFQLYKHGVYN--EPFC--SQTRLDHGVLAVGY 287

Query: 298 GVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGD-GSCGI 339
           G D  K       YW++KNSWGEGWG+KGY ++ R     CGI
Sbjct: 288 GTDSGK------DYWLVKNSWGEGWGDKGYIKMTRNKRNQCGI 324


>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
          Length = 341

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 121/317 (38%), Positives = 158/317 (49%), Gaps = 24/317 (7%)

Query: 33  LHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFS 92
           LH          ++ Q+ + Y    E   R  IF  N+ +I+        S    +NEF+
Sbjct: 30  LHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFA 89

Query: 93  DLSTAEFQAKYLGFKLKPSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSW 151
           DL+  EF+A    FK     +  +      N+T +P   DWR+  AVT +KDQ  CGS W
Sbjct: 90  DLTNEEFRASRNRFKAHIC-STEATSFKYENVTAVPSTVDWRKKGAVTPIKDQGQCGSCW 148

Query: 152 AFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQ--EDDGCEGGSISNAFDTIMSKLGGGLE 209
           AFS    +EG+    T KL+SLSEQEL+DCD   ED GC GG + +AF  I  +   GL 
Sbjct: 149 AFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFI--EQNHGLT 206

Query: 210 EEKTYPYRGDDKACRLNKKA-TQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--L 266
            E  YPY G D  C   K A    KINGY  V  +     +  V + P+AVAI+A     
Sbjct: 207 TEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEF 266

Query: 267 QFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKG 326
           QFY +GV     F    G E L H V  VGYG      +   + YW++KNSW  GWGE+G
Sbjct: 267 QFYSSGV-----FTGQCGTE-LDHGVAAVGYGT-----SDDGMKYWLVKNSWSTGWGEEG 315

Query: 327 YFRLYRG----DGSCGI 339
           Y R+ R     +G CGI
Sbjct: 316 YIRMQRDVTVKEGLCGI 332


>gi|4557501|ref|NP_001325.1| cathepsin O preproprotein [Homo sapiens]
 gi|1168795|sp|P43234.1|CATO_HUMAN RecName: Full=Cathepsin O; Flags: Precursor
 gi|574804|emb|CAA54562.1| cathepsin O [Homo sapiens]
 gi|29351630|gb|AAH49206.1| Cathepsin O [Homo sapiens]
 gi|312153238|gb|ADQ33131.1| cathepsin O [synthetic construct]
          Length = 321

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 113/284 (39%), Positives = 156/284 (54%), Gaps = 20/284 (7%)

Query: 71  RKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADR---SVPAMIPNITLP 127
           R +  L  +E+ +  YG+N+FS L   EF+A YL  + KPS   R    V   IPN++LP
Sbjct: 52  RYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYL--RSKPSKFPRYSAEVHMSIPNVSLP 109

Query: 128 RAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDG 187
             FDWR+   VT V++Q MCG  WAFS  G +E  YA K K L  LS Q++IDC   + G
Sbjct: 110 LRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAIKGKPLEDLSVQQVIDCSYNNYG 169

Query: 188 CEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR-LNKKATQVKINGYVS--VSRDE 244
           C GGS  NA +  ++K+   L ++  YP++  +  C   +   +   I GY +   S  E
Sbjct: 170 CNGGSTLNALN-WLNKMQVKLVKDSEYPFKAQNGLCHYFSGSHSGFSIKGYSAYDFSDQE 228

Query: 245 TDMAKYLVENGPMAVAINAYALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKF 304
            +MAK L+  GP+ V ++A + Q Y+ G+   IQ  C  G  N  H+VLI G+  D+T  
Sbjct: 229 DEMAKALLTFGPLVVIVDAVSWQDYLGGI---IQHHCSSGEAN--HAVLITGF--DKTGS 281

Query: 305 THKAVPYWIIKNSWGEGWGEKGYFRLYRGDGSCGINDYVRSALV 348
           T    PYWI++NSWG  WG  GY  +  G   CGI D V S  V
Sbjct: 282 T----PYWIVRNSWGSSWGVDGYAHVKMGSNVCGIADSVSSIFV 321


>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
          Length = 344

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 108/300 (36%), Positives = 157/300 (52%), Gaps = 24/300 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC+GG ++NAFD I  K  GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFI--KENGGISSESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G    CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GQQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326


>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
 gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
           lyrata]
          Length = 343

 Score =  187 bits (475), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 117/308 (37%), Positives = 161/308 (52%), Gaps = 26/308 (8%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQA 101
           F  +L+ H+K Y    E+  R  I+  N++ I  + ++ H       N F+D++ +EF+A
Sbjct: 43  FEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYI-NSLHLPFKLTDNRFADMTNSEFKA 101

Query: 102 KYLGFKLKP-SYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIE 160
            +LG          +  P   P   +P A DWR   AVT +++Q  CG  WAFS    IE
Sbjct: 102 HFLGLNTSSLRLHKKQRPVCDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIE 161

Query: 161 GVYAAKTKKLVSLSEQELIDCD--QEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRG 218
           G+   KT  LVSLSEQ+LIDCD    + GC GG +  AF+ I S   GGL  E  YPY G
Sbjct: 162 GINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMETAFEFIKSN--GGLTTETDYPYTG 219

Query: 219 DDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSH 275
            +  C   K K   V I GY  V+++E  + +      P++V I+A  +  Q Y +GV  
Sbjct: 220 IEGTCDQEKAKNKVVTIQGYQKVAQNEASL-QIAAAQQPVSVGIDAGGFIFQLYSSGV-- 276

Query: 276 PIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG-- 333
               F      NL+H V +VGYGV+  +       YWI+KNSWG GWGE+GY R+ RG  
Sbjct: 277 ----FTSYCGTNLNHGVTVVGYGVEGDQ------KYWIVKNSWGTGWGEEGYIRMERGIS 326

Query: 334 --DGSCGI 339
              G CGI
Sbjct: 327 EDTGKCGI 334


>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
          Length = 324

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 121/309 (39%), Positives = 171/309 (55%), Gaps = 26/309 (8%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQ---LLQDTEHGSGVYGLNEFSDLSTA 97
           ++  F   H+KTYAT  E   R  I+  +L  I    +  D    +   G+NE+ DL+  
Sbjct: 23  MWTLFKTTHSKTYATEAEDMRRF-IWERHLNMINQHNIEADLGKHTFSLGMNEYGDLTQH 81

Query: 98  EFQAKYLGFKLKPSYADRSVPAMIP-NITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTT 156
           E+ A   G+K+  S    S   + P N+ +P+  DWRE   VT VK+Q  CGS WAFS+T
Sbjct: 82  EY-AAMSGYKMAKSSVGSSF--LEPENLQVPKTVDWREKGYVTPVKNQGQCGSCWAFSST 138

Query: 157 GNIEGVYAAKTKKLVSLSEQELIDC--DQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTY 214
           G++EG    KT +L S+SEQ L+DC  D+ + GC GG + NAF  I   +  G++ EK+Y
Sbjct: 139 GSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGCSGGLMDNAFTYIKKNM--GIDSEKSY 196

Query: 215 PYRGDDKACRLNKKATQVKINGYVSVSR-DETDMAKYLVENGPMAVAINA--YALQFYVT 271
           PY   D  CR  K  +    +G+V +   DET +   +   GP++VAI+A   + QFY T
Sbjct: 197 PYEAVDGECRYKKSDSVTTDSGFVDIPHGDETALRTAVASVGPVSVAIDASHTSFQFYKT 256

Query: 272 GVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLY 331
           GV    +  C   +  L H VL+VGYGV+  +       YW++KNSWG  WGE GY +L 
Sbjct: 257 GVY--TEANCS--STQLDHGVLVVGYGVENGQ------DYWLVKNSWGASWGEAGYIKLA 306

Query: 332 RGDGS-CGI 339
           R  G+ CGI
Sbjct: 307 RNHGNQCGI 315


>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 107/301 (35%), Positives = 158/301 (52%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I+    GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYKVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 110/310 (35%), Positives = 162/310 (52%), Gaps = 25/310 (8%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H   Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGHVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYADRSVPAM-------IPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY   S  +        + +  +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC+GG ++NAFD I  K  GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFI--KENGGISSESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G+   CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GEQYTCRSQEKTAAVQISSYQVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S-CGINDYVR 344
           +  G+ D  +
Sbjct: 327 NPAGLCDIAK 336


>gi|340368362|ref|XP_003382721.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
          Length = 329

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 115/299 (38%), Positives = 171/299 (57%), Gaps = 21/299 (7%)

Query: 49  HNKTYATLVEYYSRLHIFSGN--LRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYLGF 106
           + K+Y TL E   R   +  N  L K       +HG  +  +N F DL++AEF + Y G+
Sbjct: 34  YGKSYLTLEEEKYRRDTWEENSLLIKTHNTDSDKHGYTLE-MNSFGDLTSAEFSSLYNGY 92

Query: 107 KLKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEGVYAAK 166
           +     +     + + N  +P + DWR+   VT VK+Q  CGS WAFSTTG++EG++A K
Sbjct: 93  RQNLETSGSVFSSSLRN-AMPSSLDWRDKKVVTDVKNQGKCGSCWAFSTTGSLEGLHALK 151

Query: 167 TKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDDKACR 224
           T  LVSLSEQ+L+DC  +  ++GC+GG++ +AF  I  K  GG + E++YPY   +++CR
Sbjct: 152 TGHLVSLSEQQLMDCSVKYGNNGCDGGNMRSAFQYI--KDAGGDDTEESYPYTAKNESCR 209

Query: 225 LNKKATQVKINGYVSV-SRDETDMAKYLVENGPMAVAINA--YALQFYVTGVSHPIQFFC 281
            + K       GYV + S DE  +   L E GP++VA++A     QFY  G+     + C
Sbjct: 210 FDPKKVGATDEGYVRIPSGDEVSLMHALYEVGPISVAMDAGLKTFQFYKKGIYS--DYLC 267

Query: 282 DGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDGS-CGI 339
              N +L+H V ++GYG      +    PYW++KNSWG+ WG  GYF L R  G+ CG+
Sbjct: 268 S--NTHLNHGVTLIGYGE-----SSDGSPYWLVKNSWGKDWGIDGYFMLARYVGNMCGV 319


>gi|330842703|ref|XP_003293312.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
 gi|325076376|gb|EGC30167.1| hypothetical protein DICPUDRAFT_41833 [Dictyostelium purpureum]
          Length = 352

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 111/315 (35%), Positives = 161/315 (51%), Gaps = 25/315 (7%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQ 100
           LF+++ +Q+ K Y T  E+  R   F  NL+KI+ L +   G   +G+N++SDLS  EF 
Sbjct: 38  LFHHWTKQNGKIYETSEEFEKRFSNFKTNLKKIENLNNLHKGKASFGMNKYSDLSEEEFS 97

Query: 101 AKYL--GFKLKPSYA-DRSVPAMIPNITLPRAF-------------DWREYDAVTGVKDQ 144
             YL   FK KP    D       P+  L   +             DWR    VT VKDQ
Sbjct: 98  NFYLMKNFKGKPEEERDYIKKPENPSSNLIGGYLNTDDGLKAMYQVDWRNKGLVTPVKDQ 157

Query: 145 TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKL 204
             CGS + FS T  IE  Y     K + LSEQ+ +DCD  D GC GG  +N ++ I+S  
Sbjct: 158 GQCGSCYIFSATEQIESEYIRAGHKAILLSEQQSVDCDTMDGGCGGGDPANVYNYIIS-- 215

Query: 205 GGGLEEEKTYPYRGDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAY 264
            GG+  EK YPY   D  C    +A  +    YV+ + DE  +   +  +GP+++ ++A 
Sbjct: 216 AGGVSTEKDYPYTAQDGTCFNTTRAVSITGFQYVTQNSDEDTLITTIANHGPVSICVDAS 275

Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
             Q Y  G+         G  +N+ H V +VG  +D+T  ++  +PY+II+NSWG  WG+
Sbjct: 276 TWQSYTGGI------ITTGCEQNIDHCVQVVGLDIDKTDPSN-PIPYYIIRNSWGTSWGD 328

Query: 325 KGYFRLYRGDGSCGI 339
           KGY  + +G   CGI
Sbjct: 329 KGYIYVAQGSNLCGI 343


>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
 gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
 gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
          Length = 345

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 124/309 (40%), Positives = 164/309 (53%), Gaps = 30/309 (9%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAK 102
           ++  + K Y  L E  +RL IF  N+  I+   +  +   +Y  G+N+F+DL+  EF A 
Sbjct: 44  WMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNK-LYKLGINQFADLTNEEFIAS 102

Query: 103 YLGFK-LKPSYADRSVPAMIPNITLPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
              FK    S   ++      N ++P   DWR+  AVT VK+Q  CG  WAFS     EG
Sbjct: 103 RNKFKGHMCSSITKTSTFKYENASVPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEG 162

Query: 162 VYAAKTKKLVSLSEQELIDCDQE--DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGD 219
           ++   T KLVSLSEQEL+DCD +  D GCEGG + +AF  I+     GL  E  YPY+G 
Sbjct: 163 IHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNH--GLNTEAQYPYQGV 220

Query: 220 DKACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHP 276
           D  C  NK +   V I GY  V  +     +  V N P++VAI+A     QFY +GV   
Sbjct: 221 DGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKSGV--- 277

Query: 277 IQFFCDGGNENLSHSVLIVGYGV--DRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG- 333
             F    G E L H V  VGYGV  D TK       YW++KNSWG  WGE+GY ++ RG 
Sbjct: 278 --FTGSCGTE-LDHGVTAVGYGVGNDGTK-------YWLVKNSWGTDWGEEGYIKMQRGV 327

Query: 334 ---DGSCGI 339
              +G CGI
Sbjct: 328 DAAEGLCGI 336


>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
          Length = 300

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 124/306 (40%), Positives = 164/306 (53%), Gaps = 29/306 (9%)

Query: 46  LEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVY--GLNEFSDLSTAEFQAKY 103
           + +H K+Y +  E   R  +F  NL+ I    +T      Y  GLNEF+DLS  EF+ KY
Sbjct: 1   MSKHGKSYRSFEEKLHRFEVFQDNLKHID---ETNKKVSSYWLGLNEFADLSHEEFKRKY 57

Query: 104 LGFKLK-PSYADRSVPAMIPNIT-LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTGNIEG 161
           LG K++ P   D        ++  LP++ DWR+  AV  VK+Q  CGS WAFST   +EG
Sbjct: 58  LGLKIELPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEG 117

Query: 162 VYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYRGDD 220
           +    T  L +LSEQELIDCD+  ++GC GG +  AF  I+S   GGL +E+ YPY  ++
Sbjct: 118 INQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISN--GGLRKEEDYPYVMEE 175

Query: 221 KACRLNKKATQ-VKINGYVSVSRDETDMAKYLVENGPMAVAINAYA--LQFYVTGVSHPI 277
             C   K+  + V I+GY  V  D        + N P++VAI A +   QFY  G+    
Sbjct: 176 GTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGI---- 231

Query: 278 QFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRG---- 333
            F    G E L H V  VGYG      T K V Y  +KNSWG  WGEKGY R+ R     
Sbjct: 232 -FNGHCGTE-LDHGVAAVGYG------TSKGVDYITVKNSWGSKWGEKGYIRMKRNVGKP 283

Query: 334 DGSCGI 339
           +G CGI
Sbjct: 284 EGICGI 289


>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
          Length = 344

 Score =  187 bits (475), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 108/301 (35%), Positives = 159/301 (52%), Gaps = 24/301 (7%)

Query: 45  FLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHGSGVYGLNEFSDLSTAEFQAKYL 104
           ++ +H + Y   VE   R  IF  N++ I+ +    + S   G+NEF+D+++ EF AK+ 
Sbjct: 42  WMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFT 101

Query: 105 GFKLKPSYAD----RSVPAMIPNIT---LPRAFDWREYDAVTGVKDQTMCGSSWAFSTTG 157
           G  +  SY       S   +I +++   +P   DWRE  AVT VK Q  CG  WAFS  G
Sbjct: 102 GLNIPNSYLSPSPMSSTEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVG 161

Query: 158 NIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKTYPYR 217
           ++EG Y   T  L+  SEQEL+DC   + GC GG ++NAFD I+    GG+  E  Y Y 
Sbjct: 162 SLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIEN--GGISRESDYEYL 219

Query: 218 GDDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVTGVSHP 276
           G    CR  +K   V+I+ Y  V   ET + + + +  P+++ I A   LQFY  G    
Sbjct: 220 GQQYTCRSQEKTAAVQISSYKVVPEGETSLLQAVTKQ-PVSIGIAASQDLQFYAGGTY-- 276

Query: 277 IQFFCDGG-NENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGEKGYFRLYRGDG 335
                DG   + ++H+V  +GYG D      K   YW++KNSWG  WGE G+ ++ R  G
Sbjct: 277 -----DGSCADRINHAVTAIGYGTD-----EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 326

Query: 336 S 336
           +
Sbjct: 327 N 327


>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
          Length = 565

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 126/320 (39%), Positives = 166/320 (51%), Gaps = 34/320 (10%)

Query: 41  LFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG--------SGVYGLNEFS 92
           LF  +  +H K YA+  E  +RL  F+ N   +        G        S    LN F+
Sbjct: 41  LFEAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFA 100

Query: 93  DLSTAEFQAKYLG-FKLKPSYADRSVPAMIPNI---TLPRAFDWREYDAVTGVKDQTMCG 148
           DL+ AEF+A  LG   +  + A  S      ++    +P A DWR+  AVT VKDQ  CG
Sbjct: 101 DLTHAEFRAARLGRLAVGGARAPPSEGGFAGSVGVGAVPEALDWRQSGAVTKVKDQGSCG 160

Query: 149 SSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQE-DDGCEGGSISNAFDTIMSKLGGG 207
           + W+FS TG IEG+   KT  L+SLSEQELIDCD+  + GC GG +  A+  ++    GG
Sbjct: 161 ACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRSYNAGCGGGLMDYAYRFVIKN--GG 218

Query: 208 LEEEKTYPYRGDDKACRLNK-KATQVKINGYVSVSRDETDMAKYLVENGPMAVAI--NAY 264
           ++ E  YPYR  D  C  NK K   V I+GY  V  ++ D     V   P++V I  +A 
Sbjct: 219 IDTEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSAR 278

Query: 265 ALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWGE 324
           A Q Y  G+      F      +L H+VLIVGYG +  K       YWI+KNSWGE WG 
Sbjct: 279 AFQLYSQGI------FDGPCPTSLDHAVLIVGYGSEGGK------DYWIVKNSWGERWGM 326

Query: 325 KGYFRLYRGDGS----CGIN 340
           KGY  ++R  GS    CGIN
Sbjct: 327 KGYMHMHRNTGSSSGICGIN 346


>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
          Length = 344

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 119/332 (35%), Positives = 167/332 (50%), Gaps = 26/332 (7%)

Query: 22  FMVVGDEKL--HHLHHVKHTALFNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDT 79
           F+ VG  ++    LH          ++ ++ K Y    E   R  IF  N+  I+     
Sbjct: 16  FLAVGISQVMPRKLHQTALRERHENWMAEYGKIYKDAAEKEKRFQIFKDNVEFIESFNAA 75

Query: 80  EHGSGVYGLNEFSDLSTAEFQAKYLGFKLKPSYADRSVPA---MIPNIT-LPRAFDWREY 135
            +     G+N  +DL+  EF+    G K    ++  +         N+T +P A DWR  
Sbjct: 76  GNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENVTDIPEAIDWRVK 135

Query: 136 DAVTGVKDQ-TMCGSSWAFSTTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSIS 194
            AVT +KDQ   CGS WAFST    EG+Y   T  L+SLSEQEL+DCD  D GC+GG + 
Sbjct: 136 GAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSVDHGCDGGLME 195

Query: 195 NAFDTIMSKLGGGLEEEKTYPYRGDDKACRLNKKAT-QVKINGYVSVSRDETDMAKYLVE 253
           + F+ I+    GG+  E  YPY   D  C  +K+A+   +I GY +V  +  +  +  V 
Sbjct: 196 DGFEFIIKN--GGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSEEALQQAVA 253

Query: 254 NGPMAVAINA--YALQFYVTGVSHPIQFFCDGGNENLSHSVLIVGYGVDRTKFTHKAVPY 311
           N P++V+I+A     QFY +GV      F       L H V +VGYG      TH+   Y
Sbjct: 254 NQPVSVSIDAGGSGFQFYSSGV------FTGQCGTQLDHGVTVVGYGTTDDG-THE---Y 303

Query: 312 WIIKNSWGEGWGEKGYFRLYRG----DGSCGI 339
           WI+KNSWG  WGE+GY R+ RG    +G CGI
Sbjct: 304 WIVKNSWGTQWGEEGYIRMQRGIDALEGLCGI 335


>gi|324513891|gb|ADY45690.1| Cysteine proteinase [Ascaris suum]
          Length = 398

 Score =  187 bits (474), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 114/312 (36%), Positives = 170/312 (54%), Gaps = 23/312 (7%)

Query: 42  FNYFLEQHNKTYATLVEYYSRLHIFSGNLRKIQLLQDTEHG-SGVYGLNEFSDLSTAEFQ 100
           F  F+ +++K Y    ++  R  I+  N+  I  L +  +G S +YG N+F+D S  EF+
Sbjct: 91  FMEFMHKYDKVYVDSAQFVKRFRIYVNNMANIDALNERNYGRSIIYGENQFADWSEDEFR 150

Query: 101 AK------YLGFKLKPSYADRSVPAMIPNI-TLPRAFDWREYDAVTGVKDQTMCGSSWAF 153
                   Y  F  +  + D+    M+P    +P  FDWR Y+ VT VK Q  CGS WAF
Sbjct: 151 QILLPRGFYKNFHKRAIFIDQPDEIMMPRKEIIPEHFDWRPYNVVTPVKAQLNCGSCWAF 210

Query: 154 STTGNIEGVYAAKTKKLVSLSEQELIDCDQEDDGCEGGSISNAFDTIMSKLGGGLEEEKT 213
           +TTG +E  YA  T +L SLSEQ+L+DC+ E++ C+GG I  A   +  +   GL  E  
Sbjct: 211 ATTGTVESAYAIGTGELKSLSEQQLLDCNVENNACDGGDIDKALRYVYEE---GLMTEYD 267

Query: 214 YPYRG-DDKACRLNKKATQVKINGYVSVSRDETDMAKYLVENGPMAVAINAYA-LQFYVT 271
           YPY     + C L  + T++K    V + +DE  +  +L+ NGP+ V +N  A ++ Y  
Sbjct: 268 YPYVAHRQETCYLRGETTRIK--AAVFLHQDEASIIDWLIHNGPVNVGVNVTADMKAYKG 325

Query: 272 GVSHPIQFFCDGGNENL-SHSVLIVGYGVDRTKFTHKAVPYWIIKNSWGEGWG-EKGYFR 329
           GV  P ++ C+  N+ + +H++ IVGYG     +      YWI+KNSWG+ +G E GY  
Sbjct: 326 GVYTPNKWECE--NKIIGTHAMNIVGYGT----WNKTNEKYWIVKNSWGQSYGVENGYVY 379

Query: 330 LYRGDGSCGIND 341
             RG  SCGI D
Sbjct: 380 FARGINSCGIED 391


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.319    0.136    0.414 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,683,889,489
Number of Sequences: 23463169
Number of extensions: 244309735
Number of successful extensions: 527360
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6328
Number of HSP's successfully gapped in prelim test: 1064
Number of HSP's that attempted gapping in prelim test: 499062
Number of HSP's gapped (non-prelim): 8631
length of query: 348
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 205
effective length of database: 9,003,962,200
effective search space: 1845812251000
effective search space used: 1845812251000
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 77 (34.3 bits)