BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 023507
         (281 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
          Length = 345

 Score =  221 bits (562), Expect = 6e-57,   Method: Compositional matrix adjust.
 Identities = 120/330 (36%), Positives = 168/330 (50%), Gaps = 71/330 (21%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + L V  AS   +S       +++  E+WMA++GR YKD  EK +R +IFK N+ +
Sbjct: 8   VFLFLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNH 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N     +Y LG NQF+D+TN+EF A YTG  +P    R    S   + ++ ++ VP
Sbjct: 68  IETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPLNIKREPVVS---FDDVDISSVP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD------- 190
            S+DWRD GAVT +KNQ  CG CWAFA++A VE I KI+ GNL+ LSEQQ+LD       
Sbjct: 125 QSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAVSYGC 184

Query: 191 --------------------------------CSTNGNNGCLGGSR--------EKAFAY 210
                                           C TNG       +R        E+   Y
Sbjct: 185 KGGWINKAYSFIISNKGVASAAIYPYKAAKGTCKTNGVPNSAYITRYTYVQRNNERNMMY 244

Query: 211 IIQNQ-----------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 253
            + NQ                 G+F G CGT+L+HA+ I+G+G    G  +W+++NSWG 
Sbjct: 245 AVSNQPIAAALDASGNFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNSWGA 304

Query: 254 TWGDAGYMKIVRDE----GLCGIGTRSSYP 279
            WG+ GY+++ RD     GLCGI     YP
Sbjct: 305 GWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334


>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
          Length = 351

 Score =  220 bits (561), Expect = 7e-57,   Method: Compositional matrix adjust.
 Identities = 117/330 (35%), Positives = 173/330 (52%), Gaps = 71/330 (21%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           +F+ + L    AS   +SR      +++  E+WMA++GR YKD+ EK  R +IFK N+++
Sbjct: 8   VFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           IE  N     +Y LG NQF+D+T  EF A YTG  +P    R    S   + +++++ VP
Sbjct: 68  IETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVS---FDDVNISAVP 124

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN--- 194
            S+DWRD GAV  +KNQ  CG CW+FAA+A VEGI KI++G L+ LSEQ++LDC+ +   
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSYGC 184

Query: 195 -------------GNNG---------------CLGGS----------------REKAFAY 210
                         NNG               C   S                 E++  Y
Sbjct: 185 KGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPNSAYITGYSYVRRNDERSMMY 244

Query: 211 IIQNQ-----------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 253
            + NQ                 G+F+G CGT L+HA+TI+G+G    G  YW+++NSWG+
Sbjct: 245 AVSNQPIAALIDASENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGS 304

Query: 254 TWGDAGYMKIVR----DEGLCGIGTRSSYP 279
           +WG+ GY+++ R      G+CGI     +P
Sbjct: 305 SWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
           SV=1
          Length = 355

 Score =  205 bits (521), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 116/313 (37%), Positives = 169/313 (53%), Gaps = 74/313 (23%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T+   ++E+ E WM++H ++YK   EK  R ++F+ENL +I++ N E N +Y LG N+F+
Sbjct: 42  TNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFA 100

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DLT++EF+  Y G   P  S +   S+ F+Y+++  TD+P S+DWR KGAV P+K+Q +C
Sbjct: 101 DLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDI--TDLPKSVDWRKKGAVAPVKDQGQC 158

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+ VAAVEGI +I +GNL  LSEQ+L+DC T  N+GC GG  + AF YII   G+
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGL 218

Query: 218 FN----------GVCGTQLDHA--VTIVGF------------------------------ 235
                       G+C  Q +    VTI G+                              
Sbjct: 219 HKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRD 278

Query: 236 -------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD---- 266
                        GT  D            G++Y ++KNSWG  WG+ G++++ R+    
Sbjct: 279 FQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKP 338

Query: 267 EGLCGIGTRSSYP 279
           EGLCGI   +SYP
Sbjct: 339 EGLCGINKMASYP 351


>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
          Length = 345

 Score =  196 bits (497), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 126/336 (37%), Positives = 174/336 (51%), Gaps = 82/336 (24%)

Query: 18  MFIIITLLVSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENL 75
           +F+ + L     S V  S++  T  + ++++ E WM +H + YK+  EK  R +IFK+NL
Sbjct: 17  LFVYMGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNL 76

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD 135
           +YI++ NK+ N +Y LG N F+D++NDEF+  YTG    + ++ +T  S  +  N    +
Sbjct: 77  KYIDETNKK-NNSYWLGLNVFADMSNDEFKEKYTG--SIAGNYTTTELSYEEVLNDGDVN 133

Query: 136 VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG 195
           +P  +DWR KGAVTP+KNQ  CG CWAF+AV  +EGI KIR+GNL + SEQ+LLDC    
Sbjct: 134 IPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR- 192

Query: 196 NNGCLGG------------------------------SREK------------------- 206
           + GC GG                              SREK                   
Sbjct: 193 SYGCNGGYPWSALQLVAQYGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEG 252

Query: 207 AFAYIIQNQ------------------GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIK 248
           A  Y I NQ                  GIF G CG ++DHAV  VG+     G NY LIK
Sbjct: 253 ALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGY-----GPNYILIK 307

Query: 249 NSWGNTWGDAGYMKIVR----DEGLCGIGTRSSYPL 280
           NSWG  WG+ GY++I R      G+CG+ T S YP+
Sbjct: 308 NSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPV 343


>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
           GN=GCP1 PE=2 SV=2
          Length = 376

 Score =  180 bits (457), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 111/321 (34%), Positives = 165/321 (51%), Gaps = 81/321 (25%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTN 94
           ++ V  I+ +W A+HG++  +      +++ R  IFK+NL +I+  N++  N TYKLG  
Sbjct: 42  DEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLT 101

Query: 95  QFSDLTNDEFRALYTGYKMPSPSHRSTTSSTF--KYQN-LSMTDVPTSLDWRDKGAVTPI 151
           +F+DLTNDE+R LY G +   P+ R   +     KY   ++  +VP ++DWR KGAV PI
Sbjct: 102 KFTDLTNDEYRKLYLGART-EPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPI 160

Query: 152 KNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYI 211
           K+Q  CG CWAF+  AAVEGI KI +G LI LSEQ+L+DC  + N GC GG  + AF +I
Sbjct: 161 KDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFI 220

Query: 212 IQNQGI----------FNGVCGTQLDHA--VTIVGF------------------------ 235
           ++N G+          F G C + L ++  V+I G+                        
Sbjct: 221 MKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAI 280

Query: 236 -------------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIV 264
                              GT  D            G +YW+++NSWG  WG+ GY+++ 
Sbjct: 281 EAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRME 340

Query: 265 RD-----EGLCGIGTRSSYPL 280
           R+      G CGI   +SYP+
Sbjct: 341 RNLAASKSGKCGIAVEASYPV 361


>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
          Length = 328

 Score =  174 bits (442), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 109/316 (34%), Positives = 161/316 (50%), Gaps = 80/316 (25%)

Query: 44  VEIHEKWMAQHGRSYKDEL----EKEMRLKIFKENLEYIEKANKEG-NRTYKLGTNQFSD 98
           + I+ +W  +HG+S  +      +++ R  IFK+NL +I+  N+   N TYKLG   F++
Sbjct: 1   MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSST--FKYQN-LSMTDVPTSLDWRDKGAVTPIKNQK 155
           LTNDE+R+LY G +   P  R T +     KY   +++ +VP ++DWR KGAV  IK+Q 
Sbjct: 61  LTNDEYRSLYLGART-EPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQG 119

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQ 215
            CG CWAF+  AAVEGI KI +G L+ LSEQ+L+DC  + N GC GG  + AF +I++N 
Sbjct: 120 TCGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNG 179

Query: 216 GI----------FNGVCGTQLDHA--VTIVGF---------------------------- 235
           G+           NG C + L ++  VTI G+                            
Sbjct: 180 GLNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGG 239

Query: 236 ---------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD-- 266
                          GT  D            G +YW+++NSWG  WG+ GY+++ R+  
Sbjct: 240 RAFQHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVA 299

Query: 267 --EGLCGIGTRSSYPL 280
              G CGI   +SYP+
Sbjct: 300 SKSGKCGIAIEASYPV 315


>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
           PE=1 SV=2
          Length = 466

 Score =  174 bits (441), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 106/316 (33%), Positives = 167/316 (52%), Gaps = 80/316 (25%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDEL--EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQ 95
           E      ++ W+A++G    + L  E E R  +F +NL++++  N   +    ++LG N+
Sbjct: 45  EAEARAAYDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNR 104

Query: 96  FSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQK 155
           F+DLTN+EFRA + G K+   + RS  +   +Y++  + ++P S+DWR+KGAV P+KNQ 
Sbjct: 105 FADLTNEEFRATFLGAKV---AERSRAAGE-RYRHDGVEELPESVDWREKGAVAPVKNQG 160

Query: 156 ECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQN 214
           +CG CWAF+AV+ VE I ++ +G +I LSEQ+L++CSTNG N+GC GG  + AF +II+N
Sbjct: 161 QCGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKN 220

Query: 215 QGI----------FNGVCGTQLDHA--VTIVGF--------------------------- 235
            GI           +G C    ++A  V+I GF                           
Sbjct: 221 GGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAG 280

Query: 236 ----------------GTTED------------GANYWLIKNSWGNTWGDAGYMKIVRD- 266
                           GT+ D            G +YW+++NSWG  WG++GY+++ R+ 
Sbjct: 281 GREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNI 340

Query: 267 ---EGLCGIGTRSSYP 279
               G CGI   +SYP
Sbjct: 341 NVTTGKCGIAMMASYP 356


>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
           SV=1
          Length = 462

 Score =  173 bits (438), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 92/216 (42%), Positives = 134/216 (62%), Gaps = 15/216 (6%)

Query: 40  EQSVVEIHEKWMAQHGR--SYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           E  V+ I+E W+ +HG+  S    +EK+ R +IFK+NL ++++ N E N +Y+LG  +F+
Sbjct: 43  EAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFA 101

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DLTNDE+R+ Y G KM     R T+    +Y+     ++P S+DWR KGAV  +K+Q  C
Sbjct: 102 DLTNDEYRSKYLGAKMEKKGERRTS---LRYEARVGDELPESIDWRKKGAVAEVKDQGGC 158

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           G CWAF+ + AVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF +II+N GI
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 218

Query: 218 -------FNGVCGT--QLDHAVTIVGFGTTEDGANY 244
                  + GV GT  Q+     +V   + ED   Y
Sbjct: 219 DTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTY 254



 Score = 89.4 bits (220), Expect = 3e-17,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 55/76 (72%), Gaps = 5/76 (6%)

Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
           A+ + + GIF+G CGTQLDH V  VG+GT E+G +YW+++NSWG +WG++GY+++ R+  
Sbjct: 278 AFQLYDSGIFDGSCGTQLDHGVVAVGYGT-ENGKDYWIVRNSWGKSWGESGYLRMARNIA 336

Query: 267 --EGLCGIGTRSSYPL 280
              G CGI    SYP+
Sbjct: 337 SSSGKCGIAIEPSYPI 352


>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
           GN=At4g11320 PE=2 SV=1
          Length = 371

 Score =  171 bits (434), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 114/352 (32%), Positives = 169/352 (48%), Gaps = 91/352 (25%)

Query: 18  MFIIITLLVSCAS----QVVSSRSTHE--------QSVVE-----IHEKWMAQHGRSYKD 60
           +F++  ++ SCA+     VVSS   H         Q + +     + E WM +HG+ Y  
Sbjct: 10  IFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMVKHGKVYDS 69

Query: 61  ELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
             EKE RL IF++NL +I   N E N +Y+LG N+F+DL+  E+  +  G     P +  
Sbjct: 70  VAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEICHGADPRPPRNHV 128

Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNL 180
             +S+ +Y+      +P S+DWR++GAVT +K+Q  C  CWAF+ V AVEG+ KI +G L
Sbjct: 129 FMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGEL 188

Query: 181 IQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQL---D 227
           + LSEQ L++C+   NNGC GG  E A+ +I+ N G+           NGVC  +L   +
Sbjct: 189 VTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDN 247

Query: 228 HAVTIVGF----------------------------------------GTTEDGANYWLI 247
             V I G+                                        GT     N+ ++
Sbjct: 248 KNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVV 307

Query: 248 KNSWG---------------NTWGDAGYMKIVRD----EGLCGIGTRSSYPL 280
              +G               +TWG+AGYMK+ R+     GLCGI  R+SYPL
Sbjct: 308 VVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359


>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
           GN=At3g43960 PE=2 SV=1
          Length = 376

 Score =  169 bits (429), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 89/210 (42%), Positives = 128/210 (60%), Gaps = 5/210 (2%)

Query: 10  SFKINTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLK 69
           SF+        ++ + +S      +    +E  V+ ++E+W+ ++G++Y    EKE R K
Sbjct: 4   SFRTLALLTLSVLLISISLGVVTATESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFK 63

Query: 70  IFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQ 129
           IFK+NL+ IE+ N + NR+Y+ G N+FSDLT DEF+A Y G KM     +S +    +YQ
Sbjct: 64  IFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQASYLGGKM---EKKSLSDVAERYQ 120

Query: 130 NLSMTDVPTSLDWRDKGAVTP-IKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQL 188
                 +P  +DWR++GAV P +K Q ECG CWAFAA  AVEGI +I +G L+ LSEQ+L
Sbjct: 121 YKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFAATGAVEGINQITTGELVSLSEQEL 180

Query: 189 LDCST-NGNNGCLGGSREKAFAYIIQNQGI 217
           +DC   N N GC GG    AF +I +N GI
Sbjct: 181 IDCDRGNDNFGCAGGGAVWAFEFIKENGGI 210



 Score = 68.2 bits (165), Expect = 7e-11,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 42/70 (60%), Gaps = 5/70 (7%)

Query: 216 GIFNGVCGTQL-DHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
           G++ G C     DH V IVG+GT+ D  +YWLI+NSWG  WG+ GY+++ R+     G C
Sbjct: 277 GVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPTGKC 336

Query: 271 GIGTRSSYPL 280
            +     YP+
Sbjct: 337 AVAVAPVYPI 346


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
           SV=2
          Length = 356

 Score =  168 bits (426), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 87/206 (42%), Positives = 132/206 (64%), Gaps = 15/206 (7%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           +H++ ++E+ E W++   ++Y+   EK +R ++FK+NL++I++ NK+G ++Y LG N+F+
Sbjct: 43  SHDK-LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFA 100

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTS-STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
           DL+++EF+ +Y G K          S + F Y+++    VP S+DWR KGAV  +KNQ  
Sbjct: 101 DLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAVAEVKNQGS 158

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQG 216
           CG CWAF+ VAAVEGI KI +GNL  LSEQ+L+DC T  NNGC GG  + AF YI++N G
Sbjct: 159 CGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGG 218

Query: 217 IFN----------GVCGTQLDHAVTI 232
           +            G C  Q D + T+
Sbjct: 219 LRKEEDYPYSMEEGTCEMQKDESETV 244



 Score = 76.3 bits (186), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 32/68 (47%), Positives = 49/68 (72%), Gaps = 5/68 (7%)

Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLCG 271
           G+F+G CG  LDH V  VG+G+++ G++Y ++KNSWG  WG+ GY+++ R+    EGLCG
Sbjct: 286 GVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTGKPEGLCG 344

Query: 272 IGTRSSYP 279
           I   +S+P
Sbjct: 345 INKMASFP 352


>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
          Length = 371

 Score =  168 bits (426), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 106/317 (33%), Positives = 154/317 (48%), Gaps = 83/317 (26%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           + E+W     +H ++Y+DE E+  RLKIF EN   I K N+   EG  ++KL  N+++DL
Sbjct: 55  VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFK---YQNLSMTDVPTSLDWRDKGAVTPIKNQKE 156
            + EFR L  G+             +FK   + + +   +P S+DWR KGAVT +K+Q  
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174

Query: 157 CGCCWAFAAVAAVEGITKIRSGNLIQLSEQ------------------------------ 186
           CG CWAF++  A+EG    +SG L+ LSEQ                              
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234

Query: 187 ----------QLLDCSTNGNNGCLGGSREKAFAYIIQ----------------------- 213
                     + +D S + N G +G + ++ F  I Q                       
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNKGTVGAT-DRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 293

Query: 214 -------NQGIFN-GVCGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIV 264
                  ++G++N   C  Q LDH V +VGFGT E G +YWL+KNSWG TWGD G++K++
Sbjct: 294 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKML 353

Query: 265 RD-EGLCGIGTRSSYPL 280
           R+ E  CGI + SSYPL
Sbjct: 354 RNKENQCGIASASSYPL 370


>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
           GN=CEP2 PE=2 SV=1
          Length = 361

 Score =  164 bits (416), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 89/232 (38%), Positives = 133/232 (57%), Gaps = 23/232 (9%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHG--RSYKDELEKEMRLKIFKENL 75
           +F ++ L  +C           E+ +  ++++W + H   RS     E+E R  +F+ N+
Sbjct: 9   LFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHSVPRSLN---EREKRFNVFRHNV 65

Query: 76  EYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQN 130
            ++   NK+ NR+YKL  N+F+DLT +EF+  YTG     ++M     R +    + ++N
Sbjct: 66  MHVHNTNKK-NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHEN 124

Query: 131 LSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLD 190
           LS   +P+S+DWR KGAVT IKNQ +CG CWAF+ VAAVEGI KI++  L+ LSEQ+L+D
Sbjct: 125 LSK--LPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVD 182

Query: 191 CSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQLDHAVTI 232
           C T  N GC GG  E AF +I +N GI           +G C    D+ V +
Sbjct: 183 CDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLV 234



 Score = 82.8 bits (203), Expect = 3e-15,   Method: Compositional matrix adjust.
 Identities = 35/70 (50%), Positives = 49/70 (70%), Gaps = 5/70 (7%)

Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
           +G+F G CGT+L+H V  VG+G+ E G  YW+++NSWG  WG+ GY+KI R+    EG C
Sbjct: 275 EGVFTGSCGTELNHGVAAVGYGS-ERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRC 333

Query: 271 GIGTRSSYPL 280
           GI   +SYP+
Sbjct: 334 GIAMEASYPI 343


>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
          Length = 373

 Score =  164 bits (416), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 77/180 (42%), Positives = 114/180 (63%), Gaps = 3/180 (1%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+++ +++E+W + H R  +   EK  R   FK N  +I   NK G+  Y+L  N+F D+
Sbjct: 39  EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97

Query: 100 TNDEFRALYTG-YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
              EFRA + G  +  +PS +  +   F Y  L+++D+P S+DWR KGAVT +K+Q +CG
Sbjct: 98  DQAEFRATFVGDLRRDTPS-KPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCG 156

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIF 218
            CWAF+ V +VEGI  IR+G+L+ LSEQ+L+DC T  N+GC GG  + AF YI  N G+ 
Sbjct: 157 SCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLI 216



 Score = 95.5 bits (236), Expect = 4e-19,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 54/76 (71%), Gaps = 4/76 (5%)

Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 267
           A++  ++G+F G CGT+LDH V +VG+G  EDG  YW +KNSWG +WG+ GY+++ +D  
Sbjct: 278 AFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337

Query: 268 ---GLCGIGTRSSYPL 280
              GLCGI   +SYP+
Sbjct: 338 ASGGLCGIAMEASYPV 353


>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
          Length = 371

 Score =  164 bits (415), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 75/179 (41%), Positives = 110/179 (61%), Gaps = 1/179 (0%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+++ +++E+W + H R  +   EK  R   FK N  +I   NK G+  Y+L  N+F D+
Sbjct: 39  EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97

Query: 100 TNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGC 159
              EFRA + G        +  +   F Y  L+++D+P S+DWR KGAVT +K+Q +CG 
Sbjct: 98  DQAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGS 157

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIF 218
           CWAF+ V +VEGI  IR+G+L+ LSEQ+L+DC T  N+GC GG  + AF YI  N G+ 
Sbjct: 158 CWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLI 216



 Score = 94.7 bits (234), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 54/76 (71%), Gaps = 4/76 (5%)

Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE- 267
           A++  ++G+F G CGT+LDH V +VG+G  EDG  YW +KNSWG +WG+ GY+++ +D  
Sbjct: 278 AFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337

Query: 268 ---GLCGIGTRSSYPL 280
              GLCGI   +SYP+
Sbjct: 338 ASGGLCGIAMEASYPV 353


>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
          Length = 380

 Score =  164 bits (414), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 86/206 (41%), Positives = 127/206 (61%), Gaps = 5/206 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           ++ + +F    L++S A    +        V  ++E W+ ++G+SY    E E R +IFK
Sbjct: 8   VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y G+   S S+++  S+  +Y+   
Sbjct: 68  ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT--SGSNKTKVSN--RYEPRV 123

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P+ +DWR  GAV  IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC 
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
            T    GC GG     F +II N GI
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGI 209



 Score = 90.1 bits (222), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 41/68 (60%), Positives = 51/68 (75%), Gaps = 4/68 (5%)

Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
           GIF G CGT +DHAVTIVG+GT E G +YW++KNSW  TWG+ GYM+I+R+    G CGI
Sbjct: 276 GIFTGPCGTAIDHAVTIVGYGT-EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGI 334

Query: 273 GTRSSYPL 280
            T  SYP+
Sbjct: 335 ATMPSYPV 342


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
           PE=1 SV=2
          Length = 458

 Score =  162 bits (411), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 82/189 (43%), Positives = 117/189 (61%), Gaps = 6/189 (3%)

Query: 32  VVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKAN---KEGNRT 88
           +VS     E+    ++ +W A+HG+SY    E+E R   F++NL YI++ N     G  +
Sbjct: 25  IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           ++LG N+F+DLTN+E+R  Y G +      R  +       N ++   P S+DWR KGAV
Sbjct: 85  FRLGLNRFADLTNEEYRDTYLGLRNKPRRERKVSDRYLAADNEAL---PESVDWRTKGAV 141

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAF 208
             IK+Q  CG CWAF+A+AAVEGI +I +G+LI LSEQ+L+DC T+ N GC GG  + AF
Sbjct: 142 AEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAF 201

Query: 209 AYIIQNQGI 217
            +II N GI
Sbjct: 202 DFIINNGGI 210



 Score = 85.1 bits (209), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 37/76 (48%), Positives = 53/76 (69%), Gaps = 5/76 (6%)

Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
           A+ + + GIF G CGT LDH V  VG+GT E+G +YW+++NSWG +WG++GY+++ R+  
Sbjct: 270 AFQLYSSGIFTGKCGTALDHGVAAVGYGT-ENGKDYWIVRNSWGKSWGESGYVRMERNIK 328

Query: 267 --EGLCGIGTRSSYPL 280
              G CGI    SYPL
Sbjct: 329 ASSGKCGIAVEPSYPL 344


>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
          Length = 380

 Score =  161 bits (408), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 85/206 (41%), Positives = 126/206 (61%), Gaps = 5/206 (2%)

Query: 13  INTTPMFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           ++ + +F    L++S A    +        V  ++E W+ ++G+SY    E E R +IFK
Sbjct: 8   VSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFK 67

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           E L +I++ N + NR+YK+G NQF+DLT++EFR+ Y   +  S S+++  S+  +Y+   
Sbjct: 68  ETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYL--RFTSGSNKTKVSN--RYEPRV 123

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              +P+ +DWR  GAV  IK+Q ECG CWAF+A+A VEGI KI +G LI LSEQ+L+DC 
Sbjct: 124 GQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCG 183

Query: 193 -TNGNNGCLGGSREKAFAYIIQNQGI 217
            T    GC GG     F +II N GI
Sbjct: 184 RTQNTRGCNGGYITDGFQFIINNGGI 209



 Score = 90.1 bits (222), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 41/68 (60%), Positives = 51/68 (75%), Gaps = 4/68 (5%)

Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD---EGLCGI 272
           GIF G CGT +DHAVTIVG+GT E G +YW++KNSW  TWG+ GYM+I+R+    G CGI
Sbjct: 276 GIFTGPCGTAVDHAVTIVGYGT-EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGI 334

Query: 273 GTRSSYPL 280
            T  SYP+
Sbjct: 335 ATMPSYPV 342


>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
           GN=CEP3 PE=2 SV=1
          Length = 364

 Score =  161 bits (408), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 83/183 (45%), Positives = 119/183 (65%), Gaps = 11/183 (6%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E++V +++E+W   H  S +   E   R  +F+ N+ ++ + NK+ N+ YKL  N+F+D+
Sbjct: 31  EENVWKLYERWRGHHSVS-RASHEAIKRFNVFRHNVLHVHRTNKK-NKPYKLKINRFADI 88

Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           T+ EFR+ Y G     ++M     R   S  F Y+N+  T VP+S+DWR+KGAVT +KNQ
Sbjct: 89  THHEFRSSYAGSNVKHHRMLRGPKRG--SGGFMYENV--TRVPSSVDWREKGAVTEVKNQ 144

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
           ++CG CWAF+ VAAVEGI KIR+  L+ LSEQ+L+DC T  N GC GG  E AF +I  N
Sbjct: 145 QDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNN 204

Query: 215 QGI 217
            GI
Sbjct: 205 GGI 207



 Score = 88.6 bits (218), Expect = 4e-17,   Method: Compositional matrix adjust.
 Identities = 36/74 (48%), Positives = 53/74 (71%), Gaps = 4/74 (5%)

Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR---- 265
           + + ++G+F G CGTQL+H V IVG+G T++G  YW+++NSWG  WG+ GY++I R    
Sbjct: 269 FQLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISE 328

Query: 266 DEGLCGIGTRSSYP 279
           +EG CGI   +SYP
Sbjct: 329 NEGRCGIAMEASYP 342


>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
           GN=CEP1 PE=2 SV=1
          Length = 361

 Score =  158 bits (399), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 85/210 (40%), Positives = 127/210 (60%), Gaps = 17/210 (8%)

Query: 19  FIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFK 72
           FI++ L +    +       H      E S+ E++E+W + H  +   E EK  R  +FK
Sbjct: 4   FIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLE-EKAKRFNVFK 62

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTG-----YKMPSPSHRSTTSSTFK 127
            N+++I + NK+ +++YKL  N+F D+T++EFR  Y G     ++M     ++T S  F 
Sbjct: 63  HNVKHIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKS--FM 119

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           Y N++   +PTS+DWR  GAVTP+KNQ +CG CWAF+ V AVEGI +IR+  L  LSEQ+
Sbjct: 120 YANVNT--LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQE 177

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           L+DC TN N GC GG  + AF +I +  G+
Sbjct: 178 LVDCDTNQNQGCNGGLMDLAFEFIKEKGGL 207



 Score = 94.7 bits (234), Expect = 7e-19,   Method: Compositional matrix adjust.
 Identities = 54/151 (35%), Positives = 82/151 (54%), Gaps = 23/151 (15%)

Query: 143 RDKGAVT-----PIKNQKE-CGCCWAFAAVAAVEG---ITKIRSGNLIQLSEQQLLDCST 193
           ++KG +T     P K   E C      A V +++G   + K    +L++    Q +  + 
Sbjct: 202 KEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAI 261

Query: 194 NGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGN 253
           +      GGS      +   ++G+F G CGT+L+H V +VG+GTT DG  YW++KNSWG 
Sbjct: 262 DA-----GGS-----DFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGE 311

Query: 254 TWGDAGYMKIVR----DEGLCGIGTRSSYPL 280
            WG+ GY+++ R     EGLCGI   +SYPL
Sbjct: 312 EWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342


>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
          Length = 360

 Score =  157 bits (396), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 84/209 (40%), Positives = 127/209 (60%), Gaps = 15/209 (7%)

Query: 17  PMFIIITLL----VSCASQVVSSRS--THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKI 70
           P FI + L+    +S A  +  +      E S+  ++EKW   H  + +D  EK  R  +
Sbjct: 4   PKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVA-RDLDEKNRRFNV 62

Query: 71  FKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS-----TTSST 125
           FKEN+++I + N++ +  YKL  N+F D+TN EFR+ Y G K+    HRS       + +
Sbjct: 63  FKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQH--HRSQRGIQKNTGS 120

Query: 126 FKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSE 185
           F Y+N+       S+DWR KGAVT +K+Q +CG CWAF+ +A+VEGI +I++G L+ LSE
Sbjct: 121 FMYENVGSLPA-ASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSE 179

Query: 186 QQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
           Q+L+DC T+ N GC GG  + AF +I +N
Sbjct: 180 QELVDCDTSYNEGCNGGLMDYAFEFIQKN 208



 Score = 91.7 bits (226), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 52/77 (67%), Gaps = 4/77 (5%)

Query: 208 FAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-- 265
           + +   ++G+F G CGT+LDH V IVG+G T DG  YW++KNSWG  WG++GY+++ R  
Sbjct: 269 YGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDGTKYWIVKNSWGEEWGESGYIRMQRGI 328

Query: 266 --DEGLCGIGTRSSYPL 280
               G CGI   +SYP+
Sbjct: 329 SDKRGKCGIAMEASYPI 345


>sp|P00786|CATH_RAT Pro-cathepsin H OS=Rattus norvegicus GN=Ctsh PE=1 SV=1
          Length = 333

 Score =  157 bits (396), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 108/339 (31%), Positives = 154/339 (45%), Gaps = 85/339 (25%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHE----KWMAQHGRSYKDELEKEMRLKIFKE 73
           M+  + LL + A  ++S+ +T E +V  I +     WM QH ++Y    E   RL++F  
Sbjct: 1   MWTALPLLCAGA-WLLSAGATAELTVNAIEKFHFTSWMKQHQKTYSSR-EYSHRLQVFAN 58

Query: 74  NLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           N   I+  N+  N T+K+G NQFSD++   F  +   Y    P + S T S +       
Sbjct: 59  NWRKIQAHNQR-NHTFKMGLNQFSDMS---FAEIKHKYLWSEPQNCSATKSNYL---RGT 111

Query: 134 TDVPTSLDWRDKG-AVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
              P+S+DWR KG  V+P+KNQ  CG CW F+   A+E    I SG ++ L+EQQL+DC+
Sbjct: 112 GPYPSSMDWRKKGNVVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQQLVDCA 171

Query: 193 TNGNN-GCLGGSREKAFAYIIQNQGIF----------NGVCGTQLDHAVTIV-------- 233
            N NN GC GG   +AF YI+ N+GI           NG C    + AV  V        
Sbjct: 172 QNFNNHGCQGGLPSQAFEYILYNKGIMGEDSYPYIGKNGQCKFNPEKAVAFVKNVVNITL 231

Query: 234 ------------------GFGTTED----------------------------------G 241
                              F  TED                                  G
Sbjct: 232 NDEAAMVEAVALYNPVSFAFEVTEDFMMYKSGVYSSNSCHKTPDKVNHAVLAVGYGEQNG 291

Query: 242 ANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSSYPL 280
             YW++KNSWG+ WG+ GY  I R + +CG+   +SYP+
Sbjct: 292 LLYWIVKNSWGSNWGNNGYFLIERGKNMCGLAACASYPI 330


>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
          Length = 348

 Score =  156 bits (395), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 78/177 (44%), Positives = 114/177 (64%), Gaps = 5/177 (2%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++   WM +H ++YK+  EK  R +IFK+NL+YI++ NK  N  Y LG N+FS
Sbjct: 39  TSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMIN-GYWLGLNEFS 97

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DL+NDEF+  Y G     P   +      ++ N  + D+P S+DWR KGAVTP+K+Q  C
Sbjct: 98  DLSNDEFKEKYVG---SLPEDYTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYC 154

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
             CWAF+ VA VEGI KI++GNL++LSEQ+L+DC    + GC  G +  +  Y+ QN
Sbjct: 155 ESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQ-SYGCNRGYQSTSLQYVAQN 210



 Score = 60.8 bits (146), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 34/69 (49%), Positives = 44/69 (63%), Gaps = 5/69 (7%)

Query: 216 GIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR----DEGLCG 271
           GIF G CGT++DHAVT VG+G +       LIKNSWG  WG+ GY++I R      G+CG
Sbjct: 279 GIFEGSCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGPGWGENGYIRIRRASGNSPGVCG 337

Query: 272 IGTRSSYPL 280
           +   S YP+
Sbjct: 338 VYRSSYYPI 346


>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
          Length = 348

 Score =  156 bits (395), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 76/177 (42%), Positives = 115/177 (64%), Gaps = 5/177 (2%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++   WM  H + Y++  EK  R +IFK+NL YI++ NK+ N +Y LG N+F+
Sbjct: 39  TSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFA 97

Query: 98  DLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
           DL+NDEF   Y G  + +   +S      ++ N    ++P ++DWR KGAVTP+++Q  C
Sbjct: 98  DLSNDEFNEKYVGSLIDATIEQSYDE---EFINEDTVNLPENVDWRKKGAVTPVRHQGSC 154

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
           G CWAF+AVA VEGI KIR+G L++LSEQ+L+DC    ++GC GG    A  Y+ +N
Sbjct: 155 GSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKN 210



 Score = 61.6 bits (148), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 35/78 (44%), Positives = 46/78 (58%), Gaps = 5/78 (6%)

Query: 206 KAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 265
           K   + +   GIF G CGT++DHAVT VG+G +       LIKNSWG  WG+ GY++I R
Sbjct: 269 KGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKR 327

Query: 266 ----DEGLCGIGTRSSYP 279
                 G+CG+   S YP
Sbjct: 328 APGNSPGVCGLYKSSYYP 345


>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
          Length = 352

 Score =  155 bits (393), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 77/180 (42%), Positives = 115/180 (63%), Gaps = 9/180 (5%)

Query: 38  THEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFS 97
           T  + ++++ + WM +H + Y+   EK  R +IF++NL YI++ NK+ N +Y LG N F+
Sbjct: 39  TSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFA 97

Query: 98  DLTNDEFRALYTGY---KMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           DL+NDEF+  Y G+         H      T+K+    +T+ P S+DWR KGAVTP+KNQ
Sbjct: 98  DLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKH----VTNYPQSIDWRAKGAVTPVKNQ 153

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
             CG CWAF+ +A VEGI KI +GNL++LSEQ+L+DC  + + GC GG +  +  Y+  N
Sbjct: 154 GACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQYVANN 212



 Score = 83.2 bits (204), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 38/73 (52%), Positives = 50/73 (68%), Gaps = 5/73 (6%)

Query: 212 IQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----E 267
           +   G+F+G CGT+LDHAVT VG+GT+ DG NY +IKNSWG  WG+ GYM++ R     +
Sbjct: 277 LYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ 335

Query: 268 GLCGIGTRSSYPL 280
           G CG+   S YP 
Sbjct: 336 GTCGVYKSSYYPF 348


>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
          Length = 362

 Score =  155 bits (393), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 84/210 (40%), Positives = 119/210 (56%), Gaps = 9/210 (4%)

Query: 15  TTPMFIIITLLVSCASQVVSSRSTH------EQSVVEIHEKWMAQHGRSYKDELEKEMRL 68
            T   + + L  S    V +S   H      E+S+ +++E+W + H  S +   EK  R 
Sbjct: 2   ATKKLLWVVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVS-RSLGEKHKRF 60

Query: 69  KIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPS-HRSTTSSTFK 127
            +FK NL ++   NK  ++ YKL  N+F+D+TN EFR+ Y G K+  P   R T      
Sbjct: 61  NVFKANLMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGA 119

Query: 128 YQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQ 187
           +    +  VP S+DWR KGAVT +K+Q +CG CWAF+ V AVEGI +I++  L+ LSEQ+
Sbjct: 120 FMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQE 179

Query: 188 LLDCSTNGNNGCLGGSREKAFAYIIQNQGI 217
           L+DC    N GC GG  E AF +I Q  GI
Sbjct: 180 LVDCDKEENQGCNGGLMESAFEFIKQKGGI 209



 Score = 90.1 bits (222), Expect = 2e-17,   Method: Compositional matrix adjust.
 Identities = 37/70 (52%), Positives = 50/70 (71%), Gaps = 4/70 (5%)

Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
           +G+F G C T L+H V IVG+GTT DG NYW+++NSWG  WG+ GY+++ R+    EGLC
Sbjct: 275 EGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLC 334

Query: 271 GIGTRSSYPL 280
           GI    SYP+
Sbjct: 335 GIAMLPSYPI 344


>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
           GN=At4g11310 PE=2 SV=1
          Length = 364

 Score =  155 bits (392), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 87/233 (37%), Positives = 134/233 (57%), Gaps = 24/233 (10%)

Query: 16  TPMFIIITLLV--SCASQVVSSRSTHE-----QSVVE-----IHEKWMAQHGRSYKDELE 63
           + M I++  +V  SCA+ +  S  +++      SV +     I E WM +HG+ Y    E
Sbjct: 6   SAMLILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLIFESWMVKHGKVYGSVAE 65

Query: 64  KEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTS 123
           KE RL IF++NL +I   N E N +Y+LG   F+DL+  E++ +  G     P +    +
Sbjct: 66  KERRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHVFMT 124

Query: 124 STFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQL 183
           S+ +Y+  +   +P S+DWR++GAVT +K+Q  C  CWAF+ V AVEG+ KI +G L+ L
Sbjct: 125 SSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTL 184

Query: 184 SEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI----------FNGVCGTQL 226
           SEQ L++C+   NNGC GG  E A+ +I++N G+           NGVC  +L
Sbjct: 185 SEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRL 236



 Score = 91.7 bits (226), Expect = 6e-18,   Method: Compositional matrix adjust.
 Identities = 41/75 (54%), Positives = 55/75 (73%), Gaps = 5/75 (6%)

Query: 210 YIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD--- 266
           + +   G+F+G CGT L+H V +VG+GT E+G +YWL+KNS G TWG+AGYMK+ R+   
Sbjct: 279 FQLYESGVFDGSCGTNLNHGVVVVGYGT-ENGRDYWLVKNSRGITWGEAGYMKMARNIAN 337

Query: 267 -EGLCGIGTRSSYPL 280
             GLCGI  R+SYPL
Sbjct: 338 PRGLCGIAMRASYPL 352


>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
          Length = 360

 Score =  154 bits (389), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 98/303 (32%), Positives = 143/303 (47%), Gaps = 76/303 (25%)

Query: 49  KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
           ++  ++G+SY+   E   R +IF E+L+ +   N++G  +Y+LG N+F+D++ +EFRA  
Sbjct: 61  RFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKG-LSYRLGINRFADMSWEEFRAT- 118

Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
              ++ +  + S T +       +   +P + DWR+ G V+P+KNQ  CG CW F+   A
Sbjct: 119 ---RLGAAQNCSATLTGNHRMRAAAVALPETKDWREDGIVSPVKNQGHCGSCWTFSTTGA 175

Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
           +E      +G  I LSEQQL+DC    NN GC GG   +AF YI  N G+          
Sbjct: 176 LEAAYTQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYNGGLDTEESYPYQG 235

Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
            NG+C                       +L  AV +V           GF          
Sbjct: 236 VNGICKFKNENVGVKVLDSVNITLGAEDELKDAVGLVRPVSVAFEVITGFRLYKSGVYTS 295

Query: 236 ---GTT---------------EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
              GTT               EDG  YWLIKNSWG  WGD GY K+   + +CG+ T +S
Sbjct: 296 DHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEGYFKMEMGKNMCGVATCAS 355

Query: 278 YPL 280
           YP+
Sbjct: 356 YPI 358


>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
           GN=At3g19400 PE=2 SV=1
          Length = 362

 Score =  154 bits (389), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 77/180 (42%), Positives = 117/180 (65%), Gaps = 4/180 (2%)

Query: 39  HEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSD 98
           +E  V  ++E+W+ ++ ++Y    EKE R KIFK+NL+++++ N   +RT+++G  +F+D
Sbjct: 36  NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95

Query: 99  LTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECG 158
           LTN+EFRA+Y   KM   +  S  +  + Y+   +  +P  +DWR  GAV  +K+Q  CG
Sbjct: 96  LTNEEFRAIYLRKKMER-TKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCG 152

Query: 159 CCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
            CWAF+AV AVEGI +I +G LI LSEQ+L+DC     N GC GG    AF +I++N GI
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212



 Score = 79.0 bits (193), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 35/75 (46%), Positives = 48/75 (64%), Gaps = 5/75 (6%)

Query: 209 AYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD-- 266
           A+ +   G+  G CG  LDH V +VG+G+T  G +YW+I+NSWG  WGD+GY+K+ R+  
Sbjct: 274 AFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNID 332

Query: 267 --EGLCGIGTRSSYP 279
              G CGI    SYP
Sbjct: 333 DPFGKCGIAMMPSYP 347


>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
          Length = 362

 Score =  152 bits (384), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 105/306 (34%), Positives = 143/306 (46%), Gaps = 83/306 (27%)

Query: 49  KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
           ++  ++G+SY+   E   R +IF E+LE +   N++G   Y+LG N+FSD++ +EF+A  
Sbjct: 63  RFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNRKG-LPYRLGINRFSDMSWEEFQATR 121

Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTD---VPTSLDWRDKGAVTPIKNQKECGCCWAFAA 165
            G          T S+T    +L M D   +P + DWR+ G V+P+KNQ  CG CW F+ 
Sbjct: 122 LGAAQ-------TCSATLAGNHL-MRDAAALPETKDWREDGIVSPVKNQAHCGSCWTFST 173

Query: 166 VAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI------- 217
             A+E      +G  I LSEQQL+DC+   NN GC GG   +AF YI  N GI       
Sbjct: 174 TGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIKYNGGIDTEESYP 233

Query: 218 ---FNGVCG---------------------TQLDHAVTIV-----------GF------- 235
               NGVC                       +L +AV +V           GF       
Sbjct: 234 YKGVNGVCHYKAENAAVQVLDSVNITLNAEDELKNAVGLVRPVSVAFQVIDGFRQYKSGV 293

Query: 236 ------GTTEDGAN---------------YWLIKNSWGNTWGDAGYMKIVRDEGLCGIGT 274
                 GTT D  N               YWLIKNSWG  WGD GY K+   + +C I T
Sbjct: 294 YTSDHCGTTPDDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCAIAT 353

Query: 275 RSSYPL 280
            +SYP+
Sbjct: 354 CASYPV 359


>sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. japonica GN=Os09g0442300
           PE=2 SV=2
          Length = 362

 Score =  151 bits (381), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 95/303 (31%), Positives = 142/303 (46%), Gaps = 77/303 (25%)

Query: 49  KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
           ++  +HG+ Y D  E + R +IF E+LE +   N+ G   Y+LG N+F+D++ +EF+A  
Sbjct: 64  RFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNRRG-LPYRLGINRFADMSWEEFQASR 122

Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAA 168
            G      +   + +    ++      +P + DWR+ G V+P+K+Q  CG CW F+   +
Sbjct: 123 LG-----AAQNCSATLAGNHRMRDAAALPETKDWREDGIVSPVKDQGHCGSCWTFSTTGS 177

Query: 169 VEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI---------- 217
           +E      +G  + LSEQQL+DC+T  NN GC GG   +AF YI  N G+          
Sbjct: 178 LEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYIKYNGGLDTEEAYPYTG 237

Query: 218 FNGVCG---------------------TQLDHAVTIV-----------GF---------- 235
            NG+C                       +L +AV +V           GF          
Sbjct: 238 VNGICHYKPENVGVKVLDSVNITLGAEDELKNAVGLVRPVSVAFQVINGFRMYKSGVYTS 297

Query: 236 ---GTT---------------EDGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTRSS 277
              GT+               E+G  YWLIKNSWG  WGD GY K+   + +CGI T +S
Sbjct: 298 DHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKMEMGKNMCGIATCAS 357

Query: 278 YPL 280
           YP+
Sbjct: 358 YPI 360


>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
          Length = 362

 Score =  150 bits (379), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 79/183 (43%), Positives = 113/183 (61%), Gaps = 11/183 (6%)

Query: 40  EQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDL 99
           E+S+ +++E+W + H  S +   EK  R  +FK N+ ++   NK  ++ YKL  N+F+D+
Sbjct: 33  EESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADM 90

Query: 100 TNDEFRALYTG-----YKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQ 154
           TN EFR+ Y G     +KM   S     S TF Y+ +    VP S+DWR KGAVT +K+Q
Sbjct: 91  TNHEFRSTYAGSKVNHHKMFRGSQHG--SGTFMYEKVG--SVPASVDWRKKGAVTDVKDQ 146

Query: 155 KECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQN 214
            +CG CWAF+ + AVEGI +I++  L+ LSEQ+L+DC    N GC GG  E AF +I Q 
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQK 206

Query: 215 QGI 217
            GI
Sbjct: 207 GGI 209



 Score = 90.9 bits (224), Expect = 1e-17,   Method: Compositional matrix adjust.
 Identities = 37/70 (52%), Positives = 51/70 (72%), Gaps = 4/70 (5%)

Query: 215 QGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD----EGLC 270
           +G+F G C T L+H V IVG+GTT DG NYW+++NSWG  WG+ GY+++ R+    EGLC
Sbjct: 275 EGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEGLC 334

Query: 271 GIGTRSSYPL 280
           GI   +SYP+
Sbjct: 335 GIAMMASYPI 344


>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
          Length = 360

 Score =  150 bits (379), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 80/202 (39%), Positives = 119/202 (58%), Gaps = 21/202 (10%)

Query: 46  IHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFR 105
           ++E+W + H  S +   EK+ R  +FK N  ++  ANK  ++ YKL  N+F+D+TN EFR
Sbjct: 37  LYERWRSHHTVS-RSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFR 94

Query: 106 ALYTGYKMPSPSHR-----STTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCC 160
             Y+G K+    HR        + TF Y+ +    VP S+DWR KGAVT +K+Q +CG C
Sbjct: 95  NTYSGSKVKH--HRMFRGGPRGNGTFMYEKVDT--VPASVDWRKKGAVTSVKDQGQCGSC 150

Query: 161 WAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGI--- 217
           WAF+ + AVEGI +I++  L+ LSEQ+L+DC T+ N GC GG  + AF +I Q  GI   
Sbjct: 151 WAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTE 210

Query: 218 -------FNGVCGTQLDHAVTI 232
                  ++G C    ++A  +
Sbjct: 211 ANYPYEAYDGTCDVSKENAPAV 232


>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
           SV=2
          Length = 490

 Score =  147 bits (372), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 75/159 (47%), Positives = 108/159 (67%), Gaps = 8/159 (5%)

Query: 63  EKEMRLKIFKENLEYIEKANKEGNRT--YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRS 120
           E E R ++F +NL++++  N   +    ++LG N+F+DLTN EFRA Y G   P+   R 
Sbjct: 84  EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142

Query: 121 TTSSTFKYQNLSMTDVPTSLDWRDKGAV-TPIKNQKECGCCWAFAAVAAVEGITKIRSGN 179
              +   Y++  +  +P S+DWRDKGAV  P+KNQ +CG CWAF+AVAAVEGI KI +G 
Sbjct: 143 VGEA---YRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199

Query: 180 LIQLSEQQLLDCSTNG-NNGCLGGSREKAFAYIIQNQGI 217
           L+ LSEQ+L++C+ NG N+GC GG  + AFA+I +N G+
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGL 238



 Score = 76.3 bits (186), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 37/84 (44%), Positives = 51/84 (60%), Gaps = 9/84 (10%)

Query: 202 GSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGT-TEDGANYWLIKNSWGNTWGDAGY 260
           G RE    + + + G+F G CGT LDH V  VG+GT    GA YW ++NSWG  WG+ GY
Sbjct: 295 GGRE----FQLYDSGVFTGRCGTNLDHGVVAVGYGTDAATGAAYWTVRNSWGPDWGENGY 350

Query: 261 MKIVRD----EGLCGIGTRSSYPL 280
           +++ R+     G CGI   +SYP+
Sbjct: 351 IRMERNVTARTGKCGIAMMASYPI 374


>sp|Q94503|CYSP6_DICDI Cysteine proteinase 6 OS=Dictyostelium discoideum GN=cprF PE=2 SV=1
          Length = 434

 Score =  143 bits (361), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 87/203 (42%), Positives = 111/203 (54%), Gaps = 10/203 (4%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M ++  L V   S   + +   E         WM  H R Y  E E   R  IFK N++Y
Sbjct: 1   MKVLSALCVLLVSVATAKQQLSELQYRNAFTNWMIAHQRHYSSE-EFNGRFNIFKANMDY 59

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           I + N +G+ T  LG N F+D+TN+E+RA Y G    + S   T S     + +      
Sbjct: 60  INEWNTKGSETV-LGLNVFADITNEEYRATYLGTPFDASSLEMTPS-----EKVFGGVQA 113

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG--NLIQLSEQQLLDCS-TN 194
            S+DWR KGAVTPIKNQ ECG CW+F+A  A EG   I +G  +L  +SEQQL+DCS + 
Sbjct: 114 NSVDWRAKGAVTPIKNQGECGGCWSFSATGATEGAQYIANGDSDLTSVSEQQLIDCSGSY 173

Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
           GNNGC GG    AF YII N GI
Sbjct: 174 GNNGCEGGLMTLAFEYIINNGGI 196



 Score = 45.8 bits (107), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 27/40 (67%), Gaps = 1/40 (2%)

Query: 243 NYWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPLA 281
           NYW++KNSWG  WG  GY+ + +D +  CGI T +S P A
Sbjct: 388 NYWIVKNSWGLDWGINGYILMSKDKDNQCGIATMASIPQA 427


>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
          Length = 356

 Score =  143 bits (360), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 101/305 (33%), Positives = 144/305 (47%), Gaps = 82/305 (26%)

Query: 49  KWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALY 108
           ++  +H + Y    E + R +IF +NL+ I   N++G  +YKLG N+F+DLT DEFR   
Sbjct: 59  RFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKG-LSYKLGINEFTDLTWDEFRK-- 115

Query: 109 TGYKMPSPSHRSTTSSTFKYQNLSMTDV--PTSLDWRDKGAVTPIKNQKECGCCWAFAAV 166
             +K+ +  + S T+      NL +T+V  P + DWR  G V+P+K Q +CG CW F+  
Sbjct: 116 --HKLGASQNCSATTKG----NLKLTNVVLPETKDWRKDGIVSPVKAQGKCGSCWTFSTT 169

Query: 167 AAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGI-------- 217
            A+E       G  I LSEQQL+DC+   NN GC GG   +AF YI  N G+        
Sbjct: 170 GALEAAYAQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPY 229

Query: 218 --FNGVCG---------------------TQLDHAVTIV-----------GF-------- 235
              NG+C                       +L +AV +V           GF        
Sbjct: 230 TGKNGICKFSQANIGVKVISSVNITLGAEYELKYAVALVRPVSVAFEVVKGFKQYKSGVY 289

Query: 236 GTTE--------------------DGANYWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTR 275
            +TE                    +G  YWLIKNSWG  WG+ GY K+   + +CG+ T 
Sbjct: 290 ASTECGDTPMDVNHAVLAVGYGVENGTPYWLIKNSWGADWGEDGYFKMEMGKNMCGVATC 349

Query: 276 SSYPL 280
           +SYP+
Sbjct: 350 ASYPI 354


>sp|P09668|CATH_HUMAN Pro-cathepsin H OS=Homo sapiens GN=CTSH PE=1 SV=4
          Length = 335

 Score =  143 bits (360), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 98/312 (31%), Positives = 138/312 (44%), Gaps = 81/312 (25%)

Query: 42  SVVEIHEK-WMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKEGNRTYKLGTNQFSDLT 100
           S+ + H K WM++H ++Y  E E   RL+ F  N   I  A+  GN T+K+  NQFSD++
Sbjct: 29  SLEKFHFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN-AHNNGNHTFKMALNQFSDMS 86

Query: 101 NDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGA-VTPIKNQKECGC 159
             E +  Y       P + S T S +          P S+DWR KG  V+P+KNQ  CG 
Sbjct: 87  FAEIKHKYL---WSEPQNCSATKSNYL---RGTGPYPPSVDWRKKGNFVSPVKNQGACGS 140

Query: 160 CWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTNGNN-GCLGGSREKAFAYIIQNQGIF 218
           CW F+   A+E    I +G ++ L+EQQL+DC+ + NN GC GG   +AF YI+ N+GI 
Sbjct: 141 CWTFSTTGALESAIAIATGKMLSLAEQQLVDCAQDFNNHGCQGGLPSQAFEYILYNKGIM 200

Query: 219 ----------NGVCGTQLDHAVTIV--------------------------GFGTTED-- 240
                     +G C  Q   A+  V                           F  T+D  
Sbjct: 201 GEDTYPYQGKDGYCKFQPGKAIGFVKDVANITIYDEEAMVEAVALYNPVSFAFEVTQDFM 260

Query: 241 --------------------------------GANYWLIKNSWGNTWGDAGYMKIVRDEG 268
                                           G  YW++KNSWG  WG  GY  I R + 
Sbjct: 261 MYRTGIYSSTSCHKTPDKVNHAVLAVGYGEKNGIPYWIVKNSWGPQWGMNGYFLIERGKN 320

Query: 269 LCGIGTRSSYPL 280
           +CG+   +SYP+
Sbjct: 321 MCGLAACASYPI 332


>sp|Q5E998|CATL2_BOVIN Cathepsin L2 OS=Bos taurus GN=CTSL2 PE=2 SV=1
          Length = 334

 Score =  142 bits (359), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 102/342 (29%), Positives = 155/342 (45%), Gaps = 91/342 (26%)

Query: 17  PMFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENL 75
           P F +  L +      V+S +      ++ H  +W A H R Y    E+E R  ++++N 
Sbjct: 3   PSFFLTVLCLG-----VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNK 56

Query: 76  EYIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
           + I+  N+E   G   +++  N F D+TN+EFR +  G++  +  H+        +    
Sbjct: 57  KIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQ--NQKHKKGKL----FHEPL 110

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
           + DVP S+DW  KG VTP+KNQ +CG CWAF+A  A+EG    ++G L+ LSEQ L+DCS
Sbjct: 111 LVDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170

Query: 193 -------TNG-----------NNGCLGGS------------------------------- 203
                   NG           +NGCL                                  
Sbjct: 171 RAQGNQGCNGGLMDNAFQYIKDNGCLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIP 230

Query: 204 -REKAFAYIIQNQG--------------------IFNGVCGTQ-LDHAVTIVGFG---TT 238
            REKA    +   G                     ++  C ++ LDH V +VG+G   T 
Sbjct: 231 QREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTD 290

Query: 239 EDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
            +   +W++KNSWG  WG  GY+K+ +D+   CGI T +SYP
Sbjct: 291 SNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYP 332


>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
           SV=2
          Length = 322

 Score =  142 bits (359), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 75/176 (42%), Positives = 106/176 (60%), Gaps = 14/176 (7%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           E++  + GR Y D  E+  RL +F +NL+YIE+ NK+   G  TY L  NQFSD+TN++F
Sbjct: 21  EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKF 80

Query: 105 RALYTGYKM-PSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAF 163
            A+  GYK  P P+   T++              T +DWR KGAVTP+K+Q +CG CWAF
Sbjct: 81  NAVMKGYKKGPRPAAVFTSTDA--------APESTEVDWRTKGAVTPVKDQGQCGSCWAF 132

Query: 164 AAVAAVEGITKIRSGNLIQLSEQQLLDCSTNG--NNGCLGGSREKAFAYIIQNQGI 217
           +    +EG   +++G L+ LSEQQL+DC+     N GC GG  E+A  Y+  N G+
Sbjct: 133 STTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGV 188



 Score = 66.6 bits (161), Expect = 2e-10,   Method: Compositional matrix adjust.
 Identities = 29/57 (50%), Positives = 42/57 (73%), Gaps = 2/57 (3%)

Query: 224 TQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYP 279
           +QLDHAV  VG+G+ E G ++WL+KNSW  +WG++GY+K+ R+    CGI T + YP
Sbjct: 265 SQLDHAVLAVGYGS-EGGQDFWLVKNSWATSWGESGYIKMARNRNNNCGIATDACYP 320


>sp|Q94504|CYSP7_DICDI Cysteine proteinase 7 OS=Dictyostelium discoideum GN=cprG PE=1 SV=1
          Length = 460

 Score =  142 bits (358), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 82/203 (40%), Positives = 110/203 (54%), Gaps = 12/203 (5%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           M ++  L V   S   + +   E         WM  H R Y  E E   R  IFK N++Y
Sbjct: 1   MKVLSALCVLLVSVATAKQQLSEVEYRNAFTNWMIAHQRHYSSE-EFNGRYNIFKANMDY 59

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVP 137
           + + N +G+ T  LG N F+D++N+E+RA Y G    + S   T S         + D  
Sbjct: 60  VNEWNTKGSETV-LGLNVFADISNEEYRATYLGTPFDASSLEMTESD-------KIFDAS 111

Query: 138 TSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG--NLIQLSEQQLLDCS-TN 194
             +DWR +GAVTPIKNQ +CG CW+F+   A EG   + +G  NL+ LSEQ L+DCS + 
Sbjct: 112 AQVDWRTQGAVTPIKNQGQCGGCWSFSTTGATEGAQYLANGKKNLVSLSEQNLIDCSGSY 171

Query: 195 GNNGCLGGSREKAFAYIIQNQGI 217
           GNNGC GG    AF YII N+GI
Sbjct: 172 GNNGCEGGLMTLAFEYIINNKGI 194



 Score = 44.7 bits (104), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 18/40 (45%), Positives = 27/40 (67%), Gaps = 1/40 (2%)

Query: 243 NYWLIKNSWGNTWGDAGYMKIVR-DEGLCGIGTRSSYPLA 281
           +YW++KNSWG +WG  GY+ + + +   CGI T +S P A
Sbjct: 417 DYWIVKNSWGTSWGMDGYILMTKGNNNQCGIATMASRPTA 456


>sp|Q80LP4|CATV_NPVAH Viral cathepsin OS=Adoxophyes honmai nucleopolyhedrovirus GN=VCATH
           PE=3 SV=1
          Length = 337

 Score =  142 bits (358), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 100/332 (30%), Positives = 144/332 (43%), Gaps = 79/332 (23%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEY 77
           + +I T+L+  +SQ+               E ++  + + Y D   K  R KIFK+NLE 
Sbjct: 3   LLMIFTILLVASSQIEGHLKFDIHDAQHYFETFIINYNKQYPDTKTKNYRFKIFKQNLED 62

Query: 78  IEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTD-- 135
           I + NK  N +     N+FSDL+ +E    YTG     PS+   ++S F   N+   D  
Sbjct: 63  INEKNKL-NDSAIYNINKFSDLSKNELLTKYTGLTSKKPSNMVRSTSNF--CNVIHLDAP 119

Query: 136 ------VPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLL 189
                 +P + DWR    +T +K+Q  CG CWA AAV  +E +  I+   LI LSEQQL+
Sbjct: 120 PDVHDELPQNFDWRVNNKMTSVKDQGACGSCWAHAAVGTLETLYAIKHNYLINLSEQQLI 179

Query: 190 DCSTNGNNGCLGGSRE-------------------------------KAFA--------Y 210
           DC +  N  C GG                                  K FA        Y
Sbjct: 180 DCDS-ANMACDGGLMHTAFEQLMNAGGLMEEIDYPYQGTKGVCKIDNKKFALSVSSCKRY 238

Query: 211 IIQNQ---------------------------GIFNGVCGTQLDHAVTIVGFGTTEDGAN 243
           I QN+                           GI +      L+HAV +VG+GT E G +
Sbjct: 239 IFQNEENLKKELITMGPIAMAIDAASISTYSKGIIHFCENLGLNHAVLLVGYGT-EGGVS 297

Query: 244 YWLIKNSWGNTWGDAGYMKIVRDEGLCGIGTR 275
           YW +KNSWG+ WG+ GY ++ R+   CG+  +
Sbjct: 298 YWTLKNSWGSDWGEDGYFRVKRNINACGLNNQ 329


>sp|P25774|CATS_HUMAN Cathepsin S OS=Homo sapiens GN=CTSS PE=1 SV=3
          Length = 331

 Score =  141 bits (356), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 78/206 (37%), Positives = 121/206 (58%), Gaps = 14/206 (6%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++ +L+ C+S V      H+   ++ H   W   +G+ YK++ E+ +R  I+++NL+
Sbjct: 1   MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           ++   N E   G  +Y LG N   D+T++E  +L +  ++PS   R+ T     Y++   
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPSQWQRNIT-----YKSNPN 112

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             +P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
              GN GC GG    AF YII N+GI
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGI 198



 Score = 62.8 bits (151), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 50/78 (64%), Gaps = 3/78 (3%)

Query: 203 SREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMK 262
           +R  +F ++ ++   +   C   ++H V +VG+G   +G  YWL+KNSWG+ +G+ GY++
Sbjct: 254 ARHPSF-FLYRSGVYYEPSCTQNVNHGVLVVGYGDL-NGKEYWLVKNSWGHNFGEEGYIR 311

Query: 263 IVRDEG-LCGIGTRSSYP 279
           + R++G  CGI +  SYP
Sbjct: 312 MARNKGNHCGIASFPSYP 329


>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
           SV=1
          Length = 321

 Score =  140 bits (354), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 75/174 (43%), Positives = 106/174 (60%), Gaps = 10/174 (5%)

Query: 48  EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRTYKLGTNQFSDLTNDEF 104
           + +  Q+GR Y D  E+  R ++F++N + IE  NK+   G  T+K+  NQF D+TN+EF
Sbjct: 21  DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 80

Query: 105 RALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKECGCCWAFA 164
            A+  GYK  S   R    + F  +   M      +DWR K  VTP+K+Q++CG CWAF+
Sbjct: 81  NAVMKGYKKGS---RGEPKAVFTAEAGPMA---ADVDWRTKALVTPVKDQEQCGSCWAFS 134

Query: 165 AVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQGI 217
           A  A+EG   +++  L+ LSEQQL+DCST+ GN+GC GG    AF YI  N GI
Sbjct: 135 ATGALEGQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI 188



 Score = 70.1 bits (170), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 34/75 (45%), Positives = 49/75 (65%), Gaps = 4/75 (5%)

Query: 208 FAYIIQNQGIF--NGVCGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR 265
           F++   + G++       T LDH V  VG+GT E   +YWL+KNSWG++WGDAGY+K+ R
Sbjct: 246 FSFQFYSSGVYYEQNCSPTFLDHGVLAVGYGT-ESTKDYWLVKNSWGSSWGDAGYIKMSR 304

Query: 266 D-EGLCGIGTRSSYP 279
           + +  CGI +  SYP
Sbjct: 305 NRDNNCGIASEPSYP 319


>sp|P54639|CYSP4_DICDI Cysteine proteinase 4 OS=Dictyostelium discoideum GN=cprD PE=2 SV=2
          Length = 442

 Score =  138 bits (348), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 86/202 (42%), Positives = 114/202 (56%), Gaps = 15/202 (7%)

Query: 20  IIITLLVSCASQVVSSRSTHEQSVVEIHEKWMAQHGRSYKDELEKEMRLKIFKENLEYIE 79
            +  LLVS AS   + +   E         WM  H R+Y  E E   R +IFK N++Y+ 
Sbjct: 6   FLCLLLVSYAS---AKQQFSELQYRNAFTNWMQAHQRTYSSE-EFNARYQIFKSNMDYVH 61

Query: 80  KANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTS 139
           + N +G  T  LG N F+D+TN E+R  Y G    +P   S    T + + +  T  PT 
Sbjct: 62  QWNSKGGETV-LGLNVFADITNQEYRTTYLG----TPFDGSALIGT-EEEKIFSTPAPT- 114

Query: 140 LDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSG---NLIQLSEQQLLDCSTN-G 195
           +DWR +GAVTPIKNQ +CG CW+F+   + EG   I SG   +L+ LSEQ L+DCS + G
Sbjct: 115 VDWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSEQNLIDCSKSYG 174

Query: 196 NNGCLGGSREKAFAYIIQNQGI 217
           NNGC GG    AF YII N+GI
Sbjct: 175 NNGCEGGLMTLAFEYIINNKGI 196



 Score = 48.9 bits (115), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 20/40 (50%), Positives = 28/40 (70%), Gaps = 1/40 (2%)

Query: 243 NYWLIKNSWGNTWGDAGYMKIVRDE-GLCGIGTRSSYPLA 281
           NYW++KNSWG +WG  GY+ + +D    CGI T +S+P A
Sbjct: 400 NYWIVKNSWGTSWGMDGYIFMSKDRNNNCGIATMASFPTA 439


>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
          Length = 330

 Score =  138 bits (347), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 77/205 (37%), Positives = 119/205 (58%), Gaps = 13/205 (6%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++ +L  C+S V      H+   ++ H   W   +G+ YK++ E+ +R  I+++NL+
Sbjct: 1   MKQLVCVLFVCSSAVTQ---LHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           ++   N E   G  +Y LG N   D+T++E  +L +  ++P+   R+ T  +   Q L  
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPNQWQRNITYKSNPNQML-- 115

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
              P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCS 
Sbjct: 116 ---PDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSE 172

Query: 194 N-GNNGCLGGSREKAFAYIIQNQGI 217
             GN GC GG   +AF YII N+GI
Sbjct: 173 KYGNKGCNGGFMTEAFQYIIDNKGI 197



 Score = 65.9 bits (159), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 30/96 (31%), Positives = 55/96 (57%), Gaps = 2/96 (2%)

Query: 185 EQQLLDCSTNGNNGCLGGSREKAFAYIIQNQGIFNGVCGTQLDHAVTIVGFGTTEDGANY 244
           E  L +   N    C+G        ++ ++   ++  C  +++H V ++G+G   +G  Y
Sbjct: 234 EDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYGDL-NGKEY 292

Query: 245 WLIKNSWGNTWGDAGYMKIVRDEG-LCGIGTRSSYP 279
           WL+KNSWG+ +G+ GY+++ R++G  CGI +  SYP
Sbjct: 293 WLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSYP 328


>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
          Length = 339

 Score =  138 bits (347), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 76/181 (41%), Positives = 106/181 (58%), Gaps = 9/181 (4%)

Query: 46  IHEKWMA---QHGRSYKDELEKEMRLKIFKENLEYIEKANK---EGNRTYKLGTNQFSDL 99
           I E+W     QH ++Y +E+E+  R+KIF EN   I K N+   +G  +YKLG N+++D+
Sbjct: 24  IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83

Query: 100 TNDEFRALYTGYK--MPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAVTPIKNQKEC 157
            + EF+    GY   +       T      Y   +   VP S+DWR+ GAVT +K+Q  C
Sbjct: 84  LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143

Query: 158 GCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN-GNNGCLGGSREKAFAYIIQNQG 216
           G CWAF++  A+EG    ++G L+ LSEQ L+DCST  GNNGC GG  + AF YI  N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203

Query: 217 I 217
           I
Sbjct: 204 I 204



 Score = 80.9 bits (198), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 37/74 (50%), Positives = 53/74 (71%), Gaps = 3/74 (4%)

Query: 209 AYIIQNQGIFNGV-CGTQ-LDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVRD 266
           ++ + ++G++N   C  Q LDH V +VG+GT E G +YWL+KNSWG TWG+ GY+K+ R+
Sbjct: 264 SFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMARN 323

Query: 267 E-GLCGIGTRSSYP 279
           +   CGI T SSYP
Sbjct: 324 QNNQCGIATASSYP 337


>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
          Length = 340

 Score =  137 bits (346), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 75/192 (39%), Positives = 111/192 (57%), Gaps = 12/192 (6%)

Query: 33  VSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLEYIEKANKE---GNRT 88
           V+         ++ H + W   H + YKD+ E+E+R  I+++NL++I   N E   G  T
Sbjct: 21  VAMEQLQRDPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHT 80

Query: 89  YKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSMTDVPTSLDWRDKGAV 148
           Y++G N   D+TN+E        ++P  S ++ T     +++ S   +P ++DWR+KG V
Sbjct: 81  YQVGMNDMGDMTNEEILCRMGALRIPRQSPKTVT-----FRSYSNRTLPDTVDWREKGCV 135

Query: 149 TPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCSTN---GNNGCLGGSRE 205
           T +K Q  CG CWAF+AV A+EG  K+++G LI LS Q L+DCS     GN GC GG   
Sbjct: 136 TEVKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMT 195

Query: 206 KAFAYIIQNQGI 217
           +AF YII N GI
Sbjct: 196 EAFQYIIDNGGI 207



 Score = 63.9 bits (154), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 47/73 (64%), Gaps = 3/73 (4%)

Query: 209 AYIIQNQGIFNGV-CGTQLDHAVTIVGFGTTEDGANYWLIKNSWGNTWGDAGYMKIVR-D 266
           ++     G+++   C   ++H V +VG+GT  DG +YWL+KNSWG  +GD GY+++ R +
Sbjct: 267 SFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNN 325

Query: 267 EGLCGIGTRSSYP 279
           +  CGI +  SYP
Sbjct: 326 KNHCGIASYCSYP 338


>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
          Length = 331

 Score =  137 bits (346), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 79/206 (38%), Positives = 115/206 (55%), Gaps = 14/206 (6%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++ LL  C+  V      H+   ++ H   W   + + YK+E E+  R  I+++NL+
Sbjct: 1   MKWLVGLLPLCSYAVAQ---VHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
           ++   N E   G  +Y LG N   D+T +E  +L    ++PS   R+ T     Y++ S 
Sbjct: 58  FVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVT-----YRSNSN 112

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
             +P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST
Sbjct: 113 QKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172

Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
              GN GC GG    AF YII N GI
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNNGI 198


>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
          Length = 344

 Score =  137 bits (346), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 82/205 (40%), Positives = 112/205 (54%), Gaps = 21/205 (10%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIHEK-----WMAQHGRSYKDELEKEMRLKIFK 72
           +  +  LLVS A        T +Q   E+  +     WM  H +SY  E E   R  IFK
Sbjct: 4   LSFLCVLLVSVA--------TAKQQFSELQYRNAFTDWMITHQKSYTSE-EFGARYNIFK 54

Query: 73  ENLEYIEKANKEGNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLS 132
            N++Y+++ N +G+ T  LG N F+D+TN+E+R  Y G K  + S   T     + + + 
Sbjct: 55  ANMDYVQQWNSKGSETV-LGLNNFADITNEEYRNTYLGTKFDASSLIGT-----QEEKVF 108

Query: 133 MTDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCS 192
            T    S DWR +GAVTP+KNQ +CG CW+F+   + EG      G L+ LSEQ L+DCS
Sbjct: 109 TTSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCS 168

Query: 193 TNGNNGCLGGSREKAFAYIIQNQGI 217
           T  N+GC GG    AF YII N GI
Sbjct: 169 TE-NSGCDGGLMTYAFEYIINNNGI 192



 Score = 45.8 bits (107), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 17/38 (44%), Positives = 28/38 (73%), Gaps = 1/38 (2%)

Query: 244 YWLIKNSWGNTWGDAGYMKIVRD-EGLCGIGTRSSYPL 280
           YW++KNSWG +WG  GY+ + R+ +  CGI + +S+P+
Sbjct: 306 YWIVKNSWGTSWGIEGYILMSRNRDNNCGIASSASFPV 343


>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
          Length = 331

 Score =  137 bits (345), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 78/206 (37%), Positives = 118/206 (57%), Gaps = 14/206 (6%)

Query: 18  MFIIITLLVSCASQVVSSRSTHEQSVVEIH-EKWMAQHGRSYKDELEKEMRLKIFKENLE 76
           M  ++  L+ C+S +      H    ++ H + W   +G+ YK++ E+  R  I+++NL+
Sbjct: 1   MNWLVWALLLCSSAMAH---VHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 57

Query: 77  YIEKANKE---GNRTYKLGTNQFSDLTNDEFRALYTGYKMPSPSHRSTTSSTFKYQNLSM 133
            +   N E   G  +Y+LG N   D+T++E  +L +  ++PS   R+ T  +   Q L  
Sbjct: 58  TVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKL-- 115

Query: 134 TDVPTSLDWRDKGAVTPIKNQKECGCCWAFAAVAAVEGITKIRSGNLIQLSEQQLLDCST 193
              P S+DWR+KG VT +K Q  CG CWAF+AV A+E   K+++G L+ LS Q L+DCST
Sbjct: 116 ---PDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCST 172

Query: 194 N--GNNGCLGGSREKAFAYIIQNQGI 217
              GN GC GG   +AF YII N GI
Sbjct: 173 AKYGNKGCNGGFMTEAFQYIIDNNGI 198


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.317    0.132    0.402 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 102,919,239
Number of Sequences: 539616
Number of extensions: 4214831
Number of successful extensions: 12963
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 224
Number of HSP's successfully gapped in prelim test: 22
Number of HSP's that attempted gapping in prelim test: 11927
Number of HSP's gapped (non-prelim): 536
length of query: 281
length of database: 191,569,459
effective HSP length: 116
effective length of query: 165
effective length of database: 128,974,003
effective search space: 21280710495
effective search space used: 21280710495
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 60 (27.7 bits)