BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= 047400
         (88 letters)

Database: swissprot 
           539,616 sequences; 191,569,459 total letters

Searching..................................................done



>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
           GN=GCP1 PE=2 SV=2
          Length = 376

 Score = 83.6 bits (205), Expect = 4e-16,   Method: Compositional matrix adjust.
 Identities = 40/85 (47%), Positives = 55/85 (64%), Gaps = 7/85 (8%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
           GSCW F+   A+EGI+KIVT  L+ +S Q+LVDCD   NQG    C GG ++  +Q++++
Sbjct: 167 GSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQG----CNGGLMDYAFQFIMK 222

Query: 62  NRGINTERDYPNVGVMDNCKVFQFN 86
           N G+NTE+DYP  G    C  F  N
Sbjct: 223 NGGLNTEKDYPYRGFGGKCNSFLKN 247


>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
           SV=1
          Length = 462

 Score = 82.8 bits (203), Expect = 5e-16,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 57/82 (69%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +GA+EGI++IVT +L+ +S Q+LVDCD    +  C GG ++  ++++I+N G
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 217

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+T++DYP  GV   C   + N
Sbjct: 218 IDTDKDYPYKGVDGTCDQIRKN 239


>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
           GN=CEP2 PE=2 SV=1
          Length = 361

 Score = 82.0 bits (201), Expect = 1e-15,   Method: Composition-based stats.
 Identities = 37/82 (45%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KI TN LV +S Q+LVDCD + ++  C GG +E  ++++ +N G
Sbjct: 150 GSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTK-QNEGCNGGLMEIAFEFIKKNGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE  YP  G+   C   + N
Sbjct: 209 ITTEDSYPYEGIDGKCDASKDN 230


>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
          Length = 328

 Score = 81.6 bits (200), Expect = 1e-15,   Method: Compositional matrix adjust.
 Identities = 36/76 (47%), Positives = 53/76 (69%), Gaps = 1/76 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+   A+EGI+KIVT  LV +S Q+LVDCD +  ++ C GG ++  +Q++++N G
Sbjct: 122 GSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGG 180

Query: 65  INTERDYPNVGVMDNC 80
           +NTE+DYP  G    C
Sbjct: 181 LNTEKDYPYHGTNGKC 196


>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
           GN=CEP3 PE=2 SV=1
          Length = 364

 Score = 81.3 bits (199), Expect = 2e-15,   Method: Composition-based stats.
 Identities = 35/68 (51%), Positives = 48/68 (70%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KI TN LV +S Q+LVDCD + E++ C GG +E  ++++  N G
Sbjct: 148 GSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTE-ENQGCAGGLMEPAFEFIKNNGG 206

Query: 65  INTERDYP 72
           I TE  YP
Sbjct: 207 IKTEETYP 214


>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
          Length = 215

 Score = 80.1 bits (196), Expect = 4e-15,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 53/81 (65%), Gaps = 2/81 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+E I+KI T  L+ +S Q+LVDCD    S  C GG++   +QY+I N G
Sbjct: 23  GSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDT--ASHGCNGGWMNNAFQYIITNGG 80

Query: 65  INTERDYPNVGVMDNCKVFQF 85
           I+T+++YP   V  +CK ++ 
Sbjct: 81  IDTQQNYPYSAVQGSCKPYRL 101


>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
          Length = 323

 Score = 79.7 bits (195), Expect = 4e-15,   Method: Composition-based stats.
 Identities = 36/77 (46%), Positives = 49/77 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EG   + T +LV +S Q LVDC +   ++ C GG+    YQY+I NRG
Sbjct: 128 GSCWAFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRG 187

Query: 65  INTERDYPNVGVMDNCK 81
           I+TE  YP   + DNC+
Sbjct: 188 IDTESSYPYKAIDDNCR 204


>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
           SV=2
          Length = 490

 Score = 79.7 bits (195), Expect = 5e-15,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 51/78 (65%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT  LV +S Q+LV+C   G++  C GG ++  + ++ +N G
Sbjct: 178 GSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGG 237

Query: 65  INTERDYPNVGVMDNCKV 82
           ++TE DYP   +   C +
Sbjct: 238 LDTEEDYPYTAMDGKCNL 255


>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
           PE=1 SV=2
          Length = 458

 Score = 79.3 bits (194), Expect = 7e-15,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD    +  C GG ++  + ++I N G
Sbjct: 151 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGG 209

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP  G  + C V + N
Sbjct: 210 IDTEDDYPYKGKDERCDVNRKN 231


>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
          Length = 362

 Score = 79.0 bits (193), Expect = 8e-15,   Method: Composition-based stats.
 Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I TN LV +S Q+LVDCD + E++ C GG +E+ ++++ Q  G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGG 208

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE +YP       C   + N
Sbjct: 209 ITTESNYPYKAQEGTCDASKVN 230


>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
           SV=2
          Length = 356

 Score = 78.6 bits (192), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI+KIVT NL  +S Q+L+DCD    +  C GG ++  ++Y+++N G
Sbjct: 160 GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGG 218

Query: 65  INTERDYP 72
           +  E DYP
Sbjct: 219 LRKEEDYP 226


>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
          Length = 362

 Score = 78.2 bits (191), Expect = 1e-14,   Method: Composition-based stats.
 Identities = 37/81 (45%), Positives = 54/81 (66%), Gaps = 4/81 (4%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++I TN LV +S Q+LVDCD + E++ C GG +E+ ++++ Q  G
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGG 208

Query: 65  INTERDYPNV---GVMDNCKV 82
           I TE +YP     G  D  KV
Sbjct: 209 ITTESNYPYTAQEGTCDESKV 229


>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
          Length = 380

 Score = 76.6 bits (187), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 48/78 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ +  +EGI+KIVT  L+ +S Q+L+DC     +R C GG+I   +Q++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 208

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE +YP       C V
Sbjct: 209 INTEENYPYTAQDGECNV 226


>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
           GN=At4g11310 PE=2 SV=1
          Length = 364

 Score = 76.6 bits (187), Expect = 4e-14,   Method: Composition-based stats.
 Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 2/75 (2%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ VGA+EG++KIVT  LV +S Q L++C+   E+  C GG +ET Y+++++N G+
Sbjct: 160 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKLETAYEFIMKNGGL 217

Query: 66  NTERDYPNVGVMDNC 80
            T+ DYP   V   C
Sbjct: 218 GTDNDYPYKAVNGVC 232


>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
           PE=1 SV=2
          Length = 466

 Score = 75.9 bits (185), Expect = 6e-14,   Method: Composition-based stats.
 Identities = 31/82 (37%), Positives = 52/82 (63%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V  +E I+++VT  ++ +S Q+LV+C   G++  C GG ++  + ++I+N G
Sbjct: 163 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 222

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I+TE DYP   V   C + + N
Sbjct: 223 IDTEDDYPYKAVDGKCDINREN 244


>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
          Length = 380

 Score = 75.9 bits (185), Expect = 6e-14,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 48/78 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G CW F+ +  +EGI+KIVT  L+ +S Q+L+DC     +R C GG+I   +Q++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 208

Query: 65  INTERDYPNVGVMDNCKV 82
           INTE +YP       C +
Sbjct: 209 INTEENYPYTAQDGECNL 226


>sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1
          Length = 330

 Score = 75.9 bits (185), Expect = 7e-14,   Method: Composition-based stats.
 Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  +NC
Sbjct: 196 IDSEDAYPYVGQDENC 211


>sp|Q5E968|CATK_BOVIN Cathepsin K OS=Bos taurus GN=CTSK PE=2 SV=2
          Length = 329

 Score = 75.9 bits (185), Expect = 7e-14,   Method: Composition-based stats.
 Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  +NC
Sbjct: 195 IDSEDAYPYVGQDENC 210


>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
           GN=At4g11320 PE=2 SV=1
          Length = 371

 Score = 75.9 bits (185), Expect = 7e-14,   Method: Composition-based stats.
 Identities = 32/67 (47%), Positives = 48/67 (71%), Gaps = 2/67 (2%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ VGA+EG++KIVT  LV +S Q L++C+   E+  C GG +ET Y++++ N G+
Sbjct: 167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIMNNGGL 224

Query: 66  NTERDYP 72
            T+ DYP
Sbjct: 225 GTDNDYP 231


>sp|P84347|MEX2_JACME Chymomexicain OS=Jacaratia mexicana PE=1 SV=1
          Length = 215

 Score = 75.5 bits (184), Expect = 9e-14,   Method: Compositional matrix adjust.
 Identities = 34/81 (41%), Positives = 49/81 (60%), Gaps = 2/81 (2%)

Query: 2  HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
          +P GSCW F+ V  +EGI+KI T  L+ +S Q+L+DCD +  S  C GG+     QYV  
Sbjct: 20 NPCGSCWAFSTVATVEGINKIRTGKLISLSEQELLDCDRR--SHGCKGGYQTGSIQYVAD 77

Query: 62 NRGINTERDYPNVGVMDNCKV 82
          N G++TE++YP       C+ 
Sbjct: 78 NGGVHTEKEYPYEKKQGKCRA 98


>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
          Length = 330

 Score = 75.1 bits (183), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 32/77 (41%), Positives = 47/77 (61%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +  ++ C GGF+   +QY+I N+G
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKG 196

Query: 65  INTERDYPNVGVMDNCK 81
           I++E  YP       C+
Sbjct: 197 IDSEASYPYKATDQKCQ 213


>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
           lycopersicum PE=2 SV=1
          Length = 346

 Score = 75.1 bits (183), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 34/68 (50%), Positives = 49/68 (72%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD +  +  C GG ++  +++VI+N G
Sbjct: 40  GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-RSYNEGCDGGLMDYAFEFVIKNGG 98

Query: 65  INTERDYP 72
           I+TE DYP
Sbjct: 99  IDTEEDYP 106


>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
           SV=1
          Length = 355

 Score = 75.1 bits (183), Expect = 1e-13,   Method: Composition-based stats.
 Identities = 32/68 (47%), Positives = 45/68 (66%), Gaps = 1/68 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V A+EGI++I T NL  +S Q+L+DCD    S  C GG ++  +QY+I   G
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNS-GCNGGLMDYAFQYIISTGG 217

Query: 65  INTERDYP 72
           ++ E DYP
Sbjct: 218 LHKEDDYP 225


>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
          Length = 221

 Score = 74.7 bits (182), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 49/80 (61%), Gaps = 2/80 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F  + A+EGI++IVT +L+ +S QQLVDC  +  +  C GG+    +QY+I N G
Sbjct: 25  GSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTR--NHGCEGGWPYRAFQYIINNGG 82

Query: 65  INTERDYPNVGVMDNCKVFQ 84
           IN+E  YP  G    C   +
Sbjct: 83  INSEEHYPYTGTNGTCDTKE 102


>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
          Length = 329

 Score = 74.3 bits (181), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210


>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
          Length = 329

 Score = 74.3 bits (181), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210


>sp|P43235|CATK_HUMAN Cathepsin K OS=Homo sapiens GN=CTSK PE=1 SV=1
          Length = 329

 Score = 74.3 bits (181), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210


>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
           GN=At3g19400 PE=2 SV=1
          Length = 362

 Score = 74.3 bits (181), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 48/68 (70%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EGI++I T  L+ +S Q+LVDCD    +  C GG +   ++++++N G
Sbjct: 152 GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGG 211

Query: 65  INTERDYP 72
           I T++DYP
Sbjct: 212 IETDQDYP 219


>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2 SV=1
          Length = 330

 Score = 74.3 bits (181), Expect = 2e-13,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 196 IDSEDAYPYVGQDESC 211


>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
          Length = 360

 Score = 73.9 bits (180), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 34/82 (41%), Positives = 52/82 (63%), Gaps = 1/82 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ + A+EGI++I TN LV +S Q+LVDCD   +++ C GG ++  ++++ Q  G
Sbjct: 148 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTD-QNQGCNGGLMDYAFEFIKQRGG 206

Query: 65  INTERDYPNVGVMDNCKVFQFN 86
           I TE +YP       C V + N
Sbjct: 207 ITTEANYPYEAYDGTCDVSKEN 228


>sp|P43236|CATK_RABIT Cathepsin K OS=Oryctolagus cuniculus GN=CTSK PE=1 SV=1
          Length = 329

 Score = 73.9 bits (180), Expect = 3e-13,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ VGA+EG  K  T  L+++S Q LVDC    E+  C GG++   +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENYGCGGGYMTNAFQYVQRNRG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210


>sp|P25774|CATS_HUMAN Cathepsin S OS=Homo sapiens GN=CTSS PE=1 SV=3
          Length = 331

 Score = 73.6 bits (179), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI+++  YP   +   C+
Sbjct: 197 GIDSDASYPYKAMDQKCQ 214


>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
          Length = 331

 Score = 73.6 bits (179), Expect = 4e-13,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
           G+CW F+ VGA+E   K+ T  LV +S Q LVDC  +   ++ C GGF+ T +QY+I N 
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNN 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP   +   C+
Sbjct: 197 GIDSEASYPYKAMNGKCR 214


>sp|P84346|MEX1_JACME Mexicain OS=Jacaratia mexicana PE=1 SV=1
          Length = 214

 Score = 73.2 bits (178), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 51/81 (62%), Gaps = 3/81 (3%)

Query: 2  HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
          +P GSCW F+ V  IEGI+KI+T  L+ +S Q+L+DC+ +  S  C GG+     QYV+ 
Sbjct: 20 NPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCEYR--SHGCDGGYQTPSLQYVVD 77

Query: 62 NRGINTERDYPNVGVMDNCKV 82
          N G++TER+YP       C+ 
Sbjct: 78 N-GVHTEREYPYEKKQGRCRA 97


>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
          Length = 208

 Score = 72.8 bits (177), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 34/78 (43%), Positives = 49/78 (62%), Gaps = 2/78 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ V  +E I++I T NL+ +S Q+LVDCD   ++  C+GG     YQY+I N G
Sbjct: 23 GSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDK--KNHGCLGGAFVFAYQYIINNGG 80

Query: 65 INTERDYPNVGVMDNCKV 82
          I+T+ +YP   V   C+ 
Sbjct: 81 IDTQANYPYKAVQGPCQA 98


>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
          Length = 221

 Score = 72.4 bits (176), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 33/71 (46%), Positives = 48/71 (67%), Gaps = 2/71 (2%)

Query: 5  GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
          GSCW F+ V A+EGI++IVT +L+ +S QQLVDC     +  C GG++   +Q+++ N G
Sbjct: 25 GSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA--NHGCRGGWMNPAFQFIVNNGG 82

Query: 65 INTERDYPNVG 75
          IN+E  YP  G
Sbjct: 83 INSEETYPYRG 93


>sp|O60911|CATL2_HUMAN Cathepsin L2 OS=Homo sapiens GN=CTSL2 PE=1 SV=2
          Length = 334

 Score = 72.4 bits (176), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 34/77 (44%), Positives = 46/77 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     T  LV +S Q LVDC     ++ C GGF+   +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 195

Query: 65  INTERDYPNVGVMDNCK 81
           +++E  YP V V + CK
Sbjct: 196 LDSEESYPYVAVDEICK 212


>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
          Length = 352

 Score = 72.4 bits (176), Expect = 8e-13,   Method: Composition-based stats.
 Identities = 35/77 (45%), Positives = 48/77 (62%), Gaps = 3/77 (3%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ +  +EGI+KIVT NL+++S Q+LVDCD    S  C GG+  T  QYV  N G
Sbjct: 157 GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH--SYGCKGGYQTTSLQYV-ANNG 213

Query: 65  INTERDYPNVGVMDNCK 81
           ++T + YP       C+
Sbjct: 214 VHTSKVYPYQAKQYKCR 230


>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
          Length = 351

 Score = 71.6 bits (174), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 3/84 (3%)

Query: 2   HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
           +P GSCW FA +  +EGI KI T  LV +S Q+++DC     S  C GG++   Y ++I 
Sbjct: 142 NPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDC---AVSYGCKGGWVNKAYDFIIS 198

Query: 62  NRGINTERDYPNVGVMDNCKVFQF 85
           N G+ TE +YP +     C    F
Sbjct: 199 NNGVTTEENYPYLAYQGTCNANSF 222


>sp|P55097|CATK_MOUSE Cathepsin K OS=Mus musculus GN=Ctsk PE=2 SV=2
          Length = 329

 Score = 71.6 bits (174), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 48/76 (63%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG  K  T  L+ +S Q LVDC    E+  C GG++ T +QYV QN G
Sbjct: 137 GSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVT--ENYGCGGGYMTTAFQYVQQNGG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210


>sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2 SV=1
          Length = 329

 Score = 71.6 bits (174), Expect = 1e-12,   Method: Composition-based stats.
 Identities = 36/76 (47%), Positives = 48/76 (63%), Gaps = 2/76 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG  K  T  L+ +S Q LVDC    E+  C GG++ T +QYV QN G
Sbjct: 137 GSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDC--VSENYGCGGGYMTTAFQYVQQNGG 194

Query: 65  INTERDYPNVGVMDNC 80
           I++E  YP VG  ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210


>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
          Length = 331

 Score = 71.2 bits (173), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
           GSCW F+ VGA+E   K+ T  LV +S Q LVDC   +  ++ C GGF+   +QY+I N 
Sbjct: 137 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNN 196

Query: 64  GINTERDYPNVGVMDNCK 81
           GI++E  YP   +   C+
Sbjct: 197 GIDSEASYPYKAMDGKCQ 214


>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
          Length = 337

 Score = 70.9 bits (172), Expect = 2e-12,   Method: Composition-based stats.
 Identities = 30/78 (38%), Positives = 49/78 (62%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSC+ F+  G++EG++ I T  LV +S Q ++DC +   +  C GG +   ++Y+I+N G
Sbjct: 143 GSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNG 202

Query: 65  INTERDYP-NVGVMDNCK 81
           +N+E  YP  + V D CK
Sbjct: 203 LNSEEQYPYEMKVNDECK 220


>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
          Length = 340

 Score = 70.5 bits (171), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 2/78 (2%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
           G+CW F+ VGA+EG  K+ T  L+ +S Q LVDC N+ +  ++ C GG++   +QY+I N
Sbjct: 145 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 204

Query: 63  RGINTERDYPNVGVMDNC 80
            GI  +  YP     + C
Sbjct: 205 GGIEADASYPYKATDEKC 222


>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
          Length = 373

 Score = 70.5 bits (171), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++EGI+ I T +LV +S Q+L+DCD   ++  C GG ++  ++Y+  N G
Sbjct: 156 GSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT-ADNDGCQGGLMDNAFEYIKNNGG 214

Query: 65  INTERDYPNVGVMDNCKV 82
           + TE  YP       C V
Sbjct: 215 LITEAAYPYRAARGTCNV 232


>sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium discoideum GN=cprB PE=2 SV=1
          Length = 376

 Score = 70.5 bits (171), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 30/68 (44%), Positives = 41/68 (60%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  G+ EG   + T  LV +S Q LVDC    E+  C GG +   + Y+I+N+G
Sbjct: 145 GSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKG 204

Query: 65  INTERDYP 72
           I+TE  YP
Sbjct: 205 IDTESSYP 212


>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
          Length = 371

 Score = 70.5 bits (171), Expect = 3e-12,   Method: Composition-based stats.
 Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 1/78 (1%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+ V ++EGI+ I T +LV +S Q+L+DCD   ++  C GG ++  ++Y+  N G
Sbjct: 156 GSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT-ADNDGCQGGLMDNAFEYIKNNGG 214

Query: 65  INTERDYPNVGVMDNCKV 82
           + TE  YP       C V
Sbjct: 215 LITEAAYPYRAARGTCNV 232


>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
          Length = 348

 Score = 70.1 bits (170), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 37/79 (46%), Positives = 47/79 (59%), Gaps = 3/79 (3%)

Query: 6   SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
           SCW F+ V  +EGI+KI T NLV++S Q+LVDCD Q  S  C  G+  T  QYV QN GI
Sbjct: 156 SCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQ--SYGCNRGYQSTSLQYVAQN-GI 212

Query: 66  NTERDYPNVGVMDNCKVFQ 84
           +    YP +     C+  Q
Sbjct: 213 HLRAKYPYIAKQQTCRANQ 231


>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
          Length = 339

 Score = 69.3 bits (168), Expect = 6e-12,   Method: Composition-based stats.
 Identities = 31/76 (40%), Positives = 45/76 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG        LV +S Q LVDC  +  +  C GG ++  ++Y+  N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203

Query: 65  INTERDYPNVGVMDNC 80
           I+TE+ YP  G+ D+C
Sbjct: 204 IDTEKSYPYEGIDDSC 219


>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
          Length = 371

 Score = 68.6 bits (166), Expect = 1e-11,   Method: Composition-based stats.
 Identities = 30/76 (39%), Positives = 45/76 (59%)

Query: 5   GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
           GSCW F+  GA+EG     +  LV +S Q LVDC  +  +  C GG ++  ++Y+  N G
Sbjct: 176 GSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 235

Query: 65  INTERDYPNVGVMDNC 80
           I+TE+ YP   + D+C
Sbjct: 236 IDTEKSYPYEAIDDSC 251


  Database: swissprot
    Posted date:  Mar 23, 2013  2:32 AM
  Number of letters in database: 191,569,459
  Number of sequences in database:  539,616
  
Lambda     K      H
   0.323    0.140    0.458 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 33,496,859
Number of Sequences: 539616
Number of extensions: 1153627
Number of successful extensions: 2457
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 162
Number of HSP's successfully gapped in prelim test: 26
Number of HSP's that attempted gapping in prelim test: 2195
Number of HSP's gapped (non-prelim): 189
length of query: 88
length of database: 191,569,459
effective HSP length: 58
effective length of query: 30
effective length of database: 160,271,731
effective search space: 4808151930
effective search space used: 4808151930
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 55 (25.8 bits)