BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 047400
(88 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 83.6 bits (205), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 40/85 (47%), Positives = 55/85 (64%), Gaps = 7/85 (8%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCD---NQGESRSCVGGFIETIYQYVIQ 61
GSCW F+ A+EGI+KIVT L+ +S Q+LVDCD NQG C GG ++ +Q++++
Sbjct: 167 GSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQG----CNGGLMDYAFQFIMK 222
Query: 62 NRGINTERDYPNVGVMDNCKVFQFN 86
N G+NTE+DYP G C F N
Sbjct: 223 NGGLNTEKDYPYRGFGGKCNSFLKN 247
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 82.8 bits (203), Expect = 5e-16, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 57/82 (69%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ +GA+EGI++IVT +L+ +S Q+LVDCD + C GG ++ ++++I+N G
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTS-YNEGCNGGLMDYAFEFIIKNGG 217
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+T++DYP GV C + N
Sbjct: 218 IDTDKDYPYKGVDGTCDQIRKN 239
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 82.0 bits (201), Expect = 1e-15, Method: Composition-based stats.
Identities = 37/82 (45%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KI TN LV +S Q+LVDCD + ++ C GG +E ++++ +N G
Sbjct: 150 GSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTK-QNEGCNGGLMEIAFEFIKKNGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE YP G+ C + N
Sbjct: 209 ITTEDSYPYEGIDGKCDASKDN 230
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 81.6 bits (200), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 36/76 (47%), Positives = 53/76 (69%), Gaps = 1/76 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ A+EGI+KIVT LV +S Q+LVDCD + ++ C GG ++ +Q++++N G
Sbjct: 122 GSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGG 180
Query: 65 INTERDYPNVGVMDNC 80
+NTE+DYP G C
Sbjct: 181 LNTEKDYPYHGTNGKC 196
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 81.3 bits (199), Expect = 2e-15, Method: Composition-based stats.
Identities = 35/68 (51%), Positives = 48/68 (70%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KI TN LV +S Q+LVDCD + E++ C GG +E ++++ N G
Sbjct: 148 GSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTE-ENQGCAGGLMEPAFEFIKNNGG 206
Query: 65 INTERDYP 72
I TE YP
Sbjct: 207 IKTEETYP 214
>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
Length = 215
Score = 80.1 bits (196), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 53/81 (65%), Gaps = 2/81 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+E I+KI T L+ +S Q+LVDCD S C GG++ +QY+I N G
Sbjct: 23 GSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDT--ASHGCNGGWMNNAFQYIITNGG 80
Query: 65 INTERDYPNVGVMDNCKVFQF 85
I+T+++YP V +CK ++
Sbjct: 81 IDTQQNYPYSAVQGSCKPYRL 101
>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
Length = 323
Score = 79.7 bits (195), Expect = 4e-15, Method: Composition-based stats.
Identities = 36/77 (46%), Positives = 49/77 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EG + T +LV +S Q LVDC + ++ C GG+ YQY+I NRG
Sbjct: 128 GSCWAFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRG 187
Query: 65 INTERDYPNVGVMDNCK 81
I+TE YP + DNC+
Sbjct: 188 IDTESSYPYKAIDDNCR 204
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 79.7 bits (195), Expect = 5e-15, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 51/78 (65%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT LV +S Q+LV+C G++ C GG ++ + ++ +N G
Sbjct: 178 GSCWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGG 237
Query: 65 INTERDYPNVGVMDNCKV 82
++TE DYP + C +
Sbjct: 238 LDTEEDYPYTAMDGKCNL 255
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 79.3 bits (194), Expect = 7e-15, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 54/82 (65%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++IVT +L+ +S Q+LVDCD + C GG ++ + ++I N G
Sbjct: 151 GSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTS-YNEGCNGGLMDYAFDFIINNGG 209
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP G + C V + N
Sbjct: 210 IDTEDDYPYKGKDERCDVNRKN 231
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 79.0 bits (193), Expect = 8e-15, Method: Composition-based stats.
Identities = 36/82 (43%), Positives = 53/82 (64%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I TN LV +S Q+LVDCD + E++ C GG +E+ ++++ Q G
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGG 208
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE +YP C + N
Sbjct: 209 ITTESNYPYKAQEGTCDASKVN 230
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 78.6 bits (192), Expect = 1e-14, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 47/68 (69%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI+KIVT NL +S Q+L+DCD + C GG ++ ++Y+++N G
Sbjct: 160 GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTT-YNNGCNGGLMDYAFEYIVKNGG 218
Query: 65 INTERDYP 72
+ E DYP
Sbjct: 219 LRKEEDYP 226
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 78.2 bits (191), Expect = 1e-14, Method: Composition-based stats.
Identities = 37/81 (45%), Positives = 54/81 (66%), Gaps = 4/81 (4%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++I TN LV +S Q+LVDCD + E++ C GG +E+ ++++ Q G
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKE-ENQGCNGGLMESAFEFIKQKGG 208
Query: 65 INTERDYPNV---GVMDNCKV 82
I TE +YP G D KV
Sbjct: 209 ITTESNYPYTAQEGTCDESKV 229
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 76.6 bits (187), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 48/78 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + +EGI+KIVT L+ +S Q+L+DC +R C GG+I +Q++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 208
Query: 65 INTERDYPNVGVMDNCKV 82
INTE +YP C V
Sbjct: 209 INTEENYPYTAQDGECNV 226
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 76.6 bits (187), Expect = 4e-14, Method: Composition-based stats.
Identities = 34/75 (45%), Positives = 51/75 (68%), Gaps = 2/75 (2%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ VGA+EG++KIVT LV +S Q L++C+ E+ C GG +ET Y+++++N G+
Sbjct: 160 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKLETAYEFIMKNGGL 217
Query: 66 NTERDYPNVGVMDNC 80
T+ DYP V C
Sbjct: 218 GTDNDYPYKAVNGVC 232
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 75.9 bits (185), Expect = 6e-14, Method: Composition-based stats.
Identities = 31/82 (37%), Positives = 52/82 (63%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I+++VT ++ +S Q+LV+C G++ C GG ++ + ++I+N G
Sbjct: 163 GSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGG 222
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I+TE DYP V C + + N
Sbjct: 223 IDTEDDYPYKAVDGKCDINREN 244
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 75.9 bits (185), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 48/78 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G CW F+ + +EGI+KIVT L+ +S Q+L+DC +R C GG+I +Q++I N G
Sbjct: 149 GGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGG 208
Query: 65 INTERDYPNVGVMDNCKV 82
INTE +YP C +
Sbjct: 209 INTEENYPYTAQDGECNL 226
>sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1
Length = 330
Score = 75.9 bits (185), Expect = 7e-14, Method: Composition-based stats.
Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG +NC
Sbjct: 196 IDSEDAYPYVGQDENC 211
>sp|Q5E968|CATK_BOVIN Cathepsin K OS=Bos taurus GN=CTSK PE=2 SV=2
Length = 329
Score = 75.9 bits (185), Expect = 7e-14, Method: Composition-based stats.
Identities = 37/76 (48%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG +NC
Sbjct: 195 IDSEDAYPYVGQDENC 210
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 75.9 bits (185), Expect = 7e-14, Method: Composition-based stats.
Identities = 32/67 (47%), Positives = 48/67 (71%), Gaps = 2/67 (2%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ VGA+EG++KIVT LV +S Q L++C+ E+ C GG +ET Y++++ N G+
Sbjct: 167 SCWAFSTVGAVEGLNKIVTGELVTLSEQDLINCNK--ENNGCGGGKVETAYEFIMNNGGL 224
Query: 66 NTERDYP 72
T+ DYP
Sbjct: 225 GTDNDYP 231
>sp|P84347|MEX2_JACME Chymomexicain OS=Jacaratia mexicana PE=1 SV=1
Length = 215
Score = 75.5 bits (184), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 49/81 (60%), Gaps = 2/81 (2%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW F+ V +EGI+KI T L+ +S Q+L+DCD + S C GG+ QYV
Sbjct: 20 NPCGSCWAFSTVATVEGINKIRTGKLISLSEQELLDCDRR--SHGCKGGYQTGSIQYVAD 77
Query: 62 NRGINTERDYPNVGVMDNCKV 82
N G++TE++YP C+
Sbjct: 78 NGGVHTEKEYPYEKKQGKCRA 98
>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
Length = 330
Score = 75.1 bits (183), Expect = 1e-13, Method: Composition-based stats.
Identities = 32/77 (41%), Positives = 47/77 (61%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N+G
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKG 196
Query: 65 INTERDYPNVGVMDNCK 81
I++E YP C+
Sbjct: 197 IDSEASYPYKATDQKCQ 213
>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
lycopersicum PE=2 SV=1
Length = 346
Score = 75.1 bits (183), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 34/68 (50%), Positives = 49/68 (72%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+E I+ IVT NL+ +S Q+LVDCD + + C GG ++ +++VI+N G
Sbjct: 40 GSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-RSYNEGCDGGLMDYAFEFVIKNGG 98
Query: 65 INTERDYP 72
I+TE DYP
Sbjct: 99 IDTEEDYP 106
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 75.1 bits (183), Expect = 1e-13, Method: Composition-based stats.
Identities = 32/68 (47%), Positives = 45/68 (66%), Gaps = 1/68 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++I T NL +S Q+L+DCD S C GG ++ +QY+I G
Sbjct: 159 GSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNS-GCNGGLMDYAFQYIISTGG 217
Query: 65 INTERDYP 72
++ E DYP
Sbjct: 218 LHKEDDYP 225
>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 74.7 bits (182), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 49/80 (61%), Gaps = 2/80 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F + A+EGI++IVT +L+ +S QQLVDC + + C GG+ +QY+I N G
Sbjct: 25 GSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTR--NHGCEGGWPYRAFQYIINNGG 82
Query: 65 INTERDYPNVGVMDNCKVFQ 84
IN+E YP G C +
Sbjct: 83 INSEEHYPYTGTNGTCDTKE 102
>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
Length = 329
Score = 74.3 bits (181), Expect = 2e-13, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210
>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
Length = 329
Score = 74.3 bits (181), Expect = 2e-13, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210
>sp|P43235|CATK_HUMAN Cathepsin K OS=Homo sapiens GN=CTSK PE=1 SV=1
Length = 329
Score = 74.3 bits (181), Expect = 2e-13, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQEESC 210
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 74.3 bits (181), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 48/68 (70%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EGI++I T L+ +S Q+LVDCD + C GG + ++++++N G
Sbjct: 152 GSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGG 211
Query: 65 INTERDYP 72
I T++DYP
Sbjct: 212 IETDQDYP 219
>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2 SV=1
Length = 330
Score = 74.3 bits (181), Expect = 2e-13, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 138 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENDGCGGGYMTNAFQYVQKNRG 195
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 196 IDSEDAYPYVGQDESC 211
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 73.9 bits (180), Expect = 3e-13, Method: Composition-based stats.
Identities = 34/82 (41%), Positives = 52/82 (63%), Gaps = 1/82 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + A+EGI++I TN LV +S Q+LVDCD +++ C GG ++ ++++ Q G
Sbjct: 148 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTD-QNQGCNGGLMDYAFEFIKQRGG 206
Query: 65 INTERDYPNVGVMDNCKVFQFN 86
I TE +YP C V + N
Sbjct: 207 ITTEANYPYEAYDGTCDVSKEN 228
>sp|P43236|CATK_RABIT Cathepsin K OS=Oryctolagus cuniculus GN=CTSK PE=1 SV=1
Length = 329
Score = 73.9 bits (180), Expect = 3e-13, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 50/76 (65%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ VGA+EG K T L+++S Q LVDC E+ C GG++ +QYV +NRG
Sbjct: 137 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC--VSENYGCGGGYMTNAFQYVQRNRG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210
>sp|P25774|CATS_HUMAN Cathepsin S OS=Homo sapiens GN=CTSS PE=1 SV=3
Length = 331
Score = 73.6 bits (179), Expect = 4e-13, Method: Composition-based stats.
Identities = 32/78 (41%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N+
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNK 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI+++ YP + C+
Sbjct: 197 GIDSDASYPYKAMDQKCQ 214
>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
Length = 331
Score = 73.6 bits (179), Expect = 4e-13, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 48/78 (61%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE-SRSCVGGFIETIYQYVIQNR 63
G+CW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ T +QY+I N
Sbjct: 137 GACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNN 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + C+
Sbjct: 197 GIDSEASYPYKAMNGKCR 214
>sp|P84346|MEX1_JACME Mexicain OS=Jacaratia mexicana PE=1 SV=1
Length = 214
Score = 73.2 bits (178), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 51/81 (62%), Gaps = 3/81 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW F+ V IEGI+KI+T L+ +S Q+L+DC+ + S C GG+ QYV+
Sbjct: 20 NPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCEYR--SHGCDGGYQTPSLQYVVD 77
Query: 62 NRGINTERDYPNVGVMDNCKV 82
N G++TER+YP C+
Sbjct: 78 N-GVHTEREYPYEKKQGRCRA 97
>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
Length = 208
Score = 72.8 bits (177), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 34/78 (43%), Positives = 49/78 (62%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V +E I++I T NL+ +S Q+LVDCD ++ C+GG YQY+I N G
Sbjct: 23 GSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDK--KNHGCLGGAFVFAYQYIINNGG 80
Query: 65 INTERDYPNVGVMDNCKV 82
I+T+ +YP V C+
Sbjct: 81 IDTQANYPYKAVQGPCQA 98
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 72.4 bits (176), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 33/71 (46%), Positives = 48/71 (67%), Gaps = 2/71 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V A+EGI++IVT +L+ +S QQLVDC + C GG++ +Q+++ N G
Sbjct: 25 GSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA--NHGCRGGWMNPAFQFIVNNGG 82
Query: 65 INTERDYPNVG 75
IN+E YP G
Sbjct: 83 INSEETYPYRG 93
>sp|O60911|CATL2_HUMAN Cathepsin L2 OS=Homo sapiens GN=CTSL2 PE=1 SV=2
Length = 334
Score = 72.4 bits (176), Expect = 8e-13, Method: Composition-based stats.
Identities = 34/77 (44%), Positives = 46/77 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG T LV +S Q LVDC ++ C GGF+ +QYV +N G
Sbjct: 136 GSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQYVKENGG 195
Query: 65 INTERDYPNVGVMDNCK 81
+++E YP V V + CK
Sbjct: 196 LDSEESYPYVAVDEICK 212
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 72.4 bits (176), Expect = 8e-13, Method: Composition-based stats.
Identities = 35/77 (45%), Positives = 48/77 (62%), Gaps = 3/77 (3%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ + +EGI+KIVT NL+++S Q+LVDCD S C GG+ T QYV N G
Sbjct: 157 GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH--SYGCKGGYQTTSLQYV-ANNG 213
Query: 65 INTERDYPNVGVMDNCK 81
++T + YP C+
Sbjct: 214 VHTSKVYPYQAKQYKCR 230
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 71.6 bits (174), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 33/84 (39%), Positives = 47/84 (55%), Gaps = 3/84 (3%)
Query: 2 HPLGSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQ 61
+P GSCW FA + +EGI KI T LV +S Q+++DC S C GG++ Y ++I
Sbjct: 142 NPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDC---AVSYGCKGGWVNKAYDFIIS 198
Query: 62 NRGINTERDYPNVGVMDNCKVFQF 85
N G+ TE +YP + C F
Sbjct: 199 NNGVTTEENYPYLAYQGTCNANSF 222
>sp|P55097|CATK_MOUSE Cathepsin K OS=Mus musculus GN=Ctsk PE=2 SV=2
Length = 329
Score = 71.6 bits (174), Expect = 1e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 48/76 (63%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG K T L+ +S Q LVDC E+ C GG++ T +QYV QN G
Sbjct: 137 GSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVT--ENYGCGGGYMTTAFQYVQQNGG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210
>sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2 SV=1
Length = 329
Score = 71.6 bits (174), Expect = 1e-12, Method: Composition-based stats.
Identities = 36/76 (47%), Positives = 48/76 (63%), Gaps = 2/76 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG K T L+ +S Q LVDC E+ C GG++ T +QYV QN G
Sbjct: 137 GSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDC--VSENYGCGGGYMTTAFQYVQQNGG 194
Query: 65 INTERDYPNVGVMDNC 80
I++E YP VG ++C
Sbjct: 195 IDSEDAYPYVGQDESC 210
>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
Length = 331
Score = 71.2 bits (173), Expect = 2e-12, Method: Composition-based stats.
Identities = 33/78 (42%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDN-QGESRSCVGGFIETIYQYVIQNR 63
GSCW F+ VGA+E K+ T LV +S Q LVDC + ++ C GGF+ +QY+I N
Sbjct: 137 GSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNN 196
Query: 64 GINTERDYPNVGVMDNCK 81
GI++E YP + C+
Sbjct: 197 GIDSEASYPYKAMDGKCQ 214
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 70.9 bits (172), Expect = 2e-12, Method: Composition-based stats.
Identities = 30/78 (38%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSC+ F+ G++EG++ I T LV +S Q ++DC + + C GG + ++Y+I+N G
Sbjct: 143 GSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNG 202
Query: 65 INTERDYP-NVGVMDNCK 81
+N+E YP + V D CK
Sbjct: 203 LNSEEQYPYEMKVNDECK 220
>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
Length = 340
Score = 70.5 bits (171), Expect = 3e-12, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 2/78 (2%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGE--SRSCVGGFIETIYQYVIQN 62
G+CW F+ VGA+EG K+ T L+ +S Q LVDC N+ + ++ C GG++ +QY+I N
Sbjct: 145 GACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDN 204
Query: 63 RGINTERDYPNVGVMDNC 80
GI + YP + C
Sbjct: 205 GGIEADASYPYKATDEKC 222
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 70.5 bits (171), Expect = 3e-12, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++EGI+ I T +LV +S Q+L+DCD ++ C GG ++ ++Y+ N G
Sbjct: 156 GSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT-ADNDGCQGGLMDNAFEYIKNNGG 214
Query: 65 INTERDYPNVGVMDNCKV 82
+ TE YP C V
Sbjct: 215 LITEAAYPYRAARGTCNV 232
>sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium discoideum GN=cprB PE=2 SV=1
Length = 376
Score = 70.5 bits (171), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 41/68 (60%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ G+ EG + T LV +S Q LVDC E+ C GG + + Y+I+N+G
Sbjct: 145 GSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAFDYIIKNKG 204
Query: 65 INTERDYP 72
I+TE YP
Sbjct: 205 IDTESSYP 212
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 70.5 bits (171), Expect = 3e-12, Method: Composition-based stats.
Identities = 31/78 (39%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ V ++EGI+ I T +LV +S Q+L+DCD ++ C GG ++ ++Y+ N G
Sbjct: 156 GSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT-ADNDGCQGGLMDNAFEYIKNNGG 214
Query: 65 INTERDYPNVGVMDNCKV 82
+ TE YP C V
Sbjct: 215 LITEAAYPYRAARGTCNV 232
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 70.1 bits (170), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 37/79 (46%), Positives = 47/79 (59%), Gaps = 3/79 (3%)
Query: 6 SCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRGI 65
SCW F+ V +EGI+KI T NLV++S Q+LVDCD Q S C G+ T QYV QN GI
Sbjct: 156 SCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQ--SYGCNRGYQSTSLQYVAQN-GI 212
Query: 66 NTERDYPNVGVMDNCKVFQ 84
+ YP + C+ Q
Sbjct: 213 HLRAKYPYIAKQQTCRANQ 231
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 69.3 bits (168), Expect = 6e-12, Method: Composition-based stats.
Identities = 31/76 (40%), Positives = 45/76 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG LV +S Q LVDC + + C GG ++ ++Y+ N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 65 INTERDYPNVGVMDNC 80
I+TE+ YP G+ D+C
Sbjct: 204 IDTEKSYPYEGIDDSC 219
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 68.6 bits (166), Expect = 1e-11, Method: Composition-based stats.
Identities = 30/76 (39%), Positives = 45/76 (59%)
Query: 5 GSCWIFAVVGAIEGISKIVTNNLVDISTQQLVDCDNQGESRSCVGGFIETIYQYVIQNRG 64
GSCW F+ GA+EG + LV +S Q LVDC + + C GG ++ ++Y+ N G
Sbjct: 176 GSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 235
Query: 65 INTERDYPNVGVMDNC 80
I+TE+ YP + D+C
Sbjct: 236 IDTEKSYPYEAIDDSC 251
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.323 0.140 0.458
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 33,496,859
Number of Sequences: 539616
Number of extensions: 1153627
Number of successful extensions: 2457
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 162
Number of HSP's successfully gapped in prelim test: 26
Number of HSP's that attempted gapping in prelim test: 2195
Number of HSP's gapped (non-prelim): 189
length of query: 88
length of database: 191,569,459
effective HSP length: 58
effective length of query: 30
effective length of database: 160,271,731
effective search space: 4808151930
effective search space used: 4808151930
T: 11
A: 40
X1: 16 ( 7.5 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.9 bits)
S2: 55 (25.8 bits)