BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 047264
(149 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 133 bits (335), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 67/130 (51%), Positives = 92/130 (70%), Gaps = 15/130 (11%)
Query: 20 LLRLSLCVSLLMIICIRCSSSRTLQGHEYDGEFSIVGYSPEELTSTDKLVELFESWMLKH 79
L + SL V++ + C+ +R +FSIVGY+PE LT+TDKL+ELFESWM +H
Sbjct: 8 LSKFSLLVAISASALLCCAFAR---------DFSIVGYTPEHLTNTDKLLELFESWMSEH 58
Query: 80 GKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEFKNKYLTGL 139
K+Y+S EEK+HR E+F++NL HID RN E+ +SYWLGLNEF+D++HEEFK +YL
Sbjct: 59 SKAYKSVEEKVHRFEVFRENLMHIDQRNNEI---NSYWLGLNEFADLTHEEFKGRYLGLA 115
Query: 140 KPDDDEFRRR 149
KP +F R+
Sbjct: 116 KP---QFSRK 122
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 123 bits (308), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 59/102 (57%), Positives = 81/102 (79%), Gaps = 9/102 (8%)
Query: 51 EFSIVGYSPEELTSTDKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNREL 110
+FSIVGYS ++LTST++L++LF SWMLKH K+Y++ +EKL+R EIFKDNLK+ID RN+
Sbjct: 27 DFSIVGYSQDDLTSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNK-- 84
Query: 111 QITSSYWLGLNEFSDMSHEEFKNKYLTGL------KPDDDEF 146
+ + YWLGLNEFSD+S++EFK KY+ L +P D+EF
Sbjct: 85 -MINGYWLGLNEFSDLSNDEFKEKYVGSLPEDYTNQPYDEEF 125
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 120 bits (300), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 62/126 (49%), Positives = 87/126 (69%), Gaps = 18/126 (14%)
Query: 10 MKMLSSNCKLLLRLSLCVSLLMIICIRCSSSRTLQGHEYDGEFSIVGYSPEELTSTDKLV 69
M M+ S KLL +++C+ + M + G+FSIVGYS +LTST++L+
Sbjct: 1 MAMIPSISKLLF-VAICLFVYMGLSF--------------GDFSIVGYSQNDLTSTERLI 45
Query: 70 ELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHE 129
+LFESWMLKH K Y++ +EK++R EIFKDNLK+ID N++ +SYWLGLN F+DMS++
Sbjct: 46 QLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKK---NNSYWLGLNVFADMSND 102
Query: 130 EFKNKY 135
EFK KY
Sbjct: 103 EFKEKY 108
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 118 bits (295), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 61/130 (46%), Positives = 87/130 (66%), Gaps = 18/130 (13%)
Query: 10 MKMLSSNCKLLLRLSLCVSLLMIICIRCSSSRTLQGHEYDGEFSIVGYSPEELTSTDKLV 69
M M+ S KLL +++C+ + M + G+FSIVGYS ++LTST++L+
Sbjct: 1 MAMIPSISKLLF-VAICLFVHMSVSF--------------GDFSIVGYSQDDLTSTERLI 45
Query: 70 ELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHE 129
+LF SWML H K YE+ +EKL+R EIFKDNL +ID N++ +SYWLGLNEF+D+S++
Sbjct: 46 QLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK---NNSYWLGLNEFADLSND 102
Query: 130 EFKNKYLTGL 139
EF KY+ L
Sbjct: 103 EFNEKYVGSL 112
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 116 bits (290), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 58/92 (63%), Positives = 72/92 (78%), Gaps = 4/92 (4%)
Query: 51 EFSIVGYSPEELTSTDKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNREL 110
++SIVGYSPE+L S DKL+ELFE+W+ K+YE+ EEK R E+FKDNLKHID N++
Sbjct: 30 DYSIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG 89
Query: 111 QITSSYWLGLNEFSDMSHEEFKNKYLTGLKPD 142
+ SYWLGLNEF+D+SHEEFK YL GLK D
Sbjct: 90 K---SYWLGLNEFADLSHEEFKKMYL-GLKTD 117
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 108 bits (270), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 63/143 (44%), Positives = 93/143 (65%), Gaps = 25/143 (17%)
Query: 10 MKMLSSNCKLLLRLSLCVSLLMIICIRCSSSRTLQGHEYDGEFSIVGYSPEELTSTDKLV 69
M +SS K++ L+ C +II + SS+ +F VGYS ++LTS ++L+
Sbjct: 1 MATMSSISKIIF-LATC----LIIHMGLSSA----------DFYTVGYSQDDLTSIERLI 45
Query: 70 ELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHE 129
+LF+SWMLKH K YES +EK++R EIF+DNL +ID N++ +SYWLGLN F+D+S++
Sbjct: 46 QLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK---NNSYWLGLNGFADLSND 102
Query: 130 EFKNKYL-------TGLKPDDDE 145
EFK KY+ TGL+ D+E
Sbjct: 103 EFKKKYVGFVAEDFTGLEHFDNE 125
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 68.2 bits (165), Expect = 2e-11, Method: Composition-based stats.
Identities = 40/106 (37%), Positives = 63/106 (59%), Gaps = 12/106 (11%)
Query: 51 EFSIVGYSPEELTSTD------KLVELFESWMLKHGK--SYESTEEKLHRLEIFKDNLKH 102
+ SI+ Y + ST +++ ++E+W++KHGK S S EK R EIFKDNL+
Sbjct: 23 DMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRF 82
Query: 103 IDARNRELQITSSYWLGLNEFSDMSHEEFKNKYLTGLKPDDDEFRR 148
+D N + SY LGL F+D++++E+++KYL G K + RR
Sbjct: 83 VDEHNEK---NLSYRLGLTRFADLTNDEYRSKYL-GAKMEKKGERR 124
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 67.8 bits (164), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 50/73 (68%), Gaps = 2/73 (2%)
Query: 64 STDKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEF 123
+ D++ ++ESW++K+GKSY S E R EIFK+ L+ ID N + SY +GLN+F
Sbjct: 34 TNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTN--RSYKVGLNQF 91
Query: 124 SDMSHEEFKNKYL 136
+D++ EEF++ YL
Sbjct: 92 ADLTDEEFRSTYL 104
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 67.8 bits (164), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 50/73 (68%), Gaps = 2/73 (2%)
Query: 64 STDKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEF 123
+ D++ ++ESW++K+GKSY S E R EIFK+ L+ ID N + SY +GLN+F
Sbjct: 34 TNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTN--RSYKVGLNQF 91
Query: 124 SDMSHEEFKNKYL 136
+D++ EEF++ YL
Sbjct: 92 ADLTDEEFRSTYL 104
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 67.8 bits (164), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 53/75 (70%), Gaps = 3/75 (4%)
Query: 65 TDKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFS 124
+D +++ FE WM ++G+ Y+ +EK+ R +IFK+N+ HI+ N + +SY LG+N+F+
Sbjct: 30 SDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNN--RNGNSYTLGINQFT 87
Query: 125 DMSHEEFKNKYLTGL 139
DM++ EF +Y TGL
Sbjct: 88 DMTNNEFVAQY-TGL 101
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 67.4 bits (163), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 31/74 (41%), Positives = 52/74 (70%), Gaps = 3/74 (4%)
Query: 66 DKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSD 125
D +++ FE WM ++G+ Y+ +EK+ R +IFK+N+KHI+ N + +SY LG+N+F+D
Sbjct: 31 DPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNE--NSYTLGINQFTD 88
Query: 126 MSHEEFKNKYLTGL 139
M+ EF +Y TG+
Sbjct: 89 MTKSEFVAQY-TGV 101
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 67.0 bits (162), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/61 (55%), Positives = 42/61 (68%), Gaps = 3/61 (4%)
Query: 71 LFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEE 130
+FESWM+KHGK Y+S EK RL IF+DNL+ I RN E SY LGLN F+D+S E
Sbjct: 55 MFESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAE---NLSYRLGLNRFADLSLHE 111
Query: 131 F 131
+
Sbjct: 112 Y 112
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 66.2 bits (160), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 34/62 (54%), Positives = 42/62 (67%), Gaps = 3/62 (4%)
Query: 71 LFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEE 130
+FESWM+KHGK Y S EK RL IF+DNL+ I+ RN E SY LGL F+D+S E
Sbjct: 48 IFESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAE---NLSYRLGLTGFADLSLHE 104
Query: 131 FK 132
+K
Sbjct: 105 YK 106
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 63.2 bits (152), Expect = 6e-10, Method: Composition-based stats.
Identities = 34/91 (37%), Positives = 53/91 (58%), Gaps = 5/91 (5%)
Query: 51 EFSIVGYSPEELTSTDKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNREL 110
+ SIV Y S ++ L+ W +HGKSY + E+ R F+DNL++ID N
Sbjct: 22 DMSIVSYGER---SEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAA 78
Query: 111 QI-TSSYWLGLNEFSDMSHEEFKNKYLTGLK 140
S+ LGLN F+D+++EE+++ YL GL+
Sbjct: 79 DAGVHSFRLGLNRFADLTNEEYRDTYL-GLR 108
>sp|Q3T0I2|CATH_BOVIN Pro-cathepsin H OS=Bos taurus GN=CTSH PE=2 SV=1
Length = 335
Score = 62.8 bits (151), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 44/65 (67%), Gaps = 4/65 (6%)
Query: 72 FESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEF 131
F+SWM++H K Y S+EE HRL+ F NL+ I+A N ++ +GLN+FSDMS +E
Sbjct: 35 FQSWMVQHQKKY-SSEEYYHRLQAFASNLREINAHNAR---NHTFKMGLNQFSDMSFDEL 90
Query: 132 KNKYL 136
K KYL
Sbjct: 91 KRKYL 95
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 62.4 bits (150), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 48/72 (66%), Gaps = 2/72 (2%)
Query: 67 KLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDM 126
+++ ++E W++++GK+Y EK R +IFKDNLK I+ N + SY GLN+FSD+
Sbjct: 36 EVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPN--RSYERGLNKFSDL 93
Query: 127 SHEEFKNKYLTG 138
+ +EF+ YL G
Sbjct: 94 TADEFQASYLGG 105
>sp|P22895|P34_SOYBN P34 probable thiol protease OS=Glycine max PE=1 SV=1
Length = 379
Score = 61.6 bits (148), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 49/80 (61%)
Query: 61 ELTSTDKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGL 120
+ T+ ++ LF+ W +HG+ Y + EE+ RLEIFK+N +I N + S+ LGL
Sbjct: 33 KFTTQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNANRKSPHSHRLGL 92
Query: 121 NEFSDMSHEEFKNKYLTGLK 140
N+F+D++ +EF KYL K
Sbjct: 93 NKFADITPQEFSKKYLQAPK 112
>sp|O46427|CATH_PIG Pro-cathepsin H OS=Sus scrofa GN=CTSH PE=1 SV=1
Length = 335
Score = 58.9 bits (141), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 52/81 (64%), Gaps = 6/81 (7%)
Query: 56 GYSPEELTSTDKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSS 115
G S ++S +KL F+SWM++H K Y S EE HRL++F N + I+A N +
Sbjct: 21 GASNLAVSSFEKL--HFKSWMVQHQKKY-SLEEYHHRLQVFVSNWRKINAHNAG---NHT 74
Query: 116 YWLGLNEFSDMSHEEFKNKYL 136
+ LGLN+FSDMS +E ++KYL
Sbjct: 75 FKLGLNQFSDMSFDEIRHKYL 95
>sp|P00786|CATH_RAT Pro-cathepsin H OS=Rattus norvegicus GN=Ctsh PE=1 SV=1
Length = 333
Score = 58.5 bits (140), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 43/65 (66%), Gaps = 4/65 (6%)
Query: 72 FESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEF 131
F SWM +H K+Y S+ E HRL++F +N + I A N Q ++ +GLN+FSDMS E
Sbjct: 33 FTSWMKQHQKTY-SSREYSHRLQVFANNWRKIQAHN---QRNHTFKMGLNQFSDMSFAEI 88
Query: 132 KNKYL 136
K+KYL
Sbjct: 89 KHKYL 93
>sp|P09668|CATH_HUMAN Pro-cathepsin H OS=Homo sapiens GN=CTSH PE=1 SV=4
Length = 335
Score = 58.2 bits (139), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 42/65 (64%), Gaps = 4/65 (6%)
Query: 72 FESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEF 131
F+SWM KH K+Y STEE HRL+ F N + I+A N ++ + LN+FSDMS E
Sbjct: 35 FKSWMSKHRKTY-STEEYHHRLQTFASNWRKINAHNNG---NHTFKMALNQFSDMSFAEI 90
Query: 132 KNKYL 136
K+KYL
Sbjct: 91 KHKYL 95
>sp|A0E358|CATL2_PARTE Cathepsin L 2 OS=Paramecium tetraurelia GN=GSPATT00022898001 PE=3
SV=2
Length = 314
Score = 56.2 bits (134), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 49/78 (62%), Gaps = 1/78 (1%)
Query: 60 EELTSTDKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLG 119
+E++ L+ +W +K+ + Y S ++++R ++F DNL +I A + +++Y L
Sbjct: 14 QEVSDEIDTANLYANWKMKYNRRYTSQRDEMYRFKVFSDNLNYIRAFQDSTE-SATYTLE 72
Query: 120 LNEFSDMSHEEFKNKYLT 137
LN+F+DMS +EF + YL+
Sbjct: 73 LNQFADMSQQEFASTYLS 90
>sp|P49935|CATH_MOUSE Pro-cathepsin H OS=Mus musculus GN=Ctsh PE=2 SV=2
Length = 333
Score = 55.1 bits (131), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 42/65 (64%), Gaps = 4/65 (6%)
Query: 72 FESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEF 131
F+SWM +H K+Y S E HRL++F +N + I A N Q ++ + LN+FSDMS E
Sbjct: 33 FKSWMKQHQKTYSSVEYN-HRLQMFANNWRKIQAHN---QRNHTFKMALNQFSDMSFAEI 88
Query: 132 KNKYL 136
K+K+L
Sbjct: 89 KHKFL 93
>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
Length = 356
Score = 53.9 bits (128), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 40/61 (65%), Gaps = 3/61 (4%)
Query: 72 FESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEF 131
F + ++H K Y+S EE R EIF DNLK I + NR+ SY LG+NEF+D++ +EF
Sbjct: 57 FARFAIRHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRK---GLSYKLGINEFTDLTWDEF 113
Query: 132 K 132
+
Sbjct: 114 R 114
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 53.5 bits (127), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 42/66 (63%), Gaps = 2/66 (3%)
Query: 71 LFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEE 130
++E W++++ K+Y EK R +IFKDNLK +D N T + +GL F+D+++EE
Sbjct: 43 MYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRT--FEVGLTRFADLTNEE 100
Query: 131 FKNKYL 136
F+ YL
Sbjct: 101 FRAIYL 106
>sp|Q24940|CATLL_FASHE Cathepsin L-like proteinase OS=Fasciola hepatica GN=Cat-1 PE=1 SV=1
Length = 326
Score = 53.1 bits (126), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 44/70 (62%), Gaps = 4/70 (5%)
Query: 70 ELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARN--RELQITSSYWLGLNEFSDMS 127
+L+ W + K Y +++ HR I++ N+KHI N +L + + Y LGLN+F+DM+
Sbjct: 19 DLWHQWKRMYNKEYNGADDQ-HRRNIWEKNVKHIQEHNLRHDLGLVT-YTLGLNQFTDMT 76
Query: 128 HEEFKNKYLT 137
EEFK KYLT
Sbjct: 77 FEEFKAKYLT 86
>sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 OS=Drosophila melanogaster
GN=CG12163 PE=2 SV=2
Length = 614
Score = 52.8 bits (125), Expect = 7e-07, Method: Composition-based stats.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 4/79 (5%)
Query: 66 DKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSD 125
DK+ LF + ++ G+ Y ST E+ RL IF+ NLK I+ N ++ Y G+ EF+D
Sbjct: 302 DKVDHLFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKY--GITEFAD 359
Query: 126 MSHEEFKNKYLTGLKPDDD 144
M+ E+K + TGL D+
Sbjct: 360 MTSSEYKER--TGLWQRDE 376
>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
Length = 344
Score = 52.8 bits (125), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 31/71 (43%), Positives = 42/71 (59%), Gaps = 5/71 (7%)
Query: 72 FESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEF 131
F WM+ H KSY S EE R IFK N+ ++ N + T LGLN F+D+++EE+
Sbjct: 30 FTDWMITHQKSYTS-EEFGARYNIFKANMDYVQQWNSKGSETV---LGLNNFADITNEEY 85
Query: 132 KNKYLTGLKPD 142
+N YL G K D
Sbjct: 86 RNTYL-GTKFD 95
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 52.4 bits (124), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 59/118 (50%), Gaps = 22/118 (18%)
Query: 18 KLLLRLSLCVSLLMIICIRCSSSRTLQGHEYDGEFSIVGYSPEELTSTDKLVELFESWML 77
K + L+LC +LM++ +++ L H D E S + L EL+E W
Sbjct: 2 KRFIVLALC--MLMVL----ETTKGLDFHNKDVE------------SENSLWELYERWRS 43
Query: 78 KHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEFKNKY 135
H + S EEK R +FK N+KHI N++ SY L LN+F DM+ EEF+ Y
Sbjct: 44 HHTVA-RSLEEKAKRFNVFKHNVKHIHETNKK---DKSYKLKLNKFGDMTSEEFRRTY 97
>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
Length = 323
Score = 51.6 bits (122), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 26/60 (43%), Positives = 39/60 (65%), Gaps = 1/60 (1%)
Query: 72 FESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARN-RELQITSSYWLGLNEFSDMSHEE 130
+E++ K GK Y ++EE+ HR+ +F D LK I N R + +YWL +N FSD++HEE
Sbjct: 20 WENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHEE 79
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 51.2 bits (121), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 28/81 (34%), Positives = 49/81 (60%), Gaps = 3/81 (3%)
Query: 55 VGYSPEELTSTDKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITS 114
+ ++ ++L S D L L+E W H + + +EK R +FK+N+K I N++ +
Sbjct: 23 IPFTEKDLASEDSLWNLYEKWRTHHTVARD-LDEKNRRFNVFKENVKFIHEFNQKKD--A 79
Query: 115 SYWLGLNEFSDMSHEEFKNKY 135
Y L LN+F DM+++EF++KY
Sbjct: 80 PYKLALNKFGDMTNQEFRSKY 100
>sp|Q94714|CATL1_PARTE Cathepsin L 1 OS=Paramecium tetraurelia GN=GSPATT00020990001 PE=1
SV=1
Length = 314
Score = 50.8 bits (120), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 47/78 (60%), Gaps = 1/78 (1%)
Query: 60 EELTSTDKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLG 119
+E++ L+ +W +K+ + Y + ++++R ++F DNL +I A E +++ L
Sbjct: 14 QEVSDEIDTANLYANWKMKYNRRYTNQRDEMYRYKVFTDNLNYIRAF-YESPEEATFTLE 72
Query: 120 LNEFSDMSHEEFKNKYLT 137
LN+F+DMS +EF YL+
Sbjct: 73 LNQFADMSQQEFAQTYLS 90
>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
Length = 360
Score = 50.8 bits (120), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 39/61 (63%), Gaps = 3/61 (4%)
Query: 72 FESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEF 131
F + +++GKSYES E R IF ++L+ + + NR+ SY LG+N F+DMS EEF
Sbjct: 59 FARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRK---GLSYRLGINRFADMSWEEF 115
Query: 132 K 132
+
Sbjct: 116 R 116
>sp|Q80LP4|CATV_NPVAH Viral cathepsin OS=Adoxophyes honmai nucleopolyhedrovirus GN=VCATH
PE=3 SV=1
Length = 337
Score = 50.4 bits (119), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 60/119 (50%), Gaps = 19/119 (15%)
Query: 27 VSLLMIICIRCSSSRTLQGHEYDGEFSIVGYSPEELTSTDKLVELFESWMLKHGKSYEST 86
++LLMI I +S ++GH +F I FE++++ + K Y T
Sbjct: 1 MTLLMIFTILLVASSQIEGHL---KFDI-----------HDAQHYFETFIINYNKQYPDT 46
Query: 87 EEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEFKNKY--LTGLKPDD 143
+ K +R +IFK NL+ I+ +N+ + S +N+FSD+S E KY LT KP +
Sbjct: 47 KTKNYRFKIFKQNLEDINEKNK---LNDSAIYNINKFSDLSKNELLTKYTGLTSKKPSN 102
>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
Length = 362
Score = 50.1 bits (118), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 26/61 (42%), Positives = 38/61 (62%), Gaps = 3/61 (4%)
Query: 72 FESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEF 131
F + +++GKSYES E R IF ++L+ + + NR+ Y LG+N FSDMS EEF
Sbjct: 61 FARFAVRYGKSYESAAEVRRRFRIFSESLEEVRSTNRK---GLPYRLGINRFSDMSWEEF 117
Query: 132 K 132
+
Sbjct: 118 Q 118
>sp|Q9UBX1|CATF_HUMAN Cathepsin F OS=Homo sapiens GN=CTSF PE=1 SV=1
Length = 484
Score = 49.7 bits (117), Expect = 7e-06, Method: Composition-based stats.
Identities = 30/93 (32%), Positives = 54/93 (58%), Gaps = 11/93 (11%)
Query: 53 SIVGYSPEELTSTD---KLVELFESWMLKHGKSYESTEEKLHRLEIFKDNL---KHIDAR 106
S++ E+ S D K+ +F+++++ + ++YES EE RL +F +N+ + I A
Sbjct: 165 SVISLLNEDPLSQDLPVKMASIFKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQAL 224
Query: 107 NRELQITSSYWLGLNEFSDMSHEEFKNKYLTGL 139
+R T+ Y G+ +FSD++ EEF+ YL L
Sbjct: 225 DRG---TAQY--GVTKFSDLTEEEFRTIYLNTL 252
>sp|O10364|CATV_NPVOP Viral cathepsin OS=Orgyia pseudotsugata multicapsid polyhedrosis
virus GN=VCATH PE=3 SV=1
Length = 324
Score = 49.3 bits (116), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 32/73 (43%), Positives = 44/73 (60%), Gaps = 4/73 (5%)
Query: 67 KLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDM 126
K FE ++ K K+Y S EKLHR +IF+ NL+ I +N Q S+ +N+FSD+
Sbjct: 23 KAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKN---QNDSTAQYEINKFSDL 79
Query: 127 SHEEFKNKYLTGL 139
S EE +KY TGL
Sbjct: 80 SKEEAISKY-TGL 91
>sp|Q94715|CATL3_PARTE Putative cathepsin L 3 OS=Paramecium tetraurelia
GN=GSPATT00022199001 PE=2 SV=2
Length = 308
Score = 49.3 bits (116), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 28/65 (43%), Positives = 41/65 (63%), Gaps = 3/65 (4%)
Query: 72 FESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEF 131
FE W LK+ K Y + EKL+R+EI+ N + I+ N+ +T Y +G N+F +SHEEF
Sbjct: 29 FERWALKNNKFY-TESEKLYRMEIYNSNKRMIEEHNQREDVT--YQMGENQFMTLSHEEF 85
Query: 132 KNKYL 136
+ YL
Sbjct: 86 VDLYL 90
>sp|Q94504|CYSP7_DICDI Cysteine proteinase 7 OS=Dictyostelium discoideum GN=cprG PE=1 SV=1
Length = 460
Score = 48.9 bits (115), Expect = 1e-05, Method: Composition-based stats.
Identities = 26/65 (40%), Positives = 40/65 (61%), Gaps = 4/65 (6%)
Query: 72 FESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEF 131
F +WM+ H + Y S+EE R IFK N+ +++ N + S LGLN F+D+S+EE+
Sbjct: 30 FTNWMIAHQRHY-SSEEFNGRYNIFKANMDYVNEWNTK---GSETVLGLNVFADISNEEY 85
Query: 132 KNKYL 136
+ YL
Sbjct: 86 RATYL 90
>sp|P12399|CTL2A_MOUSE Protein CTLA-2-alpha OS=Mus musculus GN=Ctla2a PE=2 SV=2
Length = 137
Score = 48.5 bits (114), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 35/122 (28%), Positives = 59/122 (48%), Gaps = 23/122 (18%)
Query: 12 MLSSNCKLLLRLSLCVSLLMIICIRCSSSRTLQGHEYDGEFSIVGYSPEELTSTDKLVEL 71
M+ S C+ L+ L+I+C+ S+ D E+
Sbjct: 1 MMVSICEQKLQ-HFSAVFLLILCLGMMSAAPPPDPSLDNEW------------------- 40
Query: 72 FESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNREL-QITSSYWLGLNEFSDMSHEE 130
+ W K K+Y EE+ HR ++++N K I+A N + Q +S+++GLN+FSD++ EE
Sbjct: 41 -KEWKTKFAKAYNLNEER-HRRLVWEENKKKIEAHNADYEQGKTSFYMGLNQFSDLTPEE 98
Query: 131 FK 132
FK
Sbjct: 99 FK 100
>sp|Q94503|CYSP6_DICDI Cysteine proteinase 6 OS=Dictyostelium discoideum GN=cprF PE=2 SV=1
Length = 434
Score = 48.5 bits (114), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 46/79 (58%), Gaps = 9/79 (11%)
Query: 63 TSTDKLVEL-----FESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYW 117
T+ +L EL F +WM+ H + Y S+EE R IFK N+ +I+ N + T
Sbjct: 16 TAKQQLSELQYRNAFTNWMIAHQRHY-SSEEFNGRFNIFKANMDYINEWNTKGSETV--- 71
Query: 118 LGLNEFSDMSHEEFKNKYL 136
LGLN F+D+++EE++ YL
Sbjct: 72 LGLNVFADITNEEYRATYL 90
>sp|Q9R013|CATF_MOUSE Cathepsin F OS=Mus musculus GN=Ctsf PE=2 SV=1
Length = 462
Score = 47.8 bits (112), Expect = 3e-05, Method: Composition-based stats.
Identities = 28/76 (36%), Positives = 43/76 (56%), Gaps = 8/76 (10%)
Query: 67 KLVELFESWMLKHGKSYESTEEKLHRLEIFKDNL---KHIDARNRELQITSSYWLGLNEF 123
K+ LF+ +M + ++YES EE RL +F N+ + I A +R T+ Y G+ +F
Sbjct: 160 KMAPLFKDFMTTYNRTYESREEAQWRLTVFARNMIRAQKIQALDRG---TAQY--GITKF 214
Query: 124 SDMSHEEFKNKYLTGL 139
SD++ EEF YL L
Sbjct: 215 SDLTEEEFHTIYLNPL 230
>sp|Q9YMP9|CATV_NPVLD Viral cathepsin OS=Lymantria dispar multicapsid nuclear
polyhedrosis virus GN=VCATH PE=3 SV=1
Length = 356
Score = 47.8 bits (112), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 40/70 (57%), Gaps = 1/70 (1%)
Query: 70 ELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHE 129
+ FES++ + K+Y S EK R IFKDNL I+A+N + +N+FSD+S
Sbjct: 54 DYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKFSDLSKS 113
Query: 130 EFKNKYLTGL 139
E K+ TGL
Sbjct: 114 ELIAKF-TGL 122
>sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 OS=Zea mays GN=CCP1 PE=2 SV=1
Length = 371
Score = 47.0 bits (110), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 41/65 (63%), Gaps = 3/65 (4%)
Query: 72 FESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEF 131
F S++ + GKSY+ +E +RL +FKDNL+ AR +L + S G+ +FSD++ EF
Sbjct: 48 FLSFVQRFGKSYKDADEHAYRLSVFKDNLRR--ARRHQL-LDPSAEHGVTKFSDLTPAEF 104
Query: 132 KNKYL 136
+ YL
Sbjct: 105 RRTYL 109
>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis thaliana GN=RD19A PE=2
SV=1
Length = 368
Score = 47.0 bits (110), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/83 (34%), Positives = 48/83 (57%), Gaps = 7/83 (8%)
Query: 54 IVGYSPEELTSTDKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQIT 113
+ G P+ LTS D F + K GK Y S EE +R +FK NL+ R+++L +
Sbjct: 37 VGGAEPQVLTSEDH----FSLFKRKFGKVYASNEEHDYRFSVFKANLRRA-RRHQKLDPS 91
Query: 114 SSYWLGLNEFSDMSHEEFKNKYL 136
+++ G+ +FSD++ EF+ K+L
Sbjct: 92 ATH--GVTQFSDLTRSEFRKKHL 112
>sp|Q54TR1|CFAD_DICDI Counting factor associated protein D OS=Dictyostelium discoideum
GN=cfaD PE=1 SV=1
Length = 531
Score = 46.6 bits (109), Expect = 5e-05, Method: Composition-based stats.
Identities = 31/101 (30%), Positives = 52/101 (51%), Gaps = 7/101 (6%)
Query: 33 ICIRCSSSRTLQGHE--YDGEFSIVGYSPEELTSTDKLVELFESWMLKHGKSYESTEEKL 90
I + C++++ E +DG FS +G L ++ LF+ + ++ K Y S +E
Sbjct: 186 IPVLCNNAKEAPAKENQFDGLFSSIG--DNLLAKEEQASNLFKEYKAQYNKEYSSQDEHD 243
Query: 91 HRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHEEF 131
R FK K I N + SSY LG+N ++D+S++EF
Sbjct: 244 ERFINFKAARKIIATHNAK---ESSYKLGMNHYADLSNKEF 281
>sp|Q9PYY5|CATV_GVXN Viral cathepsin OS=Xestia c-nigrum granulosis virus GN=VCATH PE=3
SV=1
Length = 346
Score = 46.6 bits (109), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 41/71 (57%), Gaps = 4/71 (5%)
Query: 70 ELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDMSHE 129
ELF +++K+ K Y+ +EK R EIFK NL I+ARN + S +N +D+S
Sbjct: 41 ELFNEFVVKYNKVYKDDQEKEARFEIFKQNLADINARN---ALEDSAMFEINSRADISSN 97
Query: 130 EFKNKYLTGLK 140
E K LTGLK
Sbjct: 98 ELLQK-LTGLK 107
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 46.6 bits (109), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 44/70 (62%), Gaps = 5/70 (7%)
Query: 71 LFESWMLKHGKSYESTEEKLH----RLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDM 126
++ W L+HGKS ++ ++ R IFKDNL+ ID N E ++Y LGL F+++
Sbjct: 3 IYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHN-ENNKNATYKLGLTIFANL 61
Query: 127 SHEEFKNKYL 136
+++E+++ YL
Sbjct: 62 TNDEYRSLYL 71
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 46.6 bits (109), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 43/76 (56%), Gaps = 4/76 (5%)
Query: 60 EELTSTDKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLG 119
++L S + L +L+E W H S S EK R +FK NL H+ N+ + Y L
Sbjct: 28 KDLASEESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANLMHVHNTNK---MDKPYKLK 83
Query: 120 LNEFSDMSHEEFKNKY 135
LN+F+DM++ EF++ Y
Sbjct: 84 LNKFADMTNHEFRSTY 99
>sp|P25779|CYSP_TRYCR Cruzipain OS=Trypanosoma cruzi PE=1 SV=1
Length = 467
Score = 46.2 bits (108), Expect = 6e-05, Method: Composition-based stats.
Identities = 27/89 (30%), Positives = 43/89 (48%), Gaps = 11/89 (12%)
Query: 54 IVGYSPEELTSTDKLVELFESWMLKHGKSYESTEEKLHRLEIFKDNL----KHIDARNRE 109
+V + L + + L F + KHG+ YES E+ RL +F++NL H A
Sbjct: 20 LVPAATASLHAEETLTSQFAEFKQKHGRVYESAAEEAFRLSVFRENLFLARLHAAANPHA 79
Query: 110 LQITSSYWLGLNEFSDMSHEEFKNKYLTG 138
G+ FSD++ EEF+++Y G
Sbjct: 80 T-------FGVTPFSDLTREEFRSRYHNG 101
>sp|Q6VTL7|CATV_NPVCD Viral cathepsin OS=Choristoneura fumiferana defective polyhedrosis
virus GN=Vcath PE=3 SV=1
Length = 324
Score = 46.2 bits (108), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 45/73 (61%), Gaps = 4/73 (5%)
Query: 67 KLVELFESWMLKHGKSYESTEEKLHRLEIFKDNLKHIDARNRELQITSSYWLGLNEFSDM 126
K FE ++ K+Y S EKLHR +IF+ NL+ I N+ L TS+ + +N+FSD+
Sbjct: 23 KAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEI--INKNLNDTSAQY-EINKFSDL 79
Query: 127 SHEEFKNKYLTGL 139
S +E +KY TGL
Sbjct: 80 SKDETISKY-TGL 91
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.134 0.392
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 55,603,662
Number of Sequences: 539616
Number of extensions: 2122819
Number of successful extensions: 6332
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 88
Number of HSP's successfully gapped in prelim test: 80
Number of HSP's that attempted gapping in prelim test: 6143
Number of HSP's gapped (non-prelim): 171
length of query: 149
length of database: 191,569,459
effective HSP length: 107
effective length of query: 42
effective length of database: 133,830,547
effective search space: 5620882974
effective search space used: 5620882974
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 55 (25.8 bits)