BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy12185
(317 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P43234|CATO_HUMAN Cathepsin O OS=Homo sapiens GN=CTSO PE=2 SV=1
Length = 321
Score = 167 bits (422), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 99/258 (38%), Positives = 139/258 (53%), Gaps = 21/258 (8%)
Query: 60 FEKSLDIIEELNKNRQSPES-ARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDH 118
F +SL+ LN S S A YGI +FS L EEFK +LR +K S H
Sbjct: 44 FRESLNRHRYLNSLFPSENSTAFYGINQFSYLFPEEFKAIYLRSKPSKFPRYSAEVH--- 100
Query: 119 HHNHVKKRSITTGITIP-TGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHAL 177
++IP +P++ DWR+ ++ +VRNQQ CG CWAFS V ES +A+
Sbjct: 101 -------------MSIPNVSLPLRFDWRDKQVVTQVRNQQMCGGCWAFSVVGAVESAYAI 147
Query: 178 KNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
K L LSVQ+VIDC+ N N GC+GG L+W++ +V L +SEYP ++ C
Sbjct: 148 KGKPLEDLSVQQVIDCSYN-NYGCNGGSTLNALNWLNKMQVKLVKDSEYPFKAQNGLCHY 206
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
+ S +G IK Y+ E + + T GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 207 FSGSHSGFSIKGYSAYDFSDQEDEMAKALLTFGPLVVIVDAVSWQDYLGGIIQHHC--SS 264
Query: 298 ANINHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 265 GEANHAVLITGFDKTGST 282
>sp|Q8BM88|CATO_MOUSE Cathepsin O OS=Mus musculus GN=Ctso PE=2 SV=1
Length = 312
Score = 150 bits (379), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 86/256 (33%), Positives = 130/256 (50%), Gaps = 18/256 (7%)
Query: 60 FEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHH 119
+SL LN +A YG+ +FS L EEFK +L +K+ +
Sbjct: 36 LRESLHRHRYLNSFPHENSTAFYGVNQFSYLFPEEFKALYLG---SKYAWAPRYPAEG-- 90
Query: 120 HNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKN 179
+R I +P++ DWR+ ++ VRNQ+ CG CWAFS V ES A++
Sbjct: 91 -----QRPIPN-----VSLPLRFDWRDKHVVNPVRNQEMCGGCWAFSVVSAIESARAIQG 140
Query: 180 GTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
+L LSVQ+VIDC+ N N GC GG L W++ ++ L +S+YP + C+
Sbjct: 141 KSLDYLSVQQVIDCSFN-NSGCLGGSPLCALRWLNETQLKLVADSQYPFKAVNGQCRHFP 199
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLAN 299
S GV +K ++ E + + + GP++ V+A++WQ YLGG+IQ++C S
Sbjct: 200 QSQAGVSVKDFSAYNFRGQEDEMARALLSFGPLVVIVDAMSWQDYLGGIIQHHC--SSGE 257
Query: 300 INHAVQIVGYDNYSRT 315
NHAV I G+D T
Sbjct: 258 ANHAVLITGFDRTGNT 273
>sp|Q54TR1|CFAD_DICDI Counting factor associated protein D OS=Dictyostelium discoideum
GN=cfaD PE=1 SV=1
Length = 531
Score = 143 bits (361), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 96/290 (33%), Positives = 143/290 (49%), Gaps = 31/290 (10%)
Query: 31 EQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
EQ LF ++ +Y K YS + EHD RF NF+ + II N S + G+ ++D
Sbjct: 219 EQASNLFKEYKAQYNKEYSSQDEHDERFINFKAARKIIATHNAKE---SSYKLGMNHYAD 275
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
LS +EF T V V D H+ RSI P DWR
Sbjct: 276 LSNKEFNTL-----VKPKVARPSVTGADSVHDDESLRSI----------PSTVDWRNQNC 320
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCA 208
+ V++Q CG+CW F + + E + + NG L LS Q+++DCA G+ GC GG +
Sbjct: 321 VTPVKDQGICGSCWTFGSTGSLEGTNCVTNGELVSLSEQQLVDCAILTGSQGCGGGFASS 380
Query: 209 LLDW-MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
+ M++ L ES YP L+++ C+ + +P+GV I Y + SES++ IA
Sbjct: 381 AFQYVMEIGS--LATESNYPYLMQNGLCRDRTVTPSGVSITGYV-NVTSGSESALQNAIA 437
Query: 268 THGPVIAAVNALT--WQYYLGGVIQYN---CDGSLANINHAVQIVGYDNY 312
T GPV A++A ++YY+ GV YN C L +++H V +GY Y
Sbjct: 438 TTGPVAIAIDASVDDFRYYMSGV--YNNPACKNGLDDLDHEVLAIGYGTY 485
>sp|Q9R014|CATJ_MOUSE Cathepsin J OS=Mus musculus GN=Ctsj PE=2 SV=2
Length = 334
Score = 134 bits (336), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 99/304 (32%), Positives = 154/304 (50%), Gaps = 30/304 (9%)
Query: 11 VALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
V L+ LCF +A + P L+ + + ++ +Y KSYS E +R +E+++ +I+
Sbjct: 5 VLLLILCFGVASGAQAHDPKLDAE---WKDWKTKYAKSYSPKEEALRRAVWEENMRMIKL 61
Query: 70 LNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
NK N + + +F D + EEF R S++ ++ + H NHV
Sbjct: 62 HNKENSLGKNNFTMKMNKFGDQTSEEF-----RKSID-NIPIPAAMTDPHAQNHVS---- 111
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
G+P KDWRE G + VRNQ CG+CWAF+ E K G L+ LSVQ
Sbjct: 112 -------IGLPDYKDWREEGYVTPVRNQGKCGSCWAFAAAGAIEGQMFWKTGNLTPLSVQ 164
Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
++DC+ GN GC G +++ NK LE E+ YP KD C+ ++ + + I
Sbjct: 165 NLLDCSKTVGNKGCQSGTAHQAFEYVLKNK-GLEAEATYPYEGKDGPCRYRSENAS-ANI 222
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQ 305
Y L P+E + +A+ GPV AA++A ++++Y GG I Y + S +NHAV
Sbjct: 223 TDYV--NLPPNELYLWVAVASIGPVSAAIDASHDSFRFYNGG-IYYEPNCSSYFVNHAVL 279
Query: 306 IVGY 309
+VGY
Sbjct: 280 VVGY 283
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 133 bits (334), Expect = 2e-30, Method: Compositional matrix adjust.
Identities = 90/287 (31%), Positives = 139/287 (48%), Gaps = 31/287 (10%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
N ++ LELF S+ + K+Y E + RF+ F ++L I++ N S G+ EF
Sbjct: 43 NTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS---YWLGLNEF 99
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DL+ EEFK R+L + + + R IT +P DWR+
Sbjct: 100 ADLTHEEFKGRYL------GLAKPQFSRKRQPSANFRYRDITD-------LPKSVDWRKK 146
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V++Q CG+CWAFSTV E ++ + G LS LS QE+IDC N GC+GG
Sbjct: 147 GAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGG--- 203
Query: 208 ALLDWMD---VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILT 264
L+D+ ++ L E +YP L+++ C+ + V I Y + + ++ L
Sbjct: 204 -LMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGY--EDVPENDDESLV 260
Query: 265 DIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H PV A+ A +Q+Y GGV C +++H V VGY
Sbjct: 261 KALAHQPVSVAIEASGRDFQFYKGGVFNGKCG---TDLDHGVAAVGY 304
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 132 bits (332), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 87/279 (31%), Positives = 142/279 (50%), Gaps = 24/279 (8%)
Query: 34 LELFSSFQQRYKKSYSKSEHD-IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSE 92
+ELF ++ ++K+Y E +RF+ F+ +L I+E NK +S G+ EF+DLS
Sbjct: 48 IELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKS---YWLGLNEFADLSH 104
Query: 93 EEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGK 152
EEFK +L + ++ + + + R + +P DWR+ G + +
Sbjct: 105 EEFKKMYL--GLKTDIV---RRDEERSYAEFAYRDVEA-------VPKSVDWRKKGAVAE 152
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
V+NQ +CG+CWAFSTV E ++ + G L+ LS QE+IDC N GC+GG ++
Sbjct: 153 VKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEY 212
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPV 272
+ V L E +YP +++ C+ + V I + D E S+L +A H P+
Sbjct: 213 I-VKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQ-DVPTNDEKSLLKALA-HQPL 269
Query: 273 IAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
A++A +Q+Y GGV C +++H V VGY
Sbjct: 270 SVAIDASGREFQFYSGGVFDGRCG---VDLDHGVAAVGY 305
>sp|O97397|CATLL_PHACE Cathepsin L-like proteinase OS=Phaedon cochleariae PE=2 SV=1
Length = 324
Score = 131 bits (330), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 95/310 (30%), Positives = 158/310 (50%), Gaps = 33/310 (10%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELN 71
+IAL L + + N EL++ F++ + ++Y S E +RF F+ +L I E N
Sbjct: 4 IIALAALIVVI-----NAASDQELWADFKKTHARTYKSLREEKLRFNIFQDTLRQIAEHN 58
Query: 72 KNRQSPESARY-GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
++ ES Y I +FSD+++EEF+ +++ ++ L ++ +T
Sbjct: 59 VKYENGESTYYLAINKFSDITDEEFRDMLMKNEASRPNL-----------EGLEVADLTV 107
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
G P DWR G++ VRNQ CG+CWA ST ES A+K+G+ LS Q++
Sbjct: 108 GAA-----PESIDWRSKGVVLPVRNQGECGSCWALSTAAAIESQSAIKSGSKVPLSPQQL 162
Query: 191 IDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
+DC+ + GN GC+GG +++ N LE +++YP K+ CK S + V++
Sbjct: 163 VDCSTSYGNHGCNGGFAVNGFEYVKDNG--LESDADYPYSGKEDKCKANDKSRSVVELTG 220
Query: 250 YTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVI-QYNCDGSLANINHAVQIVG 308
Y + SE+S+ + T GP+ A V + Y GG+ +C G N++H V +VG
Sbjct: 221 YK--KVTASETSLKEAVGTIGPISAVVFGKPMKSYGGGIFDDSSCLGD--NLHHGVNVVG 276
Query: 309 Y--DNYSRTW 316
Y +N + W
Sbjct: 277 YGIENGQKYW 286
>sp|P25804|CYSP_PEA Cysteine proteinase 15A OS=Pisum sativum PE=2 SV=1
Length = 363
Score = 131 bits (330), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 91/283 (32%), Positives = 147/283 (51%), Gaps = 34/283 (12%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+SF+ ++ KSY +K EHD RF F+ +L I +L++NR +A +GIT+FSDL+ EF
Sbjct: 48 FTSFKSKFSKSYATKEEHDYRFGVFKSNL-IKAKLHQNRDP--TAEHGITKFSDLTASEF 104
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ + L + K + + H I T +P DWRE G + V++
Sbjct: 105 RRQFL--GLKKRLRLPAHAQK-------------APILPTTNLPEDFDWREKGAVTPVKD 149
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFC 207
Q +CG+CWAFST E H L G L LS Q+++DC AG+ + GC+GG
Sbjct: 150 QGSCGSCWAFSTTGALEGAHYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMN 209
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
+++ + V++ E +Y +D +CK S + +++ TL E I ++
Sbjct: 210 NAFEYLLESGGVVQ-EKDYAYTGRDGSCKFD-KSKVVASVSNFSVVTL--DEDQIAANLV 265
Query: 268 THGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
+GP+ A+NA Q Y+ GV Y C + + ++H V +VG+
Sbjct: 266 KNGPLAVAINAAWMQTYMSGVSCPYVC--AKSRLDHGVLLVGF 306
>sp|P04988|CYSP1_DICDI Cysteine proteinase 1 OS=Dictyostelium discoideum GN=cprA PE=1 SV=2
Length = 343
Score = 127 bits (320), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 90/313 (28%), Positives = 152/313 (48%), Gaps = 35/313 (11%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
++ L L + V LE++ + F FQ ++ K YS E+ RF+ F+ +L IEE
Sbjct: 3 VILLFVLAVFTVFVSSRGIPLEEQSQ-FLEFQDKFNKKYSHEEYLERFEIFKSNLGKIEE 61
Query: 70 LNK---NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
LN N ++ ++G+ +F+DLS +EFK +L NK + + +++
Sbjct: 62 LNLIAINHKA--DTKFGVNKFADLSSDEFKNYYLN---NKEAIFTDDLPV---ADYLDDE 113
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
I + IP DWR G + V+NQ CG+CW+FST E H + L LS
Sbjct: 114 FINS-------IPTAFDWRTRGAVTPVKNQGQCGSCWSFSTTGNVEGQHFISQNKLVSLS 166
Query: 187 VQEVIDC---------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKR 237
Q ++DC + GC+GG +++ N + + ES YP +
Sbjct: 167 EQNLVDCDHECMEYEGEQACDEGCNGGLQPNAYNYIIKNGGI-QTESSYPYTAETGTQCN 225
Query: 238 KATSPNGVKIKSYTCDTLIPSESSILTD-IATHGPVIAAVNALTWQYYLGGVIQYNCDGS 296
++ G KI ++ T+IP +++ I + GP+ A +A+ WQ+Y+GGV C+ +
Sbjct: 226 FNSANIGAKISNF---TMIPKNETVMAGYIVSTGPLAIAADAVEWQFYIGGVFDIPCNPN 282
Query: 297 LANINHAVQIVGY 309
+++H + IVGY
Sbjct: 283 --SLDHGILIVGY 293
>sp|Q63088|CATJ_RAT Cathepsin J OS=Rattus norvegicus GN=Ctsj PE=2 SV=2
Length = 334
Score = 127 bits (318), Expect = 1e-28, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 156/305 (51%), Gaps = 32/305 (10%)
Query: 11 VALIALCF-LAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
V L+ LCF +A PNL+ + + ++ +Y KSYS E +++ +E++L +I+
Sbjct: 5 VFLVILCFGVASGAPARDPNLDAEWQ---DWKTKYAKSYSPVEEELKRAVWEENLKMIQL 61
Query: 70 LNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
NK N + F+D + EEF R S++ ++ + V S
Sbjct: 62 HNKENGLGKNGFTMEMNAFADTTGEEF-----RKSLSDILIPAA----------VTNPSA 106
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
++I G+P KDWR+ G + VRNQ CG+CWAF+ V E K G L+ LSVQ
Sbjct: 107 QKQVSI--GLPNFKDWRKEGYVTPVRNQGKCGSCWAFAAVGAIEGQMFSKTGNLTPLSVQ 164
Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
++DC+ GN GC G +++ NK LE E+ YP KD C+ + + + I
Sbjct: 165 NLLDCSKSEGNNGCRWGTAHQAFNYVLKNK-GLEAEATYPYEGKDGPCRYHSENAS-ANI 222
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHAV 304
+ L P+E + +A+ GPV AA++A ++++Y GGV + NC + +NHAV
Sbjct: 223 TGFV--NLPPNELYLWVAVASIGPVSAAIDASHDSFRFYSGGVYHEPNCSSYV--VNHAV 278
Query: 305 QIVGY 309
+VGY
Sbjct: 279 LVVGY 283
>sp|Q9VN93|CPR1_DROME Putative cysteine proteinase CG12163 OS=Drosophila melanogaster
GN=CG12163 PE=2 SV=2
Length = 614
Score = 126 bits (316), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 89/284 (31%), Positives = 142/284 (50%), Gaps = 38/284 (13%)
Query: 36 LFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF FQ R+ + Y S +E +R + F ++L IEELN N SA+YGITEF+D++ E
Sbjct: 307 LFYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMG--SAKYGITEFADMTSSE 364
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT---GIPVKKDWREAGIIG 151
+K R ++ K + + +P +P + DWR+ +
Sbjct: 365 YKERTGLWQRDE-----------------AKATGGSAAVVPAYHGELPKEFDWRQKDAVT 407
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLD 211
+V+NQ +CG+CWAFS E ++A+K G L S QE++DC + C+GG L+D
Sbjct: 408 QVKNQGSCGSCWAFSVTGNIEGLYAVKTGELKEFSEQELLDC-DTTDSACNGG----LMD 462
Query: 212 WMDVNKVV-----LEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
+ K + LE E+EYP K C T + V++ + D +E+++ +
Sbjct: 463 --NAYKAIKDIGGLEYEAEYPYKAKKNQCHFNRTLSH-VQVAGFV-DLPKGNETAMQEWL 518
Query: 267 ATHGPVIAAVNALTWQYYLGGVIQ-YNCDGSLANINHAVQIVGY 309
+GP+ +NA Q+Y GGV + S N++H V +VGY
Sbjct: 519 LANGPISIGINANAMQFYRGGVSHPWKALCSKKNLDHGVLVVGY 562
>sp|Q10716|CYSP1_MAIZE Cysteine proteinase 1 OS=Zea mays GN=CCP1 PE=2 SV=1
Length = 371
Score = 124 bits (312), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 86/284 (30%), Positives = 136/284 (47%), Gaps = 32/284 (11%)
Query: 37 FSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F SF QR+ KSY + EH R F+ D + +++ SA +G+T+FSDL+ EF
Sbjct: 48 FLSFVQRFGKSYKDADEHAYRLSVFK---DNLRRARRHQLLDPSAEHGVTKFSDLTPAEF 104
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVR 154
+ +L ++ L+ H +PT G+P DWR+ G +G V+
Sbjct: 105 RRTYLGLRKSRRALLRELGESAHE-----------APVLPTDGLPDDFDWRDHGAVGPVK 153
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDF 206
NQ +CG+CW+FS E H L G L +LS Q+ +DC + + GC+GG
Sbjct: 154 NQGSCGSCWSFSASGALEGAHYLATGKLEVLSEQQFVDCDHECDSSEPDSCDSGCNGGLM 213
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
++ LE E +YP D CK S +++++ ++ E+ I ++
Sbjct: 214 TTAFSYLQ-KAGGLESEKDYPYTGSDGKCKFD-KSKIVASVQNFSVVSV--DEAQISANL 269
Query: 267 ATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
HGP+ +NA Q Y+GGV Y C +++H V +VGY
Sbjct: 270 IKHGPLAIGINAAYMQTYIGGVSCPYICG---RHLDHGVLLVGY 310
>sp|P36400|LMCPB_LEIME Cysteine proteinase B OS=Leishmania mexicana GN=LMCPB PE=2 SV=2
Length = 443
Score = 124 bits (310), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 87/280 (31%), Positives = 135/280 (48%), Gaps = 25/280 (8%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P A++GIT+F DLSE E
Sbjct: 37 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS V E L L LS Q+++ C + N GC GG DW+
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201
Query: 215 VNKVV-LEPESEYPLLLKDA---ACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
N L E YP + + C + G +I + + SE ++ +A +G
Sbjct: 202 QNTNGHLHTEDSYPYVSGNGYVPECSNSSELVVGAQIDGHV--LIGSSEKAMAAWLAKNG 259
Query: 271 PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
P+ A++A ++ Y GV+ C G +NH V +VGYD
Sbjct: 260 PIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 296
>sp|Q05094|CYSP2_LEIPI Cysteine proteinase 2 OS=Leishmania pifanoi GN=CYS2 PE=1 SV=1
Length = 444
Score = 122 bits (307), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 87/281 (30%), Positives = 135/281 (48%), Gaps = 26/281 (9%)
Query: 36 LFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEE 94
LF F++ Y ++Y +E R NFE++L+++ E ++P A++GIT+F DLSE E
Sbjct: 37 LFEEFKRTYGRAYETLAEEQQRLANFERNLELMRE--HQARNPH-AQFGITKFFDLSEAE 93
Query: 95 FKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
F R+L N + K H H + ++ +P DWRE G + V+
Sbjct: 94 FAARYL----NGAAYFAAAKRHAAQHYRKARADLSA-------VPDAVDWREKGAVTPVK 142
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMD 214
+Q CG+CWAFS V E L L LS Q+++ C + N GC GG DW+
Sbjct: 143 DQGACGSCWAFSAVGNIEGQWYLAGHELVSLSEQQLVSC-DDMNDGCDGGLMLQAFDWLL 201
Query: 215 VNKVV-LEPESEYPLLLKDAACKRKATSPN----GVKIKSYTCDTLIPSESSILTDIATH 269
N L E YP + + + S G +I + + SE ++ +A +
Sbjct: 202 QNTNGHLHTEDSYPYVSGNGYVPECSNSSEELVVGAQIDGHV--LIGSSEKAMAAWLAKN 259
Query: 270 GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYD 310
GP+ A++A ++ Y GV+ C G +NH V +VGYD
Sbjct: 260 GPIAIALDASSFMSYKSGVLT-ACIGK--QLNHGVLLVGYD 297
>sp|P43295|A494_ARATH Probable cysteine proteinase A494 OS=Arabidopsis thaliana
GN=At2g21430 PE=2 SV=2
Length = 361
Score = 122 bits (307), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 92/325 (28%), Positives = 158/325 (48%), Gaps = 42/325 (12%)
Query: 1 MFDVKNVLFIVALIALC-----FLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHD 54
+F V +++F+ +++C + V ++P + + F+ F++++ K Y S EH
Sbjct: 8 LFSV-SLIFVFVSVSVCGDEDVLIRQVVDETEPKVLSSEDHFTLFKKKFGKVYGSIEEHY 66
Query: 55 IRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHK 114
RF F+ +L + + + P SAR+G+T+FSDL+ EF+ +HL V
Sbjct: 67 YRFSVFKANL--LRAMRHQKMDP-SARHGVTQFSDLTRSEFRRKHL------GVKGGFKL 117
Query: 115 HHDHHHNHVKKRSITTGITIPT-GIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAES 173
D + + +PT +P + DWR+ G + V+NQ +CG+CW+FST E
Sbjct: 118 PKDANQAPI----------LPTQNLPEEFDWRDRGAVTPVKNQGSCGSCWSFSTTGALEG 167
Query: 174 MHALKNGTLSLLSVQEVIDC--------AGNGNMGCSGGDFCALLDWMDVNKVVLEPESE 225
H L G L LS Q+++DC G+ + GC+GG + ++ + L E +
Sbjct: 168 AHFLATGKLVSLSEQQLVDCDHECDPEEEGSCDSGCNGGLMNSAFEYT-LKTGGLMREKD 226
Query: 226 YPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYL 285
YP D + S + +++ ++ +E I ++ +GP+ A+NA Q Y+
Sbjct: 227 YPYTGTDGGSCKLDRSKIVASVSNFSVVSI--NEDQIAANLIKNGPLAVAINAAYMQTYI 284
Query: 286 GGV-IQYNCDGSLANINHAVQIVGY 309
GGV Y C L NH V +VGY
Sbjct: 285 GGVSCPYICSRRL---NHGVLLVGY 306
>sp|Q8RWQ9|ALEUL_ARATH Thiol protease aleurain-like OS=Arabidopsis thaliana GN=At3g45310
PE=2 SV=1
Length = 358
Score = 119 bits (298), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 90/278 (32%), Positives = 133/278 (47%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
FS F RY K Y S E +RF F+++LD+I NK S + + +F+DL+ +EF
Sbjct: 59 FSRFTHRYGKKYQSVEEMKLRFSVFKENLDLIRSTNKKGLS---YKLSLNQFADLTWQEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L HK IT T +P KDWRE GI+ V+
Sbjct: 116 QRYKLGAAQNCSATLKGSHK-----------------ITEAT-VPDTKDWREDGIVSPVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
Q CG+CW FST E+ + G LS Q+++DCAG N GC GG +++
Sbjct: 158 EQGHCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGTFNNFGCHGGLPSQAFEYI 217
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP KD CK A + GV+++ + + + +E + + PV
Sbjct: 218 KYNG-GLDTEEAYPYTGKDGGCKFSAKNI-GVQVRD-SVNITLGAEDELKHAVGLVRPVS 274
Query: 274 AAVNAL-TWQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A + +++Y GV N C + ++NHAV VGY
Sbjct: 275 VAFEVVHEFRFYKKGVFTSNTCGNTPMDVNHAVLAVGY 312
>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
Length = 344
Score = 119 bits (298), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 86/301 (28%), Positives = 142/301 (47%), Gaps = 29/301 (9%)
Query: 13 LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
L LC L + V +K Q F+ + ++KSY+ E R+ F+ ++D +++
Sbjct: 4 LSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSEEFGARYNIFKANMDYVQQW 63
Query: 71 NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
N + E+ G+ F+D++ EE++ +L + L+ +
Sbjct: 64 NS--KGSETVL-GLNNFADITNEEYRNTYLGTKFDASSLIGTQEEK-------------- 106
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
T KDWR G + V+NQ CG CW+FST + E H G L LS Q +
Sbjct: 107 --VFTTSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNL 164
Query: 191 IDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY 250
IDC+ N GC GG +++ +N ++ ES YP ++ C+ K+ + +G + SY
Sbjct: 165 IDCSTE-NSGCDGGLMTYAFEYI-INNNGIDTESSYPYKAENGKCEYKSEN-SGATLSSY 221
Query: 251 TCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
T SESS+ + + + PV A++A ++Q Y G I Y + S N++H V VG
Sbjct: 222 KTVT-AGSESSLESAVNVN-PVSVAIDASHQSFQLYTSG-IYYEPECSSENLDHGVLAVG 278
Query: 309 Y 309
Y
Sbjct: 279 Y 279
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 119 bits (298), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 150/312 (48%), Gaps = 27/312 (8%)
Query: 2 FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
F ++LF L+ L +++ ++ ++ S+ +Y KSY S E + RF+ F
Sbjct: 7 FVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+++L I+E N + S + G+ +F+DL++EEF++ +LR + + +++
Sbjct: 67 KETLRFIDEHNADTN--RSYKVGLNQFADLTDEEFRSTYLRFTSGSNKTKVSNRYEPR-- 122
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
G +P+ + DWR AG + +++Q CG CWAFS + T E ++ + G
Sbjct: 123 ---------VGQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169
Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS QE+IDC N GC+GG ++ +N + E YP +D C
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGGYITDGFQFI-INNGGINTEENYPYTAQDGECNVDL 228
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
+ V I +Y + + + L T+ PV A++A ++ Y G+ C +
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA- 285
Query: 298 ANINHAVQIVGY 309
++HAV IVGY
Sbjct: 286 --VDHAVTIVGY 295
>sp|P04989|CYSP2_DICDI Cysteine proteinase 2 OS=Dictyostelium discoideum GN=cprB PE=2 SV=1
Length = 376
Score = 119 bits (298), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 86/281 (30%), Positives = 133/281 (47%), Gaps = 21/281 (7%)
Query: 32 QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLS 91
Q F+ + ++ + YS SE R+ F+ ++D ++ N N + G+ F+D++
Sbjct: 31 QYRTAFTEWTLKFNRQYSSSEFSNRYSIFKSNMDYVD--NWNSKGDSQTVLGLNNFADIT 88
Query: 92 EEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIG 151
EE++ +L VN H +N R + + T P DWR +
Sbjct: 89 NEEYRKTYLGTRVNAH-----------SYNGYDGREVLNVEDLQTN-PKSIDWRTKNAVT 136
Query: 152 KVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAG-NGNMGCSGGDFCALL 210
+++Q CG+CW+FST + E HALK L LS Q ++DC+G N GC GG
Sbjct: 137 PIKDQGQCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFGCDGGLMNNAF 196
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
D++ NK + + ES YP + + S G IK Y + SE S L + A HG
Sbjct: 197 DYIIKNKGI-DTESSYPYTAETGSTCLFNKSDIGATIKGY-VNITAGSEIS-LENGAQHG 253
Query: 271 PVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A++A ++Q Y G I Y S ++H V +VGY
Sbjct: 254 PVSVAIDASHNSFQLYTSG-IYYEPKCSPTELDHGVLVVGY 293
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 119 bits (298), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 91/285 (31%), Positives = 133/285 (46%), Gaps = 28/285 (9%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
++E+ ++LF S+ ++ K Y + I RF+ F +L I+E NK S G+ F
Sbjct: 40 SIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNS---YWLGLNGF 96
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
+DLS +EFK +++ + H + D + HV T P DWR
Sbjct: 97 ADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKHV------------TNYPQSIDWRAK 144
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFC 207
G + V+NQ CG+CWAFST+ T E ++ + G L LS QE++DC + + GC GG
Sbjct: 145 GAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQT 203
Query: 208 ALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPS--ESSILTD 265
L ++ N V YP K C +AT G K+K T +PS E+S L
Sbjct: 204 TSLQYVANNGV--HTSKVYPYQAKQYKC--RATDKPGPKVK-ITGYKRVPSNCETSFLGA 258
Query: 266 IATHG-PVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+A V+ +Q Y GV C L +HAV VGY
Sbjct: 259 LANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKL---DHAVTAVGY 300
>sp|Q9JIA9|CATR_MOUSE Cathepsin R OS=Mus musculus GN=Ctsr PE=2 SV=1
Length = 334
Score = 119 bits (297), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 90/306 (29%), Positives = 147/306 (48%), Gaps = 28/306 (9%)
Query: 10 IVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIE 68
+ A++ + FL + V P L+ L+ + ++ +Y KSYS E ++ +E+ L +I+
Sbjct: 1 MAAVVFIAFLYLGVASGVPVLDSSLDAEWQDWKIKYNKSYSLKEEKLKRVVWEEKLKMIK 60
Query: 69 ELNK-NRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRS 127
N+ N + EF D ++EEF+ + SV H + KR
Sbjct: 61 LHNRENSLGKNGFTMKMNEFGDQTDEEFRKMMIEISVWTH----------REGKSIMKRE 110
Query: 128 ITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV 187
G +P + DWR+ G + VR Q C ACWAF+ E+ + G L+ LSV
Sbjct: 111 --AGSILPKFV----DWRKKGYVTPVRRQGDCDACWAFAVTGAIEAQAIWQTGKLTPLSV 164
Query: 188 QEVIDCAG-NGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVK 246
Q ++DC+ GN GC GGD ++ ++ LE E+ YP KD C+ +P K
Sbjct: 165 QNLVDCSKPQGNNGCLGGDTYNAFQYV-LHNGGLESEATYPYEGKDGPCR---YNPKNSK 220
Query: 247 IKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVI-QYNCDGSLANINHA 303
+ +L SE ++ +AT GP+ A ++A +++ Y GG+ + NC S + H
Sbjct: 221 AEITGFVSLPQSEDILMAAVATIGPITAGIDASHESFKNYKGGIYHEPNC--SSDTVTHG 278
Query: 304 VQIVGY 309
V +VGY
Sbjct: 279 VLVVGY 284
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 118 bits (295), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 83/315 (26%), Positives = 145/315 (46%), Gaps = 29/315 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKK-------SYSKSEHDIR 56
K +AL+AL FL+I + P E+ L S Y+K + E + R
Sbjct: 2 AKPKFIALALVALSFLSIAQSI--PFTEKDLASEDSLWNLYEKWRTHHTVARDLDEKNRR 59
Query: 57 FKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHH 116
F F++++ I E N+ + +P + + +F D++ +EF++++ + HH+
Sbjct: 60 FNVFKENVKFIHEFNQKKDAP--YKLALNKFGDMTNQEFRSKYAGSKIQ------HHRSQ 111
Query: 117 DHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHA 176
+ ++P DWR G + V++Q CG+CWAFST+ + E ++
Sbjct: 112 RGIQKNTGSFMYENVGSLPA---ASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQ 168
Query: 177 LKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACK 236
+K G L LS QE++DC + N GC+GG +++ N + E YP +D C
Sbjct: 169 IKTGELVSLSEQELVDCDTSYNEGCNGGLMDYAFEFIQKNGITT--EDSYPYAEQDGTCA 226
Query: 237 RKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCD 294
+ V I + D +E++++ +A P+ ++ A +Q+Y GV C
Sbjct: 227 SNLLNSPVVSIDGHQ-DVPANNENALMQAVANQ-PISVSIEASGYGFQFYSEGVFTGRCG 284
Query: 295 GSLANINHAVQIVGY 309
L +H V IVGY
Sbjct: 285 TEL---DHGVAIVGY 296
>sp|P43296|RD19A_ARATH Cysteine proteinase RD19a OS=Arabidopsis thaliana GN=RD19A PE=2
SV=1
Length = 368
Score = 118 bits (295), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 143/297 (48%), Gaps = 34/297 (11%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESAR 81
V ++P + + FS F++++ K Y S EHD RF F+ +L ++++ SA
Sbjct: 37 VGGAEPQVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANL---RRARRHQKLDPSAT 93
Query: 82 YGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVK 141
+G+T+FSDL+ EF+ +HL + S K + K + I +P
Sbjct: 94 HGVTQFSDLTRSEFRKKHLG-------VRSGFK--------LPKDANKAPILPTENLPED 138
Query: 142 KDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDC-------- 193
DWR+ G + V+NQ +CG+CW+FS E + L G L LS Q+++DC
Sbjct: 139 FDWRDHGAVTPVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQQLVDCDHECDPEE 198
Query: 194 AGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCD 253
A + + GC+GG + ++ + L E +YP KD + S + +++
Sbjct: 199 ADSCDSGCNGGLMNSAFEYT-LKTGGLMKEEDYPYTGKDGKTCKLDKSKIVASVSNFSVI 257
Query: 254 TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
++ E I ++ +GP+ A+NA Q Y+GGV Y C +NH V +VGY
Sbjct: 258 SI--DEEQIAANLVKNGPLAVAINAGYMQTYIGGVSCPYIC---TRRLNHGVLLVGY 309
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 117 bits (294), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 150/312 (48%), Gaps = 27/312 (8%)
Query: 2 FDVKNVLFIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSY-SKSEHDIRFKNF 60
F ++LF L+ L +++ ++ ++ S+ +Y KSY S E + RF+ F
Sbjct: 7 FVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIF 66
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+++L I+E N + S + G+ +F+DL++EEF++ +L + + +++
Sbjct: 67 KETLRFIDEHNADTN--RSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPR-- 122
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
G +P+ + DWR AG + +++Q CG CWAFS + T E ++ + G
Sbjct: 123 ---------VGQVLPSYV----DWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTG 169
Query: 181 TLSLLSVQEVIDCAGNGNM-GCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKA 239
L LS QE+IDC N GC+GG ++ +N + E YP +D C
Sbjct: 170 VLISLSEQELIDCGRTQNTRGCNGGYITDGFQFI-INNGGINTEENYPYTAQDGECNLDL 228
Query: 240 TSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSL 297
+ V I +Y + + + L T+ PV A++A +++Y G+ C +
Sbjct: 229 QNEKYVTIDTY--ENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTA- 285
Query: 298 ANINHAVQIVGY 309
I+HAV IVGY
Sbjct: 286 --IDHAVTIVGY 295
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 117 bits (293), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 85/271 (31%), Positives = 125/271 (46%), Gaps = 26/271 (9%)
Query: 51 SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLM 110
EH+ RF F +L ++ N R G+ F+DL+ EEF+ L V +
Sbjct: 69 GEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFRATFLGAKVAERSRA 128
Query: 111 SHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVET 170
+ ++ H+ V++ +P DWRE G + V+NQ CG+CWAFS V T
Sbjct: 129 AGERYR---HDGVEE------------LPESVDWREKGAVAPVKNQGQCGSCWAFSAVST 173
Query: 171 AESMHALKNGTLSLLSVQEVIDCAGNG-NMGCSGGDFCALLDWMDVNKVVLEPESEYPLL 229
ES++ L G + LS QE+++C+ NG N GC+GG D++ + ++ E +YP
Sbjct: 174 VESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFI-IKNGGIDTEDDYPYK 232
Query: 230 LKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGG 287
D C + V I + D E S+ +A H PV A+ A +Q Y G
Sbjct: 233 AVDGKCDINRENAKVVSIDGFE-DVPQNDEKSLQKAVA-HQPVSVAIEAGGREFQLYHSG 290
Query: 288 VIQYNCDGSLANINHAVQIVGY--DNYSRTW 316
V C SL +H V VGY DN W
Sbjct: 291 VFSGRCGTSL---DHGVVAVGYGTDNGKDYW 318
>sp|P14658|CYSP_TRYBB Cysteine proteinase OS=Trypanosoma brucei brucei PE=1 SV=1
Length = 450
Score = 116 bits (291), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 157/316 (49%), Gaps = 30/316 (9%)
Query: 4 VKNVLFIVALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKS-EHDIRFKNFE 61
V+ V V L+A+ V + ++E+ LE+ F++F+++Y K Y + E RF+ FE
Sbjct: 7 VRFVRLPVVLLAMAACLASVALGSLHVEESLEMRFAAFKKKYGKVYKDAKEEAFRFRAFE 66
Query: 62 KSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHN 121
+++ E+ + A +G+T FSD++ EEF+ R+ ++ +
Sbjct: 67 ENM---EQAKIQAAANPYATFGVTPFSDMTREEFRARY--------------RNGASYFA 109
Query: 122 HVKKRSITTGITIPTG-IPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
+KR T + + TG P DWRE G + V+ Q CG+CWAFST+ E +
Sbjct: 110 AAQKRLRKT-VNVTTGRAPAAVDWREKGAVTPVKVQGQCGSCWAFSTIGNIEGQWQVAGN 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKA 239
L LS Q ++ C + GC+GG +W+ + N + E+ YP + + ++
Sbjct: 169 PLVSLSEQMLVSC-DTIDSGCNGGLMDNAFNWIVNSNGGNVFTEASYPYVSGNG--EQPQ 225
Query: 240 TSPNGVKIKSYTCD--TLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSL 297
NG +I + D L E +I +A +GP+ AV+A ++ Y GG++ +C +
Sbjct: 226 CQMNGHEIGAAITDHVDLPQDEDAIAAYLAENGPLAIAVDAESFMDYNGGILT-SC--TS 282
Query: 298 ANINHAVQIVGYDNYS 313
++H V +VGY++ S
Sbjct: 283 KQLDHGVLLVGYNDNS 298
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 116 bits (290), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 89/308 (28%), Positives = 146/308 (47%), Gaps = 31/308 (10%)
Query: 7 VLFIVALIALCFL-AIPVKVSK--PNLEQKLELFSSFQQRYKKSYSKSEHDIR-FKNFEK 62
V + + LC + A P S+ PN + ++ F + Y + Y + +R F+ F+
Sbjct: 5 VQLVFLFLFLCAMWASPSAASRDEPN-DPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKN 63
Query: 63 SLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
++ IE N ++ S GI +F+D+++ EF ++ S+ ++ D
Sbjct: 64 NVKHIETFNSRNEN--SYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSFDD---- 117
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
+ I + +P DWR+ G + +V+NQ CG+CW+F+ + T E ++ +K G L
Sbjct: 118 ---------VNI-SAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYL 167
Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSP 242
LS QEV+DCA + GC GG D++ N V E YP L C + P
Sbjct: 168 VSLSEQEVLDCA--VSYGCKGGWVNKAYDFIISNNGVTT-EENYPYLAYQGTCNANSF-P 223
Query: 243 NGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL-TWQYYLGGVIQYNCDGSLANIN 301
N I Y+ E S++ ++ P+ A ++A +QYY GGV C SL N
Sbjct: 224 NSAYITGYSY-VRRNDERSMMYAVSNQ-PIAALIDASENFQYYNGGVFSGPCGTSL---N 278
Query: 302 HAVQIVGY 309
HA+ I+GY
Sbjct: 279 HAITIIGY 286
>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
Length = 360
Score = 115 bits (289), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 86/278 (30%), Positives = 129/278 (46%), Gaps = 27/278 (9%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY KSY S +E RF+ F +SL ++ N+ S R GI F+D+S EEF
Sbjct: 59 FARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLS---YRLGINRFADMSWEEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L +H+ +P KDWRE GI+ V+
Sbjct: 116 RATRLGAAQNCSATLTGNHRMR----------------AAAVALPETKDWREDGIVSPVK 159
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
NQ CG+CW FST E+ + G LS Q+++DC N GC+GG +++
Sbjct: 160 NQGHCGSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYI 219
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP + CK K + GVK+ + + + +E + + PV
Sbjct: 220 KYNG-GLDTEESYPYQGVNGICKFKNENV-GVKVLD-SVNITLGAEDELKDAVGLVRPVS 276
Query: 274 AAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A +T ++ Y GV + C + ++NHAV VGY
Sbjct: 277 VAFEVITGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGY 314
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
SV=1
Length = 321
Score = 115 bits (289), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 83/305 (27%), Positives = 153/305 (50%), Gaps = 38/305 (12%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEE 69
VA + LC LA+ + P+ + F+ +Y + Y ++ ++ R + F+++ +IE+
Sbjct: 3 VAALFLCGLALAT--ASPSWDH-------FKTQYGRKYGDAKEELYRQRVFQQNEQLIED 53
Query: 70 LNKNRQSPE-SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
NK ++ E + + + +F D++ EEF + +M +K + + +++
Sbjct: 54 FNKKFENGEVTFKVAMNQFGDMTNEEF-----------NAVMKGYKKG----SRGEPKAV 98
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
T P V DWR ++ V++Q+ CG+CWAFS E H LKN L LS Q
Sbjct: 99 FTAEAGPMAADV--DWRTKALVTPVKDQEQCGSCWAFSATGALEGQHFLKNDELVSLSEQ 156
Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
+++DC+ + GN GC GG + D++ N + + ES YP +D +C+ A S +
Sbjct: 157 QLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGI-DTESSYPYEAEDRSCRFDANSIGAICT 215
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGV-IQYNCDGSLANINHAV 304
S + +E ++ ++ GP+ A++A ++Q+Y GV + NC + ++H V
Sbjct: 216 GSV---EVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQNCSPTF--LDHGV 270
Query: 305 QIVGY 309
VGY
Sbjct: 271 LAVGY 275
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 115 bits (288), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 90/311 (28%), Positives = 139/311 (44%), Gaps = 31/311 (9%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQK--------LELFSSFQQRYKKSYSKSEHDIRFKNF 60
FIV +ALC L + + K EL+ ++ + + S E RF F
Sbjct: 4 FIV--LALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLEEKAKRFNVF 61
Query: 61 EKSLDIIEELNKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHH 120
+ ++ I E NK +S + + +F D++ EEF+ + ++ HH+
Sbjct: 62 KHNVKHIHETNK---KDKSYKLKLNKFGDMTSEEFRRTYAGSNI------KHHRMFQGEK 112
Query: 121 NHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNG 180
K T+PT + DWR+ G + V+NQ CG+CWAFSTV E ++ ++
Sbjct: 113 KATKSFMYANVNTLPTSV----DWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTK 168
Query: 181 TLSLLSVQEVIDCAGNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKAT 240
L+ LS QE++DC N N GC+GG +++ K L E YP D C
Sbjct: 169 KLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIK-EKGGLTSELVYPYKASDETCDTNKE 227
Query: 241 SPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLA 298
+ V I + D SE ++ +A PV A++A +Q+Y GV C L
Sbjct: 228 NAPVVSIDGHE-DVPKNSEDDLMKAVANQ-PVSVAIDAGGSDFQFYSEGVFTGRCGTEL- 284
Query: 299 NINHAVQIVGY 309
NH V +VGY
Sbjct: 285 --NHGVAVVGY 293
>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
Length = 358
Score = 114 bits (286), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 85/278 (30%), Positives = 132/278 (47%), Gaps = 29/278 (10%)
Query: 37 FSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY K Y E +RF F+++LD+I NK S + G+ +F+DL+ +EF
Sbjct: 59 FARFTHRYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLS---YKLGVNQFADLTWQEF 115
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L HK +P KDWRE GI+ V+
Sbjct: 116 QRTKLGAAQNCSATLKGSHK------------------VTEAALPETKDWREDGIVSPVK 157
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWM 213
+Q CG+CW FST E+ + G LS Q+++DCAG N GC+GG +++
Sbjct: 158 DQGGCGSCWTFSTTGALEAAYHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYI 217
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVI 273
N L+ E YP KD CK A + GV++ + + + + +E + + PV
Sbjct: 218 KSNG-GLDTEKAYPYTGKDETCKFSAENV-GVQVLN-SVNITLGAEDELKHAVGLVRPVS 274
Query: 274 AAVNAL-TWQYYLGGV-IQYNCDGSLANINHAVQIVGY 309
A + +++ Y GV +C + ++NHAV VGY
Sbjct: 275 IAFEVIHSFRLYKSGVYTDSHCGSTPMDVNHAVLAVGY 312
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 114 bits (285), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 82/291 (28%), Positives = 135/291 (46%), Gaps = 24/291 (8%)
Query: 23 VKVSKPNLEQKLELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSP-ESA 80
V + + E+ L++ ++ + KSY+ E + R+ F +L I+E N + S
Sbjct: 26 VSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSF 85
Query: 81 RYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPV 140
R G+ F+DL+ EE++ +L ++ V R + +P
Sbjct: 86 RLGLNRFADLTNEEYRDTYL-----------GLRNKPRRERKVSDRYLAAD---NEALPE 131
Query: 141 KKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMG 200
DWR G + ++++Q CG+CWAFS + E ++ + G L LS QE++DC + N G
Sbjct: 132 SVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEG 191
Query: 201 CSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSES 260
C+GG D++ +N ++ E +YP KD C + V I SY D SE+
Sbjct: 192 CNGGLMDYAFDFI-INNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYE-DVTPNSET 249
Query: 261 SILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
S+ +A PV A+ A +Q Y G+ C +L +H V VGY
Sbjct: 250 SLQKAVANQ-PVSVAIEAGGRAFQLYSSGIFTGKCGTAL---DHGVAAVGY 296
>sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. japonica GN=Os09g0442300
PE=2 SV=2
Length = 362
Score = 114 bits (284), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 85/279 (30%), Positives = 129/279 (46%), Gaps = 30/279 (10%)
Query: 37 FSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F R+ K Y +E RF+ F +SL+++ N+ R P R GI F+D+S EEF
Sbjct: 62 FARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNR-RGLPY--RLGINRFADMSWEEF 118
Query: 96 KTRHLRHSVN-KHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVR 154
+ L + N L +H+ D +P KDWRE GI+ V+
Sbjct: 119 QASRLGAAQNCSATLAGNHRMRD-----------------AAALPETKDWREDGIVSPVK 161
Query: 155 NQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGCSGGDFCALLDWM 213
+Q CG+CW FST + E+ + G LS Q+++DCA N GCSGG +++
Sbjct: 162 DQGHCGSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYI 221
Query: 214 DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSY-TCDTLIPSESSILTDIATHGPV 272
N L+ E YP + C K P V +K + + + +E + + PV
Sbjct: 222 KYNG-GLDTEEAYPYTGVNGICHYK---PENVGVKVLDSVNITLGAEDELKNAVGLVRPV 277
Query: 273 IAAVNALT-WQYYLGGVIQYN-CDGSLANINHAVQIVGY 309
A + ++ Y GV + C S ++NHAV VGY
Sbjct: 278 SVAFQVINGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGY 316
>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 114 bits (284), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 86/291 (29%), Positives = 137/291 (47%), Gaps = 29/291 (9%)
Query: 25 VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESA-RY 82
++ P +Q + ++ +++ Y +E + R +EK++ +I+ N + +
Sbjct: 16 LATPKFDQTFSAEWHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSM 75
Query: 83 GITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKK 142
+ F D++ EEF R VN + H H K R + + IP
Sbjct: 76 EMNAFGDMTNEEF-----RQVVNGY----------RHQKHKKGRLFQEPLMLK--IPKSV 118
Query: 143 DWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGNMGC 201
DWRE G + V+NQ CG+CWAFS E LK G L LS Q ++DC+ GN GC
Sbjct: 119 DWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGC 178
Query: 202 SGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP-SES 260
+GG ++ N L+ E YP KD +CK +A + + T IP E
Sbjct: 179 NGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----FAVANDTGFVDIPQQEK 233
Query: 261 SILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+++ +AT GP+ A++A + Q+Y G I Y + S N++H V +VGY
Sbjct: 234 ALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKNLDHGVLLVGY 283
>sp|P54639|CYSP4_DICDI Cysteine proteinase 4 OS=Dictyostelium discoideum GN=cprD PE=2 SV=2
Length = 442
Score = 114 bits (284), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 90/306 (29%), Positives = 151/306 (49%), Gaps = 34/306 (11%)
Query: 13 LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
L LC L + +K Q F+++ Q ++++YS E + R++ F+ ++D + +
Sbjct: 4 LSFLCLLLVSYASAKQQFSELQYRNAFTNWMQAHQRTYSSEEFNARYQIFKSNMDYVHQW 63
Query: 71 NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
N + E+ G+ F+D++ +E++T +L + L+ + +K T
Sbjct: 64 NS--KGGETV-LGLNVFADITNQEYRTTYLGTPFDGSALIGTEE---------EKIFSTP 111
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGT---LSLLSV 187
T+ DWR G + ++NQ CG CW+FST + E H + +GT L LS
Sbjct: 112 APTV--------DWRAQGAVTPIKNQGQCGGCWSFSTTGSTEGAHFIASGTKKDLVSLSE 163
Query: 188 QEVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA-ACKRKATSPNGV 245
Q +IDC+ + GN GC GG +++ +N ++ ES YP +D CK K TS G
Sbjct: 164 QNLIDCSKSYGNNGCEGGLMTLAFEYI-INNKGIDTESSYPYTAEDGKECKFK-TSNIGA 221
Query: 246 KIKSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDGSLANINHA 303
+I SY + SE+S L + + PV A++A ++Q Y G I Y S ++H
Sbjct: 222 QIVSYQ-NVTSGSEAS-LQSASNNAPVSVAIDASNESFQLYESG-IYYEPACSPTQLDHG 278
Query: 304 VQIVGY 309
V +VGY
Sbjct: 279 VLVVGY 284
>sp|Q94503|CYSP6_DICDI Cysteine proteinase 6 OS=Dictyostelium discoideum GN=cprF PE=2 SV=1
Length = 434
Score = 113 bits (283), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 87/305 (28%), Positives = 143/305 (46%), Gaps = 33/305 (10%)
Query: 13 LIALCFLAIPVKVSKPNLE--QKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEEL 70
L ALC L + V +K L Q F+++ +++ YS E + RF F+ ++D I E
Sbjct: 4 LSALCVLLVSVATAKQQLSELQYRNAFTNWMIAHQRHYSSEEFNGRFNIFKANMDYINEW 63
Query: 71 NKNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
N + E+ G+ F+D++ EE++ +L + L + V+ S+
Sbjct: 64 NT--KGSETV-LGLNVFADITNEEYRATYLGTPFDASSL--EMTPSEKVFGGVQANSV-- 116
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSV--Q 188
DWR G + ++NQ CG CW+FS E + NG L SV Q
Sbjct: 117 ------------DWRAKGAVTPIKNQGECGGCWSFSATGATEGAQYIANGDSDLTSVSEQ 164
Query: 189 EVIDCAGN-GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
++IDC+G+ GN GC GG +++ +N ++ ES YP CK ++ G ++
Sbjct: 165 QLIDCSGSYGNNGCEGGLMTLAFEYI-INNGGIDTESSYPFTANTEKCKYNPSNI-GAEL 222
Query: 248 KSYTCDTLIPSESSILTDIATHGPVIAAVNAL--TWQYYLGGVIQYNCDG-SLANINHAV 304
SY + SES + + T GP A++A ++Q+Y G+ YN S ++H V
Sbjct: 223 SSYV-NVTSGSESDLAAKV-TQGPTSVAIDASQPSFQFYSSGI--YNEPACSSTQLDHGV 278
Query: 305 QIVGY 309
VG+
Sbjct: 279 LAVGF 283
>sp|O10364|CATV_NPVOP Viral cathepsin OS=Orgyia pseudotsugata multicapsid polyhedrosis
virus GN=VCATH PE=3 SV=1
Length = 324
Score = 113 bits (283), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/301 (32%), Positives = 149/301 (49%), Gaps = 32/301 (10%)
Query: 14 IALCFLAIPV-KVSKPNLEQKLELFSSFQQRYKKSYS-KSEHDIRFKNFEKSLDIIEELN 71
I LC L V + +L + F F ++ K+YS +SE RFK F+ +L+ E +N
Sbjct: 4 IMLCLLVCGVVHAATYDLLKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLE--EIIN 61
Query: 72 KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMS-HHKHHDHHHNHVKKRSITT 130
KN Q+ +A+Y I +FSDLS+EE +++K+ +S H+ + + R
Sbjct: 62 KN-QNDSTAQYEINKFSDLSKEE--------AISKYTGLSLPHQTQNFCEVVILDRPPDR 112
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
G P++ DWR+ + V+NQ CGACWAF+T+ + ES A+K L LS Q+
Sbjct: 113 G-------PLEFDWRQFNKVTSVKNQGVCGACWAFATLGSLESQFAIKYNRLINLSEQQF 165
Query: 191 IDCAGNGNMGCSGGDF-CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKS 249
IDC N GC GG A M++ V + ES+YP + C+ +PN +
Sbjct: 166 IDC-DRVNAGCDGGLLHTAFESAMEMGGVQM--ESDYPYETANGQCR---INPNRFVVGV 219
Query: 250 YTCDTLIPSESSILTDIATH-GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVG 308
+C I L D+ GP+ A++A Y G+++ + L NHAV +VG
Sbjct: 220 RSCRRYIVMFEEKLKDLLRAVGPIPVAIDASDIVNYRRGIMRQCANHGL---NHAVLLVG 276
Query: 309 Y 309
Y
Sbjct: 277 Y 277
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 113 bits (282), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 79/280 (28%), Positives = 134/280 (47%), Gaps = 28/280 (10%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDIIEELNKNRQSPESARYGITEFSD 89
E+ ++LF+S+ + K Y + + RF+ F+ +L+ I+E NK S G+ EF+D
Sbjct: 42 ERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNS---YWLGLNEFAD 98
Query: 90 LSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGI 149
LS +EF +++ ++ + S+ + + +P DWR+ G
Sbjct: 99 LSNDEFNEKYVGSLIDATIEQSYDEEFINEDT--------------VNLPENVDWRKKGA 144
Query: 150 IGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCAL 209
+ VR+Q +CG+CWAFS V T E ++ ++ G L LS QE++DC + GC GG
Sbjct: 145 VTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSH-GCKGGYPPYA 203
Query: 210 LDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH 269
L+++ N + L S+YP K C+ K G +K+ + P+ L +
Sbjct: 204 LEYVAKNGIHL--RSKYPYKAKQGTCRAKQVG--GPIVKTSGVGRVQPNNEGNLLNAIAK 259
Query: 270 GPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIV 307
PV V + +Q Y GG+ + C ++HAV V
Sbjct: 260 QPVSVVVESKGRPFQLYKGGIFEGPCG---TKVDHAVTAV 296
>sp|P25775|LMCPA_LEIME Cysteine proteinase A OS=Leishmania mexicana GN=LMCPA PE=2 SV=1
Length = 354
Score = 113 bits (282), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 149/316 (47%), Gaps = 32/316 (10%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSYS-KSEHDIRFKNFEKS 63
+ + L +C+ + + + P ++ + + SF++R+ K++ +E RF F+++
Sbjct: 10 AIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAFKQN 69
Query: 64 LDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
+ LN Q+P A Y ++ +F+DL+ +EF +L N H K+H
Sbjct: 70 MQTAYFLNT--QNPH-AHYDVSGKFADLTPQEFAKLYL----NPDYYARHLKNH------ 116
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
K + + P+G+ + DWR+ G + V+NQ CG+CWAFS + E A +L
Sbjct: 117 --KEDVHVDDSAPSGV-MSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSL 173
Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAA---CKRK 238
LS Q ++ C N + GC+GG ++W M + + E+ YP C +
Sbjct: 174 VSLSEQMLVSC-DNIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCHDE 232
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
G KI + +L E I + GPV AV+A TWQ Y GGV+ SL
Sbjct: 233 GEV--GAKITGFL--SLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSLCLAWSL- 287
Query: 299 NINHAVQIVGYDNYSR 314
NH V IVG++ ++
Sbjct: 288 --NHGVLIVGFNKNAK 301
>sp|Q9YWK4|CATV_NPVBS Viral cathepsin OS=Buzura suppressaria nuclear polyhedrosis virus
GN=VCATH PE=3 SV=1
Length = 331
Score = 112 bits (281), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 90/278 (32%), Positives = 137/278 (49%), Gaps = 28/278 (10%)
Query: 35 ELFSSFQQRYKKSYSK-SEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEE 93
+ F +F Y K Y+ SE + RF F+++L EE+N + +SA Y I +F+DLS+
Sbjct: 29 DYFETFLANYNKMYNDTSEKERRFSIFQQTL---EEINYKNRLNDSAVYQINKFADLSKN 85
Query: 94 EFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI-PVKKDWREAGIIGK 152
E +++ +N V + N K T I P G P+ DWR+ +
Sbjct: 86 EIISKYT--GLNMPVQTT---------NFCK----TIVIDQPPGKGPLNFDWRQQNKVTS 130
Query: 153 VRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALLDW 212
++NQ+ CGACWAF+T+ + ES +A+KN LS Q++IDC +MGC GG +
Sbjct: 131 IKNQKACGACWAFATLASIESQYAIKNNVHIDLSEQQMIDC-DYVDMGCDGGLLHTAFEQ 189
Query: 213 MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATH-GP 271
M + L E EYP + C+ + VK+K C + L D+ GP
Sbjct: 190 M-IQMGELVQEHEYPYAGVNKPCELRGDETGVVKVKG--CYRYVVFREEKLKDLLRAVGP 246
Query: 272 VIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
+ A++A Y G+I Y C+ +NHAV +VGY
Sbjct: 247 IPMAIDASGIVNYHHGIIHY-CENY--GLNHAVLLVGY 281
>sp|P35591|CYSP1_LEIPI Cysteine proteinase 1 OS=Leishmania pifanoi GN=CYS1 PE=2 SV=2
Length = 354
Score = 112 bits (281), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 148/316 (46%), Gaps = 32/316 (10%)
Query: 7 VLFIVALIALCFLAIPVKVSKPNLEQKLEL--FSSFQQRYKKSYS-KSEHDIRFKNFEKS 63
+ + L +C+ + + + P ++ + + SF++R+ K++ +E RF F+++
Sbjct: 10 AIVVTILFVVCYGSALIAQTPPPVDNFVASAHYGSFKKRHGKAFGGDAEEGHRFNAFKQN 69
Query: 64 LDIIEELNKNRQSPESARYGIT-EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNH 122
+ LN Q+P A Y ++ +F+DL+ +EF +L N H K H
Sbjct: 70 MQTAYFLNT--QNPH-AHYDVSGKFADLTPQEFAKLYL----NPDYYARHLKDH------ 116
Query: 123 VKKRSITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTL 182
K + + P+G+ + DWR+ G + V+NQ CG+CWAFS + E A +L
Sbjct: 117 --KEDVHVDDSAPSGV-MSVDWRDKGAVTPVKNQGLCGSCWAFSAIGNIEGQWAASGHSL 173
Query: 183 SLLSVQEVIDCAGNGNMGCSGGDFCALLDW-MDVNKVVLEPESEYPLLLKDAA---CKRK 238
LS Q ++ C N + GC+GG ++W M + + E+ YP C +
Sbjct: 174 VSLSEQMLVSC-DNIDEGCNGGLMDQAMNWIMQSHNGSVFTEASYPYTSGGGTRPPCHDE 232
Query: 239 ATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLA 298
G KI + +L E I + GPV AV+A TWQ Y GGV+ SL
Sbjct: 233 GEV--GAKITGFL--SLPHDEERIAEWVEKRGPVAVAVDATTWQLYFGGVVSLCLAWSL- 287
Query: 299 NINHAVQIVGYDNYSR 314
NH V IVG++ ++
Sbjct: 288 --NHGVLIVGFNKNAK 301
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 112 bits (280), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 76/281 (27%), Positives = 134/281 (47%), Gaps = 21/281 (7%)
Query: 31 EQKLELFSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
E +L+ ++ + S S E RF F+ ++ + NK + + + +F+D+
Sbjct: 34 ESLWDLYERWRSHHTVSRSLGEKHKRFNVFKANVMHVHNTNK---MDKPYKLKLNKFADM 90
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
+ EF++ + VN H + +H + K S+ P DWR+ G +
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSV----------PASVDWRKKGAV 140
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
V++Q CG+CWAFST+ E ++ +K L LS QE++DC N GC+GG +
Sbjct: 141 TDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAF 200
Query: 211 DWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHG 270
+++ K + ES YP ++ C + V I + + + E+++L +A
Sbjct: 201 EFIK-QKGGITTESNYPYTAQEGTCDESKVNDLAVSIDGHE-NVPVNDENALLKAVANQ- 257
Query: 271 PVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
PV A++A +Q+Y GV +C+ ++NH V IVGY
Sbjct: 258 PVSVAIDAGGSDFQFYSEGVFTGDCN---TDLNHGVAIVGY 295
>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 112 bits (280), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 87/294 (29%), Positives = 139/294 (47%), Gaps = 35/294 (11%)
Query: 25 VSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYG 83
++ P +Q + ++ +++ Y +E + R +EK++ +I+ N + ++G
Sbjct: 16 LATPKFDQTFNAQWHQWKSTHRRLYGTNEEEWRRAVWEKNMRMIQLHNGEY---SNGKHG 72
Query: 84 IT----EFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIP 139
T F D++ EEF R VN + H H K R + + IP
Sbjct: 73 FTMEMNAFGDMTNEEF-----RQIVNGY----------RHQKHKKGRLFQEPLMLQ--IP 115
Query: 140 VKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCA-GNGN 198
DWRE G + V+NQ CG+CWAFS E LK G L LS Q ++DC+ GN
Sbjct: 116 KTVDWREKGCVTPVKNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGN 175
Query: 199 MGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP- 257
GC+GG ++ N L+ E YP KD +CK +A + + T IP
Sbjct: 176 QGCNGGLMDFAFQYIKENG-GLDSEESYPYEAKDGSCKYRAE----YAVANDTGFVDIPQ 230
Query: 258 SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
E +++ +AT GP+ A++A + Q+Y G I Y + S +++H V +VGY
Sbjct: 231 QEKALMKAVATVGPISVAMDASHPSLQFYSSG-IYYEPNCSSKDLDHGVLVVGY 283
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 112 bits (279), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 141/284 (49%), Gaps = 34/284 (11%)
Query: 34 LELFSSFQQRYKKSYSKS---EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDL 90
+ ++ ++ ++ K+ S++ E D RF+ F+ +L ++E N+ S R G+T F+DL
Sbjct: 47 MSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLS---YRLGLTRFADL 103
Query: 91 SEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGII 150
+ +E+++++L + K ++ S+ + +P DWR+ G +
Sbjct: 104 TNDEYRSKYLGAKMEKK--------------GERRTSLRYEARVGDELPESIDWRKKGAV 149
Query: 151 GKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDFCALL 210
+V++Q CG+CWAFST+ E ++ + G L LS QE++DC + N GC+GG L+
Sbjct: 150 AEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGG----LM 205
Query: 211 DW---MDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIA 267
D+ + ++ + +YP D C + + V I SY D SE S+ +A
Sbjct: 206 DYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYE-DVPTYSEESLKKAVA 264
Query: 268 THGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
H P+ A+ A +Q Y G+ +C L +H V VGY
Sbjct: 265 -HQPISIAIEAGGRAFQLYDSGIFDGSCGTQL---DHGVVAVGY 304
>sp|P05167|ALEU_HORVU Thiol protease aleurain OS=Hordeum vulgare PE=2 SV=1
Length = 362
Score = 112 bits (279), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 84/277 (30%), Positives = 128/277 (46%), Gaps = 26/277 (9%)
Query: 37 FSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEFSDLSEEEF 95
F+ F RY KSY S +E RF+ F +SL EE+ + R GI FSD+S EEF
Sbjct: 61 FARFAVRYGKSYESAAEVRRRFRIFSESL---EEVRSTNRKGLPYRLGINRFSDMSWEEF 117
Query: 96 KTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREAGIIGKVRN 155
+ L + ++ NH+ + + +P KDWRE GI+ V+N
Sbjct: 118 QATRLGAAQTCSATLAG--------NHLMRDA--------AALPETKDWREDGIVSPVKN 161
Query: 156 QQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGN-GNMGCSGGDFCALLDWMD 214
Q CG+CW FST E+ + G LS Q+++DCAG N GC+GG +++
Sbjct: 162 QAHCGSCWTFSTTGALEAAYTQATGKNISLSEQQLVDCAGGFNNFGCNGGLPSQAFEYIK 221
Query: 215 VNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDIATHGPVIA 274
N + + E YP + C KA + V++ + + + +E + + PV
Sbjct: 222 YNGGI-DTEESYPYKGVNGVCHYKAENA-AVQVLD-SVNITLNAEDELKNAVGLVRPVSV 278
Query: 275 AVNALTW--QYYLGGVIQYNCDGSLANINHAVQIVGY 309
A + QY G +C + ++NHAV VGY
Sbjct: 279 AFQVIDGFRQYKSGVYTSDHCGTTPDDVNHAVLAVGY 315
>sp|Q9YMP9|CATV_NPVLD Viral cathepsin OS=Lymantria dispar multicapsid nuclear
polyhedrosis virus GN=VCATH PE=3 SV=1
Length = 356
Score = 112 bits (279), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 87/284 (30%), Positives = 137/284 (48%), Gaps = 25/284 (8%)
Query: 29 NLEQKLELFSSFQQRYKKSYSKS-EHDIRFKNFEKSLDIIEELNKNRQSPESARYGITEF 87
NL++ + F SF + Y K+Y+ E + R+ F+ +L I N N +A Y I +F
Sbjct: 48 NLQRAPDYFESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKF 107
Query: 88 SDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGIPVKKDWREA 147
SDLS+ E + S+ + V N K + P P+ DWRE
Sbjct: 108 SDLSKSELIAKFTGLSIPERV-----------SNFCKTIILNQP---PDKGPLHFDWREQ 153
Query: 148 GIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGNMGCSGGDF- 206
+ ++NQ CGACWAF+T+ + ES A+++ L LS Q++IDC + +MGC+GG
Sbjct: 154 NKVTSIKNQGACGACWAFATLASVESQFAMRHNRLIDLSEQQLIDC-DSVDMGCNGGLLH 212
Query: 207 CALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIPSESSILTDI 266
A + M + V + E +YP + ++ C P V + C + L D+
Sbjct: 213 TAFEEIMRMGGV--QTELDYPFVGRNRRCGLDRHRPYVVSLVG--CYRYVMVNEEKLKDL 268
Query: 267 ATH-GPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGY 309
GP+ A++A Y GVI +C+ + +NHAV +VGY
Sbjct: 269 LRAVGPIPMAIDAADIVNYYRGVIS-SCENN--GLNHAVLLVGY 309
>sp|P25779|CYSP_TRYCR Cruzipain OS=Trypanosoma cruzi PE=1 SV=1
Length = 467
Score = 110 bits (276), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 82/294 (27%), Positives = 134/294 (45%), Gaps = 24/294 (8%)
Query: 21 IPVKVSKPNLEQKL-ELFSSFQQRYKKSY-SKSEHDIRFKNFEKSLDIIEELNKNRQSPE 78
+P + + E+ L F+ F+Q++ + Y S +E R F ++L + L+ +
Sbjct: 21 VPAATASLHAEETLTSQFAEFKQKHGRVYESAAEEAFRLSVFRENL-FLARLHA--AANP 77
Query: 79 SARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITTGITIPTGI 138
A +G+T FSDL+ EEF++R+ H H ++ + + + G
Sbjct: 78 HATFGVTPFSDLTREEFRSRY-------------HNGAAHFAAAQERARVPVKVEV-VGA 123
Query: 139 PVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEVIDCAGNGN 198
P DWR G + V++Q CG+CWAFS + E L L+ LS Q ++ C +
Sbjct: 124 PAAVDWRARGAVTAVKDQGQCGSCWAFSAIGNVECQWFLAGHPLTNLSEQMLVSC-DKTD 182
Query: 199 MGCSGGDFCALLDWM-DVNKVVLEPESEYPLLLKDAACKRKATSPNGVKIKSYTCDTLIP 257
GCSGG +W+ N + E YP + TS + V L
Sbjct: 183 SGCSGGLMNNAFEWIVQENNGAVYTEDSYPYASGEGISPPCTTSGHTVGATITGHVELPQ 242
Query: 258 SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANINHAVQIVGYDN 311
E+ I +A +GPV AV+A +W Y GGV+ +C ++H V +VGY++
Sbjct: 243 DEAQIAAWLAVNGPVAVAVDASSWMTYTGGVMT-SCVSE--QLDHGVLLVGYND 293
>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
Length = 334
Score = 110 bits (275), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 87/304 (28%), Positives = 144/304 (47%), Gaps = 33/304 (10%)
Query: 13 LIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEELN 71
L ALC + + + P L+Q L+ + ++ + + Y +E R +EK++ +IE N
Sbjct: 7 LTALC---LGIASAAPKLDQNLDADWYKWKATHGRLYGMNEEGWRRAVWEKNMKMIELHN 63
Query: 72 KN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSITT 130
+ Q + F D++ EEF+ +M+ ++ H V S+
Sbjct: 64 QEYSQGKHGFSMAMNAFGDMTNEEFRQ-----------VMNGFQNQKHKKGKVFHESLV- 111
Query: 131 GITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQEV 190
+ +P + DWRE G + V+NQ CG+CWAFS E K G L LS Q +
Sbjct: 112 -LEVPKSV----DWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNL 166
Query: 191 IDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDA-ACKRKATSPNGVKIK 248
+DC+ GN GC+GG ++ N L+ E YP L ++ +C K
Sbjct: 167 VDCSRPQGNQGCNGGLMDNAFQYVKDNG-GLDTEESYPYLGRETNSCTYKPE----CSAA 221
Query: 249 SYTCDTLIPS-ESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHAVQ 305
+ T IP E +++ +AT GP+ A++A ++Q+Y G I Y+ D S +++H V
Sbjct: 222 NDTGFVDIPQREKALMKAVATVGPISVAIDAGHSSFQFYKSG-IYYDPDCSSKDLDHGVL 280
Query: 306 IVGY 309
+VGY
Sbjct: 281 VVGY 284
>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
Length = 331
Score = 110 bits (274), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 86/311 (27%), Positives = 142/311 (45%), Gaps = 34/311 (10%)
Query: 9 FIVALIALCFLAIPVKVSKPNLEQKLELFSSFQQRYKKSYSKSEHDI-RFKNFEKSLDII 67
++V L+ LC A+ P L+ L+ ++ Y K Y + ++ R +EK+L +
Sbjct: 3 WLVGLLPLCSYAVAQVHKDPTLDHHWNLW---KKTYSKQYKEENEEVARRLIWEKNLKFV 59
Query: 68 EELN-KNRQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKR 126
N ++ S G+ D++ EE + LM + +V R
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTGEEVIS-----------LMGSLRVPSQWQRNVTYR 108
Query: 127 SITTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLS 186
S + +P DWRE G + +V+ Q +CGACWAFS V E+ LK G L LS
Sbjct: 109 SNSN-----QKLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLS 163
Query: 187 VQEVIDCAGN--GNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNG 244
Q ++DC+ GN GC+GG ++ ++ ++ E+ YP + C+ +
Sbjct: 164 AQNLVDCSTEKYGNKGCNGGFMTTAFQYI-IDNNGIDSEASYPYKAMNGKCRYDS----- 217
Query: 245 VKIKSYTCD--TLIP--SESSILTDIATHGPVIAAVNALTWQYYLGGVIQYNCDGSLANI 300
K ++ TC T +P SE ++ +A GPV A++A + ++L Y N+
Sbjct: 218 -KKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNV 276
Query: 301 NHAVQIVGYDN 311
NH V +VGY N
Sbjct: 277 NHGVLVVGYGN 287
>sp|O60911|CATL2_HUMAN Cathepsin L2 OS=Homo sapiens GN=CTSL2 PE=1 SV=2
Length = 334
Score = 110 bits (274), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 87/319 (27%), Positives = 147/319 (46%), Gaps = 39/319 (12%)
Query: 11 VALIALCFLAIPVKVSKPNLEQKLEL-FSSFQQRYKKSYSKSEHDIRFKNFEKSLDIIEE 69
+ L A C L I V P +Q L+ + ++ +++ Y +E R +EK++ +IE
Sbjct: 5 LVLAAFC-LGIASAV--PKFDQNLDTKWYQWKATHRRLYGANEEGWRRAVWEKNMKMIEL 61
Query: 70 LNKN-RQSPESARYGITEFSDLSEEEFKTRHLRHSVNKHVLMSHHKHHDHHHNHVKKRSI 128
N Q + F D++ EEF+ +M ++ V + +
Sbjct: 62 HNGEYSQGKHGFTMAMNAFGDMTNEEFRQ-----------MMGCFRNQKFRKGKVFREPL 110
Query: 129 TTGITIPTGIPVKKDWREAGIIGKVRNQQTCGACWAFSTVETAESMHALKNGTLSLLSVQ 188
+ +P + DWR+ G + V+NQ+ CG+CWAFS E K G L LS Q
Sbjct: 111 F--LDLPKSV----DWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQ 164
Query: 189 EVIDCA-GNGNMGCSGGDFCALLDWMDVNKVVLEPESEYPLLLKDAACKRKATSPNGVKI 247
++DC+ GN GC+GG ++ N L+ E YP + D CK + + +
Sbjct: 165 NLVDCSRPQGNQGCNGGFMARAFQYVKENG-GLDSEESYPYVAVDEICKYRPEN----SV 219
Query: 248 KSYTCDTLIP--SESSILTDIATHGPVIAAVNA--LTWQYYLGGVIQYNCDGSLANINHA 303
+ T T++ E +++ +AT GP+ A++A ++Q+Y G I + D S N++H
Sbjct: 220 ANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKSG-IYFEPDCSSKNLDHG 278
Query: 304 VQIVGY------DNYSRTW 316
V +VGY N S+ W
Sbjct: 279 VLVVGYGFEGANSNNSKYW 297
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.319 0.133 0.406
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 117,397,920
Number of Sequences: 539616
Number of extensions: 4778598
Number of successful extensions: 20595
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 228
Number of HSP's successfully gapped in prelim test: 88
Number of HSP's that attempted gapping in prelim test: 19037
Number of HSP's gapped (non-prelim): 971
length of query: 317
length of database: 191,569,459
effective HSP length: 117
effective length of query: 200
effective length of database: 128,434,387
effective search space: 25686877400
effective search space used: 25686877400
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 61 (28.1 bits)