BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 014761
(419 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 404 bits (1039), Expect = e-112, Method: Compositional matrix adjust.
Identities = 204/405 (50%), Positives = 265/405 (65%), Gaps = 14/405 (3%)
Query: 23 SDINELFETWCKQHGKAYSSEQ--EKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFAD 80
+++ ++E W +HGKA S EK +R +IF+DN FV +HN N S+ L L FAD
Sbjct: 44 AEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEK-NLSYRLGLTRFAD 102
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKKGAVTEVKDQASCG 139
LT+ E+++ +LG A ++ R S++ + D +P SIDWRKKGAV EVKDQ CG
Sbjct: 103 LTNDEYRSKYLG---AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCG 159
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFS GA+EGIN+IVTG L++LSEQEL+DCD SYN GC GGLMDYA++F+IKN GID
Sbjct: 160 SCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGID 219
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
T+KDYPY+G G C++ + N +VTID Y+DVP +E+ L +AV QP+S+ I RAF
Sbjct: 220 TDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAF 279
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
QLY SGIF G C T LDH V+ VGY +ENG DYWI++NSWG+SWG +GY+ M RN +S
Sbjct: 280 QLYDSGIFDGSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSS 339
Query: 320 GICGINMLASYPTKTGQ------NPPPSPPPGPTRCSLLTYCAAGETCCCGSSILGICLS 373
G CGI + SYP K G+ PPSP PT+C C TCCC C +
Sbjct: 340 GKCGIAIEPSYPIKNGENPPNPGPSPPSPIKPPTQCDSYYTCPESNTCCCLFEYGKYCFA 399
Query: 374 WKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVSLKFSFTVK 418
W CC +A CC D+ CCP YP+CD + CL +S F+VK
Sbjct: 400 WGCCPLEAATCCDDNYSCCPHEYPVCDLDQGTCL-LSKNSPFSVK 443
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 392 bits (1007), Expect = e-108, Method: Compositional matrix adjust.
Identities = 194/393 (49%), Positives = 250/393 (63%), Gaps = 11/393 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFAD 80
+ L+ W +HGK+Y++ E+++R F DN ++ +HN + G SF L LN FAD
Sbjct: 35 EARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFAD 94
Query: 81 LTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGA 140
LT++E++ ++LG + R+ + + +P S+DWR KGAV E+KDQ CG+
Sbjct: 95 LTNEEYRDTYLGLR--NKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGS 152
Query: 141 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDT 200
CWAFSA A+EGIN+IVTG L+SLSEQEL+DCD SYN GC GGLMDYA+ F+I N GIDT
Sbjct: 153 CWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDT 212
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
E DYPY+G+ +C+ + N +VTID Y+DV N+E L +AV QPVSV I RAFQ
Sbjct: 213 EDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQ 272
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LYSSGIFTG C T+LDH V VGY +ENG DYWI++NSWG+SWG +GY+ M+RN S G
Sbjct: 273 LYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMERNIKASSG 332
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGPTR------CSLLTYCAAGETCCCGSSILGICLSW 374
CGI + SYP K G+NPP P P+ C C TCCC C +W
Sbjct: 333 KCGIAVEPSYPLKKGENPPNPGPTPPSPTPPPTVCDNYYTCPDSTTCCCIYEYGKYCYAW 392
Query: 375 KCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCL 407
CC A CC DH CCP YPIC+ + CL
Sbjct: 393 GCCPLEGATCCDDHYSCCPHEYPICNVQQGTCL 425
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 368 bits (945), Expect = e-101, Method: Compositional matrix adjust.
Identities = 186/396 (46%), Positives = 249/396 (62%), Gaps = 22/396 (5%)
Query: 29 FETWCKQHGKAYSSE--QEKQQRLKIFEDNYAFVTQHNNMGNS--SFTLSLNAFADLTHQ 84
++ W ++G + E ++R +F DN FV HN + F L +N FADLT++
Sbjct: 52 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111
Query: 85 EFKASFLGFSAASIDHDRRRNASVQ-SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWA 143
EF+A+FLG A +R R A + + ++P S+DWR+KGAV VK+Q CG+CWA
Sbjct: 112 EFRATFLGAKVA----ERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167
Query: 144 FSATGAIEGINKIVTGSLVSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEK 202
FSA +E IN++VTG +++LSEQEL++C + NSGC GGLMD A+ F+IKN GIDTE
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
DYPY+ G+C+ + N +V+IDG++DVP+N+EK L +AV QPVSV I R FQLY
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGIC 322
SG+F+G C TSLDH V+ VGY ++NG DYWI++NSWG WG +GY+ M+RN + G C
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKC 347
Query: 323 GINMLASYPTKTGQNPPPSPPPGPTR------------CSLLTYCAAGETCCCGSSILGI 370
GI M+ASYPTK+G NPP P PT C C AG TCCC +
Sbjct: 348 GIAMMASYPTKSGANPPKPSPTPPTPPTPPPPSAPDHVCDDNFSCPAGSTCCCAFGFRNL 407
Query: 371 CLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQC 406
CL W CC A CC DH CCP +YP+C++ C
Sbjct: 408 CLVWGCCPVEGATCCKDHASCCPPDYPVCNTRAGTC 443
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 358 bits (919), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 187/372 (50%), Positives = 238/372 (63%), Gaps = 13/372 (3%)
Query: 45 EKQQRLKIFEDNYAFVTQHNNMGNS--SFTLSLNAFADLTHQEFKASFLGFSAASIDHDR 102
E ++R ++F DN FV HN + F L +N FADLT+ EF+A++LG + A R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPAG--RGR 141
Query: 103 RRNASVQSPGNLRDVPASIDWRKKGAVTE-VKDQASCGACWAFSATGAIEGINKIVTGSL 161
R + + G + +P S+DWR KGAV VK+Q CG+CWAFSA A+EGINKIVTG L
Sbjct: 142 RVGEAYRHDG-VEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGEL 200
Query: 162 VSLSEQELIDCDRS-YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNR 220
VSLSEQEL++C R+ NSGC GG+MD A+ F+ +N G+DTE+DYPY G+CN K +R
Sbjct: 201 VSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSR 260
Query: 221 HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVL 280
+V+IDG++DVPEN+E L +AV QPVSV I R FQLY SG+FTG C T+LDH V+
Sbjct: 261 KVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVV 320
Query: 281 IVGY--DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP 338
VGY D+ G YW ++NSWG WG NGY+ M+RN G CGI M+ASYP K G NP
Sbjct: 321 AVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPIKKGPNP 380
Query: 339 PPSPPPGPT----RCSLLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPS 394
PSPP +C + C AG TCCC I C+ W CC A CC DH CCP
Sbjct: 381 KPSPPSPAPSPPQQCDRYSKCPAGTTCCCNYGIRNHCIVWGCCPVEGATCCKDHSTCCPK 440
Query: 395 NYPICDSVRHQC 406
YP+C++ C
Sbjct: 441 EYPVCNAKARTC 452
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 354 bits (908), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 169/326 (51%), Positives = 218/326 (66%), Gaps = 2/326 (0%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGN 68
SI+ + L + ELFE+W +H KAY S +EK R ++F +N + Q NN N
Sbjct: 31 FSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN 90
Query: 69 SSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGA 128
S + L LN FADLTH+EFK +LG + R+ +A+ + ++ D+P S+DWRKKGA
Sbjct: 91 S-YWLGLNEFADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYR-DITDLPKSVDWRKKGA 148
Query: 129 VTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 188
V VKDQ CG+CWAFS A+EGIN+I TG+L SLSEQELIDCD ++NSGC GGLMDYA
Sbjct: 149 VAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYA 208
Query: 189 YQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPV 248
+Q++I G+ E DYPY + G C +QK + VTI GY+DVPEN+++ L++A+ QPV
Sbjct: 209 FQYIISTGGLHKEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPV 268
Query: 249 SVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGY 308
SV I S R FQ Y G+F G C T LDH V VGY S G DY I+KNSWG WG G+
Sbjct: 269 SVAIEASGRDFQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGF 328
Query: 309 MHMQRNTGNSLGICGINMLASYPTKT 334
+ M+RNTG G+CGIN +ASYPTKT
Sbjct: 329 IRMKRNTGKPEGLCGINKMASYPTKT 354
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 353 bits (907), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 167/319 (52%), Positives = 219/319 (68%), Gaps = 8/319 (2%)
Query: 28 LFETWCKQHGKAYSSEQ----EKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLNAFADLT 82
++ W +HGK+ S+ ++ +R IF+DN F+ HN N N+++ L L FA+LT
Sbjct: 3 IYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLT 62
Query: 83 HQEFKASFLGFSAA---SIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCG 139
+ E+++ +LG I + N + N+ +VP ++DWR+KGAV +KDQ +CG
Sbjct: 63 NDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCG 122
Query: 140 ACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGID 199
+CWAFS A+EGINKIVTG LVSLSEQEL+DCD+SYN GC GGLMDYA+QF++KN G++
Sbjct: 123 SCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLN 182
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
TEKDYPY G G+CN N +VTIDGY+DVP +E L +AV QPVSV I RAF
Sbjct: 183 TEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAF 242
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
Q Y SGIFTG C T++DHAV+ VGY SENGVDYWI++NSWG WG +GY+ M+RN +
Sbjct: 243 QHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVASKS 302
Query: 320 GICGINMLASYPTKTGQNP 338
G CGI + ASYP K NP
Sbjct: 303 GKCGIAIEASYPVKYSPNP 321
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 352 bits (902), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 167/325 (51%), Positives = 217/325 (66%), Gaps = 1/325 (0%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
SI+ S L + ELFE W KAY + +EK R ++F+DN + + N G S
Sbjct: 32 SIVGYSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKS 91
Query: 70 SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAV 129
+ L LN FADL+H+EFK +LG + D R+ + + ++ VP S+DWRKKGAV
Sbjct: 92 -YWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAV 150
Query: 130 TEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAY 189
EVK+Q SCG+CWAFS A+EGINKIVTG+L +LSEQELIDCD +YN+GC GGLMDYA+
Sbjct: 151 AEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAF 210
Query: 190 QFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVS 249
++++KN G+ E+DYPY + G C QK VTI+G++DVP N+EK LL+A+ QP+S
Sbjct: 211 EYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLS 270
Query: 250 VGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYM 309
V I S R FQ YS G+F G C LDH V VGY S G DY I+KNSWG WG GY+
Sbjct: 271 VAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGYI 330
Query: 310 HMQRNTGNSLGICGINMLASYPTKT 334
++RNTG G+CGIN +AS+PTKT
Sbjct: 331 RLKRNTGKPEGLCGINKMASFPTKT 355
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 345 bits (885), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 166/324 (51%), Positives = 219/324 (67%), Gaps = 9/324 (2%)
Query: 24 DINELFETWCKQHGKAYSSEQ----EKQQRLKIFEDNYAFVTQHN-NMGNSSFTLSLNAF 78
++ ++ W +HGK ++ ++ +R IF+DN F+ HN + N+++ L L F
Sbjct: 44 EVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKF 103
Query: 79 ADLTHQEFKASFLGFS---AASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQ 135
DLT+ E++ +LG A I + N + N ++VP ++DWR+KGAV +KDQ
Sbjct: 104 TDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQ 163
Query: 136 ASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKN 195
+CG+CWAFS T A+EGINKIVTG L+SLSEQEL+DCD+SYN GC GGLMDYA+QF++KN
Sbjct: 164 GTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKN 223
Query: 196 HGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGS 255
G++TEKDYPYRG G+CN N +V+IDGY+DVP +E L +A+ QPVSV I
Sbjct: 224 GGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAG 283
Query: 256 ERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNT 315
R FQ Y SGIFTG C T+LDHAV+ VGY SENGVDYWI++NSWG WG GY+ M+RN
Sbjct: 284 GRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVDYWIVRNSWGPRWGEEGYIRMERNL 343
Query: 316 GNSL-GICGINMLASYPTKTGQNP 338
S G CGI + ASYP K NP
Sbjct: 344 AASKSGKCGIAVEASYPVKYSPNP 367
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 338 bits (867), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 158/315 (50%), Positives = 220/315 (69%), Gaps = 5/315 (1%)
Query: 23 SDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLT 82
+++ ++E W ++ K Y+ EK++R KIF+DN FV +HN++ + +F + L FADLT
Sbjct: 38 TEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLT 97
Query: 83 HQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACW 142
++EF+A +L + + G++ +P +DWR GAV VKDQ +CG+CW
Sbjct: 98 NEEFRAIYLRKKMERTKDSVKTERYLYKEGDV--LPDEVDWRANGAVVSVKDQGNCGSCW 155
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDTE 201
AFSA GA+EGIN+I TG L+SLSEQEL+DCDR + N+GC GG+M+YA++F++KN GI+T+
Sbjct: 156 AFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETD 215
Query: 202 KDYPYRG-QAGQCNKQKLNR-HIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAF 259
+DYPY G CN K N +VTIDGY+DVP ++EK L +AV QPVSV I S +AF
Sbjct: 216 QDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAF 275
Query: 260 QLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
QLY SG+ TG C SLDH V++VGY S +G DYWII+NSWG +WG +GY+ +QRN +
Sbjct: 276 QLYKSGVMTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDPF 335
Query: 320 GICGINMLASYPTKT 334
G CGI M+ SYPTK+
Sbjct: 336 GKCGIAMMPSYPTKS 350
>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
lycopersicum PE=2 SV=1
Length = 346
Score = 337 bits (865), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 169/308 (54%), Positives = 209/308 (67%), Gaps = 7/308 (2%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P SIDWR+KG + VKDQ SCG+CWAFSA A+E IN IVTG+L+SLSEQEL+DCDRSY
Sbjct: 18 LPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSY 77
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GGLMDYA++FVIKN GIDTE+DYPY+ + G C++ + N +V ID Y+DVP NNE
Sbjct: 78 NEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNNE 137
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIK 296
K L +AV QPVS+ + R FQ Y SGIFTG C T++DH V+I GY +ENG+DYWI++
Sbjct: 138 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGTENGMDYWIVR 197
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTG------QNPPPSPPPGPTRCS 350
NSWG + NGY+ +QRN +S G+CG+ + SYP KTG PPSP PT C
Sbjct: 198 NSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPVKTGPNPPKPAPSPPSPVKPPTECD 257
Query: 351 LLTYCAAGETCCCGSSILGICLSWKCCGFSSAVCCSDHRYCCPSNYPICDSVRHQCLTVS 410
+ CA G TCCC C SW CC A CC DH CCP +YPIC+ VR ++S
Sbjct: 258 EYSQCAVGTTCCCILQFRRSCFSWGCCPLEGATCCEDHYSCCPHDYPICN-VRQGTCSMS 316
Query: 411 LKFSFTVK 418
VK
Sbjct: 317 KGNPLGVK 324
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 333 bits (854), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 162/348 (46%), Positives = 227/348 (65%), Gaps = 12/348 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCS-------DINELFETWCKQHGKAYSSEQEKQQRLKIFED 55
S++ S LL+ SL N + ++ ++E+W ++GK+Y+S E ++R +IF++
Sbjct: 9 SMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKE 68
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR 115
F+ +HN N S+ + LN FADLT +EF++++LGF++ S ++ + ++ P +
Sbjct: 69 TLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS---NKTKVSNRYEPRVGQ 125
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
+P+ +DWR GAV ++K Q CG CWAFSA +EGINKIVTG L+SLSEQELIDC R+
Sbjct: 126 VLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 176 YNS-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
N+ GC GG + +QF+I N GI+TE++YPY Q G+CN N VTID Y++VP N
Sbjct: 186 QNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPYN 245
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
NE L AV QPVSV + + AF+ YSSGIFTGPC T++DHAV IVGY +E G+DYWI
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWI 305
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
+KNSW +WG GYM + RN G + G CGI + SYP K P P
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVKYNNQNHPKP 352
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 330 bits (847), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 172/352 (48%), Positives = 220/352 (62%), Gaps = 18/352 (5%)
Query: 3 SLAFFLLSIL-LLSSLPLNYCSDINE-----LFETWCKQHGKAYSSEQEKQQRLKIFEDN 56
+LA LS L + S+P +E L+E W H A + EK +R +F++N
Sbjct: 8 ALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVARDLD-EKNRRFNVFKEN 66
Query: 57 YAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG---- 112
F+ + N ++ + L+LN F D+T+QEF++ + G + I H R + ++ G
Sbjct: 67 VKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAG---SKIQHHRSQRGIQKNTGSFMY 123
Query: 113 -NLRDVPA-SIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELI 170
N+ +PA SIDWR KGAVT VKDQ CG+CWAFS ++EGIN+I TG LVSLSEQEL+
Sbjct: 124 ENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQELV 183
Query: 171 DCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKD 230
DCD SYN GC GGLMDYA++F+ KN GI TE YPY Q G C LN +V+IDG++D
Sbjct: 184 DCDTSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAEQDGTCASNLLNSPVVSIDGHQD 242
Query: 231 VPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENG 289
VP NNE L+QAV QP+SV I S FQ YS G+FTG C T LDH V IVGY + +G
Sbjct: 243 VPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDG 302
Query: 290 VDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPS 341
YWI+KNSWG WG +GY+ MQR + G CGI M ASYP KT NP S
Sbjct: 303 TKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPIKTSANPKNS 354
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 330 bits (846), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 161/348 (46%), Positives = 226/348 (64%), Gaps = 12/348 (3%)
Query: 3 SLAFFLLSILLLSSLPLNYCS-------DINELFETWCKQHGKAYSSEQEKQQRLKIFED 55
S++ S LL+ SL N + ++ ++E+W ++GK+Y+S E ++R +IF++
Sbjct: 9 SMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKE 68
Query: 56 NYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLR 115
F+ +HN N S+ + LN FADLT +EF++++L F++ S ++ + ++ P +
Sbjct: 69 TLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFTSGS---NKTKVSNRYEPRVGQ 125
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
+P+ +DWR GAV ++K Q CG CWAFSA +EGINKIVTG L+SLSEQELIDC R+
Sbjct: 126 VLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 176 YNS-GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPEN 234
N+ GC GG + +QF+I N GI+TE++YPY Q G+CN N VTID Y++VP N
Sbjct: 186 QNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYN 245
Query: 235 NEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWI 294
NE L AV QPVSV + + AF+ YSSGIFTGPC T++DHAV IVGY +E G+DYWI
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWI 305
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
+KNSW +WG GYM + RN G + G CGI + SYP K P P
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA-GTCGIATMPSYPVKYNNQNHPKP 352
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 317 bits (811), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 159/348 (45%), Positives = 213/348 (61%), Gaps = 22/348 (6%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMG 67
L S + + L + +L+E W H + EK +R F+ N F+ HN G
Sbjct: 25 LCSAIPMEDKDLESEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRG 83
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG---------NLRDVP 118
+ + L LN F D+ EF+A+F+G D RR+ + P N+ D+P
Sbjct: 84 DHPYRLHLNRFGDMDQAEFRATFVG--------DLRRDTPSKPPSVPGFMYAALNVSDLP 135
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
S+DWR+KGAVT VKDQ CG+CWAFS ++EGIN I TGSLVSLSEQELIDCD + N
Sbjct: 136 PSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADND 195
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRH---IVTIDGYKDVPENN 235
GC GGLMD A++++ N G+ TE YPYR G CN + ++ +V IDG++DVP N+
Sbjct: 196 GCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANS 255
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWI 294
E+ L +AV QPVSV + S +AF YS G+FTG C T LDH V +VGY +E+G YW
Sbjct: 256 EEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWT 315
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSP 342
+KNSWG SWG GY+ +++++G S G+CGI M ASYP KT P P+P
Sbjct: 316 VKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTYSKPKPTP 363
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 315 bits (807), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 168/364 (46%), Positives = 217/364 (59%), Gaps = 23/364 (6%)
Query: 4 LAFFLLSILLLSSL--------PLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFED 55
L FL S+++L + + ++ L++ W + H S E+++R +F
Sbjct: 5 LLIFLFSLVILQTACGFDYDDKEIESEEGLSTLYDRW-RSHHSVPRSLNEREKRFNVFRH 63
Query: 56 NYAFVTQHN-NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRR----RNASVQ- 109
N V HN N N S+ L LN FADLT EFK ++ G ++I H R + S Q
Sbjct: 64 NVMHV--HNTNKKNRSYKLKLNKFADLTINEFKNAYTG---SNIKHHRMLQGPKRGSKQF 118
Query: 110 --SPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQ 167
NL +P+S+DWRKKGAVTE+K+Q CG+CWAFS A+EGINKI T LVSLSEQ
Sbjct: 119 MYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQ 178
Query: 168 ELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDG 227
EL+DCD N GC GGLM+ A++F+ KN GI TE YPY G G+C+ K N +VTIDG
Sbjct: 179 ELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDG 238
Query: 228 YKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE 287
++DVPEN+E LL+AV QPVSV I FQ YS G+FTG C T L+H V VGY SE
Sbjct: 239 HEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYGSE 298
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPPSPPPGPT 347
G YWI++NSWG WG GY+ ++R G CGI M ASYP K + P+P G
Sbjct: 299 RGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPIKL-SSSNPTPKDGDV 357
Query: 348 RCSL 351
+ L
Sbjct: 358 KDEL 361
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 313 bits (802), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 155/308 (50%), Positives = 206/308 (66%), Gaps = 4/308 (1%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+FE+W +HGK Y S EK++RL IFEDN F+ N N S+ L L FADL+ E+K
Sbjct: 48 IFESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRN-AENLSYRLGLTGFADLSLHEYK 106
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDV-PASIDWRKKGAVTEVKDQASCGACWAFSA 146
G + +S + + DV P S+DWR +GAVTEVKDQ C +CWAFS
Sbjct: 107 EVCHGADPRPPRNHVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFST 166
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
GA+EG+NKIVTG LV+LSEQ+LI+C++ N+GCGGG ++ AY+F++KN G+ T+ DYPY
Sbjct: 167 VGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPY 225
Query: 207 RGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSG 265
+ G C+ + K N V IDGY+++P N+E L++AV QPV+ I S R FQLY SG
Sbjct: 226 KAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESG 285
Query: 266 IFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGIN 325
+F G C T+L+H V++VGY +ENG DYW++KNS G +WG GYM M RN N G+CGI
Sbjct: 286 VFDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIA 345
Query: 326 MLASYPTK 333
M ASYP K
Sbjct: 346 MRASYPLK 353
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 312 bits (800), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 158/346 (45%), Positives = 211/346 (60%), Gaps = 22/346 (6%)
Query: 8 LLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMG 67
L S + + L + +L+E W H + EK +R F+ N F+ HN G
Sbjct: 25 LCSAIPMEDKDLESEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRG 83
Query: 68 NSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG---------NLRDVP 118
+ + L LN F D+ EF+A+F+G D RR+ + P N+ D+P
Sbjct: 84 DHPYRLHLNRFGDMDQAEFRATFVG--------DLRRDTPAKPPSVPGFMYAALNVSDLP 135
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
S+DWR+KGAVT VKDQ CG+CWAFS ++EGIN I TGSLVSLSEQELIDCD + N
Sbjct: 136 PSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADND 195
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRH---IVTIDGYKDVPENN 235
GC GGLMD A++++ N G+ TE YPYR G CN + ++ +V IDG++DVP N+
Sbjct: 196 GCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANS 255
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYD-SENGVDYWI 294
E+ L +AV QPVSV + S +AF YS G+FTG C T LDH V +VGY +E+G YW
Sbjct: 256 EEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWT 315
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNPPP 340
+KNSWG SWG GY+ +++++G S G+CGI M ASYP KT P P
Sbjct: 316 VKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTYNKPMP 361
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 311 bits (797), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 153/326 (46%), Positives = 205/326 (62%), Gaps = 11/326 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H S EK +R +F+ N V N M + + L LN FAD+T+ EF
Sbjct: 38 DLYERW-RSHHTVSRSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFADMTNHEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLR-----DVPASIDWRKKGAVTEVKDQASCGAC 141
++++ G + ++H + S G VPAS+DWRKKGAVT+VKDQ CG+C
Sbjct: 96 RSTYAG---SKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LVSLSEQEL+DCD+ N GC GGLM+ A++F+ + GI TE
Sbjct: 153 WAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTE 212
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY Q G C++ K+N V+IDG+++VP N+E LL+AV QPVSV I FQ
Sbjct: 213 SNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQF 272
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG C+T L+H V IVGY + +G +YWI++NSWG WG GY+ MQRN G
Sbjct: 273 YSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRNISKKEG 332
Query: 321 ICGINMLASYPTKTGQNPPPSPPPGP 346
+CGI M+ASYP K + P P
Sbjct: 333 LCGIAMMASYPIKNSSDNPTGSLSSP 358
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 309 bits (792), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 151/319 (47%), Positives = 204/319 (63%), Gaps = 11/319 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+L+E W + H S EK +R +F+ N V N M + + L LN FAD+T+ EF
Sbjct: 38 DLYERW-RSHHTVSRSLGEKHKRFNVFKANLMHVHNTNKM-DKPYKLKLNKFADMTNHEF 95
Query: 87 KASFLGFSAASIDHDRRRNASVQSPG-----NLRDVPASIDWRKKGAVTEVKDQASCGAC 141
++++ G + ++H R + G + VP S+DWRKKGAVT+VKDQ CG+C
Sbjct: 96 RSTYAG---SKVNHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSC 152
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T LV+LSEQEL+DCD+ N GC GGLM+ A++F+ + GI TE
Sbjct: 153 WAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTE 212
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
+YPY+ Q G C+ K+N V+IDG+++VP N+E LL+AV QPVSV I FQ
Sbjct: 213 SNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQF 272
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG CST L+H V IVGY + +G +YWI++NSWG WG +GY+ MQRN G
Sbjct: 273 YSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEG 332
Query: 321 ICGINMLASYPTKTGQNPP 339
+CGI ML SYP K + P
Sbjct: 333 LCGIAMLPSYPIKNSSDNP 351
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 308 bits (790), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 155/313 (49%), Positives = 204/313 (65%), Gaps = 14/313 (4%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
+FE+W +HGK Y S EK++RL IFEDN F+T N N S+ L LN FADL+ E+
Sbjct: 55 MFESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRN-AENLSYRLGLNRFADLSLHEY- 112
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRD------VPASIDWRKKGAVTEVKDQASCGAC 141
G D RN + N +P S+DWR +GAVTEVKDQ C +C
Sbjct: 113 ----GEICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSC 168
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS GA+EG+NKIVTG LV+LSEQ+LI+C++ N+GCGGG ++ AY+F++ N G+ T+
Sbjct: 169 WAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTD 227
Query: 202 KDYPYRGQAGQCN-KQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQ 260
DYPY+ G C + K + V IDGY+++P N+E L++AV QPV+ + S R FQ
Sbjct: 228 NDYPYKALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQ 287
Query: 261 LYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
LY SG+F G C T+L+H V++VGY +ENG DYWI+KNS G +WG GYM M RN N G
Sbjct: 288 LYESGVFDGTCGTNLNHGVVVVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRG 347
Query: 321 ICGINMLASYPTK 333
+CGI M ASYP K
Sbjct: 348 LCGIAMRASYPLK 360
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 307 bits (786), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 156/325 (48%), Positives = 197/325 (60%), Gaps = 11/325 (3%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
L+E W + H S EKQ+R +F+ N V N M + + L LN FAD+T+ EF+
Sbjct: 37 LYERW-RSHHTVSRSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFR 94
Query: 88 ASFLGFSAASIDHDRRRNASVQSPG-----NLRDVPASIDWRKKGAVTEVKDQASCGACW 142
++ S + + H R + G + VPAS+DWRKKGAVT VKDQ CG+CW
Sbjct: 95 NTY---SGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCW 151
Query: 143 AFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEK 202
AFS A+EGIN+I T LVSLSEQEL+DCD N GC GGLMDYA++F+ + GI TE
Sbjct: 152 AFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEA 211
Query: 203 DYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLY 262
+YPY G C+ K N V+IDG+++VPEN+E LL+AV QPVSV I FQ Y
Sbjct: 212 NYPYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFY 271
Query: 263 SSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGI 321
S G+FTG C T LDH V IVGY + +G YW +KNSWG WG GY+ M+R + G+
Sbjct: 272 SEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGL 331
Query: 322 CGINMLASYPTKTGQNPPPSPPPGP 346
CGI M ASYP K N P P
Sbjct: 332 CGIAMEASYPIKKSSNNPSGIKSSP 356
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 305 bits (780), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 159/351 (45%), Positives = 210/351 (59%), Gaps = 23/351 (6%)
Query: 6 FFLLSILLLSSLPLNYCSDINE-----------LFETWCKQHGKAYSSEQEKQQRLKIFE 54
FF++ I LS L + D +E L+E W H + +S E +R +F
Sbjct: 4 FFIVLISFLSLLQASKGFDFDEKELETEENVWKLYERWRGHHSVSRAS-HEAIKRFNVFR 62
Query: 55 DNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPG-- 112
N V N N + L +N FAD+TH EF++S+ G +++ H R + G
Sbjct: 63 HNVLHV-HRTNKKNKPYKLKINRFADITHHEFRSSYAG---SNVKHHRMLRGPKRGSGGF 118
Query: 113 ---NLRDVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQEL 169
N+ VP+S+DWR+KGAVTEVK+Q CG+CWAFS A+EGINKI T LVSLSEQEL
Sbjct: 119 MYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQEL 178
Query: 170 IDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQ-CNKQKLNRHIVTIDGY 228
+DCD N GC GGLM+ A++F+ N GI TE+ YPY Q C + VTIDG+
Sbjct: 179 VDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGH 238
Query: 229 KDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGY-DSE 287
+ VPEN+E++LL+AV QPVSV I FQLYS G+F G C T L+H V+IVGY +++
Sbjct: 239 EHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETK 298
Query: 288 NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQNP 338
NG YWI++NSWG WG GY+ ++R + G CGI M ASYPTK P
Sbjct: 299 NGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTKLSSTP 349
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 293 bits (751), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 195/319 (61%), Gaps = 11/319 (3%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
EL+E W H A S E EK +R +F+ N + N + S+ L LN F D+T +EF
Sbjct: 36 ELYERWRSHHTVARSLE-EKAKRFNVFKHNVKHI-HETNKKDKSYKLKLNKFGDMTSEEF 93
Query: 87 KASFLGFSAASIDHDRRRNASVQSP-----GNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
+ ++ G ++I H R ++ N+ +P S+DWRK GAVT VK+Q CG+C
Sbjct: 94 RRTYAG---SNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSC 150
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTE 201
WAFS A+EGIN+I T L SLSEQEL+DCD + N GC GGLMD A++F+ + G+ +E
Sbjct: 151 WAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSE 210
Query: 202 KDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
YPY+ C+ K N +V+IDG++DVP+N+E L++AV QPVSV I FQ
Sbjct: 211 LVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQF 270
Query: 262 YSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLG 320
YS G+FTG C T L+H V +VGY + +G YWI+KNSWG WG GY+ MQR + G
Sbjct: 271 YSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEG 330
Query: 321 ICGINMLASYPTKTGQNPP 339
+CGI M ASYP K P
Sbjct: 331 LCGIAMEASYPLKNSNTNP 349
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 280 bits (716), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 142/307 (46%), Positives = 192/307 (62%), Gaps = 3/307 (0%)
Query: 27 ELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEF 86
+LF++W +H K Y S EK R +IF DN ++ + N N+S+ L LN FADL++ EF
Sbjct: 46 QLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEF 104
Query: 87 KASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
K ++GF A + + ++ + P SIDWR KGAVT VK+Q +CG+CWAFS
Sbjct: 105 KKKYVGFVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFST 164
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
+EGINKIVTG+L+ LSEQEL+DCD+ ++ GC GG + Q+V N+G+ T K YPY
Sbjct: 165 IATVEGINKIVTGNLLELSEQELVDCDK-HSYGCKGGYQTTSLQYV-ANNGVHTSKVYPY 222
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+ + +C V I GYK VP N E L A+ QP+SV + + FQLY SG+
Sbjct: 223 QAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGV 282
Query: 267 FTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINM 326
F GPC T LDHAV VGY + +G +Y IIKNSWG +WG GYM ++R +GNS G CG+
Sbjct: 283 FDGPCGTKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYK 342
Query: 327 LASYPTK 333
+ YP K
Sbjct: 343 SSYYPFK 349
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 272 bits (696), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 143/341 (41%), Positives = 203/341 (59%), Gaps = 13/341 (3%)
Query: 4 LAFFLLSILLLSSLPLNYCSD-----INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
L F L + + + P D + + FE W ++G+ Y + EK +R +IF++N
Sbjct: 7 LVFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVK 66
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
+ N+ +S+TL +N F D+T EF A + G S ++ +R S N+ VP
Sbjct: 67 HIETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLP-LNIEREPVVSFDDV-NISAVP 124
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNS 178
SIDWR GAV EVK+Q CG+CW+F+A +EGI KI TG LVSLSEQE++DC SY
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSY-- 182
Query: 179 GCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQ 238
GC GG ++ AY F+I N+G+ TE++YPY G CN I GY V N+E+
Sbjct: 183 GCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPNS-AYITGYSYVRRNDERS 241
Query: 239 LLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIKN 297
++ AV QP++ I SE FQ Y+ G+F+GPC TSL+HA+ I+GY + +G YWI++N
Sbjct: 242 MMYAVSNQPIAALIDASEN-FQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRN 300
Query: 298 SWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT-KTGQN 337
SWG SWG GY+ M R +S G+CGI M +PT ++G N
Sbjct: 301 SWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFPTLQSGAN 341
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 268 bits (684), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 146/321 (45%), Positives = 199/321 (61%), Gaps = 19/321 (5%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNAFADL 81
I E + T+ QH K Y++E E++ R+KIF +N + +HN + G S+ L LN +AD+
Sbjct: 24 IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQS---PGNLRDVPASIDWRKKGAVTEVKDQASC 138
H EFK + G++ R R V + P VP S+DWR+ GAVT VKDQ C
Sbjct: 84 LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143
Query: 139 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHG 197
G+CWAFS+TGA+EG + G LVSLSEQ L+DC Y N+GC GGLMD A++++ N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 198 IDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAVVAQ-PVSVGICG 254
IDTEK YPY G C+ N+ + T G+ D+PE +E+++ +AV PVSV I
Sbjct: 204 IDTEKSYPYEGIDDSCH---FNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDA 260
Query: 255 SERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGMNGYMHM 311
S +FQLYS G++ P +LDH VL+VGY + E+G+DYW++KNSWG +WG GY+ M
Sbjct: 261 SHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKM 320
Query: 312 QRNTGNSLGICGINMLASYPT 332
RN N CGI +SYPT
Sbjct: 321 ARNQNNQ---CGIATASSYPT 338
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 266 bits (680), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 149/332 (44%), Positives = 196/332 (59%), Gaps = 7/332 (2%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
S++F SI+ S L + +LF +W H K Y + EK R +IF+DN ++ +
Sbjct: 22 SVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDE 81
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLG-FSAASIDHDRRRNASVQSPGNLRDVPASI 121
N N+S+ L LN FADL++ EF ++G A+I+ + NL P ++
Sbjct: 82 TNKK-NNSYWLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEEFINEDTVNL---PENV 137
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCG 181
DWRKKGAVT V+ Q SCG+CWAFSA +EGINKI TG LV LSEQEL+DC+R + GC
Sbjct: 138 DWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERR-SHGCK 196
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GG YA ++V KN GI YPY+ + G C +++ IV G V NNE LL
Sbjct: 197 GGYPPYALEYVAKN-GIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLN 255
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGR 301
A+ QPVSV + R FQLY GIF GPC T +DHAV VGY G Y +IKNSWG
Sbjct: 256 AIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYILIKNSWGT 315
Query: 302 SWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
+WG GY+ ++R GNS G+CG+ + YPTK
Sbjct: 316 AWGEKGYIRIKRAPGNSPGVCGLYKSSYYPTK 347
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 261 bits (668), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 123/222 (55%), Positives = 162/222 (72%), Gaps = 2/222 (0%)
Query: 116 DVPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRS 175
D+P SIDWR+ GAV VK+Q CG+CWAFS A+EGIN+IVTG L+SLSEQ+L+DC +
Sbjct: 2 DLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDC-TT 60
Query: 176 YNSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENN 235
N GC GG M+ A+QF++ N GI++E+ YPYRGQ G CN +N +V+ID Y++VP +N
Sbjct: 61 ANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNS-TVNAPVVSIDSYENVPSHN 119
Query: 236 EKQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWII 295
E+ L +AV QPVSV + + R FQLY SGIFTG C+ S +HA+ +VGY +EN D+WI+
Sbjct: 120 EQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIV 179
Query: 296 KNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKTGQN 337
KNSWG++WG +GY+ +RN N G CGI ASYP K G N
Sbjct: 180 KNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPVKKGTN 221
>sp|P22895|P34_SOYBN P34 probable thiol protease OS=Glycine max PE=1 SV=1
Length = 379
Score = 261 bits (667), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 200/337 (59%), Gaps = 17/337 (5%)
Query: 10 SILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNS 69
SIL L ++ LF+ W +HG+ Y + +E+ +RL+IF++N ++ N S
Sbjct: 25 SILDLDLTKFTTQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNANRKS 84
Query: 70 --SFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRD-VPASIDWRKK 126
S L LN FAD+T QEF +L + N ++ D PAS DWRKK
Sbjct: 85 PHSHRLGLNKFADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSCDHPPASWDWRKK 144
Query: 127 GAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMD 186
G +T+VK Q CG WAFSATGAIE + I TG LVSLSEQEL+DC + G G
Sbjct: 145 GVITQVKYQGGCGRGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGSYNGWQY 203
Query: 187 YAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDV-------PENNEKQL 239
++++V+++ GI T+ DYPYR + G+C K+ + VTIDGY+ + E+
Sbjct: 204 QSFEWVLEHGGIATDDDYPYRAKEGRCKANKI-QDKVTIDGYETLIMSDESTESETEQAF 262
Query: 240 LQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTS---LDHAVLIVGYDSENGVDYWIIK 296
L A++ QP+SV I + F LY+ GI+ G TS ++H VL+VGY S +GVDYWI K
Sbjct: 263 LSAILEQPISVSI--DAKDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYWIAK 320
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
NSWG WG +GY+ +QRNTGN LG+CG+N ASYPTK
Sbjct: 321 NSWGFDWGEDGYIWIQRNTGNLLGVCGMNYFASYPTK 357
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 258 bits (660), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 142/330 (43%), Positives = 193/330 (58%), Gaps = 8/330 (2%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
L+F SI+ S L + +LFE+W +H K Y + EK R +IF+DN ++ +
Sbjct: 23 LSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDET 82
Query: 64 NNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDW 123
N N+S+ L LN FAD+++ EFK + G A + V + G++ ++P +DW
Sbjct: 83 NKK-NNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGDV-NIPEYVDW 140
Query: 124 RKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGG 183
R+KGAVT VK+Q SCG+CWAFSA IEGI KI TG+L SEQEL+DCDR + GC GG
Sbjct: 141 RQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR-SYGCNGG 199
Query: 184 LMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV 243
A Q V + +GI YPY G C ++ + DG + V NE LL ++
Sbjct: 200 YPWSALQLVAQ-YGIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSI 258
Query: 244 VAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSW 303
QPVSV + + + FQLY GIF GPC +DHAV VGY G +Y +IKNSWG W
Sbjct: 259 ANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGY----GPNYILIKNSWGTGW 314
Query: 304 GMNGYMHMQRNTGNSLGICGINMLASYPTK 333
G NGY+ ++R TGNS G+CG+ + YP K
Sbjct: 315 GENGYIRIKRGTGNSYGVCGLYTSSFYPVK 344
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 256 bits (653), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 142/326 (43%), Positives = 198/326 (60%), Gaps = 22/326 (6%)
Query: 21 YCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM---GNSSFTLSLNA 77
+ + E + T+ +H K Y E E++ RLKIF +N + +HN G SF L++N
Sbjct: 51 FADVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNK 110
Query: 78 FADLTHQEFKASFLGFSAASIDHDRRRNASVQ-----SPGNLRDVPASIDWRKKGAVTEV 132
+ADL H EF+ GF+ R + S + SP ++ +P S+DWR KGAVT V
Sbjct: 111 YADLLHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVT-LPKSVDWRTKGAVTAV 169
Query: 133 KDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQF 191
KDQ CG+CWAFS+TGA+EG + +G LVSLSEQ L+DC Y N+GC GGLMD A+++
Sbjct: 170 KDQGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRY 229
Query: 192 VIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIV--TIDGYKDVPENNEKQLLQAV-VAQPV 248
+ N GIDTEK YPY C+ N+ V T G+ D+P+ +EK++ +AV PV
Sbjct: 230 IKDNGGIDTEKSYPYEAIDDSCH---FNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPV 286
Query: 249 SVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDS-ENGVDYWIIKNSWGRSWGM 305
SV I S +FQ YS G++ P + +LDH VL+VG+ + E+G DYW++KNSWG +WG
Sbjct: 287 SVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGD 346
Query: 306 NGYMHMQRNTGNSLGICGINMLASYP 331
G++ M RN N CGI +SYP
Sbjct: 347 KGFIKMLRNKENQ---CGIASASSYP 369
>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
Length = 215
Score = 253 bits (645), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 116/217 (53%), Positives = 158/217 (72%), Gaps = 3/217 (1%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P+ +DWR KGAV +K+Q CG+CWAFSA A+E INKI TG L+SLSEQEL+DCD +
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTA- 59
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
+ GC GG M+ A+Q++I N GIDT+++YPY G C +L +V+I+G++ V NNE
Sbjct: 60 SHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCKPYRL--RVVSINGFQRVTRNNE 117
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIK 296
L AV +QPVSV + + FQ YSSGIFTGPC T+ +H V+IVGY +++G +YWI++
Sbjct: 118 SALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWIVR 177
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
NSWG++WG GY+ M+RN +S G+CGI L SYPTK
Sbjct: 178 NSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYPTK 214
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 252 bits (643), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 131/336 (38%), Positives = 192/336 (57%), Gaps = 14/336 (4%)
Query: 4 LAFFLLSILLLSSLPLNYCSD-----INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYA 58
L F L + ++ + P D + + FE W ++G+ Y EK R +IF++N
Sbjct: 7 LVFLFLFLCVMWASPSAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVN 66
Query: 59 FVTQHNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQ-SPGNLRDV 117
+ NN +S+TL +N F D+T+ EF A + G S + + +R V ++ V
Sbjct: 67 HIETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLS---LPLNIKREPVVSFDDVDISSV 123
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P SIDWR GAVT VK+Q CG+CWAF++ +E I KI G+LVSLSEQ+++DC SY
Sbjct: 124 PQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAVSY- 182
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GC GG ++ AY F+I N G+ + YPY+ G C + I Y V NNE+
Sbjct: 183 -GCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTCKTNGVPNS-AYITRYTYVQRNNER 240
Query: 238 QLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSE-NGVDYWIIK 296
++ AV QP++ + S FQ Y G+FTGPC T L+HA++I+GY + +G +WI++
Sbjct: 241 NMMYAVSNQPIAAALDASGN-FQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVR 299
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
NSWG WG GY+ + R+ +S G+CGI M YPT
Sbjct: 300 NSWGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYPT 335
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 249 bits (637), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 196/315 (62%), Gaps = 14/315 (4%)
Query: 28 LFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFK 87
++E W ++GK Y+ EK++R KIF+DN + +HN+ N S+ LN F+DLT EF+
Sbjct: 40 MYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQ 99
Query: 88 ASFLGFSAASIDHDRRRNASVQSPGNLRDV-PASIDWRKKGAVT-EVKDQASCGACWAFS 145
AS+LG ++ + + + DV P +DWR++GAV VK Q CG+CWAF+
Sbjct: 100 ASYLG---GKMEKKSLSDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWAFA 156
Query: 146 ATGAIEGINKIVTGSLVSLSEQELIDCDR-SYNSGCGGGLMDYAYQFVIKNHGIDTEKDY 204
ATGA+EGIN+I TG LVSLSEQELIDCDR + N GC GG +A++F+ +N GI +++ Y
Sbjct: 157 ATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVY 216
Query: 205 PYRGQ---AGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQL 261
Y G+ A + + K R +VTI+G++ VP N+E L +AV QP+SV I +
Sbjct: 217 GYTGEDTAACKAIEMKTTR-VVTINGHEVVPVNDEMSLKKAVAYQPISVMISAAN--MSD 273
Query: 262 YSSGIFTGPCSTSL-DHAVLIVGY-DSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNSL 319
Y SG++ G CS DH VLIVGY S + DYW+I+NSWG WG GY+ +QRN
Sbjct: 274 YKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQRNFHEPT 333
Query: 320 GICGINMLASYPTKT 334
G C + + YP K+
Sbjct: 334 GKCAVAVAPVYPIKS 348
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 249 bits (636), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 134/332 (40%), Positives = 201/332 (60%), Gaps = 9/332 (2%)
Query: 4 LAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
+LSI +S+ + + F W + + KAY+ +E R + F+ N +V
Sbjct: 9 FTLIVLSISFISAGNVFSHKQYQDSFIDWMRSNNKAYT-HKEFMPRYEEFKKNMDYVHNW 67
Query: 64 NNMGNSSFTLSLNAFADLTHQEFKASFLGFSA-ASIDHDRRRNASVQSPGNLRDVPASID 122
N+ G S L LN ADL+++E++ ++LG A ++ +RN ++ P ++D
Sbjct: 68 NSKG-SKTVLGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRLNRPQFKQPLNVD 126
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCG 181
WR+K AVT VKDQ CG+C++FS TG++EG+ I TG LVSLSEQ ++DC S+ N GC
Sbjct: 127 WREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCN 186
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GGLM A++++IKN+G+++E+ YPY + K + I YK++ +E L
Sbjct: 187 GGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVAAKITSYKEIEAGDENDLQN 246
Query: 242 AVVAQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWIIKNSW 299
A++ PVSV I S +FQLY++G++ P S LDH VL VG ++NG DY+I+KNSW
Sbjct: 247 ALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMGTDNGEDYYIVKNSW 306
Query: 300 GRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
G SWG+NGY+HM RN N+ CGI+ +ASYP
Sbjct: 307 GPSWGLNGYIHMARNKDNN---CGISTMASYP 335
>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
Length = 331
Score = 243 bits (620), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 189/314 (60%), Gaps = 15/314 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
++ + W K + K Y E E+ R I+E N FV HN +MG S+ L +N D+
Sbjct: 24 LDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDM 83
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T +E S +G + + +RN + +S N + +P S+DWR+KG VTEVK Q SCGAC
Sbjct: 84 TGEEV-ISLMG--SLRVPSQWQRNVTYRSNSN-QKLPDSVDWREKGCVTEVKYQGSCGAC 139
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDC--DRSYNSGCGGGLMDYAYQFVIKNHGID 199
WAFSA GA+E K+ TG LVSLS Q L+DC ++ N GC GG M A+Q++I N+GID
Sbjct: 140 WAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGID 199
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERA 258
+E YPY+ G+C R T Y ++P +E L +AV + PVSV I S +
Sbjct: 200 SEASYPYKAMNGKCRYDSKKR-AATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYS 258
Query: 259 FQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
F LY SG++ P C+ +++H VL+VGY + NG DYW++KNSWG ++G GY+ M RN+GN
Sbjct: 259 FFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMARNSGN 318
Query: 318 SLGICGINMLASYP 331
CGI SYP
Sbjct: 319 H---CGIASYPSYP 329
>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
Length = 331
Score = 242 bits (617), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 138/337 (40%), Positives = 201/337 (59%), Gaps = 17/337 (5%)
Query: 4 LAFFLLSILLLSSLPLNYCSD--INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVT 61
+ + + ++LL SS + D ++ ++ W K +GK Y + E+ R I+E N VT
Sbjct: 1 MNWLVWALLLCSSAMAHVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTVT 60
Query: 62 QHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVP 118
HN +MG S+ L +N D+T +E + S+ + RN + +S N + +P
Sbjct: 61 LHNLEHSMGMHSYELGMNHLGDMTSEEVISLM---SSLRVPSQWPRNVTYKSDPNQK-LP 116
Query: 119 ASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-- 176
S+DWR+KG VTEVK Q +CG+CWAFSA GA+E K+ TG LVSLS Q L+DC +
Sbjct: 117 DSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYG 176
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GG M A+Q++I N+GID+E YPY+ G+C NR T Y ++P +E
Sbjct: 177 NKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYDVKNR-AATCSRYIELPFGSE 235
Query: 237 KQLLQAVVAQ-PVSVGICGSERAFQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWI 294
+ L +AV + PVSVGI S +F LY +G++ P C+ +++H VL+VGY + +G DYW+
Sbjct: 236 EALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGKDYWL 295
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
+KNSWG +G GY+ M RN+GN CGI SYP
Sbjct: 296 VKNSWGLHFGDQGYIRMARNSGNH---CGIANYPSYP 329
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 242 bits (617), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 138/331 (41%), Positives = 187/331 (56%), Gaps = 5/331 (1%)
Query: 3 SLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQ 62
SL++ SI+ S L + +LF +W +H K Y + EK R +IF+DN ++ +
Sbjct: 22 SLSYCDFSIVGYSQDDLTSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDE 81
Query: 63 HNNMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASID 122
N M N + L LN F+DL++ EFK ++G + V ++ D+P S+D
Sbjct: 82 RNKMING-YWLGLNEFSDLSNDEFKEKYVGSLPEDYTNQPYDEEFVNE--DIVDLPESVD 138
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGG 182
WR KGAVT VK Q C +CWAFS +EGINKI TG+LV LSEQEL+DCD+ + GC
Sbjct: 139 WRAKGAVTPVKHQGYCESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQ-SYGCNR 197
Query: 183 GLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQA 242
G + Q+V +N GI YPY + C ++ V +G V NNE LL A
Sbjct: 198 GYQSTSLQYVAQN-GIHLRAKYPYIAKQQTCRANQVGGPKVKTNGVGRVQSNNEGSLLNA 256
Query: 243 VVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIKNSWGRS 302
+ QPVSV + + R FQ Y GIF G C T +DHAV VGY G Y +IKNSWG
Sbjct: 257 IAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVDHAVTAVGYGKSGGKGYILIKNSWGPG 316
Query: 303 WGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
WG NGY+ ++R +GNS G+CG+ + YP K
Sbjct: 317 WGENGYIRIRRASGNSPGVCGVYRSSYYPIK 347
>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 241 bits (615), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 115/217 (52%), Positives = 154/217 (70%), Gaps = 2/217 (0%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P SIDWR+KGAV VK+Q CG+CWAF A A+EGIN+IVTG L+SLSEQ+L+DC +
Sbjct: 3 LPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCS-TR 61
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GG A+Q++I N GI++E+ YPY G G C+ K N H+V+ID Y++VP N+E
Sbjct: 62 NHGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCDT-KENAHVVSIDSYRNVPSNDE 120
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIK 296
K L +AV QPVSV + + R FQLY +GIFTG C+ S +H + G ++EN DYW +K
Sbjct: 121 KSLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRTVGGRETENDKDYWTVK 180
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTK 333
NSWG++WG +GY+ ++RN S G CGI + SYP K
Sbjct: 181 NSWGKNWGESGYIRVERNIAESSGKCGIAISPSYPIK 217
>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
Length = 344
Score = 241 bits (615), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 144/326 (44%), Positives = 187/326 (57%), Gaps = 36/326 (11%)
Query: 29 FETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNMGNSSFTLSLNAFADLTHQEFKA 88
F W H K+Y+SE E R IF+ N +V Q N+ G S L LN FAD+T++E++
Sbjct: 30 FTDWMITHQKSYTSE-EFGARYNIFKANMDYVQQWNSKG-SETVLGLNNFADITNEEYRN 87
Query: 89 SFLG--FSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGACWAFSA 146
++LG F A+S+ + S AS DWR +GAVT VK+Q CG CW+FS
Sbjct: 88 TYLGTKFDASSLIGTQEEKVFTTSS------AASKDWRSEGAVTPVKNQGQCGGCWSFST 141
Query: 147 TGAIEGINKIVTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYQFVIKNHGIDTEKDYPY 206
TG+ EG + G LVSLSEQ LIDC NSGC GGLM YA++++I N+GIDTE YPY
Sbjct: 142 TGSTEGAHFQSKGELVSLSEQNLIDCSTE-NSGCDGGLMTYAFEYIINNNGIDTESSYPY 200
Query: 207 RGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQPVSVGICGSERAFQLYSSGI 266
+ + G+C + N T+ YK V +E L AV PVSV I S ++FQLY+SGI
Sbjct: 201 KAENGKCEYKSENSG-ATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYTSGI 259
Query: 267 FTGP--CSTSLDHAVLIVGYDSENGV-------------------DYWIIKNSWGRSWGM 305
+ P S +LDH VL VGY S +G +YWI+KNSWG SWG+
Sbjct: 260 YYEPECSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTSWGI 319
Query: 306 NGYMHMQRNTGNSLGICGINMLASYP 331
GY+ M RN N+ CGI AS+P
Sbjct: 320 EGYILMSRNRDNN---CGIASSASFP 342
>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
Length = 334
Score = 240 bits (613), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 138/340 (40%), Positives = 193/340 (56%), Gaps = 22/340 (6%)
Query: 5 AFFLLSILL-LSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQH 63
+ FL ++ L ++S +++ + W HG+ Y +E +R ++E N + H
Sbjct: 4 SLFLTALCLGIASAAPKLDQNLDADWYKWKATHGRLYGMNEEGWRRA-VWEKNMKMIELH 62
Query: 64 N---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPAS 120
N + G F++++NAF D+T++EF+ GF + + + V + +VP S
Sbjct: 63 NQEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQ-----NQKHKKGKVFHESLVLEVPKS 117
Query: 121 IDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSG 179
+DWR+KG VT VK+Q CG+CWAFSATGA+EG TG LVSLSEQ L+DC R N G
Sbjct: 118 VDWREKGYVTAVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQG 177
Query: 180 CGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQL 239
C GGLMD A+Q+V N G+DTE+ YPY G+ K G+ D+P+ EK L
Sbjct: 178 CNGGLMDNAFQYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDIPQ-REKAL 236
Query: 240 LQAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE----NGVDY 292
++AV P+SV I +FQ Y SGI+ P S LDH VL+VGY E N +
Sbjct: 237 MKAVATVGPISVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKF 296
Query: 293 WIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
WI+KNSWG WG NGY+ M ++ N CGI+ ASYPT
Sbjct: 297 WIVKNSWGPEWGWNGYVKMAKDQNNH---CGISTAASYPT 333
>sp|Q9GL24|CATL1_CANFA Cathepsin L1 OS=Canis familiaris GN=CTSL1 PE=2 SV=1
Length = 333
Score = 240 bits (612), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 139/343 (40%), Positives = 193/343 (56%), Gaps = 22/343 (6%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
MN F L ++S + +N + W H + Y +E +R ++E N +
Sbjct: 1 MNPSLFLTALCLGIASAAPKFDQSLNAQWYQWKATHRRLYGMNEEGWRRA-VWEKNMKMI 59
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
HN + G FT+++NAF D+T++EF+ GF + ++ Q P ++
Sbjct: 60 ELHNREYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQ----NQKHKKGKMFQEP-LFAEI 114
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY- 176
P S+DWR+KG VT VK+Q CG+CWAFSATGA+EG TG LVSLSEQ L+DC R+
Sbjct: 115 PKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQG 174
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GGLMD A+++V N G+D+E+ YPY G+ + K G+ D+P+ E
Sbjct: 175 NEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQ-RE 233
Query: 237 KQLLQAVVA-QPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSENGVD-- 291
K L++AV P+SV I ++FQ Y SGI+ P S LDH VL+VGY E G D
Sbjct: 234 KALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFE-GTDSN 292
Query: 292 --YWIIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
+WI+KNSWG WG NGY+ M ++ N CGI ASYPT
Sbjct: 293 NKFWIVKNSWGPEWGWNGYVKMAKDQNNH---CGIATAASYPT 332
>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
Length = 330
Score = 239 bits (609), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 132/313 (42%), Positives = 183/313 (58%), Gaps = 14/313 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
++ + W K +GK Y + E+ R I+E N FV HN +MG S+ L +N D+
Sbjct: 24 LDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDM 83
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T +E + S+ + + +RN + +S N + +P S+DWR+KG VTEVK Q SCGAC
Sbjct: 84 TSEEVMSLM---SSLRVPNQWQRNITYKSNPN-QMLPDSVDWREKGCVTEVKYQGSCGAC 139
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSA GA+E K+ TG LVSLS Q L+DC Y N GC GG M A+Q++I N GID+
Sbjct: 140 WAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKGIDS 199
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERAF 259
E YPY+ +C R T Y ++P E L +AV + PV VG+ S +F
Sbjct: 200 EASYPYKATDQKCQYDSKYR-AATCSKYTELPYGREDVLKEAVANKGPVCVGVDASHPSF 258
Query: 260 QLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGNS 318
LY SG++ P C+ ++H VL++GY NG +YW++KNSWG ++G GY+ M RN GN
Sbjct: 259 FLYRSGVYYDPACTQKVNHGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGYIRMARNKGNH 318
Query: 319 LGICGINMLASYP 331
CGI SYP
Sbjct: 319 ---CGIASYPSYP 328
>sp|P25774|CATS_HUMAN Cathepsin S OS=Homo sapiens GN=CTSS PE=1 SV=3
Length = 331
Score = 238 bits (608), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 184/314 (58%), Gaps = 15/314 (4%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
++ + W K +GK Y + E+ R I+E N FV HN +MG S+ L +N D+
Sbjct: 24 LDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDM 83
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T +E + S+ + +RN + +S N R +P S+DWR+KG VTEVK Q SCGAC
Sbjct: 84 TSEEVMSLM---SSLRVPSQWQRNITYKSNPN-RILPDSVDWREKGCVTEVKYQGSCGAC 139
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDC--DRSYNSGCGGGLMDYAYQFVIKNHGID 199
WAFSA GA+E K+ TG LVSLS Q L+DC ++ N GC GG M A+Q++I N GID
Sbjct: 140 WAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGID 199
Query: 200 TEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAVVAQ-PVSVGICGSERA 258
++ YPY+ +C R T Y ++P E L +AV + PVSVG+ +
Sbjct: 200 SDASYPYKAMDQKCQYDSKYR-AATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPS 258
Query: 259 FQLYSSGIFTGP-CSTSLDHAVLIVGYDSENGVDYWIIKNSWGRSWGMNGYMHMQRNTGN 317
F LY SG++ P C+ +++H VL+VGY NG +YW++KNSWG ++G GY+ M RN GN
Sbjct: 259 FFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKEYWLVKNSWGHNFGEEGYIRMARNKGN 318
Query: 318 SLGICGINMLASYP 331
CGI SYP
Sbjct: 319 H---CGIASFPSYP 329
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
SV=1
Length = 323
Score = 237 bits (605), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 142/333 (42%), Positives = 194/333 (58%), Gaps = 24/333 (7%)
Query: 9 LSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHNNM-- 66
+++L L + L S E F+ ++G+ Y +E R IFE N ++ + N
Sbjct: 3 VAVLFLCGVALAAASPSWEHFK---GKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYE 59
Query: 67 -GNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNA--SVQSPGNLRDVPAS-ID 122
G +F L++N F D+T +EF A G + RR+A SV P A+ +D
Sbjct: 60 NGEVTFNLAMNKFGDMTLEEFNAVMKG-------NIPRRSAPVSVFYPKKETGPQATEVD 112
Query: 123 WRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN-SGCG 181
WR KGAVT VKDQ CG+CWAFS TG++EG + + TGSL+SL+EQ+L+DC R Y GC
Sbjct: 113 WRTKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCN 172
Query: 182 GGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQ 241
GG M+ A+ ++ N+GIDTE YPY + G C + N T G+ ++ +E L Q
Sbjct: 173 GGWMNDAFDYIKANNGIDTEAAYPYEARDGSC-RFDSNSVAATCSGHTNIASGSETGLQQ 231
Query: 242 AVV-AQPVSVGICGSERAFQLYSSGIFTGP-CSTS-LDHAVLIVGYDSENGVDYWIIKNS 298
AV P+SV I + +FQ YSSG++ P CS S LDHAVL VGY SE G D+W++KNS
Sbjct: 232 AVRDIGPISVTIDAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNS 291
Query: 299 WGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
W SWG GY+ M RN N+ CGI +ASYP
Sbjct: 292 WATSWGDAGYIKMSRNRNNN---CGIATVASYP 321
>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
Length = 329
Score = 237 bits (604), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/337 (40%), Positives = 192/337 (56%), Gaps = 16/337 (4%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M L LL ++ + P ++ +E W K H K Y+S+ ++ R I+E N ++
Sbjct: 1 MWGLKVLLLPVMSFALYPEEI---LDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYI 57
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
+ HN ++G ++ L++N D+T++E G + R N ++ P
Sbjct: 58 SIHNLEASLGVHTYELAMNHLGDMTNEEVVQKMTGLKVPA--SHSRSNDTLYIPDWEGRA 115
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P S+D+RKKG VT VK+Q CG+CWAFS+ GA+EG K TG L++LS Q L+DC S N
Sbjct: 116 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSEN 174
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GCGGG M A+Q+V KN GID+E YPY GQ C + GY+++PE NEK
Sbjct: 175 DGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGK-AAKCRGYREIPEGNEK 233
Query: 238 QLLQAVV-AQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWI 294
L +AV PVSV I S +FQ YS G++ S +L+HAVL VGY + G +WI
Sbjct: 234 ALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWI 293
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
IKNSWG +WG GY+ M RN N+ CGI LAS+P
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNA---CGIANLASFP 327
>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
Length = 329
Score = 237 bits (604), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/337 (40%), Positives = 192/337 (56%), Gaps = 16/337 (4%)
Query: 1 MNSLAFFLLSILLLSSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFV 60
M L LL ++ + P ++ +E W K H K Y+S+ ++ R I+E N ++
Sbjct: 1 MWGLKVLLLPVMSFALYPEEI---LDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYI 57
Query: 61 TQHN---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDV 117
+ HN ++G ++ L++N D+T++E G + R N ++ P
Sbjct: 58 SIHNLEASLGVHTYELAMNHLGDMTNEEVVQKMTGLKVPA--SHSRSNDTLYIPDWEGRA 115
Query: 118 PASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSYN 177
P S+D+RKKG VT VK+Q CG+CWAFS+ GA+EG K TG L++LS Q L+DC S N
Sbjct: 116 PDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDC-VSEN 174
Query: 178 SGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEK 237
GCGGG M A+Q+V KN GID+E YPY GQ C + GY+++PE NEK
Sbjct: 175 DGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTGK-AAKCRGYREIPEGNEK 233
Query: 238 QLLQAVV-AQPVSVGICGSERAFQLYSSGIFTGPC--STSLDHAVLIVGYDSENGVDYWI 294
L +AV PVSV I S +FQ YS G++ S +L+HAVL VGY + G +WI
Sbjct: 234 ALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWI 293
Query: 295 IKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYP 331
IKNSWG +WG GY+ M RN N+ CGI LAS+P
Sbjct: 294 IKNSWGENWGNKGYILMARNKNNA---CGIANLASFP 327
>sp|Q9GKL8|CATL1_CHLAE Cathepsin L1 OS=Chlorocebus aethiops GN=CTSL1 PE=1 SV=1
Length = 333
Score = 236 bits (603), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 136/339 (40%), Positives = 186/339 (54%), Gaps = 24/339 (7%)
Query: 7 FLLSILLL--SSLPLNYCSDINELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN 64
F+L+ L L +S L + + + W H + Y +E +R ++E N + HN
Sbjct: 5 FILAALCLGIASATLTFNHSLEAQWTKWKAMHNRLYGMNEEGWRR-AVWEKNMKMIELHN 63
Query: 65 ---NMGNSSFTLSLNAFADLTHQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASI 121
+ G SFT+++N F D+T +EF+ GF + R+ Q P + P S+
Sbjct: 64 QEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ----NRKPRKGKVFQEP-LFYEAPRSV 118
Query: 122 DWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGC 180
DWR+KG VT VK+Q CG+CWAFSATGA+EG TG LVSLSEQ L+DC N GC
Sbjct: 119 DWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNEGC 178
Query: 181 GGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLL 240
GGLMDYA+Q+V N G+D+E+ YPY C K + G+ D+P+ EK L+
Sbjct: 179 NGGLMDYAFQYVADNGGLDSEESYPYEATEESC-KYNPEYSVANDTGFVDIPK-QEKALM 236
Query: 241 QAV-VAQPVSVGICGSERAFQLYSSGIFTGP--CSTSLDHAVLIVGYDSE----NGVDYW 293
+AV P+SV I +F Y GI+ P S +DH VL+VGY E + YW
Sbjct: 237 KAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNSKYW 296
Query: 294 IIKNSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPT 332
++KNSWG WGM GY+ M ++ N CGI ASYPT
Sbjct: 297 LVKNSWGEEWGMGGYIKMAKDRRNH---CGIASAASYPT 332
>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
Length = 208
Score = 235 bits (600), Expect = 4e-61, Method: Compositional matrix adjust.
Identities = 119/218 (54%), Positives = 146/218 (66%), Gaps = 10/218 (4%)
Query: 117 VPASIDWRKKGAVTEVKDQASCGACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY 176
+P IDWRKKGAVT VK+Q SCG+CWAFS +E IN+I TG+L+SLSEQEL+DCD+
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59
Query: 177 NSGCGGGLMDYAYQFVIKNHGIDTEKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNE 236
N GC GG +AYQ++I N GIDT+ +YPY+ G C + +V+IDGY VP NE
Sbjct: 60 NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPC---QAASKVVSIDGYNGVPFCNE 116
Query: 237 KQLLQAVVAQPVSVGICGSERAFQLYSSGIFTGPCSTSLDHAVLIVGYDSENGVDYWIIK 296
L QAV QP +V I S FQ YSSGIF+GPC T L+H V IVGY + +YWI++
Sbjct: 117 XALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQA----NYWIVR 172
Query: 297 NSWGRSWGMNGYMHMQRNTGNSLGICGINMLASYPTKT 334
NSWGR WG GY+ M R G G+CGI L YPTK
Sbjct: 173 NSWGRYWGEKGYIRMLRVGG--CGLCGIARLPYYPTKA 208
>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 235 bits (599), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 124/318 (38%), Positives = 186/318 (58%), Gaps = 22/318 (6%)
Query: 25 INELFETWCKQHGKAYSSEQEKQQRLKIFEDNYAFVTQHN---NMGNSSFTLSLNAFADL 81
N + W H + Y + +E+ +R ++E N + HN + G FT+ +NAF D+
Sbjct: 25 FNAQWHQWKSTHRRLYGTNEEEWRRA-VWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDM 83
Query: 82 THQEFKASFLGFSAASIDHDRRRNASVQSPGNLRDVPASIDWRKKGAVTEVKDQASCGAC 141
T++EF+ G+ H + + + + +P ++DWR+KG VT VK+Q CG+C
Sbjct: 84 TNEEFRQIVNGYR-----HQKHKKGRLFQEPLMLQIPKTVDWREKGCVTPVKNQGQCGSC 138
Query: 142 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDRSY-NSGCGGGLMDYAYQFVIKNHGIDT 200
WAFSA+G +EG + TG L+SLSEQ L+DC N GC GGLMD+A+Q++ +N G+D+
Sbjct: 139 WAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDS 198
Query: 201 EKDYPYRGQAGQCNKQKLNRHIVTIDGYKDVPENNEKQLLQAV-VAQPVSVGICGSERAF 259
E+ YPY + G C K + + G+ D+P+ EK L++AV P+SV + S +
Sbjct: 199 EESYPYEAKDGSC-KYRAEYAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSL 256
Query: 260 QLYSSGIFTGP--CSTSLDHAVLIVGYDSE----NGVDYWIIKNSWGRSWGMNGYMHMQR 313
Q YSSGI+ P S LDH VL+VGY E N YW++KNSWG+ WGM+GY+ + +
Sbjct: 257 QFYSSGIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAK 316
Query: 314 NTGNSLGICGINMLASYP 331
+ N CG+ ASYP
Sbjct: 317 DRNNH---CGLATAASYP 331
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.320 0.134 0.430
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 162,185,548
Number of Sequences: 539616
Number of extensions: 7129906
Number of successful extensions: 26828
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 230
Number of HSP's successfully gapped in prelim test: 42
Number of HSP's that attempted gapping in prelim test: 25704
Number of HSP's gapped (non-prelim): 404
length of query: 419
length of database: 191,569,459
effective HSP length: 120
effective length of query: 299
effective length of database: 126,815,539
effective search space: 37917846161
effective search space used: 37917846161
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 63 (28.9 bits)