BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017419
(372 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 474 bits (1220), Expect = e-133, Method: Compositional matrix adjust.
Identities = 221/352 (62%), Positives = 280/352 (79%), Gaps = 8/352 (2%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSW-RTDDEVMTIYQTWLAKHGK--TSNGMGHN 69
++FL ++ SSA DMSIISYD H S++ R++ EVM+IY+ WL KHGK + N +
Sbjct: 10 ILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRSEAEVMSIYEAWLVKHGKAQSQNSLVEK 69
Query: 70 EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
++RF+IFKDNLRF+DEHN N +Y++GL +FADLTN+EYR+ YLG + + K +
Sbjct: 70 DRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKK----GERRT 125
Query: 130 SQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLS 189
S RY + GDELPES+DWR+KGAV VKDQG CGSCWAFST+ AVEGIN+IVTG+LI+LS
Sbjct: 126 SLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLS 185
Query: 190 EQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSI 249
EQELVDCD N GCNGGLMDYAF+FII+NGG+D+++DYPY G + CD R+NAKVV+I
Sbjct: 186 EQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTI 245
Query: 250 DGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG 309
D YEDV + E SLKKAVA QP+S+AIEAGGRAFQ Y+SG+F G CG+ LDHGVVAVGYG
Sbjct: 246 DSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG 305
Query: 310 TENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQ 361
TENG DYW+VRNSWG WGE+GY+++ RN+ +++GKCGIA+E SYP+KN +
Sbjct: 306 TENGKDYWIVRNSWGKSWGESGYLRMARNIA-SSSGKCGIAIEPSYPIKNGE 356
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 472 bits (1214), Expect = e-132, Method: Compositional matrix adjust.
Identities = 227/356 (63%), Positives = 277/356 (77%), Gaps = 7/356 (1%)
Query: 23 SAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGH----NEKRFQIFKD 78
++ D SII+ WRTD+EV +IY W A+HGKT+N +KRF IFKD
Sbjct: 20 ASGDESIINDHLQLPSDGKWRTDEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKD 79
Query: 79 NLRFIDEHNSLNR--TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRY-AC 135
NLRFID HN N+ TYK+GL KF DLTN+EYR +YLG R++ RR+ K+K +Q+Y A
Sbjct: 80 NLRFIDLHNEDNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAA 139
Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
G E+PE+VDWR+KGAVNP+KDQG+CGSCWAFST AAVEGINKIVTGELISLSEQELVD
Sbjct: 140 VNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVD 199
Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDV 255
CD+ N GCNGGLMDYAFQFI++NGG+++E+DYPY G KC+ +N++VVSIDGYEDV
Sbjct: 200 CDKSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDV 259
Query: 256 SPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVD 315
DE +LKKA++ QPVSVAIEAGGR FQHY+SG+FTG CG+ LDH VVAVGYG+ENGVD
Sbjct: 260 PTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGSENGVD 319
Query: 316 YWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
YW+VRNSWG WGE GY++++RNL + +GKCGIA+EASYPVK S N + SS
Sbjct: 320 YWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVKYSPNPVRGNTISS 375
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 443 bits (1140), Expect = e-124, Method: Compositional matrix adjust.
Identities = 211/341 (61%), Positives = 261/341 (76%), Gaps = 17/341 (4%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
DMSI+SY R+++E +Y W A+HGK+ N +G E+R+ F+DNLR+IDE
Sbjct: 22 DMSIVSYGE--------RSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE 73
Query: 86 HNSLN----RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDEL 141
HN+ ++++GLN+FADLTNEEYR YLG R+ +R + S RY + L
Sbjct: 74 HNAAADAGVHSFRLGLNRFADLTNEEYRDTYLGLRNKPRRE----RKVSDRYLAADNEAL 129
Query: 142 PESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKIN 201
PESVDWR KGAV +KDQG CGSCWAFS +AAVEGIN+IVTG+LISLSEQELVDCD N
Sbjct: 130 PESVDWRTKGAVAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYN 189
Query: 202 AGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEM 261
GCNGGLMDYAF FII NGG+D+E DYPY G + +CD +R+NAKVV+ID YEDV+P E
Sbjct: 190 EGCNGGLMDYAFDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSET 249
Query: 262 SLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRN 321
SL+KAVA+QPVSVAIEAGGRAFQ Y SG+FTG+CG+ALDHGV AVGYGTENG DYW+VRN
Sbjct: 250 SLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRN 309
Query: 322 SWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
SWG WGE+GYV+++RN + ++GKCGIA+E SYP+K +N
Sbjct: 310 SWGKSWGESGYVRMERN-IKASSGKCGIAVEPSYPLKKGEN 349
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 442 bits (1138), Expect = e-123, Method: Compositional matrix adjust.
Identities = 206/324 (63%), Positives = 265/324 (81%), Gaps = 8/324 (2%)
Query: 49 MTIYQTWLAKHGKT---SNGM-GHNEKRFQIFKDNLRFIDEHNSLNR--TYKVGLNKFAD 102
M+IY W +HGK+ SNG+ ++RF IFKDNLRFID HN N+ TYK+GL FA+
Sbjct: 1 MSIYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFAN 60
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAG-DELPESVDWREKGAVNPVKDQGS 161
LTN+EYR++YLG R++ RR+ K+K + +Y+ DE+P +VDWR+KGAVN +KDQG+
Sbjct: 61 LTNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGT 120
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
CGSCWAFST AAVEGINKIVTGEL+SLSEQELVDCD+ N GCNGGLMDYAFQFI++NGG
Sbjct: 121 CGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 180
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGR 281
+++E+DYPY G KC+ +N++VV+IDGYEDV DE +LK+AV+ QPVSVAI+AGGR
Sbjct: 181 LNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGR 240
Query: 282 AFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLD 341
AFQHY+SG+FTG+CG+ +DH VVAVGYG+ENGVDYW+VRNSWG+ WGE+GY++++RN+
Sbjct: 241 AFQHYQSGIFTGKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVA- 299
Query: 342 TNTGKCGIAMEASYPVKNSQNSAK 365
+ +GKCGIA+EASYPVK S N +
Sbjct: 300 SKSGKCGIAIEASYPVKYSPNPVR 323
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 412 bits (1058), Expect = e-114, Method: Compositional matrix adjust.
Identities = 207/351 (58%), Positives = 261/351 (74%), Gaps = 14/351 (3%)
Query: 20 SSSSAADMSIISYDNNHDHSS--SWRTDDEVMTIYQTWLAKHGKTS-NGMG-HNEKRFQI 75
++++A DMSIISY+ H T+ E Y WLA++G S N +G +E+RF +
Sbjct: 18 AATAAPDMSIISYNAEHGARGLEEGPTEAEARAAYDLWLAENGGGSPNALGGEHERRFLV 77
Query: 76 FKDNLRFIDEHNSL---NRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
F DNL+F+D HN+ +++G+N+FADLTNEE+RA +LG + +S+ A +R
Sbjct: 78 FWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFRATFLGAKV-----AERSRAAGER 132
Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
Y +ELPESVDWREKGAV PVK+QG CGSCWAFS V+ VE IN++VTGE+I+LSEQE
Sbjct: 133 YRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITLSEQE 192
Query: 193 LVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
LV+C N+GCNGGLMD AF FII+NGG+D+E DYPY + KCD +R NAKVVSIDG
Sbjct: 193 LVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDG 252
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
+EDV DE SL+KAVA QPVSVAIEAGGR FQ Y SGVF+G CG++LDHGVVAVGYGT+
Sbjct: 253 FEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTD 312
Query: 312 NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
NG DYW+VRNSWG WGE+GYV+++RN ++ TGKCGIAM ASYP K+ N
Sbjct: 313 NGKDYWIVRNSWGPKWGESGYVRMERN-INVTTGKCGIAMMASYPTKSGAN 362
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 404 bits (1039), Expect = e-112, Method: Compositional matrix adjust.
Identities = 200/322 (62%), Positives = 251/322 (77%), Gaps = 9/322 (2%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFA 101
R + EV +Y+ WL ++ K NG+G E+RF+IFKDNL+F+DEHNS+ +RT++VGL +FA
Sbjct: 35 RNETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFA 94
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
DLTNEE+RA+YL R +R K V ++RY K GD LP+ VDWR GAV VKDQG+
Sbjct: 95 DLTNEEFRAIYL--RKKMER--TKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGN 150
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNG 220
CGSCWAFS V AVEGIN+I TGELISLSEQELVDCDR +NAGC+GG+M+YAF+FI++NG
Sbjct: 151 CGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNG 210
Query: 221 GMDSEQDYPYLGAE-NKCDPSRRN-AKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
G++++QDYPY + C+ + N +VV+IDGYEDV DE SLKKAVA QPVSVAIEA
Sbjct: 211 GIETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEA 270
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
+AFQ Y+SGV TG CG +LDHGVV VGYG+ +G DYW++RNSWG +WG++GYVKLQRN
Sbjct: 271 SSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRN 330
Query: 339 LLDTNTGKCGIAMEASYPVKNS 360
+D GKCGIAM SYP K+S
Sbjct: 331 -IDDPFGKCGIAMMPSYPTKSS 351
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 388 bits (996), Expect = e-107, Method: Compositional matrix adjust.
Identities = 203/373 (54%), Positives = 261/373 (69%), Gaps = 26/373 (6%)
Query: 5 SMFLAISTLVFLFFISSSSAAD-------MSIISYDNNHDHSSSWRTDDEVMTIYQTWLA 57
S+ A++ FL +++ + MSII Y+ H RT+ E Y WLA
Sbjct: 8 SVAAALAMACFLLILAAFAPPAAAAPPDIMSIIRYNAEHGVRGLERTEAEARAAYDLWLA 67
Query: 58 KHGKTSNG------MGHNEKRFQIFKDNLRFIDEHNSL---NRTYKVGLNKFADLTNEEY 108
+H + G +G +E+RF++F DNL+F+D HN+ +++G+N+FADLTN E+
Sbjct: 68 RHRRGGGGGSRNGFIGEHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEF 127
Query: 109 RAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAV-NPVKDQGSCGSCWA 167
RA YLGT + R + + Y + LP+SVDWR+KGAV PVK+QG CGSCWA
Sbjct: 128 RATYLGTTPAGRGRRV-----GEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWA 182
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRK-INAGCNGGLMDYAFQFIIQNGGMDSEQ 226
FS VAAVEGINKIVTGEL+SLSEQELV+C R N+GCNGG+MD AF FI +NGG+D+E+
Sbjct: 183 FSAVAAVEGINKIVTGELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEE 242
Query: 227 DYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHY 286
DYPY + KC+ ++R+ KVVSIDG+EDV DE+SL+KAVA QPVSVAI+AGGR FQ Y
Sbjct: 243 DYPYTAMDGKCNLAKRSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLY 302
Query: 287 ESGVFTGECGSALDHGVVAVGYGTE--NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNT 344
+SGVFTG CG+ LDHGVVAVGYGT+ G YW VRNSWG DWGENGY++++RN+ T
Sbjct: 303 DSGVFTGRCGTNLDHGVVAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVT-ART 361
Query: 345 GKCGIAMEASYPV 357
GKCGIAM ASYP+
Sbjct: 362 GKCGIAMMASYPI 374
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 384 bits (986), Expect = e-106, Method: Compositional matrix adjust.
Identities = 197/374 (52%), Positives = 258/374 (68%), Gaps = 22/374 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M F+++S L F + + I+S N + + RT+DEV +Y++WL K+G
Sbjct: 1 MGLPKSFVSMSLLFF---------STLLILSLAFNAKNLTQ-RTNDEVKAMYESWLIKYG 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N +G E+RF+IFK+ LRFIDEHN+ NR+YKVGLN+FADLT+EE+R+ YLG S +
Sbjct: 51 KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGS 110
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K+KV S RY + G LP VDWR GAV +K QG CG CWAFS +A VEGINK
Sbjct: 111 N----KTKV-SNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINK 165
Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
IVTG LISLSEQEL+DC R N GCNGG + FQFII NGG+++E++YPY + +C+
Sbjct: 166 IVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECN 225
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+N K V+ID YE+V +E +L+ AV QPVSVA++A G AF+HY SG+FTG CG+A
Sbjct: 226 LDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTA 285
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+DH V VGYGTE G+DYW+V+NSW + WGE GY+++ RN+ G CGIA SYPVK
Sbjct: 286 IDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 343
Query: 359 -NSQNSAKPKPHSS 371
N+QN PKP+SS
Sbjct: 344 YNNQN--HPKPYSS 355
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 378 bits (970), Expect = e-104, Method: Compositional matrix adjust.
Identities = 195/374 (52%), Positives = 256/374 (68%), Gaps = 22/374 (5%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
M F+++S L F + + I+S N + + RT+DEV +Y++WL K+G
Sbjct: 1 MGLPKSFVSMSLLFF---------STLLILSLAFNAKNLTQ-RTNDEVKAMYESWLIKYG 50
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGLNKFADLTNEEYRAMYLGTRSDA 119
K+ N +G E+RF+IFK+ LRFIDEHN+ NR+YKVGLN+FADLT+EE+R+ YL S +
Sbjct: 51 KSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFTSGS 110
Query: 120 KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINK 179
K+KV S RY + G LP VDWR GAV +K QG CG CWAFS +A VEGINK
Sbjct: 111 N----KTKV-SNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINK 165
Query: 180 IVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCD 238
IVTG LISLSEQEL+DC R N GCNGG + FQFII NGG+++E++YPY + +C+
Sbjct: 166 IVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECN 225
Query: 239 PSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA 298
+N K V+ID YE+V +E +L+ AV QPVSVA++A G AF+ Y SG+FTG CG+A
Sbjct: 226 VDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTA 285
Query: 299 LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
+DH V VGYGTE G+DYW+V+NSW + WGE GY+++ RN+ G CGIA SYPVK
Sbjct: 286 VDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNV--GGAGTCGIATMPSYPVK 343
Query: 359 -NSQNSAKPKPHSS 371
N+QN PKP+SS
Sbjct: 344 YNNQN--HPKPYSS 355
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 366 bits (940), Expect = e-100, Method: Compositional matrix adjust.
Identities = 184/360 (51%), Positives = 239/360 (66%), Gaps = 9/360 (2%)
Query: 10 ISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHN 69
+ L+ +F S YD+ S ++ + T+Y W + H +
Sbjct: 1 MKKLLLIFLFSLVILQTACGFDYDDKEIES-----EEGLSTLYDRWRSHHS-VPRSLNER 54
Query: 70 EKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVA 129
EKRF +F+ N+ + N NR+YK+ LNKFADLT E++ Y G+ R L K
Sbjct: 55 EKRFNVFRHNVMHVHNTNKKNRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRG 114
Query: 130 SQR--YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
S++ Y + +LP SVDWR+KGAV +K+QG CGSCWAFSTVAAVEGINKI T +L+S
Sbjct: 115 SKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVS 174
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVV 247
LSEQELVDCD K N GCNGGLM+ AF+FI +NGG+ +E YPY G + KCD S+ N +V
Sbjct: 175 LSEQELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLV 234
Query: 248 SIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVG 307
+IDG+EDV DE +L KAVA+QPVSVAI+AG FQ Y GVFTG CG+ L+HGV AVG
Sbjct: 235 TIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVG 294
Query: 308 YGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPK 367
YG+E G YW+VRNSWG++WGE GY+K++R +D G+CGIAMEASYP+K S ++ PK
Sbjct: 295 YGSERGKKYWIVRNSWGAEWGEGGYIKIERE-IDEPEGRCGIAMEASYPIKLSSSNPTPK 353
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 362 bits (928), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 179/359 (49%), Positives = 240/359 (66%), Gaps = 12/359 (3%)
Query: 1 MATASMFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
++ S+ +AIS L + A D SI+ Y H ++ D+++ ++++W+++H
Sbjct: 8 LSKFSLLVAISASALL---CCAFARDFSIVGYTPEHLTNT-----DKLLELFESWMSEHS 59
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF++F++NL ID+ N+ +Y +GLN+FADLT+EE++ YLG AK
Sbjct: 60 KAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGL---AK 116
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+ + + S + + +LP+SVDWR+KGAV PVKDQG CGSCWAFSTVAAVEGIN+I
Sbjct: 117 PQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQI 176
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
TG L SLSEQEL+DCD N+GCNGGLMDYAFQ+II GG+ E DYPYL E C
Sbjct: 177 TTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQ 236
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ + + V+I GYEDV D+ SL KA+A QPVSVAIEA GR FQ Y+ GVF G+CG+ LD
Sbjct: 237 KEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNGKCGTDLD 296
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
HGV AVGYG+ G DY +V+NSWG WGE G+++++RN G CGI ASYP K
Sbjct: 297 HGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRN-TGKPEGLCGINKMASYPTKT 354
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 360 bits (925), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 181/354 (51%), Positives = 243/354 (68%), Gaps = 6/354 (1%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LV + S ++A DMS++SYD+N+ S + D E I+++W+ KHGK + E+R
Sbjct: 12 LVAMVIASCATAIDMSVVSYDDNNRLHSVF--DAEASLIFESWMVKHGKVYGSVAEKERR 69
Query: 73 FQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQR 132
IF+DNLRFI+ N+ N +Y++GL FADL+ EY+ + G R + +S R
Sbjct: 70 LTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVCHGADPRPPRNHV-FMTSSDR 128
Query: 133 YACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQE 192
Y A D LP+SVDWR +GAV VKDQG C SCWAFSTV AVEG+NKIVTGEL++LSEQ+
Sbjct: 129 YKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSEQD 188
Query: 193 LVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS-RRNAKVVSIDG 251
L++C+++ N GC GG ++ A++FI++NGG+ ++ DYPY CD + N K V IDG
Sbjct: 189 LINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVMIDG 247
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
YE++ DE +L KAVA QPV+ I++ R FQ YESGVF G CG+ L+HGVV VGYGTE
Sbjct: 248 YENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYGTE 307
Query: 312 NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
NG DYWLV+NS G WGE GY+K+ RN+ + G CGIAM ASYP+KNS ++ K
Sbjct: 308 NGRDYWLVKNSRGITWGEAGYMKMARNIANPR-GLCGIAMRASYPLKNSFSTDK 360
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 356 bits (914), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 169/327 (51%), Positives = 227/327 (69%), Gaps = 3/327 (0%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H +++ + +Y+ W + H S +G KRF +FK N+ + N +++ YK+ L
Sbjct: 26 HEKDLESEESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANVMHVHNTNKMDKPYKLKL 84
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
NKFAD+TN E+R+ Y G++ + + S+ S + + +P SVDWR+KGAV VK
Sbjct: 85 NKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVK 144
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCWAFST+ AVEGIN+I T +L+SLSEQELVDCD++ N GCNGGLM+ AF+FI
Sbjct: 145 DQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIK 204
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
Q GG+ +E +YPY E CD S+ N VSIDG+E+V DE +L KAVA+QPVSVAI+
Sbjct: 205 QKGGITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAID 264
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
AGG FQ Y GVFTG+C + L+HGV VGYGT +G +YW+VRNSWG +WGE GY+++Q
Sbjct: 265 AGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQ 324
Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
RN + G CGIAM ASYP+KNS ++
Sbjct: 325 RN-ISKKEGLCGIAMMASYPIKNSSDN 350
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 354 bits (909), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 171/324 (52%), Positives = 221/324 (68%), Gaps = 3/324 (0%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H+ +++ + +Y+ W + H + + KRF +FK N++ I E N +++YK+ L
Sbjct: 24 HNKDVESENSLWELYERWRSHH-TVARSLEEKAKRFNVFKHNVKHIHETNKKDKSYKLKL 82
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
NKF D+T+EE+R Y G+ R K A++ + + LP SVDWR+ GAV PVK
Sbjct: 83 NKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVK 142
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
+QG CGSCWAFSTV AVEGIN+I T +L SLSEQELVDCD N GCNGGLMD AF+FI
Sbjct: 143 NQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIK 202
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
+ GG+ SE YPY ++ CD ++ NA VVSIDG+EDV E L KAVA+QPVSVAI+
Sbjct: 203 EKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAID 262
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
AGG FQ Y GVFTG CG+ L+HGV VGYGT +G YW+V+NSWG +WGE GY+++Q
Sbjct: 263 AGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQ 322
Query: 337 RNLLDTNTGKCGIAMEASYPVKNS 360
R + G CGIAMEASYP+KNS
Sbjct: 323 RGIRHKE-GLCGIAMEASYPLKNS 345
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 353 bits (907), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 170/314 (54%), Positives = 216/314 (68%), Gaps = 3/314 (0%)
Query: 51 IYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRA 110
+Y+ W + H S + +KRF +FK N + N +++ YK+ LNKFAD+TN E+R
Sbjct: 37 LYERWRSHH-TVSRSLHEKQKRFNVFKHNAMHVHNANKMDKPYKLKLNKFADMTNHEFRN 95
Query: 111 MYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFST 170
Y G++ R + + + D +P SVDWR+KGAV VKDQG CGSCWAFST
Sbjct: 96 TYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFST 155
Query: 171 VAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPY 230
+ AVEGIN+I T +L+SLSEQELVDCD N GCNGGLMDYAF+FI Q GG+ +E +YPY
Sbjct: 156 IVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPY 215
Query: 231 LGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGV 290
+ CD S+ NA VSIDG+E+V DE +L KAVA+QPVSVAI+AGG FQ Y GV
Sbjct: 216 EAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV 275
Query: 291 FTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGI 349
FTG CG+ LDHGV VGYGT +G YW V+NSWG +WGE GY++++R + D G CGI
Sbjct: 276 FTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKE-GLCGI 334
Query: 350 AMEASYPVKNSQNS 363
AMEASYP+K S N+
Sbjct: 335 AMEASYPIKKSSNN 348
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 351 bits (901), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 168/327 (51%), Positives = 226/327 (69%), Gaps = 3/327 (0%)
Query: 38 HSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGL 97
H +++ + +Y+ W + H S +G KRF +FK NL + N +++ YK+ L
Sbjct: 26 HDKDLASEESLWDLYERWRSHH-TVSRSLGEKHKRFNVFKANLMHVHNTNKMDKPYKLKL 84
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
NKFAD+TN E+R+ Y G++ + R + + + + +P SVDWR+KGAV VK
Sbjct: 85 NKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVK 144
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
DQG CGSCWAFSTV AVEGIN+I T +L++LSEQELVDCD++ N GCNGGLM+ AF+FI
Sbjct: 145 DQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIK 204
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIE 277
Q GG+ +E +YPY E CD S+ N VSIDG+E+V DE +L KAVA+QPVSVAI+
Sbjct: 205 QKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAID 264
Query: 278 AGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE-NGVDYWLVRNSWGSDWGENGYVKLQ 336
AGG FQ Y GVFTG+C + L+HGV VGYGT +G +YW+VRNSWG +WGE+GY+++Q
Sbjct: 265 AGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQ 324
Query: 337 RNLLDTNTGKCGIAMEASYPVKNSQNS 363
RN + G CGIAM SYP+KNS ++
Sbjct: 325 RN-ISKKEGLCGIAMLPSYPIKNSSDN 350
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 350 bits (899), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 171/333 (51%), Positives = 229/333 (68%), Gaps = 8/333 (2%)
Query: 26 DMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDE 85
D SI+ Y + D+++ +++ W++ K + RF++FKDNL+ IDE
Sbjct: 30 DYSIVGYS-----PEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDE 84
Query: 86 HNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESV 145
N ++Y +GLN+FADL++EE++ MYLG ++D RR + A +A + + +P+SV
Sbjct: 85 TNKKGKSYWLGLNEFADLSHEEFKKMYLGLKTDIVRRDEERSYA--EFAYRDVEAVPKSV 142
Query: 146 DWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCN 205
DWR+KGAV VK+QGSCGSCWAFSTVAAVEGINKIVTG L +LSEQEL+DCD N GCN
Sbjct: 143 DWRKKGAVAEVKNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCN 202
Query: 206 GGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKK 265
GGLMDYAF++I++NGG+ E+DYPY E C+ + ++ V+I+G++DV DE SL K
Sbjct: 203 GGLMDYAFEYIVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLK 262
Query: 266 AVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVRNSWGS 325
A+A QP+SVAI+A GR FQ Y GVF G CG LDHGV AVGYG+ G DY +V+NSWG
Sbjct: 263 ALAHQPLSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGP 322
Query: 326 DWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
WGE GY++L+RN G CGI AS+P K
Sbjct: 323 KWGEKGYIRLKRN-TGKPEGLCGINKMASFPTK 354
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 349 bits (896), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 175/359 (48%), Positives = 242/359 (67%), Gaps = 9/359 (2%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRT-----DDEVMTIYQTWLAKHGKTSNGMG 67
L+ L S ++A DMS++S ++NH ++ D E ++++W+ KHGK + +
Sbjct: 12 LLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESWMVKHGKVYDSVA 71
Query: 68 HNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSK 127
E+R IF+DNLRFI N+ N +Y++GLN+FADL+ EY + G R +
Sbjct: 72 EKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYGEICHGADPRPPRNHV-FM 130
Query: 128 VASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELIS 187
+S RY GD LP+SVDWR +GAV VKDQG C SCWAFSTV AVEG+NKIVTGEL++
Sbjct: 131 TSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVEGLNKIVTGELVT 190
Query: 188 LSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS-RRNAKV 246
LSEQ+L++C+++ N GC GG ++ A++FI+ NGG+ ++ DYPY C+ + + K
Sbjct: 191 LSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGVCEGRLKEDNKN 249
Query: 247 VSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAV 306
V IDGYE++ DE +L KAVA QPV+ +++ R FQ YESGVF G CG+ L+HGVV V
Sbjct: 250 VMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVVVV 309
Query: 307 GYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
GYGTENG DYW+V+NS G WGE GY+K+ RN+ + G CGIAM ASYP+KNS ++ K
Sbjct: 310 GYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPR-GLCGIAMRASYPLKNSFSTDK 367
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 343 bits (880), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 172/332 (51%), Positives = 225/332 (67%), Gaps = 11/332 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
++D + +Y+ W H + + +RF +FK+N++FI E N + YK+ LNKF D
Sbjct: 32 SEDSLWNLYEKWRTHH-TVARDLDEKNRRFNVFKENVKFIHEFNQKKDAPYKLALNKFGD 90
Query: 103 LTNEEYRAMYLGTRSDAKR--RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQG 160
+TN+E+R+ Y G++ R R ++ S Y G S+DWR KGAV VKDQG
Sbjct: 91 MTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYE-NVGSLPAASIDWRAKGAVTGVKDQG 149
Query: 161 SCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNG 220
CGSCWAFST+A+VEGIN+I TGEL+SLSEQELVDCD N GCNGGLMDYAF+F IQ
Sbjct: 150 QCGSCWAFSTIASVEGINQIKTGELVSLSEQELVDCDTSYNEGCNGGLMDYAFEF-IQKN 208
Query: 221 GMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGG 280
G+ +E YPY + C + N+ VVSIDG++DV +E +L +AVA+QP+SV+IEA G
Sbjct: 209 GITTEDSYPYAEQDGTCASNLLNSPVVSIDGHQDVPANNENALMQAVANQPISVSIEASG 268
Query: 281 RAFQHYESGVFTGECGSALDHGVVAVGYG-TENGVDYWLVRNSWGSDWGENGYVKLQRNL 339
FQ Y GVFTG CG+ LDHGV VGYG T +G YW+V+NSWG +WGE+GY+++QR +
Sbjct: 269 YGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDGTKYWIVKNSWGEEWGESGYIRMQRGI 328
Query: 340 LDTNTGKCGIAMEASYPVKNSQNSAKPKPHSS 371
D GKCGIAMEASYP+K S N PK S+
Sbjct: 329 SDKR-GKCGIAMEASYPIKTSAN---PKNSST 356
>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
lycopersicum PE=2 SV=1
Length = 346
Score = 336 bits (861), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 158/233 (67%), Positives = 192/233 (82%), Gaps = 1/233 (0%)
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
SK S RY K GD LPES+DWREKG + VKDQGSCGSCWAFS VAA+E IN IVTG L
Sbjct: 3 SKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNL 62
Query: 186 ISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAK 245
ISLSEQELVDCDR N GC+GGLMDYAF+F+I+NGG+D+E+DYPY CD R+NAK
Sbjct: 63 ISLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAK 122
Query: 246 VVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVA 305
VV ID YEDV +E +L+KAVA QPVS+A+EAGGR FQHY+SG+FTG+CG+A+DHGVV
Sbjct: 123 VVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVI 182
Query: 306 VGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
GYGTENG+DYW+VRNSWG++ ENGY+++QRN + +++G CG+A+E SYPVK
Sbjct: 183 AGYGTENGMDYWIVRNSWGANCRENGYLRVQRN-VSSSSGLCGLAIEPSYPVK 234
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 331 bits (848), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 176/359 (49%), Positives = 227/359 (63%), Gaps = 8/359 (2%)
Query: 16 LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQI 75
LFFI S + S + D T++ V +Y+ W H S KRF +
Sbjct: 3 LFFIVLISFLSLLQASKGFDFD-EKELETEENVWKLYERWRGHH-SVSRASHEAIKRFNV 60
Query: 76 FKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYAC 135
F+ N+ + N N+ YK+ +N+FAD+T+ E+R+ Y G+ R L K S +
Sbjct: 61 FRHNVLHVHRTNKKNKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGFMY 120
Query: 136 KAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVD 195
+ +P SVDWREKGAV VK+Q CGSCWAFSTVAAVEGINKI T +L+SLSEQELVD
Sbjct: 121 ENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVD 180
Query: 196 CDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK-CDPSRRNAKVVSIDGYED 254
CD + N GC GGLM+ AF+FI NGG+ +E+ YPY ++ + C + + V+IDG+E
Sbjct: 181 CDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGHEH 240
Query: 255 VSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYG-TENG 313
V DE L KAVA QPVSVAI+AG FQ Y GVF GECG+ L+HGVV VGYG T+NG
Sbjct: 241 VPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNG 300
Query: 314 VDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKPHSSA 372
YW+VRNSWG +WGE GYV+++R + + N G+CGIAMEASYP K S+ P H S
Sbjct: 301 TKYWIVRNSWGPEWGEGGYVRIERGISE-NEGRCGIAMEASYPTK---LSSTPSTHESV 355
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 327 bits (838), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 177/337 (52%), Positives = 233/337 (69%), Gaps = 14/337 (4%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNS-LNRTYKVGL 97
+ S R + EV+T+Y+ WL ++GK NG+G E+RF+IFKDNL+ I+EHNS NR+Y+ GL
Sbjct: 28 TESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGL 87
Query: 98 NKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNP-V 156
NKF+DLT +E++A YLG + + K S VA +RY K GD LP+ VDWRE+GAV P V
Sbjct: 88 NKFSDLTADEFQASYLGGKMEKKSL---SDVA-ERYQYKEGDVLPDEVDWRERGAVVPRV 143
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
K QG CGSCWAF+ AVEGIN+I TGEL+SLSEQEL+DCDR N GC GG +AF+F
Sbjct: 144 KRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEF 203
Query: 216 IIQNGGMDSEQDYPYLGAEN-KCDP-SRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVS 273
I +NGG+ S++ Y Y G + C + +VV+I+G+E V DEMSLKKAVA QP+S
Sbjct: 204 IKENGGIVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPIS 263
Query: 274 VAIEAGGRAFQHYESGVFTGECGSAL-DHGVVAVGYGTENGV-DYWLVRNSWGSDWGENG 331
V I A Y+SGV+ G C + DH V+ VGYGT + DYWL+RNSWG +WGE G
Sbjct: 264 VMISAAN--MSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGG 321
Query: 332 YVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
Y++LQRN + TGKC +A+ YP+K++ +S P
Sbjct: 322 YLRLQRNFHEP-TGKCAVAVAPVYPIKSNSSSHLLSP 357
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 316 bits (810), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 164/331 (49%), Positives = 211/331 (63%), Gaps = 13/331 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
+++ + +Y+ W + H + +RF FK N FI HN + Y++ LN+F D
Sbjct: 38 SEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGD 96
Query: 103 LTNEEYRAMYLG-TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
+ E+RA ++G R D + V YA +LP SVDWR+KGAV VKDQG
Sbjct: 97 MDQAEFRATFVGDLRRDTPSK--PPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
CGSCWAFSTV +VEGIN I TG L+SLSEQEL+DCD N GC GGLMD AF++I NGG
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214
Query: 222 MDSEQDYPYLGAENKCDPSR---RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
+ +E YPY A C+ +R + VV IDG++DV E L +AVA+QPVSVA+EA
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQR 337
G+AF Y GVFTGECG+ LDHGV VGYG E+G YW V+NSWG WGE GY+++++
Sbjct: 275 SGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEK 334
Query: 338 NLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
+ + G CGIAMEASYPVK +KPKP
Sbjct: 335 D-SGASGGLCGIAMEASYPVK---TYSKPKP 361
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 311 bits (798), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 162/331 (48%), Positives = 209/331 (63%), Gaps = 13/331 (3%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL-NRTYKVGLNKFAD 102
+++ + +Y+ W + H + +RF FK N FI HN + Y++ LN+F D
Sbjct: 38 SEEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGD 96
Query: 103 LTNEEYRAMYLG-TRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGS 161
+ E+RA ++G R D + V YA +LP SVDWR+KGAV VKDQG
Sbjct: 97 MDQAEFRATFVGDLRRDTPAK--PPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154
Query: 162 CGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGG 221
CGSCWAFSTV +VEGIN I TG L+SLSEQEL+DCD N GC GGLMD AF++I NGG
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214
Query: 222 MDSEQDYPYLGAENKCDPSR---RNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEA 278
+ +E YPY A C+ +R + VV IDG++DV E L +AVA+QPVSVA+EA
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274
Query: 279 GGRAFQHYESGVFTGECGSALDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYVKLQR 337
G+AF Y GVFTG+CG+ LDHGV VGYG E+G YW V+NSWG WGE GY+++++
Sbjct: 275 SGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEK 334
Query: 338 NLLDTNTGKCGIAMEASYPVKNSQNSAKPKP 368
+ + G CGIAMEASYPVK KP P
Sbjct: 335 D-SGASGGLCGIAMEASYPVKTYN---KPMP 361
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 298 bits (764), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 158/362 (43%), Positives = 225/362 (62%), Gaps = 17/362 (4%)
Query: 1 MATASMFLAISTLVFL----FFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWL 56
MAT S +IS ++FL S+AD + Y + D +S R ++ ++ +W+
Sbjct: 1 MATMS---SISKIIFLATCLIIHMGLSSADFYTVGYSQD-DLTSIER----LIQLFDSWM 52
Query: 57 AKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTR 116
KH K + RF+IF+DNL +IDE N N +Y +GLN FADL+N+E++ Y+G
Sbjct: 53 LKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFV 112
Query: 117 SDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEG 176
++ L ++ + K P+S+DWR KGAV PVK+QG+CGSCWAFST+A VEG
Sbjct: 113 AEDFTGL--EHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEG 170
Query: 177 INKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
INKIVTG L+ LSEQELVDCD+ + GC GG + Q++ NG + + + YPY + K
Sbjct: 171 INKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQYVANNG-VHTSKVYPYQAKQYK 228
Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECG 296
C + + V I GY+ V E S A+A+QP+SV +EAGG+ FQ Y+SGVF G CG
Sbjct: 229 CRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCG 288
Query: 297 SALDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYP 356
+ LDH V AVGYGT +G +Y +++NSWG +WGE GY++L+R ++ G CG+ + YP
Sbjct: 289 TKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ-GTCGVYKSSYYP 347
Query: 357 VK 358
K
Sbjct: 348 FK 349
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 290 bits (743), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 152/359 (42%), Positives = 217/359 (60%), Gaps = 19/359 (5%)
Query: 5 SMFLAISTLVF----LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
+M +IS L+F LF S D SI+ Y N D +S+ R ++ ++++W+ KH
Sbjct: 2 AMIPSISKLLFVAICLFVYMGLSFGDFSIVGYSQN-DLTSTER----LIQLFESWMLKHN 56
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF+IFKDNL++IDE N N +Y +GLN FAD++N+E++ Y G+ +
Sbjct: 57 KIYKNIDEKIYRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAG-- 114
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
++++ + +PE VDWR+KGAV PVK+QGSCGSCWAFS V +EGI KI
Sbjct: 115 -NYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKI 173
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
TG L SEQEL+DCDR+ + GCNGG A Q + Q G+ YPY G + C
Sbjct: 174 RTGNLNEYSEQELLDCDRR-SYGCNGGYPWSALQLVAQY-GIHYRNTYPYEGVQRYCRSR 231
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ DG V P++E +L ++A+QPVSV +EA G+ FQ Y G+F G CG+ +D
Sbjct: 232 EKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVD 291
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
H V AVGYG +Y L++NSWG+ WGENGY++++R ++ G CG+ + YPVKN
Sbjct: 292 HAVAAVGYGP----NYILIKNSWGTGWGENGYIRIKRGTGNS-YGVCGLYTSSFYPVKN 345
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 286 bits (733), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 138/224 (61%), Positives = 173/224 (77%), Gaps = 3/224 (1%)
Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
D+LP+S+DWRE GAV PVK+QG CGSCWAFSTVAAVEGIN+IVTG+LISLSEQ+LVDC
Sbjct: 1 DDLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDC-T 59
Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
N GC GG M+ AFQFI+ NGG++SE+ YPY G + C+ S NA VVSID YE+V
Sbjct: 60 TANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICN-STVNAPVVSIDSYENVPSH 118
Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
+E SL+KAVA+QPVSV ++A GR FQ Y SG+FTG C + +H + VGYGTEN D+W+
Sbjct: 119 NEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWI 178
Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN 362
V+NSWG +WGE+GY++ +RN+ + + GKCGI ASYPVK N
Sbjct: 179 VKNSWGKNWGESGYIRAERNIENPD-GKCGITRFASYPVKKGTN 221
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 285 bits (730), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 156/359 (43%), Positives = 219/359 (61%), Gaps = 16/359 (4%)
Query: 5 SMFLAISTLVF----LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
+M +IS L+F LF S S D SI+ Y + D +S+ R ++ ++ +W+ H
Sbjct: 2 AMIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQD-DLTSTER----LIQLFNSWMLNHN 56
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF+IFKDNL +IDE N N +Y +GLN+FADL+N+E+ Y+G+ DA
Sbjct: 57 KFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEFNEKYVGSLIDA- 115
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
+ + + + LPE+VDWR+KGAV PV+ QGSCGSCWAFS VA VEGINKI
Sbjct: 116 ---TIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKI 172
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
TG+L+ LSEQELVDC+R+ + GC GG YA +++ +NG + YPY + C
Sbjct: 173 RTGKLVELSEQELVDCERR-SHGCKGGYPPYALEYVAKNG-IHLRSKYPYKAKQGTCRAK 230
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ +V G V P +E +L A+A QPVSV +E+ GR FQ Y+ G+F G CG+ +D
Sbjct: 231 QVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVD 290
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
H V AVGYG G Y L++NSWG+ WGE GY++++R + G CG+ + YP KN
Sbjct: 291 HAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRIKR-APGNSPGVCGLYKSSYYPTKN 348
>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
Length = 215
Score = 279 bits (713), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 131/218 (60%), Positives = 164/218 (75%), Gaps = 4/218 (1%)
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LP VDWR KGAVN +K+Q CGSCWAFS VAAVE INKI TG+LISLSEQELVDCD
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTA- 59
Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
+ GCNGG M+ AFQ+II NGG+D++Q+YPY + C P R +VVSI+G++ V+ +E
Sbjct: 60 SHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCKPYR--LRVVSINGFQRVTRNNE 117
Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
+L+ AVA QPVSV +EA G FQHY SG+FTG CG+A +HGVV VGYGT++G +YW+VR
Sbjct: 118 SALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGTQSGKNYWIVR 177
Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
NSWG +WG GY+ ++RN+ + G CGIA SYP K
Sbjct: 178 NSWGQNWGNQGYIWMERNVASS-AGLCGIAQLPSYPTK 214
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 278 bits (711), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 153/359 (42%), Positives = 220/359 (61%), Gaps = 16/359 (4%)
Query: 5 SMFLAISTLVF----LFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHG 60
++ + S L+F LF S S D SI+ Y + D +S+ R ++ ++ +W+ KH
Sbjct: 2 AIICSFSKLLFVAICLFGHMSLSYCDFSIVGYSQD-DLTSTER----LIQLFNSWMLKHN 56
Query: 61 KTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAK 120
K + RF+IFKDNL++IDE N + Y +GLN+F+DL+N+E++ Y+G+ +
Sbjct: 57 KNYKNVDEKLYRFEIFKDNLKYIDERNKMINGYWLGLNEFSDLSNDEFKEKYVGSLPED- 115
Query: 121 RRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKI 180
++ + + + +LPESVDWR KGAV PVK QG C SCWAFSTVA VEGINKI
Sbjct: 116 ---YTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYCESCWAFSTVATVEGINKI 172
Query: 181 VTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPS 240
TG L+ LSEQELVDCD++ + GCN G + Q++ QNG + YPY+ + C +
Sbjct: 173 KTGNLVELSEQELVDCDKQ-SYGCNRGYQSTSLQYVAQNG-IHLRAKYPYIAKQQTCRAN 230
Query: 241 RRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALD 300
+ V +G V +E SL A+A QPVSV +E+ GR FQ+Y+ G+F G CG+ +D
Sbjct: 231 QVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVD 290
Query: 301 HGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
H V AVGYG G Y L++NSWG WGENGY++++R + G CG+ + YP+KN
Sbjct: 291 HAVTAVGYGKSGGKGYILIKNSWGPGWGENGYIRIRR-ASGNSPGVCGVYRSSYYPIKN 348
>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 271 bits (693), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 130/220 (59%), Positives = 166/220 (75%), Gaps = 3/220 (1%)
Query: 139 DELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR 198
D LP+S+DWREKGAV PVK+QG CGSCWAF +AAVEGIN+IVTG+LISLSEQ+LVDC
Sbjct: 1 DVLPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCST 60
Query: 199 KINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPF 258
+ N GC GG AFQ+II NGG++SE+ YPY G CD ++ NA VVSID Y +V
Sbjct: 61 R-NHGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCD-TKENAHVVSIDSYRNVPSN 118
Query: 259 DEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWL 318
DE SL+KAVA+QPVSV ++A GR FQ Y +G+FTG C + +H G TEN DYW
Sbjct: 119 DEKSLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRTVGGRETENDKDYWT 178
Query: 319 VRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
V+NSWG +WGE+GY++++RN+ ++ +GKCGIA+ SYP+K
Sbjct: 179 VKNSWGKNWGESGYIRVERNIAES-SGKCGIAISPSYPIK 217
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 266 bits (680), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 142/355 (40%), Positives = 213/355 (60%), Gaps = 20/355 (5%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LVFLF + A S S D +D +M ++ W+A++G+ +R
Sbjct: 7 LVFLFLFLCAMWASPSAASRD---------EPNDPMMKRFEEWMAEYGRVYKDDDEKMRR 57
Query: 73 FQIFKDNLRFIDEHNSLNR-TYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
FQIFK+N++ I+ NS N +Y +G+N+F D+T E+ A Y G + + V S
Sbjct: 58 FQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVAQYTGV--SLPLNIEREPVVS- 114
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
+ +P+S+DWR+ GAVN VK+Q CGSCW+F+ +A VEGI KI TG L+SLSEQ
Sbjct: 115 -FDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQ 173
Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
E++DC ++ GC GG ++ A+ FII N G+ +E++YPYL + C+ + I G
Sbjct: 174 EVLDC--AVSYGCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCN-ANSFPNSAYITG 230
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
Y V DE S+ AV++QP++ I+A FQ+Y GVF+G CG++L+H + +GYG +
Sbjct: 231 YSYVRRNDERSMMYAVSNQPIAALIDA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQD 289
Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
+G YW+VRNSWGS WGE GYV++ R + +++G CGIAM +P S +A+
Sbjct: 290 SSGTKYWIVRNSWGSSWGEGGYVRMARG-VSSSSGVCGIAMAPLFPTLQSGANAE 343
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 256 bits (655), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 148/324 (45%), Positives = 199/324 (61%), Gaps = 19/324 (5%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFA 101
D + + T+ +H K R +IF +N I +HN L +YK+GLNK+A
Sbjct: 22 DLIKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYA 81
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSK--VASQRYACKAGDELPESVDWREKGAVNPVKDQ 159
D+ + E++ G + R+LM+ + + Y A +P+SVDWRE GAV VKDQ
Sbjct: 82 DMLHHEFKETMNG-YNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQ 140
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
G CGSCWAFS+ A+EG + G L+SLSEQ LVDC K N GCNGGLMD AF++I
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 200
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVADQ-PVSVAI 276
NGG+D+E+ YPY G ++ C ++ A + + D G+ D+ DE +KKAVA PVSVAI
Sbjct: 201 NGGIDTEKSYPYEGIDDSCHFNK--ATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAI 258
Query: 277 EAGGRAFQHYESGVFT-GECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYV 333
+A +FQ Y GV+ EC LDHGV+ VGYGT E+G+DYWLV+NSWG+ WGE GY+
Sbjct: 259 DASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYI 318
Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
K+ RN +CGIA +SYP
Sbjct: 319 KMARN----QNNQCGIATASSYPT 338
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 254 bits (648), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 140/355 (39%), Positives = 207/355 (58%), Gaps = 20/355 (5%)
Query: 13 LVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKR 72
LVFLF A S S D D +M ++ W+A++G+ R
Sbjct: 7 LVFLFLFLCVMWASPSAASCD---------EPSDPMMKQFEEWMAEYGRVYKDNDEKMLR 57
Query: 73 FQIFKDNLRFIDEHNSLN-RTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQ 131
FQIFK+N+ I+ N+ N +Y +G+N+F D+TN E+ A Y G + + V S
Sbjct: 58 FQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGL--SLPLNIKREPVVS- 114
Query: 132 RYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQ 191
+ +P+S+DWR+ GAV VK+QG CGSCWAF+++A VE I KI G L+SLSEQ
Sbjct: 115 -FDDVDISSVPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQ 173
Query: 192 ELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDG 251
+++DC ++ GC GG ++ A+ FII N G+ S YPY A+ C + I
Sbjct: 174 QVLDC--AVSYGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTCK-TNGVPNSAYITR 230
Query: 252 YEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTE 311
Y V +E ++ AV++QP++ A++A G FQHY+ GVFTG CG+ L+H +V +GYG +
Sbjct: 231 YTYVQRNNERNMMYAVSNQPIAAALDASGN-FQHYKRGVFTGPCGTRLNHAIVIIGYGQD 289
Query: 312 -NGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQNSAK 365
+G +W+VRNSWG+ WGE GY++L R+ + ++ G CGIAM+ YP S S +
Sbjct: 290 SSGKKFWIVRNSWGAGWGEGGYIRLARD-VSSSFGLCGIAMDPLYPTLQSGPSVE 343
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 253 bits (647), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 143/355 (40%), Positives = 204/355 (57%), Gaps = 22/355 (6%)
Query: 6 MFLAISTLVFLFFISSSSAADMSIISYDNNHDHSSSWRTDDEVMTIYQTWLAKHGKTSNG 65
M L+I+ + L +S S + ++ S+ D W + ++ ++
Sbjct: 1 MRLSITLIFTLIVLSISFISAGNVFSHKQYQDSFIDWMRSNNKAYTHKEFMP-------- 52
Query: 66 MGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRSDAKRRLMK 125
R++ FK N+ ++ NS +GLN+ ADL+NEEYR YLGTR+ K
Sbjct: 53 ------RYEEFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIKLNGYH 106
Query: 126 SKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGEL 185
+ R + + P +VDWREK AV PVKDQG CGSC++FST +VEG+ I TG+L
Sbjct: 107 KRNLGLRLN-RPQFKQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKL 165
Query: 186 ISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNA 244
+SLSEQ ++DC N GCNGGLM AF++II+N G++SE+ YPY N + +
Sbjct: 166 VSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGS 225
Query: 245 KVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSA--LDHG 302
I Y+++ DE L+ A+ PVSVAI+A +FQ Y +GV+ S+ LDHG
Sbjct: 226 VAAKITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHG 285
Query: 303 VVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
V+AVG GT+NG DY++V+NSWG WG NGY+ + RN D N CGI+ ASYP+
Sbjct: 286 VLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHMARN-KDNN---CGISTMASYPI 336
>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
Length = 208
Score = 248 bits (633), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 123/218 (56%), Positives = 153/218 (70%), Gaps = 11/218 (5%)
Query: 141 LPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI 200
LPE +DWR+KGAV PVK+QGSCGSCWAFSTV+ VE IN+I TG LISLSEQELVDCD+K
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59
Query: 201 NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDE 260
N GC GG +A+Q+II NGG+D++ +YPY + C + +KVVSIDGY V +E
Sbjct: 60 NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAA---SKVVSIDGYNGVPFCNE 116
Query: 261 MSLKKAVADQPVSVAIEAGGRAFQHYESGVFTGECGSALDHGVVAVGYGTENGVDYWLVR 320
+LK+AVA QP +VAI+A FQ Y SG+F+G CG+ L+HGV VGY +YW+VR
Sbjct: 117 XALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQA----NYWIVR 172
Query: 321 NSWGSDWGENGYVKLQRNLLDTNTGKCGIAMEASYPVK 358
NSWG WGE GY+++ R G CGIA YP K
Sbjct: 173 NSWGRYWGEKGYIRMLR---VGGCGLCGIARLPYYPTK 207
>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
Length = 344
Score = 246 bits (629), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 144/331 (43%), Positives = 193/331 (58%), Gaps = 42/331 (12%)
Query: 52 YQTWLAKHGK--TSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYR 109
+ W+ H K TS G R+ IFK N+ ++ + NS +GLN FAD+TNEEYR
Sbjct: 30 FTDWMITHQKSYTSEEFG---ARYNIFKANMDYVQQWNSKGSETVLGLNNFADITNEEYR 86
Query: 110 AMYLGTRSDAKRRL--MKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWA 167
YLGT+ DA + + KV + A S DWR +GAV PVK+QG CG CW+
Sbjct: 87 NTYLGTKFDASSLIGTQEEKVFTTSSAA--------SKDWRSEGAVTPVKNQGQCGGCWS 138
Query: 168 FSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFIIQNGGMDSEQD 227
FST + EG + GEL+SLSEQ L+DC + N+GC+GGLM YAF++II N G+D+E
Sbjct: 139 FSTTGSTEGAHFQSKGELVSLSEQNLIDCSTE-NSGCDGGLMTYAFEYIINNNGIDTESS 197
Query: 228 YPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQPVSVAIEAGGRAFQHYE 287
YPY KC+ N+ ++ Y+ V+ E SL+ AV PVSVAI+A ++FQ Y
Sbjct: 198 YPYKAENGKCEYKSENSG-ATLSSYKTVTAGSESSLESAVNVNPVSVAIDASHQSFQLYT 256
Query: 288 SGV-FTGECGSA-LDHGVVAVGYGTENGV-------------------DYWLVRNSWGSD 326
SG+ + EC S LDHGV+AVGYG+ +G +YW+V+NSWG+
Sbjct: 257 SGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNEYWIVKNSWGTS 316
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG GY+ + RN D N CGIA AS+PV
Sbjct: 317 WGIEGYILMSRN-RDNN---CGIASSASFPV 343
>sp|O60911|CATL2_HUMAN Cathepsin L2 OS=Homo sapiens GN=CTSL2 PE=1 SV=2
Length = 334
Score = 244 bits (623), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 198/328 (60%), Gaps = 29/328 (8%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEK--RFQIFKDNLRFIDEHNSL----NRTYKVG 96
+ D + T + W A H + G NE+ R +++ N++ I+ HN + +
Sbjct: 20 KFDQNLDTKWYQWKATHRRL---YGANEEGWRRAVWEKNMKMIELHNGEYSQGKHGFTMA 76
Query: 97 LNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
+N F D+TNEE+R M +G + K R K KV + +LP+SVDWR+KG V PV
Sbjct: 77 MNAFGDMTNEEFRQM-MGCFRNQKFR--KGKVFREPLFL----DLPKSVDWRKKGYVTPV 129
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
K+Q CGSCWAFS A+EG TG+L+SLSEQ LVDC R + N GCNGG M AFQ+
Sbjct: 130 KNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGFMARAFQY 189
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
+ +NGG+DSE+ YPY+ + C N+ V + G+ V+P E +L KAVA P+SV
Sbjct: 190 VKENGGLDSEESYPYVAVDEICKYRPENS-VANDTGFTVVAPGKEKALMKAVATVGPISV 248
Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
A++AG +FQ Y+SG+ F +C S LDHGV+ VGYG E N YWLV+NSWG +WG
Sbjct: 249 AMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWG 308
Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYP 356
NGYVK+ + D N CGIA ASYP
Sbjct: 309 SNGYVKIAK---DKNN-HCGIATAASYP 332
>sp|Q28944|CATL1_PIG Cathepsin L1 OS=Sus scrofa GN=CTSL1 PE=2 SV=1
Length = 334
Score = 243 bits (620), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 143/331 (43%), Positives = 197/331 (59%), Gaps = 25/331 (7%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYK 94
S++ + D + + W A HG+ GM R +++ N++ I+ HN +
Sbjct: 16 SAAPKLDQNLDADWYKWKATHGRLY-GMNEEGWRRAVWEKNMKMIELHNQEYSQGKHGFS 74
Query: 95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
+ +N F D+TNEE+R + G ++ + K KV + E+P+SVDWREKG V
Sbjct: 75 MAMNAFGDMTNEEFRQVMNGFQNQKHK---KGKVFHESLVL----EVPKSVDWREKGYVT 127
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
VK+QG CGSCWAFS A+EG TG+L+SLSEQ LVDC R + N GCNGGLMD AF
Sbjct: 128 AVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAF 187
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
Q++ NGG+D+E+ YPYLG E + + G+ D+ P E +L KAVA P+
Sbjct: 188 QYVKDNGGLDTEESYPYLGRETNSCTYKPECSAANDTGFVDI-PQREKALMKAVATVGPI 246
Query: 273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSD 326
SVAI+AG +FQ Y+SG+ + +C S LDHGV+ VGYG E N +W+V+NSWG +
Sbjct: 247 SVAIDAGHSSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPE 306
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG NGYVK+ + D N CGI+ ASYP
Sbjct: 307 WGWNGYVKMAK---DQNN-HCGISTAASYPT 333
>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
SV=2
Length = 322
Score = 242 bits (618), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 134/319 (42%), Positives = 188/319 (58%), Gaps = 30/319 (9%)
Query: 52 YQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNKFADLTNEE 107
++ + K G+ + R +F DNL++I+E N TY + +N+F+D+TNE+
Sbjct: 20 WEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEK 79
Query: 108 YRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPES--VDWREKGAVNPVKDQGSCGSC 165
+ A+ G + + + + D PES VDWR KGAV PVKDQG CGSC
Sbjct: 80 FNAVMKGYKKGPRPAAVFTST----------DAAPESTEVDWRTKGAVTPVKDQGQCGSC 129
Query: 166 WAFSTVAAVEGINKIVTGELISLSEQELVDC--DRKINAGCNGGLMDYAFQFIIQNGGMD 223
WAFST +EG + + TG L+SLSEQ+LVDC N GCNGG ++ A ++ NGG+D
Sbjct: 130 WAFSTTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVD 189
Query: 224 SEQDYPYLGAENKCDPSRRNAKVV--SIDGYEDVSPFDEMSLKKAVAD-QPVSVAIEAGG 280
+E YPY +N C R N+ + + GY ++ E +LK A D P+SVAI+A
Sbjct: 190 TESSYPYEARDNTC---RFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASH 246
Query: 281 RAFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRN 338
R+FQ Y +GV + C S+ LDH V+AVGYG+E G D+WLV+NSW + WGE+GY+K+ RN
Sbjct: 247 RSFQSYYTGVYYEPSCSSSQLDHAVLAVGYGSEGGQDFWLVKNSWATSWGESGYIKMARN 306
Query: 339 LLDTNTGKCGIAMEASYPV 357
CGIA +A YP
Sbjct: 307 ----RNNNCGIATDACYPT 321
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 242 bits (618), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 139/324 (42%), Positives = 196/324 (60%), Gaps = 18/324 (5%)
Query: 46 DEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLNKFA 101
D VM + T+ +H K R +IF +N I +HN ++K+ +NK+A
Sbjct: 53 DVVMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYA 112
Query: 102 DLTNEEYRAMYLGTRSDAKRRLMKSKVASQ--RYACKAGDELPESVDWREKGAVNPVKDQ 159
DL + E+R + G ++L + + + + A LP+SVDWR KGAV VKDQ
Sbjct: 113 DLLHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQ 172
Query: 160 GSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQ 218
G CGSCWAFS+ A+EG + +G L+SLSEQ LVDC K N GCNGGLMD AF++I
Sbjct: 173 GHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 232
Query: 219 NGGMDSEQDYPYLGAENKCDPSRRNAKVVSID-GYEDVSPFDEMSLKKAVAD-QPVSVAI 276
NGG+D+E+ YPY ++ C ++ V + D G+ D+ DE + +AVA PVSVAI
Sbjct: 233 NGGIDTEKSYPYEAIDDSCHFNK--GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAI 290
Query: 277 EAGGRAFQHYESGVFT-GECGSA-LDHGVVAVGYGT-ENGVDYWLVRNSWGSDWGENGYV 333
+A +FQ Y GV+ +C + LDHGV+ VG+GT E+G DYWLV+NSWG+ WG+ G++
Sbjct: 291 DASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFI 350
Query: 334 KLQRNLLDTNTGKCGIAMEASYPV 357
K+ RN +CGIA +SYP+
Sbjct: 351 KMLRN----KENQCGIASASSYPL 370
>sp|P22895|P34_SOYBN P34 probable thiol protease OS=Glycine max PE=1 SV=1
Length = 379
Score = 242 bits (617), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 135/345 (39%), Positives = 207/345 (60%), Gaps = 28/345 (8%)
Query: 44 TDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYKVGLNK 99
T +V +++Q W ++HG+ + KR +IFK+N +I + N+ NR ++++GLNK
Sbjct: 36 TQKQVSSLFQLWKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNA-NRKSPHSHRLGLNK 94
Query: 100 FADLTNEEYRAMYLGTRSDAKR--RLMKSKVASQRYACKAGDELPESVDWREKGAVNPVK 157
FAD+T +E+ YL D + ++ K+ ++Y+C D P S DWR+KG + VK
Sbjct: 95 FADITPQEFSKKYLQAPKDVSQQIKMANKKMKKEQYSC---DHPPASWDWRKKGVITQVK 151
Query: 158 DQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINAGCNGGLMDYAFQFII 217
QG CG WAFS A+E + I TG+L+SLSEQELVDC + + G G +F++++
Sbjct: 152 YQGGCGRGWAFSATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGSYNGWQYQSFEWVL 210
Query: 218 QNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFD-------EMSLKKAVADQ 270
++GG+ ++ DYPY E +C ++ K V+IDGYE + D E + A+ +Q
Sbjct: 211 EHGGIATDDDYPYRAKEGRCKANKIQDK-VTIDGYETLIMSDESTESETEQAFLSAILEQ 269
Query: 271 PVSVAIEAGGRAFQHYESGVFTGE-CGS--ALDHGVVAVGYGTENGVDYWLVRNSWGSDW 327
P+SV+I+A + F Y G++ GE C S ++H V+ VGYG+ +GVDYW+ +NSWG DW
Sbjct: 270 PISVSIDA--KDFHLYTGGIYDGENCTSPYGINHFVLLVGYGSADGVDYWIAKNSWGFDW 327
Query: 328 GENGYVKLQRNLLDTNTGKCGIAMEASYPVKNSQN---SAKPKPH 369
GE+GY+ +QRN + G CG+ ASYP K SA+ K H
Sbjct: 328 GEDGYIWIQRNTGNL-LGVCGMNYFASYPTKEESETLVSARVKGH 371
>sp|Q9GL24|CATL1_CANFA Cathepsin L1 OS=Canis familiaris GN=CTSL1 PE=2 SV=1
Length = 333
Score = 242 bits (617), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 144/331 (43%), Positives = 200/331 (60%), Gaps = 26/331 (7%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYK 94
S++ + D + + W A H + GM R +++ N++ I+ HN +
Sbjct: 16 SAAPKFDQSLNAQWYQWKATHRRLY-GMNEEGWRRAVWEKNMKMIELHNREYSQGKHGFT 74
Query: 95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
+ +N F D+TNEE+R + G ++ + K K+ + E+P+SVDWREKG V
Sbjct: 75 MAMNAFGDMTNEEFRQVMNGFQNQKHK---KGKMFQEPLFA----EIPKSVDWREKGYVT 127
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
PVK+QG CGSCWAFS A+EG TG+L+SLSEQ LVDC R + N GCNGGLMD AF
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNEGCNGGLMDNAF 187
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVADQ-PV 272
+++ NGG+DSE+ YPYLG + + + + G+ D+ P E +L KAVA P+
Sbjct: 188 RYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDL-PQREKALMKAVATLGPI 246
Query: 273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTENGVD----YWLVRNSWGSD 326
SVAI+AG ++FQ Y+SG+ F +C S LDHGV+ VGYG E G D +W+V+NSWG +
Sbjct: 247 SVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFE-GTDSNNKFWIVKNSWGPE 305
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG NGYVK+ + D N CGIA ASYP
Sbjct: 306 WGWNGYVKMAK---DQNN-HCGIATAASYPT 332
>sp|Q10717|CYSP2_MAIZE Cysteine proteinase 2 OS=Zea mays GN=CCP2 PE=2 SV=1
Length = 360
Score = 241 bits (614), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 136/321 (42%), Positives = 190/321 (59%), Gaps = 20/321 (6%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
RT D + + + ++GK+ KRF+IF ++L+ + N +Y++G+N+FAD
Sbjct: 52 RTRDALR--FARFAVRYGKSYESAAEVHKRFRIFSESLQLVRSTNRKGLSYRLGINRFAD 109
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
++ EE+RA LG + L + R A A LPE+ DWRE G V+PVK+QG C
Sbjct: 110 MSWEEFRATRLGAAQNCSATLTGNH--RMRAAAVA---LPETKDWREDGIVSPVKNQGHC 164
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGG 221
GSCW FST A+E TG+ ISLSEQ+LVDC N GCNGGL AF++I NGG
Sbjct: 165 GSCWTFSTTGALEAAYTQATGKPISLSEQQLVDCGFAFNNFGCNGGLPSQAFEYIKYNGG 224
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGG 280
+D+E+ YPY G C N V +D +++ E LK AV +PVSVA E
Sbjct: 225 LDTEESYPYQGVNGICKFKNENVGVKVLDSV-NITLGAEDELKDAVGLVRPVSVAFEV-I 282
Query: 281 RAFQHYESGVFTGE-CGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
F+ Y+SGV+T + CG+ ++H V+AVGYG E+GV YWL++NSWG+DWG+ GY K++
Sbjct: 283 TGFRLYKSGVYTSDHCGTTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDEGYFKME 342
Query: 337 RNLLDTNTGKCGIAMEASYPV 357
CG+A ASYP+
Sbjct: 343 -----MGKNMCGVATCASYPI 358
>sp|P25975|CATL1_BOVIN Cathepsin L1 OS=Bos taurus GN=CTSL1 PE=1 SV=3
Length = 334
Score = 239 bits (611), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 142/331 (42%), Positives = 196/331 (59%), Gaps = 25/331 (7%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYK 94
S++ + D + + W A H + GM E R +++ N + ID HN ++
Sbjct: 16 SAAPKLDPNLDAHWHQWKATHRRLY-GMNEEEWRRAVWEKNKKIIDLHNQEYSEGKHGFR 74
Query: 95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
+ +N F D+TNEE+R + G ++ + K K+ + ++P+SVDW +KG V
Sbjct: 75 MAMNAFGDMTNEEFRQVMNGFQNQKHK---KGKLFHEPLLV----DVPKSVDWTKKGYVT 127
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAF 213
PVK+QG CGSCWAFS A+EG TG+L+SLSEQ LVDC R + N GCNGGLMD AF
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNQGCNGGLMDNAF 187
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
Q+I NGG+DSE+ YPYL + + + G+ D+ P E +L KAVA P+
Sbjct: 188 QYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI-PQREKALMKAVATVGPI 246
Query: 273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSD 326
SVAI+AG +FQ Y+SG+ + +C S LDHGV+ VGYG E N +W+V+NSWG +
Sbjct: 247 SVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNNKFWIVKNSWGPE 306
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG NGYVK+ + D N CGIA ASYP
Sbjct: 307 WGWNGYVKMAK---DQNN-HCGIATAASYPT 333
>sp|P25778|ORYC_ORYSJ Oryzain gamma chain OS=Oryza sativa subsp. japonica GN=Os09g0442300
PE=2 SV=2
Length = 362
Score = 239 bits (610), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 132/321 (41%), Positives = 189/321 (58%), Gaps = 21/321 (6%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFAD 102
RT D + + + +HGK ++RF+IF ++L + N Y++G+N+FAD
Sbjct: 55 RTRDALR--FARFAVRHGKRYGDAAEVQRRFRIFSESLELVRSTNRRGLPYRLGINRFAD 112
Query: 103 LTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSC 162
++ EE++A LG + S + + + LPE+ DWRE G V+PVKDQG C
Sbjct: 113 MSWEEFQASRLGAAQNC------SATLAGNHRMRDAAALPETKDWREDGIVSPVKDQGHC 166
Query: 163 GSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDRKINA-GCNGGLMDYAFQFIIQNGG 221
GSCW FST ++E TG+ +SLSEQ+LVDC N GC+GGL AF++I NGG
Sbjct: 167 GSCWTFSTTGSLEAAYTQATGKPVSLSEQQLVDCATAYNNFGCSGGLPSQAFEYIKYNGG 226
Query: 222 MDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGG 280
+D+E+ YPY G C N V +D +++ E LK AV +PVSVA +
Sbjct: 227 LDTEEAYPYTGVNGICHYKPENVGVKVLDSV-NITLGAEDELKNAVGLVRPVSVAFQV-I 284
Query: 281 RAFQHYESGVFTGE-CGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ 336
F+ Y+SGV+T + CG++ ++H V+AVGYG ENGV YWL++NSWG+DWG+NGY K++
Sbjct: 285 NGFRMYKSGVYTSDHCGTSPMDVNHAVLAVGYGVENGVPYWLIKNSWGADWGDNGYFKME 344
Query: 337 RNLLDTNTGKCGIAMEASYPV 357
CGIA ASYP+
Sbjct: 345 -----MGKNMCGIATCASYPI 360
>sp|P07711|CATL1_HUMAN Cathepsin L1 OS=Homo sapiens GN=CTSL1 PE=1 SV=2
Length = 333
Score = 239 bits (609), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 143/331 (43%), Positives = 196/331 (59%), Gaps = 26/331 (7%)
Query: 39 SSSWRTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNR----TYK 94
S++ D + + W A H + GM R +++ N++ I+ HN R ++
Sbjct: 16 SATLTFDHSLEAQWTKWKAMHNRLY-GMNEEGWRRAVWEKNMKMIELHNQEYREGKHSFT 74
Query: 95 VGLNKFADLTNEEYRAMYLGTRSDAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVN 154
+ +N F D+T+EE+R + G ++ R+ K KV + +A P SVDWREKG V
Sbjct: 75 MAMNAFGDMTSEEFRQVMNGFQN---RKPRKGKVFQEPLFYEA----PRSVDWREKGYVT 127
Query: 155 PVKDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCD-RKINAGCNGGLMDYAF 213
PVK+QG CGSCWAFS A+EG TG LISLSEQ LVDC + N GCNGGLMDYAF
Sbjct: 128 PVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGPQGNEGCNGGLMDYAF 187
Query: 214 QFIIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPV 272
Q++ NGG+DSE+ YPY E C + + + V + G+ D+ P E +L KAVA P+
Sbjct: 188 QYVQDNGGLDSEESYPYEATEESCKYNPKYS-VANDTGFVDI-PKQEKALMKAVATVGPI 245
Query: 273 SVAIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSD 326
SVAI+AG +F Y+ G+ F +C S +DHGV+ VGYG E + YWLV+NSWG +
Sbjct: 246 SVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTESDNNKYWLVKNSWGEE 305
Query: 327 WGENGYVKLQRNLLDTNTGKCGIAMEASYPV 357
WG GYVK+ ++ + CGIA ASYP
Sbjct: 306 WGMGGYVKMAKDRRN----HCGIASAASYPT 332
>sp|Q8H166|ALEU_ARATH Thiol protease aleurain OS=Arabidopsis thaliana GN=ALEU PE=1 SV=2
Length = 358
Score = 238 bits (608), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 136/308 (44%), Positives = 185/308 (60%), Gaps = 24/308 (7%)
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
++GK + + RF IFK+NL I N +YK+G+N+FADLT +E++ LG
Sbjct: 65 RYGKKYQNVEEMKLRFSIFKENLDLIRSTNKKGLSYKLGVNQFADLTWQEFQRTKLGAAQ 124
Query: 118 DAKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAAVEGI 177
+ L S ++ LPE+ DWRE G V+PVKDQG CGSCW FST A+E
Sbjct: 125 NCSATLKGSHKVTEA-------ALPETKDWREDGIVSPVKDQGGCGSCWTFSTTGALEAA 177
Query: 178 NKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLGAENK 236
G+ ISLSEQ+LVDC N GCNGGL AF++I NGG+D+E+ YPY G +
Sbjct: 178 YHQAFGKGISLSEQQLVDCAGAFNNYGCNGGLPSQAFEYIKSNGGLDTEKAYPYTGKDET 237
Query: 237 CDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVFT-GE 294
C S N V ++ +++ E LK AV +PVS+A E +F+ Y+SGV+T
Sbjct: 238 CKFSAENVGVQVLNSV-NITLGAEDELKHAVGLVRPVSIAFEV-IHSFRLYKSGVYTDSH 295
Query: 295 CGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQ--RNLLDTNTGKCGI 349
CGS ++H V+AVGYG E+GV YWL++NSWG+DWG+ GY K++ +N+ CGI
Sbjct: 296 CGSTPMDVNHAVLAVGYGVEDGVPYWLIKNSWGADWGDKGYFKMEMGKNM-------CGI 348
Query: 350 AMEASYPV 357
A ASYPV
Sbjct: 349 ATCASYPV 356
>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 238 bits (608), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 142/331 (42%), Positives = 189/331 (57%), Gaps = 30/331 (9%)
Query: 43 RTDDEVMTIYQTWLAKHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSL----NRTYKVGLN 98
+ D + W + H + G E R I++ N+R I HN + + +N
Sbjct: 20 KFDQTFSAEWHQWKSTHRRLY-GTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMN 78
Query: 99 KFADLTNEEYRAMYLGTRSDA--KRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPV 156
F D+TNEE+R + G R K RL + + + +P+SVDWREKG V PV
Sbjct: 79 AFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLK---------IPKSVDWREKGCVTPV 129
Query: 157 KDQGSCGSCWAFSTVAAVEGINKIVTGELISLSEQELVDCDR-KINAGCNGGLMDYAFQF 215
K+QG CGSCWAFS +EG + TG+LISLSEQ LVDC + N GCNGGLMD+AFQ+
Sbjct: 130 KNQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQY 189
Query: 216 IIQNGGMDSEQDYPYLGAENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVAD-QPVSV 274
I +NGG+DSE+ YPY + C R V + G+ D+ P E +L KAVA P+SV
Sbjct: 190 IKENGGLDSEESYPYEAKDGSCK-YRAEFAVANDTGFVDI-PQQEKALMKAVATVGPISV 247
Query: 275 AIEAGGRAFQHYESGV-FTGECGSA-LDHGVVAVGYGTE----NGVDYWLVRNSWGSDWG 328
A++A + Q Y SG+ + C S LDHGV+ VGYG E N YWLV+NSWGS+WG
Sbjct: 248 AMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWG 307
Query: 329 ENGYVKLQRNLLDTNTGKCGIAMEASYPVKN 359
GY+K+ ++ CG+A ASYPV N
Sbjct: 308 MEGYIKIAKD----RDNHCGLATAASYPVVN 334
>sp|Q40143|CYSP3_SOLLC Cysteine proteinase 3 OS=Solanum lycopersicum GN=CYP-3 PE=2 SV=1
Length = 356
Score = 238 bits (607), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 136/310 (43%), Positives = 183/310 (59%), Gaps = 28/310 (9%)
Query: 58 KHGKTSNGMGHNEKRFQIFKDNLRFIDEHNSLNRTYKVGLNKFADLTNEEYRAMYLGTRS 117
+H K + + ++RF+IF DNL+ I HN +YK+G+N+F DLT +E+R LG
Sbjct: 63 RHRKRYDSVEEIKQRFEIFLDNLKMIRSHNRKGLSYKLGINEFTDLTWDEFRKHKLGASQ 122
Query: 118 D----AKRRLMKSKVASQRYACKAGDELPESVDWREKGAVNPVKDQGSCGSCWAFSTVAA 173
+ K L + V LPE+ DWR+ G V+PVK QG CGSCW FST A
Sbjct: 123 NCSATTKGNLKLTNVV-----------LPETKDWRKDGIVSPVKAQGKCGSCWTFSTTGA 171
Query: 174 VEGINKIVTGELISLSEQELVDCDRKI-NAGCNGGLMDYAFQFIIQNGGMDSEQDYPYLG 232
+E G+ ISLSEQ+LVDC N GCNGGL AF++I NGG+D+E+ YPY G
Sbjct: 172 LEAAYAQAFGKGISLSEQQLVDCAGAFNNFGCNGGLPSQAFEYIKFNGGLDTEEAYPYTG 231
Query: 233 AENKCDPSRRNAKVVSIDGYEDVSPFDEMSLKKAVA-DQPVSVAIEAGGRAFQHYESGVF 291
C S+ N V I +++ E LK AVA +PVSVA E + F+ Y+SGV+
Sbjct: 232 KNGICKFSQANIGVKVISSV-NITLGAEYELKYAVALVRPVSVAFEV-VKGFKQYKSGVY 289
Query: 292 -TGECGSA---LDHGVVAVGYGTENGVDYWLVRNSWGSDWGENGYVKLQRNLLDTNTGKC 347
+ ECG ++H V+AVGYG ENG YWL++NSWG+DWGE+GY K ++ C
Sbjct: 290 ASTECGDTPMDVNHAVLAVGYGVENGTPYWLIKNSWGADWGEDGYFK-----MEMGKNMC 344
Query: 348 GIAMEASYPV 357
G+A ASYP+
Sbjct: 345 GVATCASYPI 354
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.315 0.131 0.393
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 141,107,492
Number of Sequences: 539616
Number of extensions: 6102443
Number of successful extensions: 14248
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 219
Number of HSP's successfully gapped in prelim test: 13
Number of HSP's that attempted gapping in prelim test: 13190
Number of HSP's gapped (non-prelim): 278
length of query: 372
length of database: 191,569,459
effective HSP length: 119
effective length of query: 253
effective length of database: 127,355,155
effective search space: 32220854215
effective search space used: 32220854215
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 62 (28.5 bits)