BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 048002
(351 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 423 bits (1087), Expect = e-117, Method: Compositional matrix adjust.
Identities = 220/371 (59%), Positives = 258/371 (69%), Gaps = 41/371 (11%)
Query: 5 VGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQNLK 64
V LS LV GVA SFD+ + DLASEE LWDLYERWRSHHTVSR L EK RFNVFK NL
Sbjct: 9 VVLSFSLVLGVANSFDFHDKDLASEESLWDLYERWRSHHTVSRSLGEKHKRFNVFKANLM 68
Query: 65 RIHKVNQMDKPYKLRLNRFADMTNHEFMSSRS-SKVSHHRMLHG-PRRQTGFMHGKTQDL 122
+H N+MDKPYKL+LN+FADMTNHEF S+ + SKV+H RM G P FM+ K +
Sbjct: 69 HVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYEKVVSV 128
Query: 123 PPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDK-DN 181
PPSVDWRK+GAVT VKDQG+CGSCWAFSTVV+VEGIN+IKT +L +LSEQELVDCDK +N
Sbjct: 129 PPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEEN 188
Query: 182 HGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNA 241
GC+GGLME A FI + G+TTE +YPY A++G+C+ S V N
Sbjct: 189 QGCNGGLMESAFEFIKQKGGITTESNYPYKAQEGTCD--ASKV---------------ND 231
Query: 242 PEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE---------------- 285
V +DG+E VP +DE+AL+KAVANQPV+VAIDAGG DFQFYSE
Sbjct: 232 LAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLNHGVA 291
Query: 286 --GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKL---HP 340
GYG T DGT YWIV+NSWG +W E GYIRM R I +EGLCGI + SYP+K +P
Sbjct: 292 IVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPIKNSSDNP 351
Query: 341 ENSRHPRKDEL 351
S KDEL
Sbjct: 352 TGSFSSPKDEL 362
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 417 bits (1071), Expect = e-116, Method: Compositional matrix adjust.
Identities = 212/360 (58%), Positives = 253/360 (70%), Gaps = 41/360 (11%)
Query: 16 AESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQMDKP 75
A SFD+ E DL SEE LWDLYERWRSHHTVSR L EK RFNVFK N+ +H N+MDKP
Sbjct: 20 ANSFDFHEKDLESEESLWDLYERWRSHHTVSRSLGEKHKRFNVFKANVMHVHNTNKMDKP 79
Query: 76 YKLRLNRFADMTNHEFMSSRS-SKVSHHRMLHGPRRQTG-FMHGKTQDLPPSVDWRKQGA 133
YKL+LN+FADMTNHEF S+ + SKV+HH+M G + +G FM+ K +P SVDWRK+GA
Sbjct: 80 YKLKLNKFADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGA 139
Query: 134 VTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDK-DNHGCDGGLMEQA 192
VT VKDQG+CGSCWAFST+V+VEGIN+IKT +L SLSEQELVDCDK +N GC+GGLME A
Sbjct: 140 VTDVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESA 199
Query: 193 LNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMV 252
FI + G+TTE +YPYTA++G+C+ N V +DG+E V
Sbjct: 200 FEFIKQKGGITTESNYPYTAQEGTCD-----------------ESKVNDLAVSIDGHENV 242
Query: 253 PESDENALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGT 294
P +DENAL+KAVANQPV+VAIDAGG DFQFYSE GYG T DGT
Sbjct: 243 PVNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGT 302
Query: 295 KYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKL---HPENSRHPRKDEL 351
YWIV+NSWG +W E+GYIRM R I +EGLCGI + ASYP+K +P S KDEL
Sbjct: 303 NYWIVRNSWGPEWGEQGYIRMQRNISKKEGLCGIAMMASYPIKNSSDNPTGSLSSPKDEL 362
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 416 bits (1069), Expect = e-115, Method: Compositional matrix adjust.
Identities = 218/361 (60%), Positives = 255/361 (70%), Gaps = 41/361 (11%)
Query: 15 VAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQMDK 74
+ ESFD+ E +L SEE LW LYERWRSHHTVSR L EKQ RFNVFK N +H N+MDK
Sbjct: 17 ITESFDFHEKELESEESLWGLYERWRSHHTVSRSLHEKQKRFNVFKHNAMHVHNANKMDK 76
Query: 75 PYKLRLNRFADMTNHEFMSSRS-SKVSHHRMLH-GPRRQTGFMHGKTQDLPPSVDWRKQG 132
PYKL+LN+FADMTNHEF ++ S SKV HHRM GPR FM+ K +P SVDWRK+G
Sbjct: 77 PYKLKLNKFADMTNHEFRNTYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKG 136
Query: 133 AVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD-NHGCDGGLMEQ 191
AVT VKDQG+CGSCWAFST+V+VEGIN+IKT +L SLSEQELVDCD D N GC+GGLM+
Sbjct: 137 AVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDY 196
Query: 192 ALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEM 251
A FI + G+TTE +YPY A DG+C++ +NAP V +DG+E
Sbjct: 197 AFEFIKQRGGITTEANYPYEAYDGTCDVSK-----------------ENAPAVSIDGHEN 239
Query: 252 VPESDENALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDG 293
VPE+DENAL+KAVANQPV+VAIDAGG DFQFYSE GYG T DG
Sbjct: 240 VPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDG 299
Query: 294 TKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKL---HPENSRHPRKDE 350
TKYW VKNSWG +W EKGYIRM RGI +EGLCGI +EASYP+K +P + KDE
Sbjct: 300 TKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPIKKSSNNPSGIKSSPKDE 359
Query: 351 L 351
L
Sbjct: 360 L 360
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 396 bits (1017), Expect = e-109, Method: Compositional matrix adjust.
Identities = 203/375 (54%), Positives = 250/375 (66%), Gaps = 42/375 (11%)
Query: 2 FFLVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQ 61
F ++ L +++V + D+ D+ SE LW+LYERWRSHHTV+R L+EK RFNVFK
Sbjct: 4 FIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLEEKAKRFNVFKH 63
Query: 62 NLKRIHKVNQMDKPYKLRLNRFADMTNHEFMSSRS-SKVSHHRMLHGPRRQT-GFMHGKT 119
N+K IH+ N+ DK YKL+LN+F DMT+ EF + + S + HHRM G ++ T FM+
Sbjct: 64 NVKHIHETNKKDKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSFMYANV 123
Query: 120 QDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDK 179
LP SVDWRK GAVT VK+QG+CGSCWAFSTVV+VEGIN+I+T +L SLSEQELVDCD
Sbjct: 124 NTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDT 183
Query: 180 D-NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGD 238
+ N GC+GGLM+ A FI + GLT+E YPY A D +C+
Sbjct: 184 NQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCD-----------------TNK 226
Query: 239 KNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE------------- 285
+NAP V +DG+E VP++ E+ LMKAVANQPV+VAIDAGG DFQFYSE
Sbjct: 227 ENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNH 286
Query: 286 -----GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKLHP 340
GYG T DGTKYWIVKNSWG +W EKGYIRM RGI +EGLCGI +EASYP+K
Sbjct: 287 GVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPLKNSN 346
Query: 341 EN----SRHPRKDEL 351
N S KDEL
Sbjct: 347 TNPSRLSLDSLKDEL 361
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 392 bits (1008), Expect = e-108, Method: Compositional matrix adjust.
Identities = 202/373 (54%), Positives = 253/373 (67%), Gaps = 41/373 (10%)
Query: 2 FFLVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQ 61
F + L + +A+S + E DLASE+ LW+LYE+WR+HHTV+RDL EK RFNVFK+
Sbjct: 6 FIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHHTVARDLDEKNRRFNVFKE 65
Query: 62 NLKRIHKVNQM-DKPYKLRLNRFADMTNHEFMSSRS-SKVSHHRMLHGPRRQTG-FMHGK 118
N+K IH+ NQ D PYKL LN+F DMTN EF S + SK+ HHR G ++ TG FM+
Sbjct: 66 NVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYEN 125
Query: 119 TQDLPP-SVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDC 177
LP S+DWR +GAVTGVKDQG+CGSCWAFST+ SVEGIN+IKTGEL SLSEQELVDC
Sbjct: 126 VGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQELVDC 185
Query: 178 DKD-NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWN 236
D N GC+GGLM+ A FI K+ G+TTE SYPY +DG+C ++++
Sbjct: 186 DTSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAEQDGTC--ASNLL------------ 230
Query: 237 GDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE----------- 285
N+P V +DG++ VP ++ENALM+AVANQP++V+I+A G FQFYSE
Sbjct: 231 ---NSPVVSIDGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTEL 287
Query: 286 -------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKL 338
GYGAT+DGTKYWIVKNSWG +W E GYIRM RGI + G CGI +EASYP+K
Sbjct: 288 DHGVAIVGYGATRDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPIKT 347
Query: 339 HPENSRHPRKDEL 351
+DEL
Sbjct: 348 SANPKNSSTRDEL 360
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 375 bits (962), Expect = e-103, Method: Compositional matrix adjust.
Identities = 194/377 (51%), Positives = 246/377 (65%), Gaps = 43/377 (11%)
Query: 2 FFLVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQ 61
FF+V +S + + ++ FD+ E +L +EE +W LYERWR HH+VSR E RFNVF+
Sbjct: 4 FFIVLISFLSLLQASKGFDFDEKELETEENVWKLYERWRGHHSVSRASHEAIKRFNVFRH 63
Query: 62 NLKRIHKVNQMDKPYKLRLNRFADMTNHEFMSSRS-SKVSHHRMLHGPRRQT-GFMHGKT 119
N+ +H+ N+ +KPYKL++NRFAD+T+HEF SS + S V HHRML GP+R + GFM+
Sbjct: 64 NVLHVHRTNKKNKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGFMYENV 123
Query: 120 QDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCD- 178
+P SVDWR++GAVT VK+Q CGSCWAFSTV +VEGINKI+T +L SLSEQELVDCD
Sbjct: 124 TRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDT 183
Query: 179 KDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGD 238
++N GC GGLME A FI + G+ TE++YPY + D V C N
Sbjct: 184 EENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSD---------------VQFCRAN-S 227
Query: 239 KNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE------------- 285
V +DG+E VPE+DE L+KAVA+QPV+VAIDAG DFQ YSE
Sbjct: 228 IGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNH 287
Query: 286 -----GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKLHP 340
GYG T++GTKYWIV+NSWG +W E GY+R+ RGI EG CGI +EASYP KL
Sbjct: 288 GVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTKLSS 347
Query: 341 ENSRHPR------KDEL 351
S H KDEL
Sbjct: 348 TPSTHESVVRDDVKDEL 364
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 370 bits (951), Expect = e-102, Method: Compositional matrix adjust.
Identities = 195/377 (51%), Positives = 246/377 (65%), Gaps = 46/377 (12%)
Query: 2 FFLVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQ 61
L+ L +++ A FDY + ++ SEE L LY+RWRSHH+V R L E++ RFNVF+
Sbjct: 4 LLLIFLFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHSVPRSLNEREKRFNVFRH 63
Query: 62 NLKRIHKVNQMDKPYKLRLNRFADMTNHEFMSSRS-SKVSHHRMLHGPRR---QTGFMHG 117
N+ +H N+ ++ YKL+LN+FAD+T +EF ++ + S + HHRML GP+R Q + H
Sbjct: 64 NVMHVHNTNKKNRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHE 123
Query: 118 KTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDC 177
LP SVDWRK+GAVT +K+QG+CGSCWAFSTV +VEGINKIKT +L SLSEQELVDC
Sbjct: 124 NLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDC 183
Query: 178 D-KDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWN 236
D K N GC+GGLME A FI K+ G+TTE SYPY DG C+
Sbjct: 184 DTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKD-------------- 229
Query: 237 GDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE----------- 285
N V +DG+E VPE+DENAL+KAVANQPV+VAIDAG DFQFYSE
Sbjct: 230 ---NGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTEL 286
Query: 286 -------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKL 338
GYG ++ G KYWIV+NSWG +W E GYI++ R ID EG CGI +EASYP+KL
Sbjct: 287 NHGVAAVGYG-SERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPIKL 345
Query: 339 HPENSRHPR----KDEL 351
N P+ KDEL
Sbjct: 346 SSSNPT-PKDGDVKDEL 361
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 325 bits (834), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 176/352 (50%), Positives = 215/352 (61%), Gaps = 43/352 (12%)
Query: 22 QESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQM-DKPYKLRL 80
++ DL SEE LWDLYERW+S H V R EK RF FK N IH N+ D PY+L L
Sbjct: 32 EDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHL 91
Query: 81 NRFADMTNHEFMSSRSSKVSHHR--MLHGPRRQTGFMHG--KTQDLPPSVDWRKQGAVTG 136
NRF DM EF R++ V R P GFM+ DLPPSVDWR++GAVTG
Sbjct: 92 NRFGDMDQAEF---RATFVGDLRRDTPSKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTG 148
Query: 137 VKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCD-KDNHGCDGGLMEQALNF 195
VKDQG+CGSCWAFSTVVSVEGIN I+TG L SLSEQEL+DCD DN GC GGLM+ A +
Sbjct: 149 VKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEY 208
Query: 196 IAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVI-LDGYEMVPE 254
I + GL TE +YPY A G+C + + +N+P V+ +DG++ VP
Sbjct: 209 IKNNGGLITEAAYPYRAARGTCNVARAA---------------QNSPVVVHIDGHQDVPA 253
Query: 255 SDENALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKY 296
+ E L +AVANQPV+VA++A GK F FYSE GYG +DG Y
Sbjct: 254 NSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAY 313
Query: 297 WIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKLHPENSRHPRK 348
W VKNSWG W E+GYIR+ + A GLCGI +EASYPVK + + PR+
Sbjct: 314 WTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTYSKPKPTPRR 365
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 322 bits (826), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 177/352 (50%), Positives = 215/352 (61%), Gaps = 45/352 (12%)
Query: 22 QESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQM-DKPYKLRL 80
++ DL SEE LWDLYERW+S H V R EK RF FK N IH N+ D PY+L L
Sbjct: 32 EDKDLESEEALWDLYERWQSAHRVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHL 91
Query: 81 NRFADMTNHEFMSSRSSKVSHHR--MLHGPRRQTGFMHG--KTQDLPPSVDWRKQGAVTG 136
NRF DM EF R++ V R P GFM+ DLPPSVDWR++GAVTG
Sbjct: 92 NRFGDMDQAEF---RATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTG 148
Query: 137 VKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCD-KDNHGCDGGLMEQALNF 195
VKDQG+CGSCWAFSTVVSVEGIN I+TG L SLSEQEL+DCD DN GC GGLM+ A +
Sbjct: 149 VKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEY 208
Query: 196 IAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVI-LDGYEMVPE 254
I + GL TE +YPY A G+C + + +N+P V+ +DG++ VP
Sbjct: 209 IKNNGGLITEAAYPYRAARGTCNVARAA---------------QNSPVVVHIDGHQDVPA 253
Query: 255 SDENALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKY 296
+ E L +AVANQPV+VA++A GK F FYSE GYG +DG Y
Sbjct: 254 NSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAY 313
Query: 297 WIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKLHPENSRHPRK 348
W VKNSWG W E+GYIR+ + A GLCGI +EASYPVK + N PR+
Sbjct: 314 WTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPVKTY--NKPMPRR 363
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 275 bits (703), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 157/341 (46%), Positives = 197/341 (57%), Gaps = 51/341 (14%)
Query: 35 LYERWRSHHTVSRD-----LKEKQIRFNVFKQNLKRI--HKVNQMDKPYKLRLNRFADMT 87
+Y RW H S + ++ RFN+FK NL+ I H N + YKL L FA++T
Sbjct: 3 IYLRWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLT 62
Query: 88 NHEFMS----SRSSKVSHHRMLHGPRRQTGFMHGKTQD-LPPSVDWRKQGAVTGVKDQGR 142
N E+ S +R+ V R+ + D +P +VDWR++GAV +KDQG
Sbjct: 63 NDEYRSLYLGARTEPV--RRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGT 120
Query: 143 CGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD-NHGCDGGLMEQALNFIAKSEG 201
CGSCWAFST +VEGINKI TGEL SLSEQELVDCDK N GC+GGLM+ A FI K+ G
Sbjct: 121 CGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 180
Query: 202 LTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALM 261
L TEK YPY +G C S++ KN+ V +DGYE VP DE AL
Sbjct: 181 LNTEKDYPYHGTNGKCN------SLL-----------KNSRVVTIDGYEDVPSKDETALK 223
Query: 262 KAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKYWIVKNSW 303
+AV+ QPV+VAIDAGG+ FQ Y GYG +++G YWIV+NSW
Sbjct: 224 RAVSYQPVSVAIDAGGRAFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSW 282
Query: 304 GTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVKLHPENSR 344
GT W E GYIRM R + ++ G CGI +EASYPVK P R
Sbjct: 283 GTRWGEDGYIRMERNVASKSGKCGIAIEASYPVKYSPNPVR 323
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 269 bits (687), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 150/333 (45%), Positives = 195/333 (58%), Gaps = 44/333 (13%)
Query: 28 SEECLWDLYERWRSHHTVSRD---LKEKQIRFNVFKQNLKRIHKVNQMDKPYKLRLNRFA 84
SE + +YE W H ++ L EK RF +FK NL+ + + N+ + Y+L L RFA
Sbjct: 42 SEAEVMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFA 101
Query: 85 DMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQD-LPPSVDWRKQGAVTGVKDQGRC 143
D+TN E+ RS + G RR + + D LP S+DWRK+GAV VKDQG C
Sbjct: 102 DLTNDEY---RSKYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGC 158
Query: 144 GSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD-NHGCDGGLMEQALNFIAKSEGL 202
GSCWAFST+ +VEGIN+I TG+L +LSEQELVDCD N GC+GGLM+ A FI K+ G+
Sbjct: 159 GSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGI 218
Query: 203 TTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMK 262
T+K YPY DG+C+ KNA V +D YE VP E +L K
Sbjct: 219 DTDKDYPYKGVDGTCDQIR-----------------KNAKVVTIDSYEDVPTYSEESLKK 261
Query: 263 AVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKYWIVKNSWG 304
AVA+QP+++AI+AGG+ FQ Y GYG T++G YWIV+NSWG
Sbjct: 262 AVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG-TENGKDYWIVRNSWG 320
Query: 305 TDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
W E GY+RM R I + G CGI +E SYP+K
Sbjct: 321 KSWGESGYLRMARNIASSSGKCGIAIEPSYPIK 353
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 267 bits (683), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 148/337 (43%), Positives = 202/337 (59%), Gaps = 38/337 (11%)
Query: 21 YQESDLASEECLWDLYERWRSHHTVSRD-LKEKQIRFNVFKQNLKRIHKVNQMDKPYKLR 79
Y DL S + L +L+E W S+ + + ++EK +RF VFK NLK I + N+ K Y L
Sbjct: 36 YSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLG 95
Query: 80 LNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKD 139
LN FAD+++ EF + R F + + +P SVDWRK+GAV VK+
Sbjct: 96 LNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKN 155
Query: 140 QGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD-NHGCDGGLMEQALNFIAK 198
QG CGSCWAFSTV +VEGINKI TG L +LSEQEL+DCD N+GC+GGLM+ A +I K
Sbjct: 156 QGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVK 215
Query: 199 SEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDEN 258
+ GL E+ YPY+ ++G+CE+ + V ++G++ VP +DE
Sbjct: 216 NGGLRKEEDYPYSMEEGTCEMQKD-----------------ESETVTINGHQDVPTNDEK 258
Query: 259 ALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKYWIVK 300
+L+KA+A+QP++VAIDA G++FQFYS GYG+++ G+ Y IVK
Sbjct: 259 SLLKALAHQPLSVAIDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVK 317
Query: 301 NSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
NSWG W EKGYIR+ R EGLCGI AS+P K
Sbjct: 318 NSWGPKWGEKGYIRLKRNTGKPEGLCGINKMASFPTK 354
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 266 bits (681), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 153/356 (42%), Positives = 204/356 (57%), Gaps = 42/356 (11%)
Query: 5 VGLSLVLVFGVAESFD---YQESDLASEECLWDLYERWRSHHTVS-RDLKEKQIRFNVFK 60
+ S +L A F Y L + + L +L+E W S H+ + + ++EK RF VF+
Sbjct: 17 ISASALLCCAFARDFSIVGYTPEHLTNTDKLLELFESWMSEHSKAYKSVEEKVHRFEVFR 76
Query: 61 QNLKRIHKVNQMDKPYKLRLNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQ 120
+NL I + N Y L LN FAD+T+ EF R ++ + + F +
Sbjct: 77 ENLMHIDQRNNEINSYWLGLNEFADLTHEEF-KGRYLGLAKPQFSRKRQPSANFRYRDIT 135
Query: 121 DLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD 180
DLP SVDWRK+GAV VKDQG+CGSCWAFSTV +VEGIN+I TG L SLSEQEL+DCD
Sbjct: 136 DLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTT 195
Query: 181 -NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDK 239
N GC+GGLM+ A +I + GL E YPY ++G C+ +
Sbjct: 196 FNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQ-----------------EQKE 238
Query: 240 NAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFY---------------- 283
+ V + GYE VPE+D+ +L+KA+A+QPV+VAI+A G+DFQFY
Sbjct: 239 DVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNGKCGTDLDHG 298
Query: 284 --SEGYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
+ GYG+++ G+ Y IVKNSWG W EKG+IRM R EGLCGI ASYP K
Sbjct: 299 VAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEGLCGINKMASYPTK 353
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 262 bits (669), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 148/338 (43%), Positives = 196/338 (57%), Gaps = 52/338 (15%)
Query: 28 SEECLWDLYERWRSHHTVSRD-LKEKQIRFNVFKQNLKRIHKVNQMDKP----YKLRLNR 82
SEE LY W++ H S + + E++ R+ F+ NL+ I + N ++L LNR
Sbjct: 32 SEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNR 91
Query: 83 FADMTNHEFMSSRSSKVSHHRMLHGPRRQTG----FMHGKTQDLPPSVDWRKQGAVTGVK 138
FAD+TN E+ + ++ + + PRR+ ++ + LP SVDWR +GAV +K
Sbjct: 92 FADLTNEEY------RDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIK 145
Query: 139 DQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD-NHGCDGGLMEQALNFIA 197
DQG CGSCWAFS + +VEGIN+I TG+L SLSEQELVDCD N GC+GGLM+ A +FI
Sbjct: 146 DQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFII 205
Query: 198 KSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDE 257
+ G+ TE YPY KD C++ KNA V +D YE V + E
Sbjct: 206 NNGGIDTEDDYPYKGKDERCDV-----------------NRKNAKVVTIDSYEDVTPNSE 248
Query: 258 NALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKYWIV 299
+L KAVANQPV+VAI+AGG+ FQ YS GYG T++G YWIV
Sbjct: 249 TSLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYG-TENGKDYWIV 307
Query: 300 KNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
+NSWG W E GY+RM R I A G CGI +E SYP+K
Sbjct: 308 RNSWGKSWGESGYVRMERNIKASSGKCGIAVEPSYPLK 345
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 260 bits (665), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 152/351 (43%), Positives = 209/351 (59%), Gaps = 56/351 (15%)
Query: 28 SEECLWDLYERWRSHH-----TVSRDLKEKQIRFNVFKQNLKRI--HKVNQMDKPYKLRL 80
++E + +Y +W + H + + ++ RFN+FK NL+ I H + + YKL L
Sbjct: 41 TDEEVRSIYLQWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGL 100
Query: 81 NRFADMTNHEF----MSSRSS---KVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGA 133
+F D+TN E+ + +R+ +++ + ++ ++ + ++GK ++P +VDWR++GA
Sbjct: 101 TKFTDLTNDEYRKLYLGARTEPARRIAKAKNVN--QKYSAAVNGK--EVPETVDWRQKGA 156
Query: 134 VTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD-NHGCDGGLMEQA 192
V +KDQG CGSCWAFST +VEGINKI TGEL SLSEQELVDCDK N GC+GGLM+ A
Sbjct: 157 VNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYA 216
Query: 193 LNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMV 252
FI K+ GL TEK YPY G C S + KN+ V +DGYE V
Sbjct: 217 FQFIMKNGGLNTEKDYPYRGFGGKCN------SFL-----------KNSRVVSIDGYEDV 259
Query: 253 PESDENALMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGT 294
P DE AL KA++ QPV+VAI+AGG+ FQ Y GYG +++G
Sbjct: 260 PTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGV 318
Query: 295 KYWIVKNSWGTDWEEKGYIRMLRGIDA-EEGLCGITLEASYPVKLHPENSR 344
YWIV+NSWG W E+GYIRM R + A + G CGI +EASYPVK P R
Sbjct: 319 DYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVKYSPNPVR 369
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 258 bits (658), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 145/313 (46%), Positives = 183/313 (58%), Gaps = 45/313 (14%)
Query: 49 LKEKQIRFNVFKQNLKRIHKVNQMDKP---YKLRLNRFADMTNHEFMSSRSSKVSHHRML 105
+ E + RF VF NLK + N ++L +NRFAD+TN EF R++ +
Sbjct: 82 IGEHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEF---RATYLGTTPAG 138
Query: 106 HGPRRQTGFMHGKTQDLPPSVDWRKQGAVTG-VKDQGRCGSCWAFSTVVSVEGINKIKTG 164
G R + H + LP SVDWR +GAV VK+QG+CGSCWAFS V +VEGINKI TG
Sbjct: 139 RGRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTG 198
Query: 165 ELWSLSEQELVDCDKD--NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTS 222
EL SLSEQELV+C ++ N GC+GG+M+ A FIA++ GL TE+ YPYTA DG C L
Sbjct: 199 ELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKR 258
Query: 223 MVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQF 282
+ V +DG+E VPE+DE +L KAVA+QPV+VAIDAGG++FQ
Sbjct: 259 SRKV-----------------VSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQL 301
Query: 283 YSE------------------GYGA-TQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEE 323
Y GYG G YW V+NSWG DW E GYIRM R + A
Sbjct: 302 YDSGVFTGRCGTNLDHGVVAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTART 361
Query: 324 GLCGITLEASYPV 336
G CGI + ASYP+
Sbjct: 362 GKCGIAMMASYPI 374
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 253 bits (646), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 188/311 (60%), Gaps = 46/311 (14%)
Query: 51 EKQIRFNVFKQNLKRIHKVNQMDKP---YKLRLNRFADMTNHEFMSS-RSSKVSHHRMLH 106
E + RF VF NLK + N ++L +NRFAD+TN EF ++ +KV+
Sbjct: 70 EHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFRATFLGAKVAERSRAA 129
Query: 107 GPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGEL 166
G R + H ++LP SVDWR++GAV VK+QG+CGSCWAFS V +VE IN++ TGE+
Sbjct: 130 GER----YRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEM 185
Query: 167 WSLSEQELVDCDKD--NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMV 224
+LSEQELV+C + N GC+GGLM+ A +FI K+ G+ TE YPY A DG C++
Sbjct: 186 ITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDI----- 240
Query: 225 SIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFY- 283
+NA V +DG+E VP++DE +L KAVA+QPV+VAI+AGG++FQ Y
Sbjct: 241 ------------NRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYH 288
Query: 284 -----------------SEGYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLC 326
+ GYG T +G YWIV+NSWG W E GY+RM R I+ G C
Sbjct: 289 SGVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKC 347
Query: 327 GITLEASYPVK 337
GI + ASYP K
Sbjct: 348 GIAMMASYPTK 358
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 252 bits (644), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 140/325 (43%), Positives = 190/325 (58%), Gaps = 40/325 (12%)
Query: 35 LYERWRSHHTVSRD-LKEKQIRFNVFKQNLKRIHKVNQM-DKPYKLRLNRFADMTNHEFM 92
+YE+W + + + L EK+ RF +FK NLK + + N + D+ +++ L RFAD+TN EF
Sbjct: 43 MYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFR 102
Query: 93 SSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTV 152
+ K R + + +++ + LP VDWR GAV VKDQG CGSCWAFS V
Sbjct: 103 AIYLRK-KMERTKDSVKTER-YLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAV 160
Query: 153 VSVEGINKIKTGELWSLSEQELVDCDKD--NHGCDGGLMEQALNFIAKSEGLTTEKSYPY 210
+VEGIN+I TGEL SLSEQELVDCD+ N GCDGG+M A FI K+ G+ T++ YPY
Sbjct: 161 GAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPY 220
Query: 211 TAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVA 270
A D + +C+ + + N V +DGYE VP DE +L KAVA+QPV+
Sbjct: 221 NAND---------------LGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVS 265
Query: 271 VAIDAGGKDFQFYSE------------------GYGATQDGTKYWIVKNSWGTDWEEKGY 312
VAI+A + FQ Y GYG+T G YWI++NSWG +W + GY
Sbjct: 266 VAIEASSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGY 324
Query: 313 IRMLRGIDAEEGLCGITLEASYPVK 337
+++ R ID G CGI + SYP K
Sbjct: 325 VKLQRNIDDPFGKCGIAMMPSYPTK 349
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 252 bits (643), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 143/329 (43%), Positives = 188/329 (57%), Gaps = 49/329 (14%)
Query: 35 LYERWR-SHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQMDKPYKLRLNRFADMTNHEFMS 93
++E W H V + EK+ R +F+ NL+ I N + Y+L LNRFAD++ HE+
Sbjct: 55 MFESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEY-- 112
Query: 94 SRSSKVSHHRMLHGPRRQTGFMHG----KTQD---LPPSVDWRKQGAVTGVKDQGRCGSC 146
++ H PR FM KT D LP SVDWR +GAVT VKDQG C SC
Sbjct: 113 ---GEICHGADPRPPRNHV-FMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSC 168
Query: 147 WAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKSEGLTTEK 206
WAFSTV +VEG+NKI TGEL +LSEQ+L++C+K+N+GC GG +E A FI + GL T+
Sbjct: 169 WAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDN 228
Query: 207 SYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVAN 266
YPY A +G CE ++ V++DGYE +P +DE ALMKAVA+
Sbjct: 229 DYPYKALNGVCEGRLK----------------EDNKNVMIDGYENLPANDEAALMKAVAH 272
Query: 267 QPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKYWIVKNSWGTDWE 308
QPV +D+ ++FQ Y GYG T++G YWIVKNS G W
Sbjct: 273 QPVTAVVDSSSREFQLYESGVFDGTCGTNLNHGVVVVGYG-TENGRDYWIVKNSRGDTWG 331
Query: 309 EKGYIRMLRGIDAEEGLCGITLEASYPVK 337
E GY++M R I GLCGI + ASYP+K
Sbjct: 332 EAGYMKMARNIANPRGLCGIAMRASYPLK 360
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 250 bits (638), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 141/329 (42%), Positives = 189/329 (57%), Gaps = 49/329 (14%)
Query: 35 LYERWRSHH-TVSRDLKEKQIRFNVFKQNLKRIHKVNQMDKPYKLRLNRFADMTNHEFMS 93
++E W H V + EK+ R +F+ NL+ I+ N + Y+L L FAD++ HE+
Sbjct: 48 IFESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEY-- 105
Query: 94 SRSSKVSHHRMLHGPRRQTGFM-----HGKTQD--LPPSVDWRKQGAVTGVKDQGRCGSC 146
+V H PR FM + + D LP SVDWR +GAVT VKDQG C SC
Sbjct: 106 ---KEVCHGADPRPPRNHV-FMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSC 161
Query: 147 WAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKSEGLTTEK 206
WAFSTV +VEG+NKI TGEL +LSEQ+L++C+K+N+GC GG +E A FI K+ GL T+
Sbjct: 162 WAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDN 221
Query: 207 SYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVAN 266
YPY A +G +C +N V++DGYE +P +DE+ALMKAVA+
Sbjct: 222 DYPYKAVNG----------------VCDGRLKENNKNVMIDGYENLPANDESALMKAVAH 265
Query: 267 QPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKYWIVKNSWGTDWE 308
QPV ID+ ++FQ Y GYG T++G YW+VKNS G W
Sbjct: 266 QPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGYG-TENGRDYWLVKNSRGITWG 324
Query: 309 EKGYIRMLRGIDAEEGLCGITLEASYPVK 337
E GY++M R I GLCGI + ASYP+K
Sbjct: 325 EAGYMKMARNIANPRGLCGIAMRASYPLK 353
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 246 bits (627), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 136/336 (40%), Positives = 182/336 (54%), Gaps = 38/336 (11%)
Query: 21 YQESDLASEECLWDLYERWR-SHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQMDKPYKLR 79
Y + DL S E L L++ W H+ + + EK RF +F+ NL I + N+ + Y L
Sbjct: 33 YSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLG 92
Query: 80 LNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKD 139
LN FAD++N EF V+ F + + P S+DWR +GAVT VK+
Sbjct: 93 LNGFADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKN 152
Query: 140 QGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKS 199
QG CGSCWAFST+ +VEGINKI TG L LSEQELVDCDK ++GC GG +L ++A +
Sbjct: 153 QGACGSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKHSYGCKGGYQTTSLQYVA-N 211
Query: 200 EGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENA 259
G+ T K YPY AK C DK P+V + GY+ VP + E +
Sbjct: 212 NGVHTSKVYPYQAKQYKCRAT-----------------DKPGPKVKITGYKRVPSNCETS 254
Query: 260 LMKAVANQPVAVAIDAGGKDFQFYSE------------------GYGATQDGTKYWIVKN 301
+ A+ANQP++V ++AGGK FQ Y GYG T DG Y I+KN
Sbjct: 255 FLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYG-TSDGKNYIIIKN 313
Query: 302 SWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
SWG +W EKGY+R+ R +G CG+ + YP K
Sbjct: 314 SWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYPFK 349
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 236 bits (601), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 148/374 (39%), Positives = 202/374 (54%), Gaps = 64/374 (17%)
Query: 4 LVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRD-LKEKQIRFNVFKQN 62
L+ S +L+ +A F+ + + + + +YE W + S + L E + RF +FK+
Sbjct: 12 LLFFSTLLILSLA--FNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKET 69
Query: 63 LKRIHKVN-QMDKPYKLRLNRFADMTNHEFMS--------SRSSKVSHHRMLHGPRRQTG 113
L+ I + N ++ YK+ LN+FAD+T+ EF S S +KVS+ + PR
Sbjct: 70 LRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNR---YEPRVG-- 124
Query: 114 FMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQE 173
Q LP VDWR GAV +K QG CG CWAFS + +VEGINKI TG L SLSEQE
Sbjct: 125 ------QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQE 178
Query: 174 LVDCDK--DNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVH 231
L+DC + + GC+GG + FI + G+ TE++YPYTA+DG C L
Sbjct: 179 LIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDL---------- 228
Query: 232 ICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE------ 285
+N V +D YE VP ++E AL AV QPV+VA+DA G F+ YS
Sbjct: 229 -------QNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGP 281
Query: 286 ------------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEAS 333
GYG T+ G YWIVKNSW T W E+GY+R+LR + G CGI S
Sbjct: 282 CGTAIDHAVTIVGYG-TEGGIDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPS 339
Query: 334 YPVKLHPENSRHPR 347
YPVK + +N HP+
Sbjct: 340 YPVKYNNQN--HPK 351
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 234 bits (597), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 147/374 (39%), Positives = 202/374 (54%), Gaps = 64/374 (17%)
Query: 4 LVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRD-LKEKQIRFNVFKQN 62
L+ S +L+ +A F+ + + + + +YE W + S + L E + RF +FK+
Sbjct: 12 LLFFSTLLILSLA--FNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKET 69
Query: 63 LKRIHKVN-QMDKPYKLRLNRFADMTNHEFMS--------SRSSKVSHHRMLHGPRRQTG 113
L+ I + N ++ YK+ LN+FAD+T+ EF S S +KVS+ + PR
Sbjct: 70 LRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFTSGSNKTKVSNR---YEPRVG-- 124
Query: 114 FMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQE 173
Q LP VDWR GAV +K QG CG CWAFS + +VEGINKI TG L SLSEQE
Sbjct: 125 ------QVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQE 178
Query: 174 LVDCDK--DNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVH 231
L+DC + + GC+GG + FI + G+ TE++YPYTA+DG C +
Sbjct: 179 LIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDL---------- 228
Query: 232 ICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE------ 285
+N V +D YE VP ++E AL AV QPV+VA+DA G F+ YS
Sbjct: 229 -------QNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGP 281
Query: 286 ------------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEAS 333
GYG T+ G YWIVKNSW T W E+GY+R+LR + G CGI S
Sbjct: 282 CGTAVDHAVTIVGYG-TEGGIDYWIVKNSWDTTWGEEGYMRILRNVGG-AGTCGIATMPS 339
Query: 334 YPVKLHPENSRHPR 347
YPVK + +N HP+
Sbjct: 340 YPVKYNNQN--HPK 351
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 233 bits (595), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 135/350 (38%), Positives = 186/350 (53%), Gaps = 37/350 (10%)
Query: 2 FFLVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWR-SHHTVSRDLKEKQIRFNVFK 60
F +GLS FG Y ++DL S E L L+E W H+ + +++ EK RF +FK
Sbjct: 18 FVYMGLS----FGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEIFK 73
Query: 61 QNLKRIHKVNQMDKPYKLRLNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQ 120
NLK I + N+ + Y L LN FADM+N EF + ++ + + G
Sbjct: 74 DNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGDV- 132
Query: 121 DLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD 180
++P VDWR++GAVT VK+QG CGSCWAFS VV++EGI KI+TG L SEQEL+DCD+
Sbjct: 133 NIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR 192
Query: 181 NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKN 240
++GC+GG AL +A+ G+ +YPY C + +K
Sbjct: 193 SYGCNGGYPWSALQLVAQY-GIHYRNTYPYEGVQRYCR-----------------SREKG 234
Query: 241 APEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSEGY------------- 287
DG V +E AL+ ++ANQPV+V ++A GKDFQ Y G
Sbjct: 235 PYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAV 294
Query: 288 GATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
A G Y ++KNSWGT W E GYIR+ RG G+CG+ + YPVK
Sbjct: 295 AAVGYGPNYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPVK 344
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 228 bits (581), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 132/343 (38%), Positives = 179/343 (52%), Gaps = 38/343 (11%)
Query: 13 FGVAESFDYQESDLASEECLWDLYERWR-SHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQ 71
FG Y + DL S E L L+ W +H+ ++ EK RF +FK NL I + N+
Sbjct: 25 FGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNK 84
Query: 72 MDKPYKLRLNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQ 131
+ Y L LN FAD++N EF + + + F++ T +LP +VDWRK+
Sbjct: 85 KNNSYWLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDEE--FINEDTVNLPENVDWRKK 142
Query: 132 GAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQ 191
GAVT V+ QG CGSCWAFS V +VEGINKI+TG+L LSEQELVDC++ +HGC GG
Sbjct: 143 GAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPY 202
Query: 192 ALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEM 251
AL ++AK+ G+ YPY AK G+C P V G
Sbjct: 203 ALEYVAKN-GIHLRSKYPYKAKQGTCRAKQV-----------------GGPIVKTSGVGR 244
Query: 252 VPESDENALMKAVANQPVAVAIDAGGKDFQFYSEGYGATQDGTKY--------------- 296
V ++E L+ A+A QPV+V +++ G+ FQ Y G GTK
Sbjct: 245 VQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGK 304
Query: 297 --WIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
++KNSWGT W EKGYIR+ R G+CG+ + YP K
Sbjct: 305 GYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPTK 347
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 227 bits (579), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 119/235 (50%), Positives = 144/235 (61%), Gaps = 37/235 (15%)
Query: 121 DLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD 180
DLP S+DWR+ GAV VK+QG CGSCWAFSTV +VEGIN+I TG+L SLSEQ+LVDC
Sbjct: 2 DLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA 61
Query: 181 NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKN 240
NHGC GG M A FI + G+ +E++YPY +DG C N N
Sbjct: 62 NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGIC------------------NSTVN 103
Query: 241 APEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE--------------- 285
AP V +D YE VP +E +L KAVANQPV+V +DA G+DFQ Y
Sbjct: 104 APVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHAL 163
Query: 286 ---GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
GYG T++ +WIVKNSWG +W E GYIR R I+ +G CGIT ASYPVK
Sbjct: 164 TVVGYG-TENDKDFWIVKNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPVK 217
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 226 bits (575), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 123/320 (38%), Positives = 175/320 (54%), Gaps = 42/320 (13%)
Query: 36 YERWRSHH-TVSRDLKEKQIRFNVFKQNLKRIHKVNQMDK-PYKLRLNRFADMTNHEFMS 93
+E W + + V +D EK RF +FK N+K I N ++ Y L +N+F DMT EF++
Sbjct: 37 FEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVA 96
Query: 94 SRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVV 153
+ + P F +P S+DWR GAV VK+Q CGSCW+F+ +
Sbjct: 97 QYTGVSLPLNIEREP--VVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIA 154
Query: 154 SVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAK 213
+VEGI KIKTG L SLSEQE++DC ++GC GG + +A +FI + G+TTE++YPY A
Sbjct: 155 TVEGIYKIKTGYLVSLSEQEVLDC-AVSYGCKGGWVNKAYDFIISNNGVTTEENYPYLAY 213
Query: 214 DGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAI 273
G+C N + + GY V +DE ++M AV+NQP+A I
Sbjct: 214 QGTC------------------NANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALI 255
Query: 274 DAGGKDFQFYSE------------------GYGATQDGTKYWIVKNSWGTDWEEKGYIRM 315
DA ++FQ+Y+ GYG GTKYWIV+NSWG+ W E GY+RM
Sbjct: 256 DA-SENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRM 314
Query: 316 LRGIDAEEGLCGITLEASYP 335
RG+ + G+CGI + +P
Sbjct: 315 ARGVSSSSGVCGIAMAPLFP 334
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 224 bits (570), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 130/335 (38%), Positives = 179/335 (53%), Gaps = 38/335 (11%)
Query: 21 YQESDLASEECLWDLYERWRSHHTVS-RDLKEKQIRFNVFKQNLKRIHKVNQMDKPYKLR 79
Y + DL S E L L+ W H + +++ EK RF +FK NLK I + N+M Y L
Sbjct: 33 YSQDDLTSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMINGYWLG 92
Query: 80 LNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKD 139
LN F+D++N EF + + P + F++ DLP SVDWR +GAVT VK
Sbjct: 93 LNEFSDLSNDEFKEKYVGSLPED-YTNQPYDEE-FVNEDIVDLPESVDWRAKGAVTPVKH 150
Query: 140 QGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKS 199
QG C SCWAFSTV +VEGINKIKTG L LSEQELVDCDK ++GC+ G +L ++A++
Sbjct: 151 QGYCESCWAFSTVATVEGINKIKTGNLVELSEQELVDCDKQSYGCNRGYQSTSLQYVAQN 210
Query: 200 EGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENA 259
G+ YPY AK +C P+V +G V ++E +
Sbjct: 211 -GIHLRAKYPYIAKQQTCRA-----------------NQVGGPKVKTNGVGRVQSNNEGS 252
Query: 260 LMKAVANQPVAVAIDAGGKDFQFYSEGYGATQDGTKY-----------------WIVKNS 302
L+ A+A+QPV+V +++ G+DFQ Y G GTK ++KNS
Sbjct: 253 LLNAIAHQPVSVVVESAGRDFQNYKGGIFEGSCGTKVDHAVTAVGYGKSGGKGYILIKNS 312
Query: 303 WGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
WG W E GYIR+ R G+CG+ + YP+K
Sbjct: 313 WGPGWGENGYIRIRRASGNSPGVCGVYRSSYYPIK 347
>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
lycopersicum PE=2 SV=1
Length = 346
Score = 224 bits (570), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 116/235 (49%), Positives = 150/235 (63%), Gaps = 37/235 (15%)
Query: 122 LPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD- 180
LP S+DWR++G + GVKDQG CGSCWAFS V ++E IN I TG L SLSEQELVDCD+
Sbjct: 18 LPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSY 77
Query: 181 NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKN 240
N GCDGGLM+ A F+ K+ G+ TE+ YPY ++G C+ YR KN
Sbjct: 78 NEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQ--------YR---------KN 120
Query: 241 APEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE--------------- 285
A V +D YE VP ++E AL KAVA+QPV++A++AGG+DFQ Y
Sbjct: 121 AKVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGV 180
Query: 286 ---GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
GYG T++G YWIV+NSWG + E GY+R+ R + + GLCG+ +E SYPVK
Sbjct: 181 VIAGYG-TENGMDYWIVRNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPVK 234
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 219 bits (557), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 121/320 (37%), Positives = 173/320 (54%), Gaps = 42/320 (13%)
Query: 36 YERWRSHH-TVSRDLKEKQIRFNVFKQNLKRIHKVNQMD-KPYKLRLNRFADMTNHEFMS 93
+E W + + V +D EK +RF +FK N+ I N + Y L +N+F DMTN+EF++
Sbjct: 37 FEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNEFVA 96
Query: 94 SRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVV 153
+ + P F +P S+DWR GAVT VK+QGRCGSCWAF+++
Sbjct: 97 QYTGLSLPLNIKREP--VVSFDDVDISSVPQSIDWRDSGAVTSVKNQGRCGSCWAFASIA 154
Query: 154 SVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAK 213
+VE I KIK G L SLSEQ+++DC ++GC GG + +A +FI ++G+ + YPY A
Sbjct: 155 TVESIYKIKRGNLVSLSEQQVLDC-AVSYGCKGGWINKAYSFIISNKGVASAAIYPYKAA 213
Query: 214 DGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAI 273
G+C+ NG N+ + Y V ++E +M AV+NQP+A A+
Sbjct: 214 KGTCKT----------------NGVPNSAYITR--YTYVQRNNERNMMYAVSNQPIAAAL 255
Query: 274 DAGGKDFQFYSE------------------GYGATQDGTKYWIVKNSWGTDWEEKGYIRM 315
DA G +FQ Y GYG G K+WIV+NSWG W E GYIR+
Sbjct: 256 DASG-NFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNSWGAGWGEGGYIRL 314
Query: 316 LRGIDAEEGLCGITLEASYP 335
R + + GLCGI ++ YP
Sbjct: 315 ARDVSSSFGLCGIAMDPLYP 334
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 217 bits (552), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 137/345 (39%), Positives = 180/345 (52%), Gaps = 61/345 (17%)
Query: 25 DLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQM----DKPYKLRL 80
DL EE W Y+ H + E++ R +F +N +I K NQ+ YKL L
Sbjct: 22 DLIKEE--WHTYKL--QHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGL 77
Query: 81 NRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQ------DLPPSVDWRKQGAV 134
N++ADM +HEF + + +H + R +TG + G T +P SVDWR+ GAV
Sbjct: 78 NKYADMLHHEFKETMNG--YNHTLRQLMRERTGLV-GATYIPPAHVTVPKSVDWREHGAV 134
Query: 135 TGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD--NHGCDGGLMEQA 192
TGVKDQG CGSCWAFS+ ++EG + K G L SLSEQ LVDC N+GC+GGLM+ A
Sbjct: 135 TGVKDQGHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNA 194
Query: 193 LNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMV 252
+I + G+ TEKSYPY D SC + + G+ +
Sbjct: 195 FRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIG------------------ATDTGFVDI 236
Query: 253 PESDENALMKAVANQ-PVAVAIDAGGKDFQFYSE--------------------GYGATQ 291
PE DE + KAVA PV+VAIDA + FQ YSE GYG +
Sbjct: 237 PEGDEEKMKKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDE 296
Query: 292 DGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPV 336
G YW+VKNSWGT W E+GYI+M R + + CGI +SYP
Sbjct: 297 SGMDYWLVKNSWGTTWGEQGYIKMARNQNNQ---CGIATASSYPT 338
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 215 bits (547), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 130/337 (38%), Positives = 177/337 (52%), Gaps = 56/337 (16%)
Query: 35 LYERWRS----HHTVSRDLKEKQIRFNVFKQNLKRIHKVNQM----DKPYKLRLNRFADM 86
+ E W + H +D E++ R +F +N +I K NQ +KL +N++AD+
Sbjct: 55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114
Query: 87 TNHEFMSSRSS-KVSHHRMLHGPR---RQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGR 142
+HEF + + H+ L + F+ LP SVDWR +GAVT VKDQG
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174
Query: 143 CGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD--NHGCDGGLMEQALNFIAKSE 200
CGSCWAFS+ ++EG + K+G L SLSEQ LVDC N+GC+GGLM+ A +I +
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234
Query: 201 GLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENAL 260
G+ TEKSYPY A D SC V R G+ +P+ DE +
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNKGTVGATDR------------------GFTDIPQGDEKKM 276
Query: 261 MKAVAN-QPVAVAIDAGGKDFQFYSE--------------------GYGATQDGTKYWIV 299
+AVA PV+VAIDA + FQFYSE G+G + G YW+V
Sbjct: 277 AEAVATVGPVSVAIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLV 336
Query: 300 KNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPV 336
KNSWGT W +KG+I+MLR +E CGI +SYP+
Sbjct: 337 KNSWGTTWGDKGFIKMLRN---KENQCGIASASSYPL 370
>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 211 bits (536), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 106/233 (45%), Positives = 138/233 (59%), Gaps = 35/233 (15%)
Query: 122 LPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDN 181
LP S+DWR++GAV VK+QG CGSCWAF + +VEGIN+I TG+L SLSEQ+LVDC N
Sbjct: 3 LPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTRN 62
Query: 182 HGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNA 241
HGC+GG +A +I + G+ +E+ YPYT +G+C+ +NA
Sbjct: 63 HGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCDT------------------KENA 104
Query: 242 PEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSEGYGA------------ 289
V +D Y VP +DE +L KAVANQPV+V +DA G+DFQ Y G
Sbjct: 105 HVVSIDSYRNVPSNDEKSLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRT 164
Query: 290 -----TQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
T++ YW VKNSWG +W E GYIR+ R I G CGI + SYP+K
Sbjct: 165 VGGRETENDKDYWTVKNSWGKNWGESGYIRVERNIAESSGKCGIAISPSYPIK 217
>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
Length = 215
Score = 209 bits (533), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 107/234 (45%), Positives = 142/234 (60%), Gaps = 38/234 (16%)
Query: 122 LPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDN 181
LP VDWR +GAV +K+Q +CGSCWAFS V +VE INKI+TG+L SLSEQELVDCD +
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDTAS 60
Query: 182 HGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNA 241
HGC+GG M A +I + G+ T+++YPY+A GSC+ YR+ + S
Sbjct: 61 HGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCKP--------YRLRVVS------- 105
Query: 242 PEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSE---------------- 285
++G++ V ++E+AL AVA+QPV+V ++A G FQ YS
Sbjct: 106 ----INGFQRVTRNNESALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVV 161
Query: 286 --GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
GYG TQ G YWIV+NSWG +W +GYI M R + + GLCGI SYP K
Sbjct: 162 IVGYG-TQSGKNYWIVRNSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYPTK 214
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 206 bits (524), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/325 (37%), Positives = 173/325 (53%), Gaps = 39/325 (12%)
Query: 35 LYERWRSHHTVSRD-LKEKQIRFNVFKQNLKRIHKVNQ-MDKPYKLRLNRFADMTNHEFM 92
+YE+W + + + L EK+ RF +FK NLKRI + N ++ Y+ LN+F+D+T EF
Sbjct: 40 MYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQ 99
Query: 93 SSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTG-VKDQGRCGSCWAFST 151
+S + L + + G LP VDWR++GAV VK QG CGSCWAF+
Sbjct: 100 ASYLGGKMEKKSLSDVAERYQYKEGDV--LPDEVDWRERGAVVPRVKRQGECGSCWAFAA 157
Query: 152 VVSVEGINKIKTGELWSLSEQELVDCDK--DNHGCDGGLMEQALNFIAKSEGLTTEKSYP 209
+VEGIN+I TGEL SLSEQEL+DCD+ DN GC GG A FI ++ G+ +++ Y
Sbjct: 158 TGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSDEVYG 217
Query: 210 YTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPV 269
YT +D + M K V ++G+E+VP +DE +L KAVA QP+
Sbjct: 218 YTGEDTAACKAIEM---------------KTTRVVTINGHEVVPVNDEMSLKKAVAYQPI 262
Query: 270 AVAIDAGGK-----------------DFQFYSEGYGATQDGTKYWIVKNSWGTDWEEKGY 312
+V I A D GYG + D YW+++NSWG +W E GY
Sbjct: 263 SVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGY 322
Query: 313 IRMLRGIDAEEGLCGITLEASYPVK 337
+R+ R G C + + YP+K
Sbjct: 323 LRLQRNFHEPTGKCAVAVAPVYPIK 347
>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
Length = 208
Score = 205 bits (521), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 115/229 (50%), Positives = 133/229 (58%), Gaps = 35/229 (15%)
Query: 122 LPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDN 181
LP +DWRK+GAVT VK+QG CGSCWAFSTV +VE IN+I+TG L SLSEQELVDCDK N
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKN 60
Query: 182 HGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNA 241
HGC GG A +I + G+ T+ +YPY A G C+ + +VSI
Sbjct: 61 HGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPCQAASKVVSI--------------- 105
Query: 242 PEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSEGYGATQDGTK------ 295
DGY VP +E AL +AVA QP VAIDA FQ YS G + GTK
Sbjct: 106 -----DGYNGVPFCNEXALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVT 160
Query: 296 -------YWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
YWIV+NSWG W EKGYIRMLR GLCGI YP K
Sbjct: 161 IVGYQANYWIVRNSWGRYWGEKGYIRMLR--VGGCGLCGIARLPYYPTK 207
>sp|P84346|MEX1_JACME Mexicain OS=Jacaratia mexicana PE=1 SV=1
Length = 214
Score = 201 bits (512), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 103/228 (45%), Positives = 139/228 (60%), Gaps = 31/228 (13%)
Query: 123 PPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNH 182
P S+DWR++GAVT VK+Q CGSCWAFSTV ++EGINKI TG+L SLSEQEL+DC+ +H
Sbjct: 2 PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCEYRSH 61
Query: 183 GCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAP 242
GCDGG +L ++ + G+ TE+ YPY K G C DK P
Sbjct: 62 GCDGGYQTPSLQYVVDN-GVHTEREYPYEKKQGRCRAK-----------------DKKGP 103
Query: 243 EVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSEGY-----GATQD----- 292
+V + GY+ VP +DE +L++A+ANQPV+V D+ G+ FQFY G G D
Sbjct: 104 KVYITGYKYVPANDEISLIQAIANQPVSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTA 163
Query: 293 ---GTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
G Y ++KNSWG +W EKGYIR+ R +G CG+ + +P+K
Sbjct: 164 VGYGKTYLLLKNSWGPNWGEKGYIRIKRASGRSKGTCGVYTSSFFPIK 211
>sp|P54640|CYSP5_DICDI Cysteine proteinase 5 OS=Dictyostelium discoideum GN=cprE PE=2 SV=2
Length = 344
Score = 196 bits (498), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 122/341 (35%), Positives = 161/341 (47%), Gaps = 63/341 (18%)
Query: 34 DLYERWRSHHTVSRDLKEKQIRFNVFKQNLKRIHKVNQMDKPYKLRLNRFADMTNHEFMS 93
+ + W H S +E R+N+FK N+ + + N L LN FAD+TN E+ +
Sbjct: 28 NAFTDWMITHQKSYTSEEFGARYNIFKANMDYVQQWNSKGSETVLGLNNFADITNEEYRN 87
Query: 94 SRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVV 153
+ L G + + F T S DWR +GAVT VK+QG+CG CW+FST
Sbjct: 88 TYLGTKFDASSLIGTQEEKVF----TTSSAASKDWRSEGAVTPVKNQGQCGGCWSFSTTG 143
Query: 154 SVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAK 213
S EG + GEL SLSEQ L+DC +N GCDGGLM A +I + G+ TE SYPY A+
Sbjct: 144 STEGAHFQSKGELVSLSEQNLIDCSTENSGCDGGLMTYAFEYIINNNGIDTESSYPYKAE 203
Query: 214 DGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVANQPVAVAI 273
+G CE + +N+ L Y+ V E++L AV PV+VAI
Sbjct: 204 NGKCEYKS-----------------ENSG-ATLSSYKTVTAGSESSLESAVNVNPVSVAI 245
Query: 274 DAGGKDFQFYSEG-------------YGATQDG-------------------------TK 295
DA + FQ Y+ G +G G +
Sbjct: 246 DASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSSGQSSGNLSASSSNE 305
Query: 296 YWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPV 336
YWIVKNSWGT W +GYI M R D CGI AS+PV
Sbjct: 306 YWIVKNSWGTSWGIEGYILMSRNRDNN---CGIASSASFPV 343
>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
SV=2
Length = 322
Score = 194 bits (494), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 118/317 (37%), Positives = 162/317 (51%), Gaps = 56/317 (17%)
Query: 48 DLKEKQIRFNVFKQNLKRIHKVNQM----DKPYKLRLNRFADMTNHEFMSSRSSKVSHHR 103
DL+E++ R NVF NL+ I + N+ + Y L +N+F+DMTN +F +
Sbjct: 33 DLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKFNAVMKG------ 86
Query: 104 MLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKT 163
GPR F VDWR +GAVT VKDQG+CGSCWAFST +EG + +KT
Sbjct: 87 YKKGPRPAAVFTSTDAAPESTEVDWRTKGAVTPVKDQGQCGSCWAFSTTGGIEGQHFLKT 146
Query: 164 GELWSLSEQELVDCDKD---NHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELP 220
G L SLSEQ+LVDC N GC+GG +E+A+ ++ + G+ TE SYPY A+D +C
Sbjct: 147 GRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTESSYPYEARDNTCRF- 205
Query: 221 TSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVAN-QPVAVAIDAGGKD 279
+ N GY + + E+AL A + P++VAIDA +
Sbjct: 206 -----------------NSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRS 248
Query: 280 FQFY--------------------SEGYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGI 319
FQ Y + GYG ++ G +W+VKNSW T W E GYI+M R
Sbjct: 249 FQSYYTGVYYEPSCSSSQLDHAVLAVGYG-SEGGQDFWLVKNSWATSWGESGYIKMARNR 307
Query: 320 DAEEGLCGITLEASYPV 336
+ CGI +A YP
Sbjct: 308 NNN---CGIATDACYPT 321
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 193 bits (490), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 124/355 (34%), Positives = 177/355 (49%), Gaps = 46/355 (12%)
Query: 7 LSLVLVFG-VAESFDY-QESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQNLK 64
LS+ L+F + S + ++ S + D + W + + KE R+ FK+N+
Sbjct: 3 LSITLIFTLIVLSISFISAGNVFSHKQYQDSFIDWMRSNNKAYTHKEFMPRYEEFKKNMD 62
Query: 65 RIHKVNQMDKPYKLRLNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHGKTQ-DLP 123
+H N L LN+ AD++N E+ + +H ++ +R G + Q P
Sbjct: 63 YVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRLNRPQFKQP 122
Query: 124 PSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKD--N 181
+VDWR++ AVT VKDQG+CGSC++FST SVEG+ IKTG+L SLSEQ ++DC N
Sbjct: 123 LNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGN 182
Query: 182 HGCDGGLMEQALNFIAKSEGLTTEKSYPYTAK-DGSCELPTSMVSIIYRVHICSWNGDKN 240
GC+GGLM A +I K+ GL +E+ YPY K + C+ V+
Sbjct: 183 EGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGSVA--------------- 227
Query: 241 APEVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSEG-------------Y 287
+ Y+ + DEN L A+ PV+VAIDA FQ Y+ G +
Sbjct: 228 ---AKITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDH 284
Query: 288 G------ATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPV 336
G T +G Y+IVKNSWG W GYI M R D CGI+ ASYP+
Sbjct: 285 GVLAVGMGTDNGEDYYIVKNSWGPSWGLNGYIHMARNKDNN---CGISTMASYPI 336
>sp|O60911|CATL2_HUMAN Cathepsin L2 OS=Homo sapiens GN=CTSL2 PE=1 SV=2
Length = 334
Score = 192 bits (488), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 133/373 (35%), Positives = 184/373 (49%), Gaps = 83/373 (22%)
Query: 5 VGLSLVLV---FGVAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFKQ 61
+ LSLVL G+A + + +L ++ + +W++ H E+ R V+++
Sbjct: 1 MNLSLVLAAFCLGIASAVPKFDQNLDTK------WYQWKATHRRLYGANEEGWRRAVWEK 54
Query: 62 NLKRIHKVN----QMDKPYKLRLNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMHG 117
N+K I N Q + + +N F DMTN EF R + G R F G
Sbjct: 55 NMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEF-----------RQMMGCFRNQKFRKG 103
Query: 118 KT------QDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSE 171
K DLP SVDWRK+G VT VK+Q +CGSCWAFS ++EG KTG+L SLSE
Sbjct: 104 KVFREPLFLDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSE 163
Query: 172 QELVDCDK--DNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYR 229
Q LVDC + N GC+GG M +A ++ ++ GL +E+SYPY A D C+ YR
Sbjct: 164 QNLVDCSRPQGNQGCNGGFMARAFQYVKENGGLDSEESYPYVAVDEICK---------YR 214
Query: 230 VHICSWNGDKNAPEVIL---DGYEMVPESDENALMKAVAN-QPVAVAIDAGGKDFQFYSE 285
PE + G+ +V E ALMKAVA P++VA+DAG FQFY
Sbjct: 215 ------------PENSVANDTGFTVVAPGKEKALMKAVATVGPISVAMDAGHSSFQFYKS 262
Query: 286 GY-----------------------GATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAE 322
G GA + +KYW+VKNSWG +W GY+++ + +
Sbjct: 263 GIYFEPDCSSKNLDHGVLVVGYGFEGANSNNSKYWLVKNSWGPEWGSNGYVKIAKDKNNH 322
Query: 323 EGLCGITLEASYP 335
CGI ASYP
Sbjct: 323 ---CGIATAASYP 332
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
SV=1
Length = 323
Score = 191 bits (486), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 120/323 (37%), Positives = 165/323 (51%), Gaps = 67/323 (20%)
Query: 48 DLKEKQIRFNVFKQNLKRIHKVNQM----DKPYKLRLNRFADMTNHEF-------MSSRS 96
D +E R +F+QN K I + N+ + + L +N+F DMT EF + RS
Sbjct: 33 DAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEFNAVMKGNIPRRS 92
Query: 97 SKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVE 156
+ VS + P+++TG VDWR +GAVT VKDQG+CGSCWAFST S+E
Sbjct: 93 APVS----VFYPKKETG-------PQATEVDWRTKGAVTPVKDQGQCGSCWAFSTTGSLE 141
Query: 157 GINKIKTGELWSLSEQELVDCDK--DNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKD 214
G + +KTG L SL+EQ+LVDC + GC+GG M A ++I + G+ TE +YPY A+D
Sbjct: 142 GQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPYEARD 201
Query: 215 GSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVAN-QPVAVAI 273
GSC D N+ G+ + E L +AV + P++V I
Sbjct: 202 GSCRF------------------DSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTI 243
Query: 274 DAGGKDFQFYSE--------------------GYGATQDGTKYWIVKNSWGTDWEEKGYI 313
DA FQFYS GYG ++ G +W+VKNSW T W + GYI
Sbjct: 244 DAAHSSFQFYSSGVYYEPSCSPSYLDHAVLAVGYG-SEGGQDFWLVKNSWATSWGDAGYI 302
Query: 314 RMLRGIDAEEGLCGITLEASYPV 336
+M R + CGI ASYP+
Sbjct: 303 KMSRNRNNN---CGIATVASYPL 322
>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 191 bits (485), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 123/331 (37%), Positives = 171/331 (51%), Gaps = 57/331 (17%)
Query: 36 YERWRSHHTVSRDLKEKQIRFNVFKQNLKRIHKVN----QMDKPYKLRLNRFADMTNHEF 91
+ +W+S H E++ R ++++N++ I N + + +N F DMTN EF
Sbjct: 29 WHQWKSTHRRLYGTNEEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEF 88
Query: 92 MSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFST 151
+ H + G Q M +P SVDWR++G VT VK+QG+CGSCWAFS
Sbjct: 89 RQVVNG-YRHQKHKKGRLFQEPLM----LKIPKSVDWREKGCVTPVKNQGQCGSCWAFSA 143
Query: 152 VVSVEGINKIKTGELWSLSEQELVDCD--KDNHGCDGGLMEQALNFIAKSEGLTTEKSYP 209
+EG +KTG+L SLSEQ LVDC + N GC+GGLM+ A +I ++ GL +E+SYP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203
Query: 210 YTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDENALMKAVAN-QP 268
Y AKDGSC+ YR N G+ +P+ E ALMKAVA P
Sbjct: 204 YEAKDGSCK---------YRAEFAVAND---------TGFVDIPQ-QEKALMKAVATVGP 244
Query: 269 VAVAIDAGGKDFQFYSEGY-----------------------GATQDGTKYWIVKNSWGT 305
++VA+DA QFYS G G + KYW+VKNSWG+
Sbjct: 245 ISVAMDASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGS 304
Query: 306 DWEEKGYIRMLRGIDAEEGLCGITLEASYPV 336
+W +GYI++ + D CG+ ASYPV
Sbjct: 305 EWGMEGYIKIAKDRDNH---CGLATAASYPV 332
>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2 SV=1
Length = 330
Score = 191 bits (485), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 129/360 (35%), Positives = 187/360 (51%), Gaps = 64/360 (17%)
Query: 6 GLSLVLVFGVAESFDYQESDLASEECLWDLYER-WRSHHTVSRDLKEKQIRFNVFKQNLK 64
GL ++L+ +A Y E L ++ WDL+++ +R + D +++ ++++NLK
Sbjct: 3 GLEVLLLLPMASFALYPEEILDTQ---WDLWKKTYRKQYNSKVDELSRRL---IWEKNLK 56
Query: 65 RIHKVNQMDKP-----YKLRLNRFADMTNHEF---MSSRSSKVSHHRMLHGPRRQTGFMH 116
I ++ ++ Y+L +N DMT+ E M+ SH R T ++
Sbjct: 57 HI-SIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPPSHSR-----SNDTLYIP 110
Query: 117 GKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVD 176
P SVD+RK+G VT VK+QG+CGSCWAFS+V ++EG K KTG+L +LS Q LVD
Sbjct: 111 DWESRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVD 170
Query: 177 CDKDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWN 236
C +N GC GG M A ++ K+ G+ +E +YPY +D SC M + + C
Sbjct: 171 CVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDESC-----MYNPTGKAAKCR-- 223
Query: 237 GDKNAPEVILDGYEMVPESDENALMKAVAN-QPVAVAIDAGGKDFQFYSE---------- 285
GY +PE +E AL +AVA P++VAIDA FQFYS+
Sbjct: 224 -----------GYREIPEGNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDENCNS 272
Query: 286 ----------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYP 335
GYG Q G K+WI+KNSWG +W KGYI M R + CGI AS+P
Sbjct: 273 DNLNHAVLAVGYG-IQKGNKHWIIKNSWGENWGNKGYILMARN---KNNACGIANLASFP 328
>sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2 SV=1
Length = 329
Score = 190 bits (483), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 124/338 (36%), Positives = 177/338 (52%), Gaps = 54/338 (15%)
Query: 26 LASEECLWDLYERWRSHHTVSRDLKEKQI-RFNVFKQNLKRIHKVNQMDKP-----YKLR 79
L+ EE L +E W+ H + K +I R ++++NLK+I V+ ++ Y+L
Sbjct: 16 LSPEETLDTQWELWKKTHGKQYNSKVDEISRRLIWEKNLKKI-SVHNLEASLGAHTYELA 74
Query: 80 LNRFADMTNHEFMSSRSS-KVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTGVK 138
+N DMT+ E + + +V R T G+ +P S+D+RK+G VT VK
Sbjct: 75 MNHLGDMTSEEVVQKMTGLRVPPSRSFSNDTLYTPEWEGR---VPDSIDYRKKGYVTPVK 131
Query: 139 DQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFIAK 198
+QG+CGSCWAFS+ ++EG K KTG+L +LS Q LVDC +N+GC GG M A ++ +
Sbjct: 132 NQGQCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCVSENYGCGGGYMTTAFQYVQQ 191
Query: 199 SEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESDEN 258
+ G+ +E +YPY +D SC M + + C GY +P +E
Sbjct: 192 NGGIDSEDAYPYVGQDESC-----MYNATAKAAKCR-------------GYREIPVGNEK 233
Query: 259 ALMKAVAN-QPVAVAIDAGGKDFQFYSE--------------------GYGATQDGTKYW 297
AL +AVA PV+V+IDA FQFYS GYG TQ G KYW
Sbjct: 234 ALKRAVARVGPVSVSIDASLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYG-TQKGNKYW 292
Query: 298 IVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYP 335
I+KNSWG W KGY+ + R + CGIT AS+P
Sbjct: 293 IIKNSWGESWGNKGYVLLARN---KNNACGITNLASFP 327
>sp|Q9GKL8|CATL1_CHLAE Cathepsin L1 OS=Chlorocebus aethiops GN=CTSL1 PE=1 SV=1
Length = 333
Score = 190 bits (482), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 133/369 (36%), Positives = 183/369 (49%), Gaps = 73/369 (19%)
Query: 1 TFFLVGLSLVLVFGVAESFDYQESDLASEECLWDLYERWRSHHTVSRDLKEKQIRFNVFK 60
TF L L L G+A + L L + +W++ H + E+ R V++
Sbjct: 4 TFILAALCL----GIASA------TLTFNHSLEAQWTKWKAMHNRLYGMNEEGWRRAVWE 53
Query: 61 QNLKRI----HKVNQMDKPYKLRLNRFADMTNHEFMSSRSSKVSHHRMLHGPRRQTGFMH 116
+N+K I + +Q + + +N F DMT+ EF +V + PR+ F
Sbjct: 54 KNMKMIELHNQEYSQGKHSFTMAMNTFGDMTSEEF-----RQVMNGFQNRKPRKGKVFQE 108
Query: 117 GKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVD 176
+ P SVDWR++G VT VK+QG+CGSCWAFS ++EG KTG+L SLSEQ LVD
Sbjct: 109 PLFYEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVD 168
Query: 177 CD--KDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICS 234
C + N GC+GGLM+ A ++A + GL +E+SYPY A + SC
Sbjct: 169 CSGPQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESC----------------- 211
Query: 235 WNGDKNAPEVIL---DGYEMVPESDENALMKAVAN-QPVAVAIDAGGKDFQFYSE----- 285
K PE + G+ +P+ E ALMKAVA P++VAIDAG + F FY E
Sbjct: 212 ----KYNPEYSVANDTGFVDIPK-QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFE 266
Query: 286 ---------------GYG---ATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCG 327
GYG D +KYW+VKNSWG +W GYI+M + CG
Sbjct: 267 PDCSSEDMDHGVLVVGYGFESTESDNSKYWLVKNSWGEEWGMGGYIKMAKD---RRNHCG 323
Query: 328 ITLEASYPV 336
I ASYP
Sbjct: 324 IASAASYPT 332
>sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1
Length = 330
Score = 190 bits (482), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 129/360 (35%), Positives = 187/360 (51%), Gaps = 64/360 (17%)
Query: 6 GLSLVLVFGVAESFDYQESDLASEECLWDLYER-WRSHHTVSRDLKEKQIRFNVFKQNLK 64
GL +VL+ V S Y E L ++ W+L+++ +R + D +++ ++++NLK
Sbjct: 3 GLKVVLLLPVMSSALYPEEILDTQ---WELWKKTYRKQYNSKVDEISRRL---IWEKNLK 56
Query: 65 RIHKVNQMDKP-----YKLRLNRFADMTNHEF---MSSRSSKVSHHRMLHGPRRQTGFMH 116
I ++ ++ Y+L +N DMT+ E M+ SH R T ++
Sbjct: 57 HI-SIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPPSHSR-----SNDTLYIP 110
Query: 117 GKTQDLPPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVD 176
P S+D+RK+G VT VK+QG+CGSCWAFS+V ++EG K KTG+L +LS Q LVD
Sbjct: 111 DWEGRTPDSIDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVD 170
Query: 177 CDKDNHGCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWN 236
C +N GC GG M A ++ K+ G+ +E +YPY +D +C M + + C
Sbjct: 171 CVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQDENC-----MYNPTGKAAKCR-- 223
Query: 237 GDKNAPEVILDGYEMVPESDENALMKAVAN-QPVAVAIDAGGKDFQFYSE---------- 285
GY +PE +E AL +AVA PV+VAIDA FQFYS+
Sbjct: 224 -----------GYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDENCNS 272
Query: 286 ----------GYGATQDGTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYP 335
GYG Q G K+WI+KNSWG +W KGYI M R + CGI AS+P
Sbjct: 273 DNLNHAVLAVGYG-IQKGKKHWIIKNSWGENWGNKGYILMARN---KNNACGIANLASFP 328
>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
Length = 329
Score = 189 bits (481), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 128/340 (37%), Positives = 175/340 (51%), Gaps = 58/340 (17%)
Query: 26 LASEECLWDLYERWRSHHTVSRDLKEKQI-RFNVFKQNLKRIHKVNQMDKP-----YKLR 79
L EE L +E W+ H + K +I R ++++NLK I ++ ++ Y+L
Sbjct: 16 LYPEEILDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYI-SIHNLEASLGVHTYELA 74
Query: 80 LNRFADMTNHEF---MSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTG 136
+N DMTN E M+ SH R T ++ P SVD+RK+G VT
Sbjct: 75 MNHLGDMTNEEVVQKMTGLKVPASHSR-----SNDTLYIPDWEGRAPDSVDYRKKGYVTP 129
Query: 137 VKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFI 196
VK+QG+CGSCWAFS+V ++EG K KTG+L +LS Q LVDC +N GC GG M A ++
Sbjct: 130 VKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYV 189
Query: 197 AKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESD 256
K+ G+ +E +YPY ++ SC M + + C GY +PE +
Sbjct: 190 QKNRGIDSEDAYPYVGQEESC-----MYNPTGKAAKCR-------------GYREIPEGN 231
Query: 257 ENALMKAVAN-QPVAVAIDAGGKDFQFYSE--------------------GYGATQDGTK 295
E AL +AVA PV+VAIDA FQFYS+ GYG Q G K
Sbjct: 232 EKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYG-IQKGNK 290
Query: 296 YWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYP 335
+WI+KNSWG +W KGYI M R + CGI AS+P
Sbjct: 291 HWIIKNSWGENWGNKGYILMARN---KNNACGIANLASFP 327
>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
Length = 329
Score = 189 bits (481), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 128/340 (37%), Positives = 175/340 (51%), Gaps = 58/340 (17%)
Query: 26 LASEECLWDLYERWRSHHTVSRDLKEKQI-RFNVFKQNLKRIHKVNQMDKP-----YKLR 79
L EE L +E W+ H + K +I R ++++NLK I ++ ++ Y+L
Sbjct: 16 LYPEEILDTHWELWKKTHRKQYNSKVDEISRRLIWEKNLKYI-SIHNLEASLGVHTYELA 74
Query: 80 LNRFADMTNHEF---MSSRSSKVSHHRMLHGPRRQTGFMHGKTQDLPPSVDWRKQGAVTG 136
+N DMTN E M+ SH R T ++ P SVD+RK+G VT
Sbjct: 75 MNHLGDMTNEEVVQKMTGLKVPASHSR-----SNDTLYIPDWEGRAPDSVDYRKKGYVTP 129
Query: 137 VKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNHGCDGGLMEQALNFI 196
VK+QG+CGSCWAFS+V ++EG K KTG+L +LS Q LVDC +N GC GG M A ++
Sbjct: 130 VKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYV 189
Query: 197 AKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAPEVILDGYEMVPESD 256
K+ G+ +E +YPY ++ SC M + + C GY +PE +
Sbjct: 190 QKNRGIDSEDAYPYVGQEESC-----MYNPTGKAAKCR-------------GYREIPEGN 231
Query: 257 ENALMKAVAN-QPVAVAIDAGGKDFQFYSE--------------------GYGATQDGTK 295
E AL +AVA PV+VAIDA FQFYS+ GYG Q G K
Sbjct: 232 EKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYG-IQKGNK 290
Query: 296 YWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYP 335
+WI+KNSWG +W KGYI M R + CGI AS+P
Sbjct: 291 HWIIKNSWGENWGNKGYILMARN---KNNACGIANLASFP 327
>sp|P84347|MEX2_JACME Chymomexicain OS=Jacaratia mexicana PE=1 SV=1
Length = 215
Score = 189 bits (479), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 98/228 (42%), Positives = 134/228 (58%), Gaps = 30/228 (13%)
Query: 123 PPSVDWRKQGAVTGVKDQGRCGSCWAFSTVVSVEGINKIKTGELWSLSEQELVDCDKDNH 182
P S+DWR +GAVT VK+Q CGSCWAFSTV +VEGINKI+TG+L SLSEQEL+DCD+ +H
Sbjct: 2 PESIDWRDKGAVTPVKNQNPCGSCWAFSTVATVEGINKIRTGKLISLSEQELLDCDRRSH 61
Query: 183 GCDGGLMEQALNFIAKSEGLTTEKSYPYTAKDGSCELPTSMVSIIYRVHICSWNGDKNAP 242
GC GG ++ ++A + G+ TEK YPY K G C +K
Sbjct: 62 GCKGGYQTGSIQYVADNGGVHTEKEYPYEKKQGKCRAK-----------------EKKGT 104
Query: 243 EVILDGYEMVPESDENALMKAVANQPVAVAIDAGGKDFQFYSEGY-----GATQD----- 292
+V + GY+ VP +DE +L++ + NQPV+V ++ G+ FQ Y G G D
Sbjct: 105 KVQITGYKRVPANDEISLIQGIGNQPVSVLHESKGRAFQLYKGGIFNGPCGYKNDHAVTA 164
Query: 293 ---GTKYWIVKNSWGTDWEEKGYIRMLRGIDAEEGLCGITLEASYPVK 337
G + KNSWG +W EKGYI++ R EG CG+ + +P+K
Sbjct: 165 IGYGKAQLLDKNSWGPNWGEKGYIKIKRASGKSEGTCGVYKSSYFPIK 212
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.317 0.134 0.416
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 137,431,730
Number of Sequences: 539616
Number of extensions: 5804850
Number of successful extensions: 13455
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 216
Number of HSP's successfully gapped in prelim test: 4
Number of HSP's that attempted gapping in prelim test: 12555
Number of HSP's gapped (non-prelim): 292
length of query: 351
length of database: 191,569,459
effective HSP length: 118
effective length of query: 233
effective length of database: 127,894,771
effective search space: 29799481643
effective search space used: 29799481643
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 62 (28.5 bits)