BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 041120
(340 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 289 bits (739), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 156/322 (48%), Positives = 204/322 (63%), Gaps = 38/322 (11%)
Query: 53 YDPQSMEER---FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKL 105
Y +S EE + W ++ + Y + E +RR+ + N++YID N+ SF+L
Sbjct: 28 YGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRL 87
Query: 106 TDNKFADLSNEEFISTYLGY-NKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKD 159
N+FADL+NEE+ TYLG NKP E R S +YL LP SVDWR +GAV +KD
Sbjct: 88 GLNRFADLTNEEYRDTYLGLRNKPRRE-RKVSDRYLAADNEALPESVDWRTKGAVAEIKD 146
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
QG CGSCWAFSA+AAVEGIN++ TG L+SLSEQELVDCD S N+GCNGG M+ AF+FI
Sbjct: 147 QGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT-SYNEGCNGGLMDYAFDFII 205
Query: 220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------------- 260
GG+ TEDDYPY+GK++RC ++ VTI YE +
Sbjct: 206 NNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIE 265
Query: 261 ---YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
AFQLYS G+F CG L+HGV VGYG ++G+ YW+V+NSWG SWGE+GY+RM R
Sbjct: 266 AGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMER 325
Query: 318 NSPSSNIGICGILMQASYPVKR 339
N +S+ G CGI ++ SYP+K+
Sbjct: 326 NIKASS-GKCGIAVEPSYPLKK 346
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 288 bits (737), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 159/344 (46%), Positives = 203/344 (59%), Gaps = 38/344 (11%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKY--DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYS 87
L L L+ L I A Y K + + ++ W +S S +E ++RF ++
Sbjct: 4 LLLIFLFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHSVPR-SLNEREKRFNVFR 62
Query: 88 SNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNE----PRWPSVQYL--- 140
NV ++ N +N S+KL NKFADL+ EF + Y G N ++ P+ S Q++
Sbjct: 63 HNVMHVHNTNKKNRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDH 122
Query: 141 ----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVD 196
LP+SVDWRK+GAVT +K+QG+CGSCWAFS VAAVEGINK+KT KLVSLSEQELVD
Sbjct: 123 ENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVD 182
Query: 197 CDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA 256
CD +N+GCNGG ME AFEFI K GG+TTED YPY G + +C K VTI G+E
Sbjct: 183 CDT-KQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHED 241
Query: 257 IP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGE 294
+P FQ YS GVF CG +LNHGV VGYG + G+
Sbjct: 242 VPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYGSERGK 301
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
KYW+V+NSWG WGE GYI++ R G CGI M+ASYP+K
Sbjct: 302 KYWIVRNSWGAEWGEGGYIKIEREIDEPE-GRCGIAMEASYPIK 344
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 286 bits (732), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 147/306 (48%), Positives = 193/306 (63%), Gaps = 31/306 (10%)
Query: 62 FENWLKQYSREYGSED--EWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
+E WL ++ + E RRF I+ N++++D N +NLS++L +FADL+N+E+
Sbjct: 50 YEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR 109
Query: 120 STYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
S YLG R S++Y LP S+DWRK+GAV VKDQG CGSCWAFS + A
Sbjct: 110 SKYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGA 169
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEGIN++ TG L++LSEQELVDCD S N+GCNGG M+ AFEFI K GG+ T+ DYPY+G
Sbjct: 170 VEGINQIVTGDLITLSEQELVDCDT-SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKG 228
Query: 235 KNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVFD 272
+ C + VTI YE +P AFQLY G+FD
Sbjct: 229 VDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFD 288
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG QL+HGV VGYG ++G+ YW+V+NSWG SWGE+GY+RMARN SS+ G CGI ++
Sbjct: 289 GSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSS-GKCGIAIE 347
Query: 333 ASYPVK 338
SYP+K
Sbjct: 348 PSYPIK 353
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 284 bits (726), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 161/348 (46%), Positives = 208/348 (59%), Gaps = 38/348 (10%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEWQRR 82
M R VL+L +L VL G + + + + + S+ E +E W ++ S +E +R
Sbjct: 1 MKRFIVLALCMLMVLETTKGL--DFHNKDVESENSLWELYERWRSHHTVAR-SLEEKAKR 57
Query: 83 FGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYN------EPRWPS 136
F ++ NV++I N ++ S+KL NKF D+++EEF TY G N ++ + S
Sbjct: 58 FNVFKHNVKHIHETNKKDKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKS 117
Query: 137 VQYLG---LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
Y LP SVDWRK GAVTPVK+QGQCGSCWAFS V AVEGIN+++T KL SLSEQE
Sbjct: 118 FMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQE 177
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
LVDCD N +NQGCNGG M+ AFEFI + GG+T+E YPY+ ++ C T+K V+I G
Sbjct: 178 LVDCDTN-QNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDG 236
Query: 254 YEAIPARYA----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGED 291
+E +P FQ YS GVF CG +LNHGV VVGYG
Sbjct: 237 HEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTT 296
Query: 292 -HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G KYW+VKNSWG WGE GYIRM R G+CGI M+ASYP+K
Sbjct: 297 IDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKE-GLCGIAMEASYPLK 343
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 280 bits (717), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 153/317 (48%), Positives = 191/317 (60%), Gaps = 39/317 (12%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+S+ + +E W + SR G E +RF ++ +NV ++ N + +KL NKFAD+
Sbjct: 34 ESLWDLYERWRSHHTVSRSLG---EKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADM 90
Query: 114 SNEEFISTYLG----YNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCG 164
+N EF STY G ++K + + S ++ +PASVDWRK+GAVT VKDQGQCG
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCG 150
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS + AVEGIN++KT KLVSLSEQELVDCD ENQGCNGG ME AFEFI + GG+
Sbjct: 151 SCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCD-KEENQGCNGGLMESAFEFIKQKGGI 209
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE +YPY + C K AV+I G+E +P
Sbjct: 210 TTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSD 269
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GVF C LNHGV +VGYG G YW+V+NSWG WGE GYIRM RN S
Sbjct: 270 FQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN-IS 328
Query: 322 SNIGICGILMQASYPVK 338
G+CGI M ASYP+K
Sbjct: 329 KKEGLCGIAMMASYPIK 345
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 280 bits (715), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 149/307 (48%), Positives = 187/307 (60%), Gaps = 30/307 (9%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
E FE+W+ ++S+ Y S +E RF ++ N+ +ID N++ S+ L N+FADL++EEF
Sbjct: 49 ELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFK 108
Query: 120 STYLGYNKP-YNEPRWPSVQY-----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
YLG KP ++ R PS + LP SVDWRK+GAV PVKDQGQCGSCWAFS VA
Sbjct: 109 GRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVA 168
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEGIN++ TG L SLSEQEL+DCD + N GCNGG M+ AF++I GG+ EDDYPY
Sbjct: 169 AVEGINQITTGNLSSLSEQELIDCDT-TFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYL 227
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVF 271
+ CQ K VTI+GYE +P + FQ Y GVF
Sbjct: 228 MEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVF 287
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
+ CG L+HGV VGYG G Y +VKNSWG WGE G+IRM RN+ G+CGI
Sbjct: 288 NGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPE-GLCGINK 346
Query: 332 QASYPVK 338
ASYP K
Sbjct: 347 MASYPTK 353
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 278 bits (710), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 150/317 (47%), Positives = 188/317 (59%), Gaps = 39/317 (12%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+S+ + +E W + SR G E +RF ++ +N+ ++ N + +KL NKFAD+
Sbjct: 34 ESLWDLYERWRSHHTVSRSLG---EKHKRFNVFKANLMHVHNTNKMDKPYKLKLNKFADM 90
Query: 114 SNEEFISTYLGYN---------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
+N EF STY G P+ + + + +P SVDWRK+GAVT VKDQGQCG
Sbjct: 91 TNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCG 150
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS V AVEGIN++KT KLV+LSEQELVDCD ENQGCNGG ME AFEFI + GG+
Sbjct: 151 SCWAFSTVVAVEGINQIKTNKLVALSEQELVDCD-KEENQGCNGGLMESAFEFIKQKGGI 209
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE +YPY+ + C K AV+I G+E +PA
Sbjct: 210 TTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSD 269
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GVF C LNHGV +VGYG G YW+V+NSWG WGE GYIRM RN S
Sbjct: 270 FQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNI-S 328
Query: 322 SNIGICGILMQASYPVK 338
G+CGI M SYP+K
Sbjct: 329 KKEGLCGIAMLPSYPIK 345
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 273 bits (697), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 147/308 (47%), Positives = 190/308 (61%), Gaps = 34/308 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
FE+W+ ++ + YGS E +RR I+ N+++I+ N++NLS++L FADLS E+
Sbjct: 49 FESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEV 108
Query: 122 YLGYNK--PYN------EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
G + P N R+ + LP SVDWR EGAVT VKDQG C SCWAFS V
Sbjct: 109 CHGADPRPPRNHVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVG 168
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEG+NK+ TG+LV+LSEQ+L++C N EN GC GG +E A+EFI K GG+ T++DYPY+
Sbjct: 169 AVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYK 226
Query: 234 GKNDRCQTD-KTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
N C K + V I GYE +PA FQLY GV
Sbjct: 227 AVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGV 286
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
FD CG LNHGV VVGYG ++G YWLVKNS G +WGEAGY++MARN + G+CGI
Sbjct: 287 FDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANPR-GLCGIA 345
Query: 331 MQASYPVK 338
M+ASYP+K
Sbjct: 346 MRASYPLK 353
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 272 bits (695), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 196/340 (57%), Gaps = 37/340 (10%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L I + A++ + ++ +E+WL +Y + Y S EW+RRF I+
Sbjct: 10 MSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKET 69
Query: 90 VQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYN--------EPRWPSVQYL 140
+++ID N+ N S+K+ N+FADL++EEF STYLG+ N EPR V
Sbjct: 70 LRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPRVGQV--- 126
Query: 141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
LP+ VDWR GAV +K QG+CG CWAFSA+A VEGINK+ TG L+SLSEQEL+DC
Sbjct: 127 -LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-- 258
+GCNGGY+ F+FI GG+ TE++YPY ++ C D VTI YE +P
Sbjct: 186 QNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPYN 245
Query: 259 --------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWL 298
A AF+ YS G+F CG ++H VT+VGYG + G YW+
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWI 305
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VKNSW T+WGE GY+R+ RN + G CGI SYPVK
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA--GTCGIATMPSYPVK 343
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 271 bits (694), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 148/310 (47%), Positives = 186/310 (60%), Gaps = 35/310 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E W ++ S E Q+RF ++ N ++ N + +KL NKFAD++N EF +T
Sbjct: 38 YERWRSHHTVSR-SLHEKQKRFNVFKHNAMHVHNANKMDKPYKLKLNKFADMTNHEFRNT 96
Query: 122 YLGYNKPYNE-----PRWPSV----QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
Y G ++ PR + +PASVDWRK+GAVT VKDQGQCGSCWAFS +
Sbjct: 97 YSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFSTI 156
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AVEGIN++KT KLVSLSEQELVDCD + +NQGCNGG M+ AFEFI + GG+TTE +YPY
Sbjct: 157 VAVEGINQIKTNKLVSLSEQELVDCDTD-QNQGCNGGLMDYAFEFIKQRGGITTEANYPY 215
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
+ C K AV+I G+E +P FQ YS GV
Sbjct: 216 EAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV 275
Query: 271 FDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
F CG +L+HGV +VGYG G KYW VKNSWG WGE GYIRM R S G+CGI
Sbjct: 276 FTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMER-GISDKEGLCGI 334
Query: 330 LMQASYPVKR 339
M+ASYP+K+
Sbjct: 335 AMEASYPIKK 344
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 271 bits (693), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 138/291 (47%), Positives = 183/291 (62%), Gaps = 32/291 (10%)
Query: 78 EWQRRFGIYSSNVQYIDYINS---QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRW 134
E +RRF ++ N++++D N+ + F+L N+FADL+NEEF +T+LG K R
Sbjct: 70 EHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFRATFLGA-KVAERSRA 128
Query: 135 PSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+Y LP SVDWR++GAV PVK+QGQCGSCWAFSAV+ VE IN+L TG++++L
Sbjct: 129 AGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITL 188
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQELV+C N +N GCNGG M+ AF+FI K GG+ TEDDYPY+ + +C ++ V
Sbjct: 189 SEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVV 248
Query: 250 TITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVG 287
+I G+E +P FQLY GVF CG L+HGV VG
Sbjct: 249 SIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVG 308
Query: 288 YGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YG D+G+ YW+V+NSWG WGE+GY+RM RN + G CGI M ASYP K
Sbjct: 309 YGTDNGKDYWIVRNSWGPKWGESGYVRMERNI-NVTTGKCGIAMMASYPTK 358
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 269 bits (688), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 145/340 (42%), Positives = 195/340 (57%), Gaps = 37/340 (10%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L I + A++ + ++ +E+WL +Y + Y S EW+RRF I+
Sbjct: 10 MSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKET 69
Query: 90 VQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYN--------EPRWPSVQYL 140
+++ID N+ N S+K+ N+FADL++EEF STYL + N EPR V
Sbjct: 70 LRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFTSGSNKTKVSNRYEPRVGQV--- 126
Query: 141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
LP+ VDWR GAV +K QG+CG CWAFSA+A VEGINK+ TG L+SLSEQEL+DC
Sbjct: 127 -LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-- 258
+GCNGGY+ F+FI GG+ TE++YPY ++ C D VTI YE +P
Sbjct: 186 QNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYN 245
Query: 259 --------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWL 298
A AF+ YS G+F CG ++H VT+VGYG + G YW+
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWI 305
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VKNSW T+WGE GY+R+ RN + G CGI SYPVK
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA--GTCGIATMPSYPVK 343
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 268 bits (685), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 145/308 (47%), Positives = 190/308 (61%), Gaps = 34/308 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
FE+W+ ++ + Y S E +RR I+ N+++I N++NLS++L N+FADLS E+
Sbjct: 56 FESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYGEI 115
Query: 122 YLGYNK--PYN------EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
G + P N R+ + LP SVDWR EGAVT VKDQG C SCWAFS V
Sbjct: 116 CHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVG 175
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEG+NK+ TG+LV+LSEQ+L++C N EN GC GG +E A+EFI GG+ T++DYPY+
Sbjct: 176 AVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYK 233
Query: 234 GKNDRCQTD-KTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
N C+ K + V I GYE +PA FQLY GV
Sbjct: 234 ALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGV 293
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
FD CG LNHGV VVGYG ++G YW+VKNS G +WGEAGY++MARN + G+CGI
Sbjct: 294 FDGTCGTNLNHGVVVVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPR-GLCGIA 352
Query: 331 MQASYPVK 338
M+ASYP+K
Sbjct: 353 MRASYPLK 360
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 268 bits (684), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 187/323 (57%), Gaps = 41/323 (12%)
Query: 53 YDPQSME------ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLT 106
Y P+ +E E FENW+ + + Y + +E RF ++ N+++ID N + S+ L
Sbjct: 36 YSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLG 95
Query: 107 DNKFADLSNEEFISTYLGYN---------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPV 157
N+FADLS+EEF YLG + Y E + V+ +P SVDWRK+GAV V
Sbjct: 96 LNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVE--AVPKSVDWRKKGAVAEV 153
Query: 158 KDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEF 217
K+QG CGSCWAFS VAAVEGINK+ TG L +LSEQEL+DCD + N GCNGG M+ AFE+
Sbjct: 154 KNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDT-TYNNGCNGGLMDYAFEY 212
Query: 218 ITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------- 260
I K GG+ E+DYPY + C+ K + VTI G++ +P
Sbjct: 213 IVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVA 272
Query: 261 -----YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRM 315
FQ YS GVFD CG L+HGV VGYG G Y +VKNSWG WGE GYIR+
Sbjct: 273 IDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGYIRL 332
Query: 316 ARNSPSSNIGICGILMQASYPVK 338
RN+ G+CGI AS+P K
Sbjct: 333 KRNTGKPE-GLCGINKMASFPTK 354
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 267 bits (683), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 139/292 (47%), Positives = 178/292 (60%), Gaps = 35/292 (11%)
Query: 81 RRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNEEFISTYLGYN----------KP 128
+RF I+ N+++ID N ++N ++KL KF DL+N+E+ YLG K
Sbjct: 72 KRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKN 131
Query: 129 YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
N+ +V +P +VDWR++GAV P+KDQG CGSCWAFS AAVEGINK+ TG+L+S
Sbjct: 132 VNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELIS 191
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
LSEQELVDCD S NQGCNGG M+ AF+FI K GG+ TE DYPYRG +C +
Sbjct: 192 LSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRV 250
Query: 249 VTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVV 286
V+I GYE +P + FQ Y G+F CG L+H V V
Sbjct: 251 VSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAV 310
Query: 287 GYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
GYG ++G YW+V+NSWG WGE GYIRM RN +S G CGI ++ASYPVK
Sbjct: 311 GYGSENGVDYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVK 362
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 267 bits (682), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 144/306 (47%), Positives = 184/306 (60%), Gaps = 36/306 (11%)
Query: 67 KQYSREYGSEDEWQRRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNEEFISTYLG 124
K S G ++ RF I+ N+++ID N ++N ++KL FA+L+N+E+ S YLG
Sbjct: 13 KSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLG 72
Query: 125 YN----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
K N +V +P +VDWR++GAV +KDQG CGSCWAFS AA
Sbjct: 73 ARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAA 132
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEGINK+ TG+LVSLSEQELVDCD S NQGCNGG M+ AF+FI K GG+ TE DYPY G
Sbjct: 133 VEGINKIVTGELVSLSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHG 191
Query: 235 KNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFD 272
N +C + VTI GYE +P++ AFQ Y G+F
Sbjct: 192 TNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFT 251
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG ++H V VGYG ++G YW+V+NSWGT WGE GYIRM RN S + G CGI ++
Sbjct: 252 GKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVASKS-GKCGIAIE 310
Query: 333 ASYPVK 338
ASYPVK
Sbjct: 311 ASYPVK 316
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 266 bits (679), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 145/296 (48%), Positives = 184/296 (62%), Gaps = 37/296 (12%)
Query: 77 DEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWP 135
DE RRF ++ NV++I N ++ +KL NKF D++N+EF S Y G ++ +
Sbjct: 54 DEKNRRFNVFKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRG 113
Query: 136 SVQYLG---------LPA-SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGK 185
+ G LPA S+DWR +GAVT VKDQGQCGSCWAFS +A+VEGIN++KTG+
Sbjct: 114 IQKNTGSFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGE 173
Query: 186 LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK 245
LVSLSEQELVDCD S N+GCNGG M+ AFEFI K G+TTED YPY ++ C ++
Sbjct: 174 LVSLSEQELVDCDT-SYNEGCNGGLMDYAFEFIQK-NGITTEDSYPYAEQDGTCASNLLN 231
Query: 246 HHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGV 283
V+I G++ +PA Y FQ YS GVF CG +L+HGV
Sbjct: 232 SPVVSIDGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGV 291
Query: 284 TVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+VGYG G KYW+VKNSWG WGE+GYIRM R S G CGI M+ASYP+K
Sbjct: 292 AIVGYGATRDGTKYWIVKNSWGEEWGESGYIRMQR-GISDKRGKCGIAMEASYPIK 346
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 265 bits (678), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 147/310 (47%), Positives = 188/310 (60%), Gaps = 36/310 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E W +S S E +RF ++ NV ++ N +N +KL N+FAD+++ EF S+
Sbjct: 38 YERWRGHHSVSRASH-EAIKRFNVFRHNVLHVHRTNKKNKPYKLKINRFADITHHEFRSS 96
Query: 122 YLGYNKPYNE----PRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
Y G N ++ P+ S ++ +P+SVDWR++GAVT VK+Q CGSCWAFS V
Sbjct: 97 YAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTV 156
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AAVEGINK++T KLVSLSEQELVDCD ENQGC GG ME AFEFI GG+ TE+ YPY
Sbjct: 157 AAVEGINKIRTNKLVSLSEQELVDCDT-EENQGCAGGLMEPAFEFIKNNGGIKTEETYPY 215
Query: 233 RGKNDR-CQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHG 269
+ + C+ + VTI G+E +P FQLYS G
Sbjct: 216 DSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEG 275
Query: 270 VFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
VF CG QLNHGV +VGYGE +G KYW+V+NSWG WGE GY+R+ R S N G CG
Sbjct: 276 VFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIER-GISENEGRCG 334
Query: 329 ILMQASYPVK 338
I M+ASYP K
Sbjct: 335 IAMEASYPTK 344
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 265 bits (676), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 150/345 (43%), Positives = 202/345 (58%), Gaps = 38/345 (11%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M ++ L LFL + P+ A S P DP M +RFE W+ +Y R Y +DE
Sbjct: 1 MASKVQLVFLFLFLCAMWASPSAA-SRDEPN--DP--MMKRFEEWMAEYGRVYKDDDEKM 55
Query: 81 RRFGIYSSNVQYIDYINSQNL-SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY 139
RRF I+ +NV++I+ NS+N S+ L N+F D++ EF++ Y G + P N R P V +
Sbjct: 56 RRFQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSF 115
Query: 140 -----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+P S+DWR GAV VK+Q CGSCW+F+A+A VEGI K+KTG LVSLSEQE+
Sbjct: 116 DDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEV 175
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
+DC V + GC GG++ KA++FI GVTTE++YPY C + + A ITGY
Sbjct: 176 LDCAV---SYGCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPNSAY-ITGY 231
Query: 255 E---------------------AIPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-H 292
I A FQ Y+ GVF CG LNH +T++GYG+D
Sbjct: 232 SYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSS 291
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G KYW+V+NSWG+SWGE GY+RMAR SS+ G+CGI M +P
Sbjct: 292 GTKYWIVRNSWGSSWGEGGYVRMARGVSSSS-GVCGIAMAPLFPT 335
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 262 bits (669), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 140/307 (45%), Positives = 180/307 (58%), Gaps = 31/307 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFIS 120
+E WL + + Y E +RRF I+ N++++D NS + +F++ +FADL+NEEF +
Sbjct: 44 YEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRA 103
Query: 121 TYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
YL + + +YL LP VDWR GAV VKDQG CGSCWAFSAV AV
Sbjct: 104 IYLRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAV 163
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EGIN++ TG+L+SLSEQELVDCD N GC+GG M AFEFI K GG+ T+ DYPY
Sbjct: 164 EGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223
Query: 236 N-DRCQTDKTKH-HAVTITGYEAIP----------------------ARYAFQLYSHGVF 271
+ C DK + VTI GYE +P + AFQLY GV
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVM 283
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG L+HGV VVGYG GE YW+++NSWG +WG++GY+++ RN G CGI M
Sbjct: 284 TGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDP-FGKCGIAM 342
Query: 332 QASYPVK 338
SYP K
Sbjct: 343 MPSYPTK 349
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 261 bits (668), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 143/293 (48%), Positives = 176/293 (60%), Gaps = 35/293 (11%)
Query: 78 EWQRRFGIYSSNVQYIDYINS---QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRW 134
E +RRF ++ N++++D N+ + F+L N+FADL+N EF +TYLG P R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142
Query: 135 PSVQYL-----GLPASVDWRKEGAVT-PVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
Y LP SVDWR +GAV PVK+QGQCGSCWAFSAVAAVEGINK+ TG+LVS
Sbjct: 143 VGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVS 202
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
LSEQELV+C N +N GCNGG M+ AF FI + GG+ TE+DYPY + +C K
Sbjct: 203 LSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSRKV 262
Query: 249 VTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVV 286
V+I G+E +P FQLY GVF CG L+HGV V
Sbjct: 263 VSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVVAV 322
Query: 287 GYGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
GYG D G YW V+NSWG WGE GYIRM RN ++ G CGI M ASYP+
Sbjct: 323 GYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNV-TARTGKCGIAMMASYPI 374
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 254 bits (649), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 186/318 (58%), Gaps = 38/318 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLS 114
+++ + +E W + R E RRFG + SN +I N + + ++L N+F D+
Sbjct: 40 EALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDMD 98
Query: 115 NEEFISTYLG---YNKPYNEPRWPSVQYLGL-----PASVDWRKEGAVTPVKDQGQCGSC 166
EF +T++G + P P P Y L P SVDWR++GAVT VKDQG+CGSC
Sbjct: 99 QAEFRATFVGDLRRDTPSKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGSC 158
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS V +VEGIN ++TG LVSLSEQEL+DCD ++N GC GG M+ AFE+I GG+ T
Sbjct: 159 WAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT-ADNDGCQGGLMDNAFEYIKNNGGLIT 217
Query: 227 EDDYPYRGKNDRCQTDKTKHHA---VTITGYEAIPARY---------------------- 261
E YPYR C + ++ V I G++ +PA
Sbjct: 218 EAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGK 277
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
AF YS GVF CG +L+HGV VVGYG + G+ YW VKNSWG SWGE GYIR+ ++S
Sbjct: 278 AFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337
Query: 321 SSNIGICGILMQASYPVK 338
+S G+CGI M+ASYPVK
Sbjct: 338 ASG-GLCGIAMEASYPVK 354
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 254 bits (648), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 186/318 (58%), Gaps = 38/318 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLS 114
+++ + +E W + R E RRFG + SN +I N + + ++L N+F D+
Sbjct: 40 EALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDMD 98
Query: 115 NEEFISTYLG---YNKPYNEPRWPSVQYLGL-----PASVDWRKEGAVTPVKDQGQCGSC 166
EF +T++G + P P P Y L P SVDWR++GAVT VKDQG+CGSC
Sbjct: 99 QAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGSC 158
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS V +VEGIN ++TG LVSLSEQEL+DCD ++N GC GG M+ AFE+I GG+ T
Sbjct: 159 WAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT-ADNDGCQGGLMDNAFEYIKNNGGLIT 217
Query: 227 EDDYPYRGKNDRCQTDKTKHHA---VTITGYEAIPARY---------------------- 261
E YPYR C + ++ V I G++ +PA
Sbjct: 218 EAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGK 277
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
AF YS GVF CG +L+HGV VVGYG + G+ YW VKNSWG SWGE GYIR+ ++S
Sbjct: 278 AFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337
Query: 321 SSNIGICGILMQASYPVK 338
+S G+CGI M+ASYPVK
Sbjct: 338 ASG-GLCGIAMEASYPVK 354
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 251 bits (641), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 140/345 (40%), Positives = 198/345 (57%), Gaps = 38/345 (11%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M ++ L LFL + P+ A + + DP M ++FE W+ +Y R Y DE
Sbjct: 1 MTSKVQLVFLFLFLCVMWASPSAASCD---EPSDP--MMKQFEEWMAEYGRVYKDNDEKM 55
Query: 81 RRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY 139
RF I+ +NV +I+ N++N S+ L N+F D++N EF++ Y G + P N R P V +
Sbjct: 56 LRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPLNIKREPVVSF 115
Query: 140 -----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+P S+DWR GAVT VK+QG+CGSCWAF+++A VE I K+K G LVSLSEQ++
Sbjct: 116 DDVDISSVPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQV 175
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
+DC V + GC GG++ KA+ FI GV + YPY+ C+T+ + A IT Y
Sbjct: 176 LDCAV---SYGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTCKTNGVPNSAY-ITRY 231
Query: 255 ---------------------EAIPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-H 292
A+ A FQ Y GVF CG +LNH + ++GYG+D
Sbjct: 232 TYVQRNNERNMMYAVSNQPIAAALDASGNFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSS 291
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G+K+W+V+NSWG WGE GYIR+AR+ SS+ G+CGI M YP
Sbjct: 292 GKKFWIVRNSWGAGWGEGGYIRLARDV-SSSFGLCGIAMDPLYPT 335
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 243 bits (621), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 141/351 (40%), Positives = 197/351 (56%), Gaps = 38/351 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIP-AGAWSEGYPQKYDPQSME---ERFENWLKQYSREYGSE 76
M + + L+ L+ +G+ A ++ GY Q D S+E + F++W+ ++++ Y S
Sbjct: 4 MSSISKIIFLATCLIIHMGLSSADFYTVGYSQD-DLTSIERLIQLFDSWMLKHNKIYESI 62
Query: 77 DEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYN-------KPY 129
DE RF I+ N+ YID N +N S+ L N FADLSN+EF Y+G+ + +
Sbjct: 63 DEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHF 122
Query: 130 NEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+ + P S+DWR +GAVTPVK+QG CGSCWAFS +A VEGINK+ TG L+ L
Sbjct: 123 DNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLEL 182
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQELVDCD +S GC GGY + +++ GV T YPY+ K +C+ V
Sbjct: 183 SEQELVDCDKHS--YGCKGGYQTTSLQYVAN-NGVHTSKVYPYQAKQYKCRATDKPGPKV 239
Query: 250 TITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVG 287
ITGY+ +P+ FQLY GVFD CG +L+H VT VG
Sbjct: 240 KITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVG 299
Query: 288 YGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YG G+ Y ++KNSWG +WGE GY+R+ R S +S G CG+ + YP K
Sbjct: 300 YGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ-GTCGVYKSSYYPFK 349
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 225 bits (574), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 188/323 (58%), Gaps = 39/323 (12%)
Query: 48 GYPQKYDPQSMEER----FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSF 103
GY Q D + ER F +W+ +++ Y + DE RF I+ N+ YID N +N S+
Sbjct: 32 GYSQ--DDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSY 89
Query: 104 KLTDNKFADLSNEEFISTYLG------YNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPV 157
L N+FADLSN+EF Y+G + Y+E + + + LP +VDWRK+GAVTPV
Sbjct: 90 WLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDE-EFINEDTVNLPENVDWRKKGAVTPV 148
Query: 158 KDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEF 217
+ QG CGSCWAFSAVA VEGINK++TGKLV LSEQELVDC+ S GC GGY A E+
Sbjct: 149 RHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRS--HGCKGGYPPYALEY 206
Query: 218 ITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG------------YEAIPAR----- 260
+ K G+ YPY+ K C+ + V +G AI +
Sbjct: 207 VAK-NGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVV 265
Query: 261 -----YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRM 315
FQLY G+F+ CG +++H VT VGYG+ G+ Y L+KNSWGT+WGE GYIR+
Sbjct: 266 VESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRI 325
Query: 316 ARNSPSSNIGICGILMQASYPVK 338
R +P ++ G+CG+ + YP K
Sbjct: 326 KR-APGNSPGVCGLYKSSYYPTK 347
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 224 bits (571), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 137/320 (42%), Positives = 174/320 (54%), Gaps = 45/320 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS----QNLSFKLTDNKFADL 113
++E + + Q+ + Y +E E + R I++ N I N +S+KL NK+AD+
Sbjct: 24 IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQYLG----------LPASVDWRKEGAVTPVKDQGQC 163
+ EF T GYN + +G +P SVDWR+ GAVT VKDQG C
Sbjct: 84 LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCWAFS+ A+EG + K G LVSLSEQ LVDC N GCNGG M+ AF +I GG
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------AR 260
+ TE YPY G +D C +K A T TG+ IP +
Sbjct: 204 IDTEKSYPYEGIDDSCHFNKATIGA-TDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASH 262
Query: 261 YAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQLYS GV++E C Q L+HGV VVGYG D G YWLVKNSWGT+WGE GYI+MAR
Sbjct: 263 ESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMAR 322
Query: 318 NSPSSNIGICGILMQASYPV 337
N + CGI +SYP
Sbjct: 323 NQNNQ----CGIATASSYPT 338
>sp|P13277|CYSP1_HOMAM Digestive cysteine proteinase 1 OS=Homarus americanus GN=LCP1 PE=1
SV=2
Length = 322
Score = 221 bits (564), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 127/306 (41%), Positives = 174/306 (56%), Gaps = 36/306 (11%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEF 118
E + ++ R+Y +E + R ++ N+QYI+ N + +++ L N+F+D++NE+F
Sbjct: 21 EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKF 80
Query: 119 ISTYLGYNK-PYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEG 177
+ GY K P + S VDWR +GAVTPVKDQGQCGSCWAFS +EG
Sbjct: 81 NAVMKGYKKGPRPAAVFTSTDAAPESTEVDWRTKGAVTPVKDQGQCGSCWAFSTTGGIEG 140
Query: 178 INKLKTGKLVSLSEQELVDCDVNS-ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
+ LKTG+LVSLSEQ+LVDC S NQGCNGG++E+A ++ GGV TE YPY ++
Sbjct: 141 QHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTESSYPYEARD 200
Query: 237 DRCQTDKTKHHAVTITGYEAI-----------------------PARYAFQLYSHGVFDE 273
+ C+ + A T TGY I + +FQ Y GV+ E
Sbjct: 201 NTCRFNSNTIGA-TCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQSYYTGVYYE 259
Query: 274 --YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
QL+H V VGYG + G+ +WLVKNSW TSWGE+GYI+MARN ++ CGI
Sbjct: 260 PSCSSSQLDHAVLAVGYGSEGGQDFWLVKNSWATSWGESGYIKMARNRNNN----CGIAT 315
Query: 332 QASYPV 337
A YP
Sbjct: 316 DACYPT 321
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 221 bits (562), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 138/347 (39%), Positives = 192/347 (55%), Gaps = 45/347 (12%)
Query: 25 LRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFG 84
+R ++ +F L VL I + + K ++ F +W++ ++ Y + E+ R+
Sbjct: 1 MRLSITLIFTLIVLSISFISAGNVFSHK----QYQDSFIDWMRSNNKAY-THKEFMPRYE 55
Query: 85 IYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLG---------YNKPYNEPRWP 135
+ N+ Y+ NS+ L N+ ADLSNEE+ YLG Y+K R
Sbjct: 56 EFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRLN 115
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
Q+ P +VDWR++ AVTPVKDQGQCGSC++FS +VEG+ +KTGKLVSLSEQ ++
Sbjct: 116 RPQF-KQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNIL 174
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK-NDRCQTDKTKHHAVTITGY 254
DC + N+GCNGG M AFE+I K G+ +E+ YPY K ND C+ + A IT Y
Sbjct: 175 DCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGS-VAAKITSY 233
Query: 255 EAIPA----------------------RYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGE 290
+ I A +FQLY+ GV+ E C + L+HGV VG G
Sbjct: 234 KEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMGT 293
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
D+GE Y++VKNSWG SWG GYI MARN ++ CGI ASYP+
Sbjct: 294 DNGEDYYIVKNSWGPSWGLNGYIHMARNKDNN----CGISTMASYPI 336
>sp|P25784|CYSP3_HOMAM Digestive cysteine proteinase 3 OS=Homarus americanus GN=LCP3 PE=2
SV=1
Length = 321
Score = 220 bits (561), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 127/304 (41%), Positives = 173/304 (56%), Gaps = 33/304 (10%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEF 118
+++ QY R+YG E R ++ N Q I+ N + ++FK+ N+F D++NEEF
Sbjct: 21 DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 80
Query: 119 ISTYLGYNK-PYNEPRWPSVQYLG-LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
+ GY K EP+ G + A VDWR + VTPVKDQ QCGSCWAFSA A+E
Sbjct: 81 NAVMKGYKKGSRGEPKAVFTAEAGPMAADVDWRTKALVTPVKDQEQCGSCWAFSATGALE 140
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
G + LK +LVSLSEQ+LVDC + N GC GG+M AF++I GG+ TE YPY ++
Sbjct: 141 GQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAED 200
Query: 237 DRCQTDKTKHHAVTITGYE--------------------AIPA-RYAFQLYSHGV-FDEY 274
C+ D A+ E AI A ++FQ YS GV +++
Sbjct: 201 RSCRFDANSIGAICTGSVEVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQN 260
Query: 275 CGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
C L+HGV VGYG + + YWLVKNSWG+SWG+AGYI+M+RN ++ CGI +
Sbjct: 261 CSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNN----CGIASEP 316
Query: 334 SYPV 337
SYP
Sbjct: 317 SYPT 320
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 218 bits (554), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 173/319 (54%), Gaps = 46/319 (14%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSN 115
E + + ++ + Y E E + R I++ N I N + +SFKL NK+ADL +
Sbjct: 57 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 116
Query: 116 EEFISTYLGYN-----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
EF G+N + + + S ++ LP SVDWR +GAVT VKDQG CG
Sbjct: 117 HEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCG 176
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS+ A+EG + K+G LVSLSEQ LVDC N GCNGG M+ AF +I GG+
Sbjct: 177 SCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 236
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARY 261
TE YPY +D C +K A T G+ IP +
Sbjct: 237 DTEKSYPYEAIDDSCHFNKGTVGA-TDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHE 295
Query: 262 AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARN 318
+FQ YS GV++E C Q L+HGV VVG+G D GE YWLVKNSWGT+WG+ G+I+M RN
Sbjct: 296 SFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRN 355
Query: 319 SPSSNIGICGILMQASYPV 337
+ CGI +SYP+
Sbjct: 356 KENQ----CGIASASSYPL 370
>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
Length = 331
Score = 216 bits (551), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 186/336 (55%), Gaps = 40/336 (11%)
Query: 34 LLWVLGI-PAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
+ W++G+ P +++ K DP +++ + W K YS++Y E+E R I+ N+++
Sbjct: 1 MKWLVGLLPLCSYAVAQVHK-DP-TLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKF 58
Query: 93 IDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPAS 145
+ N ++ S+ L N D++ EE IS P R + S LP S
Sbjct: 59 VMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVTYRSNSNQKLPDS 118
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS-ENQ 204
VDWR++G VT VK QG CG+CWAFSAV A+E KLKTGKLVSLS Q LVDC N+
Sbjct: 119 VDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNK 178
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------ 258
GCNGG+M AF++I G+ +E YPY+ N +C+ D +K A T + Y +P
Sbjct: 179 GCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYD-SKKRAATCSKYTELPFGSEDA 237
Query: 259 -----------------ARYAFQLYSHGVFDE-YCGHQLNHGVTVVGYGEDHGEKYWLVK 300
+ Y+F LY GV+ E C +NHGV VVGYG +G+ YWLVK
Sbjct: 238 LKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVK 297
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
NSWG ++G+ GYIRMARNS + CGI SYP
Sbjct: 298 NSWGLNFGDQGYIRMARNSGNH----CGIASYPSYP 329
>sp|P61277|CATK_MACMU Cathepsin K OS=Macaca mulatta GN=CTSK PE=1 SV=1
Length = 329
Score = 216 bits (550), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 133/319 (41%), Positives = 177/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K + ++Y S+ DE RR I+ N++YI N + +++L
Sbjct: 17 YPEEILDTHWELWKKTHRKQYNSKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D++NEE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTNEEVVQKMTGLKVPASHSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>sp|P61276|CATK_MACFA Cathepsin K OS=Macaca fascicularis GN=CTSK PE=2 SV=1
Length = 329
Score = 216 bits (550), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 133/319 (41%), Positives = 177/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K + ++Y S+ DE RR I+ N++YI N + +++L
Sbjct: 17 YPEEILDTHWELWKKTHRKQYNSKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D++NEE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTNEEVVQKMTGLKVPASHSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
Length = 330
Score = 216 bits (549), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 126/322 (39%), Positives = 180/322 (55%), Gaps = 36/322 (11%)
Query: 46 SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL---- 101
S Q + +++ + W K Y ++Y ++E R I+ N++++ N ++
Sbjct: 12 SSAVTQLHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMH 71
Query: 102 SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPASVDWRKEGAVTPVK 158
S+ L N D+++EE +S P R + S LP SVDWR++G VT VK
Sbjct: 72 SYDLGMNHLGDMTSEEVMSLMSSLRVPNQWQRNITYKSNPNQMLPDSVDWREKGCVTEVK 131
Query: 159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
QG CG+CWAFSAV A+E KLKTGKLVSLS Q LVDC N+GCNGG+M +AF++I
Sbjct: 132 YQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYI 191
Query: 219 TKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-------------------- 258
G+ +E YPY+ + +CQ D +K+ A T + Y +P
Sbjct: 192 IDNKGIDSEASYPYKATDQKCQYD-SKYRAATCSKYTELPYGREDVLKEAVANKGPVCVG 250
Query: 259 ---ARYAFQLYSHGV-FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIR 314
+ +F LY GV +D C ++NHGV V+GYG+ +G++YWLVKNSWG+++GE GYIR
Sbjct: 251 VDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGYIR 310
Query: 315 MARNSPSSNIGICGILMQASYP 336
MARN + CGI SYP
Sbjct: 311 MARNKGNH----CGIASYPSYP 328
>sp|Q9GLE3|CATK_PIG Cathepsin K OS=Sus scrofa GN=CTSK PE=2 SV=1
Length = 330
Score = 215 bits (548), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 180/319 (56%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ ++E W K Y ++Y S+ DE RR I+ N+++I N + +++L
Sbjct: 18 YPEEILDTQWELWKKTYRKQYNSKVDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAM 76
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P S+D+RK+G VTPVK+QGQ
Sbjct: 77 NHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQ 136
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 137 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 194
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 195 GIDSEDAYPYVGQDENCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 253
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G+K+W++KNSWG +WG GYI MAR
Sbjct: 254 LTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWGNKGYILMAR 313
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 314 NKNNA----CGIANLASFP 328
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 214 bits (545), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 135/341 (39%), Positives = 188/341 (55%), Gaps = 41/341 (12%)
Query: 30 LSLFLLWVLGIPAGAWS-EGYPQKYDPQSME---ERFENWLKQYSREYGSEDEWQRRFGI 85
+++ L +G+ G +S GY Q D S E + FE+W+ ++++ Y + DE RF I
Sbjct: 13 VAICLFVYMGLSFGDFSIVGYSQN-DLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEI 71
Query: 86 YSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGY---NKPYNEPRWPSVQYLG- 141
+ N++YID N +N S+ L N FAD+SN+EF Y G N E + V G
Sbjct: 72 FKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGD 131
Query: 142 --LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDV 199
+P VDWR++GAVTPVK+QG CGSCWAFSAV +EGI K++TG L SEQEL+DCD
Sbjct: 132 VNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDR 191
Query: 200 NSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI-- 257
S GCNGGY A + + + G+ + YPY G C++ + +A G +
Sbjct: 192 RS--YGCNGGYPWSALQLVAQY-GIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQP 248
Query: 258 --------------------PARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYW 297
A FQLY G+F CG++++H V VGYG + Y
Sbjct: 249 YNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGPN----YI 304
Query: 298 LVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
L+KNSWGT WGE GYIR+ R + +S G+CG+ + YPVK
Sbjct: 305 LIKNSWGTGWGENGYIRIKRGTGNS-YGVCGLYTSSFYPVK 344
>sp|Q3ZKN1|CATK_CANFA Cathepsin K OS=Canis familiaris GN=CTSK PE=2 SV=1
Length = 330
Score = 214 bits (545), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 179/319 (56%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +++ W K Y ++Y S+ DE RR I+ N+++I N + +++L
Sbjct: 18 YPEEILDTQWDLWKKTYRKQYNSKVDELSRRL-IWEKNLKHISIHNLEASLGVHTYELAM 76
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 77 NHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWESRAPDSVDYRKKGYVTPVKNQGQ 136
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 137 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 194
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 195 GIDSEDAYPYVGQDESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPISVAIDAS 253
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 254 LTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 313
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 314 NKNNA----CGIANLASFP 328
>sp|Q9GL24|CATL1_CANFA Cathepsin L1 OS=Canis familiaris GN=CTSL1 PE=2 SV=1
Length = 333
Score = 214 bits (545), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 136/346 (39%), Positives = 184/346 (53%), Gaps = 56/346 (16%)
Query: 31 SLFLLWV-LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
SLFL + LGI + A K+D QS+ ++ W + R YG +E RR ++ N
Sbjct: 4 SLFLTALCLGIASAA------PKFD-QSLNAQWYQWKATHRRLYGMNEEGWRR-AVWEKN 55
Query: 90 VQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYN-------KPYNEPRWPSVQ 138
++ I+ N + F + N F D++NEEF G+ K + EP + +
Sbjct: 56 MKMIELHNREYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKMFQEPLFAEI- 114
Query: 139 YLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
P SVDWR++G VTPVK+QGQCGSCWAFSA A+EG KTGKLVSLSEQ LVDC
Sbjct: 115 ----PKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170
Query: 199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
N+GCNGG M+ AF ++ GG+ +E+ YPY G++ K + A TG+ +P
Sbjct: 171 RAQGNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLP 230
Query: 259 AR----------------------YAFQLYSHGV-FDEYCGHQ-LNHGVTVVGY---GED 291
R +FQ Y G+ FD C + L+HGV VVGY G D
Sbjct: 231 QREKALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTD 290
Query: 292 HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
K+W+VKNSWG WG GY++MA++ + CGI ASYP
Sbjct: 291 SNNKFWIVKNSWGPEWGWNGYVKMAKDQNNH----CGIATAASYPT 332
>sp|P43235|CATK_HUMAN Cathepsin K OS=Homo sapiens GN=CTSK PE=1 SV=1
Length = 329
Score = 213 bits (543), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 177/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K + ++Y ++ DE RR I+ N++YI N + +++L
Sbjct: 17 YPEEILDTHWELWKKTHRKQYNNKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>sp|Q5E968|CATK_BOVIN Cathepsin K OS=Bos taurus GN=CTSK PE=2 SV=2
Length = 329
Score = 213 bits (543), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 178/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ ++E W K Y ++Y S+ DE RR I+ N+++I N + +++L
Sbjct: 17 YPEEILDTQWELWKKTYRKQYNSKGDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTSEEVVQKMTGLKVPASRSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQDENCMYNPT-GKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ Y GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
Length = 331
Score = 213 bits (543), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 133/336 (39%), Positives = 185/336 (55%), Gaps = 41/336 (12%)
Query: 33 FLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
+L+W L + + A + + DP +++ ++ W K Y ++Y ++E R I+ N++
Sbjct: 3 WLVWALLLCSSAMAHVH---RDP-TLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKT 58
Query: 93 IDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPAS 145
+ N ++ S++L N D+++EE IS P PR + S LP S
Sbjct: 59 VTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKLPDS 118
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS-ENQ 204
+DWR++G VT VK QG CGSCWAFSAV A+E KLKTGKLVSLS Q LVDC N+
Sbjct: 119 MDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNK 178
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------ 258
GCNGG+M +AF++I G+ +E YPY+ + +CQ D K+ A T + Y +P
Sbjct: 179 GCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYD-VKNRAATCSRYIELPFGSEEA 237
Query: 259 -----------------ARYAFQLYSHGV-FDEYCGHQLNHGVTVVGYGEDHGEKYWLVK 300
+ +F LY GV +D C +NHGV VVGYG G+ YWLVK
Sbjct: 238 LKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVK 297
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
NSWG +G+ GYIRMARNS + CGI SYP
Sbjct: 298 NSWGLHFGDQGYIRMARNSGNH----CGIANYPSYP 329
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
SV=1
Length = 323
Score = 212 bits (539), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 172/312 (55%), Gaps = 47/312 (15%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEF 118
E++ +Y R+Y +E R I+ N +YI+ N + ++F L NKF D++ EEF
Sbjct: 21 EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF 80
Query: 119 ISTYLGYNKPYNEPR--------WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
+ G N PR +P + VDWR +GAVTPVKDQGQCGSCWAFS
Sbjct: 81 NAVMKG-----NIPRRSAPVSVFYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFS 135
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
++EG + LKTG L+SL+EQ+LVDC QGCNGG+M AF++I G+ TE Y
Sbjct: 136 TTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAY 195
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAI-----------------------PARYAFQLYS 267
PY ++ C+ D + A T +G+ I A +FQ YS
Sbjct: 196 PYEARDGSCRFD-SNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYS 254
Query: 268 HGVFDE-YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
GV+ E C L+H V VGYG + G+ +WLVKNSW TSWG+AGYI+M+RN ++
Sbjct: 255 SGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNN--- 311
Query: 326 ICGILMQASYPV 337
CGI ASYP+
Sbjct: 312 -CGIATVASYPL 322
>sp|O35186|CATK_RAT Cathepsin K OS=Rattus norvegicus GN=Ctsk PE=2 SV=1
Length = 329
Score = 211 bits (538), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 128/320 (40%), Positives = 179/320 (55%), Gaps = 51/320 (15%)
Query: 56 QSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKF 110
++++ ++E W K + ++Y S+ DE RR I+ N++ I N + +++L N
Sbjct: 20 ETLDTQWELWKKTHGKQYNSKVDEISRRL-IWEKNLKKISVHNLEASLGAHTYELAMNHL 78
Query: 111 ADLSNEEFISTYLGYNKPYNE---------PRWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
D+++EE + G P + P W +P S+D+RK+G VTPVK+QG
Sbjct: 79 GDMTSEEVVQKMTGLRVPPSRSFSNDTLYTPEWEGR----VPDSIDYRKKGYVTPVKNQG 134
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
QCGSCWAFS+ A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ +
Sbjct: 135 QCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCV--SENYGCGGGYMTTAFQYVQQN 192
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY--------- 261
GG+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 193 GGIDSEDAYPYVGQDESCMYNATA-KAAKCRGYREIPVGNEKALKRAVARVGPVSVSIDA 251
Query: 262 ---AFQLYSHGV-FDEYCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMA 316
+FQ YS GV +DE C +NH V VVGYG G KYW++KNSWG SWG GY+ +A
Sbjct: 252 SLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGNKYWIIKNSWGESWGNKGYVLLA 311
Query: 317 RNSPSSNIGICGILMQASYP 336
RN ++ CGI AS+P
Sbjct: 312 RNKNNA----CGITNLASFP 327
>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 211 bits (537), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 132/326 (40%), Positives = 182/326 (55%), Gaps = 53/326 (16%)
Query: 52 KYDPQSMEERFENWLKQYSREYGS-EDEWQRRFGIYSSNVQYI-----DYINSQNLSFKL 105
K+D Q+ + W + R YG+ E+EW+R I+ N++ I +Y N Q+ F +
Sbjct: 20 KFD-QTFSAEWHQWKSTHRRLYGTNEEEWRR--AIWEKNMRMIQLHNGEYSNGQH-GFSM 75
Query: 106 TDNKFADLSNEEFISTYLGYN-------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVK 158
N F D++NEEF GY + + EP L +P SVDWR++G VTPVK
Sbjct: 76 EMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPL-----MLKIPKSVDWREKGCVTPVK 130
Query: 159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
+QGQCGSCWAFSA +EG LKTGKL+SLSEQ LVDC NQGCNGG M+ AF++I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190
Query: 219 TKIGGVTTEDDYPYRGKNDRC------------------QTDKTKHHAVTITGYEAI--- 257
+ GG+ +E+ YPY K+ C Q +K AV G ++
Sbjct: 191 KENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVAMD 250
Query: 258 PARYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAG 311
+ + Q YS G++ E C + L+HGV +VGYG + + KYWLVKNSWG+ WG G
Sbjct: 251 ASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEG 310
Query: 312 YIRMARNSPSSNIGICGILMQASYPV 337
YI++A++ + CG+ ASYPV
Sbjct: 311 YIKIAKDRDNH----CGLATAASYPV 332
>sp|O97397|CATLL_PHACE Cathepsin L-like proteinase OS=Phaedon cochleariae PE=2 SV=1
Length = 324
Score = 211 bits (537), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 116/319 (36%), Positives = 178/319 (55%), Gaps = 46/319 (14%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI-----DYINSQNLSFKLTDN 108
+ S +E + ++ K ++R Y S E + RF I+ ++ I Y N ++ ++ L N
Sbjct: 15 NAASDQELWADFKKTHARTYKSLREEKLRFNIFQDTLRQIAEHNVKYENGES-TYYLAIN 73
Query: 109 KFADLSNEEFISTYLGYNKPYNEPRWPSVQYL--------GLPASVDWRKEGAVTPVKDQ 160
KF+D+++EEF + NE P+++ L P S+DWR +G V PV++Q
Sbjct: 74 KFSDITDEEFRDMLM-----KNEASRPNLEGLEVADLTVGAAPESIDWRSKGVVLPVRNQ 128
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G+CGSCWA S AA+E + +K+G V LS Q+LVDC + N GCNGG+ FE++ K
Sbjct: 129 GECGSCWALSTAAAIESQSAIKSGSKVPLSPQQLVDCSTSYGNHGCNGGFAVNGFEYV-K 187
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA------------------ 262
G+ ++ DYPY GK D+C+ + V +TGY+ + A
Sbjct: 188 DNGLESDADYPYSGKEDKCKANDKSRSVVELTGYKKVTASETSLKEAVGTIGPISAVVFG 247
Query: 263 --FQLYSHGVFDEYC--GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
+ Y G+FD+ G L+HGV VVGYG ++G+KYW++KN+WG WGE+GYIR+ R+
Sbjct: 248 KPMKSYGGGIFDDSSCLGDNLHHGVNVVGYGIENGQKYWIIKNTWGADWGESGYIRLIRD 307
Query: 319 SPSSNIGICGILMQASYPV 337
+ S CG+ ASYP+
Sbjct: 308 TDHS----CGVEKMASYPI 322
>sp|P25975|CATL1_BOVIN Cathepsin L1 OS=Bos taurus GN=CTSL1 PE=1 SV=3
Length = 334
Score = 211 bits (537), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 135/348 (38%), Positives = 185/348 (53%), Gaps = 59/348 (16%)
Query: 31 SLFL-LWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYG-SEDEWQRRFGIYSS 88
S FL + LG+ + A K DP +++ + W + R YG +E+EW+R ++
Sbjct: 4 SFFLTVLCLGVASAA------PKLDP-NLDAHWHQWKATHRRLYGMNEEEWRR--AVWEK 54
Query: 89 NVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYN-------KPYNEPRWPSV 137
N + ID N + F++ N F D++NEEF G+ K ++EP
Sbjct: 55 NKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKLFHEPL---- 110
Query: 138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
+ +P SVDW K+G VTPVK+QGQCGSCWAFSA A+EG KTGKLVSLSEQ LVDC
Sbjct: 111 -LVDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDC 169
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
NQGCNGG M+ AF++I GG+ +E+ YPY + K + A TG+ I
Sbjct: 170 SRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI 229
Query: 258 PAR----------------------YAFQLYSHGV-FDEYCGHQ-LNHGVTVVGYG---- 289
P R +FQ Y G+ +D C + L+HGV VVGYG
Sbjct: 230 PQREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGT 289
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+ + K+W+VKNSWG WG GY++MA++ + CGI ASYP
Sbjct: 290 DSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNH----CGIATAASYPT 333
>sp|P43236|CATK_RABIT Cathepsin K OS=Oryctolagus cuniculus GN=CTSK PE=1 SV=1
Length = 329
Score = 211 bits (537), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 129/314 (41%), Positives = 177/314 (56%), Gaps = 43/314 (13%)
Query: 58 MEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFAD 112
++ ++E W K YS++Y S+ DE RR I+ N+++I N + +++L N D
Sbjct: 22 LDTQWELWKKTYSKQYNSKVDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAMNHLGD 80
Query: 113 LSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCW 167
+++EE + G P + Y+ P S+D+RK+G VTPVK+QGQCGSCW
Sbjct: 81 MTSEEVVQKMTGLKVPPSRSHSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQCGSCW 140
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ + G+ +E
Sbjct: 141 AFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENYGCGGGYMTNAFQYVQRNRGIDSE 198
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY------------AFQ 264
D YPY G+++ C + T A GY IP AR +FQ
Sbjct: 199 DAYPYVGQDESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQ 257
Query: 265 LYSHGV-FDEYCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
YS GV +DE C +NH V VGYG G K+W++KNSWG SWG GYI MARN ++
Sbjct: 258 FYSKGVYYDENCSSDNVNHAVLAVGYGIQKGNKHWIIKNSWGESWGNKGYILMARNKNNA 317
Query: 323 NIGICGILMQASYP 336
CGI AS+P
Sbjct: 318 ----CGIANLASFP 327
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 211 bits (537), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 108/220 (49%), Positives = 138/220 (62%), Gaps = 26/220 (11%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
LP S+DWR+ GAV PVK+QG CGSCWAFS VAAVEGIN++ TG L+SLSEQ+LVDC +
Sbjct: 3 LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDC--TT 60
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP--- 258
N GC GG+M AF+FI GG+ +E+ YPYRG++ C + V+I YE +P
Sbjct: 61 ANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNS-TVNAPVVSIDSYENVPSHN 119
Query: 259 -------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
A FQLY G+F C NH +TVVGYG ++ + +W+V
Sbjct: 120 EQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIV 179
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
KNSWG +WGE+GYIR RN + + G CGI ASYPVK+
Sbjct: 180 KNSWGKNWGESGYIRAERNIENPD-GKCGITRFASYPVKK 218
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 210 bits (535), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 132/323 (40%), Positives = 181/323 (56%), Gaps = 39/323 (12%)
Query: 48 GYPQKYDPQSMEER----FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSF 103
GY Q D + ER F +W+ ++++ Y + DE RF I+ N++YID N +
Sbjct: 32 GYSQ--DDLTSTERLIQLFNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMINGY 89
Query: 104 KLTDNKFADLSNEEFISTYLG------YNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPV 157
L N+F+DLSN+EF Y+G N+PY+E + + + LP SVDWR +GAVTPV
Sbjct: 90 WLGLNEFSDLSNDEFKEKYVGSLPEDYTNQPYDE-EFVNEDIVDLPESVDWRAKGAVTPV 148
Query: 158 KDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEF 217
K QG C SCWAFS VA VEGINK+KTG LV LSEQELVDCD ++ GCN GY + ++
Sbjct: 149 KHQGYCESCWAFSTVATVEGINKIKTGNLVELSEQELVDCD--KQSYGCNRGYQSTSLQY 206
Query: 218 ITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI-------------------- 257
+ + G+ YPY K C+ ++ V G +
Sbjct: 207 VAQ-NGIHLRAKYPYIAKQQTCRANQVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVV 265
Query: 258 --PARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRM 315
A FQ Y G+F+ CG +++H VT VGYG+ G+ Y L+KNSWG WGE GYIR+
Sbjct: 266 VESAGRDFQNYKGGIFEGSCGTKVDHAVTAVGYGKSGGKGYILIKNSWGPGWGENGYIRI 325
Query: 316 ARNSPSSNIGICGILMQASYPVK 338
R S +S G+CG+ + YP+K
Sbjct: 326 RRASGNSP-GVCGVYRSSYYPIK 347
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.318 0.135 0.430
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 137,085,032
Number of Sequences: 539616
Number of extensions: 6117674
Number of successful extensions: 14003
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 217
Number of HSP's successfully gapped in prelim test: 11
Number of HSP's that attempted gapping in prelim test: 13095
Number of HSP's gapped (non-prelim): 303
length of query: 340
length of database: 191,569,459
effective HSP length: 118
effective length of query: 222
effective length of database: 127,894,771
effective search space: 28392639162
effective search space used: 28392639162
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 61 (28.1 bits)