BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 041011
(313 letters)
Database: swissprot
539,616 sequences; 191,569,459 total letters
Searching..................................................done
>sp|O65493|XCP1_ARATH Xylem cysteine proteinase 1 OS=Arabidopsis thaliana GN=XCP1 PE=1
SV=1
Length = 355
Score = 311 bits (797), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 150/313 (47%), Positives = 213/313 (68%), Gaps = 18/313 (5%)
Query: 9 IAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQF 68
+ E E WM+EH ++YK EK RF++F++NL +ID+ NN NS Y LG N+F
Sbjct: 47 LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINS-------YWLGLNEF 99
Query: 69 SDLTNAEFRASYAGNSMAITSQH----SSFKYQNLTQVPTSMDWREKGAVTSIKNQGGCA 124
+DLT+ EF+ Y G + S+ ++F+Y+++T +P S+DWR+KGAV +K+QG C
Sbjct: 100 ADLTHEEFKGRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCG 159
Query: 125 ACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGIA 184
+CWAFS VAAVEGI QI++GNL LSEQ+L+DC + NSGC G D AF+YII G+
Sbjct: 160 SCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLH 219
Query: 185 TEADYPYHQVQGSC--GREHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQDF 242
E DYPY +G C +E IS YE +P D+++L+KA++ QPVS+ IE +G+DF
Sbjct: 220 KEDDYPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDF 279
Query: 243 KNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD---- 298
+ YKGG+FNG CGT LDH V +G+G+++ G+ Y ++KNSWG WGE G++R++R+
Sbjct: 280 QFYKGGVFNGKCGTDLDHGVAAVGYGSSK-GSDYVIVKNSWGPRWGEKGFIRMKRNTGKP 338
Query: 299 EGLCGIGTQAAYP 311
EGLCGI A+YP
Sbjct: 339 EGLCGINKMASYP 351
>sp|P43297|RD21A_ARATH Cysteine proteinase RD21a OS=Arabidopsis thaliana GN=RD21A PE=1
SV=1
Length = 462
Score = 304 bits (779), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 153/322 (47%), Positives = 215/322 (66%), Gaps = 22/322 (6%)
Query: 2 NEAASISIAEKHEKWMAEHGR--SYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINR 59
+EA +SI +E W+ +HG+ S +EKD RF+IFK NL ++D+ N N S
Sbjct: 42 SEAEVMSI---YEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLS------ 92
Query: 60 TYQLGTNQFSDLTNAEFRASYAGNSMAITSQH-SSFKYQNLT--QVPTSMDWREKGAVTS 116
Y+LG +F+DLTN E+R+ Y G M + +S +Y+ ++P S+DWR+KGAV
Sbjct: 93 -YRLGLTRFADLTNDEYRSKYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAE 151
Query: 117 IKNQGGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKY 176
+K+QGGC +CWAFS + AVEGI QI +G+LI LSEQ+L+DC ++ N GC G D AF++
Sbjct: 152 VKDQGGCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEF 211
Query: 177 IIKNQGIATEADYPYHQVQGSCG--REHAAAAKISSYEVLPSGDEQALLKAVSMQPVSIN 234
IIKN GI T+ DYPY V G+C R++A I SYE +P+ E++L KAV+ QP+SI
Sbjct: 212 IIKNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIA 271
Query: 235 IEGTGQDFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMR 294
IE G+ F+ Y GIF+G CGTQLDH V +G+G TE+G YW+++NSWG +WGE+GY+R
Sbjct: 272 IEAGGRAFQLYDSGIFDGSCGTQLDHGVVAVGYG-TENGKDYWIVRNSWGKSWGESGYLR 330
Query: 295 IQRD----EGLCGIGTQAAYPI 312
+ R+ G CGI + +YPI
Sbjct: 331 MARNIASSSGKCGIAIEPSYPI 352
>sp|O65039|CYSEP_RICCO Vignain OS=Ricinus communis GN=CYSEP PE=1 SV=1
Length = 360
Score = 303 bits (776), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 146/313 (46%), Positives = 203/313 (64%), Gaps = 21/313 (6%)
Query: 13 HEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLT 72
+E+W + H S + EK RF +FK N ++ N +++ Y+L N+F+D+T
Sbjct: 38 YERWRSHHTVS-RSLHEKQKRFNVFKHNAMHVHNANK-------MDKPYKLKLNKFADMT 89
Query: 73 NAEFRASYAGNSMAITSQ-------HSSFKYQNLTQVPTSMDWREKGAVTSIKNQGGCAA 125
N EFR +Y+G+ + + +F Y+ + VP S+DWR+KGAVTS+K+QG C +
Sbjct: 90 NHEFRNTYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGS 149
Query: 126 CWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGIAT 185
CWAFS + AVEGI QI + L+ LSEQ+L+DC ++ N GC G D AF++I + GI T
Sbjct: 150 CWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITT 209
Query: 186 EADYPYHQVQGSC--GREHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQDFK 243
EA+YPY G+C +E+A A I +E +P DE ALLKAV+ QPVS+ I+ G DF+
Sbjct: 210 EANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQ 269
Query: 244 NYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQR----DE 299
Y G+F G CGT+LDH V I+G+GTT DGTKYW +KNSWG WGE GY+R++R E
Sbjct: 270 FYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMERGISDKE 329
Query: 300 GLCGIGTQAAYPI 312
GLCGI +A+YPI
Sbjct: 330 GLCGIAMEASYPI 342
>sp|Q9FGR9|CEP1_ARATH KDEL-tailed cysteine endopeptidase CEP1 OS=Arabidopsis thaliana
GN=CEP1 PE=2 SV=1
Length = 361
Score = 295 bits (755), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 143/318 (44%), Positives = 202/318 (63%), Gaps = 21/318 (6%)
Query: 8 SIAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQ 67
S+ E +E+W + H + E EK RF +FK N+++I + N + S Y+L N+
Sbjct: 33 SLWELYERWRSHHTVARSLE-EKAKRFNVFKHNVKHIHETNKKDKS-------YKLKLNK 84
Query: 68 FSDLTNAEFRASYAGNSMAITSQH-------SSFKYQNLTQVPTSMDWREKGAVTSIKNQ 120
F D+T+ EFR +YAG+++ SF Y N+ +PTS+DWR+ GAVT +KNQ
Sbjct: 85 FGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSFMYANVNTLPTSVDWRKNGAVTPVKNQ 144
Query: 121 GGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKN 180
G C +CWAFS V AVEGI QI + L LSEQ+L+DC +N N GC G D+AF++I +
Sbjct: 145 GQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIKEK 204
Query: 181 QGIATEADYPYHQVQGSC--GREHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGT 238
G+ +E YPY +C +E+A I +E +P E L+KAV+ QPVS+ I+
Sbjct: 205 GGLTSELVYPYKASDETCDTNKENAPVVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAG 264
Query: 239 GQDFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQR- 297
G DF+ Y G+F G CGT+L+H V ++G+GTT DGTKYW++KNSWG+ WGE GY+R+QR
Sbjct: 265 GSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTIDGTKYWIVKNSWGEEWGEKGYIRMQRG 324
Query: 298 ---DEGLCGIGTQAAYPI 312
EGLCGI +A+YP+
Sbjct: 325 IRHKEGLCGIAMEASYPL 342
>sp|P43156|CYSP_HEMSP Thiol protease SEN102 OS=Hemerocallis sp. GN=SEN102 PE=2 SV=1
Length = 360
Score = 295 bits (754), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 144/323 (44%), Positives = 210/323 (65%), Gaps = 22/323 (6%)
Query: 4 AASISIAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQL 63
A+ S+ +EKW H + +D EK+ RF +FK+N+++I + N ++ Y+L
Sbjct: 31 ASEDSLWNLYEKWRTHHTVA-RDLDEKNRRFNVFKENVKFIHEFNQKKDA------PYKL 83
Query: 64 GTNQFSDLTNAEFRASYAGNSM-------AITSQHSSFKYQNLTQVPT-SMDWREKGAVT 115
N+F D+TN EFR+ YAG+ + I SF Y+N+ +P S+DWR KGAVT
Sbjct: 84 ALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTGSFMYENVGSLPAASIDWRAKGAVT 143
Query: 116 SIKNQGGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFK 175
+K+QG C +CWAFS +A+VEGI QI +G L+ LSEQ+L+DC ++ N GC G D AF+
Sbjct: 144 GVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSEQELVDCDTSYNEGCNGGLMDYAFE 203
Query: 176 YIIKNQGIATEADYPYHQVQGSCGRE--HAAAAKISSYEVLPSGDEQALLKAVSMQPVSI 233
+I KN GI TE YPY + G+C ++ I ++ +P+ +E AL++AV+ QP+S+
Sbjct: 204 FIQKN-GITTEDSYPYAEQDGTCASNLLNSPVVSIDGHQDVPANNENALMQAVANQPISV 262
Query: 234 NIEGTGQDFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYM 293
+IE +G F+ Y G+F G CGT+LDH V I+G+G T DGTKYW++KNSWG+ WGE+GY+
Sbjct: 263 SIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDGTKYWIVKNSWGEEWGESGYI 322
Query: 294 RIQR----DEGLCGIGTQAAYPI 312
R+QR G CGI +A+YPI
Sbjct: 323 RMQRGISDKRGKCGIAMEASYPI 345
>sp|P12412|CYSEP_VIGMU Vignain OS=Vigna mungo PE=1 SV=1
Length = 362
Score = 293 bits (750), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 204/318 (64%), Gaps = 21/318 (6%)
Query: 8 SIAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQ 67
S+ + +E+W + H S + EK RF +FK N+ ++ N +++ Y+L N+
Sbjct: 35 SLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANVMHVHNTNK-------MDKPYKLKLNK 86
Query: 68 FSDLTNAEFRASYAG-----NSMAITSQHSS--FKYQNLTQVPTSMDWREKGAVTSIKNQ 120
F+D+TN EFR++YAG + M SQH S F Y+ + VP S+DWR+KGAVT +K+Q
Sbjct: 87 FADMTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQ 146
Query: 121 GGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKN 180
G C +CWAFS + AVEGI QI + L+ LSEQ+L+DC N GC G + AF++I +
Sbjct: 147 GQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQK 206
Query: 181 QGIATEADYPYHQVQGSC--GREHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGT 238
GI TE++YPY +G+C + + A I +E +P DE ALLKAV+ QPVS+ I+
Sbjct: 207 GGITTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAG 266
Query: 239 GQDFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD 298
G DF+ Y G+F G C T L+H V I+G+GTT DGT YW+++NSWG WGE GY+R+QR+
Sbjct: 267 GSDFQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN 326
Query: 299 ----EGLCGIGTQAAYPI 312
EGLCGI A+YPI
Sbjct: 327 ISKKEGLCGIAMMASYPI 344
>sp|P25803|CYSEP_PHAVU Vignain OS=Phaseolus vulgaris PE=2 SV=2
Length = 362
Score = 293 bits (750), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 142/322 (44%), Positives = 205/322 (63%), Gaps = 21/322 (6%)
Query: 4 AASISIAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQL 63
A+ S+ + +E+W + H S + EK RF +FK NL ++ N +++ Y+L
Sbjct: 31 ASEESLWDLYERWRSHHTVS-RSLGEKHKRFNVFKANLMHVHNTNK-------MDKPYKL 82
Query: 64 GTNQFSDLTNAEFRASYAGNSM-------AITSQHSSFKYQNLTQVPTSMDWREKGAVTS 116
N+F+D+TN EFR++YAG+ + ++ +F Y+ + VP S+DWR+KGAVT
Sbjct: 83 KLNKFADMTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTD 142
Query: 117 IKNQGGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKY 176
+K+QG C +CWAFS V AVEGI QI + L+ LSEQ+L+DC N GC G + AF++
Sbjct: 143 VKDQGQCGSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEF 202
Query: 177 IIKNQGIATEADYPYHQVQGSC--GREHAAAAKISSYEVLPSGDEQALLKAVSMQPVSIN 234
I + GI TE++YPY +G+C + + A I +E +P+ DE ALLKAV+ QPVS+
Sbjct: 203 IKQKGGITTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVA 262
Query: 235 IEGTGQDFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMR 294
I+ G DF+ Y G+F G C T L+H V I+G+GTT DGT YW+++NSWG WGE GY+R
Sbjct: 263 IDAGGSDFQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIR 322
Query: 295 IQRD----EGLCGIGTQAAYPI 312
+QR+ EGLCGI +YPI
Sbjct: 323 MQRNISKKEGLCGIAMLPSYPI 344
>sp|P25777|ORYB_ORYSJ Oryzain beta chain OS=Oryza sativa subsp. japonica GN=Os04g0670200
PE=1 SV=2
Length = 466
Score = 292 bits (747), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 145/310 (46%), Positives = 208/310 (67%), Gaps = 16/310 (5%)
Query: 13 HEKWMAEHGRSYKDEL--EKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSD 70
++ W+AE+G + L E + RF +F NL+++D N + G ++LG N+F+D
Sbjct: 52 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGG----FRLGMNRFAD 107
Query: 71 LTNAEFRASYAGNSMAITSQHSSFKYQN--LTQVPTSMDWREKGAVTSIKNQGGCAACWA 128
LTN EFRA++ G +A S+ + +Y++ + ++P S+DWREKGAV +KNQG C +CWA
Sbjct: 108 LTNEEFRATFLGAKVAERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167
Query: 129 FSAVAAVEGITQISSGNLIRLSEQQLLDCSSNG-NSGCVAGKSDIAFKYIIKNQGIATEA 187
FSAV+ VE I Q+ +G +I LSEQ+L++CS+NG NSGC G D AF +IIKN GI TE
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227
Query: 188 DYPYHQVQGSC--GREHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQDFKNY 245
DYPY V G C RE+A I +E +P DE++L KAV+ QPVS+ IE G++F+ Y
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287
Query: 246 KGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD----EGL 301
G+F+G CGT LDH V +G+G T++G YW+++NSWG WGE+GY+R++R+ G
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYG-TDNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGK 346
Query: 302 CGIGTQAAYP 311
CGI A+YP
Sbjct: 347 CGIAMMASYP 356
>sp|Q9LM66|XCP2_ARATH Xylem cysteine proteinase 2 OS=Arabidopsis thaliana GN=XCP2 PE=1
SV=2
Length = 356
Score = 291 bits (745), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 210/314 (66%), Gaps = 19/314 (6%)
Query: 9 IAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQF 68
+ E E W++ ++Y+ EK +RF++FK NL++ID+ N ++Y LG N+F
Sbjct: 47 LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-------KSYWLGLNEF 99
Query: 69 SDLTNAEFRASYAGNSMAITSQ-----HSSFKYQNLTQVPTSMDWREKGAVTSIKNQGGC 123
+DL++ EF+ Y G I + ++ F Y+++ VP S+DWR+KGAV +KNQG C
Sbjct: 100 ADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSC 159
Query: 124 AACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGI 183
+CWAFS VAAVEGI +I +GNL LSEQ+L+DC + N+GC G D AF+YI+KN G+
Sbjct: 160 GSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGL 219
Query: 184 ATEADYPYHQVQGSC--GREHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQD 241
E DYPY +G+C ++ + I+ ++ +P+ DE++LLKA++ QP+S+ I+ +G++
Sbjct: 220 RKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGRE 279
Query: 242 FKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD--- 298
F+ Y GG+F+G CG LDH V +G+G+++ G+ Y ++KNSWG WGE GY+R++R+
Sbjct: 280 FQFYSGGVFDGRCGVDLDHGVAAVGYGSSK-GSDYIIVKNSWGPKWGEKGYIRLKRNTGK 338
Query: 299 -EGLCGIGTQAAYP 311
EGLCGI A++P
Sbjct: 339 PEGLCGINKMASFP 352
>sp|P25251|CYSP4_BRANA Cysteine proteinase COT44 (Fragment) OS=Brassica napus PE=2 SV=1
Length = 328
Score = 289 bits (739), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 203/316 (64%), Gaps = 24/316 (7%)
Query: 15 KWMAEHGRSYKDEL----EKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSD 70
+W EHG+S + ++D RF IFK NL +ID N NN N TY+LG F++
Sbjct: 6 RWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNK-----NATYKLGLTIFAN 60
Query: 71 LTNAEFRASYAGNSMA-----ITSQHSSFKYQ---NLTQVPTSMDWREKGAVTSIKNQGG 122
LTN E+R+ Y G +++ + KY N+ +VP ++DWR+KGAV +IK+QG
Sbjct: 61 LTNDEYRSLYLGARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGT 120
Query: 123 CAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQG 182
C +CWAFS AAVEGI +I +G L+ LSEQ+L+DC + N GC G D AF++I+KN G
Sbjct: 121 CGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 180
Query: 183 IATEADYPYHQVQGSCGR--EHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQ 240
+ TE DYPYH G C +++ I YE +PS DE AL +AVS QPVS+ I+ G+
Sbjct: 181 LNTEKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGR 240
Query: 241 DFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD-- 298
F++Y+ GIF G CGT +DHAV +G+G +E+G YW+++NSWG WGE GY+R++R+
Sbjct: 241 AFQHYQSGIFTGKCGTNMDHAVVAVGYG-SENGVDYWIVRNSWGTRWGEDGYIRMERNVA 299
Query: 299 --EGLCGIGTQAAYPI 312
G CGI +A+YP+
Sbjct: 300 SKSGKCGIAIEASYPV 315
>sp|Q94B08|GCP1_ARATH Germination-specific cysteine protease 1 OS=Arabidopsis thaliana
GN=GCP1 PE=2 SV=2
Length = 376
Score = 285 bits (730), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 145/317 (45%), Positives = 201/317 (63%), Gaps = 25/317 (7%)
Query: 15 KWMAEHGRSYKDEL----EKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSD 70
+W AEHG++ + ++D RF IFK NL +ID N +N N TY+LG +F+D
Sbjct: 51 QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNK-----NATYKLGLTKFTD 105
Query: 71 LTNAEFRASYAGNSMAIT-----SQHSSFKYQ---NLTQVPTSMDWREKGAVTSIKNQGG 122
LTN E+R Y G +++ + KY N +VP ++DWR+KGAV IK+QG
Sbjct: 106 LTNDEYRKLYLGARTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGT 165
Query: 123 CAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQG 182
C +CWAFS AAVEGI +I +G LI LSEQ+L+DC + N GC G D AF++I+KN G
Sbjct: 166 CGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 225
Query: 183 IATEADYPYHQVQGSCGR--EHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQ 240
+ TE DYPY G C +++ I YE +P+ DE AL KA+S QPVS+ IE G+
Sbjct: 226 LNTEKDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGR 285
Query: 241 DFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD-- 298
F++Y+ GIF G CGT LDHAV +G+G +E+G YW+++NSWG WGE GY+R++R+
Sbjct: 286 IFQHYQSGIFTGSCGTNLDHAVVAVGYG-SENGVDYWIVRNSWGPRWGEEGYIRMERNLA 344
Query: 299 ---EGLCGIGTQAAYPI 312
G CGI +A+YP+
Sbjct: 345 ASKSGKCGIAVEASYPV 361
>sp|P80884|ANAN_ANACO Ananain OS=Ananas comosus GN=AN1 PE=1 SV=2
Length = 345
Score = 285 bits (729), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 139/317 (43%), Positives = 206/317 (64%), Gaps = 17/317 (5%)
Query: 3 EAASISIAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQ 62
+ S + ++ E+WMAE+GR YKD EK +RF+IFK N+ +I+ NN N + +Y
Sbjct: 27 DEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNNRNGN------SYT 80
Query: 63 LGTNQFSDLTNAEFRASYAGNSMAITSQHS---SFKYQNLTQVPTSMDWREKGAVTSIKN 119
LG NQF+D+TN EF A Y G S+ + + SF +++ VP S+DWR+ GAVTS+KN
Sbjct: 81 LGINQFTDMTNNEFVAQYTGLSLPLNIKREPVVSFDDVDISSVPQSIDWRDSGAVTSVKN 140
Query: 120 QGGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIK 179
QG C +CWAF+++A VE I +I GNL+ LSEQQ+LDC+ + GC G + A+ +II
Sbjct: 141 QGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAV--SYGCKGGWINKAYSFIIS 198
Query: 180 NQGIATEADYPYHQVQGSCGREHAA-AAKISSYEVLPSGDEQALLKAVSMQPVSINIEGT 238
N+G+A+ A YPY +G+C +A I+ Y + +E+ ++ AVS QP++ ++ +
Sbjct: 199 NKGVASAAIYPYKAAKGTCKTNGVPNSAYITRYTYVQRNNERNMMYAVSNQPIAAALDAS 258
Query: 239 GQDFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD 298
G +F++YK G+F G CGT+L+HA+ IIG+G G K+W+++NSWG WGE GY+R+ RD
Sbjct: 259 G-NFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSSGKKFWIVRNSWGAGWGEGGYIRLARD 317
Query: 299 E----GLCGIGTQAAYP 311
GLCGI YP
Sbjct: 318 VSSSFGLCGIAMDPLYP 334
>sp|P25776|ORYA_ORYSJ Oryzain alpha chain OS=Oryza sativa subsp. japonica GN=Os04g0650000
PE=1 SV=2
Length = 458
Score = 283 bits (725), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 144/309 (46%), Positives = 203/309 (65%), Gaps = 13/309 (4%)
Query: 13 HEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLT 72
+ +W AEHG+SY E++ R+ F+ NL YID+ +N ++ G++ +++LG N+F+DLT
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDE--HNAAADAGVH-SFRLGLNRFADLT 96
Query: 73 NAEFRASYAG-NSMAITSQHSSFKY--QNLTQVPTSMDWREKGAVTSIKNQGGCAACWAF 129
N E+R +Y G + + S +Y + +P S+DWR KGAV IK+QGGC +CWAF
Sbjct: 97 NEEYRDTYLGLRNKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWAF 156
Query: 130 SAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGIATEADY 189
SA+AAVEGI QI +G+LI LSEQ+L+DC ++ N GC G D AF +II N GI TE DY
Sbjct: 157 SAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDTEDDY 216
Query: 190 PYHQVQGSC--GREHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQDFKNYKG 247
PY C R++A I SYE + E +L KAV+ QPVS+ IE G+ F+ Y
Sbjct: 217 PYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQLYSS 276
Query: 248 GIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD----EGLCG 303
GIF G CGT LDH V +G+G TE+G YW+++NSWG +WGE+GY+R++R+ G CG
Sbjct: 277 GIFTGKCGTALDHGVAAVGYG-TENGKDYWIVRNSWGKSWGESGYVRMERNIKASSGKCG 335
Query: 304 IGTQAAYPI 312
I + +YP+
Sbjct: 336 IAVEPSYPL 344
>sp|Q9STL5|CEP3_ARATH KDEL-tailed cysteine endopeptidase CEP3 OS=Arabidopsis thaliana
GN=CEP3 PE=2 SV=1
Length = 364
Score = 283 bits (725), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 140/313 (44%), Positives = 198/313 (63%), Gaps = 22/313 (7%)
Query: 13 HEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLT 72
+E+W H S + E RF +F+ N+ ++ + N N + Y+L N+F+D+T
Sbjct: 38 YERWRGHHSVS-RASHEAIKRFNVFRHNVLHVHRTNKKN-------KPYKLKINRFADIT 89
Query: 73 NAEFRASYAG-----NSMAITSQHSS--FKYQNLTQVPTSMDWREKGAVTSIKNQGGCAA 125
+ EFR+SYAG + M + S F Y+N+T+VP+S+DWREKGAVT +KNQ C +
Sbjct: 90 HHEFRSSYAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGS 149
Query: 126 CWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGIAT 185
CWAFS VAAVEGI +I + L+ LSEQ+L+DC + N GC G + AF++I N GI T
Sbjct: 150 CWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNNGGIKT 209
Query: 186 EADYPYHQVQGSCGREHAAAAK---ISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQDF 242
E YPY R ++ + I +E +P DE+ LLKAV+ QPVS+ I+ DF
Sbjct: 210 EETYPYDSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDF 269
Query: 243 KNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQR----D 298
+ Y G+F G CGTQL+H V I+G+G T++GTKYW+++NSWG WGE GY+RI+R +
Sbjct: 270 QLYSEGVFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIERGISEN 329
Query: 299 EGLCGIGTQAAYP 311
EG CGI +A+YP
Sbjct: 330 EGRCGIAMEASYP 342
>sp|O23791|BROM1_ANACO Fruit bromelain OS=Ananas comosus PE=1 SV=1
Length = 351
Score = 283 bits (725), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 135/311 (43%), Positives = 204/311 (65%), Gaps = 17/311 (5%)
Query: 9 IAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQF 68
+ ++ E+WMAE+GR YKD+ EK RF+IFK N+++I+ N+ N + +Y LG NQF
Sbjct: 33 MMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNEN------SYTLGINQF 86
Query: 69 SDLTNAEFRASYAGNSMAITSQHS---SFKYQNLTQVPTSMDWREKGAVTSIKNQGGCAA 125
+D+T +EF A Y G S+ + + SF N++ VP S+DWR+ GAV +KNQ C +
Sbjct: 87 TDMTKSEFVAQYTGVSLPLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGS 146
Query: 126 CWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGIAT 185
CW+F+A+A VEGI +I +G L+ LSEQ++LDC+ + GC G + A+ +II N G+ T
Sbjct: 147 CWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVS--YGCKGGWVNKAYDFIISNNGVTT 204
Query: 186 EADYPYHQVQGSC-GREHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQDFKN 244
E +YPY QG+C +A I+ Y + DE++++ AVS QP++ I+ + ++F+
Sbjct: 205 EENYPYLAYQGTCNANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDAS-ENFQY 263
Query: 245 YKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQR----DEG 300
Y GG+F+G CGT L+HA+TIIG+G GTKYW+++NSWG +WGE GY+R+ R G
Sbjct: 264 YNGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVSSSSG 323
Query: 301 LCGIGTQAAYP 311
+CGI +P
Sbjct: 324 VCGIAMAPLFP 334
>sp|Q9STL4|CEP2_ARATH KDEL-tailed cysteine endopeptidase CEP2 OS=Arabidopsis thaliana
GN=CEP2 PE=2 SV=1
Length = 361
Score = 281 bits (720), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 202/317 (63%), Gaps = 28/317 (8%)
Query: 13 HEKWMAEHG--RSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSD 70
+++W + H RS E++ RF +F+ N+ ++ N N R+Y+L N+F+D
Sbjct: 38 YDRWRSHHSVPRSLN---EREKRFNVFRHNVMHVHNTNKKN-------RSYKLKLNKFAD 87
Query: 71 LTNAEFRASYAGNSMAIT---------SQHSSFKYQNLTQVPTSMDWREKGAVTSIKNQG 121
LT EF+ +Y G+++ S+ + ++NL+++P+S+DWR+KGAVT IKNQG
Sbjct: 88 LTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDHENLSKLPSSVDWRKKGAVTEIKNQG 147
Query: 122 GCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQ 181
C +CWAFS VAAVEGI +I + L+ LSEQ+L+DC + N GC G +IAF++I KN
Sbjct: 148 KCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVDCDTKQNEGCNGGLMEIAFEFIKKNG 207
Query: 182 GIATEADYPYHQVQGSC--GREHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTG 239
GI TE YPY + G C +++ I +E +P DE ALLKAV+ QPVS+ I+
Sbjct: 208 GITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGS 267
Query: 240 QDFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD- 298
DF+ Y G+F G CGT+L+H V +G+G +E G KYW+++NSWG WGE GY++I+R+
Sbjct: 268 SDFQFYSEGVFTGSCGTELNHGVAAVGYG-SERGKKYWIVRNSWGAEWGEGGYIKIEREI 326
Query: 299 ---EGLCGIGTQAAYPI 312
EG CGI +A+YPI
Sbjct: 327 DEPEGRCGIAMEASYPI 343
>sp|P25250|CYSP2_HORVU Cysteine proteinase EP-B 2 OS=Hordeum vulgare GN=EPB2 PE=1 SV=1
Length = 373
Score = 277 bits (709), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 193/315 (61%), Gaps = 22/315 (6%)
Query: 13 HEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLT 72
+E+W + H R + EK RF FK N +I ++ N+ + Y+L N+F D+
Sbjct: 46 YERWQSAH-RVRRHHAEKHRRFGTFKSNAHFI------HSHNKRGDHPYRLHLNRFGDMD 98
Query: 73 NAEFRASYAGNSMAITSQHSS----FKYQ--NLTQVPTSMDWREKGAVTSIKNQGGCAAC 126
AEFRA++ G+ T F Y N++ +P S+DWR+KGAVT +K+QG C +C
Sbjct: 99 QAEFRATFVGDLRRDTPSKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGSC 158
Query: 127 WAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGIATE 186
WAFS V +VEGI I +G+L+ LSEQ+L+DC + N GC G D AF+YI N G+ TE
Sbjct: 159 WAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLITE 218
Query: 187 ADYPYHQVQGSCGREHAA-----AAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQD 241
A YPY +G+C AA I ++ +P+ E+ L +AV+ QPVS+ +E +G+
Sbjct: 219 AAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKA 278
Query: 242 FKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRDE-- 299
F Y G+F G CGT+LDH V ++G+G EDG YW +KNSWG +WGE GY+R+++D
Sbjct: 279 FMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGA 338
Query: 300 --GLCGIGTQAAYPI 312
GLCGI +A+YP+
Sbjct: 339 SGGLCGIAMEASYPV 353
>sp|A5HII1|ACTN_ACTDE Actinidain OS=Actinidia deliciosa PE=1 SV=1
Length = 380
Score = 277 bits (709), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 143/312 (45%), Positives = 195/312 (62%), Gaps = 15/312 (4%)
Query: 9 IAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQF 68
+ +E W+ ++G+SY E + RF+IFK+ L +ID+ N NR+Y++G NQF
Sbjct: 38 VKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDE------HNADTNRSYKVGLNQF 91
Query: 69 SDLTNAEFRASYAGNSMAITSQHSSFKYQ-NLTQV-PTSMDWREKGAVTSIKNQGGCAAC 126
+DLT+ EFR++Y G + S +Y+ + QV P+ +DWR GAV IK+QG C C
Sbjct: 92 ADLTDEEFRSTYLGFTSGSNKTKVSNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGC 151
Query: 127 WAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNS-GCVAGKSDIAFKYIIKNQGIAT 185
WAFSA+A VEGI +I +G LI LSEQ+L+DC N+ GC G F++II N GI T
Sbjct: 152 WAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINT 211
Query: 186 EADYPYHQVQGSCGRE--HAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQDFK 243
E +YPY G C + + I +YE +P +E AL AV+ QPVS+ ++ G FK
Sbjct: 212 EENYPYTAQDGECNLDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFK 271
Query: 244 NYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD---EG 300
+Y GIF G CGT +DHAVTI+G+G TE G YW++KNSW TWGE GYMRI R+ G
Sbjct: 272 HYSSGIFTGPCGTAIDHAVTIVGYG-TEGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAG 330
Query: 301 LCGIGTQAAYPI 312
CGI T +YP+
Sbjct: 331 TCGIATMPSYPV 342
>sp|P25249|CYSP1_HORVU Cysteine proteinase EP-B 1 OS=Hordeum vulgare GN=EPB1 PE=2 SV=1
Length = 371
Score = 277 bits (708), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 194/315 (61%), Gaps = 22/315 (6%)
Query: 13 HEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLT 72
+E+W + H R + EK RF FK N +I ++ N+ + Y+L N+F D+
Sbjct: 46 YERWQSAH-RVRRHHAEKHRRFGTFKSNAHFI------HSHNKRGDHPYRLHLNRFGDMD 98
Query: 73 NAEFRASYAGN----SMAITSQHSSFKYQ--NLTQVPTSMDWREKGAVTSIKNQGGCAAC 126
AEFRA++ G+ + A F Y N++ +P S+DWR+KGAVT +K+QG C +C
Sbjct: 99 QAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGSC 158
Query: 127 WAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGIATE 186
WAFS V +VEGI I +G+L+ LSEQ+L+DC + N GC G D AF+YI N G+ TE
Sbjct: 159 WAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLITE 218
Query: 187 ADYPYHQVQGSCGREHAA-----AAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQD 241
A YPY +G+C AA I ++ +P+ E+ L +AV+ QPVS+ +E +G+
Sbjct: 219 AAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKA 278
Query: 242 FKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRDE-- 299
F Y G+F G CGT+LDH V ++G+G EDG YW +KNSWG +WGE GY+R+++D
Sbjct: 279 FMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGA 338
Query: 300 --GLCGIGTQAAYPI 312
GLCGI +A+YP+
Sbjct: 339 SGGLCGIAMEASYPV 353
>sp|P00785|ACTN_ACTCH Actinidain OS=Actinidia chinensis PE=1 SV=4
Length = 380
Score = 273 bits (699), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 193/312 (61%), Gaps = 15/312 (4%)
Query: 9 IAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQF 68
+ +E W+ ++G+SY E + RF+IFK+ L +ID+ N NR+Y++G NQF
Sbjct: 38 VKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDE------HNADTNRSYKVGLNQF 91
Query: 69 SDLTNAEFRASYAGNSMAITSQHSSFKYQ-NLTQV-PTSMDWREKGAVTSIKNQGGCAAC 126
+DLT+ EFR++Y + S +Y+ + QV P+ +DWR GAV IK+QG C C
Sbjct: 92 ADLTDEEFRSTYLRFTSGSNKTKVSNRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGC 151
Query: 127 WAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNS-GCVAGKSDIAFKYIIKNQGIAT 185
WAFSA+A VEGI +I +G LI LSEQ+L+DC N+ GC G F++II N GI T
Sbjct: 152 WAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFIINNGGINT 211
Query: 186 EADYPYHQVQGSCG--REHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQDFK 243
E +YPY G C ++ I +YE +P +E AL AV+ QPVS+ ++ G FK
Sbjct: 212 EENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFK 271
Query: 244 NYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD---EG 300
Y GIF G CGT +DHAVTI+G+G TE G YW++KNSW TWGE GYMRI R+ G
Sbjct: 272 QYSSGIFTGPCGTAVDHAVTIVGYG-TEGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAG 330
Query: 301 LCGIGTQAAYPI 312
CGI T +YP+
Sbjct: 331 TCGIATMPSYPV 342
>sp|Q9LT77|CPR1_ARATH Probable cysteine proteinase At3g19400 OS=Arabidopsis thaliana
GN=At3g19400 PE=2 SV=1
Length = 362
Score = 273 bits (697), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 137/311 (44%), Positives = 196/311 (63%), Gaps = 19/311 (6%)
Query: 13 HEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLT 72
+E+W+ E+ ++Y EK+ RFKIFK NL+++D+ N +RT+++G +F+DLT
Sbjct: 44 YEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDE------HNSVPDRTFEVGLTRFADLT 97
Query: 73 NAEFRASYAGNSMAITS---QHSSFKYQNLTQVPTSMDWREKGAVTSIKNQGGCAACWAF 129
N EFRA Y M T + + Y+ +P +DWR GAV S+K+QG C +CWAF
Sbjct: 98 NEEFRAIYLRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAF 157
Query: 130 SAVAAVEGITQISSGNLIRLSEQQLLDCSSNG-NSGCVAGKSDIAFKYIIKNQGIATEAD 188
SAV AVEGI QI++G LI LSEQ+L+DC N+GC G + AF++I+KN GI T+ D
Sbjct: 158 SAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQD 217
Query: 189 YPYHQVQ-GSCGRE---HAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQDFKN 244
YPY+ G C + + I YE +P DE++L KAV+ QPVS+ IE + Q F+
Sbjct: 218 YPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQL 277
Query: 245 YKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD----EG 300
YK G+ G CG LDH V ++G+G+T G YW+I+NSWG WG++GY+++QR+ G
Sbjct: 278 YKSGVMTGTCGISLDHGVVVVGYGSTS-GEDYWIIRNSWGLNWGDSGYVKLQRNIDDPFG 336
Query: 301 LCGIGTQAAYP 311
CGI +YP
Sbjct: 337 KCGIAMMPSYP 347
>sp|Q9SUT0|CPR3_ARATH Probable cysteine proteinase At4g11310 OS=Arabidopsis thaliana
GN=At4g11310 PE=2 SV=1
Length = 364
Score = 269 bits (688), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 137/312 (43%), Positives = 198/312 (63%), Gaps = 22/312 (7%)
Query: 14 EKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTN 73
E WM +HG+ Y EK+ R IF+ NL +I NN N+ N +Y+LG F+DL+
Sbjct: 50 ESWMVKHGKVYGSVAEKERRLTIFEDNLRFI----NNRNAE---NLSYRLGLTGFADLSL 102
Query: 74 AEFRASYAGNSMAITSQH----SSFKYQNLTQ--VPTSMDWREKGAVTSIKNQGGCAACW 127
E++ G H SS +Y+ +P S+DWR +GAVT +K+QG C +CW
Sbjct: 103 HEYKEVCHGADPRPPRNHVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCW 162
Query: 128 AFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGIATEA 187
AFS V AVEG+ +I +G L+ LSEQ L++C+ N+GC GK + A+++I+KN G+ T+
Sbjct: 163 AFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKLETAYEFIMKNGGLGTDN 221
Query: 188 DYPYHQVQGSCG---REHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQDFKN 244
DYPY V G C +E+ I YE LP+ DE AL+KAV+ QPV+ I+ + ++F+
Sbjct: 222 DYPYKAVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQL 281
Query: 245 YKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD----EG 300
Y+ G+F+G CGT L+H V ++G+G TE+G YWL+KNS G TWGEAGYM++ R+ G
Sbjct: 282 YESGVFDGSCGTNLNHGVVVVGYG-TENGRDYWLVKNSRGITWGEAGYMKMARNIANPRG 340
Query: 301 LCGIGTQAAYPI 312
LCGI +A+YP+
Sbjct: 341 LCGIAMRASYPL 352
>sp|Q7XR52|CYSP1_ORYSJ Cysteine protease 1 OS=Oryza sativa subsp. japonica GN=CP1 PE=2
SV=2
Length = 490
Score = 269 bits (687), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 133/295 (45%), Positives = 189/295 (64%), Gaps = 15/295 (5%)
Query: 29 EKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTNAEFRASYAGNSMAIT 88
E + RF++F NL+++D N + G ++LG N+F+DLTN EFRA+Y G + A
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGG----FRLGMNRFADLTNGEFRATYLGTTPAGR 139
Query: 89 SQH--SSFKYQNLTQVPTSMDWREKGAVTS-IKNQGGCAACWAFSAVAAVEGITQISSGN 145
+ ++++ + +P S+DWR+KGAV + +KNQG C +CWAFSAVAAVEGI +I +G
Sbjct: 140 GRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 146 LIRLSEQQLLDCSSNG-NSGCVAGKSDIAFKYIIKNQGIATEADYPYHQVQGSC--GREH 202
L+ LSEQ+L++C+ NG NSGC G D AF +I +N G+ TE DYPY + G C +
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 203 AAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQDFKNYKGGIFNGVCGTQLDHAV 262
I +E +P DE +L KAV+ QPVS+ I+ G++F+ Y G+F G CGT LDH V
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 263 TIIGFGT-TEDGTKYWLIKNSWGDTWGEAGYMRIQRD----EGLCGIGTQAAYPI 312
+G+GT G YW ++NSWG WGE GY+R++R+ G CGI A+YPI
Sbjct: 320 VAVGYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>sp|Q9SUS9|CPR4_ARATH Probable cysteine proteinase At4g11320 OS=Arabidopsis thaliana
GN=At4g11320 PE=2 SV=1
Length = 371
Score = 267 bits (683), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 136/313 (43%), Positives = 199/313 (63%), Gaps = 24/313 (7%)
Query: 14 EKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTN 73
E WM +HG+ Y EK+ R IF+ NL +I N N S Y+LG N+F+DL+
Sbjct: 57 ESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLS-------YRLGLNRFADLSL 109
Query: 74 AEFRASYAG-------NSMAITSQHSSFKYQNLTQVPTSMDWREKGAVTSIKNQGGCAAC 126
E+ G N + +TS + +K + +P S+DWR +GAVT +K+QG C +C
Sbjct: 110 HEYGEICHGADPRPPRNHVFMTSSNR-YKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSC 168
Query: 127 WAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGIATE 186
WAFS V AVEG+ +I +G L+ LSEQ L++C+ N+GC GK + A+++I+ N G+ T+
Sbjct: 169 WAFSTVGAVEGLNKIVTGELVTLSEQDLINCNKE-NNGCGGGKVETAYEFIMNNGGLGTD 227
Query: 187 ADYPYHQVQGSC-GR--EHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQDFK 243
DYPY + G C GR E I YE LP+ DE AL+KAV+ QPV+ ++ + ++F+
Sbjct: 228 NDYPYKALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQ 287
Query: 244 NYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD----E 299
Y+ G+F+G CGT L+H V ++G+G TE+G YW++KNS GDTWGEAGYM++ R+
Sbjct: 288 LYESGVFDGTCGTNLNHGVVVVGYG-TENGRDYWIVKNSRGDTWGEAGYMKMARNIANPR 346
Query: 300 GLCGIGTQAAYPI 312
GLCGI +A+YP+
Sbjct: 347 GLCGIAMRASYPL 359
>sp|P14080|PAPA2_CARPA Chymopapain OS=Carica papaya PE=1 SV=2
Length = 352
Score = 265 bits (676), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 135/315 (42%), Positives = 191/315 (60%), Gaps = 21/315 (6%)
Query: 9 IAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQF 68
+ + + WM +H + Y+ EK RF+IF+ NL YID+ N NNS Y LG N F
Sbjct: 44 LIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNS-------YWLGLNGF 96
Query: 69 SDLTNAEFRASYAGNSMAITS-----QHSSFKYQNLTQVPTSMDWREKGAVTSIKNQGGC 123
+DL+N EF+ Y G + + F Y+++T P S+DWR KGAVT +KNQG C
Sbjct: 97 ADLSNDEFKKKYVGFVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGAC 156
Query: 124 AACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGI 183
+CWAFS +A VEGI +I +GNL+ LSEQ+L+DC + + GC G + +Y + N G+
Sbjct: 157 GSCWAFSTIATVEGINKIVTGNLLELSEQELVDCDKH-SYGCKGGYQTTSLQY-VANNGV 214
Query: 184 ATEADYPYHQVQGSC--GREHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQD 241
T YPY Q C + KI+ Y+ +PS E + L A++ QP+S+ +E G+
Sbjct: 215 HTSKVYPYQAKQYKCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKP 274
Query: 242 FKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQR---- 297
F+ YK G+F+G CGT+LDHAVT +G+GT+ DG Y +IKNSWG WGE GYMR++R
Sbjct: 275 FQLYKSGVFDGPCGTKLDHAVTAVGYGTS-DGKNYIIIKNSWGPNWGEKGYMRLKRQSGN 333
Query: 298 DEGLCGIGTQAAYPI 312
+G CG+ + YP
Sbjct: 334 SQGTCGVYKSSYYPF 348
>sp|P05994|PAPA4_CARPA Papaya proteinase 4 OS=Carica papaya PE=1 SV=3
Length = 348
Score = 249 bits (637), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 138/306 (45%), Positives = 190/306 (62%), Gaps = 19/306 (6%)
Query: 16 WMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTNAE 75
WM +H ++YK+ EK RF+IFK NL+YID+ N N Y LG N+FSDL+N E
Sbjct: 51 WMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMING-------YWLGLNEFSDLSNDE 103
Query: 76 FRASYAGN-SMAITSQ--HSSFKYQNLTQVPTSMDWREKGAVTSIKNQGGCAACWAFSAV 132
F+ Y G+ T+Q F +++ +P S+DWR KGAVT +K+QG C +CWAFS V
Sbjct: 104 FKEKYVGSLPEDYTNQPYDEEFVNEDIVDLPESVDWRAKGAVTPVKHQGYCESCWAFSTV 163
Query: 133 AAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGIATEADYPYH 192
A VEGI +I +GNL+ LSEQ+L+DC + GC G + +Y+ +N GI A YPY
Sbjct: 164 ATVEGINKIKTGNLVELSEQELVDCDKQ-SYGCNRGYQSTSLQYVAQN-GIHLRAKYPYI 221
Query: 193 QVQGSCGREHAAAAKISSYEV--LPSGDEQALLKAVSMQPVSINIEGTGQDFKNYKGGIF 250
Q +C K+ + V + S +E +LL A++ QPVS+ +E G+DF+NYKGGIF
Sbjct: 222 AKQQTCRANQVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRDFQNYKGGIF 281
Query: 251 NGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQR----DEGLCGIGT 306
G CGT++DHAVT +G+G + LIKNSWG WGE GY+RI+R G+CG+
Sbjct: 282 EGSCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGPGWGENGYIRIRRASGNSPGVCGVYR 340
Query: 307 QAAYPI 312
+ YPI
Sbjct: 341 SSYYPI 346
>sp|P10056|PAPA3_CARPA Caricain OS=Carica papaya PE=1 SV=2
Length = 348
Score = 249 bits (635), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 133/305 (43%), Positives = 185/305 (60%), Gaps = 19/305 (6%)
Query: 16 WMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTNAE 75
WM H + Y++ EK RF+IFK NL YID+ N NNS Y LG N+F+DL+N E
Sbjct: 51 WMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNS-------YWLGLNEFADLSNDE 103
Query: 76 FRASYAGNSMAITSQHS---SFKYQNLTQVPTSMDWREKGAVTSIKNQGGCAACWAFSAV 132
F Y G+ + T + S F ++ +P ++DWR+KGAVT +++QG C +CWAFSAV
Sbjct: 104 FNEKYVGSLIDATIEQSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAV 163
Query: 133 AAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGIATEADYPYH 192
A VEGI +I +G L+ LSEQ+L+DC + GC G A +Y+ KN GI + YPY
Sbjct: 164 ATVEGINKIRTGKLVELSEQELVDCERRSH-GCKGGYPPYALEYVAKN-GIHLRSKYPYK 221
Query: 193 QVQGSCGREHAAAAKISSYEV--LPSGDEQALLKAVSMQPVSINIEGTGQDFKNYKGGIF 250
QG+C + + + V + +E LL A++ QPVS+ +E G+ F+ YKGGIF
Sbjct: 222 AKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIF 281
Query: 251 NGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQR----DEGLCGIGT 306
G CGT++DHAVT +G+G + LIKNSWG WGE GY+RI+R G+CG+
Sbjct: 282 EGPCGTKVDHAVTAVGYGKSGGKGYI-LIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYK 340
Query: 307 QAAYP 311
+ YP
Sbjct: 341 SSYYP 345
>sp|Q9LXW3|CPR2_ARATH Probable cysteine proteinase At3g43960 OS=Arabidopsis thaliana
GN=At3g43960 PE=2 SV=1
Length = 376
Score = 248 bits (632), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 193/320 (60%), Gaps = 34/320 (10%)
Query: 13 HEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLT 72
+E+W+ E+G++Y EK+ RFKIFK NL+ I++ N++ N R+Y+ G N+FSDLT
Sbjct: 41 YEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPN------RSYERGLNKFSDLT 94
Query: 73 NAEFRASYAGNSM---AITSQHSSFKYQNLTQVPTSMDWREKGAVTS-IKNQGGCAACWA 128
EF+ASY G M +++ ++Y+ +P +DWRE+GAV +K QG C +CWA
Sbjct: 95 ADEFQASYLGGKMEKKSLSDVAERYQYKEGDVLPDEVDWRERGAVVPRVKRQGECGSCWA 154
Query: 129 FSAVAAVEGITQISSGNLIRLSEQQLLDCSS-NGNSGCVAGKSDIAFKYIIKNQGIATEA 187
F+A AVEGI QI++G L+ LSEQ+L+DC N N GC G + AF++I +N GI ++
Sbjct: 155 FAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGGIVSD- 213
Query: 188 DYPYHQVQGSCGREHAA----------AAKISSYEVLPSGDEQALLKAVSMQPVSINIEG 237
+V G G + AA I+ +EV+P DE +L KAV+ QP+S+ I
Sbjct: 214 -----EVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMI-- 266
Query: 238 TGQDFKNYKGGIFNGVCGTQL-DHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQ 296
+ + +YK G++ G C DH V I+G+GT+ D YWLI+NSWG WGE GY+R+Q
Sbjct: 267 SAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSDEGDYWLIRNSWGPEWGEGGYLRLQ 326
Query: 297 RD----EGLCGIGTQAAYPI 312
R+ G C + YPI
Sbjct: 327 RNFHEPTGKCAVAVAPVYPI 346
>sp|P00784|PAPA1_CARPA Papain OS=Carica papaya PE=1 SV=1
Length = 345
Score = 244 bits (624), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 136/315 (43%), Positives = 185/315 (58%), Gaps = 26/315 (8%)
Query: 9 IAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQF 68
+ + E WM +H + YK+ EK RF+IFK NL+YID+ N NNS Y LG N F
Sbjct: 44 LIQLFESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKNNS-------YWLGLNVF 96
Query: 69 SDLTNAEFRASYAGNSMAITSQHSSFKYQNL-----TQVPTSMDWREKGAVTSIKNQGGC 123
+D++N EF+ Y G S+A + Y+ + +P +DWR+KGAVT +KNQG C
Sbjct: 97 ADMSNDEFKEKYTG-SIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSC 155
Query: 124 AACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGI 183
+CWAFSAV +EGI +I +GNL SEQ+LLDC + GC G A + ++ GI
Sbjct: 156 GSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRR-SYGCNGGYPWSALQ-LVAQYGI 213
Query: 184 ATEADYPYHQVQGSC-GREHAA-AAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQD 241
YPY VQ C RE AAK + +E ALL +++ QPVS+ +E G+D
Sbjct: 214 HYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKD 273
Query: 242 FKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQR---- 297
F+ Y+GGIF G CG ++DHAV +G+G Y LIKNSWG WGE GY+RI+R
Sbjct: 274 FQLYRGGIFVGPCGNKVDHAVAAVGYGPN-----YILIKNSWGTGWGENGYIRIKRGTGN 328
Query: 298 DEGLCGIGTQAAYPI 312
G+CG+ T + YP+
Sbjct: 329 SYGVCGLYTSSFYPV 343
>sp|P20721|CYSPL_SOLLC Low-temperature-induced cysteine proteinase (Fragment) OS=Solanum
lycopersicum PE=2 SV=1
Length = 346
Score = 240 bits (612), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 114/217 (52%), Positives = 152/217 (70%), Gaps = 7/217 (3%)
Query: 102 VPTSMDWREKGAVTSIKNQGGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNG 161
+P S+DWREKG + +K+QG C +CWAFSAVAA+E I I +GNLI LSEQ+L+DC +
Sbjct: 18 LPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRSY 77
Query: 162 NSGCVAGKSDIAFKYIIKNQGIATEADYPYHQVQGSCG--REHAAAAKISSYEVLPSGDE 219
N GC G D AF+++IKN GI TE DYPY + G C R++A KI SYE +P +E
Sbjct: 78 NEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYEDVPVNNE 137
Query: 220 QALLKAVSMQPVSINIEGTGQDFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLI 279
+AL KAV+ QPVSI +E G+DF++YK GIF G CGT +DH V I G+G TE+G YW++
Sbjct: 138 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYG-TENGMDYWIV 196
Query: 280 KNSWGDTWGEAGYMRIQRD----EGLCGIGTQAAYPI 312
+NSWG E GY+R+QR+ GLCG+ + +YP+
Sbjct: 197 RNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPV 233
>sp|Q95029|CATL_DROME Cathepsin L OS=Drosophila melanogaster GN=Cp1 PE=2 SV=2
Length = 371
Score = 230 bits (586), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 128/320 (40%), Positives = 188/320 (58%), Gaps = 18/320 (5%)
Query: 9 IAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQF 68
+ E+ + EH ++Y+DE E+ R KIF +N I K +N EG +++L N++
Sbjct: 55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAK--HNQRFAEG-KVSFKLAVNKY 111
Query: 69 SDLTNAEFRASYAGNSMAITSQ----HSSFKYQNL-----TQVPTSMDWREKGAVTSIKN 119
+DL + EFR G + + Q SFK +P S+DWR KGAVT++K+
Sbjct: 112 ADLLHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKD 171
Query: 120 QGGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSN-GNSGCVAGKSDIAFKYII 178
QG C +CWAFS+ A+EG SG L+ LSEQ L+DCS+ GN+GC G D AF+YI
Sbjct: 172 QGHCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIK 231
Query: 179 KNQGIATEADYPYHQVQGSCG-REHAAAAKISSYEVLPSGDEQALLKAV-SMQPVSINIE 236
N GI TE YPY + SC + A + +P GDE+ + +AV ++ PVS+ I+
Sbjct: 232 DNGGIDTEKSYPYEAIDDSCHFNKGTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAID 291
Query: 237 GTGQDFKNYKGGIFNGV-CGTQ-LDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMR 294
+ + F+ Y G++N C Q LDH V ++GFGT E G YWL+KNSWG TWG+ G+++
Sbjct: 292 ASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIK 351
Query: 295 IQRD-EGLCGIGTQAAYPIT 313
+ R+ E CGI + ++YP+
Sbjct: 352 MLRNKENQCGIASASSYPLV 371
>sp|P82474|CPGP2_ZINOF Zingipain-2 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 229 bits (583), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 109/216 (50%), Positives = 149/216 (68%), Gaps = 7/216 (3%)
Query: 102 VPTSMDWREKGAVTSIKNQGGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNG 161
+P S+DWRE GAV +KNQGGC +CWAFS VAAVEGI QI +G+LI LSEQQL+DC++
Sbjct: 3 LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTT-A 61
Query: 162 NSGCVAGKSDIAFKYIIKNQGIATEADYPYHQVQGSCGRE-HAAAAKISSYEVLPSGDEQ 220
N GC G + AF++I+ N GI +E YPY G C +A I SYE +PS +EQ
Sbjct: 62 NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNSTVNAPVVSIDSYENVPSHNEQ 121
Query: 221 ALLKAVSMQPVSINIEGTGQDFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIK 280
+L KAV+ QPVS+ ++ G+DF+ Y+ GIF G C +HA+T++G+GT D +W++K
Sbjct: 122 SLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTEND-KDFWIVK 180
Query: 281 NSWGDTWGEAGYMRIQRD----EGLCGIGTQAAYPI 312
NSWG WGE+GY+R +R+ +G CGI A+YP+
Sbjct: 181 NSWGKNWGESGYIRAERNIENPDGKCGITRFASYPV 216
>sp|Q26636|CATL_SARPE Cathepsin L OS=Sarcophaga peregrina PE=1 SV=1
Length = 339
Score = 226 bits (577), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 188/317 (59%), Gaps = 17/317 (5%)
Query: 9 IAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQF 68
I E+ + +H ++Y +E+E+ R KIF +N I K +N +G +Y+LG N++
Sbjct: 24 IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAK--HNQLFAQG-KVSYKLGLNKY 80
Query: 69 SDLTNAEFRASYAGNSMAITSQH--------SSFKYQNLTQVPTSMDWREKGAVTSIKNQ 120
+D+ + EF+ + G + + +++ VP S+DWRE GAVT +K+Q
Sbjct: 81 ADMLHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQ 140
Query: 121 GGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSN-GNSGCVAGKSDIAFKYIIK 179
G C +CWAFS+ A+EG +G L+ LSEQ L+DCS+ GN+GC G D AF+YI
Sbjct: 141 GHCGSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKD 200
Query: 180 NQGIATEADYPYHQVQGSCGREHAA-AAKISSYEVLPSGDEQALLKAV-SMQPVSINIEG 237
N GI TE YPY + SC A A + + +P GDE+ + KAV +M PVS+ I+
Sbjct: 201 NGGIDTEKSYPYEGIDDSCHFNKATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDA 260
Query: 238 TGQDFKNYKGGIFNGV-CGTQ-LDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRI 295
+ + F+ Y G++N C Q LDH V ++G+GT E G YWL+KNSWG TWGE GY+++
Sbjct: 261 SHESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKM 320
Query: 296 QRDE-GLCGIGTQAAYP 311
R++ CGI T ++YP
Sbjct: 321 ARNQNNQCGIATASSYP 337
>sp|P82473|CPGP1_ZINOF Zingipain-1 OS=Zingiber officinale PE=1 SV=1
Length = 221
Score = 226 bits (576), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 111/216 (51%), Positives = 145/216 (67%), Gaps = 7/216 (3%)
Query: 102 VPTSMDWREKGAVTSIKNQGGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNG 161
+P S+DWREKGAV +KNQGGC +CWAF A+AAVEGI QI +G+LI LSEQQL+DCS+
Sbjct: 3 LPDSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTR- 61
Query: 162 NSGCVAGKSDIAFKYIIKNQGIATEADYPYHQVQGSCG-REHAAAAKISSYEVLPSGDEQ 220
N GC G AF+YII N GI +E YPY G+C +E+A I SY +PS DE+
Sbjct: 62 NHGCEGGWPYRAFQYIINNGGINSEEHYPYTGTNGTCDTKENAHVVSIDSYRNVPSNDEK 121
Query: 221 ALLKAVSMQPVSINIEGTGQDFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIK 280
+L KAV+ QPVS+ ++ G+DF+ Y+ GIF G C +H T +G TE+ YW +K
Sbjct: 122 SLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRT-VGGRETENDKDYWTVK 180
Query: 281 NSWGDTWGEAGYMRIQRD----EGLCGIGTQAAYPI 312
NSWG WGE+GY+R++R+ G CGI +YPI
Sbjct: 181 NSWGKNWGESGYIRVERNIAESSGKCGIAISPSYPI 216
>sp|Q86GF7|CRUST_PANBO Crustapain OS=Pandalus borealis GN=Cys PE=1 SV=1
Length = 323
Score = 218 bits (554), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 125/315 (39%), Positives = 182/315 (57%), Gaps = 9/315 (2%)
Query: 5 ASISIAEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLG 64
A++S + E + + G+ Y + E+ R +F L++I + N + E TY L
Sbjct: 12 AAVSAIGEWENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGE---VTYWLK 68
Query: 65 TNQFSDLTNAEFRASYAGNSMAITSQHSSFKYQNLTQVPTSMDWREKGAVTSIKNQGGCA 124
N FSDLT+ E A+ G + K T + +DWR KGAVT +K+QG C
Sbjct: 69 INNFSDLTHEEVLATKTGMTRRRHPLSVLPKSAPTTPMAADVDWRNKGAVTPVKDQGQCG 128
Query: 125 ACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSN-GNSGCVAGKSDIAFKYIIKNQGI 183
+CWAFSAVAA+EG + +G+L+ LSEQ L+DCSS+ GN GC G A++YII N+GI
Sbjct: 129 SCWAFSAVAALEGAHFLKTGDLVSLSEQNLVDCSSSYGNQGCNGGWPYQAYQYIIANRGI 188
Query: 184 ATEADYPYHQVQGSCGREHA-AAAKISSYEVLPSGDEQALLKAVSMQ-PVSINIEGTGQD 241
TE+ YPY + +C + A +SSY SGDE AL AV + PVS+ I+
Sbjct: 189 DTESSYPYKAIDDNCRYDAGNIGATVSSYVEPASGDESALQHAVQNEGPVSVCIDAGQSS 248
Query: 242 FKNYKGGI-FNGVCGTQL-DHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD- 298
F +Y GG+ + C + +HAVT +G+GT +G YW++KNSWG WGE+GY+++ R+
Sbjct: 249 FGSYGGGVYYEPNCDSWYANHAVTAVGYGTDANGGDYWIVKNSWGAWWGESGYIKMARNR 308
Query: 299 EGLCGIGTQAAYPIT 313
+ C I T + YP+
Sbjct: 309 DNNCAIATYSVYPVV 323
>sp|O70370|CATS_MOUSE Cathepsin S OS=Mus musculus GN=Ctss PE=2 SV=2
Length = 340
Score = 217 bits (553), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 123/304 (40%), Positives = 180/304 (59%), Gaps = 12/304 (3%)
Query: 16 WMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTNAE 75
W H + YKD+ E+++R I+++NL++I + +N + G++ TYQ+G N D+TN E
Sbjct: 39 WKKTHEKEYKDKNEEEVRRLIWEKNLKFI--MIHNLEYSMGMH-TYQVGMNDMGDMTNEE 95
Query: 76 FRASYAGNSMAITSQHS-SFKYQNLTQVPTSMDWREKGAVTSIKNQGGCAACWAFSAVAA 134
+ S + +F+ + +P ++DWREKG VT +K QG C ACWAFSAV A
Sbjct: 96 ILCRMGALRIPRQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGSCGACWAFSAVGA 155
Query: 135 VEGITQISSGNLIRLSEQQLLDCSSN---GNSGCVAGKSDIAFKYIIKNQGIATEADYPY 191
+EG ++ +G LI LS Q L+DCS+ GN GC G AF+YII N GI +A YPY
Sbjct: 156 LEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIIDNGGIEADASYPY 215
Query: 192 HQVQGSCG-REHAAAAKISSYEVLPSGDEQALLKAVSMQ-PVSINIEGTGQDFKNYKGGI 249
C AA S Y LP GDE AL +AV+ + PVS+ I+ + F YK G+
Sbjct: 216 KATDEKCHYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGIDASHSSFFFYKSGV 275
Query: 250 FNGV-CGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQR-DEGLCGIGTQ 307
++ C ++H V ++G+GT DG YWL+KNSWG +G+ GY+R+ R ++ CGI +
Sbjct: 276 YDDPSCTGNVNHGVLVVGYGTL-DGKDYWLVKNSWGLNFGDQGYIRMARNNKNHCGIASY 334
Query: 308 AAYP 311
+YP
Sbjct: 335 CSYP 338
>sp|Q23894|CYSP3_DICDI Cysteine proteinase 3 OS=Dictyostelium discoideum GN=cprC PE=3 SV=2
Length = 337
Score = 217 bits (552), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 124/311 (39%), Positives = 191/311 (61%), Gaps = 25/311 (8%)
Query: 16 WMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTNAE 75
WM + ++Y + E R++ FK+N++Y+ +N N ++T LG NQ +DL+N E
Sbjct: 37 WMRSNNKAYTHK-EFMPRYEEFKKNMDYV------HNWNSKGSKTV-LGLNQHADLSNEE 88
Query: 76 FRASYAGNSMAITSQHSSFKYQNL--------TQVPTSMDWREKGAVTSIKNQGGCAACW 127
+R +Y G I + + + +NL + P ++DWREK AVT +K+QG C +C+
Sbjct: 89 YRLNYLGTRAHI--KLNGYHKRNLGLRLNRPQFKQPLNVDWREKDAVTPVKDQGQCGSCY 146
Query: 128 AFSAVAAVEGITQISSGNLIRLSEQQLLDCSSN-GNSGCVAGKSDIAFKYIIKNQGIATE 186
+FS +VEG+T I +G L+ LSEQ +LDCSS+ GN GC G AF+YIIKN G+ +E
Sbjct: 147 SFSTTGSVEGVTAIKTGKLVSLSEQNILDCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSE 206
Query: 187 ADYPYH-QVQGSCG-REHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQDFKN 244
YPY +V C +E + AAKI+SY+ + +GDE L A+ + PVS+ I+ + F+
Sbjct: 207 EQYPYEMKVNDECKFQEGSVAAKITSYKEIEAGDENDLQNALLLNPVSVAIDASHNSFQL 266
Query: 245 YKGGI-FNGVCGTQ-LDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD-EGL 301
Y G+ + C ++ LDH V +G G T++G Y+++KNSWG +WG GY+ + R+ +
Sbjct: 267 YTAGVYYEPACSSEDLDHGVLAVGMG-TDNGEDYYIVKNSWGPSWGLNGYIHMARNKDNN 325
Query: 302 CGIGTQAAYPI 312
CGI T A+YPI
Sbjct: 326 CGISTMASYPI 336
>sp|P60994|ERVB_TABDI Ervatamin-B OS=Tabernaemontana divaricata PE=1 SV=1
Length = 215
Score = 213 bits (541), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 103/214 (48%), Positives = 139/214 (64%), Gaps = 6/214 (2%)
Query: 102 VPTSMDWREKGAVTSIKNQGGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNG 161
+P+ +DWR KGAV SIKNQ C +CWAFSAVAAVE I +I +G LI LSEQ+L+DC +
Sbjct: 1 LPSFVDWRSKGAVNSIKNQKQCGSCWAFSAVAAVESINKIRTGQLISLSEQELVDCDT-A 59
Query: 162 NSGCVAGKSDIAFKYIIKNQGIATEADYPYHQVQGSCGREHAAAAKISSYEVLPSGDEQA 221
+ GC G + AF+YII N GI T+ +YPY VQGSC I+ ++ + +E A
Sbjct: 60 SHGCNGGWMNNAFQYIITNGGIDTQQNYPYSAVQGSCKPYRLRVVSINGFQRVTRNNESA 119
Query: 222 LLKAVSMQPVSINIEGTGQDFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKN 281
L AV+ QPVS+ +E G F++Y GIF G CGT +H V I+G+G T+ G YW+++N
Sbjct: 120 LQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYG-TQSGKNYWIVRN 178
Query: 282 SWGDTWGEAGYMRIQRD----EGLCGIGTQAAYP 311
SWG WG GY+ ++R+ GLCGI +YP
Sbjct: 179 SWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYP 212
>sp|P25782|CYSP2_HOMAM Digestive cysteine proteinase 2 OS=Homarus americanus GN=LCP2 PE=2
SV=1
Length = 323
Score = 211 bits (538), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 123/307 (40%), Positives = 176/307 (57%), Gaps = 11/307 (3%)
Query: 14 EKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTN 73
E + ++GR Y D E R IF+QN +YI++ N + E T+ L N+F D+T
Sbjct: 21 EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGE---VTFNLAMNKFGDMTL 77
Query: 74 AEFRASYAGNSMAITSQHSSFKYQNLT-QVPTSMDWREKGAVTSIKNQGGCAACWAFSAV 132
EF A GN ++ S F + T T +DWR KGAVT +K+QG C +CWAFS
Sbjct: 78 EEFNAVMKGNIPRRSAPVSVFYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFSTT 137
Query: 133 AAVEGITQISSGNLIRLSEQQLLDCSSN-GNSGCVAGKSDIAFKYIIKNQGIATEADYPY 191
++EG + +G+LI L+EQQL+DCS G GC G + AF YI N GI TEA YPY
Sbjct: 138 GSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAYPY 197
Query: 192 HQVQGSCGRE-HAAAAKISSYEVLPSGDEQALLKAV-SMQPVSINIEGTGQDFKNYKGGI 249
GSC + ++ AA S + + SG E L +AV + P+S+ I+ F+ Y G+
Sbjct: 198 EARDGSCRFDSNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYSSGV 257
Query: 250 F--NGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRDE-GLCGIGT 306
+ + LDHAV +G+G +E G +WL+KNSW +WG+AGY+++ R+ CGI T
Sbjct: 258 YYEPSCSPSYLDHAVLAVGYG-SEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNNCGIAT 316
Query: 307 QAAYPIT 313
A+YP+
Sbjct: 317 VASYPLV 323
>sp|P25326|CATS_BOVIN Cathepsin S OS=Bos taurus GN=CTSS PE=1 SV=2
Length = 331
Score = 211 bits (537), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 121/305 (39%), Positives = 183/305 (60%), Gaps = 15/305 (4%)
Query: 16 WMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTNAE 75
W +G+ YK++ E+ R I+++NL+ + +N + G++ +Y+LG N D+T+ E
Sbjct: 31 WKKTYGKQYKEKNEEVARRLIWEKNLKTV--TLHNLEHSMGMH-SYELGMNHLGDMTSEE 87
Query: 76 FRASYAGNSMAITSQ---HSSFKYQNLTQVPTSMDWREKGAVTSIKNQGGCAACWAFSAV 132
+ + S+ + SQ + ++K ++P SMDWREKG VT +K QG C +CWAFSAV
Sbjct: 88 VISLMS--SLRVPSQWPRNVTYKSDPNQKLPDSMDWREKGCVTEVKYQGACGSCWAFSAV 145
Query: 133 AAVEGITQISSGNLIRLSEQQLLDCSSN--GNSGCVAGKSDIAFKYIIKNQGIATEADYP 190
A+E ++ +G L+ LS Q L+DCS+ GN GC G AF+YII N GI +EA YP
Sbjct: 146 GALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYP 205
Query: 191 YHQVQGSCGRE-HAAAAKISSYEVLPSGDEQALLKAVSMQ-PVSINIEGTGQDFKNYKGG 248
Y + G C + AA S Y LP G E+AL +AV+ + PVS+ I+ + F YK G
Sbjct: 206 YKAMDGKCQYDVKNRAATCSRYIELPFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTG 265
Query: 249 I-FNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRDEG-LCGIGT 306
+ ++ C ++H V ++G+G DG YWL+KNSWG +G+ GY+R+ R+ G CGI
Sbjct: 266 VYYDPSCTQNVNHGVLVVGYGNL-DGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIAN 324
Query: 307 QAAYP 311
+YP
Sbjct: 325 YPSYP 329
>sp|P83654|ERVC_TABDI Ervatamin-C OS=Tabernaemontana divaricata PE=1 SV=1
Length = 208
Score = 209 bits (533), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 102/212 (48%), Positives = 138/212 (65%), Gaps = 9/212 (4%)
Query: 102 VPTSMDWREKGAVTSIKNQGGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNG 161
+P +DWR+KGAVT +KNQG C +CWAFS V+ VE I QI +GNLI LSEQ+L+DC
Sbjct: 1 LPEQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKK- 59
Query: 162 NSGCVAGKSDIAFKYIIKNQGIATEADYPYHQVQGSCGREHAAAAKISSYEVLPSGDEQA 221
N GC+ G A++YII N GI T+A+YPY VQG C + + I Y +P +E A
Sbjct: 60 NHGCLGGAFVFAYQYIINNGGIDTQANYPYKAVQGPC-QAASKVVSIDGYNGVPFCNEXA 118
Query: 222 LLKAVSMQPVSINIEGTGQDFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKN 281
L +AV++QP ++ I+ + F+ Y GIF+G CGT+L+H VTI+G+ YW+++N
Sbjct: 119 LKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGYQAN-----YWIVRN 173
Query: 282 SWGDTWGEAGYMRIQR--DEGLCGIGTQAAYP 311
SWG WGE GY+R+ R GLCGI YP
Sbjct: 174 SWGRYWGEKGYIRMLRVGGCGLCGIARLPYYP 205
>sp|P84346|MEX1_JACME Mexicain OS=Jacaratia mexicana PE=1 SV=1
Length = 214
Score = 209 bits (533), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 103/216 (47%), Positives = 142/216 (65%), Gaps = 13/216 (6%)
Query: 103 PTSMDWREKGAVTSIKNQGGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNGN 162
P S+DWREKGAVT +KNQ C +CWAFS VA +EGI +I +G LI LSEQ+LLDC +
Sbjct: 2 PESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCEYRSH 61
Query: 163 SGCVAGKSDIAFKYIIKNQGIATEADYPYHQVQGSCGREHAAAAK--ISSYEVLPSGDEQ 220
GC G + +Y++ N G+ TE +YPY + QG C + K I+ Y+ +P+ DE
Sbjct: 62 -GCDGGYQTPSLQYVVDN-GVHTEREYPYEKKQGRCRAKDKKGPKVYITGYKYVPANDEI 119
Query: 221 ALLKAVSMQPVSINIEGTGQDFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIK 280
+L++A++ QPVS+ + G+ F+ YKGGI+ G CGT DHAVT +G+G T Y L+K
Sbjct: 120 SLIQAIANQPVSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTAVGYGKT-----YLLLK 174
Query: 281 NSWGDTWGEAGYMRIQ----RDEGLCGIGTQAAYPI 312
NSWG WGE GY+RI+ R +G CG+ T + +PI
Sbjct: 175 NSWGPNWGEKGYIRIKRASGRSKGTCGVYTSSFFPI 210
>sp|P06797|CATL1_MOUSE Cathepsin L1 OS=Mus musculus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 209 bits (532), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 122/313 (38%), Positives = 181/313 (57%), Gaps = 15/313 (4%)
Query: 10 AEKHEKWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFS 69
AE H+ W + H R Y E++ R I+++N+ I +++N SN + + N F
Sbjct: 27 AEWHQ-WKSTHRRLYGTN-EEEWRRAIWEKNMRMI-QLHNGEYSNG--QHGFSMEMNAFG 81
Query: 70 DLTNAEFRASYAGNSMAITSQHSSFKYQNLTQVPTSMDWREKGAVTSIKNQGGCAACWAF 129
D+TN EFR G + F+ + ++P S+DWREKG VT +KNQG C +CWAF
Sbjct: 82 DMTNEEFRQVVNGYRHQKHKKGRLFQEPLMLKIPKSVDWREKGCVTPVKNQGQCGSCWAF 141
Query: 130 SAVAAVEGITQISSGNLIRLSEQQLLDCS-SNGNSGCVAGKSDIAFKYIIKNQGIATEAD 188
SA +EG + +G LI LSEQ L+DCS + GN GC G D AF+YI +N G+ +E
Sbjct: 142 SASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEES 201
Query: 189 YPYHQVQGSCG-REHAAAAKISSYEVLPSGDEQALLKAV-SMQPVSINIEGTGQDFKNYK 246
YPY GSC R A A + + +P E+AL+KAV ++ P+S+ ++ + + Y
Sbjct: 202 YPYEAKDGSCKYRAEFAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYS 260
Query: 247 GGI-FNGVCGTQ-LDHAVTIIGF---GTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD-EG 300
GI + C ++ LDH V ++G+ GT + KYWL+KNSWG WG GY++I +D +
Sbjct: 261 SGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDN 320
Query: 301 LCGIGTQAAYPIT 313
CG+ T A+YP+
Sbjct: 321 HCGLATAASYPVV 333
>sp|Q8HY81|CATS_CANFA Cathepsin S OS=Canis familiaris GN=CTSS PE=2 SV=1
Length = 331
Score = 209 bits (532), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 120/305 (39%), Positives = 182/305 (59%), Gaps = 15/305 (4%)
Query: 16 WMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTNAE 75
W + + YK+E E+ R I+++NL+++ + +N + G++ +Y LG N D+T E
Sbjct: 31 WKKTYSKQYKEENEEVARRLIWEKNLKFV--MLHNLEHSMGMH-SYDLGMNHLGDMTGEE 87
Query: 76 FRASYAGNSMAITSQ---HSSFKYQNLTQVPTSMDWREKGAVTSIKNQGGCAACWAFSAV 132
S G S+ + SQ + +++ + ++P S+DWREKG VT +K QG C ACWAFSAV
Sbjct: 88 V-ISLMG-SLRVPSQWQRNVTYRSNSNQKLPDSVDWREKGCVTEVKYQGSCGACWAFSAV 145
Query: 133 AAVEGITQISSGNLIRLSEQQLLDCSSN--GNSGCVAGKSDIAFKYIIKNQGIATEADYP 190
A+E ++ +G L+ LS Q L+DCS+ GN GC G AF+YII N GI +EA YP
Sbjct: 146 GALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNNGIDSEASYP 205
Query: 191 YHQVQGSCGRE-HAAAAKISSYEVLPSGDEQALLKAVSMQ-PVSINIEGTGQDFKNYKGG 248
Y + G C + AA S Y LP G E AL +AV+ + PVS+ I+ + F Y+ G
Sbjct: 206 YKAMNGKCRYDSKKRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDASHYSFFLYRSG 265
Query: 249 I-FNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRDEG-LCGIGT 306
+ + C ++H V ++G+G +G YWL+KNSWG +G+ GY+R+ R+ G CGI +
Sbjct: 266 VYYEPSCTQNVNHGVLVVGYGNL-NGKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIAS 324
Query: 307 QAAYP 311
+YP
Sbjct: 325 YPSYP 329
>sp|Q8HY82|CATS_SAIBB Cathepsin S OS=Saimiri boliviensis boliviensis GN=CTSS PE=2 SV=1
Length = 330
Score = 209 bits (532), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 116/304 (38%), Positives = 183/304 (60%), Gaps = 14/304 (4%)
Query: 16 WMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTNAE 75
W +G+ YK++ E+ +R I+++NL+++ + +N + G++ +Y LG N D+T+ E
Sbjct: 31 WKKTYGKQYKEKNEEAVRRLIWEKNLKFV--MLHNLEHSMGMH-SYDLGMNHLGDMTSEE 87
Query: 76 FRASYAGNSMAITSQ---HSSFKYQNLTQVPTSMDWREKGAVTSIKNQGGCAACWAFSAV 132
+ + S+ + +Q + ++K +P S+DWREKG VT +K QG C ACWAFSAV
Sbjct: 88 VMSLMS--SLRVPNQWQRNITYKSNPNQMLPDSVDWREKGCVTEVKYQGSCGACWAFSAV 145
Query: 133 AAVEGITQISSGNLIRLSEQQLLDCSSN-GNSGCVAGKSDIAFKYIIKNQGIATEADYPY 191
A+E ++ +G L+ LS Q L+DCS GN GC G AF+YII N+GI +EA YPY
Sbjct: 146 GALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPY 205
Query: 192 HQVQGSCGREHA-AAAKISSYEVLPSGDEQALLKAVSMQ-PVSINIEGTGQDFKNYKGGI 249
C + AA S Y LP G E L +AV+ + PV + ++ + F Y+ G+
Sbjct: 206 KATDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGVDASHPSFFLYRSGV 265
Query: 250 -FNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRDEG-LCGIGTQ 307
++ C +++H V +IG+G +G +YWL+KNSWG +GE GY+R+ R++G CGI +
Sbjct: 266 YYDPACTQKVNHGVLVIGYGDL-NGKEYWLVKNSWGSNFGEQGYIRMARNKGNHCGIASY 324
Query: 308 AAYP 311
+YP
Sbjct: 325 PSYP 328
>sp|P83443|MDO1_PSEMR Macrodontain-1 OS=Pseudananas macrodontes PE=1 SV=1
Length = 213
Score = 209 bits (531), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 97/215 (45%), Positives = 142/215 (66%), Gaps = 11/215 (5%)
Query: 102 VPTSMDWREKGAVTSIKNQGGCAACWAFSAVAAVEGITQISSGNLIRLSEQQLLDCSSNG 161
VP S+DWR+ GAV +KNQG C CWAF+A+A VEGI +I GNL+ LSEQ++LDC+
Sbjct: 2 VPQSIDWRDYGAVNEVKNQGPCGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAV-- 59
Query: 162 NSGCVAGKSDIAFKYIIKNQGIATEADYPYHQVQGSCGREHAA-AAKISSYEVLPSGDEQ 220
+ GC G + A+ +II N G+ T+ +YPY QG+C + +A I+ Y + DE
Sbjct: 60 SYGCKGGWVNRAYDFIISNNGVTTDENYPYRAYQGTCNANYFPNSAYITGYSYVRRNDES 119
Query: 221 ALLKAVSMQPVSINIEGTGQDFKNYKGGIFNGVCGTQLDHAVTIIGFGTTEDGTKYWLIK 280
++ AVS QP++ I+ +G +F+ YKGG+++G CG L+HA+TIIG+G YW+++
Sbjct: 120 HMMYAVSNQPIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGYGRDS----YWIVR 175
Query: 281 NSWGDTWGEAGYMRIQRDE----GLCGIGTQAAYP 311
NSWG +WG+ GY+RI+RD G+CGI +P
Sbjct: 176 NSWGSSWGQGGYVRIRRDVSHSGGVCGIAMSPLFP 210
>sp|Q9R014|CATJ_MOUSE Cathepsin J OS=Mus musculus GN=Ctsj PE=2 SV=2
Length = 334
Score = 208 bits (530), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 120/306 (39%), Positives = 179/306 (58%), Gaps = 16/306 (5%)
Query: 16 WMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTNAE 75
W ++ +SY + E+ +R ++++N+ I K++N NS N T ++ N+F D T+ E
Sbjct: 32 WKTKYAKSYSPK-EEALRRAVWEENMRMI-KLHNKENSLGKNNFTMKM--NKFGDQTSEE 87
Query: 76 FRASYAGNSMAITSQHSSFKYQNLTQV--PTSMDWREKGAVTSIKNQGGCAACWAFSAVA 133
FR S +++ I + + QN + P DWRE+G VT ++NQG C +CWAF+A
Sbjct: 88 FRKSI--DNIPIPAAMTDPHAQNHVSIGLPDYKDWREEGYVTPVRNQGKCGSCWAFAAAG 145
Query: 134 AVEGITQISSGNLIRLSEQQLLDCSSN-GNSGCVAGKSDIAFKYIIKNQGIATEADYPYH 192
A+EG +GNL LS Q LLDCS GN GC +G + AF+Y++KN+G+ EA YPY
Sbjct: 146 AIEGQMFWKTGNLTPLSVQNLLDCSKTVGNKGCQSGTAHQAFEYVLKNKGLEAEATYPYE 205
Query: 193 QVQGSCG-REHAAAAKISSYEVLPSGDEQALLKAVSMQPVSINIEGTGQDFKNYKGGI-F 250
G C R A+A I+ Y LP + + S+ PVS I+ + F+ Y GGI +
Sbjct: 206 GKDGPCRYRSENASANITDYVNLPPNELYLWVAVASIGPVSAAIDASHDSFRFYNGGIYY 265
Query: 251 NGVCGTQ-LDHAVTIIGFGT---TEDGTKYWLIKNSWGDTWGEAGYMRIQRDE-GLCGIG 305
C + ++HAV ++G+G+ +DG YWLIKNSWG+ WG GYM+I +D CGI
Sbjct: 266 EPNCSSYFVNHAVLVVGYGSEGDVKDGNNYWLIKNSWGEEWGMNGYMQIAKDHNNHCGIA 325
Query: 306 TQAAYP 311
+ A+YP
Sbjct: 326 SLASYP 331
>sp|P22895|P34_SOYBN P34 probable thiol protease OS=Glycine max PE=1 SV=1
Length = 379
Score = 208 bits (529), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 120/317 (37%), Positives = 177/317 (55%), Gaps = 29/317 (9%)
Query: 16 WMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTNAE 75
W +EHGR Y + E+ R +IFK N YI +N N S +++LG N+F+D+T E
Sbjct: 47 WKSEHGRVYHNHEEEAKRLEIFKNNSNYIRDMNANRKSP----HSHRLGLNKFADITPQE 102
Query: 76 FRASYAGNSMAITSQ----HSSFKYQNLT--QVPTSMDWREKGAVTSIKNQGGCAACWAF 129
F Y ++ Q + K + + P S DWR+KG +T +K QGGC WAF
Sbjct: 103 FSKKYLQAPKDVSQQIKMANKKMKKEQYSCDHPPASWDWRKKGVITQVKYQGGCGRGWAF 162
Query: 130 SAVAAVEGITQISSGNLIRLSEQQLLDCSSNGNSGCVAGKSDIAFKYIIKNQGIATEADY 189
SA A+E I++G+L+ LSEQ+L+DC + G G +F++++++ GIAT+ DY
Sbjct: 163 SATGAIEAAHAIATGDLVSLSEQELVDCVEE-SEGSYNGWQYQSFEWVLEHGGIATDDDY 221
Query: 190 PYHQVQGSC-GREHAAAAKISSYEVLPSGD-------EQALLKAVSMQPVSINIEGTGQD 241
PY +G C + I YE L D EQA L A+ QP+S++I+ +D
Sbjct: 222 PYRAKEGRCKANKIQDKVTIDGYETLIMSDESTESETEQAFLSAILEQPISVSID--AKD 279
Query: 242 FKNYKGGIFNGVCGTQ---LDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRD 298
F Y GGI++G T ++H V ++G+G+ DG YW+ KNSWG WGE GY+ IQR+
Sbjct: 280 FHLYTGGIYDGENCTSPYGINHFVLLVGYGSA-DGVDYWIAKNSWGFDWGEDGYIWIQRN 338
Query: 299 E----GLCGIGTQAAYP 311
G+CG+ A+YP
Sbjct: 339 TGNLLGVCGMNYFASYP 355
>sp|P25774|CATS_HUMAN Cathepsin S OS=Homo sapiens GN=CTSS PE=1 SV=3
Length = 331
Score = 208 bits (529), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 116/305 (38%), Positives = 183/305 (60%), Gaps = 15/305 (4%)
Query: 16 WMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTNAE 75
W +G+ YK++ E+ +R I+++NL+++ + +N + G++ +Y LG N D+T+ E
Sbjct: 31 WKKTYGKQYKEKNEEAVRRLIWEKNLKFV--MLHNLEHSMGMH-SYDLGMNHLGDMTSEE 87
Query: 76 FRASYAGNSMAITSQ---HSSFKYQNLTQVPTSMDWREKGAVTSIKNQGGCAACWAFSAV 132
+ + S+ + SQ + ++K +P S+DWREKG VT +K QG C ACWAFSAV
Sbjct: 88 VMSLMS--SLRVPSQWQRNITYKSNPNRILPDSVDWREKGCVTEVKYQGSCGACWAFSAV 145
Query: 133 AAVEGITQISSGNLIRLSEQQLLDCSSN--GNSGCVAGKSDIAFKYIIKNQGIATEADYP 190
A+E ++ +G L+ LS Q L+DCS+ GN GC G AF+YII N+GI ++A YP
Sbjct: 146 GALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYP 205
Query: 191 YHQVQGSCGREHA-AAAKISSYEVLPSGDEQALLKAVSMQ-PVSINIEGTGQDFKNYKGG 248
Y + C + AA S Y LP G E L +AV+ + PVS+ ++ F Y+ G
Sbjct: 206 YKAMDQKCQYDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDARHPSFFLYRSG 265
Query: 249 I-FNGVCGTQLDHAVTIIGFGTTEDGTKYWLIKNSWGDTWGEAGYMRIQRDEG-LCGIGT 306
+ + C ++H V ++G+G +G +YWL+KNSWG +GE GY+R+ R++G CGI +
Sbjct: 266 VYYEPSCTQNVNHGVLVVGYGDL-NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIAS 324
Query: 307 QAAYP 311
+YP
Sbjct: 325 FPSYP 329
>sp|P07154|CATL1_RAT Cathepsin L1 OS=Rattus norvegicus GN=Ctsl1 PE=1 SV=2
Length = 334
Score = 207 bits (528), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 120/308 (38%), Positives = 178/308 (57%), Gaps = 14/308 (4%)
Query: 15 KWMAEHGRSYKDELEKDMRFKIFKQNLEYIDKVNNNNNSNEGINRTYQLGTNQFSDLTNA 74
+W + H R Y E++ R ++++N+ I +++N SN T ++ N F D+TN
Sbjct: 31 QWKSTHRRLYGTN-EEEWRRAVWEKNMRMI-QLHNGEYSNGKHGFTMEM--NAFGDMTNE 86
Query: 75 EFRASYAGNSMAITSQHSSFKYQNLTQVPTSMDWREKGAVTSIKNQGGCAACWAFSAVAA 134
EFR G + F+ + Q+P ++DWREKG VT +KNQG C +CWAFSA
Sbjct: 87 EFRQIVNGYRHQKHKKGRLFQEPLMLQIPKTVDWREKGCVTPVKNQGQCGSCWAFSASGC 146
Query: 135 VEGITQISSGNLIRLSEQQLLDCSSN-GNSGCVAGKSDIAFKYIIKNQGIATEADYPYHQ 193
+EG + +G LI LSEQ L+DCS + GN GC G D AF+YI +N G+ +E YPY
Sbjct: 147 LEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYPYEA 206
Query: 194 VQGSCG-REHAAAAKISSYEVLPSGDEQALLKAV-SMQPVSINIEGTGQDFKNYKGGI-F 250
GSC R A A + + +P E+AL+KAV ++ P+S+ ++ + + Y GI +
Sbjct: 207 KDGSCKYRAEYAVANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSSGIYY 265
Query: 251 NGVCGTQ-LDHAVTIIGF---GTTEDGTKYWLIKNSWGDTWGEAGYMRIQRDE-GLCGIG 305
C ++ LDH V ++G+ GT + KYWL+KNSWG WG GY++I +D CG+
Sbjct: 266 EPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNHCGLA 325
Query: 306 TQAAYPIT 313
T A+YPI
Sbjct: 326 TAASYPIV 333
Database: swissprot
Posted date: Mar 23, 2013 2:32 AM
Number of letters in database: 191,569,459
Number of sequences in database: 539,616
Lambda K H
0.314 0.129 0.385
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 116,408,828
Number of Sequences: 539616
Number of extensions: 4887973
Number of successful extensions: 26706
Number of sequences better than 100.0: 50
Number of HSP's better than 100.0 without gapping: 231
Number of HSP's successfully gapped in prelim test: 27
Number of HSP's that attempted gapping in prelim test: 25117
Number of HSP's gapped (non-prelim): 776
length of query: 313
length of database: 191,569,459
effective HSP length: 117
effective length of query: 196
effective length of database: 128,434,387
effective search space: 25173139852
effective search space used: 25173139852
T: 11
A: 40
X1: 16 ( 7.2 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 42 (22.0 bits)
S2: 61 (28.1 bits)