BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 041120
(340 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 204/342 (59%), Positives = 247/342 (72%), Gaps = 28/342 (8%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSE--GYPQKYDPQSMEERFENWLKQYSREYGSEDEWQR 81
M++NA L L L L IP+ A SE P P +M+ R++ WL+QY R+Y ++DE+
Sbjct: 6 MIKNAGLMLITLCTLWIPSIARSEIHSLPIDSAPTAMKVRYDKWLEQYGRKYDTKDEYLL 65
Query: 82 RFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY-- 139
RFGIY SN+Q+I+YINSQNLSFKLTDNKFADL+N+EF S YLGY + R S +
Sbjct: 66 RFGIYHSNIQFIEYINSQNLSFKLTDNKFADLTNDEFNSIYLGYQIRSYKRRNLSHMHEN 125
Query: 140 -LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
LP +VDWR+ GAVTP+KDQGQCGSCWAFSAVAAVEGINK+KTG LVSLSEQELVDCD
Sbjct: 126 STDLPDAVDWRENGAVTPIKDQGQCGSCWAFSAVAAVEGINKIKTGNLVSLSEQELVDCD 185
Query: 199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
VN +N+GCNGG+MEKAF FI IGG+TTE+DYPY+G + C+ KT +HAV I GYE +P
Sbjct: 186 VNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYPYKGTDGSCEKAKTDNHAVIIGGYETVP 245
Query: 259 AR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKY 296
A Y FQLYS GVF YCG QLNHGVT+VGYG+++G+KY
Sbjct: 246 ANNENSLKVAVSKQPVSVAIDASGYEFQLYSEGVFSGYCGIQLNHGVTIVGYGDNNGQKY 305
Query: 297 WLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
WLVKNSWG WGE+GYIRM R+S S G+CGI M+ SYP+K
Sbjct: 306 WLVKNSWGKGWGESGYIRMKRDS-SDTKGMCGIAMEPSYPIK 346
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 186/343 (54%), Positives = 238/343 (69%), Gaps = 28/343 (8%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGAWSEGY-PQKYDPQSMEERFENWLKQYSREYGSEDEWQR 81
M RN +L ++W +G+ A+SE + P + + ME+R+E WL Q+ R Y + DEWQR
Sbjct: 5 MFCRNVYFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQR 64
Query: 82 RFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKP----YNEPRWPSV 137
FGIY SNV++I+YIN+QN SF LTDN+FAD++NEE+ + Y+G N+ +
Sbjct: 65 HFGIYQSNVRFINYINAQNFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQSSFKRE 124
Query: 138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
+ LP SVDWRK GAVTPV++QG+CGSCWAFS VAAVEGINK++TGKLVSLSEQEL+DC
Sbjct: 125 RSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDC 184
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
D++S N+GCNGGYM AF+FI + GG+TT +YPY G+ C DK +H V I+GYE +
Sbjct: 185 DIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIGEQGICNKDKAANHVVKISGYETV 244
Query: 258 PAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEK 295
P Y FQLYS G+F+ +CG QLNH VTV+GYGED+G+K
Sbjct: 245 PPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYGEDNGKK 304
Query: 296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YWLVKNSWGT WGEAGY RM R+S GICGI M+ASYP+K
Sbjct: 305 YWLVKNSWGTGWGEAGYARMIRDSRDDE-GICGIAMEASYPIK 346
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 386 bits (991), Expect = e-105, Method: Compositional matrix adjust.
Identities = 186/343 (54%), Positives = 238/343 (69%), Gaps = 28/343 (8%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGAWSEGY-PQKYDPQSMEERFENWLKQYSREYGSEDEWQR 81
M RN +L ++W +G+ A+SE + P + + ME+R+E WL Q+ R Y + DEWQR
Sbjct: 1 MFCRNVYFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQR 60
Query: 82 RFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKP----YNEPRWPSV 137
FGIY SNV++I+YIN+QN SF LTDN+FAD++NEE+ + Y+G N+ +
Sbjct: 61 HFGIYQSNVRFINYINAQNFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQSSFKRE 120
Query: 138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
+ LP SVDWRK GAVTPV++QG+CGSCWAFS VAAVEGINK++TGKLVSLSEQEL+DC
Sbjct: 121 RSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQELLDC 180
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
D++S N+GCNGGYM AF+FI + GG+TT +YPY G+ C DK +H V I+GYE +
Sbjct: 181 DIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIGEQGICNKDKAANHVVKISGYETV 240
Query: 258 PAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEK 295
P Y FQLYS G+F+ +CG QLNH VTV+GYGED+G+K
Sbjct: 241 PPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYGEDNGKK 300
Query: 296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YWLVKNSWGT WGEAGY RM R+S GICGI M+ASYP+K
Sbjct: 301 YWLVKNSWGTGWGEAGYARMIRDSRDDE-GICGIAMEASYPIK 342
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 180/344 (52%), Positives = 227/344 (65%), Gaps = 27/344 (7%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDP-QSMEERFENWLKQYSREYGSEDEW 79
M +LRN+ L+L +L + A YDP +++++RFE WLK +S+ YG DEW
Sbjct: 1 MLNVLRNSNLTLVVLICFVLIASKLCSVNSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEW 60
Query: 80 QRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKP---YNEPRWPS 136
RFGIY SNVQ IDYINS +L FKLTDN+FAD++N EF + +LG N ++ + P
Sbjct: 61 MLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQRPV 120
Query: 137 VQYLG-LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
G +P +VDWR +GAVTP+++QG+CG CWAFSAVAA+EGINK+KTG LVSLSEQ+L+
Sbjct: 121 CDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLI 180
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DCDV + N+GC+GG ME AFEFI GG+TTE DYPY G C +K K+ VTI GY+
Sbjct: 181 DCDVGTYNKGCSGGLMETAFEFIKSNGGLTTETDYPYTGIEGTCDQEKAKNKVVTIQGYQ 240
Query: 256 AIPAR---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGE 294
+ + FQLYS GVF YCG LNHGVTVVGYG + +
Sbjct: 241 KVAQNEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTSYCGTNLNHGVTVVGYGVEGDQ 300
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
KYW+VKNSWGT WGE GYIRM R S + G CGI M ASYP++
Sbjct: 301 KYWIVKNSWGTGWGEEGYIRMER-GISEDTGKCGIAMLASYPLQ 343
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 355 bits (910), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 178/339 (52%), Positives = 232/339 (68%), Gaps = 26/339 (7%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQK-YDPQSMEERFENWLKQYSREYGSEDEWQRR 82
M LS+ +L L I A A E + + +P M++R+E WLK+Y R Y +EW+ R
Sbjct: 1 MKTTITLSIVIL-NLWIIASACPEIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVR 59
Query: 83 FGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYN-EPRWPSVQYLG 141
F IY SNVQYI++ NSQN S+KL DN+FAD++NEEF STYLGY + + + ++
Sbjct: 60 FDIYQSNVQYIEFYNSQNYSYKLIDNRFADITNEEFKSTYLGYLPRFRVQTEFRYHKHGE 119
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
LP S+DWRK+GAVT VKDQG+CGSCWAFSAVAAVEGINK+KT LVSLSEQ+L+DCD+ S
Sbjct: 120 LPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDCDIKS 179
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR- 260
N+GC GG M AF +I K GG+ T +YPY+G++ C K K++AVTI+GYE++PAR
Sbjct: 180 GNEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGRDGNCNKSKAKNNAVTISGYESVPARN 239
Query: 261 ---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
YAFQ YS G+F CG LNHG+T+VGYGE++G+KYW+V
Sbjct: 240 EKMLKAAVAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYGEENGDKYWIV 299
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
KNSW WGE+GY+RM R++ + G CGI M A+YPVK
Sbjct: 300 KNSWANDWGESGYVRMKRDTKDKD-GTCGIAMDATYPVK 337
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 353 bits (907), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 179/344 (52%), Positives = 227/344 (65%), Gaps = 27/344 (7%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDP-QSMEERFENWLKQYSREYGSEDEW 79
M +LRN+ L+L +L + A YDP +++++RFE WLK +S+ YG DEW
Sbjct: 1 MLNVLRNSNLTLAVLICFVLIASKLCSVDSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEW 60
Query: 80 QRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKP---YNEPRWPS 136
RFGIY SNVQ IDYINS +L FKLTDN+FAD++N EF + +LG N ++ + P
Sbjct: 61 MLRFGIYQSNVQLIDYINSLHLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQRPV 120
Query: 137 VQYLG-LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
G +P +VDWR +GAVTP+++QG+CG CWAFSAVAA+EGINK+KTG LVSLSEQ+L+
Sbjct: 121 CDPAGNVPDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLI 180
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DCDV + N+GC+GG ME AFEFI GG+ TE DYPY G C +K+K+ VTI GY+
Sbjct: 181 DCDVGTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQ 240
Query: 256 AIPAR---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGE 294
+ + FQLYS GVF YCG LNHGVTVVGYG + +
Sbjct: 241 KVAQNEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGVEGDQ 300
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
KYW+VKNSWGT WGE GYIRM R S + G CGI M ASYP++
Sbjct: 301 KYWIVKNSWGTGWGEEGYIRMER-GVSEDTGKCGIAMMASYPLQ 343
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 177/342 (51%), Positives = 228/342 (66%), Gaps = 30/342 (8%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYP----QKYDPQSMEERFENWLKQYSREYGSEDEW 79
+L + L +L + A + SE P + D ++M++RF+ W+K++ R+Y DE
Sbjct: 5 ILTTTIFILLMLCNTCVIA-SESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHNDER 63
Query: 80 QRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPY--NEPRWPSV 137
+ RFGIY +NVQYI N+Q S+ LTDNKFADL+NEEF STY+G + + +
Sbjct: 64 EVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQSTYMGLSTRLRSHNTGFRYD 123
Query: 138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
++ LP S DWRKEGAVT + DQGQCG CWAF+AVAAVEGINK+K+GKL+SLSEQEL+DC
Sbjct: 124 EHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLISLSEQELIDC 183
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
DV S NQGC GG ME A+ FI + GG+TTE DYPY G + C+ +K H+A +I+GYE +
Sbjct: 184 DVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHYAASISGYEEV 243
Query: 258 PAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEK 295
PA Y+FQ YS GVF CG QLNHGVTVVGYG++ K
Sbjct: 244 PADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVVGYGKETINK 303
Query: 296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
YW+VKNSWG WGE+GYIRM R++ S G+CGI MQASYP+
Sbjct: 304 YWIVKNSWGADWGESGYIRMKRDTLSKE-GMCGIAMQASYPL 344
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 343 bits (881), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 172/306 (56%), Positives = 210/306 (68%), Gaps = 29/306 (9%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
+++R++ W+ +Y R+Y S +EW+RRF IY +NVQYID NS N S L +N FADL+NEE
Sbjct: 15 IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLAENNFADLTNEE 74
Query: 118 FISTYLGYNK---PYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
F +TYLGY P R+ ++ + LP +VDWR+EGAVTP+K+QGQCGSCWAFSAVAA
Sbjct: 75 FKATYLGYKTVSIPDTCFRYGNM--VNLPTNVDWRQEGAVTPIKNQGQCGSCWAFSAVAA 132
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEGINK+K GKL+SLSEQELVDCDV S NQGCNGGYM KAFEFI + G+TTE +YPY+G
Sbjct: 133 VEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFIKRT-GLTTEIEYPYQG 191
Query: 235 KNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFD 272
C K K+ V+I+GYE +P FQ YS G+F
Sbjct: 192 AESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQFYSGGIFS 251
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG+QLNHGV +VGYGE + YWLVKNSWGT WGE+GYIRM R+S G CGI M
Sbjct: 252 GNCGNQLNHGVAIVGYGETSNQAYWLVKNSWGTDWGESGYIRMKRDSTDRQ-GTCGIAMM 310
Query: 333 ASYPVK 338
ASYP K
Sbjct: 311 ASYPTK 316
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 171/304 (56%), Positives = 209/304 (68%), Gaps = 29/304 (9%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
+++R++ W+ +Y R+Y S +EW+RRF IY +NVQYID NS N S L +N FADL+NEE
Sbjct: 15 IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLAENNFADLTNEE 74
Query: 118 FISTYLGYNK---PYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
F +TYLGY P R+ ++ + LP +VDWR+EGAVTP+K+QGQCGSCWAFSAVAA
Sbjct: 75 FKATYLGYKTVSIPDTCFRYGNM--VNLPTNVDWRQEGAVTPIKNQGQCGSCWAFSAVAA 132
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEGINK+K GKL+SLSEQELVDCDV S NQGCNGGYM KAFEFI + G+TTE +YPY+G
Sbjct: 133 VEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFIKRT-GLTTEIEYPYQG 191
Query: 235 KNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFD 272
C K K+ V+I+GYE +P FQ YS G+F
Sbjct: 192 AESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQFYSGGIFS 251
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG+QLNHGV +VGYGE + YWLVKNSWGT WGE+GYIRM R+S G CGI M
Sbjct: 252 GNCGNQLNHGVAIVGYGETSNQAYWLVKNSWGTDWGESGYIRMKRDSTDKQ-GTCGIAMM 310
Query: 333 ASYP 336
ASYP
Sbjct: 311 ASYP 314
>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 306
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 170/310 (54%), Positives = 204/310 (65%), Gaps = 34/310 (10%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
M RFE WLKQ R Y ++EW+ RFGIY +N++YI+ NSQ S+ LTDNKFADL+NEE
Sbjct: 1 MRVRFERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQEXSYNLTDNKFADLTNEE 60
Query: 118 FISTYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
F+S YLG+ + P ++ LP S DWRKEGAV+ +KDQG CGSCWAFSAV
Sbjct: 61 FVSPYLGFGTRF----LPHTGFMYHEHEDLPESKDWRKEGAVSDIKDQGNCGSCWAFSAV 116
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AAVEGINK+K+GKLVSLSEQE DCDV NQGC GG M+ AF FI K GG+TT DYPY
Sbjct: 117 AAVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSKDYPY 176
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPAR------------------------YAFQLYSH 268
G + C +K HHA I+G+ +PA +AFQLY
Sbjct: 177 EGVDGTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHAFQLYLK 236
Query: 269 GVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
GVF CG QLNHGVT+VGYG+ +KYW+VKNSWG WGE+GYIRM R++ G CG
Sbjct: 237 GVFSGICGKQLNHGVTIVGYGKGTSDKYWIVKNSWGADWGESGYIRMKRDA-FDKAGTCG 295
Query: 329 ILMQASYPVK 338
I MQASYP+K
Sbjct: 296 IAMQASYPLK 305
>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP2-like [Glycine max]
Length = 342
Score = 330 bits (845), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 170/335 (50%), Positives = 219/335 (65%), Gaps = 26/335 (7%)
Query: 28 AVLSLFLLWVLGIPAGAWSEGY-PQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIY 86
A+++L +L L I A A + D + M R+E+WLK+Y ++Y ++DEW+ RF IY
Sbjct: 9 AIINLLVLCNLWITASACPAKHNDNSSDSEVMRMRYESWLKKYGQKYRNKDEWEFRFEIY 68
Query: 87 SSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYN-KPYNEPRWPSVQYLGLPAS 145
+NVQ+I+ NSQN S+KL DNKF DL+NEEF YL Y + + + R+ ++ LP
Sbjct: 69 RANVQFIEVYNSQNYSYKLMDNKFVDLTNEEFRRMYLVYQPRSHLQTRFMYQKHGDLPKR 128
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
+DWR GAVT +KDQG CGSCW+FSAVA VE INK+KTGKLVSLSEQ+L+DCD + N+G
Sbjct: 129 IDWRTRGAVTXIKDQGHCGSCWSFSAVATVEDINKIKTGKLVSLSEQQLIDCDNRNGNEG 188
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----- 260
CNGG+ME F FITK GG+TT+ +YPY+G + K ++HAV I GYE +PA
Sbjct: 189 CNGGHME-TFTFITKRGGLTTDKNYPYQGSDGDXNKAKVRNHAVAICGYENLPAHNENML 247
Query: 261 -----------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSW 303
YAFQLYS G F CG LNH +T+VGYGE++GEKYWLVKNSW
Sbjct: 248 KAAVAHQPASVATDAGGYAFQLYSKGTFSGSCGKDLNHRMTIVGYGEENGEKYWLVKNSW 307
Query: 304 GTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G +GYIRM R+ P G CG M+ASYP K
Sbjct: 308 ANDXGVSGYIRMKRD-PKDKDGTCGTAMEASYPDK 341
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 172/343 (50%), Positives = 215/343 (62%), Gaps = 32/343 (9%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEW 79
MR+ ++ + L LL+VLG AW S+ + SM ER E W+ QY R Y + E
Sbjct: 1 MRLTKQSQFICLALLFVLG----AWPSKSAARTLQDVSMYERHEQWMAQYGRVYKDDAEK 56
Query: 80 QRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQ 138
+ R+ I+ NV ID NSQ S+KL N+FADLSNEEF ++ + P+ +
Sbjct: 57 ETRYNIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEFKASRNRFKGHMCSPQAGPFR 116
Query: 139 Y---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
Y +PA++DWRK+GAVTPVKDQGQCG CWAFSAVAA+EGIN+L TGKL+SLSEQE+V
Sbjct: 117 YENVSAVPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGINQLTTGKLISLSEQEVV 176
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DCD E+QGCNGG M+ AF+FI + G+TTE +YPY G + C T K HA ITG+E
Sbjct: 177 DCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTGTDGTCNTQKEATHAAKITGFE 236
Query: 256 AIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHG 293
+PA + FQ YS G+F CG QL+HGVT VGYG G
Sbjct: 237 DVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGSCGTQLDHGVTAVGYGISDG 296
Query: 294 EKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
KYWLVKNSWG WGE GYIRM ++ S+ G+CGI MQASYP
Sbjct: 297 TKYWLVKNSWGAQWGEEGYIRMQKD-ISAKEGLCGIAMQASYP 338
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 320 bits (819), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 167/333 (50%), Positives = 213/333 (63%), Gaps = 30/333 (9%)
Query: 34 LLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
L++V + G W S+ + + +M ER E W+ +Y R Y E +RRF I+ +NV++
Sbjct: 9 LMFVALLVVGLWVSQAWSRSLHDAAMNERHEMWMVKYGRVYKDNSEKERRFEIFRNNVEF 68
Query: 93 IDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNE--PRWPSVQY---LGLPASV 146
I+ N N +KL N+FADL+NEEF ++ GY + N S +Y +P S+
Sbjct: 69 IESFNKPGNRPYKLDINEFADLTNEEFKASRNGYKRSSNVGLSEKSSFRYGNVTAVPTSM 128
Query: 147 DWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGC 206
DWR++GAVTP+KDQGQCG CWAFSAVAA+EGI KL TGKL+SLSEQELVDCD + E+QGC
Sbjct: 129 DWRQKGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGC 188
Query: 207 NGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------ 260
GG M+ AFEFI + GG+TTE +YPY+G + C T+K + A ITGYE +PA
Sbjct: 189 EGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSEDALL 248
Query: 261 ----------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWG 304
AFQ YS GVF CG +L+HGVT VGYG G KYWLVKNSWG
Sbjct: 249 KAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSDGTKYWLVKNSWG 308
Query: 305 TSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
TSWGE GYIRM R+ + G+CGI MQ+SYP
Sbjct: 309 TSWGEDGYIRMERDIEAKE-GLCGIAMQSSYPT 340
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 170/344 (49%), Positives = 212/344 (61%), Gaps = 32/344 (9%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEW 79
MR + + L LL++LG AW S+ + M ER E W+ QY R Y ++E
Sbjct: 1 MRFTKQFQFVCLALLFILG----AWPSKSTARTLLDAPMYERHEQWMTQYGRVYKDDNER 56
Query: 80 QRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQ 138
R+ I+ NV ID NSQ S+KL N+FADL+NEEF ++ + P+ +
Sbjct: 57 ATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEFKASRNRFKGHMCSPQAGPFR 116
Query: 139 Y---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
Y +P++VDWRKEGAVTPVKDQGQCG CWAFSAVAA+EGINKL TGKL+SLSEQE+V
Sbjct: 117 YENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQEVV 176
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DCD E+QGCNGG M+ AF+FI + G+TTE +YPY+G + C T+K HA ITG+E
Sbjct: 177 DCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYKGTDGTCNTNKAAIHAAKITGFE 236
Query: 256 AIPARYA----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHG 293
+PA FQ YS G+F C QL+HGVT VGYG G
Sbjct: 237 DVPANSEAALMKAVAKQPVSVAIDAGGSDFQFYSSGIFTGSCDTQLDHGVTAVGYGVSDG 296
Query: 294 EKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KYWLVKNSWG WGE GYIRM ++ S+ G+CGI MQASYP
Sbjct: 297 SKYWLVKNSWGAQWGEEGYIRMQKD-ISAKEGLCGIAMQASYPT 339
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 161/308 (52%), Positives = 199/308 (64%), Gaps = 29/308 (9%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+SM ER E W+ Q+ R Y + E RF I+ +NV+ I+ N++N FKL N+FADL+N
Sbjct: 35 KSMLERHEQWMAQHGRVYKNAAEKAHRFEIFRANVERIESFNAENHKFKLGVNQFADLTN 94
Query: 116 EEFISTYLGYNKPYNEPRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
EEF + KP S +Y +PA++DWR +GAVTP+KDQGQCGSCWAFSAV
Sbjct: 95 EEFKTR--NTLKPSKMASTKSFKYENVTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAV 152
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AA EGI KL TGKL+SLSEQE+VDCDV S++QGCNGG M+ AFE+I K G+TTE +YPY
Sbjct: 153 AATEGITKLSTGKLISLSEQEVVDCDVTSDDQGCNGGEMDDAFEYIIKNKGITTEANYPY 212
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
+ + C T K HA +ITGYE + +AFQ+YS GV
Sbjct: 213 KAADGTCNTKKAASHAASITGYEDVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGV 272
Query: 271 FDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
F CG L+HGVT+VGYG G KYWLVKNSWGTSWGE GYIRM R+ + G+CGI
Sbjct: 273 FTGDCGTDLDHGVTLVGYGATSDGTKYWLVKNSWGTSWGEDGYIRMERDVDAKE-GLCGI 331
Query: 330 LMQASYPV 337
M ASYP
Sbjct: 332 AMDASYPT 339
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 165/334 (49%), Positives = 209/334 (62%), Gaps = 31/334 (9%)
Query: 34 LLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
L++V + G W S+ + + +M ER E W+ +Y R Y E +RRF I+ +NV++
Sbjct: 9 LMFVALLVVGLWASQAWSRSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNNVEF 68
Query: 93 IDYINS-QNLSFKLTDNKFADLSNEEFISTYLGYNKP-----YNEPRWPSVQYLGLPASV 146
I+ N N +KL N+FADL+NEEF + GY + + + +P S+
Sbjct: 69 IESFNKLGNRPYKLDINEFADLTNEEFKVSKNGYKRSSGVGLTEKSSFRYANVTAVPTSM 128
Query: 147 DWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGC 206
DWR+ GAVTP+KDQGQCG CWAFSAVAA+EGI KL TGKL+SLSEQELVDCD + E+QGC
Sbjct: 129 DWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGC 188
Query: 207 NGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------ 260
GG M+ AFEFI + GG+TTE +YPY+G + C T+K + A ITGYE +PA
Sbjct: 189 EGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSEDALL 248
Query: 261 ----------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSW 303
AFQ YS GVF CG +L+HGVT VGYG D G KYWLVKNSW
Sbjct: 249 KAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSDDGTKYWLVKNSW 308
Query: 304 GTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
GTSWGE GYIRM R+ + G+CGI MQ SYP
Sbjct: 309 GTSWGEDGYIRMERDIEAKE-GLCGIAMQPSYPT 341
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 312 bits (799), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 165/336 (49%), Positives = 209/336 (62%), Gaps = 33/336 (9%)
Query: 30 LSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+ L LL+VL AW S+ + SM ER E+W+ QY R Y DE +R+ I+
Sbjct: 10 ICLALLFVLA----AWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKD 65
Query: 89 NVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPA 144
NV I+ N + + S+KL+ N+FADL+NEEF ++ + S +Y +P+
Sbjct: 66 NVARIESFNKAMDKSYKLSINEFADLTNEEFGTSRNRFKAHICSTEATSFKYENVTAVPS 125
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
++DWRK+GAVTP+KDQGQCGSCWAFSAVAA+EGI +L TGKL+SLSEQELVDCD + E+Q
Sbjct: 126 TIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQ 185
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---- 260
GCNGG M+ AF+FI + G+TTE +YPY G + C K H A I GYE +PA
Sbjct: 186 GCNGGLMDDAFKFIKQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKA 245
Query: 261 ------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKN 301
+ FQ YS GVF CG +L+HGV VGYG D G KYWLVKN
Sbjct: 246 LQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 305
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWGT WGE GYIRM R+ + G+CGI MQASYP
Sbjct: 306 SWGTGWGEEGYIRMQRDVTAKE-GLCGIAMQASYPT 340
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 160/306 (52%), Positives = 196/306 (64%), Gaps = 27/306 (8%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNE 116
M ER E W+ QY R Y ++E R+ I+ NV ID NSQ S+KL N+FADL+NE
Sbjct: 1 MYERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNE 60
Query: 117 EFISTYLGYNKPYNEPRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
EF ++ + P+ +Y +P++VDWRKEGAVTPVKDQGQCG CWAFSAVA
Sbjct: 61 EFKASRNRFKGHMCSPQAGPFRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVA 120
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
A+EGINKL TGKL+SLSEQE+VDCD E+QGCNGG M+ AF+FI + G+TTE +YPY+
Sbjct: 121 AMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYK 180
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHGVF 271
G + C T K+ HA ITG+E +PA FQ YS G+F
Sbjct: 181 GTDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFYSSGIF 240
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
C QL+HGVT VGYG G KYWLVKNSWG WGE GYIRM ++ S+ G+CGI M
Sbjct: 241 TGSCDTQLDHGVTAVGYGVSDGSKYWLVKNSWGAQWGEEGYIRMQKD-ISAKEGLCGIAM 299
Query: 332 QASYPV 337
QASYP
Sbjct: 300 QASYPT 305
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 166/336 (49%), Positives = 210/336 (62%), Gaps = 33/336 (9%)
Query: 30 LSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+ L LL+VL AW S+ + SM ER E+W+ QY REY DE +R+ I+
Sbjct: 10 ICLALLFVLA----AWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKD 65
Query: 89 NVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPA 144
NV I+ N + + S+KL+ N+FADL+NEEF ++ + S +Y +P+
Sbjct: 66 NVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTEATSFKYENVTAVPS 125
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
+VDWRK+GAVTP+KDQGQCGSCWAFSAVAA+EGI +L TGKL+SLSEQELVDCD + E+Q
Sbjct: 126 TVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQ 185
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---- 260
GC+GG M+ AF+FI + G+TTE +YPY G + C K H A I GYE +PA
Sbjct: 186 GCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKA 245
Query: 261 ------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKN 301
FQ YS GVF CG +L+HGV+ VGYG D G KYWLVKN
Sbjct: 246 LQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYWLVKN 305
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWGT WGE GYIRM R+ + G+CGI MQASYP
Sbjct: 306 SWGTGWGEEGYIRMQRDVTAKE-GLCGIAMQASYPT 340
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 159/345 (46%), Positives = 211/345 (61%), Gaps = 32/345 (9%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M + + +L + L +VL + A + ++ M R E W+ ++ + Y + E
Sbjct: 1 MAFLCKGKILPIALFFVLAMCA---DQAASRELHELEMTGRHEKWMAKHGKVYKDDKEKL 57
Query: 81 RRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPR----WP 135
RRF I+ SNV +I+ N+ N S+ L NKFADL+NEEF + + GY +P R +
Sbjct: 58 RRFQIFKSNVVFIESFNTAGNKSYMLGINKFADLTNEEFRAFWNGYKRPLGASRKITPFK 117
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
LP+S+DWR +GAVTP+KDQG CGSCWAFSAVAA EGI+KL+TGKLVSLSEQELV
Sbjct: 118 YENVTALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIHKLRTGKLVSLSEQELV 177
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DCDV +++GC GG M AF+FI + GG+T+E +YPY+G++ +C T K AV ITGY+
Sbjct: 178 DCDVKGQDKGCQGGLMVDAFKFIKRHGGMTSEANYPYQGRDGKCDTKKEASRAVKITGYQ 237
Query: 256 AIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGE-DH 292
A+P +FQ Y G+F CG +NHGV VGYG +
Sbjct: 238 AVPKNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGIFTGICGKDINHGVAAVGYGRSNS 297
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G KYW+VKNSWGT WGE GYIRM R+ S G+CGI M+ SYP
Sbjct: 298 GSKYWIVKNSWGTEWGEKGYIRMKRDVRSKE-GLCGIAMECSYPT 341
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 310 bits (793), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 165/336 (49%), Positives = 208/336 (61%), Gaps = 33/336 (9%)
Query: 30 LSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+ L LL+VL AW S+ + SM ER E+W+ QY REY DE +R+ I+
Sbjct: 10 ICLALLFVLA----AWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKD 65
Query: 89 NVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPA 144
NV I+ N + + S+KL+ N+FADL+NEEF ++ + S +Y +P+
Sbjct: 66 NVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTEATSFKYENVTAVPS 125
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
+VDWRK+GAVTP+KDQGQCGSCWAFSAVAA+EGI +L TGKL+SLSEQELVDCD + E+Q
Sbjct: 126 TVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQ 185
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---- 260
GC+GG M+ AF+FI + G+TTE +YPY G + C K H A I GYE +PA
Sbjct: 186 GCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKA 245
Query: 261 ------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKN 301
FQ YS GVF CG +L+HGV VGYG D G KYWLVKN
Sbjct: 246 LQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 305
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SW T WGE GYIRM R+ + G+CGI MQASYP
Sbjct: 306 SWSTGWGEEGYIRMQRDVTAKE-GLCGIAMQASYPT 340
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 310 bits (793), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 167/347 (48%), Positives = 210/347 (60%), Gaps = 36/347 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWS-EGYPQKYDPQSMEERFENWLKQYSREYGSEDEW 79
M + R F+L +LG+ W+ E ++ SM R E W++ + + Y E
Sbjct: 1 MVSICRRQCFFAFIL-ILGM----WAYEVASRELQEPSMSARHEQWMETFGKVYADAAEK 55
Query: 80 QRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP--RWPS 136
+RRF I+ NV+YI+ N+ N +KL+ NKFADL+NEE GY +P + S
Sbjct: 56 ERRFEIFKDNVEYIESFNTAGNKPYKLSVNKFADLTNEELKVARNGYRRPLQTRPMKVTS 115
Query: 137 VQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
+Y +PA++DWRK+GAVTP+KDQGQCGSCWAFS VAA EGIN+L TGKLVSLSEQE
Sbjct: 116 FKYENVTAVPATMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQE 175
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
LVDCD E+QGC GG ME FEFI K G+TTE +YPY+ + C + K ITG
Sbjct: 176 LVDCDTQGEDQGCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKEASRIAKITG 235
Query: 254 YEAIPARYA----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGE- 290
YE++PA FQ YS GVF CG +L+HGVT VGYGE
Sbjct: 236 YESVPANSEAALLKAVASQPISVSIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGET 295
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G KYWLVKNSWGTSWGE GYIRM R++ + G+CGI M +SYP
Sbjct: 296 SDGTKYWLVKNSWGTSWGEEGYIRMQRDTEAEE-GLCGIAMDSSYPT 341
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 164/336 (48%), Positives = 207/336 (61%), Gaps = 33/336 (9%)
Query: 30 LSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+ L LL+ L AW S+ + SM ER E+W+ QY R Y DE +R+ I+
Sbjct: 10 ICLALLFFLA----AWASQATARNLLEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKD 65
Query: 89 NVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPA 144
NV I+ N + + S+KL+ N+FADL+NEEF ++ + S +Y +P+
Sbjct: 66 NVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTEATSFKYEHVAAVPS 125
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
+VDWRK+GAVTP+KDQGQCGSCWAFSAVAA+EGI +L TGKL+SLSEQELVDCD + E+Q
Sbjct: 126 TVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQ 185
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---- 260
GCNGG M+ AF+FI + G+ TE +YPY G + C K H A I GYE +PA
Sbjct: 186 GCNGGLMDDAFKFIEQNHGLATEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKA 245
Query: 261 ------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKN 301
+ FQ YS GVF CG +L+HGV VGYG D G KYWLVKN
Sbjct: 246 LQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 305
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWGT WGE GYIRM R+ + G+CGI MQASYP
Sbjct: 306 SWGTGWGEVGYIRMQRDVTAKE-GLCGIAMQASYPT 340
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 165/336 (49%), Positives = 207/336 (61%), Gaps = 33/336 (9%)
Query: 30 LSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+ L LL+VL AW S+ + SM ER E+W+ QY REY DE +R+ I+
Sbjct: 10 ICLALLFVLA----AWASQATARXLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKD 65
Query: 89 NVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPA 144
NV I+ N + + S+KL+ N+FADL+NEEF ++ + S +Y +P+
Sbjct: 66 NVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTEATSFKYENVTAVPS 125
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
+VDWRK+GAVTP+KDQGQCGSCWAFSAVAA+EGI +L TGKL+SLSEQELVDCD + E+Q
Sbjct: 126 TVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQ 185
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---- 260
GC+GG M+ AF+FI + G+TTE +YPY G + C K H A I GYE +PA
Sbjct: 186 GCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKA 245
Query: 261 ------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKN 301
FQ YS GVF CG +L+HGV VGYG D G KYWLVKN
Sbjct: 246 LQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 305
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SW T WGE GYIRM R+ G+CGI MQASYP
Sbjct: 306 SWSTGWGEEGYIRMQRDVTVKE-GLCGIAMQASYPT 340
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 165/336 (49%), Positives = 207/336 (61%), Gaps = 33/336 (9%)
Query: 30 LSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+ L LL+VL AW S + SM ER E+W+ QY R Y E +R+ I+
Sbjct: 10 ICLALLFVLA----AWASHAKARNLHEASMYERHEDWMAQYGRVYKDAGEKSKRYKIFKD 65
Query: 89 NVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPA 144
NV I+ N + N S+KL+ N+FADL+NEEF ++ + S +Y +P+
Sbjct: 66 NVARIESFNKAMNKSYKLSINEFADLTNEEFRASRNRFKAHICSTEATSFKYEHVXAVPS 125
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
+VDWRK+GAVTP+KDQGQCGSCWAFSAVAA+EGI +L TGKL+SLSEQELVDCD + E+Q
Sbjct: 126 TVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQ 185
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---- 260
GC+GG M+ AF+FI + G+TTE +YPY G + C K H A I GYE +PA
Sbjct: 186 GCSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKA 245
Query: 261 ------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKN 301
+ FQ YS GVF CG +L+HGV+ VGYG D G KYWLVKN
Sbjct: 246 LQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYWLVKN 305
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWGT WGE GYIRM R+ G+CGI MQASYP
Sbjct: 306 SWGTGWGEEGYIRMQRDVTEKE-GLCGIAMQASYPT 340
>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 162/339 (47%), Positives = 204/339 (60%), Gaps = 38/339 (11%)
Query: 28 AVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYS 87
A + + +W + + E Y M R E W+ Y + Y E +RRF I+
Sbjct: 12 AFILILGMWAFEVASRELQESY--------MSARHEQWMATYGKVYVDAAEKERRFKIFK 63
Query: 88 SNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP--RWPSVQY---LG 141
+NV+YI+ N+ N +KL+ NKFAD +NE+F GY +P+ + S +Y
Sbjct: 64 NNVEYIESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQTRPMKVTSFKYENVTA 123
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
+PA++DWRK+GAVT +KDQGQCGSCWAFS VAA EGIN+L TGKLVSLSEQELVDCD+
Sbjct: 124 VPATMDWRKKGAVTLIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDIQG 183
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY 261
E+QGC GG ME FEFI K G+TTE +YPY+ + C + K H ITGYE++PA
Sbjct: 184 EDQGCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANS 243
Query: 262 A----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWL 298
FQ YS GVF CG +L+HGVT VGYGE G KYWL
Sbjct: 244 EAELLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGTKYWL 303
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
VKNSWGTSWGE GYIRM R+ + G+CGI M +SYP
Sbjct: 304 VKNSWGTSWGEEGYIRMQRDIDTEE-GLCGIAMDSSYPT 341
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 306 bits (784), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 162/346 (46%), Positives = 215/346 (62%), Gaps = 35/346 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M + +++ L LL+ +G+ A S + + SM E + W+ +Y R Y + +E
Sbjct: 1 MALTIKHQCTPLALLFTIGVLA---SLAAARSLNEASMTETHDQWMARYGRVYKTANEKN 57
Query: 81 RRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPY-----NEPRW 134
RR I+ N++YI N + N +KL N+FADL+NEEF ++ + N R+
Sbjct: 58 RRSTIFQENLKYIQTFNKANNKPYKLGVNEFADLTNEEFTTSRNKFKSHVCATVTNVFRY 117
Query: 135 PSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+V +PA++DWRK+GAVTP+K+QGQCG CWAFSAVAA+EGI +LKTGKL+SLSEQEL
Sbjct: 118 ENV--TAVPATMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSEQEL 175
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
VDCD N E+QGC GG M+ AF+FI + G++TE +YPY G + C +K +HA TITG+
Sbjct: 176 VDCDTNGEDQGCEGGLMDYAFDFIQQNHGLSTETNYPYSGTDGTCNANKEANHAATITGH 235
Query: 255 EAIPARYA----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-ED 291
E +PA FQ YS GVF CG +L+HGVT VGYG
Sbjct: 236 EDVPANSESALLKAVANQPISVAIDASGSDFQFYSSGVFTGECGTELDHGVTAVGYGTAA 295
Query: 292 HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G KYWLVKNSWGTSWGE GYI+M R ++ G+CGI MQASYP
Sbjct: 296 DGTKYWLVKNSWGTSWGEEGYIQMQRGVAAAE-GLCGIAMQASYPT 340
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 306 bits (784), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 158/343 (46%), Positives = 208/343 (60%), Gaps = 30/343 (8%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M RN +SL L+++LG S+ + SM E+ E W+ ++ R Y +E +
Sbjct: 1 MAFTTRNGCISLALIFLLGALV---SQAMARTLQDASMHEKHEEWMSRFGRVYNDGNEKE 57
Query: 81 RRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY 139
R+ I+ NVQ I+ N + S+KL N+FADL+NEEF ++ + + +Y
Sbjct: 58 IRYKIFKENVQRIESFNKASGKSYKLGINQFADLTNEEFKTSRNRFKGHMCSSQAGPFRY 117
Query: 140 LGL---PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVD 196
L P+S+DWRK+GAVT +KDQGQCGSCWAFSAVAAVEGI +L T KL+SLSEQELVD
Sbjct: 118 ENLTAAPSSMDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVD 177
Query: 197 CDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA 256
CD E+QGC GG M+ AF+FI + G+TTE +YPY G + C T + +HA I G+E
Sbjct: 178 CDTKGEDQGCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFED 237
Query: 257 IPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGE 294
+PA + FQ YS G+F CG +L+HGV VGYGE +G
Sbjct: 238 VPANNEGALMKAVAKQPVSVAIDAGGFGFQFYSSGIFTGDCGTELDHGVAAVGYGESNGM 297
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
YWLVKNSWGT WGE GYIRM ++ + G+CGI MQASYP
Sbjct: 298 NYWLVKNSWGTQWGEEGYIRMQKDIDAKE-GLCGIAMQASYPT 339
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 165/348 (47%), Positives = 212/348 (60%), Gaps = 36/348 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M ++L N ++ + +L V + +WS + SME R + W+ QY R Y E +
Sbjct: 1 MALLLHNKLVLMAMLLVTLWASQSWSRSLHEA----SMELRHKTWMTQYGRVYKGNVEKE 56
Query: 81 RRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP----RWP 135
+RF I+ NV++I+ N+ N +KL N F DL+NEEF +++ GY + R
Sbjct: 57 KRFKIFKENVEFIESFNNNGNKPYKLGINAFTDLTNEEFRASHNGYTMSMSSHQSSYRTK 116
Query: 136 SVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
S +Y +P S+DWR +GAVT +KDQGQCG CWAFSAVAA+EGI KL TG L+SLSEQ
Sbjct: 117 SFRYENVTAVPPSLDWRTKGAVTHIKDQGQCGCCWAFSAVAAMEGITKLSTGTLISLSEQ 176
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTIT 252
ELVDCD + +QGC GG M+ AFEFI + G+TTE +YPY G + C T K +HA IT
Sbjct: 177 ELVDCDTSGMDQGCEGGLMDDAFEFIIENNGLTTEANYPYEGVDGSCNTRKAANHAAKIT 236
Query: 253 GYEAIPA----------------------RYAFQLYSHGVFDEYCGHQLNHGVTVVGYG- 289
GYE +PA AFQ YS G+F CG +L+HGVTVVGYG
Sbjct: 237 GYENVPAYDEEALRKAVANQPVSVAIDAGESAFQHYSSGIFTGDCGTELDHGVTVVGYGT 296
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
D G KYWLVKNSWGTSWGE GYIRM R+ + G+CGI M+ SYP
Sbjct: 297 SDDGTKYWLVKNSWGTSWGEDGYIRMERDIDAKE-GLCGIAMEPSYPT 343
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 157/343 (45%), Positives = 208/343 (60%), Gaps = 30/343 (8%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M +R+ +SL L++ LG A S+ + S+ E+ E W+ ++ R Y E +
Sbjct: 1 MAFTIRHGCISLALIFFLGALA---SQAIARTLQDASIHEKHEEWMTRFKRVYSDAKEKE 57
Query: 81 RRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY 139
R+ I+ NVQ I+ N + S+KL N+FADL+NEEF ++ + + +Y
Sbjct: 58 IRYKIFKENVQRIESFNKASEKSYKLGINQFADLTNEEFKTSRNRFKGHMCSSQAGPFRY 117
Query: 140 ---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVD 196
+P+S+DWRKEGAVT +KDQGQCGSCWAFSAVAAVEGI +L T KL+SLSEQELVD
Sbjct: 118 ENITAVPSSMDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVD 177
Query: 197 CDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA 256
CD E+QGC GG M+ AF+FI + G+TTE +YPY G + C T + +HA I G+E
Sbjct: 178 CDTKGEDQGCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFED 237
Query: 257 IPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGE 294
+PA + FQ YS G+F CG +L+HGV VGYGE +G
Sbjct: 238 VPANNEGALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGDCGTELDHGVAAVGYGESNGM 297
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
YWLVKNSWGT WGE GYIRM ++ + G+CGI MQASYP
Sbjct: 298 NYWLVKNSWGTQWGEEGYIRMQKDIDAKE-GLCGIAMQASYPT 339
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 162/339 (47%), Positives = 203/339 (59%), Gaps = 38/339 (11%)
Query: 28 AVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYS 87
A + + +W + + E Y M R E W+ Y + Y E +RRF I+
Sbjct: 12 AFILILGMWAFEVASRELQESY--------MSARHEQWMATYGKVYVDAAEKERRFKIFK 63
Query: 88 SNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP--RWPSVQY---LG 141
+NV+YI+ N+ N +KL+ NKFAD +NE+F GY +P+ + S +Y
Sbjct: 64 NNVEYIESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQTRPMKVTSFKYENVTA 123
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
+PA++DWRK+GAVTP+KDQGQCGSCWAFS VAA EGIN+L TGKLVSLSEQELVDCD
Sbjct: 124 VPATMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDNQG 183
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY 261
E+QGC GG ME FEFI K G+TTE +YPY+ + C + K H ITGYE++PA
Sbjct: 184 EDQGCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANS 243
Query: 262 A----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWL 298
FQ YS GVF CG +L+HGVT VGYGE G KYWL
Sbjct: 244 EAELLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYGETSDGTKYWL 303
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
VKNSW TSWGE GYIRM R+ + G+CGI M +SYP
Sbjct: 304 VKNSWXTSWGEEGYIRMQRDIDAEE-GLCGIAMDSSYPT 341
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 168/366 (45%), Positives = 216/366 (59%), Gaps = 48/366 (13%)
Query: 5 LFIAIYTNLHLKIAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFEN 64
LF ++ + L L A DM ++ + L P + +E+
Sbjct: 18 LFFSLASFLMLSSASDMSIITYDETHGL---------------NSPPLRTHDQLLSLYES 62
Query: 65 WLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFISTYL 123
WL ++ + Y + E + RFGI+ NV ++D NS +N S+KL NKFADL+N+E+ S YL
Sbjct: 63 WLVKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRSLYL 122
Query: 124 G----YNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
+ NE + S +++ LP SVDWR GAV PVKDQGQCGSCWAFS V A
Sbjct: 123 SGKMMKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQGQCGSCWAFSTVGA 182
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEGINK+ TG+L+SLSEQELVDCD N NQGCNGG M+ AFEFI K GG+ TEDDYPY+G
Sbjct: 183 VEGINKIVTGELISLSEQELVDCD-NGYNQGCNGGLMDYAFEFIVKNGGIDTEDDYPYKG 241
Query: 235 KNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFD 272
+ C ++ VTI GYE +P AFQLY GVF
Sbjct: 242 VDGLCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGVFT 301
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG +L+HGV VGYG ++G+ YW+V+NSWG WGE+GYIR+ RN S++ G CGI MQ
Sbjct: 302 GQCGTELDHGVVAVGYGSENGKDYWIVRNSWGPDWGESGYIRLERNVASTSTGKCGIAMQ 361
Query: 333 ASYPVK 338
ASYP K
Sbjct: 362 ASYPTK 367
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 303 bits (776), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 165/345 (47%), Positives = 215/345 (62%), Gaps = 36/345 (10%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRR 82
M L + ++ + LL ++G+ A S+ + SM ER E+W+ Y R Y E +RR
Sbjct: 1 MALESKIICITLL-IMGVWA---SQALSRTLHEVSMSERHEDWMGLYGRTYKDIAEKERR 56
Query: 83 FGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL- 140
F I+ NV+YI+ +NS N +KL+ N+FAD +NEEF ++ GYN + PR +
Sbjct: 57 FKIFKENVEYIESVNSAGNRRYKLSINEFADQTNEEFKASRNGYNMS-SRPRSSEITSFR 115
Query: 141 -----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
+P+S+DWRK+GAVTP+KDQGQCG CWAFSAVAA+EG+ +LKTG+L+SLSEQELV
Sbjct: 116 YENVAAVPSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELV 175
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DCD + E+QGC GG M+ AFEFI GG+TTE +YPY+G + C K A I YE
Sbjct: 176 DCDTSGEDQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYE 235
Query: 256 AIPARYA----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGE-DH 292
+PA FQ YS GVF CG +L+HGVT VGYG+ D
Sbjct: 236 DVPANSEAALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDD 295
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G KYWLVKNSWGT WGE GYI M R+ ++ G+CGI M+ASYP
Sbjct: 296 GTKYWLVKNSWGTGWGEDGYIWMERD-IGADEGLCGIAMEASYPT 339
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 303 bits (775), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 154/345 (44%), Positives = 212/345 (61%), Gaps = 32/345 (9%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M ++ + L + L +VL + A + ++ +M ER E W+ ++ + Y ++E
Sbjct: 1 MALLCKGQFLLIALFFVLAMWA---DQASTRELHESTMVERHEKWMAKHGKVYKDDEEKL 57
Query: 81 RRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPR----WP 135
RRF I+ +NV++I+ N+ N S+ L N+FADL+NEEF +++ GY +P + R +
Sbjct: 58 RRFQIFKNNVEFIESSNAAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDASRIVTPFK 117
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
LP S+DWR++GAVT +KDQ +CGSCWAFSAVAA EG++KL+TGKLVSLSEQELV
Sbjct: 118 YENVTALPYSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSEQELV 177
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DCDV E++GC GG ME AF+FI + GG+TTE +Y YRG++ +C T K H ITGY+
Sbjct: 178 DCDVKGEDKGCQGGLMEDAFKFIKRNGGITTEANYAYRGRDGKCDTKKEASHVAKITGYQ 237
Query: 256 AIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDH 292
+P +FQ Y G++ CG LNHGV VGYG
Sbjct: 238 VVPENSEAALLKAVAHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAVGYGTSSS 297
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G KYW+VKNSWG WGE GY+RM R+ +S G+CGI M SYP
Sbjct: 298 GSKYWIVKNSWGPEWGERGYVRMKRD-ITSRKGLCGIAMDCSYPT 341
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 167/351 (47%), Positives = 209/351 (59%), Gaps = 43/351 (12%)
Query: 29 VLSLFLLWVLGIPAGAWSE------GYPQK--YDPQSMEERFENWLKQYSREYGSEDEWQ 80
+L LF + L AG+ S GY K + ++ E +E WL Q+ + Y E Q
Sbjct: 3 ILLLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAIMELYELWLAQHKKAYNGLGEKQ 62
Query: 81 RRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGY---------NKPYN 130
RF ++ N YI N+Q N S+KL N+FADLS+EEF +TYLG N P
Sbjct: 63 NRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKATYLGAKLDTKKRLSNSP-- 120
Query: 131 EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
PR+ LP S+DWR++GAVT VKDQG CGSCWAFS VAAVEGIN++ TG L SLS
Sbjct: 121 SPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 180
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQELVDCD S NQGCNGG M+ AF+FI GG+ +EDDYPY+ + C + H VT
Sbjct: 181 EQELVDCDT-SYNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKANDGSCDAYRKNAHVVT 239
Query: 251 ITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
I YE +P + AFQ Y GVF CG QL+HGVT+VGY
Sbjct: 240 IDDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSTCGTQLDHGVTLVGY 299
Query: 289 GEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
G + G YW+VKNSWG SWGE G+IR+ RN + G+CGI M+ASYP+K+
Sbjct: 300 GSESGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVSTGMCGIAMEASYPLKK 350
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 167/349 (47%), Positives = 211/349 (60%), Gaps = 39/349 (11%)
Query: 29 VLSLFLLWVLGIPAGAWSEGYPQ--KYDPQ------SMEERFENWLKQYSREYGSEDEWQ 80
+L LF + L AG+ S YD Q ++ E +E WL Q+ + Y DE Q
Sbjct: 3 ILLLFAVLALSAMAGSASRADFSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDEKQ 62
Query: 81 RRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLG--YNKPYNEPRWPSV 137
++F ++ N YI N+Q N S+KL N+FADLS+EEF + YLG + R PS
Sbjct: 63 KKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKAAYLGTKLDAKKRLSRSPSP 122
Query: 138 QYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
+Y LP S+DWR++GAVT VK+QG CGSCWAFS VAAVEGIN++ TG L SLSEQ
Sbjct: 123 RYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQ 182
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTIT 252
ELVDCD S NQGCNGG M+ AF+FI GG+ +EDDYPY+ N C + H VTI
Sbjct: 183 ELVDCDT-SYNQGCNGGLMDYAFQFIISNGGLDSEDDYPYKANNGSCDAYRKNAHVVTID 241
Query: 253 GYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
YE +P + AFQ Y GVF CG QL+HGVT+VGYG
Sbjct: 242 DYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSNCGTQLDHGVTLVGYGS 301
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
+ G YWLVKNSWG SWGE G+I++ RN ++ G+CGI M+ASYPVK+
Sbjct: 302 ESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGASTGMCGIAMEASYPVKK 350
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 163/341 (47%), Positives = 200/341 (58%), Gaps = 32/341 (9%)
Query: 27 NAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEWQRRFGI 85
N F++ L I GAW+ + P+ SM ER E W+ QY R Y E E RF I
Sbjct: 22 NMAFKHFMIAAL-ILLGAWACQATSRTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQI 80
Query: 86 YSSNVQYIDYINSQNL-SFKLTDNKFADLSNEEFISTYLGY-----NKPYNEPRWPSVQY 139
+ NV++I+ N S+KL N+FAD +NEEF ++ GY ++P +
Sbjct: 81 FMDNVKFIEEFNKDGRQSYKLAVNEFADQTNEEFQASRNGYKMAVSSRPSQTTLFRYENV 140
Query: 140 LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDV 199
+P+S+DWRK+GAVTPVKDQGQCGSCWAFS +AA EGI KLKTGKL+SLSEQELVDCD
Sbjct: 141 TAVPSSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDK 200
Query: 200 NSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA 259
E+QGC GGYME FEFI K G+ E YPY + C + + A I+GYE +PA
Sbjct: 201 TGEDQGCEGGYMEDGFEFIVKNKGIALEASYPYTAADGTCNSKEEASRAAKISGYEKVPA 260
Query: 260 R----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKY 296
AFQ YS GVF CG L+HGVT VGYG+ G KY
Sbjct: 261 NSETALLKAVANQPVSVSIDASGVAFQFYSSGVFTGECGTDLDHGVTAVGYGKTSDGTKY 320
Query: 297 WLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
WLVKNSWG SWG++GYI M R + G+CGI M ASYP
Sbjct: 321 WLVKNSWGASWGDSGYIMMQRGVAAKG-GLCGIAMDASYPT 360
>gi|356515116|ref|XP_003526247.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 164/333 (49%), Positives = 200/333 (60%), Gaps = 53/333 (15%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
M+ RF+ WLK Y ++EW+ RF IY +NV+YI SQ S+ LTDNKFADL+NEE
Sbjct: 1 MKVRFDRWLKXNGXNYEDKEEWEIRFVIYQANVEYIGCKKSQKNSYNLTDNKFADLTNEE 60
Query: 118 FISTYLGY-NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG------------ 164
F+STYLG+ + R+ ++ LP S DWRKEGAVT +KDQG CG
Sbjct: 61 FVSTYLGFATRLIPHTRFKYHEHGNLPXSKDWRKEGAVTDIKDQGNCGKHSTWFSPEISH 120
Query: 165 -----------------SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCN 207
S WAFS VAAVE INK+K+GKLVSLSEQELVD DV ++NQGC
Sbjct: 121 NLRNILTNYNTINFRDISFWAFSVVAAVERINKIKSGKLVSLSEQELVDYDVANKNQGCE 180
Query: 208 GGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------- 260
GG M+ F FI K GG+TT DYPY G + C +K HHAV I+GYE P++
Sbjct: 181 GGLMDTTFAFIKKNGGLTTSKDYPYEGVDGSCNKEKALHHAVNISGYERAPSKDEAMLKV 240
Query: 261 ---------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGT 305
YAFQLYS GVF CG +LNHGVT+VGY + +KY VKNS G
Sbjct: 241 AAANQPISVAIDAGGYAFQLYSQGVFSGVCGKKLNHGVTIVGYDKGTFDKYRTVKNSXGA 300
Query: 306 SWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
WGE+GYIRM R++ G CGI M+ASYP+K
Sbjct: 301 DWGESGYIRMKRDA-FDKAGTCGIAMKASYPLK 332
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 300 bits (768), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 158/359 (44%), Positives = 215/359 (59%), Gaps = 41/359 (11%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSE----GYPQKYDPQS-------MEERFENWLKQY 69
M + ++ +++FL +LG+ + + + GY + + +S + +E WL ++
Sbjct: 1 MGLCRSSSSMAVFLFLLLGLASASAXDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKH 60
Query: 70 SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPY 129
+ Y + E +RRF I+ N+++ID N++N ++K+ N+FADL+NEE+ S YLG
Sbjct: 61 GKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAA 120
Query: 130 NE-------PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLK 182
R+ LP SVDWRK+GAV VKDQG CGSCWAFS +AAVEGINK+
Sbjct: 121 KRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIV 180
Query: 183 TGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTD 242
TG L+SLSEQELVDCD S N+GCNGG M+ AFEFI GG+ +E+DYPY+ + RC
Sbjct: 181 TGGLISLSEQELVDCDT-SYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQY 239
Query: 243 KTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLN 280
+ VTI GYE +P FQLY G+F CG L+
Sbjct: 240 RKNAXVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALD 299
Query: 281 HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
HGVT VGYG ++G YW+VKNSWG SWGE GYIRM R+ +S G CGI M+ASYP+K+
Sbjct: 300 HGVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPIKK 358
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 300 bits (768), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 168/367 (45%), Positives = 217/367 (59%), Gaps = 53/367 (14%)
Query: 6 FIAIYTNLHLKIAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENW 65
F+A + L + +AIDM ++ N P++ + +++ +E W
Sbjct: 10 FLATFYFLSVCLAIDMSIIDYNLKHGQV----------------PERTEAETLR-LYEMW 52
Query: 66 LKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFISTYLG 124
L +Y + Y + E +RRF I+ N++++D NS N S+KL NKFADLSNEE+ + YLG
Sbjct: 53 LVKYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRAAYLG 112
Query: 125 YN-----KPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
+ P+ S +YL LP SVDWR++GAV PVKDQGQCGSCWAFS V A
Sbjct: 113 TRMDGKRRLLGGPK--SARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGSCWAFSTVGA 170
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEGIN++ TG L SLSEQELVDCD NQGCNGG M+ AFEFI K GG+ TE+DYPY+
Sbjct: 171 VEGINQIVTGNLTSLSEQELVDCD-KVYNQGCNGGLMDYAFEFIMKNGGIDTEEDYPYKA 229
Query: 235 KNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFD 272
+ C ++ VTI GYE +P AFQLY GVF
Sbjct: 230 VDSMCDPNRKNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQLYQSGVFT 289
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG QL+HGV VGYG ++G YW+V+NSWG +WGE GYIRM RN S+ G CGI M+
Sbjct: 290 GSCGTQLDHGVVAVGYGTENGVDYWVVRNSWGPAWGENGYIRMERNVASTETGKCGIAME 349
Query: 333 ASYPVKR 339
ASYP K+
Sbjct: 350 ASYPTKK 356
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 300 bits (768), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 158/351 (45%), Positives = 215/351 (61%), Gaps = 35/351 (9%)
Query: 22 RMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEE---RFENWLKQYSREYGSEDE 78
++ + L+ L L + ++ + +P K P++ ++ +E WL ++ + Y + E
Sbjct: 4 KLTILFITLTFTLSLALDMCIISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKNYNALGE 63
Query: 79 WQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGY-------NKPYNE 131
++RF I+ N+ +ID NS+NLSF+L N+FADL+NEE+ + +LG N+ N
Sbjct: 64 KEKRFEIFKDNLGFIDEHNSKNLSFRLGLNRFADLTNEEYRTRFLGTRINPNRRNRKVNS 123
Query: 132 P--RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
R+ + LP SVDWRKEGAV VKDQG CGSCWAFSA+AAVEG+NKL TG L+SL
Sbjct: 124 QTNRYATRVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGVNKLATGDLISL 183
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQELVDCD S N+GCNGG M+ AFEFI + +T E+DYPYR + RC ++ V
Sbjct: 184 SEQELVDCDT-SYNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAIDGRCDQNRKNAKVV 242
Query: 250 TITGYEAIPA----------------------RYAFQLYSHGVFDEYCGHQLNHGVTVVG 287
+I YE +PA FQLY GVF CG L+HGV VG
Sbjct: 243 SIDQYEDVPAYDEGALKKAVANQVIAVAVEGGGREFQLYDSGVFTGRCGTALDHGVAAVG 302
Query: 288 YGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YG ++G+ YW+V+NSWG SWGEAGYIR+ RN +S G CGI ++ SYP+K
Sbjct: 303 YGTENGKDYWIVRNSWGGSWGEAGYIRLERNLATSKSGKCGIAIEPSYPIK 353
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 300 bits (767), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 155/304 (50%), Positives = 197/304 (64%), Gaps = 30/304 (9%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E W+ ++ + S E RRF I+ N+++ID N +NLS++L KFADL+N+E+ S
Sbjct: 42 YEEWVVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSM 101
Query: 122 YLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
YLG ++ + S++Y +P SVDWRKEGAV VKDQG CGSCWAFS + AVE
Sbjct: 102 YLG-SRLKRKATKTSLRYEARVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVE 160
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
GINK+ TG L+SLSEQELVDCD S N+GCNGG M+ AFEFI K GG+ TE+DYPY+G +
Sbjct: 161 GINKIVTGDLISLSEQELVDCDT-SYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYKGVD 219
Query: 237 DRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEY 274
RC + VTI YE +PA AFQLY G+FD
Sbjct: 220 GRCDQTRKNAKVVTIDSYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGI 279
Query: 275 CGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQAS 334
CG L+HGV VGYG ++G+ YW+VKNSWGTSWGE+GYIRM RN SS G CGI ++ S
Sbjct: 280 CGTDLDHGVVAVGYGTENGKDYWIVKNSWGTSWGESGYIRMERNIASS-AGKCGIAVEPS 338
Query: 335 YPVK 338
YP+K
Sbjct: 339 YPIK 342
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 156/307 (50%), Positives = 200/307 (65%), Gaps = 31/307 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E+WL ++ + Y S E +RRF ++ N+++ID NS+N ++++ N+FADL+NEE+ S
Sbjct: 42 YEDWLVKHGKAYNSLGEKERRFEVFKDNLRFIDEHNSENRTYRVGLNRFADLTNEEYRSM 101
Query: 122 YLGY--NKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
YLG N+ R S +Y LP SVDWRKEGAV VKDQG CGSCWAFSAVAA
Sbjct: 102 YLGALSGIRRNKLRKISDRYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAA 161
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEGINK+ TG L+SLSEQELVDCD NS N+GCNGG M+ FEFI GG+ +E+DYPY
Sbjct: 162 VEGINKIVTGDLISLSEQELVDCD-NSYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLA 220
Query: 235 KNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFD 272
++ RC T + V+I YE +P FQLYS GVF
Sbjct: 221 RDGRCDTYRKNARVVSIDSYEDVPVNNEAALQKAVANQPVSVAIEAGGRDFQLYSSGVFS 280
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG L+HGV VGYG ++G+ YW+V+NSWG SWGE+GY+RMARN GICGI M+
Sbjct: 281 GRCGTALDHGVVAVGYGTENGQDYWIVRNSWGKSWGESGYLRMARNIRKP-TGICGIAME 339
Query: 333 ASYPVKR 339
ASYP+K+
Sbjct: 340 ASYPIKK 346
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 165/338 (48%), Positives = 205/338 (60%), Gaps = 34/338 (10%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL LL+ LG+ A + + SM ER W+ QY + Y E + RF I+ N
Sbjct: 10 ISLALLFCLGLFA---IQVTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFKEN 66
Query: 90 VQYIDYINSQN--LSFKLTDNKFADLSNEEFISTYLGYNKPYNEP--RWPSVQY---LGL 142
V YI+ N+ + S+KL N+FADL+NEEFI++ + R S +Y G+
Sbjct: 67 VNYIETFNNADDTKSYKLGINQFADLTNEEFIASRNKFKGHMCSSIMRTTSFKYENVSGI 126
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P++VDWRK+GAVTPVK+QGQCG CWAFSAVAA EGI+KL TGKL+SLSEQELVDCD
Sbjct: 127 PSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGV 186
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA 262
+QGC GG M+ AF+FI + G++TE YPY G + C +K AVTITGYE +PA
Sbjct: 187 DQGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSE 246
Query: 263 ----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLV 299
FQ Y GVF CG +L+HGVT VGYG + G KYWLV
Sbjct: 247 QALQKAVANQPISVAIDASGSDFQFYKSGVFTGACGTELDHGVTAVGYGVSNDGTKYWLV 306
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KNSWGT WGE GYI M R ++ GICGI MQASYP
Sbjct: 307 KNSWGTDWGEEGYIMMQRGIEAAE-GICGIAMQASYPT 343
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 158/357 (44%), Positives = 214/357 (59%), Gaps = 39/357 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAG------AWSEGYPQKYDPQSMEER---FENWLKQYSR 71
M + ++ +++FL +LG+ + + E + K ++ E+ +E WL ++ +
Sbjct: 1 MGLCRSSSSMAVFLFLLLGLASALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGK 60
Query: 72 EYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNE 131
Y + E +RRF I+ N+++ID N++N ++K+ N+FADL+NEE+ S YLG
Sbjct: 61 SYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNEEYRSMYLGTRTAAKR 120
Query: 132 -------PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTG 184
R+ LP SVDWRK+GAV VKDQG CGSCWAFS +AAVEGINK+ TG
Sbjct: 121 RSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTG 180
Query: 185 KLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKT 244
L+SLSEQELVDCD S N+GCNGG M+ AFEFI GG+ +E+DYPY+ + RC +
Sbjct: 181 GLISLSEQELVDCDT-SYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRK 239
Query: 245 KHHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHG 282
VTI GYE +P FQLY G+F CG L+HG
Sbjct: 240 NAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHG 299
Query: 283 VTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
VT VGYG ++G YW+VKNSWG SWGE GYIRM R+ +S G CGI M+ASYP+K+
Sbjct: 300 VTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPIKK 356
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 160/336 (47%), Positives = 200/336 (59%), Gaps = 31/336 (9%)
Query: 29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+ SL LL V G + E + + SM ER E W+ QY + Y E + R I+
Sbjct: 9 ITSLTLLLVFGFLS---FEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRSKIFKE 65
Query: 89 NVQYID-YINSQNLSFKLTDNKFADLSNEEFIS-TYLGYNKPYNEPRWPSVQY---LGLP 143
NVQ I+ + N+ N S+KL N+FADL+NEEF + + N R P+ +Y +P
Sbjct: 66 NVQRIEAFNNAGNKSYKLGINQFADLTNEEFKARNRFKGHMCSNSTRTPTFKYEHVTSVP 125
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
AS+DWR++GAVTP+KDQGQCG CWAFSAVAA EGI KL TGKL+SLSEQELVDCD +
Sbjct: 126 ASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVD 185
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR--- 260
QGC GG M+ AF+FI + G+ TE YPY+G + C + A +I G+E +PA
Sbjct: 186 QGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSES 245
Query: 261 -------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKN 301
FQ YS GVF CG +L+HGVT VGYG D G KYWLVKN
Sbjct: 246 ALLKAVANQPISVAIDASGSEFQFYSSGVFTGSCGTELDHGVTAVGYGSDGGTKYWLVKN 305
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWG WGE GYIRM R+ + G+CG MQASYP
Sbjct: 306 SWGEQWGEQGYIRMQRDVAAEE-GLCGFAMQASYPT 340
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 165/366 (45%), Positives = 217/366 (59%), Gaps = 51/366 (13%)
Query: 5 LFIAIYTNLHLKIAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFEN 64
+F+ ++ + L A DM ++ + + W + D + M +E
Sbjct: 11 MFVLLFLSFTLSSASDMSIISYDQTHATKSSW---------------RTDDEVMA-IYEE 54
Query: 65 WLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLG 124
WL + + Y + E ++RF ++ N+++ID NS+N ++KL N FADL+NEE+ STYLG
Sbjct: 55 WLVKQGKVYNALGEREKRFQVFKDNLRFIDEHNSENRTYKLGLNGFADLTNEEYRSTYLG 114
Query: 125 YN--KPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEG 177
N R S +Y LP SVDWRKEGAV VKDQG CGSCWAFS +AAVEG
Sbjct: 115 ARGGMKRNRLRKTSDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEG 174
Query: 178 INKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
INK+ TG L+SLSEQELVDCD S N+GCNGG M+ AFEFI GG+ TE+DYPY ++
Sbjct: 175 INKIVTGDLISLSEQELVDCDT-SYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDG 233
Query: 238 RCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHGVFDEYC 275
RC T + VTI YE +P FQ Y+ G+F C
Sbjct: 234 RCDTYRKNAKVVTIDDYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIFSGRC 293
Query: 276 GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR--NSPSSNIGICGILMQA 333
G QL+HGV VGYG ++G+ YW+V+NSWG SWGE GY+RMAR NSP+ GICGI M+A
Sbjct: 294 GTQLDHGVAAVGYGTENGKDYWIVRNSWGKSWGENGYLRMARSINSPT---GICGIAMEA 350
Query: 334 SYPVKR 339
SYP+K+
Sbjct: 351 SYPIKK 356
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 159/312 (50%), Positives = 199/312 (63%), Gaps = 43/312 (13%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS-FKLTDNKFADLSNEEF 118
ER E W+ QY R Y E +RR I+ +NV++I+ N +KL+ N+FADL+NEEF
Sbjct: 2 ERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEEF 61
Query: 119 ISTYLGY----------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
++ GY KP+ R+ +V +P+++DWRK+GAVTP+KDQGQCG CWA
Sbjct: 62 QASRNGYKMSAHLSSSSTKPF---RYENVS--AVPSTMDWRKKGAVTPIKDQGQCGCCWA 116
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSAVAA EGI +L TGKL+SLSEQELVDCD + E+QGCNGG M+ AF+FI + G+TTE
Sbjct: 117 FSAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEA 176
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLY 266
+YPY+G + C + K A ITGYE +PA AFQ Y
Sbjct: 177 NYPYQGADGACNSGKA---AAKITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFY 233
Query: 267 SHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
S GVF CG L+HGVT VGYG D G KYWLVKNSWGTSWGE GYIRM R+ + G
Sbjct: 234 SSGVFTGDCGTDLDHGVTAVGYGMSDDGTKYWLVKNSWGTSWGENGYIRMERDIDAQE-G 292
Query: 326 ICGILMQASYPV 337
+CGI M+ASYP
Sbjct: 293 LCGIAMEASYPT 304
>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 160/338 (47%), Positives = 204/338 (60%), Gaps = 34/338 (10%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
++L ++ LG+ A + + SM ER E W+ QYS+ Y E + R I+++N
Sbjct: 11 IALTFIFCLGLCA---IQVTSRSLQVDSMYERHEQWMSQYSKVYKDPQEREERHKIFTAN 67
Query: 90 VQYIDYIN--SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL-----GL 142
V YI+ N + N +KL N+FADL+NEEFI++ + + + +
Sbjct: 68 VNYIEVFNNDANNKLYKLGINQFADLTNEEFIASRNKFKGHMCSSIAKTTTFKYENVSAI 127
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P++VDWRK+GAVTPVK+QGQCG CWAFSAVAA EGI KL TGKLVSLSEQELVDCD
Sbjct: 128 PSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGV 187
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA 262
+QGC GG M+ AF+FI + G++TE YPY+G + C +K HA TITGYE +PA
Sbjct: 188 DQGCEGGLMDDAFKFIIQNHGLSTEAAYPYQGVDGTCNANKASIHAATITGYEDVPANNE 247
Query: 263 ----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLV 299
FQ Y GVF CG +L+HGVT VGYG + G KYWLV
Sbjct: 248 QALQKAVANQPISVAIDASGSDFQFYKSGVFSGSCGTELDHGVTAVGYGVGNDGTKYWLV 307
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KNSWGT WGE GYIRM R ++ G+CGI MQASYP
Sbjct: 308 KNSWGTDWGEEGYIRMQRGVDAAE-GLCGIAMQASYPT 344
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 157/313 (50%), Positives = 196/313 (62%), Gaps = 30/313 (9%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
++ E +E WL ++ R Y DE Q+RF ++ N YI N N S+KL N+FADLS+
Sbjct: 36 DAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHNQGNRSYKLGLNQFADLSH 95
Query: 116 EEFISTYLG--YNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWA 168
EEF +TYLG + R PS +Y LP S+DWR++GAVT VKDQG CGSCWA
Sbjct: 96 EEFKATYLGAKLDTKKRLSRPPSRRYQYSDGEDLPESIDWREKGAVTSVKDQGSCGSCWA 155
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS VAAVEGIN++ TG L+SLSEQELVDCD S NQGCNGG M+ AFEFI GG+ +E+
Sbjct: 156 FSTVAAVEGINQIVTGDLISLSEQELVDCDT-SYNQGCNGGLMDYAFEFIINNGGLDSEE 214
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLY 266
DYPY + C + + H VTI YE +P + FQ Y
Sbjct: 215 DYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGREFQFY 274
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
GVF CG QL+HGVT+VGYG + G YW VKNSWG SWGE G+IR+ RN ++ G+
Sbjct: 275 DSGVFTSTCGTQLDHGVTLVGYGSESGTDYWTVKNSWGKSWGEEGFIRLQRNIEVASTGM 334
Query: 327 CGILMQASYPVKR 339
CGI M+ASYPVK+
Sbjct: 335 CGIAMEASYPVKK 347
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 163/349 (46%), Positives = 213/349 (61%), Gaps = 37/349 (10%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSE-GYPQKYDPQS------MEERFENWLKQYSREYGSE 76
+L +A + LFL ++ A S Y + + S + +E WL ++ + S
Sbjct: 3 LLNSATVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRLYEEWLVKHGKAQNSL 62
Query: 77 DEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPS 136
E RRF I+ N+++ID N +NLS++L KFADL+N+E+ S YLG ++ + S
Sbjct: 63 TEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSMYLG-SRLKRKATKSS 121
Query: 137 VQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSE 191
++Y +P SVDWRKEGAV VKDQG CGSCWAFS + AVEGINK+ TG L++LSE
Sbjct: 122 LRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSE 181
Query: 192 QELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTI 251
QELVDCD S N+GCNGG M+ AFEFI GG+ TE+DYPY+G + RC + VTI
Sbjct: 182 QELVDCDT-SYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTI 240
Query: 252 TGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG 289
YE +PA AFQLY G+FD CG L+HGV VGYG
Sbjct: 241 DLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYG 300
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
++G+ YW+VKNSWGTSWGE+GYIRM RN SS G CGI ++ SYP+K
Sbjct: 301 TENGKDYWIVKNSWGTSWGESGYIRMERNIASS-AGKCGIAVEPSYPIK 348
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 156/313 (49%), Positives = 201/313 (64%), Gaps = 34/313 (10%)
Query: 58 MEERFENWLKQYSREYGSED----EWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+E +E W+ ++ ++ +++ E +RF I+ N+++ID N++NLS+KL +FADL
Sbjct: 46 VERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYKLGLTRFADL 105
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWA 168
+NEE+ S YLG KP S +Y LP SVDWRKEGAV VKDQG CGSCWA
Sbjct: 106 TNEEYRSMYLGA-KPTKRVLKTSDRYQARVGDALPDSVDWRKEGAVADVKDQGSCGSCWA 164
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS + AVEGINK+ TG L+SLSEQELVDCD S NQGCNGG M+ AFEFI K GG+ TE
Sbjct: 165 FSTIGAVEGINKIVTGDLISLSEQELVDCDT-SYNQGCNGGLMDYAFEFIIKNGGIDTEA 223
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLY 266
DYPY+ + RC ++ VTI YE +P AFQLY
Sbjct: 224 DYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLY 283
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
S GVFD CG +L+HGV VGYG ++G+ YW+V+NSWG WGE+GYI+MARN + G
Sbjct: 284 SSGVFDGLCGTELDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARNIEAPT-GK 342
Query: 327 CGILMQASYPVKR 339
CGI M+ASYP+K+
Sbjct: 343 CGIAMEASYPIKK 355
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 154/304 (50%), Positives = 196/304 (64%), Gaps = 30/304 (9%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E WL ++ + S E RRF I+ N+++ID N +NLS++L KFADL+N+E+ S
Sbjct: 42 YEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKFADLTNDEYRSM 101
Query: 122 YLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
YLG ++ + S++Y +P SVDWRKEGAV VKDQG CGSCWAFS + AVE
Sbjct: 102 YLG-SRLKRKATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVE 160
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
GINK+ TG L++LSEQELVDCD S N+GCNGG M+ AFEFI GG+ TE+DYPY+G +
Sbjct: 161 GINKIVTGDLITLSEQELVDCDT-SYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVD 219
Query: 237 DRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEY 274
RC + VTI YE +PA AFQLY G+FD
Sbjct: 220 GRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGI 279
Query: 275 CGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQAS 334
CG L+HGV VGYG ++G+ YW+VKNSWGTSWGE+GYIRM RN SS G CGI ++ S
Sbjct: 280 CGTDLDHGVVAVGYGTENGKDYWIVKNSWGTSWGESGYIRMERNIASS-AGKCGIAVEPS 338
Query: 335 YPVK 338
YP+K
Sbjct: 339 YPIK 342
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 296 bits (759), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 162/351 (46%), Positives = 212/351 (60%), Gaps = 42/351 (11%)
Query: 29 VLSLFLLWVLGIPAGAWSE--GYPQKYDPQSMEER--------FENWLKQYSREYGSE-- 76
V L L ++G+ A Y +K+ + ER +E W++++ ++ S
Sbjct: 6 VTILLLAMMIGVSYAADMSIISYDEKHHITAENERSDAEVARIYEAWMEKHGKKAQSNGL 65
Query: 77 --DEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPY----N 130
+E +RF I+ N+++ID N++NLS+KL +FADL+NEE+ S YLG
Sbjct: 66 VGEEKDQRFEIFKDNLRFIDEHNNKNLSYKLGLTRFADLTNEEYRSIYLGAKSKKRVLKT 125
Query: 131 EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
R+ +P SVDWRKEGAV VKDQG CGSCWAFS + AVEGINK+ TG L+SLS
Sbjct: 126 SDRYQPRVGDAIPDSVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLS 185
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQELVDCD S NQGCNGG M+ AFEFI K GG+ TE+DYPY+ + RC + VT
Sbjct: 186 EQELVDCDT-SYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQTRKNAKVVT 244
Query: 251 ITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
I YE +P AFQLYS GVFD CG +L+HGV VGY
Sbjct: 245 IDAYEDVPENNEAALKKTLANQPISVAIEAGGRAFQLYSSGVFDGICGTELDHGVVAVGY 304
Query: 289 GEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
G ++G+ YW+V+NSWG SWGE+GYI+MARN + G CGI M+ASYP+K+
Sbjct: 305 GTENGKDYWIVRNSWGGSWGESGYIKMARN-IAEPTGKCGIAMEASYPIKK 354
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 149/334 (44%), Positives = 197/334 (58%), Gaps = 30/334 (8%)
Query: 33 FLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
FL+ +L + D SM R E W+ +Y R Y E +R ++ +NV +
Sbjct: 82 FLIAILACTCAVSALAARDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAF 141
Query: 93 IDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNK-PYNEPRWPSVQYL-----GLPASV 146
I+ +N+ N F L N+FAD++ +EF + + GY P N+ R +Y LPAS+
Sbjct: 142 IELVNAGNDKFSLEANQFADMTVDEFRAAHTGYKPVPANKGRTTQFKYANVSLDALPASM 201
Query: 147 DWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGC 206
DWR +GAVTP+KDQGQCG CWAFS VA+VEGI KL TGKL+SLSEQELVDCDV+ +QGC
Sbjct: 202 DWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVDGMDQGC 261
Query: 207 NGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------ 260
GG M+ AFEFI GG+TTE +YPY G +D C ++K + +I GYE +P+
Sbjct: 262 EGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDSCNSNKESNDVASIKGYEDVPSNDETSLL 321
Query: 261 ----------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSW 303
F+ Y GV CG +L+HG+ VGYG G K+WL+KNSW
Sbjct: 322 KAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDHGIAAVGYGITSDGTKFWLMKNSW 381
Query: 304 GTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
GTSWGE G+IRM R+ G+CG+ MQ SYP
Sbjct: 382 GTSWGEKGFIRMERDIADEE-GLCGLAMQPSYPT 414
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 159/318 (50%), Positives = 199/318 (62%), Gaps = 39/318 (12%)
Query: 54 DPQSMEERFENWLKQYSREYGSE--DEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFA 111
D SM R E W+ Q+ R Y E D +RF ++ NV+ I+ N +FKL N+FA
Sbjct: 31 DEDSM--RHEEWMSQHGRVYADEQEDHKNKRFNVFKENVERIEEFNDGK-TFKLAINQFA 87
Query: 112 DLSNEEFISTYLGYNKPY------NEP---RWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
DL+NEEF ++Y G+ P +P R+ +V LP SVDWRK+GAVTPVK+QGQ
Sbjct: 88 DLTNEEFRASYNGFKGPMVLSSQITKPTPFRYENVSS-ALPVSVDWRKKGAVTPVKNQGQ 146
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CG CWAFSAVAA+EGI ++ TGKL+SLSEQELVDCD + GC GG M+ AFEFI G
Sbjct: 147 CGCCWAFSAVAAIEGITQISTGKLISLSEQELVDCDTKGIDHGCEGGLMDTAFEFIINNG 206
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------------- 260
G+TTE +YPY+G++ C +KT AV+ITGYE +PA
Sbjct: 207 GLTTESNYPYKGEDGTCNFNKTNPIAVSITGYEDVPANDEQALMKAVAHQPVSVAIEAGG 266
Query: 261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
FQ YS GVF CG +L+H VT VGYGE + G KYW+VKNSWGT WGE+GYI M ++
Sbjct: 267 SDFQFYSSGVFTGECGTELDHAVTAVGYGESEDGSKYWIVKNSWGTKWGESGYIEMQKDI 326
Query: 320 PSSNIGICGILMQASYPV 337
G+CGI MQASYP
Sbjct: 327 KVKQ-GLCGIAMQASYPT 343
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 156/313 (49%), Positives = 201/313 (64%), Gaps = 34/313 (10%)
Query: 58 MEERFENWLKQYSREYGSED----EWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+E +E W+ ++ ++ +++ E +RF I+ N++YID N++NLS+KL +FADL
Sbjct: 46 VERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRYIDEHNTKNLSYKLGLTRFADL 105
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWA 168
+N+E+ S YLG KP S +Y LP SVDWRKEGAV VKDQG CGSCWA
Sbjct: 106 TNDEYRSMYLG-AKPVKRVLKTSDRYEARVGDALPDSVDWRKEGAVADVKDQGSCGSCWA 164
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS + AVEGINK+ TG L+SLSEQELVDCD S NQGCNGG M+ AFEFI K GG+ TE
Sbjct: 165 FSTIGAVEGINKIVTGDLISLSEQELVDCDT-SYNQGCNGGLMDYAFEFIIKNGGIDTEA 223
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLY 266
DYPY+ + RC ++ VTI YE +P AFQLY
Sbjct: 224 DYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQPISVAIEAGGRAFQLY 283
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
S GVFD CG +L+HGV VGYG ++G+ YW+V+NSWG WGE+GYI+MARN + G
Sbjct: 284 SSGVFDGICGTELDHGVVAVGYGTENGKDYWIVRNSWGNRWGESGYIKMARN-IAEPTGK 342
Query: 327 CGILMQASYPVKR 339
CGI M+ASYP+K+
Sbjct: 343 CGIAMEASYPIKK 355
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 154/321 (47%), Positives = 207/321 (64%), Gaps = 33/321 (10%)
Query: 49 YPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDN 108
YP + D Q + +E WL ++ + Y + E ++RF I+ N+++ID NS + S+K+ N
Sbjct: 39 YPLRTDSQ-VRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSVDRSYKVGLN 97
Query: 109 KFADLSNEEFISTYLGYNKPYNEPRW---PSVQYL-----GLPASVDWRKEGAVTPVKDQ 160
+FADL+NEE+ + +LG K + R+ S +YL LP +VDWR++GAV PVKDQ
Sbjct: 98 RFADLTNEEYKAMFLG-TKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQ 156
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
GQCGSCWAFS V AVEGIN++ TG+L+SLSEQELVDCD S NQGCNGG M+ AFEFI
Sbjct: 157 GQCGSCWAFSTVGAVEGINQIVTGELISLSEQELVDCD-KSYNQGCNGGLMDYAFEFIIN 215
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------------------- 260
GG+ TE+DYPY+ ++ C ++ VTI GYE +P
Sbjct: 216 NGGIDTEEDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEA 275
Query: 261 --YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
AFQLY GVF CG +L+HGV VGYG ++G YW+V+NSWG++WGE+GYIRM RN
Sbjct: 276 GGRAFQLYKSGVFTGRCGTELDHGVVAVGYGTENGVNYWIVRNSWGSAWGESGYIRMERN 335
Query: 319 SPSSNIGICGILMQASYPVKR 339
++ G CGI +Q SYP K+
Sbjct: 336 VANTKTGKCGIAIQPSYPTKK 356
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 152/325 (46%), Positives = 208/325 (64%), Gaps = 36/325 (11%)
Query: 49 YPQKYDPQSMEE----RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYID-YINSQNLSF 103
Y +K+ QS E+ R+E WL ++ R Y + E ++RF I+ N+++I+ + NS N ++
Sbjct: 33 YARKWTLQSDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTY 92
Query: 104 KLTDNKFADLSNEEFISTYLGYN----KPYNEPRWPSVQYLG-----LPASVDWRKEGAV 154
K+ N+FADL+NEE+ + YLG + + + + PS +Y +P SVDWRK GAV
Sbjct: 93 KVGLNQFADLTNEEYRTMYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAV 152
Query: 155 TPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKA 214
P+K+QG CGSCWAFS VAAVEGIN++ TG++++LSEQELVDCD +N GCNGG M+ A
Sbjct: 153 APIKNQGSCGSCWAFSTVAAVEGINQIVTGEMITLSEQELVDCD-RVQNSGCNGGLMDYA 211
Query: 215 FEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---------------- 258
FEFI GG+ TE YPYRG RC + + V+I GYE +P
Sbjct: 212 FEFIISNGGMDTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPRNERALQKAVAHQPVC 271
Query: 259 -----ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYI 313
+ AFQLYS GVF CG +++HGV VVGYG + G YW+V+NSWGT WGE GY+
Sbjct: 272 VAIEASGRAFQLYSSGVFTGECGEEVDHGVVVVGYGSEDGVDYWIVRNSWGTKWGENGYV 331
Query: 314 RMARNSPSSNIGICGILMQASYPVK 338
+M RN S++G CGI+ +ASYP K
Sbjct: 332 KMERNVKKSHLGKCGIMTEASYPTK 356
>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 346
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 146/312 (46%), Positives = 200/312 (64%), Gaps = 31/312 (9%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNE 116
+M R E W+ Q+ R Y E R ++ +NV +I+ N++N F L N+FADL+N+
Sbjct: 36 AMAARHEQWMAQFGRVYKDPAEKAHRLEVFKANVAFIESFNAENHEFWLGANQFADLTND 95
Query: 117 EFISTYL-------GYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
EF ++ G ++ V LPASVDWR +GAVTP+K+QGQCGSCWAF
Sbjct: 96 EFRASKTNKGIKQGGVRDAPTGFKYSDVSIDALPASVDWRTKGAVTPIKNQGQCGSCWAF 155
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
SAVAA EG+ KL TGKLVSLSEQELVDCDV+ +QGC GG+M+ AF+FI K GG+TTE +
Sbjct: 156 SAVAATEGVVKLSTGKLVSLSEQELVDCDVHGVDQGCMGGWMDDAFKFIIKNGGLTTEAN 215
Query: 230 YPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYS 267
YPY G++D+C++++T + A TI GYE +PA FQLY+
Sbjct: 216 YPYTGEDDKCKSNETVNVAATIKGYEDVPANDESALMKAVAHQPVSVVVDGGDMTFQLYA 275
Query: 268 HGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
GV CG +++HG+ +GYG +G KYWL+KNSWGT+WGE G++RMA++ P G+
Sbjct: 276 GGVMTGSCGVEMDHGIAAIGYGATSNGTKYWLMKNSWGTTWGEKGFLRMAKDIPDKR-GM 334
Query: 327 CGILMQASYPVK 338
CG+ M+ SYP +
Sbjct: 335 CGLAMKPSYPTE 346
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 161/357 (45%), Positives = 212/357 (59%), Gaps = 43/357 (12%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGAWSE--GYPQKYDPQS-------MEERFENWLKQYSREY 73
M L + LSLFLL + + Y Q++ +S + +E WL ++ + Y
Sbjct: 1 MGLHRSSLSLFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAY 60
Query: 74 GSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKP----- 128
+ E ++RFGI+ N+++ID NSQNL+++L N+FADL+NEE+ S YLG KP
Sbjct: 61 NALGEKEKRFGIFKDNLRFIDEHNSQNLTYRLGLNRFADLTNEEYRSMYLGV-KPGATRV 119
Query: 129 -----YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKT 183
R+ + LP +DWRKEGAV VKDQG CGSCWAFS +AAVEGIN++ T
Sbjct: 120 TRKVSRKSDRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVT 179
Query: 184 GKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDK 243
G L+SLSEQELVDCD S N+GCNGG M+ AFEFI GG+ +E+DYPYR + +C +
Sbjct: 180 GDLISLSEQELVDCDT-SYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADQKCDQYR 238
Query: 244 TKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNH 281
+ V+I GYE +P AFQLY GVF CG L+H
Sbjct: 239 KNANVVSIDGYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDH 298
Query: 282 GVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
GV VGYG ++G+ YW+V NSWG +WGE GYIRM RN S+ G CGI + SYP+K
Sbjct: 299 GVAAVGYGTENGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGIAIGPSYPIK 355
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 160/339 (47%), Positives = 207/339 (61%), Gaps = 36/339 (10%)
Query: 30 LSLFLLWVLGIPAGAWS-EGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+SL L++ LG+ W+ + + SM ER E W+ Y + Y E ++RF I++
Sbjct: 10 ISLALVFCLGL----WAIQVTSRTLQDGSMHERHERWMNHYGKVYKDHQEREKRFKIFTE 65
Query: 89 NVQYIDYINS--QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP--RWPSVQY---LG 141
N++YI+ N+ N S+KL N+FADL+NEEF+++ + R + +Y
Sbjct: 66 NMKYIEAFNNGDNNESYKLGINQFADLTNEEFVASRNKFKGHMCSSIIRTTTFKYENVSA 125
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
+P++VDWRK+GAVTPVK+QGQCG CWAFSAVAA EGI+KL TGKLVSLSEQELVDCD
Sbjct: 126 IPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKG 185
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY 261
+QGC GG M+ AF+FI + G+ TE YPY+G + C +K A TITGYE +PA
Sbjct: 186 VDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCNANKASIQATTITGYEDVPANN 245
Query: 262 A----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWL 298
FQ Y GVF CG +L+HGVT VGYG + G KYWL
Sbjct: 246 EQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWL 305
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
VKNSWGT WGE GYI M R ++ G+CGI MQASYP
Sbjct: 306 VKNSWGTDWGEEGYIMMQRGVEAAE-GLCGIAMQASYPT 343
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 159/336 (47%), Positives = 203/336 (60%), Gaps = 33/336 (9%)
Query: 29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+L+LFLL +GI E + + S+ ER E W+ +Y + Y E ++RF I+
Sbjct: 11 ILALFLLLAVGISRVISRELHETE---TSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKD 67
Query: 89 NVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYN-EPRWPSVQY---LGLP 143
NV++I+ N+ N +KL N ADL+ EEF ++ G + Y+ E S +Y +P
Sbjct: 68 NVEFIESFNAAGNKPYKLGVNHLADLTIEEFKASRNGLKRSYDYEVGTTSFKYENVTAIP 127
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
ASVDWRK+GAVTP+KDQGQCGSCWAFS VAA EGI+K+ TGKLVSLSEQELVDCD +
Sbjct: 128 ASVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGTD 187
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----- 258
QGC GGYME FEFI K GG+TTE +YPY+ + C+ A I GYE +P
Sbjct: 188 QGCEGGYMEDGFEFIIKNGGITTEANYPYKAVDGSCKN--ATAPAAQIKGYEKVPVNSEK 245
Query: 259 -----------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKN 301
A +F YS G+F CG +L+HGVT VGYG +G YW+VKN
Sbjct: 246 ALLKAVANQPVSVSIDAADGSFMFYSSGIFTGECGTELDHGVTAVGYGRANGTDYWIVKN 305
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWGT WGE GYIRM R + G+CGI M +SYP
Sbjct: 306 SWGTVWGEQGYIRMQRGIAAKE-GLCGIAMDSSYPT 340
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 294 bits (753), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 159/337 (47%), Positives = 206/337 (61%), Gaps = 33/337 (9%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L++ LG+ A + + SM ER W+ QY + Y E + RF I++ N
Sbjct: 10 ISLALVFCLGLFA---IQVTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFTEN 66
Query: 90 VQYIDYINSQNL-SFKLTDNKFADLSNEEFISTYLGYNKPY--NEPRWPSVQY---LGLP 143
V Y++ N+ + S+KL N+FADL+NEEF+++ + + R + +Y +P
Sbjct: 67 VNYVEASNADDTKSYKLGINQFADLTNEEFVASRNKFKGHMCSSITRTTTFKYENVSAIP 126
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
++VDWRK+GAVTPVK+QGQCG CWAFSAVAA EGI+KL TGKL+SLSEQELVDCD +
Sbjct: 127 STVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVD 186
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA- 262
QGC GG M+ AF+FI + G++TE YPY G + C +K AVTITGYE +PA
Sbjct: 187 QGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSEQ 246
Query: 263 ---------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVK 300
FQ Y GVF CG +L+HGVT VGYG + G KYWLVK
Sbjct: 247 ALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVK 306
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
NSWGT WGE GYI M R ++ G+CGI MQASYP
Sbjct: 307 NSWGTDWGEEGYIMMQRGVEAAE-GLCGIAMQASYPT 342
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 158/308 (51%), Positives = 197/308 (63%), Gaps = 32/308 (10%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS-FKLTDNKFADLSNE 116
M R E W+ QY R Y +E E +R+ I+ NV+YI+ N +KL N FADL+N+
Sbjct: 33 MAVRHEQWMAQYGRVYKNEVEKTKRYNIFKENVEYIESFNKAGTKPYKLGINAFADLTNK 92
Query: 117 EFISTYLGYNKPY----NEP-RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
EFI++ GY P+ N P R+ +V +P +VDWRK+GAVTPVKDQGQCG CWAFSA
Sbjct: 93 EFIASRNGYILPHECSSNTPFRYENVS--AVPTTVDWRKKGAVTPVKDQGQCGCCWAFSA 150
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
VAA+EGI KL TG L+SLSEQELVDCDV +QGC GG M+ AF FI G+TTE +YP
Sbjct: 151 VAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFTFIINNKGLTTESNYP 210
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHG 269
Y+G + C+ K+ + A I+GYE +PA FQ YS G
Sbjct: 211 YQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGSDFQFYSSG 270
Query: 270 VFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
VF CG +L+HGVT VGYG + G KYWLVKNSWGTSWGE GYIRM ++ + G+CG
Sbjct: 271 VFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKE-GLCG 329
Query: 329 ILMQASYP 336
I MQ+SYP
Sbjct: 330 IAMQSSYP 337
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 160/352 (45%), Positives = 213/352 (60%), Gaps = 42/352 (11%)
Query: 28 AVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEER--------FENWLKQYSREYGSEDEW 79
A+ LF+++ L + + + + Y DP ER +E+WL ++ + Y + E
Sbjct: 11 AISFLFMVFSLSLASMSIID-YDLPADPLQSTERTEAHMMKMYEHWLVKHGKNYNAIGEK 69
Query: 80 QRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWP--S 136
+RRF I+ N++++D NS ++KL KFADL+NEE+ + YLG E S
Sbjct: 70 ERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYRAMYLGAKMEKKEKLRTERS 129
Query: 137 VQYL-------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+YL LP+ VDWR++GAVT VKDQGQCGSCWAFS V +VEGIN++ TG L+SL
Sbjct: 130 QRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCGSCWAFSTVGSVEGINQIVTGDLISL 189
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQELVDCD + NQGCNGG M+ AFEFI K GG+ +E DYPYR ++ C +++ H V
Sbjct: 190 SEQELVDCD-KAYNQGCNGGLMDYAFEFIIKNGGIDSEADYPYRASDNMCDSNRKNAHVV 248
Query: 250 TITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVG 287
TI GYE +P FQLY GVF CG L+HGV VG
Sbjct: 249 TIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREFQLYQSGVFTGRCGTNLDHGVVAVG 308
Query: 288 YGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
YG ++G YW+V+NSWG WGE+GYIRM RN S++ G CGI M+ASYP K+
Sbjct: 309 YGTENGIDYWIVRNSWGPKWGESGYIRMERNVASTDTGKCGIAMEASYPTKK 360
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 157/337 (46%), Positives = 209/337 (62%), Gaps = 33/337 (9%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L + LG+ A + + S+ ER E W+ Y + Y + E ++R I++ N
Sbjct: 10 VSLALFFCLGLLA---IQVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTEN 66
Query: 90 VQYIDYINS--QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP--RWPSVQY--LGLP 143
++YI+ N+ N +KL N+FADL+NEEFI++ + R + +Y +P
Sbjct: 67 LKYIEASNNAGNNKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIRTTTFKYENTSVP 126
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
++VDWRK+GAVTPVK+QGQCG CWAFSA+AA EGI+K+ TGKLVSLSEQELVDCD N +
Sbjct: 127 STVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVD 186
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA- 262
QGC GG M+ AF+FI + G++TE YPY+G + C+ ++ A TITGYE +PA
Sbjct: 187 QGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNEN 246
Query: 263 ---------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVK 300
FQ Y GVF CG +L+HGVT VGYG + G KYWLVK
Sbjct: 247 ALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWLVK 306
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
NSWGT WGE GYIRM R+ ++ G+CGI MQASYP
Sbjct: 307 NSWGTDWGEEGYIRMQRSIDAAE-GLCGIAMQASYPT 342
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 293 bits (750), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 158/336 (47%), Positives = 206/336 (61%), Gaps = 35/336 (10%)
Query: 30 LSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+S+ LL++L AW S+ + SM ER E+W+ +Y R Y +E ++RF I+
Sbjct: 10 VSMALLFILA----AWASQATSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKD 65
Query: 89 NVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPA 144
NV I+ N + + ++KL+ N+FADL+NEEF S + K + + +Y +P+
Sbjct: 66 NVARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNRF-KAHICSEATTFKYENVTAVPS 124
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
++DWRK+GAVTP+KDQ QCG CWAFSAVAA EGI ++ TGKL+SLSEQELVDCD ENQ
Sbjct: 125 TIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQ 184
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---- 260
GC+GG M+ AF FI KI G+ +E YPY G + C + K H A I GYE +PA
Sbjct: 185 GCSGGLMDDAFRFI-KIHGLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANNEKA 243
Query: 261 ------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKN 301
+ FQ Y+ GVF CG +L+HGV VGYG D G YWLVKN
Sbjct: 244 LQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMMYWLVKN 303
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWGT WGE GYIRM R+ + G+CGI MQASYP
Sbjct: 304 SWGTGWGEEGYIRMQRDVTAKE-GLCGIAMQASYPT 338
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 293 bits (750), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 153/301 (50%), Positives = 195/301 (64%), Gaps = 29/301 (9%)
Query: 66 LKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLG 124
L ++ + Y + ++RF I+ N+++ID N N SFKL NKFADLSNEE+ S +LG
Sbjct: 11 LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70
Query: 125 YNKPYNEPRWPSVQY---LG--LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGIN 179
+ + S ++ +G LP SVDWR++GAV PVKDQGQCGSCWAFS VAAVEGIN
Sbjct: 71 GRMVRDRKGFESDRFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVEGIN 130
Query: 180 KLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRC 239
++ TG L+SLSEQELVDCD NQGCNGG+M+ AFEFI K GG+ TEDDYPY+G + +C
Sbjct: 131 QIATGDLISLSEQELVDCD-KGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVDGQC 189
Query: 240 QTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGH 277
++ VTI G+E +P AFQLY G+F+ CG
Sbjct: 190 DQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGLCGT 249
Query: 278 QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
L+HGV VGYG + G+ YW+V+NSWG +WGE GYIR+ RN S+N G CGI MQ SYP
Sbjct: 250 DLDHGVVAVGYGTEDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQPSYPT 309
Query: 338 K 338
K
Sbjct: 310 K 310
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 293 bits (750), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 161/336 (47%), Positives = 204/336 (60%), Gaps = 32/336 (9%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+S L+ LG+ A S Q SM+ER E W+ +Y R Y E ++RF I+ N
Sbjct: 10 VSFALVLCLGLWAFQVSSRTLQD---ASMQERHEQWMARYGRVYKDLQEKEKRFSIFKEN 66
Query: 90 VQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP--RWPSVQY--LGLPA 144
V YI+ N+ + +KL N+FADL+NEEFI+T + + R + +Y + P+
Sbjct: 67 VNYIEASNNAGDKPYKLGVNQFADLTNEEFIATRNKFKGHMSSSITRTTTFKYENVTAPS 126
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
+VDWR+EGAVTPVK+QG CG CWAFSAVAA EGI+KL TG LVSLSEQELVDCD + +Q
Sbjct: 127 TVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQ 186
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA-- 262
GC GG M+ AF+FI + GG+ TE YPY+G + C T++ H TITGYE +P+
Sbjct: 187 GCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEATHVATITGYEDVPSNNEQA 246
Query: 263 --------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKN 301
FQ Y GVF CG QL+HGV VVGYG D G KYWLVKN
Sbjct: 247 LQQAVANQPISIAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLVKN 306
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWG WGE GYIRM R+ + G+CG+ MQ SYP
Sbjct: 307 SWGADWGEEGYIRMQRDVDAPE-GLCGLAMQPSYPT 341
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 151/325 (46%), Positives = 207/325 (63%), Gaps = 36/325 (11%)
Query: 49 YPQKYDPQSMEE----RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYID-YINSQNLSF 103
Y +K+ QS E+ R+E WL ++ R Y + E ++RF I+ N+++I+ + NS N ++
Sbjct: 33 YARKWTLQSDEDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTY 92
Query: 104 KLTDNKFADLSNEEFISTYLGYN----KPYNEPRWPSVQYLG-----LPASVDWRKEGAV 154
K+ N+FADL+NEE+ + YLG + + + + PS +Y +P SVDWRK GAV
Sbjct: 93 KVGLNQFADLTNEEYRTMYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAV 152
Query: 155 TPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKA 214
P+K+QG CGSCWAFS VAAV GIN++ TG++++LSEQELVDCD +N GCNGG M+ A
Sbjct: 153 APIKNQGSCGSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCD-RVQNSGCNGGLMDYA 211
Query: 215 FEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---------------- 258
FEFI GG+ TE YPYRG RC + + V+I GYE +P
Sbjct: 212 FEFIISNGGMDTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPRNERALQKAVAHQPVC 271
Query: 259 -----ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYI 313
+ AFQLYS GVF CG +++HGV VVGYG + G YW+V+NSWGT WGE GY+
Sbjct: 272 VAIEASGRAFQLYSSGVFTGECGEEVDHGVVVVGYGSEDGVDYWIVRNSWGTKWGENGYV 331
Query: 314 RMARNSPSSNIGICGILMQASYPVK 338
+M RN S++G CGI+ +ASYP K
Sbjct: 332 KMERNVKKSHLGKCGIMTEASYPTK 356
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 153/321 (47%), Positives = 202/321 (62%), Gaps = 36/321 (11%)
Query: 53 YDPQSMEER---FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKL 105
Y +S EE + W+ + R Y + E +RRF ++ N++Y+D N+ SF+L
Sbjct: 34 YGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRL 93
Query: 106 TDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQ 160
N+FADL+N+E+ +TYLG R +YL LP SVDWR +GAV VKDQ
Sbjct: 94 GLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEVKDQ 153
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G CGSCWAFS +AAVEGIN++ TG ++SLSEQELVDCD S NQGCNGG M+ AFEFI
Sbjct: 154 GSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDT-SYNQGCNGGLMDYAFEFIIN 212
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------------------- 260
GG+ TE+DYPY+G + RC ++ VTI YE +PA
Sbjct: 213 NGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEA 272
Query: 261 --YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
AFQLY+ G+F CG L+HGVT VGYG ++G+ YW+VKNSWG+SWGE+GY+RM RN
Sbjct: 273 GGRAFQLYNSGIFTGTCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERN 332
Query: 319 SPSSNIGICGILMQASYPVKR 339
+S+ G CGI ++ SYP+K+
Sbjct: 333 IKASS-GKCGIAVEPSYPLKK 352
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 157/308 (50%), Positives = 195/308 (63%), Gaps = 32/308 (10%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS-FKLTDNKFADLSNE 116
M R E W+ QY R Y +E E +RF I+ NV+YI+ N +KL N FADL+N+
Sbjct: 33 MVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAGTKPYKLGINAFADLTNQ 92
Query: 117 EFISTYLGYNKPY----NEP-RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
EF ++ GY P+ N P R+ +V +P +VDWR +GAVTPVKDQGQCG CWAFSA
Sbjct: 93 EFKASRNGYKLPHDCSSNTPFRYENVS--SVPTTVDWRTKGAVTPVKDQGQCGCCWAFSA 150
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
VAA+EGI KL TG L+SLSEQELVDCDV +QGC GG M+ AF FI G+TTE +YP
Sbjct: 151 VAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSFIINNKGLTTESNYP 210
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHG 269
Y+G + C+ K+ + A I+GYE +PA FQ YS G
Sbjct: 211 YQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGSDFQFYSSG 270
Query: 270 VFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
VF CG +L+HGVT VGYG + G KYWLVKNSWGTSWGE GYIRM ++ + G+CG
Sbjct: 271 VFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKE-GLCG 329
Query: 329 ILMQASYP 336
I MQ+SYP
Sbjct: 330 IAMQSSYP 337
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 149/311 (47%), Positives = 194/311 (62%), Gaps = 28/311 (9%)
Query: 53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFA 111
Y+ S++ER E W+ ++ + Y E ++RF I+ NV++I+ N+ N +KL+ N A
Sbjct: 31 YESLSLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKLSVNHLA 90
Query: 112 DLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
DL+ +EF ++ GY K E S +Y +PA+VDWR +GAVTP+KDQGQCGSCWA
Sbjct: 91 DLTLDEFKASRNGYKKIDREFTTTSFKYENVTAIPAAVDWRVKGAVTPIKDQGQCGSCWA 150
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS VAA EGIN++ TGKLVSLSEQELVDCD E+QGC GG ME FEFI K GG+T+E
Sbjct: 151 FSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSET 210
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLY 266
+YPY+ + C T T A ITGYE +P + +F Y
Sbjct: 211 NYPYKAADGSCNTATTTPVA-KITGYEKVPVNSEKSLLKAVANQPISVSIDASDSSFMFY 269
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
S G++ CG +L+HGVT VGYG +G YW+VKNSWGT WGE GYIRM R + G+
Sbjct: 270 SSGIYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIAAKE-GL 328
Query: 327 CGILMQASYPV 337
CGI M +SYP
Sbjct: 329 CGIAMDSSYPT 339
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 152/321 (47%), Positives = 202/321 (62%), Gaps = 36/321 (11%)
Query: 53 YDPQSMEER---FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKL 105
Y +S EE + W+ + R Y + E +RRF ++ N++Y+D N+ SF+L
Sbjct: 34 YGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRL 93
Query: 106 TDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQ 160
N+FADL+N+E+ +TYLG R +YL LP SVDWR +GAV +KDQ
Sbjct: 94 GLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEIKDQ 153
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G CGSCWAFS +AAVEGIN++ TG ++SLSEQELVDCD S NQGCNGG M+ AFEFI
Sbjct: 154 GSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDT-SYNQGCNGGLMDYAFEFIIN 212
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------------------- 260
GG+ TE+DYPY+G + RC ++ VTI YE +PA
Sbjct: 213 NGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEA 272
Query: 261 --YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
AFQLY+ G+F CG L+HGVT VGYG ++G+ YW+VKNSWG+SWGE+GY+RM RN
Sbjct: 273 GGRAFQLYNSGIFTGTCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERN 332
Query: 319 SPSSNIGICGILMQASYPVKR 339
+S+ G CGI ++ SYP+K+
Sbjct: 333 IKASS-GKCGIAVEPSYPLKK 352
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 155/320 (48%), Positives = 202/320 (63%), Gaps = 36/320 (11%)
Query: 53 YDPQSMEER---FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKL 105
Y +S EE + W+ + R Y + E +RR+ ++ N++YID N+ SF+L
Sbjct: 29 YGERSXEEARRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRL 88
Query: 106 TDNKFADLSNEEFISTYLGY-NKPYNE----PRWPSVQYLGLPASVDWRKEGAVTPVKDQ 160
N+FADL+N+E+ +TYLG +P E R+ + LP SVDWR +GAV VKDQ
Sbjct: 89 GLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQ 148
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G CGSCWAFS +AAVEGIN++ TG L+SLSEQELVDCD S NQGCNGG M+ AFEFI
Sbjct: 149 GSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT-SYNQGCNGGLMDYAFEFIIN 207
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------------------- 260
GG+ TE DYPY+G + RC ++ VTI YE +PA
Sbjct: 208 NGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEA 267
Query: 261 --YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
AFQLYS G+F CG L+HGVT VGYG ++G+ YW+VKNSWG+SWGE+GY+RM RN
Sbjct: 268 AGTAFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMERN 327
Query: 319 SPSSNIGICGILMQASYPVK 338
+S+ G CGI ++ SYP+K
Sbjct: 328 IKASS-GKCGIAVEPSYPLK 346
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 155/354 (43%), Positives = 216/354 (61%), Gaps = 39/354 (11%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGA-------WSEGYPQKYDPQSMEE---RFENWLKQYSRE 72
M L + +++ LL+ L + + A + + K ++ +E +E+WL ++ +
Sbjct: 1 MKLLSPSMAIALLFALFVASSALDMSIINYDATHASKSSWRTDDEVMAMYESWLVKHGKS 60
Query: 73 YGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNE 131
Y + E ++RF I+ N+++ID N++ NLS+K+ N+FADL+NEE+ STYLG
Sbjct: 61 YNALGEKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTNEEYRSTYLGAKSKPKL 120
Query: 132 PRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKL 186
+ S +Y LP SVDWR +GAV P+KDQG CGSCWAFS V AVEGIN++ TG+L
Sbjct: 121 SKVKSDRYAPRVGDSLPESVDWRAKGAVAPIKDQGSCGSCWAFSTVNAVEGINQIVTGEL 180
Query: 187 VSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKH 246
++LSEQELVDCD S N+GC+GG M+ FEFI GG+ T+ DYPY G++ RC +
Sbjct: 181 ITLSEQELVDCD-KSYNEGCDGGLMDYGFEFIINNGGIDTDKDYPYLGRDARCDQYRKNA 239
Query: 247 HAVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVT 284
VTI YE +P AFQ Y G+F CG L+HGV
Sbjct: 240 KVVTIDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYDSGIFTGKCGTALDHGVN 299
Query: 285 VVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VVGYG + G+ YW+V+NSWG+SWGEAGYIRM RN +++G CGI M+ SYP+K
Sbjct: 300 VVGYGTEKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSVGKCGIAMEPSYPLK 353
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 149/308 (48%), Positives = 195/308 (63%), Gaps = 32/308 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+ WL ++ + Y E +RRF I+ N++++D NS+N S+K+ N+FADL+NEE+ S
Sbjct: 47 YAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSENRSYKVGLNRFADLTNEEYRSM 106
Query: 122 YLG--------YNKPYNEPRWPSVQYLG-LPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
+LG + K + R +VQ LP SVDWR+ GAV P+KDQG CGSCWAFS V
Sbjct: 107 FLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKDQGSCGSCWAFSTV 166
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AAVEG+N++ TG+++ LSEQELVDCD + + GCNGG M+ AFEFI GG+ TE+DYPY
Sbjct: 167 AAVEGVNQIATGEMIQLSEQELVDCD-RTYDAGCNGGLMDYAFEFIINNGGIDTEEDYPY 225
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGV 270
RG + C ++ V+I YE +P + AFQLY GV
Sbjct: 226 RGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVSVAIEASGRAFQLYLSGV 285
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
F CG L+HGV VVGYG D+G +W+V+NSWGTSWGE GYIRM RN + G CGI
Sbjct: 286 FTGECGRALDHGVVVVGYGTDNGADHWIVRNSWGTSWGENGYIRMERNVVDNFGGKCGIA 345
Query: 331 MQASYPVK 338
MQASYP+K
Sbjct: 346 MQASYPIK 353
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 152/313 (48%), Positives = 201/313 (64%), Gaps = 33/313 (10%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNE 116
++E FE+WL ++ + Y + DE +RF I+ N++YID NS +N S+KL N+FAD++NE
Sbjct: 46 VKEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADITNE 105
Query: 117 EFISTYLGYNKPYNE-------PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
E+ + YLG + + R+ V LP S+DWR++GAVT VKDQG CGSCWAF
Sbjct: 106 EYRTGYLGAKRDASRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVKDQGSCGSCWAF 165
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
S +AAVEG+N+L TG L+SLSEQELVDCD NQGCNGG M AF+FI K GG+ +E+D
Sbjct: 166 STIAAVEGVNQLATGNLISLSEQELVDCD-RKINQGCNGGDMGYAFQFIIKNGGIDSEED 224
Query: 230 YPYRGKNDRCQTDKTKHHAV-TITGYEAIPAR----------------------YAFQLY 266
YPY GK+ +C + + + V +I GYE +P Y FQLY
Sbjct: 225 YPYTGKDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGGYDFQLY 284
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
S G+F CG L+HGV VGYG ++G YW+VKNSWG WGE GY+RM RN + G+
Sbjct: 285 SSGIFTGSCGTDLDHGVAAVGYGTENGVDYWIVKNSWGDYWGEKGYVRMQRNV-KAKTGL 343
Query: 327 CGILMQASYPVKR 339
CGI M+ASYP K+
Sbjct: 344 CGIAMEASYPTKK 356
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 150/314 (47%), Positives = 195/314 (62%), Gaps = 33/314 (10%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFA 111
+ + + W+ ++ Y + E +RRF + N++YID N+ SF+L N+FA
Sbjct: 37 EEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVHSFRLGLNRFA 96
Query: 112 DLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSC 166
DL+NEE+ STYLG + R S +Y LP SVDWRK+GAV VKDQG CGSC
Sbjct: 97 DLTNEEYRSTYLGARTKPDRERKLSARYQAADNDELPESVDWRKKGAVGAVKDQGGCGSC 156
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSA+AAVEGIN++ TG ++ LSEQELVDCD S NQGCNGG M+ AFEFI GG+ +
Sbjct: 157 WAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT-SYNQGCNGGLMDYAFEFIINNGGIDS 215
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E+DYPY+ +++RC +K VTI GYE +P AFQ
Sbjct: 216 EEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQ 275
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
LY G+F CG L+HGV VGYG ++G+ YWLV+NSWG+ WGE GYIRM RN +S+
Sbjct: 276 LYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGSVWGEDGYIRMERNIKASS- 334
Query: 325 GICGILMQASYPVK 338
G CGI ++ SYP K
Sbjct: 335 GKCGIAVEPSYPTK 348
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 159/337 (47%), Positives = 201/337 (59%), Gaps = 32/337 (9%)
Query: 29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+ SL LL V G A E + + S++ER E W+ QY + Y E + R I+
Sbjct: 9 ISSLALLLVFGFLA---FEANARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNIFKE 65
Query: 89 NVQYID-YINSQNLSFKLTDNKFADLSNEEFIS-TYLGYNKPYNEPRWPSVQY---LGLP 143
NVQ I+ + N+ N +KL N+FADL+NEEF + + N R P+ +Y +P
Sbjct: 66 NVQRIEAFNNAGNKPYKLGINQFADLTNEEFKARNRFKGHMCSNSTRTPTFKYEDVSSVP 125
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
AS+DWR++GAVTP+KDQGQCG CWAFSAVAA EGI KL TGKL+SLSEQELVDCD +
Sbjct: 126 ASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVD 185
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR--- 260
QGC GG M+ AF+FI + G+ TE YPY+G + C + A +I G+E +PA
Sbjct: 186 QGCEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSES 245
Query: 261 -------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVK 300
FQ YS G+F CG +L+HGVT VGYG D G KYWLVK
Sbjct: 246 ALLKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYGVSDDGTKYWLVK 305
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
NSWG WGE GYIRM R+ + G+CGI MQASYP
Sbjct: 306 NSWGEQWGEEGYIRMQRDVAAEE-GLCGIAMQASYPT 341
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 148/311 (47%), Positives = 193/311 (62%), Gaps = 28/311 (9%)
Query: 53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFA 111
Y+ S++ER E W+ +Y + Y E ++RF I+ NV++I+ N+ N +KL+ N A
Sbjct: 31 YESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLA 90
Query: 112 DLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
DL+ +EF ++ GY K E S +Y +P +VDWR +GAVTP+KDQGQCGSCWA
Sbjct: 91 DLTLDEFKASRNGYKKIDREFATTSFKYENVTAIPEAVDWRVKGAVTPIKDQGQCGSCWA 150
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS VAA+EGIN++ TGKL+SLSEQELVDCD E+QGC GG ME FEFI K GG+T+E
Sbjct: 151 FSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSET 210
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLY 266
+YPY+ + C T T A ITGYE +P + +F Y
Sbjct: 211 NYPYKAADGSCNTATTAPVA-KITGYEKVPVNSEISLLKAVANQPISVSIDASDSSFMFY 269
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
S G++ CG +L+HGVT VGYG +G YW+VKNSWGT WGE GYIRM R G+
Sbjct: 270 SSGIYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIADKE-GL 328
Query: 327 CGILMQASYPV 337
CGI M +SYP
Sbjct: 329 CGIAMDSSYPT 339
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 160/337 (47%), Positives = 204/337 (60%), Gaps = 33/337 (9%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL LL LG+ A + + SM ER + W+ QY++ Y EW++RF I+ N
Sbjct: 10 ISLALLMCLGLWA---VQVTSRTLQDASMYERHQQWMGQYAKIYNDHQEWEKRFQIFKEN 66
Query: 90 VQYIDYINSQNLSF-KLTDNKFADLSNEEFISTYLGYNKPYNEP--RWPSVQY---LGLP 143
V YI+ N + F KL N+F DL+NEEFI+ + R + +Y +P
Sbjct: 67 VNYIETSNKEGGRFYKLGVNQFVDLTNEEFIAPRNRFKGHMCSSIIRTNTYKYENVTTVP 126
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
++VDWR++GAVTPVKDQGQCG CWAFSAVAA EGI++L TGKL+SLSEQELVDCD +
Sbjct: 127 SNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELVDCDTKGVD 186
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA- 262
QGC GG M+ AF+FI + G+ TE YPY+G + C ++ +A TIT YE +P
Sbjct: 187 QGCEGGLMDDAFKFIIQNHGLDTEAKYPYQGVDGTCNANEASINAATITSYEDVPTNNEQ 246
Query: 263 ---------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVK 300
FQ Y+ GVF CG +L+HGVT VGYG D G KYWLVK
Sbjct: 247 ALQKAVANQPISVAIDASGSDFQFYTSGVFTGSCGTELDHGVTAVGYGVSDDGTKYWLVK 306
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
NSWGTSWGE GYIRM R + G+CGI MQASYP+
Sbjct: 307 NSWGTSWGEEGYIRMQRGVDAVE-GLCGIAMQASYPI 342
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 158/346 (45%), Positives = 206/346 (59%), Gaps = 37/346 (10%)
Query: 29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQS-------MEERFENWLKQYSREYGSEDEWQR 81
++ LFL++ L Y Q + +S + +E WL ++ + Y + E ++
Sbjct: 2 LMLLFLVFALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKEK 61
Query: 82 RFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNK------PYNEPRWP 135
RF I+ N+ +ID NS+N ++ + N+FADL+NEEF S YLG P R+
Sbjct: 62 RFEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSDRYA 121
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
LP SVDWRKEGAV VKDQG CGSCWAFS +AAVEGINK+ TG L++LSEQELV
Sbjct: 122 PRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGDLIALSEQELV 181
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DCD S N+GCNGG M+ AFEFI GG+ TEDDYPY G++ RC T + V+I YE
Sbjct: 182 DCDT-SYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYE 240
Query: 256 AIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHG 293
+P FQLY+ GVF CG L+HGV VGYG + G
Sbjct: 241 DVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVAAVGYGTEKG 300
Query: 294 EKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
+ YW+V+NSWG SWGE+GYIRM RN +S G CGI ++ SYP+K+
Sbjct: 301 KDYWIVRNSWGKSWGESGYIRMERNI-ASPTGKCGIAIEPSYPIKK 345
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 151/306 (49%), Positives = 192/306 (62%), Gaps = 30/306 (9%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E WL ++ + Y + E ++RF I+ N+ +ID NS+N ++ + N+FADL+NEEF S
Sbjct: 51 YEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEEFRSM 110
Query: 122 YLGYNK------PYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
YLG P R+ LP SVDWRKEGAV VKDQG CGSCWAFS +AAV
Sbjct: 111 YLGTRTGHKKRLPKTSDRYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAV 170
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EGINK+ TG L++LSEQELVDCD S N+GCNGG M+ AFEFI GG+ TEDDYPY G+
Sbjct: 171 EGINKIVTGDLIALSEQELVDCDT-SYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGR 229
Query: 236 NDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFDE 273
+ RC T + V+I YE +P FQLY+ GVF
Sbjct: 230 DGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTG 289
Query: 274 YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
CG L+HGV VGYG + G+ YW+V+NSWG SWGE+GYIRM RN +S G CGI ++
Sbjct: 290 ECGTSLDHGVAAVGYGTEKGKDYWIVRNSWGKSWGESGYIRMERNI-ASPTGKCGIAIEP 348
Query: 334 SYPVKR 339
SYP+K+
Sbjct: 349 SYPIKK 354
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 153/321 (47%), Positives = 203/321 (63%), Gaps = 34/321 (10%)
Query: 49 YPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFK 104
Y ++ D ++ + W+ + R Y + E +RR+ ++ N++YID N+ SF+
Sbjct: 34 YGERSDEEA-RRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFR 92
Query: 105 LTDNKFADLSNEEFISTYLGY-NKPYNE----PRWPSVQYLGLPASVDWRKEGAVTPVKD 159
L N+FADL+N+E+ +TYLG +P E R+ + LP SVDWR +GAV VKD
Sbjct: 93 LGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAADNEDLPESVDWRAKGAVAEVKD 152
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
QG CGSCWAFS +AAVEGIN++ TG L+SLSEQELVDCD S NQGCNGG M+ AFEFI
Sbjct: 153 QGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT-SYNQGCNGGLMDYAFEFII 211
Query: 220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------------- 260
GG+ TE DYPY+G + RC ++ VTI YE +PA
Sbjct: 212 NNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIE 271
Query: 261 ---YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
AFQLYS G+F CG L+HGVT VGYG ++G+ YW+VKNSWG+SWGE+GY+RM R
Sbjct: 272 AAGTAFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMER 331
Query: 318 NSPSSNIGICGILMQASYPVK 338
N +S+ G CGI ++ SYP+K
Sbjct: 332 NIKASS-GKCGIAVEPSYPLK 351
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 291 bits (746), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 157/342 (45%), Positives = 200/342 (58%), Gaps = 30/342 (8%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRR 82
M + L ++ LG A + + SM ER E W+ Y R Y +E Q+R
Sbjct: 1 MGFVSQCFCLVVMVTLGALASQLAA--ARSLQDASMRERHEEWMASYGRVYKDINEKQKR 58
Query: 83 FGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY-- 139
+ I+ NV I+ N N +KL+ N+FADL+NEEF ++ + + S +Y
Sbjct: 59 YKIFEENVALIESSNKDANKPYKLSVNQFADLTNEEFKASRNRFKGHICSTKSTSFKYGN 118
Query: 140 -LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
+P+++DWR +GAVTPVKDQGQCG CWAFSAVAA EGI KL TG+L+SLSEQELVDCD
Sbjct: 119 VSAVPSAMDWRMKGAVTPVKDQGQCGCCWAFSAVAATEGITKLTTGELISLSEQELVDCD 178
Query: 199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
+ +QGC GG M+ AF FI G+ +E +YPY+G + C T+K HA I G+E +P
Sbjct: 179 TSGVDQGCEGGLMDNAFTFIQHNHGLASEANYPYKGVDGTCNTNKQAIHAAEINGFEDVP 238
Query: 259 AR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEK 295
A FQ YS GVF CG QL+HGVT VGYG D G K
Sbjct: 239 ANSEEALLNAVAHQPVSVAIDAGGSGFQFYSKGVFIGACGTQLDHGVTAVGYGTSDDGTK 298
Query: 296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
YWLVKNSWGT WGE GYIRM R+ + G+CGI M+ASYP
Sbjct: 299 YWLVKNSWGTQWGEEGYIRMQRDVDAKE-GLCGIAMKASYPT 339
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 156/337 (46%), Positives = 208/337 (61%), Gaps = 33/337 (9%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L + LG+ A + + S+ ER E W+ Y + Y + E ++R I++ N
Sbjct: 10 VSLALFFCLGLLA---IQVTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTEN 66
Query: 90 VQYIDYINS--QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP--RWPSVQY--LGLP 143
++YI+ N+ +KL N+FADL+NEEFI++ + R + +Y +P
Sbjct: 67 LKYIEASNNAGNKKPYKLGINQFADLTNEEFIASRNKFKGHMCSSIIRTTTFKYENTSVP 126
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
++VDWRK+GAVTPVK+QGQCG CWAFSA+AA EGI+K+ TGKLVSLSEQELVDCD N +
Sbjct: 127 STVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVD 186
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA- 262
QGC GG M+ AF+FI + G++TE YPY+G + C+ ++ A TITGYE +PA
Sbjct: 187 QGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNEN 246
Query: 263 ---------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVK 300
FQ Y GVF CG +L+HGVT VGYG + G KYWLVK
Sbjct: 247 ALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDGTKYWLVK 306
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
NSWGT WGE GYIRM R+ ++ G+CGI MQASYP
Sbjct: 307 NSWGTDWGEEGYIRMQRSIDAAE-GLCGIAMQASYPT 342
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 291 bits (744), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 161/336 (47%), Positives = 197/336 (58%), Gaps = 55/336 (16%)
Query: 52 KYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFA 111
+ DP M ERFE W+ ++ R Y E QRR +Y NV+ ++ NS ++L DNKFA
Sbjct: 46 RADP--MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNGYRLADNKFA 103
Query: 112 DLSNEEFISTYLGYNKPYN-----EPRWPSV------------QYLGLPASVDWRKEGAV 154
DL+NEEF + LG+ +P + PS Y LP SVDWR++GAV
Sbjct: 104 DLTNEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAV 163
Query: 155 TPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKA 214
PVK QG CGSCWAFSAVAA+EGIN++K GKLVSLSEQELVDCD + GC GGYM A
Sbjct: 164 APVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA--IGCAGGYMSWA 221
Query: 215 FEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI----------------- 257
FEF+ K G+TTE +YPY+G N CQT K K AV+I+GY +
Sbjct: 222 FEFVMKNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPV 281
Query: 258 -----PARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGE-----------KYWLVKN 301
+ +QLY GVF C +LNHGVTVVGYGE G+ KYW+VKN
Sbjct: 282 SVAVDAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKN 341
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWG WG+AGYI M R + S G+CGI M SYPV
Sbjct: 342 SWGPEWGDAGYILMQREA-SVASGLCGIAMLPSYPV 376
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 160/336 (47%), Positives = 198/336 (58%), Gaps = 55/336 (16%)
Query: 52 KYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFA 111
+ DP M ERFE W+ ++ R Y E QRR +Y NV+ ++ NS ++L DNKFA
Sbjct: 25 RADP--MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNGYRLADNKFA 82
Query: 112 DLSNEEFISTYLGYNKPYN-----EPRWPSV------------QYLGLPASVDWRKEGAV 154
DL+NEEF + LG+ +P + PS Y LP SVDWR++GAV
Sbjct: 83 DLTNEEFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAV 142
Query: 155 TPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKA 214
PVK QG CGSCWAFSAVAA+EGIN++K GKLVSLSEQELVDCD + GC GGYM A
Sbjct: 143 APVKSQGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA--IGCAGGYMSWA 200
Query: 215 FEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI----------------- 257
FEF+ K G+TTE +YPY+G N CQT K K AV+I+GY +
Sbjct: 201 FEFVMKNRGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPV 260
Query: 258 -----PARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGE-----------KYWLVKN 301
+ +QLY GVF C +LNHGVTVVGYGE G+ KYW+VKN
Sbjct: 261 SVAVDAGSFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKN 320
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWG WG+AGYI M R + ++ G+CGI M SYPV
Sbjct: 321 SWGPEWGDAGYILMQREASVAS-GLCGIAMLPSYPV 355
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 150/310 (48%), Positives = 194/310 (62%), Gaps = 30/310 (9%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYID-YINSQNLSFKLTDNKFADLSN 115
SM ER E W+ +Y + Y E ++RF I+ NV YI+ + N+ N +KL N+FADL+N
Sbjct: 581 SMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTN 640
Query: 116 EEFISTYLGYNKPYNEP--RWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
EEFI+ + R + +Y +P++VDWR++GAVTP+KDQGQCG CWAFS
Sbjct: 641 EEFIAPRNRFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFS 700
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
AVAA EGI+ L +GKL+SLSEQELVDCD +QGC GG M+ AF+F+ + G+ TE +Y
Sbjct: 701 AVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANY 760
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSH 268
PY+G + +C ++ + VTITGYE +PA FQ Y
Sbjct: 761 PYKGVDGKCNANEAANDVVTITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKS 820
Query: 269 GVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GVF CG +L+HGVT VGYG + G +YWLVKNSWGT WGE GYIRM R S G+C
Sbjct: 821 GVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEE-GLC 879
Query: 328 GILMQASYPV 337
GI MQASYP
Sbjct: 880 GIAMQASYPT 889
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 152/312 (48%), Positives = 195/312 (62%), Gaps = 30/312 (9%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+ + +E WL ++ + Y + E +RF I+ N+++ID N++N ++KL N+FADL+N
Sbjct: 34 EEVNSLYEEWLVKHGKLYNALGEKDKRFQIFKDNLRFIDQQNAENRTYKLGLNRFADLTN 93
Query: 116 EEFISTYLGYNKPYNEP--RWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWA 168
EE+ + YLG N R PS +Y LP SVDWRKEGAV PVKDQ CGSCWA
Sbjct: 94 EEYRARYLGTKIDPNRRLGRTPSNRYAPRVGETLPDSVDWRKEGAVVPVKDQASCGSCWA 153
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSA+ AVEGINK+ TG L+SLSEQELVDCD N GCNGG M+ AFEFI K GG+ +E+
Sbjct: 154 FSAIGAVEGINKIVTGDLISLSEQELVDCDT-GYNMGCNGGLMDYAFEFIIKNGGIDSEE 212
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAI---------------PARYA-------FQLY 266
DYPY+G + RC + V+I GYE + P A FQLY
Sbjct: 213 DYPYKGVDGRCDEYRKNAKVVSIDGYEDVNTYDELALKKAVANQPVSVAVEGGGREFQLY 272
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
S GVF CG L+HGV VGYG D+G +W+V+NSWG WGE GYIR+ RN +S G
Sbjct: 273 SSGVFTGRCGTALDHGVVAVGYGTDNGHDFWIVRNSWGADWGEEGYIRLERNLGNSRSGK 332
Query: 327 CGILMQASYPVK 338
CGI ++ SYP+K
Sbjct: 333 CGIAIEPSYPIK 344
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 290 bits (743), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 149/314 (47%), Positives = 197/314 (62%), Gaps = 33/314 (10%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFA 111
+ + + W+ ++ R Y + E +RRF ++ N++YID N+ SF+L N+FA
Sbjct: 35 EEVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLHSFRLGLNRFA 94
Query: 112 DLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSC 166
DL+NEE+ STYLG + R S +Y LP +VDWRK+GAV +KDQG CGSC
Sbjct: 95 DLTNEEYRSTYLGARTKPDRERKLSARYQADDNEELPETVDWRKKGAVAAIKDQGGCGSC 154
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSA+AAVEGIN++ TG ++ LSEQELVDCD S N+GCNGG M+ AFEFI GG+ +
Sbjct: 155 WAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT-SYNEGCNGGLMDYAFEFIINNGGIDS 213
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E+DYPY+ +++RC +K VTI GYE +P AFQ
Sbjct: 214 EEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQ 273
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
LY G+F CG L+HGV VGYG ++G+ YWLV+NSWGT WGE GYIRM RN +S+
Sbjct: 274 LYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGTVWGEDGYIRMERNIKASS- 332
Query: 325 GICGILMQASYPVK 338
G CGI ++ SYP K
Sbjct: 333 GKCGIAVEPSYPTK 346
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 290 bits (743), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 160/336 (47%), Positives = 203/336 (60%), Gaps = 32/336 (9%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+S L+ LG+ A S Q SM ER E W+ +Y + Y E ++RF I+ N
Sbjct: 10 ISFALVLCLGLWAFQVSSRTLQD---ASMHERHEQWMARYGKVYKDLQEKEKRFNIFQEN 66
Query: 90 VQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP--RWPSVQY--LGLPA 144
V+YI+ N+ N +KL N+F DL+N+EFI+T + + R + +Y + P+
Sbjct: 67 VKYIEASNNAGNKPYKLGVNQFTDLTNKEFIATRNKFKGHMSSSITRTTTFKYENVTAPS 126
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
+VDWR+EGAVTPVK+QG CG CWAFSAVAA EGI+KL TG LVSLSEQELVDCD + +Q
Sbjct: 127 TVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQ 186
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA-- 262
GC GG M+ AF+FI + GG+ TE YPY+G + C T++ H TITGYE +P+
Sbjct: 187 GCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEVTHVATITGYEDVPSNNEQA 246
Query: 263 --------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKN 301
FQ Y GVF CG QL+HGV VVGYG D G KYWLVKN
Sbjct: 247 LQQAVANQPISVAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGTKYWLVKN 306
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWG WGE GYIRM R+ + G+CGI MQ SYP
Sbjct: 307 SWGEDWGEEGYIRMQRDVEAPE-GLCGIAMQPSYPT 341
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 290 bits (742), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 154/310 (49%), Positives = 194/310 (62%), Gaps = 34/310 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFIS 120
+E WL ++ R Y + E +RRF I+ N+++ID NS N S+KL NKFADLSN+E+ S
Sbjct: 25 YEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADLSNDEYRS 84
Query: 121 TYLG---------YNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
YLG P +E R+ + LP +VDWR++GAV PVKDQGQCGSCWAFS
Sbjct: 85 VYLGTRMDGKGRLLGGPKSE-RYLFKEGDDLPETVDWREKGAVAPVKDQGQCGSCWAFST 143
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
V AVEGIN++ TG L SLSEQELVDCD + N GCNGG M+ AF+FI + GG+ TE+DYP
Sbjct: 144 VGAVEGINQIVTGNLTSLSEQELVDCD-KTYNLGCNGGLMDYAFDFIIENGGIDTEEDYP 202
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHG 269
Y+ + C ++ VTI GYE +P FQLY G
Sbjct: 203 YKAIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRGFQLYQSG 262
Query: 270 VFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
VF CG QL+HGV VGYG +HG YW+V+NSWG +WGE GYIRM R+ S+ G CGI
Sbjct: 263 VFTGSCGTQLDHGVVTVGYGTEHGVDYWIVRNSWGPAWGENGYIRMERDVASTETGKCGI 322
Query: 330 LMQASYPVKR 339
M+ASYP K+
Sbjct: 323 AMEASYPTKK 332
>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
Length = 363
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 160/317 (50%), Positives = 195/317 (61%), Gaps = 38/317 (11%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFAD 112
DP +M R E W+ + R Y E+E Q RF I+ +NV YID N++ + S+ L NKFAD
Sbjct: 48 DP-TMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYTLEVNKFAD 106
Query: 113 LSNEEFISTYLGYNKPYNEPRWPSVQYLGL---------PASVDWRKEGAVTPVKDQGQC 163
L+N+EF ++ GY K +P S GL P VDWRKEGAVTPVKDQG C
Sbjct: 107 LTNDEFRASRNGYKK---QPDSDSHVVSGLFRYANVSAVPDEVDWRKEGAVTPVKDQGDC 163
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
G CWAFSAVAA+EGINKL+ GKLVSLSEQELVDCD++ +QGC GG ME AF+FI K G
Sbjct: 164 GCCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFIEKRKG 223
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------Y 261
+ E YPY G++ C T K A I+G+E +PA Y
Sbjct: 224 LAAESVYPYTGEDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVSIAIDASGY 283
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
FQ YS GVF CG +L+H +T VGYG G KYWL+KNSWG SWGE GYIR+ R+S
Sbjct: 284 EFQFYSGGVFTGSCGTELDHAITAVGYGATMDGTKYWLMKNSWGASWGENGYIRIKRDSL 343
Query: 321 SSNIGICGILMQASYPV 337
+ G+CGI M SYPV
Sbjct: 344 AKE-GLCGIAMDPSYPV 359
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 147/311 (47%), Positives = 192/311 (61%), Gaps = 28/311 (9%)
Query: 53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFA 111
Y+ S++ER E W+ +Y + Y E ++RF I+ NV++I+ N+ N +KL+ N A
Sbjct: 31 YESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLA 90
Query: 112 DLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
DL+ +EF ++ GY K E S +Y +P +VDWR +GAVTP+KDQGQCGSCWA
Sbjct: 91 DLTLDEFKASRNGYKKIDREFATTSFKYENVTAIPEAVDWRVKGAVTPIKDQGQCGSCWA 150
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS VAA+EGIN++ TGKL+SLSEQELVDCD E+QGC GG ME FEFI K GG+T+E
Sbjct: 151 FSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEFIIKNGGITSET 210
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLY 266
+YPY+ + C T A ITGYE +P + +F Y
Sbjct: 211 NYPYKAADGSCSAATTAPVA-KITGYEKVPVNSEISLLKAVANQPISVSIDASDSSFMFY 269
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
S G++ CG +L+HGVT VGYG +G YW+VKNSWGT WGE GYIRM R G+
Sbjct: 270 SSGIYTGECGTELDHGVTAVGYGSANGTDYWIVKNSWGTVWGEKGYIRMQRGIADKE-GL 328
Query: 327 CGILMQASYPV 337
CGI M +SYP
Sbjct: 329 CGIAMDSSYPT 339
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 150/310 (48%), Positives = 194/310 (62%), Gaps = 30/310 (9%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYID-YINSQNLSFKLTDNKFADLSN 115
SM ER E W+ +Y + Y E ++RF I+ NV YI+ + N+ N +KL N+FADL+N
Sbjct: 52 SMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTN 111
Query: 116 EEFISTYLGYNKPYNEP--RWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
EEFI+ + R + +Y +P++VDWR++GAVTP+KDQGQCG CWAFS
Sbjct: 112 EEFIAPRNRFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFS 171
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
AVAA EGI+ L +GKL+SLSEQELVDCD +QGC GG M+ AF+F+ + G+ TE +Y
Sbjct: 172 AVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANY 231
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSH 268
PY+G + +C ++ + VTITGYE +PA FQ Y
Sbjct: 232 PYKGVDGKCNANEAANDVVTITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKS 291
Query: 269 GVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GVF CG +L+HGVT VGYG + G +YWLVKNSWGT WGE GYIRM R S G+C
Sbjct: 292 GVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEE-GLC 350
Query: 328 GILMQASYPV 337
GI MQASYP
Sbjct: 351 GIAMQASYPT 360
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 290 bits (741), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 157/308 (50%), Positives = 195/308 (63%), Gaps = 32/308 (10%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS-FKLTDNKFADLSNE 116
M R E W+ QY R Y +E E +RF I+ NV+YI+ N +KL N FADL+N+
Sbjct: 35 MVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAGTKPYKLGINAFADLTNQ 94
Query: 117 EFISTYLGYNKPY----NEP-RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
EF ++ GY P+ N P R+ +V +P +VDWR +GAVTPVKDQGQCG CWAFSA
Sbjct: 95 EFKASRNGYKLPHDCSSNTPFRYENVS--SVPTTVDWRTKGAVTPVKDQGQCGCCWAFSA 152
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
VAA+EGI KL TG L+SLSEQELVDCDV +QGC GG M+ AF FI G+TTE +YP
Sbjct: 153 VAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFSFIINNKGLTTESNYP 212
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHG 269
Y+G + C+ K+ + A I+GYE +PA FQ YS G
Sbjct: 213 YQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGSDFQFYSSG 272
Query: 270 VFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
VF CG +L+HGVT VGYG + G KYWLVKNSWGTSWGE GYIRM ++ + G+CG
Sbjct: 273 VFTGECGTELDHGVTAVGYGIAEDGSKYWLVKNSWGTSWGEKGYIRMQKDIEAKE-GLCG 331
Query: 329 ILMQASYP 336
I MQ+SYP
Sbjct: 332 IAMQSSYP 339
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 150/310 (48%), Positives = 194/310 (62%), Gaps = 30/310 (9%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYID-YINSQNLSFKLTDNKFADLSN 115
SM ER E W+ +Y + Y E ++RF I+ NV YI+ + N+ N +KL N+FADL+N
Sbjct: 34 SMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTN 93
Query: 116 EEFISTYLGYNKPYNEP--RWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
EEFI+ + R + +Y +P++VDWR++GAVTP+KDQGQCG CWAFS
Sbjct: 94 EEFIAPRNRFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFS 153
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
AVAA EGI+ L +GKL+SLSEQELVDCD +QGC GG M+ AF+F+ + G+ TE +Y
Sbjct: 154 AVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANY 213
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSH 268
PY+G + +C ++ + A TITGYE +PA FQ Y
Sbjct: 214 PYKGVDGKCNVNEAANDAATITGYEDVPANNEKALQKAVANQPVSVAIDASGSDFQFYKS 273
Query: 269 GVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GVF CG +L+HGVT VGYG + G +YWLVKNSWGT WGE GYIRM R S G+C
Sbjct: 274 GVFTGSCGTELDHGVTAVGYGVSNDGTEYWLVKNSWGTEWGEEGYIRMQRGVNSEE-GLC 332
Query: 328 GILMQASYPV 337
GI MQASYP
Sbjct: 333 GIAMQASYPT 342
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 154/354 (43%), Positives = 206/354 (58%), Gaps = 37/354 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEER----FENWLKQYSREYGSE 76
M++M+ + S + L + ++ + +P K + + +E WL ++ + Y
Sbjct: 10 MKLMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGL 69
Query: 77 DEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP---- 132
E +RF I+ N+++ID N N +++L +FADL+NEE+ S +LG N
Sbjct: 70 GEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKL 129
Query: 133 ------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKL 186
R+ LP SVDWRKEGAV VKDQ CGSCWAFSA+AAVEGINK+ TG L
Sbjct: 130 GGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDL 189
Query: 187 VSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKH 246
+SLSEQELVDCD S N+GCNGG M+ AFEFI GG+ +EDDYPY+ + RC ++
Sbjct: 190 ISLSEQELVDCDT-SYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNA 248
Query: 247 HAVTITGYEAIPA----------------------RYAFQLYSHGVFDEYCGHQLNHGVT 284
VTI YE +PA FQLY +GVF CG L+HGV
Sbjct: 249 KVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVA 308
Query: 285 VVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VGYG ++G+ YW+V+NSWG SWGE GYIR+ RN SS G CGI ++ SYP+K
Sbjct: 309 AVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIK 362
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 150/321 (46%), Positives = 203/321 (63%), Gaps = 34/321 (10%)
Query: 49 YPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFK 104
Y ++ D ++ + W+ + R Y + +RR+ ++ N++YID N+ SF+
Sbjct: 32 YGERTDEEA-RRMYAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHSFR 90
Query: 105 LTDNKFADLSNEEFISTYLG-YNKPYNE----PRWPSVQYLGLPASVDWRKEGAVTPVKD 159
L N+FADL+N+E+ +TYLG +P + R+ + LP SVDWR +GAV VKD
Sbjct: 91 LGLNRFADLTNDEYPATYLGARTRPQRDRKLGARYHAADNEDLPESVDWRAKGAVAEVKD 150
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
QG CG+CWAFS +AAVEGIN++ TG L+SLSEQELVDCD S NQGCNGG M+ AFEFI
Sbjct: 151 QGSCGTCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT-SYNQGCNGGLMDYAFEFII 209
Query: 220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------------- 260
GG+ TE DYPY+G + RC ++ VTI YE +PA
Sbjct: 210 NNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIE 269
Query: 261 ---YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
AFQLYS G+F CG +L+HGVT VGYG ++G+ YW+VKNSWG+SWGE+GY+RM R
Sbjct: 270 AAGTAFQLYSSGIFTGSCGTRLDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMER 329
Query: 318 NSPSSNIGICGILMQASYPVK 338
N +S+ G CGI ++ SYP+K
Sbjct: 330 NIKASS-GKCGIAVEPSYPLK 349
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 154/354 (43%), Positives = 206/354 (58%), Gaps = 37/354 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEER----FENWLKQYSREYGSE 76
M++M+ + S + L + ++ + +P K + + +E WL ++ + Y
Sbjct: 10 MKLMIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKEVLTMYEEWLVKHGKSYNGL 69
Query: 77 DEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP---- 132
E +RF I+ N+++ID N N +++L +FADL+NEE+ S +LG N
Sbjct: 70 GEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKL 129
Query: 133 ------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKL 186
R+ LP SVDWRKEGAV VKDQ CGSCWAFSA+AAVEGINK+ TG L
Sbjct: 130 GGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDL 189
Query: 187 VSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKH 246
+SLSEQELVDCD S N+GCNGG M+ AFEFI GG+ +EDDYPY+ + RC ++
Sbjct: 190 ISLSEQELVDCDT-SYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNA 248
Query: 247 HAVTITGYEAIPA----------------------RYAFQLYSHGVFDEYCGHQLNHGVT 284
VTI YE +PA FQLY +GVF CG L+HGV
Sbjct: 249 KVVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVA 308
Query: 285 VVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VGYG ++G+ YW+V+NSWG SWGE GYIR+ RN SS G CGI ++ SYP+K
Sbjct: 309 AVGYGTENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIK 362
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 156/322 (48%), Positives = 204/322 (63%), Gaps = 38/322 (11%)
Query: 53 YDPQSMEER---FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKL 105
Y +S EE + W ++ + Y + E +RR+ + N++YID N+ SF+L
Sbjct: 28 YGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRL 87
Query: 106 TDNKFADLSNEEFISTYLGY-NKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKD 159
N+FADL+NEE+ TYLG NKP E R S +YL LP SVDWR +GAV +KD
Sbjct: 88 GLNRFADLTNEEYRDTYLGLRNKPRRE-RKVSDRYLAADNEALPESVDWRTKGAVAEIKD 146
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
QG CGSCWAFSA+AAVEGIN++ TG L+SLSEQELVDCD S N+GCNGG M+ AF+FI
Sbjct: 147 QGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT-SYNEGCNGGLMDYAFDFII 205
Query: 220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------------- 260
GG+ TEDDYPY+GK++RC ++ VTI YE +
Sbjct: 206 NNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIE 265
Query: 261 ---YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
AFQLYS G+F CG L+HGV VGYG ++G+ YW+V+NSWG SWGE+GY+RM R
Sbjct: 266 AGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMER 325
Query: 318 NSPSSNIGICGILMQASYPVKR 339
N +S+ G CGI ++ SYP+K+
Sbjct: 326 NIKASS-GKCGIAVEPSYPLKK 346
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 146/313 (46%), Positives = 192/313 (61%), Gaps = 32/313 (10%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
D +M R E W+ QY R Y + E RRF ++ +NV +I+ N+ N F L N+FADL
Sbjct: 29 DDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGVNQFADL 88
Query: 114 SNEEFISTYLGYNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
+N+EF ST + R P+ V LPA++DWR +G VTP+KDQGQCG CW
Sbjct: 89 TNDEFRSTKTNKGFIPSTTRVPTGFRYENVNIDALPATMDWRTKGVVTPIKDQGQCGCCW 148
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAVAA+EGI KL TGKL+SLSEQELVDCDV+ E+QGC GG M+ AF+FI K GG+TTE
Sbjct: 149 AFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTE 208
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
+YPY +D+C++ + +I GYE +PA FQ
Sbjct: 209 SNYPYAAADDKCKS--VSNSVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQF 266
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y GV CG L+HG+ +GYG+ G KYWL+KNSWGT+WGE G++RM ++ S
Sbjct: 267 YKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKD-ISDKR 325
Query: 325 GICGILMQASYPV 337
G+CG+ M+ SYP
Sbjct: 326 GMCGLAMEPSYPT 338
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 156/322 (48%), Positives = 204/322 (63%), Gaps = 38/322 (11%)
Query: 53 YDPQSMEER---FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKL 105
Y +S EE + W ++ + Y + E +RR+ + N++YID N+ SF+L
Sbjct: 28 YGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRL 87
Query: 106 TDNKFADLSNEEFISTYLGY-NKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKD 159
N+FADL+NEE+ TYLG NKP E R S +YL LP SVDWR +GAV +KD
Sbjct: 88 GLNRFADLTNEEYRDTYLGLRNKPRRE-RKVSDRYLAADNEALPESVDWRTKGAVAEIKD 146
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
QG CGSCWAFSA+AAVEGIN++ TG L+SLSEQELVDCD S N+GCNGG M+ AF+FI
Sbjct: 147 QGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT-SYNEGCNGGLMDYAFDFII 205
Query: 220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------------- 260
GG+ TEDDYPY+GK++RC ++ VTI YE +
Sbjct: 206 NNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIE 265
Query: 261 ---YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
AFQLYS G+F CG L+HGV VGYG ++G+ YW+V+NSWG SWGE+GY+RM R
Sbjct: 266 AGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMER 325
Query: 318 NSPSSNIGICGILMQASYPVKR 339
N +S+ G CGI ++ SYP+K+
Sbjct: 326 NIKASS-GKCGIAVEPSYPLKK 346
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 156/322 (48%), Positives = 204/322 (63%), Gaps = 38/322 (11%)
Query: 53 YDPQSMEER---FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKL 105
Y +S EE + W ++ + Y + E +RR+ + N++YID N+ SF+L
Sbjct: 29 YGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRL 88
Query: 106 TDNKFADLSNEEFISTYLGY-NKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKD 159
N+FADL+NEE+ TYLG NKP E R S +YL LP SVDWR +GAV +KD
Sbjct: 89 GLNRFADLTNEEYRDTYLGLRNKPRRE-RKVSDRYLAADNEALPESVDWRTKGAVAEIKD 147
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
QG CGSCWAFSA+AAVEGIN++ TG L+SLSEQELVDCD S N+GCNGG M+ AF+FI
Sbjct: 148 QGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT-SYNEGCNGGLMDYAFDFII 206
Query: 220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------------- 260
GG+ TEDDYPY+GK++RC ++ VTI YE +
Sbjct: 207 NNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIE 266
Query: 261 ---YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
AFQLYS G+F CG L+HGV VGYG ++G+ YW+V+NSWG SWGE+GY+RM R
Sbjct: 267 AGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMER 326
Query: 318 NSPSSNIGICGILMQASYPVKR 339
N +S+ G CGI ++ SYP+K+
Sbjct: 327 NIKASS-GKCGIAVEPSYPLKK 347
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 151/335 (45%), Positives = 199/335 (59%), Gaps = 34/335 (10%)
Query: 29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
++LFLL LGIP + +K SM ER E W+ +Y + Y E ++RF I+
Sbjct: 10 TIALFLLLALGIP-----QMMSRKLHETSMRERHEQWMAEYGKVYKDAAEKEKRFLIFKH 64
Query: 89 NVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV--QYLGLPAS 145
NV++I+ N+ N +KL N ADL+ EEF ++ G +PY P +PA+
Sbjct: 65 NVEFIESFNAAANKPYKLGVNHLADLTVEEFKASRNGLKRPYELSTTPFKYENVTAIPAA 124
Query: 146 VDWRKEGAVTPVKDQGQC-GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
+DWR +GAVT +KDQGQC GSCWAFS VAA EGI+++ TGKLVSLSEQELVDCD +Q
Sbjct: 125 IDWRTKGAVTSIKDQGQCAGSCWAFSTVAATEGIHQITTGKLVSLSEQELVDCDTKGVDQ 184
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY--- 261
GC GGYME FEFI K GG+T+E +YPY+ + +C +K I GYE +P
Sbjct: 185 GCEGGYMEDGFEFIIKNGGITSEANYPYKAVDGKC--NKATSPVAQIKGYEKVPPNSEKT 242
Query: 262 -------------------AFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNS 302
F YS G+++ CG +L+HGVT VGYG +G YWLVKNS
Sbjct: 243 LQKAVANQPVSVSIDANGEGFMFYSSGIYNGECGTELDHGVTAVGYGIANGTDYWLVKNS 302
Query: 303 WGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
WGT WGE GY+RM R + + G+CGI + +SYP
Sbjct: 303 WGTQWGEKGYVRMQRGVAAKH-GLCGIALDSSYPT 336
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 149/307 (48%), Positives = 193/307 (62%), Gaps = 28/307 (9%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYID-YINSQNLSFKLTDNKFADLSNE 116
M +R E W+ Q+ R YG E ++R+ I+ N++ I+ + N + +KL NKFADL+NE
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 117 EFISTYLGYNKPYNEPRWPSVQYLGL---PASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
EF + + GY + ++ S ++ L P S+DWRK GAVTPVKDQG CG CWAFSAVA
Sbjct: 61 EFRAMHHGYKRQSSKLMSSSFRHENLSAIPTSMDWRKAGAVTPVKDQGTCGCCWAFSAVA 120
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
A+EGI KLKTGKL+SLSEQ+LVDCDV +QGC GG M+ AF+FI + GG+T+E YPY+
Sbjct: 121 AIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTSEATYPYQ 180
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVF 271
G + C++ KT ITGYE +P Y FQ Y GVF
Sbjct: 181 GVDGTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQFYKSGVF 240
Query: 272 DEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
CG L+H VT +GYG + G YWLVKNSWGTSWGE+GY+RM R + G+CG+
Sbjct: 241 KGDCGTYLDHAVTAIGYGTNSDGTNYWLVKNSWGTSWGESGYMRMQRGIGARE-GLCGVA 299
Query: 331 MQASYPV 337
M ASYP
Sbjct: 300 MDASYPT 306
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 196/313 (62%), Gaps = 32/313 (10%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
D +M R E W+ QY R Y + E RRF ++ +NV +I+ N+ N +F L N+FADL
Sbjct: 29 DDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNHNFWLGVNQFADL 88
Query: 114 SNEEF--ISTYLGY----NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
+N+EF + T G+ + R+ +V LPA+VDWR +GAVTP+KDQGQCG CW
Sbjct: 89 TNDEFRWMKTNKGFIPSTTRVPTGFRYENVNIDALPATVDWRTKGAVTPIKDQGQCGCCW 148
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAVAA+EGI KL TGKL+SLSEQELVDCDV+ E+QGC GG M+ AF+FI K GG+TTE
Sbjct: 149 AFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTE 208
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
+YPY +D+C++ + +I GYE +PA FQ
Sbjct: 209 SNYPYAAADDKCKS--VSNSVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQF 266
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y GV CG L+HG+ +GYG+ G KYWL+KNSWGT+WGE G++RM ++ S
Sbjct: 267 YKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKD-ISDKR 325
Query: 325 GICGILMQASYPV 337
G+CG+ M+ SYP
Sbjct: 326 GMCGLAMEPSYPT 338
>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
Precursor
gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 159/344 (46%), Positives = 203/344 (59%), Gaps = 38/344 (11%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKY--DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYS 87
L L L+ L I A Y K + + ++ W +S S +E ++RF ++
Sbjct: 4 LLLIFLFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHSVPR-SLNEREKRFNVFR 62
Query: 88 SNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNE----PRWPSVQYL--- 140
NV ++ N +N S+KL NKFADL+ EF + Y G N ++ P+ S Q++
Sbjct: 63 HNVMHVHNTNKKNRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQFMYDH 122
Query: 141 ----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVD 196
LP+SVDWRK+GAVT +K+QG+CGSCWAFS VAAVEGINK+KT KLVSLSEQELVD
Sbjct: 123 ENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVD 182
Query: 197 CDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA 256
CD +N+GCNGG ME AFEFI K GG+TTED YPY G + +C K VTI G+E
Sbjct: 183 CDT-KQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHED 241
Query: 257 IP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGE 294
+P FQ YS GVF CG +LNHGV VGYG + G+
Sbjct: 242 VPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYGSERGK 301
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
KYW+V+NSWG WGE GYI++ R G CGI M+ASYP+K
Sbjct: 302 KYWIVRNSWGAEWGEGGYIKIEREIDEPE-GRCGIAMEASYPIK 344
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 152/321 (47%), Positives = 203/321 (63%), Gaps = 36/321 (11%)
Query: 53 YDPQSMEE---RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKL 105
Y +S EE + W+ + R Y + E +RRF ++ N++Y+D N+ SF+L
Sbjct: 30 YGERSEEEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLHSFRL 89
Query: 106 TDNKFADLSNEEFISTYLGY-NKPYNEPR----WPSVQYLGLPASVDWRKEGAVTPVKDQ 160
N+FADL+NEE+ TYLG KP E R + + LP SVDWR++GAV VKDQ
Sbjct: 90 GLNRFADLTNEEYRDTYLGVRTKPVRERRLSGRYQAADNEELPESVDWREKGAVAKVKDQ 149
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G CGSCWAFSA+AAVEGIN++ TG +++LSEQELVDCD S NQGCNGG M+ AFEFI
Sbjct: 150 GGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDT-SYNQGCNGGLMDYAFEFIIN 208
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------------------- 260
GG+ +E+DYPY+ +++RC +K VTI GYE +P
Sbjct: 209 NGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSELSLKKAVANQPISVAIEA 268
Query: 261 --YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
AFQLY G+F CG L+HGVT VGYG ++G+ YW+VKNSWGT WGE GY+R+ RN
Sbjct: 269 GGRAFQLYKSGIFTGRCGTALDHGVTAVGYGSENGKDYWIVKNSWGTVWGEDGYVRLERN 328
Query: 319 SPSSNIGICGILMQASYPVKR 339
+++ G CGI ++ SYP+K+
Sbjct: 329 IKATS-GKCGIAIEPSYPLKK 348
>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
Length = 339
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 150/316 (47%), Positives = 195/316 (61%), Gaps = 32/316 (10%)
Query: 51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKF 110
++ D +M R E W++QY R Y E RRF I+ +NV +I+ N+ N F L+ N+F
Sbjct: 26 EQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNHKFWLSVNQF 85
Query: 111 ADLSNEEFISTYLGYNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCG 164
ADL+N EF +T + R P+ V LPA+VDWR +GAVTP+KDQGQCG
Sbjct: 86 ADLTNYEFRATKTNKGFIPSTVRVPTTFRYENVSIDTLPATVDWRTKGAVTPIKDQGQCG 145
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
CWAFSAVAA+EGI KL TGKL+SLSEQELVDCDV+ E+QGC GG M+ AF+FI K GG+
Sbjct: 146 CCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGL 205
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE YPY + +C + + A TI GYE +PA
Sbjct: 206 TTESKYPYTAADGKC--NGGSNSAATIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMT 263
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GV CG L+HG+ +GYG+D G +YWL+KNSWGT+WGE G++RM ++ S
Sbjct: 264 FQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKNSWGTTWGENGFLRMEKD-IS 322
Query: 322 SNIGICGILMQASYPV 337
G+CG+ M+ SYP
Sbjct: 323 DKRGMCGLAMEPSYPT 338
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 155/338 (45%), Positives = 200/338 (59%), Gaps = 34/338 (10%)
Query: 26 RNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGI 85
+ +L+L LL + S+ + SM ER E W+K+Y + Y E Q+R I
Sbjct: 7 KQHILALVLLLSI-----CTSQVMSRNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLI 61
Query: 86 YSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV--QYLGL 142
+ NV++I+ N+ N +KL+ N AD +NEEF++++ GY + + P G+
Sbjct: 62 FKDNVEFIESFNAAGNRPYKLSINHLADQTNEEFVASHNGYKHKGSHSQTPFKYENVTGV 121
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P +VDWR+ GAVT VKDQGQCGSCWAFS VAA EGI ++ T L+SLSEQELVDCD S
Sbjct: 122 PNAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCD--SV 179
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-- 260
+ GC+GGYME FEFI K GG+++E +YPY + C +K A I GYE +PA
Sbjct: 180 DHGCDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSE 239
Query: 261 --------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLV 299
AFQ YS GVF CG QL+HGVT VGYG D G +YW+V
Sbjct: 240 DALQKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGTQYWIV 299
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KNSWGT WGE GYIRM R + + G+CGI M ASYP
Sbjct: 300 KNSWGTQWGEEGYIRMQRGTDAQE-GLCGIAMDASYPT 336
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 158/337 (46%), Positives = 201/337 (59%), Gaps = 32/337 (9%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L + LG+ A + Q D + E+ E W+ Y + Y E + R I+ N
Sbjct: 11 ISLALFFCLGLFAIQVTSRTLQ--DDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKEN 68
Query: 90 VQYIDYINS--QNLSFKLTDNKFADLSNEEFISTYLGYN----KPYNEPRWPSVQYLGLP 143
V YI+ N+ N +KL N+FADL+NEEFI++ + + + +P
Sbjct: 69 VNYIEASNNAGNNKLYKLGINQFADLTNEEFIASRNKFKGHMCSSITKTSTFKYENASVP 128
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
++VDWRK+GAVTPVK+QGQCG CWAFSAVAA EGI+KL TGKLVSLSEQELVDCD +
Sbjct: 129 STVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVD 188
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA- 262
QGC GG M+ AF+FI + G+ TE YPY+G + C +K HAVTITGYE +PA
Sbjct: 189 QGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQ 248
Query: 263 ---------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVK 300
FQ Y GVF CG +L+HGVT VGYG + G KYWLVK
Sbjct: 249 ALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVK 308
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
NSWGT WGE GYI+M R ++ G+CGI M+ASYP
Sbjct: 309 NSWGTDWGEEGYIKMQRGVDAAE-GLCGIAMEASYPT 344
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 153/331 (46%), Positives = 206/331 (62%), Gaps = 34/331 (10%)
Query: 38 LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN 97
LGIP S+ + Q+ D + + +E+WL + + Y + E +RRF I+ N+++ID N
Sbjct: 40 LGIPEIPHSDAH-QRPD-EEVAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHN 97
Query: 98 SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQ------YLG--LPASVDWR 149
++ ++K+ +FADL+NEE+ + +LG + +PR + + LG LP VDWR
Sbjct: 98 RESRTYKVGLTRFADLTNEEYRARFLG-GRFSRKPRLSAAKSGRYAAALGDDLPDDVDWR 156
Query: 150 KEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGG 209
K+GAV VKDQGQCGSCWAFS+VAAVEGIN++ TG+L+ LSEQELVDCD S N GCNGG
Sbjct: 157 KKGAVATVKDQGQCGSCWAFSSVAAVEGINQIVTGELIPLSEQELVDCD-KSFNMGCNGG 215
Query: 210 YMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR--------- 260
M+ AF+FI GG+ TE+DYPY+G++ C ++ VTI GYE +P
Sbjct: 216 LMDYAFQFIIGNGGIDTEEDYPYKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAV 275
Query: 261 -------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSW 307
AFQLY GVF CG L+HGV VGYG D+G YW+V+NSWG W
Sbjct: 276 ANQPVSVAIEAGGRAFQLYQSGVFTGRCGTDLDHGVVAVGYGTDNGTDYWIVRNSWGKDW 335
Query: 308 GEAGYIRMARNSPSSNIGICGILMQASYPVK 338
GE+GYIR+ RN + G CGI +Q SYP K
Sbjct: 336 GESGYIRLERNVANITTGKCGIAVQPSYPTK 366
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 154/320 (48%), Positives = 198/320 (61%), Gaps = 36/320 (11%)
Query: 53 YDPQSMEE---RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKL 105
Y +S EE + W+ ++ Y + E +RRF + N++YID N+ SF+L
Sbjct: 31 YGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVHSFRL 90
Query: 106 TDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQ 160
N+FADL+NEE+ STYLG + R S +Y LP SVDWRK+GAV VKDQ
Sbjct: 91 GLNRFADLTNEEYRSTYLGARTKPDRERKLSARYQAADNDELPESVDWRKKGAVGAVKDQ 150
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G CGSCWAFSA+AAVEGIN++ TG ++ LSEQELVDCD S NQGCNGG M+ AFEFI
Sbjct: 151 GGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT-SYNQGCNGGLMDYAFEFIIN 209
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------------------- 260
GG+ +E+DYPY+ +++RC +K VTI GYE +P
Sbjct: 210 NGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEA 269
Query: 261 --YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
AFQLY G+F CG L+HGV VGYG ++G+ YWLV+NSWG+ WGE GYIRM RN
Sbjct: 270 GGRAFQLYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGSVWGEDGYIRMERN 329
Query: 319 SPSSNIGICGILMQASYPVK 338
+S+ G CGI ++ SYP K
Sbjct: 330 IKASS-GKCGIAVEPSYPTK 348
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 194/313 (61%), Gaps = 32/313 (10%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
D +M R E W+ QY R Y + E RRF ++ +NV +I+ N+ N +F L N+FADL
Sbjct: 29 DDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNHNFWLGVNQFADL 88
Query: 114 SNEEF--ISTYLGYNKPYNEP----RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
+N+EF T G+ R+ +V LPA+VDWR +GAVTP+KDQGQCG CW
Sbjct: 89 TNDEFRWTKTNKGFIPSTTRVPTGFRYENVNIDALPATVDWRTKGAVTPIKDQGQCGCCW 148
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAVAA+EGI KL TGKL+SLSEQELVDCDV+ E+QGC GG M+ AF+FI K GG+TTE
Sbjct: 149 AFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTE 208
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
+YPY +D+C++ + +I GYE +PA FQ
Sbjct: 209 SNYPYAAADDKCKS--VSNSVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQF 266
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y GV CG L+HG+ +GYG+ G KYWL+KNSWGT+WGE G++RM ++ S
Sbjct: 267 YKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGTTWGENGFLRMEKD-ISDKR 325
Query: 325 GICGILMQASYPV 337
G+CG+ M+ SYP
Sbjct: 326 GMCGLAMEPSYPT 338
>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 160/344 (46%), Positives = 204/344 (59%), Gaps = 38/344 (11%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKY--DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYS 87
L L L+ L I A Y K + + + ++ W +S S E ++RF ++
Sbjct: 4 LLLIFLFSLVILETACGFDYEDKEIESEEGLSKLYDRWRSHHSVPR-SLHEREKRFNVFR 62
Query: 88 SNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLG----YNKPYNEPRWPSVQYL--- 140
NV ++ N +N S+KL NKFADL+ EF + Y G +++ P+ S Q++
Sbjct: 63 HNVMHVHNSNKKNRSYKLKLNKFADLTIHEFKNAYTGSKIKHHRMLQGPKRGSKQFMYDH 122
Query: 141 ----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVD 196
LP+SVDWRK+GAVT +K+QG+CGSCWAFS VAAVEGINK+KT KLVSLSEQELVD
Sbjct: 123 ENVSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELVD 182
Query: 197 CDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA 256
CD N +N+GCNGG ME AFEFI K GG+TTED YPY G + +C K VTI G+E
Sbjct: 183 CDTN-QNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEN 241
Query: 257 IP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGE 294
+P FQ YS GVF CG +LNHGV VGYG G+
Sbjct: 242 VPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGDCGTELNHGVATVGYGSQGGK 301
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
KYW+V+NSWGT WGE GYI++ R G CGI M+ASYP+K
Sbjct: 302 KYWIVRNSWGTEWGEGGYIKIERGIDEPE-GRCGIAMEASYPIK 344
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 157/327 (48%), Positives = 196/327 (59%), Gaps = 45/327 (13%)
Query: 56 QSMEERFENWLKQY-------SREYGSED-EWQRRFGIYSSNVQYIDYINSQN-LSFKLT 106
+S+ +E W +Y S G++D E +RRF ++ N +YI N + F+L
Sbjct: 36 ESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPFRLA 95
Query: 107 DNKFADLSNEEFISTYLGYNKPYNEP-------RWPSVQYLG-----LPASVDWRKEGAV 154
NKFAD++ +EF TY G ++ S +Y G LP +VDWR+ GAV
Sbjct: 96 LNKFADMTTDEFRRTYAGSRARHHRSLRGGRGGEGGSFRYGGDDEDNLPPAVDWRERGAV 155
Query: 155 TPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKA 214
T +KDQGQCGSCWAFSAVAAVEG+NK+KTG+LV+LSEQELVDCD +NQGC+GG M+ A
Sbjct: 156 TGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDT-GDNQGCDGGLMDYA 214
Query: 215 FEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------------- 260
F+FI + GG+TTE +YPYR + RC K H VTI GYE +PA
Sbjct: 215 FQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVANQPV 274
Query: 261 --------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAG 311
FQ YS GVF CG L+HGV VGYG G KYW+VKNSWG WGE G
Sbjct: 275 AVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDWGERG 334
Query: 312 YIRMARNSPSSNIGICGILMQASYPVK 338
YIRM R S + G+CGI M+ASYPVK
Sbjct: 335 YIRMQRGVSSDSNGLCGIAMEASYPVK 361
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 153/315 (48%), Positives = 194/315 (61%), Gaps = 47/315 (14%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E WL ++ + Y E Q RF I+ N++++D NS+NLSFKL N+FADL+NEE+ S
Sbjct: 43 YETWLVKHGKNYNGLGEKQLRFNIFKDNLRFVDERNSENLSFKLGLNRFADLTNEEYRSV 102
Query: 122 YLGYNKPYNEPRWPSVQYLG--------------LPASVDWRKEGAVTPVKDQGQCGSCW 167
YLG PR +V G LP SVDWRK+GAV +KDQG CGSCW
Sbjct: 103 YLG-----TRPRSVAVARSGRSKSDRYAFRAGDTLPESVDWRKKGAVAGIKDQGSCGSCW 157
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSA+AAVEG+N++ TG L+SLSEQELV+CD S N GC+GG M+ AFEFI K G+ ++
Sbjct: 158 AFSAIAAVEGVNQIVTGDLISLSEQELVECDT-SYNDGCDGGLMDYAFEFIIKNEGIDSD 216
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQL 265
+DYPY G++ RC T++ VTI YE P FQL
Sbjct: 217 EDYPYTGRDGRCDTNRKNAKVVTIDDYEDSPVYDEKSLQKAVANQPVSVAIEGGGRDFQL 276
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS--PSSN 323
Y GVF CG L+HGV VVGYG + G YW+V+NSWG +WGE GYIRM RN+ PS
Sbjct: 277 YDSGVFTGKCGTALDHGVAVVGYGTEDGLDYWIVRNSWGDTWGEGGYIRMQRNTKLPS-- 334
Query: 324 IGICGILMQASYPVK 338
GICGI ++ SYP+K
Sbjct: 335 -GICGIAIEPSYPIK 348
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 154/347 (44%), Positives = 200/347 (57%), Gaps = 37/347 (10%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRR 82
M L+L + L I A A S + + E ++ WL ++ + Y DE ++R
Sbjct: 1 MATATTSLALLSFFFLSISASALS-----RRSDGEVREIYDLWLAKHGKAYNGIDEREKR 55
Query: 83 FGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKP--------YNEPRW 134
F I+ N+++ID NS+N ++K+ N FADL+NEE+ + YLG P R
Sbjct: 56 FQIFKENLKFIDDHNSENRTYKVGLNMFADLTNEEYRALYLGTRSPPARRVMKAKTASRR 115
Query: 135 PSVQYLG-LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
+V L LP S+DWR GAV PVK+QG CGSCWAFS +AAVEGIN++ TG+L+SLSEQE
Sbjct: 116 YAVNNLDRLPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQE 175
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
LV CD N GCNGG M+ AF+FI GG+ TE+DYPY + +C + V+I
Sbjct: 176 LVSCD-KKYNSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDA 234
Query: 254 YEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED 291
YE +PA A QLY GVF CG L+HGV VGYG++
Sbjct: 235 YEDVPANDEESLKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGKE 294
Query: 292 HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+G YWLV+NSWGTSWGE GY ++ RN G CGI MQASYPVK
Sbjct: 295 NGVDYWLVRNSWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPVK 341
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 149/315 (47%), Positives = 192/315 (60%), Gaps = 33/315 (10%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFAD 112
D M R E W+ QYSR Y E RRF ++ +NVQ+I+ N+ N F L N+FAD
Sbjct: 122 DDSVMVARHEQWMAQYSRVYKDASEKARRFEVFKANVQFIESFNAGGNNKFWLGVNQFAD 181
Query: 113 LSNEEFISTYLGYNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
L+N+EF ST + + P+ V LP ++DWR +GAVTP+KDQGQCG C
Sbjct: 182 LTNDEFRSTKTNKGLKSSNMKIPTGFRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCC 241
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSAVAA EGI K+ TGKLVSL+EQELVDCDV+ E+QGC GG M+ AF+FI K GG+TT
Sbjct: 242 WAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTT 301
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E YPY + +C++ + A TI GYE +PA FQ
Sbjct: 302 ESSYPYTAADGKCKSG--SNSAATIKGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQ 359
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GV CG L+HG+ +GYG+ G KYWL+KNSWGT+WGE GY+RM ++ S
Sbjct: 360 FYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKD-ISDK 418
Query: 324 IGICGILMQASYPVK 338
G+CG+ M+ SYP +
Sbjct: 419 RGMCGLAMEPSYPTE 433
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 150/310 (48%), Positives = 192/310 (61%), Gaps = 30/310 (9%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYID-YINSQNLSFKLTDNKFADLSN 115
SM ER E W+ +Y + Y E ++RF ++ NV YI+ + N+ N S+KL N+FADL+N
Sbjct: 34 SMYERHEQWMTRYGKVYKDPQEREKRFRVFKENVNYIEAFNNAANKSYKLGINQFADLTN 93
Query: 116 EEFISTYLGYNKPYNEPRWPSVQY-----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
+EFI+ G+ + + P++VDWR++GAVTP+KDQGQCG CWAFS
Sbjct: 94 KEFIAPRNGFKGHMCSSIIRTTTFKFENVTATPSTVDWRQKGAVTPIKDQGQCGCCWAFS 153
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
AVAA EGI+ L GKL+SLSEQELVDCD +QGC GG M+ AF+FI + G+ TE +Y
Sbjct: 154 AVAATEGIHALSAGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEANY 213
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSH 268
PY+G + +C ++ +A TITGYE +PA FQ Y
Sbjct: 214 PYKGVDGKCNANEAAKNAATITGYEDVPANNEMALQKAVANQPVSVAIDASGSDFQFYKS 273
Query: 269 GVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GVF CG +L+HGVT VGYG D G +YWLVKNSWGT WGE GYIRM R S G+C
Sbjct: 274 GVFTGSCGTELDHGVTAVGYGVSDDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEE-GLC 332
Query: 328 GILMQASYPV 337
GI MQASYP
Sbjct: 333 GIAMQASYPT 342
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 150/316 (47%), Positives = 194/316 (61%), Gaps = 32/316 (10%)
Query: 51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKF 110
++ D +M R E W++QY R Y E RRF I+ +NV +I+ N+ N F L N+F
Sbjct: 26 EQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNHKFWLGVNQF 85
Query: 111 ADLSNEEFISTYLGYNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCG 164
ADL+N EF +T + R P+ V LPA+VDWR +GAVTP+KDQGQCG
Sbjct: 86 ADLTNYEFRATKTNKGFIPSTVRVPTTFRYENVSIDTLPATVDWRTKGAVTPIKDQGQCG 145
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
CWAFSAVAA+EGI KL TGKL+SLSEQELVDCDV+ E+QGC GG M+ AF+FI K GG+
Sbjct: 146 CCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGL 205
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE YPY + +C + + A TI GYE +PA
Sbjct: 206 TTESKYPYTAADGKC--NGGSNSAATIKGYEEVPANNEAALMKAVANQPVSVAVDGGDMT 263
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GV CG L+HG+ +GYG+D G +YWL+KNSWGT+WGE G++RM ++ S
Sbjct: 264 FQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKNSWGTTWGENGFLRMEKD-IS 322
Query: 322 SNIGICGILMQASYPV 337
G+CG+ M+ SYP
Sbjct: 323 DKRGMCGLAMEPSYPT 338
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 286 bits (733), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 165/357 (46%), Positives = 215/357 (60%), Gaps = 44/357 (12%)
Query: 24 MLRNAVLSLFLLWVLGIPAG-------AWSEGYPQKYDPQSMEE---RFENWLKQYSREY 73
M + +LSLF+L + + + E +P K +S EE +E+WL ++ + Y
Sbjct: 1 MAKLLILSLFVLAAVSSASASADMSIITYDEEHPAKGLSRSDEEVMALYESWLVEHGKSY 60
Query: 74 -GSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYN----- 126
G E +RF I+ N++YID NS+ + S+KL N+FADL+NEE+ STYLG
Sbjct: 61 NGLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNRFADLTNEEYRSTYLGAKTDARR 120
Query: 127 ---KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKT 183
K ++ R+ LP S+DWR++GAV VKDQG CGSCWAFS +AAVEGIN++ T
Sbjct: 121 RIAKTKSDRRYAPKAGGSLPDSIDWREKGAVAEVKDQGSCGSCWAFSTIAAVEGINQIVT 180
Query: 184 GKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDK 243
G+L+SLSEQELVDCD S N+GCNGG M+ AFEFI K GG+ TE DYPY G+ RC +
Sbjct: 181 GELISLSEQELVDCDT-SYNEGCNGGLMDYAFEFIIKNGGIDTEADYPYTGRYGRCDQTR 239
Query: 244 TKHHAVTITGYEAI---------------PARYA-------FQLYSHGVFDEYCGHQLNH 281
V+I GYE + P A FQLYS G+F CG L+H
Sbjct: 240 KNAKVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEAGGRDFQLYSSGIFTGSCGTDLDH 299
Query: 282 GVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
GVT VGYG ++G YW+VKNSW SWGE GY+RM RN N G+CGI ++ SYP K
Sbjct: 300 GVTAVGYGTENGVDYWIVKNSWAASWGEKGYLRMQRNVKDKN-GLCGIAIEPSYPTK 355
>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
Length = 373
Score = 286 bits (733), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 157/340 (46%), Positives = 199/340 (58%), Gaps = 37/340 (10%)
Query: 31 SLFLLW--VLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
SL LW +L + GA S + D SM ER W+ ++ R Y E ++R GI+ S
Sbjct: 3 SLVCLWMALLALGLGACSPAAAELGDA-SMAERHVEWMARHGRTYKDAAEKEQRLGIFKS 61
Query: 89 NVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYN-------KPYNEPRWPSVQYLG 141
NV+YI+ N+ ++L N+FADL++EEF + + G+ K N R S+
Sbjct: 62 NVEYIESFNAGKRKYQLAANQFADLTHEEFKAMHTGFKPSGTGAKKAGNGFRHGSLS--S 119
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
+P SVDWR +GAVTPVKDQG CGSCWAF+ VAAVEGI K+ TGKL+SLSEQ+LVDCDV+
Sbjct: 120 VPDSVDWRSKGAVTPVKDQGLCGSCWAFTVVAAVEGITKIVTGKLISLSEQQLVDCDVHG 179
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP--- 258
++QGC GG M+ AFEFI GG+T+E +YPY C TI +E +P
Sbjct: 180 KDQGCQGGDMDAAFEFIVNNGGITSEANYPYEEVQRLCNAHNASFVVATIESHEDVPTND 239
Query: 259 --------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYW 297
+ FQLYS GVF CG L+H VTVVGYG G KYW
Sbjct: 240 EKALRKAVANQPVSVGIDAGSSLDFQLYSGGVFSGECGTDLDHAVTVVGYGTTSDGTKYW 299
Query: 298 LVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
L KNSWG +WGE GYIRM R+ + G+CGI MQASYP
Sbjct: 300 LAKNSWGETWGENGYIRMERDVAAKE-GLCGIAMQASYPT 338
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 286 bits (733), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 150/316 (47%), Positives = 194/316 (61%), Gaps = 32/316 (10%)
Query: 51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKF 110
++ D +M R E W++QY R Y E RRF I+ +NV +I+ N+ N F L N+F
Sbjct: 26 EQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNHKFWLGVNQF 85
Query: 111 ADLSNEEFISTYLGYNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCG 164
ADL+N EF +T + R P+ V LPA+VDWR +GAVTP+KDQGQCG
Sbjct: 86 ADLTNYEFRATKTNKGFIPSTVRVPTTFRYENVSIDTLPATVDWRTKGAVTPIKDQGQCG 145
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
CWAFSAVAA+EGI KL TGKL+SLSEQELVDCDV+ E+QGC GG M+ AF+FI K GG+
Sbjct: 146 CCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGL 205
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE YPY + +C + + A TI GYE +PA
Sbjct: 206 TTESKYPYTAADGKC--NGGSNSAATIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMT 263
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GV CG L+HG+ +GYG+D G +YWL+KNSWGT+WGE G++RM ++ S
Sbjct: 264 FQFYSGGVMTGSCGTDLDHGIVAIGYGKDGDGTQYWLLKNSWGTTWGENGFLRMEKD-IS 322
Query: 322 SNIGICGILMQASYPV 337
G+CG+ M+ SYP
Sbjct: 323 DKRGMCGLAMEPSYPT 338
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 286 bits (733), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 160/359 (44%), Positives = 211/359 (58%), Gaps = 39/359 (10%)
Query: 17 IAIDMRMMLRNAVLSLFLLWV----LGIPAGAWSEGYPQKYDPQSMEER----FENWLKQ 68
I M A++ LF ++ L + ++ + K EE +E WL +
Sbjct: 6 ITTSPATMTMAAIVLLFTVFAVSSALDMSIISYDSAHADKAATLRTEEELMSMYEQWLVK 65
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFISTYLGYNK 127
+ + Y + E ++RF I+ N+++ID NS ++ ++KL N+FADL+NEE+ + YLG
Sbjct: 66 HGKVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTNEEYRAKYLGTKI 125
Query: 128 PYNEP--RWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINK 180
N + PS +Y LP SVDWRKEGAV PVKDQG CGSCWAFSA+ AVEGINK
Sbjct: 126 DPNRRLGKTPSNRYAPRVGDKLPDSVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINK 185
Query: 181 LKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
+ TG+L+SLSEQELVDCD NQGCNGG M+ AFEFI GG+ +++DYPYRG + RC
Sbjct: 186 IVTGELISLSEQELVDCDTGY-NQGCNGGLMDYAFEFIINNGGIDSDEDYPYRGVDGRCD 244
Query: 241 TDKTKHHAVTITGYEAIPA----------------------RYAFQLYSHGVFDEYCGHQ 278
T + V+I YE +PA FQLY GVF CG
Sbjct: 245 TYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTA 304
Query: 279 LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
L+HGV VGYG G YW+V+NSWG+SWGE GYIR+ RN +S G CGI ++ SYP+
Sbjct: 305 LDHGVVAVGYGTAKGHDYWIVRNSWGSSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 363
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 286 bits (732), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 152/312 (48%), Positives = 196/312 (62%), Gaps = 31/312 (9%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLS 114
+ + +E WL ++ + Y + E ++RF I+ N+++ID NSQ + ++KL N+FADL+
Sbjct: 73 EELMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSQEDRTYKLGLNRFADLT 132
Query: 115 NEEFISTYLGYNKPYNEP--RWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCW 167
NEE+ + YLG N + PS +Y LP SVDWRKEGAV PVKDQG CGSCW
Sbjct: 133 NEEYRAKYLGTKIDPNRRLGKTPSNRYAPRVGDKLPESVDWRKEGAVPPVKDQGGCGSCW 192
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSA+ AVEGINK+ TG+L+SLSEQELVDCD N+GCNGG M+ AFEFI GG+ +E
Sbjct: 193 AFSAIGAVEGINKIVTGELISLSEQELVDCDTGY-NEGCNGGLMDYAFEFIINNGGIDSE 251
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA----------------------RYAFQL 265
+DYPYRG + RC T + V+I YE +PA FQL
Sbjct: 252 EDYPYRGVDGRCDTYRKNAKVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQL 311
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
Y GVF CG L+HGV VGYG +G YW+V+NSWG SWGE GYIR+ RN +S G
Sbjct: 312 YVSGVFTGRCGTALDHGVVAVGYGTANGHDYWIVRNSWGPSWGEDGYIRLERNLANSRSG 371
Query: 326 ICGILMQASYPV 337
CGI ++ SYP+
Sbjct: 372 KCGIAIEPSYPL 383
>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
Length = 360
Score = 286 bits (732), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 149/311 (47%), Positives = 190/311 (61%), Gaps = 35/311 (11%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
E +E W ++ S DE +RF ++ +NV Y+ N ++ +KL NKFAD++N EF
Sbjct: 36 ELYERWRSHHTVSR-SLDEKHKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFR 94
Query: 120 STYLGYNKPYNEPRWPSVQYLG---------LPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
Y G ++ + + G +P S+DWRK+GAVTPVKDQGQCGSCWAFS
Sbjct: 95 QHYAGSKIKHHRTLLGASRANGTFMYANEDNVPPSIDWRKKGAVTPVKDQGQCGSCWAFS 154
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
V AVEGIN++KT KLVSLSEQELVDCD +ENQGCNGG M+ AF+FI K GG+TTE+ Y
Sbjct: 155 TVVAVEGINQIKTKKLVSLSEQELVDCDT-TENQGCNGGLMDPAFDFIKKRGGITTEERY 213
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSH 268
PY+ ++D+C K V+I G+E +P FQ YS
Sbjct: 214 PYKAEDDKCDIQKRNTPVVSIDGHEDVPPNDEDALLKAVANQPISVAIDASGSQFQFYSE 273
Query: 269 GVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GVF CG +L+HGV +VGYG G KYW+VKNSWG WGE GYIRM R + G+C
Sbjct: 274 GVFTGECGTELDHGVAIVGYGTTVDGTKYWIVKNSWGAGWGEKGYIRMQRKVDAEE-GLC 332
Query: 328 GILMQASYPVK 338
GI MQ SYP+K
Sbjct: 333 GIAMQPSYPIK 343
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 155/322 (48%), Positives = 203/322 (63%), Gaps = 38/322 (11%)
Query: 53 YDPQSMEER---FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKL 105
Y +S EE + W ++ + Y + E +RR+ + N++YID N+ SF+L
Sbjct: 28 YGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRL 87
Query: 106 TDNKFADLSNEEFISTYLGY-NKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKD 159
N+FADL+NEE+ TYLG NKP E R S +YL LP SVDWR +GAV +KD
Sbjct: 88 GLNRFADLTNEEYRDTYLGLRNKPRRE-RKVSDRYLAADNEALPESVDWRTKGAVAEIKD 146
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
QG CGSCWAFSA+AAVE IN++ TG L+SLSEQELVDCD S N+GCNGG M+ AF+FI
Sbjct: 147 QGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDT-SYNEGCNGGLMDYAFDFII 205
Query: 220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------------- 260
GG+ TEDDYPY+GK++RC ++ VTI YE +
Sbjct: 206 NNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQPVSVAIE 265
Query: 261 ---YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
AFQLYS G+F CG L+HGV VGYG ++G+ YW+V+NSWG SWGE+GY+RM R
Sbjct: 266 AGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMER 325
Query: 318 NSPSSNIGICGILMQASYPVKR 339
N +S+ G CGI ++ SYP+K+
Sbjct: 326 NIKASS-GKCGIAVEPSYPLKK 346
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 147/306 (48%), Positives = 193/306 (63%), Gaps = 31/306 (10%)
Query: 62 FENWLKQYSREYGSED--EWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
+E WL ++ + E RRF I+ N++++D N +NLS++L +FADL+N+E+
Sbjct: 50 YEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR 109
Query: 120 STYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
S YLG R S++Y LP S+DWRK+GAV VKDQG CGSCWAFS + A
Sbjct: 110 SKYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGA 169
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEGIN++ TG L++LSEQELVDCD S N+GCNGG M+ AFEFI K GG+ T+ DYPY+G
Sbjct: 170 VEGINQIVTGDLITLSEQELVDCDT-SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKG 228
Query: 235 KNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVFD 272
+ C + VTI YE +P AFQLY G+FD
Sbjct: 229 VDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFD 288
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG QL+HGV VGYG ++G+ YW+V+NSWG SWGE+GY+RMARN SS+ G CGI ++
Sbjct: 289 GSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSS-GKCGIAIE 347
Query: 333 ASYPVK 338
SYP+K
Sbjct: 348 PSYPIK 353
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 147/306 (48%), Positives = 193/306 (63%), Gaps = 31/306 (10%)
Query: 62 FENWLKQYSREYGSED--EWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
+E WL ++ + E RRF I+ N++++D N +NLS++L +FADL+N+E+
Sbjct: 50 YEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR 109
Query: 120 STYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
S YLG R S++Y LP S+DWRK+GAV VKDQG CGSCWAFS + A
Sbjct: 110 SKYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGA 169
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEGIN++ TG L++LSEQELVDCD S N+GCNGG M+ AFEFI K GG+ T+ DYPY+G
Sbjct: 170 VEGINQIVTGDLITLSEQELVDCDT-SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKG 228
Query: 235 KNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVFD 272
+ C + VTI YE +P AFQLY G+FD
Sbjct: 229 VDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFD 288
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG QL+HGV VGYG ++G+ YW+V+NSWG SWGE+GY+RMARN SS+ G CGI ++
Sbjct: 289 GSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSS-GKCGIAIE 347
Query: 333 ASYPVK 338
SYP+K
Sbjct: 348 PSYPIK 353
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 150/314 (47%), Positives = 195/314 (62%), Gaps = 33/314 (10%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFA 111
+ + + W+ ++ Y E +RRF + +N++YID N+ SF+L N+FA
Sbjct: 36 EEVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRLGLNRFA 95
Query: 112 DLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSC 166
DL+NEE+ STYLG + R S +Y LP SVDWRK+GAV VKDQG CGSC
Sbjct: 96 DLTNEEYRSTYLGARTKPDRERKLSARYQAADNDELPESVDWRKKGAVGAVKDQGGCGSC 155
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSA+AAVEGIN++ TG ++ LSEQELVDCD S NQGCNGG M+ AFEFI GG+ +
Sbjct: 156 WAFSAIAAVEGINQIVTGDMIPLSEQELVDCDT-SYNQGCNGGLMDYAFEFIINNGGIDS 214
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E+DYPY+ +++RC +K VTI GYE +P AFQ
Sbjct: 215 EEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQ 274
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
LY G+F CG L+HGV VGYG ++G+ YWLV+NSWG+ WGE GYIRM RN +S+
Sbjct: 275 LYKSGIFTGTCGTALDHGVAAVGYGTENGKDYWLVRNSWGSVWGENGYIRMERNIKASS- 333
Query: 325 GICGILMQASYPVK 338
G CGI ++ SYP K
Sbjct: 334 GKCGIAVEPSYPTK 347
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 150/307 (48%), Positives = 188/307 (61%), Gaps = 29/307 (9%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSN 115
SM ER E W+K+Y + Y E Q+R I+ NV++I+ N+ N +KL N AD +N
Sbjct: 33 SMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLGINHLADQTN 92
Query: 116 EEFISTYLGYNKPYNEPRWPSV--QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
EEF++++ GY + + P G+P +VDWR+ GAVT VKDQGQCGSCWAFS VA
Sbjct: 93 EEFVASHNGYKHKASHSQTPFKYENVTGVPNAVDWRENGAVTAVKDQGQCGSCWAFSTVA 152
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
A EGI ++ T L+SLSEQELVDCD S + GC+GGYME FEFI K GG+++E +YPY
Sbjct: 153 ATEGIYQITTSMLMSLSEQELVDCD--SVDHGCDGGYMEGGFEFIIKNGGISSEANYPYT 210
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVF 271
+ C +K A I GYE +PA AFQ YS GVF
Sbjct: 211 AVDGTCDANKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDAGGSAFQFYSSGVF 270
Query: 272 DEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
CG QL+HGVT VGYG D G +YW+VKNSWGT WGE GYIRM R + + G+CGI
Sbjct: 271 TGQCGTQLDHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRGTDAQE-GLCGIA 329
Query: 331 MQASYPV 337
M ASYP
Sbjct: 330 MDASYPT 336
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 147/306 (48%), Positives = 193/306 (63%), Gaps = 31/306 (10%)
Query: 62 FENWLKQYSREYGSED--EWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
+E WL ++ + E RRF I+ N++++D N +NLS++L +FADL+N+E+
Sbjct: 50 YEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTNDEYR 109
Query: 120 STYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
S YLG R S++Y LP S+DWRK+GAV VKDQG CGSCWAFS + A
Sbjct: 110 SKYLGAKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGA 169
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEGIN++ TG L++LSEQELVDCD S N+GCNGG M+ AFEFI K GG+ T+ DYPY+G
Sbjct: 170 VEGINQIVTGDLITLSEQELVDCDT-SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKG 228
Query: 235 KNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVFD 272
+ C + VTI YE +P AFQLY G+FD
Sbjct: 229 VDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFD 288
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG QL+HGV VGYG ++G+ YW+V+NSWG SWGE+GY+RMARN SS+ G CGI ++
Sbjct: 289 GSCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLRMARNIASSS-GKCGIAIE 347
Query: 333 ASYPVK 338
SYP+K
Sbjct: 348 PSYPIK 353
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 156/327 (47%), Positives = 195/327 (59%), Gaps = 45/327 (13%)
Query: 56 QSMEERFENWLKQY-------SREYGSED-EWQRRFGIYSSNVQYIDYINSQN-LSFKLT 106
+S+ +E W +Y S G++D E +RRF ++ N +YI N + F+L
Sbjct: 36 ESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPFRLA 95
Query: 107 DNKFADLSNEEFISTYLGYNKPYNEP-------RWPSVQYLG-----LPASVDWRKEGAV 154
NKFAD++ +EF TY G ++ S +Y G LP +VDWR+ GAV
Sbjct: 96 LNKFADMTTDEFRRTYAGSRARHHRSLSGGRGGEGGSFRYGGDDEDNLPPAVDWRERGAV 155
Query: 155 TPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKA 214
T +KDQGQCGSCWAFS VAAVEG+NK+KTG+LV+LSEQELVDCD +NQGC+GG M+ A
Sbjct: 156 TGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDT-GDNQGCDGGLMDYA 214
Query: 215 FEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------------- 260
F+FI + GG+TTE +YPYR + RC K H VTI GYE +PA
Sbjct: 215 FQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVANQPV 274
Query: 261 --------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAG 311
FQ YS GVF CG L+HGV VGYG G KYW+VKNSWG WGE G
Sbjct: 275 AVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRDGTKYWIVKNSWGEDWGERG 334
Query: 312 YIRMARNSPSSNIGICGILMQASYPVK 338
YIRM R S + G+CGI M+ASYPVK
Sbjct: 335 YIRMQRGVSSDSNGLCGIAMEASYPVK 361
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 156/315 (49%), Positives = 189/315 (60%), Gaps = 38/315 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
M RFE W+ ++ R Y + E QRRF +Y N+ I+ NS + LTDNKFADL+NEE
Sbjct: 115 MRMRFEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNSGGHGYTLTDNKFADLTNEE 174
Query: 118 FISTYLG-----------YNKPYNEPRWP-SVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
F + LG N P + LP VDWRK+GAV VK+QG CGS
Sbjct: 175 FRAKMLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSCGS 234
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFSAVAA+EG+N++K GKLVSLSEQELVDCD +E GC GG+M AFEF+ G+T
Sbjct: 235 CWAFSAVAAMEGLNQIKNGKLVSLSEQELVDCD--AEAVGCAGGFMSWAFEFVMANHGLT 292
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAF 263
TE YPY+G N CQT K +V+ITGY + + F
Sbjct: 293 TEASYPYKGINGACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGGFLF 352
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
QLY+ GVF C Q+NHGVTVVGYGE D EKYW+VKNSWG WGEAGY+ M R++
Sbjct: 353 QLYAGGVFSGPCTAQINHGVTVVGYGETDKAEKYWIVKNSWGPEWGEAGYMLMQRDA-GV 411
Query: 323 NIGICGILMQASYPV 337
G+CGI M ASYPV
Sbjct: 412 PTGLCGIAMLASYPV 426
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 285 bits (730), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 156/329 (47%), Positives = 199/329 (60%), Gaps = 45/329 (13%)
Query: 53 YDPQ--SMEER----FENWLKQYSREYGSE--------DEWQRRFGIYSSNVQYIDYINS 98
YDPQ S EER F++W+ Q+ + Y E R+GI+ N+++I N
Sbjct: 42 YDPQDLSSEERLQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGENE 101
Query: 99 QNLSFKLTDNKFADLSNEEFISTYLG-------YNKPYNEPRWPSVQYLGLPASVDWRKE 151
+N + L N FADL+NEEF + G Y E R+ SVQ LP S+DWR++
Sbjct: 102 KNQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRERTSYEEFRYGSVQLKDLPDSIDWREK 161
Query: 152 GAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYM 211
GAV VKDQG CGSCWAFSAVAA+EG+NKL TG+LVSLSEQELVDCD E++GCNGG M
Sbjct: 162 GAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCD-KGEDEGCNGGLM 220
Query: 212 EKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------- 260
+ AF F+ K GG+ TE DYPY+G RC K VTI GYE +P
Sbjct: 221 DYAFGFVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAH 280
Query: 261 -----------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGE 309
+ Q Y G+F CG L+HGVT VGYG++ G+ YW++KNSWG++WGE
Sbjct: 281 QPVSVAIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGKEDGKAYWIIKNSWGSNWGE 340
Query: 310 AGYIRMARNSPSSNIGICGILMQASYPVK 338
GYI+MARN+ + G+CGI M+ASYP K
Sbjct: 341 KGYIKMARNTGLA-AGLCGINMEASYPTK 368
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 152/310 (49%), Positives = 191/310 (61%), Gaps = 31/310 (10%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSN 115
M ER W+ QY + Y E ++RF I++ NV YI+ N N + L N+FADL+N
Sbjct: 34 MYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTLGVNQFADLTN 93
Query: 116 EEFISTYLGYNKPY--NEPRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
+EF S+ + + R + +Y +P+SVDWRK+GAVTPVK+QGQCG CWAFS
Sbjct: 94 DEFTSSRNKFKGHMCSSITRTSTFKYENASAIPSSVDWRKKGAVTPVKNQGQCGCCWAFS 153
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
AVAA EGI+KL TGKL+SLSEQELVDCD +QGC GG M+ AF+FI + G+ TE +Y
Sbjct: 154 AVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEANY 213
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSH 268
PY+G + C +K +AVTITGYE +P FQ Y
Sbjct: 214 PYQGVDGTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPISVAIDASGSDFQFYKS 273
Query: 269 GVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GVF CG +L+HGVT VGYG + G KYWLVKNSWGT WGE GYI M R ++ G+C
Sbjct: 274 GVFTGSCGTELDHGVTAVGYGVSNDGTKYWLVKNSWGTEWGEEGYIMMQRGVDAAE-GLC 332
Query: 328 GILMQASYPV 337
GI MQASYP
Sbjct: 333 GIAMQASYPT 342
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 149/319 (46%), Positives = 196/319 (61%), Gaps = 34/319 (10%)
Query: 53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFA 111
YD +E WL ++ + Y + E +RRF I+ N+++I+ N + + S+KL NKFA
Sbjct: 39 YDESHTRHVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFA 98
Query: 112 DLSNEEFISTYLGYNK--PYNEP--------RWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
DL+NEE+ + +LG P N+ R+ LPA VDWR++GAVTP+KDQG
Sbjct: 99 DLTNEEYRAMFLGTRTRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGAVTPIKDQG 158
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
QCGSCWAFS V AVEGIN++ TG L SLSEQELVDCD N GCNGG M+ AFEFI +
Sbjct: 159 QCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCD-RGYNMGCNGGLMDYAFEFIVQN 217
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR--------------------- 260
GG+ TE+DYPY K++ C ++ VTI GYE +P
Sbjct: 218 GGIDTEEDYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAIEAG 277
Query: 261 -YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
FQLY GVF CG L+HGV VGYG ++G YWLV+NSWG++WGE GYI++ RN
Sbjct: 278 GMEFQLYQSGVFTGRCGTNLDHGVVAVGYGTENGTDYWLVRNSWGSAWGENGYIKLERNV 337
Query: 320 PSSNIGICGILMQASYPVK 338
++ G CGI ++ASYP+K
Sbjct: 338 QNTETGKCGIAIEASYPIK 356
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 151/313 (48%), Positives = 191/313 (61%), Gaps = 30/313 (9%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS--QNLSFKLTDNKFA 111
D ++ E+ E W+ Y + Y E + R I+ NV YI+ N+ N +KL N+FA
Sbjct: 33 DDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLGINQFA 92
Query: 112 DLSNEEFISTYLGYN----KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
DL+NEEFI++ + + + +P++VDWRK+GAVTPVK+QGQCG CW
Sbjct: 93 DLTNEEFIASRNKFKGHMCSSITKTSTFKYENASVPSTVDWRKKGAVTPVKNQGQCGCCW 152
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAVAA EGI+KL TGKLVSLSEQELVDCD +QGC GG M+ AF+FI + G+ TE
Sbjct: 153 AFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTE 212
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQL 265
YPY+G + C +K HAVTITGYE +PA FQ
Sbjct: 213 AQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQF 272
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y GVF CG +L+HGVT VGYG + G KYWLVKNSWGT WGE GYI+M R ++
Sbjct: 273 YKSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGTDWGEEGYIKMQRGVDAAE- 331
Query: 325 GICGILMQASYPV 337
G+CGI M+ASYP
Sbjct: 332 GLCGIAMEASYPT 344
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 149/315 (47%), Positives = 194/315 (61%), Gaps = 33/315 (10%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFAD 112
D +M R E W+ QYSR Y E RRF ++ +NV++I+ N+ N F L N+FAD
Sbjct: 29 DDSAMVARHEQWMAQYSRVYKDASEKARRFEVFKANVKFIESFNAGGNNKFWLGVNQFAD 88
Query: 113 LSNEEF--ISTYLGYN----KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
L+N+EF I T G+ K R+ +V LP ++DWR +GAVTP+KDQGQCG C
Sbjct: 89 LTNDEFRSIKTNKGFKSSNMKIPTGFRYENVSVDALPTTIDWRTKGAVTPIKDQGQCGCC 148
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSAVAA EGI K+ TGKLVSL+EQELVDCDV+ E+QGC GG M+ AF+FI GG+TT
Sbjct: 149 WAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGCEGGLMDDAFKFIINNGGLTT 208
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E YPY + +C++ + A TI GYE +PA FQ
Sbjct: 209 ESSYPYTAADGKCKSG--SNSAATIKGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQ 266
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GV CG L+HG+ +GYG+ G KYWL+KNSWGT+WGE GY+RM ++ S
Sbjct: 267 FYSSGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKD-ISDK 325
Query: 324 IGICGILMQASYPVK 338
G+CG+ M+ SYP +
Sbjct: 326 RGMCGLAMEPSYPTE 340
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 151/309 (48%), Positives = 199/309 (64%), Gaps = 34/309 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEFIS 120
+++W+ Q+ + Y E ++RF I+ N+++ID NS N ++KL NKFADL+N+E+ +
Sbjct: 45 YKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYRA 104
Query: 121 TYLGYN----KPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
+LG + + + PS +Y LP SVDWR GAV+PVKDQG CGSCWAFS
Sbjct: 105 KFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVDWRDHGAVSPVKDQGSCGSCWAFST 164
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
+A VEGINK+ +G+LVSLSEQELVDCD S + GCNGG M+ AF+FI GG+ TE DYP
Sbjct: 165 IATVEGINKIVSGELVSLSEQELVDCD-RSYDAGCNGGLMDYAFQFIMDNGGIDTEKDYP 223
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------------YAFQLYSHGV 270
Y G N++C K V+I GYE +P AFQLY GV
Sbjct: 224 YLGFNNQCDPTKKNAKVVSIDGYEDVPNNENALKKAVAHQPVSIAIEAGGRAFQLYESGV 283
Query: 271 FDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
F+ CG L+HGV VGYG +D+G+ YW+V+NSWG++WGE GYIRM RN ++N G CGI
Sbjct: 284 FNGECGLALDHGVVAVGYGTDDNGQDYWIVRNSWGSNWGENGYIRMERN-INANTGKCGI 342
Query: 330 LMQASYPVK 338
M+ASYPVK
Sbjct: 343 AMEASYPVK 351
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 152/318 (47%), Positives = 198/318 (62%), Gaps = 37/318 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNKFA 111
+ M +E WL ++ R + E +RRF I+ NV++ID N S + SF+L N+FA
Sbjct: 44 EEMRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSFRLGLNRFA 103
Query: 112 DLSNEEFISTYLGYNKPYN---EPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQC 163
D++NEE+ + YLG +P + R S +Y LP SVDWR +GAVT VKDQG C
Sbjct: 104 DMTNEEYRTVYLG-TRPASHRRRARLGSDRYRYNAGEELPESVDWRDKGAVTTVKDQGSC 162
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCWAFS +AAVEGINK+ TG L+SLSEQELVDCD N +NQGCNGG M+ AFEFI GG
Sbjct: 163 GSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCD-NGQNQGCNGGLMDYAFEFIINNGG 221
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------Y 261
+ TE+DYPY+ ++ +C + V+I GYE +P
Sbjct: 222 IDTEEDYPYKARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGR 281
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQLY G+F CG L+HGV VGYG ++G+ YW+V+NSWG WGE+GYIRM RN +
Sbjct: 282 EFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRNSWGGDWGESGYIRMERNVNA 341
Query: 322 SNIGICGILMQASYPVKR 339
S G CGI M++SYP K+
Sbjct: 342 S-TGKCGIAMESSYPTKK 358
>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 150/306 (49%), Positives = 188/306 (61%), Gaps = 35/306 (11%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNE 116
M ER E W+ QY R Y + E + R+ I+ NV ID NSQ S+ L N+FADLSNE
Sbjct: 1 MYERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYNLGVNQFADLSNE 60
Query: 117 EFISTYLGYNKPYNEPRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
EF ++ + P+ +Y +PA++DWRK+GAVTPVKDQGQC VA
Sbjct: 61 EFKASRNRFKGHMCSPQAGPFRYENVSAVPATMDWRKKGAVTPVKDQGQC--------VA 112
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
A+EGIN+L TGKL+SLSEQE+VDCD E+QGCNGG M+ AF+FI + G+TTE +YPY
Sbjct: 113 AMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYT 172
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVF 271
G + C T K HA ITG++ +PA + FQ YS G+F
Sbjct: 173 GTDGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIF 232
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG +L+HGVT VGYG G KYWLVKNSWG WGE GYIRM ++ S+ G+CGI M
Sbjct: 233 TGSCGTELDHGVTAVGYGGSDGTKYWLVKNSWGAQWGEEGYIRMQKD-ISAKEGLCGIAM 291
Query: 332 QASYPV 337
QASYP
Sbjct: 292 QASYPT 297
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 151/321 (47%), Positives = 201/321 (62%), Gaps = 34/321 (10%)
Query: 49 YPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFK 104
Y ++ D ++ + W+ + R Y + E +RR+ ++ N++YID N+ SF+
Sbjct: 32 YGERSDEEA-RRMYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFR 90
Query: 105 LTDNKFADLSNEEFISTYLGY-NKPYNE----PRWPSVQYLGLPASVDWRKEGAVTPVKD 159
L N+FADL+N+E+ +TYLG +P E R+ + LP SVDWR +GAV VKD
Sbjct: 91 LGLNRFADLTNDEYRATYLGARTRPQRERKLGARYHAADNEDLPESVDWRAKGAVAEVKD 150
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
QG GSCWAFS +AAVEGIN++ TG L+SLSEQELVDCD S NQGCNGG M+ AFEFI
Sbjct: 151 QGSYGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT-SYNQGCNGGLMDYAFEFII 209
Query: 220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------------- 260
GG+ TE DYPY+G + RC ++ VTI YE +PA
Sbjct: 210 NNGGIDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIE 269
Query: 261 ---YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
FQLYS G+F CG L+HGVT VGYG ++G+ YW+VKNSWG+SWGE+GY+RM R
Sbjct: 270 AAGTQFQLYSSGIFTGSCGTALDHGVTAVGYGTENGKDYWIVKNSWGSSWGESGYVRMER 329
Query: 318 NSPSSNIGICGILMQASYPVK 338
N +S+ G CGI ++ SYP+K
Sbjct: 330 NIKASS-GKCGIAVEPSYPLK 349
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 151/305 (49%), Positives = 192/305 (62%), Gaps = 32/305 (10%)
Query: 65 WLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLG 124
WL+++ + Y E +RF I+ +N+++ID NSQN ++K+ KFADL+N+E+ + +LG
Sbjct: 31 WLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQNRTYKVGLTKFADLTNQEYRAMFLG 90
Query: 125 Y----NKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
+ + + PS +Y LP SVDWR +GAV P+KDQG CGSCWAFS VAAV
Sbjct: 91 TRSDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGAVNPIKDQGSCGSCWAFSTVAAV 150
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EGIN++ TG+L+SLSEQELVDCD N GCNGG M+ AF+FI GG+ TE DYPY G
Sbjct: 151 EGINQIVTGELISLSEQELVDCD-RFYNAGCNGGLMDYAFQFIINNGGLDTEKDYPYLGN 209
Query: 236 NDRCQTDKTKHHAVTITGYE---------------------AIPAR-YAFQLYSHGVFDE 273
+D C DK K AV+I G+E AI A A Q Y GVF
Sbjct: 210 DDTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQPVSVAIEASGMALQFYQSGVFTG 269
Query: 274 YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
CG L+HGV VVGYG + G YWLV+NSWGT WGE GYI+M RN + G CGI M++
Sbjct: 270 ECGTALDHGVVVVGYGTEKGLDYWLVRNSWGTEWGEHGYIKMQRNVRDTYTGRCGIAMES 329
Query: 334 SYPVK 338
SYPVK
Sbjct: 330 SYPVK 334
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 199/330 (60%), Gaps = 40/330 (12%)
Query: 44 AWSEGYPQKY--DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL 101
AWS + +K ++ + +E W + + +G E RRF ++ SNV ++ N +
Sbjct: 20 AWSFDFHEKELETEDNLWDMYERWRHKVATNHG---EKLRRFNVFKSNVLHVHETNKMDK 76
Query: 102 SFKLTDNKFADLSNEEFISTYLG-----YNKPYNEPRWPSVQYL-----GLPASVDWRKE 151
+KL NKFAD++N EF S Y G +++ R S ++ +P SVDWRK+
Sbjct: 77 PYKLKLNKFADMTNHEFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVESVPTSVDWRKK 136
Query: 152 GAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYM 211
GAV PVKDQGQCGSCWAFS VAAVEGINK+KT +LVSLSEQELVDCD ENQGCNGG M
Sbjct: 137 GAVAPVKDQGQCGSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDT-LENQGCNGGLM 195
Query: 212 EKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------------- 258
+ AF+FI K GG+T ED YPY ++ +C ++K V+I G+E +P
Sbjct: 196 DLAFDFIKKTGGLTREDAYPYAAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVAN 255
Query: 259 ---------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWG 308
FQ YS GVF CG QL+HGV VGYG G KYW+V+NSWG+ WG
Sbjct: 256 QPVAVAIDAGSSDFQFYSEGVFTGKCGTQLDHGVAAVGYGTTLDGTKYWIVRNSWGSEWG 315
Query: 309 EAGYIRMARNSPSSNIGICGILMQASYPVK 338
E GYIRM R S G+CGI M+ASYP+K
Sbjct: 316 EKGYIRMER-GISDKRGLCGIAMEASYPIK 344
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 147/306 (48%), Positives = 192/306 (62%), Gaps = 31/306 (10%)
Query: 62 FENWLKQYSREYGSED--EWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
+E WL ++ + E RRF I+ N+++ID N +NLS++L +FADL+N+E+
Sbjct: 43 YEAWLVKHGKAQNQNSLVEKDRRFEIFKDNLRFIDDHNKKNLSYRLGLTRFADLTNDEYR 102
Query: 120 STYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
S YLG R S +Y LP S+DWRK+GAV VKDQG CGSCWAFS + A
Sbjct: 103 SKYLGAKMEKKGERRTSQRYEARVGDELPESIDWRKKGAVAEVKDQGSCGSCWAFSTIGA 162
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEGIN++ TG L++LSEQELVDCD S N+GCNGG M+ AFEFI K GG+ T+ DYPY+G
Sbjct: 163 VEGINQIVTGDLITLSEQELVDCDT-SYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKG 221
Query: 235 KNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVFD 272
+ C + VTI YE +P AFQLY G+FD
Sbjct: 222 VDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRAFQLYDSGIFD 281
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG QL+HGV VGYG ++G+ YW+V+NSWG SWGE+GY++MARN SS+ G CGI ++
Sbjct: 282 GTCGTQLDHGVVAVGYGTENGKDYWIVRNSWGKSWGESGYLKMARNIASSS-GKCGIAIE 340
Query: 333 ASYPVK 338
SYP+K
Sbjct: 341 PSYPIK 346
>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 153/342 (44%), Positives = 197/342 (57%), Gaps = 29/342 (8%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRR 82
M + + F L + + A EG + + M ER E W+ + + Y E +++
Sbjct: 1 MAFKKVLFQYFTLALCLVFAFCAFEGNARTLEDAPMRERHEQWMAIHGKVYTHSYEKEQK 60
Query: 83 FGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEF--ISTYLGY--NKPYNEPRWPSV 137
+ + NVQ I+ N + N +KL N FADL+NEEF I+ + G+ +K P +
Sbjct: 61 YQTFKENVQRIEAFNHAGNKPYKLGINHFADLTNEEFKAINRFKGHVCSKITRTPTFRYE 120
Query: 138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
+PA++DWR+EGAVTP+KDQGQCG CWAFSAVAA EGI KL TGKL+SLSEQELVDC
Sbjct: 121 NMTAVPATLDWRQEGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDC 180
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
D +QGC GG M+ AF+FI + G+ E YPY G + C +HA +I GYE +
Sbjct: 181 DTKGVDQGCEGGLMDDAFKFILQNKGLAAEAIYPYEGVDGTCNAKAEGNHATSIKGYEDV 240
Query: 258 PAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGE 294
PA + FQ YS GVF CG L+HGVT VGYG D G
Sbjct: 241 PANSESALLKAVANQPVSVAIEASGFEFQFYSGGVFTGSCGTNLDHGVTAVGYGVSDDGT 300
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
KYWLVKNSWG WG+ GYIRM R+ + G+CGI M ASYP
Sbjct: 301 KYWLVKNSWGVKWGDKGYIRMQRDVAAKE-GLCGIAMLASYP 341
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 192/313 (61%), Gaps = 32/313 (10%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
D SM R ENW+ QY R Y E ++F ++ +N ++I+ N+ N F L N+FAD+
Sbjct: 29 DDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFKANAEFINSFNAGNHKFWLGINQFADI 88
Query: 114 SNEEFISTYLGYNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
+NEEF +T N+ R P+ + + LPA++DWR +GAVTP+KDQGQCG CW
Sbjct: 89 TNEEFKATKTNKGFISNKVRVPTGFMYENMSFDALPATIDWRTKGAVTPIKDQGQCGCCW 148
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAVAA+EGI KL TGKLVSLSEQELVDCDV+ E+QGC GG M+ AF+FI K GG+T E
Sbjct: 149 AFSAVAAMEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTQE 208
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
+YPY + +C++ + A TI YE +PA FQ
Sbjct: 209 SNYPYDAADGKCKSGSS--SAATIKSYEDVPANNEGALMKAVANQPVSVAVDGGDMTFQF 266
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS GV CG L+HG+ +GYG G K+W++KNSWGTSWGE G++RM ++
Sbjct: 267 YSGGVMTGSCGTDLDHGIAAIGYGTTSDGTKFWIMKNSWGTSWGENGFLRMEKDIADKK- 325
Query: 325 GICGILMQASYPV 337
G+CG+ M+ SYP
Sbjct: 326 GMCGLAMEPSYPT 338
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 161/348 (46%), Positives = 208/348 (59%), Gaps = 38/348 (10%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEWQRR 82
M R VL+L +L VL G + + + + + S+ E +E W ++ S +E +R
Sbjct: 1 MKRFIVLALCMLMVLETTKGL--DFHNKDVESENSLWELYERWRSHHTVAR-SLEEKAKR 57
Query: 83 FGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYN------EPRWPS 136
F ++ NV++I N ++ S+KL NKF D+++EEF TY G N ++ + S
Sbjct: 58 FNVFKHNVKHIHETNKKDKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKS 117
Query: 137 VQYLG---LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
Y LP SVDWRK GAVTPVK+QGQCGSCWAFS V AVEGIN+++T KL SLSEQE
Sbjct: 118 FMYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQE 177
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
LVDCD N +NQGCNGG M+ AFEFI + GG+T+E YPY+ ++ C T+K V+I G
Sbjct: 178 LVDCDTN-QNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDG 236
Query: 254 YEAIPARYA----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGED 291
+E +P FQ YS GVF CG +LNHGV VVGYG
Sbjct: 237 HEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTT 296
Query: 292 -HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G KYW+VKNSWG WGE GYIRM R G+CGI M+ASYP+K
Sbjct: 297 IDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKE-GLCGIAMEASYPLK 343
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 148/317 (46%), Positives = 195/317 (61%), Gaps = 35/317 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLS 114
+ + +E WL ++ + Y + E + RF I++ N+++ID N S N S+K+ N+FADL+
Sbjct: 30 EEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVGLNQFADLT 89
Query: 115 NEEFISTYLGYN-KPYNE----------PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQC 163
NEE+ S YLG PY R+ + PA VDWR+ GAV+PVK+QG C
Sbjct: 90 NEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGAVSPVKNQGGC 149
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCWAFS VA+VEGINK+ TG L+SLSEQELVDCD N N GCNGG M+ AF+FI GG
Sbjct: 150 GSCWAFSTVASVEGINKIVTGDLISLSEQELVDCD-NKYNSGCNGGSMDYAFQFIVSNGG 208
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARY 261
+ +E DYPY+G C + K V+I GYE +P +
Sbjct: 209 IDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPVSVGIEASGR 268
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
AFQLY+ GV CG L+HGV VVGYG ++G+ YW+V+NSWG WGE GYIRM RN
Sbjct: 269 AFQLYTSGVLTGSCGTNLDHGVVVVGYGSENGKDYWIVRNSWGPEWGEDGYIRMERNMVD 328
Query: 322 SNIGICGILMQASYPVK 338
+ +G+CGI + ASYP+K
Sbjct: 329 TPVGMCGITLMASYPIK 345
>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 153/347 (44%), Positives = 202/347 (58%), Gaps = 35/347 (10%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRR 82
M R +L++F + ++ A ++ + + + +E W ++ S E Q R
Sbjct: 1 MDTRKVILAVFSVVLVFRLADSFDYTEEDLASEERLRDLYERWRSHHTVSR-SLAEKQER 59
Query: 83 FGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG- 141
F ++ N+++I +N ++ +KL N FAD++N EF+ Y G + Q G
Sbjct: 60 FNVFKENLKHIHKVNHKDRPYKLKLNSFADMTNHEFLQHYGGSKVSHYRVLRGQRQGTGS 119
Query: 142 -------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
LP+SVDWRK GAVT +KDQG+CGSCWAFS VAAVEGINK+KTG+L+SLSEQEL
Sbjct: 120 MHEDTSKLPSSVDWRKNGAVTGIKDQGKCGSCWAFSTVAAVEGINKIKTGELISLSEQEL 179
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
VDCD S+N GCNGG ME AF FI +IGG+T+E+ YPYR K + C ++K V I GY
Sbjct: 180 VDCD--SDNHGCNGGLMEDAFNFIKQIGGLTSENTYPYRAKEEPCDSNKMNSPVVNIDGY 237
Query: 255 EAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH 292
E +P Q YS +F CG +LNHGV +VGYG
Sbjct: 238 EMVPENDENALMKAVANQPVAIAMDAGGKDLQFYSEAIFTGDCGTELNHGVALVGYGTTQ 297
Query: 293 -GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G KYW+VKNSWGT WGE GYIRM R + G+CGI M+ASYPVK
Sbjct: 298 DGTKYWIVKNSWGTDWGEKGYIRMQRGIDAEE-GLCGITMEASYPVK 343
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 190/307 (61%), Gaps = 31/307 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E WL ++ + Y + E +RRF I+ N+++I+ N+ N ++K+ N+FADL+NEE+ S
Sbjct: 54 YEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAVNRTYKVGLNRFADLTNEEYRSR 113
Query: 122 YLGYNKPYNE--------PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
YLG R+ LP SVDWR++GAV PVKDQG CGSCWAFS +A
Sbjct: 114 YLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAFSTIA 173
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEGIN++ TG L+SLSEQELVDCD S NQGCNGG M+ AFEFI GG+ +E+DYPYR
Sbjct: 174 AVEGINQIATGDLISLSEQELVDCD-KSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYR 232
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVF 271
+ C ++ V+I GYE +P AFQLY GVF
Sbjct: 233 AADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGVF 292
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG QL+HGV VGYG ++ YW+V+NSWG +WGE+GYI++ RN + G CGI +
Sbjct: 293 TGQCGTQLDHGVVAVGYGTENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCGIAI 352
Query: 332 QASYPVK 338
+ SYP+K
Sbjct: 353 EPSYPIK 359
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 149/312 (47%), Positives = 192/312 (61%), Gaps = 37/312 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDY----INSQNLSFKLTDNKFADLSNEE 117
++ W Q++R Y + DE ++R I+ N+++ID N+ SF+L +FADL+NEE
Sbjct: 47 YQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTRFADLTNEE 106
Query: 118 FISTYLGY---------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
+ STYLG N R+ LP S+DWR +GAV VKDQG CGSCWA
Sbjct: 107 YRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKDQGSCGSCWA 166
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS +AAVEGIN + TG L+SLSEQELVDCD NQGCNGG M+ AFEFI GG+ T++
Sbjct: 167 FSTIAAVEGINHIVTGDLISLSEQELVDCDT-YYNQGCNGGLMDYAFEFIISNGGIDTDE 225
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLY 266
DYPY G++ C + H VTI YE +P AFQLY
Sbjct: 226 DYPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEAGGRAFQLY 285
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
G+F YCG +L+HGVT +GYG ++G+ YW+VKNSWG+ WGE+GYIRM RN S+ G
Sbjct: 286 ESGIFTGYCGTELDHGVTAIGYGSENGKYYWIVKNSWGSDWGESGYIRMERNINSA-TGK 344
Query: 327 CGILMQASYPVK 338
CGI M+ASYP+K
Sbjct: 345 CGIAMEASYPIK 356
>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 162/348 (46%), Positives = 206/348 (59%), Gaps = 38/348 (10%)
Query: 24 MLRNAVLSLFLLWVLGIPAGA-WSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRR 82
M R VL+L +L VL + E + D S+ E +E W K + S +E +R
Sbjct: 1 MKRFIVLALCMLMVLETTKSLDFHEKDVESED--SLWELYERW-KSHHTIARSLEEKAKR 57
Query: 83 FGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYN----KPYNEPRWPSVQ 138
F ++ NV++I N + S+KL NKF D+++EEF TY G N + + R +
Sbjct: 58 FNVFKHNVKHIHETNKKENSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGERQTTKS 117
Query: 139 YL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
++ LP SVDWRK GAVTPVK+QGQCGSCWAFS V AVEGIN+++T KL SLSEQE
Sbjct: 118 FMYANVDTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQE 177
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
LVDCD N +NQGCNGG M+ AFEFI + GG+T+E YPY+ ++ C T+K V+I G
Sbjct: 178 LVDCDTN-KNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDG 236
Query: 254 YEAIPARYA----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGED 291
+E +P FQ YS GVF CG +LNHGV VVGYG
Sbjct: 237 HEDVPKNSEVDLMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTT 296
Query: 292 -HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G KYW+VKNSWG WGE GYIRM R G+CGI M+ASYP+K
Sbjct: 297 IDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKE-GLCGIAMEASYPLK 343
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 151/315 (47%), Positives = 193/315 (61%), Gaps = 35/315 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+S+ +E W ++ S DE +RF ++ NV ++ N ++ +KL NKFAD++N
Sbjct: 32 ESLWNLYERWRSHHTVSR-SLDEKHKRFNVFKENVNFVHEFNKKDEPYKLKLNKFADMTN 90
Query: 116 EEFISTYLG----YNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSC 166
EF STY G +++ + + + ++ +P SVDWRK+GAVTP+KDQGQCGSC
Sbjct: 91 HEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQCGSC 150
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS V AVEGIN +KT KLVSLSEQELVDCD SENQGCNGG M AFEFI + GG+TT
Sbjct: 151 WAFSTVVAVEGINHIKTNKLVSLSEQELVDCDT-SENQGCNGGLMGYAFEFIKEKGGITT 209
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E YPY ++ C K V+I G+E +P AFQ
Sbjct: 210 EQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGSAFQ 269
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GVF CG L+HGV +VGYG G KYW+VKNSWGT WGE GYIRM R S+
Sbjct: 270 FYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKR-GISAK 328
Query: 324 IGICGILMQASYPVK 338
G+CGI ++ASYP+K
Sbjct: 329 EGLCGIAVEASYPIK 343
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 283 bits (724), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 154/329 (46%), Positives = 199/329 (60%), Gaps = 45/329 (13%)
Query: 53 YDPQ--SMEER----FENWLKQYSREYGSE--------DEWQRRFGIYSSNVQYIDYINS 98
YDPQ S EER F++W+ Q+ + Y E R+GI+ N+++I N
Sbjct: 42 YDPQDLSSEERLQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGENE 101
Query: 99 QNLSFKLTDNKFADLSNEEFISTYLG-------YNKPYNEPRWPSVQYLGLPASVDWRKE 151
+N + L N FADL+NEEF + G + E R+ SVQ LP S+DWR++
Sbjct: 102 KNQGYFLGLNAFADLTNEEFRAQRHGGRFDRSRERTSHEEFRYGSVQLKDLPDSIDWREK 161
Query: 152 GAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYM 211
GAV VKDQG CGSCWAFSAVAA+EG+NKL TG+LVSLSEQELVDCD E++GCNGG M
Sbjct: 162 GAVVGVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCD-KGEDEGCNGGLM 220
Query: 212 EKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------- 260
+ AF F+ K GG+ TE DYPY+G RC K VTI GYE +P
Sbjct: 221 DYAFGFVIKNGGLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAH 280
Query: 261 -----------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGE 309
+ Q Y G+F CG L+HGVT VGYG++ G+ YW++KNSWG++WGE
Sbjct: 281 QPVSVAIDAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGKEDGKAYWIIKNSWGSNWGE 340
Query: 310 AGYIRMARNSPSSNIGICGILMQASYPVK 338
GY++MARN+ + G+CGI M+ASYP K
Sbjct: 341 KGYVKMARNTGLA-AGLCGINMEASYPTK 368
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 283 bits (724), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 143/311 (45%), Positives = 190/311 (61%), Gaps = 30/311 (9%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNE 116
+M ER E W+ +++R Y E +RF ++ +NV +I+ N++N F L N+F DL+N+
Sbjct: 32 AMVERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAFIESFNAENRKFWLGVNQFTDLTND 91
Query: 117 EFISTYLGYNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
EF +T + R P+ V LP +VDWR +G VTP+KDQGQCG CWAFS
Sbjct: 92 EFRATKTNKGLKMSGGRAPTGFKYSNVSIDALPTAVDWRTKGVVTPIKDQGQCGCCWAFS 151
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
AV A EGI KL TGKL+SLSEQELVDCDV+ +QGC GG M+ AF+FI K GG+TTE +Y
Sbjct: 152 AVVATEGIVKLSTGKLISLSEQELVDCDVHGVDQGCEGGEMDDAFKFIIKNGGLTTEANY 211
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSH 268
PY ++ +C+T + TI GYE +PA FQ YS
Sbjct: 212 PYTAQDGQCKTSIASNSVATIKGYEDVPANDESSLMKAVANQPVSVAVDGGDVIFQHYSG 271
Query: 269 GVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GV CG L+HG+ +GYG G KYWL+KNSWGT+WGE+GY+RM ++ S G+C
Sbjct: 272 GVMTGSCGTDLDHGIAAIGYGMTSDGTKYWLLKNSWGTTWGESGYLRMEKD-ISDKSGMC 330
Query: 328 GILMQASYPVK 338
G+ MQ SYP +
Sbjct: 331 GLAMQPSYPTE 341
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 145/307 (47%), Positives = 195/307 (63%), Gaps = 33/307 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E+WL ++ + Y + E ++RF I+ N+++ID N+++ ++K+ N+FADL+N+E+ S
Sbjct: 46 YESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAESRTYKVGLNRFADLTNDEYRSM 105
Query: 122 YLG--------YNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
YLG + R+ V LP SVDWR++GAV VKDQG CGSCWAFS +A
Sbjct: 106 YLGARTGSRRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVVGVKDQGSCGSCWAFSTIA 165
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEGIN++ TG L+SLSEQELVDCD S N+GCNGG M+ AFEFI K GG+ TE+DYPY
Sbjct: 166 AVEGINQIVTGDLISLSEQELVDCDT-SYNEGCNGGLMDYAFEFIIKNGGIDTEEDYPYN 224
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVF 271
++ RC + VTI YE +P AFQ Y GVF
Sbjct: 225 ARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSVAIEASGMAFQFYESGVF 284
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG L+HGVT VGYG ++ YW+VKNSWG+SWGE+GYIRM RN+ ++ G CGI +
Sbjct: 285 TGNCGTALDHGVTAVGYGTENSVDYWIVKNSWGSSWGESGYIRMERNTGAT--GKCGIAV 342
Query: 332 QASYPVK 338
+ SYP+K
Sbjct: 343 EPSYPIK 349
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 140/312 (44%), Positives = 190/312 (60%), Gaps = 28/312 (8%)
Query: 51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNK 109
+K SM ER E W+ +Y + Y E +RF I+ NV++I+ N+ N +KL N
Sbjct: 27 RKLHETSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLGVNH 86
Query: 110 FADLSNEEFISTYLGYNKP--YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
ADL+ EEF ++ G+ +P ++ + +PA++DWR +GAVTP+KDQGQCGSCW
Sbjct: 87 LADLTVEEFKASRNGFKRPHEFSTTTFKYENVTAIPAAIDWRTKGAVTPIKDQGQCGSCW 146
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS +AA EGI+++ TGKLVSLSEQELVDCD +QGC GGYME FEFI K GG+T+E
Sbjct: 147 AFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGGITSE 206
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
+YPY+ + +C +K I GYE +P F
Sbjct: 207 TNYPYKAVDGKC--NKATSPVAQIKGYEKVPPNSETALQKAVANQPVSVSIDADGAGFMF 264
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
YS G+++ CG +L+HGVT VGYG +G YW+VKNSWGT WGE GY+RM R + + G
Sbjct: 265 YSSGIYNGECGTELDHGVTAVGYGTANGTDYWIVKNSWGTQWGEKGYVRMQRGIAAKH-G 323
Query: 326 ICGILMQASYPV 337
+CGI + +SYP
Sbjct: 324 LCGIALDSSYPT 335
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 152/337 (45%), Positives = 200/337 (59%), Gaps = 33/337 (9%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L + LG A + + SM ER E W+ +Y + Y +E ++RF ++ N
Sbjct: 10 ISLALFFCLGFLAFQVA---SRTLQDASMYERHEQWMARYGKVYKDPEEKEKRFRVFKEN 66
Query: 90 VQYID-YINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY-----LGLP 143
V YI+ + N+ N +KL N+FADL++EEFI +N + + LP
Sbjct: 67 VNYIEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNGHTRSSNTRTTTFKYENVTVLP 126
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
S+DWR++GAVTP+K+QG CG CWAFSA+AA EGI+K+ TGKLVSLSEQE+VDCD +
Sbjct: 127 DSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTD 186
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----- 258
GC GGYM+ AF+FI + G+ TE YPY+G + +C + HA TITGYE +P
Sbjct: 187 HGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHAATITGYEDVPINNEK 246
Query: 259 -----------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVK 300
+ FQ Y G+F CG +L+HGVT VGYGE++ G KYWLVK
Sbjct: 247 ALQKAVANQPVSVAIDASGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEGTKYWLVK 306
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
NSWGT WGE GYI M R + GICGI M ASYP
Sbjct: 307 NSWGTEWGEEGYIMMQRGVKAVE-GICGIAMMASYPT 342
>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 152/315 (48%), Positives = 192/315 (60%), Gaps = 35/315 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+S+ + +E W ++ + +E Q+RF ++ SNV ++ N + +KL NKFAD++N
Sbjct: 34 ESLWDLYERWRSHHTVSR-NLNEKQKRFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTN 92
Query: 116 EEFISTYLGYNKPYNE-----PRWPSV----QYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
EF +TY G ++ PR + PASVDWRK+GAVT VKDQGQCGSC
Sbjct: 93 HEFKTTYAGSKVNHHRMFRGTPRVSGTFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSC 152
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS V AVEGIN++KT +LV LSEQEL+DCD N ENQGCNGG ME AFE+I + GG+TT
Sbjct: 153 WAFSTVVAVEGINQIKTNRLVPLSEQELIDCD-NQENQGCNGGLMEYAFEYIKQKGGITT 211
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E YPY + C K AV+I G+E +PA FQ
Sbjct: 212 ESYYPYTANDGSCDATKENVPAVSIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQ 271
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GVF CG +LNHGV +VGYG G YW+V+NSWG WGE GYIRM RN S+
Sbjct: 272 FYSEGVFTGDCGKELNHGVAIVGYGTTVDGTNYWIVRNSWGAEWGEQGYIRMKRNV-SNK 330
Query: 324 IGICGILMQASYPVK 338
G+CGI M+ASYPVK
Sbjct: 331 EGLCGIAMEASYPVK 345
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 148/315 (46%), Positives = 196/315 (62%), Gaps = 33/315 (10%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFAD 112
D +M R E W+ QY+R Y E +RF ++ +NV++I+ N+ N F L N+FAD
Sbjct: 29 DDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKFIESFNAGGNRKFWLGVNQFAD 88
Query: 113 LSNEEFISTYL--GYN-KPYNEP---RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
L+N+EF +T G+ P P R+ +V LPAS+DWR +GAVTP+KDQGQCG C
Sbjct: 89 LTNDEFRATKTNKGFKPSPVKVPTGFRYENVSVDALPASIDWRTKGAVTPIKDQGQCGCC 148
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSAVAA EGI K+ T KL+SLSEQELVDCDV+ E+QGC GG M+ AF+FI K GG+TT
Sbjct: 149 WAFSAVAATEGIVKISTDKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTT 208
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E YPY + +C++ + A I G+E +PA FQ
Sbjct: 209 ESSYPYTATDGKCKSG--TNSAANIKGFEDVPANDEAALMKAVANQPVSVAVDGGDMTFQ 266
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
LYS GV CG L+HG+ +GYG+ G KYWL+KNSWGT+WGE GY+RM ++ S
Sbjct: 267 LYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKD-ISDK 325
Query: 324 IGICGILMQASYPVK 338
G+CG+ M+ SYP +
Sbjct: 326 RGMCGLAMEPSYPTE 340
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 153/342 (44%), Positives = 204/342 (59%), Gaps = 30/342 (8%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRR 82
M + +FL ++L + A A + + M +R E W+ Q+ R YG E ++R
Sbjct: 1 MAAKKCNTRIFLPFLLILAAWATKIACRPLDEQEYMLKRHEEWMAQHGRVYGDMKEKEKR 60
Query: 83 FGIYSSNVQYID-YINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG 141
+ I+ N++ I+ + N + +KL NKFADL+NEEF + Y GY + ++ S +Y
Sbjct: 61 YLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYHGYKRQSSKLMSSSFRYEN 120
Query: 142 L---PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
L P S+DWR +GAVTPVKDQG CG CWAFS VAA+EGI KL+TG L+SLSEQ+LVDC
Sbjct: 121 LSDIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDC- 179
Query: 199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
+ N+GC GG M+ AF++I + GG+T+ED+YPY+G + C ++K ITGYE +P
Sbjct: 180 -TAGNKGCQGGLMDTAFQYIIRNGGLTSEDNYPYQGVDGTCSSEKAASTEAQITGYEDVP 238
Query: 259 ARYA----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEK 295
FQ Y GVF+ CG Q NH VT +GYG D G
Sbjct: 239 QNNENALLQAVAKQPVSVGVDGGGNDFQFYKSGVFNGDCGTQQNHAVTAIGYGTDIDGTD 298
Query: 296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
YWLVKNSWGTSWGE GY+RM R SS G+CG+ M ASYP
Sbjct: 299 YWLVKNSWGTSWGENGYMRMRRGIGSSE-GLCGVAMDASYPT 339
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 164/360 (45%), Positives = 203/360 (56%), Gaps = 50/360 (13%)
Query: 24 MLRNAVLSL--FLLWVLGIPAGAWSEGYP----QKYDPQSMEERFENWLKQY--SREYG- 74
MLR VL+ L VL PA A G P +S+ +E W Y SR G
Sbjct: 1 MLRCLVLAAVSLALLVLAPPARA---GIPFTEKDLASEESLRALYEQWRSHYMVSRPAGL 57
Query: 75 -SEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPR 133
+D+ R F ++ NV+YI N + SF+L NKFAD++ +EF Y ++ +
Sbjct: 58 QEQDDKARWFNVFKENVRYIHEANKKGRSFRLALNKFADMTTDEFRRAYAAGSRTRHHRA 117
Query: 134 WPS------------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKL 181
S Q LP +VDWR+ GAVT +KDQGQCGSCWAFS +AAVEGINK+
Sbjct: 118 LSSGIRRHGDGSFMYAQAGNLPLAVDWRQRGAVTGIKDQGQCGSCWAFSTIAAVEGINKI 177
Query: 182 KTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQT 241
+TGKLVSLSEQELVDCD + +NQGCNGG M+ AF++I + GG+TTE +YPY + C
Sbjct: 178 RTGKLVSLSEQELVDCD-DVDNQGCNGGLMDYAFQYIKRNGGITTESNYPYLAEQRSCNK 236
Query: 242 DKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQL 279
K + H VTI GYE +PA FQ YS GVF CG +L
Sbjct: 237 AKERSHDVTIDGYEDVPANNEDALQKAVANQPVSIAIEASGQDFQFYSEGVFTGSCGTEL 296
Query: 280 NHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+HGV VGYG G KYW+VKNSWG WGE GYIRM R S G+CGI M+ SYP K
Sbjct: 297 DHGVAAVGYGITRDGTKYWIVKNSWGEDWGERGYIRMQRGISDSQ-GLCGIAMEPSYPTK 355
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 158/356 (44%), Positives = 214/356 (60%), Gaps = 41/356 (11%)
Query: 23 MMLRNAVLSLFL-LWVLGIPAGAWSEGYPQKYDPQSMEER---FENWLKQYSREYGSED- 77
M++ V +LF + L + ++ + + K +S +E +E W ++ + + D
Sbjct: 10 MLVILIVFTLFTATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNNNIDG 69
Query: 78 -EWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGY----------- 125
E +RF I+ N+++ID N++N ++K+ N+FADLSNEE+ S YLG
Sbjct: 70 SEKDKRFEIFKDNLKFIDEHNAENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMMMAR 129
Query: 126 NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGK 185
K + PSV LP SVDWR +GAV VKDQG CGSCWAFS +AAVEGINK+ TG+
Sbjct: 130 TKTRSNRYAPSVGD-KLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIVTGE 188
Query: 186 LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK 245
LVSLSEQELVDCD + N GC+GG ME AFEFI GG+ +++DYPYRG + +C K
Sbjct: 189 LVSLSEQELVDCD-RTVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRGVDGKCDQYKKN 247
Query: 246 HHAVTITGYEAIPA----------------------RYAFQLYSHGVFDEYCGHQLNHGV 283
V+I YE +PA FQLY G+F CG L+HGV
Sbjct: 248 ARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGIFTGKCGTALDHGV 307
Query: 284 TVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
T VGYG ++G YW+V+NSWG SWGE+GY+RM RN +S G CGI+MQ+SYP+K+
Sbjct: 308 TAVGYGTENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGIVMQSSYPIKK 363
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 149/309 (48%), Positives = 185/309 (59%), Gaps = 33/309 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E WL ++ + Y E +RF I+ N+ +ID N+QN ++K+ NKFAD +NEE+ +
Sbjct: 35 YEEWLVKHHKVYNGLGEKDQRFEIFKDNLGFIDEHNAQNYTYKVGLNKFADTTNEEYRNM 94
Query: 122 YLGYN----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
YLG K R+ LP VDWR +GAV +KDQG CGSCWAFS
Sbjct: 95 YLGTKNDAKRNVMKIKITTGHRYAFNSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFST 154
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
+A VE INK+ TGKLVSLSEQELVDCD + N+GCNGG M+ AFEFI + GG+ TE DYP
Sbjct: 155 IATVEAINKIVTGKLVSLSEQELVDCD-RAFNEGCNGGLMDYAFEFIVENGGIDTEQDYP 213
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHG 269
Y+G RC + V+I GYE +PA A QLY G
Sbjct: 214 YKGFEGRCDPTRKNAKVVSIDGYEDVPAYNENALKKAVFHQPVSVAIEAGGRALQLYQSG 273
Query: 270 VFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
VF CG L+HGV VVGYG ++G YWLV+NSWGT+WGE GY ++ RN N G CGI
Sbjct: 274 VFTGRCGTNLDHGVVVVGYGFENGVDYWLVRNSWGTNWGEDGYFKLERNVKKINTGKCGI 333
Query: 330 LMQASYPVK 338
MQASYPVK
Sbjct: 334 AMQASYPVK 342
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 151/339 (44%), Positives = 195/339 (57%), Gaps = 39/339 (11%)
Query: 28 AVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYS 87
A+L + G+ A + D SM R E+W+ QY R Y E R+F ++
Sbjct: 10 AILGCLCFFASGLAA-------RELNDDLSMVARHESWMSQYGRSYKDAAEKDRKFEVFK 62
Query: 88 SNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPR------WPSVQYLG 141
+N +ID N++N F L N+FAD++NEEF T N+ R + +V
Sbjct: 63 ANAAFIDSFNAKNHKFWLGINQFADITNEEFKVTKTNKGFISNKVRASTGFSYENVSIDA 122
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
LPA++DWR +GAVTPVKDQGQCG CWAFSAVAA EGI KL TGKLVSLSEQELVDCDV+
Sbjct: 123 LPATIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHG 182
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR- 260
E+QGC GG M+ AF+FI GG+T E YPY ++ +C++ A TI YE +PA
Sbjct: 183 EDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKCKSG--SKSAGTIKSYEDVPANN 240
Query: 261 ---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWL 298
FQ YS GV CG L+HG+ +GYG G KYWL
Sbjct: 241 EGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYWL 300
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+KNSWGTSWGE G++RM ++ G+CG+ M+ SYP
Sbjct: 301 MKNSWGTSWGENGFLRMEKDIADKK-GMCGLAMEPSYPT 338
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 148/318 (46%), Positives = 191/318 (60%), Gaps = 34/318 (10%)
Query: 53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFAD 112
Y + + +E WL ++ + Y DE ++RF ++ N+ +I N+QN ++ L NKFAD
Sbjct: 27 YSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKFAD 86
Query: 113 LSNEEFISTYLGYNKPYNEPRWPSVQYLG----------LPASVDWRKEGAVTPVKDQGQ 162
++NEE+ + YLG + + R Q G LP VDWR +GAV P+KDQG
Sbjct: 87 ITNEEYRAMYLG-TRTDAKRRVMKTQNTGHRYAYNSGDQLPVHVDWRLKGAVGPIKDQGN 145
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS VAAVEGIN + TG+ VSLSEQELVDCD ++GCNGG M+ AF+FI + G
Sbjct: 146 CGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCD-REYDEGCNGGLMDYAFQFIIQNG 204
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------------- 260
G+ TE+DYPY+G + C K K V I GYE +P+
Sbjct: 205 GIDTEEDYPYQGIDGTCDQTKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVSVAIEASG 264
Query: 261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
A QLY GVF CG L+HGV VVGYG ++G YWLV+NSWGT WGE GY +M RN
Sbjct: 265 RALQLYQSGVFTGKCGTALDHGVVVVGYGTENGVDYWLVRNSWGTGWGEDGYFKMERNVR 324
Query: 321 SSNIGICGILMQASYPVK 338
S++ G CGI M SYPVK
Sbjct: 325 STSEGKCGIAMDCSYPVK 342
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 156/338 (46%), Positives = 207/338 (61%), Gaps = 35/338 (10%)
Query: 30 LSLFLLWVLGIPAGAWS-EGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+SL LL+ LG W+ + + SM ER E W+ +Y++ Y +E ++RF I+
Sbjct: 10 ISLALLFCLGF----WAFQVTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKE 65
Query: 89 NVQYID-YINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPY--NEPRWPSVQY---LGL 142
NV YI+ + N+ N +KL N+FADL+NEEFI+ + + R + +Y L
Sbjct: 66 NVNYIEAFNNAANKPYKLGINQFADLTNEEFIAPRNRFKGHMCSSITRTTTFKYENVTAL 125
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P++VDWR++GAVTP+KDQGQCG CWAFSAVAA EGI+ L +GKL+SLSEQE+VDCD E
Sbjct: 126 PSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGE 185
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA 262
+QGC GG+M+ AF+FI + G+ TE +YPY+ + +C ++ +HA TITGYE +P
Sbjct: 186 DQGCAGGFMDGAFKFIIQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNE 245
Query: 263 ----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLV 299
FQ Y GVF CG QL+HGVT VGYG G +YWLV
Sbjct: 246 KALQKAVANQPVSVAIDASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSADGTQYWLV 305
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KNSWGT WGE GYI M R + G+CGI M ASYP
Sbjct: 306 KNSWGTEWGEEGYIMMQRGVKAQE-GLCGIAMMASYPT 342
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 154/322 (47%), Positives = 202/322 (62%), Gaps = 38/322 (11%)
Query: 53 YDPQSMEER---FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKL 105
Y +S EE + W ++ + Y + E +RR+ + N++YID N+ SF+L
Sbjct: 28 YGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRL 87
Query: 106 TDNKFADLSNEEFISTYLGY-NKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKD 159
N+FADL+NEE+ TYLG NKP E R S +YL LP SVDWR +GAV +KD
Sbjct: 88 GLNRFADLTNEEYRDTYLGLRNKPRRE-RKVSDRYLAADNEALPESVDWRTKGAVAEIKD 146
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
Q GSCWAFSA+AAVEGIN++ TG L+SLSEQELVDCD S N+GCNGG M+ AF+FI
Sbjct: 147 QEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT-SYNEGCNGGLMDYAFDFII 205
Query: 220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------------- 260
GG+ TEDDYPY+GK++RC ++ VTI YE +
Sbjct: 206 NNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSVAIE 265
Query: 261 ---YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
AFQLYS G+F CG L+HGV VGYG ++G+ YW+V+NSWG SWGE+GY+RM R
Sbjct: 266 AGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYVRMER 325
Query: 318 NSPSSNIGICGILMQASYPVKR 339
N +S+ G CGI ++ SYP+K+
Sbjct: 326 NIKASS-GKCGIAVEPSYPLKK 346
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 164/365 (44%), Positives = 214/365 (58%), Gaps = 35/365 (9%)
Query: 6 FIAIYT-NLHLKIAI---DMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEER 61
F+AI T N H+ I D M+ +N + L +L A+ D SM ER
Sbjct: 76 FLAISTHNSHVLNYIFKRDSTMVAKNHFYHISLAMLLCTAFLAFQVTCCTLQDA-SMYER 134
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYID-YINSQNLSFKLTDNKFADLSNEEFIS 120
E W+ ++ + Y E ++RF I++ NV Y++ + N+ N +KL N+F DL+N+EFI+
Sbjct: 135 HEQWMTRHGKVYKDPREREKRFRIFNENVNYVEAFNNAANKPYKLGINQFXDLTNQEFIA 194
Query: 121 TYLGYNKPYNEP--RWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
+ R + +Y +P++VDWR+ GAVTPVKDQGQCG CWAFSAVAA
Sbjct: 195 PRNRFKGHMCSSIIRTTTFKYENVTTVPSTVDWRQNGAVTPVKDQGQCGCCWAFSAVAAT 254
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EGI+ L GKL+SLSEQELVDCD +QGC GG M+ A++FI + G+ TE +YPY+G
Sbjct: 255 EGIHALSGGKLISLSEQELVDCDTKGVDQGCEGGLMDDAYKFIIQNHGLNTEANYPYKGV 314
Query: 236 NDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHGVFDE 273
+ +C ++ +HA TITGYE +PA FQ Y G F
Sbjct: 315 DGKCNANEAANHAATITGYEDVPANNEKALQKAVANQPVSVAIDASSSDFQFYKSGAFTG 374
Query: 274 YCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG +L+HGVT VGYG DHG KYWLVKNSWGT WGE GYIRM R S G+CGI MQ
Sbjct: 375 SCGTELDHGVTAVGYGVSDHGTKYWLVKNSWGTEWGEEGYIRMQRGVDSEE-GVCGIAMQ 433
Query: 333 ASYPV 337
ASYP
Sbjct: 434 ASYPT 438
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 152/337 (45%), Positives = 198/337 (58%), Gaps = 33/337 (9%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL LL+ G A + + SM ER E W+ +Y++ Y E +RRF I+ N
Sbjct: 10 ISLALLFCSGFLAFQVT---CRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKEN 66
Query: 90 VQYID-YINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP--RWPSVQY---LGLP 143
V YI+ + N+ N + L N+FADL+NEEFI+ + R + +Y +P
Sbjct: 67 VNYIEAFNNAANKPYTLGINQFADLTNEEFIAPRNRFKGHMCSSITRTTTFKYENVTAIP 126
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
++VDWR++GAVTP+KDQGQCG CWAFSAVAA EGI+ L GKL+SLSEQE+VDCD E+
Sbjct: 127 STVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTKGED 186
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA- 262
QGC GG+M+ AF+FI + G+ E +YPY+ + +C +H TITGYE +P
Sbjct: 187 QGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVNNEK 246
Query: 263 ---------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVK 300
FQ Y GVF CG +L+HGVT VGYG G +YWLVK
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVK 306
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
NSWGT WGE GYIRM R + G+CGI M ASYP
Sbjct: 307 NSWGTEWGEEGYIRMQRGVKAEE-GLCGIAMMASYPT 342
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 193/315 (61%), Gaps = 33/315 (10%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFAD 112
+ +M R E W+ QYSR Y E RRF ++ +NV++I+ N+ N F L N+FAD
Sbjct: 29 EDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVKFIESFNTGGNRKFWLGINQFAD 88
Query: 113 LSNEEFISTYL------GYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
L+N+EF +T +K R+ +V +PA++DWR GAVTP+KDQGQCG C
Sbjct: 89 LTNDEFRTTKTNKGFKPSLDKVSTGFRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCC 148
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSAVAA EGI K+ TGKL+SLSEQELVDCDV+ E+QGC GG M+ AF+FI K GG+TT
Sbjct: 149 WAFSAVAATEGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTT 208
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E +YPY + +C++ + A I GYE +P FQ
Sbjct: 209 ESNYPYTAADGKCKSG--SNSAANIKGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQ 266
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GV CG L+HG+ +GYG+ G KYWL+KNSWGT+WGE GY+RM ++ S
Sbjct: 267 FYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKD-ISDK 325
Query: 324 IGICGILMQASYPVK 338
G+CG+ M+ SYP +
Sbjct: 326 KGMCGLAMEPSYPTE 340
>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 155/336 (46%), Positives = 195/336 (58%), Gaps = 52/336 (15%)
Query: 30 LSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+ L LL+VL AW S+ + SM ER E+W+ QY R Y DE +R+ I+
Sbjct: 10 ICLALLFVLA----AWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKD 65
Query: 89 NVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPA 144
NV I+ N + + S+KL+ N+FADL+NEEF ++ + S +Y +P+
Sbjct: 66 NVARIESFNKAMDKSYKLSINEFADLTNEEFGTSRNRFKAHICSTEATSFKYENVTAVPS 125
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
++DWRK+GAVTP+KDQGQCGSCWAFSAVAA+EGI +L TGKL+SLSEQELVDCD + E+Q
Sbjct: 126 TIDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQ 185
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---- 260
GCNG +YPY G + C K H A I GYE +PA
Sbjct: 186 GCNGA-------------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKA 226
Query: 261 ------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKN 301
+ FQ YS GVF CG +L+HGV VGYG D G KYWLVKN
Sbjct: 227 LQKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 286
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWGT WGE GYIRM R+ + G+CGI MQASYP
Sbjct: 287 SWGTGWGEEGYIRMQRDVTAKE-GLCGIAMQASYPT 321
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 146/310 (47%), Positives = 188/310 (60%), Gaps = 30/310 (9%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYID-YINSQNLSFKLTDNKFADLSN 115
SM ER E W+ +Y++ Y E +RRF I+ NV YI+ + N+ N + L N+FADL+N
Sbjct: 34 SMYERHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTN 93
Query: 116 EEFISTYLGYNKPYNEP--RWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
EEFI+ + R + +Y +P++VDWR++GAVTP+KDQGQCG CWAFS
Sbjct: 94 EEFIAPRNRFKGHMCSSITRTTTFKYENVTAIPSTVDWRQKGAVTPIKDQGQCGCCWAFS 153
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
AVAA EGI+ L GKL+SLSEQE+VDCD E+QGC GG+M+ AF+FI + G+ E +Y
Sbjct: 154 AVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNY 213
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSH 268
PY+ + +C +H TITGYE +P FQ Y
Sbjct: 214 PYKAVDGKCNAKAAANHVATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDFQFYQS 273
Query: 269 GVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GVF CG +L+HGVT VGYG G +YWLVKNSWGT WGE GYIRM R + G+C
Sbjct: 274 GVFTGSCGTELDHGVTAVGYGVSADGTEYWLVKNSWGTEWGEEGYIRMQRGVKAEE-GLC 332
Query: 328 GILMQASYPV 337
GI M ASYP
Sbjct: 333 GIAMMASYPT 342
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 150/308 (48%), Positives = 197/308 (63%), Gaps = 39/308 (12%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFIS 120
++ W++++ + Y S E+++RF I+ NV YI+ N++ N S L NKFADL+N EF
Sbjct: 38 YQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEFRG 97
Query: 121 TYLGYNK---PYNEPRWPSVQYLGLPA----SVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
Y+G + P++E V + L A SVDWRK+G VT +KDQG CGSCWAFSAVA
Sbjct: 98 LYVGRLQRPAPFHE-----VGDIALVADTATSVDWRKKGGVTEIKDQGDCGSCWAFSAVA 152
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEG+ L TG LVSLSEQELVDCD + NQGC+GG M+ AF+++ + GG+T++ +YPYR
Sbjct: 153 AVEGLTFLSTGTLVSLSEQELVDCDT-TVNQGCDGGIMDYAFQYMIRNGGITSQSNYPYR 211
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHGVF 271
C DK K+HA TI G++AIP + FQLYS GVF
Sbjct: 212 ALRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYSSGVF 271
Query: 272 DEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
CG L+HGV +VGYG D G +YWLVKNSWG+ WGE+GY+RM R P + G+CGI
Sbjct: 272 TGECGSNLDHGVAIVGYGTDAGGRQYWLVKNSWGSGWGESGYVRMERQGPGA--GVCGIN 329
Query: 331 MQASYPVK 338
+ ASYP K
Sbjct: 330 LDASYPTK 337
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 142/307 (46%), Positives = 193/307 (62%), Gaps = 32/307 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS---QNLSFKLTDNKFADLSNEEF 118
++ WL + R Y + E +RRF ++ N++++D N+ ++ F+L N+FADL+N+EF
Sbjct: 49 YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108
Query: 119 ISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
ST+LG K R +Y LP SVDWR++GAV PVK+QGQCGSCWAFSAV+
Sbjct: 109 RSTFLGA-KVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVS 167
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
VE IN+L TG++++LSEQELV+C N +N GCNGG M+ AF+FI K GG+ TEDDYPY+
Sbjct: 168 TVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYK 227
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVF 271
+ +C ++ V+I G+E +P FQLY GVF
Sbjct: 228 AVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVF 287
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG L+HGV VGYG D+G+ YW+V+NSWG WGE+GY+RM RN ++ G CGI M
Sbjct: 288 SGRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERN-INATTGKCGIAM 346
Query: 332 QASYPVK 338
ASYP K
Sbjct: 347 MASYPTK 353
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 154/344 (44%), Positives = 200/344 (58%), Gaps = 35/344 (10%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRR 82
M + +L+L LL + S+ + SM ER E W+K+Y + Y E Q+R
Sbjct: 4 MGKKQHILALVLLLSI-----CTSQVMSRNLHEASMSERHEQWMKKYGKVYKDAAEKQKR 58
Query: 83 FGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV--QY 139
I+ NV++I+ N+ N +KL+ N AD +NEEF++++ GY + + P
Sbjct: 59 LLIFKDNVEFIESFNAAGNKPYKLSINHLADQTNEEFVASHNGYKYKGSHSQTPFKYGNV 118
Query: 140 LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDV 199
+P +VDWR+ GAVT VKDQGQCGSCWAFS VAA EGI ++ TG L+SLSEQELVDCD
Sbjct: 119 TDIPTAVDWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCD- 177
Query: 200 NSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA 259
S + GC+GG ME FEFI K GG+++E +YPY + C K A I GYE +PA
Sbjct: 178 -SVDHGCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPA 236
Query: 260 R----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG--EDHGEK 295
FQ YS GVF CG QL+HGVTVVGYG +D +
Sbjct: 237 NSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGTHE 296
Query: 296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
YW+VKNSWGT WGE GYIRM R + G+CGI M ASYP+ +
Sbjct: 297 YWIVKNSWGTQWGEEGYIRMQRGIDAQE-GLCGIAMDASYPMGK 339
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 193/318 (60%), Gaps = 38/318 (11%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN--LSFKLTDNKFA 111
D +M++R W+ ++ R Y +E R+ ++ NV+ I+ +N L+FKL N+FA
Sbjct: 29 DEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFA 88
Query: 112 DLSNEEFISTYLGY---------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
DL+NEEF S Y GY KP R+ V LP SVDWRK+GAVTP+KDQG
Sbjct: 89 DLTNEEFRSMYTGYKGNSVLSSRTKP-TSFRYQHVSSDALPISVDWRKKGAVTPIKDQGS 147
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFSAVAA+EG+ ++K GKL+SLSEQELVDCD N + GC GGYM AF + G
Sbjct: 148 CGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDD--GCMGGYMNSAFNYTMTTG 205
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------------- 260
G+T+E +YPY+ + C +KTK A +I G+E +PA
Sbjct: 206 GLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGG 265
Query: 261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
FQ YS GVF C L+HGV VVGYG+ +G KYW++KNSWG WGE GY+R+ +++
Sbjct: 266 TGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRIKKDT 325
Query: 320 PSSNIGICGILMQASYPV 337
+ + G CG+ M ASYP
Sbjct: 326 KAKH-GQCGLAMNASYPT 342
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 146/307 (47%), Positives = 192/307 (62%), Gaps = 30/307 (9%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYID-YINSQNLSFKLTDNKFADLSNE 116
M +R E W+ Q+ R YG E ++R+ I+ N++ I+ + N + +KL NKFADL+NE
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 117 EFISTYLGYNKPYNEPRWPSVQYLGL---PASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
EF + Y GY + ++ S +Y L P S+DWR +GAVTPVKDQG CG CWAFS VA
Sbjct: 61 EFRAMYHGYKRQSSKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVA 120
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
A+EGI KL+TG L+SLSEQ+LVDC + N+GC GG M+ AF++I + GG+T+ED+YPY+
Sbjct: 121 AIEGIIKLQTGNLISLSEQQLVDC--TAGNKGCQGGLMDTAFQYIIRNGGLTSEDNYPYQ 178
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHGVF 271
G + C ++K ITGYE +P F+ Y GVF
Sbjct: 179 GVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFYKSGVF 238
Query: 272 DEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
+ CG LNHGVT +GYG D G YWLVKNSWGTSWGE+GY RM R +S G+CG+
Sbjct: 239 EGDCGTNLNHGVTAIGYGTDSDGTDYWLVKNSWGTSWGESGYTRMQRGIGASE-GLCGVA 297
Query: 331 MQASYPV 337
M ASYP
Sbjct: 298 MDASYPT 304
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 153/317 (48%), Positives = 192/317 (60%), Gaps = 39/317 (12%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+S+ + +E W + SR G E +RF ++ +NV ++ N + +KL NKFAD+
Sbjct: 34 ESLWDLYERWRSHHTVSRSLG---EKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADM 90
Query: 114 SNEEFISTYLG----YNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCG 164
+N EF STY G ++K + + S ++ +PASVDWRK+GAVT VKDQGQCG
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCG 150
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS + AVEGIN++KT KLVSLSEQELVDCD ENQGCNGG ME AFEFI + GG+
Sbjct: 151 SCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCD-KEENQGCNGGLMESAFEFIKQKGGI 209
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE +YPY+ + C K AV+I G+E +P
Sbjct: 210 TTESNYPYKAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSD 269
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GVF C LNHGV +VGYG G YW+V+NSWG WGE GYIRM RN S
Sbjct: 270 FQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN-IS 328
Query: 322 SNIGICGILMQASYPVK 338
G+CGI M ASYP+K
Sbjct: 329 KKEGLCGIAMMASYPIK 345
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 144/305 (47%), Positives = 188/305 (61%), Gaps = 32/305 (10%)
Query: 65 WLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLG 124
W+ ++ + Y E ++RF I+ N+++ID N+QN ++K+ N+FADL+NEE+ + YLG
Sbjct: 49 WMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQNRTYKVGLNRFADLTNEEYRAIYLG 108
Query: 125 --------YNKPYN-EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
+ K N PR+ + LP SVDWR+ GAV PVKDQ CGSCWAFS VAAV
Sbjct: 109 TRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAV 168
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EGIN++ TG+L+SLSEQELVDCD + GCNGG M+ AF+FI K GG+ TE DYPY G
Sbjct: 169 EGINQIVTGELISLSEQELVDCDTEYD-MGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGF 227
Query: 236 NDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVFDE 273
+ C V+I GYE +P A QLY G+F
Sbjct: 228 DGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRALQLYVSGIFTG 287
Query: 274 YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
CG L+HG+ VGYG ++G YW+V+NSWG+SWGE GYIRM RN + G CGI M+A
Sbjct: 288 ECGTALDHGIVAVGYGTENGTDYWIVRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEA 347
Query: 334 SYPVK 338
SYP+K
Sbjct: 348 SYPIK 352
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 147/318 (46%), Positives = 191/318 (60%), Gaps = 34/318 (10%)
Query: 53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFAD 112
Y + + +E WL ++ + Y DE ++RF ++ N+ +I N+QN ++ L NKFAD
Sbjct: 27 YSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQDHNAQNNTYTLGLNKFAD 86
Query: 113 LSNEEFISTYLGYNKPYNEPRWPSVQYLG----------LPASVDWRKEGAVTPVKDQGQ 162
++N+E+ + YLG + + R Q G LP VDWR +GAV P+KDQG
Sbjct: 87 ITNKEYRAMYLG-TRTDAKRRVMKTQNTGHRYAYNSGDQLPVHVDWRLKGAVGPIKDQGN 145
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS VAAVEGIN + TG+ VSLSEQELVDCD ++GCNGG M+ AF+FI + G
Sbjct: 146 CGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCD-REYDEGCNGGLMDYAFQFIIQNG 204
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------------- 260
G+ TE+DYPY+G + C K K V I GYE +P+
Sbjct: 205 GIDTEEDYPYQGIDGTCDETKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVSVAIEASG 264
Query: 261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
A QLY GVF CG L+HGV VVGYG ++G YWLV+NSWGT WGE GY +M RN
Sbjct: 265 RALQLYQSGVFTGKCGTALDHGVVVVGYGTENGVDYWLVRNSWGTGWGEDGYFKMERNVR 324
Query: 321 SSNIGICGILMQASYPVK 338
S++ G CGI M SYPVK
Sbjct: 325 STSEGKCGIAMDCSYPVK 342
>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 187/313 (59%), Gaps = 32/313 (10%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
D SM R E+W+ QY R Y E +F ++ +N +ID N+ N F L N+FAD+
Sbjct: 29 DDLSMVARHESWMLQYGRVYKDAAEKASKFEVFKANAGFIDSFNAGNHKFWLGINQFADI 88
Query: 114 SNEEFISTYLGYNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
+N+EF +T N+ R P+ V + LPAS+DWR +GAVTPVKDQGQCG CW
Sbjct: 89 TNKEFKATKTNKGFISNKVRAPTGFSYENVSFDALPASIDWRTKGAVTPVKDQGQCGCCW 148
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAVAA EGI KL TGKLVSLSEQELVDCDV+ E+QGC GG M+ AF+FI GG+T E
Sbjct: 149 AFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIISNGGLTQE 208
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
YPY ++ +C++ A TI YE +PA FQ
Sbjct: 209 SSYPYDAEDGKCKSG--SKSAGTIKSYEDVPANNEGALMKAVANQPVSVAVDGGDMTFQF 266
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS GV CG L+HG+ +GYG G KYWL+KNSWGTSWGE G++RM ++
Sbjct: 267 YSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKYWLMKNSWGTSWGENGFLRMEKDIADKK- 325
Query: 325 GICGILMQASYPV 337
G+CG+ M+ SYP
Sbjct: 326 GMCGLAMEPSYPT 338
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 157/345 (45%), Positives = 208/345 (60%), Gaps = 49/345 (14%)
Query: 30 LSLFLLWVLGIPAGAWS-EGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+SL LL+ LG W+ + + SM ER E W+ +Y++ Y +E ++RF I+
Sbjct: 10 ISLALLFCLGF----WAFQVTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKE 65
Query: 89 NVQYID-YINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNE---------PRWPSVQ 138
NV YI+ + N+ + +KL N+FADL+NEEFI+ P N+ R + +
Sbjct: 66 NVNYIEAFNNAADKPYKLGINQFADLTNEEFIA-------PRNKFKGHMCSSITRTTTFK 118
Query: 139 Y---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
Y LP++VDWR++GAVTP+KDQGQCG CWAFSAVAA EGI+ L +GKL+SLSEQE+V
Sbjct: 119 YENVTALPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVV 178
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DCD E+QGC GG+M+ AF+FI + G+ TE +YPY+ + +C ++ +HA TITGYE
Sbjct: 179 DCDTKGEDQGCAGGFMDGAFKFIIQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYE 238
Query: 256 AIPARYA----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDH 292
+P FQ Y GVF CG QL+HGVT VGYG
Sbjct: 239 DVPVNNEKALQKAVANQPVSVAIDASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSAD 298
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G +YWLVKNSWGT WGE GYI M R + G+CGI M ASYP
Sbjct: 299 GTQYWLVKNSWGTEWGEEGYIMMQRGVKAQE-GLCGIAMMASYPT 342
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 148/312 (47%), Positives = 191/312 (61%), Gaps = 32/312 (10%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS-FKLTDNKFADLSN 115
+M R E W+ Q+ R Y E RR ++ +NV +I+ N+ + + L N+FADL++
Sbjct: 39 AMAARHERWMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTS 98
Query: 116 EEFISTYL---GYNKPYNEPR------WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
EEF +T G++ P N R + +V LPASVDWR +GAVT +KDQGQCG C
Sbjct: 99 EEFKATMTNSKGFSTPNNGVRVSTGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCC 158
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSAVAA+EGI KL TGKL+SLSEQELVDCDV+ +QGC GG ++ AF+FI GG+T
Sbjct: 159 WAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTA 218
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR--------------------YAFQLY 266
E +YPY ++ RC+T A +I GYE +PA FQ Y
Sbjct: 219 EANYPYTAEDGRCKTTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDASKFQFY 278
Query: 267 SHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
GV CG L+HGVTV+GYG G KYWLVKNSWGT+WGEAGY+RM ++ G
Sbjct: 279 GGGVMAGECGTSLDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKR-G 337
Query: 326 ICGILMQASYPV 337
+CG+ MQ SYP
Sbjct: 338 MCGLAMQPSYPT 349
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 146/308 (47%), Positives = 187/308 (60%), Gaps = 32/308 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E WL ++ + Y + E +RF I+ N+++ID N+ N ++KL N+FADL+NEE+ +
Sbjct: 4 YEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHNADNRTYKLGLNRFADLTNEEYRAR 63
Query: 122 YLGY----NKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
YLG N+ + + + S +Y LP SVDWR E AV PVKDQG CGSCWAFS +
Sbjct: 64 YLGTRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWAFSTI 123
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AVEGINK+ TG L+SLSEQELVDCD S NQGCNGG M+ A+EFI GG+ +E+DYPY
Sbjct: 124 GAVEGINKIVTGDLISLSEQELVDCDT-SYNQGCNGGLMDYAYEFIINNGGIDSEEDYPY 182
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
R + C + VTI YE +PA FQLY GV
Sbjct: 183 RAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLYVSGV 242
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
F CG L+HGV VGYG G YW+V+NSWG SWGE GY+R+ RN S G CGI
Sbjct: 243 FTGRCGTALDHGVVAVGYGSVKGHDYWIVRNSWGASWGEEGYVRLERNLAKSRSGKCGIA 302
Query: 331 MQASYPVK 338
++ SYP+K
Sbjct: 303 IEPSYPIK 310
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 144/310 (46%), Positives = 186/310 (60%), Gaps = 31/310 (10%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
+ R E W+ +Y R Y E RR ++ +NV +I+ +N+ N F L N+FAD++ +E
Sbjct: 29 IAARHEQWMARYGRVYSDVAEKARRLEVFKANVGFIESVNAGNHKFWLEANQFADITKDE 88
Query: 118 FISTYLGYNKPY-------NEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
F + + GY R+ +V LPASVDWR GAVTPVKDQGQCG CWAFS
Sbjct: 89 FRAMHKGYKMQVIGSKARATGFRYANVSIDDLPASVDWRANGAVTPVKDQGQCGCCWAFS 148
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
VA++EGI K+ TGKL+SLSEQELVDCDV +N+GC GG M+ AFEFI GG+ TE DY
Sbjct: 149 TVASMEGIVKVSTGKLISLSEQELVDCDVGMQNKGCGGGLMDNAFEFIVNNGGLDTEADY 208
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSH 268
PY G + C ++K + A +I GYE +PA F+ Y
Sbjct: 209 PYTGADGTCNSNKESNIAASIKGYEDVPANDEASLQKAVAAQPVSIAVDGGDDLFRFYKG 268
Query: 269 GVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GV CG +L+HGV VGYG G KYWLVKNSWGTSWGE G+IR+ R+ + G+C
Sbjct: 269 GVLTGACGTELDHGVAAVGYGVAGDGTKYWLVKNSWGTSWGEDGFIRLERD-VADEAGMC 327
Query: 328 GILMQASYPV 337
G+ M+ SYP
Sbjct: 328 GLAMKPSYPT 337
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 159/358 (44%), Positives = 204/358 (56%), Gaps = 43/358 (12%)
Query: 11 TNLHLKIAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYP--QKYDPQSMEERFENWLKQ 68
T L +AI +L +A+ F + GY Q + + E FE+W+ +
Sbjct: 9 TKFSLLVAISASALLCSALARDFSIV-----------GYTPEQLTSTEKLLELFESWMSE 57
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKP 128
+S+ Y S +E RF ++ N+ +ID N++ S+ L N+FADL++EEF YLG KP
Sbjct: 58 HSKVYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRYLGLAKP 117
Query: 129 -YNEPRWPSVQY-----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLK 182
++ R PS + LP SVDWRK+GAV PVKDQGQCGSCWAFS VAAVEGIN++
Sbjct: 118 QFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVAAVEGINQIT 177
Query: 183 TGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTD 242
TG L SLSEQEL+DCD + N GCNGG M+ AF++I GG+ EDDYPY + CQ
Sbjct: 178 TGNLSSLSEQELIDCDT-TFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYLMEEGICQEQ 236
Query: 243 KTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLN 280
K VTI+GYE +P + FQ Y GVF+ CG L+
Sbjct: 237 KEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVFNGQCGTDLD 296
Query: 281 HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
HGV VGYG G Y +VKNSWG WGE G+IRM RN+ G+CGI ASYP K
Sbjct: 297 HGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPE-GLCGINKMASYPTK 353
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 156/334 (46%), Positives = 204/334 (61%), Gaps = 50/334 (14%)
Query: 53 YDPQSMEER---FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKL 105
Y +S EE + W ++ + Y + E +RR+ + N++YID N+ SF+L
Sbjct: 28 YGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRL 87
Query: 106 TDNKFADLSNEEFISTYLGY-NKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKD 159
N+FADL+NEE+ TYLG NKP E R S +YL LP SVDWR +GAV +KD
Sbjct: 88 GLNRFADLTNEEYRDTYLGLRNKPRRE-RKVSDRYLAADNEALPESVDWRTKGAVAEIKD 146
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
QG CGSCWAFSA+AAVEGIN++ TG L+SLSEQELVDCD S N+GCNGG M+ AF+FI
Sbjct: 147 QGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDT-SYNEGCNGGLMDYAFDFII 205
Query: 220 KIGGVTTEDDYPYRGKNDRCQTDKTKH------------HAVTITGYEAIPAR------- 260
GG+ TEDDYPY+GK++RC ++ VTI YE +
Sbjct: 206 NNGGIDTEDDYPYKGKDERCDVNRVSFVFFAPLVFQKNAKVVTIDSYEDVTPNSETSLQK 265
Query: 261 ---------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGT 305
AFQLYS G+F CG L+HGV VGYG ++G+ YW+V+NSWG
Sbjct: 266 AVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGTENGKDYWIVRNSWGK 325
Query: 306 SWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
SWGE+GY+RM RN +S+ G CGI ++ SYP+K+
Sbjct: 326 SWGESGYVRMERNIKASS-GKCGIAVEPSYPLKK 358
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 150/309 (48%), Positives = 199/309 (64%), Gaps = 34/309 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEFIS 120
+++W+ Q+ + Y E ++RF I+ N+++ID NS N ++KL NKFADL+N+E+ +
Sbjct: 46 YKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYRA 105
Query: 121 TYLGYN----KPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
+LG + + + PS +Y LP SV+WR GAV+ VKDQG CGSCWAFSA
Sbjct: 106 KFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGSCGSCWAFSA 165
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
+AAVEGINK+ +G+L+SLSEQELVDCD S + GCNGG M+ AF+FI GG+ TE DYP
Sbjct: 166 IAAVEGINKIVSGELISLSEQELVDCD-RSYDAGCNGGLMDYAFQFIIDNGGIDTEKDYP 224
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------------YAFQLYSHGV 270
Y G N++C K V+I GYE +P AFQLY GV
Sbjct: 225 YLGFNNQCDPTKKNAKVVSIDGYEDVPNNENALKKAVAHQPVSIAIEAGGRAFQLYESGV 284
Query: 271 FDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
F+ CG L+HGV VGYG +D+G+ YW+V+NSWG +WGE GYIRM RN ++N G CGI
Sbjct: 285 FNGECGLALDHGVVAVGYGSDDNGQDYWIVRNSWGGNWGENGYIRMERN-INANTGKCGI 343
Query: 330 LMQASYPVK 338
M+ASYPVK
Sbjct: 344 AMEASYPVK 352
>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase; AltName:
Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
RecName: Full=Vignain-1; Contains: RecName:
Full=Vignain-2; Flags: Precursor
gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
Length = 362
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 153/317 (48%), Positives = 191/317 (60%), Gaps = 39/317 (12%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+S+ + +E W + SR G E +RF ++ +NV ++ N + +KL NKFAD+
Sbjct: 34 ESLWDLYERWRSHHTVSRSLG---EKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADM 90
Query: 114 SNEEFISTYLG----YNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCG 164
+N EF STY G ++K + + S ++ +PASVDWRK+GAVT VKDQGQCG
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCG 150
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS + AVEGIN++KT KLVSLSEQELVDCD ENQGCNGG ME AFEFI + GG+
Sbjct: 151 SCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCD-KEENQGCNGGLMESAFEFIKQKGGI 209
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE +YPY + C K AV+I G+E +P
Sbjct: 210 TTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSD 269
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GVF C LNHGV +VGYG G YW+V+NSWG WGE GYIRM RN S
Sbjct: 270 FQFYSEGVFTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN-IS 328
Query: 322 SNIGICGILMQASYPVK 338
G+CGI M ASYP+K
Sbjct: 329 KKEGLCGIAMMASYPIK 345
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 280 bits (716), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 142/306 (46%), Positives = 194/306 (63%), Gaps = 33/306 (10%)
Query: 65 WLKQYSREYGSEDEWQRRFGIYSSNVQYID-YINSQNLSFKLTDNKFADLSNEEFISTYL 123
WL ++S+ Y E ++RF I+ +N+++ID + NS+N ++K+ +FADL+NEE+ + +L
Sbjct: 51 WLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRAKFL 110
Query: 124 GYN----KPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
G + + + PS +Y LP S+DWR+ GAV+ +KDQG CGSCWAFS +AA
Sbjct: 111 GTKSDPKRRLMKSKNPSQRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFSTIAA 170
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEG+NK+ TG+L+SLSEQELVDCD S N GCNGG M+ AF+FI GG+ T+ DYPY+
Sbjct: 171 VEGVNKIVTGELISLSEQELVDCD-RSYNAGCNGGLMDNAFQFIINNGGIDTDKDYPYQA 229
Query: 235 KNDRCQTDKTKHHAVTITGYEAIPA----------------------RYAFQLYSHGVFD 272
+ +C T K K+ AVTI G+E + A A Q Y GVF
Sbjct: 230 VDGKCDTTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQPVSVAIEASGMALQFYQSGVFT 289
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG L+HGV +VGYG + G YWLV+NSWG WGE GYI+M RN + G CGI M+
Sbjct: 290 GECGSALDHGVVIVGYGTEDGIDYWLVRNSWGRDWGENGYIKMQRNVVDTFTGKCGIAME 349
Query: 333 ASYPVK 338
+SYP+K
Sbjct: 350 SSYPIK 355
>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 280 bits (716), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 188/313 (60%), Gaps = 32/313 (10%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
D SM R E W+ QY R Y E ++F ++ +N ++ID N++N F L N+FADL
Sbjct: 29 DDLSMAARHETWMAQYGRVYKDAAEKAQKFEVFKANARFIDSFNAENHKFWLGINQFADL 88
Query: 114 SNEEFISTYLGYNKPYNEPR------WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
+NEEF +T N+ R + +++ LP S+DWR +GAVTPVKDQGQCG CW
Sbjct: 89 TNEEFKATKTNKGFISNKARVSTGFKYENLKIEALPTSIDWRTKGAVTPVKDQGQCGCCW 148
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAVAA EGI KL TGKLVSLSEQELVDCDV+ E+QGC GG M+ AF+FI GG+T E
Sbjct: 149 AFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIITNGGLTQE 208
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
YPY ++ +C++ A TI YE +PA FQ
Sbjct: 209 SSYPYDAEDGKCKSG--SKSAGTIKSYEDVPANNEGALMKAVANQPVSVAVDGGDMTFQF 266
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS GV CG L+HG+ +GYG G K+WL+KNSWGT+WGE G++RM ++
Sbjct: 267 YSGGVMTGSCGTDLDHGIAAIGYGVTSDGTKFWLMKNSWGTTWGENGFLRMEKDIADKK- 325
Query: 325 GICGILMQASYPV 337
G+CG+ M+ SYP
Sbjct: 326 GMCGLAMEPSYPT 338
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 280 bits (716), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 142/306 (46%), Positives = 191/306 (62%), Gaps = 31/306 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ--NLSFKLTDNKFADLSNEEFI 119
++ WL + R Y + E +RRF ++ N+++ D N++ + F+L N+FADL+NEEF
Sbjct: 54 YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 113
Query: 120 STYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
+T+LG K R +Y LP SVDWR++GAV PVK+QGQCGSCWAFSAV+
Sbjct: 114 ATFLGA-KVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVST 172
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VE IN+L TG++++LSEQELV+C N +N GCNGG M+ AF+FI K GG+ TEDDYPY+
Sbjct: 173 VESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKA 232
Query: 235 KNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFD 272
+ +C ++ V+I G+E +P FQLY GVF
Sbjct: 233 VDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFS 292
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG L+HGV VGYG D+G+ YW+V+NSWG WGE+GY+RM RN + G CGI M
Sbjct: 293 GRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERN-INVTTGKCGIAMM 351
Query: 333 ASYPVK 338
ASYP K
Sbjct: 352 ASYPTK 357
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 280 bits (716), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 154/337 (45%), Positives = 201/337 (59%), Gaps = 33/337 (9%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL LL+ +G A + + SM ER W+ +Y++ Y E ++RF I+ N
Sbjct: 10 ISLALLFCMGFLAFQVT---CRTLQDASMYERHAQWMARYAKVYKDPQEREKRFRIFKEN 66
Query: 90 VQYIDYINS-QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP--RWPSVQYLG---LP 143
V YI+ NS N S+KL N+FADL+NEEFI+ + R + +Y +P
Sbjct: 67 VNYIETFNSADNKSYKLDINQFADLTNEEFIAPRNRFKGHMCSSITRTTTFKYENVTVIP 126
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
++VDWR++GAVTP+KDQGQCG CWAFSAVAA EGI+ L GKL+SLSEQE+VDCD ++
Sbjct: 127 STVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNAGKLISLSEQEVVDCDTKGQD 186
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA- 262
QGC GG+M+ AF+FI + G+ TE +YPY+ + +C +HA TITGYE +P
Sbjct: 187 QGCAGGFMDGAFKFIIQNHGLNTEPNYPYKAADGKCNAKAAANHAATITGYEDVPVNNEK 246
Query: 263 ---------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVK 300
FQ Y GVF CG +L+HGVT VGYG G +YWLVK
Sbjct: 247 ALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSADGTEYWLVK 306
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
NSWGT WGE GYIRM R + G+CGI M ASYP
Sbjct: 307 NSWGTEWGEEGYIRMQRGVKAEE-GLCGIAMMASYPT 342
>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 345
Score = 280 bits (716), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 147/311 (47%), Positives = 188/311 (60%), Gaps = 35/311 (11%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEF 118
ER ENW+ QY + Y E ++RF I+ +NV +I+ N+ + F L+ N+FADL +EEF
Sbjct: 36 ERHENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFNLSINQFADLHDEEF 95
Query: 119 ISTYLGYNKPY---------NEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
+ NK E + + L A++DWRK GAVTP+KDQ +CGSCWAF
Sbjct: 96 KALLTNGNKKVRSVVGTATETETSFKYNRVTKLLATMDWRKRGAVTPIKDQRRCGSCWAF 155
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
SAVAA+EGI+++ T KLVSLSEQELVDC V E++GCNGGYME AFEF+ K GG+ +E
Sbjct: 156 SAVAAIEGIHQITTSKLVSLSEQELVDC-VKGESEGCNGGYMEDAFEFVAKKGGIASESY 214
Query: 230 YPYRGKNDRCQTDKTKHHAVTITGYEAIPARY----------------------AFQLYS 267
YPY+GK+ C+ K H I GYE +P+ AFQ YS
Sbjct: 215 YPYKGKDKSCKVKKETHGVSQIKGYEKVPSNSEKALQKAVAHQPVSVYVEAGGNAFQFYS 274
Query: 268 HGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
G+F CG +H +TVVGYG+ G KYWLVKNSWG WGE GYIRM R+ + G+
Sbjct: 275 SGIFTGKCGTNTDHAITVVGYGKSRGGTKYWLVKNSWGAGWGEKGYIRMKRDIRAKE-GL 333
Query: 327 CGILMQASYPV 337
CGI M A YP
Sbjct: 334 CGIAMNAFYPT 344
>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
Length = 292
Score = 280 bits (716), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 149/290 (51%), Positives = 186/290 (64%), Gaps = 31/290 (10%)
Query: 78 EWQRRFGIYSSNVQYIDYINS--QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP--R 133
E ++R I++ NV YI+ NS N +KL+ NKFADL+NEEFI++ + R
Sbjct: 3 EREKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEFIASRNKFKGHMCSSIIR 62
Query: 134 WPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
+ +Y +P++VDWRK+GAVTPVK+QGQCGSCWAFSAVAA EGI++L TGKLVSLS
Sbjct: 63 TTTFKYENASAIPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLS 122
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQEL+DCD +QGC GG M+ AF+FI + G++TE YPY G + C +K HAVT
Sbjct: 123 EQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNANKASIHAVT 182
Query: 251 ITGYEAIPARYA----------------------FQLYSHGVFDEYCGHQLNHGVTVVGY 288
ITGYE +PA FQ Y+ GVF CG +L+HGVT VGY
Sbjct: 183 ITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGY 242
Query: 289 G-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G + G KYWLVKNSWG WGE GYIRM R ++ G+CGI MQASYP
Sbjct: 243 GVGNDGTKYWLVKNSWGADWGEEGYIRMQRGIAAAE-GLCGIAMQASYPT 291
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 155/349 (44%), Positives = 207/349 (59%), Gaps = 36/349 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREY---GSED 77
M + +++L + + A + S PQ+ D + M ++ W ++ + + G+E
Sbjct: 1 MGTFQSSPIMALLFFLFIALSAASPSSIIPQRTDDEVMA-LYDQWRAKHGKLHNNLGAEP 59
Query: 78 EWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPR-WPS 136
E RF I+ N+++ID IN+QNL ++L N FADL+NEE+ S YLG R S
Sbjct: 60 E--NRFHIFKDNLKFIDEINAQNLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTS 117
Query: 137 VQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSE 191
+YL LP S+DWR +GAV PVKDQG CGSCWAFS VA+VE IN++ TG L++LSE
Sbjct: 118 NRYLPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSE 177
Query: 192 QELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTI 251
QELVDCD S N+GCNGG M+ AFEFI + GG+ TE+DYPY G + C K V I
Sbjct: 178 QELVDCD-RSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNAKVVAI 236
Query: 252 TGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG 289
YE +P +FQLY G+F CG L+HGV VVGYG
Sbjct: 237 DSYEDVPVNNEKALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYG 296
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+ G YW+V+NSWG SWGE+GY++M RN +S G+CGI M+ SYP K
Sbjct: 297 SEGGVDYWIVRNSWGGSWGESGYVKMQRNI-ASPTGLCGIAMEPSYPTK 344
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 280 bits (715), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 192/318 (60%), Gaps = 38/318 (11%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS--QNLSFKLTDNKFA 111
D +M++R W+ ++ R Y +E R+ ++ NV+ I+ +N L+FKL N+FA
Sbjct: 30 DEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFA 89
Query: 112 DLSNEEFISTYLGY---------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
DL+NEEF S Y G+ KP R+ +V LP SVDWRK+GAVTP+KDQG
Sbjct: 90 DLTNEEFRSMYTGFKGNSVLSSRTKP-TSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGL 148
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFSAVAA+EG+ ++K GKL+SLSEQELVDCD N + GC GG M+ AF + IG
Sbjct: 149 CGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN--DGGCMGGLMDTAFNYTITIG 206
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------------- 260
G+T+E +YPY+ N C +KTK A +I G+E +PA
Sbjct: 207 GLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGD 266
Query: 261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNS 319
FQ YS GVF C L+HGVT VGYG +G KYW++KNSWG WGE GY+R+ ++
Sbjct: 267 IGFQFYSSGVFSGECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKKDI 326
Query: 320 PSSNIGICGILMQASYPV 337
+ G CG+ M ASYP
Sbjct: 327 KPKH-GQCGLAMNASYPT 343
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 280 bits (715), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 149/307 (48%), Positives = 187/307 (60%), Gaps = 30/307 (9%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
E FE+W+ ++S+ Y S +E RF ++ N+ +ID N++ S+ L N+FADL++EEF
Sbjct: 49 ELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFK 108
Query: 120 STYLGYNKP-YNEPRWPSVQY-----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
YLG KP ++ R PS + LP SVDWRK+GAV PVKDQGQCGSCWAFS VA
Sbjct: 109 GRYLGLAKPQFSRKRQPSANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWAFSTVA 168
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEGIN++ TG L SLSEQEL+DCD + N GCNGG M+ AF++I GG+ EDDYPY
Sbjct: 169 AVEGINQITTGNLSSLSEQELIDCDT-TFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYL 227
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVF 271
+ CQ K VTI+GYE +P + FQ Y GVF
Sbjct: 228 MEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKGGVF 287
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
+ CG L+HGV VGYG G Y +VKNSWG WGE G+IRM RN+ G+CGI
Sbjct: 288 NGKCGTDLDHGVAAVGYGSSKGSDYVIVKNSWGPRWGEKGFIRMKRNTGKPE-GLCGINK 346
Query: 332 QASYPVK 338
ASYP K
Sbjct: 347 MASYPTK 353
>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 341
Score = 280 bits (715), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 154/340 (45%), Positives = 198/340 (58%), Gaps = 34/340 (10%)
Query: 25 LRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFG 84
L + L+LFL++ E + + M ER E W+ + + Y E ++++
Sbjct: 6 LFHCTLALFLIFAF-----CAFEANARTLEDAPMRERHEQWMATHGKVYKHSYEKEQKYQ 60
Query: 85 IYSSNVQYIDYINSQNLS-FKLTDNKFADLSNEEF--ISTYLGY--NKPYNEPRWPSVQY 139
I+ NVQ I+ N+ +KL N FADL+NEEF I+ + G+ +K +
Sbjct: 61 IFMENVQRIEAFNNAGXKPYKLGINHFADLTNEEFKAINRFKGHVCSKRTRTTTFRYENV 120
Query: 140 LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDV 199
+PAS+DWR++GAVTP+KDQGQCG CWAFSAVAA EGI KL+TGKL+SLSEQELVDCD
Sbjct: 121 TAVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLRTGKLISLSEQELVDCDT 180
Query: 200 NSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA 259
+QGC GG M+ AF+FI + G+ TE YPY G + C +HA +I GYE +PA
Sbjct: 181 KGVDQGCEGGLMDDAFKFILQNKGLATEAIYPYEGFDGTCNAKADGNHAGSIKGYEDVPA 240
Query: 260 R----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKY 296
+ FQ YS GVF CG L+HGVT VGYG D G KY
Sbjct: 241 NSESALLKAVANQPVSVAIEASGFKFQFYSGGVFTGSCGTNLDHGVTSVGYGVGDDGTKY 300
Query: 297 WLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
WLVKNSWG WGE GYIRM R+ + G+CGI M ASYP
Sbjct: 301 WLVKNSWGVKWGEKGYIRMQRDVAAKE-GLCGIAMLASYP 339
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 147/310 (47%), Positives = 193/310 (62%), Gaps = 34/310 (10%)
Query: 62 FENWLKQYSREYGSED---EWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEF 118
+E WL + + + + + E +RRF ++ N+++ID NS+N S+K+ N+FADL+NEE+
Sbjct: 51 YEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDEHNSENRSYKVGLNRFADLTNEEY 110
Query: 119 ISTYLGYNKPYNEPRWP--SVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
S YLG R S +YL LP SVDWRKEGAV VKDQG CGSCWAFS
Sbjct: 111 RSMYLGARSGAKRNRLSRSSNRYLPRVGDSLPDSVDWRKEGAVAEVKDQGSCGSCWAFST 170
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
+AAVEGINK+ TG L+SLSEQELVDCD S N+GCNGG M+ AF+FI GG+ +E+DYP
Sbjct: 171 IAAVEGINKIVTGDLISLSEQELVDCD-RSYNEGCNGGLMDYAFQFIINNGGIDSEEDYP 229
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHG 269
Y ++ C T + VTI YE +P FQ Y G
Sbjct: 230 YLARDGTCDTYRKNAKVVTIDNYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQFYQSG 289
Query: 270 VFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
+F CG L+HGV VGYG ++G+ YW+V+NSWG SWGE+GYIRM RN ++ G CGI
Sbjct: 290 IFTGRCGTALDHGVAAVGYGTENGKDYWIVRNSWGKSWGESGYIRMERNIATA-TGKCGI 348
Query: 330 LMQASYPVKR 339
++ SYP+K+
Sbjct: 349 AIEPSYPIKK 358
>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
Length = 347
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 147/319 (46%), Positives = 195/319 (61%), Gaps = 40/319 (12%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNK 109
D +M R E W+ Q+ R Y E + RF ++ +NV++I+ N+ N F L N+
Sbjct: 33 DELAMVARHEQWMVQHGRVYKDETDKAHRFLVFKANVKFIESFNAAAAAGNRKFWLGVNQ 92
Query: 110 FADLSNEEFISTYLGYNKPYNEP--------RWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
FADL+N+EF +T NK +N R+ ++ LP +VDWR +GAVTP+KDQG
Sbjct: 93 FADLTNDEFRATKT--NKGFNPNVVKVPTGFRYQNLSIDALPQTVDWRTKGAVTPIKDQG 150
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
QCG CWAFSAVAA EGI K+ TGKL SLSEQELVDCDV+ E+QGCNGG M+ AF+FI K
Sbjct: 151 QCGCCWAFSAVAATEGIVKISTGKLTSLSEQELVDCDVHGEDQGCNGGEMDDAFKFIIKN 210
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR--------------------- 260
GG+TTE +YPY ++ +C++ + A TI GYE +PA
Sbjct: 211 GGLTTESNYPYTAQDGQCKSG--SNGAATIKGYEDVPANDEAALMKAVASQPVSVAVDGG 268
Query: 261 -YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARN 318
FQ YS GV CG L+HG+ +GYG+ G KYWL+KNSWGT+WGE G++RM ++
Sbjct: 269 DMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGFLRMEKD 328
Query: 319 SPSSNIGICGILMQASYPV 337
G+CG+ MQ SYP
Sbjct: 329 IADKK-GMCGLAMQPSYPT 346
>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
Length = 362
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 151/315 (47%), Positives = 190/315 (60%), Gaps = 35/315 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+S+ + +E W ++ + +E Q+RF ++ SNV ++ N + +KL NKFAD++N
Sbjct: 34 ESLWDLYERWRSHHTVSR-NLNEKQKRFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTN 92
Query: 116 EEFISTYLGYNKPYNE-----PRWPSV----QYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
EF +TY G ++ PR + PASVDWRK+GAVT VKDQGQCGSC
Sbjct: 93 HEFKTTYAGTKVNHHRMFRGTPRVSGTFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSC 152
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS V AVEGIN++KT +LV LSEQEL+DCD N ENQGCNGG ME AFE+I + GGVTT
Sbjct: 153 WAFSTVVAVEGINQIKTNRLVPLSEQELIDCD-NQENQGCNGGLMEYAFEYIKQKGGVTT 211
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E YPY + C K V+I G+E +PA FQ
Sbjct: 212 ESYYPYTANDGSCDATKENVPTVSIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQ 271
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GVF CG +LNHGV +VGYG G YW+V+NSWG WGE G IRM RN S+
Sbjct: 272 FYSEGVFTGDCGKELNHGVAIVGYGTTVDGTNYWIVRNSWGAEWGEQGCIRMKRNV-SNK 330
Query: 324 IGICGILMQASYPVK 338
G+CGI M+ASYPVK
Sbjct: 331 EGLCGIAMEASYPVK 345
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 146/318 (45%), Positives = 195/318 (61%), Gaps = 32/318 (10%)
Query: 50 PQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNK 109
PQ+ D ++M +E WL + + Y + E +RRF I+ N++++D N+ S+++ N+
Sbjct: 36 PQRTDAEAMA-IYEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHNAVAGSYRVGLNR 94
Query: 110 FADLSNEEFISTYLGYNKPYNE-------PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
FADL+NEE+ S +LG N E R+ LP SVDWR++GAV+PVKDQGQ
Sbjct: 95 FADLTNEEYRSMFLGGNMEMKERSASTKSDRYAFRAGDKLPGSVDWREKGAVSPVKDQGQ 154
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS ++AVEGIN++ TG+L+SLSEQELVDCD S N GCNGG M+ F+FI G
Sbjct: 155 CGSCWAFSTISAVEGINQIVTGELISLSEQELVDCD-KSYNMGCNGGLMDYGFQFIINNG 213
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------AR 260
G+ TE+DYPYR + C + V+I GYE +P
Sbjct: 214 GIDTEEDYPYRAVDGTCDQFRKNARVVSINGYEDVPEDDENSLKKAVANQPVSVAIEAGG 273
Query: 261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
AFQLY GVF +CG L+HGV VGYG ++G YW V+NSWG WGE GYI++ RN
Sbjct: 274 RAFQLYESGVFTGHCGTNLDHGVVAVGYGTENGVDYWTVRNSWGPKWGENGYIKLERNIN 333
Query: 321 SSNIGICGILMQASYPVK 338
+++ G CGI ASYP K
Sbjct: 334 ATS-GKCGIASMASYPTK 350
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 149/345 (43%), Positives = 203/345 (58%), Gaps = 43/345 (12%)
Query: 26 RNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGI 85
+ +L+LFL +GI S+ P+K ++ ER ENW+ +Y + Y E ++RF I
Sbjct: 7 KQHMLALFLFLAVGI-----SQVMPRKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQI 61
Query: 86 YSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPY---------NEPRWP 135
+ NV++I+ N+ N +KL N ADL+ EEF + G + Y N ++
Sbjct: 62 FKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYE 121
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQG-QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+V +P ++DWR +GAVTP+KDQG QCGSCWAFS +AA EGI+++ TG LVSLSEQEL
Sbjct: 122 NVT--DIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTIAATEGIHQISTGNLVSLSEQEL 179
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
VDCD S + GC GG+ME FEFI K GG+T+E +YPY+G + C T I GY
Sbjct: 180 VDCD--SVDDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASPVAQIKGY 237
Query: 255 EAIPA----------------------RYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH 292
E +P+ F YS G+++ CG L+HGVT VGYG ++
Sbjct: 238 EIVPSYSEEALQKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGTEN 297
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G YW+VKNSWGT WGE GYIRM R + + GICGI + +SYP
Sbjct: 298 GTDYWIVKNSWGTQWGEKGYIRMHRGIAAKH-GICGIALDSSYPT 341
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 142/315 (45%), Positives = 195/315 (61%), Gaps = 38/315 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS--QNLSFKLTDNKFADLSN 115
M++R W+ ++ R Y E R+ ++ +NV+ I+++NS +FKL N+FADL+N
Sbjct: 34 MQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTN 93
Query: 116 EEFISTYLGY---------NKPYNEP-RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
+EF S Y G+ ++ P R+ +V LP SVDWRK+GAVTP+K+QG CG
Sbjct: 94 DEFCSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGC 153
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFSAVAA+EG ++K GKL+SLSEQ+LVDCD N + GC GG M+ AFE I GG+T
Sbjct: 154 CWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCEGGLMDTAFEHIKATGGLT 211
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAF 263
TE DYPY+G++ C + KT A +ITGYE +P + F
Sbjct: 212 TESDYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDF 271
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
Q YS GVF C L+H VT +GYGE +G KYW++KNSWGT WGE+GY+R+ ++
Sbjct: 272 QFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDK 331
Query: 323 NIGICGILMQASYPV 337
G+CG+ M+ASYP
Sbjct: 332 Q-GLCGLAMKASYPT 345
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 279 bits (713), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 150/312 (48%), Positives = 191/312 (61%), Gaps = 35/312 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNEEFI 119
+E WL + Y E +RRF I+ N++YID N N S+ L +FADL+NEE+
Sbjct: 38 YEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYTLGLTRFADLTNEEYR 97
Query: 120 STYLGYN----KPYNEPRWP------SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
STYLG +P R P S LP VDWR++GAV P+KDQG CGSCWAF
Sbjct: 98 STYLGVKPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAVAPIKDQGGCGSCWAF 157
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
S VAAVEGIN++ TG L+ LSEQELVDCD + N+GCNGG M+ AF+FI GG+ TE+D
Sbjct: 158 STVAAVEGINQIVTGDLIVLSEQELVDCDT-AYNEGCNGGLMDYAFQFIISNGGIDTEED 216
Query: 230 YPYRGKNDRCQTDKTKHHAVTITGYEAI---------------PARYA-------FQLYS 267
YPY+ ++ C ++ V+I YE + P A FQLY
Sbjct: 217 YPYKERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPVSVAIEGGGRSFQLYK 276
Query: 268 HGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
G+FD CG L+HGV VGYG + G+ YW+V+NSWG SWGEAGYIRM RN PSS+ G C
Sbjct: 277 SGIFDGRCGIDLDHGVVAVGYGTESGKDYWIVRNSWGKSWGEAGYIRMERNLPSSSSGKC 336
Query: 328 GILMQASYPVKR 339
GI ++ SYP+K+
Sbjct: 337 GIAIEPSYPIKK 348
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 148/319 (46%), Positives = 191/319 (59%), Gaps = 39/319 (12%)
Query: 56 QSMEERFENWLKQYS---REYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFAD 112
+S+ +E W ++ R G+E E RRF ++ NV+YI N ++ F+L NKFAD
Sbjct: 34 ESLRGLYETWRSHHTVSRRGLGAEAE-ARRFNVFKENVRYIHEANKKDRPFRLALNKFAD 92
Query: 113 LSNEEFISTYLGYNKPYNEP----------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
++ +EF TY G ++ + LPA+VDWR++GAVTP+KDQGQ
Sbjct: 93 MTTDEFRRTYAGSRVRHHRSLSGGRRQGGGSFMYADAENLPAAVDWRQKGAVTPIKDQGQ 152
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS + AVEGINK++TG+LVSLSEQEL+DC++ EN GCNGG M+ AF+FI + G
Sbjct: 153 CGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNI-GENDGCNGGLMDVAFQFIQQNG 211
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------------- 260
G+TTE YPY+G+ + C K H V+I GYE +PA
Sbjct: 212 GITTEASYPYQGEQNSCDQSKENSHDVSIDGYEDVPANDESALQKAVANQPVSVAIDASG 271
Query: 261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNS 319
FQ YS GVF G L+HGV VGYG G KYW+VKNSWG WGE GYIRM R
Sbjct: 272 NDFQFYSEGVFTTDGGTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGV 331
Query: 320 PSSNIGICGILMQASYPVK 338
+ G+CGI M+ASYP K
Sbjct: 332 KQAE-GLCGIAMEASYPTK 349
>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
Length = 362
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 151/315 (47%), Positives = 190/315 (60%), Gaps = 35/315 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+S+ + +E W ++ + +E Q+RF ++ SNV ++ N + +KL NKFAD++N
Sbjct: 34 ESLWDLYERWRSHHTVSR-NLNEKQKRFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTN 92
Query: 116 EEFISTYLGYNKPYNE-----PRWPSV----QYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
EF +TY G ++ PR + PASVDWRK+GAVT VKDQGQCGSC
Sbjct: 93 HEFKTTYAGSKVNHHRMFRGTPRVSGTFMYENFTKAPASVDWRKKGAVTDVKDQGQCGSC 152
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS V AVEGIN++KT +LV LSEQEL+DCD N ENQGCNGG ME AFE+I + GGVTT
Sbjct: 153 WAFSTVVAVEGINQIKTNRLVPLSEQELIDCD-NQENQGCNGGLMEYAFEYIKQKGGVTT 211
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E YPY + C K V+I G+E +PA FQ
Sbjct: 212 ESYYPYTANDGSCDATKENVPTVSIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQ 271
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GVF CG +LNHGV +VGYG G YW+V+NSWG WGE G IRM RN S+
Sbjct: 272 FYSEGVFTGDCGKELNHGVAIVGYGTTVDGTNYWIVRNSWGAEWGEQGCIRMKRNV-SNK 330
Query: 324 IGICGILMQASYPVK 338
G+CGI M+ASYPVK
Sbjct: 331 EGLCGIAMEASYPVK 345
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 191/313 (61%), Gaps = 32/313 (10%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS-FKLTDNKFADLSN 115
+M R E W+ Q+ R Y E RR ++ +NV +I+ N+ + + L N+FADL++
Sbjct: 39 AMAARHERWMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTS 98
Query: 116 EEFISTYL---GYNKPYNEPR------WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
EEF +T G++ P N R + +V LPASVDWR +GAVT +KDQGQCG C
Sbjct: 99 EEFKATMTNSKGFSTPNNGVRVSTGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCC 158
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSAVAA+EG KL TGKL+SLSEQELVDCDV+ +QGC GG ++ AF+FI GG+T
Sbjct: 159 WAFSAVAAMEGFVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTA 218
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR--------------------YAFQLY 266
E +YPY ++ RC+T A +I GYE +PA FQ Y
Sbjct: 219 EANYPYTAEDGRCKTTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDASKFQFY 278
Query: 267 SHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
GV CG L+HGVTV+GYG G KYWLVKNSWGT+WGEAGY+RM ++ G
Sbjct: 279 GGGVMAGECGTSLDHGVTVIGYGAASDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKR-G 337
Query: 326 ICGILMQASYPVK 338
+CG+ MQ SYP +
Sbjct: 338 MCGLAMQPSYPTE 350
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 188/307 (61%), Gaps = 32/307 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E+WL ++ + Y + E RRF I+ N+++ID NS + ++KL NKFADL+NEE+ T
Sbjct: 52 YESWLVKHGKTYNALGEKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEEYRMT 111
Query: 122 YLGYNKPYNEPRWPSVQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
Y G ++ + ++ LP VDWR++GAVT VKDQG CGSCWAFS
Sbjct: 112 YTGIKTIDDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTG 171
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
+VEG+NK+ TG L+S+SEQELV+CD S NQGCNGG M+ AFEFI K GG+ TE+DYPY
Sbjct: 172 SVEGVNKIVTGDLISVSEQELVNCDT-SYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYT 230
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVF 271
GK+ +C +K VTI YE +P FQ Y+ G+F
Sbjct: 231 GKDGKCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIF 290
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG L+HGV GYG + G+ YWLVKNSWG WGE GY++M RN + G CGI M
Sbjct: 291 TGSCGTALDHGVLAAGYGTEDGKDYWLVKNSWGAEWGEGGYLKMERNIADKS-GKCGIAM 349
Query: 332 QASYPVK 338
+ASYP+K
Sbjct: 350 EASYPIK 356
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 148/346 (42%), Positives = 194/346 (56%), Gaps = 32/346 (9%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRF 83
M +L FL + L + A P + +E WL ++ + Y E +RF
Sbjct: 1 MASMTILPFFLFFSLITFSLALDIQLPTGRSNDEVMTMYEEWLVKHQKVYNGLREKDQRF 60
Query: 84 GIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP---------RW 134
I+ N+ +ID N+QN ++ + NKFAD++NEE+ YLG R+
Sbjct: 61 QIFKDNLNFIDEHNAQNYTYIVGLNKFADMTNEEYRDMYLGTRSDIKRRIMKNKITGHRY 120
Query: 135 PSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
LP VDWR +GA+T +KDQG CGSCWAFS +A VE INK+ TGKLVSLSEQEL
Sbjct: 121 AYNSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQEL 180
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
VDCD + N+GCNGG M+ AFEFI GG+ T+ YPY+G RC + K V+I GY
Sbjct: 181 VDCD-RAFNEGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAKIVSIDGY 239
Query: 255 EAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH 292
E +P+ A QLY GVF CG L+H V +VGYG ++
Sbjct: 240 EDVPSNNENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVIVGYGSEN 299
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G YWLV+NSWGT+WGE GY +M RN ++ G CGI ++ASYPVK
Sbjct: 300 GLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPVK 345
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 150/316 (47%), Positives = 198/316 (62%), Gaps = 46/316 (14%)
Query: 62 FENWLKQY--SREYGSEDEWQRRFGIYSSNVQYI-DYINSQNLSFKLTDNKFADLSNEEF 118
+E W + SR+ DE Q+RF ++ N +YI D+ +++ +KL NKFADL+N EF
Sbjct: 38 YERWRSHHTVSRDL---DEKQKRFNVFKENPRYIHDFNKRKDIPYKLRLNKFADLTNHEF 94
Query: 119 ISTYLGYNKPY-------------NEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
STY G + N + S+ LPAS+DWR++GAVT VKDQGQCGS
Sbjct: 95 RSTYAGSRINHHRSLRGSRRGGATNSFMYQSLDSRSLPASIDWRQKGAVTAVKDQGQCGS 154
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFS VAAVEGIN++KT KL+SLSEQEL+DCD + EN GCNGG M+ AF+FI K GG++
Sbjct: 155 CWAFSTVAAVEGINQIKTKKLLSLSEQELIDCDTD-ENNGCNGGLMDYAFDFIKKNGGIS 213
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAF 263
+E +YPY ++ C T+K K H V+I G+E +PA Y F
Sbjct: 214 SEAEYPYAAEDSYCATEK-KSHVVSIDGHEDVPANDEDSLLKAVANQPVSIAIEASGYDF 272
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
Q YS GVF G +L+HGV +VGYG+ G KYW+V+NSWG WGE GYIR++ S S
Sbjct: 273 QFYSEGVFTGRSGTELDHGVAIVGYGKTQQGTKYWIVRNSWGAEWGEKGYIRISAASDSK 332
Query: 323 NIGICGILMQASYPVK 338
+CG+ M+ASYP+K
Sbjct: 333 R--LCGLAMEASYPIK 346
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 146/307 (47%), Positives = 189/307 (61%), Gaps = 32/307 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFIS 120
FE+WL + + Y + E ++RF I+ +N++YID N ++ FKL NKFADL+NEE+ S
Sbjct: 45 FESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEEYRS 104
Query: 121 TYLGYNK-------PYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
Y G R+ ++ LP SVDWR+ GAV VKDQG CGSCWAFS ++
Sbjct: 105 KYTGIKSKDLRKKVSAKSGRYATLSGESLPESVDWRESGAVATVKDQGSCGSCWAFSTIS 164
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEGIN++ TGKL++LSEQELVDCD S N+GCNGG M+ AFEFI GG+ T+ DYPY
Sbjct: 165 AVEGINQIATGKLITLSEQELVDCD-RSYNEGCNGGLMDYAFEFIINNGGIDTDVDYPYT 223
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPA----------------------RYAFQLYSHGVF 271
G++ +C + VTI YE +PA FQ Y G+F
Sbjct: 224 GRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRDFQFYDSGIF 283
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG L+HGV VVGYG ++G+ YW+V+NSWG WGE GY+RM R SS GICGI +
Sbjct: 284 TGKCGIALDHGVVVVGYGTENGKDYWIVRNSWGADWGENGYLRMER-GISSKTGICGIAI 342
Query: 332 QASYPVK 338
+ SYPVK
Sbjct: 343 EPSYPVK 349
>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 155/336 (46%), Positives = 194/336 (57%), Gaps = 54/336 (16%)
Query: 30 LSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+ L LL+VL AW S+ + SM ER E+W+ QY REY DE +R+ I+
Sbjct: 10 ICLALLFVLA----AWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKD 65
Query: 89 NVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPA 144
NV I+ N + + S+KL+ N+FADL+NEEF ++ + S +Y +P+
Sbjct: 66 NVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTEATSFKYENVTAVPS 125
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
+VDWRK+GAVTP+KDQGQCGSCWAFSAVAA+EGI +L TGKL+SLSEQELVDCD + E+Q
Sbjct: 126 TVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQ 185
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---- 260
GC +YPY G + C K H A I GYE +PA
Sbjct: 186 GCT---------------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKA 224
Query: 261 ------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKN 301
FQ YS GVF CG +L+HGV+ VGYG D G KYWLVKN
Sbjct: 225 LQKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDDGMKYWLVKN 284
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWGT WGE GYIRM R+ + G+CGI MQASYP
Sbjct: 285 SWGTGWGEEGYIRMQRDVTAKE-GLCGIAMQASYPT 319
>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 156/349 (44%), Positives = 209/349 (59%), Gaps = 40/349 (11%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQK--YDPQSMEERFENWLKQYSREYGSEDEWQR 81
M + +++L L+ V G+ A S + +K +S+ + +E W + Y +E +
Sbjct: 3 MEKVILVALSLVLVFGL---AESFDFDEKDLASEESLWDLYERW-RSYHTVSRDLEEKNK 58
Query: 82 RFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYN-KPYNEPRWPSVQYL 140
RF ++ N +++ +N + +KL NKFAD++N EF S+Y G K Y R
Sbjct: 59 RFNVFKENTKHVHKVNQMDKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTG 118
Query: 141 G--------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
G LP SVDWRK+GAVT +KDQG+CGSCWAFS V VEGIN++KT +L+SLSEQ
Sbjct: 119 GFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQ 178
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTIT 252
+L+DCD S++ GCNGG ME AFEFI K GG+TTE++YPY+ K++RC K VTI
Sbjct: 179 QLIDCD-RSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTID 237
Query: 253 GYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
G+E++P Q YS GVFD CG +L+HGV +VGYG
Sbjct: 238 GHESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGT 297
Query: 291 D-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G KYW+VKNSWG WGE GYIRMAR ++ G CGI M+ASYPVK
Sbjct: 298 TLDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAE-GQCGIAMEASYPVK 345
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 150/318 (47%), Positives = 192/318 (60%), Gaps = 37/318 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFA 111
+ M +E WL ++ R Y + E +RRF I+ NV +ID N+ + SF+L N+FA
Sbjct: 44 EEMRILYEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLGLNRFA 103
Query: 112 DLSNEEFISTYLGYNKPYNEPRWPSV-----QYLG---LPASVDWRKEGAVTPVKDQGQC 163
D++NEE+ + YLG +P R V +Y LP SVDWR +GAV VKDQG C
Sbjct: 104 DMTNEEYRAVYLG-TRPAGHRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVKDQGSC 162
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCWAFS VAAVEGINK+ TG L+SLSEQELVDCD N NQGCNGG M+ FEFI GG
Sbjct: 163 GSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCD-NGYNQGCNGGLMDYGFEFIINNGG 221
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------Y 261
+ TE+DYPY ++ +C + V+I GYE +P
Sbjct: 222 IDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGR 281
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQLY G+F CG L+HGV VGYG ++G+ YW+V+NSWG WGE+GYIRM RN +
Sbjct: 282 EFQLYHSGIFTGRCGTDLDHGVVAVGYGTENGKDYWIVRNSWGGDWGESGYIRMERNVNT 341
Query: 322 SNIGICGILMQASYPVKR 339
S G CGI ++ SYP K+
Sbjct: 342 S-TGKCGIAIEPSYPTKK 358
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 152/321 (47%), Positives = 191/321 (59%), Gaps = 32/321 (9%)
Query: 48 GYPQKYDPQSMEE--RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFK 104
G K D ++ EE FE WL + + Y E +RF I+ N++++ NS N S++
Sbjct: 21 GVTAKADHRNPEEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYE 80
Query: 105 LTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKD 159
L +FADL+NEEF + YL S +YL LP VDWR +GAV PVKD
Sbjct: 81 LGLTRFADLTNEEFRAIYLRSKMERTRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPVKD 140
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
QG CGSCWAFSA+ AVEGIN++KTG+LVSLSEQELVDCD S N GC GG M+ AF+FI
Sbjct: 141 QGSCGSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCDT-SYNNGCGGGLMDYAFQFII 199
Query: 220 KIGGVTTEDDYPYRGKNDR-CQTDKTKHHAVTITGYEAIPAR------------------ 260
GG+ TE+DYPY +D C TDK VTI GYE +P
Sbjct: 200 SNGGIDTEEDYPYTATDDNICNTDKKNTRVVTIDGYEDVPENENSLKKALANQPISVAIE 259
Query: 261 ---YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
FQLY GVF CG L+HGV VGYG G+ YW+++NSWG++WGE+GYI++ R
Sbjct: 260 AGGRGFQLYKSGVFTGTCGTALDHGVVAVGYGTSEGQDYWIIRNSWGSNWGESGYIKLQR 319
Query: 318 NSPSSNIGICGILMQASYPVK 338
N S+ G CG+ M ASYP K
Sbjct: 320 NIKDSS-GKCGVAMMASYPTK 339
>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
Length = 357
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 156/349 (44%), Positives = 209/349 (59%), Gaps = 40/349 (11%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQK--YDPQSMEERFENWLKQYSREYGSEDEWQR 81
M + +++L L+ V G+ A S + +K +S+ + +E W + Y +E +
Sbjct: 1 MEKVILVALSLVLVFGL---AESFDFDEKDLASEESLWDLYERW-RSYHTVSRDLEEKNK 56
Query: 82 RFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYN-KPYNEPRWPSVQYL 140
RF ++ N +++ +N + +KL NKFAD++N EF S+Y G K Y R
Sbjct: 57 RFNVFKENTKHVHKVNQMDKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTG 116
Query: 141 G--------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
G LP SVDWRK+GAVT +KDQG+CGSCWAFS V VEGIN++KT +L+SLSEQ
Sbjct: 117 GFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQ 176
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTIT 252
+L+DCD S++ GCNGG ME AFEFI K GG+TTE++YPY+ K++RC K VTI
Sbjct: 177 QLIDCD-RSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTID 235
Query: 253 GYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
G+E++P Q YS GVFD CG +L+HGV +VGYG
Sbjct: 236 GHESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGT 295
Query: 291 D-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G KYW+VKNSWG WGE GYIRMAR ++ G CGI M+ASYPVK
Sbjct: 296 TLDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAE-GQCGIAMEASYPVK 343
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 141/315 (44%), Positives = 192/315 (60%), Gaps = 38/315 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS--QNLSFKLTDNKFADLSN 115
M++R + W+ ++ R Y E R+ ++ NV+ I+ +N+ +FKL N+FADL+N
Sbjct: 35 MQKRHDEWMAKHGRVYADMKEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQFADLTN 94
Query: 116 EEFISTYLGYN----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
+EF S Y GY + R+ +V LP SVDWRK+GAVTP+K+QG CG
Sbjct: 95 DEFRSMYTGYKGGSVLSSQSGTKTSSFRYQNVSSGALPVSVDWRKKGAVTPIKNQGTCGC 154
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFSAVAA+EG K+K GKL+SLSEQ+LVDCD N + GC+GG M+ AFE I GG+T
Sbjct: 155 CWAFSAVAAIEGATKIKKGKLISLSEQQLVDCDTN--DFGCSGGLMDTAFEHIMATGGLT 212
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAF 263
TE +YPY+GK+ C+ TK A +ITGYE +P + F
Sbjct: 213 TESNYPYKGKDATCKIKNTKPTATSITGYEDVPVNDEKALMKAVAHQPVSIGIEGGGFDF 272
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
Q Y GVF C L+H VT VGYG+ +G KYW++KNSWGT WGE+GY+R+ ++
Sbjct: 273 QFYGSGVFTGECTTYLDHAVTAVGYGQSSNGSKYWIIKNSWGTKWGESGYMRIKKDVKDK 332
Query: 323 NIGICGILMQASYPV 337
G+CG+ M+ASYP
Sbjct: 333 K-GLCGLAMKASYPT 346
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 150/317 (47%), Positives = 188/317 (59%), Gaps = 39/317 (12%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+S+ + +E W + SR G E +RF ++ +N+ ++ N + +KL NKFAD+
Sbjct: 33 ESLWDLYERWRSHHTVSRSLG---EKHKRFNVFKANLMHVHNTNKMDKPYKLKLNKFADM 89
Query: 114 SNEEFISTYLGYN---------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
+N EF STY G P+ + + + +P SVDWRK+GAVT VKDQGQCG
Sbjct: 90 TNHEFRSTYAGSKVNHHRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCG 149
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS V AVEGIN++KT KLV+LSEQELVDCD ENQGCNGG ME AFEFI + GG+
Sbjct: 150 SCWAFSTVVAVEGINQIKTNKLVALSEQELVDCD-KEENQGCNGGLMESAFEFIKQKGGI 208
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE +YPY+ + C K AV+I G+E +PA
Sbjct: 209 TTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSD 268
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GVF C LNHGV +VGYG G YW+V+NSWG WGE GYIRM RN S
Sbjct: 269 FQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRN-IS 327
Query: 322 SNIGICGILMQASYPVK 338
G+CGI M SYP+K
Sbjct: 328 KKEGLCGIAMLPSYPIK 344
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 160/342 (46%), Positives = 204/342 (59%), Gaps = 41/342 (11%)
Query: 36 WVLGIPAGAWSEGYPQKY--DPQSMEERFENW-LKQYSREYGSEDEWQRRFGIYSSNVQY 92
W L A S G+ + +S+ ++ W L+ S DE RRF I+ NV++
Sbjct: 17 WTLSANALDSSPGFTDEELESDESLRGLYDKWALQHRSTRSLDSDEHARRFEIFKENVKH 76
Query: 93 IDYINSQNLSFKLTDNKFADLSNEEF----ISTYLGYNKPYNEPRW---PSVQYLG---L 142
ID +N ++ +KL NKFADLSNEEF ++T + +K R S Y L
Sbjct: 77 IDSVNKKDGPYKLGLNKFADLSNEEFKAMHMTTKMEKHKSLRGDRGVESGSFMYQNSKRL 136
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
PAS+DWRK+GAVTPVK+QGQCGSCWAFS +A+VEGIN +KTGKLVSLSEQ+LVDC + E
Sbjct: 137 PASIDWRKKGAVTPVKNQGQCGSCWAFSTIASVEGINYIKTGKLVSLSEQQLVDC--SKE 194
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDK--TKHHAVTITGYEAIPAR 260
N GCNGG M+ AF++I GG+ TED+YPY + C T K +K A I G+E +PA
Sbjct: 195 NAGCNGGLMDNAFQYIIDNGGIVTEDEYPYTAEAGECSTTKIESKSIATIIDGFEDVPAN 254
Query: 261 ----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYW 297
+ FQ YS GVF CG +L+HGV VVGYG+ G YW
Sbjct: 255 NEGALKKAVAHQPVSIAIEASGHDFQFYSTGVFTGKCGTELDHGVVVVGYGKSPEGINYW 314
Query: 298 LVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
+V+NSWG WGE GYIRM R ++ G CGI MQASYP K+
Sbjct: 315 IVRNSWGPEWGEQGYIRMQRGIEATE-GKCGISMQASYPTKK 355
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 150/317 (47%), Positives = 188/317 (59%), Gaps = 39/317 (12%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+S+ + +E W + SR G E +RF ++ +N+ ++ N + +KL NKFAD+
Sbjct: 34 ESLWDLYERWRSHHTVSRSLG---EKHKRFNVFKANLMHVHNTNKMDKPYKLKLNKFADM 90
Query: 114 SNEEFISTYLGYN---------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
+N EF STY G P+ + + + +P SVDWRK+GAVT VKDQGQCG
Sbjct: 91 TNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCG 150
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS V AVEGIN++KT KLV+LSEQELVDCD ENQGCNGG ME AFEFI + GG+
Sbjct: 151 SCWAFSTVVAVEGINQIKTNKLVALSEQELVDCD-KEENQGCNGGLMESAFEFIKQKGGI 209
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE +YPY+ + C K AV+I G+E +PA
Sbjct: 210 TTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSD 269
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GVF C LNHGV +VGYG G YW+V+NSWG WGE GYIRM RN S
Sbjct: 270 FQFYSEGVFTGDCSTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEHGYIRMQRNI-S 328
Query: 322 SNIGICGILMQASYPVK 338
G+CGI M SYP+K
Sbjct: 329 KKEGLCGIAMLPSYPIK 345
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 153/344 (44%), Positives = 198/344 (57%), Gaps = 47/344 (13%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL LL+ G A + + SM ER E W+ +Y++ Y E +RRF I+ N
Sbjct: 10 ISLALLFCSGFLAFQVT---CRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKEN 66
Query: 90 VQYID-YINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNE---------PRWPSVQY 139
V YI+ + N+ N + L N+FADL+NEEFI+ P N R + +Y
Sbjct: 67 VNYIEAFNNAANKPYTLGINQFADLTNEEFIA-------PRNRFKGHMCSSITRTTTFKY 119
Query: 140 ---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVD 196
+P++VDWR++GAVTP+KDQGQCG CWAFSAVAA EGI+ L GKL+SLSEQE+VD
Sbjct: 120 ENVTAIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVD 179
Query: 197 CDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA 256
CD E+QGC GG+M+ AF+FI + G+ E +YPY+ + +C +H TITGYE
Sbjct: 180 CDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYED 239
Query: 257 IPARYA----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHG 293
+P FQ Y GVF CG +L+HGVT VGYG G
Sbjct: 240 VPVNNEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSADG 299
Query: 294 EKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+YWLVKNSWGT WGE GYIRM R + G+ GI M ASYP
Sbjct: 300 TEYWLVKNSWGTEWGEEGYIRMQRGVKAEE-GLXGIAMMASYPT 342
>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
Length = 362
Score = 277 bits (709), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 159/349 (45%), Positives = 202/349 (57%), Gaps = 40/349 (11%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQK--YDPQSMEERFENWLKQYSREYGSEDEWQR 81
M + ++L L VLGI S + +K +S+ + +E W ++ S DE +
Sbjct: 3 MKKFLFVALSLALVLGITE---SLDFHEKDLESEESLWDLYERWRSHHTVST-SLDEKHK 58
Query: 82 RFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG 141
RF ++ NV ++ N +KL NKFAD++N EF S Y G ++ + + G
Sbjct: 59 RFNVFKENVMHVHKTNKMGKPYKLKLNKFADMTNHEFRSVYAGSKVKHHRMFRGTTRGNG 118
Query: 142 ---------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
+P SVDWRK+GAVT VKDQGQCGSCWAFS + AVEGIN +KT +LVSLSEQ
Sbjct: 119 SFMYGKVEKVPTSVDWRKKGAVTAVKDQGQCGSCWAFSTIVAVEGINYIKTNELVSLSEQ 178
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTIT 252
ELVDCD +ENQGCNGG ME AFEFI K G+TTE YPY+ ++ C K + AV+I
Sbjct: 179 ELVDCDT-TENQGCNGGLMEYAFEFIKKKRGITTESTYPYKAEDGHCDAAKENNPAVSID 237
Query: 253 GYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
GYE +P FQ YS GVF CG +L+HGV VVGYG
Sbjct: 238 GYEKVPENDEDALLKAAANQPVSVAIDAGGSDFQFYSEGVFIGECGTELDHGVAVVGYGT 297
Query: 291 D-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G KYW+V+NSWG WGE GYIRM R S G+CGI M+ASYP+K
Sbjct: 298 TLDGTKYWIVRNSWGPEWGEKGYIRMQR-GISDKEGLCGIAMEASYPIK 345
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 277 bits (709), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 155/347 (44%), Positives = 202/347 (58%), Gaps = 45/347 (12%)
Query: 26 RNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGI 85
+ +L+LFL +GI S+ P+K ++ ER ENW+ +Y + Y E ++RF I
Sbjct: 7 KQHMLALFLFLAVGI-----SQVMPRKLHQTALRERHENWMAEYGKIYKDAAEKEKRFQI 61
Query: 86 YSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPY---------NEPRWP 135
+ NV++I+ N+ N +KL N ADL+ EEF + G + Y N ++
Sbjct: 62 FKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYE 121
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQG-QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+V +P ++DWR +GAVTP+KDQG QCGSCWAFS VAA EGI ++ TG L+SLSEQEL
Sbjct: 122 NV--TDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLSEQEL 179
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
VDCD S + GC+GG ME FEFI K GG+++E +YPY + C K A I GY
Sbjct: 180 VDCD--SVDHGCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIKGY 237
Query: 255 EAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG--E 290
E +PA FQ YS GVF CG QL+HGVTVVGYG +
Sbjct: 238 ETVPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTD 297
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
D +YW+VKNSWGT WGE GYIRM R + G+CGI M ASYP
Sbjct: 298 DGTHEYWIVKNSWGTQWGEEGYIRMQRGIDALE-GLCGIAMDASYPT 343
>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 277 bits (709), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 152/348 (43%), Positives = 197/348 (56%), Gaps = 41/348 (11%)
Query: 21 MRMMLRNAVLSLFLL---WVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSED 77
MR +N L LFL+ W + + SE ER E W+ QY + Y
Sbjct: 1 MRSFSQNHYLILFLILTVWTFHVMSRRLSE--------VCTSERHEKWMAQYGKLYTDAA 52
Query: 78 EWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNK------PYN 130
E ++RF I+ +NVQ+I+ N+ + F L+ N+FADL NEEF ++ + K
Sbjct: 53 EKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVETAT 112
Query: 131 EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
E + +P ++DWRK GAVTP+KDQG CGSCWAFS VAA+EGI+++ TGKLVSLS
Sbjct: 113 ETSFRYESITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFSTVAAIEGIHQITTGKLVSLS 172
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQELVDC V +++GCN GY E+AFEF+ K GG+ +E YPY+ N C K
Sbjct: 173 EQELVDC-VKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQ 231
Query: 251 ITGYEAIPARY--------------------AFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
I GYE +P+ A Q YS G+F CG NH VTV+GYG+
Sbjct: 232 IKGYENVPSNSEKALLKAVANQPVSVYIDAGALQFYSSGIFTGKCGTAPNHAVTVIGYGK 291
Query: 291 DH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G KYWLVKNSWGT WGE GYI+M R+ + G+CGI ASYP
Sbjct: 292 ARGGAKYWLVKNSWGTKWGEKGYIKMKRDIRAKE-GLCGIATNASYPT 338
>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
Length = 359
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 146/315 (46%), Positives = 196/315 (62%), Gaps = 35/315 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+S+ +E W ++ S E +RF ++ N+++I +N ++ +KL NKFAD++N
Sbjct: 34 ESLWNLYERWRSHHTVSR-SLTEKNQRFNVFKENLKHIHKVNQKDRPYKLRLNKFADMTN 92
Query: 116 EEFISTYLG--------YNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
EF+ Y G ++ + + LP+S+DWRK+GAVT VKDQG+CGSCW
Sbjct: 93 HEFLQHYGGSKVSHYRMFHGSRRQTGFAHENTSNLPSSIDWRKQGAVTGVKDQGKCGSCW 152
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS+VAAVEGINK+KTG+L+SLSEQELVDC NS N GC+GG ME+AF FI K GG+TTE
Sbjct: 153 AFSSVAAVEGINKIKTGELISLSEQELVDC--NSVNHGCDGGLMEQAFSFIEKTGGLTTE 210
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
++YPYR K+ C + K VTI GYE +P FQ
Sbjct: 211 NNYPYRAKDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQAVANQPVSIAIDAGGQDFQF 270
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS GV+ CG +LNHGV +VGYG G KYW+VKNSWG+ WGE G+IRM R +
Sbjct: 271 YSEGVYTGDCGTELNHGVALVGYGATQDGTKYWIVKNSWGSEWGENGFIRMQRENDVEE- 329
Query: 325 GICGILMQASYPVKR 339
G+CGI ++ASYP+K+
Sbjct: 330 GLCGITLEASYPIKQ 344
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 277 bits (708), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 160/355 (45%), Positives = 201/355 (56%), Gaps = 42/355 (11%)
Query: 17 IAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSE 76
+A M +M+ LFL + L A Y + +E WL ++ + Y
Sbjct: 1 MASIMTLMISTL---LFLSFTLSC---AIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGL 54
Query: 77 DEWQRRFGIYSSNVQYI-DYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWP 135
E +RF ++ N+ +I ++ N+QN ++KL NKFAD++NEE+ Y G K + R
Sbjct: 55 GEKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNKFADMTNEEYRVMYFG-TKSDAKRRLM 113
Query: 136 SVQYLG----------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGK 185
+ G LP VDWR +GAV P+KDQG CGSCWAFS VA VE INK+ TGK
Sbjct: 114 KTKSTGHRYAYSAGDQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGK 173
Query: 186 LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK 245
VSLSEQELVDCD + NQGCNGG M+ AFEFI + GG+ T+ DYPYRG + C K
Sbjct: 174 FVSLSEQELVDCD-RAYNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKN 232
Query: 246 HHAVTITGYEAIP-----------AR-----------YAFQLYSHGVFDEYCGHQLNHGV 283
AV I GYE +P AR A QLY GVF CG L+HGV
Sbjct: 233 AKAVNIDGYEDVPPYDENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGV 292
Query: 284 TVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VVGYG ++G YWLV+NSWGT WGE GY +M RN + G CGI M+ASYPVK
Sbjct: 293 VVVGYGSENGVDYWLVRNSWGTGWGEDGYFKMQRNVRTPT-GKCGITMEASYPVK 346
>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 277 bits (708), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 148/313 (47%), Positives = 187/313 (59%), Gaps = 30/313 (9%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS--QNLSFKLTDNKFA 111
D + E+ E W+ Y + Y E + R I+ NV YI+ N+ N +KL N+FA
Sbjct: 33 DDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLGINQFA 92
Query: 112 DLSNEEFISTYLGYN----KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
D++NEEFI++ + + + +P++VDWRK+GAVTPVK+QGQCG CW
Sbjct: 93 DITNEEFIASRNKFKGHMCSSITKTSTFKYENASVPSTVDWRKKGAVTPVKNQGQCGCCW 152
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAVAA EGI+KL TGKLVSLSEQELVDCD +QGC GG M+ AF+FI + G+ TE
Sbjct: 153 AFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLHTE 212
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQL 265
YPY+G + C ++T A TI GYE +PA FQ
Sbjct: 213 AQYPYQGVDGTCSANETSTPAATIAGYEDVPANNENALQKAVANQPISVAIDASGSDFQF 272
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y GVF CG QL+HGVT VGYG + G KYWLVKNSWG WGE GYIRM R+ ++
Sbjct: 273 YKSGVFTGSCGTQLDHGVTAVGYGISNDGTKYWLVKNSWGNDWGEEGYIRMQRSVDAAQ- 331
Query: 325 GICGILMQASYPV 337
G+CGI M ASYP
Sbjct: 332 GLCGIAMMASYPT 344
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 277 bits (708), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 201/349 (57%), Gaps = 39/349 (11%)
Query: 28 AVLSLFLLWV-LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIY 86
A+ +FLL V LG + ++ +M ER E W+ Q+ R Y E RRF +
Sbjct: 2 AIPKVFLLAVVLGCICLCSTVLSARELGDAAMVERHEQWMAQHGRVYKDGAEKARRFEAF 61
Query: 87 SSNVQYIDYINS--QNLSFKLTDNKFADLSNEEFISTYL--GYNK----------PYNEP 132
+NV +I+ N+ F L N+F DL+N+EF +T G+ K P
Sbjct: 62 RNNVVFIESFNAAGNRRKFWLGVNQFTDLTNDEFRATKTNKGFIKRNAAAVNKASPTGTF 121
Query: 133 RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
R+ +V LPA+VDWR +GAVTP+K+QGQCG CWAFSAVAA EGI +L TGKLV LSEQ
Sbjct: 122 RYSNVSADALPAAVDWRAKGAVTPIKNQGQCGCCWAFSAVAATEGIVQLSTGKLVPLSEQ 181
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTIT 252
ELVDCD N + GC GG M+ AFEFI K GG+T+E +YPY ++ +C+ T + TI
Sbjct: 182 ELVDCDANGADHGCEGGEMDDAFEFIIKNGGLTSETNYPYTAQDGQCKAKNTINSVATIK 241
Query: 253 GYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG- 289
GYE +PA FQ Y+ GV CG L+HG+ VGYG
Sbjct: 242 GYEDVPANDEASLMKAVAAQPVSVAVDGGDMVFQHYAGGVLSGSCGTSLDHGIVAVGYGA 301
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
D G K+WL+KNSWGT+WGE GYIRM ++ + G+CG+ MQ SYP +
Sbjct: 302 ADDGTKFWLMKNSWGTTWGEDGYIRMEKDVADAG-GMCGLAMQPSYPTE 349
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 277 bits (708), Expect = 6e-72, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 206/349 (59%), Gaps = 38/349 (10%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQY--SREYGSEDEWQ 80
M + A L +L V+ + A + +S+ + +E W + SR+ E +
Sbjct: 1 MKMGKAFLFAVVLAVILVAAMSMEITERDLASEESLWDLYERWRSHHTVSRDL---SEKR 57
Query: 81 RRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV--- 137
+RF ++ +NV +I +N ++ +KL N FAD++N EF Y K Y
Sbjct: 58 KRFNVFKANVHHIHKVNQKDKPYKLKLNSFADMTNHEFREFYSSKVKHYRMLHGSRANTG 117
Query: 138 ----QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
+ LPASVDWRK+GAVT VK+QG+CGSCWAFS V VEGINK+KTG+LVSLSEQE
Sbjct: 118 FMHGKTESLPASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQE 177
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
LVDC+ ++N+GCNGG ME A+EFI K GG+TTE YPY+ ++ C + K AVTI G
Sbjct: 178 LVDCE--TDNEGCNGGLMENAYEFIKKSGGITTERLYPYKARDGSCDSSKMNAPAVTIDG 235
Query: 254 YEAIPAR----------------------YAFQLYSHGVF-DEYCGHQLNHGVTVVGYGE 290
+E +PA Q YS GV+ + CG++L+HGV VVGYG
Sbjct: 236 HEMVPANDENALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELDHGVAVVGYGT 295
Query: 291 D-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G KYW+VKNSWGT WGE GYIRM R ++ G+CGI M+ASYP+K
Sbjct: 296 ALDGTKYWIVKNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYPLK 344
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 146/313 (46%), Positives = 189/313 (60%), Gaps = 37/313 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEFIS 120
+E+WL Q+ + Y + E ++RF I+ N+++ID NS + +FK+ NKFADL+NEEF S
Sbjct: 53 YESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFKVGLNKFADLTNEEFRS 112
Query: 121 TYLG--------YNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCW 167
YLG + + S +YL LP +VDWRK GAV VKDQGQCGSCW
Sbjct: 113 VYLGRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKNGAVAKVKDQGQCGSCW 172
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS +AAVEGIN++ TG+L+SLSEQELVDCD S N GC+GG M+ A+EFI GG+ T+
Sbjct: 173 AFSTIAAVEGINQIVTGELLSLSEQELVDCDT-SYNSGCDGGLMDYAYEFIINNGGIDTD 231
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
DYPY K+ +C + VTI +E +P FQ
Sbjct: 232 ADYPYTAKDGKCDQYRKNAKVVTIDDFEDVPENDEKALQKAVAHQPVSVAIEAGGSTFQF 291
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
Y GVF CG L+HGV VGYG D G+ YW+V+NSWG WGE+GYIRM RN + G
Sbjct: 292 YQSGVFTGKCGADLDHGVVAVGYGSDDGKDYWIVRNSWGADWGESGYIRMERNLETVKTG 351
Query: 326 ICGILMQASYPVK 338
CGI ++ SYP+K
Sbjct: 352 KCGIAIEPSYPIK 364
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 152/348 (43%), Positives = 196/348 (56%), Gaps = 41/348 (11%)
Query: 21 MRMMLRNAVLSLFLL---WVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSED 77
MR +N L LFL+ W + + SE ER E W+ QY + Y
Sbjct: 1 MRSFSQNHYLILFLILTVWTFHVMSRRLSE--------VCTSERHEKWMAQYGKLYTDAA 52
Query: 78 EWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNK------PYN 130
E ++RF I+ +NVQ+I+ N+ + F L+ N+FADL NEEF ++ + K
Sbjct: 53 EKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGVETAT 112
Query: 131 EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
E + +P ++DWRK GAVTP+KDQG CGSCWAFS VAA+EGI+++ TGKLVSLS
Sbjct: 113 ETSFRYESITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFSIVAAIEGIHQITTGKLVSLS 172
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQELVDC V +++GCN GY E+AFEF+ K GG+ +E YPY+ N C K
Sbjct: 173 EQELVDC-VKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQ 231
Query: 251 ITGYEAIPARY--------------------AFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
I GYE +P+ A Q YS G+F CG NH TV+GYG+
Sbjct: 232 IKGYENVPSNSEKALLKAVANQPVSVYIDAGALQFYSSGIFTGKCGTAPNHAATVIGYGK 291
Query: 291 DH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G KYWLVKNSWGT WGE GYIRM R+ + G+CGI ASYP
Sbjct: 292 ARGGAKYWLVKNSWGTKWGEKGYIRMKRDIRAKE-GLCGIATNASYPT 338
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 141/315 (44%), Positives = 195/315 (61%), Gaps = 38/315 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS--QNLSFKLTDNKFADLSN 115
M++R W+ ++ R Y E R+ ++ +NV+ I+++NS +FKL N+FADL+N
Sbjct: 34 MQKRHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTN 93
Query: 116 EEFISTYLGY---------NKPYNEP-RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
+EF S Y G+ ++ P R+ +V LP SVDWRK+GAVTP+K+QG CG
Sbjct: 94 DEFRSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGC 153
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFSAVAA+EG ++K GKL+SLSEQ+LVDCD N + GC GG M+ AFE I GG+T
Sbjct: 154 CWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCEGGLMDTAFEHIKATGGLT 211
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAF 263
TE +YPY+G++ C + KT A +ITGYE +P + F
Sbjct: 212 TESNYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDF 271
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
Q YS GVF C L+H VT +GYGE +G KYW++KNSWGT WGE+GY+R+ ++
Sbjct: 272 QFYSSGVFTGECTTYLDHAVTAIGYGESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDK 331
Query: 323 NIGICGILMQASYPV 337
G+CG+ M+ASYP
Sbjct: 332 Q-GLCGLAMKASYPT 345
>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 154/336 (45%), Positives = 192/336 (57%), Gaps = 54/336 (16%)
Query: 30 LSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+ L LL+VL AW S+ + SM ER E+W+ QY REY DE +R+ I+
Sbjct: 10 ICLALLFVLA----AWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKD 65
Query: 89 NVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPA 144
NV I+ N + + S+KL+ N+FADL+NEEF ++ + S +Y +P+
Sbjct: 66 NVARIESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICSTEATSFKYENVTAVPS 125
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
+VDWRK+GAVTP+KDQGQCGSCWAFSAVAA+EGI +L TGKL+SLSEQELVDCD + E+Q
Sbjct: 126 TVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQ 185
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---- 260
GC +YPY G + C K H A I GYE +PA
Sbjct: 186 GCT---------------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKA 224
Query: 261 ------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKN 301
FQ YS GVF CG +L+HGV VGYG D G KYWLVKN
Sbjct: 225 LQKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDGMKYWLVKN 284
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SW T WGE GYIRM R+ + G+CGI MQASYP
Sbjct: 285 SWSTGWGEEGYIRMQRDVTAKE-GLCGIAMQASYPT 319
>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 147/311 (47%), Positives = 186/311 (59%), Gaps = 35/311 (11%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
E +E W ++ S DE +RF ++ +NV Y+ N ++ +KL NKFAD++N EF
Sbjct: 36 ELYERWRSHHTVSR-SLDEKDKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFR 94
Query: 120 STYLGYNKPYNEPRWPSVQYLG---------LPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
Y G ++ + + G +P SVDWRK+GAVTPVKDQG+CGSCWAFS
Sbjct: 95 HHYAGSKIKHHRSFLGASRANGTFMYANVEDVPPSVDWRKKGAVTPVKDQGKCGSCWAFS 154
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
V AVEGIN++KT +LVSLSEQELVDCD S+NQGCNGG M+ AFEFI K GG+ TE++Y
Sbjct: 155 TVVAVEGINQIKTNELVSLSEQELVDCDT-SQNQGCNGGLMDMAFEFIKKKGGINTEENY 213
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSH 268
PY + C K V+I GYE +P FQ YS
Sbjct: 214 PYMAEGGECDIQKRNSPVVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQFYSE 273
Query: 269 GVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GVF CG +L+HGV +VGYG G KYW+V+NSWG WGE GYIRM R + G+C
Sbjct: 274 GVFTGDCGTELDHGVAIVGYGTTLDGTKYWIVRNSWGPEWGEKGYIRMQREIDAEE-GLC 332
Query: 328 GILMQASYPVK 338
GI MQ SYP+K
Sbjct: 333 GIAMQPSYPIK 343
>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 155/338 (45%), Positives = 205/338 (60%), Gaps = 42/338 (12%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRR 82
M L + ++ + LL ++G+ A S+ + SM ER E+W+ Y R Y E +RR
Sbjct: 1 MALESKIICITLL-IMGVWA---SQALSRTLHEVSMSERHEDWMGLYGRTYKDIAEKERR 56
Query: 83 FGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL 142
F I+ NV+YI+ +N FK + N + ++S+ S + R+ +V +
Sbjct: 57 FKIFKENVEYIESVNK----FKASRNGY-NMSSRPRSSEITSF-------RYENVA--AV 102
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P+S+DWRK+GAVTP+KDQGQCG CWAFSAVAA+EG+ +LKTG+L+SLSEQELVDCD + E
Sbjct: 103 PSSMDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGE 162
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA 262
+QGC GG M+ AFEFI GG+TTE +YPY+G + C K A I YE +PA
Sbjct: 163 DQGCGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSE 222
Query: 263 ----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLV 299
FQ YS GVF CG +L+HGVT VGYG+ D G KYWLV
Sbjct: 223 AALLKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDDGTKYWLV 282
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KNSWGT WGE GYI M R+ ++ G+CGI M+ASYP
Sbjct: 283 KNSWGTGWGEDGYIWMERD-IGADEGLCGIAMEASYPT 319
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 150/315 (47%), Positives = 188/315 (59%), Gaps = 35/315 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+S+ + +E W ++ S E +RF ++ NV ++ N + +KL NKFAD++N
Sbjct: 34 ESLWDLYERWRSHHTVSR-SLTEKHKRFNVFKENVMHVHNTNKMDKPYKLKLNKFADMTN 92
Query: 116 EEFISTYLG----YNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSC 166
EF STY G ++K + + + ++ +PASVDWRK+GAVT VKDQGQCGSC
Sbjct: 93 HEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSC 152
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS V AVEGIN++KT KLVSLSEQELVDCD ENQGCNGG ME AFEFI + GG+TT
Sbjct: 153 WAFSTVVAVEGINQIKTDKLVSLSEQELVDCD-KEENQGCNGGLMESAFEFIKQKGGITT 211
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E +YPY + C K AV+I G+E +P FQ
Sbjct: 212 ESNYPYTAQEGTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQ 271
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GV C LNHGV +VGYG G YW+V+NSWG WGE GYIRM RN S
Sbjct: 272 FYSEGVLTGDCNTDLNHGVAIVGYGTTVDGTNYWIVRNSWGPEWGEQGYIRMQRN-ISKK 330
Query: 324 IGICGILMQASYPVK 338
G+CGI M ASYP+K
Sbjct: 331 EGLCGIAMMASYPIK 345
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 139/310 (44%), Positives = 192/310 (61%), Gaps = 33/310 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL--SFKLTDNKFADLSNEEFI 119
+E WL ++ R Y + E RRF ++ N++++D N + F+L N+FADL+N+EF
Sbjct: 109 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 168
Query: 120 STYLGYNKPYNEPRWPSV----QYLG----LPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
+ YLG P + R +V ++ G LP SVDWR++GAV PVK+QGQCGSCWAFSA
Sbjct: 169 AAYLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 228
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
V++VE +N++ TG++V+LSEQELV+C + N GCNGG M+ AF+FI K GG+ TE DYP
Sbjct: 229 VSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDYP 288
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHG 269
Y+ + +C ++ V+I G+E +P FQLY G
Sbjct: 289 YKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYKAG 348
Query: 270 VFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
VF C L+HGV VGYG ++G+ YW+V+NSWG WGE GYIRM RN ++ G CGI
Sbjct: 349 VFTGTCTTNLDHGVVAVGYGTENGKDYWIVRNSWGAKWGEDGYIRMERNV-NATTGKCGI 407
Query: 330 LMQASYPVKR 339
M ASYP K+
Sbjct: 408 AMMASYPTKK 417
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 146/342 (42%), Positives = 206/342 (60%), Gaps = 39/342 (11%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
L+LF ++ LG+ + P Y+ SM R + W+ + + Y +E + RF I+ N
Sbjct: 12 LALFFIF-LGVWRSQVASSRPINYEA-SMRARHDQWIAHHDKVYKDLNEKEMRFKIFKEN 69
Query: 90 VQYIDYINS-QNLSFKLTDNKFADLSNEEFISTYLGY----------NKPYNEPRWPSVQ 138
V+ I+ N+ ++ +KL NKF+DL+NE+F + GY +KP R+ +V
Sbjct: 70 VERIEAFNAGEDKGYKLGVNKFSDLTNEKFRVLHTGYKRSHPKVMSSSKPKTHFRYANVT 129
Query: 139 YLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
+P ++DWRK+GAVTP+KDQ +CG CWAFSAVAA EG+++LKTGKL+ LSEQELVDCD
Sbjct: 130 --DIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCD 187
Query: 199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
V E++GC+GG ++ AF+FI K G+TTE +YPY+G++ C K+ A I GYE +P
Sbjct: 188 VEGEDEGCSGGLLDTAFDFILKNKGLTTEANYPYKGEDGVCNKKKSALSAAKIAGYEDVP 247
Query: 259 AR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEK 295
A + FQ YS GVF C LNH VT VGYG G K
Sbjct: 248 ANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTK 307
Query: 296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
YW++KNSWG+ WG++GY+R+ R+ G+CG+ M ASYP
Sbjct: 308 YWIIKNSWGSKWGDSGYMRIKRDVHEKE-GLCGLAMDASYPT 348
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 144/307 (46%), Positives = 190/307 (61%), Gaps = 32/307 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFIS 120
+E+WL ++ + Y + E ++RF I+ N YID N+ ++ SFKL N+FADL+NEE+ S
Sbjct: 44 YESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNRFADLTNEEYRS 103
Query: 121 TYLGYNKPYNEP-------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
Y G + R+ S+ LP SVDWR+ GAV VKDQGQCGSCWAFS ++
Sbjct: 104 KYTGIRTKDSRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQGQCGSCWAFSTIS 163
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEGIN++ TGKL++LSEQELVDCD S N+GCNGG M+ AF+FI GG+ ++ DYPY
Sbjct: 164 AVEGINQIATGKLITLSEQELVDCD-RSYNEGCNGGLMDDAFQFIINNGGIDSDADYPYT 222
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVF 271
G++ +C + VTI YE +P + FQ Y G+F
Sbjct: 223 GRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPISVAIEASGRDFQFYDSGIF 282
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG L+HGV VVGYG ++G+ YW+V+NSWG WGE GY+RM R SS GICGI
Sbjct: 283 TGKCGTDLDHGVVVVGYGTENGKDYWIVRNSWGADWGEKGYLRMERG-ISSKAGICGITS 341
Query: 332 QASYPVK 338
+ SYPVK
Sbjct: 342 EPSYPVK 348
>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 145/299 (48%), Positives = 186/299 (62%), Gaps = 30/299 (10%)
Query: 66 LKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLG 124
+ +Y R Y +E ++RF I+ NV I+ N + + ++KL+ N+FADL+NEEF S
Sbjct: 1 MARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNR 60
Query: 125 YNKPYNEPRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKL 181
+ K + + +Y +P+++DWRK+GAVTP+KDQ QCG CWAFSAVAA EGI ++
Sbjct: 61 F-KAHICSEATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQI 119
Query: 182 KTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQT 241
TGKL+SLSEQELVDCD ENQGC+GG M+ AF FI KI G+ +E YPY G + C +
Sbjct: 120 TTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRFI-KIHGLASEATYPYEGDDGTCNS 178
Query: 242 DKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQL 279
K H A I GYE +PA + FQ Y+ GVF CG +L
Sbjct: 179 KKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTEL 238
Query: 280 NHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+HGV VGYG D G YWLVKNSWGT WGE GYIRM R+ + G+CGI MQASYP
Sbjct: 239 DHGVAAVGYGIGDDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKE-GLCGIAMQASYPT 296
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 140/321 (43%), Positives = 198/321 (61%), Gaps = 34/321 (10%)
Query: 51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL--SFKLTDN 108
++ +P++ +E WL ++ R Y + E RRF ++ N++++D N + F+L N
Sbjct: 42 ERTEPEA-RTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMN 100
Query: 109 KFADLSNEEFISTYLGYNKPYNEPRWPSV----QYLG----LPASVDWRKEGAVTPVKDQ 160
+FADL+N+EF + YLG P + R +V ++ G LP SVDWR++GAV PVK+Q
Sbjct: 101 QFADLTNDEFRAAYLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQ 160
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
GQCGSCWAFSAV++VE +N++ TG++V+LSEQELV+C + N GCNGG M+ AF+FI K
Sbjct: 161 GQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIK 220
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------------------- 260
GG+ TE DYPY+ + +C ++ V+I G+E +P
Sbjct: 221 NGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEA 280
Query: 261 --YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
FQLY GVF C L+HGV VGYG ++G+ YW+V+NSWG WGE GYIRM RN
Sbjct: 281 GGREFQLYKAGVFTGTCTTNLDHGVVAVGYGTENGKDYWIVRNSWGAKWGEDGYIRMERN 340
Query: 319 SPSSNIGICGILMQASYPVKR 339
++ G CGI M ASYP K+
Sbjct: 341 V-NATTGKCGIAMMASYPTKK 360
>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
Length = 360
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 155/352 (44%), Positives = 208/352 (59%), Gaps = 44/352 (12%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYP-QKYDPQSME---ERFENWLKQYSREYGSEDEW 79
M + ++L+L VLG ++E + + D +S E + +E W ++ S DE
Sbjct: 1 MKKLLFVALYLALVLG-----FTESFDFHEKDLESEESLWDLYEKWRSHHTVST-SLDEK 54
Query: 80 QRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTY---------LGYNKPYN 130
++RF ++ +NV ++ N + +KL NKFAD++N EF + Y + P
Sbjct: 55 RKRFNVFRANVLHVHNTNKMDKPYKLKLNKFADMTNHEFRTAYASSKVKHHTMFRGAPLG 114
Query: 131 EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
+ +PAS+DWRK+GAVTPVKDQG+CGSCWAFS + AVEGIN +KT KL+SLS
Sbjct: 115 NGSFMYGNIDKVPASIDWRKKGAVTPVKDQGKCGSCWAFSTIVAVEGINFIKTNKLISLS 174
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQELVDC+ EN GCNGG M+ AFEFITK G+TTE +YPYR ++ C +K AV+
Sbjct: 175 EQELVDCNT-GENHGCNGGLMDYAFEFITKQKGITTEANYPYRAQDGHCDANKANQPAVS 233
Query: 251 ITGYEAI---------------PARYA-------FQLYSHGVFDEYCGHQLNHGVTVVGY 288
I G+E + P A FQ YS GVF CG +L+HGV +VGY
Sbjct: 234 IDGHEDVLHNNENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGECGKELDHGVAIVGY 293
Query: 289 GED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
G G KYW+V+NSWG WGE GYIRM R S G+CGI M+ASYP+K+
Sbjct: 294 GTTVDGTKYWIVRNSWGPEWGERGYIRMQR-GISDRRGLCGIAMEASYPIKK 344
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 140/321 (43%), Positives = 197/321 (61%), Gaps = 34/321 (10%)
Query: 51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL--SFKLTDN 108
++ +P++ +E WL ++ R Y + E RRF ++ N++++D N + F+L N
Sbjct: 39 ERTEPEA-RTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMN 97
Query: 109 KFADLSNEEFISTYLGYNKPYNEPRWPSV----QYLG----LPASVDWRKEGAVTPVKDQ 160
+FADL+N+EF + YLG P R +V ++ G LP SVDWR++GAV PVK+Q
Sbjct: 98 QFADLTNDEFRAAYLGARIPAARRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQ 157
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
GQCGSCWAFSAV++VE +N++ TG++V+LSEQELV+C + N GCNGG M+ AF+FI K
Sbjct: 158 GQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIK 217
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------------------- 260
GG+ TE DYPY+ + +C ++ V+I G+E +P
Sbjct: 218 NGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEA 277
Query: 261 --YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
FQLY GVF C L+HGV VGYG ++G+ YW+V+NSWG WGE GYIRM RN
Sbjct: 278 GGREFQLYKAGVFSGTCTTNLDHGVVAVGYGTENGKDYWIVRNSWGAKWGEDGYIRMERN 337
Query: 319 SPSSNIGICGILMQASYPVKR 339
++ G CGI M ASYP K+
Sbjct: 338 V-NATTGKCGIAMMASYPTKK 357
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 155/347 (44%), Positives = 206/347 (59%), Gaps = 32/347 (9%)
Query: 21 MRMMLRNAVLSLFLLW---VLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSED 77
M+M +V+S+ LL+ +L + + + Q+ + Q M +E+WL + + Y S D
Sbjct: 1 MKMGSPKSVISMSLLFFSTLLILSSALDIKNSVQRTNDQVMA-MYESWLVEQGKSYNSLD 59
Query: 78 EWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYN---KPYNEPR 133
E + RF I+ N++ ID N+ N S+ L N+FADL++EE+ STYLG+ K R
Sbjct: 60 EKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGFKSGPKAKVSNR 119
Query: 134 WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
+ + LP VDWR GAV VKDQG C SCWAFSAVAAVEGINK+ TG L+SLSEQE
Sbjct: 120 YVPKVGVVLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQE 179
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
LVDC +GCN GYM AF+FI GG+ TED+YPY ++ +C + VTI
Sbjct: 180 LVDCGRTQRTRGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQDGQCDWYRKNQRYVTIDN 239
Query: 254 YEAIPARY----------------------AFQLYSHGVFDEYCGHQLNHGVTVVGYGED 291
YE +PA F+LY+ G++ YCG ++HGVT+VGYG +
Sbjct: 240 YEQLPANNEWVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTAIDHGVTIVGYGTE 299
Query: 292 HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G YW+VKNSWGT+WGE GYIR+ RN + G CGI M SYPVK
Sbjct: 300 RGLDYWIVKNSWGTNWGENGYIRIQRNIGGA--GKCGIAMVPSYPVK 344
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 147/319 (46%), Positives = 192/319 (60%), Gaps = 35/319 (10%)
Query: 51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNK 109
+K SM ER E W+++Y + Y E Q+RF I+ +NV++I+ N+ N +KL+ N
Sbjct: 27 RKLHDASMYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNKPYKLSINH 86
Query: 110 FADLSNEEFISTYLGYNKPYNEPRWPSVQ-------YLGLPASVDWRKEGAVTPVKDQGQ 162
AD +NEEF++++ GY + + + Q +P +VDWR++G VT +KDQ Q
Sbjct: 87 LADQTNEEFMASHKGYKGSHWQGLRITTQTPFKYENVTDIPWAVDWRQKGDVTSIKDQAQ 146
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CG+CWAFSAVAA EGI ++ TG LVSLSE+ELVDCD S + GC+GG ME FEFI K G
Sbjct: 147 CGNCWAFSAVAATEGIYQITTGNLVSLSEKELVDCD--SVDHGCDGGLMEHGFEFIIKNG 204
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------------- 260
G+++E +YPY N C T+K ITGYE +P
Sbjct: 205 GISSEANYPYTAVNGTCDTNKEASPVAQITGYETVPVNCEEELQKAVANQLTMSVSIDAG 264
Query: 261 -YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARN 318
AFQ Y GVF CG QL+HGVT VGYG D+G +YW+VKNSWGT WGE GYIRM R
Sbjct: 265 GSAFQFYPSGVFTGQCGTQLDHGVTAVGYGSTDYGTQYWIVKNSWGTQWGEEGYIRMLRG 324
Query: 319 SPSSNIGICGILMQASYPV 337
+ G+CGI M ASYP
Sbjct: 325 IDAQE-GLCGIAMDASYPT 342
>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 384
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 150/325 (46%), Positives = 188/325 (57%), Gaps = 43/325 (13%)
Query: 56 QSMEERFENWLKQYSR---EYGSEDEWQ-RRFGIYSSNVQYIDYINSQN-LSFKLTDNKF 110
+S+ +E W Y R G + + Q RRF ++ N +Y+ N ++ F+L NKF
Sbjct: 35 ESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKDGRPFRLALNKF 94
Query: 111 ADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-------------LPASVDWRKEGAVTPV 157
AD++ +EF TY G ++ + + LP +VDWR GAVT V
Sbjct: 95 ADMTTDEFRRTYAGSRTRHHRAQLGEARSFAHAQHGRGGSGTTNLPPAVDWRLRGAVTGV 154
Query: 158 KDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEF 217
KDQGQCGSCWAFSA+AAVEG+NK+ TGKLVSLSEQELVDCD + +NQGC+GG M+ AF++
Sbjct: 155 KDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCD-DVDNQGCDGGLMDYAFQY 213
Query: 218 ITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------- 260
I + GGVTTE +YPY + C K + H VTI GYE +PA
Sbjct: 214 IQRNGGVTTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAVA 273
Query: 261 -----YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIR 314
FQ YS GVF CG L+HGV VGYG G KYW VKNSWG WGE GYIR
Sbjct: 274 IEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDGTKYWTVKNSWGEDWGERGYIR 333
Query: 315 MARNSPSSNIGICGILMQASYPVKR 339
M R P S G+CGI M+ SYP K+
Sbjct: 334 MQRGVPDSR-GLCGIAMEPSYPTKK 357
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 148/319 (46%), Positives = 189/319 (59%), Gaps = 36/319 (11%)
Query: 53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI-DYINSQNLSFKLTDNKFA 111
Y + +E WL ++ + Y + +RF ++ N+ +I ++ N+ N ++KL NKFA
Sbjct: 29 YTDNEVMAMYEEWLVRHQKGYNELGKKDKRFQVFKDNLGFIQEHNNNLNNTYKLGLNKFA 88
Query: 112 DLSNEEFISTYLGYNKPYNEPRWPSVQYLG----------LPASVDWRKEGAVTPVKDQG 161
D++NEE+ + YLG K + R + G LP VDWR +GAV P+KDQG
Sbjct: 89 DMTNEEYRAMYLG-TKSNAKRRLMKTKSTGHRYAFSARDRLPVHVDWRMKGAVAPIKDQG 147
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
CGSCWAFS VA VE INK+ TGK VSLSEQELVDCD + N+GCNGG M+ AFEFI +
Sbjct: 148 SCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCD-RAYNEGCNGGLMDYAFEFIIQN 206
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------A 259
GG+ T+ DYPYRG + C K V I GYE +P +
Sbjct: 207 GGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGYEDVPPYDENALKKAVAHQPVSVAIEAS 266
Query: 260 RYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
A QLY GVF CG L+HGV VVGYG ++G YWLV+NSWGT WGE GY +M RN
Sbjct: 267 GRALQLYQSGVFTGKCGTSLDHGVVVVGYGSENGVDYWLVRNSWGTGWGEDGYFKMQRNV 326
Query: 320 PSSNIGICGILMQASYPVK 338
+S G CGI M+ASYPVK
Sbjct: 327 RTST-GKCGITMEASYPVK 344
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 151/369 (40%), Positives = 209/369 (56%), Gaps = 51/369 (13%)
Query: 1 MQHRLFIAIYTNLHLKIAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEE 60
+ + I ++T L + A+DM ++ ++ + K +S EE
Sbjct: 7 LMATILIVLFTVLAVSSALDMSII-------------------SYDRSHADKSGWKSDEE 47
Query: 61 R---FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
+E WL ++ + Y + +E ++RF I+ N+ +I+ N+ N ++K+ N+F+DLSNEE
Sbjct: 48 VMSIYEEWLVKHGKVYNAVEEKEKRFQIFKDNLNFIEEHNAVNRTYKVGLNRFSDLSNEE 107
Query: 118 FISTYLGYN-KPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
+ S YLG P PS +Y LP SVDWRKEGAV VK+Q +C CWAFSA
Sbjct: 108 YRSKYLGTKIDPSRMMARPSRRYSPRVADNLPESVDWRKEGAVVRVKNQSECEGCWAFSA 167
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
+AAVEGINK+ TG L +LSEQEL+DCD + N GC+GG ++ AFEFI GG+ TE+DYP
Sbjct: 168 IAAVEGINKIVTGNLTALSEQELLDCD-RTVNAGCSGGLVDYAFEFIINNGGIDTEEDYP 226
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPA----------------------RYAFQLYSHG 269
++G + C K AVTI GYE +PA FQLY G
Sbjct: 227 FQGADGICDQYKINARAVTIDGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQLYESG 286
Query: 270 VFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
+F CG ++HGVT VGYG ++G YW+VKNSWG +WGEAGY+ M RN G CGI
Sbjct: 287 IFTGTCGTSIDHGVTAVGYGTENGIDYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGI 346
Query: 330 LMQASYPVK 338
+ YP+K
Sbjct: 347 AILTLYPIK 355
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 152/324 (46%), Positives = 185/324 (57%), Gaps = 46/324 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
M +RFE W+ ++ R Y E QRRF +Y NV+ ++ NS + +KL DNKFADL+NEE
Sbjct: 27 MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFADLTNEE 86
Query: 118 FISTYLGYNKPYNEPRWPS-----VQYLG------LPASVDWRKEGAVTPVKDQGQCGSC 166
F + LG+ P+ + + G LP SVDWRK+GAV VK+QG CGSC
Sbjct: 87 FRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCGSC 146
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSAVAA+EGIN++K G+LVSLSEQELVDCD E GC GGYM AFEF+ G+TT
Sbjct: 147 WAFSAVAAIEGINQIKNGELVSLSEQELVDCD--DEAVGCGGGYMSWAFEFVVGNHGLTT 204
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-------ARYA---------------FQ 264
E YPY N CQ K AV I GY + AR A FQ
Sbjct: 205 EASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQ 264
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEK-----------YWLVKNSWGTSWGEAGYI 313
LY GV+ C +NHGVTVVGYGE + YW+VKNSWG WG+AGYI
Sbjct: 265 LYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYI 324
Query: 314 RMARNSPSSNIGICGILMQASYPV 337
M R+ G+CGI + SYPV
Sbjct: 325 LMQRDVAGLASGLCGIALLPSYPV 348
>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 458
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 154/352 (43%), Positives = 208/352 (59%), Gaps = 43/352 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREY---GSED 77
M + +++L + + A + S PQ+ D + M ++ W ++ + + G+E
Sbjct: 1 MGTFQSSPIMALLFFLFIALSAASPSSIIPQRTDDEVMA-LYDQWRAKHGKLHNNLGAEP 59
Query: 78 EWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPR-WPS 136
E RF I+ N+++ID IN+QNL ++L N FADL+NEE+ S YLG R S
Sbjct: 60 E--NRFHIFKDNLKFIDEINAQNLPYRLGLNVFADLTNEEYRSRYLGGKFASGSRRNRTS 117
Query: 137 VQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSE 191
+YL LP S+DWR +GAV PVKDQG CGSCWAFS VA+VE IN++ TG L++LSE
Sbjct: 118 NRYLPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSE 177
Query: 192 QELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTI 251
QELVDCD S N+GCNGG M+ AFEFI + GG+ TE+DYPY G + C ++ I
Sbjct: 178 QELVDCD-RSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSC----IQYKKNAI 232
Query: 252 TGYEAIPAR-------------------------YAFQLYSHGVFDEYCGHQLNHGVTVV 286
GYE +P +FQLY G+F CG L+HGV VV
Sbjct: 233 DGYEDVPVNNEKALQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVV 292
Query: 287 GYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
GYG + G YW+V+NSWG SWGE+GY++M RN +S G+CGI M+ SYP K
Sbjct: 293 GYGSEGGVDYWIVRNSWGGSWGESGYVKMQRNI-ASPTGLCGIAMEPSYPTK 343
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 186/308 (60%), Gaps = 32/308 (10%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
D +M R E W+ QY R Y + E RRF ++ +N +I+ N+ N F L N+FADL
Sbjct: 29 DDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANAAFIESFNAGNHKFWLGVNQFADL 88
Query: 114 SNEEFISTYLGYNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
+N+EF T + R P+ V LPA++DWR +G VTP+KDQGQCG CW
Sbjct: 89 TNDEFRLTKTNKGFIPSTTRVPTGFRYENVNIDALPATMDWRTKGVVTPIKDQGQCGCCW 148
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAVAA+EGI KL TGKL+SLSEQELVDCDV+ E+QGC GG M+ AF+FI K GG+TTE
Sbjct: 149 AFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTE 208
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
+YPY +D+C++ + +I GYE +PA FQ
Sbjct: 209 SNYPYAAADDKCKS--VSNSVASIKGYEDVPANNEAALMKAVANQPVSVAVDGDDMTFQF 266
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y GV CG L+HG+ +GYG+ G KYWL+KNSWG +WGE G++RM ++ S
Sbjct: 267 YKGGVMIGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKD-ISDKR 325
Query: 325 GICGILMQ 332
G+CG+ M+
Sbjct: 326 GMCGLAME 333
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 152/324 (46%), Positives = 185/324 (57%), Gaps = 46/324 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
M +RFE W+ ++ R Y E QRRF +Y NV+ ++ NS + +KL DNKFADL+NEE
Sbjct: 28 MLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFADLTNEE 87
Query: 118 FISTYLGYNKPYNEPRWPS-----VQYLG------LPASVDWRKEGAVTPVKDQGQCGSC 166
F + LG+ P+ + + G LP SVDWRK+GAV VK+QG CGSC
Sbjct: 88 FRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCGSC 147
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSAVAA+EGIN++K G+LVSLSEQELVDCD E GC GGYM AFEF+ G+TT
Sbjct: 148 WAFSAVAAIEGINQIKNGELVSLSEQELVDCD--DEAVGCGGGYMSWAFEFVVGNHGLTT 205
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-------ARYA---------------FQ 264
E YPY N CQ K AV I GY + AR A FQ
Sbjct: 206 EASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQ 265
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEK-----------YWLVKNSWGTSWGEAGYI 313
LY GV+ C +NHGVTVVGYGE + YW+VKNSWG WG+AGYI
Sbjct: 266 LYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYI 325
Query: 314 RMARNSPSSNIGICGILMQASYPV 337
M R+ G+CGI + SYPV
Sbjct: 326 LMQRDVAGLASGLCGIALLPSYPV 349
>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 143/347 (41%), Positives = 199/347 (57%), Gaps = 54/347 (15%)
Query: 43 GAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ--- 99
G+ + P D M +RF W ++SR Y + +E + R +Y+ N++YI+ N
Sbjct: 23 GSSATSRPATEDADPMAQRFRRWKAEHSRTYATPEEERHRLRVYARNMRYIEATNGDAGA 82
Query: 100 NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPR----------------------WPSV 137
L+++L + + DL+++EF + Y P ++ W V
Sbjct: 83 GLTYELGETAYTDLTSDEFTAMYTSRAPPLSDDDDDLPMTMITTRAGPVAAAGGGGWLQV 142
Query: 138 ---QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+ G PASVDWR+ GAVT VK+QGQCGSCWAFS VA +EGI+++KTGKL SLSEQEL
Sbjct: 143 YVNESAGAPASVDWRERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLASLSEQEL 202
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
VDCD + GCNGG +A ++IT GG+T++DDYPY K+D C T K HHA +I+G+
Sbjct: 203 VDCD--KLDHGCNGGVSYRALQWITSNGGITSQDDYPYTAKDDTCDTKKLSHHAASISGF 260
Query: 255 EAIPARYA----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDH 292
+ + R FQ Y +GV++ CG +LNHGVTVVGYGED
Sbjct: 261 QRVATRSELSLTNAVAMQPVAVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTVVGYGEDE 320
Query: 293 --GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
GE YW+VKNSWG WG+ GY+RM + GICGI ++ S+P+
Sbjct: 321 VTGESYWIVKNSWGEKWGDNGYLRMKKGIIDKPEGICGIAIRPSFPL 367
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 155/348 (44%), Positives = 204/348 (58%), Gaps = 35/348 (10%)
Query: 22 RMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSME---ERFENWLKQYSREYGSEDE 78
+ ++ L LFL G GY + D +SM+ E FE+W+ ++ + Y + +E
Sbjct: 7 KTLVLTCSLCLFLSLAFGRDFSIV--GYSSE-DLKSMDKLIELFESWMSRHGKIYETIEE 63
Query: 79 WQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQ 138
RF ++ N+++ID N ++ L N+FADLS++EF + YLG ++ R S +
Sbjct: 64 KLLRFEVFKDNLKHIDDRNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDLSQRRESSEE 123
Query: 139 Y-----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
+ LP SVDWRK+GAVTPVK+QGQCGSCWAFS VAAVEGIN++ TG L SLSEQE
Sbjct: 124 EFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQE 183
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
L+DCD + N GCNGG M+ AF FI K GG+ E+DYPY + C+ K VTI G
Sbjct: 184 LIDCDT-TYNNGCNGGLMDYAFSFIVKNGGLHKEEDYPYIMEESTCEMKKEVSEVVTING 242
Query: 254 YEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGED 291
Y +P + FQ YS GVFD +CG +L+HGV+ VGYG
Sbjct: 243 YHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGGVFDGHCGSELDHGVSAVGYGTS 302
Query: 292 HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
G Y +VKNSWG WGE G+IRM RN S GICG+ ASYP K+
Sbjct: 303 KGLDYIIVKNSWGAKWGEKGFIRMKRNIGKSE-GICGLYKMASYPTKK 349
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 157/350 (44%), Positives = 212/350 (60%), Gaps = 45/350 (12%)
Query: 25 LRNAVLSLFLLWVLGIPAGAWSEGYPQK--YDPQSMEERFENWLKQY--SREYGSEDEWQ 80
L A+LS+ L VLG A A S + +K +S+ +E W + SR+ D+
Sbjct: 4 LSYALLSVVL--VLGSVALAQSIPFDEKDLASEESLWSLYEKWRAHHAVSRDL---DDTD 58
Query: 81 RRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPY--------NE 131
+RF ++ NV++I N ++ ++KL NKF D++N+EF STY G + +
Sbjct: 59 KRFNVFKENVKFIHEFNQKKDATYKLALNKFGDMTNQEFRSTYAGSKIDHHMTLRGVKDA 118
Query: 132 PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSE 191
+ ++ LP SVDWR++GAVT VKDQGQCGSCWAFS V AVEGIN++KT +LVSLSE
Sbjct: 119 GEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGSCWAFSTVVAVEGINQIKTNELVSLSE 178
Query: 192 QELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTI 251
Q+LVDCD ++N GCNGG M+ AF+FI GG+++ED YPY + C ++ VTI
Sbjct: 179 QQLVDCD--TKNSGCNGGLMDYAFDFIKNNGGLSSEDSYPYLAEQKSCGSE-ANSAVVTI 235
Query: 252 TGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG 289
GY+ +P + YAFQ YS GVF +CG +L+HGV VGYG
Sbjct: 236 DGYQDVPRNNEAALMKAVANQPVSVAIEASGYAFQFYSQGVFSGHCGTELDHGVAAVGYG 295
Query: 290 -EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+D G+KYW+VKNSWG WGE+GYIRM R G CGI M+ASYP+K
Sbjct: 296 VDDDGKKYWIVKNSWGEGWGESGYIRMERGIKDKR-GKCGIAMEASYPIK 344
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 147/372 (39%), Positives = 209/372 (56%), Gaps = 67/372 (18%)
Query: 1 MQHRLFIAIYTNLHLKIAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEE 60
MQ LF+AI+++ + I++ L N ++ M++
Sbjct: 6 MQIFLFVAIFSSFYFSISLSRP--LDNELI---------------------------MQK 36
Query: 61 RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS--QNLSFKLTDNKFADLSNEEF 118
R W+ ++ R Y E R+ ++ SNV+ I+++N+ +FKL N+FADL+N+EF
Sbjct: 37 RHIEWMTKHGRVYADVKEKSNRYVVFKSNVERIEHLNNIPAGRTFKLAVNQFADLTNDEF 96
Query: 119 ISTYLGYN----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
S Y G+ R+ +V LP SVDWR +GAVTP+K+QG CG CWA
Sbjct: 97 RSMYTGFKGVSSLSSQSQTKTTSFRYQNVSSGALPISVDWRTKGAVTPIKNQGSCGCCWA 156
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSAVAA+EG ++K GKL+SLSEQ+LVDCD N + GC GG M+ AFE I GG+TTE
Sbjct: 157 FSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCEGGLMDTAFEHIMATGGLTTES 214
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLY 266
+YPY+G++ C + KT A +ITGYE +P + FQ Y
Sbjct: 215 NYPYKGEDATCNSKKTNPKATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFY 274
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
S GVF C L+H VT +GYG+ +G KYW++KNSWGT WGE+GY+R+ ++ G
Sbjct: 275 SSGVFTGECTTYLDHAVTAIGYGQSTNGSKYWIIKNSWGTKWGESGYMRIQKDIKDKQ-G 333
Query: 326 ICGILMQASYPV 337
+CG+ M+ASYP
Sbjct: 334 LCGLAMKASYPT 345
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 190/314 (60%), Gaps = 38/314 (12%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN--LSFKLTDNKFA 111
D +M++R W+ ++ R Y +E R+ ++ NV+ I+ +N L+FKL N+FA
Sbjct: 23 DEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFA 82
Query: 112 DLSNEEFISTYLGY---------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
DL+NEEF S Y GY KP R+ V LP SVDWRK+GAVTP+KDQG
Sbjct: 83 DLTNEEFRSMYTGYKGNSVLSSRTKP-TSFRYQHVSSDALPISVDWRKKGAVTPIKDQGS 141
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFSAVAA+EG+ ++K GKL+SLSEQELVDCD N + GC GGYM AF + G
Sbjct: 142 CGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDD--GCMGGYMNSAFNYTMTTG 199
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------------- 260
G+T+E +YPY+ + C +KTK A +I G+E +PA
Sbjct: 200 GLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGG 259
Query: 261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
FQ YS GVF C L+HGV VVGYG+ +G KYW++KNSWG WGE GY+R+ +++
Sbjct: 260 TGFQFYSSGVFSGECSTHLDHGVAVVGYGKSSNGSKYWILKNSWGPKWGERGYMRIKKDT 319
Query: 320 PSSNIGICGILMQA 333
+ + G CG+ M A
Sbjct: 320 KAKH-GQCGLAMNA 332
>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
Length = 365
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 149/317 (47%), Positives = 187/317 (58%), Gaps = 37/317 (11%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+S+ +E W Y SR D +RRF ++ N +Y+ N +++ F+L NKFAD+
Sbjct: 35 ESLRGLYERWRSHYTVSRRGLGADAGERRFNVFKQNARYVHEGNKRDMPFRLALNKFADM 94
Query: 114 SNEEFISTYLGYNKPYN---------EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
+ +EF TY G ++ + + LP +VDWR++GAVT +KDQGQCG
Sbjct: 95 TTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQGQCG 154
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS + AVEGINK++TGKLVSLSEQEL+DCD N NQGC+GG M+ AF+FI K G+
Sbjct: 155 SCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCD-NVNNQGCDGGLMDYAFQFIQK-NGI 212
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE +YPY+G+ C K AVTI GYE +PA
Sbjct: 213 TTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQD 272
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GVF C L+HGV VGYG G KYW+VKNSWG WGE GYIRM R S
Sbjct: 273 FQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGV-S 331
Query: 322 SNIGICGILMQASYPVK 338
G+CGI MQASYP K
Sbjct: 332 QTEGLCGIAMQASYPTK 348
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 147/318 (46%), Positives = 183/318 (57%), Gaps = 35/318 (11%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFK--LTDNKFA 111
D +M +R E W+ ++ R Y + E RR ++ NV +I+ +N+ K L +N+FA
Sbjct: 32 DAAAMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFA 91
Query: 112 DLSNEEFISTYLGY-------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
DL+N EF +T G N+ R+ +V LPASVDWR +GAV PVKDQG CG
Sbjct: 92 DLTNAEFRATRTGLRPSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCG 151
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
CWAFSAVAA+EG KL TGKLVSLSEQ+LV CDV E+QGC GG M+ AF+FI K GG+
Sbjct: 152 CCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGL 211
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
E DYPY +D+C T A TI GYE +PA
Sbjct: 212 AAESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRH 271
Query: 263 FQLYSHGVFDEY--CGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
FQ Y GV C +L+H +T VGYG G KYWL+KNSWGTSWGE GY+RM R
Sbjct: 272 FQFYKGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGV 331
Query: 320 PSSNIGICGILMQASYPV 337
G+CG+ M ASYP
Sbjct: 332 ADKE-GVCGLAMMASYPT 348
>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 149/317 (47%), Positives = 187/317 (58%), Gaps = 37/317 (11%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+S+ +E W Y SR D +RRF ++ N +Y+ N +++ F+L NKFAD+
Sbjct: 35 ESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKQNARYVHEGNKRDMPFRLALNKFADM 94
Query: 114 SNEEFISTYLGYNKPYN---------EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
+ +EF TY G ++ + + LP +VDWR++GAVT +KDQGQCG
Sbjct: 95 TTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQGQCG 154
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS + AVEGINK++TGKLVSLSEQEL+DCD N NQGC+GG M+ AF+FI K G+
Sbjct: 155 SCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCD-NVNNQGCDGGLMDYAFQFIQK-NGI 212
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE +YPY+G+ C K AVTI GYE +PA
Sbjct: 213 TTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQD 272
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GVF C L+HGV VGYG G KYW+VKNSWG WGE GYIRM R S
Sbjct: 273 FQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGV-S 331
Query: 322 SNIGICGILMQASYPVK 338
G+CGI MQASYP K
Sbjct: 332 QTEGLCGIAMQASYPTK 348
>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
Length = 364
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 151/311 (48%), Positives = 184/311 (59%), Gaps = 37/311 (11%)
Query: 62 FENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
+E W Y SR D +RRF ++ N +YI N ++ F+L NKFAD++ +EF
Sbjct: 40 YERWRSHYTVSRRGLGADAEERRFNVFKENARYIHEGNKKDRPFRLALNKFADMTTDEFR 99
Query: 120 STYLGYNKPYNEPRWPSVQYLG---------LPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
TY G ++ + G LP +VDWR++GAVT +KDQGQCGSCWAFS
Sbjct: 100 RTYAGSRVRHHLSLSGGRRGDGSFRYGDADNLPPAVDWRQKGAVTAIKDQGQCGSCWAFS 159
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
+ AVEGINK++TGKLVSLSEQEL+DCD N NQGC+GG M+ AF+FI K G+TTE +Y
Sbjct: 160 TIVAVEGINKIRTGKLVSLSEQELMDCD-NVNNQGCDGGLMDYAFQFIHK-NGITTESNY 217
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSH 268
PY+G+ C K K HAVTI GYE +PA FQ YS
Sbjct: 218 PYQGEQGSCDLAKEKAHAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGNDFQFYSE 277
Query: 269 GVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GVF C L+HGV VGYG G KYW+VKNSWG WGE GYIRM R + G C
Sbjct: 278 GVFTGECSTDLDHGVAAVGYGTTRDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQAE-GQC 336
Query: 328 GILMQASYPVK 338
GI MQASYP K
Sbjct: 337 GIAMQASYPTK 347
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 150/348 (43%), Positives = 202/348 (58%), Gaps = 32/348 (9%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEER-FENWLKQYSREYGSEDEW 79
M +++ L+L + +L I S ++ R +E WL + + Y E
Sbjct: 1 MATPIKSITLALLIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEK 60
Query: 80 QRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQ 138
+ RF I++ N++YI+ NS N +F++ +FADL+N+EF + YL +
Sbjct: 61 ETRFEIFTDNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGER 120
Query: 139 YL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
YL LP +DWR +GAV PVKDQG CGSCWAFSA+ AVEGIN++KTG+L+SLSEQE
Sbjct: 121 YLYKVGDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQE 180
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR-CQTDKTKHHAVTIT 252
LVDCD S N GC GG M+ AF+FI + GG+ TE+DYPY +D C +DK VTI
Sbjct: 181 LVDCDT-SYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNSDKKNSRVVTID 239
Query: 253 GYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
GYE +P AFQLY GVF CG L+HGV VGYG
Sbjct: 240 GYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSLDHGVVAVGYGS 299
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+ G+ YW+V+NSWG++WGE+GY ++ RN S+ G CG+ M ASYP K
Sbjct: 300 EGGQDYWIVRNSWGSNWGESGYFKLERNIKESS-GKCGVAMMASYPTK 346
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 147/345 (42%), Positives = 201/345 (58%), Gaps = 43/345 (12%)
Query: 26 RNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGI 85
+ +L+LFL +GI S+ P+K ++ ER ENW+ +Y + Y E ++RF I
Sbjct: 7 KQHMLALFLFLAVGI-----SQVMPRKLHQTALRERHENWMAEYGKMYKDAAEKEKRFQI 61
Query: 86 YSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPY---------NEPRWP 135
+ NV++I+ N+ N +KL N ADL+ EEF + G + Y N ++
Sbjct: 62 FKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYE 121
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQG-QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+V +P ++DWR +GAVTP+KDQG QCG WAFS +AA EGI+++ TG LVSLSEQEL
Sbjct: 122 NVT--DIPEAIDWRVKGAVTPIKDQGDQCGRFWAFSTIAATEGIHQISTGNLVSLSEQEL 179
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
VDCD S + GC GG+ME FEFI K GG+T+E +YPY+G + C T I GY
Sbjct: 180 VDCD--SVDDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASPVAQIKGY 237
Query: 255 EAIPA----------------------RYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH 292
E +P+ F YS G+++ CG L+HGVT VGYG ++
Sbjct: 238 EIVPSYSEEALKKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGTEN 297
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G YW+VKNSWGT WGE GYIRM R + + GICGI + +SYP
Sbjct: 298 GTDYWIVKNSWGTQWGEKGYIRMHRGIAAKH-GICGIALDSSYPT 341
>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
Length = 362
Score = 273 bits (699), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 151/317 (47%), Positives = 186/317 (58%), Gaps = 39/317 (12%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+S + +E W + SR G + +RF ++ +NV ++ N + +KL NKFAD+
Sbjct: 34 ESFWDLYERWRSHHTVSRSLGDK---HKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADM 90
Query: 114 SNEEFISTYLGYNKPYNE-----PRWPSV----QYLGLPASVDWRKEGAVTPVKDQGQCG 164
+N EF STY G ++ PR + +P SVDWRK GAVT VKDQGQCG
Sbjct: 91 TNHEFRSTYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSVDWRKNGAVTGVKDQGQCG 150
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS V AVEGIN++KT KLVSLSEQELVDCD +N GCNGG ME AFEFI + GG+
Sbjct: 151 SCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDT-KKNAGCNGGLMESAFEFIKQKGGI 209
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE +YPY ++ C K AV+I G+E +PA
Sbjct: 210 TTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAGGSD 269
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GVF C +LNHGV +VGYG G YW V+NSWG WGE GYIRM R S S
Sbjct: 270 FQFYSEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQR-SIS 328
Query: 322 SNIGICGILMQASYPVK 338
G+CGI M ASYP+K
Sbjct: 329 KKEGLCGIAMMASYPIK 345
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 273 bits (698), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 197/340 (57%), Gaps = 37/340 (10%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L I + A++ + ++ +E+WL +Y + Y S EW+RRF I+
Sbjct: 10 MSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKET 69
Query: 90 VQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYN--------EPRWPSVQYL 140
+++ID N+ N S+K+ N+FADL++EEF STYLG+ N EPR+ V
Sbjct: 70 LRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPRFGQV--- 126
Query: 141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
LP+ VDWR GAV +K QG+CG CWAFSA+A VEGINK+ TG L+SLSEQEL+DC
Sbjct: 127 -LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-- 258
+GCNGGY+ F+FI GG+ TE++YPY ++ C D VTI YE +P
Sbjct: 186 QNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPYN 245
Query: 259 --------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWL 298
A AF+ YS G+F CG ++H VT+VGYG + G YW+
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWI 305
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VKNSW T+WGE GY+R+ RN + G CGI SYPVK
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA--GTCGIATMPSYPVK 343
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 145/337 (43%), Positives = 202/337 (59%), Gaps = 36/337 (10%)
Query: 33 FLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
FLL +LG + S ++ +M ER ENW+ +Y R Y E RRF ++ NV +
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEVFKDNVAF 66
Query: 93 IDYINS-QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL-------GLPA 144
++ N+ +N F L N+FADL+ EEF + G+ KP + + P+ + LP
Sbjct: 67 VESFNTNKNNKFWLGINQFADLTIEEFKANK-GF-KPISAEKVPTTGFKYENLSVSALPT 124
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
+VDWR +GAVTP+K+QGQCG CWAFSAVAA+EGI KL TG L+SLSEQELVDCD +S ++
Sbjct: 125 AVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDE 184
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------ 258
GC GG+M+ AFEF+ K GG+ T YPY+ + +C+ A TI G+E +P
Sbjct: 185 GCEGGWMDSAFEFVIKNGGLATVSSYPYKAVDGKCKGG--SKSAATIKGHEDVPVNDEAA 242
Query: 259 ----------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKN 301
+ F LYS GV CG +L+HG+ +GYG E G KYW++KN
Sbjct: 243 LMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYWILKN 302
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
SWGT+WGE G++RM ++ S G+CG+ M+ SYP +
Sbjct: 303 SWGTTWGEKGFLRMEKD-ISDKQGMCGLAMKPSYPTE 338
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 151/335 (45%), Positives = 196/335 (58%), Gaps = 28/335 (8%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L I + A + + + +E+WL + + Y S DE + RF I+ N
Sbjct: 10 MSLLFFSTLLILSSALDIVNSAQRTNDQVRDMYESWLVEQGKSYNSLDEKEMRFEIFKDN 69
Query: 90 VQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNK-PYNEPRWPSVQYLG--LPAS 145
++ ID N+ N SF L N+FADL++EE+ STYLG+ P + V +G LP
Sbjct: 70 LRIIDDHNADANRSFSLGLNRFADLTDEEYRSTYLGFKSGPKAKVSNRYVPKVGDVLPNY 129
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
VDWR GAV VK+QG C SCWAFSAVAAVEGINK+ TG L+SLSEQELVDC +G
Sbjct: 130 VDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIMTGNLLSLSEQELVDCGRTQSTRG 189
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY---- 261
CN GYM AF+FI GG+ TED+YPY ++ +C VTI YE +P+
Sbjct: 190 CNRGYMTDAFQFIINNGGINTEDNYPYTAQDGQCNRYLQNQKYVTIDDYENVPSNNEWAL 249
Query: 262 ------------------AFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSW 303
F+LY+ G+F +YCG ++HGVT+VGYG + G YW+VKNSW
Sbjct: 250 QNAVAHQPVSVGLESEGGKFKLYTSGIFTQYCGTAIDHGVTIVGYGTERGLDYWIVKNSW 309
Query: 304 GTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
GT+WGE GYIR+ RN + G CGI ASYPVK
Sbjct: 310 GTNWGENGYIRIQRNIGGA--GKCGIARMASYPVK 342
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 196/340 (57%), Gaps = 37/340 (10%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L I + A++ + ++ +E+WL +Y + Y S EW+RRF I+
Sbjct: 10 MSLLFFSTLLILSLAFNTKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKET 69
Query: 90 VQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYN--------EPRWPSVQYL 140
+++ID N+ N S+K+ N+FADL++EEF STYLG+ N EPR V
Sbjct: 70 LRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPRVGQV--- 126
Query: 141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
LP+ VDWR GAV +K QG+CG CWAFSA+A VEGINK+ TG L+SLSEQEL+DC
Sbjct: 127 -LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-- 258
+GCNGGY+ F+FI GG+ TE++YPY ++ C D VTI YE +P
Sbjct: 186 QNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYN 245
Query: 259 --------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWL 298
A AF+ YS G+F CG ++H VT+VGYG + G YW+
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWI 305
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VKNSW T+WGE GY+R+ RN + G CGI SYPVK
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA--GTCGIATMPSYPVK 343
>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
Precursor
gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 147/308 (47%), Positives = 190/308 (61%), Gaps = 34/308 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
FE+W+ ++ + YGS E +RR I+ N+++I+ N++NLS++L FADLS E+
Sbjct: 49 FESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEV 108
Query: 122 YLGYNK--PYN------EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
G + P N R+ + LP SVDWR EGAVT VKDQG C SCWAFS V
Sbjct: 109 CHGADPRPPRNHVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVG 168
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEG+NK+ TG+LV+LSEQ+L++C N EN GC GG +E A+EFI K GG+ T++DYPY+
Sbjct: 169 AVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYK 226
Query: 234 GKNDRCQTD-KTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
N C K + V I GYE +PA FQLY GV
Sbjct: 227 AVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGV 286
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
FD CG LNHGV VVGYG ++G YWLVKNS G +WGEAGY++MARN + G+CGI
Sbjct: 287 FDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANPR-GLCGIA 345
Query: 331 MQASYPVK 338
M+ASYP+K
Sbjct: 346 MRASYPLK 353
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 144/326 (44%), Positives = 201/326 (61%), Gaps = 41/326 (12%)
Query: 51 QKYDPQSMEERFENWLKQYSREYGS----EDEWQRRFGIYSSNVQYIDYINSQNLS--FK 104
++ +P+ + ++ WL ++ R Y + E E RRF ++ N++++D N + + F+
Sbjct: 47 ERTEPE-VRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERAGARGFR 105
Query: 105 LTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV-----QYLG----LPASVDWRKEGAVT 155
L N+FADL+N+EF + YLG P R +V ++ G LP SVDWR++GAV
Sbjct: 106 LGMNQFADLTNDEFRAAYLGAMVP--AARRGAVVGERYRHDGAAEELPESVDWREKGAVA 163
Query: 156 PVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAF 215
PVK+QGQCGSCWAFSAV++VE +N++ TG++V+LSEQELV+C + N GCNGG M+ AF
Sbjct: 164 PVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAF 223
Query: 216 EFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR--------------- 260
+FI K GG+ TEDDYPYR + +C ++ V+I G+E +P
Sbjct: 224 DFIIKNGGIDTEDDYPYRAVDGKCDMNRKNARVVSIDGFEDVPENDEKSLQKAVAHQPVS 283
Query: 261 -------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYI 313
FQLY GVF C L+HGV VGYG ++G+ YW+V+NSWG WGEAGYI
Sbjct: 284 VAIEAGGREFQLYKSGVFSGSCTTNLDHGVVAVGYGAENGKDYWIVRNSWGPKWGEAGYI 343
Query: 314 RMARNSPSSNIGICGILMQASYPVKR 339
RM RN +S G CGI M ASYP K+
Sbjct: 344 RMERNVNAS-TGKCGIAMMASYPTKK 368
>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
Length = 362
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 157/344 (45%), Positives = 198/344 (57%), Gaps = 42/344 (12%)
Query: 29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQY--SREYGSEDEWQRRFGIY 86
VLSL L VLG+ A ++ +S+ + +E W + SR G + +RF ++
Sbjct: 10 VLSLSL--VLGV-ANSFDFHDKDLESEESLWDLYERWRSHHTVSRSLGDK---HKRFNVF 63
Query: 87 SSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLG---------YNKPYNEPRWPSV 137
+N+ ++ N + +KL NKFAD++N EF STY G + P +
Sbjct: 64 KANMMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRDMPRGNGTFMYE 123
Query: 138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
+ +PASVDWRK+GAVT VKDQG CGSCWAFS V AVEGIN++KT KLVSLSEQELVDC
Sbjct: 124 KVGSVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDC 183
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
D EN GCNGG ME AF+FI + GG+TTE YPY ++ C K AV+I G+E +
Sbjct: 184 DT-EENAGCNGGLMESAFQFIKQKGGITTESYYPYTAQDGTCDASKANDLAVSIDGHENV 242
Query: 258 PAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGE 294
P FQ YS GVF C +LNHGV +VGYG G
Sbjct: 243 PGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGATVDGT 302
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YW+V+NSWG WGE GYIRM RN S G+CGI M ASYP+K
Sbjct: 303 SYWIVRNSWGPEWGELGYIRMQRN-ISKKEGLCGIAMLASYPIK 345
>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
Length = 357
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 147/308 (47%), Positives = 190/308 (61%), Gaps = 34/308 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
FE+W+ ++ + YGS E +RR I+ N+++I+ N++NLS++L FADLS E+
Sbjct: 42 FESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEV 101
Query: 122 YLGYNK--PYN------EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
G + P N R+ + LP SVDWR EGAVT VKDQG C SCWAFS V
Sbjct: 102 CHGADPRPPRNHVFMTSSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVG 161
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEG+NK+ TG+LV+LSEQ+L++C N EN GC GG +E A+EFI K GG+ T++DYPY+
Sbjct: 162 AVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYK 219
Query: 234 GKNDRCQTD-KTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
N C K + V I GYE +PA FQLY GV
Sbjct: 220 AVNGVCDGRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGV 279
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
FD CG LNHGV VVGYG ++G YWLVKNS G +WGEAGY++MARN + G+CGI
Sbjct: 280 FDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGITWGEAGYMKMARNIANPR-GLCGIA 338
Query: 331 MQASYPVK 338
M+ASYP+K
Sbjct: 339 MRASYPLK 346
>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 196/316 (62%), Gaps = 39/316 (12%)
Query: 62 FENWLKQY----SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS----FKLTDNKFADL 113
++ WL ++ S S + +RRF + N++++D N++ + F+L N+FADL
Sbjct: 52 YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111
Query: 114 SNEEFISTYLGYNKPYNEPRWPSV-----QYLG---LPASVDWRKEGAVTPVKDQGQCGS 165
+N+EF + YLG R V ++ G LP +VDWR++GAV PVK+QGQCGS
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFSAV+ VE IN++ TG++V+LSEQELV+CD+N ++ GCNGG M+ AFEFI K GG+
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAF 263
TEDDYPY+ + RC + V+I G+E +P F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
QLY GVF CG QL+HGV VGYG ++G+ YW+V+NSWG +WGEAGY+RM RN ++
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYWIVRNSWGPNWGEAGYLRMERNINVTS 351
Query: 324 IGICGILMQASYPVKR 339
G CGI M +SYP K+
Sbjct: 352 -GKCGIAMMSSYPTKK 366
>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
Length = 473
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 196/316 (62%), Gaps = 39/316 (12%)
Query: 62 FENWLKQY----SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS----FKLTDNKFADL 113
++ WL ++ S S + +RRF + N++++D N++ + F+L N+FADL
Sbjct: 52 YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111
Query: 114 SNEEFISTYLGYNKPYNEPRWPSV-----QYLG---LPASVDWRKEGAVTPVKDQGQCGS 165
+N+EF + YLG R V ++ G LP +VDWR++GAV PVK+QGQCGS
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFSAV+ VE IN++ TG++V+LSEQELV+CD+N ++ GCNGG M+ AFEFI K GG+
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAF 263
TEDDYPY+ + RC + V+I G+E +P F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
QLY GVF CG QL+HGV VGYG ++G+ YW+V+NSWG +WGEAGY+RM RN ++
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYWIVRNSWGPNWGEAGYLRMERNINVTS 351
Query: 324 IGICGILMQASYPVKR 339
G CGI M +SYP K+
Sbjct: 352 -GKCGIAMMSSYPTKK 366
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 147/318 (46%), Positives = 190/318 (59%), Gaps = 34/318 (10%)
Query: 51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNK 109
+K SM ER E W+++Y + Y E ++RF I+ +NV++I+ N+ N +KL+ N
Sbjct: 27 RKLHDASMYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNKPYKLSINH 86
Query: 110 FADLSNEEFISTYLGYNKPYNEPRWPSVQ-------YLGLPASVDWRKEGAVTPVKDQGQ 162
AD +NEEF++++ GY + + + Q +P +VDWR++G T +KDQGQ
Sbjct: 87 LADQTNEEFMASHKGYKGSHWQGLRITTQTPFKYENVTDIPWAVDWRQKGDATSIKDQGQ 146
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CG CWAFSAVAA EGI ++ TG LVSLSEQELVDCD S + GC+GG ME FEFI K G
Sbjct: 147 CGICWAFSAVAATEGIYQITTGNLVSLSEQELVDCD--SVDHGCDGGLMEHGFEFIIKNG 204
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------------- 260
G+++E +YPY N C T+K I GYE +P
Sbjct: 205 GISSEANYPYTAVNGTCDTNKEASPGAQIKGYETVPVNCEEELQKAVANQPVSVSIDAGG 264
Query: 261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
AFQ YS GVF CG QL+HGVT VGYG D G +YW+VKNSWGT WGE GYIRM R
Sbjct: 265 SAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDDGIQYWIVKNSWGTQWGEEGYIRMLRGI 324
Query: 320 PSSNIGICGILMQASYPV 337
+ G+CGI M ASYP
Sbjct: 325 DAQE-GLCGIAMDASYPT 341
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 189/314 (60%), Gaps = 38/314 (12%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS--QNLSFKLTDNKFA 111
D +M++R W+ ++ R Y +E R+ ++ NV+ I+ +N L+FKL N+FA
Sbjct: 24 DEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFA 83
Query: 112 DLSNEEFISTYLGY---------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
DL+NEEF S Y G+ KP R+ +V LP SVDWRK+GAVTP+KDQG
Sbjct: 84 DLTNEEFRSMYTGFKGNSVLSSRTKP-TSFRYQNVSSDALPVSVDWRKKGAVTPIKDQGL 142
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFSAVAA+EG+ ++K GKL+SLSEQELVDCD N + GC GG M+ AF + IG
Sbjct: 143 CGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN--DGGCMGGLMDTAFNYTITIG 200
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------------- 260
G+T+E +YPY+ N C +KTK A +I G+E +PA
Sbjct: 201 GLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGD 260
Query: 261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNS 319
FQ YS GVF C L+HGVT VGYG +G KYW++KNSWG WGE GY+R+ ++
Sbjct: 261 IGFQFYSSGVFSGECTTHLDHGVTAVGYGRSKNGLKYWILKNSWGPKWGERGYMRIKKDI 320
Query: 320 PSSNIGICGILMQA 333
+ G CG+ M A
Sbjct: 321 KPKH-GQCGLAMNA 333
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 148/319 (46%), Positives = 188/319 (58%), Gaps = 35/319 (10%)
Query: 53 YDPQSME------ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLT 106
Y P+ + RFE+W+ ++ + Y S +E RF ++ N+ +ID N + S+ L
Sbjct: 389 YSPEDLTCIDKLIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSSYWLG 448
Query: 107 DNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY-----LGLPASVDWRKEGAVTPVKDQG 161
N+FADLS+EEF S YLG + R S ++ LP SVDWRK+GAVT VK+QG
Sbjct: 449 LNEFADLSHEEFKSKYLGLRAEFPRSRDYSGEFRYRDVADLPESVDWRKKGAVTHVKNQG 508
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
CGSCWAFS VAAVEGIN++ TG L +LSEQEL+DCD + N GCNGG M+ AF FI
Sbjct: 509 ACGSCWAFSTVAAVEGINQIVTGNLTTLSEQELIDCD-TTFNSGCNGGLMDYAFAFIASN 567
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR--------------------- 260
GG+ EDDYPY + C+ K VTI+GYE +P +
Sbjct: 568 GGLHKEDDYPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEAS 627
Query: 261 -YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
FQ YS GVF+ CG +L+HGV VGYG G Y +VKNSWG WGE GYIRM RN+
Sbjct: 628 GRDFQFYSGGVFNGPCGTELDHGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGYIRMKRNT 687
Query: 320 PSSNIGICGILMQASYPVK 338
+ G+CGI ASYP K
Sbjct: 688 GKTE-GLCGINKMASYPTK 705
>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 145/308 (47%), Positives = 191/308 (62%), Gaps = 34/308 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
FE+W+ ++ + Y S E +RR I+ N+++I N++NLS++L N+FADLS E+
Sbjct: 56 FESWMVKHGKVYESVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYAQI 115
Query: 122 YLGYNK--PYN------EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
G + P N R+ + LP SVDWR EGAVT VKDQGQC SCWAFS V
Sbjct: 116 CHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRSCWAFSTVG 175
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEG+NK+ TG+LV+LSEQ+L++C N EN GC GG +E A+EFI GG+ T++DYPY+
Sbjct: 176 AVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYK 233
Query: 234 GKNDRCQTD-KTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
N C K + V I GYE +PA FQLY+ GV
Sbjct: 234 ALNGVCNDRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSREFQLYASGV 293
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
FD CG LNHGV VVGYG ++G YW+V+NS G +WGEAGY++MARN + G+CGI
Sbjct: 294 FDGTCGTNLNHGVVVVGYGTENGRDYWIVRNSRGNTWGEAGYMKMARNIANPR-GLCGIA 352
Query: 331 MQASYPVK 338
M+ASYP+K
Sbjct: 353 MRASYPLK 360
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 147/348 (42%), Positives = 205/348 (58%), Gaps = 32/348 (9%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEER-FENWLKQYSREYGSEDEW 79
M +++ L+L + VL I S + ++ R +E WL + + Y E
Sbjct: 1 MATSIKSITLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEK 60
Query: 80 QRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQ 138
+RRF I+ N+++++ +S N ++++ +FADL+N+EF + YL +
Sbjct: 61 ERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYLRSKMERTRVPVKGEK 120
Query: 139 YL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
YL LP ++DWR +GAV PVKDQG CGSCWAFSA+ AVEGIN++KTG+L+SLSEQE
Sbjct: 121 YLYKVGDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQE 180
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN-DRCQTDKTKHHAVTIT 252
LVDCD S N GC GG M+ AF+FI + GG+ TE+DYPY + + C +DK VTI
Sbjct: 181 LVDCDT-SYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTID 239
Query: 253 GYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
GYE +P AFQLY+ GVF CG L+HGV VGYG
Sbjct: 240 GYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYGS 299
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+ G+ YW+V+NSWG++WGE+GY ++ RN S+ G CG+ M ASYP K
Sbjct: 300 EGGQDYWIVRNSWGSNWGESGYFKLERNIKESS-GKCGVAMMASYPTK 346
>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
Length = 341
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 155/311 (49%), Positives = 193/311 (62%), Gaps = 30/311 (9%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFAD 112
DP M E E W+ Q+ + Y + E Q+RFGI+ NV YI+ N+ N S+KL N FAD
Sbjct: 33 DP--MYEMHEQWMVQHGKVYKAAHEKQKRFGIFKENVNYIEAFNNVGNKSYKLGLNHFAD 90
Query: 113 LSNEEFISTYLGYNKPYNEPRWPSVQYLGL---PASVDWRKEGAVTPVKDQGQCGSCWAF 169
L+N EFI+ +N + + +Y + P++VDWR+EGAVTPVK+QGQCG CWAF
Sbjct: 91 LTNHEFIAARNKFNGYLHGSIITTFKYKNVSDVPSAVDWRQEGAVTPVKNQGQCGCCWAF 150
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
SAVA+ EGI+KL TG LVSLSEQELVDCD N E+QGC GG M+ AFEFI + G++TE +
Sbjct: 151 SAVASTEGIHKLTTGNLVSLSEQELVDCDTNGEDQGCEGGLMDDAFEFIIQNNGLSTEAE 210
Query: 230 YPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYS 267
YPY+G + C + A TI+GYE +P FQ Y
Sbjct: 211 YPYQGVDGTCNKTEVGSSAATISGYENVPVNDEQALQKAVANQPVSVAIDASGSDFQFYK 270
Query: 268 HGVFDEYCGHQLNHGVTVVGYGEDHGE-KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
GVF CG +L+HGV VVGYG E +YWLVKNSWGT WGE GYIRM R +S G+
Sbjct: 271 SGVFTGSCGTELDHGVAVVGYGVGEDETEYWLVKNSWGTQWGEEGYIRMQRGVDASE-GL 329
Query: 327 CGILMQASYPV 337
CGI MQ SYP
Sbjct: 330 CGIAMQPSYPT 340
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 139/300 (46%), Positives = 184/300 (61%), Gaps = 38/300 (12%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
Q+M R E W+ +Y R Y E RRF ++ +N+ I+ +N+ N F L N+FADL++
Sbjct: 35 QAMVARHEEWMAKYDRVYSDAAEKARRFEVFKANMALIESVNAGNHKFWLEANRFADLTD 94
Query: 116 EEFISTYLGYNKPYNEP--------------RWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
+EF +T+ GY +P ++ +V +PASVDWR +GAVTP+K+QG
Sbjct: 95 DEFRATWTGY-RPKTAAASSKGRSRTATTGFKYANVSLDDVPASVDWRTKGAVTPIKNQG 153
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
+CG CWAFSAVA++EG+ KL TGKLVSLSEQELVDCDVN +QGC GG M+ AF+FI
Sbjct: 154 ECGCCWAFSAVASMEGVVKLSTGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFDFIVGN 213
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA------RYA------------- 262
GG+TTE YPY + C +++ A +I GYE +PA R A
Sbjct: 214 GGLTTESRYPYTASDGTCNSNEASGDAASIKGYEDVPANDEASLRKAVANQPVSVAVDGG 273
Query: 263 ---FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
F+ Y GV CG +L+HG+ VGYG G KYW++KNSWGTSWGEAGYIRM R+
Sbjct: 274 DSHFRFYKGGVLSGACGTELDHGIAAVGYGVASDGTKYWVMKNSWGTSWGEAGYIRMERD 333
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 188/312 (60%), Gaps = 37/312 (11%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNE 116
++ +E+WL ++ + Y S E +RRF I+ +++ID N+ + S+K+ N+FADL+NE
Sbjct: 34 VKAMYESWLIKHGKSYNSLGERERRFEIFKETLRFIDEHNADTSRSYKVGLNQFADLTNE 93
Query: 117 EFISTYLGYNKPYN--------EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
EF STYLG+ + N EPR V LP VDWR EGAV +K+QGQCGSCWA
Sbjct: 94 EFRSTYLGFTRGSNKTKVSNRYEPRVGQV----LPDYVDWRSEGAVVDIKNQGQCGSCWA 149
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSA+AAVEGINK+ TG L+SLSEQELVDC +GC+GGYM FEFI GG+ TE+
Sbjct: 150 FSAIAAVEGINKIVTGNLISLSEQELVDCGRTQSTKGCDGGYMTDGFEFIINNGGINTEE 209
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLY 266
+YPY + +C + VTI YE +P A AFQ Y
Sbjct: 210 NYPYTAQEGQCDLNLQNEKYVTIDNYENVPYYNEWALQTAVAYQPVSVALESAGDAFQHY 269
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
S G+F CG +H VT+VGYG + G YW+VKNSW T+WGE GY+R+ RN + G
Sbjct: 270 SSGIFTGPCGTATDHAVTIVGYGTEGGIDYWIVKNSWDTTWGEEGYMRILRNVGGA--GT 327
Query: 327 CGILMQASYPVK 338
CGI SYPVK
Sbjct: 328 CGIATMPSYPVK 339
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 196/340 (57%), Gaps = 37/340 (10%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L I + A++ + ++ +E+WL +Y + Y S EW+RRF I+
Sbjct: 10 MSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKET 69
Query: 90 VQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYN--------EPRWPSVQYL 140
+++ID N+ N S+K+ N+FADL++EEF STYLG+ N EPR V
Sbjct: 70 LRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPRVGQV--- 126
Query: 141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
LP+ VDWR GAV +K QG+CG CWAFSA+A VEGINK+ TG L+SLSEQEL+DC
Sbjct: 127 -LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-- 258
+GCNGGY+ F+FI GG+ TE++YPY ++ C D VTI YE +P
Sbjct: 186 QNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTIDTYENVPYN 245
Query: 259 --------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWL 298
A AF+ YS G+F CG ++H VT+VGYG + G YW+
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWI 305
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VKNSW T+WGE GY+R+ RN + G CGI SYPVK
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA--GTCGIATMPSYPVK 343
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 145/337 (43%), Positives = 200/337 (59%), Gaps = 36/337 (10%)
Query: 33 FLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
FLL +LG + S ++ +M ER ENW+ +Y R Y E RRF + NV +
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAF 66
Query: 93 IDYINSQNLS-FKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL-------GLPA 144
++ N+ + F L N+FADL+ EEF + G+ KP + P+ + LP
Sbjct: 67 VESFNTNKKNKFWLGVNQFADLTTEEFKANK-GF-KPISAEMVPTTGFKYENLSVSALPT 124
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
+VDWR +GAVTP+K+QGQCG CWAFSAVAA+EGI KL TG L+SLSEQELVDCD +S ++
Sbjct: 125 AVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDE 184
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------ 258
GC GG+M+ AFEF+ K GG+ TE YPY+ + +C+ A TI G+E +P
Sbjct: 185 GCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGG--SKSAATIKGHEDVPVNDEAA 242
Query: 259 ----------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKN 301
+ F LYS GV CG +L+HG+ +GYG E G KYW++KN
Sbjct: 243 LMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGVESDGTKYWILKN 302
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
SWGT+WGE G++RM ++ S G+CG+ M+ SYP +
Sbjct: 303 SWGTTWGEKGFLRMEKD-ISDKQGMCGLAMKPSYPTE 338
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 145/309 (46%), Positives = 189/309 (61%), Gaps = 29/309 (9%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
+ E+F W ++ + Y S +E R+ ++ N++YI + +N S+ L KFAD++N+E
Sbjct: 42 LSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEKNRSYWLGLTKFADITNDE 101
Query: 118 FISTYLG--YNKPYNEPRWPSVQYLG--LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
F Y G ++ R +Y P SVDWRK+GAVT VKDQG CGSCWAFSA+
Sbjct: 102 FRRQYTGTRIDRSKRSKRKTGFRYADSEAPESVDWRKKGAVTTVKDQGSCGSCWAFSAIG 161
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
+VEGIN ++TG+ VSLSEQELVDCD+ NQGCNGG M+ AF+FI + GG+ TE+DYPY+
Sbjct: 162 SVEGINAIRTGEAVSLSEQELVDCDLEY-NQGCNGGLMDYAFDFILENGGIDTENDYPYK 220
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVF 271
G + RC +K H VTI GYE +P FQLYS GVF
Sbjct: 221 GLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYSGGVF 280
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN--IGICGI 329
CG L+HGV VGYG + YW+VKNSWG WGE+GY+RM RN SN G+CGI
Sbjct: 281 TGECGTDLDHGVLAVGYGSEGSLDYWIVKNSWGEYWGESGYLRMQRNIKDSNHQFGLCGI 340
Query: 330 LMQASYPVK 338
++ SY VK
Sbjct: 341 NIEPSYAVK 349
>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 196/316 (62%), Gaps = 39/316 (12%)
Query: 62 FENWLKQY----SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS----FKLTDNKFADL 113
++ WL ++ S S + +RRF + N++++D N++ + F+L N+FADL
Sbjct: 52 YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111
Query: 114 SNEEFISTYLGYNKPYNEPRWPSV-----QYLG---LPASVDWRKEGAVTPVKDQGQCGS 165
+N+EF + YLG R V ++ G LP +VDWR++GAV PVK+QGQCGS
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGDRYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFSAV+ VE IN++ TG++V+LSEQELV+CD+N ++ GCNGG M+ AFEFI K GG+
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAF 263
TEDDYPY+ + RC + V+I G+E +P F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
QLY GVF CG QL+HGV VGYG ++G+ YW+V+NSWG +WGEAGY+RM RN ++
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGTENGKDYWIVRNSWGPNWGEAGYLRMERNINVTS 351
Query: 324 IGICGILMQASYPVKR 339
G CGI M +SYP K+
Sbjct: 352 -GKCGIAMMSSYPTKK 366
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 144/307 (46%), Positives = 184/307 (59%), Gaps = 32/307 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E WL ++ + Y E RF I+ N+++ID N+QN S+K+ NKFAD++NEE+
Sbjct: 4 YEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQNYSYKVGLNKFADINNEEYRDM 63
Query: 122 YLGYNKPYNEPRWPSVQYLG---------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
YLG K + R + G + VDWR +GAVT +KDQG CGSCWAFS +
Sbjct: 64 YLG-TKSDAKRRVMKTKITGHRITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCWAFSTI 122
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
A VE INK+ TGK VSLSEQELVDCD + N+GCNGG M+ AFEFI + GG+ T+ DYPY
Sbjct: 123 ATVEAINKIVTGKFVSLSEQELVDCD-RAFNEGCNGGLMDYAFEFIIRNGGIDTDQDYPY 181
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPARY---------------------AFQLYSHGVF 271
G +C K V+I GYE +P+ A QLY GVF
Sbjct: 182 NGFERKCDPTKKNAKVVSIDGYEDVPSYMNALKKAVAHQPVSVAIAGLGRALQLYQSGVF 241
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG L+HGV VVGYG ++G YWLV+NSWGT+WGE GY ++A + S CGI M
Sbjct: 242 TGKCGTDLDHGVVVVGYGSENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRKCGIAM 301
Query: 332 QASYPVK 338
+ASYPVK
Sbjct: 302 EASYPVK 308
>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
Precursor
gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
Length = 360
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 148/310 (47%), Positives = 186/310 (60%), Gaps = 35/310 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E W ++ S E Q+RF ++ N ++ N + +KL NKFAD++N EF +T
Sbjct: 38 YERWRSHHTVSR-SLHEKQKRFNVFKHNAMHVHNANKMDKPYKLKLNKFADMTNHEFRNT 96
Query: 122 YLGYNKPYNE-----PRWPSV----QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
Y G ++ PR + +PASVDWRK+GAVT VKDQGQCGSCWAFS +
Sbjct: 97 YSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFSTI 156
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AVEGIN++KT KLVSLSEQELVDCD + +NQGCNGG M+ AFEFI + GG+TTE +YPY
Sbjct: 157 VAVEGINQIKTNKLVSLSEQELVDCDTD-QNQGCNGGLMDYAFEFIKQRGGITTEANYPY 215
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
+ C K AV+I G+E +P FQ YS GV
Sbjct: 216 EAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV 275
Query: 271 FDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
F CG +L+HGV +VGYG G KYW VKNSWG WGE GYIRM R S G+CGI
Sbjct: 276 FTGSCGTELDHGVAIVGYGTTIDGTKYWTVKNSWGPEWGEKGYIRMER-GISDKEGLCGI 334
Query: 330 LMQASYPVKR 339
M+ASYP+K+
Sbjct: 335 AMEASYPIKK 344
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 144/336 (42%), Positives = 202/336 (60%), Gaps = 35/336 (10%)
Query: 33 FLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
FLL +LG + S ++ +M ER ENW+ +Y R Y E RRF + NV +
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAF 66
Query: 93 IDYINSQNLS-FKLTDNKFADLSNEEFISTYLGYNKPYNEP------RWPSVQYLGLPAS 145
++ N+ + F L N+FADL+ EEF + G+ KP E ++ ++ LP +
Sbjct: 67 VESFNTNKKNKFWLGVNQFADLTTEEFKANK-GF-KPTAEKVPTTGFKYENLSVSALPTA 124
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
VDWR +GAVTP+K+QGQCG CWAFSAVAA+EGI KL TG L+SLSEQELVDCD +S ++G
Sbjct: 125 VDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEG 184
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------- 258
C GG+M+ AFEF+ K GG+ TE +YPY+ + +C+ A TI G+E +P
Sbjct: 185 CEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGG--SKSAATIKGHEDVPVNNEAAL 242
Query: 259 ---------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNS 302
+ F LYS GV CG +L+HG+ +GYG E G KYW++KNS
Sbjct: 243 MKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWILKNS 302
Query: 303 WGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
WGT+WGE G++RM ++ + G+CG+ M+ SYP +
Sbjct: 303 WGTTWGEKGFLRMEKD-ITDKRGMCGLAMKPSYPTE 337
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 205/340 (60%), Gaps = 35/340 (10%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
L+LF + LG+ + + P Y+ +M R + W+ + + Y +E + RF I+ N
Sbjct: 12 LALFFI-CLGLWSSQVALSRPINYEA-TMRARHDQWIVHHEKVYKDLNEKEVRFQIFKEN 69
Query: 90 VQYIDYINS-QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPS--------VQYL 140
V+ I+ N+ ++ +KL NKF+DL+NEEF + GY + + + S
Sbjct: 70 VERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSHPKVMTSSKGKTHFRYTNVT 129
Query: 141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
+P ++DWRK+GAVTP+KDQ +CG CWAFSAVAA+EG+++LKTG+L+ LSEQELVDCDV
Sbjct: 130 DIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGELIPLSEQELVDCDVE 189
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR 260
E++GC+GG ++ AF+FI K G+TTE +YPY+G++ C K+ A ITGYE +PA
Sbjct: 190 GEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKGEDGVCNKKKSALSAAKITGYEDVPAN 249
Query: 261 ----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYW 297
+ FQ YS GVF C LNH VT VGYG G KYW
Sbjct: 250 SEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDGTKYW 309
Query: 298 LVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
++KNSWG+ WG++GY+R+ R+ G+CG+ M ASYP
Sbjct: 310 IIKNSWGSKWGDSGYMRIKRDVHEKE-GLCGLAMDASYPT 348
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 139/301 (46%), Positives = 182/301 (60%), Gaps = 29/301 (9%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNE 116
+M E+ E W+ +++R Y E +RF + +NV +I+ N+ N F L N+F DL+N+
Sbjct: 32 AMVEKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAFIESFNTGNHKFWLGVNQFTDLTND 91
Query: 117 EFISTYLGYNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
EF +T N R P+ V LPA+VDWR +G VTP+KDQGQCG CWAFS
Sbjct: 92 EFRATKTNKGLKRNGARAPTRFKYNNVSTDALPAAVDWRTKGVVTPIKDQGQCGCCWAFS 151
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
AVAA EGI KL TGKLVSLSEQELVDCDV+ +QGC GG M+ AF+FI K GG+TTE +Y
Sbjct: 152 AVAATEGIVKLSTGKLVSLSEQELVDCDVHGVDQGCEGGEMDNAFKFIIKNGGLTTEANY 211
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSH 268
PY ++ +C+T T + TI GYE +PA FQ YS
Sbjct: 212 PYTAQDGQCKTSTTSNSVATIKGYEDVPANDESSLMKAVANQPVSVAVDGGDVIFQHYSG 271
Query: 269 GVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GV CG L+HG+ +GYG G K+WL+KNSWGT+WGE+GY+RM ++ + I
Sbjct: 272 GVMTGSCGTDLDHGIVAIGYGMTSDGTKFWLLKNSWGTTWGESGYLRMEKDISDKSGTII 331
Query: 328 G 328
G
Sbjct: 332 G 332
>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
Length = 384
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 161/363 (44%), Positives = 198/363 (54%), Gaps = 84/363 (23%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFAD 112
DP M ERFE W+ ++ R Y E QRR +Y NV ++ NS N ++L DNKFAD
Sbjct: 26 DP--MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMSNGGYRLADNKFAD 83
Query: 113 LSNEEFISTYLGYNKPYNEPRWP-------SVQYLG----------LPASVDWRKEGAVT 155
L+NEEF + LG+ +P R +V +G LP SVDWR++GAV
Sbjct: 84 LTNEEFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSDELPKSVDWREKGAVA 143
Query: 156 PVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAF 215
PVK+QG+CGSCWAFSAVAA+EGIN++K GKLVSLSEQELVDCD + GC GGYM AF
Sbjct: 144 PVKNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKA--IGCAGGYMSWAF 201
Query: 216 EFITKIGGVTTEDDYPYRGK----------------------------NDRCQTDKTKHH 247
EF+ G+TTE +YPY+G N CQT K K
Sbjct: 202 EFVMNNSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKES 261
Query: 248 AVTITGYEAIPA----------------------RYAFQLYSHGVFDEYCGHQLNHGVTV 285
AV+I+GY + A + +QLY GVF C LNHGVTV
Sbjct: 262 AVSISGYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTV 321
Query: 286 VGYGEDH-----------GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQAS 334
VGYGE G+KYW+VKNSWG WG+AGYI M R + S G+CGI + S
Sbjct: 322 VGYGETQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQREA-SVASGLCGIALLPS 380
Query: 335 YPV 337
YPV
Sbjct: 381 YPV 383
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 150/315 (47%), Positives = 189/315 (60%), Gaps = 28/315 (8%)
Query: 51 QKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKF 110
Q + ++ + F WL+ +SR Y S E RF I+ N YI N Q S+ L NKF
Sbjct: 38 QLHSDDAILDVFHQWLETHSRVYRSLSEKHHRFQIFKENFLYIHAHNKQQKSYWLGLNKF 97
Query: 111 ADLSNEEFISTYLGYNKPYNEPRWPS-VQYLGLPA--SVDWRKEGAVTPVKDQGQCGSCW 167
+DL+++EF + YLG KP N R + Y + A VDWR +GAVT VKDQG CGSCW
Sbjct: 98 SDLTHQEFRAQYLG-TKPVNRQRKEANFMYEDVEAEPKVDWRLKGAVTDVKDQGACGSCW 156
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAV +VEG+N +KTG+LVSLSEQELVDCD +NQGCNGG M+ AFEFI K GG+ TE
Sbjct: 157 AFSAVGSVEGVNAIKTGELVSLSEQELVDCD-RKQNQGCNGGLMDYAFEFIIKNGGIDTE 215
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQL 265
DYPY+ ++ RC + V I Y+ +P + FQ
Sbjct: 216 KDYPYKARDGRCDEGRRNSKVVVIDDYQDVPTQSESALMKALTKNPVSVAIEAGGRDFQH 275
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y GVF CG +L+HGV VGYG +D G YW+VKNSWG WGE GYIRM R S
Sbjct: 276 YQGGVFTGPCGSELDHGVLAVGYGTDDDGVNYWIVKNSWGPGWGEKGYIRMERFGSDSTD 335
Query: 325 GICGILMQASYPVKR 339
G CGI ++AS+P+K+
Sbjct: 336 GKCGINIEASFPIKK 350
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 138/291 (47%), Positives = 183/291 (62%), Gaps = 32/291 (10%)
Query: 78 EWQRRFGIYSSNVQYIDYINS---QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRW 134
E +RRF ++ N++++D N+ + F+L N+FADL+NEEF +T+LG K R
Sbjct: 70 EHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFRATFLGA-KVAERSRA 128
Query: 135 PSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+Y LP SVDWR++GAV PVK+QGQCGSCWAFSAV+ VE IN+L TG++++L
Sbjct: 129 AGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITL 188
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQELV+C N +N GCNGG M+ AF+FI K GG+ TEDDYPY+ + +C ++ V
Sbjct: 189 SEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVV 248
Query: 250 TITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVG 287
+I G+E +P FQLY GVF CG L+HGV VG
Sbjct: 249 SIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVG 308
Query: 288 YGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YG D+G+ YW+V+NSWG WGE+GY+RM RN + G CGI M ASYP K
Sbjct: 309 YGTDNGKDYWIVRNSWGPKWGESGYVRMERNI-NVTTGKCGIAMMASYPTK 358
>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 145/311 (46%), Positives = 185/311 (59%), Gaps = 35/311 (11%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
E +E W ++ S DE +RF ++ +NV Y+ N ++ +KL NKFAD++N EF
Sbjct: 36 ELYERWRSHHTVSR-SLDEKDKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFR 94
Query: 120 STYLGYNKPYNEPRWPSVQYLG---------LPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
Y G ++ + + G +P +VDWRK+GAVTPVKDQG+CGSCWAFS
Sbjct: 95 HHYAGSKIKHHRTFLGASRANGTFMYAHEDSVPPTVDWRKKGAVTPVKDQGKCGSCWAFS 154
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
V AVEGIN++KT +LVSLSEQELVDCD S+NQGCNGG M+ AFEFI K GG+ TE++Y
Sbjct: 155 TVVAVEGINQIKTNELVSLSEQELVDCDT-SQNQGCNGGLMDMAFEFIKKKGGINTEENY 213
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSH 268
PY + C K V+I G+E +P FQ YS
Sbjct: 214 PYMAEGGECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSDFQFYSE 273
Query: 269 GVFDEYCGHQLNHGVTVVGYGEDHGE-KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GVF CG +L+HGV +VGYG KYW+VKNSWG WGE GYIRM R + G+C
Sbjct: 274 GVFTGDCGTELDHGVAIVGYGTTLDRTKYWIVKNSWGPEWGEKGYIRMQREIDAEE-GLC 332
Query: 328 GILMQASYPVK 338
GI MQ SYP+K
Sbjct: 333 GIAMQPSYPIK 343
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 148/307 (48%), Positives = 185/307 (60%), Gaps = 32/307 (10%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
E FE+W+ ++S+ Y S +E RF I+ N+++ID N + S+ L N+FADLS+EEF
Sbjct: 45 ELFESWMSKHSKTYRSIEEKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFK 104
Query: 120 STYLGYNKPYNEPRWPSVQYLG------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
S YLG + PR S + LP SVDWR +GAVTPVK+QG CGSCWAFS VA
Sbjct: 105 SKYLGLRVEF--PRKRSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVA 162
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEGIN++ TG L SLSEQEL+DCD S N GC GG M+ AF++I G+ E+DYPY
Sbjct: 163 AVEGINQIVTGNLTSLSEQELIDCD-RSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYL 221
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVF 271
+ RC +K + VTI+GYE +PA FQ Y G+F
Sbjct: 222 MEEGRCIREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIF 281
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG Q++HGVT VGYG G Y +VKNSWG WGE GYIRM RN+ G+CGI
Sbjct: 282 TGRCGTQMDHGVTAVGYGSSEGTDYIIVKNSWGPKWGENGYIRMKRNTGKPE-GLCGINQ 340
Query: 332 QASYPVK 338
ASYP K
Sbjct: 341 MASYPTK 347
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 154/339 (45%), Positives = 198/339 (58%), Gaps = 39/339 (11%)
Query: 36 WVLGIPAGAWSEGYPQK--YDPQSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQ 91
WVL A ++ G+ + +S+ ++NW Q+ SR SE E RF I+ NV+
Sbjct: 18 WVLSASASDFTPGFTDEDLESEKSLRSLYDNWALQHRSSRSLDSE-EHAERFEIFKENVK 76
Query: 92 YIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYN---KPYNEPRWPSVQYLG---LPAS 145
YID +N ++ +KL NKFADLSNEEF + Y+G + E + S Y LPAS
Sbjct: 77 YIDSVNKKDSPYKLGLNKFADLSNEEFKAIYMGTKMDLRGDREVQSGSFMYQNSEPLPAS 136
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
+DWR++GAV VK+QG CGSCWAFS VA+VEGIN + TG LVSLSEQ+LVDC ++EN G
Sbjct: 137 IDWRQKGAVAAVKNQGHCGSCWAFSTVASVEGINYITTGNLVSLSEQQLVDC--STENSG 194
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA--VTITGYEAIPAR--- 260
CNGG M+ AF++I GG+ TED+YPY + C + K V I G+E +PA
Sbjct: 195 CNGGLMDTAFQYIINNGGIVTEDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNEQ 254
Query: 261 -------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVK 300
FQ YS GVF CG L+HGV VGYG G YW+V+
Sbjct: 255 ALKEAVAHQPVSVAIEASGQDFQFYSTGVFTGKCGTALDHGVVAVGYGTSPEGINYWIVR 314
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
NSWG WGE GYIRM + ++ G CGI MQASYP K+
Sbjct: 315 NSWGPKWGEEGYIRMQQGIEAAE-GKCGIAMQASYPTKK 352
>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
Length = 469
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 136/294 (46%), Positives = 185/294 (62%), Gaps = 33/294 (11%)
Query: 78 EWQRRFGIYSSNVQYIDYINSQNLS----FKLTDNKFADLSNEEFISTYLGYNKPYNEP- 132
E +RRF + N++++D N++ + F+L N+FADL+N+EF + YLG P
Sbjct: 70 ERERRFRAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGQRARPG 129
Query: 133 -----RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLV 187
R+ LP +VDWR++GAV PVK+QGQCGSCWAFSA++ VE IN++ TG++V
Sbjct: 130 RVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAISTVESINQIVTGEMV 189
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
+LSEQELV+CD N ++ GCNGG M+ AFEFI K GG+ TEDDYPY+ + RC +
Sbjct: 190 TLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAIDGRCDVLRKNAK 249
Query: 248 AVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTV 285
V+I G+E +P FQLY GVF CG QL+HGV
Sbjct: 250 VVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVA 309
Query: 286 VGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
VGYG ++G+ YW+V+NSWG +WGEAGY+RM RN ++ G CGI M +SYP K+
Sbjct: 310 VGYGTENGKDYWIVRNSWGPNWGEAGYLRMERNINVTS-GKCGIAMMSSYPTKK 362
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 148/307 (48%), Positives = 185/307 (60%), Gaps = 32/307 (10%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
E FE+W+ ++S+ Y S +E RF I+ N+++ID N + S+ L N+FADLS+EEF
Sbjct: 45 ELFESWMSKHSKAYRSIEEKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFK 104
Query: 120 STYLGYNKPYNEPRWPSVQYLG------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
S YLG + PR S + LP SVDWR +GAVTPVK+QG CGSCWAFS VA
Sbjct: 105 SKYLGLRVEF--PRKRSSRGFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVA 162
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEGIN++ TG L SLSEQEL+DCD S N GC GG M+ AF++I G+ E+DYPY
Sbjct: 163 AVEGINQIVTGNLTSLSEQELIDCD-RSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYL 221
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVF 271
+ RC +K + VTI+GYE +PA FQ Y G+F
Sbjct: 222 MEEGRCIREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIF 281
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG Q++HGVT VGYG G Y +VKNSWG WGE GYIRM RN+ G+CGI
Sbjct: 282 TGRCGTQMDHGVTAVGYGSSEGTDYIIVKNSWGPKWGENGYIRMKRNTGKPE-GLCGINQ 340
Query: 332 QASYPVK 338
ASYP K
Sbjct: 341 MASYPTK 347
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 146/310 (47%), Positives = 187/310 (60%), Gaps = 35/310 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYID-----YINSQNLSFKLTDNKFADLSNE 116
++WL ++ + Y + E ++RF I+ N+++ID F+L NKFADL+N+
Sbjct: 5 LQSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTND 64
Query: 117 EFISTYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
EF Y G +P S +Y LP SVDWRK+GAV+ VKDQGQCGSCWAFSA
Sbjct: 65 EFRRIYFGVKRPEKAESVKSDRYAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAFSA 124
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
+ AVEGINK+ TG L++LSEQELVDCD S N GC+GG M+ AF FI GG+ T+ DYP
Sbjct: 125 IGAVEGINKIVTGDLITLSEQELVDCDT-SYNSGCDGGLMDYAFRFIINNGGIDTDKDYP 183
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPA---------------RYA-------FQLYSHG 269
Y+ + C +++ VTI G E +PA R A FQLY G
Sbjct: 184 YKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLYKSG 243
Query: 270 VFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
VF CG L+HGV VGYG D G+ YW+V+NSWG WGE GYIRM RN+ S + G CG
Sbjct: 244 VFTGSCGTSLDHGVVAVGYGTTDDGKDYWIVRNSWGDDWGEDGYIRMERNTESKS-GKCG 302
Query: 329 ILMQASYPVK 338
I ++ SYPVK
Sbjct: 303 IAIEPSYPVK 312
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 155/337 (45%), Positives = 200/337 (59%), Gaps = 33/337 (9%)
Query: 33 FLLWVLGIPAGAWS-EGYPQKYDPQSME---ERFENWLKQYSREYGSEDEWQRRFGIYSS 88
F L+ AG +S GY + D +SM+ E FE+W+ ++ + Y S +E RF I+
Sbjct: 15 FCLFASLAVAGDFSIVGYSSE-DLKSMDKLIELFESWMSRHGKIYQSIEEKLHRFDIFKD 73
Query: 89 NVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY----LGLPA 144
N+++ID N ++ L N+FADLS++EF + YLG Y+ R ++ LP
Sbjct: 74 NLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKYLGLKVDYSRRRESPEEFTYKDFELPK 133
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
SVDWRK+GAVT VK+QG CGSCWAFS VAAVEGIN++ TG L SLSEQEL+DCD + N
Sbjct: 134 SVDWRKKGAVTQVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNN 192
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------ 258
GCNGG M+ AF FI + GG+ E+DYPY + C+ K + VTI+GY +P
Sbjct: 193 GCNGGLMDYAFSFIVENGGLHKEEDYPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQS 252
Query: 259 ----------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNS 302
+ FQ YS GVFD +CG L+HGV VGYG G Y +VKNS
Sbjct: 253 LLKALVNQPLSVAIEASGRDFQFYSGGVFDGHCGSDLDHGVAAVGYGTSKGVNYIIVKNS 312
Query: 303 WGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
WG+ WGE GYIRM RN GICGI ASYP K+
Sbjct: 313 WGSKWGEKGYIRMRRNIGKPE-GICGIYKMASYPTKK 348
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 149/310 (48%), Positives = 187/310 (60%), Gaps = 33/310 (10%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
E FE WL ++ + Y S +E RF ++ N+++ID +N + S+ L N+FADL++EEF
Sbjct: 148 ELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVTSYWLGLNEFADLTHEEFK 207
Query: 120 STYLGYN--KPYNEPR----WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
+TYLG P E R + V LP SVDWR +GAVT VK+QGQCGSCWAFS VA
Sbjct: 208 ATYLGLAPPAPARESRGSFKYEDVSADDLPKSVDWRTKGAVTEVKNQGQCGSCWAFSTVA 267
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEGIN + TG L +LSEQEL+DC V+ N GCNGG M+ AF +I GG+ TE+ YPY
Sbjct: 268 AVEGINAIVTGNLTALSEQELIDCSVDG-NNGCNGGLMDYAFSYIASSGGLHTEEAYPYL 326
Query: 234 GKNDRC-QTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHGV 270
+ C K++ AVTI+GYE +PA FQ YS GV
Sbjct: 327 MEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKALAHQPVSVAIEASGRHFQFYSGGV 386
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGE--KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
FD CG QL+HGV VGYG D G+ Y +V+NSWG WGE GYIRM R + G+CG
Sbjct: 387 FDGPCGTQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAKWGEKGYIRMKRGTGKGE-GLCG 445
Query: 329 ILMQASYPVK 338
I ASYP K
Sbjct: 446 INKMASYPTK 455
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 143/292 (48%), Positives = 179/292 (61%), Gaps = 35/292 (11%)
Query: 81 RRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNEEFISTYLGYN----------KP 128
+RF I+ N+++ID N ++N ++KL KF DL+NEE+ S YLG K
Sbjct: 72 KRFNIFKDNLRFIDLHNEKNKNATYKLGLTKFTDLTNEEYRSLYLGARTEPVRRIAKAKN 131
Query: 129 YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
N+ +V +P +VDWR +GAV P+KDQG CGSCWAFS AAVEGINK+ TG+L+S
Sbjct: 132 VNQKYSAAVDGKEVPETVDWRLKGAVNPIKDQGTCGSCWAFSTAAAVEGINKIVTGELIS 191
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
LSEQELVDCD NS NQGCNGG M+ AF+FI K GG+ TE DYPYRG +C +
Sbjct: 192 LSEQELVDCD-NSYNQGCNGGLMDYAFQFIMKNGGLKTEKDYPYRGFGGKCNSFLKNAKV 250
Query: 249 VTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVV 286
V+I GYE +P + FQ Y G+F CG L+H V V
Sbjct: 251 VSIDGYEDVPTKDETALKRAISLQPVSVAIEAGGRIFQHYQTGIFTGNCGTNLDHAVVAV 310
Query: 287 GYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
GYG ++G YW+V+NSWG WGE GYIRM RN SS G CGI ++ASYPVK
Sbjct: 311 GYGSENGVDYWIVRNSWGPRWGEEGYIRMERNLASSKSGKCGIAVEASYPVK 362
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 140/308 (45%), Positives = 192/308 (62%), Gaps = 33/308 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFIS 120
+ +WL ++ + Y + E + RF I+ N++YID N+ + S++L N+FADL+NEE+ +
Sbjct: 49 YNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNADPDRSYELGLNRFADLTNEEYRA 108
Query: 121 TYLGYNKPYNEP--------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
YLG + P R+ V+ LP S+DWR++GAV VKDQG CGSCWAFSA+
Sbjct: 109 KYLGTKSRESRPKLSKGPSDRYAPVEGEELPDSIDWREKGAVAAVKDQGSCGSCWAFSAI 168
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AVEGIN++ TG+L++LSEQELVDCD S N+GC GG M+ AF FI K GG+ ++ DYPY
Sbjct: 169 GAVEGINQITTGELITLSEQELVDCD-RSYNEGCEGGLMDYAFNFIIKNGGIDSDLDYPY 227
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGV 270
G++ C +K VTI YE +P FQLY G+
Sbjct: 228 TGRDGTCNQNKENAKVVTIDSYEDVPVYDEKALQKAAANQPISVAIEAGGMDFQLYVSGI 287
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
F CG ++HGV VVGYG + G YW+V+NSWG +WGEAGY++M RN S+ G+CGI
Sbjct: 288 FTGKCGTAVDHGVVVVGYGSEEGMDYWIVRNSWGAAWGEAGYLKMQRNVGKSS-GLCGIT 346
Query: 331 MQASYPVK 338
++ SYPVK
Sbjct: 347 IEPSYPVK 354
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 271 bits (693), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 137/319 (42%), Positives = 192/319 (60%), Gaps = 38/319 (11%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS--QNLSFKLTDNKFA 111
D M+++ + W+ ++ R Y +E R+ ++ NV+ I+ +N+ +FKL N+FA
Sbjct: 30 DELIMQKKHDEWMAEHGRTYADMNEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQFA 89
Query: 112 DLSNEEFISTYLGYNKPY----------NEPRWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
DL+N+EF Y GY + R+ +V + LP +VDWRK+GAVTP+K+QG
Sbjct: 90 DLTNDEFRFMYTGYKGDFVLFSQSQTKSTSFRYQNVFFGALPIAVDWRKKGAVTPIKNQG 149
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
CG CWAFSAVAA+EG ++K GKL+SLSEQ+LVDCD N + GC+GG M+ AFE I
Sbjct: 150 SCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCSGGLMDTAFEHIMAT 207
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR--------------------- 260
GG+TTE +YPY+G++ C+ TK A +ITGYE +P
Sbjct: 208 GGLTTESNYPYKGEDANCKIKSTKPSAASITGYEDVPVNDENALMKAVAHQPVSVGIEGG 267
Query: 261 -YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARN 318
+ FQ YS GVF C L+H VT VGY + G KYW++KNSWGT WGE GY+R+ ++
Sbjct: 268 GFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRIKKD 327
Query: 319 SPSSNIGICGILMQASYPV 337
G+CG+ M+ASYP
Sbjct: 328 IKDKE-GLCGLAMKASYPT 345
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 145/304 (47%), Positives = 185/304 (60%), Gaps = 28/304 (9%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
FE+W+ ++ R Y S +E RF I+ N+ +ID N + ++ L N+FADLS+EEF +
Sbjct: 47 FESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKKVRNYWLGLNEFADLSHEEFKNK 106
Query: 122 YLGYNKPYNE----PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEG 177
YLG ++ P + + + +P SVDWRK+GAVTPVK+QG CGSCWAFS VAAVEG
Sbjct: 107 YLGLKPDLSKRAQCPEEFTYKDVAIPKSVDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEG 166
Query: 178 INKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
IN++ TG L SLSEQEL+DCD + N GCNGG M+ AF +I GG+ E+DYPY +
Sbjct: 167 INQIVTGNLTSLSEQELIDCDT-TYNNGCNGGLMDYAFAYIVANGGLHKEEDYPYIMEEG 225
Query: 238 RCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHGVFDEYC 275
C K + AVTI+GY +P FQ YS GVFD +C
Sbjct: 226 TCDMRKEESDAVTISGYHDVPQNSEESLLKALANQPLSIAIEASGRDFQFYSGGVFDGHC 285
Query: 276 GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASY 335
G +L+HGV VGYG G Y +VKNSWG WGE GYIRM R + S GICGI ASY
Sbjct: 286 GTELDHGVAAVGYGTSKGLDYIIVKNSWGPKWGEKGYIRMKRKT-SKPEGICGIYKMASY 344
Query: 336 PVKR 339
P K+
Sbjct: 345 PTKK 348
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 138/291 (47%), Positives = 182/291 (62%), Gaps = 32/291 (10%)
Query: 78 EWQRRFGIYSSNVQYIDYINS---QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRW 134
E +RRF ++ N++++D N+ + F+L N+FADL+NEEF +T+LG K R
Sbjct: 69 EHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNEEFRATFLGA-KVAERSRA 127
Query: 135 PSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+Y LP SVDWR++GAV PVK+QGQCGSCWAFSAV+ VE IN+L TG++++L
Sbjct: 128 AGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQLVTGEMITL 187
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQELV+C N +N GCNGG M AF+FI K GG+ TEDDYPY+ + +C ++ V
Sbjct: 188 SEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVV 247
Query: 250 TITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVG 287
+I G+E +P FQLY GVF CG L+HGV VG
Sbjct: 248 SIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVG 307
Query: 288 YGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YG D+G+ YW+V+NSWG WGE+GY+RM RN + G CGI M ASYP K
Sbjct: 308 YGTDNGKDYWIVRNSWGPKWGESGYVRMERNI-NVTTGKCGIAMMASYPTK 357
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 145/340 (42%), Positives = 196/340 (57%), Gaps = 37/340 (10%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L I + A++ + ++ +E+WL +Y + Y S EW+RRF I+
Sbjct: 10 MSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKET 69
Query: 90 VQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYN--------EPRWPSVQYL 140
+++ID N+ N S+K+ N+FADL++EEF STYLG+ N EPR V
Sbjct: 70 LRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPRVGQV--- 126
Query: 141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
LP+ VDWR GAV +K QG+CG CWAFSA+A VEGINK+ TG L+SLSEQEL+DC
Sbjct: 127 -LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-- 258
+GCNGGY+ F+FI GG+ TE++YPY ++ C + VTI YE +P
Sbjct: 186 QNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVELQNEKYVTIDTYENVPYN 245
Query: 259 --------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWL 298
A AF+ YS G+F CG ++H VT+VGYG + G YW+
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWI 305
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VKNSW T+WGE GY+R+ RN + G CGI SYPVK
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA--GTCGIATMPSYPVK 343
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 151/305 (49%), Positives = 192/305 (62%), Gaps = 32/305 (10%)
Query: 65 WLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLG 124
WL ++ + Y E RF I+ +N+++ID NSQN ++K+ KFADL+NEE+ + +LG
Sbjct: 7 WLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQNHTYKVGLTKFADLTNEEYRAMFLG 66
Query: 125 Y----NKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
+ + + PS +Y LP SVDWR +GAV P+KDQG CGSCWAFS VAAV
Sbjct: 67 TRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWAFSTVAAV 126
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EGIN++ TG+L+SLSEQELVDCD + N GCNGG M+ AF+FI GG+ TE DYPY G
Sbjct: 127 EGINQIVTGELISLSEQELVDCD-RTYNAGCNGGLMDYAFQFIINNGGLDTEKDYPYVGD 185
Query: 236 NDRCQTDKTKHHAVTITGYE---------------------AIPAR-YAFQLYSHGVFDE 273
+D+C DK K AV+I G+E AI A A Q Y GVF
Sbjct: 186 DDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQFYQSGVFTG 245
Query: 274 YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
CG L+HGV VVGY ++G YWLV+NSWGT WGE GYI+M RN + G CGI M++
Sbjct: 246 ECGTALDHGVVVVGYASENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTGRCGIAMES 305
Query: 334 SYPVK 338
SYPVK
Sbjct: 306 SYPVK 310
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 186/313 (59%), Gaps = 33/313 (10%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSN 115
S +E W+ + R Y E +RRF I+ N +YI+ N Q N ++ L N FAD+++
Sbjct: 29 SFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTH 88
Query: 116 EEFISTYLGYNKPYNEPRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
+EF + Y G P + +Y LP DWR +GAV VK+QG CGSCWAFS V
Sbjct: 89 DEFKALYFGTKVPLSNTIKSGFRYEDATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTV 148
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AAVEG+N++ TG+LVSLSEQELVDCD +NQGCNGG M+ AFEFI + GG+ +E DYPY
Sbjct: 149 AAVEGVNQIVTGELVSLSEQELVDCD-KQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPY 207
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHGV 270
+ + C + H VTI G+E +PA FQLYS GV
Sbjct: 208 KAVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGV 267
Query: 271 FDEYCGHQLNHGVTVVGYGEDH-----GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
+ +CG++L+HGV VGYG YW+V+NSWG +WGE+GYIR+ RN SS G
Sbjct: 268 YTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASSR-G 326
Query: 326 ICGILMQASYPVK 338
CGI M ASYPVK
Sbjct: 327 KCGIAMMASYPVK 339
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 146/314 (46%), Positives = 181/314 (57%), Gaps = 35/314 (11%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFK--LTDNKFADLSN 115
M +R E W+ ++ R Y + E RR ++ NV +I+ +N+ K L +N+FADL+N
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 116 EEFISTYLGY-------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
EF +T G N+ R+ +V LPASVDWR +GAV PVKDQG CG CWA
Sbjct: 61 AEFRATRTGLRPSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWA 120
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSAVAA+EG KL TGKLVSLSEQ+LV CDV E+QGC GG M+ AF+FI K GG+ E
Sbjct: 121 FSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAES 180
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLY 266
DYPY +D+C T A TI GYE +PA FQ Y
Sbjct: 181 DYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFY 240
Query: 267 SHGVFDEY--CGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
GV C +L+H +T VGYG G KYWL+KNSWGTSWGE GY+RM R
Sbjct: 241 KGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKE 300
Query: 324 IGICGILMQASYPV 337
G+CG+ M ASYP
Sbjct: 301 -GVCGLAMMASYPT 313
>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 149/317 (47%), Positives = 185/317 (58%), Gaps = 37/317 (11%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+S+ +E W Y SR D +RRF ++ N +Y+ N ++ F+L NKFAD+
Sbjct: 35 ESLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYVHEGNKRDRPFRLALNKFADM 94
Query: 114 SNEEFISTYLGYNKPYN---------EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
+ +EF TY G ++ + + LP +VDWR++GAVT +KDQGQCG
Sbjct: 95 TTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGAVTAIKDQGQCG 154
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS + AVEGINK++TGKLVSLSEQEL+DCD N NQGC GG M+ AF+FI K G+
Sbjct: 155 SCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCD-NVNNQGCEGGLMDYAFQFIQK-NGI 212
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE +YPY+G+ C K AVTI GYE +PA
Sbjct: 213 TTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDASGQD 272
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GVF C L+HGV VGYG G KYW+VKNSWG WGE GYIRM R S
Sbjct: 273 FQFYSEGVFTGECSTDLDHGVAAVGYGATRDGTKYWIVKNSWGEDWGEKGYIRMQRGV-S 331
Query: 322 SNIGICGILMQASYPVK 338
G+CGI MQASYP K
Sbjct: 332 QTEGLCGIAMQASYPTK 348
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 148/314 (47%), Positives = 189/314 (60%), Gaps = 33/314 (10%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+ + E FE WL ++ + Y S +E RF ++ N+++ID IN + S+ L N+FADL++
Sbjct: 43 ERLVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINREVTSYWLGLNEFADLTH 102
Query: 116 EEFISTYLGYNKP------YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
+EF + YLG + R+ V LP SVDWRK+GAVT VK+QGQCGSCWAF
Sbjct: 103 DEFKAAYLGLDAAPARRGSSRSFRYEDVSASDLPKSVDWRKKGAVTEVKNQGQCGSCWAF 162
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
S VAAVEGIN + TG L +LSEQEL+DC V+ N GCNGG M+ AF +I GG+ TE+
Sbjct: 163 STVAAVEGINAIVTGNLTALSEQELIDCSVDG-NSGCNGGLMDYAFSYIASSGGLHTEEA 221
Query: 230 YPYRGKNDRC-QTDKTKHHAVTITGYEAIPAR----------------------YAFQLY 266
YPY + C K + AVTI+GYE +PA FQ Y
Sbjct: 222 YPYLMEEGSCGDGKKAESEAVTISGYEDVPANDEQALIKALAHQPVSVAIEASGRHFQFY 281
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDHGE--KYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
S GVFD CG QL+HGV VGYG D G+ Y +V+NSWG WGE GYIRM R + S+
Sbjct: 282 SGGVFDGPCGAQLDHGVAAVGYGSDKGKGHDYIIVRNSWGAQWGEKGYIRMKRGT-SNGE 340
Query: 325 GICGILMQASYPVK 338
G+CGI ASYP K
Sbjct: 341 GLCGINKMASYPTK 354
>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 368
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 146/325 (44%), Positives = 190/325 (58%), Gaps = 44/325 (13%)
Query: 56 QSMEERFENWLKQYS----------REYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKL 105
+S+ +E W +Y+ R ++ + RRF ++ NV+YI N ++ F+L
Sbjct: 32 ESLRGLYERWRSRYTVSPSTPGSGLRGKLADHDPARRFNVFKENVKYIHEANKKDRPFRL 91
Query: 106 TDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG---------LPASVDWRKEGAVTP 156
NKFAD++ +E +Y G ++ + G LP +VDWR++GAVT
Sbjct: 92 ALNKFADMTTDELRHSYAGSRVRHHRALSGGRRAQGNFTYSDAENLPPAVDWREKGAVTG 151
Query: 157 VKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFE 216
+KDQGQCGSCWAFS +AAVE INK++TGKLVSLSEQEL+DCD N +QGC+GG M+ AF+
Sbjct: 152 IKDQGQCGSCWAFSTIAAVESINKIRTGKLVSLSEQELMDCD-NVNDQGCDGGLMDYAFQ 210
Query: 217 FITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------- 260
FI K GGVT+E +YPY+G+ + C K H V I GYE +PA
Sbjct: 211 FIQKNGGVTSEANYPYQGQQNTCDQAKENTHDVAIDGYEDVPANDESALQKAVAYQPVSV 270
Query: 261 ------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYI 313
FQ YS GVF C L+HGV VGYG G KYW+VKNSWG WGE GYI
Sbjct: 271 AIEASGQDFQFYSEGVFTGQCTTDLDHGVAAVGYGTARDGTKYWIVKNSWGLDWGEKGYI 330
Query: 314 RMARNSPSSNIGICGILMQASYPVK 338
RM R + G+CGI MQASYP+K
Sbjct: 331 RMQRGVSQAE-GLCGIAMQASYPIK 354
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 148/315 (46%), Positives = 191/315 (60%), Gaps = 31/315 (9%)
Query: 54 DPQSME---ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKF 110
D +SM+ E FE+W+ ++ + Y S +E RF I+ N+++ID N ++ L N+F
Sbjct: 36 DLKSMDKLIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEF 95
Query: 111 ADLSNEEFISTYLGYNKPYNEPRWPSVQY----LGLPASVDWRKEGAVTPVKDQGQCGSC 166
ADLS++EF + YLG Y+ R ++ + LP SVDWRK+GAV PVK+QG CGSC
Sbjct: 96 ADLSHQEFKNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSC 155
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS VAAVEGIN++ TG L SLSEQEL+DCD + N GCNGG M+ AF FI + GG+
Sbjct: 156 WAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNNGCNGGLMDYAFSFIVENGGLHK 214
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQ 264
E+DYPY + C+ K + VTI+GY +P + FQ
Sbjct: 215 EEDYPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQ 274
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS GVFD +CG L+HGV VGYG G Y +VKNSWG+ WGE GYIRM RN
Sbjct: 275 FYSGGVFDGHCGSDLDHGVAAVGYGTAKGVDYIIVKNSWGSKWGEKGYIRMRRNIGKPE- 333
Query: 325 GICGILMQASYPVKR 339
GICGI ASYP K+
Sbjct: 334 GICGIYKMASYPTKK 348
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 202/349 (57%), Gaps = 36/349 (10%)
Query: 22 RMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSME---ERFENWLKQYSREYGSEDE 78
+ ++ L LFL G GY + D +SM+ E FE+W+ ++ + Y + +E
Sbjct: 7 KTLVLTCSLCLFLSLAFGRDFSIV--GYSSE-DLKSMDKLIELFESWMSRHGKIYETIEE 63
Query: 79 WQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQ 138
RF ++ N+++ID N ++ L N+FADLS++EF + YLG ++ R S +
Sbjct: 64 KLLRFEVFKDNLKHIDERNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVNLSQRRESSNE 123
Query: 139 Y------LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
+ LP SVDWRK+GAVTPVK+QGQCGSCWAFS VAAVEGIN++ TG L SLSEQ
Sbjct: 124 EEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQ 183
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTIT 252
EL+DCD + N GCNGG M+ AF FI + GG+ EDDYPY + C+ K + VTI
Sbjct: 184 ELIDCDT-TYNNGCNGGLMDYAFSFIVQNGGLHKEDDYPYIMEESTCEMKKEETQVVTIN 242
Query: 253 GYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
GY +P + FQ YS GVFD +CG L+HGV+ VGYG
Sbjct: 243 GYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSGGVFDGHCGSDLDHGVSAVGYGT 302
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
Y +VKNSWG WGE G+IRM RN GICG+ ASYP K+
Sbjct: 303 SKNLDYIIVKNSWGAKWGEKGFIRMKRNIGKPE-GICGLYKMASYPTKK 350
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 150/349 (42%), Positives = 202/349 (57%), Gaps = 44/349 (12%)
Query: 28 AVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEER----FENWLKQY--SREYGSEDEWQR 81
A S+ L V+ + + P + EE +E W + SR+ E +
Sbjct: 2 ATKSMLLALVVALAFVGVARTIPFNEKDLASEESLWGLYERWRSHHTVSRDL---SEKNK 58
Query: 82 RFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG 141
RF ++ N ++I N ++ +KL NKFAD++N+EF STY G ++ + + + G
Sbjct: 59 RFNVFKENAKFIHEFNKKDAPYKLGLNKFADMTNQEFRSTYAGSKIHHHRTQRGTPRATG 118
Query: 142 ---------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
+PASVDWR +GAV PVKDQGQCGSCWAFS +A+VEGINK+KT +LV LS Q
Sbjct: 119 SFMYENVHSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKIKTNQLVPLSGQ 178
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTIT 252
+LVDCD + +N+GCNGG M+ AFEFI GG+T+E YPY + C ++ + VTI
Sbjct: 179 QLVDCDTD-QNEGCNGGLMDYAFEFIKSNGGITSESAYPYTAEQGSCASESSA-PVVTID 236
Query: 253 GYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
GYE +PA AFQ YS GVF CG++L+HGV VVGYG
Sbjct: 237 GYEDVPANNEAALMKAVANQVVSVAIEASGMAFQFYSEGVFTGSCGNELDHGVAVVGYGA 296
Query: 291 DH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G KYW+V+NSWG WGE GYIRM R + + G+CGI M+ SYP+K
Sbjct: 297 TRDGTKYWIVRNSWGAEWGEKGYIRMQRGIRARH-GLCGIAMEPSYPLK 344
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 150/349 (42%), Positives = 200/349 (57%), Gaps = 40/349 (11%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEW 79
M +N L LFL+ + W S ++ ER E W+ QY R Y E
Sbjct: 1 MNSFSQNHYLILFLVLAV------WTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEK 54
Query: 80 QRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYN------EP 132
++RF ++ +NV +I+ N+ + F L+ N+FADL++EEF + + K + E
Sbjct: 55 EKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTET 114
Query: 133 RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
+ +PA++DWRK GAVTP+KDQG+CGSCWAFSAVAA EGI+++ TGKLV LSEQ
Sbjct: 115 SFRYESVTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQ 174
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTIT 252
ELVDC V E++GC GGY++ AFEFI K GG+ +E YPY+G N C+ K H I
Sbjct: 175 ELVDC-VKGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIK 233
Query: 253 GYEAIPAR----------------------YAFQLYSHGVFD-EYCGHQLNHGVTVVGYG 289
GYE +P+ +AF+ YS G+F+ CG NH V VVGYG
Sbjct: 234 GYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYG 293
Query: 290 ED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+ G KYWLVKNSWGT WGE GYIR+ R+ + G+CGI YP
Sbjct: 294 KALDGSKYWLVKNSWGTEWGERGYIRIKRDIRAKE-GLCGIAKYPYYPT 341
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 194/340 (57%), Gaps = 37/340 (10%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L + + A++ K ++ +E+WL +Y + Y S EW+RRF I+
Sbjct: 10 MSLLFFSTLLVLSLAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKET 69
Query: 90 VQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYN--------EPRWPSVQYL 140
+++ID N+ N S+++ N+FAD +NEEF STYLG+ N EPR V
Sbjct: 70 LRFIDEHNADTNRSYRVGLNQFADQTNEEFQSTYLGFTSGSNKMKVSNRYEPRVGQV--- 126
Query: 141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
LP VDWR GAV +K QGQCGSCWAFSA+A VEGINK+ TG L+SLSEQELVDC
Sbjct: 127 -LPDYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGRT 185
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-- 258
+GC+GG + F+FI GG+ TE +YPY ++ +C D +I YE +P
Sbjct: 186 QNTRGCDGGSITDGFQFIINNGGINTEANYPYTAEDGQCNLDLQNEKYASIDTYENVPYN 245
Query: 259 --------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWL 298
A AFQ YS G+F CG ++H VT+VGYG + G YW+
Sbjct: 246 NEWALQTAVAYQPVSVALEAAGDAFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWI 305
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VKNSW T+WGE GYIR+ RN + G CGI + SYPVK
Sbjct: 306 VKNSWDTTWGEEGYIRILRNVGGA--GTCGIATKPSYPVK 343
>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 145/308 (47%), Positives = 190/308 (61%), Gaps = 34/308 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
F++W+ ++ + YGS E +RR I+ N+++I N++NLS++L +FADLS E+
Sbjct: 56 FDSWMVKHGKVYGSVAEKERRLTIFEDNLRFISNRNAENLSYRLGLTQFADLSLHEYGEV 115
Query: 122 YLGYNK--PYN------EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
G + P N R+ + LP SVDWR EGAVT VKDQG C SCWAFS V
Sbjct: 116 CHGADPRPPRNHVFMTSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVG 175
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEG+NK+ TG+LV+LSEQ+L++C N EN GC GG +E A+EFI K GG+ T++DYPY+
Sbjct: 176 AVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIMKNGGLGTDNDYPYK 233
Query: 234 GKNDRCQTD-KTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
N C K + V I G+E +PA FQLY GV
Sbjct: 234 AVNGVCDGRLKENNKNVMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQLYESGV 293
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
FD CG LNHGV VVGYG ++G YWLVKNS G +WGEAGY++MARN + G+CGI
Sbjct: 294 FDGSCGTNLNHGVVVVGYGTENGRDYWLVKNSRGNTWGEAGYMKMARNIANPR-GLCGIA 352
Query: 331 MQASYPVK 338
M+ASYP+K
Sbjct: 353 MRASYPLK 360
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 146/314 (46%), Positives = 181/314 (57%), Gaps = 35/314 (11%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFK--LTDNKFADLSN 115
M +R E W+ ++ R Y + E RR ++ NV +I+ +N+ K L +N+FADL+N
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 116 EEFISTYLGY-------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
EF +T G N+ R+ +V LPASVDWR +GAV PVKDQG CG CWA
Sbjct: 61 AEFRATRTGLRPSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGCCWA 120
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSAVAA+EG KL TGKLVSLSEQ+LV CDV E+QGC GG M+ AF+FI K GG+ E
Sbjct: 121 FSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLAAES 180
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLY 266
DYPY +D+C T A TI GYE +PA FQ Y
Sbjct: 181 DYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHFQFY 240
Query: 267 SHGVFDEY--CGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
GV C +L+H +T VGYG G KYWL+KNSWGTSWGE GY+RM R
Sbjct: 241 KGGVLSGAAGCATELDHAITAVGYGVASDGTKYWLMKNSWGTSWGEDGYVRMERGVADKE 300
Query: 324 IGICGILMQASYPV 337
G+CG+ M ASYP
Sbjct: 301 -GVCGLAMMASYPT 313
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 145/307 (47%), Positives = 183/307 (59%), Gaps = 30/307 (9%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
E+F W ++ + Y ++ RF ++ N+ YI + + N ++ L KFADL+NEEF
Sbjct: 52 EQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRH-SETNRTYSLGLTKFADLTNEEFR 110
Query: 120 STYLG--YNKPYNEPRWPSVQYLG--LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
Y G ++ R +Y P SVDWRK GAVT VKDQG CGSCWAFSAV +V
Sbjct: 111 RMYTGTRIDRSRRAKRRTGFRYADSEAPESVDWRKNGAVTSVKDQGSCGSCWAFSAVGSV 170
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EGIN ++ G+ VSLSEQELVDCD+ NQGCNGG M+ AF+FI + GG+ TE DYPY+G
Sbjct: 171 EGINAIRNGEAVSLSEQELVDCDLEY-NQGCNGGLMDYAFDFIIQNGGIDTEKDYPYKGF 229
Query: 236 NDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFDE 273
+ RC K H VTI GYE +P FQLY+ GVF
Sbjct: 230 DGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYAQGVFSG 289
Query: 274 YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN--IGICGILM 331
CG L+HGV VGYG + G YW+VKNSWG WGE+GY+RM RN SN G+CGI +
Sbjct: 290 ECGTDLDHGVLAVGYGTEDGVDYWIVKNSWGEYWGESGYLRMKRNMKDSNDGPGLCGINI 349
Query: 332 QASYPVK 338
+ SY VK
Sbjct: 350 EPSYAVK 356
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 145/340 (42%), Positives = 194/340 (57%), Gaps = 37/340 (10%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L I + A++ + ++ +E+WL +Y + Y S EW+RRF I+
Sbjct: 10 MSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKET 69
Query: 90 VQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYN--------EPRWPSVQYL 140
+++ID N+ N S+K+ N+FADL++EEF STYLG+ N EPR V
Sbjct: 70 LRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFTSGSNKTKVSNRYEPRVGQV--- 126
Query: 141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
LP+ VDWR GAV +K QG+CG CWAFSA+A VEGINK+ TG L+SLSEQEL+DC
Sbjct: 127 -LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-- 258
+GCNG Y+ F FI GG+ TE++YPY ++ C D VTI YE +P
Sbjct: 186 QNTRGCNGSYITDGFPFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYN 245
Query: 259 --------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWL 298
A AF+ YS G+F CG ++H VT+VGYG + G YW+
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWI 305
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VKNSW T+WGE GY+R+ RN + G CGI SYPVK
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA--GTCGIATMPSYPVK 343
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 152/349 (43%), Positives = 210/349 (60%), Gaps = 41/349 (11%)
Query: 23 MMLRNAVLSL-FLLWVLGIPAGAWSEGYPQKYDPQS--MEERFENWLKQYSREYGSEDEW 79
M N +++L +LW A A++ YD S + + + W+ QY R Y ++ E
Sbjct: 1 MKHLNPIIALCTMLW-----ACAYTAMSRTLYDETSSVVAKTHQQWMLQYGRSYTNDAEM 55
Query: 80 QRRFGIYSSNVQYIDYINSQ--NLSFKLTDNKFADLSNEEFISTYLGY-------NKPYN 130
++RF I+ N++YI+ N+ N S+KL N+F+DL+NEEFI+++ G +
Sbjct: 56 EKRFKIFMENLEYIEKFNNAPGNKSYKLDLNQFSDLTNEEFIASHTGLMIDPSKPSSSSK 115
Query: 131 EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
S+ P S+DWR++GAVT VK+QG CGSCWAFSAVAAVEGI K+K G L+SLS
Sbjct: 116 RASPASLDLSDTPTSLDWREQGAVTDVKNQGNCGSCWAFSAVAAVEGIVKIKNGNLISLS 175
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQ+LVDC N +NQGC GG+M+ AF +IT+ G+ +E+DY YRG CQ ++ A
Sbjct: 176 EQQLVDCASNEQNQGCGGGFMDNAFSYITE-NGIASENDYQYRGGAGTCQNNEMITPAAR 234
Query: 251 ITGYEAIPA--------------------RYAFQLYSHGVFDEYCGHQLNHGVTVVGYG- 289
I+GYE +PA +F LY G++ CG LNHGVT+VGYG
Sbjct: 235 ISGYEDVPAGEDQLLLAVSQQPVSVAIAVGQSFHLYKEGIYSGPCGSSLNHGVTLVGYGT 294
Query: 290 -EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
E+ G KYWL+KNSWG SWGE GY+R+ R S S G CGI ++AS+P
Sbjct: 295 SEEDGTKYWLIKNSWGESWGENGYMRLLRESGQSE-GHCGIAVKASHPT 342
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 154/351 (43%), Positives = 203/351 (57%), Gaps = 44/351 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEW 79
M +N L LFL VL + W S ++ ER E W+ QY R Y E
Sbjct: 1 MNSFSQNHYLILFL--VLSV----WTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEK 54
Query: 80 QRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYN-------- 130
++RF ++ +NV +I+ N+ + F L+ N+FADL++EEF + + K +
Sbjct: 55 EKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTQT 114
Query: 131 EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
R+ SV +PA++DWRK GAVTP+KDQG+CGSCWAFSAVAA EGI+++ TGKLV LS
Sbjct: 115 SFRYESVT--KIPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLS 172
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQELVDC V E++GC GGY++ AFEFI K GG+ +E YPY+G N C+ K H
Sbjct: 173 EQELVDC-VKGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAE 231
Query: 251 ITGYEAIPAR----------------------YAFQLYSHGVFD-EYCGHQLNHGVTVVG 287
I GYE +P+ +AF+ YS G+F+ CG NH V VVG
Sbjct: 232 IKGYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNVRNCGTDPNHAVAVVG 291
Query: 288 YGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
YG+ G KYWLVKNSWGT WGE GYIR+ R+ + G+CGI YP
Sbjct: 292 YGKALDGSKYWLVKNSWGTEWGERGYIRIKRDIRAKE-GLCGIAKYPYYPT 341
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 145/340 (42%), Positives = 195/340 (57%), Gaps = 37/340 (10%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+SL L I + A++ + ++ +E+WL +Y + Y S EW+RRF I+
Sbjct: 10 MSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKET 69
Query: 90 VQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYN--------EPRWPSVQYL 140
+++ID N+ N S+K+ N+FADL++EEF STYL + N EPR V
Sbjct: 70 LRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFTSGSNKTKVSNRYEPRVGQV--- 126
Query: 141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
LP+ VDWR GAV +K QG+CG CWAFSA+A VEGINK+ TG L+SLSEQEL+DC
Sbjct: 127 -LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRT 185
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-- 258
+GCNGGY+ F+FI GG+ TE++YPY ++ C D VTI YE +P
Sbjct: 186 QNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYN 245
Query: 259 --------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWL 298
A AF+ YS G+F CG ++H VT+VGYG + G YW+
Sbjct: 246 NEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWI 305
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VKNSW T+WGE GY+R+ RN + G CGI SYPVK
Sbjct: 306 VKNSWDTTWGEEGYMRILRNVGGA--GTCGIATMPSYPVK 343
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 150/340 (44%), Positives = 192/340 (56%), Gaps = 39/340 (11%)
Query: 32 LFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQ 91
LFL + L A Y + +E WL ++ + Y E +RF ++ N+
Sbjct: 13 LFLSFTLSC---AIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLREKDKRFQVFKDNLG 69
Query: 92 YI-DYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG--------- 141
+I ++ N+QN ++KL N+FAD++NEE+ Y G K + R + G
Sbjct: 70 FIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFG-TKSDAKRRLMKTKSTGHRYAYSAGD 128
Query: 142 -LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
LP VDWR +GAV P+KDQG CGSCWAFS VA VE INK+ TGK VSLSEQELVDCD
Sbjct: 129 RLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCD-R 187
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-- 258
+ N+GCNGG M+ AFEFI + GG+ T+ DYPYRG + C K V I G+E +P
Sbjct: 188 AYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGFEDVPPY 247
Query: 259 --------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWL 298
+ QLY GVF CG L+HGV VVGYG ++G YWL
Sbjct: 248 DENALKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVVGYGSENGVDYWL 307
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
V+NSWGT WGE GY +M RN + G CGI M+ASYPVK
Sbjct: 308 VRNSWGTGWGEDGYFKMQRNVRTPT-GKCGITMEASYPVK 346
>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
Length = 379
Score = 269 bits (687), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 144/308 (46%), Positives = 188/308 (61%), Gaps = 34/308 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
FE+W+ ++ + Y S E +RR I+ N+++I NS+NL ++L N+FADLS E+
Sbjct: 64 FESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSENLGYRLGLNRFADLSLHEYKEI 123
Query: 122 YLGYN-KP-------YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
G + KP + R+ + LP SVDWR EGAVT VKDQG C SCWAFS V
Sbjct: 124 CHGADPKPPRNHVFMSSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVG 183
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEG+NK+ TG+LV+LSEQ+L++C N EN GC GG +E A+EFI GG+ T++DYPY+
Sbjct: 184 AVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIVSNGGLGTDNDYPYK 241
Query: 234 GKNDRCQTD-KTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
N C K V I GYE +PA FQLY GV
Sbjct: 242 AVNGACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQLYESGV 301
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
FD CG LNHGV VVGYG ++G YW+V+NSWG +WGEAGY++MARN + G+CGI
Sbjct: 302 FDGRCGTNLNHGVVVVGYGTENGRNYWIVRNSWGNTWGEAGYMKMARNIANPR-GLCGIA 360
Query: 331 MQASYPVK 338
M+ SYP+K
Sbjct: 361 MRVSYPLK 368
>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 139/300 (46%), Positives = 179/300 (59%), Gaps = 48/300 (16%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E WL ++ + Y + E +RRF I+ N+++ID N++N ++K++D
Sbjct: 4 YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKISD-------------- 49
Query: 122 YLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKL 181
R+ LP SVDWRK+GAV VKDQG CGSCWAFS +AAVEGINK+
Sbjct: 50 -----------RYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKI 98
Query: 182 KTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQT 241
TG L+SLSEQELVDCD S N+GCNGG M+ AFEFI GG+ +E+DYPY+ + RC
Sbjct: 99 VTGGLISLSEQELVDCDT-SYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQ 157
Query: 242 DKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQL 279
+ VTI GYE +P FQLY G+F CG L
Sbjct: 158 YRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTAL 217
Query: 280 NHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
+HGVT VGYG ++G YW+VKNSWG SWGE GYIRM R+ +S G CGI M+ASYP+K+
Sbjct: 218 DHGVTAVGYGTENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPIKK 277
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 143/314 (45%), Positives = 186/314 (59%), Gaps = 33/314 (10%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLS 114
+S +E W+ + R Y E +RRF I+ N +YI+ N Q N ++ L N FAD++
Sbjct: 28 RSFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMT 87
Query: 115 NEEFISTYLGYNKPYNEPRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
++EF + Y G P + +Y LP DWR +GAV VK+QG CGSCWAFS
Sbjct: 88 HDEFKALYFGTKVPLSNTIKSGFRYKDATNLPLDTDWRSKGAVATVKNQGACGSCWAFST 147
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
VAAVEG+N++ TG+LVSLSEQELVDCD +NQGCNGG M+ AFEFI + GG+ +E DYP
Sbjct: 148 VAAVEGVNQIVTGELVSLSEQELVDCD-KQKNQGCNGGLMDSAFEFIIQNGGLDSEADYP 206
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHG 269
Y+ + C + H VTI G+E +PA FQLYS G
Sbjct: 207 YKAVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGG 266
Query: 270 VFDEYCGHQLNHGVTVVGYGEDH-----GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
V+ +CG++L+HGV VGYG YW+V+NSWG +WGE+GYIR+ RN S
Sbjct: 267 VYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASPR- 325
Query: 325 GICGILMQASYPVK 338
G CGI M ASYPVK
Sbjct: 326 GKCGIAMMASYPVK 339
>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
Length = 422
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 140/306 (45%), Positives = 192/306 (62%), Gaps = 31/306 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFIS 120
FE+W K++ + Y S+++ RF I+ N +++ NSQ N S+ L+ N FADL++ EF +
Sbjct: 32 FESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFKA 91
Query: 121 TYLGYNK-----PYNEPRWPSVQYLG-LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
+ LG + + +P ++G +P S+DWRK+GAV+ VKDQG CG+CW+FSA A
Sbjct: 92 SRLGLSAFSTSGKLSRRNFPLHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFSATGA 151
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
+EGINK+ TG LVSLSEQELVDCD S N GC GG M+ A++F+ + G+ TE+DYPY+
Sbjct: 152 IEGINKIVTGSLVSLSEQELVDCD-RSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQA 210
Query: 235 KNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVFD 272
+ C +K K H VTI GY +P + AFQLYS G+F
Sbjct: 211 REKTCNKEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFT 270
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
C L+H V +VGYG ++G YW+VKNSWGT WG GY+ M RNS +S G+CGI M
Sbjct: 271 GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQ-GLCGINML 329
Query: 333 ASYPVK 338
AS+PVK
Sbjct: 330 ASFPVK 335
>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
Length = 361
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 147/310 (47%), Positives = 183/310 (59%), Gaps = 36/310 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E W ++ S DE RF ++ NV ++ N + +KL N+FAD++N EF S
Sbjct: 40 YERWRSHHTVSR-SLDEKHNRFNVFKGNVMHVHSSNKMDKPYKLKLNRFADMTNHEFRSI 98
Query: 122 YLGYNKPYNE-----PRWPSV----QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
Y G ++ PR +P+SVDWRK+GAVT VKDQGQCGSCWAFS +
Sbjct: 99 YAGSKVNHHRMFRGTPRGNGTFMYQNVDRVPSSVDWRKKGAVTDVKDQGQCGSCWAFSTI 158
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AVEGIN++KT KLV LSEQELVDCD ++NQGCNGG ME AFEFI + G+TT +YPY
Sbjct: 159 VAVEGINQIKTHKLVPLSEQELVDCDT-TQNQGCNGGLMESAFEFIKQY-GITTASNYPY 216
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
K+ C K AV+I G+E +P FQ YS GV
Sbjct: 217 EAKDGTCDASKVNEPAVSIDGHENVPVNNEAALLKAVAHQPVSVAIEAGGIDFQFYSEGV 276
Query: 271 FDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
F CG L+HGV +VGYG G KYW VKNSWG+ WGE GYIRM R S S G+CGI
Sbjct: 277 FTGNCGTALDHGVAIVGYGTTQDGTKYWTVKNSWGSEWGEKGYIRMKR-SISVKKGLCGI 335
Query: 330 LMQASYPVKR 339
M+ASYP+K+
Sbjct: 336 AMEASYPIKK 345
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 205/343 (59%), Gaps = 39/343 (11%)
Query: 27 NAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIY 86
N ++ +FL++ + S + Y + + E W+ Q+ + Y E ++RF I+
Sbjct: 6 NFIIPMFLIFTTWMLPYVMSSRVLEPY----LSNKHEKWMTQFGKSYKDAAEKEKRFQIF 61
Query: 87 SSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNE----PRWPSVQY-- 139
+NV++I+ N+ N F L+ N FADL+NEEF ++ G K +++ S +Y
Sbjct: 62 KNNVEFIELFNAVGNKPFNLSINHFADLTNEEFKASLNGNKKLHDKFDILNETTSFRYHN 121
Query: 140 -LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
+PAS+DWRK GAVTP+K+QG CGSCWAFS VA++EGI+++ TG+LVSLSEQEL+DC
Sbjct: 122 VTSVPASMDWRKRGAVTPIKNQGSCGSCWAFSTVASIEGIHQITTGELVSLSEQELIDC- 180
Query: 199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
V + GC+GGY+E AF+FI K GG+ +E +YPY+ +++C+ K H I GYE +P
Sbjct: 181 VRGNSSGCSGGYLEDAFKFIAKKGGMASETNYPYKETDEKCKFKKESKHVAEIKGYEKVP 240
Query: 259 AR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE--DHGE 294
+ Y FQ YS G+F CG +H VT+VGYG D+ E
Sbjct: 241 SNSENDLLKAVANQPVSVYVDAGDYVFQFYSGGIFTGKCGTDTDHVVTIVGYGVSLDYTE 300
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
YWLVKNSWGT WGE GY+++ RN S G+CGI SYPV
Sbjct: 301 -YWLVKNSWGTGWGEKGYMKLKRNVDSKK-GLCGIATNPSYPV 341
>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
Precursor
gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 371
Score = 268 bits (685), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 145/308 (47%), Positives = 190/308 (61%), Gaps = 34/308 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
FE+W+ ++ + Y S E +RR I+ N+++I N++NLS++L N+FADLS E+
Sbjct: 56 FESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYGEI 115
Query: 122 YLGYNK--PYN------EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
G + P N R+ + LP SVDWR EGAVT VKDQG C SCWAFS V
Sbjct: 116 CHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVG 175
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEG+NK+ TG+LV+LSEQ+L++C N EN GC GG +E A+EFI GG+ T++DYPY+
Sbjct: 176 AVEGLNKIVTGELVTLSEQDLINC--NKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYK 233
Query: 234 GKNDRCQTD-KTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
N C+ K + V I GYE +PA FQLY GV
Sbjct: 234 ALNGVCEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGV 293
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
FD CG LNHGV VVGYG ++G YW+VKNS G +WGEAGY++MARN + G+CGI
Sbjct: 294 FDGTCGTNLNHGVVVVGYGTENGRDYWIVKNSRGDTWGEAGYMKMARNIANPR-GLCGIA 352
Query: 331 MQASYPVK 338
M+ASYP+K
Sbjct: 353 MRASYPLK 360
>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 470
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 135/296 (45%), Positives = 185/296 (62%), Gaps = 35/296 (11%)
Query: 78 EWQRRFGIYSSNVQYIDYINSQNLS----FKLTDNKFADLSNEEFISTYLGYNKPYNE-- 131
E +RRF + N++++D N++ + F+L N+FADL+N+EF + YLG
Sbjct: 69 EEERRFRAFWDNLRFVDAHNARAAAGEEGFRLGMNRFADLTNDEFRAAYLGVKGAGQRRS 128
Query: 132 ------PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGK 185
R+ LP +VDWR++GAV PVK+QGQCGSCWAFSAV+AVE IN+L TG+
Sbjct: 129 ARAGVGERYRHDGVEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSAVESINQLVTGE 188
Query: 186 LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK 245
LV+LSEQELV+CD+N ++ GCNGG M+ AF+FI GG+ TEDDYPY+ + +C ++
Sbjct: 189 LVTLSEQELVECDINGQSNGCNGGLMDDAFDFIINNGGIDTEDDYPYKALDGKCDINRRN 248
Query: 246 HHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGV 283
V+I G+E +P FQLY GVF CG +L+HGV
Sbjct: 249 AKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFTGRCGTELDHGV 308
Query: 284 TVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
VGYG ++G+ YW+V+NSWG WGEAGY+RM RN ++ G CGI M +SYP K+
Sbjct: 309 VAVGYGTENGKDYWIVRNSWGPKWGEAGYLRMERN-INATTGKCGIAMMSSYPTKK 363
>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
Length = 472
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 135/294 (45%), Positives = 184/294 (62%), Gaps = 33/294 (11%)
Query: 78 EWQRRFGIYSSNVQYIDYINSQNLS----FKLTDNKFADLSNEEFISTYLGYNKPYNEP- 132
E +RRF + N+ ++D N++ + ++L N+FADL+N+EF + YLG P
Sbjct: 73 ERERRFRAFWDNLNFVDAHNARAAAGEEGYRLGMNRFADLTNDEFRAAYLGVKAQRARPG 132
Query: 133 -----RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLV 187
R+ LP +VDWR++GAV PVK+QGQCGSCWAFSAV+ VE IN++ TG++V
Sbjct: 133 RMVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTGEMV 192
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
+LSEQELV+CD N ++ GCNGG M+ AFEFI K GG+ TEDDYPY+ + RC +
Sbjct: 193 TLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKAIDGRCDVLRKNAK 252
Query: 248 AVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTV 285
V+I G+E +P FQLY GVF CG QL+HGV
Sbjct: 253 VVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTQLDHGVVA 312
Query: 286 VGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
VGYG ++G+ YW+V+NSWG +WGE+GY+RM RN ++ G CGI M +SYP K+
Sbjct: 313 VGYGTENGKDYWIVRNSWGPNWGESGYLRMERNINVTS-GKCGIAMMSSYPTKK 365
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 148/312 (47%), Positives = 190/312 (60%), Gaps = 33/312 (10%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
+ E FE WL ++ + Y S +E RF ++ N++ ID IN + S+ L N+FADL+++E
Sbjct: 40 LVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKLIDEINREVTSYWLGLNEFADLTHDE 99
Query: 118 FISTYLGYNKPYNEP------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
F +TYLG + P R+ +V LP +VDWRK+GAVT VK+QGQCGSCWAFS
Sbjct: 100 FKTTYLGLSPPPARRSSSRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFST 159
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
VAAVEGIN + TG L +LSEQEL+DC V+ N GCNGG M+ AF +I GG+ TE+ YP
Sbjct: 160 VAAVEGINAIVTGNLTALSEQELIDCSVDG-NSGCNGGMMDYAFSYIASSGGLHTEEAYP 218
Query: 232 YRGKNDRC-QTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSH 268
Y + C K++ AV+I+GYE +P + FQ YS
Sbjct: 219 YLMEEGSCGDGKKSESEAVSISGYEDVPTKDEQALIKALAHQPVSVAIEASGRHFQFYSG 278
Query: 269 GVFDEYCGHQLNHGVTVVGYGEDHGE--KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
GVFD CG QL+HGV VGYG D G+ Y +VKNSWG WGE GYIRM R + S G+
Sbjct: 279 GVFDGPCGAQLDHGVAAVGYGSDKGKGHDYIIVKNSWGGKWGEKGYIRMKRGTGKSE-GL 337
Query: 327 CGILMQASYPVK 338
CGI ASYP K
Sbjct: 338 CGINKMASYPTK 349
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 144/306 (47%), Positives = 185/306 (60%), Gaps = 36/306 (11%)
Query: 67 KQYSREYGSEDEWQRRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNEEFISTYLG 124
K S G ++ RF I+ N+++ID N ++N ++KL FA+L+N+E+ S YLG
Sbjct: 13 KSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLG 72
Query: 125 YN----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
K N +V + +P +VDWR++GAV +KDQG CGSCWAFS AA
Sbjct: 73 ARTEPVRRITKAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAA 132
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEGINK+ TG+LVSLSEQELVDCD S NQGCNGG M+ AF+FI K GG+ TE DYPY G
Sbjct: 133 VEGINKIVTGELVSLSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHG 191
Query: 235 KNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFD 272
N +C + VTI GYE +P++ AFQ Y G+F
Sbjct: 192 TNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFT 251
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG ++H V VGYG ++G YW+V+NSWGT WGE GYIRM RN S + G CGI ++
Sbjct: 252 GKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVASKS-GKCGIAIE 310
Query: 333 ASYPVK 338
ASYPVK
Sbjct: 311 ASYPVK 316
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 145/323 (44%), Positives = 187/323 (57%), Gaps = 41/323 (12%)
Query: 53 YDPQSME------ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLT 106
Y P+ +E E FENW+ + + Y + +E RF ++ N+++ID N + S+ L
Sbjct: 36 YSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLG 95
Query: 107 DNKFADLSNEEFISTYLGYN---------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPV 157
N+FADLS+EEF YLG + Y E + V+ +P SVDWRK+GAV V
Sbjct: 96 LNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVE--AVPKSVDWRKKGAVAEV 153
Query: 158 KDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEF 217
K+QG CGSCWAFS VAAVEGINK+ TG L +LSEQEL+DCD + N GCNGG M+ AFE+
Sbjct: 154 KNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDT-TYNNGCNGGLMDYAFEY 212
Query: 218 ITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------- 260
I K GG+ E+DYPY + C+ K + VTI G++ +P
Sbjct: 213 IVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVA 272
Query: 261 -----YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRM 315
FQ YS GVFD CG L+HGV VGYG G Y +VKNSWG WGE GYIR+
Sbjct: 273 IDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGYIRL 332
Query: 316 ARNSPSSNIGICGILMQASYPVK 338
RN+ G+CGI AS+P K
Sbjct: 333 KRNTGKPE-GLCGINKMASFPTK 354
>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
Length = 362
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 148/317 (46%), Positives = 184/317 (58%), Gaps = 39/317 (12%)
Query: 56 QSMEERFENW--LKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+S + +E W + SR G + +RF ++ +NV ++ N + +KL NKFAD+
Sbjct: 34 ESFWDLYERWRSYRTVSRSLGDK---HKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADM 90
Query: 114 SNEEFISTYLGYNKPYNE-----PRWPSV----QYLGLPASVDWRKEGAVTPVKDQGQCG 164
+N EF STY G ++ PR + +P S DWRK GAVT VKDQGQCG
Sbjct: 91 TNHEFRSTYAGSKVNHHRMFQGTPRGNGTFMYEKVGSVPPSADWRKNGAVTGVKDQGQCG 150
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS V AVEGIN++KT KLVSLSEQELVDCD +N GCNGG ME AFEFI + GG+
Sbjct: 151 SCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDT-KKNAGCNGGLMESAFEFIKQKGGI 209
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE +YPY ++ C K AV+I G+E +PA +
Sbjct: 210 TTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAGGFD 269
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ Y GVF C +LNHGV +VGYG G YW V+NSWG WGE GYIRM R S
Sbjct: 270 FQFYFEGVFTGDCSTELNHGVAIVGYGTTVDGTNYWTVRNSWGPEWGEQGYIRMQR-SIF 328
Query: 322 SNIGICGILMQASYPVK 338
G+CGI M ASYP+K
Sbjct: 329 KKEGLCGIAMMASYPIK 345
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 139/292 (47%), Positives = 178/292 (60%), Gaps = 35/292 (11%)
Query: 81 RRFGIYSSNVQYIDY--INSQNLSFKLTDNKFADLSNEEFISTYLGYN----------KP 128
+RF I+ N+++ID N++N ++KL KF DL+N+E+ YLG K
Sbjct: 72 KRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKN 131
Query: 129 YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
N+ +V +P +VDWR++GAV P+KDQG CGSCWAFS AAVEGINK+ TG+L+S
Sbjct: 132 VNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELIS 191
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
LSEQELVDCD S NQGCNGG M+ AF+FI K GG+ TE DYPYRG +C +
Sbjct: 192 LSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRV 250
Query: 249 VTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVV 286
V+I GYE +P + FQ Y G+F CG L+H V V
Sbjct: 251 VSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAV 310
Query: 287 GYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
GYG ++G YW+V+NSWG WGE GYIRM RN +S G CGI ++ASYPVK
Sbjct: 311 GYGSENGVDYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVK 362
>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 339
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 142/306 (46%), Positives = 182/306 (59%), Gaps = 29/306 (9%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNE 116
M ER E W K+Y + Y E Q+R I+ NV++I+ N+ N +KL+ N D +NE
Sbjct: 36 MSERHEQWTKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLSINHLTDQTNE 95
Query: 117 EFISTYLGYNKPYNEPRWPSV--QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
EF++++ GY + + P G+P +VDWR+ GAV +KDQGQCG+CWAFS VA
Sbjct: 96 EFVASHNGYKHKGSHSQTPFKYENITGVPNAVDWRENGAVXAMKDQGQCGNCWAFSTVAT 155
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
EGI ++ T L+SLSEQELVDCD S + GC+GGYME FEFI K GG+++E +YPY
Sbjct: 156 TEGIYQITTSMLMSLSEQELVDCD--SVDHGCDGGYMEGGFEFIXKNGGISSEANYPYTA 213
Query: 235 KNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFD 272
+ +K A I GYE +PA AFQ S GVF
Sbjct: 214 VDGTYDANKEASPAAQIKGYETVPANSEDALQKAVANQPVSVTIDVGGSAFQFNSSGVFT 273
Query: 273 EYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG QL+HGVT VGYG D G +YW+VKNSWGT WGE GYIRM R + + G+CGI M
Sbjct: 274 GQCGTQLDHGVTAVGYGSTDDGTQYWIVKNSWGTQWGEEGYIRMQRGTDAQE-GLCGIAM 332
Query: 332 QASYPV 337
ASYP
Sbjct: 333 DASYPT 338
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 145/307 (47%), Positives = 185/307 (60%), Gaps = 33/307 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
FE W+ +Y + Y S +E RF ++ N+ +ID N + ++ L N FADL+++EF +T
Sbjct: 66 FEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVTTYWLGLNAFADLTHDEFKAT 125
Query: 122 YLGYNKPYNEP------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
YLG +P + R+ V +PASVDWRK+GAVT VK+QGQCGSCWAFS VAAV
Sbjct: 126 YLGLRQPETKKTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQGQCGSCWAFSTVAAV 185
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EGIN++ TG L SLSEQELVDC + N GCNGG M+ AF +I GG+ TE+ YPY +
Sbjct: 186 EGINQIVTGNLTSLSEQELVDCSTDG-NNGCNGGVMDNAFSYIASSGGLRTEEAYPYLME 244
Query: 236 NDRCQTDKTK--HHAVTITGYEAIPAR----------------------YAFQLYSHGVF 271
C DK + VTI+GYE +PA FQ YS GVF
Sbjct: 245 EGDCD-DKARDGEQVVTISGYEDVPANDEQALVKALAHQPLSVAIEASGRHFQFYSGGVF 303
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
+ CG +L+HGV VGYG G+ Y +VKNSWG+ WGE GYIRM R + G+CGI
Sbjct: 304 NGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGSHWGEKGYIRMKRGTGKPE-GLCGINK 362
Query: 332 QASYPVK 338
ASYP K
Sbjct: 363 MASYPTK 369
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 139/292 (47%), Positives = 178/292 (60%), Gaps = 35/292 (11%)
Query: 81 RRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNEEFISTYLGYN----------KP 128
+RF I+ N+++ID N ++N ++KL KF DL+N+E+ YLG K
Sbjct: 72 KRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKN 131
Query: 129 YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
N+ +V +P +VDWR++GAV P+KDQG CGSCWAFS AAVEGINK+ TG+L+S
Sbjct: 132 VNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELIS 191
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
LSEQELVDCD S NQGCNGG M+ AF+FI K GG+ TE DYPYRG +C +
Sbjct: 192 LSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRV 250
Query: 249 VTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVV 286
V+I GYE +P + FQ Y G+F CG L+H V V
Sbjct: 251 VSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAV 310
Query: 287 GYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
GYG ++G YW+V+NSWG WGE GYIRM RN +S G CGI ++ASYPVK
Sbjct: 311 GYGSENGVDYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVK 362
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 144/305 (47%), Positives = 187/305 (61%), Gaps = 29/305 (9%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
FE+W+ ++ + Y S +E RF I+ N+ +ID N + +++ L N+F+DLS+EEF +
Sbjct: 33 FESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNKKVVNYWLGLNEFSDLSHEEFKNK 92
Query: 122 YLGYNKPYNEPRWPSVQY-----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
YLG +E R S ++ + +P SVDWRK+GAVT VK+QG CGSCWAFS VAAVE
Sbjct: 93 YLGLKVDMSERRECSQEFNYKDVMSIPKSVDWRKKGAVTDVKNQGSCGSCWAFSTVAAVE 152
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
GIN++ TG L SLSEQELVDCD + N GCNGG M+ AF +I GG+ E DYPY +
Sbjct: 153 GINQIVTGNLTSLSEQELVDCDT-TNNYGCNGGLMDYAFSYIISNGGLHKEVDYPYIMEE 211
Query: 237 DRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHGVFDEY 274
C+ K + VTI+GY +P FQ YS GVFD +
Sbjct: 212 GTCEMRKEESEVVTISGYHDVPQNSEESLLKALANQPLSVAIEASGRDFQFYSGGVFDGH 271
Query: 275 CGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQAS 334
CG QL+HGV VGYG +G Y +VKNSWG+ WGE GYIRM RN+ G+CGI AS
Sbjct: 272 CGTQLDHGVAAVGYGSTNGLDYIIVKNSWGSKWGEKGYIRMKRNTGKP-AGLCGINKMAS 330
Query: 335 YPVKR 339
YP K+
Sbjct: 331 YPTKK 335
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 139/292 (47%), Positives = 178/292 (60%), Gaps = 35/292 (11%)
Query: 81 RRFGIYSSNVQYIDY--INSQNLSFKLTDNKFADLSNEEFISTYLGYN----------KP 128
+RF I+ N+++ID N++N ++KL KF DL+N+E+ YLG K
Sbjct: 72 KRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKAKN 131
Query: 129 YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
N+ +V +P +VDWR++GAV P+KDQG CGSCWAFS AAVEGINK+ TG+L+S
Sbjct: 132 VNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSCWAFSTTAAVEGINKIVTGELIS 191
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
LSEQELVDCD S NQGCNGG M+ AF+FI K GG+ TE DYPYRG +C +
Sbjct: 192 LSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYRGFGGKCNSFLKNSRV 250
Query: 249 VTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVV 286
V+I GYE +P + FQ Y G+F CG L+H V V
Sbjct: 251 VSIDGYEDVPTKDETALKKAISYQPVRVAIEAGGRIFQHYQSGIFTGSCGTNLDHAVVAV 310
Query: 287 GYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
GYG ++G YW+V+NSWG WGE GYIRM RN +S G CGI ++ASYPVK
Sbjct: 311 GYGSENGVDYWIVRNSWGPRWGEEGYIRMERNLAASKSGKCGIAVEASYPVK 362
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 267 bits (682), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 147/315 (46%), Positives = 189/315 (60%), Gaps = 31/315 (9%)
Query: 54 DPQSME---ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKF 110
D +SM+ E FE+W+ ++ + Y + +E RF I+ N+++ID N ++ L N+F
Sbjct: 37 DLKSMDKLIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEF 96
Query: 111 ADLSNEEFISTYLGYNKPYNEPRWPSVQY----LGLPASVDWRKEGAVTPVKDQGQCGSC 166
ADLS+ EF + YLG Y+ R ++ + LP SVDWRK+GAV PVK+QG CGSC
Sbjct: 97 ADLSHREFNNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSC 156
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS VAAVEGIN++ TG L SLSEQEL+DCD + N GCNGG M+ AF FI + GG+
Sbjct: 157 WAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNNGCNGGLMDYAFSFIVENGGLHK 215
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQ 264
E+DYPY + C+ K + VTI+GY +P + FQ
Sbjct: 216 EEDYPYIMEEGTCEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQ 275
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS GVFD +CG L+HGV VGYG G Y VKNSWG+ WGE GYIRM RN
Sbjct: 276 FYSGGVFDGHCGSDLDHGVAAVGYGTAKGVDYITVKNSWGSKWGEKGYIRMRRNIGKPE- 334
Query: 325 GICGILMQASYPVKR 339
GICGI ASYP K+
Sbjct: 335 GICGIYKMASYPTKK 349
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 144/306 (47%), Positives = 184/306 (60%), Gaps = 36/306 (11%)
Query: 67 KQYSREYGSEDEWQRRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNEEFISTYLG 124
K S G ++ RF I+ N+++ID N ++N ++KL FA+L+N+E+ S YLG
Sbjct: 13 KSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDEYRSLYLG 72
Query: 125 YN----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
K N +V +P +VDWR++GAV +KDQG CGSCWAFS AA
Sbjct: 73 ARTEPVRRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGSCWAFSTAAA 132
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEGINK+ TG+LVSLSEQELVDCD S NQGCNGG M+ AF+FI K GG+ TE DYPY G
Sbjct: 133 VEGINKIVTGELVSLSEQELVDCD-KSYNQGCNGGLMDYAFQFIMKNGGLNTEKDYPYHG 191
Query: 235 KNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFD 272
N +C + VTI GYE +P++ AFQ Y G+F
Sbjct: 192 TNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQHYQSGIFT 251
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG ++H V VGYG ++G YW+V+NSWGT WGE GYIRM RN S + G CGI ++
Sbjct: 252 GKCGTNMDHAVVAVGYGSENGVDYWIVRNSWGTRWGEDGYIRMERNVASKS-GKCGIAIE 310
Query: 333 ASYPVK 338
ASYPVK
Sbjct: 311 ASYPVK 316
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 145/320 (45%), Positives = 191/320 (59%), Gaps = 35/320 (10%)
Query: 53 YDPQSMEER------FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLT 106
Y P+ + R FE+W+ ++ + Y S +E RF I+ N+ +ID N + +++ L
Sbjct: 18 YAPEDLTSRDRIIDLFESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKKVVNYWLG 77
Query: 107 DNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQG 161
N+FADLS+EEF + YLG N + R S ++ +P SVDWRK+GAVT VK+QG
Sbjct: 78 LNEFADLSHEEFKNKYLGLNVDLSNRRECSEEFTYKDVSSIPKSVDWRKKGAVTDVKNQG 137
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
CGSCWAFS VAAVEGIN++ TG L SLSEQELVDCD + N GCNGG M+ AF +I
Sbjct: 138 SCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDT-TYNNGCNGGLMDYAFAYIISN 196
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA------------------- 262
GG+ E+DYPY + C+ K + VTI+GY +P
Sbjct: 197 GGLHKEEDYPYIMEEGTCEMRKAESEVVTISGYHDVPQNSEESLLKALANQPLSVAIDAS 256
Query: 263 ---FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
FQ YS GVFD +CG +L+HGV VGYG G + +VKNSWG+ WGE G+IRM RN+
Sbjct: 257 GRDFQFYSGGVFDGHCGTELDHGVAAVGYGSAKGLDFIVVKNSWGSKWGEKGFIRMKRNT 316
Query: 320 PSSNIGICGILMQASYPVKR 339
G+CGI ASYP K+
Sbjct: 317 GKP-AGLCGINKMASYPTKK 335
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 140/310 (45%), Positives = 182/310 (58%), Gaps = 34/310 (10%)
Query: 62 FENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNLS--FKLTDNKFADLSNEEF 118
+E W+ ++ + + E RRF + N++++D N++ + ++L N+FADL+N EF
Sbjct: 52 YEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGYRLGINRFADLTNAEF 111
Query: 119 ISTYL------GYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
+ YL G R+ LP VDWR++GAV PVK+QGQCGSCWAFSAV
Sbjct: 112 RAAYLSAGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVAPVKNQGQCGSCWAFSAV 171
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AVEGIN++ TG+LV+LSEQELVDC N +N GC+GG M+ AF FI GG+ T+ DYPY
Sbjct: 172 GAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAFAFIVGNGGIDTDKDYPY 231
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGV 270
++ +C K H V+I G+E +P FQLY GV
Sbjct: 232 TARDGKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVAVAIEAGGREFQLYQSGV 291
Query: 271 FDEYCGHQLNHGVTVVGYG--EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
F CG L+HGV VGYG D G YWLV+NSWG WGE GYIRM RN + G CG
Sbjct: 292 FTGRCGTSLDHGVVAVGYGTEADGGRDYWLVRNSWGADWGEGGYIRMERNV-GARAGKCG 350
Query: 329 ILMQASYPVK 338
I M+ASYPVK
Sbjct: 351 IAMEASYPVK 360
>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
Length = 382
Score = 266 bits (681), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 193/337 (57%), Gaps = 58/337 (17%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSN 115
+M E F+ W +Y+R Y + +E +RR +Y+ NV+YI+ N+ L+++L + + DL+N
Sbjct: 47 TMMEMFQRWKAEYNRSYATPEEERRRLRVYARNVRYIEATNAAAGLAYELGETAYTDLTN 106
Query: 116 EEFISTYLG---------------------YNKPYNEPRWPSVQY---LGLPASVDWRKE 151
+EF++ Y P +E + P V + G PASVDWR
Sbjct: 107 DEFMAMYTAPPLRSAADDDDDAATTTIITTRAGPVDEHQQPEVYFNESAGAPASVDWRAS 166
Query: 152 GAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYM 211
GAVT VKDQG+CGSCWAFS VA VEGI K+K GKLVSLSEQELVDCD + + GC+GG
Sbjct: 167 GAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDCD--TLDSGCDGGVS 224
Query: 212 EKAFEFITKIGGVTTEDDYPYRG-KNDRCQTDKTKHHAVTITGYEAIPARYA-------- 262
+A E+IT GG+TT DDYPY G C K HHA TI G + R
Sbjct: 225 YRALEWITANGGITTRDDYPYTGAAAAACDRAKLGHHAATIAGLRRVATRSEASLQNAAA 284
Query: 263 --------------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDH--------GEKYWLVK 300
FQ Y GV+D CG +LNHGVTVVGYG++ G+KYW++K
Sbjct: 285 AQPVAVSIEAGGDNFQHYRKGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWIIK 344
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
NSWG +WG+ GYI+M ++ G+CGI ++ S+P+
Sbjct: 345 NSWGKNWGDQGYIKMKKDVAGKPEGLCGIAIRPSFPL 381
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 266 bits (681), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 144/348 (41%), Positives = 202/348 (58%), Gaps = 40/348 (11%)
Query: 29 VLSLFLLWVLGIPAGA-------WSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQR 81
+S+ L+ + + A + E + + + +E+WL ++ + Y + E +
Sbjct: 9 TISILLMLIFSTLSSASDMSIISYDETHIHRRTDDEVSALYESWLIEHGKSYNALGEKDK 68
Query: 82 RFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWP---SV 137
RF I+ N++YID NS N S+KL KFADL+NEE+ S YLG + + S
Sbjct: 69 RFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRKKLSKNKSD 128
Query: 138 QYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
+YL LP S+DWR++G + VKDQG CGSCWAFSAVAA+E IN + TG L+SLSEQ
Sbjct: 129 RYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQ 188
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTIT 252
ELVDCD S N+GC+GG M+ AFEF+ K GG+ TE+DYPY+ +N C + V I
Sbjct: 189 ELVDCD-RSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKID 247
Query: 253 GYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
YE +P FQ Y G+F CG ++HGV + GYG
Sbjct: 248 SYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGT 307
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
++G YW+V+NSWG +WGE GY+R+ RN SS+ G+CG+ ++ SYPVK
Sbjct: 308 ENGMDYWIVRNSWGANWGENGYLRVQRNVASSS-GLCGLAIEPSYPVK 354
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 146/311 (46%), Positives = 182/311 (58%), Gaps = 35/311 (11%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
E FE W+ +Y + Y S +E RRF ++ N+ +ID IN + S+ L N+FADL+++EF
Sbjct: 49 ELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKKVTSYWLGLNEFADLTHDEFK 108
Query: 120 STYLGYNKP----------YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
+TYLG P E R+ + +P +DWRK+ AVT VK+QGQCGSCWAF
Sbjct: 109 ATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAF 168
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
S VAAVEGIN + TG L SLSEQEL+DC + N GCNGG M+ AF +I GG+ TE+
Sbjct: 169 STVAAVEGINAIVTGNLTSLSEQELIDCSTDG-NNGCNGGLMDYAFSYIASTGGLRTEEA 227
Query: 230 YPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYS 267
YPY + C K VTI+GYE +PA FQ YS
Sbjct: 228 YPYAMEEGDCDEGKGA-AVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYS 286
Query: 268 HGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GVFD CG QL+HGVT VGYG G+ Y +VKNSWG WGE GYIRM R + G+C
Sbjct: 287 GGVFDGPCGEQLDHGVTAVGYGTSKGQDYIIVKNSWGPHWGEKGYIRMKRGTGKGE-GLC 345
Query: 328 GILMQASYPVK 338
GI ASYP K
Sbjct: 346 GINKMASYPTK 356
>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
Length = 360
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 147/316 (46%), Positives = 190/316 (60%), Gaps = 36/316 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+S+ + +E W ++ S DE RF ++ +NV ++ N + +KL NKFAD++N
Sbjct: 34 KSLWDLYERWRSHHTVTR-SLDEKHNRFNVFKANVMHVHNTNKLDKPYKLKLNKFADMTN 92
Query: 116 EEFISTY----LGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSC 166
EF Y + +++ + + ++ +P+S+DWRK+GAVT VKDQGQCGSC
Sbjct: 93 YEFRRIYADSKVSHHRMFRGMSNENGTFMYENVKNVPSSIDWRKKGAVTDVKDQGQCGSC 152
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS + AVEGIN++KT KLVSLSEQELVDCD N+GCNGG ME AFEFI K G+TT
Sbjct: 153 WAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTGG-NEGCNGGLMEYAFEFI-KQNGITT 210
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E +YPY K+ C K V+I GYE +P Y FQ
Sbjct: 211 ESNYPYAAKDGTCDLKKEDKAEVSIDGYENVPINNEAALLKAAAKQPVSVAIDAGGYNFQ 270
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGE-KYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GVF +CG LNHGV VVGYG KYW+VKNSWG+ WGE GYIRM R S
Sbjct: 271 FYSEGVFSGHCGTDLNHGVAVVGYGVTQDRTKYWIVKNSWGSEWGEQGYIRMQR-GISHK 329
Query: 324 IGICGILMQASYPVKR 339
G+CGI M+ASYP+K+
Sbjct: 330 EGLCGIAMEASYPIKK 345
>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
Length = 464
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 147/315 (46%), Positives = 190/315 (60%), Gaps = 39/315 (12%)
Query: 62 FENWLKQYSREYGSED----EWQRRFGIYSSNVQYIDYINS---QNLSFKLTDNKFADLS 114
++ W+ ++ GS + E++RRF ++ N++++D N+ ++ F+L N+FADL+
Sbjct: 66 YDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGMNRFADLT 125
Query: 115 NEEFISTYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAV-TPVKDQGQCGSCWA 168
N+EF + YLG P R Y LP SVDWR +GAV +PVK+QGQCGSCWA
Sbjct: 126 NDEFRAAYLG-TTPAGRGRHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGSCWA 184
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSAVAAVEGINK+ TG+LVSLSEQELV+C N N GCNGG M+ AF FIT+ GG+ TE+
Sbjct: 185 FSAVAAVEGINKIVTGELVSLSEQELVECARNRGNSGCNGGIMDDAFAFITRNGGLDTEE 244
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLY 266
DYPY + +C K V+I G+E +P FQLY
Sbjct: 245 DYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLY 304
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
GVF CG L+HGV VGYG D G YW V+NSWG WGE GYIRM RN ++
Sbjct: 305 DSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNV-TART 363
Query: 325 GICGILMQASYPVKR 339
G CGI M ASYP+K+
Sbjct: 364 GKCGIAMMASYPIKK 378
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 152/348 (43%), Positives = 202/348 (58%), Gaps = 36/348 (10%)
Query: 24 MLRNAVLSLFL-LWVLGIPAGAWS-EGYPQKY--DPQSMEERFENWLKQYSREYGSEDEW 79
+L+ + L+ F L+V + A +S GY ++ + E FE+W+ + + Y S +E
Sbjct: 5 VLKTSFLTFFASLFVCSVLAHDFSIVGYSPEHLTSVDKLVELFESWISGHGKAYNSLEEK 64
Query: 80 QRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQ- 138
RF ++ N+++ID N + S+ L N+FADLS+EEF S +LG + PR S +
Sbjct: 65 LHRFEVFKENLKHIDQRNKEVTSYWLGLNEFADLSHEEFKSKFLGLYPEF--PRKKSSED 122
Query: 139 -----YLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
+ LP S+DWRK+GAVTPVK+QG CGSCWAFS VAAVEGIN++ G L SLSEQ+
Sbjct: 123 FSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVAGNLTSLSEQQ 182
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
L+DCD S N GCNGG M+ AFEFI GG+ E+DYPY + C + + VTI+G
Sbjct: 183 LIDCDT-SFNNGCNGGLMDYAFEFIVNNGGLHKEEDYPYLMEEGTCDEKREEMEVVTISG 241
Query: 254 YEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGED 291
Y +P + FQ YS GVF CG L+HGV VGYG
Sbjct: 242 YHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQFYSGGVFSGPCGTDLDHGVAAVGYGSS 301
Query: 292 HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
G Y +VKNSWG WGE GY+RM RN+ G+CGI ASYP K+
Sbjct: 302 SGIDYIIVKNSWGPKWGERGYLRMKRNTGKPE-GLCGINKMASYPTKQ 348
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 144/308 (46%), Positives = 186/308 (60%), Gaps = 34/308 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFIS 120
FE W +Q+ + Y S++E R ++ N ++ NSQ N S+ L+ N FADL++ EF +
Sbjct: 30 FETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHHEFKA 89
Query: 121 TYLGY--------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
+ LG N + + P +PASVDWRK GAVT VKDQG CG+CW+FSA
Sbjct: 90 SRLGLSSAASASLNVDRSNRQIPDF-VADVPASVDWRKNGAVTQVKDQGNCGACWSFSAT 148
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
A+EGINK+ TG LVSLSEQELVDCD S N GC GG M+ AF+F+ G+ TE+DYPY
Sbjct: 149 GAIEGINKIVTGSLVSLSEQELVDCD-KSYNNGCEGGIMDYAFQFVIDNHGIDTEEDYPY 207
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGV 270
+G++ C +K K H VTI GY +P + AFQLYS G+
Sbjct: 208 QGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYSKGI 267
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
F C L+H V +VGYG ++G YW+VKNSWG+ WG GY+ M RNS SS G+CGI
Sbjct: 268 FTGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSR-GLCGIN 326
Query: 331 MQASYPVK 338
M ASYP K
Sbjct: 327 MLASYPKK 334
>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
Length = 499
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 146/315 (46%), Positives = 188/315 (59%), Gaps = 39/315 (12%)
Query: 62 FENWLKQYSREYGSED----EWQRRFGIYSSNVQYIDYINS---QNLSFKLTDNKFADLS 114
++ W+ ++ GS + E++RRF ++ N++++D N+ ++ F+L N+FADL+
Sbjct: 65 YDLWVARHRHGGGSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLT 124
Query: 115 NEEFISTYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAV-TPVKDQGQCGSCWA 168
N+EF + YLG P R Y LP SVDWR +GAV PVK+QGQCGSCWA
Sbjct: 125 NDEFRAAYLG-TTPAGRGRHVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWA 183
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSAVAAVEGINK+ TG+LVSLSEQELV+C N N GCNGG M+ AF FI + GG+ TE+
Sbjct: 184 FSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLDTEE 243
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLY 266
DYPY + +C K V+I G+E +P FQLY
Sbjct: 244 DYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLY 303
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
GVF CG L+HGV VGYG D G YW V+NSWG WGE GYIRM RN ++
Sbjct: 304 DSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNV-TART 362
Query: 325 GICGILMQASYPVKR 339
G CGI M ASYP+K+
Sbjct: 363 GKCGIAMMASYPIKK 377
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 145/314 (46%), Positives = 195/314 (62%), Gaps = 40/314 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNE 116
+ E F++W +++ + YGSE+E Q+R I+ N ++ N N ++ L+ N FADL++
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87
Query: 117 EFISTYLGYNKPYNEPRWPSV------QYLG----LPASVDWRKEGAVTPVKDQGQCGSC 166
EF ++ LG + PSV Q LG +P SVDWRK+GAVT VKDQG CG+C
Sbjct: 88 EFKASRLGLSVSA-----PSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGAC 142
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
W+FSA A+EGIN++ TG L+SLSEQEL+DCD S N GCNGG M+ AFEF+ K G+ T
Sbjct: 143 WSFSATGAMEGINQIVTGDLISLSEQELIDCD-KSYNAGCNGGLMDYAFEFVIKNHGIDT 201
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGY------------EAIPAR----------YAFQ 264
E DYPY+ ++ C+ DK K VTI Y EA+ A+ AFQ
Sbjct: 202 EKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQ 261
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
LYS G+F C L+H V +VGYG +G YW+VKNSWG SWG G++ M RN+ +S+
Sbjct: 262 LYSRGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSD- 320
Query: 325 GICGILMQASYPVK 338
G+CGI M ASYP+K
Sbjct: 321 GVCGINMLASYPIK 334
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 266 bits (680), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 153/351 (43%), Positives = 208/351 (59%), Gaps = 49/351 (13%)
Query: 29 VLSLF-LLWVLGIPAGAWSEGYPQKYDPQSMEER---FENWLKQY--SREYGSEDEWQRR 82
+ SLF +L VL + G+ ++ D +S + +E W + SR+ D+ Q+R
Sbjct: 1 MASLFPVLLVLALAFGSTLSIPIKEKDLESEDSLWSLYERWRSHHAVSRDL---DQKQKR 57
Query: 83 FGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG 141
F ++ NV++I N +++++FKL NKF D++N+EF + Y G ++ S G
Sbjct: 58 FNVFKENVKFIHEFNKNKDVTFKLALNKFGDMTNQEFRAKYAGSKVHHHRTMKGSRHGSG 117
Query: 142 -----------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
P S+DWR+ GAV VK+QGQCGSCWAFSA+AAVEGIN++ T +LV LS
Sbjct: 118 SGAKFMYENAVAPPSIDWRERGAVAAVKNQGQCGSCWAFSAIAAVEGINQIVTKELVPLS 177
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQEL+DCD + +NQGC+GG M+ AFEFI GG+TTED YPY+ ++ C K AV
Sbjct: 178 EQELIDCDTD-QNQGCSGGLMDYAFEFIKNNGGITTEDVYPYQAEDATC---KKNSPAVV 233
Query: 251 ITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
I GYE +P Y FQ YS GVF CG +L+HGV VVGY
Sbjct: 234 IDGYEDVPTNDEDALMKAVANQPVAVAIEASGYVFQFYSEGVFTGRCGTELDHGVAVVGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G G KYW V+NSWG WGE+GY+RM R +++ G+CGI MQASYP+K
Sbjct: 294 GTTQDGTKYWTVRNSWGADWGESGYVRMQRGIKATH-GLCGIAMQASYPIK 343
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 151/349 (43%), Positives = 202/349 (57%), Gaps = 36/349 (10%)
Query: 22 RMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSME---ERFENWLKQYSREYGSEDE 78
+ ++ L LFL G GY + D +SM+ E FE+W+ ++ + Y + +E
Sbjct: 7 KTLVLTCSLCLFLSLAFGRDFSIV--GYSSE-DLKSMDKLIELFESWMSRHGKIYETIEE 63
Query: 79 WQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQ 138
RF ++ N+++ID N ++ L N+FADLS++EF + YLG ++ R S +
Sbjct: 64 KLLRFEVFKDNLKHIDDRNKIVSNYWLGLNEFADLSHQEFKNKYLGLKVDLSQRRESSNE 123
Query: 139 Y------LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
+ LP SVDWRK+GAVTPVK+QGQCGSCWAFS VAAVEGIN++ TG L SLSEQ
Sbjct: 124 EEFTYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQ 183
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTIT 252
EL+DCD + N GCNGG M+ AF FI + GG+ E+DYPY + C+ K + VTI
Sbjct: 184 ELIDCDT-TYNNGCNGGLMDYAFSFIGQNGGLHKEEDYPYIMEESTCEMKKEETQVVTIN 242
Query: 253 GYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
GY +P + FQ YS GVFD +CG L+HGV+ VGYG
Sbjct: 243 GYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSGGVFDGHCGSDLDHGVSAVGYGT 302
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
Y +VKNSWG WGE G+IRM R+ GICG+ ASYP K+
Sbjct: 303 SKNLDYIIVKNSWGAKWGEKGFIRMKRDIGKPE-GICGLYKMASYPTKK 350
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 145/314 (46%), Positives = 195/314 (62%), Gaps = 40/314 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNE 116
+ E F++W +++ + YGSE+E Q+R I+ N ++ N N ++ L+ N FADL++
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87
Query: 117 EFISTYLGYNKPYNEPRWPSV------QYLG----LPASVDWRKEGAVTPVKDQGQCGSC 166
EF ++ LG + PSV Q LG +P SVDWRK+GAVT VKDQG CG+C
Sbjct: 88 EFKASRLGLSVSA-----PSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGAC 142
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
W+FSA A+EGIN++ TG L+SLSEQEL+DCD S N GCNGG M+ AFEF+ K G+ T
Sbjct: 143 WSFSATGAMEGINQIVTGDLISLSEQELIDCD-KSYNAGCNGGLMDYAFEFVIKNHGIDT 201
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGY------------EAIPAR----------YAFQ 264
E DYPY+ ++ C+ DK K VTI Y EA+ A+ AFQ
Sbjct: 202 EKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQ 261
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
LYS G+F C L+H V +VGYG +G YW+VKNSWG SWG G++ M RN+ +S+
Sbjct: 262 LYSSGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSD- 320
Query: 325 GICGILMQASYPVK 338
G+CGI M ASYP+K
Sbjct: 321 GVCGINMLASYPIK 334
>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
Length = 368
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 146/317 (46%), Positives = 191/317 (60%), Gaps = 40/317 (12%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+++ + +E W + + R +G E RRFG + NV+YI N + + N+F D+
Sbjct: 40 EALWDLYERWQEHHHVPRHHG---EKHRRFGAFKDNVRYIHEHNKRAPGYAPL-NRFGDM 95
Query: 114 SNEEFISTYLGYN------KPYNEPRWPSVQYLG---LPASVDWRKEGAVTPVKDQGQCG 164
EEF +T+ G + P P Y G LP +VDWR++GAVT VKDQG+CG
Sbjct: 96 GREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCG 155
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS V +VEGIN ++TG+LVSLSEQEL+DCD ++N GC GG ME AFE+I GG+
Sbjct: 156 SCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDT-ADNSGCQGGLMENAFEYIKHSGGI 214
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE YPYR N C + + V I G++ +PA +
Sbjct: 215 TTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQS 274
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GVF CG L+HGV VVGYGE + G +YW+VKNSWGT+WGE GYIRM R+S
Sbjct: 275 FQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDS-G 333
Query: 322 SNIGICGILMQASYPVK 338
+ G+CGI M+ASYPVK
Sbjct: 334 YDGGLCGIAMEASYPVK 350
>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
Length = 368
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 146/317 (46%), Positives = 191/317 (60%), Gaps = 40/317 (12%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+++ + +E W + + R +G E RRFG + NV+YI N + + N+F D+
Sbjct: 40 EALWDLYERWQEHHHVPRHHG---EKHRRFGAFKDNVRYIHEHNKRAPGYPPL-NRFGDM 95
Query: 114 SNEEFISTYLGYN------KPYNEPRWPSVQYLG---LPASVDWRKEGAVTPVKDQGQCG 164
EEF +T+ G + P P Y G LP +VDWR++GAVT VKDQG+CG
Sbjct: 96 GREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCG 155
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS V +VEGIN ++TG+LVSLSEQEL+DCD ++N GC GG ME AFE+I GG+
Sbjct: 156 SCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDT-ADNSGCQGGLMENAFEYIKHSGGI 214
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE YPYR N C + + V I G++ +PA +
Sbjct: 215 TTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQS 274
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GVF CG L+HGV VVGYGE + G +YW+VKNSWGT+WGE GYIRM R+S
Sbjct: 275 FQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDS-G 333
Query: 322 SNIGICGILMQASYPVK 338
+ G+CGI M+ASYPVK
Sbjct: 334 YDGGLCGIAMEASYPVK 350
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 147/319 (46%), Positives = 193/319 (60%), Gaps = 41/319 (12%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFAD 112
+++ + +E W + + R +G E RRFG + NV+YI N + ++L N+F D
Sbjct: 40 EALWDLYERWQEHHHVPRHHG---EKHRRFGAFKDNVRYIHEHNKRGGRGYRLRLNRFGD 96
Query: 113 LSNEEFISTYLGYN------KPYNEPRWPSVQYLG---LPASVDWRKEGAVTPVKDQGQC 163
+ EEF +T+ G + P P Y G LP +VDWR++GAVT VKDQG+C
Sbjct: 97 MGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKC 156
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCWAFS V +VEGIN ++TG+LVSLSEQEL+DCD ++N GC GG ME AFE+I GG
Sbjct: 157 GSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDT-ADNSGCQGGLMENAFEYIKHSGG 215
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHA-VTITGYEAIPAR---------------------- 260
+TTE YPYR N C + + V I G++ +PA
Sbjct: 216 ITTESAYPYRAANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGD 275
Query: 261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
+FQ YS GVF CG L+HGV VVGYGE + G +YW+VKNSWGT+WGE GYIRM R+S
Sbjct: 276 QSFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDGTEYWIVKNSWGTAWGEGGYIRMQRDS 335
Query: 320 PSSNIGICGILMQASYPVK 338
+ G+CGI M+ASYPVK
Sbjct: 336 -GYDGGLCGIAMEASYPVK 353
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 148/354 (41%), Positives = 204/354 (57%), Gaps = 38/354 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYP-QKYDPQSME------ERFENWLKQYSREY 73
M + + SLFL++V + A + + Y P+ + FE+WL ++S+ Y
Sbjct: 1 MAFIFSSKKTSLFLVFVSVLACSALANEFSILGYAPEDLTSIHKVIHLFESWLAKHSKIY 60
Query: 74 GSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPR 133
S DE RF I+ N+++ID N + ++ L N+FADL++EEF + +LG E +
Sbjct: 61 ESLDEKLHRFEIFMDNLKHIDDTNKKVSNYWLGLNEFADLTHEEFKNKFLGLKGELPERK 120
Query: 134 WPSVQ------YLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLV 187
S++ ++ LP SVDWRK+GAV PVK+QGQCGSCWAFS VAAVEGIN++ TG L
Sbjct: 121 DESIEEFSYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 180
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
LSEQEL+DCD + N GCNGG M+ AF ++ + G+ E++YPY C K
Sbjct: 181 MLSEQELIDCDT-TFNNGCNGGLMDYAFAYVMR-SGLHKEEEYPYIMSEGTCDEKKDVSE 238
Query: 248 AVTITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTV 285
VTI+GY +P + FQ YS GVFD +CG +L+HGV
Sbjct: 239 TVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAA 298
Query: 286 VGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
VGYG G Y +V+NSWG WGE GYIRM R + + G+CG+ M ASYP K+
Sbjct: 299 VGYGTTKGLDYVIVRNSWGPKWGEKGYIRMKRKTGKPH-GMCGLYMMASYPTKQ 351
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 194/339 (57%), Gaps = 38/339 (11%)
Query: 28 AVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYS 87
A+L LF W + + SM ER E W+ Q+ + Y E + R+ I+
Sbjct: 13 ALLLLFGFWAF--------SANTRTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQ 64
Query: 88 SNVQYID-YINSQNLSFKLTDNKFADLSNEEF--ISTYLGY--NKPYNEPRWPSVQYLGL 142
NV+ I+ + N+ N S KL N+FADL+ EEF I+ GY +K + +
Sbjct: 65 QNVKGIEGFNNAGNKSHKLGVNQFADLTEEEFKAINKLKGYMWSKISRTSTFKYEHVTKV 124
Query: 143 PASVDWRKEGAVTPVKDQG-QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
PA++DWR++GAVTP+K QG +CGSCWAF+AVAA EGI KL TG+L+SLSEQEL+DCD N
Sbjct: 125 PATLDWRQKGAVTPIKSQGLKCGSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNG 184
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR- 260
+N GC G +++AF+FI + G+ TE YPY+ + C H +I GYE +PA
Sbjct: 185 DNGGCKWGIIQEAFKFIVQNKGLATEASYPYQAVDGTCNAKVESKHVASIKGYEDVPANN 244
Query: 261 ---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWL 298
Y F+ YS GV CG +H VTVVGYG D G KYWL
Sbjct: 245 ETALLNAVANQPVSVLVDSSDYDFRFYSSGVLSGSCGTTFDHAVTVVGYGVSDDGTKYWL 304
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+KNSWG WGE GYIR+ R+ + G+CGI MQASYP+
Sbjct: 305 IKNSWGVYWGEQGYIRIKRDVAAKE-GMCGIAMQASYPI 342
>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
Length = 369
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 144/296 (48%), Positives = 177/296 (59%), Gaps = 39/296 (13%)
Query: 78 EWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTY-------LGYNKPY 129
E RRFG + NV++I N + + ++L+ N+F D+ EEF ST+ L +
Sbjct: 57 EKGRRFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDMGREEFRSTFADSRINDLRRAESP 116
Query: 130 NEPRWPSVQYLG---LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKL 186
P P Y G LP SVDWRKEGAVT VKDQG CGSCWAFS V +VEGIN ++TG L
Sbjct: 117 AAPAVPGFMYDGVTDLPPSVDWRKEGAVTAVKDQGHCGSCWAFSTVVSVEGINAIRTGSL 176
Query: 187 VSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKH 246
VSLSEQEL+DCD ++ GC GG ME AFEFI GGVTTE YPYR N C + +++
Sbjct: 177 VSLSEQELIDCD--TDENGCQGGLMENAFEFIKSYGGVTTESAYPYRASNGTCDSVRSRR 234
Query: 247 -HAVTITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGV 283
V+I G++ +P AFQ YS GVF CG L+HGV
Sbjct: 235 GQIVSIDGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGVFTGDCGTDLDHGV 294
Query: 284 TVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VGYG D G YW+VKNSWG SWGE GYIRM R + N G+CGI M+AS+P+K
Sbjct: 295 AAVGYGVSDDGTAYWIVKNSWGPSWGEGGYIRMQRG--AGNGGLCGIAMEASFPIK 348
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 145/296 (48%), Positives = 184/296 (62%), Gaps = 37/296 (12%)
Query: 77 DEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWP 135
DE RRF ++ NV++I N ++ +KL NKF D++N+EF S Y G ++ +
Sbjct: 54 DEKNRRFNVFKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRG 113
Query: 136 SVQYLG---------LPA-SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGK 185
+ G LPA S+DWR +GAVT VKDQGQCGSCWAFS +A+VEGIN++KTG+
Sbjct: 114 IQKNTGSFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGE 173
Query: 186 LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK 245
LVSLSEQELVDCD S N+GCNGG M+ AFEFI K G+TTED YPY ++ C ++
Sbjct: 174 LVSLSEQELVDCDT-SYNEGCNGGLMDYAFEFIQK-NGITTEDSYPYAEQDGTCASNLLN 231
Query: 246 HHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGV 283
V+I G++ +PA Y FQ YS GVF CG +L+HGV
Sbjct: 232 SPVVSIDGHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGV 291
Query: 284 TVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+VGYG G KYW+VKNSWG WGE+GYIRM R S G CGI M+ASYP+K
Sbjct: 292 AIVGYGATRDGTKYWIVKNSWGEEWGESGYIRMQR-GISDKRGKCGIAMEASYPIK 346
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 147/315 (46%), Positives = 189/315 (60%), Gaps = 31/315 (9%)
Query: 54 DPQSME---ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKF 110
D +SM+ E FE+W+ ++ + Y S +E RF I+ N+++ID N ++ L N+F
Sbjct: 37 DLKSMDKLIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVSNYWLGLNEF 96
Query: 111 ADLSNEEFISTYLGYNKPYNEPRWPSVQY----LGLPASVDWRKEGAVTPVKDQGQCGSC 166
ADLS++EF + YLG Y+ R ++ + LP SVDWRK+GAVT VK+QG CGSC
Sbjct: 97 ADLSHQEFKNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVTQVKNQGSCGSC 156
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS VAAVEGIN++ TG L SLSEQEL+DCD + N GCNGG M+ AF FI + G+
Sbjct: 157 WAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNNGCNGGLMDYAFSFIVENDGLHK 215
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQ 264
E+DYPY + C+ K + VTI+GY +P + FQ
Sbjct: 216 EEDYPYIMEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQ 275
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS GVFD +CG L+HGV VGYG G Y VKNSWG+ WGE GYIRM RN
Sbjct: 276 FYSGGVFDGHCGSDLDHGVAAVGYGTAKGVDYITVKNSWGSKWGEKGYIRMRRNIGKPE- 334
Query: 325 GICGILMQASYPVKR 339
GICGI ASYP K+
Sbjct: 335 GICGIYKMASYPTKK 349
>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
Precursor
gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
thaliana]
gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 147/310 (47%), Positives = 188/310 (60%), Gaps = 36/310 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E W +S S E +RF ++ NV ++ N +N +KL N+FAD+++ EF S+
Sbjct: 38 YERWRGHHSVSRASH-EAIKRFNVFRHNVLHVHRTNKKNKPYKLKINRFADITHHEFRSS 96
Query: 122 YLGYNKPYNE----PRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
Y G N ++ P+ S ++ +P+SVDWR++GAVT VK+Q CGSCWAFS V
Sbjct: 97 YAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTV 156
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AAVEGINK++T KLVSLSEQELVDCD ENQGC GG ME AFEFI GG+ TE+ YPY
Sbjct: 157 AAVEGINKIRTNKLVSLSEQELVDCDT-EENQGCAGGLMEPAFEFIKNNGGIKTEETYPY 215
Query: 233 RGKNDR-CQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHG 269
+ + C+ + VTI G+E +P FQLYS G
Sbjct: 216 DSSDVQFCRANSIGGETVTIDGHEHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEG 275
Query: 270 VFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
VF CG QLNHGV +VGYGE +G KYW+V+NSWG WGE GY+R+ R S N G CG
Sbjct: 276 VFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIER-GISENEGRCG 334
Query: 329 ILMQASYPVK 338
I M+ASYP K
Sbjct: 335 IAMEASYPTK 344
>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
Length = 353
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 140/313 (44%), Positives = 180/313 (57%), Gaps = 35/313 (11%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL-SFKLTDNKFADLSNE 116
M R E W+ ++ R Y E E RR I+ +N ++ID N S +L N+FADL++E
Sbjct: 43 MVSRHEKWMAEHGRTYTDEAEKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDE 102
Query: 117 EFISTYLGYNKPYNEP---------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
EF + G+ R+ + SVDWR GAVT VKDQG+CG CW
Sbjct: 103 EFRAARTGFRPRPAPAAAAGSGGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCW 162
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAVAAVEG+NK++TG+LVSLSEQELVDCDVN E+QGC GG M+ AF+FI + GG+ +E
Sbjct: 163 AFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASE 222
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
YPY+G + C++ A +I G+E +P YAF+
Sbjct: 223 SGYPYQGDDGSCRSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAFRF 282
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y GV CG LNH +T VGYG G KYWL+KNSWGTSWGE GY+R+ R
Sbjct: 283 YDSGVLGGECGTDLNHAITAVGYGTAADGSKYWLMKNSWGTSWGEGGYVRIRRGVRGE-- 340
Query: 325 GICGILMQASYPV 337
G+CG+ SYPV
Sbjct: 341 GVCGLAKLPSYPV 353
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 146/315 (46%), Positives = 189/315 (60%), Gaps = 31/315 (9%)
Query: 54 DPQSME---ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKF 110
D +SM+ E FE+W+ ++ + Y + +E RF I+ N+++ID N ++ L ++F
Sbjct: 37 DLKSMDKLIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLSEF 96
Query: 111 ADLSNEEFISTYLGYNKPYNEPRWPSVQY----LGLPASVDWRKEGAVTPVKDQGQCGSC 166
ADLS+ EF + YLG Y+ R ++ + LP SVDWRK+GAV PVK+QG CGSC
Sbjct: 97 ADLSHREFNNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVAPVKNQGSCGSC 156
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS VAAVEGIN++ TG L SLSEQEL+DCD + N GCNGG M+ AF FI + GG+
Sbjct: 157 WAFSTVAAVEGINQIVTGNLTSLSEQELIDCD-RTYNNGCNGGLMDYAFSFIVENGGLHK 215
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQ 264
E+DYPY + C+ K + VTI+GY +P + FQ
Sbjct: 216 EEDYPYIMEEGACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQ 275
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS GVFD +CG L+HGV VGYG G Y VKNSWG+ WGE GYIRM RN
Sbjct: 276 FYSGGVFDGHCGSDLDHGVAAVGYGTAKGVDYITVKNSWGSKWGEKGYIRMRRNIGKPE- 334
Query: 325 GICGILMQASYPVKR 339
GICGI ASYP K+
Sbjct: 335 GICGIYKMASYPTKK 349
>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
Length = 363
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 147/310 (47%), Positives = 187/310 (60%), Gaps = 36/310 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E W +S S E +RF ++ NV ++ N +N +KL N+FAD+++ EF S+
Sbjct: 37 YERWRDHHSVTRASH-EALKRFNVFRHNVLHVHRTNKKNKPYKLKVNRFADITHHEFRSS 95
Query: 122 YLGYNKPYNE----PRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
Y G N ++ P+ S ++ +P+SVDWR++GAVT VK+Q CGSCWAFS V
Sbjct: 96 YAGSNVKHHRMLRGPKRGSGGFMYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTV 155
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AAVEGINK++T KLVSLSEQELVDCD ENQGC GG ME AFEFI GG+ TE+ YPY
Sbjct: 156 AAVEGINKIRTNKLVSLSEQELVDCDT-EENQGCAGGLMEPAFEFIKNNGGIKTEETYPY 214
Query: 233 RGKNDR-CQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHG 269
+ + C+ VTI G+E +P FQLYS G
Sbjct: 215 DSNDVQFCRAKSIDGETVTIDGHEHVPENDEEALLKAVAHQPVSVAIDAGSSDFQLYSEG 274
Query: 270 VFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
VF CG QLNHGV +VGYGE +G KYW+V+NSWG WGE GY+R+ R S N G CG
Sbjct: 275 VFIGECGTQLNHGVVIVGYGETKNGTKYWIVRNSWGPEWGEGGYVRIER-GISENEGRCG 333
Query: 329 ILMQASYPVK 338
I M+ASYP K
Sbjct: 334 IAMEASYPTK 343
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 150/345 (43%), Positives = 202/345 (58%), Gaps = 38/345 (11%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M ++ L LFL + P+ A S P DP M +RFE W+ +Y R Y +DE
Sbjct: 1 MASKVQLVFLFLFLCAMWASPSAA-SRDEPN--DP--MMKRFEEWMAEYGRVYKDDDEKM 55
Query: 81 RRFGIYSSNVQYIDYINSQNL-SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY 139
RRF I+ +NV++I+ NS+N S+ L N+F D++ EF++ Y G + P N R P V +
Sbjct: 56 RRFQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPLNIEREPVVSF 115
Query: 140 -----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+P S+DWR GAV VK+Q CGSCW+F+A+A VEGI K+KTG LVSLSEQE+
Sbjct: 116 DDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEV 175
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
+DC V + GC GG++ KA++FI GVTTE++YPY C + + A ITGY
Sbjct: 176 LDCAV---SYGCKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPNSAY-ITGY 231
Query: 255 E---------------------AIPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-H 292
I A FQ Y+ GVF CG LNH +T++GYG+D
Sbjct: 232 SYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGGVFSGPCGTSLNHAITIIGYGQDSS 291
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G KYW+V+NSWG+SWGE GY+RMAR SS+ G+CGI M +P
Sbjct: 292 GTKYWIVRNSWGSSWGEGGYVRMARGVSSSS-GVCGIAMAPLFPT 335
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 149/336 (44%), Positives = 197/336 (58%), Gaps = 35/336 (10%)
Query: 32 LFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQ 91
LF +L + + E Q+ + Q M +E+WL ++ + Y S DE + RF I+ N++
Sbjct: 13 LFFSTLLILSSAIDIENSVQRTNDQVMA-MYESWLVEHGKSYNSLDEKEMRFEIFKENLR 71
Query: 92 YIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNK-PYNEPRWPSVQYL-----GLPA 144
ID N+ N S+ L N+FADL++EE+ STYLG + P + S QY+ LP
Sbjct: 72 IIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKRGPKTDV---SNQYMPKVGDALPD 128
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
VDWR GAV VK+QG C SCWAFSAVAAVEGINK+ TG L+SLSEQELVDC +
Sbjct: 129 YVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQITK 188
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY--- 261
GCN G M AF+FI GG+ TE++YPY K+ +C VTI Y+ +P+
Sbjct: 189 GCNRGLMTDAFKFIINNGGINTENNYPYTAKDGQCNLSLKNQKYVTIDSYKNVPSNNEMA 248
Query: 262 -------------------AFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNS 302
F+LY+ G+F CG ++HGVT+VGYG + G YW+VKNS
Sbjct: 249 LKKAVAYQPVSVGVESEGGKFKLYTSGIFTGSCGTAVDHGVTIVGYGTERGMDYWIVKNS 308
Query: 303 WGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
WGT+WGE+GYIR+ RN + G CGI SYPVK
Sbjct: 309 WGTNWGESGYIRIQRNIGGA--GKCGIAKMPSYPVK 342
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 148/349 (42%), Positives = 200/349 (57%), Gaps = 40/349 (11%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEW 79
M +N L LFL+ + W S ++ ER E W+ QY R Y E
Sbjct: 1 MNSFSQNHYLILFLVLAV------WTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEK 54
Query: 80 QRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYN------EP 132
++RF ++ +NV +I+ N+ + F L+ N+FADL++EEF + + K + E
Sbjct: 55 EKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWVETSTET 114
Query: 133 RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
+ +PA++D RK GAVTP+KDQG+CGSCWAFSAVAA EGI+++ TGKLV LSEQ
Sbjct: 115 SFRYESVTKIPATIDRRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQ 174
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTIT 252
ELVDC V E++GC GGY++ AFEFI K GG+ +E YPY+G N C+ K H I
Sbjct: 175 ELVDC-VKGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIK 233
Query: 253 GYEAIPAR----------------------YAFQLYSHGVFD-EYCGHQLNHGVTVVGYG 289
GYE +P+ +AF+ YS G+F+ CG NH V VVGYG
Sbjct: 234 GYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYG 293
Query: 290 EDHGE-KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+ + KYWLVKNSWGT WGE GYIR+ R+ + G+CGI YP+
Sbjct: 294 KALDDSKYWLVKNSWGTEWGERGYIRIKRDIRAKE-GLCGIAKYPYYPI 341
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 141/307 (45%), Positives = 181/307 (58%), Gaps = 31/307 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFIS 120
+E WL + + Y E +RRF I+ N++++D NS + +F++ +FADL+NEEF +
Sbjct: 44 YEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRA 103
Query: 121 TYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
YL N+ + +YL LP VDWR GAV VKDQG CGSCWAFSAV AV
Sbjct: 104 IYLRKKMERNKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAV 163
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EGIN++ TG+L+SLSEQELVDCD N GC+GG M AFEFI K GG+ T+ DYPY
Sbjct: 164 EGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223
Query: 236 N-DRCQTDKTKH-HAVTITGYEAIP----------------------ARYAFQLYSHGVF 271
+ C DK + VTI GYE +P + AFQLY GV
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVM 283
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG L+HGV VVGYG GE YW+++NSWG +WG++GY+++ RN G CGI M
Sbjct: 284 TGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDP-FGKCGIAM 342
Query: 332 QASYPVK 338
SYP K
Sbjct: 343 MPSYPTK 349
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 149/340 (43%), Positives = 198/340 (58%), Gaps = 32/340 (9%)
Query: 28 AVLSLFLLW---VLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFG 84
+V+S+ LL+ +L + E Q+ + Q M +E+WL + + Y S DE + RF
Sbjct: 6 SVISMSLLFFSTLLILSLALDIENSVQRTNDQVMA-MYESWLVEQGKSYNSLDEKEMRFE 64
Query: 85 IYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNK-PYNEPRWPSVQYLG- 141
I+ N++ ID N+ N S+ L N+FADL++EE+ STYLG P + + +G
Sbjct: 65 IFKENLRIIDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKMGPKTDVSNEYMPKVGE 124
Query: 142 -LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
LP VDWR GAV VK+QG C SCWAFSAV AVEGINK+ TG L+SLSEQELVDC
Sbjct: 125 ALPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSEQELVDCGRT 184
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR 260
+GCN G M AF+FI GG+ TED+YPY K+ +C VTI Y+ +P+
Sbjct: 185 QRTKGCNRGLMTDAFQFIINNGGINTEDNYPYTAKDGQCNLSLKNQKYVTIDNYKNVPSN 244
Query: 261 Y----------------------AFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWL 298
F+LY+ G+F +CG ++HGVT+VGYG + G YW+
Sbjct: 245 NEMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTAVDHGVTIVGYGTERGMDYWI 304
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VKNSWGT+WGE GYIR+ RN + G CGI SYPVK
Sbjct: 305 VKNSWGTNWGENGYIRIQRNIGGA--GKCGIARMPSYPVK 342
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 147/314 (46%), Positives = 187/314 (59%), Gaps = 44/314 (14%)
Query: 62 FENWLKQYS--REYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
+E W +++ R+ G + RRF ++ +NV+ I N ++ +KL N+F D++ +EF
Sbjct: 49 YERWRGRHALARDLGDK---ARRFNVFKANVRLIHEFNRRDEPYKLRLNRFGDMTADEFR 105
Query: 120 STYLG----YNKPYNEPRWPS--------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
Y G +++ + R S +PASVDWR++GAVT VKDQGQCGSCW
Sbjct: 106 RHYAGSRVAHHRMFRGDRQGSSASASFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCW 165
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS +AAVEGIN +KT L SLSEQ+LVDCD + N GCNGG M+ AF++I K GGV E
Sbjct: 166 AFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKA-NAGCNGGLMDYAFQYIAKHGGVAAE 224
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
D YPYR + C+ K+ VTI GYE +PA FQ
Sbjct: 225 DAYPYRARQASCK--KSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQF 282
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS GVF CG +L+HGVT VGYG G KYWLVKNSWG WGE GYIRMAR+ +
Sbjct: 283 YSEGVFSGRCGTELDHGVTAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKE- 341
Query: 325 GICGILMQASYPVK 338
G CGI M+ASYPVK
Sbjct: 342 GHCGIAMEASYPVK 355
>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 493
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 141/339 (41%), Positives = 193/339 (56%), Gaps = 64/339 (18%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS---QNLSFKLTDNKFADLSNEEF 118
++ WL + R Y + E +RRF ++ N++++D N+ ++ F+L N+FADL+N+EF
Sbjct: 49 YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108
Query: 119 ISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQC---------- 163
+T+LG K R +Y LP SVDWR++GAV PVK+QGQC
Sbjct: 109 RATFLGA-KFVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCVDRIIVWNSM 167
Query: 164 ----------------------GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
GSCWAFSAV+ VE IN+L TG++++LSEQELV+C N
Sbjct: 168 VRIYVVDAGCMLENPLMGLTVQGSCWAFSAVSTVESINQLVTGEMITLSEQELVECSTNG 227
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR- 260
+N GCNGG M+ AF+FI K GG+ TEDDYPY+ + +C ++ V+I G+E +P
Sbjct: 228 QNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVPQND 287
Query: 261 ---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
FQLY GVF CG L+HGV VGYG D+G+ YW+V
Sbjct: 288 EKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGTDNGKDYWIV 347
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+NSWG WGE+GY+RM RN ++ G CGI M ASYP K
Sbjct: 348 RNSWGPKWGESGYVRMERN-INATTGKCGIAMMASYPTK 385
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 148/349 (42%), Positives = 198/349 (56%), Gaps = 42/349 (12%)
Query: 29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEER--------FENWLKQYSREYGSEDEWQ 80
+SL L+ + + A S+ YD + R +E+WL ++ + Y + E
Sbjct: 9 TISLLLMLIFSTLSSA-SDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNALGEKD 67
Query: 81 RRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWP---S 136
+RF I+ N++YID NS N S+KL KFADL+NEE+ S YLG + + S
Sbjct: 68 KRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRRKLSKNKS 127
Query: 137 VQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSE 191
+YL LP SVDWR +G + VKDQG CGSCWAFSAVAA+E IN + TG L+SLSE
Sbjct: 128 DRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSE 187
Query: 192 QELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTI 251
QELVDCD S N+GC+GG M+ AFEF+ GG+ TE+DYPY+ +ND C + V I
Sbjct: 188 QELVDCD-KSYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKI 246
Query: 252 TGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG 289
YE +P Q Y G+F CG ++HGV GYG
Sbjct: 247 DSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAVDHGVVAAGYG 306
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
++G YW+V+NSWG WGE GY+R+ RN SS+ G+CG+ + SYPVK
Sbjct: 307 SENGMDYWIVRNSWGAKWGEKGYLRVQRNVASSS-GLCGLATEPSYPVK 354
>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
Length = 272
Score = 263 bits (673), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 140/269 (52%), Positives = 172/269 (63%), Gaps = 29/269 (10%)
Query: 97 NSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP--RWPSVQY---LGLPASVDWRKE 151
N N +KL NKFADL+NEEF ++ + R + +Y +P++VDWRK+
Sbjct: 4 NVNNKLYKLGINKFADLTNEEFKASRNKFKGHMCSSIIRTTTFKYENASAIPSTVDWRKK 63
Query: 152 GAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYM 211
GAVTPVK+QGQCGSCWAFSAVAA EGI++L TGKLVSLSEQEL+DCD +QGC GG M
Sbjct: 64 GAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGGLM 123
Query: 212 EKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA--------- 262
+ AF+FI + G++TE YPY G + C T++ HAVTITGYE +PA
Sbjct: 124 DDAFKFIIQNHGLSTEVQYPYEGVDGTCNTNEASIHAVTITGYEDVPANNELALQKAVAN 183
Query: 263 -------------FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWG 308
FQ Y+ GVF CG +L+HGVT VGYG + G KYWLVKNSWG WG
Sbjct: 184 QPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGNDGTKYWLVKNSWGADWG 243
Query: 309 EAGYIRMARNSPSSNIGICGILMQASYPV 337
E GYIRM R ++ G+CGI MQASYP
Sbjct: 244 EEGYIRMQRGIDAAE-GLCGIAMQASYPT 271
>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
Length = 499
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 143/295 (48%), Positives = 179/295 (60%), Gaps = 35/295 (11%)
Query: 78 EWQRRFGIYSSNVQYIDYINS---QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRW 134
E++RRF ++ N++++D N+ ++ F+L N+FADL+N+EF + YLG P R
Sbjct: 85 EYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEFRAAYLG-TTPAGRGRH 143
Query: 135 PSVQYLG-----LPASVDWRKEGAV-TPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
Y LP SVDWR +GAV PVK+QGQCGSCWAFSAVAAVEGINK+ TG+LVS
Sbjct: 144 VGEAYRHDGVEVLPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVS 203
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
LSEQELV+C N N GCNGG M+ AF FI + GG+ TE+DYPY + +C K
Sbjct: 204 LSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKKSRKV 263
Query: 249 VTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVV 286
V+I G+E +P FQLY GVF CG L+HGV V
Sbjct: 264 VSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSLDHGVVAV 323
Query: 287 GYGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
GYG D G YW V+NSWG WGE GYIRM RN ++ G CGI M ASYP+K+
Sbjct: 324 GYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNV-TARTGKCGIAMMASYPIKK 377
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 149/354 (42%), Positives = 200/354 (56%), Gaps = 38/354 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYP-QKYDPQSME------ERFENWLKQYSREY 73
M + + SL L+V + A + + Y P+ + FE+WL ++S+ Y
Sbjct: 1 MAFIFSSKKTSLLFLFVSILACSALAHEFSILGYAPEDLTSIHKVIHLFESWLVKHSKFY 60
Query: 74 GSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPR 133
S DE RF I+ N+++ID N + ++ L N+FADL++EEF +LG+ E +
Sbjct: 61 ESLDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKHKFLGFKGELAERK 120
Query: 134 WPSVQYLG------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLV 187
S + G LP SVDWRK+GAV PVK+QGQCGSCWAFS VAAVEGIN++ TG L
Sbjct: 121 DESSKEFGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 180
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
LSEQEL+DCD + N GCNGG M+ AF ++ + G+ E++YPY C K
Sbjct: 181 MLSEQELIDCDT-TFNNGCNGGLMDYAFAYVMR-SGLHKEEEYPYIMSEGTCDEKKDVSE 238
Query: 248 AVTITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTV 285
VTI+GY +P + FQ YS GVFD +CG +L+HGV
Sbjct: 239 KVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAA 298
Query: 286 VGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
VGYG G Y +V+NSWG WGE GYIRM R S + G+CG+ M ASYP K+
Sbjct: 299 VGYGTTKGLDYVIVRNSWGPKWGEKGYIRMKRGSGKPH-GMCGLYMMASYPTKQ 351
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 146/315 (46%), Positives = 187/315 (59%), Gaps = 45/315 (14%)
Query: 62 FENWLKQYS--REYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
+E W +++ R+ G + RRF ++ +NV+ I N ++ +KL N+F D++ +EF
Sbjct: 156 YERWRGRHALARDLGDK---ARRFNVFKANVRLIHEFNRRDEPYKLRLNRFGDMTADEFR 212
Query: 120 STYLG----YNKPYNEPRWPS---------VQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
Y G +++ + R S +PASVDWR++GAVT VKDQGQCGSC
Sbjct: 213 RHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKDQGQCGSC 272
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS +AAVEGIN +KT L SLSEQ+LVDCD + N GCNGG M+ AF++I K GGV
Sbjct: 273 WAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKA-NAGCNGGLMDYAFQYIAKHGGVAA 331
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
ED YPYR + C+ K+ VTI GYE +PA FQ
Sbjct: 332 EDAYPYRARQASCK--KSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQ 389
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GVF CG +L+HGV VGYG G KYWLVKNSWG WGE GYIRMAR+ ++
Sbjct: 390 FYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWLVKNSWGPEWGEKGYIRMARDV-AAK 448
Query: 324 IGICGILMQASYPVK 338
G CGI M+ASYPVK
Sbjct: 449 EGHCGIAMEASYPVK 463
>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
Length = 367
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 198/337 (58%), Gaps = 33/337 (9%)
Query: 32 LFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQ 91
L+ L + G+ + S + + +E WL ++ + Y E +RF I+ N+
Sbjct: 5 LYSLILFGLITLSLSLDMSSGRSNKEVMTMYEKWLVKHQKVYYGLGEKNQRFQIFKDNLI 64
Query: 92 YIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYN-EPRWPSVQYL-------GLP 143
+ID N+ N S+++ N+F+D++N+E+ TYL N + + SV+Y LP
Sbjct: 65 FIDEHNAPNHSYRVGLNEFSDITNKEYRDTYLSRWSNNNIKNKITSVRYAYKAGHNNKLP 124
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
SVDWR GA+TP+K+QG CG+CWAFSAVAAVE INK+ TG LVSLSEQELVDCD ++N
Sbjct: 125 VSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTGSLVSLSEQELVDCD-RTKN 181
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI------ 257
+GCNGG A+ FI + GG+ ++ DYPY G+ C K V+I GY+ +
Sbjct: 182 KGCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQSTCNQAKKNTKVVSINGYKNVQRNSES 241
Query: 258 ---------PARYA-------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKN 301
P FQLY GVF CG L+H V VVGYG ++G+ YWLVKN
Sbjct: 242 ALMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAVVVVGYGSENGKDYWLVKN 301
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
SWGT+WGE GY+++ RN ++N G CGI M A+YP K
Sbjct: 302 SWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYPTK 338
>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 337
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 142/333 (42%), Positives = 187/333 (56%), Gaps = 32/333 (9%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
L+LFLL + I S+ +K S+ E ENW+ +Y + Y E + F I+ N
Sbjct: 11 LALFLLLSIEI-----SQVMSRKLHETSLREEHENWIARYGQVYKVAAE-KETFQIFKEN 64
Query: 90 VQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV--QYLGLPASV 146
V++I+ N+ N +KL N FADL+ EEF G K + P +P ++
Sbjct: 65 VEFIESFNAAANKPYKLGVNLFADLTLEEFKDFRFGLKKTHEFSITPFKYENVTDIPEAL 124
Query: 147 DWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGC 206
DWR++GAVTP+KDQGQCGSCWAFS VAA EGI+++ TG LVSL EQELV CD +QGC
Sbjct: 125 DWREKGAVTPIKDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKGVDQGC 184
Query: 207 NGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA---- 262
GGYME FEFI K GG+TT+ +YPY+G N C T I GYE +P+
Sbjct: 185 EGGYMEDGFEFIIKNGGITTKANYPYKGVNGTCNTTIAASTVAQIKGYETVPSYSEEALQ 244
Query: 263 ------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWG 304
F Y+ G++ CG L+HGVT VGYG + YW+VKNSWG
Sbjct: 245 KAVANQPVSVSIDANNGHFMFYAGGIYTGECGTDLDHGVTAVGYGTTNETDYWIVKNSWG 304
Query: 305 TSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
T W E G+IRM R + G+CG+ + +SYP
Sbjct: 305 TGWDEKGFIRMQRGITVKH-GLCGVALDSSYPT 336
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 151/349 (43%), Positives = 200/349 (57%), Gaps = 38/349 (10%)
Query: 25 LRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSME---ERFENWLKQYSREYGSEDEWQR 81
+R + + L L + AG E + + + Q++E E F+ W++ R Y S +E++R
Sbjct: 1 MRLSCVLLVACSCLAVAAGFPFENH-RLFIQQAVESPREAFDFWVQTLKRAYASAEEYER 59
Query: 82 RFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQ 138
RF ++ N++++ N+ + S L+ +ADLS +E+ S LGYN +E R
Sbjct: 60 RFDVWLDNLRFVHEYNAGHTSHWLSMGVYADLSQDEYRSKALGYNADLHEERPLRAAPFL 119
Query: 139 YLGL--PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVD 196
Y G P VDW +GAVTPVK+Q CGSCWAFS AVEG + + TGKL SLSEQ LVD
Sbjct: 120 YEGTVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAVEGASAIATGKLASLSEQMLVD 179
Query: 197 CDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA 256
CD +N GC+GG M+ AFEFI K GG+ TEDDYPY + CQ +K + H VTI Y+
Sbjct: 180 CDRERDN-GCHGGLMDFAFEFIMKNGGIDTEDDYPYTAEEGMCQDNKMRRHVVTIDDYQD 238
Query: 257 IPA----------------------RYAFQLYSHGVFDEYCGHQLNHGVTVVGYGE---- 290
+P + AFQLY GVFD CG L+HGV VVGYG
Sbjct: 239 VPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGVFDAECGTALDHGVLVVGYGTASNG 298
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
H YWLVKNSWG WG+ GYIR+ RN G CG+ MQAS+P+K+
Sbjct: 299 THHLPYWLVKNSWGAEWGDKGYIRLLRN--LGEEGQCGVAMQASFPIKK 345
>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
Length = 494
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 143/293 (48%), Positives = 176/293 (60%), Gaps = 35/293 (11%)
Query: 78 EWQRRFGIYSSNVQYIDYINS---QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRW 134
E +RRF ++ N++++D N+ + F+L N+FADL+N EF +TYLG P R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142
Query: 135 PSVQYL-----GLPASVDWRKEGAVT-PVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
Y LP SVDWR +GAV PVK+QGQCGSCWAFSAVAAVEGINK+ TG+LVS
Sbjct: 143 VGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVS 202
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
LSEQELV+C N +N GCNGG M+ AF FI + GG+ TE+DYPY + +C K
Sbjct: 203 LSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSRKV 262
Query: 249 VTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVV 286
V+I G+E +P FQLY GVF CG L+HGV V
Sbjct: 263 VSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVVAV 322
Query: 287 GYGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
GYG D G YW V+NSWG WGE GYIRM RN ++ G CGI M ASYP+
Sbjct: 323 GYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNV-TARTGKCGIAMMASYPI 374
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 138/313 (44%), Positives = 186/313 (59%), Gaps = 40/313 (12%)
Query: 62 FENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQ--NLSFKLTDNKFADLSNEEF 118
+E WL ++ R + E RF ++ N++++D N + F+L N+FADL+N+EF
Sbjct: 56 YELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDAHNERAGEHGFRLGMNQFADLTNDEF 115
Query: 119 ISTYLGYNKPYNEPRWPSVQYLG----------LPASVDWRKEGAVTPVKDQGQCGSCWA 168
+ YLG P S +G LP SVDWR++GAV PVK+QGQCGSCWA
Sbjct: 116 RAAYLGARIPAAR----SGNAVGEMYRHDGAEELPESVDWREKGAVAPVKNQGQCGSCWA 171
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSAV++VE IN++ TG++V+LSEQELV+C + N GCNGG M+ AF FI K GG+ TED
Sbjct: 172 FSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGGIDTED 231
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLY 266
DYPY+ + +C ++ V+I +E +P FQLY
Sbjct: 232 DYPYKAVDGKCDINRRNAKVVSIDAFEDVPENDEKSLQKAVAHQPVSVAIEAGGRQFQLY 291
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
GVF C L+HGV VGYG ++G+ YW+V+NSWG WGEAGYIRM RN ++ G
Sbjct: 292 KSGVFSGSCTTNLDHGVVAVGYGTENGKDYWIVRNSWGPKWGEAGYIRMERN-INATTGK 350
Query: 327 CGILMQASYPVKR 339
CGI M ASYP K+
Sbjct: 351 CGIAMMASYPTKK 363
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 140/307 (45%), Positives = 180/307 (58%), Gaps = 31/307 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFIS 120
+E WL + + Y E +RRF I+ N++++D NS + +F++ +FADL+NEEF +
Sbjct: 44 YEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRA 103
Query: 121 TYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
YL + + +YL LP VDWR GAV VKDQG CGSCWAFSAV AV
Sbjct: 104 IYLRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCGSCWAFSAVGAV 163
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EGIN++ TG+L+SLSEQELVDCD N GC+GG M AFEFI K GG+ T+ DYPY
Sbjct: 164 EGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYPYNAN 223
Query: 236 N-DRCQTDKTKH-HAVTITGYEAIP----------------------ARYAFQLYSHGVF 271
+ C DK + VTI GYE +P + AFQLY GV
Sbjct: 224 DLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLYKSGVM 283
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG L+HGV VVGYG GE YW+++NSWG +WG++GY+++ RN G CGI M
Sbjct: 284 TGTCGISLDHGVVVVGYGSTSGEDYWIIRNSWGLNWGDSGYVKLQRNIDDP-FGKCGIAM 342
Query: 332 QASYPVK 338
SYP K
Sbjct: 343 MPSYPTK 349
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 188/311 (60%), Gaps = 30/311 (9%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
M + F WL+++SR Y S E QRRF I+ N+ YI N Q S+ L NKF+DL+++E
Sbjct: 48 MLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQEKSYWLGLNKFSDLTHDE 107
Query: 118 FISTYLGYN---KPYNEPRWPSVQYLGLPAS--VDWRKEGAVTPVKDQGQCGSCWAFSAV 172
F + YLG + + Y + A VDWRK+GAV+ VKDQG CGSCWAFSA+
Sbjct: 108 FRALYLGIRPAGRAHGLRNGDRFIYEDVVAEEMVDWRKKGAVSDVKDQGSCGSCWAFSAI 167
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
+VEG+N + TG+L+SLSEQELVDCD +NQGCNGG M+ AF+FI K GG+ TE+DYPY
Sbjct: 168 GSVEGVNAIVTGELISLSEQELVDCD-RGQNQGCNGGLMDYAFDFIIKNGGIDTEEDYPY 226
Query: 233 RGKNDRC-QTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHG 269
+ + +C + K V I Y+ +P + FQ Y G
Sbjct: 227 KATDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSKNPVSVAIEAGGRDFQHYQGG 286
Query: 270 VFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
VF CG L+HGV VGYG +D G YW+VKNSWG SWGE GYIRM R +S G CG
Sbjct: 287 VFTGPCGTDLDHGVLAVGYGTDDDGVNYWIVKNSWGPSWGEKGYIRMERMGSNSTSGKCG 346
Query: 329 ILMQASYPVKR 339
I ++ S+P+K+
Sbjct: 347 INIEPSFPIKK 357
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 188/316 (59%), Gaps = 36/316 (11%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS-FKLTDNKFAD 112
D +M ER E W+ Y R Y E RRF ++ N+ +++ N+ + F L N+FAD
Sbjct: 33 DDAAMAERHERWMAVYGRVYKDAAEKARRFEVFKDNLAFVESFNADKKNKFWLGVNQFAD 92
Query: 113 LSNEEFISTYLGYNKPYNEPRWPSVQYL-------GLPASVDWRKEGAVTPVKDQGQCGS 165
L+ EEF + G+ KP + P+ + LP +VDWR +GAVTP+K+QGQCG
Sbjct: 93 LTTEEFKANK-GF-KPISAEEVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGC 150
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFSAVAA+EGI KL T LVSLSEQELVDCD +S ++GC GG+M+ AFEF+ K GG+
Sbjct: 151 CWAFSAVAAMEGIVKLSTDNLVSLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLA 210
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAF 263
TE YPY+ + +C+ A TI G+E +P + F
Sbjct: 211 TESSYPYKAVDGKCKGG--SKSAATIKGHEDVPPNNEAALMKAVASQPVSVAVDASDRTF 268
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
LYS GV CG QL+HG+ +GYG E G KYW++KNSWGT+WGE ++RM ++ S
Sbjct: 269 MLYSGGVMTGSCGTQLDHGIAAIGYGVESDGTKYWILKNSWGTTWGEKRFLRMEKDI-SD 327
Query: 323 NIGICGILMQASYPVK 338
G+CG+ M+ SYP +
Sbjct: 328 KQGMCGLAMKPSYPTE 343
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 135/306 (44%), Positives = 184/306 (60%), Gaps = 29/306 (9%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEF 118
+ FE W K++ + Y S++E R ++ N ++ NS+ N S+ L N FADL++ EF
Sbjct: 27 QLFETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEF 86
Query: 119 ISTYLGYNKPYNEPRWPSVQYLG----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
++ LG + +++ G +PAS+DWR +G VT VKDQG CG+CW+FSA A
Sbjct: 87 KTSRLGLSAAPLNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFSATGA 146
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
+EGINK+ TG LVSLSEQEL++CD S N GC GG M+ AF+F+ G+ TE+DYPYR
Sbjct: 147 IEGINKIVTGSLVSLSEQELIECD-KSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYRA 205
Query: 235 KNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVFD 272
++ C D+ K VTI Y +P + AFQ+YS G+F
Sbjct: 206 RDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSKGIFT 265
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
C L+H V +VGYG ++G YW+VKNSWGT WG GY+ M RNS +S G+CGI M
Sbjct: 266 GPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQ-GVCGINML 324
Query: 333 ASYPVK 338
ASYPVK
Sbjct: 325 ASYPVK 330
>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
Length = 380
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 141/338 (41%), Positives = 190/338 (56%), Gaps = 56/338 (16%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI----DYINSQNLSFKLTDNK 109
D M ERF+ W Y++ Y + E +RRF +Y+ N+ YI + L+++L +
Sbjct: 44 DNSPMIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIEATNAEAEAAGLTYELGETA 103
Query: 110 FADLSNEEFISTYLGYNKPYNEP---------------RWPSVQYLG-----------LP 143
+ DL+N+EF++ Y P P R V +G P
Sbjct: 104 YTDLTNQEFMAMYTAAPSPAQLPADEDEDDAAEAVITTRAGPVDAVGQLPVYVNLSTAAP 163
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
ASVDWR GAVTPVK+QG+CGSCWAFS VA VEGI +++TGKLVSLSEQELVDCD + +
Sbjct: 164 ASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCD--TLD 221
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA- 262
GC+GG +A +IT GG+TTE+DYPY G D C K H+A +I G + R
Sbjct: 222 AGCDGGISYRALRWITSNGGLTTEEDYPYTGTTDACNRAKLAHNAASIAGLRRVATRSEA 281
Query: 263 ---------------------FQLYSHGVFDEYCGHQLNHGVTVVGYG--EDHGEKYWLV 299
FQ Y GV++ CG LNHGVTVVGYG E+ G+KYW++
Sbjct: 282 SLANAVAGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNHGVTVVGYGQEEEDGDKYWII 341
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KNSWG SWG+ GYI+M ++ G+CGI ++ S+P+
Sbjct: 342 KNSWGASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFPL 379
>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 147/347 (42%), Positives = 198/347 (57%), Gaps = 47/347 (13%)
Query: 26 RNAVLSLFLL---WVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRR 82
+ +L +FL+ W + + SE Y + E W+ QY + Y E ++R
Sbjct: 7 KKNILVVFLVLTVWTSQVMSRRLSEAYSS--------VKHEKWMAQYGKVYKDAAEKEKR 58
Query: 83 FGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYL-GYNKPYN-------EPR 133
F I+ +NV +I+ ++ + F L+ N+FADL +F + + G K +N E
Sbjct: 59 FQIFKNNVHFIESFHAAGDKPFNLSINQFADL--HKFKALLINGQKKEHNVRTATATEAS 116
Query: 134 WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
+ +P+S+DWRK GAVTP+KDQG C SCWAFS VA +EG++++ G+LVSLSEQE
Sbjct: 117 FKYDSVTRIPSSLDWRKRGAVTPIKDQGTCRSCWAFSTVATIEGLHQITKGELVSLSEQE 176
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
LVDC V +++GC GGY+E AFEFI K GGV +E YPY+G N C+ K H V I G
Sbjct: 177 LVDC-VKGDSEGCYGGYVEDAFEFIAKKGGVASETHYPYKGVNKTCKVKKETHGVVQIKG 235
Query: 254 YEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED 291
YE +P+ YAFQ YS G+F CG ++H VTVVGYG+
Sbjct: 236 YEQVPSNSEKALLKAVAHQPVSAYVEAGGYAFQFYSSGIFTGKCGTDIDHSVTVVGYGKA 295
Query: 292 H-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G KYWLVKNSWGT WGE GYIRM R+ + G+CGI A YP
Sbjct: 296 RGGNKYWLVKNSWGTEWGEKGYIRMKRDIRAKE-GLCGIATGALYPT 341
>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
Precursor
gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 490
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 143/293 (48%), Positives = 176/293 (60%), Gaps = 35/293 (11%)
Query: 78 EWQRRFGIYSSNVQYIDYINS---QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRW 134
E +RRF ++ N++++D N+ + F+L N+FADL+N EF +TYLG P R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG-TTPAGRGRR 142
Query: 135 PSVQYL-----GLPASVDWRKEGAVT-PVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
Y LP SVDWR +GAV PVK+QGQCGSCWAFSAVAAVEGINK+ TG+LVS
Sbjct: 143 VGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGELVS 202
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
LSEQELV+C N +N GCNGG M+ AF FI + GG+ TE+DYPY + +C K
Sbjct: 203 LSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRSRKV 262
Query: 249 VTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVV 286
V+I G+E +P FQLY GVF CG L+HGV V
Sbjct: 263 VSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGVVAV 322
Query: 287 GYGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
GYG D G YW V+NSWG WGE GYIRM RN ++ G CGI M ASYP+
Sbjct: 323 GYGTDAATGAAYWTVRNSWGPDWGENGYIRMERNV-TARTGKCGIAMMASYPI 374
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 151/350 (43%), Positives = 196/350 (56%), Gaps = 41/350 (11%)
Query: 22 RMMLRNAVLSLFLLWVLGIP---AGAWSEGYPQKYDPQSME---ERFENWLKQYSREYGS 75
+++ +S F++ G G W E D SM+ E FE W+ + + Y +
Sbjct: 5 KLLPLAMCMSFFVVTSFGKDFSIVGYWPE------DLTSMDRLIELFEEWISNHGKIYET 58
Query: 76 EDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWP 135
+E RF ++ N+++ID N + S+ L N+FADL+++EF + YLG + R
Sbjct: 59 IEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVESSRTRQS 118
Query: 136 SVQY-----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
++ + LP SVDWRK+GAVT VK+QG CGSCWAFS VAAVEGINK+ G L SLS
Sbjct: 119 PEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGGNLTSLS 178
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQEL+DCD N GC+GG M+ AF FI GG+ E+DYPY C K + VT
Sbjct: 179 EQELIDCD-RPYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYLEVESTCDNKKGELEVVT 237
Query: 251 ITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
I+GY+ +P + FQ YS GVFD CG QL+HGVT VGY
Sbjct: 238 ISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYSGGVFDGPCGTQLDHGVTAVGY 297
Query: 289 GEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G G Y +VKNSWG WGE GYIRM RN+ G+CGI ASYP K
Sbjct: 298 GSSKGVDYIIVKNSWGPKWGEKGYIRMKRNT-GKPAGLCGINKMASYPTK 346
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 147/312 (47%), Positives = 187/312 (59%), Gaps = 42/312 (13%)
Query: 62 FENWLKQYS--REYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
+E W +++ R+ G + RRF ++ NV+ I N ++ +KL N+F D++ +EF
Sbjct: 47 YERWRGRHAVARDLGDK---ARRFNVFKENVRLIHDFNQRDEPYKLRLNRFGDMTADEFR 103
Query: 120 STYLG----YNKPYNEPRWPSVQ---YLG---LPASVDWRKEGAVTPVKDQGQCGSCWAF 169
Y G +++ + R S Y G LP SVDWR++GAVT VKDQGQCGSCWAF
Sbjct: 104 RHYAGSRVAHHRMFRGDRQGSASSFMYAGARDLPTSVDWRQKGAVTDVKDQGQCGSCWAF 163
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
S +AAVEGIN +KT L SLSEQ+LVDCD N GC+GG M+ AF++I K GGV ED
Sbjct: 164 STIAAVEGINAIKTKNLTSLSEQQLVDCDTKG-NAGCDGGLMDYAFQYIAKHGGVAAEDA 222
Query: 230 YPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYS 267
YPY+ + C+ K+ AVTI GYE +PA FQ YS
Sbjct: 223 YPYKARQASCK--KSPAPAVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYS 280
Query: 268 HGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
GVF CG +L+HGVT VGYG G KYW+VKNSWG WGE GYIRMAR+ + G
Sbjct: 281 EGVFAGRCGTELDHGVTAVGYGVAADGTKYWVVKNSWGPEWGEKGYIRMARDVAAKE-GH 339
Query: 327 CGILMQASYPVK 338
CGI M+ASYPVK
Sbjct: 340 CGIAMEASYPVK 351
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 150/346 (43%), Positives = 203/346 (58%), Gaps = 40/346 (11%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M ++ L LFL + P+ A + + DP M +RFE W+ +Y R Y DE
Sbjct: 1 MASKVQLVFLFLFLCVMWASPSAASRD---EPSDP--MMKRFEEWMAEYGRVYKDNDEKM 55
Query: 81 RRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY 139
RRF I+ +NV +I+ N++N S+ L NKF D++N EF++ Y G + P N R P V +
Sbjct: 56 RRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVTQYTGVSLPLNFKREPVVSF 115
Query: 140 -----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+ S+DWR GAVT VKDQ CGSCWAFSA+A VEGI K+ TG LVSLSEQE+
Sbjct: 116 DDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEV 175
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
+DC V++ GC+GG+++ A++FI GV +E DYPY+ C + + A ITGY
Sbjct: 176 LDCAVSN---GCDGGFVDNAYDFIISNNGVASEADYPYQAYEGDCTANSWPNSAY-ITGY 231
Query: 255 EAIPA------RYA----------------FQLYSHGVFDEYCGHQLNHGVTVVGYGED- 291
+ + +YA FQ Y+ GVF CG LNH +T++GYG+D
Sbjct: 232 SYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQDS 291
Query: 292 HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G +YW+VKNSWG+SWGE GY+RMAR SS G+CGI M YP
Sbjct: 292 SGTQYWIVKNSWGSSWGERGYVRMARGVSSS--GLCGIAMDPLYPT 335
>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
Length = 565
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 141/318 (44%), Positives = 182/318 (57%), Gaps = 40/318 (12%)
Query: 59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL---------SFKLTDNK 109
E FE W ++ + Y S E R ++ N ++ N+ S+ L N
Sbjct: 39 EPLFEAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNA 98
Query: 110 FADLSNEEFISTYLG------YNKPYNEPRWP-SVQYLGLPASVDWRKEGAVTPVKDQGQ 162
FADL++ EF + LG P +E + SV +P ++DWR+ GAVT VKDQG
Sbjct: 99 FADLTHAEFRAARLGRLAVGGARAPPSEGGFAGSVGVGAVPEALDWRQSGAVTKVKDQGS 158
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CG+CW+FSA A+EGINK+KTG L+SLSEQEL+DCD S N GC GG M+ A+ F+ K G
Sbjct: 159 CGACWSFSATGAIEGINKIKTGSLISLSEQELIDCD-RSYNAGCGGGLMDYAYRFVIKNG 217
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------------- 260
G+ TEDDYPYR + C +K K H VTI GY +PA
Sbjct: 218 GIDTEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSA 277
Query: 261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
AFQLYS G+FD C L+H V +VGYG + G+ YW+VKNSWG WG GY+ M RN+
Sbjct: 278 RAFQLYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMKGYMHMHRNTG 337
Query: 321 SSNIGICGILMQASYPVK 338
SS+ GICGI M AS+P K
Sbjct: 338 SSS-GICGINMMASFPTK 354
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 144/324 (44%), Positives = 186/324 (57%), Gaps = 42/324 (12%)
Query: 53 YDPQSME------ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLT 106
Y P+ +E E FENW+ + + Y + +E RF ++ N+++ID N + S+ L
Sbjct: 36 YSPEDLESHDKLIELFENWISNFEKAYETVEEKLLRFEVFKDNLKHIDETNKKVKSYWLG 95
Query: 107 DNKFADLSNEEFISTYLGYN---------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPV 157
N+FADLS+EEF YLG + Y E + V+ +P SVDWRK+GAV V
Sbjct: 96 LNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVE--AVPKSVDWRKKGAVAEV 153
Query: 158 KDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEF 217
K+QG CGSCWAFS VAAVEGINK+ TG L +LSEQEL+DCD + N GCNGG M+ AFE+
Sbjct: 154 KNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDT-TYNNGCNGGLMDYAFEY 212
Query: 218 ITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------- 260
I K GG+ E+DYPY + C+ K + VTI G++ +P
Sbjct: 213 IVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTIDGHQDVPTNDEKSLLKALAHQPLSVA 272
Query: 261 -----YAFQLYSH-GVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIR 314
FQ YS VFD CG L+HGV VGYG G Y +VKNSWG WGE GYIR
Sbjct: 273 IDASGREFQFYSGVSVFDGRCGVDLDHGVAAVGYGSSKGSDYIIVKNSWGPKWGEKGYIR 332
Query: 315 MARNSPSSNIGICGILMQASYPVK 338
+ RN+ G+CGI AS+P K
Sbjct: 333 LKRNTGKPE-GLCGINKMASFPTK 355
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 151/342 (44%), Positives = 193/342 (56%), Gaps = 41/342 (11%)
Query: 30 LSLFLLWVLGIP---AGAWSEGYPQKYDPQSME---ERFENWLKQYSREYGSEDEWQRRF 83
+S F++ G G W E D SM+ E FE W+ + + Y + +E RF
Sbjct: 16 MSFFVVTSFGKDFSIVGYWPE------DLTSMDRLIELFEEWISNHGKIYETIEEKWHRF 69
Query: 84 GIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY---- 139
++ N+++ID N + S+ L N+FADL+++EF + YLG + R ++
Sbjct: 70 EVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMYLGLKVESSRTRQSPEEFTYKD 129
Query: 140 -LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
+ LP SVDWRK+GAVT VK+QG CGSCWAFS VAAVEGINK+ G L SLSEQEL+DCD
Sbjct: 130 VVDLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVAAVEGINKIVGGNLTSLSEQELIDCD 189
Query: 199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
N GC+GG M+ AF FI GG+ E+DYPY C K + VTI+GY+ +P
Sbjct: 190 -RPYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYLEVESTCDNKKGELEVVTISGYKDVP 248
Query: 259 ----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKY 296
+ FQ YS GVFD CG QL+HGVT VGYG G Y
Sbjct: 249 ENNEASLIKALAHQPLSVAIEASGRDFQFYSGGVFDGPCGTQLDHGVTAVGYGSSKGVDY 308
Query: 297 WLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+VKNSWG WGE GYIRM RN+ G+CGI ASYP K
Sbjct: 309 IIVKNSWGPKWGEKGYIRMKRNT-GKPAGLCGINKMASYPTK 349
>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
Length = 439
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 182/313 (58%), Gaps = 39/313 (12%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL------SFKLTDNKFADL 113
E FE W K++S+ Y SE+E R ++ N ++ N S+ L+ N FADL
Sbjct: 31 ELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADL 90
Query: 114 SNEEFISTYLG-------YNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
++ EF +T LG + +P N+ S L +P+ +DWR+ GAVTPVKDQ CG+C
Sbjct: 91 THHEFKTTRLGLPLTLLRFKRPQNQQ---SRDLLHIPSQIDWRQSGAVTPVKDQASCGAC 147
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSA A+EGINK+ TG LVSLSEQEL+DCD S N GC GG M+ A++F+ G+ T
Sbjct: 148 WAFSATGAIEGINKIVTGSLVSLSEQELIDCDT-SYNSGCGGGLMDFAYQFVIDNKGIDT 206
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---------------------ARYAFQL 265
EDDYPY+ + C DK K AVTI Y +P + FQL
Sbjct: 207 EDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVASQPVSVGICGSEREFQL 266
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
YS G+F C L+H V +VGYG ++G YW+VKNSWG WG GYI M RNS +S G
Sbjct: 267 YSKGIFTGPCSTFLDHAVLIVGYGSENGVDYWIVKNSWGKYWGMNGYIHMIRNSGNSK-G 325
Query: 326 ICGILMQASYPVK 338
ICGI ASYPVK
Sbjct: 326 ICGINTLASYPVK 338
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 140/295 (47%), Positives = 183/295 (62%), Gaps = 36/295 (12%)
Query: 77 DEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTY----LGYNKPYNEP 132
DE RF ++ +NV ++ N + +KL NKF D++N EF Y + +++ +
Sbjct: 54 DEKHNRFNVFKANVMHVHNTNKLDKPYKLKLNKFGDMTNYEFRRIYADSKISHHRMFRGM 113
Query: 133 RWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLV 187
+ ++ +P+S+DWR +GAVT VKDQGQCGSCWAFS +AAVEGIN++KT KLV
Sbjct: 114 SHENGTFMYENAVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLV 173
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
SLSEQ+LVDCD EN+GCNGG ME AFEFI K G+TTE +YPY K+ C +K +
Sbjct: 174 SLSEQQLVDCDT-EENEGCNGGLMEYAFEFI-KQNGITTESNYPYAAKDGTCDVEK-EDK 230
Query: 248 AVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTV 285
AV+I G+E +P Y FQ YS GVF +C LNHGV +
Sbjct: 231 AVSIDGHENVPINNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGVFTGHCDTDLNHGVAI 290
Query: 286 VGYGEDHGE-KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
VGYG KYW++KNSWG+ WGE GYIRM R SS G+CGI M+ASYP+K+
Sbjct: 291 VGYGVTQDRTKYWIMKNSWGSEWGEQGYIRMQR-GISSREGLCGIAMEASYPIKK 344
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 140/322 (43%), Positives = 187/322 (58%), Gaps = 40/322 (12%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI---------DYINSQNLSFKLT 106
+++ E + W + E RRFG + SNV +I N+ S++L
Sbjct: 36 EALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLNDTSTNNNGPSYRLR 95
Query: 107 DNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQG 161
N+F D+ EF ST+ G + P ++ +P +VDWR++GAVT VKDQG
Sbjct: 96 LNRFGDMDQAEFRSTFAGPLHRHTRPAQSIPGFIYDTVKDIPQAVDWRQKGAVTGVKDQG 155
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT-K 220
+CGSCWAFSAVA+VEG+N ++TG LVSLSEQEL+DCD ++ GC GG ME AFEFI
Sbjct: 156 KCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMESAFEFIAHS 215
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA--------------------- 259
GG+ TE YPY N C ++ +V I G++++PA
Sbjct: 216 AGGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAKAVAHQPVSVAIDA 275
Query: 260 -RYAFQLYSHGVFDEYCGHQLNHGVTVVGYG--EDHGEKYWLVKNSWGTSWGEAGYIRMA 316
AFQ YS GVF CG +L+HGV VVGYG E+ G++YW+VKNSWG WGE GY+RM
Sbjct: 276 GGQAFQFYSEGVFTGDCGSELDHGVAVVGYGVAEEDGKEYWIVKNSWGPGWGEHGYVRMQ 335
Query: 317 RNSPSSNIGICGILMQASYPVK 338
R+S + G+CGI M+ASYPVK
Sbjct: 336 RDS-GVDGGLCGIAMEASYPVK 356
>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
Length = 348
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 185/315 (58%), Gaps = 38/315 (12%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+S+ + +E W Q+ + DE ++RF ++ NV +I+ +N +KL N+FAD++N
Sbjct: 34 KSLWDLYERWGSQHMVSR-APDEKKKRFNVFKYNVNHINRVNQLGKPYKLKLNEFADMTN 92
Query: 116 EEFISTY----LGYNKPYNEPR---WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
EF + + L + + R + + P S+DWR GAV P+K+QG+CGSCWA
Sbjct: 93 HEFKAGFDSKILHFRMLKGKRRQTPFTHAKTTDPPPSIDWRTNGAVNPIKNQGRCGSCWA 152
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS + VEGINK+KT +LVSLSEQELVDC+ + E GCNGG ME +EFI + GGVTTE
Sbjct: 153 FSTIVGVEGINKIKTNQLVSLSEQELVDCETDCE--GCNGGLMENGYEFIKETGGVTTEQ 210
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLY 266
YPY +N RC K V I G+E +PA FQ Y
Sbjct: 211 IYPYFARNGRCDISKRNSPVVKIDGFENVPANDESAMLRAVANQPVSIAIDAGGLNFQFY 270
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMAR--NSPSSN 323
S GVF+ CG +LNHGV +VGYG G YW+V+NSWGT WGE GY+RM R N P
Sbjct: 271 SQGVFNGACGTELNHGVAIVGYGTTQDGTNYWIVRNSWGTGWGEQGYVRMQRGVNVPE-- 328
Query: 324 IGICGILMQASYPVK 338
G+CG+ M ASYP+K
Sbjct: 329 -GLCGLAMDASYPIK 342
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 149/346 (43%), Positives = 200/346 (57%), Gaps = 39/346 (11%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M ++ L LFL + P+ A + + DP M +RFE W+ +Y R Y DE
Sbjct: 1 MASKVQLVFLFLFLCVMWASPSAASRD---EPSDP--MMKRFEEWMAEYGRVYKDNDEKM 55
Query: 81 RRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEFISTYLG-YNKPYNEPRWPSVQ 138
RRF I+ +NV +I+ NS N S+ L N+F D++ EF++ Y G ++P N R P V
Sbjct: 56 RRFQIFKNNVNHIETFNSHNGNSYTLGINQFTDMTKSEFVAQYTGGISRPLNIEREPVVS 115
Query: 139 Y-----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
+ +P S+DWR GAV VK+Q CGSCWAF+A+A VEGI K+KTG LVSLSEQE
Sbjct: 116 FDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQE 175
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
++DC V + GC GG++ KA++FI GVTTE++YPY+ C + + A ITG
Sbjct: 176 VLDCAV---SYGCKGGWVNKAYDFIISNNGVTTEENYPYQAYQGTCNANSFPNSAY-ITG 231
Query: 254 YE---------------------AIPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGED- 291
Y I A FQ Y+ GVF CG LNH +T++GYG+D
Sbjct: 232 YSYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGGVFSGPCGTSLNHAITIIGYGQDS 291
Query: 292 HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G KYW+V+NSWG+SWGE GY+RMAR SS+ G CGI M +P
Sbjct: 292 SGTKYWIVRNSWGSSWGEGGYVRMARGVSSSS-GACGIAMSPLFPT 336
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 141/314 (44%), Positives = 181/314 (57%), Gaps = 32/314 (10%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
Q + +F W ++ + Y + +E RF ++ N++YI + +NLS+ L KFADL+N
Sbjct: 39 QLLAGQFAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQRHSEKNLSYWLGLTKFADLTN 98
Query: 116 EEFISTYLGYNKPYNEPRWPSVQYLG--------LPASVDWRKEGAVTPVKDQGQCGSCW 167
EEF Y G + G P S+DWR++GAVT VKDQG CGSCW
Sbjct: 99 EEFRRQYTGTRIDRSRRLKKGRNATGSFRYANSEAPKSIDWREKGAVTSVKDQGSCGSCW 158
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAV +VEGIN ++TG +SLS QELVDCD NQGCNGG M+ AF+F+ + GG+ TE
Sbjct: 159 AFSAVGSVEGINAIRTGDAISLSVQELVDCD-KKYNQGCNGGLMDYAFDFVIQNGGIDTE 217
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
DYPY+G + RC +K VTI YE +P FQL
Sbjct: 218 KDYPYQGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQL 277
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN-I 324
YS GVF CG L+HGV VGYG + G YW+VKNSWG WGE+GY+RM RN N
Sbjct: 278 YSGGVFTGRCGTDLDHGVLAVGYGSEKGLDYWIVKNSWGEYWGESGYLRMQRNLKDDNGY 337
Query: 325 GICGILMQASYPVK 338
G+CGI ++ SY VK
Sbjct: 338 GLCGINIEPSYAVK 351
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 188/312 (60%), Gaps = 35/312 (11%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFAD 112
DP M +RFE W+ +Y R Y DE RRF I+ +NV++I+ NS+N S+ L N+F D
Sbjct: 4 DP--MMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTD 61
Query: 113 LSNEEFISTYLGYNKPYNEPRWPSVQY-----LGLPASVDWRKEGAVTPVKDQGQCGSCW 167
++ EF++ Y G + P N R P V + +P S+DWR GAV VK+Q CGSCW
Sbjct: 62 MTKSEFVAQYTGVSLPLNIEREPVVSFDDVNISAVPQSIDWRDYGAVNEVKNQNPCGSCW 121
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AF+A+A VEGI K+KTG LVSLSEQE++DC V + GC GG++ KA++FI GVTTE
Sbjct: 122 AFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV---SYGCKGGWVNKAYDFIISNNGVTTE 178
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYE---------------------AIPARYAFQLY 266
++YPY+ C + + A ITGY I A FQ Y
Sbjct: 179 ENYPYQAYQGTCNANSFPNSAY-ITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYY 237
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
+ GVF CG LNH +T++GYG+D G KYW+V+NSWG+SWGE GY+RMAR SS+ G
Sbjct: 238 NGGVFSGPCGTSLNHAITIIGYGQDSSGTKYWIVRNSWGSSWGEGGYVRMARGVSSSS-G 296
Query: 326 ICGILMQASYPV 337
CGI M +P
Sbjct: 297 ACGIAMSPLFPT 308
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 195/321 (60%), Gaps = 47/321 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNE 116
+ E F++W +++ + YGSE+E Q+R I+ N ++ N N ++ L+ N FADL++
Sbjct: 26 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 85
Query: 117 EFISTYLGYNKPYNEPRWPSV------QYLG----LPASVDWRKEGAVTPVKDQGQCGSC 166
EF ++ LG + PSV Q LG +P SVDWRK+GAVT VKDQG CG+C
Sbjct: 86 EFKASRLGLSVSA-----PSVIMASKGQSLGGSVKVPDSVDWRKKGAVTNVKDQGSCGAC 140
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
W+FSA A+EGIN++ TG L+SLSEQEL+DCD S N GCNGG M+ AFEF+ K G+ T
Sbjct: 141 WSFSATGAMEGINQIVTGDLISLSEQELIDCD-KSYNAGCNGGLMDYAFEFVIKNHGIDT 199
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGY------------EAIPAR----------YAFQ 264
E DYPY+ ++ C+ DK K VTI Y EA+ A+ AFQ
Sbjct: 200 EKDYPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQ 259
Query: 265 LYS-------HGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
LYS G+F C L+H V +VGYG +G YW+VKNSWG SWG G++ M R
Sbjct: 260 LYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQR 319
Query: 318 NSPSSNIGICGILMQASYPVK 338
N+ +S+ G+CGI M ASYP+K
Sbjct: 320 NTENSD-GVCGINMLASYPIK 339
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 144/309 (46%), Positives = 192/309 (62%), Gaps = 32/309 (10%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEF 118
E F++W +++ + YGSE+E Q+R I+ N ++ N N ++ L+ N FADL++ EF
Sbjct: 30 ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89
Query: 119 ISTYLGYNKPYNEPRWPSV-QYLG----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
++ LG + + S Q LG +P SVDWRK+GAVT VKDQG CG+CW+FSA
Sbjct: 90 KASRLGLSVSASSLIMASKGQSLGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSFSATG 149
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
A+EGIN++ TG L+SLSEQEL+DCD S N GCNGG M+ AFEF+ K G+ TE DYPY+
Sbjct: 150 AMEGINQIVTGDLISLSEQELIDCD-KSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQ 208
Query: 234 GKNDRCQTDKTKHHAVTITGY------------EAIPAR----------YAFQLYSH--G 269
++ C+ DK K VTI Y EA+ A+ AFQLYS G
Sbjct: 209 ERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRVSG 268
Query: 270 VFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
+F C L+H V +VGYG +G YW+VKNSWG SWG G++ M RN+ +S GICGI
Sbjct: 269 IFSGPCSTSLDHAVLIVGYGSQNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSE-GICGI 327
Query: 330 LMQASYPVK 338
M ASYP+K
Sbjct: 328 NMLASYPIK 336
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 137/310 (44%), Positives = 189/310 (60%), Gaps = 34/310 (10%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEF 118
ER E W+ QY + Y E ++RF ++ +NVQ+I+ N+ + F L+ N+FADL +EEF
Sbjct: 33 ERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEF 92
Query: 119 ISTYLGYNKPYN------EPRWPSVQYLGLPASVDWRKEGAVTPVKDQG-QCGSCWAFSA 171
+ K + E + +P+++DWRK GAVTP+KDQG CGSCWAF+
Sbjct: 93 KALLNNVQKKASRVETATETSFRYENVTKIPSTMDWRKRGAVTPIKDQGYTCGSCWAFAT 152
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
VA VE ++++ TG+LVSLSEQELVDC V +++GC GGY+E AFEFI GG+T+E YP
Sbjct: 153 VATVESLHQITTGELVSLSEQELVDC-VRGDSEGCRGGYVENAFEFIANKGGITSEAYYP 211
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHG 269
Y+GK+ C+ K H I GYE++P+ AF+ YS G
Sbjct: 212 YKGKDRSCKVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSVYIDAGAIAFKFYSSG 271
Query: 270 VFD-EYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
+F+ CG L+H V VVGYG+ G KYWLVKNSW T+WGE GY+R+ R+ + G+C
Sbjct: 272 IFEARNCGTHLDHAVAVVGYGKLRDGTKYWLVKNSWSTAWGEKGYMRIKRDIRAKK-GLC 330
Query: 328 GILMQASYPV 337
GI ASYP+
Sbjct: 331 GIASNASYPI 340
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 260 bits (664), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 142/315 (45%), Positives = 182/315 (57%), Gaps = 36/315 (11%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFAD 112
+ S + FE W +QY + Y SE+E R ++ N ++ NS N S+ L N FAD
Sbjct: 21 EASSTADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFAD 80
Query: 113 LSNEEFISTYLGYN-------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
L++ EF ++ LG++ + P VQ L +P +VDWRK GAVT VKDQG CG
Sbjct: 81 LTHHEFKASRLGFSPGRAQSIRSVGTP----VQELHVPPAVDWRKSGAVTGVKDQGNCGG 136
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CW+FS A+EGINK+ TG LVSLSEQELVDCD S N GC GG M+ A++F+ K G+
Sbjct: 137 CWSFSTTGAIEGINKIVTGSLVSLSEQELVDCD-RSYNSGCEGGLMDYAYQFVIKNQGID 195
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAF 263
+E DYPY G + C +K K H VTI GY IP + F
Sbjct: 196 SEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGSEKTF 255
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
QLYS GV+ C L+H V +VGYG + G +W+VKNSWG WG GYI M RN+ ++
Sbjct: 256 QLYSKGVYTGPCSSTLDHAVLIVGYGTEDGVDFWIVKNSWGEHWGMRGYIHMLRNNGTAE 315
Query: 324 IGICGILMQASYPVK 338
GICGI M ASYP K
Sbjct: 316 -GICGINMLASYPAK 329
>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
Length = 349
Score = 260 bits (664), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 189/317 (59%), Gaps = 42/317 (13%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
+RF+ W +Y+R Y + +E+Q+RF +YS NV++I+ +N S++L +N+FADL+ EEF
Sbjct: 35 DRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGSSYELGENQFADLTEEEFK 94
Query: 120 STYLGYNKPYNEPRWPSVQYLGL-----------------PASVDWRKEGAVTPVKDQGQ 162
TYL K N P L + P SVDWR +GAVTPVK Q
Sbjct: 95 DTYL--MKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVKSQQH 152
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAF+AVA++EG++K+KTG+LVSLSEQE+VDCD N GC+GG+ A E++T+ G
Sbjct: 153 CGSCWAFAAVASIEGVHKIKTGRLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWVTRNG 212
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA---------------------IPARY 261
G+TTE DYPY G+ +C +DK HHA I G +A I A
Sbjct: 213 GLTTESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSINASR 272
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
AFQ Y G+F C NH VTVVGYG + G KYW+VKNSWG WGE GY+RM R
Sbjct: 273 AFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRMQRGVR 332
Query: 321 SSNIGICGILMQASYPV 337
+ G+CGI + Y V
Sbjct: 333 ARE-GVCGIAIAPFYAV 348
>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
Length = 374
Score = 260 bits (664), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 143/334 (42%), Positives = 193/334 (57%), Gaps = 52/334 (15%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNK 109
D SM ERF+ W Y++ Y + E +RRF +Y+ N+ YI+ N++ L+++L +
Sbjct: 42 DDSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAAGLTYELGETA 101
Query: 110 FADLSNEEFISTYLG---YNKPYNEP----RWPSVQYLG---------------LPASVD 147
+ DL+N+EF++ Y P +E R V +G PASVD
Sbjct: 102 YTDLTNQEFMAMYTAPALAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSASAPASVD 161
Query: 148 WRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCN 207
WR GAVTPVK+QG+CGSCWAFS VA VEGI +++TGKLVSLSEQELVDCD + + GC+
Sbjct: 162 WRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCD--TLDDGCD 219
Query: 208 GGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA----- 262
GG +A +I GG+TTE DYPY G D C K H+AV+I G + R
Sbjct: 220 GGISYRALRWIASNGGITTEADYPYTGTTDACNRAKLSHNAVSIAGLRRVATRSEASLAN 279
Query: 263 -----------------FQLYSHGVFDEYCGHQLNHGVTVVGYGED--HGEKYWLVKNSW 303
FQ Y GV++ CG LNHGVTVVGYG++ G++YW+VKNSW
Sbjct: 280 AVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAAGDRYWIVKNSW 339
Query: 304 GTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G WG+ GYIRM ++ G+CGI ++ SYP+
Sbjct: 340 GQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 140/306 (45%), Positives = 184/306 (60%), Gaps = 31/306 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
FE+WL ++S+ Y S DE RF I+ N+++ID N + ++ L N+FADL++EEF
Sbjct: 49 FESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKHK 108
Query: 122 YLGYNKPYNEPRWPSVQYLG------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
+LG+ E + S + G LP SVDWRK+GAV PVK+QGQCG+CWAFS VAAV
Sbjct: 109 FLGFKGELAERKDESSKEFGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGNCWAFSTVAAV 168
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EGIN++ TG L LSEQEL+DCD + N GCNGG M+ AF ++ + G+ E++YPY
Sbjct: 169 EGINQIVTGNLTMLSEQELIDCDT-TFNNGCNGGLMDYAFAYVMR-SGLHKEEEYPYIMS 226
Query: 236 NDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVFDE 273
C K VTI+GY +P + FQ YS GVFD
Sbjct: 227 EGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDG 286
Query: 274 YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
+CG +L+HGV VGYG G Y +V+NSWG WGE GYIRM R S + G+CG+ M A
Sbjct: 287 HCGTELDHGVAAVGYGTTKGLDYVIVRNSWGPKWGEKGYIRMKRGSGKPH-GMCGLYMMA 345
Query: 334 SYPVKR 339
SYP K+
Sbjct: 346 SYPTKQ 351
>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
Length = 484
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 140/297 (47%), Positives = 182/297 (61%), Gaps = 36/297 (12%)
Query: 76 EDEWQRRFGIYSSNVQYIDYINSQNLS----FKLTDNKFADLSNEEFISTYLGYNKPYN- 130
ED+ RR ++ N++YID N++ + F+L +FADL+ EE+ + L ++ N
Sbjct: 77 EDDDARRLEVFRYNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNG 136
Query: 131 -------EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKT 183
R+ + LP +VDWR+ GAV VKDQGQCG+CWAFSAVAAVEGINK+ T
Sbjct: 137 TAVGVVGSRRYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGACWAFSAVAAVEGINKIVT 196
Query: 184 GKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDK 243
G L+SLSEQEL+DCD ++QGC+GG M+ AF F+ K GG+ TE DYP+ G + C
Sbjct: 197 GSLISLSEQELIDCD-KFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKL 255
Query: 244 TKHHAVTITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNH 281
V+I +E +P +R AFQLYS G+FD CG L+H
Sbjct: 256 KNTRVVSIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDH 315
Query: 282 GVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
GVTVVGYG + G+ YW+VKNSWGT WGEAGY+RMARN G CGI M+ YPVK
Sbjct: 316 GVTVVGYGSEGGKDYWIVKNSWGTQWGEAGYVRMARNV-RVRAGKCGIAMEPLYPVK 371
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 259 bits (663), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 142/308 (46%), Positives = 182/308 (59%), Gaps = 29/308 (9%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
+ + FE+W+ ++ + Y S +E RF ++ N+++ID N + S+ L N+FADLS+EE
Sbjct: 44 LTDLFESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEE 103
Query: 118 FISTYLGYN----KPYNEPRWPSVQYLG-LPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
F YLG K + P S + + LP SVDWRK+GAV VK+QG CGSCWAFS V
Sbjct: 104 FKRKYLGLKIELPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTV 163
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AAVEGIN++ TG L +LSEQEL+DCD N GCNGG M+ AF FI GG+ E+DYPY
Sbjct: 164 AAVEGINQIVTGNLTALSEQELIDCD-KPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPY 222
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGV 270
+ C K + VTI+GY +P + FQ YS G+
Sbjct: 223 VMEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGI 282
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
F+ +CG +L+HGV VGYG G Y VKNSWG+ WGE GYIRM RN GICGI
Sbjct: 283 FNGHCGTELDHGVAAVGYGTSKGVDYITVKNSWGSKWGEKGYIRMKRNVGKPE-GICGIY 341
Query: 331 MQASYPVK 338
ASYP K
Sbjct: 342 KMASYPTK 349
>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 464
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 142/306 (46%), Positives = 191/306 (62%), Gaps = 31/306 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ--NLSFKLTDNKFADLSNEEFI 119
++ WL + R Y + E +RRF ++ N+++ D N++ + F+L N+FADL+NEEF
Sbjct: 53 YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 112
Query: 120 STYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
+T+LG K R +Y LP SVDWR++GAV PVK+QGQCGSCWAFSAV+
Sbjct: 113 ATFLGA-KVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSAVST 171
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VE IN+L TG++++LSEQELV+C N +N GCNGG M+ AF+FI K GG+ TEDDYPY+
Sbjct: 172 VESINQLVTGEMITLSEQELVECSTNGQNGGCNGGLMDDAFDFIIKNGGIDTEDDYPYKA 231
Query: 235 KNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFD 272
+ +C ++ V+I G+E +P FQLY GVF
Sbjct: 232 VDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFS 291
Query: 273 EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
CG L+HGV VGYG D+G+ YW+V+NSWG WGE+GY+RM RN + G CGI M
Sbjct: 292 GRCGTSLDHGVVAVGYGTDNGKDYWIVRNSWGPKWGESGYVRMERNI-NVTTGKCGIAMM 350
Query: 333 ASYPVK 338
ASYP K
Sbjct: 351 ASYPTK 356
>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
Length = 284
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 139/278 (50%), Positives = 176/278 (63%), Gaps = 30/278 (10%)
Query: 89 NVQYID-YINSQNLSFKLTDNKFADLSNEEFISTYLGYN--KPYNEPRWPSVQYLG---L 142
NV YI+ + N+ N +KL N+FADL++EEFI +N ++ R + +Y L
Sbjct: 7 NVNYIEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNGHMRFSNTRTTTFKYENVTVL 66
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P S+DWR++GAVTP+K+QG CG CWAFSA+AA EGI+K+ TGKLVSLSEQE+VDCD
Sbjct: 67 PDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGT 126
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---- 258
+ GC GGYM+ AF+FI + G+ TE YPY+G + +C + HA TITGYE +P
Sbjct: 127 DHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHATTITGYEDVPINNE 186
Query: 259 -----------------ARYA-FQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLV 299
AR A FQ Y G+F CG +L+HGVT VGYGE++ G KYWLV
Sbjct: 187 KALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEGTKYWLV 246
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KNSWGT WGE GY M R + GICGI M ASYP
Sbjct: 247 KNSWGTEWGEEGYTMMQRGVKAVE-GICGIAMLASYPT 283
>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
Length = 328
Score = 259 bits (661), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 188/315 (59%), Gaps = 45/315 (14%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFAD 112
D +M R E W+ QYSR Y E RRF ++ +NV++I+ N+ N F L N+FAD
Sbjct: 29 DDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKFIESFNAGGNRKFWLGVNQFAD 88
Query: 113 LSNEEFISTYL--GYN-KPYNEP---RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
L+N+EF +T G+ P P R+ +V LPA++DWR +GAVTP+KDQGQC
Sbjct: 89 LTNDEFRATKTNKGFKPSPVKVPTGFRYENVSVDALPATIDWRTKGAVTPIKDQGQC--- 145
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
EGI K+ TGKL+SLSEQELVDCDV+ E+QGC GG M+ AF+FI K GG+TT
Sbjct: 146 ---------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFQFIIKNGGLTT 196
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E YPY + +C++ + A T+ G+E +PA FQ
Sbjct: 197 ESSYPYTAADGKCKSG--SNSAATVKGFEDVPANDEAALMKAVANQPVSVAVDGGDMTFQ 254
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GV CG L+HG+ +GYG+ G KYWL+KNSWGT+WGE GY+RM ++ S
Sbjct: 255 FYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKD-ISDK 313
Query: 324 IGICGILMQASYPVK 338
G+CG+ M+ SYP++
Sbjct: 314 RGMCGLAMEPSYPIE 328
>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
Length = 298
Score = 259 bits (661), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 144/332 (43%), Positives = 183/332 (55%), Gaps = 68/332 (20%)
Query: 30 LSLFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+S+ LL++L AW S+ + SM ER E+W+ +Y R Y +E ++RF I+
Sbjct: 10 VSMALLFILA----AWASQATSRSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKD 65
Query: 89 NVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGLPASVDW 148
NV +Q +FK + +P+++DW
Sbjct: 66 NV-------AQATTFKYEN-------------------------------VTAVPSTIDW 87
Query: 149 RKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNG 208
RK+GAVTP+KDQ QCGSCWAFSAVAA EGI ++ TGKL+SLSEQELVDCD ENQGC+G
Sbjct: 88 RKKGAVTPIKDQQQCGSCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGCSG 147
Query: 209 GYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------- 260
G + AF FI I G+ +E YPY G + C + K H A I GYE +PA
Sbjct: 148 GLXDDAFRFIX-IHGLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANNEKALQKA 206
Query: 261 --------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGT 305
+ FQ Y+ GVF CG +L+HGV VGYG D G YWLVKNSWGT
Sbjct: 207 VAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDGMXYWLVKNSWGT 266
Query: 306 SWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
WGE GYIRM R+ + G+CGI MQASYP
Sbjct: 267 GWGEEGYIRMQRDVTAKE-GLCGIAMQASYPT 297
>gi|226503205|ref|NP_001150062.1| thiol protease SEN102 precursor [Zea mays]
gi|195636390|gb|ACG37663.1| thiol protease SEN102 precursor [Zea mays]
Length = 349
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 188/317 (59%), Gaps = 42/317 (13%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
+RF+ W +Y+R Y + +E+Q+RF +YS NV++I+ +N S++L +N+FADL+ EEF
Sbjct: 35 DRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGSSYELGENRFADLTEEEFK 94
Query: 120 STYLGYNKPYNEPRWPSVQYLGL-----------------PASVDWRKEGAVTPVKDQGQ 162
TYL K N P L + P SVDWR +GAVTPVK Q
Sbjct: 95 DTYL--MKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTPVKSQQH 152
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAF+AVA++EG++K+KTG LVSLSEQE+VDCD N GC+GG+ A E++T+ G
Sbjct: 153 CGSCWAFAAVASIEGVHKIKTGLLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWVTRNG 212
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA---------------------IPARY 261
G+TTE DYPY G+ +C +DK HHA I G +A I A
Sbjct: 213 GLTTESDYPYVGRQGQCMSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSINASR 272
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
AFQ Y G+F C NH VTVVGYG + G KYW+VKNSWG WGE GY+RM R
Sbjct: 273 AFQFYKRGIFSGPCNTTRNHAVTVVGYGANASGHKYWIVKNSWGERWGEKGYVRMQRGVR 332
Query: 321 SSNIGICGILMQASYPV 337
+ G+CGI + Y V
Sbjct: 333 ARE-GVCGIAIAPFYAV 348
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 138/328 (42%), Positives = 186/328 (56%), Gaps = 45/328 (13%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL------------ 101
DP ++E +F+ W ++ + Y + +E R +++ N ++ N++
Sbjct: 28 DPPAIEAQFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAA 87
Query: 102 --SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR-------WPSVQYLGLPASVDWRKEG 152
S+ L N FADL++EEF + LG P R W +P ++DWRK G
Sbjct: 88 PPSYTLALNAFADLTHEEFRAARLGRIAPGAALRSRAAPVYWGLGGGAAVPDALDWRKSG 147
Query: 153 AVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYME 212
AVT VKDQG CG+CW+FSA A+EGINK+KTG LVSLSEQEL+DCD S N GC GG M+
Sbjct: 148 AVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCD-RSYNSGCGGGLMD 206
Query: 213 KAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------ 260
A++F+ K GG+ TE+DYPYR + C +K K VTI GY +P+
Sbjct: 207 YAYKFVIKNGGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQ 266
Query: 261 ----------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEA 310
AFQLY G+FD C L+H V +VGYG + G+ YW+VKNSWG SWG
Sbjct: 267 PVSVGICGSARAFQLYYQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMK 326
Query: 311 GYIRMARNSPSSNIGICGILMQASYPVK 338
GY+ M RN+ S G+CGI M AS+P K
Sbjct: 327 GYMHMHRNTGDSK-GVCGINMMASFPTK 353
>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
C-169]
Length = 481
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 135/306 (44%), Positives = 187/306 (61%), Gaps = 32/306 (10%)
Query: 62 FENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIS 120
F +W++ + Y +E++R+F ++ N++++ N ++ +FKL FADL+++E+
Sbjct: 48 FSDWVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHNEKDSTFKLGLTNFADLTHDEYRQ 107
Query: 121 TYLGYNKPYNEPRWPSVQYLGL-------PASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
LGY + + G P S+DWRK+GAVT VK+Q QCGSCWAFS
Sbjct: 108 HALGYRPELKGTGLGTGKSTGFQYADYEAPPSIDWRKKGAVTDVKNQQQCGSCWAFSTTG 167
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
+VEG N + +G+LVSLSEQELVDCDV +++ GC+GG M+ AF FI + GG+ TE DY Y+
Sbjct: 168 SVEGANAIYSGELVSLSEQELVDCDV-TQDHGCHGGLMDFAFSFIIRNGGIDTEKDYKYK 226
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPA----------------------RYAFQLYSHGVF 271
++ C K K H VTI YE +P + FQLY+ GVF
Sbjct: 227 AQDGVCNIAKEKRHVVTIDSYEDVPPNDESALKKAAANQPISVAIEADQREFQLYAGGVF 286
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
D CG L+HGV VVGYG D+G YW+VKNSWG WG++GYIR+AR S++ G CGI M
Sbjct: 287 DAPCGTALDHGVLVVGYGSDNGTDYWIVKNSWGDFWGDSGYIRLARGI-SNSAGQCGIAM 345
Query: 332 QASYPV 337
QASYP+
Sbjct: 346 QASYPI 351
>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 132/299 (44%), Positives = 180/299 (60%), Gaps = 48/299 (16%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E WL ++ + Y + E +RRF I+ N+++I+ N+ N ++K+ D +++ + E+
Sbjct: 4 YEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAVNRTYKVGD-RYSFRAGED---- 58
Query: 122 YLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKL 181
LP SVDWR++GAV PVKDQG CGSCWAFS +AAVEGIN++
Sbjct: 59 --------------------LPESVDWREKGAVVPVKDQGNCGSCWAFSTIAAVEGINQI 98
Query: 182 KTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQT 241
TG L+SLSEQELVDCD S NQGCNGG M+ AFEFI GG+ +E+DYPYR + C
Sbjct: 99 ATGDLISLSEQELVDCD-KSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADTTCDP 157
Query: 242 DKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQL 279
++ V+I GYE +P AFQLY GVF CG QL
Sbjct: 158 NRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGVFTGQCGTQL 217
Query: 280 NHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+HGV VGYG ++ YW+V+NSWG +WGE+GYI++ RN + G CGI ++ SYP+K
Sbjct: 218 DHGVVAVGYGTENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCGIAIEPSYPIK 276
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 150/347 (43%), Positives = 204/347 (58%), Gaps = 41/347 (11%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M ++ L LFL + P+ A + + DP M +RFE W+ +Y R Y DE
Sbjct: 1 MASKVQLVFLFLFLCVMWASPSAASRD---EPSDP--MMKRFEEWMAEYGRVYKDNDEKM 55
Query: 81 RRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEFISTYLG-YNKPYNEPRWPSVQ 138
RRF I+ +NV +I+ N++N S+ L NKF D++N EF++ Y G ++P N + P V
Sbjct: 56 RRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPLNIEKEPVVS 115
Query: 139 Y-----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
+ + S+DWR GAVT VKDQ CGSCWAFSA+A VEGI K+ TG LVSLSEQE
Sbjct: 116 FDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQE 175
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
++DC V++ GC+GG+++ A++FI GV +E DYPY+ C + + A ITG
Sbjct: 176 VLDCAVSN---GCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSWPNSAY-ITG 231
Query: 254 YEAIPA------RYA----------------FQLYSHGVFDEYCGHQLNHGVTVVGYGED 291
Y + + +YA FQ Y+ GVF CG LNH +T++GYG+D
Sbjct: 232 YSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYGQD 291
Query: 292 -HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G +YW+VKNSWG+SWGE GYIRMAR SS G+CGI M YP
Sbjct: 292 SSGTQYWIVKNSWGSSWGERGYIRMARGVSSS--GLCGIAMDPLYPT 336
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 150/352 (42%), Positives = 205/352 (58%), Gaps = 40/352 (11%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSE--------GYPQK--YDPQSMEERFENWLKQYSREY 73
M + LSLF L LG A + S GY Q+ P + + F +W ++S+ Y
Sbjct: 1 MAMGSKLSLFFL-SLGFVAYSSSASHNDPSVVGYSQEDLALPYKLVDLFSSWSVKHSKIY 59
Query: 74 GSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP- 132
S +E +R+ ++ N+++I N +N S+ L N+FAD+++EEF STYLG + P
Sbjct: 60 VSPEEKVKRYEVFKQNLKHIVETNRRNGSYWLGLNQFADVAHEEFKSTYLGLKTGMDGPA 119
Query: 133 RWPSV----QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
R P+ + LP SVDWRK+GAVTPVK+QG+CGSCWAFS VAAVEGIN++ TGKL S
Sbjct: 120 RAPTAFRYENSVNLPWSVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIATGKLES 179
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
LSEQEL+DCD ++ GC GG+M+ AF +I G+ T+DDYPY + C+ + +
Sbjct: 180 LSEQELMDCDTTFDH-GCGGGFMDFAFAYIMGNLGIHTDDDYPYLMEEGYCKEKQPQSKV 238
Query: 249 VTITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVV 286
VTI+GYE +P FQ Y GVF+ CG +L+H +T V
Sbjct: 239 VTISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQFYKRGVFEGSCGTELDHALTAV 298
Query: 287 GYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
GYG G+ Y ++KNSWG SWGE GY R+ R + G+C I ASYP K
Sbjct: 299 GYGSSDGQDYIIMKNSWGKSWGEQGYFRIKRGTGKPE-GVCSIYSMASYPTK 349
>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 140/343 (40%), Positives = 201/343 (58%), Gaps = 35/343 (10%)
Query: 28 AVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEER---FENWLKQYSREYGSE-DEWQRRF 83
+L L +++VL P+ A +S EE F+ W+ ++ + Y + E +RRF
Sbjct: 10 TILFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRF 69
Query: 84 GIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL- 142
+ N+++ID N++NLS++L +FADL+ +E+ + G KP S +Y+ L
Sbjct: 70 QNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRDLFPGSPKPKQRNLKTSRRYVPLA 129
Query: 143 ----PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
P SVDWR+EGAV+ +KDQG C SCWAFS VAAVEG+NK+ TG+L+SLSEQELVDC
Sbjct: 130 GDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVDC- 188
Query: 199 VNSENQGCNG-GYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
N N GC G G M+ AF+F+ G+ +E DYPY+G C + +TI YE +
Sbjct: 189 -NLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQVHLLVITIDSYEDV 247
Query: 258 PAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEK 295
PA F LY +++ CG L+H + +VGYG ++G+
Sbjct: 248 PANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGSENGQD 307
Query: 296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YW+V+NSWGT+WG+AGYI++ARN G+CGI M ASYP+K
Sbjct: 308 YWIVRNSWGTTWGDAGYIKIARNFEDPK-GLCGIAMLASYPIK 349
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 141/300 (47%), Positives = 177/300 (59%), Gaps = 40/300 (13%)
Query: 61 RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIS 120
RFE+W+ ++ + Y S +E RF ++ N+ +ID N + S+ L N+FADLS+EEF S
Sbjct: 48 RFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSSYWLGLNEFADLSHEEFKS 107
Query: 121 TYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINK 180
+ LP SVDWRK+GAVT VK+QG CGSCWAFS VAAVEGIN+
Sbjct: 108 KDVA----------------DLPESVDWRKKGAVTHVKNQGACGSCWAFSTVAAVEGINQ 151
Query: 181 LKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
+ TG L +LSEQEL+DCD + N GCNGG M+ AF FI GG+ EDDYPY + C+
Sbjct: 152 IVTGNLTTLSEQELIDCDT-TFNSGCNGGLMDYAFAFIASNGGLHKEDDYPYLMEEGTCE 210
Query: 241 TDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQ 278
K VTI+GYE +P + FQ YS GVF+ CG +
Sbjct: 211 EQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYSGGVFNGPCGTE 270
Query: 279 LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
L+HGV VGYG G Y +VKNSWG WGE GYIRM RN+ + G+CGI ASYP K
Sbjct: 271 LDHGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGYIRMKRNTGKTE-GLCGINKMASYPTK 329
>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 423
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 180/316 (56%), Gaps = 44/316 (13%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFIS 120
+E W + + R + E RRFG + NV++I N + + ++L N+F D+ EEF S
Sbjct: 88 YERW-QTHHRVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRS 146
Query: 121 TYLGYN----KPYNEPRWPSVQYLGL--------PASVDWRKEGAVTPVKDQGQCGSCWA 168
T+ + + P + G P SVDWR+EGAVT VKDQG CGSCWA
Sbjct: 147 TFADSRINDLRRQDSPAARAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWA 206
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS V AVEGIN ++TG L SLSEQEL+DCD ++ GC GG ME AFEFI GG+TTE
Sbjct: 207 FSTVVAVEGINAIRTGSLASLSEQELIDCD--TDENGCQGGLMENAFEFIKSFGGITTEA 264
Query: 229 DYPYRGKNDRCQTDKTKH---HAVTITGYEAIPA----------------------RYAF 263
YPYR N C D+ + V I G++ +PA AF
Sbjct: 265 AYPYRASNGTCDGDRARRGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAF 324
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
Q YS GVF CG L+HGV VGYG D G YW+VKNSWGTSWGE GYIRM R +
Sbjct: 325 QFYSEGVFTGDCGTDLDHGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRG--AG 382
Query: 323 NIGICGILMQASYPVK 338
N G+CGI M+AS+P+K
Sbjct: 383 NGGLCGIAMEASFPIK 398
>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 385
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 140/317 (44%), Positives = 185/317 (58%), Gaps = 38/317 (11%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+++ E +E W Q+ +R+ G E RRF ++ NV+ I N ++ +KL N+F D+
Sbjct: 42 EALWELYERWRGQHRVARDLG---EKARRFNVFKDNVRLIHEFNRRDEPYKLRLNRFGDM 98
Query: 114 SNEEFISTYLGYNKPYNE------PRWPSVQYLG---LPASVDWRKEGAVTPVKDQGQCG 164
+ +EF Y ++ R Y G LPA+VDWR++GAV VKDQGQCG
Sbjct: 99 TADEFRRAYASSRVSHHRMFRGRGERRSGFMYAGARDLPAAVDWREKGAVGAVKDQGQCG 158
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS +AAVEGIN ++T L +LSEQ+LVDCD + N GC+GG M+ AF++I K GGV
Sbjct: 159 SCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGV 218
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA---------------------- 262
YPYR + C++ AVTI GYE +PA
Sbjct: 219 AASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSH 278
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GVF CG +L+HGV VGYG G KYW+V+NSWG WGE GYIRM R+ S
Sbjct: 279 FQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRDV-S 337
Query: 322 SNIGICGILMQASYPVK 338
+ G+CGI M+ASYP+K
Sbjct: 338 AKEGLCGIAMEASYPIK 354
>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 379
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 183/318 (57%), Gaps = 48/318 (15%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFIS 120
+E W + + R + E RRFG + NV++I N + + ++L N+F D+ EEF S
Sbjct: 44 YERW-QTHHRVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRS 102
Query: 121 TYLGYNKPYNEPRW---PSVQYLGLPA-----------SVDWRKEGAVTPVKDQGQCGSC 166
T+ + N+ R P+ + +P SVDWR+EGAVT VKDQG CGSC
Sbjct: 103 TFA--DSRINDLRRQDSPAARAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSC 160
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS V AVEGIN ++TG L SLSEQEL+DCD ++ GC GG ME AFEFI GG+TT
Sbjct: 161 WAFSTVVAVEGINAIRTGSLASLSEQELIDCD--TDENGCQGGLMENAFEFIKSFGGITT 218
Query: 227 EDDYPYRGKNDRCQTDKTKH---HAVTITGYEAIPA----------------------RY 261
E YPYR N C D+ + V I G++ +PA
Sbjct: 219 EAAYPYRASNGTCDGDRARRGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQ 278
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
AFQ YS GVF CG L+HGV VGYG D G YW+VKNSWGTSWGE GYIRM R
Sbjct: 279 AFQFYSEGVFTGDCGTDLDHGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRG-- 336
Query: 321 SSNIGICGILMQASYPVK 338
+ N G+CGI M+AS+P+K
Sbjct: 337 AGNGGLCGIAMEASFPIK 354
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 198/337 (58%), Gaps = 39/337 (11%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
L LFL + P+ A ++ + DP M +RFE W+ +Y R Y DE RRF I+ +N
Sbjct: 10 LFLFLCVMWASPSAASAD---EPSDP--MMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNN 64
Query: 90 VQYIDYINSQNL-SFKLTDNKFADLSNEEFISTYLG-YNKPYNEPRWPSVQY-----LGL 142
V +I+ NS+N S+ L N+F D++N EFI+ Y G ++P N R P V + +
Sbjct: 65 VNHIETFNSRNENSYTLGINQFTDMTNNEFIAQYTGGISRPLNIEREPVVSFDDVDISAV 124
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P S+DWR GAVT VK+Q CG+CWAF+A+A VE I K+K G L LSEQ+++DC ++
Sbjct: 125 PQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDC---AK 181
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---- 258
GC GG+ +AFEFI GV + YPY+ C+T+ + A ITGY +P
Sbjct: 182 GYGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTCKTNGVPNSAY-ITGYARVPRNNE 240
Query: 259 -----------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVK 300
A FQ Y GVF+ CG LNH VT +GYG+D +G+KYW+VK
Sbjct: 241 SSMMYAVSKQPITVAVDANANFQYYKSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWIVK 300
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
NSWG WGEAGYIRMAR+ SS+ GICGI + + YP
Sbjct: 301 NSWGARWGEAGYIRMARDVSSSS-GICGIAIDSLYPT 336
>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 356
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 141/344 (40%), Positives = 203/344 (59%), Gaps = 36/344 (10%)
Query: 28 AVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEER---FENWLKQYSREYGSE-DEWQRRF 83
+L L +++VL P+ A +S EE F+ W+ ++ + Y + E +RRF
Sbjct: 10 TILFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFIFQMWMSKHGKTYTNALGEKERRF 69
Query: 84 GIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL- 142
+ N+++ID N++NLS++L +FADL+ +E+ + G KP S +Y+ L
Sbjct: 70 QNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRDLFPGSPKPKQRNLKTSRRYVPLA 129
Query: 143 ----PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
P SVDWR+EGAV+ +KDQG C SCWAFS VAAVEG+NK+ TG+L+SLSEQELVDC
Sbjct: 130 GDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQELVDC- 188
Query: 199 VNSENQGCNG-GYMEKAFEFITKIGGVTTEDDYPYRGKNDRC-QTDKTKHHAVTITGYEA 256
N N GC G G M+ AF+F+ G+ +E DYPY+G C + T + +TI YE
Sbjct: 189 -NLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSNKVITIDSYED 247
Query: 257 IPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGE 294
+PA F LY +++ CG L+H + +VGYG ++G+
Sbjct: 248 VPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGSENGQ 307
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YW+V+NSWGT+WG+AGYI++ARN G+CGI M ASYP+K
Sbjct: 308 DYWIVRNSWGTTWGDAGYIKIARNFEDPK-GLCGIAMLASYPIK 350
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 141/300 (47%), Positives = 180/300 (60%), Gaps = 41/300 (13%)
Query: 78 EWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLG-------YNKPY 129
E RRFG + SNV +I N + + ++L N+F D+S EF +T+ G + P
Sbjct: 61 EKHRRFGTFKSNVHFIHSHNKRGDRPYRLRLNRFGDMSQAEFRATFAGSRVSDRRRDGPA 120
Query: 130 NEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTG 184
P P Y LP SVDWR++GAVT VK+QG+CGSCWAFS V +VEGIN ++TG
Sbjct: 121 TPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGVKNQGKCGSCWAFSTVVSVEGINAIRTG 180
Query: 185 KLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKT 244
KLVSLSEQEL+DCD ++N GC GG M+ AFE+I K GG+TTE YPYR N C+ K
Sbjct: 181 KLVSLSEQELIDCDT-ADNDGCEGGLMDNAFEYIKKNGGLTTEAAYPYRAANGTCKAAKV 239
Query: 245 KHHA---VTITGYEAIPARY----------------------AFQLYSHGVFDEYCGHQL 279
+ V I G++ +PA AF YS GVF CG +L
Sbjct: 240 AKSSPMVVHIDGHQDVPANSEEALAKAVANQPVSVGIDASGKAFMFYSEGVFTGECGTEL 299
Query: 280 NHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+HGV VVGYG + G+ YW VKNSWG SWGE GYIR+ ++S + G+CGI M+ASY VK
Sbjct: 300 DHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEKGYIRVEKDSGAEG-GLCGIAMEASYAVK 358
>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
Length = 374
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 143/335 (42%), Positives = 192/335 (57%), Gaps = 54/335 (16%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNK 109
D SM ERF+ W Y++ Y + E +RRF + + N+ YI+ N++ L+++L +
Sbjct: 42 DDSSMIERFQRWKAAYNKSYATVAEERRRFRVCARNMAYIEATNAEAEAAGLTYELGETA 101
Query: 110 FADLSNEEFISTYLGYNKPYNEPRWPSV-------------------QYLGL----PASV 146
+ DL+N+EF++ Y P P SV Y+ L PASV
Sbjct: 102 YTDLTNQEFMAMYTA-PAPAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSTSAPASV 160
Query: 147 DWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGC 206
DWR GAVTPVK+QG+CGSCWAFS VA VEGI +++TGKLVSLSEQELVDCD + + GC
Sbjct: 161 DWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCD--TLDDGC 218
Query: 207 NGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA---- 262
+GG +A +I GG+TTE DYPY G D C K H+AV+I G + R
Sbjct: 219 DGGISYRALRWIASNGGITTETDYPYTGTTDACNRAKLSHNAVSIAGLRRVATRSEASLA 278
Query: 263 ------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDH--GEKYWLVKNS 302
FQ Y GV++ CG LNHGVTVVGYG++ G++YW+VKNS
Sbjct: 279 NAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQEAAGGDRYWIVKNS 338
Query: 303 WGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
WG WG+ GYIRM ++ G+CGI ++ SYP+
Sbjct: 339 WGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
Length = 368
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 178/311 (57%), Gaps = 39/311 (12%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFIS 120
+E W + + R + E RRFG + N ++I N + + ++L N+F D+ EEF S
Sbjct: 42 YERW-QTHHRVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRLRLNRFGDMGREEFRS 100
Query: 121 TYLG------YNKPYNEPRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
+ +P P P Y LP SVDWR++GAVT VK+QG+CGSCWAFS
Sbjct: 101 GFADSRINDLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAVTAVKNQGRCGSCWAFST 160
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
V AVEGIN ++TG LVSLSEQEL+DCD ++ GC GG ME AFEFI GG+TTE YP
Sbjct: 161 VVAVEGINAIRTGSLVSLSEQELIDCD--TDENGCQGGLMENAFEFIKSHGGITTESAYP 218
Query: 232 YRGKNDRCQTDKTKH-HAVTITGYEAIPA----------------------RYAFQLYSH 268
Y N C + + V I G++A+PA A Q YS
Sbjct: 219 YHASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAIDAGGQALQFYSE 278
Query: 269 GVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GVF CG L+HGV VGYG D G YW+VKNSWG SWGE GYIRM R + N G+C
Sbjct: 279 GVFTGDCGTDLDHGVAAVGYGVSDDGTPYWIVKNSWGPSWGEGGYIRMQRG--TGNGGLC 336
Query: 328 GILMQASYPVK 338
GI M+AS+P+K
Sbjct: 337 GIAMEASFPIK 347
>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
Length = 328
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 138/317 (43%), Positives = 186/317 (58%), Gaps = 49/317 (15%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFAD 112
D +M R E W+ QYSR Y E RRF ++ +NV++I+ N+ N F L N+FAD
Sbjct: 29 DDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKFIESFNAGGNRKFWLGVNQFAD 88
Query: 113 LSNEEFISTYLGYNKPYNEP--------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
L+N+EF +T NK + R+ +V LPA++DWR +GAVTP+KDQGQC
Sbjct: 89 LTNDEFRATKT--NKGFKPSPVKVSTGFRYENVSVDALPATIDWRTKGAVTPIKDQGQC- 145
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
EGI K+ TGKL+SLSEQELVDCDV+ E+QGC GG M+ AF+FI K GG+
Sbjct: 146 -----------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGL 194
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE YPY + +C++ + A T+ G+E +PA
Sbjct: 195 TTESSYPYTAADGKCKSG--SNSAATVKGFEDVPANDEAALMKAVANQPVSVAVDGGDMT 252
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQ YS GV CG L+HG+ +GYG+ G KYWL+KNSWGT+WGE GY+RM ++ S
Sbjct: 253 FQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKD-IS 311
Query: 322 SNIGICGILMQASYPVK 338
G+CG+ M+ SYP +
Sbjct: 312 DKRGMCGLAMEPSYPTE 328
>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
Length = 341
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 134/308 (43%), Positives = 179/308 (58%), Gaps = 34/308 (11%)
Query: 61 RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL-SFKLTDNKFADLSNEEFI 119
R E W+ ++ R Y E E RR ++ +N + ID N+ S +L N+FADL+ EEF
Sbjct: 37 RHEKWMAEHGRAYKDEAEKARRLEVFRANAELIDSFNAAGTHSHRLATNRFADLTVEEFR 96
Query: 120 STYLGYNKPYNEP-------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
+ G +P P R+ + SVDWR GAVT VKDQG CG CWAFSAV
Sbjct: 97 AARTGL-RPRPAPSAGAGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGACGCCWAFSAV 155
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AAVEG+NK++TG+LVSLSEQELVDCDV+ +QGC+GG M+ AF+F+ + GG+ +E YPY
Sbjct: 156 AAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYPY 215
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
+G++ C++ A +I G+E +P AF+ Y GV
Sbjct: 216 QGRDGPCRSSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDMAFRFYDSGV 275
Query: 271 FDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
CG LNH +T VGYG + G +YWL+KNSWG SWGE GY+R+ R G+CG+
Sbjct: 276 LGGACGTDLNHAITAVGYGTANDGTRYWLMKNSWGASWGEGGYVRIRRGVRGE--GVCGL 333
Query: 330 LMQASYPV 337
SYPV
Sbjct: 334 AKLPSYPV 341
>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 138/328 (42%), Positives = 185/328 (56%), Gaps = 39/328 (11%)
Query: 48 GYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI----DYINSQNL-- 101
G + E +FE W ++ + Y + E R ++ N ++ D + S
Sbjct: 25 GRDESVSASDYEAQFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGG 84
Query: 102 -SFKLTDNKFADLSNEEFISTYLGY----NKPYNEPRWPSVQYLG----LPASVDWRKEG 152
S+ L N FADL+++EF + LG P P + G +P ++DWR+ G
Sbjct: 85 PSYTLALNAFADLTHDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSG 144
Query: 153 AVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYME 212
AVT VKDQG CG+CW+FSA A+EGINK+ TG L+SLSEQEL+DCD S N GC GG M
Sbjct: 145 AVTKVKDQGSCGACWSFSATGAMEGINKITTGSLLSLSEQELIDCD-RSYNTGCGGGLMT 203
Query: 213 KAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------ 260
A++F+ K GG+ TEDDYP+R + C +K K H VTI GY+ +P+
Sbjct: 204 YAYKFVIKNGGIDTEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQ 263
Query: 261 ----------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEA 310
AFQLYS G+FD C L+H V +VGYG + G+ YW+VKNSWG WG
Sbjct: 264 PISVGICGSARAFQLYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMK 323
Query: 311 GYIRMARNSPSSNIGICGILMQASYPVK 338
GY+ M RN+ SS+ GICGI M AS+P K
Sbjct: 324 GYMHMHRNTGSSS-GICGINMMASFPTK 350
>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 360
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 143/311 (45%), Positives = 188/311 (60%), Gaps = 38/311 (12%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADLSNEE 117
+ W Q+ +E+E R+ + N++YID N+ SF+L N+FA L+NEE
Sbjct: 43 YAEWTAQHGSPITNEEEG--RYEAFRDNLRYIDEHNAAADAGIHSFRLGLNRFAGLTNEE 100
Query: 118 FISTYLGY---NKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ-CGSCWA 168
+ + YLG + + R PS +Y LP SVDWR++GAV VKDQG+ CGS WA
Sbjct: 101 YRAAYLGLRLRSGAVGDLRKPSARYEAADGEALPESVDWREKGAVGKVKDQGRSCGSAWA 160
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSA+AAVE IN++ TG+L+SLSEQEL+DCD S N GC+GG M+ AFEFI GG+ T++
Sbjct: 161 FSAIAAVESINQIVTGELISLSEQELMDCDT-SYNAGCDGGLMDDAFEFIISNGGIDTDE 219
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAI--------------PARYA-------FQLYS 267
DYPY+ +ND C +K AVTI YE + P A FQLY
Sbjct: 220 DYPYKARNDSCDANKRNRKAVTIDDYEDLRMNEKSLQKAVSNQPVSVAIEAGGRDFQLYK 279
Query: 268 HGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
G+F CG L+H T+VGYG ++G YW+VK S+GTSWGE+GY RM RN ++ G C
Sbjct: 280 SGIFTGTCGTDLDHATTIVGYGSENGTDYWIVKESYGTSWGESGYARMERNIKETS-GKC 338
Query: 328 GILMQASYPVK 338
GI M SYPVK
Sbjct: 339 GIAMLPSYPVK 349
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 139/326 (42%), Positives = 196/326 (60%), Gaps = 44/326 (13%)
Query: 49 YPQKYDPQSMEE---RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFK 104
Y +K++ ++ +E FE+WL +Y + Y + E +RRF I+ N++++D N+ N S+K
Sbjct: 32 YGEKWEQRTNDEVIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYK 91
Query: 105 LTDNKFADLSNEEFISTYLGYNKPYN----------EPRWPSVQYLGLPASVDWRKEGAV 154
+ N+F+DL++ E+ S YLG +N EPR LP SVDWRK+GAV
Sbjct: 92 VGLNQFSDLTDAEYSSIYLG--TKFNIRMTNVSDRYEPRVGDQ----LPDSVDWRKKGAV 145
Query: 155 TPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKA 214
VK+QG CGSCW F+++AAVEGINK+ TG L+SLSEQE+VDC N GCNGG + A
Sbjct: 146 LGVKNQGNCGSCWTFASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGA 205
Query: 215 FEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------------- 260
++FI GG+ TE +YPY G++ C +K VTI YE +P+
Sbjct: 206 YQFIINNGGINTEANYPYTGRDGVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPV 265
Query: 261 --------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGY 312
AF+ Y G+F+ CG +++HGVT+VGYG + G+ YW+V+NSWG +WGE+GY
Sbjct: 266 SVVIASNSTAFKSYKSGIFNGPCGPRIDHGVTIVGYGTEGGKDYWIVRNSWGPNWGESGY 325
Query: 313 IRMARNSPSSNIGICGILMQASYPVK 338
+RM RN S G C I YPVK
Sbjct: 326 VRMQRNVGGS--GKCFIARAPVYPVK 349
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 142/302 (47%), Positives = 176/302 (58%), Gaps = 35/302 (11%)
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKP 128
Y + Y S +E RRF ++ N+ +ID IN + S+ L N+FADL+++EF +TYLG P
Sbjct: 36 YRKAYASFEEKVRRFEVFKDNLNHIDDINKKVTSYWLGLNEFADLTHDEFKATYLGLTPP 95
Query: 129 ----------YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGI 178
E R+ + +P +DWRK+ AVT VK+QGQCGSCWAFS VAAVEGI
Sbjct: 96 PTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAVEGI 155
Query: 179 NKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR 238
N + TG L SLSEQEL+DC + N GCNGG M+ AF +I GG+ TE+ YPY +
Sbjct: 156 NAIVTGNLTSLSEQELIDCSTDG-NNGCNGGLMDYAFSYIASTGGLRTEEAYPYAMEEGD 214
Query: 239 CQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFDEYCG 276
C K VTI+GYE +PA FQ YS GVFD CG
Sbjct: 215 CDEGKGA-AVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSGGVFDGPCG 273
Query: 277 HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
QL+HGVT VGYG G+ Y +VKNSWG WGE GYIRM R + G+CGI ASYP
Sbjct: 274 EQLDHGVTAVGYGTSKGQDYIIVKNSWGPHWGEKGYIRMKRGTGKGE-GLCGINKMASYP 332
Query: 337 VK 338
K
Sbjct: 333 TK 334
>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
Length = 377
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 140/291 (48%), Positives = 174/291 (59%), Gaps = 39/291 (13%)
Query: 83 FGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLG----YNKPYNEPRWPS-- 136
F ++ +NV+ I N ++ +KL N+F D++ +EF Y G +++ + R S
Sbjct: 70 FNVFKANVRLIHEFNRRDEPYKLRLNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQGSSA 129
Query: 137 ------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
+PASVDWR++GAVT VKDQGQCGSCWAFS +AAVEGIN +KT L SLS
Sbjct: 130 SASFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLS 189
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQ+LVDCD + N GCNGG M+ AF++I K GGV ED YPYR + C+ K+ VT
Sbjct: 190 EQQLVDCDTKA-NAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQASCK--KSPAPVVT 246
Query: 251 ITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
I GYE +PA FQ YS GVF CG +L+HGV VGY
Sbjct: 247 IDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDHGVAAVGY 306
Query: 289 G-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G G KYWLVKNSWG WGE GYIRMAR+ + G CGI M+ASYPVK
Sbjct: 307 GVTADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKE-GHCGIAMEASYPVK 356
>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
Length = 314
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 139/314 (44%), Positives = 176/314 (56%), Gaps = 57/314 (18%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
D +M R E W+ QYSR Y E RRF KFADL
Sbjct: 29 DDSAMVARHEQWMAQYSRVYKDASEKARRF-------------------------KFADL 63
Query: 114 SNEEF--ISTYLGYN----KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
+N EF + T G+ K R+ +V LP ++DWR +G VTP+KDQGQCG C
Sbjct: 64 TNHEFRSVKTNKGFKSSNMKILTGFRYENVSADALPTTIDWRTKGVVTPIKDQGQCGCCS 123
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAVAA EGI K+ TGKLVSL++QELVDCDV+ E+QGC GG M+ AF+FI K GG+TTE
Sbjct: 124 AFSAVAATEGIVKISTGKLVSLADQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTE 183
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
YPY + +C + + A TI GYE +PA F+
Sbjct: 184 SSYPYTAADGKCNSG--SNSAATIKGYEDVPANDEAALMKAMANQPVSVAVDGGDMTFRF 241
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS GV CG L+HG+ +GYG+ G KYWL+KNSWGT+WGE GY+RM ++ S
Sbjct: 242 YSGGVMTGSCGTDLDHGIAAIGYGKTSDGTKYWLMKNSWGTTWGENGYLRMEKD-ISDKR 300
Query: 325 GICGILMQASYPVK 338
G+CG+ M+ SYP K
Sbjct: 301 GMCGLAMEPSYPTK 314
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 190/322 (59%), Gaps = 39/322 (12%)
Query: 53 YDPQSMEER------FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL-SFKL 105
Y P+ + + FE W+ +Y + YGS +E RRF ++ N+ +ID N + + S+ L
Sbjct: 57 YSPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWL 116
Query: 106 TDNKFADLSNEEFISTYLGY-NKPYNEPRW----PSVQYLGLPASVDWRKEGAVTPVKDQ 160
N FADL+++EF +TYLG K + R+ +PASVDWRK+GAVT VK+Q
Sbjct: 117 GLNAFADLTHDEFKATYLGLLPKRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQ 176
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
GQCGSCWAFS VAAVEGIN++ TG L SLSEQ+LVDC + N GC+GG M+ AF FI
Sbjct: 177 GQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDG-NNGCSGGVMDNAFSFIAT 235
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHA--VTITGYEAIPAR------------------ 260
G+ +E+ YPY + C D+ + VTI+GYE +PA
Sbjct: 236 GAGLRSEEAYPYLMEEGDCD-DRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAI 294
Query: 261 ----YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMA 316
FQ YS GVFD CG +L+HGV VGYG G+ Y +VKNSWGT WGE GYIRM
Sbjct: 295 EASGRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGTHWGEKGYIRMK 354
Query: 317 RNSPSSNIGICGILMQASYPVK 338
R + G+CGI ASYP K
Sbjct: 355 RGTGKPE-GLCGINKMASYPTK 375
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 133/308 (43%), Positives = 189/308 (61%), Gaps = 31/308 (10%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLS 114
++ E FE W ++ + Y S +E R G+++ N +++ + N+ N S+ L+ N +ADL+
Sbjct: 23 SNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLT 82
Query: 115 NEEFISTYLGYNKPYNE-----PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
+ EF + LG++ P+ PS+ +P S+DWRK+GAVT VKDQG CG+CW+F
Sbjct: 83 HHEFKVSRLGFSPALRNFRPVLPQEPSLPR-DVPDSLDWRKKGAVTAVKDQGSCGACWSF 141
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
SA A+EGIN++ TG L+SLSEQEL+DCD S N GC GG M+ A++F+ G+ TE+D
Sbjct: 142 SATGAMEGINQIMTGSLISLSEQELIDCD-RSYNSGCGGGLMDYAYQFVISNHGIDTEND 200
Query: 230 YPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYS 267
YPY+ ++ C+ DK + + VTI GY IP + AFQLYS
Sbjct: 201 YPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERAFQLYS 260
Query: 268 HGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
G+F C L+H V +VGYG ++G YW+VKNSWG SWG GY+ M RNS +S G+C
Sbjct: 261 KGIFSGPCSTSLDHAVLIVGYGSENGVDYWIVKNSWGKSWGMDGYMHMQRNSGNSE-GVC 319
Query: 328 GILMQASY 335
GI ASY
Sbjct: 320 GINKLASY 327
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 147/322 (45%), Positives = 190/322 (59%), Gaps = 39/322 (12%)
Query: 53 YDPQSMEER------FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL-SFKL 105
Y P+ + + FE W+ +Y + YGS +E RRF ++ N+ +ID N + + S+ L
Sbjct: 71 YSPEDLTQHDRLVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWL 130
Query: 106 TDNKFADLSNEEFISTYLGY-NKPYNEPRW----PSVQYLGLPASVDWRKEGAVTPVKDQ 160
N FADL+++EF +TYLG K + R+ +PASVDWRK+GAVT VK+Q
Sbjct: 131 GLNAFADLTHDEFKATYLGLLPKRTSGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQ 190
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
GQCGSCWAFS VAAVEGIN++ TG L SLSEQ+LVDC + N GC+GG M+ AF FI
Sbjct: 191 GQCGSCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDG-NNGCSGGVMDNAFSFIAT 249
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHA--VTITGYEAIPAR------------------ 260
G+ +E+ YPY + C D+ + VTI+GYE +PA
Sbjct: 250 GAGLRSEEAYPYLMEEGDCD-DRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAI 308
Query: 261 ----YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMA 316
FQ YS GVFD CG +L+HGV VGYG G+ Y +VKNSWGT WGE GYIRM
Sbjct: 309 EASGRHFQFYSGGVFDGPCGSELDHGVAAVGYGSSKGQDYIIVKNSWGTHWGEKGYIRMK 368
Query: 317 RNSPSSNIGICGILMQASYPVK 338
R + G+CGI ASYP K
Sbjct: 369 RGTGKPE-GLCGINKMASYPTK 389
>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
Length = 449
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 135/306 (44%), Positives = 178/306 (58%), Gaps = 31/306 (10%)
Query: 59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEF 118
E +FE W ++ R Y + E R ++ N ++ N S+ L N FADL+++EF
Sbjct: 35 EAQFEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYALALNAFADLTHDEF 94
Query: 119 ISTYLGYNKPYNEPRWPSVQYLGL-------PASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
+ LG R YLG+ P +VDWR+ GAVT VKDQG CG+CW+FSA
Sbjct: 95 RAARLGRLAAAGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSA 154
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
A+EGINK+KTG L+SLSEQEL+DCD S N GC GG M+ A++F+ K GG+ TE DYP
Sbjct: 155 TGAMEGINKIKTGSLISLSEQELIDCD-RSYNSGCGGGLMDYAYKFVVKNGGIDTEADYP 213
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHG 269
YR + C +K K VTI GY+ +PA AFQLYS G
Sbjct: 214 YRETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKG 273
Query: 270 VFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
+FD C L+H + +VGYG + G+ YW+VKNSWG SWG GY+ M RN+ +SN G+CGI
Sbjct: 274 IFDGPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSN-GVCGI 332
Query: 330 LMQASY 335
S+
Sbjct: 333 NQMPSF 338
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/320 (44%), Positives = 188/320 (58%), Gaps = 39/320 (12%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+ + E FE ++ +Y + Y S +E RRF ++ N+ +ID N + + L N+FADL++
Sbjct: 46 ERLMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKKITGYWLGLNEFADLTH 105
Query: 116 EEFISTYLGYN-----KPYNEP--RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
+EF + YLG + N+ R+ V+ LP VDWRK+GAVT VK+QGQCGSCWA
Sbjct: 106 DEFKAAYLGLTLTPARRNSNDQLFRYEEVEAASLPKEVDWRKKGAVTEVKNQGQCGSCWA 165
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS VAAVEGIN + TG L LSEQEL+DCD + N GC+GG M+ AF +I GG+ TE+
Sbjct: 166 FSTVAAVEGINAIVTGNLTRLSEQELIDCDTDG-NNGCSGGLMDYAFSYIAANGGLHTEE 224
Query: 229 DYPYRGKNDRCQTDKTKHH-------AVTITGYEAIP----------------------A 259
YPY + C+ T+ AVTI+GYE +P +
Sbjct: 225 SYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNEQALLKALAHQPVSVAIEAS 284
Query: 260 RYAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
FQ YS GVFD CG +L+HGVT VGYG G Y +VKNSWG+ WGE GYIRM R
Sbjct: 285 GRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTASKGHDYIIVKNSWGSHWGEKGYIRMRRG 344
Query: 319 SPSSNIGICGILMQASYPVK 338
+ + G+CGI ASYP K
Sbjct: 345 TGKHD-GLCGINKMASYPTK 363
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 152/356 (42%), Positives = 193/356 (54%), Gaps = 56/356 (15%)
Query: 37 VLGIPAGAWS-EGYPQK--YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI 93
L P+G +S GY ++ +S+ E FE WL ++ R Y S +E RRF ++ N+ +I
Sbjct: 31 ALARPSGDFSIVGYSEEDLSSHESLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHI 90
Query: 94 DYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY-------------- 139
D N + S+ L N+FADL+++EF +TYLG +
Sbjct: 91 DETNRKVSSYWLGLNEFADLTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDG 150
Query: 140 LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDV 199
LP SVDWR +GAVT VK+QGQCGSCWAFS VAAVEGIN++ TG L +LSEQEL+DCD
Sbjct: 151 ASLPKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDT 210
Query: 200 NSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH------------ 247
+ N GCNGG M+ AF +I GG+ TE+ YPY + CQ +
Sbjct: 211 DG-NNGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCQRSSSSEKKWPGSSEDANDD 269
Query: 248 --AVTITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGV 283
VTI+GYE +P + FQ YS GVFD CG QL+HGV
Sbjct: 270 AAVVTISGYEDVPRNNEQALLKALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGV 329
Query: 284 TVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VGYG G Y +VKNSWG SWGE GYIRM R + G+CGI ASYP K
Sbjct: 330 AAVGYGTAAKGHDYIIVKNSWGPSWGEKGYIRMRRGTGKRQ-GLCGINKMASYPTK 384
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 185/321 (57%), Gaps = 38/321 (11%)
Query: 56 QSMEERFENWLKQY----SREYGSEDE-WQRRFGIYSSNVQYIDYINSQNLSFKLTDNKF 110
+S E F+ W+ +R Y S E ++RRF I+ N+++ N+++ S L+ +
Sbjct: 40 ESPREAFDFWVHTVKPPSNRAYASSAEVYERRFNIWLDNLRFAHEYNARHTSHWLSMGVY 99
Query: 111 ADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGL--PASVDWRKEGAVTPVKDQGQCGS 165
ADLS +E+ S LGYN ++ R Y G P VDW GAVTPVKDQ CGS
Sbjct: 100 ADLSQDEYRSKALGYNAHLHKKRPLRAAPFLYKGTVPPEEVDWVAGGAVTPVKDQLLCGS 159
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFS AVEG N + TGKLVSLSEQ LVDCD + GC GG+M+ AF+FI GG+
Sbjct: 160 CWAFSTTGAVEGANAIATGKLVSLSEQMLVDCD-REYDTGCRGGFMDSAFDFIVNNGGID 218
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA----------------------RYAF 263
TEDDYPYR ++ CQ ++T+ H VTI GY+ +P + AF
Sbjct: 219 TEDDYPYRAEDGICQDNRTRRHVVTIDGYQDVPPNDENALMKAVAHQPVSVAIEADQLAF 278
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGE----DHGEKYWLVKNSWGTSWGEAGYIRMARN- 318
QLY GVFD CG L+H V VVGYG H YWLVKNSWG WGE GYIR+ RN
Sbjct: 279 QLYGGGVFDAECGTALDHAVLVVGYGTASNGTHNLPYWLVKNSWGAEWGEKGYIRLLRNL 338
Query: 319 SPSSNIGICGILMQASYPVKR 339
+ G CG+ M AS+P+K+
Sbjct: 339 GKDAPEGQCGLAMYASFPIKK 359
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 139/347 (40%), Positives = 202/347 (58%), Gaps = 39/347 (11%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKY-DPQSMEER--FENWLKQYSREYGSEDEWQ 80
M N + S +L V+ + A ++ P D +++E + FE+W ++ + Y S+ E
Sbjct: 1 MASNMIASTLILLVV-VGATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDLEKA 59
Query: 81 RRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLG-YNKPYNEPRWPS-- 136
RR I+S + YI+ N+Q N +F L NKF+DL+N EF + ++G + +P + R P+
Sbjct: 60 RRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAED 119
Query: 137 --VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
V LP S+DWR++GAVTP+KDQG CGSCWAFSA+A++E + L T +LVSLSEQ+L
Sbjct: 120 EDVDVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQL 179
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK--HHAVTIT 252
+DCD + + GC+GG ME AF+F+ K GGVTTE YPY G C +K + IT
Sbjct: 180 MDCD--TVDAGCDGGLMETAFKFVVKNGGVTTEASYPYTGSVGSCNANKVAIINKVAEIT 237
Query: 253 GYEAIPARYA----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
G++ + A FQ Y G+ CG L+HGV ++GYG
Sbjct: 238 GFKVVTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGQCGDSLDHGVLLIGYGT 297
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+ G YW++KNSWGTSWGE G++++ R GICG+ +SYP
Sbjct: 298 EGGMPYWIIKNSWGTSWGEDGFMKIERKDGD---GICGMNGDSSYPT 341
>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
Length = 450
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 136/307 (44%), Positives = 179/307 (58%), Gaps = 32/307 (10%)
Query: 59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEF 118
E +FE W ++ R Y + E R ++ N ++ N S+ L N FADL+++EF
Sbjct: 35 EAQFEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYALALNAFADLTHDEF 94
Query: 119 ISTYLGYNKPYNEP-RWPSVQYLGL-------PASVDWRKEGAVTPVKDQGQCGSCWAFS 170
+ LG P R YLG+ P +VDWR+ GAVT VKDQG CG+CW+FS
Sbjct: 95 RAARLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFS 154
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
A A+EGINK+KTG L+SLSEQEL+DCD S N GC GG M+ A++F+ K GG+ TE DY
Sbjct: 155 ATGAMEGINKIKTGSLISLSEQELIDCD-RSYNSGCGGGLMDYAYKFVVKNGGIDTEADY 213
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSH 268
PYR + C +K K VTI GY+ +PA AFQLYS
Sbjct: 214 PYRETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSK 273
Query: 269 GVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
G+FD C L+H + +VGYG + G+ YW+VKNSWG SWG GY+ M RN+ +SN G+CG
Sbjct: 274 GIFDGPCPTSLDHAILIVGYGSEGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSN-GVCG 332
Query: 329 ILMQASY 335
I S+
Sbjct: 333 INQMPSF 339
>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 191/309 (61%), Gaps = 37/309 (11%)
Query: 62 FENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIS 120
F+ W+ ++ + Y + E +RRF + N+++ID N++NLS++L +FADL+ +E+
Sbjct: 48 FQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRD 107
Query: 121 TYLGYNKPYNEPRWPSVQYLGL-----PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
+ G KP S +Y+ L P SVDWR EGAV+ +KDQG C SCWAFS VAAV
Sbjct: 108 LFPGSPKPKQRNLRISRRYVPLDGDQLPESVDWRNEGAVSAIKDQGTCNSCWAFSTVAAV 167
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNG-GYMEKAFEFITKIGGVTTEDDYPYRG 234
EGINK+ TG+LVSLSEQELVDC N N GC G G M+ AF+F+ GG+ ++ DYPY+G
Sbjct: 168 EGINKIVTGELVSLSEQELVDC--NLVNNGCYGSGTMDAAFQFLINNGGLDSDTDYPYQG 225
Query: 235 KNDRC-QTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVF 271
C + + T + +TI YE +PA F LY G++
Sbjct: 226 SQGYCNRKESTSNKIITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSGIY 285
Query: 272 DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS--PSSNIGICGI 329
+ CG L+H + +VGYG ++G+ YW+V+NSWGT+WG+AGY +MARN PS G+CGI
Sbjct: 286 NGPCGTDLDHALVIVGYGSENGQDYWIVRNSWGTTWGDAGYAKMARNFEYPS---GVCGI 342
Query: 330 LMQASYPVK 338
M ASYPVK
Sbjct: 343 AMLASYPVK 351
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 184/319 (57%), Gaps = 43/319 (13%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS-FKLTDNKFAD 112
D M R++ W+ QY R+Y + E RF ++ +N ++ID N+ + L N+FAD
Sbjct: 51 DEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFAD 110
Query: 113 LSNEEFISTYLGYNKPYNEP----------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
L+++EF + Y G KP P ++ + L VDWR++GAVTPVK+QGQ
Sbjct: 111 LTSKEFAAMYTGLRKPAAVPSGAKQIPAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQ 170
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CG CWAFSAV A+EG+ + TG LVSLSEQ+++DCD + NQGCNGGYM+ AF+++ G
Sbjct: 171 CGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNG 230
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA----------------------R 260
GVTTED YPY CQ A TI+G++ +P+
Sbjct: 231 GVTTEDAYPYSAVQGTCQ---NVQPAATISGFQDLPSGDENALANAVANQPVSVGVDGGS 287
Query: 261 YAFQLYSHGVFD-EYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
FQ Y G++D + CG +NH VT +GYG +D G +YW++KNSWGT WGE G++++
Sbjct: 288 SPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQL--- 344
Query: 319 SPSSNIGICGILMQASYPV 337
+G CGI ASYP
Sbjct: 345 --QMGVGACGISTMASYPT 361
>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
Length = 324
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 186/312 (59%), Gaps = 35/312 (11%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFAD 112
DP M ERFE W+ +Y R Y E RRF I+ +NV +I+ N+++ S+ L N+F D
Sbjct: 4 DP--MMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFTD 61
Query: 113 LSNEEFISTYLGYNKPYNEPRWPSVQY-----LGLPASVDWRKEGAVTPVKDQGQCGSCW 167
++N EF++ Y G + P N R P V + +P S+DWR GAVT VK+QG CGSCW
Sbjct: 62 MTNNEFLARYTGASLPLNIERDPVVSFDDVDISAVPQSIDWRDYGAVTSVKNQGSCGSCW 121
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSA+A VEGI K+K G L+SLSEQE++DC + + GC+GG++ KA++FI GVT+
Sbjct: 122 AFSAIATVEGIYKIKAGNLISLSEQEVLDCAL---SYGCDGGWVNKAYDFIISNNGVTSF 178
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGY---------------------EAIPARYAFQLY 266
+ PY+G C + + A ITGY I A FQ Y
Sbjct: 179 ANLPYKGYKGPCNHNDLPNKAY-ITGYTYVQSNNERSMMIAVANQPIAALIDAGGDFQYY 237
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
GVF CG LNH +TV+GYG+ G KYW+VKNSWGTSWGE GYIRMAR+ SS G
Sbjct: 238 KSGVFTGSCGTSLNHAITVIGYGQTSSGTKYWIVKNSWGTSWGERGYIRMARDV-SSPYG 296
Query: 326 ICGILMQASYPV 337
+CGI M +P
Sbjct: 297 LCGIAMAPLFPT 308
>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 186/318 (58%), Gaps = 38/318 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLS 114
+++ + +E W + R E RRFG + SN +I N + + ++L N+F D+
Sbjct: 40 EALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDMD 98
Query: 115 NEEFISTYLG---YNKPYNEPRWPSVQYLGL-----PASVDWRKEGAVTPVKDQGQCGSC 166
EF +T++G + P P P Y L P SVDWR++GAVT VKDQG+CGSC
Sbjct: 99 QAEFRATFVGDLRRDTPSKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGSC 158
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS V +VEGIN ++TG LVSLSEQEL+DCD ++N GC GG M+ AFE+I GG+ T
Sbjct: 159 WAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT-ADNDGCQGGLMDNAFEYIKNNGGLIT 217
Query: 227 EDDYPYRGKNDRCQTDKTKHHA---VTITGYEAIPARY---------------------- 261
E YPYR C + ++ V I G++ +PA
Sbjct: 218 EAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGK 277
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
AF YS GVF CG +L+HGV VVGYG + G+ YW VKNSWG SWGE GYIR+ ++S
Sbjct: 278 AFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337
Query: 321 SSNIGICGILMQASYPVK 338
+S G+CGI M+ASYPVK
Sbjct: 338 ASG-GLCGIAMEASYPVK 354
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 190/319 (59%), Gaps = 33/319 (10%)
Query: 48 GYPQKYDPQSME---ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFK 104
GY + D +SM+ E FE+W+ ++ + Y S +E RF I+ N+++ID N ++
Sbjct: 31 GYSSE-DLKSMDKLIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVSNYW 89
Query: 105 LTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY----LGLPASVDWRKEGAVTPVKDQ 160
L N+FADLS++EF + YLG Y+ R ++ + LP SVDWRK+GAV PVK+Q
Sbjct: 90 LGLNEFADLSHQEFKNKYLGLKVDYSRRRESPEEFTYKDVELPKSVDWRKKGAVAPVKNQ 149
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G CGSCWAFS VAAVEGIN++ TG L SLSEQEL+DCD N GCNGG M+ AF FI +
Sbjct: 150 GSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYSN-GCNGGLMDYAFSFIVE 208
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---------------------- 258
GG+ E+DYPY + C+ K + VTI+GY +P
Sbjct: 209 NGGLHKEEDYPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEA 268
Query: 259 ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
+ FQ YS GVFD +CG L+HGV VGYG G Y +VKNSWG+ WGE GYIRM R
Sbjct: 269 SGRDFQFYSGGVFDGHCGSDLDHGVAAVGYGTAKGVDYIIVKNSWGSKWGEKGYIRM-RG 327
Query: 319 SPSSNIGICGILMQASYPV 337
+ + G L ASYP+
Sbjct: 328 TLETR-GNLRYLQMASYPL 345
>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
Length = 356
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 145/337 (43%), Positives = 198/337 (58%), Gaps = 39/337 (11%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
L LFL + P+ A ++ + DP M +RFE W+ +Y R Y DE RRF I+ +N
Sbjct: 10 LFLFLCVMWASPSAASAD---EPSDP--MMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNN 64
Query: 90 VQYIDYINSQNL-SFKLTDNKFADLSNEEFISTYLG-YNKPYNEPRWPSVQY-----LGL 142
V +I+ NS+N S+ L N+F D++N EF++ Y G ++P N R P V + +
Sbjct: 65 VNHIETFNSRNKDSYTLGINQFTDMTNNEFVAQYTGGISRPLNIEREPVVSFDDVDISAV 124
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P S+DWR GAVT VK+Q CG+CWAF+A+A VE I K+K G L LSEQ+++DC ++
Sbjct: 125 PQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDC---AK 181
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---- 258
GC GG+ +AFEFI GV + YPY+ C+T+ + A ITGY +P
Sbjct: 182 GYGCKGGWEFRAFEFIISNKGVASVAIYPYKAAKGTCKTNGVPNSAY-ITGYARVPRNNE 240
Query: 259 -----------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVK 300
A Q Y+ GVF+ CG LNH VT +GYG+D +G+KYW+VK
Sbjct: 241 SSMMYAVSKQPITVAVDANANSQYYNSGVFNGPCGTSLNHAVTAIGYGQDSNGKKYWIVK 300
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
NSWG WGEAGYIRMAR+ SS+ GICGI + + YP
Sbjct: 301 NSWGARWGEAGYIRMARDVSSSS-GICGIAIDSLYPT 336
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 133/312 (42%), Positives = 186/312 (59%), Gaps = 35/312 (11%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEF 118
E+ E W++++ + Y E ++RF I+ N+++I+ N+ + F L+ N+F D +N+EF
Sbjct: 33 EKHEQWMEEHGKFYKDAAEKEQRFQIFKENLEFIESFNAAGDNGFNLSINQFGDQTNDEF 92
Query: 119 ISTYL-GYNKPY---------NEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
+ YL G KP E + +PA++DWR+ GAVTP+K Q CGSCWA
Sbjct: 93 KANYLNGKKKPLIGVGIAAIEEESVFRYENVTEVPATMDWRERGAVTPIKHQHLCGSCWA 152
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
F+ VAA+EGI+++ TG+LVSLSEQELVDC + GCNGGY+E A +FI K GG+T+E
Sbjct: 153 FATVAAIEGIHQITTGRLVSLSEQELVDCVKTNTTDGCNGGYVEDACDFIVKKGGITSET 212
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPA----------------------RYAFQLY 266
+YPY + +C K ++ I GYE +PA + AFQ Y
Sbjct: 213 NYPYTRVDGKCNVRKGTYNVAKIKGYEHVPANNEKALLKAVANQPIAVYIAATKRAFQFY 272
Query: 267 SHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
S G+ CG L+H VT+VGYG D G KYWLVKNSWGT WGE GYI++ R+ + G
Sbjct: 273 SSGILKGKCGIDLDHTVTIVGYGTSDDGVKYWLVKNSWGTKWGEKGYIKIKRDVHAKE-G 331
Query: 326 ICGILMQASYPV 337
CGI M +YP+
Sbjct: 332 SCGIAMVPTYPI 343
>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
Length = 371
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 186/318 (58%), Gaps = 38/318 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLS 114
+++ + +E W + R E RRFG + SN +I N + + ++L N+F D+
Sbjct: 40 EALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDMD 98
Query: 115 NEEFISTYLG---YNKPYNEPRWPSVQYLGL-----PASVDWRKEGAVTPVKDQGQCGSC 166
EF +T++G + P P P Y L P SVDWR++GAVT VKDQG+CGSC
Sbjct: 99 QAEFRATFVGDLRRDTPAKPPSVPGFMYAALNVSDLPPSVDWRQKGAVTGVKDQGKCGSC 158
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS V +VEGIN ++TG LVSLSEQEL+DCD ++N GC GG M+ AFE+I GG+ T
Sbjct: 159 WAFSTVVSVEGINAIRTGSLVSLSEQELIDCDT-ADNDGCQGGLMDNAFEYIKNNGGLIT 217
Query: 227 EDDYPYRGKNDRCQTDKTKHHA---VTITGYEAIPARY---------------------- 261
E YPYR C + ++ V I G++ +PA
Sbjct: 218 EAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGK 277
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
AF YS GVF CG +L+HGV VVGYG + G+ YW VKNSWG SWGE GYIR+ ++S
Sbjct: 278 AFMFYSEGVFTGDCGTELDHGVAVVGYGVAEDGKAYWTVKNSWGPSWGEQGYIRVEKDSG 337
Query: 321 SSNIGICGILMQASYPVK 338
+S G+CGI M+ASYPVK
Sbjct: 338 ASG-GLCGIAMEASYPVK 354
>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
gi|194701540|gb|ACF84854.1| unknown [Zea mays]
Length = 379
Score = 253 bits (647), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 144/318 (45%), Positives = 182/318 (57%), Gaps = 48/318 (15%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFIS 120
+E W + + R + E RRFG + NV++I N + + ++L N+F D+ EEF S
Sbjct: 44 YERW-QTHHRVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRS 102
Query: 121 TYLGYNKPYNEPRW---PSVQYLGLPA-----------SVDWRKEGAVTPVKDQGQCGSC 166
T+ + N+ R P+ + +P SVDWR+EGAVT VK QG CGSC
Sbjct: 103 TFA--DSRINDLRRQDSPAARAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKVQGHCGSC 160
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS V AVEGIN ++TG L SLSEQEL+DCD ++ GC GG ME AFEFI GG+TT
Sbjct: 161 WAFSTVVAVEGINAIRTGSLASLSEQELIDCD--TDENGCQGGLMENAFEFIKSFGGITT 218
Query: 227 EDDYPYRGKNDRCQTDKTKH---HAVTITGYEAIPA----------------------RY 261
E YPYR N C D+ + V I G++ +PA
Sbjct: 219 EAAYPYRASNGTCDGDRARRGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQ 278
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
AFQ YS GVF CG L+HGV VGYG D G YW+VKNSWGTSWGE GYIRM R
Sbjct: 279 AFQFYSEGVFTGDCGTDLDHGVAAVGYGVGDDGTPYWIVKNSWGTSWGEGGYIRMQRG-- 336
Query: 321 SSNIGICGILMQASYPVK 338
+ N G+CGI M+AS+P+K
Sbjct: 337 AGNGGLCGIAMEASFPIK 354
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 186/314 (59%), Gaps = 32/314 (10%)
Query: 55 PQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLS 114
P + F +W ++S+ Y S E +R+ I+ N+++I N +N S+ L N FAD++
Sbjct: 48 PNKLVGLFTSWSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRRNGSYWLGLNHFADIA 107
Query: 115 NEEFISTYLGYN--------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
+EEF ++YLG +P+ + + LP +VDWRK+GAVTPVK+QG+CGSC
Sbjct: 108 HEEFKASYLGLKPGLARRDAQPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSC 167
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS VAAVEGIN++ TGKLVSLSEQEL+DCD N+ N GC GG M+ AF +I G+ T
Sbjct: 168 WAFSTVAAVEGINQIVTGKLVSLSEQELMDCD-NTFNHGCRGGLMDFAFAYIMGNQGIYT 226
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQ 264
E+DYPY + C+ + +TITGYE +PA FQ
Sbjct: 227 EEDYPYLMEEGYCREKQPHSKVITITGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQ 286
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y G+FD CG Q +H +T VGYG +G+ Y ++KNSWG +WGE GY R+ R +
Sbjct: 287 FYKGGIFDGECGIQPDHALTAVGYGSYYGQDYIIMKNSWGKNWGEQGYFRIRRGTGKPE- 345
Query: 325 GICGILMQASYPVK 338
G+C I ASYP K
Sbjct: 346 GVCDIYKIASYPTK 359
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 253 bits (646), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 138/338 (40%), Positives = 200/338 (59%), Gaps = 38/338 (11%)
Query: 31 SLFLLWVLGIPAGAWSEGYPQKY-DPQSMEER--FENWLKQYSREYGSEDEWQRRFGIYS 87
+L LL V+G A ++ P D +++E + FE+W ++ + Y S+ E RR I+S
Sbjct: 5 TLILLVVVG--ATPFAIARPAALEDGRALEIKNMFEDWAAKHGKSYSSDWEKARRLMIFS 62
Query: 88 SNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLG-YNKPYNEPRWPS----VQYLG 141
+ YI+ N+Q N +F L NKF+DL+N EF + ++G + +P + R P+ V
Sbjct: 63 DTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRPRYQDRLPAEDEDVDVSS 122
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
LP S+DWR++GAVTP+KDQG CGSCWAFSA+A++E + L T +LVSLSEQ+L+DCD +
Sbjct: 123 LPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDCD--T 180
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY 261
+ GC+GG ME AF+F+ K GGVTTE YPY G C +K K+ ITG++ +
Sbjct: 181 VDAGCDGGLMETAFKFVVKNGGVTTEAAYPYTGSVGSCNANKAKNKVAEITGFKVVTEDS 240
Query: 262 A----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
A FQ Y G+ C L+HGV ++GYG + G YW++
Sbjct: 241 ADALMKAVSKTPVTVSICGSDENFQNYKSGILSGKCDDSLDHGVLLIGYGTEGGMPYWII 300
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KNSWGTSWGE G++++ R G+CG+ +SYP
Sbjct: 301 KNSWGTSWGEDGFMKIERKDGD---GMCGMNGDSSYPT 335
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 134/312 (42%), Positives = 185/312 (59%), Gaps = 36/312 (11%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS-FKLTDNKFADLSNE 116
M ER E W+ +Y R Y E RRF ++ N +++ N+ + F L N+FADL+ E
Sbjct: 1 MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTE 60
Query: 117 EFISTYLGYNKPYNEPRWPSVQYL-------GLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
EF + G+ KP + P+ + LP +VDWR +GAVTP+K+QGQCG CWAF
Sbjct: 61 EFKANK-GF-KPISAEEVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAF 118
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
SA+AA+EGI KL TG LVSLSEQE VDCD ++ ++GC GG+M+ AFEF+ K GG+ TE
Sbjct: 119 SAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLATESS 178
Query: 230 YPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYS 267
YPY+ + +C+ A TI G+E +P + F LYS
Sbjct: 179 YPYKVVDGKCKGG--SKSAATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRTFMLYS 236
Query: 268 HGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
GV CG QL+HG+ +GYG E KYW++KNSWGT+WGE G++RM ++ S G+
Sbjct: 237 GGVMTGSCGTQLDHGIAAIGYGVESDDTKYWILKNSWGTTWGEKGFLRMEKDI-SDKRGM 295
Query: 327 CGILMQASYPVK 338
C + M+ SYP +
Sbjct: 296 CDLAMKPSYPTE 307
>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
Length = 352
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 134/311 (43%), Positives = 179/311 (57%), Gaps = 35/311 (11%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNE 116
M +RF +W Y+R Y + +E QRRF +Y N+++I+ N + NL++ L +N+FADL+ E
Sbjct: 45 MMDRFLSWQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEE 104
Query: 117 EFISTYLGYNKPYNEPRW-------PSVQYLGLPASVDWRKEGAVTPVKDQG-QCGSCWA 168
EF+ Y P S + P SVDWR +GAVTP+K+QG C SCWA
Sbjct: 105 EFLDLYTMKGMPVRRDAGKKRANVSSSAAAVDAPTSVDWRSKGAVTPIKNQGPSCSSCWA 164
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
F A +E I K+ TGKLVSLSEQEL+DCD + GCN GY + ++ + GG+TTE
Sbjct: 165 FVTAATIESITKITTGKLVSLSEQELIDCD--PYDGGCNLGYFVNGYRWVIQNGGLTTEA 222
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY--------------------AFQLYSH 268
+YPY+ + C + HA TI+ Y +PA + Q YS
Sbjct: 223 NYPYQARRYACSRSRAAQHAATISDYVQLPAGEGQLQQAVAQQPVAAAIEMGGSLQFYSG 282
Query: 269 GVFDEYCGHQLNHGVTVVGYGED--HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
GVF CG ++NH +TVVGYG D G KYWLVKNSWG SWGE GY+RM R+ G+
Sbjct: 283 GVFSGQCGTRMNHAITVVGYGADSSSGLKYWLVKNSWGQSWGERGYLRMRRDVGRG--GL 340
Query: 327 CGILMQASYPV 337
CGI + +YPV
Sbjct: 341 CGIALDLAYPV 351
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 184/320 (57%), Gaps = 44/320 (13%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS-FKLTDNKFAD 112
D M R++ W+ QY R+Y + E RF ++ +N ++ID N+ + L N+FAD
Sbjct: 51 DEAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFAD 110
Query: 113 LSNEEFISTYLGYNKPYNEP-----------RWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
L+++EF + Y G KP P ++ + L VDWR++GAVTPVK+QG
Sbjct: 111 LTSKEFAAMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDVQVDWRQQGAVTPVKNQG 170
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
QCG CWAFSAV A+EG+ + TG LVSLSEQ+++DCD + NQGCNGGYM+ AF+++
Sbjct: 171 QCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVINN 230
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA---------------------- 259
GGVTTED YPY CQ A TI+G++ +P+
Sbjct: 231 GGVTTEDAYPYSAVQGTCQ---NVQPAATISGFQDLPSGDENALANAVANQPVSVGVDGG 287
Query: 260 RYAFQLYSHGVFD-EYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
FQ Y G++D + CG +NH VT +GYG +D G +YW++KNSWGT WGE G++++
Sbjct: 288 SSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGADDQGTQYWILKNSWGTGWGENGFMQL-- 345
Query: 318 NSPSSNIGICGILMQASYPV 337
+G CGI ASYP
Sbjct: 346 ---QMGVGACGISTMASYPT 362
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 138/349 (39%), Positives = 211/349 (60%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L + +++LF + + ++ + P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMSILITLFFVISM------FNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPR-WP 135
RF I+ N+++I+ +N + NLS+KL N+FAD+++EEF++ + G N P Y P P
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMP 116
Query: 136 SVQYL-------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
S ++ +P+++DWR+ GAVT VK+QGQCG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRS-QGKTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A + Q Y+ G +D C +++NH VT +GY
Sbjct: 234 VQISNYQVVPEGETSLLQAVTKQPVSIGIAASHDLQFYAGGTYDGSCANRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 136/323 (42%), Positives = 185/323 (57%), Gaps = 43/323 (13%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ---------NLSFK 104
DP++ E F+ W ++ + Y + +E R +++ N ++ N++ S+
Sbjct: 33 DPRAYEALFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYT 92
Query: 105 LTDNKFADLSNEEFISTYLGY----NKPYNEPRWPSVQYL-----GLPASVDWRKEGAVT 155
L N FADL++EEF + LG P P + L +P ++DWR+ GAVT
Sbjct: 93 LALNAFADLTHEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVT 152
Query: 156 PVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAF 215
VKDQG CG+CW+FSA A+EGINK+KTG LVSLSEQEL+DCD S N GC GG M+ A+
Sbjct: 153 KVKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCD-RSYNSGCGGGLMDYAY 211
Query: 216 EFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR--------------- 260
+F+ K GG+ TE+DYPYR + C +K K VTI GY +P+
Sbjct: 212 KFVVKNGGIDTEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVS 271
Query: 261 -------YAFQLYS-HGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGY 312
AFQLYS G+FD C L+H V +VGYG + G+ YW+VKNSWG SWG GY
Sbjct: 272 VGICGSARAFQLYSQQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGESWGMKGY 331
Query: 313 IRMARNSPSSNIGICGILMQASY 335
+ M RN+ S G+CGI M AS+
Sbjct: 332 MHMHRNTGDSK-GVCGINMMASF 353
>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
Length = 350
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 147/325 (45%), Positives = 179/325 (55%), Gaps = 47/325 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
M +RFE W+ ++ R Y E QRRF +Y NV+ ++ NS + +KL DNKFADL+NEE
Sbjct: 27 MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFADLTNEE 86
Query: 118 FISTYLGYNKPYNEPRWPS-----VQYLG------LPASVDWRKEGAVTPV-KDQGQCGS 165
F + LG+ P+ + + G LP SVDWR +GAV K GS
Sbjct: 87 FRAKMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRNKGAVINRWKICVDAGS 146
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFSAVAA+EGIN++K G+LVSLSEQELVDCD E GC GGYM AFEF+ G+T
Sbjct: 147 CWAFSAVAAIEGINQIKNGELVSLSEQELVDCD--DEAVGCGGGYMSWAFEFVVGNHGLT 204
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-------ARYA---------------F 263
TE YPY N CQ K AV I GY + AR A F
Sbjct: 205 TEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMF 264
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEK-----------YWLVKNSWGTSWGEAGY 312
QLY GV+ C +NHGVTVVGYGE + YW+VKNSWG WG+AGY
Sbjct: 265 QLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGY 324
Query: 313 IRMARNSPSSNIGICGILMQASYPV 337
I M R+ G+CGI + SYPV
Sbjct: 325 ILMQRDVAGLASGLCGIALLPSYPV 349
>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 133/293 (45%), Positives = 189/293 (64%), Gaps = 20/293 (6%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEF 118
E+ E W+ +++R Y + E RF I+ N+++++ N + N ++KL NKF+DL++EEF
Sbjct: 16 EKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFSDLTDEEF 75
Query: 119 ISTYLGY--------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
+ Y+G ++ R+ +V G S+DWR EGAVTPVKDQGQCG CWAF+
Sbjct: 76 QARYMGLVPEGMTGDSQKTVSFRYENVSETG--ESMDWRLEGAVTPVKDQGQCGCCWAFA 133
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
AVAAVEG+ K+ G+LVSLSEQ+LVDC + N GC+GG A+++I + G+T+E++Y
Sbjct: 134 AVAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQGITSEENY 193
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYS----HGVF-DEYCGHQLNHGVTV 285
PY+ C++ T A TI+GYEA+P L HG+F DEYCG +H VT+
Sbjct: 194 PYQAVQQTCKS--TDPAAATISGYEAVPKDDEEALLKAVSQHGIFEDEYCGTDSHHAVTI 251
Query: 286 VGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
VGYG + G KYWL+KNSWG SWGE GY+R+ R+ G+CG+ +A YPV
Sbjct: 252 VGYGTSEEGIKYWLLKNSWGESWGENGYMRIKRDVDEPQ-GMCGLAHRAYYPV 303
>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 142/314 (45%), Positives = 196/314 (62%), Gaps = 35/314 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+S+ + +E W K ++ +++ +RF ++ NV ++ +N + +KL NKFAD+SN
Sbjct: 35 ESLWQLYERWGKHHTISRNLKEK-HKRFSVFKENVNHVFTVNQMDKPYKLKLNKFADMSN 93
Query: 116 EEFISTYLGYN----KPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSC 166
EF++ Y N + +E R + ++ LP+SVDWR+ GAV VK+QG+CGSC
Sbjct: 94 YEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDWRERGAVNAVKEQGRCGSC 153
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS+VAAVEGINK+KT +L+SLSEQEL+DC N N+GCNGG+ME AF+FI + GG+ T
Sbjct: 154 WAFSSVAAVEGINKIKTNQLLSLSEQELLDC--NYRNKGCNGGFMEIAFDFIKRNGGIAT 211
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---------------------ARYAFQL 265
E+ YPY G C++ + V I GYE++P A FQ
Sbjct: 212 ENSYPYHGSRGLCRSSRISSPIVKIDGYESVPENEDALMQAVANQPVSVAIDAAGRDFQF 271
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS GVFD YCG +LNHGV +GYG + G YWLV+NSWG WGE GY+RM R +
Sbjct: 272 YSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKRGVEQAE- 330
Query: 325 GICGILMQASYPVK 338
G+CGI M+ASYP+K
Sbjct: 331 GLCGIAMEASYPIK 344
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 252 bits (644), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 136/323 (42%), Positives = 189/323 (58%), Gaps = 39/323 (12%)
Query: 50 PQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKL 105
P + + +E W ++ +GS+D R ++ N++YID N++ +F+L
Sbjct: 40 PVERADDEVRRMYEAWKSEHGHGHGSDDRL--RLEVFRDNLRYIDAHNAEADAGLHTFRL 97
Query: 106 TDNKFADLSNEEFISTYLGYNKPYNEPRW--------PSVQYLGLPASVDWRKEGAVTPV 157
FADL+ EE+ LG+ P + LP ++DWR+ GAVT V
Sbjct: 98 GLTPFADLTLEEYRGRALGFRARRGGASRVGSGSSYRPRPRGGDLPDAIDWRELGAVTGV 157
Query: 158 KDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEF 217
K+Q QCG CWAFSAVAA+EGIN++ TG LVSLSEQE++DCD +++ GCNGG M+ AF+F
Sbjct: 158 KNQEQCGGCWAFSAVAAIEGINEIVTGNLVSLSEQEIIDCD--TQDGGCNGGEMQNAFQF 215
Query: 218 ITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI---------------PARYA 262
+ GG+ TE DYPY G + C ++ VTI G+ ++ P A
Sbjct: 216 VINNGGIDTEADYPYLGTDAACDANRVNERVVTIDGFVSVATENETALQEAVANQPVSVA 275
Query: 263 -------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRM 315
FQ Y+ G+F+ CG QL+HGVT VGYG ++G+ YW+VKNSW +SWGEAGYIR+
Sbjct: 276 IDASGRKFQHYTSGIFNGPCGTQLDHGVTAVGYGSENGKDYWIVKNSWSSSWGEAGYIRI 335
Query: 316 ARNSPSSNIGICGILMQASYPVK 338
RN ++ G CGI M ASYPVK
Sbjct: 336 RRNVAAAT-GKCGIAMDASYPVK 357
>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
Length = 289
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 125/220 (56%), Positives = 150/220 (68%), Gaps = 24/220 (10%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
+P SVDWRKEGAV VKDQG CGSCWAFS + AVEGINK+ TG L+SLSEQELVDCD S
Sbjct: 3 IPESVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDT-S 61
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR- 260
NQGCNGG M+ AFEFI K GG+ TE+DYPY+ + RC ++ VTI YE +P
Sbjct: 62 YNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQNRKNAKVVTIDAYEDVPENN 121
Query: 261 ---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
AFQLYS GVFD CG +L+HGV VGYG ++G+ YW+V
Sbjct: 122 EAALKKALANQPISVAIEAGGRAFQLYSSGVFDGTCGTELDHGVVAVGYGTENGKDYWIV 181
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
+NSWG SWGE+GYI+MARN + G CGI M+ASYP+K+
Sbjct: 182 RNSWGGSWGESGYIKMARNIAEAT-GKCGIAMEASYPIKK 220
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 136/307 (44%), Positives = 185/307 (60%), Gaps = 40/307 (13%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFADLSNEE 117
F+ + ++++ Y S +E RRF ++S N+ +I+ N++ + + N+FADL+NEE
Sbjct: 30 FDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTHTVDVNQFADLTNEE 89
Query: 118 FISTYLGYNKPYNEP---RWPSVQYLGLP--ASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
+ YL +PY R +L P SVDWR++GAVTP+K+QGQCGSCW+FS
Sbjct: 90 YRQLYL---RPYPTELLGRERQEVWLDGPNAGSVDWRQKGAVTPIKNQGQCGSCWSFSTT 146
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
+VEG + + TG LVSLSEQ+LVDC + NQGCNGG M+ AF++I GG+ TE DYPY
Sbjct: 147 GSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPY 206
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPA----------------------RYAFQLYSHGV 270
++ C K HAV+I+GY+ +P + +FQ+YS GV
Sbjct: 207 TARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGV 266
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
F CG L+HGV VVGY D YW+VKNSWG SWG+ GYI M R S+ GICGI
Sbjct: 267 FSGPCGTNLDHGVLVVGYTSD----YWIVKNSWGASWGDQGYIMMKRGVSSA--GICGIA 320
Query: 331 MQASYPV 337
MQ SYP+
Sbjct: 321 MQPSYPI 327
>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
Length = 493
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 138/296 (46%), Positives = 179/296 (60%), Gaps = 44/296 (14%)
Query: 81 RRFGIYSSNVQYIDYINSQNLS----FKLTDNKFADLSNEEFISTYLGYNKPYN------ 130
RR ++ N++YID N++ + F+L +FADL+ EE+ + L ++ N
Sbjct: 91 RRLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGV 150
Query: 131 --EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
R+ + LP +VDWR+ GAV VKDQGQCG CWAFSAVAAVEGINK+ TG L+S
Sbjct: 151 VGRRRYLPLAGEQLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTGSLIS 210
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
LSEQEL+DCD ++QGC+GG M+ AF F+ K GG+ TE DYP+ G + C
Sbjct: 211 LSEQELIDCD-KFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKNTRV 269
Query: 249 VTITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVV 286
V+I +E +P +R AFQLYS G+FD CG L+HGVTVV
Sbjct: 270 VSIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVV 329
Query: 287 GYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN----SPSSNIGICGILMQASYPVK 338
GYG + G+ YW+VKNSWGT WGEAGY+RMARN PS+ GI M+ YPVK
Sbjct: 330 GYGSEGGKDYWIVKNSWGTQWGEAGYVRMARNVRVRPPSA-----GIAMEPLYPVK 380
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 185/314 (58%), Gaps = 32/314 (10%)
Query: 55 PQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLS 114
P + F +W ++S+ Y S E +R+ I+ N+++I N +N S+ L N FAD++
Sbjct: 39 PNKLVGLFTSWSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRRNGSYWLGLNHFADIA 98
Query: 115 NEEFISTYLGYN--------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
+EEF ++YLG +P+ + + LP +VDWRK+GAVTPVK+QG+CGSC
Sbjct: 99 HEEFKASYLGLKPGLARRDAQPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSC 158
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS VAAVEGIN++ TGKLVSLSEQEL+DCD N+ N GC GG M+ AF +I G+ T
Sbjct: 159 WAFSTVAAVEGINQIVTGKLVSLSEQELMDCD-NTFNHGCRGGLMDFAFAYIMGNQGIYT 217
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQ 264
E+DYPY + C+ + +TITGYE +P FQ
Sbjct: 218 EEDYPYLMEEGYCREKQPHSKVITITGYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQ 277
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y G+FD CG Q +H +T VGYG +G+ Y ++KNSWG +WGE GY R+ R +
Sbjct: 278 FYKGGIFDGECGIQPDHALTAVGYGSYYGQDYIIMKNSWGKNWGEQGYFRIRRGTGKPE- 336
Query: 325 GICGILMQASYPVK 338
G+C I ASYP K
Sbjct: 337 GVCDIYKIASYPTK 350
>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 136/325 (41%), Positives = 183/325 (56%), Gaps = 39/325 (12%)
Query: 48 GYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI----DYINSQNL-- 101
G + E +FE W ++ + Y + E R ++ N ++ D + S
Sbjct: 25 GRDESVSASDYEAQFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGG 84
Query: 102 -SFKLTDNKFADLSNEEFISTYLGY----NKPYNEPRWPSVQYLG----LPASVDWRKEG 152
S+ L N FADL+++EF + LG P P + G +P ++DWR+ G
Sbjct: 85 PSYTLALNAFADLTHDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVPDALDWRQSG 144
Query: 153 AVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYME 212
AVT VKDQG CG+CW+FSA A+EGINK+ TG L+SLSEQEL+DCD S N GC GG M
Sbjct: 145 AVTKVKDQGSCGACWSFSATGAMEGINKITTGSLLSLSEQELIDCD-RSYNTGCGGGLMT 203
Query: 213 KAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------ 260
A++F+ K GG+ TEDDYP+R + C +K K H VTI GY+ +P+
Sbjct: 204 YAYKFVIKNGGIDTEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQ 263
Query: 261 ----------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEA 310
AFQLYS G+FD C L+H V +VGYG + G+ YW+VKNSWG WG
Sbjct: 264 PISVGICGSARAFQLYSQGIFDGPCPTSLDHAVLIVGYGSEGGKDYWIVKNSWGERWGMK 323
Query: 311 GYIRMARNSPSSNIGICGILMQASY 335
GY+ M RN+ SS+ GICGI M AS+
Sbjct: 324 GYMHMHRNTGSSS-GICGINMMASF 347
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 140/349 (40%), Positives = 207/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKKNMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QGQCG CWAFSAV ++EG K+ TGKL+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAEGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 357
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 135/320 (42%), Positives = 184/320 (57%), Gaps = 40/320 (12%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ--NLSFKLTDNKFA 111
D +M ER+E W + R Y E RRF ++ +N +ID N+ S +LT NKFA
Sbjct: 41 DDSAMRERYEKWAADHGRTYKDSLEKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFA 100
Query: 112 DLSNEEFISTYLGYNKPYNEP-------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
DL+NEEF Y +P++ P + +V+ +PA+++WR GAVT VK+Q C
Sbjct: 101 DLTNEEFAEYY---GRPFSTPVIGGSGFMYGNVRTSDVPANINWRDRGAVTQVKNQKDCA 157
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFSAVAAVEGI+++++ LV+LS Q+L+DC N GCN G M++AF +IT GG+
Sbjct: 158 SCWAFSAVAAVEGIHQIRSHNLVALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGI 217
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
E DYPY + K A +I G++ +P
Sbjct: 218 AAESDYPYEDRALGTCRASGKPVAASIRGFQYVPPNNETALLLAVAHQPVSVALDGVGKV 277
Query: 263 FQLYSHGVF----DEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMAR 317
Q +S GVF +E C LNH +T VGYG D HG KYWL+KNSWGT WGE GY+++AR
Sbjct: 278 SQFFSSGVFGAMQNETCTTDLNHAMTAVGYGTDEHGTKYWLMKNSWGTDWGEGGYMKIAR 337
Query: 318 NSPSSNIGICGILMQASYPV 337
+ +SN G+CG+ MQ SYPV
Sbjct: 338 DV-ASNTGLCGLAMQPSYPV 356
>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 335
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 137/334 (41%), Positives = 187/334 (55%), Gaps = 29/334 (8%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+ L + ++ + A A S Y D + FE W+ ++ + Y E + RFGI+ N
Sbjct: 4 IVLLVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFRDN 63
Query: 90 VQYI-DYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGLPASVDW 148
V +I Y + N+FADL+N+EF++TY G P+ + V + P +DW
Sbjct: 64 VHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAPRPVDPIWTPCCIDW 123
Query: 149 RKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNG 208
R GAVT VKDQG CGSCWAF+AVAA+EG+ K++TG+L LSEQELVDCD NS GC G
Sbjct: 124 RFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTNS--NGCGG 181
Query: 209 GYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK-HHAVTITGYEAIP--------- 258
G+ ++AFE + GG+T E DY Y G +C+ D +HA +I GY A+P
Sbjct: 182 GHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLAT 241
Query: 259 --ARY-----------AFQLYSHGVFDEYCGHQLNHGVTVVGYGED--HGEKYWLVKNSW 303
AR AFQ Y GVF CG NH VT+VGY +D G+KYWL KNSW
Sbjct: 242 AVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGKKYWLAKNSW 301
Query: 304 GTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G +WG+ GYI + ++ + G CG+ + YP
Sbjct: 302 GKTWGQQGYILLEKDIVQPH-GTCGLAVSPFYPT 334
>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 142/315 (45%), Positives = 181/315 (57%), Gaps = 46/315 (14%)
Query: 62 FENWLKQYS--REYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
+E W +Q++ R+ G E RRF ++ NV+ I N + +KL N+F D++ +EF
Sbjct: 47 YERWREQHTVARDLG---EKARRFNVFRENVRLIHEFNRGDAPYKLRLNRFGDMTADEFR 103
Query: 120 STYLGYNKPYNEPRWPSVQYLG-------------LPASVDWRKEGAVTPVKDQGQCGSC 166
Y ++ R S++ G +P SVDWR++GAVT VKDQGQCGSC
Sbjct: 104 RAYASSRVSHH--RMFSLKEGGGGFMHGSAASVRDVPPSVDWRQKGAVTAVKDQGQCGSC 161
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS +AAVEGIN +++ L SLSEQ+LVDCD S N GCNGG M+ AF++I K GGV
Sbjct: 162 WAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKS-NAGCNGGLMDYAFQYIAKHGGVAA 220
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
ED YPY+ + +K VTI GYE +PA FQ
Sbjct: 221 EDAYPYKARQ-ASSCNKKPSAVVTIDGYEDVPANDETALKKAVAAQPVAVAIEASGSHFQ 279
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GVF CG +L+HGV VGYG G KYW+VKNSWG WGE GYIRM R+
Sbjct: 280 FYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWIVKNSWGPEWGEKGYIRMKRDVKDKE 339
Query: 324 IGICGILMQASYPVK 338
G+CGI M+ASYPVK
Sbjct: 340 -GLCGIAMEASYPVK 353
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 138/350 (39%), Positives = 208/350 (59%), Gaps = 44/350 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYN--------KPYN 130
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 131 EPRWPSVQYLG---LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLV 187
+ + L +P+++DWR+ GAVT VK QGQCG CWAFSAV ++EG K+ TGKL+
Sbjct: 117 STEFKKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLM 176
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K
Sbjct: 177 EFSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQE-KTA 233
Query: 248 AVTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVG 287
AV I+ Y+ +P A Q Y+ G +D C ++NH VT +G
Sbjct: 234 AVQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIG 293
Query: 288 YGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
YG D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 YGTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 342
>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
Length = 345
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 140/345 (40%), Positives = 198/345 (57%), Gaps = 38/345 (11%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M ++ L LFL + P+ A + + DP M ++FE W+ +Y R Y DE
Sbjct: 1 MTSKVQLVFLFLFLCVMWASPSAASCD---EPSDP--MMKQFEEWMAEYGRVYKDNDEKM 55
Query: 81 RRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY 139
RF I+ +NV +I+ N++N S+ L N+F D++N EF++ Y G + P N R P V +
Sbjct: 56 LRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPLNIKREPVVSF 115
Query: 140 -----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+P S+DWR GAVT VK+QG+CGSCWAF+++A VE I K+K G LVSLSEQ++
Sbjct: 116 DDVDISSVPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQV 175
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
+DC V + GC GG++ KA+ FI GV + YPY+ C+T+ + A IT Y
Sbjct: 176 LDCAV---SYGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTCKTNGVPNSAY-ITRY 231
Query: 255 ---------------------EAIPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-H 292
A+ A FQ Y GVF CG +LNH + ++GYG+D
Sbjct: 232 TYVQRNNERNMMYAVSNQPIAAALDASGNFQHYKRGVFTGPCGTRLNHAIVIIGYGQDSS 291
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G+K+W+V+NSWG WGE GYIR+AR+ SS+ G+CGI M YP
Sbjct: 292 GKKFWIVRNSWGAGWGEGGYIRLARDV-SSSFGLCGIAMDPLYPT 335
>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
Length = 381
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 146/317 (46%), Positives = 184/317 (58%), Gaps = 45/317 (14%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ--NLSFKLTDNKFADLSNEEFI 119
+E W + + R + E RRFG + NV++I N + S++L N+F D+ EEF
Sbjct: 46 YERW-QTHHRVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFGDMGPEEFR 104
Query: 120 STYLGYN----KPYNEPRWPSVQYLG--------LPASVDWRKEGAVTPVKDQGQCGSCW 167
ST+ + Y E + G +P SVDWR+ GAVT VK+QG+CGSCW
Sbjct: 105 STFADSRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQHGAVTAVKNQGRCGSCW 164
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS V AVEGIN ++TG LVSLSEQELVDCD +EN GC GG ME AF+FI GG+TTE
Sbjct: 165 AFSTVVAVEGINAIRTGSLVSLSEQELVDCDT-AEN-GCQGGLMENAFDFIKSYGGITTE 222
Query: 228 DDYPYRGKNDRCQTDKTKHHA--VTITGYEAIP-----------AR-----------YAF 263
YPYR N C + + V+I G++ +P AR AF
Sbjct: 223 SAYPYRASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVARQPVSVAIDAGGQAF 282
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYG--EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
Q YS GVF CG L+HGV VVGYG + G YW+VKNSWG SWGE GYIRM R +
Sbjct: 283 QFYSEGVFTGDCGTDLDHGVAVVGYGVSDVDGTPYWIVKNSWGPSWGEGGYIRMQRG--A 340
Query: 322 SNIGICGILMQASYPVK 338
N G+CGI M+AS+P+K
Sbjct: 341 GNGGLCGIAMEASFPIK 357
>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 380
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 187/340 (55%), Gaps = 47/340 (13%)
Query: 41 PAGAWSEGYPQKYDPQSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYIN- 97
PA A G +S+ +E W ++ SR+ E RRF ++ N + + N
Sbjct: 28 PASAMDFGESDLASEESLWALYERWRARHTVSRDLA---EKSRRFNVFRENARLVHEFNL 84
Query: 98 SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYN---EPRWPSVQYL------------GL 142
++ +KL N+FADL+++EF +Y ++ +PR + L
Sbjct: 85 RRDAPYKLRLNRFADLTSDEFRRSYASSRVSHHRMFKPRAANNNDDDDDKGSSFTHGGAL 144
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P SVDWR++GAVT VKDQGQCGSCWAFS +AAVEGIN ++T L SLSEQ+LVDCD +
Sbjct: 145 PTSVDWREKGAVTGVKDQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKT- 203
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK-NDRCQTDKTKHHAVTITGYEAIPAR- 260
N GC+GG M+ AF +I K GGV E YPYR + + C + K V+I GYE +P
Sbjct: 204 NAGCDGGLMDDAFSYIAKHGGVAAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRND 263
Query: 261 ---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWL 298
FQ YS GVF CG +L+HGV VGYG G KYW+
Sbjct: 264 ETALKKAVAAQPVAVAIEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGVTVDGTKYWI 323
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VKNSWG WGE GYIRM R+ G+CGI M+ASYPVK
Sbjct: 324 VKNSWGEEWGEKGYIRMKRDVADKE-GLCGIAMEASYPVK 362
>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
Length = 296
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 136/311 (43%), Positives = 184/311 (59%), Gaps = 45/311 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNE 116
M R E W+ QYSR Y E +RF ++ SNV++I+ N+ N F L N+FADL+N+
Sbjct: 1 MVARHEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTND 60
Query: 117 EFISTYL--GYN-KPYNEP---RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
EF +T G+ P P R+ ++ LPA++DWR +GAVTP+KDQGQC
Sbjct: 61 EFRATKTNKGFKPSPVKVPTGFRYENISVDALPATIDWRTKGAVTPIKDQGQC------- 113
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
EGI K+ TGKL+SLSEQELVDCDV+ E+QGC GG M+ AF+FI K GG+TTE Y
Sbjct: 114 -----EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLTTESSY 168
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSH 268
PY + +C++ + T+ G+E +PA FQ YS
Sbjct: 169 PYTAADGKCKSG--SNSVATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDMTFQFYSG 226
Query: 269 GVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
GV CG L+HG+ +GYG+ G KYWL+KNSWGT+WGE GY+RM ++ S G+C
Sbjct: 227 GVMTGSCGTDLDHGIAAIGYGQTSDGTKYWLLKNSWGTTWGENGYLRMEKDI-SDKRGMC 285
Query: 328 GILMQASYPVK 338
G+ M+ SYP +
Sbjct: 286 GLAMEPSYPTE 296
>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
Length = 328
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 137/336 (40%), Positives = 196/336 (58%), Gaps = 44/336 (13%)
Query: 33 FLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
FLL +LG + S ++ +M ER ENW+ +Y R Y E RRF ++ NV +
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKDNVAF 66
Query: 93 IDYINS-QNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP------RWPSVQYLGLPAS 145
++ N+ +N F L N+FADL+ EEF + G+ KP E ++ ++ LP +
Sbjct: 67 VESFNTNKNNKFWLGVNQFADLTTEEFKANK-GF-KPTAEKVPTTGFKYENLSVSALPTA 124
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
VDWR +GAVTP+K+QGQC AA+EGI KL TG L+SLSEQELVDCD +S ++G
Sbjct: 125 VDWRTKGAVTPIKNQGQC---------AAMEGIVKLSTGNLISLSEQELVDCDTHSMDEG 175
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------- 258
C GG+M+ AFEF+ K GG+ TE +YPY+ + +C+ A TI G+E +P
Sbjct: 176 CEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGG--SKSAATIKGHEDVPVNNEAAL 233
Query: 259 ---------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNS 302
+ F LYS GV CG +L+HG+ +GYG E G KYW++KNS
Sbjct: 234 MKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGMESDGTKYWILKNS 293
Query: 303 WGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
WGT+WGE G++RM ++ + G+CG+ M+ SYP +
Sbjct: 294 WGTTWGEKGFLRMEKD-ITDKRGMCGLAMKPSYPTE 328
>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
Length = 357
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 142/337 (42%), Positives = 198/337 (58%), Gaps = 40/337 (11%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
L LFL + P+ A + + DP M +RFE W+ +Y R Y DE RRF I+ +N
Sbjct: 10 LFLFLCVMWASPSAASRD---EPSDP--MMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNN 64
Query: 90 VQYIDYINSQN-LSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY-----LGLP 143
V +I+ NS+N S+ L N+F D++N EF++ Y G + P N R P V + +P
Sbjct: 65 VNHIETFNSRNGNSYTLGINQFTDMTNNEFVAQYTGVSLPLNIEREPVVSFDDVDISAVP 124
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
S+DWR GAVT VK+ CGSCWAF+A+A VE I K+K G L+SLSEQ+++DC V +
Sbjct: 125 QSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGYLISLSEQQVLDCAV---S 181
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR--CQTDKTKHHAVTITGY------- 254
GC+GG++ KA++FI GV + YPY+ + C+ + + A ITGY
Sbjct: 182 YGCDGGWVNKAYDFIISNKGVASAAIYPYKASQGQGTCRINGVPNSAY-ITGYTRVQSNN 240
Query: 255 --------------EAIPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLV 299
+I A FQ Y GVF CG LNH +T++GYG+D G+K+W+V
Sbjct: 241 ERSMMYAVSNQPIAASIEASGDFQHYKRGVFSGPCGTSLNHAITIIGYGQDSSGKKFWIV 300
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
+NSWG SWGE GYIRMAR+ SS+ G+CGI ++ YP
Sbjct: 301 RNSWGASWGERGYIRMARDVSSSS-GLCGIAIRPLYP 336
>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 133/258 (51%), Positives = 162/258 (62%), Gaps = 34/258 (13%)
Query: 113 LSNEEFISTYLG----YNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQC 163
++N EF STY G +++ + + + ++ +P SVDWRK+GAVTP+KDQGQC
Sbjct: 1 MTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQC 60
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCWAFS V AVEGIN +KT KLVSLSEQELVDCD SENQGCNGG M AFEFI + GG
Sbjct: 61 GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDT-SENQGCNGGLMGYAFEFIKEKGG 119
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------Y 261
+TTE YPY ++ C K V+I G+E +P
Sbjct: 120 ITTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGS 179
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
AFQ YS GVF CG L+HGV +VGYG G KYW+VKNSWGT WGE GYIRM R
Sbjct: 180 AFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDGTKYWIVKNSWGTDWGENGYIRMKRGI- 238
Query: 321 SSNIGICGILMQASYPVK 338
S+ G+CGI ++ASYP+K
Sbjct: 239 SAKEGLCGIAVEASYPIK 256
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 250 bits (639), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 139/300 (46%), Positives = 176/300 (58%), Gaps = 29/300 (9%)
Query: 66 LKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGY 125
+ ++ + Y S +E RF ++ N+++ID N + S+ L N+FADLS+EEF YLG
Sbjct: 1 MSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGL 60
Query: 126 N----KPYNEPRWPSVQYLG-LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINK 180
K + P S + + LP SVDWRK+GAV VK+QG CGSCWAFS VAAVEGIN+
Sbjct: 61 KIELPKRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVEGINQ 120
Query: 181 LKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
+ TG L +LSEQEL+DCD N GCNGG M+ AF FI GG+ E+DYPY + C
Sbjct: 121 IVTGNLTALSEQELIDCD-KPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEGTCG 179
Query: 241 TDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQ 278
K + VTI+GY +P + FQ YS G+F+ +CG +
Sbjct: 180 EKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHCGTE 239
Query: 279 LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
L+HGV VGYG G Y VKNSWG+ WGE GYIRM RN GICGI ASYP K
Sbjct: 240 LDHGVAAVGYGTSKGVDYITVKNSWGSKWGEKGYIRMKRNVGKPE-GICGIYKMASYPTK 298
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 204/351 (58%), Gaps = 54/351 (15%)
Query: 29 VLSLFLLW--VLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIY 86
+L++FL + L G E P E+ E W+ +++R Y E E + RF I+
Sbjct: 8 ILTIFLSYRTSLATSRGGLFEASPI--------EKHEQWMARFNRVYSDESEKRNRFNIF 59
Query: 87 SSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP------------- 132
N++++ N ++N+++KL N+F+DL++EEF +T+ G P
Sbjct: 60 KKNLEFVQSFNMNKNITYKLDVNEFSDLTDEEFRATHTGLVVPEEITGISTLSSDKTVPF 119
Query: 133 RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
R+ +V G S+DWR+EGAVTPVK QG+CG CWAFSAVAAVEGI K+ G+LVSLSEQ
Sbjct: 120 RYGNVSDTG--ESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQ 177
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND---RCQTDKTKHHAV 249
+L+DCD + NQGC+GG M KAFE+I K G+TTED+YPY+ T + A
Sbjct: 178 QLLDCDTDY-NQGCHGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAA 236
Query: 250 TITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVG 287
TI+GYE +P F+ YS G+F+ CG L+H VT+VG
Sbjct: 237 TISGYETVPMNNEEALLQAVSQQPVSVGIEGTGAGFRHYSGGIFNGECGTDLHHAVTIVG 296
Query: 288 YG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
YG + G KYW+VKNSWG +WGE G++R+ R+ + G+CG+ M A YP+
Sbjct: 297 YGMSEEGTKYWVVKNSWGETWGEDGFMRIKRDVDAPQ-GMCGLAMLAFYPL 346
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 132/304 (43%), Positives = 184/304 (60%), Gaps = 35/304 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFIS 120
FE+W ++ + Y S+ E RR I+S + YI+ N+Q N +F L NKF+DL+N EF +
Sbjct: 2 FEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61
Query: 121 TYLG-YNKPYNEPRWPS----VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
Y+G + P + R P+ V LP S+DWR+EGAVTP+KDQGQCGSCWAFSA+A++
Sbjct: 62 NYVGKFKSPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIASI 121
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
E + L T +LVSLSEQ+L+DCD + +QGC GG+ E AF+F+ + GGVTTE+ YPY G
Sbjct: 122 ESAHFLATKELVSLSEQQLIDCD--TVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTGF 179
Query: 236 NDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHGVFDE 273
C +K K V ITGY+ + A FQ Y G+
Sbjct: 180 AGSCNANKNK--VVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237
Query: 274 YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
C + +H V V+GYG + G YW++KNSWGTSWGE G++++ + G+CG+ Q+
Sbjct: 238 QCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGENGFMKIKKKDGE---GMCGMNGQS 294
Query: 334 SYPV 337
SYP
Sbjct: 295 SYPT 298
>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
Length = 336
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 140/338 (41%), Positives = 189/338 (55%), Gaps = 32/338 (9%)
Query: 29 VLSLFLLWV---LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGI 85
+ S FLL V + + A A S Y D + FE W+ ++ + Y E + RFGI
Sbjct: 1 MASAFLLVVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGI 60
Query: 86 YSSNVQYI-DYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGLPA 144
+ NV +I Y + N+FADL+N+EF++TY G P+ + V + P
Sbjct: 61 FRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAPRPVDPIWTPC 120
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
+DWR GAVT VKDQG CGSCWAF+AVAA+EG+ K++TG+L LSEQELVDCD NS
Sbjct: 121 CIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTNS--N 178
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK-HHAVTITGYEAIP----- 258
GC GG+ ++AFE + GG+T E DY Y G +C+ D +HA +I GY A+P
Sbjct: 179 GCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPPNDER 238
Query: 259 ------ARY-----------AFQLYSHGVFDEYCGHQLNHGVTVVGYGED--HGEKYWLV 299
AR AFQ Y GVF CG NH VT+VGY +D G+KYW+
Sbjct: 239 QLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGKKYWVA 298
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KNSWG +WG+ GYI + ++ + G CG+ + YP
Sbjct: 299 KNSWGKTWGQQGYILLEKDVLQPH-GTCGLAVSPFYPT 335
>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
Length = 367
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 190/314 (60%), Gaps = 37/314 (11%)
Query: 56 QSMEERFENWLKQYS--REYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+++ + +E W Y+ R +G E Q RF ++ NV+YI+ +N + +KL N+F DL
Sbjct: 38 ETLWDLYERWRSVYTSARSFG---EKQNRFHVFKENVKYINEVNKMDKPYKLRLNQFGDL 94
Query: 114 SNEEFISTYLG---YNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
+ EF TY NE + + +P S+DWR +GAVTPVK+QG+CG CWAFS
Sbjct: 95 TPSEFARTYANSKIIEGTRNESGGFMYENVEVPRSIDWRVKGAVTPVKNQGRCGGCWAFS 154
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
A AAVEGIN++ TG+L+SLSEQ+L+DCD ++N GC GG M +AFE+I + GG+T+E +Y
Sbjct: 155 AAAAVEGINQITTGQLISLSEQQLIDCD--TQNSGCRGGTMGRAFEYIKQRGGITSEANY 212
Query: 231 PYRGKNDRCQTDKTKHHAVTITGY-------EAIPARYAFQ-----------------LY 266
PY+ + C+ + + V+I GY +A+ A Q Y
Sbjct: 213 PYKAQAGMCKNNLIQRPTVSIDGYYNIRRSEDAVLKILAHQPVSVAVDATTWSSLDWMFY 272
Query: 267 SHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
GVF CG +LNHGVT VGYG + G YW++KNSWG +WGE GY+RM R S G
Sbjct: 273 FQGVFTGPCGTKLNHGVTAVGYGTTNDGYDYWIIKNSWGETWGERGYMRMLRG--VSPYG 330
Query: 326 ICGILMQASYPVKR 339
+CGI MQAS+P+KR
Sbjct: 331 LCGIAMQASFPIKR 344
>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 469
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 138/312 (44%), Positives = 183/312 (58%), Gaps = 37/312 (11%)
Query: 62 FENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIS 120
F+ W + +SR Y ++ E++ RF ++ N++Y+ N++ S LT N ADLS E+ S
Sbjct: 13 FKEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNARTTSHWLTLNHLADLSTPEYKS 72
Query: 121 TYLGY--------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
LG+ NK R+ V LP ++DWRK+ AV VK+QGQCGSCWAF+
Sbjct: 73 KLLGFDNQARVARNKLKTGFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGSCWAFATT 132
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
+VEGIN + TG LVSLSEQELVDCD +++GC+GG M+ A+ +I K G+ TE+DYPY
Sbjct: 133 GSVEGINAIVTGSLVSLSEQELVDCDT-EQDKGCSGGLMDYAYAWIIKNKGINTEEDYPY 191
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
+ +C K K VTI YE +P +FQLY GV
Sbjct: 192 TAMDGQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAKSFQLYGGGV 251
Query: 271 FDE-YCGHQLNHGVTVVGYGED---HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
+D+ CG LNHGV VVGYG+D G YW+VKNSWG WG+AGYIR+ S + G+
Sbjct: 252 YDDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKMGSTDAE-GL 310
Query: 327 CGILMQASYPVK 338
CGI M SYPVK
Sbjct: 311 CGIAMAPSYPVK 322
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 137/348 (39%), Positives = 207/348 (59%), Gaps = 42/348 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L + +++LF + + ++ + P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKIDLMSILITLFFVISM------FNSQTTARSQPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP-YNEPRWPSV 137
RF I+ N+++I+ +N + NLS+KL N+FAD+++EEF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFTGINIPSYLSPSPMSS 116
Query: 138 QYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+P+++DWR+ GAVT VK+QGQCG CWAFSAV ++EG K+ TG L+
Sbjct: 117 TEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEF 176
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQEL+DC N N GCNGG+M AF+FI + GG+++E DY Y+G+ C++ + K AV
Sbjct: 177 SEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISSESDYEYQGQQYTCRSQE-KTAAV 233
Query: 250 TITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG 289
I+ Y+ +P A Q Y+ G +D C ++NH VT +GYG
Sbjct: 234 QISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYG 293
Query: 290 EDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
D G+KYWL+KNSWGTSWGE G++++ R+S + G C I +SYP
Sbjct: 294 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPG-GHCDIAKMSSYP 340
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 135/313 (43%), Positives = 173/313 (55%), Gaps = 39/313 (12%)
Query: 61 RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-----NLSFKLTDNKFADLSN 115
R E W+ ++ + Y E+E RR ++ +N + ID N+ +L N+FADL++
Sbjct: 41 RHEKWMAKHGKTYKDEEEKARRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTD 100
Query: 116 EEFISTYLGYNKPYNEPRWPSVQYL-------GLPASVDWRKEGAVTPVKDQGQCGSCWA 168
+EF + GY +P +L P S+DWR GAVT VKDQG CG CWA
Sbjct: 101 DEFRAARTGYQRPPAAVAGAGGGFLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGCCWA 160
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSAVAAVEG+ K++TG+LVSLSEQELVDCDV E+QGC GG M+ AF++I + GG+ E
Sbjct: 161 FSAVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLAAES 220
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLY 266
YPYRG D A +I G++ +P A Y F+ Y
Sbjct: 221 SYPYRGV-DGACRAAAGRAAASIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFY 279
Query: 267 SHGVFDEY-CGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
GV CG +LNH VT VGYG G YWL+KNSWG SWGE GY+R+ R
Sbjct: 280 DRGVLGGAGCGTELNHAVTAVGYGTASDGTGYWLMKNSWGASWGEGGYVRIRRGVGRE-- 337
Query: 325 GICGILMQASYPV 337
G CGI ASYPV
Sbjct: 338 GACGIAQMASYPV 350
>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
Length = 362
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 144/353 (40%), Positives = 197/353 (55%), Gaps = 49/353 (13%)
Query: 29 VLSLFLLWVLG-------IPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQR 81
VL+L LL G +PA A + G M +RF W ++R Y S +E +
Sbjct: 12 VLTLALLASCGALLATSMLPARA-TAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEALQ 70
Query: 82 RFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGY---NKPYNEPRW--- 134
RF +Y N ++ID +N + +L+++L +N+FADL+ EEF++TY GY + P ++
Sbjct: 71 RFDVYRRNAEFIDAVNLRGDLTYRLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTG 130
Query: 135 -----PSVQY-LGLPASVDWRKEGAVTPVKDQ-GQCGSCWAFSAVAAVEGINKLKTGKLV 187
S Y + +PASVDWR +GAV P K Q C SCWAF A +E +N +KTGKLV
Sbjct: 131 AGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTGKLV 190
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
SLSEQ+LVDCD S + GCN G +A++++ + GG+TTE DYPY + C K+ HH
Sbjct: 191 SLSEQQLVDCD--SYDGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAKSAHH 248
Query: 248 AVTITGYEAIPAR---------------------YAFQLYSHGVFDEYCGHQLNHGVTVV 286
A ITG+ +P R Q Y GV+ CG +L H VTVV
Sbjct: 249 AAKITGFGKVPPRNEAALQAAVARQPVAVAIEVGSGMQFYKGGVYTGPCGTRLAHAVTVV 308
Query: 287 GYGED--HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
GYG D G KYW +KNSWG SWGE GYIR+ R+ G+CG+ + +YP
Sbjct: 309 GYGTDASSGAKYWTIKNSWGQSWGERGYIRILRD--VGGPGLCGVTLDIAYPT 359
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 136/342 (39%), Positives = 207/342 (60%), Gaps = 36/342 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L + +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKIDLMSILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQ 138
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P + +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPIN 116
Query: 139 YLG---LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
L +P+++DWR+ GAVT VK+QGQCG CWAFSAV ++EG K+ TG L+ SEQEL+
Sbjct: 117 DLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 176
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K AV I+ Y+
Sbjct: 177 DCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQE-KTAAVQISSYQ 233
Query: 256 AIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GE 294
+P A Q Y+ G +D C +++NH VT +GYG D G+
Sbjct: 234 VVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQ 293
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
KYWL+KNSWGTSWGE G++++ R+S + G+C I +SYP
Sbjct: 294 KYWLLKNSWGTSWGEDGFMKIIRDSGNP-AGLCDIAKVSSYP 334
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 144/351 (41%), Positives = 208/351 (59%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF +V+ I + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLF--FVISI-FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QGQCG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKTNDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q YS G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYSGGTYDGSCADRINHAVTAIGY 293
Query: 289 GED-HGEKYWLVKNSWGTSWGEAGYIRMARNS--PSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S PS G+C I +SYP
Sbjct: 294 GTDEEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPS---GLCDIAKMSSYP 341
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 136/342 (39%), Positives = 207/342 (60%), Gaps = 36/342 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L + +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMSILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQ 138
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P + +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPIN 116
Query: 139 YLG---LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
L +P+++DWR+ GAVT VK+QGQCG CWAFSAV ++EG K+ TG L+ SEQEL+
Sbjct: 117 DLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELL 176
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K AV I+ Y+
Sbjct: 177 DCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQE-KTAAVQISSYQ 233
Query: 256 AIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GE 294
+P A Q Y+ G +D C +++NH VT +GYG D G+
Sbjct: 234 VVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGYGTDEKGQ 293
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
KYWL+KNSWGTSWGE G++++ R+S + G+C I +SYP
Sbjct: 294 KYWLLKNSWGTSWGEDGFMKIIRDSGNP-AGLCDIAKVSSYP 334
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 186/314 (59%), Gaps = 32/314 (10%)
Query: 55 PQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLS 114
P + F++W ++ + Y S E +R+GI+ N+ +I N +N S+ L N+FAD++
Sbjct: 38 PNRLVNLFKSWSVKHRKIYVSPKEKLKRYGIFKQNLMHIAETNRKNGSYWLGLNQFADIT 97
Query: 115 NEEFISTYLGYNKPYN----EPRWPSV----QYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
+EEF + +LG + + + R P+ LP SVDWR +GAVTPVK+QG+CGSC
Sbjct: 98 HEEFKANHLGLKQGLSRMGAQTRTPTTFRYAAAANLPWSVDWRYKGAVTPVKNQGKCGSC 157
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS+VAAVEGIN++ TGKLVSLSEQEL+DCD ++ GC GG M+ AF +I G+
Sbjct: 158 WAFSSVAAVEGINQIVTGKLVSLSEQELMDCDTMLDH-GCEGGLMDFAFAYIMGSQGIHA 216
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQ 264
EDDYPY + C+ + + VTITGYE +P FQ
Sbjct: 217 EDDYPYLMEEGYCKEKQPYANVVTITGYEDVPENSEISLLKALAHQPVSVGIAAGSRDFQ 276
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y GVFD C +L+H +T VGYG +G+ Y +KNSWG +WGE GY+R+ +
Sbjct: 277 FYKGGVFDGSCSDELDHALTAVGYGSSYGQNYITMKNSWGKNWGEQGYVRIKMGTGKPE- 335
Query: 325 GICGILMQASYPVK 338
G+CGI ASYPVK
Sbjct: 336 GVCGIYTMASYPVK 349
>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 359
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 149/361 (41%), Positives = 200/361 (55%), Gaps = 47/361 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M M +A L+L +L+ + + + ERF+ W +Y+R Y + +E+Q
Sbjct: 1 MTMATASASLALVMLFACSLLLAG--TAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQ 58
Query: 81 RRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNEEFISTYL---GYNKPYNEPRWP 135
+RF +YS N+++I +N S S++L +N+F DL+ EEF TYL P E P
Sbjct: 59 QRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQPPAAEAMPP 118
Query: 136 SVQYLGL------------PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKT 183
V + P SVDWR +GAVTPVK+Q QCGSCWAF+ VA++EG++++KT
Sbjct: 119 IVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVASIEGVHQIKT 178
Query: 184 GKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDK 243
G+LVSLSEQE+VDCD + GC GGY A E++T+ GG+TTE DYPY G +C + K
Sbjct: 179 GRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGGLTTESDYPYVGSQRQCMSGK 238
Query: 244 TKHHAVTITGYEA---------------------IPARYAFQLYSHGVFDEYCG-HQLNH 281
HHA I GY+A I A AFQ Y GVF C +NH
Sbjct: 239 LGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVIDASRAFQFYKRGVFSGPCNTTTVNH 298
Query: 282 GVTVV-----GYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
VTVV G G KYW+VKNSWG WGE GY+RMAR + G+C I ++ YP
Sbjct: 299 AVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRARE-GMCAIAIEPYYP 357
Query: 337 V 337
V
Sbjct: 358 V 358
>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
Length = 362
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 144/353 (40%), Positives = 197/353 (55%), Gaps = 49/353 (13%)
Query: 29 VLSLFLLWVLG-------IPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQR 81
VL+L LL G +PA A + G M +RF W ++R Y S +E +
Sbjct: 12 VLTLALLASCGALLATSMLPARA-TAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEALQ 70
Query: 82 RFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGY---NKPYNEPRW--- 134
RF +Y N ++ID +N + +L+++L +N+FADL+ EEF++TY GY + P ++
Sbjct: 71 RFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTG 130
Query: 135 -----PSVQY-LGLPASVDWRKEGAVTPVKDQ-GQCGSCWAFSAVAAVEGINKLKTGKLV 187
S Y + +PASVDWR +GAV P K Q C SCWAF A +E +N +KTGKLV
Sbjct: 131 AGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTGKLV 190
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
SLSEQ+LVDCD S + GCN G +A++++ + GG+TTE DYPY + C K+ HH
Sbjct: 191 SLSEQQLVDCD--SYDGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAKSAHH 248
Query: 248 AVTITGYEAIPAR---------------------YAFQLYSHGVFDEYCGHQLNHGVTVV 286
A ITG+ +P R Q Y GV+ CG +L H VTVV
Sbjct: 249 AAKITGFGKVPPRNEAALQAAVARQPVAVAIEVGSGMQFYKGGVYTGPCGTRLAHAVTVV 308
Query: 287 GYGED--HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
GYG D G KYW +KNSWG SWGE GYIR+ R+ G+CG+ + +YP
Sbjct: 309 GYGTDASSGAKYWTIKNSWGQSWGERGYIRILRD--VGGPGLCGVTLDIAYPT 359
>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
Length = 336
Score = 249 bits (635), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 132/313 (42%), Positives = 180/313 (57%), Gaps = 35/313 (11%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
D +M R E W+ QY R Y + E RRF ++ +NV +I+ N+ N F L N+FADL
Sbjct: 29 DDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGVNQFADL 88
Query: 114 SNEEFISTYLGYNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
+N+EF ST + R P+ V LPA++DWR +G VTP+KDQGQCG CW
Sbjct: 89 TNDEFRSTKTNKGFIPSTTRVPTGFRNENVNIDALPATMDWRTKGVVTPIKDQGQCGCCW 148
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAVAA+EGI KL TGKL+S S + + + + GC GG M+ AF+FI K GG+TTE
Sbjct: 149 AFSAVAAMEGIVKLSTGKLISHSLNKSL---LTVMSMGCEGGLMDDAFKFIIKNGGLTTE 205
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
+YPY +D+ ++ + +I GYE +PA FQ
Sbjct: 206 SNYPYAAVDDKFKS--VSNSVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQF 263
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y GV CG L+HG+ +GYG+ G KYWL+KNSWG +WGE G++RM ++ S
Sbjct: 264 YKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKD-ISDKR 322
Query: 325 GICGILMQASYPV 337
G+CG+ M+ SYP
Sbjct: 323 GMCGLAMEPSYPT 335
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 136/351 (38%), Positives = 210/351 (59%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L + +++LF + + ++ + P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMSILITLFFVISM------FNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYN-------- 130
RF I+ N+++I+ +N + NLS+KL N+FAD+++EEF++ + G N P +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMS 116
Query: 131 --EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
E + + +P+++DWR+ GAVT VK+QGQCG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDISDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C +++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGY 293
Query: 289 GED-HGEKYWLVKNSWGTSWGEAGYIRMARN--SPSSNIGICGILMQASYP 336
G D +G+KYWL+KNSWGTSWGE G++++ R+ +PS G+C I +SYP
Sbjct: 294 GTDENGQKYWLLKNSWGTSWGEKGFMKIIRDYGNPS---GLCDIAKLSSYP 341
>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 358
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 144/353 (40%), Positives = 197/353 (55%), Gaps = 49/353 (13%)
Query: 29 VLSLFLLWVLG-------IPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQR 81
VL+L LL G +PA A + G M +RF W ++R Y S +E +
Sbjct: 8 VLTLALLASCGALLATSMLPARA-TAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEALQ 66
Query: 82 RFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGY---NKPYNEPRW--- 134
RF +Y N ++ID +N + +L+++L +N+FADL+ EEF++TY GY + P ++
Sbjct: 67 RFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTG 126
Query: 135 -----PSVQY-LGLPASVDWRKEGAVTPVKDQ-GQCGSCWAFSAVAAVEGINKLKTGKLV 187
S Y + +PASVDWR +GAV P K Q C SCWAF A +E +N +KTGKLV
Sbjct: 127 AGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTGKLV 186
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
SLSEQ+LVDCD S + GCN G +A++++ + GG+TTE DYPY + C K+ HH
Sbjct: 187 SLSEQQLVDCD--SYDGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAKSAHH 244
Query: 248 AVTITGYEAIPAR---------------------YAFQLYSHGVFDEYCGHQLNHGVTVV 286
A ITG+ +P R Q Y GV+ CG +L H VTVV
Sbjct: 245 AAKITGFGKVPPRNEAALQAAVARQPVAVAIEVGSGMQFYKGGVYTGPCGTRLAHAVTVV 304
Query: 287 GYGED--HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
GYG D G KYW +KNSWG SWGE GYIR+ R+ G+CG+ + +YP
Sbjct: 305 GYGTDASSGAKYWTIKNSWGQSWGERGYIRILRD--VGGPGLCGVTLDIAYPT 355
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 131/315 (41%), Positives = 180/315 (57%), Gaps = 36/315 (11%)
Query: 58 MEERFENWLKQYSREYGSEDEW-QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSN 115
+ + +W ++ +E S + RRF + N +YI+ N + S++L N+F+DL++
Sbjct: 9 LSGEYASWCAKFGKECASSNSLGDRRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTS 68
Query: 116 EEFISTYLGYNK----------PYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
EEF +LG P + Q + LPASVDWRK GAVT KDQG CG
Sbjct: 69 EEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRKHGAVTAPKDQGSCGG 128
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAF+ A+EGIN++ TG+L+SLSEQEL+DCD ++ +GC+GG ME A++FI + GG+
Sbjct: 129 CWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKKAD-KGCDGGLMENAYQFIVENGGLD 187
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAF 263
TE DYPY C K V I GYEAIP A F
Sbjct: 188 TETDYPYHASESHCNMKKLNSRVVAIDGYEAIPDGDEQALLRAVAKQPVSVAIEGASKDF 247
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
Q Y+ GVF +CG ++NHGV +VGYG + G YW+VKNSW +WG+ G+++M RN+
Sbjct: 248 QHYASGVFTGHCGEEINHGVLIVGYGTEDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRG 307
Query: 324 IGICGILMQASYPVK 338
G+C I ASYPVK
Sbjct: 308 -GLCSINTLASYPVK 321
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 138/349 (39%), Positives = 209/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPR-WP 135
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 136 SVQYL-------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
S +++ +P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 138/349 (39%), Positives = 209/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPR-WP 135
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 136 SVQYL-------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
S +++ +P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 139/349 (39%), Positives = 207/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
L +P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 141/314 (44%), Positives = 195/314 (62%), Gaps = 35/314 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+S+ + +E W K ++ +++ +RF ++ NV ++ +N + +KL NKFAD+SN
Sbjct: 35 ESLWQLYERWGKHHTISRNLKEK-HKRFSVFKENVNHVFTVNQMDKPYKLKLNKFADMSN 93
Query: 116 EEFISTYLGYN----KPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSC 166
EF++ Y N + +E R + ++ LP+SVD R+ GAV VK+QG+CGSC
Sbjct: 94 YEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDGRERGAVNAVKEQGRCGSC 153
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS+VAAVEGINK+KT +L+SLSEQEL+DC N N+GCNGG+ME AF+FI + GG+ T
Sbjct: 154 WAFSSVAAVEGINKIKTNQLLSLSEQELLDC--NYRNKGCNGGFMEIAFDFIKRNGGIAT 211
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---------------------ARYAFQL 265
E+ YPY G C++ + V I GYE++P A FQ
Sbjct: 212 ENSYPYHGSRGLCRSSRISSPIVKIDGYESVPENEDALMQAVANQPVSVAIDAAGRDFQF 271
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS GVFD YCG +LNHGV +GYG + G YWLV+NSWG WGE GY+RM R +
Sbjct: 272 YSQGVFDGYCGTELNHGVVAIGYGTTEDGTDYWLVRNSWGVGWGEDGYVRMKRGVEQAE- 330
Query: 325 GICGILMQASYPVK 338
G+CGI M+ASYP+K
Sbjct: 331 GLCGIAMEASYPIK 344
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 140/351 (39%), Positives = 209/351 (59%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPR-WP 135
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 136 SVQYL-------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
S +++ +P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNS--PSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S PS G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPS---GLCDITKMSSYP 341
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 140/351 (39%), Positives = 209/351 (59%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPR-WP 135
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 136 SVQYL-------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
S +++ +P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNS--PSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S PS G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPS---GLCDIAKMSSYP 341
>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
Length = 341
Score = 248 bits (633), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 139/302 (46%), Positives = 182/302 (60%), Gaps = 27/302 (8%)
Query: 60 ERFENWLKQYSREYGSEDEWQR---------RFGIYSSNVQYIDYINSQN----LSFKLT 106
ER + ++Q + + SE R R ++ N++YID N++ +F+L
Sbjct: 41 ERADEEVRQLYKTWKSEHGRPRDGISVADGLRLKVFRDNLRYIDAHNAEADAGLHTFRLG 100
Query: 107 DNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQG 161
F DL+ EEF + LG+ PR S +YL LP +VDWR++GAVT VK+Q
Sbjct: 101 LTPFTDLTLEEFRAHALGFLNS-TLPRVASDRYLPRAGDDLPDAVDWRQQGAVTGVKNQL 159
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
CG CWAFSAVAA+EGINK+ T L+SLSEQEL+DCD +E+ GC GG M+KAF+F+
Sbjct: 160 DCGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCD--TEDYGCQGGEMQKAFQFVIDN 217
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQLYSH-----GVFDEYCG 276
GG+ TE DYP+ G N C + K V+I YE +P L G+F+ CG
Sbjct: 218 GGIDTEADYPFIGTNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQPGIFNGPCG 277
Query: 277 HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
L+HGVT VGYG D+GE +W+VKNSWG WGE+GYIRM RN +G CGI M ASYP
Sbjct: 278 FILDHGVTAVGYGSDNGEDFWIVKNSWGAEWGESGYIRMKRNVLLP-MGKCGIAMYASYP 336
Query: 337 VK 338
VK
Sbjct: 337 VK 338
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 138/349 (39%), Positives = 207/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PELSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y+G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYQGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 144/315 (45%), Positives = 185/315 (58%), Gaps = 42/315 (13%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNE 116
M +RF W ++R Y S +E RRF +Y +NV+YID N + L+++L +N+FADL+ E
Sbjct: 41 MMDRFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRGGLTYELGENQFADLTGE 100
Query: 117 EFISTYLG-------YNKPYNEPRWPSVQYLGL-----PASVDWRKEGAVTPVKDQG-QC 163
EF++ Y G + W S G PASVDWR +GAVTPVK+QG QC
Sbjct: 101 EFLARYAGGHTGSAITTAAEADGLWSSGGSDGSLEADPPASVDWRAKGAVTPVKNQGSQC 160
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
SCWAFSAVA +E + +KTGKLV+LSEQ+LVDCD + GCN GY +AF++I + GG
Sbjct: 161 YSCWAFSAVATMESLYFIKTGKLVALSEQQLVDCD--KYDGGCNKGYYHRAFQWIMENGG 218
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------AR----------YAF 263
+TT YPY+ C K AVTITG+ A+ AR +
Sbjct: 219 ITTAAQYPYKAVRGACSAAKP---AVTITGHLAVAKNELALQSAVARQPIGVAIEVPISM 275
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
Q Y GVF CG Q++H V VGYG D G KYWLVKNSWG +WGEAGYIRM R+
Sbjct: 276 QFYKSGVFSAACGIQMSHAVVTVGYGADASGLKYWLVKNSWGQTWGEAGYIRMRRDVGGG 335
Query: 323 NIGICGILMQASYPV 337
G+CGI + +YP
Sbjct: 336 --GLCGIALDTAYPT 348
>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
Length = 345
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 139/352 (39%), Positives = 207/352 (58%), Gaps = 48/352 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYN--------KPYN 130
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P +
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 131 EPRWPSVQYLG---LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLV 187
+ + L +P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLM 176
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K
Sbjct: 177 EFSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQE-KTA 233
Query: 248 AVTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVG 287
AV I+ Y+ +P A Q Y+ G +D C ++NH VT +G
Sbjct: 234 AVQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGNCADRINHAVTAIG 293
Query: 288 YGED-HGEKYWLVKNSWGTSWGEAGYIRMARNS--PSSNIGICGILMQASYP 336
YG D G+KYWL+KNSWGTSWGE GY+++ R+S PS G+C I +SYP
Sbjct: 294 YGTDEEGQKYWLLKNSWGTSWGENGYMKIIRDSGDPS---GLCDIAKMSSYP 342
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 138/349 (39%), Positives = 209/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPR-WP 135
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 136 SVQYL-------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
S +++ +P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 141/306 (46%), Positives = 182/306 (59%), Gaps = 33/306 (10%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNE 116
M +RF W ++R Y S +E RRF +Y +NV+YID N + L+++L +N+FADL+ E
Sbjct: 41 MMDRFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRGGLTYELGENQFADLTGE 100
Query: 117 EFISTYLGYNKP---YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQG-QCGSCWAFSAV 172
EF++ Y G + PASVDWR +GAVTPVK+QG QC SCWAFSAV
Sbjct: 101 EFLARYAGGHTGSAITTAAEADGSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAV 160
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
A +E + +KTGKLV+LSEQ+LVDCD + GCN GY +AF++I + GG+TT YPY
Sbjct: 161 ATMESLYFIKTGKLVALSEQQLVDCD--KYDGGCNKGYYHRAFQWIMENGGITTAAQYPY 218
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIP----------AR----------YAFQLYSHGVFD 272
+ C K AVTITG+ A+ AR + Q Y GVF
Sbjct: 219 KAVRGACSAAKP---AVTITGHLAVAKNELALQSAVARQPIGVAIEVPISMQFYKSGVFS 275
Query: 273 EYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
CG Q++H V VGYG D G KYWLVKNSWG +WGEAGYIRM R+ G+CGI +
Sbjct: 276 AACGIQMSHAVVTVGYGADASGLKYWLVKNSWGQTWGEAGYIRMRRDVGGG--GLCGIAL 333
Query: 332 QASYPV 337
+YP
Sbjct: 334 DTAYPT 339
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 137/348 (39%), Positives = 206/348 (59%), Gaps = 41/348 (11%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M++ L N +++LF + + + G Q S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQP--KLSVSERHELWMSRHGRVYKDEVEKG 57
Query: 81 RRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPR-WPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 58 ERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSS 117
Query: 137 VQYL-------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+++ +P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 118 TEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEF 177
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K AV
Sbjct: 178 SEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQE-KTAAV 234
Query: 250 TITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG 289
I+ Y+ +P A Q Y+ G +D C ++NH VT +GYG
Sbjct: 235 QISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 290 EDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
D G+KYWL+KNSWGTSWGE G++++ R+S + G+C I +SYP
Sbjct: 295 TDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341
>gi|307103885|gb|EFN52142.1| hypothetical protein CHLNCDRAFT_139276 [Chlorella variabilis]
Length = 388
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 137/301 (45%), Positives = 182/301 (60%), Gaps = 23/301 (7%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
+ F W + R Y S E ++R ++ N +++ N++N L N+FADL+ EEF
Sbjct: 44 QAFSQWQMTHGRSYKSASEARKRQAVFVENAKHVAEQNARNSGLVLALNQFADLTLEEFA 103
Query: 120 STYLGYNKPYNEPR---WPSVQYLG---LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
+T+LGYN E + S QY LP++VDWRK+ AVTPVK+Q CGSCWAFSA
Sbjct: 104 ATHLGYNPSLREGKEHTTTSFQYADANDLPSTVDWRKKNAVTPVKNQAMCGSCWAFSATG 163
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEGIN ++TGKLVSLSEQ+LVDCD + ++ GC GG M+ AF++ITK GG+ +EDDY Y
Sbjct: 164 AVEGINAIRTGKLVSLSEQQLVDCD-SEKDLGCGGGLMDFAFDYITKNGGIDSEDDYSYW 222
Query: 234 GKNDRCQTDK-TKHHAVTITGYEAIP-----------ARYAFQLYSHGVF-DEYCGHQLN 280
G CQ K H VTI G+E +P A LY GV D+ C LN
Sbjct: 223 GYGLICQRRKEADRHVVTIDGFEDVPKNDGEALKKAIAHQPVSLYHSGVVGDDACCQDLN 282
Query: 281 HGVTVVGY--GEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
HGV VGY G G ++++KNSWG WGE G+ R+A S ++ G CG+ ASYP+K
Sbjct: 283 HGVLAVGYDDGSKGGTPHYVIKNSWGEGWGEQGFFRLAAKSSEAS-GACGVYKAASYPLK 341
Query: 339 R 339
+
Sbjct: 342 K 342
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 248 bits (632), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 136/349 (38%), Positives = 207/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L + +++LF + + ++ + P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMSILITLFFVISM------FNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYVSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK+QGQCG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C +++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCANRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNP-AGLCDIAKVSSYP 341
>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
Length = 342
Score = 247 bits (631), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 138/334 (41%), Positives = 191/334 (57%), Gaps = 37/334 (11%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYP-QKYDPQSME------ERFENWLKQYSREY 73
M + + S FL +G +S + Y P+ + FE+ L ++S+ Y
Sbjct: 1 MAFIFSSKKTSAFLCICIGFGMFGFSHEFSILGYAPEDLTSIHKVIHLFESSLVKHSKIY 60
Query: 74 GSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPR 133
S DE RF I+ N+++ID N + ++ L N+FADL++EEF + +LG+ E +
Sbjct: 61 ESFDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKNKFLGFKGELAERK 120
Query: 134 WPSVQ------YLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLV 187
S++ ++ LP SVDWRK+GAV+PVK+QGQCGSCWAFS VAAVEGIN++ TG L
Sbjct: 121 DESIEQFRYRDFVDLPKSVDWRKKGAVSPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLT 180
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
LSEQEL+DCD + N GCNGG M+ AF ++T+ G+ E++YPY C +
Sbjct: 181 VLSEQELIDCDT-TFNNGCNGGLMDYAFAYVTR-NGLHKEEEYPYIMSEGTCDEKRDASE 238
Query: 248 AVTITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTV 285
VTI+GY +P + FQ YS GVFD +CG +L+HGV
Sbjct: 239 KVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAA 298
Query: 286 VGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
VGYG G Y +V+NSWG WGE GYIRM RN+
Sbjct: 299 VGYGTSKGLDYVIVRNSWGPKWGEKGYIRMKRNT 332
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 192/317 (60%), Gaps = 41/317 (12%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEF 118
E+ E W+ +++R Y E E + RF I+ N++++ N N +++K+ N+F+DL++EEF
Sbjct: 33 EKHEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDINEFSDLTDEEF 92
Query: 119 ISTYLGYNKPYNEPRWPSV---------QYLGLP---ASVDWRKEGAVTPVKDQGQCGSC 166
+T+ G P R ++ +Y + S+DWR+EGAVTPVK QG+CG C
Sbjct: 93 RATHTGLVVPEAITRISTLSSGKNTVPFRYGNVSDNGESMDWRQEGAVTPVKYQGRCGGC 152
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSAVAAVEGI K+ G+LVSLSEQ+L+DCD + NQGC GG M KAFE+I K G+TT
Sbjct: 153 WAFSAVAAVEGITKITKGELVSLSEQQLLDCDRDY-NQGCRGGIMSKAFEYIIKNQGITT 211
Query: 227 EDDYPYRGKND---RCQTDKTKHHAVTITGYEAIPAR----------------------Y 261
ED+YPY+ T + A TI+GYE +P
Sbjct: 212 EDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGA 271
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
AF+ YS GVF+ CG L+H VT+VGYG + G KYW+VKNSWG +WGE GY+R+ R+
Sbjct: 272 AFRHYSGGVFNGECGTDLHHAVTIVGYGMSEEGTKYWVVKNSWGETWGENGYMRIKRDVD 331
Query: 321 SSNIGICGILMQASYPV 337
+ G+CG+ + A YP+
Sbjct: 332 APQ-GMCGLAILAFYPL 347
>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
Length = 350
Score = 247 bits (631), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 139/354 (39%), Positives = 199/354 (56%), Gaps = 41/354 (11%)
Query: 19 IDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQK-YDPQSMEERFENWLKQYSREYGSED 77
I + M+ +L +L V+ + A Y ++M+ R + W+ ++ R Y E
Sbjct: 5 IGNKTMITFTAAALMILAVMTMVVEARDLSTSTGGYGEEAMKVRHQQWMAEHGRTYKDEA 64
Query: 78 EWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEFISTYLGYNK-PYNEPRWP 135
E RRF ++ +N ++D N+ S++L N+FAD++N+EF++ Y G P +
Sbjct: 65 EKARRFQVFKANADFVDRSNAAGGKSYELAINEFADMTNDEFVAMYTGLKPVPAGPKKMA 124
Query: 136 SVQYLGLPAS------VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+Y L S VDWR++GAVT +K+QGQCG CWAF+AVAAVE I+++ TG LVSL
Sbjct: 125 GFKYENLTLSDVDQQAVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVESIHQITTGNLVSL 184
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQ+++DCD + N GCNGGY++ AF++I GG+ TED YPY CQ+ + AV
Sbjct: 185 SEQQVLDCDTDGNN-GCNGGYIDNAFQYIISNGGLATEDAYPYAAAQGTCQS--SVQPAV 241
Query: 250 TITGYEAIP---------------------ARYAFQLYSHGVFD-EYCGH-QLNHGVTVV 286
TI+ Y+ +P A FQ YS GV + CG LNH VT V
Sbjct: 242 TISSYQDVPSGDEAALAAAVANQPVAVAIDAHNNFQFYSSGVLTADTCGTPSLNHAVTAV 301
Query: 287 GYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
GY + G YWL+KN WG +WGE GY+R+ R + + CG+ QASYPV R
Sbjct: 302 GYSTAEDGTPYWLLKNQWGQNWGEGGYLRVERGTNA-----CGVAQQASYPVAR 350
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 136/349 (38%), Positives = 206/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + ++ + P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM------FNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPVS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 138/349 (39%), Positives = 207/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D +G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDENGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 247 bits (630), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 136/349 (38%), Positives = 206/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + ++ + P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM------FNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 138/349 (39%), Positives = 206/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
Length = 323
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 131/313 (41%), Positives = 176/313 (56%), Gaps = 48/313 (15%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
D +M R E W+ QY R Y + E RRF ++ +NV +I+ N+ N F L N+FADL
Sbjct: 29 DDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGVNQFADL 88
Query: 114 SNEEFISTYLGYNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
+N+EF ST + R P+ V LPA++DWR +G VTP+KDQGQCG CW
Sbjct: 89 TNDEFRSTKTNKGFIPSTTRVPTGFRNENVNIDALPATMDWRTKGVVTPIKDQGQCGCCW 148
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAVAA+E ELVDCDV+ E+QGC GG M+ AF+FI K GG+TTE
Sbjct: 149 AFSAVAAME----------------ELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTE 192
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQL 265
+YPY +D+ ++ + +I GYE +PA FQ
Sbjct: 193 SNYPYAAVDDKFKS--VSNSVASIKGYEDVPANNEAALMKAVANQPVSVAVDGGDMTFQF 250
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y GV CG L+HG+ +GYG+ G KYWL+KNSWG +WGE G++RM ++ S
Sbjct: 251 YKGGVMTGSCGTDLDHGIVAIGYGKASDGTKYWLLKNSWGMTWGENGFLRMEKD-ISDKR 309
Query: 325 GICGILMQASYPV 337
G+CG+ M+ SYP
Sbjct: 310 GMCGLAMEPSYPT 322
>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 137/311 (44%), Positives = 178/311 (57%), Gaps = 27/311 (8%)
Query: 51 QKYDPQ-SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDN 108
Q+ DP S+ ERFE W +Y Y E ++ F I+ NV YIDY N+ N +KL N
Sbjct: 30 QENDPSLSLSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFNAAGNKPYKLAIN 89
Query: 109 KFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
+F D E+ + + +PA+VDWRK GAVTP+K+QG+CGSCWA
Sbjct: 90 RFVDKPIEDSDDGFERTTTTTPTTTFKYENVTDIPATVDWRKRGAVTPIKNQGKCGSCWA 149
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSAVAA+EGI K+ +G LVSLSEQ+LVDCD + +GC+ G M AF+FI + GG+ TE
Sbjct: 150 FSAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGGIATEA 209
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPA---------------------RYAFQLYS 267
+YPY K T K H V I YE +P+ R F+ YS
Sbjct: 210 NYPY--KRVVKGTCKKVSHKVQIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRGMFKFYS 267
Query: 268 HGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
G+F CG + NH +T+VGYG G KYWLVKNSW WGE GYIR+ R+ + G+
Sbjct: 268 SGIFTGECGTKPNHALTIVGYGTSKDGIKYWLVKNSWSKRWGEKGYIRIKRDIDAKE-GL 326
Query: 327 CGILMQASYPV 337
CGI M+ SYP+
Sbjct: 327 CGIAMKPSYPI 337
>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 138/349 (39%), Positives = 206/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
Length = 319
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 131/304 (43%), Positives = 176/304 (57%), Gaps = 29/304 (9%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI-DYINSQNLSFKLTDNKFADLSNEEF 118
+ FE W+ ++ + Y E + RFGI+ NV +I Y + N+FADL+N+EF
Sbjct: 18 QMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEF 77
Query: 119 ISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGI 178
++TY G P+ + V + P +DWR GAVT VKDQG CGSCWAF+AVAA+EG+
Sbjct: 78 VATYTGAKPPHPKEAPRPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGL 137
Query: 179 NKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR 238
K++TG+L LSEQELVDCD NS GC GG+ ++AFE + GG+T E DY Y G +
Sbjct: 138 TKIRTGQLTPLSEQELVDCDTNS--NGCGGGHTDRAFELVASKGGITAESDYRYEGFQGK 195
Query: 239 CQTDKTK-HHAVTITGYEAIP-----------ARY-----------AFQLYSHGVFDEYC 275
C+ D +HA +I GY A+P AR AFQ Y GVF C
Sbjct: 196 CRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPC 255
Query: 276 GHQLNHGVTVVGYGED--HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
G NH VT+VGY +D G+KYWL KNSWG +WG+ GYI + ++ + G CG+ +
Sbjct: 256 GASSNHAVTLVGYCQDGASGKKYWLAKNSWGKTWGQQGYILLEKDIVQPH-GTCGLAVSP 314
Query: 334 SYPV 337
YP
Sbjct: 315 FYPT 318
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 132/304 (43%), Positives = 183/304 (60%), Gaps = 35/304 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFIS 120
FE W ++ + Y S+ E RR I+S + YI+ N+ N +F L NKF+DL+N EF +
Sbjct: 2 FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61
Query: 121 TYLG-YNKPYNEPRWPS----VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
Y+G + P + R P+ V LP S+DWR+EGAVTP+KDQGQCGSCWAFSA+A++
Sbjct: 62 NYVGKFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIASI 121
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
E + L T +LVSLSEQ+L+DCD + +QGC GG+ E AF+F+ + GGVTTE+ YPY G
Sbjct: 122 ESAHFLATKELVSLSEQQLIDCD--TVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTGF 179
Query: 236 NDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHGVFDE 273
C +K K V ITGY+ + A FQ Y G+
Sbjct: 180 AGSCNANKNK--VVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237
Query: 274 YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
+C + +H V V+GYG + G YW++KNSWGTSWGE G++R+ + G+CG+ Q+
Sbjct: 238 HCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMRIKKKDGE---GMCGMNGQS 294
Query: 334 SYPV 337
SYP
Sbjct: 295 SYPT 298
>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 138/351 (39%), Positives = 206/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + ++ + P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM------FNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNS--PSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S PS G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPS---GLCDIAKMSSYP 341
>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 138/349 (39%), Positives = 206/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 132/304 (43%), Positives = 183/304 (60%), Gaps = 35/304 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFIS 120
FE W ++ + Y S+ E RR I+S + YI+ N+ N +F L NKF+DL+N EF +
Sbjct: 2 FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61
Query: 121 TYLG-YNKPYNEPRWPS----VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
Y+G + P + R P+ V LP S+DWR+EGAVTP+KDQGQCGSCWAFSA+A++
Sbjct: 62 NYVGKFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIASI 121
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
E + L T +LVSLSEQ+L+DCD + +QGC GG+ E AF+F+ + GGVTTE+ YPY G
Sbjct: 122 ESAHFLATKELVSLSEQQLIDCD--TVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTGF 179
Query: 236 NDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHGVFDE 273
C +K K V ITGY+ + A FQ Y G+
Sbjct: 180 AGSCNANKNK--VVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237
Query: 274 YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
+C + +H V V+GYG + G YW++KNSWGTSWGE G++R+ + G+CG+ Q+
Sbjct: 238 HCSNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMRIKKEDGE---GMCGMNGQS 294
Query: 334 SYPV 337
SYP
Sbjct: 295 SYPT 298
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 137/316 (43%), Positives = 192/316 (60%), Gaps = 40/316 (12%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSN 115
S E+ E W+ +++R Y + E RF I+++N+++++ IN + N ++ L N+F+DL++
Sbjct: 30 SAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTD 89
Query: 116 EEFISTYLGYNKPYNEPRWP--------SVQYLGLPA---SVDWRKEGAVTPVKDQGQCG 164
EEF + Y G P R S +Y + S+DW +EGAVT VK Q QCG
Sbjct: 90 EEFKARYTGLVVPEGMTRISTTDSHETVSFRYENVGETGESMDWIQEGAVTSVKHQQQCG 149
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
CWAFSAVAAVEG+ K+ G+LVSLSEQ+L+DC ++EN GC GG M KAF++I + G+
Sbjct: 150 CCWAFSAVAAVEGMTKIANGELVSLSEQQLLDC--STENNGCGGGIMWKAFDYIKENQGI 207
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYA 262
TTED+YPY+G C+++ A TI+GYE +P + Y
Sbjct: 208 TTEDNYPYQGAQQTCESNHLA--AATISGYETVPQNDEEALLKAVSQQPVSVAIEGSGYE 265
Query: 263 FQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
F YS G+F+ CG QL H VT+VGYG + G KYWL+KNSWG SWGE GY+R+ R+ S
Sbjct: 266 FIHYSGGIFNGECGTQLTHAVTIVGYGVSEEGIKYWLLKNSWGESWGENGYMRIMRDVDS 325
Query: 322 SNIGICGILMQASYPV 337
G+CG+ A YPV
Sbjct: 326 PQ-GMCGLASLAYYPV 340
>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
Length = 464
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 147/315 (46%), Positives = 189/315 (60%), Gaps = 39/315 (12%)
Query: 62 FENWLKQYSREYGSED----EWQRRFGIYSSNVQYIDYINSQ---NLSFKLTDNKFADLS 114
++ W+ ++ GS + E++RRF ++ N++++D N+ + F+L N+FADL+
Sbjct: 66 YDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADGHGGFRLGMNRFADLT 125
Query: 115 NEEFISTYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAV-TPVKDQGQCGSCWA 168
N+EF + YLG P R Y LP SVDWR +GAV +PVK+QGQCGSCWA
Sbjct: 126 NDEFRAAYLG-TTPAGRGRHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGSCWA 184
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSAVAAVEGINK+ TG+LVSLSEQELV+C N N GCNGG M+ AF FIT+ GG+ TE+
Sbjct: 185 FSAVAAVEGINKIVTGELVSLSEQELVECARNGGNSGCNGGIMDDAFAFITRNGGLDTEE 244
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLY 266
DYPY + +C K V+I G+E +P FQLY
Sbjct: 245 DYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLY 304
Query: 267 SHGVFDEYCGHQLNHGVTVVGYGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
GVF CG L+HGV VGYG D G YW V+NSWG WGE GYIRM RN ++
Sbjct: 305 DSGVFTGRCGTSLDHGVVAVGYGTDAATGTDYWTVRNSWGPDWGENGYIRMERNV-TART 363
Query: 325 GICGILMQASYPVKR 339
G CGI M ASYP+K+
Sbjct: 364 GKCGIAMMASYPIKK 378
>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 246 bits (629), Expect = 9e-63, Method: Compositional matrix adjust.
Identities = 140/351 (39%), Positives = 206/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNS--PSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S PS G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPS---GLCDIAKMSSYP 341
>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
Length = 342
Score = 246 bits (629), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 137/339 (40%), Positives = 188/339 (55%), Gaps = 30/339 (8%)
Query: 25 LRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFG 84
+ +AVL L + ++ + A Y D + FE W+ ++ + Y E + RFG
Sbjct: 7 MASAVL-LVVCTLMALQAMGADAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFG 65
Query: 85 IYSSNVQYI-DYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGLP 143
I+ NV +I Y + N+FADL+N+EF++TY G P+ + V + P
Sbjct: 66 IFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHPKEAPRPVDPIWTP 125
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
+DWR GAVT VKDQG CGSCWAF+AVAA+EG+ K++TG+L LSEQELVDCD NS
Sbjct: 126 CCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCDTNS-- 183
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK-HHAVTITGYEAIP---- 258
GC GG+ ++AFE + GG+T E DY Y G +C+ D +HA I GY A+P
Sbjct: 184 NGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAARIGGYRAVPPNDE 243
Query: 259 -------ARY-----------AFQLYSHGVFDEYCGHQLNHGVTVVGYGED--HGEKYWL 298
AR AFQ Y GVF CG NH VT+VGY +D G+KYW+
Sbjct: 244 RQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGYCQDGASGKKYWV 303
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KNSWG +WG+ GYI + ++ + G CG+ + YP
Sbjct: 304 AKNSWGKTWGQQGYILLEKDVLQPH-GTCGLAVSPFYPT 341
>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 140/351 (39%), Positives = 206/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNS--PSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S PS G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPS---GLCDITKMSSYP 341
>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 140/351 (39%), Positives = 206/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNS--PSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S PS G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPS---GLCDITKMSSYP 341
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 143/351 (40%), Positives = 199/351 (56%), Gaps = 40/351 (11%)
Query: 16 KIAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGS 75
K I M ++ S FL++ I A P + + + M +E+WL +Y + Y S
Sbjct: 5 KSFISMSLLF----FSTFLIFSFAIDAKI----SPLRTNDEVMA-LYESWLVKYGKSYNS 55
Query: 76 EDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYN---KPYNE 131
E + R I+ N+++ID N+ N S+ + N+FADL++EE+ STYLG+ K
Sbjct: 56 LGEREMRIEIFKENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFKSSLKSKVS 115
Query: 132 PRW-PSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
R+ P V + LP VDWR GAV VK+QG C SCWAF+ +A VE IN++ TG L+SLS
Sbjct: 116 NRYMPQVGEV-LPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLS 174
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQELVDC+ N+GC GG+M+ A+EFI GG+ TE++YPY G++D+C K + VT
Sbjct: 175 EQELVDCNRTPINEGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQCDEPKKNQNYVT 234
Query: 251 ITGYEAIPAR----------------------YAFQLYSHGVFD-EYCGHQLNHGVTVVG 287
I YE +P F+ Y G+F CG LNH VT++G
Sbjct: 235 IDSYEQVPPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTTLNHAVTIIG 294
Query: 288 YGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YG ++G YW+VKNS+GT WGE+GY ++ RN G CGI YPVK
Sbjct: 295 YGTENGIDYWIVKNSYGTQWGESGYGKVQRNVGGE--GRCGIASYPFYPVK 343
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 138/349 (39%), Positives = 206/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 131/304 (43%), Positives = 185/304 (60%), Gaps = 35/304 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFIS 120
FE+W ++ + Y S+ E RR ++S + YI+ N+Q N +F L NKF+DL+N EF +
Sbjct: 2 FEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61
Query: 121 TYLG-YNKPYNEPRWPS----VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
Y+G + P + R P+ V LP S+DWR+EGAVTP+KDQGQCGSCWAFSA+A++
Sbjct: 62 NYVGKFKPPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIASI 121
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
E + L T +LVSLSEQ+L+DCD + +QGC GG+ + AF+F+ + GGVTTE+ YPY G
Sbjct: 122 ESAHFLATKELVSLSEQQLIDCD--TVDQGCQGGFPDDAFKFVVENGGVTTEEAYPYTGF 179
Query: 236 NDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHGVFDE 273
C T+K K V ITGY+ + A FQ Y G+
Sbjct: 180 AGSCNTNKNK--VVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237
Query: 274 YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
C + +H V V+GYG + G YW++KNSWGTSWGE G++++ + G+CG+ Q+
Sbjct: 238 QCCNSRDHAVLVIGYGTEGGMPYWIIKNSWGTSWGEDGFMKIKKKDGE---GMCGMNGQS 294
Query: 334 SYPV 337
SYP
Sbjct: 295 SYPT 298
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 135/333 (40%), Positives = 181/333 (54%), Gaps = 49/333 (14%)
Query: 41 PAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN 100
PA W ++ ++ F ++ Y++ Y +E+E QRR+ I+ +N+ YI N Q
Sbjct: 102 PANIW------EWKEAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQG 155
Query: 101 LSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG------------LPASVDW 148
S+ L N F DLS +EF YLG+ K N +LG LPA VDW
Sbjct: 156 YSYSLKMNHFGDLSRDEFRRKYLGFKKSRN----LKSHHLGVATELLNVLPSELPAGVDW 211
Query: 149 RKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNG 208
R G VTPVKDQ CGSCWAFS A+EG + KTGKLVSLSEQEL+DC NQ C+G
Sbjct: 212 RSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSG 271
Query: 209 GYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------- 260
G M AF+++ GG+ +ED YPY +++ C+ ++ V I G++ +P R
Sbjct: 272 GEMNDAFQYVLDSGGICSEDAYPYLARDEECRA-QSCEKVVKILGFKDVPRRSEAAMKAA 330
Query: 261 --------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEK--YWLVKNSWG 304
FQ Y GVFD CG L+HGV +VGYG D K +W++KNSWG
Sbjct: 331 LAKSPVSIAIEADQMPFQFYHEGVFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWG 390
Query: 305 TSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
T WG GY+ MA + G CG+L+ AS+PV
Sbjct: 391 TGWGRDGYMYMAMHKGEE--GQCGLLLDASFPV 421
>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 138/349 (39%), Positives = 205/349 (58%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 139/351 (39%), Positives = 207/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PELSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GC+GG+M AF+FI + GG+++E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNS--PSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S PS G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPS---GLCDIAKMSSYP 341
>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
Length = 344
Score = 246 bits (628), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 141/351 (40%), Positives = 205/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYLG--------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQYTCRS-REKTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C Q+NH VT +GY
Sbjct: 234 VQISSYKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGNCADQINHAVTAIGY 293
Query: 289 GED-HGEKYWLVKNSWGTSWGEAGYIRMARNS--PSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S PS G+C I +SYP
Sbjct: 294 GTDEEGQKYWLLKNSWGTSWGENGFMKIIRDSGDPS---GLCDIAKMSSYP 341
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 131/299 (43%), Positives = 179/299 (59%), Gaps = 40/299 (13%)
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFADLSNEEFISTYL- 123
YS+ Y SE +R + +N+++I+ N+++ S+ + N+FADL+ +EF++ Y+
Sbjct: 5 YSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMALYVP 64
Query: 124 -GYNK--PYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINK 180
+N+ PYN P+ SVDWR +GAVTP+K+QGQCGSCW+FS + EG +
Sbjct: 65 SKFNRTMPYNTVYLPATS----EDSVDWRTKGAVTPIKNQGQCGSCWSFSTTGSTEGAHA 120
Query: 181 LKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
+ TG LVSLSEQ+LVDC + NQGCNGG M+ AF++I G+ TE+DYPY ++ C
Sbjct: 121 IATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTAQDGTCN 180
Query: 241 TDKTKHHAVTITGYEAIPA----------------------RYAFQLYSHGVFDEYCGHQ 278
+K HA TI+ Y +P + FQLY GVFD CG
Sbjct: 181 KEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGVFDGNCGTN 240
Query: 279 LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
L+HGV VVGY +D YW+VKNSWGT+WG GYI M R +S GICGI MQ SYP+
Sbjct: 241 LDHGVLVVGYTDD----YWIVKNSWGTTWGVEGYINMKRGVSAS--GICGIAMQPSYPI 293
>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
Length = 345
Score = 246 bits (627), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 139/351 (39%), Positives = 207/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PELSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GC+GG+M AF+FI + GG+++E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNS--PSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S PS G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGDPS---GLCDIAKMSSYP 341
>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 246 bits (627), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 137/349 (39%), Positives = 206/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVITM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GC+GG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCDGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
Length = 398
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 136/295 (46%), Positives = 176/295 (59%), Gaps = 35/295 (11%)
Query: 76 EDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADLSNEEFISTYLGYNKPYNE 131
E++ + R ++ N++YID N++ +F+L FADL+ EE+ LG+
Sbjct: 82 EEDRRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPFADLTLEEYRGRVLGFRARGRR 141
Query: 132 PRWP-----SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKL 186
SV+ LP ++DWR+ GAVT VKDQ QCG CWAFSAVAA+EG+N + TG L
Sbjct: 142 SGARYGSGYSVRGGDLPDAIDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNL 201
Query: 187 VSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKH 246
VSLSEQE++DCD +++ GC+GG ME AF F+ GG+ TE DYP+ G + C K K+
Sbjct: 202 VSLSEQEIIDCD--AQDSGCDGGQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKEKN 259
Query: 247 HAV-TITGY------------EAIPAR----------YAFQLYSHGVFDEYCGHQLNHGV 283
V TI G EA+ + AFQ YS G+F+ CG L+HGV
Sbjct: 260 EKVATIDGLVEVASNNETALQEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGV 319
Query: 284 TVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
T VGYG + G+ YW+VKNSW SWGEAGYIRM RN P G CGI M ASYPVK
Sbjct: 320 TAVGYGSESGKDYWIVKNSWSASWGEAGYIRMRRNVPRPT-GKCGIAMDASYPVK 373
>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 361
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 147/360 (40%), Positives = 198/360 (55%), Gaps = 47/360 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M M +A L+L +L+ + + + ERF+ W +Y+R Y + +E+Q
Sbjct: 1 MTMATASASLALVMLFACSLLLAG--TAFSDDTIAIPLLERFKAWQAEYNRTYATPEEFQ 58
Query: 81 RRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNEEFISTYL---GYNKPYNEPRWP 135
+RF +YS N+++I +N S S++L +N+F DL+ EEF TYL P E P
Sbjct: 59 QRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQPPAAEAMPP 118
Query: 136 SVQYLGL------------PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKT 183
V + P SVDWR +GAVTPVK+Q QCGSCWAF+ VA++EG++++KT
Sbjct: 119 IVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVASIEGVHQIKT 178
Query: 184 GKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDK 243
G+LVSLSEQE+VDCD + GC GGY A E++T+ GG+TTE DYPY G +C + K
Sbjct: 179 GRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAMEWVTRNGGLTTESDYPYVGSQRQCMSGK 238
Query: 244 TKHHAVTITGYEA---------------------IPARYAFQLYSHGVFDEYCG-HQLNH 281
HHA I GY+A I A AFQ Y GVF C +NH
Sbjct: 239 LGHHAARIRGYQAVQRKNEAELERAVAGRPVAVVIDASRAFQFYKRGVFSGPCNTTTVNH 298
Query: 282 GVTVV-----GYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
VTVV G G KYW+VKNSWG WGE GY+RMAR + G+C I ++ P
Sbjct: 299 AVTVVGYGSAGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRARE-GMCAIAIEPLLP 357
>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 139/351 (39%), Positives = 207/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
L +P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARN--SPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+ +P+ G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPA---GLCDIAKMSSYP 341
>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
Length = 319
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 130/304 (42%), Positives = 176/304 (57%), Gaps = 29/304 (9%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI-DYINSQNLSFKLTDNKFADLSNEEF 118
+ FE W+ ++ + Y E + RFGI+ NV +I Y + N+FADL+N+EF
Sbjct: 18 QMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEF 77
Query: 119 ISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGI 178
++TY G P+ + V + P +DWR GAVT VKDQG CGSCWAF+AVAA+EG+
Sbjct: 78 VATYTGAKPPHPKEAPRPVDPIWTPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGL 137
Query: 179 NKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR 238
K++TG+L LSEQELVDCD NS GC GG+ ++AFE + GG+T E DY Y G +
Sbjct: 138 TKIRTGQLTPLSEQELVDCDTNS--NGCGGGHTDRAFELVASKGGITAESDYRYEGFQGK 195
Query: 239 CQTDKTK-HHAVTITGYEAIP-----------ARY-----------AFQLYSHGVFDEYC 275
C+ D +HA +I GY A+P AR AFQ Y GVF C
Sbjct: 196 CRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPC 255
Query: 276 GHQLNHGVTVVGYGED--HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
G NH VT+VGY +D G+KYW+ KNSWG +WG+ GYI + ++ + G CG+ +
Sbjct: 256 GASSNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPH-GTCGLAVSP 314
Query: 334 SYPV 337
YP
Sbjct: 315 FYPT 318
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 135/333 (40%), Positives = 181/333 (54%), Gaps = 49/333 (14%)
Query: 41 PAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN 100
PA W ++ ++ F ++ Y++ Y +E+E QRR+ I+ +N+ YI N Q
Sbjct: 101 PANIW------EWKEAHFQDAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYIHTHNQQG 154
Query: 101 LSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG------------LPASVDW 148
S+ L N F DLS +EF YLG+ K N +LG LPA VDW
Sbjct: 155 YSYSLKMNHFGDLSRDEFRRKYLGFKKSRN----LKSHHLGVATELLNVLPSELPAGVDW 210
Query: 149 RKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNG 208
R G VTPVKDQ CGSCWAFS A+EG + KTGKLVSLSEQEL+DC NQ C+G
Sbjct: 211 RSRGCVTPVKDQRDCGSCWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGNQSCSG 270
Query: 209 GYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------- 260
G M AF+++ GG+ +ED YPY +++ C+ ++ V I G++ +P R
Sbjct: 271 GEMNDAFQYVLDSGGICSEDAYPYLARDEECRA-QSCEKVVKILGFKDVPRRSEAAMKAA 329
Query: 261 --------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEK--YWLVKNSWG 304
FQ Y GVFD CG L+HGV +VGYG D K +W++KNSWG
Sbjct: 330 LAKSPVSIAIEADQMPFQFYHEGVFDASCGTDLDHGVLLVGYGTDKESKKDFWIMKNSWG 389
Query: 305 TSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
T WG GY+ MA + G CG+L+ AS+PV
Sbjct: 390 TGWGRDGYMYMAMHKGEE--GQCGLLLDASFPV 420
>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 135/349 (38%), Positives = 206/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N ++++F + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITVFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP---------- 128
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPLS 116
Query: 129 YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
E + + +P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 139/351 (39%), Positives = 207/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
L +P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STELKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARN--SPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+ +P+ G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPA---GLCDIAKMSSYP 341
>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 138/351 (39%), Positives = 209/351 (59%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPR-WP 135
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 136 SVQYL-------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
S +++ +P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFIINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARN--SPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+ +P+ G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPA---GLCDIAKMSSYP 341
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 137/349 (39%), Positives = 206/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GC+GG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 134/313 (42%), Positives = 188/313 (60%), Gaps = 40/313 (12%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEF 118
E+ E W+ ++ R Y + E RF I+ N+++++ N + N ++ L N+F+DL++EEF
Sbjct: 33 EKHEQWMSRFHRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNKTYTLDVNEFSDLTDEEF 92
Query: 119 ISTYLGYNKPYNEPRWP--------SVQYLGLPA---SVDWRKEGAVTPVKDQGQCGSCW 167
+ Y G P R S +Y + S+DWR+EGAVT VK Q QCG CW
Sbjct: 93 KARYTGLVVPEGMTRMSTTDSHETVSFRYENVGETGESMDWREEGAVTSVKHQQQCGCCW 152
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAVAAVEG+ K+ G+LVSLSEQ+L+DC ++EN GC+GG M KAF++I + G+T E
Sbjct: 153 AFSAVAAVEGMTKIAKGELVSLSEQQLLDC--STENDGCDGGIMWKAFDYIVENQGITAE 210
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQL 265
D+YPY+G C+++ A TI+GYE +P + Y F
Sbjct: 211 DNYPYQGAQQTCESNHVA--AATISGYETVPQNDEEALLKAVSQQPVSVAIEGSGYEFIH 268
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS G+F+ CG LNH VT+VGYG + G KYWL+KNSWG SWGE GY+R+ R+ +
Sbjct: 269 YSGGIFNGECGTHLNHAVTIVGYGVSEEGIKYWLLKNSWGESWGEDGYMRIMRDVDAPQ- 327
Query: 325 GICGILMQASYPV 337
G+CG+ A YPV
Sbjct: 328 GMCGLASLAYYPV 340
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 132/324 (40%), Positives = 189/324 (58%), Gaps = 41/324 (12%)
Query: 49 YPQKYDPQSMEE---RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFK 104
Y +K++ ++ +E FE+WL +Y + Y + E +RRF I+ N++++D N+ N S+K
Sbjct: 32 YAKKWEQRTNDEVMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYK 91
Query: 105 LTDNKFADLSNEEFISTYLGYNKPYN--------EPRWPSVQYLGLPASVDWRKEGAVTP 156
+ N+F+DL+ EE+ S YLG EPR LP S+DWRK+GAV
Sbjct: 92 VGLNQFSDLTLEEYSSIYLGTKFDMRMTNVSDRYEPRVGDQ----LPNSIDWRKKGAVLG 147
Query: 157 VKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFE 216
VK+QG CGSCW F+ +AAVE IN++ TG L+SLSEQ++VDC S N GC GG A++
Sbjct: 148 VKNQGNCGSCWTFAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGAYQ 207
Query: 217 FITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR---------------- 260
FI GG+ TE +YPY+ ++ C K + + VTI YE +P +
Sbjct: 208 FIIDNGGINTEANYPYKAQDGECDEQKNQKY-VTIDRYENVPRKNEKALQKAVSNQLVSV 266
Query: 261 ------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIR 314
F+ Y G+F CG +++H VT+VGYG + G YW+V+NSWG++WGE GY+R
Sbjct: 267 GIASNSSEFKAYKSGIFTGPCGAKIDHAVTIVGYGTEGGMDYWIVRNSWGSNWGENGYVR 326
Query: 315 MARNSPSSNIGICGILMQASYPVK 338
M RN N G C I +YPVK
Sbjct: 327 MQRN--VGNAGTCFIATSPNYPVK 348
>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 245 bits (625), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 134/349 (38%), Positives = 206/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L + +++LF + + ++ + P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMSILITLFFVISM------FNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENIKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GC+GG+M AF+FI + GG+++E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDS-GNPAGLCDIAKMSSYP 341
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 137/349 (39%), Positives = 206/349 (59%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L + +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMSILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPS-GLCDIAKMSSYP 341
>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 137/349 (39%), Positives = 205/349 (58%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGHVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QGQCG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GC+GG+M AF+FI + GG+++E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 136/324 (41%), Positives = 191/324 (58%), Gaps = 38/324 (11%)
Query: 49 YPQK---YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKL 105
YP+K + + +F + + +++ Y +E+E +R+ I+ +N+ YI N Q S+ L
Sbjct: 73 YPEKIWEWKDHHFQSQFYQFQRDHNKFYATEEERLKRYAIFKNNLTYIHNHNMQGYSYVL 132
Query: 106 TDNKFADLSNEEFISTYLGYNKP--YNEPR-----WPSVQYLGLPASVDWRKEGAVTPVK 158
NKF DL+ EEF YLGY KP PR SV+ +P VDWR+ G VT VK
Sbjct: 133 KMNKFGDLTLEEFRQRYLGYKKPDLRTPPREVDTTLESVEDNDIPTHVDWRQRGCVTSVK 192
Query: 159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
DQG CGSCWAFSA A+EG+ KTGKLV+LS+Q+LVDC NQGC+GG ME+AFE++
Sbjct: 193 DQGDCGSCWAFSATGAMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYV 252
Query: 219 TKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------------ 260
+ GG+ + ++YPY K+ C++ + A TITGY ++P R
Sbjct: 253 VENGGICSGENYPYMRKDGVCKSSQCTSVA-TITGYRSVPRRSEKSMKTALALRSPVSVA 311
Query: 261 -----YAFQLYSHGVFDEYCGHQLNHGVTVVGY-GEDHGE-KYWLVKNSWGTSWGEAGYI 313
AFQ Y G+FD CG L+HGV +VGY E G+ YW++KNSWG +WG+ GY+
Sbjct: 312 IQANQAAFQFYYDGIFDAPCGTNLDHGVLLVGYSAETAGQGDYWIMKNSWGAAWGKGGYM 371
Query: 314 RMARNSPSSNIGICGILMQASYPV 337
MA + + G CG+L+ S+PV
Sbjct: 372 LMAMHKGPA--GQCGVLLDGSFPV 393
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 184/311 (59%), Gaps = 37/311 (11%)
Query: 62 FENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIS 120
F+ W+ QY++ Y ++ E + RF ++ N+ YI N++ S L N FADL+ +EF
Sbjct: 45 FQQWMMQYTKAYANDIKELETRFSVWLENLNYILAYNARTTSHWLHLNAFADLTTDEF-R 103
Query: 121 TYLGY--------NKPYNEP-RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
LGY N+ + P + +V LP +DWRK+GAVT VK+QGQCGSCWAF+
Sbjct: 104 NRLGYDFKARQASNRLQSSPFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWAFAT 163
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
+VEGIN + TG+L SLSEQELVDCD + E++GC+GG M+ A+++I K GG+ TEDDYP
Sbjct: 164 TGSVEGINAIVTGELASLSEQELVDCDTD-EDRGCSGGLMDYAYQWIIKNGGLDTEDDYP 222
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHG 269
Y ++ C K VTI GY IP +FQLY G
Sbjct: 223 YTAEDGVCVAAKKNRRVVTIDGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYGGG 282
Query: 270 VFDE-YCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
V+D+ CG LNHGV VVGYG+D H YW+VKNSWG WG+ GYIR+ R G+C
Sbjct: 283 VYDDPTCGTSLNHGVLVVGYGKDPHFGNYWIVKNSWGPEWGDNGYIRL-RMGAEDVQGMC 341
Query: 328 GILMQASYPVK 338
GI M S+P K
Sbjct: 342 GIAMAPSFPTK 352
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 244 bits (623), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 138/353 (39%), Positives = 198/353 (56%), Gaps = 51/353 (14%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGAWSEGYP-----QKYDPQSMEERFENWLKQYSREYGSED 77
L+N + L L +L + YP + SM ER ENW+ + R Y +
Sbjct: 5 FFLKNITVVLLLFSILSL--------YPFIVTSRNLKELSMLERHENWMVHHGRVYKDDI 56
Query: 78 EWQRRFGIYSSNVQYIDYINSQNLS-FKLTDNKFADLSNEEFISTYLGYNKPY-----NE 131
E + RF + NV++I+ N +KL NK+ADL+ EEF ++++G + +
Sbjct: 57 EKEHRFKTFKENVEFIESFNKNGTQRYKLAVNKYADLTTEEFTTSFMGLDTSLLSQQEST 116
Query: 132 PRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
S +Y +P S+DWRK G+VT VKDQG CG CWAFSA AA+EG ++ +L+S
Sbjct: 117 ATTTSFKYDSVTEVPNSMDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNELIS 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKI--GGVTTEDDYPYRGKNDRCQTDKTKH 246
LSEQ+L+DC +++N+GC GG M A++F+ + GG+TTE +YPY + C+T++
Sbjct: 177 LSEQQLLDC--STQNKGCEGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVCKTEQPA- 233
Query: 247 HAVTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVV 286
AVTI GYE +P A F +Y G++D C +LNH VTV+
Sbjct: 234 -AVTINGYEVVPSDESSLLKAVVNQPISVGIAANDEFHMYGSGIYDGSCNSRLNHAVTVI 292
Query: 287 GYG--EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
GYG E+ G KYW+VKNSWG+ WGE GY+R+AR+ G CGI AS+P
Sbjct: 293 GYGTSEEDGTKYWIVKNSWGSDWGEEGYMRIARDVGVDG-GHCGIAKVASFPT 344
>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
Length = 340
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 176/308 (57%), Gaps = 35/308 (11%)
Query: 61 RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL-SFKLTDNKFADLSNEEFI 119
R E W+ ++ R Y E E RR ++ +N + ID N+ S +L N+FADL+ +EF
Sbjct: 37 RHEKWMAEHGRAYKDEAEKARRLEVFRANAELIDSFNAAGTHSHRLATNRFADLTVQEFR 96
Query: 120 STYLGYNKPYNEP-------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
+ G +P P R+ + SVDWR GAVT VKDQG G CWAFSAV
Sbjct: 97 AARTGL-RPRPAPSAGAGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGASGCCWAFSAV 155
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AAVEG+NK++TG+LVSLSEQELVDCDV+ +QGC+GG M+ AF+F+ + GG+ +E YPY
Sbjct: 156 AAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESGYPY 215
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
+ ++ C++ A +I G+E +P AF+ Y GV
Sbjct: 216 QCRDGPCRS-SAAAAAASIRGHEDVPRNNEAALAAAVAHQPVSVAINGEDMAFRFYDSGV 274
Query: 271 FDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
CG LNH +T VGYG G +YWL+KNSWG SWGE GY+R+ R G+CG+
Sbjct: 275 LGGACGTDLNHAITAVGYGTAADGTRYWLMKNSWGASWGEGGYVRIRRGVRGE--GVCGL 332
Query: 330 LMQASYPV 337
SYPV
Sbjct: 333 AKLPSYPV 340
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 244 bits (623), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 129/315 (40%), Positives = 179/315 (56%), Gaps = 36/315 (11%)
Query: 58 MEERFENWLKQYSREYGSEDEW-QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSN 115
+ + +W ++ +E S + RF + N +YI+ N + S++L N+F+DL++
Sbjct: 9 LSGEYASWCAKFGKECASSNSLGDHRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTS 68
Query: 116 EEFISTYLGYNK----------PYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
EEF +LG P + Q + LPASVDWR+ GAVT KDQG CG
Sbjct: 69 EEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVDLPASVDWRQHGAVTAPKDQGSCGG 128
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAF+ A+EGIN++ TG+LVSLSEQEL+DCD ++ +GC+GG ME A++FI + GG+
Sbjct: 129 CWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKKAD-KGCDGGLMENAYQFIVENGGLD 187
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAF 263
TE DYPY C K V I GY+AIP A F
Sbjct: 188 TETDYPYHASESHCNMKKLNSRVVAIDGYKAIPEGDEQALLLAVAKQPVSVAIEGASKDF 247
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
Q Y+ GVF +CG ++NHGV +VGYG + G YW+VKNSW +WG+ G+++M RN+
Sbjct: 248 QHYASGVFTGHCGEEINHGVLIVGYGTEDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRG 307
Query: 324 IGICGILMQASYPVK 338
G+C I ASYPVK
Sbjct: 308 -GLCSINTLASYPVK 321
>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 244 bits (622), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 136/351 (38%), Positives = 206/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + ++ + P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM------FNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARN--SPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+ +P+ G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPA---GLCDIAKMSSYP 341
>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 138/351 (39%), Positives = 206/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARN--SPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+ +P+ G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPA---GLCDIAKMSSYP 341
>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 244 bits (622), Expect = 6e-62, Method: Compositional matrix adjust.
Identities = 138/351 (39%), Positives = 206/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARN--SPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+ +P+ G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPA---GLCDIAKMSSYP 341
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 146/322 (45%), Positives = 180/322 (55%), Gaps = 41/322 (12%)
Query: 45 WSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSF 103
+SE P + Q M F ++KQYS+ Y S E+ RF + +NV+ I N+ N S+
Sbjct: 28 FSEEVPSEVMLQDM---FTAFMKQYSKAY-SHAEFSSRFNQFKANVETIRLHNTLANASY 83
Query: 104 KLTDNKFADLSNEEFISTYLGYN---KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQ 160
+ N+FADLS EEF Y GY + + + P S+DWR AVTP+KDQ
Sbjct: 84 TMGLNEFADLSFEEFKGKYFGYKHVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQ 143
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGK--LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
GQCGSCWAFSA ++EG L+ GK L SLSEQ+LVDC + N GCNGG M+ AFE+I
Sbjct: 144 GQCGSCWAFSATGSIEGAWVLQ-GKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYI 202
Query: 219 TKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE----------------------A 256
G+ E YPY+G CQ TK VTI+GY+ A
Sbjct: 203 IANKGICAESAYPYKGVGGLCQKSCTK--VVTISGYKDVASGDEASLLNAVGTVGPVSVA 260
Query: 257 IPARYA-FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRM 315
I A A FQ YS GVF CGH L+HGV VGYG + YW+VKNSWGTSWGE+GYIRM
Sbjct: 261 IEADQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTTGSQDYWIVKNSWGTSWGESGYIRM 320
Query: 316 ARNSPSSNIGICGILMQASYPV 337
RN CGI +Q SYP
Sbjct: 321 IRNKNQ-----CGIAIQPSYPT 337
>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
Length = 356
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 154/360 (42%), Positives = 203/360 (56%), Gaps = 48/360 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M M +A L+L L + A+S+ + ERF+ W +Y+R Y + +E+Q
Sbjct: 1 MTMATASASLALMFACSLLLAGTAFSD----DTIAIPLLERFKAWQAEYNRTYATPEEFQ 56
Query: 81 RRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNEEFISTYL---GYNKPYNEPRWP 135
+RF IYS NV++I +N S S++L +N+F DL+ EEF TYL P E P
Sbjct: 57 QRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQPPAAEAMGP 116
Query: 136 SVQYLGL------------PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKT 183
+V + P SVDWR +GAVT VKDQ QCGSCWAF+ VA++EG++++KT
Sbjct: 117 TVGTMSTAGMSNGNNTGEAPNSVDWRTKGAVTRVKDQQQCGSCWAFATVASIEGVHQIKT 176
Query: 184 GKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDK 243
G+LVSLSEQE+VDCD + GC GG A E++T+ GG+TTE DYPY G +C + K
Sbjct: 177 GRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVTRNGGLTTESDYPYVGSQRQCMSGK 236
Query: 244 TKHHAVTITGYEA---------------------IPARYAFQLYSHGVFDEYC-GHQLNH 281
HHA I GY+A I A AFQ Y GVF C +NH
Sbjct: 237 LGHHAARIRGYQAVQRNNEAELERAVAERPVAVFIDASRAFQFYKSGVFSGPCDTTTVNH 296
Query: 282 GVTVVGYG----EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
VTVVGYG + G KYW+VKNSWG WGE GY+RMAR + G+C I ++ YPV
Sbjct: 297 VVTVVGYGSTGSDSGGRKYWIVKNSWGQGWGENGYVRMARRVRARE-GMCAIAIEPYYPV 355
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 141/351 (40%), Positives = 197/351 (56%), Gaps = 38/351 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIP-AGAWSEGYPQKYDPQSME---ERFENWLKQYSREYGSE 76
M + + L+ L+ +G+ A ++ GY Q D S+E + F++W+ ++++ Y S
Sbjct: 4 MSSISKIIFLATCLIIHMGLSSADFYTVGYSQD-DLTSIERLIQLFDSWMLKHNKIYESI 62
Query: 77 DEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYN-------KPY 129
DE RF I+ N+ YID N +N S+ L N FADLSN+EF Y+G+ + +
Sbjct: 63 DEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHF 122
Query: 130 NEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+ + P S+DWR +GAVTPVK+QG CGSCWAFS +A VEGINK+ TG L+ L
Sbjct: 123 DNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLEL 182
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQELVDCD +S GC GGY + +++ GV T YPY+ K +C+ V
Sbjct: 183 SEQELVDCDKHS--YGCKGGYQTTSLQYVAN-NGVHTSKVYPYQAKQYKCRATDKPGPKV 239
Query: 250 TITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVG 287
ITGY+ +P+ FQLY GVFD CG +L+H VT VG
Sbjct: 240 KITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVG 299
Query: 288 YGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YG G+ Y ++KNSWG +WGE GY+R+ R S +S G CG+ + YP K
Sbjct: 300 YGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ-GTCGVYKSSYYPFK 349
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 243 bits (621), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 136/351 (38%), Positives = 206/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L + +++LF + + ++ + P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMSILITLFFVISM------FNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYKVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARN--SPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+ +PS G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPS---GLCDIAKMSSYP 341
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 243 bits (620), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 141/351 (40%), Positives = 197/351 (56%), Gaps = 38/351 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIP-AGAWSEGYPQKYDPQSME---ERFENWLKQYSREYGSE 76
M + + L+ L+ +G+ A ++ GY Q D S+E + F++W+ ++++ Y S
Sbjct: 4 MSSISKIIFLATCLIIHMGLSSADFYTVGYSQD-DLTSIERLIQLFDSWMLKHNKIYESI 62
Query: 77 DEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYN-------KPY 129
DE RF I+ N+ YID N +N S+ L N FADLSN+EF Y+G+ + +
Sbjct: 63 DEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHF 122
Query: 130 NEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+ + P S+DWR +GAVTPVK+QG CGSCWAFS +A VEGINK+ TG L+ L
Sbjct: 123 DNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLEL 182
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQELVDCD +S GC GGY + +++ GV T YPY+ K +C+ V
Sbjct: 183 SEQELVDCDKHS--YGCKGGYQTTSLQYVAN-NGVHTSKVYPYQAKQYKCRATDKPGPKV 239
Query: 250 TITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVG 287
ITGY+ +P+ FQLY GVFD CG +L+H VT VG
Sbjct: 240 KITGYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVG 299
Query: 288 YGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YG G+ Y ++KNSWG +WGE GY+R+ R S +S G CG+ + YP K
Sbjct: 300 YGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ-GTCGVYKSSYYPFK 349
>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
Length = 382
Score = 243 bits (620), Expect = 9e-62, Method: Compositional matrix adjust.
Identities = 153/361 (42%), Positives = 204/361 (56%), Gaps = 48/361 (13%)
Query: 20 DMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEW 79
+M M +A L+L L + A+S+ + ERF+ W +Y+R Y + +E+
Sbjct: 26 NMTMATASASLALMFACSLLLAGTAFSD----DTIAIPLLERFKAWQAEYNRTYATPEEF 81
Query: 80 QRRFGIYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNEEFISTYL---GYNKPYNEPRW 134
Q+RF IYS NV++I +N S S++L +N+F DL+ EEF TYL P E
Sbjct: 82 QQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQPPAAEAMP 141
Query: 135 PSVQYLGL------------PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLK 182
P+V + P SVDWR +GAVT VKDQ QCGSCWAF+ VA++EG++++K
Sbjct: 142 PTVGTMSTAGMSNGNNTGEAPNSVDWRTKGAVTRVKDQQQCGSCWAFATVASIEGVHQIK 201
Query: 183 TGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTD 242
TG+LVSLSEQE+VDCD + GC GG A E++T+ GG+TTE DYPY G +C +
Sbjct: 202 TGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAMEWVTRNGGLTTESDYPYVGSQRQCMSG 261
Query: 243 KTKHHAVTITGYEA---------------------IPARYAFQLYSHGVFDEYC-GHQLN 280
K HHA I GY+A + A AFQ Y GVF C +N
Sbjct: 262 KLGHHAARIRGYQAVQRNNEAELERAVAGQPVAVFVDASRAFQFYKSGVFSGPCDTTTVN 321
Query: 281 HGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
H VTVVGYG + G KYW+VKNSWG WGE GY+RMAR + G+C I ++ YP
Sbjct: 322 HVVTVVGYGSTGSDSGGRKYWIVKNSWGQGWGENGYVRMARRVRARE-GMCAIAIEPYYP 380
Query: 337 V 337
V
Sbjct: 381 V 381
>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 137/349 (39%), Positives = 204/349 (58%), Gaps = 43/349 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++E K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+S + G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNP-AGLCDIAKMSSYP 341
>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 136/350 (38%), Positives = 205/350 (58%), Gaps = 45/350 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
M++ L N +++LF + + + G Q S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQP--KLSVSERHELWMSRHGRVYKDEVEKG 57
Query: 81 RRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPSV 137
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 58 ERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSS 117
Query: 138 QYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 118 TEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEF 177
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQEL+DC N N GC+GG+M AF+FI + GG++ E DY Y G+ C++ + K AV
Sbjct: 178 SEQELLDCTTN--NYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQE-KTAAV 234
Query: 250 TITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG 289
I+ Y+ +P A Q Y+ G +D C ++NH VT +GYG
Sbjct: 235 QISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGYG 294
Query: 290 ED-HGEKYWLVKNSWGTSWGEAGYIRMARN--SPSSNIGICGILMQASYP 336
D +G+KYWL+KNSWGTSWGE G++++ R+ +P+ G+C I +SYP
Sbjct: 295 TDENGQKYWLLKNSWGTSWGENGFMKIIRDYGNPA---GLCDIAKMSSYP 341
>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 340
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 137/344 (39%), Positives = 198/344 (57%), Gaps = 45/344 (13%)
Query: 26 RNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGI 85
+ +V + F+L++ I + + S+ + E W+ + R Y E RR I
Sbjct: 7 KKSVGTFFMLFLTCICRAS-----SRTLSESSIATQHEEWMAMHDRVYADSAEKDRRQQI 61
Query: 86 YSSNVQYIDYINSQNLS-FKLTDNKFADLSNEEFISTYLG--YNKPYNEPRWPSVQYLG- 141
+ N+++I+ N++ + L+ N FADL+NEEF++++ G Y P + LG
Sbjct: 62 FKENLEFIEKHNNEGKKRYNLSLNSFADLTNEEFVASHTGALYKPPTQLGSFKINHSLGF 121
Query: 142 -------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+ AS+DWRK GAV +K+QG+CGSCWAFSAVAAVEGIN++K G+LVSLSEQ L
Sbjct: 122 HKMSVGDIEASLDWRKRGAVNDIKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSLSEQNL 181
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
VDC + N GC+G Y+EKAF++I G+ E++YPY C + A+ I GY
Sbjct: 182 VDC---ASNDGCHGQYVEKAFDYIRDY-GLANEEEYPYVETVGTCSGNSNP--AIQIRGY 235
Query: 255 EAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH 292
+++ + FQ YS GVF CG +LNH VT+VGYGE+
Sbjct: 236 QSVTPQNEEQLLTAVASQPVSVLLEAKGQGFQFYSGGVFSGECGTELNHAVTIVGYGEEA 295
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
KYWL++NSWG SWGE GY+++ R++ + G+CGI MQASYP
Sbjct: 296 EGKYWLIRNSWGKSWGEGGYMKLMRDTGNPQ-GLCGINMQASYP 338
>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 370
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 122/222 (54%), Positives = 150/222 (67%), Gaps = 28/222 (12%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
LP SVDWR++GAV P+KDQG CGSCWAFS +A+VEGINK+ TG L+SLSEQELVDCD +
Sbjct: 41 LPDSVDWREKGAVVPIKDQGGCGSCWAFSTIASVEGINKIVTGDLISLSEQELVDCD-KT 99
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR- 260
N GCNGG M+ AF+FI GG+ TE DYPY ++ RC + + V+I YE +P
Sbjct: 100 YNDGCNGGLMDYAFQFIIDNGGIDTEKDYPYTEQDGRCDSYRKNAKVVSINSYEDVPVND 159
Query: 261 ---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
+FQLY+ G+F CG L+HGVTVVGYG + G+ YW+V
Sbjct: 160 EQALKKAAASQPIAVAIDGGGRSFQLYNSGIFTGKCGTSLDHGVTVVGYGSESGKDYWIV 219
Query: 300 KNSWGTSWGEAGYIRMARN--SPSSNIGICGILMQASYPVKR 339
+NSWG SWGE GYIRMARN SPS GICGI M+ASYP+K+
Sbjct: 220 RNSWGESWGEKGYIRMARNIDSPS---GICGIAMEASYPIKK 258
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 136/302 (45%), Positives = 183/302 (60%), Gaps = 36/302 (11%)
Query: 66 LKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEFISTYLG 124
+ +Y R Y DE RRF I+ +NV +I+ N++N S+ L NKF D++N EF++ Y G
Sbjct: 1 MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60
Query: 125 -YNKPYNEPRWPSVQY-----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGI 178
++P N + P V + + S+DWR GAVT VKDQ CGSCWAFSA+A VEGI
Sbjct: 61 GISRPLNIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGI 120
Query: 179 NKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR 238
K+ TG LVSLSEQE++DC V++ GC+GG+++ A++FI GV +E DYPY+
Sbjct: 121 YKIVTGYLVSLSEQEVLDCAVSN---GCDGGFVDNAYDFIISNNGVASEADYPYQAYQGD 177
Query: 239 CQTDKTKHHAVTITGYEAIPA------RYA----------------FQLYSHGVFDEYCG 276
C + + A ITGY + + +YA FQ Y+ GVF CG
Sbjct: 178 CAANSWPNSAY-ITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCG 236
Query: 277 HQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASY 335
LNH +T++GYG+D G +YW+VKNSWG+SWGE GYIRMAR SS G+CGI M Y
Sbjct: 237 TSLNHAITIIGYGQDSSGTQYWIVKNSWGSSWGERGYIRMARGVSSS--GLCGIAMDPLY 294
Query: 336 PV 337
P
Sbjct: 295 PT 296
>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 135/351 (38%), Positives = 206/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L + +++LF + + ++ + P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMSILITLFFVISM------FNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARN--SPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+ +P+ G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPA---GLCDIAKMSSYP 341
>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 242 bits (618), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 138/351 (39%), Positives = 206/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L N +++LF + + + G Q P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMNILITLFFVISM---FNTQTRGRSQ---PKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 VERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
L +P+++DW + GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STELKINDLSDDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q Y+ G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFYAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARN--SPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+ +P+ G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPA---GLCDIAKMSSYP 341
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 137/329 (41%), Positives = 186/329 (56%), Gaps = 37/329 (11%)
Query: 42 AGAWSEGYPQKYDPQSME---ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS 98
A ++ GY Q D S+E + F++W+ ++++ Y S DE RF I+ N+ YID N
Sbjct: 26 ADFYTVGYSQD-DLTSIERLIQLFDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNK 84
Query: 99 QNLSFKLTDNKFADLSNEEFISTYLGYN-------KPYNEPRWPSVQYLGLPASVDWRKE 151
+N S+ L N FADLSN+EF Y+G + ++ + P S+DWR +
Sbjct: 85 KNNSYWLGLNGFADLSNDEFKKKYVGSVAEDFTGLEHFDNEDFTYKHVTNYPQSIDWRAK 144
Query: 152 GAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYM 211
GAVTPVK+QG CGSCWAFS +A VEG+NK+ TG L+ LSEQELVDCD NS GC GGY
Sbjct: 145 GAVTPVKNQGSCGSCWAFSTIATVEGVNKIVTGNLLELSEQELVDCDKNS--HGCKGGYQ 202
Query: 212 EKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------- 260
+ +++ GV T YPY+ K +C+ V ITGY+ +P+
Sbjct: 203 TTSLQYVAD-NGVHTSKVYPYQAKAMQCRATDKPGPKVKITGYKRVPSNCETSFLGALAN 261
Query: 261 -----------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGE 309
FQLY GVFD CG +L+H VT VGYG G+ Y ++KNSWG +WGE
Sbjct: 262 QPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDGKNYIIIKNSWGPNWGE 321
Query: 310 AGYIRMARNSPSSNIGICGILMQASYPVK 338
GY+R+ R S +S G CG+ + YP K
Sbjct: 322 KGYMRLKRQSGNSQ-GTCGVYKSSYYPFK 349
>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
[Brachypodium distachyon]
Length = 334
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 129/324 (39%), Positives = 183/324 (56%), Gaps = 42/324 (12%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-----LSFKLTDN 108
D ++M ER+E W+ + R Y E RRF ++ SN +ID N+ KLT N
Sbjct: 12 DDKAMRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTN 71
Query: 109 KFADLSNEEFISTYLGYNKPYNEP---------RWPSVQYLGLPASVDWRKEGAVTPVKD 159
KFADL+ +EF + Y+ ++ P ++ +V +P S+DWR GAVT VKD
Sbjct: 72 KFADLTEDEFRNIYVTGHRVNYRPTSLVTDTVFKFGAVSLSDVPPSIDWRARGAVTSVKD 131
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
Q C CWAFS+ AAVEGI+++ TG VSLS Q+LVDC N+ N+ C G ++KA+E+I
Sbjct: 132 QHLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCS-NAANEKCKAGEIDKAYEYIA 190
Query: 220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------------- 260
+ GG+ + DYPY G + C+ K I+G++ +PAR
Sbjct: 191 RSGGLVADQDYPYEGHSGTCRV-YGKQAVARISGFQYVPARNETALLLAVAHQPVSVALD 249
Query: 261 ---YAFQLYSHGVF---DEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYI 313
A Q G+F E C LNH +T+VGYG D HG +YWL+KNSWG+ WG+ GY+
Sbjct: 250 GLSRALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGTDEHGTRYWLMKNSWGSDWGDKGYV 309
Query: 314 RMARNSPSSNIGICGILMQASYPV 337
+ AR+ S G+CG+ ++ASYPV
Sbjct: 310 KFARDVASEINGVCGLALEASYPV 333
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 179/311 (57%), Gaps = 37/311 (11%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
++ ++E + ++ Y E+E R G+++ NVQ I+ NS+ ++ L N+FADL+ EE
Sbjct: 15 IDAQWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHTYTLGVNQFADLTVEE 74
Query: 118 FISTYLGYNKPYNEPRWPSVQYLG--------LPASVDWRKEGAVTPVKDQGQCGSCWAF 169
F TY+G+ KP ++ YLG LP SVDW +GAVTPVK+QGQCGSCW+F
Sbjct: 75 FSKTYMGFKKPAQ--KYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGSCWSF 132
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
S ++EG N++ TGKLVSLSEQ+ VDC NQGCNGG M+ AF++ + + TE
Sbjct: 133 STTGSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKY-AEANALCTEQS 191
Query: 230 YPYRGKNDRCQTD--KTKHHAVTITGYEAIPA----------------------RYAFQL 265
YPY+G + CQ T +++GY+ + + + FQL
Sbjct: 192 YPYKGTDGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKSVFQL 251
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
YS GV CG L+HGV VGYG G YW VKNSWG++WG +GY+ + R S G
Sbjct: 252 YSGGVLTGACGASLDHGVLAVGYGTLSGTDYWKVKNSWGSTWGMSGYVLLQRGKGGS--G 309
Query: 326 ICGILMQASYP 336
CG+L + SYP
Sbjct: 310 ECGLLSEPSYP 320
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 180/322 (55%), Gaps = 41/322 (12%)
Query: 45 WSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSF 103
+SE P + Q M F ++KQYS+ Y S E+ RF + +NV+ I N+ N S+
Sbjct: 28 FSEEVPSEVMLQDM---FTAFMKQYSKAY-SHAEFSSRFNQFKANVETIRLHNTLANASY 83
Query: 104 KLTDNKFADLSNEEFISTYLGYN---KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQ 160
+ N+FADLS EEF Y GY + + + P S+DWR AVTP+KDQ
Sbjct: 84 TMGLNEFADLSFEEFKGKYFGYKHVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQ 143
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGK--LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
GQCGSCWAFSA ++EG L+ GK L SLSEQ+LVDC + + GCNGG M+ AFE+I
Sbjct: 144 GQCGSCWAFSATGSIEGAWVLQ-GKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYI 202
Query: 219 TKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE----------------------A 256
G+ E YPY+G CQ TK VTI+GY+ A
Sbjct: 203 IANKGICAESAYPYKGVGGLCQKSCTK--VVTISGYKDVASGDEASLLNAVGTVGPVSVA 260
Query: 257 IPARYA-FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRM 315
I A A FQ YS GVF CGH L+HGV VGYG + YW+VKNSWGTSWGE+GYIRM
Sbjct: 261 IEADQAGFQFYSSGVFSGTCGHNLDHGVLAVGYGTTGSQDYWIVKNSWGTSWGESGYIRM 320
Query: 316 ARNSPSSNIGICGILMQASYPV 337
RN CGI +Q SYP
Sbjct: 321 IRNKNQ-----CGIAIQPSYPT 337
>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
Length = 229
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 124/221 (56%), Positives = 146/221 (66%), Gaps = 25/221 (11%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
+PASVDWRK+GAVT VKDQGQCGSCWAFS + AVEGIN++KT KLVSLSEQELVDCD +
Sbjct: 2 VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTD- 60
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR- 260
+NQGCNGG M+ AFEFI + GG+TTE +YPY + C K AV+I G+E +P
Sbjct: 61 QNQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPEND 120
Query: 261 ---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWL 298
FQ YS GVF CG +L+HGV +VGYG G KYW
Sbjct: 121 ENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDGTKYWT 180
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
VKNSWG WGE GYIRM R S G+CGI M+ASYP+K+
Sbjct: 181 VKNSWGPEWGEKGYIRMER-GISDKEGLCGIAMEASYPIKK 220
>gi|217072410|gb|ACJ84565.1| unknown [Medicago truncatula]
Length = 328
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 121/219 (55%), Positives = 144/219 (65%), Gaps = 23/219 (10%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
LP SVDWRKEGAV VKDQ CGSCWAFSA+AAVEGINK+ TG L+SLSEQELVDCD S
Sbjct: 24 LPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDT-S 82
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA-- 259
N+GCNGG M+ AFEFI GG+ +EDDYPY+ + RC ++ VTI YE +PA
Sbjct: 83 YNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVVTIDDYEDVPAYD 142
Query: 260 --------------------RYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
FQLY +GV CG L+HGV VGYG ++G+ YW+V
Sbjct: 143 ELALQKAVANQPIAVAVEGGGREFQLYEYGVLTGRCGTALDHGVAAVGYGTENGKDYWIV 202
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+NSWG SWGE GYIR+ RN SS G CGI ++ SYP+K
Sbjct: 203 RNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPIK 241
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 133/302 (44%), Positives = 172/302 (56%), Gaps = 33/302 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNEEFIS 120
+E WL + + Y E +RR I+ N+++ID NS N +F++ +FADL+N+E
Sbjct: 2 YERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDEPKD 61
Query: 121 TYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINK 180
Y E LP +DWR +GAV PVKDQG CGSCWAFSAV AVEGIN+
Sbjct: 62 FMKADRYLYKEGDI-------LPDEIDWRAKGAVVPVKDQGNCGSCWAFSAVGAVEGINQ 114
Query: 181 LKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN-DRC 239
+KTG+L+SLS+QEL+DCD N GC GG M AFEFI GG+ ++ DYPY + C
Sbjct: 115 IKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQDYPYTATDLGVC 174
Query: 240 QTDKTKH-HAVTITGYEAI----------------------PARYAFQLYSHGVFDEYCG 276
DK + V I GYE + + AF+LY GVF CG
Sbjct: 175 NADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLYKSGVFTGTCG 234
Query: 277 HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
L+HGV VVGYG GE YW+++NSWG +WGE GY+++ RN S G CG+ M SYP
Sbjct: 235 IYLDHGVVVVGYGTSSGEDYWIIRNSWGLNWGENGYVKLQRNIDDS-FGKCGVAMMPSYP 293
Query: 337 VK 338
K
Sbjct: 294 TK 295
>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 377
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 137/340 (40%), Positives = 190/340 (55%), Gaps = 55/340 (16%)
Query: 51 QKYDP---QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN---SQNLSFK 104
++ DP Q+M RF+ W ++ R Y + DE RR +Y+ NV+YI+ N + L+++
Sbjct: 39 EETDPTILQTMAPRFQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQ 98
Query: 105 LTDNKFADLSNEEF-----------------------ISTYLGYNKPYNEPRWPSVQYLG 141
L + + DL+ +EF I+T G + + +V G
Sbjct: 99 LGETAYTDLTADEFTAMYTSPSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFNVSTAG 158
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
PASVDWR +GAVT VK+QG+CGSCWAFS VA VEGI++++TG L+SLSEQELVDCD +
Sbjct: 159 APASVDWRAKGAVTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCD--T 216
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY 261
+ GC+GG A E+I GG+ TE DYPY GK+ C +K HA I+G+ + R
Sbjct: 217 LDYGCDGGVSYHALEWIASNGGIATEADYPYTGKDGACVANKLPLHAAAISGFARVATRS 276
Query: 262 A----------------------FQLYSHGVFDEYCGHQLNHGVTVV--GYGEDHGEKYW 297
FQ Y GV++ CG +LNHGVTVV G E GEKYW
Sbjct: 277 EPSLANAVAAQPVAVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVVGYGEEEGDGEKYW 336
Query: 298 LVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+VKNSWG WG+ GY RM ++ G+CGI ++ S+P+
Sbjct: 337 IVKNSWGKKWGDGGYFRMKKDVAGKPEGLCGIAIRPSFPL 376
>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
Length = 358
Score = 241 bits (614), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 180/311 (57%), Gaps = 37/311 (11%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNE 116
M +RF W Y+R Y + +E QRRF +Y N+++I+ N + NL++ L +N+FADL+ E
Sbjct: 53 MMDRFLRWQATYNRSYPTAEERQRRFQVYRRNMEHIEATNRAGNLTYTLGENQFADLTEE 112
Query: 117 EFISTYLGYNKP--------YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQG-QCGSCW 167
EF+ Y P + + SV + P SVDWR GAVTP+K+QG C SCW
Sbjct: 113 EFLDLYTMKGMPPVRRDAGKKQQANFSSV--VDAPTSVDWRSRGAVTPIKNQGPSCSSCW 170
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AF A +E I +++TGKLVSLSEQEL+DCD + GCN GY ++++ + GG+TTE
Sbjct: 171 AFVTAATIESITQIRTGKLVSLSEQELIDCD--PYDGGCNLGYFVNGYKWVIQNGGLTTE 228
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY--------------------AFQLYS 267
+YPY+ + +C K A I+ Y +P + Q YS
Sbjct: 229 ANYPYQARRYQCNRSKAGQRAARISNYRQLPQGEAQLQQAVAQQPVAAAIEMGGSLQFYS 288
Query: 268 HGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
GV+ CG ++NH +TVVGYG D G KYWLVKNSWG +WGE GY+RM ++ G+
Sbjct: 289 GGVWSGQCGTRMNHAITVVGYGADSSGVKYWLVKNSWGQTWGERGYLRMRKDVRQG--GL 346
Query: 327 CGILMQASYPV 337
CGI + +YP+
Sbjct: 347 CGIALDLAYPI 357
>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
Length = 378
Score = 240 bits (613), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 152/360 (42%), Positives = 196/360 (54%), Gaps = 62/360 (17%)
Query: 38 LGIPAGAWS-EGYPQK--YDPQSMEERFENWLKQYSR-EYGSEDEWQRRFGIYSSNVQYI 93
LG+ G +S GY ++ +S+ E FE WL ++ + Y S +E RRF ++ N+ +I
Sbjct: 21 LGLARGDFSIVGYSEEDLSSHESLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHI 80
Query: 94 DYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEP--------------------- 132
D N + S+ L N+FADL+++EF +TYLG +
Sbjct: 81 DETNRKVSSYWLGLNEFADLTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSS 140
Query: 133 ------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKL 186
R+ V LP SVDWR +GAVT VK+QGQCGSCWAFS VAAVEGIN++ TG L
Sbjct: 141 SSSFRFRYEGVDAARLPKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNL 200
Query: 187 VSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKH 246
+LSEQELVDCD + N GCNGG M+ AF +I GG+ TE+ YPY + C +
Sbjct: 201 TALSEQELVDCDTDG-NNGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCSRGSSA- 258
Query: 247 HAVTITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVT 284
VTI+GYE +P + Q YS GVFD CG QL+HGV
Sbjct: 259 AVVTISGYEDVPRNNEQALLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHGVA 318
Query: 285 VVGY---GEDHGE---KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VGY G+D+G Y +VKNSWG SWGE GYIRM R + G+CGI SYP K
Sbjct: 319 AVGYGTAGKDNGHVVADYIIVKNSWGPSWGEKGYIRMRRGTGKRQ-GLCGINKMPSYPTK 377
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 240 bits (613), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 140/351 (39%), Positives = 196/351 (55%), Gaps = 38/351 (10%)
Query: 21 MRMMLRNAVLSLFLLWVLGIP-AGAWSEGYPQKYDPQSME---ERFENWLKQYSREYGSE 76
M + + L+ L+ +G+ A ++ GY Q D S+E + F++W+ ++++ Y S
Sbjct: 4 MSSISKIIFLATCLIIHMGLSSADFYTVGYSQD-DLTSIERLIQLFDSWMLKHNKIYESI 62
Query: 77 DEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYN-------KPY 129
DE RF I+ N+ YID N +N S+ L N FADLSN+EF Y+G+ + +
Sbjct: 63 DEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKYVGFVAEDFTGLEHF 122
Query: 130 NEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+ + P S+DWR +GAVTPVK+QG CGSCWAFS +A VEGINK+ TG L+ L
Sbjct: 123 DNEDFTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLEL 182
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQELVDCD +S GC GGY + +++ GV T YP + K +C+ V
Sbjct: 183 SEQELVDCDKHS--YGCKGGYQTTSLQYVAN-NGVHTSKVYPCQAKQYKCRATDKPGPKV 239
Query: 250 TITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVG 287
ITGY+ +P+ FQLY GVFD CG +L+H VT VG
Sbjct: 240 KITGYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVG 299
Query: 288 YGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YG G+ Y ++KNSWG +WGE GY+R+ R S +S G CG+ + YP K
Sbjct: 300 YGTSDGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQ-GTCGVYKSSYYPFK 349
>gi|325185016|emb|CCA19507.1| cysteine protease family C01A putative [Albugo laibachii Nc14]
Length = 492
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 132/290 (45%), Positives = 173/290 (59%), Gaps = 17/290 (5%)
Query: 59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEF 118
E F +WLK + + E+ +R Y +N YI N Q SFKL N F+ L+NEEF
Sbjct: 30 ESDFVSWLKTHHLTFSDAFEYAKRLETYIANDIYILTHNLQESSFKLGHNAFSHLTNEEF 89
Query: 119 ISTYLGYNKP----------YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
+ G+ N + QY+ LP SVDW ++GAVT VK+QG CGSCWA
Sbjct: 90 RQRFNGFKASDDYLTKRLAQSNVASSTNFQYIDLPESVDWVEKGAVTGVKNQGMCGSCWA 149
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS A+EG + +GKLVSLSEQELVDCD N ++ GCNGG M+ AF +I++ G+ +E+
Sbjct: 150 FSTTGAIEGATFISSGKLVSLSEQELVDCDHNGDH-GCNGGLMDHAFSWISEHDGICSEE 208
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPA-RYAFQLYSHGVFDEYCGHQLNHGVTVVG 287
DY Y C++ K V + AI A +FQ Y GV+++ CG QL+HGV VG
Sbjct: 209 DYAYIHSQSLCRSCKPVVSPVAV----AIDAGDRSFQFYQSGVYNKTCGTQLDHGVLTVG 264
Query: 288 YGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
YG + G+KYW VKNSWG SWGE GYIR++R+ + G CGI M SYP
Sbjct: 265 YGVEDGQKYWKVKNSWGNSWGEKGYIRLSRDQNGRS-GQCGIAMVPSYPT 313
>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
Length = 435
Score = 239 bits (611), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 140/343 (40%), Positives = 186/343 (54%), Gaps = 61/343 (17%)
Query: 50 PQKYDPQSMEERFENWLKQYSREYGS-------------EDEWQRRFGIYSSNVQYIDYI 96
P + + + +E W ++ R S E++ + R ++ N++YID
Sbjct: 72 PAERADEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKH 131
Query: 97 NSQN----LSFKLTDNKFADLSNEEFISTYLGYNKPYN--------------EPRWPSVQ 138
N++ +F+L FADL+ +E+ LG+ PR +
Sbjct: 132 NAEADAGLHTFRLGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDL- 190
Query: 139 YLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
LP ++DWR+ GAVT VKDQ QCG CWAFSAVAA+EGIN + TG LVSLSEQE++DCD
Sbjct: 191 ---LPDAIDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCD 247
Query: 199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV-TITGY--- 254
+++ GC+GG ME AF F+ GG+ TE DYP+ G + C K + V TI G
Sbjct: 248 --AQDSGCDGGQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKENNEKVATIDGLVEV 305
Query: 255 ---------EAIPAR----------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEK 295
EA+ + AFQ YS G+F+ CG L+HGVT VGYG + G+
Sbjct: 306 ASNNETALQEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGSESGKD 365
Query: 296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YW+VKNSW SWGEAGYIRM RN P G CGI M ASYPVK
Sbjct: 366 YWIVKNSWSASWGEAGYIRMRRNVPRPT-GKCGIAMDASYPVK 407
>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
Length = 360
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 139/334 (41%), Positives = 188/334 (56%), Gaps = 47/334 (14%)
Query: 29 VLSLFLLWVLG-------IPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQR 81
VL+L LL G +PA A + G M +RF W ++R Y S +E +
Sbjct: 12 VLTLALLASCGALLATSMLPARA-TAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEALQ 70
Query: 82 RFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGY---NKPYNEPRW--- 134
RF +Y N ++ID +N + +L+++L +N+FADL+ EEF++TY GY + P ++
Sbjct: 71 RFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITTG 130
Query: 135 -----PSVQY-LGLPASVDWRKEGAVTPVKDQ-GQCGSCWAFSAVAAVEGINKLKTGKLV 187
S Y + +PASVDWR +GAV P K Q C SCWAF A +E +N +KTGKLV
Sbjct: 131 AGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTGKLV 190
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
SLSEQ+LVDCD S + GCN G +A++++ + GG+TTE DYPY + C K+ HH
Sbjct: 191 SLSEQQLVDCD--SYDGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAKSAHH 248
Query: 248 AVTITGYEAIPAR---------------------YAFQLYSHGVFDEYCGHQLNHGVTVV 286
A ITG+ +P R Q Y GV+ CG +L H VTVV
Sbjct: 249 AAKITGFGKVPPRNEAALQAAVARQPVAVAIEVGSGMQFYKGGVYTGPCGTRLAHAVTVV 308
Query: 287 GYGED--HGEKYWLVKNSWGTSWGEAGYIRMARN 318
GYG D G KYW +KNSWG SWGE GYIR+ R+
Sbjct: 309 GYGTDASSGAKYWTIKNSWGQSWGERGYIRILRD 342
>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
Length = 343
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 147/354 (41%), Positives = 199/354 (56%), Gaps = 50/354 (14%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAW-SEGYPQKY-DPQSMEERFENWLKQYSREYGSEDE 78
M + L+ L + LL +LG W S+ P+ + +++ E+ E W+ ++ R Y E
Sbjct: 1 MPLSLQITKLVITLLMILG----TWVSQAMPRPLLNAEAIAEKHEQWMARHGRTYHDNAE 56
Query: 79 WQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV 137
+RRF I+ +N+ YI+ N N ++KL NKF+DLS EEF++TY GY P P +
Sbjct: 57 KERRFQIFKNNLDYIENFNKAFNKTYKLGLNKFSDLSEEEFVTTYNGYEMPTTLPTANTT 116
Query: 138 -------QYLG---LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLV 187
Y +P S+DWR+ G VT VK+QG+CG CWAFSAVAAVEGI G
Sbjct: 117 VKPTFFSNYYNQDEVPESIDWRENGVVTSVKNQGECGCCWAFSAVAAVEGI----AGNGA 172
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
SLS Q+L+DC +N GC GG M KAFE+I + G+ ++ DYPY + C++ +
Sbjct: 173 SLSAQQLLDCV--GDNSGCGGGTMIKAFEYIVQNQGIVSDTDYPYEQTQEMCRSG--SNV 228
Query: 248 AVTITGYEAI--------------PARYA--------FQLYSHGVFD-EYCGHQLNHGVT 284
A ITGYE++ P A F+ Y GVF E CG L H VT
Sbjct: 229 AARITGYESVIQSEEALKRAVAKQPISVAIDASSGPNFKSYISGVFSAEDCGTHLTHAVT 288
Query: 285 VVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+VGYG + G KYWLVKNSWG WGE+GY+R+ R+ + G CGI MQASYP
Sbjct: 289 LVGYGTTEDGTKYWLVKNSWGEEWGESGYMRLQRDVGAME-GPCGIAMQASYPT 341
>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 371
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 141/363 (38%), Positives = 194/363 (53%), Gaps = 50/363 (13%)
Query: 17 IAIDMRMMLRNAVLSLFLLWVLGIPAGAWSE--GYPQKYDPQSMEERFENWLKQYSREYG 74
+A +MLR LF+ PA + G+ + D M +RF W ++R YG
Sbjct: 15 LATTAVLMLRGC---LFVFLTALPPAAIMTPAAGHVVELDDMLMLDRFVRWQAAHNRTYG 71
Query: 75 SEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEFISTYLGY----NKPY 129
+E RRF +Y +N++YI+ N + L+++L +N+FADL++EEF+S Y ++
Sbjct: 72 DAEERLRRFQVYRANIEYIEATNRRGGLTYELGENQFADLTSEEFLSMYASSYDAGDRAD 131
Query: 130 NEPRWPSVQYLG------------LPASVDWRKEGAVTPVKDQG-QCGSCWAFSAVAAVE 176
+E + G P S DWR +GAVTP K+QG C SCWAF VA +E
Sbjct: 132 DEAALITTDVAGDGAWSDGDLEALPPPSWDWRAKGAVTPPKNQGPTCSSCWAFVTVATIE 191
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
G+ +KTGKL+SLSEQ+LVDCD+ + GCN G + F ++ + GG+TTE +YPY
Sbjct: 192 GLTFIKTGKLISLSEQQLVDCDMY--DGGCNTGSYSRGFRWVLENGGLTTEAEYPYTAAR 249
Query: 237 DRCQTDKTKHHAVTITGYEAIPAR---------------------YAFQLYSHGVFDEYC 275
C K+ HHA ITG IP + Q Y GV+ C
Sbjct: 250 GPCNRAKSAHHAAKITGQGRIPPQNELVMQKAVAGQPVGVAIEVGSGMQFYKTGVYSGPC 309
Query: 276 GHQLNHGVTVVGYGED--HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
G L H VTVVGYG D G KYW+VKNSWG +WGE G+IRM R+ G+CGI +
Sbjct: 310 GTNLAHAVTVVGYGVDPASGAKYWIVKNSWGQAWGERGFIRMRRD--VGGPGLCGIALDV 367
Query: 334 SYP 336
+YP
Sbjct: 368 AYP 370
>gi|302779822|ref|XP_002971686.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
gi|300160818|gb|EFJ27435.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
Length = 214
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 117/217 (53%), Positives = 147/217 (67%), Gaps = 26/217 (11%)
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
SVDWRK+G VT +KDQG CG+CWAFSA+AAVEG+ L TG LVSLSEQELVDCD + NQ
Sbjct: 1 SVDWRKKGGVTEIKDQGDCGNCWAFSAIAAVEGLTFLSTGTLVSLSEQELVDCDT-TVNQ 59
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA-- 262
GC+GG M+ AF+++ + GG+T++ +YPYR + C DK K+HA TI G++AIP +
Sbjct: 60 GCDGGMMDYAFQYMIRNGGITSQSNYPYRAQRGACDKDKVKYHAATINGFQAIPPQSEEL 119
Query: 263 --------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKN 301
FQLYS GVF CG L+HGV +VGYG D G +YWLVKN
Sbjct: 120 LLRAVANQPVSVAIEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGTDAGGRQYWLVKN 179
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
SWG+ WGE+GY+RM R P + G+CGI + ASYP K
Sbjct: 180 SWGSGWGESGYVRMERQGPGA--GVCGINLDASYPTK 214
>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 134/351 (38%), Positives = 205/351 (58%), Gaps = 47/351 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEW 79
M++ L + +++LF + + ++ + P+ S+ ER E W+ ++ R Y E E
Sbjct: 3 MKVDLMSILITLFFVISM------FNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEK 56
Query: 80 QRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPS 136
RF I+ N+++I+ +N + NLS+KL N+FAD++++EF++ + G N P Y P S
Sbjct: 57 GERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMS 116
Query: 137 VQYL--------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVS 188
+P+++DWR+ GAVT VK QG+CG CWAFSAV ++EG K+ TG L+
Sbjct: 117 STEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLME 176
Query: 189 LSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA 248
SEQEL+DC N N GCNGG+M AF+FI + GG++ E DY Y G+ C++ + K A
Sbjct: 177 FSEQELLDCTTN--NYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQE-KTAA 233
Query: 249 VTITGYEAIP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
V I+ Y+ +P A Q + G +D C ++NH VT +GY
Sbjct: 234 VQISSYQVVPEGETSLLQAVTKQPVSIGIAASQDLQFCAGGTYDGSCADRINHAVTAIGY 293
Query: 289 GEDH-GEKYWLVKNSWGTSWGEAGYIRMARN--SPSSNIGICGILMQASYP 336
G D G+KYWL+KNSWGTSWGE G++++ R+ +P+ G+C I +SYP
Sbjct: 294 GTDEKGQKYWLLKNSWGTSWGENGFMKIIRDYGNPA---GLCDIAKMSSYP 341
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 133/307 (43%), Positives = 174/307 (56%), Gaps = 36/307 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
F++W + Y + E R GIY +N+ +I+ NS+ S+KL NKFADL+ EF +
Sbjct: 22 FDSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHSYKLAVNKFADLTYPEFAAK 81
Query: 122 YLG--YNKPYNEPRWPSVQYL----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
YLG ++ + + YL LP SVDWR G VTP+KDQGQCGSCW+FS +V
Sbjct: 82 YLGLRFDATNATKSFAASTYLPRMVSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTTGSV 141
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EG + KTG+LVSLSEQ LVDC N GCNGG M++AF++I G+ TE YPY +
Sbjct: 142 EGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPYTAQ 201
Query: 236 NDRCQTDKTKHHAVTITGYEAI-----------------------PARYAFQLYSHGVFD 272
+ CQ + A T+ Y+ I ++ +FQ YS GV++
Sbjct: 202 DGTCQFNSANVGA-TVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSGVYN 260
Query: 273 E--YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
E QL+HGV VGYG YWLVKNSWGTSWG++GYI M RNS + CGI
Sbjct: 261 EPACSSSQLDHGVLAVGYGTSGSSDYWLVKNSWGTSWGQSGYIWMTRNSNNQ----CGIA 316
Query: 331 MQASYPV 337
ASYP+
Sbjct: 317 TAASYPL 323
>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
Length = 340
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 141/333 (42%), Positives = 186/333 (55%), Gaps = 40/333 (12%)
Query: 32 LFLLWVLGIPAGAWSEGYPQKYDPQ-SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNV 90
L ++WV+ + Q+ D ++ ER+++W +Y Y + E ++ I+ NV
Sbjct: 14 LIVIWVM------FPSNQNQENDQSLTLSERYKHWKIKYRVIYKDDAEEEKHIQIFKHNV 67
Query: 91 QYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV----QYLGLPAS 145
YID N+ N S+KLT N+FADL E + G+ K EP S+ +PA+
Sbjct: 68 AYIDSFNAAGNKSYKLTINRFADLPTE---PSDDGFKKRKLEPTTSSLFKYKNITDIPAA 124
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
VDWRK GAVTPVK+Q +CGSCWAFSAV A+EGI ++ +G LVSLSEQELVD ++ G
Sbjct: 125 VDWRKRGAVTPVKNQRECGSCWAFSAVGALEGIQQITSGNLVSLSEQELVDRVRSNWTNG 184
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY---- 261
CNGGY+ AFEF+ + GG+ TE YPYRG + K V I YE +P
Sbjct: 185 CNGGYLIDAFEFVLENGGIATEASYPYRGV--KGNNSKKVSRQVQIKSYEQVPRNSEDSL 242
Query: 262 -----------------AFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSW 303
+ YS G+F CG + NH V +VGYG + G KYWLVKNSW
Sbjct: 243 LKVVANQPVSVGIDISGMIRFYSSGIFTGECGTKPNHAVIIVGYGTSNDGTKYWLVKNSW 302
Query: 304 GTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G WGE YIRM R+ + G+CGI M ASYP
Sbjct: 303 GIRWGEKRYIRMKRDIDAKE-GLCGIPMDASYP 334
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 135/317 (42%), Positives = 183/317 (57%), Gaps = 40/317 (12%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLS 114
+M R E W+ ++ R Y +E+E RR ++ +N + ID NS ++ + +L N+FADL+
Sbjct: 38 SAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADLT 97
Query: 115 NEEFISTYLGYNKPYNEP----------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
+EEF + G +P R+ + S+DWR GAVT VKDQG CG
Sbjct: 98 DEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSCG 157
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
CWAFSAVAAVEG+ K++TG+LVSLSEQ+LVDCDV +++GC GG M+ AFE++ GG+
Sbjct: 158 CCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGGL 217
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
TTE YPYRG + C+ + A +I GYE +PA
Sbjct: 218 TTESSYPYRGTDGSCRRSAS---AASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDSV 274
Query: 263 FQLYSHGVF-DEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
F+ Y GV CG +LNH +T VGYG G KYW++KNSWG SWGE GY+R+ R
Sbjct: 275 FRFYDSGVLGGSGCGTELNHAITAVGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRGVR 334
Query: 321 SSNIGICGILMQASYPV 337
G+CG+ ASYPV
Sbjct: 335 GE--GVCGLAQLASYPV 349
>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
Length = 233
Score = 238 bits (608), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 118/229 (51%), Positives = 149/229 (65%), Gaps = 26/229 (11%)
Query: 133 RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
R+ +V LP ++DWR +GAVTP+KDQGQCG CWAFSAVAA EGI K+ TGKLVSL+EQ
Sbjct: 8 RYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQ 67
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTIT 252
ELVDCDV+ E+QGC GG M+ AF+FI K GG+TTE YPY + +C++ + A TI
Sbjct: 68 ELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSG--SNSAATIK 125
Query: 253 GYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
GYE +PA FQ YS GV CG L+HG+ +GYG+
Sbjct: 126 GYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGK 185
Query: 291 -DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G KYWL+KNSWGT+WGE GY+RM ++ S G+CG+ M+ SYP K
Sbjct: 186 TSDGTKYWLMKNSWGTTWGENGYLRMEKDI-SDKRGMCGLAMEPSYPTK 233
>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
Length = 397
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 140/343 (40%), Positives = 189/343 (55%), Gaps = 59/343 (17%)
Query: 50 PQKYDPQSMEERFENWLKQYSREYG----SEDEWQRRFGIYSSNVQYIDYINSQN----L 101
P + + + +E W ++ R G + DE + R ++ N++YID N++
Sbjct: 42 PAERADEEVRRMYEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAHNAEADAGLH 101
Query: 102 SFKLTDNKFADLSNEEFISTYLGYNK-----PYNEPRWPSVQYLG--------------- 141
+F+L FADL+ EE+ LG+ P V G
Sbjct: 102 TFRLGLTPFADLTLEEYRGRALGFRARHRGGPSARAAASRVGSGGTRSHHRRPRPRPRCG 161
Query: 142 -LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
LP ++DWR+ GAVT VK+Q QCG CWAFSAVAA+EGIN + TG LVSLSEQE++DCD
Sbjct: 162 DLPDAIDWRQLGAVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCD-- 219
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV-TITGY----- 254
+++ GCNGG ME AF+F+ GG+ +E DYP+ + C +K V I G+
Sbjct: 220 TQDSGCNGGQMENAFQFVIDNGGIDSEADYPFIATDGTCDANKANDEKVAAIDGFVEVAS 279
Query: 255 -------EAIPAR----------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYW 297
EA+ + AFQ YS G+F+ CG L+HGVTVVGYG ++G+ YW
Sbjct: 280 NNETALQEAVAIQPVSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGYGSENGKAYW 339
Query: 298 LVKNSWGTSWGEAGYIRMARNS--PSSNIGICGILMQASYPVK 338
+VKNSW SWGEAGYIR+ RN P +G CGI M ASYPVK
Sbjct: 340 IVKNSWSDSWGEAGYIRIRRNVFLP---VGKCGIAMDASYPVK 379
>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
Length = 246
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 128/262 (48%), Positives = 154/262 (58%), Gaps = 46/262 (17%)
Query: 102 SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPASVDWRKEGAVTPVK 158
S+KL+ N+FADL+NEEF ++ + S +Y +P++ DWRK+GAVTP+K
Sbjct: 4 SYKLSINEFADLTNEEFGTSRNRFKAHICSTEATSFKYENVTAVPSTXDWRKKGAVTPIK 63
Query: 159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
DQGQCGSCWAFSAVAA+EGI +L TGKL+SLSEQELVDCD + E+QGC G
Sbjct: 64 DQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXGA--------- 114
Query: 219 TKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------------ 260
+YPY G + C K H A I GYE +PA
Sbjct: 115 ----------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAI 164
Query: 261 ----YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRM 315
FQ YS GVF CG +L+HGV VGYG D G KYWLVKNSWGT WGE GYIRM
Sbjct: 165 DAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDDGMKYWLVKNSWGTGWGEEGYIRM 224
Query: 316 ARNSPSSNIGICGILMQASYPV 337
R+ + G+CGI MQASYP
Sbjct: 225 QRDVTAKE-GLCGIAMQASYPT 245
>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
vinifera]
Length = 340
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 135/324 (41%), Positives = 177/324 (54%), Gaps = 35/324 (10%)
Query: 46 SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFK 104
SE + SM ER E W+ +YSR Y + E +RRF ++ NV +I ++ N+ K
Sbjct: 19 SEATSRPLHEASMYERHEQWMARYSRNYKDDAEEERRFXMFKDNVDFIQTFDTAGNMPNK 78
Query: 105 LTDNKFADLSNEEF--------ISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTP 156
L N AD+++EEF I LG R +V + P+++DWRK+ VT
Sbjct: 79 LGVNALADMTHEEFRASGNTFKIPPNLGLRSETTSFRHQNVTRI--PSTMDWRKKRTVTH 136
Query: 157 VKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFE 216
+K+Q QCG CWAFSAVAA+EGI KL+T K +SLSEQELVDCD+ N GC GG M+ AF+
Sbjct: 137 IKNQLQCGGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDIFGSNIGCEGGCMDDAFK 196
Query: 217 FITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------------------ 258
FI + G+ +E Y Y+G C K A I YE +P
Sbjct: 197 FIIQNRGLNSEARYLYKGVEGHCNKKKESSRAARINDYENMPEFSEKALLKVVAHQPISV 256
Query: 259 ----ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYI 313
AFQ Y G+ G+ L++GVT GYG G+K+WLVKNSWGT WGE GY
Sbjct: 257 AIDAGGSAFQFYEIGIITXESGNDLDYGVTTDGYGRSADGKKHWLVKNSWGTDWGENGYT 316
Query: 314 RMARNSPSSNIGICGILMQASYPV 337
RM R ++ G+CG MQASYP
Sbjct: 317 RMERGVKATT-GLCGFTMQASYPT 339
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 238 bits (606), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 136/323 (42%), Positives = 184/323 (56%), Gaps = 41/323 (12%)
Query: 48 GYPQKYDPQSMEER---FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFK 104
GY Q D S+E FE+W + + Y + DE RF I+ N+ YID N +N S+
Sbjct: 6 GYSQD-DLTSIERLVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKKNSSYW 64
Query: 105 LTDNKFADLSNEEFISTYLG-------YNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPV 157
L N+FADL+++EF + Y+G + ++ +P + P S+DWR++GAVTPV
Sbjct: 65 LGLNEFADLTHDEFKAKYVGSLGEDSTIIEQSDDEEFPYKHVVDYPESIDWRQKGAVTPV 124
Query: 158 KDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEF 217
K+Q CGSCWAFS VA VEGINK+ TGKL+SLSEQEL+DCD S GC GGY + ++
Sbjct: 125 KNQNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRRS--HGCKGGYQTTSLQY 182
Query: 218 ITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------- 260
+ GV TE +YPY K +C+ K V ITGY+ +PA
Sbjct: 183 VAD-NGVHTEKEYPYEKKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQAIANQPVSVV 241
Query: 261 -----YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRM 315
AFQ Y G+F+ CG +++H VT VGYG++ Y L+KNSWG WGE GYIR+
Sbjct: 242 VESKGRAFQFYKGGIFEGPCGTKVDHAVTAVGYGKN----YILIKNSWGPKWGEKGYIRI 297
Query: 316 ARNSPSSNIGICGILMQASYPVK 338
R S S G CG+ + +P K
Sbjct: 298 KRASGKSK-GTCGVYSSSYFPTK 319
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 134/316 (42%), Positives = 182/316 (57%), Gaps = 40/316 (12%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSN 115
+M R E W+ ++ R Y +E+E RR ++ +N + ID NS ++ + +L N+FADL++
Sbjct: 39 AMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADLTD 98
Query: 116 EEFISTYLGYNKPYNEP----------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
EEF + G +P R+ + S+DWR GAVT VKDQG CG
Sbjct: 99 EEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSCGC 158
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFSAVAAVEG+ K++TG+LVSLSEQ+LVDCDV +++GC GG M+ AFE++ GG+T
Sbjct: 159 CWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGGLT 218
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAF 263
TE YPYRG + C+ + A +I GYE +PA F
Sbjct: 219 TESSYPYRGTDGSCRRSAS---AASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDSVF 275
Query: 264 QLYSHGVF-DEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
+ Y GV CG +LNH +T GYG G KYW++KNSWG SWGE GY+R+ R
Sbjct: 276 RFYDSGVLGGSGCGTELNHAITAAGYGTASDGTKYWIMKNSWGGSWGEGGYVRIRRGVRG 335
Query: 322 SNIGICGILMQASYPV 337
G+CG+ ASYPV
Sbjct: 336 E--GVCGLAQLASYPV 349
>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
Length = 316
Score = 237 bits (605), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 136/332 (40%), Positives = 194/332 (58%), Gaps = 47/332 (14%)
Query: 32 LFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREY-GSEDEWQRRFGIYSSNV 90
F+L+ + + S+ Y +K F+ + +Y + Y SE E++++ Y N+
Sbjct: 5 FFVLFAVALSLNLHSDAYYEKL--------FQTFEAKYGKNYLSSEREYRKKVLAY--NM 54
Query: 91 QYIDYINSQNLSFKLTDNKFADLSNEEFISTYLG--YNKPYNEPRWPSVQYLGLPASVDW 148
+I+ NS SF L FAD++N EF ++ L KP N + + + + S+DW
Sbjct: 55 DWIEKFNSDEHSFTLGMTPFADMTNTEFATSKLCGCMKKPLNHKQARVLNNMAVE-SIDW 113
Query: 149 RKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNG 208
R++GAVTPVK+QG CGSCWAFSA A+EG N + TGKLVSLSEQ+LVDCD +E+ GC G
Sbjct: 114 REKGAVTPVKNQGSCGSCWAFSATGALEGGNFVATGKLVSLSEQQLVDCD--TEDAGCGG 171
Query: 209 GYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------- 260
G+M+ AFE++ K G+ TE+DYPY K++ C+ D+ ++ITGYE +PA
Sbjct: 172 GFMDTAFEYVMK-KGLCTEEDYPYHAKDEDCKDDQCT-SVISITGYEDVPANDGVALKQA 229
Query: 261 --------------YAFQLYSHGVFD-EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGT 305
+ FQ+Y+ GV D + CG LNHGV VGY ++ Y +VKNSWG
Sbjct: 230 LTKAPVSVAIQADSFVFQMYTGGVLDSDMCGTSLNHGVLAVGYAKE----YIIVKNSWGA 285
Query: 306 SWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWG+ GY+++A GICGI M ASYP
Sbjct: 286 SWGDKGYVKIAHRDQGE--GICGINMAASYPT 315
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 237 bits (604), Expect = 6e-60, Method: Compositional matrix adjust.
Identities = 133/317 (41%), Positives = 186/317 (58%), Gaps = 39/317 (12%)
Query: 53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFA 111
++P S+E + E W+ ++SR Y E E Q R ++ N+++I+ N + N S+KL N+FA
Sbjct: 31 HEPSSLE-KHEQWMARFSRVYRDELEKQMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFA 89
Query: 112 DLSNEEFISTYLGY--------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQC 163
D +NEEF++ + G ++ + W +G+ S DWR EGAVTPVK QGQC
Sbjct: 90 DWTNEEFLAIHTGLKGLSSKVVDETISSRSWNISDMVGV--SKDWRAEGAVTPVKYQGQC 147
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
G CWAFSAVAAVEG+ K+ G LVSLSEQ+L+DCD ++GC+GG M AF +I + G
Sbjct: 148 GCCWAFSAVAAVEGVTKIAGGNLVSLSEQQLLDCD-REYDRGCDGGIMSDAFNYIIQNRG 206
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY---------------------- 261
+ +E+DY Y+G + RC++ + A I+G++ +P+
Sbjct: 207 IASENDYSYQGSDGRCRS--SARPAARISGFQTVPSNNEQALLEAVSRQPVSVSMDANGD 264
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSP 320
F YS GV+D CG NH VT VGYG G KYWL KNSWG +WGE GYIR+ R+
Sbjct: 265 GFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIRRDVA 324
Query: 321 SSNIGICGILMQASYPV 337
G+CG+ A YPV
Sbjct: 325 WPQ-GMCGVAQYAFYPV 340
>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
Length = 279
Score = 237 bits (604), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 124/220 (56%), Positives = 143/220 (65%), Gaps = 27/220 (12%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
+PASVDWR++GAVT VKDQGQCGSCWAFS +AAVEGIN +KT L SLSEQ+LVDCD +
Sbjct: 43 VPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKA 102
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR- 260
N GCNGG M+ AF++I K GGV ED YPYR + C+ K+ VTI GYE +PA
Sbjct: 103 -NAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQASCK--KSPAPVVTIDGYEDVPAND 159
Query: 261 ---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWL 298
FQ YS GVF CG +L+HGV VGYG G KYWL
Sbjct: 160 ESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVTADGTKYWL 219
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VKNSWG WGE GYIRMAR+ + G CGI M+ASYPVK
Sbjct: 220 VKNSWGPEWGEKGYIRMARDVAAKE-GHCGIAMEASYPVK 258
>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 352
Score = 236 bits (603), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 182/318 (57%), Gaps = 42/318 (13%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSN 115
+ME R + W+ ++ R Y E RRF ++ +NV ID N+ N ++L N+F DL++
Sbjct: 37 TMEARHDKWMAEHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTD 96
Query: 116 EEFISTYLGYN------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
EF + Y GYN N S + PA VDWR++GAVT VK+Q CG CWAF
Sbjct: 97 AEFAAMYTGYNPANTMYAAANATTRLSSEDDQQPAEVDWRQQGAVTGVKNQRSCGCCWAF 156
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
S VAAVEGI+++ TG+LVSLSEQ+L+DC ++N GC GG ++ AF+++ GGVTTE
Sbjct: 157 STVAAVEGIHQITTGELVSLSEQQLLDC---ADNGGCTGGSLDNAFQYMANSGGVTTEAA 213
Query: 230 YPYRGKNDRCQTD---KTKHHAVTITGYEAI---------------PARYA-------FQ 264
Y Y+G CQ D A TI+GY+ + P A F+
Sbjct: 214 YAYQGAQGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFR 273
Query: 265 LYSHGVFD-EYCGHQLNHGVTVVGYGEDH----GEKYWLVKNSWGTSWGEAGYIRMARNS 319
Y GVF + CG +L+H V VVGYG + G YW++KNSWGT+WG+ GY+++ ++
Sbjct: 274 HYGSGVFTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDV 333
Query: 320 PSSNIGICGILMQASYPV 337
S G CG+ M SYPV
Sbjct: 334 GSQ--GACGVAMAPSYPV 349
>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 178/313 (56%), Gaps = 38/313 (12%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI-DYINSQNLSFKLTDNKFADLSNEEF 118
+ FE W+ ++ ++Y E + RFG++ NV++I Y + L N+FADL+N+EF
Sbjct: 39 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 98
Query: 119 ISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGI 178
+ST+ G P + V + LP +DWR +GAVT VKDQG CGSCWAF+AVAA+EG+
Sbjct: 99 VSTHTGAKPPCPKDAPRGVDPIWLPCCIDWRYKGAVTDVKDQGACGSCWAFAAVAAIEGL 158
Query: 179 NKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR 238
+++TGKL LSEQELVDCD S GC GG+ ++AFE + GG+T E Y Y G +
Sbjct: 159 TQIRTGKLTPLSEQELVDCDTGS--SGCAGGHTDRAFELVAAKGGITAESGYRYEGYRGK 216
Query: 239 CQTDKTK-HHAVTITGYEAIP-----------ARY-----------AFQLYSHGVFDEYC 275
C+ D +HA I G+ A+P AR AFQ Y GVF C
Sbjct: 217 CRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGSGVFPGPC 276
Query: 276 GH---------QLNHGVTVVGYGED--HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
G NH VT+VGY +D G+KYW+ KNSWG +WGE GYI + ++ S +
Sbjct: 277 GSGSGAAAAAPTTNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGEKGYILLEKDVASPH- 335
Query: 325 GICGILMQASYPV 337
G CG+ + YP
Sbjct: 336 GTCGVAVSPFYPT 348
>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
Length = 361
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 134/327 (40%), Positives = 183/327 (55%), Gaps = 39/327 (11%)
Query: 48 GYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTD 107
GY Q+ D F +W ++ + Y S E R+ I+ N+ +I N +N S+ L
Sbjct: 31 GYSQE-DLALPSSLFRSWSVKHGKLYASPTEKLERYEIFKQNLMHIAETNRKNGSYWLGL 89
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL-----------GLPASVDWRKEGAVTP 156
N+FAD+++EEF ++YLG + P + LP SVDWR +GAVTP
Sbjct: 90 NQFADVAHEEFKASYLGLKRALPRAGAPQTRTPTAFRYAAAAAGSLPWSVDWRYKGAVTP 149
Query: 157 VKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFE 216
VK+QG+CGSCWAFS+VAAVEGIN++ TGKLVSLSEQELVDCD + + GC GG M+ AF
Sbjct: 150 VKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLSEQELVDCD-TTLDHGCEGGTMDLAFA 208
Query: 217 FITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT---ITGYEAIP--------------- 258
++ G+ EDDYPY + C+ + +T +TG+E +P
Sbjct: 209 YMMGSQGIHAEDDYPYLMEEGYCKEKQPCVLGITEQDLTGFEDVPENSEISLLKALAHQP 268
Query: 259 -------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAG 311
FQ Y GVFD C +L+H +T VGYG +G+ Y +KNSWG +WGE G
Sbjct: 269 VSVGIAAGSRDFQFYRGGVFDGACSVELDHALTAVGYGSSYGQNYITMKNSWGKNWGEQG 328
Query: 312 YIRMARNSPSSNIGICGILMQASYPVK 338
Y+R+ + G+CGI ASYPVK
Sbjct: 329 YVRIKMGTGKPE-GVCGIYTMASYPVK 354
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 133/315 (42%), Positives = 175/315 (55%), Gaps = 38/315 (12%)
Query: 55 PQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL-SFKLTDNKFADL 113
P +ME FE W + + + Y E R ++ +N +D N + S+ L N FADL
Sbjct: 25 PLNME--FEAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADL 82
Query: 114 SNEEFISTYLGYNKPYNEPR-------WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
++EEF YLG N PR P+ LP SVDWR G VTPVKDQGQCGSC
Sbjct: 83 THEEFKRFYLGTKVDLNRPRSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSC 142
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
W+FS +VEG + KTG+LVSLSEQ LVDC NQGCNGG M+ AF++I G+ T
Sbjct: 143 WSFSTTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDT 202
Query: 227 EDDYPYRGKNDRCQ-------------------TDKTKHHAVTITGYEAI---PARYAFQ 264
E YPY K+ C+ ++ +AV G ++ ++ +FQ
Sbjct: 203 EASYPYTAKDGTCKFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQ 262
Query: 265 LYSHGVFDEY--CGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
LY+ GV++E L+HGV GYG +G YWLVKNSWG+SWG+AGYI M+RN+ +
Sbjct: 263 LYTSGVYNEKKCSSTSLDHGVLAAGYGTSNGTPYWLVKNSWGSSWGQAGYIWMSRNANNQ 322
Query: 323 NIGICGILMQASYPV 337
CGI ASYP+
Sbjct: 323 ----CGIATSASYPI 333
>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
Length = 342
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 182/318 (57%), Gaps = 42/318 (13%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSN 115
+ME R + W+ ++ R Y E RRF ++ +NV ID N+ N ++L N+F DL++
Sbjct: 27 TMEARHDKWMAEHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRFTDLTD 86
Query: 116 EEFISTYLGYN------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
EF + Y GYN N S + PA VDWR++GAVT VK+Q CG CWAF
Sbjct: 87 AEFAAMYTGYNPANTMYAAANATTRLSSEDDQQPAEVDWRQQGAVTGVKNQRSCGCCWAF 146
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
S VAAVEGI+++ TG+LVSLSEQ+L+DC ++N GC GG ++ AF+++ GGVTTE
Sbjct: 147 STVAAVEGIHQITTGELVSLSEQQLLDC---ADNGGCTGGSLDNAFQYMANSGGVTTEAA 203
Query: 230 YPYRGKNDRCQTD---KTKHHAVTITGYEAI---------------PARYA-------FQ 264
Y Y+G CQ D A TI+GY+ + P A F+
Sbjct: 204 YAYQGAQGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGSGAMFR 263
Query: 265 LYSHGVFD-EYCGHQLNHGVTVVGYGEDH----GEKYWLVKNSWGTSWGEAGYIRMARNS 319
Y GVF + CG +L+H V VVGYG + G YW++KNSWGT+WG+ GY+++ ++
Sbjct: 264 HYGSGVFTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMKLEKDV 323
Query: 320 PSSNIGICGILMQASYPV 337
S G CG+ M SYPV
Sbjct: 324 GSQ--GACGVAMAPSYPV 339
>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
Length = 324
Score = 236 bits (601), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 129/306 (42%), Positives = 168/306 (54%), Gaps = 45/306 (14%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+ E FE+W+ ++ + Y S +E R ++ N+ +ID N ++ L N+FADLS+
Sbjct: 41 HKLTELFESWMSKHGKTYESIEEKLHRLEVFKDNLMHIDRRNRDVTTYWLALNEFADLSH 100
Query: 116 EEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
EEF S A + ++GAV PVK+QG CGSCWAFS VAAV
Sbjct: 101 EEFKSKL---------------------AQIRRLEKGAVAPVKNQGSCGSCWAFSTVAAV 139
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EGIN++ TG L SLSEQEL+DCD S N GCNGG M+ AF++I GG+ E+DYPY +
Sbjct: 140 EGINQIVTGNLTSLSEQELIDCDT-SFNSGCNGGLMDYAFDYIVNNGGLHKEEDYPYLME 198
Query: 236 NDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHGVFDE 273
C + + VTI+GY +P + FQ Y GVF+
Sbjct: 199 EGTCDEKREEMEVVTISGYHDVPENNEESLLKALAHQPLSIAIEASGRDFQFYGRGVFNG 258
Query: 274 YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
CG L+HGV VGYG G Y +VKNSWG WGE GYIRM RN+ G+CGI A
Sbjct: 259 PCGTDLDHGVAAVGYGSSKGLDYIIVKNSWGPKWGEKGYIRMKRNTGKPE-GLCGINKMA 317
Query: 334 SYPVKR 339
SYP K+
Sbjct: 318 SYPTKK 323
>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
Length = 396
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 136/331 (41%), Positives = 186/331 (56%), Gaps = 55/331 (16%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTD----NKFADL 113
+E+ F+ WL +Y +E + +E +R I+ N ++ N++ ++ K++ NKFA
Sbjct: 68 IEDAFDAWLVKYDKEIANAEERLKRLKIFGENYLFVLEHNAKYVAGKVSHYVEMNKFAAH 127
Query: 114 SNEEFISTYLGYNKPYNEPR-----------WPSVQYLGL--PASVDWRKEGAVTPVKDQ 160
+ EE+ LG+ K + W +Y G+ P S+DW EG +T K+Q
Sbjct: 128 TREEY-RKMLGFKKSLRRKKDSGEAAKDVSLW---EYEGVEAPESIDWVDEGVITTPKNQ 183
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G CGSCWAFSA+ AVEGIN ++TGKLVSLSEQELV C NQGCNGG M+ AFE+I +
Sbjct: 184 GSCGSCWAFSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAFEWIVE 243
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA--------------------- 259
GGV +E Y Y+ D C+T KT H +I G+ +P+
Sbjct: 244 NGGVDSEKQYQYKASFDDCKTRKTLLHIASIDGFNDVPSNDETALKKAVSQQPVSVAIEA 303
Query: 260 -RYAFQLYSHGVFD-EYCGHQLNHGVTVVGYGEDHG----------EKYWLVKNSWGTSW 307
+ +FQLY GV+ E CG QL+HGV VVGYG DH +KYW +KNSW W
Sbjct: 304 DQRSFQLYGGGVYHAEDCGTQLDHGVLVVGYGIDHNSSNVIIPGATKKYWKIKNSWSEQW 363
Query: 308 GEAGYIRMARNSPSSNIGICGILMQASYPVK 338
GE GYIR+AR+ S + G+CG+ ASYP K
Sbjct: 364 GEGGYIRIARDVESPS-GMCGVAEMASYPEK 393
>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
gi|194689328|gb|ACF78748.1| unknown [Zea mays]
gi|219886279|gb|ACL53514.1| unknown [Zea mays]
gi|238010470|gb|ACR36270.1| unknown [Zea mays]
gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
Length = 354
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 134/358 (37%), Positives = 197/358 (55%), Gaps = 46/358 (12%)
Query: 17 IAIDMRMMLRNAV-LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGS 75
I ++ ++ AV L++ + + A S Y ++M+ R + W+ ++ R Y
Sbjct: 5 IVVNKTVITFTAVALTILAVTTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRD 64
Query: 76 EDEWQRRFGIYSSNVQYIDYINS---QNLSFKLTDNKFADLSNEEFISTYLGYNK-PYNE 131
E E RF ++ +N ++D N+ S++L N+FAD++N+EF++ Y G P
Sbjct: 65 EAEKAHRFQVFKANADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAMYTGLRPVPAGA 124
Query: 132 PRWPSVQYLGLPAS--------VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKT 183
+ +Y + S VDWR++GAVT +K+QGQCG CWAF+AVAAVEGI+++ T
Sbjct: 125 KKMAGFKYGNVTLSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITT 184
Query: 184 GKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDK 243
G LVSLSEQ+++DCD + N GCNGGY++ AF++I GG+ TED YPY CQ
Sbjct: 185 GNLVSLSEQQVLDCDTDGNN-GCNGGYIDNAFQYIVGNGGLGTEDAYPYTAAQAMCQ--- 240
Query: 244 TKHHAVTITGYEAIPA--------------------RYAFQLYSHGVFDEY-CGH--QLN 280
+ I+GY+ +P+ + FQLY GV C LN
Sbjct: 241 SVQPVAAISGYQDVPSGDEAALAAAVANQPVSVAIDAHNFQLYGGGVMTAASCSTPPNLN 300
Query: 281 HGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
H VT VGYG + G YWL+KN WG +WGE GY+R+ R + + CG+ QASYPV
Sbjct: 301 HAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLERGANA-----CGVAQQASYPV 353
>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
Length = 369
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 174/308 (56%), Gaps = 36/308 (11%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+++ E +E W Q+ +R+ G E RRF ++ NV+ I N ++ +KL N+F D+
Sbjct: 42 EALWELYERWRGQHRVARDLG---EKARRFNVFKDNVRLIHEFNRRDEPYKLRLNRFGDM 98
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
+ +E Y ++ + G R GAV VKDQGQCGSCWAFS +A
Sbjct: 99 TADESAGAYASSRVSHHR------MFRGRGEKAQ-RLHGAVGAVKDQGQCGSCWAFSTIA 151
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
AVEGIN ++T L +LSEQ+LVDCD + N GC+GG M+ AF++I K GGV YPYR
Sbjct: 152 AVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVAASSAYPYR 211
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------FQLYSHGVF 271
+ C++ AVTI GYE +PA FQ YS GVF
Sbjct: 212 ARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSHFQFYSEGVF 271
Query: 272 DEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
CG +L+HGV VGYG G KYW+V+NSWG WGE GYIRM R+ S+ G+CGI
Sbjct: 272 AGKCGTELDHGVAAVGYGTTVDGTKYWIVRNSWGADWGEKGYIRMKRDV-SAKEGLCGIA 330
Query: 331 MQASYPVK 338
M+ASYP+K
Sbjct: 331 MEASYPIK 338
>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
Length = 327
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 178/313 (56%), Gaps = 38/313 (12%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI-DYINSQNLSFKLTDNKFADLSNEEF 118
+ FE W+ ++ ++Y E + RFG++ NV++I Y + L N+FADL+N+EF
Sbjct: 17 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 76
Query: 119 ISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGI 178
+ST+ G P + V + LP +DWR +GAVT VKDQG CGSCWAF+AVAA+EG+
Sbjct: 77 VSTHTGAKPPCPKDAPRGVDPIWLPCCIDWRYKGAVTDVKDQGACGSCWAFAAVAAIEGL 136
Query: 179 NKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR 238
+++TGKL LSEQELVDCD S GC GG+ ++AFE + GG+T E Y Y G +
Sbjct: 137 TQIRTGKLTPLSEQELVDCDTGS--SGCAGGHTDRAFELVAAKGGITAESGYRYEGYRGK 194
Query: 239 CQTDKTK-HHAVTITGYEAIP-----------ARY-----------AFQLYSHGVFDEYC 275
C+ D +HA I G+ A+P AR AFQ Y GVF C
Sbjct: 195 CRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGSGVFPGPC 254
Query: 276 GH---------QLNHGVTVVGYGED--HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
G NH VT+VGY +D G+KYW+ KNSWG +WGE GYI + ++ S +
Sbjct: 255 GSGSGAAAAAPTTNHAVTLVGYCQDGASGKKYWVAKNSWGKTWGEKGYILLEKDVASPH- 313
Query: 325 GICGILMQASYPV 337
G CG+ + YP
Sbjct: 314 GTCGVAVSPFYPT 326
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 128/284 (45%), Positives = 168/284 (59%), Gaps = 38/284 (13%)
Query: 83 FGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTY-----LGYNKPYNEPRWPSV 137
F + +N++ I+ N+ N SF + +FADL+ EF S Y + +P NE W +
Sbjct: 48 FRCHLANLRVIEAHNAGNSSFTMGITQFADLTAAEF-SAYVKRFPMNVTRPRNE-VWITE 105
Query: 138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
L VDWR++ AVT +K+QGQCGSCW+FS +VEG + + TGKLVSLSEQ+L+DC
Sbjct: 106 APL---QEVDWRQKNAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDC 162
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
N GCNGG M+ AFE++ GG+ TE+DYPY ++ +C T+K K HA I G+ +
Sbjct: 163 STRYGNHGCNGGLMDYAFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAEIHGFRNV 222
Query: 258 PARY----------------------AFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEK 295
P + FQ Y+ GVFD CG L+HGV VVGY +D
Sbjct: 223 PKEHEDQLAAAVSIGPVSVAIEADQAGFQHYTSGVFDGKCGTSLDHGVLVVGYSDD---- 278
Query: 296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
YW+VKNSWG SWGE GYIR+ R G+CGI MQASYP KR
Sbjct: 279 YWIVKNSWGKSWGEEGYIRLKRGVDKK--GMCGITMQASYPEKR 320
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 235 bits (599), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 126/294 (42%), Positives = 169/294 (57%), Gaps = 38/294 (12%)
Query: 59 EERFEN----WLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLS 114
EE F+N + Y + Y +E+E Q+R+ I+ +N+ YI N Q S+ L N F DLS
Sbjct: 112 EEHFQNAFGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQGYSYSLKMNHFGDLS 171
Query: 115 NEEFISTYLGYNKPYN--------EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
EEF YLGYNK N V +P++VDWR++G VTPVKDQ CGSC
Sbjct: 172 REEFRRKYLGYNKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSC 231
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSA A+EG + KTG+L+SLSEQELVDC + NQGC+GG M AF+++ GG+ +
Sbjct: 232 WAFSATGALEGAHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCS 291
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQ 264
E+ YPY ++ C+ K VTI+G++ +P + FQ
Sbjct: 292 EEGYPYLARDGECKRACKK--VVTISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQ 349
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEK--YWLVKNSWGTSWGEAGYIRMA 316
Y GVFD CG L+HGV +VGYG D K +W++KNSWG+ WG GY+ MA
Sbjct: 350 FYHEGVFDASCGTDLDHGVLLVGYGTDKETKKDFWIMKNSWGSGWGRDGYMYMA 403
>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
gi|223948637|gb|ACN28402.1| unknown [Zea mays]
gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
Length = 354
Score = 234 bits (598), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 133/358 (37%), Positives = 196/358 (54%), Gaps = 46/358 (12%)
Query: 17 IAIDMRMMLRNAV-LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGS 75
I ++ ++ AV L++ + + A S Y ++M+ R + W+ ++ R Y
Sbjct: 5 IVVNKTVIAFTAVALTILAVKTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRD 64
Query: 76 EDEWQRRFGIYSSNVQYIDYINS---QNLSFKLTDNKFADLSNEEFISTYLGYNK-PYNE 131
E E RF ++ +N ++D N+ S+++ N+FAD++N+EF++ Y G P
Sbjct: 65 EAEKAHRFQVFKANADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAMYTGLRPVPAGA 124
Query: 132 PRWPSVQYLGLPAS--------VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKT 183
+ +Y + S VDWR++GAVT +K+QGQCG CWAF+AVAAVEGI+++ T
Sbjct: 125 KKMAGFKYGNVTLSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITT 184
Query: 184 GKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDK 243
G LVSLSEQ+++DCD N GCNGGY++ AF++I GG+ TED YPY CQ
Sbjct: 185 GNLVSLSEQQVLDCDTEGNN-GCNGGYIDNAFQYIAGNGGLATEDAYPYTAAQAMCQ--- 240
Query: 244 TKHHAVTITGYEAIPA--------------------RYAFQLYSHGVFDEY-CGH--QLN 280
+ I+GY+ +P+ + FQLY GV C LN
Sbjct: 241 SVQPVAAISGYQDVPSGDEAALAAAVANQPVSVAIDAHNFQLYGGGVMTAASCSTPPNLN 300
Query: 281 HGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
H VT VGYG + G YWL+KN WG +WGE GY+R+ R + + CG+ QASYPV
Sbjct: 301 HAVTAVGYGTAEDGTPYWLLKNQWGQNWGEGGYLRLERGANA-----CGVAQQASYPV 353
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 136/320 (42%), Positives = 180/320 (56%), Gaps = 46/320 (14%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFIS 120
F+ WL ++ + YGS +E RR I+ +N+QYI N + N SF+L NKFADL+NEEF +
Sbjct: 43 FDRWLGRHGKLYGSHEEKARRLQIFRTNLQYIHAHNKNSNSSFRLGLNKFADLTNEEFKT 102
Query: 121 TYLGYN-KPYNEPRWPSVQYLGL-----------------PASVDWRKEGAVTPVKDQGQ 162
Y G N K + + R ++ L +S+DWRK+GAVT VKDQ Q
Sbjct: 103 RYFGKNSKQWRDRRRTELEGAELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVKDQAQ 162
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS A+EG+N + TGKLVSLSEQELV CD + N GC GG M+ AF ++ + G
Sbjct: 163 CGSCWAFSTTGAIEGVNFISTGKLVSLSEQELVACD--ATNYGCEGGDMDYAFTWVIQNG 220
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---------------------ARY 261
G+ TE DY Y G + C T+K V+I GY + +
Sbjct: 221 GIDTEKDYSYTGVDSTCNTNKEAKKIVSIDGYTDVSPDDSALLCAAGSQPVSVGIDGSAI 280
Query: 262 AFQLYSHGVFDEYCG---HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
FQLY+ G++D C ++H V VVGY +G+ YW+VKNSWGT WG GY + RN
Sbjct: 281 DFQLYTGGIYDGDCSGNPDDIDHAVLVVGYSAKNGKDYWIVKNSWGTDWGLEGYFYILRN 340
Query: 319 SPSSNIGICGILMQASYPVK 338
+ G+C I ASYP K
Sbjct: 341 TELP-YGVCAINAMASYPTK 359
>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
sativus]
Length = 235
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 115/219 (52%), Positives = 141/219 (64%), Gaps = 23/219 (10%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
LP +VDWR++GAV +K+QG CGSCWAFS A VEGINK+ TG+L+SLSEQELVDCD S
Sbjct: 4 LPETVDWRQKGAVNAIKNQGTCGSCWAFSTAAVVEGINKIVTGELISLSEQELVDCD-KS 62
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR- 260
NQGCNGG M+ AF+FI K GG+ TE DYPYRG + +C + VTI GYE +P
Sbjct: 63 YNQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRGSDGKCNSLLKNSKVVTIDGYEDVPTND 122
Query: 261 ---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
FQ Y G+F CG +++H V VGYG ++G YW+V
Sbjct: 123 ETALKRAVSYQPVSVAIDAGGRVFQHYQSGIFTGECGTKMDHAVVAVGYGSENGVDYWIV 182
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+NSWG WGE GYIR+ RN SS G CGI ++ASYPVK
Sbjct: 183 RNSWGQKWGEDGYIRIERNLASSKSGKCGIAIEASYPVK 221
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 234 bits (597), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 132/341 (38%), Positives = 193/341 (56%), Gaps = 40/341 (11%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
MR++L A++ FL+ I + + + QK + F+NW+ ++ + Y + DE+
Sbjct: 1 MRLVL--ALIFCFLI----INCCSAARIFSQK----QYQTAFQNWMVKHQKSY-TNDEFG 49
Query: 81 RRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL 140
R+ ++ N+ + N + + L N ADL+NEEF YLG + V
Sbjct: 50 SRYSVFQDNMDIVAKWNQKGSNTILGLNVMADLTNEEFKKLYLGTKANVTYKKKTLVGVS 109
Query: 141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
GLPASVDWR GAVT VK+QGQCG C+AFS +VEGI+++ + +LV LSEQ+++DC +
Sbjct: 110 GLPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDCSGS 169
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI--- 257
N GC+GG M +FE+I +GG+ TE YPY G+ +C+ +K K+ TITGY+ +
Sbjct: 170 EGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYTGEVGKCKFNK-KNIGATITGYKNVESG 228
Query: 258 -------------------PARYAFQLYSHGVF--DEYCGHQLNHGVTVVGYGEDHGEKY 296
++ +FQLY+ GV+ E QL+HGV VGYG G+ Y
Sbjct: 229 SESDLQTAVAAQPVSVAIDASQSSFQLYASGVYYEPECSSTQLDHGVLAVGYGSQSGQDY 288
Query: 297 WLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
W+VKNSWG WGE G+I MARN ++ CGI AS+P
Sbjct: 289 WIVKNSWGADWGENGFILMARNKDNN----CGIATMASFPT 325
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 234 bits (597), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 128/314 (40%), Positives = 179/314 (57%), Gaps = 43/314 (13%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+ ++E +E WL ++ + Y E+++RF I+ N+++ID NS+N ++K+ + DL+N
Sbjct: 39 EEVKEIYELWLAKHDKVYSGLVEYEKRFEIFKDNLKFIDEHNSENHTYKMGLTPYTDLTN 98
Query: 116 EEFISTYLG--------YNKPYN-EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
EEF + YLG + N R+ LP +DWRK+GAVTPVK+QG+CGSC
Sbjct: 99 EEFQAIYLGTRSDTIHRLKRTINISERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKCGSC 158
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS V+ VE IN+++TG L+SLSEQ+LVDC N +N GC GG A+++I GG+ T
Sbjct: 159 WAFSTVSTVESINQIRTGNLISLSEQQLVDC--NKKNHGCKGGAFVYAYQYIIDNGGIDT 216
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQ 264
E +YPY+ C+ K V I GY+ +P + FQ
Sbjct: 217 EANYPYKAVQGPCRAAK---KVVRIDGYKGVPHCNENALKKAVASQPSVVAIDASSKQFQ 273
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y G+F CG +LNHGV +VGY +D YW+V+NSWG WGE GYIRM R
Sbjct: 274 HYKSGIFSGPCGTKLNHGVVIVGYWKD----YWIVRNSWGRYWGEQGYIRMKR---VGGC 326
Query: 325 GICGILMQASYPVK 338
G+CGI YP K
Sbjct: 327 GLCGIARLPYYPTK 340
>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 535
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 140/315 (44%), Positives = 178/315 (56%), Gaps = 40/315 (12%)
Query: 55 PQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN--LSFKLTDNKFAD 112
P E F W+K +S + E+ +R Y +N YI N +N KL N+F+
Sbjct: 22 PLEYEHEFSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSS 81
Query: 113 LSNEEFISTYLGYNKP--YNEPR--------WPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
+S EEF GY P Y E R W VQ +P SVDW+ +G VTPVK+QG
Sbjct: 82 MSFEEFKFKMTGYVMPEGYLEQRLASRVDNLWSDVQ---VPDSVDWQDKGGVTPVKNQGM 138
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS AVEG + +GKLVSLSEQELVDCD N + GCNGG M+ AF +I G
Sbjct: 139 CGSCWAFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHNGD-MGCNGGLMDHAFAWIEDNG 197
Query: 223 GVTTEDDYPYRGKNDRCQ-------------TDKTKHHAVTITGYE-----AIPA-RYAF 263
G+ +EDDY Y+ K C+ + HA+ + + AI A + AF
Sbjct: 198 GICSEDDYEYKAKAQVCRDCEKVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAF 257
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR--NSPS 321
Q Y GVF+ CG +L+HGV VGYG ++G+K+W VKNSWG+SWGE GYIR+AR N P+
Sbjct: 258 QFYKSGVFNLTCGTRLDHGVLAVGYGSENGQKFWKVKNSWGSSWGEKGYIRLAREENGPA 317
Query: 322 SNIGICGILMQASYP 336
G CGI SYP
Sbjct: 318 ---GQCGIASVPSYP 329
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 233 bits (595), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 127/325 (39%), Positives = 177/325 (54%), Gaps = 46/325 (14%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL--------SFKLTDN 108
+M R E+W+ ++ R Y +E RR I+ +N + ID NS+ S +L N
Sbjct: 38 AMASRHESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATN 97
Query: 109 KFADLSNEEFISTYLGYNKPYNEPRWPSVQY--------LGLPASVDWRKEGAVTPVKDQ 160
+FADL++EEF + G +P + S+DWR GAVT VKDQ
Sbjct: 98 RFADLTDEEFRAARTGLRRPAAVAGAVGGGFRYENFSLQADAAGSMDWRAMGAVTGVKDQ 157
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G CG CWAFSAVAA+EG+ K++TG+LVSLSEQ+LVDCDV ++QGC GG M+ AF++I++
Sbjct: 158 GSCGCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYISR 217
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------------------- 260
GG+ +E YPY G++ A +I G+E +PA
Sbjct: 218 QGGLASESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVSVAING 277
Query: 261 --YAFQLYSH----GVFDEYC-GHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGY 312
Y F+ Y + C +L+H +T VGYG G YWL+KNSWG+ WGE+GY
Sbjct: 278 GDYVFRFYDRGVLGAGGNGGCESTELDHAITAVGYGMAGDGTGYWLMKNSWGSGWGESGY 337
Query: 313 IRMARNSPSSNIGICGILMQASYPV 337
+R+ R S G+CG+ ASYPV
Sbjct: 338 VRIRRGSRGE--GVCGLAKLASYPV 360
>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
Length = 510
Score = 233 bits (595), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 140/315 (44%), Positives = 178/315 (56%), Gaps = 40/315 (12%)
Query: 55 PQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN--LSFKLTDNKFAD 112
P E F W+K +S + E+ +R Y +N YI N +N KL N+F+
Sbjct: 22 PLEYEHEFSAWMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSS 81
Query: 113 LSNEEFISTYLGYNKP--YNEPR--------WPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
+S EEF GY P Y E R W VQ +P SVDW+ +G VTPVK+QG
Sbjct: 82 MSFEEFKFKMTGYVMPEGYLEQRLASRVDNLWSDVQ---VPDSVDWQDKGGVTPVKNQGM 138
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS AVEG + +GKLVSLSEQELVDCD N + GCNGG M+ AF +I G
Sbjct: 139 CGSCWAFSTTGAVEGAAFVSSGKLVSLSEQELVDCDHNGD-MGCNGGLMDHAFAWIEDNG 197
Query: 223 GVTTEDDYPYRGKNDRCQ-------------TDKTKHHAVTITGYE-----AIPA-RYAF 263
G+ +EDDY Y+ K C+ + HA+ + + AI A + AF
Sbjct: 198 GICSEDDYEYKAKAQVCRDCEKVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAF 257
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR--NSPS 321
Q Y GVF+ CG +L+HGV VGYG ++G+K+W VKNSWG+SWGE GYIR+AR N P+
Sbjct: 258 QFYKSGVFNLTCGTRLDHGVLAVGYGSENGQKFWKVKNSWGSSWGEKGYIRLAREENGPA 317
Query: 322 SNIGICGILMQASYP 336
G CGI SYP
Sbjct: 318 ---GQCGIASVPSYP 329
>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 348
Score = 233 bits (594), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 137/339 (40%), Positives = 191/339 (56%), Gaps = 45/339 (13%)
Query: 34 LLWVLGIPAGAWS-EGYPQKYDPQSMEER----FENWLKQYSREYGSEDEWQRRFGIYSS 88
L+ +G+ + +S GY Q D + ER FE+W+ ++ R Y + +E RF I+
Sbjct: 17 LIVHVGLSSADFSIVGYSQ--DDLTSTERLIRLFESWMLKHDRVYNNIEEKIHRFEIFKD 74
Query: 89 NVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLG-------YNKPYNEPRWPSVQYLG 141
N+ YID N +N S+ L N+F DL+++EF Y+G + N+ +P +
Sbjct: 75 NLMYIDETNKKNNSYWLGLNEFVDLTHDEFKEKYVGSIGEDFVTIEQSNDEEFPYKHVVD 134
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
P S+DWR +GAVTPVK CGSCWAFS VA VEGINK+ TGKL+SLSEQEL+DCD S
Sbjct: 135 YPESIDWRDKGAVTPVKPN-PCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRRS 193
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR- 260
GC GGY + +++ GV TE +YPY K +C+ + K V ITGY+ +PA
Sbjct: 194 --HGCKGGYQTTSLQYVVD-NGVHTEKEYPYEKKQGKCRAKEKKGTKVQITGYKRVPAND 250
Query: 261 ---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
AFQLY G+F+ CG +L+H VT +GY G+ Y L+
Sbjct: 251 EISLIQAIANQPVSVLLESKGRAFQLYKGGIFNGPCGTKLDHAVTAIGY----GKTYILI 306
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
KNSWG +WGE GY+++ R S S G CG+ + +P K
Sbjct: 307 KNSWGPNWGEKGYLKIKRASGKSE-GTCGVYKSSYFPTK 344
>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
Length = 245
Score = 233 bits (594), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 116/219 (52%), Positives = 139/219 (63%), Gaps = 23/219 (10%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
LP SVDWR+ GAV PVKDQ CGSCWAFS VAAVEGIN++ TG+L+SLSEQELVDCD
Sbjct: 6 LPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEY 65
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP--- 258
+ GCNGG M+ AF+FI K GG+ TE DYPY G + C V+I GYE +P
Sbjct: 66 D-MGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFD 124
Query: 259 -------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
A QLY G+F CG L+HG+ VGYG ++G YW+V
Sbjct: 125 EKALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGTENGTDYWIV 184
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
+NSWG+SWGE GYIRM RN + G CGI M+ASYP+K
Sbjct: 185 RNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPIK 223
>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
Length = 381
Score = 233 bits (594), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 127/294 (43%), Positives = 172/294 (58%), Gaps = 43/294 (14%)
Query: 82 RFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV 137
R ++ N+Q++D N+ +F L N+FADL+NEE+ + +L + ++ R +
Sbjct: 73 RLEVFKENLQFVDEHNAAADRGEHTFLLGMNRFADLTNEEYRTRFL---RDFSRLRRSAS 129
Query: 138 QYLG----------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLV 187
+ LP S+DWR+ GAV PVK+QG CGSCWAFS VAAVEGIN++ TG L+
Sbjct: 130 GKISSRYRLREGDDLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLI 189
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
SLSEQ+LVDC + N GC GG+M AF+FI GG+ +E+ YPYRG+N C +
Sbjct: 190 SLSEQQLVDC--TTANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNS-TVNAP 246
Query: 248 AVTITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTV 285
V+I YE +P A FQLY G+F C NH +TV
Sbjct: 247 VVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTV 306
Query: 286 VGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
VGYG ++ + +W+VKNSWG +WGE+GYIR RN + N G CGI ASYPVK+
Sbjct: 307 VGYGTENDKDFWIVKNSWGKNWGESGYIRAERNIENPN-GKCGITRFASYPVKK 359
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 131/315 (41%), Positives = 181/315 (57%), Gaps = 38/315 (12%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFK----LTDNKFADLSN 115
E F+ W +++ + Y +E ++RF + N++YI N++ + K + NKFAD+SN
Sbjct: 47 EIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLNKFADMSN 106
Query: 116 EEFISTYLG-YNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
EEF YL KP N+ S VQ P+S+DWR G VT VKDQG CGSCWA
Sbjct: 107 EEFRKAYLSKVKKPINKGITLSRNMRRKVQSCDAPSSLDWRNYGVVTAVKDQGSCGSCWA 166
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS+ A+EGIN L TG L+SLSEQELV+CD + N GC GGYM+ AFE++ GG+ +E
Sbjct: 167 FSSTGAMEGINALVTGDLISLSEQELVECD--TSNYGCEGGYMDYAFEWVINNGGIDSES 224
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIP---------------------ARYAFQLYS 267
DYPY G + C T K + V+I GY+ + + FQLY+
Sbjct: 225 DYPYTGVDGTCNTTKEETKVVSIDGYQDVEQSDSALLCAVAQQPVSVGIDGSAIDFQLYT 284
Query: 268 HGVFDEYCG---HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
G++D C ++H V +VGYG + E+YW+VKNSWGTSWG GY + R++
Sbjct: 285 GGIYDGSCSDDPDDIDHAVLIVGYGSEDSEEYWIVKNSWGTSWGIDGYFYLKRDTDLP-Y 343
Query: 325 GICGILMQASYPVKR 339
G+C + ASYP K+
Sbjct: 344 GVCAVNAMASYPTKQ 358
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 132/319 (41%), Positives = 180/319 (56%), Gaps = 38/319 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI---DYINSQNLSFKLTDNKFAD 112
+S+ E F+ W ++ + Y E ++R+ + N++YI + L + NKFAD
Sbjct: 44 ESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGLNKFAD 103
Query: 113 LSNEEFISTYLG-YNKPYNEPRWPS-------VQYLGLPASVDWRKEGAVTPVKDQGQCG 164
LSNEEF YL KP N R + +Q P+S+DWRK+G VT VKDQG CG
Sbjct: 104 LSNEEFKELYLSKVKKPINIKRSTARDWRQRNLQTCDAPSSLDWRKKGVVTAVKDQGDCG 163
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCW+FS A+EGIN + TG L+SLSEQELVDCD + N GC GGYM+ AFE++ GG+
Sbjct: 164 SCWSFSTTGAIEGINAIVTGDLISLSEQELVDCD--TTNYGCEGGYMDYAFEWVINNGGI 221
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---------------------ARYAF 263
TE +YPY G + C T K + V+I GY + + F
Sbjct: 222 DTEANYPYTGVDGTCNTTKEEIKVVSIDGYTDVDETDSALLCATVQQPISVGMDGSALDF 281
Query: 264 QLYSHGVFDEYCG---HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
QLY+ G++D C + ++H V +VGYG ++GE YW+VKNSWGT WG GY + RN+
Sbjct: 282 QLYTGGIYDGDCSDDPNDIDHAVLIVGYGSENGEDYWIVKNSWGTEWGMEGYFYIKRNTD 341
Query: 321 SSNIGICGILMQASYPVKR 339
G+C I +ASYP K
Sbjct: 342 LP-YGVCAINAEASYPTKE 359
>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 132/318 (41%), Positives = 181/318 (56%), Gaps = 37/318 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ--NLSFKLTDNKFADL 113
+S+ E F+ W ++ + Y +E ++RFG + N++YI + L ++ NKFADL
Sbjct: 37 ESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLRHRVGLNKFADL 96
Query: 114 SNEEFISTYLG-YNKPYNEPRWPS-------VQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
SNEEF YL KP N+ R + +Q P+S+DWRK+G VT VKDQG CGS
Sbjct: 97 SNEEFKQLYLSKVKKPINKTRIDAEDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDCGS 156
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CW+FS A+EGIN + T L+SLSEQELVDCD + N GC GGYM+ AFE++ GG+
Sbjct: 157 CWSFSTTGAIEGINAIVTSDLISLSEQELVDCD--TTNYGCEGGYMDYAFEWVINNGGID 214
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---------------------ARYAFQ 264
TE +YPY G + C T K + V+I GY+ + + FQ
Sbjct: 215 TEANYPYTGVDGTCNTAKEEIKVVSIDGYKDVDETDSALLCAAAQQPISVGIDGSAIDFQ 274
Query: 265 LYSHGVF---DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
LY+ G++ ++H V +VGYG ++GE YW+VKNSWGTSWG GY + RN+
Sbjct: 275 LYTGGIYDGDCSDDPDDIDHAVLIVGYGSENGEDYWIVKNSWGTSWGIEGYFYIKRNTDL 334
Query: 322 SNIGICGILMQASYPVKR 339
G+C I ASYP K
Sbjct: 335 P-YGVCAINAMASYPTKE 351
>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
parachinensis]
Length = 260
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 119/261 (45%), Positives = 159/261 (60%), Gaps = 36/261 (13%)
Query: 109 KFADLSNEEFISTYLGYN----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVK 158
+FA+++N+EF S Y GY R+ +V LP +VDWRK+GAVTP+K
Sbjct: 1 QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60
Query: 159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
+QG CG CWAFSAVAA+EG ++K GKL+SLSEQ+LVDCD N + GC+GG ++ AFE I
Sbjct: 61 NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTN--DFGCSGGLIDTAFEHI 118
Query: 219 TKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------------------ 260
GG+TTE +YPY+G++ C+ T A +ITGYE +P
Sbjct: 119 MATGGLTTESNYPYKGEDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVGI 178
Query: 261 ----YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRM 315
+ FQ YS GVF C L+H VT VGY + G KYW++KNSWGT WGE GY+R+
Sbjct: 179 EGGGFDFQFYSSGVFTGECTTYLDHAVTAVGYSQSSAGSKYWIIKNSWGTKWGEGGYMRI 238
Query: 316 ARNSPSSNIGICGILMQASYP 336
++ G+CG+ M+ASYP
Sbjct: 239 KKDIKDKE-GLCGLAMKASYP 258
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 180/314 (57%), Gaps = 33/314 (10%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN---LSFKLTDNKFAD 112
+ + E F+ W +++ + Y +E +RR G + N++YI N + L K+ NKFAD
Sbjct: 44 EGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNKFAD 103
Query: 113 LSNEEFISTYLG-YNKPYN---EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
LSNEEF YL KP + + +Q P+S+DWR +G VT VKDQG CGSCW+
Sbjct: 104 LSNEEFREMYLSKVKKPITIEEKRKHRHLQTCDAPSSLDWRNKGVVTAVKDQGDCGSCWS 163
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS A+E IN + TG L+SLSEQELVDCD + N GC GG M+ AF+++ GG+ TE
Sbjct: 164 FSTTGAIEAINAIVTGDLISLSEQELVDCDT-TNNYGCEGGDMDSAFQWVIGNGGIDTEA 222
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAI-PARYA--------------------FQLYS 267
DYPY G + C T K + V+I GY + P+ A FQLY+
Sbjct: 223 DYPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSDSALLCATVQQPISVGMDGSALDFQLYT 282
Query: 268 HGVFDEYCG---HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
G++D C + ++H + +VGYG ++ E YW+VKNSWGT WG GY + RN+ S
Sbjct: 283 GGIYDGDCSGDPNDIDHAILIVGYGSENDEDYWIVKNSWGTEWGMEGYFYIRRNT-SKPY 341
Query: 325 GICGILMQASYPVK 338
G+C I ASYP K
Sbjct: 342 GVCAINADASYPTK 355
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 140/331 (42%), Positives = 191/331 (57%), Gaps = 46/331 (13%)
Query: 40 IPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI-----D 94
IP+ +EG +E +FE + + R Y S + R I+ +N+Q+I D
Sbjct: 19 IPSMLLTEG--------ELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNID 70
Query: 95 YINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQ----YLGLPASVDWRK 150
Y N + +F ++ N F DLSNEEF +T+ GY + SV LPA+VDW
Sbjct: 71 YFNGDS-TFSVSVNNFTDLSNEEFRATFNGYRRLAAVSLADSVHADNDVEALPATVDWTT 129
Query: 151 EGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGY 210
+G VTP+K+Q QCGSCWAFSAVA++EG + LKTGKLVSLSEQ LVDC + GC+GG+
Sbjct: 130 KGVVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGW 189
Query: 211 MEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDK-----TKHHAVTI-TGYEAI------- 257
M+ AF+++ + G+ TE YPY+ ++ C+ + T H V + TG E+
Sbjct: 190 MDYAFKYVIQNRGIDTEASYPYKAIDESCEFKRNSVGATIHSFVDVKTGDESALQNAVAS 249
Query: 258 ---------PARYAFQLYSHGVFDEY-CGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTS 306
A+ +FQ YS GV++E C + L+HGVT VGYG +G YW VKNSWGTS
Sbjct: 250 IGPISVAIDAAQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTLNGAPYWKVKNSWGTS 309
Query: 307 WGEAGYIRMARNSPSSNIGICGILMQASYPV 337
WG GYI M+RN + CGI +ASYPV
Sbjct: 310 WGRKGYIFMSRNKQNQ----CGIATKASYPV 336
>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
Length = 232
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 114/229 (49%), Positives = 147/229 (64%), Gaps = 26/229 (11%)
Query: 133 RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
R+ +V +PA++DWR GAVTP+KDQGQCG CWAFSAVAA EGI K+ TGKL+SLSEQ
Sbjct: 7 RYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQ 66
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTIT 252
ELVDCDV E+QGC GG M+ AF+FI K GG+TTE +YPY + +C++ + A I
Sbjct: 67 ELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSG--SNSAANIK 124
Query: 253 GYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
GYE +P FQ YS GV CG L+HG+ +GYG+
Sbjct: 125 GYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGK 184
Query: 291 -DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G KYWL+KNSWGT+WGE GY+RM ++ S G+CG+ ++ SYP +
Sbjct: 185 TSDGTKYWLMKNSWGTTWGENGYLRMEKD-ISDKKGMCGLAIEPSYPTE 232
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 131/321 (40%), Positives = 181/321 (56%), Gaps = 42/321 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFA 111
+ QSM ++ E W+ ++SREY E E R ++ N+++I+ N + N S+KL N+FA
Sbjct: 30 FREQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFA 89
Query: 112 DLSNEEFISTYLGYN------------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKD 159
D +NEEF++ + G K + W + S DWR EGAVTPVK
Sbjct: 90 DWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTWNVSDMV--VESKDWRAEGAVTPVKY 147
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
QGQCG CWAFSAVAAVEG+ K+ G LVSLSEQ+L+DCD ++GC+GG M AF ++
Sbjct: 148 QGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCD-REYDRGCDGGIMSDAFNYVV 206
Query: 220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY------------------ 261
+ G+ +E+DY Y+G + C+++ A I+G++ +P+
Sbjct: 207 QNRGIASENDYSYQGSDGGCRSN--ARPAARISGFQTVPSNNERALLEAVSRQPVSVSMD 264
Query: 262 ----AFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMA 316
F YS GV+D CG NH VT VGYG G KYWL KNSWG +WGE GYIR+
Sbjct: 265 ATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNSWGETWGEKGYIRIR 324
Query: 317 RNSPSSNIGICGILMQASYPV 337
R+ G+CG+ A YPV
Sbjct: 325 RDVAWPQ-GMCGVAQYAFYPV 344
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 184/319 (57%), Gaps = 43/319 (13%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN---SQN-LSFKLTDNKFADL 113
++E++ ++ Q+S+ Y SE E + R I+ N + N SQ + FKL NK+AD+
Sbjct: 23 VQEQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHNKLFSQGFVKFKLGLNKYADM 82
Query: 114 SNEEFISTYLGYNKPYNE----------PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQC 163
+ EF+ST G+NK N R+ S + LP +VDWR +GAVT VKDQG C
Sbjct: 83 LHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTEVKDQGHC 142
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCW+FSA ++EG + KTGKLVSLSEQ LVDC N GCNGG M+ AF +I GG
Sbjct: 143 GSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYIKDNGG 202
Query: 224 VTTEDDYPYRGKNDRCQ--------TDK-----------TKHHAVTITGYEAI---PARY 261
+ TE YPY ++++C TDK AV G +I +
Sbjct: 203 IDTEKSYPYLAEDEKCHYKAQNSGATDKGFVDIEEANEDDLKAAVATVGPVSIAIDASHE 262
Query: 262 AFQLYSHGVF-DEYCGHQ-LNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
FQLYS GV+ D C Q L+HGV VVGYG D G+ YWLVKNSWG SWG GYI+MARN
Sbjct: 263 TFQLYSDGVYSDPECSSQELDHGVLVVGYGTSDDGQDYWLVKNSWGPSWGLNGYIKMARN 322
Query: 319 SPSSNIGICGILMQASYPV 337
+ +CG+ QASYP+
Sbjct: 323 QDN----MCGVASQASYPL 337
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 180/316 (56%), Gaps = 39/316 (12%)
Query: 53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFAD 112
+ Q+ + F W++++ R Y S +E+ R+ + N+ +I NSQ L KFAD
Sbjct: 24 FSSQTYQTSFIGWMRKHDRAY-SHEEFTDRYQAFKENMDFIHKWNSQESDTVLGLTKFAD 82
Query: 113 LSNEEFISTYLGYNKPYNEPRWPSVQYLGL-------PASVDWRKEGAVTPVKDQGQCGS 165
L+NEE+ YLG N + + GL P S+DWR++GAV+ VKDQGQCGS
Sbjct: 83 LTNEEYKKHYLGI--KVNVKKNLNAAQKGLKFFKFTGPDSIDWREKGAVSQVKDQGQCGS 140
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CW+FS AVEG +++K+G +VSLSEQ LVDC NQGC GG M AFE+I GG+
Sbjct: 141 CWSFSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGGIA 200
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAF 263
TE YPY RC+ K+ + A I GY+ IP + +F
Sbjct: 201 TESSYPYTAAQGRCKFTKSMNGA-NIIGYKEIPQGEEDSLTAALAKQPVSVAIDASHMSF 259
Query: 264 QLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
QLYS GV+DE C + L+HGV VGYG G+ Y+++KNSWG +WG+ GYI M+RN+ +
Sbjct: 260 QLYSSGVYDEPACSSEALDHGVLAVGYGTLEGKDYYIIKNSWGPTWGQDGYIFMSRNAQN 319
Query: 322 SNIGICGILMQASYPV 337
CG+ ASYP+
Sbjct: 320 Q----CGVATMASYPI 331
>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 325
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 140/339 (41%), Positives = 194/339 (57%), Gaps = 47/339 (13%)
Query: 29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
LS+FL L + + P K DP +E W + ++Y ++ E R ++
Sbjct: 3 TLSVFLAICLAVVSAI-----PLK-DPS-----WEAWKSFHGKKYHNQGEDDFRHYVFLQ 51
Query: 89 NVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGY----NKPYNEPR-WPSVQYLGLP 143
N++ I N+++ +FK+ N+F+DL+ +EF+ TY GY K N+P + + +P
Sbjct: 52 NIKTIAAHNAKS-TFKMAINEFSDLTRKEFVKTYNGYRLSMKKSTNKPSTFMAPLNTNMP 110
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
VDWRKEG VTP+K+QG+CGSCWAFS ++EG + KTGKLVSLSEQ L+DC N
Sbjct: 111 TEVDWRKEGYVTPIKNQGRCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLIDCSAAEGN 170
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE-------- 255
GC GG+M+ AFE+I G+ TE YPY G++D C+ KT A+ TGY
Sbjct: 171 DGCGGGFMDDAFEYIKLNNGIDTEASYPYEGRDDICRYKKTNKGAID-TGYMDIKQYSED 229
Query: 256 --------------AIPARY-AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDHGEKYWL 298
AI A + +F +Y GV+ E C L+HGV VVGYG ++GE YWL
Sbjct: 230 DLKAAVATVGPISVAIDASHKSFHMYHTGVYHEPECSQTVLDHGVLVVGYGTENGEDYWL 289
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
VKNSWGT WG GYI+M+RN ++ CGI ASYP+
Sbjct: 290 VKNSWGTDWGMNGYIKMSRNRSNN----CGIATNASYPL 324
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 131/339 (38%), Positives = 192/339 (56%), Gaps = 41/339 (12%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+ LLW P + S+ E + W+ +Y R Y + E ++R I+ N
Sbjct: 7 FCIILLWACAYPT------MSRTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKEN 60
Query: 90 VQYID-YINSQNLSFKLTDNKFADLSNEEFISTYLGY--NKPYNEPRWPSVQYL-----G 141
++YI+ + N N S+KL N+++DL++EEFI+++ G+ + ++ + SV
Sbjct: 61 LEYIENFNNVGNKSYKLGLNRYSDLTSEEFIASHTGFKVSDQLSDSKMRSVAIPFNLNDD 120
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
+P + DWR++G VT VK+Q QCG CWAF+AVAAVEGI K+K G L+SLSEQ+LVDCD
Sbjct: 121 VPTNFDWREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSEQQLVDCD--R 178
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN-DRCQTDKTKHHAVTITGYEAIPAR 260
++ GC GG AF+ I K G+ EDDYPY+ + CQ + A I GY +PA
Sbjct: 179 QSSGCGGGDFVLAFDSIIKSRGIVKEDDYPYKANDVQTCQLGQIP-GAAQINGYFKVPAN 237
Query: 261 ---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWL 298
Y F Y GV++ CG +LNH VT++GYG + G+KYWL
Sbjct: 238 DEQQLLRAVLQQPVSVAISTSYDFHHYMGGVYEGSCGPKLNHAVTIIGYGVSEAGKKYWL 297
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+KNSWG +WGE GY+++ R S ++ G C I + A+YP
Sbjct: 298 IKNSWGETWGEKGYMKVLRESSATG-GQCSIAVHAAYPT 335
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 135/307 (43%), Positives = 178/307 (57%), Gaps = 36/307 (11%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFI 119
E + W +Y + Y S E R I+ N Y++ NS + SF+L N+FADL+ EEF
Sbjct: 27 EEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYVNEHNSMDSSFQLEVNEFADLTAEEFS 86
Query: 120 STYLGYNKPYNEPRWPSV---QYLG--LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
S Y GY K N + +Y G +P SVDWR +G VTPVK+Q QCGSCWAFS +
Sbjct: 87 SIYNGYGKGRNRENHENTTIYRYTGGAIPDSVDWRTKGLVTPVKNQKQCGSCWAFSTTGS 146
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
+EG + KTGKLVSLSEQ LVDCD ++ GC GG M AF++I + G+ TE+ YPY+
Sbjct: 147 LEGAHAKKTGKLVSLSEQNLVDCD--KKDHGCQGGLMTTAFKYIEENKGIDTEESYPYKA 204
Query: 235 KNDRCQTDKT-------KHHAVTITGYEAIPARYA---------------FQLYSHGVFD 272
KN RC+ K +H ++ T EA+ A FQLY G++D
Sbjct: 205 KNGRCEFKKDDIGATVERHVSILTTDCEALKKAVAEIGPISVAMDASHSSFQLYKSGIYD 264
Query: 273 -EYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
+ C +L+HGV VVGYG++ GE+YWLVKNSWG +WG GY ++A S +CGI
Sbjct: 265 PKICSSRKLDHGVLVVGYGKEDGEEYWLVKNSWGKNWGMEGYFKIA-----SKKNLCGIC 319
Query: 331 MQASYPV 337
A YPV
Sbjct: 320 TSACYPV 326
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 142/331 (42%), Positives = 193/331 (58%), Gaps = 46/331 (13%)
Query: 40 IPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI-----D 94
IP+ +EG +E +FE + + R Y S + R I+ +N+Q+I D
Sbjct: 19 IPSMLLTEG--------ELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNID 70
Query: 95 YINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQ----YLGLPASVDWRK 150
Y N + +F ++ N F DLSNEEF +T+ GY + SV LPA+VDW
Sbjct: 71 YFNGDS-TFSVSVNNFTDLSNEEFRATFNGYRRLAAVSLADSVHADNDVEALPATVDWTT 129
Query: 151 EGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGY 210
+G VTP+K+Q QCGSCWAFSAVA++EG + LKTGKLVSLSEQ LVDC + GC+GG+
Sbjct: 130 KGVVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGW 189
Query: 211 MEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDK-----TKHHAVTI-TGYE--------- 255
M+ AF+++ + G+ TE YPY+ ++ C+ + T H V + TG E
Sbjct: 190 MDYAFKYVIQNRGIDTEASYPYKAIDESCEFKRNSIGATIHSFVDVKTGDESALQNAVAS 249
Query: 256 ------AIPA-RYAFQLYSHGVFDEY-CGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTS 306
AI A + +FQ YS GV++E C + L+HGVT VGYG +G YW VKNSWGTS
Sbjct: 250 IGPISVAIDASQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTLNGVPYWKVKNSWGTS 309
Query: 307 WGEAGYIRMARNSPSSNIGICGILMQASYPV 337
WG+ GYI M+RN + CGI +ASYPV
Sbjct: 310 WGQKGYIFMSRNKQNQ----CGIATKASYPV 336
>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 346
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 182/320 (56%), Gaps = 39/320 (12%)
Query: 53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFA 111
Y P S+ + + W+ Q+SR Y E E Q R + + N+++I+ N+ N S+KL N+F
Sbjct: 30 YKPSSIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFT 89
Query: 112 DLSNEEFISTYLGY-----NKPY-----NEPRWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
D + EEF++TY G P+ +P W L + DWR EGAVTPVK QG
Sbjct: 90 DWTKEEFLATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVLGTNKDWRNEGAVTPVKSQG 149
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
+CG CWAFSA+AAVEG+ K+ G L+SLSEQ+L+DC +N GC GG AF +I K
Sbjct: 150 ECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDC-TREQNNGCKGGTFVNAFNYIIKH 208
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------A 259
G+++E++YPY+ K C+++ A+ I G+E +P +
Sbjct: 209 RGISSENEYPYQVKEGPCRSN--ARPAILIRGFENVPSNNERALLEAVSRQPVAVAIDAS 266
Query: 260 RYAFQLYSHGVFD-EYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMAR 317
F YS GV++ CG +NH VT+VGYG G KYWL KNSWG +WGE GYIR+ R
Sbjct: 267 EAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTSPEGMKYWLAKNSWGKTWGENGYIRIRR 326
Query: 318 NSPSSNIGICGILMQASYPV 337
+ G+CG+ ASYPV
Sbjct: 327 DVEWPQ-GMCGVAQYASYPV 345
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 231 bits (589), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 127/294 (43%), Positives = 173/294 (58%), Gaps = 43/294 (14%)
Query: 82 RFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV 137
R ++ N+Q++D N+ +F+L N+FADL+NEE+ + +L + ++ R +
Sbjct: 71 RLEVFKENLQFVDKHNAAADRGEHTFRLGMNRFADLTNEEYRTRFL---RDFSRLRRSAS 127
Query: 138 QYLG----------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLV 187
+ LP S+DWR++GAV PVK+QG CGSCWAFS VAAVEGIN++ TG L+
Sbjct: 128 GKISSRYRLREGDDLPDSIDWREKGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLI 187
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
SLSEQ+LVDC + N GC GG+M AF+FI GG+ +E+ YPYRG+N C +
Sbjct: 188 SLSEQQLVDC--TTANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNS-TVNAP 244
Query: 248 AVTITGYEAIP----------------------ARYAFQLYSHGVFDEYCGHQLNHGVTV 285
V+I YE +P A FQLY G+F C NH +TV
Sbjct: 245 VVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTV 304
Query: 286 VGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
VGYG ++ + Y VKNSWG +WGE+GYIR+ RN + N G CGI ASYPVK+
Sbjct: 305 VGYGTENDKDYRTVKNSWGKNWGESGYIRVERNIGNPN-GKCGITRFASYPVKK 357
>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 337
Score = 231 bits (588), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 127/308 (41%), Positives = 179/308 (58%), Gaps = 37/308 (12%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFIST 121
E W+ Q+ + Y E +R I+ +N+++I+ + + SF L+ N+FADL +EEF +
Sbjct: 33 EKWMAQHGKVYKDAAEKERCLQIFENNMEFIESFDVCGDKSFNLSTNQFADLHDEEFKAL 92
Query: 122 YL-GYNKPYNEPRWPSVQYL-------GLPASVDWRKEGAVTPVKDQGQCGSCWAFS-AV 172
G+ K ++ W + + L +PAS+DWRK G VTP+KDQG+C SCWAFS V
Sbjct: 93 LTNGHKKEHS--LWTTTETLFRYDNVTKIPASMDWRKRGVVTPIKDQGKCLSCWAFSLCV 150
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
A +EG++++ T +LV LSEQELVD V E++GC G Y+E AF+FITK G + +E YPY
Sbjct: 151 ATIEGLHQIITSELVPLSEQELVDF-VKGESEGCYGDYVEDAFKFITKKGRIESETHYPY 209
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
+G N+ C+ K H I GY+ +P++ AFQ YS G+
Sbjct: 210 KGVNNTCKVKKETHGVAQIKGYKKVPSKSENALLKAVANQLVSVSVEARDSAFQFYSSGI 269
Query: 271 FDEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
F CG +H V + YGE G KYWL KNSWGT WGE GYIR+ + P+ G+CGI
Sbjct: 270 FTGKCGTDTDHRVALASYGESGDGTKYWLAKNSWGTEWGEKGYIRIKXDIPAKE-GLCGI 328
Query: 330 LMQASYPV 337
YP+
Sbjct: 329 AKYPYYPI 336
>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
(fragment)
gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
gi|226542|prf||1601514A actinidin
Length = 302
Score = 231 bits (588), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 125/280 (44%), Positives = 162/280 (57%), Gaps = 37/280 (13%)
Query: 90 VQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYN--------EPRWPSVQYL 140
+++ID N+ N S+K+ N+FADL+ EEF STYLG+ N EPR V
Sbjct: 1 LRFIDEHNADTNRSYKVGLNQFADLTGEEFRSTYLGFTGGSNKTKVSNRYEPRVSQV--- 57
Query: 141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
LP+ VDWR GAV +K QG+CG CWAFSA+A VEGINK+ TG L+SLSEQEL+ C
Sbjct: 58 -LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGT 116
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-- 258
+GCNGGY+ F+FI GG+ T ++YPY ++ C D VTI Y +P
Sbjct: 117 QNTRGCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKYVTIDTYGNVPYN 176
Query: 259 --------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWL 298
A AF+ YS G+F CG ++H VT+VGYG + G YW+
Sbjct: 177 NEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWI 236
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
V+NSW T+WGE GY+R+ RN + G CGI SYPVK
Sbjct: 237 VENSWDTTWGEEGYMRILRNVGGA--GTCGIATMPSYPVK 274
>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
Length = 367
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 199/342 (58%), Gaps = 40/342 (11%)
Query: 30 LSLFLLWVLGIPAGAWS-EGYPQKYDPQSMEER----FENWLKQYSREYGSEDEWQRRFG 84
+++ L + + G +S GY Q D + ER F +W+ +++ Y + DE RF
Sbjct: 13 VAICLFVHMSVSFGDFSIVGYSQ--DDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFE 70
Query: 85 IYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLG------YNKPYNEPRWPSVQ 138
I+ N+ YID N +N S++L N+FADLSN+EF Y+G + Y+E + +
Sbjct: 71 IFKDNLNYIDETNKKNNSYRLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDE-EFINED 129
Query: 139 YLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
+ LP +VDWRK+GAVTPV+ QG CGSCWAFSAVA VEGINK++TGKLV LSEQELVDC+
Sbjct: 130 IVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCE 189
Query: 199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG----- 253
S GC GGY A E++ K G+ YPY+ K C+ + V +G
Sbjct: 190 RRS--HGCKGGYPPYALEYVAK-NGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQ 246
Query: 254 -------YEAIPAR----------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKY 296
AI + FQLY G+F+ CG +++H VT VGYG+ G+ Y
Sbjct: 247 PNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGY 306
Query: 297 WLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
L+KNSWGT+WGE GYIR+ R +P ++ G+CG+ + YP+K
Sbjct: 307 ILIKNSWGTAWGEKGYIRIKR-APGNSPGVCGLYKSSYYPIK 347
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 230 bits (587), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 134/310 (43%), Positives = 180/310 (58%), Gaps = 37/310 (11%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS--QNLSFKLTDNKFADLSNEE 117
E +E+W K++ + Y S+ E R I+ +N +Y+D N+ + F + N+FADL + E
Sbjct: 20 EEWESWKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEKFGFTVGMNQFADLESSE 79
Query: 118 FISTYLGYN-KPY---NEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
F Y GYN KP + + S + LP SVDWR +G VT +K+QGQCGSCWAFSAVA
Sbjct: 80 FGRLYNGYNNKPSMKKAQSKVFSTKVGDLPTSVDWRTKGFVTAIKNQGQCGSCWAFSAVA 139
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
+EG + TG LVSLSEQ LVDC NQGCNGG M+ AF+++ K GG+ TE YPY+
Sbjct: 140 GLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGGIDTEASYPYK 199
Query: 234 GKNDRCQTDKTKHHAVTITGYE-----------------------AIPARY-AFQLYSHG 269
+ +C+ + + T +G+ AI A + +FQLY G
Sbjct: 200 AVDQKCKFN-AANVGSTCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHTSFQLYKSG 258
Query: 270 VFDEYCGHQ--LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
V+ E Q L+HGVT VGY G YW+VKNSWGT+WG+AGYI M+RN + C
Sbjct: 259 VYSESACSQTSLDHGVTAVGYDSSSGVAYWIVKNSWGTTWGQAGYIWMSRNKNNQ----C 314
Query: 328 GILMQASYPV 337
GI ASYP+
Sbjct: 315 GIATAASYPI 324
>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
Length = 340
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 127/314 (40%), Positives = 180/314 (57%), Gaps = 45/314 (14%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADLSNEE 117
+E WL ++ + Y S E +RF I+ N++YID N N ++F L N+FADL+ +E
Sbjct: 34 YEEWLVKHQKLYSSLGEKIKRFEIFKDNLRYIDQQNHYNKVNHMNFTLGLNQFADLTLDE 93
Query: 118 FISTYLGYNKPYNE-----PRWPSVQ-------YLGLPASVDWRKEGAVTPVKDQGQCGS 165
F S YLG + Y + P V+ + LP SVDWR++G V P+++QG+CGS
Sbjct: 94 FSSIYLGTSVDYEQIISSNPNHDDVEEDILKEDVVELPDSVDWREKGVVFPIRNQGKCGS 153
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CW FSAVA++E +N +K G +++LSEQEL+DC+ S QGC GG+ AF ++ K G+T
Sbjct: 154 CWTFSAVASIETLNGIKKGHMIALSEQELLDCETIS--QGCKGGHYNNAFAYVAK-NGIT 210
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA----------------------F 263
+E+ YPY + +C K V I+GY+ +P F
Sbjct: 211 SEEKYPYIFRQGQCY---QKEKVVKISGYKRVPRNNGGQLQSAVAQQVVSVAVKCESKDF 267
Query: 264 QLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
Q Y G+F CG L+H V +VGYG G YW+++NSWGT+WGE GY+R+ +NS
Sbjct: 268 QFYDRGIFSGACGPILDHAVNIVGYGSKGGANYWIMRNSWGTNWGENGYMRIQKNSKHYE 327
Query: 324 IGICGILMQASYPV 337
G CGI MQ SYPV
Sbjct: 328 -GHCGIAMQPSYPV 340
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 229 bits (585), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 139/305 (45%), Positives = 184/305 (60%), Gaps = 39/305 (12%)
Query: 67 KQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTY 122
KQ+ R Y +E + RF I+ N+QYI+ N + S+ L N+FAD+ NEEF Y
Sbjct: 47 KQHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEF-RMY 105
Query: 123 LGYNKPYNEPRWPSV------QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
G + YN R +YL P VDWRK+G VT VK+QGQCGSCW+FS ++E
Sbjct: 106 NGLRRDYNYSREVQCSNHLTPEYLVAPDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGSLE 165
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
G + K+GKLVSLSEQ+LVDC N+GCNGG M++AFE+I GG+ TE++YPY +
Sbjct: 166 GQHFHKSGKLVSLSEQQLVDCSGKFGNEGCNGGLMDQAFEYIITNGGIETEEEYPYDARQ 225
Query: 237 DRCQTDKTKHHAVTI------TGYE---------------AIPARY-AFQLYSHGVFDE- 273
+RC K++ A +G E AI A + +FQLYS GV+DE
Sbjct: 226 ERCHFKKSEVAATASGCVDVKSGDETDLKNSVAEVGPVSIAIDASHQSFQLYSGGVYDEP 285
Query: 274 YCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
C +L+HGV VVGYG D G+ YWLVKNSWGT+WG GY++M+RN + CG+ Q
Sbjct: 286 KCSSTELDHGVLVVGYGTDDGQDYWLVKNSWGTTWGLEGYVKMSRNQDNQ----CGVATQ 341
Query: 333 ASYPV 337
ASYP+
Sbjct: 342 ASYPL 346
>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 229 bits (583), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 133/314 (42%), Positives = 172/314 (54%), Gaps = 37/314 (11%)
Query: 55 PQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLS 114
P F N+ +Y + Y +E RFGI+ +NV I N++NL+F L N+F DL+
Sbjct: 20 PPDYMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLT 79
Query: 115 NEEFISTYLGYNKPYNE----PRWPSVQYLGLP--ASVDWRKEGAVTPVKDQGQCGSCWA 168
EE ++Y G KP + PR + +Y G P +SVDW +G VTPVK+QGQCGSCW+
Sbjct: 80 QEELAASYTGL-KPASLWSGLPRLSTHEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWS 138
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS A+EG L TG LVSLSEQ+ VDCD + + GCNGG+M+ AF F K + TE
Sbjct: 139 FSTTGALEGAWALSTGNLVSLSEQQFVDCD--TTDSGCNGGWMDNAFSFAKK-NSICTEG 195
Query: 229 DYPYRGKNDRCQ----------------TD-KTKHHAVTITGYEAIPA-------RYAFQ 264
YPY + C TD T ++ P +Y+FQ
Sbjct: 196 SYPYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQ 255
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
LYS GV CG +L+HGV VGYG + G YW VKNSWG+SWGE GY+R+ R
Sbjct: 256 LYSSGVLTASCGTRLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRG--KGGA 313
Query: 325 GICGILM-QASYPV 337
G CG+L SYPV
Sbjct: 314 GECGLLAGPPSYPV 327
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 228 bits (582), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 133/323 (41%), Positives = 180/323 (55%), Gaps = 47/323 (14%)
Query: 59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTD-----NKFADL 113
+E FE W++++ + Y E RR+ + SN+ ++ N++ + N FADL
Sbjct: 48 QELFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSSGQGVGMNVFADL 107
Query: 114 SNEEFISTY------------LGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
SNEEF Y G + E R V PAS+DWRK GAVT VK+QG
Sbjct: 108 SNEEFREVYSSRVLRKKAAEGRGARRRAGEGR--VVAGCDAPASLDWRKRGAVTAVKNQG 165
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
CGSCWAFS+ A+EGIN + TG+L+SLSEQELVDCD + N+GC+GGYM+ AFE++
Sbjct: 166 DCGSCWAFSSTGAMEGINAITTGELISLSEQELVDCD--TTNEGCDGGYMDYAFEWVINN 223
Query: 222 GGVTTEDDYPYRGKNDR-CQTDKTKHHAVTITGYEAIPARYA------------------ 262
GG+ +E +YPY G+ D C T K + V+I GYE + +
Sbjct: 224 GGIDSEANYPYTGQADSVCNTTKEEIKVVSIDGYEDVATSESALLCAAVQQPVSVGIDGS 283
Query: 263 ---FQLYSHGVFDEYCG---HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMA 316
FQLY+ G++D C ++H V VVGYG+ G YW+VKNSWGT WG GYI +
Sbjct: 284 SLDFQLYAGGIYDGDCSGNPDDIDHAVLVVGYGQQGGTDYWIVKNSWGTDWGMQGYIYIR 343
Query: 317 RNSPSSNIGICGILMQASYPVKR 339
RN+ G+C I ASYP K+
Sbjct: 344 RNT-GLPYGVCAIDAMASYPTKQ 365
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 139/319 (43%), Positives = 187/319 (58%), Gaps = 44/319 (13%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN---SQNL-SFKLTDNKFADL 113
++E++ + ++++Y S+ E + R I+ N + N +Q L SFKL NK+AD+
Sbjct: 23 VQEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADM 82
Query: 114 SNEEFISTYLGYNKPYNEPRW----PSVQYL-----GLPASVDWRKEGAVTPVKDQGQCG 164
+ EF+ G+N+ + R SV +L LP +DWR +GAVTPVKDQGQCG
Sbjct: 83 LHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCG 142
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCW+FSA ++EG + K+GKLVSLSEQ LVDC N GCNGG M+ AF +I GG+
Sbjct: 143 SCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGI 202
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGY----------------------EAIPARY- 261
TE YPY+ ++++C K K+ T GY AI A +
Sbjct: 203 DTEQAYPYKAEDEKCHY-KPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVAIDASHQ 261
Query: 262 AFQLYSHGVFDE--YCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
+FQLYS GV+ E QL+HGV VVGYG ED G YWLVKNSWG SWG+ GYI+MARN
Sbjct: 262 SFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARN 321
Query: 319 SPSSNIGICGILMQASYPV 337
++ CGI +ASYP+
Sbjct: 322 RDNN----CGIATEASYPL 336
>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 133/314 (42%), Positives = 172/314 (54%), Gaps = 37/314 (11%)
Query: 55 PQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLS 114
P F N+ +Y + Y +E RFGI+ +NV I N++NL+F L N+F DL+
Sbjct: 20 PPDYMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALGVNEFTDLT 79
Query: 115 NEEFISTYLGYNKPYNE----PRWPSVQYLGLP--ASVDWRKEGAVTPVKDQGQCGSCWA 168
EEF ++Y G KP + PR + +Y G P +SVDW +G VTPVK+QGQCGSCW+
Sbjct: 80 QEEFAASYTGL-KPASLWSGLPRLSTHEYNGAPLASSVDWTTQGVVTPVKNQGQCGSCWS 138
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS A+EG L TG LVSLSEQ+ DCD + + GCNGG+M+ AF F K + TE
Sbjct: 139 FSTTGALEGAWALSTGNLVSLSEQQFEDCD--TTDSGCNGGWMDNAFSFAKK-NSICTEG 195
Query: 229 DYPYRGKNDRCQ----------------TD-KTKHHAVTITGYEAIPA-------RYAFQ 264
YPY + C TD T ++ P +Y+FQ
Sbjct: 196 SYPYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIAIEADQYSFQ 255
Query: 265 LYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
LYS GV CG +L+HGV VGYG + G YW VKNSWG+SWGE GY+R+ R
Sbjct: 256 LYSSGVLTASCGTRLDHGVLAVGYGSEAGTDYWKVKNSWGSSWGEQGYVRLQRG--KGGA 313
Query: 325 GICGILM-QASYPV 337
G CG+L SYPV
Sbjct: 314 GECGLLAGPPSYPV 327
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 134/316 (42%), Positives = 180/316 (56%), Gaps = 40/316 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNE 116
+ E + W+ ++SR Y E E Q RF ++ N+++I+ N + + ++KL N+FAD + E
Sbjct: 34 VAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTKE 93
Query: 117 EFISTYLGYNK----PYNE------PRWP-SVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
EFI+T+ G P +E P W +V + P DWR EGAVTPVK QGQCG
Sbjct: 94 EFIATHTGLKGFNGIPSSEFVDEMIPSWNWNVSDVAGPEIKDWRYEGAVTPVKYQGQCGC 153
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFS+VAAVEG+ K+ G LVSLSEQ+L+DCD +N GCNGG M AF +I K G+
Sbjct: 154 CWAFSSVAAVEGLTKIVGGNLVSLSEQQLLDCDRERDN-GCNGGIMSDAFSYIIKNRGIA 212
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY----------------------AF 263
+E YPY+ C+ + + I G++ +P+ F
Sbjct: 213 SEASYPYQETEGTCRYNAKP--SAWIRGFQTVPSNNERALLEAVSRQPVSVSIDADGPGF 270
Query: 264 QLYSHGVFDE-YCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
YS GV+DE YCG +NH VT VGYG G KYWL KNSWG +WGE GYIR+ R+
Sbjct: 271 MHYSGGVYDEPYCGTDVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAW 330
Query: 322 SNIGICGILMQASYPV 337
G+CG+ A YPV
Sbjct: 331 PQ-GMCGVAQYAFYPV 345
>gi|222632170|gb|EEE64302.1| hypothetical protein OsJ_19139 [Oryza sativa Japonica Group]
Length = 1105
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 112/219 (51%), Positives = 142/219 (64%), Gaps = 24/219 (10%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
+P +VDWR+ GAVT VKDQG CG+CW+FSA A+EGINK+KTG L+SLSEQEL+DCD S
Sbjct: 129 VPDAVDWRQSGAVTKVKDQGSCGACWSFSATGAMEGINKIKTGSLISLSEQELIDCD-RS 187
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR- 260
N GC GG M+ A++F+ K GG+ TE DYPYR + C +K K VTI GY+ +PA
Sbjct: 188 YNSGCGGGLMDYAYKFVVKNGGIDTEADYPYRETDGTCNKNKLKRRVVTIDGYKDVPANN 247
Query: 261 ---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
AFQLYS G+FD C L+H + +VGYG + G+ YW+V
Sbjct: 248 EDMLLQAVAQQPVSVGICGSARAFQLYSKGIFDGPCPTSLDHAILIVGYGSEGGKDYWIV 307
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
KNSWG SWG GY+ M RN+ +SN G+CGI S+P K
Sbjct: 308 KNSWGESWGMKGYMYMHRNTGNSN-GVCGINQMPSFPTK 345
>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
Length = 321
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 134/347 (38%), Positives = 189/347 (54%), Gaps = 54/347 (15%)
Query: 17 IAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSE 76
+A+ + L A+L +F W A Q + ++ E+ E W+ ++ R Y
Sbjct: 1 MALSLEKKLAIALLVVFSTWASQAMA-------RQLINEDALVEKHEQWMARHGRTYQDS 53
Query: 77 DEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWP 135
+E +RRF I+ SN++YID N + N +++L N FADLS+EE+++TY P
Sbjct: 54 EEKERRFQIFKSNLEYIDNFNKASNQTYQLGLNNFADLSHEEYVATYTARKMP------- 106
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
+ +P S+DWR GAVTP+K+Q QCG CWAFSA AAVEGI VSLS Q+L+
Sbjct: 107 ----VEVPESIDWRDHGAVTPIKNQYQCGCCWAFSAAAAVEGI----VANGVSLSAQQLL 158
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DC S+NQGC GG+M AF +I + G+ E DYPY+ C ++ A I+G+E
Sbjct: 159 DCV--SDNQGCKGGWMNNAFNYIIQNQGIALETDYPYQQMQQMC---SSRMAAAQISGFE 213
Query: 256 AIPARYA-----------------------FQLYSHGVFDEY-CGHQLNHGVTVVGYG-E 290
+ + F+LY GVF CG+ +H VT+VGYG
Sbjct: 214 DVTPKDEEALMRAVAKQPVSVTIDATSNPNFKLYKEGVFTAAGCGNGHSHAVTLVGYGTS 273
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+ G KYWL KNSWG +WGE+GY+R+ R+ G CGI + ASYP
Sbjct: 274 EDGTKYWLAKNSWGETWGESGYMRLQRDIGLEG-GPCGIALYASYPT 319
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 128/320 (40%), Positives = 182/320 (56%), Gaps = 42/320 (13%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI-----DYINSQNLSFKLTDNKFA 111
S+ + F W +++ + Y SE+E + R I++ N +++ +Y N ++ F + N A
Sbjct: 63 SLSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHF-VGLNHLA 121
Query: 112 DLSNEEFISTYLGYNKPYNEPRWP----SVQYLGL--PASVDWRKEGAVTPVKDQGQCGS 165
DL+ +EF LGYN R P + +Y + P +DW GAVTPVK+Q QCGS
Sbjct: 122 DLTKDEF-KKMLGYNAALRASRAPVDASTWEYADVTPPEEIDWVASGAVTPVKNQKQCGS 180
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFS AVEG+N +KTGKL+SLSE+EL+ C N N GCNGG M+ FE+I G+
Sbjct: 181 CWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNG-NMGCNGGLMDNGFEWIVNNRGID 239
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA----------------------RYAF 263
TED + Y K ++C + H AV I G++ +P+ +F
Sbjct: 240 TEDGWEYVAKEEKCGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEADHQSF 299
Query: 264 QLYSHGVFD-EYCGHQLNHGVTVVGYGED----HGEKYWLVKNSWGTSWGEAGYIRMARN 318
QLY+ GV+ + CG +L+HGV +VGYG D + +W +KNSWG +WGE GYIR+A+
Sbjct: 300 QLYAGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYIRIAKG 359
Query: 319 SPSSNIGICGILMQASYPVK 338
S G CG+ MQ SYP K
Sbjct: 360 G-SGVEGQCGVAMQPSYPTK 378
>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
Length = 345
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 137/348 (39%), Positives = 189/348 (54%), Gaps = 50/348 (14%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+ LFL+ + I A + + + + + M + E + + Y S+ E + R I+ N
Sbjct: 1 MKLFLILFITIFATVHAVSFFELVNQEWMTFKME-----HKKAYKSDVEERFRMKIFMDN 55
Query: 90 VQYIDYINS----QNLSFKLTDNKFADLSNEEFISTYLGYNKPYN----EPRWP------ 135
I NS + +S+KL NK+ D+ + EF++ G+NK N R P
Sbjct: 56 KHKIAKHNSNYEMKKVSYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERMPIGASFI 115
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
+ LP VDWRKEGAVTPVKDQG CGSCW+FSA A+EG + +TG LVSLSEQ L+
Sbjct: 116 EPANVALPKKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLI 175
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DC N GCNGG M++AF++I G+ TE YPY +ND+C+ + A+ + GY
Sbjct: 176 DCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDV-GYI 234
Query: 256 AIP-----------------------ARYAFQLYSHGVF--DEYCGHQLNHGVTVVGYG- 289
IP + +FQ YS GV+ E +L+HGV V+GYG
Sbjct: 235 DIPTGNEKLLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGT 294
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
++GE YWLVKNSWG +WG GYI+MARN + CGI ASYP+
Sbjct: 295 NENGEDYWLVKNSWGETWGNNGYIKMARNK----LNHCGIASSASYPL 338
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 128/304 (42%), Positives = 175/304 (57%), Gaps = 33/304 (10%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEF 118
E++ QY R+YG E R ++ N Q ++ N + ++FK+ N+F D++NEEF
Sbjct: 13 EHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQFGDMTNEEF 72
Query: 119 ISTYLGYNKPYNEPRWPSVQYLGLP--ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
+ GY K G P A VDWR +GAVTPVKDQGQCGSCWAFSA ++E
Sbjct: 73 NAVMKGYKKGSRGEPTTVFTAEGRPMAADVDWRTKGAVTPVKDQGQCGSCWAFSATGSLE 132
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
G + LK +LVSLSEQELVDC N GC GG+M AF++I GG+ TE YPY ++
Sbjct: 133 GQHFLKNNELVSLSEQELVDCSTEYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAQD 192
Query: 237 DRCQ------------------TDKTKHHAVTITGYEAI---PARYAFQLYSHGV-FDEY 274
C+ T++ H AV+ G ++ + ++FQ YS GV +++
Sbjct: 193 RSCRFDANSIGATCTGFVEVQHTEEALHEAVSDIGPISVAIDASHFSFQFYSSGVYYEKK 252
Query: 275 CG-HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
C L+HGV VGYG + E YWLVKNSWG+ WG+AGYI+M+RN ++ CGI +
Sbjct: 253 CSPTNLDHGVLAVGYGTESTEDYWLVKNSWGSGWGDAGYIKMSRNRDNN----CGIASEP 308
Query: 334 SYPV 337
SYP
Sbjct: 309 SYPT 312
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 139/319 (43%), Positives = 187/319 (58%), Gaps = 44/319 (13%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN---SQNL-SFKLTDNKFADL 113
++E++ + ++++Y S+ E + R I+ N + N +Q L SFKL NK+AD+
Sbjct: 23 VQEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADM 82
Query: 114 SNEEFISTYLGYNKPYNEPRW----PSVQYL-----GLPASVDWRKEGAVTPVKDQGQCG 164
+ EF+ G+N+ + R SV +L LP +DWR +GAVTPVKDQGQCG
Sbjct: 83 LHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCG 142
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCW+FSA ++EG + K+GKLVSLSEQ LVDC N GCNGG M+ AF +I GG+
Sbjct: 143 SCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGI 202
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGY----------------------EAIPARY- 261
TE YPY+ ++++C K K+ T GY AI A +
Sbjct: 203 DTEQAYPYKAEDEKCHY-KPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVAIDASHQ 261
Query: 262 AFQLYSHGVF--DEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
+FQLYS GV+ E QL+HGV VVGYG ED G YWLVKNSWG SWG+ GYI+MARN
Sbjct: 262 SFQLYSGGVYYEPECSPSQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARN 321
Query: 319 SPSSNIGICGILMQASYPV 337
++ CGI +ASYP+
Sbjct: 322 RDNN----CGIATEASYPL 336
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 139/319 (43%), Positives = 187/319 (58%), Gaps = 44/319 (13%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN---SQNL-SFKLTDNKFADL 113
++E++ + ++++Y SE E + R I+ N + N +Q L SFKL NK+AD+
Sbjct: 23 VQEQWGAFKMTHNKQYQSETEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADM 82
Query: 114 SNEEFISTYLGYNKPYNEPRW----PSVQYL-----GLPASVDWRKEGAVTPVKDQGQCG 164
+ EF+ G+N+ + R SV +L LP +DWR +GAVTPVKDQGQCG
Sbjct: 83 LHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCG 142
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCW+FSA ++EG + ++GKLVSLSEQ LVDC N GCNGG M+ AF +I GG+
Sbjct: 143 SCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGI 202
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGY----------------------EAIPARY- 261
TE YPY+ ++++C K K+ T GY AI A +
Sbjct: 203 DTEQAYPYKAEDEKCHY-KPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVAIDASHQ 261
Query: 262 AFQLYSHGVFDE--YCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
+FQLYS GV+ E QL+HGV VVGYG ED G YWLVKNSWG SWG+ GYI+MARN
Sbjct: 262 SFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDGTDYWLVKNSWGKSWGDQGYIKMARN 321
Query: 319 SPSSNIGICGILMQASYPV 337
++ CGI +ASYP+
Sbjct: 322 RNNN----CGIATEASYPL 336
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 228 bits (580), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 126/314 (40%), Positives = 175/314 (55%), Gaps = 34/314 (10%)
Query: 49 YPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDN 108
+ +DP + F +W++++ + Y +E E+ R+ ++ N YI+ N QN SF L N
Sbjct: 19 FAVSHDP--LTGVFADWMQEHQKSYANE-EFVYRWNVWRENYLYIEAHNHQNKSFHLAMN 75
Query: 109 KFADLSNEEFISTYLGYNKPYNEPRWPS--VQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
KF DL+N EF + G + ++ + S GLPA DWR++GAVT VK+QGQCGSC
Sbjct: 76 KFGDLTNAEFNKLFKGLSITADQAKQESDIAPAPGLPADFDWRQKGAVTHVKNQGQCGSC 135
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
W+FS + EG N LK G+L SLSEQ LVDC + N GCNGG M+ AFE+I + G+ T
Sbjct: 136 WSFSTTGSTEGANFLKHGRLTSLSEQNLVDCSTSYGNHGCNGGLMDYAFEYIIRNKGIDT 195
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQ 264
E+ YPY C+ +K +H + Y +P + +FQ
Sbjct: 196 EESYPYHASQGTCRYNK-QHSGGELVSYTNVPSGNEGALLNAVATQPTSVAIDASHSSFQ 254
Query: 265 LYSHGVFDE--YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
Y GV+DE +L+HGV VG+G G+ YWLVKNSWG WG +GYI M+RN +
Sbjct: 255 FYKGGVYDEPACSSSRLDHGVLAVGWGVRDGKDYWLVKNSWGADWGLSGYIEMSRNKHNQ 314
Query: 323 NIGICGILMQASYP 336
CGI AS+P
Sbjct: 315 ----CGIATAASHP 324
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 133/314 (42%), Positives = 177/314 (56%), Gaps = 38/314 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADL 113
+ +E + Q+++ Y S E RF I++ N + N++ +S+KL NKF DL
Sbjct: 23 LRTEWEAFKSQHNKAYSSHVEELLRFKIFTENTLLVAKHNAKYAKGLVSYKLAMNKFGDL 82
Query: 114 SNEEFISTYLGYNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
EF GY N+ + P+ + LP +VDWRK+GAVTPVK+QGQCGSCW
Sbjct: 83 LPHEFAKMVNGYRGKQNKEQRPTFIPPANLNDSSLPTTVDWRKKGAVTPVKNQGQCGSCW 142
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS ++EG + KTGKLVSLSEQ LVDC + NQGCNGG M+ F++I GG+ TE
Sbjct: 143 AFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDFGNQGCNGGLMDNGFQYIKANGGIDTE 202
Query: 228 DDYPYRGKNDRCQTDKTK-------------------HHAVTITG--YEAIPARY-AFQL 265
+ +PY ++ C+ K AV G AI A + +FQL
Sbjct: 203 ESHPYTAQDGDCKFKKADVGATDAGFVDIQQGSEDDLKKAVATVGPVSVAIDASHGSFQL 262
Query: 266 YSHGVFDE--YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GV+DE QL+HGV VGYG +G+KYWLVKNSWG WG+ GYI M+R+ +
Sbjct: 263 YSQGVYDEPDCSSSQLDHGVLTVGYGVKNGKKYWLVKNSWGGDWGDNGYILMSRDKDNQ- 321
Query: 324 IGICGILMQASYPV 337
CGI ASYP+
Sbjct: 322 ---CGIASSASYPL 332
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 132/343 (38%), Positives = 191/343 (55%), Gaps = 42/343 (12%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQ 80
MR++L A++ FL+ V I A + + + + F+NW+ ++ + Y + DE+
Sbjct: 1 MRIIL--ALVFCFLI-VNCISA-------ARVFSQKQYQTAFQNWMVKHQKSY-TNDEFG 49
Query: 81 RRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRW--PSVQ 138
R+ I+ N+ ++ N + L N ADL+N+E+ YLG +P
Sbjct: 50 SRYTIFQDNMDFVTKWNQKGSDTILGLNSMADLTNQEYQRIYLGTKTTVKKPNLIIGVTD 109
Query: 139 YLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
PASVDWR GAVT VK+QGQCG C++FS +VEGI+++ + +LVSLSEQ+++DC
Sbjct: 110 VSKAPASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILDCS 169
Query: 199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI- 257
+ N GC+GG M +FE+I +GG+ TE YPY G +C+ +K A TITGY+ +
Sbjct: 170 GSEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYEGVVGKCKFNKANIGA-TITGYKNVK 228
Query: 258 ---------------------PARYAFQLYSHGVFDE-YCGH-QLNHGVTVVGYGEDHGE 294
++ +FQLYS GV+ E C QL+HGV VGYG G+
Sbjct: 229 SGSESDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVLAVGYGSQSGQ 288
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
YW+VKNSWG WGE G+I MARN ++ CGI ASYP
Sbjct: 289 DYWIVKNSWGADWGEKGFILMARNKHNN----CGIATMASYPT 327
>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 517
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 139/364 (38%), Positives = 196/364 (53%), Gaps = 54/364 (14%)
Query: 21 MRMMLRNAVLSLFLLWVLGIPAGAWS---EGYPQKYDPQSME-----------ERFENWL 66
M L+ + LFL+W G+W+ G P +Y ++E E F+ W
Sbjct: 1 MGCQLKTQLFLLFLVW------GSWTFLCYGLPSEYSILALEIDKFPSEEGVIELFQRWK 54
Query: 67 KQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS---FKLTDNKFADLSNEEFISTYL 123
++ + Y S D+ + RF + N++YI NS+ +S L N+FAD+SNEEF S +
Sbjct: 55 EENKKIYRSPDQEKLRFENFKRNLKYIAEKNSKRISPYGQSLGLNRFADMSNEEFKSKFT 114
Query: 124 G-YNKPYNEPRWPSVQYLGL---PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGIN 179
KP+++ S + P S+DWRK+G VT VKDQG CG CWAFS+ A+EGIN
Sbjct: 115 SKVKKPFSKRNGLSGKDHSCEDAPYSLDWRKKGVVTAVKDQGYCGCCWAFSSTGAIEGIN 174
Query: 180 KLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRC 239
+ +G L+SLSE ELVDCD N GC+GG+M+ AFE++ GG+ TE +YPY G + C
Sbjct: 175 AIVSGDLISLSEPELVDCD--RTNDGCDGGHMDYAFEWVMHNGGIDTETNYPYSGADGTC 232
Query: 240 QTDKTKHHAVTITGYEAIP---------------------ARYAFQLYSHGVFDEYCG-- 276
K + + I GY + + + FQLY G++D C
Sbjct: 233 NVAKEETKVIGIDGYYNVEQSDRSLLCATVKQPISAGIDGSSWDFQLYIGGIYDGDCSSD 292
Query: 277 -HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASY 335
++H + VVGYG + E YW+VKNSWGTSWG GYI + RN+ + G+C I ASY
Sbjct: 293 PDDIDHAILVVGYGSEGDEDYWIVKNSWGTSWGMEGYIYIRRNT-NLKYGVCAINYMASY 351
Query: 336 PVKR 339
P K
Sbjct: 352 PTKE 355
>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 291
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 119/220 (54%), Positives = 140/220 (63%), Gaps = 26/220 (11%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
+P+SVDWR++GAVT VKDQGQCGSCWAFS +AAVEGIN ++T L SLSEQ+LVDCD S
Sbjct: 61 VPSSVDWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCDTKS 120
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR- 260
N GCNGG M+ AF++I K GGV ED YPY+ + +K VTI GYE +PA
Sbjct: 121 -NAGCNGGLMDYAFQYIAKHGGVAAEDAYPYKARQ-ASSCNKKPSAVVTIDGYEDVPAND 178
Query: 261 ---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGED-HGEKYWL 298
FQ YS GVF CG +L+HGV VGYG G KYW+
Sbjct: 179 ETALKKAVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDGTKYWI 238
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
VKNSWG WGE GYIRM R+ G+CGI M+ASYPVK
Sbjct: 239 VKNSWGPEWGEKGYIRMKRDVEDKE-GLCGIAMEASYPVK 277
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 138/308 (44%), Positives = 173/308 (56%), Gaps = 36/308 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL-SFKLTDNKFADLSNEEFIS 120
F W ++R+Y S E R IY SN++ I+ N+ S+ L N+F DL++ EF +
Sbjct: 21 FAEWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAA 80
Query: 121 TYLG--YNKPYNEPRWPSVQYL----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
YLG +N + S YL LP SVDWR G VTPVK+QGQCGSCW+FS +
Sbjct: 81 KYLGVRFNGVNATKSFASSTYLPRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGS 140
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
VEG + KTG LVSLSEQ LVDC N+GCNGG M+ AFE+I K GG+ TE YPY
Sbjct: 141 VEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEASYPYTA 200
Query: 235 KNDRCQTDKTKHHAVT------ITGYE---------------AIPARYA-FQLYSHGVFD 272
C+ + A ITG E AI A + FQ Y GV++
Sbjct: 201 TTGTCKFNAANIGATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGVYN 260
Query: 273 E-YCG-HQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
E C QL+HGV VGYG G+ YWLVKNSWG +WG+AGYI M+RN+ + CGI
Sbjct: 261 EKKCSTTQLDHGVLAVGYGTSTEGKDYWLVKNSWGATWGKAGYIWMSRNADNQ----CGI 316
Query: 330 LMQASYPV 337
ASYP+
Sbjct: 317 ATSASYPL 324
>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 340
Score = 227 bits (579), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 135/316 (42%), Positives = 180/316 (56%), Gaps = 45/316 (14%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEE 117
F+ W + + Y + +E +++ + +N I N Q S++L N++ DL++EE
Sbjct: 29 FQTWKNLWKKVYQTVEEEEQKMATWFNNWNKISEHNMQYSLKQKSYRLEMNEYGDLTSEE 88
Query: 118 FISTYLGYNKPYNEPRWPS--VQYLGL---------PASVDWRKEGAVTPVKDQGQCGSC 166
F S GY R + YL L P VDWRK G VTPVK+QGQCGSC
Sbjct: 89 FSSMMNGYRNDIRLKRKSTGGSTYLNLLSFGSQIQLPTLVDWRKHGLVTPVKNQGQCGSC 148
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
W+FSA ++EG +K KTGKLVSLSEQ L+DC N GCNGG M++AF++I GG+ T
Sbjct: 149 WSFSATGSLEGQHKKKTGKLVSLSEQNLIDCSTPEGNDGCNGGLMDQAFKYIKIQGGIDT 208
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGY----------------------EAIPARY-AF 263
E YPY K+D C+ + T A T TG+ AI A + +F
Sbjct: 209 EAYYPYEAKDDTCRFNITDSGA-TDTGFVDIKSGDEEMLKEAAATVGPISVAIDASHTSF 267
Query: 264 QLYSHGVFDEYC--GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
Q YS+GV+ E L+HGV VVGYG ++G+ YWLVKNSWG WGEAGYI+M+RN+ +
Sbjct: 268 QFYSNGVYSETACSSTMLDHGVLVVGYGTENGKDYWLVKNSWGEGWGEAGYIKMSRNADN 327
Query: 322 SNIGICGILMQASYPV 337
CGI QASYP+
Sbjct: 328 Q----CGIATQASYPL 339
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 227 bits (578), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 135/354 (38%), Positives = 189/354 (53%), Gaps = 55/354 (15%)
Query: 25 LRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFG 84
+R A+++L + V A ++SE ++++ +E R + Y E R
Sbjct: 1 MRFALITLLIALVAMTQAVSYSELVREEWNTFKLEHR---------KNYADSTEETFRMK 51
Query: 85 IYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYN-----------KPY 129
I++ N +I N + +S+KL NK+AD+ + EF T G+N + +
Sbjct: 52 IFNENKHHIAKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESF 111
Query: 130 NEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+ S +++ LP +VDWR +GAVT VKDQG CGSCWAFS+ A+EG + K+G LVSL
Sbjct: 112 TGVTFISPEHVKLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSL 171
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQ LVDC N GCNGG M+ AF ++ GG+ TE Y Y G +D C DK A
Sbjct: 172 SEQNLVDCSTKYGNNGCNGGLMDNAFRYVKDNGGIDTEKSYAYEGIDDSCHFDKNSIGA- 230
Query: 250 TITGYEAIP-----------------------ARYAFQLYSHGVFDE--YCGHQLNHGVT 284
T G+ IP ++ +FQ YS GV+DE L+HGV
Sbjct: 231 TDRGFADIPQGNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPNCSAENLDHGVL 290
Query: 285 VVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
VVGYG E G YWLVKNSWGT+WG+ G+I+M+RN + CGI +SYP+
Sbjct: 291 VVGYGTEKDGSDYWLVKNSWGTTWGDKGFIKMSRNKENQ----CGIASASSYPL 340
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 226 bits (577), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 136/316 (43%), Positives = 183/316 (57%), Gaps = 38/316 (12%)
Query: 55 PQS-MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNK 109
P+S ++ ++ +LK + ++YG+E+E +RR I+ N+ YI+ N + SF L N+
Sbjct: 19 PKSELDSEWQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNE 77
Query: 110 FADLSNEEFISTYLGYNKPYNEPR----WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
+ D++NEEF ST GY R P LP +VDWR +G VTP+K+QGQCGS
Sbjct: 78 YGDMTNEEFRSTMNGYKMRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGS 137
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CW+FSA ++EG KTGKL SLSEQ LVDC N GC GG M+ AF++I G+
Sbjct: 138 CWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNNGID 197
Query: 226 TEDDYPYRGKNDRCQ--------TD------KTK-----HHAVTITGYEAI---PARYAF 263
TE YPY KN +C+ TD K+K AV G A+ + +F
Sbjct: 198 TESSYPYEAKNGKCRFNAANVGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDASHMSF 257
Query: 264 QLYSHGVFDE-YCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
QLY GV+ E +C +L+HGV VGYG + G+ YWLVKNSWG SWG+ GYI M+RN +
Sbjct: 258 QLYKSGVYHEFFCSETRLDHGVLAVGYGTESGKDYWLVKNSWGESWGQKGYIMMSRNKRN 317
Query: 322 SNIGICGILMQASYPV 337
+ CGI ASYP
Sbjct: 318 N----CGIATSASYPT 329
>gi|357153071|ref|XP_003576329.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 398
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 134/346 (38%), Positives = 185/346 (53%), Gaps = 63/346 (18%)
Query: 52 KYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTD 107
K++ M RF+ W+ R Y + +E RRF +Y SNV+YI+ +N++ L+F+L +
Sbjct: 52 KHNDLLMMGRFQGWMAAQGRSYWTAEETARRFEVYKSNVRYIEAVNAEAATTGLTFELGE 111
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQ--------------------YLGL----- 142
F DL++EEF + Y G P E +Q + L
Sbjct: 112 GPFTDLTHEEFSALYNGSMPPPEEEEGDDIQEEDEQVIATVVDGVDVNVAVHTNLSAGGP 171
Query: 143 ----PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
P S DWRK GAVTP+KDQG+CGSCWAF VA +EG +K+ G LVSLSEQ+L+DCD
Sbjct: 172 RPWPPRSRDWRKHGAVTPIKDQGRCGSCWAFPTVATIEGKHKIVRGNLVSLSEQQLIDCD 231
Query: 199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
N GC GG++ +A+ +I KIGG+TT YPY+G +C K + A I G+ ++
Sbjct: 232 YT--NSGCKGGFVIRAYRWIRKIGGLTTSSAYPYKGARGKCM--KRRRAAARIAGWRSVR 287
Query: 259 ARYA----------------------FQLYSHGVFDEYCG-HQLNHGVTVVGYGE--DHG 293
+R FQ Y G+ + C +LNH VTVVGYG D G
Sbjct: 288 SRSEVALVNAVAGQPVAVYISASGKNFQHYKKGILNGPCDTARLNHAVTVVGYGRQADTG 347
Query: 294 EKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
KYW+VKNSWGT+WG+ GYI M R + + G CGI +P+ +
Sbjct: 348 AKYWIVKNSWGTTWGQEGYILMKRGTRNPR-GQCGIATSPVFPLMK 392
>gi|7239343|gb|AAF43193.1|AF228731_1 cathepsin L [Stylonychia lemnae]
Length = 340
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 126/302 (41%), Positives = 178/302 (58%), Gaps = 32/302 (10%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN--LSFKLTDNKFADLSNEEFI 119
F +++ ++S+ Y S++E++ R Y SN+ +I+ NSQN SF L N AD +++E+
Sbjct: 42 FVHFMSRFSKAYKSKEEFEMRLQQYKSNIAFINNHNSQNDGTSFTLGPNHLADYTHDEY- 100
Query: 120 STYLGYNKPYNEPR---WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
LGY KP N+ + + +P S+DWR++GAV VKDQGQCGSCWAFS +A++E
Sbjct: 101 KKMLGY-KPRNKTGKEVYSTPNLKDIPESIDWREKGAVNAVKDQGQCGSCWAFSTIASLE 159
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
++TGKL SLSEQ+LVDC N N+GCNGG M A ++I GGV TE DYPY GK+
Sbjct: 160 SRYFIETGKLQSLSEQQLVDCSKNG-NEGCNGGDMGLAMDYIASAGGVETEKDYPYVGKD 218
Query: 237 DRCQTDKTKHHAVTITGYEAIPARYA---------------------FQLYSHGVFD-EY 274
C + +K A +P ++A FQ Y G+FD +
Sbjct: 219 QTCAFEASKEVATDKGHINIVPGKFATLQAAIAEGPVSVAIEADSLFFQFYRSGIFDSSW 278
Query: 275 CGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQAS 334
CG L+HGV VGYG D+G++Y++V+NSW SWG GYI + N + G+CGI M+
Sbjct: 279 CGTNLDHGVAAVGYGVDNGKQYYIVRNSWSDSWGLKGYINIIANGDGN--GMCGIQMEPV 336
Query: 335 YP 336
P
Sbjct: 337 VP 338
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 133/311 (42%), Positives = 168/311 (54%), Gaps = 36/311 (11%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN--LSFKLTDNKFADLSN 115
+ W ++ + Y + E R + +N +YID N + L N+F DL N
Sbjct: 18 FSKELRAWKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHAGVFGYTLKMNQFGDLEN 77
Query: 116 EEFISTYLGY---NKPYN-EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
EF S Y GY N P +P P+ + LPASVDW K+G VTPVK+QGQCGSCW+FSA
Sbjct: 78 SEFKSLYNGYRMSNAPRKGKPFVPAARVQDLPASVDWSKKGWVTPVKNQGQCGSCWSFSA 137
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
++EG + TG L+SLSEQ LVDC N GCNGG M+ AFE++ K G+ TE YP
Sbjct: 138 TGSMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKNNGIDTEASYP 197
Query: 232 YRGKNDRCQTDKTKHHAVTITGY----------------------EAIPARY-AFQLYSH 268
YR + C+ + T TI+GY AI A + +FQ YS
Sbjct: 198 YRAVDSTCKFN-TADVGATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFYSS 256
Query: 269 GVFDEY--CGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
GV+D L+HGV VGYG D + YWLVKNSWG SWG +GYI M RN +
Sbjct: 257 GVYDPLICSSTNLDHGVLAVGYGTDGSKDYWLVKNSWGASWGMSGYIEMVRNHNNK---- 312
Query: 327 CGILMQASYPV 337
CGI ASYPV
Sbjct: 313 CGIATSASYPV 323
>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
Length = 330
Score = 226 bits (577), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 126/312 (40%), Positives = 180/312 (57%), Gaps = 34/312 (10%)
Query: 53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFA 111
Y Q+ + F W+K++ R Y E+ ++ + N+ +I N+ +N L +FA
Sbjct: 24 YSAQTYQTSFLGWMKKHDRSY-HHHEFNNKYQAFKDNMDFIHNWNTNKNSKTVLGLTQFA 82
Query: 112 DLSNEEFISTYLG--YNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
DL+NEE+ YLG N + + + + G P S+DWR +GAV+ VKDQGQCGSCW+F
Sbjct: 83 DLTNEEYRKIYLGTKVNVAPEKHNFNMIHFTG-PDSIDWRTKGAVSHVKDQGQCGSCWSF 141
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
S +VEG +++KTG +V+LSEQ LVDC N GC+GG M AF+FI GGV TED
Sbjct: 142 STTGSVEGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGGLMVNAFKFIMSQGGVATEDS 201
Query: 230 YPYRGKNDRCQTDKTKHHAVTITGYEAI----------------------PARYAFQLYS 267
YPY +C+ K+ A I+GY+ I ++ +FQLY
Sbjct: 202 YPYNAVQGKCKFTKSMVGA-NISGYKEITQGSELELQAALTKQPVSIAIDASQQSFQLYK 260
Query: 268 HGVFD--EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
GV+D E +QL+HGV VGYG ++G+ Y++VKNSW SWG+ GYI M+RN+ +
Sbjct: 261 SGVYDEPECSSYQLDHGVLAVGYGTENGKDYYIVKNSWADSWGQDGYIFMSRNAKNQ--- 317
Query: 326 ICGILMQASYPV 337
CG+ ASYP+
Sbjct: 318 -CGVATMASYPI 328
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 136/306 (44%), Positives = 174/306 (56%), Gaps = 42/306 (13%)
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLG 124
+ +EY SE E R IY N I N + S+KL N+F DL + EF+ST G
Sbjct: 57 HGKEYHSETEEYYRLKIYMENRLKIARHNEKYANNKASYKLAMNEFGDLLHHEFVSTRNG 116
Query: 125 YNKPY-NEPRWPS-------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
+ + Y + PR S ++ LP +VDWRK+GAVTPVK+QGQCGSCWAFS ++E
Sbjct: 117 FKRNYRSTPREGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLE 176
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
G + KTG++VSLSEQ LVDC N GC GG M+ AF++I GG+ TE YPY G +
Sbjct: 177 GQHFRKTGRMVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYIKANGGIDTELSYPYNGTD 236
Query: 237 DRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQLYSHGVFDE 273
C +K+ A T TG+ IP + +FQ YS GV+DE
Sbjct: 237 GICHFEKSDVGA-TDTGFVDIPEGNEQLLKKAVATVGPVSVAIDASHESFQFYSQGVYDE 295
Query: 274 -YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
C + L+HGV VVGYG G+ YWLVKNSWGT+WG+ GYI M RN + CGI
Sbjct: 296 PECSSESLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDDGYIYMTRNKENQ----CGIAS 351
Query: 332 QASYPV 337
ASYP+
Sbjct: 352 SASYPL 357
>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 131/310 (42%), Positives = 181/310 (58%), Gaps = 39/310 (12%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFIST 121
+ W+ +SR Y E E Q R +++ N+++I+ N+ + S+KL NKF D + EEF++T
Sbjct: 39 QKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSYKLGVNKFTDWTKEEFLAT 98
Query: 122 YLGYN-----KPY---NE--PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
+ G + P+ NE P W L + DWR EGAVTPVK QG+CG CWAFSA
Sbjct: 99 HTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNEGAVTPVKYQGECGGCWAFSA 158
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
+AAVEG+ K+ G L+SLSEQ+L+DC +N GC GG M +AF +I K GGV++E+ YP
Sbjct: 159 IAAVEGLTKIARGNLISLSEQQLLDC-AREQNNGCKGGTMIEAFNYIVKNGGVSSENAYP 217
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLYSHG 269
Y+ K C+++ A+ I G+E +P + F YS G
Sbjct: 218 YQVKEGPCRSNDIP--AIVIRGFENVPSNNERALLEAVSRQPVAVDIDASETGFIHYSGG 275
Query: 270 VFDEY-CGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
V++ CG +NH VT+VGYG G KYWL KNSWG +WGE GYIR+ R+ G+C
Sbjct: 276 VYNARDCGTSVNHAVTLVGYGTSQEGIKYWLAKNSWGKTWGENGYIRIRRDVEWPQ-GMC 334
Query: 328 GILMQASYPV 337
G+ ASYPV
Sbjct: 335 GVAQYASYPV 344
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 179/321 (55%), Gaps = 42/321 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFA 111
+ QSM ++ E W+ ++SREY E E R ++ N+++I+ N + N S+KL N+FA
Sbjct: 30 FREQSMVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFA 89
Query: 112 DLSNEEFISTYLGYN------------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKD 159
D +NEEF++ + G K + W + S DWR EGAVTPVK
Sbjct: 90 DWTNEEFLAIHTGLKGLTEVSPSKVVAKTISSQTWNVSDMV--VESKDWRAEGAVTPVKY 147
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
QGQCG CWAFSAVAAVEG+ K+ G LVSLSEQ+L+DCD ++ C+GG M AF ++
Sbjct: 148 QGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCD-REYDRDCDGGIMSDAFNYVV 206
Query: 220 KIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY------------------ 261
+ G+ +E+DY Y+G + C+++ A I+G++ +P+
Sbjct: 207 QNRGIASENDYSYQGSDGGCRSN--ARPAARISGFQTVPSNNERALLEAVSRQPVSVSMD 264
Query: 262 ----AFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMA 316
F YS GV+D CG NH VT VGYG G KYWL KNSWG +W E GYIR+
Sbjct: 265 ATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDGTKYWLAKNSWGETWEEKGYIRIR 324
Query: 317 RNSPSSNIGICGILMQASYPV 337
R+ G+CG+ A YPV
Sbjct: 325 RDVAWPQ-GMCGVAQYAFYPV 344
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 130/300 (43%), Positives = 169/300 (56%), Gaps = 32/300 (10%)
Query: 65 WLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLG 124
W +++ Y E E R+ I+ N+ I NS++ + L N F D++N EF + G
Sbjct: 30 WKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSKSKNVILRMNHFGDMTNTEFRAKMNG 89
Query: 125 Y--NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLK 182
+K N + + P +VDWR EG VTPVK+QGQCGSCWAFS+ A+EG + K
Sbjct: 90 LLLHKHQNGSTFLVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGALEGQHFKK 149
Query: 183 TGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTD 242
TG+LVSLSEQ LVDC + N GCNGG M+ AF +I GG+ TE YPY G++ C+
Sbjct: 150 TGRLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTETGYPYEGQDGTCRYS 209
Query: 243 KTKHHAVTITGYEAIP-----------------------ARYAFQLYSHGVFDE-YCG-H 277
K+ A TG+ IP + +FQ Y GV+DE C
Sbjct: 210 KSSIGA-DDTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQFYHSGVYDEPQCSPS 268
Query: 278 QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
L+HGV VVGYG D+G+ YWLVKNSWGT WG GYI M+RN N CGI +ASYP+
Sbjct: 269 ALDHGVLVVGYGTDNGKDYWLVKNSWGTGWGTEGYIYMSRN----NQNQCGIASKASYPL 324
>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
Length = 351
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 135/350 (38%), Positives = 191/350 (54%), Gaps = 42/350 (12%)
Query: 22 RMMLRNAVLSLFLLWV---LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDE 78
+ ++ AV L +L + +G A Y ++M R E W+ ++ R Y E E
Sbjct: 9 KPLITAAVALLTVLAIANCIGCAVAARDLSSSTGYGEEAMTARHEKWMVEHGRTYKDEAE 68
Query: 79 WQRRFGIYSSNVQYIDYINSQ--NLSFKLTDNKFADLSNEEFISTYLGYNK-PYNEPRWP 135
RRF ++ +N ++D N+ + L N+FAD++++EF++ Y G+ P + P
Sbjct: 69 KARRFQVFKANAAFVDTSNAAAGGKKYHLAINRFADMTHDEFMARYTGFKPLPATGKKMP 128
Query: 136 SVQYLGLPAS------VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+Y + S VDWRK+GAVT VK+Q +CG CWAFSAVAA+EG++++ TG+LVSL
Sbjct: 129 GFKYANVTLSSEDQQAVDWRKKGAVTDVKNQQKCGCCWAFSAVAAIEGMHQINTGELVSL 188
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQ+LVDC N N GC GG ME AF+++ G+ TE YPY CQ + AV
Sbjct: 189 SEQQLVDCSTNGNNNGCGGGTMEDAFQYVIGNNGIATEAAYPYTAMQGMCQNVQP---AV 245
Query: 250 TITGYEAIP--------ARYA------------FQLYSHGVFD-EYCGHQLNHGVTVVGY 288
+ Y+ +P A A FQ Y GV + CG LNH VT VGY
Sbjct: 246 AVRSYQQVPRDDEDALAAAVAGQPVSVAVDANNFQFYKGGVMTADSCGTNLNHAVTAVGY 305
Query: 289 G-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G + G YWL+KN WG++WGE GY+R+ R +G CG+ ASYPV
Sbjct: 306 GTAEDGTPYWLLKNQWGSTWGEEGYLRLQR-----GVGACGVAKDASYPV 350
>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 533
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 136/318 (42%), Positives = 176/318 (55%), Gaps = 46/318 (14%)
Query: 55 PQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN--LSFKLTDNKFAD 112
P E F W+ + + E+ RR Y +N YI N++N KL N F+
Sbjct: 21 PLEYEHEFSAWMSAHGVTFSDALEFARRLENYIANDMYILEHNAENAWTGVKLGHNAFSH 80
Query: 113 LSNEEFISTYLGYNKP--YNEPR--------WPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
+S +EF G P Y E R W V+ +P++VDW +G VTPVK+QG
Sbjct: 81 MSFDEFKFKMTGLVLPEGYLEQRLASRVDGLWSDVE---VPSAVDWVDKGGVTPVKNQGM 137
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS AVEG + +GKL+SLSEQELVDCD N + GCNGG M+ AF++I G
Sbjct: 138 CGSCWAFSTTGAVEGATFVSSGKLLSLSEQELVDCDHNGD-MGCNGGLMDHAFQWIEDHG 196
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE---------------------AIPA-R 260
G+ +EDDY Y+ K C+ + V +TG++ AI A +
Sbjct: 197 GICSEDDYEYKAKAQVCRKCDS---VVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQ 253
Query: 261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR--N 318
AFQ Y GVF+ CG +L+HGV VGYG D+G+K+W VKNSWG SWGE GYIR+AR N
Sbjct: 254 KAFQFYKSGVFNLTCGTRLDHGVLAVGYGNDNGQKFWKVKNSWGASWGEQGYIRLAREEN 313
Query: 319 SPSSNIGICGILMQASYP 336
P+ G CGI SYP
Sbjct: 314 GPA---GQCGIASVPSYP 328
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 133/321 (41%), Positives = 181/321 (56%), Gaps = 46/321 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNKFADL 113
++E ++ + ++ ++Y E E + R I++ N I N + +SFK+ NK+AD+
Sbjct: 24 IKEEWQTFKLEHRKQYQDETEERFRLKIFNENKHKIAKHNQLYAAGEVSFKMGLNKYADM 83
Query: 114 SNEEFISTYLGYN-----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
+ EF T G+N + + S +++ LP SVDWR +GAVT VKDQG
Sbjct: 84 LHHEFHETMNGFNYTLHKQLRASDATFTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQGH 143
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+ A+EG + KTG L+SLSEQ LVDC N GCNGG M+ AF +I G
Sbjct: 144 CGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 203
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------A 259
G+ TE YPY G +D C +K A T G+ IP +
Sbjct: 204 GIDTEKSYPYEGIDDSCHFNKGTIGA-TDRGFTDIPQGDEKKLAQAVATIGPVSVAIDAS 262
Query: 260 RYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMA 316
+FQ YS GV+DE C Q L+HGV VVGYG D +G+ YWLVKNSWGT+WG+ G+I+MA
Sbjct: 263 HESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTDENGKDYWLVKNSWGTTWGDKGFIKMA 322
Query: 317 RNSPSSNIGICGILMQASYPV 337
RN + CGI +SYP+
Sbjct: 323 RNDDNQ----CGIATASSYPL 339
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 226 bits (575), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 132/317 (41%), Positives = 181/317 (57%), Gaps = 40/317 (12%)
Query: 55 PQS-MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNK 109
P+S ++ ++ +LK + ++YG+E+E +RR I+ N+ YI+ N + SF L N+
Sbjct: 19 PKSELDSEWQLYLKAHGKQYGAEEEARRRV-IWEGNLDYIEKHNLAADRGDYSFWLGMNE 77
Query: 110 FADLSNEEFISTYLGYNKPYNEPR----WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
+ D++NEEF ST GY R P LP +VDWR +G VTP+K+QGQCGS
Sbjct: 78 YGDMTNEEFRSTMNGYKMRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGS 137
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CW+FSA ++EG KTGKL SLSEQ LVDC N GC GG M+ AF++I G+
Sbjct: 138 CWSFSATGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNSGID 197
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-----------------------YA 262
TE YPY KN +C+ + A T +G+ I ++ +
Sbjct: 198 TESSYPYEAKNGKCRFNAANVGA-TDSGFTDIKSKSESDLQSAVATVGPISVAIDASHMS 256
Query: 263 FQLYSHGVFDE-YCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
FQLY GV+ E +C +L+HGV VGYG + G+ YWLVKNSWG SWG+ GYI M+RN
Sbjct: 257 FQLYRSGVYHEFFCSETRLDHGVLAVGYGTESGKDYWLVKNSWGESWGQKGYIMMSRNKR 316
Query: 321 SSNIGICGILMQASYPV 337
++ CGI ASYP
Sbjct: 317 NN----CGIATSASYPT 329
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 178/314 (56%), Gaps = 38/314 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADL 113
+ +E + + + Y S E RF I++ N +I N + +S+KL N+FADL
Sbjct: 23 LRTEWEAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQFADL 82
Query: 114 SNEEFISTYLGYNKPYNEPRWPS------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
EF+ GY R + + LP +VDWRK+GAVTPVKDQGQCGSCW
Sbjct: 83 LPHEFVKMMNGYQGKRLAGRGSTYLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCGSCW 142
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS+ ++EG + LKTGKLVSLSEQ LVDC NQGCNGG M+ +F +I GG+ TE
Sbjct: 143 AFSSTGSLEGQHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYIKANGGIDTE 202
Query: 228 DDYPYRGKNDRCQ-------------------TDKTKHHAVTITGYEAI---PARYAFQL 265
D YPY ++ C+ ++K AV G ++ ++ +FQL
Sbjct: 203 DSYPYEAEDGDCRYKKEDVGATDTGFVDIKEGSEKDLQKAVATVGPVSVAIDASQQSFQL 262
Query: 266 YSHGVFDE-YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GV+DE C + L+HGV VGYG +G+KYWLVKNSW +WG+ GYI M+R+ +
Sbjct: 263 YSEGVYDEPNCSSESLDHGVLAVGYGVKNGKKYWLVKNSWAETWGQDGYILMSRDKNNQ- 321
Query: 324 IGICGILMQASYPV 337
CGI ASYP+
Sbjct: 322 ---CGIASSASYPL 332
>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
Full=Papaya proteinase III; Short=PPIII; AltName:
Full=Papaya proteinase omega; Flags: Precursor
gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
Length = 348
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 139/323 (43%), Positives = 188/323 (58%), Gaps = 39/323 (12%)
Query: 48 GYPQKYDPQSMEER----FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSF 103
GY Q D + ER F +W+ +++ Y + DE RF I+ N+ YID N +N S+
Sbjct: 32 GYSQ--DDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSY 89
Query: 104 KLTDNKFADLSNEEFISTYLG------YNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPV 157
L N+FADLSN+EF Y+G + Y+E + + + LP +VDWRK+GAVTPV
Sbjct: 90 WLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDE-EFINEDTVNLPENVDWRKKGAVTPV 148
Query: 158 KDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEF 217
+ QG CGSCWAFSAVA VEGINK++TGKLV LSEQELVDC+ S GC GGY A E+
Sbjct: 149 RHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRS--HGCKGGYPPYALEY 206
Query: 218 ITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG------------YEAIPAR----- 260
+ K G+ YPY+ K C+ + V +G AI +
Sbjct: 207 VAK-NGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVV 265
Query: 261 -----YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRM 315
FQLY G+F+ CG +++H VT VGYG+ G+ Y L+KNSWGT+WGE GYIR+
Sbjct: 266 VESKGRPFQLYKGGIFEGPCGTKVDHAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRI 325
Query: 316 ARNSPSSNIGICGILMQASYPVK 338
R +P ++ G+CG+ + YP K
Sbjct: 326 KR-APGNSPGVCGLYKSSYYPTK 347
>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
Length = 366
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 135/334 (40%), Positives = 178/334 (53%), Gaps = 59/334 (17%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFAD 112
+S+ +E W Y +R++G E RRF ++ N + I N Q N ++ L N+F+D
Sbjct: 42 ESLWALYERWCAHYNMARDHG---EKTRRFDLFKENARRIYEHNHQGNATYTLGLNRFSD 98
Query: 113 LSNEEFISTYLGY------------------------NKPYNEPRWPSVQYLGLPASVDW 148
+++EEF + G + +N LG P +VDW
Sbjct: 99 MTDEEFNRSPYGGCLTAPRMSDDEIEELHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDW 158
Query: 149 RKEGAVTPVKDQG-QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCN 207
R AVT VKDQG CGSCWAFSA+AAVEGIN ++T LV LSEQ+LVDCD N GCN
Sbjct: 159 RGR-AVTRVKDQGPTCGSCWAFSAIAAVEGINAIRTRNLVPLSEQQLVDCD--KLNHGCN 215
Query: 208 GGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP--------- 258
GG M AF F+ + GV E YPY G+ RC+ VTI GY+ +P
Sbjct: 216 GGLMTTAFSFVVRNRGVVPEGAYPYMGREGRCK--HVMAPPVTIYGYQRVPRFDANALMN 273
Query: 259 -------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGT 305
+ + F+ Y GVF+ CG +L H T VGYG D G +W+VKNSWG
Sbjct: 274 AVAAQPVSVAIEASSFEFRHYQGGVFNGNCGGRLGHAATAVGYGADAGGPFWIVKNSWGP 333
Query: 306 SWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
WGE GY+R++RN+P G+CGIL + SYPVKR
Sbjct: 334 GWGEGGYVRISRNTPVRQ-GVCGILTENSYPVKR 366
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 132/325 (40%), Positives = 180/325 (55%), Gaps = 38/325 (11%)
Query: 44 AWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSF 103
A + G+ K+D E++++ W ++++Y + E R I+ N++ I N++ SF
Sbjct: 12 AVASGFVVKFDED--EQQWQAWKLFHTKKYTTVTEEGARKAIWRDNLKKIQKHNAEGHSF 69
Query: 104 KLTDNKFADLSNEEFISTYLGYNKPYNE------PRWPSVQYLGLPASVDWRKEGAVTPV 157
L N DL+ +EF Y G Y+ + + ++ +P +VDWRKEG VTPV
Sbjct: 70 TLAMNHLGDLTQDEFRYFYTGMRSHYSNYTKKQGSAFLAPSHVQVPDTVDWRKEGYVTPV 129
Query: 158 KDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEF 217
K+QGQCGSCWAFS ++EG N KTGKLVSLSEQ LVDC N GC GG M+ AF++
Sbjct: 130 KNQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCQGGLMDYAFKY 189
Query: 218 ITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY--------EAI------------ 257
I + GG+ TE+ YPY +NDRC+ K+ AV TG+ EA+
Sbjct: 190 IKENGGIDTEESYPYEARNDRCRFQKSNIGAVD-TGFVDVTHGDEEALKTAAGTVGPISV 248
Query: 258 ---PARYAFQLYSHGVFDEY--CGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGY 312
+FQ Y GV++ L+HGV VVGYG G YWLVKNSWG WG GY
Sbjct: 249 AIDAGHMSFQFYHSGVYNNAGCSSTSLDHGVLVVGYGTYQGSDYWLVKNSWGERWGMEGY 308
Query: 313 IRMARNSPSSNIGICGILMQASYPV 337
I M+RN + CG+ QASYP+
Sbjct: 309 IMMSRNKNNQ----CGVATQASYPL 329
>gi|326493706|dbj|BAJ85314.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 131/330 (39%), Positives = 175/330 (53%), Gaps = 52/330 (15%)
Query: 55 PQS-MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSF---------- 103
P+S + ERF W+ +YS+ Y + E + RF ++ +N I ++ QN +
Sbjct: 40 PESEVRERFSKWMIKYSKHYSCKQEEEMRFQVFKNNTNSIGQLDRQNPNPGVGGALGPSG 99
Query: 104 -------KLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL----PASVDWRKEG 152
K++ N+F DLS E I Y G N R S YL P VDWR G
Sbjct: 100 SQVHTFQKVSMNRFGDLSPREVIQQYTGLNT--TSFRTASPTYLPYHSFKPCCVDWRSSG 157
Query: 153 AVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYME 212
AVT VK QG CGSCWAF+AVAA+EG+NK++TG+LVSLSEQ LVDCD S GC GG+ +
Sbjct: 158 AVTGVKHQGTCGSCWAFAAVAAIEGMNKIRTGELVSLSEQVLVDCDTVST--GCGGGHSD 215
Query: 213 KAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK-HHAVTITGYEAIPAR----------- 260
A + GG+T+E+ YPY G +C DK H +I G++A+P+
Sbjct: 216 SAMALVAARGGITSEERYPYAGFQGKCDVDKLMFDHQASIKGFKAVPSNNEAQLAIAVAM 275
Query: 261 -----------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGE--KYWLVKNSWGTSW 307
AFQ YS G++ C +NH VT+VGY E GE KYW+ KNSW W
Sbjct: 276 QPVTVYIDASGSAFQFYSGGIYRGPCSANVNHAVTIVGYCEGPGEGNKYWIAKNSWSNDW 335
Query: 308 GEAGYIRMARNSPSSNIGICGILMQASYPV 337
GE GY+ +A++ S G CG+ YP
Sbjct: 336 GEQGYVYLAKDVAWST-GTCGLATSPFYPT 364
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 131/321 (40%), Positives = 180/321 (56%), Gaps = 46/321 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
++E + + ++ + Y E E + R I++ N I N + ++FK+ NK+AD+
Sbjct: 23 IKEEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYADM 82
Query: 114 SNEEFISTYLGYNKPYN------EPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
+ EF T G+N + +P + + ++ LP SVDWR++GAVT VKDQG
Sbjct: 83 LHHEFRETMNGFNYTLHKELRASDPSFTGITFISPAHVKLPKSVDWREKGAVTAVKDQGH 142
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+ A+EG + KTG LVSLSEQ LVDC N GCNGG M+ AF +I G
Sbjct: 143 CGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYIKDNG 202
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------A 259
G+ TE YPY G +D C +K A T G+ IP +
Sbjct: 203 GIDTEKSYPYEGIDDSCHFNKDSVGA-TDRGFADIPQGNEKKMAEAVATIGPVSVAIDAS 261
Query: 260 RYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMA 316
+FQ YS G+++E C Q L+HGV VVGYG D G+ YWLVKNSWGT+WG+ G+I+MA
Sbjct: 262 HESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTDESGKDYWLVKNSWGTTWGDKGFIKMA 321
Query: 317 RNSPSSNIGICGILMQASYPV 337
RN + CGI +SYP+
Sbjct: 322 RNEDNQ----CGIASASSYPL 338
>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
Length = 262
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 116/223 (52%), Positives = 144/223 (64%), Gaps = 28/223 (12%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
LP SVDWR++GAVT VKDQG+CGSCWAFS V +VEGIN ++TG LVSLSEQEL+DCD +
Sbjct: 4 LPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCD-TA 62
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHA---VTITGYEAIP 258
+N GC GG M+ AFE+I GG+ TE YPYR C + ++ V I G++ +P
Sbjct: 63 DNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVP 122
Query: 259 ARY----------------------AFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEK 295
A AF YS GVF CG +L+HGV VVGYG + G+
Sbjct: 123 ANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAEDGKA 182
Query: 296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YW VKNSWG SWGE GYIR+ ++S +S G+CGI M+ASYPVK
Sbjct: 183 YWTVKNSWGPSWGEQGYIRVEKDSGASG-GLCGIAMEASYPVK 224
>gi|157278117|ref|NP_001098157.1| cathepsin S precursor [Oryzias latipes]
gi|50251130|dbj|BAD27582.1| cathepsin S [Oryzias latipes]
Length = 327
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 173/314 (55%), Gaps = 42/314 (13%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN---SQNL-SFKLTDNKFADL 113
+++ + W K YS+ Y E E R I+ N++ I N S L S++L N DL
Sbjct: 21 LDQHWNLWKKTYSKTYSHEIEEFGRRRIWEENLEMISVHNLEVSLGLHSYELAMNHLGDL 80
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQY------LGLPASVDWRKEGAVTPVKDQGQCGSCW 167
+ EE I++ G P R + Y +P SVDWR+ G VT VK QG+CGSCW
Sbjct: 81 TIEELIASLTGTVAPVGLER---IHYDLVKINTSVPESVDWREGGLVTSVKTQGRCGSCW 137
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSAV A+EG K TG L SLS Q LVDC N GC GG+M AF+++ K G++++
Sbjct: 138 AFSAVGALEGQLKKTTGILTSLSPQNLVDCSTKYGNYGCKGGFMSNAFQYVIKNQGISSD 197
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQ 264
YPY GK D+C+ D +KH A TGY +P +R F
Sbjct: 198 AAYPYIGKRDKCKYD-SKHRAANCTGYNFLPKGDEFALKVGVATIGPISVAIDASRPKFL 256
Query: 265 LYSHGVF-DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
Y HGV+ D C H +NHGV VVGYG ++GE YWLVKNSWG +G+ GYI+MARN +
Sbjct: 257 FYRHGVYKDHSCSHNVNHGVLVVGYGTENGEDYWLVKNSWGERYGDGGYIKMARNRRNQ- 315
Query: 324 IGICGILMQASYPV 337
CGI + A +PV
Sbjct: 316 ---CGIALYACFPV 326
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 225 bits (573), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 141/355 (39%), Positives = 193/355 (54%), Gaps = 61/355 (17%)
Query: 29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
+L L VL I A ++ YD + E ++ + ++ + Y ++ E + R I+
Sbjct: 3 ILFFIALTVLSINAVSF-------YD--LVMEEWQLFKAEHKKNYNNDVEEKFRMKIFMD 53
Query: 89 NVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPS-------- 136
N Q I N++ + +KL NK++D+ + EFI+T+ G+NK P S
Sbjct: 54 NKQKITKHNTKYQRGEVGYKLGLNKYSDMLHHEFINTFNGFNKSIIPPHLRSNNGKTHLK 113
Query: 137 ------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
+ LP VDW K GAVTPVKDQG CGSCWAFSA A+EG++ KT LVSLS
Sbjct: 114 GSFFIPPANVKLPKHVDWVKLGAVTPVKDQGHCGSCWAFSATGALEGLHFRKTKVLVSLS 173
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQ L+DC N GCNGG M++AF+++ GG+ TE YPY G ND C+ + A+
Sbjct: 174 EQNLIDCSTEEGNNGCNGGLMDQAFQYVRINGGIDTERSYPYEGNNDVCRYEPENSGAID 233
Query: 251 ITGYEAIP-----------------------ARYAFQLYSHGV-FDEYCGHQ---LNHGV 283
TGY +P ++ +FQLYS GV F+ C ++ L+HGV
Sbjct: 234 -TGYTDVPLGDEDALKSAVATVGPVSVAIDASQESFQLYSSGVYFEPNCKNEPESLDHGV 292
Query: 284 TVVGYGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
VVGYG D + YWLVKNSWG SWGE GYI+MARN+ + CGI Q S+P
Sbjct: 293 LVVGYGTDEETQQDYWLVKNSWGDSWGENGYIKMARNADNQ----CGIATQPSFP 343
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 133/318 (41%), Positives = 181/318 (56%), Gaps = 44/318 (13%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSN 115
E +E + ++S++Y SE E R I++ N I N + ++KL+ NK+ D+ +
Sbjct: 27 EEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDMLH 86
Query: 116 EEFISTYLGYNKPYNEPRWPSVQYLG-----------LPASVDWRKEGAVTPVKDQGQCG 164
EF+ST G+ + + Y G LP +VDWR +GAVTP+KDQGQCG
Sbjct: 87 HEFVSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQLPKNVDWRTKGAVTPIKDQGQCG 146
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFSA A+EG KTG+LVSLSEQ LVDC N GCNGG M+ AFE++ + GG+
Sbjct: 147 SCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKENGGI 206
Query: 225 TTEDDYPYRGKNDRCQ-------------------TDKTKHHAVTITG--YEAIPARY-A 262
TE+ YPY ++++C ++ AV G AI A + +
Sbjct: 207 DTEESYPYDAEDEKCHYNPRAAGAEDKGFVDVREGSEHALKKAVATVGPVSVAIDASHES 266
Query: 263 FQLYSHGVF--DEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
FQ YSHGV+ E L+HGV VVGYG +D G YWLVKNSWGT+WG+ GY++MARN
Sbjct: 267 FQFYSHGVYIEPECSPEMLDHGVLVVGYGIDDDGTDYWLVKNSWGTTWGDQGYVKMARNR 326
Query: 320 PSSNIGICGILMQASYPV 337
+ CGI AS+P+
Sbjct: 327 DNQ----CGIASSASFPL 340
>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
Length = 325
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 128/313 (40%), Positives = 179/313 (57%), Gaps = 37/313 (11%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFAD 112
S+ ++++N+ ++ R Y S E + R ++ N Q+ID N++ ++F L N+F D
Sbjct: 17 SLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGD 76
Query: 113 LSNEEFISTYLGY-NKPYNEPRWP-SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
+++EE ++T G+ P P LP VDWR +GAVTPVKDQ QCGSCWAFS
Sbjct: 77 MTSEEIVATMNGFLGAPTRRPAAVLKADDETLPEKVDWRTKGAVTPVKDQKQCGSCWAFS 136
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
++EG + LK GKLVSLSEQ LVDC N GC GG M++AF +I G+ TED Y
Sbjct: 137 TTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKANKGIDTEDSY 196
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAI-----------------------PARYAFQLYS 267
PY ++ +C+ D + A T TGY + ++ F Y
Sbjct: 197 PYEAQDGKCRFDASNVGA-TDTGYVDVEHGSESALKKAVATIGPISVGIDASQSTFHFYH 255
Query: 268 HGVF-DEYCGH-QLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
GV+ D++C L+HGV VGYG D +G +WLVKNSW TSWG+ GYI+M+RN ++
Sbjct: 256 TGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRNRNNN-- 313
Query: 325 GICGILMQASYPV 337
CGI QASYP+
Sbjct: 314 --CGIASQASYPL 324
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 132/306 (43%), Positives = 173/306 (56%), Gaps = 42/306 (13%)
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLG 124
+ +EY SE E R IY N I N + +S+KL N++ D+ + EF+ST G
Sbjct: 36 HGKEYQSETEEYYRLKIYMENRMMIARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTRNG 95
Query: 125 YNKPY-NEPRWPS-------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
+ + Y ++PR S ++ LP +VDWRK+GAVTPVK+QGQCGSCWAFS ++E
Sbjct: 96 FRRDYRSKPRQGSFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLE 155
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
G + K+G +VSLSEQ LVDC N GC GG M+ AF++I GG+ TE YPY G +
Sbjct: 156 GQHFRKSGDMVSLSEQNLVDCSTAFGNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNGTD 215
Query: 237 DRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQLYSHGVFDE 273
C K+ A T TG+ IP + +FQ YS GV+DE
Sbjct: 216 GTCHFKKSDVGA-TDTGFVDIPEGNEHLLKKAVATVGPISVAIDASHQSFQFYSQGVYDE 274
Query: 274 -YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
C + L+HGV VVGYG + YWLVKNSWGT+WG+ GYI M RN + CGI
Sbjct: 275 PECSSENLDHGVLVVGYGTKDDQDYWLVKNSWGTTWGDGGYIYMTRNKDNQ----CGIAS 330
Query: 332 QASYPV 337
ASYP+
Sbjct: 331 SASYPL 336
>gi|357477225|ref|XP_003608898.1| Cysteine proteinase, partial [Medicago truncatula]
gi|355509953|gb|AES91095.1| Cysteine proteinase, partial [Medicago truncatula]
Length = 260
Score = 224 bits (572), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 127/263 (48%), Positives = 157/263 (59%), Gaps = 51/263 (19%)
Query: 108 NKFADLSNEEFISTY----LGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVK 158
NKFAD++N EF S Y + +++ + + ++ G+P+S+DWRK GAVT VK
Sbjct: 3 NKFADMTNYEFRSIYADSKVNHHRMFRGMSHDNGPFMYENVEGVPSSIDWRKIGAVTGVK 62
Query: 159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
DQGQCGSCWAFS + AVEGIN++KT KLVSLSEQELVDCD NQGCNGG ME AFEFI
Sbjct: 63 DQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTEV-NQGCNGGLMEYAFEFI 121
Query: 219 TKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA---------------- 262
K G+TTE +YPY K+ C K AV+I G+E +PA
Sbjct: 122 -KQNGITTETNYPYAAKDGTCNIQKENKPAVSIDGHENVPANNEKALLKAAANQPISVAI 180
Query: 263 ------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMA 316
FQ YS GVF +CG +LNHGV NSWG+ WGE GYIRM
Sbjct: 181 DAGGSDFQFYSEGVFTGHCGTELNHGV-----------------NSWGSEWGEQGYIRMQ 223
Query: 317 RNSPSSNIGICGILMQASYPVKR 339
R + S G+CGI M+ASYP+K+
Sbjct: 224 R-AISHKQGLCGIAMEASYPIKK 245
>gi|403333364|gb|EJY65772.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 131/315 (41%), Positives = 192/315 (60%), Gaps = 34/315 (10%)
Query: 51 QKYDPQSMEER-FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDN 108
Q Y P + E+ F N++ +Y + YG+++E+ R ++ N+ + N++N ++++L N
Sbjct: 31 QLYTPITAEDHAFTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNARNDVTYRLGLN 90
Query: 109 KFADLSNEEFISTYLGYNKPYNE-PRWPSVQYLGLPAS--VDWRKEGAVTPVKDQGQCGS 165
KFAD + E+ LG+ N+ PR +++ LG P + V+W ++GAVTPVKDQGQCGS
Sbjct: 91 KFADYTEAEY-KRLLGFGGQKNKNPR--NIKVLGAPKNDGVNWVEQGAVTPVKDQGQCGS 147
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CW+FSA A+EG K++ G L SLSEQ+LVDC N+GC GG+M++AF+++ + +
Sbjct: 148 CWSFSATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYVEQT-ALE 206
Query: 226 TEDDYPYRGKNDRCQ------------TDKTKHHAVTITGY-------EAIPA-RYAFQL 265
TED YPY +D C+ D T ++ + AI A + FQ
Sbjct: 207 TEDQYPYEAVDDTCRASSAGVVKVDSFVDVTPNNVNELKAALDKGPVSVAIEADQMVFQF 266
Query: 266 YSHGVF-DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS GV D CG L+HGV VGYG + G+ Y+LVKNSWG SWGE GY+++A SP +
Sbjct: 267 YSGGVINDASCGTTLDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGYVKIAA-SPDN-- 323
Query: 325 GICGILMQASYPVKR 339
ICGIL QASYP+ +
Sbjct: 324 -ICGILSQASYPIMK 337
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 137/320 (42%), Positives = 174/320 (54%), Gaps = 45/320 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS----QNLSFKLTDNKFADL 113
++E + + Q+ + Y +E E + R I++ N I N +S+KL NK+AD+
Sbjct: 24 IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQYLG----------LPASVDWRKEGAVTPVKDQGQC 163
+ EF T GYN + +G +P SVDWR+ GAVT VKDQG C
Sbjct: 84 LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCWAFS+ A+EG + K G LVSLSEQ LVDC N GCNGG M+ AF +I GG
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------AR 260
+ TE YPY G +D C +K A T TG+ IP +
Sbjct: 204 IDTEKSYPYEGIDDSCHFNKATIGA-TDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASH 262
Query: 261 YAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQLYS GV++E C Q L+HGV VVGYG D G YWLVKNSWGT+WGE GYI+MAR
Sbjct: 263 ESFQLYSEGVYNEPECDEQNLDHGVLVVGYGTDESGMDYWLVKNSWGTTWGEQGYIKMAR 322
Query: 318 NSPSSNIGICGILMQASYPV 337
N + CGI +SYP
Sbjct: 323 NQNNQ----CGIATASSYPT 338
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 140/349 (40%), Positives = 189/349 (54%), Gaps = 52/349 (14%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRF 83
MLR ++L F++ + A S + + ++E + + + Y S E RF
Sbjct: 1 MLRISLLCAFVV----VTTAASSH--------EILRTQWEAFKATHKKSYQSNMEELLRF 48
Query: 84 GIYSSNVQYIDYINSQN----LSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPS--- 136
I+S N + N + +S+KL N+F DL EF + GY R +
Sbjct: 49 KIFSENSLLVARHNEKYARGLVSYKLGMNQFGDLLPHEFARMFNGYRGARTAGRGSTFLP 108
Query: 137 ---VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
V Y LP S+DWR++GAVTPVK+QGQCGSCWAFS ++EG + LKTG LVSLSEQ
Sbjct: 109 PANVNYSSLPQSMDWREKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQN 168
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
LVDC N GC GG M+ AF++I GG+ TE YPY ++ C+ K ++ T TG
Sbjct: 169 LVDCSETFGNHGCEGGLMDNAFQYIKANGGIDTEKSYPYEAEDGECRF-KKQNVGATDTG 227
Query: 254 Y----------------------EAIPARY-AFQLYSHGVFDEY--CGHQLNHGVTVVGY 288
+ AI A + +FQLYS GV+DE QL+HGV VVGY
Sbjct: 228 FVDIEQGSEDDLKKAVATVGPVSVAIDASHSSFQLYSEGVYDETECSSEQLDHGVLVVGY 287
Query: 289 GEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G + G+KYWLVKNSW SWG+ GYI+M+R+ + CGI ASYP+
Sbjct: 288 GVEDGKKYWLVKNSWAESWGDNGYIKMSRDKDNQ----CGIASAASYPL 332
>gi|66378053|gb|AAY45871.1| cathepsin L-like cysteine proteinase [Longidorus elongatus]
Length = 358
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 130/315 (41%), Positives = 180/315 (57%), Gaps = 47/315 (14%)
Query: 64 NWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFI 119
N+ ++++ Y ++DE RF +++SN + I+ N + SF L+ NKFAD++N EF
Sbjct: 45 NFKLKHAKSYKTKDEELLRFQVFASNHKVIEQHNIEYEAGQHSFALSLNKFADMTNAEFR 104
Query: 120 STYLGYNKPYNEPRWPSVQY------------LGLPASVDWRKEGAVTPVKDQGQCGSCW 167
G+ P S + +P SVDWRKEG VT VKDQG CGSCW
Sbjct: 105 QRMNGFKLPAKRKLAKSQPLKEDGMIFEMPDNVTIPDSVDWRKEGYVTKVKDQGSCGSCW 164
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSA ++EG + +TGKLVSLSEQ LVDCDVN +++GCNGGYM+ AF+++ G+ TE
Sbjct: 165 AFSATGSLEGQHYKQTGKLVSLSEQNLVDCDVNGDDEGCNGGYMDGAFQYVETNKGIDTE 224
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQ 264
YPY+G++ RC+ K++ T TG+ IP A + FQ
Sbjct: 225 ASYPYKGRDGRCRF-KSEDVGATDTGFVDIPEGNETLLEAAIATVGPVSVAIDAASFKFQ 283
Query: 265 LYSHGV-FDEYCGHQ-LNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
YSHGV +D C + L+HGV VGY G++Y++VKNSW WG+ GYI M+R +
Sbjct: 284 FYSHGVYYDRSCSPEYLDHGVLAVGYNSTKDGKQYYIVKNSWSEDWGDDGYILMSRRKNN 343
Query: 322 SNIGICGILMQASYP 336
+ CGI ASYP
Sbjct: 344 N----CGIATMASYP 354
>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 356
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 130/320 (40%), Positives = 180/320 (56%), Gaps = 47/320 (14%)
Query: 61 RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEEFI 119
R E W+ ++ R Y E RR ++ +N +Y+D +N + N ++ L NKF+DL+++EF+
Sbjct: 38 RHEEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRAGNRTYTLGLNKFSDLTDDEFV 97
Query: 120 STYLGYN-------KPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCW 167
T+LGY +P E V LG +P SVDWR +GAVT VK+QG CG CW
Sbjct: 98 QTHLGYRGHQQGGLRP-EEENVSKVAALGYGQADMPESVDWRAQGAVTGVKNQGSCGCCW 156
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG----CNGGYMEKAFEFITKIGG 223
AF+AVAA EG+ K+ TG L+S+SEQ+++DC S G C+GG+++ A ++ G
Sbjct: 157 AFAAVAATEGLVKIATGNLISMSEQQVLDCTGQSPGMGNTNTCDGGHIDDALRYVAASRG 216
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHA--------VTITGYE--------------AIPARY 261
+ E Y Y G CQ+ T + A VT+ G E ++ A
Sbjct: 217 LQPEAAYAYTGLQGACQSGFTPNSAASFGEPQTVTLQGDEGRLQGLVAGQPIAVSVEASD 276
Query: 262 AFQLYSHGVF---DEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMAR 317
F+ Y GVF CG +LNH VTVVGYG D G++YWLVKN WGTSWGE GY+R+AR
Sbjct: 277 DFRHYMSGVFTAGTSSCGQRLNHAVTVVGYGSADGGQEYWLVKNQWGTSWGEGGYMRIAR 336
Query: 318 NSPSSNIGICGILMQASYPV 337
+ + N CGI A YP
Sbjct: 337 GNGAPN---CGISAYAYYPT 353
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 224 bits (570), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 181/314 (57%), Gaps = 38/314 (12%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFAD 112
S+ +++ ++ ++ R Y S E + R ++ N Q+ID N++ ++F L N+F D
Sbjct: 19 SLRQQWRDFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGD 78
Query: 113 LSNEEFISTYLGY-NKPYNEPR--WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
+++EEF +T G+ N P P + LP VDWR +GAVTPVKDQ QCGSCWAF
Sbjct: 79 MTSEEFTATMNGFLNVPSRRPTAILRADPDETLPKEVDWRTKGAVTPVKDQKQCGSCWAF 138
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
S ++EG + LK GKLVSLSEQ LVDC N GC GG M++AF +I G+ TED
Sbjct: 139 STTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGIDTEDS 198
Query: 230 YPYRGKNDRCQTDKTKHHAVTITGYEAI-----------------------PARYAFQLY 266
YPY ++ +C+ D + A T TGY + ++ +FQ Y
Sbjct: 199 YPYEAQDGKCRFDASNVGA-TDTGYVDVEHGSESALKKAVATIGPISVAIDASQPSFQFY 257
Query: 267 SHGV-FDEYCGH-QLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
GV ++E C L+HGV VGYGE + GE YWLVKNSW TSWG GYI+M+R+ ++
Sbjct: 258 HDGVYYEEGCSSTMLDHGVLAVGYGETEKGEAYWLVKNSWNTSWGNKGYIQMSRDKKNN- 316
Query: 324 IGICGILMQASYPV 337
CGI QASYP+
Sbjct: 317 ---CGIASQASYPL 327
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 140/354 (39%), Positives = 186/354 (52%), Gaps = 57/354 (16%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+ LFLL V + A + ++E + + Q+ ++Y SE E + R IY N
Sbjct: 1 MKLFLLLVSFLAAANAVSIF------NLVKEEWNAFKLQHRKKYDSESEERIRMKIYVQN 54
Query: 90 VQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYNKP---------------YN 130
I N + F+L NK+ADL +EEF+ T G+N+
Sbjct: 55 KHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSAAAGSKLLGREQLMTIE 114
Query: 131 EP-RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
EP W + +P ++DWR++GAVTPVKDQG CGSCW+FSA A+EG + KTGKLVSL
Sbjct: 115 EPITWIEPANVDVPTTIDWREKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSL 174
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQ LVDC N GCNGG M+ AF+++ G+ TE YPY +D C + K
Sbjct: 175 SEQNLVDCSTKYGNNGCNGGLMDNAFQYVKDNKGIDTEKAYPYEAIDDECHYN-PKAIGA 233
Query: 250 TITGYEAIP-----------------------ARYAFQLYSHGVFDE-YC-GHQLNHGVT 284
T G+ IP + +FQ YS GV+ E C QL+HGV
Sbjct: 234 TDKGFVDIPQGDEKALKKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVL 293
Query: 285 VVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
VGYG + GE YWLVKNSWGT+WG+ GY++MARN + CGI ASYP+
Sbjct: 294 AVGYGTTEDGEDYWLVKNSWGTTWGDQGYVKMARNRENH----CGIATTASYPL 343
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 128/300 (42%), Positives = 169/300 (56%), Gaps = 33/300 (11%)
Query: 65 WLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEF--ISTY 122
W +++ Y + E R+ I+ N + I N Q F L N+F D++N EF + Y
Sbjct: 30 WKMAHNKAYSHDGEETVRYTIWKDNERRIREHNLQGGDFLLEMNQFGDMTNNEFKDFNGY 89
Query: 123 LGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLK 182
L + K + + + P SVDWR EG VTPVKDQGQCGSCWAFS ++EG N K
Sbjct: 90 LSH-KHVSGSTFLTPNSFVAPDSVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQNFKK 148
Query: 183 TGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTD 242
TGKLVSLSEQ LVDC N GCNGG M+ AF +I + G+ +E YPY K+ +C
Sbjct: 149 TGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENNGIDSEASYPYTAKDGKCAFT 208
Query: 243 KTKHHAVTITGYEAIP-----------------------ARYAFQLYSHGVFDEY--CGH 277
K + A T TG+ IP + ++FQ Y GV++E
Sbjct: 209 K-PNVAATDTGFVDIPSGDENKLKEAVASVGPISVAIDASHFSFQFYRKGVYNERKCSST 267
Query: 278 QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+L+HGV VVGYG + G+ YWLVKNSW TSWG+ GYI+M+RN+ + CGI ASYP+
Sbjct: 268 ELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMSRNAKNQ----CGIATNASYPL 323
>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
Length = 326
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 128/313 (40%), Positives = 179/313 (57%), Gaps = 37/313 (11%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFAD 112
S+ ++++N+ ++ R Y S E + R ++ N Q+ID N++ ++F L N+F D
Sbjct: 18 SLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGD 77
Query: 113 LSNEEFISTYLGY-NKPYNEPRWP-SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
+++EE ++T G+ P P LP VDWR +GAVTPVKDQ QCGSCWAFS
Sbjct: 78 MTSEEIVATMNGFLGAPTRRPAAVLKADDETLPEKVDWRTKGAVTPVKDQKQCGSCWAFS 137
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
++EG + LK GKLVSLSEQ LVDC N GC GG M++AF +I G+ TED Y
Sbjct: 138 TTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGIDTEDSY 197
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAI-----------------------PARYAFQLYS 267
PY ++ +C+ D + A T TGY + ++ F Y
Sbjct: 198 PYEAQDGKCRFDASNVGA-TDTGYVDVEHGSESALKKAVATIGPISVGIDASQSTFHFYH 256
Query: 268 HGVF-DEYCGH-QLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
GV+ D++C L+HGV VGYG D +G +WLVKNSW TSWG+ GYI+M+RN ++
Sbjct: 257 TGVYHDDHCSSTMLDHGVLAVGYGSDENGGDFWLVKNSWNTSWGDKGYIKMSRNRNNN-- 314
Query: 325 GICGILMQASYPV 337
CGI QASYP+
Sbjct: 315 --CGIASQASYPL 325
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 179/316 (56%), Gaps = 40/316 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNE 116
+ E + W+ ++SR Y E E Q RF ++ N+++I+ N + + ++KL N+FAD + E
Sbjct: 43 VAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTRE 102
Query: 117 EFISTYLGYNK----PYNE------PRWP-SVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
EFI+T+ G P +E P W +V + + DWR EGAVTPVK QGQCG
Sbjct: 103 EFIATHTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAGRETKDWRYEGAVTPVKYQGQCGC 162
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFS+VAAVEG+ K+ LVSLSEQ+L+DCD +N GCNGG M AF +I K G+
Sbjct: 163 CWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDN-GCNGGIMSDAFSYIIKNRGIA 221
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY----------------------AF 263
+E YPY+ C+ + + I G++ +P+ F
Sbjct: 222 SEASYPYQAAEGTCRYNGKP--SAWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGF 279
Query: 264 QLYSHGVFDE-YCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
YS GV+DE YCG +NH VT VGYG G KYWL KNSWG +WGE GYIR+ R+
Sbjct: 280 MHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAW 339
Query: 322 SNIGICGILMQASYPV 337
G+CG+ A YPV
Sbjct: 340 PQ-GMCGVAQYAFYPV 354
>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
Length = 344
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 142/350 (40%), Positives = 186/350 (53%), Gaps = 55/350 (15%)
Query: 33 FLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
FL+ +LG A A + + ++E + + Q+ ++Y SE E + R IY N
Sbjct: 4 FLILILGFVAAANAISIFE-----LVKEEWTAFKLQHRKKYDSETEERIRMKIYVQNKHK 58
Query: 93 IDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYN--------------KPYNEP-R 133
I N + F+L NK+ADL +EEF+ T G+N KP EP
Sbjct: 59 IAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSVSGKGQLLRGELKPIEEPVT 118
Query: 134 WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
W + +P ++DWR +GAVT VKDQG CGSCW+FSA A+EG + KTGKLVSLSEQ
Sbjct: 119 WIEPANVDVPTAMDWRTKGAVTQVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQN 178
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
LVDC N GCNGG M+ AF++I G+ TE YPY +D C + K T G
Sbjct: 179 LVDCSQKYGNNGCNGGMMDFAFQYIKDNKGIDTEKSYPYEAIDDECHYN-PKAVGATDKG 237
Query: 254 YEAIP-----------------------ARYAFQLYSHGVFDE-YC-GHQLNHGVTVVGY 288
+ IP + +FQ YS GV+ E C QL+HGV VGY
Sbjct: 238 FVDIPQGNEKALMKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGY 297
Query: 289 G-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G + GE YWLVKNSWGT+WG+ GY++MARN + CGI ASYP+
Sbjct: 298 GTTEDGEDYWLVKNSWGTTWGDQGYVKMARNRDNH----CGIATTASYPL 343
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 224 bits (570), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 132/320 (41%), Positives = 184/320 (57%), Gaps = 51/320 (15%)
Query: 59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYID---YINSQ-NLSFKLTDNKFADLS 114
E+ ++++ + R YG +E QR+ ++ +N++ I+ Y++SQ S+++ N+FAD+
Sbjct: 41 EKLWQDFKTVHERNYGETEEMQRK-EVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADME 99
Query: 115 NEEFISTYLGY------------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
+EF S G+ + Y P P + LPA VDWRKEG VTP+KDQG
Sbjct: 100 VKEFASVVNGFRMNNRTKVRDHLHSHYISPAIP----VSLPAEVDWRKEGYVTPIKDQGH 155
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCW+FS A+EG + KTGKLVSLSEQ L+DC + N GCNGG M+ AF++I
Sbjct: 156 CGSCWSFSTTGALEGQHFRKTGKLVSLSEQNLIDCSTSYGNNGCNGGVMDYAFQYIKDND 215
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------A 259
G TED YPY + C+ K ++ T TGY +P +
Sbjct: 216 GDDTEDSYPYEAADGPCRF-KKEYVGATDTGYTDLPKGDEEKMKEAVAMVGPVSVAIDAS 274
Query: 260 RYAFQLYSHGVFDEY-CGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ+Y GV+DE C + L+HGV VVGYG + G+ YWLVKNSWGT WG+ GYI+M+R
Sbjct: 275 HTSFQMYQSGVYDEVECDPEGLDHGVLVVGYGTELGQDYWLVKNSWGTKWGDEGYIKMSR 334
Query: 318 NSPSSNIGICGILMQASYPV 337
N + CGI ASYP+
Sbjct: 335 NKNNQ----CGISSMASYPL 350
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 224 bits (570), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 138/313 (44%), Positives = 178/313 (56%), Gaps = 39/313 (12%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSN 115
+ F + K + +EY +E E R I+ N + I+ NS+ +SFKL N AD+
Sbjct: 25 DEFTLFKKFHRKEYDNELEESYRKKIFLENKKRIEKHNSRYKQGKVSFKLKLNHLADMLI 84
Query: 116 EEFISTYLGYNKPYNEPRWPSVQYLGLPAS-------VDWRKEGAVTPVKDQGQCGSCWA 168
E+ YLG+NK Y +P + VDWR +GAVTPVK+QG CGSCWA
Sbjct: 85 HEYSDVYLGFNKSSKANNNKLQSYTFIPPAHVTLNKEVDWRTKGAVTPVKNQGHCGSCWA 144
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS A+EG N KTGKLVSLSEQ LVDC + N GC GG M+ AF++I + G+ TE
Sbjct: 145 FSTTGALEGQNFRKTGKLVSLSEQNLVDCSGSYGNNGCEGGLMDNAFQYIKENHGIDTEK 204
Query: 229 DYPYRGKNDRCQTDKTKHHA-----VTIT-GYE---------------AIPARY-AFQLY 266
YPY G+++ C+ KT A V IT G E AI A + +FQ Y
Sbjct: 205 SYPYEGEDETCRFRKTSIGATDSGFVDITQGDEEALMQAVATIGPISVAIDASHQSFQFY 264
Query: 267 SHGVF--DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
S GV+ E L+HGV VVGYG + +KYWLVKNSWGT WG+ GYI+MAR+ ++
Sbjct: 265 SEGVYYEPECSSENLDHGVLVVGYGVEDNQKYWLVKNSWGTQWGDGGYIKMARDQDNN-- 322
Query: 325 GICGILMQASYPV 337
CGI QASYP+
Sbjct: 323 --CGIATQASYPL 333
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 178/321 (55%), Gaps = 46/321 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
++E + + Q+ + Y SE E + R IY N I N + ++L NK+ADL
Sbjct: 23 VKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADL 82
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQY-----------LGLPASVDWRKEGAVTPVKDQGQ 162
+EEF+ T G+N+ ++ V+ + +P +VDWRK+GAVTPVKDQG
Sbjct: 83 LHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGH 142
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCW+FSA A+EG + KTGKLVSLSEQ LVDC N GCNGG M+ AF++I G
Sbjct: 143 CGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNG 202
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------A 259
G+ TE YPY +D C + K T GY IP +
Sbjct: 203 GIDTEKSYPYEAIDDTCHFN-PKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDAS 261
Query: 260 RYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMA 316
+FQ YS GV+ E C + L+HGV VGYG + GE YWLVKNSWGT+WG+ GY++MA
Sbjct: 262 HESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMA 321
Query: 317 RNSPSSNIGICGILMQASYPV 337
RN + CG+ ASYP+
Sbjct: 322 RNRDNH----CGVATCASYPL 338
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 179/316 (56%), Gaps = 40/316 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNE 116
+ E + W+ ++SR Y E E Q RF ++ N+++I+ N + + ++KL N+FAD + E
Sbjct: 19 VAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTRE 78
Query: 117 EFISTYLGYNK----PYNE------PRWP-SVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
EFI+T+ G P +E P W +V + + DWR EGAVTPVK QGQCG
Sbjct: 79 EFIATHTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAGRETKDWRYEGAVTPVKYQGQCGC 138
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFS+VAAVEG+ K+ LVSLSEQ+L+DCD +N GCNGG M AF +I K G+
Sbjct: 139 CWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDN-GCNGGIMSDAFSYIIKNRGIA 197
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY----------------------AF 263
+E YPY+ C+ + + I G++ +P+ F
Sbjct: 198 SEASYPYQAAEGTCRYNGKP--SAWIRGFQTVPSNNERALLEAVSKQPVSVSIDADGPGF 255
Query: 264 QLYSHGVFDE-YCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
YS GV+DE YCG +NH VT VGYG G KYWL KNSWG +WGE GYIR+ R+
Sbjct: 256 MHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEGIKYWLAKNSWGETWGENGYIRIRRDVAW 315
Query: 322 SNIGICGILMQASYPV 337
G+CG+ A YPV
Sbjct: 316 PQ-GMCGVAQYAFYPV 330
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 134/373 (35%), Positives = 195/373 (52%), Gaps = 67/373 (17%)
Query: 19 IDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDE 78
I +++M R + L + ++ I + + + FENW+ ++ ++Y E
Sbjct: 145 IFLKIMNRYINILLLIFGLIAISNALL-------FSEEQYKNEFENWIDRFEKKYDVS-E 196
Query: 79 WQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNK------PYNEP 132
+++RF I+ SN+ ++ NS+N L N ADL+N E+ YLG +K P N
Sbjct: 197 FKKRFSIFKSNMDFVHSWNSKNSQTVLGLNHLADLTNLEYRQFYLGTHKKAVLGTPGNHE 256
Query: 133 RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQ 192
G A+VDWR++GAV+P+KDQGQCGSCW+FS +VEG +++K+G +V LSEQ
Sbjct: 257 VSNLQSVFGDSATVDWRQKGAVSPIKDQGQCGSCWSFSTTGSVEGAHQIKSGNMVELSEQ 316
Query: 193 ELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN-DRCQTDKTKHHAVTI 251
LVDC + N GCNGG M+ AFE+I G+ TE YPY + C+ +K A TI
Sbjct: 317 NLVDCSTSEGNMGCNGGLMDYAFEYIITNNGIDTESSYPYTASSGTTCKYNKANSGA-TI 375
Query: 252 TGYEAIPA-----------------------RYAFQLYSHGV-FDEYCGH-QLNHGVTVV 286
+ Y+ I A +FQLYSHG+ +D C L+HGV VV
Sbjct: 376 SSYKNITAGSESDLADAVKNAGPVSVAIDASHNSFQLYSHGIYYDASCSSVNLDHGVLVV 435
Query: 287 GYGE----------------------DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
GYG D + YW+VKNSWGTSWG+ G+I M+++ ++
Sbjct: 436 GYGSGTPDSDSRVHKGSQVRVKVPKTDDTKNYWIVKNSWGTSWGDKGFIYMSKDRDNN-- 493
Query: 325 GICGILMQASYPV 337
CGI ASYP+
Sbjct: 494 --CGIASCASYPI 504
>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
Length = 341
Score = 223 bits (569), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 133/323 (41%), Positives = 178/323 (55%), Gaps = 48/323 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
+ E + + ++S+ Y SE E + R IY N I N + +S+KL NK+AD+
Sbjct: 23 VREEWSAFKLEHSKRYDSEVEDKFRMKIYLENKHRIAKHNQRFEQGAVSYKLRPNKYADM 82
Query: 114 SNEEFISTYLGYNKPYNEPR-------------WPSVQYLGLPASVDWRKEGAVTPVKDQ 160
+ EF+ G+NK P+ + + ++ P VDWRK+GAVT VKDQ
Sbjct: 83 LSHEFVHVMNGFNKTLKHPKAVHGKGRESRPATFIAPAHVTYPDHVDWRKKGAVTEVKDQ 142
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G+CGSCWAFS A+EG + KTG LVSLSEQ L+DC N GCNGG M+ AF++I
Sbjct: 143 GKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKD 202
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---------------------- 258
GG+ TE YPY G +D+C+ + K+ G+ IP
Sbjct: 203 NGGIDTEKAYPYEGVDDKCRYN-AKNSGADDVGFVDIPQGDEEKLMQAVATVGPVSVAID 261
Query: 259 -ARYAFQLYSHGV-FDEYCGH-QLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIR 314
++ +FQ YS GV +DE C L+HGV VVGYG D G YWLVKNSWG +WG+ GYI+
Sbjct: 262 ASQESFQFYSDGVYYDENCSSTDLDHGVMVVGYGTDEQGGDYWLVKNSWGRTWGDLGYIK 321
Query: 315 MARNSPSSNIGICGILMQASYPV 337
MARN + CGI ASYP+
Sbjct: 322 MARNKNNH----CGIASSASYPL 340
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 223 bits (569), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 136/338 (40%), Positives = 183/338 (54%), Gaps = 44/338 (13%)
Query: 28 AVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYS 87
A L+ L+ VL Q + S + ++ W + + Y E+E RR I++
Sbjct: 3 AFLACLLVAVL----------IAQCFSELSQDRQWHAWKDFHGKTYTGEEEDLRR-AIWN 51
Query: 88 SNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNE---PRWPSVQYLGLPA 144
N++ + N++N S+KL N FADL+ EF ++GY N + + + LPA
Sbjct: 52 DNLEIVKKHNAENHSYKLDMNHFADLTVTEFKQRFMGYRAASNSTGGSTFLPLSNVQLPA 111
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQ 204
VDWR +G VT VK+QGQCGSCWAFS+ ++EG + KTGKLVSLSEQ LVDC N
Sbjct: 112 EVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVSLSEQNLVDCSKKYGNN 171
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE--------- 255
GC GG M+ AF++I G+ TE YPY ++ +C K T+TGY
Sbjct: 172 GCEGGLMDYAFKYIKNNDGIDTEQSYPYTARDGQCHF-KPGSVGATVTGYTDVQRGSEGD 230
Query: 256 -------------AIPARY-AFQLYSHGVFDE--YCGHQLNHGVTVVGYGEDHGEKYWLV 299
AI A + +FQLY GV+ E QL+HGV VGYG + G+ YWLV
Sbjct: 231 LQSAVATVGPISVAIDAGHSSFQLYKTGVYSEPDCSSTQLDHGVLAVGYGAEDGKDYWLV 290
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KNSWG WG GYI+M+RN + CGI QASYP+
Sbjct: 291 KNSWGEGWGMNGYIKMSRNKDNQ----CGIATQASYPL 324
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 135/319 (42%), Positives = 181/319 (56%), Gaps = 44/319 (13%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN---SQNL-SFKLTDNKFADL 113
++E++ + + ++Y SE E + R I+ N + N +Q L SFKL NK++D+
Sbjct: 23 VQEQWGAFKVTHKKQYESETEERFRMKIFMENAHKVAKHNKLYAQGLVSFKLGVNKYSDM 82
Query: 114 SNEEFISTYLGYNKPYNEPRW----PSVQYLG-----LPASVDWRKEGAVTPVKDQGQCG 164
N EF+ T GYN+ R S+ ++ LP +DWRK GAVTPVKDQGQCG
Sbjct: 83 LNHEFVHTLNGYNRSKTPLRSGELDESITFIPPANVELPKQIDWRKLGAVTPVKDQGQCG 142
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCW+FS ++EG + K+ KLVSLSEQ L+DC N GCNGG M+ AF +I GG+
Sbjct: 143 SCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDCSEKYGNNGCNGGLMDNAFRYIKDNGGI 202
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGY--------EAIPARYA-------------- 262
TE YPY+ ++++C K ++ T G+ E + A A
Sbjct: 203 DTEQSYPYKAEDEKCHY-KPRNKGATDRGFVDIESGDEEKLKAAVATVGPISVAIDASHP 261
Query: 263 -FQLYSHGVF--DEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARN 318
FQ YS GV+ E QL+HGV VVGYG D G YWLVKNSWG SWG+ GYI+MARN
Sbjct: 262 TFQQYSEGVYYEPECSSEQLDHGVLVVGYGTDEDGNDYWLVKNSWGDSWGDQGYIKMARN 321
Query: 319 SPSSNIGICGILMQASYPV 337
++ CGI QASYP+
Sbjct: 322 RDNN----CGIATQASYPL 336
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 130/318 (40%), Positives = 178/318 (55%), Gaps = 41/318 (12%)
Query: 38 LGIPAGAWS-EGYPQKYDPQSMEER---FENWLKQYSREYGSEDEWQRRFGIYSSNVQYI 93
LG+ + +S GY Q D S+E FE+W+ ++ + Y + DE RF + N+ YI
Sbjct: 21 LGLSSADFSIVGYSQD-DLTSIESSIRLFESWMLKHDKVYKTIDEKIYRFETFKDNLMYI 79
Query: 94 DYINSQNLSFKLTDNKFADLSNEEFISTYLG-------YNKPYNEPRWPSVQYLGLPASV 146
D N +N S+ L N+FADL+++EF Y+G + ++ +P+ + P S+
Sbjct: 80 DETNKKNNSYWLGLNEFADLTHDEFKEKYVGSIPEDSMIIEQSDDVEFPNKHVVDYPESI 139
Query: 147 DWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGC 206
DWR++GAVTPVK+Q CGSCWAFS VA VEGINK+ TG L+SLSEQEL+DCD S GC
Sbjct: 140 DWRQKGAVTPVKNQNPCGSCWAFSTVATVEGINKIVTGNLISLSEQELLDCDRRS--HGC 197
Query: 207 NGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR------ 260
GGY + +++ GV TE +YPY K C+ K V I GY+ +P+
Sbjct: 198 KGGYQTTSLKYVVD-NGVHTEKEYPYEKKQGNCRAKNKKGLKVYINGYKRVPSNDEISLI 256
Query: 261 ----------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWG 304
FQ Y GVF CG +L+H VT VGYG+D Y L+KNSWG
Sbjct: 257 KTISIQPVSVLVESKGRPFQFYKGGVFGGPCGTKLDHAVTAVGYGKD----YILIKNSWG 312
Query: 305 TSWGEAGYIRMARNSPSS 322
WG+ GYI++ R S S
Sbjct: 313 PKWGDKGYIKIKRASGQS 330
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 134/293 (45%), Positives = 172/293 (58%), Gaps = 37/293 (12%)
Query: 76 EDEWQRRFGIYSSNVQYIDYINSQNLS---FKLTDNKFADLSNEEFISTYLGY---NKPY 129
++E RRF I+ N+ I+ N N S F L N+FAD++N EF + LG NK
Sbjct: 43 QEELIRRF-IFEDNLHTIEEFNRVNASLAGFTLGVNEFADMTNTEFSNMLLGLGGRNKIA 101
Query: 130 NEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
+ + S LPA VDW ++G VT VK+QGQCGSCWAFS ++EG KTGKLVSL
Sbjct: 102 GDSVFESSHVQDLPAEVDWTQKGYVTEVKNQGQCGSCWAFSTTGSLEGQVFKKTGKLVSL 161
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQ LVDC + NQGCNGG M++AF +I K GG+ TE YPY G + C+ + K A
Sbjct: 162 SEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGIDTEAAYPYTGSDGTCRFLENKVGA- 220
Query: 250 TITGY------------EAI----PARYA-------FQLYSHGVFDE-YCGH-QLNHGVT 284
T++G+ EA+ P A FQ Y GV++ +C +L+HGV
Sbjct: 221 TVSGFVDVKSGDENALKEAVATVGPISVAIDASSIFFQFYRGGVYNPWFCSSTELDHGVL 280
Query: 285 VVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
VVGYG + G+ YWLVKNSWG+SWG GYI+M RN + CGI QASYP
Sbjct: 281 VVGYGTEGGKDYWLVKNSWGSSWGLKGYIKMVRNKKNR----CGIATQASYPT 329
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 178/321 (55%), Gaps = 46/321 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
++E + + Q+ + Y SE E + R IY N I N + ++L NK+ADL
Sbjct: 23 VKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADL 82
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQY-----------LGLPASVDWRKEGAVTPVKDQGQ 162
+EEF+ T G+N+ ++ V+ + +P +VDWRK+GAVTPVKDQG
Sbjct: 83 LHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGH 142
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCW+FSA A+EG + KTGKLVSLSEQ LVDC N GCNGG M+ AF++I G
Sbjct: 143 CGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNG 202
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------A 259
G+ TE YPY +D C + K T GY IP +
Sbjct: 203 GIDTEKSYPYEAIDDTCHFN-PKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDAS 261
Query: 260 RYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMA 316
+FQ YS GV+ E C + L+HGV VGYG + GE YWLVKNSWGT+WG+ GY++MA
Sbjct: 262 HESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKMA 321
Query: 317 RNSPSSNIGICGILMQASYPV 337
RN + CG+ ASYP+
Sbjct: 322 RNHDNH----CGVATCASYPL 338
>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
Length = 509
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 130/320 (40%), Positives = 177/320 (55%), Gaps = 43/320 (13%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS---FKLTDNKFADLSNE 116
E F+ W +++ + Y E +++F + N++Y+ N + + + NKFAD+SNE
Sbjct: 49 ELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVGLNKFADMSNE 108
Query: 117 EFISTYLG-YNKPYNEPRWPSVQYLGL------------PASVDWRKEGAVTPVKDQGQC 163
EF Y+ KP ++ + G P S+DWRK G VT VKDQG C
Sbjct: 109 EFREVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGIVTGVKDQGDC 168
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCWAFS+ A+EGIN L G L+SLSEQELVDCD S N GC GGYM+ AFE++ GG
Sbjct: 169 GSCWAFSSTGAIEGINALANGDLISLSEQELVDCD--STNDGCEGGYMDYAFEWVMSNGG 226
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA--------------------- 262
+ TE DYPY G++ C T K + AV+I GYE + +
Sbjct: 227 IDTETDYPYTGEDGTCNTTKEETKAVSIDGYEDVAEEESALFCAVLKQPISVGIDGGAID 286
Query: 263 FQLYSHGVF---DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
FQLY+ G++ ++H V VVGYG + GE+YW++KNSWGT WG GY + RN+
Sbjct: 287 FQLYTGGIYDGDCSDDPDDIDHAVLVVGYGAESGEEYWIIKNSWGTDWGMKGYAYIKRNT 346
Query: 320 PSSNIGICGILMQASYPVKR 339
S + G+C I ASYP K
Sbjct: 347 -SKDYGVCAINAMASYPTKE 365
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 130/318 (40%), Positives = 171/318 (53%), Gaps = 37/318 (11%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+ + E F W +++ R Y +E +RF I+ N++Y+ NS+ L NKFAD+SN
Sbjct: 40 ERVRELFHLWKERHKRVYKHAEETAKRFEIFKENLKYVIERNSKGHRHTLGMNKFADMSN 99
Query: 116 EEFISTYLGYNKPYNEP-----RWPSVQYLGL-----PASVDWRKEGAVTPVKDQGQCGS 165
EEF YL K R Q G P+S+DWRK+G VT +KDQG CGS
Sbjct: 100 EEFKEKYLSKIKKPINKKNNYLRRSMQQKKGTASCEAPSSLDWRKKGVVTGIKDQGDCGS 159
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFS+ A+EGIN + TG L+SLSEQELVDCD N GC GGYM+ AFE++ GG+
Sbjct: 160 CWAFSSTGAMEGINAIVTGDLISLSEQELVDCDTT--NYGCEGGYMDYAFEWVISNGGID 217
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---------------------ARYAFQ 264
+E DYPY G + C T K V+I GY+ + + FQ
Sbjct: 218 SESDYPYTGTDGTCNTTKEDTKVVSIDGYKDVDESDSALLCAAVNQPISVGMDGSALDFQ 277
Query: 265 LYSHGVF---DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
LY+ G++ ++H V +VGYG + E YW+ KNSWGTSWG GY + RN+
Sbjct: 278 LYTSGIYAGDCSDDPDDIDHAVLIVGYGSEDSEDYWICKNSWGTSWGMEGYFYIKRNTDL 337
Query: 322 SNIGICGILMQASYPVKR 339
G C I ASYP K
Sbjct: 338 P-YGECAINAMASYPTKE 354
>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
Length = 475
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 129/338 (38%), Positives = 183/338 (54%), Gaps = 42/338 (12%)
Query: 31 SLFLLWVLGIPAGAWSEGYPQKYDP--QSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
SL L GIP+ + P + + E F+ W K++ + Y +E R +
Sbjct: 18 SLTFLSCYGIPSEYSILAFDLNKFPSEEQVVELFQQWKKEHQKFYIHPEEAALRLENFKR 77
Query: 89 NVQYI---DYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGLPAS 145
N++YI + + + + L N+FAD+SNEEF + ++ + ++ P S
Sbjct: 78 NLKYIVERNAMRNSPVGHHLGLNRFADMSNEEFKNKFISKVESCDDA----------PYS 127
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
+DWRK+G VT VKDQG CGSCW+FS+ A+EG+N + TG L+SLSEQELVDCD + N G
Sbjct: 128 LDWRKKGVVTGVKDQGNCGSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCD--TTNDG 185
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------- 258
C GGYM+ AFE++ GG+ TE DYPY G C K + VTI GY +
Sbjct: 186 CEGGYMDYAFEWVINNGGIDTEADYPYIGVGGTCNVTKEETKVVTIDGYTDVTQSDSALF 245
Query: 259 --------------ARYAFQLYSHGVFDEYCG---HQLNHGVTVVGYGEDHGEKYWLVKN 301
+ FQLY+ G++D C ++H V +VGYG D + YW+VKN
Sbjct: 246 CATVKQPISVGIDGSTLDFQLYTGGIYDGDCSSNPDDIDHAVLIVGYGSDGNQDYWIVKN 305
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
SWGTSWG G+I + RN+ + G+C I AS+P K
Sbjct: 306 SWGTSWGIEGFIYIRRNT-NLKYGVCAINYMASFPTKE 342
>gi|255635645|gb|ACU18172.1| unknown [Glycine max]
Length = 355
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 127/298 (42%), Positives = 176/298 (59%), Gaps = 34/298 (11%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
FE WL ++ + Y + E ++RF I+ +N+++ID NS N ++KL N FADL+N E+ +
Sbjct: 45 FEEWLVKHDKVYNALGEKEKRFQIFKNNLRFIDERNSLNRTYKLGLNVFADLTNAEYRAM 104
Query: 122 YLGY--NKPY----NEPRWPSVQYLG--LPASVDWRKEGAVTPVKDQG-QCGSCWAFSAV 172
YL + P PR V +G +P SVDWRKEGAVTPVK+QG C SCWAF+AV
Sbjct: 105 YLRTWDDGPRLDLDTPPRNRYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAV 164
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
AVE + K+KTG L+SLSEQE+VDC S ++GC GG ++ + +I K G++ E DYPY
Sbjct: 165 GAVESLVKIKTGDLISLSEQEVVDC-TTSSSRGCGGGDIQHGYIYIRK-NGISLEKDYPY 222
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGV 270
RG +C ++K K+ VTI G+ +P + Y FQ Y+ GV
Sbjct: 223 RGDEGKCDSNK-KNAIVTIDGHGWVPTQLEEALKQGIANQPVAVPIPADDYEFQYYTSGV 281
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
F CG +LNH + +VGYG + YW+ KNS+ WGE GYIR+ R + G G
Sbjct: 282 FKGKCGTELNHALLLVGYGAEKDGDYWIAKNSYSDKWGENGYIRIQRKLSTCKFGNGG 339
>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
Length = 523
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 124/312 (39%), Positives = 176/312 (56%), Gaps = 35/312 (11%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSN 115
S E +F +W+K+++ + + EW RF ++ N Q I+ N + SF + N+++ L+
Sbjct: 23 SYEAKFLSWMKKFAVKL-NPLEWVHRFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTF 81
Query: 116 EEF--ISTYLGYNKPYNEPRW------PSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
+EF + T L + Y + R P+V +P +DW ++G VTPVK+QG CGSCW
Sbjct: 82 DEFKKLRTGLRVSPSYIQSRAKYALMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCGSCW 141
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS A+EG + + +LVS+SEQELVDCD N + GCNGG M+ AF+++ G+ E
Sbjct: 142 AFSTTGAIEGAAFVSSKQLVSVSEQELVDCDHNGD-MGCNGGLMDNAFKWVKTHKGLCKE 200
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA----------------------RYAFQL 265
+DYPY K C K K +T + +PA + FQ
Sbjct: 201 EDYPYHAKEGTCALKKCKP-VTKVTAFHDVPANDEQALKAAVAKQPVSVAIEADQPEFQF 259
Query: 266 YSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
Y GVFD+ CG +L+HGV VVGYGE+ G+KYW VKNSWG WG+ GYI++AR G
Sbjct: 260 YKSGVFDKSCGTKLDHGVLVVGYGEEGGKKYWKVKNSWGADWGDKGYIKLAREF-GPETG 318
Query: 326 ICGILMQASYPV 337
CG+ M SYP
Sbjct: 319 QCGVAMVPSYPT 330
>gi|326503122|dbj|BAJ99186.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326512552|dbj|BAJ99631.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 126/332 (37%), Positives = 182/332 (54%), Gaps = 58/332 (17%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
M RF W+ +R Y + E RF +Y SN++YI+ +N++ +++L + F DL
Sbjct: 56 MMARFHVWMTVQNRSYPTSSEKAHRFKVYRSNMRYIEALNAEATTSGFTYELGEGPFTDL 115
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQ------------------------YLGLPASVDWR 149
++EEFIS Y G P ++ R V G P +DWR
Sbjct: 116 TDEEFISLYTG-KIPDDDHREDGVHDEQIITTHAGSVNGAEGVTVYANFSAGAPIRMDWR 174
Query: 150 KEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGG 209
K GAVTPVKDQG+CGSCWAF VA +EGI+K+K G+LVSLSEQ+LVDCD + GCNGG
Sbjct: 175 KRGAVTPVKDQGKCGSCWAFPTVATIEGIHKIKRGRLVSLSEQQLVDCDF--LDGGCNGG 232
Query: 210 YMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY-------- 261
+ AF++I + GG+TT Y Y+ +C+ ++ A ITGY + +
Sbjct: 233 WPRNAFQWIIQNGGITTTSSYTYKAAEGQCKGNRKP--AAKITGYRKVKSNSEVSMVNIV 290
Query: 262 --------------AFQLYSHGVFDEYCG-HQLNHGVTVVGYGED-HGEKYWLVKNSWGT 305
FQ Y G+++ C +LNH +T+VGYG+ +G KYW+VKNSWG
Sbjct: 291 ANQPIAASIVVHGGQFQHYKGGIYNGPCATSKLNHVITIVGYGQQAYGAKYWIVKNSWGA 350
Query: 306 SWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+WG GY+ M R + + +G CGI ++ +P+
Sbjct: 351 AWGNKGYMLMKRGTKNP-LGQCGIAVRPIFPL 381
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 134/322 (41%), Positives = 178/322 (55%), Gaps = 47/322 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
++E + + Q+ ++Y SE E + R IY N I N + F+L NK+ DL
Sbjct: 23 VKEEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDL 82
Query: 114 SNEEFISTYLGYNKP-YNEPRWPSVQY-----------LGLPASVDWRKEGAVTPVKDQG 161
+EEF+ T G+N+ +P V+ + +P +VDWR++GAVTPVKDQG
Sbjct: 83 LHEEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQG 142
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
CGSCW+FSA A+EG + KTGKLVSLSEQ LVDC N GCNGG M+ AF++I
Sbjct: 143 HCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDN 202
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------- 258
GG+ TE YPY +D C + K T G+ IP
Sbjct: 203 GGIDTEKAYPYEAIDDTCHYN-PKAVGATDKGFVDIPQGDEKALMKAIATAGPVSVAIDA 261
Query: 259 ARYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRM 315
+ +FQ YS GV+ E C + L+HGV VGYG + GE YWLVKNSWGT+WG+ GY++M
Sbjct: 262 SHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGEDYWLVKNSWGTTWGDQGYVKM 321
Query: 316 ARNSPSSNIGICGILMQASYPV 337
ARN + CGI ASYP+
Sbjct: 322 ARNRDNH----CGIATAASYPL 339
>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
Length = 323
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 134/314 (42%), Positives = 178/314 (56%), Gaps = 41/314 (13%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDY--INSQNLSFKLTDNKFADLS 114
S + +E+W +++++Y + E R+ I+ N + I+ NS F L NKF DL
Sbjct: 17 SFSQDWEDWKNEHNKKYSDDLEELTRYKIWQGNQKIIEVHNANSDKFGFTLGMNKFGDLE 76
Query: 115 NEEFISTYLGYNKPYNEPRWPSVQ-YLGLP-----ASVDWRKEGAVTPVKDQGQCGSCWA 168
+ EF + GY + R S + ++ P +VDWR +GAVT VK+QGQCGSCWA
Sbjct: 77 SHEFAEMFNGY---MMQARSNSTKVFVADPNYKADPTVDWRTKGAVTGVKNQGQCGSCWA 133
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FS ++EG + LKTGKLVSLSEQ LVDC N+GCNGG M++AFE+I K GG+ TE
Sbjct: 134 FSTTGSLEGQHFLKTGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYIKKNGGIDTEA 193
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGY----------------------EAIPARY-AFQL 265
YPY+ ++RC+ K T TGY AI A + +FQL
Sbjct: 194 SYPYQAHDERCRF-KASDVGATCTGYVDIKREDENALMQAVEKIGPVSVAIDASHSSFQL 252
Query: 266 YSHGVFDEYCGHQ--LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
Y GV+ E Q L+HGV +GYG + G YWLVKNSWGT WG GYI M+RN ++
Sbjct: 253 YRSGVYYERECSQTALDHGVLAIGYGTEGGSDYWLVKNSWGTDWGMEGYIMMSRNRNNN- 311
Query: 324 IGICGILMQASYPV 337
CGI +ASYP
Sbjct: 312 ---CGIATEASYPT 322
>gi|403368476|gb|EJY84073.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 131/315 (41%), Positives = 191/315 (60%), Gaps = 34/315 (10%)
Query: 51 QKYDPQSMEER-FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDN 108
Q Y P + E+ F N++ +Y + YG+++E+ R ++ N+ + N +N ++++L N
Sbjct: 31 QLYTPITPEDHAFTNFVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNVRNDVTYRLGLN 90
Query: 109 KFADLSNEEFISTYLGYNKPYNE-PRWPSVQYLGLPAS--VDWRKEGAVTPVKDQGQCGS 165
KFAD + E+ LG+ N+ PR +++ LG P + V+W ++GAVTPVKDQGQCGS
Sbjct: 91 KFADYTEAEY-KRLLGFGGQKNKNPR--NIKVLGAPKNDGVNWVEQGAVTPVKDQGQCGS 147
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CW+FSA A+EG K++ G L SLSEQ+LVDC N+GC GG+M++AF+++ + +
Sbjct: 148 CWSFSATGAMEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYVEQT-ALE 206
Query: 226 TEDDYPYRGKNDRCQ------------TDKTKHHAVTITGY-------EAIPA-RYAFQL 265
TED YPY +D C+ D T ++ + AI A + FQ
Sbjct: 207 TEDQYPYEAVDDTCRASSAGVVKVDSFVDVTPNNVNELKAALDKGPVSVAIEADQMVFQF 266
Query: 266 YSHGVF-DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
YS GV D CG L+HGV VGYG + G+ Y+LVKNSWG SWGE GY+++A SP +
Sbjct: 267 YSGGVINDASCGTTLDHGVLAVGYGNESGQDYFLVKNSWGASWGEEGYVKIAA-SPDN-- 323
Query: 325 GICGILMQASYPVKR 339
ICGIL QASYP+ +
Sbjct: 324 -ICGILSQASYPIMK 337
>gi|414887427|tpg|DAA63441.1| TPA: hypothetical protein ZEAMMB73_713985 [Zea mays]
Length = 355
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 133/335 (39%), Positives = 185/335 (55%), Gaps = 52/335 (15%)
Query: 28 AVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYS 87
A+L L+ VL AGA S G D M +RF W Y+R Y + E RRF +Y
Sbjct: 9 ALLCACLMLVL--MAGAASGGRVDVED-MLMMDRFRAWQATYNRSYLTAAERLRRFEVYR 65
Query: 88 SNVQYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNK------------------- 127
N++ I+ N + LS++L++ F DL++EEF++T+ +
Sbjct: 66 QNMELIEATNRRAELSYQLSETPFTDLTSEEFLATHTMSTRLHASEAARRHRELITTHAG 125
Query: 128 PYNE--PRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLK 182
P ++ +W Y L +P SVDWR +GAVT VKDQG CG CW+F+ VAA+EG++K++
Sbjct: 126 PVSDGGRQWNRRNYTTDLDVPESVDWRTKGAVTTVKDQGACGGCWSFATVAAIEGLHKIR 185
Query: 183 TGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTD 242
TG+LVSLSEQE++DC + N GC+GG A ++++ GG+TTE DYPY G+ +C+ D
Sbjct: 186 TGQLVSLSEQEVLDCS-SPPNNGCHGGNPAAAIDWVSANGGLTTESDYPYEGRQGKCKLD 244
Query: 243 KTKHHAVTITGYEAI---------------PARYAF------QLYSHGVFDEYCGHQ-LN 280
K ++H I G + + P Q Y GVF C + LN
Sbjct: 245 KARNHVAKIRGRKLVDQNNEAALEVAVAQQPVAVGMNVHPIQQHYKSGVFHGPCDPEDLN 304
Query: 281 HGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIR 314
H VT+VGYG E G KYW+VKNSWG WGE GY R
Sbjct: 305 HAVTMVGYGAESGGRKYWIVKNSWGEKWGEKGYFR 339
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 130/321 (40%), Positives = 180/321 (56%), Gaps = 46/321 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADL 113
++E++ + Q+ ++Y S+ E + R I+ N + N +S+KL NK+AD+
Sbjct: 23 VQEQWGTFKLQHKKQYKSDTEEKFRMKIFMENSHKVAKXNKLYEMGLVSYKLKINKYADM 82
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQ-----------YLGLPASVDWRKEGAVTPVKDQGQ 162
+ EF+ T G+N+ N P + + + P +VDWR+ GAVT VKDQG
Sbjct: 83 LHHEFVHTVNGFNRTKNTPLLGTSEDEQGATFIAPANVKFPENVDWREHGAVTXVKDQGH 142
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCW+FSA A+EG + KT KLVSLSEQ LVDC N GCNGG M+ AF+++
Sbjct: 143 CGSCWSFSATGALEGQHFRKTNKLVSLSEQNLVDCSTKFGNDGCNGGLMDNAFKYVKYNH 202
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------A 259
G+ TE YPY +++C + K T G+ IP +
Sbjct: 203 GIDTEASYPYHADDEKCHYN-PKTSGATDRGFVDIPTGDEEKLMAAVATVGPVSVAIDAS 261
Query: 260 RYAFQLYSHGV-FDEYC-GHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMA 316
+FQLYS GV +D C +L+HGV VVGYG D +G+ YW+VKNSWG SWGE GYI+MA
Sbjct: 262 HESFQLYSEGVYYDPECSSEELDHGVLVVGYGTDENGQDYWIVKNSWGESWGEQGYIKMA 321
Query: 317 RNSPSSNIGICGILMQASYPV 337
RN ++ CGI QASYP+
Sbjct: 322 RNRDNN----CGIATQASYPL 338
>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 360
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 130/325 (40%), Positives = 181/325 (55%), Gaps = 49/325 (15%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNE 116
M +RF W +++ Y S +E RRF +Y NV+YI+ N + +L+++L +N+FADL+ E
Sbjct: 38 MMDRFLMWQATHNQSYRSAEERLRRFQVYRDNVEYIETTNRRGDLTYQLGENQFADLTRE 97
Query: 117 EFISTYLGYNKPY------------------NEPRWPSV-QYLGL-PASVDWRKEGAVTP 156
EFI+ + YN + W S + L P SVDWR +GAV P
Sbjct: 98 EFIARFTSYNGDDDRTGDDDSVITTAAVGGGDPDLWSSGGDDVSLDPPSVDWRAKGAVVP 157
Query: 157 VKDQGQCGSC-WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAF 215
K Q S WAF AVA +E ++ +KTGKLV+LSEQ+LVDCD + GCN G +AF
Sbjct: 158 PKSQSSSCSSSWAFVAVATIESLHAIKTGKLVALSEQQLVDCD--QYDGGCNRGTFRRAF 215
Query: 216 EFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA------RYA------- 262
++ + GG+TTE +YPY C + K+ HH I+G+ ++P ++A
Sbjct: 216 HWVIQNGGLTTEAEYPYTAAQGTCNSAKSDHHVAAISGHASVPGSNELAMKHAVATQPVA 275
Query: 263 --------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDH--GEKYWLVKNSWGTSWGEAGY 312
Q Y GV+ CG +L H VTVVGYG D G+KYW+VKNSWG +WGE GY
Sbjct: 276 AAIELGSDMQFYKSGVYSGPCGARLEHAVTVVGYGADESTGDKYWIVKNSWGQTWGERGY 335
Query: 313 IRMARNSPSSNIGICGILMQASYPV 337
IRM R G+CGI++ +YP
Sbjct: 336 IRMQRKILGP--GLCGIMLDVAYPT 358
>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
Length = 330
Score = 222 bits (566), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 130/307 (42%), Positives = 177/307 (57%), Gaps = 45/307 (14%)
Query: 67 KQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL-------SFKLTDNKFADLSNEEFI 119
KQY++ Y +E+E +RR ++ SN +D+I NL +F + N++ D++NEEF
Sbjct: 32 KQYNKLYQNEEEARRRL-VWESN---LDFITLHNLAADRGEHTFWVGMNEYGDMTNEEFT 87
Query: 120 STYLGY---NKPYNEPRWPSVQYLG-LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
T GY NK N P + +G LP +VDWR +G VTP+K+QGQCGSCW+FSA ++
Sbjct: 88 KTMNGYRMRNKTSNAPVFMPPNNMGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATGSL 147
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EG KTGKLVSLSEQ LVDC N GC GG M+ AF +I G+ TE YPY+ +
Sbjct: 148 EGQTFKKTGKLVSLSEQNLVDCSKKQGNHGCEGGLMDDAFTYIKANNGIDTEASYPYKAR 207
Query: 236 NDRCQTDKTKHHAVTITGYEAIPAR-----------------------YAFQLYSHGVF- 271
+ +C+ K+ T TG+ I + +FQLY GV+
Sbjct: 208 DGKCEF-KSADVGATDTGFVDIKTKDEEALKQAVATVGPISVAIDASHMSFQLYRTGVYH 266
Query: 272 DEYCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
D +C +L+HGV VGYG + + YWLVKNSWG SWG+ GYI+M+RN ++ CGI
Sbjct: 267 DWFCSQTKLDHGVLAVGYGTEDSKDYWLVKNSWGESWGQKGYIQMSRNRRNN----CGIA 322
Query: 331 MQASYPV 337
ASYP
Sbjct: 323 TSASYPT 329
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 174/310 (56%), Gaps = 45/310 (14%)
Query: 68 QYSREYGSEDEWQRRFGIYSSNVQYIDYINS----QNLSFKLTDNKFADLSNEEFISTYL 123
++ + Y S+ E + R I+ N I NS + +S+KL NK+ D+ + EF++
Sbjct: 40 EHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYKLKMNKYGDMLHHEFVNILN 99
Query: 124 GYNKPYN----EPRWP------SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
G+NK N R P + LP VDWRKEGAVTPVKDQG CGSCW+FSA
Sbjct: 100 GFNKSINTQLRSERLPVGASFIEPANVVLPKKVDWRKEGAVTPVKDQGHCGSCWSFSATG 159
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
A+EG + +TG LVSLSEQ L+DC N GCNGG M++AF++I G+ TE YPY
Sbjct: 160 ALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEASYPYE 219
Query: 234 GKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQLYSHGV 270
+ND+C+ + A+ + GY IP + +FQ YS GV
Sbjct: 220 AENDKCRYNPANSGAIDV-GYIDIPTGDEKLLKAAVATIGPVSVAIDASHQSFQFYSEGV 278
Query: 271 F--DEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
+ E +L+HGV V+GYG ++G+ YWLVKNSWG +WG GYI+MARN + C
Sbjct: 279 YYEPECSSEELDHGVLVIGYGTNENGQDYWLVKNSWGETWGNNGYIKMARNK----LNHC 334
Query: 328 GILMQASYPV 337
GI ASYP+
Sbjct: 335 GIASSASYPL 344
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 137/345 (39%), Positives = 189/345 (54%), Gaps = 49/345 (14%)
Query: 33 FLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
L +L + A A + Y ++E ++ + ++ + Y E E + R I++ N
Sbjct: 3 ILFALLALVAVAQAVSYAD-----VIKEEWQTFKLEHRKNYVDETEERFRLKIFNENKHK 57
Query: 93 IDYIN----SQNLSFKLTDNKFADLSNEEFISTYLGYNKPYN------EPRWPSVQYLG- 141
I N S +SFK+ NK+AD+ + EF +T G+N + +P + V ++
Sbjct: 58 IAKHNQRYASGEVSFKMAVNKYADMLHHEFHTTMNGFNYTLHKQLRASDPSFVGVTFISP 117
Query: 142 ----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
+P SVDWR +GAVT VKDQG CGSCWAFS+ A+EG + K G L+SLSEQ LVDC
Sbjct: 118 EHVKIPKSVDWRSKGAVTEVKDQGHCGSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDC 177
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ----------------- 240
N GCNGG M+ AF +I GG+ TE YPY G +D C
Sbjct: 178 STKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDRGSVDIP 237
Query: 241 --TDKTKHHAVTITG--YEAIPARY-AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDH- 292
+K AV G AI A + +FQ YS G+++E C Q L+HGV VVGYG D
Sbjct: 238 QGDEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYGTDES 297
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G+ YWLVKNSWGT+WG+ G+I+MARN+ + CGI +SYP+
Sbjct: 298 GQDYWLVKNSWGTTWGDKGFIKMARNADNQ----CGIASASSYPL 338
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 122/283 (43%), Positives = 160/283 (56%), Gaps = 40/283 (14%)
Query: 53 YDPQSME------ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLT 106
Y P+ +E E FENW+ + + Y + +E RF ++ N+++ID N + S+ L
Sbjct: 36 YSPEDLESHDKLIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLG 95
Query: 107 DNKFADLSNEEFISTYLGYN---------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPV 157
N+FADLS+EEF YLG + Y E + V+ +P SVDWRK+GAV V
Sbjct: 96 LNEFADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVE--AVPKSVDWRKKGAVAEV 153
Query: 158 KDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEF 217
K+QG CGSCWAFS VAAVEGINK+ TG L +LSEQEL+DCD + N GCNGG M+ AFE+
Sbjct: 154 KNQGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDT-TYNNGCNGGLMDYAFEY 212
Query: 218 ITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------- 260
I K GG+ E+DYPY + C+ K + VTI G++ +P
Sbjct: 213 IVKNGGLRKEEDYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVA 272
Query: 261 -----YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWL 298
FQ YS GVFD CG L+HGV VGYG G Y +
Sbjct: 273 IDASGREFQFYSGGVFDGRCGVDLDHGVAAVGYGSSKGSDYII 315
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 135/307 (43%), Positives = 173/307 (56%), Gaps = 44/307 (14%)
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYI-----DYINSQNLSFKLTDNKFADLSNEEFISTYL 123
+ ++Y S+ E R IY N I Y SQ +S+KL N+F DL + EF+ST
Sbjct: 34 HGKDYASDTEEYYRLKIYMENRLKIARHNEKYAKSQ-VSYKLAMNEFGDLLHHEFVSTRN 92
Query: 124 GYNKPYNE-PRWPSV-------QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
G+ + Y + PR S + L LP +VDWRK+GAVTPVK+QGQCGSCWAFS ++
Sbjct: 93 GFKRNYRDSPREGSFFVEPEGFEDLQLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSL 152
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EG + KT KLVSLSEQ LVDC + N GC GG M+ AF++I G+ TE YPY
Sbjct: 153 EGPHFRKTRKLVSLSEQNLVDCSRSFGNNGCEGGLMDNAFKYIKSNKGIDTEWSYPYNAT 212
Query: 236 NDRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQLYSHGVFD 272
+ C +++ A T TG+ IP + +FQ YS GV+D
Sbjct: 213 DGVCHFNRSDVGA-TDTGFVDIPEGDENKLKKAVAAVGPVSVAIDASHESFQFYSEGVYD 271
Query: 273 --EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
E QL+HGV VVGYG G+ YWLVKNSWGT+WG+ GYI M RN + CGI
Sbjct: 272 EPECSSEQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDEGYIYMTRNKDNQ----CGIA 327
Query: 331 MQASYPV 337
ASYP+
Sbjct: 328 SSASYPL 334
>gi|260516672|gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
Length = 319
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 128/286 (44%), Positives = 164/286 (57%), Gaps = 33/286 (11%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS-QNLSFKLTDNKFADLSNE 116
+++ F ++KQYS+ Y S E+ RF + ++V+ I N+ N S+ + N+FADLS E
Sbjct: 38 LQDMFTAFMKQYSKAY-SHAEFSSRFNQFKASVETIRLHNTLANASYTMGLNEFADLSFE 96
Query: 117 EFISTYLG---YNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
EF Y G + + + P S+DWR AVTP+KDQGQCGSCWAFSA
Sbjct: 97 EFKGKYFGCKHVEREFARSNNLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATG 156
Query: 174 AVEGINKLKTGK--LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
++EG L+ GK L SLSEQ+LVDC + N GCNGG M+ AFE+I G+ E YP
Sbjct: 157 SIEGAWVLQ-GKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAYP 215
Query: 232 YRGKNDRCQTDKTKHHAVTITGYE----------------------AIPARYA-FQLYSH 268
Y+G CQ TK VTI+G++ AI A A FQ YS
Sbjct: 216 YKGVGGLCQKSCTK--VVTISGHKDVASGDEASSLNAVGTVGPVSVAIEADQAGFQFYSS 273
Query: 269 GVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIR 314
GVF CGH L+HGV VGYG + YW+VKNSWGTSWGE+GYIR
Sbjct: 274 GVFSGTCGHNLDHGVLAVGYGTTGSQDYWIVKNSWGTSWGESGYIR 319
>gi|357139514|ref|XP_003571326.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 363
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 121/338 (35%), Positives = 175/338 (51%), Gaps = 48/338 (14%)
Query: 46 SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS--- 102
+ G P D + +R+ W +YS+ Y S +E ++RFG++ N I ++ +
Sbjct: 27 AAGKPAADDDSELRQRWSKWQAKYSKRYPSHEEQEKRFGVFRDNSNSIGAFSAPQTTTSA 86
Query: 103 -------------FKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG----LPAS 145
++ N+F DL E + + G+N + P L P
Sbjct: 87 VVGSFGAPQTVTTVRVGMNRFGDLQPREVLDQFTGFNNTAAVLKTPPPTRLPHHSRKPCC 146
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
VDWR GAVT VK QG C SCWAF+AVAA+EG+NK++TG LVSLSEQ+LVDCD S G
Sbjct: 147 VDWRSSGAVTGVKFQGSCQSCWAFAAVAAIEGMNKIRTGTLVSLSEQQLVDCDNGSS--G 204
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK-HHAVTITGYEAIP------ 258
C GG + A + + + GG+T+ + Y Y G N RC+ DK H + G++A+P
Sbjct: 205 CAGGRTDTALDLVARRGGITSGERYAYGGFNGRCKVDKLLFDHGAAVGGFKAVPPNDEHQ 264
Query: 259 ----------------ARYAFQLYSHGVFDEYCG---HQLNHGVTVVGYGEDHGEKYWLV 299
+ + FQ YS G+F C ++NH VT+VGY E+ G+K+W+
Sbjct: 265 LAMAVARQPVTAYVDASTWEFQFYSGGIFRGPCSGDPARVNHAVTIVGYCEEFGDKFWIA 324
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KNSW WG+ GYI +A++ SS G CG+ YP
Sbjct: 325 KNSWSDDWGDQGYILLAKDVLSSPNGTCGLATSPFYPT 362
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 134/320 (41%), Positives = 177/320 (55%), Gaps = 45/320 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS----QNLSFKLTDNKFADL 113
++E ++ + ++ + Y SE E + R I++ N I N +SFKL NK+AD+
Sbjct: 23 IKEEWQTFKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADM 82
Query: 114 SNEEFISTYLGYN----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQC 163
+ EF T GYN + +N + S + +P +VDWR+ GAVT VKDQG C
Sbjct: 83 LHHEFKETMNGYNHTMRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQGHC 142
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCW+FS+ ++EG + K G LVSLSEQ LVDC N GCNGG M+ AF +I GG
Sbjct: 143 GSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 202
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------AR 260
V TE YPY G +D C +K A T TG+ IP +
Sbjct: 203 VDTEKSYPYEGIDDSCHFNKATVGA-TDTGFVDIPQGDEEAMMKAVATMGPVAVAIDASN 261
Query: 261 YAFQLYSHGVF-DEYC-GHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQLYS GV+ D C L+HGV VVGYG D G+ YWLVKNSWGT+WG+ GYI+MAR
Sbjct: 262 ESFQLYSEGVYNDPNCSSDNLDHGVLVVGYGTDKDGQDYWLVKNSWGTTWGDQGYIKMAR 321
Query: 318 NSPSSNIGICGILMQASYPV 337
N + CGI +S+P
Sbjct: 322 NQDNQ----CGIATASSFPT 337
>gi|310942960|pdb|3P5W|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
Length = 220
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 110/219 (50%), Positives = 136/219 (62%), Gaps = 24/219 (10%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
LP VDWR GAV +KDQGQCGSCWAFS +AAVEGINK+ TG L+SLSEQELVDC
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSCWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP--- 258
+GC+GG+M F+FI GG+ TE +YPY + +C D + V+I YE +P
Sbjct: 61 NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120
Query: 259 -------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
A Y FQ YS G+F CG ++H VT+VGYG + G YW+V
Sbjct: 121 EWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIV 180
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
KNSWGT+WGE GY+R+ RN +G CGI +ASYPVK
Sbjct: 181 KNSWGTTWGEEGYMRIQRN--VGGVGQCGIAKKASYPVK 217
>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
gi|228243|prf||1801240A Cys protease 1
Length = 322
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 127/306 (41%), Positives = 174/306 (56%), Gaps = 36/306 (11%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEF 118
E + ++ R+Y +E + R ++ N+QYI+ N + +++ L N+F+D++NE+F
Sbjct: 21 EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKF 80
Query: 119 ISTYLGYNK-PYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEG 177
+ GY K P + S VDWR +GAVTPVKDQGQCGSCWAFS +EG
Sbjct: 81 NAVMKGYKKGPRPAAVFTSTDAAPESTEVDWRTKGAVTPVKDQGQCGSCWAFSTTGGIEG 140
Query: 178 INKLKTGKLVSLSEQELVDCDVNS-ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
+ LKTG+LVSLSEQ+LVDC S NQGCNGG++E+A ++ GGV TE YPY ++
Sbjct: 141 QHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTESSYPYEARD 200
Query: 237 DRCQTDKTKHHAVTITGYEAI-----------------------PARYAFQLYSHGVFDE 273
+ C+ + A T TGY I + +FQ Y GV+ E
Sbjct: 201 NTCRFNSNTIGA-TCTGYVGIAQGSESALKTATRDIGPISVAIDASHRSFQSYYTGVYYE 259
Query: 274 --YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
QL+H V VGYG + G+ +WLVKNSW TSWGE+GYI+MARN ++ CGI
Sbjct: 260 PSCSSSQLDHAVLAVGYGSEGGQDFWLVKNSWATSWGESGYIKMARNRNNN----CGIAT 315
Query: 332 QASYPV 337
A YP
Sbjct: 316 DACYPT 321
>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
Length = 324
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 123/327 (37%), Positives = 192/327 (58%), Gaps = 32/327 (9%)
Query: 29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEER---FENWLKQYSREYGSE-DEWQRRFG 84
+++L LL + +P + + +S EE F+ W+ ++ + Y + + ++RF
Sbjct: 9 MITLSLLIIFLLPPSSAMDLSVTSGGLRSNEEVGFIFQTWMSKHGKTYTNALGDKEQRFQ 68
Query: 85 IYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL---- 140
+ N+++ID N++NLS++L +FADL+ +E+ + G +P + + V +
Sbjct: 69 NFKDNLRFIDQHNAKNLSYRLGLTQFADLTVQEYQDLFSG--RPIQKQKALRVTHRYVPL 126
Query: 141 ---GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
LP SVDWR++GAV+ +KDQG+C VE INK+ TG+L+SLSEQELVDC
Sbjct: 127 AEDQLPQSVDWRQKGAVSEIKDQGRC----------TVESINKIVTGELISLSEQELVDC 176
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDK-TKHHAVTITGYEA 256
+ +N GCNGG M+ AF+F+ G+ + DYPY+ C ++ T + I GYE
Sbjct: 177 SI--DNHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQGYCNHNQNTSKKVIKIDGYED 234
Query: 257 IPARYAFQL-----YSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAG 311
+PA L + G++ CG L+H V +VGYG ++G+ YW+V+NSWGT WGEAG
Sbjct: 235 VPANNENSLQKAVAHQPGIYTGPCGTDLDHAVVIVGYGTENGQDYWIVRNSWGTVWGEAG 294
Query: 312 YIRMARNSPSSNIGICGILMQASYPVK 338
Y ++ARN + G+CGI M ASYP+K
Sbjct: 295 YAKIARNFENPT-GVCGIAMVASYPIK 320
>gi|357127811|ref|XP_003565571.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 364
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 123/329 (37%), Positives = 177/329 (53%), Gaps = 49/329 (14%)
Query: 55 PQS-MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS----------- 102
P+S + +R+ NW +YS+ Y S +E ++RFG++ N+ I ++ +
Sbjct: 38 PESELRQRWTNWQAKYSKTYPSHEEQEKRFGVFRGNINNIGAFSAAQTTTTAVVGSFGAP 97
Query: 103 -----FKLTDNKFADLSNEEFISTYLGYNKP--YNEPRWPSVQYLGL-PASVDWRKEGAV 154
++ N+F DL E + + G+N P+ + Y P VDWR GAV
Sbjct: 98 QTVTTVRVGMNRFGDLQPSEVLEQFTGFNSTVVLKTPKPTRLPYHSRKPCCVDWRSSGAV 157
Query: 155 TPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKA 214
T VK QG C SCWAF+AVAA+EG+NK++TG LVSLSEQ+LVDCD S GC GG + A
Sbjct: 158 TGVKFQGSCLSCWAFAAVAAIEGMNKIRTGTLVSLSEQQLVDCDKGSS--GCAGGRTDTA 215
Query: 215 FEFITKIGGVTTEDDYPYRGKNDRCQTDKTK-HHAVTITGYEAIP--------------- 258
+ + K GG+T+E+ YPY G N +C DK HA + G++A+P
Sbjct: 216 LDLVAKRGGITSEEKYPYGGFNGKCNVDKLLFEHAAIVKGFKAVPPNDEHQLALAVAQQP 275
Query: 259 -------ARYAFQLYSHGVFDEYCG---HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWG 308
+ + FQ YS G+F C ++NH VT+VGY ED GEK+W+ KNSW WG
Sbjct: 276 VTVYVDASTWEFQFYSGGIFRGPCSTDPARVNHAVTIVGYCEDFGEKFWIAKNSWSNDWG 335
Query: 309 EAGYIRMARNSPSSNIGICGILMQASYPV 337
+ GYI +A++ + G C + YP
Sbjct: 336 DQGYIYLAKDV-AWPTGTCSLASSPFYPT 363
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 137/319 (42%), Positives = 177/319 (55%), Gaps = 44/319 (13%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS----QNLSFKLTDNKFADL 113
++E ++ + ++ + + SE E + R I++ N I N +SFKL NK++D+
Sbjct: 23 IKEEWQTFKMEHRKNFLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYSDM 82
Query: 114 SNEEFISTYLGYN----KPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCG 164
EF T GYN K + + Y+ +P SVDWR+ GAVT VKDQG CG
Sbjct: 83 LYHEFKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQIPKSVDWRQHGAVTAVKDQGHCG 142
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS+ AA+EG + K G LVSLSEQ LVDC N GCNGG M+ AF +I GG+
Sbjct: 143 SCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 202
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARY 261
TE YPY G +D C K+ A T TG+ IP +
Sbjct: 203 DTEKSYPYEGIDDSCHFTKSGVGA-TDTGFVDIPQGDEEALMKAVATMGPVSVAIDASHE 261
Query: 262 AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARN 318
+FQLYS GV++E C Q L+HGV VVGYG D G YWLVKNSWGT+WG+ GYI+MARN
Sbjct: 262 SFQLYSEGVYNEPECDAQNLDHGVLVVGYGTDKTGLDYWLVKNSWGTTWGDQGYIKMARN 321
Query: 319 SPSSNIGICGILMQASYPV 337
+ CGI +SYP
Sbjct: 322 QDNQ----CGIATASSYPT 336
>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 358
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 133/325 (40%), Positives = 178/325 (54%), Gaps = 48/325 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNE 116
M +RF + Y+R Y S +E RRF +Y NV YI+ +N + +L+++L +N+FADL+ +
Sbjct: 36 MMDRFRAFQATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADLTVQ 95
Query: 117 EFISTYLGYNKPYNEP-RWPSVQYLGL---------------------PASVDWRKEGAV 154
EF + Y + + P W Q + P SVDWR +GAV
Sbjct: 96 EFRAMYTMPARVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPTSVDWRSKGAV 155
Query: 155 TPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKA 214
TPVKDQG CG CWAF+ VA +EG++K+KTG+LVSLSEQELVDCD + G E A
Sbjct: 156 TPVKDQGGCGCCWAFATVATIEGLHKIKTGQLVSLSEQELVDCDDADDGCGGG--LPEIA 213
Query: 215 FEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE------------------- 255
E++ GG+TTE +YPY GK +C K +HA I +
Sbjct: 214 MEWVAHNGGLTTEANYPYTGKAGKCDRGKASNHAAKIAAAQMVRANSEAELERAVARQPV 273
Query: 256 --AIPARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGY 312
AI A + Y GV+ C + +H VTVVGYG D+ G KYW++KNSW +WGE GY
Sbjct: 274 AVAINAPDSLMFYKSGVYSGPCTAEFDHAVTVVGYGADNKGHKYWIIKNSWAETWGEKGY 333
Query: 313 IRMARNSPSSNIGICGILMQASYPV 337
RM R + G+CGI ASYPV
Sbjct: 334 GRMQRGVAAKE-GLCGIATHASYPV 357
>gi|405971603|gb|EKC36430.1| Cathepsin L [Crassostrea gigas]
Length = 360
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 133/315 (42%), Positives = 184/315 (58%), Gaps = 38/315 (12%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS----QNLSFKLTDNKFA 111
Q E+ ++ + + + Y + +E RRF I+ NVQ I+ N S+ L N+F+
Sbjct: 50 QPYEQAWKEFKILHDKTYDALEEESRRFEIFRENVQKIEEHNKLYHLGKKSYYLGVNQFS 109
Query: 112 DLSNEEFISTYLGYNKPYNE----PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
DL +EEF+ Y G K + + + L P SVDWRK+G VT VK+QGQCGSCW
Sbjct: 110 DLKHEEFVK-YNGLKKTSLKDGGCSSYLAANNLVEPDSVDWRKKGYVTDVKNQGQCGSCW 168
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
+FS ++EG + K+GKLVSLSE +LVDC + N+GCNGG M+ AF++I +GG+ +E
Sbjct: 169 SFSTTGSLEGQHFRKSGKLVSLSESQLVDCSQSFGNEGCNGGLMDNAFKYIKSVGGLESE 228
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTI------TGYE---------------AIPARY-AFQL 265
+DYPY+ K C+ D TK A +G E AI A + +FQ
Sbjct: 229 EDYPYKPKQGTCKFDDTKVAATDTGCVDVESGSESALKKAVSEVGPVSVAIDASHSSFQS 288
Query: 266 YSHGVFD--EYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
Y+ GV+D E QL+HGV VGYG +D G+ YW+VKNSWG WGE GY++M+RN +
Sbjct: 289 YAGGVYDEPECSSEQLDHGVLCVGYGTDDQGQDYWIVKNSWGAEWGEDGYVKMSRNKKNQ 348
Query: 323 NIGICGILMQASYPV 337
CGI QASYP+
Sbjct: 349 ----CGIATQASYPL 359
>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 351
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 120/320 (37%), Positives = 181/320 (56%), Gaps = 41/320 (12%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSN 115
+S+ + ++ W + R + +E RF ++ +N +++ +N S KL N+FAD+S+
Sbjct: 35 KSLMQLYKRW-SSHHRISRNANEMHNRFKVFKNNAKHVFKVNLMGKSLKLKLNQFADMSD 93
Query: 116 EEFISTYLGYNKPYNEPRWPSVQYLG-------------LPASVDWRKEGAVTPVKDQGQ 162
+EF + Y Y + ++ G +P+S+DWRK+GAV +K+QG+
Sbjct: 94 DEFRNMYSSNITYYKDLHAKKIEATGGRIGGFMYEHANNIPSSIDWRKKGAVNAIKNQGR 153
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAF+AVAAVE I+++KT +LVSLSE+E++DCD + GC GG+ AFEF+
Sbjct: 154 CGSCWAFAAVAAVESIHQIKTNELVSLSEEEVLDCDY--RDGGCRGGFYNSAFEFMMDND 211
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA--RYA------------------ 262
GVT ED+YPY N C+ ++ V I GYE +P YA
Sbjct: 212 GVTIEDNYPYYEGNGYCRRRGGRNKRVRIDGYENVPRNNEYALMKAVAHQPVAVAIASGG 271
Query: 263 --FQLYSHGVFDE--YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
F+ Y G+F E +CG ++H V VVGYG D YW+++N +G WG GY++M R
Sbjct: 272 SDFKFYGGGMFTENDFCGFNIDHTVVVVGYGTDEDGDYWIIRNQYGHRWGMNGYMKMQRG 331
Query: 319 SPSSNIGICGILMQASYPVK 338
+ S G+CG+ MQ +YPVK
Sbjct: 332 AHSPQ-GVCGMAMQPAYPVK 350
>gi|2098464|pdb|1PCI|A Chain A, Procaricain
gi|2098465|pdb|1PCI|B Chain B, Procaricain
gi|2098466|pdb|1PCI|C Chain C, Procaricain
Length = 322
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 138/323 (42%), Positives = 187/323 (57%), Gaps = 39/323 (12%)
Query: 48 GYPQKYDPQSMEER----FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSF 103
GY Q D + ER F +W+ +++ Y + DE RF I+ N+ YID N +N S+
Sbjct: 6 GYSQ--DDLTSTERLIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSY 63
Query: 104 KLTDNKFADLSNEEFISTYLG------YNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPV 157
L N+FADLSN+EF Y+G + Y+E + + + LP +VDWRK+GAVTPV
Sbjct: 64 WLGLNEFADLSNDEFNEKYVGSLIDATIEQSYDE-EFINEDIVNLPENVDWRKKGAVTPV 122
Query: 158 KDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEF 217
+ QG CGSCWAFSAVA VEGINK++TGKLV LSEQELVDC+ S GC GGY A E+
Sbjct: 123 RHQGSCGSCWAFSAVATVEGINKIRTGKLVELSEQELVDCERRS--HGCKGGYPPYALEY 180
Query: 218 ITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG------------YEAIPAR----- 260
+ K G+ YPY+ K C+ + V +G AI +
Sbjct: 181 VAK-NGIHLRSKYPYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVV 239
Query: 261 -----YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRM 315
FQLY G+F+ CG +++ VT VGYG+ G+ Y L+KNSWGT+WGE GYIR+
Sbjct: 240 VESKGRPFQLYKGGIFEGPCGTKVDGAVTAVGYGKSGGKGYILIKNSWGTAWGEKGYIRI 299
Query: 316 ARNSPSSNIGICGILMQASYPVK 338
R +P ++ G+CG+ + YP K
Sbjct: 300 KR-APGNSPGVCGLYKSSYYPTK 321
>gi|219362839|ref|NP_001136636.1| uncharacterized protein LOC100216764 precursor [Zea mays]
gi|194696462|gb|ACF82315.1| unknown [Zea mays]
gi|413934556|gb|AFW69107.1| hypothetical protein ZEAMMB73_554980 [Zea mays]
Length = 361
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 130/330 (39%), Positives = 178/330 (53%), Gaps = 55/330 (16%)
Query: 56 QSMEERFENWLKQY--SREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
+S+ +E W Y +R+ G E RRF ++ N I N N ++ L N+F+D+
Sbjct: 41 ESLWALYERWCAHYNMARDLG---EKTRRFNLFKENAHRIYEHNQGNATYTLGLNRFSDM 97
Query: 114 SNEEFISTYLGY---------------------NKPYNEPRWPSVQYLGLPASVDWRKEG 152
++EEF + G + +N + LGLP SVDWR
Sbjct: 98 TDEEFSRSPYGRCLFAPVQRISDGENEELQQHEDVSFNLTHGGATAALGLPPSVDWRGR- 156
Query: 153 AVTPVKDQG-QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYM 211
+VT VKDQG CGSCWAF+A+AAVEGIN ++T LV+LSEQ+LVDCD + + GC GG++
Sbjct: 157 SVTRVKDQGLTCGSCWAFAAIAAVEGINAIRTWSLVTLSEQQLVDCD--NVDHGCAGGWI 214
Query: 212 EKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI-------------- 257
A +FI + G+ E YPY G RC+ VTI GY +
Sbjct: 215 PSALDFIVRNRGIVPEGTYPYIGTQGRCR--HVMAPPVTIDGYRRVLPFDVNALMSAVAA 272
Query: 258 --------PARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGE 309
+ +AF+ Y GVF+ CG +L H VVGYG+ G +W+VKNSWG WGE
Sbjct: 273 QPVAVAMESSAWAFRHYQGGVFNGNCGGRLGHAAAVVGYGDGAGGPFWIVKNSWGPKWGE 332
Query: 310 AGYIRMARNSPSSNIGICGILMQASYPVKR 339
GY+R++RN+P + +GICGIL Q YPVKR
Sbjct: 333 GGYVRISRNAP-NRLGICGILTQPLYPVKR 361
>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
Length = 328
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 135/309 (43%), Positives = 175/309 (56%), Gaps = 39/309 (12%)
Query: 63 ENWLK---QYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSN 115
E WL Q+ + Y + E R +Y N + ID N + +S+KL N F DL
Sbjct: 24 EEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYKLKMNHFGDLMQ 83
Query: 116 EEF--ISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVA 173
EF ++ K N LPA VDWR++GAVTPVKD GQCGSCWAFS+
Sbjct: 84 HEFKALNKLKRSAKQQNSGEVFRATGGKLPAKVDWRQKGAVTPVKDPGQCGSCWAFSSTG 143
Query: 174 AVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYR 233
++ G LK KLVSLSEQ+LVDC N N GC+GG M +AF++I GG+ TE YPY
Sbjct: 144 SLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYIKGNGGIDTEGSYPYE 203
Query: 234 GKNDRCQTDKTKHHAVTITGY------------EAI-----------PARYAFQLYSHGV 270
++D+C+ KTK A T GY EA+ +FQ YS G+
Sbjct: 204 AEDDKCRY-KTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVAIDAGNLSFQFYSEGI 262
Query: 271 FDE-YCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
+DE +C + +L+HGV VVGYG ++G+ YWLVKNSWG SWGE GYI++ARN + CG
Sbjct: 263 YDEPFCSNTELDHGVLVVGYGTENGQDYWLVKNSWGPSWGENGYIKIARNHNNH----CG 318
Query: 329 ILMQASYPV 337
I ASYP+
Sbjct: 319 IASMASYPI 327
>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
Length = 324
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 170/310 (54%), Gaps = 42/310 (13%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEE 117
F + QY R+Y + E + R +Y N+++I+ N Q +++ L N+F D++NEE
Sbjct: 22 FHQFKVQYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGEVTYMLAINQFGDMTNEE 81
Query: 118 FISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
I+ + P +E R V LG LPA VDWR +GAVTPVKDQ CGSCWAFSA
Sbjct: 82 -INAVMNGLLPASESR--GVAVLGGRDDTLPAEVDWRTKGAVTPVKDQKACGSCWAFSAT 138
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
++EG + LK GKLVSLSEQ LVDC + GC GG M+ AF +I GG+ TE YPY
Sbjct: 139 GSLEGQHFLKDGKLVSLSEQNLVDCSTKQGDHGCGGGLMDFAFTYIKDNGGIDTEASYPY 198
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAI-----------------------PARYAFQLYSHG 269
+ +CQ + A T+TGY + +R F Y G
Sbjct: 199 EATDGKCQYNPANSGA-TVTGYVDVEHDSEDALQKAVATIGPISVAIDASRSTFHFYHKG 257
Query: 270 V-FDEYCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
V +D+ C L+HGV VGYG G YWLVKNSW +WG G+I M+RN ++ C
Sbjct: 258 VYYDKECSSTSLDHGVLAVGYGTQDGTDYWLVKNSWNITWGNHGFIEMSRNRNNN----C 313
Query: 328 GILMQASYPV 337
GI QASYP+
Sbjct: 314 GIATQASYPL 323
>gi|224460525|gb|ACN43674.1| cathepsin L [Paralichthys olivaceus]
Length = 334
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 130/318 (40%), Positives = 182/318 (57%), Gaps = 46/318 (14%)
Query: 59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLS 114
+ F W ++ R Y S E +R I+ N + + N+ + +++L +ADL
Sbjct: 23 DHDFHAWKLKFGRSYNSSSEEDKRMQIWLRNREIVMAHNAMADQGHSTYRLGMTFYADLE 82
Query: 115 NEEFISTYLG-----YNKPYNEPRWPSV-----QYLGLPASVDWRKEGAVTPVKDQGQCG 164
+EEF T G +N ++PR S ++ LP ++DWR+ G VTPVK+QG CG
Sbjct: 83 HEEFKQTVFGVCLGSFNA--SKPRGGSSFLKMHRFYNLPQTIDWRQWGFVTPVKNQGSCG 140
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCW+FS+ A+EG N KTG+LVSLSEQELVDC N N GCNGG+M+ AF +I GG+
Sbjct: 141 SCWSFSSTGALEGQNFRKTGRLVSLSEQELVDCSGNYGNYGCNGGWMDNAFRYIVNKGGI 200
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARY 261
TED YPY G+ +C+ + + A T TGY IP +
Sbjct: 201 HTEDSYPYEGQVGQCRANYGEIGA-TCTGYYDIPSGNEHALKEAVATFGPVSVAIHASDQ 259
Query: 262 AFQLYSHGVFDE-YC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
+FQLY GV++ YC G L+H V +VGYG ++G+ YWLVKNSWG +WG+ GYI+M+RN
Sbjct: 260 SFQLYHSGVYNNPYCSGTALDHAVLIVGYGTEYGQDYWLVKNSWGPAWGDQGYIKMSRNR 319
Query: 320 PSSNIGICGILMQASYPV 337
+ CGI AS+P+
Sbjct: 320 YNQ----CGIASAASFPL 333
>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
Length = 475
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 133/344 (38%), Positives = 191/344 (55%), Gaps = 38/344 (11%)
Query: 28 AVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYS 87
AV + L +L + AW+ P + + ++ W ++ + R ++
Sbjct: 20 AVSVVPPLDILTLSKQAWAA--PAGRSDEEVRIIYQEWRVKHRPAENDQYVGDYRLEVFK 77
Query: 88 SNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV--QYL- 140
N++++D N+ +++L N+FADL+NEE+ + +L + QY
Sbjct: 78 ENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRARFLRDLSRLGRSTSGEISNQYRL 137
Query: 141 ----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVD 196
LP S+DWR++GAV VK+QG+CGSCWAF+A+AAVEGIN++ TG L+SLSEQ+LVD
Sbjct: 138 REGDVLPDSIDWREKGAVVAVKNQGRCGSCWAFAAIAAVEGINQIVTGDLISLSEQQLVD 197
Query: 197 CDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA 256
C ++ N GC GG+ +AF++I GGV +E+ YPY G N C T K H V+I Y
Sbjct: 198 C--STRNYGCEGGWPYRAFQYIINNGGVNSEEHYPYTGTNGTCNTTKENAHVVSIDSYRN 255
Query: 257 IPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGE 294
+P+ FQLY G+F C LNHGVTVVGYG ++G
Sbjct: 256 VPSNDEKSLQKAAANQPISVGIDASGRNFQLYHSGIFTGSCNTSLNHGVTVVGYGTENGN 315
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YW+VKNSWG +WG +GYI M RN S+ G CGI + SYP+K
Sbjct: 316 DYWIVKNSWGENWGNSGYILMERNIAESS-GKCGIAISPSYPIK 358
>gi|66810271|ref|XP_638859.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
gi|166201983|sp|Q23894.2|CYSP3_DICDI RecName: Full=Cysteine proteinase 3; AltName: Full=Cysteine
proteinase II; Flags: Precursor
gi|60467526|gb|EAL65548.1| cysteine proteinase 3 [Dictyostelium discoideum AX4]
Length = 337
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 138/347 (39%), Positives = 192/347 (55%), Gaps = 45/347 (12%)
Query: 25 LRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFG 84
+R ++ +F L VL I + + K ++ F +W++ ++ Y + E+ R+
Sbjct: 1 MRLSITLIFTLIVLSISFISAGNVFSHK----QYQDSFIDWMRSNNKAY-THKEFMPRYE 55
Query: 85 IYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLG---------YNKPYNEPRWP 135
+ N+ Y+ NS+ L N+ ADLSNEE+ YLG Y+K R
Sbjct: 56 EFKKNMDYVHNWNSKGSKTVLGLNQHADLSNEEYRLNYLGTRAHIKLNGYHKRNLGLRLN 115
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
Q+ P +VDWR++ AVTPVKDQGQCGSC++FS +VEG+ +KTGKLVSLSEQ ++
Sbjct: 116 RPQF-KQPLNVDWREKDAVTPVKDQGQCGSCYSFSTTGSVEGVTAIKTGKLVSLSEQNIL 174
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK-NDRCQTDKTKHHAVTITGY 254
DC + N+GCNGG M AFE+I K G+ +E+ YPY K ND C+ + A IT Y
Sbjct: 175 DCSSSFGNEGCNGGLMTNAFEYIIKNNGLNSEEQYPYEMKVNDECKFQEGS-VAAKITSY 233
Query: 255 EAIPA----------------------RYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGE 290
+ I A +FQLY+ GV+ E C + L+HGV VG G
Sbjct: 234 KEIEAGDENDLQNALLLNPVSVAIDASHNSFQLYTAGVYYEPACSSEDLDHGVLAVGMGT 293
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
D+GE Y++VKNSWG SWG GYI MARN ++ CGI ASYP+
Sbjct: 294 DNGEDYYIVKNSWGPSWGLNGYIHMARNKDNN----CGISTMASYPI 336
>gi|11055|emb|CAA45129.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 320
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 127/304 (41%), Positives = 173/304 (56%), Gaps = 33/304 (10%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEF 118
+++ QY R+YG E R ++ N Q I+ N + ++FK+ N+F D++NEEF
Sbjct: 20 DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 79
Query: 119 ISTYLGYNK-PYNEPRWPSVQYLG-LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
+ GY K EP+ G + A VDWR + VTPVKDQ QCGSCWAFSA A+E
Sbjct: 80 NAVMKGYKKGSRGEPKAVFTAEAGPMAADVDWRTKALVTPVKDQEQCGSCWAFSATGALE 139
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
G + LK +LVSLSEQ+LVDC + N GC GG+M AF++I GG+ TE YPY ++
Sbjct: 140 GQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAED 199
Query: 237 DRCQTDKTKHHAVTITGYE--------------------AIPA-RYAFQLYSHGV-FDEY 274
C+ D A+ E AI A ++FQ YS GV +++
Sbjct: 200 RSCRFDANSIGAICTGSVEVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQN 259
Query: 275 CGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
C L+HGV VGYG + + YWLVKNSWG+SWG+AGYI+M+RN ++ CGI +
Sbjct: 260 CSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNN----CGIASEP 315
Query: 334 SYPV 337
SYP
Sbjct: 316 SYPT 319
>gi|118125|sp|P25784.1|CYSP3_HOMAM RecName: Full=Digestive cysteine proteinase 3; Flags: Precursor
Length = 321
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 127/304 (41%), Positives = 173/304 (56%), Gaps = 33/304 (10%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEF 118
+++ QY R+YG E R ++ N Q I+ N + ++FK+ N+F D++NEEF
Sbjct: 21 DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 80
Query: 119 ISTYLGYNK-PYNEPRWPSVQYLG-LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
+ GY K EP+ G + A VDWR + VTPVKDQ QCGSCWAFSA A+E
Sbjct: 81 NAVMKGYKKGSRGEPKAVFTAEAGPMAADVDWRTKALVTPVKDQEQCGSCWAFSATGALE 140
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
G + LK +LVSLSEQ+LVDC + N GC GG+M AF++I GG+ TE YPY ++
Sbjct: 141 GQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAED 200
Query: 237 DRCQTDKTKHHAVTITGYE--------------------AIPA-RYAFQLYSHGV-FDEY 274
C+ D A+ E AI A ++FQ YS GV +++
Sbjct: 201 RSCRFDANSIGAICTGSVEVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQN 260
Query: 275 CGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
C L+HGV VGYG + + YWLVKNSWG+SWG+AGYI+M+RN ++ CGI +
Sbjct: 261 CSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNN----CGIASEP 316
Query: 334 SYPV 337
SYP
Sbjct: 317 SYPT 320
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 137/322 (42%), Positives = 185/322 (57%), Gaps = 43/322 (13%)
Query: 54 DPQSMEE-RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI---DYINSQNL-SFKLTDN 108
D S+EE F W ++ R Y + E +R I+ +N + + + + Q + S++L
Sbjct: 18 DGMSLEEMEFHAWKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMT 77
Query: 109 KFADLSNEEFISTY-LGYNKPYN--EPRWPSVQY-----LGLPASVDWRKEGAVTPVKDQ 160
+FAD+ NEE+ S LG + +N PR S + LP +VDWR +G VT VKDQ
Sbjct: 78 QFADMDNEEYKSLISLGCLRAFNTSAPRRGSAFFRLAEGTHLPTTVDWRDKGYVTGVKDQ 137
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
QCGSCWAFSA ++EG N KTGKLVSLSEQ+LVDC + N GCNGG M+ AF++I +
Sbjct: 138 KQCGSCWAFSATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQE 197
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY------------EAI----------- 257
GG+ TE YPY ++ +C+ K ++ TGY EA+
Sbjct: 198 NGGIDTEKSYPYEAEDGQCRF-KPENVGAKCTGYVDVTVGDEDALKEAVATIGPVSVGID 256
Query: 258 PARYAFQLYSHGVFDEY-CGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRM 315
+ +FQLY GV+DE C Q L+HGV VGYG D+G+ YWLVKNSWG WG+ GYI M
Sbjct: 257 ASHSSFQLYDSGVYDEQDCSSQDLDHGVLAVGYGTDNGQDYWLVKNSWGLGWGQEGYIMM 316
Query: 316 ARNSPSSNIGICGILMQASYPV 337
+RN + CGI ASYP+
Sbjct: 317 SRNKDNQ----CGIATAASYPL 334
>gi|357627452|gb|EHJ77132.1| cathepsin L-like protease [Danaus plexippus]
Length = 341
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 129/323 (39%), Positives = 180/323 (55%), Gaps = 48/323 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADL 113
+ E + + ++ ++Y SE E + R IY+ N + N + +S++L NK++D+
Sbjct: 23 VREEWNTFKLEHKKQYDSETEEKFRMKIYAENKHKVAKHNQRYQKGLVSYRLKTNKYSDM 82
Query: 114 SNEEFISTYLGYNKPYNEPR-------------WPSVQYLGLPASVDWRKEGAVTPVKDQ 160
+ EF++T G+NK + + S + P +VDWR+ GAVTPVKDQ
Sbjct: 83 LHHEFVNTMNGFNKTVKHNKGLYAKGNDIRGATFVSPANVAAPPTVDWRQHGAVTPVKDQ 142
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G+CGSCW+FS A+EG + K+G LVSLSEQ L+DC N GCNGG M+ AF++I
Sbjct: 143 GKCGSCWSFSTTGALEGQHFRKSGFLVSLSEQNLIDCSSAYGNNGCNGGLMDNAFKYIKD 202
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA--------------------- 259
G+ TE YPY +D+C+ + K+ G+ IPA
Sbjct: 203 NDGIDTEKTYPYEAVDDKCRYN-PKNSGAEDVGFVDIPAGDEHKLMLALATVGPVSVAID 261
Query: 260 --RYAFQLYSHGV-FDEYCGHQ-LNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIR 314
+ +FQLYS GV +DE C + L+HGV VVGYG D G YWLVKNSWG SWG+ GYI+
Sbjct: 262 ASQESFQLYSDGVYYDENCSSENLDHGVLVVGYGTDEDGGDYWLVKNSWGPSWGDEGYIK 321
Query: 315 MARNSPSSNIGICGILMQASYPV 337
MARN + CGI ASYP+
Sbjct: 322 MARNRDNH----CGIASSASYPL 340
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 134/347 (38%), Positives = 187/347 (53%), Gaps = 49/347 (14%)
Query: 31 SLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNV 90
+ +L +L + A A + Y + ++E + + ++ + Y E E + R I++ N
Sbjct: 3 TALILPLLALVAVAQAVSYAE-----VIQEEWHTFKLEHRKNYQDETEERFRLKIFNENK 57
Query: 91 QYIDYIN----SQNLSFKLTDNKFADLSNEEFISTYLGYN-----------KPYNEPRWP 135
I N + +SFK+ NK+AD+ + EF ST G+N + + +
Sbjct: 58 HKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTFI 117
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
S +++ LP VDWR +GAVT VKDQG CGSCWAFS+ A+EG + K+G LVSLSEQ LV
Sbjct: 118 SPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLV 177
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ--------------- 240
DC N GCNGG M+ AF +I GG+ TE YPY +D C
Sbjct: 178 DCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGSIGATDRGFVD 237
Query: 241 ----TDKTKHHAVTITGYEAI---PARYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGED 291
+K AV G A+ + +FQ YS GV++E C Q L+HGV VVG+G D
Sbjct: 238 IPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGFGTD 297
Query: 292 H-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
GE YWLVKNSWGT+WG+ G+I+M RN + CGI +SYP+
Sbjct: 298 ESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQ----CGIASASSYPL 340
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 130/334 (38%), Positives = 176/334 (52%), Gaps = 40/334 (11%)
Query: 32 LFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQ 91
+L VL + WS + Q + + W + + Y E + R I+ N++
Sbjct: 4 FLVLCVLVASSRGWSVRFGQ-------DSEWVAWKSYHGKSYSDVHEERTRMAIWQQNLE 56
Query: 92 YIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL-----GLPASV 146
I N+++ S+K+ N DL+ +EF YLG +N + Y+ +P+SV
Sbjct: 57 KIKRHNAEDHSYKMAMNHLGDLTEDEFRYFYLGVRAHHNSTKRGWATYMPPSNVKIPSSV 116
Query: 147 DWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGC 206
DW ++G VT VK+QGQCGSCWAFS +VEG + KTG LVSLSEQ L+DC + N GC
Sbjct: 117 DWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLSEQNLIDCSGSYGNNGC 176
Query: 207 NGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA------- 259
GG M+ AF +I GG+ TE YPY G+ C + H +TGY+ IP
Sbjct: 177 QGGLMDNAFRYIESNGGIDTESSYPYLGQQGSCHF-SSSHVGARVTGYQDIPQGSEQALQ 235
Query: 260 --------------RYAFQLYSHGVFDE-YCGH-QLNHGVTVVGYGEDHGEKYWLVKNSW 303
+Q YS GV+D YC QL+HGV V+GYG +G+ YWLVKNSW
Sbjct: 236 SAVATVGPVSVAVDASQWQFYSSGVYDNPYCSSTQLDHGVLVIGYGNYNGQDYWLVKNSW 295
Query: 304 GTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G SWG GYI M+RN + CGI ASYP+
Sbjct: 296 GYSWGVEGYIMMSRNKNNQ----CGIASSASYPL 325
>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
Length = 533
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 135/318 (42%), Positives = 172/318 (54%), Gaps = 46/318 (14%)
Query: 55 PQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN--LSFKLTDNKFAD 112
P E F W+ + + E+ RR Y N YI N++N L N F+
Sbjct: 21 PLEYEHEFSAWMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAENAWTGVTLGHNAFSH 80
Query: 113 LSNEEFISTYLGYNKP--YNEPR--------WPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
+S +EF G P Y E R W V+ +P++VDW +G VTPVK+QG
Sbjct: 81 MSFDEFKFKMTGLVLPEGYLEQRLASRVDGLWSDVE---VPSAVDWVDKGGVTPVKNQGM 137
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS AVEG + +GKL SLSEQELVDCD N + GCNGG M+ AF++I G
Sbjct: 138 CGSCWAFSTTGAVEGATFVSSGKLPSLSEQELVDCDHNGD-MGCNGGLMDHAFQWIEDHG 196
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE---------------------AIPA-R 260
G+ +EDDY Y+ K C+ + V +TG++ AI A +
Sbjct: 197 GICSEDDYEYKAKAQVCRECDS---VVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQ 253
Query: 261 YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR--N 318
AFQ Y GVF+ CG +L+HGV VGYG D+G K+W VKNSWG SWGE GYIR+AR N
Sbjct: 254 KAFQFYKSGVFNLTCGTRLDHGVLAVGYGNDNGHKFWKVKNSWGASWGEQGYIRLAREEN 313
Query: 319 SPSSNIGICGILMQASYP 336
P+ G CGI SYP
Sbjct: 314 GPA---GQCGIASVPSYP 328
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 130/343 (37%), Positives = 182/343 (53%), Gaps = 42/343 (12%)
Query: 25 LRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFG 84
+R AV + L +L I A + + Q+ + F W+K++++ Y E+ ++
Sbjct: 1 MRLAVFLIVSLVILSINVCAAT----NLFSAQTYQTSFLGWMKKHNKAY-HHHEFNDKYQ 55
Query: 85 IYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL-- 142
+ N+ +I NS+ L N+FADL+NEE+ TYLG + N R V GL
Sbjct: 56 TFKDNMDFIHNWNSKESDTVLGLNRFADLTNEEYKKTYLGMSINVN-LRANQVPMNGLNF 114
Query: 143 -----PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
P+S+DWR+ GAV VKDQG CGSCWAF+ AVEG +++KTG +V+ SEQ LVDC
Sbjct: 115 ERFTGPSSIDWRQNGAVAYVKDQGHCGSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDC 174
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
N GC+GG M AF++I G+ TE+ YPY +RC + T I+GY+ +
Sbjct: 175 SGRYGNNGCDGGLMTSAFKYIIDNDGIATEEAYPYTATQNRCVYNTTM-LGTAISGYKDV 233
Query: 258 P----------------------ARYAFQLYSHGVFDEYC--GHQLNHGVTVVGYGEDHG 293
P + FQLY GV+ E ++LNHGV VGYG G
Sbjct: 234 PRGSESALTAAISKQPVAVAIDASPITFQLYKSGVYQEATCSSYRLNHGVLAVGYGTLEG 293
Query: 294 EKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
+ Y++VKNSW +WG GYI MARN+ + CGI ASY
Sbjct: 294 KDYYIVKNSWAETWGNQGYILMARNANNH----CGIATMASYA 332
>gi|42564163|gb|AAS20593.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 324
Score = 220 bits (561), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 135/317 (42%), Positives = 183/317 (57%), Gaps = 47/317 (14%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS-----FKLTDNKFA 111
S +E+++N+ +S+ Y + E +RRF I+ SN+ I+ N QN S +++ NKFA
Sbjct: 18 SDKEKWQNFKINFSKSYQNVVEEKRRFNIFLSNLLRIEEHN-QNFSRGLSTYEMGVNKFA 76
Query: 112 DLSNEEFISTYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSC 166
DL+ EEF+ + K +P++ S Q LPA VDW K+GAVT VK QG CGSC
Sbjct: 77 DLTPEEFMERFRPLRK--TKPKFLSEQAKFNFDGDLPAEVDWTKQGAVTEVKSQGSCGSC 134
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS +VE N +KTGKL+SLSEQ+LVDC N N GC GG+M+ A E+I + G+ +
Sbjct: 135 WAFSTTGSVESHNFIKTGKLISLSEQQLVDCVKN--NSGCAGGWMDIALEYI-EADGIMS 191
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQ 264
EDDYPY +N C+ + +K AV I Y+AI AFQ
Sbjct: 192 EDDYPYEERNTTCRFNNSK-AAVQIKSYKAIKKNDEIDLQKAVALEGPVSVAIEVTIAFQ 250
Query: 265 LYSHGVF-DEYCGH---QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
LY+ G+ D C + L H V V GYG G+ YW+VKNSWG +G GY+RM+RN+
Sbjct: 251 LYARGILNDPQCKNTEGDLTHAVLVTGYGSQDGKDYWIVKNSWGAEYGMDGYLRMSRNAD 310
Query: 321 SSNIGICGILMQASYPV 337
+ CGI +ASYPV
Sbjct: 311 NQ----CGIATRASYPV 323
>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 498
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 136/324 (41%), Positives = 173/324 (53%), Gaps = 54/324 (16%)
Query: 59 EERFENWLKQYSREYG-SEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEE 117
E F W Q++R Y E+ RR G+++ NV+ I N +N L N++AD + EE
Sbjct: 37 ERAFGLWATQHARTYSEGSPEYTRRLGVFADNVRAIAEQNRRNTGITLALNEYADETWEE 96
Query: 118 FISTYLGYNKPYNEP---------------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
F + LG + R+ VQ PA+VDWR + AVT VK+QGQ
Sbjct: 97 FAAKRLGLKISQEQLKAREARSSSSSSSSWRYAQVQ---TPAAVDWRAKNAVTQVKNQGQ 153
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFSAV ++EG N L TG+LV+LSEQ+LVDCD S N GC+GG M+ AF+++ G
Sbjct: 154 CGSCWAFSAVGSIEGANALATGQLVALSEQQLVDCDTAS-NMGCSGGLMDDAFKYVLDNG 212
Query: 223 GVTTEDDYPYRGK-------NDRCQTDKTKHHAVTITGYEAIP----------------- 258
G+ TE+DY Y N R QTD+ AV+I GYE +P
Sbjct: 213 GIDTEEDYSYWSGYGFGFWCNKRKQTDRP---AVSIDGYEDVPTSEPALLKAVAGQPVAV 269
Query: 259 ---ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIR 314
A Q YS GV + C LNHGV VGY D + YW+VKNSWG SWGE GY R
Sbjct: 270 AICASANMQFYSSGVINSCC-EGLNHGVLAVGYDTSDKAQPYWIVKNSWGGSWGEQGYFR 328
Query: 315 MARNSPSSNIGICGILMQASYPVK 338
+ G+CGI ASY VK
Sbjct: 329 LKMGEGPK--GLCGIASAASYAVK 350
>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 422
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 141/326 (43%), Positives = 178/326 (54%), Gaps = 43/326 (13%)
Query: 54 DP-QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFK----LTDN 108
DP ++E RF+ WL + + Y E +R I++ N +++ N + + K L N
Sbjct: 61 DPVATIEARFDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLN 120
Query: 109 KFADLSNEEFISTYLGYN--KPYNEPRWPSV-----QYLGL--PASVDWRKEGAVTPVKD 159
ADL+ EEF LGY+ K E P V +Y + P ++DW GAVTPVK+
Sbjct: 121 HLADLTREEF-KHMLGYDASKKRVESSSPPVDAANWEYADVTPPETMDWVSRGAVTPVKN 179
Query: 160 QGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFIT 219
QGQCGSCWAFS V AVEG+ +KTG L+SLSEQELV C N GC GG M+ FE+I
Sbjct: 180 QGQCGSCWAFSTVGAVEGVVAVKTGDLISLSEQELVSCAKIGGNNGCKGGLMDNGFEWIV 239
Query: 220 KIGGVTTEDDYPYRGKNDRCQ-TDKTKHHAVTITGYEAIPA------------------- 259
+ GV E+D+ Y K+ RC K + A +I G++ +P
Sbjct: 240 ENRGVDDEEDWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVAI 299
Query: 260 ---RYAFQLYSHGVFDEYCGHQLNHGVTVVGY---GEDHGEK-YWLVKNSWGTSWGEAGY 312
FQLYS GVFD CG L+HGV VVGY GE G K YW VKNSWG WGE GY
Sbjct: 300 EADHREFQLYSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEGY 359
Query: 313 IRMARNSPSSNIGICGILMQASYPVK 338
IR+AR G CG+ MQASYP K
Sbjct: 360 IRIARGG-MGPAGQCGVAMQASYPTK 384
>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 326
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 133/311 (42%), Positives = 177/311 (56%), Gaps = 42/311 (13%)
Query: 63 ENWLK---QYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ---NLS-FKLTDNKFADLSN 115
E W++ + ++ Y + E Q+RF I+ +++ I+ N + LS FKL KFADL+
Sbjct: 21 EEWVQFKVRNNKSYRNYIEEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVTKFADLTE 80
Query: 116 EEFISTYLGYNKPYNEPRWPSVQYL----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
+EF S LG ++ R + L LP+ DWR++GAVT VKDQG CGSCW+FS
Sbjct: 81 KEF-SDMLGISRSTKSSRPRVIHSLTPVKDLPSKFDWREKGAVTEVKDQGSCGSCWSFST 139
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
VEG LKTGKLVSLSEQ LVDC + GC+GGYM+KA E+I GG+ +E+DYP
Sbjct: 140 TGTVEGAYFLKTGKLVSLSEQNLVDC-AKEDCYGCSGGYMDKALEYIETAGGIMSENDYP 198
Query: 232 YRGKNDRCQTDKTK-------------------HHAVTITG--YEAIPARYAFQLYSHGV 270
Y G +D+C+ D +K +AV G AI A + FQLY G+
Sbjct: 199 YEGIDDKCRFDSSKVAAKISNFTYIKKNDEDDLKNAVIAKGPISVAIDASFNFQLYDSGI 258
Query: 271 FDEYCGH----QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
D+ + LNHGV VVGYG + + YW+VKNSWG WG GYI M+RN +
Sbjct: 259 LDDSSCYSDFNSLNHGVLVVGYGTEKEQDYWIVKNSWGADWGMDGYIWMSRNKNNQ---- 314
Query: 327 CGILMQASYPV 337
CGI A+YP
Sbjct: 315 CGIATDATYPT 325
>gi|32394728|gb|AAM96000.1| cathepsin L precursor [Metapenaeus ensis]
Length = 322
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 130/302 (43%), Positives = 176/302 (58%), Gaps = 37/302 (12%)
Query: 68 QYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYL 123
QY R YG+ E R ++ N Q+I+ N++ ++F L N+F D+++EEF +T
Sbjct: 25 QYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEEFAATMN 84
Query: 124 GY-NKPYNEP-RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKL 181
G+ N P P LP VDWR +GAVTPVKDQ QCGSCWAFS ++EG + L
Sbjct: 85 GFLNVPTRHPVAILEADDETLPKHVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFL 144
Query: 182 KTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQT 241
K GKLVSLSEQ LVDC N GC GG M++AF++I + G+ TE+ YPY ++ +C+
Sbjct: 145 KDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEESYPYEAQDGKCRF 204
Query: 242 DKTKHHAVTITGY----------------------EAIPARY-AFQLYSHGVF--DEYCG 276
D + A T TG+ AI A + +FQ Y GV+ E
Sbjct: 205 DSSNVGA-TDTGFVDIAHGEENSLMKAVANIGPISVAIDASHPSFQFYHQGVYYEKECSS 263
Query: 277 HQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASY 335
L+HGV +GYGE D G++YWLVKNSW TSWG+ G+I+M+RN ++ CGI QASY
Sbjct: 264 TMLDHGVLAIGYGETDDGKEYWLVKNSWNTSWGDKGFIQMSRNKKNN----CGIASQASY 319
Query: 336 PV 337
P+
Sbjct: 320 PL 321
>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 129/310 (41%), Positives = 170/310 (54%), Gaps = 39/310 (12%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNKFADLSNEE 117
+E+W +Y + Y E R ++ SN+Q + N +++L N +ADL NEE
Sbjct: 19 WESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEE 78
Query: 118 FI-----STYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
F+ S L + + + + LP+SVDWR +G VTPVKDQGQCGSCW+FSA
Sbjct: 79 FMALKGSSGILQAKDQSSTQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWSFSAT 138
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
++EG + KTG LVSLSEQ+LVDC + N GC+GG ME A+++I GGV E YPY
Sbjct: 139 GSLEGQHFAKTGTLVSLSEQQLVDCSWSYGNYGCSGGLMESAYDYIRDAGGVQLESAYPY 198
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQLYSHG 269
+N RC D++K A T TG+ AIP + Y FQLY G
Sbjct: 199 TAQNGRCHFDQSKAVA-TCTGHVAIPSGDEQSLMQAVGTVGPVAVAIDASGYDFQLYESG 257
Query: 270 VFDEY--CGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
V+D L+HGV GYG + G YWLVKNSWG WG GYI+M+RN + C
Sbjct: 258 VYDRSRCSSSSLDHGVLAAGYGTEGGNDYWLVKNSWGPGWGAQGYIKMSRNKSNQ----C 313
Query: 328 GILMQASYPV 337
GI A YP+
Sbjct: 314 GIATMACYPL 323
>gi|32394730|gb|AAM96001.1| cathepsin L precursor [Metapenaeus ensis]
Length = 306
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 130/302 (43%), Positives = 176/302 (58%), Gaps = 37/302 (12%)
Query: 68 QYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYL 123
QY R YG+ E R ++ N Q+I+ N++ ++F L N+F D+++EEF +T
Sbjct: 9 QYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEEFAATMN 68
Query: 124 GY-NKPYNEP-RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKL 181
G+ N P P LP VDWR +GAVTPVKDQ QCGSCWAFS ++EG + L
Sbjct: 69 GFLNVPTRHPVAILEADDETLPKHVDWRTKGAVTPVKDQKQCGSCWAFSTTGSLEGQHFL 128
Query: 182 KTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQT 241
K GKLVSLSEQ LVDC N GC GG M++AF++I + G+ TE+ YPY ++ +C+
Sbjct: 129 KDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEESYPYEAQDGKCRF 188
Query: 242 DKTKHHAVTITGY----------------------EAIPARY-AFQLYSHGVF--DEYCG 276
D + A T TG+ AI A + +FQ Y GV+ E
Sbjct: 189 DSSNVGA-TDTGFVDIAHGEENSLMKAVANIGPISVAIDASHPSFQFYHQGVYYEKECSS 247
Query: 277 HQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASY 335
L+HGV +GYGE D G++YWLVKNSW TSWG+ G+I+M+RN ++ CGI QASY
Sbjct: 248 TMLDHGVLAIGYGETDDGKEYWLVKNSWNTSWGDKGFIQMSRNKKNN----CGIASQASY 303
Query: 336 PV 337
P+
Sbjct: 304 PL 305
>gi|326501772|dbj|BAK02675.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 333
Score = 220 bits (560), Expect = 9e-55, Method: Compositional matrix adjust.
Identities = 127/315 (40%), Positives = 174/315 (55%), Gaps = 41/315 (13%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADL 113
+ + ++ W + ++ Y +E RR + N+Q + N Q ++ L NK+AD+
Sbjct: 24 LNQHWKLWKEANNKRYSDAEEHVRR-ATWEGNLQKVQEHNLQADLGVHTYWLGMNKYADM 82
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQY------LGLPASVDWRKEGAVTPVKDQGQCGSCW 167
+ EF+ GYN R + LP +VDWR +G VT VKDQGQCGSCW
Sbjct: 83 TVTEFVKVMNGYNATMRGQRTQDRHTFSFNSKIALPDTVDWRDKGYVTDVKDQGQCGSCW 142
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS A+EG + +TGKLVSLSEQ LVDC N GCNGG M++AFE+I + G+ TE
Sbjct: 143 AFSTTGALEGQHFKQTGKLVSLSEQNLVDCSGKQGNMGCNGGLMDQAFEYIKENNGIDTE 202
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-----------------------YAFQ 264
D YPY +++C+ K + T TG+ I ++ +FQ
Sbjct: 203 DSYPYEAVDNQCRF-KAANVGATDTGFTDITSKDESALQQAVATVGPISVAIDAGHTSFQ 261
Query: 265 LYSHGVFDE-YCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
LY HGV++E +C +L+HGV VGYG D G+ YWLVKNSWG WG+ GYI+M RN +
Sbjct: 262 LYKHGVYNEPFCSQTRLDHGVLAVGYGTDSGKDYWLVKNSWGEGWGDKGYIKMTRNKRNQ 321
Query: 323 NIGICGILMQASYPV 337
CGI ASYP+
Sbjct: 322 ----CGIATAASYPL 332
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 119/277 (42%), Positives = 164/277 (59%), Gaps = 38/277 (13%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFADLSNEE 117
F+++ + ++Y S +E RRF I++ N+ +I N++ + + N+FADL+NEE
Sbjct: 20 FDDFKTTFEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEE 79
Query: 118 FISTYLGYNKPYNEP---RWPSVQYLGLP--ASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
+ YL +PY R +L P SVDWR++GAVTP+K+QGQCGSCW+FS
Sbjct: 80 YRQLYL---RPYPTELLGRERQEVWLDGPNAGSVDWRQKGAVTPIKNQGQCGSCWSFSTT 136
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
+VEG + + TG LVSLSEQ+LVDC + NQGCNGG M+ AF++I GG+ TE DYPY
Sbjct: 137 GSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPY 196
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIPA----------------------RYAFQLYSHGV 270
++ C K HAV+I+GY+ +P + +FQ+YS GV
Sbjct: 197 TARDGVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGV 256
Query: 271 FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSW 307
F CG L+HGV VVGY D YW+VKNSWG SW
Sbjct: 257 FSGPCGTNLDHGVLVVGYTSD----YWIVKNSWGASW 289
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 131/320 (40%), Positives = 181/320 (56%), Gaps = 45/320 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS---QNL-SFKLTDNKFADL 113
++E++ ++ Q+ ++Y SE E + R I+ N + N Q L +KL NK+ DL
Sbjct: 23 VQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVAKHNKLFEQGLYPYKLAMNKYGDL 82
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQ---------YLGLPASVDWRKEGAVTPVKDQGQCG 164
+ EF+ G+N+ + +Q ++ +P +VDWR+EGAVTPVKDQG CG
Sbjct: 83 LHHEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHVDIPDTVDWRQEGAVTPVKDQGHCG 142
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCW+FSA A+EG + +T KLVSLSEQ LVDC N GCNGG M+ AF +I GG+
Sbjct: 143 SCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRFGNNGCNGGLMDNAFRYIKNNGGI 202
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARY 261
TE YPY G++++ + K+ T G+ IP +
Sbjct: 203 DTEAAYPYMGEDEKFRY-SAKNRGATDKGFVDIPSGDEDKLKAAVATVGPISIAIDASHE 261
Query: 262 AFQLYSHGVF-DEYCGH-QLNHGVTVVGYGEDH--GEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQLYS+GV+ D C +L+HGV VVGYG D G YWLVKNSWG +WG GYI+MAR
Sbjct: 262 SFQLYSNGVYSDPTCSSTELDHGVLVVGYGTDEKTGMDYWLVKNSWGDTWGLDGYIKMAR 321
Query: 318 NSPSSNIGICGILMQASYPV 337
N + CG+ QASYP+
Sbjct: 322 NQDNQ----CGVATQASYPL 337
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 175/314 (55%), Gaps = 39/314 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADL 113
+ ++E + + + Y S E RF I++ N I N++ +S+KL N+F DL
Sbjct: 23 LRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDL 82
Query: 114 SNEEFISTYLGYN---KPYNEPRWP--SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
EF + G++ K P +V LP +VDWRK+GAVTPVKDQGQCGSCWA
Sbjct: 83 LAHEFARIFNGHHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWA 142
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSA ++EG + LK G+LVSLSEQ LVDC + N GC GG ME AF++I G+ TE
Sbjct: 143 FSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEK 202
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPA-----------------------RYAFQL 265
YPY + C+ K A T TGY I A +FQL
Sbjct: 203 SYPYEAVDGECRFKKEDVGA-TDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQL 261
Query: 266 YSHGVFDE-YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GV+DE C + L+HGV VVGYG G+KYWLVKNSW SWG+ GYI M+R+ N
Sbjct: 262 YSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRD----N 317
Query: 324 IGICGILMQASYPV 337
CGI QASYP+
Sbjct: 318 NNQCGIASQASYPL 331
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 144/348 (41%), Positives = 187/348 (53%), Gaps = 51/348 (14%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRF 83
MLR +VL ++ + A S+ + + ++E + + + Y S E RF
Sbjct: 1 MLRLSVLCA----IVAVTVAASSQ--------EILRTQWEAFKTTHKKTYQSHMEELLRF 48
Query: 84 GIYSSNVQYIDYINSQN----LSFKLTDNKFADLSNEEFISTYLGYN---KPYNEPRWP- 135
I++ N I N++ +S+KL N+F DL EF + GY+ K P
Sbjct: 49 KIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGYHGSRKSGGSTFLPP 108
Query: 136 -SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+V LP +VDWRK+GAVTPVKDQGQCGSCWAFS ++EG + LK G+LVSLSEQ L
Sbjct: 109 ANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNL 168
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
VDC + N GC GG ME AF++I G+ TE YPY + C+ K A T TGY
Sbjct: 169 VDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGA-TDTGY 227
Query: 255 EAIPA-----------------------RYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG 289
I A +FQLYS GV+DE C + L+HGV VVGYG
Sbjct: 228 VEIKAGCEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG 287
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G+KYWLVKNSW SWG+ GYI M+R+ N CGI QASYP+
Sbjct: 288 VKGGKKYWLVKNSWAESWGDQGYILMSRD----NNNQCGIASQASYPL 331
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 139/368 (37%), Positives = 197/368 (53%), Gaps = 56/368 (15%)
Query: 8 AIYTNLHLKIAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLK 67
++ L ++M + L +LSL L P + DP ++ ++ W
Sbjct: 3 VLFLARRLSRFVNMNVCL--TILSLCLGLAFAAP----------RVDP-DLDSHWQLWKS 49
Query: 68 QYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFADLSNEEFISTYL 123
+S++Y +E RR ++ N++ I+ N + S+KL N+F D++ EEF
Sbjct: 50 WHSKDYHEREESWRRV-VWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMN 108
Query: 124 GYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGI 178
GY +E ++ Q+L P SVDWR++G VTPVKDQGQCGSCWAFS A+EG
Sbjct: 109 GYKHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQ 168
Query: 179 NKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR 238
+ KTGKLVSLSEQ LVDC NQGCNGG M++AF+++ GG+ +E+ YPY K+D
Sbjct: 169 HFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDE 228
Query: 239 CQTDKTKHHAVTITGYEAIPARY-----------------------AFQLYSHGVFDE-- 273
K +++A TG+ IP + +FQ Y G++ E
Sbjct: 229 DCRYKAEYNAANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPD 288
Query: 274 YCGHQLNHGVTVVGY---GED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
L+HGV VVGY GED G+KYW+VKNSWG WG+ GYI MA++ + CGI
Sbjct: 289 CSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNH----CGI 344
Query: 330 LMQASYPV 337
ASYP+
Sbjct: 345 ATAASYPL 352
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 144/348 (41%), Positives = 187/348 (53%), Gaps = 51/348 (14%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRF 83
MLR +VL ++ + A S+ + + ++E + + + Y S E RF
Sbjct: 1 MLRLSVLCA----IVAVTVAASSQ--------EILRTQWEAFKTTHKKTYQSHMEELLRF 48
Query: 84 GIYSSNVQYIDYINSQN----LSFKLTDNKFADLSNEEFISTYLGYN---KPYNEPRWP- 135
I++ N I N++ +S+KL N+F DL EF + G+ K P
Sbjct: 49 KIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHRGTRKTGGSTFLPP 108
Query: 136 -SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+V LP +VDWRK+GAVTPVKDQGQCGSCWAFSA ++EG + LK G+LVSLSEQ L
Sbjct: 109 ANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNL 168
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
VDC + N GC GG ME AF++I G+ TE YPY + C+ K A T TGY
Sbjct: 169 VDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGA-TDTGY 227
Query: 255 EAIPA-----------------------RYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG 289
I A +FQLYS GV+DE C + L+HGV VVGYG
Sbjct: 228 VEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG 287
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G+KYWLVKNSW SWG+ GYI M+R+ N CGI QASYP+
Sbjct: 288 VKGGKKYWLVKNSWAESWGDQGYILMSRD----NNNQCGIASQASYPL 331
>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
Length = 339
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 127/320 (39%), Positives = 182/320 (56%), Gaps = 44/320 (13%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFAD 112
++++ ++ W +S++Y ++E RR I+ N++ I N + S++L N F D
Sbjct: 24 ALDDHWQAWKTWHSKKYHQQEEGWRRM-IWEKNLKMIQLHNLDHSLGKHSYRLGMNHFGD 82
Query: 113 LSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCW 167
++NEEF GY E ++ ++L +P SVDWR++G VTPVKDQGQCGSCW
Sbjct: 83 MTNEEFRQVMNGYKHSKTEKKYRGSEFLEPNFLVVPKSVDWREKGYVTPVKDQGQCGSCW 142
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS ++EG + KTGKLVSLSEQ LVDC NQGCNGG M++AFE+I GG+ +E
Sbjct: 143 AFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFEYIADNGGIDSE 202
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQ 264
+ YPY K+D K++ +A TG+ +P + FQ
Sbjct: 203 ESYPYIAKDDEDCLYKSEFNAANDTGFVDVPEGHERALMKAVAAVGPVSVAIDASHSTFQ 262
Query: 265 LYSHGV-FDEYC-GHQLNHGVTVVGYG-----EDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
Y G+ +D C +L+HGV VVGYG +D+ +KYW+VKNSW WG+ GYI MA+
Sbjct: 263 FYESGIYYDPDCSSEELDHGVLVVGYGFEGTDDDNKKKYWIVKNSWSDKWGDKGYILMAK 322
Query: 318 NSPSSNIGICGILMQASYPV 337
+ + CGI ASYP+
Sbjct: 323 DRNNH----CGIATAASYPL 338
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 130/306 (42%), Positives = 173/306 (56%), Gaps = 40/306 (13%)
Query: 68 QYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYL 123
++ + Y SE E R IY N I N + + + + N+F D+ + EF+ST
Sbjct: 33 KHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTRN 92
Query: 124 GYNKPY-NEPRWPS-------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
G+ + Y ++PR S ++ LP +VDWR +GAVTPVK+QGQCGSCWAFSA ++
Sbjct: 93 GFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSATGSL 152
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EG + K+G +VSLSEQ LVDC + N GC GG M+ AF++I G+ TE YPY G
Sbjct: 153 EGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSYPYNGT 212
Query: 236 NDRCQTDKTK-------------------HHAVTITG--YEAIPARY-AFQLYSHGVFDE 273
+ C K+ AV G AI A + +FQ YS GV+DE
Sbjct: 213 DGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVYDE 272
Query: 274 -YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
C + L+HGV VVGYG +G YWLVKNSWGT+WG+ GYIRM+RN + CGI
Sbjct: 273 PECDSESLDHGVLVVGYGTLNGTDYWLVKNSWGTTWGDEGYIRMSRNKKNQ----CGIAS 328
Query: 332 QASYPV 337
ASYP+
Sbjct: 329 SASYPL 334
>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
pulchellus]
Length = 331
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 133/307 (43%), Positives = 174/307 (56%), Gaps = 44/307 (14%)
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYI-----DYINSQNLSFKLTDNKFADLSNEEFISTYL 123
+ +EY S+ E R IY N I Y SQ +S+KL N+F D+ + EF+ST
Sbjct: 30 HGKEYESDTEEYYRLKIYMENRLKIARHNEKYAKSQ-VSYKLAMNEFGDMLHHEFVSTRN 88
Query: 124 GYNKPYNE-PRWPS-------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
G+ + Y + PR S ++ LP +VDWRK+GAVTPVK+QGQCGSCW+FS ++
Sbjct: 89 GFKRNYRDTPREGSFFVEPEGLEDFHLPKTVDWRKKGAVTPVKNQGQCGSCWSFSTTGSL 148
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EG + K KLVSLSEQ L+DC + N GC GG M+ AF++I G+ TE YPY
Sbjct: 149 EGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYIKANKGIDTEQSYPYNAT 208
Query: 236 NDRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQLYSHGVFD 272
+ C +K+ A T TG+ IP + +FQ YS GV+D
Sbjct: 209 DGVCHFNKSAVGA-TDTGFVDIPEGDENKLKKAVATVGPVSVAIDASHESFQFYSEGVYD 267
Query: 273 E-YC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
E C QL+HGV VVGYG G+ YWLVKNSWGT+WG+ GYI M+RN + CGI
Sbjct: 268 EPECDSEQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDGGYIYMSRNKDNQ----CGIA 323
Query: 331 MQASYPV 337
ASYP+
Sbjct: 324 SAASYPL 330
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 175/314 (55%), Gaps = 39/314 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADL 113
+ ++E + + + Y S E RF I++ N I N++ +S+KL N+F DL
Sbjct: 23 LRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDL 82
Query: 114 SNEEFISTYLGYN---KPYNEPRWP--SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
EF + G++ K P +V LP VDWRK+GAVTPVKDQGQCGSCWA
Sbjct: 83 LAHEFARIFNGHHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWA 142
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSA ++EG + LK G+LVSLSEQ LVDC + N GC GG ME AF++I G+ TE
Sbjct: 143 FSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEK 202
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPA-----------------------RYAFQL 265
YPY+ + C+ K A T TGY I A +FQL
Sbjct: 203 SYPYKAVDGECRFKKEDVGA-TDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQL 261
Query: 266 YSHGVFDE-YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GV+DE C + L+HGV VVGYG G+KYWLVKNSW SWG+ GYI M+R+ N
Sbjct: 262 YSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRD----N 317
Query: 324 IGICGILMQASYPV 337
CGI QASYP+
Sbjct: 318 NNQCGIASQASYPL 331
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 219 bits (558), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 133/347 (38%), Positives = 187/347 (53%), Gaps = 49/347 (14%)
Query: 31 SLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNV 90
+ +L +L + A A + Y + ++E + + ++ + Y E E + R I++ N
Sbjct: 3 TALILPLLALVAVAQAVSYAE-----VIQEEWHTFKLEHRKNYQDETEERFRLKIFNENK 57
Query: 91 QYIDYIN----SQNLSFKLTDNKFADLSNEEFISTYLGYN-----------KPYNEPRWP 135
I N + +SFK+ NK+AD+ + EF ST G+N + + +
Sbjct: 58 HKIAKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTFI 117
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
S +++ LP VDWR +GAVT VKDQG CGSCWAFS+ A+EG + K+G LVSLSEQ LV
Sbjct: 118 SPEHVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLV 177
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ--------------- 240
DC N GCNGG M+ AF +I GG+ TE YPY +D C
Sbjct: 178 DCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDRGFVD 237
Query: 241 ----TDKTKHHAVTITGYEAI---PARYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGED 291
+K AV G A+ + +FQ YS GV++E C Q L+HGV VVG+G D
Sbjct: 238 IPQGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGFGTD 297
Query: 292 H-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G+ YWLVKNSWGT+WG+ G+I+M RN + CGI +SYP+
Sbjct: 298 ESGQDYWLVKNSWGTTWGDKGFIKMLRNKENQ----CGIASASSYPL 340
>gi|306992173|gb|ADN19567.1| cathepsin L-like proteinase [Spodoptera frugiperda]
Length = 344
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 134/326 (41%), Positives = 176/326 (53%), Gaps = 51/326 (15%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADL 113
+ E + + ++S++Y SE E + R IY N I N + +S+KL NK+AD+
Sbjct: 23 VREEWNAFKMEHSKQYDSEVEDKFRMKIYVENKHRIAKHNQRFEQRLVSYKLKPNKYADM 82
Query: 114 SNEEFISTYLGYNKPYNE----------------PRWPSVQYLGLPASVDWRKEGAVTPV 157
+ EF+ T G+NK + + ++ P VDWRK+GAVT V
Sbjct: 83 LHHEFVHTMNGFNKTAKHGGRNKAVHSKGRDGRAATFIAPAHVSYPDHVDWRKKGAVTDV 142
Query: 158 KDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEF 217
KDQG+CGSCWAFS A+EG + KTG LVSLSEQ LVDC N GCNGG M+ AF++
Sbjct: 143 KDQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLVDCSAAYGNNGCNGGLMDNAFKY 202
Query: 218 ITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------------------- 258
I GG+ TE YPY +D+C+ + K+ G+ IP
Sbjct: 203 IKDNGGIDTEKSYPYEAVDDKCRYN-PKNSGADDVGFVDIPQGDEEKLMQAVATVGPISV 261
Query: 259 ----ARYAFQLYSHGV-FDEYCGH-QLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAG 311
++ FQ YS GV +DE C L+HGV VVGYG E+ G YWLVKNSWG SWGE G
Sbjct: 262 AIDASQETFQFYSKGVYYDENCSSTDLDHGVMVVGYGTEEEGGDYWLVKNSWGRSWGELG 321
Query: 312 YIRMARNSPSSNIGICGILMQASYPV 337
YI+MA N + CGI ASYP+
Sbjct: 322 YIKMAHNKNNH----CGIASSASYPL 343
>gi|389608655|dbj|BAM17937.1| cathepsin L [Papilio xuthus]
Length = 341
Score = 219 bits (558), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 178/323 (55%), Gaps = 48/323 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
++E + + ++ ++Y SE E + R IY+ N I N + +SF+L NK+ D+
Sbjct: 23 VKEEWNAFKMEHQKQYDSEVEDKFRMKIYAENKHNIAKHNQKYARGEVSFRLKQNKYGDM 82
Query: 114 SNEEFISTYLGYNKPYNEPR-------------WPSVQYLGLPASVDWRKEGAVTPVKDQ 160
+ EF+ T G+NK + + + + LP VDWRK GAVT VKDQ
Sbjct: 83 LHHEFVHTMNGFNKTTKNSKGLFGKSAGERGATFITPANVHLPDHVDWRKHGAVTEVKDQ 142
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G+CGSCW+FS+ A+EG + +T LVSLSEQ L+DC N GCNGG M+ AF++I
Sbjct: 143 GKCGSCWSFSSTGALEGQHYRRTNILVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKD 202
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---------------------- 258
G+ TE YPY G +D+C+ + K+ G+ IP
Sbjct: 203 NRGIDTEKSYPYEGIDDKCRYN-PKNTGADDNGFVDIPSGDEGKLMAAVATVGPVSVAID 261
Query: 259 -ARYAFQLYSHGV-FDEYC-GHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIR 314
++ +FQ YS GV FDE C L+HGV VVGYG D +G YWLVKNSWG SWG+ GYI+
Sbjct: 262 ASQSSFQFYSDGVYFDENCSSSSLDHGVLVVGYGTDENGGDYWLVKNSWGRSWGDLGYIK 321
Query: 315 MARNSPSSNIGICGILMQASYPV 337
MARN + CGI ASYP+
Sbjct: 322 MARNRDNH----CGIATAASYPL 340
>gi|413953048|gb|AFW85697.1| hypothetical protein ZEAMMB73_051316 [Zea mays]
Length = 298
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 131/297 (44%), Positives = 170/297 (57%), Gaps = 45/297 (15%)
Query: 85 IYSSNVQYIDYIN--SQNLSFKLTDNKFADLSNEEFISTYL---GYNKPYNEPRWPSVQY 139
+YS NV++I +N S S++L +N+F DL+ EEF TYL P E P+V
Sbjct: 2 VYSENVRFIKTMNQLSTGSSYELGENQFTDLTEEEFKDTYLMKLDEQPPAAEAMPPTVGT 61
Query: 140 LGL------------PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLV 187
+ P SVDWR +GAVTPVK+Q QCGSCWAF+ VA++EG++++KTG+LV
Sbjct: 62 MSTAGMSNGDNTGEAPNSVDWRTKGAVTPVKNQQQCGSCWAFATVASIEGVHQIKTGRLV 121
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
SLSEQ++VDCD + GC+GGY A E++T+ GG+TTE DYPY G +C + K H
Sbjct: 122 SLSEQQIVDCDRGGNDHGCHGGYPRSAMEWVTRNGGLTTESDYPYVGSQRQCMSGKLGHQ 181
Query: 248 AVTITGYEA---------------------IPARYAFQLYSHGVFDEYCG-HQLNHGVTV 285
A I GY+A I A AFQ Y GVF C +NH VTV
Sbjct: 182 AARIRGYQAVQRKNEAELERAVAGRPVAVVIDASRAFQFYKRGVFSGPCNTTTVNHAVTV 241
Query: 286 V-----GYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
V G G KYW+VKNSWG WGE GY+RMAR + G+C I ++ YPV
Sbjct: 242 VGYGSTGSDSGGGRKYWIVKNSWGQRWGENGYVRMARRVRARE-GMCAIAIEPYYPV 297
>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
Length = 339
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 136/337 (40%), Positives = 187/337 (55%), Gaps = 40/337 (11%)
Query: 33 FLLWVLGI-PAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQ 91
F+ W++G+ P +++ K DP +++ + W K YS++Y E+E R I+ N++
Sbjct: 8 FMKWLVGLLPLCSYAVAQVHK-DP-TLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLK 65
Query: 92 YIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPA 144
++ N ++ S+ L N D++ EE IS P R + S LP
Sbjct: 66 FVMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVTYRSNSNQKLPD 125
Query: 145 SVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS-EN 203
SVDWR++G VT VK QG CG+CWAFSAV A+E KLKTGKLVSLS Q LVDC N
Sbjct: 126 SVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGN 185
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----- 258
+GCNGG+M AF++I G+ +E YPY+ N +C+ D +K A T + Y +P
Sbjct: 186 KGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYD-SKKRAATCSKYTELPFGSED 244
Query: 259 ------------------ARYAFQLYSHGVFDE-YCGHQLNHGVTVVGYGEDHGEKYWLV 299
+ Y+F LY GV+ E C +NHGV VVGYG +G+ YWLV
Sbjct: 245 ALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLV 304
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
KNSWG ++G+ GYIRMARNS + CGI SYP
Sbjct: 305 KNSWGLNFGDQGYIRMARNSGNH----CGIASYPSYP 337
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 140/356 (39%), Positives = 196/356 (55%), Gaps = 54/356 (15%)
Query: 15 LKIAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYG 74
L+++ M+++ AV+ L A A S +P ++ + +EN+ +++++Y
Sbjct: 50 LRVSAGMKLLAVLAVIGL---------ASALSP------NP-NLNQHWENFKAEHNKKYE 93
Query: 75 SEDEWQRRFGIYSSNVQYIDYINSQN-LSFKLTDNKFADLSNEEFISTYLGYNKPYNEPR 133
S E R I+ N Q+I+ NS+ F L N F DL+N+E+ YLGY +P N P
Sbjct: 94 SFPEELMRRLIFEENHQFIEDHNSKKEFDFYLGMNHFGDLTNKEYRERYLGYRRPENTPS 153
Query: 134 WPSVQYL------GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLV 187
S + +P +DWR +G VTPVK+QGQCGSCWAFSAV ++EG + TGKLV
Sbjct: 154 KASYIFSRAEKIEDVPDQIDWRDQGFVTPVKNQGQCGSCWAFSAVGSLEGQHFKSTGKLV 213
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
SLSEQ LVDC N GCNGG+M++AFE++ G+ TED YPY G + C K K
Sbjct: 214 SLSEQNLVDCSTPEGNSGCNGGWMDQAFEYVKDNHGIDTEDSYPYVGTDGSCHF-KNKSI 272
Query: 248 AVTITGY--------EAI--------PARYA-------FQLYSHGVFD-EYCG-HQLNHG 282
T+ G+ EA+ P A FQ Y GV++ +C +L+HG
Sbjct: 273 GATLKGFMDVKEGDEEALRQAVGVAGPVSVAIDASSMLFQFYRGGVYNVPWCSTSELDHG 332
Query: 283 VTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
V VVGYG+ G+ +W+VKNSWG WG GYI M+RN + CGI +AS P
Sbjct: 333 VLVVGYGKQFQGKDFWMVKNSWGVGWGIYGYIEMSRNKGNQ----CGIASKASIPT 384
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 144/348 (41%), Positives = 188/348 (54%), Gaps = 51/348 (14%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRF 83
MLR +VL ++ + A S+ + + ++E + + + Y S E RF
Sbjct: 1 MLRLSVLCA----IVAVTVAASSQ--------EILRTQWEAFKTTHKKTYQSHMEELLRF 48
Query: 84 GIYSSNVQYIDYINSQN----LSFKLTDNKFADLSNEEFISTYLGYN---KPYNEPRWP- 135
I++ N I N++ +S+KL N+F DL EF + G++ K P
Sbjct: 49 KIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHGTRKTGGSTFLPP 108
Query: 136 -SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+V LP VDWRK+GAVTPVKDQGQCGSCWAFSA ++EG + LK G+LVSLSEQ L
Sbjct: 109 ANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNL 168
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
VDC + N GC GG ME AF++I + G+ TE YPY + C+ K A T TGY
Sbjct: 169 VDCSQSFGNNGCEGGLMEDAFKYIKENDGIDTEKSYPYEAVDGECRFKKEDVGA-TDTGY 227
Query: 255 EAIPA-----------------------RYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG 289
I A +FQLYS GV+DE C + L+HGV VVGYG
Sbjct: 228 VEIKAGSEDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG 287
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G+KYWLVKNSW SWG+ GYI M+R+ N CGI QASYP+
Sbjct: 288 VKGGKKYWLVKNSWAESWGDQGYILMSRD----NNNQCGIASQASYPL 331
>gi|161408095|dbj|BAF94151.1| cathepsin L-like cysteine protease 1 [Plautia stali]
Length = 344
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 121/317 (38%), Positives = 175/317 (55%), Gaps = 51/317 (16%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEE 117
F + QY ++Y S+ + R +Y N +++ N + +++K+ N AD+ E
Sbjct: 23 FTRFKSQYRKDYPSDSVERYRKKVYKQNEKFVREHNERYERGEVTYKMALNHLADMHPRE 82
Query: 118 FISTYLGYNK------------PYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
F++T+LG+N+ P+ + +Q VDWR++GA++PVKDQG CGS
Sbjct: 83 FMATFLGFNRSLRATNKVPEGIPFRHNKDAVIQ-----KEVDWRQKGAISPVKDQGHCGS 137
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFS+ A+E LK G+ VSLSEQ L+DC +N N GC GG ME+AF+++ G+
Sbjct: 138 CWAFSSTGALEAHTFLKKGRRVSLSEQNLIDCSLNYGNNGCEGGLMEQAFQYVRDNDGID 197
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARYA 262
TE+ YPY G++ C+ K A T G+ IP + +
Sbjct: 198 TEEAYPYEGEDSECRFKKNNVGA-TDAGFVTIPSGDEQALMEAVATQGPLSIAIDASNPS 256
Query: 263 FQLYSHGVF--DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
FQ YS GV+ E QL+HGV +VGYG + +KYWLVKNSW WGE GYI+MARN
Sbjct: 257 FQFYSEGVYYEPECSSAQLDHGVLLVGYGVEKDQKYWLVKNSWSEQWGENGYIKMARNKD 316
Query: 321 SSNIGICGILMQASYPV 337
++ CGI QAS+P+
Sbjct: 317 NN----CGIATQASFPI 329
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 144/348 (41%), Positives = 187/348 (53%), Gaps = 51/348 (14%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRF 83
MLR +VL ++ + A S+ + + ++E + + + Y S E RF
Sbjct: 1 MLRLSVLCA----IVAVTVAASSQ--------EILRTQWEAFKTTHKKSYQSHMEELLRF 48
Query: 84 GIYSSNVQYIDYINSQN----LSFKLTDNKFADLSNEEFISTYLGYN---KPYNEPRWP- 135
I++ N I N++ +S+KL N+F DL EF + G++ K P
Sbjct: 49 KIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHGTRKTGGSTFLPP 108
Query: 136 -SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+V LP VDWRK+GAVTPVKDQGQCGSCWAFSA ++EG + LK G+LVSLSEQ L
Sbjct: 109 ANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNL 168
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
VDC + N GC GG ME AF++I G+ TE YPY + C+ K A T TGY
Sbjct: 169 VDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGA-TDTGY 227
Query: 255 EAIPA-----------------------RYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG 289
I A +FQLYS GV+DE C + L+HGV VVGYG
Sbjct: 228 VEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG 287
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G+KYWLVKNSW SWG+ GYI M+R+ N CGI QASYP+
Sbjct: 288 VKGGKKYWLVKNSWAESWGDQGYILMSRD----NNNQCGIASQASYPL 331
>gi|293334761|ref|NP_001168296.1| uncharacterized protein LOC100382061 [Zea mays]
gi|223947281|gb|ACN27724.1| unknown [Zea mays]
Length = 322
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 122/305 (40%), Positives = 172/305 (56%), Gaps = 49/305 (16%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ-NLSFKLTDNKFADLSNE 116
M +RF W Y+R Y + E RRF +Y N++ I+ N + LS++L++ F DL++E
Sbjct: 3 MMDRFRAWQATYNRSYLTAAERLRRFEVYRQNMELIEATNRRAELSYQLSETPFTDLTSE 62
Query: 117 EFISTYLGYNK-------------------PYNE--PRWPSVQY---LGLPASVDWRKEG 152
EF++T+ + P ++ +W Y L +P SVDWR +G
Sbjct: 63 EFLATHTMSTRLHASEAARRHRELITTHAGPVSDGGRQWNRRNYTTDLDVPESVDWRTKG 122
Query: 153 AVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYME 212
AVT VKDQG CG CW+F+ VAA+EG++K++TG+LVSLSEQE++DC + N GC+GG
Sbjct: 123 AVTTVKDQGACGGCWSFATVAAIEGLHKIRTGQLVSLSEQEVLDCS-SPPNNGCHGGNPA 181
Query: 213 KAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI--------------- 257
A ++++ GG+TTE DYPY G+ +C+ DK ++H I G + +
Sbjct: 182 AAIDWVSANGGLTTESDYPYEGRQGKCKLDKARNHVAKIRGRKLVDQNNEAALEVAVAQQ 241
Query: 258 PARYAF------QLYSHGVFDEYCGHQ-LNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGE 309
P Q Y GVF C + LNH VT+VGYG E G KYW+VKNSWG WGE
Sbjct: 242 PVAVGMNVHPIQQHYKSGVFHGPCDPEDLNHAVTMVGYGAESGGRKYWIVKNSWGEKWGE 301
Query: 310 AGYIR 314
GY R
Sbjct: 302 KGYFR 306
>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
Length = 343
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 132/348 (37%), Positives = 189/348 (54%), Gaps = 50/348 (14%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+ LFLL ++ I A A + + + + + + + ++++ Y ++ E + R I+ N
Sbjct: 1 MKLFLLLIVAILATAQAISFFE-----LVNQEWTTFKMEHNKVYKNDIEERFRMKIFMDN 55
Query: 90 VQYIDYINS----QNLSFKLTDNKFADLSNEEFISTYLGYNKPYN----EPRWP------ 135
I N + +S+KL NK+ D+ + EF++T G+NK N R P
Sbjct: 56 KHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLPIGASFI 115
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
+ LP +VDWR+ GAVTPVKDQG CGSCW+FSA A+EG + +TG L+ LSEQ L+
Sbjct: 116 EPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLI 175
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DC N GCNGG M++AF++I G+ TE YPY +ND+C+ + A + GY
Sbjct: 176 DCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDV-GYV 234
Query: 256 AIP-----------------------ARYAFQLYSHGVF--DEYCGHQLNHGVTVVGYGE 290
IP + +FQ YS GV+ E L+HGV VGYG
Sbjct: 235 DIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYGT 294
Query: 291 D-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
D +G+ YWLVKNSWG +WG+ GYI+MARN + CGI ASYP+
Sbjct: 295 DENGQDYWLVKNSWGETWGDNGYIKMARNK----LNHCGIASTASYPL 338
>gi|391332597|ref|XP_003740719.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 131/317 (41%), Positives = 177/317 (55%), Gaps = 48/317 (15%)
Query: 59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL-------SFKLTDNKFA 111
+E +E + +Q+++ Y + + RR I+ +N++ I N+ NL S++L N FA
Sbjct: 23 DEHWELFKRQHNKTYLQKQDVGRR-AIFEANIKKI---NAHNLLYDLGRSSYRLGLNGFA 78
Query: 112 DLSNEEFISTYLGYNKPYNEPRWPSVQY-----LGLPASVDWRKEGAVTPVKDQGQCGSC 166
D++ +EF Y G NE R +Q+ + +P +VDWR EG VTPVK+QG CGSC
Sbjct: 79 DMTPDEF-EKYRGTRFEANEARVSKLQHRDNRSMHVPDTVDWRTEGYVTPVKNQGVCGSC 137
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS A+EG + ++G LVSLSEQ LVDC N GCNGG M+ AF FI GG+ T
Sbjct: 138 WAFSTTGALEGQHFRRSGDLVSLSEQMLVDCSAVYGNAGCNGGLMDNAFRFIKDAGGLET 197
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-----------------------YAF 263
E YPY GK+ C D + +TG+ +P+R F
Sbjct: 198 EKSYPYTGKDGTCHFD-ARGIGAKLTGFVDVPSRDEEALKEAAGVVGPVSVAIDASGQNF 256
Query: 264 QLYSHGVFDEYC--GHQLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSP 320
Q Y GV+DE L+HGV VVGYG G+ YWLVKNSWG+SWG++GYI+M+RN
Sbjct: 257 QFYKDGVYDEITCSSTSLDHGVLVVGYGTTRDGKDYWLVKNSWGSSWGQSGYIQMSRNKE 316
Query: 321 SSNIGICGILMQASYPV 337
+ CGI ASYP
Sbjct: 317 NQ----CGIATMASYPT 329
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 144/348 (41%), Positives = 187/348 (53%), Gaps = 51/348 (14%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRF 83
MLR +VL ++ + A S+ + + ++E + + + Y S E RF
Sbjct: 1 MLRLSVLCA----IVAVTVAASSQ--------EILRTQWEAFKTTHKKTYQSHMEELLRF 48
Query: 84 GIYSSNVQYIDYINSQN----LSFKLTDNKFADLSNEEFISTYLGYN---KPYNEPRWP- 135
I++ N I N++ +S+KL N+F DL EF + G++ K P
Sbjct: 49 KIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHGTRKTGGSSFLPP 108
Query: 136 -SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQEL 194
+V LP VDWRK+GAVTPVKDQGQCGSCWAFSA ++EG + LK G+LVSLSEQ L
Sbjct: 109 ANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNL 168
Query: 195 VDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGY 254
VDC + N GC GG ME AF++I G+ TE YPY + C+ K A T TGY
Sbjct: 169 VDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGA-TDTGY 227
Query: 255 EAIPA-----------------------RYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG 289
I A +FQLYS GV+DE C + L+HGV VVGYG
Sbjct: 228 VEIKAGSEVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYG 287
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G+KYWLVKNSW SWG+ GYI M+R+ N CGI QASYP+
Sbjct: 288 VKGGKKYWLVKNSWAESWGDQGYILMSRD----NNNQCGIASQASYPL 331
>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 135/307 (43%), Positives = 170/307 (55%), Gaps = 37/307 (12%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSN 115
SM ER E + +Y + Y +D +R F NV YI+ N + N +K N+FA
Sbjct: 34 SMXERHEQRMTRYGKVY--KDPPKRXF---KENVNYIEACNNAANKPYKRGINQFA--PR 86
Query: 116 EEFISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
F ++ +V P++VD R++GAVTP+KDQGQCG CWAFSAVAA
Sbjct: 87 NRFKGHMCSSIIRITTFKFENV--TATPSTVDCRQKGAVTPIKDQGQCGCCWAFSAVAAT 144
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP-YRG 234
EGI+ L GKL+SLSEQELVDCD + GC GG M+ AF+FI + G+ P Y G
Sbjct: 145 EGIHALSAGKLISLSEQELVDCDTKGVDXGCEGGLMDDAFKFIIQNHGLKHXSQLPLYMG 204
Query: 235 KNDRCQTDKTKHHAVT-ITGYEAIPARYA-----------------------FQLYSHGV 270
+ +C ++ +A T ITGYE +PA FQ Y GV
Sbjct: 205 VDGKCNANEAAKNAATIITGYEDVPANNEKAHLQKAVANNPVSEAIDASGSDFQFYKSGV 264
Query: 271 FDEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
F CG +L+HGVT VGYG D G +YWLVKNSWGT WGE GYIRM R S +CGI
Sbjct: 265 FTGSCGTELDHGVTAVGYGVSDDGTEYWLVKNSWGTEWGEEGYIRMQRGVDSEE-ALCGI 323
Query: 330 LMQASYP 336
+QASYP
Sbjct: 324 AVQASYP 330
>gi|298705581|emb|CBJ28832.1| Cathepsin L-like proteinase [Ectocarpus siliculosus]
Length = 553
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 129/326 (39%), Positives = 178/326 (54%), Gaps = 51/326 (15%)
Query: 61 RFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN--LSFKLTDNKFADLSNEEF 118
RF W+ Q+ +G++ E+ RR I++ N ID N+ N +F L+ N+F+ LS +EF
Sbjct: 42 RFRAWMAQHGVTFGTKGEFDRRLKIFAENSDLIDTHNTANDGSTFTLSHNEFSHLSWDEF 101
Query: 119 ISTYLGYNKPYNEP--------RWPSVQYLG------------LPASVDWRKEGAVTPVK 158
T+ GY + ++P R P + G +P VDW +EGAVTPV+
Sbjct: 102 KETHFGYKRSSDKPKPARQTPERRPMEKVAGGRRRLVELTGSEIPDEVDWVREGAVTPVQ 161
Query: 159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
+QG CGSCWAFS + A+EG L T L+ SE++LVDCD ++GC GG ME+AF++I
Sbjct: 162 NQGMCGSCWAFSTIGAMEGAYYLATDDLIKFSEEQLVDCD--KVDKGCFGGDMEQAFDWI 219
Query: 219 TKIGGVTTEDDYPYRG---KNDRCQT-------------------DKTKHHAVTITGYEA 256
+ GGV ED+YPY G C T D+ A+ G A
Sbjct: 220 KENGGVCPEDEYPYVGLWPPFKTCATTCTPVEGSQVKEWAQVKATDEALMTALATVGPIA 279
Query: 257 IPA---RYAFQLYSHGVFDEYCGHQLNHGVTVVGYGE-DHGEKYWLVKNSWGTSWGEAGY 312
I + AFQ YS GV+ CG +L+HGV VGYG + G YW VKNSWG SWG+ GY
Sbjct: 280 IAIEADQMAFQFYSDGVYTAPCGDKLDHGVLAVGYGTWEDGTDYWKVKNSWGDSWGQGGY 339
Query: 313 IRMAR-NSPSSNIGICGILMQASYPV 337
I + R +S G CG+L++A YP+
Sbjct: 340 ILLERADSEEDEGGQCGLLIEAIYPI 365
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 218 bits (556), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 138/362 (38%), Positives = 190/362 (52%), Gaps = 50/362 (13%)
Query: 17 IAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSE 76
IAI + M+ NA + +F+L A+ + P++ + ++ + Y E
Sbjct: 64 IAIVVVMLFVNAFILVFIL----KKRKAYQNLKATEEQPRTSYAATSTHVLEHRKNYLDE 119
Query: 77 DEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNKFADLSNEEFISTYLGYN------ 126
E + R I++ N I N S +S+KL NK+AD+ + EF G+N
Sbjct: 120 TEERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMNGFNYTLHKE 179
Query: 127 -----KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKL 181
+ + + S +++ LP SVDWR +GAVT VKDQG CGSCWAFS+ A+EG +
Sbjct: 180 LRAADESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSSTGALEGQHYR 239
Query: 182 KTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQT 241
K+G LVSLSEQ LVDC N GCNGG M+ AF +I GG+ TE YPY +D C
Sbjct: 240 KSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEALDDSCHF 299
Query: 242 DKTKHHAVTITGYEAIP-----------------------ARYAFQLYSHGVFDE-YCGH 277
+K A T G+ IP + +FQ YS GV+ E C
Sbjct: 300 NKGTIGA-TDRGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSEGVYVEPACDA 358
Query: 278 Q-LNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASY 335
Q L+HGV VVG+G D G+ YWLVKNSWGT+WG+ G+I+M RN + CGI +SY
Sbjct: 359 QNLDHGVLVVGFGTDESGQDYWLVKNSWGTTWGDKGFIKMLRNKDNQ----CGIASASSY 414
Query: 336 PV 337
P+
Sbjct: 415 PL 416
>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
Length = 466
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 125/290 (43%), Positives = 170/290 (58%), Gaps = 36/290 (12%)
Query: 82 RFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV 137
R ++ N++++D N+ +++L N+FADL+NEE+ + +L +
Sbjct: 63 RLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRARFLRDLSRLGRSTSGEI 122
Query: 138 --QYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
QY LP S+DWR++GAV VK QG+CGSCWAF+A+A VEGIN++ TG L+SLS
Sbjct: 123 SNQYRLREGDVLPDSIDWREKGAVVAVKSQGRCGSCWAFAAIATVEGINQIVTGDLISLS 182
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQ+LVDC ++ N GC GG+ +AF++I GGV +E+ YPY G N C T K H V+
Sbjct: 183 EQQLVDC--STRNHGCEGGWPYRAFQYIINNGGVNSEEHYPYTGTNGTCNTTKGNAHVVS 240
Query: 251 ITGYEAIPAR----------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
I Y +P+ FQLY G+F C LNHGVTVVGY
Sbjct: 241 IDSYRNVPSNDEKSLQKAVANQPISVGINASGRNFQLYHSGIFTGSCNTSLNHGVTVVGY 300
Query: 289 GEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G +G YW+VKNSWG SWG++GYI M RN S+ G CGI + SYP+K
Sbjct: 301 GTVNGNDYWIVKNSWGESWGDSGYILMERNIAESS-GKCGIAISPSYPIK 349
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 137/314 (43%), Positives = 174/314 (55%), Gaps = 39/314 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADL 113
+ ++E + + + Y S E RF I++ N I N++ +S+KL N+F DL
Sbjct: 23 LRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDL 82
Query: 114 SNEEFISTYLGYN---KPYNEPRWP--SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
EF + G++ K P +V LP VDWRK+GAVTPVKDQGQCGSCWA
Sbjct: 83 LAHEFARIFNGHHGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGSCWA 142
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSA ++EG + LK G+LVSLSEQ LVDC + N GC GG ME AF++I G+ TE
Sbjct: 143 FSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEK 202
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPA-----------------------RYAFQL 265
YPY + C+ K A T TGY I A +FQL
Sbjct: 203 SYPYEAVDGECRFKKEDVGA-TDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSFQL 261
Query: 266 YSHGVFDE-YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GV+DE C + L+HGV VVGYG G+KYWLVKNSW SWG+ GYI M+R+ N
Sbjct: 262 YSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRD----N 317
Query: 324 IGICGILMQASYPV 337
CGI QASYP+
Sbjct: 318 NNQCGIASQASYPL 331
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 139/352 (39%), Positives = 195/352 (55%), Gaps = 55/352 (15%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRF 83
ML AV++L L L P+ DPQ +++ +E W +S++Y ++E RR
Sbjct: 1 MLPLAVVALCLSAALSAPS----------LDPQ-LDDHWELWKSWHSKKYHEKEEGWRRM 49
Query: 84 GIYSSNVQYIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSV-- 137
++ N++ I+ N ++ S++L N F D+++EEF GY + S+
Sbjct: 50 -VWEKNLKKIELHNLEHSMGTHSYRLGMNHFGDMTHEEFRQLMNGYKRKAETKARGSLFL 108
Query: 138 --QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
+L P SVDWR G VTPVKDQGQCGSCWAFS A+EG + KTGKLVSLSEQ LV
Sbjct: 109 EPNFLEAPKSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLV 168
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR-CQTDKTKHHAVTITGY 254
DC N+GCNGG M++AF+++ G+ +ED YPY G +D+ C D T +++V TG+
Sbjct: 169 DCSRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPT-YNSVNDTGF 227
Query: 255 EAIPA-----------------------RYAFQLYSHGVF--DEYCGHQLNHGVTVVGY- 288
IP+ +FQ Y G++ E +L+HGV VVGY
Sbjct: 228 VDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYG 287
Query: 289 --GED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
GED G+KYW+VKNSW WG+ GYI MA++ + CGI ASYP+
Sbjct: 288 FQGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNH----CGIATAASYPL 335
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 127/310 (40%), Positives = 176/310 (56%), Gaps = 37/310 (11%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSN 115
E +E + + + Y ++ E R I+ +N + I+ N++ +S+K+ N F DL +
Sbjct: 25 EEWETFKVVHGKNYKNQFEEMFRRKIFMNNKKRIEAHNAKYEQGEVSYKMKMNHFGDLMS 84
Query: 116 EEFISTYLGYNKPYNEPRWPSVQYLG---LPASVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
E + G+ N R + + LP SVDWR++GAVTPVKDQGQCGSCW+FSA
Sbjct: 85 HEIKALMNGFKMTPNTKREGKIYFPSNDKLPKSVDWRQKGAVTPVKDQGQCGSCWSFSAT 144
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
++EG LK GKLVSLSEQ L+DC N GC GG M+KAF++++ G+ TE YPY
Sbjct: 145 GSLEGQIFLKKGKLVSLSEQNLMDCSKEYGNNGCEGGLMDKAFQYVSDNKGIDTESSYPY 204
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQLYSHG 269
++ C+ K K T GY IP + +F YS G
Sbjct: 205 EARDYACRFKKDKVGG-TDKGYVDIPEGDEKALQNALATVGPISVAIDASHESFHFYSEG 263
Query: 270 VFDE-YC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
V++E YC + L+HGV VGYG ++G+ YWLVKNSWG SWGE+GYI++ARN + C
Sbjct: 264 VYNEPYCSSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGESGYIKIARNHSNH----C 319
Query: 328 GILMQASYPV 337
GI ASYP+
Sbjct: 320 GIASMASYPI 329
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 173/319 (54%), Gaps = 46/319 (14%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSN 115
E + + ++ + Y E E + R I++ N I N + +SFKL NK+ADL +
Sbjct: 57 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 116
Query: 116 EEFISTYLGYN-----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
EF G+N + + + S ++ LP SVDWR +GAVT VKDQG CG
Sbjct: 117 HEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCG 176
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS+ A+EG + K+G LVSLSEQ LVDC N GCNGG M+ AF +I GG+
Sbjct: 177 SCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 236
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARY 261
TE YPY +D C +K A T G+ IP +
Sbjct: 237 DTEKSYPYEAIDDSCHFNKGTVGA-TDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHE 295
Query: 262 AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARN 318
+FQ YS GV++E C Q L+HGV VVG+G D GE YWLVKNSWGT+WG+ G+I+M RN
Sbjct: 296 SFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRN 355
Query: 319 SPSSNIGICGILMQASYPV 337
+ CGI +SYP+
Sbjct: 356 KENQ----CGIASASSYPL 370
>gi|344275470|ref|XP_003409535.1| PREDICTED: cathepsin S-like isoform 1 [Loxodonta africana]
Length = 331
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 134/336 (39%), Positives = 189/336 (56%), Gaps = 41/336 (12%)
Query: 33 FLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
+LLWV + + A + + DP +++ ++ W K YS++Y ++E R I+ N+++
Sbjct: 3 WLLWVALVCSSAMARLHK---DP-TLDNHWDLWKKTYSKQYKEKNEEVARRLIWEKNLKF 58
Query: 93 IDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPAS 145
+ N ++ S+ L+ N D+++EE +S P R + S LP S
Sbjct: 59 VMLHNLEHSMGMHSYDLSMNHLGDMTSEEVMSLMSSLRVPSQWQRNVTFKSNPNQKLPDS 118
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS-ENQ 204
+DWR++G VT VK QG CG+CWAFSAV A+E KLKTGKLVSLS Q LVDC N+
Sbjct: 119 LDWREKGCVTDVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGEKYSNK 178
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------ 258
GCNGG+M +AF++I G+ +E YPY+ + +CQ D K+ A T + Y +P
Sbjct: 179 GCNGGFMTRAFQYIIDNNGIDSEASYPYKATDGKCQYD-PKNRAATCSKYTELPYGSEDA 237
Query: 259 -----------------ARYAFQLYSHGV-FDEYCGHQLNHGVTVVGYGEDHGEKYWLVK 300
+R +F LY GV +D C +NHGV VVGYG +G+ YWLVK
Sbjct: 238 LKEAVANKGPVSVGIDASRPSFFLYKSGVYYDPSCTDNVNHGVLVVGYGNLNGKDYWLVK 297
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
NSWG ++GE GYIRMARNS + CGI SYP
Sbjct: 298 NSWGLNFGEQGYIRMARNSGNH----CGIASFPSYP 329
>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
Length = 362
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 114/233 (48%), Positives = 151/233 (64%), Gaps = 6/233 (2%)
Query: 32 LFLLWVLGIPAGAW-SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNV 90
+ + + L GAW S+ + SM ER E W+ Y+R Y +E Q R+ I+ NV
Sbjct: 8 ICITFALFFSIGAWTSQCMARTLQEASMYERHEQWMASYARVYKDANEKQMRYKIFKENV 67
Query: 91 QYIDYINSQ-NLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY---LGLPASV 146
Q ID NS+ + S+KL N+FADL+NEEF S G+ + +Y +PAS+
Sbjct: 68 QRIDSFNSESDKSYKLAVNQFADLTNEEFKSLRNGFKGHMCSAQAGHFRYENVTAVPASI 127
Query: 147 DWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGC 206
DWRK+GAVT +K+QGQCGSCWAFSAVAAVEGI ++KTGKL+SLSEQELVDCD NSE+QGC
Sbjct: 128 DWRKKGAVTQIKEQGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSEDQGC 187
Query: 207 NGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA 259
GG M+ AF+FI + G+ +E YPY + C+T + + ITGYE +PA
Sbjct: 188 QGGLMDDAFKFIEQ-HGLASEATYPYDAADSTCKTKEEAKPSAKITGYEDVPA 239
>gi|326494040|dbj|BAJ85482.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 121/319 (37%), Positives = 167/319 (52%), Gaps = 45/319 (14%)
Query: 59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSNEE 117
R E W+ +Y R Y E RR ++++N ++ID +N + N ++ L N F+DL+NEE
Sbjct: 38 RHRHERWMAKYGRVYADAAEKLRRQEVFAANARHIDAVNRAGNRTYTLGLNHFSDLTNEE 97
Query: 118 FISTYLGY-------------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
F T+LGY + P Q P SVDWR GAVTPVK QG CG
Sbjct: 98 FAQTHLGYRHQPGPGGLRPEDSSPAAAVNVTDAQLQSTPDSVDWRARGAVTPVKHQGHCG 157
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAF+AVAA EG+ ++ TG L+S+SEQ+++DC + + C GY+ A +IT GG+
Sbjct: 158 SCWAFAAVAATEGLVQIATGNLISMSEQQVLDCTGGTSS--CKSGYVNAALTYITASGGL 215
Query: 225 TTEDDYPYRGKNDRCQTDKTK---------HHAVTITGYE--------------AIPARY 261
TE Y Y + C++ H + + G E A+ A
Sbjct: 216 QTEAAYAYSAEQGACRSGGASPNSAAAVGVHRSAMLNGDEGALQVLVAGQPVAVAVEAEP 275
Query: 262 AFQLYSHGVF--DEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARN 318
F Y GV+ CG +L+H VTVVGYG D G+ YW+VKN WG WGE GY+R+ R
Sbjct: 276 DFHHYKSGVYVGSPSCGQKLHHAVTVVGYGADGDGQGYWVVKNQWGAGWGEVGYMRLTRG 335
Query: 319 SPSSNIGICGILMQASYPV 337
+ +N CG+ A YP
Sbjct: 336 NGGNN---CGMATHAYYPT 351
>gi|310942958|pdb|3P5U|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
gi|310942959|pdb|3P5V|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
gi|310942961|pdb|3P5X|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
Length = 220
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 109/219 (49%), Positives = 135/219 (61%), Gaps = 24/219 (10%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
LP VDWR GAV +KDQGQCGS WAFS +AAVEGINK+ TG L+SLSEQELVDC
Sbjct: 1 LPDYVDWRSSGAVVDIKDQGQCGSXWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQ 60
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP--- 258
+GC+GG+M F+FI GG+ TE +YPY + +C D + V+I YE +P
Sbjct: 61 NTRGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQCNLDLQQEKYVSIDTYENVPYNN 120
Query: 259 -------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
A Y FQ YS G+F CG ++H VT+VGYG + G YW+V
Sbjct: 121 EWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGTEGGIDYWIV 180
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
KNSWGT+WGE GY+R+ RN +G CGI +ASYPVK
Sbjct: 181 KNSWGTTWGEEGYMRIQRN--VGGVGQCGIAKKASYPVK 217
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 174/316 (55%), Gaps = 43/316 (13%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADL 113
+ ++E + + + Y S E RF I++ N I N++ +S+KL N+F DL
Sbjct: 23 LRTQWEAFKTTHKKSYESHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDL 82
Query: 114 SNEEFISTYLGY-------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
EF + GY + P +V LP++VDWRK+GAVTPVKDQGQCGSC
Sbjct: 83 LAHEFAKIFNGYRGQRTSRGSTFMPP--ANVNDSSLPSTVDWRKKGAVTPVKDQGQCGSC 140
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSA ++EG + LK G+LVSLSEQ LVDC + N GC GG M+ AF++I G+
Sbjct: 141 WAFSATGSLEGQHFLKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFKYIKANDGIDA 200
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAI-----------------------PARYAF 263
E+ YPY +D+C+ K A T TG+ I +F
Sbjct: 201 EESYPYEAMDDKCRFKKEDVGA-TDTGFVDIEGGSEDDLKKAVATVGPISVAIDAGHSSF 259
Query: 264 QLYSHGVFD--EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
QLYS GV+D E +L+HGV VGYG G+KYWLVKNSWG SWG+ GYI M+R+ +
Sbjct: 260 QLYSEGVYDEPECSSEELDHGVLAVGYGVKDGKKYWLVKNSWGGSWGDNGYILMSRDKNN 319
Query: 322 SNIGICGILMQASYPV 337
CGI ASYP+
Sbjct: 320 Q----CGIASAASYPL 331
>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
Length = 340
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 137/344 (39%), Positives = 188/344 (54%), Gaps = 51/344 (14%)
Query: 38 LGIPAG-------AWS------EGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFG 84
+G PAG AW+ P DP +++ + W K Y ++Y ++E R
Sbjct: 1 MGAPAGSTIRTWLAWALLACSYAAAPVDRDP-ALDHHWNLWKKTYGKQYKEKNEEVARRL 59
Query: 85 IYSSNVQYIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSV 137
I+ N++++ N ++ S+ L N D+++EE IS P PR + S
Sbjct: 60 IWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSN 119
Query: 138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
LP SVDWR++G VT VK QG CG+CWAFSAV A+E KLKTGKLVSLS Q LVDC
Sbjct: 120 SNQKLPDSVDWREKGCVTKVKYQGACGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDC 179
Query: 198 DVNS-ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA 256
N+GCNGG+M +AF++I G+ +E YPY+ + +C+ D +K+ A T + Y
Sbjct: 180 STEKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKATDGKCRYD-SKNRAATCSKYTE 238
Query: 257 IP----------------------ARY-AFQLYSHGV-FDEYCGHQLNHGVTVVGYGEDH 292
+P AR+ +F LY GV +D C +NHGV VVGYG +
Sbjct: 239 LPSGSEDDLKEAVANKGPVSVAIDARHSSFFLYRSGVYYDPSCTQNVNHGVLVVGYGNLN 298
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
G+ YWLVKNSWG ++G+ GYIRMARNS + CGI SYP
Sbjct: 299 GKDYWLVKNSWGLNFGDQGYIRMARNSGNH----CGIASYPSYP 338
>gi|340508003|gb|EGR33817.1| papain family cysteine protease, putative [Ichthyophthirius
multifiliis]
Length = 334
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 142/347 (40%), Positives = 192/347 (55%), Gaps = 41/347 (11%)
Query: 16 KIAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSME--ERFENWLKQYSREY 73
KI I M+ +L A+ S LL V P S P Q++ FEN+ +Y+++Y
Sbjct: 3 KIKIQMKGLLLLALASFTLLSVG--PILLLSPQTPLIRSSQNVNYVSEFENFNFKYNKQY 60
Query: 74 GSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPR 133
S+ ++Q R +++ N++YI+ N ++ SF L N + L+ EEFI TYLG N P
Sbjct: 61 QSQQQYQYRLQVFTENLKYIEQQNKKSQSFTLGVNSISHLTREEFIQTYLGLNIINYYPE 120
Query: 134 WPSVQYLG---LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
S + + LP SVDWR +GAVTPVKDQGQCGSCWAFS ++EG N L+ L + S
Sbjct: 121 NISQEIVNVEDLPDSVDWRTQGAVTPVKDQGQCGSCWAFSTTGSLEGANYLQNKTLSAFS 180
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQ+L+DC N GCNGG M +AF+++ GVTTED YPY K+ + K K+
Sbjct: 181 EQQLMDCSWLYGNLGCNGGLMPRAFKWVAS-HGVTTEDKYPYEAKSHF--SCKNKNGEFK 237
Query: 251 ITGYEAIPA--------------------RYAFQLYSHGVFDEYCGHQLNHGVTVVGYGE 290
I+ Y+ IP +Q YS GVFD+ C +LNHGV VGY
Sbjct: 238 ISSYQEIPVGDCDALAQSVSQRPTSIAVDASNWQSYSSGVFDD-CATRLNHGVLAVGYTS 296
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+ YW+VKNSW TSWG+ GYI + R + CG+ ASYPV
Sbjct: 297 E----YWIVKNSWNTSWGQQGYINLKRGN------TCGLCNSASYPV 333
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 173/319 (54%), Gaps = 46/319 (14%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSN 115
E + + ++ + Y E E + R I++ N I N + +SFKL NK+ADL +
Sbjct: 61 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 120
Query: 116 EEFISTYLGYN-----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
EF G+N + + + S ++ LP SVDWR +GAVT VKDQG CG
Sbjct: 121 HEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCG 180
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS+ A+EG + K+G LVSLSEQ LVDC N GCNGG M+ AF +I GG+
Sbjct: 181 SCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 240
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARY 261
TE YPY +D C +K A T G+ IP +
Sbjct: 241 DTEKSYPYEAIDDSCHFNKGTVGA-TDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHE 299
Query: 262 AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARN 318
+FQ YS GV++E C Q L+HGV VVG+G D GE YWLVKNSWGT+WG+ G+I+M RN
Sbjct: 300 SFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRN 359
Query: 319 SPSSNIGICGILMQASYPV 337
+ CGI +SYP+
Sbjct: 360 KENQ----CGIASASSYPL 374
>gi|313118768|gb|ADR32296.1| C14 cysteine protease [Solanum demissum]
gi|313118770|gb|ADR32297.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 218 bits (554), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 108/218 (49%), Positives = 137/218 (62%), Gaps = 24/218 (11%)
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P SVDWR +G + VKDQG CGSCWAFSAVAA+E IN + TG L+SLSEQELVDCD S
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSY 60
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA 262
N+GC+GG M+ AFEF+ GG+ TE+DYPY+ +N C + VTI YE +P
Sbjct: 61 NEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNGVCDQYRKNAKVVTIDSYEDVPVNNE 120
Query: 263 ----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVK 300
FQ Y G+F CG ++HGV V GYG ++G YW+V+
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVVAGYGTENGMDYWIVR 180
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
NSWG WGE GY+R+ RN SS+ G+CG+ ++ SYPVK
Sbjct: 181 NSWGAKWGEKGYLRVQRNVASSS-GLCGLAIEPSYPVK 217
>gi|6630974|gb|AAF19631.1|AF194427_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 127/311 (40%), Positives = 174/311 (55%), Gaps = 41/311 (13%)
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNKFADLSNEE 117
+E+W +Y + Y E R ++ SN+Q + N +++L N +ADL NEE
Sbjct: 19 WESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEE 78
Query: 118 FIST------YLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSA 171
F++ +K + P V + LP+SVDWR +G VTPVKDQGQCGSCW FSA
Sbjct: 79 FMALKGSGGLLQAKDKSSTQTFKPLVG-VTLPSSVDWRNQGYVTPVKDQGQCGSCWTFSA 137
Query: 172 VAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYP 231
++EG + KTG L+SLSEQ+LVDC N GCNGG ME A+++I +GGV E YP
Sbjct: 138 TGSLEGQHFAKTGNLLSLSEQQLVDCAGRYGNYGCNGGLMESAYDYIKGVGGVELESAYP 197
Query: 232 YRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQLYSH 268
Y ++ RC+ D++K A T GY IP + Y+FQLY
Sbjct: 198 YTARDGRCKFDRSKVVA-TCKGYVVIPVGDEQALMQAVGTIGPVAVSIDASGYSFQLYES 256
Query: 269 GVFD-EYCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGI 326
GV+D C L+HGV VGYG + G+ YWLVKNSWG WG+ GYI+M+++ +
Sbjct: 257 GVYDFRRCSSTNLDHGVLAVGYGTEGGQNYWLVKNSWGPGWGDQGYIKMSKDKNNQ---- 312
Query: 327 CGILMQASYPV 337
CGI + YP+
Sbjct: 313 CGIATDSCYPL 323
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 175/314 (55%), Gaps = 39/314 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADL 113
+ ++E + + + Y S E RF I++ + I N++ +S+KL N+F DL
Sbjct: 23 LRTQWEAFKTTHKKTYQSHMEELLRFKIFTESSLIIARHNAKYAKGLVSYKLGMNQFGDL 82
Query: 114 SNEEFISTYLGYN---KPYNEPRWP--SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
EF + G++ K P +V LP +VDWRK+GAVTPVKDQGQCGSCWA
Sbjct: 83 LAHEFARIFNGHHGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWA 142
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSA ++EG + LK G+LVSLSEQ LVDC + N GC GG ME AF++I G+ TE
Sbjct: 143 FSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEK 202
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPA-----------------------RYAFQL 265
YPY + C+ K A T TGY I A +FQL
Sbjct: 203 SYPYEAVDGECRFKKEDVGA-TDTGYVEIKAGSEDDLKKAVATVGPISVAIDASHSSFQL 261
Query: 266 YSHGVFDE-YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GV+DE C + L+HGV VVGYG G+KYWLVKNSW SWG+ GYI M+R+ N
Sbjct: 262 YSEGVYDEPECSSEDLDHGVLVVGYGVKGGKKYWLVKNSWAESWGDQGYILMSRD----N 317
Query: 324 IGICGILMQASYPV 337
CGI QASYP+
Sbjct: 318 NNQCGIASQASYPL 331
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 217 bits (553), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 173/319 (54%), Gaps = 46/319 (14%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSN 115
E + + ++ + Y E E + R I++ N I N + +SFKL NK+ADL +
Sbjct: 27 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 86
Query: 116 EEFISTYLGYN-----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
EF G+N + + + S ++ LP SVDWR +GAVT VKDQG CG
Sbjct: 87 HEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCG 146
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS+ A+EG + K+G LVSLSEQ LVDC N GCNGG M+ AF +I GG+
Sbjct: 147 SCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 206
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARY 261
TE YPY +D C +K A T G+ IP +
Sbjct: 207 DTEKSYPYEAIDDSCHFNKGTVGA-TDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHE 265
Query: 262 AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARN 318
+FQ YS GV++E C Q L+HGV VVG+G D GE YWLVKNSWGT+WG+ G+I+M RN
Sbjct: 266 SFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRN 325
Query: 319 SPSSNIGICGILMQASYPV 337
+ CGI +SYP+
Sbjct: 326 KENQ----CGIASASSYPL 340
>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
Length = 339
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 136/347 (39%), Positives = 191/347 (55%), Gaps = 52/347 (14%)
Query: 31 SLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNV 90
++FLL LGI A A + + + E + + + + Y S+ E R I+ N
Sbjct: 4 AIFLL--LGILAAAQAISFFNL-----VTEEWNTFKVTHRKAYDSKIEESFRMKIFMENW 56
Query: 91 QYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYNKPYNE----------PRWPS 136
I N + +S+KL NK+ D+ + EFI+T G+NK + R+
Sbjct: 57 HKIALHNQKYELNEVSYKLGMNKYGDMLHHEFINTLNGFNKSVSAQLRAQRRPIGSRFIE 116
Query: 137 VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVD 196
+ +P+SVDWR GAVTP+KDQG CGSCW+FSA A+EG + TGKLVSLSEQ L+D
Sbjct: 117 PANVEIPSSVDWRTHGAVTPIKDQGHCGSCWSFSATGALEGQHYRITGKLVSLSEQNLID 176
Query: 197 CDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA 256
C N GCNGG M++AF++I G+ TE YPY +ND+C+ + +++ T +GY
Sbjct: 177 CSGRYGNNGCNGGLMDQAFQYIKDNHGLDTEISYPYEAENDKCRYN-PRNNGATDSGYVD 235
Query: 257 IP-----------------------ARYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG-E 290
IP + +FQ Y GV+ E C + L+HGV VVGYG +
Sbjct: 236 IPEGNEKKLKAAVATIGPVSVAIDASAESFQFYREGVYYEPRCSSENLDHGVLVVGYGTD 295
Query: 291 DHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
D+ + YWLVKNSWG +WG+ GYI+MARN + CGI ASYP+
Sbjct: 296 DNDQDYWLVKNSWGVTWGDEGYIKMARNKDNH----CGIASSASYPL 338
>gi|228245|prf||1801240C Cys protease 3
Length = 321
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 122/305 (40%), Positives = 172/305 (56%), Gaps = 34/305 (11%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEF 118
+++ QY R+YG E R ++ N Q I+ N + ++FK+ N+F D++NEEF
Sbjct: 20 DHFKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEF 79
Query: 119 ISTYLGYNK-PYNEPRWP-SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
+ GY K EP+ + + + VDWR + VTPVKDQ QCGSCWAFSA A+E
Sbjct: 80 NAVMKGYKKGSRGEPKAVFTAEGRPMARDVDWRTKALVTPVKDQEQCGSCWAFSATGALE 139
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
G + LK +LVSLSEQ+LVDC + N GC GG+M AF++I GG+ TE YPY ++
Sbjct: 140 GQHFLKNDELVSLSEQQLVDCSTDYGNDGCGGGWMTSAFDYIKDNGGIDTESSYPYEAED 199
Query: 237 DRCQTDKTKHHAVTITGYEAI----------------------PARYAFQLYSHGV-FDE 273
C+ D A+ E + + ++FQ YS GV +++
Sbjct: 200 RSCRFDANSIGAICTGSVEIVQHTEEALQEAVSGVGPISVAIDASHFSFQFYSSGVYYEQ 259
Query: 274 YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQ 332
C L+HGV VGYG + + YWLVKNSWG+SWG+AGYI+M+RN ++ CGI +
Sbjct: 260 NCSPTFLDHGVLAVGYGTESTKDYWLVKNSWGSSWGDAGYIKMSRNRDNN----CGIASE 315
Query: 333 ASYPV 337
SYP
Sbjct: 316 PSYPT 320
>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
Length = 338
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 134/345 (38%), Positives = 197/345 (57%), Gaps = 47/345 (13%)
Query: 32 LFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQ 91
+ L +L + GA S P DP ++ + + +W +S++Y ++E RR I+ N++
Sbjct: 1 MIYLCILALSFGA-SFAAP-GLDP-ALNDHWLSWKSWHSKKYHEKEEGWRRM-IWEKNLK 56
Query: 92 YIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL-----GL 142
I+ N + S++L N F D++NEEF G+ + ++ ++ Q+L
Sbjct: 57 MIELHNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGFKQSRSQRKYKGSQFLEPNFLQA 116
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P SVDWR++G VTPVKDQGQCGSCWAFSA A+EG + KTGKLVSLSEQ L+DC
Sbjct: 117 PKSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQHFRKTGKLVSLSEQNLIDCSGPEG 176
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---- 258
NQGCNGG M++AF++I G+ +E+ YPY GK+D K ++++ TG+ IP
Sbjct: 177 NQGCNGGLMDQAFQYIKDNNGIDSEESYPYIGKDDEDCLYKPEYNSANDTGFVDIPEGRE 236
Query: 259 -------------------ARYAFQLYSHGVFDE-YC-GHQLNHGVTVVGYG-----EDH 292
+ +FQ Y GV+ E C +L+HGV VVGYG +D+
Sbjct: 237 RALMKAVAAVGPISVAIDASHTSFQFYESGVYYEPQCNSEELDHGVLVVGYGYEGTDDDN 296
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
++YW+VKNSW WG+ GYI MA++ ++ CGI ASYP+
Sbjct: 297 KKRYWIVKNSWSEKWGDQGYIHMAKDRSNN----CGIASAASYPM 337
>gi|194352764|emb|CAQ00110.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 406
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 125/347 (36%), Positives = 181/347 (52%), Gaps = 70/347 (20%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
M +RF W+ ++R Y + E RRF +Y SN+++I+ +N++ L+++L + F DL
Sbjct: 59 MMDRFHVWMTVHNRSYSTAGEKARRFEVYRSNMRFIEAVNAEAATSGLTYELGEGPFTDL 118
Query: 114 SNEEFISTYLG------------YNKPYNEPRWPSVQYLGL--------------PASVD 147
+NEEF+ Y G ++ S+ LG P S+D
Sbjct: 119 TNEEFMELYTGQILEDDQSEDGDDDEQIITTHAGSIDGLGTHKGATVYANFSASAPTSID 178
Query: 148 WRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCN 207
WRK G VTPVK+Q QCGSCWAF VA +EGI+K+K G LVSLSEQ+L+DCD + GC
Sbjct: 179 WRKRGVVTPVKNQKQCGSCWAFPTVATIEGIHKIKRGTLVSLSEQQLIDCDY--LDNGCK 236
Query: 208 GGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYAFQL-- 265
GG + +AF++I K GG+T+ Y Y+ RC + + A I G+ + + L
Sbjct: 237 GGLVTRAFQWIKKNGGITSTSSYKYKAVRGRCL--RNRKPAAKIVGFRKVKSNSEVSLMN 294
Query: 266 --------------------YSHGVFDEYCG-HQLNHGVTVVGYGEDH------------ 292
Y G+++ C +LNH VTVVGYG+
Sbjct: 295 AVANQPVAVSISSHSSHFHHYKGGIYNGPCSTTKLNHAVTVVGYGQQQQNGADSVHASAP 354
Query: 293 GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
G KYW+VKNSWGT+WG+ GYI M R + S+ G CGI + +P+ +
Sbjct: 355 GAKYWIVKNSWGTTWGDKGYILMKRGTKHSS-GQCGIATRPVFPLMK 400
>gi|67605684|ref|XP_666697.1| cryptopain precursor [Cryptosporidium hominis TU502]
gi|54657738|gb|EAL36466.1| cryptopain precursor [Cryptosporidium hominis]
Length = 401
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 129/364 (35%), Positives = 195/364 (53%), Gaps = 49/364 (13%)
Query: 21 MRMMLRNAVLSLFLLWVLGIP-------AGAWSEGYPQKY-DPQSMEER--FENWLKQYS 70
++ ++ ++++F++ V+ + + + P Y DP + E R FE + K+Y+
Sbjct: 35 LKNIIIATLIAIFIVLVVTVSLYITNNTSDKIDDFVPGDYVDPATREYRKSFEEFKKKYN 94
Query: 71 REYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKP-- 128
+ Y S +E +RF IY N+ +I NSQ S+ L N+F DLS EEF++ + GY K
Sbjct: 95 KTYSSMEEENQRFEIYKQNMNFIKTTNSQGFSYVLEMNEFGDLSKEEFMARFTGYIKDSK 154
Query: 129 -----YNEPRWPSVQY---LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINK 180
+ R + + P S++W + G V P+++Q CGSCWAFSAVAA+EG
Sbjct: 155 DDERVFKSSRVSASELEEEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSAVAALEGATC 214
Query: 181 LKTGK-LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRC 239
+T + L SLSEQ+ VDC + N GC+GG M AF++ K + T DDYPY + C
Sbjct: 215 AQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTMGLAFQYAIKNKYLCTNDDYPYFAEEKTC 274
Query: 240 QTDKTKHH-AVTITGYEAIPAR-----------------------YAFQLYSHGVFDEYC 275
+++ + + Y+ + R FQ Y GVFD C
Sbjct: 275 MDSFCENYIEIPVKAYKYVFPRNINTLKTALAKYGPISVAIQADQTPFQFYKSGVFDAPC 334
Query: 276 GHQLNHGVTVVGY--GEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
G ++NHGV +VGY ED ++YWLV+NSWG +WGE GYI++A +S G CGIL++
Sbjct: 335 GTKVNHGVVLVGYDMDEDTNKEYWLVRNSWGEAWGEKGYIKLALHSGKK--GTCGILVEP 392
Query: 334 SYPV 337
YPV
Sbjct: 393 VYPV 396
>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
Length = 328
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 180/319 (56%), Gaps = 38/319 (11%)
Query: 50 PQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKL 105
P DP +++ + W K Y ++Y ++E R I+ N++++ N ++ S+ L
Sbjct: 14 PVDRDP-ALDHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDL 72
Query: 106 TDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPASVDWRKEGAVTPVKDQGQ 162
N D+++EE IS P PR + S LP SVDWR++G VT VK QG
Sbjct: 73 GMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSNSNQKLPDSVDWREKGCVTKVKYQGA 132
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS-ENQGCNGGYMEKAFEFITKI 221
CG+CWAFSAV A+E KLKTGKLVSLS Q LVDC N+GCNGG+M +AF++I
Sbjct: 133 CGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDN 192
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------A 259
G+ +E YPY+ + +C+ D +K+ A T + Y +P A
Sbjct: 193 NGIDSEASYPYKATDGKCRYD-SKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDA 251
Query: 260 RY-AFQLYSHGV-FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
R+ +F LY GV +D C +NHGV VVGYG +G+ YWLVKNSWG ++G+ GYIRMAR
Sbjct: 252 RHSSFFLYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKNSWGLNFGDQGYIRMAR 311
Query: 318 NSPSSNIGICGILMQASYP 336
NS + CGI SYP
Sbjct: 312 NSGNH----CGIASYPSYP 326
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 130/348 (37%), Positives = 188/348 (54%), Gaps = 50/348 (14%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+ LFL ++ + A A + + + + + + + ++++ Y ++ E + R I+ N
Sbjct: 1 MKLFLFLIVAVLATAQAISFFE-----LVNQEWTTFKMEHNKVYKNDVEERFRMKIFMDN 55
Query: 90 VQYIDYINS----QNLSFKLTDNKFADLSNEEFISTYLGYNKPYN----EPRWPSVQY-- 139
I N + +S+KL NK+ D+ + EF++T G+NK N R P
Sbjct: 56 KHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLPIAASFI 115
Query: 140 ----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
+ LP +VDWR+ GAVTPVKDQG CGSCW+FSA A+EG + +TG L+ LSEQ L+
Sbjct: 116 EPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNLI 175
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE 255
DC N GCNGG M++AF++I G+ TE YPY +ND+C+ + A + GY
Sbjct: 176 DCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDV-GYV 234
Query: 256 AIP-----------------------ARYAFQLYSHGVF--DEYCGHQLNHGVTVVGYGE 290
IP + +FQ YS GV+ E L+HGV VGYG
Sbjct: 235 DIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYGT 294
Query: 291 D-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
D +G+ YWLVKNSWG +WG+ GYI+MARN + CGI ASYP+
Sbjct: 295 DENGQDYWLVKNSWGETWGDNGYIKMARNK----LNHCGIASTASYPL 338
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 132/304 (43%), Positives = 171/304 (56%), Gaps = 39/304 (12%)
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNKFADLSNEEFISTYLG 124
+ +EY S+ E + R IY N + N S+ + NKF DL + EF S G
Sbjct: 34 HKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNG 93
Query: 125 Y-NKPYNEPRWPSV------QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEG 177
Y +K N R S + +P SVDWR++GA+TPVKDQGQCGSCWAFS+ A+EG
Sbjct: 94 YQHKKQNSSRAESTFTFMEPANVTVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEG 153
Query: 178 INKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
KTGKLVSLSEQ L+DC N+GCNGG M++AF++I G+ TE+ YPY ++D
Sbjct: 154 QTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDD 213
Query: 238 RC--------------------QTDKTKHHAVTITGYE-AIPARY-AFQLYSHGVFDE-Y 274
C + DK K T+ AI A + +FQ YS GV+ E
Sbjct: 214 VCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPS 273
Query: 275 C-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
C L+HGV VVGYG D+G+ YWLVKNSW WG+ GYI+MARN + CG+ A
Sbjct: 274 CDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKMARNRKNH----CGVASAA 329
Query: 334 SYPV 337
SYP+
Sbjct: 330 SYPL 333
>gi|118424553|gb|ABK90824.1| cathepsin L-like cysteine proteinase [Spodoptera exigua]
Length = 344
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 171/316 (54%), Gaps = 51/316 (16%)
Query: 68 QYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADLSNEEFISTYL 123
++S++Y SE E + R IY N I N + +S+KL NK+AD+ + EF+ T
Sbjct: 33 EHSKQYDSEVEDKFRMKIYVENKHRITKHNQRFEQRLVSYKLKPNKYADMLHHEFVHTMN 92
Query: 124 GYNKPYNE----------------PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
G+NK + + ++ P VDWRK+GAVT VKDQG+CGSCW
Sbjct: 93 GFNKTAKHGGRNKNVHGKGHDGRAATFIAPAHVSYPDHVDWRKKGAVTDVKDQGKCGSCW 152
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS A+EG + KTG LVSLSEQ L+DC N GCNGG M+ AF++I GG+ TE
Sbjct: 153 AFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNGGIDTE 212
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQ 264
YPY +D+C+ + K G+ IP ++ FQ
Sbjct: 213 KSYPYEAVDDKCRYN-PKESGADDVGFVDIPQGDEEKLMQAVATVGPISVAIDASQETFQ 271
Query: 265 LYSHGV-FDEYCGH-QLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
YS GV +DE C L+HGV VVGYG E+ G WLVKNSWG SWGE GYI+MARN +
Sbjct: 272 FYSKGVYYDENCSSTDLDHGVMVVGYGTEEDGSDDWLVKNSWGRSWGELGYIKMARNKNN 331
Query: 322 SNIGICGILMQASYPV 337
CGI ASYP+
Sbjct: 332 H----CGIASSASYPL 343
>gi|72005575|ref|XP_783218.1| PREDICTED: cathepsin L2-like isoform 2 [Strongylocentrotus
purpuratus]
gi|390337647|ref|XP_003724610.1| PREDICTED: cathepsin L2-like isoform 1 [Strongylocentrotus
purpuratus]
Length = 334
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 134/343 (39%), Positives = 187/343 (54%), Gaps = 45/343 (13%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
+ F++ +L + AGA + P + +E ++ W+ + +EY + E R I+ N
Sbjct: 1 MKTFIIVLLSV-AGALATRLPSR----DFDEEWKEWVDYHGKEYSAMGEEMERRMIWEDN 55
Query: 90 VQYIDYINSQN----LSFKLTDNKFADLSNEEFISTYL-----GYNKPYNEPRWPSVQYL 140
++ I N ++ +++L N+F D++N EF++T G K + ++L
Sbjct: 56 LRIITKHNLEHSQGKTTYRLGMNEFGDMTNAEFVATRTMKKMSGVPKVGQGSTFLPSEFL 115
Query: 141 GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVN 200
LP SVDWR EG VTPVKDQGQCGSCWAFS V A+EG + +KTG LVSLSEQ LVDC
Sbjct: 116 QLPDSVDWRTEGYVTPVKDQGQCGSCWAFSTVGALEGQHFVKTGTLVSLSEQNLVDCSQA 175
Query: 201 SENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA- 259
N GCNGG+ A E+I GG+ TE YPY G +D C +T TITG+ + A
Sbjct: 176 EGNDGCNGGWPAWADEYIKSNGGIDTEVGYPYEGVDDSCHY-RTSDVGATITGFAEVEAD 234
Query: 260 ----------------------RYAFQLYSHGVFDE--YCGHQLNHGVTVVGYGED-HGE 294
+ +FQLY GV+DE L+H VT VGY G+
Sbjct: 235 SEKALEKALAQVGPISVCIDATQPSFQLYESGVYDEPDCSSTALDHCVTAVGYDSTADGD 294
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KY++VKNSWGT+WG+ GYI M+R+ CGI A+YP+
Sbjct: 295 KYYIVKNSWGTTWGQEGYIWMSRDKQKQ----CGIATNATYPL 333
>gi|403302736|ref|XP_003942009.1| PREDICTED: cathepsin K isoform 2 [Saimiri boliviensis boliviensis]
Length = 383
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 139/342 (40%), Positives = 190/342 (55%), Gaps = 51/342 (14%)
Query: 33 FLLW---VLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSS 88
F++W VL +P +++ YP++ ++ +E W K + ++Y S+ DE RR I+
Sbjct: 53 FVMWGLKVLLLPMVSFAL-YPEEI----LDTHWELWKKTHRKQYTSKVDEISRRL-IWEK 106
Query: 89 NVQYIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG--- 141
N++YI N + +F+L N D+++EE + G P + R Y+
Sbjct: 107 NLKYISIHNLEASLGVHTFELAMNHLGDMTSEEVVQKMTGLKVPTSFSRSNDTLYIPDWE 166
Query: 142 --LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDV 199
P SVD+RK+G VTPVK+QGQCGSCWAFS+V A+EG K KTGKL++LS Q LVDC
Sbjct: 167 GRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV- 225
Query: 200 NSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP- 258
SEN GC GGYM AF+++ K G+ +ED YPY G+ + C + T A GY IP
Sbjct: 226 -SENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPE 283
Query: 259 ----------ARY------------AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGE 294
AR +FQ YS GV +DE C LNH V VGYG G
Sbjct: 284 GNEKALKRAVARVGPISVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGN 343
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
K+W++KNSWG +WG GYI MARN ++ CGI AS+P
Sbjct: 344 KHWIIKNSWGENWGNKGYILMARNKNNA----CGIANLASFP 381
>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
Length = 342
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 132/338 (39%), Positives = 185/338 (54%), Gaps = 38/338 (11%)
Query: 31 SLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNV 90
S+ + W++ + G S Q + +++ ++ W K Y ++Y ++E R I+ N+
Sbjct: 9 SIIMKWLVLVLLGC-SSAMAQLHKDPTLDRHWDLWKKTYGKQYKEKNEEGVRRLIWEKNL 67
Query: 91 QYIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLP 143
+++ N ++ S+ L N D+++EE + P R + S LP
Sbjct: 68 KFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVTALMSSLRVPSQWQRNVTYKSNPNQKLP 127
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS-E 202
SVDWR +G VT VK QG CGSCWAFSAV A+E KLKTGKLVSLS Q LVDC V
Sbjct: 128 DSVDWRDKGCVTDVKYQGSCGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSVGKYS 187
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---- 258
N+GCNGG+M +AF++I G+ +E YPY+ + +CQ D +K+ A T + Y +P
Sbjct: 188 NRGCNGGFMTEAFQYIIDNNGIESEASYPYKAMDGKCQYD-SKYRAATCSRYTELPEDSE 246
Query: 259 -------------------ARYAFQLYSHGV-FDEYCGHQLNHGVTVVGYGEDHGEKYWL 298
+ +F LY GV +D C +NHGV VVGYG +G+ YWL
Sbjct: 247 DALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDPACTLHVNHGVLVVGYGNLNGKDYWL 306
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
VKNSWG +G+ GYIRMARNS + CGI ASYP
Sbjct: 307 VKNSWGLHFGDQGYIRMARNSGNH----CGIASYASYP 340
>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
gi|1582620|prf||2119193A cathepsin L-related Cys protease
Length = 324
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 127/308 (41%), Positives = 176/308 (57%), Gaps = 38/308 (12%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNKFADLSNEEF 118
E + ++ R+Y +E + R ++ N+QYI+ N S +++ L N+F+DL+N+EF
Sbjct: 21 EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLAINQFSDLTNDEF 80
Query: 119 ISTYLGYN---KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
S GY +P + S VDWR +G VT VKDQGQCGSCWAFSA ++
Sbjct: 81 NSMMKGYKTSLRPKPVAVFTSTDAAPETTEVDWRTKGCVTHVKDQGQCGSCWAFSATGSL 140
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNS-ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
EG + LK G+LVSL+EQ+LVDC NQGCNGG++ +AF++I GG+ TE YPY
Sbjct: 141 EGQHFLKYGELVSLAEQQLVDCAGGIYYNQGCNGGWVNQAFKYIKANGGIDTESSYPYEA 200
Query: 235 KNDRCQTDKTKHHAVTITGYEAI-----------------------PARYAFQLYSHGVF 271
+++ C+ + + A T +G+ +I A +FQ YS GV+
Sbjct: 201 RDNTCRFN-SNSVAATCSGFVSIAQGSESPEVRRTTNTGPISVAIDAAHRSFQSYSSGVY 259
Query: 272 DE--YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
E QL+H V VGYG + G+ +WLVKNSWGTSWG AGYI MARN ++ CGI
Sbjct: 260 YEPSCSSSQLDHAVLAVGYGSEGGQDFWLVKNSWGTSWGSAGYINMARNRNNN----CGI 315
Query: 330 LMQASYPV 337
ASYP
Sbjct: 316 ATDASYPT 323
>gi|356552228|ref|XP_003544471.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 351
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 131/336 (38%), Positives = 191/336 (56%), Gaps = 45/336 (13%)
Query: 28 AVLSLFLLWVLGIPA--------GAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEW 79
A++ LF+++ + A ++ ++ D + M FE WL ++ + Y + E
Sbjct: 4 AIVLLFMVFAVSSALDMSIISHDNAHADRATRRTDDEVMS-MFEEWLVKHDKVYNALGEK 62
Query: 80 QRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYL---------GYNKPYN 130
++RF I+ +N+++ID NS N ++KL N FADL+N E+ + YL + P
Sbjct: 63 EKRFQIFKNNLRFIDERNSLNRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLDTP-- 120
Query: 131 EPRWPSVQYLG--LPASVDWRKEGAVTPVKDQG-QCGSCWAFSAVAAVEGINKLKTGKLV 187
PR V +G +P SVDWRKEGAVTPVK+QG C SCWAF+AV AVE + K+KTG L+
Sbjct: 121 -PRNHYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGDLI 179
Query: 188 SLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHH 247
SLSEQE+VDC S ++GC GG ++ + +I K G++ E DYPYRG +C ++K K+
Sbjct: 180 SLSEQEVVDC-TTSSSRGCGGGDIQHGYIYIRK-NGISLEKDYPYRGDEGKCDSNK-KNA 236
Query: 248 AVTITGYEAIPARY------------AFQLY------SHGVFDEYCGHQLNHGVTVVGYG 289
VTI G+ +P + A+ LY GVF CG +LNH + +VGYG
Sbjct: 237 IVTIDGHGWVPTQLEEALNRALFCYCAYFLYVDKFFLCQGVFKGKCGTELNHALLLVGYG 296
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
+ YW+ KNS+ WGE GYIR+ R + G
Sbjct: 297 TEKDGDYWIAKNSYSDKWGENGYIRIQRKLSTCKFG 332
>gi|313118772|gb|ADR32298.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 217 bits (552), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 108/218 (49%), Positives = 135/218 (61%), Gaps = 24/218 (11%)
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P SVDWR +G + VKDQG CGSCWAFSAVAA+E IN + TG L+SLSEQELVDCD S
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGDLISLSEQELVDCD-KSY 60
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-- 260
NQGC+GG M+ AFEF+ GG+ TE+DYPY+ +ND C + V I YE +P
Sbjct: 61 NQGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNE 120
Query: 261 --------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVK 300
FQ Y G+F CG ++HGV GYG ++G YW+V+
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVR 180
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
NSWG WGE GY+R+ RN SS+ G+CG+ + SYPVK
Sbjct: 181 NSWGAKWGEKGYLRVQRNIASSS-GLCGLATEPSYPVK 217
>gi|402856109|ref|XP_003892642.1| PREDICTED: cathepsin K [Papio anubis]
Length = 348
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 133/319 (41%), Positives = 177/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K + ++Y S+ DE RR I+ N++YI N + +++L
Sbjct: 36 YPEEILDTHWELWKKTHRKQYNSKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAM 94
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D++NEE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 95 NHLGDMTNEEVVQKMTGLKVPASHSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 154
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 155 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 212
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 213 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 271
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 272 LTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 331
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 332 NKNNA----CGIANLASFP 346
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 132/317 (41%), Positives = 172/317 (54%), Gaps = 43/317 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFAD 112
+DP + F W++ S+ Y S +E+ R+ ++ N Q I+ N N + L NKF D
Sbjct: 23 HDP--LTGVFAEWMRDNSKSY-SNEEFVFRWNVWRENQQLIEEHNRSNKTSFLAMNKFGD 79
Query: 113 LSNEEFISTYLGY--------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
L+N EF + G NK E P+ GL A DWR++GAVT VK+QGQCG
Sbjct: 80 LTNAEFNKLFKGLAFDYSFHANKAAAEKAVPAP---GLSADFDWRQKGAVTHVKNQGQCG 136
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCW+FS + EG N LKTG+L SLSEQ L+DC + N GCNGG M+ AFE+I G+
Sbjct: 137 SCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKGI 196
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYE---------------------AIPARY-A 262
TE YPY+ CQ + ++T Y AI A + +
Sbjct: 197 DTEASYPYQTAQYTCQYNPANSGG-SLTSYTDVSSGDENALLNAVATEPTSVAIDASHNS 255
Query: 263 FQLYSHGVFDEYC--GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
FQ YS GV+ E QL+HGV VG+G + G+ YWLVKNSWG WG AGYI+MARN
Sbjct: 256 FQFYSGGVYYESACSSTQLDHGVLAVGWGTEDGQDYWLVKNSWGADWGLAGYIKMARNRS 315
Query: 321 SSNIGICGILMQASYPV 337
++ CGI ASYP
Sbjct: 316 NN----CGIATSASYPT 328
>gi|296228726|ref|XP_002759933.1| PREDICTED: cathepsin S isoform 1 [Callithrix jacchus]
Length = 330
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 127/314 (40%), Positives = 178/314 (56%), Gaps = 37/314 (11%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNK 109
DP +++ + W K Y ++Y ++E R I+ N++++ N ++ S+ L N
Sbjct: 21 DP-TLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNH 79
Query: 110 FADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
D+++EE +S P R + S LP SVDWR++G VT VK QG CG+C
Sbjct: 80 LGDMTSEEVMSLMSSLRVPSQWQRNITYKSNPNQMLPDSVDWREKGCVTEVKYQGSCGAC 139
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSAV A+E KLKTGKLVSLS Q LVDC N+GCNGG+M +AF++I G+ +
Sbjct: 140 WAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKGIDS 199
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARYAF 263
E YPY+ + +CQ D +K+ A T + Y +P + +F
Sbjct: 200 EASYPYKAMDQKCQYD-SKYRAATCSKYTELPYGREDVLKEAVANKGPVCVGVDASHSSF 258
Query: 264 QLYSHGV-FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
LY GV +D C +NHGV V+GYG+ +GE+YWLVKNSWG+++GE GYIRMARN +
Sbjct: 259 FLYRSGVYYDPACTQNVNHGVLVIGYGDLNGEEYWLVKNSWGSNFGERGYIRMARNKGNH 318
Query: 323 NIGICGILMQASYP 336
CGI SYP
Sbjct: 319 ----CGIASYPSYP 328
>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
Length = 331
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 135/336 (40%), Positives = 186/336 (55%), Gaps = 40/336 (11%)
Query: 34 LLWVLGI-PAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
+ W++G+ P +++ K DP +++ + W K YS++Y E+E R I+ N+++
Sbjct: 1 MKWLVGLLPLCSYAVAQVHK-DP-TLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKF 58
Query: 93 IDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPAS 145
+ N ++ S+ L N D++ EE IS P R + S LP S
Sbjct: 59 VMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISLMGSLRVPSQWQRNVTYRSNSNQKLPDS 118
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS-ENQ 204
VDWR++G VT VK QG CG+CWAFSAV A+E KLKTGKLVSLS Q LVDC N+
Sbjct: 119 VDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNK 178
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------ 258
GCNGG+M AF++I G+ +E YPY+ N +C+ D +K A T + Y +P
Sbjct: 179 GCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYD-SKKRAATCSKYTELPFGSEDA 237
Query: 259 -----------------ARYAFQLYSHGVFDE-YCGHQLNHGVTVVGYGEDHGEKYWLVK 300
+ Y+F LY GV+ E C +NHGV VVGYG +G+ YWLVK
Sbjct: 238 LKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKDYWLVK 297
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
NSWG ++G+ GYIRMARNS + CGI SYP
Sbjct: 298 NSWGLNFGDQGYIRMARNSGNH----CGIASYPSYP 329
>gi|344275468|ref|XP_003409534.1| PREDICTED: cathepsin K-like [Loxodonta africana]
Length = 329
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 132/319 (41%), Positives = 180/319 (56%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ ++E W K Y ++Y S+ DE RR I+ N++YI N + +++L
Sbjct: 17 YPEEILDTQWELWKKTYGKQYNSKVDEISRRL-IWEKNLKYISIHNLEASLGAHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P ++ R Y+ P S+D+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTSEEVVQKMTGLKVPPSDSRNNDTLYIPDWEGRAPDSIDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQDESCMYNPT-GKAAKCRGYREIPVGNEKALKRAVARVGPVSVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>gi|340370276|ref|XP_003383672.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 121/308 (39%), Positives = 176/308 (57%), Gaps = 35/308 (11%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDY--INSQNLSFKLTDNKFADLSNEE 117
+ F++W +Y++ Y +++ R I+ SN ++++ NS F + N+FADL E
Sbjct: 22 QEFQDWKVKYNKVYETKETELERQIIWESNKKFVENHNANSDKFGFTVAMNEFADLDAGE 81
Query: 118 FISTYLGY-NKP--YNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
F + G +P YN + +P +VDW+++GAVTP+K+QGQCGSCW+FS+ +
Sbjct: 82 FGRIFNGLLPRPSSYNSTNIYKPSGVKVPDTVDWKEKGAVTPIKNQGQCGSCWSFSSTGS 141
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
+EG + + TG LVSLSEQ+L+DC N GCNGG M+ +F ++ + G TED+YPY
Sbjct: 142 LEGQHFINTGTLVSLSEQQLMDCSTKYGNHGCNGGLMDNSFRYLKSVAGDETEDNYPYTA 201
Query: 235 KNDRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQLYSHGVF 271
+N C+ D + VT Y IP + +FQLY+ GV+
Sbjct: 202 ENGVCRYD-SSLAVVTDKSYVDIPQGDEDSLKDAVANVGPISVAIDASHSSFQLYNSGVY 260
Query: 272 --DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
QL+HGV +GYG + G+ YWLVKNSWGTSWG GYI+M+RN ++ CGI
Sbjct: 261 YASTCSSTQLDHGVLAIGYGTEDGKDYWLVKNSWGTSWGMEGYIKMSRNRNNN----CGI 316
Query: 330 LMQASYPV 337
QASYP
Sbjct: 317 ATQASYPT 324
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 127/306 (41%), Positives = 172/306 (56%), Gaps = 33/306 (10%)
Query: 59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEF 118
+E + W +++ Y + E R+ I+ N + I N + F L N+F D++N EF
Sbjct: 24 DESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFLLKMNQFGDMTNSEF 83
Query: 119 ISTYLGY--NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
+ GY +K N + + P +VDWR EG VTPVKDQGQCGSCWAFS ++E
Sbjct: 84 -KAFNGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLE 142
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
G + KTGKLVSLSEQ LVDC N GCNGG M+ AF +I + G+ +E YPY ++
Sbjct: 143 GQHFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENKGIDSEASYPYTAED 202
Query: 237 DRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQLYSHGVFDE 273
+C K A T TG+ +P + +FQ YS GV++E
Sbjct: 203 GKC-VFKKPSVAATDTGFVDLPEGNENKLKEAVASVGPISVAIDASHESFQFYSSGVYNE 261
Query: 274 -YCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
C +L+HGV VVGYG + G+ YWLVKNSW TSWG+ GYI+M RN+ + CGI
Sbjct: 262 PSCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMRRNAKNQ----CGIAT 317
Query: 332 QASYPV 337
+ASYP+
Sbjct: 318 KASYPL 323
>gi|114559412|ref|XP_001171151.1| PREDICTED: cathepsin K isoform 4 [Pan troglodytes]
gi|410221358|gb|JAA07898.1| cathepsin K [Pan troglodytes]
gi|410248298|gb|JAA12116.1| cathepsin K [Pan troglodytes]
gi|410301088|gb|JAA29144.1| cathepsin K [Pan troglodytes]
gi|410351445|gb|JAA42326.1| cathepsin K [Pan troglodytes]
Length = 329
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 133/319 (41%), Positives = 177/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K + ++Y ++ DE RR I+ N++YI N + +++L
Sbjct: 17 YPEEILDTHWELWKKTHRKQYNNKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AFE++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFEYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV FDE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYSRGVYFDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>gi|348525618|ref|XP_003450319.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
Length = 330
Score = 216 bits (551), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 119/318 (37%), Positives = 175/318 (55%), Gaps = 37/318 (11%)
Query: 52 KYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
++D S++ ++E W + REY E R I+ N++ I+ N + SF++
Sbjct: 17 RFDESSLDAQWEEWKSTHRREYNGLGEEGIRRAIWEKNMRMIEAHNEEAALGIHSFEMGM 76
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLGLPA----SVDWRKEGAVTPVKDQGQC 163
N D+++EE + G P N+ R ++ +P+ SVD+RK+G VT VK+QG C
Sbjct: 77 NHLGDMTSEEVVEKMTGLQIPMNQERSFTLAMDDMPSKIPKSVDYRKKGMVTSVKNQGAC 136
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCWAFSA A+EG TGKLV LS Q LVDC N GCNGG+M +AF+++ G
Sbjct: 137 GSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQYVIDNHG 196
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA-----------------------R 260
+ ++ YPY G++++C+ + A + Y+ +P R
Sbjct: 197 IDSDASYPYTGRDEQCRYNPAT-RAANCSSYQFLPEGDENALKQALATIGPISVAIDARR 255
Query: 261 YAFQLYSHGVF-DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
F Y GV+ D C ++NHGV VGYG +G+ YWLVKNSWG+++G+ GYIRMARN+
Sbjct: 256 PRFSFYRSGVYNDPSCTQEVNHGVLAVGYGSLNGQDYWLVKNSWGSTFGDQGYIRMARNT 315
Query: 320 PSSNIGICGILMQASYPV 337
+ CGI + A YPV
Sbjct: 316 GNQ----CGIALYACYPV 329
>gi|74136185|ref|NP_001027984.1| cathepsin K precursor [Macaca mulatta]
gi|47117667|sp|P61276.1|CATK_MACFA RecName: Full=Cathepsin K; Flags: Precursor
gi|47117668|sp|P61277.1|CATK_MACMU RecName: Full=Cathepsin K; Flags: Precursor
gi|3236470|gb|AAC23694.1| cathepsin K [Macaca fascicularis]
gi|4927694|gb|AAD33249.1| cathepsin K [Macaca mulatta]
gi|355558400|gb|EHH15180.1| hypothetical protein EGK_01237 [Macaca mulatta]
gi|355763132|gb|EHH62118.1| hypothetical protein EGM_20317 [Macaca fascicularis]
gi|380809978|gb|AFE76864.1| cathepsin K preproprotein [Macaca mulatta]
gi|383416065|gb|AFH31246.1| cathepsin K preproprotein [Macaca mulatta]
gi|384945478|gb|AFI36344.1| cathepsin K preproprotein [Macaca mulatta]
Length = 329
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 133/319 (41%), Positives = 177/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K + ++Y S+ DE RR I+ N++YI N + +++L
Sbjct: 17 YPEEILDTHWELWKKTHRKQYNSKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D++NEE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTNEEVVQKMTGLKVPASHSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>gi|66475996|ref|XP_627814.1| cryptopain - cysteine proteinase secreted, possible transmembrane
domain near N-terminus [Cryptosporidium parvum Iowa II]
gi|32399065|emb|CAD98305.1| cryptopain precursor [Cryptosporidium parvum]
gi|46229218|gb|EAK90067.1| cryptopain - cysteine proteinase secreted, possible transmembrane
domain near N-terminus [Cryptosporidium parvum Iowa II]
gi|76160841|gb|ABA40395.1| cryptopain-1 [Cryptosporidium parvum]
Length = 401
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 131/374 (35%), Positives = 196/374 (52%), Gaps = 49/374 (13%)
Query: 11 TNLHLKIAIDMRMMLRNAVLSLFLLWVLGIP-------AGAWSEGYPQKY-DPQSMEER- 61
TN + ++ ++ ++++F++ V+ + + + P Y DP + E R
Sbjct: 25 TNQQREPNKKLKNIIIATLIAIFIVLVVTVSLYITNNTSDKIDDFVPGDYVDPATREYRK 84
Query: 62 -FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIS 120
FE + K+Y + Y S +E +RF IY N+ +I NSQ S+ L N+F DLS EEF++
Sbjct: 85 SFEEFKKKYHKVYSSMEEENQRFEIYKQNMNFIKTTNSQGFSYVLEMNEFGDLSKEEFMA 144
Query: 121 TYLGYNKPYNEPRW----------PSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
+ GY K + S + P S++W + G V P+++Q CGSCWAFS
Sbjct: 145 RFTGYIKDSKDDERVFKSSRVSASESEEEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFS 204
Query: 171 AVAAVEGINKLKTGK-LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
AVAA+EG +T + L SLSEQ+ VDC + N GC+GG M AF++ K + T DD
Sbjct: 205 AVAALEGATCAQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTMGLAFQYAIKNKYLCTNDD 264
Query: 230 YPYRGKNDRCQTDKTKHH-AVTITGYEAIPAR-----------------------YAFQL 265
YPY + C +++ + + Y+ + R FQ
Sbjct: 265 YPYFAEEKTCMDSFCENYIEIPVKAYKYVFPRNINALKTALAKYGPISVAIQADQTPFQF 324
Query: 266 YSHGVFDEYCGHQLNHGVTVVGY--GEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
Y GVFD CG ++NHGV +VGY ED ++YWLV+NSWG +WGE GYI++A +S
Sbjct: 325 YKSGVFDAPCGTKVNHGVVLVGYDMDEDTNKEYWLVRNSWGEAWGEKGYIKLALHSGKK- 383
Query: 324 IGICGILMQASYPV 337
G CGIL++ YPV
Sbjct: 384 -GTCGILVEPVYPV 396
>gi|198432215|ref|XP_002130162.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 331
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 129/316 (40%), Positives = 180/316 (56%), Gaps = 41/316 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
+ +E W Y + Y +E+E +R++ I+ N++Y+ N + ++K+ N+FADL
Sbjct: 20 FQNEWEEWKTLYGKVYRAEEELKRQY-IWLENLKYVTQHNLEADEGKHTYKVDTNQFADL 78
Query: 114 SNEEFISTYLG-YNKPYNEPRWPSVQYLGL------PASVDWRKEGAVTPVKDQGQCGSC 166
SN+E+ +P N+ + ++ ++ + P +VDWRKEG VTPVKDQ QCGSC
Sbjct: 79 SNDEWRELMTSQVTRPTNQMSFCNMTFMTVGDHVIAPKNVDWRKEGYVTPVKDQKQCGSC 138
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS ++EG + KTGKLVSLSEQ LVDC + N GC GG M+ FE+I GG+ T
Sbjct: 139 WAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSMKEGNHGCQGGLMDLGFEYIFDNGGIDT 198
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITG----------------------YEAIPARY-AF 263
E YPY KN+ K + T+TG AI A + +F
Sbjct: 199 ESSYPYMAKNEPQCMYKRSNSGATLTGCVDIKRGSESALMKAVADVGPISVAIDAGHKSF 258
Query: 264 QLYSHGVFDE-YCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
Q+Y GV+ E C +L+HGV VG+G D+GE +WLVKNSWG WG GYI M+RN +
Sbjct: 259 QMYKSGVYYEPSCSSVKLDHGVLAVGFGADNGEDFWLVKNSWGPIWGMEGYIMMSRNRDN 318
Query: 322 SNIGICGILMQASYPV 337
+ CGI QASYP+
Sbjct: 319 N----CGIATQASYPL 330
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 127/306 (41%), Positives = 173/306 (56%), Gaps = 33/306 (10%)
Query: 59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEF 118
+E + W +++ Y + E R+ I+ N + I N + F L N+F D++N EF
Sbjct: 24 DESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFILKMNQFGDMTNSEF 83
Query: 119 ISTYLGY--NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVE 176
+ GY +K N + + P +VDWR EG VTPVKDQGQCGSCWAFS ++E
Sbjct: 84 -KAFNGYLSHKHVNGSTFLTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLE 142
Query: 177 GINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN 236
G + KTGKLVSLSEQ LVDC N GC+GG M+ AF +I + G+ +E YPY ++
Sbjct: 143 GQHFKKTGKLVSLSEQNLVDCSTAYGNNGCDGGLMDNAFTYIKENKGIDSEASYPYTAED 202
Query: 237 DRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQLYSHGVFDE 273
+C K+ A T TG+ IP + +FQ YS GV++E
Sbjct: 203 GKCVFKKSS-VAATDTGFVDIPEGNENKLKEAVASVGPISVAIDASHESFQFYSSGVYNE 261
Query: 274 -YCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
C +L+HGV VVGYG + G+ YWLVKNSW TSWG+ GYI+M RN+ + CGI
Sbjct: 262 PSCSSTELDHGVLVVGYGTESGKDYWLVKNSWNTSWGDKGYIKMRRNAKNQ----CGIAT 317
Query: 332 QASYPV 337
+ASYP+
Sbjct: 318 KASYPL 323
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 126/303 (41%), Positives = 172/303 (56%), Gaps = 41/303 (13%)
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLG 124
+ + Y ++ E R I+ N + I+ N++ +S+K+ N F DL EF + G
Sbjct: 34 HGKTYKNQFEEMFRMKIFMDNKKKIEAHNAKYEQGEVSYKMMMNHFGDLMVHEFKALMNG 93
Query: 125 YNKPYNEPR-----WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGIN 179
+ + R +PS LP +VDWR++GAVTPVKDQGQCGSCW+FSA ++EG
Sbjct: 94 FKMSPDTKRNGELYFPSNS--NLPKTVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQV 151
Query: 180 KLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRC 239
LKTGKLVSLSEQ LVDC + N GC GG M++AF++++ G+ TE YPY + + C
Sbjct: 152 FLKTGKLVSLSEQNLVDCSTSYGNNGCEGGLMDQAFQYVSDNKGIDTEASYPYEARENTC 211
Query: 240 QTDKTKHHAVTITGYEAIPA-----------------------RYAFQLYSHGVFDE--Y 274
+ K K T G+ IPA +FQ YS GV++E
Sbjct: 212 RFKKNKVGG-TDKGHVDIPAGDEKALQNALATVGPISVAIDANHGSFQFYSKGVYNEPNC 270
Query: 275 CGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQAS 334
+ L+HGV VGYG ++G+ YWLVKNSWG SWGE GYI++ARN + CGI AS
Sbjct: 271 SSYDLDHGVLAVGYGTENGQDYWLVKNSWGPSWGENGYIKIARNHSNH----CGIASMAS 326
Query: 335 YPV 337
YP+
Sbjct: 327 YPL 329
>gi|344271892|ref|XP_003407771.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 334
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 138/343 (40%), Positives = 190/343 (55%), Gaps = 49/343 (14%)
Query: 31 SLFLLWV-LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGS-EDEWQRRFGIYSS 88
SLFL + LGI + A QK+D +S++E++ W Y + Y + E++W+R ++
Sbjct: 4 SLFLTALCLGIASAA------QKHD-ESLDEQWYQWKSLYKKPYAANEEDWRR--AVWEK 54
Query: 89 NVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGY-NKPYNEPRWPSVQYLG-L 142
N++ I+ N + F +T N F D++NEEF G+ N+ + + G +
Sbjct: 55 NMKMIERHNQEYSQGKHGFTMTMNAFGDMTNEEFRQVMNGFQNQKRIQGKLLYEPVFGHI 114
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P SVDW ++G VTPVKDQGQCGSCWAFSA A+EG KTGKLVSLSEQ LVDC
Sbjct: 115 PKSVDWTQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRREG 174
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-- 260
N+GCNGG M+ AF++I GG+ +E+ YPY + + K+ A TG+ IP +
Sbjct: 175 NEGCNGGLMDNAFQYIKDNGGLDSEESYPYTAMDKQDCRYNPKYSAANDTGFVDIPPQEK 234
Query: 261 --------------------YAFQLYSHGV-FDEYCGHQ-LNHGVTVVGYG----EDHGE 294
+FQ Y G+ +D C + LNHGV VVGYG +
Sbjct: 235 ALMKAVATVGPISVAVDAGHESFQFYKSGIYYDSNCSSKDLNHGVLVVGYGFEGIDSANN 294
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+YWLVKNSWGT WG GYI+MA++ + CGI ASYP
Sbjct: 295 RYWLVKNSWGTGWGTDGYIKMAKDRNNH----CGIATAASYPT 333
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 130/319 (40%), Positives = 173/319 (54%), Gaps = 46/319 (14%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSN 115
E + + ++ + Y + E + R I++ N I N + +SFKL NK+ADL +
Sbjct: 27 EEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 86
Query: 116 EEFISTYLGYN-----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
EF G+N + + + S ++ LP SVDWR +GAVT VKDQG CG
Sbjct: 87 HEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCG 146
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS+ A+EG + K+G LVSLSEQ LVDC N GCNGG M+ AF +I GG+
Sbjct: 147 SCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 206
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARY 261
TE YPY +D C +K A T G+ IP +
Sbjct: 207 DTEKSYPYEAIDDSCHFNKGTIGA-TDRGFTDIPQGDEKKMAEAVATVGPVAVAIDASHE 265
Query: 262 AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARN 318
+FQ YS GV++E C Q L+HGV VVG+G D GE YWLVKNSWGT+WG+ G+I+M RN
Sbjct: 266 SFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGEDYWLVKNSWGTTWGDKGFIKMLRN 325
Query: 319 SPSSNIGICGILMQASYPV 337
+ CGI +SYP+
Sbjct: 326 KENQ----CGIASASSYPL 340
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 216 bits (550), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 130/324 (40%), Positives = 184/324 (56%), Gaps = 44/324 (13%)
Query: 52 KYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
+ DP+ ++ ++ W ++++Y +E RR ++ N++ I+ N + S+KL
Sbjct: 1 RADPE-LDGHWQLWKSWHNKDYHEREESWRRV-VWEKNLKMIELHNLDHTLGKHSYKLGM 58
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N+F D++ EEF GY +E ++ Q+L P SVDWR++G VTPVKDQGQ
Sbjct: 59 NQFGDMTTEEFRQLMNGYAHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQ 118
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS A+EG + KTGKLVSLSEQ LVDC NQGCNGG M++AF+++ G
Sbjct: 119 CGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNG 178
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY--------------------- 261
G+ +E+ YPY K+D K +++A TG+ IP +
Sbjct: 179 GIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAG 238
Query: 262 --AFQLYSHGVFDE--YCGHQLNHGVTVVGY---GED-HGEKYWLVKNSWGTSWGEAGYI 313
+FQ Y G++ E L+HGV VVGY GED G+KYW+VKNSWG WG+ GYI
Sbjct: 239 HSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYI 298
Query: 314 RMARNSPSSNIGICGILMQASYPV 337
MA++ + CGI ASYP+
Sbjct: 299 YMAKDRKNH----CGIATAASYPL 318
>gi|403302730|ref|XP_003942006.1| PREDICTED: cathepsin S isoform 1 [Saimiri boliviensis boliviensis]
Length = 339
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 130/342 (38%), Positives = 186/342 (54%), Gaps = 48/342 (14%)
Query: 38 LGIPAGA------------WSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGI 85
+G PAG+ S Q + +++ + W K Y ++Y ++E R I
Sbjct: 1 MGAPAGSITMKQLVCVLFVCSSAVTQLHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLI 60
Query: 86 YSSNVQYIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQ 138
+ N++++ N ++ S+ L N D+++EE +S P R + S
Sbjct: 61 WEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVPNQWQRNITYKSNP 120
Query: 139 YLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
LP SVDWR++G VT VK QG CG+CWAFSAV A+E KLKTGKLVSLS Q LVDC
Sbjct: 121 NQMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCS 180
Query: 199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
N+GCNGG+M +AF++I G+ +E YPY+ + +CQ D +K+ A T + Y +P
Sbjct: 181 EKYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKATDQKCQYD-SKYRAATCSKYTELP 239
Query: 259 -----------------------ARYAFQLYSHGV-FDEYCGHQLNHGVTVVGYGEDHGE 294
+ +F LY GV +D C ++NHGV V+GYG+ +G+
Sbjct: 240 YGREDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYGDLNGK 299
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
+YWLVKNSWG+++GE GYIRMARN + CGI SYP
Sbjct: 300 EYWLVKNSWGSNFGEQGYIRMARNKGNH----CGIASYPSYP 337
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 130/304 (42%), Positives = 172/304 (56%), Gaps = 39/304 (12%)
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNKFADLSNEEFISTYLG 124
+ +EY S+ E + R IY N + N S+++ NKF DL + EF S G
Sbjct: 38 HKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNG 97
Query: 125 Y-NKPYNEPRWPSV------QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEG 177
Y +K N R S + +P SVDWR++GA+TPVKDQGQCGSCWAFS+ A+EG
Sbjct: 98 YQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEG 157
Query: 178 INKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
KTGKL+SLSEQ L+DC N+GCNGG M++AF++I G+ TE+ YPY ++D
Sbjct: 158 QTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDD 217
Query: 238 RC--------------------QTDKTKHHAVTITGYE-AIPARY-AFQLYSHGVFDE-Y 274
C + DK K T+ AI A + +FQ YS GV+ E
Sbjct: 218 VCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPS 277
Query: 275 C-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
C L+HGV VVGYG D+G+ YWLVKNSW WG+ GYI++ARN + CG+ A
Sbjct: 278 CDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKIARNRKNH----CGVATAA 333
Query: 334 SYPV 337
SYP+
Sbjct: 334 SYPL 337
>gi|62510453|sp|Q8HY82.1|CATS_SAIBB RecName: Full=Cathepsin S; Flags: Precursor
gi|27497536|gb|AAO13008.1| cathepsin S preproprotein [Saimiri boliviensis]
Length = 330
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 126/322 (39%), Positives = 180/322 (55%), Gaps = 36/322 (11%)
Query: 46 SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL---- 101
S Q + +++ + W K Y ++Y ++E R I+ N++++ N ++
Sbjct: 12 SSAVTQLHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMH 71
Query: 102 SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPASVDWRKEGAVTPVK 158
S+ L N D+++EE +S P R + S LP SVDWR++G VT VK
Sbjct: 72 SYDLGMNHLGDMTSEEVMSLMSSLRVPNQWQRNITYKSNPNQMLPDSVDWREKGCVTEVK 131
Query: 159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
QG CG+CWAFSAV A+E KLKTGKLVSLS Q LVDC N+GCNGG+M +AF++I
Sbjct: 132 YQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYI 191
Query: 219 TKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-------------------- 258
G+ +E YPY+ + +CQ D +K+ A T + Y +P
Sbjct: 192 IDNKGIDSEASYPYKATDQKCQYD-SKYRAATCSKYTELPYGREDVLKEAVANKGPVCVG 250
Query: 259 ---ARYAFQLYSHGV-FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIR 314
+ +F LY GV +D C ++NHGV V+GYG+ +G++YWLVKNSWG+++GE GYIR
Sbjct: 251 VDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYGDLNGKEYWLVKNSWGSNFGEQGYIR 310
Query: 315 MARNSPSSNIGICGILMQASYP 336
MARN + CGI SYP
Sbjct: 311 MARNKGNH----CGIASYPSYP 328
>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
Length = 331
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/323 (40%), Positives = 180/323 (55%), Gaps = 37/323 (11%)
Query: 46 SEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL---- 101
S Q + +++ ++ W K Y ++Y ++E R I+ N++ + N ++
Sbjct: 12 SSAMAQVHRDPTLDHHWDLWKKTYGKQYEEKNEEVARRLIWEKNLKTVMLHNLEHSMGMH 71
Query: 102 SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPASVDWRKEGAVTPVK 158
S++L N D+++EE IS+ P PR + S LP S+DWR++G VT VK
Sbjct: 72 SYELGMNHLGDMTSEEVISSMSSLRVPSQWPRNVTYKSSPNQKLPDSLDWREKGCVTEVK 131
Query: 159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD-VNSENQGCNGGYMEKAFEF 217
QG CGSCWAFSAV A+E KLKTGKLVSLS Q LVDC V N+GCNGG+M +AF++
Sbjct: 132 YQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCNGGFMTEAFQY 191
Query: 218 ITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------------------- 258
I G+ +E YPY+ + RCQ D K+ A T + Y +P
Sbjct: 192 IIDNNGIDSEASYPYKAMDGRCQYD-VKNRAATCSRYIELPFGSEEALKEAVANKGPVSV 250
Query: 259 ----ARYAFQLYSHGV-FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYI 313
+ +F LY GV +D C +NHGV VVGYG +G+ YWLVKNSWG ++G+ GYI
Sbjct: 251 GIDAKQTSFFLYKTGVYYDPSCTQNVNHGVLVVGYGSLNGKDYWLVKNSWGLNFGDQGYI 310
Query: 314 RMARNSPSSNIGICGILMQASYP 336
RMARNS + CGI SYP
Sbjct: 311 RMARNSGNH----CGIANFPSYP 329
>gi|414875906|tpg|DAA53037.1| TPA: hypothetical protein ZEAMMB73_586844 [Zea mays]
Length = 1039
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 106/197 (53%), Positives = 132/197 (67%), Gaps = 24/197 (12%)
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCWAFS +AAVEGIN++ TG L+SLSEQELVDCD S NQGCNGG M+ AFEFI GG
Sbjct: 713 GSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDT-SYNQGCNGGLMDYAFEFIINNGG 771
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------Y 261
+ TE DYPY+G + RC ++ VTI YE +PA
Sbjct: 772 IDTEKDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGT 831
Query: 262 AFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQLYS G+F CG L+HGVTVVGYG ++G+ YW++KNSWG+SWGE+GY+RM RN +
Sbjct: 832 TFQLYSSGIFTGSCGTALDHGVTVVGYGTENGKDYWIMKNSWGSSWGESGYVRMERNIKA 891
Query: 322 SNIGICGILMQASYPVK 338
S+ G CGI ++ SYP+K
Sbjct: 892 SS-GKCGIAVEPSYPLK 907
>gi|291398027|ref|XP_002715626.1| PREDICTED: cathepsin S [Oryctolagus cuniculus]
Length = 331
Score = 216 bits (549), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/338 (38%), Positives = 181/338 (53%), Gaps = 45/338 (13%)
Query: 33 FLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
+L+W L + S Q + +++ + W K Y ++Y ++E R I+ N+++
Sbjct: 3 WLVWALLVC----SSTVAQLHRDPTLDHHWHLWKKAYGKQYKEKNEEAARRLIWEKNLKF 58
Query: 93 IDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQY-----LGLP 143
+ N ++ S+ + N AD+++EE +S P+ PR +V Y LP
Sbjct: 59 VTLHNLEHSMGMHSYDVGMNHLADMTSEEVVSLMSSLRIPHQWPR--NVTYKLNPNQKLP 116
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS-E 202
SVDWR+ G VT VK QG CG+CWAFSAV A+E KLKTG LVSLS Q LVDC
Sbjct: 117 DSVDWRERGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTTKYG 176
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---- 258
N+GCNGG+M +AF++I G+ +E YPY+ + +C D +KH A T + Y +P
Sbjct: 177 NKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDQKCHYD-SKHRAATCSKYTELPFGSE 235
Query: 259 -------------------ARYAFQLYSHGVFDE-YCGHQLNHGVTVVGYGEDHGEKYWL 298
+ +F LY GV+ E C +NHGV VGYG G+ YWL
Sbjct: 236 EALKEAVANKGPVSVAIDASHSSFFLYRSGVYYEPSCTQNVNHGVLAVGYGNLKGKDYWL 295
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
VKNSWG +GE GYIRMARNS + CGI SYP
Sbjct: 296 VKNSWGIHFGEQGYIRMARNSKNH----CGIANYPSYP 329
>gi|395729888|ref|XP_002810309.2| PREDICTED: cathepsin K [Pongo abelii]
Length = 343
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 137/342 (40%), Positives = 190/342 (55%), Gaps = 51/342 (14%)
Query: 33 FLLW---VLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSS 88
F++W VL +P +++ YP+ + ++ +E W K + ++Y ++ DE RR I+
Sbjct: 13 FVMWGLKVLLLPVVSFAL-YPE----EILDTHWELWKKTHRKQYNNKVDEISRRL-IWEK 66
Query: 89 NVQYIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG--- 141
N++YI N + +++L N D+++EE + G P + R Y+
Sbjct: 67 NLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPDWE 126
Query: 142 --LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDV 199
P SVD+RK+G VTPVK+QGQCGSCWAFS+V A+EG K KTGKL++LS Q LVDC
Sbjct: 127 GRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV- 185
Query: 200 NSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP- 258
SEN GC GGYM AF+++ K G+ +ED YPY G+ + C + T A GY IP
Sbjct: 186 -SENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPE 243
Query: 259 ----------ARY------------AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGE 294
AR +FQ YS GV +DE C LNH V VGYG G
Sbjct: 244 GNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGN 303
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
K+W++KNSWG +WG GYI MARN ++ CGI AS+P
Sbjct: 304 KHWIIKNSWGENWGNKGYILMARNKNNA----CGIANLASFP 341
>gi|47523662|ref|NP_999467.1| cathepsin K precursor [Sus scrofa]
gi|15213940|sp|Q9GLE3.1|CATK_PIG RecName: Full=Cathepsin K; Flags: Precursor
gi|10048286|gb|AAG12340.1|AF292030_1 cathepsin K precursor [Sus scrofa]
Length = 330
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 180/319 (56%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ ++E W K Y ++Y S+ DE RR I+ N+++I N + +++L
Sbjct: 18 YPEEILDTQWELWKKTYRKQYNSKVDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAM 76
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P S+D+RK+G VTPVK+QGQ
Sbjct: 77 NHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQ 136
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 137 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 194
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 195 GIDSEDAYPYVGQDENCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 253
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G+K+W++KNSWG +WG GYI MAR
Sbjct: 254 LTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGKKHWIIKNSWGENWGNKGYILMAR 313
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 314 NKNNA----CGIANLASFP 328
>gi|149751227|ref|XP_001490649.1| PREDICTED: cathepsin K-like [Equus caballus]
Length = 329
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 179/319 (56%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ ++E W K Y ++Y S+ DE RR I+ N+++I N + +++L
Sbjct: 17 YPEEILDTQWELWKKTYRKQYNSKVDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P S+D+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTSEEVVQKMTGLKVPPSHTRSNDTLYIPDWEGRAPDSIDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQDESCMYNPTG-KAAKCRGYREIPQGNEKALKRAVARVGPVSVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYSRGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANMASFP 327
>gi|397492864|ref|XP_003817340.1| PREDICTED: cathepsin K [Pan paniscus]
Length = 343
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 137/342 (40%), Positives = 190/342 (55%), Gaps = 51/342 (14%)
Query: 33 FLLW---VLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSS 88
F++W VL +P +++ YP+ + ++ +E W K + ++Y ++ DE RR I+
Sbjct: 13 FVMWGLKVLLLPVVSFAL-YPE----EILDTHWELWKKTHRKQYNNKVDEISRRL-IWEK 66
Query: 89 NVQYIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG--- 141
N++YI N + +++L N D+++EE + G P + R Y+
Sbjct: 67 NLKYISIHNLEASLGVHTYELAMNHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPDWE 126
Query: 142 --LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDV 199
P SVD+RK+G VTPVK+QGQCGSCWAFS+V A+EG K KTGKL++LS Q LVDC
Sbjct: 127 GRAPDSVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV- 185
Query: 200 NSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP- 258
SEN GC GGYM AF+++ K G+ +ED YPY G+ + C + T A GY IP
Sbjct: 186 -SENDGCGGGYMTNAFQYVQKNRGIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPE 243
Query: 259 ----------ARY------------AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGE 294
AR +FQ YS GV +DE C LNH V VGYG G
Sbjct: 244 GNEKALKRAVARVGPVSVAIDASLTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGN 303
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
K+W++KNSWG +WG GYI MARN ++ CGI AS+P
Sbjct: 304 KHWIIKNSWGENWGNKGYILMARNKNNA----CGIANLASFP 341
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/319 (40%), Positives = 172/319 (53%), Gaps = 46/319 (14%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSN 115
E + + ++ + Y + E + R I++ N I N + +SFKL NK+ADL +
Sbjct: 27 EEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADLLH 86
Query: 116 EEFISTYLGYN-----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
EF G+N + + S ++ LP SVDWR +GAVT VKDQG CG
Sbjct: 87 HEFRQLMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCG 146
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS+ A+EG + K+G LVSLSEQ LVDC N GCNGG M+ AF +I GG+
Sbjct: 147 SCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 206
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARY 261
TE YPY +D C +K A T G+ IP +
Sbjct: 207 DTEKSYPYEAIDDSCHFNKGAIGA-TDRGFTDIPQGDEKKMAEAVATVGPVAVAIDASHE 265
Query: 262 AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARN 318
+FQ YS GV++E C Q L+HGV VVGYG D G+ YWLVKNSWGT+WG+ G+I+M RN
Sbjct: 266 SFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTDESGDDYWLVKNSWGTTWGDKGFIKMLRN 325
Query: 319 SPSSNIGICGILMQASYPV 337
+ CGI +SYP+
Sbjct: 326 KDNQ----CGIASASSYPL 340
>gi|356514419|ref|XP_003525903.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 343
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 132/360 (36%), Positives = 187/360 (51%), Gaps = 65/360 (18%)
Query: 5 LFIAIYTNLHLKIAIDMRMMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEER--- 61
+FI +T L + A+D+ ++ ++ + K +S EE
Sbjct: 11 IFILFFTVLAVSSALDLSII-------------------SYDRSHADKSGWRSDEEVMSI 51
Query: 62 FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIST 121
+E L ++ + Y + DE + RF I N+++++ N+ N ++K+ N+FAD S
Sbjct: 52 YEEXLAKHGKVYNAIDEMEERFQISKENLKFVEQHNAGNRTYKVGLNRFADRSRM----- 106
Query: 122 YLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKL 181
+P R+ L SVDWRKEGAV VK Q +C SC F+ +AAVEGINK+
Sbjct: 107 ---MTRP--SSRYAPRVSDNLSESVDWRKEGAVVRVKTQSECESCRTFTVIAAVEGINKI 161
Query: 182 KTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQT 241
TG L +LS DCD + N GC+GG + A EFI GG+ TE+DYP++G C
Sbjct: 162 VTGNLTALS-----DCD-RTVNAGCSGGLADYALEFIINNGGIDTEEDYPFQGAVGIC-- 213
Query: 242 DKTKHHAVTITGYEAIPA-----------------------RYAFQLYSHGVFDEYCGHQ 278
D+ K +AV GYE +PA FQLY G+F CG
Sbjct: 214 DQYKINAVD--GYERVPAYDELALKKAVANQPVSVAYIEAYGKEFQLYESGIFTGKCGTS 271
Query: 279 LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
++HGVT VGYG ++G YW+VKNSWG +WGEAGY+RM RN+ G CGI + YP+K
Sbjct: 272 IDHGVTAVGYGTENGIDYWIVKNSWGENWGEAGYVRMERNTAEDTAGKCGIAILTLYPIK 331
>gi|94421564|gb|ABF18889.1| cathepsin-L [Lygus lineolaris]
Length = 314
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/320 (40%), Positives = 175/320 (54%), Gaps = 42/320 (13%)
Query: 29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGS-EDEWQRRFGIYS 87
VL+ L +G+ P D Q+ +E++ +Y + Y S E+E RR +
Sbjct: 4 VLAFACLVAVGLA-------LPLSDDNQA---EWESYKAKYGKTYESNENEAARRTIYFM 53
Query: 88 SNVQYIDY---INSQNLSFKLTDNKFADLSNEEFISTYLGYNK--PYNEPRWPSVQYLGL 142
+ + +++ +S+KL N FAD+ N EF GY + P N + L
Sbjct: 54 AKEKVMEHNARFEQGLVSYKLGLNSFADMHNGEFRKMMNGYRRGTPRNSVVVHVESNITL 113
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
PASVDWR +GAVTP+K+QGQCGSCWAFS ++EG + LK GKLVSLSEQELVDC
Sbjct: 114 PASVDWRTKGAVTPIKNQGQCGSCWAFSTTGSLEGQHALKKGKLVSLSEQELVDCSAAEG 173
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI----- 257
N GC+GG M+ AF +I K G+ TE YPY G++ C K+ A T+TG+ +
Sbjct: 174 NDGCDGGLMDDAFTYIKKNNGIDTEQSYPYTGEDGTCSFKKSD-VAATVTGFVDVTSGSE 232
Query: 258 ------------------PARYAFQLYSHGVFD--EYCGHQLNHGVTVVGYGEDHGEKYW 297
+ + FQLY GV+D + +L+HGV VVGYG D G YW
Sbjct: 233 SGLQDASATIGPISVAIDASSWDFQLYESGVYDVSDCSTTELDHGVLVVGYGTDDGTAYW 292
Query: 298 LVKNSWGTSWGEAGYIRMAR 317
LVKNSWGT WG GYI+M+R
Sbjct: 293 LVKNSWGTDWGHHGYIQMSR 312
>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
Length = 344
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 116/261 (44%), Positives = 153/261 (58%), Gaps = 35/261 (13%)
Query: 53 YDPQSMEER---FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKL 105
Y +S EE + W+ + R Y + E +RRF ++ N++Y+D N+ SF+L
Sbjct: 34 YGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRL 93
Query: 106 TDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQ 160
N+FADL+N+E+ +TYLG R +YL LP SVDWR +GAV VKDQ
Sbjct: 94 GLNRFADLTNDEYRATYLGVRSRPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEVKDQ 153
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G CGSCWAFS +AAVEGIN++ TG ++SLSEQELVDCD S NQGCNGG M+ AFEFI
Sbjct: 154 GSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDT-SYNQGCNGGLMDYAFEFIIN 212
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------------------- 260
GG+ TE+DYPY+G + RC ++ VTI YE +PA
Sbjct: 213 NGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPISVAIEA 272
Query: 261 --YAFQLYSHGVFDEYCGHQL 279
AFQLY+ G+F CG+ +
Sbjct: 273 GGRAFQLYNSGIFTGTCGNSV 293
>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 443
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/322 (40%), Positives = 182/322 (56%), Gaps = 44/322 (13%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNK 109
DP+ ++ ++ W + ++Y +E RR ++ N++ I+ N + S+KL N+
Sbjct: 127 DPE-LDGHWQLWKSWHRKDYHEREEGWRRV-VWEKNLKMIEIHNLDHALGKHSYKLGMNQ 184
Query: 110 FADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCG 164
F D++ EEF GY +E ++ Q+L P SVDWR++G VTPVKDQGQCG
Sbjct: 185 FGDMTTEEFRQLMNGYVHKKSERKYRGSQFLEPNFLEAPRSVDWREKGYVTPVKDQGQCG 244
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS A+EG + KTGKLVSLSEQ LVDC NQGCNGG M++AF+++ GG+
Sbjct: 245 SCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGI 304
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY----------------------- 261
+E+ YPY K+D K +++A TG+ IP +
Sbjct: 305 DSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHS 364
Query: 262 AFQLYSHGVFDE--YCGHQLNHGVTVVGY---GED-HGEKYWLVKNSWGTSWGEAGYIRM 315
+FQ Y G++ E L+HGV VVGY GED G+KYW+VKNSWG WG+ GYI M
Sbjct: 365 SFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYM 424
Query: 316 ARNSPSSNIGICGILMQASYPV 337
A++ + CGI ASYP+
Sbjct: 425 AKDRKNH----CGIATAASYPL 442
>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 136/352 (38%), Positives = 194/352 (55%), Gaps = 55/352 (15%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRF 83
ML AVL++ L L P+ DPQ ++E ++ W ++++Y ++E RR
Sbjct: 1 MLPVAVLAVCLSAALSAPS----------LDPQ-LDEHWDLWKSWHTKKYHEKEEGWRRM 49
Query: 84 GIYSSNVQYIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKP----YNEPRWP 135
++ N++ I+ N ++ +++L N F D+++EEF GY + + +
Sbjct: 50 -VWEKNLKKIELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMYGYKRKSERKFKGSLFM 108
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
+L P SVDWR G VTPVKDQGQCGSCWAFS A+EG + KTGKLVSLSEQ LV
Sbjct: 109 EPNFLEAPRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLV 168
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR-CQTDKTKHHAVTITGY 254
DC N+GCNGG M++AF++I G+ +ED YPY G +D+ C D K+++ TG+
Sbjct: 169 DCSRPEGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYD-PKYNSANDTGF 227
Query: 255 EAIPA-----------------------RYAFQLYSHGVF--DEYCGHQLNHGVTVVGY- 288
IP+ +FQ Y G++ E +L+HGV VVGY
Sbjct: 228 IDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYG 287
Query: 289 --GED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
GED G+KYW+VKNSW WG+ GYI MA++ + CGI ASYP+
Sbjct: 288 FEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNH----CGIATAASYPL 335
>gi|23344734|gb|AAN28680.1| cathepsin L [Theromyzon tessulatum]
Length = 351
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 128/306 (41%), Positives = 173/306 (56%), Gaps = 39/306 (12%)
Query: 68 QYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFADLSNEEFISTYL 123
++++ Y +E R I+++N ++I N+ + SF + N+FAD++ EF
Sbjct: 47 EHNKVYVGIEEESLRKTIFATNYKFIKDHNALHATGEKSFTVGVNEFADMTVHEFAQMMN 106
Query: 124 GYNKPYNEPRWPSVQYLG------LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEG 177
G KP + R YL LP VDWR +G V+ VK+QG CGSCWAFS ++EG
Sbjct: 107 GL-KP-DSTRVSGSTYLSPNIDAPLPVEVDWRTKGLVSEVKNQGSCGSCWAFSTTGSLEG 164
Query: 178 INKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
+ KTG +V LSEQ LVDC + N GCNGG M AF++I G+ TE+ YPY G++
Sbjct: 165 QHMRKTGTMVDLSEQNLVDCSTSYGNDGCNGGLMTNAFKYIKDNKGIDTEEAYPYAGRDG 224
Query: 238 RCQTDKTKHHAVTITGYEAIPA-----------------------RYAFQLYSHGVFDE- 273
C+ K K A T+TG+ IPA +F LY GV+DE
Sbjct: 225 DCKFKKNKVGA-TVTGFVEIPAGNEKKLQEALATVGPVSVAIDANHQSFMLYKSGVYDEP 283
Query: 274 YC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS-PSSNIGICGILM 331
C QL+HGV VGYG HG+ Y++VKNSWGT+WGE GYIR + + P + GICGIL+
Sbjct: 284 ECDSAQLDHGVLAVGYGSIHGKDYYIVKNSWGTTWGEQGYIRFSTTAVPDAIGGICGILL 343
Query: 332 QASYPV 337
ASYPV
Sbjct: 344 DASYPV 349
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 127/302 (42%), Positives = 169/302 (55%), Gaps = 39/302 (12%)
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLG 124
+ + Y ++ E R ++ N + ID N++ S+K+ N DL EF + G
Sbjct: 20 HGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMNHLGDLMVHEFKALMNG 79
Query: 125 YNKPYNEPR-----WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGIN 179
+ K N R PS + LP SVDWR+ GAVTPVKDQG CGSCW+FSA ++EG
Sbjct: 80 FKKTPNAERNGKIYVPSNE--NLPKSVDWRQRGAVTPVKDQGHCGSCWSFSATGSLEGQL 137
Query: 180 KLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRC 239
LKTG+LVSLSEQ LVDC N GC GG M +AF+++ G+ TE YPY + + C
Sbjct: 138 FLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKGIDTEASYPYEARENNC 197
Query: 240 Q-------------------TDKTKHHAVTITGYEAI---PARYAFQLYSHGVFDE-YCG 276
+ ++K AV G ++ + +FQ YS GV+ E YC
Sbjct: 198 RFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVRIDASHESFQFYSEGVYKEQYCS 257
Query: 277 -HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASY 335
QL+HGV VGYG ++G+ YWLVKNSWG SWGE+GYI++ARN + CGI ASY
Sbjct: 258 PSQLDHGVLTVGYGTENGQDYWLVKNSWGPSWGESGYIKIARNHKNH----CGIASMASY 313
Query: 336 PV 337
PV
Sbjct: 314 PV 315
>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 119/290 (41%), Positives = 170/290 (58%), Gaps = 34/290 (11%)
Query: 78 EWQRRFGIYSSNVQYID-YINSQNLSFKLTDNKFADLSNEEFISTYLGY--NKPYNEPRW 134
E ++R I+ +N++YI+ + N+ N S+KL N+++DL+++EF++++ G +K + +
Sbjct: 78 ELEKRKRIFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGLKVSKQLSSSKM 137
Query: 135 PSVQYL-----GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSL 189
S +P + DWR++GAVT VKDQG CG CWAFS VAAVEG K+ TG+L+SL
Sbjct: 138 RSAAVPFNLNDDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGELISL 197
Query: 190 SEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAV 249
SEQ+LVDCD N GC+GG M+ AF++I + G+ +E DYPY+ + CQ +
Sbjct: 198 SEQQLVDCD--ERNSGCHGGNMDSAFKYIIQ-KGIVSEADYPYQEGSQTCQLNDQMKFEA 254
Query: 250 TITGYEAIPAR---------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGY 288
IT + +PA FQ Y V+ CG +NH VT VGY
Sbjct: 255 QITNFIDVPANDEQQLLQAVAQQPVSVGIEVGDEFQHYMGDVYSGTCGQSMNHAVTAVGY 314
Query: 289 G-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
G + G KYWL+KNSWG WGE GY+++ R S G CGI ASYP+
Sbjct: 315 GVSEDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPG-GQCGIAAHASYPI 363
>gi|395514298|ref|XP_003761356.1| PREDICTED: cathepsin L1-like [Sarcophilus harrisii]
Length = 365
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 140/375 (37%), Positives = 191/375 (50%), Gaps = 85/375 (22%)
Query: 29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
++SL L V IP K D ++++ ++ W Q+ R+YG ++W+R I+
Sbjct: 7 LVSLCLGLVAAIP----------KLD-RTLDAQWYQWKAQHRRDYGENEDWRR--AIWEK 53
Query: 89 NVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYN----------KPYNEP-- 132
N++ I+ N + SF++ NKF D++NEEF G++ + + EP
Sbjct: 54 NLRSIEMHNLEYSAGKHSFQMEMNKFGDMTNEEFRQVMNGFSTHRVQRRTKGRLFREPLL 113
Query: 133 -------RWPSVQYLG------------------LPASVDWRKEGAVTPVKDQGQCGSCW 167
W Y+ +P SVDWR +G VTPVK+QGQCGSCW
Sbjct: 114 VQIPKSVDWRDKGYVTPVKNQLVRRLFREPLLVQIPKSVDWRDKGYVTPVKNQGQCGSCW 173
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSA ++EG KTGKLVSLSEQ LVDC N GC GG M+ AFE++ + GG+ TE
Sbjct: 174 AFSATGSLEGQWFRKTGKLVSLSEQNLVDCSTAQGNSGCQGGLMDNAFEYVKENGGIDTE 233
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARY-----------------------AFQ 264
+ YPY +D CQ K ++ ITGY IP+R +FQ
Sbjct: 234 ESYPYIAADDTCQY-KPQYSGANITGYVDIPSRMEKALEKAVATVGPISVAIDAGHSSFQ 292
Query: 265 LYSHGVF--DEYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
Y GV+ E L+HGV VGYG + KYW+VKNSWG WG++GYI MAR+ +
Sbjct: 293 FYRSGVYYEPECSSEDLDHGVLAVGYGVQGKNGKYWIVKNSWGEEWGDSGYILMARDRNN 352
Query: 322 SNIGICGILMQASYP 336
CGI ASYP
Sbjct: 353 H----CGIATAASYP 363
>gi|417409774|gb|JAA51378.1| Putative cathepsin k, partial [Desmodus rotundus]
Length = 331
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 130/320 (40%), Positives = 182/320 (56%), Gaps = 43/320 (13%)
Query: 52 KYDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLT 106
+Y + ++ ++E W K Y ++Y S+ DE RR I+ N+++I N + +++L
Sbjct: 18 QYPEEILDTQWEQWKKTYRKQYNSKVDEISRRL-IWEKNLKHISIHNLEASLGVHTYELA 76
Query: 107 DNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQG 161
N D+++EE + G P + R +Y+ +P S+D+RK+G VTPVK+QG
Sbjct: 77 MNHLGDMTSEEVVQKMTGLKVPPSHSRSNDTRYVPDWEGKVPDSIDYRKKGYVTPVKNQG 136
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
QCGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF ++ K
Sbjct: 137 QCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFHYVQKN 194
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY--------- 261
G+ +ED YPY G+++ C + T A GY+ IP AR
Sbjct: 195 QGIDSEDAYPYVGQDESCMYNPTG-KAAKCRGYKEIPEGNEKALKRAVARVGPISVAIDA 253
Query: 262 ---AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMA 316
+FQ YS GV +D+ C LNH V VGYG +K+W++KNSWG SWG GYI MA
Sbjct: 254 SLTSFQFYSKGVYYDKNCNSDNLNHAVLAVGYGIQKRKKHWIIKNSWGESWGNKGYILMA 313
Query: 317 RNSPSSNIGICGILMQASYP 336
RN ++ CGI AS+P
Sbjct: 314 RNKNNA----CGIANLASFP 329
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 129/319 (40%), Positives = 173/319 (54%), Gaps = 46/319 (14%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSN 115
E + + ++ + Y + E + R I++ N I N + +SFKL NK+ADL +
Sbjct: 27 EEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 86
Query: 116 EEFISTYLGYN-----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
EF G+N + + + S ++ LP SVDWR +GAVT VKDQG CG
Sbjct: 87 HEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGHCG 146
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS+ A+EG + K+G LVSLSEQ LVDC N GCNGG M+ AF +I GG+
Sbjct: 147 SCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 206
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARY 261
TE YPY +D C +K A T G+ IP +
Sbjct: 207 DTEKSYPYEAIDDSCHFNKGTIGA-TDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHE 265
Query: 262 AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARN 318
+FQ YS GV++E C Q L+HGV VVG+G D G+ YWLVKNSWGT+WG+ G+I+M RN
Sbjct: 266 SFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFIKMLRN 325
Query: 319 SPSSNIGICGILMQASYPV 337
+ CGI +SYP+
Sbjct: 326 KENQ----CGIASASSYPL 340
>gi|61661067|gb|AAX51229.1| cathepsin S cysteine protease [Paralichthys olivaceus]
Length = 337
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 129/344 (37%), Positives = 182/344 (52%), Gaps = 46/344 (13%)
Query: 26 RNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGI 85
R +L+ LL L + A A + ++ +E W K + + Y +E E RR +
Sbjct: 7 RGLMLASLLLVSLCVEAAAMLD--------VRLDVHWELWKKSHGKTYPNEVEDVRRREL 58
Query: 86 YSSNVQYIDYIN---SQNL-SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG 141
+ N+ I N S L ++ L+ N DL+ EE + +Y P + R P+ ++G
Sbjct: 59 WERNLMLITKHNLEASMGLQTYDLSMNHMGDLTTEEIMQSYATLTPPADIQRAPA-PFVG 117
Query: 142 ----LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
+P SVDWR +G VT VK QG CGSCWAFSA A+EG TGKLV LS Q LVDC
Sbjct: 118 SGADVPVSVDWRLQGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDC 177
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
+ N+GCNGG+M++AF+++ G+ +E YPYRG+ +C + + + A + Y +
Sbjct: 178 SLKYGNKGCNGGFMDRAFQYVIDNKGIDSEASYPYRGQLQQCSYNPS-YRAANCSRYSFL 236
Query: 258 P-----------------------ARYAFQLYSHGVF-DEYCGHQLNHGVTVVGYGEDHG 293
P R F Y GV+ D C ++NHGV VGYG + G
Sbjct: 237 PEGDEGALKNALATIGPISVAIDATRPTFAFYRSGVYNDPTCTQRVNHGVLAVGYGTESG 296
Query: 294 EKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+ YWLVKNSWGTS+G+ GYIRM+RN CGI + SYP+
Sbjct: 297 QDYWLVKNSWGTSFGDKGYIRMSRNKNDQ----CGIALYCSYPI 336
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 130/314 (41%), Positives = 179/314 (57%), Gaps = 39/314 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADL 113
+ ++E + + + Y S+ E R+ I++ N I N++ +S+KL N+F DL
Sbjct: 3 LRTQWEAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDL 62
Query: 114 SNEEFISTYLGYN---KPYNEPRWP--SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
EF + GY+ K P +V LP +VDWRK+GAVTPVKDQGQCGSCWA
Sbjct: 63 LPHEFAKMFNGYHGERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWA 122
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSA ++EG + LK+GKLVSLSEQ L+DC + N+GC GG M+ AF++I G+ TE+
Sbjct: 123 FSATGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGIDTEE 182
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGY----------------------EAIPARY-AFQL 265
YPY + C+ K + T TG+ AI A + +FQL
Sbjct: 183 SYPYEAMDGDCRF-KKEDVGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDASHSSFQL 241
Query: 266 YSHGVFDE--YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
YS GV+DE +L+HGV VGYG +G+KYWLVKNSW +WG+ GYI M+R+ +
Sbjct: 242 YSEGVYDEPNCSSEELDHGVLAVGYGVKNGKKYWLVKNSWAETWGDNGYILMSRDKDNQ- 300
Query: 324 IGICGILMQASYPV 337
CGI ASYP+
Sbjct: 301 ---CGIASSASYPL 311
>gi|313118760|gb|ADR32292.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 136/218 (62%), Gaps = 24/218 (11%)
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P SVDWR +G + VKDQG CGSCWAFSAVAA+E IN + TG L+SLSEQELVDCD S
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSY 60
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA 262
N+GC+GG M+ AFEF+ GG+ +E+DYPY+ +ND C + V I YE +P
Sbjct: 61 NEGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNE 120
Query: 263 ----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVK 300
FQ Y G+F CG ++HGV GYG ++G YW+V+
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVR 180
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
NSWG +WGE GY+R+ RN SS+ G+CG+ + SYPVK
Sbjct: 181 NSWGANWGEKGYLRVQRNIASSS-GLCGLATEPSYPVK 217
>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 127/316 (40%), Positives = 175/316 (55%), Gaps = 42/316 (13%)
Query: 56 QSMEERFENWLKQYSREYGS-EDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNKF 110
Q+++ ++ W Q+ R Y + ED W+R + N++ I+ N + SF+L NKF
Sbjct: 23 QTLDSQWHQWKAQHRRTYAANEDGWRR--ATWEKNLKMIEMHNLEYSAGKHSFQLGMNKF 80
Query: 111 ADLSNEEFISTYLGYNKPYNEPRWPSVQY-----LGLPASVDWRKEGAVTPVKDQGQCGS 165
D++ EEF GYN ++ R Y LP SVDWR++G VTPVK+QGQCGS
Sbjct: 81 GDMTTEEFKQVMNGYNSNGSQKRTKGSLYREPLLAQLPKSVDWREKGYVTPVKNQGQCGS 140
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFSA ++EG KT KLVSLSEQ LVDC + N GC+GG M+ AFE++ GG+
Sbjct: 141 CWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCSTSEGNNGCSGGLMDNAFEYVKNNGGID 200
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA-----------------------RYA 262
TE YPY G+++ C+ + + +TG+ IP+ +
Sbjct: 201 TEQAYPYLGQDNECKY-RAECSGANVTGFVDIPSMNERALMKAVANVGPISVAIDAGNPS 259
Query: 263 FQLYSHGVFDE--YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
FQ Y GV+ E QL+HGV VVGYG ++YW+VKNSWG WG+ GY+ MA+
Sbjct: 260 FQFYESGVYYEPQCSSSQLDHGVLVVGYGSIGKDEYWIVKNSWGEEWGKKGYVLMAKFRN 319
Query: 321 SSNIGICGILMQASYP 336
+ CGI ASYP
Sbjct: 320 NH----CGIATAASYP 331
>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 214 bits (546), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 136/352 (38%), Positives = 194/352 (55%), Gaps = 55/352 (15%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRF 83
ML AVL++ L L P+ DPQ ++E ++ W ++++Y ++E RR
Sbjct: 1 MLPVAVLAVCLSAALSAPS----------LDPQ-LDEHWDLWKSWHTKKYHEKEEGWRRM 49
Query: 84 GIYSSNVQYIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKP----YNEPRWP 135
++ N++ I+ N ++ +++L N F D+++EEF GY + + +
Sbjct: 50 -VWEKNLKKIELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMNGYKRKSERKFKGSLFM 108
Query: 136 SVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELV 195
+L P SVDWR G VTPVKDQGQCGSCWAFS A+EG + KTGKLVSLSEQ LV
Sbjct: 109 EPNFLEAPRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLV 168
Query: 196 DCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDR-CQTDKTKHHAVTITGY 254
DC N+GCNGG M++AF++I G+ +ED YPY G +D+ C D K+++ TG+
Sbjct: 169 DCSRPEGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYD-PKYNSANDTGF 227
Query: 255 EAIPA-----------------------RYAFQLYSHGVF--DEYCGHQLNHGVTVVGY- 288
IP+ +FQ Y G++ E +L+HGV VVGY
Sbjct: 228 IDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYG 287
Query: 289 --GED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
GED G+KYW+VKNSW WG+ GYI MA++ + CGI ASYP+
Sbjct: 288 FEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNH----CGIATAASYPL 335
>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
AltName: Allergen=Car p 1; Flags: Precursor
gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
gi|387885|gb|AAA72774.1| papain [synthetic construct]
gi|225437|prf||1303270A papain
Length = 345
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 135/341 (39%), Positives = 188/341 (55%), Gaps = 41/341 (12%)
Query: 30 LSLFLLWVLGIPAGAWS-EGYPQKYDPQSME---ERFENWLKQYSREYGSEDEWQRRFGI 85
+++ L +G+ G +S GY Q D S E + FE+W+ ++++ Y + DE RF I
Sbjct: 13 VAICLFVYMGLSFGDFSIVGYSQN-DLTSTERLIQLFESWMLKHNKIYKNIDEKIYRFEI 71
Query: 86 YSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGY---NKPYNEPRWPSVQYLG- 141
+ N++YID N +N S+ L N FAD+SN+EF Y G N E + V G
Sbjct: 72 FKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKYTGSIAGNYTTTELSYEEVLNDGD 131
Query: 142 --LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDV 199
+P VDWR++GAVTPVK+QG CGSCWAFSAV +EGI K++TG L SEQEL+DCD
Sbjct: 132 VNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDR 191
Query: 200 NSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI-- 257
S GCNGGY A + + + G+ + YPY G C++ + +A G +
Sbjct: 192 RS--YGCNGGYPWSALQLVAQY-GIHYRNTYPYEGVQRYCRSREKGPYAAKTDGVRQVQP 248
Query: 258 --------------------PARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYW 297
A FQLY G+F CG++++H V VGYG + Y
Sbjct: 249 YNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGPN----YI 304
Query: 298 LVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
L+KNSWGT WGE GYIR+ R + +S G+CG+ + YPVK
Sbjct: 305 LIKNSWGTGWGENGYIRIKRGTGNS-YGVCGLYTSSFYPVK 344
>gi|403302734|ref|XP_003942008.1| PREDICTED: cathepsin K isoform 1 [Saimiri boliviensis boliviensis]
Length = 329
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 133/319 (41%), Positives = 177/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K + ++Y S+ DE RR I+ N++YI N + +F+L
Sbjct: 17 YPEEILDTHWELWKKTHRKQYTSKVDEISRRL-IWEKNLKYISIHNLEASLGVHTFELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTSEEVVQKMTGLKVPTSFSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPISVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
Length = 342
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 133/336 (39%), Positives = 186/336 (55%), Gaps = 41/336 (12%)
Query: 33 FLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
+L+W L + + A ++ + DP +++ ++ W K Y ++Y ++E R I+ N++
Sbjct: 14 WLVWALLLCSSAMAQVH---RDP-TLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKT 69
Query: 93 IDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPAS 145
+ N ++ S++L N D+++EE IS P PR + S LP S
Sbjct: 70 VTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKLPDS 129
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS-ENQ 204
+DWR++G VT VK QG CGSCWAFSAV A+E KLKTGKLVSLS Q LVDC N+
Sbjct: 130 MDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNK 189
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------ 258
GCNGG+M +AF++I G+ +E YPY+ + +CQ D K+ A T + Y +P
Sbjct: 190 GCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYD-VKNRAATCSRYIELPFGSEEA 248
Query: 259 -----------------ARYAFQLYSHGV-FDEYCGHQLNHGVTVVGYGEDHGEKYWLVK 300
+ +F LY GV +D C +NHGV VVGYG G+ YWLVK
Sbjct: 249 LKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVK 308
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
NSWG +G+ GYIRMARNS + CGI SYP
Sbjct: 309 NSWGLHFGDQGYIRMARNSGNH----CGIASYPSYP 340
>gi|50657029|emb|CAH04632.1| cathepsin L [Suberites domuncula]
Length = 324
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 128/308 (41%), Positives = 174/308 (56%), Gaps = 35/308 (11%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS--QNLSFKLTDNKFADLSNEE 117
E + W +++S+EY E E RR I+ SN ++ID NS + L N+F DLS E
Sbjct: 21 EEWVAWKQEHSKEYTEELEELRRHTIWQSNKKFIDSHNSVSDKFGYTLEMNEFGDLSGVE 80
Query: 118 FISTYLGY---NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
F Y GY + + + + Y+ ASVDWR++G V+ VK+QGQCGSCW+FSA +
Sbjct: 81 FKQIYNGYIMQERANDTKLFTASPYMEPAASVDWRQKGVVSEVKNQGQCGSCWSFSATGS 140
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
+EG + LK G+LVSLSEQ L+DC N GC GG M+ AF ++ GV TE YPY
Sbjct: 141 LEGQHALKMGRLVSLSEQNLMDCSSRFGNHGCKGGIMDDAFRYVISNHGVDTESSYPYTA 200
Query: 235 KNDRCQTDKTKHHAVTITGYE----------------------AIPARY-AFQLYSHGVF 271
K+ C+ ++ A T T Y AI A + +FQ Y +GV+
Sbjct: 201 KDGYCRFNQNNVGA-TETSYRDIARGSESSLTQASAQIGPISVAIDASHRSFQFYKNGVY 259
Query: 272 DE--YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
E +L+HGV VVGYG + G+ Y++VKNSWGT WG GYI M+RN ++ CGI
Sbjct: 260 YEPSCSSSRLDHGVLVVGYGTEGGQDYFIVKNSWGTRWGMDGYIMMSRNRRNN----CGI 315
Query: 330 LMQASYPV 337
QASYP+
Sbjct: 316 ASQASYPI 323
>gi|390457768|ref|XP_002742793.2| PREDICTED: cathepsin L2 [Callithrix jacchus]
Length = 588
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 135/343 (39%), Positives = 187/343 (54%), Gaps = 48/343 (13%)
Query: 30 LSLFLL-WVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
LSL L + LGI + A K+D Q+++ ++ W + R YG+ +E RR ++
Sbjct: 3 LSLVLAAFCLGIASAA------PKFD-QNLDTQWYQWKATHRRLYGTNEEGWRR-AVWEK 54
Query: 89 NVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGY--NKPYNEPRWPSVQYLGL 142
N++ I+ N + F + N F D++NEEF + + K N + L L
Sbjct: 55 NMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQVMVCFRNQKHKNRKVFRGPLLLNL 114
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P SVDWRK+G VTPVK+Q QCGSCWAFSA A+EG KTGKLVSLSEQ LVDC
Sbjct: 115 PKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHPQG 174
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA--- 259
NQGCNGG+M AF+++ + GG+ +E YPY K+ C+ K ++ TG+ IPA
Sbjct: 175 NQGCNGGFMNNAFQYVKENGGLDSEASYPYVAKDGSCKY-KPENSVANDTGFVVIPAHEK 233
Query: 260 -------------------RYAFQLYSHGV-FDEYCGHQ-LNHGVTVVGYG----EDHGE 294
+FQ Y G+ F++ C + L+HGV VVGYG +
Sbjct: 234 ELMKAVATVGPISVAVDASHSSFQFYKSGIYFEQDCSSKNLDHGVLVVGYGFEGTNSNNN 293
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
YWL+KNSWG WG GYI++A++ + CGI ASYP+
Sbjct: 294 NYWLIKNSWGPEWGSNGYIKIAKDRNNH----CGIATAASYPI 332
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 171/314 (54%), Gaps = 45/314 (14%)
Query: 64 NWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINS----QNLSFKLTDNKFADLSNEEFI 119
N+ ++ + Y E E + R IY N I N + ++++L NK+ D+ N EF
Sbjct: 30 NFKMEHKKCYKHEAEERLRMKIYMKNKLQIAQHNCDYELKKVTYRLKINKYGDMLNHEFK 89
Query: 120 STYLGYNKPYNEP----RWPSVQY------LGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
+ GYN+ N R P + LP VDWRK GAVT VKDQG CGSCWAF
Sbjct: 90 NMLNGYNRTINHTLRNERLPVGAAFIEPCNVELPKMVDWRKCGAVTEVKDQGHCGSCWAF 149
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
SA ++EG + +TG LVSLSEQ L+DC + N GCNGG M++AF +I G+ TE
Sbjct: 150 SATGSLEGQHFRRTGVLVSLSEQNLIDCSGSYGNNGCNGGLMDQAFSYIKDNKGLDTEKT 209
Query: 230 YPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQLY 266
YPY G++D+C+ DK A + G+ IP + +FQ Y
Sbjct: 210 YPYEGEDDKCRYDKRSSGASDV-GFVDIPVGDEQKLKAAVATVGPVSVAIDASHQSFQFY 268
Query: 267 SHGV-FDEYCGH-QLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
S G+ F+ C L+HGV VVGYG D G YW+VKNSWG SWGE GYI+MARN +
Sbjct: 269 SDGIYFEPECSSTNLDHGVLVVGYGTDEEGRDYWIVKNSWGESWGEKGYIKMARNIDNH- 327
Query: 324 IGICGILMQASYPV 337
CGI ASYP+
Sbjct: 328 ---CGIASSASYPI 338
>gi|313118766|gb|ADR32295.1| C14 cysteine protease [Solanum demissum]
gi|313118774|gb|ADR32299.1| C14 cysteine protease [Solanum verrucosum]
gi|313118776|gb|ADR32300.1| C14 cysteine protease [Solanum verrucosum]
gi|313118778|gb|ADR32301.1| C14 cysteine protease [Solanum verrucosum]
Length = 217
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 135/218 (61%), Gaps = 24/218 (11%)
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P SVDWR +G + VKDQG CGSCWAFSAVAA+E IN + TG L+SLSEQELVDCD S
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSY 60
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA 262
N+GC+GG M+ AFEF+ GG+ +E+DYPY+ +ND C + V I YE +P
Sbjct: 61 NEGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNE 120
Query: 263 ----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVK 300
FQ Y G+F CG ++HGV GYG ++G YW+V+
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVR 180
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
NSWG WGE GY+R+ RN SS+ G+CG+ + SYPVK
Sbjct: 181 NSWGAKWGEKGYLRVQRNIASSS-GLCGLATEPSYPVK 217
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 129/319 (40%), Positives = 172/319 (53%), Gaps = 46/319 (14%)
Query: 60 ERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSN 115
E + + ++ + Y + E + R I++ N I N + +SFKL NK+ADL +
Sbjct: 27 EEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 86
Query: 116 EEFISTYLGYN-----------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
EF G+N + + S ++ LP SVDWR +GAVT VKDQG CG
Sbjct: 87 HEFRQLMNGFNYTLHKQLRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQGHCG 146
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFS+ A+EG + K+G LVSLSEQ LVDC N GCNGG M+ AF +I GG+
Sbjct: 147 SCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGI 206
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARY 261
TE YPY +D C +K A T G+ IP +
Sbjct: 207 DTEKSYPYEAIDDSCHFNKGTIGA-TDRGFTDIPQGDEKKMAEAVATVGPVSVAIDASHE 265
Query: 262 AFQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARN 318
+FQ YS GV++E C Q L+HGV VVG+G D G+ YWLVKNSWGT+WG+ G+I+M RN
Sbjct: 266 SFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTDESGDDYWLVKNSWGTTWGDKGFIKMLRN 325
Query: 319 SPSSNIGICGILMQASYPV 337
+ CGI +SYP+
Sbjct: 326 KDNQ----CGIASASSYPL 340
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 128/316 (40%), Positives = 173/316 (54%), Gaps = 41/316 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
+E + W ++ + Y S++E R I+ N+ + N + + ++ L N+FADL
Sbjct: 24 FDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADL 83
Query: 114 SNEEFISTYLGY------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCW 167
NEEF++ G+ PS LP +VDWR +G VTPVKDQGQCGSCW
Sbjct: 84 KNEEFVAMMTGFRVNGTSKAAKGSTFLPSNNIGELPKTVDWRTKGYVTPVKDQGQCGSCW 143
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS ++EG + TGKLVSLSEQ LVDC N+GC+GG M++AF++I K GG+ TE
Sbjct: 144 AFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKEGNEGCDGGLMDQAFQYIIKAGGIDTE 203
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYE----------------------AIPARY-AFQ 264
+ YPY+ + C K A T+TGY AI A + +FQ
Sbjct: 204 ESYPYKAVDGECHFKKANIGA-TVTGYTDVTSDSETALQKAVAHIGPISVAIDASHMSFQ 262
Query: 265 LYSHGVFDE-YCGHQ-LNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
LY GV++E C L+HGV VGYG G YW+VKNSW +WG GY+ M+RN +
Sbjct: 263 LYKSGVYNEPDCSSTLLDHGVLAVGYGTTSDGTDYWIVKNSWAETWGMNGYLWMSRNKDN 322
Query: 322 SNIGICGILMQASYPV 337
CGI QASYP+
Sbjct: 323 Q----CGIATQASYPL 334
>gi|94480716|emb|CAI91577.1| cathepsin L [Aphrocallistes vastus]
Length = 329
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 134/339 (39%), Positives = 187/339 (55%), Gaps = 49/339 (14%)
Query: 32 LFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQ 91
+FLL LG+ AGA + + Q +E W +Y+R YG ++E +++ I+++N+
Sbjct: 4 VFLL--LGLFAGACVCLQCETEEVQDFA--WEGWKLKYNRSYGLDEELRKK--IWANNML 57
Query: 92 YIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWP----------SVQYLG 141
Y+ N++ S+KL N+FADL+N E+ YLGY+ NE R ++
Sbjct: 58 YVKEFNAEGHSYKLAANQFADLTNLEYRQIYLGYD---NEARLSRKREGKVFQRKMKDED 114
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
LP +VDWR +G VTPVK+QGQCGSCW+FSA ++EG +K+GKLVS SEQELVDC +
Sbjct: 115 LPTTVDWRSKGVVTPVKNQGQCGSCWSFSATGSLEGQYAIKSGKLVSFSEQELVDCSTSL 174
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRC----QTDKTKHHAVTITGYEAI 257
N GC GG M+ AF++ + E DY Y KN +C Q TK + T E
Sbjct: 175 GNHGCQGGLMDYAFKYW-ETNLAEKESDYTYTAKNGKCKYNAQLGVTKDSSFTDIPSENC 233
Query: 258 PA------------------RYAFQLYSHGVFDEY-CGH-QLNHGVTVVGYGEDHGEKYW 297
A +FQ+Y G++ + C +L+HGV VVGYG D+G YW
Sbjct: 234 DALKEAVANKGPIAVAMDASHTSFQMYHSGIYTPFLCSKTKLDHGVLVVGYGTDNGVDYW 293
Query: 298 LVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
L+KNSWG +WG GY ++ S CGI QASYP
Sbjct: 294 LIKNSWGMAWGMDGYFKIEMKSDK-----CGICTQASYP 327
>gi|301767944|ref|XP_002919404.1| PREDICTED: cathepsin K-like [Ailuropoda melanoleuca]
gi|281352889|gb|EFB28473.1| hypothetical protein PANDA_008011 [Ailuropoda melanoleuca]
Length = 330
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 179/319 (56%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ ++E W K Y ++Y S+ DE RR I+ N+++I N + +++L
Sbjct: 18 YPEEILDTQWELWKKTYGKQYNSKVDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAM 76
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P S+D+RK+G VTPVK+QGQ
Sbjct: 77 NHLGDMTSEEVVQKMTGLKVPPSHSRNNDTLYIPDWESRAPDSIDYRKKGYVTPVKNQGQ 136
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 137 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 194
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 195 GIDSEDAYPYVGQDESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPISVAIDAS 253
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 254 LTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 313
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 314 NKNNA----CGIANLASFP 328
>gi|351694420|gb|EHA97338.1| Cathepsin K [Heterocephalus glaber]
Length = 329
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 130/319 (40%), Positives = 178/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREY-GSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ ++E W K Y ++Y G DE RR I+ N++YI N + +++L+
Sbjct: 17 YPEEILDTQWELWKKTYQKQYNGKVDELSRRL-IWEKNLKYISIHNLEASLGVHTYELSM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D++NEE + G P Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTNEEVVQKMTGLKVPPAHSHSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ +
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQQNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY +P AR
Sbjct: 194 GIDSEDAYPYVGQDESCMYNPTG-KAAKCRGYREVPVGNEKALKRAVARVGPISVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C G LNH V VGYG G K+W++KNSWG +WG GY+ +AR
Sbjct: 253 LTSFQFYSKGVYYDESCDGDNLNHAVLAVGYGIQRGHKHWILKNSWGENWGNKGYVLLAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNT----CGIANLASFP 327
>gi|77404197|ref|NP_001029168.1| cathepsin K precursor [Canis lupus familiaris]
gi|122056102|sp|Q3ZKN1.1|CATK_CANFA RecName: Full=Cathepsin K; Flags: Precursor
gi|58047562|gb|AAW65150.1| cathepsin K [Canis lupus familiaris]
Length = 330
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 179/319 (56%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +++ W K Y ++Y S+ DE RR I+ N+++I N + +++L
Sbjct: 18 YPEEILDTQWDLWKKTYRKQYNSKVDELSRRL-IWEKNLKHISIHNLEASLGVHTYELAM 76
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 77 NHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWESRAPDSVDYRKKGYVTPVKNQGQ 136
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 137 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 194
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 195 GIDSEDAYPYVGQDESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPISVAIDAS 253
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 254 LTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 313
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 314 NKNNA----CGIANLASFP 328
>gi|2804264|dbj|BAA24443.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 132/312 (42%), Positives = 176/312 (56%), Gaps = 45/312 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI---DYINSQN-LSFKLTDNKFADL 113
++E++ ++ Q+S+ Y SE E + R I+ N + + SQ + FKL NK+AD+
Sbjct: 23 VQEQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHSKLFSQGFVKFKLGLNKYADM 82
Query: 114 SNEEFISTYLGYNKPYNE----------PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQC 163
+ EF+ST G+NK N R+ S + LP +VDWR +GAVT VKDQG C
Sbjct: 83 LHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTKVKDQGHC 142
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCW+FS ++EG + KTGKLVSLSEQ LVDC N GCNGG M+ AF +I GG
Sbjct: 143 GSCWSFSGSGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYIKDNGG 202
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE----------------------AIPARY 261
+ TE YPY ++++C KT++ T G+ AI A Y
Sbjct: 203 IDTEQSYPYLAEDEKCHY-KTQNSGATDKGFVDIEEGNEDDLKAAVATVGPISIAIDASY 261
Query: 262 -AFQLYSHGVFD--EYCGHQLNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
FQLYS GV+ E +L+HGV VVGYG D G+ YWLVKNSW S G GYI+MAR
Sbjct: 262 ETFQLYSDGVYSDPECISQELDHGVLVVGYGTSDDGQDYWLVKNSWRPSCGLNGYIKMAR 321
Query: 318 NSPSSNIGICGI 329
N + +CG+
Sbjct: 322 NQDN----MCGV 329
>gi|356582227|ref|NP_001239115.1| cathepsin L1 precursor [Canis lupus familiaris]
gi|62899810|sp|Q9GL24.1|CATL1_CANFA RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|10185020|emb|CAC08809.1| cathepsin L [Canis lupus familiaris]
Length = 333
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 136/346 (39%), Positives = 184/346 (53%), Gaps = 56/346 (16%)
Query: 31 SLFLLWV-LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
SLFL + LGI + A K+D QS+ ++ W + R YG +E RR ++ N
Sbjct: 4 SLFLTALCLGIASAA------PKFD-QSLNAQWYQWKATHRRLYGMNEEGWRR-AVWEKN 55
Query: 90 VQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYN-------KPYNEPRWPSVQ 138
++ I+ N + F + N F D++NEEF G+ K + EP + +
Sbjct: 56 MKMIELHNREYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKMFQEPLFAEI- 114
Query: 139 YLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCD 198
P SVDWR++G VTPVK+QGQCGSCWAFSA A+EG KTGKLVSLSEQ LVDC
Sbjct: 115 ----PKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170
Query: 199 VNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP 258
N+GCNGG M+ AF ++ GG+ +E+ YPY G++ K + A TG+ +P
Sbjct: 171 RAQGNEGCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLP 230
Query: 259 AR----------------------YAFQLYSHGV-FDEYCGHQ-LNHGVTVVGY---GED 291
R +FQ Y G+ FD C + L+HGV VVGY G D
Sbjct: 231 QREKALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTD 290
Query: 292 HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
K+W+VKNSWG WG GY++MA++ + CGI ASYP
Sbjct: 291 SNNKFWIVKNSWGPEWGWNGYVKMAKDQNNH----CGIATAASYPT 332
>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
Length = 350
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 128/321 (39%), Positives = 181/321 (56%), Gaps = 51/321 (15%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI---DYINSQNLS-FKLTDNKFADL 113
E+ ++++ + R YG +E QR+ ++ +N++ I ++++ Q S +++ N+FAD+
Sbjct: 39 FEKLWQDFKTVHERTYGETEESQRK-EVFRNNLKKIQAHNHLHEQGKSPYRMGINQFADM 97
Query: 114 SNEEFISTYLGY------------NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
EF S G+ + Y P P + +PA VDWRKEG VTPVK+QG
Sbjct: 98 EANEFASIMNGFRMNNRTEVRDHLHANYISPAIP----VSVPAEVDWRKEGYVTPVKNQG 153
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
QCGSCWAFS ++EG + KTGKLVSLSEQ LVDC + N+GCNGG ++ AF++I
Sbjct: 154 QCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSTSYGNEGCNGGIVDYAFQYIKDN 213
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------- 258
G TE YPY + C+ K+ T TGY +P
Sbjct: 214 DGDDTEACYPYEAVDGTCRF-KSVCVGATCTGYTDLPKGDEAKMKEAVALVGPVSVAIDA 272
Query: 259 ARYAFQLYSHGVF--DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMA 316
+ +FQ+Y G++ E QL+H V VVGYG + G+ YWLVKNSWGT+WG+ GYI+MA
Sbjct: 273 SHSSFQMYQSGIYVEQECSPKQLDHAVLVVGYGTEQGQDYWLVKNSWGTTWGDEGYIKMA 332
Query: 317 RNSPSSNIGICGILMQASYPV 337
RN + CGI QASYP+
Sbjct: 333 RNMDNQ----CGIASQASYPL 349
>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
Length = 371
Score = 214 bits (545), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 131/317 (41%), Positives = 176/317 (55%), Gaps = 46/317 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNKFADL 113
+ ++ +L++Y R Y S+ E +RR GI++ N I N +S+ + N F+D
Sbjct: 63 LNSMWQAFLEKYKRVYDSKLEEERRLGIFTENFIRISEHNLLFEKGEVSYSMGINAFSDK 122
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQYLGL----PASVDWRKEGAVTPVKDQGQCGSCWAF 169
+N E + G+ R S QY+ PA VDWR +GAVTPVK+QG CGSCWAF
Sbjct: 123 TNSE-LDVLRGFRHSSKASRSGS-QYIPFDAAPPAEVDWRTKGAVTPVKNQGDCGSCWAF 180
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
SA +EG + L TGKLVSLSEQ+LVDC +S N GC+GG M+ AFE++ + G+ TE
Sbjct: 181 SATGGIEGQHYLATGKLVSLSEQQLVDC--SSSNDGCDGGLMDLAFEYVKEHKGIDTEVH 238
Query: 230 YPYRGKND----RCQTDKTKHHAVTITGYEAIP-----------------------ARYA 262
YPY N +C D K+ AV +TGY IP +
Sbjct: 239 YPYVSGNTGYARQCSFDP-KYAAVNVTGYVDIPEGQELLLQQAVGFHGPISVGINAGLPS 297
Query: 263 FQLYSHGVF-DEYCG-HQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
F Y G++ D C H L+HGV VVGYG D+G YWL+KNSWG WGE GY+R+ RN
Sbjct: 298 FMAYESGIYSDHRCNPHDLDHGVLVVGYGVDNGVPYWLIKNSWGEDWGENGYVRILRNHN 357
Query: 321 SSNIGICGILMQASYPV 337
+ +CG+ ASYP+
Sbjct: 358 N----LCGVATMASYPL 370
>gi|344271925|ref|XP_003407787.1| PREDICTED: cathepsin L1-like [Loxodonta africana]
Length = 333
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 137/348 (39%), Positives = 186/348 (53%), Gaps = 58/348 (16%)
Query: 30 LSLFLLWV-LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYG-SEDEWQRRFGIYS 87
LSLFL + LGI + A K+D QS++ ++ W Y + Y +E++W+R ++
Sbjct: 3 LSLFLAALCLGIASAA------PKFD-QSLDAQWNQWRSTYKKVYAVNEEDWRR--AVWE 53
Query: 88 SNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYN-------KPYNEPRWPS 136
N++ I+ N + F + N F D +NEEF G+ K + EP +
Sbjct: 54 KNMKMIERHNQEYSQGKHGFTMAMNAFGDKTNEEFRQLMNGFQSQKHKKGKLFYEPVFGH 113
Query: 137 VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVD 196
+ P SVDW ++G VTPVKDQGQCGSCWAFSA A+EG KTGKLVSLSEQ LVD
Sbjct: 114 I-----PTSVDWTQKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVD 168
Query: 197 CDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEA 256
C N+GCNGG M+ AF+++ GG+ +E+ YPY + + K+ A TG+
Sbjct: 169 CSWREGNEGCNGGLMDNAFQYVKDNGGLDSEESYPYTATDTQDCRYNPKYSAANDTGFVD 228
Query: 257 IP----------------------ARYAFQLYSHGV-FDEYCGHQLNHGVTVVGYG---- 289
IP + +FQ YS G+ FD C +NHGV VGYG
Sbjct: 229 IPPQEKALMKAVATVGPISVAIDAGQVSFQFYSSGIYFDPACRLTVNHGVLAVGYGFEGT 288
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+ KYWLVKNSWG SWG GYI++A++ + CGI ASYP
Sbjct: 289 DPDKNKYWLVKNSWGKSWGADGYIKIAKDRNNH----CGIARAASYPT 332
>gi|313118764|gb|ADR32294.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 106/218 (48%), Positives = 135/218 (61%), Gaps = 24/218 (11%)
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P SVDWR +G + VKDQG CGSCWAFSAVAA+E IN + TG L+SLSEQELVDCD S
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSY 60
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA 262
NQGC+GG M+ AFEF+ GG+ +E+DYPY+ +N C + V I YE +P
Sbjct: 61 NQGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNGVCDQYRKNAKVVVIDSYEDVPVNNE 120
Query: 263 ----------------------FQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVK 300
FQ Y G+F CG ++HGV GYG ++G YW+V+
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGLDYWIVR 180
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
NSWG WGE GY+R+ RN SS+ G+CG+ ++ SYPVK
Sbjct: 181 NSWGADWGEKGYLRVQRNVASSS-GLCGLAIEPSYPVK 217
>gi|60654335|gb|AAX29858.1| cathepsin K [synthetic construct]
gi|60654337|gb|AAX29859.1| cathepsin K [synthetic construct]
Length = 330
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 177/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K + ++Y ++ DE RR I+ N++YI N + +++L
Sbjct: 17 YPEEILDTHWELWKKTHRKQYNNKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>gi|77735825|ref|NP_001029607.1| cathepsin K precursor [Bos taurus]
gi|59858469|gb|AAX09069.1| cathepsin K preproprotein [Bos taurus]
gi|83638771|gb|AAI09854.1| Cathepsin K [Bos taurus]
gi|296489554|tpg|DAA31667.1| TPA: cathepsin K [Bos taurus]
Length = 334
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 178/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ ++E W K Y ++Y S+ DE RR I+ N+++I N + +++L
Sbjct: 22 YPEEILDTQWELWKKTYRKQYNSKGDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAM 80
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 81 NHLGDMTSEEVVQKMTGLKVPASRSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 140
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 141 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 198
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 199 GIDSEDAYPYVGQDENCMYNPT-GKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDAS 257
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ Y GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 258 LTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 317
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 318 NKNNA----CGIANLASFP 332
>gi|2804266|dbj|BAA24444.1| cysteine proteinase [Sitophilus zeamais]
Length = 331
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 132/312 (42%), Positives = 176/312 (56%), Gaps = 45/312 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSN----VQYIDYINSQNLSFKLTDNKFADL 113
++E++ ++ Q+S+ Y SE E + R I+ N ++ + + FKL NK+AD+
Sbjct: 23 VQEQWSSFKMQHSKNYDSETEERFRMKIFMENDHKVAKHSKLFSQGFVKFKLGLNKYADM 82
Query: 114 SNEEFISTYLGYNKPYNE----------PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQC 163
+ EF+ST G+NK N R+ S + LP +VDWR +GAVT VKDQG C
Sbjct: 83 LHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTKVKDQGHC 142
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCW+FS ++EG + KTGKLVSLSEQ LVDC N GCNGG M+ AF +I GG
Sbjct: 143 GSCWSFSGSGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNTGCNGGLMDNAFRYIKDNGG 202
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYE----------------------AIPARY 261
+ TE YPY ++++C KT++ T G+ AI A Y
Sbjct: 203 IDTEQSYPYLAEDEKCHY-KTQNSGATDKGFVDIEEGNEDDLKAAVATVGPVSIAIDASY 261
Query: 262 -AFQLYSHGVF-DEYCGHQ-LNHGVTVVGYG-EDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
FQLYS GV+ D C Q L+HGV VVGYG D G+ YWLVKNSW S G GYI+MAR
Sbjct: 262 ETFQLYSDGVYSDPECSSQELDHGVLVVGYGTSDDGQDYWLVKNSWRPSCGLNGYIKMAR 321
Query: 318 NSPSSNIGICGI 329
N + +CG+
Sbjct: 322 NQDN----MCGV 329
>gi|49456399|emb|CAG46520.1| CTSK [Homo sapiens]
Length = 329
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 177/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K + ++Y ++ DE RR I+ N++YI N + +++L
Sbjct: 17 YPEEILDTHWELWKKTHRKQYNNKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
Length = 333
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 130/321 (40%), Positives = 175/321 (54%), Gaps = 50/321 (15%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFA 111
Q+++ +++ W + R YG +E RR ++ N++ I+ N + SF L N F
Sbjct: 23 QTLDAQWDQWKAAHGRLYGLNEEGWRR-AVWEKNLRMIELHNGEYSQGRHSFTLGMNHFG 81
Query: 112 DLSNEEFISTYLGYN-------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
D++NEEF G+ K Y EP L LP SVDWR++G VT VK+QGQCG
Sbjct: 82 DMTNEEFRQVMNGFQHQKHKTGKMYQEPL-----LLQLPKSVDWREKGYVTEVKNQGQCG 136
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFSA ++EG KTG LVSLSEQ LVDC NQGCNGG M+ AF+++ G+
Sbjct: 137 SCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDCSRPQGNQGCNGGLMDFAFQYVKDNKGL 196
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------YA 262
E YPY GK+ C+ K + A TG+ +P R +
Sbjct: 197 EAEKSYPYVGKDGECKY-KPELSAANDTGFVDVPQREKVVQKALATVGPLSVAIDAGLQS 255
Query: 263 FQLYSHGV-FDEYCGHQ-LNHGVTVVGYGEDHGE----KYWLVKNSWGTSWGEAGYIRMA 316
FQ Y G+ +D C + LNHGV +VGYG D E YWL+KNSWGT+WG GY+++A
Sbjct: 256 FQFYKEGIYYDPGCSSRDLNHGVLLVGYGTDASETGKGDYWLIKNSWGTTWGADGYVKIA 315
Query: 317 RNSPSSNIGICGILMQASYPV 337
RN + CG+ ASYP+
Sbjct: 316 RNRNNH----CGVATAASYPL 332
>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
Length = 340
Score = 214 bits (544), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 132/335 (39%), Positives = 181/335 (54%), Gaps = 38/335 (11%)
Query: 34 LLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI 93
+ W+L + G S Q + +++ ++ W K Y ++Y E+E R I+ N++Y+
Sbjct: 10 MKWLLLVLLGC-SSAMAQLHKDPTLDHHWDLWKKTYGKQYTEENEEVTRRFIWEKNLKYV 68
Query: 94 DYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPASV 146
N ++ S+ L N AD+++EE + P R + S LP S+
Sbjct: 69 MLHNLEHSMGMHSYDLGMNHLADMTSEEVMLLMSSLRVPSQWQRNVTFKSNPNQKLPDSM 128
Query: 147 DWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS-ENQG 205
DWR +G VT VK QG CGSCWAFSAV A+E KLKTGKLVSLS Q LVDC N+G
Sbjct: 129 DWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSVQNLVDCSTGKYSNKG 188
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------- 258
CNGG+M +AF++I G+ +E YPY+ + +CQ D K+ A T + Y +P
Sbjct: 189 CNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYD-VKNRAATCSKYVELPFGNEEAL 247
Query: 259 ----------------ARYAFQLYSHGV-FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKN 301
+ +F LY GV +D+ C +NHGV VGYG +G+ YWLVKN
Sbjct: 248 KEAVANKGPVSVAIDASHPSFFLYRSGVYYDKACTLNVNHGVLAVGYGNYNGKDYWLVKN 307
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
SWG +GE GYIRMARNS + CGI SYP
Sbjct: 308 SWGLHFGEQGYIRMARNSGNH----CGIASYPSYP 338
>gi|395535911|ref|XP_003769964.1| PREDICTED: cathepsin K [Sarcophilus harrisii]
Length = 332
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 136/338 (40%), Positives = 184/338 (54%), Gaps = 44/338 (13%)
Query: 35 LWVLGIPAGAWSEGYPQKYDPQSM-EERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQY 92
+W I + P+ M + +++ W + Y +EY S+ DE RR I+ N++Y
Sbjct: 1 MWEFSILLLLLPSVVSSAHHPEEMLDTQWKLWKQSYGKEYNSKVDEISRRL-IWEKNLKY 59
Query: 93 IDYIN---SQNL-SFKLTDNKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LP 143
I N S L +F+L N D+++EE + G P + + Y+ P
Sbjct: 60 ISTHNLEFSLGLHTFELAMNHLGDMTSEEVVQKMTGLKMPLSRSQNNDTLYIPDWEGRTP 119
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
SVD+RK+G VTPVK+QGQCGSCWAFS+V A+EG K KTGKL++LS Q LVDC S+N
Sbjct: 120 ESVDYRKKGYVTPVKNQGQCGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SKN 177
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----- 258
GC GGYM AF+++ + G+ +ED YPY G+++ C + T A GY IP
Sbjct: 178 DGCGGGYMTNAFQYVQENRGIDSEDAYPYIGQDESCMYNPTG-KAAKCRGYREIPEGSEK 236
Query: 259 ------ARY------------AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWL 298
AR +FQ YS GV +DE C G LNH V VGYG G K+W+
Sbjct: 237 ALKRAVARVGPVAVAIDASLSSFQFYSKGVYYDENCNGDNLNHAVLAVGYGIQRGTKHWI 296
Query: 299 VKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
+KNSWG WG GYI MARN ++ CGI AS+P
Sbjct: 297 IKNSWGEEWGNKGYILMARNKKNA----CGIANLASFP 330
>gi|301789679|ref|XP_002930256.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
gi|281343339|gb|EFB18923.1| hypothetical protein PANDA_020645 [Ailuropoda melanoleuca]
Length = 334
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 137/343 (39%), Positives = 187/343 (54%), Gaps = 49/343 (14%)
Query: 31 SLFL-LWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
SLFL + LGI + A K D QS+ E++ W + R YG +E RR ++ N
Sbjct: 4 SLFLSVLCLGIASAA------PKLD-QSLTEQWYQWKATHRRLYGMNEEGWRR-AVWEKN 55
Query: 90 VQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGY--NKPYNEPRWPSVQYLGLP 143
++ ID N + F + N F D++NEEF G+ KP + + +P
Sbjct: 56 MKMIDLHNREYSQGQHGFTMAMNAFGDMTNEEFRQVMNGFRNQKPRKGKVFQEPLFAEIP 115
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
SVDW +G VTPVK+QGQCGSCWAFSA A+EG KTGKLVSLSEQ LVDC + N
Sbjct: 116 KSVDWTLKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRSQGN 175
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKN-DRCQTDKTKHHAVTITGYEAIPAR-- 260
+GCNGG M+ AF+++ + GG+ +E+ YPY G + D C+ K + A TG+ IP R
Sbjct: 176 EGCNGGLMDNAFQYVKENGGLDSEESYPYLGTDTDSCKY-KPECSAANDTGFVDIPQREK 234
Query: 261 --------------------YAFQLYSHGV-FDEYCGHQ-LNHGVTVVGYG----EDHGE 294
+FQ Y G+ +D C + L+HGV VVGYG + +
Sbjct: 235 ALMKAVATVGPISVAIDAGHQSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTDSNNN 294
Query: 295 KYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
K+W+VKNSWG WG GY++MA++ + CGI ASYP
Sbjct: 295 KFWIVKNSWGPEWGTNGYVKMAKDQNNH----CGIATAASYPT 333
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 214 bits (544), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 125/310 (40%), Positives = 167/310 (53%), Gaps = 36/310 (11%)
Query: 59 EERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEF 118
+ ++ W + +EY +++E R I+ +N++ I N SFKL N D+++ E
Sbjct: 26 DPNWKAWKSFHGKEYPNKNEETMRNFIWQNNLKKIVTHNEGKHSFKLAMNHLGDMTSLEI 85
Query: 119 ISTYLGYNKPYNEPRWPSVQYLGLPA------SVDWRKEGAVTPVKDQGQCGSCWAFSAV 172
T LG + P PA S+DWR +G VTPVK+QGQCGSCWAFS
Sbjct: 86 SQTLLGLKLKKHAESQPKGATFLPPANVKVVDSIDWRSKGYVTPVKNQGQCGSCWAFSTT 145
Query: 173 AAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPY 232
A+EG + KTGKLVSLSEQ LVDC N GC GG M+ AF++I + GG+ TE YPY
Sbjct: 146 GALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCEGGLMDNAFQYIKENGGIDTEKSYPY 205
Query: 233 RGKNDRCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQLYSHG 269
K+ C +K+ A TG+ IP ++ F Y G
Sbjct: 206 LAKDGVCHYNKSAIGAKD-TGFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQG 264
Query: 270 VFD--EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGIC 327
V+D + +L+HGV VGYG D G+ YWLVKNSWG SWGE GYI++ARN C
Sbjct: 265 VYDDPDCSSTRLDHGVLAVGYGTDDGKDYWLVKNSWGPSWGEEGYIKIARNDHDK----C 320
Query: 328 GILMQASYPV 337
G+ +ASYP+
Sbjct: 321 GVASKASYPL 330
>gi|5381317|gb|AAD42940.1|AF091366_1 cryptopain precursor [Cryptosporidium parvum]
Length = 401
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 130/374 (34%), Positives = 195/374 (52%), Gaps = 49/374 (13%)
Query: 11 TNLHLKIAIDMRMMLRNAVLSLFLLWVLGIP-------AGAWSEGYPQKY-DPQSMEER- 61
TN + ++ ++ ++++F++ V+ + + + P Y DP + E R
Sbjct: 25 TNQQREPNKKLKNIIIATLIAIFIVLVVTVSLYITNNTSDKIDDFVPGDYVDPATREYRK 84
Query: 62 -FENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIS 120
FE + K+Y + Y S +E +RF IY N+ +I NSQ S+ L N+F DLS EEF++
Sbjct: 85 SFEEFKKKYHKVYSSMEEENQRFEIYKQNMNFIKTTNSQGFSYVLEMNEFGDLSKEEFMA 144
Query: 121 TYLGYNKPYNEPRW----------PSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
+ GY K + S + P S++W + G V P+++Q CGSCWAFS
Sbjct: 145 RFTGYIKDSKDDERVFKSSRVSASESEEEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFS 204
Query: 171 AVAAVEGINKLKTGK-LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
AVAA+EG +T + L SLSEQ+ VDC + N GC+GG M AF++ K + T DD
Sbjct: 205 AVAALEGATCAQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTMGLAFQYAIKNKYLCTNDD 264
Query: 230 YPYRGKNDRCQTDKTKHH-AVTITGYEAIPAR-----------------------YAFQL 265
YPY + C +++ + + Y+ + R FQ
Sbjct: 265 YPYFAEEKTCMDSFCENYIEIPVKAYKYVFPRNINALKTALAKYGPISVAIQADQTPFQF 324
Query: 266 YSHGVFDEYCGHQLNHGVTVVGY--GEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSN 323
Y GVFD CG ++NHGV +V Y ED ++YWLV+NSWG +WGE GYI++A +S
Sbjct: 325 YKSGVFDAPCGTKVNHGVVLVEYDMDEDTNKEYWLVRNSWGEAWGEKGYIKLALHSGKK- 383
Query: 324 IGICGILMQASYPV 337
G CGIL++ YPV
Sbjct: 384 -GTCGILVEPVYPV 396
>gi|4503151|ref|NP_000387.1| cathepsin K preproprotein [Homo sapiens]
gi|1168793|sp|P43235.1|CATK_HUMAN RecName: Full=Cathepsin K; AltName: Full=Cathepsin O; AltName:
Full=Cathepsin O2; AltName: Full=Cathepsin X; Flags:
Precursor
gi|562757|emb|CAA57649.1| Cathepsin O [Homo sapiens]
gi|606923|gb|AAA65233.1| cathepsin O [Homo sapiens]
gi|1195556|gb|AAB35521.1| cathepsin O2 [Homo sapiens]
gi|16359188|gb|AAH16058.1| Cathepsin K [Homo sapiens]
gi|49456311|emb|CAG46476.1| CTSK [Homo sapiens]
gi|60823594|gb|AAX36649.1| cathepsin K [synthetic construct]
gi|119573901|gb|EAW53516.1| cathepsin K (pycnodysostosis), isoform CRA_b [Homo sapiens]
gi|307685681|dbj|BAJ20771.1| cathepsin K [synthetic construct]
gi|312150424|gb|ADQ31724.1| cathepsin K [synthetic construct]
Length = 329
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 177/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K + ++Y ++ DE RR I+ N++YI N + +++L
Sbjct: 17 YPEEILDTHWELWKKTHRKQYNNKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
Length = 326
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 176/319 (55%), Gaps = 48/319 (15%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL-------SFKLTDNK 109
S++ + + +++++Y E R G++ ++ ++YI NL SF++ N+
Sbjct: 17 SLDREWGMFKVRHNKQYKDNQEEAYRKGVF---MKAVEYIQQHNLEADRGVHSFRVGINE 73
Query: 110 FADLSNEEFISTYLGYNKPYNEPRWPSVQYL------GLPASVDWRKEGAVTPVKDQGQC 163
+AD+ NEEF+ GY P+ P+ Y+ LPA+VDWR +G VT VK+QGQC
Sbjct: 74 YADMPNEEFVRVMNGYKMQEQRPKAPT--YMPPSNVGDLPATVDWRTKGYVTEVKNQGQC 131
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCWAFS+ ++EG K KL+SLSEQ LVDC N GC GG M++AF +I G
Sbjct: 132 GSCWAFSSTGSLEGQTFKKYNKLISLSEQNLVDCSTEQGNMGCGGGLMDQAFTYIKVNDG 191
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR----------------------- 260
+ TE YPY + +C+ +K A TGY I ++
Sbjct: 192 IDTETSYPYEAASGKCRFNKANVGA-NDTGYTDIKSKSESDLQSAVATVGPIAVAIDASH 250
Query: 261 YAFQLYSHGVFDE-YCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
+FQLY GV+ +C +L+HGV VGYG D G+ YWLVKNSWG +WG+ GYI M+RN
Sbjct: 251 MSFQLYKSGVYHYIFCSQTRLDHGVLAVGYGTDSGKDYWLVKNSWGATWGQQGYIMMSRN 310
Query: 319 SPSSNIGICGILMQASYPV 337
++ CGI QASYP
Sbjct: 311 RDNN----CGIATQASYPT 325
>gi|109940312|sp|Q5E968.2|CATK_BOVIN RecName: Full=Cathepsin K; Flags: Precursor
Length = 329
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 178/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ ++E W K Y ++Y S+ DE RR I+ N+++I N + +++L
Sbjct: 17 YPEEILDTQWELWKKTYRKQYNSKGDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTSEEVVQKMTGLKVPASRSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQDENCMYNPT-GKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ Y GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 130/304 (42%), Positives = 170/304 (55%), Gaps = 39/304 (12%)
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNKFADLSNEEFISTYLG 124
+ +EY S+ E + R IY N + N S+++ NKF DL + EF S G
Sbjct: 38 HKKEYPSQLEEKLRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNG 97
Query: 125 Y-NKPYNEPRWPSV------QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEG 177
Y +K N R S + +P SVDWR++GA+TPVKDQGQCGSCWAFS+ A+EG
Sbjct: 98 YQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEG 157
Query: 178 INKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
KTGKLVSLSEQ L+DC N+GCNGG M++AF++I G+ TE+ YPY ++
Sbjct: 158 QTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDG 217
Query: 238 RC--------------------QTDKTKHHAVTITGYE-AIPARY-AFQLYSHGVFDE-Y 274
C + DK K T+ AI A + +FQ YS G + E
Sbjct: 218 VCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGXYYEPS 277
Query: 275 C-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
C L+HGV VVGYG D+GE YWLVKNSW WG+ GYI++ARN + CG+ A
Sbjct: 278 CDSDDLDHGVLVVGYGSDNGEDYWLVKNSWSEHWGDEGYIKIARNRKNH----CGVATAA 333
Query: 334 SYPV 337
SYP+
Sbjct: 334 SYPL 337
>gi|440906717|gb|ELR56946.1| Cathepsin K [Bos grunniens mutus]
Length = 338
Score = 213 bits (543), Expect = 7e-53, Method: Compositional matrix adjust.
Identities = 130/319 (40%), Positives = 178/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ ++E W K Y ++Y S+ DE RR I+ N+++I N + +++L
Sbjct: 26 YPEEILDTQWELWKKTYRKQYNSKGDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAM 84
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P S+D+RK+G VTPVK+QGQ
Sbjct: 85 NHLGDMTSEEVVQKMTGLKVPASRSRSNDTLYIPDWEGRAPDSIDYRKKGYVTPVKNQGQ 144
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 145 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 202
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 203 GIDSEDAYPYVGQDENCMYNPT-GKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDAS 261
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ Y GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 262 LTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 321
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 322 NKNNA----CGIANLASFP 336
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 213 bits (543), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 130/304 (42%), Positives = 171/304 (56%), Gaps = 39/304 (12%)
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNKFADLSNEEFISTYLG 124
+ +EY S+ E + R IY N + N S+++ NKF DL + EF S G
Sbjct: 38 HKKEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNG 97
Query: 125 Y-NKPYNEPRWPSV------QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEG 177
Y +K N R S + +P SVDWR +GA+TPVKDQGQCGSCWAFS+ A+EG
Sbjct: 98 YQHKKQNSSRAESTFTFMEPANVEVPESVDWRVKGAITPVKDQGQCGSCWAFSSTGALEG 157
Query: 178 INKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
KTGKL+SLSEQ L+DC N+GCNGG M++AF++I G+ TE+ YPY +++
Sbjct: 158 QTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDN 217
Query: 238 RC--------------------QTDKTKHHAVTITGYE-AIPARY-AFQLYSHGVFDE-Y 274
C + DK K T+ AI A + +FQ YS GV+ E
Sbjct: 218 VCRYNPRNRGAIDRGFVHIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPS 277
Query: 275 C-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
C L+HGV VVGYG D+G+ YWLVKNSW WG+ GYI++ARN + CGI A
Sbjct: 278 CDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDEGYIKIARNRKNH----CGIATAA 333
Query: 334 SYPV 337
SYP+
Sbjct: 334 SYPL 337
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 213 bits (543), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 126/316 (39%), Positives = 175/316 (55%), Gaps = 43/316 (13%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADL 113
+E ++ W ++ + Y S++E R I+ N+ + N + + ++ L N+FADL
Sbjct: 24 FDEDWKEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIRHNLKYDLGHFTYDLGMNQFADL 83
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQYL------GLPASVDWRKEGAVTPVKDQGQCGSCW 167
N+EF++ G+ +L LP +VDWR +G VTPVKDQGQCGSCW
Sbjct: 84 QNKEFVAMMTGFRVNGTSKAAKGSTFLPPNNVGKLPKTVDWRTKGYVTPVKDQGQCGSCW 143
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFSA ++EG + KTGKLVSLSEQ LVDC + +N GCNGG M++AF++I GG+ TE
Sbjct: 144 AFSATGSLEGQHFKKTGKLVSLSEQNLVDC--SDKNYGCNGGLMDRAFQYIIDAGGIDTE 201
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAI-----------------------PARYAFQ 264
+ YPY + C KT + T+TGY + + ++FQ
Sbjct: 202 ESYPYIAMDGNCHF-KTANVGATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHFSFQ 260
Query: 265 LYSHGVFDEY-CGHQ-LNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
LY GV++E C L+HGV VGYG G YW+VKNSW +WG GYI M+RN +
Sbjct: 261 LYQSGVYNEPGCSSTLLDHGVLAVGYGTTIDGTDYWIVKNSWAETWGMNGYIWMSRNKDN 320
Query: 322 SNIGICGILMQASYPV 337
CGI QASYP+
Sbjct: 321 Q----CGIATQASYPL 332
>gi|426331364|ref|XP_004026652.1| PREDICTED: cathepsin K [Gorilla gorilla gorilla]
Length = 329
Score = 213 bits (543), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 177/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K + ++Y ++ DE RR I+ N++YI N + +++L
Sbjct: 17 YPEEILDTHWELWKKTHRKQYNNKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>gi|294885991|ref|XP_002771503.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875207|gb|EER03319.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 130/322 (40%), Positives = 178/322 (55%), Gaps = 41/322 (12%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
D ++ F + K++ + Y ++DE +R I+ N+ YI+ +N+QNLS+KL N++ DL
Sbjct: 19 DLETSSLAFIGFQKKHGKSYDNKDEEMKRAAIFHDNLNYIEEVNAQNLSYKLGVNEYTDL 78
Query: 114 SNEEFISTYL-------GYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
+ EEF + L G + P+ L P SVDWRK+G + PVKDQG CGSC
Sbjct: 79 TLEEFAALKLSSTDMSEGMGDGFVAGAGPTTTTL--PTSVDWRKKGVLNPVKDQGYCGSC 136
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSA+ A+E + TGKL+SLSEQ+LVDC N+GCNGG M+KAFE+I K GV
Sbjct: 137 WAFSAIGALEPRYAIATGKLLSLSEQQLVDCAGAYGNEGCNGGLMDKAFEYI-KATGVDK 195
Query: 227 EDDYPYRGKNDRCQ------TDKTKHHAVT------------ITGYEAIP---ARYA--- 262
E YPY G ++ CQ TD VT + G A P A YA
Sbjct: 196 ESTYPYVGSDETCQATVENKTDGLPVGEVTGNQMLHQTEKALMEGVAAAPVSIAMYANLQ 255
Query: 263 -FQLYSHGVF-DEYC---GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
FQ Y GV+ D C G ++HGV VGYG ++G+ Y++++NSWG SWG+ GY+ + R
Sbjct: 256 SFQHYKSGVYSDPNCNAKGGSIDHGVVAVGYGTENGQDYFIIRNSWGRSWGQDGYVYLKR 315
Query: 318 NSPSSNIGICGILMQASYPVKR 339
S G C I P +
Sbjct: 316 GVGS--FGQCNIYKYMCVPTLK 335
>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
Length = 331
Score = 213 bits (543), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 133/336 (39%), Positives = 185/336 (55%), Gaps = 41/336 (12%)
Query: 33 FLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
+L+W L + + A + + DP +++ ++ W K Y ++Y ++E R I+ N++
Sbjct: 3 WLVWALLLCSSAMAHVH---RDP-TLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKT 58
Query: 93 IDYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPAS 145
+ N ++ S++L N D+++EE IS P PR + S LP S
Sbjct: 59 VTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVPSQWPRNVTYKSDPNQKLPDS 118
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS-ENQ 204
+DWR++G VT VK QG CGSCWAFSAV A+E KLKTGKLVSLS Q LVDC N+
Sbjct: 119 MDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTAKYGNK 178
Query: 205 GCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------ 258
GCNGG+M +AF++I G+ +E YPY+ + +CQ D K+ A T + Y +P
Sbjct: 179 GCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQYD-VKNRAATCSRYIELPFGSEEA 237
Query: 259 -----------------ARYAFQLYSHGV-FDEYCGHQLNHGVTVVGYGEDHGEKYWLVK 300
+ +F LY GV +D C +NHGV VVGYG G+ YWLVK
Sbjct: 238 LKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLDGKDYWLVK 297
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
NSWG +G+ GYIRMARNS + CGI SYP
Sbjct: 298 NSWGLHFGDQGYIRMARNSGNH----CGIANYPSYP 329
>gi|426216528|ref|XP_004002514.1| PREDICTED: cathepsin K [Ovis aries]
Length = 330
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 178/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ ++E W K Y ++Y S+ DE RR I+ N+++I N + +++L
Sbjct: 18 YPEEILDTQWELWKKTYRKQYNSKVDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAM 76
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 77 NHLGDMTSEEVVQKMTGLKVPASRSRSNDTLYIPDWEGRTPDSVDYRKKGYVTPVKNQGQ 136
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 137 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 194
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 195 GIDSEDAYPYVGQDENCMYNPT-GKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDAS 253
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ Y GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 254 LTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 313
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 314 NKNNA----CGIANLASFP 328
>gi|227018328|gb|ACP18830.1| cysteine proteinase 1 [Chrysomela tremula]
Length = 323
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 124/318 (38%), Positives = 180/318 (56%), Gaps = 45/318 (14%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNK 109
+ S +E + ++ K + + Y S E + RF I+ ++ I N++ ++ L N+
Sbjct: 15 NAASDQELWADFKKAHGKTYKSLREEKLRFNIFQDTLREIAAHNAKYESGESTYYLAINQ 74
Query: 110 FADLSNEEFISTYLGYNKPYNEPRWPSVQYLGL--------PASVDWRKEGAVTPVKDQG 161
F+D+++EEF + + N PS++ + + P S+DWR EGAV P+++Q
Sbjct: 75 FSDITDEEFRAMLM-----KNVESRPSLEDMEIANLTVGAAPESIDWRTEGAVLPIRNQE 129
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
CGSCWAFSAVAAVEG +K+G LS Q+LVDC N GCNGG M AF++I K
Sbjct: 130 DCGSCWAFSAVAAVEGQAAIKSGSKTPLSVQQLVDCSTEGGNSGCNGGLMNGAFDYI-KA 188
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA------------------- 262
G+ ++ YPY G +D C+ DK+ V +TGY+ + + A
Sbjct: 189 NGLESDAKYPYTGTDDSCKADKSS-SLVKLTGYKKVASSEASLKEAVGTVGPISVAVYAD 247
Query: 263 -FQLYSHGVFDEY--CGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNS 319
++ Y G+F+ G L+HGVT VGYG D+G+KYW VKNSWG SWGE GYIRMAR++
Sbjct: 248 LWRSYGGGIFNNILCLGFGLDHGVTAVGYGTDNGKKYWPVKNSWGESWGEEGYIRMARDT 307
Query: 320 PSSNIGICGILMQASYPV 337
+ CGI QASYP+
Sbjct: 308 LHN----CGINQQASYPI 321
>gi|6435586|pdb|7PCK|A Chain A, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435587|pdb|7PCK|B Chain B, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435588|pdb|7PCK|C Chain C, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435589|pdb|7PCK|D Chain D, Crystal Structure Of Wild Type Human Procathepsin K
gi|6435592|pdb|1BY8|A Chain A, The Crystal Structure Of Human Procathepsin K
Length = 314
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 177/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K + ++Y ++ DE RR I+ N++YI N + +++L
Sbjct: 2 YPEEILDTHWELWKKTHRKQYNNKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAM 60
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 61 NHLGDMTSEEVVQKMTGLKVPLSHSRSNDTLYIPEWEGRAPDSVDYRKKGYVTPVKNQGQ 120
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 121 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 178
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 179 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 237
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 238 LTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 297
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 298 NKNNA----CGIANLASFP 312
>gi|355681653|gb|AER96814.1| cathepsin K [Mustela putorius furo]
Length = 329
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 130/319 (40%), Positives = 179/319 (56%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ ++E W K Y ++Y ++ DE RR I+ N+++I N + +++L
Sbjct: 18 YPEEILDTQWELWKKTYGKQYNNKVDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAM 76
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P S+D+RK+G VTPVK+QGQ
Sbjct: 77 NHLGDMTSEEVVQKMTGLKVPPSHSRSNDSLYIPDWESRAPDSIDYRKKGYVTPVKNQGQ 136
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 137 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 194
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 195 GIDSEDAYPYVGQDESCMYNPTG-KAAKCKGYREIPEGNEKALKRAVARVGPISVAIDAS 253
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 254 LTSFQFYSKGVYYDENCNSDNLNHAVLAVGYGVQKGNKHWIIKNSWGENWGNKGYILMAR 313
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 314 NKNNA----CGIANLASFP 328
>gi|357130490|ref|XP_003566881.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 350
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 121/305 (39%), Positives = 170/305 (55%), Gaps = 37/305 (12%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYIN-SQNLSFKLTDNKFADLSN 115
++ R E W+ ++ R Y +E RR ++ +N +Y+D +N + N ++ L N+F+DL++
Sbjct: 35 TVAARHEQWMAKFGRVYTDANEKARRQAVFGANARYVDAVNRAGNRTYTLGLNEFSDLTD 94
Query: 116 EEFISTYLGYNKPYNEPRW------PSVQYLG-LPASVDWRKEGAVTPVKDQGQCGSCWA 168
EF T+LGY + E P G +P S DWR +GAVT VK QG CG CWA
Sbjct: 95 NEFAKTHLGYREFRPETANISKGVDPGYGLAGNIPKSFDWRTKGAVTEVKSQGGCGCCWA 154
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
F+AVAA EG+ K+ G L+S+SEQ+++DC + N C GGYM A ++ GG+ TE+
Sbjct: 155 FAAVAATEGLVKIAKGTLISMSEQQVLDC--TTGNNTCKGGYMNDALSYVFASGGLQTEE 212
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIP------------ARYA-----------FQL 265
DY Y + C+ D T + A ++ E +P AR F+
Sbjct: 213 DYEYNAEKGACRRDVTPNPATSVGHAEYMPLDGNEFLLQKLVARQPVVVAVEAYGTDFKN 272
Query: 266 YSHGVF--DEYCGHQLNHGVTVVGYGEDHGEK--YWLVKNSWGTSWGEAGYIRMARNSPS 321
Y GVF CG L+H TVVGYG G K YWLVKN WGTSWGE+GY+R+AR S +
Sbjct: 273 YGGGVFTGSPSCGQNLDHFFTVVGYGFADGGKQMYWLVKNQWGTSWGESGYMRIARGSSA 332
Query: 322 SNIGI 326
N G+
Sbjct: 333 RNCGM 337
>gi|261289787|ref|XP_002611755.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
gi|229297127|gb|EEN67765.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
Length = 327
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 178/317 (56%), Gaps = 41/317 (12%)
Query: 57 SMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFAD 112
+++ ++E + + ++Y SE E R+ I+ N + + N + +F + NKF D
Sbjct: 15 AIDPQWEAFKLLHGKQY-SEYEDGARYAIFQENSRIVKQHNEEAAMGKHTFFMRMNKFGD 73
Query: 113 LSNEEFISTYLGYNKPYNEPR-------WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGS 165
++NEEF +G Y+ + S+ L + +VDWR++GAVT VK+Q QCGS
Sbjct: 74 MTNEEFQMLVIGSGLLYSNKTQQTEGGVFESLPGLKVNDTVDWRQKGAVTKVKNQEQCGS 133
Query: 166 CWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVT 225
CWAFS ++EG + LK+G LVSLSEQ LVDC N+GC GG M++AF++I GG+
Sbjct: 134 CWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGCQGGLMDQAFKYIKTNGGID 193
Query: 226 TEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI-----------------------PARYA 262
TE+ YPY+GKN+R K+ T++ Y I + +
Sbjct: 194 TEECYPYKGKNERKCEYKSSCSGATLSSYVDIKTGDEDALMQASATIGPISVGIDASHPS 253
Query: 263 FQLYSHGVFDEY--CGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSP 320
FQLY HGV+ E +L+HGV VVGYG D + YWLVKNSWG WG GYI+M+RN
Sbjct: 254 FQLYDHGVYHEKRCSSKKLDHGVLVVGYGTDGEKDYWLVKNSWGEEWGMEGYIKMSRNKD 313
Query: 321 SSNIGICGILMQASYPV 337
+ CGI QASYPV
Sbjct: 314 NQ----CGIATQASYPV 326
>gi|431896622|gb|ELK06034.1| Cathepsin K [Pteropus alecto]
Length = 330
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 130/319 (40%), Positives = 178/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K Y ++Y S+ DE RR I+ N+++I N + +++L
Sbjct: 18 YPEEILDTHWELWKKSYGKQYDSKVDETSRRL-IWEKNLKHISIHNLEAALGVHTYELAM 76
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 77 NHLGDMTSEEVVQKMTGLKVPPSRSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 136
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 137 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 194
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+++ C + T A GY+ IP AR
Sbjct: 195 GIDSEDAYPYVGQDESCMYNPTG-KAAKCRGYKEIPEGNEKALKRAVARVGPISVAIDAS 253
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ Y GV +DE C LNH V VGYG G K+W++KNSWG +WG GY+ MAR
Sbjct: 254 LTSFQFYRKGVYYDENCNSDNLNHAVLAVGYGIQKGRKHWIIKNSWGENWGNKGYVLMAR 313
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 314 NKNNA----CGIANLASFP 328
>gi|410978262|ref|XP_003995514.1| PREDICTED: cathepsin L1-like [Felis catus]
Length = 333
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 134/343 (39%), Positives = 191/343 (55%), Gaps = 50/343 (14%)
Query: 31 SLFLLWV-LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDE-WQRRFGIYSS 88
SLFL + LGI + A PQ QS++E + W + + YG ++E W+R ++
Sbjct: 4 SLFLAALCLGIASAA-----PQLN--QSLDELWSQWKATHGKLYGMDEEGWRRE--VWKK 54
Query: 89 NVQYIDYINSQNL----SFKLTDNKFADLSNEEF--ISTYLGYNKPYNEPRWPSVQYLGL 142
N++ I N ++ SF + N F D++NEEF + L K + + + +
Sbjct: 55 NMKMIRQHNWEHSQGKHSFTVAMNGFGDMTNEEFKQVMNGLQMQKHKKGKMFQAPLFAKI 114
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P+SVDWR++G VTPVKDQG CGSCWAFSA A+EG KTGKLVSLSEQ LVDC
Sbjct: 115 PSSVDWREKGYVTPVKDQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQAEG 174
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---- 258
N+GCNGG M AF+++ GG+ +E+ YPY +++ C+ K + A TG+ IP
Sbjct: 175 NEGCNGGLMNNAFQYVKDNGGLDSEESYPYHAQDESCKY-KPQDSAANDTGFFDIPQQEK 233
Query: 259 ------------------ARYAFQLYSHGV-FDEYCGHQ-LNHGVTVVGYGEDHGEK--- 295
+ + FQ Y G+ +D C + L+HGV V+GYG + G+
Sbjct: 234 ALMVAVATKGPISVGIDASHFTFQFYHEGIYYDPDCSSEDLDHGVLVIGYGTEIGQSINK 293
Query: 296 -YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
YW+VKNSWG +WG GYI+MA++ + CGI AS+PV
Sbjct: 294 TYWIVKNSWGANWGIDGYIKMAKDRKNH----CGIATMASFPV 332
>gi|403367386|gb|EJY83513.1| Cathepsin L [Oxytricha trifallax]
Length = 339
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 135/343 (39%), Positives = 195/343 (56%), Gaps = 37/343 (10%)
Query: 25 LRNAVLSLFLL-WVLGIPAGAWSEGYPQKYDPQSMEER-----FENWLKQYSREYGSEDE 78
+R++V++L ++ V I A SE P +ME F N+L +Y + YG+++E
Sbjct: 1 MRSSVITLAVVGTVAAIAVVALSE-MPSSTSLYTMEVTQENVDFANYLAKYGKSYGTKEE 59
Query: 79 WQRRFGIYSSNVQYIDYINSQNL-SFKLTDNKFADLSNEEFISTYLGYNK-PYNEPRWPS 136
+Q RF Y N+ I + NS N +F L NKFAD + E+ LGY + P ++
Sbjct: 60 FQFRFQQYQQNMALIAHHNSNNENTFTLASNKFADYTPAEY-KKLLGYKRMPKANAQYAE 118
Query: 137 VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVD 196
+P S+DWR +GAVTPVKDQGQCGSCWAFS ++EG + + TG L S SEQ+LVD
Sbjct: 119 FDLTAVPDSIDWRTKGAVTPVKDQGQCGSCWAFSTTGSLEGRDAIATGTLQSYSEQQLVD 178
Query: 197 CDVNSE-NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRC--QTDK--TKHHAVTI 251
CD +++ NQGCNGG M A ++ K + E DYPY+ + +C + DK +K+ T
Sbjct: 179 CDYSTDGNQGCNGGDMGLAMDYSAK-NPLELESDYPYKAIDGKCSYKADKGHSKNKGHTN 237
Query: 252 TGYEAIPARYA-----------------FQLYSHGVFD-EYCGHQLNHGVTVVGYGEDHG 293
++P A FQ Y+ G+ + + CG L+HGV VGYG ++
Sbjct: 238 VKQNSLPDLKAAIAQGPVSVAIEADTMVFQFYNGGILNSKSCGTNLDHGVLAVGYGSENN 297
Query: 294 EKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
+ Y++VKNSWG SWGE GY+R+A+ GICGI M+ +P
Sbjct: 298 KPYYIVKNSWGPSWGEQGYLRIAQ---VDGAGICGIQMEPVFP 337
>gi|157829826|pdb|1AEC|A Chain A, Crystal Structure Of Actinidin-E-64 Complex+
Length = 218
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 105/219 (47%), Positives = 132/219 (60%), Gaps = 24/219 (10%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
LP+ VDWR GAV +K QG+CG CWAFSA+A VEGINK+ TG L+SLSEQEL+DC
Sbjct: 1 LPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQ 60
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP--- 258
+GCNGGY+ F+FI GG+ TE++YPY ++ C D VTI YE +P
Sbjct: 61 NTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTIDTYENVPYNN 120
Query: 259 -------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
A AF+ YS G+F CG ++H VT+VGYG + G YW+V
Sbjct: 121 EWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGTEGGIDYWIV 180
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
KNSW T+WGE GY+R+ RN + G CGI SYPVK
Sbjct: 181 KNSWDTTWGEEGYMRILRNVGGA--GTCGIATMPSYPVK 217
>gi|330801846|ref|XP_003288934.1| hypothetical protein DICPUDRAFT_153222 [Dictyostelium purpureum]
gi|325081026|gb|EGC34558.1| hypothetical protein DICPUDRAFT_153222 [Dictyostelium purpureum]
Length = 334
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 123/338 (36%), Positives = 183/338 (54%), Gaps = 45/338 (13%)
Query: 29 VLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
VLSL L + I + + + P + F W+K + + Y S DE+ R++ +
Sbjct: 8 VLSLLFLSINIIAS-------SRVFTPNQYQSSFVQWMKSHGKAY-SHDEFARKYRTFQD 59
Query: 89 NVQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYN---KPYNEPRWPSVQYLGLPAS 145
N+ Y+ NS+N L N FAD++N E+ +T LG + +P+ PR + + LP S
Sbjct: 60 NMDYVHQWNSKNSETVLGLNNFADMNNVEYRNTLLGASIEVEPFRTPR--TFSRIQLPTS 117
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
VDWR++GAV +KDQG CGSC++FSA+ A E + G++++LSEQ ++DC + N+G
Sbjct: 118 VDWREKGAVHDIKDQGHCGSCYSFSAIGAAESAYYIANGEMLTLSEQNILDCSRSYGNEG 177
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK-------------------- 245
CNGGYM ++F+F+ GG +E YPY K+ C+ D K
Sbjct: 178 CNGGYMLESFQFLLDQGGAVSEASYPYEAKDASCRFDSVKTPIVATFNGTVEIRRGDEGD 237
Query: 246 -HHAVTITGYEAI---PARYAFQLYSHGVFDE-YC-GHQLNHGVTVVGYGEDH--GEKYW 297
A+ G A+ +FQLY GV+ E YC + L+H V VGY D G+ YW
Sbjct: 238 LQQAIATHGPVAVAIDAGHISFQLYKTGVYYEPYCSSYSLSHAVLAVGYDTDSVTGKDYW 297
Query: 298 LVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASY 335
+V NSWG WG++G+I+MARN + CGI +SY
Sbjct: 298 IVANSWGLKWGDSGFIKMARNRGNH----CGISTMSSY 331
>gi|403300975|ref|XP_003941187.1| PREDICTED: cathepsin L1-like isoform 1 [Saimiri boliviensis
boliviensis]
gi|403300977|ref|XP_003941188.1| PREDICTED: cathepsin L1-like isoform 2 [Saimiri boliviensis
boliviensis]
gi|403300979|ref|XP_003941189.1| PREDICTED: cathepsin L1-like isoform 3 [Saimiri boliviensis
boliviensis]
Length = 333
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 175/317 (55%), Gaps = 42/317 (13%)
Query: 56 QSMEERFENWLKQYSREYG-SEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKF 110
S+E ++ W ++R YG +E+EW+R ++ N++ I+ N + SF + N F
Sbjct: 23 HSLEAQWIKWKAMHNRLYGKNEEEWRR--AVWEKNMKTIELHNHEYNQGKHSFTMAMNTF 80
Query: 111 ADLSNEEFISTYLGYN--KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
D++NEEF G+ KP N + P SVDWR++G VTPVK+QGQCGSCWA
Sbjct: 81 GDMTNEEFRQVMNGFQNRKPRNGKVFQEPLLHEAPRSVDWREKGYVTPVKNQGQCGSCWA 140
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSA A+EG KTGKLVSLSEQ LVDC NQGCNGG M+ AF+++ + GG+ +E+
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNQGCNGGLMDYAFQYVQENGGLDSEE 200
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLY 266
YPY + C+ + K+ TG+ IP +FQ Y
Sbjct: 201 SYPYEATEESCKYN-PKYSVANDTGFVDIPKLEKALMKAVATVGPISVAIDAGHESFQFY 259
Query: 267 SHGV-FDEYCGHQ-LNHGVTVVGYGEDH----GEKYWLVKNSWGTSWGEAGYIRMARNSP 320
G+ F+ C + ++HGV VVGYG + KYWLVKNSWG WG GYI+MA++
Sbjct: 260 KEGIYFEPECSSEDMDHGVLVVGYGFERTGSDNSKYWLVKNSWGEEWGMDGYIKMAKDRK 319
Query: 321 SSNIGICGILMQASYPV 337
+ CGI ASYP
Sbjct: 320 NH----CGIASAASYPT 332
>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
Length = 331
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 130/335 (38%), Positives = 186/335 (55%), Gaps = 41/335 (12%)
Query: 34 LLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI 93
L+WVL + + A ++ + DP +++ ++ W K Y ++Y ++E R I+ N++ +
Sbjct: 4 LVWVLLLCSSAMAQLHR---DP-TLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKTV 59
Query: 94 DYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPASV 146
N ++ S+ L N D+++EE IS P PR + S LP S+
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVRVPSQWPRNVTYKSNPNQKLPDSM 119
Query: 147 DWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS-ENQG 205
DWR++G VT VK QG CGSCWAFSAV A+E K+KTG+LVSLS Q LVDC N+G
Sbjct: 120 DWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCSTEKYRNKG 179
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------- 258
CNGG+M +AF++I G+ +E YPY+ + +C+ D +K+ A T + Y +P
Sbjct: 180 CNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKCKYD-SKNRAATCSRYTELPFADEYAL 238
Query: 259 ----------------ARYAFQLYSHGV-FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKN 301
+F Y GV +D C +NHGV VVGYG +G+ YWLVKN
Sbjct: 239 KEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKDYWLVKN 298
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
SWG ++G+ GYIRMARNS + CGI SYP
Sbjct: 299 SWGLNFGDGGYIRMARNSENH----CGIANYPSYP 329
>gi|384941728|gb|AFI34469.1| cathepsin L2 preproprotein [Macaca mulatta]
Length = 334
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 135/349 (38%), Positives = 186/349 (53%), Gaps = 59/349 (16%)
Query: 30 LSLFLL-WVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
LSL L + LGI + K+D Q+++ ++ W + R YG+ +E RR ++
Sbjct: 3 LSLVLAAFCLGIASAV------PKFD-QNLDTKWYQWKATHRRLYGASEEGWRR-AVWEK 54
Query: 89 NVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTY-------LGYNKPYNEPRWPSV 137
N++ I+ N + F + N F D++NEEF L K + EP
Sbjct: 55 NMKMIELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMGCFRNQKLRKGKLFREPL---- 110
Query: 138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
+L LP SVDWRK+G VTPVK+Q QCGSCWAFSA A+EG KTGKLVSLSEQ LVDC
Sbjct: 111 -FLDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDC 169
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
NQGCNGG+M AF ++ + GG+ +E+ YPY + C+ ++++ TG+E +
Sbjct: 170 SRPQGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKY-RSENSVANDTGFEVV 228
Query: 258 PA-----------------------RYAFQLYSHGV-FDEYCGHQ-LNHGVTVVGYG--- 289
PA +FQ Y G+ F+ C + L+HGV VVGYG
Sbjct: 229 PAGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEG 288
Query: 290 -EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KYWLVKNSWG WG GY+++A++ + CGI ASYP
Sbjct: 289 ANSDNNKYWLVKNSWGPEWGSNGYVKIAKDKDNH----CGIATAASYPT 333
>gi|28932704|gb|AAO60046.1| midgut cysteine proteinase 3 [Rhipicephalus appendiculatus]
Length = 334
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 139/306 (45%), Positives = 171/306 (55%), Gaps = 43/306 (14%)
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYI-----DYINSQNLSFKLTDNKFADLSNEEFISTYL 123
+ +EY S+ E R IY N I Y SQ +S+KL N+F DL + EF+ST
Sbjct: 34 HGKEYDSDTEEYYRLKIYMENRLKIARHNEKYAKSQ-VSYKLAMNEFGDLLHHEFVSTRN 92
Query: 124 GYNKPYNE-PRWPSV-------QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
G+ + Y + PR S + L LP +VDWRK+GAVTPVK+QGQCGSCWAFS ++
Sbjct: 93 GFKRNYRDTPREGSFFIEPEGFEDLHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSL 152
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EG + K KLVSLSEQ LVDC N GC GG M+ AF++I G+ TE YPY
Sbjct: 153 EGQHFRKMRKLVSLSEQNLVDCMQKLGNNGCGGGLMDNAFKYIKANKGIDTELSYPYNAT 212
Query: 236 NDRCQTDKTKHHAVTITGYEAIPAR----------------------YAFQLYSHGVFD- 272
+ C K+ A T TG+E IPAR +FQ YS GV D
Sbjct: 213 DGVCHFKKSGVGA-TATGFEDIPARDENSWDAVAPVGPVSVAIDASHESFQFYSEGVLDE 271
Query: 273 -EYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
E QL+HGV VVGYG G+ YWLVKNSWGT+WG+ GYI M RN + CGI
Sbjct: 272 PECSSDQLDHGVLVVGYGTKDGQDYWLVKNSWGTTWGDEGYIYMTRNKDNQ----CGIAS 327
Query: 332 QASYPV 337
ASYP+
Sbjct: 328 SASYPL 333
>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
Length = 336
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 129/341 (37%), Positives = 185/341 (54%), Gaps = 42/341 (12%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
LS+ L +L + GA S + Q + ++N+ ++++Y R I+ N
Sbjct: 4 LSMKFL-ILAVLVGAASAALTLE---QLFDAEWQNFKVHHNKKYEGSTVEAFRKKIFLQN 59
Query: 90 VQYIDYINSQN----LSFKLTDNKFADLSNEEFISTYLGY---NKPYNEPRWPSVQYLGL 142
I N ++ ++KL N+F D+ + EF+ST G N+ Y W + + L
Sbjct: 60 THLIARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGLLRSNRTYFGSTWIEPESVSL 119
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P SVDWR++GAVTPVK+QG CGSCW+FS A+EG KTG+LVSLSEQ L+DC +
Sbjct: 120 PKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYG 179
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---- 258
N GC GG M+ AF +I + G+ TE+ YPY GK +C+ K + A TG+ IP
Sbjct: 180 NNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHK-EDSAGRDTGFVDIPSGNE 238
Query: 259 -------------------ARYAFQLYSHGVFD--EYCGHQLNHGVTVVGYG-EDHGEKY 296
+ +FQ Y GV++ + H L+HGV VGYG D G+ Y
Sbjct: 239 RALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDGQDY 298
Query: 297 WLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+++KNSWG WG+ GY+ MARNS + CG+ QASYP+
Sbjct: 299 YIIKNSWGERWGQEGYVLMARNSKNE----CGVATQASYPL 335
>gi|332220191|ref|XP_003259241.1| PREDICTED: cathepsin K [Nomascus leucogenys]
Length = 329
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 131/319 (41%), Positives = 177/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K + ++Y ++ DE RR I+ N++YI N + +++L
Sbjct: 17 YPEEILDTHWELWKKTHRKQYNNKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTSEEVVQKMTGLKVPPSHSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>gi|159792912|gb|ABW98676.1| cathepsin L [Apostichopus japonicus]
Length = 332
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 132/317 (41%), Positives = 177/317 (55%), Gaps = 42/317 (13%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFA 111
+ ++ +E W + +S++Y E+E RR I+ N+Q + N+++ S+ L NK+A
Sbjct: 22 KGFDDTWEAWKQTHSKQYTKEEEDNRR-KIWEDNLQKVSKHNTEHSLGLHSYTLGMNKYA 80
Query: 112 DLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSC 166
DL EEF+ G + R +++L P SVDWR EG VTPVKDQGQCGSC
Sbjct: 81 DLRGEEFVQMMNGLKFDASRER-QGIKFLSYAKFQAPDSVDWRDEGYVTPVKDQGQCGSC 139
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFS ++EG + TG L SLSEQ LVDC ++ N GC GG M+ AF++I G+ T
Sbjct: 140 WAFSTTGSLEGQHFRSTGVLTSLSEQNLVDCSISYGNNGCEGGLMDYAFQYIKDNLGIDT 199
Query: 227 EDDYPYRGKNDRCQTDKTKHHAVTITGY----------------------EAIPARY-AF 263
ED YPY ++D C+ + T +GY AI A + +F
Sbjct: 200 EDKYPYEAEDDTCRF-SPDNVGATDSGYVDVDSGDEDALKEACAANGPISVAIDASHESF 258
Query: 264 QLYSHGVFD-EYCGH-QLNHGVTVVGYGEDH-GEKYWLVKNSWGTSWGEAGYIRMARNSP 320
QLY GV+D E C +L+HGV VVGYG D G YW+VKNSWG SWG+ GYI M+RN
Sbjct: 259 QLYESGVYDEESCSSIELDHGVLVVGYGTDSVGGDYWIVKNSWGLSWGQEGYIWMSRNKD 318
Query: 321 SSNIGICGILMQASYPV 337
+ CGI ASYP
Sbjct: 319 NQ----CGIATSASYPT 331
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 129/304 (42%), Positives = 170/304 (55%), Gaps = 39/304 (12%)
Query: 69 YSREYGSEDEWQRRFGIYSSNVQYIDYIN----SQNLSFKLTDNKFADLSNEEFISTYLG 124
+ +EY S+ E + R IY N + N S+++ NKF DL + EF S G
Sbjct: 34 HKKEYPSQLEEKFRMKIYLENKHKVAKHNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNG 93
Query: 125 Y-NKPYNEPRWPSV------QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEG 177
Y +K N R S + +P SVDWR++GA+TPVKDQGQCG CWAFS+ A+EG
Sbjct: 94 YQHKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQCGPCWAFSSTGALEG 153
Query: 178 INKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
KTGKLVSL EQ L+DC N+GCNGG M++AF++I G+ TE+ YPY ++D
Sbjct: 154 QTFRKTGKLVSLREQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDD 213
Query: 238 RC--------------------QTDKTKHHAVTITGYE-AIPARY-AFQLYSHGVFDE-Y 274
C + DK K T+ AI A + +FQ YS GV+ E
Sbjct: 214 VCRYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPS 273
Query: 275 C-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQA 333
C L+HGV VVGYG D+G+ YWLVKNSW WG+ GYI++ARN + CG+ A
Sbjct: 274 CDSDDLDHGVLVVGYGSDNGKDYWLVKNSWSEHWGDQGYIKIARNRKNH----CGVATAA 329
Query: 334 SYPV 337
SYP+
Sbjct: 330 SYPL 333
>gi|417399134|gb|JAA46597.1| Putative cathepsin l1 [Desmodus rotundus]
Length = 335
Score = 213 bits (541), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 126/321 (39%), Positives = 179/321 (55%), Gaps = 41/321 (12%)
Query: 52 KYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTD 107
K+D S+ + W Y R YG+++E RR ++ N + I+ N + F +
Sbjct: 20 KFD-HSLNAEWYQWKATYRRLYGADEEGWRR-AVWEKNRKMIELHNREYSQRKHGFTMAM 77
Query: 108 NKFADLSNEEF---ISTYLGYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
N F D++NEEF ++ +L + N + + +P+SVDWR++G VTPVK+QGQCG
Sbjct: 78 NAFGDMTNEEFRQVMNGFLKQKQHRNGRLFREPLFAEIPSSVDWRQKGYVTPVKNQGQCG 137
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
SCWAFSA A+EG KTGKLVSLSEQ LVDC + NQGCNGG M+ AF+++ G+
Sbjct: 138 SCWAFSANGALEGQMFRKTGKLVSLSEQNLVDCSHSQGNQGCNGGLMDNAFQYVKDNKGL 197
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYA 262
+E+ YPY G+ + ++ A TG+ IP +
Sbjct: 198 DSEESYPYLGRESNTCNYRPEYSAANDTGFVDIPQHERGLMKAVATVGPISVAIDAGHSS 257
Query: 263 FQLYSHGVFDE-YCGHQ-LNHGVTVVGYGEDHGE----KYWLVKNSWGTSWGEAGYIRMA 316
FQ YS G++ E C + L+HGV VVGYG + + K+W+VKNSWGT WG +GY++MA
Sbjct: 258 FQFYSEGIYYEPNCSSKDLDHGVLVVGYGSEGAQSDSNKFWIVKNSWGTGWGMSGYVKMA 317
Query: 317 RNSPSSNIGICGILMQASYPV 337
R+ + CGI ASYP
Sbjct: 318 RDQSNH----CGIATAASYPT 334
>gi|228244|prf||1801240B Cys protease 2
Length = 323
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 172/312 (55%), Gaps = 47/312 (15%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEF 118
E++ +Y R+Y +E R I+ N +YI+ N + ++F L NKF D++ EEF
Sbjct: 21 EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF 80
Query: 119 ISTYLGYNKPYNEPR--------WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
+ G N PR +P + VDWR +GAVTPVKDQGQCGSCWAFS
Sbjct: 81 NAVMKG-----NIPRRSAPVSVFYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFS 135
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
++EG + LKTG L+SL+EQ+LVDC QGCNGG+M AF++I G+ TE Y
Sbjct: 136 TTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEASY 195
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAI-----------------------PARYAFQLYS 267
PY ++ C+ D + A T +G+ I A +FQ YS
Sbjct: 196 PYEARDGSCRFD-SNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYS 254
Query: 268 HGVFDE-YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
GV+ E C L+H V VGYG + G+ +WLVKNSW TSWG+AGYI+M+RN ++
Sbjct: 255 SGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNN--- 311
Query: 326 ICGILMQASYPV 337
CGI ASYP+
Sbjct: 312 -CGIATVASYPL 322
>gi|313118762|gb|ADR32293.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 134/218 (61%), Gaps = 24/218 (11%)
Query: 143 PASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSE 202
P SVDWR +G + VKDQG CGSCWAFSAVAA+E IN + TG L+SLSEQELVDCD S
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCD-KSY 60
Query: 203 NQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-- 260
N+GC+GG M+ AFEF+ GG+ +E+DYPY+ +ND C + V I YE +P
Sbjct: 61 NEGCDGGLMDYAFEFVINNGGIDSEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNE 120
Query: 261 --------------------YAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLVK 300
FQ Y G+F CG ++HGV GYG ++G YW+V+
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGTENGMDYWIVR 180
Query: 301 NSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
NSWG WGE GY+R+ RN S+ G+CG+ + SYPVK
Sbjct: 181 NSWGAKWGEKGYLRVQRNIARSS-GLCGLATEPSYPVK 217
>gi|302831223|ref|XP_002947177.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
nagariensis]
gi|300267584|gb|EFJ51767.1| hypothetical protein VOLCADRAFT_103269 [Volvox carteri f.
nagariensis]
Length = 514
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 134/343 (39%), Positives = 171/343 (49%), Gaps = 72/343 (20%)
Query: 62 FENWLKQYSREYGSED-EWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADLSNEEFIS 120
F W +QY R Y + E+ RR I+S NV+ I + ++ L N++ADL+ EEF S
Sbjct: 38 FTLWSRQYGRTYVEQSPEYTRRLSIFSDNVRAIQESHEKDPGVTLALNEYADLTWEEFSS 97
Query: 121 TYLGYNKPYNE------------PRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
T LG ++ W + P ++DWR++GAV VK+QGQCGSCWA
Sbjct: 98 TRLGLRIDQDQLDRRSRRSASRRNAWRYAAAVDNPKAIDWREKGAVAEVKNQGQCGSCWA 157
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDV-------------------------NSEN 203
FS A+EGIN + TG+L SLSEQ+LVDCD N N
Sbjct: 158 FSTTGAIEGINAIVTGQLQSLSEQQLVDCDTGKRTVTRSKRSCTVILPSYSSNSCRNESN 217
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK-------NDRCQTDKTKHHAVTITGYEA 256
GC+GG M+ AF+++ + GG+ TE DY Y N R QTD+ AV+I GYE
Sbjct: 218 MGCSGGLMDDAFKYVIQNGGLDTEQDYAYWSGYGLGFWCNKRKQTDRP---AVSIDGYED 274
Query: 257 IP--------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYG-EDHGEK 295
+P A + Q YS GV C LNHGV VGY GEK
Sbjct: 275 VPQGEDNLLKAVAHQPVAVAICAGASMQFYSRGVISTCC-EGLNHGVLTVGYNVSQDGEK 333
Query: 296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
YW+VKNSWG WGE GY R+ G+CGI ASYP K
Sbjct: 334 YWIVKNSWGAGWGEQGYFRLKMG--VGETGLCGIASAASYPTK 374
>gi|300175245|emb|CBK20556.2| unnamed protein product [Blastocystis hominis]
Length = 325
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/315 (38%), Positives = 172/315 (54%), Gaps = 41/315 (13%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
D S+E +F + K++ + Y E+E + R ++S+N++ +DY NS+ SF L F DL
Sbjct: 16 DTLSVELQFAAFEKKFGKTYVGEEERRFRMSVFSNNLKIVDYYNSKQSSFVLGITPFIDL 75
Query: 114 SNEEFISTYLGYNKPYNEP---------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCG 164
SN+EF + N + + + S Y LP S+DWR + V+ VKDQ CG
Sbjct: 76 SNDEFRERFAS-NTAFEKKAKSVESSSSQQTSQDYSSLPRSIDWRAKNTVSSVKDQKNCG 134
Query: 165 SCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGV 224
+CWAF+AVA++EG+ KTGK++ S Q+LVDCD +S GC+GG M A+E++ G+
Sbjct: 135 ACWAFAAVASIEGVYAQKTGKILDFSPQQLVDCDYSS--LGCSGGLMTYAYEYVMN-NGI 191
Query: 225 TTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPA----------------------RYA 262
+ E DYPY+ C K +I GY +P
Sbjct: 192 SLESDYPYKASQGSC---KKVDFVTSIMGYYEVPVGSTYELLKATTKNPVSVAIGADSIF 248
Query: 263 FQLYSHGVF-DEYCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
FQLY+ G+ +E CG LNHGV +VGY D + +VKNSWG SWGE GYIR+A +
Sbjct: 249 FQLYTSGILAEELCGTTLNHGVLLVGYELDTATPFLIVKNSWGASWGEKGYIRLALS--D 306
Query: 322 SNIGICGILMQASYP 336
S G CGI + ASYP
Sbjct: 307 SYAGTCGINLMASYP 321
>gi|261289783|ref|XP_002611753.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
gi|229297125|gb|EEN67763.1| hypothetical protein BRAFLDRAFT_236364 [Branchiostoma floridae]
Length = 307
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 125/297 (42%), Positives = 171/297 (57%), Gaps = 42/297 (14%)
Query: 78 EWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADLSNEEFISTYLG---YNKPYN 130
E RR I+ +N + I+ N++ ++ L N+FA ++N+EF++ +G ++ +
Sbjct: 15 EESRRMEIFENNTKLINLHNNEADLGMHTYWLGHNQFAHMTNDEFVANVIGGCLLDRNAS 74
Query: 131 EPRWPSV-QY----LGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGK 185
+ V QY + LP +VDWR +G VTPVK+Q QCGSCWAFS ++EG KTGK
Sbjct: 75 KSTADRVHQYDSNLVELPDTVDWRTKGYVTPVKNQEQCGSCWAFSTTGSLEGQTFKKTGK 134
Query: 186 LVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTK 245
LVSLSEQ LVDC NQGCNGG M+ AF++I GG+ TED YPY ++ +C+ K
Sbjct: 135 LVSLSEQNLVDCSGEFGNQGCNGGLMDDAFKYIKANGGIDTEDSYPYEARDGKCRF-KPA 193
Query: 246 HHAVTITGYEAI-----------------------PARYAFQLYSHGVFDE-YCGH-QLN 280
T+TGY I + + FQ+YSHGV+ E C +L+
Sbjct: 194 DVGATVTGYTDISEGDEGALTQAVATVGPISVAIDASHHTFQMYSHGVYYEPQCSSTELD 253
Query: 281 HGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
HGV VGYG + G+ YWLVKNSWG WG+ GYI M+RN + CGI ASYP+
Sbjct: 254 HGVLAVGYGTEGGKDYWLVKNSWGEVWGQNGYIMMSRNKNNQ----CGIATSASYPL 306
>gi|351629617|gb|AEQ54772.1| KDEL-tailed cysteine proteinase CP4, partial [Coffea canephora]
Length = 215
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 108/202 (53%), Positives = 134/202 (66%), Gaps = 26/202 (12%)
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G+CGSCWAFS V VEGINK+KTG+LVSLSEQELVDC+ ++N+GCNGG ME A+EFI K
Sbjct: 1 GKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQELVDCE--TDNEGCNGGLMENAYEFIKK 58
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPAR-------------------- 260
GG+TTE YPY+ ++ C + K AVTI G+E +PA
Sbjct: 59 SGGITTERLYPYKARDGSCDSSKMNAPAVTIDGHEMVPANDENALMKAVANQPVSVAIDA 118
Query: 261 --YAFQLYSHGVFD-EYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIRMA 316
Q YS GV+ + CG++L+HGV VVGYG G KYW+VKNSWGT WGE GYIRM
Sbjct: 119 SGSDMQFYSEGVYTGDSCGNELDHGVAVVGYGTALDGTKYWIVKNSWGTGWGEQGYIRMQ 178
Query: 317 RNSPSSNIGICGILMQASYPVK 338
R ++ G+CGI M+ASYP+K
Sbjct: 179 RGVDAAEGGVCGIAMEASYPLK 200
>gi|390476660|ref|XP_003735160.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin K [Callithrix jacchus]
Length = 329
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 132/319 (41%), Positives = 177/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ +E W K + ++Y S+ DE RR I+ N++YI N + +++L
Sbjct: 17 YPEEILDTHWELWKKTHRKQYNSKVDEISRRL-IWEKNLKYISIHNLEASLGVHTYELAM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + R Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTSEEVVQKMTGLKVPTSYSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPISVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MAR
Sbjct: 253 LTSFQFYSKGVYYDESCNSDNLNHAVLAVGYGILKGNKHWIIKNSWGENWGNKGYILMAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
Length = 334
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 130/308 (42%), Positives = 168/308 (54%), Gaps = 37/308 (12%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFADLSNEEF 118
E+W + + Y S E + R I+ N I N++ + ++ + N + DL + EF
Sbjct: 30 ESWKLTHQKGYDSSVEEKLRLKIFMENSLRISRHNAEAIQGRHTYFMKMNHYGDLLHHEF 89
Query: 119 ISTYLGY---NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
++ GY NK + + + LP VDWR+EGAVTPVK+QGQCGSCW+FSA ++
Sbjct: 90 VAMVNGYIYNNKTTLGGTFIPSKNINLPEHVDWREEGAVTPVKNQGQCGSCWSFSATGSL 149
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EG + KTGKL+SLSEQ LVDC N GC GG M+ AF++I G+ TE YPY G
Sbjct: 150 EGQDFRKTGKLISLSEQNLVDCSRKYGNNGCEGGLMDYAFKYIQDNNGIDTEASYPYEGI 209
Query: 236 NDRCQTD-------------------KTKHHAVTITG--YEAIPARY-AFQLYSHGVFDE 273
+ C D K A+ G AI A + +FQ YSHGV+ E
Sbjct: 210 DGHCHYDPKNKGGSDIGFVDIKKGSEKDLQKALATVGPISVAIDASHMSFQFYSHGVYSE 269
Query: 274 -YCG-HQLNHGVTVVGYGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
C L+HGV VGYG D GE YWLVKNSW WGE GYI+MARN + +CGI
Sbjct: 270 KKCSPENLDHGVLAVGYGTDEVTGEDYWLVKNSWSEKWGEDGYIKMARNKDN----MCGI 325
Query: 330 LMQASYPV 337
ASYPV
Sbjct: 326 ASSASYPV 333
>gi|431896621|gb|ELK06033.1| Cathepsin S [Pteropus alecto]
Length = 331
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/335 (39%), Positives = 182/335 (54%), Gaps = 38/335 (11%)
Query: 34 LLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI 93
+ W+ + G + + DP +++ ++ W K YS+ Y + E R I+ N++++
Sbjct: 1 MKWLACVLLGCSAAVAQLQRDP-TLDRHWDLWKKTYSKHYREKIEEVARRLIWEKNLKFV 59
Query: 94 DYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPASV 146
N ++ S+ L N D+++EE IS P R + S LP S+
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMGSLTVPSQWQRNVTYKSNPNQKLPDSL 119
Query: 147 DWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS-ENQG 205
DWR +G VT VK QG CGSCWAFSAV A+E KLKTGKLVSLS Q LVDC N+G
Sbjct: 120 DWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYSNKG 179
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------- 258
CNGG+M AF++I G+ +E YPY+ ++ +CQ D +K A T + Y +P
Sbjct: 180 CNGGFMTSAFQYIIDNNGIDSEASYPYKAQDGKCQYD-SKFRAATCSKYTELPFGSEEAL 238
Query: 259 ----------------ARYAFQLYSHGV-FDEYCGHQLNHGVTVVGYGEDHGEKYWLVKN 301
+ +F LY GV +D+ C ++NHGV VVGYG G+ YWLVKN
Sbjct: 239 KEAVANKGPVSVAIDASHPSFFLYRSGVYYDQSCTLKVNHGVLVVGYGNLDGKDYWLVKN 298
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
SWG ++G+ GYIRMARNS + CGI SYP
Sbjct: 299 SWGLNFGDKGYIRMARNSGNH----CGIASYPSYP 329
>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
Length = 332
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 131/308 (42%), Positives = 170/308 (55%), Gaps = 37/308 (12%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFADLSNEEF 118
E+W + + Y S E + R I+ N I N++ + S+ + N + DL + EF
Sbjct: 28 ESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYGDLLHHEF 87
Query: 119 ISTYLGY---NKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
++ GY NK + + + LP VDWR++GAVTPVK+QGQCGSCWAFS+ ++
Sbjct: 88 VAMVNGYEYVNKTSLGGSFIPSKNVKLPTHVDWREDGAVTPVKNQGQCGSCWAFSSTGSL 147
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EG KTGKL+ LSEQ LVDC N GC GG M+ AF +I G+ TE YPY G
Sbjct: 148 EGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDNKGIDTEGSYPYEGV 207
Query: 236 NDRCQTDKTKHHAVTI------TGYE---------------AIPARY-AFQLYSHGV-FD 272
RC D +K + I G E AI A + +FQ YSHGV F+
Sbjct: 208 GGRCHYDPSKKGSSDIGFVDVKKGSEEELLKAVASVGPVSVAIDASHMSFQFYSHGVYFE 267
Query: 273 EYCG-HQLNHGVTVVGYG--EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGI 329
C L+HGV VVGYG E+ GE YWLVKNSW +WG+ GYI+MARN + +CGI
Sbjct: 268 SKCSPENLDHGVLVVGYGTDENSGEDYWLVKNSWSENWGDQGYIKMARNKKN----MCGI 323
Query: 330 LMQASYPV 337
ASYPV
Sbjct: 324 ASSASYPV 331
>gi|410968296|ref|XP_003990643.1| PREDICTED: cathepsin K [Felis catus]
Length = 330
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/314 (41%), Positives = 177/314 (56%), Gaps = 43/314 (13%)
Query: 58 MEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFAD 112
++ ++E W K Y ++Y ++ DE RR I+ N+++I N + +++L N D
Sbjct: 23 LDTQWELWKKTYGKQYNNKVDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAMNHLGD 81
Query: 113 LSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCW 167
+++EE + G P + R Y+ P S+D+RK+G VTPVK+QGQCGSCW
Sbjct: 82 MTSEEVVQKMTGLKVPPSRSRSNDTLYIPDWESRAPDSIDYRKKGYVTPVKNQGQCGSCW 141
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K G+ +E
Sbjct: 142 AFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNRGIDSE 199
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY------------AFQ 264
D YPY G+++ C + T A GY IP AR +FQ
Sbjct: 200 DAYPYVGQDESCMYNPT-GKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQ 258
Query: 265 LYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
YS GV +DE C LNH V VGYG G K+W++KNSWG +WG GYI MARN ++
Sbjct: 259 FYSKGVYYDENCNSDNLNHAVLAVGYGIQKGNKHWIIKNSWGENWGNKGYILMARNKNNA 318
Query: 323 NIGICGILMQASYP 336
CGI AS+P
Sbjct: 319 ----CGIANLASFP 328
>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
Length = 331
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 126/338 (37%), Positives = 184/338 (54%), Gaps = 43/338 (12%)
Query: 33 FLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQY 92
FL+ + + A + + Q +D + ++N+ ++++Y R I+ N
Sbjct: 3 FLILAVLVGAASAALTLEQLFDAE-----WQNFKVHHNKKYEGSTVEAFRKKIFLQNTHL 57
Query: 93 IDYINSQN----LSFKLTDNKFADLSNEEFISTYLGY---NKPYNEPRWPSVQYLGLPAS 145
I N ++ ++KL N+F D+ + EF+ST G N+ Y W + + LP S
Sbjct: 58 IARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGLLRSNRTYFGSTWIEPESVSLPKS 117
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
VDWR++GAVTPVK+QG CGSCW+FS A+EG KTG+LVSLSEQ L+DC + N G
Sbjct: 118 VDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGNNG 177
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------- 258
C GG M+ AF +I + G+ TE+ YPY GK +C+ K + A TG+ IP
Sbjct: 178 CGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHK-EDSAGRDTGFVDIPSGNERAL 236
Query: 259 ----------------ARYAFQLYSHGVFD--EYCGHQLNHGVTVVGYG-EDHGEKYWLV 299
+ +FQ Y GV++ + H L+HGV VGYG D G+ Y+++
Sbjct: 237 AKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDGQDYYII 296
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KNSWG WG+ GY+ MARNS + CG+ QASYP+
Sbjct: 297 KNSWGERWGQEGYVLMARNSKNE----CGVATQASYPL 330
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/306 (41%), Positives = 170/306 (55%), Gaps = 40/306 (13%)
Query: 68 QYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYL 123
++ + Y SE E R IY N I N + + + + N+F D+ + EF+ST
Sbjct: 33 KHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTRN 92
Query: 124 GYNKPY-NEPRWPS-------VQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAV 175
G+ + Y ++PR S ++ LP +VDWR +GAVTPVK+QGQCGSCWAFSA ++
Sbjct: 93 GFKRNYKDQPREGSTYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSATGSL 152
Query: 176 EGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGK 235
EG + K+G +VSLSEQ LV C + N GC GG M+ AF++I G+ TE YPY G
Sbjct: 153 EGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSYPYNGT 212
Query: 236 NDRCQTDKTK-------------------HHAVTITG--YEAIPARY-AFQLYSHGVFDE 273
+ C K+ AV G AI A + +FQ YS GV+DE
Sbjct: 213 DGTCHFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDGVYDE 272
Query: 274 -YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILM 331
C + L+HGV VVGYG +G YW VKNSWGT+WG+ GYIRM+RN + CGI
Sbjct: 273 PECDSESLDHGVLVVGYGTLNGTDYWFVKNSWGTTWGDEGYIRMSRNKKNQ----CGIAS 328
Query: 332 QASYPV 337
AS P+
Sbjct: 329 SASIPL 334
>gi|118123|sp|P25782.1|CYSP2_HOMAM RecName: Full=Digestive cysteine proteinase 2; Flags: Precursor
gi|11053|emb|CAA45128.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 323
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 172/312 (55%), Gaps = 47/312 (15%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEF 118
E++ +Y R+Y +E R I+ N +YI+ N + ++F L NKF D++ EEF
Sbjct: 21 EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF 80
Query: 119 ISTYLGYNKPYNEPR--------WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFS 170
+ G N PR +P + VDWR +GAVTPVKDQGQCGSCWAFS
Sbjct: 81 NAVMKG-----NIPRRSAPVSVFYPKKETGPQATEVDWRTKGAVTPVKDQGQCGSCWAFS 135
Query: 171 AVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDY 230
++EG + LKTG L+SL+EQ+LVDC QGCNGG+M AF++I G+ TE Y
Sbjct: 136 TTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAAY 195
Query: 231 PYRGKNDRCQTDKTKHHAVTITGYEAI-----------------------PARYAFQLYS 267
PY ++ C+ D + A T +G+ I A +FQ YS
Sbjct: 196 PYEARDGSCRFD-SNSVAATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSFQFYS 254
Query: 268 HGVFDE-YCGHQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIG 325
GV+ E C L+H V VGYG + G+ +WLVKNSW TSWG+AGYI+M+RN ++
Sbjct: 255 SGVYYEPSCSPSYLDHAVLAVGYGSEGGQDFWLVKNSWATSWGDAGYIKMSRNRNNN--- 311
Query: 326 ICGILMQASYPV 337
CGI ASYP+
Sbjct: 312 -CGIATVASYPL 322
>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
Length = 341
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 129/323 (39%), Positives = 173/323 (53%), Gaps = 48/323 (14%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADL 113
++E + + Q+ Y SE E R IY+ + I N + +S+KL NK+ D+
Sbjct: 23 VKEEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIAKHNQKYEMGLVSYKLGMNKYGDM 82
Query: 114 SNEEFISTYLGYNKPYNE-------------PRWPSVQYLGLPASVDWRKEGAVTPVKDQ 160
+ EF+ T G+NK ++ S + LP VDWRK GAVT +KDQ
Sbjct: 83 LHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISPANVKLPEQVDWRKHGAVTDIKDQ 142
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G+CGSCW+FS A+EG + ++G LVSLSEQ L+DC N GCNGG M+ AF++I
Sbjct: 143 GKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDCSEQYGNNGCNGGLMDNAFKYIKD 202
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP---------------------- 258
GG+ TE YPY G +D+C+ + K+ G+ IP
Sbjct: 203 NGGIDTEQTYPYEGVDDKCRYN-PKNTGAEDVGFVDIPEGDEQKLMEAVATVGPVSVAID 261
Query: 259 -ARYAFQLYSHGVF--DEYCGHQLNHGVTVVGYGED-HGEKYWLVKNSWGTSWGEAGYIR 314
+ +FQLYS GV+ +E L+HGV VVGYG D G YWLVKNSWG SWGE GYI+
Sbjct: 262 ASHTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGTDEQGVDYWLVKNSWGRSWGELGYIK 321
Query: 315 MARNSPSSNIGICGILMQASYPV 337
M RN + CGI ASYP+
Sbjct: 322 MIRNKNNR----CGIASSASYPL 340
>gi|296189340|ref|XP_002742739.1| PREDICTED: cathepsin L1 [Callithrix jacchus]
Length = 333
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 177/317 (55%), Gaps = 42/317 (13%)
Query: 56 QSMEERFENWLKQYSREYG-SEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKF 110
+S+E ++ W ++R YG +E+EW+R ++ N++ I+ N + SF + N F
Sbjct: 23 RSLEAQWIKWKAMHNRLYGMNEEEWRR--AVWEKNMKMIELHNHEYNQGKHSFTMAMNAF 80
Query: 111 ADLSNEEFISTYLGYN--KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWA 168
D++NEEF G+ KP N + + P SVDWR++G VTPVK+QGQCGSCWA
Sbjct: 81 GDMTNEEFRQVMNGFQNRKPRNGKVFQEPLFHEAPRSVDWREKGYVTPVKNQGQCGSCWA 140
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSA A+EG KTGKLVSLSEQ LVDC NQGC+GG M+ AF+++ + GG+ +E+
Sbjct: 141 FSATGALEGQMFRKTGKLVSLSEQNLVDCSGPQGNQGCDGGLMDYAFQYVQENGGLDSEE 200
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIP----------------------ARYAFQLY 266
YPY + C+ + ++ TG+ IP +FQ Y
Sbjct: 201 SYPYEATEESCKYN-PEYSVANDTGFVDIPKLEKALMKAVATVGPISVAIDAGHESFQFY 259
Query: 267 SHGV-FDEYCGHQ-LNHGVTVVGYGEDH----GEKYWLVKNSWGTSWGEAGYIRMARNSP 320
G+ F+ C + ++HGV VVGYG + KYWLVKNSWG WG GYI+MA++
Sbjct: 260 KEGIYFEPECSSEDMDHGVLVVGYGFERTGSDNSKYWLVKNSWGEKWGMDGYIKMAKDRK 319
Query: 321 SSNIGICGILMQASYPV 337
+ CGI ASYP
Sbjct: 320 NH----CGIASAASYPT 332
>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
Length = 331
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 128/335 (38%), Positives = 183/335 (54%), Gaps = 41/335 (12%)
Query: 34 LLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI 93
L+W L + A ++ + DP +++ + W K Y ++Y ++E R I+ N++++
Sbjct: 4 LVWTLLVCCSAMAQLHR---DP-ALDHHWHLWKKTYGKQYTEKNEETERRLIWEKNLKFV 59
Query: 94 DYINSQNL----SFKLTDNKFADLSNEEFISTYLGYNKPYNEPR---WPSVQYLGLPASV 146
N ++ S+ L N D+++EE +S P R + S LP S+
Sbjct: 60 MLHNLEHSMGMHSYDLGMNHLGDMTSEEVVSLMTCLKVPRQSQRNVTYKSSPNQKLPDSL 119
Query: 147 DWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS-ENQG 205
DWR++G VT VK QG CGSCWAFSAV A+E KL TGKLVSLS Q LVDC N+G
Sbjct: 120 DWREKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLTTGKLVSLSAQNLVDCSTEKYRNEG 179
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP------- 258
C+GG+M +AF++I G+ +E YPY+ +++CQ D +K+ A T + Y +P
Sbjct: 180 CHGGFMTEAFQYIIDNNGIDSEASYPYKAMDEKCQYD-SKNRAATCSKYTELPFGSEEAL 238
Query: 259 ----------------ARYAFQLYSHGVFDE-YCGHQLNHGVTVVGYGEDHGEKYWLVKN 301
+ +F LY GV+ E C +NHGV VVGYG +G YWLVKN
Sbjct: 239 KEAVASKGPVSVAIDASHSSFFLYRSGVYYEPACTQVVNHGVLVVGYGNLNGNDYWLVKN 298
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYP 336
SWG +G+ GYIRMARN + CGI +SYP
Sbjct: 299 SWGLYFGDKGYIRMARNRENH----CGIASYSSYP 329
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 128/302 (42%), Positives = 167/302 (55%), Gaps = 37/302 (12%)
Query: 68 QYSREYGSEDEWQRRFGIYSSNVQYIDYINSQN----LSFKLTDNKFADLSNEEFISTYL 123
+Y ++Y S E R +Y N ++I+ N Q +SF L N+F D++ EE +
Sbjct: 28 RYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDMTTEEINAAMN 87
Query: 124 GY-NKPYNEPRWPSVQYL--GLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINK 180
G+ + PR Q L LP +VDWR +GAVTPVKDQ CGSCWAFSA ++EG +
Sbjct: 88 GFLSAGKKVPRGTMYQPLVDELPDTVDWRDKGAVTPVKDQKACGSCWAFSATGSLEGQHF 147
Query: 181 LKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQ 240
L TGKLVSLSEQ LVDC N GC GG M+ AF +I G+ TE+ YPY KN C+
Sbjct: 148 LSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGIDTEESYPYEAKNGPCR 207
Query: 241 TDKTKHHAVTITGY----------------EAIPARYA-------FQLYSHGV-FDEYCG 276
+ + + T++ Y E P A F YS G+ +DE C
Sbjct: 208 FN-SDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTFHFYSRGIYYDEKCS 266
Query: 277 HQ-LNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASY 335
L+HGV VGYG D YWLVKNSW +WG++GYI+M+RN ++ CGI QASY
Sbjct: 267 SSFLDHGVLAVGYGTDDSSDYWLVKNSWNETWGDSGYIKMSRNRNNN----CGIASQASY 322
Query: 336 PV 337
PV
Sbjct: 323 PV 324
>gi|73946536|ref|XP_541257.2| PREDICTED: cathepsin L1 [Canis lupus familiaris]
Length = 333
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 135/342 (39%), Positives = 191/342 (55%), Gaps = 48/342 (14%)
Query: 31 SLFLLWV-LGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
SLFL + LGI + A PQ+ S++ + W + + + Y ++E RR ++ N
Sbjct: 4 SLFLAALCLGIASAA-----PQQ--DHSLDAHWSQWKEAHGKLYDKDEEGWRR-TVWERN 55
Query: 90 VQYIDYINSQ----NLSFKLTDNKFADLSNEEF--ISTYLGYNKPYNEPRWPSVQYLGLP 143
++ I+ N + SF L N F D++NEEF + K +P+ + +P
Sbjct: 56 MEMIEQHNQEYSQGEHSFTLAMNAFGDMTNEEFKQVLNDFKIQKHKKGKVFPAPLFAEVP 115
Query: 144 ASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSEN 203
+SVDWR++G VTPVKDQGQC CWAFSA A+EG KTGKLVSLSEQ LVDC + N
Sbjct: 116 SSVDWREQGYVTPVKDQGQCLGCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWSQGN 175
Query: 204 QGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI------ 257
+GCNGG ME AF+++ GG+ +E+ YPY +N+ C+ + + A +T + I
Sbjct: 176 RGCNGGLMEYAFQYVKDNGGLDSEESYPYLARNEPCKY-RPEKSAANVTAFWPILNEEDG 234
Query: 258 ---------PARYA-------FQLYSHGV-FDEYCGHQ-LNHGVTVVGYG----EDHGEK 295
P A FQ Y G+ +D C ++ LNHGV VVGYG E +K
Sbjct: 235 LMTTVATVGPVSAAVDSSPQSFQFYKKGIYYDPKCSNKLLNHGVLVVGYGFEGAESDNKK 294
Query: 296 YWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
YW+VKNSWGT+WG GY+ +A++ + CGI +ASYPV
Sbjct: 295 YWIVKNSWGTNWGMQGYMLLAKDRDNH----CGIATRASYPV 332
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 130/309 (42%), Positives = 166/309 (53%), Gaps = 38/309 (12%)
Query: 63 ENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLS----FKLTDNKFADLSNEEF 118
E+W + + Y S E + R IY N I NS+ L+ + + N + DL + EF
Sbjct: 31 ESWKLMHGKTYSSSIEEKLRLKIYMENSLKISRHNSEALNGIHPYYMKMNHYGDLLHHEF 90
Query: 119 ISTYLGYNKPYNEPRWPSV----QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAA 174
++ GY + + LP VDWR+EGAVTPVK+QGQCGSCW+FSA A
Sbjct: 91 VAMVNGYQYANKTASLGGTYIPNKNIQLPTHVDWREEGAVTPVKNQGQCGSCWSFSATGA 150
Query: 175 VEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRG 234
+EG + KTGKL+SLSEQ LVDC N GC GG M+ AF +I G+ TE YPY G
Sbjct: 151 LEGQDFRKTGKLISLSEQNLVDCSRKFGNNGCEGGLMDFAFTYIRDNKGIDTEASYPYEG 210
Query: 235 KNDRCQ-------------------TDKTKHHAVTITG--YEAIPARY-AFQLYSHGVFD 272
+ C ++K AV G AI A + +FQ YSHGV+
Sbjct: 211 IDGHCHYNPKNKGGSDIGFVDIKKGSEKDLKKAVAGVGPISVAIDASHMSFQFYSHGVYV 270
Query: 273 EY--CGHQLNHGVTVVGYGED--HGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICG 328
E +L+HGV VVG+G D GE YWLVKNSW WG+ GYI+MARN + +CG
Sbjct: 271 ESKCSSEELDHGVLVVGFGTDSVSGEDYWLVKNSWSEKWGDQGYIKMARNKEN----MCG 326
Query: 329 ILMQASYPV 337
I ASYPV
Sbjct: 327 IASSASYPV 335
>gi|281204231|gb|EFA78427.1| cysteine proteinase 3 [Polysphondylium pallidum PN500]
Length = 329
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 121/336 (36%), Positives = 179/336 (53%), Gaps = 39/336 (11%)
Query: 30 LSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSN 89
L L ++G+ AG+ + + + + +F NW+ R+Y + E++ R+ + N
Sbjct: 3 LLLAFFMIVGLAAGS------RLFAEKHYQNQFTNWMVVQDRQYDAY-EFRTRYSAFKDN 55
Query: 90 VQYIDYINSQNLSFKLTDNKFADLSNEEFISTYLGYNKPYN----EPRWPSVQYLGLPAS 145
+ +I N+ N +L FADL+NEE+ + YLG N + +P Y + ++
Sbjct: 56 LDFIHRWNAVNKETELGATVFADLTNEEYRAVYLGMNVDASNFAAQPATLDQVYQPVRST 115
Query: 146 VDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQG 205
+DWR GAV VKDQGQCGSCWAFS AVEG +++ TG VSLSEQ+L+DC + N G
Sbjct: 116 LDWRNNGAVGRVKDQGQCGSCWAFSTTGAVEGAHQIATGNFVSLSEQQLMDCSRSYGNHG 175
Query: 206 CNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI-------- 257
C GG M+ A +I K GG+ TE+ YPY ++ ++ ++GY I
Sbjct: 176 CQGGLMDSAMSYIVKQGGINTEESYPYEMRDSYTCKYNPANNGAKLSGYSNIKRGSEADL 235
Query: 258 --------------PARYAFQLYSHGVF-DEYCGH-QLNHGVTVVGYGEDHGEKYWLVKN 301
+ +FQLY GVF D C L+HGV VGYG + YW+VKN
Sbjct: 236 AAKLNIGPVAIALDASHSSFQLYKSGVFYDPACSSTSLSHGVLAVGYGTEGSSAYWIVKN 295
Query: 302 SWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
SWGT WG+AGYI +A++ + CG+ +S P+
Sbjct: 296 SWGTRWGDAGYIWIAKDRNNH----CGVATMSSIPI 327
>gi|37905511|gb|AAO64477.1| cathepsin S precursor [Fundulus heteroclitus]
Length = 337
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 121/313 (38%), Positives = 169/313 (53%), Gaps = 39/313 (12%)
Query: 58 MEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFADL 113
+++ +E W K + +EY +E+E R ++ N+ I N + ++ L+ N DL
Sbjct: 30 LDDHWELWKKTHGKEYQNEEENVHRRDLWEKNLMLITTHNLEASMGFHTYDLSMNFMGDL 89
Query: 114 SNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCWA 168
S EE + Y P + R PS ++G +P ++D R++G VT V+ QG CGSCWA
Sbjct: 90 SQEEILQFYTTLTTPTDLQRAPS-SFVGASGADVPDTLDLREKGLVTAVRMQGACGSCWA 148
Query: 169 FSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTED 228
FSA A+EG KTGKL +LS Q LVDC N GCNGG+M KAF+++ G+ +ED
Sbjct: 149 FSAAGALEGQLAKKTGKLQNLSPQNLVDCSTKYGNHGCNGGFMHKAFQYVIDNQGIDSED 208
Query: 229 DYPYRGKNDRCQTDKTKHHAVTITGYEAIPA-----------------------RYAFQL 265
YPYRG++ +CQ + A + Y+ +P R F
Sbjct: 209 SYPYRGRDQQCQYNPAT-RAANCSRYDFLPEGDEQALKEAIATIGPISVAIDARRPRFAF 267
Query: 266 YSHGVFDE-YCGHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNI 324
Y GV+D+ C +NH V VGYG G+ YWLVKNSWGTS+G+ GYIRMARN
Sbjct: 268 YRSGVYDDSSCTQNVNHAVLAVGYGSLGGQDYWLVKNSWGTSFGDQGYIRMARNKNDQ-- 325
Query: 325 GICGILMQASYPV 337
CGI + A YP+
Sbjct: 326 --CGIALYACYPI 336
>gi|27806673|ref|NP_776457.1| cathepsin L2 precursor [Bos taurus]
gi|1542853|emb|CAA62870.1| cathepsin L [Bos taurus]
Length = 334
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 135/348 (38%), Positives = 185/348 (53%), Gaps = 59/348 (16%)
Query: 31 SLFL-LWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYG-SEDEWQRRFGIYSS 88
S FL + LG+ + A K DP +++ + W + R YG +E+EW+R ++
Sbjct: 4 SFFLTVLCLGVASAA------PKLDP-NLDAHWHQWKATHRRLYGMNEEEWRR--AVWEK 54
Query: 89 NVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYN-------KPYNEPRWPSV 137
N + ID N + +F++ N F D++NEEF G+ K ++EP
Sbjct: 55 NKKIIDLHNQEYSEGKHAFRMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKLFHEPL---- 110
Query: 138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
+ +P SVDW K+G VTPVK+QGQCGSCWAFSA A+EG KTGKLVSLSEQ LVDC
Sbjct: 111 -LVDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDC 169
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
NQGCNGG M+ AF++I GG+ +E+ YPY + K + A TG+ I
Sbjct: 170 SRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI 229
Query: 258 PAR----------------------YAFQLYSHGV-FDEYCG-HQLNHGVTVVGYG---- 289
P R +FQ Y G+ +D C L+HGV VVGYG
Sbjct: 230 PQREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGT 289
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+ + K+W+VKNSWG WG GY++MA++ + CGI ASYP
Sbjct: 290 DSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNH----CGIATAASYPT 333
>gi|402898110|ref|XP_003912074.1| PREDICTED: cathepsin L2 [Papio anubis]
Length = 334
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 135/349 (38%), Positives = 185/349 (53%), Gaps = 59/349 (16%)
Query: 30 LSLFLL-WVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
LSL L + LGI + K+D Q+++ ++ W + R YG+ +E RR ++
Sbjct: 3 LSLVLAAFCLGIASAV------PKFD-QNLDTKWYQWKATHRRLYGASEEGWRR-AVWEK 54
Query: 89 NVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTY-------LGYNKPYNEPRWPSV 137
N++ I+ N + F + N F D++NEEF L K + EP
Sbjct: 55 NMKMIELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQVMGCFRNQKLRKGKLFREPL---- 110
Query: 138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
+L LP SVDWRK+G VTPVK+Q QCGSCWAFSA A+EG KTGKLVSLSEQ LVDC
Sbjct: 111 -FLDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDC 169
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
NQGCNGG+M AF ++ + GG+ +E+ YPY + C+ + ++ TG+E +
Sbjct: 170 SRPQGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKY-RPENSVANDTGFEVV 228
Query: 258 PA-----------------------RYAFQLYSHGV-FDEYCGHQ-LNHGVTVVGYG--- 289
PA +FQ Y G+ F+ C + L+HGV VVGYG
Sbjct: 229 PAGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEG 288
Query: 290 -EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KYWLVKNSWG WG GY+++A++ + CGI ASYP
Sbjct: 289 ANSDNNKYWLVKNSWGPEWGSNGYVKIAKDKDNH----CGIATAASYPT 333
>gi|294885989|ref|XP_002771502.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875206|gb|EER03318.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 129/322 (40%), Positives = 178/322 (55%), Gaps = 41/322 (12%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQNLSFKLTDNKFADL 113
D ++ F + K++ + Y +++E +R I+ N+ YI+ +N+QNLS+KL N++ DL
Sbjct: 19 DLEAAGLAFIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVNAQNLSYKLGVNEYTDL 78
Query: 114 SNEEFISTYL-------GYNKPYNEPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSC 166
+ EEF + L G + P+ L P SVDWRK+G + PVKDQG CGSC
Sbjct: 79 TLEEFAALKLSSTDMSEGMGDGFVAGAGPTTTTL--PTSVDWRKKGVLNPVKDQGYCGSC 136
Query: 167 WAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTT 226
WAFSA+ A+E + TGKL+SLSEQ+LVDC N+GCNGG M+KAFE+I K GV
Sbjct: 137 WAFSAIGALEPRYAIATGKLLSLSEQQLVDCAGAYGNEGCNGGLMDKAFEYI-KATGVDK 195
Query: 227 EDDYPYRGKNDRCQ------TDKTKHHAVT------------ITGYEAIP---ARYA--- 262
E YPY G ++ CQ TD VT + G A P A YA
Sbjct: 196 ESTYPYVGSDETCQATVENKTDGLPVGEVTGNQMLHQTEKALMEGVAAAPVSIAMYANLQ 255
Query: 263 -FQLYSHGVF-DEYC---GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
FQ Y GV+ D C G ++HGV VGYG ++G+ Y++++NSWG SWG+ GY+ + R
Sbjct: 256 SFQHYKSGVYSDPNCNAKGGSIDHGVVAVGYGTENGQDYFIIRNSWGRSWGQDGYVYLKR 315
Query: 318 NSPSSNIGICGILMQASYPVKR 339
S G C I P +
Sbjct: 316 GVGS--FGQCNIYKYMCVPTLK 335
>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
Length = 334
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 132/326 (40%), Positives = 182/326 (55%), Gaps = 53/326 (16%)
Query: 52 KYDPQSMEERFENWLKQYSREYGS-EDEWQRRFGIYSSNVQYI-----DYINSQNLSFKL 105
K+D Q+ + W + R YG+ E+EW+R I+ N++ I +Y N Q+ F +
Sbjct: 20 KFD-QTFSAEWHQWKSTHRRLYGTNEEEWRR--AIWEKNMRMIQLHNGEYSNGQH-GFSM 75
Query: 106 TDNKFADLSNEEFISTYLGYN-------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVK 158
N F D++NEEF GY + + EP L +P SVDWR++G VTPVK
Sbjct: 76 EMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPL-----MLKIPKSVDWREKGCVTPVK 130
Query: 159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
+QGQCGSCWAFSA +EG LKTGKL+SLSEQ LVDC NQGCNGG M+ AF++I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDYAFQYI 190
Query: 219 TKIGGVTTEDDYPYRGKNDRC------------------QTDKTKHHAVTITGYEAI--- 257
+ GG+ +E+ YPY K+ C Q +K AV G ++
Sbjct: 191 KENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVAMD 250
Query: 258 PARYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAG 311
+ + Q YS G++ E C + L+HGV +VGYG + + KYWLVKNSWG+ WG G
Sbjct: 251 ASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEG 310
Query: 312 YIRMARNSPSSNIGICGILMQASYPV 337
YI++A++ + CG+ ASYPV
Sbjct: 311 YIKIAKDRDNH----CGLATAASYPV 332
>gi|13928758|ref|NP_113748.1| cathepsin K precursor [Rattus norvegicus]
gi|12585195|sp|O35186.1|CATK_RAT RecName: Full=Cathepsin K; Flags: Precursor
gi|2305208|gb|AAB65743.1| cathepsin K [Rattus norvegicus]
gi|50927597|gb|AAH78793.1| Cathepsin K [Rattus norvegicus]
gi|149030667|gb|EDL85704.1| cathepsin K, isoform CRA_a [Rattus norvegicus]
Length = 329
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 128/320 (40%), Positives = 179/320 (55%), Gaps = 51/320 (15%)
Query: 56 QSMEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKF 110
++++ ++E W K + ++Y S+ DE RR I+ N++ I N + +++L N
Sbjct: 20 ETLDTQWELWKKTHGKQYNSKVDEISRRL-IWEKNLKKISVHNLEASLGAHTYELAMNHL 78
Query: 111 ADLSNEEFISTYLGYNKPYNE---------PRWPSVQYLGLPASVDWRKEGAVTPVKDQG 161
D+++EE + G P + P W +P S+D+RK+G VTPVK+QG
Sbjct: 79 GDMTSEEVVQKMTGLRVPPSRSFSNDTLYTPEWEGR----VPDSIDYRKKGYVTPVKNQG 134
Query: 162 QCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKI 221
QCGSCWAFS+ A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ +
Sbjct: 135 QCGSCWAFSSAGALEGQLKKKTGKLLALSPQNLVDCV--SENYGCGGGYMTTAFQYVQQN 192
Query: 222 GGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY--------- 261
GG+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 193 GGIDSEDAYPYVGQDESCMYNATA-KAAKCRGYREIPVGNEKALKRAVARVGPVSVSIDA 251
Query: 262 ---AFQLYSHGV-FDEYCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMA 316
+FQ YS GV +DE C +NH V VVGYG G KYW++KNSWG SWG GY+ +A
Sbjct: 252 SLTSFQFYSRGVYYDENCDRDNVNHAVLVVGYGTQKGNKYWIIKNSWGESWGNKGYVLLA 311
Query: 317 RNSPSSNIGICGILMQASYP 336
RN ++ CGI AS+P
Sbjct: 312 RNKNNA----CGITNLASFP 327
>gi|391328505|ref|XP_003738729.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 323
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 135/352 (38%), Positives = 185/352 (52%), Gaps = 68/352 (19%)
Query: 24 MLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRF 83
MLR + L+ +LG+ + W + + K + + YG ++E RR
Sbjct: 1 MLRTTAI---LVALLGLASANW-----------------DLYKKVHGKSYGHDEEHFRRQ 40
Query: 84 GIYSSNVQYIDYINSQNL-------SFKLTDNKFADLSNEEFIS----TYLGYNKPYNEP 132
Y S + IN+ NL ++++ NKF D+++EEF + + N
Sbjct: 41 LFYKS----VAKINAHNLRHDLGLTTYRMGLNKFTDMTSEEFRNFKGLKFDATKTKRNGT 96
Query: 133 RWPSVQYLG--LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLS 190
R+ + LG LP VDWR++G VTPVK+QGQCGSCWAFS ++EG + TGKLVSLS
Sbjct: 97 RFQK-ELLGEALPTQVDWREKGYVTPVKNQGQCGSCWAFSTTGSLEGQHFKATGKLVSLS 155
Query: 191 EQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVT 250
EQ LVDC N GCNGG M+ F +I + GG+ TE+ YPY GK+ C ++ A
Sbjct: 156 EQNLVDCSRVEGNNGCNGGLMDNGFTYIQQNGGIDTEESYPYTGKDGDCAFNENSVGA-R 214
Query: 251 ITGYEAIPAR-----------------------YAFQLYSHGVFDE-YCG-HQLNHGVTV 285
+ G+ +P R +FQ Y GV+DE C QL+HGV V
Sbjct: 215 VKGFVDVPQRDEAALQAAVASVGPVSVAIDASNDSFQYYKEGVYDEPSCSFSQLDHGVLV 274
Query: 286 VGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
VGYG ++G YWLVKNSWG +WG+ GYI+M RN + CGI ASYP
Sbjct: 275 VGYGTENGVDYWLVKNSWGPTWGQDGYIKMMRNKENQ----CGIASMASYPT 322
>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
Length = 334
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 132/326 (40%), Positives = 182/326 (55%), Gaps = 53/326 (16%)
Query: 52 KYDPQSMEERFENWLKQYSREYGS-EDEWQRRFGIYSSNVQYI-----DYINSQNLSFKL 105
K+D Q+ + W + R YG+ E+EW+R I+ N++ I +Y N Q+ F +
Sbjct: 20 KFD-QTFSAEWHQWKSTHRRLYGTNEEEWRR--AIWEKNMRMIQLHNGEYSNGQH-GFSM 75
Query: 106 TDNKFADLSNEEFISTYLGYN-------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVK 158
N F D++NEEF GY + + EP L +P SVDWR++G VTPVK
Sbjct: 76 EMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPL-----MLKIPKSVDWREKGCVTPVK 130
Query: 159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
+QGQCGSCWAFSA +EG LKTGKL+SLSEQ LVDC NQGCNGG M+ AF++I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190
Query: 219 TKIGGVTTEDDYPYRGKNDRC------------------QTDKTKHHAVTITGYEAI--- 257
+ GG+ +E+ YPY K+ C Q +K AV G ++
Sbjct: 191 KENGGLDSEESYPYEAKDGSCKYRAEFAVANGTGFVDIPQQEKALMKAVATVGPISVAMD 250
Query: 258 PARYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAG 311
+ + Q YS G++ E C + L+HGV +VGYG + + KYWLVKNSWG+ WG G
Sbjct: 251 ASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEG 310
Query: 312 YIRMARNSPSSNIGICGILMQASYPV 337
YI++A++ + CG+ ASYPV
Sbjct: 311 YIKIAKDRDNH----CGLATAASYPV 332
>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; AltName: Full=p39 cysteine proteinase;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
Length = 334
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 132/326 (40%), Positives = 182/326 (55%), Gaps = 53/326 (16%)
Query: 52 KYDPQSMEERFENWLKQYSREYGS-EDEWQRRFGIYSSNVQYI-----DYINSQNLSFKL 105
K+D Q+ + W + R YG+ E+EW+R I+ N++ I +Y N Q+ F +
Sbjct: 20 KFD-QTFSAEWHQWKSTHRRLYGTNEEEWRR--AIWEKNMRMIQLHNGEYSNGQH-GFSM 75
Query: 106 TDNKFADLSNEEFISTYLGYN-------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVK 158
N F D++NEEF GY + + EP L +P SVDWR++G VTPVK
Sbjct: 76 EMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPL-----MLKIPKSVDWREKGCVTPVK 130
Query: 159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
+QGQCGSCWAFSA +EG LKTGKL+SLSEQ LVDC NQGCNGG M+ AF++I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190
Query: 219 TKIGGVTTEDDYPYRGKNDRC------------------QTDKTKHHAVTITGYEAI--- 257
+ GG+ +E+ YPY K+ C Q +K AV G ++
Sbjct: 191 KENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVAMD 250
Query: 258 PARYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAG 311
+ + Q YS G++ E C + L+HGV +VGYG + + KYWLVKNSWG+ WG G
Sbjct: 251 ASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEG 310
Query: 312 YIRMARNSPSSNIGICGILMQASYPV 337
YI++A++ + CG+ ASYPV
Sbjct: 311 YIKIAKDRDNH----CGLATAASYPV 332
>gi|291383484|ref|XP_002708316.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 123/316 (38%), Positives = 171/316 (54%), Gaps = 40/316 (12%)
Query: 56 QSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFA 111
+S++ R+ W Q+ R Y +EW+RR ++ N++ I+ N + F + N +
Sbjct: 23 RSLDARWSQWKAQHRRAYSPHEEWRRR-AVWEKNMRMIELHNGEYSQGKRGFSMAMNAYG 81
Query: 112 DLSNEEFISTYLGYNKPYN--EPRWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAF 169
D+++EEF G++ + E + + +P+SVDWR +G VTPVK QG+CGSCWAF
Sbjct: 82 DMTSEEFRQVMNGFHHQPDKKEKVFGKAVFQEVPSSVDWRDKGYVTPVKKQGRCGSCWAF 141
Query: 170 SAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDD 229
SA A+EG KTG+LVSLSEQ L+DC + N GC GG + AF+++ GG+ +ED
Sbjct: 142 SATGALEGQMFRKTGRLVSLSEQNLIDCSWPAGNHGCRGGLTDHAFQYVKDNGGLDSEDS 201
Query: 230 YPYRGKNDRCQTDKTKHHAVTITGYEAIPARY----------------------AFQLYS 267
YPY +N C+ D K A TG+ IP + +FQ Y
Sbjct: 202 YPYEARNLPCRYDPQKSVA-NGTGFVRIPRQENALMEAVATVGPIAVAIDAGHPSFQFYK 260
Query: 268 HGVFDE--YCGHQLNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPS 321
G++ E NH V VVGYG E KYWLVKNSWG WGEAGYIR+A++ +
Sbjct: 261 EGIYYEPNCSSKHHNHAVLVVGYGYEGAESDSNKYWLVKNSWGKRWGEAGYIRIAKDRNN 320
Query: 322 SNIGICGILMQASYPV 337
CGI ASYP
Sbjct: 321 H----CGIASHASYPT 332
>gi|74834619|sp|O97397.1|CATLL_PHACE RecName: Full=Cathepsin L-like proteinase; Flags: Precursor
gi|4210800|emb|CAA76927.1| thiol protease [Phaedon cochleariae]
Length = 324
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 116/319 (36%), Positives = 178/319 (55%), Gaps = 46/319 (14%)
Query: 54 DPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSSNVQYI-----DYINSQNLSFKLTDN 108
+ S +E + ++ K ++R Y S E + RF I+ ++ I Y N ++ ++ L N
Sbjct: 15 NAASDQELWADFKKTHARTYKSLREEKLRFNIFQDTLRQIAEHNVKYENGES-TYYLAIN 73
Query: 109 KFADLSNEEFISTYLGYNKPYNEPRWPSVQYL--------GLPASVDWRKEGAVTPVKDQ 160
KF+D+++EEF + NE P+++ L P S+DWR +G V PV++Q
Sbjct: 74 KFSDITDEEFRDMLM-----KNEASRPNLEGLEVADLTVGAAPESIDWRSKGVVLPVRNQ 128
Query: 161 GQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITK 220
G+CGSCWA S AA+E + +K+G V LS Q+LVDC + N GCNGG+ FE++ K
Sbjct: 129 GECGSCWALSTAAAIESQSAIKSGSKVPLSPQQLVDCSTSYGNHGCNGGFAVNGFEYV-K 187
Query: 221 IGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIPARYA------------------ 262
G+ ++ DYPY GK D+C+ + V +TGY+ + A
Sbjct: 188 DNGLESDADYPYSGKEDKCKANDKSRSVVELTGYKKVTASETSLKEAVGTIGPISAVVFG 247
Query: 263 --FQLYSHGVFDEYC--GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
+ Y G+FD+ G L+HGV VVGYG ++G+KYW++KN+WG WGE+GYIR+ R+
Sbjct: 248 KPMKSYGGGIFDDSSCLGDNLHHGVNVVGYGIENGQKYWIIKNTWGADWGESGYIRLIRD 307
Query: 319 SPSSNIGICGILMQASYPV 337
+ S CG+ ASYP+
Sbjct: 308 TDHS----CGVEKMASYPI 322
>gi|109940313|sp|P25975.3|CATL1_BOVIN RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|74354943|gb|AAI02313.1| CTSL2 protein [Bos taurus]
gi|154425700|gb|AAI51426.1| Cathepsin L2 [Bos taurus]
gi|296484466|tpg|DAA26581.1| TPA: cathepsin L2 precursor [Bos taurus]
gi|440898893|gb|ELR50299.1| Cathepsin L1 [Bos grunniens mutus]
Length = 334
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 135/348 (38%), Positives = 185/348 (53%), Gaps = 59/348 (16%)
Query: 31 SLFL-LWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYG-SEDEWQRRFGIYSS 88
S FL + LG+ + A K DP +++ + W + R YG +E+EW+R ++
Sbjct: 4 SFFLTVLCLGVASAA------PKLDP-NLDAHWHQWKATHRRLYGMNEEEWRR--AVWEK 54
Query: 89 NVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYLGYN-------KPYNEPRWPSV 137
N + ID N + F++ N F D++NEEF G+ K ++EP
Sbjct: 55 NKKIIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQNQKHKKGKLFHEPL---- 110
Query: 138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
+ +P SVDW K+G VTPVK+QGQCGSCWAFSA A+EG KTGKLVSLSEQ LVDC
Sbjct: 111 -LVDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDC 169
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
NQGCNGG M+ AF++I GG+ +E+ YPY + K + A TG+ I
Sbjct: 170 SRAQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDI 229
Query: 258 PAR----------------------YAFQLYSHGV-FDEYCGHQ-LNHGVTVVGYG---- 289
P R +FQ Y G+ +D C + L+HGV VVGYG
Sbjct: 230 PQREKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGT 289
Query: 290 EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
+ + K+W+VKNSWG WG GY++MA++ + CGI ASYP
Sbjct: 290 DSNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNH----CGIATAASYPT 333
>gi|130502110|ref|NP_001076110.1| cathepsin K precursor [Oryctolagus cuniculus]
gi|1168794|sp|P43236.1|CATK_RABIT RecName: Full=Cathepsin K; AltName: Full=Protein OC-2; Flags:
Precursor
gi|454187|dbj|BAA03125.1| OC-2 protein [Oryctolagus cuniculus]
Length = 329
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 129/314 (41%), Positives = 177/314 (56%), Gaps = 43/314 (13%)
Query: 58 MEERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYINSQNL----SFKLTDNKFAD 112
++ ++E W K YS++Y S+ DE RR I+ N+++I N + +++L N D
Sbjct: 22 LDTQWELWKKTYSKQYNSKVDEISRRL-IWEKNLKHISIHNLEASLGVHTYELAMNHLGD 80
Query: 113 LSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQCGSCW 167
+++EE + G P + Y+ P S+D+RK+G VTPVK+QGQCGSCW
Sbjct: 81 MTSEEVVQKMTGLKVPPSRSHSNDTLYIPDWEGRTPDSIDYRKKGYVTPVKNQGQCGSCW 140
Query: 168 AFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTE 227
AFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ + G+ +E
Sbjct: 141 AFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENYGCGGGYMTNAFQYVQRNRGIDSE 198
Query: 228 DDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY------------AFQ 264
D YPY G+++ C + T A GY IP AR +FQ
Sbjct: 199 DAYPYVGQDESCMYNPTG-KAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQ 257
Query: 265 LYSHGV-FDEYCGH-QLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSS 322
YS GV +DE C +NH V VGYG G K+W++KNSWG SWG GYI MARN ++
Sbjct: 258 FYSKGVYYDENCSSDNVNHAVLAVGYGIQKGNKHWIIKNSWGESWGNKGYILMARNKNNA 317
Query: 323 NIGICGILMQASYP 336
CGI AS+P
Sbjct: 318 ----CGIANLASFP 327
>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
Length = 337
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 128/307 (41%), Positives = 168/307 (54%), Gaps = 42/307 (13%)
Query: 68 QYSREYGSEDEWQRRFGIYSSNVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTYL 123
+++ Y S E R IY N + I N + +++KL NK+ D+ + EF++T
Sbjct: 35 HHNKVYKSPVEEGYRMKIYMDNKRKIAEHNRKYELNEVTYKLGMNKYGDMLHHEFVNTLN 94
Query: 124 GYNKPYNEP------RWPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEG 177
G+NK + S + LP VDW K+GAVT VKDQG CGSCWAFS+ A+EG
Sbjct: 95 GFNKSVTAGIETEGVTFISPANVKLPDEVDWTKQGAVTAVKDQGHCGSCWAFSSTGALEG 154
Query: 178 INKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKND 237
+ TG LVSLSEQ L+DC N GCNGG M+ AF++I G+ TE YPY +ND
Sbjct: 155 QHFRSTGYLVSLSEQNLIDCSGKYGNNGCNGGLMDYAFQYIKDNKGLDTEKTYPYEAEND 214
Query: 238 RCQTDKTKHHAVTITGYEAIP-----------------------ARYAFQLYSHGV-FDE 273
RC+ + ++ T GY IP + +FQLYS GV +D
Sbjct: 215 RCRYN-PRNSGATDKGYVDIPQGDEEKLKAAVATIGPISVAIDASHESFQLYSEGVYYDP 273
Query: 274 YC-GHQLNHGVTVVGYGEDH--GEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGIL 330
C L+HGV +VGYG D G YWLVKNSWG +WG+ GYI+MARN + CGI
Sbjct: 274 DCSAENLDHGVLIVGYGTDETSGHDYWLVKNSWGKTWGQKGYIKMARNKNNH----CGIA 329
Query: 331 MQASYPV 337
ASYP+
Sbjct: 330 SSASYPL 336
>gi|18202415|sp|P82474.1|CPGP2_ZINOF RecName: Full=Zingipain-2; AltName: Full=Cysteine proteinase GP-II
gi|6137410|pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137411|pdb|1CQD|B Chain B, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137412|pdb|1CQD|C Chain C, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137413|pdb|1CQD|D Chain D, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
Length = 221
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 108/220 (49%), Positives = 138/220 (62%), Gaps = 26/220 (11%)
Query: 142 LPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNS 201
LP S+DWR+ GAV PVK+QG CGSCWAFS VAAVEGIN++ TG L+SLSEQ+LVDC +
Sbjct: 3 LPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDC--TT 60
Query: 202 ENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP--- 258
N GC GG+M AF+FI GG+ +E+ YPYRG++ C + V+I YE +P
Sbjct: 61 ANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGICNS-TVNAPVVSIDSYENVPSHN 119
Query: 259 -------------------ARYAFQLYSHGVFDEYCGHQLNHGVTVVGYGEDHGEKYWLV 299
A FQLY G+F C NH +TVVGYG ++ + +W+V
Sbjct: 120 EQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKDFWIV 179
Query: 300 KNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVKR 339
KNSWG +WGE+GYIR RN + + G CGI ASYPVK+
Sbjct: 180 KNSWGKNWGESGYIRAERNIENPD-GKCGITRFASYPVKK 218
>gi|355567966|gb|EHH24307.1| Cathepsin L2 [Macaca mulatta]
gi|355753494|gb|EHH57540.1| Cathepsin L2 [Macaca fascicularis]
gi|380790509|gb|AFE67130.1| cathepsin L2 preproprotein [Macaca mulatta]
Length = 334
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 135/349 (38%), Positives = 185/349 (53%), Gaps = 59/349 (16%)
Query: 30 LSLFLL-WVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRRFGIYSS 88
LSL L + LGI + K+D Q+++ ++ W + R YG+ +E RR ++
Sbjct: 3 LSLVLAAFCLGIASAV------PKFD-QNLDTKWYQWKATHRRLYGASEEGWRR-AVWEK 54
Query: 89 NVQYIDYINSQ----NLSFKLTDNKFADLSNEEFISTY-------LGYNKPYNEPRWPSV 137
N++ I+ N + F + N F D++NEEF L K + EP
Sbjct: 55 NMKMIELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMGCFRNQKLRKGKLFREPL---- 110
Query: 138 QYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDC 197
+L LP SVDWRK+G VTPVK+Q QCGSCWAFSA A+EG KTGKLVSLSEQ LVDC
Sbjct: 111 -FLDLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDC 169
Query: 198 DVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAI 257
NQGCNGG+M AF ++ + GG+ +E+ YPY + C+ + ++ TG+E +
Sbjct: 170 SHPQGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKY-RPENSVANDTGFEVV 228
Query: 258 PA-----------------------RYAFQLYSHGV-FDEYCGHQ-LNHGVTVVGYG--- 289
PA +FQ Y G+ F+ C + L+HGV VVGYG
Sbjct: 229 PAGKEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEG 288
Query: 290 -EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPV 337
KYWLVKNSWG WG GY+++A++ + CGI ASYP
Sbjct: 289 ANSDNNKYWLVKNSWGPEWGSNGYVKIAKDKDNH----CGIATAASYPT 333
>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
Length = 334
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 132/326 (40%), Positives = 182/326 (55%), Gaps = 53/326 (16%)
Query: 52 KYDPQSMEERFENWLKQYSREYGS-EDEWQRRFGIYSSNVQYI-----DYINSQNLSFKL 105
K+D Q+ + W + R YG+ E+EW+R I+ N++ I +Y N Q+ F +
Sbjct: 20 KFD-QTFSAEWHQWKSTHRRLYGTNEEEWRR--AIWEKNMRIIQLHNGEYSNGQH-GFSM 75
Query: 106 TDNKFADLSNEEFISTYLGYN-------KPYNEPRWPSVQYLGLPASVDWRKEGAVTPVK 158
N F D++NEEF GY + + EP L +P SVDWR++G VTPVK
Sbjct: 76 EMNAFGDMTNEEFRQVVNGYRHQKHKKGRLFQEPL-----MLKIPKSVDWREKGCVTPVK 130
Query: 159 DQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFI 218
+QGQCGSCWAFSA +EG LKTGKL+SLSEQ LVDC NQGCNGG M+ AF++I
Sbjct: 131 NQGQCGSCWAFSASGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYI 190
Query: 219 TKIGGVTTEDDYPYRGKNDRC------------------QTDKTKHHAVTITGYEAI--- 257
+ GG+ +E+ YPY K+ C Q +K AV G ++
Sbjct: 191 KENGGLDSEESYPYEAKDGSCKYRAEFAVANDTGFVDIPQQEKALMKAVATVGPISVAMD 250
Query: 258 PARYAFQLYSHGVFDE-YCGHQ-LNHGVTVVGYG----EDHGEKYWLVKNSWGTSWGEAG 311
+ + Q YS G++ E C + L+HGV +VGYG + + KYWLVKNSWG+ WG G
Sbjct: 251 ASHPSLQFYSSGIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEG 310
Query: 312 YIRMARNSPSSNIGICGILMQASYPV 337
YI++A++ + CG+ ASYPV
Sbjct: 311 YIKIAKDRDNH----CGLATAASYPV 332
>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 133/354 (37%), Positives = 187/354 (52%), Gaps = 54/354 (15%)
Query: 23 MMLRNAVLSLFLLWVLGIPAGAWSEGYPQKYDPQSMEERFENWLKQYSREYGSEDEWQRR 82
M L VLSL L L P+ DP ++ +E W + + Y ++E RR
Sbjct: 1 MRLPFVVLSLCLAGGLAAPS----------LDP-GLDTHWEQWKSWHGKSYEQKEETWRR 49
Query: 83 FGIYSSNVQYIDYINSQNL----SFKLTDNKFADLSNEEFISTYLGY-----NKPYNEPR 133
++ +++ I+ N ++ SF+L N F D+ NEEF GY +K
Sbjct: 50 M-VWEKHLRVIEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSH 108
Query: 134 WPSVQYLGLPASVDWRKEGAVTPVKDQGQCGSCWAFSAVAAVEGINKLKTGKLVSLSEQE 193
+ +L +P VDWR EG VTPVKDQGQCGSCWAFS A+EG + +TG+LVSLSEQ
Sbjct: 109 FLEPNFLEVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQN 168
Query: 194 LVDCDVNSENQGCNGGYMEKAFEFITKIGGVTTEDDYPYRGKNDRCQTDKTKHHAVTITG 253
LV+C N+GCNGG M++AF+++ GG+ +ED YPY G +D +++A TG
Sbjct: 169 LVECSKPEGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTG 228
Query: 254 YEAIPA-----------------------RYAFQLYSHGV-FDEYCGH-QLNHGVTVVGY 288
+ IP+ +FQ Y G+ F+ C L+HGV VVGY
Sbjct: 229 FVDIPSGKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGY 288
Query: 289 G----EDHGEKYWLVKNSWGTSWGEAGYIRMARNSPSSNIGICGILMQASYPVK 338
G + G+KYW+VKNSW WG+ GYI MA++ + CGI ASYP++
Sbjct: 289 GVEKRDTDGKKYWIVKNSWSEKWGQNGYILMAKDKDNH----CGIATAASYPLE 338
>gi|348586441|ref|XP_003478977.1| PREDICTED: cathepsin K-like [Cavia porcellus]
Length = 329
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 129/319 (40%), Positives = 178/319 (55%), Gaps = 43/319 (13%)
Query: 53 YDPQSMEERFENWLKQYSREY-GSEDEWQRRFGIYSSNVQYIDYINSQNL----SFKLTD 107
Y + ++ ++E W K Y ++Y G DE RR I+ N++YI N + +++L+
Sbjct: 17 YPEEILDTQWELWKKTYRKQYNGKVDEISRRI-IWEKNLKYISIHNLEASLGVHTYELSM 75
Query: 108 NKFADLSNEEFISTYLGYNKPYNEPRWPSVQYLG-----LPASVDWRKEGAVTPVKDQGQ 162
N D+++EE + G P + Y+ P SVD+RK+G VTPVK+QGQ
Sbjct: 76 NHLGDMTSEEVVQKMTGLKVPPSHSHSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQ 135
Query: 163 CGSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIG 222
CGSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ +
Sbjct: 136 CGSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQENR 193
Query: 223 GVTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY---------- 261
G+ +ED YPY G+ + C + T A GY IP AR
Sbjct: 194 GIDSEDAYPYVGQEESCMYNPTG-KAAKCRGYREIPVGNEKALKRAVARVGPVSVAIDAS 252
Query: 262 --AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMAR 317
+FQ YS GV +DE C G LNH + VGYG G K+W++KNSWG +WG GY+ +AR
Sbjct: 253 LSSFQFYSKGVYYDESCNGEDLNHALLAVGYGMQRGNKHWILKNSWGENWGNKGYVLLAR 312
Query: 318 NSPSSNIGICGILMQASYP 336
N ++ CGI AS+P
Sbjct: 313 NKNNA----CGIANLASFP 327
>gi|334324659|ref|XP_001371004.2| PREDICTED: cathepsin K-like [Monodelphis domestica]
Length = 332
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 134/318 (42%), Positives = 177/318 (55%), Gaps = 44/318 (13%)
Query: 55 PQSM-EERFENWLKQYSREYGSE-DEWQRRFGIYSSNVQYIDYIN---SQNL-SFKLTDN 108
P+ M + +++ W Y +EY S+ DE RR I+ N++YI N S L +F+L N
Sbjct: 21 PEEMLDTQWKLWKDSYRKEYNSKVDEISRRL-IWEKNLKYISTHNLEFSLGLHTFELAMN 79
Query: 109 KFADLSNEEFISTYLGYNKPYNEPRWPSVQYL-----GLPASVDWRKEGAVTPVKDQGQC 163
D+++EE + G P + + Y P S+D+RK+G VTPVK+QGQC
Sbjct: 80 HLGDMTSEEVVQKMTGLKVPLSRSQNNDTLYFPDWETKTPDSIDYRKKGYVTPVKNQGQC 139
Query: 164 GSCWAFSAVAAVEGINKLKTGKLVSLSEQELVDCDVNSENQGCNGGYMEKAFEFITKIGG 223
GSCWAFS+V A+EG K KTGKL++LS Q LVDC SEN GC GGYM AF+++ K G
Sbjct: 140 GSCWAFSSVGALEGQLKKKTGKLLNLSPQNLVDCV--SENDGCGGGYMTNAFQYVQKNRG 197
Query: 224 VTTEDDYPYRGKNDRCQTDKTKHHAVTITGYEAIP-----------ARY----------- 261
+ +ED YPY G+++ C + T A GY IP AR
Sbjct: 198 IDSEDAYPYIGEDESCMYNPT-GKAAKCRGYREIPEGSEKALKRAVARVGPVAVAIDASL 256
Query: 262 -AFQLYSHGV-FDEYC-GHQLNHGVTVVGYGEDHGEKYWLVKNSWGTSWGEAGYIRMARN 318
+FQ YS GV +DE C LNH V VGYG G K+W++KNSWG WG GYI MARN
Sbjct: 257 SSFQFYSKGVYYDENCNSDNLNHAVLAVGYGIQRGTKHWIIKNSWGEQWGNKGYILMARN 316
Query: 319 SPSSNIGICGILMQASYP 336
++ CGI AS+P
Sbjct: 317 KNNA----CGIANLASFP 330
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.135 0.430
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,853,929,499
Number of Sequences: 23463169
Number of extensions: 263114752
Number of successful extensions: 597166
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6764
Number of HSP's successfully gapped in prelim test: 841
Number of HSP's that attempted gapping in prelim test: 570292
Number of HSP's gapped (non-prelim): 10647
length of query: 340
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 197
effective length of database: 9,003,962,200
effective search space: 1773780553400
effective search space used: 1773780553400
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 77 (34.3 bits)