BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy14862
(263 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|71993922|ref|NP_505215.2| Protein TAG-196 [Caenorhabditis elegans]
gi|351050011|emb|CCD64084.1| Protein TAG-196 [Caenorhabditis elegans]
Length = 477
Score = 103 bits (258), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 57/145 (39%), Positives = 85/145 (58%), Gaps = 4/145 (2%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N F DFV +E++Y + E+ +RF +F+ N K I K+EQGTA YG +F+DMT EF
Sbjct: 172 NSFLDFVDRHEKKYTNKREVLKRFRVFKKNAKVIRELQKNEQGTAVYGFTKFSDMTTMEF 231
Query: 137 NHGLSSLDWEQ--IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ WEQ ++ FE + + N L ES ++++KG V +V++Q CGSCW
Sbjct: 232 KKIMLPYQWEQPVYPMEQANFEKHDV-TINEEDLPESFDWREKGAV-TQVKNQGNCGSCW 289
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A S +E A+ I N+L+ LS+Q
Sbjct: 290 AFSTTGNVEGAWFIAKNKLVSLSEQ 314
>gi|308506829|ref|XP_003115597.1| CRE-TAG-196 protein [Caenorhabditis remanei]
gi|308256132|gb|EFP00085.1| CRE-TAG-196 protein [Caenorhabditis remanei]
Length = 475
Score = 101 bits (252), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 56/145 (38%), Positives = 82/145 (56%), Gaps = 4/145 (2%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N F DF+ +E++Y + E+ +RF F+ N K I K+EQGTA YG +F+DMT EF
Sbjct: 170 NSFLDFIDRHEKRYSNKREVLKRFRTFKKNAKAIRELQKNEQGTAVYGFTKFSDMTTMEF 229
Query: 137 NHGLSSLDWEQ--IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ WEQ ++ FE S L ES +++DKG V +V++Q CGSCW
Sbjct: 230 KQTMLPYQWEQPVYPMDQADFEKEGITISEE-DLPESFDWRDKGAV-TQVKNQGNCGSCW 287
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A S +E A+ + N+L+ LS+Q
Sbjct: 288 AFSTTGNVEGAWFLAKNKLVSLSEQ 312
>gi|383863617|ref|XP_003707276.1| PREDICTED: uncharacterized protein LOC100880620 [Megachile
rotundata]
Length = 884
Score = 100 bits (248), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 70/206 (33%), Positives = 97/206 (47%), Gaps = 33/206 (16%)
Query: 31 CVHTYLGWHPRWTGR-----VHNLILQRSQPNSYGSEEASTFDLEEFLDHGNQ--FKDFV 83
C+ T W W R N L+ Q S + S L+ D+ ++ F+DFV
Sbjct: 526 CLFTI--WSQPWIDRGNPKITINCDLKNKQKRSLRGSQYSLKMLKMAEDYKDELLFEDFV 583
Query: 84 REYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSL 143
+ Y + Y S E R+ +FR NLK I+ K EQGTA YGV FAD+T EF L
Sbjct: 584 KTYNKTYLSAKEKADRYKVFRKNLKMIEKLRKFEQGTAVYGVTMFADLTPEEFKTKYLGL 643
Query: 144 DWEQIENLKSTFETYSFNSSNSYGLAESI----------NYKDKGKVLPKVQDQHLCGSC 193
+ N N L E++ ++++ V P V+DQ CGSC
Sbjct: 644 -------------KTNLNQENDIPLQEAVIPDIDLPPKFDWREYNAVTP-VKDQGQCGSC 689
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SA+ +E YAIKH +L+ LS+Q
Sbjct: 690 WAFSAIGNIEGQYAIKHKKLLSLSEQ 715
>gi|339246873|ref|XP_003375070.1| viral cathepsin [Trichinella spiralis]
gi|316971622|gb|EFV55373.1| viral cathepsin [Trichinella spiralis]
Length = 496
Score = 100 bits (248), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 88/144 (61%), Gaps = 11/144 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
QFK+F++ +++ Y S+ E+ +R+DIF+ N+KT++ K+EQGTA YGV FAD+T EF
Sbjct: 195 QFKEFLKTFKKWYLSEKELLKRYDIFKVNMKTVEMLQKNEQGTAVYGVTFFADLTPEEFR 254
Query: 138 HGLSSLDW--EQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
S W +Q+ K++ + + ++++ V +V++Q +CGSCWA
Sbjct: 255 KFYLSPQWKRDQLPQRKASIPKGK--------IEDRWDWREHNAV-TEVKNQGMCGSCWA 305
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
+ +A +E +A+K EL+ LS+Q
Sbjct: 306 FATIANVEGVWAVKKGELVSLSEQ 329
>gi|312080834|ref|XP_003142769.1| ctsf protein [Loa loa]
Length = 437
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/161 (36%), Positives = 90/161 (55%), Gaps = 3/161 (1%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N F DF+++++R+Y S +E RF + NL ++ E+GTA YGV +F+DM+ EF
Sbjct: 133 NSFLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSDMSPEEF 192
Query: 137 NHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
L SL W+++ + ++ FN + + L E +++ KG V P V++Q CGSCWA
Sbjct: 193 QKTMLPSLWWDRVVSNGVEYDLKKFNLTFN-NLPEQFDWRTKGVVTP-VKNQGSCGSCWA 250
Query: 196 HSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLP 236
S +E +AIK +LI LS+Q R KG LP
Sbjct: 251 FSVTGNIEGLWAIKTGKLISLSEQELIDCDRIDKGCNGGLP 291
>gi|393906608|gb|EFO21301.2| ctsf protein [Loa loa]
Length = 472
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 59/161 (36%), Positives = 90/161 (55%), Gaps = 3/161 (1%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N F DF+++++R+Y S +E RF + NL ++ E+GTA YGV +F+DM+ EF
Sbjct: 168 NSFLDFIKKFKREYSSVAEQLDRFKKYMQNLHFVEKLQHEEKGTAIYGVTQFSDMSPEEF 227
Query: 137 NHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
L SL W+++ + ++ FN + + L E +++ KG V P V++Q CGSCWA
Sbjct: 228 QKTMLPSLWWDRVVSNGVEYDLKKFNLTFN-NLPEQFDWRTKGVVTP-VKNQGSCGSCWA 285
Query: 196 HSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLP 236
S +E +AIK +LI LS+Q R KG LP
Sbjct: 286 FSVTGNIEGLWAIKTGKLISLSEQELIDCDRIDKGCNGGLP 326
>gi|341878608|gb|EGT34543.1| hypothetical protein CAEBREN_26318 [Caenorhabditis brenneri]
Length = 478
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 83/144 (57%), Gaps = 3/144 (2%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N F DF+ +E++Y++ E+ +RF +F+ N K I K+EQGTA YG +F+DMT EF
Sbjct: 174 NSFLDFIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQGTAVYGFTKFSDMTTMEF 233
Query: 137 NHGLSSLDWEQ-IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ WEQ + ++ FE S L +S ++++ G V +V++Q CGSCWA
Sbjct: 234 KETMLPYQWEQPVPMDQANFEKEGVTISEE-DLPDSFDWREHGAV-TQVKNQGSCGSCWA 291
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +E A+ + +L+ LS+Q
Sbjct: 292 FSTTGNIEGAWFLAKKKLVSLSEQ 315
>gi|341878637|gb|EGT34572.1| hypothetical protein CAEBREN_13324 [Caenorhabditis brenneri]
Length = 478
Score = 99.4 bits (246), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 52/144 (36%), Positives = 83/144 (57%), Gaps = 3/144 (2%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N F DF+ +E++Y++ E+ +RF +F+ N K I K+EQGTA YG +F+DMT EF
Sbjct: 174 NSFLDFIDRHEKRYENKREVLKRFRVFKRNAKVIRELQKNEQGTAVYGFTKFSDMTTMEF 233
Query: 137 NHGLSSLDWEQ-IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ WEQ + ++ FE S L +S ++++ G V +V++Q CGSCWA
Sbjct: 234 KETMLPYQWEQPVPMDQANFEKEGVTISEE-DLPDSFDWREHGAVT-QVKNQGSCGSCWA 291
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +E A+ + +L+ LS+Q
Sbjct: 292 FSTTGNIEGAWFLAKKKLVSLSEQ 315
>gi|46948154|gb|AAT07059.1| cathepsin F-like cysteine proteinase, partial [Brugia malayi]
Length = 461
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 56/142 (39%), Positives = 81/142 (57%), Gaps = 3/142 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F+++++R+Y S E RF I+ N+ E+GTA YG +F+DMT EF
Sbjct: 159 FMTFIKKFKREYSSIEEQLDRFRIYLQNMNFAKKLQFEEKGTAIYGATKFSDMTAEEFQK 218
Query: 139 -GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L S+ W+++E+ TF FN S Y L +++ +G V P V+DQ CGSCWA S
Sbjct: 219 IMLPSIWWDRVESNGITFNLNDFNLS-IYNLPSKFDWRTEGVVTP-VKDQGSCGSCWAFS 276
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+ES +AIK +LI LS+Q
Sbjct: 277 VTGNIESLWAIKTGKLISLSEQ 298
>gi|348528696|ref|XP_003451852.1| PREDICTED: cathepsin F-like [Oreochromis niloticus]
Length = 475
Score = 98.6 bits (244), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 65/193 (33%), Positives = 101/193 (52%), Gaps = 20/193 (10%)
Query: 38 WHPRWTGRVHNLILQRSQPN-------SYGSEEASTFD-LEEFLDHGNQFKDFVREYERQ 89
W W G L+ Q+ QP S E+ ST LEE ++ QFK+F+ +Y +
Sbjct: 129 WDIPWQG-ASTLLKQKCQPKVESEVEESNKVEDPSTSQPLEESVELLGQFKEFMTKYNKV 187
Query: 90 YDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG-LSSL--DWE 146
Y S E++RR IF NLKT + +QG+A YGV +F+D+T+ EF L+ L W
Sbjct: 188 YSSQEEVDRRLRIFHENLKTAEKLQALDQGSAEYGVTKFSDLTEEEFRSTYLNPLLSQWT 247
Query: 147 QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAY 206
+ +K + +S +++D G V P V++Q +CGSCWA S + +E +
Sbjct: 248 LHQPMKPATPAKGPS-------PDSWDWRDHGAVSP-VKNQGMCGSCWAFSVIGNIEGQW 299
Query: 207 AIKHNELIELSKQ 219
+K+ L+ LS+Q
Sbjct: 300 FLKNGTLLSLSEQ 312
>gi|324522685|gb|ADY48108.1| Cathepsin L, partial [Ascaris suum]
Length = 308
Score = 98.6 bits (244), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 86/149 (57%), Gaps = 6/149 (4%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
+ HG F+ Y R Y + E+ +RF I++ NL+ + +EQGTA YG +F+D+T
Sbjct: 1 MVHGISVDGFIGRYNRTYSNKKEMLKRFRIYKRNLRAAKIWQANEQGTAIYGETQFSDLT 60
Query: 133 DSEFNHGLSSLDWE--QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
+EF + WE ++ N + F+ + ++ + ES ++++K V +V++Q C
Sbjct: 61 QAEFRKIMLPYKWETPKVPNKMANFKEFGIAQND---IPESFDWREKNAVT-EVKNQGSC 116
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCWA S +E A+AIK ++L+ LS+Q
Sbjct: 117 GSCWAFSVTGNIEGAWAIKTSKLVSLSEQ 145
>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
Length = 1032
Score = 97.4 bits (241), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 60/145 (41%), Positives = 81/145 (55%), Gaps = 12/145 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++FV Y R Y ++ E R IFR NL I K+EQGT YGVN+FAD++ EF+
Sbjct: 727 FENFVNTYNRTYATEEERNLRLSIFRENLGIIRLLRKNEQGTGQYGVNQFADVSTEEFHA 786
Query: 139 ---GL-SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
GL L E L+ E NS+ +++ KG V P V++Q +CGSCW
Sbjct: 787 FYLGLRPDLRTENNIPLRQA-EIPDIELPNSF------DWRQKGAVTP-VKNQGMCGSCW 838
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A S +E YAIKHN+L+ LS+Q
Sbjct: 839 AFSVTGNVEGQYAIKHNKLLSLSEQ 863
>gi|432880227|ref|XP_004073613.1| PREDICTED: cathepsin F-like [Oryzias latipes]
Length = 473
Score = 97.4 bits (241), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 65/212 (30%), Positives = 103/212 (48%), Gaps = 12/212 (5%)
Query: 14 ASVRGLTFQYEAEWASGCVHTYLGWHPRWTGRVHNLILQRSQPNS------YGSEEASTF 67
A VR F E + V + W W G L+ Q+ P + +E ++
Sbjct: 105 AMVRTCDFYPETQKLKTEVCVFEVWDIPWQG-TSTLLKQKCSPKAEVEETNRVAEPTNSQ 163
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNR 127
+EE + QFKDF+ +Y++ Y S E ERR IF+ NLKT + +QG+A YGV +
Sbjct: 164 PVEESVQLLGQFKDFMVKYKKDYSSQEEAERRLQIFQENLKTAEKLQALDQGSAEYGVTK 223
Query: 128 FADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQ 187
F+D+T+ EF S + + + +S +++D G V P V++Q
Sbjct: 224 FSDLTEEEFR----STYLNPLLSQWTLHRGMKPAPPAKTPAPDSWDWRDHGAVSP-VKNQ 278
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+CGSCWA S +E + +K+ L+ LS+Q
Sbjct: 279 GMCGSCWAFSVTGNIEGQWFLKNGTLLSLSEQ 310
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 96.7 bits (239), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 85/145 (58%), Gaps = 10/145 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F++++ EY R Y D E RRF IF+NN+K I+ + + + T G+N+F DMT SEF
Sbjct: 36 RFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFV 95
Query: 138 HGLSSLDWEQIENLKSTFE---TYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ + +L E SF+ N + +SI+++D G V +V++Q+ CGSCW
Sbjct: 96 AQYTGV------SLPLNIEREPVVSFDDVNISAVPQSIDWRDYGAV-NEVKNQNPCGSCW 148
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
+ +A+A +E Y IK L+ LS+Q
Sbjct: 149 SFAAIATVEGIYKIKTGYLVSLSEQ 173
>gi|410913409|ref|XP_003970181.1| PREDICTED: cathepsin F-like [Takifugu rubripes]
Length = 476
Score = 96.3 bits (238), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 66/195 (33%), Positives = 100/195 (51%), Gaps = 24/195 (12%)
Query: 38 WHPRWTGRVHNLILQRSQPNSYGSEE-------ASTFD-LEEFLDHGNQFKDFVREYERQ 89
W W G +L+ Q+ QP +E AST L+E ++ FK+F+ +Y +
Sbjct: 130 WDIPWQG-TSSLLSQKCQPKVELQQEETNEVTEASTRQPLKESVELLGLFKEFMTKYNKV 188
Query: 90 YDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-----NHGLSSLD 144
Y S E +RR IF+ NLKT + ++G+A YGV +F+D+T+ EF N LS
Sbjct: 189 YSSQEEADRRLQIFKENLKTAEKIQSLDEGSAEYGVTKFSDLTEEEFRLTYLNPLLS--Q 246
Query: 145 WEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLES 204
W +K S + S +++D G V P V++Q LCGSCWA S +E
Sbjct: 247 WTLRRPMKPASPARSPAPA-------SWDWRDHGAVSP-VKNQGLCGSCWAFSVTGNIEG 298
Query: 205 AYAIKHNELIELSKQ 219
+ +KH +L+ LS+Q
Sbjct: 299 QWFLKHGKLLSLSEQ 313
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 95.1 bits (235), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 55/147 (37%), Positives = 86/147 (58%), Gaps = 13/147 (8%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF- 136
+F++++ EY R Y + E RRF IF+NN+ I+ + H + T G+N+F DMT SEF
Sbjct: 36 RFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSHNGNSYTLGINQFTDMTKSEFV 95
Query: 137 ---NHGLS-SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
G+S L+ E+ SF+ N + +SI+++D G V +V++Q+ CGS
Sbjct: 96 AQYTGGISRPLNIER-------EPVVSFDDVNISAVPQSIDWRDYGAV-NEVKNQNPCGS 147
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA +A+A +E Y IK L+ LS+Q
Sbjct: 148 CWAFAAIATVEGIYKIKTGYLVSLSEQ 174
>gi|268554660|ref|XP_002635317.1| C. briggsae CBR-TAG-196 protein [Caenorhabditis briggsae]
Length = 477
Score = 94.7 bits (234), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 80/145 (55%), Gaps = 4/145 (2%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N F DF+ +E++Y + E+ +RF F+ N K I K+EQG+A YG +F+DMT EF
Sbjct: 172 NSFLDFIDRHEKRYSNKREVLKRFRTFKKNAKVIRELQKNEQGSAVYGFTKFSDMTTMEF 231
Query: 137 NHGLSSLDWEQ--IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ WEQ ++ FE S L +S +++D G V +V++Q CGSCW
Sbjct: 232 KQTMLPYQWEQPVYPMAEADFEKEGVTISED-DLPDSFDWRDHGAV-TQVKNQGNCGSCW 289
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A S +E A+ + +L+ LS+Q
Sbjct: 290 AFSTTGNVEGAWYLAKKKLVSLSEQ 314
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 84/145 (57%), Gaps = 10/145 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F++++ EY R Y + E RRF IF+NN+K I+ + + T G+N+F DMT SEF
Sbjct: 9 RFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDMTKSEFV 68
Query: 138 HGLSSLDWEQIENLKSTFE---TYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ + +L E SF+ N + +SI+++D G V +V++Q+ CGSCW
Sbjct: 69 AQYTGV------SLPLNIEREPVVSFDDVNISAVPQSIDWRDYGAV-NEVKNQNPCGSCW 121
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A +A+A +E Y IK L+ LS+Q
Sbjct: 122 AFAAIATVEGIYKIKTGYLVSLSEQ 146
>gi|244790097|ref|NP_001156454.1| cathepsin F isoform 2 precursor [Acyrthosiphon pisum]
Length = 586
Score = 94.7 bits (234), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 54/152 (35%), Positives = 81/152 (53%), Gaps = 5/152 (3%)
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNR 127
++++ L F++F+ + + Y S E RRF IF N+K + HEQG+A YG +
Sbjct: 269 NIDDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQ 328
Query: 128 FADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQ 187
FAD+T +EF LD K T S S + ++++ V P V++Q
Sbjct: 329 FADLTKNEFKKKYLGLDSSMTS--KKTLPMAVIPQSAS--IPNEFDWRNHNVVTP-VKNQ 383
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SA+A +E YA+K EL+ LS+Q
Sbjct: 384 GACGSCWAFSAIANIEGQYALKSKELLSLSEQ 415
>gi|322801532|gb|EFZ22193.1| hypothetical protein SINV_14496 [Solenopsis invicta]
Length = 781
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 60/142 (42%), Positives = 73/142 (51%), Gaps = 6/142 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F DFV Y R Y S E R IFR NL I+ K EQ T YGVN FADM+ EF
Sbjct: 524 FDDFVATYNRTYSSPDERNLRLQIFRENLGIIELLQKTEQATGRYGVNMFADMSREEFRT 583
Query: 139 GLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L + Q EN E N L + +++ KG V P V++Q CGSCWA S
Sbjct: 584 RYLGLRPDLQSENEIPLQEAKFPN----IELPPTFDWRKKGVVTP-VKNQGGCGSCWAFS 638
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+E YAIKH +L+ LS+Q
Sbjct: 639 VTGNVEGQYAIKHGQLLSLSEQ 660
>gi|244790093|ref|NP_001156453.1| cathepsin F isoform 1 precursor [Acyrthosiphon pisum]
Length = 586
Score = 94.4 bits (233), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 54/152 (35%), Positives = 81/152 (53%), Gaps = 5/152 (3%)
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNR 127
++++ L F++F+ + + Y S E RRF IF N+K + HEQG+A YG +
Sbjct: 269 NIDDRLQLKTDFENFIMTHNKIYTSLEEKSRRFRIFAANMKKVKLLQNHEQGSAIYGATQ 328
Query: 128 FADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQ 187
FAD+T +EF LD K T S S + ++++ V P V++Q
Sbjct: 329 FADLTKNEFKKKYLGLDSSMTS--KKTLPMAVIPQSAS--IPNEFDWRNHNVVTP-VKNQ 383
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SA+A +E YA+K EL+ LS+Q
Sbjct: 384 GACGSCWAFSAIANIEGQYALKSKELLSLSEQ 415
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 82/153 (53%), Gaps = 4/153 (2%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQ 147
R Y D E E RF F+ N++ I+ + K+ VN++AD+T EF LD
Sbjct: 50 RVYKDDIEKEHRFKTFKENVEFIESFNKNGTQRYKLAVNKYADLTTEEFTTSFMGLDTSL 109
Query: 148 IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYA 207
+ +ST T SF + + S++++ +G V V+DQ +CG CWA SA A +E AY
Sbjct: 110 LSQQESTATTTSFKYDSVTEVPNSMDWRKRGSV-TGVKDQGVCGCCWAFSAAAAIEGAYQ 168
Query: 208 IKHNELIELSKQPP---KTHGRFYKGGVMNLPH 237
I +NELI LS+Q T + +GG+M + +
Sbjct: 169 IANNELISLSEQQLLDCSTQNKGCEGGLMTVAY 201
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 84/145 (57%), Gaps = 10/145 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F++++ EY R Y + E RRF IF+NN+ I+ + + T G+N+F DMT++EF
Sbjct: 36 RFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFV 95
Query: 138 HGLSSLDWEQIENLKSTFE---TYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ + +L F+ SF+ N + +SI+++D G V +V+DQ+ CGSCW
Sbjct: 96 TQYTGV------SLPLNFKREPVVSFDDVNISAVGQSIDWRDYGAVT-EVKDQNPCGSCW 148
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA+A +E Y I L+ LS+Q
Sbjct: 149 AFSAIATVEGIYKIVTGYLVSLSEQ 173
>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
Length = 324
Score = 94.4 bits (233), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 82/142 (57%), Gaps = 4/142 (2%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F++++ EY R Y+ ++E RRF IF+NN+ I+ + + T GVN+F DMT++EF
Sbjct: 9 RFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFTDMTNNEF- 67
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L+ + SF+ + + +SI+++D G V V++Q CGSCWA S
Sbjct: 68 --LARYTGASLPLNIERDPVVSFDDVDISAVPQSIDWRDYGAVT-SVKNQGSCGSCWAFS 124
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A+A +E Y IK LI LS+Q
Sbjct: 125 AIATVEGIYKIKAGNLISLSEQ 146
>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
Length = 345
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 82/142 (57%), Gaps = 4/142 (2%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
QF++++ EY R Y + E RF IF+NN+ I+ + + T G+N+F DMT++EF
Sbjct: 36 QFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNEFV 95
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ L N+K SF+ + + +SI+++D G V V++Q CGSCWA +
Sbjct: 96 AQYTGLSLPL--NIKRE-PVVSFDDVDISSVPQSIDWRDSGAVT-SVKNQGRCGSCWAFA 151
Query: 198 AVACLESAYAIKHNELIELSKQ 219
++A +ES Y IK L+ LS+Q
Sbjct: 152 SIATVESIYKIKRGNLVSLSEQ 173
>gi|402584107|gb|EJW78049.1| hypothetical protein WUBG_11042, partial [Wuchereria bancrofti]
Length = 213
Score = 93.6 bits (231), Expect = 8e-17, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 75/134 (55%), Gaps = 2/134 (1%)
Query: 86 YERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDW 145
Y R+Y S E +RF I++ NL+ E+GTA YG ++DMT EF + W
Sbjct: 1 YNRKYRSKKEFLKRFRIYKRNLRLAKLIQNKEEGTAIYGETPYSDMTQEEFRKIMLPYKW 60
Query: 146 EQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESA 205
EN K + + ++ + ES +++DKG V+ +V++Q CGSCWA S +E A
Sbjct: 61 PLNENKKQMIDLAEYGITDD-EIPESFDWRDKG-VVTEVKNQGSCGSCWAFSVTGNIEGA 118
Query: 206 YAIKHNELIELSKQ 219
+AIK +LI LS+Q
Sbjct: 119 WAIKKGKLISLSEQ 132
>gi|357619727|gb|EHJ72186.1| cathepsin [Danaus plexippus]
Length = 336
Score = 93.2 bits (230), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 56/144 (38%), Positives = 87/144 (60%), Gaps = 11/144 (7%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F+REY ++YDS E E RF IF NNLK I+ H+ A +G+N+F D++ EF
Sbjct: 41 FENFIREYNKKYDS-KEKEERFKIFVNNLKRINDLN-HKSTNAVHGINKFTDLSKEEFKK 98
Query: 139 GLSSLDWEQI---ENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ ++ +N+K + SFN + + +++DKG V+ +V++Q CGSCWA
Sbjct: 99 FYTGFKPDKSFLDDNIKKPSQ-LSFNIT----APPAFDWRDKG-VVTRVKNQGTCGSCWA 152
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S + +ES AIKH L+ELS+Q
Sbjct: 153 FSTIGNVESVNAIKHGNLVELSEQ 176
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 54/147 (36%), Positives = 85/147 (57%), Gaps = 13/147 (8%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF- 136
+F++++ EY R Y + E RRF IF+NN+ I+ + + T G+N+F DMT++EF
Sbjct: 36 RFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFV 95
Query: 137 ---NHGLS-SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
G+S L+ E+ SF+ N + +SI+++D G V +V+DQ+ CGS
Sbjct: 96 AQYTGGISRPLNIEK-------EPVVSFDDVNISAVGQSIDWRDYGAVT-EVKDQNPCGS 147
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA SA+A +E Y I L+ LS+Q
Sbjct: 148 CWAFSAIATVEGIYKIVTGYLVSLSEQ 174
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 93.2 bits (230), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 87/149 (58%), Gaps = 8/149 (5%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFAD 130
+ G F+ F ++ + Y + +E +RF IFR NL+ I+ + +++QG +Y G+N+FAD
Sbjct: 21 EDGAHFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFAD 80
Query: 131 MTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
MT +EF L++ Q++ S T +F ++ + ESI+++ + V P ++DQ C
Sbjct: 81 MTRAEFKAMLAT----QVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTP-IKDQAQC 135
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCWA + V E AYA+ +L S+Q
Sbjct: 136 GSCWAFAVVGSTEGAYALSTGKLTRFSEQ 164
>gi|312378084|gb|EFR24752.1| hypothetical protein AND_10451 [Anopheles darlingi]
Length = 1785
Score = 92.8 bits (229), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 56/143 (39%), Positives = 83/143 (58%), Gaps = 5/143 (3%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
QF+ F ++RQY S E E R++IFRNNL ID +HE+GT YGV +FADMT +E+
Sbjct: 1477 QFEKFKLHHQRQYASSFEHEMRYNIFRNNLYKIDQLNRHERGTGKYGVTKFADMTTAEYR 1536
Query: 138 HGLSSLDWEQIEN-LKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ +Q N +++ T S ++ L S +++D G V V++Q CGSCWA
Sbjct: 1537 AHTGLIVPKQHSNHIRNPIATVSTERTS---LPTSFDWRDHGAVT-GVKNQGNCGSCWAF 1592
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SA+ +E + IK +L S+Q
Sbjct: 1593 SAIGNIEGLHQIKTKKLEAYSEQ 1615
>gi|242014216|ref|XP_002427787.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
gi|212512256|gb|EEB15049.1| Cathepsin F precursor, putative [Pediculus humanus corporis]
Length = 434
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 53/148 (35%), Positives = 83/148 (56%), Gaps = 23/148 (15%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FKDFV ++ + Y S E ++RF IFR N+K I++ K E+GTA YG+ F+D++ +EF +
Sbjct: 134 FKDFVLKFNKVYFSKEEFKKRFRIFRANMKKINFLNKAEKGTAQYGITEFSDLSVTEFKN 193
Query: 139 GL-------SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
L S L +I ++K L ++ +++ V P V++Q CG
Sbjct: 194 YLGLKKKPESKLPTAEIPDVK---------------LPDNFDWRHYNAVTP-VKNQGSCG 237
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
SCWA S +E +AIK +EL+ LS+Q
Sbjct: 238 SCWAFSVTGNIEGLWAIKKHELLSLSEQ 265
>gi|324983200|gb|ADY68475.1| stem bromelain [Ananas comosus]
Length = 291
Score = 92.4 bits (228), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 54/147 (36%), Positives = 85/147 (57%), Gaps = 13/147 (8%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF- 136
+F++++ EY R Y + E RRF IF+NN+ I+ + + T G+N+F DMT++EF
Sbjct: 36 RFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFV 95
Query: 137 ---NHGLS-SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
G+S L+ E+ SF+ N + +SI+++D G V +V+DQ+ CGS
Sbjct: 96 AQYTGGISRPLNIEKE-------PVVSFDDVNISAVGQSIDWRDYGAVT-EVKDQNPCGS 147
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA SA+A +E Y I L+ LS+Q
Sbjct: 148 CWAFSAIATVEGIYKIVTGYLVSLSEQ 174
>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
Length = 884
Score = 92.0 bits (227), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 59/154 (38%), Positives = 81/154 (52%), Gaps = 6/154 (3%)
Query: 67 FDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVN 126
L E + F+ F++++ + Y+S E RF IF+ NLK I+ E+GTA YGV
Sbjct: 567 LKLAEEIKDETLFEAFIKKFGKTYNSADEKLDRFKIFKQNLKIIEELQTFERGTAEYGVT 626
Query: 127 RFADMTDSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQ 185
FAD+T EF L E + EN E + S L +++D V P V+
Sbjct: 627 MFADLTPKEFKARYLGLRPELKHENEIPLPEAEIPDVS----LPLKFDWRDHSVVTP-VK 681
Query: 186 DQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
DQ CGSCWA S +E YAIKHN+L+ LS+Q
Sbjct: 682 DQGQCGSCWAFSVTGNVEGQYAIKHNQLLSLSEQ 715
>gi|186688051|gb|ACC86111.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 100/194 (51%), Gaps = 22/194 (11%)
Query: 38 WHPRWTGRVHNLILQRSQPN-SYGSEEASTFD-------LEEFLDHGNQFKDFVREYERQ 89
W W G L+ Q+ QP + +E + + LEE ++ QFK+F+ +Y +
Sbjct: 129 WDIPWQG-TSTLLKQKCQPKVEFQVKETNEVEDLSINPPLEESVELLGQFKEFMVKYNKV 187
Query: 90 YDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG-LSSL--DWE 146
Y S E +RR IF NLKT + +QG+A YGV +F+D+T+ EF L+ L W
Sbjct: 188 YSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSDLTEEEFRSTYLNPLLSQWT 247
Query: 147 QIENLKSTFETYSFNSSNSYGLAE-SINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESA 205
+K +S + G A S +++D G V V++Q +CGSCWA S +E
Sbjct: 248 LHRPMKP--------ASPAKGPAPASWDWRDHGAV-SSVKNQGMCGSCWAFSVTGNIEGQ 298
Query: 206 YAIKHNELIELSKQ 219
+ +K+ L+ LS+Q
Sbjct: 299 WFLKNGTLVSLSEQ 312
>gi|224555777|gb|ACN56478.1| cathepsin F [Paralichthys olivaceus]
Length = 475
Score = 91.7 bits (226), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 64/194 (32%), Positives = 100/194 (51%), Gaps = 22/194 (11%)
Query: 38 WHPRWTGRVHNLILQRSQPN-SYGSEEASTFD-------LEEFLDHGNQFKDFVREYERQ 89
W W G L+ Q+ QP + +E + + LEE ++ QFK+F+ +Y +
Sbjct: 129 WDIPWQG-TSTLLKQKCQPKVEFQVKETNEVEDLSINPPLEESVELLGQFKEFMVKYNKV 187
Query: 90 YDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG-LSSL--DWE 146
Y S E +RR IF NLKT + +QG+A YGV +F+D+T+ EF L+ L W
Sbjct: 188 YSSQDEADRRLSIFHENLKTAEKLQSLDQGSAEYGVTKFSDLTEEEFRSTYLNPLLSQWT 247
Query: 147 QIENLKSTFETYSFNSSNSYGLAE-SINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESA 205
+K +S + G A S +++D G V V++Q +CGSCWA S +E
Sbjct: 248 LHRPMKP--------ASPAKGPAPASWDWRDHGAV-SSVKNQGMCGSCWAFSVTGNIEGQ 298
Query: 206 YAIKHNELIELSKQ 219
+ +K+ L+ LS+Q
Sbjct: 299 WFLKNGTLVSLSEQ 312
>gi|67773374|gb|AAY81944.1| cysteine protease 6 [Paragonimus westermani]
Length = 325
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 58/170 (34%), Positives = 89/170 (52%), Gaps = 28/170 (16%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ F R+Y + Y +D + E+RF IF++NL Y EQGTA YGV +F+D+T EF
Sbjct: 32 YEQFKRDYGKSYANDDD-EKRFAIFKDNLVRAQNYQLQEQGTARYGVTQFSDLTPEEFAA 90
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
LSS +Q+E ++ ++ ES+++++ G V P V+DQ CGSCWA S
Sbjct: 91 KFLSSRFDDQVERVQ---------LNDLKAAPESVDWRELGAVAP-VEDQGSCGSCWAFS 140
Query: 198 AVACLESAYAIKHNELIELSKQ----------------PPKTHGRFYKGG 231
+E + +K +L+ LSKQ PP T+G + G
Sbjct: 141 VAGNVEGQWFLKTGQLVSLSKQQLVDCDVQDSGCDGGYPPTTYGEIIRMG 190
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 91.3 bits (225), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 87/149 (58%), Gaps = 8/149 (5%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFAD 130
+ G F+ F ++ + Y + +E +RF IFR NL+ I+ + +++QG +Y G+N+FAD
Sbjct: 21 EDGVHFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFAD 80
Query: 131 MTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
MT +EF L++ Q++ S T +F ++ + ESI+++ + V P ++DQ C
Sbjct: 81 MTRAEFKAMLAT----QVKTKPSIVATKTFQLADGVSVPESIDWRSRNVVTP-IKDQAQC 135
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCW+ + V E AYA+ +L S+Q
Sbjct: 136 GSCWSFAVVGSTEGAYALSTGKLTRFSEQ 164
>gi|390994427|gb|AFM37363.1| cathepsin F1 [Dictyocaulus viviparus]
Length = 459
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 82/147 (55%), Gaps = 9/147 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
NQF DF+ +E+ Y+S + +RF +F+ NLK I + + E+GTA YG+ +F+D+T EF
Sbjct: 155 NQFVDFMGRHEKVYNSKHDTLKRFRVFKRNLKAIRSWQEKEEGTAVYGITQFSDLTPEEF 214
Query: 137 NHGLSSLDWEQ--IEN--LKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
W++ + N + T E N + L ES +++D G V V++Q CGS
Sbjct: 215 KKIYLPYIWDEPIVPNRMVDLTAEGVHLNET----LPESFDWRDHGAVT-DVKNQGFCGS 269
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA S +E + + +L+ LS+Q
Sbjct: 270 CWAFSTTGNIEGQWFLAKKKLVSLSEQ 296
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 91.3 bits (225), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 95/184 (51%), Gaps = 12/184 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
+K +V ++ + Y+ E E+RF+IF++NL+ ID + + T G+N+FAD+T+ E+
Sbjct: 46 YKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYRA 105
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L + + +KS + + L +S+N++D G V +V+DQ CGSCWA S
Sbjct: 106 KFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAV-SRVKDQGSCGSCWAFS 164
Query: 198 AVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKG--PYSLNHAVLNVG 255
A+A +E I ELI LS+Q R Y G C+ G Y+ + N G
Sbjct: 165 AIAAVEGINKIVSGELISLSEQELVDCDRSYDAG--------CNGGLMDYAFQFIIDNGG 216
Query: 256 YDNE 259
D E
Sbjct: 217 IDTE 220
>gi|223648298|gb|ACN10907.1| Cathepsin F precursor [Salmo salar]
Length = 474
Score = 90.9 bits (224), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 64/201 (31%), Positives = 100/201 (49%), Gaps = 20/201 (9%)
Query: 32 VHTYLGWHPRWTGRVHNLILQRSQP---------NSYGSEEASTFDLEEFLDHG---NQF 79
V + W W + ++ Q+ QP N ST +EE +D QF
Sbjct: 118 VCVFEVWDIPWESK-STILKQKCQPAVDPKPVETNKVEFLPLSTKPVEESVDSVELLGQF 176
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
K+F+ Y R Y S E +RR +F NLKT + +QGTA YGV +F+D+T+ EF
Sbjct: 177 KEFMVRYNRTYSSQEEADRRLRVFHENLKTAEKLQSLDQGTAEYGVTKFSDLTEEEFRTL 236
Query: 140 -LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L+ L +Q NL+ + + + + S ++++ G V P V++Q +CGSCWA S
Sbjct: 237 YLNPLLSQQ--NLQQSMKPAAMPRGPA---PPSWDWREHGAVSP-VKNQGMCGSCWAFSV 290
Query: 199 VACLESAYAIKHNELIELSKQ 219
+E + K +L+ LS+Q
Sbjct: 291 TGNIEGQWFAKTGKLVSLSEQ 311
>gi|390339264|ref|XP_791714.3| PREDICTED: putative cysteine proteinase CG12163-like
[Strongylocentrotus purpuratus]
Length = 453
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 86/152 (56%), Gaps = 13/152 (8%)
Query: 70 EEFLDHGNQF-KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRF 128
+E+ D ++F F REY RQ D +E E R+ +F N+ T++ + + EQGTA YG +F
Sbjct: 150 DEYRDLFDKFLMTFKREY-RQNDGTNEYEYRYSVFVQNMLTVEMFNQFEQGTAKYGPTKF 208
Query: 129 ADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYG-LAESINYKDKGKVLPKVQDQ 187
ADMT++EF S LK T ++ G + E +++ G V P V++Q
Sbjct: 209 ADMTEAEFRKLQSG-------PLKKT--GIKKQAAIPQGPVPEEYDWRTHGAVTP-VKNQ 258
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+CGSCWA SA+ +E + IK ELI LS+Q
Sbjct: 259 GMCGSCWAFSAIGNMEGQWQIKKGELISLSEQ 290
>gi|357619726|gb|EHJ72185.1| cathepsin [Danaus plexippus]
Length = 1118
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 64/173 (36%), Positives = 94/173 (54%), Gaps = 21/173 (12%)
Query: 50 ILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKT 109
I + Q Y EEA T F+ F+++Y ++YD +SE E RF IF NNLK
Sbjct: 284 IRELGQRRLYSLEEAPTL-----------FEQFIKDYNKEYD-ESEKEERFKIFVNNLKD 331
Query: 110 IDYYTKHEQGTATYGVNRFADMTDSEFNH---GLSSLDWEQIENLKSTFETYSFNSSNSY 166
I+ + A YG+N+F+D++ EF GL E+ KST SFN +
Sbjct: 332 INAMNER-SSNAVYGINKFSDLSKEEFIKYYTGLKRDRCTTTEHHKSTDLPKSFNIT--- 387
Query: 167 GLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ +++ KG V+ V++Q CGSCWA SA A +ES +AIK +LI++S+Q
Sbjct: 388 -APDQFDWRKKG-VVSSVKNQRHCGSCWAFSAAANVESIHAIKTGKLIDVSEQ 438
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/168 (35%), Positives = 91/168 (54%), Gaps = 21/168 (12%)
Query: 55 QPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT 114
Q Y EEA T F+ F+++Y ++YD +SE E RF IF NNLK I+
Sbjct: 506 QRRLYSLEEAPTL-----------FEQFIKDYNKEYD-ESEKEERFKIFVNNLKDINAMN 553
Query: 115 KHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQI---ENLKSTFETYSFNSSNSYGLAES 171
+ A YG+N+F+D++ EF + L E+ E+ K T SFN + +
Sbjct: 554 ER-SSNAVYGINKFSDLSKEEFIKYYTGLKREESPSNEDHKKTDLPESFNVT----APDQ 608
Query: 172 INYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+++ KG V+ +++Q CGSCWA SA +ES +AIK +L+ +S+Q
Sbjct: 609 FDWRKKG-VVSSIKNQKHCGSCWAFSAAGNVESIHAIKTGKLVHVSEQ 655
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 59/168 (35%), Positives = 89/168 (52%), Gaps = 21/168 (12%)
Query: 55 QPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT 114
Q + Y EEA T F+ F+++Y ++YD +SE E RF IF NNLK I+
Sbjct: 806 QKHLYSLEEAPTL-----------FEQFIKDYNKEYD-ESEKEERFKIFVNNLKDINAMN 853
Query: 115 KHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQI---ENLKSTFETYSFNSSNSYGLAES 171
+ A YG+N+F+D++ EF + L E+ E+ K T SFN + +
Sbjct: 854 ER-SSNAVYGINKFSDLSKDEFVKFYTGLKREESPSNEDHKKTDLPKSFNVT----APDQ 908
Query: 172 INYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+++ KG V+ V+ Q C SCWA S +ES AIK +LI++S+Q
Sbjct: 909 FDWRKKG-VVSSVKFQGHCVSCWAFSVAGNVESINAIKTGKLIDVSEQ 955
Score = 63.2 bits (152), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 66/119 (55%), Gaps = 8/119 (6%)
Query: 121 ATYGVNRFADMTDSEFNHGLSSLDWEQI---ENLKSTFETYSFNSSNSYGLAESINYKDK 177
A YG+N+F+D++ EF + L E+ E+ K T SFN + + +++ K
Sbjct: 8 AVYGINKFSDLSKEEFVKYYTGLKREESPSNEDHKKTDLPESFNVT----APDQFDWRKK 63
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLP 236
G V+ +++Q CGSCWA SA A +ES +AIK +LI++S+Q ++ G LP
Sbjct: 64 G-VVSSIKNQKHCGSCWAFSAAANVESIHAIKTGKLIDVSEQQLLDCDKYDSGCSGGLP 121
>gi|213513816|ref|NP_001133678.1| Cathepsin F precursor [Salmo salar]
gi|209154908|gb|ACI33686.1| Cathepsin F precursor [Salmo salar]
Length = 475
Score = 90.5 bits (223), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 61/182 (33%), Positives = 98/182 (53%), Gaps = 18/182 (9%)
Query: 40 PRWTGRVHNLILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERR 99
P T +V L L S+P + E+F++ QFK+F+ Y R Y S + +RR
Sbjct: 147 PVETNKVEFLSLSTSKPVE---------ETEDFVELLGQFKEFMVRYNRTYSSQEDTDRR 197
Query: 100 FDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG-LSSLDWEQIENLKSTFETY 158
IF NLKT + + GTA YGV +F+D+T+ EF L+ L +Q L+ + +
Sbjct: 198 LRIFHENLKTAEKLQSLDLGTAEYGVTKFSDLTEEEFRTLYLNPLLSQQ--KLQRSMKP- 254
Query: 159 SFNSSNSYGLA-ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELS 217
++ +G A S ++++ G V P V++Q +CGSCWA S +E + +K +L+ LS
Sbjct: 255 ---AAMPHGPAPPSWDWREHGAVSP-VKNQGMCGSCWAFSVTGNIEGQWFVKTGKLVSLS 310
Query: 218 KQ 219
+Q
Sbjct: 311 EQ 312
>gi|357619725|gb|EHJ72184.1| hypothetical protein KGM_03271 [Danaus plexippus]
Length = 338
Score = 90.1 bits (222), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 68/203 (33%), Positives = 104/203 (51%), Gaps = 30/203 (14%)
Query: 46 VHNLIL---------QRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEI 96
VH L+L + Q Y EEA T F+ F+++Y ++YD +SE
Sbjct: 10 VHVLVLFSIDQCKVRELGQRRLYSLEEAPTL-----------FEQFIKDYNKEYD-ESEK 57
Query: 97 ERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQI---ENLKS 153
E RF IF NNLK I+ + A YG+N+F+D++ EF + L E+ E+ K
Sbjct: 58 EERFKIFVNNLKDINAMNER-SSNAVYGINKFSDLSKEEFIKYYTGLKREESPSNEDHKK 116
Query: 154 TFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNEL 213
T SFN + + +++ KG V+ +++Q CGSCWA SA A +ES +AIK +L
Sbjct: 117 TDLPESFNVT----APDQFDWRKKG-VVSSIKNQKHCGSCWAFSAAANVESIHAIKTGKL 171
Query: 214 IELSKQPPKTHGRFYKGGVMNLP 236
I++S+Q ++ G LP
Sbjct: 172 IDVSEQQLLDCDKYDSGCSGGLP 194
>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
Length = 774
Score = 90.1 bits (222), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 59/144 (40%), Positives = 77/144 (53%), Gaps = 11/144 (7%)
Query: 79 FKDFVREYERQYDSDSEIER--RFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
F +F+ Y R Y S +ER RF IFR NL I+ + EQGT YGVN FADM+ EF
Sbjct: 470 FNNFMTTYNRTYSS---LERNLRFKIFRENLNFIEELRETEQGTGIYGVNMFADMSQKEF 526
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSS-NSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
L +L+S E + L S +++ KG V P V++Q CGSCWA
Sbjct: 527 RTRYLGLR----PDLQSENEIPLPKAEIPDIDLPSSFDWRQKGVVTP-VKNQGQCGSCWA 581
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +E YAIKH +L+ LS+Q
Sbjct: 582 FSVTGNVEGQYAIKHGQLLSLSEQ 605
>gi|47212989|emb|CAF92720.1| unnamed protein product [Tetraodon nigroviridis]
Length = 142
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 81/147 (55%), Gaps = 15/147 (10%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF- 136
QFK+F+ +Y + Y+S E + R IF+ NLKT + ++G+A YG+ +F+D+T+ EF
Sbjct: 6 QFKEFMMKYSKVYNSQEEADHRLKIFKENLKTAEKIQSLDEGSAEYGITKFSDLTEEEFR 65
Query: 137 ----NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
N LS W + +K S S +++D G V P V++Q +CGS
Sbjct: 66 LTYLNPLLS--QWTLRQPMKRA-------SPARSPAPASWDWRDHGAVSP-VKNQGMCGS 115
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA S +E + +KH +L+ LS+Q
Sbjct: 116 CWAFSVTGNIEGQWFLKHGKLLSLSEQ 142
>gi|195054270|ref|XP_001994049.1| GH22731 [Drosophila grimshawi]
gi|193895919|gb|EDV94785.1| GH22731 [Drosophila grimshawi]
Length = 617
Score = 89.7 bits (221), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 52/141 (36%), Positives = 82/141 (58%), Gaps = 4/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F +Y+RQY + +E + R IFR NL+TI+ +E+G+A YG+ +FADMT +E+
Sbjct: 311 FHKFQLKYKRQYANTAEHQMRLRIFRQNLRTIEELNANERGSAKYGITQFADMTSTEYK- 369
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L + W++ E+ K T + + + + +++ K K + V++Q CGSCWA S
Sbjct: 370 -LHAGLWQRSED-KPTGGAAAVVPPYAGEMPKEFDWRQK-KAVTHVKNQGQCGSCWAFSV 426
Query: 199 VACLESAYAIKHNELIELSKQ 219
+E YAIK EL E S+Q
Sbjct: 427 TGNIEGLYAIKTGELEEFSEQ 447
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 98/176 (55%), Gaps = 20/176 (11%)
Query: 49 LILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLK 108
L + + P++ ++E S ++ +F++++ EY R Y + E RRF IF+NN+
Sbjct: 14 LCVMWASPSAASADEPSDPMMK-------RFEEWMVEYGRVYKDNDEKMRRFQIFKNNVN 66
Query: 109 TIDYYTKHEQGTATYGVNRFADMTDSEF----NHGLS-SLDWEQIENLKSTFETYSFNSS 163
I+ + + + T G+N+F DMT++EF G+S L+ E+ SF+
Sbjct: 67 HIETFNSRNENSYTLGINQFTDMTNNEFIAQYTGGISRPLNIER-------EPVVSFDDV 119
Query: 164 NSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ + +SI+++D G V V++Q+ CG+CWA +A+A +ES Y IK L LS+Q
Sbjct: 120 DISAVPQSIDWRDYGAVT-SVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQ 174
>gi|389986405|gb|AFL46179.1| INT9 protein, partial [Solanum lycopersicum]
Length = 189
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 53/148 (35%), Positives = 82/148 (55%), Gaps = 8/148 (5%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHGLSSLDWEQIENLK 152
E E RF F+ N++ I+ + K+ GT Y +N++AD+T EF LD + +
Sbjct: 4 EKEHRFKTFKENVEFIESFNKN--GTQRYKLAINKYADLTTEEFTTSFMGLDTSLLSQQE 61
Query: 153 STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE 212
ST T SF + + S++++ +G V V+DQ +CG CWA SA A +E AY I +NE
Sbjct: 62 STATTTSFKYDSVTEVPNSMDWRKRGSV-TGVKDQGVCGCCWAFSAAAAIEGAYQIANNE 120
Query: 213 LIELSKQP---PKTHGRFYKGGVMNLPH 237
LI LS+Q T + +GG+M + +
Sbjct: 121 LISLSEQQLLDCSTQNKGCEGGLMTVAY 148
>gi|30387350|ref|NP_848429.1| cathepsin [Choristoneura fumiferana MNPV]
gi|1168799|sp|P41715.1|CATV_NPVCF RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|332509|gb|AAA96732.1| cathepsin [Choristoneura fumiferana MNPV]
gi|30270084|gb|AAP29900.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 89.4 bits (220), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 57/162 (35%), Positives = 89/162 (54%), Gaps = 10/162 (6%)
Query: 59 YGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQ 118
YG+ + + +D+ L N F+DF+ ++ + Y S+SE RRF IFR+NL+ I H
Sbjct: 11 YGAVQCAAYDV---LKAPNYFEDFLHKFNKSYSSESEKLRRFQIFRHNLEEI-INKNHND 66
Query: 119 GTATYGVNRFADMTDSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDK 177
TA Y +N+FAD++ E + L Q +N E + G E +++
Sbjct: 67 STAQYEINKFADLSKDETISKYTGLSLPLQTQNF---CEVVVLDRPPDKGPLE-FDWRRL 122
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
KV V++Q +CG+CWA + + LES +AIKHN+ I LS+Q
Sbjct: 123 NKV-TSVKNQGMCGACWAFATLGSLESQFAIKHNQFINLSEQ 163
>gi|387015020|gb|AFJ49629.1| Cathepsin H [Crotalus adamanteus]
Length = 337
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 81/143 (56%), Gaps = 11/143 (7%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEF 136
FK + ++ R Y S+ E R IF +N + ID KH G +++ G+N+F+DMT +EF
Sbjct: 36 FKAWASQHRRAYRSEEEFRHRLQIFLDNKQKID---KHNAGNSSFRMGLNQFSDMTFTEF 92
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
W++ +N +T + ++ ++I+++ KGK + V++Q CGSCW
Sbjct: 93 RK---KYLWQEPQNCSATMGNFPRSAGPC---PKAIDWRKKGKFVSPVKNQGSCGSCWTF 146
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S CLESA AIK +L+ L++Q
Sbjct: 147 STTGCLESAIAIKTGKLLNLAEQ 169
>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
Length = 356
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 56/176 (31%), Positives = 98/176 (55%), Gaps = 20/176 (11%)
Query: 49 LILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLK 108
L + + P++ ++E S ++ +F++++ EY R Y + E RRF IF+NN+
Sbjct: 14 LCVMWASPSAASADEPSDPMMK-------RFEEWMVEYGRVYKDNDEKMRRFQIFKNNVN 66
Query: 109 TIDYYTKHEQGTATYGVNRFADMTDSEF----NHGLS-SLDWEQIENLKSTFETYSFNSS 163
I+ + + + T G+N+F DMT++EF G+S L+ E+ SF+
Sbjct: 67 HIETFNSRNKDSYTLGINQFTDMTNNEFVAQYTGGISRPLNIER-------EPVVSFDDV 119
Query: 164 NSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ + +SI+++D G V V++Q+ CG+CWA +A+A +ES Y IK L LS+Q
Sbjct: 120 DISAVPQSIDWRDYGAVT-SVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQ 174
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 80/142 (56%), Gaps = 13/142 (9%)
Query: 83 VREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF----NH 138
+ EY R Y + E RRF IF+NN+ I+ + + T G+N+F DMT++EF
Sbjct: 1 MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60
Query: 139 GLS-SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
G+S L+ E+ SF+ N + +SI+++D G V +V+DQ+ CGSCWA S
Sbjct: 61 GISRPLNIEK-------EPVVSFDDVNISAVGQSIDWRDYGAVT-EVKDQNPCGSCWAFS 112
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A+A +E Y I L+ LS+Q
Sbjct: 113 AIATVEGIYKIVTGYLVSLSEQ 134
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 61/186 (32%), Positives = 98/186 (52%), Gaps = 17/186 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
+K +++++ + Y+ E +RF+IF+NNL+ ID + + T G+ +FAD+T+ E+
Sbjct: 28 YKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQNR-TYKVGLTKFADLTNQEYRA 86
Query: 139 ---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
G S ++ K+ E Y++ + + L ES++++ KG V P ++DQ CGSCWA
Sbjct: 87 MFLGTRSDPKRRLMKSKNPSERYAYKAGDK--LPESVDWRGKGAVNP-IKDQGSCGSCWA 143
Query: 196 HSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKG--PYSLNHAVLN 253
S VA +E I ELI LS+Q RFY G C+ G Y+ + N
Sbjct: 144 FSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAG--------CNGGLMDYAFQFIINN 195
Query: 254 VGYDNE 259
G D E
Sbjct: 196 GGLDTE 201
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 89.0 bits (219), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 96/186 (51%), Gaps = 17/186 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
+ +++ ++ + Y+ E ERRF+IF++NLK +D + E + G+NRFAD+T+ E+
Sbjct: 47 YAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNS-ENRSYKVGLNRFADLTNEEYRS 105
Query: 139 ---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
G + + KS Y+ S+ L ES+++++ G V P ++DQ CGSCWA
Sbjct: 106 MFLGTKTDSKRRFMKSKSASRRYAVQDSDM--LPESVDWRESGAVAP-IKDQGSCGSCWA 162
Query: 196 HSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKG--PYSLNHAVLN 253
S VA +E I E+I+LS+Q R Y G C+ G Y+ + N
Sbjct: 163 FSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAG--------CNGGLMDYAFEFIINN 214
Query: 254 VGYDNE 259
G D E
Sbjct: 215 GGIDTE 220
>gi|90592736|ref|YP_529689.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
gi|71559186|gb|AAZ38185.1| VCATH [Agrotis segetum nucleopolyhedrovirus]
Length = 343
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 85/145 (58%), Gaps = 11/145 (7%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
F+ F+ +Y +QY +++E RF+IF +N++ I+ +A Y +NRFADMT +E
Sbjct: 45 FEQFISQYNKQYKNEAEKRHRFNIFMHNIEEINQKNSRND-SAVYKINRFADMTKNEVVI 103
Query: 137 -NHGLSSLDWEQIENLKSTF-ETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ GL+S I L S F ET + S +++ KV V+DQ +CG+CW
Sbjct: 104 RHTGLAS-----IGELNSNFCETVVVDGPGQRQRPSSFDWRTYNKV-TSVKDQSMCGACW 157
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A +++ LES YAIK++ LI+L++Q
Sbjct: 158 AFASLGALESQYAIKYDRLIDLAEQ 182
>gi|307175778|gb|EFN65613.1| Putative cysteine proteinase CG12163 [Camponotus floridanus]
Length = 887
Score = 88.6 bits (218), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 55/142 (38%), Positives = 72/142 (50%), Gaps = 6/142 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F +FV Y R Y + E R IFR NL I K E+GTA Y VN FADM+ EF
Sbjct: 582 FNNFVVTYNRTYSTPEERNLRLRIFRENLGIIQLLRKTERGTAHYDVNMFADMSPEEFRS 641
Query: 139 GLSSLDWEQIENLKSTFETYSFNSS-NSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L +L+S + + L ++++K V P V+DQ +CGSCWA S
Sbjct: 642 RYLGLR----PDLRSENDIPLREAEIPDVELPPKFDWREKSVVTP-VKDQGMCGSCWAFS 696
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+E YAIKH L+ LS+Q
Sbjct: 697 VTGNIEGQYAIKHGRLLSLSEQ 718
>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
Length = 357
Score = 88.6 bits (218), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 82/145 (56%), Gaps = 10/145 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F++++ EY R Y + E RRF IF+NN+ I+ + + T G+N+F DMT++EF
Sbjct: 36 RFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSRNGNSYTLGINQFTDMTNNEFV 95
Query: 138 HGLSSLDWEQIENLKSTFE---TYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ + +L E SF+ + + +SI++++ G V V++ CGSCW
Sbjct: 96 AQYTGV------SLPLNIEREPVVSFDDVDISAVPQSIDWRNYGAVT-SVKNHIPCGSCW 148
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A +A+A +ES Y IK LI LS+Q
Sbjct: 149 AFAAIATVESIYKIKRGYLISLSEQ 173
>gi|403183546|gb|EJY58173.1| AAEL017153-PA [Aedes aegypti]
Length = 1165
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 57/152 (37%), Positives = 81/152 (53%), Gaps = 14/152 (9%)
Query: 74 DHGNQ-FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
DH F+ F ++ R+Y S E E RF IF+NNL I+ K+EQGTA YG+ FADMT
Sbjct: 852 DHARHLFEKFKLKHSREYQSTLEHEMRFRIFKNNLFKIEQLNKYEQGTAKYGITHFADMT 911
Query: 133 DSEFNHGLSSL-----DWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQ 187
+E+ + D + N K+ + + L ES ++++ G V P V++Q
Sbjct: 912 SAEYRQRTGLVIPRDEDRNHVGNPKAEID-------ENMELPESFDWRELGAVSP-VKNQ 963
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S V +E + IK L E S+Q
Sbjct: 964 GNCGSCWAFSVVGNIEGLHQIKTKVLEEYSEQ 995
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 78/142 (54%), Gaps = 8/142 (5%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF--NHG 139
++ EY+R Y +E RRF++F++N ++ + ++ GVN+FAD+T EF N G
Sbjct: 8 WMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTEEFKANKG 67
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ E++ +E S ++ L +++++ KG V P +++Q CG CWA SA+
Sbjct: 68 FKPISAEEVPTTGFKYENLSVSA-----LPTAVDWRTKGAVTP-IKNQGQCGCCWAFSAI 121
Query: 200 ACLESAYAIKHNELIELSKQPP 221
A +E + L+ LS+Q P
Sbjct: 122 AAMEGIVKLSTGNLVSLSEQEP 143
>gi|15320768|ref|NP_203280.1| V-CATH [Epiphyas postvittana NPV]
gi|37077652|sp|Q91GE3.1|CATV_NPVEP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|15213236|gb|AAK85675.1| V-CATH [Epiphyas postvittana NPV]
Length = 323
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 55/161 (34%), Positives = 91/161 (56%), Gaps = 9/161 (5%)
Query: 59 YGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQ 118
YG ++ +D+ L N F++FVR+Y +QYDS+ E RR+ IF++NL D TK+
Sbjct: 11 YGVVCSAAYDI---LKAPNYFEEFVRQYNKQYDSEYEKLRRYKIFQHNLN--DIITKNRN 65
Query: 119 GTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKG 178
TA Y +N+F+D++ E + L + ++ E + G E +++
Sbjct: 66 DTAVYKINKFSDLSKDETIAKYTGLSLPL--HTQNFCEVVVLDRPPGKGPLE-FDWRRFN 122
Query: 179 KVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
K+ V++Q +CG+CWA + +A LES +AI H+ LI LS+Q
Sbjct: 123 KI-TSVKNQGMCGACWAFATLASLESQFAIAHDRLINLSEQ 162
>gi|74229746|ref|YP_308950.1| cathepsin [Trichoplusia ni SNPV]
gi|72259660|gb|AAZ67431.1| cathepsin [Trichoplusia ni SNPV]
Length = 344
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 90/157 (57%), Gaps = 5/157 (3%)
Query: 64 ASTFDLEEFLDHGNQ-FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTAT 122
A F L+ L+ Q F+ F +Y++ Y D+E + R+ IF+ NL+ I+ + +A
Sbjct: 31 APHFKLQYNLERAPQYFETFQTKYKKVYADDNERDYRYKIFKTNLEIINL-KNQQNDSAV 89
Query: 123 YGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLP 182
Y +N+FAD+T +E + L + NLK+ + + + Y E+ +++ K+
Sbjct: 90 YNINKFADLTKNEVIAKFTGLGVKS-PNLKNFCDPLIVDGPSKY-TQETFDWRQFNKI-T 146
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+DQ CGSCWA S +A LES YAIK+NE I+LS+Q
Sbjct: 147 SVKDQGFCGSCWAFSTIAGLESQYAIKYNEHIDLSEQ 183
>gi|161408101|dbj|BAF94154.1| cathepsin F-like cysteine protease [Plautia stali]
Length = 803
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 55/155 (35%), Positives = 82/155 (52%), Gaps = 11/155 (7%)
Query: 71 EFLDHGNQFKDFVREY------ERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYG 124
E L Q KD+++ +R Y + E+++RF IFR N+K DY K EQGTA YG
Sbjct: 486 ELLGVDEQDKDYIKFKFFTKKFQRSYKTTEELKKRFRIFRANMKKADYLQKTEQGTAKYG 545
Query: 125 VNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKV 184
V F+D++ EF L ++ ++K E + L E ++++ V P V
Sbjct: 546 VTIFSDISSKEFKKHYLGLK-KRTPDIKFKQEMAQI---PNITLPEEYDWRNYNAVTP-V 600
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++Q +CGSCWA S +E YAIK L+ LS+Q
Sbjct: 601 KNQGMCGSCWAFSVTGNIEGQYAIKTGNLVSLSEQ 635
>gi|195152617|ref|XP_002017233.1| GL22196 [Drosophila persimilis]
gi|194112290|gb|EDW34333.1| GL22196 [Drosophila persimilis]
Length = 627
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 77/143 (53%), Gaps = 8/143 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + R+YD+ +E + R IFR NLKTI+ +E G+A YG+ FADMT +E+
Sbjct: 321 FHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADMTSTEYKE 380
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL W++ E K T + + + +++ K V P V++Q CGSCWA
Sbjct: 381 RTGL----WQRDEQ-KPTGGAPAVVPAYEGEFPKEFDWRQKNAVTP-VKNQGSCGSCWAF 434
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E YA+K EL E S+Q
Sbjct: 435 SVTGNIEGLYAVKTGELKEFSEQ 457
>gi|390178852|ref|XP_003736743.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
gi|388859612|gb|EIM52816.1| GA27408, isoform B [Drosophila pseudoobscura pseudoobscura]
Length = 477
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 77/143 (53%), Gaps = 8/143 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + R+YD+ +E + R IFR NLKTI+ +E G+A YG+ FADMT +E+
Sbjct: 171 FHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADMTSTEYKE 230
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL W++ E K T + + + +++ K V P V++Q CGSCWA
Sbjct: 231 RTGL----WQRDEQ-KPTGGAPAVVPAYEGEFPKEFDWRQKNAVTP-VKNQGSCGSCWAF 284
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E YA+K EL E S+Q
Sbjct: 285 SVTGNIEGLYAVKTGELKEFSEQ 307
>gi|198453932|ref|XP_002137768.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
gi|198132577|gb|EDY68326.1| GA27408, isoform A [Drosophila pseudoobscura pseudoobscura]
Length = 629
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 53/143 (37%), Positives = 77/143 (53%), Gaps = 8/143 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + R+YD+ +E + R IFR NLKTI+ +E G+A YG+ FADMT +E+
Sbjct: 323 FHKFQIRFGRRYDNTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADMTSTEYKE 382
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL W++ E K T + + + +++ K V P V++Q CGSCWA
Sbjct: 383 RTGL----WQRDEQ-KPTGGAPAVVPAYEGEFPKEFDWRQKNAVTP-VKNQGSCGSCWAF 436
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E YA+K EL E S+Q
Sbjct: 437 SVTGNIEGLYAVKTGELKEFSEQ 459
>gi|341888719|gb|EGT44654.1| hypothetical protein CAEBREN_19265 [Caenorhabditis brenneri]
Length = 396
Score = 88.2 bits (217), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 57/165 (34%), Positives = 91/165 (55%), Gaps = 11/165 (6%)
Query: 64 ASTFDLE----EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
ASTF + +F QFKDF +++ R++ S E + RF++F+ NL+ I+ +
Sbjct: 69 ASTFKIRAEKLKFFGLQQQFKDFNKKFGREHKSLEEYKMRFEVFQKNLRDIEELN-LKNP 127
Query: 120 TATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSF-----NSSNSYGLAESINY 174
+ YG+NRF+D T+SE + L + S+ +T S N + + I++
Sbjct: 128 SVQYGINRFSDKTESELKNLLMDKKFMDSSLSNSSLKTLSSYRNPRNIIKNVQRPDYIDW 187
Query: 175 KDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++ GKV+ V+DQ CGSCWA + VA +ES YAI+ L LS+Q
Sbjct: 188 RNVGKVM-SVKDQGQCGSCWAFATVAAVESQYAIRKGTLWSLSEQ 231
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 58/184 (31%), Positives = 94/184 (51%), Gaps = 12/184 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
+K +V ++ + Y+ E E+RF+IF++NL+ ID + + T G+N+FAD+T+ E+
Sbjct: 45 YKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYRA 104
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L + + +KS + + L +S++++D G V P V+DQ CGSCWA S
Sbjct: 105 KFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVDWRDHGAVSP-VKDQGSCGSCWAFS 163
Query: 198 AVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKG--PYSLNHAVLNVG 255
+A +E I EL+ LS+Q R Y G C+ G Y+ + N G
Sbjct: 164 TIATVEGINKIVSGELVSLSEQELVDCDRSYDAG--------CNGGLMDYAFQFIMDNGG 215
Query: 256 YDNE 259
D E
Sbjct: 216 IDTE 219
>gi|194898683|ref|XP_001978897.1| GG11133 [Drosophila erecta]
gi|190650600|gb|EDV47855.1| GG11133 [Drosophila erecta]
Length = 615
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 79/143 (55%), Gaps = 8/143 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + R+Y S +E + R IFR NLKTI+ +E G+A YG+ FAD+T SE+
Sbjct: 309 FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNTNEMGSAKYGITEFADLTSSEYKE 368
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL W++ E K+T + + + L + +++ K V P V++Q CGSCWA
Sbjct: 369 RTGL----WQRDE-AKATGGSAAVVPAYHGELPKEFDWRQKNAVTP-VKNQGSCGSCWAF 422
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E YA+K EL E S+Q
Sbjct: 423 SVTGNIEGLYAVKTGELKEFSEQ 445
>gi|163914827|ref|NP_001106423.1| cathepsin F precursor [Xenopus (Silurana) tropicalis]
gi|157423494|gb|AAI53364.1| LOC100127591 protein [Xenopus (Silurana) tropicalis]
Length = 463
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 78/150 (52%), Gaps = 6/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+E L FKDFV Y ++Y E RR IF NLK + +QGTA YGV +++
Sbjct: 157 DEMLKTLTLFKDFVTTYNKKYSDQEEAARRLQIFSQNLKKAQMIQEMDQGTAEYGVTKYS 216
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T+ EF SL + + K ++ N + +++D G V +V++Q +
Sbjct: 217 DLTEDEFR----SLYLNPLLSSKPLYQMKKAIVPN-MSAPDQWDWRDHGAVT-EVKNQGM 270
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S + +E + +K L+ LS+Q
Sbjct: 271 CGSCWAFSVIGNIEGQWFLKKGSLVSLSEQ 300
>gi|20069912|ref|NP_613116.1| cathepsin [Mamestra configurata NPV-A]
gi|37077373|sp|Q8QLK1.1|CATV_NPVMC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|20043306|gb|AAM09141.1| cathepsin [Mamestra configurata NPV-A]
gi|33331744|gb|AAQ11052.1| putative cysteine proteinase [Mamestra configurata NPV-A]
Length = 337
Score = 87.8 bits (216), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 55/169 (32%), Positives = 93/169 (55%), Gaps = 23/169 (13%)
Query: 55 QPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT 114
+PN Y A + F+ F+ +Y +QY S+ E + R++IFR+N+++I+
Sbjct: 27 KPNLYNINSAPLY-----------FEKFISQYNKQYSSEDEKKYRYNIFRHNIESINA-K 74
Query: 115 KHEQGTATYGVNRFADMTDSEFNH---GLSSLDWEQIENLKSTF-ETYSFNSSNSYGLAE 170
+A Y +NRFADMT +E + GL+S D + + F ET +
Sbjct: 75 NSRNDSAVYKINRFADMTKNEVVNRHTGLASGD------IGANFCETIVVDGPGQRQRPA 128
Query: 171 SINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ ++++ KV V+DQ +CG+CWA + + LES YAIK++ LI+L++Q
Sbjct: 129 NFDWRNYNKV-TSVKDQGMCGACWAFAGLGALESQYAIKYDRLIDLAEQ 176
>gi|313235882|emb|CBY11269.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 61/158 (38%), Positives = 88/158 (55%), Gaps = 9/158 (5%)
Query: 67 FDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVN 126
FD+ E D QF++F+ E+ + Y S+ E RF F NLK I ++ EQG+A YGV
Sbjct: 39 FDVSE-DDARKQFENFLLEHPKMY-SEQESHSRFQTFWENLKRIKFHNHIEQGSAKYGVT 96
Query: 127 RFADMTDSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLA----ESINYKDKGKVL 181
FAD++D EF L E +I N K +E S NSS A E+ ++ +KG V
Sbjct: 97 EFADLSDFEFRRHYLGLKPELKIPNRKK-YERKSRNSSKKLKFAKTVDETFDWVEKGAV- 154
Query: 182 PKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+V++Q +CGSCWA S +E A+ +L+ LS+Q
Sbjct: 155 TEVKNQGMCGSCWAFSTTGNIEGAWFKATGDLVSLSEQ 192
>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 87.8 bits (216), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 76/138 (55%), Gaps = 3/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ + R Y DSE RF+IF+ NLK ++ + + T VN+F+D+TD EF
Sbjct: 21 WMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFSDLTDEEFQARYM 80
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
L E + + +T SF N ES++++ +G V P V+DQ CG CWA +AVA
Sbjct: 81 GLVPEGMTG--DSQKTVSFRYENVSETGESMDWRLEGAVTP-VKDQGQCGCCWAFAAVAA 137
Query: 202 LESAYAIKHNELIELSKQ 219
+E I + EL+ LS+Q
Sbjct: 138 VEGVTKIANGELVSLSEQ 155
>gi|68304200|ref|YP_249668.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
gi|67973029|gb|AAY83995.1| VCATH [Chrysodeixis chalcites nucleopolyhedrovirus]
Length = 344
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 55/157 (35%), Positives = 89/157 (56%), Gaps = 5/157 (3%)
Query: 64 ASTFDLEEFLDHGNQ-FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTAT 122
A F L+ L+ Q F+ F +Y++ Y D+E + R+ IF+ NL+ I+ + +A
Sbjct: 31 APQFKLQYNLERAPQYFETFQTKYKKVYADDNERDYRYKIFKTNLEIINL-KNQQNDSAV 89
Query: 123 YGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLP 182
Y +N+FAD+T +E + L LK++ E + + Y E+ +++ K+
Sbjct: 90 YNINKFADLTKNEVIAKFTGLGIRS-PALKNSCEPVIVDGPSKY-TQETFDWRQFNKI-T 146
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+DQ CGSCWA S +A LES YAIK+NE ++LS+Q
Sbjct: 147 SVKDQGFCGSCWAFSTIAGLESQYAIKYNEHVDLSEQ 183
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 51/138 (36%), Positives = 79/138 (57%), Gaps = 4/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y +SE ERRF+IFRNN++ I+ + K +N FAD+T+ EF S
Sbjct: 41 WMVKYGRVYKDNSEKERRFEIFRNNVEFIESFNKPGNRPYKLDINEFADLTNEEFKA--S 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+++ N+ + E SF N + S++++ KG V P ++DQ CG CWA SAVA
Sbjct: 99 RNGYKRSSNVGLS-EKSSFRYGNVTAVPTSMDWRQKGAVTP-IKDQGQCGCCWAFSAVAA 156
Query: 202 LESAYAIKHNELIELSKQ 219
+E + +LI LS+Q
Sbjct: 157 MEGITKLSTGKLISLSEQ 174
>gi|195395906|ref|XP_002056575.1| GJ11017 [Drosophila virilis]
gi|194143284|gb|EDW59687.1| GJ11017 [Drosophila virilis]
Length = 599
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 80/143 (55%), Gaps = 8/143 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F +Y+R+Y + +E + R IFR +LKTI +EQG+A YG+ FADMT +E+
Sbjct: 293 FHKFQVKYKRRYANSAEHQMRLRIFRQSLKTIQELNANEQGSAKYGITEFADMTSTEYAQ 352
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL W++ E K T + + + L + +++ K V V++Q CGSCWA
Sbjct: 353 RAGL----WQRSEG-KPTGGAAAVVPAYAGELPKEFDWRQKNAVT-HVKNQGQCGSCWAF 406
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E AYAIK +L E S+Q
Sbjct: 407 SVTGNIEGAYAIKTGDLQEFSEQ 429
>gi|195111686|ref|XP_002000409.1| GI10216 [Drosophila mojavensis]
gi|193917003|gb|EDW15870.1| GI10216 [Drosophila mojavensis]
Length = 605
Score = 87.4 bits (215), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 55/143 (38%), Positives = 79/143 (55%), Gaps = 8/143 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F +Y+R+Y + E + R IFR NL+TI +EQG+A YG+ FADMT SE+
Sbjct: 299 FHVFQIKYKRRYANSMEHQMRLRIFRQNLRTIQELNDNEQGSAKYGITEFADMTSSEYTQ 358
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL W++ N K T + + L + ++++K V +V++Q CGSCWA
Sbjct: 359 RAGL----WQRSAN-KPTGGKPAVVPAYKGELPKEFDWREKNAVT-QVKNQGSCGSCWAF 412
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E YAIK EL E S+Q
Sbjct: 413 SVTGNIEGLYAIKTGELREFSEQ 435
>gi|118197532|ref|YP_874244.1| cathepsin [Ectropis obliqua NPV]
gi|113472527|gb|ABI35734.1| cathepsin [Ectropis obliqua NPV]
Length = 299
Score = 87.4 bits (215), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 80/139 (57%), Gaps = 4/139 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
FV Y + YD D E +R+ IFR+NL+ I+ K G+A Y +N+F+D++ SE +
Sbjct: 2 FVANYNKMYDDDLEKTKRYSIFRDNLRDINIKNKL-NGSAVYRINKFSDLSTSEIVLKYT 60
Query: 142 SLDWEQIENLKSTF-ETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVA 200
L E L + F +T + G + +++ + KV +++Q +CG+CWA + +A
Sbjct: 61 GLSVPPTERLTTNFCKTIVLDQPPGKG-PLNFDWRHQNKV-TSIKNQGVCGACWAFATLA 118
Query: 201 CLESAYAIKHNELIELSKQ 219
+ES YAIKHN I LS+Q
Sbjct: 119 SIESQYAIKHNVQINLSEQ 137
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 61/189 (32%), Positives = 104/189 (55%), Gaps = 24/189 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEF 136
++ ++ ++ + Y++ E ERRF+IF++NL+ I+ +H TY G+NRFAD+T+ E+
Sbjct: 54 YEAWLVKHGKSYNALGERERRFEIFKDNLRFIE---EHNAVNRTYKVGLNRFADLTNEEY 110
Query: 137 NHGLSSLDWEQIENLKST--FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
E L+++ + YSF + L ES+++++KG V+P V+DQ CGSCW
Sbjct: 111 RSRYLGRRDETRRGLRASRVSDRYSFRAGED--LPESVDWREKGAVVP-VKDQGNCGSCW 167
Query: 195 AHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHA 250
A S +A +E I +LI LS+Q K++ + GG+M+ Y+
Sbjct: 168 AFSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMD----------YAFEFI 217
Query: 251 VLNVGYDNE 259
+ N G D+E
Sbjct: 218 INNGGIDSE 226
>gi|194746631|ref|XP_001955780.1| GF16067 [Drosophila ananassae]
gi|190628817|gb|EDV44341.1| GF16067 [Drosophila ananassae]
Length = 620
Score = 87.0 bits (214), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 80/143 (55%), Gaps = 8/143 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + R+Y S +E + R IFR NLKTI+ +E G+A YG+ FADMT +E+
Sbjct: 314 FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSTEYKE 373
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL W++ E K+T + + + S L + +++ K V V++Q CGSCWA
Sbjct: 374 RTGL----WQRDE-AKATGGSPAVVPAYSGELPKEFDWRSKNAVT-GVKNQGQCGSCWAF 427
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E YA+K+ EL E S+Q
Sbjct: 428 SVTGNIEGLYALKYGELKEFSEQ 450
>gi|313220237|emb|CBY31096.1| unnamed protein product [Oikopleura dioica]
Length = 371
Score = 87.0 bits (214), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 60/158 (37%), Positives = 88/158 (55%), Gaps = 9/158 (5%)
Query: 67 FDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVN 126
FD+ E D QF++F+ E+ + Y S+ E RF F NLK I ++ EQG+A YGV
Sbjct: 39 FDVSE-DDARKQFENFLLEHPKMY-SEQESHSRFQTFWENLKRIKFHNHIEQGSAKYGVT 96
Query: 127 RFADMTDSEFNHGLSSLDWEQIENL-KSTFETYSFNSSNSYGLA----ESINYKDKGKVL 181
F D++D EF L E ++NL + +E S NSS A E+ ++ +KG V
Sbjct: 97 EFTDLSDFEFRRHYLGLKPE-LKNLNRKKYERKSRNSSKKLKFAKTADETFDWVEKGAV- 154
Query: 182 PKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+V++Q +CGSCWA S +E A+ +LI LS+Q
Sbjct: 155 TEVKNQGMCGSCWAFSTTGNIEGAWFKATGDLISLSEQ 192
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 87.0 bits (214), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 49/159 (30%), Positives = 89/159 (55%), Gaps = 6/159 (3%)
Query: 61 SEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT 120
+++A++ +L E L+ + + ++ ++ + Y D E RRF IF++N+ I+ + +
Sbjct: 22 ADQAASRELHE-LEMTGRHEKWMAKHGKVYKDDKEKLRRFQIFKSNVVFIESFNTAGNKS 80
Query: 121 ATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKV 180
G+N+FAD+T+ EF + L ++ + F N L SI+++ KG V
Sbjct: 81 YMLGINKFADLTNEEFRAFWNGYK----RPLGASRKITPFKYENVTALPSSIDWRSKGAV 136
Query: 181 LPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
P ++DQ +CGSCWA SAVA E + ++ +L+ LS+Q
Sbjct: 137 TP-IKDQGVCGSCWAFSAVAATEGIHKLRTGKLVSLSEQ 174
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 80/140 (57%), Gaps = 2/140 (1%)
Query: 81 DFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-TATYGVNRFADMTDSEFNHG 139
+++ E+ R Y +E R+ +F+ N++ I+ + G T VN+FAD+T+ EF
Sbjct: 40 EWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSM 99
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ + + ++ ++ + + +S L S++++ KG V P ++DQ LCGSCWA SAV
Sbjct: 100 YTGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTP-IKDQGLCGSCWAFSAV 158
Query: 200 ACLESAYAIKHNELIELSKQ 219
A +E IK +LI LS+Q
Sbjct: 159 AAIEGVAQIKKGKLISLSEQ 178
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 62/188 (32%), Positives = 95/188 (50%), Gaps = 17/188 (9%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
+ +K ++ ++ + Y+ E RF+IF+NNL+ ID + T G+ +FAD+T+ E+
Sbjct: 2 SMYKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQNH-TYKVGLTKFADLTNEEY 60
Query: 137 NH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
G S ++ KS E Y+F + + L ES++++ KG V P ++DQ CGSC
Sbjct: 61 RAMFLGTRSDAKRRLMKSKSPSERYAFKAGDK--LPESVDWRAKGAVNP-IKDQGSCGSC 117
Query: 194 WAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKG--PYSLNHAV 251
WA S VA +E I ELI LS+Q R Y G C+ G Y+ +
Sbjct: 118 WAFSTVAAVEGINQIVTGELISLSEQELVDCDRTYNAG--------CNGGLMDYAFQFII 169
Query: 252 LNVGYDNE 259
N G D E
Sbjct: 170 NNGGLDTE 177
>gi|6649575|gb|AAF21461.1|U69120_1 cysteine proteinase PWCP1 [Paragonimus westermani]
Length = 427
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 85/141 (60%), Gaps = 6/141 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R++ + Y SD+ +R+ +F+ NL + + E+GTA YG+ +F+D++ EF H
Sbjct: 127 FEEFQRKFRKSYSSDTA--KRYALFKYNLLKMQLIQRLEKGTANYGITKFSDLSAEEFRH 184
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L+++ ++ ++ S ET F ++ L S +++ G V +V+DQ +CGSCWA +
Sbjct: 185 SLANM--KRRKSKGSQMETAIFPTTIQ-SLPPSFDWRANGAV-TEVKDQGMCGSCWAFAT 240
Query: 199 VACLESAYAIKHNELIELSKQ 219
+E + K N+LI LS+Q
Sbjct: 241 TGNIEGQWFRKTNKLISLSEQ 261
Score = 38.9 bits (89), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 18/31 (58%), Positives = 23/31 (74%), Gaps = 2/31 (6%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGY 256
+FY GG+ + PHMLCS+ L+HAVL VGY
Sbjct: 351 QFYLGGISHPPHMLCSEA--GLDHAVLLVGY 379
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 59/186 (31%), Positives = 98/186 (52%), Gaps = 19/186 (10%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ + + Y++ E ERRF+IF++NL+ +D + G+ G+NRFAD+T+ E+
Sbjct: 47 YEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHNA-VAGSYRVGLNRFADLTNEEYRS 105
Query: 139 GLSSLDWEQIENLKST-FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ E E ST + Y+F + + L S+++++KG V P V+DQ CGSCWA S
Sbjct: 106 MFLGGNMEMKERSASTKSDRYAFRAGDK--LPGSVDWREKGAVSP-VKDQGQCGSCWAFS 162
Query: 198 AVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVLN 253
++ +E I ELI LS+Q K++ GG+M+ Y + N
Sbjct: 163 TISAVEGINQIVTGELISLSEQELVDCDKSYNMGCNGGLMD----------YGFQFIINN 212
Query: 254 VGYDNE 259
G D E
Sbjct: 213 GGIDTE 218
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 80/140 (57%), Gaps = 2/140 (1%)
Query: 81 DFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-TATYGVNRFADMTDSEFNHG 139
+++ E+ R Y +E R+ +F+ N++ I+ + G T VN+FAD+T+ EF
Sbjct: 34 EWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEFRSM 93
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ + + ++ ++ + + +S L S++++ KG V P ++DQ LCGSCWA SAV
Sbjct: 94 YTGFKGNSVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTP-IKDQGLCGSCWAFSAV 152
Query: 200 ACLESAYAIKHNELIELSKQ 219
A +E IK +LI LS+Q
Sbjct: 153 AAIEGVAQIKKGKLISLSEQ 172
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 53/140 (37%), Positives = 81/140 (57%), Gaps = 7/140 (5%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKH--EQGTATYGVNRFADMTDSEFNHG 139
++ E R Y++ E ERRF +F +NLK +D + E G G+NRFAD+T+ EF
Sbjct: 52 WLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEFRST 111
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
L + +E ++ E Y + L ES+++++KG V P V++Q CGSCWA SAV
Sbjct: 112 F--LGAKVVERSRAAGERYRHDGVEE--LPESVDWREKGAVAP-VKNQGQCGSCWAFSAV 166
Query: 200 ACLESAYAIKHNELIELSKQ 219
+ +ES + E+I LS+Q
Sbjct: 167 STVESINQLVTGEMITLSEQ 186
>gi|156546466|ref|XP_001607324.1| PREDICTED: hypothetical protein LOC100123649 [Nasonia vitripennis]
Length = 1036
Score = 86.7 bits (213), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 75/142 (52%), Gaps = 6/142 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F +F+ +Y++ Y + E E RF IF++NL I+ ++E GT YGV +F D+T +EF
Sbjct: 731 FHEFMGKYKKMYHNKEEKEMRFQIFKDNLNLIEELQRNEMGTGRYGVTQFTDLTKAEFKA 790
Query: 139 GLSSLDWEQIENLKSTFET-YSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L LKS + + L +++ V P V+DQ CGSCWA S
Sbjct: 791 RHLGLK----PTLKSENDIPMPMATIPDIELPSDYDWRHHNVVTP-VKDQGSCGSCWAFS 845
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+E YAIKH EL+ LS+Q
Sbjct: 846 VTGNIEGQYAIKHGELLSLSEQ 867
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 86.7 bits (213), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/143 (36%), Positives = 78/143 (54%), Gaps = 4/143 (2%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+ ++ + R Y DSE RF+IF+ NLK ++ + + T T VN F+D+TD EF
Sbjct: 36 EQWMSRFHRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNKTYTLDVNEFSDLTDEEFKAR 95
Query: 140 LSSLDW-EQIENLKST--FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ L E + + +T ET SF N ES++++++G V V+ Q CG CWA
Sbjct: 96 YTGLVVPEGMTRMSTTDSHETVSFRYENVGETGESMDWREEGAV-TSVKHQQQCGCCWAF 154
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SAVA +E I EL+ LS+Q
Sbjct: 155 SAVAAVEGMTKIAKGELVSLSEQ 177
>gi|358339354|dbj|GAA47434.1| cathepsin F [Clonorchis sinensis]
Length = 603
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 57/174 (32%), Positives = 87/174 (50%), Gaps = 25/174 (14%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
+++F ++Y++ Y +D + E RF +F+ NL EQGTA YGV +F D+T EF
Sbjct: 307 YEEFKQKYKKTYVNDDD-EYRFSVFKENLLRAHQLQTMEQGTAEYGVTQFFDLTSQEFQI 365
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAE-SINYKDKGKVLPKVQDQHLCGSCWAHS 197
+E ++ +T + S + E S +++D G V P V DQ CGSCWA S
Sbjct: 366 QYLGFKYEDMQ------DTEEMSPSTRVVMDEDSFDWRDHGAVGP-VLDQGKCGSCWAFS 418
Query: 198 AVACLESAYAIKHNELIELSKQ----------------PPKTHGRFYKGGVMNL 235
+ +E + +K EL+ LS+Q PPKT+G K G + L
Sbjct: 419 TIGNIEGQWFLKTGELLSLSEQQLIDCDNVDEGCNGGYPPKTYGAVIKMGGLEL 472
Score = 42.0 bits (97), Expect = 0.27, Method: Compositional matrix adjust.
Identities = 21/50 (42%), Positives = 31/50 (62%), Gaps = 1/50 (2%)
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++ +++ G V P V +Q CGSCWA SAV +E + +K EL+ LS Q
Sbjct: 41 DNFDWRQHGAVGP-VWNQGPCGSCWAFSAVGNIEGQWFLKSGELLHLSVQ 89
Score = 39.7 bits (91), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 19/35 (54%), Positives = 24/35 (68%), Gaps = 2/35 (5%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
+FYK G+M+LP C P +LNHAVL VGY E+
Sbjct: 529 KFYKTGIMHLPVASCF--PRALNHAVLTVGYGTEN 561
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 79/138 (57%), Gaps = 4/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y +SE ERRF+IFRNN++ I+ + K +N FAD+T+ EF +S
Sbjct: 41 WMAKYGRVYKDNSEKERRFEIFRNNVEFIESFNKLGNRPYKLDINEFADLTNEEFK--VS 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+++ + T E SF +N + S++++ G V P ++DQ CG CWA SAVA
Sbjct: 99 KNGYKRSSGVGLT-EKSSFRYANVTAVPTSMDWRQNGAVTP-IKDQGQCGCCWAFSAVAA 156
Query: 202 LESAYAIKHNELIELSKQ 219
+E + +LI LS+Q
Sbjct: 157 MEGITKLSTGKLISLSEQ 174
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 83/142 (58%), Gaps = 4/142 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ ++ + Y++ E ++RF IFR+NLK ID E + G+NRFAD+T+ E+
Sbjct: 50 FESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADITNEEYRT 109
Query: 139 GLSSLDWEQIENL-KSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
G + N+ KS + Y+ + +S L +SI++++KG V V+DQ CGSCWA S
Sbjct: 110 GYLGAKRDASRNMVKSKSDRYAPVAGDS--LPDSIDWREKGAVT-GVKDQGSCGSCWAFS 166
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+A +E + LI LS+Q
Sbjct: 167 TIAAVEGVNQLATGNLISLSEQ 188
>gi|347968729|ref|XP_003436276.1| AGAP002879-PC [Anopheles gambiae str. PEST]
gi|333467870|gb|EGK96737.1| AGAP002879-PC [Anopheles gambiae str. PEST]
Length = 953
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 55/143 (38%), Positives = 80/143 (55%), Gaps = 4/143 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + RQY S E E RF+IFRNNL I+ K E+GTA YGV +FADMT +E+
Sbjct: 643 FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGVTKFADMTVAEYRA 702
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL ++ ++ + + + + L S +++D G V +V++Q CGSCWA
Sbjct: 703 HTGLVVPKHDRANHVGNRVASEE-DVAGVGDLPRSFDWRDHGAVT-EVKNQGSCGSCWAF 760
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SAV +E + IK +L S+Q
Sbjct: 761 SAVGNVEGLHQIKTKKLESYSEQ 783
>gi|22549430|ref|NP_689203.1| cath gene product [Mamestra configurata NPV-B]
gi|215401259|ref|YP_002332563.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
gi|22476609|gb|AAM95015.1| putative cysteine proteinase [Mamestra configurata NPV-B]
gi|198448759|gb|ACH88549.1| cathepsin [Helicoverpa armigera multiple nucleopolyhedrovirus]
gi|390165231|gb|AFL64878.1| cathepsin [Mamestra brassicae MNPV]
gi|401665635|gb|AFP95747.1| putative cysteine proteinase [Mamestra brassicae MNPV]
Length = 341
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 86/145 (59%), Gaps = 12/145 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ F+ +Y +QY S+ E + R++IFR+N+++I+ +A Y +NRFADMT +E +
Sbjct: 44 FEKFITQYNKQYSSEDEKKYRYNIFRHNIESINA-KNSRNDSAVYKINRFADMTKNEVVN 102
Query: 139 ---GLSSLDWEQIENLKSTF-ETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
GL+S D + F ET + + ++++ KV V+DQ +CG+CW
Sbjct: 103 RHTGLASGD------TGANFCETIVVDGPGQRQRPANFDWRNYNKV-TSVKDQGMCGACW 155
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A + + LES YAIK++ LI+L++Q
Sbjct: 156 AFAGLGALESQYAIKYDRLIDLAEQ 180
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 86.3 bits (212), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 54/134 (40%), Positives = 79/134 (58%), Gaps = 5/134 (3%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQ 147
R DSD E RRF+IF+ N+K ID K + G G+N+FAD+++ EF + E+
Sbjct: 56 RSLDSD-EHARRFEIFKENVKHIDSVNKKD-GPYKLGLNKFADLSNEEFKAMHMTTKMEK 113
Query: 148 IENLKST--FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESA 205
++L+ E+ SF NS L SI+++ KG V P V++Q CGSCWA S +A +E
Sbjct: 114 HKSLRGDRGVESGSFMYQNSKRLPASIDWRKKGAVTP-VKNQGQCGSCWAFSTIASVEGI 172
Query: 206 YAIKHNELIELSKQ 219
IK +L+ LS+Q
Sbjct: 173 NYIKTGKLVSLSEQ 186
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 85.9 bits (211), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 83/153 (54%), Gaps = 6/153 (3%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH--- 138
++ ++ + Y+ E E+RF+IF+NNL+ ID + + T G+ RFAD+T+ E+
Sbjct: 51 WLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRAKFL 110
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
G S ++ K+ + Y+F + + L ESI+++ G V ++DQ CGSCWA S
Sbjct: 111 GTKSDPKRRLMKSKNPSQRYAFKAGDV--LPESIDWRQSGAV-SAIKDQGSCGSCWAFST 167
Query: 199 VACLESAYAIKHNELIELSKQPPKTHGRFYKGG 231
+A +E I ELI LS+Q R Y G
Sbjct: 168 IAAVEGVNKIVTGELISLSEQELVDCDRSYNAG 200
>gi|182892046|gb|AAI65744.1| Ctsf protein [Danio rerio]
Length = 473
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 82/156 (52%), Gaps = 15/156 (9%)
Query: 69 LEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRF 128
++E ++ FK+F+ Y R Y S E E+R IF+ N+KT EQG+A YG+ +F
Sbjct: 165 MKESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKF 224
Query: 129 ADMTDSEF-----NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPK 183
+D+T+ EF N LS W + +K S ++ +++D G V P
Sbjct: 225 SDLTEDEFRMMYLNPMLS--QWSLKKEMKPA-------IPASAPAPDTWDWRDHGAVSP- 274
Query: 184 VQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V++Q +CGSCWA S +E + K +L+ LS+Q
Sbjct: 275 VKNQGMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQ 310
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 87/142 (61%), Gaps = 5/142 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++D++ ++ + Y+S E ERRF++F++NL+ ID + E T G+NRFAD+T+ E+
Sbjct: 42 YEDWLVKHGKAYNSLGEKERRFEVFKDNLRFIDEHNS-ENRTYRVGLNRFADLTNEEYRS 100
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L +L + L+ + Y+ +S L +S++++ +G V+ V+DQ CGSCWA S
Sbjct: 101 MYLGALSGIRRNKLRKISDRYTPRVGDS--LPDSVDWRKEGAVV-GVKDQGSCGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQ 219
AVA +E I +LI LS+Q
Sbjct: 158 AVAAVEGINKIVTGDLISLSEQ 179
>gi|341888721|gb|EGT44656.1| hypothetical protein CAEBREN_22029 [Caenorhabditis brenneri]
Length = 396
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 53/154 (34%), Positives = 86/154 (55%), Gaps = 7/154 (4%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFAD 130
+F QFKDF +++R++ + E + RF+IF+ NL+ I+ + + YG+N+F+D
Sbjct: 80 KFFGLQQQFKDFNAKFQREHKTLEEYKMRFEIFQKNLRDIEELN-LKNPSVQYGINKFSD 138
Query: 131 MTDSEFNHGLSSLDWEQIENLKSTFETYSF-----NSSNSYGLAESINYKDKGKVLPKVQ 185
T+SE + L + ST +T S N + + I++++ GKV+ V+
Sbjct: 139 KTESELKNLLMDKKFLDSSLSNSTLKTLSSYRNPRNIIKNVQRPDYIDWRNDGKVM-SVK 197
Query: 186 DQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
DQ CGSCWA + VA +ES YAI+ L LS+Q
Sbjct: 198 DQGQCGSCWAFATVAAVESQYAIRKGTLWSLSEQ 231
>gi|117606135|ref|NP_001071036.1| cathepsin F precursor [Danio rerio]
gi|115313533|gb|AAI24244.1| Cathepsin F [Danio rerio]
Length = 473
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 82/156 (52%), Gaps = 15/156 (9%)
Query: 69 LEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRF 128
++E ++ FK+F+ Y R Y S E E+R IF+ N+KT EQG+A YG+ +F
Sbjct: 165 MKESVELLTMFKNFMITYNRTYSSQEEAEKRLRIFQQNMKTAQTLQSLEQGSAEYGITKF 224
Query: 129 ADMTDSEF-----NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPK 183
+D+T+ EF N LS W + +K S ++ +++D G V P
Sbjct: 225 SDLTEDEFRMMYLNPMLS--QWSLKKEMKPA-------IPASAPAPDTWDWRDHGAVSP- 274
Query: 184 VQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V++Q +CGSCWA S +E + K +L+ LS+Q
Sbjct: 275 VKNQGMCGSCWAFSVTGNIEGQWFKKTGQLLSLSEQ 310
>gi|195343593|ref|XP_002038380.1| GM10654 [Drosophila sechellia]
gi|194133401|gb|EDW54917.1| GM10654 [Drosophila sechellia]
Length = 615
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 79/143 (55%), Gaps = 8/143 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + R+Y S +E + R IFR NLKTI+ +E G+A YG+ FADMT SE+
Sbjct: 309 FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKE 368
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL W++ E K+T + + + L + +++ K V +V++Q CGSCWA
Sbjct: 369 RTGL----WQRDE-AKATGGSAAVVPAYHGELPKEFDWRQKDAV-TQVKNQGSCGSCWAF 422
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E YA+K EL E S+Q
Sbjct: 423 SVTGNIEGLYAVKTGELKEFSEQ 445
>gi|167833701|gb|ACA02577.1| cathepsin [Spodoptera frugiperda MNPV]
Length = 340
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 86/145 (59%), Gaps = 12/145 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
F+ F+ +Y +QY S+ E + R++IFR+N+++I+ +A Y +NRFADMT +E
Sbjct: 43 FEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQ-KNSRNDSAVYKINRFADMTKNEIVI 101
Query: 137 -NHGLSSLDWEQIENLKSTF-ETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ GL+S + L + F ET + + +++ KV V+DQ +CG+CW
Sbjct: 102 RHTGLASGE------LGANFCETVVVDGPAQRQRPANFDWRTLNKV-TSVKDQGMCGACW 154
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A + + LES YAIK++ LI+L++Q
Sbjct: 155 AFAGLGALESQYAIKYDRLIDLAEQ 179
>gi|125860143|ref|YP_001036312.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|120969288|gb|ABM45731.1| viral cathepsin [Spodoptera frugiperda MNPV]
gi|319997353|gb|ADV91251.1| V-CATH [Spodoptera frugiperda MNPV]
gi|384087478|gb|AFH58958.1| v-cath [Spodoptera frugiperda MNPV]
Length = 339
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 86/145 (59%), Gaps = 12/145 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
F+ F+ +Y +QY S+ E + R++IFR+N+++I+ +A Y +NRFADMT +E
Sbjct: 42 FEKFIAQYNKQYKSEDEKKYRYNIFRHNIESINQ-KNSRNDSAVYKINRFADMTKNEIVI 100
Query: 137 -NHGLSSLDWEQIENLKSTF-ETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ GL+S + L + F ET + + +++ KV V+DQ +CG+CW
Sbjct: 101 RHTGLASGE------LGANFCETVVVDGPAQRQRPANFDWRTLNKV-TSVKDQGMCGACW 153
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A + + LES YAIK++ LI+L++Q
Sbjct: 154 AFAGLGALESQYAIKYDRLIDLAEQ 178
>gi|347968733|ref|XP_312034.5| AGAP002879-PA [Anopheles gambiae str. PEST]
gi|333467868|gb|EAA08025.5| AGAP002879-PA [Anopheles gambiae str. PEST]
Length = 1810
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/143 (38%), Positives = 80/143 (55%), Gaps = 4/143 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + RQY S E E RF+IFRNNL I+ K E+GTA YGV +FADMT +E+
Sbjct: 1500 FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGVTKFADMTVAEYRA 1559
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL ++ ++ + + + + L S +++D G V +V++Q CGSCWA
Sbjct: 1560 HTGLVVPKHDRANHVGNRVASEE-DVAGVGDLPRSFDWRDHGAVT-EVKNQGSCGSCWAF 1617
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SAV +E + IK +L S+Q
Sbjct: 1618 SAVGNVEGLHQIKTKKLESYSEQ 1640
>gi|74273320|gb|ABA01328.1| secreted cathepsin F [Teladorsagia circumcincta]
Length = 364
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 77/143 (53%), Gaps = 1/143 (0%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N F F+ +++ Y ++SE +RF IF+ NL+ I ++++GTA YG+N+FAD++ EF
Sbjct: 62 NHFTSFIERHDKVYRNESEALKRFGIFKRNLEIIRSAQENDKGTAIYGINQFADLSPEEF 121
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
W+Q ++ + + L ES ++++ G V KV+ + C +CWA
Sbjct: 122 KKTHLPHTWKQPDHPNRIVDLAAEGVDPKEPLPESFDWREHGAVT-KVKTEGHCAACWAF 180
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E + + +L+ LS Q
Sbjct: 181 SVTGNIEGQWFLAKKKLVSLSAQ 203
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 56/172 (32%), Positives = 83/172 (48%), Gaps = 28/172 (16%)
Query: 54 SQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY 113
SQ S EAS ++ E D++ Y R Y +E E+RF IF++N+ I+ +
Sbjct: 23 SQATSRSLHEASMYERHE---------DWMARYGRMYKDANEKEKRFKIFKDNVARIESF 73
Query: 114 TKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTF------ETYSFNSSNSYG 167
K T +N FAD+T+ EF +L++ F E +F N
Sbjct: 74 NKAMDKTYKLSINEFADLTNEEFR------------SLRNRFKAHICSEATTFKYENVTA 121
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ +I+++ KG V P ++DQ CG CWA SAVA E I +LI LS+Q
Sbjct: 122 VPSTIDWRKKGAVTP-IKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQ 172
>gi|24644155|ref|NP_730901.1| CG12163, isoform A [Drosophila melanogaster]
gi|32699625|sp|Q9VN93.2|CPR1_DROME RecName: Full=Putative cysteine proteinase CG12163; Flags:
Precursor
gi|23170427|gb|AAF52055.2| CG12163, isoform A [Drosophila melanogaster]
gi|27819876|gb|AAO24986.1| LP08529p [Drosophila melanogaster]
Length = 614
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 79/143 (55%), Gaps = 8/143 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + R+Y S +E + R IFR NLKTI+ +E G+A YG+ FADMT SE+
Sbjct: 308 FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKE 367
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL W++ E K+T + + + L + +++ K V +V++Q CGSCWA
Sbjct: 368 RTGL----WQRDE-AKATGGSAAVVPAYHGELPKEFDWRQKDAVT-QVKNQGSCGSCWAF 421
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E YA+K EL E S+Q
Sbjct: 422 SVTGNIEGLYAVKTGELKEFSEQ 444
>gi|347968731|ref|XP_003436277.1| AGAP002879-PB [Anopheles gambiae str. PEST]
gi|333467869|gb|EGK96736.1| AGAP002879-PB [Anopheles gambiae str. PEST]
Length = 1834
Score = 85.9 bits (211), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/143 (38%), Positives = 80/143 (55%), Gaps = 4/143 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + RQY S E E RF+IFRNNL I+ K E+GTA YGV +FADMT +E+
Sbjct: 1524 FDKFRHHHRRQYASSMEHEMRFNIFRNNLFKIEQLNKFERGTAKYGVTKFADMTVAEYRA 1583
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL ++ ++ + + + + L S +++D G V +V++Q CGSCWA
Sbjct: 1584 HTGLVVPKHDRANHVGNRVASEE-DVAGVGDLPRSFDWRDHGAVT-EVKNQGSCGSCWAF 1641
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SAV +E + IK +L S+Q
Sbjct: 1642 SAVGNVEGLHQIKTKKLESYSEQ 1664
>gi|24644153|ref|NP_649521.1| CG12163, isoform B [Drosophila melanogaster]
gi|23170426|gb|AAN13266.1| CG12163, isoform B [Drosophila melanogaster]
gi|378548248|gb|AFC17498.1| FI18603p1 [Drosophila melanogaster]
Length = 475
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 79/143 (55%), Gaps = 8/143 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + R+Y S +E + R IFR NLKTI+ +E G+A YG+ FADMT SE+
Sbjct: 169 FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKE 228
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL W++ E K+T + + + L + +++ K V +V++Q CGSCWA
Sbjct: 229 RTGL----WQRDE-AKATGGSAAVVPAYHGELPKEFDWRQKDAV-TQVKNQGSCGSCWAF 282
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E YA+K EL E S+Q
Sbjct: 283 SVTGNIEGLYAVKTGELKEFSEQ 305
>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
Length = 464
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/128 (39%), Positives = 72/128 (56%), Gaps = 6/128 (4%)
Query: 94 SEIERRFDIFRNNLKTIDYYTKH--EQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENL 151
E ERRF +F +NLK +D + H E G G+NRFAD+T+ EF + L
Sbjct: 85 GEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGMNRFADLTNDEFR--AAYLGTTPAGRG 142
Query: 152 KSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN 211
+ E Y + + L +S++++DKG V+ V++Q CGSCWA SAVA +E I
Sbjct: 143 RHVGEMYRHDGVEA--LPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVTG 200
Query: 212 ELIELSKQ 219
EL+ LS+Q
Sbjct: 201 ELVSLSEQ 208
>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
Length = 2676
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 90/163 (55%), Gaps = 15/163 (9%)
Query: 63 EASTFDLEEFLDHGNQFKDFVREYERQYDSDS-EIERRFDIFRNNLKTIDYYTKHEQGTA 121
EA+T ++ L + F +F+ Y+ +Y D ++ +RF+IF+ N++ + HE+GTA
Sbjct: 2355 EAATAEVYHHLQAEHLFYEFLSTYKPEYIDDRHQMRQRFEIFKENVRKMHELNTHERGTA 2414
Query: 122 TYGVNRFADMTDSEFN---HGL--SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKD 176
TYGV RFAD+T EF+ G+ S D Q++ K+ + +S +++D
Sbjct: 2415 TYGVTRFADLTYEEFSTKHMGMKASLRDPNQVQFRKAVIPNVT--------APDSFDWRD 2466
Query: 177 KGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G V V+DQ CGSCWA S +E + +K +L+ LS+Q
Sbjct: 2467 HGAVT-GVKDQGSCGSCWAFSVTGNIEGQWKMKTGDLVSLSEQ 2508
>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
Length = 464
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 49/128 (38%), Positives = 72/128 (56%), Gaps = 6/128 (4%)
Query: 94 SEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHGLSSLDWEQIENL 151
E ERRF +F +NLK +D + H G + G+NRFAD+T+ EF + L
Sbjct: 85 GEYERRFRVFWDNLKFVDAHNAHADGHGGFRLGMNRFADLTNDEFR--AAYLGTTPAGRG 142
Query: 152 KSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN 211
+ E Y + + L +S++++DKG V+ V++Q CGSCWA SAVA +E I
Sbjct: 143 RHVGEMYRHDGVEA--LPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIVTG 200
Query: 212 ELIELSKQ 219
EL+ LS+Q
Sbjct: 201 ELVSLSEQ 208
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 58/187 (31%), Positives = 100/187 (53%), Gaps = 19/187 (10%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ E+ + Y++ E E+RF+IF++NL+ ID + ++ + G+NRFAD+T+ E+
Sbjct: 50 MYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSVDR-SYKVGLNRFADLTNEEYK 108
Query: 138 HGLSSLDWEQIEN-LKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
E+ L + + Y F + L E++++++KG V+P V+DQ CGSCWA
Sbjct: 109 AMFLGTKMERKNRFLGTRSQRYLFKDGDD--LPENVDWREKGAVVP-VKDQGQCGSCWAF 165
Query: 197 SAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVL 252
S V +E I ELI LS+Q K++ + GG+M+ Y+ +
Sbjct: 166 STVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMD----------YAFEFIIN 215
Query: 253 NVGYDNE 259
N G D E
Sbjct: 216 NGGIDTE 222
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 76/140 (54%), Gaps = 4/140 (2%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
++++ Y + Y E ERRF IF+ N+ I+ + T G+N+FAD+T+ EF
Sbjct: 40 EEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEF--- 96
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
++ + + S T +F N + +++++ KG V P ++DQ CG CWA SAV
Sbjct: 97 IAPRNRFKGHMCSSITRTTTFKYENVTAIPSTVDWRQKGAVTP-IKDQGQCGCCWAFSAV 155
Query: 200 ACLESAYAIKHNELIELSKQ 219
A E +A+ +LI LS+Q
Sbjct: 156 AATEGIHALSAGKLISLSEQ 175
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 76/140 (54%), Gaps = 4/140 (2%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
++++ Y + Y E ERRF IF+ N+ I+ + T G+N+FAD+T+ EF
Sbjct: 40 EEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEF--- 96
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
++ + + S T +F N + +++++ KG V P ++DQ CG CWA SAV
Sbjct: 97 IAPRNRFKGHMCSSITRTTTFKYENVTAIPSTVDWRQKGAVTP-IKDQGQCGCCWAFSAV 155
Query: 200 ACLESAYAIKHNELIELSKQ 219
A E +A+ +LI LS+Q
Sbjct: 156 AATEGIHALSAGKLISLSEQ 175
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 76/140 (54%), Gaps = 4/140 (2%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
++++ Y + Y E ERRF IF+ N+ I+ + T G+N+FAD+T+ EF
Sbjct: 40 EEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEF--- 96
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
++ + + S T +F N + +++++ KG V P ++DQ CG CWA SAV
Sbjct: 97 IAPRNRFKGHMCSSITRTTTFKYENVTAIPSTVDWRQKGAVTP-IKDQGQCGCCWAFSAV 155
Query: 200 ACLESAYAIKHNELIELSKQ 219
A E +A+ +LI LS+Q
Sbjct: 156 AATEGIHALSAGKLISLSEQ 175
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 64/189 (33%), Positives = 99/189 (52%), Gaps = 23/189 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ E+ + Y++ E ++RF IF++NLK ID + G+ +FAD+T+ E+
Sbjct: 49 YESWLIEHGKSYNALGEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRS 108
Query: 139 ----GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
SS D ++ KS + Y +S L ES++++DKG VL V+DQ CGSCW
Sbjct: 109 IYLGTKSSGDRRKLSKNKS--DRYLPKVGDS--LPESVDWRDKG-VLVGVKDQGSCGSCW 163
Query: 195 AHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHA 250
A SAVA +ES AI LI LS+Q K++ GG+M+ Y+
Sbjct: 164 AFSAVAAMESINAIVTGNLISLSEQELVDCDKSYNEGCDGGLMD----------YAFEFV 213
Query: 251 VLNVGYDNE 259
+ N G D E
Sbjct: 214 INNGGIDTE 222
>gi|195379496|ref|XP_002048514.1| GJ14012 [Drosophila virilis]
gi|194155672|gb|EDW70856.1| GJ14012 [Drosophila virilis]
Length = 327
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/148 (36%), Positives = 88/148 (59%), Gaps = 10/148 (6%)
Query: 76 GNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT-KHEQGTATY--GVNRFADMT 132
++F+ F EYE+ Y+ D E + R IF++N + ID + ++ G TY GVN+F DM
Sbjct: 26 ASEFESFKVEYEKSYEDDGEEQLRMQIFKDNKQLIDRHNERYAAGEETYEMGVNQFTDML 85
Query: 133 DSEFNH-GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
+EF L +L+ I + S+ E Y ++ +N+ + +++++KG V P V++Q CG
Sbjct: 86 ATEFRKIMLVNLN---ISDFTSSIE-YIYSPANAE-IPSQVDWREKGAVTP-VKNQGRCG 139
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
SCWA SA LE + I+ +LI LS+Q
Sbjct: 140 SCWAFSAAGALEGQHFIQTKQLIPLSEQ 167
>gi|344310882|gb|AEN03980.1| cathepsin-like cysteine proteinase [Helicoverpa armigera NPV strain
Australia]
Length = 367
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 83/152 (54%), Gaps = 13/152 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHE-----------QGTATYGVNR 127
FK F+++Y + YD E + R+++F++NL I+ + +A +GVN+
Sbjct: 57 FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116
Query: 128 FADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQ 187
F+D T E H + + + + E + + L + +++D KV P ++DQ
Sbjct: 117 FSDKTPDEVLHSNTGF-FLNLSQHYTLCENRIVKGAPNIRLPDYYDWRDTNKVTP-IKDQ 174
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+CGSCWA A+ +ES YAI+HN+LI+LS+Q
Sbjct: 175 GVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQ 206
>gi|341886805|gb|EGT42740.1| hypothetical protein CAEBREN_23878 [Caenorhabditis brenneri]
Length = 396
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 92/164 (56%), Gaps = 13/164 (7%)
Query: 61 SEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT 120
+E+ +F L++ QFKDF +++ R++ S E + RF++F+ NL+ + + + +
Sbjct: 76 AEKLKSFGLQQ------QFKDFNKKFGREHKSLEEYKMRFEVFQKNLREFEELNQ-KNPS 128
Query: 121 ATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSF-----NSSNSYGLAESINYK 175
YG+N+F+D T+SE + L + ST +T S N + + I+++
Sbjct: 129 VQYGINKFSDKTESELKNLLMDKKFLDSSLSNSTLKTLSSYRNPRNIIKNVQRPDYIDWR 188
Query: 176 DKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ GKV+ V+DQ CGSCWA + VA +ES YAI+ L LS+Q
Sbjct: 189 NDGKVM-SVKDQGQCGSCWAFATVAAVESQYAIRKGTLWSLSEQ 231
>gi|12597541|ref|NP_075125.1| cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15426394|ref|NP_203611.1| cathepsin [Helicoverpa armigera NPV]
gi|12483807|gb|AAG53799.1|AF271059_56 cathepsin [Helicoverpa armigera nucleopolyhedrovirus G4]
gi|15384470|gb|AAK96381.1|AF303045_123 cathepsin [Helicoverpa armigera NPV]
gi|18027090|gb|AAL55725.1|AF268612_1 cathepsin [Helicoverpa armigera NPV]
Length = 365
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 83/152 (54%), Gaps = 13/152 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-----------TATYGVNR 127
FK F+++Y + YD E + R+++F++NL I+ + +A +GVN+
Sbjct: 55 FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 114
Query: 128 FADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQ 187
F+D T E H + + + + E + + L + +++D KV P ++DQ
Sbjct: 115 FSDKTPDEVLHSNTGF-FLNLSQHYTLCENRIVKGAPNIRLPDYYDWRDTNKVTP-IKDQ 172
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+CGSCWA A+ +ES YAI+HN+LI+LS+Q
Sbjct: 173 GVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQ 204
>gi|121531592|gb|ABM55481.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 318
Score = 85.5 bits (210), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 59/146 (40%), Positives = 81/146 (55%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTD 133
+Q+ F + + + Y S E RF IF+NNL+TI+ + K+++G TY GVN+FADMT
Sbjct: 18 DQWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEEHNAKYDKGEETYYMGVNQFADMTA 77
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF H L + + NL +T +S N ESI++ KG L V+DQ CGSC
Sbjct: 78 EEFRHMLGLQNGAR-PNLNATLHVFSENLQ----APESIDWTQKGADLG-VKDQGKCGSC 131
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA S+ LE AI H LS+Q
Sbjct: 132 WAFSSTGSLEGQNAIHHKVKTPLSEQ 157
>gi|215401412|ref|YP_002332715.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
gi|209483953|gb|ACI47386.1| cathepsin [Spodoptera litura nucleopolyhedrovirus II]
Length = 337
Score = 85.1 bits (209), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 87/145 (60%), Gaps = 12/145 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
F+ F+ +Y ++Y ++ E + R++IFR+N+++I++ +A Y +NRFADMT +E
Sbjct: 40 FEKFIAQYNKKYKTEDEKKYRYNIFRHNMESINHKNSRND-SAIYKINRFADMTKNEVVI 98
Query: 137 -NHGLSSLDWEQIENLKSTF-ETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ GL+S + L + F ET + S +++ KV V+DQ +CG+CW
Sbjct: 99 RHTGLASGE------LGANFCETIVVDGPAQRQRPTSFDWRTLNKV-TSVKDQGMCGACW 151
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A + + LES YAIK++ LI+L++Q
Sbjct: 152 AFAGLGALESQYAIKYDRLIDLAEQ 176
>gi|18138384|ref|NP_542680.1| cathepsin [Helicoverpa zea SNPV]
gi|209401110|ref|YP_002273979.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
gi|37077430|sp|Q8V5U0.1|CATV_NPVHZ RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|18028766|gb|AAL56202.1|AF334030_127 ORF57 [Helicoverpa zea SNPV]
gi|209364362|dbj|BAG74621.1| viral cathepsin-like protein [Helicoverpa armigera NPV NNg1]
Length = 367
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 82/152 (53%), Gaps = 13/152 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHE-----------QGTATYGVNR 127
FK F+++Y + YD E + R+++F++NL I+ + +A +GVN+
Sbjct: 57 FKHFLQQYNKSYDDPKEYQYRYNVFKDNLNKINSQNRENLLNNKNNNDSLSTSAQFGVNK 116
Query: 128 FADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQ 187
F+D T E H + + + + E + L + +++D KV P ++DQ
Sbjct: 117 FSDKTPDEVLHSNTGF-FLNLSQHYTLCENRIVKGAPDIRLPDYYDWRDTNKVTP-IKDQ 174
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+CGSCWA A+ +ES YAI+HN+LI+LS+Q
Sbjct: 175 GVCGSCWAFVAIGNIESQYAIRHNKLIDLSEQ 206
>gi|294661899|gb|ADF28790.1| RE01479p [Drosophila melanogaster]
Length = 334
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 79/143 (55%), Gaps = 8/143 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + R+Y S +E + R IFR NLKTI+ +E G+A YG+ FADMT SE+
Sbjct: 169 FYKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEELNANEMGSAKYGITEFADMTSSEYKE 228
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL W++ E K+T + + + L + +++ K V +V++Q CGSCWA
Sbjct: 229 RTGL----WQRDEA-KATGGSAAVVPAYHGELPKEFDWRQKDAV-TQVKNQGSCGSCWAF 282
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E YA+K EL E S+Q
Sbjct: 283 SVTGNIEGLYAVKTGELKEFSEQ 305
>gi|195453400|ref|XP_002073772.1| GK14287 [Drosophila willistoni]
gi|194169857|gb|EDW84758.1| GK14287 [Drosophila willistoni]
Length = 610
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 59/196 (30%), Positives = 90/196 (45%), Gaps = 19/196 (9%)
Query: 38 WHPRWTGRVHNLILQ-RSQPNSYGSEEASTFDLEEFLDHGNQ-----------FKDFVRE 85
W W R H + + R+QP S E+ H F F +
Sbjct: 250 WAQPWLDRGHEITFKCRNQPIVLARHSRSVEWAEKKTGHKKHNHHSLDKVEHLFHKFQIK 309
Query: 86 YERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH--GLSSL 143
+ER+Y + E + R IFR NL+ I+ +E G+A YG+ FADMT +E+ GL
Sbjct: 310 FERRYVNSVERQMRLRIFRQNLRIIEQLNANEMGSAKYGITEFADMTSTEYKERTGL--- 366
Query: 144 DWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLE 203
W++ E + + S L + +++ KG V V++Q CGSCWA S + +E
Sbjct: 367 -WQRTEGQPTGGQKAVVPSYPGGELPKEFDWRQKGAV-SSVKNQGSCGSCWAFSTIGNIE 424
Query: 204 SAYAIKHNELIELSKQ 219
A+K +L E S+Q
Sbjct: 425 GLNAVKTGQLKEFSEQ 440
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/136 (32%), Positives = 75/136 (55%), Gaps = 8/136 (5%)
Query: 86 YERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF--NHGLSSL 143
Y R Y +E RRF++F++NL ++ + ++ GVN+FAD+T EF N G +
Sbjct: 48 YGRVYKDAAEKARRFEVFKDNLAFVESFNADKKNKFWLGVNQFADLTTEEFKANKGFKPI 107
Query: 144 DWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLE 203
E++ +E S ++ L +++++ KG V P +++Q CG CWA SAVA +E
Sbjct: 108 SAEEVPTTGFKYENLSVSA-----LPTAVDWRTKGAVTP-IKNQGQCGCCWAFSAVAAME 161
Query: 204 SAYAIKHNELIELSKQ 219
+ + L+ LS+Q
Sbjct: 162 GIVKLSTDNLVSLSEQ 177
>gi|198427474|ref|XP_002119872.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 596
Score = 85.1 bits (209), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 83/142 (58%), Gaps = 6/142 (4%)
Query: 79 FKDFVREYERQYDSDS-EIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
F F+ +Y R Y S S E RF+IF+ N + + + + E+GTA YG+ +F DM++ E++
Sbjct: 169 FDMFLEKYPRTYSSSSDEYNERFEIFKTNYQVVQHLNEIERGTAVYGITKFMDMSEEEYH 228
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L+ + ++ T + +++N + +S++++ G V +V++Q CGSCWA S
Sbjct: 229 RTLAPGFTRPLVPIQ-TLNSAELDTTN---IPDSMDWRKHGAVT-EVKNQGSCGSCWAFS 283
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+E + +KH +LI LS+Q
Sbjct: 284 TTGNVEGQWFLKHKKLISLSEQ 305
Score = 44.3 bits (103), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 20/52 (38%), Positives = 34/52 (65%), Gaps = 1/52 (1%)
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ +S++++ G V +V++Q CGSCWA S +E + +KH +LI LS+Q
Sbjct: 475 IPDSMDWRKHGAVT-EVKNQGSCGSCWAFSTTGNVEGQWFLKHKKLISLSEQ 525
>gi|380025691|ref|XP_003696602.1| PREDICTED: putative cysteine proteinase CG12163-like [Apis florea]
Length = 881
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 74/144 (51%), Gaps = 4/144 (2%)
Query: 67 FDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVN 126
L + + + F+DF+ ++ + + S +E + RF IF+ NLK I EQGTA YGV
Sbjct: 564 LKLAQNIKYETLFEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIIKELQTFEQGTAEYGVT 623
Query: 127 RFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
FAD+T EF E + ++ S+ + L +++D V P V+D
Sbjct: 624 MFADLTPKEFKTRYLGFRPELKQ--ENEIPLAKIEVSDIF-LPPKFDWRDYNAVTP-VKD 679
Query: 187 QHLCGSCWAHSAVACLESAYAIKH 210
Q LCGSCWA S +E YAIK+
Sbjct: 680 QGLCGSCWAFSVTGNVEGQYAIKY 703
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/168 (33%), Positives = 89/168 (52%), Gaps = 16/168 (9%)
Query: 58 SYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHE 117
SY + A+T +E L + ++ ++ ++ + Y++ E E+RF IF++NL+ ID + E
Sbjct: 60 SYDNAHAATSRSDEEL--MSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSQE 117
Query: 118 QGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYG------LAES 171
T G+NRFAD+T+ E+ + L T SN Y L ES
Sbjct: 118 DRTYKLGLNRFADLTNEEYRAKYLGTKIDPNRRLGKT-------PSNRYAPRVGDKLPES 170
Query: 172 INYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++++ +G V P V+DQ CGSCWA SA+ +E I ELI LS+Q
Sbjct: 171 VDWRKEGAV-PPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQ 217
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 55/165 (33%), Positives = 89/165 (53%), Gaps = 16/165 (9%)
Query: 61 SEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT 120
+++A+T EE L + ++ ++ ++ + Y++ E E+RF IF++NL+ ID + E T
Sbjct: 43 ADKAATLRTEEEL--MSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRT 100
Query: 121 ATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYG------LAESINY 174
G+NRFAD+T+ E+ + L T SN Y L +S+++
Sbjct: 101 YKLGLNRFADLTNEEYRAKYLGTKIDPNRRLGKT-------PSNRYAPRVGDKLPDSVDW 153
Query: 175 KDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ +G V P V+DQ CGSCWA SA+ +E I ELI LS+Q
Sbjct: 154 RKEGAV-PPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQ 197
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 57/160 (35%), Positives = 81/160 (50%), Gaps = 7/160 (4%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+ ++ + R Y DSE RF+IF NNLK ++ + T T VN F+D+TD EF
Sbjct: 36 EQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTDEEFKAR 95
Query: 140 LSSLDW-EQIENLKST--FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ L E + + +T ET SF N ES+++ +G V V+ Q CG CWA
Sbjct: 96 YTGLVVPEGMTRISTTDSHETVSFRYENVGETGESMDWIQEGAV-TSVKHQQQCGCCWAF 154
Query: 197 SAVACLESAYAIKHNELIELSKQPP---KTHGRFYKGGVM 233
SAVA +E I + EL+ LS+Q T GG+M
Sbjct: 155 SAVAAVEGMTKIANGELVSLSEQQLLDCSTENNGCGGGIM 194
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 84.7 bits (208), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 79/142 (55%), Gaps = 8/142 (5%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF--N 137
++++ EY R Y +E RRF++F++N+ ++ + ++ G+N+FAD+T EF N
Sbjct: 37 ENWMVEYGRVYKDAAEKARRFEVFKDNVAFVESFNTNKNNKFWLGINQFADLTIEEFKAN 96
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
G + E++ +E S ++ L +++++ KG V P +++Q CG CWA S
Sbjct: 97 KGFKPISAEKVPTTGFKYENLSVSA-----LPTAVDWRTKGAVTP-IKNQGQCGCCWAFS 150
Query: 198 AVACLESAYAIKHNELIELSKQ 219
AVA +E + LI LS+Q
Sbjct: 151 AVAAMEGIVKLSTGNLISLSEQ 172
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 82/149 (55%), Gaps = 8/149 (5%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ N F+D+ ++ + Y SD E RR IF + L I+ + T T G+N+F+D+T
Sbjct: 35 LEIKNMFEDWAAKHGKSYSSDLEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLT 94
Query: 133 DSEFN--HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
++EF H + L + E +S L S++++ KG V P ++DQ C
Sbjct: 95 NAEFRAMHVGKFKRPRYQDRLPAEDEDVDVSS-----LPTSLDWRQKGAVTP-IKDQGDC 148
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCWA SA+A +ESA+ + EL+ LS+Q
Sbjct: 149 GSCWAFSAIASIESAHFLATKELVSLSEQ 177
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 92/162 (56%), Gaps = 10/162 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ ++ + Y++ E E+RF IF++NL+ ID + E + G+NRFAD+T+ E+
Sbjct: 49 MYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTNEEYR 108
Query: 138 HG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
L + ++ +KS + Y+ +S L ES++++ KG V P ++DQ CGSCWA
Sbjct: 109 STYLGAKSKPKLSKVKS--DRYAPRVGDS--LPESVDWRAKGAVAP-IKDQGSCGSCWAF 163
Query: 197 SAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMN 234
S V +E I ELI LS+Q K++ GG+M+
Sbjct: 164 STVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDGGLMD 205
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 60/186 (32%), Positives = 93/186 (50%), Gaps = 18/186 (9%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ E + Y+ E ERRF+IF++NLK ++ ++ T G+ RFAD+T+ EF
Sbjct: 42 MYERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFR 101
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
E+ + E Y + +S L ++I+++ KG V P V+DQ CGSCWA S
Sbjct: 102 AIYLRSKMERTR-VPVKGEKYLYKVGDS--LPDAIDWRAKGAVNP-VKDQGSCGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQPPKTHGRFYK----GGVMNLPHMLCSKGPYSLNHAVLN 253
A+ +E IK ELI LS+Q Y GG+M+ Y+ + N
Sbjct: 158 AIGAVEGINQIKTGELISLSEQELVDCDTSYNDGCGGGLMD----------YAFKFIIEN 207
Query: 254 VGYDNE 259
G D E
Sbjct: 208 GGIDTE 213
>gi|302763127|ref|XP_002964985.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
gi|300167218|gb|EFJ33823.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
Length = 320
Score = 84.7 bits (208), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 51/172 (29%), Positives = 93/172 (54%), Gaps = 8/172 (4%)
Query: 49 LILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLK 108
L++ + P + + + + L+ N F+D+ ++ + Y SD E RR IF + L
Sbjct: 13 LVVVGAAP--FAIARPAALEDDRALEIKNMFEDWAAKHGKSYSSDWEKARRMTIFSDTLA 70
Query: 109 TIDYYTKHEQGTATYGVNRFADMTDSEFNHG-LSSLDWEQIENLKSTFETYSFNSSNSYG 167
I+ + T T G+N+F+D+T++EF + + ++ + + SS
Sbjct: 71 YIEKHNALPNTTFTLGLNKFSDLTNAEFRANYVGKFKPPRYQDRRPAKDVDVDVSS---- 126
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
L S++++ +G V P ++DQ CGSCWA SA+A +ESA+ + N+L+ LS+Q
Sbjct: 127 LPTSLDWRQEGAVTP-IKDQGQCGSCWAFSAIASIESAHFLATNQLVSLSEQ 177
>gi|9635308|ref|NP_059206.1| ORF58 [Xestia c-nigrum granulovirus]
gi|13124001|sp|Q9PYY5.1|CATV_GVXN RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|6175702|gb|AAF05172.1|AF162221_58 ORF58 [Xestia c-nigrum granulovirus]
Length = 346
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 83/143 (58%), Gaps = 4/143 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F +FV +Y + Y D E E RF+IF+ NL I+ E +A + +N AD++ +E
Sbjct: 43 FNEFVVKYNKVYKDDQEKEARFEIFKQNLADINARNALED-SAMFEINSRADISSNELLQ 101
Query: 139 GLSSLDWEQIEN-LKSTFETYSFNSSNSYG-LAESINYKDKGKVLPKVQDQHLCGSCWAH 196
L+ L + K++F T + S +S G + +S +++D+ V V+ Q CGSCWA
Sbjct: 102 KLTGLKLSLMRGEKKNSFCTPTVISGDSSGKVPDSFDWRDRNSV-TSVKMQKECGSCWAF 160
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SAVA +ES Y IKHN ++LS+Q
Sbjct: 161 SAVANIESLYHIKHNVSLDLSEQ 183
>gi|46309423|ref|YP_006313.1| ORF31 [Agrotis segetum granulovirus]
gi|46200640|gb|AAS82707.1| ORF31 [Agrotis segetum granulovirus]
Length = 327
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 50/148 (33%), Positives = 88/148 (59%), Gaps = 11/148 (7%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSE--- 135
F+DFV++Y + Y S+ E + +FD F+NN+++I+ +A Y +N ++DM +E
Sbjct: 25 FEDFVQKYNKSYSSEEERQIKFDNFKNNIRSINE-KNSLSNSAVYDINFYSDMNKNELLR 83
Query: 136 ----FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
F L + + N+K + N + + L +S +++D+ V+ V++Q CG
Sbjct: 84 KQTGFKINLKKNNLDLSWNIKCNKKL--INGNPAVLLPDSFDWRDR-HVITSVKNQRDCG 140
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
SCWA S +A +ES YAIK+N+L++LS+Q
Sbjct: 141 SCWAFSTIANIESLYAIKYNKLLDLSEQ 168
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 84.3 bits (207), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 64/189 (33%), Positives = 99/189 (52%), Gaps = 23/189 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ E+ + Y++ E ++RF IF++NL+ ID + G+ +FAD+T+ E+
Sbjct: 49 YESWLIEHGKSYNALGEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRS 108
Query: 139 ----GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
SS D +++ KS + Y +S L ESI++++KG VL V+DQ CGSCW
Sbjct: 109 IYLGTKSSGDRKKLSKNKS--DRYLPKVGDS--LPESIDWREKG-VLVGVKDQGSCGSCW 163
Query: 195 AHSAVACLESAYAIKHNELIELSKQPPKTHGRFYK----GGVMNLPHMLCSKGPYSLNHA 250
A SAVA +ES AI LI LS+Q R Y GG+M+ Y+
Sbjct: 164 AFSAVAAMESINAIVTGNLISLSEQELVDCDRSYNEGCDGGLMD----------YAFEFV 213
Query: 251 VLNVGYDNE 259
+ N G D E
Sbjct: 214 IKNGGIDTE 222
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 82/149 (55%), Gaps = 8/149 (5%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ N F+D+ ++ + Y SD E RR IF + L I+ + T T G+N+F+D+T
Sbjct: 31 LEIKNMFEDWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLT 90
Query: 133 DSEFN--HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
++EF H + L + E +S L S++++ KG V P ++DQ C
Sbjct: 91 NAEFRAMHVGKFKRPRYQDRLPAEDEDVDVSS-----LPTSLDWRQKGAVTP-IKDQGDC 144
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCWA SA+A +ESA+ + EL+ LS+Q
Sbjct: 145 GSCWAFSAIASIESAHFLATKELVSLSEQ 173
>gi|328788558|ref|XP_392381.3| PREDICTED: putative cysteine proteinase CG12163-like [Apis
mellifera]
Length = 881
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 49/132 (37%), Positives = 71/132 (53%), Gaps = 4/132 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+DF+ ++ + + S +E + RF IF+ NLK I+ EQGTA YGV FAD+T EF
Sbjct: 576 FEDFIIKFNKTFSSTNEKQNRFQIFKQNLKIINELQTFEQGTAEYGVTMFADLTPKEFKT 635
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
E + ++ S+ + L +++D V P V+DQ LCGSCWA S
Sbjct: 636 RYLGFRPELKQ--ENEIPLAKIEVSDIF-LPLKFDWRDYNVVTP-VKDQGLCGSCWAFSV 691
Query: 199 VACLESAYAIKH 210
+E YAIK+
Sbjct: 692 TGNVEGQYAIKY 703
>gi|405977658|gb|EKC42097.1| Cathepsin F [Crassostrea gigas]
Length = 715
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 50/141 (35%), Positives = 76/141 (53%), Gaps = 6/141 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ F ++R Y S E + RF IF N++ E+GTA YGV +FADM++SEF
Sbjct: 418 FQQFQAAFKRLYMSKQEEKTRFKIFCENMRKAKKLQDVEKGTAVYGVTKFADMSESEFKQ 477
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + W+Q N + NS L S ++++ G V +V++Q CGSCWA S
Sbjct: 478 YVGKV-WDQ--NANKGMKKAKIPEMNS--LPNSFDWREHGAVT-EVKNQGSCGSCWAFST 531
Query: 199 VACLESAYAIKHNELIELSKQ 219
+E +AI +L+ LS+Q
Sbjct: 532 TGNIEGQWAISKKKLVSLSEQ 552
>gi|341876229|gb|EGT32164.1| hypothetical protein CAEBREN_11106 [Caenorhabditis brenneri]
Length = 389
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 49/155 (31%), Positives = 83/155 (53%), Gaps = 10/155 (6%)
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNR 127
++E+ + F DF+ +Y R+Y+ E+ RR+ IF N+K + K G VN
Sbjct: 77 NMEQEAKYFRMFNDFILKYNRRYEQPGELSRRYLIFVKNVKEFEAEEKKHLGV-DLDVNE 135
Query: 128 FADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSS---NSYGLAESINYKDKGKVLPKV 184
+ D TD E + + +N+ + E F S + SI+++D+GK+ P +
Sbjct: 136 YTDWTDDELKRMVI-----EKKNVITDLEAVRFEGSYLESGVKRPASIDWRDQGKLTP-I 189
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++Q CGSCWA + VA +E+ +AIK +L+ LS+Q
Sbjct: 190 KNQGQCGSCWAFATVAAVEAQHAIKKGQLVSLSEQ 224
>gi|292397748|ref|YP_003517814.1| cathepsin [Lymantria xylina MNPV]
gi|291065465|gb|ADD73783.1| cathepsin [Lymantria xylina MNPV]
Length = 335
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 90/158 (56%), Gaps = 11/158 (6%)
Query: 65 STFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKH--EQGTAT 122
+ ++L+ D+ F+ FV Y + Y SD E +R+ IF++NL I+ + + TAT
Sbjct: 24 TAYNLQRAPDY---FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTAT 80
Query: 123 YGVNRFADMTDSEFNHGLSSLDWEQIENLKSTF-ETYSFNSSNSYGLAESINYKDKGKVL 181
YG+N+F+D++ SE + L I S F +T N G +++++ KV
Sbjct: 81 YGINKFSDLSKSELIAKFTGL---SIPQRASNFCKTIVLNQPPDKGPLH-FDWREQNKV- 135
Query: 182 PKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+++Q CG+CWA + +A +ES +A++HN L++LS+Q
Sbjct: 136 TSIKNQGACGACWAFATLASVESQFAMRHNRLVDLSEQ 173
>gi|209170907|ref|YP_002268053.1| agip23 [Agrotis ipsilon multiple nucleopolyhedrovirus]
gi|208436498|gb|ACI28725.1| viral cathepsin [Agrotis ipsilon multiple nucleopolyhedrovirus]
Length = 364
Score = 84.3 bits (207), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 51/145 (35%), Positives = 85/145 (58%), Gaps = 12/145 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
F+ F+ +Y + Y ++ E + R++IFR+N+++I++ +A Y +NRFADMT +E
Sbjct: 67 FEKFISQYNKHYKNEDEKKYRYNIFRHNIESINH-KNSRNDSAVYKINRFADMTKNEVVI 125
Query: 137 -NHGLSSLDWEQIENLKSTF-ETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ GL+S + L F ET + S +++ KV V+DQ +CG+CW
Sbjct: 126 RHTGLASGE------LGVNFCETIVVDGPGQRQRPTSFDWRTLNKV-TSVKDQGMCGACW 178
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A + + LES YAIK++ LI+LS+Q
Sbjct: 179 AFAGLGALESQYAIKYDRLIDLSEQ 203
>gi|229366026|gb|ACQ57993.1| Cathepsin H precursor [Anoplopoma fimbria]
Length = 247
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 55/143 (38%), Positives = 78/143 (54%), Gaps = 12/143 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG--TATYGVNRFADMTDSEF 136
FK ++ ++ R Y S E RF IF N + ID KH +G T T G+N+F+DMT SEF
Sbjct: 26 FKSWMAQHNRVY-SMQEYHERFQIFSENKRRID---KHNEGNHTFTMGLNQFSDMTFSEF 81
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
S W + +N +T Y SN ++I+++ KG + V++Q CGSCW
Sbjct: 82 R---KSFLWSEPQNCSATKGNYF---SNDGPHPDTIDWRKKGNYVTDVKNQGACGSCWTF 135
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S CLES AI +L+ LS+Q
Sbjct: 136 STTGCLESVTAISTGKLVPLSEQ 158
>gi|308462787|ref|XP_003093674.1| hypothetical protein CRE_29187 [Caenorhabditis remanei]
gi|308249538|gb|EFO93490.1| hypothetical protein CRE_29187 [Caenorhabditis remanei]
Length = 392
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 86/150 (57%), Gaps = 4/150 (2%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+EF ++ +F+DF +++++ + + E + RF IFR NLK ++ + + +N+F+
Sbjct: 82 QEFFENLQEFRDFNQKFQKIHKNSVEFKERFLIFRGNLKKLEIL-RSSNPDIDFSINQFS 140
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
DM+++E L + ST + SF+ + E I+++D GKV+ V++Q
Sbjct: 141 DMSENELKLILLDKKLLERNFQNSTLK--SFDLPMNLTRPERIDWRDSGKVM-SVKNQGA 197
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA + VA +ES YAI+ L LS+Q
Sbjct: 198 CGSCWAFATVAAVESQYAIRKGTLWSLSEQ 227
>gi|327239614|gb|AEA39651.1| cathepsin H [Epinephelus coioides]
Length = 261
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 79/143 (55%), Gaps = 12/143 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG--TATYGVNRFADMTDSEF 136
FK ++ +Y R Y S E R IF N + ID KH +G + T G+N+F+DMT EF
Sbjct: 5 FKSWMAQYNRVY-SLQEYYERLQIFTENKRRID---KHNEGNHSFTMGLNQFSDMTSKEF 60
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
S W + +N +T Y F+S+ Y ++I+++ KG + V++Q CGSCW
Sbjct: 61 K---KSFLWSEPQNCSATKGNY-FSSNGPY--PDTIDWRKKGNYVTDVKNQGGCGSCWTF 114
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S CLES AI +L+ LS+Q
Sbjct: 115 STTGCLESVIAINKGKLVPLSEQ 137
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 84.0 bits (206), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 81/142 (57%), Gaps = 6/142 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+D+ ++ + Y SDSE RR IF + L I+ + T T G+N+F+D+T++EF
Sbjct: 2 FEDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ + ++ + + SS L S++++ +G V P ++DQ CGSCWA S
Sbjct: 62 NYVGKFKSPRYQDRRPAKDVDVDVSS----LPTSLDWRQEGAVTP-IKDQGQCGSCWAFS 116
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A+A +ESA+ + EL+ LS+Q
Sbjct: 117 AIASIESAHFLATKELVSLSEQ 138
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 83.6 bits (205), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 61/185 (32%), Positives = 93/185 (50%), Gaps = 21/185 (11%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH--- 138
++ ++ + Y+ E E+RF+IF++NLK ID + + T G+NRFAD+T+ E+
Sbjct: 49 WMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQNR-TYKVGLNRFADLTNEEYRAIYL 107
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
G S + LK+ Y+ L ES+++++ G V P V+DQ CGSCWA S
Sbjct: 108 GTRSDPKRRFAKLKNASPRYAVMPGEV--LPESVDWRETGAVNP-VKDQRSCGSCWAFST 164
Query: 199 VACLESAYAIKHNELIELSKQPPKTHGRFY----KGGVMNLPHMLCSKGPYSLNHAVLNV 254
VA +E I ELI LS+Q Y GG+M+ Y+ + + N
Sbjct: 165 VAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMD----------YAFDFIIKNG 214
Query: 255 GYDNE 259
G D E
Sbjct: 215 GLDTE 219
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 83.6 bits (205), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 78/142 (54%), Gaps = 8/142 (5%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF--N 137
++++ EY R Y +E RRF+ F++N+ ++ + +++ GVN+FAD+T EF N
Sbjct: 37 ENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNKFWLGVNQFADLTTEEFKAN 96
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
G + E + +E S ++ L +++++ KG V P +++Q CG CWA S
Sbjct: 97 KGFKPISAEMVPTTGFKYENLSVSA-----LPTAVDWRTKGAVTP-IKNQGQCGCCWAFS 150
Query: 198 AVACLESAYAIKHNELIELSKQ 219
AVA +E + LI LS+Q
Sbjct: 151 AVAAMEGIVKLSTGNLISLSEQ 172
>gi|449469923|ref|XP_004152668.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449520697|ref|XP_004167370.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 371
Score = 83.6 bits (205), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 78/141 (55%), Gaps = 7/141 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+DF ++ + Y +D E + RF +F+ NL+ + K + A +GV RF+D+T+SEF
Sbjct: 58 FQDFKLKFGKTYTTDEEHDYRFRVFKANLRKAKRHQKLDP-DAVHGVTRFSDLTESEFRE 116
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L+ L+ + + + LA +++D+G V P V+DQ CGSCW+ SA
Sbjct: 117 NFVGLN-----RLRLPADAHQAPILPTDNLASDFDWRDQGAVTP-VKDQGSCGSCWSFSA 170
Query: 199 VACLESAYAIKHNELIELSKQ 219
V LE A + +LI LS+Q
Sbjct: 171 VGALEGANFLSTGKLISLSEQ 191
>gi|114679921|ref|YP_758371.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
gi|39598652|gb|AAR28838.1| cathepsin [Leucania separata nuclear polyhedrosis virus]
Length = 359
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 55/173 (31%), Positives = 91/173 (52%), Gaps = 22/173 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ FVR+Y R Y E E+R++ F NLK I+ + Q A+Y +N+F+D+T E
Sbjct: 54 FERFVRDYNRTYIDSVEREQRYETFVQNLKNINRLNQKSQ--ASYDINKFSDLTKDEVVA 111
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESI-------------NYKDKGKVLPKVQ 185
+ LD +L + Y+ N+ Y L + + ++++ KV V+
Sbjct: 112 RFTGLD----PSLAAA--AYTDNNGTQYQLCKVVVVDGTPGRVPDLWDWRNSQKV-TSVK 164
Query: 186 DQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHM 238
Q +CGSCWA ++VA +ES YAI+H+ L++LS+Q + +G L H+
Sbjct: 165 QQGVCGSCWAFASVANIESQYAIRHDRLLDLSEQQLVDCDQIDQGCSGGLMHL 217
>gi|23577865|ref|NP_703114.1| viral cathepsin [Rachiplusia ou MNPV]
gi|37077115|sp|Q8B9D5.1|CATV_NPVR1 RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|23476510|gb|AAN28057.1| viral cathepsin [Rachiplusia ou MNPV]
Length = 323
Score = 83.6 bits (205), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 88/162 (54%), Gaps = 11/162 (6%)
Query: 59 YGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQ 118
YG ++ +DL L N F++FV + + Y S+ E RRF IF++NL I K++
Sbjct: 11 YGVVNSAAYDL---LKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEI--IIKNQN 65
Query: 119 GTATYGVNRFADMTDSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDK 177
+A Y +N+F+D++ E + L Q +N + G E +++
Sbjct: 66 DSAKYEINKFSDLSKDETIAKYTGLSLPIQTQNFCKVI---VLDQPPGKGPLE-FDWRRL 121
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
KV V++Q +CG+CWA + +A LES +AIKHN+LI LS+Q
Sbjct: 122 NKV-TSVKNQGMCGACWAFATLASLESQFAIKHNQLINLSEQ 162
>gi|67773376|gb|AAY81945.1| cysteine protease 7 [Paragonimus westermani]
Length = 325
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 80/142 (56%), Gaps = 12/142 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ F R+Y + Y ++ + ++RF IF++NL Y EQGTA YGV +F+D+T EF
Sbjct: 32 YEQFKRDYGKAYANEDD-QKRFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTPEEFAA 90
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L S E+++ ++ N + S++++ KG V P V+DQ CGSCWA S
Sbjct: 91 MYLGSRIDERVDRVQ-------LNDLQT--APASVDWRKKGAVGP-VEDQGSCGSCWAFS 140
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A +E + +K L+ LSKQ
Sbjct: 141 VTANVEGQWFLKTGRLVSLSKQ 162
Score = 38.5 bits (88), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 17/34 (50%), Positives = 23/34 (67%), Gaps = 2/34 (5%)
Query: 226 RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNE 259
+FY+ G+++ +CS P LNHAVL VGYD E
Sbjct: 251 QFYQSGILHPSKAMCS--PEGLNHAVLTVGYDTE 282
>gi|74834619|sp|O97397.1|CATLL_PHACE RecName: Full=Cathepsin L-like proteinase; Flags: Precursor
gi|4210800|emb|CAA76927.1| thiol protease [Phaedon cochleariae]
Length = 324
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 53/144 (36%), Positives = 78/144 (54%), Gaps = 9/144 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATY--GVNRFADMTDSE 135
+ DF + + R Y S E + RF+IF++ L+ I ++ K+E G +TY +N+F+D+TD E
Sbjct: 23 WADFKKTHARTYKSLREEKLRFNIFQDTLRQIAEHNVKYENGESTYYLAINKFSDITDEE 82
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
F L + E + E ESI+++ KG VLP V++Q CGSCWA
Sbjct: 83 FRDMLM-----KNEASRPNLEGLEVADLTVGAAPESIDWRSKGVVLP-VRNQGECGSCWA 136
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S A +ES AIK + LS Q
Sbjct: 137 LSTAAAIESQSAIKSGSKVPLSPQ 160
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 49/140 (35%), Positives = 74/140 (52%), Gaps = 4/140 (2%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+D++ Y R Y +E ERRF IF+ N++ I+ +N FAD T+ EF
Sbjct: 37 EDWMGLYGRTYKDIAEKERRFKIFKENVEYIESVNSAGNRRYKLSINEFADQTNEEFK-- 94
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+S + + + + E SF N + S++++ KG V P ++DQ CG CWA SAV
Sbjct: 95 -ASRNGYNMSSRPRSSEITSFRYENVAAVPSSMDWRKKGAVTP-IKDQGQCGCCWAFSAV 152
Query: 200 ACLESAYAIKHNELIELSKQ 219
A +E +K ELI LS+Q
Sbjct: 153 AAMEGVTQLKTGELISLSEQ 172
>gi|309752918|gb|ADO85436.1| cathepsin [Pieris rapae granulovirus]
Length = 339
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 89/164 (54%), Gaps = 13/164 (7%)
Query: 63 EASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTAT 122
E T++LE + N F+DF+++Y + Y +D E +++ F+NNLK I+ + A
Sbjct: 23 ETVTYNLE---NSDNIFEDFIKKYNKSYATDQERAIKYENFKNNLKMINDKNNGSK-DAV 78
Query: 123 YGVNRFADMTDSE-------FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYK 175
+ +N F+D+ ++ F GL + ++ S L ES +++
Sbjct: 79 FDINAFSDLNKNDLLRRTTGFRMGLKKNSY-YTPDVSKECNVQVIKSEPQIILPESFDWR 137
Query: 176 DKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
DK V P V++Q CGSCWA SA+A +ES Y IKHN+ ++LS+Q
Sbjct: 138 DKHGVTP-VKNQLECGSCWAFSAIANIESLYNIKHNKELDLSEQ 180
>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
Length = 499
Score = 83.6 bits (205), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 71/127 (55%), Gaps = 6/127 (4%)
Query: 95 EIERRFDIFRNNLKTIDYYTKH--EQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLK 152
E ERRF +F +NLK +D + E G G+NRFAD+T+ EF + L +
Sbjct: 85 EYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEFR--AAYLGTTPAGRGR 142
Query: 153 STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE 212
E Y + + L +S++++DKG V+ V++Q CGSCWA SAVA +E I E
Sbjct: 143 HVGEAYRHDGVEA--LPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 200
Query: 213 LIELSKQ 219
L+ LS+Q
Sbjct: 201 LVSLSEQ 207
>gi|91992516|gb|ABE72974.1| cathepsin L [Ochlerotatus atropalpus]
Length = 313
Score = 83.2 bits (204), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 85/151 (56%), Gaps = 9/151 (5%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
EE LD N+F F ++ + Y +D E +RR DIFR NL+ I + + +G T VN A
Sbjct: 2 EEHLD--NEFSRFKNKHGKNYHNDKEHDRRRDIFRQNLRFIHSHNRAGKGF-TVAVNHLA 58
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYG-LAESINYKDKGKVLPKVQDQH 188
D TD E L +L + N+ + + +N + L ES++++ G V P V+DQ
Sbjct: 59 DRTDEE----LKALRGFKSSNVYNGGLPFPYNPKDFEDELPESLDWRIAGAVTP-VKDQS 113
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+CGSCW+ +ESAY +K+N+L+ S+Q
Sbjct: 114 VCGSCWSFGTAGHIESAYFLKYNKLMRFSQQ 144
>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 83.2 bits (204), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 78/142 (54%), Gaps = 3/142 (2%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ E + Y+ E ERRF IF++NLK I+ + + G+N+F+D+T EF
Sbjct: 40 MYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQ 99
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
E+ ++L E Y + + L + ++++++G V+P+V+ Q CGSCWA +
Sbjct: 100 ASYLGGKMEK-KSLSDVAERYQYKEGDV--LPDEVDWRERGAVVPRVKRQGECGSCWAFA 156
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A +E I EL+ LS+Q
Sbjct: 157 ATGAVEGINQITTGELVSLSEQ 178
>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
Precursor
gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 490
Score = 83.2 bits (204), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 72/127 (56%), Gaps = 6/127 (4%)
Query: 95 EIERRFDIFRNNLKTIDYYTKH--EQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLK 152
E ERRF +F +NLK +D + E+G G+NRFAD+T+ EF + L +
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRA--TYLGTTPAGRGR 141
Query: 153 STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE 212
E Y + + L +S++++DKG V+ V++Q CGSCWA SAVA +E I E
Sbjct: 142 RVGEAYRHDGVEA--LPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 213 LIELSKQ 219
L+ LS+Q
Sbjct: 200 LVSLSEQ 206
>gi|403342666|gb|EJY70658.1| Cysteine protease [Oxytricha trifallax]
Length = 367
Score = 83.2 bits (204), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 77/144 (53%), Gaps = 4/144 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F +FV ++R + + E + R IFR+ + + + E + +N+F+DM+ EF+
Sbjct: 57 FNNFVSRHQRSFLTQEEYKARLAIFRDTFEAVQLHNSLESKSYKLAINKFSDMSKDEFSK 116
Query: 139 GLSSLDW---EQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
SSL + E + ++ + G +S++++DKG V P + LCG+C+A
Sbjct: 117 -FSSLQLPAEDDEEEESNQYQEDDDDDDLLLGAPQSLDWRDKGAVNPVFEQTKLCGACYA 175
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +E+AY IK +L+ELSKQ
Sbjct: 176 MSTTGAVEAAYKIKTGKLVELSKQ 199
>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
Precursor
gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 78/142 (54%), Gaps = 3/142 (2%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ E + Y+ E ERRF IF++NLK I+ + + G+N+F+D+T EF
Sbjct: 40 MYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSDLTADEFQ 99
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
E+ ++L E Y + + L + ++++++G V+P+V+ Q CGSCWA +
Sbjct: 100 ASYLGGKMEK-KSLSDVAERYQYKEGDV--LPDEVDWRERGAVVPRVKRQGECGSCWAFA 156
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A +E I EL+ LS+Q
Sbjct: 157 ATGAVEGINQITTGELVSLSEQ 178
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 83/151 (54%), Gaps = 4/151 (2%)
Query: 69 LEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRF 128
LE +L N+ + ++ ++ + Y +E E+RF IF+NN++ I+ + +N F
Sbjct: 29 LEPYLS--NKHEKWMTQFGKSYKDAAEKEKRFQIFKNNVEFIELFNAVGNKPFNLSINHF 86
Query: 129 ADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQH 188
AD+T+ EF L+ + + + ET SF N + S++++ +G V P +++Q
Sbjct: 87 ADLTNEEFKASLNG-NKKLHDKFDILNETTSFRYHNVTSVPASMDWRKRGAVTP-IKNQG 144
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S VA +E + I EL+ LS+Q
Sbjct: 145 SCGSCWAFSTVASIEGIHQITTGELVSLSEQ 175
>gi|9627870|ref|NP_054157.1| viral cathepsin-like protein [Autographa californica
nucleopolyhedrovirus]
gi|114680178|ref|YP_758591.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
gi|115751|sp|P25783.1|CATV_NPVAC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|332491|gb|AAA46752.1| viral cathepsin [Autographa californica nucleopolyhedrovirus]
gi|559196|gb|AAA66757.1| viral cathepsin-like protein [Autographa californica
nucleopolyhedrovirus]
gi|113015253|gb|ABE68510.1| viral cathepsin [Plutella xylostella multiple nucleopolyhedrovirus]
Length = 323
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 88/162 (54%), Gaps = 11/162 (6%)
Query: 59 YGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQ 118
YG ++ +DL L N F++FV + + Y S+ E RRF IF++NL I K++
Sbjct: 11 YGVVNSAAYDL---LKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEI--INKNQN 65
Query: 119 GTATYGVNRFADMTDSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDK 177
+A Y +N+F+D++ E + L Q +N + G E +++
Sbjct: 66 DSAKYEINKFSDLSKDETIAKYTGLSLPIQTQNFCKVI---VLDQPPGKGPLE-FDWRRL 121
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
KV V++Q +CG+CWA + +A LES +AIKHN+LI LS+Q
Sbjct: 122 NKV-TSVKNQGMCGACWAFATLASLESQFAIKHNQLINLSEQ 162
>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
Length = 340
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/148 (33%), Positives = 89/148 (60%), Gaps = 8/148 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTID---YYTKHEQGTATYGVNRFADMTDSE 135
+++++ ++++ Y S E +RF+IF++NL+ ID +Y K T G+N+FAD+T E
Sbjct: 34 YEEWLVKHQKLYSSLGEKIKRFEIFKDNLRYIDQQNHYNKVNHMNFTLGLNQFADLTLDE 93
Query: 136 FN--HGLSSLDWEQIENLKSTFETY--SFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
F+ + +S+D+EQI + + + L +S+++++KG V P +++Q CG
Sbjct: 94 FSSIYLGTSVDYEQIISSNPNHDDVEEDILKEDVVELPDSVDWREKGVVFP-IRNQGKCG 152
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
SCW SAVA +E+ IK +I LS+Q
Sbjct: 153 SCWTFSAVASIETLNGIKKGHMIALSEQ 180
>gi|339244639|ref|XP_003378245.1| cathepsin F [Trichinella spiralis]
gi|316972864|gb|EFV56510.1| cathepsin F [Trichinella spiralis]
Length = 366
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 84/158 (53%), Gaps = 7/158 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK F+ E+ + Y+++ +++IF++N+ + EQGTA YG FADMT EF
Sbjct: 66 FKQFMVEFNKWYETEKLTAEKYNIFKSNMVIAKRLQEEEQGTAIYGPTIFADMTPEEFRK 125
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ L++ N+K + SN ++E ++++ K + V+DQ CGSCWA
Sbjct: 126 --THLNFNP-NNVKKPKRMANIPKSN---ISERMDWR-KFNAVTSVKDQGNCGSCWAFCT 178
Query: 199 VACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLP 236
VA +E A+A+K +LI LS+Q R G LP
Sbjct: 179 VANIEGAWAVKTAQLISLSEQQLVDCDRLDDGCEGGLP 216
>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
Length = 494
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 72/127 (56%), Gaps = 6/127 (4%)
Query: 95 EIERRFDIFRNNLKTIDYYTKH--EQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLK 152
E ERRF +F +NLK +D + E+G G+NRFAD+T+ EF + L +
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRA--TYLGTTPAGRGR 141
Query: 153 STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE 212
E Y + + L +S++++DKG V+ V++Q CGSCWA SAVA +E I E
Sbjct: 142 RVGEAYRHDGVEA--LPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 213 LIELSKQ 219
L+ LS+Q
Sbjct: 200 LVSLSEQ 206
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 57/179 (31%), Positives = 91/179 (50%), Gaps = 18/179 (10%)
Query: 85 EYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLD 144
++ + Y++ E+RF+IF++NL+ ID + K + G+N+FAD+++ E+ L
Sbjct: 13 KHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMF--LG 70
Query: 145 WEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLES 204
+ + K FE+ F L +S+++++KG V P V+DQ CGSCWA S VA +E
Sbjct: 71 GRMVRDRKG-FESDRFKYGVGDELPQSVDWREKGAVAP-VKDQGQCGSCWAFSTVAAVEG 128
Query: 205 AYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNE 259
I +LI LS+Q K + GG M+ Y+ V N G D E
Sbjct: 129 INQIATGDLISLSEQELVDCDKGFNQGCNGGFMD----------YAFEFIVKNGGIDTE 177
>gi|195497262|ref|XP_002096026.1| GE25302 [Drosophila yakuba]
gi|194182127|gb|EDW95738.1| GE25302 [Drosophila yakuba]
Length = 615
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/143 (36%), Positives = 79/143 (55%), Gaps = 8/143 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + R+Y S +E + R IFR NLKTI+ +E G+A YG+ FADMT SE+
Sbjct: 309 FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMGSAKYGITEFADMTSSEYKE 368
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL W++ E K+T + + + L + +++ K V +V++Q CGSCWA
Sbjct: 369 RTGL----WQRNE-AKATGGSVAVVPAYHGELPKEFDWRQKNAV-TQVKNQGSCGSCWAF 422
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E +A+K +L E S+Q
Sbjct: 423 SVTGNIEGLHAVKTGDLKEFSEQ 445
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 94/200 (47%), Gaps = 18/200 (9%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ E + Y+ E ERRF IF++NLK +D + T G+ RFAD+T+ EF
Sbjct: 43 MYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFR 102
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ +++E K + +T + L + ++++ G V+ V+DQ CGSCWA S
Sbjct: 103 ---AIYLRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVV-SVKDQGNCGSCWAFS 158
Query: 198 AVACLESAYAIKHNELIELSKQPPKTHGRFY-----KGGVMNLPHMLCSKG--------- 243
AV +E I ELI LS+Q R + GG+MN K
Sbjct: 159 AVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDY 218
Query: 244 PYSLNHAVLNVGYDNESTRT 263
PY+ N L N +TR
Sbjct: 219 PYNANDLGLCNADKNNNTRV 238
>gi|86355549|ref|YP_473217.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
gi|86198154|dbj|BAE72318.1| Cathepsin [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/150 (35%), Positives = 86/150 (57%), Gaps = 7/150 (4%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFAD 130
+ L + F+DF+ ++ + Y S+SE RRF IF++NL+ I +++ TA Y +N+F+D
Sbjct: 20 DLLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDT-TAQYEINKFSD 78
Query: 131 MTDSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
++ E + L Q +N E N G E +++ KV V++Q +
Sbjct: 79 LSKDETISKYTGLALPLQTQNF---CEVVVLNRPPDKGPLE-FDWRRLNKV-TSVKNQGI 133
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CG+CWA + +A LES +AIKHN+LI LS+Q
Sbjct: 134 CGACWAFATLASLESQFAIKHNQLINLSEQ 163
>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 85/166 (51%), Gaps = 15/166 (9%)
Query: 54 SQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY 113
SQ + EAS ++ E D++ +Y R+Y E +R+ IF++N+ I+ +
Sbjct: 23 SQATARSLHEASMYERHE---------DWMVQYGREYKDADEKSKRYKIFKDNVARIESF 73
Query: 114 TKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESIN 173
K + +N FAD+T+ EF + ++ ST E SF N + +++
Sbjct: 74 NKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICST-EATSFKYENVTAVPSTVD 128
Query: 174 YKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++ KG V P ++DQ CGSCWA SAVA +E + +LI LS+Q
Sbjct: 129 WRKKGAVTP-IKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQ 173
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 75/140 (53%), Gaps = 4/140 (2%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
++++ Y + Y E E+RF IF+ N+ I+ + G+N+FAD+T+ EF
Sbjct: 40 EEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLGINQFADLTNEEF--- 96
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
++ + + S T +F N L +++++ KG V P ++DQ CG CWA SAV
Sbjct: 97 IAPRNRFKGHMCSSITRTTTFKYENVTALPSTVDWRQKGAVTP-IKDQGQCGCCWAFSAV 155
Query: 200 ACLESAYAIKHNELIELSKQ 219
A E +A+ +LI LS+Q
Sbjct: 156 AATEGIHALNSGKLISLSEQ 175
>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 77/140 (55%), Gaps = 6/140 (4%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+D++ +Y R+Y E +R+ IF++N+ I+ + K + +N FAD+T+ EF
Sbjct: 40 EDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRAS 99
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ ++ ST E SF N + +++++ KG V P ++DQ CGSCWA SAV
Sbjct: 100 RNRFK----AHICST-EATSFKYENVTAVPSTVDWRKKGAVTP-IKDQGQCGSCWAFSAV 153
Query: 200 ACLESAYAIKHNELIELSKQ 219
A +E + +LI LS+Q
Sbjct: 154 AAMEGITQLSTGKLISLSEQ 173
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 83.2 bits (204), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 55/189 (29%), Positives = 103/189 (54%), Gaps = 21/189 (11%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDS 134
+ +++ E R Y++ E ERRF++FR+NL+ +D + + G ++ G+NRFAD+T+
Sbjct: 41 MYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLHSFRLGLNRFADLTNE 100
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
E+ + + + + + + ++++ L ES+++++KG V KV+DQ CGSCW
Sbjct: 101 EYRDTYLGVRTKPVRERRLSGR---YQAADNEELPESVDWREKGAV-AKVKDQGGCGSCW 156
Query: 195 AHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHA 250
A SA+A +E I ++I LS+Q ++ + GG+M+ Y+
Sbjct: 157 AFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYNQGCNGGLMD----------YAFEFI 206
Query: 251 VLNVGYDNE 259
+ N G D+E
Sbjct: 207 INNGGIDSE 215
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 78/161 (48%), Gaps = 9/161 (5%)
Query: 62 EEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA 121
+EA T D H + ++ E+ R Y ++ E RR ++FR N K ID + E T
Sbjct: 31 DEAITVDAAMVSRH----EKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTH 86
Query: 122 TYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE---SINYKDKG 178
NRFAD+TD EF + L + F N + LA+ S++++ G
Sbjct: 87 RLATNRFADLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYEN-FSLADAAGSMDWRAMG 145
Query: 179 KVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V V+DQ CG CWA SAVA +E I+ L+ LS+Q
Sbjct: 146 AV-TGVKDQGSCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQ 185
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 85/166 (51%), Gaps = 15/166 (9%)
Query: 54 SQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY 113
SQ + EAS ++ E D++ +Y R+Y E +R+ IF++N+ I+ +
Sbjct: 23 SQATARSLHEASMYERHE---------DWMVQYGREYKDADEKSKRYKIFKDNVARIESF 73
Query: 114 TKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESIN 173
K + +N FAD+T+ EF + ++ ST E SF N + +++
Sbjct: 74 NKAMDKSYKLSINEFADLTNEEFRASRNRFK----AHICST-EATSFKYENVTAVPSTVD 128
Query: 174 YKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++ KG V P ++DQ CGSCWA SAVA +E + +LI LS+Q
Sbjct: 129 WRKKGAVTP-IKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQ 173
>gi|431910221|gb|ELK13294.1| Cathepsin F [Pteropus alecto]
Length = 458
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 83/159 (52%), Gaps = 8/159 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK+FV Y R Y++ E + R +F NN+ ++GTA YGV +F+D+T+ EF
Sbjct: 161 FKEFVITYNRTYETKEEAQWRMSVFINNMMRAQKIQALDRGTARYGVTKFSDLTEEEFRT 220
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L+ L ++ L+S + + S ++++KG V KV+DQ +CGSCWA S
Sbjct: 221 IYLNPL----LKELRSKRMPLAMSVSGP--APPEWDWRNKGAVT-KVKDQGMCGSCWAFS 273
Query: 198 AVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLP 236
+E + +K +L+ LS+Q + K + LP
Sbjct: 274 VTGNVEGQWFLKRGDLLSLSEQELVDCDKLDKACLGGLP 312
>gi|13124026|sp|Q9WGE0.1|CATV_NPVHC RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|4884631|gb|AAD31760.1|AF120926_1 cysteine proteinase [Hyphantria cunea nucleopolyhedrovirus]
Length = 324
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/150 (35%), Positives = 86/150 (57%), Gaps = 7/150 (4%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFAD 130
+ L + F+DF+ ++ + Y S+SE RRF IF++NL+ I +++ TA Y +N+F+D
Sbjct: 20 DLLKAPSYFEDFLHKFNKHYSSESEKLRRFQIFQHNLEEIIIKNQNDT-TAQYEINKFSD 78
Query: 131 MTDSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
++ E + L Q +N E N G E +++ KV V++Q +
Sbjct: 79 LSKDETISKYTGLALPLQTQNF---CEVVVLNRPPDKGPLE-FDWRRLNKV-TSVKNQGI 133
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CG+CWA + +A LES +AIKHN+LI LS+Q
Sbjct: 134 CGACWAFATLASLESQFAIKHNQLINLSEQ 163
>gi|288804650|ref|YP_003429335.1| cathepsin [Pieris rapae granulovirus]
gi|270161225|gb|ACZ63497.1| cathepsin [Pieris rapae granulovirus]
Length = 339
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/164 (32%), Positives = 89/164 (54%), Gaps = 13/164 (7%)
Query: 63 EASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTAT 122
E T++LE + N F+DF+++Y + Y +D E +++ F+NNLK I+ + A
Sbjct: 23 ETVTYNLE---NSDNIFEDFIKKYNKSYATDQERAIKYENFKNNLKMINDKNNGSK-YAV 78
Query: 123 YGVNRFADMTDSE-------FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYK 175
+ +N F+D+ ++ F GL + ++ S L ES +++
Sbjct: 79 FDINAFSDLNKNDLLRRTTGFRMGLKKNSY-YTPDVSKECNVQVIKSEPQIILPESFDWR 137
Query: 176 DKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
DK V P V++Q CGSCWA SA+A +ES Y IKHN+ ++LS+Q
Sbjct: 138 DKHGVTP-VKNQLECGSCWAFSAIANIESLYNIKHNKELDLSEQ 180
>gi|346469497|gb|AEO34593.1| hypothetical protein [Amblyomma maculatum]
Length = 557
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 60/172 (34%), Positives = 78/172 (45%), Gaps = 11/172 (6%)
Query: 53 RSQPNSYGSEEASTFDLEEFLDHGN-----QFKDFVREYERQYDSDSEIERRFDIFRNNL 107
RS P A + EFLD N F+DF + R YD E +RR DIFR NL
Sbjct: 226 RSFPGPGAERLALHNPMAEFLDGHNGHTEQSFEDFKETHRRTYDHSVEHDRRRDIFRQNL 285
Query: 108 KTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYG 167
+ ID + G VN AD T E + L + F Y F +
Sbjct: 286 RFIDSTNRANLGYQV-AVNHLADRTPEEISVMRGRLQSRDGSSTAEPFPRYKFTAK---- 340
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
L E I+++ G V P V+DQ +CGSCW+ LE AY K +L+ LS+Q
Sbjct: 341 LPEQIDWRLYGAVTP-VKDQAVCGSCWSFGTTGELEGAYFRKTGKLVRLSEQ 391
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 77/140 (55%), Gaps = 6/140 (4%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+D++ +Y R+Y E +R+ IF++N+ I+ + K + +N FAD+T+ EF
Sbjct: 40 EDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRAS 99
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ ++ ST E SF N + +++++ KG V P ++DQ CGSCWA SAV
Sbjct: 100 RNRFK----AHICST-EATSFKYENVTAVPSTVDWRKKGAVTP-IKDQGQCGSCWAFSAV 153
Query: 200 ACLESAYAIKHNELIELSKQ 219
A +E + +LI LS+Q
Sbjct: 154 AAMEGITQLSTGKLISLSEQ 173
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 77/140 (55%), Gaps = 6/140 (4%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+D++ +Y R+Y E +R+ IF++N+ I+ + K + +N FAD+T+ EF
Sbjct: 40 EDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRAS 99
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ ++ ST E SF N + +++++ KG V P ++DQ CGSCWA SAV
Sbjct: 100 RNRFK----AHICST-EATSFKYENVTAVPSTVDWRKKGAVTP-IKDQGQCGSCWAFSAV 153
Query: 200 ACLESAYAIKHNELIELSKQ 219
A +E + +LI LS+Q
Sbjct: 154 AAMEGITQLSTGKLISLSEQ 173
>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 54/171 (31%), Positives = 85/171 (49%), Gaps = 15/171 (8%)
Query: 49 LILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLK 108
L SQ + EAS ++ E D++ +Y R Y E +R+ IF++N+
Sbjct: 18 LAAWASQATARNLHEASMYERHE---------DWMAQYGRVYKDADEKSKRYKIFKDNVA 68
Query: 109 TIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGL 168
I+ + K + +N FAD+T+ EF + ++ ST E SF N +
Sbjct: 69 RIESFNKAMDKSYKLSINEFADLTNEEFGTSRNRFK----AHICST-EATSFKYENVTAV 123
Query: 169 AESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+I+++ KG V P ++DQ CGSCWA SAVA +E + +LI LS+Q
Sbjct: 124 PSTIDWRKKGAVTP-IKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQ 173
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 75/140 (53%), Gaps = 4/140 (2%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
++++ Y + Y E E+RF IF+ N+ I+ + G+N+FAD+T+ EF
Sbjct: 40 EEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLGINQFADLTNEEF--- 96
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
++ + + S T +F N L +++++ KG V P ++DQ CG CWA SAV
Sbjct: 97 IAPRNKFKGHMCSSITRTTTFKYENVTALPSTVDWRQKGAVTP-IKDQGQCGCCWAFSAV 155
Query: 200 ACLESAYAIKHNELIELSKQ 219
A E +A+ +LI LS+Q
Sbjct: 156 AATEGIHALNSGKLISLSEQ 175
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 53/166 (31%), Positives = 84/166 (50%), Gaps = 15/166 (9%)
Query: 54 SQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY 113
SQ + EAS ++ E D++ +Y R Y E +R+ IF++N+ I+ +
Sbjct: 23 SQATARNLHEASMYERHE---------DWMAQYGRVYKDADEKSKRYKIFKDNVARIESF 73
Query: 114 TKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESIN 173
K + +N FAD+T+ EF + ++ ST E SF N + +I+
Sbjct: 74 NKAMDKSYKLSINEFADLTNEEFGTSRNRFK----AHICST-EATSFKYENVTAVPSTID 128
Query: 174 YKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++ KG V P ++DQ CGSCWA SAVA +E + +LI LS+Q
Sbjct: 129 WRKKGAVTP-IKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQ 173
>gi|67773378|gb|AAY81946.1| cysteine protease 8 [Paragonimus westermani]
Length = 325
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/147 (34%), Positives = 81/147 (55%), Gaps = 22/147 (14%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ F R+Y + Y ++ + ++RF IF++NL Y EQGTA YGV +F+D+T EF
Sbjct: 32 YEQFKRDYGKAYANEDD-QKRFAIFKDNLVRAQQYQMQEQGTAKYGVTQFSDLTPEEFEA 90
Query: 139 ---GL---SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
GL +D Q+ +L++ S+++++KG V P +++Q CGS
Sbjct: 91 KYLGLRIDEQVDRVQLNDLQTA--------------PASVDWREKGAVGP-IENQGSCGS 135
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA S V +E + +K L+ LSKQ
Sbjct: 136 CWAFSVVGNIEGQWFLKTGYLVSLSKQ 162
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 82.8 bits (203), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 58/194 (29%), Positives = 91/194 (46%), Gaps = 13/194 (6%)
Query: 49 LILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLK 108
L + S S + S F+ H + ++ + R Y ++E RF+IF+ NL+
Sbjct: 9 LTIFLSYRTSLATSRGSLFEASAIEKH----EQWMARFNRVYSDETEKRNRFNIFKKNLE 64
Query: 109 TIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSL----DWEQIENLKSTFETYSFNSSN 164
+ + + + T +N F+D+TD EF + L +I L S T F N
Sbjct: 65 FVQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNTVPFRYGN 124
Query: 165 SYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTH 224
ES++++ +G V P V+ Q CG CWA SAVA +E I EL+ LS+Q
Sbjct: 125 VSDNGESMDWRQEGAVTP-VKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDC 183
Query: 225 GRFY----KGGVMN 234
R Y +GG+M+
Sbjct: 184 DRDYNQGCRGGIMS 197
>gi|268534724|ref|XP_002632495.1| Hypothetical protein CBG13738 [Caenorhabditis briggsae]
Length = 341
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 95/189 (50%), Gaps = 30/189 (15%)
Query: 39 HPRWTGRVHNL--ILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEI 96
P T R+ NL ILQR Q T D++ + F++F+ +Y R+Y S+ EI
Sbjct: 20 EPLPTFRIENLELILQRHQ--------IPTPDVK----YTEAFQNFLVKYLREYKSEEEI 67
Query: 97 ERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFE 156
+RF IF N+ ++ Y K G TY +N F+D++D E+ L +K +
Sbjct: 68 VKRFTIFSRNMDLVERYNKEGAGKVTYELNDFSDLSDEEWKQFL----------MKPRPQ 117
Query: 157 TYSFNSSNSYGL----AESINYKDKGKV--LPKVQDQHLCGSCWAHSAVACLESAYAIKH 210
+S +S+ + L ES++++++ + ++ Q CGSCWA + A +ESA +I
Sbjct: 118 KFSKSSNFKFPLKKEIPESVDWRNRNGQSHVTGIKYQGPCGSCWAFATAAAIESAVSISG 177
Query: 211 NELIELSKQ 219
L LS Q
Sbjct: 178 GGLTSLSSQ 186
>gi|42572491|ref|NP_974341.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332642714|gb|AEE76235.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 290
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 61/199 (30%), Positives = 94/199 (47%), Gaps = 18/199 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ E + Y+ E ERRF IF++NLK +D + T G+ RFAD+T+ EF
Sbjct: 44 YEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFR- 102
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ +++E K + +T + L + ++++ G V+ V+DQ CGSCWA SA
Sbjct: 103 --AIYLRKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVS-VKDQGNCGSCWAFSA 159
Query: 199 VACLESAYAIKHNELIELSKQPPKTHGRFY-----KGGVMNLPHMLCSKG---------P 244
V +E I ELI LS+Q R + GG+MN K P
Sbjct: 160 VGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYP 219
Query: 245 YSLNHAVLNVGYDNESTRT 263
Y+ N L N +TR
Sbjct: 220 YNANDLGLCNADKNNNTRV 238
>gi|5231178|gb|AAD41105.1|AF157961_1 cysteine proteinase [Hypera postica]
Length = 324
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 84/147 (57%), Gaps = 10/147 (6%)
Query: 76 GNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTK-HEQGTATY--GVNRFADMT 132
G +F+ F E+ + Y + +E +RF+IF +N++ I+ + +EQG +Y G+N+F DM+
Sbjct: 23 GAKFQAFKLEHGKTYLNQAEESKRFNIFTDNVRAIEAHNALYEQGKVSYKKGINKFTDMS 82
Query: 133 DSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
EF L+ + K T ET S+ + + S++++ +G+V V+DQ CGS
Sbjct: 83 QEEFKTMLT-----LSASRKPTLETTSYVKTG-VEIPSSVDWRKEGRV-TGVKDQGDCGS 135
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA S E AYA K +L+ LS+Q
Sbjct: 136 CWAFSITGSTEGAYARKSGKLVSLSEQ 162
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 82/141 (58%), Gaps = 5/141 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ EY + Y++ E ERRF+IF++NL+ +D + + G+N+F+D+TD+E+
Sbjct: 48 FESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTDAEY-- 105
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
SS+ N++ T + + L +S++++ KG VL V++Q CGSCW ++
Sbjct: 106 --SSIYLGTKFNIRMTNVSDRYEPRVGDQLPDSVDWRKKGAVL-GVKNQGNCGSCWTFAS 162
Query: 199 VACLESAYAIKHNELIELSKQ 219
+A +E I LI LS+Q
Sbjct: 163 IAAVEGINKIVTGNLISLSEQ 183
>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
Length = 499
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 70/127 (55%), Gaps = 6/127 (4%)
Query: 95 EIERRFDIFRNNLKTIDYYTKH--EQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLK 152
E ERRF +F +NLK +D + E G G+NRFAD+T+ EF + L +
Sbjct: 85 EYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEFR--AAYLGTTPAGRGR 142
Query: 153 STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE 212
E Y + L +S++++DKG V+ V++Q CGSCWA SAVA +E I E
Sbjct: 143 HVGEAYRHDGVEV--LPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 200
Query: 213 LIELSKQ 219
L+ LS+Q
Sbjct: 201 LVSLSEQ 207
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 78/161 (48%), Gaps = 9/161 (5%)
Query: 62 EEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA 121
+EA T D H + ++ E+ R Y ++ E RR ++FR N K ID + E T
Sbjct: 31 DEAITVDSAMVSRH----EKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTH 86
Query: 122 TYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE---SINYKDKG 178
NRFAD+TD EF + L + F N + LA+ S++++ G
Sbjct: 87 RLATNRFADLTDEEFRAARTGLRRPPAAAAGAGSGAGGFRYEN-FSLADAAGSMDWRAMG 145
Query: 179 KVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V V+DQ CG CWA SAVA +E I+ L+ LS+Q
Sbjct: 146 AV-TGVKDQGSCGCCWAFSAVAAVEGLTKIRTGRLVSLSEQ 185
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 82.4 bits (202), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 94/200 (47%), Gaps = 18/200 (9%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ E + Y+ E ERRF IF++NLK +D + T G+ RFAD+T+ EF
Sbjct: 43 MYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFR 102
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ +++E K + +T + L + ++++ G V+ V+DQ CGSCWA S
Sbjct: 103 ---AIYLRKKMERNKDSVKTERYLYKEGDVLPDEVDWRANGAVV-SVKDQGNCGSCWAFS 158
Query: 198 AVACLESAYAIKHNELIELSKQPPKTHGRFY-----KGGVMNLPHMLCSKG--------- 243
AV +E I ELI LS+Q R + GG+MN K
Sbjct: 159 AVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDY 218
Query: 244 PYSLNHAVLNVGYDNESTRT 263
PY+ N L N +TR
Sbjct: 219 PYNANDLGLCNADKNNNTRV 238
>gi|9634237|ref|NP_037776.1| ORF16 cathepsin [Spodoptera exigua MNPV]
gi|37077857|sp|Q9J8B9.1|CATV_NPVSE RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|6960476|gb|AAF33546.1|AF169823_16 ORF16 cathepsin [Spodoptera exigua MNPV]
Length = 337
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 84/145 (57%), Gaps = 12/145 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
F+ F+ +Y +QY S+ E + R++IFR+N+++I+ +A Y +NRFADM +E
Sbjct: 40 FEKFITQYNKQYKSEDEKKYRYNIFRHNIESINQ-KNSRNDSAVYKINRFADMPKNEIVI 98
Query: 137 -NHGLSSLDWEQIENLKSTF-ETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ GL+S + L F ET + S +++ K+ V+DQ +CG+CW
Sbjct: 99 RHTGLASGE------LGLNFCETIVVDGPAQRQRPVSFDWRSMNKI-TSVKDQGMCGACW 151
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
+++ LES YAIK++ LI+LS+Q
Sbjct: 152 RFASLGALESQYAIKYDRLIDLSEQ 176
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 53/185 (28%), Positives = 92/185 (49%), Gaps = 15/185 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ +Y + Y++ E ERRF+IF++NLK +D + + G+N+FAD+++ E+
Sbjct: 49 YEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRA 108
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ L ++ + + L ES+++++KG V P V+DQ CGSCWA S
Sbjct: 109 AYLGTRMDGKRRLLGGPKSARYLFKDGDDLPESVDWREKGAVAP-VKDQGQCGSCWAFST 167
Query: 199 VACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVLNV 254
V +E I L LS+Q K + + GG+M+ Y+ + N
Sbjct: 168 VGAVEGINQIVTGNLTSLSEQELVDCDKVYNQGCNGGLMD----------YAFEFIMKNG 217
Query: 255 GYDNE 259
G D E
Sbjct: 218 GIDTE 222
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/142 (34%), Positives = 83/142 (58%), Gaps = 4/142 (2%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+++++ ++ + Y++ E E+RF+IF++NL ID + E T T G+NRFAD+T+ EF
Sbjct: 41 MYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNS-ENRTYTVGLNRFADLTNEEFR 99
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ L T + Y+ +S L +S++++ +G V +V+DQ CGSCWA S
Sbjct: 100 SMYLGTRTGHKKRLPKTSDRYAPRVGDS--LPDSVDWRKEGAV-AEVKDQGGCGSCWAFS 156
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+A +E I +LI LS+Q
Sbjct: 157 TIAAVEGINKIVTGDLIALSEQ 178
>gi|226821417|gb|ACO82384.1| cathepsin H-like protein [Lutjanus argentimaculatus]
Length = 255
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 54/143 (37%), Positives = 78/143 (54%), Gaps = 12/143 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG--TATYGVNRFADMTDSEF 136
FK ++ +Y + Y+ E R IF N + ID KH +G + T G+N+F+DMT EF
Sbjct: 27 FKSWMAQYNKAYNM-REYYERLQIFTENKRRID---KHNEGNHSFTMGLNQFSDMTFGEF 82
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
S W + +N +T Y SSN L +SI+++ KG + V++Q CGSCW
Sbjct: 83 R---KSFLWSEPQNCSATKGNYF--SSNG-ALPDSIDWRKKGNYVTPVKNQGGCGSCWTF 136
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S CLES AI +L+ LS+Q
Sbjct: 137 STTGCLESVIAINKGKLVPLSEQ 159
>gi|38048171|gb|AAR09988.1| similar to Drosophila melanogaster CG12163, partial [Drosophila
yakuba]
Length = 213
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/143 (36%), Positives = 79/143 (55%), Gaps = 8/143 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + R+Y S +E + R IFR NLKTI+ +E G+A YG+ FADMT SE+
Sbjct: 37 FHKFQVRFGRRYVSTAERQMRLRIFRQNLKTIEQLNVNEMGSAKYGITEFADMTSSEYKE 96
Query: 139 --GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
GL W++ E K+T + + + L + +++ K V +V++Q CGSCWA
Sbjct: 97 RTGL----WQRNE-AKATGGSVAVVPAYHGELPKEFDWRQKNAV-TQVKNQGSCGSCWAF 150
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E +A+K +L E S+Q
Sbjct: 151 SVTGNIEGLHAVKTGDLKEFSEQ 173
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/142 (34%), Positives = 83/142 (58%), Gaps = 4/142 (2%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+++++ ++ + Y++ E E+RF+IF++NL ID + E T T G+NRFAD+T+ EF
Sbjct: 50 MYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNS-ENRTYTVGLNRFADLTNEEFR 108
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ L T + Y+ +S L +S++++ +G V +V+DQ CGSCWA S
Sbjct: 109 SMYLGTRTGHKKRLPKTSDRYAPRVGDS--LPDSVDWRKEGAVA-EVKDQGGCGSCWAFS 165
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+A +E I +LI LS+Q
Sbjct: 166 TIAAVEGINKIVTGDLIALSEQ 187
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 77/141 (54%), Gaps = 5/141 (3%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA-TYGVNRFADMTDSEFNH 138
+ ++ +Y + Y E E+RF IF N+ I+ + K + T GVN+FAD+T+ EF
Sbjct: 39 RQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTLGVNQFADLTNDEFT- 97
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
SS + + S T +F N+ + S++++ KG V P V++Q CG CWA SA
Sbjct: 98 --SSRNKFKGHMCSSITRTSTFKYENASAIPSSVDWRKKGAVTP-VKNQGQCGCCWAFSA 154
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA E + + +LI LS+Q
Sbjct: 155 VAATEGIHKLSTGKLISLSEQ 175
>gi|121531590|gb|ABM55480.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 321
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 57/146 (39%), Positives = 81/146 (55%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTD 133
+Q+ F + + + Y S E RF IF+NNL+TI+ + K+++G TY GVN+FADMT
Sbjct: 21 DQWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEEHNAKYDKGEETYYMGVNQFADMTA 80
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF H L + + NL +T +S N ESI++ KG L V++Q CGSC
Sbjct: 81 EEFRHMLGLQNGAR-PNLNATLHVFSENLQ----APESIDWTQKGADLG-VKNQGKCGSC 134
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA S+ LE AI H LS++
Sbjct: 135 WAFSSTGSLEGQNAIHHKVKTPLSER 160
>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 361
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 81/150 (54%), Gaps = 9/150 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA-TYGVNRFADMTDSEF 136
+FK + EY R Y + E ++RF ++ NL+ I + G++ G N+F D+T+ EF
Sbjct: 39 RFKAWQAEYNRTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEEEF 98
Query: 137 NHG-LSSLD-----WEQIENLKSTFETYSFNSSNSYGLAE-SINYKDKGKVLPKVQDQHL 189
L LD E + + T T ++ ++ G A S++++ KG V P V++Q
Sbjct: 99 KDTYLMKLDEQPPAAEAMPPIVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTP-VKNQQQ 157
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA + VA +E + IK L+ LS+Q
Sbjct: 158 CGSCWAFATVASIEGVHQIKTGRLVSLSEQ 187
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 74/141 (52%), Gaps = 12/141 (8%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN---H 138
++ EY + Y E E+RF IF++N++ I+ + + VN AD+T EF +
Sbjct: 43 WMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLADLTLDEFKASRN 102
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
G +D E F T SF N + E+++++ KG V P ++DQ CGSCWA S
Sbjct: 103 GYKKIDRE--------FATTSFKYENVTAIPEAVDWRVKGAVTP-IKDQGQCGSCWAFST 153
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I +LI LS+Q
Sbjct: 154 VAAIEGINQITTGKLISLSEQ 174
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 74/141 (52%), Gaps = 12/141 (8%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN---H 138
++ EY + Y E E+RF IF++N++ I+ + + VN AD+T EF +
Sbjct: 43 WMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLSVNHLADLTLDEFKASRN 102
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
G +D E F T SF N + E+++++ KG V P ++DQ CGSCWA S
Sbjct: 103 GYKKIDRE--------FATTSFKYENVTAIPEAVDWRVKGAVTP-IKDQGQCGSCWAFST 153
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I +LI LS+Q
Sbjct: 154 VAAIEGINQITTGKLISLSEQ 174
>gi|348668979|gb|EGZ08802.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 376
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 52/148 (35%), Positives = 83/148 (56%), Gaps = 12/148 (8%)
Query: 79 FKDFVREYERQYDSDSE----IERRFDIFRNNLKTIDYY-TKHEQG--TATYGVNRFADM 131
F D+ +YE+ Y SD+ ++ RF F NL+ I+ + E+G + T G+N AD+
Sbjct: 39 FVDYALDYEKSYRSDANDQALVQHRFRAFATNLQRIEAHNAAFERGEFSFTLGLNDLADL 98
Query: 132 TDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
+D+E+ LS + + K ET+S + + L +S +++ G V P V++Q CG
Sbjct: 99 SDAEYKQLLSY----RARDSKGASETFSVSPEDVKDLPDSWDWRQHGAVTP-VKNQGQCG 153
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
SCWA SAVA +ESAY + +L S+Q
Sbjct: 154 SCWAFSAVAAMESAYQLSTGKLESFSEQ 181
>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 82.0 bits (201), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 48/140 (34%), Positives = 71/140 (50%), Gaps = 19/140 (13%)
Query: 86 YERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDW 145
Y R Y +E E+RF IF++N+ I+ + K T +N FAD+T+ EF
Sbjct: 4 YGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEF--------- 54
Query: 146 EQIENLKSTF------ETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+L++ F E +F N + +I+++ KG V P ++DQ CG CWA SAV
Sbjct: 55 ---RSLRNRFKAHICSEATTFKYENVTAVPSTIDWRKKGAVTP-IKDQQQCGCCWAFSAV 110
Query: 200 ACLESAYAIKHNELIELSKQ 219
A E I +LI LS+Q
Sbjct: 111 AATEGITQITTGKLISLSEQ 130
>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 359
Score = 82.0 bits (201), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 81/150 (54%), Gaps = 9/150 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA-TYGVNRFADMTDSEF 136
+FK + EY R Y + E ++RF ++ NL+ I + G++ G N+F D+T+ EF
Sbjct: 39 RFKAWQAEYNRTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTEEEF 98
Query: 137 NHG-LSSLD-----WEQIENLKSTFETYSFNSSNSYGLAE-SINYKDKGKVLPKVQDQHL 189
L LD E + + T T ++ ++ G A S++++ KG V P V++Q
Sbjct: 99 KDTYLMKLDEQPPAAEAMPPIVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTP-VKNQQQ 157
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA + VA +E + IK L+ LS+Q
Sbjct: 158 CGSCWAFATVASIEGVHQIKTGRLVSLSEQ 187
>gi|9631045|ref|NP_047715.1| cathepsin-like proteinase [Lymantria dispar MNPV]
gi|13124028|sp|Q9YMP9.1|CATV_NPVLD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|3822313|gb|AAC70264.1| cathepsin-like proteinase [Lymantria dispar MNPV]
Length = 356
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 82/143 (57%), Gaps = 6/143 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKH--EQGTATYGVNRFADMTDSEF 136
F+ FV Y + Y SD E +R+ IF++NL I+ + + TATY +N+F+D++ SE
Sbjct: 56 FESFVENYNKNYTSDWEKNKRYSIFKDNLHEINAKNGNATDGPTATYKINKFSDLSKSEL 115
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ L E + + +T N G +++++ KV +++Q CG+CWA
Sbjct: 116 IAKFTGLSIP--ERVSNFCKTIILNQPPDKG-PLHFDWREQNKV-TSIKNQGACGACWAF 171
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
+ +A +ES +A++HN LI+LS+Q
Sbjct: 172 ATLASVESQFAMRHNRLIDLSEQ 194
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 59/183 (32%), Positives = 93/183 (50%), Gaps = 17/183 (9%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH--- 138
++ ++ + Y+ E E+RF IF+ NLK ID + E T G+N FAD+T+ E+
Sbjct: 38 WLAKHGKAYNGIDEREKRFQIFKENLKFIDDHN-SENRTYKVGLNMFADLTNEEYRALYL 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
G S ++ K+ Y+ N+ + L ES++++ +G V P V++Q CGSCWA S
Sbjct: 97 GTRSPPARRVMKAKTASRRYAVNNLDR--LPESMDWRTRGAVAP-VKNQGSCGSCWAFST 153
Query: 199 VACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKG--PYSLNHAVLNVGY 256
+A +E I ELI LS+Q + + Y G C+ G Y+ + N G
Sbjct: 154 IAAVEGINQIVTGELISLSEQELVSCDKKYNSG--------CNGGLMDYAFQFIIDNGGL 205
Query: 257 DNE 259
D E
Sbjct: 206 DTE 208
>gi|2731635|gb|AAB93494.1| pre-procathepsin L [Paragonimus westermani]
Length = 325
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 81/142 (57%), Gaps = 12/142 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ F R+Y + Y ++ + ++RF IF++NL Y EQGTA YGV +F+D+T+ EF
Sbjct: 32 YEQFKRDYGKAYANEDD-QKRFAIFKDNLVRAQQYQTQEQGTAKYGVTQFSDLTNEEFAA 90
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L S E+++ ++ N + S+++++KG V P V+ Q CGSCWA S
Sbjct: 91 MYLGSRIDERVDRVQ-------LNDLQT--APASVDWREKGAVGP-VEHQGSCGSCWAFS 140
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A +E + +K L+ LSKQ
Sbjct: 141 VTANVEGQWFLKTGRLVSLSKQ 162
>gi|4760897|gb|AAD29130.1| cysteine proteinase 1 precursor [Clonorchis sinensis]
Length = 328
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 100/208 (48%), Gaps = 39/208 (18%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
+++F +Y++ Y +D + E RF+IF++NL + EQGTA YGV +F+D+T EF
Sbjct: 32 YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKT 90
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ D + S E + ++ E ++++ G V P V DQ CGSCWA
Sbjct: 91 RYLRMRFDGPIVSEDPSPEEDVTMDN-------EKFDWREHGAVGP-VLDQGKCGSCWAF 142
Query: 197 SAVACLESAYAIKHNELIELSKQ----------------PPKTHGRFYKGGVMNLPHMLC 240
S + +E + K +L+ LS+Q PPKT+G K G + L
Sbjct: 143 SVIGNVEGQWFRKTGDLLALSEQQLVDCDHLDKGCNGGYPPKTYGEIEKMGGLE----LA 198
Query: 241 SKGPYS-------LNHAVLNVGYDNEST 261
S PY+ +N + V Y NEST
Sbjct: 199 SDYPYTGVDGICYMNQSKF-VAYVNEST 225
>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 74/146 (50%), Gaps = 5/146 (3%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
D+ F +F +Y + Y+ +E RF IF+ N+ I Y T T GVN F D+T
Sbjct: 22 DYMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDII-YATNARNLTFALGVNEFTDLTQ 80
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF + L + + T+ +N + LA S+++ +G V P V++Q CGSC
Sbjct: 81 EEFAASYTGLKPASLWSGLPRLSTHEYNGAP---LASSVDWTTQGVVTP-VKNQGQCGSC 136
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
W+ S LE A+A+ L+ LS+Q
Sbjct: 137 WSFSTTGALEGAWALSTGNLVSLSEQ 162
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 81/142 (57%), Gaps = 6/142 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+D+ ++++ Y SD E RR +F + L I+ + T T G+N+F+D+T++EF
Sbjct: 2 FEDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRA 61
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ + ++ + + SS L S++++ +G V P ++DQ CGSCWA S
Sbjct: 62 NYVGKFKPPRYQDRRPAKDVDVDVSS----LPTSLDWRQEGAVTP-IKDQGQCGSCWAFS 116
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A+A +ESA+ + EL+ LS+Q
Sbjct: 117 AIASIESAHFLATKELVSLSEQ 138
>gi|308454069|ref|XP_003089698.1| hypothetical protein CRE_27947 [Caenorhabditis remanei]
gi|308269277|gb|EFP13230.1| hypothetical protein CRE_27947 [Caenorhabditis remanei]
Length = 243
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 78/147 (53%), Gaps = 6/147 (4%)
Query: 75 HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDS 134
+ N F+DF+ +Y R+Y ++ E+ +RF IF N+ ++ Y K + G TY +N F+D++D
Sbjct: 46 YTNAFQDFLVKYLREYKTEDELVKRFTIFSRNMDLVERYNKEDLGKVTYELNDFSDLSDE 105
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKD-KGKV-LPKVQDQHLCGS 192
E+ L + S T F + ES+++++ KG + ++ Q CGS
Sbjct: 106 EWKKFLMTPK----PKSPSKSATKLFTPKEKRVIPESVDWRNVKGNNHVTGIKYQGPCGS 161
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA + A +ESA +I L LS Q
Sbjct: 162 CWAFATAAAIESAVSISGGGLQSLSSQ 188
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 79/139 (56%), Gaps = 2/139 (1%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-TATYGVNRFADMTDSEFNHGL 140
++ E+ R Y +E R+ +F+ N+++I+ + + G T VN+FAD+T+ EF
Sbjct: 40 WMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMY 99
Query: 141 SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVA 200
+ + + ++ ++ + +S L S++++ KG V P ++DQ CGSCWA SAVA
Sbjct: 100 TGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTP-IKDQGSCGSCWAFSAVA 158
Query: 201 CLESAYAIKHNELIELSKQ 219
+E IK +LI LS+Q
Sbjct: 159 AIEGVAQIKKGKLISLSEQ 177
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 79/139 (56%), Gaps = 2/139 (1%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-TATYGVNRFADMTDSEFNHGL 140
++ E+ R Y +E R+ +F+ N+++I+ + + G T VN+FAD+T+ EF
Sbjct: 34 WMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMY 93
Query: 141 SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVA 200
+ + + ++ ++ + +S L S++++ KG V P ++DQ CGSCWA SAVA
Sbjct: 94 TGYKGNSVLSSRTKPTSFRYQHVSSDALPISVDWRKKGAVTP-IKDQGSCGSCWAFSAVA 152
Query: 201 CLESAYAIKHNELIELSKQ 219
+E IK +LI LS+Q
Sbjct: 153 AIEGVAQIKKGKLISLSEQ 171
>gi|189239337|ref|XP_973607.2| PREDICTED: similar to cathepsin F-like cysteine protease [Tribolium
castaneum]
Length = 1726
Score = 81.6 bits (200), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 70/122 (57%), Gaps = 7/122 (5%)
Query: 99 RFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFET- 157
RF++F NL I EQGTATYG+ RFADMT EF+ L +L++ ET
Sbjct: 1442 RFNVFVQNLMQIRVLNTFEQGTATYGITRFADMTQKEFSRSLGLR-----TDLRNENETP 1496
Query: 158 YSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELS 217
++ + L + +++ K V+ +V++Q CGSCWA S +E YA++H +L+E S
Sbjct: 1497 FAQAKIPNIELPKEFDWRKKN-VVTEVKNQEQCGSCWAFSVTGNVEGQYALRHGKLLEFS 1555
Query: 218 KQ 219
+Q
Sbjct: 1556 EQ 1557
>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
Length = 376
Score = 81.3 bits (199), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 79/141 (56%), Gaps = 3/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ E+ + Y+ E ERRF IF++NLK I+ + + G+N+F+D+T EF
Sbjct: 41 YERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSYDRGLNQFSDLTVDEFQA 100
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
E+ ++L E Y + + L + ++++++G V+P+V+ Q CGSCWA +A
Sbjct: 101 SYLGGKIEK-KSLSDVAERYQYKEGDI--LPDEVDWRERGAVVPRVKRQGDCGSCWAFAA 157
Query: 199 VACLESAYAIKHNELIELSKQ 219
+E I EL+ LS+Q
Sbjct: 158 TGAVEGINQITTGELLSLSEQ 178
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 98/186 (52%), Gaps = 19/186 (10%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ ++ + Y++ E ERRF IF++NL+ ID + E T G+NRFAD+T+ E+
Sbjct: 53 YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNA-ENRTYKVGLNRFADLTNEEYRS 111
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L + + + + Y+F +S L ES++++ KG V+ +V+DQ CGSCWA S
Sbjct: 112 MYLGTRTAAKRRSSNKISDRYAFRVGDS--LPESVDWRKKGAVV-EVKDQGSCGSCWAFS 168
Query: 198 AVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVLN 253
+A +E I LI LS+Q ++ GG+M+ Y+ + N
Sbjct: 169 TIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMD----------YAFEFIINN 218
Query: 254 VGYDNE 259
G D+E
Sbjct: 219 GGIDSE 224
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 79/138 (57%), Gaps = 3/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y +E E+RF +F+NN+ I+ + +N+FAD+ D EF L
Sbjct: 40 WMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLI 99
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
++ ++ ++++ ET SF + + +I+++ +G V P ++DQ CGSCWA SAVA
Sbjct: 100 NVQ-KKASWVETSTET-SFRYESVTKIPATIDWRKRGAVTP-IKDQGRCGSCWAFSAVAA 156
Query: 202 LESAYAIKHNELIELSKQ 219
E + I +L+ LS+Q
Sbjct: 157 TEGIHQITTGKLVPLSEQ 174
>gi|270011071|gb|EFA07519.1| cystatin [Tribolium castaneum]
Length = 1761
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 46/122 (37%), Positives = 70/122 (57%), Gaps = 7/122 (5%)
Query: 99 RFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFET- 157
RF++F NL I EQGTATYG+ RFADMT EF+ L +L++ ET
Sbjct: 1477 RFNVFVQNLMQIRVLNTFEQGTATYGITRFADMTQKEFSRSLGLR-----TDLRNENETP 1531
Query: 158 YSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELS 217
++ + L + +++ K V+ +V++Q CGSCWA S +E YA++H +L+E S
Sbjct: 1532 FAQAKIPNIELPKEFDWRKKN-VVTEVKNQEQCGSCWAFSVTGNVEGQYALRHGKLLEFS 1590
Query: 218 KQ 219
+Q
Sbjct: 1591 EQ 1592
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 96/188 (51%), Gaps = 16/188 (8%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ ++ + Y++ E ERRF+IF++NL+ +D T G+ +FAD+T+ E+
Sbjct: 51 MYEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADLTNEEYR 110
Query: 138 HGLSSLDWEQIENLKST-FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
E+ E L++ + Y + N L +++++KG V +V+DQ CGSCWA
Sbjct: 111 AMYLGAKMEKKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAVT-EVKDQGQCGSCWAF 169
Query: 197 SAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVL 252
S V +E I +LI LS+Q K + + GG+M+ Y+ +
Sbjct: 170 STVGSVEGINQIVTGDLISLSEQELVDCDKAYNQGCNGGLMD----------YAFEFIIK 219
Query: 253 NVGYDNES 260
N G D+E+
Sbjct: 220 NGGIDSEA 227
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 75/138 (54%), Gaps = 4/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y E ERR +IF+NN++ I+ + K + VN FAD+T+ EF +
Sbjct: 7 WMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEEFQ---A 63
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
S + ++ S+ T F N + +++++ KG V P ++DQ CG CWA SAVA
Sbjct: 64 SRNGYKMSAHLSSSSTKPFRYENVSAVPSTMDWRKKGAVTP-IKDQGQCGCCWAFSAVAA 122
Query: 202 LESAYAIKHNELIELSKQ 219
E + +LI LS+Q
Sbjct: 123 TEGITQLSTGKLISLSEQ 140
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 77/143 (53%), Gaps = 13/143 (9%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF----- 136
++ +Y R Y SE RRF++F+ N++ I+ + GVN+FAD+T+ EF
Sbjct: 133 WMAQYSRVYKDASEKARRFEVFKANVQFIESFNAGGNNKFWLGVNQFADLTNDEFRSTKT 192
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
N GL S N+K + + + ++ L +I+++ KG V P ++DQ CG CWA
Sbjct: 193 NKGLKS------SNMKIP-TGFRYENVSADALPTTIDWRTKGAVTP-IKDQGQCGCCWAF 244
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SAVA E I +L+ L++Q
Sbjct: 245 SAVAATEGIVKISTGKLVSLAEQ 267
>gi|328711164|ref|XP_003244460.1| PREDICTED: cathepsin O-like [Acyrthosiphon pisum]
Length = 339
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 77/144 (53%), Gaps = 4/144 (2%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
++F F++ Y + Y +++E +RF+ F+ +LKTI ++ G YG+ F+D++ EF
Sbjct: 34 DKFNKFIKMYNKSYMNETEHNKRFEHFKKSLKTIQLLSQKCNGCTNYGITEFSDLSTEEF 93
Query: 137 NHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
L+S+ TF S + SI+++DKG V+ V++Q CG+CWA
Sbjct: 94 TKIYLNSVTLRTPRT--GTFSMARSKRSITTATLSSIDWRDKG-VVTSVRNQKNCGACWA 150
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S V +ES YAIK L S Q
Sbjct: 151 ISVVELIESVYAIKTGLLQTFSVQ 174
>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
Length = 382
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 51/150 (34%), Positives = 80/150 (53%), Gaps = 9/150 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA-TYGVNRFADMTDSEF 136
+FK + EY R Y + E ++RF I+ N++ I + G++ G N+F D+T+ EF
Sbjct: 63 RFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEEEF 122
Query: 137 NHG-LSSLD-----WEQIENLKSTFETYSFNSSNSYGLAE-SINYKDKGKVLPKVQDQHL 189
L LD E + T T ++ N+ G A S++++ KG V +V+DQ
Sbjct: 123 KDTYLMKLDEQPPAAEAMPPTVGTMSTAGMSNGNNTGEAPNSVDWRTKGAVT-RVKDQQQ 181
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA + VA +E + IK L+ LS+Q
Sbjct: 182 CGSCWAFATVASIEGVHQIKTGRLVSLSEQ 211
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 58/186 (31%), Positives = 98/186 (52%), Gaps = 19/186 (10%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ ++ + Y++ E ERRF IF++NL+ ID + E T G+NRFAD+T+ E+
Sbjct: 51 YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNA-ENRTYKVGLNRFADLTNEEYRS 109
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L + + + + Y+F +S L ES++++ KG V+ +V+DQ CGSCWA S
Sbjct: 110 MYLGTRTAAKRRSSNKISDRYAFRVGDS--LPESVDWRKKGAVV-EVKDQGSCGSCWAFS 166
Query: 198 AVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVLN 253
+A +E I LI LS+Q ++ GG+M+ Y+ + N
Sbjct: 167 TIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMD----------YAFEFIINN 216
Query: 254 VGYDNE 259
G D+E
Sbjct: 217 GGIDSE 222
>gi|290984408|ref|XP_002674919.1| predicted protein [Naegleria gruberi]
gi|284088512|gb|EFC42175.1| predicted protein [Naegleria gruberi]
Length = 353
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 78/145 (53%), Gaps = 4/145 (2%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
+ F DF R+++R Y E E R +FR N++T E G YG+ +F+D+T EF
Sbjct: 35 DHFLDFTRKFQRFYKGPEEYEYRLKVFRENIETSRRMNIRE-GNNNYGITKFSDLTSDEF 93
Query: 137 N--HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ + ++I+ + ++S + + ++++ G + V+DQ CGSCW
Sbjct: 94 RKFYLMEKKTPKEIQKMMRMDSNKMVSNSYAKPAPDHYDWRNHGAIT-GVKDQGQCGSCW 152
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA+ +E +YAIKH +L+ S+Q
Sbjct: 153 AFSAIGSIEGSYAIKHKQLVSFSEQ 177
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 60/184 (32%), Positives = 88/184 (47%), Gaps = 14/184 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ E + Y+ E E RF+IF +NLK I+ + T G+ RFAD+T+ EF
Sbjct: 42 MYEQWLVENRKNYNGLGEKETRFEIFTDNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFR 101
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
E+ + E Y + ++ L + I+++ KG V P V+DQ CGSCWA S
Sbjct: 102 AIYLRSKMERTR-VPVKGERYLYKVGDT--LPDQIDWRAKGAVNP-VKDQGNCGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKG--PYSLNHAVLNVG 255
A+ +E IK ELI LS+Q Y GG C G Y+ + N G
Sbjct: 158 AIGAVEGINQIKTGELISLSEQELVDCDTSYNGG--------CGGGLMDYAFKFIIENGG 209
Query: 256 YDNE 259
D E
Sbjct: 210 IDTE 213
>gi|31096290|gb|AAP43630.1| chabaupain-2 [Plasmodium chabaudi chabaudi]
Length = 471
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 80/153 (52%), Gaps = 7/153 (4%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ N F +F++++ +QY+S E++ RF IF NLK ++ + K ++ G+N F+DM
Sbjct: 148 LESVNIFYNFMKKFNKQYNSAEEMQERFYIFTENLKKVEKHNKEKKYMYKKGINPFSDMR 207
Query: 133 DSEFNHGL--SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKD----KGKVLPKVQD 186
EF S L I +L+ YS S + +NYK + + V+D
Sbjct: 208 PEEFKMRYLNSKLSESTIIDLRHLI-PYSAAISKYKSPTDKVNYKSFDWREHNAIIAVKD 266
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q C SCWA + +E+ YAI+ N+ I LS+Q
Sbjct: 267 QKRCASCWAFATAGVIEAQYAIRQNKKISLSEQ 299
>gi|195379512|ref|XP_002048522.1| GJ11311 [Drosophila virilis]
gi|194155680|gb|EDW70864.1| GJ11311 [Drosophila virilis]
Length = 311
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 96/199 (48%), Gaps = 36/199 (18%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDS 134
++K F + R Y DSE R IF +N K ID + ++E G TY GVN F D+ S
Sbjct: 26 EWKSFKEMHGRSYAGDSEELLRRRIFEDNKKLIDTHNARYEAGKETYKMGVNEFTDLLPS 85
Query: 135 EF-NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF + + SL N + Y + S + + ESI+++ KG V P V++Q CGSC
Sbjct: 86 EFVSRMMGSL------NRTAVTADYIYEPSANLQIPESIDWRTKGAVSP-VKNQGTCGSC 138
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ--------PP-KTHGRFYKGGVMNLPHMLCSKG- 243
W +AV LE ++ ++ELS+Q PP + HG C +G
Sbjct: 139 WTFAAVGTLEGQSFLRTKRMVELSEQNLLDCSSHPPYRNHG--------------CQRGY 184
Query: 244 PY-SLNHAVLNVGYDNEST 261
PY +L + N G D S+
Sbjct: 185 PYDALRYVKDNQGLDTRSS 203
>gi|118397782|ref|XP_001031222.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89285547|gb|EAR83559.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 331
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 57/161 (35%), Positives = 82/161 (50%), Gaps = 14/161 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
+ F R Y R Y +++E + R IF N + I + + + T GVNRF+DMT EF+
Sbjct: 32 YNKFTRNYPRIYLNEAESDYRLAIFLENYQKIQDHNNNPENTYQIGVNRFSDMTQQEFSQ 91
Query: 139 GLSSLDWEQIENLKSTFETYSFN---SSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ Q N+ S + Y S N A SI+++ KG V P V++Q CGSCWA
Sbjct: 92 KIL-----QNPNILSNGKNYVQKQQASVNDVQPATSIDWRTKGVVTP-VKNQGECGSCWA 145
Query: 196 HSAVACLESAYAIKHNELIELSKQ-----PPKTHGRFYKGG 231
SA A +ES AI + L+ S+Q + +G FY G
Sbjct: 146 FSATAAMESYNAIHNKVLLRFSEQEFVDCTTEKNGGFYSFG 186
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 81.3 bits (199), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 91/164 (55%), Gaps = 10/164 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+++++ ++++ Y+ E ++RF +F++NL I + ++ T G+N+FADMT+ E+
Sbjct: 39 MYEEWLVKHQKVYNGLGEKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNKFADMTNEEYR 98
Query: 138 ---HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
G S ++ KST Y++++ + L ++++ KG V P ++DQ CGSCW
Sbjct: 99 VMYFGTKSDAKRRLMKTKSTGHRYAYSAGDQ--LPVHVDWRVKGAVAP-IKDQGSCGSCW 155
Query: 195 AHSAVACLESAYAIKHNELIELSKQPPKTHGRFY----KGGVMN 234
A S VA +E+ I + + LS+Q R Y GG+M+
Sbjct: 156 AFSTVATVEAINKIVTGKFVSLSEQELVDCDRAYNQGCNGGLMD 199
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 76/143 (53%), Gaps = 13/143 (9%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF----- 136
++ +Y R Y SE RRF++F+ N+K I+ + GVN+FAD+T+ EF
Sbjct: 40 WMAQYSRVYKDASEKARRFEVFKANVKFIESFNAGGNNKFWLGVNQFADLTNDEFRSIKT 99
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
N G S + + + +E S ++ L +I+++ KG V P ++DQ CG CWA
Sbjct: 100 NKGFKSSNMKIPTGFR--YENVSVDA-----LPTTIDWRTKGAVTP-IKDQGQCGCCWAF 151
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SAVA E I +L+ L++Q
Sbjct: 152 SAVAATEGIVKISTGKLVSLAEQ 174
>gi|348505824|ref|XP_003440460.1| PREDICTED: pro-cathepsin H-like [Oreochromis niloticus]
Length = 324
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 55/143 (38%), Positives = 79/143 (55%), Gaps = 12/143 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG--TATYGVNRFADMTDSEF 136
FK ++ +Y ++Y+ E +R IF N K ID KH +G + T G+N F+DMT SEF
Sbjct: 26 FKSWMAQYNKEYNLK-EYYQRLQIFTENKKRID---KHNEGNHSFTMGLNEFSDMTFSEF 81
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
S + +N +T Y SSN L +SI+++ KG + V++Q CGSCW
Sbjct: 82 R---KSFLMSEPQNCSATKGNYF--SSNGL-LPDSIDWRKKGNYVTPVKNQGGCGSCWTF 135
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S CLES AI +L+ LS+Q
Sbjct: 136 STTGCLESVTAINKGKLVPLSEQ 158
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 74/138 (53%), Gaps = 6/138 (4%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y D+E E R++IF+ N+ ID + + GVN+FAD+++ EF +
Sbjct: 42 WMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEFKASRN 101
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ + + + F N + +++++ KG V P V+DQ CG CWA SAVA
Sbjct: 102 -----RFKGHMCSPQAGPFRYENVSAVPATMDWRKKGAVTP-VKDQGQCGCCWAFSAVAA 155
Query: 202 LESAYAIKHNELIELSKQ 219
+E + +LI LS+Q
Sbjct: 156 MEGINQLTTGKLISLSEQ 173
>gi|397133545|gb|AFO10079.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus S2]
Length = 323
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 54/162 (33%), Positives = 88/162 (54%), Gaps = 11/162 (6%)
Query: 59 YGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQ 118
Y +++ +DL L N F++FV + + Y S+ E RRF IF++NL I K +
Sbjct: 11 YAVVKSAAYDL---LKAPNYFEEFVHRFNKDYGSEVEKLRRFKIFQHNLNEI--INKDQN 65
Query: 119 GTATYGVNRFADMTDSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDK 177
+A Y +N+F+D++ E + L Q +N + + G E +++
Sbjct: 66 DSAKYEINKFSDLSKDETIAKYTGLSLPIQTQNF---CKVIVLDQPPGKGPLE-FDWRRL 121
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
KV V++Q +CG+CWA + +A LES +AIKHN+LI LS+Q
Sbjct: 122 NKV-TSVKNQGMCGACWAFATLASLESQFAIKHNQLINLSEQ 162
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 78/138 (56%), Gaps = 2/138 (1%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y + Y +E E+RF +F+NN++ I+ + +N+FAD+ D EF L+
Sbjct: 38 WMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEFKALLN 97
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
++ ++ +++ ET SF N + +++++ +G V P + CGSCWA + VA
Sbjct: 98 NVQ-KKASRVETATET-SFRYENVTKIPSTMDWRKRGAVTPIKDQGYTCGSCWAFATVAT 155
Query: 202 LESAYAIKHNELIELSKQ 219
+ES + I EL+ LS+Q
Sbjct: 156 VESLHQITTGELVSLSEQ 173
>gi|291230041|ref|XP_002734978.1| PREDICTED: cysteine proteinase inhibitor-like [Saccoglossus
kowalevskii]
Length = 352
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 75/142 (52%), Gaps = 7/142 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+DF++ Y+++YD++ E + R+ IF++NL + + EQ T YGV +F D+++ EF
Sbjct: 54 FQDFMKTYDKKYDTEEEHQLRYQIFQDNLLKAERLQQTEQATGQYGVTKFMDLSEEEFRK 113
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGK-VLPKVQDQHLCGSCWAHS 197
+ W + E ++ +++D K + KV++Q CGSCWA S
Sbjct: 114 YYLTPVWRGSDPHMKKAEIPKGTPPAAF------DWRDADKNAVTKVKNQGTCGSCWAFS 167
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+E + IK L+ LS+Q
Sbjct: 168 TTGNIEGQWKIKKGTLVSLSEQ 189
>gi|226503205|ref|NP_001150062.1| thiol protease SEN102 precursor [Zea mays]
gi|195636390|gb|ACG37663.1| thiol protease SEN102 precursor [Zea mays]
Length = 349
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 53/151 (35%), Positives = 84/151 (55%), Gaps = 11/151 (7%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA-TYGVNRFADMTDSE 135
++F+ + EY R Y + E ++RF ++ N+K I+ T ++ G++ G NRFAD+T+ E
Sbjct: 35 DRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIE--TMNQPGSSYELGENRFADLTEEE 92
Query: 136 FNHG-LSSLD--WEQIENLKSTFETY----SFNSSNSYGLAESINYKDKGKVLPKVQDQH 188
F L LD E + T +T + SN+ S++++ KG V P V+ Q
Sbjct: 93 FKDTYLMKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTP-VKSQQ 151
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA +AVA +E + IK L+ LS+Q
Sbjct: 152 HCGSCWAFAAVASIEGVHKIKTGLLVSLSEQ 182
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 71/140 (50%), Gaps = 1/140 (0%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
++++ EY + Y +E E+RF IF++N++ I+ + GVN AD+T EF
Sbjct: 39 ENWMAEYGKIYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDS 98
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ L E +TF+ F N + E+I+++ KG V P CGSCWA S V
Sbjct: 99 RNGLK-RTYEFSTTTFKLNGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTV 157
Query: 200 ACLESAYAIKHNELIELSKQ 219
A E Y I L+ LS+Q
Sbjct: 158 AATEGIYQISTGMLMSLSEQ 177
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 61/190 (32%), Positives = 97/190 (51%), Gaps = 24/190 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ ++++ Y E E+RF+IF++NLK ID + E T G+ + D+T+ EF
Sbjct: 45 YELWLAKHDKVYSGLVEYEKRFEIFKDNLKFIDEHNS-ENHTYKMGLTPYTDLTNEEFQA 103
Query: 139 GLSSLDWEQIENLKSTF---ETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ I LK T E Y++ + ++ L E I+++ KG V P V++Q CGSCWA
Sbjct: 104 IYLGTRSDTIHRLKRTINISERYAYEAGDN--LPEQIDWRKKGAVTP-VKNQGKCGSCWA 160
Query: 196 HSAVACLESAYAIKHNELIELSKQP-----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHA 250
S V+ +ES I+ LI LS+Q K HG KGG Y+ +
Sbjct: 161 FSTVSTVESINQIRTGNLISLSEQQLVDCNKKNHG--CKGGAF----------VYAYQYI 208
Query: 251 VLNVGYDNES 260
+ N G D E+
Sbjct: 209 IDNGGIDTEA 218
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 55/186 (29%), Positives = 91/186 (48%), Gaps = 15/186 (8%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+ ++ ++ + Y++ E E RF IF++NL+ ID + + G+NRFAD+T+ E+
Sbjct: 48 MYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNADPDRSYELGLNRFADLTNEEYR 107
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ S + + L +SI++++KG V V+DQ CGSCWA S
Sbjct: 108 AKYLGTKSRESRPKLSKGPSDRYAPVEGEELPDSIDWREKGAV-AAVKDQGSCGSCWAFS 166
Query: 198 AVACLESAYAIKHNELIELSKQPPKTHGRFY----KGGVMNLPHMLCSKGPYSLNHAVLN 253
A+ +E I ELI LS+Q R Y +GG+M+ Y+ N + N
Sbjct: 167 AIGAVEGINQITTGELITLSEQELVDCDRSYNEGCEGGLMD----------YAFNFIIKN 216
Query: 254 VGYDNE 259
G D++
Sbjct: 217 GGIDSD 222
>gi|334347644|ref|XP_001379528.2| PREDICTED: cathepsin W-like [Monodelphis domestica]
Length = 619
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 79/150 (52%), Gaps = 13/150 (8%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
D +QFK F +Y + Y +E ERRF+IF +NL T+ G A +GV +F+D+T+
Sbjct: 261 DLMDQFKAFQIQYNKSYADPAEQERRFEIFADNLAWAQQLTEKHGGMAQFGVTQFSDLTE 320
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSN----SYGLAESINYKDKGKVLPKVQDQHL 189
EF+ + + +S+++ S + L S +++ G VL V+ Q
Sbjct: 321 EEFH--------QHYQPAQSSYKEPSLKTRKHPRLQRPLIRSCDWRKAG-VLTPVRKQKK 371
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
C SCWA +AV +E+ +AI + + ELS Q
Sbjct: 372 CRSCWAIAAVGNVEALWAIHYEQHFELSVQ 401
>gi|227018328|gb|ACP18830.1| cysteine proteinase 1 [Chrysomela tremula]
Length = 323
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 52/146 (35%), Positives = 82/146 (56%), Gaps = 13/146 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+ DF + + + Y S E + RF+IF++ L+ I + K+E G +TY +N+F+D+TD E
Sbjct: 23 WADFKKAHGKTYKSLREEKLRFNIFQDTLREIAAHNAKYESGESTYYLAINQFSDITDEE 82
Query: 136 FNHGLSSLDWEQIENLKS--TFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
F L ++N++S + E + ESI+++ +G VLP +++Q CGSC
Sbjct: 83 FRAML-------MKNVESRPSLEDMEIANLTVGAAPESIDWRTEGAVLP-IRNQEDCGSC 134
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SAVA +E AIK LS Q
Sbjct: 135 WAFSAVAAVEGQAAIKSGSKTPLSVQ 160
>gi|85068712|gb|ABC69436.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 80.9 bits (198), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 97/200 (48%), Gaps = 34/200 (17%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
+++F +Y++ Y +D + E RF+IF++NL + EQGTA YGV +F+D+T EF
Sbjct: 32 YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKT 90
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ D + S E + ++ E ++++ G V P V DQ CGSCWA
Sbjct: 91 RYLRMRFDGPIVSEDPSPEEDVTMDN-------EKFDWREHGAVGP-VLDQGKCGSCWAF 142
Query: 197 SAVACLESAYAIKHNELIELSKQ----------------PPKTHGRFYKGGVMNLPHMLC 240
S + +E + K +L+ LS+Q PPKT+G K G + L
Sbjct: 143 SVIGNVEGQWFRKTGDLLALSEQQLVDCDHLEKGCNGGYPPKTYGEIEKMGGLE----LA 198
Query: 241 SKGPYSLNHAVLNVGYDNES 260
S PY+ V + Y N+S
Sbjct: 199 SDYPYT---GVDGICYMNQS 215
>gi|72005575|ref|XP_783218.1| PREDICTED: cathepsin L2-like isoform 2 [Strongylocentrotus
purpuratus]
gi|390337647|ref|XP_003724610.1| PREDICTED: cathepsin L2-like isoform 1 [Strongylocentrotus
purpuratus]
Length = 334
Score = 80.9 bits (198), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 87/150 (58%), Gaps = 9/150 (6%)
Query: 74 DHGNQFKDFVREYERQYDS-DSEIERRFDIFRNNLKTIDYYT-KHEQGTATY--GVNRFA 129
D ++K++V + ++Y + E+ERR I+ +NL+ I + +H QG TY G+N F
Sbjct: 23 DFDEEWKEWVDYHGKEYSAMGEEMERRM-IWEDNLRIITKHNLEHSQGKTTYRLGMNEFG 81
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
DMT++EF +++ +++ + + +F S L +S++++ +G V P V+DQ
Sbjct: 82 DMTNAEF---VATRTMKKMSGVPKVGQGSTFLPSEFLQLPDSVDWRTEGYVTP-VKDQGQ 137
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S V LE + +K L+ LS+Q
Sbjct: 138 CGSCWAFSTVGALEGQHFVKTGTLVSLSEQ 167
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 80.5 bits (197), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 82/147 (55%), Gaps = 7/147 (4%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
++F+ ++ + R Y E +RRF+++R N++ ++ + G N+FAD+T+ EF
Sbjct: 30 DRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSNGY-KLADNKFADLTNEEF 88
Query: 137 NHGLSS----LDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
+ + QI N S SS+ L +S++++ KG V+ +V++Q CGS
Sbjct: 89 RAKMLGFRPHVTIPQISNTCSADIAMPGESSDDI-LPKSVDWRKKGAVV-EVKNQGDCGS 146
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA SAVA +E IK+ EL+ LS+Q
Sbjct: 147 CWAFSAVAAIEGINQIKNGELVSLSEQ 173
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 80.5 bits (197), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 77/143 (53%), Gaps = 4/143 (2%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+ ++ + R Y +SE RF+IF+ NL+ + + ++ T VN F+D+TD EF
Sbjct: 36 EQWMARFNRVYSDESEKRNRFNIFKKNLEFVQSFNMNKNITYKLDVNEFSDLTDEEFRAT 95
Query: 140 LSSLDW-EQIENLK--STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ L E+I + S+ +T F N ES++++ +G V P V+ Q CG CWA
Sbjct: 96 HTGLVVPEEITGISTLSSDKTVPFRYGNVSDTGESMDWRQEGAVTP-VKYQGRCGGCWAF 154
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SAVA +E I EL+ LS+Q
Sbjct: 155 SAVAAVEGITKITKGELVSLSEQ 177
>gi|67773382|gb|AAY81948.1| cysteine protease 11 [Paragonimus westermani]
Length = 322
Score = 80.5 bits (197), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 79/143 (55%), Gaps = 13/143 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ F R+Y + Y ++ + ++RF IF++NL +QGTA YGV +F+D+T EF
Sbjct: 27 YEQFKRDYGKVYANEDD-QKRFAIFKDNLVRAQKLQLRDQGTARYGVTQFSDLTPEEFAA 85
Query: 139 GLSS--LDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
S L+ +Q+E ++ T E ++++ KG V P V++Q CGSCWA
Sbjct: 86 KYLSPPLNSDQVERVQPT---------GLKAAPERMDWRAKGAVTP-VENQGECGSCWAF 135
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E + IK +L+ LSKQ
Sbjct: 136 STAGNVEGQWFIKTGQLVSLSKQ 158
>gi|195379514|ref|XP_002048523.1| GJ11310 [Drosophila virilis]
gi|194155681|gb|EDW70865.1| GJ11310 [Drosophila virilis]
Length = 328
Score = 80.5 bits (197), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 66/199 (33%), Positives = 96/199 (48%), Gaps = 36/199 (18%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDS 134
++K F + R Y DSE R IF +N K ID + ++E G TY GVN F D+ S
Sbjct: 26 EWKSFKEMHGRSYAGDSEELLRRRIFEDNKKLIDTHNARYEAGKETYKMGVNEFTDLLPS 85
Query: 135 EF-NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF + + SL N + Y + S + + ESI+++ KG V P V++Q CGSC
Sbjct: 86 EFVSRMMGSL------NRTAVTADYIYEPSANLQIPESIDWRTKGAVSP-VKNQGTCGSC 138
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ--------PP-KTHGRFYKGGVMNLPHMLCSKG- 243
W +AV LE ++ ++ELS+Q PP + HG C +G
Sbjct: 139 WTFAAVGTLEGQSFLRTKRMVELSEQNLLDCSSHPPYRNHG--------------CQRGY 184
Query: 244 PY-SLNHAVLNVGYDNEST 261
PY +L + N G D S+
Sbjct: 185 PYDALRYVKDNQGLDTRSS 203
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 80.5 bits (197), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 73/138 (52%), Gaps = 4/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ Y + Y E E+RF +F+ N+ I+ + + G+N+FAD+T+ EF ++
Sbjct: 42 WMTRYGKVYKDPQEREKRFRVFKENVNYIEAFNNAANKSYKLGINQFADLTNKEF---IA 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ + S T +F N +++++ KG V P ++DQ CG CWA SAVA
Sbjct: 99 PRNGFKGHMCSSIIRTTTFKFENVTATPSTVDWRQKGAVTP-IKDQGQCGCCWAFSAVAA 157
Query: 202 LESAYAIKHNELIELSKQ 219
E +A+ +LI LS+Q
Sbjct: 158 TEGIHALSAGKLISLSEQ 175
>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
Length = 356
Score = 80.5 bits (197), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 51/150 (34%), Positives = 80/150 (53%), Gaps = 9/150 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA-TYGVNRFADMTDSEF 136
+FK + EY R Y + E ++RF I+ N++ I + G++ G N+F D+T+ EF
Sbjct: 37 RFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTEEEF 96
Query: 137 NHG-LSSLD-----WEQIENLKSTFETYSFNSSNSYGLAE-SINYKDKGKVLPKVQDQHL 189
L LD E + T T ++ N+ G A S++++ KG V +V+DQ
Sbjct: 97 KDTYLMKLDEQPPAAEAMGPTVGTMSTAGMSNGNNTGEAPNSVDWRTKGAVT-RVKDQQQ 155
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA + VA +E + IK L+ LS+Q
Sbjct: 156 CGSCWAFATVASIEGVHQIKTGRLVSLSEQ 185
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 80.5 bits (197), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 78/142 (54%), Gaps = 6/142 (4%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF--N 137
K ++ +Y R Y + E E+RF IF+ N++ I+ + + G+N F D+T+ EF +
Sbjct: 39 KTWMTQYGRVYKGNVEKEKRFKIFKENVEFIESFNNNGNKPYKLGINAFTDLTNEEFRAS 98
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
H ++ + + +S++ T SF N + S++++ KG V ++DQ CG CWA S
Sbjct: 99 HNGYTM---SMSSHQSSYRTKSFRYENVTAVPPSLDWRTKGAV-THIKDQGQCGCCWAFS 154
Query: 198 AVACLESAYAIKHNELIELSKQ 219
AVA +E + LI LS+Q
Sbjct: 155 AVAAMEGITKLSTGTLISLSEQ 176
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 57/187 (30%), Positives = 95/187 (50%), Gaps = 19/187 (10%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ ++ R Y++ E ERRF+IF++NLK ID + + G+N+FAD+++ E+
Sbjct: 25 YEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADLSNDEYRS 84
Query: 139 GL--SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ +D + E Y F + L E++++++KG V P V+DQ CGSCWA
Sbjct: 85 VYLGTRMDGKGRLLGGPKSERYLFKEGDD--LPETVDWREKGAVAP-VKDQGQCGSCWAF 141
Query: 197 SAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVL 252
S V +E I L LS+Q KT+ GG+M+ Y+ + +
Sbjct: 142 STVGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMD----------YAFDFIIE 191
Query: 253 NVGYDNE 259
N G D E
Sbjct: 192 NGGIDTE 198
>gi|157113282|ref|XP_001657758.1| cathepsin l [Aedes aegypti]
gi|108877803|gb|EAT42028.1| AAEL006389-PA, partial [Aedes aegypti]
Length = 538
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 53/151 (35%), Positives = 85/151 (56%), Gaps = 9/151 (5%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
EE LD N+F F ++ + Y ++ E + R DIFR NL+ I + + +G T VN A
Sbjct: 227 EEHLD--NEFTRFRYKHGKSYHNEKEHDLRRDIFRQNLRFIHSHNRAGKGF-TVAVNHLA 283
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYG-LAESINYKDKGKVLPKVQDQH 188
D TD E L +L + N+ + + + +N + L ES++++ G V P V+DQ
Sbjct: 284 DRTDEE----LKALRGFKSSNIYNGGQPFPYNPEDFKDELPESLDWRIAGAVTP-VKDQS 338
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+CGSCW+ +ESAY +K+N+L+ S+Q
Sbjct: 339 VCGSCWSFGTAGHIESAYFLKYNKLMRFSQQ 369
>gi|91992512|gb|ABE72972.1| cathepsin L [Aedes aegypti]
Length = 548
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 53/151 (35%), Positives = 85/151 (56%), Gaps = 9/151 (5%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
EE LD N+F F ++ + Y ++ E + R DIFR NL+ I + + +G T VN A
Sbjct: 237 EEHLD--NEFTRFRYKHGKSYHNEKEHDLRRDIFRQNLRFIHSHNRAGKGF-TVAVNHLA 293
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYG-LAESINYKDKGKVLPKVQDQH 188
D TD E L +L + N+ + + + +N + L ES++++ G V P V+DQ
Sbjct: 294 DRTDEE----LKALRGFKSSNIYNGGQPFPYNPEDFKDELPESLDWRIAGAVTP-VKDQS 348
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+CGSCW+ +ESAY +K+N+L+ S+Q
Sbjct: 349 VCGSCWSFGTAGHIESAYFLKYNKLMRFSQQ 379
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 75/138 (54%), Gaps = 3/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y +E E+RF +F+NN+ I+ + +N+FAD+ D EF L
Sbjct: 40 WMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALL- 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
++ ++ + T SF + + +I+++ +G V P ++DQ CGSCWA SAVA
Sbjct: 99 -INVQKKASWVETSTQTSFRYESVTKIPATIDWRKRGAVTP-IKDQGRCGSCWAFSAVAA 156
Query: 202 LESAYAIKHNELIELSKQ 219
E + I +L+ LS+Q
Sbjct: 157 TEGIHQITTGKLVPLSEQ 174
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 56/164 (34%), Positives = 88/164 (53%), Gaps = 14/164 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ + + Y++ E E+RF IF+NNL+ ID E G+N+FAD+T+ E+
Sbjct: 45 FESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEEYRS 104
Query: 139 ---GLSSLDWEQIENLKS-TFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
G+ S D + + KS + T S S L ES+++++ G V V+DQ CGSCW
Sbjct: 105 KYTGIKSKDLRKKVSAKSGRYATLSGES-----LPESVDWRESGAV-ATVKDQGSCGSCW 158
Query: 195 AHSAVACLESAYAIKHNELIELSKQPPKTHGRFY----KGGVMN 234
A S ++ +E I +LI LS+Q R Y GG+M+
Sbjct: 159 AFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMD 202
>gi|401758208|gb|AFQ01139.1| cathepsin F-like protease, partial [Chilo suppressalis]
Length = 537
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 75/144 (52%), Gaps = 8/144 (5%)
Query: 79 FKDFVREYERQYDSD-SEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
F +F+ Y+ +Y +D E+ +RF+IF+ N+K I HE+GT Y V RF D+T EF
Sbjct: 231 FFNFITTYKPEYINDHVEMTKRFEIFKENVKKIHELNTHERGTGVYAVTRFTDLTYEEFK 290
Query: 138 HGLSSLDWEQIENLK--STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
L+ NLK + + L S +++ G V +V+DQ CGSCWA
Sbjct: 291 SKYLGLN----PNLKKPNQIPMRQAEIPKVHQLPASFDWRPLGAVT-EVKDQGACGSCWA 345
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +E + +K +L+ LS+Q
Sbjct: 346 FSVTGNIEGQWKLKTGKLLSLSEQ 369
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 82/147 (55%), Gaps = 7/147 (4%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
++F+ ++ + R Y E +RRF+++R N++ ++ + G N+FAD+T+ EF
Sbjct: 29 DRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGY-KLADNKFADLTNEEF 87
Query: 137 NHGLSS----LDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
+ + QI N S SS+ L +S++++ KG V+ +V++Q CGS
Sbjct: 88 RAKMLGFRPHVTIPQISNTCSADIAMPGESSDDI-LPKSVDWRKKGAVV-EVKNQGDCGS 145
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA SAVA +E IK+ EL+ LS+Q
Sbjct: 146 CWAFSAVAAIEGINQIKNGELVSLSEQ 172
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 80.5 bits (197), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 75/138 (54%), Gaps = 1/138 (0%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +E+ Y +E E RF IF+ N++ I+ + E G N+F+D+T+ EF +
Sbjct: 45 WIVHHEKVYKDLNEKEVRFQIFKENVERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHT 104
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ + S+ F +N + +++++ KG V P ++DQ CG CWA SAVA
Sbjct: 105 GYKRSHPKVMTSSKGKTHFRYTNVTDIPPTMDWRKKGAVTP-IKDQKECGCCWAFSAVAA 163
Query: 202 LESAYAIKHNELIELSKQ 219
+E + +K ELI LS+Q
Sbjct: 164 MEGLHQLKTGELIPLSEQ 181
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 76/140 (54%), Gaps = 6/140 (4%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+D++ +Y R Y E +R+ IF++N+ I+ + K + +N FAD+T+ EF
Sbjct: 40 EDWMAQYGRVYKDAGEKSKRYKIFKDNVARIESFNKAMNKSYKLSINEFADLTNEEFRAS 99
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ ++ ST E SF + + +++++ KG V P ++DQ CGSCWA SAV
Sbjct: 100 RNRFK----AHICST-EATSFKYEHVXAVPSTVDWRKKGAVTP-IKDQGQCGSCWAFSAV 153
Query: 200 ACLESAYAIKHNELIELSKQ 219
A +E + +LI LS+Q
Sbjct: 154 AAMEGITQLSTGKLISLSEQ 173
>gi|28932702|gb|AAO60045.1| midgut cysteine proteinase 2 [Rhipicephalus appendiculatus]
Length = 564
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 57/172 (33%), Positives = 82/172 (47%), Gaps = 11/172 (6%)
Query: 53 RSQPNSYGSEEASTFDLEEFLDHGN-----QFKDFVREYERQYDSDSEIERRFDIFRNNL 107
RS P A + EFL + + F+DF ++R Y+ D+E +RR DIFR NL
Sbjct: 230 RSFPGPGAERLALHNPMAEFLGNHDGHTKHSFEDFKETHKRTYELDTEHDRRRDIFRQNL 289
Query: 108 KTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYG 167
+ ID + G VN AD T E + L + + F + F +
Sbjct: 290 RFIDSKNRANLGY-NLAVNHLADRTREEISVLRGRLQSKDGSSRAEPFPRHRFTAK---- 344
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
L + I+++ G V P V+DQ +CGSCW+ V LE AY K L+ LS+Q
Sbjct: 345 LPDQIDWRPYGAVTP-VKDQAVCGSCWSFGTVGELEGAYFRKTGRLVRLSEQ 395
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 56/153 (36%), Positives = 88/153 (57%), Gaps = 18/153 (11%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFAD 130
+ G +F+ F ++ + Y + E RF+IF++NL+ I+ + +EQG +Y G+NRF D
Sbjct: 20 ETGVKFQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTD 79
Query: 131 MTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSY--GLA--ESINYKDKGKVLPKVQD 186
MT EF L+ L S+ + + FN++ GLA +SI+++ KG+V V+D
Sbjct: 80 MTQEEFRAFLT---------LSSSKKPH-FNTTEHVLTGLAVPDSIDWRTKGQV-TGVKD 128
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA S E+AY K +L+ LS+Q
Sbjct: 129 QGNCGSCWAFSVTGSTEAAYYRKAGKLVSLSEQ 161
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 85/160 (53%), Gaps = 8/160 (5%)
Query: 61 SEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT 120
+++AST +L E + + ++ ++ + Y D E RRF IF+NN++ I+ +
Sbjct: 22 ADQASTRELHEST-MVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVEFIESSNAAGNNS 80
Query: 121 ATYGVNRFADMTDSEFNHGLSSLDWEQIEN-LKSTFETYSFNSSNSYGLAESINYKDKGK 179
G+NRFAD+T+ EF W + L ++ F N L S++++ KG
Sbjct: 81 YMLGINRFADLTNEEFR-----ASWNGYKRPLDASRIVTPFKYENVTALPYSMDWRRKGA 135
Query: 180 VLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V ++DQ CGSCWA SAVA E + ++ +L+ LS+Q
Sbjct: 136 V-TSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSEQ 174
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 49/142 (34%), Positives = 76/142 (53%), Gaps = 13/142 (9%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN---H 138
++ +Y++ Y +E E+RF IF++N++ I+ + GVN AD+T EF +
Sbjct: 44 WMAKYDKVYKDAAEKEKRFLIFKDNVEFIESFNAAGNKPYKLGVNHLADLTIEEFKASRN 103
Query: 139 GLS-SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
GL S D+E T SF N + S++++ KG V P ++DQ CGSCWA S
Sbjct: 104 GLKRSYDYE--------VGTTSFKYENVTAIPASVDWRKKGAVTP-IKDQGQCGSCWAFS 154
Query: 198 AVACLESAYAIKHNELIELSKQ 219
VA E + I +L+ LS+Q
Sbjct: 155 TVAATEGIHKISTGKLVSLSEQ 176
>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 345
Score = 80.1 bits (196), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 81/141 (57%), Gaps = 2/141 (1%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
++++ +Y + Y +E ++RF IF+NN+ I+ + +N+FAD+ D EF
Sbjct: 39 ENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFNLSINQFADLHDEEFKAL 98
Query: 140 LSSLDWEQIENLKSTFET-YSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L++ + + + + ET SF + L +++++ +G V P ++DQ CGSCWA SA
Sbjct: 99 LTNGNKKVRSVVGTATETETSFKYNRVTKLLATMDWRKRGAVTP-IKDQRRCGSCWAFSA 157
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E + I ++L+ LS+Q
Sbjct: 158 VAAIEGIHQITTSKLVSLSEQ 178
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 55/186 (29%), Positives = 96/186 (51%), Gaps = 12/186 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N+++ ++ E+ R Y++ E E+RF+IF++NL+ I+ + T G+N+FAD+T+ E+
Sbjct: 48 NRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADLTNEEY 107
Query: 137 NHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
L + + +KS + + S + + S++++ +G V P +++Q CGSCWA
Sbjct: 108 RTMYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAP-IKNQGSCGSCWA 166
Query: 196 HSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKG--PYSLNHAVLN 253
S VA +E I E+I LS+Q R G C+ G Y+ + N
Sbjct: 167 FSTVAAVEGINQIVTGEMITLSEQELVDCDRVQNSG--------CNGGLMDYAFEFIISN 218
Query: 254 VGYDNE 259
G D E
Sbjct: 219 GGMDTE 224
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 81/146 (55%), Gaps = 9/146 (6%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
D ++++ ++ +Y RQY S E ERRF I++ N++ ID + + T N FAD+T+
Sbjct: 14 DIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNH-SHTLAENNFADLTN 72
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF + L ++ + + F N L +++++ +G V P +++Q CGSC
Sbjct: 73 EEFK--ATYLGYKTV-----SIPDTCFRYGNMVNLPTNVDWRQEGAVTP-IKNQGQCGSC 124
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SAVA +E IK +LI LS+Q
Sbjct: 125 WAFSAVAAVEGINKIKAGKLISLSEQ 150
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 71/140 (50%), Gaps = 1/140 (0%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
++++ EY + Y +E E+RF IF++N++ I+ + GVN AD+T EF
Sbjct: 39 ENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDS 98
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ L E +TF+ F N + E+I+++ KG V P CGSCWA S +
Sbjct: 99 RNGLK-RTYEFSTTTFKLNGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTI 157
Query: 200 ACLESAYAIKHNELIELSKQ 219
A E + I L+ LS+Q
Sbjct: 158 AATEGIHQISTGNLVSLSEQ 177
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 81/146 (55%), Gaps = 9/146 (6%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
D ++++ ++ +Y RQY S E ERRF I++ N++ ID + + T N FAD+T+
Sbjct: 14 DIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNH-SHTLAENNFADLTN 72
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF + L ++ + + F N L +++++ +G V P +++Q CGSC
Sbjct: 73 EEFK--ATYLGYKTV-----SIPDTCFRYGNMVNLPTNVDWRQEGAVTP-IKNQGQCGSC 124
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SAVA +E IK +LI LS+Q
Sbjct: 125 WAFSAVAAVEGINKIKAGKLISLSEQ 150
>gi|356553413|ref|XP_003545051.1| PREDICTED: cysteine proteinase 15A-like [Glycine max]
Length = 367
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 49/175 (28%), Positives = 90/175 (51%), Gaps = 15/175 (8%)
Query: 48 NLILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNL 107
++++++ P++ G EA+ + + L+ + F F ++ ++Y + E +RRF +F++NL
Sbjct: 24 DILIRQVVPDAVG--EAAEKEEDHLLNAEHHFASFKAKFGKKYATKEEHDRRFGVFKSNL 81
Query: 108 KTIDYYTKHEQGTATYGVNRFADMTDSEFNH---GLSSLDWEQIENLKSTFETYSFNSSN 164
+ + K + +A +GV +F+D+T +EF G L T
Sbjct: 82 RRARLHAKLDP-SAVHGVTKFSDLTPAEFRRQFLGFKPLRLPANAQKAPILPTKD----- 135
Query: 165 SYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
L + +++DKG V V+DQ CGSCW+ S LE A+ + EL+ LS+Q
Sbjct: 136 ---LPKDFDWRDKGAVT-NVKDQGACGSCWSFSTTGALEGAHYLATGELVSLSEQ 186
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 75/127 (59%), Gaps = 7/127 (5%)
Query: 95 EIERRFDIFRNNLKTIDYYTKH--EQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLK 152
E ERRF +F +NLK +D + E+G G+NRFAD+T+ EF + L + E +
Sbjct: 70 EHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNEEFR--ATFLGAKVAERSR 127
Query: 153 STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE 212
+ E Y + L ES+++++KG V P V++Q CGSCWA SAV+ +ES + E
Sbjct: 128 AAGERYRHDGVEE--LPESVDWREKGAVAP-VKNQGQCGSCWAFSAVSTVESINQLVTGE 184
Query: 213 LIELSKQ 219
+I LS+Q
Sbjct: 185 MITLSEQ 191
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 80.1 bits (196), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 77/143 (53%), Gaps = 12/143 (8%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF--- 136
++++ ++ R Y E E+R+ IF+ N++ I+ + GVN+FAD+T+ EF
Sbjct: 6 EEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAM 65
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+HG Q L S+ SF N + S++++ G V P V+DQ CG CWA
Sbjct: 66 HHGYK----RQSSKLMSS----SFRHENLSAIPTSMDWRKAGAVTP-VKDQGTCGCCWAF 116
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SAVA +E +K +LI LS+Q
Sbjct: 117 SAVAAIEGIIKLKTGKLISLSEQ 139
>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
Length = 349
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 52/151 (34%), Positives = 84/151 (55%), Gaps = 11/151 (7%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA-TYGVNRFADMTDSE 135
++F+ + EY R Y + E ++RF ++ N+K I+ T ++ G++ G N+FAD+T+ E
Sbjct: 35 DRFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIE--TMNQPGSSYELGENQFADLTEEE 92
Query: 136 FNHG-LSSLD--WEQIENLKSTFETY----SFNSSNSYGLAESINYKDKGKVLPKVQDQH 188
F L LD E + T +T + SN+ S++++ KG V P V+ Q
Sbjct: 93 FKDTYLMKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAPNSVDWRTKGAVTP-VKSQQ 151
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA +AVA +E + IK L+ LS+Q
Sbjct: 152 HCGSCWAFAAVASIEGVHKIKTGRLVSLSEQ 182
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 81/139 (58%), Gaps = 6/139 (4%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATYGVNRFADMTDSEFNHGL 140
++ E R Y++ E ERRF +F +NL+ D + + + G+NRFAD+T+ EF
Sbjct: 57 WLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR--A 114
Query: 141 SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVA 200
+ L + +E ++ E Y + L ES+++++KG V P V++Q CGSCWA SAV+
Sbjct: 115 TFLGAKVVERSRAAGERYRHDGVEE--LPESVDWREKGAVAP-VKNQGQCGSCWAFSAVS 171
Query: 201 CLESAYAIKHNELIELSKQ 219
+ES + E+I LS+Q
Sbjct: 172 TVESINQLVTGEMITLSEQ 190
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 91/164 (55%), Gaps = 10/164 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+++++ ++++ Y+ E ++RF +F++NL I + ++ T G+N+FADMT+ E+
Sbjct: 39 MYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYR 98
Query: 138 ---HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
G S ++ KST Y++++ + L ++++ KG V P ++DQ CGSCW
Sbjct: 99 VMYFGTKSDAKRRLMKTKSTGHRYAYSAGDR--LPVHVDWRVKGAVAP-IKDQGSCGSCW 155
Query: 195 AHSAVACLESAYAIKHNELIELSKQPPKTHGRFY----KGGVMN 234
A S VA +E+ I + + LS+Q R Y GG+M+
Sbjct: 156 AFSTVATVEAINKIVTGKFVSLSEQELVDCDRAYNEGCNGGLMD 199
>gi|302143414|emb|CBI21975.3| unnamed protein product [Vitis vinifera]
Length = 286
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 76/140 (54%), Gaps = 6/140 (4%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+D++ +Y R Y E +R+ IF++N+ I+ + K + +N FAD+T+ EF
Sbjct: 40 EDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRAS 99
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ ++ ST E SF + + +++++ KG V P ++DQ CGSCWA SAV
Sbjct: 100 RNRFK----AHICST-EATSFKYEHVAAVPSTVDWRKKGAVTP-IKDQGQCGSCWAFSAV 153
Query: 200 ACLESAYAIKHNELIELSKQ 219
A +E + +LI LS+Q
Sbjct: 154 AAMEGITQLSTGKLISLSEQ 173
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/140 (32%), Positives = 76/140 (54%), Gaps = 6/140 (4%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+D++ +Y R Y E +R+ IF++N+ I+ + K + +N FAD+T+ EF
Sbjct: 40 EDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFRAS 99
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ ++ ST E SF + + +++++ KG V P ++DQ CGSCWA SAV
Sbjct: 100 RNRFK----AHICST-EATSFKYEHVAAVPSTVDWRKKGAVTP-IKDQGQCGSCWAFSAV 153
Query: 200 ACLESAYAIKHNELIELSKQ 219
A +E + +LI LS+Q
Sbjct: 154 AAMEGITQLSTGKLISLSEQ 173
>gi|67773380|gb|AAY81947.1| cysteine protease 9 [Paragonimus westermani]
Length = 322
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 82/143 (57%), Gaps = 13/143 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
++ F R+Y + Y ++ + ++RF IF++NL +QGTA YGV +F+D+T EF
Sbjct: 27 YEQFKRDYGKVYANEDD-QKRFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 85
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ ++++ +Q+E ++ T E +++++KG V V++Q CGSCWA
Sbjct: 86 KYLRAAVNNDQVERVRPT---------GLKAAPERMDWREKGAV-TAVENQGSCGSCWAF 135
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SA +E + IK +L+ LSKQ
Sbjct: 136 SAAGNVEGQWFIKTGQLVSLSKQ 158
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/186 (30%), Positives = 92/186 (49%), Gaps = 16/186 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ ++ + Y++ E E RF IF++N+ +D + + G+N+FAD+T+ E+
Sbjct: 60 YESWLVKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTNDEYRS 119
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
LS ++ + F + F + L ES++++D+G V P V+DQ CGSCWA S
Sbjct: 120 LYLSGKMMKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAVAP-VKDQGQCGSCWAFS 178
Query: 198 AVACLESAYAIKHNELIELSKQPPKTHGRFY----KGGVMNLPHMLCSKGPYSLNHAVLN 253
V +E I ELI LS+Q Y GG+M+ Y+ V N
Sbjct: 179 TVGAVEGINKIVTGELISLSEQELVDCDNGYNQGCNGGLMD----------YAFEFIVKN 228
Query: 254 VGYDNE 259
G D E
Sbjct: 229 GGIDTE 234
>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 464
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 81/139 (58%), Gaps = 6/139 (4%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATYGVNRFADMTDSEFNHGL 140
++ E R Y++ E ERRF +F +NL+ D + + + G+NRFAD+T+ EF
Sbjct: 56 WLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR--A 113
Query: 141 SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVA 200
+ L + +E ++ E Y + L ES+++++KG V P V++Q CGSCWA SAV+
Sbjct: 114 TFLGAKVVERSRAAGERYRHDGVEE--LPESVDWREKGAVAP-VKNQGQCGSCWAFSAVS 170
Query: 201 CLESAYAIKHNELIELSKQ 219
+ES + E+I LS+Q
Sbjct: 171 TVESINQLVTGEMITLSEQ 189
>gi|9630063|ref|NP_046281.1| cathepsin [Orgyia pseudotsugata MNPV]
gi|2499880|sp|O10364.1|CATV_NPVOP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|7435821|pir||T10394 cathepsin - Orgyia pseudotsugata nuclear polyhedrosis virus
gi|1911371|gb|AAC59124.1| cathepsin [Orgyia pseudotsugata MNPV]
Length = 324
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/161 (34%), Positives = 89/161 (55%), Gaps = 10/161 (6%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G A+T+DL L N F+DF+ ++ + Y S+SE RF IF++NL+ I +++
Sbjct: 12 GVVHAATYDL---LKAPNYFEDFLHKFNKNYSSESEKLHRFKIFQHNLEEIINKNQND-S 67
Query: 120 TATYGVNRFADMTDSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKG 178
TA Y +N+F+D++ E + L Q +N E + G E +++
Sbjct: 68 TAQYEINKFSDLSKEEAISKYTGLSLPHQTQNF---CEVVILDRPPDRGPLE-FDWRQFN 123
Query: 179 KVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
KV V++Q +CG+CWA + + LES +AIK+N LI LS+Q
Sbjct: 124 KV-TSVKNQGVCGACWAFATLGSLESQFAIKYNRLINLSEQ 163
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 101/188 (53%), Gaps = 21/188 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+ +++ E+ R Y++ E ERRF++FR+NL+ ID + + G ++ G+NRFAD+T+ E
Sbjct: 41 YAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLHSFRLGLNRFADLTNEE 100
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ S+ + + + + + + ++ L E+++++ KG V ++DQ CGSCWA
Sbjct: 101 YR---STYLGARTKPDRERKLSARYQADDNEELPETVDWRKKGAV-AAIKDQGGCGSCWA 156
Query: 196 HSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAV 251
SA+A +E I ++I LS+Q ++ GG+M+ Y+ +
Sbjct: 157 FSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGGLMD----------YAFEFII 206
Query: 252 LNVGYDNE 259
N G D+E
Sbjct: 207 NNGGIDSE 214
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/140 (34%), Positives = 78/140 (55%), Gaps = 7/140 (5%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHG 139
++ +Y + Y E E RF IF+ N+ I+ + + T +Y G+N+FAD+T+ EF
Sbjct: 42 WMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADD-TKSYKLGINQFADLTNEEF--- 97
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
++S + + S T SF N G+ +++++ KG V P V++Q CG CWA SAV
Sbjct: 98 IASRNKFKGHMCSSIMRTTSFKYENVSGIPSTVDWRKKGAVTP-VKNQGQCGCCWAFSAV 156
Query: 200 ACLESAYAIKHNELIELSKQ 219
A E + + +LI LS+Q
Sbjct: 157 AATEGIHKLSTGKLISLSEQ 176
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 82/165 (49%), Gaps = 13/165 (7%)
Query: 55 QPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT 114
Q S EAS F+ E ++ +Y R Y ++E RF IF +N+K I+ +
Sbjct: 42 QATSRTLPEASMFERHE---------QWMIQYGRVYKDEAEKSVRFQIFMDNVKFIEEFN 92
Query: 115 KHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINY 174
K + + VN FAD T+ EF +S + ++ +T F N + S+++
Sbjct: 93 KDGRQSYKLAVNEFADQTNEEFQ---ASRNGYKMAVSSRPSQTTLFRYENVTAVPSSMDW 149
Query: 175 KDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ KG V P V+DQ CGSCWA S +A E +K +LI LS+Q
Sbjct: 150 RKKGAVTP-VKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQ 193
>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 73/146 (50%), Gaps = 5/146 (3%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
D+ F +F +Y + Y+ +E RF IF+ N+ I Y T T GVN F D+T
Sbjct: 22 DYMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDII-YATNARNLTFALGVNEFTDLTQ 80
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
E + L + + T+ +N + LA S+++ +G V P V++Q CGSC
Sbjct: 81 EELAASYTGLKPASLWSGLPRLSTHEYNGAP---LASSVDWTTQGVVTP-VKNQGQCGSC 136
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
W+ S LE A+A+ L+ LS+Q
Sbjct: 137 WSFSTTGALEGAWALSTGNLVSLSEQ 162
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 88/166 (53%), Gaps = 13/166 (7%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG---TATYGVNRFADMTDSE 135
++ + ++ R Y++ E E+R +IFR+NL+ ID + + G+ RFAD+T+ E
Sbjct: 47 YQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTRFADLTNEE 106
Query: 136 FNH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
+ G+ + + N Y F SS+ L +SI+++DKG V+ V+DQ CGS
Sbjct: 107 YRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDD--LPDSIDWRDKGAVV-DVKDQGSCGS 163
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFY----KGGVMN 234
CWA S +A +E I +LI LS+Q +Y GG+M+
Sbjct: 164 CWAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMD 209
>gi|285002340|ref|YP_003422404.1| cathepsin [Pseudaletia unipuncta granulovirus]
gi|197343600|gb|ACH69415.1| cathepsin [Pseudaletia unipuncta granulovirus]
Length = 338
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 53/163 (32%), Positives = 86/163 (52%), Gaps = 12/163 (7%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
F +FV +Y + Y +D+E + RFD+F+ NL I+ E+ +AT+G+N ++D++ +E
Sbjct: 37 FDEFVTKYGKVYANDAERKSRFDVFKANLAIINERNAQEE-SATFGINFYSDLSSNELLR 95
Query: 137 -NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
G + E ++ L E+ N++D V V+ Q CGSCWA
Sbjct: 96 KQTGFKTALHNDNEKKSKYCTRRVITGPSTRLLPEAFNWRDSDAV-TSVKQQRDCGSCWA 154
Query: 196 HSAVACLESAYAIKHNELIELSKQ-----PPKTHGRFYKGGVM 233
SAVA +ES Y IK+ + ++LS+Q P +G GG+M
Sbjct: 155 FSAVANIESQYYIKNKQYVDLSEQQIVDCDPINNG--CNGGLM 195
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 88/165 (53%), Gaps = 12/165 (7%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTK-HEQ 118
GS+ S FDL + Q+ F + +QY SD+E R IF N T+ + K + Q
Sbjct: 13 GSQAVSFFDLVQ-----EQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQ 67
Query: 119 GTATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKS--TFETYSFNSSNSYGLAESINY 174
G ++ G+N++ADM EF L+ + + L+S + ++ +F + L I++
Sbjct: 68 GLVSFKLGINKYADMLHHEFVQVLNGFNRTK-SGLRSGESDDSVTFLPPANVQLPGQIDW 126
Query: 175 KDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+DKG V P V+DQ CGSCW+ SA LE + K +L+ LS+Q
Sbjct: 127 RDKGAVTP-VKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQ 170
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 79.7 bits (195), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 41/125 (32%), Positives = 69/125 (55%), Gaps = 1/125 (0%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKST 154
+ ++RF++F+ N+K I + + + T +N+F DMT+ EF + + L+
Sbjct: 56 DTDKRFNVFKENVKFIHEFNQKKDATYKLALNKFGDMTNQEFRSTYAGSKIDHHMTLRGV 115
Query: 155 FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELI 214
+ F+ + L S+++++KG V V+DQ CGSCWA S V +E IK NEL+
Sbjct: 116 KDAGEFSYEKFHDLPTSVDWREKGAV-TGVKDQGQCGSCWAFSTVVAVEGINQIKTNELV 174
Query: 215 ELSKQ 219
LS+Q
Sbjct: 175 SLSEQ 179
>gi|126338866|ref|XP_001379280.1| PREDICTED: cathepsin F-like [Monodelphis domestica]
Length = 567
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 76/147 (51%), Gaps = 18/147 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
FKDF+ Y + Y + +E +RR IF NL+ + +QG+A YGV +F+D+T+ EF
Sbjct: 270 FKDFLTTYNKSYANATETQRRLGIFARNLELAHKLQELDQGSAQYGVTKFSDLTEEEFRM 329
Query: 137 ---NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE-SINYKDKGKVLPKVQDQHLCGS 192
N LSSL + + + G A S +++D G L ++Q +CGS
Sbjct: 330 FYLNPLLSSLPGRALR-----------PAPRARGPAPASWDWRDHG-ALTAAKNQGMCGS 377
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA S +E + ++ L+ LS+Q
Sbjct: 378 CWAFSVTGNVEGQWFLRRGALLTLSEQ 404
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 78/138 (56%), Gaps = 3/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y +E E+RF +F+NN+ I+ + +N+FAD+ D EF L
Sbjct: 40 WMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLI 99
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
++ ++ ++++ ET SF + + +I+ + +G V P ++DQ CGSCWA SAVA
Sbjct: 100 NVQ-KKASWVETSTET-SFRYESVTKIPATIDRRKRGAVTP-IKDQGRCGSCWAFSAVAA 156
Query: 202 LESAYAIKHNELIELSKQ 219
E + I +L+ LS+Q
Sbjct: 157 TEGIHQITTGKLVPLSEQ 174
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 53/147 (36%), Positives = 81/147 (55%), Gaps = 10/147 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-TATYGVNRFADMTDSE 135
N F F Y + Y ++ E ++R+ IF+NNL I +T ++QG + + +N F D++ E
Sbjct: 117 NAFGSFRATYGKSYATEEETQKRYAIFKNNLAYI--HTHNQQGYSYSLKMNHFGDLSREE 174
Query: 136 FNHGLSSLDWEQIENLKST---FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
F L + + NLKS T S S + ++++++KG V P V+DQ CGS
Sbjct: 175 FRR--KYLGYNKSRNLKSNNLGVATELLKVSPS-DVPSAVDWREKGCVTP-VKDQRDCGS 230
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA SA LE A+ K EL+ LS+Q
Sbjct: 231 CWAFSATGALEGAHCAKTGELLSLSEQ 257
>gi|85068708|gb|ABC69434.1| cysteine protease [Clonorchis sinensis]
gi|85068710|gb|ABC69435.1| cysteine protease [Clonorchis sinensis]
Length = 328
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 61/200 (30%), Positives = 97/200 (48%), Gaps = 34/200 (17%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
+++F +Y++ Y +D + E RF+IF++NL + EQGTA YGV +F+D+T EF
Sbjct: 32 YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKT 90
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ D + + E + ++ E ++++ G V P V DQ CGSCWA
Sbjct: 91 RYLRMRFDGPIVSEDLTPEEDVTMDN-------EKFDWREHGAVGP-VLDQGKCGSCWAF 142
Query: 197 SAVACLESAYAIKHNELIELSKQ----------------PPKTHGRFYKGGVMNLPHMLC 240
S + +E + K +L+ LS+Q PPKT+G K G + L
Sbjct: 143 SVIGNVEGQWFRKTGDLLALSEQQLVDCDHLDKGCNGGYPPKTYGEIEKMGGLE----LA 198
Query: 241 SKGPYSLNHAVLNVGYDNES 260
S PY+ V + Y N+S
Sbjct: 199 SDYPYT---GVDGICYMNQS 215
Score = 40.0 bits (92), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 36/131 (27%), Positives = 51/131 (38%), Gaps = 15/131 (11%)
Query: 129 ADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQH 188
D D N G + +IE + S Y + I Y ++ K + V D
Sbjct: 170 CDHLDKGCNGGYPPKTYGEIEKMGG----LELASDYPYTGVDGICYMNQSKFVAYVNDS- 224
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKGPYSLN 248
+ + E A K E+ LS +FY GG++ LC+ P+ LN
Sbjct: 225 --------TVLPLSEKIQAQKLKEIGPLSSALNAVLLQFYLGGIIFPIPFLCN--PHGLN 274
Query: 249 HAVLNVGYDNE 259
HAVL VGY E
Sbjct: 275 HAVLTVGYGTE 285
>gi|45822205|emb|CAE47499.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 317
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 55/155 (35%), Positives = 81/155 (52%), Gaps = 8/155 (5%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDS 134
Q+ F + ++Y E + RF +F NL+ I+ + +++ G ++ GVN+FADMT
Sbjct: 15 QWAQFKVNHSKKYGHLKEEQVRFQVFSQNLQKIEQHNARYQNGEVSFYLGVNQFADMTSE 74
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF + LD + I K T F + + ESI++++KG V P V+DQ CGSCW
Sbjct: 75 EFK---AMLDSQLIHKPKRDI-TSRFVADPQLTVPESIDWREKGAVNP-VRDQEQCGSCW 129
Query: 195 AHSAVACLESAYAIKHNELIELSKQPPKTHGRFYK 229
A SA LE +K +L LS Q R YK
Sbjct: 130 AFSAAGALEGQRFLKEGKLEVLSTQQLVDCSRDYK 164
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 105/191 (54%), Gaps = 28/191 (14%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEF 136
++ ++ ++ + Y++ E +RRF IF++NL+ ID +H G TY G+N+FAD+T+ E+
Sbjct: 52 YESWLVKHGKTYNALGEKDRRFQIFKDNLRFID---EHNSGDHTYKLGLNKFADLTNEEY 108
Query: 137 NH---GLSSLD-WEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
G+ ++D +++ +KS + Y++ S +S L E ++++++G V V+DQ CGS
Sbjct: 109 RMTYTGIKTIDDKKKLSKMKS--DRYAYRSGDS--LPEYVDWREQGAVT-DVKDQGSCGS 163
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLN 248
CWA S +E I +LI +S+Q ++ + GG+M+ Y+
Sbjct: 164 CWAFSTTGSVEGVNKIVTGDLISVSEQELVNCDTSYNQGCNGGLMD----------YAFE 213
Query: 249 HAVLNVGYDNE 259
+ N G D E
Sbjct: 214 FIIKNGGIDTE 224
>gi|413947586|gb|AFW80235.1| hypothetical protein ZEAMMB73_542371 [Zea mays]
Length = 264
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 42/140 (30%), Positives = 74/140 (52%), Gaps = 8/140 (5%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF--NHG 139
++ EY R Y ++ RRF++F++N ++ + ++ GVN+FAD+T F N G
Sbjct: 44 WMAEYGRVYKDAADKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTEAFKANKG 103
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ E+ +E S ++ L +++++ KG V P +++Q CG CWA SAV
Sbjct: 104 FKPISAEKAPTTGFKYENLSISA-----LPTAVDWRTKGAVTP-IKNQGQCGCCWAFSAV 157
Query: 200 ACLESAYAIKHNELIELSKQ 219
A +E + L+ LS+Q
Sbjct: 158 AAVEGIVKLSTGNLVSLSEQ 177
>gi|55735421|gb|AAV59468.1| cathepsin [Bombyx mori NPV]
Length = 323
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 82/148 (55%), Gaps = 8/148 (5%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L N F++FV + + Y S+ E RRF IF++NL I K++ +A Y +N+F+D++
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 133 DSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
E + L Q +N + + G E +++ KV V++Q +CG
Sbjct: 80 KDETIAKYTGLSLPTQTQNF---CKVILLDQPPGKGPLE-FDWRRLNKV-TSVKNQGMCG 134
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
+CWA + +A LES +AIKHN+LI LS+Q
Sbjct: 135 ACWAFATLASLESQFAIKHNQLINLSEQ 162
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 56/165 (33%), Positives = 88/165 (53%), Gaps = 12/165 (7%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTK-HEQ 118
GS+ S FDL + Q+ F + +QY SD+E R IF N T+ + K + Q
Sbjct: 13 GSQAVSFFDLVQ-----EQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQ 67
Query: 119 GTATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKS--TFETYSFNSSNSYGLAESINY 174
G ++ G+N++ADM EF L+ + + L+S + ++ +F + L I++
Sbjct: 68 GLVSFKLGINKYADMLHHEFVQVLNGFNRTK-SGLRSGESDDSVTFLPPANVQLPGQIDW 126
Query: 175 KDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+DKG V P V+DQ CGSCW+ SA LE + K +L+ LS+Q
Sbjct: 127 RDKGAVTP-VKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQ 170
>gi|13124011|sp|Q9YWK4.1|CATV_NPVBS RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|3882976|gb|AAC77812.1| cathepsin [Buzura suppressaria NPV]
Length = 331
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 83/150 (55%), Gaps = 7/150 (4%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFAD 130
+ L G+ F+ F+ Y + Y+ SE ERRF IF+ L+ I+Y + +A Y +N+FAD
Sbjct: 23 DLLKAGDYFETFLANYNKMYNDTSEKERRFSIFQQTLEEINYKNRLND-SAVYQINKFAD 81
Query: 131 MTDSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
++ +E + L+ Q N T + G + +++ + KV +++Q
Sbjct: 82 LSKNEIISKYTGLNMPVQTTNFCKTI---VIDQPPGKG-PLNFDWRQQNKV-TSIKNQKA 136
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CG+CWA + +A +ES YAIK+N I+LS+Q
Sbjct: 137 CGACWAFATLASIESQYAIKNNVHIDLSEQ 166
>gi|255626679|gb|ACU13684.1| unknown [Glycine max]
Length = 229
Score = 79.3 bits (194), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 53/189 (28%), Positives = 98/189 (51%), Gaps = 20/189 (10%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+++++ ++++ Y+ E ++RF +F++NL I + ++ T G+N+FADMT+ E+
Sbjct: 39 MYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYR 98
Query: 138 ---HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
G S ++ KST Y++++ + L ++++ KG V P ++DQ CGSCW
Sbjct: 99 VMYFGTKSDAKRRLMKTKSTGHRYAYSAGDR--LPVHVDWRVKGAVAP-IKDQGSCGSCW 155
Query: 195 AHSAVACLESAYAIKHNELIELSKQPPKTHGRFY----KGGVMNLPHMLCSKGPYSLNHA 250
A S VA +E+ I + + LS+Q R Y GG+M+ Y+
Sbjct: 156 AFSTVATVEATNKIVTGKFVSLSEQELVDCDRAYNERCNGGLMD----------YAFEFI 205
Query: 251 VLNVGYDNE 259
+ N G D +
Sbjct: 206 IQNGGIDTD 214
>gi|170032975|ref|XP_001844355.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167873312|gb|EDS36695.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 1454
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 79/144 (54%), Gaps = 8/144 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F + R Y S E E RF IF+NNL I+ K+EQGTA YG+ FADMT +E+
Sbjct: 1146 FDKFKTRHNRTYQSSLEHEMRFRIFKNNLFKIEQLNKYEQGTAKYGITHFADMTSAEYRA 1205
Query: 139 --GL-SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
GL + +++ ++++ L ++ ++++ G V +V++Q CGSCWA
Sbjct: 1206 RTGLVVPREGDEVNHIRNPMAEI----DEHMELPDAFDWRELGAV-SEVKNQGNCGSCWA 1260
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S V +E + +K +L E S+Q
Sbjct: 1261 FSVVGNIEGLHQVKTKKLEEYSEQ 1284
>gi|324514421|gb|ADY45863.1| Viral cathepsin [Ascaris suum]
Length = 399
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/173 (30%), Positives = 89/173 (51%), Gaps = 12/173 (6%)
Query: 54 SQPNSYGSEEASTFDLEEFLD----HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKT 109
S S GS E D+ E + + + F F++EY+RQY S+ E RF RN ++
Sbjct: 70 SSDPSAGSLETILADMGELSNDYPIYIDSFVKFMQEYDRQYSSNDETRLRF---RNFVRN 126
Query: 110 IDYYTKHEQG--TATYGVNRFADMTDSEFNHGLSSLDWEQIE-NLKSTFETYSFNSSNSY 166
+ + K ++G +G+ RF D +++E ++ DW E + T + S +
Sbjct: 127 MKFIKKAQKGRDNVVFGITRFTDWSEAEM-KSMTCEDWAANEVGSEITLDDDQDESDEVF 185
Query: 167 GLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++ +++ K V+ ++DQ CGSCWA +A+ +ES AI N LI LS+Q
Sbjct: 186 DRPDAFDWRTKS-VVTDIKDQERCGSCWAFAAIGVVESMNAIAKNPLISLSEQ 237
>gi|227018354|gb|ACP18843.1| cysteine proteinase 4 [Chrysomela tremula]
Length = 161
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/150 (35%), Positives = 81/150 (54%), Gaps = 8/150 (5%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFA 129
L Q+ F + + + Y S E RF IF+NNL+ I+ + TK++ G TY GV +FA
Sbjct: 17 LTDSEQWVAFKQTHGKTYKSALEESLRFSIFKNNLRKIEEHNTKYDNGEETYYLGVTKFA 76
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
DM+ EF L+ Q++ S + ++ + +S+++++KG VLP +++Q
Sbjct: 77 DMSSEEFEDLLN----RQMKERPSLNSSLKHEYDSNQEIPDSVDWREKGAVLP-IRNQGS 131
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SA LE AIK LS Q
Sbjct: 132 CGSCWAFSAAGALEGQNAIKSGVKSPLSIQ 161
>gi|195997891|ref|XP_002108814.1| hypothetical protein TRIADDRAFT_20325 [Trichoplax adhaerens]
gi|190589590|gb|EDV29612.1| hypothetical protein TRIADDRAFT_20325 [Trichoplax adhaerens]
Length = 333
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 73/143 (51%), Gaps = 8/143 (5%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+FK F+ +Y R Y + E E RF F+ N + I ATYGVN+FAD TD EF
Sbjct: 35 RFKSFITDYNRNYTTKEEHEFRFQTFKKNFRRI---ASTNANGATYGVNKFADWTDEEFK 91
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKD-KGKVLPKVQDQHLCGSCWAH 196
L + E + S +S ++ S+++++ K ++ V++Q CG CWA
Sbjct: 92 ELLGNRQVPTQEIVNSELH----HSLSTAKFPSSLDWREHKRNIVGPVRNQGRCGCCWAF 147
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S V + SA+A+ N ELS Q
Sbjct: 148 STVETIASAWALAGNSFTELSVQ 170
>gi|443724292|gb|ELU12369.1| hypothetical protein CAPTEDRAFT_165495 [Capitella teleta]
Length = 351
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 79/144 (54%), Gaps = 5/144 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT-KHEQGTATY--GVNRFADMTDSE 135
++DF +ER Y E++R+ ++FRNNLK I+ + H QG ++Y G+N+FADM E
Sbjct: 44 WQDFKTVHERNYGETEEMQRK-EVFRNNLKKIEMHNYLHSQGKSSYRMGINQFADMEVKE 102
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
F ++ ++ ++ + + L ++++ +G V P ++DQ CGSCW+
Sbjct: 103 FASVVNGFRMNNRTKVRDHLHSHYISPAIPVSLPAEVDWRKEGYVTP-IKDQGHCGSCWS 161
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S LE + K +L+ LS+Q
Sbjct: 162 FSTTGALEGQHFRKTGKLVSLSEQ 185
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 74/127 (58%), Gaps = 7/127 (5%)
Query: 95 EIERRFDIFRNNLKTIDYYTKH--EQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLK 152
E ERRF +F +NLK +D + E G G+NRFAD+T+ EF + L + E +
Sbjct: 69 EHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNEEFR--ATFLGAKVAERSR 126
Query: 153 STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE 212
+ E Y + L ES+++++KG V P V++Q CGSCWA SAV+ +ES + E
Sbjct: 127 AAGERYRHDGVEE--LPESVDWREKGAVAP-VKNQGQCGSCWAFSAVSTVESINQLVTGE 183
Query: 213 LIELSKQ 219
+I LS+Q
Sbjct: 184 MITLSEQ 190
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 58/187 (31%), Positives = 93/187 (49%), Gaps = 20/187 (10%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
F+ ++ E + Y+ E ++RF+IF +NLK + + + G+ RFAD+T+ EF
Sbjct: 36 MFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGLTRFADLTNEEFR 95
Query: 138 HGLSSLDWEQI-ENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
E+ +++KS E Y N + L + ++++ KG V+P V+DQ CGSCWA
Sbjct: 96 AIYLRSKMERTRDSVKS--ERYLHNVGDK--LPDEVDWRAKGAVVP-VKDQGSCGSCWAF 150
Query: 197 SAVACLESAYAIKHNELIELSKQPPKTHGRFYK----GGVMNLPHMLCSKGPYSLNHAVL 252
SA+ +E IK EL+ LS+Q Y GG+M+ Y+ +
Sbjct: 151 SAIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMD----------YAFQFIIS 200
Query: 253 NVGYDNE 259
N G D E
Sbjct: 201 NGGIDTE 207
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 78/142 (54%), Gaps = 9/142 (6%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF--N 137
++++ EY R Y +E RRF+ F++N+ ++ + +++ GVN+FAD+T EF N
Sbjct: 37 ENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNKFWLGVNQFADLTTEEFKAN 96
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
G E++ +E S ++ L +++++ KG V P +++Q CG CWA S
Sbjct: 97 KGFKPTA-EKVPTTGFKYENLSVSA-----LPTAVDWRTKGAVTP-IKNQGQCGCCWAFS 149
Query: 198 AVACLESAYAIKHNELIELSKQ 219
AVA +E + LI LS+Q
Sbjct: 150 AVAAMEGIVKLSTGNLISLSEQ 171
>gi|42407296|dbj|BAD10859.1| cysteine protease [Aster tripolium]
Length = 363
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 79/150 (52%), Gaps = 6/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+ LD + FK F ++ R YD++ E E R +F++NL+ + + TA +GV +F+
Sbjct: 41 DPLLDPEHHFKLFKNKFGRTYDTEEEHEYRLTVFKSNLRRAKRHQVLDP-TAKHGVTKFS 99
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T SEF L LK + + L + +++DKG V P V++Q
Sbjct: 100 DLTPSEFRKKYLGLK----SKLKLPADANKAPILPTSNLPQDFDWRDKGAVTP-VKNQGS 154
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCW+ S LE ++ ++ EL+ LS+Q
Sbjct: 155 CGSCWSFSTTGALEGSHFLQTGELVSLSEQ 184
>gi|47779249|gb|AAT38521.1| cysteine protease [Bombyx mori NPV]
Length = 323
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 81/148 (54%), Gaps = 8/148 (5%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L N F++FV + + Y S+ E RRF IF++NL I K++ +A Y +N+F+D++
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 133 DSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
E + L Q +N + + G E +++ KV V++Q +CG
Sbjct: 80 KDETIAKYTGLSLPTQTQNF---CKVILLDQPPGKGPLE-FDWRRLNKV-TSVKNQGMCG 134
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
+CWA + + LES +AIKHNELI LS+Q
Sbjct: 135 ACWAFATLGSLESQFAIKHNELINLSEQ 162
>gi|237643659|ref|YP_002884349.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
gi|229358205|gb|ACQ57300.1| V-CATH [Bombyx mandarina nucleopolyhedrovirus]
Length = 323
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 81/148 (54%), Gaps = 8/148 (5%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L N F++FV + + Y S+ E RRF IF++NL I K++ +A Y +N+F+D++
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 133 DSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
E + L Q +N + + G E +++ KV V++Q +CG
Sbjct: 80 KDETIAKYTGLSLPTQTQNF---CKVIILDQPPGKGPLE-FDWRRLNKV-TSVKNQGMCG 134
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
+CWA + + LES +AIKHNELI LS+Q
Sbjct: 135 ACWAFATLGSLESQFAIKHNELINLSEQ 162
>gi|27819101|gb|AAO23117.1| cysteine proteinase [Bombyx mori NPV]
Length = 323
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 81/148 (54%), Gaps = 8/148 (5%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L N F++FV + + Y S+ E RRF IF++NL I K++ +A Y +N+F+D++
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 133 DSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
E + L Q +N + + G E +++ KV V++Q +CG
Sbjct: 80 KDETIAKYTGLSLPTQTQNF---CKVILLDQPPGKGPLE-FDWRRLNKV-TSVKNQGMCG 134
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
+CWA + + LES +AIKHNELI LS+Q
Sbjct: 135 ACWAFATLGSLESQFAIKHNELINLSEQ 162
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/168 (30%), Positives = 86/168 (51%), Gaps = 11/168 (6%)
Query: 64 ASTFDLEEFLDHGN-----QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQ 118
+S D+E + N ++ ++ E+ + Y+S E E RF+IF+ NL+ ID +
Sbjct: 22 SSAIDIENSVQRTNDQVMAMYESWLVEHGKSYNSLDEKEMRFEIFKENLRIIDDHNADAN 81
Query: 119 GTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKG 178
+ + G+NRFAD+TD E+ L ++ + + ++ L + ++++ G
Sbjct: 82 RSYSLGLNRFADLTDEEYRSTYLGLKRGPKTDVSNQYMPKVGDA-----LPDYVDWRTVG 136
Query: 179 KVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGR 226
V+ V++Q LC SCWA SAVA +E I LI LS+Q GR
Sbjct: 137 AVV-GVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGR 183
>gi|29567137|ref|NP_818699.1| cathepsin [Adoxophyes honmai NPV]
gi|37076951|sp|Q80LP4.1|CATV_NPVAH RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|29467913|dbj|BAC67303.1| cathepsin [Adoxophyes honmai NPV]
Length = 337
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 79/147 (53%), Gaps = 8/147 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ F+ Y +QY RF IF+ NL+ I+ K +A Y +N+F+D++ +E
Sbjct: 32 FETFIINYNKQYPDTKTKNYRFKIFKQNLEDINEKNKLND-SAIYNINKFSDLSKNELLT 90
Query: 139 GLSSLDWEQIENLKSTFETYS----FNSSNSY--GLAESINYKDKGKVLPKVQDQHLCGS 192
+ L ++ N+ + + ++ L ++ +++ K + V+DQ CGS
Sbjct: 91 KYTGLTSKKPSNMVRSTSNFCNVIHLDAPPDVHDELPQNFDWRVNNK-MTSVKDQGACGS 149
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWAH+AV LE+ YAIKHN LI LS+Q
Sbjct: 150 CWAHAAVGTLETLYAIKHNYLINLSEQ 176
>gi|9630927|ref|NP_047524.1| Cystein Protease [Bombyx mori NPV]
gi|1168798|sp|P41721.1|CATV_NPVBM RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|540066|gb|AAB49542.1| cysteine protease [Bombyx mori NPV]
gi|3745946|gb|AAC63793.1| Cystein Protease [Bombyx mori NPV]
Length = 323
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 81/148 (54%), Gaps = 8/148 (5%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L N F++FV + + Y S+ E RRF IF++NL I K++ +A Y +N+F+D++
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 133 DSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
E + L Q +N + + G E +++ KV V++Q +CG
Sbjct: 80 KDETIAKYTGLSLPTQTQNF---CKVILLDQPPGKGPLE-FDWRRLNKV-TSVKNQGMCG 134
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
+CWA + + LES +AIKHNELI LS+Q
Sbjct: 135 ACWAFATLGSLESQFAIKHNELINLSEQ 162
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/187 (28%), Positives = 91/187 (48%), Gaps = 16/187 (8%)
Query: 79 FKDFVREYERQYDS-DSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ E+ + Y+ E ++RF+IF++NL+ ID + G+NRFAD+T+ E+
Sbjct: 49 YESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGLNRFADLTNEEYR 108
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ + T + L +SI++++KG V +V+DQ CGSCWA S
Sbjct: 109 STYLGAKTDARRRIAKTKSDRRYAPKAGGSLPDSIDWREKGAVA-EVKDQGSCGSCWAFS 167
Query: 198 AVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVLN 253
+A +E I ELI LS+Q ++ GG+M+ Y+ + N
Sbjct: 168 TIAAVEGINQIVTGELISLSEQELVDCDTSYNEGCNGGLMD----------YAFEFIIKN 217
Query: 254 VGYDNES 260
G D E+
Sbjct: 218 GGIDTEA 224
>gi|393660044|gb|AFN09033.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 81/148 (54%), Gaps = 8/148 (5%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L N F++FV + + Y S+ E RRF IF++NL I K++ +A Y +N+F+D++
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 133 DSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
E + L Q +N + + G E +++ KV V++Q +CG
Sbjct: 80 KDETIAKYTGLSLPTQTQNF---CKVILLDQPPGKGPLE-FDWRRLNKV-TSVKNQGMCG 134
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
+CWA + + LES +AIKHNELI LS+Q
Sbjct: 135 ACWAFATLGSLESQFAIKHNELINLSEQ 162
>gi|1323748|gb|AAC49287.1| thiol protease [Triticum aestivum]
Length = 374
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/160 (34%), Positives = 85/160 (53%), Gaps = 12/160 (7%)
Query: 69 LEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRF 128
LE+ L +F ++ ++ + Y E RRFDIFR N++ I+ + + + T GVN+F
Sbjct: 40 LEDSLLMMERFHGWMAKHGKSYAGVEEKLRRFDIFRRNVEFIEAANRDGRLSYTLGVNQF 99
Query: 129 ADMTDSEFNHGLSSLDWEQI----ENLKSTFETYSFNSSNSY----GLAESINYKDKGKV 180
AD+T EF L++ ++ E + +T +N + SIN+ ++ KV
Sbjct: 100 ADLTHEEF---LATHTSRRVVPSEEMVITTRAGVVVEGANCQPAPNAVPRSINWVNQSKV 156
Query: 181 LPKVQDQHLCGSCWAHSAVACLESAYAI-KHNELIELSKQ 219
P +CG+CWA SAVA +ESAYAI K E LS+Q
Sbjct: 157 TPVKNQGKVCGACWAFSAVATIESAYAIAKRGEPPVLSEQ 196
>gi|393717301|gb|AFN21222.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 80/148 (54%), Gaps = 8/148 (5%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L N F++FV + + Y S+ E RRF IF++NL I K++ +A Y +N+F+D++
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 133 DSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
E + L Q +N + G E +++ KV V++Q +CG
Sbjct: 80 KDETIAKYTGLSLPTQTQNFCKVI---LLDQPPGKGPLE-FDWRRLNKV-TSVKNQGMCG 134
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
+CWA + + LES +AIKHNELI LS+Q
Sbjct: 135 ACWAFATLGSLESQFAIKHNELINLSEQ 162
>gi|393717160|gb|AFN21082.1| V-Cath [Bombyx mori NPV]
gi|393717442|gb|AFN21362.1| V-Cath [Bombyx mori NPV]
Length = 323
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 51/148 (34%), Positives = 80/148 (54%), Gaps = 8/148 (5%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L N F++FV + + Y S+ E RRF IF++NL I K++ +A Y +N+F+D++
Sbjct: 22 LKAPNYFEEFVHRFNKNYSSEVEKLRRFKIFQHNLNEI--INKNQNDSAKYEINKFSDLS 79
Query: 133 DSEFNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
E + L Q +N + G E +++ KV V++Q +CG
Sbjct: 80 KDETIAKYTGLSLPTQTQNFCKVI---LLDQPPGKGPLE-FDWRRLNKV-TSVKNQGMCG 134
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
+CWA + + LES +AIKHNELI LS+Q
Sbjct: 135 ACWAFATLGSLESQFAIKHNELINLSEQ 162
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 70/138 (50%), Gaps = 7/138 (5%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ EY + Y +E ++RF IF++N++ I+ + GVN AD+T EF +
Sbjct: 41 WMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLGVNHLADLTVEEFKASRN 100
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
F T +F N + +I+++ KG V P ++DQ CGSCWA S +A
Sbjct: 101 GF------KRPHEFSTTTFKYENVTAIPAAIDWRTKGAVTP-IKDQGQCGSCWAFSTIAA 153
Query: 202 LESAYAIKHNELIELSKQ 219
E + I +L+ LS+Q
Sbjct: 154 TEGIHQITTGKLVSLSEQ 171
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 80/162 (49%), Gaps = 21/162 (12%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ E + Y+ E ERR IF+ NLK ID + T G+ RFAD+T+ E
Sbjct: 1 MYERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDEPK 60
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ + + Y + + L + I+++ KG V+P V+DQ CGSCWA S
Sbjct: 61 DFMKA-------------DRYLYKEGDI--LPDEIDWRAKGAVVP-VKDQGNCGSCWAFS 104
Query: 198 AVACLESAYAIKHNELIELSKQPPKTHGRFY-----KGGVMN 234
AV +E IK ELI LS Q R + +GGVMN
Sbjct: 105 AVGAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMN 146
>gi|26245865|gb|AAN77408.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 173
Score = 79.0 bits (193), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 78/146 (53%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTD 133
+Q+ F + + + Y S E RF IF+NNL+TI+ + K+E+G TY V +FADMT
Sbjct: 21 DQWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEKHNAKYEEGKVTYYMAVTQFADMTR 80
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF L L + NL +T + + L E I++ +KG VLP V++Q C SC
Sbjct: 81 DEFRKKLG-LQNNRRPNLNATLQVFP----EDLELPEQIDWTEKGAVLP-VKNQGNCRSC 134
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA S LE AI + LS+Q
Sbjct: 135 WAFSTTGSLEGQNAIHNKVKTPLSEQ 160
>gi|341879557|gb|EGT35492.1| hypothetical protein CAEBREN_11857 [Caenorhabditis brenneri]
Length = 340
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 82/162 (50%), Gaps = 20/162 (12%)
Query: 49 LILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLK 108
L+LQR Q T D + + N F+DF+ +Y R+Y S+ E+ +RF IF N
Sbjct: 31 LVLQRHQ--------IPTPDAK----YTNAFQDFLVKYMREYKSEEEMVKRFTIFSRNAD 78
Query: 109 TIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGL 168
++ Y K + G TY +N F+D+TD E+ L + + L++ + N L
Sbjct: 79 LVERYNKEDAGKVTYELNDFSDLTDEEWKQFLMK---PKPKKLQTPVKKPEIKVEN---L 132
Query: 169 AESINYK--DKGKVLPKVQDQHLCGSCWAHSAVACLESAYAI 208
ES++++ D + ++ Q CGSCWA + A +ESA +
Sbjct: 133 PESVDWRNFDGKNHVTGIKYQGPCGSCWAFATAAAIESAMSF 174
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/185 (30%), Positives = 95/185 (51%), Gaps = 17/185 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ + + Y++ E ERRF+IF++NL+ ID + + E T G+ RFAD+T+ E+
Sbjct: 62 YESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNR-ESRTYKVGLTRFADLTNEEYRA 120
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + L S ++ + ++ L + ++++ KG V V+DQ CGSCWA S+
Sbjct: 121 RFLGGRFSRKPRL-SAAKSGRYAAALGDDLPDDVDWRKKGAVA-TVKDQGQCGSCWAFSS 178
Query: 199 VACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVLNV 254
VA +E I ELI LS+Q K+ GG+M+ Y+ + N
Sbjct: 179 VAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMD----------YAFQFIIGNG 228
Query: 255 GYDNE 259
G D E
Sbjct: 229 GIDTE 233
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 72/138 (52%), Gaps = 4/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ Y + Y E E+RF IF+ N+ I+ + +N+FAD+T+ EF ++
Sbjct: 42 WMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEF---IA 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ + S T +F N + +++++ KG V P ++DQ CG CWA SAVA
Sbjct: 99 PRNRFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTP-IKDQGQCGCCWAFSAVAA 157
Query: 202 LESAYAIKHNELIELSKQ 219
E +A+ +LI LS+Q
Sbjct: 158 TEGIHALTSGKLISLSEQ 175
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 55/199 (27%), Positives = 99/199 (49%), Gaps = 38/199 (19%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N+F++++ +E++YD SE ++RF IF++N+ + + T G+N AD+T+ E+
Sbjct: 179 NEFENWIDRFEKKYDV-SEFKKRFSIFKSNMDFVHSWNSKNSQTV-LGLNHLADLTNLEY 236
Query: 137 NH---------GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQ 187
L + ++ NL+S F G + +++++ KG V P ++DQ
Sbjct: 237 RQFYLGTHKKAVLGTPGNHEVSNLQSVF-----------GDSATVDWRQKGAVSP-IKDQ 284
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQP----PKTHGRF-YKGGVMNLPHMLCSK 242
CGSCW+ S +E A+ IK ++ELS+Q + G GG+M+
Sbjct: 285 GQCGSCWSFSTTGSVEGAHQIKSGNMVELSEQNLVDCSTSEGNMGCNGGLMD-------- 336
Query: 243 GPYSLNHAVLNVGYDNEST 261
Y+ + + N G D ES+
Sbjct: 337 --YAFEYIITNNGIDTESS 353
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 73/140 (52%), Gaps = 4/140 (2%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+ ++ Y + Y E E+RF IF+ N+ I+ + +N+FAD+T+ EF
Sbjct: 58 EQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEF--- 114
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
++ + + S T +F N + +++++ KG V P ++DQ CG CWA SAV
Sbjct: 115 IAPRNRFKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTP-IKDQGQCGCCWAFSAV 173
Query: 200 ACLESAYAIKHNELIELSKQ 219
A E +A+ +LI LS+Q
Sbjct: 174 AATEGIHALTSGKLISLSEQ 193
>gi|27462834|gb|AAO15606.1| cathepsin L-like protease [Sarcoptes scabiei type hominis]
Length = 245
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 86/153 (56%), Gaps = 17/153 (11%)
Query: 73 LDHGNQFKDFVREYERQYDSD-SEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFA 129
+DH Q+ F +Y RQ+ + E+ R+ RN + + K+E G +TY GVN+F
Sbjct: 29 IDH--QWTVFKAKYNRQFRTVYDELLRKLIFQRNYIYIRKHNEKYEAGLSTYELGVNQFT 86
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYS---FNSSNSYGLAESINYKDKGKVLPKVQD 186
D+T+ E+N +Q+ LK + S F++ + L + +++ K V P ++D
Sbjct: 87 DLTNKEYN--------DQMNRLKVKHDVQSEHVFDNEDVSDLPDEVDWTLKNVVAP-IKD 137
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA SAVA +ES A+K +L+ELS+Q
Sbjct: 138 QKQCGSCWAFSAVASMESQNALKTGQLVELSEQ 170
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 78.6 bits (192), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 43/134 (32%), Positives = 70/134 (52%), Gaps = 4/134 (2%)
Query: 86 YERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDW 145
Y + Y E E+RF IF+ N+ I+ + +N+FAD+T+ EF ++ +
Sbjct: 593 YGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQFADLTNEEF---IAPRNR 649
Query: 146 EQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESA 205
+ S T +F N + +++++ KG V P ++DQ CG CWA SAVA E
Sbjct: 650 FKGHMCSSIIRTTTFKYENVTAVPSTVDWRQKGAVTP-IKDQGQCGCCWAFSAVAATEGI 708
Query: 206 YAIKHNELIELSKQ 219
+A+ +LI LS+Q
Sbjct: 709 HALTSGKLISLSEQ 722
>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
Length = 366
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 75/142 (52%), Gaps = 19/142 (13%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHG-------LSSLDW 145
E RRFD+F+ N + I Y + QG ATY G+NRF+DMTD EFN +
Sbjct: 63 EKTRRFDLFKENARRI--YEHNHQGNATYTLGLNRFSDMTDEEFNRSPYGGCLTAPRMSD 120
Query: 146 EQIENLKSTF----ETYSFNSSNSYG---LAESINYKDKGKVLPKVQDQ-HLCGSCWAHS 197
++IE L + SFN ++ G L +G+ + +V+DQ CGSCWA S
Sbjct: 121 DEIEELHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDWRGRAVTRVKDQGPTCGSCWAFS 180
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A+A +E AI+ L+ LS+Q
Sbjct: 181 AIAAVEGINAIRTRNLVPLSEQ 202
>gi|17569349|ref|NP_509408.1| Protein R09F10.1 [Caenorhabditis elegans]
gi|351061560|emb|CCD69414.1| Protein R09F10.1 [Caenorhabditis elegans]
Length = 383
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 51/153 (33%), Positives = 82/153 (53%), Gaps = 12/153 (7%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTAT-YGVNRFA 129
E L H F DF+ +++R+Y S E E R+ IF N+ I++ + E+ VN F
Sbjct: 74 ENLKHEQMFNDFILKFDRKYTSVEEFEYRYQIFLRNV--IEFEAEEERNLGLDLDVNEFT 131
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSS---NSYGLAESINYKDKGKVLPKVQD 186
D TD E + ++ + K F+T F S SI+++++GK+ P +++
Sbjct: 132 DWTDEELQKMV-----QENKYTKYDFDTPKFEGSYLETGVIRPASIDWREQGKLTP-IKN 185
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA + VA +E+ AIK +L+ LS+Q
Sbjct: 186 QGQCGSCWAFATVASVEAQNAIKKGKLVSLSEQ 218
>gi|432091081|gb|ELK24293.1| Cathepsin F, partial [Myotis davidii]
Length = 410
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 80/165 (48%), Gaps = 6/165 (3%)
Query: 72 FLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADM 131
+L + FK F+ Y R Y+++ E + R +F NN+ ++GTA YGV +F+D+
Sbjct: 106 YLRMASLFKYFITTYNRTYETEEEAQWRMSVFINNMIRAQKIQALDRGTAQYGVTKFSDL 165
Query: 132 TDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
T+ EF L+ E L F + +++ KG V KV++Q +CG
Sbjct: 166 TEEEFRT--MYLNPLLKEELGKKMRLVKFVGDPA---PPEWDWRKKGAVT-KVKNQGMCG 219
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLP 236
SCWA S +E + +K +L+ LS+Q + K + LP
Sbjct: 220 SCWAFSVTGNVEGQWFLKRGDLLSLSEQELVDCDKVDKACMGGLP 264
>gi|58201366|gb|AAW66804.1| cysteine protease [Pinus taeda]
gi|58201368|gb|AAW66805.1| cysteine protease [Pinus taeda]
gi|58201392|gb|AAW66817.1| cysteine protease [Pinus taeda]
gi|58201394|gb|AAW66818.1| cysteine protease [Pinus taeda]
gi|58201398|gb|AAW66820.1| cysteine protease [Pinus taeda]
Length = 193
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 52/142 (36%), Positives = 83/142 (58%), Gaps = 11/142 (7%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHG 139
+V E+++ Y+ E ++RF +F++N Y +H QG +Y G+N+FAD++ EF
Sbjct: 45 WVAEHKKAYNGLDEKQKRFTVFKDNFL---YIHEHNQGNRSYKLGLNQFADLSHEEFKAT 101
Query: 140 L--SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ LD ++ L+S Y + S+ L +SI++++KG V P V+DQ CGSCWA S
Sbjct: 102 YLGAKLDTKK-RLLRSPSPRYQY--SDGEDLPKSIDWREKGAVAP-VKDQGACGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQ 219
VA +E I +LI LS+Q
Sbjct: 158 TVAAVEGINQIVTGDLISLSEQ 179
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 74/138 (53%), Gaps = 4/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ Y + Y E E+RF IF+ N+ I+ + + + +N+FAD+T+ EF ++
Sbjct: 42 WMARYAKVYKDPQEREKRFRIFKENVNYIETFNSADNKSYKLDINQFADLTNEEF---IA 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ + S T +F N + +++++ KG V P ++DQ CG CWA SAVA
Sbjct: 99 PRNRFKGHMCSSITRTTTFKYENVTVIPSTVDWRQKGAVTP-IKDQGQCGCCWAFSAVAA 157
Query: 202 LESAYAIKHNELIELSKQ 219
E +A+ +LI LS+Q
Sbjct: 158 TEGIHALNAGKLISLSEQ 175
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 76/128 (59%), Gaps = 4/128 (3%)
Query: 94 SEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKS 153
SE ++RF+IF++NLK ID + E T G+NRFAD+++ E+ + I + +
Sbjct: 70 SEKDKRFEIFKDNLKFIDEHNA-ENRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMMMA 128
Query: 154 TFETYSFNSSNSYG--LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN 211
+T S + S G L +S++++ +G V+ +V+DQ CGSCWA S +A +E I
Sbjct: 129 RTKTRSNRYAPSVGDKLPKSVDWRSQGAVV-QVKDQGSCGSCWAFSTIAAVEGINKIVTG 187
Query: 212 ELIELSKQ 219
EL+ LS+Q
Sbjct: 188 ELVSLSEQ 195
>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
Length = 360
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 75/151 (49%), Gaps = 13/151 (8%)
Query: 72 FLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADM 131
L+ + F F ++ + Y + E + RF +FR NL+ + K + +A +GV +F+D+
Sbjct: 37 MLNAEHHFTTFKTKFGKSYATQEEHDYRFGVFRANLRRAKLHAKLDP-SAEHGVTKFSDL 95
Query: 132 TDSEFNH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQH 188
T EF GL L N T L E+ +++DKG V P V++Q
Sbjct: 96 TPEEFKRQYLGLKPLRLPSTANKAPILPTSD--------LPENFDWRDKGAVTP-VKNQG 146
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S LE A+ + EL+ LS+Q
Sbjct: 147 SCGSCWAFSTTGALEGAHYLSTGELVSLSEQ 177
>gi|194352774|emb|CAQ00115.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 190
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 78/142 (54%), Gaps = 2/142 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
+K +V ++ + Y+ E E+RF+IF++NL +D + + T G+N+FAD+T+ E+
Sbjct: 49 YKSWVIQHGKAYNGIGEEEKRFEIFKDNLMFVDEHNSNNNTTYKLGLNKFADLTNQEYRA 108
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSY-GLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ L + S N+ + L +S+N++D G V +V+DQ GSCWA S
Sbjct: 109 KFLGTRTDPRRRLMKSKIPSSRNAHRAADNLPDSVNWRDHGAV-SRVKDQGSWGSCWAFS 167
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A+A +E I E I LS+Q
Sbjct: 168 AIAAVEGINKIVSGEPISLSEQ 189
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 56/188 (29%), Positives = 100/188 (53%), Gaps = 21/188 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+ +++ E+ Y+ E ERRF+ FRNNL+ ID + + G ++ G+NRFAD+T+ E
Sbjct: 42 YAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRLGLNRFADLTNEE 101
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ S+ + + + + + ++++ L ES++++ KG V V+DQ CGSCWA
Sbjct: 102 YR---STYLGARTKPDRERKLSARYQAADNDELPESVDWRKKGAV-GAVKDQGGCGSCWA 157
Query: 196 HSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAV 251
SA+A +E I ++I LS+Q ++ + GG+M+ Y+ +
Sbjct: 158 FSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMD----------YAFEFII 207
Query: 252 LNVGYDNE 259
N G D+E
Sbjct: 208 NNGGIDSE 215
>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
Length = 363
Score = 78.6 bits (192), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 44/132 (33%), Positives = 75/132 (56%), Gaps = 3/132 (2%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQ 147
R Y ++E + RF IF+NN+ ID + + T VN+FAD+T+ EF S +++
Sbjct: 64 RIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYTLEVNKFADLTNDEFR--ASRNGYKK 121
Query: 148 IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYA 207
+ S + F +N + + ++++ +G V P V+DQ CG CWA SAVA +E
Sbjct: 122 QPDSDSHVVSGLFRYANVSAVPDEVDWRKEGAVTP-VKDQGDCGCCWAFSAVAAMEGINK 180
Query: 208 IKHNELIELSKQ 219
+++ +L+ LS+Q
Sbjct: 181 LENGKLVSLSEQ 192
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 74/138 (53%), Gaps = 4/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y D+E RRF++F+ N I+ + GVN+FAD+T+ EF L+
Sbjct: 40 WMAQYGRMYKDDAEKARRFEVFKANAAFIESFNAGNH-KFWLGVNQFADLTNDEFR--LT 96
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ I + + + + N L +++++ KG V P ++DQ CG CWA SAVA
Sbjct: 97 KTNKGFIPSTTRVPTGFRYENVNIDALPATMDWRTKGVVTP-IKDQGQCGCCWAFSAVAA 155
Query: 202 LESAYAIKHNELIELSKQ 219
+E + +LI LS+Q
Sbjct: 156 MEGIVKLSTGKLISLSEQ 173
>gi|58201360|gb|AAW66801.1| cysteine protease [Pinus taeda]
Length = 193
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 84/142 (59%), Gaps = 11/142 (7%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHG 139
++ E+++ Y+ E ++RF +F++N Y +H QG +Y G+N+FAD++ EF
Sbjct: 45 WLAEHKKAYNGLDEKQKRFTVFKDNFL---YIHEHNQGNRSYKLGLNQFADLSHEEFKAT 101
Query: 140 L--SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ LD ++ L+S Y + S+ L +SI++++KG V+P V+DQ CGSCWA S
Sbjct: 102 YLGAKLDTKK-RLLRSPSPRYQY--SDGEDLPKSIDWREKGAVVP-VKDQGACGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQ 219
VA +E I +LI LS+Q
Sbjct: 158 TVAAVEGINQIVTGDLISLSEQ 179
>gi|7638427|gb|AAF65468.1| cysteine protease falcipain-2 [Plasmodium falciparum]
Length = 484
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 83/153 (54%), Gaps = 8/153 (5%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
+H NQF F++ +QY+S +E++ RF +F N ++ + ++ +NRFAD+T
Sbjct: 160 EHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTY 219
Query: 134 SEF-NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE------SINYKDKGKVLPKVQD 186
EF N LS + ++N K + ++ E + +++ V P V+D
Sbjct: 220 HEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYRGEENFDHAAYDWRLHSGVTP-VKD 278
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA S++ +ES YAI+ N+LI LS+Q
Sbjct: 279 QKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQ 311
>gi|429328943|gb|AFZ80702.1| cysteine protease [Babesia equi]
Length = 441
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 81/157 (51%), Gaps = 19/157 (12%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSE 135
+F +F + Y R++ E RF FRNN + K + G +Y G+N+F+DMTD E
Sbjct: 122 EFDEFNKFYSREHADADERRVRFLAFRNNYNAV----KAQTGEESYEKGINKFSDMTDEE 177
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNS---------SNSYGLAESINYKD----KGKVLP 182
FN +L E+++ + F S + G+ +S++ +D K +
Sbjct: 178 FNLRFPALSVEELKKSLEVSASEEFTSPEHLDKVRIAKGLGVEDSVDGEDLDWRKLNGVT 237
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+DQ CGSCWA +AV +ES Y IK + ++LS+Q
Sbjct: 238 PVKDQGNCGSCWAFAAVGSVESLYLIKKGQALDLSEQ 274
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 49/164 (29%), Positives = 89/164 (54%), Gaps = 10/164 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+++++ +++ Y+ + ++RF +F++NL I + + T G+N+FADMT+ E+
Sbjct: 37 MYEEWLVRHQKGYNELGKKDKRFQVFKDNLGFIQEHNNNLNNTYKLGLNKFADMTNEEYR 96
Query: 138 H---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
G S ++ KST Y+F++ + L ++++ KG V P ++DQ CGSCW
Sbjct: 97 AMYLGTKSNAKRRLMKTKSTGHRYAFSARDR--LPVHVDWRMKGAVAP-IKDQGSCGSCW 153
Query: 195 AHSAVACLESAYAIKHNELIELSKQPPKTHGRFY----KGGVMN 234
A S VA +E+ I + + LS+Q R Y GG+M+
Sbjct: 154 AFSTVATVEAINKIVTGKFVSLSEQELVDCDRAYNEGCNGGLMD 197
>gi|7542559|gb|AAF63497.1|AF239801_1 falcipain 2 [Plasmodium falciparum]
gi|9719446|gb|AAF97805.1|AF282975_1 falcipain 2 [Plasmodium falciparum]
gi|9719448|gb|AAF97806.1|AF282976_1 falcipain 2 [Plasmodium falciparum]
gi|9719450|gb|AAF97807.1|AF282977_1 falcipain 2 [Plasmodium falciparum]
Length = 484
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 83/153 (54%), Gaps = 8/153 (5%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
+H NQF F++ +QY+S +E++ RF +F N ++ + ++ +NRFAD+T
Sbjct: 160 EHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTY 219
Query: 134 SEF-NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE------SINYKDKGKVLPKVQD 186
EF N LS + ++N K + ++ E + +++ V P V+D
Sbjct: 220 HEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYRGEENFDHAAYDWRLHSGVTP-VKD 278
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA S++ +ES YAI+ N+LI LS+Q
Sbjct: 279 QKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQ 311
>gi|9719454|gb|AAF97809.1|AF282979_1 falcipain 2 [Plasmodium falciparum]
gi|12744515|gb|AAK06665.1|AF317909_1 falcipain-2 [Plasmodium falciparum]
Length = 484
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 83/153 (54%), Gaps = 8/153 (5%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
+H NQF F++ +QY+S +E++ RF +F N ++ + ++ +NRFAD+T
Sbjct: 160 EHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTY 219
Query: 134 SEF-NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE------SINYKDKGKVLPKVQD 186
EF N LS + ++N K + ++ E + +++ V P V+D
Sbjct: 220 HEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYRGEENFDHAAYDWRLHSGVTP-VKD 278
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA S++ +ES YAI+ N+LI LS+Q
Sbjct: 279 QKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQ 311
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 72/138 (52%), Gaps = 6/138 (4%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y D+E R+ IF+ N+ ID + + GVN+FAD+T+ EF +
Sbjct: 42 WMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEFKASRN 101
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ + + + F N + +++++ +G V P V+DQ CG CWA SAVA
Sbjct: 102 -----RFKGHMCSPQAGPFRYENVSAVPSTVDWRKEGAVTP-VKDQGQCGCCWAFSAVAA 155
Query: 202 LESAYAIKHNELIELSKQ 219
+E + +LI LS+Q
Sbjct: 156 MEGINKLTTGKLISLSEQ 173
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 74/143 (51%), Gaps = 14/143 (9%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF----- 136
++ +Y R Y D+E RRF++F+ N+ I+ + GVN+FAD+T+ EF
Sbjct: 40 WMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNH-NFWLGVNQFADLTNDEFRWTKT 98
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
N G I + + + + N L +++++ KG V P ++DQ CG CWA
Sbjct: 99 NKGF-------IPSTTRVPTGFRYENVNIDALPATVDWRTKGAVTP-IKDQGQCGCCWAF 150
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SAVA +E + +LI LS+Q
Sbjct: 151 SAVAAMEGIVKLSTGKLISLSEQ 173
>gi|9719452|gb|AAF97808.1|AF282978_1 falcipain 2 [Plasmodium falciparum]
Length = 484
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 83/153 (54%), Gaps = 8/153 (5%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
+H NQF F++ +QY+S +E++ RF +F N ++ + ++ +NRFAD+T
Sbjct: 160 EHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTY 219
Query: 134 SEF-NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE------SINYKDKGKVLPKVQD 186
EF N LS + ++N K + ++ E + +++ V P V+D
Sbjct: 220 HEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYRGEENFDHAAYDWRLHSGVTP-VKD 278
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA S++ +ES YAI+ N+LI LS+Q
Sbjct: 279 QKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQ 311
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 74/143 (51%), Gaps = 14/143 (9%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF----- 136
++ +Y R Y D+E RRF++F+ N+ I+ + GVN+FAD+T+ EF
Sbjct: 40 WMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNH-NFWLGVNQFADLTNDEFRWMKT 98
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
N G I + + + + N L +++++ KG V P ++DQ CG CWA
Sbjct: 99 NKGF-------IPSTTRVPTGFRYENVNIDALPATVDWRTKGAVTP-IKDQGQCGCCWAF 150
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SAVA +E + +LI LS+Q
Sbjct: 151 SAVAAMEGIVKLSTGKLISLSEQ 173
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 75/138 (54%), Gaps = 4/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y D+E RRF++F+ N+ I+ + GVN+FAD+T+ EF +
Sbjct: 40 WMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNH-KFWLGVNQFADLTNDEFRSTKT 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ + I + + + + N L +++++ KG V P ++DQ CG CWA SAVA
Sbjct: 99 NKGF--IPSTTRVPTGFRYENVNIDALPATMDWRTKGVVTP-IKDQGQCGCCWAFSAVAA 155
Query: 202 LESAYAIKHNELIELSKQ 219
+E + +LI LS+Q
Sbjct: 156 MEGIVKLSTGKLISLSEQ 173
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 81/146 (55%), Gaps = 12/146 (8%)
Query: 81 DFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-TATYGVNRFADMTDSEFN-- 137
+++ ++ R Y E R+ +F+NN++ I++ G T VN+FAD+T+ EF
Sbjct: 40 EWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSM 99
Query: 138 ----HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
G+S+L + ++ + + + +S L S++++ KG V P +++Q CG C
Sbjct: 100 YTGFKGVSALSSQS----QTKMSPFRYQNVSSGALPVSVDWRKKGAVTP-IKNQGSCGCC 154
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SAVA +E A IK +LI LS+Q
Sbjct: 155 WAFSAVAAIEGATQIKKGKLISLSEQ 180
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 78.2 bits (191), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 101/188 (53%), Gaps = 21/188 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+ +++ E+ Y++ E ERRF+ FR+NL+ ID + + G ++ G+NRFAD+T+ E
Sbjct: 43 YAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVHSFRLGLNRFADLTNEE 102
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ S+ + + + + + ++++ L ES++++ KG V V+DQ CGSCWA
Sbjct: 103 YR---STYLGARTKPDRERKLSARYQAADNDELPESVDWRKKGAV-GAVKDQGGCGSCWA 158
Query: 196 HSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAV 251
SA+A +E I ++I LS+Q ++ + GG+M+ Y+ +
Sbjct: 159 FSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMD----------YAFEFII 208
Query: 252 LNVGYDNE 259
N G D+E
Sbjct: 209 NNGGIDSE 216
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 48/129 (37%), Positives = 71/129 (55%), Gaps = 7/129 (5%)
Query: 94 SEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHGLSSLDWEQIENL 151
SE +RF++F+ N K I + K + A Y G+N+FADMT+ EF +
Sbjct: 54 SEKNKRFNVFKENAKFIHEFNKKD---APYKLGLNKFADMTNQEFRSTYAGSKIHHHRTQ 110
Query: 152 KSTFE-TYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKH 210
+ T T SF N + + S++++ +G V P V+DQ CGSCWA S +A +E IK
Sbjct: 111 RGTPRATGSFMYENVHSIPASVDWRTQGAVAP-VKDQGQCGSCWAFSTIASVEGINKIKT 169
Query: 211 NELIELSKQ 219
N+L+ LS Q
Sbjct: 170 NQLVPLSGQ 178
>gi|124803848|ref|XP_001347832.1| falcipain-2B [Plasmodium falciparum 3D7]
gi|23496084|gb|AAN35745.1| falcipain-2B [Plasmodium falciparum 3D7]
gi|62240121|gb|AAX77225.1| falcipain-2' [Plasmodium falciparum]
Length = 482
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 86/154 (55%), Gaps = 8/154 (5%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
++H NQF F++ +QY+S +E++ RF +F N + + +++ +NRFAD+T
Sbjct: 157 VEHINQFYTFIKTNNKQYNSPNEMKERFQVFLQNAHKVKMHNNNKKSLYKKELNRFADLT 216
Query: 133 DSEFNHGLSSL-DWEQIENLKSTFETYSFNSS-NSYGLAESI-----NYKDKGKVLPKVQ 185
EF +L + ++N K + ++++ Y E+ +++ V P V+
Sbjct: 217 YHEFKSKYLTLRSSKPLKNSKYLLDQINYDAVIKKYKGNENFDHAAYDWRLHSGVTP-VK 275
Query: 186 DQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
DQ CGSCWA S++ +ES YAI+ N+LI LS+Q
Sbjct: 276 DQKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQ 309
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 72/138 (52%), Gaps = 6/138 (4%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y D+E R+ IF+ N+ ID + + GVN+FAD+T+ EF +
Sbjct: 8 WMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEFKASRN 67
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ + + + F N + +++++ +G V P V+DQ CG CWA SAVA
Sbjct: 68 -----RFKGHMCSPQAGPFRYENVSAVPSTVDWRKEGAVTP-VKDQGQCGCCWAFSAVAA 121
Query: 202 LESAYAIKHNELIELSKQ 219
+E + +LI LS+Q
Sbjct: 122 MEGINKLTTGKLISLSEQ 139
>gi|1594287|gb|AAC48340.1| cathepsin L-like cysteine proteinase [Toxocara canis]
Length = 360
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 56/159 (35%), Positives = 79/159 (49%), Gaps = 7/159 (4%)
Query: 67 FDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTAT-YGV 125
+DL L ++F++F+R+Y++ YDS+ E RF I+ NN+ + + T YG
Sbjct: 38 YDLTRELRLLDRFEEFIRKYDKVYDSNEEFAERFRIYVNNMLEAQKLNQRNRDYGTIYGE 97
Query: 126 NRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKD-----KGKV 180
N FAD +EF L D+ + KSTF SF LA D V
Sbjct: 98 NEFADWNVNEFREILLPKDFFKNLRKKSTF-IDSFIDPPETVLARREEIPDHFDWRPYNV 156
Query: 181 LPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ V+ Q CGSCWA + V +ESAYA+ EL LS+Q
Sbjct: 157 VTPVKSQFKCGSCWAFATVGTVESAYALGTGELRSLSEQ 195
>gi|326526731|dbj|BAK00754.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 341
Score = 78.2 bits (191), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 73/142 (51%), Gaps = 6/142 (4%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F++F + + Y S E R+ F +NL+ + ++ G +GV +F DMT +EF
Sbjct: 31 KFQEFTARFSKNYKSVEEYTTRYATFLDNLERVA--KLNQDGRGVFGVTKFMDMTPAEFK 88
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+++ K+ N+ G S++++ KG V P V+DQ CGSCWA S
Sbjct: 89 ATYLGFKPDEMAPPKAPVARPHRAKRNATG---SVDWRTKGAVTP-VKDQAQCGSCWAFS 144
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A +ES + + NELI LS Q
Sbjct: 145 ATEQIESNWFLAGNELISLSPQ 166
>gi|96979798|ref|YP_611001.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|37077647|sp|Q91CL9.1|CATV_NPVAP RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|16041073|dbj|BAB69773.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|94983331|gb|ABF50271.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
gi|146229694|gb|ABQ12259.1| cathepsin [Antheraea pernyi nucleopolyhedrovirus]
Length = 324
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 54/163 (33%), Positives = 91/163 (55%), Gaps = 12/163 (7%)
Query: 59 YGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQ 118
YG+ + +DL L + F++F+ ++ + Y S+SE RRF IF++NL+ I K++
Sbjct: 11 YGATLGAAYDL---LKAPSYFEEFLHKFNKNYSSESEKLRRFKIFQHNLEEI--INKNQN 65
Query: 119 GT-ATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTF-ETYSFNSSNSYGLAESINYKD 176
T A Y +N+F+D++ E +S + K F E + G E +++
Sbjct: 66 DTSAQYEINKFSDLSKDE---TISKYTGLSLPLQKQNFCEVVVLDRPPDKGPLE-FDWRR 121
Query: 177 KGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
KV V++Q +CG+CWA + + LES +AIKH++LI LS+Q
Sbjct: 122 LNKV-TSVKNQGMCGACWAFATLGSLESQFAIKHDQLINLSEQ 163
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 75/138 (54%), Gaps = 1/138 (0%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +++ Y +E E RF IF+ N++ I+ + E GVN+F+D+T+ +F +
Sbjct: 45 WIAHHDKVYKDLNEKEMRFKIFKENVERIEAFNAGEDKGYKLGVNKFSDLTNEKFRVLHT 104
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ + S+ F +N + +++++ KG V P ++DQ CG CWA SAVA
Sbjct: 105 GYKRSHPKVMSSSKPKTHFRYANVTDIPPTMDWRKKGAVTP-IKDQKECGCCWAFSAVAA 163
Query: 202 LESAYAIKHNELIELSKQ 219
E + +K +LI LS+Q
Sbjct: 164 TEGLHQLKTGKLIPLSEQ 181
>gi|124803863|ref|XP_001347836.1| falcipain-2A [Plasmodium falciparum 3D7]
gi|23496088|gb|AAN35749.1| falcipain-2A [Plasmodium falciparum 3D7]
Length = 484
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 50/153 (32%), Positives = 83/153 (54%), Gaps = 8/153 (5%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
+H NQF F++ +QY+S +E++ RF +F N ++ + ++ +NRFAD+T
Sbjct: 160 EHINQFYMFIKTNNKQYNSPNEMKERFQVFLQNAHKVNMHNNNKNSLYKKELNRFADLTY 219
Query: 134 SEF-NHGLSSLDWEQIENLKSTFETYSFN------SSNSYGLAESINYKDKGKVLPKVQD 186
EF N LS + ++N K + ++ N + +++ V P V+D
Sbjct: 220 HEFKNKYLSLRSSKPLKNSKYLLDQMNYEEVIKKYKGNENFDHAAYDWRLHSGVTP-VKD 278
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA S++ +ES YAI+ N+LI LS+Q
Sbjct: 279 QKNCGSCWAFSSIGSVESQYAIRKNKLITLSEQ 311
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 46/138 (33%), Positives = 69/138 (50%), Gaps = 6/138 (4%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ Y R Y + +E RR IF+ NLK I + K GVN FAD+T+ EF +
Sbjct: 42 WMARYGRVYKTANEKNRRSTIFQENLKYIQTFNKANNKPYKLGVNEFADLTNEEFTTSRN 101
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ T F N + +++++ KG V P +++Q CG CWA SAVA
Sbjct: 102 KFKSHVCATV-----TNVFRYENVTAVPATMDWRKKGAVTP-IKNQGQCGCCWAFSAVAA 155
Query: 202 LESAYAIKHNELIELSKQ 219
+E +K +LI LS+Q
Sbjct: 156 MEGITQLKTGKLISLSEQ 173
>gi|47606558|gb|AAT36263.1| vivapain-2 [Plasmodium vivax]
Length = 487
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 85/157 (54%), Gaps = 13/157 (8%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ N F FV+EY R+Y ++ E+++R+ F NL+ I + E G+N+F D++
Sbjct: 161 LESVNSFYLFVKEYGRKYKTEEEMQQRYLAFVENLEKIKAHNSRENVLYRKGMNQFGDLS 220
Query: 133 DSEFNHG---LSSLDWE-------QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLP 182
EF L S D++ +I N + + Y + ++ A S +++ V P
Sbjct: 221 FGEFKKKYLTLKSFDFKTFGGKLKRITNYEDVIDKYKPKDA-TFDHA-SYDWRLHKGVTP 278
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+DQ CGSCWA S V +ES YAI+ N+L+ +S+Q
Sbjct: 279 -VKDQANCGSCWAFSTVGVVESQYAIRKNQLVSISEQ 314
>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 77/138 (55%), Gaps = 3/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y + Y +E E+RF IF+NN++ I+ + +N+FAD+ + EF L
Sbjct: 40 WMAQYGKLYTDAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKASL- 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
++ ++ E+ T SF + + +++++ +G V P ++DQ CGSCWA S VA
Sbjct: 99 -INVQKKESGVETATETSFRYESITKIPVTMDWRKRGAVTP-IKDQGNCGSCWAFSTVAA 156
Query: 202 LESAYAIKHNELIELSKQ 219
+E + I +L+ LS+Q
Sbjct: 157 IEGIHQITTGKLVSLSEQ 174
>gi|31322338|gb|AAP20039.1| vivapain 2 [Plasmodium vivax]
Length = 486
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 85/157 (54%), Gaps = 13/157 (8%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ N F FV+EY R+Y ++ E+++R+ F NL+ I + E G+N+F D++
Sbjct: 160 LESVNSFYLFVKEYGRKYKTEEEMQQRYLAFVENLEKIKAHNSRENVLYRKGMNQFGDLS 219
Query: 133 DSEFNHG---LSSLDWE-------QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLP 182
EF L S D++ +I N + + Y + ++ A S +++ V P
Sbjct: 220 FGEFKKKYLTLKSFDFKTFGGKLKRITNYEDVIDKYKPKDA-TFDHA-SYDWRLHKGVTP 277
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+DQ CGSCWA S V +ES YAI+ N+L+ +S+Q
Sbjct: 278 -VKDQANCGSCWAFSTVGVVESQYAIRKNQLVSISEQ 313
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 75/143 (52%), Gaps = 13/143 (9%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF----- 136
++ +Y R Y +E +RF++F+ N+K I+ + GVN+FAD+T+ EF
Sbjct: 40 WMAQYNRVYKDATEKAQRFEVFKANVKFIESFNAGGNRKFWLGVNQFADLTNDEFRATKT 99
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
N G + + +E S ++ L SI+++ KG V P ++DQ CG CWA
Sbjct: 100 NKGFKPSPVKVPTGFR--YENVSVDA-----LPASIDWRTKGAVTP-IKDQGQCGCCWAF 151
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SAVA E I ++LI LS+Q
Sbjct: 152 SAVAATEGIVKISTDKLISLSEQ 174
>gi|47606538|gb|AAT36253.1| vivapain-2 [Plasmodium vivax]
gi|47606540|gb|AAT36254.1| vivapain-2 [Plasmodium vivax]
gi|47606542|gb|AAT36255.1| vivapain-2 [Plasmodium vivax]
Length = 487
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 51/157 (32%), Positives = 83/157 (52%), Gaps = 13/157 (8%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ N F FV+EY R+Y ++ E+++R+ F NL+ I + E G+N+F D++
Sbjct: 161 LESVNSFYLFVKEYGRKYKTEEEMQQRYLAFVENLEKIKAHNSRENVLYRKGMNQFGDLS 220
Query: 133 DSEFNHG---LSSLDWE-------QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLP 182
EF L S D++ +I N + + Y ++ S +++ V P
Sbjct: 221 FGEFKKKYLTLKSFDFKTFGGKLKRITNYEDVIDKY--KPKDATFDHASYDWRLHKGVTP 278
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+DQ CGSCWA S V +ES YAI+ N+L+ +S+Q
Sbjct: 279 -VKDQANCGSCWAFSTVGVVESQYAIRKNQLVSISEQ 314
>gi|156098484|ref|XP_001615274.1| vivapain-2 [Plasmodium vivax Sal-1]
gi|47606518|gb|AAT36243.1| vivapain-2 [Plasmodium vivax]
gi|47606520|gb|AAT36244.1| vivapain-2 [Plasmodium vivax]
gi|47606522|gb|AAT36245.1| vivapain-2 [Plasmodium vivax]
gi|47606524|gb|AAT36246.1| vivapain-2 [Plasmodium vivax]
gi|47606526|gb|AAT36247.1| vivapain-2 [Plasmodium vivax]
gi|47606528|gb|AAT36248.1| vivapain-2 [Plasmodium vivax]
gi|47606530|gb|AAT36249.1| vivapain-2 [Plasmodium vivax]
gi|47606532|gb|AAT36250.1| vivapain-2 [Plasmodium vivax]
gi|47606534|gb|AAT36251.1| vivapain-2 [Plasmodium vivax]
gi|47606536|gb|AAT36252.1| vivapain-2 [Plasmodium vivax]
gi|47606544|gb|AAT36256.1| vivapain-2 [Plasmodium vivax]
gi|47606546|gb|AAT36257.1| vivapain-2 [Plasmodium vivax]
gi|47606548|gb|AAT36258.1| vivapain-2 [Plasmodium vivax]
gi|47606550|gb|AAT36259.1| vivapain-2 [Plasmodium vivax]
gi|47606552|gb|AAT36260.1| vivapain-2 [Plasmodium vivax]
gi|47606554|gb|AAT36261.1| vivapain-2 [Plasmodium vivax]
gi|47606556|gb|AAT36262.1| vivapain-2 [Plasmodium vivax]
gi|148804148|gb|EDL45547.1| vivapain-2 [Plasmodium vivax]
Length = 487
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 85/157 (54%), Gaps = 13/157 (8%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ N F FV+EY R+Y ++ E+++R+ F NL+ I + E G+N+F D++
Sbjct: 161 LESVNSFYLFVKEYGRKYKTEEEMQQRYLAFVENLEKIKAHNSRENVLYRKGMNQFGDLS 220
Query: 133 DSEFNHG---LSSLDWE-------QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLP 182
EF L S D++ +I N + + Y + ++ A S +++ V P
Sbjct: 221 FGEFKKKYLTLKSFDFKTFGGKLKRITNYEDVIDKYKPKDA-TFDHA-SYDWRLHKGVTP 278
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+DQ CGSCWA S V +ES YAI+ N+L+ +S+Q
Sbjct: 279 -VKDQANCGSCWAFSTVGVVESQYAIRKNQLVSISEQ 314
>gi|395544492|ref|XP_003774144.1| PREDICTED: cathepsin F [Sarcophilus harrisii]
Length = 451
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 76/147 (51%), Gaps = 18/147 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
FKDF+ Y + Y + +E +RR IF NL+ + ++G+A YGV +F+D+T+ EF
Sbjct: 154 FKDFLTTYNKSYANATETQRRLGIFARNLELARKVQELDRGSAEYGVTKFSDLTEEEFRT 213
Query: 137 ---NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE-SINYKDKGKVLPKVQDQHLCGS 192
N LSSL + +T G A S +++D G V V++Q CGS
Sbjct: 214 SYLNPLLSSLPGRALRPGPAT-----------RGPAPASWDWRDHGAVT-GVKNQGACGS 261
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA S +E + ++ L+ LS+Q
Sbjct: 262 CWAFSVTGNVEGQWFLRRGALLALSEQ 288
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 82/145 (56%), Gaps = 7/145 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
++ ++ ++ R Y++ E ERRF+IF++N+ ID + + G ++ G+NRFADMT+ E
Sbjct: 50 YEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLGLNRFADMTNEE 109
Query: 136 FNHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ L + + + Y +N+ L ES++++ KG V V+DQ CGSCW
Sbjct: 110 YRAVYLGTRPAGHRRRARVGSDRYRYNAGED--LPESVDWRAKGAV-AAVKDQGSCGSCW 166
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A S VA +E I +LI LS+Q
Sbjct: 167 AFSTVAAVEGINKIVTGDLISLSEQ 191
>gi|196014793|ref|XP_002117255.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
gi|190580220|gb|EDV20305.1| hypothetical protein TRIADDRAFT_61245 [Trichoplax adhaerens]
Length = 353
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 43/156 (27%), Positives = 73/156 (46%), Gaps = 9/156 (5%)
Query: 67 FDLEEFLDHGNQFKD---FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY 123
F + H FK+ F++EY + Y++ E+ R+ +F N+ + KH+ T Y
Sbjct: 40 FSQDTATHHDPMFKNYLQFIKEYNKSYNNIQELNYRYQVFTKNMARAMLFQKHDNATGRY 99
Query: 124 GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPK 183
G + +D+TD E + W Q + T N L +S +++ KG V
Sbjct: 100 GFTKLSDLTDQEVKSFYAMKKWPQ-----QLYPTKKANIPQLNSLPQSFDWRSKGAVTA- 153
Query: 184 VQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+DQ CG+CWA + +E + + +L LS+Q
Sbjct: 154 VKDQKRCGACWAFATTGNIEGQWYLNKGKLYSLSEQ 189
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 54/186 (29%), Positives = 95/186 (51%), Gaps = 12/186 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N+++ ++ E+ R Y++ E E+RF+IF++NL+ I+ + T G+N+FAD+T+ E+
Sbjct: 48 NRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFADLTNEEY 107
Query: 137 NHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
L + + +KS + + S + + S++++ +G V P +++Q CGSCWA
Sbjct: 108 RTMYLGTKSDARRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAP-IKNQGSCGSCWA 166
Query: 196 HSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKG--PYSLNHAVLN 253
S VA + I E+I LS+Q R G C+ G Y+ + N
Sbjct: 167 FSTVAAVGGINQIVTGEMITLSEQELVDCDRVQNSG--------CNGGLMDYAFEFIISN 218
Query: 254 VGYDNE 259
G D E
Sbjct: 219 GGMDTE 224
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 98/190 (51%), Gaps = 22/190 (11%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSE 135
+++++ ++ + Y+ E ++RF+IF++NLK ID +H +TY G+ RFAD+T+ E
Sbjct: 54 MYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFID---EHNGLNSTYRLGLTRFADLTNEE 110
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYG--LAESINYKDKGKVLPKVQDQHLCGSC 193
+ + +K + S + G L ES++++ +G V+ V+DQ CGSC
Sbjct: 111 YRSKFLGTKIDPNRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVV-GVKDQASCGSC 169
Query: 194 WAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNH 249
WA SA+A +E I +LI LS+Q ++ GG+M+ Y+
Sbjct: 170 WAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMD----------YAFEF 219
Query: 250 AVLNVGYDNE 259
+ N G D+E
Sbjct: 220 IISNGGIDSE 229
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 77.8 bits (190), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 52/160 (32%), Positives = 85/160 (53%), Gaps = 13/160 (8%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF---NH 138
++ + R Y + E + R D+F+ NLK I+ + K + GVN FAD T+ EF +
Sbjct: 42 WMARFSRVYRDELEKQMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHT 101
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
GL L + ++ T + S+N S+ G+++ +++ +G V P V+ Q CG CWA SA
Sbjct: 102 GLKGLSSKVVDE---TISSRSWNISDMVGVSK--DWRAEGAVTP-VKYQGQCGCCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMN 234
VA +E I L+ LS+Q + + R GG+M+
Sbjct: 156 VAAVEGVTKIAGGNLVSLSEQQLLDCDREYDRGCDGGIMS 195
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 77/142 (54%), Gaps = 4/142 (2%)
Query: 81 DFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-TATYGVNRFADMTDSEFNHG 139
+++ ++ R Y E R+ +F+ N++ I+ G T VN+FAD+T+ EF
Sbjct: 41 EWMAKHGRVYADMKEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQFADLTNDEFRSM 100
Query: 140 LSSLDWEQIENLKSTFETYSFNSSN--SYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ + + +S +T SF N S L S++++ KG V P +++Q CG CWA S
Sbjct: 101 YTGYKGGSVLSSQSGTKTSSFRYQNVSSGALPVSVDWRKKGAVTP-IKNQGTCGCCWAFS 159
Query: 198 AVACLESAYAIKHNELIELSKQ 219
AVA +E A IK +LI LS+Q
Sbjct: 160 AVAAIEGATKIKKGKLISLSEQ 181
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 49/142 (34%), Positives = 78/142 (54%), Gaps = 7/142 (4%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ +Y R+YD+ E RF I+ +N++ I+Y + N+FAD+T+ EFN
Sbjct: 45 RYDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQNL-SFKLTDNKFADLTNDEFN 103
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
S QI + K S NS L +++++++ G V P ++DQ CGSCWA S
Sbjct: 104 ---SIYLGYQIRSYKR--RNLSHMHENSTDLPDAVDWRENGAVTP-IKDQGQCGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQ 219
AVA +E IK L+ LS+Q
Sbjct: 158 AVAAVEGINKIKTGNLVSLSEQ 179
>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
Length = 384
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 79/152 (51%), Gaps = 12/152 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F+ ++ + R Y E +RR +++R N+ ++ + G N+FAD+T+ EF
Sbjct: 31 RFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMSNGGYRLADNKFADLTNEEFR 90
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYG----------LAESINYKDKGKVLPKVQDQ 187
+ + ++T T + + G L +S+++++KG V P V++Q
Sbjct: 91 AKMLGFG-RPPPHGRATGHTTTPGTVACIGSGLGRRYSDELPKSVDWREKGAVAP-VKNQ 148
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAVA +E IK+ +L+ LS+Q
Sbjct: 149 GECGSCWAFSAVAAIEGINQIKNGKLVSLSEQ 180
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 77/138 (55%), Gaps = 3/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y + Y +E E+RF IF+NN++ I+ + +N+FAD+ + EF L
Sbjct: 40 WMAQYGKLYTDAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKASL- 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
++ ++ E+ T SF + + +++++ +G V P ++DQ CGSCWA S VA
Sbjct: 99 -INVQKKESGVETATETSFRYESITKIPVTMDWRKRGAVTP-IKDQGNCGSCWAFSIVAA 156
Query: 202 LESAYAIKHNELIELSKQ 219
+E + I +L+ LS+Q
Sbjct: 157 IEGIHQITTGKLVSLSEQ 174
>gi|224105327|ref|XP_002313770.1| predicted protein [Populus trichocarpa]
gi|222850178|gb|EEE87725.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 77/144 (53%), Gaps = 13/144 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F R++++ Y S E + RF +F++NL+ + K + TA++GV +F+D+T +EF
Sbjct: 53 FSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDP-TASHGVTQFSDLTSAEFRK 111
Query: 139 ---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
GL L + N T L E ++++KG V P V++Q CGSCW+
Sbjct: 112 QVLGLRKLRLPKDANTAPILPTND--------LPEDFDWREKGAVGP-VKNQGSCGSCWS 162
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S LE A+ + EL+ LS+Q
Sbjct: 163 FSTTGALEGAHFLATGELVSLSEQ 186
>gi|221056028|ref|XP_002259152.1| P.knowlesi ortholog of falcipain [Plasmodium knowlesi strain H]
gi|193809223|emb|CAQ39925.1| P.knowlesi ortholog of falcipain [Plasmodium knowlesi strain H]
Length = 495
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 86/157 (54%), Gaps = 13/157 (8%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L++ N F F++E+ ++Y + E++ R+ F NL I+ + E + G+NRF DM+
Sbjct: 167 LENVNSFYLFIKEHGKKYQTPDEMQHRYLSFVENLAKINAHNNKENVSYKKGMNRFGDMS 226
Query: 133 DSEFNHG---LSSLDWEQIENLKST-FETYSFNSSNSYGLAESI------NYKDKGKVLP 182
EF L + D++ LKST F +Y + + Y + ++++ V P
Sbjct: 227 FEEFEKKYLTLKTFDFKS-NGLKSTRFISYD-DVIHKYKPKDGTFDYLKHDWRELNAVTP 284
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+DQ CG+CWA S V +ES YAI+ NEL+ LS+Q
Sbjct: 285 -VKDQKNCGACWAFSTVGVVESQYAIRKNELVSLSEQ 320
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 79/143 (55%), Gaps = 8/143 (5%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF--- 136
+ ++ +YER Y + SE+E+R IF+ NL+ I+ + + G+NR++D+T EF
Sbjct: 34 QQWMMKYERTYTNSSEMEKRKKIFKENLEYIENFNNVGNKSYKLGLNRYSDLTSEEFIAS 93
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ G D ++S ++ N + + ++++KG V+ V++Q CG CWA
Sbjct: 94 HTGFKVSDQLSDSKMRSVAIPFNLNDD----VPTNFDWREKG-VVTDVKNQRQCGCCWAF 148
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
+AVA +E IK+ LI LS+Q
Sbjct: 149 TAVAAVEGIVKIKNGNLISLSEQ 171
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 74/144 (51%), Gaps = 10/144 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG---TATYGVNRFADMTDSE 135
F DF +E+QY+S E RRF IF +NL I + T T GVN+FAD+T+ E
Sbjct: 20 FDDFKTTFEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEE 79
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ L E L + + N A S++++ KG V P +++Q CGSCW+
Sbjct: 80 YRQ--LYLRPYPTELLGRERQEVWLDGPN----AGSVDWRQKGAVTP-IKNQGQCGSCWS 132
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +E A+AI L+ LS+Q
Sbjct: 133 FSTTGSVEGAHAIATGNLVSLSEQ 156
>gi|118481169|gb|ABK92536.1| unknown [Populus trichocarpa]
Length = 368
Score = 77.8 bits (190), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 77/144 (53%), Gaps = 13/144 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F R++++ Y S E + RF +F++NL+ + K + TA++GV +F+D+T +EF
Sbjct: 53 FSLFKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDP-TASHGVTQFSDLTSAEFRK 111
Query: 139 ---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
GL L + N T L E ++++KG V P V++Q CGSCW+
Sbjct: 112 QVLGLRKLRLPKDANTAPILPTND--------LPEDFDWREKGAVGP-VKNQGSCGSCWS 162
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S LE A+ + EL+ LS+Q
Sbjct: 163 FSTTGALEGAHFLATGELVSLSEQ 186
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 49/145 (33%), Positives = 83/145 (57%), Gaps = 7/145 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
++ ++ ++ R ++ E ERRF+IF++N++ ID + + G ++ G+NRFADMT+ E
Sbjct: 50 YEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSFRLGLNRFADMTNEE 109
Query: 136 FNHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ L + + + Y +N+ L ES++++DKG V V+DQ CGSCW
Sbjct: 110 YRTVYLGTRPASHRRRARLGSDRYRYNAGEE--LPESVDWRDKGAVT-TVKDQGSCGSCW 166
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A S +A +E I +LI LS+Q
Sbjct: 167 AFSTIAAVEGINKIVTGDLISLSEQ 191
>gi|17978639|gb|AAL48318.1| berghepain-2 [Plasmodium berghei]
Length = 468
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/155 (34%), Positives = 81/155 (52%), Gaps = 12/155 (7%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ N F +F++EY +QY+S EI+ RF IF NLK I+ + K E T G+N F+DM
Sbjct: 146 LESVNIFYNFMKEYNKQYNSAEEIQERFYIFSENLKKIEKHNK-ENHLYTKGINAFSDMR 204
Query: 133 DSEF-----NHGLS---SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKV 184
EF N+ L S+D + + T + S S +++D V+ V
Sbjct: 205 HEEFKMKYLNNKLKENHSIDLRHL--IPYTTAISKYKSPTDKVNYTSFDWRDYN-VIIGV 261
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+DQ C SCWA + + + YAI+ N+ + LS+Q
Sbjct: 262 KDQQKCASCWAFATAGVVAAQYAIRKNQKVSLSEQ 296
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 80/138 (57%), Gaps = 6/138 (4%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y +++E +RF+IF+ N++ I+ + K G+N FAD+T+ EF +
Sbjct: 40 WMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAGTKPYKLGINAFADLTNQEFK---A 96
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
S + ++ + S+ + + + +S + +++++ KG V P V+DQ CG CWA SAVA
Sbjct: 97 SRNGYKLPHDCSSNTPFRYENVSS--VPTTVDWRTKGAVTP-VKDQGQCGCCWAFSAVAA 153
Query: 202 LESAYAIKHNELIELSKQ 219
+E + LI LS+Q
Sbjct: 154 MEGITKLSTGNLISLSEQ 171
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 98/190 (51%), Gaps = 22/190 (11%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSE 135
+++++ ++ + Y+ E ++RF+IF++NLK ID +H +TY G+ RFAD+T+ E
Sbjct: 54 MYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFID---EHNGLNSTYRLGLTRFADLTNEE 110
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYG--LAESINYKDKGKVLPKVQDQHLCGSC 193
+ + +K + S + G L ES++++ +G V+ V+DQ CGSC
Sbjct: 111 YRSKFLGTKIDPNRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVV-GVKDQASCGSC 169
Query: 194 WAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNH 249
WA SA+A +E I +LI LS+Q ++ GG+M+ Y+
Sbjct: 170 WAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMD----------YAFEF 219
Query: 250 AVLNVGYDNE 259
+ N G D+E
Sbjct: 220 IISNGGIDSE 229
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 94/188 (50%), Gaps = 20/188 (10%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSE 135
++ ++ ++ + Y++ E E+RF IF++NL+ ID +H + TY G+NRFAD+T+ E
Sbjct: 45 MYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFID---EHNAESRTYKVGLNRFADLTNDE 101
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ L + + + L +S+++++KG V+ V+DQ CGSCWA
Sbjct: 102 YRSMYLGARTGSRRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVV-GVKDQGSCGSCWA 160
Query: 196 HSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAV 251
S +A +E I +LI LS+Q ++ GG+M+ Y+ +
Sbjct: 161 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD----------YAFEFII 210
Query: 252 LNVGYDNE 259
N G D E
Sbjct: 211 KNGGIDTE 218
>gi|68076993|ref|XP_680416.1| falcipain 2 precursor [Plasmodium berghei strain ANKA]
gi|56501341|emb|CAI05700.1| falcipain 2 precursor, putative [Plasmodium berghei]
Length = 470
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/155 (34%), Positives = 81/155 (52%), Gaps = 12/155 (7%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ N F +F++EY +QY+S EI+ RF IF NLK I+ + K E T G+N F+DM
Sbjct: 148 LESVNIFYNFMKEYNKQYNSAEEIQERFYIFSENLKKIEKHNK-ENHLYTKGINAFSDMR 206
Query: 133 DSEF-----NHGLS---SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKV 184
EF N+ L S+D + + T + S S +++D V+ V
Sbjct: 207 HEEFKMKYLNNKLKENHSIDLRHL--IPYTTAISKYKSPTDKVNYTSFDWRDYN-VIIGV 263
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+DQ C SCWA + + + YAI+ N+ + LS+Q
Sbjct: 264 KDQQKCASCWAFATAGVVAAQYAIRKNQKVSLSEQ 298
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 79/142 (55%), Gaps = 6/142 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ + ++ + Y SD E RR IF + L I+ + T T G+N+F+D+T++EF
Sbjct: 2 FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ + ++ + + SS L S++++ +G V P ++DQ CGSCWA S
Sbjct: 62 NYVGKFKPPRYQDRRPAKDVDVDVSS----LPTSLDWRQEGAVTP-IKDQGQCGSCWAFS 116
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A+A +ESA+ + EL+ LS+Q
Sbjct: 117 AIASIESAHFLATKELVSLSEQ 138
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 77.4 bits (189), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 79/142 (55%), Gaps = 6/142 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ + ++ + Y SD E RR IF + L I+ + T T G+N+F+D+T++EF
Sbjct: 2 FEGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRA 61
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ + ++ + + SS L S++++ +G V P ++DQ CGSCWA S
Sbjct: 62 NYVGKFKPPRYQDRRPAKDVDVDVSS----LPTSLDWRQEGAVTP-IKDQGQCGSCWAFS 116
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A+A +ESA+ + EL+ LS+Q
Sbjct: 117 AIASIESAHFLATKELVSLSEQ 138
>gi|198435380|ref|XP_002128293.1| PREDICTED: similar to cathepsin H [Ciona intestinalis]
Length = 438
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 52/149 (34%), Positives = 76/149 (51%), Gaps = 9/149 (6%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
++ N FK + E+ +QY + E E+RF IF +LKTI + T G+N F+D T
Sbjct: 129 VEERNLFKGWQIEHGKQYINQEEAEKRFQIFSKSLKTIKEFNNRVDRTWEMGLNEFSDRT 188
Query: 133 DSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
EF +S+ +N +T + S + IN +KG + V++Q CGS
Sbjct: 189 FEEF----ASIRLMMPQNCSATKGNH---VSLGFEPPAQINCLEKGNFVTAVKNQGSCGS 241
Query: 193 CWAHSAVACLESAYAI--KHNELIELSKQ 219
CW S CLESA AI + N L+ LS+Q
Sbjct: 242 CWTFSTTGCLESATAIHKEGNPLVSLSEQ 270
>gi|42564149|gb|AAS20588.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 322
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 78/146 (53%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTD 133
+Q+ F + + + Y S E RF IF+NNL+TI+ + K+E+G TY V +FADMT
Sbjct: 21 DQWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEKHNAKYEEGKVTYYMAVTQFADMTR 80
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF L L + NL +T + + L E I++ +KG VLP V++Q C SC
Sbjct: 81 DEFRKKLG-LQNNRRPNLNATLQVFP----EDLELPEQIDWTEKGAVLP-VKNQGNCRSC 134
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA S LE AI + LS+Q
Sbjct: 135 WAFSTTGSLEGQNAIHNKVKTPLSEQ 160
>gi|58201356|gb|AAW66799.1| cysteine protease [Pinus taeda]
gi|58201376|gb|AAW66809.1| cysteine protease [Pinus taeda]
gi|58201388|gb|AAW66815.1| cysteine protease [Pinus taeda]
gi|58201400|gb|AAW66821.1| cysteine protease [Pinus taeda]
gi|58201406|gb|AAW66824.1| cysteine protease [Pinus taeda]
gi|167345244|gb|ABZ69062.1| cysteine protease [Pinus taeda]
Length = 193
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 83/142 (58%), Gaps = 11/142 (7%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHG 139
++ E+++ Y+ E ++RF +F++N Y +H QG +Y G+N+FAD++ EF
Sbjct: 45 WLAEHKKAYNGLDEKQKRFTVFKDNFL---YIHEHNQGNQSYKLGLNQFADLSHEEFKAT 101
Query: 140 L--SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ LD ++ L+S Y + S+ L +SI++++KG V P V+DQ CGSCWA S
Sbjct: 102 YLGAKLDTKK-RLLRSPSPRYQY--SDGEDLPKSIDWREKGAVAP-VKDQGACGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQ 219
VA +E I +LI LS+Q
Sbjct: 158 TVAAVEGINQIVTGDLISLSEQ 179
>gi|301607871|ref|XP_002933519.1| PREDICTED: cathepsin O-like [Xenopus (Silurana) tropicalis]
Length = 370
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 55/152 (36%), Positives = 82/152 (53%), Gaps = 13/152 (8%)
Query: 75 HGNQFKDFVREYERQYDSDSEI-ERRFDIFRNNLKTIDYYTKHEQGT-----ATYGVNRF 128
H N F DF+++Y R Y S++ + R+ IF + + +Y T A YG+N+F
Sbjct: 62 HSNAFLDFIQKYGRGYKDGSQVFQERYQIFLKSTERQNYLNAIALPTNLTSAAHYGINQF 121
Query: 129 ADMTDSEFNHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQ 187
+D++ EF + L S + K F+ NS+ Y L +++DK V P V++Q
Sbjct: 122 SDLSAEEFFYTYLRSFPTGNYTSNKP-FK----NSAQQYFLPLRFDWRDKKLVTP-VKNQ 175
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CG+CWA S V +ESAYAIK + L ELS Q
Sbjct: 176 LSCGACWAFSVVGAVESAYAIKWHTLEELSVQ 207
>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
Full=Papaya proteinase III; Short=PPIII; AltName:
Full=Papaya proteinase omega; Flags: Precursor
gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
Length = 348
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 83/144 (57%), Gaps = 11/144 (7%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F ++ + + Y++ E RF+IF++NL ID T + + G+N FAD+++ EFN
Sbjct: 48 FNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDE-TNKKNNSYWLGLNEFADLSNDEFNE 106
Query: 139 G-LSSLDWEQIENLKSTFETYS--FNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ SL IE ++Y F + ++ L E+++++ KG V P V+ Q CGSCWA
Sbjct: 107 KYVGSLIDATIE------QSYDEEFINEDTVNLPENVDWRKKGAVTP-VRHQGSCGSCWA 159
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
SAVA +E I+ +L+ELS+Q
Sbjct: 160 FSAVATVEGINKIRTGKLVELSEQ 183
>gi|47606560|gb|AAT36264.1| vivapain-2 [Plasmodium vivax]
Length = 487
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 52/157 (33%), Positives = 85/157 (54%), Gaps = 13/157 (8%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ N F FV+EY R+Y ++ E+++R+ F NL+ I + E G+N+F D++
Sbjct: 161 LESVNSFYLFVKEYGRKYKTEEEMQQRYLAFVENLEKIKAHNSRENVLYRKGMNQFGDLS 220
Query: 133 DSEFNHG---LSSLDWE-------QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLP 182
EF L S D++ +I N + + Y + ++ A S +++ V P
Sbjct: 221 FGEFKKKYLTLKSFDFKTFGGKLKRITNYEDLIDKYKPKDA-TFDHA-SYDWRLHKGVTP 278
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+DQ CGSCWA S V +ES YAI+ N+L+ +S+Q
Sbjct: 279 -VKDQANCGSCWAFSTVGVVESQYAIRKNQLVSISEQ 314
>gi|209978824|ref|YP_002300567.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
gi|192758806|gb|ACF05341.1| cathepsin [Adoxophyes orana nucleopolyhedrovirus]
Length = 337
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 75/150 (50%), Gaps = 14/150 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ F+ Y +QY RF IF NL+ I+ K +A Y +N+F+D++ +E
Sbjct: 32 FETFIVNYNKQYADTKTKNYRFKIFVQNLEYINEKNKLND-SAIYNINKFSDLSKNELLT 90
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK---------GKVLPKVQDQHL 189
+ L + N+ + + N N L + +D+ + V+DQ
Sbjct: 91 KYTGLTSRKPSNMVKS----TSNFCNVIHLDAPPDARDELPQNFDWRVNNKMTSVKDQGA 146
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWAH+AV LE+ YAIKHN LI LS+Q
Sbjct: 147 CGSCWAHAAVGTLETLYAIKHNYLINLSEQ 176
>gi|58201346|gb|AAW66794.1| cysteine protease [Pinus taeda]
gi|58201348|gb|AAW66795.1| cysteine protease [Pinus taeda]
gi|58201362|gb|AAW66802.1| cysteine protease [Pinus taeda]
gi|58201364|gb|AAW66803.1| cysteine protease [Pinus taeda]
gi|58201370|gb|AAW66806.1| cysteine protease [Pinus taeda]
gi|58201372|gb|AAW66807.1| cysteine protease [Pinus taeda]
gi|58201374|gb|AAW66808.1| cysteine protease [Pinus taeda]
gi|58201380|gb|AAW66811.1| cysteine protease [Pinus taeda]
gi|58201382|gb|AAW66812.1| cysteine protease [Pinus taeda]
gi|58201384|gb|AAW66813.1| cysteine protease [Pinus taeda]
gi|58201386|gb|AAW66814.1| cysteine protease [Pinus taeda]
gi|58201402|gb|AAW66822.1| cysteine protease [Pinus taeda]
Length = 193
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 83/142 (58%), Gaps = 11/142 (7%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHG 139
++ E+++ Y+ E ++RF +F++N Y +H QG +Y G+N+FAD++ EF
Sbjct: 45 WLAEHKKAYNGLDEKQKRFTVFKDNFL---YIHEHNQGNRSYKLGLNQFADLSHEEFKAT 101
Query: 140 L--SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ LD ++ L+S Y + S+ L +SI++++KG V P V+DQ CGSCWA S
Sbjct: 102 YLGAKLDTKK-RLLRSPSPRYQY--SDGEDLPKSIDWREKGAVAP-VKDQGACGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQ 219
VA +E I +LI LS+Q
Sbjct: 158 TVAAVEGINQIVTGDLISLSEQ 179
>gi|58201352|gb|AAW66797.1| cysteine protease [Pinus taeda]
Length = 192
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 83/142 (58%), Gaps = 11/142 (7%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHG 139
++ E+++ Y+ E ++RF +F++N Y +H QG +Y G+N+FAD++ EF
Sbjct: 44 WLAEHKKAYNGLDEKQKRFTVFKDNFL---YIHEHNQGNRSYKLGLNQFADLSHEEFKAT 100
Query: 140 L--SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ LD ++ L+S Y + S+ L +SI++++KG V P V+DQ CGSCWA S
Sbjct: 101 YLGAKLDTKK-RLLRSPSPRYQY--SDGEDLPKSIDWREKGAVAP-VKDQGACGSCWAFS 156
Query: 198 AVACLESAYAIKHNELIELSKQ 219
VA +E I +LI LS+Q
Sbjct: 157 TVAAVEGINQIVTGDLISLSEQ 178
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 81/146 (55%), Gaps = 12/146 (8%)
Query: 81 DFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-TATYGVNRFADMTDSEFN-- 137
+++ ++ R Y E R+ +F+NN++ I++ G T VN+FAD+T+ EF
Sbjct: 40 EWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFCSM 99
Query: 138 ----HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
G+S+L + ++ + + + +S L S++++ KG V P +++Q CG C
Sbjct: 100 YTGFKGVSALSSQS----QTKMSPFRYQNVSSGALPVSVDWRKKGAVTP-IKNQGSCGCC 154
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SAVA +E A IK +LI LS+Q
Sbjct: 155 WAFSAVAAIEGATQIKKGKLISLSEQ 180
>gi|167345238|gb|ABZ69059.1| cysteine protease [Pinus radiata]
gi|167345240|gb|ABZ69060.1| cysteine protease [Pinus radiata]
Length = 185
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 83/142 (58%), Gaps = 11/142 (7%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHG 139
++ E+++ Y+ E ++RF +F++N Y +H QG +Y G+N+FAD++ EF
Sbjct: 45 WLAEHKKAYNGLDEKQKRFTVFKDNFL---YIHEHNQGNRSYKLGLNQFADLSHEEFKAT 101
Query: 140 L--SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ LD ++ L+S Y + S+ L +SI++++KG V P V+DQ CGSCWA S
Sbjct: 102 YLGAKLDTKK-RLLRSPSPRYQY--SDGEDLPKSIDWREKGAVAP-VKDQGACGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQ 219
VA +E I +LI LS+Q
Sbjct: 158 TVAAVEGINQIVTGDLISLSEQ 179
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 101/188 (53%), Gaps = 21/188 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+ +++ E+ Y++ E ERRF+ FR+NL+ ID + + G ++ G+NRFAD+T+ E
Sbjct: 43 YAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVHSFRLGLNRFADLTNEE 102
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ S+ + + + + + ++++ L ES++++ KG V V+DQ CGSCWA
Sbjct: 103 YR---STYLGARTKPDRERKLSARYQAADNDELPESVDWRKKGAV-GAVKDQGGCGSCWA 158
Query: 196 HSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAV 251
SA+A +E I ++I LS+Q ++ + GG+M+ Y+ +
Sbjct: 159 FSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMD----------YAFEFII 208
Query: 252 LNVGYDNE 259
N G D+E
Sbjct: 209 NNGGIDSE 216
>gi|116779325|gb|ABK21238.1| unknown [Picea sitchensis]
gi|148905850|gb|ABR16087.1| unknown [Picea sitchensis]
gi|148908434|gb|ABR17330.1| unknown [Picea sitchensis]
gi|148908881|gb|ABR17545.1| unknown [Picea sitchensis]
gi|224286109|gb|ACN40765.1| unknown [Picea sitchensis]
Length = 366
Score = 77.4 bits (189), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 77/141 (54%), Gaps = 8/141 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ F+R Y ++Y E E RF +F++NL + K + A++GV +F+D+T EF H
Sbjct: 57 FRHFIRRYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDP-RASHGVTKFSDLTQEEFRH 115
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L + L+ + +++ L E ++++KG V +V++Q CGSCWA S
Sbjct: 116 QYLGL---RAPPLRDAHDAPILPTND---LPEDFDWREKGAVT-EVKNQGSCGSCWAFST 168
Query: 199 VACLESAYAIKHNELIELSKQ 219
LE A +K EL+ LS+Q
Sbjct: 169 TGALEGANFLKTGELVSLSEQ 189
>gi|417401303|gb|JAA47542.1| Putative cathepsin f [Desmodus rotundus]
Length = 459
Score = 77.0 bits (188), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 74/155 (47%), Gaps = 34/155 (21%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK F+ Y R Y+++ E + R IF NN+ ++GTA YGV +F+D+T+ EF
Sbjct: 162 FKHFIATYNRTYETEEEAQWRMSIFINNMVRAQEIQALDRGTAQYGVTKFSDLTEEEFR- 220
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESI--------------NYKDKGKVLPKV 184
T+ N GL + + ++++KG V KV
Sbjct: 221 ------------------TFYLNPLLKEGLGKKMRLAKPVDDPAPPEWDWRNKGAVT-KV 261
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++Q +CGSCWA S +E + +K +L+ LS+Q
Sbjct: 262 KNQGMCGSCWAFSVTGNVEGQWFLKQGDLLSLSEQ 296
>gi|116787909|gb|ABK24688.1| unknown [Picea sitchensis]
gi|224284108|gb|ACN39791.1| unknown [Picea sitchensis]
gi|224285024|gb|ACN40241.1| unknown [Picea sitchensis]
Length = 366
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 77/141 (54%), Gaps = 8/141 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ F+R Y ++Y E E RF +F++NL + K + A++GV +F+D+T EF H
Sbjct: 57 FRHFIRRYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDP-RASHGVTKFSDLTQEEFRH 115
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L + L+ + +++ L E ++++KG V +V++Q CGSCWA S
Sbjct: 116 QYLGL---RAPPLRDAHDAPILPTND---LPEDFDWREKGAVT-EVKNQGSCGSCWAFST 168
Query: 199 VACLESAYAIKHNELIELSKQ 219
LE A +K EL+ LS+Q
Sbjct: 169 TGALEGANFLKTGELVSLSEQ 189
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 53/171 (30%), Positives = 83/171 (48%), Gaps = 17/171 (9%)
Query: 64 ASTFDLEEFLDHGN-----QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQ 118
+S D++ + N ++ ++ E + Y+S E E RF+IF+ NL+ ID +
Sbjct: 24 SSALDIKNSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRIIDDHNADAN 83
Query: 119 GTATYGVNRFADMTDSEFNH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYK 175
+ + G+NRFAD+TD E+ G S ++ N + L ++++
Sbjct: 84 RSYSLGLNRFADLTDEEYRSTYLGFKSGPKAKVSN--------RYVPKVGVVLPNYVDWR 135
Query: 176 DKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGR 226
G V+ V+DQ LC SCWA SAVA +E I LI LS+Q GR
Sbjct: 136 TVGAVV-GVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGR 185
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 47/135 (34%), Positives = 75/135 (55%), Gaps = 4/135 (2%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-TATYGVNRFADMTDSEFNHGLSSLDWE 146
+ Y+ E ERRF+IF +NL+ ID + + E + T G+ RFAD+T+ E+ +
Sbjct: 47 KAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYTLGLTRFADLTNEEYRSTYLGVKPG 106
Query: 147 QIENLKSTFETYSFN--SSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLES 204
Q+ ++ S+N L + +++++KG V P ++DQ CGSCWA S VA +E
Sbjct: 107 QVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAVAP-IKDQGGCGSCWAFSTVAAVEG 165
Query: 205 AYAIKHNELIELSKQ 219
I +LI LS+Q
Sbjct: 166 INQIVTGDLIVLSEQ 180
>gi|427777627|gb|JAA54265.1| Putative cathepsin f-like cysteine protease [Rhipicephalus
pulchellus]
Length = 475
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 68/128 (53%), Gaps = 3/128 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F R Y + Y E E RF IF+NNLK I + + E+GTA YG+ F+D++ SEF
Sbjct: 166 FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFER 225
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L + E+ K+ + N L + +++ KG V +V++Q +CGSCWA S
Sbjct: 226 HYLGLKKDLAEH-KAEVKPIKVGPVNE-PLPDLFDWRTKGAVT-EVKNQGMCGSCWAFSV 282
Query: 199 VACLESAY 206
+E +
Sbjct: 283 TGNVEGQW 290
>gi|58201350|gb|AAW66796.1| cysteine protease [Pinus taeda]
gi|58201354|gb|AAW66798.1| cysteine protease [Pinus taeda]
gi|58201358|gb|AAW66800.1| cysteine protease [Pinus taeda]
gi|58201378|gb|AAW66810.1| cysteine protease [Pinus taeda]
gi|58201390|gb|AAW66816.1| cysteine protease [Pinus taeda]
gi|58201396|gb|AAW66819.1| cysteine protease [Pinus taeda]
gi|58201404|gb|AAW66823.1| cysteine protease [Pinus taeda]
gi|58201408|gb|AAW66825.1| cysteine protease [Pinus taeda]
Length = 193
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 83/142 (58%), Gaps = 11/142 (7%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHG 139
++ E+++ Y+ E ++RF +F++N Y +H QG +Y G+N+FAD++ EF
Sbjct: 45 WLAEHKKAYNGLDEKQKRFTVFKDNFL---YIHEHNQGNRSYKLGLNQFADLSHEEFKAT 101
Query: 140 L--SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ LD ++ L+S Y + S+ L +SI++++KG V P V+DQ CGSCWA S
Sbjct: 102 YLGAKLDTKK-RLLRSPSPRYQY--SDGEDLPKSIDWREKGAVAP-VKDQGACGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQ 219
VA +E I +LI LS+Q
Sbjct: 158 TVAAVEGINQIVTGDLISLSEQ 179
>gi|67773372|gb|AAY81943.1| cysteine protease 5 [Paragonimus westermani]
Length = 325
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 78/142 (54%), Gaps = 12/142 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ F R+Y + Y +D + ++RF IF++NL ++GTA YGV +F+D+T EF
Sbjct: 32 YEQFKRDYGKVYANDDD-QKRFAIFKDNLVRAQKLQLKDRGTARYGVTQFSDLTPEEFAA 90
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
LS +Q+E ++ T E +++++ G V P V++Q CGSCWA S
Sbjct: 91 KYLSRPMNDQVERVRPT---------GLKAAPERMDWREWGAVGP-VENQGSCGSCWAFS 140
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+E + +K +L+ LSKQ
Sbjct: 141 VAGNVEGQWFLKTGQLVSLSKQ 162
Score = 39.7 bits (91), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 44/87 (50%), Gaps = 11/87 (12%)
Query: 174 YKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVM 233
Y +K K+L K+ D + G+ A AY +H L S + +FY+ G+
Sbjct: 208 YLNKEKLLAKIDDLIVLGAYEEEHA------AYLAEHGPL---SSALNAGYLQFYQSGIS 258
Query: 234 NLPHMLCSKGPYSLNHAVLNVGYDNES 260
+ + CS P SLNHAVL VGYD E+
Sbjct: 259 HPSYEECS--PASLNHAVLTVGYDTEN 283
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 58/188 (30%), Positives = 95/188 (50%), Gaps = 16/188 (8%)
Query: 81 DFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-TATYGVNRFADMTDSEFNHG 139
+++ ++ R Y E R+ +F++N++ I++ G T VN+FAD+T+ EF
Sbjct: 40 EWMTKHGRVYADVKEKSNRYVVFKSNVERIEHLNNIPAGRTFKLAVNQFADLTNDEFRSM 99
Query: 140 LSSLDWEQIENLKSTFETYSFNSSN--SYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ + +S +T SF N S L S++++ KG V P +++Q CG CWA S
Sbjct: 100 YTGFKGVSSLSSQSQTKTTSFRYQNVSSGALPISVDWRTKGAVTP-IKNQGSCGCCWAFS 158
Query: 198 AVACLESAYAIKHNELIELSKQPP---KTHGRFYKGGVMN--LPHMLCSKG-------PY 245
AVA +E A IK +LI LS+Q T+ +GG+M+ H++ + G PY
Sbjct: 159 AVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCEGGLMDTAFEHIMATGGLTTESNYPY 218
Query: 246 SLNHAVLN 253
A N
Sbjct: 219 KGEDATCN 226
>gi|167345242|gb|ABZ69061.1| cysteine protease [Pinus sylvestris]
Length = 214
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 83/142 (58%), Gaps = 11/142 (7%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHG 139
++ E+++ Y+ E ++RF +F++N Y +H QG +Y G+N+FAD++ EF
Sbjct: 45 WLAEHKKAYNGLDEKQKRFTVFKDNFL---YIHEHNQGNRSYKLGLNQFADLSHEEFKAT 101
Query: 140 L--SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ LD ++ L+S Y + S+ L +SI++++KG V P V+DQ CGSCWA S
Sbjct: 102 YLGAKLDTKK-RLLRSPSPRYQY--SDGEDLPKSIDWREKGAVAP-VKDQGQCGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQ 219
VA +E I +LI LS+Q
Sbjct: 158 TVAAVEGINQIVTGDLISLSEQ 179
>gi|332374900|gb|AEE62591.1| unknown [Dendroctonus ponderosae]
Length = 359
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 81/149 (54%), Gaps = 17/149 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKH-EQGTATY--GVNRFADMTDSE 135
F F ++Y + Y +DSE+ R +IF+ NL I+ + K +Q +Y G+N+F+D+T++E
Sbjct: 24 FVTFQQKYGKVYQNDSELSVREEIFKENLAKIEEHNKQFQQNLVSYELGLNQFSDLTEAE 83
Query: 136 FNHGLSSLDW-----EQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
F L+ +Q+E S F+ + S+N+ +KG V P V++Q C
Sbjct: 84 FQALLTMSPLTDQLTKQMEKYNSEFDIKT--------APVSVNWAEKGVVTP-VKNQGNC 134
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCW + +ES A+K L+ LS+Q
Sbjct: 135 GSCWTFTTTGTIESRLALKTGSLVSLSEQ 163
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 77.0 bits (188), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 78/154 (50%), Gaps = 12/154 (7%)
Query: 69 LEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRF 128
L E L + + ++ E+ + Y+ E E+RF IF++N++ I+ + + VN
Sbjct: 30 LYESLSLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKLSVNHL 89
Query: 129 ADMTDSEFN---HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQ 185
AD+T EF +G +D E F T SF N + +++++ KG V P ++
Sbjct: 90 ADLTLDEFKASRNGYKKIDRE--------FTTTSFKYENVTAIPAAVDWRVKGAVTP-IK 140
Query: 186 DQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
DQ CGSCWA S VA E I +L+ LS+Q
Sbjct: 141 DQGQCGSCWAFSTVAATEGINQITTGKLVSLSEQ 174
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 85/155 (54%), Gaps = 12/155 (7%)
Query: 97 ERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNH---GLSSLDWEQIENL 151
++RF+IF++NL+ ID + ++ + ATY G+ +F D+T+ E+ G + +I
Sbjct: 71 DKRFNIFKDNLRFIDLHNENNK-NATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKA 129
Query: 152 KSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN 211
K+ + YS + N + E+++++ KG V P ++DQ CGSCWA S A +E I
Sbjct: 130 KNVNQKYSA-AVNGKEVPETVDWRQKGAVNP-IKDQGTCGSCWAFSTTAAVEGINKIVTG 187
Query: 212 ELIELSKQP----PKTHGRFYKGGVMNLPHMLCSK 242
ELI LS+Q K++ + GG+M+ K
Sbjct: 188 ELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMK 222
>gi|55979119|gb|AAV69023.1| cysteine protease [Opisthorchis viverrini]
gi|224923980|gb|ACN68966.1| cathepsin F-like cysteine protease [Opisthorchis viverrini]
Length = 326
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 65/208 (31%), Positives = 99/208 (47%), Gaps = 37/208 (17%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
+++F +Y++ Y +D + E RF IF++NL+ EQGTA YGV +F+D+T EF
Sbjct: 32 YEEFKLKYKKTYSNDDD-ELRFRIFKDNLERAKRLQAMEQGTAEYGVTQFSDLTSEEFKT 90
Query: 139 GLSSLDW-EQIENLKSTF-ETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ + E I N T E + ++SN +++D G V P V DQ CGSCWA
Sbjct: 91 RYLRMRFDEPIVNEDPTPQEDVTMDNSN-------FDWRDHGAVGP-VLDQGDCGSCWAF 142
Query: 197 SAVACLESAYAIKHNELIELSKQ----------------PPKTHGRFYKGGVMNLPHMLC 240
S + +E + K +L+ LS+Q PP+T+ + G + L
Sbjct: 143 SVIGNVEGQWFRKTGDLLGLSEQQLIDCDHSDQGCDGGYPPQTYSAIEEMGGLELR---- 198
Query: 241 SKGPYSLNHAVLN------VGYDNESTR 262
S PY+ + V Y N STR
Sbjct: 199 SDYPYTGKDGICYMDQSKFVAYVNGSTR 226
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/141 (35%), Positives = 76/141 (53%), Gaps = 4/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ ++ + Y+S E RF+IF +NLK ID T + G+N FAD+T EF +
Sbjct: 49 FESWLAKHSKIYESLDEKLHRFEIFMDNLKHIDD-TNKKVSNYWLGLNEFADLTHEEFKN 107
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L E E + E +S+ + L +S++++ KG V P V++Q CGSCWA S
Sbjct: 108 KFLGLKGELPERKDESIEEFSYR--DFVDLPKSVDWRKKGAVAP-VKNQGQCGSCWAFST 164
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I L LS+Q
Sbjct: 165 VAAVEGINQIVTGNLTMLSEQ 185
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 55/190 (28%), Positives = 97/190 (51%), Gaps = 21/190 (11%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
+ +++++ ++ + Y+ E E+RF +F++NL I + + T T G+N+FAD+T+ E+
Sbjct: 34 DMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQDHNA-QNNTYTLGLNKFADITNKEY 92
Query: 137 NH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
G + ++ ++T Y++NS + L ++++ KG V P ++DQ CGSC
Sbjct: 93 RAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQ--LPVHVDWRLKGAVGP-IKDQGNCGSC 149
Query: 194 WAHSAVACLESAYAIKHNELIELSKQPPKTHGRFY----KGGVMNLPHMLCSKGPYSLNH 249
WA S VA +E I E + LS+Q R Y GG+M+ Y+
Sbjct: 150 WAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREYDEGCNGGLMD----------YAFQF 199
Query: 250 AVLNVGYDNE 259
+ N G D E
Sbjct: 200 IIQNGGIDTE 209
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 75/139 (53%), Gaps = 5/139 (3%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA-TYGVNRFADMTDSEFNHGL 140
++ Y + Y E E+RF IF N+K I+ + + + G+N+FAD+T+ EF +
Sbjct: 42 WMNHYGKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKLGINQFADLTNEEF---V 98
Query: 141 SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVA 200
+S + + S T +F N + +++++ KG V P V++Q CG CWA SAVA
Sbjct: 99 ASRNKFKGHMCSSIIRTTTFKYENVSAIPSTVDWRKKGAVTP-VKNQGQCGCCWAFSAVA 157
Query: 201 CLESAYAIKHNELIELSKQ 219
E + + +L+ LS+Q
Sbjct: 158 ATEGIHKLSTGKLVSLSEQ 176
>gi|197258084|gb|ACH56226.1| cathepsin L-like cysteine proteinase [Radopholus similis]
Length = 417
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 54/167 (32%), Positives = 79/167 (47%), Gaps = 15/167 (8%)
Query: 67 FDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA---TY 123
F EE + +F R + R+++S+ E RF +F NL I + T TY
Sbjct: 85 FFPEELPEVVREFDQIQRTFSREWNSERERWERFKLFERNLAEIARLNAEAKRTGRNMTY 144
Query: 124 GVNRFADMTDSEFNHGLSSLDWEQIENLKSTF---------ETYSFNSSNSYG--LAESI 172
GVN AD T+ E L LD + +++ F ++ S+ G
Sbjct: 145 GVNGMADWTEEEMGRMLLPLDHFKRRRVEAKFIRKMNPILRRAFTDRSAEEPGSEYPRHF 204
Query: 173 NYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+++ +G V P V+ Q CGSCWA +AVA ESAYA+ H L LS+Q
Sbjct: 205 DWRPRGVVTP-VKAQGQCGSCWAFAAVATTESAYAVAHGHLRSLSEQ 250
>gi|47224192|emb|CAG13112.1| unnamed protein product [Tetraodon nigroviridis]
Length = 327
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 76/143 (53%), Gaps = 12/143 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG--TATYGVNRFADMTDSEF 136
FK ++ + + Y S E +R IF N + I+ KH G + T G+N+F+DMT +EF
Sbjct: 29 FKSWMALHNKAY-SVQEFHQRLQIFTENKRRIE---KHNGGNHSFTMGLNQFSDMTFAEF 84
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
W + +N +T +Y +S ESI+++ KG + V++Q CGSCW
Sbjct: 85 R---KRFLWSEPQNCSATKGSYMKTNSPQ---PESIDWRTKGNYVTPVKNQGACGSCWTF 138
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S CLES AI +L+ LS+Q
Sbjct: 139 STTGCLESVTAINTGKLVPLSEQ 161
>gi|311247276|ref|XP_003122571.1| PREDICTED: cathepsin W-like [Sus scrofa]
Length = 367
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 73/141 (51%), Gaps = 3/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F +Y R Y + +E RR DIF NL + + GTA +GV F+D+T+ EF
Sbjct: 42 FTLFQIQYNRSYSNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTPFSDLTEEEFGQ 101
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L W + + S S + + +S +++ K V+ ++ Q C CWA +A
Sbjct: 102 -LHGHHWGAGKAPSMGIKVGSEESGET--VPQSCDWRKKPGVISAIKHQKDCNCCWAMAA 158
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +E+ +AIK+++ ++LS Q
Sbjct: 159 VDNVEAQWAIKYHQAVQLSVQ 179
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 85/155 (54%), Gaps = 12/155 (7%)
Query: 97 ERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNH---GLSSLDWEQIENL 151
++RF+IF++NL+ ID + ++ + ATY G+ +F D+T+ E+ G + +I
Sbjct: 71 DKRFNIFKDNLRFIDLHNENNK-NATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKA 129
Query: 152 KSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN 211
K+ + YS + N + E+++++ KG V P ++DQ CGSCWA S A +E I
Sbjct: 130 KNVNQKYSA-AVNGKEVPETVDWRQKGAVNP-IKDQGTCGSCWAFSTTAAVEGINKIVTG 187
Query: 212 ELIELSKQP----PKTHGRFYKGGVMNLPHMLCSK 242
ELI LS+Q K++ + GG+M+ K
Sbjct: 188 ELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMK 222
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 99/190 (52%), Gaps = 23/190 (12%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDS 134
+ +++ + R Y++ E ERR+ +FR+NL+ ID + + G ++ G+NRFAD+T+
Sbjct: 40 MYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTND 99
Query: 135 EFNHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
E+ L + Q E ++++++ L ES++++ KG V +V+DQ CGSC
Sbjct: 100 EYRATYLGARTRPQRERKLGA----RYHAADNEDLPESVDWRAKGAV-AEVKDQGSCGSC 154
Query: 194 WAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNH 249
WA S +A +E I +LI LS+Q ++ + GG+M+ Y+
Sbjct: 155 WAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMD----------YAFEF 204
Query: 250 AVLNVGYDNE 259
+ N G D E
Sbjct: 205 IINNGGIDTE 214
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 77.0 bits (188), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 55/190 (28%), Positives = 97/190 (51%), Gaps = 21/190 (11%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
+ +++++ ++ + Y+ E E+RF +F++NL I + + T T G+N+FAD+T+ E+
Sbjct: 34 DMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQDHNA-QNNTYTLGLNKFADITNEEY 92
Query: 137 NH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
G + ++ ++T Y++NS + L ++++ KG V P ++DQ CGSC
Sbjct: 93 RAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQ--LPVHVDWRLKGAVGP-IKDQGNCGSC 149
Query: 194 WAHSAVACLESAYAIKHNELIELSKQPPKTHGRFY----KGGVMNLPHMLCSKGPYSLNH 249
WA S VA +E I E + LS+Q R Y GG+M+ Y+
Sbjct: 150 WAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREYDEGCNGGLMD----------YAFQF 199
Query: 250 AVLNVGYDNE 259
+ N G D E
Sbjct: 200 IIQNGGIDTE 209
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 76.6 bits (187), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 99/190 (52%), Gaps = 23/190 (12%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDS 134
+ +++ + R Y++ E ERR+ +FR+NL+ ID + + G ++ G+NRFAD+T+
Sbjct: 45 MYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTND 104
Query: 135 EFNHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
E+ L + Q E ++++++ L ES++++ KG V +V+DQ CGSC
Sbjct: 105 EYRATYLGARTRPQRERKLGA----RYHAADNEDLPESVDWRAKGAV-AEVKDQGSCGSC 159
Query: 194 WAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNH 249
WA S +A +E I +LI LS+Q ++ + GG+M+ Y+
Sbjct: 160 WAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMD----------YAFEF 209
Query: 250 AVLNVGYDNE 259
+ N G D E
Sbjct: 210 IINNGGIDTE 219
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 76.6 bits (187), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 64/211 (30%), Positives = 99/211 (46%), Gaps = 23/211 (10%)
Query: 56 PNSYGSEE--ASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY 113
P SEE + FD L HG + D + Q E R+ IF++NL+ I
Sbjct: 44 PQDLSSEERLQALFD-SWMLQHGKSYADNALSGDSQA---GEKATRYGIFKDNLRFIHGE 99
Query: 114 TKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESIN 173
+ QG G+N FAD+T+ EF +++ +++ E + + S L +SI+
Sbjct: 100 NEKNQGY-FLGLNAFADLTNEEFRAQRHGGRFDRSRE-RTSHEEFRYGSVQLKDLPDSID 157
Query: 174 YKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYK 229
+++KG V+ V+DQ CGSCWA SAVA +E + EL+ LS+Q K
Sbjct: 158 WREKGAVV-GVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCN 216
Query: 230 GGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
GG+M+ Y+ + N G D E+
Sbjct: 217 GGLMD----------YAFGFVIKNGGLDTEA 237
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 76.6 bits (187), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 74/138 (53%), Gaps = 3/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y +E RRF++F+ N+K I+ + G+N+FAD+T+ EF +
Sbjct: 40 WMAQYSRVYKDAAEKARRFEVFKANVKFIESFNTGGNRKFWLGINQFADLTNDEFRTTKT 99
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ ++ +L + + + + + +I+++ G V P ++DQ CG CWA SAVA
Sbjct: 100 NKGFK--PSLDKVSTGFRYENVSVDAIPATIDWRTNGAVTP-IKDQGQCGCCWAFSAVAA 156
Query: 202 LESAYAIKHNELIELSKQ 219
E I +LI LS+Q
Sbjct: 157 TEGIVKISTGKLISLSEQ 174
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 76.6 bits (187), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 57/184 (30%), Positives = 94/184 (51%), Gaps = 21/184 (11%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHG 139
++ E++R Y+ E ++RF +F++N Y +H QG +Y G+N+FAD++ EF
Sbjct: 45 WLAEHKRAYNGLDEKQKRFSVFKDNFL---YIHEHNQGNRSYKLGLNQFADLSHEEFKAT 101
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ + L S + + S+ L ESI++++KG V V+DQ CGSCWA S V
Sbjct: 102 YLGAKLDTKKRL-SRPPSRRYQYSDGEDLPESIDWREKGAVT-SVKDQGSCGSCWAFSTV 159
Query: 200 ACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVLNVG 255
A +E I +LI LS+Q ++ + GG+M+ Y+ + N G
Sbjct: 160 AAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMD----------YAFEFIINNGG 209
Query: 256 YDNE 259
D+E
Sbjct: 210 LDSE 213
>gi|343473370|emb|CCD14732.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 76.6 bits (187), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 75/145 (51%), Gaps = 11/145 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT---ATYGVNRFADMTDS 134
QF F ++Y R Y +E RF +F+ N++ K E AT+GV RF+DM+
Sbjct: 40 QFAAFKQKYSRSYKDATEEAFRFRVFKQNMER----AKEEAAANPYATFGVTRFSDMSPE 95
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF + LK + + ++ + E+++++ KG V P V+DQ CGSCW
Sbjct: 96 EFRATYHNGAEYYAAALKRPRKVVNVSTGKA---PEAVDWRKKGAVTP-VKDQGACGSCW 151
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA+ +E + + +EL LS+Q
Sbjct: 152 AFSAIGNIEGQWKVAGHELTSLSEQ 176
>gi|42564153|gb|AAS20589.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 322
Score = 76.6 bits (187), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 55/146 (37%), Positives = 77/146 (52%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTD 133
+Q+ F + + + Y S E RF IF+NNL+TI+ + K+E+G TY V +FADMT
Sbjct: 21 DQWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEKHNAKYEEGKVTYYMAVTQFADMTR 80
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF L L + NL +T + L E I++ +KG VLP V++Q C SC
Sbjct: 81 DEFRKKLG-LQNNRRPNLNATLRVFP----EDLELPEQIDWTEKGAVLP-VKNQGNCRSC 134
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA S LE AI + LS+Q
Sbjct: 135 WAFSTTGSLEGQNAIHNKVKTPLSEQ 160
>gi|440792185|gb|ELR13413.1| cathepsin L, putative [Acanthamoeba castellanii str. Neff]
Length = 331
Score = 76.6 bits (187), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 68/142 (47%), Gaps = 4/142 (2%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
QF FV+ Y + Y S E E+RF IF NL +G +G+ +FADM+ EF
Sbjct: 33 QFNAFVQRYGKSYASAEEAEQRFAIFTQNLAETAALNIKYEGKTQFGITKFADMSQEEFQ 92
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ + T + Y + + ++++K V+ V DQ CGSCWA S
Sbjct: 93 SRVLMSN----PPPPPTEKPYRGPKFEGFTAPSTFDWRNKPGVVTPVYDQGQCGSCWAFS 148
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A +ES +A+ ++L LS Q
Sbjct: 149 ATENIESQWALAGHKLTGLSMQ 170
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 76.6 bits (187), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 77/140 (55%), Gaps = 4/140 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKH--EQGTATYGVNRFADMTDSEFNHG 139
++ E+ R Y++ E +RRF +F +NL+ +D + + E G G+N+FAD+T+ EF
Sbjct: 52 WLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGF-RLGMNQFADLTNDEFRAA 110
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ + + L ES+++++KG V P V++Q CGSCWA SAV
Sbjct: 111 YLGARIPAARRRGTAVGERYRHGGGAEELPESVDWREKGAVAP-VKNQGQCGSCWAFSAV 169
Query: 200 ACLESAYAIKHNELIELSKQ 219
+ +ES I E++ LS+Q
Sbjct: 170 SSVESVNQIVTGEMVTLSEQ 189
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 84/145 (57%), Gaps = 7/145 (4%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSE 135
+++++ ++ + Y++ E E+RF+IF++NL ID +H ++ G+NRFAD+T+ E
Sbjct: 46 MYEEWLVKHGKNYNALGEKEKRFEIFKDNLGFID---EHNSKNLSFRLGLNRFADLTNEE 102
Query: 136 F-NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ L + N K +T + + L ES++++ +G V+ V+DQ CGSCW
Sbjct: 103 YRTRFLGTRINPNRRNRKVNSQTNRYATRVGDKLPESVDWRKEGAVV-GVKDQGSCGSCW 161
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA+A +E + +LI LS+Q
Sbjct: 162 AFSAIAAVEGVNKLATGDLISLSEQ 186
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 63/211 (29%), Positives = 100/211 (47%), Gaps = 23/211 (10%)
Query: 56 PNSYGSEE--ASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY 113
P SEE + FD L HG + + + Q E R+ IF++NL+ I
Sbjct: 44 PQDLSSEERLQALFD-SWMLQHGKSYAENALSGDSQA---GEKATRYGIFKDNLRFIHGE 99
Query: 114 TKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESIN 173
+ QG G+N FAD+T+ EF +++ ++++E + + S L +SI+
Sbjct: 100 NEKNQGY-FLGLNAFADLTNEEFRAQRHGGRFDRSRE-RTSYEEFRYGSVQLKDLPDSID 157
Query: 174 YKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYK 229
+++KG V+ V+DQ CGSCWA SAVA +E + EL+ LS+Q K
Sbjct: 158 WREKGAVV-GVKDQGSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCN 216
Query: 230 GGVMNLPHMLCSKGPYSLNHAVLNVGYDNES 260
GG+M+ Y+ + N G D E+
Sbjct: 217 GGLMD----------YAFGFVIKNGGLDTEA 237
>gi|22096273|gb|AAC17994.2| cysteine protease [Babesia equi]
Length = 438
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 81/157 (51%), Gaps = 19/157 (12%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSE 135
+F +F + Y R++ E RF FR+N + K + G +Y G+N+F+DMTD E
Sbjct: 122 EFDEFNKFYSREHADADERRVRFLAFRDNYNAV----KAQTGEESYEKGINKFSDMTDEE 177
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNS---------SNSYGLAESINYKD----KGKVLP 182
FN +L E+++ + F S + G+ +S++ +D K +
Sbjct: 178 FNLRFPALSVEELKKSLEVSASEEFTSPEHLDKVRIAKGLGVEDSVDGEDLDWRKLNGVT 237
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+DQ CGSCWA +AV +ES Y IK + ++LS+Q
Sbjct: 238 PVKDQGNCGSCWAFAAVGSVESLYLIKKGQALDLSEQ 274
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 80/138 (57%), Gaps = 6/138 (4%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y+++ E +RF+IF+ N++ I+ + K G+N FAD+T+ EF +
Sbjct: 42 WMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAGTKPYKLGINAFADLTNQEFK---A 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
S + ++ + S+ + + + +S + +++++ KG V P V+DQ CG CWA SAVA
Sbjct: 99 SRNGYKLPHDCSSNTPFRYENVSS--VPTTVDWRTKGAVTP-VKDQGQCGCCWAFSAVAA 155
Query: 202 LESAYAIKHNELIELSKQ 219
+E + LI LS+Q
Sbjct: 156 MEGITKLSTGNLISLSEQ 173
>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
Length = 367
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 82/144 (56%), Gaps = 11/144 (7%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F ++ + + Y++ E RF+IF++NL ID T + + G+N FAD+++ EFN
Sbjct: 48 FNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDE-TNKKNNSYRLGLNEFADLSNDEFNE 106
Query: 139 G-LSSLDWEQIENLKSTFETYS--FNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ SL IE ++Y F + + L E+++++ KG V P V+ Q CGSCWA
Sbjct: 107 KYVGSLIDATIE------QSYDEEFINEDIVNLPENVDWRKKGAVTP-VRHQGSCGSCWA 159
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
SAVA +E I+ +L+ELS+Q
Sbjct: 160 FSAVATVEGINKIRTGKLVELSEQ 183
>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 400
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/139 (31%), Positives = 79/139 (56%), Gaps = 3/139 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y + Y+ +E+E+RF IF+NN++ I+ + +N+F D+ D EF L
Sbjct: 118 WMAQYGKVYEDAAEMEKRFQIFKNNVQFIESFNVAGDKPFNIRINQFPDLHDEEFKALLI 177
Query: 142 SLDWEQIENLKSTFETYSFN-SSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVA 200
+ ++ +++ E SF S + +++ + KG V P ++DQ + GSCWA SAVA
Sbjct: 178 NGQ-RKVSGVETATEETSFRYGSVVTNIPATMDGRKKGVVTP-IKDQGIIGSCWALSAVA 235
Query: 201 CLESAYAIKHNELIELSKQ 219
+E + I ++L+ LSKQ
Sbjct: 236 AIEGIHQITTSKLMFLSKQ 254
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 74/145 (51%), Gaps = 9/145 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTID-YYTKHEQGTATYGV--NRFADMTDS 134
Q++DF E+ R+Y S E R +F N + ID + + E G T+ + N+F DMT
Sbjct: 23 QWRDFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSE 82
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF ++ N+ S T + L + ++++ KG V P V+DQ CGSCW
Sbjct: 83 EFTATMNGF-----LNVPSRRPTAILRADPDETLPKEVDWRTKGAVTP-VKDQKQCGSCW 136
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A S LE + +K +L+ LS+Q
Sbjct: 137 AFSTTGSLEGQHFLKDGKLVSLSEQ 161
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 78/144 (54%), Gaps = 6/144 (4%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF--- 136
+ ++ + R Y + E + RFD+F+ NLK I+ + K T GVN FAD T EF
Sbjct: 24 QQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIAT 83
Query: 137 NHGLSSLD-WEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ GL ++ E + ++++N S+ G E+ +++ +G V P V+ Q CG CWA
Sbjct: 84 HTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAG-RETKDWRYEGAVTP-VKYQGQCGCCWA 141
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S+VA +E I N L+ LS+Q
Sbjct: 142 FSSVAAVEGLTKIVGNNLVSLSEQ 165
>gi|403376023|gb|EJY87990.1| Cathepsin L [Oxytricha trifallax]
Length = 343
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 74/141 (52%), Gaps = 8/141 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F +++ +Y + Y + E + R++ ++ N+ + Y T G+N+F D T E+
Sbjct: 43 FANYLAKYGKSYGTKEEFQFRYEQYQKNMAKVAQYNGQNGNTFRLGINKFTDYTPEEYKV 102
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L + ++ T E + N+ SI++++KG V P V+DQ CGSCWA SA
Sbjct: 103 LLGY----KPQSKPMTLEASYLSEENT---PASIDWREKGAVTP-VKDQGQCGSCWAFSA 154
Query: 199 VACLESAYAIKHNELIELSKQ 219
LE Y I +N+LI +S+Q
Sbjct: 155 TGALEGHYQISNNKLISISEQ 175
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 77/140 (55%), Gaps = 4/140 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKH--EQGTATYGVNRFADMTDSEFNHG 139
++ E+ R Y++ E +RRF +F +NL+ +D + + E G G+N+FAD+T+ EF
Sbjct: 112 WLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGF-RLGMNQFADLTNDEFRAA 170
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ + + L ES+++++KG V P V++Q CGSCWA SAV
Sbjct: 171 YLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAP-VKNQGQCGSCWAFSAV 229
Query: 200 ACLESAYAIKHNELIELSKQ 219
+ +ES I E++ LS+Q
Sbjct: 230 SSVESVNQIVTGEMVTLSEQ 249
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 76.6 bits (187), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 74/138 (53%), Gaps = 4/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y +E +R+F++F+ N ID + G+N+FAD+T+ EF ++
Sbjct: 40 WMSQYGRSYKDAAEKDRKFEVFKANAAFIDSFNAKNH-KFWLGINQFADITNEEFK--VT 96
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ I N +S+ + + L +I+++ KG V P V+DQ CG CWA SAVA
Sbjct: 97 KTNKGFISNKVRASTGFSYENVSIDALPATIDWRTKGAVTP-VKDQGQCGCCWAFSAVAA 155
Query: 202 LESAYAIKHNELIELSKQ 219
E + +L+ LS+Q
Sbjct: 156 TEGIVKLSTGKLVSLSEQ 173
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/144 (32%), Positives = 82/144 (56%), Gaps = 7/144 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEF 136
++ ++ ++ + Y++ E ++RFDIF++NL+ ID H TY G+NRFAD+T+ E+
Sbjct: 4 YEQWLVKHGKAYNALGEKDKRFDIFKDNLRFID---DHNADNRTYKLGLNRFADLTNEEY 60
Query: 137 NHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
L + +K+ ++ + L ES++++++ VLP V+DQ CGSCWA
Sbjct: 61 RARYLGTRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLP-VKDQGNCGSCWA 119
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S + +E I +LI LS+Q
Sbjct: 120 FSTIGAVEGINKIVTGDLISLSEQ 143
>gi|410493601|ref|YP_006908539.1| V-CATH [Epinotia aporema granulovirus]
gi|354805035|gb|AER41457.1| V-CATH [Epinotia aporema granulovirus]
Length = 329
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/143 (34%), Positives = 79/143 (55%), Gaps = 5/143 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT-ATYGVNRFADMTDSEFN 137
F DFV +Y + Y +D E +++IFRNNL I+ K+ + T A Y +NR +D+ +E
Sbjct: 28 FDDFVIKYNKVYATDEERAAKYEIFRNNLVVIN--EKNSKTTNALYDINRLSDLNKNELL 85
Query: 138 HGLS-SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
S++ ++ N E + S L S +++ V P V++Q CGSCWA
Sbjct: 86 RSTGFSVNLKKNLNPSKECEYVLVADAPSRSLPASFDWRANNAVTP-VKNQLDCGSCWAF 144
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +A +ES YAIK+ ++L++Q
Sbjct: 145 STIANIESLYAIKYGVEVDLAEQ 167
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 82/144 (56%), Gaps = 9/144 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F ++ ++ + Y+S E RF+IFR+NL ID T + + G+N FAD+++ EF
Sbjct: 48 FDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDE-TNKKNNSYWLGLNGFADLSNDEFKK 106
Query: 139 ---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
G + D+ +E+ + E +++ +Y +SI+++ KG V P V++Q CGSCWA
Sbjct: 107 KYVGFVAEDFTGLEHFDN--EDFTYKHVTNY--PQSIDWRAKGAVTP-VKNQGACGSCWA 161
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +A +E I L+ELS+Q
Sbjct: 162 FSTIATVEGINKIVTGNLLELSEQ 185
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 78/144 (54%), Gaps = 6/144 (4%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF--- 136
+ ++ + R Y + E + RFD+F+ NLK I+ + K T GVN FAD T EF
Sbjct: 48 QQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTREEFIAT 107
Query: 137 NHGLSSLD-WEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ GL ++ E + ++++N S+ G E+ +++ +G V P V+ Q CG CWA
Sbjct: 108 HTGLKGVNGIPSSEFVDEMIPSWNWNVSDVAG-RETKDWRYEGAVTP-VKYQGQCGCCWA 165
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S+VA +E I N L+ LS+Q
Sbjct: 166 FSSVAAVEGLTKIVGNNLVSLSEQ 189
>gi|407424636|gb|EKF39072.1| cysteine protease, putative [Trypanosoma cruzi marinkellei]
Length = 438
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 45/142 (31%), Positives = 71/142 (50%), Gaps = 4/142 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ ++ ++Y E +R IF NL I + + G+N+F+DMT EFN
Sbjct: 43 FEKYISDFGKRYADPEEHRKRNAIFNENLAKIRAFNGVLGRSYRLGINKFSDMTKEEFNA 102
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKD-KGKVLPKVQDQHLCGSCWAHS 197
+ S Y + + E++N+++ K VL V+DQ CGSCWAH+
Sbjct: 103 KFNGRVATPQSTRLSQSPPYKYTRTT---FPEALNWQEAKNPVLTPVKDQGSCGSCWAHA 159
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A +ES YAI +L+ LS Q
Sbjct: 160 ATESVESMYAISTGKLLTLSTQ 181
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 82/144 (56%), Gaps = 9/144 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F ++ ++ + Y+S E RF+IFR+NL ID T + + G+N FAD+++ EF
Sbjct: 48 FDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDE-TNKKNNSYWLGLNGFADLSNDEFKK 106
Query: 139 ---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
G + D+ +E+ + E +++ +Y +SI+++ KG V P V++Q CGSCWA
Sbjct: 107 KYVGFVAEDFTGLEHFDN--EDFTYKHVTNY--PQSIDWRAKGAVTP-VKNQGACGSCWA 161
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +A +E I L+ELS+Q
Sbjct: 162 FSTIATVEGINKIVTGNLLELSEQ 185
>gi|56718881|gb|AAW28151.1| westerpain-1 [Paragonimus westermani]
Length = 322
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 79/143 (55%), Gaps = 13/143 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
++ F R+Y + Y ++ + ++RF IF++NL +QGTA YGV +F+D+T EF
Sbjct: 27 YEQFKRDYGKVYANEDD-QKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 85
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ + ++ +Q++ ++ T E I+++ KG V V++Q CGSCWA
Sbjct: 86 KYLSAPVNNDQVKRVRPT---------GLKAAPERIDWRAKGAVT-AVENQGSCGSCWAF 135
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E + IK +L+ LSKQ
Sbjct: 136 STAGNVEGQWFIKTGQLVSLSKQ 158
>gi|67773370|gb|AAY81942.1| cysteine protease 3 [Paragonimus westermani]
Length = 321
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 79/143 (55%), Gaps = 13/143 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
++ F R+Y + Y ++ + ++RF IF++NL +QGTA YGV +F+D+T EF
Sbjct: 27 YEQFKRDYGKVYANEDD-QKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 85
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ + ++ +Q++ ++ T E I+++ KG V V++Q CGSCWA
Sbjct: 86 KYLSAPVNNDQVKRVRPT---------GLKAAPERIDWRAKGAVT-AVENQGSCGSCWAF 135
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E + IK +L+ LSKQ
Sbjct: 136 STAGNVEGQWFIKTGQLVSLSKQ 158
>gi|60396844|gb|AAX19661.1| cysteine proteinase [Populus tomentosa]
Length = 374
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 76/141 (53%), Gaps = 13/141 (9%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH--- 138
F R++++ Y S E + RF +F++NL+ + K + TA++GV +F+D+T +EF
Sbjct: 62 FKRKFKKSYLSQEEHDYRFSVFKSNLRRAARHQKLDP-TASHGVTQFSDLTSAEFRKQVL 120
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
GL L + N T L E ++++KG V P V++Q CGSCW+ S
Sbjct: 121 GLRKLRLPKDANKAPILPTND--------LPEDFDWREKGAVGP-VKNQGSCGSCWSFST 171
Query: 199 VACLESAYAIKHNELIELSKQ 219
LE A+ + EL+ LS+Q
Sbjct: 172 TGALEGAHFLATGELVSLSEQ 192
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 84/155 (54%), Gaps = 12/155 (7%)
Query: 97 ERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNH---GLSSLDWEQIENL 151
++RF+IF++NL+ ID + + + ATY G+ +F D+T+ E+ G + +I
Sbjct: 71 DKRFNIFKDNLRFIDLHNEDNK-NATYKLGLTKFTDLTNDEYRKLYLGARTEPARRIAKA 129
Query: 152 KSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN 211
K+ + YS + N + E+++++ KG V P ++DQ CGSCWA S A +E I
Sbjct: 130 KNVNQKYSA-AVNGKEVPETVDWRQKGAVNP-IKDQGTCGSCWAFSTTAAVEGINKIVTG 187
Query: 212 ELIELSKQP----PKTHGRFYKGGVMNLPHMLCSK 242
ELI LS+Q K++ + GG+M+ K
Sbjct: 188 ELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMK 222
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/165 (32%), Positives = 88/165 (53%), Gaps = 12/165 (7%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTK-HEQ 118
GS+ S FDL + Q+ F + +QY S++E R IF N T+ + K + Q
Sbjct: 13 GSQAVSFFDLVQ-----EQWGAFKMTHNKQYQSETEERFRMKIFMENSHTVAKHNKLYAQ 67
Query: 119 GTATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKS--TFETYSFNSSNSYGLAESINY 174
G ++ G+N++ADM EF L+ + + L+S + ++ +F + L I++
Sbjct: 68 GLVSFKLGINKYADMLHHEFVQVLNGFNRTK-SGLRSGESDDSVTFLPPANVQLPGQIDW 126
Query: 175 KDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+DKG V P V+DQ CGSCW+ SA LE + + +L+ LS+Q
Sbjct: 127 RDKGAVTP-VKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQ 170
>gi|326520659|dbj|BAJ92693.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 289
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 101/188 (53%), Gaps = 21/188 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+ +++ E+ Y++ E ERRF+ FR+NL+ ID + + G ++ G+NRFAD+T+ E
Sbjct: 43 YAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVHSFRLGLNRFADLTNEE 102
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ S+ + + + + + ++++ L ES++++ KG V V+DQ CGSCWA
Sbjct: 103 YR---STYLGARTKPDRERKLSARYQAADNDELPESVDWRKKGAV-GAVKDQGGCGSCWA 158
Query: 196 HSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAV 251
SA+A +E I ++I LS+Q ++ + GG+M+ Y+ +
Sbjct: 159 FSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMD----------YAFEFII 208
Query: 252 LNVGYDNE 259
N G D+E
Sbjct: 209 NNGGIDSE 216
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 86/166 (51%), Gaps = 11/166 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
++ + ++ + R+Y + E R D+F+ NLK I+ + K + GVN FAD T+ EF
Sbjct: 37 DKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEF 96
Query: 137 ---NHGLSSLDWEQIENLKSTFETYSFNSSN-SYGLAESINYKDKGKVLPKVQDQHLCGS 192
+ GL L ++ K +T S + N S + ES +++ +G V P V+ Q CG
Sbjct: 97 LAIHTGLKGL--TEVSPSKVVAKTISSQTWNVSDMVVESKDWRAEGAVTP-VKYQGQCGC 153
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMN 234
CWA SAVA +E I L+ LS+Q + + R GG+M+
Sbjct: 154 CWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYDRGCDGGIMS 199
>gi|358347416|ref|XP_003637753.1| Cysteine proteinase [Medicago truncatula]
gi|355503688|gb|AES84891.1| Cysteine proteinase [Medicago truncatula]
Length = 323
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 44/147 (29%), Positives = 77/147 (52%), Gaps = 12/147 (8%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
+++++ R Y D E E+RF IF NL+ I+ + + T G+N+F D+T EF +
Sbjct: 37 WMKDFGRTYADDVEKEKRFKIFAKNLEYIENFNRAGNETYELGLNQFLDLTKKEFTSKYT 96
Query: 142 ------SLDWEQIENLKSTFETYSFNSSNSYG-----LAESINYKDKGKVLPKVQDQHLC 190
L+ + ++ + F +++NS + ESI++++ G V V+ Q C
Sbjct: 97 CANLKGKLESSMVASVAALFNVSKISTNNSLKGKRKPIPESIDWREGGAV-TSVKRQGAC 155
Query: 191 GSCWAHSAVACLESAYAIKHNELIELS 217
SCWA + +A +E IK+ EL+ LS
Sbjct: 156 ASCWAFATLAAVEGIVQIKNRELVSLS 182
>gi|343472324|emb|CCD15484.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 75/145 (51%), Gaps = 11/145 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT---ATYGVNRFADMTDS 134
QF F ++Y R Y +E RF +F+ N++ K E AT+GV RF+DM+
Sbjct: 40 QFAAFKQKYSRSYKDATEEAFRFRVFKQNMER----AKEEAAANPYATFGVTRFSDMSPE 95
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF + LK + + ++ + E+++++ KG V P V+DQ CGSCW
Sbjct: 96 EFRATYHNGAEYYAAALKRPRKVVTVSTGKA---PEAVDWRKKGAVTP-VKDQGQCGSCW 151
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA+ +E + + +EL LS+Q
Sbjct: 152 AFSAIGNIEGQWKVAGHELTSLSEQ 176
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 82/144 (56%), Gaps = 9/144 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F ++ ++ + Y+S E RF+IFR+NL ID T + + G+N FAD+++ EF
Sbjct: 48 FDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDE-TNKKNNSYWLGLNGFADLSNDEFKK 106
Query: 139 ---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
G + D+ +E+ + E +++ +Y +SI+++ KG V P V++Q CGSCWA
Sbjct: 107 KYVGFVAEDFTGLEHFDN--EDFTYKHVTNY--PQSIDWRAKGAVTP-VKNQGACGSCWA 161
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +A +E I L+ELS+Q
Sbjct: 162 FSTIATVEGINKIVTGNLLELSEQ 185
>gi|224077886|ref|XP_002305451.1| predicted protein [Populus trichocarpa]
gi|222848415|gb|EEE85962.1| predicted protein [Populus trichocarpa]
Length = 368
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 79/141 (56%), Gaps = 7/141 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F ++++ Y S E + RF +F+ NL+ + + + TA++GV +F+D+T +EF
Sbjct: 53 FSLFKSKFKKSYGSQEEHDYRFSVFKANLRRAARHQELDP-TASHGVTQFSDLTPAEFRK 111
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ L ++ K E +S+ L E +++DKG V P +++Q CGSCW+ SA
Sbjct: 112 QV--LGLRRLRLPKDANEAPILPTSD---LPEDFDWRDKGAVGP-IKNQGSCGSCWSFSA 165
Query: 199 VACLESAYAIKHNELIELSKQ 219
LE A+ + EL+ LS+Q
Sbjct: 166 TGALEGAHFLATGELVSLSEQ 186
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 72/141 (51%), Gaps = 3/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ F+ +Y + Y S E RRF++F++NL ID K G G+N FAD+T EF
Sbjct: 52 FEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKKITGY-WLGLNEFADLTHDEFKA 110
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L S + + + + L + ++++ KG V +V++Q CGSCWA S
Sbjct: 111 AYLGLTLTPARR-NSNDQLFRYEEVEAASLPKEVDWRKKGAV-TEVKNQGQCGSCWAFST 168
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E AI L LS+Q
Sbjct: 169 VAAVEGINAIVTGNLTRLSEQ 189
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 81/147 (55%), Gaps = 17/147 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ EY + Y++ E ERRF+IF++NL+ +D + + G+N+F+D+T E+
Sbjct: 48 FESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTLEEY-- 105
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSY------GLAESINYKDKGKVLPKVQDQHLCGS 192
SS+ L + F+ N S+ Y L SI+++ KG VL V++Q CGS
Sbjct: 106 --SSI------YLGTKFDMRMTNVSDRYEPRVGDQLPNSIDWRKKGAVL-GVKNQGNCGS 156
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CW + +A +E+ I LI LS+Q
Sbjct: 157 CWTFAPIAAVEAINQIVTGNLISLSEQ 183
>gi|268570635|ref|XP_002640795.1| Hypothetical protein CBG15672 [Caenorhabditis briggsae]
Length = 396
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 85/155 (54%), Gaps = 16/155 (10%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFAD 130
+ D +FK+F ++++R +++ ++ RF +F NLK I+ G A + +N F D
Sbjct: 87 QLTDSLRKFKEFNQKFQRIHENSDDLNFRFQLFSKNLKEIEILNSQNSG-AKFEINEFTD 145
Query: 131 MTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESIN------YKDKGKVLPKV 184
++ E S+D + ++NL + +NS LA S+N +++ GKV+ V
Sbjct: 146 RSEEELRR--YSMDQKFVKNLSN------LKFANSTILAGSLNRSGYRDWRNDGKVMS-V 196
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++Q CGSCWA S V+ +ES +AIK L LS+Q
Sbjct: 197 KNQGQCGSCWAFSIVSAVESQFAIKKGTLWSLSEQ 231
>gi|2098464|pdb|1PCI|A Chain A, Procaricain
gi|2098465|pdb|1PCI|B Chain B, Procaricain
gi|2098466|pdb|1PCI|C Chain C, Procaricain
Length = 322
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 82/144 (56%), Gaps = 11/144 (7%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F ++ + + Y++ E RF+IF++NL ID T + + G+N FAD+++ EFN
Sbjct: 22 FNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDE-TNKKNNSYWLGLNEFADLSNDEFNE 80
Query: 139 G-LSSLDWEQIENLKSTFETYS--FNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ SL IE ++Y F + + L E+++++ KG V P V+ Q CGSCWA
Sbjct: 81 KYVGSLIDATIE------QSYDEEFINEDIVNLPENVDWRKKGAVTP-VRHQGSCGSCWA 133
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
SAVA +E I+ +L+ELS+Q
Sbjct: 134 FSAVATVEGINKIRTGKLVELSEQ 157
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 57/191 (29%), Positives = 94/191 (49%), Gaps = 29/191 (15%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
+++++ ++ + Y++ E ++RF IF++NL+ ID E T G+NRFAD+T+ E+
Sbjct: 40 YEEWLVKHGKLYNALGEKDKRFQIFKDNLRFIDQ-QNAENRTYKLGLNRFADLTNEEYRA 98
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYG------LAESINYKDKGKVLPKVQDQHLCGS 192
+ L T SN Y L +S++++ +G V+P V+DQ CGS
Sbjct: 99 RYLGTKIDPNRRLGRT-------PSNRYAPRVGETLPDSVDWRKEGAVVP-VKDQASCGS 150
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFY----KGGVMNLPHMLCSKGPYSLN 248
CWA SA+ +E I +LI LS+Q Y GG+M+ Y+
Sbjct: 151 CWAFSAIGAVEGINKIVTGDLISLSEQELVDCDTGYNMGCNGGLMD----------YAFE 200
Query: 249 HAVLNVGYDNE 259
+ N G D+E
Sbjct: 201 FIIKNGGIDSE 211
>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
Length = 472
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 74/132 (56%), Gaps = 6/132 (4%)
Query: 91 DSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSEFNHGLSSLDWEQ 147
+S E ERRF F +NL +D + + G Y G+NRFAD+T+ EF + ++
Sbjct: 69 NSIPERERRFRAFWDNLNFVDAHNARAAAGEEGYRLGMNRFADLTNDEFRAAYLGVKAQR 128
Query: 148 IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYA 207
+ E Y + + L E++++++KG V P V++Q CGSCWA SAV+ +ES
Sbjct: 129 ARPGRMVGERYRHDGAEE--LPEAVDWREKGAVAP-VKNQGQCGSCWAFSAVSTVESINQ 185
Query: 208 IKHNELIELSKQ 219
I E++ LS+Q
Sbjct: 186 IVTGEMVTLSEQ 197
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/170 (31%), Positives = 86/170 (50%), Gaps = 16/170 (9%)
Query: 51 LQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI 110
L+R++P E + +DL +HG R Y + + E +RRF +F +NL+ +
Sbjct: 46 LERTEP-----EVRAMYDLW-LAEHG-------RAYNALGEGEGERDRRFLVFWDNLRFV 92
Query: 111 DYYTKHEQGTA-TYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLA 169
D + + G+N+FAD+T+ EF E Y + + L
Sbjct: 93 DAHNERAGARGFRLGMNQFADLTNDEFRAAYLGAMVPAARRGAVVGERYRHDGAAEE-LP 151
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
ES+++++KG V P V++Q CGSCWA SAV+ +ES I E++ LS+Q
Sbjct: 152 ESVDWREKGAVAP-VKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQ 200
>gi|945081|gb|AAC49361.1| P21 [Petunia x hybrida]
Length = 358
Score = 76.3 bits (186), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 80/147 (54%), Gaps = 12/147 (8%)
Query: 75 HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA-TYGVNRFADMTD 133
H F F R Y ++YDS EI++RFDIF +NL+ I+ + +++G + GVN F+D+T
Sbjct: 55 HALSFARFARRYGKRYDSVEEIKQRFDIFLDNLEMIN--SHNDKGLSYKLGVNEFSDLTW 112
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFN-SSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
EF +++ ++ T N L E+ ++++ G V P V++Q CGS
Sbjct: 113 DEFRR-------DRLGAAQNCSATTKGNLKLRDAVLPETKDWREAGIVSP-VKNQGKCGS 164
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CW S LE+AY K + I LS+Q
Sbjct: 165 CWTFSTTGALEAAYTQKFGKGISLSEQ 191
>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 358
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 48/157 (30%), Positives = 79/157 (50%), Gaps = 15/157 (9%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
++F+ F Y R Y S E RRF+++R N+ I+ + T G N+FAD+T EF
Sbjct: 38 DRFRAFQATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADLTVQEF 97
Query: 137 NHGL---SSLD-----WEQIENLKSTFETYSFNSSNSYGLA------ESINYKDKGKVLP 182
+ +D W + + + + + + + Y A S++++ KG V P
Sbjct: 98 RAMYTMPARVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPTSVDWRSKGAVTP 157
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+DQ CG CWA + VA +E + IK +L+ LS+Q
Sbjct: 158 -VKDQGGCGCCWAFATVATIEGLHKIKTGQLVSLSEQ 193
>gi|26391875|sp|Q94714.1|CATL1_PARTE RecName: Full=Cathepsin L 1; Flags: Precursor
gi|1403087|emb|CAA62869.1| cathepsin L [Paramecium tetraurelia]
Length = 314
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 83/157 (52%), Gaps = 9/157 (5%)
Query: 65 STFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATY 123
+T ++ + +D N + ++ +Y R+Y + + R+ +F +NL I +Y E+ T T
Sbjct: 12 NTQEVSDEIDTANLYANWKMKYNRRYTNQRDEMYRYKVFTDNLNYIRAFYESPEEATFTL 71
Query: 124 GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKV-LP 182
+N+FADM+ EF SL + L + NS+ Y AE +++ D KV P
Sbjct: 72 ELNQFADMSQQEFAQTYLSLKVPRTAKLNAA------NSNFQYKGAE-VDWTDNKKVKYP 124
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V++Q CGSCWA SAV LE I+ N ELS+Q
Sbjct: 125 AVKNQGSCGSCWAFSAVGALEINTDIELNRKYELSEQ 161
>gi|357437717|ref|XP_003589134.1| Cysteine proteinase [Medicago truncatula]
gi|355478182|gb|AES59385.1| Cysteine proteinase [Medicago truncatula]
Length = 299
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 56/190 (29%), Positives = 98/190 (51%), Gaps = 22/190 (11%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSE 135
+++++ ++ + Y+ E ++RF+IF++NLK ID +H +TY G+ RFAD+T+ E
Sbjct: 54 MYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFID---EHNGLNSTYRLGLTRFADLTNEE 110
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYG--LAESINYKDKGKVLPKVQDQHLCGSC 193
+ + +K + S + G L ES++++ +G V+ V+DQ CGSC
Sbjct: 111 YRSKFLGTKIDPNRRMKKLGGSKSNRYAPRVGDKLPESVDWRKEGAVV-GVKDQASCGSC 169
Query: 194 WAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNH 249
WA SA+A +E I +LI LS+Q ++ GG+M+ Y+
Sbjct: 170 WAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMD----------YAFEF 219
Query: 250 AVLNVGYDNE 259
+ N G D+E
Sbjct: 220 IISNGGIDSE 229
>gi|407859260|gb|EKG06954.1| cysteine protease, putative [Trypanosoma cruzi]
Length = 422
Score = 75.9 bits (185), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 71/142 (50%), Gaps = 4/142 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ ++ ++Y E +R IF+ NL + + + G+N+F+DMT EFN
Sbjct: 27 FEKYIADFGKRYADPEEHRKRAAIFKENLAEVRAFNGVLGRSYRLGINKFSDMTKEEFNA 86
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKD-KGKVLPKVQDQHLCGSCWAHS 197
+ +Y + E++N+++ K VL V+DQ CGSCWAH+
Sbjct: 87 KFNGRVAAPQSTRSPQRASYKHTKAT---FPEALNWQEAKNPVLTPVKDQGSCGSCWAHA 143
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A +ES YAI +L+ LS Q
Sbjct: 144 ATESVESMYAISSGKLLTLSTQ 165
>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
Length = 360
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/152 (31%), Positives = 76/152 (50%), Gaps = 6/152 (3%)
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNR 127
D ++ L F F+ Y + Y ++E RF +F++NL+ + + + TA +GV R
Sbjct: 34 DQQQLLSAEAHFSSFLSRYGKSYADEAEHAYRFSVFKSNLRRARRHQRLDP-TAVHGVTR 92
Query: 128 FADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQ 187
FAD+T SEF L + ST + ++ L +++D G V P V++Q
Sbjct: 93 FADLTPSEFRRTYLGLR-RRPRTAGSTHDAPILPTNE---LPADFDWRDHGAVTP-VKNQ 147
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCW+ SA LE A + L+ LS+Q
Sbjct: 148 GSCGSCWSFSAAGALEGANYLSTGNLVSLSEQ 179
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 84/160 (52%), Gaps = 11/160 (6%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
++++ + R Y+ +E E R+ IF+ N++ I+ + K + G+N+FAD+T+ EF
Sbjct: 40 EEWMSRFGRVYNDGNEKEIRYKIFKENVQRIESFNKASGKSYKLGINQFADLTNEEFKTS 99
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ + F + ++ S S++++ KG V ++DQ CGSCWA SAV
Sbjct: 100 RNRFKGHMCSSQAGPFRYENLTAAPS-----SMDWRKKGAVTA-IKDQGQCGSCWAFSAV 153
Query: 200 ACLESAYAIKHNELIELSKQP-----PKTHGRFYKGGVMN 234
A +E + ++LI LS+Q K + +GG+M+
Sbjct: 154 AAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMD 193
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 58/189 (30%), Positives = 95/189 (50%), Gaps = 22/189 (11%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSE 135
QF + ++ + Y + E RF ++++NL +Y +H + +Y G+ +FAD+T+ E
Sbjct: 44 QFAAWAHKHGKVYSAAEERAHRFLVWKDNL---EYIQRHSEKNLSYWLGLTKFADLTNEE 100
Query: 136 FNHGLSSLDWEQIENLKSTFE-TYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
F + ++ LK T SF +NS +SI++++KG V V+DQ CGSCW
Sbjct: 101 FRRQYTGTRIDRSRRLKKGRNATGSFRYANSEA-PKSIDWREKGAV-TSVKDQGSCGSCW 158
Query: 195 AHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHA 250
A SAV +E AI+ + I LS Q K + + GG+M+ Y+ +
Sbjct: 159 AFSAVGSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGGLMD----------YAFDFV 208
Query: 251 VLNVGYDNE 259
+ N G D E
Sbjct: 209 IQNGGIDTE 217
>gi|154550449|gb|ABS83496.1| cysteine protease [Pinus pinaster]
Length = 187
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 82/142 (57%), Gaps = 11/142 (7%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHG 139
++ E+++ Y+ E ++RF +F++N Y +H QG +Y G+N+FAD++ EF
Sbjct: 45 WLAEHKKAYNGLDEKQKRFTVFKDNFL---YIHEHNQGNRSYKLGLNKFADLSHEEFKAT 101
Query: 140 L--SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ LD ++ L+S Y + S+ L +SI+++ KG V P V+DQ CGSCWA S
Sbjct: 102 YLGAKLDTKK-RLLRSPSPRYQY--SDGEDLPKSIDWRVKGAVAP-VKDQGSCGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQ 219
VA +E I +LI LS+Q
Sbjct: 158 TVAAVEGINQIVTGDLISLSEQ 179
>gi|28971813|dbj|BAC65418.1| cathepsin L [Pandalus borealis]
Length = 318
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 49/147 (33%), Positives = 80/147 (54%), Gaps = 12/147 (8%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT-KHEQGTATYGV--NRFADMTD 133
+++++F + + Y E R IF NN K ++ + + QG T+ + NRF DMT
Sbjct: 16 SEWENFKLTHAKVYTHGKEDLYRRSIFENNQKVVEEHNERFRQGLVTFDLKMNRFGDMTT 75
Query: 134 SEFNHGLSSLDWEQIE-NLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
EF ++ L+ ++E + F Y A++++++DKG V P V+DQ CGS
Sbjct: 76 EEFVSQMTGLN--KVERTVGKVFAHYP-----EVERADTVDWRDKGAVTP-VKDQGQCGS 127
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA S LE A+ +KH +L+ LS+Q
Sbjct: 128 CWAFSTTGALEGAHFLKHGDLVSLSEQ 154
>gi|449452572|ref|XP_004144033.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
gi|449500499|ref|XP_004161114.1| PREDICTED: thiol protease aleurain-like [Cucumis sativus]
Length = 356
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 61/190 (32%), Positives = 91/190 (47%), Gaps = 17/190 (8%)
Query: 46 VHNLILQRSQPNSYGSEEASTFDLEEF-----LDHGNQFKDFVREYERQYDSDSEIERRF 100
V + S P S+ +LE + H +F F Y ++Y++ E++ RF
Sbjct: 19 VAGSVFDDSNPIRMVSDRLRELELEVVRVLGQVPHALRFARFAHRYGKKYETAEEMKLRF 78
Query: 101 DIFRNNLKTIDYYTKHEQGTA-TYGVNRFADMTDSEF-NHGLSSLDWEQIENLKSTFETY 158
IF +L+ I + ++QG + GVN+FAD T EF H L + +N +T T
Sbjct: 79 GIFLESLELIK--STNKQGLSYKLGVNQFADWTWEEFRKHRLGA-----AQNCSAT--TK 129
Query: 159 SFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSK 218
+ L ES +++ G V P V+DQ CGSCW S LE+AYA H + I LS+
Sbjct: 130 GSHKLTDTALPESKDWRKDGIVSP-VKDQGHCGSCWTFSTTGALEAAYAQAHGKGISLSE 188
Query: 219 QPPKTHGRFY 228
Q GR +
Sbjct: 189 QQLVDCGRGF 198
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/166 (32%), Positives = 86/166 (51%), Gaps = 11/166 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
++ + ++ + R+Y + E R D+F+ NLK I+ + K + GVN FAD T+ EF
Sbjct: 37 DKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEF 96
Query: 137 ---NHGLSSLDWEQIENLKSTFETYSFNSSN-SYGLAESINYKDKGKVLPKVQDQHLCGS 192
+ GL L ++ K +T S + N S + ES +++ +G V P V+ Q CG
Sbjct: 97 LAIHTGLKGL--TEVSPSKVVAKTISSQTWNVSDMVVESKDWRAEGAVTP-VKYQGQCGC 153
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMN 234
CWA SAVA +E I L+ LS+Q + + R GG+M+
Sbjct: 154 CWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYDRDCDGGIMS 199
>gi|145538079|ref|XP_001454745.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124422522|emb|CAK87348.1| unnamed protein product [Paramecium tetraurelia]
Length = 324
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 83/157 (52%), Gaps = 9/157 (5%)
Query: 65 STFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATY 123
+T ++ + +D N + ++ +Y R+Y + + R+ +F +NL I +Y E+ T T
Sbjct: 22 NTQEVSDEIDTANLYANWKMKYNRRYTNQRDEMYRYKVFTDNLNYIRAFYESPEEATFTL 81
Query: 124 GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKV-LP 182
+N+FADM+ EF SL + L + NS+ Y AE +++ D KV P
Sbjct: 82 ELNQFADMSQQEFAQTYLSLKVPRTAKLNAA------NSNFQYKGAE-VDWTDNKKVKYP 134
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V++Q CGSCWA SAV LE I+ N ELS+Q
Sbjct: 135 AVKNQGSCGSCWAFSAVGALEINTDIELNRKYELSEQ 171
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/142 (32%), Positives = 79/142 (55%), Gaps = 8/142 (5%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFN 137
+ ++ +Y + Y+ E E+RF IF+ N+ I+ T +++G Y GVN+F D+T+ EF
Sbjct: 40 QQWMGQYAKIYNDHQEWEKRFQIFKENVNYIE--TSNKEGGRFYKLGVNQFVDLTNEEF- 96
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
++ + + S T ++ N + +++++ KG V P V+DQ CG CWA S
Sbjct: 97 --IAPRNRFKGHMCSSIIRTNTYKYENVTTVPSNVDWRQKGAVTP-VKDQGQCGCCWAFS 153
Query: 198 AVACLESAYAIKHNELIELSKQ 219
AVA E + + +LI LS+Q
Sbjct: 154 AVAATEGIHQLSTGKLISLSEQ 175
>gi|281207374|gb|EFA81557.1| hypothetical protein PPL_05546 [Polysphondylium pallidum PN500]
Length = 341
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 80/146 (54%), Gaps = 5/146 (3%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
D +F+ ++ ++E+ Y DSE R + NL+T+ Y K G A + N+F+D++
Sbjct: 33 DFEGEFRQWMTKHEKSYADDSEYYLRLSHYIKNLRTVADYNKKHAGMAKFAPNKFSDLSI 92
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF G + ++ +ST + + + ++ + S++++ KG V P V++Q CGSC
Sbjct: 93 EEFRAGYLNYVPNKLIKDRSTKQNFDYPAN----IPVSLDWRQKGFVTP-VKNQEQCGSC 147
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SA +E+AY + N +S+Q
Sbjct: 148 WAFSAGEQIETAYIMAGNAAQNVSEQ 173
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 77/140 (55%), Gaps = 4/140 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKH--EQGTATYGVNRFADMTDSEFNHG 139
++ E+ R Y++ E +RRF +F +NL+ +D + + E G G+N+FAD+T+ EF
Sbjct: 55 WLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGF-RLGMNQFADLTNDEFRAA 113
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ + + L ES+++++KG V P V++Q CGSCWA SAV
Sbjct: 114 YLGARIPASRRRGTAVGERYRHGGGAEELPESVDWREKGAVAP-VKNQGQCGSCWAFSAV 172
Query: 200 ACLESAYAIKHNELIELSKQ 219
+ +ES I E++ LS+Q
Sbjct: 173 SSVESVNQIVTGEMVTLSEQ 192
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 98/188 (52%), Gaps = 21/188 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+ ++ E+ + Y++ E ERR+ FR+NL+ ID + + G ++ G+NRFAD+T+ E
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ L + K + + ++++ L ES++++ KG V +++DQ CGSCWA
Sbjct: 100 YRDTYLGLRNKPRRERKVSDR---YLAADNEALPESVDWRTKGAV-AEIKDQGGCGSCWA 155
Query: 196 HSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAV 251
SA+A +E I +LI LS+Q ++ GG+M+ Y+ + +
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD----------YAFDFII 205
Query: 252 LNVGYDNE 259
N G D E
Sbjct: 206 NNGGIDTE 213
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 98/188 (52%), Gaps = 21/188 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+ ++ E+ + Y++ E ERR+ FR+NL+ ID + + G ++ G+NRFAD+T+ E
Sbjct: 40 YAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ L + K + + ++++ L ES++++ KG V +++DQ CGSCWA
Sbjct: 100 YRDTYLGLRNKPRRERKVSDR---YLAADNEALPESVDWRTKGAV-AEIKDQGGCGSCWA 155
Query: 196 HSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAV 251
SA+A +E I +LI LS+Q ++ GG+M+ Y+ + +
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD----------YAFDFII 205
Query: 252 LNVGYDNE 259
N G D E
Sbjct: 206 NNGGIDTE 213
>gi|168059933|ref|XP_001781954.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162666600|gb|EDQ53250.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 76/150 (50%), Gaps = 11/150 (7%)
Query: 72 FLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHE--QGTATYGVNRFA 129
L QF+ F++E+ + Y + E E RF +F++NL KH+ TA++GV F+
Sbjct: 49 LLGAEKQFESFIKEFGKVYHTVEEYEHRFKVFKSNLLRA---LKHQALDPTASHGVTMFS 105
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T+ EF L + T E L S ++++KG V P V++Q
Sbjct: 106 DLTEEEFATQYLGLKRPSALSTAPTAEPLPTGD-----LPPSFDWREKGAVGP-VKNQGS 159
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S +E A+ + +L+ LS+Q
Sbjct: 160 CGSCWAFSTTGAVEGAHFLATGKLLSLSEQ 189
>gi|71662527|ref|XP_818269.1| cysteine protease [Trypanosoma cruzi strain CL Brener]
gi|70883510|gb|EAN96418.1| cysteine protease, putative [Trypanosoma cruzi]
Length = 434
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 74/149 (49%), Gaps = 18/149 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ ++ ++Y E +R IF+ NL + + + G+N+F+DMT EFN
Sbjct: 39 FEKYIADFGKRYADPEEHRKRAAIFKENLAKVRAFNGALGRSYRLGINKFSDMTKEEFNA 98
Query: 139 GLS-------SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKD-KGKVLPKVQDQHLC 190
+ S Q K T T+ E++N+++ K VL V+DQ C
Sbjct: 99 KFNGRVAAPQSTQSPQRAPYKRTKATFP----------EALNWQEAKNPVLTPVKDQGSC 148
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCWAH+A +ES YAI +L+ LS Q
Sbjct: 149 GSCWAHAATESVESMYAISSGKLLTLSTQ 177
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 82/144 (56%), Gaps = 7/144 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ ++ + Y+ E + RF+IF++NL+ +D E + G+NRFAD+T+ E+
Sbjct: 43 YETWLVKHGKNYNGLGEKQLRFNIFKDNLRFVDE-RNSENLSFKLGLNRFADLTNEEYRS 101
Query: 139 ---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
G + +S + Y+F + ++ L ES++++ KG V ++DQ CGSCWA
Sbjct: 102 VYLGTRPRSVAVARSGRSKSDRYAFRAGDT--LPESVDWRKKGAV-AGIKDQGSCGSCWA 158
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
SA+A +E I +LI LS+Q
Sbjct: 159 FSAIAAVEGVNQIVTGDLISLSEQ 182
>gi|15617524|ref|NP_258322.1| cathepsin-like cysteine proteinase [Spodoptera litura NPV]
gi|37077642|sp|Q91BH1.1|CATV_NPVST RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|15553260|gb|AAL01738.1|AF325155_50 cathepsin-like cysteine proteinase [Spodoptera litura NPV]
Length = 337
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 87/165 (52%), Gaps = 7/165 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
+++F++++ ++Y + + + F F+ NL ++ + A YG+N+F+D+ F +
Sbjct: 33 YENFIKQHNKEYTTPDQRDAAFVNFKRNLADMNA-MNNVSNQAVYGINKFSDIDKITFVN 91
Query: 139 GLSSLDWEQIENLKSTFETYSFN-----SSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
+ L I + S F+ Y + S ES +++ KV KV++Q +CGSC
Sbjct: 92 EHAGLVSNLINSTDSNFDPYRLCEYVTVAGPSARTPESFDWRKLNKV-TKVKEQGVCGSC 150
Query: 194 WAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHM 238
WA +A+ +ES YAI H+ LI+LS+Q R +G L H+
Sbjct: 151 WAFAAIGNIESQYAIMHDSLIDLSEQQLLDCDRVDQGCDGGLMHL 195
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 98/188 (52%), Gaps = 21/188 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+ ++ E+ + Y++ E ERR+ FR+NL+ ID + + G ++ G+NRFAD+T+ E
Sbjct: 41 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 100
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ L + K + + ++++ L ES++++ KG V +++DQ CGSCWA
Sbjct: 101 YRDTYLGLRNKPRRERKVSDR---YLAADNEALPESVDWRTKGAV-AEIKDQGGCGSCWA 156
Query: 196 HSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAV 251
SA+A +E I +LI LS+Q ++ GG+M+ Y+ + +
Sbjct: 157 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD----------YAFDFII 206
Query: 252 LNVGYDNE 259
N G D E
Sbjct: 207 NNGGIDTE 214
>gi|340505373|gb|EGR31705.1| hypothetical protein IMG5_103490 [Ichthyophthirius multifiliis]
Length = 351
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/152 (35%), Positives = 79/152 (51%), Gaps = 11/152 (7%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
E ++ QF+ ++++Y + + RR +F NL+ I + GT T N+FA
Sbjct: 45 ETNINKDTQFQQWLQKYNVKLAGEDFTYRR-QVFFENLEKISKNNSEDNGT-TQEANQFA 102
Query: 130 DMTDSEFNHGLSSLDWEQ--IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQ 187
+T SEFN L +Q I+NLK + L SI+++ K V P V+DQ
Sbjct: 103 ILTSSEFNQMYKGLKRQQNNIQNLKLQI------VDETAPLPASIDWRKKKAVNP-VKDQ 155
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV LE AYAI + +L+ S+Q
Sbjct: 156 GQCGSCWAFSAVGGLEGAYAIANKKLVSFSEQ 187
>gi|443708542|gb|ELU03619.1| hypothetical protein CAPTEDRAFT_17807 [Capitella teleta]
Length = 350
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 76/144 (52%), Gaps = 5/144 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTK-HEQGTATY--GVNRFADMTDSE 135
++DF +ER Y E +R+ ++FRNNLK I + HEQG + Y G+N+FADM +E
Sbjct: 43 WQDFKTVHERTYGETEESQRK-EVFRNNLKKIQAHNHLHEQGKSPYRMGINQFADMEANE 101
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
F ++ ++ + + + ++++ +G V P V++Q CGSCWA
Sbjct: 102 FASIMNGFRMNNRTEVRDHLHANYISPAIPVSVPAEVDWRKEGYVTP-VKNQGQCGSCWA 160
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S LE + K +L+ LS+Q
Sbjct: 161 FSTTGSLEGQHFRKTGKLVSLSEQ 184
>gi|121531600|gb|ABM55485.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 326
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/146 (36%), Positives = 80/146 (54%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATY--GVNRFADMTD 133
+Q+ F + + + Y + E + RF IF+ NL I ++ ++++G TY GV RFAD+T
Sbjct: 21 DQWIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIKEHNARYDKGEETYLLGVTRFADLTH 80
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF L QI+N K + +SI++ +KG VL +V+DQ+ CGSC
Sbjct: 81 EEFKDILKG----QIKN-KPRLNATPTVFPEDLEVPDSIDWTEKGAVL-EVKDQNPCGSC 134
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SA LE AI +N I LS+Q
Sbjct: 135 WAFSATGALEGQNAILNNVKISLSEQ 160
>gi|289740839|gb|ADD19167.1| cysteine proteinase cathepsin F [Glossina morsitans morsitans]
Length = 471
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 76/144 (52%), Gaps = 12/144 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F +++R Y + E + RF IF+ NL+ I+ ++EQG+A YG+ FADMT E+
Sbjct: 166 FAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYGITEFADMTSPEYKQ 225
Query: 139 --GLSSLDWEQ-IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
GL D ++ N K+ L + ++++KG + V++Q CGSCWA
Sbjct: 226 RTGLWQRDPQKAASNPKAEIPNID--------LPKEFDWREKG-AISAVKNQGNCGSCWA 276
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +E +A++ L + S+Q
Sbjct: 277 FSVTGNIEGLHAVRTGVLEQYSEQ 300
Score = 39.3 bits (90), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 60/133 (45%), Gaps = 27/133 (20%)
Query: 130 DMTDSEFNHGLSSLDWEQIEN-----LKSTFETYSFNSSNSYGLAESINYKDKGKV-LPK 183
D +DS N GL +E IE L+S + Y + + I+ K KG V LPK
Sbjct: 306 DTSDSACNGGLPDNAYEAIEKIGGLELESDY-PYHARKDQCHFNSTKIHVKVKGHVDLPK 364
Query: 184 VQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKG 243
+ + A +A + I N + +FY+GGV + PH+LCS+
Sbjct: 365 NE------TAIAQWLIANGPISIGINANAM------------QFYRGGVSHPPHILCSR- 405
Query: 244 PYSLNHAVLNVGY 256
+L+H VL VGY
Sbjct: 406 -KNLDHGVLIVGY 417
>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
Length = 362
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 76/145 (52%), Gaps = 20/145 (13%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ Y R Y +E + R+ IF+ N++ ID + + VN+FAD+T+ EF
Sbjct: 42 WMASYARVYKDANEKQMRYKIFKENVQRIDSFNSESDKSYKLAVNQFADLTNEEF----- 96
Query: 142 SLDWEQIENLKSTFETYS-------FNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
++L++ F+ + F N + SI+++ KG V ++++Q CGSCW
Sbjct: 97 -------KSLRNGFKGHMCSAQAGHFRYENVTAVPASIDWRKKGAV-TQIKEQGQCGSCW 148
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SAVA +E IK +LI LS+Q
Sbjct: 149 AFSAVAAVEGITEIKTGKLISLSEQ 173
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 76/141 (53%), Gaps = 6/141 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ ++ R Y+ E ERRF IFR+N + I+ + + T G+N FADMT EF
Sbjct: 34 YEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFK- 92
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+L + L +T ++ F ++ L +++ KG V V++Q CGSCWA S
Sbjct: 93 ---ALYFGTKVPLSNTIKS-GFRYKDATNLPLDTDWRSKGAV-ATVKNQGACGSCWAFST 147
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I EL+ LS+Q
Sbjct: 148 VAAVEGVNQIVTGELVSLSEQ 168
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 98/188 (52%), Gaps = 21/188 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+ ++ E+ + Y++ E ERR+ FR+NL+ ID + + G ++ G+NRFAD+T+ E
Sbjct: 40 YAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ L + K + + ++++ L ES++++ KG V +++DQ CGSCWA
Sbjct: 100 YRDTYLGLRNKPRRERKVSDR---YLAADNEALPESVDWRTKGAV-AEIKDQGGCGSCWA 155
Query: 196 HSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAV 251
SA+A +E I +LI LS+Q ++ GG+M+ Y+ + +
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD----------YAFDFII 205
Query: 252 LNVGYDNE 259
N G D E
Sbjct: 206 NNGGIDTE 213
>gi|83944664|gb|ABC48936.1| cathepsin F like protease [Glossina morsitans morsitans]
Length = 471
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 76/144 (52%), Gaps = 12/144 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F +++R Y + E + RF IF+ NL+ I+ ++EQG+A YG+ FADMT E+
Sbjct: 166 FAKFQIKFKRNYHTTMEKQMRFRIFKQNLQLIEELNRNEQGSAKYGITEFADMTSPEYKQ 225
Query: 139 --GLSSLDWEQ-IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
GL D ++ N K+ L + ++++KG + V++Q CGSCWA
Sbjct: 226 RTGLWQRDPQKAASNPKAEIPNID--------LPKEFDWREKGAI-SAVKNQGNCGSCWA 276
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +E +A++ L + S+Q
Sbjct: 277 FSVTGNIEGLHAVRTGVLEQYSEQ 300
Score = 39.3 bits (90), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 41/133 (30%), Positives = 60/133 (45%), Gaps = 27/133 (20%)
Query: 130 DMTDSEFNHGLSSLDWEQIEN-----LKSTFETYSFNSSNSYGLAESINYKDKGKV-LPK 183
D +DS N GL +E IE L+S + Y + + I+ K KG V LPK
Sbjct: 306 DTSDSACNGGLPDNAYEAIEKIGGLELESDY-PYHARKDQCHFNSTKIHVKVKGHVDLPK 364
Query: 184 VQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKG 243
+ + A +A + I N + +FY+GGV + PH+LCS+
Sbjct: 365 NE------TAIAQWLIANGPISIGINANAM------------QFYRGGVSHPPHILCSR- 405
Query: 244 PYSLNHAVLNVGY 256
+L+H VL VGY
Sbjct: 406 -KNLDHGVLIVGY 417
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 75.9 bits (185), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 98/188 (52%), Gaps = 21/188 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+ ++ E+ + Y++ E ERR+ FR+NL+ ID + + G ++ G+NRFAD+T+ E
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ L + K + + ++++ L ES++++ KG V +++DQ CGSCWA
Sbjct: 100 YRDTYLGLRNKPRRERKVSDR---YLAADNEALPESVDWRTKGAV-AEIKDQGGCGSCWA 155
Query: 196 HSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAV 251
SA+A +E I +LI LS+Q ++ GG+M+ Y+ + +
Sbjct: 156 FSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD----------YAFDFII 205
Query: 252 LNVGYDNE 259
N G D E
Sbjct: 206 NNGGIDTE 213
>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
Length = 344
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 55/191 (28%), Positives = 101/191 (52%), Gaps = 27/191 (14%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+ +++ + R Y++ E ERRF++FR+NL+ +D + + G ++ G+NRFAD+T+ E
Sbjct: 46 YAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFADLTNDE 105
Query: 136 FNH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
+ G+ S + + + + Y + ++ L ES++++ KG V +V+DQ CGS
Sbjct: 106 YRATYLGVRS----RPQRERRLGDRYL--AGDNEDLPESVDWRAKGAV-AEVKDQGSCGS 158
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLN 248
CWA S +A +E I ++I LS+Q ++ + GG+M+ Y+
Sbjct: 159 CWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMD----------YAFE 208
Query: 249 HAVLNVGYDNE 259
+ N G D E
Sbjct: 209 FIINNGGIDTE 219
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 78/141 (55%), Gaps = 4/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ ++++ Y S E RF++F++NLK ID + E + G+N FAD+T EF
Sbjct: 49 FEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINR-EVTSYWLGLNEFADLTHDEFKA 107
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
LD + + ++ + ++ L +S++++ KG V +V++Q CGSCWA S
Sbjct: 108 AYLGLDAAPAR--RGSSRSFRYEDVSASDLPKSVDWRKKGAV-TEVKNQGQCGSCWAFST 164
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E AI L LS+Q
Sbjct: 165 VAAVEGINAIVTGNLTALSEQ 185
>gi|427778331|gb|JAA54617.1| Putative cysteine proteinase cathepsin f [Rhipicephalus pulchellus]
Length = 361
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/119 (36%), Positives = 65/119 (54%), Gaps = 3/119 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F R Y + Y E E RF IF+NNLK I + + E+GTA YG+ F+D++ SEF
Sbjct: 34 FSVFARTYNKTYKDKEEHEARFMIFKNNLKRIALFNRLEEGTAHYGLTEFSDLSPSEFER 93
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L + E+ K+ + N L + +++ KG V +V++Q +CGSCWA S
Sbjct: 94 HYLGLKKDLAEH-KAEVKPIKVGPVNE-PLPDLFDWRTKGAVT-EVKNQGMCGSCWAFS 149
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 77/138 (55%), Gaps = 9/138 (6%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ ++ R Y + +E RF+IFR N++ I+ + E GVN+FAD+T+ EF +
Sbjct: 44 WMAQHGRVYKNAAEKAHRFEIFRANVERIESFNA-ENHKFKLGVNQFADLTNEEFKT-RN 101
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+L ++ + KS F N + +++++ KG V P ++DQ CGSCWA SAVA
Sbjct: 102 TLKPSKMASTKS------FKYENVTAVPATMDWRTKGAVTP-IKDQGQCGSCWAFSAVAA 154
Query: 202 LESAYAIKHNELIELSKQ 219
E + +LI LS+Q
Sbjct: 155 TEGITKLSTGKLISLSEQ 172
>gi|343477445|emb|CCD11724.1| unnamed protein product, partial [Trypanosoma congolense IL3000]
Length = 380
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 75/145 (51%), Gaps = 11/145 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT---ATYGVNRFADMTDS 134
QF F ++Y R Y +E RF +F+ N++ K E AT+GV RF+DM+
Sbjct: 40 QFAAFKQKYSRSYKDATEEAFRFRVFKQNMER----AKEEAAANPYATFGVTRFSDMSPE 95
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF + LK + + ++ + E+++++ KG V P V+DQ CGSCW
Sbjct: 96 EFRATYHNGAEYYAAALKRPRKVVNVSTGKA---PEAVDWRKKGAVTP-VKDQGQCGSCW 151
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA+ +E + + +EL LS+Q
Sbjct: 152 AFSAIGNIEGQWKVAGHELTSLSEQ 176
>gi|33242865|gb|AAQ01137.1| cathepsin [Branchiostoma lanceolatum]
Length = 328
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 77/145 (53%), Gaps = 5/145 (3%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKH-EQGTATY--GVNRFADMTDS 134
Q++ F + Y R Y ++ E RR IF +NLKTI + + ++G T+ GVN++ADMT
Sbjct: 22 QWEVFKKAYNRVYAAEEEFARRL-IFEDNLKTIQMHNEEADRGLHTFRLGVNQYADMTHK 80
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF + KST + L ++++++DKG V P V++Q CGSCW
Sbjct: 81 EFLENVIGGCLLDTNTSKSTADHVHEYDPTLTDLPDTVDWRDKGYVTP-VKNQEQCGSCW 139
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A S LE + +L+ LS+Q
Sbjct: 140 AFSTTGSLEGQHFKSTQKLVSLSEQ 164
>gi|21218381|gb|AAM44058.1|AF510740_1 cathepsin L1 [Schistosoma japonicum]
Length = 317
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 84/154 (54%), Gaps = 11/154 (7%)
Query: 67 FDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVN 126
F+L E + G + F Y +QY +++ E+RF IF++NL Y E+G+A YGV
Sbjct: 10 FELPE--NVGEMYAQFKLTYRKQYH-ETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVT 66
Query: 127 RFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYG-LAESINYKDKGKVLPKVQ 185
++D+T EF+ + W +++ + + G + + ++++KG V +V+
Sbjct: 67 PYSDLTTDEFSRTHLTAPW------RASSKRNTIPPRREVGDIPNNFDWREKGAV-TEVK 119
Query: 186 DQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+Q +CGSCWA S +ES + K +L+ LS+Q
Sbjct: 120 NQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQ 153
>gi|408009|gb|AAA18215.1| cysteine protease precursor [Trypanosoma congolense]
Length = 444
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 75/145 (51%), Gaps = 11/145 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT---ATYGVNRFADMTDS 134
QF F ++Y R Y +E RF +F+ N++ K E AT+GV RF+DM+
Sbjct: 40 QFAAFKQKYSRSYKDATEEAFRFRVFKQNMER----AKEEAAANPYATFGVTRFSDMSPE 95
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF + LK + + ++ + E+++++ KG V P V+DQ CGSCW
Sbjct: 96 EFRATYHNGAEYYAAALKRPRKVVNVSTGKA---PEAVDWRKKGAVTP-VKDQGQCGSCW 151
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA+ +E + + +EL LS+Q
Sbjct: 152 AFSAIGNIEGQWKVAGHELTSLSEQ 176
>gi|414887429|tpg|DAA63443.1| TPA: hypothetical protein ZEAMMB73_816727 [Zea mays]
Length = 334
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/187 (26%), Positives = 87/187 (46%), Gaps = 23/187 (12%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ D+E+ L ++F+ + Y R Y + +E RRF+++R N++ I+ +
Sbjct: 43 GAASGGRVDVEDML-MMDRFRGWQATYNRSYLTAAERLRRFEVYRQNMELIEATNRRAGL 101
Query: 120 TATYGVNRFADMTDSEF--NHGLSS-------------LDWEQIENLKSTFETYSFNSSN 164
+ G F D+T EF H +S+ L + ++ N +
Sbjct: 102 SYQLGETPFTDLTSEEFLATHTMSTRLHASEAARRHRELITTHAGPVSDGGRQWNRNYTT 161
Query: 165 SYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQP---- 220
+ ES++++ KG V P V+DQ CGSCW+ VA +E + I+ +L+ LS+Q
Sbjct: 162 DLDVPESVDWRTKGAVTP-VKDQGACGSCWSFVTVAAIEGLHKIRTGQLVSLSEQAVLDC 220
Query: 221 --PKTHG 225
P HG
Sbjct: 221 SSPPNHG 227
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 76/141 (53%), Gaps = 6/141 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ ++ R Y+ E ERRF IFR+N + I+ + + T G+N FADMT EF
Sbjct: 34 YEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFK- 92
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+L + L +T ++ F ++ L +++ KG V V++Q CGSCWA S
Sbjct: 93 ---ALYFGTKVPLSNTIKS-GFRYEDATNLPLDTDWRSKGAV-ATVKNQGACGSCWAFST 147
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I EL+ LS+Q
Sbjct: 148 VAAVEGVNQIVTGELVSLSEQ 168
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 74/143 (51%), Gaps = 12/143 (8%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN-- 137
++++ ++ R Y E E+R+ IF+ N++ I+ + GVN+FAD+T+ EF
Sbjct: 41 EEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAM 100
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
HG + S + SF N + S+++++ G V P V+DQ CG CWA
Sbjct: 101 YHGY--------KRQSSKLMSSSFRYENLSDIPTSMDWRNDGAVTP-VKDQGTCGCCWAF 151
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S VA +E ++ LI LS+Q
Sbjct: 152 STVAAIEGIIKLQTGNLISLSEQ 174
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 54/183 (29%), Positives = 95/183 (51%), Gaps = 9/183 (4%)
Query: 69 LEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-TATYGVNR 127
L++ L + +++ E+ R Y +E R+ +F+ N++ I+ G T VN+
Sbjct: 28 LDDELIMQKKHDEWMAEHGRTYADMNEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQ 87
Query: 128 FADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSY--GLAESINYKDKGKVLPKVQ 185
FAD+T+ EF + + + +S ++ SF N + L +++++ KG V P ++
Sbjct: 88 FADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSFRYQNVFFGALPIAVDWRKKGAVTP-IK 146
Query: 186 DQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPP---KTHGRFYKGGVMN--LPHMLC 240
+Q CG CWA SAVA +E A IK +LI LS+Q T+ GG+M+ H++
Sbjct: 147 NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCSGGLMDTAFEHIMA 206
Query: 241 SKG 243
+ G
Sbjct: 207 TGG 209
>gi|443685370|gb|ELT89004.1| hypothetical protein CAPTEDRAFT_95613, partial [Capitella teleta]
Length = 295
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 81/150 (54%), Gaps = 9/150 (6%)
Query: 93 DSEIERRFDIFRNNLKTIDYYT-KHEQGTA--TYGVNRFADMTDSEFNHGLSSLDWEQIE 149
++E +R ++FRNN+K I + HEQG + T G+N+F+DM + EF+ ++
Sbjct: 1 ETEENQRKEVFRNNIKKIQMHNYLHEQGKSPFTMGINQFSDMDEKEFSTIMNGFRMNNRT 60
Query: 150 NLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIK 209
++ ++ + + + ++++ KG V P V++Q CGSCWA SA+ LE + K
Sbjct: 61 KVRDHLHSHYISPAIPVSVPAEVDWRKKGYVTP-VKNQGQCGSCWAFSAIGALEGQHFRK 119
Query: 210 HNELIELSKQPPKTHGRFY-----KGGVMN 234
+L+ LS+Q + Y GGVM+
Sbjct: 120 TGKLVSLSEQNLVDCSKSYGNNGCNGGVMD 149
>gi|308465858|ref|XP_003095186.1| hypothetical protein CRE_22071 [Caenorhabditis remanei]
gi|308246042|gb|EFO89994.1| hypothetical protein CRE_22071 [Caenorhabditis remanei]
Length = 326
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 72/136 (52%), Gaps = 6/136 (4%)
Query: 75 HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDS 134
+ N F+DF+ +Y R+Y ++ E+ RF IF N+ ++ Y K + G TY +N F+D++D
Sbjct: 29 YTNAFQDFLVKYLREYKTEDELVMRFTIFSRNMDLVERYNKEDLGKVTYELNDFSDLSDE 88
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKD-KGKV-LPKVQDQHLCGS 192
E+ L + S F + ES+++++ KG + ++ Q CGS
Sbjct: 89 EWKKFLMTPK----PKSPSKSAAKPFTPKEKRVIPESVDWRNVKGNNHVTGIKYQGPCGS 144
Query: 193 CWAHSAVACLESAYAI 208
CWA + A +ESA +I
Sbjct: 145 CWAFATAAAIESAVSI 160
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 79/148 (53%), Gaps = 10/148 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
++ ++ ++ + Y++ E E+RF IF++NL+ ID + + T G+N+FAD+T+ EF
Sbjct: 53 YESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFKVGLNKFADLTNEEFRS 112
Query: 138 ------HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
SS K + Y F + L E+++++ G V KV+DQ CG
Sbjct: 113 VYLGRKKSSSSSPLLSSAKSKVKSDRYLFKEGDE--LPEAVDWRKNGAV-AKVKDQGQCG 169
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
SCWA S +A +E I EL+ LS+Q
Sbjct: 170 SCWAFSTIAAVEGINQIVTGELLSLSEQ 197
>gi|303275866|ref|XP_003057227.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226461579|gb|EEH58872.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 329
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/149 (30%), Positives = 75/149 (50%), Gaps = 7/149 (4%)
Query: 75 HGNQFKDFVREYERQYDSDS-EIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
H F FV E+ + Y SD+ E +R +IF N+ + + A YG FAD+T+
Sbjct: 4 HERDFDAFVLEHGKTYASDAKEYAKRLEIFAENMARAKEMSARD--GAEYGATPFADLTE 61
Query: 134 SEFNHGL---SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
EF L +D ++E LK + + + + +++ G V P V++Q +C
Sbjct: 62 DEFASSLLMREPIDAARVERLKRHESSRVLPHLPTENIPLNFDWRALGAVTP-VKNQGMC 120
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCW+ SA +E A+ +K L+ LS+Q
Sbjct: 121 GSCWSFSATGAVEGAHFVKSGALVSLSEQ 149
>gi|219362839|ref|NP_001136636.1| uncharacterized protein LOC100216764 precursor [Zea mays]
gi|194696462|gb|ACF82315.1| unknown [Zea mays]
gi|413934556|gb|AFW69107.1| hypothetical protein ZEAMMB73_554980 [Zea mays]
Length = 361
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 59/176 (33%), Positives = 90/176 (51%), Gaps = 34/176 (19%)
Query: 61 SEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT 120
SEE+ E + H N +D E RRF++F+ N I +H QG
Sbjct: 39 SEESLWALYERWCAHYNMARDL-----------GEKTRRFNLFKENAHRI---YEHNQGN 84
Query: 121 ATY--GVNRFADMTDSEFNHG---------LSSLDWEQIENLKSTFETYSFN-----SSN 164
ATY G+NRF+DMTD EF+ + + + E L+ E SFN ++
Sbjct: 85 ATYTLGLNRFSDMTDEEFSRSPYGRCLFAPVQRISDGENEELQQ-HEDVSFNLTHGGATA 143
Query: 165 SYGLAESINYKDKGKVLPKVQDQHL-CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ GL S++++ G+ + +V+DQ L CGSCWA +A+A +E AI+ L+ LS+Q
Sbjct: 144 ALGLPPSVDWR--GRSVTRVKDQGLTCGSCWAFAAIAAVEGINAIRTWSLVTLSEQ 197
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 85/160 (53%), Gaps = 11/160 (6%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
++++ ++R Y E E R+ IF+ N++ I+ + K + + G+N+FAD+T+ EF
Sbjct: 40 EEWMTRFKRVYSDAKEKEIRYKIFKENVQRIESFNKASEKSYKLGINQFADLTNEEFKTS 99
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ + + + + F N + S++++ +G V ++DQ CGSCWA SAV
Sbjct: 100 RN-----RFKGHMCSSQAGPFRYENITAVPSSMDWRKEGAVTA-IKDQGQCGSCWAFSAV 153
Query: 200 ACLESAYAIKHNELIELSKQP-----PKTHGRFYKGGVMN 234
A +E + ++LI LS+Q K + +GG+M+
Sbjct: 154 AAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMD 193
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/143 (32%), Positives = 75/143 (52%), Gaps = 12/143 (8%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN-- 137
++++ ++ R Y E E+R+ IF+ N++ I+ + GVN+FAD+T+ EF
Sbjct: 6 EEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFRAM 65
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
HG Q L S+ SF N + S+++++ G V P V+DQ CG CWA
Sbjct: 66 YHGYK----RQSSKLMSS----SFRYENLSDIPTSMDWRNDGAVTP-VKDQGTCGCCWAF 116
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S VA +E ++ LI LS+Q
Sbjct: 117 STVAAIEGIIKLQTGNLISLSEQ 139
>gi|56718883|gb|AAW28152.1| westerpain-10 [Paragonimus westermani]
Length = 327
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 78/143 (54%), Gaps = 13/143 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
++ F R Y + Y ++ + ++RF IF++NL +QGTA YGV +F+D+T EF
Sbjct: 32 YEQFKRGYGKVYANEDD-QKRFAIFKDNLVRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 90
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ + ++ +Q++ ++ T E I+++ KG V V++Q CGSCWA
Sbjct: 91 KYLSAPVNDDQVKRMRPT---------GLKAAPERIDWRAKGAVT-AVENQGSCGSCWAF 140
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E + IK +L+ LSKQ
Sbjct: 141 STAGNVEGQWFIKTGQLVSLSKQ 163
>gi|6649577|gb|AAF21462.1|U69121_1 cysteine proteinase PWCP2 [Paragonimus westermani]
Length = 260
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 78/143 (54%), Gaps = 13/143 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
++ F R Y + Y ++ + ++RF IF++NL +QGTA YGV +F+D+T EF
Sbjct: 6 YEQFKRXYGKVYANEDD-QKRFAIFKDNLMRAQKLQLKDQGTARYGVTQFSDLTPEEFAA 64
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ + ++ +Q++ ++ T E I+++ KG V V++Q CGSCWA
Sbjct: 65 KYLSAPVNNDQVKRVRPT---------GLKAAPERIDWRAKGAVT-AVENQGSCGSCWAF 114
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E + IK +L+ LSKQ
Sbjct: 115 STAGNVEGQWFIKTGQLVSLSKQ 137
>gi|18414611|ref|NP_567489.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|2244977|emb|CAB10398.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|7268368|emb|CAB78661.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|14517442|gb|AAK62611.1| AT4g16190/dl4135w [Arabidopsis thaliana]
gi|22136546|gb|AAM91059.1| AT4g16190/dl4135w [Arabidopsis thaliana]
gi|22530956|gb|AAM96982.1| cysteine proteinase [Arabidopsis thaliana]
gi|23397184|gb|AAN31875.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|110740834|dbj|BAE98514.1| cysteine proteinase like protein [Arabidopsis thaliana]
gi|332658313|gb|AEE83713.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 373
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 79/150 (52%), Gaps = 5/150 (3%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
E+ L+ + F F +YE+ Y + E + RF +F+ NL+ + +A +GV +F+
Sbjct: 46 EQLLNAEHHFTLFKSKYEKTYATQVEHDHRFRVFKANLRRA-RRNQLLDPSAVHGVTQFS 104
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T EF L + + +T + + L +++++G V P V++Q +
Sbjct: 105 DLTPKEFRRKFLGL---KRRGFRLPTDTQTAPILPTSDLPTEFDWREQGAVTP-VKNQGM 160
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCW+ SA+ LE A+ + EL+ LS+Q
Sbjct: 161 CGSCWSFSAIGALEGAHFLATKELVSLSEQ 190
>gi|118350036|ref|XP_001008299.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89290066|gb|EAR88054.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 332
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 73/146 (50%), Gaps = 12/146 (8%)
Query: 76 GNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSE 135
++++F++++ Y + E RF +FR+NLK I +G + YG+ +F D+T E
Sbjct: 40 AQKWQEFLKKHSITYKTIEEKLHRFAVFRDNLKKI-------EGHSNYGITKFMDLTSEE 92
Query: 136 FNHGLSSLDWEQI--ENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
F L I +N KS + N G I++ KG V P V+DQ CGSC
Sbjct: 93 FQQRYLRLKTNTIKRQNFKSNPKNAQLNMK--LGDDIIIDWTKKGAVTP-VKDQEQCGSC 149
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SA LESA I L LS+Q
Sbjct: 150 WAFSATGALESATFISTGTLPSLSEQ 175
>gi|170064305|ref|XP_001867470.1| cathepsin l [Culex quinquefasciatus]
gi|167881732|gb|EDS45115.1| cathepsin l [Culex quinquefasciatus]
Length = 547
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 79/158 (50%), Gaps = 35/158 (22%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
++F F ++ + Y+ + E +RR DIFR NL+ I + + +G T VN AD TD
Sbjct: 242 DEFTRFKYKHGKTYNGEKEHDRRQDIFRQNLRFIHSHNRANKGY-TVAVNHLADRTD--- 297
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYG---------------LAESINYKDKGKVL 181
E+I+ L+ F SSNSY L ES++++ G V
Sbjct: 298 ---------EEIQALRG------FKSSNSYNGGQPFPYNVKDFMDELPESLDWRIPGAVT 342
Query: 182 PKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
P V+DQ +CGSCW+ +ESAY +K +L+ S+Q
Sbjct: 343 P-VKDQSVCGSCWSFGTAGHIESAYFLKTKKLMRFSQQ 379
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 83/146 (56%), Gaps = 10/146 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTAT----YGVNRFADMTDS 134
+ ++ ++ + Y++ E E+RF IFR+NL+ ID + + G G+N+FAD+T+
Sbjct: 5 LQSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTND 64
Query: 135 EFNHGLSSLDW-EQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF + E+ E++KS + Y+ + L ES++++ KG V V+DQ CGSC
Sbjct: 65 EFRRIYFGVKRPEKAESVKS--DRYAVKEGDE--LPESVDWRKKGAV-SHVKDQGQCGSC 119
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SA+ +E I +LI LS+Q
Sbjct: 120 WAFSAIGAVEGINKIVTGDLITLSEQ 145
>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
Length = 341
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 71/141 (50%), Gaps = 9/141 (6%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ E+ R Y ++E RR ++FR N + ID + + NRFAD+T EF +
Sbjct: 41 WMAEHGRAYKDEAEKARRLEVFRANAELIDSFNAAGTHSHRLATNRFADLTVEEFRAART 100
Query: 142 SLDWEQIENLKST---FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L + + +E +S + A+S++++ G V V+DQ CG CWA SA
Sbjct: 101 GLRPRPAPSAGAGRFRYENFSLADA-----AQSVDWRAMGAVT-GVKDQGACGCCWAFSA 154
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I+ L+ LS+Q
Sbjct: 155 VAAVEGLNKIRTGRLVSLSEQ 175
>gi|209732052|gb|ACI66895.1| Cathepsin H precursor [Salmo salar]
Length = 275
Score = 75.5 bits (184), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 66/216 (30%), Positives = 98/216 (45%), Gaps = 47/216 (21%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT--ATYGVNRFADMTDSEF 136
FK ++ +Y + YD + E R IF N + IDY H +G T G+N+F+D+T +EF
Sbjct: 28 FKLWMSQYNKVYDME-EYYHRLQIFIENKRRIDY---HNEGNHKFTMGLNQFSDLTFAEF 83
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
S + +N +T ++ +S+ Y ES++++ KG + V++Q CGSCW
Sbjct: 84 R---KSFLLTEPQNCSATKGSH-VSSNGPY--PESVDWRKKGNYVTAVKNQGSCGSCWTF 137
Query: 197 SAVACLESAYAIKHNELIELSKQP------------------PKTHGRF----------- 227
S CLES AI +L++LS+Q RF
Sbjct: 138 STTGCLESVTAIATGKLLQLSEQQLVDCAQAFNITKYDEMGMVDAVARFNPVSLAYEVTS 197
Query: 228 ----YKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNE 259
Y GGV C ++NHAVL VGY E
Sbjct: 198 DFMHYDGGVYTSKE--CHNTTDTVNHAVLAVGYGEE 231
>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 46/152 (30%), Positives = 79/152 (51%), Gaps = 4/152 (2%)
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNR 127
+L + L + + ++ +Y R Y +E ++F++F+ N + ID + E G+N+
Sbjct: 26 ELNDDLSMAARHETWMAQYGRVYKDAAEKAQKFEVFKANARFIDSFNA-ENHKFWLGINQ 84
Query: 128 FADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQ 187
FAD+T+ EF ++ + I N + + + L SI+++ KG V P V+DQ
Sbjct: 85 FADLTNEEFKATKTNKGF--ISNKARVSTGFKYENLKIEALPTSIDWRTKGAVTP-VKDQ 141
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CG CWA SAVA E + +L+ LS+Q
Sbjct: 142 GQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQ 173
>gi|56755191|gb|AAW25775.1| SJCHGC00511 protein [Schistosoma japonicum]
Length = 454
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 89/167 (53%), Gaps = 11/167 (6%)
Query: 54 SQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY 113
S P + E F+L E + G + F Y +QY +++ E+RF IF++NL Y
Sbjct: 134 SIPRMPQNLEYLGFELPE--NVGEMYAQFKLTYRKQYH-ETDNEKRFSIFKSNLLKAQLY 190
Query: 114 TKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYG-LAESI 172
E+G+A YGV ++D+T EF+ + W +++ + + + G + +
Sbjct: 191 QVLERGSAVYGVTPYSDLTTDEFSRTHLTAPW------RASSKRNTISPRREVGDIPNNF 244
Query: 173 NYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++++KG V +V++Q +CGSCWA S +ES + K +L+ LS+Q
Sbjct: 245 DWREKGAV-TEVKNQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQ 290
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 75.1 bits (183), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/128 (36%), Positives = 68/128 (53%), Gaps = 6/128 (4%)
Query: 94 SEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHGLSSLDWEQIENL 151
E +RRF F +NL+ +D + G Y G+NRFAD+T++EF S
Sbjct: 68 GEHDRRFRAFWDNLRFVDAHNAR-AGARGYRLGINRFADLTNAEFRAAYLSAGARNGTAT 126
Query: 152 KSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN 211
+T E Y + + L E ++++ KG V P V++Q CGSCWA SAV +E I
Sbjct: 127 AATGERYRHDGVEA--LPEFVDWRQKGAVAP-VKNQGQCGSCWAFSAVGAVEGINQIVTG 183
Query: 212 ELIELSKQ 219
EL+ LS+Q
Sbjct: 184 ELVTLSEQ 191
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 74/141 (52%), Gaps = 10/141 (7%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKH-EQGTATYGV--NRFADMTDSEFNH 138
F +Y R+Y E R +F+ N + ++ + K E G T+ V N+F DMT+ EFN
Sbjct: 15 FKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQFGDMTNEEFNA 74
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ S E + ++ +A ++++ KG V P V+DQ CGSCWA SA
Sbjct: 75 VMKGY------KKGSRGEPTTVFTAEGRPMAADVDWRTKGAVTP-VKDQGQCGSCWAFSA 127
Query: 199 VACLESAYAIKHNELIELSKQ 219
LE + +K+NEL+ LS+Q
Sbjct: 128 TGSLEGQHFLKNNELVSLSEQ 148
>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
vinifera]
Length = 340
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 45/138 (32%), Positives = 72/138 (52%), Gaps = 3/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ Y R Y D+E ERRF +F++N+ I + GVN ADMT EF S
Sbjct: 38 WMARYSRNYKDDAEEERRFXMFKDNVDFIQTFDTAGNMPNKLGVNALADMTHEEFR--AS 95
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
++ NL ET SF N + +++++ K + + +++Q CG CWA SAVA
Sbjct: 96 GNTFKIPPNLGLRSETTSFRHQNVTRIPSTMDWRKK-RTVTHIKNQLQCGGCWAFSAVAA 154
Query: 202 LESAYAIKHNELIELSKQ 219
+E ++ ++ I LS+Q
Sbjct: 155 MEGIAKLQTSKSISLSEQ 172
>gi|149018922|gb|EDL77563.1| cathepsin H, isoform CRA_b [Rattus norvegicus]
Length = 270
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 81/160 (50%), Gaps = 13/160 (8%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
+ E + +E+F F +++++++ Y S E R +F NN + I + +
Sbjct: 19 ATAELTVNAIEKF-----HFTSWMKQHQKTYSSR-EYSHRLQVFANNWRKIQAHNQRNH- 71
Query: 120 TATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGK 179
T G+N+F+DM+ +E H W + +N +T Y + Y S++++ KG
Sbjct: 72 TFKMGLNQFSDMSFAEIKH---KYLWSEPQNCSATKSNY-LRGTGPY--PSSMDWRKKGN 125
Query: 180 VLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+ V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 126 VVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQ 165
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 48/140 (34%), Positives = 72/140 (51%), Gaps = 6/140 (4%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
++++ Y R Y +E ++R+ IF N+ I+ K VN+FAD+T+ EF
Sbjct: 39 EEWMASYGRVYKDINEKQKRYKIFEENVALIESSNKDANKPYKLSVNQFADLTNEEFKAS 98
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ I + KST SF N + +++++ KG V P V+DQ CG CWA SAV
Sbjct: 99 RNRFK-GHICSTKST----SFKYGNVSAVPSAMDWRMKGAVTP-VKDQGQCGCCWAFSAV 152
Query: 200 ACLESAYAIKHNELIELSKQ 219
A E + ELI LS+Q
Sbjct: 153 AATEGITKLTTGELISLSEQ 172
>gi|300175245|emb|CBK20556.2| unnamed protein product [Blastocystis hominis]
Length = 325
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 73/142 (51%), Gaps = 2/142 (1%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
QF F +++ + Y + E R +F NNLK +DYY +Q + G+ F D+++ EF
Sbjct: 23 QFAAFEKKFGKTYVGEEERRFRMSVFSNNLKIVDYY-NSKQSSFVLGITPFIDLSNDEFR 81
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+S + + + S + L SI+++ K V V+DQ CG+CWA +
Sbjct: 82 ERFASNTAFEKKAKSVESSSSQQTSQDYSSLPRSIDWRAKNTV-SSVKDQKNCGACWAFA 140
Query: 198 AVACLESAYAIKHNELIELSKQ 219
AVA +E YA K ++++ S Q
Sbjct: 141 AVASIEGVYAQKTGKILDFSPQ 162
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 74/139 (53%), Gaps = 2/139 (1%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ ++ R Y +E RR ++F+ N+ I+ + + GVN+FAD+T EF ++
Sbjct: 47 WMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMT 106
Query: 142 -SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVA 200
S + N + + + ++ L S++++ KG V +++DQ CG CWA SAVA
Sbjct: 107 NSKGFSTPNNGVRVSTGFKYENVSADALPASVDWRTKGAV-TRIKDQGQCGCCWAFSAVA 165
Query: 201 CLESAYAIKHNELIELSKQ 219
+E + +LI LS+Q
Sbjct: 166 AMEGIVKLSTGKLISLSEQ 184
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 82/146 (56%), Gaps = 6/146 (4%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F+ ++ ++ R Y + E +RRF++++ NL I+ + G T N+FAD+T+ EF
Sbjct: 118 RFEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNSGGHGY-TLTDNKFADLTNEEFR 176
Query: 138 H----GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
GL + + ++ + NS L + ++++ KG V+ +V++Q CGSC
Sbjct: 177 AKMLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVV-EVKNQGSCGSC 235
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SAVA +E IK+ +L+ LS+Q
Sbjct: 236 WAFSAVAAMEGLNQIKNGKLVSLSEQ 261
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 75/138 (54%), Gaps = 4/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y +E RRF+IF+ N+ I+ + GVN+FAD+T+ EF +
Sbjct: 40 WMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNH-KFWLGVNQFADLTNYEFRATKT 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ + I + T+ + + + L +++++ KG V P ++DQ CG CWA SAVA
Sbjct: 99 NKGF--IPSTVRVPTTFRYENVSIDTLPATVDWRTKGAVTP-IKDQGQCGCCWAFSAVAA 155
Query: 202 LESAYAIKHNELIELSKQ 219
+E + +LI LS+Q
Sbjct: 156 MEGIVKLSTGKLISLSEQ 173
>gi|113120265|gb|ABI30272.1| VXH-A, partial [Vasconcellea x heilbornii]
Length = 318
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 78/146 (53%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N F ++ EY++ Y E RF+IF++NLK ID T + T G+ F D+T+ EF
Sbjct: 46 NLFDSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDE-TNKKNNTYWLGLTSFTDLTNDEF 104
Query: 137 NHG-LSSLDWEQIENLKSTFET--YSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
+ S+ EN +T E+ F + + SI+++ KG V P V++Q CGSC
Sbjct: 105 KEKYVGSIP----ENWSTTEESNDKEFIYDDVVNIPASIDWRQKGAVTP-VRNQGSCGSC 159
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
W S+VA +E I +L+ LS+Q
Sbjct: 160 WTFSSVAAVEGINKIVTGQLVSLSEQ 185
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 55/190 (28%), Positives = 97/190 (51%), Gaps = 23/190 (12%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDS 134
+ +++ + R Y++ E ERRF++FR+NL+ +D + + G ++ G+NRFAD+T+
Sbjct: 45 MYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFADLTND 104
Query: 135 EFNHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
E+ L Q E + + ++ L ES++++ KG V +V+DQ CGSC
Sbjct: 105 EYRATYLGVRSRPQRERRLGD----RYLAGDNEDLPESVDWRAKGAV-AEVKDQGSCGSC 159
Query: 194 WAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNH 249
WA S +A +E I ++I LS+Q ++ + GG+M+ Y+
Sbjct: 160 WAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMD----------YAFEF 209
Query: 250 AVLNVGYDNE 259
+ N G D E
Sbjct: 210 IINNGGIDTE 219
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 75/138 (54%), Gaps = 4/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y +E RRF+IF+ N+ I+ + GVN+FAD+T+ EF +
Sbjct: 40 WMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNH-KFWLGVNQFADLTNYEFRATKT 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ + I + T+ + + + L +++++ KG V P ++DQ CG CWA SAVA
Sbjct: 99 NKGF--IPSTVRVPTTFRYENVSIDTLPATVDWRTKGAVTP-IKDQGQCGCCWAFSAVAA 155
Query: 202 LESAYAIKHNELIELSKQ 219
+E + +LI LS+Q
Sbjct: 156 MEGIVKLSTGKLISLSEQ 173
>gi|3377952|emb|CAA08906.1| cysteine proteinase [Cicer arietinum]
Length = 362
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 47/150 (31%), Positives = 81/150 (54%), Gaps = 6/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
++ L+ + F F ++ + Y + E + RF +F++NLK + K + +A +GV +F+
Sbjct: 38 DQLLNAEHHFTTFKSKFSKSYATKEEHDYRFGVFKSNLKKAKLHQKLDP-SAEHGVTKFS 96
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T SEF L ++ L + + +N+ L E ++++KG V P V+DQ
Sbjct: 97 DLTASEFRRQFLGL--KKRLRLPAHAQKAPILPTNN--LPEDFDWREKGAVTP-VKDQGS 151
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S LE A + +L+ LS+Q
Sbjct: 152 CGSCWAFSTTGALEGANYLATGKLVSLSEQ 181
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/192 (27%), Positives = 97/192 (50%), Gaps = 33/192 (17%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ +++E+ + Y+S E ++RF IF+ N+ I+ + + + G+N+FAD+T+SEF
Sbjct: 38 YQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEF-R 96
Query: 139 GL------SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
GL + ++ ++ +T A S++++ KG V +++DQ CGS
Sbjct: 97 GLYVGRLQRPAPFHEVGDIALVADT-----------ATSVDWRKKGGV-TEIKDQGDCGS 144
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLN 248
CWA SAVA +E + L+ LS+Q T + GG+M+ Y+
Sbjct: 145 CWAFSAVAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMD----------YAFQ 194
Query: 249 HAVLNVGYDNES 260
+ + N G ++S
Sbjct: 195 YMIRNGGITSQS 206
>gi|158519867|ref|NP_001103540.1| cathepsin W precursor [Bos taurus]
gi|158455042|gb|AAI13313.1| CTSW protein [Bos taurus]
gi|296471607|tpg|DAA13722.1| TPA: cathepsin W [Bos taurus]
Length = 272
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 70/141 (49%), Gaps = 5/141 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ F +Y R Y + +E RR DIF NL + + GTA +GV +F+D+T+ EF
Sbjct: 42 FRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEFVQ 101
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
S + + + + S ++ +++ G + P V+DQ C CWA +A
Sbjct: 102 LYGSQVAGEALGVSRKVGSEEWGESEP----QTCDWRKVGTISP-VRDQRNCNCCWAMAA 156
Query: 199 VACLESAYAIKHNELIELSKQ 219
+E+ +AIK +E+S Q
Sbjct: 157 AGNIEALWAIKFRHFVEVSVQ 177
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 74/143 (51%), Gaps = 8/143 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEF 136
F+++V +Y + Y S E RF++F++NL ID K TY G+N FAD+T EF
Sbjct: 66 FEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKK---VTTYWLGLNAFADLTHDEF 122
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ L Q E K+T + + + S++++ KG V V++Q CGSCWA
Sbjct: 123 K--ATYLGLRQPETKKTTDSRFRYGGVADDDVPASVDWRKKGAVT-DVKNQGQCGSCWAF 179
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S VA +E I L LS+Q
Sbjct: 180 STVAAVEGINQIVTGNLTSLSEQ 202
>gi|94420703|gb|ABF18679.1| cysteine protease [Medicago sativa]
Length = 350
Score = 75.1 bits (183), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 52/146 (35%), Positives = 75/146 (51%), Gaps = 10/146 (6%)
Query: 75 HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDS 134
H F F Y ++YD+ E++RRF IF NL+ I+ K G T GVN FAD T
Sbjct: 47 HAVSFARFANRYGKRYDTVDEMKRRFKIFSENLQLIESTNKKRLGY-TLGVNHFADWTWE 105
Query: 135 EF-NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF +H L + +N +T + + L +++ +G ++ +V+DQ CGSC
Sbjct: 106 EFRSHRLGA-----AQNCSATLK--GNHRITDVVLPAEKDWRKEG-IVSEVKDQGHCGSC 157
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
W S LESAYA + I LS+Q
Sbjct: 158 WTFSTTGALESAYAQAFGKNISLSEQ 183
>gi|145508365|ref|XP_001440132.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124407338|emb|CAK72735.1| unnamed protein product [Paramecium tetraurelia]
Length = 321
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 82/143 (57%), Gaps = 11/143 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
Q++++ ++Y ++Y + +E RF I++ N+ I+ + + + +N+F D+TD EF
Sbjct: 37 QYQEWQQKYNKRYPTQNEQIYRFSIYQQNIMKIEDFN-SQNNSYKQKINKFGDLTDQEFL 95
Query: 138 HGLSSLDW-EQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+L +++N++ E + + E +++ KGKV P ++DQ CGSCWA
Sbjct: 96 TIYLNLQMPARVKNIQKNEEPFL--------VQEEVDWVQKGKV-PAIKDQGDCGSCWAF 146
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SAV LE I+ NE+++LS+Q
Sbjct: 147 SAVGALEINTKIQFNEIVDLSEQ 169
>gi|343477225|emb|CCD11889.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 74/145 (51%), Gaps = 11/145 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT---ATYGVNRFADMTDS 134
QF F ++Y R Y +E RF +F+ N++ K E AT+GV RF+DM+
Sbjct: 40 QFAAFKQKYSRSYKDATEEAFRFRVFKQNMER----AKEEAAANPYATFGVTRFSDMSPE 95
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF + LK + + ++ + +++++ KG V P V+DQ CGSCW
Sbjct: 96 EFRATYHNGAEYYAAALKRPRKVVNVSTGKA---PPAVDWRKKGAVTP-VKDQGACGSCW 151
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA+ +E + + +EL LS+Q
Sbjct: 152 AFSAIGNIEGQWKVAGHELTSLSEQ 176
>gi|305434754|gb|ADM53739.1| cathepsin L2 precursor [Lepeophtheirus salmonis]
Length = 382
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 75/137 (54%), Gaps = 8/137 (5%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F+ FV+EY + Y + + + +F +NL+ I+ + + + T G+N F+D+TD EF
Sbjct: 35 EFESFVKEYSKSYHNRALRSLKLKVFVDNLREIEEHNANPKRTWDMGINEFSDLTDEEFE 94
Query: 138 ---HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
G S + + T + N L ES+++++KG V+ V++Q CGSCW
Sbjct: 95 SKYMGYSPMS----SSAGLVTRTVAPKQGNIKDLPESVDWREKG-VITDVKNQGSCGSCW 149
Query: 195 AHSAVACLESAYAIKHN 211
SAV +ES AI++N
Sbjct: 150 VFSAVEQIESYVAIENN 166
>gi|1163075|emb|CAA81061.1| cysteine proteinase [Trypanosoma congolense]
Length = 442
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 74/145 (51%), Gaps = 11/145 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT---ATYGVNRFADMTDS 134
QF F ++Y R Y +E RF +F+ N++ K E AT+GV RF+DM+
Sbjct: 35 QFAAFKQKYSRSYKDATEEAFRFRVFKQNMER----AKEEAAANPYATFGVTRFSDMSPE 90
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF + LK + + ++ + +++++ KG V P V+DQ CGSCW
Sbjct: 91 EFRATYHNGAEYYAAALKRPRKVVNVSTGKA---PPAVDWRKKGAVTP-VKDQGACGSCW 146
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA+ +E + + +EL LS+Q
Sbjct: 147 AFSAIGNIEGQWKVAGHELTSLSEQ 171
>gi|258618831|gb|ACV84238.1| cysteine proteinase L [Anisakis simplex]
Length = 411
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 51/154 (33%), Positives = 78/154 (50%), Gaps = 19/154 (12%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG--TATYGVNRFADMTDS 134
+QF DF+ Y R+Y E RF F NN+K Y K +QG +G+ RFAD ++
Sbjct: 102 DQFIDFMNVYGRKYHGYHETRERFQNFVNNMK---YIKKIQQGKQNVQFGITRFADWSEE 158
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSS---------NSYGLAESINYKDKGKVLPKVQ 185
E + S+ + N++ ++ ++ S G ES +++ K V+ ++
Sbjct: 159 E----MKSMTCGEEPNMEMRYDREYYDGSYEDEFTLYDGFGGRPESFDWRSK-NVVTDIK 213
Query: 186 DQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
DQ CGSCWA AV +ES AI N L+ LS+Q
Sbjct: 214 DQQRCGSCWAFGAVGVVESMNAIAKNPLVSLSEQ 247
>gi|291410711|ref|XP_002721635.1| PREDICTED: cathepsin H [Oryctolagus cuniculus]
Length = 333
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 84/162 (51%), Gaps = 17/162 (10%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G++ S +LE+F FK ++ ++ ++Y ++ E RR F N + I+ H G
Sbjct: 19 GADAFSANNLEKF-----HFKSWMSQHHKKYSAE-EYPRRLQTFVRNWRKIN---AHNNG 69
Query: 120 TATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T+ G+N+F+DM+ +E H W + +N +T Y + Y S++++ K
Sbjct: 70 NHTFQMGLNQFSDMSFAEIKH---KYLWTEPQNCSATKSNY-LRGTGPY--PSSVDWRKK 123
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 124 GNFVSPVKNQGACGSCWTFSTTGALESAVAIAGGKMLSLAEQ 165
>gi|33590494|gb|AAQ22984.1| cathepsin L-like cysteine proteinase precursor [Acanthoscelides
obtectus]
Length = 321
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 50/150 (33%), Positives = 86/150 (57%), Gaps = 9/150 (6%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT-KHEQGTATY--GVNRFA 129
L +++ F ++ R Y + E +RRF+IF+ NL+TI+ + ++ G T+ G+N+F
Sbjct: 17 LSEQEKWQQFKIQHGRTYRTLLEEKRRFEIFKFNLRTIEEHNERYHNGEETFEMGINQFG 76
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
DMT EF L+ L Q+ + + SF++ N + +++++++KG V +V+ Q
Sbjct: 77 DMTQEEFKRMLA-LQKPQMPLPRG--DEVSFDNVND--IPKTVDWREKGAV-TEVKKQGN 130
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV +E +K+ L LS Q
Sbjct: 131 CGSCWAFSAVGSIEGQVFLKNGSLESLSAQ 160
>gi|20147096|gb|AAM09951.1| 49 kDa cysteine proteinase Cysp1 [Cryptobia salmositica]
Length = 428
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 70/142 (49%), Gaps = 3/142 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F +F + R Y S E +RF+IF N+K + AT+G N FADMT EF
Sbjct: 10 FGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNP-MATFGPNEFADMTSEEFQT 68
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSN-SYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
++ + T +F + + + I+++ KG V P V++Q CGSCW+ S
Sbjct: 69 RHNAARHYAAAKARPPKNTKTFTAEEIKAAVGQQIDWRLKGAVTP-VKNQGACGSCWSFS 127
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+E +AI +L+ +S+Q
Sbjct: 128 TTGNIEGQHAIATGQLVAVSEQ 149
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 82/144 (56%), Gaps = 9/144 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F ++ ++ + Y+S E RF+IFR+NL ID T + + G+N FAD+++ EF
Sbjct: 48 FDSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDE-TNKKNNSYWLGLNGFADLSNDEFKK 106
Query: 139 ---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
G + D+ +E+ + E +++ +Y +SI+++ KG V P V++Q CGSCWA
Sbjct: 107 KYVGSVAEDFTGLEHFDN--EDFTYKHVTNY--PQSIDWRAKGAVTP-VKNQGSCGSCWA 161
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +A +E I L+ELS+Q
Sbjct: 162 FSTIATVEGVNKIVTGNLLELSEQ 185
>gi|256077197|ref|XP_002574894.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230780|emb|CCD77197.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 419
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ F +Y +QY ++E E RF+IF++N+ Y E+G+A YGV ++D+T EF
Sbjct: 119 KYVQFKLKYRKQYH-ETEDEIRFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTTDEFA 177
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ W + +T + +N + ++ ++++KG V +V++Q +CGSCWA S
Sbjct: 178 RTHLTASWVVPSSRSNTPTSLGKEVNN---IPKNFDWREKGAV-TEVKNQGMCGSCWAFS 233
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+ES + K +L+ LS+Q
Sbjct: 234 TTGNVESQWFRKTGKLLSLSEQ 255
>gi|121531620|gb|ABM55495.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 264
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 54/146 (36%), Positives = 80/146 (54%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATY--GVNRFADMTD 133
+Q+ F + + + Y + E + RF IF+ NL I ++ ++++G TY GV RFAD+T
Sbjct: 21 DQWIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIKEHNARYDKGEETYLLGVTRFADLTH 80
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF L QI+N K + +SI++ +KG VL +V+DQ+ CGSC
Sbjct: 81 EEFKDILKG----QIKN-KPRLNATPTVFPEDLEVPDSIDWTEKGAVL-EVKDQNPCGSC 134
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SA LE AI +N I LS+Q
Sbjct: 135 WAFSATGALEGQNAILNNVKISLSEQ 160
>gi|224285931|gb|ACN40679.1| unknown [Picea sitchensis]
Length = 366
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 76/141 (53%), Gaps = 8/141 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ F+R Y ++Y E E RF +F++NL + K + A++GV +F+D+T F H
Sbjct: 57 FRHFIRRYGKKYSGPEEHEHRFGVFKSNLLRALEHQKLDP-RASHGVTKFSDLTQEGFRH 115
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L + L+ + +++ L E ++++KG V +V++Q CGSCWA S
Sbjct: 116 QYLGL---RAPPLRDAHDAPILPTND---LPEDFDWREKGAVT-EVKNQGSCGSCWAFST 168
Query: 199 VACLESAYAIKHNELIELSKQ 219
LE A +K EL+ LS+Q
Sbjct: 169 TGALEGANFLKTGELVSLSEQ 189
>gi|30142040|gb|AAN34825.1| cysteine proteinase [Leishmania amazonensis]
gi|30142042|gb|AAN34826.1| cysteine proteinase [Leishmania amazonensis]
gi|30142572|gb|AAP21894.1| cysteine proteinase [Leishmania amazonensis]
Length = 354
Score = 74.7 bits (182), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 79/142 (55%), Gaps = 5/142 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVN-RFADMTDSEFN 137
+ F + + + + D+E RF+ F+ N++T Y+ + A Y V+ +FAD+T EF
Sbjct: 42 YGSFKKRHSKAFGGDAEEGHRFNAFKQNMQTA-YFLNTQNPHAHYDVSGKFADLTPQEFA 100
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ D+ +LK E + S G+ S++++DKG V P V++Q LCGSCWA S
Sbjct: 101 KLYLNPDY-YTSHLKDHKEDVHVDDSAPSGVM-SVDWRDKGAVTP-VKNQGLCGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A+ +E +A + L+ LS+Q
Sbjct: 158 AIGNIEGQWAASGHSLVSLSEQ 179
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 85/160 (53%), Gaps = 9/160 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++++ + + Y++ E RF++F++NLK ID T + + GVN FAD+T EF +
Sbjct: 45 FEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDE-TNKKVTSYWLGVNEFADLTHQEFKN 103
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L ++E+ ++ F + L +S++++ KG V +V++Q CGSCWA S
Sbjct: 104 MYLGL---KVESSRTRQSPEEFTYKDVVDLPKSVDWRKKGAVT-RVKNQGSCGSCWAFST 159
Query: 199 VACLESAYAIKHNELIELSKQPPKTHGRFYK----GGVMN 234
VA +E I L LS+Q R Y GG+M+
Sbjct: 160 VAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMD 199
>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
Length = 370
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 48/174 (27%), Positives = 86/174 (49%), Gaps = 18/174 (10%)
Query: 49 LILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLK 108
+++++ P+ +EE + L+ + F F ++ + Y + E + RF +F++NL+
Sbjct: 31 ILIRQVVPDVGEAEEE-----DNLLNAEHHFASFKAKFAKTYATKEEHDHRFGVFKSNLR 85
Query: 109 TIDYYTKHEQGTATYGVNRFADMTDSEFNH---GLSSLDWEQIENLKSTFETYSFNSSNS 165
+ K + +A +GV +F+D+T +EF GL L + T
Sbjct: 86 RARLHAKLDP-SAVHGVTKFSDLTPAEFRRQFLGLKPLRFPAHAQKAPILPTKD------ 138
Query: 166 YGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
L + +++DKG V V+DQ CGSCW+ S LE A+ + EL+ LS+Q
Sbjct: 139 --LPKDFDWRDKGAVT-NVKDQGACGSCWSFSTTGALEGAHYLATGELVSLSEQ 189
>gi|118429527|gb|ABK91811.1| cathepsin F precursor [Clonorchis sinensis]
Length = 326
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 86/175 (49%), Gaps = 27/175 (15%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
+++F +Y++ Y +D + E RF+IF++NL + EQGTA YGV +F+D+T EF
Sbjct: 32 YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKT 90
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ D + + E + ++ E ++++ G V P V DQ CGSCWA
Sbjct: 91 RYLRMRFDGPIVSEDLTPEEDVTMDN-------EKFDWREHGAVGP-VLDQGKCGSCWAF 142
Query: 197 SAVACLESAYAIKHNELIELSKQ----------------PPKTHGRFYKGGVMNL 235
S + +E + K +L+ LS+Q PP+T+ K G + L
Sbjct: 143 SVIGNVEGQWFRKTGDLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQKMGGLEL 197
>gi|113120271|gb|ABI30275.1| VS-A [Vasconcellea stipulata]
Length = 318
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 73/146 (50%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N F ++ EY++ Y E RF+IF++NLK ID T + T G+ F D+T+ EF
Sbjct: 46 NLFDSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDE-TNKKNNTYWLGLTSFTDLTNDEF 104
Query: 137 NH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
G +W E F + + SI+++ KG V P V++Q CGSC
Sbjct: 105 KEKYVGSIPENWSTTEEPNDK----EFIYDDVVNIPASIDWRQKGAVTP-VRNQGSCGSC 159
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
W S+VA +E I +L+ LS+Q
Sbjct: 160 WTFSSVAAVEGINKIVTGQLVSLSEQ 185
>gi|294883332|ref|XP_002770713.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239873998|gb|EER02718.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 332
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 76/142 (53%), Gaps = 9/142 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F ++ + Y+S E +R IF+ NL+ I+ + + GVN AD+T EF
Sbjct: 28 FMGFKHKFGKNYESKEEEVKRNAIFQANLQHIEQVNAKDL-SYKLGVNEHADLTHEEFAA 86
Query: 139 -GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
LS+LD + + E N+ L S+++++K VL V+DQ CGSCWA S
Sbjct: 87 LKLSTLDTSTRRDDEFVVEV------NTTQLPTSVDWRNK-SVLTPVKDQEFCGSCWAFS 139
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A+ LE+ YAI +L+ LS+Q
Sbjct: 140 AIGALEAQYAIATGKLLSLSEQ 161
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 76/138 (55%), Gaps = 6/138 (4%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y ++ E +R++IF+ N++ I+ + K G+N FAD+T+ EF ++
Sbjct: 40 WMAQYGRVYKNEVEKTKRYNIFKENVEYIESFNKAGTKPYKLGINAFADLTNKEF---IA 96
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
S + + + S+ F N + +++++ KG V P V+DQ CG CWA SAVA
Sbjct: 97 SRNGYILPHECSS--NTPFRYENVSAVPTTVDWRKKGAVTP-VKDQGQCGCCWAFSAVAA 153
Query: 202 LESAYAIKHNELIELSKQ 219
+E + LI LS+Q
Sbjct: 154 MEGITKLSTGNLISLSEQ 171
>gi|118429515|gb|ABK91805.1| cysteine proteinase 7 precursor [Clonorchis sinensis]
Length = 326
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 86/175 (49%), Gaps = 27/175 (15%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
+++F +Y++ Y +D + E RF+IF++NL + EQGTA YGV +F+D+T EF
Sbjct: 32 YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKT 90
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ D + + E + ++ E ++++ G V P V DQ CGSCWA
Sbjct: 91 RYLRMRFDGPIVSEDLTPEEDVTMDN-------EKFDWREHGAVGP-VLDQGKCGSCWAF 142
Query: 197 SAVACLESAYAIKHNELIELSKQ----------------PPKTHGRFYKGGVMNL 235
S + +E + K +L+ LS+Q PP+T+ K G + L
Sbjct: 143 SVIGNVEGQWFRKTGDLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQKMGGLEL 197
>gi|221056030|ref|XP_002259153.1| P.knowlesi ortholog of falcipain [Plasmodium knowlesi strain H]
gi|193809224|emb|CAQ39926.1| P.knowlesi ortholog of falcipain [Plasmodium knowlesi strain H]
Length = 479
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/167 (30%), Positives = 81/167 (48%), Gaps = 33/167 (19%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L++ N F F++E+ ++Y + E+++R+ F NL I+ + E + G+NRF DM+
Sbjct: 153 LENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVSYKKGMNRFGDMS 212
Query: 133 DSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK-GKVLPK-------- 183
+E+ E T +T+ F SN + I+Y D + PK
Sbjct: 213 ------------FEEFEKKYLTLKTFDF-KSNGLKSTQLISYDDVINRYKPKDDKFDHTK 259
Query: 184 -----------VQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+DQ CGSCWA S V +ES Y I+ NEL+ +S+Q
Sbjct: 260 YDWRLHRGVTPVKDQGDCGSCWAFSTVGVVESQYLIRKNELVSISEQ 306
>gi|121531616|gb|ABM55493.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 264
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 54/146 (36%), Positives = 80/146 (54%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATY--GVNRFADMTD 133
+Q+ F + + + Y S E RF IF++NL I ++ ++++G TY GV RFAD+T
Sbjct: 21 DQWIAFKQTHGKTYKSLLEERTRFGIFQSNLMKIKEHNARYDKGEETYFLGVTRFADLTH 80
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF L QI+N T + + + +SI++ +KG VL ++DQ CGSC
Sbjct: 81 GEFKDFLR----RQIKNKPRLHATPTVFPED-LEVPDSIDWTEKGAVL-DIKDQEDCGSC 134
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SA LE AI +N I LS+Q
Sbjct: 135 WAFSATGALEGQNAILNNVRIPLSEQ 160
>gi|74229834|gb|AAU14993.2| cysteine proteinase [Cryptobia salmositica]
Length = 443
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 70/142 (49%), Gaps = 3/142 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F +F + R Y S E +RF+IF N+K + AT+G N FADMT EF
Sbjct: 25 FGNFKAAHARNYASPDEERKRFEIFAGNMKKAAVLNRKNP-MATFGPNEFADMTSEEFQT 83
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSN-SYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
++ + T +F + + + I+++ KG V P V++Q CGSCW+ S
Sbjct: 84 RHNAARHYAAAKARPPKNTKTFTAEEIKAAVGQQIDWRLKGAVTP-VKNQGACGSCWSFS 142
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+E +AI +L+ +S+Q
Sbjct: 143 TTGNIEGQHAIATGQLVAVSEQ 164
>gi|29841177|gb|AAP06190.1| similar to GenBank Accession Number U07345 preprocathepsin L in
Schistosoma mansoni [Schistosoma japonicum]
Length = 356
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 47/154 (30%), Positives = 85/154 (55%), Gaps = 11/154 (7%)
Query: 67 FDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVN 126
F+L E + G + F Y +QY +++ E+RF IF++NL Y E+G+A YGV
Sbjct: 147 FELPE--NVGEMYAQFKLTYRKQY-HETDNEKRFSIFKSNLLKAQLYQVLERGSAVYGVT 203
Query: 127 RFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYG-LAESINYKDKGKVLPKVQ 185
++D+T EF+ + W +++ + + + G + + ++++KG V +V+
Sbjct: 204 PYSDLTTDEFSRTHLTAPW------RASSKRNTISPRREVGDIPNNFDWREKGAV-TEVK 256
Query: 186 DQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+Q +CGSCWA S +ES + K +L+ LS+Q
Sbjct: 257 NQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQ 290
>gi|357473651|ref|XP_003607110.1| Cysteine proteinase [Medicago truncatula]
gi|355508165|gb|AES89307.1| Cysteine proteinase [Medicago truncatula]
Length = 331
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 80/154 (51%), Gaps = 10/154 (6%)
Query: 69 LEEFLDHGN---QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGV 125
+++ +D G QF +F + + + Y S E + RF++F++NL + + +AT+GV
Sbjct: 35 IQQVVDKGGAEYQFNEFKQRFGKVYSSKDEHDYRFNVFKSNLHRAKRHGIMDP-SATHGV 93
Query: 126 NRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQ 185
RF+D+T EF + + L + S + L ++++KG V P V+
Sbjct: 94 TRFSDLTPREFRNSILGLKGVGLPRHAKAAPILS-----TENLPRDFDWREKGAVTP-VR 147
Query: 186 DQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+Q CGS W+ S + LE A+ + EL+ LS+Q
Sbjct: 148 NQGFCGSSWSFSTIGALEGAHFLSSGELVSLSEQ 181
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 49/165 (29%), Positives = 80/165 (48%), Gaps = 11/165 (6%)
Query: 67 FDLEEFLDHGN-----QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA 121
D+E + N ++ ++ E + Y+S E E RF+IF+ NL+ ID + +
Sbjct: 25 LDIENSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRIIDDHNADANRSY 84
Query: 122 TYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVL 181
+ G+NRFAD+TD E+ L ++ + + L + ++++ G V+
Sbjct: 85 SLGLNRFADLTDEEYRSTYLGLKMGPKTDVSN-----EYMPKVGEALPDYVDWRTVGAVV 139
Query: 182 PKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGR 226
V++Q LC SCWA SAV +E I LI LS+Q GR
Sbjct: 140 -GVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSEQELVDCGR 183
>gi|155966155|gb|ABU41032.1| cysteine proteinase [Lepeophtheirus salmonis]
Length = 372
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/137 (32%), Positives = 75/137 (54%), Gaps = 8/137 (5%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F+ FV+EY + Y + + + +F +NL+ I+ + + + T G+N F+D+TD EF
Sbjct: 26 EFESFVKEYSKSYHNRALRSLKLKVFVDNLREIEEHNANPKRTWDMGINEFSDLTDEEFE 85
Query: 138 ---HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
G S + + T + N L ES+++++KG V+ V++Q CGSCW
Sbjct: 86 SKYMGYSPMS----SSAGLVTRTAAPKQGNIKDLPESVDWREKG-VITDVKNQGSCGSCW 140
Query: 195 AHSAVACLESAYAIKHN 211
SAV +ES AI++N
Sbjct: 141 VFSAVEQIESYVAIENN 157
>gi|297804580|ref|XP_002870174.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
gi|297316010|gb|EFH46433.1| hypothetical protein ARALYDRAFT_915142 [Arabidopsis lyrata subsp.
lyrata]
Length = 373
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 78/150 (52%), Gaps = 5/150 (3%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
E L+ + F F +YE+ Y + E + RF +F+ NL+ + +A +GV +F+
Sbjct: 46 EHLLNAEHHFSLFKSKYEKTYATQEEHDHRFRVFKANLRRARR-NQLLDPSAVHGVTQFS 104
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T EF L + + +T + + L +++++G V P V++Q +
Sbjct: 105 DLTPKEFRRKFLGL---KRRGFRLPTDTQTAPILPTSDLPTEFDWREQGAVTP-VKNQGM 160
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCW+ SA+ LE A+ + EL+ LS+Q
Sbjct: 161 CGSCWSFSAIGALEGAHFLATKELVSLSEQ 190
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 54/190 (28%), Positives = 97/190 (51%), Gaps = 23/190 (12%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDS 134
+ +++ + R Y++ E ERRF++FR+NL+ +D + + G ++ G+NRFAD+T+
Sbjct: 45 MYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFADLTND 104
Query: 135 EFNHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
E+ L Q E + + ++ L ES++++ KG V +++DQ CGSC
Sbjct: 105 EYRATYLGVRSRPQRERRLGD----RYLAGDNEDLPESVDWRAKGAV-AEIKDQGSCGSC 159
Query: 194 WAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNH 249
WA S +A +E I ++I LS+Q ++ + GG+M+ Y+
Sbjct: 160 WAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMD----------YAFEF 209
Query: 250 AVLNVGYDNE 259
+ N G D E
Sbjct: 210 IINNGGIDTE 219
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 74.7 bits (182), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 73/138 (52%), Gaps = 4/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y + Y E E RF IF N+ ++ + + G+N+FAD+T+ EF ++
Sbjct: 42 WMSQYGKIYKDHQERETRFKIFTENVNYVEASNADDTKSYKLGINQFADLTNEEF---VA 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
S + + S T +F N + +++++ KG V P V++Q CG CWA SAVA
Sbjct: 99 SRNKFKGHMCSSITRTTTFKYENVSAIPSTVDWRKKGAVTP-VKNQGQCGCCWAFSAVAA 157
Query: 202 LESAYAIKHNELIELSKQ 219
E + + +LI LS+Q
Sbjct: 158 TEGIHKLSTGKLISLSEQ 175
>gi|256077193|ref|XP_002574892.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230781|emb|CCD77198.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 457
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ F +Y +QY ++E E RF+IF++N+ Y E+G+A YGV ++D+T EF
Sbjct: 157 KYVQFKLKYRKQYH-ETEDEIRFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTTDEFA 215
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ W + +T + +N + ++ ++++KG V +V++Q +CGSCWA S
Sbjct: 216 RTHLTASWVVPSSRSNTPTSLGKEVNN---IPKNFDWREKGAV-TEVKNQGMCGSCWAFS 271
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+ES + K +L+ LS+Q
Sbjct: 272 TTGNVESQWFRKTGKLLSLSEQ 293
>gi|209731972|gb|ACI66855.1| Cathepsin H precursor [Salmo salar]
Length = 328
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 80/143 (55%), Gaps = 12/143 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT--ATYGVNRFADMTDSEF 136
FK ++ +Y + YD + E R IF N + IDY H +G T G+N+F+D+T +EF
Sbjct: 28 FKLWMSQYNKVYDME-EYYHRLQIFIENKRRIDY---HNEGNHKFTMGLNQFSDLTFAEF 83
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
S + +N +T ++ +S+ Y ES++++ KG + V++Q CGSCW
Sbjct: 84 R---KSFLLTEPQNCSATKGSH-VSSNGPY--PESVDWRKKGNYVTAVKNQGSCGSCWTF 137
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S CLES AI +L++LS+Q
Sbjct: 138 STTGCLESVTAIATGKLLQLSEQ 160
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 51/160 (31%), Positives = 85/160 (53%), Gaps = 9/160 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++++ + + Y++ E RF++F++NLK ID T + + GVN FAD+T EF +
Sbjct: 48 FEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDE-TNKKVTSYWLGVNEFADLTHQEFKN 106
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L ++E+ ++ F + L +S++++ KG V +V++Q CGSCWA S
Sbjct: 107 MYLGL---KVESSRTRQSPEEFTYKDVVDLPKSVDWRKKGAVT-RVKNQGSCGSCWAFST 162
Query: 199 VACLESAYAIKHNELIELSKQPPKTHGRFYK----GGVMN 234
VA +E I L LS+Q R Y GG+M+
Sbjct: 163 VAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMD 202
>gi|33333696|gb|AAQ11966.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 77/150 (51%), Gaps = 18/150 (12%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATYG--VNRFA 129
LDHG ++ V E +RRF +F+ NL I ++ K+E+G ++ V +FA
Sbjct: 28 LDHGKTYRSLVEE-----------KRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFA 76
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
DMT EF L L + + L S + ++++++++G V P V+DQ
Sbjct: 77 DMTHEEF---LDLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTP-VKDQAN 132
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV +E + K+ L+ LS Q
Sbjct: 133 CGSCWAFSAVGAIEGQFFKKNGTLVSLSAQ 162
>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
Length = 363
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 76/150 (50%), Gaps = 6/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+ L+ + F F ++ + Y + E + RF +F++NL + K + TA +G+ +F+
Sbjct: 39 DHLLNAEHHFTSFKSKFSKSYSTKEEHDYRFGVFKSNLIKAKLHQKLDP-TAEHGITKFS 97
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T SEF L + L+ + L E ++++KG V P V+DQ
Sbjct: 98 DLTASEFRRQFLGLK----KRLRLPAHAQKAPILPTTNLPEDFDWREKGAVTP-VKDQGS 152
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S LE A+ + +L+ LS+Q
Sbjct: 153 CGSCWAFSTTGALEGAHYLATGKLVSLSEQ 182
>gi|126021|sp|P25775.1|LMCPA_LEIME RecName: Full=Cysteine proteinase A; Flags: Precursor
gi|9573|emb|CAA44094.1| cysteine proteinase [Leishmania mexicana]
Length = 354
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 80/142 (56%), Gaps = 5/142 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVN-RFADMTDSEFN 137
+ F + + + + D+E RF+ F+ N++T Y+ + A Y V+ +FAD+T EF
Sbjct: 42 YGSFKKRHGKAFGGDAEEGHRFNAFKQNMQTA-YFLNTQNPHAHYDVSGKFADLTPQEFA 100
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ D+ +LK+ E + S G+ S++++DKG V P V++Q LCGSCWA S
Sbjct: 101 KLYLNPDYYA-RHLKNHKEDVHVDDSAPSGVM-SVDWRDKGAVTP-VKNQGLCGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A+ +E +A + L+ LS+Q
Sbjct: 158 AIGNIEGQWAASGHSLVSLSEQ 179
>gi|226468424|emb|CAX69889.1| Temporarily Assigned Gene name [Schistosoma japonicum]
Length = 454
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 88/167 (52%), Gaps = 11/167 (6%)
Query: 54 SQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY 113
S P + E F+L E + G + F Y +QY +++ E+RF IF++NL Y
Sbjct: 134 SIPRMPQNLEYLGFELPE--NVGEMYAQFKLTYRKQYH-ETDNEKRFSIFKSNLLKAQLY 190
Query: 114 TKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYG-LAESI 172
E+G+A YGV ++D+T EF+ + W +++ + + + G + +
Sbjct: 191 QVLERGSAVYGVTPYSDLTTDEFSRTHLTAPW------RASSKRNTISPRREVGDIPNNF 244
Query: 173 NYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+++ KG V +V++Q +CGSCWA S +ES + K +L+ LS+Q
Sbjct: 245 DWRKKGAV-TEVKNQGMCGSCWAFSTTGNIESQWFRKTGKLLSLSEQ 290
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 60/197 (30%), Positives = 92/197 (46%), Gaps = 43/197 (21%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
F ++ + R Y S SE +RRF IF++NL I + K E+ + G+N+F+D+T EF
Sbjct: 52 FHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQEK-SYWLGLNKFSDLTHDEFRA 110
Query: 138 -----------HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
HGL + D E++ + E ++++ KG V V+D
Sbjct: 111 LYLGIRPAGRAHGLRNGDRFIYEDVVAE---------------EMVDWRKKGAV-SDVKD 154
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSK 242
Q CGSCWA SA+ +E AI ELI LS+Q + + GG+M+
Sbjct: 155 QGSCGSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQNQGCNGGLMD-------- 206
Query: 243 GPYSLNHAVLNVGYDNE 259
Y+ + + N G D E
Sbjct: 207 --YAFDFIIKNGGIDTE 221
>gi|33333702|gb|AAQ11969.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 77/150 (51%), Gaps = 18/150 (12%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATYG--VNRFA 129
LDHG ++ V E +RRF +F+ NL I ++ K+E+G ++ V +FA
Sbjct: 28 LDHGKTYRSLVEE-----------KRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFA 76
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
DMT EF L L + + L S + ++++++++G V P V+DQ
Sbjct: 77 DMTHEEF---LDLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTP-VKDQAN 132
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV +E + K+ L+ LS Q
Sbjct: 133 CGSCWAFSAVGAIEGQFFKKNGTLVSLSAQ 162
>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
Length = 363
Score = 74.3 bits (181), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 76/150 (50%), Gaps = 6/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+ L+ + F F ++ + Y + E + RF +F++NL + K + TA +G+ +F+
Sbjct: 39 DHLLNAEHHFTSFKSKFSKSYSTKEEHDYRFGVFKSNLIKAKLHQKLDP-TAEHGITKFS 97
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T SEF L + L+ + L E ++++KG V P V+DQ
Sbjct: 98 DLTASEFRRQFLGLK----KRLRLPAHAQKAPILPTTNLPEDFDWREKGAVTP-VKDQGS 152
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S LE A+ + +L+ LS+Q
Sbjct: 153 CGSCWAFSTTGALEGAHYLATGKLVSLSEQ 182
>gi|33333708|gb|AAQ11972.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 77/150 (51%), Gaps = 18/150 (12%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATYG--VNRFA 129
LDHG ++ V E +RRF +F+ NL I ++ K+E+G ++ V +FA
Sbjct: 28 LDHGKTYRSLVEE-----------KRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFA 76
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
DMT EF L L + + L S + ++++++++G V P V+DQ
Sbjct: 77 DMTHEEF---LDLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTP-VKDQAN 132
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV +E + K+ L+ LS Q
Sbjct: 133 CGSCWAFSAVGAIEGQFFKKNGTLVSLSAQ 162
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 54/188 (28%), Positives = 98/188 (52%), Gaps = 21/188 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+ ++ E+ + Y++ E ERR+ FR+NL+ ID + + G ++ G+NRFAD+T+ E
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ L + K + + ++++ L ES++++ KG V +++DQ + GSCWA
Sbjct: 100 YRDTYLGLRNKPRRERKVSDR---YLAADNEALPESVDWRTKGAV-AEIKDQEVAGSCWA 155
Query: 196 HSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAV 251
SA+A +E I +LI LS+Q ++ GG+M+ Y+ + +
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD----------YAFDFII 205
Query: 252 LNVGYDNE 259
N G D E
Sbjct: 206 NNGGIDTE 213
>gi|256077195|ref|XP_002574893.1| cathepsin F (C01 family) [Schistosoma mansoni]
gi|353230782|emb|CCD77199.1| cathepsin F (C01 family) [Schistosoma mansoni]
Length = 456
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 78/142 (54%), Gaps = 6/142 (4%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ F +Y +QY EI RF+IF++N+ Y E+G+A YGV ++D+T EF
Sbjct: 157 KYVQFKLKYRKQYHETDEI--RFNIFKSNILKAQLYQVFERGSAIYGVTPYSDLTTDEFA 214
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ W + +T + +N + ++ ++++KG V +V++Q +CGSCWA S
Sbjct: 215 RTHLTASWVVPSSRSNTPTSLGKEVNN---IPKNFDWREKGAV-TEVKNQGMCGSCWAFS 270
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+ES + K +L+ LS+Q
Sbjct: 271 TTGNVESQWFRKTGKLLSLSEQ 292
>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/142 (34%), Positives = 74/142 (52%), Gaps = 15/142 (10%)
Query: 86 YERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF---NHGLSS 142
+ R YD + E + R ++F NLK I+ + + GVN+F D T EF + GLS
Sbjct: 45 FSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSYKLGVNKFTDWTKEEFLATHTGLSG 104
Query: 143 LDWEQIENLKSTFE-----TYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ N+ S FE T ++N + S L + +++++G V P V+ Q CG CWA S
Sbjct: 105 I------NVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNEGAVTP-VKYQGECGGCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A+A +E I LI LS+Q
Sbjct: 158 AIAAVEGLTKIARGNLISLSEQ 179
>gi|33622213|ref|NP_891858.1| cathepsin [Cryptophlebia leucotreta granulovirus]
gi|33569322|gb|AAQ21608.1| cathepsin [Cryptophlebia leucotreta granulovirus]
Length = 332
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 45/152 (29%), Positives = 82/152 (53%), Gaps = 20/152 (13%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F FV++Y + Y ++ E +FD F+NNL+ I+ + + A + +N+++D+ ++
Sbjct: 30 FDSFVKQYNKTYLTEEERMIKFDNFKNNLRIINEKNRGSK-HAVFDINKYSDLNKNDLLR 88
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYG-----------LAESINYKDKGKVLPKVQDQ 187
+ +N YSF + G L E+ +++DK V P V++Q
Sbjct: 89 HTTGFKLGLKKN-------YSFTTVKECGVVEIKEEPQVLLPETFDWRDKHGVTP-VKNQ 140
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+CGSCWA S + +ES Y IK++++I+LS+Q
Sbjct: 141 LICGSCWAFSTIGNIESLYNIKYDKVIDLSEQ 172
>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
Precursor
gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 371
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 50/155 (32%), Positives = 81/155 (52%), Gaps = 10/155 (6%)
Query: 67 FDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--G 124
FD E L F+ ++ ++ + YDS +E ERR IF +NL+ + T +Y G
Sbjct: 48 FDAEATL----MFESWMVKHGKVYDSVAEKERRLTIFEDNLR---FITNRNAENLSYRLG 100
Query: 125 VNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKV 184
+NRFAD++ E+ D N + + +S+ L +S++++++G V +V
Sbjct: 101 LNRFADLSLHEYGEICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAV-TEV 159
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+DQ LC SCWA S V +E I EL+ LS+Q
Sbjct: 160 KDQGLCRSCWAFSTVGAVEGLNKIVTGELVTLSEQ 194
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 41/139 (29%), Positives = 74/139 (53%), Gaps = 2/139 (1%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ ++ R Y +E RR ++F+ N+ I+ + + GVN+FAD+T EF ++
Sbjct: 47 WMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMT 106
Query: 142 -SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVA 200
S + N + + + ++ L S++++ KG V +++DQ CG CWA SAVA
Sbjct: 107 NSKGFSTPNNGVRVSTGFKYENVSADALPASVDWRTKGAV-TRIKDQGQCGCCWAFSAVA 165
Query: 201 CLESAYAIKHNELIELSKQ 219
+E + +LI LS+Q
Sbjct: 166 AMEGFVKLSTGKLISLSEQ 184
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 48/142 (33%), Positives = 77/142 (54%), Gaps = 9/142 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+++ +++ Y R Y E E RFDI+++N++ I++Y NRFAD+T+ EF
Sbjct: 38 RYETWLKRYGRHYRDREEWEVRFDIYQSNVQYIEFYNSQNYSYKLID-NRFADITNEEFK 96
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ L + +++ F + L +SI+++ KG V V+DQ CGSCWA S
Sbjct: 97 S--TYLGYLPRFRVQTEFRYHKHGE-----LPKSIDWRKKGAV-THVKDQGRCGSCWAFS 148
Query: 198 AVACLESAYAIKHNELIELSKQ 219
AVA +E IK L+ LS+Q
Sbjct: 149 AVAAVEGINKIKTENLVSLSEQ 170
>gi|410960470|ref|XP_003986812.1| PREDICTED: pro-cathepsin H [Felis catus]
Length = 321
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 78/143 (54%), Gaps = 12/143 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEF 136
FK ++ +++++Y S+ E +RR F N + I + H G T+ G+N+F+DM+ +E
Sbjct: 21 FKSWMVQHQKRYSSE-EYQRRLQTFVGNWRRI---SAHNAGNHTFKMGLNQFSDMSFAEI 76
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
H W + +N +T Y + Y ++++ KGK + V++Q CGSCW
Sbjct: 77 KH---KYLWSEPQNCSATRGNY-LRGTGPY--PPFVDWRTKGKYVSPVKNQGGCGSCWTF 130
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S LESA AIK +L+ L++Q
Sbjct: 131 STTGALESAIAIKTGKLLSLAEQ 153
>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
Length = 384
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 82/143 (57%), Gaps = 9/143 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTK-HEQGTATY--GVNRFADMTDSE 135
+K+F +++ Y+ E RRF+IFR N+ I+ + K G +Y GVN+F D+ +E
Sbjct: 79 WKEFKILHDKSYEDHEEESRRFEIFRENVLRIEKHNKLFHLGKKSYYLGVNQFTDLEYAE 138
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
F + + L + N K + S S+N+ + +S++++ KG V KV++Q CGSCWA
Sbjct: 139 FVN-FNGLKMTNLNNTKCS----SHLSANNIVVPDSVDWRSKGYV-TKVKNQGACGSCWA 192
Query: 196 HSAVACLESAYAIKHNELIELSK 218
SA LE Y K+ +L+ LS+
Sbjct: 193 FSATGSLEGQYFRKNGKLVPLSE 215
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 91/166 (54%), Gaps = 15/166 (9%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSE 135
+++++ ++++ Y+ E ++RF IF++NL ID +H TY G+N+FADMT+ E
Sbjct: 38 MYEEWLVKHQKVYNGLREKDQRFQIFKDNLNFID---EHNAQNYTYIVGLNKFADMTNEE 94
Query: 136 FNH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
+ G S +I K T Y++NS + L ++++ KG + ++DQ CGS
Sbjct: 95 YRDMYLGTRSDIKRRIMKNKITGHRYAYNSGDR--LPVHVDWRLKGAI-THIKDQGSCGS 151
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFY----KGGVMN 234
CWA S +A +E+ I +L+ LS+Q R + GG+M+
Sbjct: 152 CWAFSTIATVEAINKIVTGKLVSLSEQELVDCDRAFNEGCNGGLMD 197
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 84/144 (58%), Gaps = 9/144 (6%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEF- 136
+ ++ +Y R Y +D+E+E+RF IF NL+ I+ + + G +Y +N+F+D+T+ EF
Sbjct: 39 QQWMLQYGRSYTNDAEMEKRFKIFMENLEYIEKF-NNAPGNKSYKLDLNQFSDLTNEEFI 97
Query: 137 -NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+H +D + + S + S++ S++++++G V V++Q CGSCWA
Sbjct: 98 ASHTGLMIDPSKPSSSSKRASPASLDLSDT---PTSLDWREQGAV-TDVKNQGNCGSCWA 153
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
SAVA +E IK+ LI LS+Q
Sbjct: 154 FSAVAAVEGIVKIKNGNLISLSEQ 177
>gi|209882566|ref|XP_002142719.1| papain family cysteine protease [Cryptosporidium muris RN66]
gi|209558325|gb|EEA08370.1| papain family cysteine protease, putative [Cryptosporidium muris
RN66]
Length = 400
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 78/146 (53%), Gaps = 6/146 (4%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-TATYGVNRFADMTDSE 135
NQF+DF ++Y+++Y + +E + R+ IFR N+ I + QG + +N + D+T E
Sbjct: 84 NQFEDFKQKYKKEYSNLTEEKYRYSIFRKNMNFIK--MSNNQGFSYVLEMNEYGDLTHEE 141
Query: 136 FNHGLSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
F H + + + + S N + +N+ D G V P V+DQ CGSCW
Sbjct: 142 FMHNFMGYHPQHKNKRFSDSHNILSSNKVENTSPPRFVNWVDAGCVNP-VRDQRYCGSCW 200
Query: 195 AHSAVACLESAYAIKHNE-LIELSKQ 219
A S V LESA + NE L++LS+Q
Sbjct: 201 AFSVVTSLESAVCAQKNEKLVKLSEQ 226
>gi|168018894|ref|XP_001761980.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162686697|gb|EDQ73084.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 369
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 78/144 (54%), Gaps = 11/144 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHE--QGTATYGVNRFADMTDSE 135
+F+ F++++ + Y S E E RF +F++NL KH+ TA++GV F+D+T+ E
Sbjct: 55 RFESFMKDFGKVYHSVEEYEHRFGVFKSNLLKA---LKHQALDPTASHGVTMFSDLTEEE 111
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
F L + L S + + + L + ++++KG V P V+DQ CGSCWA
Sbjct: 112 FTSKYLGLKRPSV--LSSAPQAPPLPTED---LPPNFDWREKGAVGP-VKDQGGCGSCWA 165
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +E A+ + +L+ LS+Q
Sbjct: 166 FSTTGAVEGAHFLNSGKLVSLSEQ 189
>gi|66814630|ref|XP_641494.1| cysteine protease [Dictyostelium discoideum AX4]
gi|118121|sp|P04989.1|CYSP2_DICDI RecName: Full=Cysteine proteinase 2; AltName: Full=Prestalk
cathepsin; Flags: Precursor
gi|167860|gb|AAA33240.1| pst-cathepsin [Dictyostelium discoideum]
gi|1834417|emb|CAA27050.1| cysteine proteinase 2 [Dictyostelium discoideum]
gi|60469522|gb|EAL67513.1| cysteine protease [Dictyostelium discoideum AX4]
gi|225484|prf||1304284A cathepsin,prestalk
Length = 376
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 56/200 (28%), Positives = 93/200 (46%), Gaps = 42/200 (21%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
F ++ ++ RQY S SE R+ IF++N+ +D + G+N FAD+T+ E+
Sbjct: 36 FTEWTLKFNRQYSS-SEFSNRYSIFKSNMDYVDNWNSKGDSQTVLGLNNFADITNEEYRK 94
Query: 138 ---------HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQH 188
H + D ++ N+ E N +SI+++ K V P ++DQ
Sbjct: 95 TYLGTRVNAHSYNGYDGREVLNV----EDLQTN-------PKSIDWRTKNAVTP-IKDQG 142
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ-------PPKTHGRFYKGGVMNLPHMLCS 241
CGSCW+ S E A+A+K +L+ LS+Q P + G GG+MN
Sbjct: 143 QCGSCWSFSTTGSTEGAHALKTKKLVSLSEQNLVDCSGPEENFG--CDGGLMN------- 193
Query: 242 KGPYSLNHAVLNVGYDNEST 261
+ ++ + N G D ES+
Sbjct: 194 ---NAFDYIIKNKGIDTESS 210
>gi|118125|sp|P25784.1|CYSP3_HOMAM RecName: Full=Digestive cysteine proteinase 3; Flags: Precursor
Length = 321
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 74/141 (52%), Gaps = 10/141 (7%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATYGV--NRFADMTDSEFNH 138
F +Y R+Y E R +F+ N + I D+ K E G T+ V N+F DMT+ EFN
Sbjct: 23 FKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEFNA 82
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ K+ F ++ + +A ++++ K V P V+DQ CGSCWA SA
Sbjct: 83 VMKGYKKGSRGEPKAVF------TAEAGPMAADVDWRTKALVTP-VKDQEQCGSCWAFSA 135
Query: 199 VACLESAYAIKHNELIELSKQ 219
LE + +K++EL+ LS+Q
Sbjct: 136 TGALEGQHFLKNDELVSLSEQ 156
>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 72/126 (57%), Gaps = 3/126 (2%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIEN-LKS 153
E ++RF++F+ N+ + + K ++ +N+FADMT+ EF H + + + L +
Sbjct: 53 EKDKRFNVFKANVHYVHNFNKKDK-PYKLKLNKFADMTNHEFRHHYAGSKIKHHRSFLGA 111
Query: 154 TFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNEL 213
+ +F +N + S++++ KG V P V+DQ CGSCWA S V +E IK NEL
Sbjct: 112 SRANGTFMYANVEDVPPSVDWRKKGAVTP-VKDQGKCGSCWAFSTVVAVEGINQIKTNEL 170
Query: 214 IELSKQ 219
+ LS+Q
Sbjct: 171 VSLSEQ 176
>gi|457756|emb|CAA82995.1| cysteine proteinase [Vicia sativa]
Length = 358
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 76/150 (50%), Gaps = 6/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+ L+ + F F ++ + Y + E + RF +F+ NL + K + TA +G+ +F+
Sbjct: 34 DHLLNAEHHFTSFKSKFSKSYATKEEHDYRFGVFKANLIKAKLHQKLDP-TAEHGITKFS 92
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T SEF L+ + L+ + L E ++++KG V P V+DQ
Sbjct: 93 DLTASEFRRQFLGLN----KRLRLPAHAQKAPILPTTNLPEDFDWREKGAVTP-VKDQGS 147
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S LE A+ + +L+ LS+Q
Sbjct: 148 CGSCWAFSTTGALEGAHYLATGKLVSLSEQ 177
>gi|401419663|ref|XP_003874321.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
gi|1706259|sp|P35591.2|CYSP1_LEIPI RecName: Full=Cysteine proteinase 1; AltName: Full=Amastigote
cysteine proteinase A-1; Flags: Precursor
gi|1220383|gb|AAA91859.1| cysteine proteinase [Leishmania pifanoi]
gi|322490556|emb|CBZ25817.1| cysteine peptidase A (CBA) [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 354
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 79/142 (55%), Gaps = 5/142 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVN-RFADMTDSEFN 137
+ F + + + + D+E RF+ F+ N++T Y+ + A Y V+ +FAD+T EF
Sbjct: 42 YGSFKKRHGKAFGGDAEEGHRFNAFKQNMQTA-YFLNTQNPHAHYDVSGKFADLTPQEFA 100
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ D+ +LK E + S G+ S++++DKG V P V++Q LCGSCWA S
Sbjct: 101 KLYLNPDYYA-RHLKDHKEDVHVDDSAPSGVM-SVDWRDKGAVTP-VKNQGLCGSCWAFS 157
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A+ +E +A + L+ LS+Q
Sbjct: 158 AIGNIEGQWAASGHSLVSLSEQ 179
>gi|414590229|tpg|DAA40800.1| TPA: putative cysteine protease family protein [Zea mays]
Length = 381
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 69/144 (47%), Gaps = 4/144 (2%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
QF FVR + R+Y E RR +F NL + + TA +GV F+D+T EF
Sbjct: 59 QFAAFVRRHGRRYSGPKEYARRLRVFAANLARAAAHQALDP-TARHGVTPFSDLTREEFE 117
Query: 138 HGLSSL-DWEQIENLKSTFETYSFNSSNSYG-LAESINYKDKGKVLPKVQDQHLCGSCWA 195
L+ L ++ L S S L S +++DKG V V+ Q CGSCWA
Sbjct: 118 ARLTGLRAGGDVQRLMSGVPAAPPASKEEVARLPASFDWRDKGAVT-GVKTQGACGSCWA 176
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +E A + EL++LS+Q
Sbjct: 177 FSTTGAVEGANFLATGELVDLSEQ 200
>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 356
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 73/143 (51%), Gaps = 4/143 (2%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
++++ ++ R Y E RR ++F N + +D + T T G+N+F+D+TD EF
Sbjct: 40 EEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRAGNRTYTLGLNKFSDLTDDEFVQT 99
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYG---LAESINYKDKGKVLPKVQDQHLCGSCWAH 196
Q L+ E S ++ YG + ES++++ +G V V++Q CG CWA
Sbjct: 100 HLGYRGHQQGGLRPEEENVSKVAALGYGQADMPESVDWRAQGAVT-GVKNQGSCGCCWAF 158
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
+AVA E I LI +S+Q
Sbjct: 159 AAVAATEGLVKIATGNLISMSEQ 181
>gi|17543258|ref|NP_502836.1| Protein Y40H7A.10 [Caenorhabditis elegans]
gi|3880920|emb|CAA22062.1| Protein Y40H7A.10 [Caenorhabditis elegans]
Length = 343
Score = 74.3 bits (181), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 50/167 (29%), Positives = 86/167 (51%), Gaps = 19/167 (11%)
Query: 54 SQPNSYGSEEASTFDLEEFLD----------HGNQFKDFVREYERQYDSDSEIERRFDIF 103
++PN S + S DL++ L + N F++F+ +Y R+Y ++ EI +RF IF
Sbjct: 18 AKPNLLPSYQIS--DLDQILQRHHIPTPDVKYTNAFQNFLVKYLREYPNEYEIVKRFTIF 75
Query: 104 RNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSS 163
NL ++ Y K + G TY +N F+D+T+ E+ L + + E KS +
Sbjct: 76 SRNLDLVERYNKEDAGKVTYELNDFSDLTEEEWKKYLMTPKPDHSE--KSLKPKTLIDKK 133
Query: 164 NSYGLAESINYKDKGKV--LPKVQDQHLCGSCWAHSAVACLESAYAI 208
N L S+++++ + ++ Q CGSCWA + A +ESA +I
Sbjct: 134 N---LPNSVDWRNVNGTNHVTGIKYQGPCGSCWAFATAAAIESAVSI 177
>gi|145542871|ref|XP_001457122.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124424937|emb|CAK89725.1| unnamed protein product [Paramecium tetraurelia]
Length = 324
Score = 73.9 bits (180), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 83/157 (52%), Gaps = 9/157 (5%)
Query: 65 STFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKH-EQGTATY 123
+T ++ + +D N + ++ +Y R+Y S + RF +F +NL I + E T T
Sbjct: 22 NTQEVSDEIDTANLYANWKMKYNRRYTSQRDEMYRFKVFSDNLNYIRAFQDSTESATYTL 81
Query: 124 GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKV-LP 182
+N+FADM+ EF SL + L ++ N++ Y AE +++ D KV P
Sbjct: 82 ELNQFADMSQQEFASTYLSLRVPKTAKLNAS------NANFQYKGAE-VDWTDNKKVKYP 134
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V++Q CGSCWA SAV LE I+ N+ ELS+Q
Sbjct: 135 AVKNQGSCGSCWAFSAVGALEINTDIELNKKYELSEQ 171
>gi|357473731|ref|XP_003607150.1| Cysteine proteinase [Medicago truncatula]
gi|355508205|gb|AES89347.1| Cysteine proteinase [Medicago truncatula]
Length = 326
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 79/154 (51%), Gaps = 10/154 (6%)
Query: 69 LEEFLDHG---NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGV 125
+++ +D G +QF +F + + + Y S E + RF++F++NL + + +AT+GV
Sbjct: 34 IQQVVDKGGAEHQFNEFKQRFGKVYSSKDEHDYRFNVFKSNLHRAKRHVIMDP-SATHGV 92
Query: 126 NRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQ 185
RF+D+T EF + + L + S S L ++++KG V P V+
Sbjct: 93 TRFSDLTPREFRNSILGLKGVGLPRHAKAAPILS-----SENLPRDFDWREKGAVTP-VR 146
Query: 186 DQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+Q CGS W+ S + LE A + EL+ LS Q
Sbjct: 147 NQGFCGSSWSFSTIGALEGANFLSTGELVSLSDQ 180
>gi|10441624|gb|AAG17127.1|AF190653_1 cathepsin L-like cysteine proteinase CAL1 [Diabrotica virgifera
virgifera]
Length = 322
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 51/150 (34%), Positives = 80/150 (53%), Gaps = 9/150 (6%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATY--GVNRFA 129
L ++ F ++ + Y + E RF +F+ NLKTI ++ K+EQG Y VN+FA
Sbjct: 15 LSLNQHWESFKVQHGKVYKNPIEERVRFSVFQANLKTINEHNAKYEQGLVGYTMAVNQFA 74
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
DMT EF L + + + +K + + N+ + +S++++ KG VL V+DQ
Sbjct: 75 DMTPEEFKAKLG-MQAKNMPKIKKSRHVKNVNAE----VPDSVDWRQKGAVL-GVKDQGQ 128
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SA LE I + + LS+Q
Sbjct: 129 CGSCWAFSATGSLEGQNYIVNGKSEPLSEQ 158
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 73/141 (51%), Gaps = 4/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ ++ + Y+S E RF+IF +NLK ID T + G+N FAD+T EF H
Sbjct: 49 FESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDE-TNKKVSNYWLGLNEFADLTHEEFKH 107
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
E E + + F + L +S++++ KG V P V++Q CGSCWA S
Sbjct: 108 KFLGFKGELAERKDES--SKEFGYRDFVDLPKSVDWRKKGAVAP-VKNQGQCGSCWAFST 164
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I L LS+Q
Sbjct: 165 VAAVEGINQIVTGNLTMLSEQ 185
>gi|160431607|sp|A0E358.2|CATL2_PARTE RecName: Full=Cathepsin L 2; Flags: Precursor
Length = 314
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 53/157 (33%), Positives = 83/157 (52%), Gaps = 9/157 (5%)
Query: 65 STFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKH-EQGTATY 123
+T ++ + +D N + ++ +Y R+Y S + RF +F +NL I + E T T
Sbjct: 12 NTQEVSDEIDTANLYANWKMKYNRRYTSQRDEMYRFKVFSDNLNYIRAFQDSTESATYTL 71
Query: 124 GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKV-LP 182
+N+FADM+ EF SL + L ++ N++ Y AE +++ D KV P
Sbjct: 72 ELNQFADMSQQEFASTYLSLRVPKTAKLNAS------NANFQYKGAE-VDWTDNKKVKYP 124
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V++Q CGSCWA SAV LE I+ N+ ELS+Q
Sbjct: 125 AVKNQGSCGSCWAFSAVGALEINTDIELNKKYELSEQ 161
>gi|228245|prf||1801240C Cys protease 3
Length = 321
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 73/141 (51%), Gaps = 10/141 (7%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATYGV--NRFADMTDSEFNH 138
F +Y R+Y E R +F+ N + I D+ K E G T+ V N+F DMT+ EFN
Sbjct: 22 FKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEFNA 81
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ K+ F ++ +A ++++ K V P V+DQ CGSCWA SA
Sbjct: 82 VMKGYKKGSRGEPKAVF------TAEGRPMARDVDWRTKALVTP-VKDQEQCGSCWAFSA 134
Query: 199 VACLESAYAIKHNELIELSKQ 219
LE + +K++EL+ LS+Q
Sbjct: 135 TGALEGQHFLKNDELVSLSEQ 155
>gi|121531598|gb|ABM55484.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 326
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 80/146 (54%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATY--GVNRFADMTD 133
+Q+ F + + + Y + E + RF IF+ NL I ++ ++++G TY GV RFAD+T
Sbjct: 21 DQWIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIKEHNARYDKGEETYLLGVTRFADLTH 80
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF L QI+N K + +SI++ +KG VL +V+DQ+ CGSC
Sbjct: 81 EEFKDILKG----QIKN-KPRLNATPTVFPEDLEVPDSIDWTEKGAVL-EVKDQNPCGSC 134
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SA L+ AI +N I LS+Q
Sbjct: 135 WAFSATGALKGQNAILNNVKISLSEQ 160
>gi|33333698|gb|AAQ11967.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 77/150 (51%), Gaps = 18/150 (12%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATYG--VNRFA 129
LDHG ++ V E +RRF +F+ NL I ++ K+E+G ++ V +FA
Sbjct: 28 LDHGKTYRSVVEE-----------KRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFA 76
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
DMT EF L L + + L S + ++++++++G V P V+DQ
Sbjct: 77 DMTHEEF---LDLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTP-VKDQAN 132
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV +E + K+ L+ LS Q
Sbjct: 133 CGSCWAFSAVGAIEGQFFKKNGTLVSLSAQ 162
>gi|343476707|emb|CCD12272.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 447
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 74/145 (51%), Gaps = 11/145 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT---ATYGVNRFADMTDS 134
QF F ++Y R Y +E RF +F+ N++ K E AT+GV RF+DM+
Sbjct: 40 QFAAFKQKYSRSYKDATEEAFRFRVFKQNMER----AKEEAAANPYATFGVTRFSDMSPE 95
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF + LK + + ++ + E+++++ KG V P V+DQ CGSCW
Sbjct: 96 EFRATYHNGAEYYAAALKRPRKVVTVSTGKA---PEAVDWRKKGAVTP-VKDQGQCGSCW 151
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA+ +E + + + L LS+Q
Sbjct: 152 AFSAIGNIEGQWKVTGHNLTSLSEQ 176
>gi|33333700|gb|AAQ11968.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 80/152 (52%), Gaps = 22/152 (14%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATYG--VNRFA 129
LDHG ++ V E +RRF +F+ NL I ++ K+E+G ++ V +FA
Sbjct: 28 LDHGKTYRSLVEE-----------KRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFA 76
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE--SINYKDKGKVLPKVQDQ 187
DMT EF L L + + L S F++S + E +++++++G V P +DQ
Sbjct: 77 DMTHEEF---LDLLKLQGVPALPSN--AVHFDNSEDIDMEEKDAVDWREEGAVTP-AKDQ 130
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV +E + K+ L+ LS Q
Sbjct: 131 ANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQ 162
>gi|33333704|gb|AAQ11970.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 80/152 (52%), Gaps = 22/152 (14%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATYG--VNRFA 129
LDHG ++ V E +RRF +F+ NL I ++ K+E+G ++ V +FA
Sbjct: 28 LDHGKTYRSLVEE-----------KRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFA 76
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE--SINYKDKGKVLPKVQDQ 187
DMT EF L L + + L S F++ + E +I+++++G V P V+DQ
Sbjct: 77 DMTHEEF---LDLLKLQGVPALPSN--AVHFDNFEDIDMEEKDAIDWREEGAVTP-VKDQ 130
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV +E + K+ L+ LS Q
Sbjct: 131 ANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQ 162
>gi|294885989|ref|XP_002771502.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875206|gb|EER03318.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 54/155 (34%), Positives = 79/155 (50%), Gaps = 10/155 (6%)
Query: 66 TFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGV 125
DLE G F F +++ + YD+ E +R IF +NL I+ + GV
Sbjct: 17 AVDLEA---AGLAFIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVNAQNL-SYKLGV 72
Query: 126 NRFADMTDSEFNH-GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKV 184
N + D+T EF LSS D E + F + ++ + L S++++ KG VL V
Sbjct: 73 NEYTDLTLEEFAALKLSSTDMS--EGMGDGFVAGAGPTTTT--LPTSVDWRKKG-VLNPV 127
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+DQ CGSCWA SA+ LE YAI +L+ LS+Q
Sbjct: 128 KDQGYCGSCWAFSAIGALEPRYAIATGKLLSLSEQ 162
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 79/147 (53%), Gaps = 10/147 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-TATYGVNRFADMTDSE 135
+ F F Y + Y ++ E +RR+ IF+NNL I +T ++QG + + +N F D++ E
Sbjct: 115 DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYI--HTHNQQGYSYSLKMNHFGDLSRDE 172
Query: 136 FNHGLSSLDWEQIENLKS---TFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
F L +++ NLKS T N S L ++++ +G V P V+DQ CGS
Sbjct: 173 FRR--KYLGFKKSRNLKSHHLGVATELLNVLPSE-LPAGVDWRSRGCVTP-VKDQRDCGS 228
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA S LE A+ K +L+ LS+Q
Sbjct: 229 CWAFSTTGALEGAHCAKTGKLVSLSEQ 255
>gi|11055|emb|CAA45129.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 320
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 74/141 (52%), Gaps = 10/141 (7%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATYGV--NRFADMTDSEFNH 138
F +Y R+Y E R +F+ N + I D+ K E G T+ V N+F DMT+ EFN
Sbjct: 22 FKTQYGRKYGDAKEELYRQRVFQQNEQLIEDFNKKFENGEVTFKVAMNQFGDMTNEEFNA 81
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ K+ F ++ + +A ++++ K V P V+DQ CGSCWA SA
Sbjct: 82 VMKGYKKGSRGEPKAVF------TAEAGPMAADVDWRTKALVTP-VKDQEQCGSCWAFSA 134
Query: 199 VACLESAYAIKHNELIELSKQ 219
LE + +K++EL+ LS+Q
Sbjct: 135 TGALEGQHFLKNDELVSLSEQ 155
>gi|33333694|gb|AAQ11965.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 49/150 (32%), Positives = 77/150 (51%), Gaps = 18/150 (12%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATYG--VNRFA 129
LDHG ++ V E +RRF +F+ NL I ++ K+E+G ++ V +FA
Sbjct: 28 LDHGKTYRSVVEE-----------KRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFA 76
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
DMT EF L L + + L S + ++++++++G V P V+DQ
Sbjct: 77 DMTHEEF---LDLLKLQGVPALPSNAVHFDNFEDTDMEEKDAVDWREEGAVTP-VKDQAN 132
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV +E + K+ L+ LS Q
Sbjct: 133 CGSCWAFSAVGAIEGQFFKKNGTLVSLSAQ 162
>gi|356545108|ref|XP_003540987.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 365
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 74/144 (51%), Gaps = 13/144 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F +F R + + YDS+ E + R+ +F+ N++ + + +A +GV RF+D+T SEF +
Sbjct: 50 FLEFKRRFGKAYDSEDEHDYRYKVFKANMRRARRHQSLDP-SAAHGVTRFSDLTPSEFRN 108
Query: 139 ---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
GL + N T + L +++D G V P V++Q CGSCW+
Sbjct: 109 KVLGLRGVRLPLDANKAPILPTDN--------LPSDFDWRDHGAVTP-VKNQGSCGSCWS 159
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S LE A+ + EL+ LS+Q
Sbjct: 160 FSTTGALEGAHFLSTGELVSLSEQ 183
>gi|403258371|ref|XP_003921746.1| PREDICTED: pro-cathepsin H [Saimiri boliviensis boliviensis]
Length = 336
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 48/162 (29%), Positives = 79/162 (48%), Gaps = 16/162 (9%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ E S LE+F FK ++ ++ + Y + E R F +N + I+ H G
Sbjct: 21 GAAELSVNSLEKF-----HFKSWMAKHHKTYSREEEYHHRLQTFASNWRKIN---AHNNG 72
Query: 120 TATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T+ VN+FADM+ +E W + +N +T Y + Y S++++ K
Sbjct: 73 NHTFKMAVNQFADMSFAEIKR---KYLWSEPQNCSATKSNY-LRGTGPY--PPSVDWRKK 126
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 127 GNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQ 168
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 48/153 (31%), Positives = 78/153 (50%), Gaps = 12/153 (7%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
+ ++ ++ E + Y+S E E RF+IF++NL+ ID + + + G+NRFAD+TD E+
Sbjct: 40 DMYESWLVEQGKSYNSLDEKEMRFEIFKDNLRIIDDHNADANRSFSLGLNRFADLTDEEY 99
Query: 137 NH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
G S ++ N + L ++++ G V+ V++Q LC SC
Sbjct: 100 RSTYLGFKSGPKAKVSN--------RYVPKVGDVLPNYVDWRTVGAVV-GVKNQGLCSSC 150
Query: 194 WAHSAVACLESAYAIKHNELIELSKQPPKTHGR 226
WA SAVA +E I L+ LS+Q GR
Sbjct: 151 WAFSAVAAVEGINKIMTGNLLSLSEQELVDCGR 183
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 69/140 (49%), Gaps = 1/140 (0%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
++++ EY + Y +E E+RF IF++N++ I+ + GVN AD+T EF
Sbjct: 39 ENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDS 98
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ L E +TF+ F N + E+I+++ KG V P CG WA S +
Sbjct: 99 RNGLK-RTYEFSTTTFKLNGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGRFWAFSTI 157
Query: 200 ACLESAYAIKHNELIELSKQ 219
A E + I L+ LS+Q
Sbjct: 158 AATEGIHQISTGNLVSLSEQ 177
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 79/147 (53%), Gaps = 10/147 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG-TATYGVNRFADMTDSE 135
+ F F Y + Y ++ E +RR+ IF+NNL I +T ++QG + + +N F D++ E
Sbjct: 114 DAFSSFQAMYAKSYATEEEKQRRYAIFKNNLVYI--HTHNQQGYSYSLKMNHFGDLSRDE 171
Query: 136 FNHGLSSLDWEQIENLKS---TFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
F L +++ NLKS T N S L ++++ +G V P V+DQ CGS
Sbjct: 172 FRR--KYLGFKKSRNLKSHHLGVATELLNVLPSE-LPAGVDWRSRGCVTP-VKDQRDCGS 227
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA S LE A+ K +L+ LS+Q
Sbjct: 228 CWAFSTTGALEGAHCAKTGKLVSLSEQ 254
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 73.9 bits (180), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 51/164 (31%), Positives = 85/164 (51%), Gaps = 10/164 (6%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTK-HEQ 118
GS+ S FDL + Q+ F +++QY+S++E R IF N + + K + Q
Sbjct: 13 GSQAVSFFDLVQ-----EQWGAFKVTHKKQYESETEERFRMKIFMENAHKVAKHNKLYAQ 67
Query: 119 GTATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTF-ETYSFNSSNSYGLAESINYK 175
G ++ GVN+++DM + EF H L+ + + E+ +F + L + I+++
Sbjct: 68 GLVSFKLGVNKYSDMLNHEFVHTLNGYNRSKTPLRSGELDESITFIPPANVELPKQIDWR 127
Query: 176 DKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G V P V+DQ CGSCW+ S LE + K +L+ LS+Q
Sbjct: 128 KLGAVTP-VKDQGQCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQ 170
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 69/135 (51%), Gaps = 6/135 (4%)
Query: 86 YERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDW 145
Y + Y +E ERRF IF+NN++ I+ + VN+FAD T+ +F +
Sbjct: 45 YGKVYVDAAEKERRFKIFKNNVEYIESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRR 104
Query: 146 E-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLES 204
Q +K T SF N + +++++ KG V P ++DQ CGSCWA S VA E
Sbjct: 105 PFQTRPMKVT----SFKYENVTAVPATMDWRKKGAVTP-IKDQGQCGSCWAFSTVAATEG 159
Query: 205 AYAIKHNELIELSKQ 219
+ +L+ LS+Q
Sbjct: 160 INQLTTGKLVSLSEQ 174
>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
Length = 350
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 79/147 (53%), Gaps = 6/147 (4%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
++F+ ++ + R Y E +RRF+++R N++ ++ + G N+FAD+T+ EF
Sbjct: 29 DRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGY-KLADNKFADLTNEEF 87
Query: 137 NHGLSS----LDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
+ + QI N S SS+ L +S+++++KG V+ + + GS
Sbjct: 88 RAKMLGFRPHVTIPQISNTCSADIAMPGESSDDI-LPKSVDWRNKGAVINRWKICVDAGS 146
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA SAVA +E IK+ EL+ LS+Q
Sbjct: 147 CWAFSAVAAIEGINQIKNGELVSLSEQ 173
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 73.9 bits (180), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 49/163 (30%), Positives = 87/163 (53%), Gaps = 12/163 (7%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ ++ + Y++ E E+RF IF++N ID + + G+NRFAD+T+ E+
Sbjct: 44 YESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNRFADLTNEEYRS 103
Query: 139 ---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
G+ + D + K + ++ + S L ES+++++ G V V+DQ CGSCWA
Sbjct: 104 KYTGIRTKDSRK----KVSGKSQRYASLAGESLPESVDWREHGAV-ASVKDQGQCGSCWA 158
Query: 196 HSAVACLESAYAIKHNELIELSKQPPKTHGRFY----KGGVMN 234
S ++ +E I +LI LS+Q R Y GG+M+
Sbjct: 159 FSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMD 201
>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
Length = 353
Score = 73.6 bits (179), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 71/144 (49%), Gaps = 12/144 (8%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ E+ R Y ++E RR +IFR N + ID + + + NRFAD+TD EF +
Sbjct: 50 WMAEHGRTYTDEAEKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDEEFRAART 109
Query: 142 SLDWEQIENLKST------FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ +E +S + A+S++++ G V V+DQ CG CWA
Sbjct: 110 GFRPRPAPAAAAGSGGRFRYENFSLADA-----AQSVDWRAMGAV-TGVKDQGECGCCWA 163
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
SAVA +E I+ L+ LS+Q
Sbjct: 164 FSAVAAVEGLNKIRTGRLVSLSEQ 187
>gi|113120273|gb|ABI30276.1| VXH-C [Vasconcellea x heilbornii]
Length = 282
Score = 73.6 bits (179), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 76/144 (52%), Gaps = 9/144 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ ++++ Y S E RF+IF++NL ID T + + G+N FAD+T EF
Sbjct: 48 FESWMLKHDKVYKSMEEKINRFEIFKDNLMYIDE-TNKKNNSYWLGLNEFADLTHDEFKK 106
Query: 139 ---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
G D+ IE F + ES++++ KG V P V+DQ+ CGSCWA
Sbjct: 107 KYVGSIPEDYTIIEQSDDG----EFPYKHVVDYPESVDWRQKGAVTP-VKDQNPCGSCWA 161
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S VA +E I +LI LS+Q
Sbjct: 162 FSTVATVEGINKIVTGKLISLSEQ 185
>gi|426369382|ref|XP_004051670.1| PREDICTED: cathepsin F [Gorilla gorilla gorilla]
Length = 517
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 67/129 (51%), Gaps = 8/129 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK+FV Y R Y+S E R +F NN+ ++GTA YGV +F+D+T+ EF
Sbjct: 220 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 279
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L+SL E+ N ++ + + +++ KG V KV+DQ +CGSCWA S
Sbjct: 280 IYLNSLLREEPGNKMKQAKSVGDLAPPEW------DWRSKGAVT-KVKDQGMCGSCWAFS 332
Query: 198 AVACLESAY 206
+E +
Sbjct: 333 VTGNVEGQW 341
>gi|17978641|gb|AAL48319.1| vinckepain-2 [Plasmodium vinckei]
Length = 470
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 53/180 (29%), Positives = 86/180 (47%), Gaps = 20/180 (11%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ N F +F+++Y +QY+S E++ RF IF LK I+ + K + +N F+D+
Sbjct: 147 LEAVNIFYNFMKKYNKQYNSAEEMQERFYIFSEKLKKIEKHNKENKYMYKKAINSFSDLH 206
Query: 133 DSEFNHGL--------SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKV 184
EF S++D + + Y S S +++DK V+ V
Sbjct: 207 PEEFKMRFLNSKIKDDSAIDLRYLVPYSAALGKYK--SPTDKVNYRSFDWRDKD-VIIDV 263
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ-----PPKTHGRFYKGGVMNLPHML 239
+DQ C SCWA S + + YAI+ N+ I LS+Q P G +GG+ +P+ L
Sbjct: 264 KDQKKCASCWAFSVAGVVSAQYAIRQNKKISLSEQQLVDCAPNNFG--CEGGI--IPYAL 319
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 83/162 (51%), Gaps = 13/162 (8%)
Query: 62 EEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT-KHEQGT 120
EE + L+ DHG + Q ++RF+IF++NL+ ID + K++ T
Sbjct: 43 EEVRSIYLQWSADHGKTNNNNNGIINDQ-------DKRFNIFKDNLRFIDLHNEKNKNAT 95
Query: 121 ATYGVNRFADMTDSEFNH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
G+ +F D+T+ E+ G + +I K+ + YS + + + E+++++ K
Sbjct: 96 YKLGLTKFTDLTNEEYRSLYLGARTEPVRRIAKAKNVNQKYSA-AVDGKEVPETVDWRLK 154
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G V P ++DQ CGSCWA S A +E I ELI LS+Q
Sbjct: 155 GAVNP-IKDQGTCGSCWAFSTAAAVEGINKIVTGELISLSEQ 195
>gi|294901125|ref|XP_002777247.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
gi|239884778|gb|EER09063.1| cysteine proteinase, putative [Perkinsus marinus ATCC 50983]
Length = 214
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 79/153 (51%), Gaps = 10/153 (6%)
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNR 127
DLE G F F +++ + YD+ E +R IF +NL I+ + GVN
Sbjct: 19 DLEA---AGLAFIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVNAQNL-SYKLGVNE 74
Query: 128 FADMTDSEFNH-GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
+ D+T EF LSS D E + F + ++ + L S++++ KG VL V+D
Sbjct: 75 YTDLTLEEFAALKLSSTDMS--EGMGDGFVAGAGPTTTT--LPTSVDWRKKG-VLNPVKD 129
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA SA+ LE YAI +L+ LS+Q
Sbjct: 130 QGYCGSCWAFSAIGALEPRYAIATGKLLSLSEQ 162
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 52/166 (31%), Positives = 87/166 (52%), Gaps = 12/166 (7%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTK-HEQ 118
G++ S FDL + Q+ F ++++QY SD+E + R IF N + K +E
Sbjct: 13 GAQAVSFFDLVQ-----EQWGTFKLQHKKQYKSDTEEKFRMKIFMENSHKVAKXNKLYEM 67
Query: 119 GTATYG--VNRFADMTDSEFNHGLSSLDWEQIENLKSTFET---YSFNSSNSYGLAESIN 173
G +Y +N++ADM EF H ++ + + L T E +F + + E+++
Sbjct: 68 GLVSYKLKINKYADMLHHEFVHTVNGFNRTKNTPLLGTSEDEQGATFIAPANVKFPENVD 127
Query: 174 YKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+++ G V V+DQ CGSCW+ SA LE + K N+L+ LS+Q
Sbjct: 128 WREHGAV-TXVKDQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQ 172
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 75/144 (52%), Gaps = 6/144 (4%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF--- 136
+ ++ + R Y + E + RFD+F+ NLK I+ + K T GVN FAD T EF
Sbjct: 39 QQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGVNEFADWTKEEFIAT 98
Query: 137 NHGLSSLD-WEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ GL + E + ++++N S+ G E +++ +G V P V+ Q CG CWA
Sbjct: 99 HTGLKGFNGIPSSEFVDEMIPSWNWNVSDVAG-PEIKDWRYEGAVTP-VKYQGQCGCCWA 156
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S+VA +E I L+ LS+Q
Sbjct: 157 FSSVAAVEGLTKIVGGNLVSLSEQ 180
>gi|294889982|ref|XP_002773024.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239877727|gb|EER04840.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 306
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 52/155 (33%), Positives = 83/155 (53%), Gaps = 16/155 (10%)
Query: 71 EFLDHGN---QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNR 127
E LD G F F ++ R+Y+S E +R IF+ NL I+ T + + GVN
Sbjct: 17 ECLDKGATDLAFIGFQHKFGRKYESKEEEVKRNAIFQANLHHIEQ-TNAKNLSYKLGVNE 75
Query: 128 FADMTDSEF---NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKV 184
+AD+T EF G + + +NL S+++ LA S+++++KG VL +
Sbjct: 76 YADLTHEEFAALKLGTLKMSMRRDDNL--------LVSADTTQLATSVDWRNKG-VLTPI 126
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++Q CGSCWA S+ LE+ YAI ++L+ S+Q
Sbjct: 127 KNQGSCGSCWAFSSTGALEAQYAIATSKLLSFSEQ 161
>gi|261289781|ref|XP_002611752.1| hypothetical protein BRAFLDRAFT_284342 [Branchiostoma floridae]
gi|229297124|gb|EEN67762.1| hypothetical protein BRAFLDRAFT_284342 [Branchiostoma floridae]
Length = 327
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 75/141 (53%), Gaps = 5/141 (3%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKH-EQGTATY--GVNRFADMTDSEFNH 138
F + Y R Y ++ E RR IF +NLKTI + + ++G T+ GVN++ADMT EF
Sbjct: 26 FKKAYNRVYAAEEEYARRL-IFEDNLKTIQMHNEEADRGLHTFRLGVNQYADMTHKEFLE 84
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ KST + + ++++++DKG V P V++Q CGSCWA S
Sbjct: 85 NVIGGCLLDTNTSKSTADHVHEYDPTLTDVPDTVDWRDKGYVTP-VKNQAQCGSCWAFST 143
Query: 199 VACLESAYAIKHNELIELSKQ 219
LE + N+L+ LS+Q
Sbjct: 144 TGSLEGQHFKATNKLVSLSEQ 164
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 72/141 (51%), Gaps = 4/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ E+ + Y S E RF++FR NL ID +E + G+N FAD+T EF
Sbjct: 51 FESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQ-RNNEINSYWLGLNEFADLTHEEFKG 109
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L Q + + +F + L +S++++ KG V P V+DQ CGSCWA S
Sbjct: 110 RYLGLAKPQFSRKRQP--SANFRYRDITDLPKSVDWRKKGAVAP-VKDQGQCGSCWAFST 166
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I L LS+Q
Sbjct: 167 VAAVEGINQITTGNLSSLSEQ 187
>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
Length = 469
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 74/133 (55%), Gaps = 8/133 (6%)
Query: 91 DSDSEIERRFDIFRNNLKTIDYYTKH----EQGTATYGVNRFADMTDSEFNHGLSSLDWE 146
+S E ERRF F +NL+ +D + E+G +NRFAD+T+ EF + +
Sbjct: 66 NSIPERERRFRAFWDNLRFVDAHNARAAAGEEGF-RLAMNRFADLTNDEFRAAYLGVKGQ 124
Query: 147 QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAY 206
+ + E Y + + L E++++++KG V P V++Q CGSCWA SA++ +ES
Sbjct: 125 RARPGRVVGERYRHDGAEE--LPEAVDWREKGAVAP-VKNQGQCGSCWAFSAISTVESIN 181
Query: 207 AIKHNELIELSKQ 219
I E++ LS+Q
Sbjct: 182 QIVTGEMVTLSEQ 194
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 77/154 (50%), Gaps = 17/154 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ +Y + Y+S E ERRF+IF+ L+ ID + + G+N+FAD T+ EF
Sbjct: 42 YESWLTKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYRVGLNQFADQTNEEF-- 99
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYG------LAESINYKDKGKVLPKVQDQHLCGS 192
Q L T + SN Y L + ++++ G V+ ++ Q CGS
Sbjct: 100 --------QSTYLGFTSGSNKMKVSNRYEPRVGQVLPDYVDWRSAGAVV-DIKSQGQCGS 150
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQPPKTHGR 226
CWA SA+A +E I +LI LS+Q GR
Sbjct: 151 CWAFSAIATVEGINKIVTGDLISLSEQELVDCGR 184
>gi|33333710|gb|AAQ11973.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 80/152 (52%), Gaps = 22/152 (14%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATYG--VNRFA 129
LDHG ++ V E +RRF +F+ NL I ++ K+E+G ++ V +FA
Sbjct: 28 LDHGKTYRSLVEE-----------KRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFA 76
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE--SINYKDKGKVLPKVQDQ 187
DMT EF L L + + L S F++ + E +++++++G V P V+DQ
Sbjct: 77 DMTHEEF---LDLLKLQGVPALPSN--AVHFDNFEDIDMEEKDAVDWREEGAVTP-VKDQ 130
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV +E + K+ L+ LS Q
Sbjct: 131 ANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQ 162
>gi|389583697|dbj|GAB66431.1| vivapain [Plasmodium cynomolgi strain B]
Length = 487
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 79/155 (50%), Gaps = 9/155 (5%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ N F FV+E+ ++Y + E+++R+ F NL I + E G+N+F D++
Sbjct: 161 LESVNSFYLFVKEFGKKYKTADEMQQRYQSFVENLAKIKAHNSKENVLYRKGMNQFGDLS 220
Query: 133 DSEFNHG---LSSLDWEQIENLKSTFETY-----SFNSSNSYGLAESINYKDKGKVLPKV 184
EF L S D++ + Y + ++ E +++ V P V
Sbjct: 221 FEEFKKKFLTLKSFDFKTYGGKLKGVDKYEDVIIKYKPKDATFDREKYDWRLHKGVTP-V 279
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+DQ CGSCWA S+V +ES Y+I+ NEL+ LS+Q
Sbjct: 280 KDQGDCGSCWAFSSVGVVESQYSIRKNELVSLSEQ 314
>gi|307206026|gb|EFN84119.1| Cathepsin O [Harpegnathos saltator]
Length = 353
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 48/151 (31%), Positives = 77/151 (50%), Gaps = 11/151 (7%)
Query: 79 FKDFVREYERQYDSDS-EIERRFDIFRNNLKTIDYYT--KHEQGTATYGVNRFADMTDSE 135
F D+V Y + Y D E RFD F+ +L+ I+ + Q +A YG+ F+D+++ E
Sbjct: 36 FVDYVARYNKSYRHDPPEYNERFDRFQRSLRHIERMNGFRSSQESAYYGLTEFSDLSEDE 95
Query: 136 FNHGLSSLDWE---QIENLKSTFETYSFNSSN----SYGLAESINYKDKGKVLPKVQDQH 188
F D Q+ S + ++ N++N + I+++DKG V P +Q Q
Sbjct: 96 FVQRTLLPDLSSRGQMHKAASYYHRHTKNTNNRSERETNVPPKIDWRDKGVVGP-IQSQE 154
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+CG+CWA S + ES YA+K+ L S Q
Sbjct: 155 ICGACWAFSTIGVAESMYAMKNGTLYPFSVQ 185
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 90/187 (48%), Gaps = 17/187 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ ++ + Y++ E ERRF IF++NL+ I+ + + G+N+FAD+T+ E+
Sbjct: 48 YEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADLTNEEYRA 107
Query: 139 GLSSLDWEQIENLKSTF----ETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+N + + Y++ + L +++++KG V P ++DQ CGSCW
Sbjct: 108 MFLGTRTRGPKNKAAVVAKKTDRYAYRAGEE--LPAMVDWREKGAVTP-IKDQGQCGSCW 164
Query: 195 AHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKG--PYSLNHAVL 252
A S V +E I L LS+Q R Y +M C+ G Y+ V
Sbjct: 165 AFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGY--------NMGCNGGLMDYAFEFIVQ 216
Query: 253 NVGYDNE 259
N G D E
Sbjct: 217 NGGIDTE 223
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 73.6 bits (179), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 57/190 (30%), Positives = 98/190 (51%), Gaps = 22/190 (11%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSE 135
+++++ ++ + Y+ E ++RF+IF++NL ID +H TY G+N+FAD T+ E
Sbjct: 34 MYEEWLVKHHKVYNGLGEKDQRFEIFKDNLGFID---EHNAQNYTYKVGLNKFADTTNEE 90
Query: 136 FNH---GLSSLDWEQIENLK-STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
+ + G + + +K +T Y+FNS + L ++++ KG V ++DQ CG
Sbjct: 91 YRNMYLGTKNDAKRNVMKIKITTGHRYAFNSGDR--LPVHVDWRSKGAV-AHIKDQGSCG 147
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKG--PYSLNH 249
SCWA S +A +E+ I +L+ LS+Q R + G C+ G Y+
Sbjct: 148 SCWAFSTIATVEAINKIVTGKLVSLSEQELVDCDRAFNEG--------CNGGLMDYAFEF 199
Query: 250 AVLNVGYDNE 259
V N G D E
Sbjct: 200 IVENGGIDTE 209
>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
Length = 379
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 80/154 (51%), Gaps = 6/154 (3%)
Query: 66 TFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGV 125
FD+E L F+ ++ ++ + YDS +E ERR IF++NL+ I G G+
Sbjct: 55 VFDVEASL----IFESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSENLGY-RLGL 109
Query: 126 NRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQ 185
NRFAD++ E+ D + N + + +S L +S++++++G V +V+
Sbjct: 110 NRFADLSLHEYKEICHGADPKPPRNHVFMSSSDRYKTSAGDVLPKSVDWRNEGAV-TEVK 168
Query: 186 DQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
DQ C SCWA S V +E I EL+ LS+Q
Sbjct: 169 DQGHCRSCWAFSTVGAVEGLNKIVTGELVTLSEQ 202
>gi|22653681|sp|Q9TST1.2|CATW_FELCA RecName: Full=Cathepsin W; Flags: Precursor
Length = 374
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 74/151 (49%), Gaps = 11/151 (7%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ F F +Y R Y + E RR DIF +NL + + GTA +GV F+D+T
Sbjct: 36 LELKQAFTLFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEEEDLGTAEFGVTPFSDLT 95
Query: 133 DSEFN--HGLSSLDWEQIENLKSTFETYSFNSSNSYG--LAESINYKDKGKVLPKVQDQH 188
+ EF +G +D E + + S +G + + +++ V+ V+ Q
Sbjct: 96 EEEFGRLYGHRRMDGEAPKVGREV-------GSEEWGESVPPTCDWRKLDGVISSVKKQE 148
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
C CWA +A +E+ +AIK+ + +ELS Q
Sbjct: 149 SCSCCWAMAAAGNIEALWAIKYRQSVELSVQ 179
>gi|7242888|dbj|BAA92495.1| cysteine protease [Vigna mungo]
Length = 364
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 81/153 (52%), Gaps = 13/153 (8%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+ L+ + F +F ++ + Y + E + RF +F++NL+ + + + +A +GV +F+
Sbjct: 41 DHLLNAEHHFSNFKAKFGKTYATKEEHDHRFGVFKSNLRRARLHAQLDP-SAVHGVTKFS 99
Query: 130 DMTDSEFNH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
D+T +EF GL L L + + +N+ L + +++DKG V V+D
Sbjct: 100 DLTAAEFQRQFLGLKPL------GLPANAQKAPILPTNN--LPKDFDWRDKGAVT-NVKD 150
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCW+ S LE A+ + EL+ LS+Q
Sbjct: 151 QGACGSCWSFSTTGALEGAHFLATGELVSLSEQ 183
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 77/144 (53%), Gaps = 8/144 (5%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY-GVNRFADMTDSEF---- 136
++ ++ R Y +E RRF+ FRNN+ I+ + + GVN+F D+T+ EF
Sbjct: 40 WMAQHGRVYKDGAEKARRFEAFRNNVVFIESFNAAGNRRKFWLGVNQFTDLTNDEFRATK 99
Query: 137 -NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
N G + + N S T+ +++ ++ L +++++ KG V P +++Q CG CWA
Sbjct: 100 TNKGFIKRNAAAV-NKASPTGTFRYSNVSADALPAAVDWRAKGAVTP-IKNQGQCGCCWA 157
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
SAVA E + +L+ LS+Q
Sbjct: 158 FSAVAATEGIVQLSTGKLVPLSEQ 181
>gi|356541074|ref|XP_003539008.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 363
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 72/144 (50%), Gaps = 13/144 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F DF R + + Y S E RF++F+ N++ + + +A +GV RF+D+T SEF +
Sbjct: 48 FLDFKRRFGKAYASQEEHNYRFEVFKANMRRARRHQSLDP-SAAHGVTRFSDLTASEFRN 106
Query: 139 ---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
GL + N T + L +++D G V P V++Q CGSCW+
Sbjct: 107 KVLGLRGVRLPSNANKAPILPTDN--------LPSDFDWRDHGAVTP-VKNQGSCGSCWS 157
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S LE A+ + EL+ LS+Q
Sbjct: 158 FSTTGALEGAHFLSTGELVSLSEQ 181
>gi|308476152|ref|XP_003100293.1| hypothetical protein CRE_21852 [Caenorhabditis remanei]
gi|308265817|gb|EFP09770.1| hypothetical protein CRE_21852 [Caenorhabditis remanei]
Length = 391
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 77/147 (52%), Gaps = 15/147 (10%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
F DF+ +Y+R+Y S E + R+ +F N+K + G VN F D T+ E
Sbjct: 89 FNDFILKYDRRYPSLEEFQYRYQVFLQNVKEFEAEEAKHFGL-DLDVNEFTDWTNEELQR 147
Query: 137 ----NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
N + + E++ S E+ SI+++D+GK+ P +++Q CGS
Sbjct: 148 IVYDNKNVKTDGSEEVRFEGSYLES-------GVKRPASIDWRDQGKLTP-IKNQGQCGS 199
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA + VA +E+ +AI+ N+L+ LS+Q
Sbjct: 200 CWAFATVAAVEAQHAIRKNQLVSLSEQ 226
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 73.6 bits (179), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 54/186 (29%), Positives = 96/186 (51%), Gaps = 19/186 (10%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
+++++ + + Y++ E E+RF +F++NL+ ID + E T G+N FAD+T+ E+
Sbjct: 52 YEEWLVKQGKVYNALGEREKRFQVFKDNLRFIDEHNS-ENRTYKLGLNGFADLTNEEYRS 110
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L + + L+ T + Y+ S L +S++++ +G V +V+DQ CGSCWA S
Sbjct: 111 TYLGARGGMKRNRLRKTSDRYAPRVGES--LPDSVDWRKEGAV-AEVKDQGSCGSCWAFS 167
Query: 198 AVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVLN 253
+A +E I +LI LS+Q ++ GG+M+ Y+ + N
Sbjct: 168 TIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMD----------YAFEFIINN 217
Query: 254 VGYDNE 259
G D E
Sbjct: 218 GGIDTE 223
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 72/141 (51%), Gaps = 4/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ E+ + Y S E RF++FR NL ID +E + G+N FAD+T EF
Sbjct: 51 FESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQ-RNNEINSYWLGLNEFADLTHEEFKG 109
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L Q + + +F + L +S++++ KG V P V+DQ CGSCWA S
Sbjct: 110 RYLGLAKPQFSRKRQP--SANFRYRDITDLPKSVDWRKKGAVAP-VKDQGQCGSCWAFST 166
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I L LS+Q
Sbjct: 167 VAAVEGINQITTGNLSSLSEQ 187
>gi|8050826|gb|AAF71757.1| cysteine protease falcipain-3 [Plasmodium falciparum]
Length = 488
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 81/160 (50%), Gaps = 10/160 (6%)
Query: 69 LEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRF 128
L + L+ N F F++E ++Y++ E+++RF IF N + I+ + K G+N+F
Sbjct: 157 LMDNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKF 216
Query: 129 ADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSN---SYGLAE------SINYKDKGK 179
D++ EF +L S +Y N + Y A+ + +++ G
Sbjct: 217 GDLSPEEFRSKYLNLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGG 276
Query: 180 VLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V P V+DQ LCGSCWA S+V +ES YAI+ L S+Q
Sbjct: 277 VTP-VKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQ 315
>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
Length = 347
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 53/147 (36%), Positives = 82/147 (55%), Gaps = 11/147 (7%)
Query: 77 NQFKDFVREYERQYD-SDSEIERRFDIFRNNLKTIDYYT-KHEQGTATY--GVNRFADMT 132
N ++ + + Y +QY+ + E+ RR I+ NLK + + +H G +Y +N +DMT
Sbjct: 41 NHWELWKKTYGKQYEEQNQEVTRRL-IWEKNLKFVTLHNLEHSMGLHSYDLSMNHLSDMT 99
Query: 133 DSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
E +SSL +I N S TY NS+ L +S++++DKG V +V+ Q CGS
Sbjct: 100 SEEVASLMSSL---RIPNQWSRNTTYRLNSNQK--LPDSVDWRDKGCV-TEVKYQGTCGS 153
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA SAV LE+ +K +L+ LS Q
Sbjct: 154 CWAFSAVGALEAQLKLKTGKLVSLSAQ 180
>gi|33333712|gb|AAQ11974.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 80/152 (52%), Gaps = 22/152 (14%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATYG--VNRFA 129
LDHG ++ V E +RRF +F+ NL I ++ K+E+G ++ V +FA
Sbjct: 28 LDHGKTYRSLVEE-----------KRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFA 76
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE--SINYKDKGKVLPKVQDQ 187
DMT EF L L + + L S F++ + E +++++++G V P V+DQ
Sbjct: 77 DMTHEEF---LDLLKLQGVPALPSN--AVHFDNFEDIDMEEKDAVDWREEGAVTP-VKDQ 130
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV +E + K+ L+ LS Q
Sbjct: 131 ANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQ 162
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 80/151 (52%), Gaps = 13/151 (8%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQ----GTATY--GVNRFADM 131
+++ F E+ +QY ++E R IF N I KH Q G ++ G+N++ADM
Sbjct: 27 EWQTFKLEHRKQYQDETEERFRLKIFNENKHKI---AKHNQLYAAGEVSFKMGLNKYADM 83
Query: 132 TDSEFNHGLSSLDW---EQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQH 188
EF+ ++ ++ +Q+ +TF +F S L +S+++++KG V V+DQ
Sbjct: 84 LHHEFHETMNGFNYTLHKQLRASDATFTGVTFISPEHVKLPQSVDWRNKGAV-TGVKDQG 142
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S+ LE + K LI LS+Q
Sbjct: 143 HCGSCWAFSSTGALEGQHFRKTGTLISLSEQ 173
>gi|124803852|ref|XP_001347833.1| falcipain-3 [Plasmodium falciparum 3D7]
gi|9255922|gb|AAF86352.1|AF282974_1 cysteine protease falcipain-3 [Plasmodium falciparum]
gi|23496085|gb|AAN35746.1|AE014838_24 falcipain-3 [Plasmodium falciparum 3D7]
Length = 492
Score = 73.2 bits (178), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 81/160 (50%), Gaps = 10/160 (6%)
Query: 69 LEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRF 128
L + L+ N F F++E ++Y++ E+++RF IF N + I+ + K G+N+F
Sbjct: 161 LMDNLETVNLFYIFLKENNKKYETSEEMQKRFIIFSENYRKIELHNKKTNSLYKRGMNKF 220
Query: 129 ADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSN---SYGLAE------SINYKDKGK 179
D++ EF +L S +Y N + Y A+ + +++ G
Sbjct: 221 GDLSPEEFRSKYLNLKTHGPFKTLSPPVSYEANYEDVIKKYKPADAKLDRIAYDWRLHGG 280
Query: 180 VLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V P V+DQ LCGSCWA S+V +ES YAI+ L S+Q
Sbjct: 281 VTP-VKDQALCGSCWAFSSVGSVESQYAIRKKALFLFSEQ 319
>gi|301784869|ref|XP_002927853.1| PREDICTED: cathepsin F-like [Ailuropoda melanoleuca]
Length = 394
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/141 (33%), Positives = 70/141 (49%), Gaps = 24/141 (17%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK+FV Y R Y+S E E R +F NN+ ++GTA YG+ +F+D+T+ EF
Sbjct: 97 FKEFVTTYNRTYESKEEAEWRMSVFSNNVMRAQKIQALDRGTAQYGITKFSDLTEEEFRT 156
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESI--------NYKDKGKVLPKVQDQHL 189
L+ L E N LA+SI ++++KG V +V+DQ +
Sbjct: 157 IYLNPLLRE--------------NRGKKMDLAKSIGDSAPPEWDWRNKGAVT-QVKDQGM 201
Query: 190 CGSCWAHSAVACLESAYAIKH 210
CGSCWA S +E + +K
Sbjct: 202 CGSCWAFSVTGNVEGQWFLKR 222
>gi|33333706|gb|AAQ11971.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 80/152 (52%), Gaps = 22/152 (14%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATYG--VNRFA 129
LDHG ++ V E +RRF +F+ NL I ++ K+E+G ++ V +FA
Sbjct: 28 LDHGKTYRSLVEE-----------KRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFA 76
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE--SINYKDKGKVLPKVQDQ 187
DMT EF L L + + L S F++ + E +++++++G V P V+DQ
Sbjct: 77 DMTHEEF---LDLLKLQGVPALPSN--AVHFDNFEDIDMEEKDAVDWREEGAVTP-VKDQ 130
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV +E + K+ L+ LS Q
Sbjct: 131 ANCGSCWAFSAVGAIEGQFFKKNGTLVSLSAQ 162
>gi|42564157|gb|AAS20590.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 322
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 76/146 (52%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTD 133
+Q+ F + + + Y S E RF IF+NNL+TI+ + ++E+G TY V +FADMT
Sbjct: 21 DQWVAFKQTHGKTYKSLLEERTRFGIFQNNLRTIEKHNAEYEEGKVTYYMAVTQFADMTR 80
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF L L + NL +T + L E I++ +KG VLP ++Q C SC
Sbjct: 81 DEFRKKL-GLQNNRRPNLNATLRVF----PEDLELPEQIDWTEKGAVLP-AKNQGNCRSC 134
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA S LE AI + LS+Q
Sbjct: 135 WAFSTTGSLEGQNAIHNKVKTPLSEQ 160
>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
gi|228243|prf||1801240A Cys protease 1
Length = 322
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 78/144 (54%), Gaps = 11/144 (7%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTK-HEQGTATY--GVNRFADMTDSE 135
+++F ++ R+Y E R ++F +NL+ I+ + K +E+G TY +N+F+DMT+ +
Sbjct: 20 WEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEK 79
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
FN + K F S+++ + ++++ KG V P V+DQ CGSCWA
Sbjct: 80 FNAVMKGYK-------KGPRPAAVFTSTDAAPESTEVDWRTKGAVTP-VKDQGQCGSCWA 131
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +E + +K L+ LS+Q
Sbjct: 132 FSTTGGIEGQHFLKTGRLVSLSEQ 155
>gi|294869083|ref|XP_002765753.1| Cysteine proteinase 3 precursor, putative [Perkinsus marinus ATCC
50983]
gi|239865917|gb|EEQ98470.1| Cysteine proteinase 3 precursor, putative [Perkinsus marinus ATCC
50983]
Length = 174
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 79/153 (51%), Gaps = 10/153 (6%)
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNR 127
DLE G F F +++ + YD+ E +R IF +NL I+ + GVN
Sbjct: 19 DLEA---AGLAFIGFQKKHGKSYDNKEEEMKRAAIFHDNLNYIEEVNAQNL-SYKLGVNE 74
Query: 128 FADMTDSEFNH-GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
+ D+T EF LSS D E + F + ++ + L S++++ KG VL V+D
Sbjct: 75 YTDLTLEEFAALKLSSTDMS--EGMGDGFVAGAGPTTTT--LPTSVDWRKKG-VLNPVKD 129
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA SA+ LE YAI +L+ LS+Q
Sbjct: 130 QGYCGSCWAFSAIGALEPRYAIATGKLLSLSEQ 162
>gi|171460937|ref|NP_001116343.1| cathepsin W precursor [Felis catus]
gi|6165261|emb|CAB59816.1| cysteine protease [Felis catus]
Length = 344
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/151 (29%), Positives = 74/151 (49%), Gaps = 11/151 (7%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ F F +Y R Y + E RR DIF +NL + + GTA +GV F+D+T
Sbjct: 36 LELKQAFTLFQIQYNRSYSNPEEYARRLDIFAHNLAQAQQLEEEDLGTAEFGVTPFSDLT 95
Query: 133 DSEFN--HGLSSLDWEQIENLKSTFETYSFNSSNSYG--LAESINYKDKGKVLPKVQDQH 188
+ EF +G +D E + + S +G + + +++ V+ V+ Q
Sbjct: 96 EEEFGRLYGHRRMDGEAPKVGREV-------GSEEWGESVPPTCDWRKLDGVISSVKKQE 148
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
C CWA +A +E+ +AIK+ + +ELS Q
Sbjct: 149 SCSCCWAMAAAGNIEALWAIKYRQSVELSVQ 179
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/143 (33%), Positives = 74/143 (51%), Gaps = 2/143 (1%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N FK + ++ + Y S E +R+ IF+ NL I T + G+ G+N+FAD+T EF
Sbjct: 43 NLFKSWSVKHRKIYVSPKEKLKRYGIFKQNLMHIAE-TNRKNGSYWLGLNQFADITHEEF 101
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
L T +F + + L S++++ KG V P V++Q CGSCWA
Sbjct: 102 KANHLGLKQGLSRMGAQTRTPTTFRYAAAANLPWSVDWRYKGAVTP-VKNQGKCGSCWAF 160
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S+VA +E I +L+ LS+Q
Sbjct: 161 SSVAAVEGINQIVTGKLVSLSEQ 183
>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/149 (31%), Positives = 73/149 (48%), Gaps = 8/149 (5%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
++F+ + + R Y S E RRF+++R N++ ID + T G N+FAD+T EF
Sbjct: 43 DRFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRGGLTYELGENQFADLTGEEF 102
Query: 137 ------NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
H S++ + + + S S++++ KG V P C
Sbjct: 103 LARYAGGHTGSAI--TTAAEADGLWSSGGSDGSLEADPPASVDWRAKGAVTPVKNQGSQC 160
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
SCWA SAVA +ES Y IK +L+ LS+Q
Sbjct: 161 YSCWAFSAVATMESLYFIKTGKLVALSEQ 189
>gi|294883334|ref|XP_002770714.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239873999|gb|EER02719.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 330
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 81/159 (50%), Gaps = 10/159 (6%)
Query: 65 STFDLEEFLDHGN---QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA 121
S L + LD G F F ++ + Y+S E +R IF+ NL I+ + +
Sbjct: 11 SFLPLVKCLDEGTVELAFMGFQHKFGKNYESKEEEVKRNAIFQANLHLIEQ-VNAKNLSY 69
Query: 122 TYGVNRFADMTDSEFNH-GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKV 180
GVN +AD+T EF L +L E+ + F S+++ L S+++++K V
Sbjct: 70 KLGVNEYADLTHEEFAALKLGTLKMRPAEHASLSL----FVSADTTQLPTSVDWRNK-SV 124
Query: 181 LPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
L V+DQ CGSCWA SA LE+ YAI +L LS+Q
Sbjct: 125 LSPVKDQGSCGSCWAFSAAGALEAQYAIATGKLRPLSEQ 163
>gi|294885991|ref|XP_002771503.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
gi|239875207|gb|EER03319.1| thiolproteinase SmTP1, putative [Perkinsus marinus ATCC 50983]
Length = 337
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/142 (35%), Positives = 75/142 (52%), Gaps = 7/142 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F +++ + YD+ E +R IF +NL I+ + GVN + D+T EF
Sbjct: 27 FIGFQKKHGKSYDNKDEEMKRAAIFHDNLNYIEEVNAQNL-SYKLGVNEYTDLTLEEFAA 85
Query: 139 -GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
LSS D E + F + ++ + L S++++ KG VL V+DQ CGSCWA S
Sbjct: 86 LKLSSTDMS--EGMGDGFVAGAGPTTTT--LPTSVDWRKKG-VLNPVKDQGYCGSCWAFS 140
Query: 198 AVACLESAYAIKHNELIELSKQ 219
A+ LE YAI +L+ LS+Q
Sbjct: 141 AIGALEPRYAIATGKLLSLSEQ 162
>gi|195128637|ref|XP_002008768.1| GI13676 [Drosophila mojavensis]
gi|193920377|gb|EDW19244.1| GI13676 [Drosophila mojavensis]
Length = 324
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/152 (33%), Positives = 84/152 (55%), Gaps = 22/152 (14%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDS 134
+F+ F ++E+ Y+ E R +IF+ N +TID + ++ +G TY G+N+F+DM
Sbjct: 28 EFEAFKLKHEKSYEDIDEENLRMEIFKLNKETIDKHNARYARGLETYEMGINQFSDMLPE 87
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSS-------NSYGLAESINYKDKGKVLPKVQDQ 187
EF +QI + S T +F+SS ++ + E ++++ KG V +++Q
Sbjct: 88 EF---------KQI--MLSNMNTTNFDSSIDSIYLPHNIEITEEVDWRRKGAV-SAMKNQ 135
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S LES + IK +LI LS+Q
Sbjct: 136 GSCGSCWAFSVTGALESQHFIKTKKLISLSEQ 167
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/188 (27%), Positives = 96/188 (51%), Gaps = 21/188 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+ +++ + R Y++ ERR+ +FR+NL+ ID + + G ++ G+NRFAD+T+ E
Sbjct: 44 YAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 103
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ + K ++++++ L ES++++ KG V +V+DQ CG+CWA
Sbjct: 104 YPATYLGARTRPQRDRKLGAR---YHAADNEDLPESVDWRAKGAV-AEVKDQGSCGTCWA 159
Query: 196 HSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAV 251
S +A +E I +LI LS+Q ++ + GG+M+ Y+ +
Sbjct: 160 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMD----------YAFEFII 209
Query: 252 LNVGYDNE 259
N G D E
Sbjct: 210 NNGGIDTE 217
>gi|255635645|gb|ACU18172.1| unknown [Glycine max]
Length = 355
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 78/144 (54%), Gaps = 3/144 (2%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
+ F++++ ++++ Y++ E E+RF IF+NNL+ ID + T G+N FAD+T++E+
Sbjct: 43 SMFEEWLVKHDKVYNALGEKEKRFQIFKNNLRFIDERNSLNR-TYKLGLNVFADLTNAEY 101
Query: 137 NHGLSSLDWEQIENLK-STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ W+ L T + + +S++++ +G V P C SCWA
Sbjct: 102 -RAMYLRTWDDGPRLDLDTPPRNRYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWA 160
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
+AV +ES IK +LI LS+Q
Sbjct: 161 FTAVGAVESLVKIKTGDLISLSEQ 184
>gi|6978721|ref|NP_037071.1| pro-cathepsin H precursor [Rattus norvegicus]
gi|115729|sp|P00786.1|CATH_RAT RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|55886|emb|CAA68699.1| cathepsin H pre-pro-peptide [Rattus norvegicus]
gi|55391460|gb|AAH85352.1| Cathepsin H [Rattus norvegicus]
gi|149018921|gb|EDL77562.1| cathepsin H, isoform CRA_a [Rattus norvegicus]
gi|226475|prf||1514114A cathepsin H
Length = 333
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/160 (28%), Positives = 81/160 (50%), Gaps = 13/160 (8%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
+ E + +E+F F +++++++ Y S E R +F NN + I + +
Sbjct: 19 ATAELTVNAIEKF-----HFTSWMKQHQKTYSSR-EYSHRLQVFANNWRKIQAHNQRNH- 71
Query: 120 TATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGK 179
T G+N+F+DM+ +E H W + +N +T Y + Y S++++ KG
Sbjct: 72 TFKMGLNQFSDMSFAEIKH---KYLWSEPQNCSATKSNY-LRGTGPY--PSSMDWRKKGN 125
Query: 180 VLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+ V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 126 VVSPVKNQGACGSCWTFSTTGALESAVAIASGKMMTLAEQ 165
>gi|307169691|gb|EFN62267.1| Cathepsin O [Camponotus floridanus]
Length = 358
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/161 (29%), Positives = 89/161 (55%), Gaps = 16/161 (9%)
Query: 73 LDHGNQFKDFVREYERQYDSDS-EIERRFDIFRNNLKTIDYYTKHE--QGTATYGVNRFA 129
++ F++++ +Y + Y +DS E ++RF+ F+ +L+ I+ + Q +A YG+ +F+
Sbjct: 30 VEDAKLFENYIVQYNKSYRNDSTEYKKRFECFQKSLRHIEKMNSFQSSQESAYYGLTKFS 89
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYS---FNSSNSYG--------LAESINYKDKG 178
D+++ EF D + N K T +Y F +S+++G + ++++++G
Sbjct: 90 DLSEDEFLQQTLLPDLS-LRNQKHTTASYYHQYFTNSSNHGKRAIIPPPIPSKVDWRNRG 148
Query: 179 KVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V P VQ Q CG+CWA S + +ES YAIK+ L S Q
Sbjct: 149 VVGP-VQYQDNCGACWAFSTIGVVESMYAIKNGTLYPFSVQ 188
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 73/150 (48%), Gaps = 14/150 (9%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++K ++ +Y R+Y D+E RF +F+ N + ID + G N+FAD+T EF
Sbjct: 58 RYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFA 117
Query: 138 HGLSSL--------DWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
+ L +QI S ++ ++ + ++++ +G V P V++Q
Sbjct: 118 AMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDV-----QVDWRQQGAVTP-VKNQGQ 171
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CG CWA SAV +E I L+ LS+Q
Sbjct: 172 CGCCWAFSAVGAMEGLIMITTGNLVSLSEQ 201
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 73.2 bits (178), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 50/152 (32%), Positives = 83/152 (54%), Gaps = 8/152 (5%)
Query: 72 FLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLK-TIDYYTKHEQGTATY--GVNRF 128
F D +K++ E+ ++Y SD E R I++ NL I + K++ G TY G+N+F
Sbjct: 21 FTDFDEDWKEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIRHNLKYDLGHFTYDLGMNQF 80
Query: 129 ADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYG-LAESINYKDKGKVLPKVQDQ 187
AD+ + EF ++ ++ + +F N+ G L ++++++ KG V P V+DQ
Sbjct: 81 ADLQNKEFVAMMTGF---RVNGTSKAAKGSTFLPPNNVGKLPKTVDWRTKGYVTP-VKDQ 136
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SA LE + K +L+ LS+Q
Sbjct: 137 GQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQ 168
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 67/132 (50%), Gaps = 4/132 (3%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQ 147
+ Y E E+RF IF N+ ++ + G+N+F D+T+ EF ++ + +
Sbjct: 144 KVYKDPREREKRFRIFNENVNYVEAFNNAANKPYKLGINQFXDLTNQEF---IAPRNRFK 200
Query: 148 IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYA 207
S T +F N + +++++ G V P V+DQ CG CWA SAVA E +A
Sbjct: 201 GHMCSSIIRTTTFKYENVTTVPSTVDWRQNGAVTP-VKDQGQCGCCWAFSAVAATEGIHA 259
Query: 208 IKHNELIELSKQ 219
+ +LI LS+Q
Sbjct: 260 LSGGKLISLSEQ 271
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/189 (28%), Positives = 91/189 (48%), Gaps = 14/189 (7%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N ++ ++ + + Y++ E E RF IF +NLK ID + + G+N+FAD+T+ E+
Sbjct: 34 NTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVGLNQFADLTNEEY 93
Query: 137 NH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
G + +I ++ + + + ++++++G V P V++Q CGSC
Sbjct: 94 RSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGAVSP-VKNQGGCGSC 152
Query: 194 WAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKGP--YSLNHAV 251
WA S VA +E I +LI LS+Q Y G C+ G Y+ V
Sbjct: 153 WAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSG--------CNGGSMDYAFQFIV 204
Query: 252 LNVGYDNES 260
N G D+ES
Sbjct: 205 SNGGIDSES 213
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 73/141 (51%), Gaps = 6/141 (4%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+ ++ + + Y +E ERRF+IF++N++ I+ + VN+FAD+T+ E
Sbjct: 39 EQWMETFGKVYADAAEKERRFEIFKDNVEYIESFNTAGNKPYKLSVNKFADLTNEELKVA 98
Query: 140 LSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ Q +K T SF N + +++++ KG V P ++DQ CGSCWA S
Sbjct: 99 RNGYRRPLQTRPMKVT----SFKYENVTAVPATMDWRKKGAVTP-IKDQGQCGSCWAFST 153
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA E + +L+ LS+Q
Sbjct: 154 VAATEGINQLTTGKLVSLSEQ 174
>gi|85068700|gb|ABC69430.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 76/144 (52%), Gaps = 11/144 (7%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
+++F +Y++ Y +D + E RF+IF++NL + EQGTA YGV +F+D+T EF
Sbjct: 32 YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKT 90
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ D + + E + ++ E ++++ G V P V DQ CGSCWA
Sbjct: 91 RYLRMRFDGPIVSEDLTPEEDVTMDN-------EKFDWREHGAVGP-VLDQGKCGSCWAF 142
Query: 197 SAVACLESAYAIKHNELIELSKQP 220
S + + + K L+ LS+QP
Sbjct: 143 SVIGNVVGQWFRKTGHLLALSEQP 166
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/170 (32%), Positives = 86/170 (50%), Gaps = 19/170 (11%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG-LSSLDWEQIENLKS 153
E ERRF +F++NL+ ID + E + G+NRFAD+T+ E+ L + + L
Sbjct: 70 EKERRFQVFKDNLRFIDEHNS-ENRSYKVGLNRFADLTNEEYRSMYLGARSGAKRNRLSR 128
Query: 154 TFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNEL 213
+ Y +S L +S++++ +G V +V+DQ CGSCWA S +A +E I +L
Sbjct: 129 SSNRYLPRVGDS--LPDSVDWRKEGAV-AEVKDQGSCGSCWAFSTIAAVEGINKIVTGDL 185
Query: 214 IELSKQPPKTHGRFY----KGGVMNLPHMLCSKGPYSLNHAVLNVGYDNE 259
I LS+Q R Y GG+M+ Y+ + N G D+E
Sbjct: 186 ISLSEQELVDCDRSYNEGCNGGLMD----------YAFQFIINNGGIDSE 225
>gi|110349475|gb|ABG73218.1| cathepsin L 2 precursor [Diaprepes abbreviatus]
Length = 348
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/160 (33%), Positives = 85/160 (53%), Gaps = 19/160 (11%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTK-HEQGTATY--GVNRFADMTDS 134
Q++ F E+ + Y+S+SE E R +F NL I+ + K +E G ++Y +N D+T
Sbjct: 27 QWEQFKLEHGKVYESESENEYRQSVFMENLFQINEHNKLYEMGLSSYQMAMNHLGDLTKD 86
Query: 135 EFN--HGLSSLDWEQIENLKST------------FETYSFNSS-NSYGLAESINYKDKGK 179
EF + ++ Q ENL + F TY+ ++ + L I+++ KG
Sbjct: 87 EFMRIYTVNMPQLPQSENLSDSEPWLDLPQDLQGFVTYALPTNLDEVDLPTDIDWRQKGA 146
Query: 180 VLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V P V++Q CGSCW+ SA LE+ + K N+LI LS+Q
Sbjct: 147 VTP-VKNQRNCGSCWSFSATGALEAQWFKKTNKLISLSEQ 185
>gi|118150|sp|P25804.1|CYSP_PEA RecName: Full=Cysteine proteinase 15A; AltName:
Full=Turgor-responsive protein 15A; Flags: Precursor
gi|20679|emb|CAA38242.1| unnamed protein product [Pisum sativum]
Length = 363
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 76/150 (50%), Gaps = 6/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+ L+ + F F ++ + Y + E + RF +F++NL + ++ TA +G+ +F+
Sbjct: 39 DHLLNAEHHFTSFKSKFSKSYATKEEHDYRFGVFKSNLIKAKLH-QNRDPTAEHGITKFS 97
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T SEF L + L+ + L E ++++KG V P V+DQ
Sbjct: 98 DLTASEFRRQFLGLK----KRLRLPAHAQKAPILPTTNLPEDFDWREKGAVTP-VKDQGS 152
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S LE A+ + +L+ LS+Q
Sbjct: 153 CGSCWAFSTTGALEGAHYLATGKLVSLSEQ 182
>gi|325185566|emb|CCA20049.1| cysteine protease family C01A putative [Albugo laibachii Nc14]
Length = 371
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/148 (33%), Positives = 83/148 (56%), Gaps = 10/148 (6%)
Query: 79 FKDFVREYERQY----DSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADM 131
F ++ EY ++Y +++RF+ F+ N+K I+ + K++ G T+ G+N AD+
Sbjct: 39 FVEYATEYGKEYLNYNKDHGLVQKRFEAFQANMKRIEAHNAKYQAGEYTFELGLNEIADL 98
Query: 132 TDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
TD+E+ L + E+ +S +++ + + ES +++ V P V++Q CG
Sbjct: 99 TDTEYKQFLGYK--RRSESSESQNAGVYSDANLAEDVPESWDWRTHDAVTP-VKNQGQCG 155
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
SCWA SAVA LESAYAI L+ S+Q
Sbjct: 156 SCWAFSAVAALESAYAISTGTLVSFSEQ 183
>gi|146335582|gb|ABQ23400.1| cathepsin L isotype 3 [Trypanoplasma borreli]
Length = 442
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 73/143 (51%), Gaps = 5/143 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
F+DF + R Y S E +RF+IF N+K + AT+G N FADM+ EF
Sbjct: 25 FRDFKTTHARNYASADEERKRFEIFAANMKKAAELNRKNP-MATFGPNEFADMSSEEFQT 83
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
H + + +T++ N+ + + ++++ KG V P V++Q CGSCW+
Sbjct: 84 RHNAARHYAAVMARPPKNTKTFTEEEINA-AVGQKVDWRLKGAVTP-VKNQGSCGSCWSF 141
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E +AI +L+ LS+Q
Sbjct: 142 STTGNIEGQHAIATGQLVSLSEQ 164
>gi|296213765|ref|XP_002753411.1| PREDICTED: pro-cathepsin H [Callithrix jacchus]
Length = 336
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 80/162 (49%), Gaps = 16/162 (9%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ E S LE+F FK ++ ++ + Y + E +R F +N + I+ H G
Sbjct: 21 GAAELSVNSLEKF-----HFKSWMAKHHKTYSREEEYHQRLQTFASNWRKIN---AHNNG 72
Query: 120 TATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T+ VN+F+DM+ +E W + +N +T Y + Y S++++ K
Sbjct: 73 NHTFKMAVNQFSDMSFAEIKR---KYLWSEPQNCSATKSNY-LRGTGPY--PPSVDWRKK 126
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 127 GHFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQ 168
>gi|410907221|ref|XP_003967090.1| PREDICTED: pro-cathepsin H-like [Takifugu rubripes]
Length = 324
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 79/145 (54%), Gaps = 15/145 (10%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYG--VNRFADMTDSE 135
+F+ ++ + + Y D ++R +F N + ID KH +G ++ +N+++DMT +E
Sbjct: 26 EFRSWMALHNKAYVKD--FDQRLQVFTENKRRID---KHNEGNHSFAMRLNQYSDMTFAE 80
Query: 136 F-NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
F H L W + +N +T +Y +S ESI+++ KG + V++Q CGSCW
Sbjct: 81 FRKHFL----WAEPQNCSATKGSYIQTNSPH---PESIDWRKKGNYVTPVKNQGSCGSCW 133
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
S CLES AI +L+ LS+Q
Sbjct: 134 TFSTTGCLESVTAINSGKLVPLSEQ 158
>gi|166235890|ref|NP_031827.2| pro-cathepsin H preproprotein [Mus musculus]
gi|341940309|sp|P49935.2|CATH_MOUSE RecName: Full=Pro-cathepsin H; AltName: Full=Cathepsin B3; AltName:
Full=Cathepsin BA; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|74151776|dbj|BAE29677.1| unnamed protein product [Mus musculus]
gi|74181999|dbj|BAE34071.1| unnamed protein product [Mus musculus]
gi|74211659|dbj|BAE29188.1| unnamed protein product [Mus musculus]
gi|74213518|dbj|BAE35569.1| unnamed protein product [Mus musculus]
gi|148688954|gb|EDL20901.1| cathepsin H, isoform CRA_b [Mus musculus]
Length = 333
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 74/141 (52%), Gaps = 8/141 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK +++++++ Y S E R +F NN + I + + T +N+F+DM+ +E H
Sbjct: 33 FKSWMKQHQKTYSS-VEYNHRLQMFANNWRKIQAHNQRNH-TFKMALNQFSDMSFAEIKH 90
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
W + +N +T Y + Y S++++ KG V+ V++Q CGSCW S
Sbjct: 91 ---KFLWSEPQNCSATKSNY-LRGTGPY--PSSMDWRKKGNVVSPVKNQGACGSCWTFST 144
Query: 199 VACLESAYAIKHNELIELSKQ 219
LESA AI +++ L++Q
Sbjct: 145 TGALESAVAIASGKMLSLAEQ 165
>gi|47086663|ref|NP_997853.1| cathepsin H precursor [Danio rerio]
gi|45709087|gb|AAH67615.1| Cathepsin H [Danio rerio]
Length = 330
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/144 (33%), Positives = 81/144 (56%), Gaps = 14/144 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEF 136
FK ++ +Y ++Y+ + E +R IF N K ID +H +G + G+N+F+DMT +EF
Sbjct: 30 FKSWMSQYNKKYEIN-EFYQRLQIFLENKKRID---QHNEGNHKFSMGLNQFSDMTFAEF 85
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGL-AESINYKDKGKVLPKVQDQHLCGSCWA 195
+ + +N +T N +S GL ++I+++ KG + V++Q CGSCW
Sbjct: 86 K---KTYLLTEPQNCSAT----RGNHVSSNGLYPDAIDWRTKGHYITDVKNQGPCGSCWT 138
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S CLES AI +L++L++Q
Sbjct: 139 FSTTGCLESVTAIATGKLLQLAEQ 162
>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 351
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 45/133 (33%), Positives = 75/133 (56%), Gaps = 11/133 (8%)
Query: 94 SEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSS-------LDWE 146
+E+ RF +F+NN K + + + +N+FADM+D EF + SS L +
Sbjct: 55 NEMHNRFKVFKNNAKHV-FKVNLMGKSLKLKLNQFADMSDDEFRNMYSSNITYYKDLHAK 113
Query: 147 QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAY 206
+IE + + +N+ + SI+++ KG V +++Q CGSCWA +AVA +ES +
Sbjct: 114 KIEATGGRIGGFMYEHANN--IPSSIDWRKKGAV-NAIKNQGRCGSCWAFAAVAAVESIH 170
Query: 207 AIKHNELIELSKQ 219
IK NEL+ LS++
Sbjct: 171 QIKTNELVSLSEE 183
>gi|85068698|gb|ABC69429.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 84/175 (48%), Gaps = 27/175 (15%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
+++F +Y++ Y +D + E RF+IF++NL + EQGTA YGV +F+D+T EF
Sbjct: 32 YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKT 90
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ D + S E + ++ E ++++ G V P V DQ CGSCWA
Sbjct: 91 RYLRMRFDGPIVSEDPSPEEDVTMDN-------EKFDWREHGAVGP-VLDQGKCGSCWAF 142
Query: 197 SAVACLESAYAIKHNELIELSKQ----------------PPKTHGRFYKGGVMNL 235
S + + + K L+ LS+Q PP+T+ K G + L
Sbjct: 143 SVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQKMGGLEL 197
>gi|345783063|ref|XP_533219.3| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Canis lupus
familiaris]
Length = 490
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 69/142 (48%), Gaps = 25/142 (17%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK+FV Y R Y++ E E R +F NN+ ++GTA YG+ +F+D+T+ EF
Sbjct: 192 FKEFVTTYNRTYETKEEAEWRMSVFSNNMVRAQKIQALDRGTAQYGITKFSDLTEEEFRT 251
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESI---------NYKDKGKVLPKVQDQH 188
L+ L E N LA+SI +++ KG V KV+DQ
Sbjct: 252 IYLNPLLRE--------------NRGKKMRLAKSISDHAPPPEWDWRSKGAVT-KVKDQG 296
Query: 189 LCGSCWAHSAVACLESAYAIKH 210
+CGSCWA S +E + +K
Sbjct: 297 MCGSCWAFSVTGNVEGQWFLKE 318
>gi|225706914|gb|ACO09303.1| Cathepsin H precursor [Osmerus mordax]
Length = 328
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 78/144 (54%), Gaps = 14/144 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEF 136
FK ++ ++ +QYD + E R IF N I+ +H G Y G+N F+DMT EF
Sbjct: 30 FKSWMMQHNKQYDIE-EYYHRLQIFIENKMKIE---RHNGGNHKYRMGLNTFSDMTFDEF 85
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGL-AESINYKDKGKVLPKVQDQHLCGSCWA 195
SS + +N +T T+ +S GL +S++++ KG + V++Q CGSCW
Sbjct: 86 R---SSFLLTEPQNCSATKGTHV----SSKGLYPDSVDWRKKGNYVTNVKNQGPCGSCWT 138
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S CLES AI +L++LS+Q
Sbjct: 139 FSTTGCLESVTAISTGKLLQLSEQ 162
>gi|388509526|gb|AFK42829.1| unknown [Lotus japonicus]
Length = 333
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 51/158 (32%), Positives = 82/158 (51%), Gaps = 9/158 (5%)
Query: 64 ASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY 123
A TF++E LD + + F + +QY + EI RR N + +H+ G TY
Sbjct: 17 APTFNVE--LD--SHWALFKTTFGKQYSTAEEITRRLAWEANVAIIRQHNLEHDLGLHTY 72
Query: 124 --GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVL 181
G+N +AD+T++EFN ++ L + + TY + L S++++ KG V
Sbjct: 73 TLGLNNYADLTNAEFNQVMNGLRVNASQTKSANRRTYV--APVGVELPTSVDWRTKGYVT 130
Query: 182 PKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
P ++DQ CGSCWA S+ LE + K +L+ LS+Q
Sbjct: 131 P-IKDQGQCGSCWAFSSTGSLEGQHFAKTGQLVSLSEQ 167
>gi|330376140|gb|AEC13302.1| cathepsin H [Gallus gallus]
Length = 329
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 44/143 (30%), Positives = 77/143 (53%), Gaps = 12/143 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEF 136
FK ++ ++ R+Y + E ERR +F N + I+ H G +++ +N+F+DMT +EF
Sbjct: 29 FKAWMLQHGRRYGA-GEYERRLRVFVGNKRHIE---GHNAGNSSFQMALNQFSDMTFAEF 84
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
W + +N +T + + E+++++ KG + V++Q CGSCW
Sbjct: 85 K---KLYLWSEPQNCSATRGNFLRSDGPC---PEAVDWRKKGNFVTPVKNQGPCGSCWTF 138
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S CLESA AI +L+ L++Q
Sbjct: 139 STTGCLESAIAIATGKLLSLAEQ 161
>gi|37788267|gb|AAO64473.1| cathepsin H precursor [Fundulus heteroclitus]
Length = 345
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 54/172 (31%), Positives = 81/172 (47%), Gaps = 22/172 (12%)
Query: 64 ASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG--TA 121
AS F L E + FK ++ +Y + YD + E RR IF N + ID KH +G +
Sbjct: 14 ASAFYLSERDEF--HFKSWMAQYNKAYDFN-EYYRRLQIFTENKRRID---KHNEGNHSF 67
Query: 122 TYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE----------- 170
T G+N+F+ MT +EF + + K + + + N + E
Sbjct: 68 TMGLNQFSGMTFNEFRKAFLMSEPQNCSATKGNYLSSNLNQFSGMTFNEFRKAFLMSEGP 127
Query: 171 ---SINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
SI+++ KG + V+ Q CGSCW S CLES AI +L+ LS+Q
Sbjct: 128 QPDSIDWRKKGNYITPVKTQGSCGSCWTFSTTGCLESVTAIATVKLVPLSEQ 179
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 72.8 bits (177), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 73/141 (51%), Gaps = 4/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ ++ + Y+S E RF+IF +NLK ID T + G+N FAD+T EF H
Sbjct: 49 FESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDE-TNKKVSNYWLGLNEFADLTHEEFKH 107
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
E E + + F + L +S++++ KG V P V++Q CG+CWA S
Sbjct: 108 KFLGFKGELAERKDES--SKEFGYRDFVDLPKSVDWRKKGAVAP-VKNQGQCGNCWAFST 164
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I L LS+Q
Sbjct: 165 VAAVEGINQIVTGNLTMLSEQ 185
>gi|221056026|ref|XP_002259151.1| P.knowlesi ortholog of falcipain [Plasmodium knowlesi strain H]
gi|193809222|emb|CAQ39924.1| P.knowlesi ortholog of falcipain [Plasmodium knowlesi strain H]
Length = 477
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 42/153 (27%), Positives = 84/153 (54%), Gaps = 6/153 (3%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L++ N F F++E+ ++Y + E+++R+ F NL I+ + E G N+++DM+
Sbjct: 153 LENVNSFYLFIKEHGKKYQTPDEMQQRYLSFVENLAKINAHNNKENVLYKKGTNQYSDMS 212
Query: 133 DSEFNHGLSSLDWEQIENLKSTFETYSFNSS-NSYGLAESINYKDK-----GKVLPKVQD 186
EF + +L ++ + ++ +++ Y A+++ +K + ++++
Sbjct: 213 FEEFRKTMLTLRFDLTRKIGNSPHVSNYDDVLKKYKPADAVVDNEKYDWREHNAVSEIKN 272
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q LCGSCWA AV +ES YAI+ N+ I +S+Q
Sbjct: 273 QDLCGSCWAFGAVGAVESQYAIRKNQHILISEQ 305
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/141 (35%), Positives = 77/141 (54%), Gaps = 6/141 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ + + Y+S E RF++F+ NLK ID K E + G+N FAD++ EF
Sbjct: 47 FESWISGHGKAYNSLEEKLHRFEVFKENLKHIDQRNK-EVTSYWLGLNEFADLSHEEFKS 105
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L + + KS+ E +S+ + L +SI+++ KG V P V++Q CGSCWA S
Sbjct: 106 KFLGL-YPEFPRKKSS-EDFSYR--DVVDLPKSIDWRKKGAVTP-VKNQGSCGSCWAFST 160
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I L LS+Q
Sbjct: 161 VAAVEGINQIVAGNLTSLSEQ 181
>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 288
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 72/141 (51%), Gaps = 4/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ E+ + Y S E RF++FR NL ID +E + G+N FAD+T EF
Sbjct: 51 FESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQ-RNNEINSYWLGLNEFADLTHEEFKG 109
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L Q + + +F + L +S++++ KG V P V+DQ CGSCWA S
Sbjct: 110 RYLGLAKPQFSRKRQP--SANFRYRDITDLPKSVDWRKKGAVAP-VKDQGQCGSCWAFST 166
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I L LS+Q
Sbjct: 167 VAAVEGINQITTGNLSSLSEQ 187
>gi|395742406|ref|XP_003777749.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pongo abelii]
Length = 490
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/129 (34%), Positives = 66/129 (51%), Gaps = 8/129 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK+FV Y R Y+S E R IF NN+ ++GTA YGV +F+D+T+ EF
Sbjct: 193 FKNFVITYNRTYESKEEARWRLSIFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 252
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L+ L E+ N ++ + + +++ KG V KV+DQ +CGSCWA S
Sbjct: 253 IYLNPLLREEPSNKMKQAKSVGDLAPPEW------DWRSKGAVT-KVKDQGMCGSCWAFS 305
Query: 198 AVACLESAY 206
+E +
Sbjct: 306 VTGNVEGQW 314
>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/118 (37%), Positives = 62/118 (52%), Gaps = 3/118 (2%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQ 147
R Y D E ERRF IF+ NLK I+ + T G+N FAD+TD EF + +
Sbjct: 47 RTYQDDEEKERRFHIFKKNLKHIENFNNAFNRTYKLGLNHFADLTDEEFLATYTGYKMPK 106
Query: 148 IENLK--STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLE 203
+ +T T S + + ESI+++ +G V P V++Q CG CWA SA A +E
Sbjct: 107 VLPTANITTKTTQSSDVLYEANVPESIDWRTRGVVTP-VKNQGRCGCCWAFSAAAAVE 163
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 77/142 (54%), Gaps = 5/142 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ + E ++ Y + E RF+IF++NL ID T + + G+N FAD+T EF
Sbjct: 22 FESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDE-TNKKNSSYWLGLNEFADLTHDEFKA 80
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ SL + +S E + + Y ESI+++ KG V P V++Q+ CGSCWA S
Sbjct: 81 KYVGSLGEDSTIIEQSDDEEFPYKHVVDY--PESIDWRQKGAVTP-VKNQNPCGSCWAFS 137
Query: 198 AVACLESAYAIKHNELIELSKQ 219
VA +E I +LI LS+Q
Sbjct: 138 TVATVEGINKIVTGKLISLSEQ 159
>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
Length = 340
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 87/163 (53%), Gaps = 13/163 (7%)
Query: 61 SEEASTFDLEEFLDHGNQFKDFVREYERQY-DSDSEIERRFDIFRNNLKTIDYYT-KHEQ 118
S A+ D + LDH + + + Y +QY + + E+ RR I+ NLK + + +H
Sbjct: 21 SYAAAPVDRDPALDH--HWNLWKKTYGKQYKEKNEEVARRL-IWEKNLKFVTLHNLEHSM 77
Query: 119 GTATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKD 176
G +Y G+N DMT E +SSL ++ + TY NS+ L +S+++++
Sbjct: 78 GMHSYDLGMNHLGDMTSEEVISLMSSL---RVPSQWPRNVTYKSNSNQK--LPDSVDWRE 132
Query: 177 KGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
KG V KV+ Q CG+CWA SAV LE+ +K +L+ LS Q
Sbjct: 133 KGCVT-KVKYQGACGACWAFSAVGALEAQLKLKTGKLVSLSAQ 174
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/174 (28%), Positives = 86/174 (49%), Gaps = 8/174 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F + ++ + Y S E +R++IF+ NL+ I T G+ G+N FAD+ EF
Sbjct: 46 FTSWSVKHSKIYASPKEKVKRYEIFKRNLRHI-VETNRRNGSYWLGLNHFADIAHEEFKA 104
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L + +F +N+ L +++++ KG V P V++Q CGSCWA S
Sbjct: 105 SYLGLKPGLARRDAQPHGSTTFRYANAVNLPWAVDWRKKGAVTP-VKNQGECGSCWAFST 163
Query: 199 VACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMN--LPHMLCSKGPYS 246
VA +E I +L+ LS+Q T +GG+M+ +++ ++G Y+
Sbjct: 164 VAAVEGINQIVTGKLVSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYT 217
>gi|164605518|dbj|BAF98584.1| CM0216.500.nc [Lotus japonicus]
Length = 360
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 82/150 (54%), Gaps = 7/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+E L + F +F R + + Y ++ E RF++F++N+ + + +A +GV RF+
Sbjct: 36 DEGLGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDP-SAVHGVTRFS 94
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T EF H + L + L S ++ +++ L + ++++ G V P V++Q
Sbjct: 95 DLTPMEFRHSVLGL---RGVGLPSDADSAPILPTDN--LPKDFDWREHGAVTP-VKNQGS 148
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCW+ SA LE A+ + EL+ LS+Q
Sbjct: 149 CGSCWSFSATGALEGAHFLSTGELVSLSEQ 178
>gi|155970232|gb|ABU41785.1| cysteine protease [Rosa x borboniana]
Length = 357
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 53/151 (35%), Positives = 76/151 (50%), Gaps = 20/151 (13%)
Query: 75 HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDS 134
H F F YE++Y+S E+ RRF+IF N K I T + + GVNRFAD T
Sbjct: 54 HVRSFARFAYRYEKRYESVEEMGRRFEIFAENKKLIRS-TNRKGLSYKLGVNRFADWTWE 112
Query: 135 EFN-HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESI-----NYKDKGKVLPKVQDQH 188
EF H L + +N +T + ++ L +++ N++D+G V P V+DQ
Sbjct: 113 EFQRHRLGA-----AQNCSAT-------TKGNHKLTDAVPPLTKNWRDEGIVTP-VKDQG 159
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCW S LE+AY + I S+Q
Sbjct: 160 HCGSCWTFSTTGALEAAYVQAFGKQISPSEQ 190
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/174 (28%), Positives = 86/174 (49%), Gaps = 8/174 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F + ++ + Y S E +R++IF+ NL+ I T G+ G+N FAD+ EF
Sbjct: 55 FTSWSVKHSKIYASPKEKVKRYEIFKRNLRHI-VETNRRNGSYWLGLNHFADIAHEEFKA 113
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L + +F +N+ L +++++ KG V P V++Q CGSCWA S
Sbjct: 114 SYLGLKPGLARRDAQPHGSTTFRYANAVNLPWAVDWRKKGAVTP-VKNQGECGSCWAFST 172
Query: 199 VACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMN--LPHMLCSKGPYS 246
VA +E I +L+ LS+Q T +GG+M+ +++ ++G Y+
Sbjct: 173 VAAVEGINQIVTGKLVSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYT 226
>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
Length = 339
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/138 (31%), Positives = 74/138 (53%), Gaps = 4/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y +E RRF+IF+ N+ I+ + VN+FAD+T+ EF +
Sbjct: 40 WMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNH-KFWLSVNQFADLTNYEFRATKT 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ + I + T+ + + + L +++++ KG V P ++DQ CG CWA SAVA
Sbjct: 99 NKGF--IPSTVRVPTTFRYENVSIDTLPATVDWRTKGAVTP-IKDQGQCGCCWAFSAVAA 155
Query: 202 LESAYAIKHNELIELSKQ 219
+E + +LI LS+Q
Sbjct: 156 MEGIVKLSTGKLISLSEQ 173
>gi|3023456|sp|Q26534.1|CATL_SCHMA RecName: Full=Cathepsin L; AltName: Full=SMCL1; Flags: Precursor
gi|555663|gb|AAC46485.1| preprocathepsin L [Schistosoma mansoni]
gi|1094710|prf||2106314A cathepsin L
Length = 319
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 79/142 (55%), Gaps = 5/142 (3%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ F +Y +QY ++E E RF+IF++N+ Y +G+A YGV ++D+T EF
Sbjct: 19 KYVQFKLKYRKQYH-ETEDEIRFNIFKSNILKAQLYQVFVRGSAIYGVTPYSDLTTDEFA 77
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ W + +T + +N + ++ ++++KG V +V++Q +CGSCWA S
Sbjct: 78 RTHLTASWVVPSSRSNTPTSLGKEVNN---IPKNFDWREKGAV-TEVKNQGMCGSCWAFS 133
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+ES + K +L+ LS+Q
Sbjct: 134 TTGNVESQWFRKTGKLLSLSEQ 155
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 83/159 (52%), Gaps = 11/159 (6%)
Query: 64 ASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY 123
A+T LE+ H + + ++ ++ + Y E E R+ IF+ N+K I+ + +
Sbjct: 25 ANTRTLEDASMH-ERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIEGFNNAGNKSHKL 83
Query: 124 GVNRFADMTDSEFN--HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVL 181
GVN+FAD+T+ EF + L W +I T +F + + +++++ KG V
Sbjct: 84 GVNQFADLTEEEFKAINKLKGYMWSKIS------RTSTFKYEHVTKVPATLDWRQKGAVT 137
Query: 182 PKVQDQHL-CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
P ++ Q L CGSCWA +AVA E + ELI LS+Q
Sbjct: 138 P-IKSQGLKCGSCWAFAAVAATEGITKLTTGELISLSEQ 175
>gi|121531618|gb|ABM55494.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 264
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 52/146 (35%), Positives = 80/146 (54%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATY--GVNRFADMTD 133
+Q+ F + + + Y S E RF IF++NL I ++ ++++G TY GV RFAD+T
Sbjct: 21 DQWIAFKQTHGKTYKSLLEERTRFGIFQSNLMKIKEHNARYDKGEETYFLGVTRFADLTH 80
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF L +I+N T + + + +SI++ +KG VL ++DQ CGSC
Sbjct: 81 GEFKDFLR----RRIKNKPRLHATPTVFPED-LEVPDSIDWTEKGAVL-DIKDQEDCGSC 134
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SA LE A+ +N I LS+Q
Sbjct: 135 WAFSATGALEGQNAVLNNVRIPLSEQ 160
>gi|205364757|gb|ACI04578.1| cysteine protease-like protein [Robinia pseudoacacia]
Length = 335
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 76/153 (49%), Gaps = 13/153 (8%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+ L+ + F F ++ + Y + E + RF +F++N++ + K + +A +GV +F+
Sbjct: 13 DHVLNAEHHFSTFKSKFSKTYATKEEHDYRFGVFKSNVRRAKLHAKLDP-SAVHGVTKFS 71
Query: 130 DMTDSEFNH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
D+T SEF GL L + T+ L E +++DKG V V++
Sbjct: 72 DLTPSEFRRQFLGLKPLRLPEHAQKAPILPTHD--------LPEDFDWRDKGAVT-HVKN 122
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA S LE ++ + EL+ LS Q
Sbjct: 123 QGSCGSCWAFSTTGALEGSHFLATGELVSLSDQ 155
>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
Length = 336
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 79/155 (50%), Gaps = 6/155 (3%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y D+E RRF++F+ N+ I+ + GVN+FAD+T+ EF +
Sbjct: 40 WMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNH-KFWLGVNQFADLTNDEFRSTKT 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ + I + + + N L +++++ KG V P ++DQ CG CWA SAVA
Sbjct: 99 NKGF--IPSTTRVPTGFRNENVNIDALPATMDWRTKGVVTP-IKDQGQCGCCWAFSAVAA 155
Query: 202 LESAYAIKHNELI--ELSKQPPKTHGRFYKGGVMN 234
+E + +LI L+K +GG+M+
Sbjct: 156 MEGIVKLSTGKLISHSLNKSLLTVMSMGCEGGLMD 190
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/148 (37%), Positives = 76/148 (51%), Gaps = 13/148 (8%)
Query: 75 HGNQFKDF-VREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA--TYGVNRFADM 131
H N FK ++ Y D E+ RRF IF +NL TI+ + + A T GVN FADM
Sbjct: 27 HWNAFKSTHLKSYR---DGQEELIRRF-IFEDNLHTIEEFNRVNASLAGFTLGVNEFADM 82
Query: 132 TDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
T++EF++ L L S FE SS+ L +++ KG V +V++Q CG
Sbjct: 83 TNTEFSNMLLGLGGRNKIAGDSVFE-----SSHVQDLPAEVDWTQKGYV-TEVKNQGQCG 136
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
SCWA S LE K +L+ LS+Q
Sbjct: 137 SCWAFSTTGSLEGQVFKKTGKLVSLSEQ 164
>gi|225431287|ref|XP_002275759.1| PREDICTED: cysteine proteinase RD19a isoform 1 [Vitis vinifera]
gi|297735094|emb|CBI17456.3| unnamed protein product [Vitis vinifera]
Length = 367
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 76/150 (50%), Gaps = 7/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
++ L +QF F ++ + Y + E + RF +F NL+ + + +A +GV RF+
Sbjct: 43 DDLLSAEHQFGLFKAKFGKTYSTVEEHDYRFSVFEANLRRARRHQLLDP-SAVHGVTRFS 101
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T EF D+ ++ L+ + + L +++D G V P V+DQ
Sbjct: 102 DLTPDEFRR-----DYLGLKPLRLPADAQKAPILPTNDLPTDFDWRDHGAVTP-VKDQGS 155
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCW+ SA+ LE A+ + LI +S+Q
Sbjct: 156 CGSCWSFSAIGALEGAHFLTTGNLISMSEQ 185
>gi|52076120|dbj|BAD46633.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 369
Score = 72.4 bits (176), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/125 (36%), Positives = 67/125 (53%), Gaps = 4/125 (3%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKST 154
+IE RF+ F+ N + + + K E T G+N+FADMT EF + + L S
Sbjct: 65 DIESRFEAFKANARYVSEFNKKEGMTYELGLNKFADMTLEEFVAKYAGAKVDAAAALASV 124
Query: 155 FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELI 214
E + + + +++ G V P V+DQ CGSCWA S+V +ESAYAI +L+
Sbjct: 125 PEAEEEVVGD---VPAAWDWRQHGVVTP-VKDQGSCGSCWAFSSVGAVESAYAIATKKLL 180
Query: 215 ELSKQ 219
LS+Q
Sbjct: 181 RLSEQ 185
>gi|388521567|gb|AFK48845.1| unknown [Medicago truncatula]
Length = 343
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 77/153 (50%), Gaps = 10/153 (6%)
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNR 127
D+EE L F Y ++YD+ E++RRF IF NL+ I K G T GVN
Sbjct: 33 DMEEQLLQVIGESRFANRYGKRYDTVDEMKRRFKIFSENLQLIKSTNKKRLGY-TLGVNH 91
Query: 128 FADMTDSEF-NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
FAD T EF +H L + +N +T + + L +++ +G ++ +V+D
Sbjct: 92 FADWTWEEFRSHRLGA-----AQNCSATLK--GNHRITDVVLPAEKDWRKEG-IVSEVKD 143
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCW S LESAYA + I LS+Q
Sbjct: 144 QGHCGSCWTFSTTGALESAYAQAFGKNISLSEQ 176
>gi|71030048|ref|XP_764666.1| cysteine protease [Theileria parva strain Muguga]
gi|68351622|gb|EAN32383.1| cysteine protease, putative [Theileria parva]
Length = 612
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 76/146 (52%), Gaps = 2/146 (1%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
D ++FK F+ YE++Y + E + R+ FR+N I+ + + T G D +D
Sbjct: 175 DTISEFKSFISRYEKKYKDEDEYKTRYLNFRDNRIFIETHNSNHNKIFTMGYTSSTDSSD 234
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
E +SS+ ++ ++ + + +SS Y ++++KG +LP VQDQ CGSC
Sbjct: 235 EELGRAVSSISYKPTQDEIYSRASEEMSSSKKYP-GVIFDWREKGVILP-VQDQKECGSC 292
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA S L + AI ++L + SKQ
Sbjct: 293 WAVSMSDLLSTMMAISGHKLQDYSKQ 318
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 75/151 (49%), Gaps = 11/151 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F+ ++ + R Y E +RR +++R N++ ++ + G N+FAD+T+ EF
Sbjct: 53 RFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNGY-RLADNKFADLTNEEFR 111
Query: 138 HGLSSLDWEQI------ENLKSTFETYSFNSSNSYG---LAESINYKDKGKVLPKVQDQH 188
+ + ST G L +S+++++KG V P V+ Q
Sbjct: 112 AKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAP-VKSQG 170
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAVA +E IK+ +L+ LS+Q
Sbjct: 171 DCGSCWAFSAVAAIEGINQIKNGKLVSLSEQ 201
>gi|38344381|emb|CAD40319.2| OSJNBb0054B09.3 [Oryza sativa Japonica Group]
gi|116309071|emb|CAH66180.1| OSIGBa0130O15.4 [Oryza sativa Indica Group]
gi|116309098|emb|CAH66205.1| OSIGBa0148D14.11 [Oryza sativa Indica Group]
Length = 381
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 50/160 (31%), Positives = 75/160 (46%), Gaps = 9/160 (5%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G EE + D E F F R + R Y E R +F NL+ + + +
Sbjct: 45 GEEEDAQLDAEA------HFASFERRFGRTYRDAGERAYRMSVFAANLRRARRHQRLDP- 97
Query: 120 TATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGK 179
TAT+GV +F+D+T EF L +E L E + + GL + ++++ G
Sbjct: 98 TATHGVTKFSDLTPGEFRDRFLGLRRPSLEGLVGG-EPHEAPILPTDGLPDDFDWREHGA 156
Query: 180 VLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V P V+DQ CGSCW+ S LE A+ + +L LS+Q
Sbjct: 157 VGP-VKDQGSCGSCWSFSTSGALEGAHFLATGKLEVLSEQ 195
>gi|441611591|ref|XP_003273955.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Nomascus leucogenys]
Length = 548
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 43/129 (33%), Positives = 66/129 (51%), Gaps = 8/129 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK+FV Y R Y+S E R +F NN+ ++GTA YGV +F+D+T+ EF
Sbjct: 257 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 316
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L+ L E+ N ++ + + +++ KG V KV+DQ +CGSCWA S
Sbjct: 317 IYLNPLLREEPGNKMKQAKSVGDLAPPEW------DWRSKGAVT-KVKDQGMCGSCWAFS 369
Query: 198 AVACLESAY 206
+E +
Sbjct: 370 VTGNVEGQW 378
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 41/140 (29%), Positives = 71/140 (50%), Gaps = 2/140 (1%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+ ++ +Y + Y +E+++RF IF NN++ I+ + +N AD T+ EF
Sbjct: 39 EQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNKPYKLSINHLADQTNEEFMAS 98
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ L+ T +T F N + +++++ KG V ++DQ CG+CWA SAV
Sbjct: 99 HKGYKGSHWQGLRITTQT-PFKYENVTDIPWAVDWRQKGDV-TSIKDQAQCGNCWAFSAV 156
Query: 200 ACLESAYAIKHNELIELSKQ 219
A E Y I L+ LS++
Sbjct: 157 AATEGIYQITTGNLVSLSEK 176
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 81/153 (52%), Gaps = 3/153 (1%)
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATYGVN 126
DL + + + ++ ++ R Y D+E RR ++FR+N+ I+ Q N
Sbjct: 29 DLVDAAAMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEEN 88
Query: 127 RFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
+FAD+T++EF + L ++ ++ + + ++ L S++++ KG V P V+D
Sbjct: 89 QFADLTNAEFRATRTGLRPSSSRGNRAP-TSFRYANVSTGDLPASVDWRGKGAVNP-VKD 146
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CG CWA SAVA +E A + +L+ LS+Q
Sbjct: 147 QGDCGCCWAFSAVAAMEGAVKLATGKLVSLSEQ 179
>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/138 (31%), Positives = 72/138 (52%), Gaps = 4/138 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y R Y +E +F++F+ N ID + G+N+FAD+T+ EF +
Sbjct: 40 WMLQYGRVYKDAAEKASKFEVFKANAGFIDSFNAGNH-KFWLGINQFADITNKEFKATKT 98
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ + I N +S+ + + L SI+++ KG V P V+DQ CG CWA SAVA
Sbjct: 99 NKGF--ISNKVRAPTGFSYENVSFDALPASIDWRTKGAVTP-VKDQGQCGCCWAFSAVAA 155
Query: 202 LESAYAIKHNELIELSKQ 219
E + +L+ LS+Q
Sbjct: 156 TEGIVKLSTGKLVSLSEQ 173
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 40/140 (28%), Positives = 73/140 (52%), Gaps = 3/140 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ E+ + Y +E E+RF IF+ NL+ I+ + +N+F D T+ EF
Sbjct: 38 WMEEHGKFYKDAAEKEQRFQIFKENLEFIESFNAAGDNGFNLSINQFGDQTNDEFKANYL 97
Query: 142 SLDWEQI--ENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ + + + + E F N + +++++++G V P ++ QHLCGSCWA + V
Sbjct: 98 NGKKKPLIGVGIAAIEEESVFRYENVTEVPATMDWRERGAVTP-IKHQHLCGSCWAFATV 156
Query: 200 ACLESAYAIKHNELIELSKQ 219
A +E + I L+ LS+Q
Sbjct: 157 AAIEGIHQITTGRLVSLSEQ 176
>gi|30575714|gb|AAP33049.1| cysteine proteinase 1 [Clonorchis sinensis]
Length = 326
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 84/175 (48%), Gaps = 27/175 (15%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
+++F +Y++ Y +D + E RF+IF++NL + EQGTA YGV +F+D+T EF
Sbjct: 32 YEEFTLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKT 90
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ D + + E + ++ E ++++ G V P V DQ CGSCWA
Sbjct: 91 RYLRMRFDGPIVSEDLTPEEDVTMDN-------EKFDWREHGAVGP-VLDQGKCGSCWAF 142
Query: 197 SAVACLESAYAIKHNELIELSKQ----------------PPKTHGRFYKGGVMNL 235
S + + + K L+ LS+Q PP+T+ K G + L
Sbjct: 143 SVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQKMGGLEL 197
>gi|118394988|ref|XP_001029851.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89284124|gb|EAR82188.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 330
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 74/147 (50%), Gaps = 9/147 (6%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
D FK F + Y ++Y S+ R IF+ NL+ I+ + K+++ A +G+ +FAD+T
Sbjct: 25 DIAAAFKKFTQTYNKKYSSEEHYNARLSIFKENLRRIELFNKNDE--AQHGITQFADLTH 82
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF Q+ N ++ SS + +I++ KG V P V++Q CGSC
Sbjct: 83 EEFADMYLGYK-PQLRNSQAKVSL----SSTPFTAPTAIDWTTKGAVTP-VKNQGSCGSC 136
Query: 194 WAHSAVACLESAYAIKHNE-LIELSKQ 219
WA S +E Y ++ + L S+Q
Sbjct: 137 WAFSTTGSIEGQYVLQLKQNLTSFSEQ 163
>gi|7239343|gb|AAF43193.1|AF228731_1 cathepsin L [Stylonychia lemnae]
Length = 340
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/142 (35%), Positives = 75/142 (52%), Gaps = 9/142 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA-TYGVNRFADMTDSEFN 137
F F+ + + Y S E E R +++N+ I+ + GT+ T G N AD T E+
Sbjct: 42 FVHFMSRFSKAYKSKEEFEMRLQQYKSNIAFINNHNSQNDGTSFTLGPNHLADYTHDEYK 101
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
L + N K+ E YS + N + ESI++++KG V V+DQ CGSCWA S
Sbjct: 102 KMLGY----KPRN-KTGKEVYS--TPNLKDIPESIDWREKGAV-NAVKDQGQCGSCWAFS 153
Query: 198 AVACLESAYAIKHNELIELSKQ 219
+A LES Y I+ +L LS+Q
Sbjct: 154 TIASLESRYFIETGKLQSLSEQ 175
>gi|156368930|ref|XP_001627944.1| predicted protein [Nematostella vectensis]
gi|156214907|gb|EDO35881.1| predicted protein [Nematostella vectensis]
Length = 315
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 79/146 (54%), Gaps = 7/146 (4%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGV--NRFADMTDS 134
+ F +F +++++ Y+ DSE RR IFR+N++ Y + + Y + N FAD+TD
Sbjct: 21 DDFDEFRQQHDKVYEDDSEHRRRKHIFRHNVR---YIRSMNRRSLPYKLEPNHFADLTDD 77
Query: 135 EFNHGLSSLDWEQIENL-KSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF +LD E + K +S + + + ++++D G V P + Q CGSC
Sbjct: 78 EFKSYKDALDDESKDKEHKKRMHAKIKHSKRMFEVPDQLDWRDYGAVNP-AKGQGTCGSC 136
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA + +E+A+ I+ EL+ L++Q
Sbjct: 137 WAFATAGAVEAAHFIQKGELLNLAEQ 162
>gi|118350314|ref|XP_001008438.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89290205|gb|EAR88193.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 389
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/148 (31%), Positives = 76/148 (51%), Gaps = 19/148 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F F E+++ Y+ E +RRF+IFR NL I + E+GTA YG+ +F+DMT EF
Sbjct: 40 FSKFKAEHKKFYNFLEE-QRRFEIFRQNLDIISELNQVEEGTAEYGITQFSDMTTEEFKS 98
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAE-------SINYKDKGKVLPKVQDQHLCG 191
+ + ST+ +F S +G + S +++D G V P V++Q G
Sbjct: 99 QIL---------IPSTYAR-NFTGSRYHGFQKISQDAPTSYDWRDHGAVTP-VKNQGTVG 147
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
+CW S +E + + N L+ LS++
Sbjct: 148 TCWTFSTTGNIEGQWFLAGNPLVSLSEE 175
>gi|356552228|ref|XP_003544471.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 351
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 78/144 (54%), Gaps = 3/144 (2%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
+ F++++ ++++ Y++ E E+RF IF+NNL+ ID + T G+N FAD+T++E+
Sbjct: 43 SMFEEWLVKHDKVYNALGEKEKRFQIFKNNLRFIDERNSLNR-TYKLGLNVFADLTNAEY 101
Query: 137 NHGLSSLDWEQIENLK-STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ W+ L T + + +S++++ +G V P C SCWA
Sbjct: 102 -RAMYLRTWDDGPRLDLDTPPRNHYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWA 160
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
+AV +ES IK +LI LS+Q
Sbjct: 161 FTAVGAVESLVKIKTGDLISLSEQ 184
>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
gi|1582620|prf||2119193A cathepsin L-related Cys protease
Length = 324
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 45/147 (30%), Positives = 81/147 (55%), Gaps = 15/147 (10%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTK-HEQGTATY--GVNRFADMTDSE 135
+++F ++ R+Y E R ++F +NL+ I+ + K +E G TY +N+F+D+T+ E
Sbjct: 20 WEEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLAINQFSDLTNDE 79
Query: 136 FNHGLSSLDWEQIENLKSTFE---TYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
FN ++ K++ F S+++ ++++ KG V V+DQ CGS
Sbjct: 80 FN--------SMMKGYKTSLRPKPVAVFTSTDAAPETTEVDWRTKGCVT-HVKDQGQCGS 130
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA SA LE + +K+ EL+ L++Q
Sbjct: 131 CWAFSATGSLEGQHFLKYGELVSLAEQ 157
>gi|426252044|ref|XP_004019728.1| PREDICTED: cathepsin W [Ovis aries]
Length = 375
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 67/137 (48%), Gaps = 4/137 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ F +Y R Y + +E RR DIF NL + + GTA +GV +F+D+T+ EF
Sbjct: 42 FRLFQMQYNRSYPNPAEHARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEFVQ 101
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
S + + + + S + ++++K + V++Q C CWA +A
Sbjct: 102 LYGSRVAGEALGVSRKVGSEEWGESQP----PTCDWRNKPNTISPVRNQRHCNCCWAMAA 157
Query: 199 VACLESAYAIKHNELIE 215
+E+ +AIK N +E
Sbjct: 158 AGNIEALWAIKFNRSVE 174
>gi|332252750|ref|XP_003275518.1| PREDICTED: pro-cathepsin H [Nomascus leucogenys]
Length = 335
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 82/162 (50%), Gaps = 17/162 (10%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ E S LE+F FK ++ ++ + Y ++ E R +F +N + I+ H G
Sbjct: 21 GAAELSVNSLEKF-----HFKSWMSKHHKTYSTE-EYHHRLQMFASNWRKIN---AHNNG 71
Query: 120 TATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T+ +N+F+DM+ +E H W + +N +T Y + Y S++++ K
Sbjct: 72 NHTFKMALNQFSDMSFAEIKH---KYLWSEPQNCSATKSNY-LRGTGPY--PPSMDWRKK 125
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 126 GNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQ 167
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 72.0 bits (175), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 72/141 (51%), Gaps = 4/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+++V +Y + Y S E RRF++F++NL ID + E + G+N FAD+T EF
Sbjct: 72 FEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFADLTHDEFKA 131
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L ++ + Y + S++++ KG V +V++Q CGSCWA S
Sbjct: 132 TYLGLLPKRTSGGRF---RYGGVGDGGDEVPASVDWRKKGAVT-EVKNQGQCGSCWAFST 187
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I L LS+Q
Sbjct: 188 VAAVEGINQIVTGNLTSLSEQ 208
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 81/145 (55%), Gaps = 6/145 (4%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIF-RNNLKTIDYYTKHEQGTATY--GVNRFADMTDS 134
+++ F +++ Y S+ E RF IF N+L + K+ +G +Y G+N+FAD+
Sbjct: 26 EWEAFKSTHKKTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQFADLLPH 85
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF ++ +++ ST+ + + N L ++++++ KG V P V+DQ CGSCW
Sbjct: 86 EFVKMMNGYQGKRLAGRGSTYLPPA--NLNDSSLPKTVDWRKKGAVTP-VKDQGQCGSCW 142
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A S+ LE + +K +L+ LS+Q
Sbjct: 143 AFSSTGSLEGQHFLKTGKLVSLSEQ 167
>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
Length = 328
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/163 (33%), Positives = 87/163 (53%), Gaps = 13/163 (7%)
Query: 61 SEEASTFDLEEFLDHGNQFKDFVREYERQY-DSDSEIERRFDIFRNNLKTIDYYT-KHEQ 118
S A+ D + LDH + + + Y +QY + + E+ RR I+ NLK + + +H
Sbjct: 9 SYAAAPVDRDPALDH--HWNLWKKTYGKQYKEKNEEVARRL-IWEKNLKFVTLHNLEHSM 65
Query: 119 GTATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKD 176
G +Y G+N DMT E +SSL ++ + TY NS+ L +S+++++
Sbjct: 66 GMHSYDLGMNHLGDMTSEEVISLMSSL---RVPSQWPRNVTYKSNSNQK--LPDSVDWRE 120
Query: 177 KGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
KG V KV+ Q CG+CWA SAV LE+ +K +L+ LS Q
Sbjct: 121 KGCVT-KVKYQGACGACWAFSAVGALEAQLKLKTGKLVSLSAQ 162
>gi|145505469|ref|XP_001438701.1| hypothetical protein [Paramecium tetraurelia strain d4-2]
gi|124405873|emb|CAK71304.1| unnamed protein product [Paramecium tetraurelia]
Length = 320
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 73/149 (48%), Gaps = 14/149 (9%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMT 132
L+ +F ++ ++ +QY S SE+ R ++ NL I + G +G +F D+T
Sbjct: 26 LELAQKFTNYQAQFNKQY-SGSELLYRLQVYEANLADIKARNQRV-GRQIFGETQFTDLT 83
Query: 133 DSEFNHGLSSLDW--EQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
D EF +L E+ E KS FE + A I+++ +G V P V+DQ C
Sbjct: 84 DEEFAAIYLTLKVVPEEFETQKSQFENVT---------ATPIDWRSRGAVTP-VKDQQAC 133
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCWA S LE + I +L LS+Q
Sbjct: 134 GSCWAFSTTGVLEGWFQINTGKLPNLSEQ 162
>gi|195025208|ref|XP_001986022.1| GH20767 [Drosophila grimshawi]
gi|193902022|gb|EDW00889.1| GH20767 [Drosophila grimshawi]
Length = 329
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 83/147 (56%), Gaps = 9/147 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT-KHEQGTATY--GVNRFADMTD 133
N++ + ++++Y S+ E R IF ++ + ID + ++ G TY GVN+F DM
Sbjct: 25 NEWDIYKVNHDKRYGSEDEELLRKLIFYDHKRMIDTHNERYAAGKETYEMGVNQFTDMLS 84
Query: 134 SEFNHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
SEF L SL+ I ++ S + + + + + +SI+++DKG V V+DQ C S
Sbjct: 85 SEFESAMLGSLN---ITDIASDIDII-YEAPKNLEIPKSIDWRDKGAV-TGVKDQLKCAS 139
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA SAV LE +K N+L+ LS+Q
Sbjct: 140 CWAFSAVGALEGQQFLKTNKLVALSEQ 166
>gi|449464688|ref|XP_004150061.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
gi|449519862|ref|XP_004166953.1| PREDICTED: cysteine proteinase RD19a-like [Cucumis sativus]
Length = 377
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/169 (30%), Positives = 85/169 (50%), Gaps = 6/169 (3%)
Query: 51 LQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI 110
+ R + G E S D + L + F F +++ + Y S E + RF +F+ NLK
Sbjct: 33 IIRQVVDDGGVNEGSNGD-DLLLGADHHFSVFKQKFGKSYASKEEHDHRFRVFKANLKRA 91
Query: 111 DYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE 170
+ + +AT+GV +F+D+T SEF L ++ L + ++ GL
Sbjct: 92 QRHQALDP-SATHGVTQFSDLTPSEFRRSFLGLRSRRL-GLPADANKAPILPTD--GLPT 147
Query: 171 SINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+++DKG V +V++Q CGSCW+ SA LE A + +L+ LS+Q
Sbjct: 148 DFDWRDKGAV-SEVKNQGSCGSCWSFSATGALEGANFLATGKLVSLSEQ 195
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 65/145 (44%), Gaps = 8/145 (5%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGV-------NRFADMTDS 134
++ E+ R Y E RR +IFR N + ID + A V NRFAD+TD
Sbjct: 46 WMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATNRFADLTDE 105
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF + L N S A S++++ G V V+DQ CG CW
Sbjct: 106 EFRAARTGLRRPAAVAGAVGGGFRYENFSLQADAAGSMDWRAMGAV-TGVKDQGSCGCCW 164
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SAVA +E I+ L+ LS+Q
Sbjct: 165 AFSAVAAMEGLTKIRTGRLVSLSEQ 189
>gi|118379122|ref|XP_001022728.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89304495|gb|EAS02483.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 330
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/182 (30%), Positives = 86/182 (47%), Gaps = 13/182 (7%)
Query: 58 SYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHE 117
+Y S E DLE D QF+ +++++ + E R +F N+ I + K
Sbjct: 19 TYLSLEKPHHDLESIQDVKQQFEKYLQQFGIVIKNAEERIYRLKVFIQNVAEIVAHNKLS 78
Query: 118 QGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T T G+N+FA MTD EF +L+ + K T F S++ + ES++++ +
Sbjct: 79 NKTYTQGINQFAHMTDEEFAQTYLTLE----DREKETLNIQQFQSND---IPESVDWRTQ 131
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPP-----KTHGRFYKGGV 232
G V +V++Q CGSC+A SA LE +A + L S Q K H GG
Sbjct: 132 GAV-TEVKNQGGCGSCYAFSAAGALEGLHAQQTGNLTSFSSQQIIDCSWKYHNHGCHGGF 190
Query: 233 MN 234
M+
Sbjct: 191 MD 192
>gi|113931178|ref|NP_001039033.1| cathepsin W [Xenopus (Silurana) tropicalis]
gi|89269052|emb|CAJ83515.1| cathepsin W [Xenopus (Silurana) tropicalis]
Length = 303
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 69/136 (50%), Gaps = 11/136 (8%)
Query: 85 EYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN-HGLSSL 143
+Y R Y + E + R IF NLK + E GTA YGV +F+D+TD EF+ + L +
Sbjct: 3 QYNRSYKTREEFKYRLRIFSENLKEASRLQREELGTAQYGVTKFSDLTDEEFSIYHLPTN 62
Query: 144 DWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLE 203
LK + E F +S + V+ K ++Q C SCWA +AVA +E
Sbjct: 63 ILPTPPILKQSEEVLPFPTSCDW---------RTQNVISKAKNQRTCHSCWAFAAVANIE 113
Query: 204 SAYAIKHNELIELSKQ 219
+ +AI + I LS+Q
Sbjct: 114 AQWAI-LGQTISLSEQ 128
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 72.0 bits (175), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/190 (28%), Positives = 98/190 (51%), Gaps = 23/190 (12%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDS 134
+ +++ + R Y++ E ERR+ +FR+NL+ ID + + G ++ G+NRFAD+T+
Sbjct: 43 MYAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTND 102
Query: 135 EFNHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
E+ L + Q E ++++++ L ES++++ KG V +V+DQ GSC
Sbjct: 103 EYRATYLGARTRPQRERKLGA----RYHAADNEDLPESVDWRAKGAV-AEVKDQGSYGSC 157
Query: 194 WAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNH 249
WA S +A +E I +LI LS+Q ++ + GG+M+ Y+
Sbjct: 158 WAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMD----------YAFEF 207
Query: 250 AVLNVGYDNE 259
+ N G D E
Sbjct: 208 IINNGGIDTE 217
>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
gi|255639509|gb|ACU20049.1| unknown [Glycine max]
Length = 366
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 77/151 (50%), Gaps = 13/151 (8%)
Query: 72 FLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADM 131
L+ + F F ++ + Y + E + RF IF+NNL + K + +A +GV RF+D+
Sbjct: 44 LLNAEHHFSAFKTKFGKTYATQEEHDHRFRIFKNNLLRAKSHQKLDP-SAVHGVTRFSDL 102
Query: 132 TDSEFNH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQH 188
T +EF GL L L S + +N L ++++ G V V++Q
Sbjct: 103 TPAEFRRQFLGLKPL------RLPSDAQKAPILPTND--LPTDFDWREHGAVT-GVKNQG 153
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCW+ SAV LE A+ + EL+ LS+Q
Sbjct: 154 SCGSCWSFSAVGALEGAHFLSTGELVSLSEQ 184
>gi|355681647|gb|AER96812.1| cathepsin F [Mustela putorius furo]
Length = 408
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 70/134 (52%), Gaps = 10/134 (7%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK+FV Y R Y+S E + R +F NN+ ++GTA YGV +F+D+T+ EF
Sbjct: 112 FKEFVTTYNRTYESKEETQWRMSVFSNNMMRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 171
Query: 139 G-LSSLDWE-QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
L+ L E + +N++ + S +++ KG V KV++Q +CGSCWA
Sbjct: 172 IYLNPLLREYRGKNMR-------LDKSTGDSAPSEWDWRRKGAVT-KVKNQGMCGSCWAF 223
Query: 197 SAVACLESAYAIKH 210
S +E + +K
Sbjct: 224 SVTGNVEGQWFLKQ 237
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 72/141 (51%), Gaps = 4/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+++V +Y + Y S E RRF++F++NL ID + E + G+N FAD+T EF
Sbjct: 86 FEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFADLTHDEFKA 145
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L ++ + Y + S++++ KG V +V++Q CGSCWA S
Sbjct: 146 TYLGLLPKRTSGGRF---RYGGVGDGGDEVPASVDWRKKGAVT-EVKNQGQCGSCWAFST 201
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I L LS+Q
Sbjct: 202 VAAVEGINQIVTGNLTSLSEQ 222
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/189 (29%), Positives = 95/189 (50%), Gaps = 21/189 (11%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSE 135
++ ++ ++ + Y++ E E+RF IF++NL+ ID +H TY G+NRFAD+T+ E
Sbjct: 48 MYEAWLVKHGKAYNALGEKEKRFGIFKDNLRFID---EHNSQNLTYRLGLNRFADLTNEE 104
Query: 136 FNHG-LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+ L K + ++ F + L + I+++ +G V+ V+DQ CGSCW
Sbjct: 105 YRSMYLGVKPGATRVTRKVSRKSDRFAARVGDALPDFIDWRKEGAVV-GVKDQGSCGSCW 163
Query: 195 AHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHA 250
A S +A +E I +LI LS+Q ++ GG+M+ Y+
Sbjct: 164 AFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMD----------YAFEFI 213
Query: 251 VLNVGYDNE 259
+ N G D+E
Sbjct: 214 INNGGIDSE 222
>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
Length = 401
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 77/149 (51%), Gaps = 5/149 (3%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFAD 130
L+ F +++R + + Y D + R F+I++ N + I ++ K +++ +N+F D
Sbjct: 89 LEEQRAFTEWMRTHRKSYHHDHFLPR-FEIWKTNNRWITHWNKKHANASSFTVAINQFGD 147
Query: 131 MTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
+T EFN + L E +N+ G+ ES +++ KG V+ +V+DQ +C
Sbjct: 148 LTSDEFNRLYNGLHVFSAPKASEKVER-PRQWANTAGIPESGDWRQKG-VVSRVKDQGMC 205
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCWA S E AI + L+ LS+Q
Sbjct: 206 GSCWAFSTTGSTEGINAITTSRLVPLSEQ 234
>gi|294939744|ref|XP_002782557.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239894295|gb|EER14352.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 221
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/162 (33%), Positives = 84/162 (51%), Gaps = 17/162 (10%)
Query: 65 STFDLEEFLDHGN---QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA 121
S L + L+ G F F ++ + Y+S E +R IFR +L I+ + +
Sbjct: 11 SILPLVKCLEEGTVELAFMGFQHKFGKNYESKEEEVKRNAIFRAHLHYIEQ-VNAKNLSY 69
Query: 122 TYGVNRFADMTDSEF---NHGLSS-LDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
GVN AD+T EF G SS + ++ +NL F S+++ L S++++ K
Sbjct: 70 KLGVNEHADLTHEEFAALKLGTSSKMSMKRDDNL--------FVSADTTQLLTSVDWRSK 121
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G VL ++DQ CGSCWA SA LE+ YAI +L+ LS+Q
Sbjct: 122 G-VLTPIKDQGPCGSCWAFSATGALEAQYAIATGKLLSLSEQ 162
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/141 (31%), Positives = 74/141 (52%), Gaps = 6/141 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
++ ++ +Y + Y+S E E R +IF+ NL+ ID + + T G+N+FAD+TD E+
Sbjct: 42 YESWLVKYGKSYNSLGEREMRIEIFKENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRS 101
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+LKS L + ++++ G V+ V++Q LC SCWA +
Sbjct: 102 TYLGFK----SSLKSKVSNRYMPQVGEV-LPDYVDWRTTGAVV-DVKNQGLCSSCWAFAT 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
+A +ES I +LI LS+Q
Sbjct: 156 IATVESINQIITGDLISLSEQ 176
>gi|414887428|tpg|DAA63442.1| TPA: hypothetical protein ZEAMMB73_713985 [Zea mays]
Length = 313
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 65/260 (25%), Positives = 105/260 (40%), Gaps = 66/260 (25%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ D+E+ L ++F+ + Y R Y + +E RRF+++R N++ I+ + +
Sbjct: 22 GAASGGRVDVEDMLMM-DRFRAWQATYNRSYLTAAERLRRFEVYRQNMELIEATNRRAEL 80
Query: 120 TATYGVNRFADMTDSEF--NHGLSSL--DWEQIENLKSTFETYS------------FNSS 163
+ F D+T EF H +S+ E + T++ N +
Sbjct: 81 SYQLSETPFTDLTSEEFLATHTMSTRLHASEAARRHRELITTHAGPVSDGGRQWNRRNYT 140
Query: 164 NSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQP--- 220
+ ES++++ KG V V+DQ CG CW+ + VA +E + I+ +L+ LS+Q
Sbjct: 141 TDLDVPESVDWRTKGAV-TTVKDQGACGGCWSFATVAAIEGLHKIRTGQLVSLSEQEGKC 199
Query: 221 ---------PKTHGR-------------------------------FYKGGVMNLPHMLC 240
K GR YK GV + P C
Sbjct: 200 KLDKARNHVAKIRGRKLVDQNNEAALEVAVAQQPVAVGMNVHPIQQHYKSGVFHGP---C 256
Query: 241 SKGPYSLNHAVLNVGYDNES 260
P LNHAV VGY ES
Sbjct: 257 D--PEDLNHAVTMVGYGAES 274
>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 71/133 (53%), Gaps = 6/133 (4%)
Query: 91 DSDSEIERRFDIFRNNLKTIDYYTKH----EQGTATYGVNRFADMTDSEFNHGLSSLDWE 146
+S ++ ERRF F +NL+ +D + E+G +NRFAD+T+ EF +
Sbjct: 68 NSIADRERRFSAFWDNLRFVDAHNARAAAGEEGF-RLAMNRFADLTNDEFRAAYLGVKGA 126
Query: 147 QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAY 206
N + + L E++++++KG V P V++Q CGSCWA SAV+ +ES
Sbjct: 127 AERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAP-VKNQGQCGSCWAFSAVSTVESIN 185
Query: 207 AIKHNELIELSKQ 219
I E++ LS+Q
Sbjct: 186 QIVTGEMVTLSEQ 198
>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
Length = 473
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 71/133 (53%), Gaps = 6/133 (4%)
Query: 91 DSDSEIERRFDIFRNNLKTIDYYTKH----EQGTATYGVNRFADMTDSEFNHGLSSLDWE 146
+S ++ ERRF F +NL+ +D + E+G +NRFAD+T+ EF +
Sbjct: 68 NSIADRERRFSAFWDNLRFVDAHNARAAAGEEGF-RLAMNRFADLTNDEFRAAYLGVKGA 126
Query: 147 QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAY 206
N + + L E++++++KG V P V++Q CGSCWA SAV+ +ES
Sbjct: 127 AERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAP-VKNQGQCGSCWAFSAVSTVESIN 185
Query: 207 AIKHNELIELSKQ 219
I E++ LS+Q
Sbjct: 186 QIVTGEMVTLSEQ 198
>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 42/129 (32%), Positives = 72/129 (55%), Gaps = 9/129 (6%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKST 154
E ++RF++F+ N+ + + K ++ +N+FADMT+ EF H + +I++ ++
Sbjct: 53 EKDKRFNVFKANVHYVHNFNKKDK-PYKLKLNKFADMTNHEFRHHYAG---SKIKHHRTF 108
Query: 155 FETYSFNSSNSYG----LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKH 210
N + Y + +++++ KG V P V+DQ CGSCWA S V +E IK
Sbjct: 109 LGASRANGTFMYAHEDSVPPTVDWRKKGAVTP-VKDQGKCGSCWAFSTVVAVEGINQIKT 167
Query: 211 NELIELSKQ 219
NEL+ LS+Q
Sbjct: 168 NELVSLSEQ 176
>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
Length = 342
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 48/141 (34%), Positives = 73/141 (51%), Gaps = 4/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ + ++ + Y+S E RF+IF +NLK ID T + G+N FAD+T EF +
Sbjct: 49 FESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDE-TNKKVSNYWLGLNEFADLTHEEFKN 107
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
E E + E + + + L +S++++ KG V P V++Q CGSCWA S
Sbjct: 108 KFLGFKGELAERKDESIEQFRYR--DFVDLPKSVDWRKKGAVSP-VKNQGQCGSCWAFST 164
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I L LS+Q
Sbjct: 165 VAAVEGINQIVTGNLTVLSEQ 185
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 78/149 (52%), Gaps = 5/149 (3%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ ++ + Y+S E ERRF+IF+ L+ ID + + G+N+FAD+T+ EF
Sbjct: 37 MYESWLIKHGKSYNSLGERERRFEIFKETLRFIDEHNADTSRSYKVGLNQFADLTNEEFR 96
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ L + + N Y L + ++++ +G V+ +++Q CGSCWA S
Sbjct: 97 S--TYLGFTRGSNKTKVSNRYEPRVGQV--LPDYVDWRSEGAVV-DIKNQGQCGSCWAFS 151
Query: 198 AVACLESAYAIKHNELIELSKQPPKTHGR 226
A+A +E I LI LS+Q GR
Sbjct: 152 AIAAVEGINKIVTGNLISLSEQELVDCGR 180
>gi|48374352|gb|AAT09103.1| digestive cysteine proteinase [Bigelowiella natans]
Length = 360
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 40/138 (28%), Positives = 76/138 (55%), Gaps = 5/138 (3%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F+ + +E+ + Y+ + ++ F N + I ++E G+A YG RF+DM+ +F
Sbjct: 23 KFEAWKKEFGKSYEEAGKEDKARLNFVENERIIQGLNENELGSAVYGHTRFSDMSPEQFR 82
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
++ + E + ++ + N+ + +S +++D + P V+DQ CGSCWA S
Sbjct: 83 AMMTPFKYHTDEAENAAYD----QNKNAVKVTDSFDWRDFNALTP-VKDQGGCGSCWAFS 137
Query: 198 AVACLESAYAIKHNELIE 215
A LESA+ IKHN+ ++
Sbjct: 138 ATQALESAHYIKHNDTLD 155
>gi|442736236|gb|AGC65593.1| cathepsin [Achaea janata granulovirus]
Length = 338
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 80/159 (50%), Gaps = 11/159 (6%)
Query: 67 FDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVN 126
+DLE D F F+ +Y + Y S+ E +F++F+ NL T++ ++ AT+ +N
Sbjct: 25 YDLE---DAERLFDLFMIKYHKVYRSELERAAKFEVFKRNLATLNDKNDKDE-NATFDIN 80
Query: 127 RFADMTDSEFNHGLSSLDWEQIENL------KSTFETYSFNSSNSYGLAESINYKDKGKV 180
+ D + +E + N K T + L ES +++DK V
Sbjct: 81 AYTDRSRNELLRTQTGFQSNFARNASPFTQKKGMCITRVVAGTPPCLLPESFDWRDKNVV 140
Query: 181 LPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
P V+DQ CGSCWA +A+A ES YAIKH + ++ S+Q
Sbjct: 141 TP-VKDQLECGSCWAFTAIANFESQYAIKHGKHVDFSEQ 178
>gi|114658412|ref|XP_001153217.1| PREDICTED: pro-cathepsin H isoform 6 [Pan troglodytes]
gi|397478882|ref|XP_003810764.1| PREDICTED: pro-cathepsin H [Pan paniscus]
gi|12803323|gb|AAH02479.1| Cathepsin H [Homo sapiens]
gi|60655259|gb|AAX32193.1| cathepsin H [synthetic construct]
gi|123979560|gb|ABM81609.1| cathepsin H [synthetic construct]
gi|123994193|gb|ABM84698.1| cathepsin H [synthetic construct]
gi|189054474|dbj|BAG37247.1| unnamed protein product [Homo sapiens]
gi|410254318|gb|JAA15126.1| cathepsin H [Pan troglodytes]
gi|410294916|gb|JAA26058.1| cathepsin H [Pan troglodytes]
gi|410331109|gb|JAA34501.1| cathepsin H [Pan troglodytes]
Length = 335
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 81/162 (50%), Gaps = 17/162 (10%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ E S LE+F FK ++ ++ + Y ++ E R F +N + I+ H G
Sbjct: 21 GAAELSVNSLEKF-----HFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN---AHNNG 71
Query: 120 TATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T+ +N+F+DM+ +E H W + +N +T Y + Y S++++ K
Sbjct: 72 NHTFKMALNQFSDMSFAEIKH---KYLWSEPQNCSATKSNY-LRGTGPY--PPSVDWRKK 125
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 126 GNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQ 167
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 75/151 (49%), Gaps = 11/151 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F+ ++ + R Y E +RR +++R N++ ++ + G N+FAD+T+ EF
Sbjct: 32 RFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNGY-RLADNKFADLTNEEFR 90
Query: 138 HGLSSLDWEQI------ENLKSTFETYSFNSSNSYG---LAESINYKDKGKVLPKVQDQH 188
+ + ST G L +S+++++KG V P V+ Q
Sbjct: 91 AKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAP-VKSQG 149
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAVA +E IK+ +L+ LS+Q
Sbjct: 150 DCGSCWAFSAVAAIEGINQIKNGKLVSLSEQ 180
>gi|255586666|ref|XP_002533962.1| cysteine protease, putative [Ricinus communis]
gi|223526059|gb|EEF28418.1| cysteine protease, putative [Ricinus communis]
Length = 417
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 79/158 (50%), Gaps = 7/158 (4%)
Query: 68 DLEEFLDH---GNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI--DYYTKHEQGTA- 121
DL E L F+ + ++ + Y E E+R + FR NLK + K G+A
Sbjct: 35 DLHELLSEERVKELFQQWKEKHRKVYKHVEEAEKRLENFRRNLKYVVEKNQKKKNLGSAH 94
Query: 122 TYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVL 181
T G+N+FADM++ EF S + I+ + T + S S++++ KG V
Sbjct: 95 TVGLNKFADMSNVEFRQKYLSKVKKPIKKRNNNLMTSRQRNLQSCVAPSSLDWRKKGVVT 154
Query: 182 PKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
P V+DQ CGSCWA S+ +E AI +L+ LS+Q
Sbjct: 155 P-VKDQGDCGSCWAFSSTGAIEGINAIVTGDLVSLSEQ 191
>gi|115472081|ref|NP_001059639.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|27261016|dbj|BAC45132.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113611175|dbj|BAF21553.1| Os07g0480900 [Oryza sativa Japonica Group]
gi|215693312|dbj|BAG88694.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 376
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 72/144 (50%), Gaps = 4/144 (2%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
QF FVR + R+Y E RR +F NL + + TA +GV F+D+T EF
Sbjct: 47 QFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQALDP-TARHGVTPFSDLTREEFE 105
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNS--YGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
L+ L + ++++ + ++ GL S +++D+G V V+ Q CGSCWA
Sbjct: 106 ARLTGLAADVGDDVRRRPMPSAAPATEEEVSGLPASFDWRDRGAVT-DVKMQGACGSCWA 164
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +E A + L++LS+Q
Sbjct: 165 FSTTGAVEGANFLATGNLLDLSEQ 188
>gi|61372279|gb|AAX43816.1| cathepsin H [synthetic construct]
Length = 336
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 81/162 (50%), Gaps = 17/162 (10%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ E S LE+F FK ++ ++ + Y ++ E R F +N + I+ H G
Sbjct: 21 GAAELSVNSLEKF-----HFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN---AHNNG 71
Query: 120 TATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T+ +N+F+DM+ +E H W + +N +T Y + Y S++++ K
Sbjct: 72 NHTFKMALNQFSDMSFAEIKH---KYLWSEPQNCSATKSNY-LRGTGPY--PPSVDWRKK 125
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 126 GNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQ 167
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 76/139 (54%), Gaps = 3/139 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATYGVNRFADMTDSEFNHGL 140
++ ++ R Y D+E RR ++FR+N+ I+ Q N+FAD+T++EF
Sbjct: 8 WMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTNAEFRATR 67
Query: 141 SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVA 200
+ L ++ ++ + + ++ L S++++ KG V P V+DQ CG CWA SAVA
Sbjct: 68 TGLRPSSSRGNRAP-TSFRYANVSTGDLPASVDWRGKGAVNP-VKDQGDCGCCWAFSAVA 125
Query: 201 CLESAYAIKHNELIELSKQ 219
+E A + +L+ LS+Q
Sbjct: 126 AMEGAVKLATGKLVSLSEQ 144
>gi|328870281|gb|EGG18656.1| hypothetical protein DFA_04151 [Dictyostelium fasciculatum]
Length = 347
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 64/225 (28%), Positives = 106/225 (47%), Gaps = 31/225 (13%)
Query: 46 VHNLILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRN 105
+ LI+ + S S EE QF++F +Y + Y+S E ++ F+N
Sbjct: 2 IKKLIVAILLLVALASARTSNLSFEE-----TQFREFQLKYNKHYES-HEFAQKLATFKN 55
Query: 106 NLKTI---DYYTKHEQGTATYGVNRFADMTDSEF-NHGLSSLDWEQIENLKSTFETYSFN 161
+LK I + K + +GVN+FAD++ EF N+ L+ E ++ ETY+ +
Sbjct: 56 SLKRIQELNDMAKRAKVDTEFGVNKFADLSKEEFANYYLNKGGMESTDS-----ETYAPD 110
Query: 162 SSNS--YGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
S+ L S +++ +G V P V+DQ CGSCW+ S +E + + N+L LS+Q
Sbjct: 111 YSDKEISNLPTSFDWRTQGAVTP-VKDQGQCGSCWSFSTTGNVEGQWFLAGNDLTGLSEQ 169
Query: 220 ---PPKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNEST 261
T GG+M P + ++ V N G D E++
Sbjct: 170 NLVDCSTKNDGCNGGLM----------PLAYDYIVENNGIDTEAS 204
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/140 (30%), Positives = 71/140 (50%), Gaps = 4/140 (2%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+ ++ Y + Y E E+RF +F+ N+ I+ + G+N+FAD+T EF
Sbjct: 40 EQWMARYGKVYKDPEEKEKRFRVFKENVNYIEAFNNAANKPYKLGINQFADLTSEEF--- 96
Query: 140 LSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+ + S T +F N L +SI+++ KG V P +++Q CG CWA SA+
Sbjct: 97 IVPRNRFNGHTRSSNTRTTTFKYENVTVLPDSIDWRQKGAVTP-IKNQGSCGCCWAFSAI 155
Query: 200 ACLESAYAIKHNELIELSKQ 219
A E + I +L+ LS+Q
Sbjct: 156 AATEGIHKISTGKLVSLSEQ 175
>gi|121531602|gb|ABM55486.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 326
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 53/146 (36%), Positives = 77/146 (52%), Gaps = 9/146 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATY--GVNRFADMTD 133
+Q+ F + + + Y + E RF IF+ NL I ++ + ++G TY GV RFAD+T
Sbjct: 21 DQWIAFKQTHGKTYKNLLEERTRFGIFQRNLIKIKEHNARCDKGEETYLLGVTRFADLTH 80
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF L QI+N K + +SI++ +KG VL +V+ Q+ CGSC
Sbjct: 81 EEFKDILKG----QIKN-KPRLNATPTVFPEDLEVPDSIDWTEKGAVL-EVKGQNPCGSC 134
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SA LE AI +N I LS+Q
Sbjct: 135 WAFSATGALEGQNAILNNAKISLSEQ 160
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 69/144 (47%), Gaps = 3/144 (2%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++K ++ +Y R+Y D+E RF +F+ N + ID + G N+FAD+T EF
Sbjct: 58 RYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFA 117
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAES--INYKDKGKVLPKVQDQHLCGSCWA 195
+ L + F N L + ++++ +G V P V++Q CG CWA
Sbjct: 118 AMYTGLRKPAAVPSGAKQIPAGFKYQNFTRLDDDVQVDWRQQGAVTP-VKNQGQCGCCWA 176
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
SAV +E I L+ LS+Q
Sbjct: 177 FSAVGAMEGLIMITTGNLVSLSEQ 200
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 78/160 (48%), Gaps = 27/160 (16%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ +Y + Y+S E ERRF+IF+ L+ ID + + G+N+FAD+TD EF
Sbjct: 41 MYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEF- 99
Query: 138 HGLSSLDWEQIENLKSTFETYSFNS-----SNSYG------LAESINYKDKGKVLPKVQD 186
+ST+ ++ S SN Y L ++++ G V+ ++
Sbjct: 100 --------------RSTYLGFTSGSNKTKVSNRYEPRVGQVLPSYVDWRSAGAVV-DIKS 144
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGR 226
Q CG CWA SA+A +E I LI LS+Q GR
Sbjct: 145 QGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/141 (31%), Positives = 78/141 (55%), Gaps = 4/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ ++++ Y S E RF++F++NLK ID + E + G+N FAD+T EF
Sbjct: 150 FEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNR-EVTSYWLGLNEFADLTHEEFK- 207
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ L + + ++ + ++ L +S++++ KG V +V++Q CGSCWA S
Sbjct: 208 -ATYLGLAPPAPARESRGSFKYEDVSADDLPKSVDWRTKGAVT-EVKNQGQCGSCWAFST 265
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E AI L LS+Q
Sbjct: 266 VAAVEGINAIVTGNLTALSEQ 286
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 55/168 (32%), Positives = 89/168 (52%), Gaps = 12/168 (7%)
Query: 62 EEASTFDLEEFLDHG---NQFKDFVREYERQYDSDS----EIERRFDIFRNNLKTI-DYY 113
+EA + D L H + + +V+EY +++ D E R F++F+ NL I +
Sbjct: 7 KEALSADKSAALAHQKYLSAWSSWVKEYNKEHWVDPYSSPESTRAFEVFQKNLDMIMKHN 66
Query: 114 TKHEQGTATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAES 171
++ QG +Y G+N FA +T EF+ ++E K+ S S + S
Sbjct: 67 EEYNQGLQSYEMGLNGFAHLTFEEFSAQYLGYGGAEVEQPKTRRAGKHERKSRSE-IPAS 125
Query: 172 INYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+++++KG V +V++Q CGSCWA SAVA LE A+ + ELI LS+Q
Sbjct: 126 VDWREKGAV-AEVKNQGACGSCWAFSAVAALEGAHFLNSGELISLSEQ 172
>gi|345798093|ref|XP_536212.3| PREDICTED: pro-cathepsin H [Canis lupus familiaris]
Length = 350
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 75/143 (52%), Gaps = 12/143 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEF 136
FK + +++++Y S+ E +R F N + I+ H G T+ G+N+F+DM +E
Sbjct: 49 FKSWAVQHQKKYSSE-EYLQRLQTFVGNWRKIN---AHNAGNHTFKMGLNQFSDMNFAEI 104
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
H W + +N +T Y + Y ++++ KGK + V++Q CGSCW
Sbjct: 105 KH---KYLWSEPQNCSATKGNY-LRGTGPY--PPFVDWRKKGKFVSPVKNQGSCGSCWTF 158
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S LESA AIK +L+ L++Q
Sbjct: 159 STTGALESAIAIKSGKLLSLAEQ 181
>gi|29710|emb|CAA34734.1| unnamed protein product [Homo sapiens]
Length = 335
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 81/162 (50%), Gaps = 17/162 (10%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ E S LE+F FK ++ ++ + Y ++ E R F +N + I+ H G
Sbjct: 21 GAAELSVNSLEKF-----HFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN---AHNNG 71
Query: 120 TATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T+ +N+F+DM+ +E H W + +N +T Y + Y S++++ K
Sbjct: 72 NHTFKMALNQFSDMSFAEIKH---KYLWSEPQNCSATKSNY-LRGTGPY--PPSVDWRKK 125
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 126 GNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQ 167
>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 348
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 51/144 (35%), Positives = 77/144 (53%), Gaps = 10/144 (6%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ +++R Y++ E RF+IF++NL ID T + + G+N F D+T EF
Sbjct: 48 FESWMLKHDRVYNNIEEKIHRFEIFKDNLMYIDE-TNKKNNSYWLGLNEFVDLTHDEFKE 106
Query: 139 ---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
G D+ IE +S E + + Y ESI+++DKG V P + CGSCWA
Sbjct: 107 KYVGSIGEDFVTIE--QSNDEEFPYKHVVDY--PESIDWRDKGAVTP--VKPNPCGSCWA 160
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S VA +E I +LI LS+Q
Sbjct: 161 FSTVATVEGINKIVTGKLISLSEQ 184
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 78/160 (48%), Gaps = 27/160 (16%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ +Y + Y+S E ERRF+IF+ L+ ID + + G+N+FAD+TD EF
Sbjct: 41 MYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEF- 99
Query: 138 HGLSSLDWEQIENLKSTFETYSFNS-----SNSYG------LAESINYKDKGKVLPKVQD 186
+ST+ ++ S SN Y L ++++ G V+ ++
Sbjct: 100 --------------RSTYLGFTSGSNKTKVSNRYEPRVGQVLPSYVDWRSAGAVV-DIKS 144
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGR 226
Q CG CWA SA+A +E I LI LS+Q GR
Sbjct: 145 QGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184
>gi|156354979|ref|XP_001623456.1| predicted protein [Nematostella vectensis]
gi|156210156|gb|EDO31356.1| predicted protein [Nematostella vectensis]
Length = 310
Score = 71.6 bits (174), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 80/145 (55%), Gaps = 10/145 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGV--NRFADMTDS 134
+ F +F +++++ Y+ DSE RR IFR+N++ I + + Y + N FAD+TD
Sbjct: 21 DDFDEFRQQHDKVYEDDSEHRRRKHIFRHNVRYIRSINRR---SLPYKLEPNHFADLTDD 77
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF +LD E + + + + S + + + +++++ G V P + Q CGSCW
Sbjct: 78 EFKSYKGALDDESNDRMHAKIK----RSKRMFEVPDQLDWRNYGAVNP-AKGQGTCGSCW 132
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A + +E+A+ I+ EL+ L++Q
Sbjct: 133 AFATAGAVEAAHFIQKGELLNLAEQ 157
>gi|37651368|ref|NP_932731.1| cathepsin [Choristoneura fumiferana DEF MNPV]
gi|82024252|sp|Q6VTL7.1|CATV_NPVCD RecName: Full=Viral cathepsin; Short=V-cath; AltName: Full=Cysteine
proteinase; Short=CP; Flags: Precursor
gi|37499277|gb|AAQ91676.1| cathepsin [Choristoneura fumiferana DEF MNPV]
Length = 324
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 82/149 (55%), Gaps = 5/149 (3%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFAD 130
+ L + F+DF+ + + Y S SE RF IF++NL+ I ++ +A Y +N+F+D
Sbjct: 20 DLLKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDT-SAQYEINKFSD 78
Query: 131 MTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
++ E + L ++N ++ E N G E +++ KV V++Q C
Sbjct: 79 LSKDETISKYTGLSLP-LQN-QNFCEVVVLNRPPDKGPLE-FDWRRLNKV-TSVKNQGTC 134
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
G+CWA + + LES +AIKH++LI LS+Q
Sbjct: 135 GACWAFATLGSLESQFAIKHDQLINLSEQ 163
>gi|355566270|gb|EHH22649.1| Cathepsin F [Macaca mulatta]
Length = 484
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 65/131 (49%), Gaps = 12/131 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK+FV Y R Y+S E R +F NN+ ++GTA YGV +F+D+T+ EF
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYG--LAESINYKDKGKVLPKVQDQHLCGSCWA 195
L+ L E+ N + S G +++ KG V KV+DQ +CGSCWA
Sbjct: 247 IYLNPLLREEPGN--------KMKQAKSVGDLAPPEWDWRSKGAVT-KVKDQGMCGSCWA 297
Query: 196 HSAVACLESAY 206
S +E +
Sbjct: 298 FSVTGNVEGQW 308
>gi|444519959|gb|ELV12909.1| Cathepsin L1 [Tupaia chinensis]
Length = 333
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/145 (33%), Positives = 76/145 (52%), Gaps = 11/145 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT-KHEQG--TATYGVNRFADMTDS 134
Q+ + E+ + Y + E RR ++ NLK I+ + ++ QG T T G+N F DMT+
Sbjct: 28 QWNQWTAEHGKVYSTGEESLRR-AVWEKNLKMIEQHNLEYSQGKHTFTMGMNAFGDMTNE 86
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
+F ++ Q N F+ + ES+++++KG V P V++QH CGSCW
Sbjct: 87 DFRQMMTGFQ-NQKYNKGEVFQ-----PPQPLEVPESVDWREKGYVTP-VKNQHRCGSCW 139
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA LE K +L+ LS+Q
Sbjct: 140 AFSATGALEGQMFRKTGKLVSLSEQ 164
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/155 (30%), Positives = 75/155 (48%), Gaps = 17/155 (10%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ +Y + Y+S E ERRF+IF+ L+ ID + + G+N+FAD+TD EF
Sbjct: 41 MYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFR 100
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYG------LAESINYKDKGKVLPKVQDQHLCG 191
L+ T + SN Y L ++++ G V+ ++ Q CG
Sbjct: 101 STY----------LRFTSGSNKTKVSNRYEPRVGQVLPSYVDWRSAGAVV-DIKSQGECG 149
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQPPKTHGR 226
CWA SA+A +E I LI LS+Q GR
Sbjct: 150 GCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 78/160 (48%), Gaps = 27/160 (16%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ +Y + Y+S E ERRF+IF+ L+ ID + + G+N+FAD+TD EF
Sbjct: 41 MYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEF- 99
Query: 138 HGLSSLDWEQIENLKSTFETYSFNS-----SNSYG------LAESINYKDKGKVLPKVQD 186
+ST+ ++ S SN Y L ++++ G V+ ++
Sbjct: 100 --------------RSTYLGFTSGSNKTKVSNRYEPRFGQVLPSYVDWRSAGAVV-DIKS 144
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGR 226
Q CG CWA SA+A +E I LI LS+Q GR
Sbjct: 145 QGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184
>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 44/133 (33%), Positives = 71/133 (53%), Gaps = 6/133 (4%)
Query: 91 DSDSEIERRFDIFRNNLKTIDYYTKH----EQGTATYGVNRFADMTDSEFNHGLSSLDWE 146
+S ++ ERRF F +NL+ +D + E+G +NRFAD+T+ EF +
Sbjct: 68 NSIADRERRFSAFWDNLRFVDAHNARAAAGEEGF-RLAMNRFADLTNDEFRAAYLGVKGA 126
Query: 147 QIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAY 206
N + + L E++++++KG V P V++Q CGSCWA SAV+ +ES
Sbjct: 127 AERNRAGRVVGDRYRHDGAEELPEAVDWREKGAVAP-VKNQGQCGSCWAFSAVSTVESIN 185
Query: 207 AIKHNELIELSKQ 219
I E++ LS+Q
Sbjct: 186 QIVTGEMVTLSEQ 198
>gi|13905172|gb|AAH06878.1| Cathepsin H [Mus musculus]
Length = 333
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 73/141 (51%), Gaps = 8/141 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK +++++++ Y S E R +F NN + I + + T +N+F+DM+ +E H
Sbjct: 33 FKSWMKQHQKTYSS-VEYNHRLQMFANNWRKIQAHNQRNH-TFKMALNQFSDMSFAEIKH 90
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
W + +N +T Y + Y S++++ KG V+ V +Q CGSCW S
Sbjct: 91 ---KFLWSEPQNCSATKSNY-LRGTGPY--PSSMDWRKKGNVVSPVINQGACGSCWTFST 144
Query: 199 VACLESAYAIKHNELIELSKQ 219
LESA AI +++ L++Q
Sbjct: 145 TGALESAVAIASGKMLSLAEQ 165
>gi|294874400|ref|XP_002766937.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239868312|gb|EEQ99654.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 347
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 80/152 (52%), Gaps = 13/152 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-N 137
F DF ++ ++Y+S E +R IF+ NL I+ + + T GVN +AD+T EF
Sbjct: 28 FTDFQHKFGKKYESKEEEMKRNAIFQANLHHIEQ-VNAQNLSYTLGVNEYADLTHEEFVA 86
Query: 138 HGLSSLDWEQIENLKSTFETYS----------FNSSNSYGLAESINYKDKGKVLPKVQDQ 187
+ L + ++K E + F S+++ L S++++ KG VL +++Q
Sbjct: 87 QKVGILKMDARRDVKFDVEGRTSCISHARLSLFVSADTTSLPTSVDWRSKG-VLTPIKNQ 145
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S+ LES YAI+ +L S+Q
Sbjct: 146 GACGSCWAFSSTGTLESKYAIETGQLRSFSEQ 177
>gi|291232495|ref|XP_002736191.1| PREDICTED: cysteine protease and A protease inhibitor,
putative-like [Saccoglossus kowalevskii]
Length = 367
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/154 (29%), Positives = 80/154 (51%), Gaps = 8/154 (5%)
Query: 73 LDHGNQFKDFVREYERQYDS-DSEIERRFDIFRNNLKTIDY---YTKHEQGTATYGVNRF 128
+D QFK+F+ ++ + Y + +E E RF +F+ +L I ++ TA YG+ +F
Sbjct: 37 IDEDVQFKEFILKHRKPYIAGTTEYEHRFRVFQQSLHRIRKRISLSRQLNDTAVYGITQF 96
Query: 129 ADMTDSEFNHGLSSLDWEQIENLKSTFETY--SFNSSN-SYGLAESINYKDKGKVLPKVQ 185
+D+T EF +L + + + + +FNSSN + + + +DK V V+
Sbjct: 97 SDLTPDEFQQMYLTLRPSKSSQIPVSLVQFPSAFNSSNVPPDMPKKYDLRDKSAV-SAVK 155
Query: 186 DQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
DQ CG CW+ S V +E+ + + ++ ELS Q
Sbjct: 156 DQGSCGGCWSFSTVQGMETKWVLNGGKMTELSVQ 189
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/130 (36%), Positives = 70/130 (53%), Gaps = 9/130 (6%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKST 154
E RRF++F+ N+K I + + + +N+F DMT+ EF S +I++ +S
Sbjct: 55 EKNRRFNVFKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFR---SKYAGSKIQHHRSQ 111
Query: 155 F----ETYSFNSSNSYGL-AESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIK 209
T SF N L A SI+++ KG V V+DQ CGSCWA S +A +E IK
Sbjct: 112 RGIQKNTGSFMYENVGSLPAASIDWRAKGAVT-GVKDQGQCGSCWAFSTIASVEGINQIK 170
Query: 210 HNELIELSKQ 219
EL+ LS+Q
Sbjct: 171 TGELVSLSEQ 180
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 78/160 (48%), Gaps = 27/160 (16%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ +Y + Y+S E ERRF+IF+ L+ ID + + G+N+FAD+TD EF
Sbjct: 41 MYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEF- 99
Query: 138 HGLSSLDWEQIENLKSTFETYSFNS-----SNSYG------LAESINYKDKGKVLPKVQD 186
+ST+ ++ S SN Y L ++++ G V+ ++
Sbjct: 100 --------------RSTYLGFTSGSNKTKVSNRYEPRVGQVLPSYVDWRSAGAVV-DIKS 144
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGR 226
Q CG CWA SA+A +E I LI LS+Q GR
Sbjct: 145 QGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184
>gi|21593213|gb|AAM65162.1| cysteine proteinase RD19A [Arabidopsis thaliana]
Length = 368
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 79/153 (51%), Gaps = 14/153 (9%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFAD 130
+ L + F F R++ + Y S+ E + RF +F+ NL+ + K + +AT+GV +F+D
Sbjct: 43 QVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDP-SATHGVTQFSD 101
Query: 131 MTDSEFNH---GL-SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
+T SEF G+ S + N T + L E +++D G V P V++
Sbjct: 102 LTRSEFRKKHLGVRSGFKLPKDANKAPILPTEN--------LPEDFDWRDHGAVTP-VKN 152
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCW+ SA LE A + +L+ LS+Q
Sbjct: 153 QGSCGSCWSFSATGALEGANFLATGKLVSLSEQ 185
>gi|440804656|gb|ELR25533.1| cysteine proteinase precursor, putative [Acanthamoeba castellanii
str. Neff]
Length = 330
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 52/159 (32%), Positives = 75/159 (47%), Gaps = 15/159 (9%)
Query: 61 SEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT 120
+ EA T E+ QF+ F +Y + Y S+ E R IFR+NL ID G
Sbjct: 20 AAEAGTMTAEQ------QFRQFAAQYGKSYASE-EFGERLRIFRDNLDRIDALNSANTG- 71
Query: 121 ATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKV 180
A YGVN+FAD+T EF + L + K T + + L +++DKG V
Sbjct: 72 ARYGVNKFADLTPKEFKA--TYLKGARSAGQKKAAATAKLDMTGP--LPSQFDWRDKGAV 127
Query: 181 LPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
P +DQ CG WA S +ES + + +L+ L+ Q
Sbjct: 128 TP-TKDQGQCG--WAFSVTEAIESQWFLSGRKLVSLAPQ 163
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 52/187 (27%), Positives = 96/187 (51%), Gaps = 24/187 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEF 136
++++V ++ + +S +E +RRF+IF++NL+ ID +H +Y G+ +FAD+T+ E+
Sbjct: 42 YEEWVVKHGKAQNSLTEKDRRFEIFKDNLRFID---EHNGKNLSYRLGLTKFADLTNDEY 98
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
S+ K+T + + + + ES++++ +G V +V+DQ CGSCWA
Sbjct: 99 R----SMYLGSRLKRKATKTSLRYEARVGDAIPESVDWRKEGAV-AEVKDQGSCGSCWAF 153
Query: 197 SAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVL 252
S + +E I +LI LS+Q ++ GG+M+ Y+ +
Sbjct: 154 STIGAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMD----------YAFEFIIK 203
Query: 253 NVGYDNE 259
N G D E
Sbjct: 204 NGGIDTE 210
>gi|291385469|ref|XP_002709277.1| PREDICTED: cathepsin F [Oryctolagus cuniculus]
Length = 460
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 67/137 (48%), Gaps = 16/137 (11%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
FK FVR Y R Y+S E + R +F +N+ ++GTA YG+ +F+D+T+ EF
Sbjct: 163 FKKFVRTYNRTYESKEEAQWRLSVFASNMVRAQKIQSLDRGTAQYGITKFSDLTEEEFRT 222
Query: 137 ---NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
N L S ++++ K + +++ KG V V+DQ +CGSC
Sbjct: 223 IYLNPLLRSEPGKKMQLAKPVEDPAP----------PQWDWRSKGAVT-NVKDQGMCGSC 271
Query: 194 WAHSAVACLESAYAIKH 210
WA S +E + +K
Sbjct: 272 WAFSVTGNVEGQWFLKR 288
>gi|225444726|ref|XP_002278624.1| PREDICTED: thiol protease aleurain-like isoform 1 [Vitis vinifera]
gi|147826441|emb|CAN62278.1| hypothetical protein VITISV_031382 [Vitis vinifera]
gi|297738562|emb|CBI27807.3| unnamed protein product [Vitis vinifera]
Length = 362
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 54/158 (34%), Positives = 77/158 (48%), Gaps = 10/158 (6%)
Query: 63 EASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTAT 122
E+S L H + F F Y + Y + EI+ RF+IF NLK I T + T
Sbjct: 47 ESSVLRLIGDTRHAHSFASFAHRYGKSYKTVDEIKLRFEIFSENLKLIRS-TNRKGLPYT 105
Query: 123 YGVNRFADMTDSEFN-HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVL 181
VN+FAD T EF H L + +N +T + + L E+ ++++ G V
Sbjct: 106 LAVNQFADWTWEEFRRHRLGA-----AQNCSATLK--GNHKLTDVILPETKDWREDGIVS 158
Query: 182 PKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
P ++DQ CGSCW S LE+AYA + I LS+Q
Sbjct: 159 P-IKDQGHCGSCWTFSTTGALEAAYAQAFGKGISLSEQ 195
>gi|56758090|gb|AAW27185.1| SJCHGC06231 protein [Schistosoma japonicum]
Length = 372
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 89/176 (50%), Gaps = 13/176 (7%)
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNN-LKTIDYYTKHEQGTATY--G 124
+LE + G +K F ++R Y + E +RF IF N +K +++ +++G ATY G
Sbjct: 51 NLELLSNIGAAWKFFKINFKRAYGNVMEETKRFLIFGTNFIKMMEHNRAYQEGKATYKMG 110
Query: 125 VNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKV 184
VN F D T+ E +I K + +F SS L + ++++ G V P V
Sbjct: 111 VNNFTDKTEYELRKLRGYRSACRIAKPKGS----TFISSEHAKLPDRVDWRRNGAVTP-V 165
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFY-----KGGVMNL 235
++Q CGSCWA S+ +E + K N L+ LS+Q + Y +GG+M+L
Sbjct: 166 KNQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDL 221
>gi|440907378|gb|ELR57532.1| Cathepsin W [Bos grunniens mutus]
Length = 382
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 68/141 (48%), Gaps = 5/141 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ F +Y R Y + +E RR DIF NL + + GTA +GV +F+D+T+ EF
Sbjct: 42 FRLFQMQYNRSYPNPAEYARRLDIFAQNLAKAQRLQEEDLGTAEFGVTQFSDLTEEEFVQ 101
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
S + + + + S + +++ G + V+DQ C CWA +A
Sbjct: 102 LYGSQVAGEALGVSRKVGSEEWGESEP----RTCDWRKVGPI-SLVRDQRNCNCCWAMAA 156
Query: 199 VACLESAYAIKHNELIELSKQ 219
+E+ +AIK +E+S Q
Sbjct: 157 AGNIEALWAIKFRHFVEVSVQ 177
>gi|308447426|ref|XP_003087427.1| hypothetical protein CRE_22755 [Caenorhabditis remanei]
gi|308256596|gb|EFP00549.1| hypothetical protein CRE_22755 [Caenorhabditis remanei]
Length = 324
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 72/136 (52%), Gaps = 6/136 (4%)
Query: 75 HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDS 134
+ N F+DF+ +Y R+Y ++ E+ +RF IF N+ ++ Y K + G TY +N F+D++D
Sbjct: 27 YTNAFQDFLVKYLREYKTEDELVKRFTIFSRNMDLVETYNKEDLGKVTYELNDFSDLSDK 86
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKD-KGKV-LPKVQDQHLCGS 192
E+ L S S + ES+++++ KG + ++ Q CGS
Sbjct: 87 EWKTFLMS----PKPKSPSKSAAKPSPPKEKRVIPESVDWRNVKGNNHVTGIKYQGPCGS 142
Query: 193 CWAHSAVACLESAYAI 208
CWA + A +ESA +I
Sbjct: 143 CWAFATAAAIESAVSI 158
>gi|2511691|emb|CAB17075.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 365
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 77/150 (51%), Gaps = 7/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+ L+ + F F ++ + Y + E + RF +F++N++ + + + +A +GV +F+
Sbjct: 42 DHLLNAEHHFSTFKSKFGKTYATKEEHDHRFGVFKSNMRRARLHAQLDP-SAVHGVTKFS 100
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T +EF+ L ++ N+ L + +++DKG V V+DQ
Sbjct: 101 DLTPAEFHRKFLGLKPLRLPAHAQKAPILPTNN-----LPKDFDWRDKGAVT-NVKDQGS 154
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCW+ S LE A+ + EL+ LS+Q
Sbjct: 155 CGSCWSFSTTGALEGAHFLATGELVSLSEQ 184
>gi|113120267|gb|ABI30273.1| VXH-B, partial [Vasconcellea x heilbornii]
Length = 266
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 57/180 (31%), Positives = 84/180 (46%), Gaps = 16/180 (8%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
N F ++ EY + Y E +F+IF++NLK ID T + T G+ F D+T+ EF
Sbjct: 46 NLFDSWMVEYGKVYKDIDEKIYKFEIFKDNLKYIDE-TNKKNNTYWLGLTSFTDLTNDEF 104
Query: 137 NH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
G S W E +S E + ++ + + SI+++ KG V P V+ Q CGSC
Sbjct: 105 KEKYVGSISESWSTTE--ESNDEGFIYD--DVVNIPASIDWRQKGAVTP-VRHQGSCGSC 159
Query: 194 WAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKGPYSLNHAVLN 253
W S+VA +E I L+ LS+Q R G P PY+L + N
Sbjct: 160 WTFSSVAAVEGINKIVTGRLVSLSEQELLDCERRSYGCRGGFP-------PYALQYVAQN 212
>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
Length = 365
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 41/150 (27%), Positives = 77/150 (51%), Gaps = 7/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+ L+ + F F ++ + Y + E + RF +F++N++ + + + +A +GV +F+
Sbjct: 42 DHLLNAEHHFSTFKAKFGKTYATKEEHDHRFGVFKSNMRRARLHAQLDP-SAVHGVTKFS 100
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T +EF+ L ++ N+ L + +++DKG V V+DQ
Sbjct: 101 DLTPAEFHRKFLGLKPLRLPAHAQKAPILPTNN-----LPKDFDWRDKGAVT-NVKDQGS 154
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCW+ S LE A+ + EL+ LS+Q
Sbjct: 155 CGSCWSFSTTGALEGAHFLATGELVSLSEQ 184
>gi|18420375|ref|NP_568052.1| cysteine proteinase RD19a [Arabidopsis thaliana]
gi|1172872|sp|P43296.1|RD19A_ARATH RecName: Full=Cysteine proteinase RD19a; Short=RD19; Flags:
Precursor
gi|435618|dbj|BAA02373.1| thiol protease [Arabidopsis thaliana]
gi|4539328|emb|CAB38829.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|7270892|emb|CAB80572.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|19310552|gb|AAL85009.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|22136868|gb|AAM91778.1| putative cysteine proteinase RD19A [Arabidopsis thaliana]
gi|110740898|dbj|BAE98545.1| drought-inducible cysteine proteinase RD19A precursor [Arabidopsis
thaliana]
gi|332661616|gb|AEE87016.1| cysteine proteinase RD19a [Arabidopsis thaliana]
Length = 368
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 79/153 (51%), Gaps = 14/153 (9%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFAD 130
+ L + F F R++ + Y S+ E + RF +F+ NL+ + K + +AT+GV +F+D
Sbjct: 43 QVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDP-SATHGVTQFSD 101
Query: 131 MTDSEFNH---GL-SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
+T SEF G+ S + N T + L E +++D G V P V++
Sbjct: 102 LTRSEFRKKHLGVRSGFKLPKDANKAPILPTEN--------LPEDFDWRDHGAVTP-VKN 152
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCW+ SA LE A + +L+ LS+Q
Sbjct: 153 QGSCGSCWSFSATGALEGANFLATGKLVSLSEQ 185
>gi|359484377|ref|XP_003633102.1| PREDICTED: thiol protease aleurain-like isoform 2 [Vitis vinifera]
Length = 318
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 80/168 (47%), Gaps = 10/168 (5%)
Query: 53 RSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDY 112
R +S E+S L H + F F Y + Y + EI+ RF+IF NLK I
Sbjct: 37 RLVSDSIRDLESSVLRLIGDTRHAHSFASFAHRYGKSYKTVDEIKLRFEIFSENLKLIRS 96
Query: 113 YTKHEQGTATYGVNRFADMTDSEFN-HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAES 171
T + T VN+FAD T EF H L + +N +T + + L E+
Sbjct: 97 -TNRKGLPYTLAVNQFADWTWEEFRRHRLGA-----AQNCSATLK--GNHKLTDVILPET 148
Query: 172 INYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++++ G V P ++DQ CGSCW S LE+AYA + I LS+Q
Sbjct: 149 KDWREDGIVSP-IKDQGHCGSCWTFSTTGALEAAYAQAFGKGISLSEQ 195
>gi|2351557|gb|AAB68595.1| cathepsin [Choristoneura fumiferana MNPV]
Length = 324
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 48/149 (32%), Positives = 82/149 (55%), Gaps = 5/149 (3%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFAD 130
+ L + F+DF+ + + Y S SE RF IF++NL+ I ++ +A Y +N+F+D
Sbjct: 20 DLLKAPSYFEDFLHNFNKNYSSKSEKLHRFKIFQHNLEEIINKNLNDT-SAQYEINKFSD 78
Query: 131 MTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
++ E + L ++N ++ E N G E +++ KV V++Q C
Sbjct: 79 LSKDETISKYTGLSLP-LQN-QNFCEVVVLNRPPDKGPLE-FDWRRLNKV-TSVKNQGTC 134
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
G+CWA + + LES +AIKH++LI LS+Q
Sbjct: 135 GACWAFATLGSLESQFAIKHDQLINLSEQ 163
>gi|301104775|ref|XP_002901472.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|66270081|gb|AAY43370.1| cathepsin-like cysteine protease [Phytophthora infestans]
gi|262100947|gb|EEY58999.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 376
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 55/177 (31%), Positives = 91/177 (51%), Gaps = 22/177 (12%)
Query: 51 LQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSE----IERRFDIFRNN 106
L P+S + E T++ F D+ +YE+ Y +D+ ++ RF F N
Sbjct: 21 LTTDLPSSLTASEQKTWE---------AFVDYALDYEKSYRNDANDHDVVQLRFRSFATN 71
Query: 107 LKTIDYYTK-HEQG--TATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSF-NS 162
L+ I + + +E+G + T G+N AD+ D+E+ LS + + KS+ + +F
Sbjct: 72 LERIQTHNEAYERGEHSFTLGLNDLADLADAEYKQLLSY----RTRDSKSSSASETFVKP 127
Query: 163 SNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
N L + ++++ V P V++Q CGSCWA SAVA +E AYA+ L LS+Q
Sbjct: 128 ENVEDLPATWDWREHSTVTP-VKNQGQCGSCWAFSAVAAMECAYALSTGTLESLSEQ 183
>gi|426248750|ref|XP_004018122.1| PREDICTED: pro-cathepsin H [Ovis aries]
Length = 355
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 83/160 (51%), Gaps = 13/160 (8%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ E + LE+F F+ ++ +++++Y S+ E R +F +NL+ I+ +
Sbjct: 41 GAAELAVNSLEKF-----HFQSWMVQHQKKYSSE-EYHHRLQVFASNLREINAHNARNH- 93
Query: 120 TATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGK 179
T G+N+F+DM+ +E W + +N +T Y + Y S+++++KG
Sbjct: 94 TFKMGLNQFSDMSFAELKR---KYLWSEPQNCSATKSNY-LRGTGPY--PPSMDWREKGN 147
Query: 180 VLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ V++Q CGSCW S LESA AI +L L++Q
Sbjct: 148 FVTPVKNQGSCGSCWTFSTTGALESAVAIATGKLPFLAEQ 187
>gi|394331805|gb|AFN27125.1| cysteine protease [Leishmania major]
Length = 348
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 80/141 (56%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y+R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V++Q CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKACADLSAVPDAVDWREKGAVTP-VKNQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ ++L+ LS+Q
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQ 176
>gi|410045434|ref|XP_003313198.2| PREDICTED: LOW QUALITY PROTEIN: cathepsin F [Pan troglodytes]
Length = 548
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 64/133 (48%), Gaps = 16/133 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
FK+FV Y R Y+S E R +F NN+ ++GTA YGV +F+D+T+ EF
Sbjct: 251 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 310
Query: 137 ---NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
N L +++ KS + +++ KG V KV+DQ +CGSC
Sbjct: 311 IYLNPLLRKEPGNKMKQAKSVGDLAP----------PEWDWRSKGAVT-KVKDQGMCGSC 359
Query: 194 WAHSAVACLESAY 206
WA S +E +
Sbjct: 360 WAFSVTGNVEGQW 372
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 78/160 (48%), Gaps = 27/160 (16%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++ ++ +Y + Y+S E ERRF+IF+ L+ ID + + G+N+FAD+TD EF
Sbjct: 41 MYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEF- 99
Query: 138 HGLSSLDWEQIENLKSTFETYSFNS-----SNSYG------LAESINYKDKGKVLPKVQD 186
+ST+ ++ S SN Y L ++++ G V+ ++
Sbjct: 100 --------------RSTYLGFTSGSNKTKVSNRYEPRVGQVLPSYVDWRSAGAVV-DIKS 144
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGR 226
Q CG CWA SA+A +E I LI LS+Q GR
Sbjct: 145 QGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGR 184
>gi|226477902|emb|CAX72658.1| Cathepsin L precursor [Schistosoma japonicum]
gi|226488903|emb|CAX74801.1| Cathepsin L precursor [Schistosoma japonicum]
Length = 372
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 89/176 (50%), Gaps = 13/176 (7%)
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNN-LKTIDYYTKHEQGTATY--G 124
+LE + G +K F ++R Y + E +RF IF N +K +++ +++G ATY G
Sbjct: 51 NLELLSNIGAAWKFFKINFKRAYGNVMEETKRFLIFGTNFIKMMEHNRAYQEGKATYKMG 110
Query: 125 VNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKV 184
VN F D T+ E +I K + +F SS L + ++++ G V P V
Sbjct: 111 VNNFTDKTEYELRKLRGYRSACRIAKPKGS----TFISSEHAKLPDRVDWRRNGAVTP-V 165
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFY-----KGGVMNL 235
++Q CGSCWA S+ +E + K N L+ LS+Q + Y +GG+M+L
Sbjct: 166 KNQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDL 221
>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
Length = 340
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 70/141 (49%), Gaps = 9/141 (6%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ E+ R Y ++E RR ++FR N + ID + + NRFAD+T EF +
Sbjct: 41 WMAEHGRAYKDEAEKARRLEVFRANAELIDSFNAAGTHSHRLATNRFADLTVQEFRAART 100
Query: 142 SLDWEQIENLKST---FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
L + + +E +S + A+S++++ G V V+DQ G CWA SA
Sbjct: 101 GLRPRPAPSAGAGRFRYENFSLADA-----AQSVDWRAMGAVT-GVKDQGASGCCWAFSA 154
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E I+ L+ LS+Q
Sbjct: 155 VAAVEGLNKIRTGRLVSLSEQ 175
>gi|158148921|dbj|BAF81994.1| cysteine proteinase [Platycodon grandiflorus]
Length = 359
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 56/168 (33%), Positives = 81/168 (48%), Gaps = 12/168 (7%)
Query: 63 EASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTAT 122
EAS + H F F Y + Y++ E++RRF IF ++LK I + K + T
Sbjct: 44 EASVLQVIGQTRHSLAFARFAHRYGKSYETAEEMKRRFSIFVDSLKMIRSHNKKGL-SYT 102
Query: 123 YGVNRFADMTDSEF-NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAE-SINYKDKGKV 180
GVN FAD+T EF H L + +N +T + N + GL ++++ G V
Sbjct: 103 LGVNEFADLTWEEFRKHRLGA-----AQNCSATLKG---NHKLTNGLLPLKKDWREVGIV 154
Query: 181 LPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFY 228
P V++Q CGSCW S LE+AY + I LS+Q R Y
Sbjct: 155 TP-VKNQGHCGSCWTFSTTGALEAAYVQAFGKAIFLSEQQLVDCARAY 201
>gi|157864845|ref|XP_001681131.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124425|emb|CAJ02281.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 80/141 (56%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y+R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V++Q CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKNQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ ++L+ LS+Q
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQ 176
>gi|7219908|gb|AAF40479.1| cystein protease [Clonorchis sinensis]
Length = 326
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 84/175 (48%), Gaps = 27/175 (15%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
+++F +Y++ Y +D + E RF+IF++NL + EQGTA YGV +F+D+T EF
Sbjct: 32 YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKT 90
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ D + + E + ++ E ++++ G V P V DQ CGSCWA
Sbjct: 91 RYLRMRFDGPIVSEDLTPEEDVTMDN-------EKFDWREHGAVGP-VLDQGKCGSCWAF 142
Query: 197 SAVACLESAYAIKHNELIELSKQ----------------PPKTHGRFYKGGVMNL 235
S + + + K L+ LS+Q PP+T+ K G + L
Sbjct: 143 SVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQKMGGLEL 197
>gi|226469954|emb|CAX70258.1| Cathepsin L precursor [Schistosoma japonicum]
Length = 372
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 55/176 (31%), Positives = 89/176 (50%), Gaps = 13/176 (7%)
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNN-LKTIDYYTKHEQGTATY--G 124
+LE + G +K F ++R Y + E +RF IF N +K +++ +++G ATY G
Sbjct: 51 NLELLSNIGAAWKFFKINFKRAYGNVMEETKRFLIFGTNFIKMMEHNRAYQEGKATYKMG 110
Query: 125 VNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKV 184
VN F D T+ E +I K + +F SS L + ++++ G V P V
Sbjct: 111 VNNFTDKTEYELRKLRGYRSACRIAKPKGS----TFISSEHAKLPDRVDWRRNGAVTP-V 165
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFY-----KGGVMNL 235
++Q CGSCWA S+ +E + K N L+ LS+Q + Y +GG+M+L
Sbjct: 166 KNQGQCGSCWAFSSTGAIEGQHYRKTNRLVNLSEQQLIDCSKSYGNNGCEGGLMDL 221
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 79/150 (52%), Gaps = 16/150 (10%)
Query: 72 FLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKH--EQGTATYGVNRFA 129
++HG + + + E++ RF +F +NL+ +D + + E G G+N+FA
Sbjct: 60 LVEHGRRVSNVLGEHDS----------RFRVFWDNLRFVDAHNERAGEHGF-RLGMNQFA 108
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T+ EF + + E Y + + L ES+++++KG V P V++Q
Sbjct: 109 DLTNDEFRAAYLGARIPAARSGNAVGEMYRHDGAEE--LPESVDWREKGAVAP-VKNQGQ 165
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV+ +ES I E++ LS+Q
Sbjct: 166 CGSCWAFSAVSSVESINQIVTGEMVTLSEQ 195
>gi|157864849|ref|XP_001681133.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124427|emb|CAJ02283.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 80/141 (56%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y+R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V++Q CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKNQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ ++L+ LS+Q
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQ 176
>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 72/132 (54%), Gaps = 4/132 (3%)
Query: 89 QYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDW-EQ 147
Q D SE+E+R IF+NNL+ I+ + + G+N+++D+T EF + L +Q
Sbjct: 72 QNDKISELEKRKRIFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGLKVSKQ 131
Query: 148 IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYA 207
+ + K FN ++ + + +++ +G V V+DQ CG CWA S VA +E A
Sbjct: 132 LSSSKMRSAAVPFNLNDD--VPTNFDWRQQGAV-TDVKDQGSCGCCWAFSVVAAVEGAVK 188
Query: 208 IKHNELIELSKQ 219
I ELI LS+Q
Sbjct: 189 INTGELISLSEQ 200
>gi|85068702|gb|ABC69431.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 84/175 (48%), Gaps = 27/175 (15%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
+++F +Y++ Y +D + E RF+IF++NL + EQGTA YGV +F+D+T EF
Sbjct: 32 YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKT 90
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ D + + E + ++ E ++++ G V P V DQ CGSCWA
Sbjct: 91 RYLRMRFDGPIVSEDLTPEEDVTMDN-------EKFDWREHGAVGP-VLDQGKCGSCWAF 142
Query: 197 SAVACLESAYAIKHNELIELSKQ----------------PPKTHGRFYKGGVMNL 235
S + + + K L+ LS+Q PP+T+ K G + L
Sbjct: 143 SVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDGGCDGGYPPQTYTAIQKMGGLEL 197
>gi|1749812|emb|CAA90237.1| cysteine proteinase LmCPB1 [Leishmania mexicana]
Length = 359
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 79/141 (56%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y++ +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREH-QARNPHAQFGITKFFDLSEAEFCA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + T + Y ++ + +++++++KG V P V+DQ CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKRHTPQHYPKARADLSAVPDAVDWREKGAVTP-VKDQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +E + + +EL+ LS+Q
Sbjct: 156 VGNIEGQWYLAGHELVSLSEQ 176
>gi|328876826|gb|EGG25189.1| hypothetical protein DFA_03437 [Dictyostelium fasciculatum]
Length = 341
Score = 71.2 bits (173), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 42/146 (28%), Positives = 82/146 (56%), Gaps = 7/146 (4%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
D+ +FK ++ E+ + Y + E R F N+ +I+ + TAT+G+N+F+D++
Sbjct: 27 DYTTRFKTWMVEHNKMYHEEEEFYLRLSNFIRNIHSIEKMNRQYGRTATFGLNKFSDLSL 86
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF +++ + T ET+++ S+ + +++++ KG V P V++Q +CGSC
Sbjct: 87 DEFKKHYLMPNYK--PKARVTKETFNYPSN----IPATLDWRTKGYVTP-VKNQLMCGSC 139
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SA +E+A + ++ LS+Q
Sbjct: 140 WAFSATEQIETANIMAGGQVEYLSEQ 165
>gi|394331743|gb|AFN27094.1| cysteine protease [Leishmania major]
Length = 348
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 80/141 (56%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y+R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V++Q CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKNQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ ++L+ LS+Q
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQ 176
>gi|7211745|gb|AAF40416.1|AF216785_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
gi|7381223|gb|AAF61442.1|AF138266_1 papain-like cysteine proteinase isoform III [Ipomoea batatas]
Length = 366
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 44/151 (29%), Positives = 76/151 (50%), Gaps = 10/151 (6%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQ--GTATYGVNRF 128
+ L+ + F F R + + Y SD E + R +F+ N++ +H+Q A +GV +F
Sbjct: 41 DLLNADHHFAVFKRRFGKAYASDEEHDYRLSVFKANMRRA---KRHQQLDPAAVHGVTQF 97
Query: 129 ADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQH 188
+D+T +EF L+ LK + + + L +++D+G V P V++Q
Sbjct: 98 SDLTPTEFRRKFLGLN----RRLKFPADAKTAPILPTDELPSDFDWRDRGAVTP-VKNQG 152
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCW+ S LE A + +L+ LS+Q
Sbjct: 153 TCGSCWSFSTTGALEGANFLATGKLVSLSEQ 183
>gi|116786550|gb|ABK24153.1| unknown [Picea sitchensis]
Length = 394
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 46/153 (30%), Positives = 77/153 (50%), Gaps = 15/153 (9%)
Query: 72 FLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADM 131
L+ F FV+++ ++Y E RRF IF+ NL + K ++ A +G+N+F+D+
Sbjct: 68 LLNAEAHFAHFVKKFNKEYSGAEEHARRFSIFKKNLHKALRHQKLDR-DAIHGINKFSDL 126
Query: 132 TDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSY-----GLAESINYKDKGKVLPKVQD 186
T+ EF+ EQ L + + S + + L ++++ G V P V++
Sbjct: 127 TEEEFH--------EQYLGLTTPPRSLSQRTQPAPILPTDDLPPDFDWRELGAVTP-VKN 177
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCW S +E A +K +LI LS+Q
Sbjct: 178 QGACGSCWTFSTTGAMEGANFMKTGKLISLSEQ 210
>gi|157864851|ref|XP_001681134.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124428|emb|CAJ02284.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|378943050|gb|AFC76266.1| cathepsin L-like protease [Leishmania major]
gi|378943052|gb|AFC76267.1| cathepsin L-like protease [Leishmania major]
gi|378943054|gb|AFC76268.1| cathepsin L-like protease [Leishmania major]
gi|378943058|gb|AFC76270.1| cathepsin L-like protease [Leishmania major]
gi|394331737|gb|AFN27091.1| cysteine protease [Leishmania major]
gi|394331741|gb|AFN27093.1| cysteine protease [Leishmania major]
gi|394331747|gb|AFN27096.1| cysteine protease [Leishmania major]
gi|394331749|gb|AFN27097.1| cysteine protease [Leishmania major]
gi|394331751|gb|AFN27098.1| cysteine protease [Leishmania major]
gi|394331753|gb|AFN27099.1| cysteine protease [Leishmania major]
gi|394331755|gb|AFN27100.1| cysteine protease [Leishmania major]
gi|394331757|gb|AFN27101.1| cysteine protease [Leishmania major]
gi|394331759|gb|AFN27102.1| cysteine protease [Leishmania major]
gi|394331761|gb|AFN27103.1| cysteine protease [Leishmania major]
gi|394331763|gb|AFN27104.1| cysteine protease [Leishmania major]
gi|394331765|gb|AFN27105.1| cysteine protease [Leishmania major]
gi|394331767|gb|AFN27106.1| cysteine protease [Leishmania major]
gi|394331769|gb|AFN27107.1| cysteine protease [Leishmania major]
gi|394331771|gb|AFN27108.1| cysteine protease [Leishmania major]
gi|394331773|gb|AFN27109.1| cysteine protease [Leishmania major]
gi|394331775|gb|AFN27110.1| cysteine protease [Leishmania major]
gi|394331777|gb|AFN27111.1| cysteine protease [Leishmania major]
gi|394331779|gb|AFN27112.1| cysteine protease [Leishmania major]
gi|394331781|gb|AFN27113.1| cysteine protease [Leishmania major]
gi|394331783|gb|AFN27114.1| cysteine protease [Leishmania major]
gi|394331785|gb|AFN27115.1| cysteine protease [Leishmania major]
gi|394331787|gb|AFN27116.1| cysteine protease [Leishmania major]
gi|394331789|gb|AFN27117.1| cysteine protease [Leishmania major]
gi|394331791|gb|AFN27118.1| cysteine protease [Leishmania major]
gi|394331793|gb|AFN27119.1| cysteine protease [Leishmania major]
gi|394331795|gb|AFN27120.1| cysteine protease [Leishmania major]
gi|394331797|gb|AFN27121.1| cysteine protease [Leishmania major]
gi|394331799|gb|AFN27122.1| cysteine protease [Leishmania major]
gi|394331801|gb|AFN27123.1| cysteine protease [Leishmania major]
gi|394331803|gb|AFN27124.1| cysteine protease [Leishmania major]
Length = 348
Score = 71.2 bits (173), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 80/141 (56%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y+R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V++Q CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKNQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ ++L+ LS+Q
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQ 176
>gi|268581031|ref|XP_002645498.1| Hypothetical protein CBG22748 [Caenorhabditis briggsae]
Length = 379
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 77/141 (54%), Gaps = 4/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F +F+ ++ R Y S E + R+ IF +N++ + + G + +N F D ++ E
Sbjct: 78 FDEFLYKFNRLYSSQEEYKYRYHIFVHNVREFEEEERKHPGL-DFDINEFTDWSEEELRK 136
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ +D + ++ K+ S+ SI+++D+GK+ P +++Q CGSCWA +
Sbjct: 137 MI--VDKKNVKEEKNAVRFEGSVLSSGIKRPASIDWRDQGKLTP-IKNQGQCGSCWAFAT 193
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E+ +AIK L+ LS+Q
Sbjct: 194 VAAIEAQHAIKKGILVSLSEQ 214
>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
AltName: Allergen=Car p 1; Flags: Precursor
gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
gi|387885|gb|AAA72774.1| papain [synthetic construct]
gi|225437|prf||1303270A papain
Length = 345
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 77/151 (50%), Gaps = 24/151 (15%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
F+ ++ ++ + Y + E RF+IF++NLK ID T + + G+N FADM++ EF
Sbjct: 48 FESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDE-TNKKNNSYWLGLNVFADMSNDEFKE 106
Query: 137 --------NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQH 188
N+ + L +E++ N + E ++++ KG V P V++Q
Sbjct: 107 KYTGSIAGNYTTTELSYEEVLN------------DGDVNIPEYVDWRQKGAVTP-VKNQG 153
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV +E I+ L E S+Q
Sbjct: 154 SCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQ 184
>gi|157864855|ref|XP_001681136.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124430|emb|CAJ02286.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 80/141 (56%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y+R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V++Q CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKNQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ ++L+ LS+Q
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQ 176
>gi|42794048|dbj|BAD11762.1| cahepsin L-like cysteine protease [Brugia malayi]
Length = 371
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 53/145 (36%), Positives = 75/145 (51%), Gaps = 15/145 (10%)
Query: 84 REYERQYDS-DSEIE---RRFDIFRNNLKTIDYYT-KHEQGTATY--GVNRFADMTDSEF 136
R Y+R+Y+ D EI RRF + N+K I+ + ++E+ TY +N ADM EF
Sbjct: 55 RLYKRKYNKRDEEINLEHRRFMTYLKNVKEIEKHNERYERNEETYELAINHLADMLPEEF 114
Query: 137 N--HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
HG S N K+T N L +SI+++ G V KV+DQ CGSCW
Sbjct: 115 RKLHGFQSRKITSKNNFKNTIRM-KINGP----LPKSIDWRTSGAV-TKVKDQGYCGSCW 168
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
SAV LE + ++ +L+ELS Q
Sbjct: 169 TFSAVGALEGQHFLQTGKLVELSMQ 193
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 75/149 (50%), Gaps = 12/149 (8%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT-KHEQGTATY--GVNRFAD 130
+H N +KD+ + ++Y E RR ++ NLK I+ + +H G TY G+N F D
Sbjct: 26 EHWNLWKDW---HSKKYHEKEEGWRRM-VWEKNLKKIELHNLEHSMGKHTYSLGMNHFGD 81
Query: 131 MTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
MT EF ++ + L+ + F N S++++DKG V P V+DQ C
Sbjct: 82 MTHEEFRQIMNGYKLKSQRKLRGSL----FMEPNFLEAPRSVDWRDKGYVTP-VKDQGQC 136
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCWA S +E + K L+ LS+Q
Sbjct: 137 GSCWAFSTTGAMEGQHFRKTGTLVSLSEQ 165
>gi|116242314|gb|ABJ89814.1| cysteine protease preprotein [Clonorchis sinensis]
Length = 326
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 84/175 (48%), Gaps = 27/175 (15%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
+++F +Y++ Y +D + E RF+IF++NL + EQGTA YGV +F+D+T EF
Sbjct: 32 YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFKT 90
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ D + + E + ++ E ++++ G V P V DQ CGSCWA
Sbjct: 91 RYLRMRFDGPIVSEDLTPEEDVTMDN-------EKFDWREHGAVGP-VLDQGKCGSCWAF 142
Query: 197 SAVACLESAYAIKHNELIELSKQ----------------PPKTHGRFYKGGVMNL 235
S + + + K L+ LS+Q PP+T+ K G + L
Sbjct: 143 SVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQKMGGLEL 197
>gi|157864843|ref|XP_001681130.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124424|emb|CAJ02280.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 348
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 80/141 (56%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y+R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V++Q CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKNQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ ++L+ LS+Q
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQ 176
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 42/138 (30%), Positives = 67/138 (48%), Gaps = 2/138 (1%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLS 141
++ +Y + Y +E E+RF IF NN++ I+ + +N AD T+ EF
Sbjct: 41 WMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNKPYKLSINHLADQTNEEFMASHK 100
Query: 142 SLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVAC 201
+ L+ T +T F N + +++++ KG ++DQ CG CWA SAVA
Sbjct: 101 GYKGSHWQGLRITTQT-PFKYENVTDIPWAVDWRQKGDAT-SIKDQGQCGICWAFSAVAA 158
Query: 202 LESAYAIKHNELIELSKQ 219
E Y I L+ LS+Q
Sbjct: 159 TEGIYQITTGNLVSLSEQ 176
>gi|71084304|gb|AAZ23597.1| cysteine protease [Leishmania aethiopica]
Length = 353
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 81/143 (56%), Gaps = 5/143 (3%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVN-RFADMTDSEF 136
+ F + + + + D+E RRF+ F+ N++T + H A Y V+ +FAD+T EF
Sbjct: 40 HYGRFKKRHGKAFGEDAEEGRRFNAFKQNMQTAYFLNAHNP-HAHYDVSGKFADLTPQEF 98
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ ++ + K E + S G+ S+++++KG V P V++Q +CGSCWA
Sbjct: 99 AKLYLNPNYYA-RHGKDYKEHVHVDDSVRSGVM-SVDWREKGAVTP-VKNQGMCGSCWAF 155
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
SA+ +E +A+K++ L+ LS+Q
Sbjct: 156 SAIGNIEGQWALKNHSLVSLSEQ 178
>gi|85068704|gb|ABC69432.1| cysteine protease [Clonorchis sinensis]
Length = 326
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 51/175 (29%), Positives = 84/175 (48%), Gaps = 27/175 (15%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
+++F +Y++ Y +D + E RF+IF++NL + EQGTA YGV +F+D+T EF
Sbjct: 32 YEEFKLKYKKTYSNDDD-ELRFEIFKDNLLRAKRLQEMEQGTAQYGVTQFSDLTSEEFET 90
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ D + + E + ++ E ++++ G V P V DQ CGSCWA
Sbjct: 91 RYLRMRFDGPIVSEDLTPEEDVTMDN-------EKFDWREHGAVGP-VLDQGKCGSCWAF 142
Query: 197 SAVACLESAYAIKHNELIELSKQ----------------PPKTHGRFYKGGVMNL 235
S + + + K L+ LS+Q PP+T+ K G + L
Sbjct: 143 SVIGNVVGQWFRKTGHLLALSEQQLVDCDYLDDGCDGGYPPQTYTAIQKMGGLEL 197
>gi|16444924|dbj|BAB70669.1| cysteine proteinase [Daucus carota]
Length = 208
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 44/128 (34%), Positives = 68/128 (53%), Gaps = 6/128 (4%)
Query: 94 SEIERRFDIFRNNLKTIDYYTKHEQGTATYG--VNRFADMTDSEFNHGLSSLDWEQIENL 151
+E + RF++F+ N+K I K Q Y VN+FADMT EF + + +L
Sbjct: 54 TEKQIRFNVFKTNVKHIH---KVNQMNKPYKLEVNKFADMTYHEFRNSYGGSKVKHFRSL 110
Query: 152 KSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN 211
+ F N+ L S++++ G V P +++Q CGSCWA SA+ +E IK N
Sbjct: 111 RGDRARTGFMHENTKHLPSSVDWRKHGAVTP-IKNQGRCGSCWAFSAIVGVEGINKIKTN 169
Query: 212 ELIELSKQ 219
+L+ LS+Q
Sbjct: 170 QLVSLSEQ 177
>gi|378943060|gb|AFC76271.1| cathepsin L-like protease [Leishmania major]
Length = 348
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 80/141 (56%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y+R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V++Q CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKNQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ ++L+ LS+Q
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQ 176
>gi|222637029|gb|EEE67161.1| hypothetical protein OsJ_24244 [Oryza sativa Japonica Group]
Length = 309
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 72/144 (50%), Gaps = 4/144 (2%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
QF FVR + R+Y E RR +F NL + + TA +GV F+D+T EF
Sbjct: 47 QFAAFVRRHGREYSGPEEYARRLRVFAANLARAAAHQALDP-TARHGVTPFSDLTREEFE 105
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNS--YGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
L+ L + ++++ + ++ GL S +++D+G V V+ Q CGSCWA
Sbjct: 106 ARLTGLAADVGDDVRRRPMPSAAPATEEEVSGLPASFDWRDRGAVT-DVKMQGACGSCWA 164
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S +E A + L++LS+Q
Sbjct: 165 FSTTGAVEGANFLATGNLLDLSEQ 188
>gi|1848231|gb|AAB48120.1| cathepsin L-like protease [Leishmania major]
Length = 443
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 80/141 (56%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y+R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V++Q CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKNQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ ++L+ LS+Q
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQ 176
>gi|33945878|emb|CAE45589.1| papain-like cysteine proteinase-like protein 2 [Lotus japonicus]
Length = 361
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 82/150 (54%), Gaps = 7/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+E L + F +F R + + Y ++ E RF++F++N+ + + +A +GV RF+
Sbjct: 36 DEGLGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDP-SAVHGVTRFS 94
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T EF H + L + L S ++ +++ L + ++++ G V P V++Q
Sbjct: 95 DLTPMEFRHSVLGL---RGVGLPSDADSAPILPTDN--LPKDFDWREHGAVTP-VKNQGS 148
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCW+ SA LE A+ + +L+ LS+Q
Sbjct: 149 CGSCWSFSATGALEGAHFLSTGKLVSLSEQ 178
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 43/139 (30%), Positives = 76/139 (54%), Gaps = 3/139 (2%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATYGVNRFADMTDSEFNHGL 140
++ ++ R Y D+E RR ++FR+N+ I+ Q N+FAD+T++EF
Sbjct: 8 WMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTNAEFRATR 67
Query: 141 SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVA 200
+ L ++ ++ + + ++ L S++++ KG V P V+DQ CG CWA SAVA
Sbjct: 68 TGLRPSSSRGNRAP-TSFRYANVSTGDLPASVDWRGKGAVNP-VKDQGDCGCCWAFSAVA 125
Query: 201 CLESAYAIKHNELIELSKQ 219
+E A + +L+ LS+Q
Sbjct: 126 AMEGAVKLATGKLVSLSEQ 144
>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
Length = 340
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 43/150 (28%), Positives = 73/150 (48%), Gaps = 8/150 (5%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
++ L ++K + +Y Y D+E E+ IF++N+ ID + + +NRFA
Sbjct: 30 DQSLTLSERYKHWKIKYRVIYKDDAEEEKHIQIFKHNVAYIDSFNAAGNKSYKLTINRFA 89
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+ + G ++E S+ F N + +++++ +G V P V++Q
Sbjct: 90 DLPTEPSDDGFKK---RKLEPTTSSL----FKYKNITDIPAAVDWRKRGAVTP-VKNQRE 141
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV LE I L+ LS+Q
Sbjct: 142 CGSCWAFSAVGALEGIQQITSGNLVSLSEQ 171
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 67/132 (50%), Gaps = 6/132 (4%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQ 147
R Y + E RF IF+ N+K I+ K + G+N FAD+T EF + L+
Sbjct: 48 RVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLN--- 104
Query: 148 IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYA 207
I N S N + + ++++++ G V +V++Q CG CWA SAV LE AY
Sbjct: 105 IPN--SYLSPSPINDLSDDDMPSNLDWRESGAV-TQVKNQGQCGCCWAFSAVGSLEGAYK 161
Query: 208 IKHNELIELSKQ 219
I L+E S+Q
Sbjct: 162 IATGNLMEFSEQ 173
>gi|378943046|gb|AFC76264.1| cathepsin L-like protease [Leishmania major]
gi|378943056|gb|AFC76269.1| cathepsin L-like protease [Leishmania major]
gi|394331745|gb|AFN27095.1| cysteine protease [Leishmania major]
Length = 348
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 80/141 (56%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y+R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V++Q CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKNQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ ++L+ LS+Q
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQ 176
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 67/132 (50%), Gaps = 6/132 (4%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQ 147
R Y + E RF IF+ N+K I+ K + G+N FAD+T EF + L+
Sbjct: 48 RVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLN--- 104
Query: 148 IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYA 207
I N S N + + ++++++ G V +V++Q CG CWA SAV LE AY
Sbjct: 105 IPN--SYLSPSPINDLSDDDMPSNLDWRESGAV-TQVKNQGQCGCCWAFSAVGSLEGAYK 161
Query: 208 IKHNELIELSKQ 219
I L+E S+Q
Sbjct: 162 IATGNLMEFSEQ 173
>gi|328866326|gb|EGG14711.1| hypothetical protein DFA_10969 [Dictyostelium fasciculatum]
Length = 369
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 56/196 (28%), Positives = 93/196 (47%), Gaps = 32/196 (16%)
Query: 58 SYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHE 117
+ GS+ + F E++ +FK +V ++E+ Y+S E RFDIF+ N+ I + +
Sbjct: 25 ALGSKPTALFSHEQYT---TEFKGWVGQFEKNYES-HEFLNRFDIFKKNMDYIKTWND-K 79
Query: 118 QGTATYGVNRFADMTDSEFNH-------------GLSSLDWEQIENLKSTFETYSFNSSN 164
+N AD+TD E+ GL+ D ++KS F N
Sbjct: 80 SVDHKLELNTLADLTDKEYQRLYLGTKVNGALRVGLNHADERDFGHIKSVFSNVKDN--- 136
Query: 165 SYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQPPKTH 224
+++++ +G V V++Q CGSCW+ S+ +E A+AIK E+I LS+Q
Sbjct: 137 -----PNVDWRKQGAV-SHVKNQGQCGSCWSFSSTGAIEGAHAIKTGEMISLSEQQLVDC 190
Query: 225 GRFY-----KGGVMNL 235
+ Y GG+M L
Sbjct: 191 SKRYGNNGCNGGLMTL 206
>gi|195382039|ref|XP_002049740.1| GJ20585 [Drosophila virilis]
gi|194144537|gb|EDW60933.1| GJ20585 [Drosophila virilis]
Length = 333
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 52/146 (35%), Positives = 78/146 (53%), Gaps = 9/146 (6%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDS 134
+++ F EYE++Y+S+ E R IF +N K ID + ++ G Y GVN+F DM
Sbjct: 26 EWETFKVEYEKRYESEDEELLRKLIFYDNKKAIDKHNIRYALGKEAYEMGVNQFTDMLPK 85
Query: 135 EFNHGLSSLDWEQIENLKSTFET-YSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF SL I +T + +++ + + SI+++ KG V V+DQ CGSC
Sbjct: 86 EFG----SLMLTSINLTDATSDIDIIYSAPENTEIPSSIDWRVKGAV-TSVKDQGKCGSC 140
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SAV LE +K +L+ LS Q
Sbjct: 141 WAFSAVGTLEGQQFLKTRQLMSLSTQ 166
>gi|402892718|ref|XP_003909556.1| PREDICTED: cathepsin F [Papio anubis]
Length = 460
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/135 (33%), Positives = 66/135 (48%), Gaps = 12/135 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK+FV Y R Y+S E R +F NN+ ++GTA YGV +F+D+T+ EF
Sbjct: 163 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 222
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYG--LAESINYKDKGKVLPKVQDQHLCGSCWA 195
L+ L E+ N + S G +++ KG V KV+DQ +CGSCWA
Sbjct: 223 IYLNPLLREEPGN--------KMKQAKSVGDLAPPEWDWRSKGAVT-KVKDQGMCGSCWA 273
Query: 196 HSAVACLESAYAIKH 210
S +E + +
Sbjct: 274 FSVTGNVEGQWFLNQ 288
>gi|348500228|ref|XP_003437675.1| PREDICTED: pro-cathepsin H-like [Oreochromis niloticus]
Length = 276
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 60/216 (27%), Positives = 92/216 (42%), Gaps = 47/216 (21%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG--TATYGVNRFADMTDSEF 136
FK ++ E+ + Y ++ E R +F+ N +T++ +H G + T G+N+F+DMT EF
Sbjct: 30 FKQWISEHNKVYGTE-EYHHRLHVFKQNKRTVE---QHNAGNHSFTMGLNQFSDMTFEEF 85
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+K + + + E ++++ KG + V+DQ CGSCW
Sbjct: 86 KKFYLFTQPSTCSVIKGS------HVKRTGPYPEFVDWRMKGDFVTPVKDQGFCGSCWTF 139
Query: 197 SAVACLESAYAIKHNELIELSKQP--------------------------PKTHG----- 225
S CLES AI +LI+LS+Q P T
Sbjct: 140 STTGCLESVNAIATGKLIQLSEQQLLDCSRNFNVVKYDEKAMVDAVARLNPITSCFDVTA 199
Query: 226 --RFYKGGVMNLPHMLCSKGPYSLNHAVLNVGYDNE 259
+ YK GV + C +NHAVL VGY E
Sbjct: 200 EFKHYKEGVYSSTQ--CKNTTDKVNHAVLAVGYGTE 233
>gi|74927078|sp|Q86GF7.1|CRUST_PANBO RecName: Full=Crustapain; AltName: Full=NsCys; Flags: Precursor
gi|28971811|dbj|BAC65417.1| crustapain [Pandalus borealis]
Length = 323
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 47/155 (30%), Positives = 81/155 (52%), Gaps = 30/155 (19%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATY--GVNRFADMTDS 134
++++F ++ ++Y + E R +F + LK I ++ ++++G TY +N F+D+T
Sbjct: 19 EWENFKTKFGKKYANSEEESHRMSVFMDKLKFIQEHNERYDKGEVTYWLKINNFSDLTHE 78
Query: 135 EF----------NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKV 184
E H LS L S+ + +A +++++KG V P V
Sbjct: 79 EVLATKTGMTRRRHPLSVLP----------------KSAPTTPMAADVDWRNKGAVTP-V 121
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+DQ CGSCWA SAVA LE A+ +K +L+ LS+Q
Sbjct: 122 KDQGQCGSCWAFSAVAALEGAHFLKTGDLVSLSEQ 156
>gi|294897727|ref|XP_002776051.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239882576|gb|EER07867.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 361
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 77/144 (53%), Gaps = 5/144 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
F F ++ ++Y+S E +R IF+ NL I+ + GVN + D+T EF
Sbjct: 31 FIGFQYKFGKKYESKEEEIKRNAIFQVNLHHIEQINARNL-SYKLGVNEYTDLTHEEFAA 89
Query: 137 -NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
G+ + + +N S + S+++ LA S+++++K VL ++DQ CGSCWA
Sbjct: 90 LKLGILKMSLRKDDNWISLANSSLLVSADTTQLAASVDWRNK-SVLTPIKDQGHCGSCWA 148
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S+ LE+ YAI +L+ LS+Q
Sbjct: 149 FSSTGALEAQYAIATGKLLSLSEQ 172
>gi|52546920|gb|AAU81593.1| cysteine proteinase [Petunia x hybrida]
Length = 210
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 54/161 (33%), Positives = 85/161 (52%), Gaps = 16/161 (9%)
Query: 83 VREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNH-- 138
R++ + Y+S E RF+IF+ NLK ID K + Y G+N F+D++ EF
Sbjct: 1 TRQHGKIYESIEEKLHRFEIFKENLKHIDERNKI---VSNYWLGLNEFSDLSHDEFKKMY 57
Query: 139 -GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
GL +D + + N K + + + + + L +S++++ KG V P V++Q CGSCWA S
Sbjct: 58 LGLK-VDHDLLNNKKQSQQDFEY--RDFVDLPKSVDWRKKGAVTP-VKNQGQCGSCWAFS 113
Query: 198 AVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMN 234
VA +E IK L LS+Q T+ GG+M+
Sbjct: 114 TVAAVEGINQIKTGNLTSLSEQELIDCDTTYNNGCNGGLMD 154
>gi|312377879|gb|EFR24605.1| hypothetical protein AND_10691 [Anopheles darlingi]
Length = 375
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 79/144 (54%), Gaps = 7/144 (4%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
++F F ++++ Y SD E E R ++FR NL+ I + + +G T VN AD T+ E
Sbjct: 71 DEFSRFKGKHQKTYASDREHEHRLNVFRQNLRFIHSHNRANRGF-TVAVNHLADRTEDE- 128
Query: 137 NHGLSSLDWEQIENLKSTFETYSFN-SSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ SL + N+ + + + + +++ L +S +++ G V P V+DQ +CGSCW+
Sbjct: 129 ---MKSLRGFRSSNVYNGGQAFPYKPAAHMDDLPDSWDWRISGAVTP-VKDQSVCGSCWS 184
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
+ +E AY K +L+ S+Q
Sbjct: 185 FGTIGHIEGAYFRKTQKLVRFSQQ 208
>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
endopeptidase; AltName: Full=Papaya peptidase B;
AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
Precursor
gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
Length = 348
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 49/141 (34%), Positives = 73/141 (51%), Gaps = 5/141 (3%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F ++ ++ + Y + E RF+IF++NLK ID K G G+N F+D+++ EF
Sbjct: 48 FNSWMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMINGYWL-GLNEFSDLSNDEFKE 106
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
E N E F + + L ES++++ KG V P V+ Q C SCWA S
Sbjct: 107 KYVGSLPEDYTNQPYDEE---FVNEDIVDLPESVDWRAKGAVTP-VKHQGYCESCWAFST 162
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E IK L+ELS+Q
Sbjct: 163 VATVEGINKIKTGNLVELSEQ 183
>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
Length = 340
Score = 70.9 bits (172), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 54/155 (34%), Positives = 84/155 (54%), Gaps = 21/155 (13%)
Query: 73 LDHGNQFKDFVREYERQY-DSDSEIERRFDIFRNNLKTIDYYT-KHEQGTATY--GVNRF 128
LDH + + + Y +QY + + E+ RRF I+ NLK + + +H G +Y G+N
Sbjct: 33 LDH--HWDLWKKTYGKQYTEENEEVTRRF-IWEKNLKYVMLHNLEHSMGMHSYDLGMNHL 89
Query: 129 ADMTDSEFNHGLSSL----DWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKV 184
ADMT E +SSL W++ +F S+ + L +S++++DKG V +V
Sbjct: 90 ADMTSEEVMLLMSSLRVPSQWQR---------NVTFKSNPNQKLPDSMDWRDKGCV-TEV 139
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+ Q CGSCWA SAV LE+ +K +L+ LS Q
Sbjct: 140 KYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSVQ 174
>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 385
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 75/146 (51%), Gaps = 8/146 (5%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKST 154
E RRF++F++N++ I + + ++ +NRF DMT EF +S +
Sbjct: 63 EKARRFNVFKDNVRLIHEFNRRDE-PYKLRLNRFGDMTADEFRRAYASSRVSHHRMFRGR 121
Query: 155 FETYS-FNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNEL 213
E S F + + L ++++++KG V V+DQ CGSCWA S +A +E AI+ + L
Sbjct: 122 GERRSGFMYAGARDLPAAVDWREKGAV-GAVKDQGQCGSCWAFSTIAAVEGINAIRTSNL 180
Query: 214 IELSKQ-----PPKTHGRFYKGGVMN 234
LS+Q KT GG+M+
Sbjct: 181 TALSEQQLVDCDTKTGNAGCDGGLMD 206
>gi|157864853|ref|XP_001681135.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|157864857|ref|XP_001681137.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124429|emb|CAJ02285.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124431|emb|CAJ02287.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 443
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 80/141 (56%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y+R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V++Q CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKNQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ ++L+ LS+Q
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQ 176
>gi|162460343|ref|NP_001105479.1| cysteine protease2 precursor [Zea mays]
gi|1491774|emb|CAA68192.1| cysteine protease [Zea mays]
Length = 360
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 72/142 (50%), Gaps = 6/142 (4%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F F Y + Y+S +E+ +RF IF +L+ + T + + G+NRFADM+ EF
Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRS-TNRKGLSYRLGINRFADMSWEEFR 116
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ +N +T + + L E+ ++++ G V P V++Q CGSCW S
Sbjct: 117 ----ATRLGAAQNCSATLTGNHRMRAAAVALPETKDWREDGIVSP-VKNQGHCGSCWTFS 171
Query: 198 AVACLESAYAIKHNELIELSKQ 219
LE+AY + I LS+Q
Sbjct: 172 TTGALEAAYTQATGKPISLSEQ 193
>gi|224113123|ref|XP_002316398.1| predicted protein [Populus trichocarpa]
gi|222865438|gb|EEF02569.1| predicted protein [Populus trichocarpa]
Length = 327
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 45/143 (31%), Positives = 75/143 (52%), Gaps = 8/143 (5%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNL-KTIDYYTKHEQGTATYGVNRFADMTDSEF 136
+FK F++E+ ++Y + E RF IF NL + +++ TA +GV F D+T+ EF
Sbjct: 13 KFKMFIKEHNKEYATREEYVHRFGIFGKNLIRAVEHQALDP--TAIHGVTPFMDLTEEEF 70
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ + + E S + ++ GL +S ++++KG V V+ Q CGSCWA
Sbjct: 71 ERMYAGV----LGGGTVPVEKGSVSFMDASGLPDSFDWREKGAV-TDVKIQGSCGSCWAF 125
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S +E A I +L+ LS+Q
Sbjct: 126 STTGSVEGANFIATGKLLNLSEQ 148
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 46/143 (32%), Positives = 72/143 (50%), Gaps = 12/143 (8%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+ ++ EY + Y +E E+RF IF++N++ I+ + GVN AD+T EF
Sbjct: 39 EQWMAEYGKVYKDAAEKEKRFLIFKHNVEFIESFNAAANKPYKLGVNHLADLTVEEFKAS 98
Query: 140 LSSLDWEQIENLKSTFE--TYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC-GSCWAH 196
+ LK +E T F N + +I+++ KG V ++DQ C GSCWA
Sbjct: 99 RN--------GLKRPYELSTTPFKYENVTAIPAAIDWRTKGAVT-SIKDQGQCAGSCWAF 149
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
S VA E + I +L+ LS+Q
Sbjct: 150 STVAATEGIHQITTGKLVSLSEQ 172
>gi|194689248|gb|ACF78708.1| unknown [Zea mays]
gi|414885653|tpg|DAA61667.1| TPA: cysteine protease2 [Zea mays]
Length = 360
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 72/142 (50%), Gaps = 6/142 (4%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F F Y + Y+S +E+ +RF IF +L+ + T + + G+NRFADM+ EF
Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRS-TNRKGLSYRLGINRFADMSWEEFR 116
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ +N +T + + L E+ ++++ G V P V++Q CGSCW S
Sbjct: 117 ----ATRLGAAQNCSATLTGNHRMRAAAVALPETKDWREDGIVSP-VKNQGHCGSCWTFS 171
Query: 198 AVACLESAYAIKHNELIELSKQ 219
LE+AY + I LS+Q
Sbjct: 172 TTGALEAAYTQATGKPISLSEQ 193
>gi|343477619|emb|CCD11596.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 74/145 (51%), Gaps = 11/145 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT---ATYGVNRFADMTDS 134
QF F ++Y R Y +E RF +F+ +++ K E AT+GV +F+DM+
Sbjct: 40 QFAAFKQKYSRSYKDATEEAFRFRMFKQSMER----AKEEAAANPYATFGVTQFSDMSPE 95
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF + LK + + ++ + +I+++ KG V P V+DQ CGSCW
Sbjct: 96 EFRATYLNGAKYYAAALKRPRKVVTVSTGKA---PPAIDWRKKGAVTP-VKDQRKCGSCW 151
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA+ +E + + +EL LS+Q
Sbjct: 152 AFSAIGNIEGQWKVAGHELTSLSEQ 176
>gi|328874928|gb|EGG23293.1| hypothetical protein DFA_05425 [Dictyostelium fasciculatum]
Length = 552
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 78/162 (48%), Gaps = 9/162 (5%)
Query: 66 TFDLEEFLDHGNQ--------FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHE 117
TFD ++H + F++F + + +QY+ + RF++++N L I +
Sbjct: 220 TFDPHHIVNHAKEKESNYQVSFEEFKKTHNKQYNHQHQHNHRFNLYKNRLHNIIRHNHKS 279
Query: 118 QGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T G+N F D T E + Q+++ + L S++++
Sbjct: 280 DKTFKMGMNHFGDKTVDELMGMTGQRNGMQLDSELYEKAEIHVPKVDLKDLPASVDWRQS 339
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G V P ++DQ +CGSCWA ++A LES + + +L+ELS+Q
Sbjct: 340 GCVSP-IKDQSICGSCWAFGSIAALESQNCVVNGQLVELSEQ 380
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 45/141 (31%), Positives = 79/141 (56%), Gaps = 4/141 (2%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ ++++ Y S E RF++F++NLK ID + E + G+N FAD+T EF
Sbjct: 44 FEKWLAKHQKAYASFEEKLHRFEVFKDNLKLIDEINR-EVTSYWLGLNEFADLTHDEFK- 101
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ L +S+ ++ + + ++ L ++++++ KG V V++Q CGSCWA S
Sbjct: 102 -TTYLGLSPPPARRSSSRSFRYENVAAHDLPKAVDWRKKGAV-TDVKNQGQCGSCWAFST 159
Query: 199 VACLESAYAIKHNELIELSKQ 219
VA +E AI L LS+Q
Sbjct: 160 VAAVEGINAIVTGNLTALSEQ 180
>gi|449139100|gb|AGE89905.1| cathepsin-like cysteine proteinase [Spodoptera littoralis NPV]
Length = 336
Score = 70.9 bits (172), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 48/165 (29%), Positives = 86/165 (52%), Gaps = 8/165 (4%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F++++ ++Y + + + F F+ NL ++ + A YG+N+F+D+ F +
Sbjct: 33 FENFIKQHNKEYTTPDQRDDAFVNFKRNLVNMNA-MNNISNHAVYGINKFSDIDKITFAN 91
Query: 139 GLSSLDWEQIENLKSTFETYSFN-----SSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
+ L + S F+ Y + S ES +++ KV KV++Q +CGSC
Sbjct: 92 VHAGLVL-TLNATDSNFDPYRLCEFVTVAGPSARTPESFDWRKLHKV-TKVKEQGVCGSC 149
Query: 194 WAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHM 238
WA +A+ +ES YAI H+ LI+LS+Q R +G L H+
Sbjct: 150 WAFAAIGNIESQYAILHDSLIDLSEQQLLDCDRIDQGCDGGLMHL 194
>gi|195624522|gb|ACG34091.1| thiol protease aleurain precursor [Zea mays]
Length = 360
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 72/142 (50%), Gaps = 6/142 (4%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F F Y + Y+S +E+ +RF IF +L+ + T + + G+NRFADM+ EF
Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRS-TNRKGLSYRLGINRFADMSWEEFR 116
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ +N +T + + L E+ ++++ G V P V++Q CGSCW S
Sbjct: 117 ----ATRLGAAQNCSATLTGNHRMRAAAVALPETKDWREDGIVSP-VKNQGHCGSCWTFS 171
Query: 198 AVACLESAYAIKHNELIELSKQ 219
LE+AY + I LS+Q
Sbjct: 172 TTGALEAAYTQATGKPISLSEQ 193
>gi|111073719|dbj|BAF02548.1| triticain gamma [Triticum aestivum]
Length = 365
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 46/147 (31%), Positives = 75/147 (51%), Gaps = 11/147 (7%)
Query: 75 HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDS 134
H +F F Y + Y+S +E+ RRF IF +L+ + T + + G+NRF+DM+
Sbjct: 60 HALRFARFAVRYGKSYESAAEVRRRFRIFSESLEEVRS-TNRKGLSYRLGINRFSDMSWE 118
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFN--SSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
EF ++ ++ T + N ++ L E+ ++++ G V P V+DQ CGS
Sbjct: 119 EFQA-------TRLGAAQTCSATLAGNHLMRDAAALPETKDWREDGIVSP-VKDQSHCGS 170
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CW S LE+AY + I LS+Q
Sbjct: 171 CWTFSTTGALEAAYTQATGKNISLSEQ 197
>gi|340710428|ref|XP_003393792.1| PREDICTED: cathepsin O-like [Bombus terrestris]
Length = 355
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 52/153 (33%), Positives = 81/153 (52%), Gaps = 14/153 (9%)
Query: 79 FKDFVREYERQY-DSDSEIERRFDIFRNNLKTIDYYT--KHEQGTATYGVNRFADMTDSE 135
F+++V Y + Y ++ +E E RF FR +L+ I+ + Q +A YG+ F+DM++ E
Sbjct: 36 FQNYVMRYNKSYRNNPTEYEERFKRFRKSLRHIEKMNGLRPSQESAYYGLTEFSDMSEDE 95
Query: 136 FNHGLSSLDWEQIENLKSTFETYS-----FNSSNSYGLAESI----NYKDKGKVLPKVQD 186
F L+ L K E+Y S+N + SI +++DKG + P V+
Sbjct: 96 F-LSLTLLPDLSARGEKHANESYHRRHHLLQSTNRVKKSVSIPLRFDWRDKGVITP-VRS 153
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CG+CWA S + +ES YAIK+ L LS Q
Sbjct: 154 QGSCGACWAFSTIEVVESMYAIKNGTLYMLSVQ 186
>gi|33333714|gb|AAQ11975.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 323
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 46/150 (30%), Positives = 77/150 (51%), Gaps = 21/150 (14%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATYG--VNRFA 129
LDHG ++ V E +RRF +F+ NL I ++ K+E+G ++ V +FA
Sbjct: 28 LDHGKTYRSVVEE-----------KRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFA 76
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
DMT EF LD +++ + + + ++++++ +G V P V++Q
Sbjct: 77 DMTHEEF------LDLLKLQGVPALPSDAVYFEETDIEEKDAVDWRKEGAVTP-VKNQGH 129
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV +E + K+ L+ LS Q
Sbjct: 130 CGSCWAFSAVGAIEGQFFKKNGTLVSLSAQ 159
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 46/127 (36%), Positives = 71/127 (55%), Gaps = 9/127 (7%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHGLSSLDWEQIENLK 152
E +RRF+IF++NL+ ID H + +Y G+ RFAD+T+ E+ E+ + +
Sbjct: 61 EKDRRFEIFKDNLRFID---DHNKKNLSYRLGLTRFADLTNDEYRSKYLGAKMEK-KGER 116
Query: 153 STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE 212
T + Y + L ESI+++ KG V +V+DQ CGSCWA S + +E I +
Sbjct: 117 RTSQRYEARVGDE--LPESIDWRKKGAV-AEVKDQGSCGSCWAFSTIGAVEGINQIVTGD 173
Query: 213 LIELSKQ 219
LI LS+Q
Sbjct: 174 LITLSEQ 180
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 50/162 (30%), Positives = 86/162 (53%), Gaps = 13/162 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEF 136
F+ ++ + + Y++ E RF++F++NLK ID K + Y G+N FAD++ EF
Sbjct: 47 FESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKV---VSNYWLGLNEFADLSHQEF 103
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ L + + +S+ E +++ + L +S++++ KG V P V++Q CGSCWA
Sbjct: 104 KNKYLGLKVDLSQRRESSEEEFTYRDVD---LPKSVDWRKKGAVTP-VKNQGQCGSCWAF 159
Query: 197 SAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMN 234
S VA +E I L LS+Q T+ GG+M+
Sbjct: 160 STVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMD 201
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 76/145 (52%), Gaps = 7/145 (4%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIF-RNNLKTIDYYTKHEQGTATY--GVNRFADMTDS 134
Q++ F +++ Y+S E RF IF N+L + K+ +G +Y G+N+F D+
Sbjct: 26 QWEAFKTTHKKSYESHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAH 85
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF + ++ + + N S+ L +++++ KG V P V+DQ CGSCW
Sbjct: 86 EFAKIFNGYRGQRTSRGSTFMPPANVNDSS---LPSTVDWRKKGAVTP-VKDQGQCGSCW 141
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA LE + +K EL+ LS+Q
Sbjct: 142 AFSATGSLEGQHFLKDGELVSLSEQ 166
>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
gi|255645733|gb|ACU23360.1| unknown [Glycine max]
Length = 362
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 61/223 (27%), Positives = 100/223 (44%), Gaps = 35/223 (15%)
Query: 48 NLILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNL 107
+L + +Q + SEE F L F+ + +E++R+Y + E +RF IF++NL
Sbjct: 24 SLAMSSNQLEQFASEE-EVFQL---------FQAWQKEHKREYGNQEEKAKRFQIFQSNL 73
Query: 108 KTIDYYTKHEQGTAT---YGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSN 164
+ I+ + T G+N+FADM+ EF NL+S + + ++
Sbjct: 74 RYINEMNAKRKSPTTQHRLGLNKFADMSPEEFMKTYLKEIEMPYSNLESRKKLQKGDDAD 133
Query: 165 SYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQP---- 220
L S++++DKG V +V+DQ C S WA S +E I L+ LS Q
Sbjct: 134 CDNLPHSVDWRDKGAV-TEVRDQGKCQSHWAFSVTGAIEGINKIVTGNLVSLSVQQVVDC 192
Query: 221 -PKTHGRFYKGGVMNLPHMLCSKGPY--SLNHAVLNVGYDNES 260
P +HG C+ G Y + + + N G D E+
Sbjct: 193 DPASHG--------------CAGGFYFNAFGYVIENGGIDTEA 221
>gi|402875039|ref|XP_003901328.1| PREDICTED: pro-cathepsin H [Papio anubis]
Length = 335
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 81/162 (50%), Gaps = 17/162 (10%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ E S LE+F FK ++ ++ + Y ++ E R F +N + I+ H G
Sbjct: 21 GAAELSVNSLEKF-----HFKSWMSKHHKTYSTE-EYHHRMQTFASNWRKIN---AHNNG 71
Query: 120 TATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T+ +N+F+DM+ +E H W + +N +T Y + Y S++++ K
Sbjct: 72 NHTFKMALNQFSDMSFAEIKH---KYLWSEPQNCSATKSNY-LRGTGPY--PPSMDWRKK 125
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 126 GNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQ 167
>gi|403376395|gb|EJY88173.1| Cysteine protease-5 [Oxytricha trifallax]
Length = 401
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 49/154 (31%), Positives = 80/154 (51%), Gaps = 13/154 (8%)
Query: 69 LEEFLDHGNQ--FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVN 126
L+E +H Q F FV EY + Y + + + RFDIF N + I + ++E+ G+N
Sbjct: 60 LQESGNHETQQAFIQFVAEYGKTYATKNHLNSRFDIFAKNFEMIKSHNENEEKHYEMGIN 119
Query: 127 RFADMTDSEF---NHGLSSLDWEQIENLKSTFET-----YSFNSSNSYGLAESINYKDKG 178
+F+DMT EF H L + + L++ + S ++ E +++++ G
Sbjct: 120 KFSDMTHEEFLEHYHKQGVLIPSEEKRLEAHHANRHPSLQAMASDDNQAAPEKVDWREAG 179
Query: 179 KV-LPKVQDQHLCGSCWAHSAVACLESAYAIKHN 211
KV +P DQ CGSCWA + LES +AIK++
Sbjct: 180 KVSVPG--DQSSCGSCWAFTTATTLESLHAIKND 211
>gi|157787177|ref|NP_001099150.1| cathepsin L1-like precursor [Danio rerio]
gi|157422879|gb|AAI53505.1| MGC174152 protein [Danio rerio]
Length = 336
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 84/159 (52%), Gaps = 13/159 (8%)
Query: 64 ASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT-KHEQGTAT 122
AS+ D++ DH N +K ++ + Y D E+ RR I+ NL+ I+ + ++ G T
Sbjct: 17 ASSIDIQ-LDDHWNSWKS---QHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHT 71
Query: 123 Y--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKV 180
+ G+N+F DMT+ EF H ++ + T + F + + + ++++ +G V
Sbjct: 72 FKMGMNQFGDMTNEEFRHAMNGYK----HDPNQTSQGPLFMEPSFFAAPQQVDWRQRGYV 127
Query: 181 LPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
P V+DQ CGSCW+ S+ LE K +LI +S+Q
Sbjct: 128 TP-VKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQ 165
>gi|33945877|emb|CAE45588.1| papain-like cysteine proteinase-like protein 1 [Lotus japonicus]
Length = 359
Score = 70.5 bits (171), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 82/150 (54%), Gaps = 7/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+E L + F +F R + + Y ++ E RF++F++N+ + + +A +GV +F+
Sbjct: 36 DEGLGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDP-SAVHGVTQFS 94
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T EF H + L + L S ++ +++ L + ++++ G V P V++Q
Sbjct: 95 DLTPMEFQHSVLGL---RGVGLPSDADSAPILPTDN--LPKDFDWREHGAVTP-VKNQGS 148
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCW+ SA LE A+ + EL+ LS+Q
Sbjct: 149 CGSCWSFSATGALEGAHFLSTGELVSLSEQ 178
>gi|356565778|ref|XP_003551114.1| PREDICTED: thiol protease aleurain-like [Glycine max]
Length = 353
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 54/158 (34%), Positives = 77/158 (48%), Gaps = 10/158 (6%)
Query: 63 EASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTAT 122
E+ D+ H F F R + ++Y S EI RF IF +NLK I T T T
Sbjct: 38 ESQVLDVIGQSRHALSFARFARRHGKRYRSVDEIRNRFRIFSDNLKLIRS-TNRRSLTYT 96
Query: 123 YGVNRFADMTDSEFN-HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVL 181
GVN FAD T EF H L + +N +T + + L + +++ +G ++
Sbjct: 97 LGVNHFADWTWEEFTRHKLGAP-----QNCSATLK--GNHRLTDAVLPDEKDWRKEG-IV 148
Query: 182 PKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+V+DQ CGSCW S LE+AYA + I LS+Q
Sbjct: 149 SQVKDQGNCGSCWTFSTTGALEAAYAQAFGKNISLSEQ 186
>gi|381145561|gb|AFF59216.1| cathepsin L-like cysteine protease [Philasterides dicentrarchi]
Length = 345
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 46/144 (31%), Positives = 75/144 (52%), Gaps = 7/144 (4%)
Query: 79 FKDFVR---EYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSE 135
+K +V+ E+ + ++ +E E RF++F++N I + + G+ G+N FA M + E
Sbjct: 39 YKTYVQWKSEFNQNFNG-AEDEYRFNVFQSNYNYIQQFNSEQTGSLRLGMNVFAAMENVE 97
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWA 195
+ S L + SF++ N+ L SI++ KG V P V++Q CGSCWA
Sbjct: 98 YIAKFVSGITHHSTELN--IQEVSFDNVNAGDLPTSIDWVKKGAVAP-VENQGQCGSCWA 154
Query: 196 HSAVACLESAYAIKHNELIELSKQ 219
S LE YAI+ ++ LS Q
Sbjct: 155 FSTKEGLEGVYAIQSGSMVVLSAQ 178
>gi|294874412|ref|XP_002766943.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239868318|gb|EEQ99660.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 366
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 78/153 (50%), Gaps = 13/153 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-N 137
F DF ++ ++Y+S E +R IF+ NL I+ + + T GVN +AD+T EF
Sbjct: 28 FTDFQHKFGKKYESKEEEMKRNAIFQANLHHIEQ-VNAQNLSYTLGVNEYADLTHEEFVA 86
Query: 138 HGLSSLDWEQIENLKSTFETYS----------FNSSNSYGLAESINYKDK-GKVLPKVQD 186
+ L + ++K E + F S N+ L+ ++++D G VL +++
Sbjct: 87 QKVGILKMDARRDVKFDVEGRTSCISHARLSLFVSDNATELSAGVDWRDATGDVLTPIKN 146
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA S+ LES YAI +L S+Q
Sbjct: 147 QGACGSCWAFSSTGTLESLYAIGTGQLRSFSEQ 179
>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
Length = 336
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 48/159 (30%), Positives = 84/159 (52%), Gaps = 13/159 (8%)
Query: 64 ASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT-KHEQGTAT 122
AS+ D++ DH N +K ++ + Y D E+ RR I+ NL+ I+ + ++ G T
Sbjct: 17 ASSIDIQ-LDDHWNSWKS---QHGKSYHEDVEVGRRM-IWEENLRKIEQHNFEYSYGNHT 71
Query: 123 Y--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKV 180
+ G+N+F DMT+ EF H ++ + T + F + + + ++++ +G V
Sbjct: 72 FKMGMNQFGDMTNEEFRHAMNGYK----HDPNQTSQGPLFMEPSFFAAPQQVDWRQRGYV 127
Query: 181 LPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
P V+DQ CGSCW+ S+ LE K +LI +S+Q
Sbjct: 128 TP-VKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQ 165
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 74/145 (51%), Gaps = 14/145 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ ++ + Y S E RF+IF +NLK ID T + + G+N FAD++ EF
Sbjct: 47 FESWMSKHSKTYRSIEEKLHRFEIFLDNLKHIDE-TNKKVSSYWLGLNEFADLSHEEFK- 104
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYG----LAESINYKDKGKVLPKVQDQHLCGSCW 194
+ L+ F + SYG L ES++++ KG V P V++Q CGSCW
Sbjct: 105 -------SKYLGLRVEFPRKRSSRGFSYGDVEDLPESVDWRTKGAVTP-VKNQGSCGSCW 156
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A S VA +E I L LS+Q
Sbjct: 157 AFSTVAAVEGINQIVTGNLTSLSEQ 181
>gi|109082090|ref|XP_001108862.1| PREDICTED: cathepsin H isoform 2 [Macaca mulatta]
Length = 335
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 47/162 (29%), Positives = 81/162 (50%), Gaps = 17/162 (10%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ E S LE+F FK ++ ++ + Y ++ E R F +N + I+ H G
Sbjct: 21 GAAELSVNSLEKF-----HFKSWMSKHHKTYSTE-EYHHRMQTFASNWRKIN---AHNNG 71
Query: 120 TATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T+ +N+F+DM+ +E H W + +N +T Y + Y S++++ K
Sbjct: 72 NHTFKMALNQFSDMSFAEIKH---KYLWSEPQNCSATKSNY-LRGTGPY--PPSMDWRKK 125
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 126 GNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQ 167
>gi|1706261|sp|Q10717.1|CYSP2_MAIZE RecName: Full=Cysteine proteinase 2; Flags: Precursor
gi|644490|dbj|BAA08245.1| cysteine proteinase [Zea mays]
Length = 360
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 44/142 (30%), Positives = 72/142 (50%), Gaps = 6/142 (4%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
+F F Y + Y+S +E+ +RF IF +L+ + T + + G+NRFADM+ EF
Sbjct: 58 RFARFAVRYGKSYESAAEVHKRFRIFSESLQLVRS-TNRKGLSYRLGINRFADMSWEEFR 116
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ +N +T + + L E+ ++++ G V P V++Q CGSCW S
Sbjct: 117 ----ATRLGAAQNCSATLTGNHRMRAAAVALPETKDWREDGIVSP-VKNQGHCGSCWTFS 171
Query: 198 AVACLESAYAIKHNELIELSKQ 219
LE+AY + I LS+Q
Sbjct: 172 TTGALEAAYTQATGKPISLSEQ 193
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 74/145 (51%), Gaps = 14/145 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ ++ + Y S E RF+IF +NLK ID T + + G+N FAD++ EF
Sbjct: 47 FESWMSKHSKAYRSIEEKLHRFEIFLDNLKHIDE-TNKKVSSYWLGLNEFADLSHEEFK- 104
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYG----LAESINYKDKGKVLPKVQDQHLCGSCW 194
+ L+ F + SYG L ES++++ KG V P V++Q CGSCW
Sbjct: 105 -------SKYLGLRVEFPRKRSSRGFSYGDVEDLPESVDWRTKGAVTP-VKNQGSCGSCW 156
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A S VA +E I L LS+Q
Sbjct: 157 AFSTVAAVEGINQIVTGNLTSLSEQ 181
>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
Length = 337
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 51/159 (32%), Positives = 81/159 (50%), Gaps = 13/159 (8%)
Query: 64 ASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT-KHEQGTAT 122
A T D ++ DH +Q+K + + ++Y + E RR I+ NLK I+ + +H G T
Sbjct: 18 APTLD-QQLNDHWDQWKKW---HSKKYHATEEGWRRV-IWEKNLKKIEMHNLEHSMGIHT 72
Query: 123 Y--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKV 180
Y G+N F DMT EF ++ ++ + + F N + +++++KG V
Sbjct: 73 YRLGMNHFGDMTHEEFRQVMNGFKHKKDRRFRGSL----FMEPNFIEVPNKLDWREKGYV 128
Query: 181 LPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
P V+DQ CGSCWA S LE K +L+ LS+Q
Sbjct: 129 TP-VKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQ 166
>gi|454101|gb|AAA82966.1| cathepsin H prepropeptide [Mus musculus]
Length = 333
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 73/141 (51%), Gaps = 8/141 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK +++++++ Y S E R +F NN + I + + T +N+F+DM+ +E H
Sbjct: 33 FKSWMKQHQKTYSS-VEYNHRLQMFANNWRKIQAHNQRNH-TFKMALNQFSDMSFAEIKH 90
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
W + +N +T Y + Y S++++ KG V+ V++Q C SCW S
Sbjct: 91 ---KFLWSEPQNCSATKSNY-LRGTGPY--PSSMDWRKKGNVVSPVKNQGACASCWTFST 144
Query: 199 VACLESAYAIKHNELIELSKQ 219
LESA AI +++ L++Q
Sbjct: 145 TGALESAVAIASGKMLSLAEQ 165
>gi|156367164|ref|XP_001627289.1| predicted protein [Nematostella vectensis]
gi|156214194|gb|EDO35189.1| predicted protein [Nematostella vectensis]
Length = 514
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 46/160 (28%), Positives = 85/160 (53%), Gaps = 18/160 (11%)
Query: 69 LEEFLDHGN-------QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA 121
++EF+ +G ++ + ++ +QYDS+ E+ +R IFR+N++ I + +
Sbjct: 203 MQEFMSYGKVDFAIERMYRKYQGQHNKQYDSEHEVSKRKHIFRHNMRYIRSINR-KNLKY 261
Query: 122 TYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSS--NSYGLAESINYKDKGK 179
N F D+TD E+ D + +++ + + YS S + + ++++D G
Sbjct: 262 KLAPNHFVDLTDGEY-------DQHKGDSIITLYGPYSNMSHVLQRVDVPDELDWRDYGA 314
Query: 180 VLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V P V+ Q +CGSC+A +AV +E AY +K +L ELS Q
Sbjct: 315 VSP-VRGQGICGSCYALAAVGAVEGAYFMKTGKLKELSAQ 353
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 71/146 (48%), Gaps = 14/146 (9%)
Query: 82 FVREYERQYDSDSEIERRFDIFRNNLKTIDYYT----KHEQGTATYGVNRFADMTDSEFN 137
++ ++ + Y + E RR ++FR N K ID + K G NRFAD+TD EF
Sbjct: 45 WMAKHGKTYKDEEEKARRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDDEFR 104
Query: 138 HGLSSLDWEQIENLKST----FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
+ + +E +S ++ +S++++ G V V+DQ CG C
Sbjct: 105 AARTGYQRPPAAVAGAGGGFLYENFSLAAA-----PQSMDWRAMGAV-TGVKDQGSCGCC 158
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SAVA +E I+ +L+ LS+Q
Sbjct: 159 WAFSAVAAVEGLAKIRTGQLVSLSEQ 184
>gi|26245873|gb|AAN77412.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 170
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 81/153 (52%), Gaps = 15/153 (9%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFA 129
L +Q+ F + + + Y S E RF IF++NL+ I+ + K+++G +Y GV FA
Sbjct: 17 LTDKDQWVAFKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFA 76
Query: 130 DMTDSEFNHGLSSLDWEQIE---NLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
D+T EF L QI+ N+++T + + +SI++ KG VL V+
Sbjct: 77 DLTHDEFKDKLR----RQIKTKPNVEATLAVFP----EGLEVPDSIDWTQKGAVL-GVKY 127
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA SA LE AI +N I LS+Q
Sbjct: 128 QGGCGSCWAFSATGALEGQNAIVNNVKIPLSEQ 160
>gi|378943048|gb|AFC76265.1| cathepsin L-like protease [Leishmania major]
Length = 348
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 79/141 (56%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y+R Y + +E +RR F NL+ + + + A +G+ +F D++++ F
Sbjct: 38 FEEFKRTYQRAYGTLTEEQRRLANFERNLELMREH-QARNPHARFGITKFFDLSEAVFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V++Q CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKNQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ ++L+ LS+Q
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQ 176
>gi|403223173|dbj|BAM41304.1| cysteine protease precursor TacP [Theileria orientalis strain
Shintoku]
Length = 463
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 55/174 (31%), Positives = 85/174 (48%), Gaps = 31/174 (17%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ F +Y + + +D E RF +FRNN + HE T T VN F+D+T+ E N
Sbjct: 145 FEKFKADYNKVHATDDERRERFLVFRNNYLETLTHKGHE--TFTKSVNFFSDLTEEELNR 202
Query: 139 GLSSLDW-------EQIENLKSTFETYSFNSSNSYGLA---------------ESINYKD 176
++ E +E L S+ T N LA ESI+++
Sbjct: 203 LFPKIEVPKESSPSEHLERLMSSRSTDP-NFLAKLALAKGFQSPVKSLDGISGESIDWR- 260
Query: 177 KGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ-----PPKTHG 225
K + KV+DQ +CGSCWA ++V +ES Y I +++++LS+Q K+HG
Sbjct: 261 KANGVTKVKDQGMCGSCWAFASVGSVESLYKIHTDKVLDLSEQELVNCETKSHG 314
>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
Length = 351
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 49/170 (28%), Positives = 78/170 (45%), Gaps = 20/170 (11%)
Query: 54 SQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY 113
S YG E + + ++HG +KD ++E RRF +F+ N +D
Sbjct: 38 SSSTGYGEEAMTARHEKWMVEHGRTYKD-----------EAEKARRFQVFKANAAFVDTS 86
Query: 114 TKHEQGTATY-GVNRFADMTDSEFNH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLA 169
G + +NRFADMT EF G L + + + +S +
Sbjct: 87 NAAAGGKKYHLAINRFADMTHDEFMARYTGFKPLPATGKKMPGFKYANVTLSSEDQ---- 142
Query: 170 ESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++++++ KG V V++Q CG CWA SAVA +E + I EL+ LS+Q
Sbjct: 143 QAVDWRKKGAVT-DVKNQQKCGCCWAFSAVAAIEGMHQINTGELVSLSEQ 191
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 79/149 (53%), Gaps = 6/149 (4%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFAD 130
E D +++ ++ ++ R+Y + E +R F I+++N++ I+Y + + T N+FAD
Sbjct: 37 EMSDMEKRYERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINY-INAQNFSFTLTDNQFAD 95
Query: 131 MTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
MT+ E+ L + + SF S L S++++ G V P V++Q C
Sbjct: 96 MTNEEYKALYMGLGTSETSRKNQS----SFKRERSKVLPISVDWRKMGAVTP-VRNQGEC 150
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCWA S VA +E I+ +L+ LS+Q
Sbjct: 151 GSCWAFSTVAAVEGINKIRTGKLVSLSEQ 179
>gi|38683931|gb|AAR27011.1| cysteine protease [Periserrula leucophryna]
Length = 283
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 40/124 (32%), Positives = 65/124 (52%), Gaps = 12/124 (9%)
Query: 99 RFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWE---QIENLKSTF 155
RF IFR N+K I+ +E G A YGV +F+D+ + EF + W+ + + +++
Sbjct: 2 RFKIFRENMKKINTLNDNELGDAEYGVTQFSDLAEEEFRRYYLTPKWDLSHRPDLVRAKI 61
Query: 156 ETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIE 215
S +++D V P V++Q +CGSCWA S +E +AI N+L+
Sbjct: 62 PDVD--------PPASFDWRDHNAVTP-VKNQGMCGSCWAFSTTENIEGQWAIHRNKLVS 112
Query: 216 LSKQ 219
LS+Q
Sbjct: 113 LSEQ 116
>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
Length = 331
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 51/151 (33%), Positives = 81/151 (53%), Gaps = 13/151 (8%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT-KHEQGTATY--GVNRFA 129
LDH + + + Y +QY+ +E R I+ NLKT+ + +H G +Y G+N
Sbjct: 24 LDH--HWDLWKKTYGKQYEEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYELGMNHLG 81
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTF-ETYSFNSSNSYGLAESINYKDKGKVLPKVQDQH 188
DMT E +SSL + S + ++ SS + L +S+++++KG V +V+ Q
Sbjct: 82 DMTSEEVISSMSSL------RVPSQWPRNVTYKSSPNQKLPDSLDWREKGCV-TEVKYQG 134
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SAV LE+ +K +L+ LS Q
Sbjct: 135 ACGSCWAFSAVGALEAQVKLKTGKLVSLSAQ 165
>gi|229594208|ref|XP_001031647.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|225567000|gb|EAR83984.3| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 331
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 52/145 (35%), Positives = 72/145 (49%), Gaps = 13/145 (8%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN- 137
F F R + QY ++SE R +F NLK I+ + T VN+FAD+T EF
Sbjct: 36 FLKFKRSFNVQYHNESEESYRLSVFLENLKMIEKHNADSTRTYDQEVNQFADLTIEEFES 95
Query: 138 -HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+ + SL + +NL + N + S + I++ K VLP V++Q CGSCWA
Sbjct: 96 RYLMKSLPSQLNKNLA----VLNLNETAS----QPIDWTTK-NVLPGVKNQQQCGSCWAF 146
Query: 197 SAVACLESAYAI--KHNELIELSKQ 219
S LES Y I K N I S+Q
Sbjct: 147 STAGLLESVYNIHNKPNTPISFSEQ 171
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 45/149 (30%), Positives = 79/149 (53%), Gaps = 6/149 (4%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFAD 130
E D +++ ++ ++ R+Y + E +R F I+++N++ I+Y + + T N+FAD
Sbjct: 33 EMSDMEKRYERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINY-INAQNFSFTLTDNQFAD 91
Query: 131 MTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
MT+ E+ L + + SF S L S++++ G V P V++Q C
Sbjct: 92 MTNEEYKALYMGLGTSETSRKNQS----SFKRERSKVLPISVDWRKMGAVTP-VRNQGEC 146
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCWA S VA +E I+ +L+ LS+Q
Sbjct: 147 GSCWAFSTVAAVEGINKIRTGKLVSLSEQ 175
>gi|332326581|gb|AEE42614.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 77/141 (54%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V+DQ CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKDQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ + L LS+Q
Sbjct: 156 VGNIESQWAVAGHRLTALSEQ 176
>gi|357158628|ref|XP_003578189.1| PREDICTED: thiol protease aleurain-like [Brachypodium distachyon]
Length = 363
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 45/146 (30%), Positives = 74/146 (50%), Gaps = 9/146 (6%)
Query: 75 HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA-TYGVNRFADMTD 133
H +F F Y + Y+S +E++RRF IF +L+ + + +++G + G+NR++DM+
Sbjct: 58 HALRFARFAVRYGKSYESAAEVQRRFRIFSESLEEVR--STNQKGLSYRLGINRYSDMSW 115
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF L+ N+ L E+ ++++ G V P V+DQ CGSC
Sbjct: 116 EEFQASRLGAAQTCSATLRGNHRMQDANA-----LPETKDWREDGIVSP-VKDQSHCGSC 169
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
W S LE+AY + I LS+Q
Sbjct: 170 WTFSTTGALEAAYTQATGKNISLSEQ 195
>gi|157864847|ref|XP_001681132.1| cathepsin L-like protease [Leishmania major strain Friedlin]
gi|68124426|emb|CAJ02282.1| cathepsin L-like protease [Leishmania major strain Friedlin]
Length = 443
Score = 70.5 bits (171), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 81/149 (54%), Gaps = 18/149 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y+R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFA- 95
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSY--------GLAESINYKDKGKVLPKVQDQHLC 190
+ N + F ++ Y + +++++++KG V P V++Q C
Sbjct: 96 -------ARYLNGAAYFAAVKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKNQGAC 147
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCWA SAV +ES +A+ ++L+ LS+Q
Sbjct: 148 GSCWAFSAVGNIESQWAVAGHKLVRLSEQ 176
>gi|394331735|gb|AFN27090.1| cysteine protease [Leishmania major]
Length = 348
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 79/141 (56%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y+R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYQRAYGTLTEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + ++++++ KG V P V++Q CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTP-VKNQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ ++L+ LS+Q
Sbjct: 156 VGNIESQWAVAGHKLVRLSEQ 176
>gi|391340505|ref|XP_003744580.1| PREDICTED: digestive cysteine proteinase 1-like [Metaseiulus
occidentalis]
Length = 469
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 77/146 (52%), Gaps = 11/146 (7%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEF 136
F+ F + + Y+ D R+ IF+ NL I+ + + + Y G+ +FADM+ +EF
Sbjct: 166 FEHFKEHFGKTYEGDEHALRQ-GIFQRNLAHIEKFNAEKAASRGYTLGITQFADMSTAEF 224
Query: 137 NH---GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
GL ++ I L+ + + L E+++++DKG V P V+DQ CGSC
Sbjct: 225 RQTYLGLR-MNASTIAKLRKLQREVVADDRD---LPEAVDWRDKGAVSP-VKDQGQCGSC 279
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA S +E + +K+ EL+ LS+Q
Sbjct: 280 WAFSTSGAIEGQHFLKNGELLSLSEQ 305
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 52/179 (29%), Positives = 90/179 (50%), Gaps = 15/179 (8%)
Query: 61 SEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQG 119
S ++S +D + L +F+ +++ + + Y E RF I+++N++ IDY + H
Sbjct: 27 SVDSSVYDPHKTLKQ--RFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPF 84
Query: 120 TATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGK 179
T NRFADMT+SEF L+ + K + + ++++++ +G
Sbjct: 85 KLTD--NRFADMTNSEFKAHFLGLNTSSLRLHKKQRPV----CDPAGNVPDAVDWRTQGA 138
Query: 180 VLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQP-----PKTHGRFYKGGVM 233
V P +++Q CG CWA SAVA +E IK L+ LS+Q T+ + GG+M
Sbjct: 139 VTP-IRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLM 196
>gi|156355026|ref|XP_001623478.1| predicted protein [Nematostella vectensis]
gi|156210181|gb|EDO31378.1| predicted protein [Nematostella vectensis]
Length = 306
Score = 70.5 bits (171), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 77/143 (53%), Gaps = 4/143 (2%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
+ F +F +++++ Y+ DSE RR IFR+N++ I + N FAD+TD EF
Sbjct: 3 DDFDEFRQQHDKMYEDDSEHCRRKHIFRHNVRYIRSMNRRSL-PHKLEPNHFADLTDDEF 61
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+LD E + + + +S + + + ++++D G V P + Q CGSCWA
Sbjct: 62 KSYKGALDDESKDVMNDHDDIK--HSKRMFEVPDQLDWRDYGAVNP-AKGQGTCGSCWAF 118
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
+ +E+A+ I+ EL+ L++Q
Sbjct: 119 ATAGAVEAAHFIQKGELLNLAEQ 141
>gi|6042196|ref|NP_003784.2| cathepsin F precursor [Homo sapiens]
gi|12643325|sp|Q9UBX1.1|CATF_HUMAN RecName: Full=Cathepsin F; Short=CATSF; Flags: Precursor
gi|4731642|gb|AAD26616.2|AF088886_1 cathepsin F precursor [Homo sapiens]
gi|5305722|gb|AAD41790.1|AF132894_1 cathepsin F [Homo sapiens]
gi|4826528|emb|CAB42883.1| cysteine proteinase [Homo sapiens]
gi|15079738|gb|AAH11682.1| Cathepsin F [Homo sapiens]
gi|22209085|gb|AAH36451.1| Cathepsin F [Homo sapiens]
gi|61363874|gb|AAX42458.1| cathepsin F [synthetic construct]
gi|123993139|gb|ABM84171.1| cathepsin F [synthetic construct]
gi|189053904|dbj|BAG36411.1| unnamed protein product [Homo sapiens]
Length = 484
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 64/133 (48%), Gaps = 16/133 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
FK+FV Y R Y+S E R +F NN+ ++GTA YGV +F+D+T+ EF
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246
Query: 137 ---NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
N L +++ KS + +++ KG V KV+DQ +CGSC
Sbjct: 247 IYLNTLLRKEPGNKMKQAKSVGDLAP----------PEWDWRSKGAVT-KVKDQGMCGSC 295
Query: 194 WAHSAVACLESAY 206
WA S +E +
Sbjct: 296 WAFSVTGNVEGQW 308
>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
Length = 343
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 42/118 (35%), Positives = 67/118 (56%), Gaps = 3/118 (2%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQ 147
R Y ++E ERRF IF+NNL I+ + K T G+N+F+D+++ EF + +
Sbjct: 49 RTYHDNAEKERRFQIFKNNLDYIENFNKAFNKTYKLGLNKFSDLSEEEFVTTYNGYEMPT 108
Query: 148 IENLKSTFETYSFNSS--NSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLE 203
+T +F S+ N + ESI++++ G V+ V++Q CG CWA SAVA +E
Sbjct: 109 TLPTANTTVKPTFFSNYYNQDEVPESIDWRENG-VVTSVKNQGECGCCWAFSAVAAVE 165
>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 368
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 41/129 (31%), Positives = 69/129 (53%), Gaps = 3/129 (2%)
Query: 92 SDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENL 151
+D + RRF++F+ N+K I K ++ +N+FADMT E H + L
Sbjct: 61 ADHDPARRFNVFKENVKYIHEANKKDR-PFRLALNKFADMTTDELRHSYAGSRVRHHRAL 119
Query: 152 KSTFETY-SFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKH 210
+F S++ L ++++++KG V ++DQ CGSCWA S +A +ES I+
Sbjct: 120 SGGRRAQGNFTYSDAENLPPAVDWREKGAVT-GIKDQGQCGSCWAFSTIAAVESINKIRT 178
Query: 211 NELIELSKQ 219
+L+ LS+Q
Sbjct: 179 GKLVSLSEQ 187
>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
Length = 358
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 42/147 (28%), Positives = 72/147 (48%), Gaps = 10/147 (6%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
++F + Y R Y + E +RRF ++R N++ I+ + T T G N+FAD+T+ EF
Sbjct: 55 DRFLRWQATYNRSYPTAEERQRRFQVYRRNMEHIEATNRAGNLTYTLGENQFADLTEEEF 114
Query: 137 NHGLSSLDWEQIENL----KSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
LD ++ + + + N S+ S++++ +G V P C S
Sbjct: 115 ------LDLYTMKGMPPVRRDAGKKQQANFSSVVDAPTSVDWRSRGAVTPIKNQGPSCSS 168
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA A +ES I+ +L+ LS+Q
Sbjct: 169 CWAFVTAATIESITQIRTGKLVSLSEQ 195
>gi|426379977|ref|XP_004056662.1| PREDICTED: pro-cathepsin H [Gorilla gorilla gorilla]
Length = 335
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 81/162 (50%), Gaps = 17/162 (10%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ E S LE+F F+ ++ ++ + Y ++ E R F +N + I+ H G
Sbjct: 21 GAAELSVNSLEKFY-----FRSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN---AHNNG 71
Query: 120 TATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T+ +N+F+DM+ +E H W + +N +T Y + Y S++++ K
Sbjct: 72 NHTFKMALNQFSDMSFAEIKH---KYLWSEPQNCSATKSNY-LRGTGPY--PPSVDWRKK 125
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 126 GNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQ 167
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 66/135 (48%), Gaps = 4/135 (2%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQ 147
R Y + E RF IF+ N+K I+ K + G+N FAD+T EF + L+
Sbjct: 48 RVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPN 107
Query: 148 IENLKSTFETYSFNSSNSYG---LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLES 204
S + F N + ++++++ G V +V+ Q CG CWA SAV LE
Sbjct: 108 SYLSPSPMSSTEFKKINDLSDDDMPSNLDWRESGAV-TQVKHQGQCGCCWAFSAVGSLEG 166
Query: 205 AYAIKHNELIELSKQ 219
AY I +L+E S+Q
Sbjct: 167 AYKIATGKLMEFSEQ 181
>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
Length = 359
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 45/128 (35%), Positives = 66/128 (51%), Gaps = 2/128 (1%)
Query: 92 SDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENL 151
S +E +RF++F+ NLK I + ++ +N+FADMT+ EF
Sbjct: 52 SLTEKNQRFNVFKENLKHIHKVNQKDR-PYKLRLNKFADMTNHEFLQHYGGSKVSHYRMF 110
Query: 152 KSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHN 211
+ F N+ L SI+++ +G V V+DQ CGSCWA S+VA +E IK
Sbjct: 111 HGSRRQTGFAHENTSNLPSSIDWRKQGAV-TGVKDQGKCGSCWAFSSVAAVEGINKIKTG 169
Query: 212 ELIELSKQ 219
ELI LS+Q
Sbjct: 170 ELISLSEQ 177
>gi|356509908|ref|XP_003523684.1| PREDICTED: cysteine proteinase RD19a-like [Glycine max]
Length = 366
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 49/172 (28%), Positives = 84/172 (48%), Gaps = 16/172 (9%)
Query: 48 NLILQRSQPNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNL 107
NL++++ P++ + L+ + F F ++ + Y + E + RF IF+NNL
Sbjct: 29 NLLIRQVVPDA---------EDHHLLNAEHHFSAFKTKFAKTYATQEEHDHRFRIFKNNL 79
Query: 108 KTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYG 167
+ K + +A +GV RF+D+T SEF L + L+ + +
Sbjct: 80 LRAKSHQKLDP-SAVHGVTRFSDLTPSEFRGQFLGL-----KPLRLPSDAQKAPILPTSD 133
Query: 168 LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
L +++D G V V++Q CGSCW+ SAV LE A+ + L+ LS+Q
Sbjct: 134 LPTDFDWRDHGAVT-GVKNQGSCGSCWSFSAVGALEGAHFLSTGGLVSLSEQ 184
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 42/133 (31%), Positives = 67/133 (50%), Gaps = 2/133 (1%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQ 147
R Y + E RF IF+ N+K I+ K + G+N FAD+T EF + ++
Sbjct: 48 RVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFTGINIPS 107
Query: 148 IENLKSTFET-YSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAY 206
+ T + N + + ++++++ G V +V++Q CG CWA SAV LE AY
Sbjct: 108 YLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAV-TQVKNQGQCGCCWAFSAVGSLEGAY 166
Query: 207 AIKHNELIELSKQ 219
I L+E S+Q
Sbjct: 167 KIATGNLMEFSEQ 179
>gi|165969032|ref|YP_001650932.1| peptidase [Orgyia leucostigma NPV]
gi|164663528|gb|ABY65748.1| peptidase [Orgyia leucostigma NPV]
Length = 328
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 42/149 (28%), Positives = 82/149 (55%), Gaps = 5/149 (3%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFAD 130
+ L + F+ FV Y++ Y+ D E +R+ IF++NL+ I+ + TA Y +N+F+D
Sbjct: 21 DLLKAPDYFESFVANYQKNYNDDLEKSKRYTIFKDNLEEINVKNRLND-TAVYRINKFSD 79
Query: 131 MTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
++ +E + L+ + +T + G + +++ + KV +++Q C
Sbjct: 80 LSKTEIISKYTGLNAP--SETTNFCKTIVLDQPPGKG-PLNFDWRQQNKV-TSIKNQGSC 135
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
G+CWA + +A +ES YAI+++ I LS+Q
Sbjct: 136 GACWAFATLASIESQYAIRNDRHINLSEQ 164
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 79/146 (54%), Gaps = 12/146 (8%)
Query: 97 ERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKS 153
E R ++F+ NL+ +D + ++G T+ G+NRFAD+T+ E+ D+ ++ S
Sbjct: 69 EYRLEVFKENLQFVDKHNAAADRGEHTFRLGMNRFADLTNEEYRTRFLR-DFSRLRRSAS 127
Query: 154 TFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNEL 213
+ + L +SI++++KG V+P V++Q CGSCWA S VA +E I +L
Sbjct: 128 GKISSRYRLREGDDLPDSIDWREKGAVVP-VKNQGGCGSCWAFSTVAAVEGINQIVTGDL 186
Query: 214 IELSKQ-----PPKTHGRFYKGGVMN 234
I LS+Q HG +GG MN
Sbjct: 187 ISLSEQQLVDCTTANHG--CRGGWMN 210
>gi|8347420|dbj|BAA96501.1| cysteine protease [Nicotiana tabacum]
Length = 360
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 73/149 (48%), Gaps = 16/149 (10%)
Query: 75 HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDS 134
H F F Y ++Y+S EI++RF++F +NLK I + K + GVN F D+T
Sbjct: 57 HALSFARFAHRYGKRYESVEEIKQRFEVFLDNLKMIRSHNKKGL-SYKLGVNEFTDLTWD 115
Query: 135 EFNH----GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
EF + NLK T + L E+ ++++ G V P V++Q C
Sbjct: 116 EFRRDRLGAAQNCSATTKGNLKVT----------NVVLPETKDWREAGIVSP-VKNQGKC 164
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCW S LE+AY+ + I LS+Q
Sbjct: 165 GSCWTFSTTGALEAAYSQAFGKGISLSEQ 193
>gi|449521046|ref|XP_004167542.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like [Cucumis
sativus]
Length = 297
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 43/125 (34%), Positives = 70/125 (56%), Gaps = 4/125 (3%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKST 154
E+ +RF IF++N K + + H + +N+FAD++D EF+ S + L +
Sbjct: 56 EMHKRFKIFQDNAKHV-FRVNHMGKSLKLRLNQFADLSDDEFSMMYGS-NITHYNGLHAN 113
Query: 155 FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELI 214
F + + SI+++ KG V +++Q CGSCWA +AVA +ES + IK NEL+
Sbjct: 114 -RVGEFMYERAMNIPSSIDWRQKGAV-NAIKNQGHCGSCWAFAAVAAVESIHQIKTNELV 171
Query: 215 ELSKQ 219
LS+Q
Sbjct: 172 SLSEQ 176
>gi|294952897|ref|XP_002787504.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|294952899|ref|XP_002787505.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239902506|gb|EER19300.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
gi|239902507|gb|EER19301.1| cathepsin L, putative [Perkinsus marinus ATCC 50983]
Length = 289
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 52/148 (35%), Positives = 74/148 (50%), Gaps = 13/148 (8%)
Query: 76 GNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTD 133
G F F ++Y + Y+S E +R IF++NL DY K +Y VN FAD+T
Sbjct: 24 GLAFLGFQKKYGKSYESGEEEIKRAAIFQDNL---DYIQKVNAQNLSYKLAVNEFADLTF 80
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSF--NSSNSYGLAESINYKDKGKVLPKVQDQHLCG 191
EF +++ IE E F S L S++++DK VL V+DQ CG
Sbjct: 81 EEF----AAVRLSSIET-HGEMERDGFVDESGTDPTLPSSVDWRDK-NVLTPVKDQGNCG 134
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
SCWA + LE+ +AI EL+ S+Q
Sbjct: 135 SCWAFAVTGALEAKHAIATGELLSFSEQ 162
>gi|260830531|ref|XP_002610214.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
gi|229295578|gb|EEN66224.1| hypothetical protein BRAFLDRAFT_216923 [Branchiostoma floridae]
Length = 274
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 40/121 (33%), Positives = 63/121 (52%), Gaps = 7/121 (5%)
Query: 99 RFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETY 158
R+ +F++NLK + E+GTA YGV +F D+T+ EF + W K+ +
Sbjct: 1 RYFVFQDNLKKAETLQDSERGTAKYGVTKFMDLTEEEFRRYYLTPVW------KAPAKPL 54
Query: 159 SFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSK 218
+ + +++D G V +V+DQ CGSCWA S +E +AIK L +LS+
Sbjct: 55 PPATIPKKDAPTAFDWRDHGAVT-EVKDQGQCGSCWAFSTTGNIEGQWAIKKGNLPDLSE 113
Query: 219 Q 219
Q
Sbjct: 114 Q 114
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 70.1 bits (170), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 71/127 (55%), Gaps = 9/127 (7%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHGLSSLDWEQIENLK 152
E +RRF+IF++NL+ +D +H + +Y G+ RFAD+T+ E+ E+ +
Sbjct: 68 EKDRRFEIFKDNLRFVD---EHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERR 124
Query: 153 STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE 212
++ + + L ESI+++ KG V +V+DQ CGSCWA S + +E I +
Sbjct: 125 TSLR---YEARVGDELPESIDWRKKGAV-AEVKDQGGCGSCWAFSTIGAVEGINQIVTGD 180
Query: 213 LIELSKQ 219
LI LS+Q
Sbjct: 181 LITLSEQ 187
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 67/134 (50%), Gaps = 3/134 (2%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQ 147
R Y + E RF IF+ N+K I+ K + G+N FAD+T EF + L+
Sbjct: 48 RVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPN 107
Query: 148 IENLKSTFETYSF--NSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESA 205
S + F N + + ++++++ G V +V++Q CG CWA SAV LE A
Sbjct: 108 SYVSPSPMSSTEFKINDLSDDDMPSNLDWRESGAV-TQVKNQGQCGCCWAFSAVGSLEGA 166
Query: 206 YAIKHNELIELSKQ 219
Y I L+E S+Q
Sbjct: 167 YKIATGNLMEFSEQ 180
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 71/127 (55%), Gaps = 9/127 (7%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHGLSSLDWEQIENLK 152
E +RRF+IF++NL+ +D +H + +Y G+ RFAD+T+ E+ E+ +
Sbjct: 68 EKDRRFEIFKDNLRFVD---EHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERR 124
Query: 153 STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE 212
++ + + L ESI+++ KG V +V+DQ CGSCWA S + +E I +
Sbjct: 125 TSLR---YEARVGDELPESIDWRKKGAV-AEVKDQGGCGSCWAFSTIGAVEGINQIVTGD 180
Query: 213 LIELSKQ 219
LI LS+Q
Sbjct: 181 LITLSEQ 187
>gi|54696066|gb|AAV38405.1| cathepsin F [synthetic construct]
Length = 485
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 64/133 (48%), Gaps = 16/133 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
FK+FV Y R Y+S E R +F NN+ ++GTA YGV +F+D+T+ EF
Sbjct: 187 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 246
Query: 137 ---NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
N L +++ KS + +++ KG V KV+DQ +CGSC
Sbjct: 247 IYLNTLLRKEPGNKMKQAKSVGDLAP----------PEWDWRSKGAVT-KVKDQGMCGSC 295
Query: 194 WAHSAVACLESAY 206
WA S +E +
Sbjct: 296 WAFSVTGNVEGQW 308
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 43/127 (33%), Positives = 71/127 (55%), Gaps = 9/127 (7%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHGLSSLDWEQIENLK 152
E +RRF+IF++NL+ +D +H + +Y G+ RFAD+T+ E+ E+ +
Sbjct: 68 EKDRRFEIFKDNLRFVD---EHNEKNLSYRLGLTRFADLTNDEYRSKYLGAKMEKKGERR 124
Query: 153 STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNE 212
++ + + L ESI+++ KG V +V+DQ CGSCWA S + +E I +
Sbjct: 125 TSLR---YEARVGDELPESIDWRKKGAV-AEVKDQGGCGSCWAFSTIGAVEGINQIVTGD 180
Query: 213 LIELSKQ 219
LI LS+Q
Sbjct: 181 LITLSEQ 187
>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 334
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 83/151 (54%), Gaps = 7/151 (4%)
Query: 72 FLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLK-TIDYYTKHEQGTATY--GVNRF 128
F+D + + E+ ++Y SD E R I++ NL I + K++ G TY G+N+F
Sbjct: 21 FIDFDEDWNQWKNEHGKRYLSDEEEASRRLIWQKNLDIVIKHNLKYDLGHFTYDLGMNQF 80
Query: 129 ADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQH 188
AD+ + EF +S ++ + + K+T + SN + + ++++ KG V P V++Q
Sbjct: 81 ADLKNEEF---VSLMNGFRGNSSKATRGSTFLPPSNVFDMPTMVDWRTKGYVTP-VKNQL 136
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA SA LE + K +L+ LS+Q
Sbjct: 137 QCGSCWAFSATGSLEGQHFKKTGKLVSLSEQ 167
>gi|42564161|gb|AAS20592.1| digestive cysteine proteinase intestain [Leptinotarsa decemlineata]
Length = 326
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 54/153 (35%), Positives = 81/153 (52%), Gaps = 15/153 (9%)
Query: 73 LDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFA 129
L +Q+ F + + + Y S E RF IF++NL+ I+ + K+++G +Y GV FA
Sbjct: 17 LTDKDQWVAFKQTHGKTYKSLLEERTRFGIFQSNLRKIEEHNAKYDKGEESYFLGVTPFA 76
Query: 130 DMTDSEFNHGLSSLDWEQIE---NLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
D+T EF L QI+ N+++T + + +SI++ KG VL V+
Sbjct: 77 DLTHDEFKDKLR----RQIKTKPNVEATLAVFP----EGLEVPDSIDWTQKGAVL-DVKY 127
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCWA SA LE AI +N I LS+Q
Sbjct: 128 QGGCGSCWAFSATGALEGQNAIVNNVKIPLSEQ 160
>gi|394331824|gb|AFN27131.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 76/141 (53%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + ++++++ KG V P V+DQ CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTP-VKDQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ + L LS+Q
Sbjct: 156 VGSIESQWALAGHRLTALSEQ 176
>gi|394331820|gb|AFN27129.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 76/141 (53%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + ++++++ KG V P V+DQ CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTP-VKDQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ + L LS+Q
Sbjct: 156 VGSIESQWALAGHRLTALSEQ 176
>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
Length = 326
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 74/149 (49%), Gaps = 18/149 (12%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTID-YYTKHEQGTATYGV--NRFADMTDS 134
Q+++F E+ R+Y S E R +F N + ID + + E G T+ + N+F DMT
Sbjct: 22 QWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSE 81
Query: 135 E----FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
E N L + LK+ ET L E ++++ KG V P V+DQ C
Sbjct: 82 EIVATMNGFLGAPTRRPAAVLKADDET----------LPEKVDWRTKGAVTP-VKDQKQC 130
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCWA S LE + +K +L+ LS+Q
Sbjct: 131 GSCWAFSTTGSLEGQHFLKDGKLVSLSEQ 159
>gi|156354961|ref|XP_001623447.1| predicted protein [Nematostella vectensis]
gi|156210147|gb|EDO31347.1| predicted protein [Nematostella vectensis]
Length = 294
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 77/143 (53%), Gaps = 4/143 (2%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
+ F +F +++++ Y+ DSE RR IFR+N++ I + N FAD+TD EF
Sbjct: 3 DDFDEFRQQHDKMYEDDSEHCRRKHIFRHNVRYIRSMNRRSL-PHKLEPNHFADLTDDEF 61
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
+LD E + + + +S + + + ++++D G V P + Q CGSCWA
Sbjct: 62 KSYKGALDDESKDVMNDHDDIK--HSKRMFEVPDQLDWRDYGAVNP-AKGQGTCGSCWAF 118
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
+ +E+A+ I+ EL+ L++Q
Sbjct: 119 ATAGAVEAAHFIQKGELLNLAEQ 141
>gi|334311632|ref|XP_001373241.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 328
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 48/137 (35%), Positives = 73/137 (53%), Gaps = 10/137 (7%)
Query: 86 YERQYDSDSEIERRFDIFRNNLKTI-DYYTKHEQGTATY--GVNRFADMTDSEFNHGLSS 142
Y + Y E RR ++ NLK I D+ ++G +Y G+N+F DMTD EF L+
Sbjct: 36 YGKNYSEKEESFRR-QVWEKNLKLINDHNRLFKEGKKSYFMGMNQFGDMTDKEFESRLNL 94
Query: 143 LDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACL 202
+I ++ T Y+F Y L +S++++ G V P +++Q CG+CWA S + L
Sbjct: 95 ----RIAPVR-TRRNYTFKRRIYYRLPKSVDWRTHGYVTP-IRNQGECGACWAFSTIGSL 148
Query: 203 ESAYAIKHNELIELSKQ 219
E K L+ELSKQ
Sbjct: 149 EGQLFRKTGRLVELSKQ 165
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 67/134 (50%), Gaps = 3/134 (2%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQ 147
R Y + E RF IF+ N+K I+ K + G+N FAD+T EF + L+
Sbjct: 48 RVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPN 107
Query: 148 IENLKSTFETYSF--NSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESA 205
S + F N + + ++++++ G V +V++Q CG CWA SAV LE A
Sbjct: 108 SYLSPSPMPSTEFKINDLSDDDMPSNLDWRESGAV-TQVKNQGQCGCCWAFSAVGSLEGA 166
Query: 206 YAIKHNELIELSKQ 219
Y I L+E S+Q
Sbjct: 167 YKIATGNLMEFSEQ 180
>gi|164605519|dbj|BAF98585.1| CM0216.510.nc [Lotus japonicus]
Length = 360
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 44/149 (29%), Positives = 81/149 (54%), Gaps = 7/149 (4%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFAD 130
E L + F +F R + + Y S+ E RF++F++N+ + + +A +GV RF+D
Sbjct: 37 EGLGAEHHFLEFKRRFGKVYVSEEEHGYRFNVFKSNMHRARRHQLLDP-SAVHGVTRFSD 95
Query: 131 MTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
+T EF H + L + L S ++ +++ L + ++++ G V P V++Q C
Sbjct: 96 LTPMEFRHSVLGL---RGVGLPSDADSAPILRTDN--LPKDFDWREHGAVTP-VKNQGSC 149
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
G+CW+ SA LE A+ + +L+ LS+Q
Sbjct: 150 GACWSFSATGALEGAHFLSTGKLVSLSEQ 178
>gi|388491952|gb|AFK34042.1| unknown [Lotus japonicus]
Length = 352
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 46/145 (31%), Positives = 72/145 (49%), Gaps = 8/145 (5%)
Query: 75 HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDS 134
H F F +Y ++YDS EI+ RF IF NL+ I T ++ + G+N FAD++
Sbjct: 49 HAASFARFASKYGKRYDSVEEIQHRFRIFSENLELIKS-TNKKRLSYKLGLNHFADLSWD 107
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF +++ ++ T N + + + K ++ +V+DQ CGSCW
Sbjct: 108 EFRT-------QKLGAAQNCSATLIGNHKLTDAVLSAEKDWRKESIVSEVKDQAHCGSCW 160
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
S LE+AYA H + I LS+Q
Sbjct: 161 TFSTTGALEAAYAQAHGKNISLSEQ 185
>gi|363737841|ref|XP_001232765.2| PREDICTED: pro-cathepsin H [Gallus gallus]
Length = 327
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 42/137 (30%), Positives = 74/137 (54%), Gaps = 12/137 (8%)
Query: 85 EYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY--GVNRFADMTDSEFNHGLSS 142
++ R+Y++ E ERR +F N + I+ H G +++ +N+F+DMT +EF
Sbjct: 33 QHGRRYEA-GEYERRLRVFVGNKRHIE---GHNAGNSSFQMALNQFSDMTFAEFK---KL 85
Query: 143 LDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACL 202
W + +N +T + + E+++++ KG + V++Q CGSCW S CL
Sbjct: 86 YLWSEPQNCSATRGNFLRSDGPC---PEAVDWRKKGNFVTPVKNQGPCGSCWTFSTTGCL 142
Query: 203 ESAYAIKHNELIELSKQ 219
ESA AI +L+ L++Q
Sbjct: 143 ESAIAIATGKLLSLAEQ 159
>gi|332326587|gb|AEE42617.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 77/141 (54%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V+DQ CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKDQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ + L LS+Q
Sbjct: 156 VGNIESQWAVAGHRLTALSEQ 176
>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
Length = 325
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 50/149 (33%), Positives = 74/149 (49%), Gaps = 18/149 (12%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTID-YYTKHEQGTATYGV--NRFADMTDS 134
Q+++F E+ R+Y S E R +F N + ID + + E G T+ + N+F DMT
Sbjct: 21 QWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTSE 80
Query: 135 E----FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLC 190
E N L + LK+ ET L E ++++ KG V P V+DQ C
Sbjct: 81 EIVATMNGFLGAPTRRPAAVLKADDET----------LPEKVDWRTKGAVTP-VKDQKQC 129
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQ 219
GSCWA S LE + +K +L+ LS+Q
Sbjct: 130 GSCWAFSTTGSLEGQHFLKDGKLVSLSEQ 158
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 44/134 (32%), Positives = 67/134 (50%), Gaps = 3/134 (2%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQ 147
R Y + E RF IF+ N+K I+ K + G+N FAD+T EF + L+
Sbjct: 48 RVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPN 107
Query: 148 IENLKSTFETYSF--NSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESA 205
S + F N + + ++++++ G V +V++Q CG CWA SAV LE A
Sbjct: 108 SYLSPSPMSSTEFKINDISDDDMPSNLDWRESGAV-TQVKNQGQCGCCWAFSAVGSLEGA 166
Query: 206 YAIKHNELIELSKQ 219
Y I L+E S+Q
Sbjct: 167 YKIATGNLMEFSEQ 180
>gi|1134882|emb|CAA92583.1| cysteine protease [Pisum sativum]
Length = 350
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 51/147 (34%), Positives = 74/147 (50%), Gaps = 12/147 (8%)
Query: 75 HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDS 134
H F F Y ++YDS E++ RF IF NL+ I K + GVN FAD T
Sbjct: 47 HAVSFARFANRYGKRYDSVDEMKLRFKIFSENLELIRSSNKRRL-SYKLGVNHFADWTWE 105
Query: 135 EF-NHGLSSLDWEQIENLKSTFE-TYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
EF +H L + +N +T + + +N L + +++ +G ++ V+DQ CGS
Sbjct: 106 EFRSHRLGA-----AQNCSATLKGNHKITDAN---LPDEKDWRKEG-IVSGVKDQGSCGS 156
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CW S LESAYA + I LS+Q
Sbjct: 157 CWTFSTTGALESAYAQAFGKNISLSEQ 183
>gi|23110955|ref|NP_004381.2| pro-cathepsin H preproprotein [Homo sapiens]
gi|288558851|sp|P09668.4|CATH_HUMAN RecName: Full=Pro-cathepsin H; Contains: RecName: Full=Cathepsin H
mini chain; Contains: RecName: Full=Cathepsin H;
Contains: RecName: Full=Cathepsin H heavy chain;
Contains: RecName: Full=Cathepsin H light chain; Flags:
Precursor
gi|119619549|gb|EAW99143.1| cathepsin H [Homo sapiens]
Length = 335
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 80/162 (49%), Gaps = 17/162 (10%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ E LE+F FK ++ ++ + Y ++ E R F +N + I+ H G
Sbjct: 21 GAAELCVNSLEKF-----HFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN---AHNNG 71
Query: 120 TATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T+ +N+F+DM+ +E H W + +N +T Y + Y S++++ K
Sbjct: 72 NHTFKMALNQFSDMSFAEIKH---KYLWSEPQNCSATKSNY-LRGTGPY--PPSVDWRKK 125
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 126 GNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQ 167
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 42/152 (27%), Positives = 81/152 (53%), Gaps = 4/152 (2%)
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNR 127
+L + L + ++++ +Y R Y +E ++F++F+ N + I+ + G+N+
Sbjct: 26 ELNDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFKANAEFINSFNAGNH-KFWLGINQ 84
Query: 128 FADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQ 187
FAD+T+ EF ++ + I N + + + + L +I+++ KG V P ++DQ
Sbjct: 85 FADITNEEFKATKTNKGF--ISNKVRVPTGFMYENMSFDALPATIDWRTKGAVTP-IKDQ 141
Query: 188 HLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CG CWA SAVA +E + +L+ LS+Q
Sbjct: 142 GQCGCCWAFSAVAAMEGIVKLSTGKLVSLSEQ 173
>gi|308454071|ref|XP_003089699.1| hypothetical protein CRE_27946 [Caenorhabditis remanei]
gi|308269278|gb|EFP13231.1| hypothetical protein CRE_27946 [Caenorhabditis remanei]
Length = 316
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 42/136 (30%), Positives = 72/136 (52%), Gaps = 6/136 (4%)
Query: 75 HGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDS 134
+ N F+DF+ +Y R+Y ++ E+ +RF IF N+ ++ + K + G TY +N F+D++D
Sbjct: 19 YTNAFQDFLVKYLRKYKTEDELVKRFTIFSRNMDLVERFNKEDLGKVTYELNDFSDLSDE 78
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKD-KGKV-LPKVQDQHLCGS 192
E+ L S S + ESI++++ KG + ++ Q CGS
Sbjct: 79 EWKKFLMS----PKPKSPSKSAAKPSALKEKRVIPESIDWRNVKGNNHVTGIKYQGPCGS 134
Query: 193 CWAHSAVACLESAYAI 208
CWA + A +ESA +I
Sbjct: 135 CWAFATAAAIESAVSI 150
>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 346
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 73/145 (50%), Gaps = 9/145 (6%)
Query: 80 KDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHG 139
+ ++ ++ R YD + E + R + NLK I+ + + GVN F D T EF
Sbjct: 40 QQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFTDWTKEEF--- 96
Query: 140 LSSLDWEQIENLKSTFETY-----SFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
L++ + N+ S FE ++N + S L + +++++G V P V+ Q CG CW
Sbjct: 97 LATYTGLRGVNVTSPFEVVNETKPAWNWTVSDVLGTNKDWRNEGAVTP-VKSQGECGGCW 155
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA+A +E I LI LS+Q
Sbjct: 156 AFSAIAAVEGLTKIARGNLISLSEQ 180
>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
Length = 352
Score = 70.1 bits (170), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 41/143 (28%), Positives = 69/143 (48%), Gaps = 1/143 (0%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF 136
++F + Y R Y + E +RRF ++R N++ I+ + T T G N+FAD+T+ EF
Sbjct: 47 DRFLSWQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEEEF 106
Query: 137 NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
L ++ + +S+ + S++++ KG V P C SCWA
Sbjct: 107 LD-LYTMKGMPVRRDAGKKRANVSSSAAAVDAPTSVDWRSKGAVTPIKNQGPSCSSCWAF 165
Query: 197 SAVACLESAYAIKHNELIELSKQ 219
A +ES I +L+ LS+Q
Sbjct: 166 VTAATIESITKITTGKLVSLSEQ 188
>gi|394331816|gb|AFN27127.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 41/141 (29%), Positives = 76/141 (53%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYRRAYGTVAEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + ++++++ KG V P V+DQ CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWRKKGAVTP-VKDQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ + L LS+Q
Sbjct: 156 VGSIESQWALAGHRLTALSEQ 176
>gi|60827884|gb|AAX36817.1| cathepsin H [synthetic construct]
Length = 336
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 80/162 (49%), Gaps = 17/162 (10%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ E LE+F FK ++ ++ + Y ++ E R F +N + I+ H G
Sbjct: 21 GAAELCVNSLEKF-----HFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN---AHNNG 71
Query: 120 TATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T+ +N+F+DM+ +E H W + +N +T Y + Y S++++ K
Sbjct: 72 NHTFKMALNQFSDMSFAEIKH---KYLWSEPQNCSATKSNY-LRGTGPY--PPSVDWRKK 125
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 126 GNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQ 167
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/148 (33%), Positives = 74/148 (50%), Gaps = 18/148 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYY-TKHEQGTATY--GVNRFADMTDSE 135
+K F+ Y+R Y SE ERRF IF NN I + + QG +Y G+N F+D TD E
Sbjct: 66 WKHFMTTYKRNYIDPSEHERRFKIFANNFVRISKHNVRFIQGQVSYTMGINEFSDKTDEE 125
Query: 136 FNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAES----INYKDKGKVLPKVQDQHLCG 191
+++ + + S + S +A I++++KG V P V++Q CG
Sbjct: 126 L---------KRLRCFRGSLNA-SRDGSKYITIAAPPPSEIDWRNKGAVTP-VKNQGNCG 174
Query: 192 SCWAHSAVACLESAYAIKHNELIELSKQ 219
SCWA SA +E + L+ LS+Q
Sbjct: 175 SCWAFSATGAIEGQNFLATGNLVSLSEQ 202
>gi|24417396|gb|AAN60308.1| unknown [Arabidopsis thaliana]
Length = 193
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 79/153 (51%), Gaps = 14/153 (9%)
Query: 71 EFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFAD 130
+ L + F F R++ + Y S+ E + RF +F+ NL+ + K + +AT+GV +F+D
Sbjct: 43 QVLTSEDHFSLFKRKFGKVYASNEEHDYRFSVFKANLRRARRHQKLDP-SATHGVTQFSD 101
Query: 131 MTDSEFNH---GL-SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
+T SEF G+ S + N T + L E +++D G V P V++
Sbjct: 102 LTRSEFRKKHLGVRSGFKLPKDANKAPILPTEN--------LPEDFDWRDHGAVTP-VKN 152
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CGSCW+ SA LE A + +L+ LS+Q
Sbjct: 153 QGSCGSCWSFSATGALEGANFLATGKLVSLSEQ 185
>gi|255538808|ref|XP_002510469.1| cysteine protease, putative [Ricinus communis]
gi|223551170|gb|EEF52656.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/155 (29%), Positives = 81/155 (52%), Gaps = 12/155 (7%)
Query: 68 DLEEFLDHG-NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHE--QGTATYG 124
D+E++L + F F ++ + Y + E + RF +F+ NL+ KH+ +A +G
Sbjct: 39 DVEDYLLSAQHHFTAFKAKFGKNYATQEEHDYRFKVFKANLRRAQ---KHQLMDPSAVHG 95
Query: 125 VNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKV 184
V +F+D+T EF + ++ L+ + + + G+ E +++D G V V
Sbjct: 96 VTKFSDLTPREFRR-----QYLGLKKLRLPADAHEAPILPTDGIPEDFDWRDHGAVT-NV 149
Query: 185 QDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++Q CGSCW+ SA LE A+ + EL+ LS+Q
Sbjct: 150 KNQGSCGSCWSFSAAGALEGAHFLATGELVSLSEQ 184
>gi|357438145|ref|XP_003589348.1| Cysteine proteinase [Medicago truncatula]
gi|355478396|gb|AES59599.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/150 (30%), Positives = 78/150 (52%), Gaps = 6/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+ L+ + F F ++ + Y + E + RF +F++NL + K + +A +G+ +F+
Sbjct: 42 DHILNAEHHFTSFKSKFSKNYATKEEHDYRFGVFKSNLIKAKLHQKLDP-SAQHGITKFS 100
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T SEF L+ + L + + +N+ L E ++++KG V P V+DQ
Sbjct: 101 DLTASEFRRQFLGLN--KRLRLPAHAQKAPILPTNN--LPEDFDWREKGAVTP-VKDQGS 155
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCWA S LE A + +L LS+Q
Sbjct: 156 CGSCWAFSTTGALEGANYLATGKLTSLSEQ 185
>gi|118360450|ref|XP_001013459.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89295226|gb|EAR93214.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 320
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/140 (35%), Positives = 76/140 (54%), Gaps = 17/140 (12%)
Query: 82 FVREYERQYDSDSEIER-RFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGL 140
F Y ++Y +D + E+ R ++F NLK ID ++ +G+ +F D+T EF
Sbjct: 46 FKNSYNKKY-ADPDFEQYRIEVFTENLKIIDSNCQN------FGITKFMDLTQEEFKQTY 98
Query: 141 SSLDWEQ-IENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAV 199
+L ++ IE + T FN SN G E I++ KG V P V+DQ CGSCW+ S
Sbjct: 99 LTLKTKKYIEEIPETV----FNDSN--GDIE-IDWTMKGAVTP-VKDQGKCGSCWSFSTT 150
Query: 200 ACLESAYAIKHNELIELSKQ 219
+E A+ + NEL+ LS+Q
Sbjct: 151 GAVEGAHFLSSNELVSLSEQ 170
>gi|401430288|ref|XP_003886537.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491333|emb|CBZ40988.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 533
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 78/141 (55%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y++ +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 128 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREH-QARNPHAQFGITKFFDLSEAEFAA 186
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V+DQ CGSCWA SA
Sbjct: 187 RYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTP-VKDQGACGSCWAFSA 245
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +E + + +EL+ LS+Q
Sbjct: 246 VGNIEGQWYLAGHELVSLSEQ 266
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/157 (31%), Positives = 76/157 (48%), Gaps = 16/157 (10%)
Query: 69 LEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTA------T 122
L ++ G F F ++ + Y+S E RRF +F N ID+ +H A T
Sbjct: 20 LSLTVNKGRLFDAFKTKFNKVYESAEEEARRFSVFSQN---IDFINRHNAEAARGVHTHT 76
Query: 123 YGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLP 182
VN+FAD+T+ E+ L E L + + N A S++++ KG V P
Sbjct: 77 VDVNQFADLTNEEYRQ--LYLRPYPTELLGRERQEVWLDGPN----AGSVDWRQKGAVTP 130
Query: 183 KVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
+++Q CGSCW+ S +E A+AI L+ LS+Q
Sbjct: 131 -IKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQ 166
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 50/159 (31%), Positives = 83/159 (52%), Gaps = 13/159 (8%)
Query: 64 ASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYT-KHEQGTAT 122
A T D ++ +H Q+K++ + ++Y E RR ++ NL+ I+ + +H GT T
Sbjct: 18 APTLD-KQLDNHWEQWKNW---HGKKYHEKEEGWRRM-VWEKNLQKIELHNLEHSMGTHT 72
Query: 123 Y--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKV 180
Y G+NRF DMT EF ++ ++ + + F N + S+++++KG V
Sbjct: 73 YRLGMNRFGDMTHEEFRQVMNGYKHKKERRFRGSL----FMEPNFLEVPNSLDWREKGYV 128
Query: 181 LPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
P V+DQ CGSCWA S +E K +L+ LS+Q
Sbjct: 129 TP-VKDQGECGSCWAFSTTGAMEGQMFRKTGKLVSLSEQ 166
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 62/200 (31%), Positives = 93/200 (46%), Gaps = 39/200 (19%)
Query: 78 QFKDFVREYERQYDSDSEIE-RRFDIFRNNLK-----TIDYYTKHEQGTATYGVNRFADM 131
QF+ F + R Y S EIE R IFR NL+ IDY+ + T + VN F D+
Sbjct: 32 QFEQFKSTFGRVYPS-PEIELHRKSIFRANLQFILRHNIDYF--NGDSTFSVSVNNFTDL 88
Query: 132 TDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSY-GLAESINYKDKGKVLPKVQDQHLC 190
++ EF + L + S ++ N L ++++ KG V P +++Q C
Sbjct: 89 SNEEFRATFNGY-----RRLAAVSLADSVHADNDVEALPATVDWTTKGVVTP-IKNQQQC 142
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLP-------HMLCSKG 243
GSCWA SAVA +E +A+K +L+ LS+Q NL M CS G
Sbjct: 143 GSCWAFSAVASMEGQHALKTGKLVSLSEQ--------------NLVDCSAAEGDMGCSGG 188
Query: 244 --PYSLNHAVLNVGYDNEST 261
Y+ + + N G D E++
Sbjct: 189 WMDYAFKYVIQNRGIDTEAS 208
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 61/193 (31%), Positives = 93/193 (48%), Gaps = 25/193 (12%)
Query: 78 QFKDFVREYERQYDSDSEIE-RRFDIFRNNLK-----TIDYYTKHEQGTATYGVNRFADM 131
QF+ F + R Y S EIE R IFR NL+ IDY+ + T + VN F D+
Sbjct: 32 QFEQFKSTFGRVYPS-PEIELHRKSIFRANLQFILRHNIDYF--NGDSTFSVSVNNFTDL 88
Query: 132 TDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSY-GLAESINYKDKGKVLPKVQDQHLC 190
++ EF + L + S ++ N L ++++ KG V P +++Q C
Sbjct: 89 SNEEFRATFNGY-----RRLAAVSLADSVHADNDVEALPATVDWTTKGVVTP-IKNQQQC 142
Query: 191 GSCWAHSAVACLESAYAIKHNELIELSKQPPKTHGRFYKGGVMNLPHMLCSKG--PYSLN 248
GSCWA SAVA +E +A+K +L+ LS+Q +G M CS G Y+
Sbjct: 143 GSCWAFSAVASMEGQHALKTGKLVSLSEQ-NLVDCSAAEG------DMGCSGGWMDYAFK 195
Query: 249 HAVLNVGYDNEST 261
+ + N G D E++
Sbjct: 196 YVIQNRGIDTEAS 208
>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
Length = 338
Score = 70.1 bits (170), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 52/151 (34%), Positives = 84/151 (55%), Gaps = 13/151 (8%)
Query: 73 LDHGNQFKDFVREYERQY-DSDSEIERRFDIFRNNLKTIDYYT-KHEQGTATY--GVNRF 128
LDH + + + Y RQY + + E+ RR I+ NLK++ + ++ G +Y G+N
Sbjct: 32 LDH--HWNLWKKTYGRQYQEKNEEVARRL-IWEKNLKSVMLHNLEYSMGMHSYDLGMNHL 88
Query: 129 ADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQH 188
ADMT E + +SSL ++ + TY NS+ L +S+++++KG V +V+ Q
Sbjct: 89 ADMTSEEVSSLMSSL---RVPSQWQANVTYKSNSNQK--LPDSVDWREKGCV-TEVKYQG 142
Query: 189 LCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CG+CWA SAV LE+ +K L+ LS Q
Sbjct: 143 ACGACWAFSAVGALEAQLKLKTGNLVSLSAQ 173
>gi|355751926|gb|EHH56046.1| Cathepsin F, partial [Macaca fascicularis]
Length = 381
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/131 (34%), Positives = 65/131 (49%), Gaps = 12/131 (9%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK+FV Y R Y+S E R +F NN+ ++GTA YGV +F+D+T+ EF
Sbjct: 84 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 143
Query: 139 G-LSSLDWEQIENLKSTFETYSFNSSNSYG--LAESINYKDKGKVLPKVQDQHLCGSCWA 195
L+ L E+ N + S G +++ KG V KV+DQ +CGSCWA
Sbjct: 144 IYLNPLLREEPGN--------KMKQAKSVGDLAPPEWDWRSKGAVT-KVKDQGMCGSCWA 194
Query: 196 HSAVACLESAY 206
S +E +
Sbjct: 195 FSVTGNVEGQW 205
>gi|343470378|emb|CCD16903.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 445
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 45/145 (31%), Positives = 74/145 (51%), Gaps = 11/145 (7%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT---ATYGVNRFADMTDS 134
QF F ++Y R Y +E RF +F+ +++ K E AT+GV +F+DM+
Sbjct: 40 QFAAFKQKYSRSYKDATEEAFRFRMFKQSMER----AKEEAAANPYATFGVTQFSDMSPE 95
Query: 135 EFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCW 194
EF + LK + + ++ + +I+++ KG V P V+DQ CGSCW
Sbjct: 96 EFRATYLNGAKYYAAALKRPRKVVNVSTGKA---PPAIDWRKKGAVTP-VKDQGKCGSCW 151
Query: 195 AHSAVACLESAYAIKHNELIELSKQ 219
A SA+ +E + + +EL LS+Q
Sbjct: 152 AFSAIGNIEGQWKVAGHELTSLSEQ 176
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 78/147 (53%), Gaps = 15/147 (10%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F+ ++ ++++ Y + E RF+ F++NL ID T + + G+N FAD+T EF
Sbjct: 48 FESWMLKHDKVYKTIDEKIYRFETFKDNLMYIDE-TNKKNNSYWLGLNEFADLTHDEFKE 106
Query: 139 GL------SSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
S+ EQ ++++ F + + ESI+++ KG V P V++Q+ CGS
Sbjct: 107 KYVGSIPEDSMIIEQSDDVE-------FPNKHVVDYPESIDWRQKGAVTP-VKNQNPCGS 158
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CWA S VA +E I LI LS+Q
Sbjct: 159 CWAFSTVATVEGINKIVTGNLISLSEQ 185
>gi|16506813|gb|AAL23961.1|AF426247_1 cathepsin H [Homo sapiens]
Length = 335
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/162 (28%), Positives = 80/162 (49%), Gaps = 17/162 (10%)
Query: 60 GSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQG 119
G+ E LE+F FK ++ ++ + Y ++ E R F +N + I+ H G
Sbjct: 21 GAAELCVNSLEKF-----HFKSWMSKHRKTYSTE-EYHHRLQTFASNWRKIN---AHNNG 71
Query: 120 TATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDK 177
T+ +N+F+DM+ +E H W + +N +T Y + Y S++++ K
Sbjct: 72 NHTFKMALNQFSDMSFAEIKH---KYLWSEPQNCSATKSNY-LRGTGPY--PPSVDWRKK 125
Query: 178 GKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
G + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 126 GNFVSPVKNQGACGSCWTFSTTGALESAIAIATGKMLSLAEQ 167
>gi|332326591|gb|AEE42619.1| cysteine protease [Leishmania aethiopica]
Length = 443
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 78/141 (55%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y + +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYWRAYGTVAEEQQRLANFERNLELMREH-QARNPHARFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V+BQ CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKQHAGQHYRKARADLSAVPDAVDWREKGAVTP-VKBQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +ES +A+ + L+ LS+Q
Sbjct: 156 VGNIESQWAVAXHGLVRLSEQ 176
>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/130 (35%), Positives = 70/130 (53%), Gaps = 9/130 (6%)
Query: 95 EIERRFDIFRNNLKTIDYYTKHEQGTATYGV--NRFADMTDSEFNHGLSS--LDWEQIEN 150
E RRF++FR N++ I + + G A Y + NRF DMT EF +S + ++ +
Sbjct: 62 EKARRFNVFRENVRLIHEFNR---GDAPYKLRLNRFGDMTADEFRRAYASSRVSHHRMFS 118
Query: 151 LKSTFETYSFNSSNSY-GLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIK 209
LK + S+ S + S++++ KG V V+DQ CGSCWA S +A +E AI+
Sbjct: 119 LKEGGGGFMHGSAASVRDVPPSVDWRQKGAVTA-VKDQGQCGSCWAFSTIAAVEGINAIR 177
Query: 210 HNELIELSKQ 219
L LS+Q
Sbjct: 178 SKNLTSLSEQ 187
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/142 (30%), Positives = 78/142 (54%), Gaps = 5/142 (3%)
Query: 80 KDFVREYERQYDSDSE--IERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFN 137
++++ ++ R Y + E +RF++F+ N++ I+ + ++ T +N+FAD+T+ EF
Sbjct: 38 EEWMSQHGRVYADEQEDHKNKRFNVFKENVERIEEF--NDGKTFKLAINQFADLTNEEFR 95
Query: 138 HGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHS 197
+ + + + T T + S L S++++ KG V P V++Q CG CWA S
Sbjct: 96 ASYNGFKGPMVLSSQITKPTPFRYENVSSALPVSVDWRKKGAVTP-VKNQGQCGCCWAFS 154
Query: 198 AVACLESAYAIKHNELIELSKQ 219
AVA +E I +LI LS+Q
Sbjct: 155 AVAAIEGITQISTGKLISLSEQ 176
>gi|417399160|gb|JAA46608.1| Putative pro-cathepsin h [Desmodus rotundus]
Length = 336
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/141 (30%), Positives = 72/141 (51%), Gaps = 8/141 (5%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
FK ++ ++++ Y ++ E R F +N + I + T G+N F+DMT +EF
Sbjct: 36 FKSWMEQHQKTYSAE-EYRHRLQTFASNQRKIKEHNARNH-TFKMGINPFSDMTFAEFKR 93
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
W + +N +T Y Y S++++ KG+ + V++Q CGSCW S
Sbjct: 94 ---RYLWSEPQNCSATKSNY-LRGHGPY--PTSVDWRKKGRFVSPVKNQGGCGSCWTFST 147
Query: 199 VACLESAYAIKHNELIELSKQ 219
LESA AIK +++ LS+Q
Sbjct: 148 TGALESAIAIKTGKMLSLSEQ 168
>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 48/156 (30%), Positives = 80/156 (51%), Gaps = 10/156 (6%)
Query: 66 TFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY-- 123
FD E L F+ ++ ++ + Y+S +E ERR IF +NL+ + T +Y
Sbjct: 47 VFDAEATL----MFESWMVKHGKVYESVAEKERRLTIFEDNLR---FITNRNAENLSYRL 99
Query: 124 GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPK 183
G+NRFAD++ E+ D N + + +S+ L +S++++++G V +
Sbjct: 100 GLNRFADLSLHEYAQICHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAV-TE 158
Query: 184 VQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
V+DQ C SCWA S V +E I EL+ LS+Q
Sbjct: 159 VKDQGQCRSCWAFSTVGAVEGLNKIVTGELVTLSEQ 194
>gi|156389068|ref|XP_001634814.1| predicted protein [Nematostella vectensis]
gi|156221901|gb|EDO42751.1| predicted protein [Nematostella vectensis]
Length = 276
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 42/121 (34%), Positives = 64/121 (52%), Gaps = 9/121 (7%)
Query: 100 FDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYS 159
IF +N++ K + GTA YG F+D+++ EF W K +E
Sbjct: 1 MKIFESNMRKAAKMQKMDSGTAQYGPTIFSDLSEEEFRKQKMMPGWG-----KPLYEMK- 54
Query: 160 FNSSNSYG-LAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSK 218
++ G + ES++++DKG V P V++Q CGSCWA S +E YAIK +L+ LS+
Sbjct: 55 -DAEIPLGDIPESVDWRDKGVVTP-VKNQGSCGSCWAFSTTGNIEGQYAIKTGKLVSLSE 112
Query: 219 Q 219
Q
Sbjct: 113 Q 113
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/126 (34%), Positives = 71/126 (56%), Gaps = 8/126 (6%)
Query: 98 RRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNHGLSSLDW----EQIENLKS 153
RRF++F++N+ + K ++ +N+FADMT+ EF + ++ +S
Sbjct: 56 RRFNVFKSNVLHVHETNKMDK-PYKLKLNKFADMTNHEFRSVYAGSKIHHHDRSLQGDRS 114
Query: 154 TFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNEL 213
+T+ + +N + S++++ KG V P V+DQ CGSCWA S VA +E IK NEL
Sbjct: 115 GSKTFMY--ANVESVPTSVDWRKKGAVAP-VKDQGQCGSCWAFSTVAAVEGINKIKTNEL 171
Query: 214 IELSKQ 219
+ LS+Q
Sbjct: 172 VSLSEQ 177
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 56/210 (26%), Positives = 99/210 (47%), Gaps = 18/210 (8%)
Query: 68 DLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNR 127
DLE F++++ +E+ Y++ E RF++F++NLK ID K + + G+N
Sbjct: 40 DLESHDKLIELFENWISNFEKAYETVEEKLLRFEVFKDNLKHIDETNKKVK-SYWLGLNE 98
Query: 128 FADMTDSEFNHGLSSLDWEQIE-NLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQD 186
FAD++ EF L + + + + ++ +++ + + +S++++ KG V +V++
Sbjct: 99 FADLSHEEFKKMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAV-AEVKN 155
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSK 242
Q CGSCWA S VA +E I L LS+Q T+ GG+M+ K
Sbjct: 156 QGSCGSCWAFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVK 215
Query: 243 G---------PYSLNHAVLNVGYDNESTRT 263
PYS+ + D T T
Sbjct: 216 NGGLRKEEDYPYSMEEGTCEMQKDESETVT 245
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 54/200 (27%), Positives = 98/200 (49%), Gaps = 20/200 (10%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATY-GVNRFADMTDSEFN 137
F++++ +E+ Y++ E RF++F++NLK ID K +G + + G+N FAD++ EF
Sbjct: 51 FENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNK--KGKSYWLGLNEFADLSHEEFK 108
Query: 138 HGLSSLDWEQIE-NLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAH 196
L + + + + ++ +++ + + +S++++ KG V +V++Q CGSCWA
Sbjct: 109 KMYLGLKTDIVRRDEERSYAEFAYRDVEA--VPKSVDWRKKGAV-AEVKNQGSCGSCWAF 165
Query: 197 SAVACLESAYAIKHNELIELSKQP----PKTHGRFYKGGVMNLPHMLCSKG--------- 243
S VA +E I L LS+Q T+ GG+M+ K
Sbjct: 166 STVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDY 225
Query: 244 PYSLNHAVLNVGYDNESTRT 263
PYS+ + D T T
Sbjct: 226 PYSMEEGTCEMQKDESETVT 245
>gi|401430350|ref|XP_003886559.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491516|emb|CBZ40966.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 503
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 78/141 (55%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y++ +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 98 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREH-QARNPHAQFGITKFFDLSEAEFAA 156
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V+DQ CGSCWA SA
Sbjct: 157 RYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTP-VKDQGACGSCWAFSA 215
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +E + + +EL+ LS+Q
Sbjct: 216 VGNIEGQWYLAGHELVSLSEQ 236
>gi|25956267|dbj|BAC41322.1| hypothetical protein [Lotus japonicus]
Length = 358
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 81/150 (54%), Gaps = 7/150 (4%)
Query: 70 EEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFA 129
+E L + F +F R + + Y ++ E RF++F++N+ + + +A +GV +F+
Sbjct: 36 DEGLGAEHHFLEFKRRFGKVYATEEEHGYRFNVFKSNMHRARRHQLLDP-SAVHGVTQFS 94
Query: 130 DMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHL 189
D+T EF H + L + L S ++ +++ L + +++ G V P V++Q
Sbjct: 95 DLTPMEFQHSVLGL---RGVGLPSDADSAPILPTDN--LPKDFDWRGHGAVTP-VKNQGS 148
Query: 190 CGSCWAHSAVACLESAYAIKHNELIELSKQ 219
CGSCW+ SA LE A+ + EL+ LS+Q
Sbjct: 149 CGSCWSFSATGALEGAHFLSTGELVSLSEQ 178
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 78/147 (53%), Gaps = 6/147 (4%)
Query: 78 QFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTK-HEQGTATY--GVNRFADMTDS 134
Q+ F ++ + YDS++E R IF N + + K QG + G+N++ADM
Sbjct: 26 QWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHNKLFSQGFVKFKLGLNKYADMLHH 85
Query: 135 EFNHGLSSLDWEQIENLKST--FETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGS 192
EF L+ + + LK + + F S + L ++++++DKG V +V+DQ CGS
Sbjct: 86 EFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAV-TEVKDQGHCGS 144
Query: 193 CWAHSAVACLESAYAIKHNELIELSKQ 219
CW+ SA LE + K +L+ LS+Q
Sbjct: 145 CWSFSATGSLEGQHFRKTGKLVSLSEQ 171
>gi|350415610|ref|XP_003490694.1| PREDICTED: cathepsin O-like [Bombus impatiens]
Length = 355
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 53/153 (34%), Positives = 81/153 (52%), Gaps = 14/153 (9%)
Query: 79 FKDFVREYERQYDSD-SEIERRFDIFRNNLKTIDYYT--KHEQGTATYGVNRFADMTDSE 135
F+++V Y + Y +D +E E RF F +L+ I+ + Q +A YG+ F+DM++ E
Sbjct: 36 FQNYVMRYNKSYRNDPTEYEERFKRFLKSLRHIEKMNGLRPSQESAYYGLTEFSDMSEDE 95
Query: 136 FNHGLSSLDWEQIENLKSTFETYS-----FNSSNSYGLAESI----NYKDKGKVLPKVQD 186
F L+ L K E+Y S+N + SI +++DKG + P V++
Sbjct: 96 F-LSLTLLPDLPARGEKHVNESYHRRHHLLQSTNRVKKSVSIPLRFDWRDKGVITP-VRN 153
Query: 187 QHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
Q CG+CWA S V +ES YAIK+ L LS Q
Sbjct: 154 QGSCGACWAFSTVEVVESMYAIKNGTLHMLSVQ 186
>gi|401416322|ref|XP_003872656.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488880|emb|CBZ24130.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 366
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 78/141 (55%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y++ +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREH-QARNPHAQFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V+DQ CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTP-VKDQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +E + + +EL+ LS+Q
Sbjct: 156 VGNIEGQWYLAGHELVSLSEQ 176
>gi|301769893|ref|XP_002920368.1| PREDICTED: cathepsin L1-like [Ailuropoda melanoleuca]
Length = 503
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 46/137 (33%), Positives = 77/137 (56%), Gaps = 15/137 (10%)
Query: 88 RQYDSDSEIERRFDIFRNNLKTIDYYTK-HEQGTATY--GVNRFADMTDSEFNHGLSSLD 144
+ Y+ D E+ RR ++ N+K ID + + + QG ++ +N F D+T+ EF ++ L
Sbjct: 38 KLYNKDEEVWRRA-VWEKNMKMIDQHNEEYSQGKHSFILAMNAFGDLTNEEFKQVMNGL- 95
Query: 145 WEQIENLK--STFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSAVACL 202
+I+N + + F+ F + S S+++++KG V P V+DQ CGSCWA SA L
Sbjct: 96 --KIQNPREGNMFQLLPFAETPS-----SVDWREKGYVTP-VKDQGQCGSCWAFSATGAL 147
Query: 203 ESAYAIKHNELIELSKQ 219
E K +L+ LS+Q
Sbjct: 148 EGQMFRKTGKLVSLSEQ 164
>gi|156046107|gb|ABU42573.1| cathepsin H variant 2 [Sus scrofa]
Length = 321
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 47/166 (28%), Positives = 82/166 (49%), Gaps = 17/166 (10%)
Query: 56 PNSYGSEEASTFDLEEFLDHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTK 115
P + G+ + E+ FK ++ +++++Y S E R +F +N + ID
Sbjct: 17 PPACGASNLAVSSFEKL-----HFKSWMVQHQKKY-SLEEYHHRLQVFVSNWRKID---A 67
Query: 116 HEQGTATY--GVNRFADMTDSEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESIN 173
H G T+ G+N+F+DM+ E H W + +N +T Y + Y S++
Sbjct: 68 HNAGNHTFKLGLNQFSDMSFDEIRH---KYLWSEPQNCSATKGNY-LRGTGPY--PPSMD 121
Query: 174 YKDKGKVLPKVQDQHLCGSCWAHSAVACLESAYAIKHNELIELSKQ 219
++ KG + V++Q CGSCW S LESA AI +++ L++Q
Sbjct: 122 WRKKGNFVSPVKNQGSCGSCWTFSTTGALESAVAIATGKMLSLAEQ 167
>gi|461905|sp|Q05094.1|CYSP2_LEIPI RecName: Full=Cysteine proteinase 2; AltName: Full=Amastigote
cysteine proteinase A-2; Flags: Precursor
gi|159298|gb|AAA29229.1| cysteine proteinase [Leishmania pifanoi]
Length = 444
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 78/141 (55%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y++ +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREH-QARNPHAQFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V+DQ CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTP-VKDQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +E + + +EL+ LS+Q
Sbjct: 156 VGNIEGQWYLAGHELVSLSEQ 176
>gi|401430387|ref|XP_003886572.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491640|emb|CBZ40951.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 332
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 77/141 (54%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y++ +E ++R F NL+ + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNP-HAQFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V+DQ CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTP-VKDQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +E + + +EL+ LS+Q
Sbjct: 156 VGNIEGQWYLAGHELVSLSEQ 176
>gi|9542|emb|CAA78443.1| cysteine proteinase [Leishmania mexicana]
Length = 443
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 78/141 (55%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y++ +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREH-QARNPHAQFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V+DQ CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTP-VKDQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +E + + +EL+ LS+Q
Sbjct: 156 VGNIEGQWYLAGHELVSLSEQ 176
>gi|1730100|sp|P36400.2|LMCPB_LEIME RecName: Full=Cysteine proteinase B; Flags: Precursor
gi|899313|emb|CAA90236.1| LmCPb2.8 [Leishmania mexicana]
Length = 443
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 78/141 (55%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y++ +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREH-QARNPHAQFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V+DQ CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTP-VKDQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +E + + +EL+ LS+Q
Sbjct: 156 VGNIEGQWYLAGHELVSLSEQ 176
>gi|397517049|ref|XP_003828732.1| PREDICTED: cathepsin F [Pan paniscus]
Length = 379
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 43/133 (32%), Positives = 64/133 (48%), Gaps = 16/133 (12%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEF-- 136
FK+FV Y R Y+S E R +F NN+ ++GTA YGV +F+D+T+ EF
Sbjct: 82 FKNFVITYNRTYESKEEARWRLSVFVNNMVRAQKIQALDRGTAQYGVTKFSDLTEEEFRT 141
Query: 137 ---NHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
N L +++ KS + +++ KG V KV+DQ +CGSC
Sbjct: 142 IYLNPLLRKEPGNKMKQAKSVGDL----------APPEWDWRSKGAVT-KVKDQGMCGSC 190
Query: 194 WAHSAVACLESAY 206
WA S +E +
Sbjct: 191 WAFSVTGNVEGQW 203
>gi|343476708|emb|CCD12273.1| unnamed protein product [Trypanosoma congolense IL3000]
Length = 363
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 74/146 (50%), Gaps = 11/146 (7%)
Query: 77 NQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGT---ATYGVNRFADMTD 133
QF F ++Y R Y +E RF +F+ N++ K E AT+GV RF+DM+
Sbjct: 39 QQFAAFKQKYSRSYKDATEEAFRFRVFKQNMER----AKEEAAANPYATFGVTRFSDMSP 94
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
EF + LK + + ++ + ++++++ KG V P V+D+ LC S
Sbjct: 95 EEFRATYHNGAEYYAAALKRPRKVVTVSTGKA---PDAVDWRKKGAVTP-VRDERLCDSS 150
Query: 194 WAHSAVACLESAYAIKHNELIELSKQ 219
WA SA+ +E + + +EL LS+Q
Sbjct: 151 WAFSAIGNIEGQWKVAGHELTSLSEQ 176
>gi|401416324|ref|XP_003872657.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488881|emb|CBZ24131.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 443
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 78/141 (55%), Gaps = 2/141 (1%)
Query: 79 FKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTDSEFNH 138
F++F R Y R Y++ +E ++R F NL+ + + + A +G+ +F D++++EF
Sbjct: 38 FEEFKRTYGRAYETLAEEQQRLANFERNLELMREH-QARNPHAQFGITKFFDLSEAEFAA 96
Query: 139 GLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSCWAHSA 198
+ + + Y ++ + +++++++KG V P V+DQ CGSCWA SA
Sbjct: 97 RYLNGAAYFAAAKRHAAQHYRKARADLSAVPDAVDWREKGAVTP-VKDQGACGSCWAFSA 155
Query: 199 VACLESAYAIKHNELIELSKQ 219
V +E + + +EL+ LS+Q
Sbjct: 156 VGNIEGQWYLAGHELVSLSEQ 176
>gi|118388480|ref|XP_001027337.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89309107|gb|EAS07095.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 320
Score = 69.7 bits (169), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 75/158 (47%), Gaps = 29/158 (18%)
Query: 74 DHGNQFKDFVREYERQYDSDSEIERRFDIFRNNLKTIDYYTKHEQGTATYGVNRFADMTD 133
D FK F + Y ++Y + RF +F NL+ + + +T+GV +F D+T
Sbjct: 35 DPATLFKQFKQTYNKKYADPTFETYRFGVFTQNLEIV-------KTDSTFGVTQFMDLTP 87
Query: 134 SEFNHGLSSLDWEQIENLKSTFETYSFNSSNSYGLAESINYKDKGKVLPKVQDQHLCGSC 193
+EF +L E + ST E Y + G A +++ KGKV P V++Q CGSC
Sbjct: 88 AEFAQQFLTLH----EKVNST-EVY-----RAQGEATEVDWTAKGKVTP-VKNQGSCGSC 136
Query: 194 WAHSAVACLESAYAI-----------KHNELIELSKQP 220
WA S + +ESA I EL++ +K P
Sbjct: 137 WAFSTIGAVESALLIAGQGEQNTLNLAEQELVDCAKSP 174
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.132 0.409
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 4,364,666,989
Number of Sequences: 23463169
Number of extensions: 180054967
Number of successful extensions: 439338
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 2765
Number of HSP's successfully gapped in prelim test: 1434
Number of HSP's that attempted gapping in prelim test: 431030
Number of HSP's gapped (non-prelim): 6091
length of query: 263
length of database: 8,064,228,071
effective HSP length: 140
effective length of query: 123
effective length of database: 9,074,351,707
effective search space: 1116145259961
effective search space used: 1116145259961
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 75 (33.5 bits)