BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 044448
(308 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
Length = 363
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/330 (43%), Positives = 194/330 (58%), Gaps = 32/330 (9%)
Query: 2 SRTSHKTGN---IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------- 45
SR + +T N + A+HEQWM R Y D+ EK++RF+IFK N +
Sbjct: 39 SRATSRTLNDPTMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYT 98
Query: 46 LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVT 105
L +NKFADLT ++F AS GYK P H S F+ N S + D +DW + GAVT
Sbjct: 99 LEVNKFADLTNDEFRASRNGYKKQPDSDSHV-VSGLFRYANVSAVP--DEVDWRKEGAVT 155
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAF 161
PVKDQG CCWAF+AVA +EG+NK+ G+LV+ S+ +LVDC GC +ENAF
Sbjct: 156 PVKDQGDCGCCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAF 215
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
++I + + LA+E VYPY G +D C+ +++ I G++ V E+ L V+ Q
Sbjct: 216 QFIEKRKGLAAESVYPYTG-EDGICNTKKAAIPA--AKISGHEKVPANNEKALLQAVANQ 272
Query: 222 PVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
PVS+AIDA+ F FY GGVFTG CG +H +T VGYG T + YWL+KN WG +
Sbjct: 273 PVSIAIDASGYEFQFYSGGVFTGSCGTELDHAITAVGYGATMDG---TKYWLMKNSWGAS 329
Query: 280 WDEGGSMRIFR-GVGGSGLCNIAANAAYPL 308
W E G +RI R + GLC IA + +YP+
Sbjct: 330 WGENGYIRIKRDSLAKEGLCGIAMDPSYPV 359
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 144/326 (44%), Positives = 193/326 (59%), Gaps = 32/326 (9%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
SR+ H + +HE WMV++ R YKD +EKE RF+IF+ N EF L +
Sbjct: 26 SRSLHDAA-MNERHEMWMVKYGRVYKDNSEKERRFEIFRNNVEFIESFNKPGNRPYKLDI 84
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N+FADLT E+F AS GYK S + N+ + S +DW ++GAVTP+K
Sbjct: 85 NEFADLTNEEFKASRNGYKRSSNVGLSEKSSFRYGNVTAVPTS----MDWRQKGAVTPIK 140
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYI 164
DQG CCWAF+AVA +EG+ K+ TG+L++ S+ +LVDC T GC +++AFE+I
Sbjct: 141 DQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 200
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+Q L +E YPYQG D C+ + A I GY+ V +E+ L V+ QPVS
Sbjct: 201 KQNGGLTTEANYPYQG-TDGTCN--TNKAGNDAAKITGYEDVPANSEDALLKAVASQPVS 257
Query: 225 VAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
VAIDA + F FY GGVFTG CG +HGVT VGYGT+ + YWLVKN WGT+W E
Sbjct: 258 VAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSDGTK----YWLVKNSWGTSWGE 313
Query: 283 GGSMRIFRGV-GGSGLCNIAANAAYP 307
G +R+ R + GLC IA ++YP
Sbjct: 314 DGYIRMERDIEAKEGLCGIAMQSSYP 339
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 254 bits (648), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 144/326 (44%), Positives = 192/326 (58%), Gaps = 27/326 (8%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
M R H+T + +HE WM E+ + YKD AEKE RF+IFK N EF L
Sbjct: 25 MPRKLHQTA-LRERHENWMAEYGKIYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLG 83
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N ADLT E+F S G K + + N FK N + + ++IDW +GAVTP+
Sbjct: 84 VNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENVTDIP--EAIDWRVKGAVTPI 141
Query: 108 KDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYI 164
KDQG C CWAF+ VA EG+ +I TG L++ S+ +LVDC +++ GC +E+ FE+I
Sbjct: 142 KDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSVDHGCDGGLMEDGFEFI 201
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ ++SE YPY D CD S + I+GY+ V +EE LQ V+ QPVS
Sbjct: 202 IKNGGISSEANYPYTAV-DGTCD--ASKEASPAAQIKGYETVPANSEEALQQAVANQPVS 258
Query: 225 VAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
V+IDA + F FY GVFTG CG +HGVT+VGYGTT +G YW+VKN WGT W E
Sbjct: 259 VSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTD--DGTHEYWIVKNSWGTQWGE 316
Query: 283 GGSMRIFRGVGG-SGLCNIAANAAYP 307
G +R+ RG+ GLC IA +A+YP
Sbjct: 317 EGYIRMQRGIDALEGLCGIAMDASYP 342
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 147/329 (44%), Positives = 196/329 (59%), Gaps = 36/329 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LR 47
SRT NI KHEQWMV + + YKD E+E R KIFK+N + L
Sbjct: 28 SRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLG 87
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVT 105
+N+FADLT E+F+AS +K H S+ +++ FK N+S S ++DW ++GAVT
Sbjct: 88 INQFADLTNEEFIASRNKFKG----HMCSSITKTSTFKYENASVPS---TVDWRKKGAVT 140
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAF 161
PVK+QG CCWAF+AVA EG++K+ TG+LV+ S+ +LVDC T GC +++AF
Sbjct: 141 PVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAF 200
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
++I Q L +E YPYQG D C ++S I GY+ V E+ LQ V+ Q
Sbjct: 201 KFIIQNHGLNTEAQYPYQGV-DGTCSANKASIHAV--TITGYEDVPANNEQALQKAVANQ 257
Query: 222 PVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
P+SVAIDA+ F FY GVFTG CG +HGVT VGYG + YWLVKN WGT+
Sbjct: 258 PISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDG---TKYWLVKNSWGTD 314
Query: 280 WDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
W E G +++ RGV + GLC IA A+YP
Sbjct: 315 WGEEGYIKMQRGVDAAEGLCGIAMEASYP 343
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 142/326 (43%), Positives = 189/326 (57%), Gaps = 31/326 (9%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
SR+ H + +HE WM ++ R YKD +EKE RF+IF+ N EF L +
Sbjct: 26 SRSLHDAA-MNERHEMWMAKYGRVYKDNSEKERRFEIFRNNVEFIESFNKLGNRPYKLDI 84
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N+FADLT E+F S GYK S + N+ + S +DW + GAVTP+K
Sbjct: 85 NEFADLTNEEFKVSKNGYKRSSGVGLTEKSSFRYANVTAVPTS----MDWRQNGAVTPIK 140
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYI 164
DQG CCWAF+AVA +EG+ K+ TG+L++ S+ +LVDC T GC +++AFE+I
Sbjct: 141 DQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGEDQGCEGGLMDDAFEFI 200
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+Q L +E YPYQG D C+ + A I GY+ V +E+ L V+ QPVS
Sbjct: 201 KQNGGLTTEANYPYQG-TDGTCN--TNKAGNDAAKITGYEDVPANSEDALLKAVASQPVS 257
Query: 225 VAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
VAIDA+ F FY GGVFTG CG +HGVT VGYGT+ + YWLVKN WGT+W E
Sbjct: 258 VAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSDDG---TKYWLVKNSWGTSWGE 314
Query: 283 GGSMRIFRGV-GGSGLCNIAANAAYP 307
G +R+ R + GLC IA +YP
Sbjct: 315 DGYIRMERDIEAKEGLCGIAMQPSYP 340
>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 340
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 156/329 (47%), Positives = 197/329 (59%), Gaps = 37/329 (11%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
SRT ++ +IA +HE+WM R Y D AEK+ R +IFK+N EF L L
Sbjct: 26 SRTLSES-SIATQHEEWMAMHDRVYADSAEKDRRQQIFKENLEFIEKHNNEGKKRYNLSL 84
Query: 49 NKFADLTREKFLASYTG--YKPPPTDHPHSNRSNWFKNLNSSKMSFYD---SIDWNERGA 103
N FADLT E+F+AS+TG YKPP S + N +L KMS D S+DW +RGA
Sbjct: 85 NSFADLTNEEFVASHTGALYKPPT--QLGSFKIN--HSLGFHKMSVGDIEASLDWRKRGA 140
Query: 104 VTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFE 162
V +K+QG CWAF+AVA VEG+N+I+ GQLV+ S+ LVDC++ +GC ++E AF+
Sbjct: 141 VNDIKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSLSEQNLVDCASNDGCHGQYVEKAFD 200
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
YIR Y LA+E YPY C S S IRGYQ V P EE L V+ QP
Sbjct: 201 YIRDY-GLANEEEYPYV-ETVGTC----SGNSNPAIQIRGYQSVTPQNEEQLLTAVASQP 254
Query: 223 VSVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
VSV ++A F FY GGVF+G CG NH VTIVGYG EAEG+ YWL++N WG +W
Sbjct: 255 VSVLLEAKGQGFQFYSGGVFSGECGTELNHAVTIVGYG--EEAEGK--YWLIRNSWGKSW 310
Query: 281 DEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
EGG M++ R G GLC I A+YP
Sbjct: 311 GEGGYMKLMRDTGNPQGLCGINMQASYPF 339
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 248 bits (632), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 141/326 (43%), Positives = 194/326 (59%), Gaps = 31/326 (9%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
MSR H+ +++ +HEQWM ++ + YKD AEK+ R IFK N EF L
Sbjct: 25 MSRNLHEA-SMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLS 83
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N AD T E+F+AS+ GYK + S+ FK N + + ++DW + GAVT V
Sbjct: 84 INHLADQTNEEFVASHNGYK-----YKGSHSQTPFKYGNVTDIP--TAVDWRQNGAVTAV 136
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIR 165
KDQG CWAF+ VA EG+ +I TG L++ S+ +LVDC +++ GC +E+ FE+I
Sbjct: 137 KDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSVDHGCDGGLMEDGFEFII 196
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ ++SE YPY D CD S + I+GY+ V +EE LQ V+ QPVSV
Sbjct: 197 KNGGISSEANYPYTAV-DGTCD--ASKEASPAAQIKGYETVPANSEEALQQAVANQPVSV 253
Query: 226 AIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
+IDA + F FY GVFTG CG +HGVT+VGYGTT +G YW+VKN WGT W E
Sbjct: 254 SIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTD--DGTHEYWIVKNSWGTQWGEE 311
Query: 284 GSMRIFRGVGG-SGLCNIAANAAYPL 308
G +R+ RG+ GLC IA +A+YP+
Sbjct: 312 GYIRMQRGIDAQEGLCGIAMDASYPM 337
>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
Length = 351
Score = 247 bits (630), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 142/328 (43%), Positives = 193/328 (58%), Gaps = 37/328 (11%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LR 47
S T + + A+HE+WMVE RTYKD+AEK RF++FK N F L
Sbjct: 39 SSTGYGEEAMTARHEKWMVEHGRTYKDEAEKARRFQVFKANAAFVDTSNAAAGGKKYHLA 98
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD--SIDWNERGAVT 105
+N+FAD+T ++F+A YTG+KP P + FK N + +S D ++DW ++GAVT
Sbjct: 99 INRFADMTHDEFMARYTGFKPLPAT---GKKMPGFKYANVT-LSSEDQQAVDWRKKGAVT 154
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNF---LENAF 161
VK+Q CCWAF+AVA +EG+++I TG+LV+ S+ QLVDCST +E+AF
Sbjct: 155 DVKNQQKCGCCWAFSAVAAIEGMHQINTGELVSLSEQQLVDCSTNGNNNGCGGGTMEDAF 214
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
+Y+ +A+E YPY Q C + + A+R YQ V E+ L V+ Q
Sbjct: 215 QYVIGNNGIATEAAYPYTAMQG-MCQNVQPAV-----AVRSYQQVPRDDEDALAAAVAGQ 268
Query: 222 PVSVAIDATWFNFYHGGVFTG-PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
PVSVA+DA F FY GGV T CG NH VT VGYGT AE PYWL+KN+WG+ W
Sbjct: 269 PVSVAVDANNFQFYKGGVMTADSCGTNLNHAVTAVGYGT---AEDGTPYWLLKNQWGSTW 325
Query: 281 DEGGSMRIFRGVGGSGLCNIAANAAYPL 308
E G +R+ RGVG C +A +A+YP+
Sbjct: 326 GEEGYLRLQRGVGA---CGVAKDASYPV 350
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 246 bits (629), Expect = 8e-63, Method: Compositional matrix adjust.
Identities = 144/327 (44%), Positives = 192/327 (58%), Gaps = 32/327 (9%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LR 47
SRT I KHEQWMV + + YKD E+E R KIFK+N + L
Sbjct: 28 SRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLG 87
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N+FADLT E+F+AS +K +++ FK N+S S ++DW ++GAVTPV
Sbjct: 88 INQFADLTNEEFIASRNKFKGHMCSSI--TKTSTFKYENASVPS---TVDWRKKGAVTPV 142
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
K+QG CCWAF+AVA EG++K+ TG+LV+ S+ +LVDC T GC +++AF++
Sbjct: 143 KNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFKF 202
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I Q L +E YPYQG D C ++S I GY+ V E+ LQ V+ QP+
Sbjct: 203 IIQNHGLNTEAQYPYQGV-DGTCSANKASIHAV--TITGYEDVPANNEQALQKAVANQPI 259
Query: 224 SVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVAIDA+ F FY GVFTG CG +HGVT VGYG + YWLVKN WGT+W
Sbjct: 260 SVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGNDG---TKYWLVKNSWGTDWG 316
Query: 282 EGGSMRIFRGVGGS-GLCNIAANAAYP 307
E G +++ RGV + GLC IA A+YP
Sbjct: 317 EEGYIKMQRGVDAAEGLCGIAMEASYP 343
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 139/326 (42%), Positives = 190/326 (58%), Gaps = 29/326 (8%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
SR+ H+ ++ +H+ WM ++ R YK EKE RFKIFK+N EF L +
Sbjct: 26 SRSLHE-ASMELRHKTWMTQYGRVYKGNVEKEKRFKIFKENVEFIESFNNNGNKPYKLGI 84
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N F DLT E+F AS+ GY + H S R+ F+ N + + S+DW +GAVT +K
Sbjct: 85 NAFTDLTNEEFRASHNGYTMSMSSHQSSYRTKSFRYENVTAVP--PSLDWRTKGAVTHIK 142
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYI 164
DQG CCWAF+AVA +EG+ K+ TG L++ S+ +LVDC T GC +++AFE+I
Sbjct: 143 DQGQCGCCWAFSAVAAMEGITKLSTGTLISLSEQELVDCDTSGMDQGCEGGLMDDAFEFI 202
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ L +E YPY+G D C+ A+ I GY+ V EE L+ V+ QPVS
Sbjct: 203 IENNGLTTEANYPYEG-VDGSCN--TRKAANHAAKITGYENVPAYDEEALRKAVANQPVS 259
Query: 225 VAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
VAIDA + F Y G+FTG CG +HGVT+VGYGT+ + YWLVKN WGT+W E
Sbjct: 260 VAIDAGESAFQHYSSGIFTGDCGTELDHGVTVVGYGTSDDG---TKYWLVKNSWGTSWGE 316
Query: 283 GGSMRIFRGVGG-SGLCNIAANAAYP 307
G +R+ R + GLC IA +YP
Sbjct: 317 DGYIRMERDIDAKEGLCGIAMEPSYP 342
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 245 bits (626), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 139/326 (42%), Positives = 191/326 (58%), Gaps = 29/326 (8%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
M R H+T + +HE WM E+ + YKD AEKE RF+IFK N EF L
Sbjct: 25 MPRKLHQTA-LRERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLG 83
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N ADLT E+F S G K + + N FK N + + ++IDW +GAVTP+
Sbjct: 84 VNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENVTDIP--EAIDWRVKGAVTPI 141
Query: 108 KDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYI 164
KDQG C CWAF+ +A EG+++I TG LV+ S+ +LVDC ++ +GC F+E+ FE+I
Sbjct: 142 KDQGDQCGSCWAFSTIAATEGIHQISTGNLVSLSEQELVDCDSVDDGCEGGFMEDGFEFI 201
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ + SE YPY+G D C+ + A+ I+GY+ V +EE LQ V+ QPVS
Sbjct: 202 IKNGGITSETNYPYKGV-DGTCN--TTIAASPVAQIKGYEIVPSYSEEALQKAVANQPVS 258
Query: 225 VAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
V+I AT F FY G++ G CG +HGVT VGYGT E YW+VKN WGT W E
Sbjct: 259 VSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGT----ENGTDYWIVKNSWGTQWGE 314
Query: 283 GGSMRIFRGVGGS-GLCNIAANAAYP 307
G +R+ RG+ G+C IA +++YP
Sbjct: 315 KGYIRMHRGIAAKHGICGIALDSSYP 340
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 245 bits (625), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 130/317 (41%), Positives = 184/317 (58%), Gaps = 28/317 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL------------NKFADLTRE 57
++ A+HEQWM ++ R Y D AEK R ++FK N F+ L N+FAD+T +
Sbjct: 106 SMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAFIELVNAGNDKFSLEANQFADMTVD 165
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F A++TGYKP P + R+ FK N S + S+DW +GAVTP+KDQG CCW
Sbjct: 166 EFRAAHTGYKPVPANK---GRTTQFKYANVSLDALPASMDWRAKGAVTPIKDQGQCGCCW 222
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASE 173
AF+ VA+VEG+ K+ TG+L++ S+ +LVDC GC ++NAFE+I L +E
Sbjct: 223 AFSTVASVEGIVKLSTGKLISLSEQELVDCDVDGMDQGCEGGLMDNAFEFIIDNGGLTTE 282
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY G D C+ + S +I+GY+ V E L V+ QPVS+A+D
Sbjct: 283 GNYPYTGTDD-SCN--SNKESNDVASIKGYEDVPSNDETSLLKAVAAQPVSIAVDGGDNL 339
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY GGV +G CG +HG+ VGYG T++ +WL+KN WGT+W E G +R+ R
Sbjct: 340 FRFYKGGVLSGACGTELDHGIAAVGYGITSDG---TKFWLMKNSWGTSWGEKGFIRMERD 396
Query: 292 VGG-SGLCNIAANAAYP 307
+ GLC +A +YP
Sbjct: 397 IADEEGLCGLAMQPSYP 413
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 244 bits (624), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 140/317 (44%), Positives = 190/317 (59%), Gaps = 27/317 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL-------------NKFADLTRE 57
++ KHE+WM +F ++YKD AEKE RF+IFK N EF+ L N FADLT E
Sbjct: 33 LSNKHEKWMTQFGKSYKDAAEKEKRFQIFKNNVEFIELFNAVGNKPFNLSINHFADLTNE 92
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F AS G K N + F+ N + S S+DW +RGAVTP+K+QGS CW
Sbjct: 93 EFKASLNGNKKLHDKFDILNETTSFRYHNVT--SVPASMDWRKRGAVTPIKNQGSCGSCW 150
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASEC 174
AF+ VA++EG+++I TG+LV+ S+ +L+DC N GC+ +LE+AF++I + +ASE
Sbjct: 151 AFSTVASIEGIHQITTGELVSLSEQELIDCVRGNSSGCSGGYLEDAFKFIAKKGGMASET 210
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WF 232
YPY+ D C + + S I+GY+ V +E L V+ QPVSV +DA F
Sbjct: 211 NYPYK-ETDEKCKFKKESK--HVAEIKGYEKVPSNSENDLLKAVANQPVSVYVDAGDYVF 267
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG+FTG CG +H VTIVGYG + + YWLVKN WGT W E G M++ R V
Sbjct: 268 QFYSGGIFTGKCGTDTDHVVTIVGYGVSLD---YTEYWLVKNSWGTGWGEKGYMKLKRNV 324
Query: 293 GG-SGLCNIAANAAYPL 308
GLC IA N +YP+
Sbjct: 325 DSKKGLCGIATNPSYPV 341
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 244 bits (622), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 185/314 (58%), Gaps = 33/314 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE WM ++ R YK EKE R IFK N EF L +N+FADLT E+F
Sbjct: 3 RHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEEFQ 62
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
AS GYK H S+ + F+ N S + ++DW ++GAVTP+KDQG CCWAF+
Sbjct: 63 ASRNGYKMSA--HLSSSSTKPFRYENVSAVP--STMDWRKKGAVTPIKDQGQCGCCWAFS 118
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA EG+ ++ TG+L++ S+ +LVDC T GC +++AF++I Q + L +E Y
Sbjct: 119 AVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEANY 178
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNF 234
PYQG D C+ +++A I GY+ V +E L V+ QPVSVAIDA + F F
Sbjct: 179 PYQG-ADGACNSGKAAAK-----ITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQF 232
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVFTG CG +HGVT VGYG + + YWLVKN WGT+W E G +R+ R +
Sbjct: 233 YSSGVFTGDCGTDLDHGVTAVGYGMSDDG---TKYWLVKNSWGTSWGENGYIRMERDIDA 289
Query: 295 -SGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 290 QEGLCGIAMEASYP 303
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 139/327 (42%), Positives = 190/327 (58%), Gaps = 37/327 (11%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
M R H+T ++ +HEQWM E+ + YKD AEK+ RF+IFK N EF L
Sbjct: 25 MCRKLHET-SMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPYKLG 83
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N ADLT E+F AS G+K PH + FK N + + +IDW +GAVTP+
Sbjct: 84 VNHLADLTVEEFKASRNGFK-----RPHEFSTTTFKYENVTAIPA--AIDWRTKGAVTPI 136
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQG CWAF+ +A EG+++I TG+LV+ S+ +LVDC T GC ++E+ FE+
Sbjct: 137 KDQGQCGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFEF 196
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + + SE YPY+ D C+ A+ I+GY+ V P +E LQ V+ QPV
Sbjct: 197 IIKNGGITSETNYPYKAV-DGKCN----KATSPVAQIKGYEKVPPNSETALQKAVANQPV 251
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SV+IDA F FY G++ G CG +HGVT VGYGT + YW+VKN WGT W
Sbjct: 252 SVSIDADGAGFMFYSSGIYNGECGTELDHGVTAVGYGTANGTD----YWIVKNSWGTQWG 307
Query: 282 EGGSMRIFRGVGGS-GLCNIAANAAYP 307
E G +R+ RG+ GLC IA +++YP
Sbjct: 308 EKGYVRMQRGIAAKHGLCGIALDSSYP 334
>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
Length = 341
Score = 243 bits (619), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 142/315 (45%), Positives = 185/315 (58%), Gaps = 28/315 (8%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKF 59
++HE+WM E R YKD+AEK R ++F+ N E L N+FADLT E+F
Sbjct: 36 SRHEKWMAEHGRAYKDEAEKARRLEVFRANAELIDSFNAAGTHSHRLATNRFADLTVEEF 95
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
A+ TG +P P + R F+ N S S+DW GAVT VKDQG+ CCWAF
Sbjct: 96 RAARTGLRPRPAPSAGAGR---FRYENFSLADAAQSVDWRAMGAVTGVKDQGACGCCWAF 152
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASECV 175
+AVA VEGLNKIRTG+LV+ S+ +LVDC GC ++NAF+++ + LASE
Sbjct: 153 SAVAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESG 212
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPYQGR D C S+A+ + +IRG++ V E L V+ QPVSVAI+ F
Sbjct: 213 YPYQGR-DGPCR--SSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDMAFR 269
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GV G CG NH +T VGYGT + YWL+KN WG +W EGG +RI RGV
Sbjct: 270 FYDSGVLGGACGTDLNHAITAVGYGTANDG---TRYWLMKNSWGASWGEGGYVRIRRGVR 326
Query: 294 GSGLCNIAANAAYPL 308
G G+C +A +YP+
Sbjct: 327 GEGVCGLAKLPSYPV 341
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 144/329 (43%), Positives = 197/329 (59%), Gaps = 36/329 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LR 47
SRT + G++ +HE+WM + + YKD E+E RFKIF +N ++ L
Sbjct: 27 SRTL-QDGSMHERHERWMNHYGKVYKDHQEREKRFKIFTENMKYIEAFNNGDNNESYKLG 85
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVT 105
+N+FADLT E+F+AS +K H S+ R+ FK N S + ++DW ++GAVT
Sbjct: 86 INQFADLTNEEFVASRNKFKG----HMCSSIIRTTTFKYENVSAIP--STVDWRKKGAVT 139
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAF 161
PVK+QG CCWAF+AVA EG++K+ TG+LV+ S+ +LVDC T GC +++AF
Sbjct: 140 PVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAF 199
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
++I Q L +E YPYQG D C+ + AS + I GY+ V E+ LQ V+ Q
Sbjct: 200 KFIIQNHGLNTEAQYPYQGV-DGTCN--ANKASIQATTITGYEDVPANNEQALQKAVANQ 256
Query: 222 PVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
P+SVAIDA+ F FY GVFTG CG +HGVT VGYG + + YWLVKN WGT+
Sbjct: 257 PISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDG---TKYWLVKNSWGTD 313
Query: 280 WDEGGSMRIFRGV-GGSGLCNIAANAAYP 307
W E G + + RGV GLC IA A+YP
Sbjct: 314 WGEEGYIMMQRGVEAAEGLCGIAMQASYP 342
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 143/319 (44%), Positives = 183/319 (57%), Gaps = 36/319 (11%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
KHEQWM F R Y D +EK RF+IFKKN +F L +N+F+DLT E+F
Sbjct: 34 KHEQWMSRFHRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNKTYTLDVNEFSDLTDEEFK 93
Query: 61 ASYTGYKPP------PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
A YTG P T H S ++N+ + +S+DW E GAVT VK Q
Sbjct: 94 ARYTGLVVPEGMTRMSTTDSHETVSFRYENVGETG----ESMDWREEGAVTSVKHQQQCG 149
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLAS 172
CCWAF+AVA VEG+ KI G+LV+ S+ QL+DCST N GC + AF+YI + Q + +
Sbjct: 150 CCWAFSAVAAVEGMTKIAKGELVSLSEQQLLDCSTENDGCDGGIMWKAFDYIVENQGITA 209
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPYQG Q C+ +A+ I GY+ V EE L VS+QPVSVAI+ + +
Sbjct: 210 EDNYPYQGAQQ-TCESNHVAAA----TISGYETVPQNDEEALLKAVSQQPVSVAIEGSGY 264
Query: 233 NFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F H GG+F G CG NH VTIVGYG + E YWL+KN WG +W E G MRI R
Sbjct: 265 EFIHYSGGIFNGECGTHLNHAVTIVGYGVSEEG---IKYWLLKNSWGESWGEDGYMRIMR 321
Query: 291 GVGG-SGLCNIAANAAYPL 308
V G+C +A+ A YP+
Sbjct: 322 DVDAPQGMCGLASLAYYPV 340
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 242 bits (618), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 142/327 (43%), Positives = 195/327 (59%), Gaps = 31/327 (9%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
+SRT H+ +++ +HE WM + RTYKD AEKE RFKIFK+N E+ L
Sbjct: 23 LSRTLHEV-SMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEYIESVNSAGNRRYKLS 81
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N+FAD T E+F AS GY + P S+ F+ N + + S+DW ++GAVTP+
Sbjct: 82 INEFADQTNEEFKASRNGYNM--SSRPRSSEITSFRYENVAAVP--SSMDWRKKGAVTPI 137
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQG CCWAF+AVA +EG+ +++TG+L++ S+ +LVDC T GC +++AFE+
Sbjct: 138 KDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQGCGGGLMDSAFEF 197
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I L +E YPY+G D C+ + A+ I+ Y+ V +E L V++ PV
Sbjct: 198 IIGNGGLTTEANYPYKG-VDATCN--KKKAASSAAKIKNYEDVPANSEAALLKAVAQHPV 254
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVAIDA + F FY GVFTG CG +HGVT VGYG T + YWLVKN WGT W
Sbjct: 255 SVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYGKTDDG---TKYWLVKNSWGTGWG 311
Query: 282 EGGSMRIFRGVGGS-GLCNIAANAAYP 307
E G + + R +G GLC IA A+YP
Sbjct: 312 EDGYIWMERDIGADEGLCGIAMEASYP 338
>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
Length = 340
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 187/315 (59%), Gaps = 29/315 (9%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKF 59
++HE+WM E R YKD+AEK R ++F+ N E L N+FADLT ++F
Sbjct: 36 SRHEKWMAEHGRAYKDEAEKARRLEVFRANAELIDSFNAAGTHSHRLATNRFADLTVQEF 95
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG-SYCCWAF 118
A+ TG +P P + R F+ N S S+DW GAVT VKDQG S CCWAF
Sbjct: 96 RAARTGLRPRPAPSAGAGR---FRYENFSLADAAQSVDWRAMGAVTGVKDQGASGCCWAF 152
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+AVA VEGLNKIRTG+LV+ S+ +LVDC GC ++NAF+++ + LASE
Sbjct: 153 SAVAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGLASESG 212
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPYQ R D C RSSA+ +IRG++ V E L V+ QPVSVAI+ F
Sbjct: 213 YPYQCR-DGPC---RSSAAAAAASIRGHEDVPRNNEAALAAAVAHQPVSVAINGEDMAFR 268
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GV G CG NH +T VGYGT A+G + YWL+KN WG +W EGG +RI RGV
Sbjct: 269 FYDSGVLGGACGTDLNHAITAVGYGTA--ADGTR-YWLMKNSWGASWGEGGYVRIRRGVR 325
Query: 294 GSGLCNIAANAAYPL 308
G G+C +A +YP+
Sbjct: 326 GEGVCGLAKLPSYPV 340
>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 242 bits (617), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 143/329 (43%), Positives = 192/329 (58%), Gaps = 36/329 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------- 47
SRT I KHEQWMV + + YKD E+E R KIFK+N ++
Sbjct: 28 SRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYKLG 87
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVT 105
+N+FAD+T E+F+AS +K H S+ +++ FK N+S S ++DW ++GAVT
Sbjct: 88 INQFADITNEEFIASRNKFKG----HMCSSITKTSTFKYENASVPS---TVDWRKKGAVT 140
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAF 161
PVK+QG CCWAF+AVA EG++K+ TG+LV+ S+ +LVDC T GC +++AF
Sbjct: 141 PVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAF 200
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
++I Q L +E YPYQG D C +S I GY+ V E LQ V+ Q
Sbjct: 201 KFIIQNHGLHTEAQYPYQGV-DGTCSANETSTPA--ATIAGYEDVPANNENALQKAVANQ 257
Query: 222 PVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
P+SVAIDA+ F FY GVFTG CG +HGVT VGYG + + YWLVKN WG +
Sbjct: 258 PISVAIDASGSDFQFYKSGVFTGSCGTQLDHGVTAVGYGISNDG---TKYWLVKNSWGND 314
Query: 280 WDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
W E G +R+ R V + GLC IA A+YP
Sbjct: 315 WGEEGYIRMQRSVDAAQGLCGIAMMASYP 343
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 241 bits (616), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 192/317 (60%), Gaps = 35/317 (11%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKF 59
+H QWM ++ + YKD E+E RFKIFK+N + L +N+FADLT E+F
Sbjct: 38 RHGQWMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADDTKSYKLGINQFADLTNEEF 97
Query: 60 LASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+AS +K H S+ R+ FK N S + ++DW ++GAVTPVK+QG CCW
Sbjct: 98 IASRNKFKG----HMCSSIMRTTSFKYENVSGIP--STVDWRKKGAVTPVKNQGQCGCCW 151
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASE 173
AF+AVA EG++K+ TG+L++ S+ +LVDC T GC +++AF++I Q L++E
Sbjct: 152 AFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTE 211
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-- 231
YPY+G D C+ + AS + I GY+ V +E+ LQ V+ QP+SVAIDA+
Sbjct: 212 AQYPYEGV-DGTCN--ANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAIDASGSD 268
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY GVFTG CG +HGVT VGYG + + YWLVKN WGT+W E G + + RG
Sbjct: 269 FQFYKSGVFTGACGTELDHGVTAVGYGVSNDG---TKYWLVKNSWGTDWGEEGYIMMQRG 325
Query: 292 V-GGSGLCNIAANAAYP 307
+ G+C IA A+YP
Sbjct: 326 IEAAEGICGIAMQASYP 342
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 141/312 (45%), Positives = 181/312 (58%), Gaps = 30/312 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE+WM + R Y D EKE R+ IFK+N E L +NKFADLT E+F
Sbjct: 4 RHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFR 63
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A Y GYK + S+ F+ N S + S+DW GAVTPVKDQG+ CCWAF+
Sbjct: 64 AMYHGYKRQSSKLMSSS----FRYENLSDIP--TSMDWRNDGAVTPVKDQGTCGCCWAFS 117
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPY 178
VA +EG+ K++TG L++ S+ QLVDC+ N GC ++ AF+YI + L SE YPY
Sbjct: 118 TVAAIEGIIKLQTGNLISLSEQQLVDCTAGNKGCQGGLMDTAFQYIIRNGGLTSEDNYPY 177
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYH 236
QG D C A+ I GY+ V E L V++QPVSVA+D F FY
Sbjct: 178 QGV-DGTCS--SEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFYK 234
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS- 295
GVF G CG NHGVT +GYGT ++ YWLVKN WGT+W E G R+ RG+G S
Sbjct: 235 SGVFEGDCGTNLNHGVTAIGYGTDSDG---TDYWLVKNSWGTSWGESGYTRMQRGIGASE 291
Query: 296 GLCNIAANAAYP 307
GLC +A +A+YP
Sbjct: 292 GLCGVAMDASYP 303
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 241 bits (615), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 188/314 (59%), Gaps = 30/314 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+H QWM ++ + YKD E+E RFKIF +N + L +N+FADLT E+F+
Sbjct: 38 RHGQWMSQYGKIYKDHQERETRFKIFTENVNYVEASNADDTKSYKLGINQFADLTNEEFV 97
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
AS +K R+ FK N S + ++DW ++GAVTPVK+QG CCWAF+
Sbjct: 98 ASRNKFKGHMCSSI--TRTTTFKYENVSAIP--STVDWRKKGAVTPVKNQGQCGCCWAFS 153
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA EG++K+ TG+L++ S+ +LVDC T GC +++AF++I Q L++E Y
Sbjct: 154 AVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEAQY 213
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
PY+G D C+ + AS + I GY+ V +E+ LQ V+ QP+SVAIDA+ F F
Sbjct: 214 PYEGV-DGTCN--ANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAIDASGSDFQF 270
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-G 293
Y GVFTG CG +HGVT VGYG + + YWLVKN WGT+W E G + + RGV
Sbjct: 271 YKSGVFTGSCGTELDHGVTAVGYGVSNDG---TKYWLVKNSWGTDWGEEGYIMMQRGVEA 327
Query: 294 GSGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 328 AEGLCGIAMQASYP 341
>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 240 bits (612), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 139/330 (42%), Positives = 195/330 (59%), Gaps = 35/330 (10%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------- 47
++ S + ++ +HEQWM ++++ YKD E+E R KIF N ++
Sbjct: 26 VTSRSLQVDSMYERHEQWMSQYSKVYKDPQEREERHKIFTANVNYIEVFNNDANNKLYKL 85
Query: 48 -LNKFADLTREKFLASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAV 104
+N+FADLT E+F+AS +K H S+ ++ FK N S + ++DW ++GAV
Sbjct: 86 GINQFADLTNEEFIASRNKFKG----HMCSSIAKTTTFKYENVSAIP--STVDWRKKGAV 139
Query: 105 TPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENA 160
TPVK+QG CCWAF+AVA EG+ K+ TG+LV+ S+ +LVDC T GC +++A
Sbjct: 140 TPVKNQGQCGCCWAFSAVAATEGITKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDA 199
Query: 161 FEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR 220
F++I Q L++E YPYQG D C+ + AS I GY+ V E+ LQ V+
Sbjct: 200 FKFIIQNHGLSTEAAYPYQGV-DGTCN--ANKASIHAATITGYEDVPANNEQALQKAVAN 256
Query: 221 QPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGT 278
QP+SVAIDA+ F FY GVF+G CG +HGVT VGYG + YWLVKN WGT
Sbjct: 257 QPISVAIDASGSDFQFYKSGVFSGSCGTELDHGVTAVGYGVGNDG---TKYWLVKNSWGT 313
Query: 279 NWDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
+W E G +R+ RGV + GLC IA A+YP
Sbjct: 314 DWGEEGYIRMQRGVDAAEGLCGIAMQASYP 343
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 240 bits (612), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 143/318 (44%), Positives = 186/318 (58%), Gaps = 34/318 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ +HEQWM ++ R YK +AEK RF IFK+N E+ L +N FADLT +
Sbjct: 33 MVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAGTKPYKLGINAFADLTNQ 92
Query: 58 KFLASYTGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+F AS GYK PH SN F+ N S S ++DW +GAVTPVKDQG CC
Sbjct: 93 EFKASRNGYK-----LPHDCSSNTPFRYENVS--SVPTTVDWRTKGAVTPVKDQGQCGCC 145
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQRLAS 172
WAF+AVA +EG+ K+ TG L++ S+ +LVDC T GC +++AF +I + L +
Sbjct: 146 WAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSFIINNKGLTT 205
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--T 230
E YPYQG D C +S +S I GY+ V +E L+ V+ QPVSVAIDA +
Sbjct: 206 ESNYPYQGT-DGSCK--KSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGS 262
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F FY GVFTG CG +HGVT VGYG AE YWLVKN WGT+W E G +R+ +
Sbjct: 263 DFQFYSSGVFTGECGTELDHGVTAVGYGI---AEDGSKYWLVKNSWGTSWGEKGYIRMQK 319
Query: 291 GV-GGSGLCNIAANAAYP 307
+ GLC IA ++YP
Sbjct: 320 DIEAKEGLCGIAMQSSYP 337
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 240 bits (612), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 130/314 (41%), Positives = 183/314 (58%), Gaps = 30/314 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HEQWM+++ R YKD+AEK +RF+IF N +F L +N+FAD T E+F
Sbjct: 56 RHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKFIEEFNKDGRQSYKLAVNEFADQTNEEFQ 115
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
AS GYK + P ++N+ + S+DW ++GAVTPVKDQG CWAF+
Sbjct: 116 ASRNGYKMAVSSRPSQTTLFRYENVTAVP----SSMDWRKKGAVTPVKDQGQCGSCWAFS 171
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
+A EG+ K++TG+L++ S+ +LVDC GC ++E+ FE+I + + +A E Y
Sbjct: 172 TIAATEGITKLKTGKLISLSEQELVDCDKTGEDQGCEGGYMEDGFEFIVKNKGIALEASY 231
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNF 234
PY D C+ + + I GY+ V +E L V+ QPVSV+IDA+ F F
Sbjct: 232 PYTA-ADGTCN--SKEEASRAAKISGYEKVPANSETALLKAVANQPVSVSIDASGVAFQF 288
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVFTG CG +HGVT VGYG T++ YWLVKN WG +W + G + + RGV
Sbjct: 289 YSSGVFTGECGTDLDHGVTAVGYGKTSDG---TKYWLVKNSWGASWGDSGYIMMQRGVAA 345
Query: 295 S-GLCNIAANAAYP 307
GLC IA +A+YP
Sbjct: 346 KGGLCGIAMDASYP 359
>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
gi|223948637|gb|ACN28402.1| unknown [Zea mays]
gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
Length = 354
Score = 240 bits (612), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 186/319 (58%), Gaps = 39/319 (12%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF---------------LRLNKFADLTREK 58
+H+QWM E RTY+D+AEK RF++FK N +F + LN+FAD+T ++
Sbjct: 50 RHQQWMAEHGRTYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRMELNEFADMTNDE 109
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD---SIDWNERGAVTPVKDQGSY-C 114
F+A YTG +P P + + FK N + D ++DW ++GAVT +K+QG C
Sbjct: 110 FMAMYTGLRPVPAG---AKKMAGFKYGNVTLSDADDNQQTVDWRQKGAVTGIKNQGQCGC 166
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLAS 172
CWAF AVA VEG+++I TG LV+ S+ Q++DC T NGC +++NAF+YI LA+
Sbjct: 167 CWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTEGNNGCNGGYIDNAFQYIAGNGGLAT 226
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY Q C + A AI GYQ V E L V+ QPVSVAIDA F
Sbjct: 227 EDAYPYTAAQ-AMCQSVQPVA-----AISGYQDVPSGDEAALAAAVANQPVSVAIDAHNF 280
Query: 233 NFYHGGVFTGPCGNTP---NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
Y GGV T +TP NH VT VGYGT AE PYWL+KN+WG NW EGG +R+
Sbjct: 281 QLYGGGVMTAASCSTPPNLNHAVTAVGYGT---AEDGTPYWLLKNQWGQNWGEGGYLRLE 337
Query: 290 RGVGGSGLCNIAANAAYPL 308
R G+ C +A A+YP+
Sbjct: 338 R---GANACGVAQQASYPV 353
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 239 bits (611), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 140/320 (43%), Positives = 183/320 (57%), Gaps = 33/320 (10%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-----------------LRLNKFADLT 55
++HE+WM + +TYKD+ EK R ++F+ N + L N+FADLT
Sbjct: 40 SRHEKWMAKHGKTYKDEEEKARRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLT 99
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++F A+ TGY+ PP + ++N S + S+DW GAVT VKDQGS C
Sbjct: 100 DDEFRAARTGYQRPPAAVAGAGGGFLYENF--SLAAAPQSMDWRAMGAVTGVKDQGSCGC 157
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLA 171
CWAF+AVA VEGL KIRTGQLV+ S+ +LVDC GC ++ AF+YI + LA
Sbjct: 158 CWAFSAVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLA 217
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
+E YPY+G R++A +IRG+Q V E L V+RQPVSVAI+
Sbjct: 218 AESSYPYRGVDGAC----RAAAGRAAASIRGFQDVPSNDEGALMAAVARQPVSVAINGAG 273
Query: 232 --FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
F FY GV G CG NH VT VGYGT ++ G YWL+KN WG +W EGG +RI
Sbjct: 274 YVFRFYDRGVLGGAGCGTELNHAVTAVGYGTASDGTG---YWLMKNSWGASWGEGGYVRI 330
Query: 289 FRGVGGSGLCNIAANAAYPL 308
RGVG G C IA A+YP+
Sbjct: 331 RRGVGREGACGIAQMASYPV 350
>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 352
Score = 239 bits (611), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 131/321 (40%), Positives = 182/321 (56%), Gaps = 20/321 (6%)
Query: 5 SHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKF 51
+ + G + A+H++WM E RTYKD AEK RF++FK N + L N+F
Sbjct: 32 ASRAGTMEARHDKWMAEHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRF 91
Query: 52 ADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG 111
DLT +F A YTGY P T + +N + + + + + +DW ++GAVT VK+Q
Sbjct: 92 TDLTDAEFAAMYTGYNPANTMYAAANATTRLSSEDDQQPA---EVDWRQQGAVTGVKNQR 148
Query: 112 SY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRL 170
S CCWAF+ VA VEG+++I TG+LV+ S+ QL+DC+ GC L+NAF+Y+ +
Sbjct: 149 SCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCADNGGCTGGSLDNAFQYMANSGGV 208
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT 230
+E Y YQG Q SSASG I GYQ V P E L V+ QPVSVAI+ +
Sbjct: 209 TTEAAYAYQGAQGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGS 268
Query: 231 --WFNFYHGGVFTG-PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F Y GVFT CG +H V +VGYG + G YW++KN WGT W +GG M+
Sbjct: 269 GAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMK 328
Query: 288 IFRGVGGSGLCNIAANAAYPL 308
+ + VG G C +A +YP+
Sbjct: 329 LEKDVGSQGACGVAMAPSYPV 349
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 239 bits (611), Expect = 9e-61, Method: Compositional matrix adjust.
Identities = 140/323 (43%), Positives = 187/323 (57%), Gaps = 32/323 (9%)
Query: 5 SHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKF 51
S + ++ +HE+WM + R YKD EK+ R+KIF++N L +N+F
Sbjct: 28 SLQDASMRERHEEWMASYGRVYKDINEKQKRYKIFEENVALIESSNKDANKPYKLSVNQF 87
Query: 52 ADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG 111
ADLT E+F AS +K H S +S FK N S + ++DW +GAVTPVKDQG
Sbjct: 88 ADLTNEEFKASRNRFKG----HICSTKSTSFKYGNVSAVP--SAMDWRMKGAVTPVKDQG 141
Query: 112 SY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
CCWAF+AVA EG+ K+ TG+L++ S+ +LVDC T GC ++NAF +I+
Sbjct: 142 QCGCCWAFSAVAATEGITKLTTGELISLSEQELVDCDTSGVDQGCEGGLMDNAFTFIQHN 201
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI 227
LASE YPY+G D C+ + + I G++ V +EE L + V+ QPVSVAI
Sbjct: 202 HGLASEANYPYKGV-DGTCNTNKQAIHA--AEINGFEDVPANSEEALLNAVAHQPVSVAI 258
Query: 228 DA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
DA + F FY GVF G CG +HGVT VGYGT+ + YWLVKN WGT W E G
Sbjct: 259 DAGGSGFQFYSKGVFIGACGTQLDHGVTAVGYGTSDDG---TKYWLVKNSWGTQWGEEGY 315
Query: 286 MRIFRGVGG-SGLCNIAANAAYP 307
+R+ R V GLC IA A+YP
Sbjct: 316 IRMQRDVDAKEGLCGIAMKASYP 338
>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
Length = 342
Score = 239 bits (611), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 131/321 (40%), Positives = 182/321 (56%), Gaps = 20/321 (6%)
Query: 5 SHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKF 51
+ + G + A+H++WM E RTYKD AEK RF++FK N + L N+F
Sbjct: 22 ASRAGTMEARHDKWMAEHGRTYKDAAEKARRFRVFKANVDLIDRSNAAGNKRYRLATNRF 81
Query: 52 ADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG 111
DLT +F A YTGY P T + +N + + + + + +DW ++GAVT VK+Q
Sbjct: 82 TDLTDAEFAAMYTGYNPANTMYAAANATTRLSSEDDQQPA---EVDWRQQGAVTGVKNQR 138
Query: 112 SY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRL 170
S CCWAF+ VA VEG+++I TG+LV+ S+ QL+DC+ GC L+NAF+Y+ +
Sbjct: 139 SCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCADNGGCTGGSLDNAFQYMANSGGV 198
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT 230
+E Y YQG Q SSASG I GYQ V P E L V+ QPVSVAI+ +
Sbjct: 199 TTEAAYAYQGAQGACQFDASSSASGVAATISGYQRVNPNDEGSLAAAVASQPVSVAIEGS 258
Query: 231 --WFNFYHGGVFTG-PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F Y GVFT CG +H V +VGYG + G YW++KN WGT W +GG M+
Sbjct: 259 GAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSGGGGYWIIKNSWGTTWGDGGYMK 318
Query: 288 IFRGVGGSGLCNIAANAAYPL 308
+ + VG G C +A +YP+
Sbjct: 319 LEKDVGSQGACGVAMAPSYPV 339
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 132/318 (41%), Positives = 186/318 (58%), Gaps = 30/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++A+HEQWM F + Y D AEKE RF+IFK N E+ L +NKFADLT
Sbjct: 33 SMSARHEQWMETFGKVYADAAEKERRFEIFKDNVEYIESFNTAGNKPYKLSVNKFADLTN 92
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
E+ + GY+ P P S ++N+ + ++DW ++GAVTP+KDQG C
Sbjct: 93 EELKVARNGYRRPLQTRPMKVTSFKYENVTAVPA----TMDWRKKGAVTPIKDQGQCGSC 148
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLAS 172
WAF+ VA EG+N++ TG+LV+ S+ +LVDC T GC +E+ FE+I + + +
Sbjct: 149 WAFSTVAATEGINQLTTGKLVSLSEQELVDCDTQGEDQGCEGGLMEDGFEFIIKNHGITT 208
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--T 230
E YPYQ D C+ + ++ + I GY+ V +E L V+ QP+SV+IDA +
Sbjct: 209 EANYPYQA-ADGTCNSKKEAS--RIAKITGYESVPANSEAALLKAVASQPISVSIDAGGS 265
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F FY GVFTG CG +HGVT VGYG T++ YWLVKN WGT+W E G +R+ R
Sbjct: 266 DFQFYSSGVFTGQCGTELDHGVTAVGYGETSDG---TKYWLVKNSWGTSWGEEGYIRMQR 322
Query: 291 GV-GGSGLCNIAANAAYP 307
GLC IA +++YP
Sbjct: 323 DTEAEEGLCGIAMDSSYP 340
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 140/312 (44%), Positives = 179/312 (57%), Gaps = 30/312 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE+WM + R Y D EKE R+ IFK+N E L +NKFADLT E+F
Sbjct: 39 RHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFR 98
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A Y GYK + S+ F+ N S + S+DW GAVTPVKDQG+ CCWAF+
Sbjct: 99 AMYHGYKRQSSKLMSSS----FRYENLSDIP--TSMDWRNDGAVTPVKDQGTCGCCWAFS 152
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPY 178
VA +EG+ K++TG L++ S+ QLVDC+ N GC ++ AF+YI + L SE YPY
Sbjct: 153 TVAAIEGIIKLQTGNLISLSEQQLVDCTAGNKGCQGGLMDTAFQYIIRNGGLTSEDNYPY 212
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYH 236
QG D C A+ I GY+ V E L V++QPVSV +D F FY
Sbjct: 213 QGV-DGTCS--SEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVGVDGGGNDFQFYK 269
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS- 295
GVF G CG NH VT +GYGT + YWLVKN WGT+W E G MR+ RG+G S
Sbjct: 270 SGVFNGDCGTQQNHAVTAIGYGTDIDG---TDYWLVKNSWGTSWGENGYMRMRRGIGSSE 326
Query: 296 GLCNIAANAAYP 307
GLC +A +A+YP
Sbjct: 327 GLCGVAMDASYP 338
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 137/326 (42%), Positives = 190/326 (58%), Gaps = 29/326 (8%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
M R H+T + +HE WM E+ + YKD AEKE RF+IFK N EF L
Sbjct: 25 MPRKLHQTA-LRERHENWMAEYGKMYKDAAEKEKRFQIFKDNVEFIESFNAAGNKPYKLG 83
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N ADLT E+F S G K + + N FK N + + ++IDW +GAVTP+
Sbjct: 84 VNHLADLTLEEFKDSRNGLKRTYEFSTTTFKLNGFKYENVTDIP--EAIDWRVKGAVTPI 141
Query: 108 KDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYI 164
KDQG C WAF+ +A EG+++I TG LV+ S+ +LVDC ++ +GC F+E+ FE+I
Sbjct: 142 KDQGDQCGRFWAFSTIAATEGIHQISTGNLVSLSEQELVDCDSVDDGCEGGFMEDGFEFI 201
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ + SE YPY+G D C+ + A+ I+GY+ V +EE L+ V+ QPVS
Sbjct: 202 IKNGGITSETNYPYKGV-DGTCN--TTIAASPVAQIKGYEIVPSYSEEALKKAVANQPVS 258
Query: 225 VAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
V+I AT F FY G++ G CG +HGVT VGYGT E YW+VKN WGT W E
Sbjct: 259 VSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGT----ENGTDYWIVKNSWGTQWGE 314
Query: 283 GGSMRIFRGVGGS-GLCNIAANAAYP 307
G +R+ RG+ G+C IA +++YP
Sbjct: 315 KGYIRMHRGIAAKHGICGIALDSSYP 340
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 239 bits (610), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 140/325 (43%), Positives = 192/325 (59%), Gaps = 32/325 (9%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
MSR H+ +++ +HEQWM ++ + YKD AEK+ R IFK N EF L
Sbjct: 25 MSRNLHEA-SMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNRPYKLS 83
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N AD T E+F+AS+ GYK H S+ FK N + + +++DW E GAVT V
Sbjct: 84 INHLADQTNEEFVASHNGYK-----HKGSHSQTPFKYENVTGVP--NAVDWRENGAVTAV 136
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIR 165
KDQG CWAF+ VA EG+ +I T L++ S+ +LVDC +++ GC ++E FE+I
Sbjct: 137 KDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSVDHGCDGGYMEGGFEFII 196
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ ++SE YPY D CD + ++ I+GY+ V +E+ LQ V+ QPVSV
Sbjct: 197 KNGGISSEANYPYTAV-DGTCDANKEASPA--AQIKGYETVPANSEDALQKAVANQPVSV 253
Query: 226 AIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
IDA + F FY GVFTG CG +HGVT VGYG+T +G Q YW+VKN WGT W E
Sbjct: 254 TIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTD--DGTQ-YWIVKNSWGTQWGEE 310
Query: 284 GSMRIFRGVGG-SGLCNIAANAAYP 307
G +R+ RG GLC IA +A+YP
Sbjct: 311 GYIRMQRGTDAQEGLCGIAMDASYP 335
>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
Length = 353
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 143/317 (45%), Positives = 186/317 (58%), Gaps = 25/317 (7%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ ++HE+WM E RTY D+AEK R +IF+ N EF L N+FADLT E
Sbjct: 43 MVSRHEKWMAEHGRTYTDEAEKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDE 102
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F A+ TG++P P + F+ N S S+DW GAVT VKDQG CCW
Sbjct: 103 EFRAARTGFRPRPAPAAAAGSGGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCW 162
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASE 173
AF+AVA VEGLNKIRTG+LV+ S+ +LVDC GC +++AF++I + LASE
Sbjct: 163 AFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASE 222
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--W 231
YPYQG D C S+A+ + +IRG++ V E L V+ QPVSVAI+
Sbjct: 223 SGYPYQG-DDGSCR--SSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYA 279
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY GV G CG NH +T VGYGT + YWL+KN WGT+W EGG +RI RG
Sbjct: 280 FRFYDSGVLGGECGTDLNHAITAVGYGTAADG---SKYWLMKNSWGTSWGEGGYVRIRRG 336
Query: 292 VGGSGLCNIAANAAYPL 308
V G G+C +A +YP+
Sbjct: 337 VRGEGVCGLAKLPSYPV 353
>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
gi|194689328|gb|ACF78748.1| unknown [Zea mays]
gi|219886279|gb|ACL53514.1| unknown [Zea mays]
gi|238010470|gb|ACR36270.1| unknown [Zea mays]
gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
Length = 354
Score = 239 bits (609), Expect = 1e-60, Method: Compositional matrix adjust.
Identities = 140/322 (43%), Positives = 186/322 (57%), Gaps = 39/322 (12%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF---------------LRLNKFADLT 55
+ +H+QWM E RTY+D+AEK RF++FK N +F L LN+FAD+T
Sbjct: 47 MKVRHQQWMAEHGRTYRDEAEKAHRFQVFKANADFVDASNAAGDDKKSYRLELNEFADMT 106
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD---SIDWNERGAVTPVKDQGS 112
++F+A YTG +P P + + FK N + D ++DW ++GAVT +K+QG
Sbjct: 107 NDEFMAMYTGLRPVPAG---AKKMAGFKYGNVTLSDADDDQQTVDWRQKGAVTGIKNQGQ 163
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQR 169
CCWAF AVA VEG+++I TG LV+ S+ Q++DC T NGC +++NAF+YI
Sbjct: 164 CGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIVGNGG 223
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
L +E YPY Q C + A AI GYQ V E L V+ QPVSVAIDA
Sbjct: 224 LGTEDAYPYTAAQ-AMCQSVQPVA-----AISGYQDVPSGDEAALAAAVANQPVSVAIDA 277
Query: 230 TWFNFYHGGVFTGPCGNTP---NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y GGV T +TP NH VT VGYGT AE PYWL+KN+WG NW EGG +
Sbjct: 278 HNFQLYGGGVMTAASCSTPPNLNHAVTAVGYGT---AEDGTPYWLLKNQWGQNWGEGGYL 334
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
R+ R G+ C +A A+YP+
Sbjct: 335 RLER---GANACGVAQQASYPV 353
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 239 bits (609), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 136/311 (43%), Positives = 186/311 (59%), Gaps = 28/311 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+ E+WM E+ R YKD EK RF+IFK N L +NKF D+T +F+
Sbjct: 36 RFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFV 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A YTG P + + F ++N S + SIDW + GAVT VKDQ CWAF+
Sbjct: 96 AQYTGGISRPLNIEKEPVVS-FDDVNISAVG--QSIDWRDYGAVTEVKDQNPCGSCWAFS 152
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A+ATVEG+ KI TG LV+ S+ +++DC+ NGC F++NA+++I +ASE YPYQ
Sbjct: 153 AIATVEGIYKIVTGYLVSLSEQEVLDCAVSNGCDGGFVDNAYDFIISNNGVASEADYPYQ 212
Query: 180 GRQ-DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--YH 236
Q D + W +SA I GY YV+ E ++ V QP++ AIDA+ NF Y+
Sbjct: 213 AYQGDCAANSWPNSA-----YITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYN 267
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSG 296
GGVF+GPCG + NH +TI+GYG ++ G Q YW+VKN WG++W E G +R+ RGV SG
Sbjct: 268 GGVFSGPCGTSLNHAITIIGYG--QDSSGTQ-YWIVKNSWGSSWGERGYIRMARGVSSSG 324
Query: 297 LCNIAANAAYP 307
LC IA + YP
Sbjct: 325 LCGIAMDPLYP 335
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 143/327 (43%), Positives = 186/327 (56%), Gaps = 34/327 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
SR H+ + +HE+WM + + YKD EK RF+IFK N F L +
Sbjct: 27 SRELHEL-EMTGRHEKWMAKHGKVYKDDKEKLRRFQIFKSNVVFIESFNTAGNKSYMLGI 85
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
NKFADLT E+F A + GYK P S + FK N + + SIDW +GAVTP+K
Sbjct: 86 NKFADLTNEEFRAFWNGYKRPLG---ASRKITPFKYENVTALP--SSIDWRSKGAVTPIK 140
Query: 109 DQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
DQG C CWAF+AVA EG++K+RTG+LV+ S+ +LVDC GC + +AF++
Sbjct: 141 DQG-VCGSCWAFSAVAATEGIHKLRTGKLVSLSEQELVDCDVKGQDKGCQGGLMVDAFKF 199
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I+++ + SE YPYQGR D CD + ++ + I GYQ V +E L V+ QPV
Sbjct: 200 IKRHGGMTSEANYPYQGR-DGKCDTKKEAS--RAVKITGYQAVPKNSEAALLKAVANQPV 256
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVAIDA F FY G+FTG CG NHGV VGYG + YW+VKN WGT W
Sbjct: 257 SVAIDAGSLSFQFYRSGIFTGICGKDINHGVAAVGYGRSNSG---SKYWIVKNSWGTEWG 313
Query: 282 EGGSMRIFRGV-GGSGLCNIAANAAYP 307
E G +R+ R V GLC IA +YP
Sbjct: 314 EKGYIRMKRDVRSKEGLCGIAMECSYP 340
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 238 bits (607), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 141/328 (42%), Positives = 189/328 (57%), Gaps = 38/328 (11%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
MSR H+T ++ +HEQWM E+ + YKD AEKE RF IFK N EF L
Sbjct: 25 MSRKLHET-SMRERHEQWMAEYGKVYKDAAEKEKRFLIFKHNVEFIESFNAAANKPYKLG 83
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N ADLT E+F AS G K P+ + FK N + + +IDW +GAVT +
Sbjct: 84 VNHLADLTVEEFKASRNGLK-----RPYELSTTPFKYENVTAIP--AAIDWRTKGAVTSI 136
Query: 108 KDQG--SYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFE 162
KDQG + CWAF+ VA EG+++I TG+LV+ S+ +LVDC T GC ++E+ FE
Sbjct: 137 KDQGQCAGSCWAFSTVAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGFE 196
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
+I + + SE YPY+ D C+ A+ I+GY+ V P +E+ LQ V+ QP
Sbjct: 197 FIIKNGGITSEANYPYKAV-DGKCN----KATSPVAQIKGYEKVPPNSEKTLQKAVANQP 251
Query: 223 VSVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
VSV+IDA F FY G++ G CG +HGVT VGYG + YWLVKN WGT W
Sbjct: 252 VSVSIDANGEGFMFYSSGIYNGECGTELDHGVTAVGYGIANGTD----YWLVKNSWGTQW 307
Query: 281 DEGGSMRIFRGVGGS-GLCNIAANAAYP 307
E G +R+ RGV GLC IA +++YP
Sbjct: 308 GEKGYVRMQRGVAAKHGLCGIALDSSYP 335
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 138/321 (42%), Positives = 191/321 (59%), Gaps = 36/321 (11%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLT 55
+I +HEQWM + + YK+ E+E R +IF +N ++ L +N+FADLT
Sbjct: 34 SIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNKKPYKLGINQFADLT 93
Query: 56 REKFLASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
E+F+AS +K H S+ R+ FK N+S S ++DW ++GAVTPVK+QG
Sbjct: 94 NEEFIASRNKFK----GHMCSSIIRTTTFKYENTSVPS---TVDWRKKGAVTPVKNQGQC 146
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQR 169
CCWAF+A+A EG++KI TG+LV+ S+ +LVDC T GC +++AF++I Q
Sbjct: 147 GCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAFKFIIQNNG 206
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
+++E YPYQG D C +S S I GY+ V E LQ V+ QP+SVAIDA
Sbjct: 207 ISTEAGYPYQGV-DGTCKANEASTSA--ATITGYEDVPANNENALQKAVANQPISVAIDA 263
Query: 230 TW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
+ F FY GVFTG CG +HGVT VGYG + + YWLVKN WGT+W E G +R
Sbjct: 264 SGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDG---TKYWLVKNSWGTDWGEEGYIR 320
Query: 288 IFRGVGGS-GLCNIAANAAYP 307
+ R + + GLC IA A+YP
Sbjct: 321 MQRSIDAAEGLCGIAMQASYP 341
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 138/321 (42%), Positives = 191/321 (59%), Gaps = 36/321 (11%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLT 55
+I +HEQWM + + YK+ E+E R +IF +N ++ L +N+FADLT
Sbjct: 34 SIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKYIEASNNAGNNKPYKLGINQFADLT 93
Query: 56 REKFLASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
E+F+AS +K H S+ R+ FK N+S S ++DW ++GAVTPVK+QG
Sbjct: 94 NEEFIASRNKFK----GHMCSSIIRTTTFKYENTSVPS---TVDWRKKGAVTPVKNQGQC 146
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQR 169
CCWAF+A+A EG++KI TG+LV+ S+ +LVDC T GC +++AF++I Q
Sbjct: 147 GCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGVDQGCEGGLMDDAFKFIIQNNG 206
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
+++E YPYQG D C +S S I GY+ V E LQ V+ QP+SVAIDA
Sbjct: 207 ISTEAGYPYQGV-DGTCKANEASTSA--ATITGYEDVPANNENALQKAVANQPISVAIDA 263
Query: 230 TW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
+ F FY GVFTG CG +HGVT VGYG + + YWLVKN WGT+W E G +R
Sbjct: 264 SGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISNDG---TKYWLVKNSWGTDWGEEGYIR 320
Query: 288 IFRGVGGS-GLCNIAANAAYP 307
+ R + + GLC IA A+YP
Sbjct: 321 MQRSIDAAEGLCGIAMQASYP 341
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 238 bits (607), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 140/325 (43%), Positives = 192/325 (59%), Gaps = 32/325 (9%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
MSR H+ +++ +HEQWM ++ + YKD AEK+ R IFK N EF L
Sbjct: 25 MSRYLHEA-SMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKLG 83
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N AD T E+F+AS+ GYK H S+ FK N + + +++DW E GAVT V
Sbjct: 84 INHLADQTNEEFVASHNGYK-----HKASHSQTPFKYENVTGVP--NAVDWRENGAVTAV 136
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIR 165
KDQG CWAF+ VA EG+ +I T L++ S+ +LVDC +++ GC ++E FE+I
Sbjct: 137 KDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSVDHGCDGGYMEGGFEFII 196
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ ++SE YPY D CD + ++ I+GY+ V +E+ LQ V+ QPVSV
Sbjct: 197 KNGGISSEANYPYTAV-DGTCDANKEASPA--AQIKGYETVPANSEDALQKAVANQPVSV 253
Query: 226 AIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
IDA + F FY GVFTG CG +HGVT VGYG+T +G Q YW+VKN WGT W E
Sbjct: 254 TIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTD--DGTQ-YWIVKNSWGTQWGEE 310
Query: 284 GSMRIFRGVGG-SGLCNIAANAAYP 307
G +R+ RG GLC IA +A+YP
Sbjct: 311 GYIRMQRGTDAQEGLCGIAMDASYP 335
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 238 bits (606), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 185/314 (58%), Gaps = 32/314 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE+WM + R Y D EKE R+ IFK+N E L +NKFADLT E+F
Sbjct: 4 RHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNEEFR 63
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A + GYK + S+ F++ N S + S+DW + GAVTPVKDQG+ CCWAF+
Sbjct: 64 AMHHGYKRQSSKLMSSS----FRHENLSAIP--TSMDWRKAGAVTPVKDQGTCGCCWAFS 117
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA +EG+ K++TG+L++ S+ QLVDC GC ++NAF++I + L SE Y
Sbjct: 118 AVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTSEATY 177
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
PYQG D C ++++ I GY+ V E L V++QPVSVA++ F F
Sbjct: 178 PYQG-VDGTCKSKKTASI--EAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQF 234
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVF G CG +H VT +GYGT ++ YWLVKN WGT+W E G MR+ RG+G
Sbjct: 235 YKSGVFKGDCGTYLDHAVTAIGYGTNSDG---TNYWLVKNSWGTSWGESGYMRMQRGIGA 291
Query: 295 -SGLCNIAANAAYP 307
GLC +A +A+YP
Sbjct: 292 REGLCGVAMDASYP 305
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 238 bits (606), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 136/327 (41%), Positives = 192/327 (58%), Gaps = 29/327 (8%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------L 48
SR + ++ A+H+QW+ + YKD EKEMRFKIFK+N E + +
Sbjct: 29 SRPINYEASMRARHDQWIAHHDKVYKDLNEKEMRFKIFKENVERIEAFNAGEDKGYKLGV 88
Query: 49 NKFADLTREKFLASYTGYKPP-PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
NKF+DLT EKF +TGYK P S F+ N + + ++DW ++GAVTP+
Sbjct: 89 NKFSDLTNEKFRVLHTGYKRSHPKVMSSSKPKTHFRYANVTDIP--PTMDWRKKGAVTPI 146
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQ CCWAF+AVA EGL++++TG+L+ S+ +LVDC GC+ L+ AF++
Sbjct: 147 KDQKECGCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCDVEGEDEGCSGGLLDTAFDF 206
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + + L +E YPY+G +D C+ +S+ S I GY+ V +E+ L V+ QPV
Sbjct: 207 ILKNKGLTTEANYPYKG-EDGVCNKKKSALSA--AKIAGYEDVPANSEKALLQAVANQPV 263
Query: 224 SVAIDATWFN--FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVAID + F+ FY GVF+G C NH VT VGYG TT+ YW++KN WG+ W
Sbjct: 264 SVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDG---TKYWIIKNSWGSKWG 320
Query: 282 EGGSMRIFRGV-GGSGLCNIAANAAYP 307
+ G MRI R V GLC +A +A+YP
Sbjct: 321 DSGYMRIKRDVHEKEGLCGLAMDASYP 347
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 237 bits (605), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 184/314 (58%), Gaps = 32/314 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFL 60
+HE WMV++ R YKD EK R+KIFK N F L +N+FADLT E+F
Sbjct: 38 RHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFR 97
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
AS +K H S + FK N + + ++DW ++GAVTP+KDQG CWAF+
Sbjct: 98 ASRNRFKA----HICSTEATSFKYENVTAVP--STVDWRKKGAVTPIKDQGQCGSCWAFS 151
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA +EG+ ++ TG+L++ S+ +LVDC T GC+ +++AF++I Q L +E Y
Sbjct: 152 AVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANY 211
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNF 234
PY G D C+ R A+ I GY+ V E+ LQ V+ QP++VAIDA + F F
Sbjct: 212 PYAG-TDGTCN--RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGSEFQF 268
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-G 293
Y GVFTG CG +HGV+ VGYGT+ + YWLVKN WGT W E G +R+ R V
Sbjct: 269 YSSGVFTGQCGTELDHGVSAVGYGTSDDG---MKYWLVKNSWGTGWGEEGYIRMQRDVTA 325
Query: 294 GSGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 326 KEGLCGIAMQASYP 339
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 139/319 (43%), Positives = 181/319 (56%), Gaps = 36/319 (11%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
KHEQWM F R Y D +EK RF+IF N +F L +N+F+DLT E+F
Sbjct: 34 KHEQWMSRFNRVYSDDSEKTSRFEIFTNNLKFVESINMNTNKTYTLDVNEFSDLTDEEFK 93
Query: 61 ASYTGYKPP------PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
A YTG P T H S ++N+ + +S+DW + GAVT VK Q
Sbjct: 94 ARYTGLVVPEGMTRISTTDSHETVSFRYENVGETG----ESMDWIQEGAVTSVKHQQQCG 149
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYIRQYQRLAS 172
CCWAF+AVA VEG+ KI G+LV+ S+ QL+DCST NGC + AF+YI++ Q + +
Sbjct: 150 CCWAFSAVAAVEGMTKIANGELVSLSEQQLLDCSTENNGCGGGIMWKAFDYIKENQGITT 209
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPYQG Q C+ +A+ I GY+ V EE L VS+QPVSVAI+ + +
Sbjct: 210 EDNYPYQGAQQ-TCESNHLAAA----TISGYETVPQNDEEALLKAVSQQPVSVAIEGSGY 264
Query: 233 NFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F H GG+F G CG H VTIVGYG + E YWL+KN WG +W E G MRI R
Sbjct: 265 EFIHYSGGIFNGECGTQLTHAVTIVGYGVSEEG---IKYWLLKNSWGESWGENGYMRIMR 321
Query: 291 GVGG-SGLCNIAANAAYPL 308
V G+C +A+ A YP+
Sbjct: 322 DVDSPQGMCGLASLAYYPV 340
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 139/318 (43%), Positives = 187/318 (58%), Gaps = 34/318 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+A +HEQWM ++ R YK++ EK R+ IFK+N E+ L +N FADLT +
Sbjct: 33 MAVRHEQWMAQYGRVYKNEVEKTKRYNIFKENVEYIESFNKAGTKPYKLGINAFADLTNK 92
Query: 58 KFLASYTGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+F+AS GY PH SN F+ N S + ++DW ++GAVTPVKDQG CC
Sbjct: 93 EFIASRNGYI-----LPHECSSNTPFRYENVSAVP--TTVDWRKKGAVTPVKDQGQCGCC 145
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLAS 172
WAF+AVA +EG+ K+ TG L++ S+ +LVDC GC +++AF +I + L +
Sbjct: 146 WAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFTFIINNKGLTT 205
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--T 230
E YPYQG D C +S +S I GY+ V +E L+ V+ QPVSVAIDA +
Sbjct: 206 ESNYPYQGT-DGSCK--KSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGS 262
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F FY GVFTG CG +HGVT VGYG AE YWLVKN WGT+W E G +R+ +
Sbjct: 263 DFQFYSSGVFTGECGTELDHGVTAVGYGI---AEDGSKYWLVKNSWGTSWGEKGYIRMQK 319
Query: 291 GV-GGSGLCNIAANAAYP 307
+ GLC IA ++YP
Sbjct: 320 DIEAKEGLCGIAMQSSYP 337
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 236 bits (603), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 135/315 (42%), Positives = 186/315 (59%), Gaps = 31/315 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKF 59
+H QWM ++ + YKD E+E RFKIF +N + L +N+FADLT ++F
Sbjct: 37 RHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTLGVNQFADLTNDEF 96
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+S +K R++ FK N+S + S+DW ++GAVTPVK+QG CCWAF
Sbjct: 97 TSSRNKFKGHMCSSI--TRTSTFKYENASAIP--SSVDWRKKGAVTPVKNQGQCGCCWAF 152
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+AVA EG++K+ TG+L++ S+ +LVDC T GC +++AF++I Q L +E
Sbjct: 153 SAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEAN 212
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FN 233
YPYQG D C+ + S + I GY+ V E+ LQ V+ QP+SVAIDA+ F
Sbjct: 213 YPYQGV-DGTCNANKGSINAV--TITGYEDVPTNNEQALQKAVANQPISVAIDASGSDFQ 269
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GVFTG CG +HGVT VGYG + + YWLVKN WGT W E G + + RGV
Sbjct: 270 FYKSGVFTGSCGTELDHGVTAVGYGVSNDG---TKYWLVKNSWGTEWGEEGYIMMQRGVD 326
Query: 294 GS-GLCNIAANAAYP 307
+ GLC IA A+YP
Sbjct: 327 AAEGLCGIAMQASYP 341
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 236 bits (602), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 136/325 (41%), Positives = 191/325 (58%), Gaps = 34/325 (10%)
Query: 5 SHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKF 51
S + ++ +HEQWM + + YKD E+E RF+IFK+N + L +N+F
Sbjct: 29 SLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQF 88
Query: 52 ADLTREKFLASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKD 109
ADLT E+F+A +K H S+ R+ FK N + + ++DW ++GAVTP+KD
Sbjct: 89 ADLTNEEFIAPRNRFK----GHMCSSIIRTTTFKYENVTAVP--STVDWRQKGAVTPIKD 142
Query: 110 QGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIR 165
QG CCWAF+AVA EG++ + +G+L++ S+ +LVDC T GC +++AF+++
Sbjct: 143 QGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVI 202
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
Q L +E YPY+G D C+ + A+ I GY+ V E+ LQ V+ QPVSV
Sbjct: 203 QNHGLNTEANYPYKGV-DGKCN--VNEAANDAATITGYEDVPANNEKALQKAVANQPVSV 259
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
AIDA+ F FY GVFTG CG +HGVT VGYG + + YWLVKN WGT W E
Sbjct: 260 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDG---TEYWLVKNSWGTEWGEE 316
Query: 284 GSMRIFRGVGG-SGLCNIAANAAYP 307
G +R+ RGV GLC IA A+YP
Sbjct: 317 GYIRMQRGVNSEEGLCGIAMQASYP 341
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 135/311 (43%), Positives = 184/311 (59%), Gaps = 29/311 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+ E+WM E+ R YKD EK RF+IFK N L +NKF D+T +F+
Sbjct: 36 RFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFV 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
YTG P S F ++N S + SIDW + GAVT VKDQ CWAF+
Sbjct: 96 TQYTGVSLPLNFKREPVVS--FDDVNISAVG--QSIDWRDYGAVTEVKDQNPCGSCWAFS 151
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A+ATVEG+ KI TG LV+ S+ +++DC+ NGC F++NA+++I +ASE YPYQ
Sbjct: 152 AIATVEGIYKIVTGYLVSLSEQEVLDCAVSNGCDGGFVDNAYDFIISNNGVASEADYPYQ 211
Query: 180 GRQ-DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--YH 236
+ D + W +SA I GY YV+ E ++ V QP++ AIDA+ NF Y+
Sbjct: 212 AYEGDCTANSWPNSA-----YITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYN 266
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSG 296
GGVF+GPCG + NH +TI+GYG ++ G Q YW+VKN WG++W E G +R+ RGV SG
Sbjct: 267 GGVFSGPCGTSLNHAITIIGYG--QDSSGTQ-YWIVKNSWGSSWGERGYVRMARGVSSSG 323
Query: 297 LCNIAANAAYP 307
LC IA + YP
Sbjct: 324 LCGIAMDPLYP 334
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 185/318 (58%), Gaps = 34/318 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ +HEQWM ++ R Y+++ EK RF IFK+N E+ L +N FADLT +
Sbjct: 35 MVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAGTKPYKLGINAFADLTNQ 94
Query: 58 KFLASYTGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+F AS GYK PH SN F+ N S S ++DW +GAVTPVKDQG CC
Sbjct: 95 EFKASRNGYK-----LPHDCSSNTPFRYENVS--SVPTTVDWRTKGAVTPVKDQGQCGCC 147
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLAS 172
WAF+AVA +EG+ K+ TG L++ S+ +LVDC GC +++AF +I + L +
Sbjct: 148 WAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDAFSFIINNKGLTT 207
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--T 230
E YPYQG D C +S +S I GY+ V +E L+ V+ QPVSVAIDA +
Sbjct: 208 ESNYPYQGT-DGSCK--KSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVAIDAGGS 264
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F FY GVFTG CG +HGVT VGYG AE YWLVKN WGT+W E G +R+ +
Sbjct: 265 DFQFYSSGVFTGECGTELDHGVTAVGYGI---AEDGSKYWLVKNSWGTSWGEKGYIRMQK 321
Query: 291 GV-GGSGLCNIAANAAYP 307
+ GLC IA ++YP
Sbjct: 322 DIEAKEGLCGIAMQSSYP 339
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 236 bits (602), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 136/325 (41%), Positives = 191/325 (58%), Gaps = 34/325 (10%)
Query: 5 SHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKF 51
S + ++ +HEQWM + + YKD E+E RF+IFK+N + L +N+F
Sbjct: 576 SLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQF 635
Query: 52 ADLTREKFLASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKD 109
ADLT E+F+A +K H S+ R+ FK N + + ++DW ++GAVTP+KD
Sbjct: 636 ADLTNEEFIAPRNRFK----GHMCSSIIRTTTFKYENVTAVP--STVDWRQKGAVTPIKD 689
Query: 110 QGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIR 165
QG CCWAF+AVA EG++ + +G+L++ S+ +LVDC T GC +++AF+++
Sbjct: 690 QGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVI 749
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
Q L +E YPY+G D C+ + A+ I GY+ V E+ LQ V+ QPVSV
Sbjct: 750 QNHGLNTEANYPYKG-VDGKCN--ANEAANDVVTITGYEDVPANNEKALQKAVANQPVSV 806
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
AIDA+ F FY GVFTG CG +HGVT VGYG + + YWLVKN WGT W E
Sbjct: 807 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDG---TEYWLVKNSWGTEWGEE 863
Query: 284 GSMRIFRGVGG-SGLCNIAANAAYP 307
G +R+ RGV GLC IA A+YP
Sbjct: 864 GYIRMQRGVDSEEGLCGIAMQASYP 888
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 236 bits (601), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 134/328 (40%), Positives = 193/328 (58%), Gaps = 29/328 (8%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------- 47
+SR + + A+H+QW+V + YKD EKE+RF+IFK+N E +
Sbjct: 28 LSRPINYEATMRARHDQWIVHHEKVYKDLNEKEVRFQIFKENVERIEAFNAGEDKGYKLG 87
Query: 48 LNKFADLTREKFLASYTGYK-PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
NKF+DLT E+F +TGYK P S F+ N + + ++DW ++GAVTP
Sbjct: 88 FNKFSDLTNEEFRVLHTGYKRSHPKVMTSSKGKTHFRYTNVTDIP--PTMDWRKKGAVTP 145
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFE 162
+KDQ CCWAF+AVA +EGL++++TG+L+ S+ +LVDC GC+ L+ AF+
Sbjct: 146 IKDQKECGCCWAFSAVAAMEGLHQLKTGELIPLSEQELVDCDVEGEDEGCSGGLLDTAFD 205
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
+I + + L +E YPY+G +D C+ +S+ S I GY+ V +E+ L V+ QP
Sbjct: 206 FILKNKGLTTEVNYPYKG-EDGVCNKKKSALSA--AKITGYEDVPANSEKALLQAVANQP 262
Query: 223 VSVAIDATWFN--FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
VSVAID + F+ FY GVF+G C NH VT VGYG TT+ YW++KN WG+ W
Sbjct: 263 VSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTDG---TKYWIIKNSWGSKW 319
Query: 281 DEGGSMRIFRGV-GGSGLCNIAANAAYP 307
+ G MRI R V GLC +A +A+YP
Sbjct: 320 GDSGYMRIKRDVHEKEGLCGLAMDASYP 347
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 235 bits (600), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 132/327 (40%), Positives = 190/327 (58%), Gaps = 34/327 (10%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
MSR +++ ++ +HEQWM E+ + YKD EKE RF IFK N EF L
Sbjct: 26 MSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLS 85
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N ADLT ++F AS GYK D + S ++N+ + +++DW +GAVTP+
Sbjct: 86 VNHLADLTLDEFKASRNGYK--KIDREFATTSFKYENVT----AIPEAVDWRVKGAVTPI 139
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQG CWAF+ VA +EG+N+I TG+L++ S+ +LVDC T GC +E+ FE+
Sbjct: 140 KDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + + SE YPY+ D C+ ++ + I GY+ V +E L V+ QP+
Sbjct: 200 IIKNGGITSETNYPYKA-ADGSCN---TATTAPVAKITGYEKVPVNSEISLLKAVANQPI 255
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SV+IDA + F FY G++TG CG +HGVT VGYG+ + YW+VKN WGT W
Sbjct: 256 SVSIDASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSANGTD----YWIVKNSWGTVWG 311
Query: 282 EGGSMRIFRGVGG-SGLCNIAANAAYP 307
E G +R+ RG+ GLC IA +++YP
Sbjct: 312 EKGYIRMQRGIADKEGLCGIAMDSSYP 338
>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
Length = 373
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 141/319 (44%), Positives = 180/319 (56%), Gaps = 32/319 (10%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTRE 57
++A +H +WM RTYKD AEKE R IFK N E+ L N+FADLT E
Sbjct: 30 SMAERHVEWMARHGRTYKDAAEKEQRLGIFKSNVEYIESFNAGKRKYQLAANQFADLTHE 89
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--C 115
+F A +TG+KP T + N F++ S S DS+DW +GAVTPVKDQG C C
Sbjct: 90 EFKAMHTGFKPSGTGAKKAG--NGFRH--GSLSSVPDSVDWRSKGAVTPVKDQG-LCGSC 144
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLAS 172
WAFT VA VEG+ KI TG+L++ S+ QLVDC GC ++ AFE+I + S
Sbjct: 145 WAFTVVAAVEGITKIVTGKLISLSEQQLVDCDVHGKDQGCQGGDMDAAFEFIVNNGGITS 204
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW- 231
E YPY+ Q C+ +AS I ++ V E+ L+ V+ QPVSV IDA
Sbjct: 205 EANYPYEEVQ-RLCN--AHNASFVVATIESHEDVPTNDEKALRKAVANQPVSVGIDAGSS 261
Query: 232 --FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F Y GGVF+G CG +H VT+VGYGTT++ YWL KN WG W E G +R+
Sbjct: 262 LDFQLYSGGVFSGECGTDLDHAVTVVGYGTTSDG---TKYWLAKNSWGETWGENGYIRME 318
Query: 290 RGVGG-SGLCNIAANAAYP 307
R V GLC IA A+YP
Sbjct: 319 RDVAAKEGLCGIAMQASYP 337
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 132/317 (41%), Positives = 184/317 (58%), Gaps = 30/317 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
++A+HEQWM + + Y D AEKE RFKIFK N E+ L +NKFAD T E
Sbjct: 34 MSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYIESFNTAGNKPYKLSVNKFADQTNE 93
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
KF + GY+ P P S ++N+ + ++DW ++GAVTP+KDQG CW
Sbjct: 94 KFKGARNGYRRPFQTRPMKVTSFKYENVTAVPA----TMDWRKKGAVTPIKDQGQCGSCW 149
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASE 173
AF+ VA EG+N++ TG+LV+ S+ +LVDC GC +E+ FE+I + + +E
Sbjct: 150 AFSTVAATEGINQLTTGKLVSLSEQELVDCDNQGEDQGCEGGLMEDGFEFIIKNHGITTE 209
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPYQ D C+ + ++ I GY+ V +E L VV+ QP+SV+IDA +
Sbjct: 210 ANYPYQA-ADGTCNSKKQAS--HIAKITGYESVPANSEAELLKVVANQPISVSIDAGGSD 266
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY GVFTG CG +HGVT VGYG T++ YWLVKN W T+W E G +R+ R
Sbjct: 267 FQFYSSGVFTGKCGTELDHGVTAVGYGETSDG---TKYWLVKNSWXTSWGEEGYIRMQRD 323
Query: 292 VGG-SGLCNIAANAAYP 307
+ GLC IA +++YP
Sbjct: 324 IDAEEGLCGIAMDSSYP 340
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 132/327 (40%), Positives = 189/327 (57%), Gaps = 34/327 (10%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
MSR +++ ++ +HEQWM E+ + YKD EKE RF IFK N EF L
Sbjct: 26 MSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPYKLS 85
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N ADLT ++F AS GYK D + S ++N+ + +++DW +GAVTP+
Sbjct: 86 VNHLADLTLDEFKASRNGYK--KIDREFATTSFKYENVT----AIPEAVDWRVKGAVTPI 139
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQG CWAF+ VA +EG+N+I TG+L++ S+ +LVDC T GC +E+ FE+
Sbjct: 140 KDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + + SE YPY+ D C ++ + I GY+ V +E L V+ QP+
Sbjct: 200 IIKNGGITSETNYPYKA-ADGSC---SAATTAPVAKITGYEKVPVNSEISLLKAVANQPI 255
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SV+IDA + F FY G++TG CG +HGVT VGYG+ + YW+VKN WGT W
Sbjct: 256 SVSIDASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSANGTD----YWIVKNSWGTVWG 311
Query: 282 EGGSMRIFRGVGG-SGLCNIAANAAYP 307
E G +R+ RG+ GLC IA +++YP
Sbjct: 312 EKGYIRMQRGIADKEGLCGIAMDSSYP 338
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 178/316 (56%), Gaps = 27/316 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN-----------HEF-LRLNKFADLTREK 58
IAA+HEQWM + R Y D AEK R ++FK N H+F L N+FAD+T+++
Sbjct: 29 IAARHEQWMARYGRVYSDVAEKARRLEVFKANVGFIESVNAGNHKFWLEANQFADITKDE 88
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
F A + GYK R+ F+ N S S+DW GAVTPVKDQG CCWA
Sbjct: 89 FRAMHKGYKMQVIGSKA--RATGFRYANVSIDDLPASVDWRANGAVTPVKDQGQCGCCWA 146
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+ VA++EG+ K+ TG+L++ S+ +LVDC GC ++NAFE+I L +E
Sbjct: 147 FSTVASMEGIVKVSTGKLISLSEQELVDCDVGMQNKGCGGGLMDNAFEFIVNNGGLDTEA 206
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YPY G D C+ + S +I+GY+ V E LQ V+ QPVS+A+D F
Sbjct: 207 DYPYTG-ADGTCN--SNKESNIAASIKGYEDVPANDEASLQKAVAAQPVSIAVDGGDDLF 263
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GGV TG CG +HGV VGYG + YWLVKN WGT+W E G +R+ R V
Sbjct: 264 RFYKGGVLTGACGTELDHGVAAVGYGVAGDG---TKYWLVKNSWGTSWGEDGFIRLERDV 320
Query: 293 GG-SGLCNIAANAAYP 307
+G+C +A +YP
Sbjct: 321 ADEAGMCGLAMKPSYP 336
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 235 bits (600), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 136/325 (41%), Positives = 191/325 (58%), Gaps = 34/325 (10%)
Query: 5 SHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKF 51
S + ++ +HEQWM + + YKD E+E RF+IFK+N + L +N+F
Sbjct: 47 SLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAFNNAANKRYKLAINQF 106
Query: 52 ADLTREKFLASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKD 109
ADLT E+F+A +K H S+ R+ FK N + + ++DW ++GAVTP+KD
Sbjct: 107 ADLTNEEFIAPRNRFK----GHMCSSIIRTTTFKYENVTAVP--STVDWRQKGAVTPIKD 160
Query: 110 QGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIR 165
QG CCWAF+AVA EG++ + +G+L++ S+ +LVDC T GC +++AF+++
Sbjct: 161 QGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFVI 220
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
Q L +E YPY+G D C+ + A+ I GY+ V E+ LQ V+ QPVSV
Sbjct: 221 QNHGLNTEANYPYKGV-DGKCN--ANEAANDVVTITGYEDVPANNEKALQKAVANQPVSV 277
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
AIDA+ F FY GVFTG CG +HGVT VGYG + + YWLVKN WGT W E
Sbjct: 278 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSNDG---TEYWLVKNSWGTEWGEE 334
Query: 284 GSMRIFRGVGG-SGLCNIAANAAYP 307
G +R+ RGV GLC IA A+YP
Sbjct: 335 GYIRMQRGVDSEEGLCGIAMQASYP 359
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 190/316 (60%), Gaps = 34/316 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFL 60
+HE+WM +A+ YKD E+E RFKIFK+N ++ +N+FADLT E+F+
Sbjct: 38 RHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEFI 97
Query: 61 ASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
A +K H S+ R+ FK N + + ++DW ++GAVTP+KDQG CCWA
Sbjct: 98 APRNRFK----GHMCSSITRTTTFKYENVTAIP--STVDWRQKGAVTPIKDQGQCGCCWA 151
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+AVA EG++ + G+L++ S+ ++VDC T GCA F++ AF++I Q L +E
Sbjct: 152 FSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEP 211
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--F 232
YPY+ D C+ +A+ I GY+ V E+ LQ V+ QPVSVAIDA+ F
Sbjct: 212 NYPYKAV-DGKCN--AKAAANHVATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDF 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVFTG CG +HGVT VGYG + A+G + YWLVKN WGT W E G +R+ RGV
Sbjct: 269 QFYQSGVFTGSCGTELDHGVTAVGYGVS--ADGTE-YWLVKNSWGTEWGEEGYIRMQRGV 325
Query: 293 GG-SGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 326 KAEEGLCGIAMMASYP 341
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 190/316 (60%), Gaps = 34/316 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFL 60
+HE+WM +A+ YKD E+E RFKIFK+N ++ +N+FADLT E+F+
Sbjct: 38 RHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEFI 97
Query: 61 ASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
A +K H S+ R+ FK N + + ++DW ++GAVTP+KDQG CCWA
Sbjct: 98 APRNRFK----GHMCSSITRTTTFKYENVTAIP--STVDWRQKGAVTPIKDQGQCGCCWA 151
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+AVA EG++ + G+L++ S+ ++VDC T GCA F++ AF++I Q L +E
Sbjct: 152 FSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEP 211
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--F 232
YPY+ D C+ +A+ I GY+ V E+ LQ V+ QPVSVAIDA+ F
Sbjct: 212 NYPYKAV-DGKCN--AKAAANHVATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDF 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVFTG CG +HGVT VGYG + A+G + YWLVKN WGT W E G +R+ RGV
Sbjct: 269 QFYQSGVFTGSCGTELDHGVTAVGYGVS--ADGTE-YWLVKNSWGTEWGEEGYIRMQRGV 325
Query: 293 GG-SGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 326 KAEEGLCGIAMMASYP 341
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 235 bits (599), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 137/326 (42%), Positives = 189/326 (57%), Gaps = 33/326 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRL 48
+R+ H+ ++ +HE WMV++ R YKD EK R+KIFK N F L +
Sbjct: 27 ARSLHE-ASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSI 85
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N+FADLT E+F AS +K H S + FK N + + ++DW ++GAVTP+K
Sbjct: 86 NEFADLTNEEFRASRNRFKA----HICSTEATSFKYENVTAVP--STVDWRKKGAVTPIK 139
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYI 164
DQG CWAF+AVA +EG+ ++ TG+L++ S+ +LVDC T GC+ +++AF++I
Sbjct: 140 DQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFI 199
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
Q L +E YPY G D C+ R A+ I GY+ V E+ LQ V+ QP++
Sbjct: 200 EQNHGLTTEANYPYAGT-DGTCN--RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIA 256
Query: 225 VAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
VAIDA+ F FY GVFTG CG +HGV VGYGT+ + YWLVKN W T W E
Sbjct: 257 VAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDG---MKYWLVKNSWSTGWGE 313
Query: 283 GGSMRIFRGV-GGSGLCNIAANAAYP 307
G +R+ R V GLC IA A+YP
Sbjct: 314 EGYIRMQRDVTAKEGLCGIAMQASYP 339
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 234 bits (598), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 129/315 (40%), Positives = 180/315 (57%), Gaps = 24/315 (7%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+AA+HE+WM + R YKD AEK R ++FK N F L +N+FADLT E
Sbjct: 40 MAARHERWMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSE 99
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F A+ T K T + S FK N S + S+DW +GAVT +KDQG CCW
Sbjct: 100 EFKATMTNSKGFSTPNNGVRVSTGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCW 159
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASE 173
AF+AVA +EG+ K+ TG+L++ S+ +LVDC GC ++ AF++I L +E
Sbjct: 160 AFSAVAAMEGIVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAE 219
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN 233
YPY +D C ++A+ +IRGY+ V E L V+ QPVSVA+DA+ F
Sbjct: 220 ANYPYTA-EDGRCK--TTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDASKFQ 276
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GGV G CG + +HGVT++GYG ++ YWLVKN WGT W E G +R+ + +
Sbjct: 277 FYGGGVMAGECGTSLDHGVTVIGYGAASDG---TKYWLVKNSWGTTWGEAGYLRMEKDID 333
Query: 294 GS-GLCNIAANAAYP 307
G+C +A +YP
Sbjct: 334 DKRGMCGLAMQPSYP 348
>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 132/317 (41%), Positives = 184/317 (58%), Gaps = 30/317 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
++A+HEQWM + + Y D AEKE RFKIFK N E+ L +NKFAD T E
Sbjct: 34 MSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYIESFNTAGNKPYKLSVNKFADQTNE 93
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
KF + GY+ P P S ++N+ + ++DW ++GAVT +KDQG CW
Sbjct: 94 KFKGARNGYRRPFQTRPMKVTSFKYENVTAVPA----TMDWRKKGAVTLIKDQGQCGSCW 149
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASE 173
AF+ VA EG+N++ TG+LV+ S+ +LVDC GC +E+ FE+I + + +E
Sbjct: 150 AFSTVAATEGINQLTTGKLVSLSEQELVDCDIQGEDQGCEGGLMEDGFEFIIKNHGITTE 209
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPYQ D C+ + ++ I GY+ V +E L VV+ QP+SV+IDA +
Sbjct: 210 ANYPYQA-ADGTCNSKKQAS--HIAKITGYESVPANSEAELLKVVANQPISVSIDAGGSD 266
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY GVFTG CG +HGVT VGYG T++ YWLVKN WGT+W E G +R+ R
Sbjct: 267 FQFYSSGVFTGKCGTELDHGVTAVGYGETSDG---TKYWLVKNSWGTSWGEEGYIRMQRD 323
Query: 292 VGG-SGLCNIAANAAYP 307
+ GLC IA +++YP
Sbjct: 324 IDTEEGLCGIAMDSSYP 340
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 234 bits (596), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 182/314 (57%), Gaps = 32/314 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFL 60
+HE WMV++ R YKD EK R+KIFK N F L +N+FADLT E+F
Sbjct: 38 RHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFR 97
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
AS +K H S + FK N + + ++DW ++GAVTP+KDQG CWAF+
Sbjct: 98 ASRNRFKA----HICSTEATSFKYENVTAVP--STVDWRKKGAVTPIKDQGQCGSCWAFS 151
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA +EG+ ++ TG+L++ S+ +LVDC T GC+ +++AF++I Q L +E Y
Sbjct: 152 AVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANY 211
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
PY G D C+ R A+ I GY+ V E+ LQ V+ QP++VAIDA+ F F
Sbjct: 212 PYAGT-DGTCN--RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDASGSEFQF 268
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG- 293
Y GVFTG CG +HGV VGYGT+ + YWLVKN W T W E G +R+ R V
Sbjct: 269 YSSGVFTGQCGTELDHGVAAVGYGTSDDG---MKYWLVKNSWSTGWGEEGYIRMQRDVTV 325
Query: 294 GSGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 326 KEGLCGIAMQASYP 339
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 233 bits (595), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 136/317 (42%), Positives = 189/317 (59%), Gaps = 34/317 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+H+QWM ++A+ Y D E E RF+IFK+N + L +N+F DLT E+F+
Sbjct: 38 RHQQWMGQYAKIYNDHQEWEKRFQIFKENVNYIETSNKEGGRFYKLGVNQFVDLTNEEFI 97
Query: 61 ASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
A +K H S+ R+N +K N + + ++DW ++GAVTPVKDQG CCWA
Sbjct: 98 APRNRFKG----HMCSSIIRTNTYKYENVTTVP--SNVDWRQKGAVTPVKDQGQCGCCWA 151
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+AVA EG++++ TG+L++ S+ +LVDC T GC +++AF++I Q L +E
Sbjct: 152 FSAVAATEGIHQLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLDTEA 211
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--F 232
YPYQG D C+ + AS I Y+ V E+ LQ V+ QP+SVAIDA+ F
Sbjct: 212 KYPYQGV-DGTCN--ANEASINAATITSYEDVPTNNEQALQKAVANQPISVAIDASGSDF 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVFTG CG +HGVT VGYG + + YWLVKN WGT+W E G +R+ RGV
Sbjct: 269 QFYTSGVFTGSCGTELDHGVTAVGYGVSDDG---TKYWLVKNSWGTSWGEEGYIRMQRGV 325
Query: 293 GG-SGLCNIAANAAYPL 308
GLC IA A+YP+
Sbjct: 326 DAVEGLCGIAMQASYPI 342
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 233 bits (595), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 129/315 (40%), Positives = 179/315 (56%), Gaps = 24/315 (7%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+AA+HE+WM + R YKD AEK R ++FK N F L +N+FADLT E
Sbjct: 40 MAARHERWMAQHGRVYKDAAEKARRLEVFKANVAFIESFNAGGKNRYWLGVNQFADLTSE 99
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F A+ T K T + S FK N S + S+DW +GAVT +KDQG CCW
Sbjct: 100 EFKATMTNSKGFSTPNNGVRVSTGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCW 159
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASE 173
AF+AVA +EG K+ TG+L++ S+ +LVDC GC ++ AF++I L +E
Sbjct: 160 AFSAVAAMEGFVKLSTGKLISLSEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAE 219
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN 233
YPY +D C ++A+ +IRGY+ V E L V+ QPVSVA+DA+ F
Sbjct: 220 ANYPYTA-EDGRCK--TTAAADVAASIRGYEDVPANDEPSLMKAVAGQPVSVAVDASKFQ 276
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GGV G CG + +HGVT++GYG ++ YWLVKN WGT W E G +R+ + +
Sbjct: 277 FYGGGVMAGECGTSLDHGVTVIGYGAASDG---TKYWLVKNSWGTTWGEAGYLRMEKDID 333
Query: 294 GS-GLCNIAANAAYP 307
G+C +A +YP
Sbjct: 334 DKRGMCGLAMQPSYP 348
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 233 bits (594), Expect = 9e-59, Method: Compositional matrix adjust.
Identities = 133/327 (40%), Positives = 188/327 (57%), Gaps = 34/327 (10%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
MSR +++ ++ +HEQWM E + Y+D EKE RF IFK N EF L
Sbjct: 26 MSRKLYESLSLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKLS 85
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N ADLT ++F AS GYK D + S ++N+ + ++DW +GAVTP+
Sbjct: 86 VNHLADLTLDEFKASRNGYK--KIDREFTTTSFKYENVT----AIPAAVDWRVKGAVTPI 139
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQG CWAF+ VA EG+N+I TG+LV+ S+ +LVDC T GC +E+ FE+
Sbjct: 140 KDQGQCGSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + + SE YPY+ D C+ ++ K I GY+ V +E+ L V+ QP+
Sbjct: 200 IIKNGGITSETNYPYKA-ADGSCNTATTTPVAK---ITGYEKVPVNSEKSLLKAVANQPI 255
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SV+IDA + F FY G++TG CG +HGVT VGYG+ + YW+VKN WGT W
Sbjct: 256 SVSIDASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSANGTD----YWIVKNSWGTVWG 311
Query: 282 EGGSMRIFRGVGG-SGLCNIAANAAYP 307
E G +R+ RG+ GLC IA +++YP
Sbjct: 312 EKGYIRMQRGIAAKEGLCGIAMDSSYP 338
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 135/329 (41%), Positives = 190/329 (57%), Gaps = 35/329 (10%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
+S + + ++ +HEQWM + R YKD EKE RF IFK+N + L
Sbjct: 25 VSSRTLQDASMQERHEQWMARYGRVYKDLQEKEKRFSIFKENVNYIEASNNAGDKPYKLG 84
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVT 105
+N+FADLT E+F+A+ +K H S+ R+ FK N + S ++DW + GAVT
Sbjct: 85 VNQFADLTNEEFIATRNKFK----GHMSSSITRTTTFKYENVTAPS---TVDWRQEGAVT 137
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAF 161
PVK+QG+ CCWAF+AVA EG++K+ TG LV+ S+ +LVDC T GC +++AF
Sbjct: 138 PVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQGCQGGLMDDAF 197
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
++I Q L +E YPYQG D C+ + + I GY+ V E+ LQ V+ Q
Sbjct: 198 KFIIQNGGLNTEAQYPYQGV-DGTCN--TNEEATHVATITGYEDVPSNNEQALQQAVANQ 254
Query: 222 PVSVAIDATWFNF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
P+S+AIDA+ +F Y GVFTG CG +HGV +VGYG + + YWLVKN WG +
Sbjct: 255 PISIAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGT---KYWLVKNSWGAD 311
Query: 280 WDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
W E G +R+ R V GLC +A +YP
Sbjct: 312 WGEEGYIRMQRDVDAPEGLCGLAMQPSYP 340
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 136/316 (43%), Positives = 181/316 (57%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
E+FLA +TG P + P S FK + S ++DW E GAVT VK+QG C
Sbjct: 94 EEFLAKFTGLNIPNSYLSPSPMSSTEFKINDISDDDMPSNLDWRESGAVTQVKNQGQCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++IR+ ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIRENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G+Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGQQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C N NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCANRINHAVTAIGYGTD---ENGQKYWLLKNSWGTSWGEKGFMKIIRDY 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKLSSYP 341
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 233 bits (593), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 137/317 (43%), Positives = 188/317 (59%), Gaps = 38/317 (11%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HEQWM ++ + YKD EKE+R KIFK+N + L +N+FADLT E+F
Sbjct: 38 RHEQWMAQYGKVYKDSYEKELRSKIFKENVQRIEAFNNAGNKSYKLGINQFADLTNEEFK 97
Query: 61 AS--YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
A + G+ +S R+ FK + + S S+DW ++GAVTP+KDQG CCWA
Sbjct: 98 ARNRFKGHMCS-----NSTRTPTFKYEHVT--SVPASLDWRQKGAVTPIKDQGQCGCCWA 150
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+AVA EG+ K+ TG+L++ S+ +LVDC T GC +++AF++I Q + L +E
Sbjct: 151 FSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGLNTEA 210
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-- 231
YPYQG D C+ ++A K A I+G++ V +E L V+ QP+SVAIDA+
Sbjct: 211 KYPYQGV-DATCN---ANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDASGSE 266
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY GVFTG CG +HGVT VGYG+ +G YWLVKN WG W E G +R+ R
Sbjct: 267 FQFYSSGVFTGSCGTELDHGVTAVGYGS----DGGTKYWLVKNSWGEQWGEQGYIRMQRD 322
Query: 292 VGG-SGLCNIAANAAYP 307
V GLC A A+YP
Sbjct: 323 VAAEEGLCGFAMQASYP 339
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 232 bits (592), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 183/314 (58%), Gaps = 30/314 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HEQWM + + YKD E+E RF++FK+N + L +N+FADLT ++F+
Sbjct: 38 RHEQWMTRYGKVYKDPQEREKRFRVFKENVNYIEAFNNAANKSYKLGINQFADLTNKEFI 97
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A G+K + F+N+ ++ ++DW ++GAVTP+KDQG CCWAF+
Sbjct: 98 APRNGFKGHMCSSIIRTTTFKFENVTATP----STVDWRQKGAVTPIKDQGQCGCCWAFS 153
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA EG++ + G+L++ S+ +LVDC T GC +++AF++I Q L +E Y
Sbjct: 154 AVAATEGIHALSAGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLNTEANY 213
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
PY+G D C+ + A+ I GY+ V E LQ V+ QPVSVAIDA+ F F
Sbjct: 214 PYKGV-DGKCN--ANEAAKNAATITGYEDVPANNEMALQKAVANQPVSVAIDASGSDFQF 270
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVFTG CG +HGVT VGYG + + YWLVKN WGT W E G +R+ RGV
Sbjct: 271 YKSGVFTGSCGTELDHGVTAVGYGVSDDG---TEYWLVKNSWGTEWGEEGYIRMQRGVDS 327
Query: 295 -SGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 328 EEGLCGIAMQASYP 341
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 178/319 (55%), Gaps = 30/319 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
KHEQWM F R Y D++EK RF IFKKN EF L +N+F+DLT E+F
Sbjct: 34 KHEQWMARFNRVYSDESEKRNRFNIFKKNLEFVQSFNMNKNITYKLDVNEFSDLTDEEFR 93
Query: 61 ASYTGYKPPP----TDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC-- 114
A++TG P S+++ F+ N S +S+DW + GAVTPVK QG C
Sbjct: 94 ATHTGLVVPEEITGISTLSSDKTVPFRYGNVSDTG--ESMDWRQEGAVTPVKYQGR-CGG 150
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLAS 172
CWAF+AVA VEG+ KI G+LV+ S+ QL+DC T GC + AFEYI + Q + +
Sbjct: 151 CWAFSAVAAVEGITKITKGELVSLSEQQLLDCDTDYNQGCHGGIMSKAFEYIIKNQGITT 210
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT-- 230
E YPYQ Q S+S + I GY+ V EE L VS+QPVSV I+ T
Sbjct: 211 EDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVSQQPVSVGIEGTGA 270
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F Y GG+F G CG +H VTIVGYG + E YW+VKN WG W E G MRI R
Sbjct: 271 GFRHYSGGIFNGECGTDLHHAVTIVGYGMSEEG---TKYWVVKNSWGETWGEDGFMRIKR 327
Query: 291 GVGG-SGLCNIAANAAYPL 308
V G+C +A A YPL
Sbjct: 328 DVDAPQGMCGLAMLAFYPL 346
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 133/315 (42%), Positives = 179/315 (56%), Gaps = 25/315 (7%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGINEFADITS 93
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
E+FL +TG P P S FK + S ++DW E GAVT VK+QG CC
Sbjct: 94 EEFLTKFTGINIPSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCC 153
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASEC 174
WAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++SE
Sbjct: 154 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISSES 213
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FN 233
Y YQG+Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 DYEYQGQQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDLQ 268
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R G
Sbjct: 269 FYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDSG 325
Query: 294 G-SGLCNIAANAAYP 307
G C+IA ++YP
Sbjct: 326 NPGGHCDIAKMSSYP 340
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 232 bits (592), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 137/324 (42%), Positives = 188/324 (58%), Gaps = 27/324 (8%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
SR H ++ +HEQWM ++ + YKD AE E RF IF+ N EF L +
Sbjct: 26 SRKLHDA-SMYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNKPYKLSI 84
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N AD T E+F+AS+ GYK FK N + + + ++DW ++G T +K
Sbjct: 85 NHLADQTNEEFMASHKGYKGSHWQGLRITTQTPFKYENVTDIPW--AVDWRQKGDATSIK 142
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQ 166
DQG CWAF+AVA EG+ +I TG LV+ S+ +LVDC +++ GC +E+ FE+I +
Sbjct: 143 DQGQCGICWAFSAVAATEGIYQITTGNLVSLSEQELVDCDSVDHGCDGGLMEHGFEFIIK 202
Query: 167 YQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVA 226
++SE YPY CD + ++ G I+GY+ V EE LQ V+ QPVSV+
Sbjct: 203 NGGISSEANYPYTAVNGT-CDTNKEASPG--AQIKGYETVPVNCEEELQKAVANQPVSVS 259
Query: 227 IDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
IDA + F FY GVFTG CG +HGVT VGYG+T +G Q YW+VKN WGT W E G
Sbjct: 260 IDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTD--DGIQ-YWIVKNSWGTQWGEEG 316
Query: 285 SMRIFRGVGG-SGLCNIAANAAYP 307
+R+ RG+ GLC IA +A+YP
Sbjct: 317 YIRMLRGIDAQEGLCGIAMDASYP 340
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 190/316 (60%), Gaps = 34/316 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE+WM +A+ YKD E+E RFKIFK+N + L +N+FADLT E+F+
Sbjct: 38 RHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLGINQFADLTNEEFI 97
Query: 61 ASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
A +K H S+ R+ FK N + + ++DW ++GAVTP+KDQG CCWA
Sbjct: 98 APRNKFK----GHMCSSITRTTTFKYENVTALP--STVDWRQKGAVTPIKDQGQCGCCWA 151
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+AVA EG++ + +G+L++ S+ ++VDC T GCA F++ AF++I Q L +E
Sbjct: 152 FSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNTEA 211
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--F 232
YPY+ D C+ + A+ I GY+ V E+ LQ V+ QPVSVAIDA+ F
Sbjct: 212 NYPYKAV-DGKCN--ANEAANHAATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDF 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVFTG CG +HGVT VGYG + A+G Q YWLVKN WGT W E G + + RGV
Sbjct: 269 QFYKTGVFTGSCGTQLDHGVTAVGYGVS--ADGTQ-YWLVKNSWGTEWGEEGYIMMQRGV 325
Query: 293 GG-SGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 326 KAQEGLCGIAMMASYP 341
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 134/306 (43%), Positives = 182/306 (59%), Gaps = 28/306 (9%)
Query: 19 MVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLASYTG 65
M E+ R YKD EK RF+IFK N L +NKF D+T +F+A YTG
Sbjct: 1 MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
P + + F ++N S + SIDW + GAVT VKDQ CWAF+A+ATV
Sbjct: 61 GISRPLNIEKEPVVS-FDDVNISAVG--QSIDWRDYGAVTEVKDQNPCGSCWAFSAIATV 117
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQ-D 183
EG+ KI TG LV+ S+ +++DC+ NGC F++NA+++I +ASE YPYQ Q D
Sbjct: 118 EGIYKIVTGYLVSLSEQEVLDCAVSNGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGD 177
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--YHGGVFT 241
+ W +SA I GY YV+ E ++ V QP++ AIDA+ NF Y+GGVF+
Sbjct: 178 CAANSWPNSA-----YITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFS 232
Query: 242 GPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIA 301
GPCG + NH +TI+GYG ++ G Q YW+VKN WG++W E G +R+ RGV SGLC IA
Sbjct: 233 GPCGTSLNHAITIIGYG--QDSSGTQ-YWIVKNSWGSSWGERGYIRMARGVSSSGLCGIA 289
Query: 302 ANAAYP 307
+ YP
Sbjct: 290 MDPLYP 295
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 181/314 (57%), Gaps = 32/314 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFL 60
+HE WM ++ R YKD EK R+KIFK N F L +N+FADLT E+F
Sbjct: 38 RHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFR 97
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
AS +K H S + FK + + + ++DW ++GAVTP+KDQG CWAF+
Sbjct: 98 ASRNRFKA----HICSTEATSFKYEHVAAVP--STVDWRKKGAVTPIKDQGQCGSCWAFS 151
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA +EG+ ++ TG+L++ S+ +LVDC T GC +++AF++I Q LA+E Y
Sbjct: 152 AVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFIEQNHGLATEANY 211
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
PY G D C+ R A+ I GY+ V E+ LQ V+ QP++VAIDA F F
Sbjct: 212 PYAG-TDGTCN--RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQF 268
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-G 293
Y GVFTG CG +HGV VGYGT+ + YWLVKN WGT W E G +R+ R V
Sbjct: 269 YSSGVFTGQCGTELDHGVAAVGYGTSDDG---MKYWLVKNSWGTGWGEVGYIRMQRDVTA 325
Query: 294 GSGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 326 KEGLCGIAMQASYP 339
>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 232 bits (591), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 188/314 (59%), Gaps = 25/314 (7%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRLNKFADLTREKFL 60
+SRT H+ +++ +HE WM + RTYKD AEKE RFKIFK+N E++ + KF
Sbjct: 23 LSRTLHEV-SMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEYIE-------SVNKFK 74
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
AS GY + P S+ F+ N + + S+DW ++GAVTP+KDQG CCWAF+
Sbjct: 75 ASRNGYNM--SSRPRSSEITSFRYENVAAVP--SSMDWRKKGAVTPIKDQGQCGCCWAFS 130
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA +EG+ +++TG+L++ S+ +LVDC T GC +++AFE+I L +E Y
Sbjct: 131 AVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQGCGGGLMDSAFEFIIGNGGLTTEANY 190
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNF 234
PY+G D C+ + A+ I+ Y+ V +E L V++ PVSVAIDA + F F
Sbjct: 191 PYKG-VDATCN--KKKAASSAAKIKNYEDVPANSEAALLKAVAQHPVSVAIDAGGSDFQF 247
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVFTG CG +HGVT VGYG T + YWLVKN WGT W E G + + R +G
Sbjct: 248 YSSGVFTGQCGTELDHGVTAVGYGKTDDG---TKYWLVKNSWGTGWGEDGYIWMERDIGA 304
Query: 295 S-GLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 305 DEGLCGIAMEASYP 318
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 231 bits (590), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 190/316 (60%), Gaps = 34/316 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE+WM +A+ YKD E+E RFKIFK+N + L +N+FADLT E+F+
Sbjct: 38 RHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLGINQFADLTNEEFI 97
Query: 61 ASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
A +K H S+ R+ FK N + + ++DW ++GAVTP+KDQG CCWA
Sbjct: 98 APRNRFK----GHMCSSITRTTTFKYENVTALP--STVDWRQKGAVTPIKDQGQCGCCWA 151
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+AVA EG++ + +G+L++ S+ ++VDC T GCA F++ AF++I Q L +E
Sbjct: 152 FSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNTEA 211
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--F 232
YPY+ D C+ + A+ I GY+ V E+ LQ V+ QPVSVAIDA+ F
Sbjct: 212 NYPYKAV-DGKCN--ANEAANHAATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDF 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVFTG CG +HGVT VGYG + A+G Q YWLVKN WGT W E G + + RGV
Sbjct: 269 QFYKTGVFTGSCGTQLDHGVTAVGYGVS--ADGTQ-YWLVKNSWGTEWGEEGYIMMQRGV 325
Query: 293 GG-SGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 326 KAQEGLCGIAMMASYP 341
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 231 bits (590), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 136/328 (41%), Positives = 188/328 (57%), Gaps = 35/328 (10%)
Query: 1 MSRTSHKT-GNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------L 46
+SR H+T ++ +HEQWM ++ + YKD AEKE RF IFK N EF L
Sbjct: 26 ISRELHETETSLIERHEQWMAKYDKVYKDAAEKEKRFLIFKDNVEFIESFNAAGNKPYKL 85
Query: 47 RLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
+N ADLT E+F AS G K D+ S ++N+ + S+DW ++GAVTP
Sbjct: 86 GVNHLADLTIEEFKASRNGLK-RSYDYEVGTTSFKYENVT----AIPASVDWRKKGAVTP 140
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFE 162
+KDQG CWAF+ VA EG++KI TG+LV+ S+ +LVDC T GC ++E+ FE
Sbjct: 141 IKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQELVDCDRKGTDQGCEGGYMEDGFE 200
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
+I + + +E YPY+ D +A+ I+GY+ V +E+ L V+ QP
Sbjct: 201 FIIKNGGITTEANYPYKA-----VDGSCKNATAPAAQIKGYEKVPVNSEKALLKAVANQP 255
Query: 223 VSVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
VSV+IDA F FY G+FTG CG +HGVT VGYG + YW+VKN WGT W
Sbjct: 256 VSVSIDAADGSFMFYSSGIFTGECGTELDHGVTAVGYGRANGTD----YWIVKNSWGTVW 311
Query: 281 DEGGSMRIFRGVGG-SGLCNIAANAAYP 307
E G +R+ RG+ GLC IA +++YP
Sbjct: 312 GEQGYIRMQRGIAAKEGLCGIAMDSSYP 339
>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 183/311 (58%), Gaps = 26/311 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE+WM ++ + Y D AEKE RF+IFK N +F L +N+FADL E+F
Sbjct: 36 RHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEFK 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
AS + + + ++ F+ + +K+ ++DW +RGAVTP+KDQG+ CWAF+
Sbjct: 96 ASLINVQKKESGVETATETS-FRYESITKIPV--TMDWRKRGAVTPIKDQGNCGSCWAFS 152
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVYP 177
VA +EG+++I TG+LV+ S+ +LVDC GC + E AFE++ + LASE YP
Sbjct: 153 TVAAIEGIHQITTGKLVSLSEQELVDCVKGKSEGCNFGYKEEAFEFVAKNGGLASEISYP 212
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYHG 237
Y+ + C + + I+GY+ V +E+ L V+ QPVSV IDA FY
Sbjct: 213 YKA-NNKTCMVKKETQG--VAQIKGYENVPSNSEKALLKAVANQPVSVYIDAGALQFYSS 269
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSG 296
G+FTG CG PNH VT++GYG +A G YWLVKN WGT W E G +++ R + G
Sbjct: 270 GIFTGKCGTAPNHAVTVIGYG---KARGGAKYWLVKNSWGTKWGEKGYIKMKRDIRAKEG 326
Query: 297 LCNIAANAAYP 307
LC IA NA+YP
Sbjct: 327 LCGIATNASYP 337
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 137/316 (43%), Positives = 189/316 (59%), Gaps = 34/316 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFL 60
+HE+WM +A+ YKD E+E RFKIFK+N ++ +N+FADLT E+F+
Sbjct: 38 RHEEWMGRYAKVYKDPQERERRFKIFKENVNYIEAFNNAANKPYTLGINQFADLTNEEFI 97
Query: 61 ASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
A +K H S+ R+ FK N + + ++DW ++GAVTP+KDQG CCWA
Sbjct: 98 APRNRFK----GHMCSSITRTTTFKYENVTAIP--STVDWRQKGAVTPIKDQGQCGCCWA 151
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+AVA EG++ + G+L++ S+ ++VDC T GCA F++ AF++I Q L +E
Sbjct: 152 FSAVAATEGIHALSAGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFIIQNHGLNNEP 211
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--F 232
YPY+ D C+ +A+ I GY+ V E+ LQ V+ QPVSVAIDA+ F
Sbjct: 212 NYPYKAV-DGKCN--AKAAANHVATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDF 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVFTG CG +HGVT VGYG + A+G + YWLVKN WGT W E G +R+ RGV
Sbjct: 269 QFYQSGVFTGSCGTELDHGVTAVGYGVS--ADGTE-YWLVKNSWGTEWGEEGYIRMQRGV 325
Query: 293 GG-SGLCNIAANAAYP 307
GL IA A+YP
Sbjct: 326 KAEEGLXGIAMMASYP 341
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 231 bits (589), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 134/314 (42%), Positives = 181/314 (57%), Gaps = 32/314 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFL 60
+HE WM ++ R YKD EK R+KIFK N F L +N+FADLT E+F
Sbjct: 38 RHEDWMAQYGRVYKDAGEKSKRYKIFKDNVARIESFNKAMNKSYKLSINEFADLTNEEFR 97
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
AS +K H S + FK + + ++DW ++GAVTP+KDQG CWAF+
Sbjct: 98 ASRNRFKA----HICSTEATSFKYEHVXAVP--STVDWRKKGAVTPIKDQGQCGSCWAFS 151
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA +EG+ ++ TG+L++ S+ +LVDC T GC+ +++AF++I Q L +E Y
Sbjct: 152 AVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCSGGLMDDAFKFIEQNHGLTTEANY 211
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
PY G D C+ R A+ I GY+ V E+ LQ V+ QP++VAIDA F F
Sbjct: 212 PYAG-TDGTCN--RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGFEFQF 268
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG- 293
Y GVFTG CG +HGV+ VGYGT+ + YWLVKN WGT W E G +R+ R V
Sbjct: 269 YSSGVFTGQCGTELDHGVSAVGYGTSDDG---MKYWLVKNSWGTGWGEEGYIRMQRDVTE 325
Query: 294 GSGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 326 KEGLCGIAMQASYP 339
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 135/314 (42%), Positives = 179/314 (57%), Gaps = 32/314 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFL 60
+HE WM ++ R YKD EK R+KIFK N F L +N+FADLT E+F
Sbjct: 38 RHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFG 97
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
S +K H S + FK N + + +IDW ++GAVTP+KDQG CWAF+
Sbjct: 98 TSRNRFKA----HICSTEATSFKYENVTAVP--STIDWRKKGAVTPIKDQGQCGSCWAFS 151
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA +EG+ ++ TG+L++ S+ +LVDC T GC +++AF++I+Q L +E Y
Sbjct: 152 AVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFKFIKQNHGLTTEANY 211
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
PY G D C+ R A+ I GY+ V E+ LQ V QP++VAIDA F F
Sbjct: 212 PYAGT-DGTCN--RKKAAHPAAKINGYEDVPANNEKALQKAVVHQPIAVAIDAGGFEFQF 268
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-G 293
Y GVFTG CG +HGV VGYGT+ + YWLVKN WGT W E G +R+ R V
Sbjct: 269 YSSGVFTGQCGTELDHGVAAVGYGTSDDG---MKYWLVKNSWGTGWGEEGYIRMQRDVTA 325
Query: 294 GSGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 326 KEGLCGIAMQASYP 339
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 231 bits (588), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 182/311 (58%), Gaps = 26/311 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE+WM ++ + Y D AEKE RF+IFK N +F L +N+FADL E+F
Sbjct: 36 RHEKWMAQYGKLYTDAAEKEKRFQIFKNNVQFIESFNAAGDKPFNLSINQFADLHNEEFK 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
AS + + + ++ F+ + +K+ ++DW +RGAVTP+KDQG+ CWAF+
Sbjct: 96 ASLINVQKKESGVETATETS-FRYESITKIPV--TMDWRKRGAVTPIKDQGNCGSCWAFS 152
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVYP 177
VA +EG+++I TG+LV+ S+ +LVDC GC + E AFE++ + LASE YP
Sbjct: 153 IVAAIEGIHQITTGKLVSLSEQELVDCVKGKSEGCNFGYKEEAFEFVAKNGGLASEISYP 212
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYHG 237
Y+ + C + + I+GY+ V +E+ L V+ QPVSV IDA FY
Sbjct: 213 YKA-NNKTCMVKKETQG--VAQIKGYENVPSNSEKALLKAVANQPVSVYIDAGALQFYSS 269
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSG 296
G+FTG CG PNH T++GYG +A G YWLVKN WGT W E G +R+ R + G
Sbjct: 270 GIFTGKCGTAPNHAATVIGYG---KARGGAKYWLVKNSWGTKWGEKGYIRMKRDIRAKEG 326
Query: 297 LCNIAANAAYP 307
LC IA NA+YP
Sbjct: 327 LCGIATNASYP 337
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 230 bits (587), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 186/316 (58%), Gaps = 27/316 (8%)
Query: 12 AAKHEQWMVEFARTYKDQAE--KEMRFKIFKKN----HEF-------LRLNKFADLTREK 58
+ +HE+WM + R Y D+ E K RF +FK+N EF L +N+FADLT E+
Sbjct: 34 SMRHEEWMSQHGRVYADEQEDHKNKRFNVFKENVERIEEFNDGKTFKLAINQFADLTNEE 93
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
F ASY G+K P + F+ N S + S+DW ++GAVTPVK+QG CCWA
Sbjct: 94 FRASYNGFKGPMVLSSQITKPTPFRYENVSS-ALPVSVDWRKKGAVTPVKNQGQCGCCWA 152
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+AVA +EG+ +I TG+L++ S+ +LVDC T +GC ++ AFE+I L +E
Sbjct: 153 FSAVAAIEGITQISTGKLISLSEQELVDCDTKGIDHGCEGGLMDTAFEFIINNGGLTTES 212
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YPY+G +D C++ +++ +I GY+ V E+ L V+ QPVSVAI+A + F
Sbjct: 213 NYPYKG-EDGTCNFNKTNPIAV--SITGYEDVPANDEQALMKAVAHQPVSVAIEAGGSDF 269
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVFTG CG +H VT VGYG E+E YW+VKN WGT W E G + + + +
Sbjct: 270 QFYSSGVFTGECGTELDHAVTAVGYG---ESEDGSKYWIVKNSWGTKWGESGYIEMQKDI 326
Query: 293 G-GSGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 327 KVKQGLCGIAMQASYP 342
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 230 bits (587), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 129/314 (41%), Positives = 180/314 (57%), Gaps = 30/314 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFL 60
+HEQWM + + YKD EKE RF++FK+N ++ +N+FADLT E+F+
Sbjct: 38 RHEQWMARYGKVYKDPEEKEKRFRVFKENVNYIEAFNNAANKPYKLGINQFADLTSEEFI 97
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
+ + ++N+ DSIDW ++GAVTP+K+QGS CCWAF+
Sbjct: 98 VPRNRFNGHTRSSNTRTTTFKYENVTV----LPDSIDWRQKGAVTPIKNQGSCGCCWAFS 153
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
A+A EG++KI TG+LV+ S+ ++VDC T +GC +++ AF++I Q + +E Y
Sbjct: 154 AIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASY 213
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
PY+G D C+ + I GY+ V E+ LQ V+ QPVSVAIDA+ F F
Sbjct: 214 PYKGV-DGKCNIKEEAVHA--ATITGYEDVPINNEKALQKAVANQPVSVAIDASGADFQF 270
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y G+FTG CG +HGVT VGYG E YWLVKN WGT W E G + + RGV
Sbjct: 271 YKSGIFTGSCGTELDHGVTAVGYGENNEG---TKYWLVKNSWGTEWGEEGYIMMQRGVKA 327
Query: 295 -SGLCNIAANAAYP 307
G+C IA A+YP
Sbjct: 328 VEGICGIAMMASYP 341
>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
Length = 350
Score = 230 bits (586), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 135/317 (42%), Positives = 182/317 (57%), Gaps = 32/317 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ +H+QWM E RTYKD+AEK RF++FK N +F L +N+FAD+T +
Sbjct: 45 MKVRHQQWMAEHGRTYKDEAEKARRFQVFKANADFVDRSNAAGGKSYELAINEFADMTND 104
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F+A YTG KP P P ++NL S + ++DW ++GAVT +K+QG CCW
Sbjct: 105 EFVAMYTGLKPVPAG-PKKMAGFKYENLTLSDVD-QQAVDWRQKGAVTGIKNQGQCGCCW 162
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASEC 174
AF AVA VE +++I TG LV+ S+ Q++DC T NGC +++NAF+YI LA+E
Sbjct: 163 AFAAVAAVESIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIISNGGLATED 222
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-TWFN 233
YPY Q + + + I YQ V E L V+ QPV+VAIDA F
Sbjct: 223 AYPYAAAQGTCQSSVQPAVT-----ISSYQDVPSGDEAALAAAVANQPVAVAIDAHNNFQ 277
Query: 234 FYHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
FY GV T TP NH VT VGY T AE PYWL+KN+WG NW EGG +R+ R
Sbjct: 278 FYSSGVLTADTCGTPSLNHAVTAVGYST---AEDGTPYWLLKNQWGQNWGEGGYLRVER- 333
Query: 292 VGGSGLCNIAANAAYPL 308
G+ C +A A+YP+
Sbjct: 334 --GTNACGVAQQASYPV 348
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 230 bits (586), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 134/321 (41%), Positives = 188/321 (58%), Gaps = 37/321 (11%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
++ +HEQWM ++ + Y D EKE+R IFK+N + + +N+FADLT
Sbjct: 34 SLKERHEQWMTQYGKVYTDSYEKELRSNIFKENVQRIEAFNNAGNKPYKLGINQFADLTN 93
Query: 57 EKFLAS--YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
E+F A + G+ +S R+ FK + S S S+DW ++GAVTP+KDQG
Sbjct: 94 EEFKARNRFKGHMCS-----NSTRTPTFKYEDVS--SVPASLDWRQKGAVTPIKDQGQCG 146
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRL 170
CCWAF+AVA EG+ K+ TG+L++ S+ +LVDC T GC +++AF++I Q + L
Sbjct: 147 CCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFIMQNKGL 206
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
+E YPYQG D C+ ++A K A I+G++ V +E L V+ QP+SVAIDA
Sbjct: 207 NTEAKYPYQGV-DATCN---ANAEAKDAASIKGFEDVPANSESALLKAVANQPISVAIDA 262
Query: 230 TW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
+ F FY G+FTG CG +HGVT VGYG + + YWLVKN WG W E G +R
Sbjct: 263 SGSEFQFYSSGLFTGSCGTELDHGVTAVGYGVSDDG---TKYWLVKNSWGEQWGEEGYIR 319
Query: 288 IFRGVGG-SGLCNIAANAAYP 307
+ R V GLC IA A+YP
Sbjct: 320 MQRDVAAEEGLCGIAMQASYP 340
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 230 bits (586), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 185/314 (58%), Gaps = 33/314 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFL 60
+HEQWM ++ R YKD AEKE R+ IFK+N F L +N+FADL+ E+F
Sbjct: 38 RHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEFK 97
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
AS +K H S ++ F+ N S + ++DW ++GAVTPVKDQG CCWAF+
Sbjct: 98 ASRNRFK----GHMCSPQAGPFRYENVSAVP--ATMDWRKKGAVTPVKDQGQCGCCWAFS 151
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA +EG+N++ TG+L++ S+ ++VDC T GC +++AF++I Q + L +E Y
Sbjct: 152 AVAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANY 211
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
PY G D C+ + + I G++ V +E L V++QPVSVAIDA F F
Sbjct: 212 PYTGT-DGTCNTQKEATHA--AKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGFEFQF 268
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y G+FTG CG +HGVT VGYG + + YWLVKN WG W E G +R+ + +
Sbjct: 269 YSSGIFTGSCGTQLDHGVTAVGYGISDGTK----YWLVKNSWGAQWGEEGYIRMQKDISA 324
Query: 295 -SGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 325 KEGLCGIAMQASYP 338
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 230 bits (586), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 147/317 (46%), Positives = 180/317 (56%), Gaps = 36/317 (11%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
KHEQWM F+R Y+D+ EK+MR +FKKN +F L +N+FAD T E+FL
Sbjct: 38 KHEQWMARFSRVYRDELEKQMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFL 97
Query: 61 ASYTGYK---PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
A +TG K D S+RS W N S M S DW GAVTPVK QG CCW
Sbjct: 98 AIHTGLKGLSSKVVDETISSRS-W----NISDMVGV-SKDWRAEGAVTPVKYQGQCGCCW 151
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASEC 174
AF+AVA VEG+ KI G LV+ S+ QL+DC GC + +AF YI Q + +ASE
Sbjct: 152 AFSAVAAVEGVTKIAGGNLVSLSEQQLLDCDREYDRGCDGGIMSDAFNYIIQNRGIASEN 211
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF 234
Y YQG D C RSSA I G+Q V E+ L + VSRQPVSV++DA F
Sbjct: 212 DYSYQG-SDGRC---RSSAR-PAARISGFQTVPSNNEQALLEAVSRQPVSVSMDANGDGF 266
Query: 235 YH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
H GGV+ GPCG + NH VT VGYGT+ + YWL KN WG W E G +RI R V
Sbjct: 267 MHYSGGVYDGPCGTSSNHAVTFVGYGTSQDG---TKYWLAKNSWGETWGEKGYIRIRRDV 323
Query: 293 G-GSGLCNIAANAAYPL 308
G+C +A A YP+
Sbjct: 324 AWPQGMCGVAQYAFYPV 340
>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
Length = 345
Score = 229 bits (585), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 134/316 (42%), Positives = 181/316 (57%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++SE
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFIKENGGISSE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G+Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGQQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GDPSGLCDIAKMSSYP 341
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 229 bits (585), Expect = 8e-58, Method: Compositional matrix adjust.
Identities = 134/316 (42%), Positives = 181/316 (57%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++SE
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFIKENGGISSE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G+Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGQQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GDPSGLCDIAKMSSYP 341
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 229 bits (585), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 189/316 (59%), Gaps = 34/316 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+H QWM +A+ YKD E+E RF+IFK+N + L +N+FADLT E+F+
Sbjct: 38 RHAQWMARYAKVYKDPQEREKRFRIFKENVNYIETFNSADNKSYKLDINQFADLTNEEFI 97
Query: 61 ASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
A +K H S+ R+ FK N + + ++DW ++GAVTP+KDQG CCWA
Sbjct: 98 APRNRFK----GHMCSSITRTTTFKYENVTVIP--STVDWRQKGAVTPIKDQGQCGCCWA 151
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+AVA EG++ + G+L++ S+ ++VDC T GCA F++ AF++I Q L +E
Sbjct: 152 FSAVAATEGIHALNAGKLISLSEQEVVDCDTKGQDQGCAGGFMDGAFKFIIQNHGLNTEP 211
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--F 232
YPY+ D C+ +A+ I GY+ V E+ LQ V+ QPVSVAIDA+ F
Sbjct: 212 NYPYKA-ADGKCN--AKAAANHAATITGYEDVPVNNEKALQKAVANQPVSVAIDASGSDF 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVFTG CG +HGVT VGYG + A+G + YWLVKN WGT W E G +R+ RGV
Sbjct: 269 QFYKSGVFTGSCGTELDHGVTAVGYGVS--ADGTE-YWLVKNSWGTEWGEEGYIRMQRGV 325
Query: 293 GG-SGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 326 KAEEGLCGIAMMASYP 341
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 181/316 (57%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK+QG C
Sbjct: 94 QEFLAKFTGLNIPNSYVSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G+Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGQQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C N NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCANRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGEDGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 326 GNPAGLCDIAKVSSYP 341
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 229 bits (584), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 145/332 (43%), Positives = 180/332 (54%), Gaps = 31/332 (9%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------L 48
SR S + KHEQWM F R Y D+ EK RF IFKKN EF++ +
Sbjct: 22 SRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEFVQNFNMNNKITYKVDI 81
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNW-----FKNLNSSKMSFYDSIDWNERGA 103
N+F+DLT E+F A++TG P S S+ F+ N S +S+DW + GA
Sbjct: 82 NEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNTVPFRYGNVSDNG--ESMDWRQEGA 139
Query: 104 VTPVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLEN 159
VTPVK QG C CWAF+AVA VEG+ KI G+LV+ S+ QL+DC GC +
Sbjct: 140 VTPVKYQGR-CGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDRDYNQGCRGGIMSK 198
Query: 160 AFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS 219
AFEYI + Q + +E YPYQ Q S+S + I GY+ V EE L VS
Sbjct: 199 AFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVPMNNEEALLQAVS 258
Query: 220 RQPVSVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
+QPVSV I+ T F Y GGVF G CG +H VTIVGYG + E YW+VKN WG
Sbjct: 259 QQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIVGYGMSEEG---TKYWVVKNSWG 315
Query: 278 TNWDEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
W E G MRI R V G+C +A A YPL
Sbjct: 316 ETWGENGYMRIKRDVDAPQGMCGLAILAFYPL 347
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKTNDLSDDDMPSNLDWRESGAVTQVKHQGQCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G+Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGQQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYSGGTYDGSCADRINHAVTAIGYGTDEEG---QKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GDPSGLCDIAKMSSYP 341
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 185/317 (58%), Gaps = 32/317 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+A +HE+WM E+ R YKD AEK RF++FK N F L +N+FADLT E
Sbjct: 1 MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTE 60
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F A+ G+KP + + FK N S + ++DW +GAVTP+K+QG CCW
Sbjct: 61 EFKAN-KGFKPISAEEVPTTG---FKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCW 116
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASE 173
AF+A+A +EG+ K+ TG LV+ S+ + VDC T N GC +++NAFE++ + LA+E
Sbjct: 117 AFSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLATE 176
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--W 231
YPY+ D C SA+ I+G++ V P E L VV+ QPVSVA+DA+
Sbjct: 177 SSYPYK-VVDGKCKGGSKSAA----TIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRT 231
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y GGV TG CG +HG+ +GYG ++ YW++KN WGT W E G +R+ +
Sbjct: 232 FMLYSGGVMTGSCGTQLDHGIAAIGYGVESD---DTKYWILKNSWGTTWGEKGFLRMEKD 288
Query: 292 VGGS-GLCNIAANAAYP 307
+ G+C++A +YP
Sbjct: 289 ISDKRGMCDLAMKPSYP 305
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 228 bits (582), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 143/332 (43%), Positives = 187/332 (56%), Gaps = 36/332 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
SR + +A H+QWM F+R Y D+ EK+MRF +FKKN +F L +
Sbjct: 34 SRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGV 93
Query: 49 NKFADLTREKFLASYTGYK----PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAV 104
N+FAD TRE+F+A++TG K P ++ +W N N S ++ ++ DW GAV
Sbjct: 94 NEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSW--NWNVSDVAGRETKDWRYEGAV 151
Query: 105 TPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAF 161
TPVK QG CCWAF++VA VEGL KI LV+ S+ QL+DC NGC + +AF
Sbjct: 152 TPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAF 211
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSR 220
YI + + +ASE YPYQ + C + +GK A IRG+Q V E L + VS+
Sbjct: 212 SYIIKNRGIASEASYPYQAAEG-TCRY-----NGKPSAWIRGFQTVPSNNERALLEAVSK 265
Query: 221 QPVSVAIDATWFNFYH--GGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
QPVSV+IDA F H GGV+ P CG NH VT VGYGT+ E YWL KN WG
Sbjct: 266 QPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEG---IKYWLAKNSWG 322
Query: 278 TNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W E G +RI R V G+C +A A YP+
Sbjct: 323 ETWGENGYIRIRRDVAWPQGMCGVAQYAFYPV 354
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 228 bits (581), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 135/316 (42%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y YQG Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYQGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKMSSYP 341
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 135/316 (42%), Positives = 180/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
E+FLA +TG P + P S FK + S ++DW E GAVT VK+QG C
Sbjct: 94 EEFLAKFTGLNIPNSYLSPSPMPSTEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G+Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGQQ-YTC---RSQGKTAAVQISNYQVV-PEGETSLLQAVTKQPVSIGIAASHDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C N NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCANRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 326 GNPAGLCDIAKMSSYP 341
>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 155/331 (46%), Positives = 187/331 (56%), Gaps = 40/331 (12%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------- 47
M R +A KHEQWM RTY+D EKE RF IFKKN + +
Sbjct: 24 MPRPLIDEDAVAEKHEQWMARHGRTYQDDEEKERRFHIFKKNLKHIENFNNAFNRTYKLG 83
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFY----DSIDWNERGA 103
LN FADLT E+FLA+YTGYK P P +N + K SS + + +SIDW RG
Sbjct: 84 LNHFADLTDEEFLATYTGYKMPKV-LPTANITT--KTTQSSDVLYEANVPESIDWRTRGV 140
Query: 104 VTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC-STLNGCAKNFLENAF 161
VTPVK+QG CCWAF+A A VEG+ G V+ S QL+DC NGC F++NAF
Sbjct: 141 VTPVKNQGRCGCCWAFSAAAAVEGI----IGNGVSLSAQQLLDCVPDSNGCNGGFMDNAF 196
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
YI Q Q LAS YPYQ ++ S I GY V PA EE L+ V+RQ
Sbjct: 197 RYIIQNQGLASATYYPYQLMREM------CRPSNNAARISGYVDVTPADEETLKSAVARQ 250
Query: 222 PVSVAIDATW---FNFYHGGVFTG-PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
PVS A+DAT F +Y GG+F CG+T H +TIVGYGT+ AEG + YWL+KN WG
Sbjct: 251 PVSAAVDATSELNFKYYGGGIFPPQDCGSTLTHAITIVGYGTS--AEGTK-YWLIKNSWG 307
Query: 278 TNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
W EGG MR+ R VG G C IA A+YP
Sbjct: 308 EGWGEGGYMRLQRDVGSYGGACGIALRASYP 338
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 228 bits (581), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 142/336 (42%), Positives = 185/336 (55%), Gaps = 44/336 (13%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
SR + +A H+QWM F+R Y D+ EK+MRF +FKKN +F L +
Sbjct: 25 SRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGV 84
Query: 49 NKFADLTREKFLASYTGYK----PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAV 104
N+FAD T+E+F+A++TG K P ++ +W N N S ++ + DW GAV
Sbjct: 85 NEFADWTKEEFIATHTGLKGFNGIPSSEFVDEMIPSW--NWNVSDVAGPEIKDWRYEGAV 142
Query: 105 TPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAF 161
TPVK QG CCWAF++VA VEGL KI G LV+ S+ QL+DC NGC + +AF
Sbjct: 143 TPVKYQGQCGCCWAFSSVAAVEGLTKIVGGNLVSLSEQQLLDCDRERDNGCNGGIMSDAF 202
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGA-----IRGYQYVQPATEEGLQD 216
YI + + +ASE YPYQ + + +Y A IRG+Q V E L +
Sbjct: 203 SYIIKNRGIASEASYPYQ----------ETEGTCRYNAKPSAWIRGFQTVPSNNERALLE 252
Query: 217 VVSRQPVSVAIDATWFNFYH--GGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVK 273
VSRQPVSV+IDA F H GGV+ P CG NH VT VGYGT+ E YWL K
Sbjct: 253 AVSRQPVSVSIDADGPGFMHYSGGVYDEPYCGTDVNHAVTFVGYGTSPEG---IKYWLAK 309
Query: 274 NRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
N WG W E G +RI R V G+C +A A YP+
Sbjct: 310 NSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPV 345
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 128/328 (39%), Positives = 181/328 (55%), Gaps = 35/328 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
+R + + A+HEQWM ++ R YKD EK RF++FK N +F L +
Sbjct: 24 ARDLNDDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKFIESFNAGGNRKFWLGV 83
Query: 49 NKFADLTREKFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
N+FADLT ++F A+ T G+KP P P F+ N S + SIDW +GAVTP
Sbjct: 84 NQFADLTNDEFRATKTNKGFKPSPVKVPTG-----FRYENVSVDALPASIDWRTKGAVTP 138
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFE 162
+KDQG CCWAF+AVA EG+ KI T +L++ S+ +LVDC GC +++AF+
Sbjct: 139 IKDQGQCGCCWAFSAVAATEGIVKISTDKLISLSEQELVDCDVHGEDQGCEGGLMDDAFK 198
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
+I + L +E YPY D C +SA+ I+G++ V E L V+ QP
Sbjct: 199 FIIKNGGLTTESSYPYTA-TDGKCKSGTNSAAN----IKGFEDVPANDEAALMKAVANQP 253
Query: 223 VSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
VSVA+D F Y GGV TG CG +HG+ +GYG T++ YWL+KN WGT W
Sbjct: 254 VSVAVDGGDMTFQLYSGGVMTGSCGTDLDHGIAAIGYGQTSDG---TKYWLLKNSWGTTW 310
Query: 281 DEGGSMRIFRGVGGS-GLCNIAANAAYP 307
E G +R+ + + G+C +A +YP
Sbjct: 311 GENGYLRMEKDISDKRGMCGLAMEPSYP 338
>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 180/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENIKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++SE
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFIKENGGISSE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 326 GNPAGLCDIAKMSSYP 341
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 228 bits (580), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 143/332 (43%), Positives = 187/332 (56%), Gaps = 36/332 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
SR + +A H+QWM F+R Y D+ EK+MRF +FKKN +F L +
Sbjct: 10 SRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTYKLGV 69
Query: 49 NKFADLTREKFLASYTGYK----PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAV 104
N+FAD TRE+F+A++TG K P ++ +W N N S ++ ++ DW GAV
Sbjct: 70 NEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSW--NWNVSDVAGRETKDWRYEGAV 127
Query: 105 TPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAF 161
TPVK QG CCWAF++VA VEGL KI LV+ S+ QL+DC NGC + +AF
Sbjct: 128 TPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMSDAF 187
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSR 220
YI + + +ASE YPYQ + C + +GK A IRG+Q V E L + VS+
Sbjct: 188 SYIIKNRGIASEASYPYQAAEG-TCRY-----NGKPSAWIRGFQTVPSNNERALLEAVSK 241
Query: 221 QPVSVAIDATWFNFYH--GGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
QPVSV+IDA F H GGV+ P CG NH VT VGYGT+ E YWL KN WG
Sbjct: 242 QPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPEG---IKYWLAKNSWG 298
Query: 278 TNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W E G +RI R V G+C +A A YP+
Sbjct: 299 ETWGENGYIRIRRDVAWPQGMCGVAQYAFYPV 330
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 179/320 (55%), Gaps = 30/320 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
++ KHEQWM F+R Y+D+ EK MR +FKKN +F L +N+FAD T
Sbjct: 34 SMVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTN 93
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWF--KNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
E+FLA +TG K P + + N S M +S DW GAVTPVK QG
Sbjct: 94 EEFLAIHTGLKGLTEVSPSKVVAKTISSQTWNVSDM-VVESKDWRAEGAVTPVKYQGQCG 152
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLA 171
CCWAF+AVA VEG+ KI G LV+ S+ QL+DC GC + +AF Y+ Q + +A
Sbjct: 153 CCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYDRGCDGGIMSDAFNYVVQNRGIA 212
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
SE Y YQG D C RS+A I G+Q V E L + VSRQPVSV++DAT
Sbjct: 213 SENDYSYQG-SDGGC---RSNAR-PAARISGFQTVPSNNERALLEAVSRQPVSVSMDATG 267
Query: 232 FNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F H GGV+ GPCG + NH VT VGYGT+ + YWL KN WG W E G +RI
Sbjct: 268 DGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDG---TKYWLAKNSWGETWGEKGYIRIR 324
Query: 290 RGVG-GSGLCNIAANAAYPL 308
R V G+C +A A YP+
Sbjct: 325 RDVAWPQGMCGVAQYAFYPV 344
>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
Length = 345
Score = 228 bits (580), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 134/317 (42%), Positives = 180/317 (56%), Gaps = 27/317 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDS-IDWNERGAVTPVKDQGSY- 113
++FLA +TG P + P S FK +N + S +DW E GAVT VK QG
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDLSDDYMPSNLDWRESGAVTQVKHQGRCG 153
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLAS 172
CCWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++
Sbjct: 154 CCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISR 213
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW- 231
E Y Y G+Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 ESDYEYLGQQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQD 268
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 LQFYAGGTYDGNCADRINHAVTAIGYGTDEEG---QKYWLLKNSWGTSWGENGYMKIIRD 325
Query: 292 VGG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 SGDPSGLCDIAKMSSYP 342
>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKMSSYP 341
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 134/317 (42%), Positives = 180/317 (56%), Gaps = 27/317 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNS-SKMSFYDSIDWNERGAVTPVKDQGSY- 113
++FLA +TG P + P S FK +N S ++DW E GAVT VK QG
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDLSDDDMPSNLDWRESGAVTQVKHQGQCG 153
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLAS 172
CCWAF+AV ++EG KI TG+L+ S+ +L+DC+T N GC F+ NAF++I + ++
Sbjct: 154 CCWAFSAVGSLEGAYKIATGKLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISR 213
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW- 231
E Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 ESDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQD 268
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 LQFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRD 325
Query: 292 VGG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 SGNPSGLCDIAKMSSYP 342
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 134/328 (40%), Positives = 188/328 (57%), Gaps = 36/328 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------L 48
+R H++ + +HE+WM + + YKD EK RF+IFK N EF+ +
Sbjct: 27 TRELHES-TMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVEFIESSNAAGNNSYMLGI 85
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N+FADLT E+F AS+ GYK P S FK N + + + S+DW +GAVT +K
Sbjct: 86 NRFADLTNEEFRASWNGYKRPL---DASRIVTPFKYENVTALPY--SMDWRRKGAVTSIK 140
Query: 109 DQ---GSYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFE 162
DQ GS CWAF+AVA EG++K+RTG+LV+ S+ +LVDC GC +E+AF+
Sbjct: 141 DQRECGS--CWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVKGEDKGCQGGLMEDAFK 198
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
+I++ + +E Y Y+GR D CD + ++ I GYQ V +E L V+ QP
Sbjct: 199 FIKRNGGITTEANYAYRGR-DGKCDTKKEAS--HVAKITGYQVVPENSEAALLKAVAHQP 255
Query: 223 VSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
VSV+IDA F FY G++ G CG+ NHGV VGYGT++ YW+VKN WG W
Sbjct: 256 VSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAVGYGTSSSG---SKYWIVKNSWGPEW 312
Query: 281 DEGGSMRIFRGVGG-SGLCNIAANAAYP 307
E G +R+ R + GLC IA + +YP
Sbjct: 313 GERGYVRMKRDITSRKGLCGIAMDCSYP 340
>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 227 bits (579), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFIKENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKMSSYP 341
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 185/314 (58%), Gaps = 33/314 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFL 60
+HEQWM ++ R YKD E+ R+ IFK+N F L +N+FADLT E+F
Sbjct: 4 RHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEFK 63
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
AS +K H S ++ F+ N S + ++DW + GAVTPVKDQG CCWAF+
Sbjct: 64 ASRNRFK----GHMCSPQAGPFRYENVSAVP--STVDWRKEGAVTPVKDQGQCGCCWAFS 117
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA +EG+NK+ TG+L++ S+ ++VDC T GC +++AF++I Q + L +E Y
Sbjct: 118 AVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANY 177
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNF 234
PY+G D C+ +S+ I G++ V +E L V++QPVSVAIDA + F F
Sbjct: 178 PYKGT-DGTCNTKKSAIHA--AKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQF 234
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y G+FTG C +HGVT VGYG + ++ YWLVKN WG W E G +R+ + +
Sbjct: 235 YSSGIFTGSCDTQLDHGVTAVGYGVSDGSK----YWLVKNSWGAQWGEEGYIRMQKDISA 290
Query: 295 -SGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 291 KEGLCGIAMQASYP 304
>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GDPSGLCDIAKMSSYP 341
>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GDPSGLCDIAKMSSYP 341
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 227 bits (579), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 143/329 (43%), Positives = 204/329 (62%), Gaps = 31/329 (9%)
Query: 1 MSRTSH-KTGNIAAK-HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------- 45
MSRT + +T ++ AK H+QWM+++ R+Y + AE E RFKIF +N E+
Sbjct: 22 MSRTLYDETSSVVAKTHQQWMLQYGRSYTNDAEMEKRFKIFMENLEYIEKFNNAPGNKSY 81
Query: 46 -LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAV 104
L LN+F+DLT E+F+AS+TG P+ S++ +L+ S S+DW E+GAV
Sbjct: 82 KLDLNQFSDLTNEEFIASHTGLMIDPSKPSSSSKRASPASLDLSDTP--TSLDWREQGAV 139
Query: 105 TPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENA 160
T VK+QG+ CWAF+AVA VEG+ KI+ G L++ S+ QLVDC++ GC F++NA
Sbjct: 140 TDVKNQGNCGSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCASNEQNQGCGGGFMDNA 199
Query: 161 FEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR 220
F YI + +ASE Y Y+G + + + + I GY+ V PA E+ L VS+
Sbjct: 200 FSYITE-NGIASENDYQYRGGAGTCQNNEMITPAAR---ISGYEDV-PAGEDQLLLAVSQ 254
Query: 221 QPVSVAIDA-TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
QPVSVAI F+ Y G+++GPCG++ NHGVT+VGYGT+ E +G + YWL+KN WG +
Sbjct: 255 QPVSVAIAVGQSFHLYKEGIYSGPCGSSLNHGVTLVGYGTSEE-DGTK-YWLIKNSWGES 312
Query: 280 WDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
W E G MR+ R G S G C IA A++P
Sbjct: 313 WGENGYMRLLRESGQSEGHCGIAVKASHP 341
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 227 bits (578), Expect = 5e-57, Method: Compositional matrix adjust.
Identities = 127/317 (40%), Positives = 185/317 (58%), Gaps = 28/317 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
I A +E W+V+ ++Y EKE RF+IFK N + L LN+FADLT E
Sbjct: 40 IMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNRFADLTNE 99
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ + YTG + + S +S + +L + S +S+DW E GAV VKDQG CW
Sbjct: 100 EYRSKYTGIRTKDSRKKVSGKSQRYASL--AGESLPESVDWREHGAVASVKDQGQCGSCW 157
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASEC 174
AF+ ++ VEG+N+I TG+L+T S+ +LVDC S GC +++AF++I + S+
Sbjct: 158 AFSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINNGGIDSDA 217
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--F 232
YPY GR D CD +R +A K I Y+ V E+ LQ + QP+SVAI+A+ F
Sbjct: 218 DYPYTGR-DGQCDQYRKNA--KVVTIDSYEDVPEYDEKALQKAAANQPISVAIEASGRDF 274
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY G+FTG CG +HGV +VGYGT E + YW+V+N WG +W E G +R+ RG+
Sbjct: 275 QFYDSGIFTGKCGTDLDHGVVVVGYGT----ENGKDYWIVRNSWGADWGEKGYLRMERGI 330
Query: 293 GG-SGLCNIAANAAYPL 308
+G+C I + +YP+
Sbjct: 331 SSKAGICGITSEPSYPV 347
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 227 bits (578), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 129/328 (39%), Positives = 181/328 (55%), Gaps = 35/328 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
+R ++ + A+HEQWM +++R YKD AEK RF++FK N +F L +
Sbjct: 24 ARDLNEDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVKFIESFNTGGNRKFWLGI 83
Query: 49 NKFADLTREKFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
N+FADLT ++F + T G+KP S F+ N S + +IDW GAVTP
Sbjct: 84 NQFADLTNDEFRTTKTNKGFKP-----SLDKVSTGFRYENVSVDAIPATIDWRTNGAVTP 138
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFE 162
+KDQG CCWAF+AVA EG+ KI TG+L++ S+ +LVDC GC +++AF+
Sbjct: 139 IKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFK 198
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
+I + L +E YPY D C S S I+GY+ V E L V+ QP
Sbjct: 199 FIIKNGGLTTESNYPYTA-ADGKC----KSGSNSAANIKGYEDVPTNDEAALMKAVANQP 253
Query: 223 VSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
VSVA+D F FY GGV TG CG +HG+ +GYG T++ YWL+KN WGT W
Sbjct: 254 VSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDG---TKYWLMKNSWGTTW 310
Query: 281 DEGGSMRIFRGVGG-SGLCNIAANAAYP 307
E G +R+ + + G+C +A +YP
Sbjct: 311 GENGYLRMEKDISDKKGMCGLAMEPSYP 338
>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 341
Score = 226 bits (577), Expect = 7e-57, Method: Compositional matrix adjust.
Identities = 134/317 (42%), Positives = 185/317 (58%), Gaps = 37/317 (11%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HEQWM + YK EKE +++IF +N + L +N FADLT E+F
Sbjct: 37 RHEQWMATHGKVYKHSYEKEQKYQIFMENVQRIEAFNNAGXKPYKLGINHFADLTNEEFK 96
Query: 61 A--SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
A + G+ R+ F+ N + + S+DW ++GAVTP+KDQG CCWA
Sbjct: 97 AINRFKGHVCSK-----RTRTTTFRYENVTAVP--ASLDWRQKGAVTPIKDQGQCGCCWA 149
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+AVA EG+ K+RTG+L++ S+ +LVDC T GC +++AF++I Q + LA+E
Sbjct: 150 FSAVAATEGITKLRTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFILQNKGLATEA 209
Query: 175 VYPYQGRQDYYCDWWRSSASGKY-GAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-- 231
+YPY+G D C+ + A G + G+I+GY+ V +E L V+ QPVSVAI+A+
Sbjct: 210 IYPYEGF-DGTCN---AKADGNHAGSIKGYEDVPANSESALLKAVANQPVSVAIEASGFK 265
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY GGVFTG CG +HGVT VGYG + YWLVKN WG W E G +R+ R
Sbjct: 266 FQFYSGGVFTGSCGTNLDHGVTSVGYGVGDDG---TKYWLVKNSWGVKWGEKGYIRMQRD 322
Query: 292 VGG-SGLCNIAANAAYP 307
V GLC IA A+YP
Sbjct: 323 VAAKEGLCGIAMLASYP 339
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 226 bits (577), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 132/329 (40%), Positives = 189/329 (57%), Gaps = 35/329 (10%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------- 47
+S + + ++ +HEQWM + + YKD EKE RF IF++N +++
Sbjct: 25 VSSRTLQDASMHERHEQWMARYGKVYKDLQEKEKRFNIFQENVKYIEASNNAGNKPYKLG 84
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVT 105
+N+F DLT ++F+A+ +K H S+ R+ FK N + S ++DW + GAVT
Sbjct: 85 VNQFTDLTNKEFIATRNKFK----GHMSSSITRTTTFKYENVTAPS---TVDWRQEGAVT 137
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAF 161
PVK+QG+ CCWAF+AVA EG++K+ TG LV+ S+ +LVDC T GC +++AF
Sbjct: 138 PVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQGCQGGLMDDAF 197
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
++I Q L +E YPYQG D C+ + I GY+ V E+ LQ V+ Q
Sbjct: 198 KFIIQNGGLNTEAQYPYQGV-DGTCN--TNEEVTHVATITGYEDVPSNNEQALQQAVANQ 254
Query: 222 PVSVAIDATWFNF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
P+SVAIDA+ +F Y GVFTG CG +HGV +VGYG + + YWLVKN WG +
Sbjct: 255 PISVAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDDGT---KYWLVKNSWGED 311
Query: 280 WDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
W E G +R+ R V GLC IA +YP
Sbjct: 312 WGEEGYIRMQRDVEAPEGLCGIAMQPSYP 340
>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 226 bits (577), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGHVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++SE
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFIKENGGISSE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 326 GNPAGLCDIAKMSSYP 341
>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 226 bits (576), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 134/316 (42%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G+Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGQQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---ENGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKMSSYP 341
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 140/320 (43%), Positives = 179/320 (55%), Gaps = 30/320 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
++ KHEQWM F+R Y+D+ EK MR +FKKN +F+ +N+FAD T
Sbjct: 34 SMVDKHEQWMARFSREYRDELEKNMRRDVFKKNLKFIENFNKKGNKSYKLGVNEFADWTN 93
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWF--KNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
E+FLA +TG K P + + N S M +S DW GAVTPVK QG
Sbjct: 94 EEFLAIHTGLKGLTEVSPSKVVAKTISSQTWNVSDM-VVESKDWRAEGAVTPVKYQGQCG 152
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLA 171
CCWAF+AVA VEG+ KI G LV+ S+ QL+DC C + +AF Y+ Q + +A
Sbjct: 153 CCWAFSAVAAVEGVAKIAGGNLVSLSEQQLLDCDREYDRDCDGGIMSDAFNYVVQNRGIA 212
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
SE Y YQG D C RS+A I G+Q V E L + VSRQPVSV++DAT
Sbjct: 213 SENDYSYQG-SDGGC---RSNAR-PAARISGFQTVPSNNERALLEAVSRQPVSVSMDATG 267
Query: 232 FNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F H GGV+ GPCG + NH VT VGYGT+ + YWL KN WG W+E G +RI
Sbjct: 268 DGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQDG---TKYWLAKNSWGETWEEKGYIRIR 324
Query: 290 RGVG-GSGLCNIAANAAYPL 308
R V G+C +A A YP+
Sbjct: 325 RDVAWPQGMCGVAQYAFYPV 344
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 134/326 (41%), Positives = 183/326 (56%), Gaps = 33/326 (10%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKF 51
+S + A +E W+V+ ++Y EKE RF+IFK N F+ LN+F
Sbjct: 35 SSRTDDEVMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAESRTYKVGLNRF 94
Query: 52 ADLT----REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
ADLT R +L + TG + + S+R + + S DS+DW E+GAV V
Sbjct: 95 ADLTNDEYRSMYLGARTGSRRRLSTQKRSDRY-----VPVAGESLPDSVDWREKGAVVGV 149
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYI 164
KDQGS CWAF+ +A VEG+N+I TG L++ S+ +LVDC T GC ++ AFE+I
Sbjct: 150 KDQGSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFI 209
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ + +E YPY R D CD +R +A K I Y+ V E+ LQ V+ QPVS
Sbjct: 210 IKNGGIDTEEDYPYNAR-DGRCDQYRKNA--KVVTIDDYEDVPVNNEQALQKAVANQPVS 266
Query: 225 VAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
VAI+A+ F FY GVFTG CG +HGVT VGYGT E YW+VKN WG++W E
Sbjct: 267 VAIEASGMAFQFYESGVFTGNCGTALDHGVTAVGYGT----ENSVDYWIVKNSWGSSWGE 322
Query: 283 GGSMRIFRGVGGSGLCNIAANAAYPL 308
G +R+ R G +G C IA +YP+
Sbjct: 323 SGYIRMERNTGATGKCGIAVEPSYPI 348
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 125/327 (38%), Positives = 183/327 (55%), Gaps = 34/327 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN-----------HEF-LRLN 49
+R +AA+HE+WM ++ R YKD AEK RF++FK N H+F L +N
Sbjct: 24 ARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGVN 83
Query: 50 KFADLTREKFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+FADLT ++F ++ T G+ P T P R ++N+N + ++DW +G VTP+
Sbjct: 84 QFADLTNDEFRSTKTNKGFIPSTTRVPTGFR---YENVNIDALPA--TMDWRTKGVVTPI 138
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQG CCWAF+AVA +EG+ K+ TG+L++ S+ +LVDC GC +++AF++
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKF 198
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + L +E YPY D S S +I+GY+ V E L V+ QPV
Sbjct: 199 IIKNGGLTTESNYPYAAADDKC-----KSVSNSVASIKGYEDVPANNEAALMKAVANQPV 253
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVA+D F FY GGV TG CG +HG+ +GYG ++ YWL+KN WGT W
Sbjct: 254 SVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDG---TKYWLLKNSWGTTWG 310
Query: 282 EGGSMRIFRGVGGS-GLCNIAANAAYP 307
E G +R+ + + G+C +A +YP
Sbjct: 311 ENGFLRMEKDISDKRGMCGLAMEPSYP 337
>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDY 325
Query: 293 GG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 326 GNPAGLCDIAKMSSYP 341
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 134/316 (42%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G+Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGQQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKMSSYP 341
>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 326 GNPAGLCDIAKMSSYP 341
>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDY 325
Query: 293 GG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 326 GNPAGLCDIAKMSSYP 341
>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 180/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G+Q Y C RS I Y+ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGQQ-YTC---RSQEKTAAVQISSYKVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKMSSYP 341
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 133/318 (41%), Positives = 180/318 (56%), Gaps = 30/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPP---PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
++FLA +TG P + P S+ +L+ M ++DW E GAVT VK QG
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSDDDMP--SNLDWRESGAVTQVKHQGRC 151
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLA 171
CCWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++
Sbjct: 152 GCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGIS 211
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 212 RESDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQ 266
Query: 232 -FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 267 DLQFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIR 323
Query: 291 GVGG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 324 DSGNPSGLCDIAKMSSYP 341
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 226 bits (576), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 185/314 (58%), Gaps = 33/314 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFL 60
+HEQWM ++ R YKD E+ R+ IFK+N F L +N+FADLT E+F
Sbjct: 38 RHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEFK 97
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
AS +K H S ++ F+ N S + ++DW + GAVTPVKDQG CCWAF+
Sbjct: 98 ASRNRFK----GHMCSPQAGPFRYENVSAVP--STVDWRKEGAVTPVKDQGQCGCCWAFS 151
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA +EG+NK+ TG+L++ S+ ++VDC T GC +++AF++I Q + L +E Y
Sbjct: 152 AVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANY 211
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNF 234
PY+G D C+ + A+ I G++ V +E L V++QPVSVAIDA + F F
Sbjct: 212 PYKGT-DGTCN--TNKAAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQF 268
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y G+FTG C +HGVT VGYG + ++ YWLVKN WG W E G +R+ + +
Sbjct: 269 YSSGIFTGSCDTQLDHGVTAVGYGVSDGSK----YWLVKNSWGAQWGEEGYIRMQKDISA 324
Query: 295 -SGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 325 KEGLCGIAMQASYP 338
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 133/318 (41%), Positives = 180/318 (56%), Gaps = 30/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPP---PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
++FLA +TG P + P S+ +L+ M ++DW E GAVT VK QG
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSDDDMP--SNLDWRESGAVTQVKHQGRC 151
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLA 171
CCWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++
Sbjct: 152 GCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGIS 211
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 212 RESDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQ 266
Query: 232 -FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 267 DLQFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIR 323
Query: 291 GVGG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 324 DSGDPSGLCDIAKMSSYP 341
>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDY 325
Query: 293 GG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 326 GNPAGLCDIAKMSSYP 341
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 124/304 (40%), Positives = 177/304 (58%), Gaps = 28/304 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN-----------HEF-LRLNKFADLTREK 58
+ A+HE+WM ++ R Y D AEK RF++FK N H+F L N+FADLT ++
Sbjct: 37 MVARHEEWMAKYDRVYSDAAEKARRFEVFKANMALIESVNAGNHKFWLEANRFADLTDDE 96
Query: 59 FLASYTGYKPPPTDHPHSNRS----NWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
F A++TGY+P RS FK N S S+DW +GAVTP+K+QG
Sbjct: 97 FRATWTGYRPKTAAASSKGRSRTATTGFKYANVSLDDVPASVDWRTKGAVTPIKNQGECG 156
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRL 170
CCWAF+AVA++EG+ K+ TG+LV+ S+ +LVDC GC +++AF++I L
Sbjct: 157 CCWAFSAVASMEGVVKLSTGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFDFIVGNGGL 216
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA- 229
+E YPY D C+ + ASG +I+GY+ V E L+ V+ QPVSVA+D
Sbjct: 217 TTESRYPYTA-SDGTCN--SNEASGDAASIKGYEDVPANDEASLRKAVANQPVSVAVDGG 273
Query: 230 -TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
+ F FY GGV +G CG +HG+ VGYG ++ YW++KN WGT+W E G +R+
Sbjct: 274 DSHFRFYKGGVLSGACGTELDHGIAAVGYGVASDG---TKYWVMKNSWGTSWGEAGYIRM 330
Query: 289 FRGV 292
R +
Sbjct: 331 ERDI 334
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 123/327 (37%), Positives = 181/327 (55%), Gaps = 34/327 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLN 49
+R +AA+HE+WM ++ R Y+D AEK RF++FK N F L +N
Sbjct: 24 ARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNHNFWLGVN 83
Query: 50 KFADLTREKF--LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+FADLT ++F + + G+ P T P R ++N+N + ++DW +GAVTP+
Sbjct: 84 QFADLTNDEFRWMKTNKGFIPSTTRVPTGFR---YENVNIDALPA--TVDWRTKGAVTPI 138
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQG CCWAF+AVA +EG+ K+ TG+L++ S+ +LVDC GC +++AF++
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKF 198
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + L +E YPY D S S +I+GY+ V E L V+ QPV
Sbjct: 199 IIKNGGLTTESNYPYAAADDKC-----KSVSNSVASIKGYEDVPANNEAALMKAVANQPV 253
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVA+D F FY GGV TG CG +HG+ +GYG ++ YWL+KN WGT W
Sbjct: 254 SVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDG---TKYWLLKNSWGTTWG 310
Query: 282 EGGSMRIFRGVGGS-GLCNIAANAAYP 307
E G +R+ + + G+C +A +YP
Sbjct: 311 ENGFLRMEKDISDKRGMCGLAMEPSYP 337
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 178/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKMSSYP 341
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 134/316 (42%), Positives = 178/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKMSSYP 341
>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 226 bits (575), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 178/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+I ++YP
Sbjct: 326 GDPSGLCDITKMSSYP 341
>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 339
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 135/327 (41%), Positives = 187/327 (57%), Gaps = 34/327 (10%)
Query: 1 MSRTSHKTGN-IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------L 46
MSR H+ ++ +HEQW ++ + YKD AEK+ R IFK N EF L
Sbjct: 25 MSRNLHEASXCMSERHEQWTKKYGKVYKDAAEKQKRLLIFKDNVEFIESFNAAGNKPYKL 84
Query: 47 RLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
+N D T E+F+AS+ GYK H S+ FK N + + +++DW E GAV
Sbjct: 85 SINHLTDQTNEEFVASHNGYK-----HKGSHSQTPFKYENITGVP--NAVDWRENGAVXA 137
Query: 107 VKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEY 163
+KDQG C CWAF+ VAT EG+ +I T L++ S+ +LVDC +++ GC ++E FE+
Sbjct: 138 MKDQGQ-CGNCWAFSTVATTEGIYQITTSMLMSLSEQELVDCDSVDHGCDGGYMEGGFEF 196
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + ++SE YPY Y +S + + I+GY+ V +E+ LQ V+ QPV
Sbjct: 197 IXKNGGISSEANYPYTAVDGTYDANKEASPAAQ---IKGYETVPANSEDALQKAVANQPV 253
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SV ID + F F GVFTG CG +HGVT VGYG+T +G Q YW+VKN WGT W
Sbjct: 254 SVTIDVGGSAFQFNSSGVFTGQCGTQLDHGVTAVGYGSTD--DGTQ-YWIVKNSWGTQWG 310
Query: 282 EGGSMRIFRGVGG-SGLCNIAANAAYP 307
E G +R+ RG GLC IA +A+YP
Sbjct: 311 EEGYIRMQRGTDAQEGLCGIAMDASYP 337
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 225 bits (574), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 178/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPVSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKMSSYP 341
>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 337
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 134/327 (40%), Positives = 189/327 (57%), Gaps = 36/327 (11%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------- 47
MSR H+T ++ +HE W+ + + YK AEKE F+IFK+N EF+
Sbjct: 25 MSRKLHET-SLREEHENWIARYGQVYKVAAEKET-FQIFKENVEFIESFNAAANKPYKLG 82
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N FADLT E+F G K H FK N + + +++DW E+GAVTP+
Sbjct: 83 VNLFADLTLEEFKDFRFGLKKT-----HEFSITPFKYENVTDIP--EALDWREKGAVTPI 135
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQG CWAF+ VA EG+++I TG LV+ + +LV C T GC ++E+ FE+
Sbjct: 136 KDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKGVDQGCEGGYMEDGFEF 195
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + + ++ YPY+G C+ + A+ I+GY+ V +EE LQ V+ QPV
Sbjct: 196 IIKNGGITTKANYPYKGVNGT-CN--TTIAASTVAQIKGYETVPSYSEEALQKAVANQPV 252
Query: 224 SVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SV+IDA F FY GG++TG CG +HGVT VGYGTT E + YW+VKN WGT WD
Sbjct: 253 SVSIDANNGHFMFYAGGIYTGECGTDLDHGVTAVGYGTTNETD----YWIVKNSWGTGWD 308
Query: 282 EGGSMRIFRGVG-GSGLCNIAANAAYP 307
E G +R+ RG+ GLC +A +++YP
Sbjct: 309 EKGFIRMQRGITVKHGLCGVALDSSYP 335
>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCDGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G+Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGQQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---ENGQKYWLLKNSWGTSWGENGFMKIIRDY 325
Query: 293 GG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 326 GNPAGLCDIAKMSSYP 341
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 183/319 (57%), Gaps = 30/319 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ ++HE+WM E RTY ++ EK R ++F+ N + L N+FADLT E
Sbjct: 40 MVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADLTDE 99
Query: 58 KFLASYTGYK-PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+F A+ TG + PP + + F+ N S S+DW GAVT VKDQGS CC
Sbjct: 100 EFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSCGCC 159
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLAS 172
WAF+AVA VEGL KIRTG+LV+ S+ QLVDC GCA ++NAFEY+ L +
Sbjct: 160 WAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGGLTT 219
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--T 230
E YPY+G D C S+AS IRGY+ V E L V+ QPVSVAI+ +
Sbjct: 220 ESSYPYRG-TDGSCRRSASAAS-----IRGYEDVPANNEAALMAAVAHQPVSVAINGGDS 273
Query: 231 WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GV G CG NH +T VGYGT ++ YW++KN WG +W EGG +RI
Sbjct: 274 VFRFYDSGVLGGSGCGTELNHAITAVGYGTASDG---TKYWIMKNSWGGSWGEGGYVRIR 330
Query: 290 RGVGGSGLCNIAANAAYPL 308
RGV G G+C +A A+YP+
Sbjct: 331 RGVRGEGVCGLAQLASYPV 349
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 126/319 (39%), Positives = 177/319 (55%), Gaps = 35/319 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ A+HEQWM +++R YKD +EK RF++FK N +F L +N+FADLT +
Sbjct: 126 MVARHEQWMAQYSRVYKDASEKARRFEVFKANVQFIESFNAGGNNKFWLGVNQFADLTND 185
Query: 58 KFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+F ++ T G K P F+ N S + +IDW +GAVTP+KDQG C
Sbjct: 186 EFRSTKTNKGLKSSNMKIPTG-----FRYENVSADALPTTIDWRTKGAVTPIKDQGQCGC 240
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+AVA EG+ KI TG+LV+ ++ +LVDC GC +++AF++I + L
Sbjct: 241 CWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLT 300
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-- 229
+E YPY D C S S I+GY+ V E L V+ QPVSVA+D
Sbjct: 301 TESSYPYTA-ADGKC----KSGSNSAATIKGYEDVPANDEAALMKAVANQPVSVAVDGGD 355
Query: 230 TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GGV TG CG +HG+ +GYG T++ YWL+KN WGT W E G +R+
Sbjct: 356 MTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDG---TKYWLMKNSWGTTWGENGYLRME 412
Query: 290 RGVGGS-GLCNIAANAAYP 307
+ + G+C +A +YP
Sbjct: 413 KDISDKRGMCGLAMEPSYP 431
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 139/326 (42%), Positives = 185/326 (56%), Gaps = 35/326 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRL 48
SR+ H+ ++ +HE WM + R YKD EKE RFKIFK N F L +
Sbjct: 27 SRSLHE-ASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSI 85
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N+FADLT E+F + +K H S + FK N + + +IDW ++GAVTP+K
Sbjct: 86 NEFADLTNEEFRSLRNRFKA----HICSEATT-FKYENVTAVP--STIDWRKKGAVTPIK 138
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYI 164
DQ CCWAF+AVA EG+ +I TG+L++ S+ +LVDC T GC+ +++AF +I
Sbjct: 139 DQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRFI 198
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ LASE YPY+G D C+ + + I+GY+ V E+ LQ V+ QPV+
Sbjct: 199 K-IHGLASEATYPYEG-DDGTCNSKKEAHPA--AKIKGYEDVPANNEKALQKAVAHQPVA 254
Query: 225 VAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
VAIDA F FY GVFTG CG +HGV VGYG + YWLVKN WGT W E
Sbjct: 255 VAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIGDDG---MMYWLVKNSWGTGWGE 311
Query: 283 GGSMRIFRGV-GGSGLCNIAANAAYP 307
G +R+ R V GLC IA A+YP
Sbjct: 312 EGYIRMQRDVTAKEGLCGIAMQASYP 337
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 178/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFKKN +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKKNMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGQCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG+L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGKLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY G + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAEGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKMSSYP 341
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 225 bits (573), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 124/327 (37%), Positives = 181/327 (55%), Gaps = 34/327 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLN 49
+R +AA+HE+WM ++ R Y+D AEK RF++FK N F L +N
Sbjct: 24 ARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAFIESFNAGNHNFWLGVN 83
Query: 50 KFADLTREKFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+FADLT ++F + T G+ P T P R ++N+N + ++DW +GAVTP+
Sbjct: 84 QFADLTNDEFRWTKTNKGFIPSTTRVPTGFR---YENVNIDALPA--TVDWRTKGAVTPI 138
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQG CCWAF+AVA +EG+ K+ TG+L++ S+ +LVDC GC +++AF++
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKF 198
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + L +E YPY D S S +I+GY+ V E L V+ QPV
Sbjct: 199 IIKNGGLTTESNYPYAAADDKC-----KSVSNSVASIKGYEDVPANNEAALMKAVANQPV 253
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVA+D F FY GGV TG CG +HG+ +GYG ++ YWL+KN WGT W
Sbjct: 254 SVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDG---TKYWLLKNSWGTTWG 310
Query: 282 EGGSMRIFRGVGGS-GLCNIAANAAYP 307
E G +R+ + + G+C +A +YP
Sbjct: 311 ENGFLRMEKDISDKRGMCGLAMEPSYP 337
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 128/313 (40%), Positives = 187/313 (59%), Gaps = 32/313 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLA 61
H+QWM + R YK EK R IF++N ++++ +N+FADLT E+F
Sbjct: 39 HDQWMARYGRVYKTANEKNRRSTIFQENLKYIQTFNKANNKPYKLGVNEFADLTNEEFTT 98
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
S +K H + +N F+ N + + ++DW ++GAVTP+K+QG CCWAF+A
Sbjct: 99 SRNKFK----SHVCATVTNVFRYENVTAVP--ATMDWRKKGAVTPIKNQGQCGCCWAFSA 152
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASECVYP 177
VA +EG+ +++TG+L++ S+ +LVDC T GC ++ AF++I+Q L++E YP
Sbjct: 153 VAAMEGITQLKTGKLISLSEQELVDCDTNGEDQGCEGGLMDYAFDFIQQNHGLSTETNYP 212
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFY 235
Y G D C+ + + I G++ V +E L V+ QP+SVAIDA+ F FY
Sbjct: 213 YSGT-DGTCN--ANKEANHAATITGHEDVPANSESALLKAVANQPISVAIDASGSDFQFY 269
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
GVFTG CG +HGVT VGYGT A+G + YWLVKN WGT+W E G +++ RGV +
Sbjct: 270 SSGVFTGECGTELDHGVTAVGYGTA--ADGTK-YWLVKNSWGTSWGEEGYIQMQRGVAAA 326
Query: 296 -GLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 327 EGLCGIAMQASYP 339
>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 180/318 (56%), Gaps = 30/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPP---PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
++FLA +TG P + P S+ +L+ M ++DW E GAVT VK QG
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSDDDMP--SNLDWRESGAVTQVKHQGRC 151
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLA 171
CCWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++
Sbjct: 152 GCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGIS 211
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 212 RESDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQ 266
Query: 232 -FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 267 DLQFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIR 323
Query: 291 GVGG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 324 DYGNPAGLCDIAKMSSYP 341
>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 224 bits (572), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 180/318 (56%), Gaps = 30/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPP---PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
++FLA +TG P + P S+ +L+ M ++DW E GAVT VK QG
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTELKINDLSDDDMP--SNLDWRESGAVTQVKHQGRC 151
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLA 171
CCWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++
Sbjct: 152 GCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGIS 211
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 212 RESDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQ 266
Query: 232 -FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 267 DLQFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIR 323
Query: 291 GVGG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 324 DYGNPAGLCDIAKMSSYP 341
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 224 bits (571), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 132/318 (41%), Positives = 179/318 (56%), Gaps = 30/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPP---PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
++FLA +TG P + P S+ +L+ M ++DW E GAVT VK QG
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSDDDMP--SNLDWRESGAVTQVKHQGRC 151
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLA 171
CCWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++
Sbjct: 152 GCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGIS 211
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 212 RESDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQ 266
Query: 232 -FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 267 DLQFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIR 323
Query: 291 GVGG-SGLCNIAANAAYP 307
G SGLC+I ++YP
Sbjct: 324 DSGDPSGLCDITKMSSYP 341
>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 180/318 (56%), Gaps = 30/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPP---PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
++FLA +TG P + P S+ +L+ M ++DW E GAVT VK QG
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTELKINDLSDDDMP--SNLDWRESGAVTQVKHQGRC 151
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLA 171
CCWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++
Sbjct: 152 GCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGIS 211
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 212 RESDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQ 266
Query: 232 -FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 267 DLQFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIR 323
Query: 291 GVGG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 324 DYGNPAGLCDIAKMSSYP 341
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G+Q Y C RS I Y+ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGQQ-YTC---RSQEKTAAVQISSYKVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDY 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKMSSYP 341
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 125/314 (39%), Positives = 180/314 (57%), Gaps = 26/314 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKF 59
+H++WM + R Y D EK R+ +FK+N E L +N+FADLT ++F
Sbjct: 38 RHDEWMAKHGRVYADMKEKNNRYVVFKRNVERIERLNNVPAGRTFKLAVNQFADLTNDEF 97
Query: 60 LASYTGYKPPPTDHPHS-NRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ YTGYK S +++ F+ N S + S+DW ++GAVTP+K+QG+ CCWA
Sbjct: 98 RSMYTGYKGGSVLSSQSGTKTSSFRYQNVSSGALPVSVDWRKKGAVTPIKNQGTCGCCWA 157
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVY 176
F+AVA +EG KI+ G+L++ S+ QLVDC T + GC+ ++ AFE+I L +E Y
Sbjct: 158 FSAVAAIEGATKIKKGKLISLSEQQLVDCDTNDFGCSGGLMDTAFEHIMATGGLTTESNY 217
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN--F 234
PY+G+ D C + + +I GY+ V E+ L V+ QPVS+ I+ F+ F
Sbjct: 218 PYKGK-DATCKIKNTKPTAT--SITGYEDVPVNDEKALMKAVAHQPVSIGIEGGGFDFQF 274
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-G 293
Y GVFTG C +H VT VGYG ++ YW++KN WGT W E G MRI + V
Sbjct: 275 YGSGVFTGECTTYLDHAVTAVGYGQSSNGS---KYWIIKNSWGTKWGESGYMRIKKDVKD 331
Query: 294 GSGLCNIAANAAYP 307
GLC +A A+YP
Sbjct: 332 KKGLCGLAMKASYP 345
>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 179/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G+Q Y C RS I Y+ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGQQ-YTC---RSQEKTAAVQISSYKVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKMSSYP 341
>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 176/317 (55%), Gaps = 30/317 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFK-----------KNHEF-LRLNKFADLTRE 57
++AA+HE WM ++ R YKD AEK +F++FK +NH+F L +N+FADLT E
Sbjct: 32 SMAARHETWMAQYGRVYKDAAEKAQKFEVFKANARFIDSFNAENHKFWLGINQFADLTNE 91
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F A+ T + S FK N + SIDW +GAVTPVKDQG CCW
Sbjct: 92 EFKATKTNKGFISN---KARVSTGFKYENLKIEALPTSIDWRTKGAVTPVKDQGQCGCCW 148
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASE 173
AF+AVA EG+ K+ TG+LV+ S+ +LVDC GC +++AF++I L E
Sbjct: 149 AFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIITNGGLTQE 208
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY +D C S S G I+ Y+ V E L V+ QPVSVA+D
Sbjct: 209 SSYPYDA-EDGKC----KSGSKSAGTIKSYEDVPANNEGALMKAVANQPVSVAVDGGDMT 263
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY GGV TG CG +HG+ +GYG T++ +WL+KN WGT W E G +R+ +
Sbjct: 264 FQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDG---TKFWLMKNSWGTTWGENGFLRMEKD 320
Query: 292 VGG-SGLCNIAANAAYP 307
+ G+C +A +YP
Sbjct: 321 IADKKGMCGLAMEPSYP 337
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 224 bits (571), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 133/316 (42%), Positives = 178/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I Y+ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYKVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKMSSYP 341
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 224 bits (570), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 131/324 (40%), Positives = 189/324 (58%), Gaps = 33/324 (10%)
Query: 3 RTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFK-----------KNHEF-LRLNK 50
R+ + ++ +HEQWM + R YK+ AEK RF+IF+ +NH+F L +N+
Sbjct: 29 RSLGENKSMLERHEQWMAQHGRVYKNAAEKAHRFEIFRANVERIESFNAENHKFKLGVNQ 88
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FADLT E+F T KP S +S ++N+ + + +DW +GAVTP+KDQ
Sbjct: 89 FADLTNEEFKTRNT-LKPSKM---ASTKSFKYENVTAVPAT----MDWRTKGAVTPIKDQ 140
Query: 111 GSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQ 166
G CWAF+AVA EG+ K+ TG+L++ S+ ++VDC + GC +++AFEYI +
Sbjct: 141 GQCGSCWAFSAVAATEGITKLSTGKLISLSEQEVVDCDVTSDDQGCNGGEMDDAFEYIIK 200
Query: 167 YQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVA 226
+ + +E YPY+ D C+ A+ +I GY+ V +E L + QP++VA
Sbjct: 201 NKGITTEANYPYKA-ADGTCN--TKKAASHAASITGYEDVTVNSEAALLKAAANQPIAVA 257
Query: 227 IDATWFNF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
IDA F F Y GVFTG CG +HGVT+VGYG T++ YWLVKN WGT+W E G
Sbjct: 258 IDAGDFAFQMYSSGVFTGDCGTDLDHGVTLVGYGATSDG---TKYWLVKNSWGTSWGEDG 314
Query: 285 SMRIFRGVGG-SGLCNIAANAAYP 307
+R+ R V GLC IA +A+YP
Sbjct: 315 YIRMERDVDAKEGLCGIAMDASYP 338
>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 224 bits (570), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 180/318 (56%), Gaps = 30/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPP---PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
++FLA +TG P + P S+ +L+ M ++DW E GAVT VK QG
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTELKINDLSDDDMP--SNLDWIESGAVTQVKHQGRC 151
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLA 171
CCWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++
Sbjct: 152 GCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGIS 211
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 212 RESDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQ 266
Query: 232 -FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 267 DLQFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIR 323
Query: 291 GVGG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 324 DYGNPAGLCDIAKMSSYP 341
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 184/317 (58%), Gaps = 33/317 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ +HE WMVE+ R YKD AEK RF+ FK N F L +N+FADLT E
Sbjct: 32 MVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNKFWLGVNQFADLTTE 91
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F A+ G+KP P + FK N S + ++DW +GAVTP+K+QG CCW
Sbjct: 92 EFKAN-KGFKPTAEKVPTTG----FKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCW 146
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASE 173
AF+AVA +EG+ K+ TG L++ S+ +LVDC T + GC ++++AFE++ + LA+E
Sbjct: 147 AFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATE 206
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--W 231
YPY+ D C SA+ I+G++ V E L V+ QPVSVA+DA+
Sbjct: 207 SNYPYKA-VDGKCKGGSKSAA----TIKGHEDVPVNNEAALMKAVANQPVSVAVDASDRT 261
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y GGV TG CG +HG+ +GYG E++G + YW++KN WGT W E G +R+ +
Sbjct: 262 FMLYSGGVMTGSCGTELDHGIAAIGYG--MESDGTK-YWILKNSWGTTWGEKGFLRMEKD 318
Query: 292 VGGS-GLCNIAANAAYP 307
+ G+C +A +YP
Sbjct: 319 ITDKRGMCGLAMKPSYP 335
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 223 bits (569), Expect = 6e-56, Method: Compositional matrix adjust.
Identities = 135/328 (41%), Positives = 187/328 (57%), Gaps = 41/328 (12%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLN 49
S S + +I ++++WM ++ R YK + E E RF I++ N ++ L N
Sbjct: 6 SLGSSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLAEN 65
Query: 50 KFADLTREKFLASYTGYKP---PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
FADLT E+F A+Y GYK P T + N N N +DW + GAVTP
Sbjct: 66 NFADLTNEEFKATYLGYKTVSIPDTCFRYGNMVNLPTN-----------VDWRQEGAVTP 114
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFE 162
+K+QG CWAF+AVA VEG+NKI+ G+L++ S+ +LVDC + GC ++ AFE
Sbjct: 115 IKNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFE 174
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
+I++ L +E YPYQG + C+ + ++ +I GY+ V E+ L+ V+ QP
Sbjct: 175 FIKR-TGLTTEIEYPYQGAES-ACNEQKEKY--QFVSISGYEKVPVNDEKSLKAAVANQP 230
Query: 223 VSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
VSVAIDA F FY GG+F+G CGN NHGV IVGYG T+ Q YWLVKN WGT+W
Sbjct: 231 VSVAIDAEGNNFQFYSGGIFSGNCGNQLNHGVAIVGYGETS----NQAYWLVKNSWGTDW 286
Query: 281 DEGGSMRIFR-GVGGSGLCNIAANAAYP 307
E G +R+ R G C IA A+YP
Sbjct: 287 GESGYIRMKRDSTDKQGTCGIAMMASYP 314
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 132/315 (41%), Positives = 180/315 (57%), Gaps = 31/315 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
++FLA +TG P + S +L+ M ++DW E GAVT VK+QG CC
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSP----INDLSDDDMP--SNLDWRESGAVTQVKNQGQCGCC 147
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASEC 174
WAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 148 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISRES 207
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FN 233
Y Y G+Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 208 DYEYLGQQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDLQ 262
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GG + G C N NH VT +GYGT E Q YWL+KN WGT+W E G M+I R G
Sbjct: 263 FYAGGTYDGSCANRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGEDGFMKIIRDSG 319
Query: 294 G-SGLCNIAANAAYP 307
+GLC+IA ++YP
Sbjct: 320 NPAGLCDIAKVSSYP 334
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 132/315 (41%), Positives = 180/315 (57%), Gaps = 31/315 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
++FLA +TG P + S +L+ M ++DW E GAVT VK+QG CC
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSP----INDLSDDDMP--SNLDWRESGAVTQVKNQGQCGCC 147
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASEC 174
WAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 148 WAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISRES 207
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FN 233
Y Y G+Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 208 DYEYLGQQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDLQ 262
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GG + G C N NH VT +GYGT E Q YWL+KN WGT+W E G M+I R G
Sbjct: 263 FYAGGTYDGSCANRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGEDGFMKIIRDSG 319
Query: 294 G-SGLCNIAANAAYP 307
+GLC+IA ++YP
Sbjct: 320 NPAGLCDIAKVSSYP 334
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 223 bits (569), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 182/317 (57%), Gaps = 32/317 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ A +E W+VE ++Y EKEMRF+IFK+N L LN+FADLT E
Sbjct: 40 VMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDE 99
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG-SYCCW 116
++ ++Y G+K P + SN + + + + +DW GAV VKDQG CW
Sbjct: 100 EYRSTYLGFK----SGPKAKVSNRY--VPKVGVVLPNYVDWRTVGAVVGVKDQGLCSSCW 153
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASE 173
AF+AVA VEG+NKI TG L++ S+ +LVDC GC + ++ +AF++I + +E
Sbjct: 154 AFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQRTRGCNRGYMNDAFQFIIDNGGINTE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-- 231
YPY QD CDW+R + +Y I Y+ + E LQ+ V+ QP++V +++
Sbjct: 214 DNYPYTA-QDGQCDWYRKNQ--RYVTIDNYEQLPANNEWVLQNAVAYQPITVGLESEGGK 270
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G++TG CG +HGVTIVGYGT E YW+VKN WGTNW E G +RI R
Sbjct: 271 FKLYTSGIYTGYCGTAIDHGVTIVGYGT----ERGLDYWIVKNSWGTNWGENGYIRIQRN 326
Query: 292 VGGSGLCNIAANAAYPL 308
+GG+G C IA +YP+
Sbjct: 327 IGGAGKCGIAMVPSYPV 343
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 223 bits (568), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 179/318 (56%), Gaps = 30/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPP---PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
++FLA +TG P + P S+ +L+ M ++DW E GAVT VK QG
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSDDDMP--SNLDWRESGAVTQVKHQGRC 151
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLA 171
CCWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++
Sbjct: 152 GCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGIS 211
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 212 RESDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQ 266
Query: 232 -FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 267 DLQFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIR 323
Query: 291 GVGG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 324 DSGNPAGLCDIAKMSSYP 341
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 223 bits (568), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 139/319 (43%), Positives = 182/319 (57%), Gaps = 30/319 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ ++HE+WM E RTY ++ EK R ++F+ N + L N+FADLT E
Sbjct: 40 MVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADLTDE 99
Query: 58 KFLASYTGYK-PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+F A+ TG + PP + + F+ N S S+DW GAVT VKDQGS CC
Sbjct: 100 EFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSCGCC 159
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLAS 172
WAF+AVA VEGL KIRTG+LV+ S+ QLVDC GCA ++NAFEY+ L +
Sbjct: 160 WAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGGLTT 219
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--T 230
E YPY+G D C S+AS IRGY+ V E L V+ QPVSVAI+ +
Sbjct: 220 ESSYPYRG-TDGSCRRSASAAS-----IRGYEDVPANNEAALMAAVAHQPVSVAINGGDS 273
Query: 231 WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GV G CG NH +T GYGT ++ YW++KN WG +W EGG +RI
Sbjct: 274 VFRFYDSGVLGGSGCGTELNHAITAAGYGTASDG---TKYWIMKNSWGGSWGEGGYVRIR 330
Query: 290 RGVGGSGLCNIAANAAYPL 308
RGV G G+C +A A+YP+
Sbjct: 331 RGVRGEGVCGLAQLASYPV 349
>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
Length = 381
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 130/326 (39%), Positives = 179/326 (54%), Gaps = 47/326 (14%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKFL 60
+E+W R ++ EK RF FK+N F LRLN+F D+ E+F
Sbjct: 46 YERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFGDMGPEEFR 104
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMS-------FYD-------SIDWNERGAVTP 106
+++ +R N + S + YD S+DW + GAVT
Sbjct: 105 STFA-----------DSRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQHGAVTA 153
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYI 164
VK+QG CWAF+ V VEG+N IRTG LV+ S+ +LVDC T NGC +ENAF++I
Sbjct: 154 VKNQGRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDTAENGCQGGLMENAFDFI 213
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ Y + +E YPY+ + CD R+ + +I G+Q V +E+ L V+RQPVS
Sbjct: 214 KSYGGITTESAYPYRA-SNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVARQPVS 272
Query: 225 VAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
VAIDA F FY GVFTG CG +HGV +VGYG + + +G PYW+VKN WG +W E
Sbjct: 273 VAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVS-DVDGT-PYWIVKNSWGPSWGE 330
Query: 283 GGSMRIFRGVGGSGLCNIAANAAYPL 308
GG +R+ RG G GLC IA A++P+
Sbjct: 331 GGYIRMQRGAGNGGLCGIAMEASFPI 356
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 180/318 (56%), Gaps = 30/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPP---PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
++FLA +TG P + P S+ +L+ M ++DW E GAVT VK QG
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSDDDMP--SNLDWRESGAVTQVKHQGRC 151
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLA 171
CCWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++
Sbjct: 152 GCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGIS 211
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E Y Y G+Q Y C RS I Y+ V P E L V++QPVS+ I A+
Sbjct: 212 RESDYEYLGQQ-YTC---RSQEKTAAVQISSYKVV-PEGETSLLQAVTKQPVSIGIAASQ 266
Query: 232 -FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 267 DLQFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIR 323
Query: 291 GVGG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 324 DSGNPSGLCDIAKMSSYP 341
>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 223 bits (568), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 178/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++E KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 154 CWAFSAVGSLEVAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 326 GNPAGLCDIAKMSSYP 341
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 130/327 (39%), Positives = 182/327 (55%), Gaps = 26/327 (7%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------L 46
+SR + KH++WM E RTY D EK R+ +FK+N E L
Sbjct: 24 LSRLLDDELIMQKKHDEWMAEHGRTYADMNEKNNRYVVFKRNVERIERLNNVPAGRTFKL 83
Query: 47 RLNKFADLTREKFLASYTGYKPPPTDHPHS-NRSNWFKNLNSSKMSFYDSIDWNERGAVT 105
+N+FADLT ++F YTGYK S +S F+ N + ++DW ++GAVT
Sbjct: 84 AVNQFADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSFRYQNVFFGALPIAVDWRKKGAVT 143
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEY 163
P+K+QGS CCWAF+AVA +EG +I+ G+L++ S+ QLVDC T + GC+ ++ AFE+
Sbjct: 144 PIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCSGGLMDTAFEH 203
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I L +E YPY+G +D C + S +I GY+ V E L V+ QPV
Sbjct: 204 IMATGGLTTESNYPYKG-EDANCKIKSTKPSA--ASITGYEDVPVNDENALMKAVAHQPV 260
Query: 224 SVAIDATWFN--FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SV I+ F+ FY GVFTG C +H VT VGY +++ YW++KN WGT W
Sbjct: 261 SVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGY---SQSSAGSKYWIIKNSWGTKWG 317
Query: 282 EGGSMRIFRGV-GGSGLCNIAANAAYP 307
EGG MRI + + GLC +A A+YP
Sbjct: 318 EGGYMRIKKDIKDKEGLCGLAMKASYP 344
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 135/328 (41%), Positives = 187/328 (57%), Gaps = 41/328 (12%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLN 49
S S + +I ++++WM ++ R YK + E E RF I++ N ++ L N
Sbjct: 6 SLGSSCSSDIQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSMNHSHTLAEN 65
Query: 50 KFADLTREKFLASYTGYKP---PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
FADLT E+F A+Y GYK P T + N N N +DW + GAVTP
Sbjct: 66 NFADLTNEEFKATYLGYKTVSIPDTCFRYGNMVNLPTN-----------VDWRQEGAVTP 114
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFE 162
+K+QG CWAF+AVA VEG+NKI+ G+L++ S+ +LVDC + GC ++ AFE
Sbjct: 115 IKNQGQCGSCWAFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFE 174
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
+I++ L +E YPYQG + C+ + ++ +I GY+ V E+ L+ V+ QP
Sbjct: 175 FIKR-TGLTTEIEYPYQGAES-ACNEQKEKY--QFVSISGYEKVPVNDEKSLKAAVANQP 230
Query: 223 VSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
VSVAIDA F FY GG+F+G CGN NHGV IVGYG T+ Q YWLVKN WGT+W
Sbjct: 231 VSVAIDAEGNNFQFYSGGIFSGNCGNQLNHGVAIVGYGETS----NQAYWLVKNSWGTDW 286
Query: 281 DEGGSMRIFR-GVGGSGLCNIAANAAYP 307
E G +R+ R G C IA A+YP
Sbjct: 287 GESGYIRMKRDSTDRQGTCGIAMMASYP 314
>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 178/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I++ ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I YQ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYQVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
F GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFCAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDY 325
Query: 293 GG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 326 GNPAGLCDIAKMSSYP 341
>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 178/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I Y+ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSQEKTAAVQISSYKVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDY 325
Query: 293 GG-SGLCNIAANAAYP 307
G +GLC+IA ++YP
Sbjct: 326 GNPAGLCDIAKMSSYP 341
>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 178/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G+Q Y C RS I Y+ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGQQ-YTC---RSQEKTAAVQISSYKVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+I ++YP
Sbjct: 326 GDPSGLCDITKMSSYP 341
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 132/329 (40%), Positives = 184/329 (55%), Gaps = 30/329 (9%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
+SR + + +HE+WM ++ + YKD AEKE RF++FK N +F L
Sbjct: 21 ISRVMSRGLITSERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLS 80
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N+FADL E+F A + + + ++ F+ N +K+ ++DW +RGAVTP+
Sbjct: 81 INQFADLHDEEFKALLNNVQKKASRVETATETS-FRYENVTKIP--STMDWRKRGAVTPI 137
Query: 108 KDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEY 163
KDQG C CWAF VATVE L++I TG+LV+ S+ +LVDC GC ++ENAFE+
Sbjct: 138 KDQGYTCGSCWAFATVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEF 197
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + SE YPY+G+ D C + + I GY+ V +E+ L V+ QPV
Sbjct: 198 IANKGGITSEAYYPYKGK-DRSCKVKKETHG--VARIIGYESVPSNSEKALLKAVANQPV 254
Query: 224 SVAID--ATWFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
SV ID A F FY G+F CG +H V +VGYG + YWLVKN W T W
Sbjct: 255 SVYIDAGAIAFKFYSSGIFEARNCGTHLDHAVAVVGYGKLRDG---TKYWLVKNSWSTAW 311
Query: 281 DEGGSMRIFRGV-GGSGLCNIAANAAYPL 308
E G MRI R + GLC IA+NA+YP+
Sbjct: 312 GEKGYMRIKRDIRAKKGLCGIASNASYPI 340
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 223 bits (567), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 179/318 (56%), Gaps = 30/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPP---PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
++FLA +TG P + P S+ +L+ M ++DW E GAVT VK QG
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLSDDDMP--SNLDWRESGAVTQVKHQGRC 151
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLA 171
CCWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++
Sbjct: 152 GCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGIS 211
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E Y Y G Q Y C RS I Y+ V P E L V++QPVS+ I A+
Sbjct: 212 RESDYEYLGEQ-YTC---RSQEKTAAVQISSYKVV-PEGETSLLQAVTKQPVSIGIAASQ 266
Query: 232 -FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 267 DLQFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIR 323
Query: 291 GVGG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 324 DSGNPSGLCDIAKMSSYP 341
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 132/321 (41%), Positives = 182/321 (56%), Gaps = 33/321 (10%)
Query: 7 KTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFAD 53
+ +I KHE+WM F R Y D EKE+R+KIFK+N + L +N+FAD
Sbjct: 31 QDASIHEKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRIESFNKASEKSYKLGINQFAD 90
Query: 54 LTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
LT E+F S +K H S+++ F+ N + + S+DW + GAVT +KDQG
Sbjct: 91 LTNEEFKTSRNRFKG----HMCSSQAGPFRYENITAVP--SSMDWRKEGAVTAIKDQGQC 144
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQR 169
CWAF+AVA VEG+ ++ T +L++ S+ +LVDC T GC +++AF++I Q Q
Sbjct: 145 GSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAFKFIEQNQG 204
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
L +E YPY+G D C+ + I G++ V E L V++QPVSVAIDA
Sbjct: 205 LTTEANYPYEG-SDGTCN--TKQEANHAAKINGFEDVPANNEGALMKAVAKQPVSVAIDA 261
Query: 230 TW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F FY G+FTG CG +HGV VGYG E+ G YWLVKN WGT W E G +R
Sbjct: 262 GGFEFQFYSSGIFTGDCGTELDHGVAAVGYG---ESNGMN-YWLVKNSWGTQWGEEGYIR 317
Query: 288 IFRGVGG-SGLCNIAANAAYP 307
+ + + GLC IA A+YP
Sbjct: 318 MQKDIDAKEGLCGIAMQASYP 338
>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 222 bits (566), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 131/318 (41%), Positives = 179/318 (56%), Gaps = 30/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKVERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPP---PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
++FLA +TG P + P S+ +L+ M ++DW E GAVT VK QG
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTELKINDLSDDDMP--SNLDWRESGAVTQVKHQGRC 151
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLA 171
CCWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++
Sbjct: 152 GCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGIS 211
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E Y Y G Q Y C RS I Y+ V P E L V++QPVS+ I A+
Sbjct: 212 RESDYEYLGEQ-YTC---RSQEKTAAVQISSYKVV-PEGETSLLQAVTKQPVSIGIAASQ 266
Query: 232 -FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 267 DLQFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIR 323
Query: 291 GVGG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 324 DSGNPSGLCDIAKMSSYP 341
>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
Length = 347
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 127/322 (39%), Positives = 176/322 (54%), Gaps = 38/322 (11%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADL 54
+ A+HEQWMV+ R YKD+ +K RF +FK N +F L +N+FADL
Sbjct: 37 MVARHEQWMVQHGRVYKDETDKAHRFLVFKANVKFIESFNAAAAAGNRKFWLGVNQFADL 96
Query: 55 TREKFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
T ++F A+ T G+ P P F+ N S + ++DW +GAVTP+KDQG
Sbjct: 97 TNDEFRATKTNKGFNPNVVKVPTG-----FRYQNLSIDALPQTVDWRTKGAVTPIKDQGQ 151
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQ 168
CCWAF+AVA EG+ KI TG+L + S+ +LVDC GC +++AF++I +
Sbjct: 152 CGCCWAFSAVAATEGIVKISTGKLTSLSEQELVDCDVHGEDQGCNGGEMDDAFKFIIKNG 211
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID 228
L +E YPY QD C S S I+GY+ V E L V+ QPVSVA+D
Sbjct: 212 GLTTESNYPYTA-QDGQC----KSGSNGAATIKGYEDVPANDEAALMKAVASQPVSVAVD 266
Query: 229 A--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F FY GGV TG CG +HG+ +GYG T++ YWL+KN WGT W E G +
Sbjct: 267 GGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGKTSDG---TKYWLMKNSWGTTWGENGFL 323
Query: 287 RIFRGVGG-SGLCNIAANAAYP 307
R+ + + G+C +A +YP
Sbjct: 324 RMEKDIADKKGMCGLAMQPSYP 345
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 127/317 (40%), Positives = 184/317 (58%), Gaps = 32/317 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ +HE WMVE+ R YKD AEK RF+ FK N F L +N+FADLT E
Sbjct: 32 MVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAFVESFNTNKKNKFWLGVNQFADLTTE 91
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F A+ G+KP + + FK N S + ++DW +GAVTP+K+QG CCW
Sbjct: 92 EFKAN-KGFKPISAEMVPTTG---FKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCW 147
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASE 173
AF+AVA +EG+ K+ TG L++ S+ +LVDC T + GC ++++AFE++ + LA+E
Sbjct: 148 AFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATE 207
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--W 231
YPY+ D C SA+ I+G++ V E L V+ QPVSVA+DA+
Sbjct: 208 SSYPYKA-VDGKCKGGSKSAA----TIKGHEDVPVNDEAALMKAVANQPVSVAVDASDRT 262
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y GGV TG CG +HG+ +GYG E++G + YW++KN WGT W E G +R+ +
Sbjct: 263 FMLYSGGVMTGSCGTELDHGIAAIGYG--VESDGTK-YWILKNSWGTTWGEKGFLRMEKD 319
Query: 292 VGGS-GLCNIAANAAYP 307
+ G+C +A +YP
Sbjct: 320 ISDKQGMCGLAMKPSYP 336
>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
Length = 368
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 124/311 (39%), Positives = 170/311 (54%), Gaps = 23/311 (7%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLA 61
+E+W R ++ EK RF FK+N F LRLN+F D+ RE+F +
Sbjct: 42 YERWQTHH-RVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRLRLNRFGDMGREEFRS 100
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
+ + + + S+DW ++GAVT VK+QG CWAF+
Sbjct: 101 GFADSRINDLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAVTAVKNQGRCGSCWAFST 160
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
V VEG+N IRTG LV+ S+ +L+DC T NGC +ENAFE+I+ + + +E YPY
Sbjct: 161 VVAVEGINAIRTGSLVSLSEQELIDCDTDENGCQGGLMENAFEFIKSHGGITTESAYPYH 220
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHG 237
+ CD R+ G+ AI G+Q V +E+ L V+ QPVSVAIDA FY
Sbjct: 221 A-SNGTCDGARAR-RGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAIDAGGQALQFYSE 278
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGL 297
GVFTG CG +HGV VGYG + + PYW+VKN WG +W EGG +R+ RG G GL
Sbjct: 279 GVFTGDCGTDLDHGVAAVGYGVSDDG---TPYWIVKNSWGPSWGEGGYIRMQRGTGNGGL 335
Query: 298 CNIAANAAYPL 308
C IA A++P+
Sbjct: 336 CGIAMEASFPI 346
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 127/319 (39%), Positives = 178/319 (55%), Gaps = 34/319 (10%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFK-----------KNHEF-LRLNKFADLTRE 57
++ A+HE WM ++ R+YKD AEK+ +F++FK KNH+F L +N+FAD+T E
Sbjct: 32 SMVARHESWMSQYGRSYKDAAEKDRKFEVFKANAAFIDSFNAKNHKFWLGINQFADITNE 91
Query: 58 KFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+F + T G+ S F N S + +IDW +GAVTPVKDQG C
Sbjct: 92 EFKVTKTNKGFISNKV-----RASTGFSYENVSIDALPATIDWRTKGAVTPVKDQGQCGC 146
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+AVA EG+ K+ TG+LV+ S+ +LVDC GC +++AF++I L
Sbjct: 147 CWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIITNGGLT 206
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-- 229
E YPY +D C S S G I+ Y+ V E L V+ QPVSVA+D
Sbjct: 207 QESSYPYDA-EDGKC----KSGSKSAGTIKSYEDVPANNEGALMKAVANQPVSVAVDGGD 261
Query: 230 TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GGV TG CG +HG+ +GYG T++ YWL+KN WGT+W E G +R+
Sbjct: 262 MTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDG---TKYWLMKNSWGTSWGENGFLRME 318
Query: 290 RGVGG-SGLCNIAANAAYP 307
+ + G+C +A +YP
Sbjct: 319 KDIADKKGMCGLAMEPSYP 337
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 135/329 (41%), Positives = 187/329 (56%), Gaps = 38/329 (11%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
M+RT ++ KHE+WM F R Y D EKE+R+KIFK+N + L
Sbjct: 26 MARTLQDA-SMHEKHEEWMSRFGRVYNDGNEKEIRYKIFKENVQRIESFNKASGKSYKLG 84
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFK--NLNSSKMSFYDSIDWNERGAVT 105
+N+FADLT E+F S +K H S+++ F+ NL ++ S+DW ++GAVT
Sbjct: 85 INQFADLTNEEFKTSRNRFKG----HMCSSQAGPFRYENLTAAP----SSMDWRKKGAVT 136
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAF 161
+KDQG CWAF+AVA VEG+ ++ T +L++ S+ +LVDC T GC +++AF
Sbjct: 137 AIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQGCQGGLMDDAF 196
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
++I Q Q L +E YPY+G D C+ + I G++ V E L V++Q
Sbjct: 197 KFIEQNQGLTTEANYPYEG-SDGTCN--TKQEANHAAKINGFEDVPANNEGALMKAVAKQ 253
Query: 222 PVSVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
PVSVAIDA F FY G+FTG CG +HGV VGYG E+ G YWLVKN WGT
Sbjct: 254 PVSVAIDAGGFGFQFYSSGIFTGDCGTELDHGVAAVGYG---ESNGMN-YWLVKNSWGTQ 309
Query: 280 WDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
W E G +R+ + + GLC IA A+YP
Sbjct: 310 WGEEGYIRMQKDIDAKEGLCGIAMQASYP 338
>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 423
Score = 222 bits (565), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 128/316 (40%), Positives = 174/316 (55%), Gaps = 28/316 (8%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLA 61
+E+W R ++ EK RF FK+N F LRLN+F D+ RE+F +
Sbjct: 88 YERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRS 146
Query: 62 SYTGYKPPPT---DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CW 116
++ + D P + + S S+DW + GAVT VKDQG +C CW
Sbjct: 147 TFADSRINDLRRQDSPAARAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKDQG-HCGSCW 205
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECV 175
AF+ V VEG+N IRTG L + S+ +L+DC T NGC +ENAFE+I+ + + +E
Sbjct: 206 AFSTVVAVEGINAIRTGSLASLSEQELIDCDTDENGCQGGLMENAFEFIKSFGGITTEAA 265
Query: 176 YPYQGRQDYYCDWWRSS-ASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YPY+ + CD R+ G I G+Q V +E+ L V+ QPVSVA+DA F
Sbjct: 266 YPYRA-SNGTCDGDRARRGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAF 324
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVFTG CG +HGV VGYG + PYW+VKN WGT+W EGG +R+ RG
Sbjct: 325 QFYSEGVFTGDCGTDLDHGVAAVGYGVGDDG---TPYWIVKNSWGTSWGEGGYIRMQRGA 381
Query: 293 GGSGLCNIAANAAYPL 308
G GLC IA A++P+
Sbjct: 382 GNGGLCGIAMEASFPI 397
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 221 bits (564), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 125/319 (39%), Positives = 179/319 (56%), Gaps = 34/319 (10%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTRE 57
++ A+HE WM+++ R YKD AEK +F++FK N EF L +N+FAD+T E
Sbjct: 32 SMVARHENWMLQYGRVYKDAAEKAQKFEVFKANAEFINSFNAGNHKFWLGINQFADITNE 91
Query: 58 KFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+F A+ T G+ P F N S + +IDW +GAVTP+KDQG C
Sbjct: 92 EFKATKTNKGFISNKVRVPTG-----FMYENMSFDALPATIDWRTKGAVTPIKDQGQCGC 146
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+AVA +EG+ K+ TG+LV+ S+ +LVDC GC +++AF++I + L
Sbjct: 147 CWAFSAVAAMEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLT 206
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-- 229
E YPY D C SSA+ I+ Y+ V E L V+ QPVSVA+D
Sbjct: 207 QESNYPYDA-ADGKCKSGSSSAA----TIKSYEDVPANNEGALMKAVANQPVSVAVDGGD 261
Query: 230 TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GGV TG CG +HG+ +GYGTT++ +W++KN WGT+W E G +R+
Sbjct: 262 MTFQFYSGGVMTGSCGTDLDHGIAAIGYGTTSDG---TKFWIMKNSWGTSWGENGFLRME 318
Query: 290 RGVGG-SGLCNIAANAAYP 307
+ + G+C +A +YP
Sbjct: 319 KDIADKKGMCGLAMEPSYP 337
>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
Length = 344
Score = 221 bits (564), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 177/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F L +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P + P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLSDDYMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC + NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGLMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G Q Y C RS I Y+ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGEQ-YTC---RSREKTAAVQISSYKVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGNCADQINHAVTAIGYGTDEEG---QKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GDPSGLCDIAKMSSYP 341
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 124/319 (38%), Positives = 175/319 (54%), Gaps = 35/319 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ A+HEQWM +++R YKD +EK RF++FK N +F L +N+FADLT +
Sbjct: 33 MVARHEQWMAQYSRVYKDASEKARRFEVFKANVKFIESFNAGGNNKFWLGVNQFADLTND 92
Query: 58 KF--LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+F + + G+K P F+ N S + +IDW +GAVTP+KDQG C
Sbjct: 93 EFRSIKTNKGFKSSNMKIPTG-----FRYENVSVDALPTTIDWRTKGAVTPIKDQGQCGC 147
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+AVA EG+ KI TG+LV+ ++ +LVDC GC +++AF++I L
Sbjct: 148 CWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGEDQGCEGGLMDDAFKFIINNGGLT 207
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-- 229
+E YPY D C S S I+GY+ V E L V+ QPVSVA+D
Sbjct: 208 TESSYPYTA-ADGKC----KSGSNSAATIKGYEDVPANDEAALMKAVANQPVSVAVDGGD 262
Query: 230 TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GV TG CG +HG+ +GYG T++ YWL+KN WGT W E G +R+
Sbjct: 263 MTFQFYSSGVMTGSCGTDLDHGIAAIGYGKTSDG---TKYWLMKNSWGTTWGENGYLRME 319
Query: 290 RGVGGS-GLCNIAANAAYP 307
+ + G+C +A +YP
Sbjct: 320 KDISDKRGMCGLAMEPSYP 338
>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
Length = 339
Score = 221 bits (564), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 128/327 (39%), Positives = 184/327 (56%), Gaps = 34/327 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN-----------HEF-LRLN 49
+R + A+HE+WM ++ R YKD EK RF+IFK N H+F L +N
Sbjct: 24 AREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNHKFWLSVN 83
Query: 50 KFADLTREKFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+FADLT +F A+ T G+ P P + F+ N S + ++DW +GAVTP+
Sbjct: 84 QFADLTNYEFRATKTNKGFIPSTVRVPTT-----FRYENVSIDTLPATVDWRTKGAVTPI 138
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQG CCWAF+AVA +EG+ K+ TG+L++ S+ +LVDC GC +++AF++
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKF 198
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + L +E YPY D C+ +SA+ I+GY+ V E L V+ QPV
Sbjct: 199 IIKNGGLTTESKYPYTA-ADGKCNGGSNSAA----TIKGYEDVPANNEAALMKAVANQPV 253
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVA+D F FY GGV TG CG +HG+ +GYG + +G Q YWL+KN WGT W
Sbjct: 254 SVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYG--KDGDGTQ-YWLLKNSWGTTWG 310
Query: 282 EGGSMRIFRGVGGS-GLCNIAANAAYP 307
E G +R+ + + G+C +A +YP
Sbjct: 311 ENGFLRMEKDISDKRGMCGLAMEPSYP 337
>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 379
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 128/316 (40%), Positives = 174/316 (55%), Gaps = 28/316 (8%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLA 61
+E+W R ++ EK RF FK+N F LRLN+F D+ RE+F +
Sbjct: 44 YERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRS 102
Query: 62 SYTGYKPPPT---DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CW 116
++ + D P + + S S+DW + GAVT VKDQG +C CW
Sbjct: 103 TFADSRINDLRRQDSPAARAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKDQG-HCGSCW 161
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECV 175
AF+ V VEG+N IRTG L + S+ +L+DC T NGC +ENAFE+I+ + + +E
Sbjct: 162 AFSTVVAVEGINAIRTGSLASLSEQELIDCDTDENGCQGGLMENAFEFIKSFGGITTEAA 221
Query: 176 YPYQGRQDYYCDWWRSS-ASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YPY+ + CD R+ G I G+Q V +E+ L V+ QPVSVA+DA F
Sbjct: 222 YPYRA-SNGTCDGDRARRGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAF 280
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVFTG CG +HGV VGYG + PYW+VKN WGT+W EGG +R+ RG
Sbjct: 281 QFYSEGVFTGDCGTDLDHGVAAVGYGVGDDG---TPYWIVKNSWGTSWGEGGYIRMQRGA 337
Query: 293 GGSGLCNIAANAAYPL 308
G GLC IA A++P+
Sbjct: 338 GNGGLCGIAMEASFPI 353
>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
Length = 343
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 144/330 (43%), Positives = 180/330 (54%), Gaps = 38/330 (11%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------- 47
M R IA KHEQWM RTY D AEKE RF+IFK N +++
Sbjct: 26 MPRPLLNAEAIAEKHEQWMARHGRTYHDNAEKERRFQIFKNNLDYIENFNKAFNKTYKLG 85
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSN---RSNWFKNLNSSKMSFYDSIDWNERGAV 104
LNKF+DL+ E+F+ +Y GY+ P T P +N + +F N + +SIDW E G V
Sbjct: 86 LNKFSDLSEEEFVTTYNGYEMPTT-LPTANTTVKPTFFSNYYNQD-EVPESIDWRENGVV 143
Query: 105 TPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFE 162
T VK+QG CCWAF+AVA VEG+ G + S QL+DC N GC + AFE
Sbjct: 144 TSVKNQGECGCCWAFSAVAAVEGI----AGNGASLSAQQLLDCVGDNSGCGGGTMIKAFE 199
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
YI Q Q + S+ YPY+ Q+ C S S I GY+ V +EE L+ V++QP
Sbjct: 200 YIVQNQGIVSDTDYPYEQTQE-MC----RSGSNVAARITGYESV-IQSEEALKRAVAKQP 253
Query: 223 VSVAIDATW---FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGT 278
+SVAIDA+ F Y GVF+ CG H VT+VGYGTT E YWLVKN WG
Sbjct: 254 ISVAIDASSGPNFKSYISGVFSAEDCGTHLTHAVTLVGYGTT---EDGTKYWLVKNSWGE 310
Query: 279 NWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
W E G MR+ R VG G C IA A+YP
Sbjct: 311 EWGESGYMRLQRDVGAMEGPCGIAMQASYP 340
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 128/312 (41%), Positives = 178/312 (57%), Gaps = 28/312 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLAS 62
E W+V ++Y E+E RF+IFK N + L LNKFADLT E++ +
Sbjct: 46 ESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEEYRSK 105
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
YTG K S +S + L S S +S+DW E GAV VKDQGS CWAF+ +
Sbjct: 106 YTGIKSKDLRKKVSAKSGRYATL--SGESLPESVDWRESGAVATVKDQGSCGSCWAFSTI 163
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+ VEG+N+I TG+L+T S+ +LVDC S GC ++ AFE+I + ++ YPY
Sbjct: 164 SAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDYAFEFIINNGGIDTDVDYPYT 223
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
GR D CD +R +A K I Y+ V E L+ + QP+SVAI+A+ F FY
Sbjct: 224 GR-DGKCDQYRKNA--KVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRDFQFYDS 280
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
G+FTG CG +HGV +VGYGT E + YW+V+N WG +W E G +R+ RG+ +G
Sbjct: 281 GIFTGKCGIALDHGVVVVGYGT----ENGKDYWIVRNSWGADWGENGYLRMERGISSKTG 336
Query: 297 LCNIAANAAYPL 308
+C IA +YP+
Sbjct: 337 ICGIAIEPSYPV 348
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 128/327 (39%), Positives = 184/327 (56%), Gaps = 34/327 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN-----------HEF-LRLN 49
+R + A+HE+WM ++ R YKD EK RF+IFK N H+F L +N
Sbjct: 24 AREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNHKFWLGVN 83
Query: 50 KFADLTREKFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+FADLT +F A+ T G+ P P + F+ N S + ++DW +GAVTP+
Sbjct: 84 QFADLTNYEFRATKTNKGFIPSTVRVPTT-----FRYENVSIDTLPATVDWRTKGAVTPI 138
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQG CCWAF+AVA +EG+ K+ TG+L++ S+ +LVDC GC +++AF++
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKF 198
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + L +E YPY D C+ +SA+ I+GY+ V E L V+ QPV
Sbjct: 199 IIKNGGLTTESKYPYTA-ADGKCNGGSNSAA----TIKGYEEVPANNEAALMKAVANQPV 253
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVA+D F FY GGV TG CG +HG+ +GYG + +G Q YWL+KN WGT W
Sbjct: 254 SVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYG--KDGDGTQ-YWLLKNSWGTTWG 310
Query: 282 EGGSMRIFRGVGGS-GLCNIAANAAYP 307
E G +R+ + + G+C +A +YP
Sbjct: 311 ENGFLRMEKDISDKRGMCGLAMEPSYP 337
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 125/314 (39%), Positives = 179/314 (57%), Gaps = 26/314 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKF 59
+H +WM + R Y D EK R+ +FK N E L +N+FADLT ++F
Sbjct: 37 RHIEWMTKHGRVYADVKEKSNRYVVFKSNVERIEHLNNIPAGRTFKLAVNQFADLTNDEF 96
Query: 60 LASYTGYKPPPTDHPHS-NRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ YTG+K + S ++ F+ N S + S+DW +GAVTP+K+QGS CCWA
Sbjct: 97 RSMYTGFKGVSSLSSQSQTKTTSFRYQNVSSGALPISVDWRTKGAVTPIKNQGSCGCCWA 156
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVY 176
F+AVA +EG +I+ G+L++ S+ QLVDC T + GC ++ AFE+I L +E Y
Sbjct: 157 FSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCEGGLMDTAFEHIMATGGLTTESNY 216
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN--F 234
PY+G +D C+ +++ K +I GY+ V E+ L V+ QPVSV I+ F+ F
Sbjct: 217 PYKG-EDATCNSKKTNP--KATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQF 273
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVFTG C +H VT +GYG +T YW++KN WGT W E G MRI + +
Sbjct: 274 YSSGVFTGECTTYLDHAVTAIGYGQSTNGS---KYWIIKNSWGTKWGESGYMRIQKDIKD 330
Query: 295 S-GLCNIAANAAYP 307
GLC +A A+YP
Sbjct: 331 KQGLCGLAMKASYP 344
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 221 bits (563), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 184/317 (58%), Gaps = 32/317 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ +HE WMVE+ R YKD AEK RF++FK N F L +N+FADLT E
Sbjct: 32 MVERHENWMVEYGRVYKDAAEKARRFEVFKDNVAFVESFNTNKNNKFWLGINQFADLTIE 91
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F A+ G+KP + + FK N S + ++DW +GAVTP+K+QG CCW
Sbjct: 92 EFKAN-KGFKPISAEKVPTTG---FKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCCW 147
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASE 173
AF+AVA +EG+ K+ TG L++ S+ +LVDC T + GC ++++AFE++ + LA+
Sbjct: 148 AFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATV 207
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--W 231
YPY+ D C SA+ I+G++ V E L V+ QPVSVA+DA+
Sbjct: 208 SSYPYKA-VDGKCKGGSKSAA----TIKGHEDVPVNDEAALMKAVANQPVSVAVDASDRT 262
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y GGV TG CG +HG+ +GYG E++G + YW++KN WGT W E G +R+ +
Sbjct: 263 FMLYSGGVMTGSCGTELDHGIAAIGYG--VESDGTK-YWILKNSWGTTWGEKGFLRMEKD 319
Query: 292 VGGS-GLCNIAANAAYP 307
+ G+C +A +YP
Sbjct: 320 ISDKQGMCGLAMKPSYP 336
>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 127/319 (39%), Positives = 177/319 (55%), Gaps = 34/319 (10%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN-----------HEF-LRLNKFADLTRE 57
++ A+HE WM+++ R YKD AEK +F++FK N H+F L +N+FAD+T +
Sbjct: 32 SMVARHESWMLQYGRVYKDAAEKASKFEVFKANAGFIDSFNAGNHKFWLGINQFADITNK 91
Query: 58 KFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+F A+ T G+ P F N S + SIDW +GAVTPVKDQG C
Sbjct: 92 EFKATKTNKGFISNKVRAPTG-----FSYENVSFDALPASIDWRTKGAVTPVKDQGQCGC 146
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+AVA EG+ K+ TG+LV+ S+ +LVDC GC +++AF++I L
Sbjct: 147 CWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGEDQGCEGGLMDDAFKFIISNGGLT 206
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-- 229
E YPY +D C S S G I+ Y+ V E L V+ QPVSVA+D
Sbjct: 207 QESSYPYDA-EDGKC----KSGSKSAGTIKSYEDVPANNEGALMKAVANQPVSVAVDGGD 261
Query: 230 TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GGV TG CG +HG+ +GYG T++ YWL+KN WGT+W E G +R+
Sbjct: 262 MTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSDG---TKYWLMKNSWGTSWGENGFLRME 318
Query: 290 RGVGG-SGLCNIAANAAYP 307
+ + G+C +A +YP
Sbjct: 319 KDIADKKGMCGLAMEPSYP 337
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 221 bits (563), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 129/317 (40%), Positives = 178/317 (56%), Gaps = 32/317 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ A +E W+VE ++Y EKEMRF+IFK+N L LN+FADLT E
Sbjct: 38 VMAMYESWLVEHGKSYNSLDEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDE 97
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG-SYCCW 116
++ ++Y G K P ++ SN + + + D +DW GAV VK+QG CW
Sbjct: 98 EYRSTYLGLK----RGPKTDVSNQY--MPKVGDALPDYVDWRTVGAVVGVKNQGLCSSCW 151
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQRLASE 173
AF+AVA VEG+NKI TG L++ S+ +LVDC GC + + +AF++I + +E
Sbjct: 152 AFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQITKGCNRGLMTDAFKFIINNGGINTE 211
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-- 231
YPY + D C+ S + KY I Y+ V E L+ V+ QPVSV +++
Sbjct: 212 NNYPYTAK-DGQCNL--SLKNQKYVTIDSYKNVPSNNEMALKKAVAYQPVSVGVESEGGK 268
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+FTG CG +HGVTIVGYGT E YW+VKN WGTNW E G +RI R
Sbjct: 269 FKLYTSGIFTGSCGTAVDHGVTIVGYGT----ERGMDYWIVKNSWGTNWGESGYIRIQRN 324
Query: 292 VGGSGLCNIAANAAYPL 308
+GG+G C IA +YP+
Sbjct: 325 IGGAGKCGIAKMPSYPV 341
>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 345
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 129/317 (40%), Positives = 181/317 (57%), Gaps = 33/317 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE WM ++ + YKD AEK+ RF+IFK N F L +N+FADL E+F
Sbjct: 37 RHENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFNLSINQFADLHDEEFK 96
Query: 61 ASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ---GSYCC 115
A T K + FK +K+ ++DW +RGAVTP+KDQ GS C
Sbjct: 97 ALLTNGNKKVRSVVGTATETETSFKYNRVTKL--LATMDWRKRGAVTPIKDQRRCGS--C 152
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASE 173
WAF+AVA +EG+++I T +LV+ S+ +LVDC GC ++E+AFE++ + +ASE
Sbjct: 153 WAFSAVAAIEGIHQITTSKLVSLSEQELVDCVKGESEGCNGGYMEDAFEFVAKKGGIASE 212
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY+G+ D C + + I+GY+ V +E+ LQ V+ QPVSV ++A
Sbjct: 213 SYYPYKGK-DKSCKVKKETHG--VSQIKGYEKVPSNSEKALQKAVAHQPVSVYVEAGGNA 269
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY G+FTG CG +H +T+VGYG ++ G YWLVKN WG W E G +R+ R
Sbjct: 270 FQFYSSGIFTGKCGTNTDHAITVVGYG---KSRGGTKYWLVKNSWGAGWGEKGYIRMKRD 326
Query: 292 V-GGSGLCNIAANAAYP 307
+ GLC IA NA YP
Sbjct: 327 IRAKEGLCGIAMNAFYP 343
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 221 bits (562), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 128/327 (39%), Positives = 184/327 (56%), Gaps = 34/327 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN-----------HEF-LRLN 49
+R + A+HE+WM ++ R YKD EK RF+IFK N H+F L +N
Sbjct: 24 AREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAFIESFNAGNHKFWLGVN 83
Query: 50 KFADLTREKFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+FADLT +F A+ T G+ P P + F+ N S + ++DW +GAVTP+
Sbjct: 84 QFADLTNYEFRATKTNKGFIPSTVRVPTT-----FRYENVSIDTLPATVDWRTKGAVTPI 138
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQG CCWAF+AVA +EG+ K+ TG+L++ S+ +LVDC GC +++AF++
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKF 198
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + L +E YPY D C+ +SA+ I+GY+ V E L V+ QPV
Sbjct: 199 IIKNGGLTTESKYPYTA-ADGKCNGGSNSAA----TIKGYEDVPANNEAALMKAVANQPV 253
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVA+D F FY GGV TG CG +HG+ +GYG + +G Q YWL+KN WGT W
Sbjct: 254 SVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYG--KDGDGTQ-YWLLKNSWGTTWG 310
Query: 282 EGGSMRIFRGVGGS-GLCNIAANAAYP 307
E G +R+ + + G+C +A +YP
Sbjct: 311 ENGFLRMEKDISDKRGMCGLAMEPSYP 337
>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
Length = 341
Score = 221 bits (562), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 135/313 (43%), Positives = 178/313 (56%), Gaps = 32/313 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLA 61
HEQWMV+ + YK EK+ RF IFK+N + L LN FADLT +F+A
Sbjct: 39 HEQWMVQHGKVYKAAHEKQKRFGIFKENVNYIEAFNNVGNKSYKLGLNHFADLTNHEFIA 98
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
+ + + H + FK N S + ++DW + GAVTPVK+QG CCWAF+A
Sbjct: 99 ARNKFN----GYLHGSIITTFKYKNVSDVP--SAVDWRQEGAVTPVKNQGQCGCCWAFSA 152
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASECVYP 177
VA+ EG++K+ TG LV+ S+ +LVDC T GC +++AFE+I Q L++E YP
Sbjct: 153 VASTEGIHKLTTGNLVSLSEQELVDCDTNGEDQGCEGGLMDDAFEFIIQNNGLSTEAEYP 212
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFY 235
YQG D C+ ++ I GY+ V E+ LQ V+ QPVSVAIDA+ F FY
Sbjct: 213 YQGV-DGTCN--KTEVGSSAATISGYENVPVNDEQALQKAVANQPVSVAIDASGSDFQFY 269
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
GVFTG CG +HGV +V E + YWLVKN WGT W E G +R+ RGV S
Sbjct: 270 KSGVFTGSCGTELDHGVAVV---GYGVGEDETEYWLVKNSWGTQWGEEGYIRMQRGVDAS 326
Query: 296 -GLCNIAANAAYP 307
GLC IA +YP
Sbjct: 327 EGLCGIAMQPSYP 339
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 125/314 (39%), Positives = 180/314 (57%), Gaps = 26/314 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKF 59
+H +WM + R Y D E+ R+ +FK N E L +N+FADLT ++F
Sbjct: 37 RHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEF 96
Query: 60 LASYTGYKPPPTDHPHS-NRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ YTG+K S + + F+ N S + S+DW ++GAVTP+K+QGS CCWA
Sbjct: 97 CSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWA 156
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVY 176
F+AVA +EG +I+ G+L++ S+ QLVDC T + GC ++ AFE+I+ L +E Y
Sbjct: 157 FSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCEGGLMDTAFEHIKATGGLTTESDY 216
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN--F 234
PY+G +D C+ +++ K +I GY+ V E+ L V+ QPVSV I+ F+ F
Sbjct: 217 PYKG-EDATCNSKKTNP--KATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQF 273
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVFTG C +H VT +GYG +T YW++KN WGT W E G MRI + V
Sbjct: 274 YSSGVFTGECTTYLDHAVTAIGYGESTNG---SKYWIIKNSWGTKWGESGYMRIQKDVKD 330
Query: 295 S-GLCNIAANAAYP 307
GLC +A A+YP
Sbjct: 331 KQGLCGLAMKASYP 344
>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 220 bits (561), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 130/315 (41%), Positives = 180/315 (57%), Gaps = 28/315 (8%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREK 58
+ KHE+WM ++ + YKD AEKE RF+IFK N F L +N+FADL + K
Sbjct: 35 SVKHEKWMAQYGKVYKDAAEKEKRFQIFKNNVHFIESFHAAGDKPFNLSINQFADLHKFK 94
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS-YCCWA 117
L G K + FK + +++ S+DW +RGAVTP+KDQG+ CWA
Sbjct: 95 ALL-INGQKKEHNVRTATATEASFKYDSVTRIP--SSLDWRKRGAVTPIKDQGTCRSCWA 151
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECV 175
F+ VAT+EGL++I G+LV+ S+ +LVDC GC ++E+AFE+I + +ASE
Sbjct: 152 FSTVATIEGLHQITKGELVSLSEQELVDCVKGDSEGCYGGYVEDAFEFIAKKGGVASETH 211
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFN 233
YPY+G + C + + I+GY+ V +E+ L V+ QPVS ++A F
Sbjct: 212 YPYKG-VNKTCKVKKETHG--VVQIKGYEQVPSNSEKALLKAVAHQPVSAYVEAGGYAFQ 268
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
FY G+FTG CG +H VT+VGYG +A G YWLVKN WGT W E G +R+ R +
Sbjct: 269 FYSSGIFTGKCGTDIDHSVTVVGYG---KARGGNKYWLVKNSWGTEWGEKGYIRMKRDIR 325
Query: 293 GGSGLCNIAANAAYP 307
GLC IA A YP
Sbjct: 326 AKEGLCGIATGALYP 340
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 127/326 (38%), Positives = 186/326 (57%), Gaps = 32/326 (9%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
+R +A +HE+WM + R YKD AEK RF++FK N F L +
Sbjct: 28 ARELSDDAAMAERHERWMAVYGRVYKDAAEKARRFEVFKDNLAFVESFNADKKNKFWLGV 87
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N+FADLT E+F A+ G+KP + + FK N S + ++DW +GAVTP+K
Sbjct: 88 NQFADLTTEEFKAN-KGFKPISAEEVPTTG---FKYENLSVSALPTAVDWRTKGAVTPIK 143
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYI 164
+QG CCWAF+AVA +EG+ K+ T LV+ S+ +LVDC T + GC ++++AFE++
Sbjct: 144 NQGQCGCCWAFSAVAAMEGIVKLSTDNLVSLSEQELVDCDTHSMDEGCEGGWMDSAFEFV 203
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ LA+E YPY+ D C SA+ I+G++ V P E L V+ QPVS
Sbjct: 204 IKNGGLATESSYPYKA-VDGKCKGGSKSAA----TIKGHEDVPPNNEAALMKAVASQPVS 258
Query: 225 VAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
VA+DA+ F Y GGV TG CG +HG+ +GYG E++G + YW++KN WGT W E
Sbjct: 259 VAVDASDRTFMLYSGGVMTGSCGTQLDHGIAAIGYG--VESDGTK-YWILKNSWGTTWGE 315
Query: 283 GGSMRIFRGVGGS-GLCNIAANAAYP 307
+R+ + + G+C +A +YP
Sbjct: 316 KRFLRMEKDISDKQGMCGLAMKPSYP 341
>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 220 bits (560), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 178/316 (56%), Gaps = 26/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+++ +HE WM R YKD+ EK RF IFK+N +F+ +N+FAD+T
Sbjct: 34 SVSERHELWMSRHGRVYKDEVEKGERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITS 93
Query: 57 EKFLASYTGYKPP-PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++FLA +TG P P S FK + S ++DW E GAVT VK QG C
Sbjct: 94 QEFLAKFTGLNIPNSYLSPSPLSSTEFKINDLSDDDMPSNLDWRESGAVTQVKHQGRCGC 153
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+AV ++EG KI TG L+ S+ +L+DC+T N GC F+ NAF++I + ++ E
Sbjct: 154 CWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTNNYGCNGGFMTNAFDFIIENGGISRE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-F 232
Y Y G+Q Y C RS I Y+ V P E L V++QPVS+ I A+
Sbjct: 214 SDYEYLGQQ-YTC---RSQEKTAAVQISSYKVV-PEGETSLLQAVTKQPVSIGIAASQDL 268
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GG + G C + NH VT +GYGT E Q YWL+KN WGT+W E G M+I R
Sbjct: 269 QFYAGGTYDGSCADRINHAVTAIGYGTD---EKGQKYWLLKNSWGTSWGENGFMKIIRDS 325
Query: 293 GG-SGLCNIAANAAYP 307
G SGLC+IA ++YP
Sbjct: 326 GNPSGLCDIAKMSSYP 341
>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
Length = 336
Score = 220 bits (560), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 120/324 (37%), Positives = 179/324 (55%), Gaps = 31/324 (9%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN-----------HEF-LRLN 49
+R +AA+HE+WM ++ R YKD AEK RF++FK N H+F L +N
Sbjct: 24 ARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGVN 83
Query: 50 KFADLTREKFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+FADLT ++F ++ T G+ P T P F+N N + + ++DW +G VTP+
Sbjct: 84 QFADLTNDEFRSTKTNKGFIPSTTRVPTG-----FRNENVNIDALPATMDWRTKGVVTPI 138
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQ 166
KDQG CCWAF+AVA +EG+ K+ TG+L++ S ++ + GC +++AF++I +
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGIVKLSTGKLISHSLNKSLLTVMSMGCEGGLMDDAFKFIIK 198
Query: 167 YQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVA 226
L +E YPY D + S S +I+GY+ V E L V+ QPVSVA
Sbjct: 199 NGGLTTESNYPYAAVDDKF-----KSVSNSVASIKGYEDVPANNEAALMKAVANQPVSVA 253
Query: 227 IDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
+D F FY GGV TG CG +HG+ +GYG ++ YWL+KN WG W E G
Sbjct: 254 VDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDG---TKYWLLKNSWGMTWGENG 310
Query: 285 SMRIFRGVGGS-GLCNIAANAAYP 307
+R+ + + G+C +A +YP
Sbjct: 311 FLRMEKDISDKRGMCGLAMEPSYP 334
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 125/314 (39%), Positives = 180/314 (57%), Gaps = 26/314 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKF 59
+H +WM + R Y D E+ R+ +FK N E L +N+FADLT ++F
Sbjct: 37 RHIEWMTKHGRVYADVKEENNRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEF 96
Query: 60 LASYTGYKPPPTDHPHS-NRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ YTG+K S + + F+ N S + S+DW ++GAVTP+K+QGS CCWA
Sbjct: 97 RSMYTGFKGVSALSSQSQTKMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWA 156
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVY 176
F+AVA +EG +I+ G+L++ S+ QLVDC T + GC ++ AFE+I+ L +E Y
Sbjct: 157 FSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCEGGLMDTAFEHIKATGGLTTESNY 216
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN--F 234
PY+G +D C+ +++ K +I GY+ V E+ L V+ QPVSV I+ F+ F
Sbjct: 217 PYKG-EDATCNSKKTNP--KATSITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQF 273
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVFTG C +H VT +GYG +T YW++KN WGT W E G MRI + V
Sbjct: 274 YSSGVFTGECTTYLDHAVTAIGYGESTNGS---KYWIIKNSWGTKWGESGYMRIQKDVKD 330
Query: 295 S-GLCNIAANAAYP 307
GLC +A A+YP
Sbjct: 331 KQGLCGLAMKASYP 344
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 219 bits (559), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 175/329 (53%), Gaps = 38/329 (11%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------------LRLNK 50
+A++HE WM E RTY D EK R +IF+ N E L N+
Sbjct: 39 MASRHESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATNR 98
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FADLT E+F A+ TG + P ++N S + S+DW GAVT VKDQ
Sbjct: 99 FADLTDEEFRAARTGLRRPAAVAGAVGGGFRYENF-SLQADAAGSMDWRAMGAVTGVKDQ 157
Query: 111 GSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQ 166
GS CCWAF+AVA +EGL KIRTG+LV+ S+ QLVDC GC ++NAF+YI +
Sbjct: 158 GSCGCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQYISR 217
Query: 167 YQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVA 226
LASE YPY G C RS + +IRG++ V E L V+ QPVSVA
Sbjct: 218 QGGLASESAYPYSGEDGGSC---RSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVSVA 274
Query: 227 IDAT--WFNFYH----GGVFTGPCGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
I+ F FY G G C +T +H +T VGYG + G YWL+KN WG+
Sbjct: 275 INGGDYVFRFYDRGVLGAGGNGGCESTELDHAITAVGYGMAGDGTG---YWLMKNSWGSG 331
Query: 280 WDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
W E G +RI RG G G+C +A A+YP+
Sbjct: 332 WGESGYVRIRRGSRGEGVCGLAKLASYPV 360
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 219 bits (557), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 127/318 (39%), Positives = 181/318 (56%), Gaps = 36/318 (11%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKN-------------HEFLRLNKFADLTREKFL 60
+HEQWM + + YKD EKE+R+KIF++N L +N+FADLT E+F
Sbjct: 38 RHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIEGFNNAGNKSHKLGVNQFADLTEEEFK 97
Query: 61 A--SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CW 116
A GY +R++ FK + +K+ ++DW ++GAVTP+K QG C CW
Sbjct: 98 AINKLKGYMWSKI-----SRTSTFKYEHVTKVP--ATLDWRQKGAVTPIKSQGLKCGSCW 150
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASE 173
AF AVA EG+ K+ TG+L++ S+ +L+DC T GC ++ AF++I Q + LA+E
Sbjct: 151 AFAAVAATEGITKLTTGELISLSEQELIDCDTNGDNGGCKWGIIQEAFKFIVQNKGLATE 210
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--W 231
YPYQ D C+ S +I+GY+ V E L + V+ QPVSV +D++
Sbjct: 211 ASYPYQAV-DGTCN--AKVESKHVASIKGYEDVPANNETALLNAVANQPVSVLVDSSDYD 267
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY GV +G CG T +H VT+VGYG + + YWL+KN WG W E G +RI R
Sbjct: 268 FRFYSSGVLSGSCGTTFDHAVTVVGYGVSDDG---TKYWLIKNSWGVYWGEQGYIRIKRD 324
Query: 292 VGG-SGLCNIAANAAYPL 308
V G+C IA A+YP+
Sbjct: 325 VAAKEGMCGIAMQASYPI 342
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 218 bits (556), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 134/319 (42%), Positives = 177/319 (55%), Gaps = 30/319 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREK 58
+ A +E W+V+ + Y EKE RF IFK N F L LN+FADLT E+
Sbjct: 45 VMAMYEAWLVKHGKAYNALGEKEKRFGIFKDNLRFIDEHNSQNLTYRLGLNRFADLTNEE 104
Query: 59 FLASYTGYKPPPT--DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+ + Y G KP T S +S+ F + D IDW + GAV VKDQGS C
Sbjct: 105 YRSMYLGVKPGATRVTRKVSRKSDRFAARVGDALP--DFIDWRKEGAVVGVKDQGSCGSC 162
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASE 173
WAF+ +A VEG+N+I TG L++ S+ +LVDC T GC ++ AFE+I + SE
Sbjct: 163 WAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSE 222
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY+ D CD +R +A+ +I GY+ V E L+ V++QPVSVAI+A
Sbjct: 223 EDYPYRA-ADQKCDQYRKNAN--VVSIDGYEDVPENDEAALKKAVAKQPVSVAIEAGGRA 279
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y GVFTG CG + +HGV VGYGT E Q YW+V N WG NW E G +R+ R
Sbjct: 280 FQLYQSGVFTGKCGTSLDHGVAAVGYGT----ENGQDYWIVGNSWGKNWGEDGYIRMERN 335
Query: 292 VGG--SGLCNIAANAAYPL 308
+ G SG C IA +YP+
Sbjct: 336 LAGSSSGKCGIAIGPSYPI 354
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 218 bits (555), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 125/318 (39%), Positives = 172/318 (54%), Gaps = 32/318 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREK 58
+ +HEQWM +F R YKD EK RF++FK N F L +N+F DLT ++
Sbjct: 33 MVERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAFIESFNAENRKFWLGVNQFTDLTNDE 92
Query: 59 FLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
F A+ T G K P FK N S + ++DW +G VTP+KDQG CC
Sbjct: 93 FRATKTNKGLKMSGGRAPTG-----FKYSNVSIDALPTAVDWRTKGVVTPIKDQGQCGCC 147
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLAS 172
WAF+AV EG+ K+ TG+L++ S+ +LVDC GC +++AF++I + L +
Sbjct: 148 WAFSAVVATEGIVKLSTGKLISLSEQELVDCDVHGVDQGCEGGEMDDAFKFIIKNGGLTT 207
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--T 230
E YPY QD C S AS I+GY+ V E L V+ QPVSVA+D
Sbjct: 208 EANYPYTA-QDGQCK--TSIASNSVATIKGYEDVPANDESSLMKAVANQPVSVAVDGGDV 264
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F Y GGV TG CG +HG+ +GYG T++ YWL+KN WGT W E G +R+ +
Sbjct: 265 IFQHYSGGVMTGSCGTDLDHGIAAIGYGMTSDG---TKYWLLKNSWGTTWGESGYLRMEK 321
Query: 291 GVGG-SGLCNIAANAAYP 307
+ SG+C +A +YP
Sbjct: 322 DISDKSGMCGLAMQPSYP 339
>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
gi|194701540|gb|ACF84854.1| unknown [Zea mays]
Length = 379
Score = 218 bits (554), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 127/316 (40%), Positives = 173/316 (54%), Gaps = 28/316 (8%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLA 61
+E+W R ++ EK RF FK+N F LRLN+F D+ RE+F +
Sbjct: 44 YERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRS 102
Query: 62 SYTGYKPPPT---DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CW 116
++ + D P + + S S+DW + GAVT VK QG +C CW
Sbjct: 103 TFADSRINDLRRQDSPAARAGAVPGFMYDSAADPPRSVDWRQEGAVTGVKVQG-HCGSCW 161
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECV 175
AF+ V VEG+N IRTG L + S+ +L+DC T NGC +ENAFE+I+ + + +E
Sbjct: 162 AFSTVVAVEGINAIRTGSLASLSEQELIDCDTDENGCQGGLMENAFEFIKSFGGITTEAA 221
Query: 176 YPYQGRQDYYCDWWRSS-ASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YPY+ + CD R+ G I G+Q V +E+ L V+ QPVSVA+DA F
Sbjct: 222 YPYRA-SNGTCDGDRARRGGGVVVVIDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAF 280
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVFTG CG +HGV VGYG + PYW+VKN WGT+W EGG +R+ RG
Sbjct: 281 QFYSEGVFTGDCGTDLDHGVAAVGYGVGDDG---TPYWIVKNSWGTSWGEGGYIRMQRGA 337
Query: 293 GGSGLCNIAANAAYPL 308
G GLC IA A++P+
Sbjct: 338 GNGGLCGIAMEASFPI 353
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 133/318 (41%), Positives = 178/318 (55%), Gaps = 29/318 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ + +EQW+V+ + Y EKE RF+IFK N F L LN+FADLT E
Sbjct: 55 LMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTNE 114
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ A Y G K P SN + K+ DS+DW + GAV PVKDQG CW
Sbjct: 115 EYRAKYLGTKIDPNRRLGKTPSNRYAPRVGDKLP--DSVDWRKEGAVPPVKDQGGCGSCW 172
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQRLASEC 174
AF+A+ VEG+NKI TG+L++ S+ +LVDC T GC ++ AFE+I + S+
Sbjct: 173 AFSAIGAVEGINKIVTGELISLSEQELVDCDTGYNQGCNGGLMDYAFEFIINNGGIDSDE 232
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--F 232
YPY+G D CD +R +A K +I Y+ V E L+ V+ QPVSVAI+ F
Sbjct: 233 DYPYRG-VDGRCDTYRKNA--KVVSIDDYEDVPAYDELALKKAVANQPVSVAIEGGGREF 289
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GVFTG CG +HGV VGYGT A+G YW+V+N WG++W E G +R+ R +
Sbjct: 290 QLYVSGVFTGRCGTALDHGVVAVGYGT---AKGHD-YWIVRNSWGSSWGEDGYIRLERNL 345
Query: 293 GG--SGLCNIAANAAYPL 308
SG C IA +YPL
Sbjct: 346 ANSRSGKCGIAIEPSYPL 363
>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 218 bits (554), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 182/317 (57%), Gaps = 37/317 (11%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HEQWM + Y EKE +++ FK+N + L +N FADLT E+F
Sbjct: 39 RHEQWMAIHGKVYTHSYEKEQKYQTFKENVQRIEAFNHAGNKPYKLGINHFADLTNEEFK 98
Query: 61 A--SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
A + G+ + R ++N+ + + +DW + GAVTP+KDQG CCWA
Sbjct: 99 AINRFKGHVCSKITRTPTFR---YENMTAVPAT----LDWRQEGAVTPIKDQGQCGCCWA 151
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+AVA EG+ K+ TG+L++ S+ +LVDC T GC +++AF++I Q + LA+E
Sbjct: 152 FSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFILQNKGLAAEA 211
Query: 175 VYPYQGRQDYYCDWWRSSASGKYG-AIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-- 231
+YPY+G D C+ + A G + +I+GY+ V +E L V+ QPVSVAI+A+
Sbjct: 212 IYPYEGV-DGTCN---AKAEGNHATSIKGYEDVPANSESALLKAVANQPVSVAIEASGFE 267
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY GGVFTG CG +HGVT VGYG + + YWLVKN WG W + G +R+ R
Sbjct: 268 FQFYSGGVFTGSCGTNLDHGVTAVGYGVSDDG---TKYWLVKNSWGVKWGDKGYIRMQRD 324
Query: 292 VGG-SGLCNIAANAAYP 307
V GLC IA A+YP
Sbjct: 325 VAAKEGLCGIAMLASYP 341
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 180/316 (56%), Gaps = 34/316 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HEQWM + YKD E+E RF+IF +N + L +N+F DLT ++F+
Sbjct: 134 RHEQWMTRHGKVYKDPREREKRFRIFNENVNYVEAFNNAANKPYKLGINQFXDLTNQEFI 193
Query: 61 ASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
A +K H S+ R+ FK N + + ++DW + GAVTPVKDQG CCWA
Sbjct: 194 APRNRFK----GHMCSSIIRTTTFKYENVTTVP--STVDWRQNGAVTPVKDQGQCGCCWA 247
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+AVA EG++ + G+L++ S+ +LVDC T GC +++A+++I Q L +E
Sbjct: 248 FSAVAATEGIHALSGGKLISLSEQELVDCDTKGVDQGCEGGLMDDAYKFIIQNHGLNTEA 307
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--F 232
YPY+G D C+ + A+ I GY+ V E+ LQ V+ QPVSVAIDA+ F
Sbjct: 308 NYPYKGV-DGKCN--ANEAANHAATITGYEDVPANNEKALQKAVANQPVSVAIDASSSDF 364
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY G FTG CG +HGVT VGYG + YWLVKN WGT W E G +R+ RGV
Sbjct: 365 QFYKSGAFTGSCGTELDHGVTAVGYGVSDHG---TKYWLVKNSWGTEWGEEGYIRMQRGV 421
Query: 293 GG-SGLCNIAANAAYP 307
G+C IA A+YP
Sbjct: 422 DSEEGVCGIAMQASYP 437
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 128/316 (40%), Positives = 179/316 (56%), Gaps = 25/316 (7%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREK 58
IA+ +E W+V+ + Y EK++RF IFK N F L LN+FADLT E+
Sbjct: 39 IASLYETWLVKHGKNYNGLGEKQLRFNIFKDNLRFVDERNSENLSFKLGLNRFADLTNEE 98
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ + Y G +P S RS + + + +S+DW ++GAV +KDQGS CWA
Sbjct: 99 YRSVYLGTRPRSVAVARSGRSKSDRYAFRAGDTLPESVDWRKKGAVAGIKDQGSCGSCWA 158
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+A+A VEG+N+I TG L++ S+ +LV+C T +GC ++ AFE+I + + + S+
Sbjct: 159 FSAIAAVEGVNQIVTGDLISLSEQELVECDTSYNDGCDGGLMDYAFEFIIKNEGIDSDED 218
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FN 233
YPY GR D CD R +A K I Y+ E+ LQ V+ QPVSVAI+ F
Sbjct: 219 YPYTGR-DGRCDTNRKNA--KVVTIDDYEDSPVYDEKSLQKAVANQPVSVAIEGGGRDFQ 275
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
Y GVFTG CG +HGV +VGYGT E YW+V+N WG W EGG +R+ R
Sbjct: 276 LYDSGVFTGKCGTALDHGVAVVGYGT----EDGLDYWIVRNSWGDTWGEGGYIRMQRNTK 331
Query: 294 -GSGLCNIAANAAYPL 308
SG+C IA +YP+
Sbjct: 332 LPSGICGIAIEPSYPI 347
>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
Length = 369
Score = 217 bits (553), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 124/297 (41%), Positives = 168/297 (56%), Gaps = 25/297 (8%)
Query: 31 EKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLASYTGYKPPPTDHPHSN 77
EK RF FK+N F L LN+F D+ RE+F +++ + S
Sbjct: 57 EKGRRFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDMGREEFRSTFADSRINDLRRAESP 116
Query: 78 RSNWFKNLNSSKMS-FYDSIDWNERGAVTPVKDQGSYC--CWAFTAVATVEGLNKIRTGQ 134
+ ++ S+DW + GAVT VKDQG +C CWAF+ V +VEG+N IRTG
Sbjct: 117 AAPAVPGFMYDGVTDLPPSVDWRKEGAVTAVKDQG-HCGSCWAFSTVVSVEGINAIRTGS 175
Query: 135 LVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSA 193
LV+ S+ +L+DC T NGC +ENAFE+I+ Y + +E YPY+ + CD RS
Sbjct: 176 LVSLSEQELIDCDTDENGCQGGLMENAFEFIKSYGGVTTESAYPYRA-SNGTCDSVRSR- 233
Query: 194 SGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHG 251
G+ +I G+Q V +E+ L V+ QPVSVAIDA F FY GVFTG CG +HG
Sbjct: 234 RGQIVSIDGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGVFTGDCGTDLDHG 293
Query: 252 VTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
V VGYG + + YW+VKN WG +W EGG +R+ RG G GLC IA A++P+
Sbjct: 294 VAAVGYGVSDDGTA---YWIVKNSWGPSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 347
>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
Length = 321
Score = 217 bits (552), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 137/327 (41%), Positives = 172/327 (52%), Gaps = 53/327 (16%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
M+R + KHEQWM RTY+D EKE RF+IFK N E+ L
Sbjct: 25 MARQLINEDALVEKHEQWMARHGRTYQDSEEKERRFQIFKSNLEYIDNFNKASNQTYQLG 84
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
LN FADL+ E+++A+YT K P + +SIDW + GAVTP+
Sbjct: 85 LNNFADLSHEEYVATYTARKMP--------------------VEVPESIDWRDHGAVTPI 124
Query: 108 KDQ-GSYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIR 165
K+Q CCWAF+A A VEG+ + G V+ S QL+DC + N GC ++ NAF YI
Sbjct: 125 KNQYQCGCCWAFSAAAAVEGI--VANG--VSLSAQQLLDCVSDNQGCKGGWMNNAFNYII 180
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
Q Q +A E YPYQ Q S+ I G++ V P EE L V++QPVSV
Sbjct: 181 QNQGIALETDYPYQQMQQM------CSSRMAAAQISGFEDVTPKDEEALMRAVAKQPVSV 234
Query: 226 AIDATW---FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
IDAT F Y GVFT CGN +H VT+VGYGT+ E YWL KN WG W
Sbjct: 235 TIDATSNPNFKLYKEGVFTAAGCGNGHSHAVTLVGYGTS---EDGTKYWLAKNSWGETWG 291
Query: 282 EGGSMRIFRGVG-GSGLCNIAANAAYP 307
E G MR+ R +G G C IA A+YP
Sbjct: 292 ESGYMRLQRDIGLEGGPCGIALYASYP 318
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 177/317 (55%), Gaps = 32/317 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL------------NKFADLTREK 58
+ ++++W+ ++ R Y + E +RF I+ N +F+ NKFADLT ++
Sbjct: 42 MKVRYDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQNLSFKLTDNKFADLTNDE 101
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
F + Y GY+ S + +++ + D++DW E GAVTP+KDQG CWA
Sbjct: 102 FNSIYLGYQI------RSYKRRNLSHMHENSTDLPDAVDWRENGAVTPIKDQGQCGSCWA 155
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASEC 174
F+AVA VEG+NKI+TG LV+ S+ +LVDC GC F+E AF +I+ L +E
Sbjct: 156 FSAVAAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTEN 215
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF 234
YPY+G D C+ ++ I GY+ V E L+ VS+QPVSVAIDA+ + F
Sbjct: 216 DYPYKG-TDGSCE--KAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEF 272
Query: 235 --YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GVF+G CG NHGVTIVGYG Q YWLVKN WG W E G +R+ R
Sbjct: 273 QLYSEGVFSGYCGIQLNHGVTIVGYGDNN----GQKYWLVKNSWGKGWGESGYIRMKRDS 328
Query: 293 GGS-GLCNIAANAAYPL 308
+ G+C IA +YP+
Sbjct: 329 SDTKGMCGIAMEPSYPI 345
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 217 bits (552), Expect = 7e-54, Method: Compositional matrix adjust.
Identities = 129/328 (39%), Positives = 185/328 (56%), Gaps = 32/328 (9%)
Query: 2 SRTSHKTGN-IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------- 47
S++S +T + + A +E W+V+ ++Y EKE RF+IFK N F+
Sbjct: 36 SKSSWRTDDEVMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHNAEENLSYKVG 95
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
LN+FADLT E++ ++Y G K P ++ + S +S+DW +GAV P+
Sbjct: 96 LNRFADLTNEEYRSTYLGAKS----KPKLSKVKSDRYAPRVGDSLPESVDWRAKGAVAPI 151
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYI 164
KDQGS CWAF+ V VEG+N+I TG+L+T S+ +LVDC S GC ++ FE+I
Sbjct: 152 KDQGSCGSCWAFSTVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDGGLMDYGFEFI 211
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ ++ YPY GR D CD +R +A K I Y+ V EE L+ V+ QPVS
Sbjct: 212 INNGGIDTDKDYPYLGR-DARCDQYRKNA--KVVTIDSYEDVPVNNEEALKKAVASQPVS 268
Query: 225 VAID--ATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
V I+ F FY G+FTG CG +HGV +VGYGT E + YW+V+N WG++W E
Sbjct: 269 VGIEGGGRAFQFYDSGIFTGKCGTALDHGVNVVGYGT----EKGKDYWIVRNSWGSSWGE 324
Query: 283 GGSMRIFRGVGGS--GLCNIAANAAYPL 308
G +R+ R + G+ G C IA +YPL
Sbjct: 325 AGYIRMERNLAGTSVGKCGIAMEPSYPL 352
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 135/325 (41%), Positives = 178/325 (54%), Gaps = 29/325 (8%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNK 50
TS + + +EQW+V+ + Y EKE RF+IFK N F L LN+
Sbjct: 68 TSRSDEELMSMYEQWLVKHGKVYNALGEKEKRFQIFKDNLRFIDDHNSQEDRTYKLGLNR 127
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FADLT E++ A Y G K P SN + K+ +S+DW + GAV PVKDQ
Sbjct: 128 FADLTNEEYRAKYLGTKIDPNRRLGKTPSNRYAPRVGDKLP--ESVDWRKEGAVPPVKDQ 185
Query: 111 GSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQY 167
G CWAF+A+ VEG+NKI TG+L++ S+ +LVDC T GC ++ AFE+I
Sbjct: 186 GGCGSCWAFSAIGAVEGINKIVTGELISLSEQELVDCDTGYNEGCNGGLMDYAFEFIINN 245
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI 227
+ SE YPY+G D CD +R +A K +I Y+ V E L+ V+ QPVSVAI
Sbjct: 246 GGIDSEEDYPYRG-VDGRCDTYRKNA--KVVSIDDYEDVPAYDELALKKAVANQPVSVAI 302
Query: 228 D--ATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
+ F Y GVFTG CG +HGV VGYGT A G YW+V+N WG +W E G
Sbjct: 303 EGGGREFQLYVSGVFTGRCGTALDHGVVAVGYGT---ANGHD-YWIVRNSWGPSWGEDGY 358
Query: 286 MRIFRGVGG--SGLCNIAANAAYPL 308
+R+ R + SG C IA +YPL
Sbjct: 359 IRLERNLANSRSGKCGIAIEPSYPL 383
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 177/317 (55%), Gaps = 32/317 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ A +E W+VE ++Y EKEMRF+IFK+N L LN+FADLT E
Sbjct: 38 VMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRIIDDHNADANRSYSLGLNRFADLTDE 97
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG-SYCCW 116
++ ++Y G K P ++ SN + + + D +DW GAV VK+QG CW
Sbjct: 98 EYRSTYLGLKM----GPKTDVSNEY--MPKVGEALPDYVDWRTVGAVVGVKNQGLCSSCW 151
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASE 173
AF+AV VEG+NKI TG L++ S+ +LVDC GC + + +AF++I + +E
Sbjct: 152 AFSAVTAVEGINKIVTGNLISLSEQELVDCGRTQRTKGCNRGLMTDAFQFIINNGGINTE 211
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-- 231
YPY + D C+ S + KY I Y+ V E L+ V+ QPVSV +++
Sbjct: 212 DNYPYTAK-DGQCNL--SLKNQKYVTIDNYKNVPSNNEMALKKAVAYQPVSVGVESEGGK 268
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+FTG CG +HGVTIVGYGT E YW+VKN WGTNW E G +RI R
Sbjct: 269 FKLYTSGIFTGFCGTAVDHGVTIVGYGT----ERGMDYWIVKNSWGTNWGENGYIRIQRN 324
Query: 292 VGGSGLCNIAANAAYPL 308
+GG+G C IA +YP+
Sbjct: 325 IGGAGKCGIARMPSYPV 341
>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 346
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 133/332 (40%), Positives = 189/332 (56%), Gaps = 45/332 (13%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFA 52
+K +I H+QWM++F+R Y D+ EK++R ++ +N +F+ +N+F
Sbjct: 30 YKPSSIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTENLKFIESFNNMGNQSYKLGVNEFT 89
Query: 53 DLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSI--------DWNERGAV 104
D T+E+FLA+YTG + P F+ +N +K ++ ++ DW GAV
Sbjct: 90 DWTKEEFLATYTGLRGVNVTSP-------FEVVNETKPAWNWTVSDVLGTNKDWRNEGAV 142
Query: 105 TPVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENA 160
TPVK QG C CWAF+A+A VEGL KI G L++ S+ QL+DC+ NGC NA
Sbjct: 143 TPVKSQGE-CGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCTREQNNGCKGGTFVNA 201
Query: 161 FEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR 220
F YI +++ ++SE YPYQ ++ C RS+A IRG++ V E L + VSR
Sbjct: 202 FNYIIKHRGISSENEYPYQVKEG-PC---RSNARPAI-LIRGFENVPSNNERALLEAVSR 256
Query: 221 QPVSVAIDATWFNFYH--GGVFTG-PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
QPV+VAIDA+ F H GGV+ CG + NH VT+VGYGT+ E YWL KN WG
Sbjct: 257 QPVAVAIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTSPEG---MKYWLAKNSWG 313
Query: 278 TNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W E G +RI R V G+C +A A+YP+
Sbjct: 314 KTWGENGYIRIRRDVEWPQGMCGVAQYASYPV 345
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 216 bits (551), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 120/321 (37%), Positives = 175/321 (54%), Gaps = 34/321 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLN 49
+R +AA+HE+WM ++ R YKD AEK RF++FK N F L +N
Sbjct: 24 ARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANAAFIESFNAGNHKFWLGVN 83
Query: 50 KFADLTREKFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+FADLT ++F + T G+ P T P R ++N+N + ++DW +G VTP+
Sbjct: 84 QFADLTNDEFRLTKTNKGFIPSTTRVPTGFR---YENVNIDALPA--TMDWRTKGVVTPI 138
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQG CCWAF+AVA +EG+ K+ TG+L++ S+ +LVDC GC +++AF++
Sbjct: 139 KDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKF 198
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + L +E YPY D S S +I+GY+ V E L V+ QPV
Sbjct: 199 IIKNGGLTTESNYPYAAADDKC-----KSVSNSVASIKGYEDVPANNEAALMKAVANQPV 253
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVA+D F FY GGV G CG +HG+ +GYG ++ YWL+KN WG W
Sbjct: 254 SVAVDGDDMTFQFYKGGVMIGSCGTDLDHGIVAIGYGKASDG---TKYWLLKNSWGMTWG 310
Query: 282 EGGSMRIFRGVGGS-GLCNIA 301
E G +R+ + + G+C +A
Sbjct: 311 ENGFLRMEKDISDKRGMCGLA 331
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 216 bits (551), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 134/327 (40%), Positives = 185/327 (56%), Gaps = 31/327 (9%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQA-EKEMRFKIFKKNHEF-------------L 46
+SR+ + + A +E W+VE ++Y EK+ RF+IFK N + L
Sbjct: 38 LSRSDEE---VMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKL 94
Query: 47 RLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
LN+FADLT E++ ++Y G K +S+ + + S DSIDW E+GAV
Sbjct: 95 GLNRFADLTNEEYRSTYLGAKTDARRRIAKTKSDR-RYAPKAGGSLPDSIDWREKGAVAE 153
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEY 163
VKDQGS CWAF+ +A VEG+N+I TG+L++ S+ +LVDC T GC ++ AFE+
Sbjct: 154 VKDQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTSYNEGCNGGLMDYAFEF 213
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + + +E YPY GR CD R +A K +I GY+ V P E L++ V+ QPV
Sbjct: 214 IIKNGGIDTEADYPYTGRYG-RCDQTRKNA--KVVSIDGYEDVTPYDEAALKEAVAGQPV 270
Query: 224 SVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVAI+A F Y G+FTG CG +HGVT VGYGT E YW+VKN W +W
Sbjct: 271 SVAIEAGGRDFQLYSSGIFTGSCGTDLDHGVTAVGYGT----ENGVDYWIVKNSWAASWG 326
Query: 282 EGGSMRIFRGV-GGSGLCNIAANAAYP 307
E G +R+ R V +GLC IA +YP
Sbjct: 327 EKGYLRMQRNVKDKNGLCGIAIEPSYP 353
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 181/317 (57%), Gaps = 31/317 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTRE 57
+ A E W+VE+ ++Y EKE RF+IFK N F+ LN+F+DLT
Sbjct: 44 VIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTDA 103
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ + Y G K + +N S+ ++ ++ DS+DW ++GAV VK+QG+ CW
Sbjct: 104 EYSSIYLGTK---FNIRMTNVSDRYEPRVGDQLP--DSVDWRKKGAVLGVKNQGNCGSCW 158
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASE 173
F ++A VEG+NKI TG L++ S+ ++VDC NGC L A+++I + +E
Sbjct: 159 TFASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNGGINTE 218
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI--DATW 231
YPY GR D CD ++ + KY I Y+ V E+ LQ V+ QPVSV I ++T
Sbjct: 219 ANYPYTGR-DGVCD--QNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIASNSTA 275
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+F GPCG +HGVTIVGYGT EG + YW+V+N WG NW E G +R+ R
Sbjct: 276 FKSYKSGIFNGPCGPRIDHGVTIVGYGT----EGGKDYWIVRNSWGPNWGESGYVRMQRN 331
Query: 292 VGGSGLCNIAANAAYPL 308
VGGSG C IA YP+
Sbjct: 332 VGGSGKCFIARAPVYPV 348
>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 346
Score = 216 bits (549), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 128/324 (39%), Positives = 178/324 (54%), Gaps = 43/324 (13%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFK-----------KNHEF-LRLNKFADLTREK 58
+AA+HEQWM +F R YKD AEK R ++FK +NHEF L N+FADLT ++
Sbjct: 37 MAARHEQWMAQFGRVYKDPAEKAHRLEVFKANVAFIESFNAENHEFWLGANQFADLTNDE 96
Query: 59 FLASYT-------GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG 111
F AS T G + PT FK + S + S+DW +GAVTP+K+QG
Sbjct: 97 FRASKTNKGIKQGGVRDAPTG---------FKYSDVSIDALPASVDWRTKGAVTPIKNQG 147
Query: 112 SY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
CWAF+AVA EG+ K+ TG+LV+ S+ +LVDC GC ++++AF++I +
Sbjct: 148 QCGSCWAFSAVAATEGVVKLSTGKLVSLSEQELVDCDVHGVDQGCMGGWMDDAFKFIIKN 207
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVA 226
L +E YPY G D +S+ + A I+GY+ V E L V+ QPVSV
Sbjct: 208 GGLTTEANYPYTGEDDKC----KSNETVNVAATIKGYEDVPANDESALMKAVAHQPVSVV 263
Query: 227 IDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
+D F Y GGV TG CG +HG+ +GYG T+ YWL+KN WGT W E G
Sbjct: 264 VDGGDMTFQLYAGGVMTGSCGVEMDHGIAAIGYGATSNG---TKYWLMKNSWGTTWGEKG 320
Query: 285 SMRIFRGVGGS-GLCNIAANAAYP 307
+R+ + + G+C +A +YP
Sbjct: 321 FLRMAKDIPDKRGMCGLAMKPSYP 344
>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
Length = 359
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 128/296 (43%), Positives = 170/296 (57%), Gaps = 26/296 (8%)
Query: 31 EKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPPTDHPH-SN 77
EK RF +FK+N + LRLNKFAD+T +FL Y G K H S
Sbjct: 55 EKNQRFNVFKENLKHIHKVNQKDRPYKLRLNKFADMTNHEFLQHYGGSKVSHYRMFHGSR 114
Query: 78 RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLV 136
R F + N+S + SIDW ++GAVT VKDQG CWAF++VA VEG+NKI+TG+L+
Sbjct: 115 RQTGFAHENTSNLP--SSIDWRKQGAVTGVKDQGKCGSCWAFSSVAAVEGINKIKTGELI 172
Query: 137 TRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASG 195
+ S+ +LVDC+++N GC +E AF +I + L +E YPY+ + D YCD + +
Sbjct: 173 SLSEQELVDCNSVNHGCDGGLMEQAFSFIEKTGGLTTENNYPYRAK-DGYCD--SAKMNT 229
Query: 196 KYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVT 253
I GY+ V E L V+ QPVS+AIDA F FY GV+TG CG NHGV
Sbjct: 230 PMVTIDGYEMVPENDEHALMQAVANQPVSIAIDAGGQDFQFYSEGVYTGDCGTELNHGVA 289
Query: 254 IVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
+VGYG T + YW+VKN WG+ W E G +R+ R GLC I A+YP+
Sbjct: 290 LVGYGATQDG---TKYWIVKNSWGSEWGENGFIRMQRENDVEEGLCGITLEASYPI 342
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 130/313 (41%), Positives = 177/313 (56%), Gaps = 32/313 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLA 61
+E W+VE ++Y EKEMRF+IFK N L LN+FADLT E++ +
Sbjct: 42 YESWLVEQGKSYNSLDEKEMRFEIFKDNLRIIDDHNADANRSFSLGLNRFADLTDEEYRS 101
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG-SYCCWAFTA 120
+Y G+K P + SN + + Y +DW GAV VK+QG CWAF+A
Sbjct: 102 TYLGFKS----GPKAKVSNRYVPKVGDVLPNY--VDWRTVGAVVGVKNQGLCSSCWAFSA 155
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDC---STLNGCAKNFLENAFEYIRQYQRLASECVYP 177
VA VEG+NKI TG L++ S+ +LVDC + GC + ++ +AF++I + +E YP
Sbjct: 156 VAAVEGINKIMTGNLLSLSEQELVDCGRTQSTRGCNRGYMTDAFQFIINNGGINTEDNYP 215
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFY 235
Y QD C+ R + KY I Y+ V E LQ+ V+ QPVSV +++ F Y
Sbjct: 216 YTA-QDGQCN--RYLQNQKYVTIDDYENVPSNNEWALQNAVAHQPVSVGLESEGGKFKLY 272
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
G+FT CG +HGVTIVGYGT E YW+VKN WGTNW E G +RI R +GG+
Sbjct: 273 TSGIFTQYCGTAIDHGVTIVGYGT----ERGLDYWIVKNSWGTNWGENGYIRIQRNIGGA 328
Query: 296 GLCNIAANAAYPL 308
G C IA A+YP+
Sbjct: 329 GKCGIARMASYPV 341
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 134/326 (41%), Positives = 185/326 (56%), Gaps = 30/326 (9%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
SR H ++ +HEQWM ++ + YKD AE + RF IF+ N EF L +
Sbjct: 26 SRKLHD-ASMYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNKPYKLSI 84
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N AD T E+F+AS+ GYK FK N + + + ++DW ++G VT +K
Sbjct: 85 NHLADQTNEEFMASHKGYKGSHWQGLRITTQTPFKYENVTDIPW--AVDWRQKGDVTSIK 142
Query: 109 DQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIR 165
DQ + C CWAF+AVA EG+ +I TG LV+ S+ +LVDC +++ GC +E+ FE+I
Sbjct: 143 DQ-AQCGNCWAFSAVAATEGIYQITTGNLVSLSEKELVDCDSVDHGCDGGLMEHGFEFII 201
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVS 224
+ ++SE YPY CD + + I GY+ V EE LQ V+ Q +S
Sbjct: 202 KNGGISSEANYPYTAVNGT-CD--TNKEASPVAQITGYETVPVNCEEELQKAVANQLTMS 258
Query: 225 VAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
V+IDA + F FY GVFTG CG +HGVT VGYG+T G Q YW+VKN WGT W E
Sbjct: 259 VSIDAGGSAFQFYPSGVFTGQCGTQLDHGVTAVGYGSTD--YGTQ-YWIVKNSWGTQWGE 315
Query: 283 GGSMRIFRGVGG-SGLCNIAANAAYP 307
G +R+ RG+ GLC IA +A+YP
Sbjct: 316 EGYIRMLRGIDAQEGLCGIAMDASYP 341
>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
Length = 362
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/314 (39%), Positives = 172/314 (54%), Gaps = 25/314 (7%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFL-----------RL--NKFADLTREKFL 60
+ W R+Y E RF ++++N EF+ RL N+FADLT E+FL
Sbjct: 50 RFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYRLAENEFADLTEEEFL 109
Query: 61 ASYTGY---KPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--C 115
A+YTGY P D + + S ++ S+DW +GAV P K Q S C C
Sbjct: 110 ATYTGYYAGDGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTCSSC 169
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNG-CAKNFLENAFEYIRQYQRLASEC 174
WAF AT+E LN I+TG+LV+ S+ QLVDC + +G C A++++ + L +E
Sbjct: 170 WAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGGLTTEA 229
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-TWFN 233
YPY R+ C+ R+ ++ I G+ V P E LQ V+RQPV+VAI+ +
Sbjct: 230 DYPYTARRGP-CN--RAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEVGSGMQ 286
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GGV+TGPCG H VT+VGYGT +A YW +KN WG +W E G +RI R VG
Sbjct: 287 FYKGGVYTGPCGTRLAHAVTVVGYGT--DASSGAKYWTIKNSWGQSWGERGYIRILRDVG 344
Query: 294 GSGLCNIAANAAYP 307
G GLC + + AYP
Sbjct: 345 GPGLCGVTLDIAYP 358
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 215 bits (547), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 125/317 (39%), Positives = 172/317 (54%), Gaps = 31/317 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTRE 57
+ A +E W+++ ++Y E+E RF+IFK+ F+ LN+FADLT E
Sbjct: 34 VKAMYESWLIKHGKSYNSLGERERRFEIFKETLRFIDEHNADTSRSYKVGLNQFADLTNE 93
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F ++Y G+ SNR D +DW GAV +K+QG CW
Sbjct: 94 EFRSTYLGFTRGSNKTKVSNRYE-----PRVGQVLPDYVDWRSEGAVVDIKNQGQCGSCW 148
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQRLASE 173
AF+A+A VEG+NKI TG L++ S+ +LVDC + GC ++ + FE+I + +E
Sbjct: 149 AFSAIAAVEGINKIVTGNLISLSEQELVDCGRTQSTKGCDGGYMTDGFEFIINNGGINTE 208
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-- 231
YPY Q+ CD + + KY I Y+ V E LQ V+ QPVSVA+++
Sbjct: 209 ENYPYTA-QEGQCDL--NLQNEKYVTIDNYENVPYYNEWALQTAVAYQPVSVALESAGDA 265
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+FTGPCG +H VTIVGYGT EG YW+VKN W T W E G MRI R
Sbjct: 266 FQHYSSGIFTGPCGTATDHAVTIVGYGT----EGGIDYWIVKNSWDTTWGEEGYMRILRN 321
Query: 292 VGGSGLCNIAANAAYPL 308
VGG+G C IA +YP+
Sbjct: 322 VGGAGTCGIATMPSYPV 338
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 215 bits (547), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 125/318 (39%), Positives = 179/318 (56%), Gaps = 33/318 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ A++++WM ++ R YKD AEK RF++FK N EF L N+FADLT +
Sbjct: 55 MMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSK 114
Query: 58 KFLASYTGY-KPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+F A YTG KP FK N +++ +DW ++GAVTPVK+QG CC
Sbjct: 115 EFAAMYTGLRKPAAVPSGAKQIPAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCC 174
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLAS 172
WAF+AV +EGL I TG LV+ S+ Q++DC + GC +++NAF+Y+ + +
Sbjct: 175 WAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNGGVTT 234
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID--AT 230
E YPY Q C + +A+ I G+Q + E L + V+ QPVSV +D ++
Sbjct: 235 EDAYPYSAVQG-TCQNVQPAAT-----ISGFQDLPSGDENALANAVANQPVSVGVDGGSS 288
Query: 231 WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GG++ G CG NH VT +GYG + +G Q YW++KN WGT W E G M++
Sbjct: 289 PFQFYQGGIYDGDGCGTDMNHAVTAIGYG--ADDQGTQ-YWILKNSWGTGWGENGFMQLQ 345
Query: 290 RGVGGSGLCNIAANAAYP 307
GVG C I+ A+YP
Sbjct: 346 MGVGA---CGISTMASYP 360
>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 358
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 124/314 (39%), Positives = 170/314 (54%), Gaps = 25/314 (7%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+ W R+Y E RF ++++N EF L N+FADLT E+FL
Sbjct: 46 RFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFL 105
Query: 61 ASYTGY---KPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--C 115
A+YTGY P D + + S ++ S+DW +GAV P K Q S C C
Sbjct: 106 ATYTGYYAGDGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTCSSC 165
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNG-CAKNFLENAFEYIRQYQRLASEC 174
WAF AT+E LN I+TG+LV+ S+ QLVDC + +G C A++++ + L +E
Sbjct: 166 WAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGGLTTEA 225
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-TWFN 233
YPY R+ C+ R+ ++ I G+ V P E LQ V+RQPV+VAI+ +
Sbjct: 226 DYPYTARRGP-CN--RAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEVGSGMQ 282
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GGV+TGPCG H VT+VGYGT +A YW +KN WG +W E G +RI R VG
Sbjct: 283 FYKGGVYTGPCGTRLAHAVTVVGYGT--DASSGAKYWTIKNSWGQSWGERGYIRILRDVG 340
Query: 294 GSGLCNIAANAAYP 307
G GLC + + AYP
Sbjct: 341 GPGLCGVTLDIAYP 354
>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
Length = 292
Score = 214 bits (546), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 127/300 (42%), Positives = 177/300 (59%), Gaps = 35/300 (11%)
Query: 31 EKEMRFKIFKKNHEF--------------LRLNKFADLTREKFLASYTGYKPPPTDHPHS 76
E+E R +IF KN + L +NKFADLT E+F+AS +K H S
Sbjct: 3 EREKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEFIASRNKFK----GHMCS 58
Query: 77 N--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
+ R+ FK N+S + ++DW ++GAVTPVK+QG CWAF+AVA EG++++ TG
Sbjct: 59 SIIRTTTFKYENASAIP--STVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTG 116
Query: 134 QLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWR 190
+LV+ S+ +L+DC T GC +++AF++I Q L++E YPY+G D C+ +
Sbjct: 117 KLVSLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGV-DGTCNANK 175
Query: 191 SSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTP 248
+S I GY+ V E LQ V+ QP+SVAIDA+ F FY+ GVFTG CG
Sbjct: 176 ASIHAV--TITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTEL 233
Query: 249 NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
+HGVT VGYG + YWLVKN WG +W E G +R+ RG+ + GLC IA A+YP
Sbjct: 234 DHGVTAVGYGVGNDG---TKYWLVKNSWGADWGEEGYIRMQRGIAAAEGLCGIAMQASYP 290
>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
Length = 362
Score = 214 bits (545), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 124/314 (39%), Positives = 170/314 (54%), Gaps = 25/314 (7%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+ W R+Y E RF ++++N EF L N+FADLT E+FL
Sbjct: 50 RFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFL 109
Query: 61 ASYTGY---KPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--C 115
A+YTGY P D + + S ++ S+DW +GAV P K Q S C C
Sbjct: 110 ATYTGYYAGDGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTCSSC 169
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNG-CAKNFLENAFEYIRQYQRLASEC 174
WAF AT+E LN I+TG+LV+ S+ QLVDC + +G C A++++ + L +E
Sbjct: 170 WAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGGLTTEA 229
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-TWFN 233
YPY R+ C+ R+ ++ I G+ V P E LQ V+RQPV+VAI+ +
Sbjct: 230 DYPYTARRGP-CN--RAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEVGSGMQ 286
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GGV+TGPCG H VT+VGYGT +A YW +KN WG +W E G +RI R VG
Sbjct: 287 FYKGGVYTGPCGTRLAHAVTVVGYGT--DASSGAKYWTIKNSWGQSWGERGYIRILRDVG 344
Query: 294 GSGLCNIAANAAYP 307
G GLC + + AYP
Sbjct: 345 GPGLCGVTLDIAYP 358
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 130/317 (41%), Positives = 178/317 (56%), Gaps = 28/317 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREK 58
+ A +E W+V+ +TY EK+ RF+IFK N F L LNKFADLT E+
Sbjct: 48 VNALYESWLVKHGKTYNALGEKDRRFQIFKDNLRFIDEHNSGDHTYKLGLNKFADLTNEE 107
Query: 59 FLASYTGYKPPPTDHPHSN-RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+ +YTG K S +S+ + + + Y +DW E+GAVT VKDQGS CW
Sbjct: 108 YRMTYTGIKTIDDKKKLSKMKSDRYAYRSGDSLPEY--VDWREQGAVTDVKDQGSCGSCW 165
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASEC 174
AF+ +VEG+NKI TG L++ S+ +LV+C T GC ++ AFE+I + + +E
Sbjct: 166 AFSTTGSVEGVNKIVTGDLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEE 225
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--F 232
YPY G+ D CD ++ + K I Y+ V E L+ VS QPV+VAI+A F
Sbjct: 226 DYPYTGK-DGKCD--KNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDF 282
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY G+FTG CG +HGV GYGT E + YWLVKN WG W EGG +++ R +
Sbjct: 283 QFYTSGIFTGSCGTALDHGVLAAGYGT----EDGKDYWLVKNSWGAEWGEGGYLKMERNI 338
Query: 293 GG-SGLCNIAANAAYPL 308
SG C IA A+YP+
Sbjct: 339 ADKSGKCGIAMEASYPI 355
>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 214 bits (545), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 133/309 (43%), Positives = 174/309 (56%), Gaps = 34/309 (11%)
Query: 19 MVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFLASYTG 65
M + R YKD EKE RFKIFK N F L +N+FADLT E+F +
Sbjct: 1 MARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNR 60
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
+K H S + FK N + + +IDW ++GAVTP+KDQ CCWAF+AVA
Sbjct: 61 FKA----HICSEATT-FKYENVTAVP--STIDWRKKGAVTPIKDQQQCGCCWAFSAVAAT 113
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGR 181
EG+ +I TG+L++ S+ +LVDC T GC+ +++AF +I+ LASE YPY+G
Sbjct: 114 EGITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRFIK-IHGLASEATYPYEG- 171
Query: 182 QDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGV 239
D C+ + + I+GY+ V E+ LQ V+ QPV+VAIDA F FY GV
Sbjct: 172 DDGTCNSKKEAHPA--AKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGV 229
Query: 240 FTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGLC 298
FTG CG +HGV VGYG + YWLVKN WGT W E G +R+ R V GLC
Sbjct: 230 FTGQCGTELDHGVAAVGYGIGDDG---MMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLC 286
Query: 299 NIAANAAYP 307
IA A+YP
Sbjct: 287 GIAMQASYP 295
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 128/316 (40%), Positives = 175/316 (55%), Gaps = 33/316 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +WM RTY E+E R+++F+ N H F L LN+FADLT ++
Sbjct: 44 YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 103
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ A+Y G + P R + + +S+DW +GAV VKDQGSY CWA
Sbjct: 104 YRATYLGART----RPQRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSYGSCWA 159
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+ +A VEG+N+I TG L++ S+ +LVDC T GC ++ AFE+I + +E
Sbjct: 160 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKD 219
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+G D CD R +A K I Y+ V E+ LQ V+ QPVSVAI+A T F
Sbjct: 220 YPYKG-TDGRCDVNRKNA--KVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTQFQ 276
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y G+FTG CG +HGVT VGYGT E + YW+VKN WG++W E G +R+ R +
Sbjct: 277 LYSSGIFTGSCGTALDHGVTAVGYGT----ENGKDYWIVKNSWGSSWGESGYVRMERNIK 332
Query: 293 GGSGLCNIAANAAYPL 308
SG C IA +YPL
Sbjct: 333 ASSGKCGIAVEPSYPL 348
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 175/321 (54%), Gaps = 33/321 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN-------------HEF-LRLNKFADLTR 56
+A +HE+WM + R Y D AEK R ++F+ N H+F L N+FADLT
Sbjct: 36 MAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 95
Query: 57 EKFLASYTGYKPPPTDHPHSNRS-NWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+F A+ TG +P + NR+ F+ N S S+DW +GAV PVKDQG C
Sbjct: 96 AEFRATRTGLRPSSS---RGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGC 152
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+AVA +EG K+ TG+LV+ S+ QLV C GC +++AF++I + LA
Sbjct: 153 CWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLA 212
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-- 229
+E YPY D C + A I+GY+ V E L V+ QPVSVAID
Sbjct: 213 AESDYPYTASDD-KCA--TAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGD 269
Query: 230 TWFNFYHGGVFTGP--CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F FY GGV +G C +H +T VGYG ++ YWL+KN WGT+W E G +R
Sbjct: 270 RHFQFYKGGVLSGAAGCATELDHAITAVGYGVASDG---TKYWLMKNSWGTSWGEDGYVR 326
Query: 288 IFRGVGG-SGLCNIAANAAYP 307
+ RGV G+C +A A+YP
Sbjct: 327 MERGVADKEGVCGLAMMASYP 347
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 214 bits (544), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 128/317 (40%), Positives = 176/317 (55%), Gaps = 28/317 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREK 58
+ + +E+W+V+ + Y EK+ RF+IFK N F L LN+FADLT E+
Sbjct: 36 VNSLYEEWLVKHGKLYNALGEKDKRFQIFKDNLRFIDQQNAENRTYKLGLNRFADLTNEE 95
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ A Y G K P SN + + DS+DW + GAV PVKDQ S CWA
Sbjct: 96 YRARYLGTKIDPNRRLGRTPSNRYAPRVGETLP--DSVDWRKEGAVVPVKDQASCGSCWA 153
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQRLASECV 175
F+A+ VEG+NKI TG L++ S+ +LVDC T GC ++ AFE+I + + SE
Sbjct: 154 FSAIGAVEGINKIVTGDLISLSEQELVDCDTGYNMGCNGGLMDYAFEFIIKNGGIDSEED 213
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FN 233
YPY+G D CD +R +A K +I GY+ V E L+ V+ QPVSVA++ F
Sbjct: 214 YPYKG-VDGRCDEYRKNA--KVVSIDGYEDVNTYDELALKKAVANQPVSVAVEGGGREFQ 270
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
Y GVFTG CG +HGV VGYGT + +W+V+N WG +W E G +R+ R +G
Sbjct: 271 LYSSGVFTGRCGTALDHGVVAVGYGT----DNGHDFWIVRNSWGADWGEEGYIRLERNLG 326
Query: 294 G--SGLCNIAANAAYPL 308
SG C IA +YP+
Sbjct: 327 NSRSGKCGIAIEPSYPI 343
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 213 bits (543), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 126/304 (41%), Positives = 170/304 (55%), Gaps = 40/304 (13%)
Query: 31 EKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDHPHSNR 78
EK RF +FK+N HEF L+LNKFAD+T +F ++Y G K N
Sbjct: 53 EKHKRFNVFKENVNFVHEFNKKDEPYKLKLNKFADMTNHEFRSTYAGSK--------VNH 104
Query: 79 SNWFKNLNSSKMSFY--------DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNK 129
F+ + SF S+DW ++GAVTP+KDQG CWAF+ V VEG+N
Sbjct: 105 HRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQCGSCWAFSTVVAVEGINH 164
Query: 130 IRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCD 187
I+T +LV+ S+ +LVDC T GC + AFE+I++ + +E YPY +D CD
Sbjct: 165 IKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEKGGITTEQSYPYTA-EDGTCD 223
Query: 188 WWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCG 245
S + +I G++ V P E+ L + QP+SVAIDA + F FY GVF G CG
Sbjct: 224 --VSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGSAFQFYSEGVFAGRCG 281
Query: 246 NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANA 304
+HGV IVGYGTT + YW+VKN WGT+W E G +R+ RG+ GLC IA A
Sbjct: 282 TDLDHGVAIVGYGTTLDG---TKYWIVKNSWGTDWGENGYIRMKRGISAKEGLCGIAVEA 338
Query: 305 AYPL 308
+YP+
Sbjct: 339 SYPI 342
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 213 bits (543), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 132/327 (40%), Positives = 180/327 (55%), Gaps = 35/327 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------RL--NKFADLTREKFLA 61
+ EQWM R Y D EK+ R +++++N E + RL NKFADLT E+F A
Sbjct: 53 RFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNGYRLADNKFADLTNEEFRA 112
Query: 62 SYTGYKPPPTD--HPHSNRSNWFKNLNSSKMS------FYDSIDWNERGAVTPVKDQGSY 113
G+ P + HS + + S M S+DW E+GAV PVK QG
Sbjct: 113 KMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVKSQGDC 172
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLA 171
CWAF+AVA +EG+N+I+ G+LV+ S+ +LVDC T GCA ++ AFE++ + + L
Sbjct: 173 GSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKAIGCAGGYMSWAFEFVMKNRGLT 232
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
+E YPYQG + C + S +I GY V P++E L + QPVSVA+DA
Sbjct: 233 TERNYPYQG-LNGACQTPKLKESAV--SISGYMNVTPSSEPDLLRAAAAQPVSVAVDAGS 289
Query: 232 F--NFYHGGVFTGPCGNTPNHGVTIVGYGTT---TEAEGQ----QPYWLVKNRWGTNWDE 282
F Y GGVFTGPC NHGVT+VGYG T T+ +G + YW+VKN WG W +
Sbjct: 290 FVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPEWGD 349
Query: 283 GGSMRIFRGVG-GSGLCNIAANAAYPL 308
G + + R SGLC IA +YP+
Sbjct: 350 AGYILMQREASVASGLCGIAMLPSYPV 376
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 177/318 (55%), Gaps = 28/318 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTRE 57
++ A +E W+ + ++Y EKE RF+IFK N F+ LN+FADLT E
Sbjct: 46 DVMAVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNE 105
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ + Y G + + S+ + S +S+DW ++GAV VKDQGS CW
Sbjct: 106 EYRSMYLGTRTAAKRRSSNKISDRYAFRVGD--SLPESVDWRKKGAVVEVKDQGSCGSCW 163
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASEC 174
AF+ +A VEG+NKI TG L++ S+ +LVDC T GC ++ AFE+I + SE
Sbjct: 164 AFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEE 223
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YPY+ D CD +R +A K I GY+ V E+ L+ V+ QPVSVAI+A F
Sbjct: 224 DYPYKA-SDGRCDQYRKNA--KVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREF 280
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y G+FTG CG +HGVT VGYGT E YW+VKN WG +W E G +R+ R +
Sbjct: 281 QLYQSGIFTGRCGTALDHGVTAVGYGT----ENGVDYWIVKNSWGASWGEEGYIRMERDL 336
Query: 293 GGS--GLCNIAANAAYPL 308
S G C IA A+YP+
Sbjct: 337 ATSATGKCGIAMEASYPI 354
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 175/321 (54%), Gaps = 33/321 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN-------------HEF-LRLNKFADLTR 56
+A +HE+WM + R Y D AEK R ++F+ N H+F L N+FADLT
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 57 EKFLASYTGYKPPPTDHPHSNRS-NWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+F A+ TG +P + NR+ F+ N S S+DW +GAV PVKDQG C
Sbjct: 61 AEFRATRTGLRPSSS---RGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGC 117
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+AVA +EG K+ TG+LV+ S+ QLV C GC +++AF++I + LA
Sbjct: 118 CWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLA 177
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-- 229
+E YPY D C + A I+GY+ V E L V+ QPVSVAID
Sbjct: 178 AESDYPYTASDD-KCA--TAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGD 234
Query: 230 TWFNFYHGGVFTGP--CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F FY GGV +G C +H +T VGYG ++ YWL+KN WGT+W E G +R
Sbjct: 235 RHFQFYKGGVLSGAAGCATELDHAITAVGYGVASDG---TKYWLMKNSWGTSWGEDGYVR 291
Query: 288 IFRGVGG-SGLCNIAANAAYP 307
+ RGV G+C +A A+YP
Sbjct: 292 MERGVADKEGVCGLAMMASYP 312
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 213 bits (542), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 132/327 (40%), Positives = 179/327 (54%), Gaps = 35/327 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------RL--NKFADLTREKFLA 61
+ EQWM R Y D EK+ R +++++N E + RL NKFADLT E+F A
Sbjct: 32 RFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGNGYRLADNKFADLTNEEFRA 91
Query: 62 SYTGYKPPPTD--HPHSNRSNWFKNLNSSKMS------FYDSIDWNERGAVTPVKDQGSY 113
G+ P + HS + + S M S+DW E+GAV PVK QG
Sbjct: 92 KMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVKSQGDC 151
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLA 171
CWAF+AVA +EG+N+I+ G+LV+ S+ +LVDC T GCA ++ AFE++ + + L
Sbjct: 152 GSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKAIGCAGGYMSWAFEFVMKNRGLT 211
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
+E YPYQG C + S +I GY V P++E L + QPVSVA+DA
Sbjct: 212 TERNYPYQGLNG-ACQTPKLKESAV--SISGYMNVTPSSEPDLLRAAAAQPVSVAVDAGS 268
Query: 232 F--NFYHGGVFTGPCGNTPNHGVTIVGYGTT---TEAEGQ----QPYWLVKNRWGTNWDE 282
F Y GGVFTGPC NHGVT+VGYG T T+ +G + YW+VKN WG W +
Sbjct: 269 FVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPEWGD 328
Query: 283 GGSMRIFRGVG-GSGLCNIAANAAYPL 308
G + + R SGLC IA +YP+
Sbjct: 329 AGYILMQREASVASGLCGIAMLPSYPV 355
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 129/311 (41%), Positives = 182/311 (58%), Gaps = 27/311 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLAS 62
E W+V+ ++Y EK+ RFKIF+ N ++ L LN+FAD+T E++
Sbjct: 51 ESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLNRFADITNEEYRTG 110
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
Y G K + + ++S+ + + S DSIDW E+GAVT VKDQGS CWAF+ +
Sbjct: 111 YLGAKRDASRNMVKSKSDRYAPVAGD--SLPDSIDWREKGAVTGVKDQGSCGSCWAFSTI 168
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCS-TLN-GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A VEG+N++ TG L++ S+ +LVDC +N GC + AF++I + + SE YPY
Sbjct: 169 AAVEGVNQLATGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFIIKNGGIDSEEDYPYT 228
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--YHG 237
G+ D CD +R + + K +I GY+ V E+ LQ V+ QPVSVAI+A ++F Y
Sbjct: 229 GK-DGKCDSYRQN-NAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAIEAGGYDFQLYSS 286
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSG 296
G+FTG CG +HGV VGYGT E YW+VKN WG W E G +R+ R V +G
Sbjct: 287 GIFTGSCGTDLDHGVAAVGYGT----ENGVDYWIVKNSWGDYWGEKGYVRMQRNVKAKTG 342
Query: 297 LCNIAANAAYP 307
LC IA A+YP
Sbjct: 343 LCGIAMEASYP 353
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 213 bits (542), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 129/311 (41%), Positives = 173/311 (55%), Gaps = 27/311 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E+W+ ++ + Y EK RF++FK N +L LN+FADLT ++F A+Y
Sbjct: 52 EKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKKVTSYWLGLNEFADLTHDEFKATY 111
Query: 64 TGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
G PPPT + S F+ S +DW ++ AVT VK+QG CWAF+ V
Sbjct: 112 LGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTV 171
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A VEG+N I TG L + S+ +L+DCST NGC ++ AF YI L +E YPY
Sbjct: 172 AAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIASTGGLRTEEAYPY- 230
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
++ CD + +A I GY+ V E+ L ++ QPVSVAI+A+ F FY G
Sbjct: 231 AMEEGDCDEGKGAA---VVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSG 287
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSG 296
GVF GPCG +HGVT VGYGT+ Q Y +VKN WG +W E G +R+ RG G G G
Sbjct: 288 GVFDGPCGEQLDHGVTAVGYGTSK----GQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEG 343
Query: 297 LCNIAANAAYP 307
LC I A+YP
Sbjct: 344 LCGINKMASYP 354
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 213 bits (542), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 125/317 (39%), Positives = 176/317 (55%), Gaps = 26/317 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTRE 57
+ A +E+W+V + Y + +K+ RF++FK N F++ LNKFAD+T E
Sbjct: 34 VMAMYEEWLVRHQKGYNELGKKDKRFQVFKDNLGFIQEHNNNLNNTYKLGLNKFADMTNE 93
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ A Y G K +S + S++ +DW +GAV P+KDQGS CW
Sbjct: 94 EYRAMYLGTKSNAKRRLMKTKSTGHRYAFSARDRLPVHVDWRMKGAVAPIKDQGSCGSCW 153
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASEC 174
AF+ VATVE +NKI TG+ V+ S+ +LVDC GC ++ AFE+I Q + ++
Sbjct: 154 AFSTVATVEAINKIVTGKFVSLSEQELVDCDRAYNEGCNGGLMDYAFEFIIQNGGIDTDK 213
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WF 232
YPY+G D CD + +A K I GY+ V P E L+ V+ QPVSVAI+A+
Sbjct: 214 DYPYRGF-DGICDPTKKNA--KVVNIDGYEDVPPYDENALKKAVAHQPVSVAIEASGRAL 270
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GVFTG CG + +HGV +VGYG+ E YWLV+N WGT W E G ++ R V
Sbjct: 271 QLYQSGVFTGKCGTSLDHGVVVVGYGS----ENGVDYWLVRNSWGTGWGEDGYFKMQRNV 326
Query: 293 GGS-GLCNIAANAAYPL 308
S G C I A+YP+
Sbjct: 327 RTSTGKCGITMEASYPV 343
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 130/325 (40%), Positives = 182/325 (56%), Gaps = 28/325 (8%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------- 47
MSRT ++ + A H+QWM+++ RTY + +E E R KIFK+N E++
Sbjct: 20 MSRTLTESSVVEA-HQQWMMKYERTYTNSSEMEKRKKIFKENLEYIENFNNVGNKSYKLG 78
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFK-NLNSSKMSFYDSIDWNERGAVTP 106
LN+++DLT E+F+AS+TG+K RS NLN + + DW E+G VT
Sbjct: 79 LNRYSDLTSEEFIASHTGFKVSDQLSDSKMRSVAIPFNLNDDVPT---NFDWREKGVVTD 135
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS-TLNGCAKNFLENAFEYI 164
VK+Q CCWAFTAVA VEG+ KI+ G L++ S+ QLVDC +GC AF+ I
Sbjct: 136 VKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSEQQLVDCDRQSSGCGGGDFVLAFDSI 195
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ + + E YPY+ C + + + I GY V E+ L V +QPVS
Sbjct: 196 IKSRGIVKEDDYPYKANDVQTCQLGQIPGAAQ---INGYFKVPANDEQQLLRAVLQQPVS 252
Query: 225 VAIDATW-FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
VAI ++ F+ Y GGV+ G CG NH VTI+GYG + E + YWL+KN WG W E
Sbjct: 253 VAISTSYDFHHYMGGVYEGSCGPKLNHAVTIIGYGVS---EAGKKYWLIKNSWGETWGEK 309
Query: 284 GSMRIFRGVGGS-GLCNIAANAAYP 307
G M++ R + G C+IA +AAYP
Sbjct: 310 GYMKVLRESSATGGQCSIAVHAAYP 334
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 128/325 (39%), Positives = 175/325 (53%), Gaps = 33/325 (10%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNK 50
T + A +E W++++ ++Y E E RF+IFK+ F+ LN+
Sbjct: 31 TQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQ 90
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FADLT E+F ++Y G+ SNR ++ + Y +DW GAV +K Q
Sbjct: 91 FADLTDEEFRSTYLGFTSGSNKTKVSNR---YEPRVGQVLPSY--VDWRSAGAVVDIKSQ 145
Query: 111 GSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIR 165
G C CWAF+A+ATVEG+NKI TG L++ S+ +L+DC GC ++ + F++I
Sbjct: 146 GE-CGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFII 204
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ +E YPY QD C+ + KY I Y+ V E LQ V+ QPVSV
Sbjct: 205 NNGGINTEENYPYTA-QDGECNL--DLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSV 261
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
A+DA F Y G+FTGPCG +H VTIVGYGT EG YW+VKN W T W E
Sbjct: 262 ALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT----EGGIDYWIVKNSWDTTWGEE 317
Query: 284 GSMRIFRGVGGSGLCNIAANAAYPL 308
G MRI R VGG+G C IA +YP+
Sbjct: 318 GYMRILRNVGGAGTCGIATMPSYPV 342
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 213 bits (541), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 175/321 (54%), Gaps = 33/321 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN-------------HEF-LRLNKFADLTR 56
+A +HE+WM + R Y D AEK R ++F+ N H+F L N+FADLT
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 57 EKFLASYTGYKPPPTDHPHSNRS-NWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+F A+ TG +P + NR+ F+ N S S+DW +GAV PVKDQG C
Sbjct: 61 AEFRATRTGLRPSSS---RGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGC 117
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+AVA +EG K+ TG+LV+ S+ QLV C GC +++AF++I + LA
Sbjct: 118 CWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLA 177
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-- 229
+E YPY D C + A I+GY+ V E L V+ QPVSVAID
Sbjct: 178 AESDYPYTASDD-KCA--TAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGD 234
Query: 230 TWFNFYHGGVFTGP--CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F FY GGV +G C +H +T VGYG ++ YWL+KN WGT+W E G +R
Sbjct: 235 RHFQFYKGGVLSGAAGCATELDHAITAVGYGVASDG---TKYWLMKNSWGTSWGEDGYVR 291
Query: 288 IFRGVGG-SGLCNIAANAAYP 307
+ RGV G+C +A A+YP
Sbjct: 292 MERGVADKEGVCGLAMMASYP 312
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 212 bits (540), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 129/327 (39%), Positives = 172/327 (52%), Gaps = 27/327 (8%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
+ T ++ +E W+V+ + Y EKE RF+IFK N F L
Sbjct: 38 LQSTERTEAHMMKMYEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLG 97
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
L KFADLT E++ A Y G K + + RS + + + +DW E+GAVT V
Sbjct: 98 LTKFADLTNEEYRAMYLGAKMEKKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAVTEV 157
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYI 164
KDQG CWAF+ V +VEG+N+I TG L++ S+ +LVDC GC ++ AFE+I
Sbjct: 158 KDQGQCGSCWAFSTVGSVEGINQIVTGDLISLSEQELVDCDKAYNQGCNGGLMDYAFEFI 217
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ + SE YPY+ D CD R +A I GY+ V EE L+ V+ QPVS
Sbjct: 218 IKNGGIDSEADYPYRA-SDNMCDSNRKNA--HVVTIDGYEDVPENDEESLKKAVANQPVS 274
Query: 225 VAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
VAI+A F Y GVFTG CG +HGV VGYGT E YW+V+N WG W E
Sbjct: 275 VAIEAGGREFQLYQSGVFTGRCGTNLDHGVVAVGYGT----ENGIDYWIVRNSWGPKWGE 330
Query: 283 GGSMRIFRGVGG--SGLCNIAANAAYP 307
G +R+ R V +G C IA A+YP
Sbjct: 331 SGYIRMERNVASTDTGKCGIAMEASYP 357
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/325 (39%), Positives = 175/325 (53%), Gaps = 33/325 (10%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNK 50
T + A +E W++++ ++Y E E RF+IFK+ F+ LN+
Sbjct: 31 TQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQ 90
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FADLT E+F ++Y G+ SNR ++ + Y +DW GAV +K Q
Sbjct: 91 FADLTDEEFRSTYLGFTSGSNKTKVSNR---YEPRFGQVLPSY--VDWRSAGAVVDIKSQ 145
Query: 111 GSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIR 165
G C CWAF+A+ATVEG+NKI TG L++ S+ +L+DC GC ++ + F++I
Sbjct: 146 GE-CGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFII 204
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ +E YPY QD C+ + KY I Y+ V E LQ V+ QPVSV
Sbjct: 205 NNGGINTEENYPYTA-QDGECNL--DLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSV 261
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
A+DA F Y G+FTGPCG +H VTIVGYGT EG YW+VKN W T W E
Sbjct: 262 ALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT----EGGIDYWIVKNSWDTTWGEE 317
Query: 284 GSMRIFRGVGGSGLCNIAANAAYPL 308
G MRI R VGG+G C IA +YP+
Sbjct: 318 GYMRILRNVGGAGTCGIATMPSYPV 342
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 120/313 (38%), Positives = 178/313 (56%), Gaps = 27/313 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTRE 57
+ + E+WM E+ R YKD EK RF+IFK N + + +N+F D+T+
Sbjct: 6 MMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFTDMTKS 65
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F+A YTG P S F ++N S + SIDW + GAV VK+Q CW
Sbjct: 66 EFVAQYTGVSLPLNIEREPVVS--FDDVNISAVP--QSIDWRDYGAVNEVKNQNPCGSCW 121
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVY 176
AF A+ATVEG+ KI+TG LV+ S+ +++DC+ GC ++ A+++I + +E Y
Sbjct: 122 AFAAIATVEGIYKIKTGYLVSLSEQEVLDCAVSYGCKGGWVNKAYDFIISNNGVTTEENY 181
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FNFY 235
PYQ Q C+ +++ I GY YV+ E + VS QP++ IDA+ F +Y
Sbjct: 182 PYQAYQG-TCN---ANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYY 237
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GG 294
+GGVF+GPCG + NH +TI+GYG + YW+V+N WG++W EGG +R+ RGV
Sbjct: 238 NGGVFSGPCGTSLNHAITIIGYGQDSSG---TKYWIVRNSWGSSWGEGGYVRMARGVSSS 294
Query: 295 SGLCNIAANAAYP 307
SG C IA + +P
Sbjct: 295 SGACGIAMSPLFP 307
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 126/316 (39%), Positives = 175/316 (55%), Gaps = 33/316 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +WM RTY E+E RF++F+ N H F L LN+FADLT ++
Sbjct: 46 YAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFADLTNDE 105
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ A+Y G + P R + L +S+DW +GAV VKDQGS CWA
Sbjct: 106 YRATYLGVRS----RPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEVKDQGSCGSCWA 161
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+ +A VEG+N+I TG +++ S+ +LVDC T GC ++ AFE+I + +E
Sbjct: 162 FSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEED 221
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+G D CD R +A K I Y+ V +E+ LQ V+ QP+SVAI+A F
Sbjct: 222 YPYKG-TDGRCDVNRKNA--KVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQ 278
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y+ G+FTG CG +HGVT VGYGT E + YW+VKN WG++W E G +R+ R +
Sbjct: 279 LYNSGIFTGTCGTALDHGVTAVGYGT----ENGKDYWIVKNSWGSSWGESGYVRMERNIK 334
Query: 293 GGSGLCNIAANAAYPL 308
SG C IA +YPL
Sbjct: 335 ASSGKCGIAVEPSYPL 350
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 124/324 (38%), Positives = 177/324 (54%), Gaps = 26/324 (8%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNK 50
T++ + +E+W+V+ + Y EK+ RF++FK N F++ LNK
Sbjct: 29 TNYTDNEVMTMYEEWLVKHQKVYNGLGEKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNK 88
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FAD+T E++ Y G K +S + S+ +DW +GAV P+KDQ
Sbjct: 89 FADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYSAGDQLPVHVDWRVKGAVAPIKDQ 148
Query: 111 GSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQY 167
GS CWAF+ VATVE +NKI TG+ V+ S+ +LVDC GC ++ AFE+I Q
Sbjct: 149 GSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRAYNQGCNGGLMDYAFEFIIQN 208
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI 227
+ ++ YPY+G D CD + +A K I GY+ V P E L+ V+RQPVS+AI
Sbjct: 209 GGIDTDKDYPYRGF-DGICDPTKKNA--KAVNIDGYEDVPPYDENALKKAVARQPVSIAI 265
Query: 228 DAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
+A+ Y GVFTG CG + +HGV +VGYG+ E YWLV+N WGT W E G
Sbjct: 266 EASGRALQLYQSGVFTGECGTSLDHGVVVVGYGS----ENGVDYWLVRNSWGTGWGEDGY 321
Query: 286 MRIFRGV-GGSGLCNIAANAAYPL 308
++ R V +G C I A+YP+
Sbjct: 322 FKMQRNVRTPTGKCGITMEASYPV 345
>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
Length = 328
Score = 212 bits (540), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 121/328 (36%), Positives = 175/328 (53%), Gaps = 45/328 (13%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
+R + + A+HEQWMV+++R YKD EK RF++FK N +F L +
Sbjct: 24 ARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKFIESFNAGGNRKFWLGV 83
Query: 49 NKFADLTREKFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
N+FADLT ++F A+ T G+KP P P F+ N S + +IDW +GAVTP
Sbjct: 84 NQFADLTNDEFRATKTNKGFKPSPVKVPTG-----FRYENVSVDALPATIDWRTKGAVTP 138
Query: 107 VKDQGSYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
+KDQG EG+ KI TG+L++ S+ +LVDC GC +++AF++
Sbjct: 139 IKDQGQ-----------CEGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFQF 187
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + L +E YPY D C S S ++G++ V E L V+ QPV
Sbjct: 188 IIKNGGLTTESSYPYTA-ADGKC----KSGSNSAATVKGFEDVPANDEAALMKAVANQPV 242
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVA+D F FY GGV TG CG +HG+ +GYG T++ YWL+KN WGT W
Sbjct: 243 SVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDG---TKYWLLKNSWGTTWG 299
Query: 282 EGGSMRIFRGVGGS-GLCNIAANAAYPL 308
E G +R+ + + G+C +A +YP+
Sbjct: 300 ENGYLRMEKDISDKRGMCGLAMEPSYPI 327
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/325 (39%), Positives = 175/325 (53%), Gaps = 33/325 (10%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNK 50
T + A +E W++++ ++Y E E RF+IFK+ F+ LN+
Sbjct: 31 TQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQ 90
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FADLT E+F ++Y G+ SNR ++ + Y +DW GAV +K Q
Sbjct: 91 FADLTDEEFRSTYLGFTSGSNKTKVSNR---YEPRVGQVLPSY--VDWRSAGAVVDIKSQ 145
Query: 111 GSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIR 165
G C CWAF+A+ATVEG+NKI TG L++ S+ +L+DC GC ++ + F++I
Sbjct: 146 GE-CGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFII 204
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ +E YPY QD C+ + KY I Y+ V E LQ V+ QPVSV
Sbjct: 205 NNGGINTEENYPYTA-QDGECN--VDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSV 261
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
A+DA F Y G+FTGPCG +H VTIVGYGT EG YW+VKN W T W E
Sbjct: 262 ALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGT----EGGIDYWIVKNSWDTTWGEE 317
Query: 284 GSMRIFRGVGGSGLCNIAANAAYPL 308
G MRI R VGG+G C IA +YP+
Sbjct: 318 GYMRILRNVGGAGTCGIATMPSYPV 342
>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 178/313 (56%), Gaps = 39/313 (12%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFL 60
+HEQWM ++ R YKD AEKE R+ IFK+N F L +N+FADL+ E+F
Sbjct: 4 RHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYNLGVNQFADLSNEEFK 63
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYCCWAFTA 120
AS +K H S ++ F+ N S + ++DW ++GAVTPVKDQG
Sbjct: 64 ASRNRFK----GHMCSPQAGPFRYENVSAVPA--TMDWRKKGAVTPVKDQGQ-------C 110
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYP 177
VA +EG+N++ TG+L++ S+ ++VDC T GC +++AF++I Q + L +E YP
Sbjct: 111 VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYP 170
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFY 235
Y G D C+ + + I G+Q V +E L V++QPVSVAIDA F FY
Sbjct: 171 YTGT-DGTCNTQKEVSHA--AKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFY 227
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG- 294
G+FTG CG +HGVT VGYG + + YWLVKN WG W E G +R+ + +
Sbjct: 228 SSGIFTGSCGTELDHGVTAVGYGGSDGTK----YWLVKNSWGAQWGEEGYIRMQKDISAK 283
Query: 295 SGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 284 EGLCGIAMQASYP 296
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 125/316 (39%), Positives = 175/316 (55%), Gaps = 33/316 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +WM RTY E+E RF++F+ N H F L LN+FADLT ++
Sbjct: 46 YAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFADLTNDE 105
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ A+Y G + P R + L +S+DW +GAV +KDQGS CWA
Sbjct: 106 YRATYLGVRS----RPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEIKDQGSCGSCWA 161
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+ +A VEG+N+I TG +++ S+ +LVDC T GC ++ AFE+I + +E
Sbjct: 162 FSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEED 221
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+G D CD R +A K I Y+ V +E+ LQ V+ QP+SVAI+A F
Sbjct: 222 YPYKG-TDGRCDVNRKNA--KVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQ 278
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y+ G+FTG CG +HGVT VGYGT E + YW+VKN WG++W E G +R+ R +
Sbjct: 279 LYNSGIFTGTCGTALDHGVTAVGYGT----ENGKDYWIVKNSWGSSWGESGYVRMERNIK 334
Query: 293 GGSGLCNIAANAAYPL 308
SG C IA +YPL
Sbjct: 335 ASSGKCGIAVEPSYPL 350
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 123/317 (38%), Positives = 180/317 (56%), Gaps = 26/317 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ ++ W+++ + Y E+E RF+IFK N F L LNKFADLT +
Sbjct: 41 VMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQ 100
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ A + G + P ++ + + + + DS+DW + GAV+PVKDQGS CW
Sbjct: 101 EYRAKFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVDWRDHGAVSPVKDQGSCGSCW 160
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASEC 174
AF+ +ATVEG+NKI +G+LV+ S+ +LVDC S GC ++ AF++I + +E
Sbjct: 161 AFSTIATVEGINKIVSGELVSLSEQELVDCDRSYDAGCNGGLMDYAFQFIMDNGGIDTEK 220
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YPY G + CD + +A K +I GY+ V P E L+ V+ QPVS+AI+A F
Sbjct: 221 DYPYLGFNN-QCDPTKKNA--KVVSIDGYEDV-PNNENALKKAVAHQPVSIAIEAGGRAF 276
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GVF G CG +HGV VGYGT + Q YW+V+N WG+NW E G +R+ R +
Sbjct: 277 QLYESGVFNGECGLALDHGVVAVGYGTD---DNGQDYWIVRNSWGSNWGENGYIRMERNI 333
Query: 293 -GGSGLCNIAANAAYPL 308
+G C IA A+YP+
Sbjct: 334 NANTGKCGIAMEASYPV 350
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 180/318 (56%), Gaps = 33/318 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTRE 57
+ A +E W+V++ ++Y E+EMR +IFK+N F+ LN+FADLT E
Sbjct: 38 VMALYESWLVKYGKSYNSLGEREMRIEIFKENLRFIDEHNADPNRSYTVGLNQFADLTDE 97
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG-SYCCW 116
++ ++Y G+K S SN + + D +DW GAV VK+QG CW
Sbjct: 98 EYRSTYLGFK----SSLKSKVSNRY--MPQVGEVLPDYVDWRTTGAVVDVKNQGLCSSCW 151
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLN-GCAKNFLENAFEYIRQYQRLASE 173
AF +ATVE +N+I TG L++ S+ +LVDC + +N GC F+++A+E+I + +E
Sbjct: 152 AFATIATVESINQIITGDLISLSEQELVDCNRTPINEGCKGGFMDDAYEFIINNGGINTE 211
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY G QD CD + + + Y I Y+ V P E ++ V+ QPVSVAIDA
Sbjct: 212 ENYPYIG-QDDQCDEPKKNQN--YVTIDSYEQVPPNDELAMKRAVAYQPVSVAIDAYCLG 268
Query: 232 FNFYHGGVFTG-PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F FY G+FTG CG T NH VTI+GYGT E YW+VKN +GT W E G ++ R
Sbjct: 269 FRFYQSGIFTGGSCGTTLNHAVTIIGYGT----ENGIDYWIVKNSYGTQWGESGYGKVQR 324
Query: 291 GVGGSGLCNIAANAAYPL 308
VGG G C IA+ YP+
Sbjct: 325 NVGGEGRCGIASYPFYPV 342
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 212 bits (539), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/325 (39%), Positives = 175/325 (53%), Gaps = 33/325 (10%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNK 50
T + A +E W++++ ++Y E E RF+IFK+ F+ LN+
Sbjct: 31 TQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQ 90
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FADLT E+F ++Y G+ SNR ++ + Y +DW GAV +K Q
Sbjct: 91 FADLTDEEFRSTYLGFTSGSNKTKVSNR---YEPRVGQVLPSY--VDWRSAGAVVDIKSQ 145
Query: 111 GSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIR 165
G C CWAF+A+ATVEG+NKI TG L++ S+ +L+DC GC ++ + F++I
Sbjct: 146 GE-CGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFII 204
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ +E YPY QD C+ + KY I Y+ V E LQ V+ QPVSV
Sbjct: 205 NNGGINTEENYPYTA-QDGECNVELQNE--KYVTIDTYENVPYNNEWALQTAVTYQPVSV 261
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
A+DA F Y G+FTGPCG +H VTIVGYGT EG YW+VKN W T W E
Sbjct: 262 ALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGT----EGGIDYWIVKNSWDTTWGEE 317
Query: 284 GSMRIFRGVGGSGLCNIAANAAYPL 308
G MRI R VGG+G C IA +YP+
Sbjct: 318 GYMRILRNVGGAGTCGIATMPSYPV 342
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 128/315 (40%), Positives = 174/315 (55%), Gaps = 31/315 (9%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREK 58
+E W+ + R EKE RF+IFK N F L LN+FAD+T E+
Sbjct: 50 YEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSFRLGLNRFADMTNEE 109
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ Y G +P H R + ++ +S+DW ++GAVT VKDQGS CWA
Sbjct: 110 YRTVYLGTRP--ASHRRRARLGSDRYRYNAGEELPESVDWRDKGAVTTVKDQGSCGSCWA 167
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+ +A VEG+NKI TG L++ S+ +LVDC GC ++ AFE+I + +E
Sbjct: 168 FSTIAAVEGINKIVTGDLISLSEQELVDCDNGQNQGCNGGLMDYAFEFIINNGGIDTEED 227
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FN 233
YPY+ R D CD +R +A K +I GY+ V E+ LQ V+ QPVSVAI+A F
Sbjct: 228 YPYKAR-DGKCDQYRKNA--KVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQ 284
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
YH G+FTG CG +HGV VGYGT E + YW+V+N WG +W E G +R+ R V
Sbjct: 285 LYHSGIFTGRCGTDLDHGVVAVGYGT----ENGKDYWIVRNSWGGDWGESGYIRMERNVN 340
Query: 294 GS-GLCNIAANAAYP 307
S G C IA ++YP
Sbjct: 341 ASTGKCGIAMESSYP 355
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 211 bits (538), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 122/312 (39%), Positives = 176/312 (56%), Gaps = 29/312 (9%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLA 61
+E+W+VE + Y EKE RF+IFK N +F+ L +FADLT ++F A
Sbjct: 43 YERWLVENRKNYNGLGEKERRFEIFKDNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRA 102
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
Y K T P +K +S D+IDW +GAV PVKDQGS CWAF+A
Sbjct: 103 IYLRSKMERTRVPVKGEKYLYKVGDS----LPDAIDWRAKGAVNPVKDQGSCGSCWAFSA 158
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPY 178
+ VEG+N+I+TG+L++ S+ +LVDC T +GC ++ AF++I + + +E YPY
Sbjct: 159 IGAVEGINQIKTGELISLSEQELVDCDTSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPY 218
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYH 236
C+ + + + I GY+ V E+ L+ ++ QP+SVAI+A F Y
Sbjct: 219 IATDVNVCNSDKKNT--RVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYT 276
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GS 295
GVFTG CG + +HGV VGYG+ EG Q YW+V+N WG+NW E G ++ R + S
Sbjct: 277 SGVFTGTCGTSLDHGVVAVGYGS----EGGQDYWIVRNSWGSNWGESGYFKLERNIKESS 332
Query: 296 GLCNIAANAAYP 307
G C +A A+YP
Sbjct: 333 GKCGVAMMASYP 344
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 130/320 (40%), Positives = 180/320 (56%), Gaps = 29/320 (9%)
Query: 9 GNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLT 55
G+ A +E+WMV+ R Y EKE RF+IF+ N E+ L LN FAD+T
Sbjct: 28 GSFRALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMT 87
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++F A Y G K P ++ S F+ +++ + DW +GAV VK+QG+
Sbjct: 88 HDEFKALYFGTKVPLSNTIKSG----FRYEDATNLPL--DTDWRSKGAVATVKNQGACGS 141
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLAS 172
CWAF+ VA VEG+N+I TG+LV+ S+ +LVDC GC +++AFE+I Q L S
Sbjct: 142 CWAFSTVAAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDS 201
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY+ CD R ++ I G++ V +E L V+ QPVSVAI+A+
Sbjct: 202 EADYPYKAVSG-SCDESRRNS--HVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGR 258
Query: 233 NF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEG-QQPYWLVKNRWGTNWDEGGSMRIF 289
NF Y GGV+TG CG +HGV VGYGT+ +G YW+V+N WG W E G +R+
Sbjct: 259 NFQLYSGGVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQ 318
Query: 290 RGVGGS-GLCNIAANAAYPL 308
R V S G C IA A+YP+
Sbjct: 319 RNVASSRGKCGIAMMASYPV 338
>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 335
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 170/314 (54%), Gaps = 39/314 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLAS 62
E+WM +F +TYK EKE RF IF+ N F+R +N+FADLT ++F+A+
Sbjct: 37 EEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVAT 96
Query: 63 YTGYKPP-PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
YTG KPP P + P W IDW RGAVT VKDQG+ CWAF A
Sbjct: 97 YTGAKPPHPKEAPRPVDPIWTPCC----------IDWRFRGAVTGVKDQGACGSCWAFAA 146
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
VA +EGL KIRTGQL S+ +LVDC T NGC + AFE + + +E Y Y+
Sbjct: 147 VAAIEGLTKIRTGQLTPLSEQELVDCDTNSNGCGGGHTDRAFELVASKGGITAESDYRYE 206
Query: 180 GRQDYYC---DWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
G Q C D + A+ +I GY+ V P E L V+RQPV+V IDA+ F F
Sbjct: 207 GFQG-KCRVDDMLFNHAA----SIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQF 261
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG-VG 293
Y GVF GPCG + NH VT+VGY + + YWL KN WG W + G + + + V
Sbjct: 262 YKSGVFPGPCGASSNHAVTLVGY--CQDGASGKKYWLAKNSWGKTWGQQGYILLEKDIVQ 319
Query: 294 GSGLCNIAANAAYP 307
G C +A + YP
Sbjct: 320 PHGTCGLAVSPFYP 333
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 128/300 (42%), Positives = 171/300 (57%), Gaps = 32/300 (10%)
Query: 31 EKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTG----YKPPPTDHP 74
EK RF +FK N HE L+LNKF D+T E+F +Y G +
Sbjct: 53 EKAKRFNVFKHNVKHIHETNKKDKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEK 112
Query: 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
+ +S + N+N+ S+DW + GAVTPVK+QG CWAF+ V VEG+N+IRT
Sbjct: 113 KATKSFMYANVNT----LPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTK 168
Query: 134 QLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRS 191
+L + S+ +LVDC T GC ++ AFE+I++ L SE VYPY+ D CD +
Sbjct: 169 KLTSLSEQELVDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKA-SDETCDTNKE 227
Query: 192 SASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPN 249
+A +I G++ V +E+ L V+ QPVSVAIDA + F FY GVFTG CG N
Sbjct: 228 NAP--VVSIDGHEDVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELN 285
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYPL 308
HGV +VGYGTT + YW+VKN WG W E G +R+ RG+ GLC IA A+YPL
Sbjct: 286 HGVAVVGYGTTIDG---TKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 130/314 (41%), Positives = 171/314 (54%), Gaps = 53/314 (16%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFL 60
+HE WMV++ R YKD EK R+KIFK N F L +N+FADLT E+F
Sbjct: 38 RHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFR 97
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
AS +K H S + FK N + + ++DW ++GAVTP+KDQG CWAF+
Sbjct: 98 ASRNRFKA----HICSTEATSFKYENVTAVP--STVDWRKKGAVTPIKDQGQCGSCWAFS 151
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA +EG+ ++ TG+L++ S+ +LVDC T GC Y
Sbjct: 152 AVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCTN---------------------Y 190
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNF 234
PY G D C+ R A+ I GY+ V E+ LQ V+ QP++VAIDA + F F
Sbjct: 191 PYAG-TDGTCN--RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIAVAIDAGGSEFQF 247
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-G 293
Y GVFTG CG +HGV+ VGYGT+ + YWLVKN WGT W E G +R+ R V
Sbjct: 248 YSSGVFTGQCGTELDHGVSAVGYGTSDDG---MKYWLVKNSWGTGWGEEGYIRMQRDVTA 304
Query: 294 GSGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 305 KEGLCGIAMQASYP 318
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 123/319 (38%), Positives = 181/319 (56%), Gaps = 34/319 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ A++++WM ++ R YKD AEK RF++FK N EF L N+FADLT +
Sbjct: 55 MMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSK 114
Query: 58 KFLASYTGYKPPPTDHPHSNR--SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+F A YTG + P + + + K N +++ +DW ++GAVTPVK+QG C
Sbjct: 115 EFAAMYTGLRKPAAVPSGAKQIPAAGSKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGC 174
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLA 171
CWAF+AV +EGL I TG LV+ S+ Q++DC + GC +++NAF+Y+ +
Sbjct: 175 CWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVINNGGVT 234
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID--A 229
+E YPY Q C + +A+ I G+Q + E L + V+ QPVSV +D +
Sbjct: 235 TEDAYPYSAVQG-TCQNVQPAAT-----ISGFQDLPSGDENALANAVANQPVSVGVDGGS 288
Query: 230 TWFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
+ F FY GG++ G CG NH VT +GYG + +G Q YW++KN WGT W E G M++
Sbjct: 289 SPFQFYQGGIYDGDGCGTDMNHAVTAIGYG--ADDQGTQ-YWILKNSWGTGWGENGFMQL 345
Query: 289 FRGVGGSGLCNIAANAAYP 307
GVG C I+ A+YP
Sbjct: 346 QMGVGA---CGISTMASYP 361
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 211 bits (538), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 128/325 (39%), Positives = 175/325 (53%), Gaps = 33/325 (10%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNK 50
T + A +E W++++ ++Y E E RF+IFK+ F+ LN+
Sbjct: 31 TQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQ 90
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FADLT E+F ++Y G+ SNR ++ + Y +DW GAV +K Q
Sbjct: 91 FADLTDEEFRSTYLGFTSGSNKTKVSNR---YEPRVGQVLPSY--VDWRSAGAVVDIKSQ 145
Query: 111 GSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIR 165
G C CWAF+A+ATVEG+NKI TG L++ S+ +L+DC GC +++ + F +I
Sbjct: 146 GE-CGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGSYITDGFPFII 204
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ +E YPY QD C+ + KY I Y+ V E LQ V+ QPVSV
Sbjct: 205 NNGGINTEENYPYTA-QDGECN--VDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSV 261
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
A+DA F Y G+FTGPCG +H VTIVGYGT EG YW+VKN W T W E
Sbjct: 262 ALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGT----EGGIDYWIVKNSWDTTWGEE 317
Query: 284 GSMRIFRGVGGSGLCNIAANAAYPL 308
G MRI R VGG+G C IA +YP+
Sbjct: 318 GYMRILRNVGGAGTCGIATMPSYPV 342
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 127/318 (39%), Positives = 176/318 (55%), Gaps = 28/318 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTRE 57
++ A +E W+ + ++Y EKE RF+IFK N F+ LN+FADLT E
Sbjct: 48 DVMAVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAENRTYKVGLNRFADLTNE 107
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ + Y G + + S+ + S +S+DW ++GAV VKDQGS CW
Sbjct: 108 EYRSMYLGTRTAAKRRSSNKISDRYAFRVGD--SLPESVDWRKKGAVVEVKDQGSCGSCW 165
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASEC 174
AF+ +A VEG+NKI TG L++ S+ +LVDC T GC ++ AFE+I + SE
Sbjct: 166 AFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEE 225
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YPY+ D CD +R +A I GY+ V E+ L+ V+ QPVSVAI+A F
Sbjct: 226 DYPYKA-SDGRCDQYRKNAX--VVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREF 282
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y G+FTG CG +HGVT VGYGT E YW+VKN WG +W E G +R+ R +
Sbjct: 283 QLYQSGIFTGRCGTALDHGVTAVGYGT----ENGVDYWIVKNSWGASWGEEGYIRMERDL 338
Query: 293 GGS--GLCNIAANAAYPL 308
S G C IA A+YP+
Sbjct: 339 ATSATGKCGIAMEASYPI 356
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 135/317 (42%), Positives = 178/317 (56%), Gaps = 30/317 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLA 61
+ EQWM + R Y + EK+ RF+++K+N EF L NKFADLT E+F A
Sbjct: 118 RFEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNSGGHGYTLTDNKFADLTNEEFRA 177
Query: 62 SYTGYKPPPTDHPH-----SNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
G D SN N NS+ + +DW ++GAV VK+QGS C
Sbjct: 178 KMLGGLGADPDRRRRARHASNALELPGNDNSTDLP--KDVDWRKKGAVVEVKNQGSCGSC 235
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDC-STLNGCAKNFLENAFEYIRQYQRLASEC 174
WAF+AVA +EGLN+I+ G+LV+ S+ +LVDC + GCA F+ AFE++ L +E
Sbjct: 236 WAFSAVAAMEGLNQIKNGKLVSLSEQELVDCDAEAVGCAGGFMSWAFEFVMANHGLTTEA 295
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF 234
YPY+G C + + S +I GY V +E L V + QPVSVA+DA F F
Sbjct: 296 SYPYKGINGA-CQTAKLNESSV--SITGYVNVTVNSEAELLKVAAVQPVSVAVDAGGFLF 352
Query: 235 --YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GGVF+GPC NHGVT+VGYG T +AE YW+VKN WG W E G M + R
Sbjct: 353 QLYAGGVFSGPCTAQINHGVTVVGYGETDKAE---KYWIVKNSWGPEWGEAGYMLMQRDA 409
Query: 293 G-GSGLCNIAANAAYPL 308
G +GLC IA A+YP+
Sbjct: 410 GVPTGLCGIAMLASYPV 426
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 126/318 (39%), Positives = 181/318 (56%), Gaps = 31/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL------------NKFADLTRE 57
++ ++E+W+V+ R YK++ E + F I++ N F+ N+FAD+T E
Sbjct: 40 DMEKRYERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQNFSFTLTDNQFADMTNE 99
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ A Y G T + + FK S + S+DW + GAVTPV++QG CW
Sbjct: 100 EYKALYMGLGTSETSRKNQSS---FKRERSKVLPI--SVDWRKMGAVTPVRNQGECGSCW 154
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDC---STLNGCAKNFLENAFEYIRQYQRLASE 173
AF+ VA VEG+NKIRTG+LV+ S+ +L+DC S GC ++ NAF++I+Q + +
Sbjct: 155 AFSTVAAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTA 214
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN 233
YPY G Q C+ + A+ I GY+ V P E+ LQ V++QPVSVAIDA +
Sbjct: 215 RNYPYIGEQG-ICN--KDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYE 271
Query: 234 F--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+F G CG NH VT++GYG E G++ YWLVKN WGT W E G R+ R
Sbjct: 272 FQLYSKGIFNGFCGKQLNHAVTVIGYG---EDNGKK-YWLVKNSWGTGWGEAGYARMIRD 327
Query: 292 V-GGSGLCNIAANAAYPL 308
G+C IA A+YP+
Sbjct: 328 SRDDEGICGIAMEASYPI 345
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 123/302 (40%), Positives = 165/302 (54%), Gaps = 31/302 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN-----------HEF-LRLNKFADLTREK 58
+ KHEQWM +F R YKD EK RFK FK N H+F L +N+F DLT ++
Sbjct: 33 MVEKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAFIESFNTGNHKFWLGVNQFTDLTNDE 92
Query: 59 FLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
F A+ T G K P FK N S + ++DW +G VTP+KDQG CC
Sbjct: 93 FRATKTNKGLKRNGARAPTR-----FKYNNVSTDALPAAVDWRTKGVVTPIKDQGQCGCC 147
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLAS 172
WAF+AVA EG+ K+ TG+LV+ S+ +LVDC GC ++NAF++I + L +
Sbjct: 148 WAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHGVDQGCEGGEMDNAFKFIIKNGGLTT 207
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--T 230
E YPY QD C S+ S I+GY+ V E L V+ QPVSVA+D
Sbjct: 208 EANYPYTA-QDGQCK--TSTTSNSVATIKGYEDVPANDESSLMKAVANQPVSVAVDGGDV 264
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F Y GGV TG CG +HG+ +GYG T++ +WL+KN WGT W E G +R+ +
Sbjct: 265 IFQHYSGGVMTGSCGTDLDHGIVAIGYGMTSDG---TKFWLLKNSWGTTWGESGYLRMEK 321
Query: 291 GV 292
+
Sbjct: 322 DI 323
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 211 bits (537), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 129/318 (40%), Positives = 179/318 (56%), Gaps = 28/318 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF--------LRLNKFADLTRE 57
N+ +E+W + A + EK RF +FK N HE L+LNKFAD+T
Sbjct: 35 NLWDMYERWRHKVATNH---GEKLRRFNVFKSNVLHVHETNKMDKPYKLKLNKFADMTNH 91
Query: 58 KFLASYTGYKPPPTDHP-HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+F + Y G K D +RS + ++ S S+DW ++GAV PVKDQG C
Sbjct: 92 EFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVESVPTSVDWRKKGAVAPVKDQGQCGSC 151
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASE 173
WAF+ VA VEG+NKI+T +LV+ S+ +LVDC TL GC ++ AF++I++ L E
Sbjct: 152 WAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQGCNGGLMDLAFDFIKKTGGLTRE 211
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY +D CD + + +I G++ V E+ L V+ QPV+VAIDA +
Sbjct: 212 DAYPYAA-EDGKCD--SNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVAIDAGSSD 268
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY GVFTG CG +HGV VGYGTT + YW+V+N WG+ W E G +R+ RG
Sbjct: 269 FQFYSEGVFTGKCGTQLDHGVAAVGYGTTLDG---TKYWIVRNSWGSEWGEKGYIRMERG 325
Query: 292 VGGS-GLCNIAANAAYPL 308
+ GLC IA A+YP+
Sbjct: 326 ISDKRGLCGIAMEASYPI 343
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 126/318 (39%), Positives = 181/318 (56%), Gaps = 31/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL------------NKFADLTRE 57
++ ++E+W+V+ R YK++ E + F I++ N F+ N+FAD+T E
Sbjct: 36 DMEKRYERWLVQHGRRYKNRDEWQRHFGIYQSNVRFINYINAQNFSFTLTDNQFADMTNE 95
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ A Y G T + + FK S + S+DW + GAVTPV++QG CW
Sbjct: 96 EYKALYMGLGTSETSRKNQSS---FKRERSKVLPI--SVDWRKMGAVTPVRNQGECGSCW 150
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDC---STLNGCAKNFLENAFEYIRQYQRLASE 173
AF+ VA VEG+NKIRTG+LV+ S+ +L+DC S GC ++ NAF++I+Q + +
Sbjct: 151 AFSTVAAVEGINKIRTGKLVSLSEQELLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTA 210
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN 233
YPY G Q C+ + A+ I GY+ V P E+ LQ V++QPVSVAIDA +
Sbjct: 211 RNYPYIGEQG-ICN--KDKAANHVVKISGYETVPPNNEKILQAAVAKQPVSVAIDAGGYE 267
Query: 234 F--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+F G CG NH VT++GYG E G++ YWLVKN WGT W E G R+ R
Sbjct: 268 FQLYSKGIFNGFCGKQLNHAVTVIGYG---EDNGKK-YWLVKNSWGTGWGEAGYARMIRD 323
Query: 292 V-GGSGLCNIAANAAYPL 308
G+C IA A+YP+
Sbjct: 324 SRDDEGICGIAMEASYPI 341
>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
Length = 319
Score = 211 bits (537), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 132/314 (42%), Positives = 170/314 (54%), Gaps = 39/314 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLAS 62
E+WM +F +TYK EKE RF IF+ N F+R +N+FADLT ++F+A+
Sbjct: 21 EEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVAT 80
Query: 63 YTGYKPP-PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
YTG KPP P + P W IDW RGAVT VKDQG+ CWAF A
Sbjct: 81 YTGAKPPHPKEAPRPVDPIWTPCC----------IDWRFRGAVTGVKDQGACGSCWAFAA 130
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
VA +EGL KIRTGQL S+ +LVDC T NGC + AFE + + +E Y Y+
Sbjct: 131 VAAIEGLTKIRTGQLTPLSEQELVDCDTNSNGCGGGHTDRAFELVASKGGITAESDYRYE 190
Query: 180 GRQDYYC---DWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
G Q C D + A+ +I GY+ V P E L V+RQPV+V IDA+ F F
Sbjct: 191 GFQG-KCRVDDMLFNHAA----SIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQF 245
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG-VG 293
Y GVF GPCG + NH VT+VGY + + YWL KN WG W + G + + + V
Sbjct: 246 YKSGVFPGPCGASSNHAVTLVGY--CQDGASGKKYWLAKNSWGKTWGQQGYILLEKDIVQ 303
Query: 294 GSGLCNIAANAAYP 307
G C +A + YP
Sbjct: 304 PHGTCGLAVSPFYP 317
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 127/316 (40%), Positives = 174/316 (55%), Gaps = 33/316 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +WM RTY E+E R+++F+ N H F L LN+FADLT ++
Sbjct: 41 YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 100
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ A+Y G + P R + + +S+DW +GAV VKDQGS CWA
Sbjct: 101 YRATYLGART----RPQRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWA 156
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+ +A VEG+N+I TG L++ S+ +LVDC T GC ++ AFE+I + +E
Sbjct: 157 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKD 216
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+G D CD R +A K I Y+ V E+ LQ V+ QPVSVAI+A T F
Sbjct: 217 YPYKG-TDGRCDVNRKNA--KVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQ 273
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y G+FTG CG +HGVT VGYGT E + YW+VKN WG++W E G +R+ R +
Sbjct: 274 LYSSGIFTGSCGTALDHGVTAVGYGT----ENGKDYWIVKNSWGSSWGESGYVRMERNIK 329
Query: 293 GGSGLCNIAANAAYPL 308
SG C IA +YPL
Sbjct: 330 ASSGKCGIAVEPSYPL 345
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 127/316 (40%), Positives = 174/316 (55%), Gaps = 33/316 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +WM RTY E+E R+++F+ N H F L LN+FADLT ++
Sbjct: 46 YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 105
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ A+Y G + P R + + +S+DW +GAV VKDQGS CWA
Sbjct: 106 YRATYLGART----RPQRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWA 161
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+ +A VEG+N+I TG L++ S+ +LVDC T GC ++ AFE+I + +E
Sbjct: 162 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKD 221
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+G D CD R +A K I Y+ V E+ LQ V+ QPVSVAI+A T F
Sbjct: 222 YPYKG-TDGRCDVNRKNA--KVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQ 278
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y G+FTG CG +HGVT VGYGT E + YW+VKN WG++W E G +R+ R +
Sbjct: 279 LYSSGIFTGSCGTALDHGVTAVGYGT----ENGKDYWIVKNSWGSSWGESGYVRMERNIK 334
Query: 293 GGSGLCNIAANAAYPL 308
SG C IA +YPL
Sbjct: 335 ASSGKCGIAVEPSYPL 350
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 127/326 (38%), Positives = 179/326 (54%), Gaps = 32/326 (9%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
++ H+ E+W+VE + Y EK+ RF+IF N +F L L
Sbjct: 24 AKADHRNPEEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYELGL 83
Query: 49 NKFADLTREKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+FADLT E+F A Y K T D S R + N+ D +DW +GAV PV
Sbjct: 84 TRFADLTNEEFRAIYLRSKMERTRDSVKSER--YLHNVGDK---LPDEVDWRAKGAVVPV 138
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYI 164
KDQGS CWAF+A+ VEG+N+I+TG+LV+ S+ +LVDC T NGC ++ AF++I
Sbjct: 139 KDQGSCGSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFI 198
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ +E YPY D C+ + + + I GY+ V P E L+ ++ QP+S
Sbjct: 199 ISNGGIDTEEDYPYTATDDNICNTDKKNT--RVVTIDGYEDV-PENENSLKKALANQPIS 255
Query: 225 VAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
VAI+A F Y GVFTG CG +HGV VGYGT+ EGQ YW+++N WG+NW E
Sbjct: 256 VAIEAGGRGFQLYKSGVFTGTCGTALDHGVVAVGYGTS---EGQD-YWIIRNSWGSNWGE 311
Query: 283 GGSMRIFRGV-GGSGLCNIAANAAYP 307
G +++ R + SG C +A A+YP
Sbjct: 312 SGYIKLQRNIKDSSGKCGVAMMASYP 337
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 211 bits (536), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 126/316 (39%), Positives = 177/316 (56%), Gaps = 27/316 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREK 58
+ A +E+W+V+ + Y E+E RF++FK N F L LN FADLT E+
Sbjct: 48 VMAIYEEWLVKQGKVYNALGEREKRFQVFKDNLRFIDEHNSENRTYKLGLNGFADLTNEE 107
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ ++Y G + + S+ + S DS+DW + GAV VKDQGS CWA
Sbjct: 108 YRSTYLGARGGMKRNRLRKTSDRYAPRVGE--SLPDSVDWRKEGAVAEVKDQGSCGSCWA 165
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+ +A VEG+NKI TG L++ S+ +LVDC T GC ++ AFE+I + +E
Sbjct: 166 FSTIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEED 225
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FN 233
YPY R D CD +R +A K I Y+ V +E LQ V+ QPVSVAI+A F
Sbjct: 226 YPYLAR-DGRCDTYRKNA--KVVTIDDYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQ 282
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY G+F+G CG +HGV VGYGT E + YW+V+N WG +W E G +R+ R +
Sbjct: 283 FYASGIFSGRCGTQLDHGVAAVGYGT----ENGKDYWIVRNSWGKSWGENGYLRMARSIN 338
Query: 294 G-SGLCNIAANAAYPL 308
+G+C IA A+YP+
Sbjct: 339 SPTGICGIAMEASYPI 354
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 120/312 (38%), Positives = 175/312 (56%), Gaps = 29/312 (9%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLA 61
+EQW+VE + Y EKE RF+IF N +++ L +FADLT ++F A
Sbjct: 43 YEQWLVENRKNYNGLGEKETRFEIFTDNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRA 102
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
Y K T P +K ++ D IDW +GAV PVKDQG+ CWAF+A
Sbjct: 103 IYLRSKMERTRVPVKGERYLYKVGDT----LPDQIDWRAKGAVNPVKDQGNCGSCWAFSA 158
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPY 178
+ VEG+N+I+TG+L++ S+ +LVDC T GC ++ AF++I + + +E YPY
Sbjct: 159 IGAVEGINQIKTGELISLSEQELVDCDTSYNGGCGGGLMDYAFKFIIENGGIDTEEDYPY 218
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYH 236
D C+ + ++ + I GY+ V E+ L+ ++ QP+SVAI+A F Y
Sbjct: 219 TATDDNICNSDKKNS--RVVTIDGYEDVPQNDEKSLKKALANQPISVAIEAGGRAFQLYK 276
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GS 295
GVFTG CG + +HGV VGYG+ EG Q YW+V+N WG+NW E G ++ R + S
Sbjct: 277 SGVFTGTCGTSLDHGVVAVGYGS----EGGQDYWIVRNSWGSNWGESGYFKLERNIKESS 332
Query: 296 GLCNIAANAAYP 307
G C +A A+YP
Sbjct: 333 GKCGVAMMASYP 344
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 211 bits (536), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 120/310 (38%), Positives = 177/310 (57%), Gaps = 26/310 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+ E+WM E+ R YKD EK RF+IFK N L +N+F D+T+ +F+
Sbjct: 36 RFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSHNGNSYTLGINQFTDMTKSEFV 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A YTG P + + F ++N S + SIDW + GAV VK+Q CWAF
Sbjct: 96 AQYTGGISRPLNIEREPVVS-FDDVNISAVP--QSIDWRDYGAVNEVKNQNPCGSCWAFA 152
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A+ATVEG+ KI+TG LV+ S+ +++DC+ GC ++ A+++I + +E YPYQ
Sbjct: 153 AIATVEGIYKIKTGYLVSLSEQEVLDCAVSYGCKGGWVNKAYDFIISNNGVTTEENYPYQ 212
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FNFYHGG 238
Q C+ +++ I GY YV+ E + VS QP++ IDA+ F +Y+GG
Sbjct: 213 AYQG-TCN---ANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGG 268
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGL 297
VF+GPCG + NH +TI+GYG + YW+V+N WG++W EGG +R+ RGV SG
Sbjct: 269 VFSGPCGTSLNHAITIIGYGQDSSG---TKYWIVRNSWGSSWGEGGYVRMARGVSSSSGA 325
Query: 298 CNIAANAAYP 307
C IA + +P
Sbjct: 326 CGIAMSPLFP 335
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 129/318 (40%), Positives = 177/318 (55%), Gaps = 32/318 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREK 58
+ A +E+W+V+ + Y EKE RF+IFK N F+ LN+FADLT E+
Sbjct: 47 VMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEE 106
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKM--SFYDSIDWNERGAVTPVKDQGSY-CC 115
F + Y G T H R + + ++ S DS+DW + GAV VKDQG C
Sbjct: 107 FRSMYLG-----TRTGHKKRLPKTSDRYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSC 161
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASE 173
WAF+ +A VEG+NKI TG L+ S+ +LVDC T GC ++ AFE+I + +E
Sbjct: 162 WAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTE 221
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN 233
YPY GR D CD +R +A K +I Y+ V E L+ V+ QPVSVAI+ N
Sbjct: 222 DDYPYLGR-DGRCDTYRKNA--KVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGRN 278
Query: 234 F--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y+ GVFTG CG + +HGV VGYGT E + YW+V+N WG +W E G +R+ R
Sbjct: 279 FQLYNSGVFTGECGTSLDHGVAAVGYGT----EKGKDYWIVRNSWGKSWGESGYIRMERN 334
Query: 292 VGG-SGLCNIAANAAYPL 308
+ +G C IA +YP+
Sbjct: 335 IASPTGKCGIAIEPSYPI 352
>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
Length = 336
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 170/314 (54%), Gaps = 39/314 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLAS 62
E+WM +F +TYK EKE RF IF+ N F+R +N+FADLT ++F+A+
Sbjct: 38 EEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVAT 97
Query: 63 YTGYKPP-PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
YTG KPP P + P W IDW RGAVT VKDQG+ CWAF A
Sbjct: 98 YTGAKPPHPKEAPRPVDPIWTPCC----------IDWRFRGAVTGVKDQGACGSCWAFAA 147
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
VA +EGL KIRTGQL S+ +LVDC T NGC + AFE + + +E Y Y+
Sbjct: 148 VAAIEGLTKIRTGQLTPLSEQELVDCDTNSNGCGGGHTDRAFELVASKGGITAESDYRYE 207
Query: 180 GRQDYYC---DWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNF 234
G Q C D + A+ +I GY+ V P E L V+RQPV+V IDA+ F F
Sbjct: 208 GFQG-KCRVDDMLFNHAA----SIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQF 262
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-G 293
Y GVF GPCG + NH VT+VGY + + YW+ KN WG W + G + + + V
Sbjct: 263 YKSGVFPGPCGASSNHAVTLVGY--CQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQ 320
Query: 294 GSGLCNIAANAAYP 307
G C +A + YP
Sbjct: 321 PHGTCGLAVSPFYP 334
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 210 bits (535), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 129/318 (40%), Positives = 177/318 (55%), Gaps = 32/318 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREK 58
+ A +E+W+V+ + Y EKE RF+IFK N F+ LN+FADLT E+
Sbjct: 38 VMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNSENRTYTVGLNRFADLTNEE 97
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKM--SFYDSIDWNERGAVTPVKDQGSY-CC 115
F + Y G T H R + + ++ S DS+DW + GAV VKDQG C
Sbjct: 98 FRSMYLG-----TRTGHKKRLPKTSDRYAPRVGDSLPDSVDWRKEGAVAEVKDQGGCGSC 152
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASE 173
WAF+ +A VEG+NKI TG L+ S+ +LVDC T GC ++ AFE+I + +E
Sbjct: 153 WAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTE 212
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN 233
YPY GR D CD +R +A K +I Y+ V E L+ V+ QPVSVAI+ N
Sbjct: 213 DDYPYLGR-DGRCDTYRKNA--KVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGRN 269
Query: 234 F--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y+ GVFTG CG + +HGV VGYGT E + YW+V+N WG +W E G +R+ R
Sbjct: 270 FQLYNSGVFTGECGTSLDHGVAAVGYGT----EKGKDYWIVRNSWGKSWGESGYIRMERN 325
Query: 292 VGG-SGLCNIAANAAYPL 308
+ +G C IA +YP+
Sbjct: 326 IASPTGKCGIAIEPSYPI 343
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 128/304 (42%), Positives = 168/304 (55%), Gaps = 33/304 (10%)
Query: 27 KDQAEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKP----PP 70
+D +EK RF +FK+N HEF L LNKFAD+T ++F ++Y G K
Sbjct: 51 RDLSEKNKRFNVFKENAKFIHEFNKKDAPYKLGLNKFADMTNQEFRSTYAGSKIHHHRTQ 110
Query: 71 TDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNK 129
P + S ++N++S S+DW +GAV PVKDQG CWAF+ +A+VEG+NK
Sbjct: 111 RGTPRATGSFMYENVHS----IPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINK 166
Query: 130 IRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCD 187
I+T QLV S QLVDC T GC ++ AFE+I+ + SE YPY Q
Sbjct: 167 IKTNQLVPLSGQQLVDCDTDQNEGCNGGLMDYAFEFIKSNGGITSESAYPYTAEQGSCA- 225
Query: 188 WWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHGGVFTGPCG 245
S +S I GY+ V E L V+ Q VSVAI+A+ F FY GVFTG CG
Sbjct: 226 ---SESSAPVVTIDGYEDVPANNEAALMKAVANQVVSVAIEASGMAFQFYSEGVFTGSCG 282
Query: 246 NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANA 304
N +HGV +VGYG T + YW+V+N WG W E G +R+ RG+ GLC IA
Sbjct: 283 NELDHGVAVVGYGATRDG---TKYWIVRNSWGAEWGEKGYIRMQRGIRARHGLCGIAMEP 339
Query: 305 AYPL 308
+YPL
Sbjct: 340 SYPL 343
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 210 bits (535), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 136/332 (40%), Positives = 180/332 (54%), Gaps = 54/332 (16%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFAD 53
HKT + + E+W+ ++ Y + E +RF I++ N + L N+FAD
Sbjct: 36 HKT--LKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFAD 93
Query: 54 LTREKFLASYTGY---------KPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAV 104
+T +F A + G K P P N D++DW +GAV
Sbjct: 94 MTNSEFKAHFLGLNTSSLRLHKKQRPVCDPAGNVP--------------DAVDWRTQGAV 139
Query: 105 TPVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS--TLN-GCAKNFLEN 159
TP+++QG C CWAF+AVA +EG+NKI+TG LV+ S+ QL+DC T N GC+ +E
Sbjct: 140 TPIRNQGK-CGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMET 198
Query: 160 AFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS 219
AFE+I+ LA+E YPY G + CD +S K I+GYQ V E LQ +
Sbjct: 199 AFEFIKTNGGLATETDYPYTGIEG-TCDQEKS--KNKVVTIQGYQKV-AQNEASLQIAAA 254
Query: 220 RQPVSVAIDATWFNF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
+QPVSV IDA F F Y GVFT CG NHGVT+VGYG EG Q YW+VKN WG
Sbjct: 255 QQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYG----VEGDQKYWIVKNSWG 310
Query: 278 TNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
T W E G +R+ RGV +G C IA A+YPL
Sbjct: 311 TGWGEEGYIRMERGVSEDTGKCGIAMMASYPL 342
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 124/317 (39%), Positives = 177/317 (55%), Gaps = 27/317 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
++A +E W++E ++Y EK+ RF+IFK N + L L KFADLT E
Sbjct: 45 VSALYESWLIEHGKSYNALGEKDKRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNE 104
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ + Y G K D +++ + L S +SIDW E+G + VKDQGS CW
Sbjct: 105 EYRSIYLGTKSS-GDRKKLSKNKSDRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCW 163
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASEC 174
AF+AVA +E +N I TG L++ S+ +LVDC S GC ++ AFE++ + + +E
Sbjct: 164 AFSAVAAMESINAIVTGNLISLSEQELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEE 223
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF 234
YPY+ R CD +R +A K I Y+ V E+ LQ V+ QPVS+A++A +F
Sbjct: 224 DYPYKERNG-VCDQYRKNA--KVVKIDSYEDVPVNNEKALQKAVAHQPVSIALEAGGRDF 280
Query: 235 YH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
H G+FTG CG +HGV I GYGT E YW+V+N WG NW E G +R+ R V
Sbjct: 281 QHYKSGIFTGKCGTAVDHGVVIAGYGT----ENGMDYWIVRNSWGANWGENGYLRVQRNV 336
Query: 293 G-GSGLCNIAANAAYPL 308
SGLC +A +YP+
Sbjct: 337 ASSSGLCGLAIEPSYPV 353
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 130/330 (39%), Positives = 186/330 (56%), Gaps = 31/330 (9%)
Query: 1 MSRTSHKTGN-IAAKHEQWMVEFARTYKDQ---AEKEMRFKIFKKNHEFLR--------- 47
++++S +T + + A +E+W+V+ + + + EKE RF++FK N F+
Sbjct: 36 LTKSSWRTDDEVMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDEHNSENRSY 95
Query: 48 ---LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAV 104
LN+FADLT E++ + Y G + + S SN + L S DS+DW + GAV
Sbjct: 96 KVGLNRFADLTNEEYRSMYLGARSGAKRNRLSRSSNRY--LPRVGDSLPDSVDWRKEGAV 153
Query: 105 TPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAF 161
VKDQGS CWAF+ +A VEG+NKI TG L++ S+ +LVDC S GC ++ AF
Sbjct: 154 AEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDRSYNEGCNGGLMDYAF 213
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
++I + SE YPY R D CD +R +A K I Y+ V E+ LQ V+ Q
Sbjct: 214 QFIINNGGIDSEEDYPYLAR-DGTCDTYRKNA--KVVTIDNYEDVPVNDEKALQKAVANQ 270
Query: 222 PVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
PVSVAI+A F FY G+FTG CG +HGV VGYGT E + YW+V+N WG +
Sbjct: 271 PVSVAIEAGGREFQFYQSGIFTGRCGTALDHGVAAVGYGT----ENGKDYWIVRNSWGKS 326
Query: 280 WDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W E G +R+ R + +G C IA +YP+
Sbjct: 327 WGESGYIRMERNIATATGKCGIAIEPSYPI 356
>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
Length = 296
Score = 210 bits (534), Expect = 7e-52, Method: Compositional matrix adjust.
Identities = 120/318 (37%), Positives = 171/318 (53%), Gaps = 45/318 (14%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ A+HEQWMV+++R YKD EK RF++FK N +F L +N+FADLT +
Sbjct: 1 MVARHEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTND 60
Query: 58 KFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYCC 115
+F A+ T G+KP P P F+ N S + +IDW +GAVTP+KDQG
Sbjct: 61 EFRATKTNKGFKPSPVKVPTG-----FRYENISVDALPATIDWRTKGAVTPIKDQGQ--- 112
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLAS 172
EG+ KI TG+L++ S+ +LVDC GC +++AF++I + L +
Sbjct: 113 --------CEGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLTT 164
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--T 230
E YPY D C S S ++G++ V E L V+ QPVSVA+D
Sbjct: 165 ESSYPYTA-ADGKC----KSGSNSVATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDM 219
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F FY GGV TG CG +HG+ +GYG T++ YWL+KN WGT W E G +R+ +
Sbjct: 220 TFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDG---TKYWLLKNSWGTTWGENGYLRMEK 276
Query: 291 GVGGS-GLCNIAANAAYP 307
+ G+C +A +YP
Sbjct: 277 DISDKRGMCGLAMEPSYP 294
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 128/327 (39%), Positives = 180/327 (55%), Gaps = 29/327 (8%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
S+ + I +E W+ + + Y EK+ RF +FK N + L L
Sbjct: 31 SKDLREDDAIMELYELWLAQHKKAYNGLGEKQNRFSVFKDNFLYIHQHNNQGNPSYKLGL 90
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N+FADL+ E+F A+Y G K ++ S ++ + + +SIDW E+GAVT VK
Sbjct: 91 NQFADLSHEEFKATYLGAKLDTKKRLSNSPSPRYQYSDGEDLP--ESIDWREKGAVTAVK 148
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIR 165
DQGS CWAF+ VA VEG+N+I TG L + S+ +LVDC T GC ++ AF++I
Sbjct: 149 DQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGGLMDYAFQFII 208
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
L SE YPY+ D CD +R +A I Y+ V E+ L+ + QP+SV
Sbjct: 209 NNGGLDSEDDYPYKAN-DGSCDAYRKNA--HVVTIDDYEDVPENDEKSLKKAAANQPISV 265
Query: 226 AIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
AI+A+ F FY GVFT CG +HGVT+VGYG+ E YW+VKN WG +W E
Sbjct: 266 AIEASGRAFQFYESGVFTSTCGTQLDHGVTLVGYGS----ESGTDYWIVKNSWGKSWGEK 321
Query: 284 GSMRIFRGVGG--SGLCNIAANAAYPL 308
G +R+ R + G +G+C IA A+YPL
Sbjct: 322 GFIRLQRNIEGVSTGMCGIAMEASYPL 348
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 129/315 (40%), Positives = 171/315 (54%), Gaps = 31/315 (9%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREK 58
+E W+ + R Y EKE RF+IFK N F L LN+FAD+T E+
Sbjct: 50 YEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLGLNRFADMTNEE 109
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ A Y G +P H R + ++ +S+DW +GAV VKDQGS CWA
Sbjct: 110 YRAVYLGTRP--AGHRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVKDQGSCGSCWA 167
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+ VA VEG+NKI TG L++ S+ +LVDC GC ++ FE+I + +E
Sbjct: 168 FSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQGCNGGLMDYGFEFIINNGGIDTEED 227
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FN 233
YPY R D CD +R +A K +I GY+ V E+ LQ V+ QPVSVAI+A F
Sbjct: 228 YPYTAR-DGKCDQYRKNA--KVVSIDGYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQ 284
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
YH G+FTG CG +HGV VGYGT E + YW+V+N WG +W E G +R+ R V
Sbjct: 285 LYHSGIFTGRCGTDLDHGVVAVGYGT----ENGKDYWIVRNSWGGDWGESGYIRMERNVN 340
Query: 294 GS-GLCNIAANAAYP 307
S G C IA +YP
Sbjct: 341 TSTGKCGIAIEPSYP 355
>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 126/296 (42%), Positives = 166/296 (56%), Gaps = 24/296 (8%)
Query: 31 EKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDHPHSNR 78
EK RF +FK N HE L+LNKF D+T E+F +Y G R
Sbjct: 53 EKAKRFNVFKHNVKHIHETNKKENSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGER 112
Query: 79 SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVT 137
+ ++ + S+DW + GAVTPVK+QG CWAF+ V VEG+N+IRT +L +
Sbjct: 113 QTTKSFMYANVDTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTS 172
Query: 138 RSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASG 195
S+ +LVDC T GC ++ AFE+I++ L SE VYPY+ D CD + +A
Sbjct: 173 LSEQELVDCDTNKNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKA-SDETCDTNKENAP- 230
Query: 196 KYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVT 253
+I G++ V +E L V+ QPVSVAIDA + F FY GVFTG CG NHGV
Sbjct: 231 -VVSIDGHEDVPKNSEVDLMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVA 289
Query: 254 IVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYPL 308
+VGYGTT + YW+VKN WG W E G +R+ RG+ GLC IA A+YPL
Sbjct: 290 VVGYGTTIDG---TKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 210 bits (534), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 135/329 (41%), Positives = 177/329 (53%), Gaps = 34/329 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF--------LRLN 49
S+ + I +E W+ E R Y EK+ RF +FK N HE L LN
Sbjct: 29 SKDLREDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHNQGNRSYKLGLN 88
Query: 50 KFADLTREKFLASYTGYKPPP---TDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
+FADL+ E+F A+Y G K P S R + S +SIDW E+GAVT
Sbjct: 89 QFADLSHEEFKATYLGAKLDTKKRLSRPPSRRYQY-----SDGEDLPESIDWREKGAVTS 143
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEY 163
VKDQGS CWAF+ VA VEG+N+I TG L++ S+ +LVDC T GC ++ AFE+
Sbjct: 144 VKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEF 203
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I L SE YPY D CD +R +A I Y+ V E+ L+ + QP+
Sbjct: 204 IINNGGLDSEEDYPYTAY-DGSCDSYRKNA--HVVTIDDYEDVPENDEKSLKKAAANQPI 260
Query: 224 SVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVAI+A+ F FY GVFT CG +HGVT+VGYG+ E YW VKN WG +W
Sbjct: 261 SVAIEASGREFQFYDSGVFTSTCGTQLDHGVTLVGYGS----ESGTDYWTVKNSWGKSWG 316
Query: 282 EGGSMRIFRG--VGGSGLCNIAANAAYPL 308
E G +R+ R V +G+C IA A+YP+
Sbjct: 317 EEGFIRLQRNIEVASTGMCGIAMEASYPV 345
>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 209 bits (533), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 131/326 (40%), Positives = 176/326 (53%), Gaps = 54/326 (16%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRL 48
+R+ H+ ++ +HE WMV++ R YKD EK R+KIFK N F L +
Sbjct: 27 ARSLHE-ASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSI 85
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N+FADLT E+F AS +K H S + FK N + + ++DW ++GAVTP+K
Sbjct: 86 NEFADLTNEEFRASRNRFKA----HICSTEATSFKYENVTAVP--STVDWRKKGAVTPIK 139
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYI 164
DQG CWAF+AVA +EG+ ++ TG+L++ S+ +LVDC T GC
Sbjct: 140 DQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCTN---------- 189
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
YPY G D C+ R A+ I GY+ V E+ LQ V+ QP++
Sbjct: 190 -----------YPYAG-TDGTCN--RKKAAHPAAKINGYEDVPANNEKALQKAVAHQPIA 235
Query: 225 VAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
VAIDA+ F FY GVFTG CG +HGV VGYGT+ + YWLVKN W T W E
Sbjct: 236 VAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDDG---MKYWLVKNSWSTGWGE 292
Query: 283 GGSMRIFRGV-GGSGLCNIAANAAYP 307
G +R+ R V GLC IA A+YP
Sbjct: 293 EGYIRMQRDVTAKEGLCGIAMQASYP 318
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 128/318 (40%), Positives = 176/318 (55%), Gaps = 31/318 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREK 58
+ A +E W+V+ + Y EKE RF++FK N F+ LN+FADLT E+
Sbjct: 38 VMAIYEDWLVKHGKAYNSLGEKERRFEVFKDNLRFIDEHNSENRTYRVGLNRFADLTNEE 97
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKM--SFYDSIDWNERGAVTPVKDQGSY-CC 115
+ + Y G N+ + + ++ S DS+DW + GAV VKDQGS C
Sbjct: 98 YRSMYLG----ALSGIRRNKLRKISDRYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSC 153
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASE 173
WAF+AVA VEG+NKI TG L++ S+ +LVDC S GC ++ FE+I + SE
Sbjct: 154 WAFSAVAAVEGINKIVTGDLISLSEQELVDCDNSYNEGCNGGLMDYGFEFIINNGGIDSE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-- 231
YPY R D CD +R +A + +I Y+ V E LQ V+ QPVSVAI+A
Sbjct: 214 EDYPYLAR-DGRCDTYRKNA--RVVSIDSYEDVPVNNEAALQKAVANQPVSVAIEAGGRD 270
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y GVF+G CG +HGV VGYGT E Q YW+V+N WG +W E G +R+ R
Sbjct: 271 FQLYSSGVFSGRCGTALDHGVVAVGYGT----ENGQDYWIVRNSWGKSWGESGYLRMARN 326
Query: 292 V-GGSGLCNIAANAAYPL 308
+ +G+C IA A+YP+
Sbjct: 327 IRKPTGICGIAMEASYPI 344
>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
Length = 319
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 131/314 (41%), Positives = 170/314 (54%), Gaps = 39/314 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLAS 62
E+WM +F +TYK EKE RF IF+ N F+R +N+FADLT ++F+A+
Sbjct: 21 EEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVAT 80
Query: 63 YTGYKPP-PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
YTG KPP P + P W IDW RGAVT VKDQG+ CWAF A
Sbjct: 81 YTGAKPPHPKEAPRPVDPIWTPCC----------IDWRFRGAVTGVKDQGACGSCWAFAA 130
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
VA +EGL KIRTGQL S+ +LVDC T NGC + AFE + + +E Y Y+
Sbjct: 131 VAAIEGLTKIRTGQLTPLSEQELVDCDTNSNGCGGGHTDRAFELVASKGGITAESDYRYE 190
Query: 180 GRQDYYC---DWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
G Q C D + A+ +I GY+ V P E L V+RQPV+V IDA+ F F
Sbjct: 191 GFQG-KCRVDDMLFNHAA----SIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQF 245
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-G 293
Y GVF GPCG + NH VT+VGY + + YW+ KN WG W + G + + + V
Sbjct: 246 YKSGVFPGPCGASSNHAVTLVGY--CQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQ 303
Query: 294 GSGLCNIAANAAYP 307
G C +A + YP
Sbjct: 304 PHGTCGLAVSPFYP 317
>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
Length = 328
Score = 209 bits (533), Expect = 9e-52, Method: Compositional matrix adjust.
Identities = 121/327 (37%), Positives = 174/327 (53%), Gaps = 45/327 (13%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRL 48
+R + + A+HEQWMV+++R YKD EK RF++FK N +F L +
Sbjct: 24 ARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKFIESFNAGGNRKFWLGV 83
Query: 49 NKFADLTREKFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
N+FADLT ++F A+ T G+KP P S F+ N S + +IDW +GAVTP
Sbjct: 84 NQFADLTNDEFRATKTNKGFKPSPV-----KVSTGFRYENVSVDALPATIDWRTKGAVTP 138
Query: 107 VKDQGSYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
+KDQG EG+ KI TG+L++ S+ +LVDC GC +++AF++
Sbjct: 139 IKDQGQ-----------CEGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKF 187
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + L +E YPY D C S S ++G++ V E L V+ QPV
Sbjct: 188 IIKNGGLTTESSYPYTA-ADGKC----KSGSNSAATVKGFEDVPANDEAALMKAVANQPV 242
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVA+D F FY GGV TG CG +HG+ +GYG T++ YWL+KN WGT W
Sbjct: 243 SVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGQTSDG---TKYWLLKNSWGTTWG 299
Query: 282 EGGSMRIFRGVGGS-GLCNIAANAAYP 307
E G +R+ + + G+C +A +YP
Sbjct: 300 ENGYLRMEKDISDKRGMCGLAMEPSYP 326
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 130/322 (40%), Positives = 177/322 (54%), Gaps = 45/322 (13%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE+WM ++ R YKD AEKE RF++FK N F L +N+FADL E+F
Sbjct: 36 RHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFK 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSF-YDS-------IDWNERGAVTPVKDQGS 112
A + +++W + S++ SF Y+S IDW +RGAVTP+KDQG
Sbjct: 96 ALLINVQ---------KKASWVE--TSTETSFRYESVTKIPATIDWRKRGAVTPIKDQGR 144
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQR 169
CWAF+AVA EG+++I TG+LV S+ +LVDC GC ++++AFE+I +
Sbjct: 145 CGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGG 204
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
+ASE YPY+G + C + + I+GY+ V E+ L V+ QPVSV IDA
Sbjct: 205 IASETHYPYKG-VNKTCKVKKETHG--VAEIKGYEKVPSNNEKALLKAVANQPVSVYIDA 261
Query: 230 T--WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F +Y G+F CG PNH V +VGYG + YWLVKN WGT W E G +
Sbjct: 262 GTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDG---SKYWLVKNSWGTEWGERGYI 318
Query: 287 RIFRGV-GGSGLCNIAANAAYP 307
RI R + GLC IA YP
Sbjct: 319 RIKRDIRAKEGLCGIAKYPYYP 340
>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 209 bits (533), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 120/296 (40%), Positives = 167/296 (56%), Gaps = 24/296 (8%)
Query: 30 AEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDHPHSN 77
AEK+ RF +FK+N H+ L+LN FAD+T +FL Y G K
Sbjct: 54 AEKQERFNVFKENLKHIHKVNHKDRPYKLKLNSFADMTNHEFLQHYGGSKVSHYRVLRGQ 113
Query: 78 RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLV 136
R +++ S+DW + GAVT +KDQG CWAF+ VA VEG+NKI+TG+L+
Sbjct: 114 RQG-TGSMHEDTSKLPSSVDWRKNGAVTGIKDQGKCGSCWAFSTVAAVEGINKIKTGELI 172
Query: 137 TRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASG 195
+ S+ +LVDC + N GC +E+AF +I+Q L SE YPY+ +++ CD + +
Sbjct: 173 SLSEQELVDCDSDNHGCNGGLMEDAFNFIKQIGGLTSENTYPYRAKEE-PCD--SNKMNS 229
Query: 196 KYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVT 253
I GY+ V E L V+ QPV++A+DA FY +FTG CG NHGV
Sbjct: 230 PVVNIDGYEMVPENDENALMKAVANQPVAIAMDAGGKDLQFYSEAIFTGDCGTELNHGVA 289
Query: 254 IVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
+VGYGTT + YW+VKN WGT+W E G +R+ RG+ GLC I A+YP+
Sbjct: 290 LVGYGTTQDG---TKYWIVKNSWGTDWGEKGYIRMQRGIDAEEGLCGITMEASYPV 342
>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
Length = 342
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 129/312 (41%), Positives = 168/312 (53%), Gaps = 35/312 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLAS 62
E+WM +F +TYK EKE RF IF+ N F+R +N+FADLT ++F+A+
Sbjct: 44 EEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVAT 103
Query: 63 YTGYKPP-PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
YTG KPP P + P W IDW RGAVT VKDQG+ CWAF A
Sbjct: 104 YTGAKPPHPKEAPRPVDPIWTPCC----------IDWRFRGAVTGVKDQGACGSCWAFAA 153
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
VA +EGL KIRTGQL S+ +LVDC T NGC + AFE + + +E Y Y+
Sbjct: 154 VAAIEGLTKIRTGQLTPLSEQELVDCDTNSNGCGGGHTDRAFELVASKGGITAESDYRYE 213
Query: 180 GRQ-DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYH 236
G Q D + + + G GY+ V P E L V+RQPV+V IDA+ F FY
Sbjct: 214 GFQGKCRVDDMLFNHAARIG---GYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYK 270
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGS 295
GVF GPCG + NH VT+VGY + + YW+ KN WG W + G + + + V
Sbjct: 271 SGVFPGPCGASSNHAVTLVGY--CQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPH 328
Query: 296 GLCNIAANAAYP 307
G C +A + YP
Sbjct: 329 GTCGLAVSPFYP 340
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 126/316 (39%), Positives = 174/316 (55%), Gaps = 33/316 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +WM RTY +E R+++F+ N H F L LN+FADLT ++
Sbjct: 44 YAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 103
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ A+Y G + P +R + + +S+DW +GAV VKDQGS CWA
Sbjct: 104 YPATYLGART----RPQRDRKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGTCWA 159
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+ +A VEG+N+I TG L++ S+ +LVDC T GC ++ AFE+I + +E
Sbjct: 160 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKD 219
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+G D CD R +A K I Y+ V E+ LQ V+ QPVSVAI+A T F
Sbjct: 220 YPYKG-TDGRCDVNRKNA--KVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQ 276
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y G+FTG CG +HGVT VGYGT E + YW+VKN WG++W E G +R+ R +
Sbjct: 277 LYSSGIFTGSCGTRLDHGVTAVGYGT----ENGKDYWIVKNSWGSSWGESGYVRMERNIK 332
Query: 293 GGSGLCNIAANAAYPL 308
SG C IA +YPL
Sbjct: 333 ASSGKCGIAVEPSYPL 348
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 128/322 (39%), Positives = 180/322 (55%), Gaps = 33/322 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ KHEQWM E + YKD AEKE RF+IFK+N EF L +N+F D T +
Sbjct: 31 LLEKHEQWMEEHGKFYKDAAEKEQRFQIFKENLEFIESFNAAGDNGFNLSINQFGDQTND 90
Query: 58 KFLASYTGYKPPP---TDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC 114
+F A+Y K P + F+ N +++ ++DW ERGAVTP+K Q C
Sbjct: 91 EFKANYLNGKKKPLIGVGIAAIEEESVFRYENVTEVP--ATMDWRERGAVTPIKHQ-HLC 147
Query: 115 --CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC---STLNGCAKNFLENAFEYIRQYQR 169
CWAF VA +EG+++I TG+LV+ S+ +LVDC +T +GC ++E+A ++I +
Sbjct: 148 GSCWAFATVAAIEGIHQITTGRLVSLSEQELVDCVKTNTTDGCNGGYVEDACDFIVKKGG 207
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
+ SE YPY R D C+ + + + I+GY++V E+ L V+ QP++V I A
Sbjct: 208 ITSETNYPYT-RVDGKCNVRKGTYN--VAKIKGYEHVPANNEKALLKAVANQPIAVYIAA 264
Query: 230 T--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
T F FY G+ G CG +H VTIVGYGT+ + YWLVKN WGT W E G ++
Sbjct: 265 TKRAFQFYSSGILKGKCGIDLDHTVTIVGYGTSDDG---VKYWLVKNSWGTKWGEKGYIK 321
Query: 288 IFRGV-GGSGLCNIAANAAYPL 308
I R V G C IA YP+
Sbjct: 322 IKRDVHAKEGSCGIAMVPTYPI 343
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 118/310 (38%), Positives = 176/310 (56%), Gaps = 27/310 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFL 60
+ E+WM E+ R YKD EK RF+IFK N + + +N+F D+T+ +F+
Sbjct: 36 RFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKHIETFNSRNENSYTLGINQFTDMTKSEFV 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A YTG P S F ++N S + SIDW + GAV VK+Q CW+F
Sbjct: 96 AQYTGVSLPLNIEREPVVS--FDDVNISAVP--QSIDWRDYGAVNEVKNQNPCGSCWSFA 151
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A+ATVEG+ KI+TG LV+ S+ +++DC+ GC ++ A+++I + +E YPY
Sbjct: 152 AIATVEGIYKIKTGYLVSLSEQEVLDCAVSYGCKGGWVNKAYDFIISNNGVTTEENYPYL 211
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FNFYHGG 238
Q C+ +++ I GY YV+ E + VS QP++ IDA+ F +Y+GG
Sbjct: 212 AYQG-TCN---ANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGG 267
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGL 297
VF+GPCG + NH +TI+GYG + YW+V+N WG++W EGG +R+ RGV SG+
Sbjct: 268 VFSGPCGTSLNHAITIIGYGQDSSG---TKYWIVRNSWGSSWGEGGYVRMARGVSSSSGV 324
Query: 298 CNIAANAAYP 307
C IA +P
Sbjct: 325 CGIAMAPLFP 334
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 209 bits (532), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 134/332 (40%), Positives = 179/332 (53%), Gaps = 54/332 (16%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFAD 53
HKT + + E+W+ ++ Y + E +RF I++ N + L N+FAD
Sbjct: 36 HKT--LKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNVQLIDYINSLHLPFKLTDNRFAD 93
Query: 54 LTREKFLASYTGY---------KPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAV 104
+T +F A + G K P P N D++DW +GAV
Sbjct: 94 MTNSEFKAHFLGLNTSSLRLHKKQRPVCDPAGNVP--------------DAVDWRTQGAV 139
Query: 105 TPVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS--TLN-GCAKNFLEN 159
TP+++QG C CWAF+AVA +EG+NKI+TG LV+ S+ QL+DC T N GC+ +E
Sbjct: 140 TPIRNQGK-CGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGTYNKGCSGGLMET 198
Query: 160 AFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS 219
AFE+I+ L +E YPY G + CD + A K I+GYQ V E LQ +
Sbjct: 199 AFEFIKSNGGLTTETDYPYTGIEG-TCD--QEKAKNKVVTIQGYQKV-AQNEASLQIAAA 254
Query: 220 RQPVSVAIDATWFNF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
+QPVSV IDA F F Y GVFT CG NHGVT+VGYG EG Q YW+VKN WG
Sbjct: 255 QQPVSVGIDAGGFIFQLYSSGVFTSYCGTNLNHGVTVVGYG----VEGDQKYWIVKNSWG 310
Query: 278 TNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
T W E G +R+ RG+ +G C IA A+YPL
Sbjct: 311 TGWGEEGYIRMERGISEDTGKCGIAMLASYPL 342
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 127/314 (40%), Positives = 176/314 (56%), Gaps = 29/314 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE+WM ++ R YKD AEKE RF++FK N F L +N+FADL E+F
Sbjct: 36 RHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFK 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A + + S +++ F+ + +K+ +IDW +RGAVTP+KDQG CWAF+
Sbjct: 96 ALLINVQKKASWVETSTQTS-FRYESVTKIP--ATIDWRKRGAVTPIKDQGRCGSCWAFS 152
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVYP 177
AVA EG+++I TG+LV S+ +LVDC GC ++++AFE+I + +ASE YP
Sbjct: 153 AVAATEGIHQITTGKLVPLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGGIASETHYP 212
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFY 235
Y+G + C + + I+GY+ V E+ L V+ QPVSV IDA F +Y
Sbjct: 213 YKG-VNKTCKVKKETHG--VAEIKGYEKVPSNNEKALLKAVANQPVSVYIDAGTHAFKYY 269
Query: 236 HGGVF-TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-G 293
G+F CG PNH V +VGYG + YWLVKN WGT W E G +RI R +
Sbjct: 270 SSGIFNVRNCGTDPNHAVAVVGYGKALDG---SKYWLVKNSWGTEWGERGYIRIKRDIRA 326
Query: 294 GSGLCNIAANAAYP 307
GLC IA YP
Sbjct: 327 KEGLCGIAKYPYYP 340
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 209 bits (531), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 124/317 (39%), Positives = 178/317 (56%), Gaps = 27/317 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREK 58
+ A +E W+V+ ++Y E+E RF+IFK N F+ LN+FADLT E+
Sbjct: 50 VMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAVNRTYKVGLNRFADLTNEE 109
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ + Y G + ++R + + + + +S+DW E+GAV PVKDQG+ CWA
Sbjct: 110 YRSRYLGRRDETRRGLRASRVSDRYSFRAGE-DLPESVDWREKGAVVPVKDQGNCGSCWA 168
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECV 175
F+ +A VEG+N+I TG L++ S+ +LVDC S GC ++ AFE+I + SE
Sbjct: 169 FSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEED 228
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+ D CD R +A + +I GY+ V E L+ V+ QPVSVAI+A F
Sbjct: 229 YPYRA-ADTTCDPNRKNA--RVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQ 285
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
Y GVFTG CG +HGV VGYGT E YW+V+N WG NW E G +++ R +
Sbjct: 286 LYQSGVFTGQCGTQLDHGVVAVGYGT----ENSVDYWIVRNSWGPNWGESGYIKLERNLA 341
Query: 294 G--SGLCNIAANAAYPL 308
G +G C IA +YP+
Sbjct: 342 GTETGKCGIAIEPSYPI 358
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 127/325 (39%), Positives = 174/325 (53%), Gaps = 33/325 (10%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNK 50
T + A +E W++++ ++Y E E RF+IFK+ F+ LN+
Sbjct: 31 TQRTNDEVKAMYESWLIKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYKVGLNQ 90
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FADLT E+F ++Y + SNR ++ + Y +DW GAV +K Q
Sbjct: 91 FADLTDEEFRSTYLRFTSGSNKTKVSNR---YEPRVGQVLPSY--VDWRSAGAVVDIKSQ 145
Query: 111 GSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIR 165
G C CWAF+A+ATVEG+NKI TG L++ S+ +L+DC GC ++ + F++I
Sbjct: 146 GE-CGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTRGCNGGYITDGFQFII 204
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ +E YPY QD C+ + KY I Y+ V E LQ V+ QPVSV
Sbjct: 205 NNGGINTEENYPYTA-QDGECN--VDLQNEKYVTIDTYENVPYNNEWALQTAVTYQPVSV 261
Query: 226 AIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
A+DA F Y G+FTGPCG +H VTIVGYGT EG YW+VKN W T W E
Sbjct: 262 ALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYGT----EGGIDYWIVKNSWDTTWGEE 317
Query: 284 GSMRIFRGVGGSGLCNIAANAAYPL 308
G MRI R VGG+G C IA +YP+
Sbjct: 318 GYMRILRNVGGAGTCGIATMPSYPV 342
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 127/322 (39%), Positives = 177/322 (54%), Gaps = 30/322 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLA 61
+ EQWM+ R Y D EK+ RF+++++N E L NKFADLT E+F A
Sbjct: 31 RFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFADLTNEEFRA 90
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNL--NSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
G++P T SN + + SS S+DW ++GAV VK+QG CWAF
Sbjct: 91 KMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCGSCWAF 150
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVYP 177
+AVA +EG+N+I+ G+LV+ S+ +LVDC GC ++ AFE++ L +E YP
Sbjct: 151 SAVAAIEGINQIKNGELVSLSEQELVDCDDEAVGCGGGYMSWAFEFVVGNHGLTTEASYP 210
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--Y 235
Y + C + + S AI GY+ V P++E L + QPVSVA+D F F Y
Sbjct: 211 YHA-ANGACQAAKLNQSAV--AIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQLY 267
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTT-------TEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
GV+TGPC NHGVT+VGYG + A+G + YW+VKN WG W + G + +
Sbjct: 268 GSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYILM 327
Query: 289 FRGVGG--SGLCNIAANAAYPL 308
R V G SGLC IA +YP+
Sbjct: 328 QRDVAGLASGLCGIALLPSYPV 349
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 126/310 (40%), Positives = 167/310 (53%), Gaps = 25/310 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E+W+ + + Y EK RF++FK N + +L LN+FADLT E+F A+Y
Sbjct: 151 EKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVTSYWLGLNEFADLTHEEFKATY 210
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G PP P FK + S S+DW +GAVT VK+QG CWAF+ VA
Sbjct: 211 LGLAPPA---PARESRGSFKYEDVSADDLPKSVDWRTKGAVTEVKNQGQCGSCWAFSTVA 267
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N I TG L S+ +L+DCS NGC ++ AF YI L +E YPY
Sbjct: 268 AVEGINAIVTGNLTALSEQELIDCSVDGNNGCNGGLMDYAFSYIASSGGLHTEEAYPYLM 327
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHGG 238
+ D +S + I GY+ V E+ L ++ QPVSVAI+A+ F FY GG
Sbjct: 328 EEGSCGDGKKSESEAV--TISGYEDVPAHNEQALIKALAHQPVSVAIEASGRHFQFYSGG 385
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGL 297
VF GPCG +HGV VGYG + + +G Y +V+N WG W E G +R+ RG G G GL
Sbjct: 386 VFDGPCGTQLDHGVAAVGYG-SDKGKGHD-YIIVRNSWGAKWGEKGYIRMKRGTGKGEGL 443
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 444 CGINKMASYP 453
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 208 bits (530), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 124/317 (39%), Positives = 174/317 (54%), Gaps = 27/317 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ + W+V+ ++Y EKE RF+IFK N + L LN+FADLT E
Sbjct: 45 VMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNADPDRSYELGLNRFADLTNE 104
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ A Y G K + P ++ + DSIDW E+GAV VKDQGS CW
Sbjct: 105 EYRAKYLGTKSRES-RPKLSKGPSDRYAPVEGEELPDSIDWREKGAVAAVKDQGSCGSCW 163
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASEC 174
AF+A+ VEG+N+I TG+L+T S+ +LVDC S GC ++ AF +I + + S+
Sbjct: 164 AFSAIGAVEGINQITTGELITLSEQELVDCDRSYNEGCEGGLMDYAFNFIIKNGGIDSDL 223
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF 234
YPY GR D C+ + +A K I Y+ V E+ LQ + QP+SVAI+A +F
Sbjct: 224 DYPYTGR-DGTCNQNKENA--KVVTIDSYEDVPVYDEKALQKAAANQPISVAIEAGGMDF 280
Query: 235 --YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y G+FTG CG +HGV +VGYG+ E YW+V+N WG W E G +++ R V
Sbjct: 281 QLYVSGIFTGKCGTAVDHGVVVVGYGS----EEGMDYWIVRNSWGAAWGEAGYLKMQRNV 336
Query: 293 G-GSGLCNIAANAAYPL 308
G SGLC I +YP+
Sbjct: 337 GKSSGLCGITIEPSYPV 353
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 208 bits (529), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 127/314 (40%), Positives = 176/314 (56%), Gaps = 33/314 (10%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKFL 60
+WM E RTY E+E RF++F+ N H F L LN+FADLT E++
Sbjct: 44 EWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLHSFRLGLNRFADLTNEEYR 103
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
+Y G + P S ++ ++ ++ +S+DW E+GAV VKDQG CWAF+
Sbjct: 104 DTYLGVRTKPV--RERRLSGRYQAADNEELP--ESVDWREKGAVAKVKDQGGCGSCWAFS 159
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYP 177
A+A VEG+N+I TG ++ S+ +LVDC T GC ++ AFE+I + SE YP
Sbjct: 160 AIAAVEGINQIVTGDMIALSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDSEEDYP 219
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFY 235
Y+ R D CD + +A K I GY+ V +E L+ V+ QP+SVAI+A F Y
Sbjct: 220 YKER-DNRCDANKKNA--KVVTIDGYEDVPVNSELSLKKAVANQPISVAIEAGGRAFQLY 276
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG- 294
G+FTG CG +HGVT VGYG+ E + YW+VKN WGT W E G +R+ R +
Sbjct: 277 KSGIFTGRCGTALDHGVTAVGYGS----ENGKDYWIVKNSWGTVWGEDGYVRLERNIKAT 332
Query: 295 SGLCNIAANAAYPL 308
SG C IA +YPL
Sbjct: 333 SGKCGIAIEPSYPL 346
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 208 bits (529), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 126/319 (39%), Positives = 176/319 (55%), Gaps = 30/319 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREK 58
+ ++ WM + + Y EKE RF+IFK N +F+ LN+FADLT E+
Sbjct: 42 VMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQNRTYKVGLNRFADLTNEE 101
Query: 59 FLASYTGYKPPPTDH--PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+ A Y G + P N S + + + +S+DW E GAV PVKDQ S C
Sbjct: 102 YRAIYLGTRSDPKRRFAKLKNASPRYAVMPGEVLP--ESVDWRETGAVNPVKDQRSCGSC 159
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASE 173
WAF+ VA VEG+N+I TG+L++ S+ +LVDC T GC ++ AF++I + L +E
Sbjct: 160 WAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKNGGLDTE 219
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY G D C+ S S K +I GY+ V P E+ LQ V+ QPVSVA++A
Sbjct: 220 KDYPYTGF-DGECNL--SGKSSKVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRA 276
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
Y G+FTG CG +HG+ VGYGT E YW+V+N WG++W E G +R+ R
Sbjct: 277 LQLYVSGIFTGECGTALDHGIVAVGYGT----ENGTDYWIVRNSWGSSWGENGYIRMERN 332
Query: 292 VGG--SGLCNIAANAAYPL 308
+ SG C IA A+YP+
Sbjct: 333 MADAFSGKCGIAMEASYPI 351
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 127/322 (39%), Positives = 177/322 (54%), Gaps = 30/322 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLA 61
+ EQWM+ R Y D EK+ RF+++++N E L NKFADLT E+F A
Sbjct: 30 RFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFADLTNEEFRA 89
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNL--NSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
G++P T SN + + SS S+DW ++GAV VK+QG CWAF
Sbjct: 90 KMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCGSCWAF 149
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVYP 177
+AVA +EG+N+I+ G+LV+ S+ +LVDC GC ++ AFE++ L +E YP
Sbjct: 150 SAVAAIEGINQIKNGELVSLSEQELVDCDDEAVGCGGGYMSWAFEFVVGNHGLTTEASYP 209
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--Y 235
Y + C + + S AI GY+ V P++E L + QPVSVA+D F F Y
Sbjct: 210 YHA-ANGACQAAKLNQSAV--AIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQLY 266
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTT-------TEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
GV+TGPC NHGVT+VGYG + A+G + YW+VKN WG W + G + +
Sbjct: 267 GSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYILM 326
Query: 289 FRGVGG--SGLCNIAANAAYPL 308
R V G SGLC IA +YP+
Sbjct: 327 QRDVAGLASGLCGIALLPSYPV 348
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 124/324 (38%), Positives = 169/324 (52%), Gaps = 31/324 (9%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNK 50
T + A +E W+ ++ ++Y E E RF+IFK+ F+ LN+
Sbjct: 31 TKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKETLRFIDEHNADTNRSYRVGLNQ 90
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FAD T E+F ++Y G+ SNR D +DW GAV +K Q
Sbjct: 91 FADQTNEEFQSTYLGFTSGSNKMKVSNRYE-----PRVGQVLPDYVDWRSAGAVVDIKSQ 145
Query: 111 GSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQ 166
G CWAF+A+ATVEG+NKI TG L++ S+ +LVDC GC + + F++I
Sbjct: 146 GQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGRTQNTRGCDGGSITDGFQFIIN 205
Query: 167 YQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVA 226
+ +E YPY +D C+ + KY +I Y+ V E LQ V+ QPVSVA
Sbjct: 206 NGGINTEANYPYTA-EDGQCNL--DLQNEKYASIDTYENVPYNNEWALQTAVAYQPVSVA 262
Query: 227 IDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
++A F Y G+FTGPCG +H VTIVGYGT EG YW+VKN W T W E G
Sbjct: 263 LEAAGDAFQHYSSGIFTGPCGTAVDHAVTIVGYGT----EGGIDYWIVKNSWDTTWGEEG 318
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
+RI R VGG+G C IA +YP+
Sbjct: 319 YIRILRNVGGAGTCGIATKPSYPV 342
>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 134/314 (42%), Positives = 165/314 (52%), Gaps = 46/314 (14%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
KHEQWM F R Y D +EK RF+IFKKN +F L +NKF+DLT E+F
Sbjct: 17 KHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFSDLTDEEFQ 76
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A Y G P S ++ F+ N S+ +S+DW GAVTPVKDQG CCWAF
Sbjct: 77 ARYMGLVPEGMT-GDSQKTVSFRYENVSETG--ESMDWRLEGAVTPVKDQGQCGCCWAFA 133
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASECVY 176
AVA VEG+ KI G+LV+ S+ QLVDCST N GC A++YI++ Q + SE Y
Sbjct: 134 AVAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQGITSEENY 193
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYH 236
PYQ Q C S I GY+ V EE L VS+
Sbjct: 194 PYQAVQQ-TC----KSTDPAAATISGYEAVPKDDEEALLKAVSQH--------------- 233
Query: 237 GGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG- 294
G+F CG +H VTIVGYGT+ E YWL+KN WG +W E G MRI R V
Sbjct: 234 -GIFEDEYCGTDSHHAVTIVGYGTSEEG---IKYWLLKNSWGESWGENGYMRIKRDVDEP 289
Query: 295 SGLCNIAANAAYPL 308
G+C +A A YP+
Sbjct: 290 QGMCGLAHRAYYPV 303
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 128/316 (40%), Positives = 177/316 (56%), Gaps = 29/316 (9%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKF 59
A +E+WMV+ R Y EKE RF+IF+ N E+ L LN FAD+T ++F
Sbjct: 32 ALYEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEF 91
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
A Y G K P ++ S F+ +++ + DW +GAV VK+QG+ CWAF
Sbjct: 92 KALYFGTKVPLSNTIKSG----FRYKDATNLPL--DTDWRSKGAVATVKNQGACGSCWAF 145
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVY 176
+ VA VEG+N+I TG+LV+ S+ +LVDC GC +++AFE+I Q L SE Y
Sbjct: 146 STVAAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADY 205
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF-- 234
PY+ CD R ++ I G++ V +E L V+ QPVSVAI+A+ NF
Sbjct: 206 PYKAVSG-SCDESRRNS--HVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQL 262
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEG-QQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
Y GGV+TG CG +HGV VGYGT+ +G YW+V+N WG W E G +R+ R V
Sbjct: 263 YSGGVYTGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVA 322
Query: 294 G-SGLCNIAANAAYPL 308
G C IA A+YP+
Sbjct: 323 SPRGKCGIAMMASYPV 338
>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 127/315 (40%), Positives = 175/315 (55%), Gaps = 28/315 (8%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLA 61
+E+W R + AEK RF FK N F L LN+F D+ + +F A
Sbjct: 46 YERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDMDQAEFRA 104
Query: 62 SYTG-YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
++ G + P S + LN S + S+DW ++GAVT VKDQG CWAF+
Sbjct: 105 TFVGDLRRDTPSKPPSVPGFMYAALNVSDLP--PSVDWRQKGAVTGVKDQGKCGSCWAFS 162
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYP 177
V +VEG+N IRTG LV+ S+ +L+DC T +GC ++NAFEYI+ L +E YP
Sbjct: 163 TVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLITEAAYP 222
Query: 178 YQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNF 234
Y+ + C+ R++ + I G+Q V +EE L V+ QPVSVA++A+ F F
Sbjct: 223 YRAARG-TCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMF 281
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVFTG CG +HGV +VGYG AE + YW VKN WG +W E G +R+ + G
Sbjct: 282 YSEGVFTGECGTELDHGVAVVGYGV---AEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGA 338
Query: 295 S-GLCNIAANAAYPL 308
S GLC IA A+YP+
Sbjct: 339 SGGLCGIAMEASYPV 353
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 125/317 (39%), Positives = 178/317 (56%), Gaps = 32/317 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTRE 57
+ A E W+VE+ ++Y EKE RF+IFK N F+ LN+F+DLT E
Sbjct: 44 VMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTLE 103
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ + Y G K D +N S+ ++ ++ +SIDW ++GAV VK+QG+ CW
Sbjct: 104 EYSSIYLGTK---FDMRMTNVSDRYEPRVGDQLP--NSIDWRKKGAVLGVKNQGNCGSCW 158
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDC---STLNGCAKNFLENAFEYIRQYQRLASE 173
F +A VE +N+I TG L++ S+ Q+VDC S NGC A+++I + +E
Sbjct: 159 TFAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGAYQFIIDNGGINTE 218
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI--DATW 231
YPY+ QD CD ++ KY I Y+ V E+ LQ VS Q VSV I +++
Sbjct: 219 ANYPYKA-QDGECDEQKNQ---KYVTIDRYENVPRKNEKALQKAVSNQLVSVGIASNSSE 274
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+FTGPCG +H VTIVGYGT EG YW+V+N WG+NW E G +R+ R
Sbjct: 275 FKAYKSGIFTGPCGAKIDHAVTIVGYGT----EGGMDYWIVRNSWGSNWGENGYVRMQRN 330
Query: 292 VGGSGLCNIAANAAYPL 308
VG +G C IA + YP+
Sbjct: 331 VGNAGTCFIATSPNYPV 347
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 128/321 (39%), Positives = 181/321 (56%), Gaps = 30/321 (9%)
Query: 7 KTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFAD 53
K ++ +HE WMV R YKD EKE RFK FK+N EF L +NK+AD
Sbjct: 33 KELSMLERHENWMVHHGRVYKDDIEKEHRFKTFKENVEFIESFNKNGTQRYKLAVNKYAD 92
Query: 54 LTREKFLASYTGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
LT E+F S+ G S + FK + +++ +S+DW +RG+VT VKDQG
Sbjct: 93 LTTEEFTTSFMGLDTSLLSQQESTATTTSFKYDSVTEVP--NSMDWRKRGSVTGVKDQGV 150
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQ-- 168
CCWAF+A A +EG +I +L++ S+ QL+DCST N GC + A++++ Q
Sbjct: 151 CGCCWAFSAAAAIEGAYQIANNELISLSEQQLLDCSTQNKGCEGGLMTVAYDFLLQNNGG 210
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID 228
+ +E YPY+ Q+ C + +A I GY+ V P+ E L V QP+SV I
Sbjct: 211 GITTETNYPYEEAQN-VCKTEQPAAV----TINGYEVV-PSDESSLLKAVVNQPISVGIA 264
Query: 229 AT-WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
A F+ Y G++ G C + NH VT++GYG T+E +G + YW+VKN WG++W E G MR
Sbjct: 265 ANDEFHMYGSGIYDGSCNSRLNHAVTVIGYG-TSEEDGTK-YWIVKNSWGSDWGEEGYMR 322
Query: 288 IFRGVG-GSGLCNIAANAAYP 307
I R VG G C IA A++P
Sbjct: 323 IARDVGVDGGHCGIAKVASFP 343
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 208 bits (529), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 124/324 (38%), Positives = 174/324 (53%), Gaps = 26/324 (8%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKF 51
T + +E+W+V+ + Y EK+ RF+IFK N F+ LNKF
Sbjct: 28 TGRSNDEVMTMYEEWLVKHQKVYNGLREKDQRFQIFKDNLNFIDEHNAQNYTYIVGLNKF 87
Query: 52 ADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG 111
AD+T E++ Y G + N+ + +S +DW +GA+T +KDQG
Sbjct: 88 ADMTNEEYRDMYLGTRSDIKRRIMKNKITGHRYAYNSGDRLPVHVDWRLKGAITHIKDQG 147
Query: 112 SY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQ 168
S CWAF+ +ATVE +NKI TG+LV+ S+ +LVDC GC ++ AFE+I
Sbjct: 148 SCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIGNG 207
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID 228
+ ++ YPY+G + CD R A K +I GY+ V E L+ V+ QPVSVAI+
Sbjct: 208 GIDTDQHYPYKGFEG-RCDPTRKKA--KIVSIDGYEDVPSNNENALKKAVAHQPVSVAIE 264
Query: 229 AT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
A+ Y GVFTG CG + +H V IVGYG+ E YWLV+N WGTNW E G
Sbjct: 265 ASGRALQLYQSGVFTGKCGTSLDHAVVIVGYGS----ENGLDYWLVRNSWGTNWGEDGYF 320
Query: 287 RIFRGVGG--SGLCNIAANAAYPL 308
++ R V G +G C IA A+YP+
Sbjct: 321 KMERNVKGTHTGKCGIAVEASYPV 344
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 207 bits (528), Expect = 3e-51, Method: Compositional matrix adjust.
Identities = 126/326 (38%), Positives = 179/326 (54%), Gaps = 29/326 (8%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LN 49
SRT + I A +W+ + + Y E+E RF+IFK N +F+ LN
Sbjct: 37 SRTDEEVMGIYA---EWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSENRSYKVGLN 93
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKD 109
+FADLT E++ + + G K ++S + +S+DW E GAV P+KD
Sbjct: 94 RFADLTNEEYRSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKD 153
Query: 110 QGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQ 166
QGS CWAF+ VA VEG+N+I TG+++ S+ +LVDC GC ++ AFE+I
Sbjct: 154 QGSCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAGCNGGLMDYAFEFIIN 213
Query: 167 YQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVA 226
+ +E YPY+G D CD R + K +I Y+ V P E L+ V+ QPVSVA
Sbjct: 214 NGGIDTEEDYPYRG-VDGTCDPERKNT--KVVSINDYEDVPPYDEMALKKAVAHQPVSVA 270
Query: 227 IDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
I+A+ F Y GVFTG CG +HGV +VGYGT A+ +W+V+N WGT+W E G
Sbjct: 271 IEASGRAFQLYLSGVFTGECGRALDHGVVVVGYGTDNGAD----HWIVRNSWGTSWGENG 326
Query: 285 SMRIFRGVGGS--GLCNIAANAAYPL 308
+R+ R V + G C IA A+YP+
Sbjct: 327 YIRMERNVVDNFGGKCGIAMQASYPI 352
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/324 (39%), Positives = 176/324 (54%), Gaps = 42/324 (12%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTRE 57
+ ++E W+ E R Y EKE RF+IFK N F+ LN+FADLT E
Sbjct: 46 VKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADLTNE 105
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKM-------SFYDSIDWNERGAVTPVKDQ 110
++ Y G K R + K+ N S+ S+DW +RGAV P+K+Q
Sbjct: 106 EYRTMYLGTKSDA-------RRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQ 158
Query: 111 GSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQY 167
GS CWAF+ VA VEG+N+I TG+++T S+ +LVDC + +GC ++ AFE+I
Sbjct: 159 GSCGSCWAFSTVAAVEGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISN 218
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI 227
+ +E YPY+G + CD R + K +I GY+ V P E LQ V+ QPV VAI
Sbjct: 219 GGMDTEKHYPYRGVEG-RCDPVRKNY--KVVSIDGYEDV-PRNERALQKAVAHQPVCVAI 274
Query: 228 DAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
+A+ F Y GVFTG CG +HGV +VGYG+ E YW+V+N WGT W E G
Sbjct: 275 EASGRAFQLYSSGVFTGECGEEVDHGVVVVGYGS----EDGVDYWIVRNSWGTKWGENGY 330
Query: 286 MRIFRGVGGS--GLCNIAANAAYP 307
+++ R V S G C I A+YP
Sbjct: 331 VKMERNVKKSHLGKCGIMTEASYP 354
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 124/327 (37%), Positives = 175/327 (53%), Gaps = 26/327 (7%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------L 48
MS ++ + +E+W+V+ + Y EKE RF++FK N F++ L
Sbjct: 22 MSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQDHNAQNNTYTLGL 81
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
NKFAD+T E++ A Y G + ++ + +S +DW +GAV P+K
Sbjct: 82 NKFADITNEEYRAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQLPVHVDWRLKGAVGPIK 141
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIR 165
DQG+ CWAF+ VA VEG+N I TG+ V+ S+ +LVDC GC ++ AF++I
Sbjct: 142 DQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREYDEGCNGGLMDYAFQFII 201
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
Q + +E YPYQG D CD ++ K I GY+ V E L+ VS QPVSV
Sbjct: 202 QNGGIDTEEDYPYQGI-DGTCD--QTKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVSV 258
Query: 226 AIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
AI+A+ Y GVFTG CG +HGV +VGYGT E YWLV+N WGT W E
Sbjct: 259 AIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGT----ENGVDYWLVRNSWGTGWGED 314
Query: 284 GSMRIFRGVGGS--GLCNIAANAAYPL 308
G ++ R V + G C IA + +YP+
Sbjct: 315 GYFKMERNVRSTSEGKCGIAMDCSYPV 341
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/308 (41%), Positives = 170/308 (55%), Gaps = 27/308 (8%)
Query: 19 MVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASYTGY 66
+V + + Y EK RF++FK N +L LN+FADLT ++F A+Y G
Sbjct: 33 IVGYRKAYASFEEKVRRFEVFKDNLNHIDDINKKVTSYWLGLNEFADLTHDEFKATYLGL 92
Query: 67 KPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
PPPT + S F+ S +DW ++ AVT VK+QG CWAF+ VA V
Sbjct: 93 TPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSCWAFSTVAAV 152
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQ 182
EG+N I TG L + S+ +L+DCST NGC ++ AF YI L +E YPY +
Sbjct: 153 EGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIASTGGLRTEEAYPY-AME 211
Query: 183 DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVF 240
+ CD + +A I GY+ V E+ L ++ QPVSVAI+A+ F FY GGVF
Sbjct: 212 EGDCDEGKGAA---VVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSGGVF 268
Query: 241 TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCN 299
GPCG +HGVT VGYGT+ Q Y +VKN WG +W E G +R+ RG G G GLC
Sbjct: 269 DGPCGEQLDHGVTAVGYGTSK----GQDYIIVKNSWGPHWGEKGYIRMKRGTGKGEGLCG 324
Query: 300 IAANAAYP 307
I A+YP
Sbjct: 325 INKMASYP 332
>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 207 bits (528), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 128/300 (42%), Positives = 173/300 (57%), Gaps = 31/300 (10%)
Query: 31 EKEMRFKIF-----------KKNHEF-LRLNKFADLTREKFLASYTGYKPPP----TDHP 74
E+E RF +F KKN + L+LNKFADLT +F +YTG K
Sbjct: 53 EREKRFNVFRHNVMHVHNSNKKNRSYKLKLNKFADLTIHEFKNAYTGSKIKHHRMLQGPK 112
Query: 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
++ + + N SK+ S+DW ++GAVT +K+QG CWAF+ VA VEG+NKI+T
Sbjct: 113 RGSKQFMYDHENVSKLP--SSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTN 170
Query: 134 QLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRS 191
+LV+ S+ +LVDC T GC +E AFE+I++ + +E YPY+G D CD S
Sbjct: 171 KLVSLSEQELVDCDTNQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEG-IDGKCD--AS 227
Query: 192 SASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPN 249
+G I G++ V E L V+ QPVSVAIDA + F FY GVFTG CG N
Sbjct: 228 KDNGVLVTIDGHENVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGDCGTELN 287
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
HGV VGYG+ +G + YW+V+N WGT W EGG ++I RG+ G C IA A+YP+
Sbjct: 288 HGVATVGYGS----QGGKKYWIVRNSWGTEWGEGGYIKIERGIDEPEGRCGIAMEASYPI 343
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 121/310 (39%), Positives = 170/310 (54%), Gaps = 26/310 (8%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKFLASYTG 65
W+ + + Y EK RF+IFK N F+ L KFADLT +++ A + G
Sbjct: 31 WLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQNRTYKVGLTKFADLTNQEYRAMFLG 90
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
+ P +++ + + +S+DW +GAV P+KDQGS CWAF+ VA V
Sbjct: 91 TRSDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGAVNPIKDQGSCGSCWAFSTVAAV 150
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQ 182
EG+N+I TG+L++ S+ +LVDC GC ++ AF++I L +E YPY G
Sbjct: 151 EGINQIVTGELISLSEQELVDCDRFYNAGCNGGLMDYAFQFIINNGGLDTEKDYPYLGND 210
Query: 183 DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHGGVF 240
D CD R K +I G++ V P E+ LQ V+ QPVSVAI+A+ FY GVF
Sbjct: 211 D-TCD--RDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQPVSVAIEASGMALQFYQSGVF 267
Query: 241 TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG--SGLC 298
TG CG +HGV +VGYGT E YWLV+N WGT W E G +++ R V +G C
Sbjct: 268 TGECGTALDHGVVVVGYGT----EKGLDYWLVRNSWGTEWGEHGYIKMQRNVRDTYTGRC 323
Query: 299 NIAANAAYPL 308
IA ++YP+
Sbjct: 324 GIAMESSYPV 333
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 126/310 (40%), Positives = 172/310 (55%), Gaps = 27/310 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN-HEF-----------LRLNKFADLTREKFLASY 63
E+W+ ++ + Y EK RF++FK N H L LN FADLT ++F A+Y
Sbjct: 67 EEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVTTYWLGLNAFADLTHDEFKATY 126
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G + P T +R F+ + S+DW ++GAVT VK+QG CWAF+ VA
Sbjct: 127 LGLRQPETKKTTDSR---FRYGGVADDDVPASVDWRKKGAVTDVKNQGQCGSCWAFSTVA 183
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I TG L + S+ +LVDCST NGC ++NAF YI L +E YPY
Sbjct: 184 AVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAFSYIASSGGLRTEEAYPYLM 243
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHGG 238
+ D R + I GY+ V E+ L ++ QP+SVAI+A+ F FY GG
Sbjct: 244 EEGDCDDKARDGE--QVVTISGYEDVPANDEQALVKALAHQPLSVAIEASGRHFQFYSGG 301
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF GPCG+ +HGV VGYG++ Q Y +VKN WG++W E G +R+ RG G GL
Sbjct: 302 VFNGPCGSELDHGVAAVGYGSSK----GQDYIIVKNSWGSHWGEKGYIRMKRGTGKPEGL 357
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 358 CGINKMASYP 367
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 128/319 (40%), Positives = 175/319 (54%), Gaps = 39/319 (12%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFLA 61
+E+W A + +D +K+ RF +FK+N HEF L LNKF D+T ++F A
Sbjct: 38 YERWRSHHAVS-RDLDQKQKRFNVFKENVKFIHEFNKNKDVTFKLALNKFGDMTNQEFRA 96
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD------SIDWNERGAVTPVKDQGSY-C 114
Y G K H H S Y+ SIDW ERGAV VK+QG
Sbjct: 97 KYAGSKV----HHHRTMKGSRHGSGSGAKFMYENAVAPPSIDWRERGAVAAVKNQGQCGS 152
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLAS 172
CWAF+A+A VEG+N+I T +LV S+ +L+DC T GC+ ++ AFE+I+ + +
Sbjct: 153 CWAFSAIAAVEGINQIVTKELVPLSEQELIDCDTDQNQGCSGGLMDYAFEFIKNNGGITT 212
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT-- 230
E VYPYQ +D C + I GY+ V E+ L V+ QPV+VAI+A+
Sbjct: 213 EDVYPYQA-EDATC-----KKNSPAVVIDGYEDVPTNDEDALMKAVANQPVAVAIEASGY 266
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F FY GVFTG CG +HGV +VGYGTT + YW V+N WG +W E G +R+ R
Sbjct: 267 VFQFYSEGVFTGRCGTELDHGVAVVGYGTTQDG---TKYWTVRNSWGADWGESGYVRMQR 323
Query: 291 GVGGS-GLCNIAANAAYPL 308
G+ + GLC IA A+YP+
Sbjct: 324 GIKATHGLCGIAMQASYPI 342
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 127/301 (42%), Positives = 174/301 (57%), Gaps = 27/301 (8%)
Query: 27 KDQAEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDHP 74
+D +EK RF +FK N H+ L+LN FAD+T +F Y+ K
Sbjct: 51 RDLSEKRKRFNVFKANVHHIHKVNQKDKPYKLKLNSFADMTNHEFREFYSS-KVKHYRML 109
Query: 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
H +R+N ++ S S+DW ++GAVT VK+QG CWAF+ V VEG+NKI+TG
Sbjct: 110 HGSRAN-TGFMHGKTESLPASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTG 168
Query: 134 QLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSS 192
QLV+ S+ +LVDC T N GC +ENA+E+I++ + +E +YPY+ R D CD + +
Sbjct: 169 QLVSLSEQELVDCETDNEGCNGGLMENAYEFIKKSGGITTERLYPYKAR-DGSCDSSKMN 227
Query: 193 ASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTG-PCGNTPN 249
A I G++ V E L V+ QPVSVAIDA+ FY GV+ G CGN +
Sbjct: 228 APAV--TIDGHEMVPANDENALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELD 285
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS--GLCNIAANAAYP 307
HGV +VGYGT + YW+VKN WGT W E G +R+ RGV + G+C IA A+YP
Sbjct: 286 HGVAVVGYGTALDG---TKYWIVKNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYP 342
Query: 308 L 308
L
Sbjct: 343 L 343
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 125/315 (39%), Positives = 175/315 (55%), Gaps = 33/315 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +WM E RTY E+E RF++F+ N H F L LN+FADLT E+
Sbjct: 41 YAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLHSFRLGLNRFADLTNEE 100
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ ++Y G + P D + + + N +++DW ++GAV +KDQG CWA
Sbjct: 101 YRSTYLGARTKP-DRERKLSARYQADDNEE---LPETVDWRKKGAVAAIKDQGGCGSCWA 156
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+A+A VEG+N+I TG ++ S+ +LVDC T GC ++ AFE+I + SE
Sbjct: 157 FSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEED 216
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+ R D CD + +A K I GY+ V +E+ LQ V+ QP+SVAI+A F
Sbjct: 217 YPYKER-DNRCDANKKNA--KVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQ 273
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y G+FTG CG +HGV VGYGT E + YWLV+N WGT W E G +R+ R +
Sbjct: 274 LYKSGIFTGTCGTALDHGVAAVGYGT----ENGKDYWLVRNSWGTVWGEDGYIRMERNIK 329
Query: 293 GGSGLCNIAANAAYP 307
SG C IA +YP
Sbjct: 330 ASSGKCGIAVEPSYP 344
>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
Length = 345
Score = 207 bits (527), Expect = 5e-51, Method: Compositional matrix adjust.
Identities = 119/310 (38%), Positives = 170/310 (54%), Gaps = 27/310 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+ E+WM E+ R YKD EK +RF+IFK N L +N+F D+T +F+
Sbjct: 36 QFEEWMAEYGRVYKDNDEKMLRFQIFKNNVNHIETFNNRNGNSYTLGINQFTDMTNNEFV 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A YTG P S F +++ S S SIDW + GAVT VK+QG CWAF
Sbjct: 96 AQYTGLSLPLNIKREPVVS--FDDVDIS--SVPQSIDWRDSGAVTSVKNQGRCGSCWAFA 151
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
++ATVE + KI+ G LV+ S+ Q++DC+ GC ++ A+ +I + +AS +YPY+
Sbjct: 152 SIATVESIYKIKRGNLVSLSEQQVLDCAVSYGCKGGWINKAYSFIISNKGVASAAIYPYK 211
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FNFYHGG 238
+ C +++ I Y YVQ E + VS QP++ A+DA+ F Y G
Sbjct: 212 AAKG-TC---KTNGVPNSAYITRYTYVQRNNERNMMYAVSNQPIAAALDASGNFQHYKRG 267
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GL 297
VFTGPCG NH + I+GYG + + +W+V+N WG W EGG +R+ R V S GL
Sbjct: 268 VFTGPCGTRLNHAIVIIGYGQDSSG---KKFWIVRNSWGAGWGEGGYIRLARDVSSSFGL 324
Query: 298 CNIAANAAYP 307
C IA + YP
Sbjct: 325 CGIAMDPLYP 334
>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
Length = 371
Score = 207 bits (526), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 127/315 (40%), Positives = 175/315 (55%), Gaps = 28/315 (8%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLA 61
+E+W R + AEK RF FK N F L LN+F D+ + +F A
Sbjct: 46 YERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDMDQAEFRA 104
Query: 62 SYTG-YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
++ G + P S + LN S + S+DW ++GAVT VKDQG CWAF+
Sbjct: 105 TFVGDLRRDTPAKPPSVPGFMYAALNVSDLP--PSVDWRQKGAVTGVKDQGKCGSCWAFS 162
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYP 177
V +VEG+N IRTG LV+ S+ +L+DC T +GC ++NAFEYI+ L +E YP
Sbjct: 163 TVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGGLITEAAYP 222
Query: 178 YQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNF 234
Y+ + C+ R++ + I G+Q V +EE L V+ QPVSVA++A+ F F
Sbjct: 223 YRAARG-TCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEASGKAFMF 281
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVFTG CG +HGV +VGYG AE + YW VKN WG +W E G +R+ + G
Sbjct: 282 YSEGVFTGDCGTELDHGVAVVGYGV---AEDGKAYWTVKNSWGPSWGEQGYIRVEKDSGA 338
Query: 295 S-GLCNIAANAAYPL 308
S GLC IA A+YP+
Sbjct: 339 SGGLCGIAMEASYPV 353
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 206 bits (525), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 175/314 (55%), Gaps = 28/314 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKF 59
+H WM E R Y D EK R+ +FK+N E L +N+FADLT E+F
Sbjct: 36 RHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEF 95
Query: 60 LASYTGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ YTGYK + +++ +++++S + S+DW ++GAVTP+KDQGS CWA
Sbjct: 96 RSMYTGYKGNSVLSSRTKPTSFRYQHVSSDALPI--SVDWRKKGAVTPIKDQGSCGSCWA 153
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVY 176
F+AVA +EG+ +I+ G+L++ S+ +LVDC T +GC ++ +AF Y L SE Y
Sbjct: 154 FSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDDGCMGGYMNSAFNYTMTTGGLTSESNY 213
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI--DATWFNF 234
PY+ D C+ ++ +I+G++ V E+ L V+ PVS+ I T F F
Sbjct: 214 PYK-STDGTCNINKTKQIAT--SIKGFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQF 270
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVF+G C +HGV +VGYG ++ YW++KN WG W E G MRI +
Sbjct: 271 YSSGVFSGECSTHLDHGVAVVGYGKSSNGS---KYWILKNSWGPKWGERGYMRIKKDTKA 327
Query: 295 S-GLCNIAANAAYP 307
G C +A NA+YP
Sbjct: 328 KHGQCGLAMNASYP 341
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 206 bits (525), Expect = 7e-51, Method: Compositional matrix adjust.
Identities = 130/323 (40%), Positives = 178/323 (55%), Gaps = 45/323 (13%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE+WM ++ R YKD AEKE RF++FK N F L +N+FADL E+F
Sbjct: 36 RHEKWMAQYGRVYKDAAEKEKRFQVFKNNVHFIESFNAAGDKPFNLSINQFADLNDEEFK 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSF-YDS-------IDWNERGAVTPVKDQGS 112
A + +++W + S++ SF Y+S ID +RGAVTP+KDQG
Sbjct: 96 ALLINVQ---------KKASWVE--TSTETSFRYESVTKIPATIDRRKRGAVTPIKDQGR 144
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQR 169
CWAF+AVA EG+++I TG+LV S+ +LVDC GC ++++AFE+I +
Sbjct: 145 CGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCVKGESEGCIGGYVDDAFEFIAKKGG 204
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
+ASE YPY+G + C + + I+GY+ V E+ L V+ QPVSV IDA
Sbjct: 205 IASETHYPYKG-VNKTCKVKKETHG--VAEIKGYEKVPSNNEKALLKAVANQPVSVYIDA 261
Query: 230 --TWFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F +Y G+F CG PNH V +VGYG +A YWLVKN WGT W E G +
Sbjct: 262 GTHAFKYYSSGIFNARNCGTDPNHAVAVVGYG---KALDDSKYWLVKNSWGTEWGERGYI 318
Query: 287 RIFRGV-GGSGLCNIAANAAYPL 308
RI R + GLC IA YP+
Sbjct: 319 RIKRDIRAKEGLCGIAKYPYYPI 341
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 128/319 (40%), Positives = 177/319 (55%), Gaps = 31/319 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ + +E W+V+ + Y EKE RF IFK N F L LNKFADLT +
Sbjct: 56 LLSLYESWLVKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTND 115
Query: 58 KFLASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++ + Y K + + + RS+ F + + +S+DW +RGAV PVKDQG
Sbjct: 116 EYRSLYLSGKMMKRERKNEDGFRSDRFVFEDGDHLP--ESVDWRDRGAVAPVKDQGQCGS 173
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQRLAS 172
CWAF+ V VEG+NKI TG+L++ S+ +LVDC GC ++ AFE+I + + +
Sbjct: 174 CWAFSTVGAVEGINKIVTGELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKNGGIDT 233
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--T 230
E YPY+G D CD R +A K I GY+ V E+ L+ V+ QPVSVAI+A
Sbjct: 234 EDDYPYKG-VDGLCDQNRKNA--KVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGR 290
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F Y GVFTG CG +HGV VGYG+ E + YW+V+N WG +W E G +R+ R
Sbjct: 291 AFQLYESGVFTGQCGTELDHGVVAVGYGS----ENGKDYWIVRNSWGPDWGESGYIRLER 346
Query: 291 GVG--GSGLCNIAANAAYP 307
V +G C IA A+YP
Sbjct: 347 NVASTSTGKCGIAMQASYP 365
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 130/318 (40%), Positives = 170/318 (53%), Gaps = 40/318 (12%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKFLAS 62
+E W+VE + Y EKE RF+IFK N F+ LN+FADLT E++ A
Sbjct: 51 YEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSVDRSYKVGLNRFADLTNEEYKAM 110
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD------SIDWNERGAVTPVKDQGSY-CC 115
+ G K R N F S + F D ++DW E+GAV PVKDQG C
Sbjct: 111 FLGTK--------MERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQGQCGSC 162
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASE 173
WAF+ V VEG+N+I TG+L++ S+ +LVDC S GC ++ AFE+I + +E
Sbjct: 163 WAFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDTE 222
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY+ D CD R +A K I GY+ V E L+ V+ QPVSVAI+A
Sbjct: 223 EDYPYKA-SDNICDPNRKNA--KVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGGRA 279
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y GVFTG CG +HGV VGYGT E YW+V+N WG+ W E G +R+ R
Sbjct: 280 FQLYKSGVFTGRCGTELDHGVVAVGYGT----ENGVNYWIVRNSWGSAWGESGYIRMERN 335
Query: 292 VGG--SGLCNIAANAAYP 307
V +G C IA +YP
Sbjct: 336 VANTKTGKCGIAIQPSYP 353
>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
Length = 357
Score = 206 bits (525), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 124/322 (38%), Positives = 178/322 (55%), Gaps = 29/322 (9%)
Query: 10 NIAAKHEQW-MVEFARTY----KDQAEKEMRFKIFKKNHEF------------LRLNKFA 52
++A++ W + E R+Y +D EK RF +FK+N + L+LNKFA
Sbjct: 27 DLASEESLWDLYERWRSYHTVSRDLEEKNKRFNVFKENTKHVHKVNQMDKPYKLKLNKFA 86
Query: 53 DLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
D+T +F +SY G K +R ++ S+DW ++GAVT +KDQG
Sbjct: 87 DMTNHEFRSSYGGSKVKHYRMLRGDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGK 146
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQR 169
CWAF+ V VEG+N+I+T +L++ S+ QL+DC S +GC +E+AFE+I++
Sbjct: 147 CGSCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCDRSDDHGCNGGLMESAFEFIKKNGG 206
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
+ +E YPY+ + D CD + +A I G++ V E L V+ QPVSVAIDA
Sbjct: 207 ITTENNYPYKAK-DERCDMLKMNAP--VVTIDGHESVPVNDERALMKAVAHQPVSVAIDA 263
Query: 230 --TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
+ FY GVF G CG +HGV IVGYGTT + YW+VKN WG W E G +R
Sbjct: 264 GGSDLQFYSEGVFDGECGTELDHGVAIVGYGTTLDG---TKYWIVKNSWGAEWGEKGYIR 320
Query: 288 IFRGV-GGSGLCNIAANAAYPL 308
+ RG+ G C IA A+YP+
Sbjct: 321 MARGIQAAEGQCGIAMEASYPV 342
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 121/311 (38%), Positives = 173/311 (55%), Gaps = 27/311 (8%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLASYT 64
W+ + ++TY E+E RF+IFK N F+ L +FADLT E++ A +
Sbjct: 51 WLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKNRTYKVGLTRFADLTNEEYRAKFL 110
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVAT 123
G K P +++ + + +SIDW + GAV+ +KDQGS CWAF+ +A
Sbjct: 111 GTKSDPKRRLMKSKNPSQRYAFKAGDVLPESIDWRQSGAVSAIKDQGSCGSCWAFSTIAA 170
Query: 124 VEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVYPYQGR 181
VEG+NKI TG+L++ S+ +LVDC S GC ++NAF++I + ++ YPYQ
Sbjct: 171 VEGVNKIVTGELISLSEQELVDCDRSYNAGCNGGLMDNAFQFIINNGGIDTDKDYPYQA- 229
Query: 182 QDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHGGV 239
D CD + K I G++ V E LQ V+ QPVSVAI+A+ FY GV
Sbjct: 230 VDGKCD--TTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQPVSVAIEASGMALQFYQSGV 287
Query: 240 FTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG--SGL 297
FTG CG+ +HGV IVGYGT E YWLV+N WG +W E G +++ R V +G
Sbjct: 288 FTGECGSALDHGVVIVGYGT----EDGIDYWLVRNSWGRDWGENGYIKMQRNVVDTFTGK 343
Query: 298 CNIAANAAYPL 308
C IA ++YP+
Sbjct: 344 CGIAMESSYPI 354
>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
Length = 367
Score = 206 bits (525), Expect = 9e-51, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 179/316 (56%), Gaps = 39/316 (12%)
Query: 15 HEQWMVEF--ARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFL 60
+E+W + AR++ EK+ RF +FK+N ++ LRLN+F DLT +F
Sbjct: 44 YERWRSVYTSARSF---GEKQNRFHVFKENVKYINEVNKMDKPYKLRLNQFGDLTPSEFA 100
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAF 118
+Y K N S F N + SIDW +GAVTPVK+QG C CWAF
Sbjct: 101 RTYANSK---IIEGTRNESGGFMYEN---VEVPRSIDWRVKGAVTPVKNQGR-CGGCWAF 153
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
+A A VEG+N+I TGQL++ S+ QL+DC T N GC + AFEYI+Q + SE YP
Sbjct: 154 SAAAAVEGINQITTGQLISLSEQQLIDCDTQNSGCRGGTMGRAFEYIKQRGGITSEANYP 213
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN---- 233
Y+ Q C + +I GY ++ +E+ + +++ QPVSVA+DAT ++
Sbjct: 214 YKA-QAGMCK--NNLIQRPTVSIDGYYNIR-RSEDAVLKILAHQPVSVAVDATTWSSLDW 269
Query: 234 -FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVFTGPCG NHGVT VGYGTT + YW++KN WG W E G MR+ RGV
Sbjct: 270 MFYFQGVFTGPCGTKLNHGVTAVGYGTTNDG---YDYWIIKNSWGETWGERGYMRMLRGV 326
Query: 293 GGSGLCNIAANAAYPL 308
GLC IA A++P+
Sbjct: 327 SPYGLCGIAMQASFPI 342
>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 124/322 (38%), Positives = 178/322 (55%), Gaps = 29/322 (9%)
Query: 10 NIAAKHEQW-MVEFARTY----KDQAEKEMRFKIFKKNHEF------------LRLNKFA 52
++A++ W + E R+Y +D EK RF +FK+N + L+LNKFA
Sbjct: 29 DLASEESLWDLYERWRSYHTVSRDLEEKNKRFNVFKENTKHVHKVNQMDKPYKLKLNKFA 88
Query: 53 DLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
D+T +F +SY G K +R ++ S+DW ++GAVT +KDQG
Sbjct: 89 DMTNHEFRSSYGGSKVKHYRMLRGDRRGTGGFMHEKTTYLPPSVDWRKKGAVTGIKDQGK 148
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQR 169
CWAF+ V VEG+N+I+T +L++ S+ QL+DC S +GC +E+AFE+I++
Sbjct: 149 CGSCWAFSTVVGVEGINQIKTKELLSLSEQQLIDCDRSDDHGCNGGLMESAFEFIKKNGG 208
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
+ +E YPY+ + D CD + +A I G++ V E L V+ QPVSVAIDA
Sbjct: 209 ITTENNYPYKAK-DERCDMLKMNAP--VVTIDGHESVPVNDERALMKAVAHQPVSVAIDA 265
Query: 230 --TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
+ FY GVF G CG +HGV IVGYGTT + YW+VKN WG W E G +R
Sbjct: 266 GGSDLQFYSEGVFDGECGTELDHGVAIVGYGTTLDG---TKYWIVKNSWGAEWGEKGYIR 322
Query: 288 IFRGV-GGSGLCNIAANAAYPL 308
+ RG+ G C IA A+YP+
Sbjct: 323 MARGIQAAEGQCGIAMEASYPV 344
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 121/324 (37%), Positives = 176/324 (54%), Gaps = 26/324 (8%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNK 50
T++ + +E+W+V+ + Y EK+ RF++FK N F++ LN+
Sbjct: 29 TNYTDNEVMTMYEEWLVKHQKVYNGLREKDKRFQVFKDNLGFIQEHNNNQNNTYKLGLNQ 88
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FAD+T E++ Y G K +S + S+ +DW +GAV P+KDQ
Sbjct: 89 FADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYSAGDRLPVHVDWRVKGAVAPIKDQ 148
Query: 111 GSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQY 167
GS CWAF+ VATVE +NKI TG+ V+ S+ +LVDC GC ++ AFE+I Q
Sbjct: 149 GSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRAYNEGCNGGLMDYAFEFIIQN 208
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI 227
+ ++ YPY+G D CD + +A K I G++ V P E L+ V+ QPVS+AI
Sbjct: 209 GGIDTDKDYPYRGF-DGICDPTKKNA--KVVNIDGFEDVPPYDENALKKAVAHQPVSIAI 265
Query: 228 DATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
+A+ Y GVFTG CG + +HGV +VGYG +E YWLV+N WGT W E G
Sbjct: 266 EASGRDLQLYQSGVFTGKCGTSLDHGVVVVGYG----SENGVDYWLVRNSWGTGWGEDGY 321
Query: 286 MRIFRGV-GGSGLCNIAANAAYPL 308
++ R V +G C I A+YP+
Sbjct: 322 FKMQRNVRTPTGKCGITMEASYPV 345
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 128/319 (40%), Positives = 170/319 (53%), Gaps = 37/319 (11%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFLA 61
+E+W E + EK RF FK N HE LRLN+F D+ RE+F A
Sbjct: 46 YERWQ-EHHHVPRHHGEKHRRFGAFKDNVRYIHEHNKRGGRGYRLRLNRFGDMGREEFRA 104
Query: 62 SYTGYKPPPTDHPHSNRSNWFKN------LNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++ G H + R + + ++DW +GAVT VKDQG
Sbjct: 105 TFAG------SHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCGS 158
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLAS 172
CWAF+ V +VEG+N IRTG+LV+ S+ +L+DC T + GC +ENAFEYI+ + +
Sbjct: 159 CWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYIKHSGGITT 218
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--T 230
E YPY+ + CD R+ + I G+Q V +E L V+ QPVSVAIDA
Sbjct: 219 ESAYPYRA-ANGTCDAVRARRA-PLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQ 276
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F FY GVF G CG +HGV +VGYG T + YW+VKN WGT W EGG +R+ R
Sbjct: 277 SFQFYSDGVFAGDCGTDLDHGVAVVGYGETNDG---TEYWIVKNSWGTAWGEGGYIRMQR 333
Query: 291 GVG-GSGLCNIAANAAYPL 308
G GLC IA A+YP+
Sbjct: 334 DSGYDGGLCGIAMEASYPV 352
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 125/326 (38%), Positives = 179/326 (54%), Gaps = 34/326 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL------------N 49
++ S + ++E W+ + R Y+D+ E E+RF I++ N +++ N
Sbjct: 26 TKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRFDIYQSNVQYIEFYNSQNYSYKLIDN 85
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKD 109
+FAD+T E+F ++Y GY P F+ ++ SIDW ++GAVT VKD
Sbjct: 86 RFADITNEEFKSTYLGYLPRFRVQTE------FRYHKHGELP--KSIDWRKKGAVTHVKD 137
Query: 110 QGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC---STLNGCAKNFLENAFEYIR 165
QG CWAF+AVA VEG+NKI+T LV+ S+ QL+DC S GC + AF YI+
Sbjct: 138 QGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDCDIKSGNEGCEGGDMYIAFNYIK 197
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
++ +A+ YPY+GR D C+ +S A I GY+ V E+ L+ V+ QPVS+
Sbjct: 198 KHGGIATAKEYPYKGR-DGNCN--KSKAKNNAVTISGYESVPARNEKMLKAAVAHQPVSI 254
Query: 226 AIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
A DA F FY G+F+G CG NHG+TIVGYG E YW+VKN W +W E
Sbjct: 255 ATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYG----EENGDKYWIVKNSWANDWGES 310
Query: 284 GSMRIFRGV-GGSGLCNIAANAAYPL 308
G +R+ R G C IA +A YP+
Sbjct: 311 GYVRMKRDTKDKDGTCGIAMDATYPV 336
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 206 bits (524), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 125/327 (38%), Positives = 179/327 (54%), Gaps = 28/327 (8%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------L 48
+SR S G + ++ W+ + + Y E+E RF+IFK+N +F+ L
Sbjct: 23 LSRRSD--GEVREIYDLWLAKHGKAYNGIDEREKRFQIFKENLKFIDDHNSENRTYKVGL 80
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N FADLT E++ A Y G + PP ++ + ++ +S+DW RGAV PVK
Sbjct: 81 NMFADLTNEEYRALYLGTRSPPARRVMKAKTASRRYAVNNLDRLPESMDWRTRGAVAPVK 140
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIR 165
+QGS CWAF+ +A VEG+N+I TG+L++ S+ +LV C +GC ++ AF++I
Sbjct: 141 NQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKYNSGCNGGLMDYAFQFII 200
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
L +E YPY+ D CD R +A K +I Y+ V EE L+ V+ QPVSV
Sbjct: 201 DNGGLDTEEDYPYEAF-DGQCDPTRKNA--KVVSIDAYEDVPANDEESLKKAVAHQPVSV 257
Query: 226 AIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
AI+A+ Y GVFTG CG+ +HGV VGYG E YWLV+N WGT+W E
Sbjct: 258 AIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYGK----ENGVDYWLVRNSWGTSWGED 313
Query: 284 GSMRIFRGVG--GSGLCNIAANAAYPL 308
G ++ R V G C IA A+YP+
Sbjct: 314 GYFKLERNVKHITEGKCGIAMQASYPV 340
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 124/315 (39%), Positives = 171/315 (54%), Gaps = 33/315 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +WM E TY E+E RF+ F+ N H F L LN+FADLT E+
Sbjct: 43 YAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVHSFRLGLNRFADLTNEE 102
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ ++Y G + P R + + +S+DW ++GAV VKDQG CWA
Sbjct: 103 YRSTYLGAR----TKPDRERKLSARYQAADNDELPESVDWRKKGAVGAVKDQGGCGSCWA 158
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+A+A VEG+N+I TG ++ S+ +LVDC T GC ++ AFE+I + SE
Sbjct: 159 FSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDSEED 218
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+ R D CD + +A K I GY+ V +E+ LQ V+ QP+SVAI+A F
Sbjct: 219 YPYKER-DNRCDANKKNA--KVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQ 275
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y G+FTG CG +HGV VGYGT E + YWLV+N WG+ W E G +R+ R +
Sbjct: 276 LYKSGIFTGTCGTALDHGVAAVGYGT----ENGKDYWLVRNSWGSVWGEDGYIRMERNIK 331
Query: 293 GGSGLCNIAANAAYP 307
SG C IA +YP
Sbjct: 332 ASSGKCGIAVEPSYP 346
>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
Length = 368
Score = 206 bits (523), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 168/317 (52%), Gaps = 36/317 (11%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-----------LNKFADLTREKFLASY 63
+E+W E + EK RF FK N ++ LN+F D+ RE+F A++
Sbjct: 46 YERWQ-EHHHVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPGYAPLNRFGDMGREEFRATF 104
Query: 64 TGYKPPPTDHPHSNRSNWFKN------LNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
G H + R + + ++DW +GAVT VKDQG CW
Sbjct: 105 AG------SHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCGSCW 158
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASEC 174
AF+ V +VEG+N IRTG+LV+ S+ +L+DC T + GC +ENAFEYI+ + +E
Sbjct: 159 AFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYIKHSGGITTES 218
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YPY+ + CD R A G I G+Q V +E L V+ QPVSVAIDA F
Sbjct: 219 AYPYRA-ANGTCDAVR--ARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSF 275
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVF G CG +HGV +VGYG T + YW+VKN WGT W EGG +R+ R
Sbjct: 276 QFYSDGVFAGDCGTDLDHGVAVVGYGETNDG---TEYWIVKNSWGTAWGEGGYIRMQRDS 332
Query: 293 G-GSGLCNIAANAAYPL 308
G GLC IA A+YP+
Sbjct: 333 GYDGGLCGIAMEASYPV 349
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 173/316 (54%), Gaps = 33/316 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +W E ++Y E+E R+ F+ N H F L LN+FADLT E+
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ +Y G + + P R + L + + +S+DW +GAV +KDQG CWA
Sbjct: 100 YRDTYLGLR----NKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWA 155
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+A+A VEG+N+I TG L++ S+ +LVDC T GC ++ AF++I + +E
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDTEDD 215
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+G+ D CD R +A K I Y+ V P +E LQ V+ QPVSVAI+A F
Sbjct: 216 YPYKGK-DERCDVNRKNA--KVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQ 272
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y G+FTG CG +HGV VGYGT E + YW+V+N WG +W E G +R+ R +
Sbjct: 273 LYSSGIFTGKCGTALDHGVAAVGYGT----ENGKDYWIVRNSWGKSWGESGYVRMERNIK 328
Query: 293 GGSGLCNIAANAAYPL 308
SG C IA +YPL
Sbjct: 329 ASSGKCGIAVEPSYPL 344
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 126/330 (38%), Positives = 179/330 (54%), Gaps = 34/330 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------L 48
+RT + N +E W+ +TY EKE RF+IF N +F+ L
Sbjct: 26 TRTDEEVRN---TYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVGL 82
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKN---LNSSKMSFYDSIDWNERGAVT 105
N+FADLT E++ + Y G K P + + ++M F +DW ERGAV+
Sbjct: 83 NQFADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEM-FPAKVDWRERGAVS 141
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFE 162
PVK+QG CWAF+ VA+VEG+NKI TG L++ S+ +LVDC +GC ++ AF+
Sbjct: 142 PVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYAFQ 201
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
+I + SE YPY+G CD R+ A K +I GY+ V P E+ L V+ QP
Sbjct: 202 FIVSNGGIDSESDYPYKG-VGAVCDPVRNKA--KIVSIDGYEDVPPMNEKALMKAVAHQP 258
Query: 223 VSVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
VSV I+A+ F Y GV TG CG +HGV +VGYG+ E + YW+V+N WG W
Sbjct: 259 VSVGIEASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYGS----ENGKDYWIVRNSWGPEW 314
Query: 281 DEGGSMRIFRGVGGS--GLCNIAANAAYPL 308
E G +R+ R + + G+C I A+YP+
Sbjct: 315 GEDGYIRMERNMVDTPVGMCGITLMASYPI 344
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 131/316 (41%), Positives = 176/316 (55%), Gaps = 35/316 (11%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFLA 61
+E+W A + +D + + RF +FK+N HEF L LNKF D+T ++F +
Sbjct: 41 YEKWRAHHAVS-RDLDDTDKRFNVFKENVKFIHEFNQKKDATYKLALNKFGDMTNQEFRS 99
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNS-SKMSFYD---SIDWNERGAVTPVKDQGSY-CCW 116
+Y G K DH + R K+ S F+D S+DW E+GAVT VKDQG CW
Sbjct: 100 TYAGSK---IDHHMTLRG--VKDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGSCW 154
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECV 175
AF+ V VEG+N+I+T +LV+ S+ QLVDC T N GC ++ AF++I+ L+SE
Sbjct: 155 AFSTVVAVEGINQIKTNELVSLSEQQLVDCDTKNSGCNGGLMDYAFDFIKNNGGLSSEDS 214
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFN 233
YPY Q C S A+ I GYQ V E L V+ QPVSVAI+A+ F
Sbjct: 215 YPYLAEQK-SCG---SEANSAVVTIDGYQDVPRNNEAALMKAVANQPVSVAIEASGYAFQ 270
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GVF+G CG +HGV VGYG + + YW+VKN WG W E G +R+ RG+
Sbjct: 271 FYSQGVFSGHCGTELDHGVAAVGYGVDDDG---KKYWIVKNSWGEGWGESGYIRMERGIK 327
Query: 294 GS-GLCNIAANAAYPL 308
G C IA A+YP+
Sbjct: 328 DKRGKCGIAMEASYPI 343
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 123/327 (37%), Positives = 174/327 (53%), Gaps = 26/327 (7%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------L 48
MS ++ + +E+W+V+ + Y EKE RF++FK N F++ L
Sbjct: 22 MSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFKDNLGFIQDHNAQNNTYTLGL 81
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
NKFAD+T +++ A Y G + ++ + +S +DW +GAV P+K
Sbjct: 82 NKFADITNKEYRAMYLGTRTDAKRRVMKTQNTGHRYAYNSGDQLPVHVDWRLKGAVGPIK 141
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIR 165
DQG+ CWAF+ VA VEG+N I TG+ V+ S+ +LVDC GC ++ AF++I
Sbjct: 142 DQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDCDREYDEGCNGGLMDYAFQFII 201
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
Q + +E YPYQG D CD + K I GY+ V E L+ VS QPVSV
Sbjct: 202 QNGGIDTEEDYPYQGI-DGTCD--ETKKKTKVVQIDGYEDVPSNNENALKKAVSHQPVSV 258
Query: 226 AIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
AI+A+ Y GVFTG CG +HGV +VGYGT E YWLV+N WGT W E
Sbjct: 259 AIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGT----ENGVDYWLVRNSWGTGWGED 314
Query: 284 GSMRIFRGVGGS--GLCNIAANAAYPL 308
G ++ R V + G C IA + +YP+
Sbjct: 315 GYFKMERNVRSTSEGKCGIAMDCSYPV 341
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 173/316 (54%), Gaps = 33/316 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +W E ++Y E+E R+ F+ N H F L LN+FADLT E+
Sbjct: 41 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 100
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ +Y G + + P R + L + + +S+DW +GAV +KDQG CWA
Sbjct: 101 YRDTYLGLR----NKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWA 156
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+A+A VEG+N+I TG L++ S+ +LVDC T GC ++ AF++I + +E
Sbjct: 157 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDTEDD 216
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+G+ D CD R +A K I Y+ V P +E LQ V+ QPVSVAI+A F
Sbjct: 217 YPYKGK-DERCDVNRKNA--KVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQ 273
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y G+FTG CG +HGV VGYGT E + YW+V+N WG +W E G +R+ R +
Sbjct: 274 LYSSGIFTGKCGTALDHGVAAVGYGT----ENGKDYWIVRNSWGKSWGESGYVRMERNIK 329
Query: 293 GGSGLCNIAANAAYPL 308
SG C IA +YPL
Sbjct: 330 ASSGKCGIAVEPSYPL 345
>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
Length = 368
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 168/317 (52%), Gaps = 36/317 (11%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-----------LNKFADLTREKFLASY 63
+E+W E + EK RF FK N ++ LN+F D+ RE+F A++
Sbjct: 46 YERWQ-EHHHVPRHHGEKHRRFGAFKDNVRYIHEHNKRAPGYPPLNRFGDMGREEFRATF 104
Query: 64 TGYKPPPTDHPHSNRSNWFKN------LNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
G H + R + + ++DW +GAVT VKDQG CW
Sbjct: 105 AG------SHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRKGAVTGVKDQGKCGSCW 158
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASEC 174
AF+ V +VEG+N IRTG+LV+ S+ +L+DC T + GC +ENAFEYI+ + +E
Sbjct: 159 AFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLMENAFEYIKHSGGITTES 218
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YPY+ + CD R A G I G+Q V +E L V+ QPVSVAIDA F
Sbjct: 219 AYPYRA-ANGTCDAVR--ARGGLVVIDGHQNVPANSEAALAKAVANQPVSVAIDAGDQSF 275
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVF G CG +HGV +VGYG T + YW+VKN WGT W EGG +R+ R
Sbjct: 276 QFYSDGVFAGDCGTDLDHGVAVVGYGETNDG---TEYWIVKNSWGTAWGEGGYIRMQRDS 332
Query: 293 G-GSGLCNIAANAAYPL 308
G GLC IA A+YP+
Sbjct: 333 GYDGGLCGIAMEASYPV 349
>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
Precursor
gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 165/312 (52%), Gaps = 28/312 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E WMV+ + Y AEKE R IF+ N F L L FADL+ ++
Sbjct: 50 ESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVC 109
Query: 64 TGYKP-PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTA 120
G P PP +H S+ +K S+ S+DW GAVT VKDQG +C CWAF+
Sbjct: 110 HGADPRPPRNHVFMTSSDRYKT--SADDVLPKSVDWRNEGAVTEVKDQG-HCRSCWAFST 166
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
V VEGLNKI TG+LVT S+ L++C+ NGC LE A+E+I + L ++ YPY+
Sbjct: 167 VGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYK 226
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHG 237
+ CD R + K I GY+ + E L V+ QPV+ ID++ F Y
Sbjct: 227 A-VNGVCD-GRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYES 284
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
GVF G CG NHGV +VGYGT E + YWLVKN G W E G M++ R + G
Sbjct: 285 GVFDGSCGTNLNHGVVVVGYGT----ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRG 340
Query: 297 LCNIAANAAYPL 308
LC IA A+YPL
Sbjct: 341 LCGIAMRASYPL 352
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 127/319 (39%), Positives = 172/319 (53%), Gaps = 40/319 (12%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLA 61
+E W+V++ + Y EKE RF+IFK N +F L LNKFADL+ E++ A
Sbjct: 49 YEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADLSNEEYRA 108
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD------SIDWNERGAVTPVKDQGSY-C 114
+Y G + + S++ F D S+DW E+GAV PVKDQG
Sbjct: 109 AYLGTR-------MDGKRRLLGGPKSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGS 161
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLAS 172
CWAF+ V VEG+N+I TG L + S+ +LVDC + GC ++ AFE+I + + +
Sbjct: 162 CWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGIDT 221
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--T 230
E YPY+ D CD R +A + I GY+ V E+ L+ V+ QPVSVAI+A
Sbjct: 222 EEDYPYKA-VDSMCDPNRKNA--RVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGR 278
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F Y GVFTG CG +HGV VGYGT E YW+V+N WG W E G +R+ R
Sbjct: 279 AFQLYQSGVFTGSCGTQLDHGVVAVGYGT----ENGVDYWVVRNSWGPAWGENGYIRMER 334
Query: 291 GVGG--SGLCNIAANAAYP 307
V +G C IA A+YP
Sbjct: 335 NVASTETGKCGIAMEASYP 353
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 205 bits (522), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 123/307 (40%), Positives = 173/307 (56%), Gaps = 35/307 (11%)
Query: 27 KDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFLASYTGYKPPPTDH 73
+D EK+ RF +FK+N H+F LRLNKFADLT +F ++Y G + +H
Sbjct: 49 RDLDEKQKRFNVFKENPRYIHDFNKRKDIPYKLRLNKFADLTNHEFRSTYAGSR---INH 105
Query: 74 PHSNR-------SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVE 125
S R +N F + S SIDW ++GAVT VKDQG CWAF+ VA VE
Sbjct: 106 HRSLRGSRRGGATNSFMYQSLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVE 165
Query: 126 GLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
G+N+I+T +L++ S+ +L+DC T NGC ++ AF++I++ ++SE YPY +D
Sbjct: 166 GINQIKTKKLLSLSEQELIDCDTDENNGCNGGLMDYAFDFIKKNGGISSEAEYPYAA-ED 224
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFT 241
YC + S +I G++ V E+ L V+ QPVS+AI+A+ F FY GVFT
Sbjct: 225 SYCATEKKS---HVVSIDGHEDVPANDEDSLLKAVANQPVSIAIEASGYDFQFYSEGVFT 281
Query: 242 GPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIA 301
G G +HGV IVGYG T + YW+V+N WG W E G +RI LC +A
Sbjct: 282 GRSGTELDHGVAIVGYGKTQQG---TKYWIVRNSWGAEWGEKGYIRISAASDSKRLCGLA 338
Query: 302 ANAAYPL 308
A+YP+
Sbjct: 339 MEASYPI 345
>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 122/311 (39%), Positives = 163/311 (52%), Gaps = 26/311 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E WMV+ + Y+ AEKE R IF+ N F L LN+FADL+ ++
Sbjct: 57 ESWMVKHGKVYESVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYAQIC 116
Query: 64 TGYKP-PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS-YCCWAFTAV 121
G P PP +H SN +K + + S+DW GAVT VKDQG CWAF+ V
Sbjct: 117 HGADPRPPRNHVFMTSSNRYKTSDGDVLP--KSVDWRNEGAVTEVKDQGQCRSCWAFSTV 174
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEGLNKI TG+LVT S+ L++C+ NGC +E A+E+I L ++ YPY+
Sbjct: 175 GAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKA 234
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHGG 238
D R + K I GY+ + E L V+ QPV+ +D++ F Y G
Sbjct: 235 LNGVCND--RLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSREFQLYASG 292
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF G CG NHGV +VGYGT E + YW+V+N G W E G M++ R + GL
Sbjct: 293 VFDGTCGTNLNHGVVVVGYGT----ENGRDYWIVRNSRGNTWGEAGYMKMARNIANPRGL 348
Query: 298 CNIAANAAYPL 308
C IA A+YPL
Sbjct: 349 CGIAMRASYPL 359
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 172/316 (54%), Gaps = 33/316 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +W E + Y E+E R+ F+ N H F L LN+FADLT E+
Sbjct: 40 YAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ +Y G + + P R + L + + +S+DW +GAV +KDQG CWA
Sbjct: 100 YRDTYLGLR----NKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWA 155
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+A+A VEG+N+I TG L++ S+ +LVDC T GC ++ AF++I + +E
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDTEDD 215
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+G+ D CD R +A K I Y+ V P +E LQ V+ QPVSVAI+A F
Sbjct: 216 YPYKGK-DERCDVNRKNA--KVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRAFQ 272
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y G+FTG CG +HGV VGYGT E + YW+V+N WG +W E G +R+ R +
Sbjct: 273 LYSSGIFTGKCGTALDHGVAAVGYGT----ENGKDYWIVRNSWGKSWGESGYVRMERNIK 328
Query: 293 GGSGLCNIAANAAYPL 308
SG C IA +YPL
Sbjct: 329 ASSGKCGIAVEPSYPL 344
>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
Length = 357
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 127/312 (40%), Positives = 165/312 (52%), Gaps = 28/312 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E WMV+ + Y AEKE R IF+ N F L L FADL+ ++
Sbjct: 43 ESWMVKHGKVYGSVAEKERRLTIFEDNLRFINNRNAENLSYRLGLTGFADLSLHEYKEVC 102
Query: 64 TGYKP-PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTA 120
G P PP +H S+ +K S+ S+DW GAVT VKDQG +C CWAF+
Sbjct: 103 HGADPRPPRNHVFMTSSDRYKT--SADDVLPKSVDWRNEGAVTEVKDQG-HCRSCWAFST 159
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
V VEGLNKI TG+LVT S+ L++C+ NGC LE A+E+I + L ++ YPY+
Sbjct: 160 VGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYK 219
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHG 237
+ CD R + K I GY+ + E L V+ QPV+ ID++ F Y
Sbjct: 220 A-VNGVCD-GRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYES 277
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
GVF G CG NHGV +VGYGT E + YWLVKN G W E G M++ R + G
Sbjct: 278 GVFDGSCGTNLNHGVVVVGYGT----ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRG 333
Query: 297 LCNIAANAAYPL 308
LC IA A+YPL
Sbjct: 334 LCGIAMRASYPL 345
>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
Length = 328
Score = 205 bits (521), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 121/316 (38%), Positives = 177/316 (56%), Gaps = 40/316 (12%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ +HE WMVE+ R YKD AEK RF++FK N F L +N+FADLT E
Sbjct: 32 MVERHENWMVEYGRVYKDAAEKARRFQVFKDNVAFVESFNTNKNNKFWLGVNQFADLTTE 91
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYCCWA 117
+F A+ G+KP P + FK N S + ++DW +GAVTP+K+QG
Sbjct: 92 EFKAN-KGFKPTAEKVPTTG----FKYENLSVSALPTAVDWRTKGAVTPIKNQGQ----- 141
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASEC 174
A +EG+ K+ TG L++ S+ +LVDC T + GC ++++AFE++ + LA+E
Sbjct: 142 ---CAAMEGIVKLSTGNLISLSEQELVDCDTHSMDEGCEGGWMDSAFEFVIKNGGLATES 198
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WF 232
YPY+ D C SA+ I+G++ V E L V+ QPVSVA+DA+ F
Sbjct: 199 NYPYKA-VDGKCKGGSKSAA----TIKGHEDVPVNNEAALMKAVANQPVSVAVDASDRTF 253
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GGV TG CG +HG+ +GYG E++G + YW++KN WGT W E G +R+ + +
Sbjct: 254 MLYSGGVMTGSCGTELDHGIAAIGYG--MESDGTK-YWILKNSWGTTWGEKGFLRMEKDI 310
Query: 293 GGS-GLCNIAANAAYP 307
G+C +A +YP
Sbjct: 311 TDKRGMCGLAMKPSYP 326
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 121/321 (37%), Positives = 174/321 (54%), Gaps = 30/321 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTR 56
+ +HEQWM + R YKD AEK RF+ F+ N F L +N+F DLT
Sbjct: 33 MVERHEQWMAQHGRVYKDGAEKARRFEAFRNNVVFIESFNAAGNRRKFWLGVNQFTDLTN 92
Query: 57 EKFLASYTGYKPPPTDHPHSNRSN---WFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
++F A+ T + N+++ F+ N S + ++DW +GAVTP+K+QG
Sbjct: 93 DEFRATKTNKGFIKRNAAAVNKASPTGTFRYSNVSADALPAAVDWRAKGAVTPIKNQGQC 152
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQR 169
CCWAF+AVA EG+ ++ TG+LV S+ +LVDC +GC +++AFE+I +
Sbjct: 153 GCCWAFSAVAATEGIVQLSTGKLVPLSEQELVDCDANGADHGCEGGEMDDAFEFIIKNGG 212
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
L SE YPY QD C + S I+GY+ V E L V+ QPVSVA+D
Sbjct: 213 LTSETNYPYTA-QDGQCKAKNTINS--VATIKGYEDVPANDEASLMKAVAAQPVSVAVDG 269
Query: 230 --TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F Y GGV +G CG + +HG+ VGYG A+ +WL+KN WGT W E G +R
Sbjct: 270 GDMVFQHYAGGVLSGSCGTSLDHGIVAVGYGA---ADDGTKFWLMKNSWGTTWGEDGYIR 326
Query: 288 IFRGVGGS-GLCNIAANAAYP 307
+ + V + G+C +A +YP
Sbjct: 327 MEKDVADAGGMCGLAMQPSYP 347
>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
Length = 272
Score = 205 bits (521), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 121/271 (44%), Positives = 166/271 (61%), Gaps = 21/271 (7%)
Query: 46 LRLNKFADLTREKFLASYTGYKPPPTDHPHSN--RSNWFKNLNSSKMSFYDSIDWNERGA 103
L +NKFADLT E+F AS +K H S+ R+ FK N+S + ++DW ++GA
Sbjct: 12 LGINKFADLTNEEFKASRNKFKG----HMCSSIIRTTTFKYENASAIP--STVDWRKKGA 65
Query: 104 VTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLEN 159
VTPVK+QG CWAF+AVA EG++++ TG+LV+ S+ +L+DC T GC +++
Sbjct: 66 VTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGGLMDD 125
Query: 160 AFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS 219
AF++I Q L++E YPY+G D C+ + AS I GY+ V E LQ V+
Sbjct: 126 AFKFIIQNHGLSTEVQYPYEGV-DGTCN--TNEASIHAVTITGYEDVPANNELALQKAVA 182
Query: 220 RQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
QP+SVAIDA+ F FY+ GVFTG CG +HGVT VGYG + YWLVKN WG
Sbjct: 183 NQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGNDG---TKYWLVKNSWG 239
Query: 278 TNWDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
+W E G +R+ RG+ + GLC IA A+YP
Sbjct: 240 ADWGEEGYIRMQRGIDAAEGLCGIAMQASYP 270
>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 129/314 (41%), Positives = 167/314 (53%), Gaps = 51/314 (16%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFL 60
+HE WM ++ R YKD EK R+KIFK N F L +N+FADLT E+F
Sbjct: 38 RHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIESFNKAMDKSYKLSINEFADLTNEEFG 97
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
S +K H S + FK N + + +IDW ++GAVTP+KDQG CWAF+
Sbjct: 98 TSRNRFKA----HICSTEATSFKYENVTAVP--STIDWRKKGAVTPIKDQGQCGSCWAFS 151
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AVA +EG+ ++ TG+L++ S+ +LVDC T GC + Y
Sbjct: 152 AVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGC-------------------NGANY 192
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
PY G D C+ R A+ I GY+ V E+ LQ V QP++VAIDA F F
Sbjct: 193 PYAGT-DGTCN--RKKAAHPAAKINGYEDVPANNEKALQKAVVHQPIAVAIDAGGFEFQF 249
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-G 293
Y GVFTG CG +HGV VGYGT+ + YWLVKN WGT W E G +R+ R V
Sbjct: 250 YSSGVFTGQCGTELDHGVAAVGYGTSDDG---MKYWLVKNSWGTGWGEEGYIRMQRDVTA 306
Query: 294 GSGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 307 KEGLCGIAMQASYP 320
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 127/322 (39%), Positives = 173/322 (53%), Gaps = 32/322 (9%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFAD 53
H I QW+ +R Y+ +EK RF+IFK+N + L LNKF+D
Sbjct: 40 HSDDAILDVFHQWLETHSRVYRSLSEKHHRFQIFKENFLYIHAHNKQQKSYWLGLNKFSD 99
Query: 54 LTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
LT ++F A Y G KP NR N + +DW +GAVT VKDQG+
Sbjct: 100 LTHQEFRAQYLGTKP-------VNRQRKEANFMYEDVEAEPKVDWRLKGAVTDVKDQGAC 152
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRL 170
CWAF+AV +VEG+N I+TG+LV+ S+ +LVDC GC ++ AFE+I + +
Sbjct: 153 GSCWAFSAVGSVEGVNAIKTGELVSLSEQELVDCDRKQNQGCNGGLMDYAFEFIIKNGGI 212
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT 230
+E YPY+ R D CD R ++ K I YQ V +E L +++ PVSVAI+A
Sbjct: 213 DTEKDYPYKAR-DGRCDEGRRNS--KVVVIDDYQDVPTQSESALMKALTKNPVSVAIEAG 269
Query: 231 WFNF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
+F Y GGVFTGPCG+ +HGV VGYGT + YW+VKN WG W E G +R+
Sbjct: 270 GRDFQHYQGGVFTGPCGSELDHGVLAVGYGTDDDGVN---YWIVKNSWGPGWGEKGYIRM 326
Query: 289 --FRGVGGSGLCNIAANAAYPL 308
F G C I A++P+
Sbjct: 327 ERFGSDSTDGKCGINIEASFPI 348
>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
Length = 360
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 125/298 (41%), Positives = 166/298 (55%), Gaps = 28/298 (9%)
Query: 31 EKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPP--TDHPHS 76
EK RF +FK N H F L+LNKFAD+T +F Y G K T S
Sbjct: 53 EKHKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFRQHYAGSKIKHHRTLLGAS 112
Query: 77 NRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQL 135
+ F N + SIDW ++GAVTPVKDQG CWAF+ V VEG+N+I+T +L
Sbjct: 113 RANGTFMYANEDNVP--PSIDWRKKGAVTPVKDQGQCGSCWAFSTVVAVEGINQIKTKKL 170
Query: 136 VTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSA 193
V+ S+ +LVDC T GC ++ AF++I++ + +E YPY+ D CD + +
Sbjct: 171 VSLSEQELVDCDTTENQGCNGGLMDPAFDFIKKRGGITTEERYPYKAEDDK-CDIQKRNT 229
Query: 194 SGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHGGVFTGPCGNTPNHG 251
+I G++ V P E+ L V+ QP+SVAIDA+ F FY GVFTG CG +HG
Sbjct: 230 P--VVSIDGHEDVPPNDEDALLKAVANQPISVAIDASGSQFQFYSEGVFTGECGTELDHG 287
Query: 252 VTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
V IVGYGTT + YW+VKN WG W E G +R+ R V GLC IA +YP+
Sbjct: 288 VAIVGYGTTVDG---TKYWIVKNSWGAGWGEKGYIRMQRKVDAEEGLCGIAMQPSYPI 342
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 127/324 (39%), Positives = 175/324 (54%), Gaps = 42/324 (12%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTRE 57
+ ++E W+ E R Y EKE RF+IFK N F+ LN+FADLT E
Sbjct: 46 VKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFADLTNE 105
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKM-------SFYDSIDWNERGAVTPVKDQ 110
++ Y G K R + K+ N S+ S+DW +RGAV P+K+Q
Sbjct: 106 EYRTMYLGTKSDA-------RRRFVKSKNPSQRYASRPNELMPHSVDWRKRGAVAPIKNQ 158
Query: 111 GSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQY 167
GS CWAF+ VA V G+N+I TG+++T S+ +LVDC + +GC ++ AFE+I
Sbjct: 159 GSCGSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISN 218
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI 227
+ +E YPY+G + CD R + K +I GY+ V P E LQ V+ QPV VAI
Sbjct: 219 GGMDTEKHYPYRGVEG-RCDPVRKNY--KVVSIDGYEDV-PRNERALQKAVAHQPVCVAI 274
Query: 228 DAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
+A+ F Y GVFTG CG +HGV +VGYG+ E YW+V+N WGT W E G
Sbjct: 275 EASGRAFQLYSSGVFTGECGEEVDHGVVVVGYGS----EDGVDYWIVRNSWGTKWGENGY 330
Query: 286 MRIFRGVGGS--GLCNIAANAAYP 307
+++ R V S G C I A+YP
Sbjct: 331 VKMERNVKKSHLGKCGIMTEASYP 354
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 170/311 (54%), Gaps = 35/311 (11%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLRL------------NKFADLTREKFLASYTG 65
W+ R YK E+E+RF I++ N ++++ NKFADLT E+F ++Y G
Sbjct: 49 WVKRHGRKYKHNDEREVRFGIYQANVQYIQCKNAQKNSYNLTDNKFADLTNEEFQSTYMG 108
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAVAT 123
H R + +L SK DW + GAVT + DQG C CWAF AVA
Sbjct: 109 LSTRLRSHNTGFRYDEHGDLPESK-------DWRKEGAVTEIMDQGQ-CGGCWAFAAVAA 160
Query: 124 VEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+NKI++G+L++ S+ +L+DC + GC +E A+ +I + L +E YPY+G
Sbjct: 161 VEGINKIKSGKLISLSEQELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEG 220
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
D C + A+ +I GY+ V E L+ + QPVSVAIDA F FY G
Sbjct: 221 V-DGTCKMEK--AAHYAASISGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEG 277
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG-VGGSGL 297
VF+G CG NHGVT+VGYG T YW+VKN WG +W E G +R+ R + G+
Sbjct: 278 VFSGICGKQLNHGVTVVGYGKET----INKYWIVKNSWGADWGESGYIRMKRDTLSKEGM 333
Query: 298 CNIAANAAYPL 308
C IA A+YPL
Sbjct: 334 CGIAMQASYPL 344
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 204 bits (520), Expect = 3e-50, Method: Compositional matrix adjust.
Identities = 129/323 (39%), Positives = 175/323 (54%), Gaps = 35/323 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQA--------EKEMRFKIFKKNHEF------------LRLNK 50
+ A + WM++ ++Y D A EK R+ IFK N F L LN
Sbjct: 53 LQALFDSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGENEKNQGYFLGLNA 112
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FADLT E+F A G + + S+ + ++ + DSIDW E+GAV VKDQ
Sbjct: 113 FADLTNEEFRAQRHGGRFDRSRERTSHEEFRYGSVQLKDLP--DSIDWREKGAVVGVKDQ 170
Query: 111 GSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQY 167
GS CWAF+AVA +EG+NK+ TG+LV+ S+ +LVDC GC ++ AF ++ +
Sbjct: 171 GSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGFVIKN 230
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI 227
L +E YPY+G CD RS + K I GY+ V E L V+ QPVSVAI
Sbjct: 231 GGLDTEADYPYKGYG-TRCD--RSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAI 287
Query: 228 DA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
DA + FY G+FTG CG +HGVT VGYG E + YW++KN WG+NW E G
Sbjct: 288 DAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGK----EDGKAYWIIKNSWGSNWGEKGY 343
Query: 286 MRIFRGVG-GSGLCNIAANAAYP 307
+++ R G +GLC I A+YP
Sbjct: 344 VKMARNTGLAAGLCGINMEASYP 366
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 204 bits (519), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 122/318 (38%), Positives = 176/318 (55%), Gaps = 29/318 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
++A +E W++E ++Y EK+ RF+IFK N ++ L L KFADLT E
Sbjct: 45 VSALYESWLIEHGKSYNALGEKDKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNE 104
Query: 58 KFLASYTGYKPPPTDHPHS-NRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
++ + Y G K S N+S+ + L S +S+DW ++G + VKDQGS C
Sbjct: 105 EYRSIYLGTKSSGDRRKLSKNKSDRY--LPKVGDSLPESVDWRDKGVLVGVKDQGSCGSC 162
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASE 173
WAF+AVA +E +N I TG L++ S+ +LVDC S GC ++ AFE++ + +E
Sbjct: 163 WAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYNEGCDGGLMDYAFEFVINNGGIDTE 222
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN 233
YPY+ R D CD +R +A K I Y+ V E+ LQ V+ QPVS+AI+A +
Sbjct: 223 EDYPYKERND-VCDQYRKNA--KVVKIDSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRD 279
Query: 234 FYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
H G+FTG CG +HGV GYG+ E YW+V+N WG W E G +R+ R
Sbjct: 280 LQHYKSGIFTGKCGTAVDHGVVAAGYGS----ENGMDYWIVRNSWGAKWGEKGYLRVQRN 335
Query: 292 VG-GSGLCNIAANAAYPL 308
V SGLC +A +YP+
Sbjct: 336 VASSSGLCGLATEPSYPV 353
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 172/311 (55%), Gaps = 29/311 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE-------------FLRLNKFADLTREKFLAS 62
E+W+ ++ + Y EK RF++FK N +L LN FADLT ++F A+
Sbjct: 73 EEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFADLTHDEFKAT 132
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
Y G P T S + + S+DW ++GAVT VK+QG CWAF+ V
Sbjct: 133 YLGLLPKRT----SGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQGQCGSCWAFSTV 188
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A VEG+N+I TG L + S+ QLVDCST NGC+ ++NAF +I L SE YPY
Sbjct: 189 AAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGAGLRSEEAYPYL 248
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHG 237
++ CD R+ I GY+ V E+ L ++ QPVSVAI+A+ F FY G
Sbjct: 249 -MEEGDCD-DRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSG 306
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
GVF GPCG+ +HGV VGYG++ Q Y +VKN WGT+W E G +R+ RG G G
Sbjct: 307 GVFDGPCGSELDHGVAAVGYGSSK----GQDYIIVKNSWGTHWGEKGYIRMKRGTGKPEG 362
Query: 297 LCNIAANAAYP 307
LC I A+YP
Sbjct: 363 LCGINKMASYP 373
>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
Length = 379
Score = 204 bits (518), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 125/312 (40%), Positives = 167/312 (53%), Gaps = 28/312 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E W+V+ + Y AEKE R IFK N F L LN+FADL+ ++
Sbjct: 65 ESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSENLGYRLGLNRFADLSLHEYKEIC 124
Query: 64 TGYKP-PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTA 120
G P PP +H + S+ +K S+ S+DW GAVT VKDQG +C CWAF+
Sbjct: 125 HGADPKPPRNHVFMSSSDRYKT--SAGDVLPKSVDWRNEGAVTEVKDQG-HCRSCWAFST 181
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
V VEGLNKI TG+LVT S+ L++C+ NGC +E A+E+I L ++ YPY+
Sbjct: 182 VGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIVSNGGLGTDNDYPYK 241
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHG 237
+ CD R + K I GY+ + E L V+ QPV+ ID++ F Y
Sbjct: 242 A-VNGACD-GRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQLYES 299
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
GVF G CG NHGV +VGYGT E + YW+V+N WG W E G M++ R + G
Sbjct: 300 GVFDGRCGTNLNHGVVVVGYGT----ENGRNYWIVRNSWGNTWGEAGYMKMARNIANPRG 355
Query: 297 LCNIAANAAYPL 308
LC IA +YPL
Sbjct: 356 LCGIAMRVSYPL 367
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 121/304 (39%), Positives = 169/304 (55%), Gaps = 29/304 (9%)
Query: 28 DQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLASYTGYKPP---PT 71
D E RF +F +N + L LNKFAD+T ++F +Y G +
Sbjct: 63 DDGEARRRFNVFVENARYIHEANRRGGRPFRLALNKFADMTTDEFRRTYAGSRARHHRSL 122
Query: 72 DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKI 130
F+ + + ++DW ERGAVT +KDQG CWAF+AVA VEG+NKI
Sbjct: 123 RGGRGGEGGSFRYGGDDEDNLPPAVDWRERGAVTGIKDQGQCGSCWAFSAVAAVEGVNKI 182
Query: 131 RTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDW 188
+TG+LVT S+ +LVDC T + GC ++ AF++I++ + +E YPY+ Q C+
Sbjct: 183 KTGRLVTLSEQELVDCDTGDNQGCDGGLMDYAFQFIKRNGGITTESNYPYRAEQG-RCN- 240
Query: 189 WRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGN 246
++ AS I GY+ V E LQ V+ QPV+VA++A+ F FY GVFTG CG
Sbjct: 241 -KAKASSHDVTIDGYEDVPANDESALQKAVANQPVAVAVEASGQDFQFYSEGVFTGECGT 299
Query: 247 TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG--GSGLCNIAANA 304
+HGV VGYG T + YW+VKN WG +W E G +R+ RGV +GLC IA A
Sbjct: 300 DLDHGVAAVGYGITRDG---TKYWIVKNSWGEDWGERGYIRMQRGVSSDSNGLCGIAMEA 356
Query: 305 AYPL 308
+YP+
Sbjct: 357 SYPV 360
>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 368
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 120/292 (41%), Positives = 165/292 (56%), Gaps = 24/292 (8%)
Query: 35 RFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWF 82
RF +FK+N ++ L LNKFAD+T ++ SY G + R
Sbjct: 68 RFNVFKENVKYIHEANKKDRPFRLALNKFADMTTDELRHSYAGSRVRHHRALSGGRRAQG 127
Query: 83 KNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKH 141
S + ++DW E+GAVT +KDQG CWAF+ +A VE +NKIRTG+LV+ S+
Sbjct: 128 NFTYSDAENLPPAVDWREKGAVTGIKDQGQCGSCWAFSTIAAVESINKIRTGKLVSLSEQ 187
Query: 142 QLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGA 199
+L+DC +N GC ++ AF++I++ + SE YPYQG+Q+ CD + + A
Sbjct: 188 ELMDCDNVNDQGCDGGLMDYAFQFIQKNGGVTSEANYPYQGQQN-TCDQAKENTHDV--A 244
Query: 200 IRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGY 257
I GY+ V E LQ V+ QPVSVAI+A+ F FY GVFTG C +HGV VGY
Sbjct: 245 IDGYEDVPANDESALQKAVAYQPVSVAIEASGQDFQFYSEGVFTGQCTTDLDHGVAAVGY 304
Query: 258 GTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
GT + YW+VKN WG +W E G +R+ RGV GLC IA A+YP+
Sbjct: 305 GTARDG---TKYWIVKNSWGLDWGEKGYIRMQRGVSQAEGLCGIAMQASYPI 353
>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
Length = 384
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 133/353 (37%), Positives = 180/353 (50%), Gaps = 58/353 (16%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFL-----------RL--NKFADLTREKFL 60
+ EQWM R Y D EK+ R +++++N + RL NKFADLT E+F
Sbjct: 31 RFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMSNGGYRLADNKFADLTNEEFR 90
Query: 61 ASYTGY-KPPPTDHP--HSNRSNWFKNLNSSKMSFYD-----SIDWNERGAVTPVKDQGS 112
A G+ +PPP H+ + S Y S+DW E+GAV PVK+QG
Sbjct: 91 AKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSDELPKSVDWREKGAVAPVKNQGE 150
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRL 170
CWAF+AVA +EG+N+I+ G+LV+ S+ +LVDC T GCA ++ AFE++ L
Sbjct: 151 CGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKAIGCAGGYMSWAFEFVMNNSGL 210
Query: 171 ASECVYPYQGR----------QDYYCDWWRSSASGKYG---------------AIRGYQY 205
+E YPYQG + C S+ + G +I GY
Sbjct: 211 TTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKESAVSISGYVN 270
Query: 206 VQPATEEGLQDVVSRQPVSVAIDATWF--NFYHGGVFTGPCGNTPNHGVTIVGYGTT--- 260
V ++E L + QPVSVA+DA F Y GGVFTGPC NHGVT+VGYG T
Sbjct: 271 VTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTVVGYGETQRD 330
Query: 261 TEAEGQ----QPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
T+ +G Q YW+VKN WG W + G + + R SGLC IA +YP+
Sbjct: 331 TDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIALLPSYPV 383
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 123/318 (38%), Positives = 174/318 (54%), Gaps = 37/318 (11%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +W E ++Y E+E R+ F+ N H F L LN+FADLT E+
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ---GSYCC 115
+ +Y G + + P R + L + + +S+DW +GAV +KDQ GS C
Sbjct: 100 YRDTYLGLR----NKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQEVAGS--C 153
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASE 173
WAF+A+A VEG+N+I TG L++ S+ +LVDC T GC ++ AF++I + +E
Sbjct: 154 WAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDTE 213
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY+G+ D CD R +A K I Y+ V P +E LQ V+ QPVSVAI+A
Sbjct: 214 DDYPYKGK-DERCDVNRKNA--KVVTIDSYEDVTPNSETSLQKAVANQPVSVAIEAGGRA 270
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+FTG CG +HGV VGYGT E + YW+V+N WG +W E G +R+ R
Sbjct: 271 FQLYSSGIFTGKCGTALDHGVAAVGYGT----ENGKDYWIVRNSWGKSWGESGYVRMERN 326
Query: 292 V-GGSGLCNIAANAAYPL 308
+ SG C IA +YPL
Sbjct: 327 IKASSGKCGIAVEPSYPL 344
>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 356
Score = 204 bits (518), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 124/322 (38%), Positives = 174/322 (54%), Gaps = 35/322 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE+WM +F R Y D EK R ++F N + L LNKF+DLT ++F+
Sbjct: 38 RHEEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRAGNRTYTLGLNKFSDLTDDEFV 97
Query: 61 ASYTGYKPPPTD--HPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
++ GY+ P + L + +S+DW +GAVT VK+QGS CCWA
Sbjct: 98 QTHLGYRGHQQGGLRPEEENVSKVAALGYGQADMPESVDWRAQGAVTGVKNQGSCGCCWA 157
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCS-------TLNGCAKNFLENAFEYIRQYQRL 170
F AVA EGL KI TG L++ S+ Q++DC+ N C +++A Y+ + L
Sbjct: 158 FAAVAATEGLVKIATGNLISMSEQQVLDCTGQSPGMGNTNTCDGGHIDDALRYVAASRGL 217
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEG-LQDVVSRQPVSVAIDA 229
E Y Y G Q + +++ +G Q V +EG LQ +V+ QP++V+++A
Sbjct: 218 QPEAAYAYTGLQGACQSGFTPNSAASFGEP---QTVTLQGDEGRLQGLVAGQPIAVSVEA 274
Query: 230 TW-FNFYHGGVFTG---PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
+ F Y GVFT CG NH VT+VGYG+ A+G Q YWLVKN+WGT+W EGG
Sbjct: 275 SDDFRHYMSGVFTAGTSSCGQRLNHAVTVVGYGS---ADGGQEYWLVKNQWGTSWGEGGY 331
Query: 286 MRIFRGVGGSGLCNIAANAAYP 307
MRI RG G C I+A A YP
Sbjct: 332 MRIARGNGAPN-CGISAYAYYP 352
>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 203 bits (517), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 133/330 (40%), Positives = 180/330 (54%), Gaps = 33/330 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------L 48
SR + I H++WM+ F+R Y D+ EK+MR ++F +N +F+ +
Sbjct: 25 SRVALHEPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSYKLGV 84
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHS--NRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
NKF D T+E+FLA++TG P N + N S + + DW GAVTP
Sbjct: 85 NKFTDWTKEEFLATHTGLSGINVTSPFEVVNETTPAWNWTVSDV-LGTTKDWRNEGAVTP 143
Query: 107 VKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFE 162
VK QG C CWAF+A+A VEGL KI G L++ S+ QL+DC+ NGC + AF
Sbjct: 144 VKYQGE-CGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCAREQNNGCKGGTMIEAFN 202
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
YI + ++SE YPYQ ++ C RS+ IRG++ V E L + VSRQP
Sbjct: 203 YIVKNGGVSSENAYPYQVKEG-PC---RSNDIPAI-VIRGFENVPSNNERALLEAVSRQP 257
Query: 223 VSVAIDA--TWFNFYHGGVFTG-PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
V+V IDA T F Y GGV+ CG + NH VT+VGYGT+ E YWL KN WG
Sbjct: 258 VAVDIDASETGFIHYSGGVYNARDCGTSVNHAVTLVGYGTSQEG---IKYWLAKNSWGKT 314
Query: 280 WDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W E G +RI R V G+C +A A+YP+
Sbjct: 315 WGENGYIRIRRDVEWPQGMCGVAQYASYPV 344
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 128/311 (41%), Positives = 172/311 (55%), Gaps = 29/311 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE-------------FLRLNKFADLTREKFLAS 62
E+W+ ++ + Y EK RF++FK N +L LN FADLT ++F A+
Sbjct: 87 EEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFADLTHDEFKAT 146
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
Y G P T S + + S+DW ++GAVT VK+QG CWAF+ V
Sbjct: 147 YLGLLPKRT----SGGRFRYGGVGDGGDEVPASVDWRKKGAVTEVKNQGQCGSCWAFSTV 202
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A VEG+N+I TG L + S+ QLVDCST NGC+ ++NAF +I L SE YPY
Sbjct: 203 AAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGAGLRSEEAYPYL 262
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHG 237
++ CD R+ I GY+ V E+ L ++ QPVSVAI+A+ F FY G
Sbjct: 263 -MEEGDCD-DRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQFYSG 320
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
GVF GPCG+ +HGV VGYG++ Q Y +VKN WGT+W E G +R+ RG G G
Sbjct: 321 GVFDGPCGSELDHGVAAVGYGSSK----GQDYIIVKNSWGTHWGEKGYIRMKRGTGKPEG 376
Query: 297 LCNIAANAAYP 307
LC I A+YP
Sbjct: 377 LCGINKMASYP 387
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 203 bits (517), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 126/320 (39%), Positives = 174/320 (54%), Gaps = 34/320 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLA 61
+E+W R + AEK RF FK N F LRLN+F D+++ +F A
Sbjct: 46 YERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKRGDRPYRLRLNRFGDMSQAEFRA 104
Query: 62 SYTGYKP-------PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
++ G + P T P S + +N S + S+DW ++GAVT VK+QG
Sbjct: 105 TFAGSRVSDRRRDGPAT--PPSVPGFMYAAVNVSDLP--RSVDWRQKGAVTGVKNQGKCG 160
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLA 171
CWAF+ V +VEG+N IRTG+LV+ S+ +L+DC T +GC ++NAFEYI++ L
Sbjct: 161 SCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDNAFEYIKKNGGLT 220
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT- 230
+E YPY+ + +S I G+Q V +EE L V+ QPVSV IDA+
Sbjct: 221 TEAAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQPVSVGIDASG 280
Query: 231 -WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GVFTG CG +HGV +VGYG AE + YW VKN WG +W E G +R+
Sbjct: 281 KAFMFYSEGVFTGECGTELDHGVAVVGYGV---AEDGKAYWTVKNSWGPSWGEKGYIRVE 337
Query: 290 RGVGGS-GLCNIAANAAYPL 308
+ G GLC IA A+Y +
Sbjct: 338 KDSGAEGGLCGIAMEASYAV 357
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 203 bits (517), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 121/314 (38%), Positives = 178/314 (56%), Gaps = 29/314 (9%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKFL 60
+E W+ E R Y E++ RF++F N F L +N+FADLT ++F
Sbjct: 109 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 168
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A+Y G + P + + +++ ++ +S+DW E+GAV PVK+QG CWAF+
Sbjct: 169 AAYLGARIPASRRRGTAVGERYRHGGGAE-ELPESVDWREKGAVAPVKNQGQCGSCWAFS 227
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AV++VE +N+I TG++VT S+ +LV+CST +GC ++ AF++I + + +E Y
Sbjct: 228 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 287
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNF 234
PY+ D CD R +A K +I G++ V E+ LQ V+ QPVSVAI+A F
Sbjct: 288 PYKA-VDGKCDINRENA--KVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQL 344
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVFTG C +HGV VGYGT E + YW+V+N WG W E G +R+ R V
Sbjct: 345 YKAGVFTGTCTTNLDHGVVAVGYGT----ENGKDYWIVRNSWGAKWGEDGYIRMERNVNA 400
Query: 295 -SGLCNIAANAAYP 307
+G C IA A+YP
Sbjct: 401 TTGKCGIAMMASYP 414
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 121/314 (38%), Positives = 178/314 (56%), Gaps = 29/314 (9%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKFL 60
+E W+ E R Y E++ RF++F N F L +N+FADLT ++F
Sbjct: 52 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 111
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A+Y G + P + + +++ ++ +S+DW E+GAV PVK+QG CWAF+
Sbjct: 112 AAYLGARIPASRRRGTAVGERYRHGGGAE-ELPESVDWREKGAVAPVKNQGQCGSCWAFS 170
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AV++VE +N+I TG++VT S+ +LV+CST +GC ++ AF++I + + +E Y
Sbjct: 171 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 230
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
PY+ D CD R +A K +I G++ V E+ LQ V+ QPVSVAI+A F
Sbjct: 231 PYKA-VDGKCDINRENA--KVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQL 287
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVFTG C +HGV VGYGT E + YW+V+N WG W E G +R+ R V
Sbjct: 288 YKAGVFTGTCTTNLDHGVVAVGYGT----ENGKDYWIVRNSWGAKWGEDGYIRMERNVNA 343
Query: 295 -SGLCNIAANAAYP 307
+G C IA A+YP
Sbjct: 344 TTGKCGIAMMASYP 357
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 203 bits (516), Expect = 8e-50, Method: Compositional matrix adjust.
Identities = 124/315 (39%), Positives = 175/315 (55%), Gaps = 33/315 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +WM E TY E+E RF+ F+ N H F L LN+FADLT E+
Sbjct: 43 YAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQHNAAADAGVHSFRLGLNRFADLTNEE 102
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ ++Y G + P S ++ ++ ++ +S+DW ++GAV VKDQG CWA
Sbjct: 103 YRSTYLGARTKP--DRERKLSARYQAADNDELP--ESVDWRKKGAVGAVKDQGGCGSCWA 158
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+A+A VEG+N+I TG ++ S+ +LVDC T GC ++ AFE+I + SE
Sbjct: 159 FSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDSEED 218
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+ R D CD + +A K I GY+ V +E+ LQ V+ QP+SVAI+A F
Sbjct: 219 YPYKER-DNRCDANKKNA--KVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQ 275
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y G+FTG CG +HGV VGYGT E + YWLV+N WG+ W E G +R+ R +
Sbjct: 276 LYKSGIFTGTCGTALDHGVAAVGYGT----ENGKDYWLVRNSWGSVWGEDGYIRMERNIK 331
Query: 293 GGSGLCNIAANAAYP 307
SG C IA +YP
Sbjct: 332 ASSGKCGIAVEPSYP 346
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 123/300 (41%), Positives = 166/300 (55%), Gaps = 30/300 (10%)
Query: 30 AEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPPTDHPHSN 77
EK RF +FK N L+LNKFAD+T +F ++Y G K +HP
Sbjct: 54 GEKHKRFNVFKANLMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSK---VNHPRMF 110
Query: 78 RSNWFKN---LNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
R +N + +S S+DW ++GAVT VKDQG CWAF+ V VEG+N+I+T
Sbjct: 111 RGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTN 170
Query: 134 QLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRS 191
+LV S+ +LVDC GC +E+AFE+I+Q + +E YPY+ Q+ CD S
Sbjct: 171 KLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKA-QEGTCD--AS 227
Query: 192 SASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPN 249
+ +I G++ V E+ L V+ QPVSVAIDA + F FY GVFTG C N
Sbjct: 228 KVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTDLN 287
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
HGV IVGYGTT + YW+V+N WG W E G +R+ R + GLC IA +YP+
Sbjct: 288 HGVAIVGYGTTVDGTN---YWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLPSYPI 344
>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
Precursor
gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 371
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 162/311 (52%), Gaps = 26/311 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E WMV+ + Y AEKE R IF+ N F L LN+FADL+ ++
Sbjct: 57 ESWMVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAENLSYRLGLNRFADLSLHEYGEIC 116
Query: 64 TGYKP-PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG-SYCCWAFTAV 121
G P PP +H SN +K + + S+DW GAVT VKDQG CWAF+ V
Sbjct: 117 HGADPRPPRNHVFMTSSNRYKTSDGDVLP--KSVDWRNEGAVTEVKDQGLCRSCWAFSTV 174
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEGLNKI TG+LVT S+ L++C+ NGC +E A+E+I L ++ YPY+
Sbjct: 175 GAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKA 234
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHGG 238
C+ R K I GY+ + E L V+ QPV+ +D++ F Y G
Sbjct: 235 LNG-VCE-GRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESG 292
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF G CG NHGV +VGYGT E + YW+VKN G W E G M++ R + GL
Sbjct: 293 VFDGTCGTNLNHGVVVVGYGT----ENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGL 348
Query: 298 CNIAANAAYPL 308
C IA A+YPL
Sbjct: 349 CGIAMRASYPL 359
>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 357
Score = 203 bits (516), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 131/321 (40%), Positives = 175/321 (54%), Gaps = 38/321 (11%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKF 59
++E+W + RTYKD EK RF++F+ N F L NKFADLT E+F
Sbjct: 48 RYEKWAADHGRTYKDSLEKARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADLTNEEF 107
Query: 60 LASYTGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC-CWA 117
A Y G P P S + + N+ +S + +I+W +RGAVT VK+Q CWA
Sbjct: 108 -AEYYGR---PFSTPVIGGSGFMYGNVRTSDVP--ANINWRDRGAVTQVKNQKDCASCWA 161
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+AVA VEG+++IR+ LV S QL+DCST +GC + ++ AF YI +A+E
Sbjct: 162 FSAVAAVEGIHQIRSHNLVALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAES 221
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WF 232
YPY+ R C R+S +IRG+QYV P E L V+ QPVSVA+D
Sbjct: 222 DYPYEDRALGTC---RASGKPVAASIRGFQYVPPNNETALLLAVAHQPVSVALDGVGKVS 278
Query: 233 NFYHGGVFTG----PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
F+ GVF C NH +T VGYGT E YWL+KN WGT+W EGG M+I
Sbjct: 279 QFFSSGVFGAMQNETCTTDLNHAMTAVGYGTD---EHGTKYWLMKNSWGTDWGEGGYMKI 335
Query: 289 FRGVGG-SGLCNIAANAAYPL 308
R V +GLC +A +YP+
Sbjct: 336 ARDVASNTGLCGLAMQPSYPV 356
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 202 bits (515), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 123/318 (38%), Positives = 177/318 (55%), Gaps = 33/318 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREK 58
++ W + AR+Y E E R +IF+ N F L L +FADLT E+
Sbjct: 47 YQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTRFADLTNEE 106
Query: 59 FLASYTGYKPPPTDHPHSNR--SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+ ++Y G + + ++ SN ++ +S + DSIDW ++GAV VKDQGS C
Sbjct: 107 YRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLP--DSIDWRDKGAVVDVKDQGSCGSC 164
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASE 173
WAF+ +A VEG+N I TG L++ S+ +LVDC T GC ++ AFE+I + ++
Sbjct: 165 WAFSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMDYAFEFIISNGGIDTD 224
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY GR D CD +R +A I Y+ V E+ LQ V+ QPVSVAI+A
Sbjct: 225 EDYPYTGR-DGSCDQYRKNA--HVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEAGGRA 281
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+FTG CG +HGVT +GYG+ E + YW+VKN WG++W E G +R+ R
Sbjct: 282 FQLYESGIFTGYCGTELDHGVTAIGYGS----ENGKYYWIVKNSWGSDWGESGYIRMERN 337
Query: 292 V-GGSGLCNIAANAAYPL 308
+ +G C IA A+YP+
Sbjct: 338 INSATGKCGIAMEASYPI 355
>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
Length = 323
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 118/327 (36%), Positives = 170/327 (51%), Gaps = 50/327 (15%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKK-----------NHEF-LRLN 49
+R +AA+HE+WM ++ R YKD AEK RF++FK NH+F L +N
Sbjct: 24 ARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAFIESFNAGNHKFWLGVN 83
Query: 50 KFADLTREKFLASYT--GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+FADLT ++F ++ T G+ P T P F+N N + + ++DW +G VTP+
Sbjct: 84 QFADLTNDEFRSTKTNKGFIPSTTRVPTG-----FRNENVNIDALPATMDWRTKGVVTPI 138
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEY 163
KDQG CCWAF+AVA +E +LVDC GC +++AF++
Sbjct: 139 KDQGQCGCCWAFSAVAAME----------------ELVDCDVHGEDQGCEGGLMDDAFKF 182
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I + L +E YPY D + S S +I+GY+ V E L V+ QPV
Sbjct: 183 IIKNGGLTTESNYPYAAVDDKF-----KSVSNSVASIKGYEDVPANNEAALMKAVANQPV 237
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVA+D F FY GGV TG CG +HG+ +GYG ++ YWL+KN WG W
Sbjct: 238 SVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASDG---TKYWLLKNSWGMTWG 294
Query: 282 EGGSMRIFRGVGGS-GLCNIAANAAYP 307
E G +R+ + + G+C +A +YP
Sbjct: 295 ENGFLRMEKDISDKRGMCGLAMEPSYP 321
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 202 bits (514), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 126/329 (38%), Positives = 174/329 (52%), Gaps = 34/329 (10%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQ----AEKEMRFKIFKKNHEF------------ 45
+ TS + +E WMVE + +Q AEK+ RF+IFK N F
Sbjct: 37 TETSRSDSEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTKNLSYK 96
Query: 46 LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVT 105
L L +FADLT E++ + Y G KP S+R + DS+DW + GAV
Sbjct: 97 LGLTRFADLTNEEYRSMYLGAKPTKRVLKTSDRYQ-----ARVGDALPDSVDWRKEGAVA 151
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFE 162
VKDQGS CWAF+ + VEG+NKI TG L++ S+ +LVDC T GC ++ AFE
Sbjct: 152 DVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFE 211
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
+I + + +E YPY+ D CD R +A K I Y+ V +E L+ ++ QP
Sbjct: 212 FIIKNGGIDTEADYPYKA-ADGRCDQNRKNA--KVVTIDSYEDVPENSEASLKKALAHQP 268
Query: 223 VSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
+SVAI+A F Y GVF G CG +HGV VGYGT E + YW+V+N WG W
Sbjct: 269 ISVAIEAGGRAFQLYSSGVFDGLCGTELDHGVVAVGYGT----ENGKDYWIVRNSWGNRW 324
Query: 281 DEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
E G +++ R + +G C IA A+YP+
Sbjct: 325 GESGYIKMARNIEAPTGKCGIAMEASYPI 353
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 121/316 (38%), Positives = 171/316 (54%), Gaps = 33/316 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +W E ++Y E+E R+ F+ N H F L LN+FADLT E+
Sbjct: 40 YAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ +Y G + + P R + L + + +S+DW +GAV +KDQG CWA
Sbjct: 100 YRDTYLGLR----NKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWA 155
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+A+A VE +N+I TG L++ S+ +LVDC T GC ++ AF++I + +E
Sbjct: 156 FSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDTEDD 215
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+G+ D CD R +A K I Y+ V P +E LQ V QPVSVAI+A F
Sbjct: 216 YPYKGK-DERCDVNRKNA--KVVTIDSYEDVTPNSETSLQKAVRNQPVSVAIEAGGRAFQ 272
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y G+FTG CG +HGV VGYGT E + YW+V+N WG +W E G +R+ R +
Sbjct: 273 LYSSGIFTGKCGTALDHGVAAVGYGT----ENGKDYWIVRNSWGKSWGESGYVRMERNIK 328
Query: 293 GGSGLCNIAANAAYPL 308
SG C IA +YPL
Sbjct: 329 ASSGKCGIAVEPSYPL 344
>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 385
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 125/316 (39%), Positives = 170/316 (53%), Gaps = 32/316 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLAS 62
+E+W + R +D EK RF +FK N HEF LRLN+F D+T ++F +
Sbjct: 48 YERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRRDEPYKLRLNRFGDMTADEFRRA 106
Query: 63 YTGYKPPPTDHPHSNRSNWFKN---LNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
Y + H R + + + ++DW E+GAV VKDQG CWAF
Sbjct: 107 YASSR---VSHHRMFRGRGERRSGFMYAGARDLPAAVDWREKGAVGAVKDQGQCGSCWAF 163
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASECV 175
+ +A VEG+N IRT L S+ QLVDC T GC ++NAF+YI ++ +A+
Sbjct: 164 STIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVAASSA 223
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+ RQ S+AS I GY+ V +E L+ V+ QPVSVAI+A + F
Sbjct: 224 YPYRARQSSCK---SSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSHFQ 280
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GVF G CG +HGV VGYGTT + YW+V+N WG +W E G +R+ R V
Sbjct: 281 FYSEGVFAGKCGTELDHGVAAVGYGTTVDG---TKYWIVRNSWGADWGEKGYIRMKRDVS 337
Query: 294 G-SGLCNIAANAAYPL 308
GLC IA A+YP+
Sbjct: 338 AKEGLCGIAMEASYPI 353
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 121/316 (38%), Positives = 180/316 (56%), Gaps = 33/316 (10%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREK 58
A ++ W+ E R+Y E E RF++F N F L +N+FADLT E+
Sbjct: 52 AAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEE 111
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
F A++ G K +R+ + + +S+DW E+GAV PVK+QG CWA
Sbjct: 112 FRATFLGAKVV-----ERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 166
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASEC 174
F+AV+TVE +N++ TG+++T S+ +LV+CST +GC +++AF++I + + +E
Sbjct: 167 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 226
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--F 232
YPY+ D CD R +A K +I G++ V E+ LQ V+ QPVSVAI+A F
Sbjct: 227 DYPYKA-VDGKCDINRENA--KVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREF 283
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
YH GVF+G CG + +HGV VGYGT + + YW+V+N WG W E G +R+ R +
Sbjct: 284 QLYHSGVFSGRCGTSLDHGVVAVGYGT----DNGKDYWIVRNSWGPKWGESGYVRMERNI 339
Query: 293 G-GSGLCNIAANAAYP 307
+G C IA A+YP
Sbjct: 340 NVTTGKCGIAMMASYP 355
>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
Length = 362
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 124/302 (41%), Positives = 168/302 (55%), Gaps = 34/302 (11%)
Query: 30 AEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPP----TDH 73
+K RF +FK N L+LNKFAD+T +F ++Y G K D
Sbjct: 54 GDKHKRFNVFKANMMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRDM 113
Query: 74 PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAVATVEGLNKIR 131
P N + ++ + S S +DW ++GAVT VKDQG +C CWAF+ V VEG+N+I+
Sbjct: 114 PRGNGTFMYEKVGSVPAS----VDWRKKGAVTDVKDQG-HCGSCWAFSTVVAVEGINQIK 168
Query: 132 TGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWW 189
T +LV+ S+ +LVDC T GC +E+AF++I+Q + +E YPY QD CD
Sbjct: 169 TNKLVSLSEQELVDCDTEENAGCNGGLMESAFQFIKQKGGITTESYYPYTA-QDGTCD-- 225
Query: 190 RSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNT 247
S A+ +I G++ V E L V+ QPVSVAIDA + F FY GVFTG C
Sbjct: 226 ASKANDLAVSIDGHENVPGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTE 285
Query: 248 PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAY 306
NHGV IVGYG T + YW+V+N WG W E G +R+ R + GLC IA A+Y
Sbjct: 286 LNHGVAIVGYGATVDG---TSYWIVRNSWGPEWGELGYIRMQRNISKKEGLCGIAMLASY 342
Query: 307 PL 308
P+
Sbjct: 343 PI 344
>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 120/294 (40%), Positives = 163/294 (55%), Gaps = 25/294 (8%)
Query: 32 KEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRS 79
+E RF +FK+N + L LNKFAD+T ++F +Y G + R
Sbjct: 60 EERRFNVFKQNARYVHEGNKRDMPFRLALNKFADMTTDEFRRTYAGSRVRHHLSLSGGRR 119
Query: 80 NWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTR 138
+ ++DW ++GAVT +KDQG CWAF+ + VEG+NKIRTG+LV+
Sbjct: 120 GDGGFRYGDADNLPPAVDWRQKGAVTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSL 179
Query: 139 SKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGK 196
S+ +L+DC +N GC ++ AF++I Q + +E YPYQG Q CD + +A
Sbjct: 180 SEQELMDCDNVNNQGCDGGLMDYAFQFI-QKNGITTESNYPYQGEQG-SCDQAKENAQAV 237
Query: 197 YGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTI 254
I GY+ V E LQ V+ QPVSVAIDA+ F FY GVFTG C +HGV
Sbjct: 238 --TIDGYEDVPANDESALQKAVAGQPVSVAIDASGQDFQFYSEGVFTGECSTDLDHGVAA 295
Query: 255 VGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
VGYG T + YW+VKN WG +W E G +R+ RGV + GLC IA A+YP
Sbjct: 296 VGYGATRDG---TKYWIVKNSWGEDWGEKGYIRMQRGVSQTEGLCGIAMQASYP 346
>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 124/303 (40%), Positives = 166/303 (54%), Gaps = 38/303 (12%)
Query: 31 EKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDH----- 73
EK+ RF +FK N H L+LNKFAD+T +F +Y G K +H
Sbjct: 55 EKQKRFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSK---VNHHRMFR 111
Query: 74 --PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKI 130
P + + ++N + S +DW ++GAVT VKDQG CWAF+ V VEG+N+I
Sbjct: 112 GTPRVSGTFMYENFTKAPAS----VDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQI 167
Query: 131 RTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDW 188
+T +LV S+ +L+DC GC +E AFEYI+Q + +E YPY D CD
Sbjct: 168 KTNRLVPLSEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGITTESYYPYTA-NDGSCDA 226
Query: 189 WRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGN 246
+ + +I G++ V E+ L V+ QPVSVAIDA + F FY GVFTG CG
Sbjct: 227 TKENVPAV--SIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGK 284
Query: 247 TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAA 305
NHGV IVGYGTT + YW+V+N WG W E G +R+ R V GLC IA A+
Sbjct: 285 ELNHGVAIVGYGTTVDGTN---YWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGIAMEAS 341
Query: 306 YPL 308
YP+
Sbjct: 342 YPV 344
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 119/317 (37%), Positives = 177/317 (55%), Gaps = 26/317 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+ ++ W+++ + Y E+E RF+IFK N F L LNKFADLT +
Sbjct: 42 VMGLYKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQ 101
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ A + G + P ++ + + + + DS++W + GAV+ VKDQGS CW
Sbjct: 102 EYRAKFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGSCGSCW 161
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASEC 174
AF+A+A VEG+NKI +G+L++ S+ +LVDC S GC ++ AF++I + +E
Sbjct: 162 AFSAIAAVEGINKIVSGELISLSEQELVDCDRSYDAGCNGGLMDYAFQFIIDNGGIDTEK 221
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YPY G + CD + +A K +I GY+ V P E L+ V+ QPVS+AI+A F
Sbjct: 222 DYPYLGFNN-QCDPTKKNA--KVVSIDGYEDV-PNNENALKKAVAHQPVSIAIEAGGRAF 277
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GVF G CG +HGV VGYG+ Q YW+V+N WG NW E G +R+ R +
Sbjct: 278 QLYESGVFNGECGLALDHGVVAVGYGSDDNG---QDYWIVRNSWGGNWGENGYIRMERNI 334
Query: 293 -GGSGLCNIAANAAYPL 308
+G C IA A+YP+
Sbjct: 335 NANTGKCGIAMEASYPV 351
>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
Precursor
gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
Length = 360
Score = 202 bits (514), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 122/300 (40%), Positives = 168/300 (56%), Gaps = 32/300 (10%)
Query: 31 EKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPPTDH----P 74
EK+ RF +FK N L+LNKFAD+T +F +Y+G K P
Sbjct: 53 EKQKRFNVFKHNAMHVHNANKMDKPYKLKLNKFADMTNHEFRNTYSGSKVKHHRMFRGGP 112
Query: 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
N + ++ +++ S +DW ++GAVT VKDQG CWAF+ + VEG+N+I+T
Sbjct: 113 RGNGTFMYEKVDTVPAS----VDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTN 168
Query: 134 QLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRS 191
+LV+ S+ +LVDC T GC ++ AFE+I+Q + +E YPY+ D CD +
Sbjct: 169 KLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPYEA-YDGTCDVSKE 227
Query: 192 SASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPN 249
+A +I G++ V E L V+ QPVSVAIDA + F FY GVFTG CG +
Sbjct: 228 NAPAV--SIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELD 285
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
HGV IVGYGTT + YW VKN WG W E G +R+ RG+ GLC IA A+YP+
Sbjct: 286 HGVAIVGYGTTIDG---TKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPI 342
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 120/304 (39%), Positives = 168/304 (55%), Gaps = 29/304 (9%)
Query: 28 DQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLASYTGYKPP---PT 71
D E RF +F +N + L LNKFAD+T ++F +Y G +
Sbjct: 63 DDGEARRRFNVFVENARYIHEANRRGGRPFRLALNKFADMTTDEFRRTYAGSRARHHRSL 122
Query: 72 DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKI 130
F+ + + ++DW ERGAVT +KDQG CWAF+ VA VEG+NKI
Sbjct: 123 SGGRGGEGGSFRYGGDDEDNLPPAVDWRERGAVTGIKDQGQCGSCWAFSTVAAVEGVNKI 182
Query: 131 RTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDW 188
+TG+LVT S+ +LVDC T + GC ++ AF++I++ + +E YPY+ Q C+
Sbjct: 183 KTGRLVTLSEQELVDCDTGDNQGCDGGLMDYAFQFIKRNGGITTESNYPYRAEQG-RCN- 240
Query: 189 WRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGN 246
++ AS I GY+ V E LQ V+ QPV+VA++A+ F FY GVFTG CG
Sbjct: 241 -KAKASSHDVTIDGYEDVPANDESALQKAVANQPVAVAVEASGQDFQFYSEGVFTGECGT 299
Query: 247 TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG--GSGLCNIAANA 304
+HGV VGYG T + YW+VKN WG +W E G +R+ RGV +GLC IA A
Sbjct: 300 DLDHGVAAVGYGITRDG---TKYWIVKNSWGEDWGERGYIRMQRGVSSDSNGLCGIAMEA 356
Query: 305 AYPL 308
+YP+
Sbjct: 357 SYPV 360
>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
Length = 314
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 117/306 (38%), Positives = 166/306 (54%), Gaps = 35/306 (11%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRLNKFADLTREKFLASYT--GYKP 68
+ A+HEQWM +++R YKD +EK RFK FADLT +F + T G+K
Sbjct: 33 MVARHEQWMAQYSRVYKDASEKARRFK-------------FADLTNHEFRSVKTNKGFKS 79
Query: 69 PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGL 127
+ F+ N S + +IDW +G VTP+KDQG CC AF+AVA EG+
Sbjct: 80 S-----NMKILTGFRYENVSADALPTTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGI 134
Query: 128 NKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDY 184
KI TG+LV+ + +LVDC GC +++AF++I + L +E YPY D
Sbjct: 135 VKISTGKLVSLADQELVDCDVHGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTA-ADG 193
Query: 185 YCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTG 242
C+ +SA+ I+GY+ V E L ++ QPVSVA+D F FY GGV TG
Sbjct: 194 KCNSGSNSAA----TIKGYEDVPANDEAALMKAMANQPVSVAVDGGDMTFRFYSGGVMTG 249
Query: 243 PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIA 301
CG +HG+ +GYG T++ YWL+KN WGT W E G +R+ + + G+C +A
Sbjct: 250 SCGTDLDHGIAAIGYGKTSDG---TKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLA 306
Query: 302 ANAAYP 307
+YP
Sbjct: 307 MEPSYP 312
>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 120/294 (40%), Positives = 164/294 (55%), Gaps = 25/294 (8%)
Query: 32 KEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRS 79
+E RF +FK+N + L LNKFAD+T ++F +Y G + R
Sbjct: 60 EERRFNVFKENARYVHEGNKRDRPFRLALNKFADMTTDEFRRTYAGSRVRHHLSLSGGRR 119
Query: 80 NWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTR 138
+ + ++DW ++GAVT +KDQG CWAF+ + VEG+NKIRTG+LV+
Sbjct: 120 GDGGFRYADADNLPPAVDWRQKGAVTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSL 179
Query: 139 SKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGK 196
S+ +L+DC +N GC ++ AF++I Q + +E YPYQG Q CD + +A
Sbjct: 180 SEQELMDCDNVNNQGCEGGLMDYAFQFI-QKNGITTESNYPYQGEQG-SCDQAKENAQAV 237
Query: 197 YGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTI 254
I GY+ V E LQ V+ QPVSVAIDA+ F FY GVFTG C +HGV
Sbjct: 238 --TIDGYEDVPANDESALQKAVAGQPVSVAIDASGQDFQFYSEGVFTGECSTDLDHGVAA 295
Query: 255 VGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
VGYG T + YW+VKN WG +W E G +R+ RGV + GLC IA A+YP
Sbjct: 296 VGYGATRDG---TKYWIVKNSWGEDWGEKGYIRMQRGVSQTEGLCGIAMQASYP 346
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 173/314 (55%), Gaps = 28/314 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKF 59
+H +WM E R Y D EK R+ +FK+N E L +N+FADLT E+F
Sbjct: 37 RHAEWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEF 96
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWA 117
+ YTG+K + ++ F+ N S + S+DW ++GAVTP+KDQG C CWA
Sbjct: 97 RSMYTGFKGNSVLSSRTKPTS-FRYQNVSSDALPVSVDWRKKGAVTPIKDQG-LCGSCWA 154
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVY 176
F+AVA +EG+ +I+ G+L++ S+ +LVDC T + GC ++ AF Y L SE Y
Sbjct: 155 FSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDGGCMGGLMDTAFNYTITIGGLTSESNY 214
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNF 234
PY+ + C++ ++ +I+G++ V E+ L V+ PVS+ I F F
Sbjct: 215 PYKS-TNGTCNFNKTKQIAT--SIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQF 271
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVF+G C +HGVT VGYG ++ YW++KN WG W E G MRI + +
Sbjct: 272 YSSGVFSGECTTHLDHGVTAVGYG---RSKNGLKYWILKNSWGPKWGERGYMRIKKDIKP 328
Query: 295 S-GLCNIAANAAYP 307
G C +A NA+YP
Sbjct: 329 KHGQCGLAMNASYP 342
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 119/317 (37%), Positives = 183/317 (57%), Gaps = 34/317 (10%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF---------------LRLNKFADLTRE 57
A ++ W+ E R+Y E+E RF++F N +F L +N+FADLT +
Sbjct: 47 AAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTND 106
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F +++ G K R +++ ++ +S+DW E+GAV PVK+QG CW
Sbjct: 107 EFRSTFLGAKVVERSRAAGER---YRHDGVEELP--ESVDWREKGAVAPVKNQGQCGSCW 161
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASE 173
AF+AV+TVE +N++ TG+++T S+ +LV+CST +GC +++AF++I + + +E
Sbjct: 162 AFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTE 221
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-- 231
YPY+ D CD R +A K +I G++ V E+ LQ V+ QPVSVAI+A
Sbjct: 222 DDYPYKA-VDGKCDINRENA--KVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGRE 278
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F YH GVF+G CG + +HGV VGYGT + + YW+V+N WG W E G +R+ R
Sbjct: 279 FQLYHSGVFSGRCGTSLDHGVVAVGYGT----DNGKDYWIVRNSWGPKWGESGYVRMERN 334
Query: 292 VGG-SGLCNIAANAAYP 307
+ +G C IA A+YP
Sbjct: 335 INATTGKCGIAMMASYP 351
>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
Length = 362
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 120/304 (39%), Positives = 166/304 (54%), Gaps = 40/304 (13%)
Query: 31 EKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDHPHSNR 78
EK RF +FK+N H+ L+LNKFAD+T +F + Y G K
Sbjct: 55 EKHKRFNVFKENVMHVHKTNKMGKPYKLKLNKFADMTNHEFRSVYAGSK--------VKH 106
Query: 79 SNWFKNLNSSKMSFY--------DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNK 129
F+ SF S+DW ++GAVT VKDQG CWAF+ + VEG+N
Sbjct: 107 HRMFRGTTRGNGSFMYGKVEKVPTSVDWRKKGAVTAVKDQGQCGSCWAFSTIVAVEGINY 166
Query: 130 IRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCD 187
I+T +LV+ S+ +LVDC T GC +E AFE+I++ + + +E YPY+ +D +CD
Sbjct: 167 IKTNELVSLSEQELVDCDTTENQGCNGGLMEYAFEFIKKKRGITTESTYPYKA-EDGHCD 225
Query: 188 WWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCG 245
+ + +I GY+ V E+ L + QPVSVAIDA + F FY GVF G CG
Sbjct: 226 AAKENNPAV--SIDGYEKVPENDEDALLKAAANQPVSVAIDAGGSDFQFYSEGVFIGECG 283
Query: 246 NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANA 304
+HGV +VGYGTT + YW+V+N WG W E G +R+ RG+ GLC IA A
Sbjct: 284 TELDHGVAVVGYGTTLDG---TKYWIVRNSWGPEWGEKGYIRMQRGISDKEGLCGIAMEA 340
Query: 305 AYPL 308
+YP+
Sbjct: 341 SYPI 344
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 120/314 (38%), Positives = 177/314 (56%), Gaps = 29/314 (9%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKFL 60
+E W+ E R Y E++ RF++F N F L +N+FADLT ++F
Sbjct: 49 YELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFADLTNDEFR 108
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A+Y G + P + +++ ++ +S+DW E+GAV PVK+QG CWAF+
Sbjct: 109 AAYLGARIPAARRRGTAVGERYRHGGGAE-ELPESVDWREKGAVAPVKNQGQCGSCWAFS 167
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AV++VE +N+I TG++VT S+ +LV+CST +GC ++ AF++I + + +E Y
Sbjct: 168 AVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNGGIDTEGDY 227
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
PY+ D CD R +A K +I G++ V E+ LQ V+ QPVSVAI+A F
Sbjct: 228 PYKA-VDGKCDINRENA--KVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQL 284
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVF+G C +HGV VGYGT E + YW+V+N WG W E G +R+ R V
Sbjct: 285 YKAGVFSGTCTTNLDHGVVAVGYGT----ENGKDYWIVRNSWGAKWGEDGYIRMERNVNA 340
Query: 295 -SGLCNIAANAAYP 307
+G C IA A+YP
Sbjct: 341 TTGKCGIAMMASYP 354
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 124/316 (39%), Positives = 172/316 (54%), Gaps = 28/316 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREK 58
+AA +E W+V + Y EKE RF+IFK N F+ L +FADLT E+
Sbjct: 58 VAALYESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRESRTYKVGLTRFADLTNEE 117
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ A + G + + +S + + D +DW ++GAV VKDQG CWA
Sbjct: 118 YRARFLGGRFSRKPRLSAAKSGRYAAALGDDLP--DDVDWRKKGAVATVKDQGQCGSCWA 175
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCS-TLN-GCAKNFLENAFEYIRQYQRLASECV 175
F++VA VEG+N+I TG+L+ S+ +LVDC + N GC ++ AF++I + +E
Sbjct: 176 FSSVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGIDTEED 235
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+GR D CD R +A K I GY+ V E L+ V+ QPVSVAI+A F
Sbjct: 236 YPYKGR-DAACDPNRKNA--KVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQ 292
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
Y GVFTG CG +HGV VGYGT + YW+V+N WG +W E G +R+ R V
Sbjct: 293 LYQSGVFTGRCGTDLDHGVVAVGYGTDNGTD----YWIVRNSWGKDWGESGYIRLERNVA 348
Query: 294 G--SGLCNIAANAAYP 307
+G C IA +YP
Sbjct: 349 NITTGKCGIAVQPSYP 364
>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
Length = 365
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 120/293 (40%), Positives = 162/293 (55%), Gaps = 25/293 (8%)
Query: 33 EMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSN 80
E RF +FK+N + L LNKFAD+T ++F +Y G + R
Sbjct: 61 ERRFNVFKQNARYVHEGNKRDMPFRLALNKFADMTTDEFRRTYAGSRVRHHLSLSGGRRG 120
Query: 81 WFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRS 139
+ ++DW ++GAVT +KDQG CWAF+ + VEG+NKIRTG+LV+ S
Sbjct: 121 DGGFRYGDADNLPPAVDWRQKGAVTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSLS 180
Query: 140 KHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKY 197
+ +L+DC +N GC ++ AF++I Q + +E YPYQG Q CD + +A
Sbjct: 181 EQELMDCDNVNNQGCDGGLMDYAFQFI-QKNGITTESNYPYQGEQG-SCDQAKENAQAV- 237
Query: 198 GAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIV 255
I GY+ V E LQ V+ QPVSVAIDA+ F FY GVFTG C +HGV V
Sbjct: 238 -TIDGYEDVPANDESALQKAVAGQPVSVAIDASGQDFQFYSEGVFTGECSTDLDHGVAAV 296
Query: 256 GYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
GYG T + YW+VKN WG +W E G +R+ RGV + GLC IA A+YP
Sbjct: 297 GYGATRDG---TKYWIVKNSWGEDWGEKGYIRMQRGVSQTEGLCGIAMQASYP 346
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 202 bits (513), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 124/315 (39%), Positives = 175/315 (55%), Gaps = 33/315 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +WM E TY E+E RF+ F+ N H F L LN+FADLT E+
Sbjct: 42 YAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRLGLNRFADLTNEE 101
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ ++Y G + P S ++ ++ ++ +S+DW ++GAV VKDQG CWA
Sbjct: 102 YRSTYLGARTKP--DRERKLSARYQAADNDELP--ESVDWRKKGAVGAVKDQGGCGSCWA 157
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+A+A VEG+N+I TG ++ S+ +LVDC T GC ++ AFE+I + SE
Sbjct: 158 FSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDSEED 217
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+ R D CD + +A K I GY+ V +E+ LQ V+ QP+SVAI+A F
Sbjct: 218 YPYKER-DNRCDANKKNA--KVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGRAFQ 274
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y G+FTG CG +HGV VGYGT E + YWLV+N WG+ W E G +R+ R +
Sbjct: 275 LYKSGIFTGTCGTALDHGVAAVGYGT----ENGKDYWLVRNSWGSVWGENGYIRMERNIK 330
Query: 293 GGSGLCNIAANAAYP 307
SG C IA +YP
Sbjct: 331 ASSGKCGIAVEPSYP 345
>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
Length = 364
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 126/322 (39%), Positives = 175/322 (54%), Gaps = 35/322 (10%)
Query: 10 NIAAKHEQWMVEF--ARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLT 55
N+ +E+W + +R +E RF +FK+N + L LNKFAD+T
Sbjct: 35 NLRGLYERWRSHYTVSRRGLGADAEERRFNVFKENARYIHEGNKKDRPFRLALNKFADMT 94
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD----SIDWNERGAVTPVKDQG 111
++F +Y G + H + S + S + D ++DW ++GAVT +KDQG
Sbjct: 95 TDEFRRTYAGSRV----RHHLSLSGGRRGDGSFRYGDADNLPPAVDWRQKGAVTAIKDQG 150
Query: 112 SY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQ 168
CWAF+ + VEG+NKIRTG+LV+ S+ +L+DC +N GC ++ AF++I +
Sbjct: 151 QCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIHK-N 209
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID 228
+ +E YPYQG Q CD + A I GY+ V E LQ V+ QPVSVAID
Sbjct: 210 GITTESNYPYQGEQG-SCDLAKEKAHAV--TIDGYEDVPANDESALQKAVAGQPVSVAID 266
Query: 229 ATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
A+ F FY GVFTG C +HGV VGYGTT + YW+VKN WG +W E G +
Sbjct: 267 ASGNDFQFYSEGVFTGECSTDLDHGVAAVGYGTTRDG---TKYWIVKNSWGEDWGEKGYI 323
Query: 287 RIFRGVG-GSGLCNIAANAAYP 307
R+ RGV G C IA A+YP
Sbjct: 324 RMQRGVSQAEGQCGIAMQASYP 345
>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
Length = 324
Score = 201 bits (512), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 119/310 (38%), Positives = 167/310 (53%), Gaps = 27/310 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+ E+WM E+ R Y D AEK RF+IFK N L +N+F D+T +FL
Sbjct: 9 RFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFTDMTNNEFL 68
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A YTG P S F +++ S + SIDW + GAVT VK+QGS CWAF+
Sbjct: 69 ARYTGASLPLNIERDPVVS--FDDVDISAVP--QSIDWRDYGAVTSVKNQGSCGSCWAFS 124
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A+ATVEG+ KI+ G L++ S+ +++DC+ GC ++ A+++I + S PY+
Sbjct: 125 AIATVEGIYKIKAGNLISLSEQEVLDCALSYGCDGGWVNKAYDFIISNNGVTSFANLPYK 184
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FNFYHGG 238
G Y + I GY YVQ E + V+ QP++ IDA F +Y G
Sbjct: 185 G----YKGPCNHNDLPNKAYITGYTYVQSNNERSMMIAVANQPIAALIDAGGDFQYYKSG 240
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GL 297
VFTG CG + NH +T++GYG T+ YW+VKN WGT+W E G +R+ R V GL
Sbjct: 241 VFTGSCGTSLNHAITVIGYGQTSSG---TKYWIVKNSWGTSWGERGYIRMARDVSSPYGL 297
Query: 298 CNIAANAAYP 307
C IA +P
Sbjct: 298 CGIAMAPLFP 307
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 128/323 (39%), Positives = 174/323 (53%), Gaps = 35/323 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQA--------EKEMRFKIFKKNHEF------------LRLNK 50
+ A + WM++ ++Y + A EK R+ IFK N F L LN
Sbjct: 53 LQALFDSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGENEKNQGYFLGLNA 112
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
FADLT E+F A G + + S + ++ + DSIDW E+GAV VKDQ
Sbjct: 113 FADLTNEEFRAQRHGGRFDRSRERTSYEEFRYGSVQLKDLP--DSIDWREKGAVVGVKDQ 170
Query: 111 GSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQY 167
GS CWAF+AVA +EG+NK+ TG+LV+ S+ +LVDC GC ++ AF ++ +
Sbjct: 171 GSCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGFVIKN 230
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI 227
L +E YPY+G CD RS + K I GY+ V E L V+ QPVSVAI
Sbjct: 231 GGLDTEADYPYKGYG-TRCD--RSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAI 287
Query: 228 DA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
DA + FY G+FTG CG +HGVT VGYG E + YW++KN WG+NW E G
Sbjct: 288 DAGGSSMQFYRSGIFTGRCGTDLDHGVTNVGYGK----EDGKAYWIIKNSWGSNWGEKGY 343
Query: 286 MRIFRGVG-GSGLCNIAANAAYP 307
+++ R G +GLC I A+YP
Sbjct: 344 IKMARNTGLAAGLCGINMEASYP 366
>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
Length = 360
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 121/314 (38%), Positives = 169/314 (53%), Gaps = 26/314 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+ W R+Y E RF ++++N EF L N+FADLT E+FL
Sbjct: 50 RFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFL 109
Query: 61 ASYTGY---KPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--C 115
A+YTGY P D + + S ++ S+DW +GAV P K Q S C C
Sbjct: 110 ATYTGYYAGDGPVDDSVITTGAGDVDASFSYRVDVPASVDWRAQGAVVPPKSQTSTCSSC 169
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASEC 174
WAF AT+E LN I+TG+LV+ S+ QLVDC + + GC A++++ + L +E
Sbjct: 170 WAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGGLTTEA 229
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-TWFN 233
YPY R+ C+ R+ ++ I G+ V P E LQ V+RQPV+VAI+ +
Sbjct: 230 DYPYTARRG-PCN--RAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEVGSGMQ 286
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GGV+TGPCG H VT+VGYG T+A YW +KN WG +W E G +RI R VG
Sbjct: 287 FYKGGVYTGPCGTRLAHAVTVVGYG--TDASSGAKYWTIKNSWGQSWGERGYIRILRDVG 344
Query: 294 GSGLCNIAANAAYP 307
G C +A+++ P
Sbjct: 345 GPRPC-VASHSISP 357
>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
Length = 362
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 124/303 (40%), Positives = 166/303 (54%), Gaps = 38/303 (12%)
Query: 31 EKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDH----- 73
EK+ RF +FK N H L+LNKFAD+T +F +Y G K +H
Sbjct: 55 EKQKRFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGSK---VNHHRMFR 111
Query: 74 --PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKI 130
P + + ++N + S +DW ++GAVT VKDQG CWAF+ V VEG+N+I
Sbjct: 112 GTPRVSGTFMYENFTKAPAS----VDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQI 167
Query: 131 RTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDW 188
+T +LV S+ +L+DC GC +E AFEYI+Q + +E YPY D CD
Sbjct: 168 KTNRLVPLSEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTA-NDGSCDA 226
Query: 189 WRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGN 246
+ + +I G++ V E+ L V+ QPVSVAIDA + F FY GVFTG CG
Sbjct: 227 TKENVPTV--SIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGK 284
Query: 247 TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAA 305
NHGV IVGYGTT + YW+V+N WG W E G +R+ R V GLC IA A+
Sbjct: 285 ELNHGVAIVGYGTTVDGTN---YWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEAS 341
Query: 306 YPL 308
YP+
Sbjct: 342 YPV 344
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 129/330 (39%), Positives = 184/330 (55%), Gaps = 36/330 (10%)
Query: 3 RTSHKTGNIAAKHEQWMVEFARTYK--DQAEKEMRFKIFKKNHEFLR------------L 48
R+ + NI +E+W V+ + D +EK+ RF+IFK N +F+ L
Sbjct: 44 RSDKEVKNI---YEEWRVKHGKLNNNIDGSEKDKRFEIFKDNLKFIDEHNAENRTYKVGL 100
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHS---NRSNWFKNLNSSKMSFYDSIDWNERGAVT 105
N+FADL+ E++ + Y G K P + RSN + K+ S+DW +GAV
Sbjct: 101 NRFADLSNEEYRSRYLGTKIDPIGMMMARTKTRSNRYAPSVGDKLP--KSVDWRSQGAVV 158
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS-TLN-GCAKNFLENAFE 162
VKDQGS CWAF+ +A VEG+NKI TG+LV+ S+ +LVDC T+N GC +E AFE
Sbjct: 159 QVKDQGSCGSCWAFSTIAAVEGINKIVTGELVSLSEQELVDCDRTVNAGCDGGLMEYAFE 218
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
+I + S+ YPY+G D CD ++ +A + +I Y+ V E L+ V+ QP
Sbjct: 219 FIINNGGIDSDEDYPYRG-VDGKCDQYKKNA--RVVSIDDYEQVPAYDELALKKAVANQP 275
Query: 223 VSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
+SVAI+A F Y G+FTG CG +HGVT VGYGT E YW+V+N WG +W
Sbjct: 276 ISVAIEAGGREFQLYVSGIFTGKCGTALDHGVTAVGYGT----ENGVDYWIVRNSWGKSW 331
Query: 281 DEGGSMRIFRGVGGS--GLCNIAANAAYPL 308
E G +R+ R + S G C I ++YP+
Sbjct: 332 GESGYVRMERNLAASVAGKCGIVMQSSYPI 361
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 127/325 (39%), Positives = 181/325 (55%), Gaps = 35/325 (10%)
Query: 10 NIAAKHEQW----MVEFARTYKDQAEKEMRFKIFKKN----HEF--------LRLNKFAD 53
++ A +EQW MV ++Q +K F +FK+N HE L LNKFAD
Sbjct: 37 SLRALYEQWRSHYMVSRPAGLQEQDDKARWFNVFKENVRYIHEANKKGRSFRLALNKFAD 96
Query: 54 LTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKM-----SFYDSIDWNERGAVTPVK 108
+T ++F +Y T H + S ++ + S M + ++DW +RGAVT +K
Sbjct: 97 MTTDEFRRAYAA--GSRTRHHRALSSGIRRHGDGSFMYAQAGNLPLAVDWRQRGAVTGIK 154
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIR 165
DQG CWAF+ +A VEG+NKIRTG+LV+ S+ +LVDC ++ GC ++ AF+YI+
Sbjct: 155 DQGQCGSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDCDDVDNQGCNGGLMDYAFQYIK 214
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ + +E YPY Q C+ ++ I GY+ V E+ LQ V+ QPVS+
Sbjct: 215 RNGGITTESNYPYLAEQ-RSCN--KAKERSHDVTIDGYEDVPANNEDALQKAVANQPVSI 271
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
AI+A+ F FY GVFTG CG +HGV VGYG T + YW+VKN WG +W E
Sbjct: 272 AIEASGQDFQFYSEGVFTGSCGTELDHGVAAVGYGITRDG---TKYWIVKNSWGEDWGER 328
Query: 284 GSMRIFRGVGGS-GLCNIAANAAYP 307
G +R+ RG+ S GLC IA +YP
Sbjct: 329 GYIRMQRGISDSQGLCGIAMEPSYP 353
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 122/318 (38%), Positives = 176/318 (55%), Gaps = 31/318 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREK 58
+ + +E+W+V+ + Y EKE RF+IFK N F+ LN+F+DL+ E+
Sbjct: 48 VMSIYEEWLVKHGKVYNAVEEKEKRFQIFKDNLNFIEEHNAVNRTYKVGLNRFSDLSNEE 107
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CW 116
+ + Y G K P+ + + + +S+DW + GAV VK+Q S C CW
Sbjct: 108 YRSKYLGTKIDPSRMMARPSRRYSPRVADN---LPESVDWRKEGAVVRVKNQ-SECEGCW 163
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCS-TLN-GCAKNFLENAFEYIRQYQRLASEC 174
AF+A+A VEG+NKI TG L S+ +L+DC T+N GC+ ++ AFE+I + +E
Sbjct: 164 AFSAIAAVEGINKIVTGNLTALSEQELLDCDRTVNAGCSGGLVDYAFEFIINNGGIDTEE 223
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YP+QG D CD ++ +A + I GY+ V E L+ V+ QPVSVAI+A F
Sbjct: 224 DYPFQG-ADGICDQYKINA--RAVTIDGYERVPAYDELALKKAVANQPVSVAIEAYGKEF 280
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y G+FTG CG + +HGVT VGYGT E YW+VKN WG NW E G + + R +
Sbjct: 281 QLYESGIFTGTCGTSIDHGVTAVGYGT----ENGIDYWIVKNSWGENWGEAGYVGMERNI 336
Query: 293 G--GSGLCNIAANAAYPL 308
+G C IA YP+
Sbjct: 337 AEDTAGKCGIAILTLYPI 354
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 201 bits (512), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 130/332 (39%), Positives = 174/332 (52%), Gaps = 34/332 (10%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
MS + + +E W+V+ + Y EKE RFKIFK N F L
Sbjct: 34 MSIIDYDESHTRHVYEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLG 93
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNS----SKMSFYDSIDWNERGA 103
LNKFADLT E++ A + G + T P + + K + + +DW E+GA
Sbjct: 94 LNKFADLTNEEYRAMFLGTR---TRGPKNKAAVVAKKTDRYAYRAGEELPAMVDWREKGA 150
Query: 104 VTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST--LNGCAKNFLENA 160
VTP+KDQG CWAF+ V VEG+N+I TG L + S+ +LVDC GC ++ A
Sbjct: 151 VTPIKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGGLMDYA 210
Query: 161 FEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR 220
FE+I Q + +E YPY + D CD R +A + I GY+ V E+ L V+
Sbjct: 211 FEFIVQNGGIDTEEDYPYHAK-DNTCDPNRKNA--RVVTIDGYEDVPTNDEKSLMKAVAN 267
Query: 221 QPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGT 278
QPVSVAI+A F Y GVFTG CG +HGV VGYGT E YWLV+N WG+
Sbjct: 268 QPVSVAIEAGGMEFQLYQSGVFTGRCGTNLDHGVVAVGYGT----ENGTDYWLVRNSWGS 323
Query: 279 NWDEGGSMRIFRGVGG--SGLCNIAANAAYPL 308
W E G +++ R V +G C IA A+YP+
Sbjct: 324 AWGENGYIKLERNVQNTETGKCGIAIEASYPI 355
>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
Length = 362
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 123/303 (40%), Positives = 165/303 (54%), Gaps = 38/303 (12%)
Query: 31 EKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPPTDH----- 73
EK+ RF +FK N L+LNKFAD+T +F +Y G K +H
Sbjct: 55 EKQKRFNVFKSNVMHVHNTNKMDKPYKLKLNKFADMTNHEFKTTYAGTK---VNHHRMFR 111
Query: 74 --PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKI 130
P + + ++N + S +DW ++GAVT VKDQG CWAF+ V VEG+N+I
Sbjct: 112 GTPRVSGTFMYENFTKAPAS----VDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQI 167
Query: 131 RTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDW 188
+T +LV S+ +L+DC GC +E AFEYI+Q + +E YPY D CD
Sbjct: 168 KTNRLVPLSEQELIDCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTA-NDGSCDA 226
Query: 189 WRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGN 246
+ + +I G++ V E+ L V+ QPVSVAIDA + F FY GVFTG CG
Sbjct: 227 TKENVPTV--SIDGHETVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGK 284
Query: 247 TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAA 305
NHGV IVGYGTT + YW+V+N WG W E G +R+ R V GLC IA A+
Sbjct: 285 ELNHGVAIVGYGTTVDGTN---YWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEAS 341
Query: 306 YPL 308
YP+
Sbjct: 342 YPV 344
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 124/316 (39%), Positives = 173/316 (54%), Gaps = 33/316 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-----------------LRLNKFADLTREK 58
+ W+V+ + Y EKE RF IF+ N EF L LNKFADLT ++
Sbjct: 6 QSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTNDE 65
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
F Y G K P + S +S+ + ++ +S+DW ++GAV+ VKDQG CWA
Sbjct: 66 FRRIYFGVKRP--EKAESVKSDRYAVKEGDELP--ESVDWRKKGAVSHVKDQGQCGSCWA 121
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+A+ VEG+NKI TG L+T S+ +LVDC T +GC ++ AF +I + ++
Sbjct: 122 FSAIGAVEGINKIVTGDLITLSEQELVDCDTSYNSGCDGGLMDYAFRFIINNGGIDTDKD 181
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FN 233
YPY+ D CD R +A K I G + V E+ LQ V+ QPV +AI+A F
Sbjct: 182 YPYKA-TDGSCDSNRKNA--KVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQ 238
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
Y GVFTG CG + +HGV VGYGTT + + YW+V+N WG +W E G +R+ R
Sbjct: 239 LYKSGVFTGSCGTSLDHGVVAVGYGTTDDG---KDYWIVRNSWGDDWGEDGYIRMERNTE 295
Query: 293 GGSGLCNIAANAAYPL 308
SG C IA +YP+
Sbjct: 296 SKSGKCGIAIEPSYPV 311
>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
Length = 298
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 128/315 (40%), Positives = 170/315 (53%), Gaps = 54/315 (17%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRLNKFADLTREKFLA 61
SR+ H+ ++ +HE WM + R YKD EKE RFKIFK N +A
Sbjct: 27 SRSLHEA-SMYERHEDWMARYGRMYKDANEKEKRFKIFKDN-----------------VA 68
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ---GSYCCWAF 118
T +K ++N+ + +IDW ++GAVTP+KDQ GS CWAF
Sbjct: 69 QATTFK--------------YENVTAVP----STIDWRKKGAVTPIKDQQQCGS--CWAF 108
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+AVA EG+ +I TG+L++ S+ +LVDC T GC+ ++AF +I LASE
Sbjct: 109 SAVAATEGITQITTGKLISLSEQELVDCDTGGENQGCSGGLXDDAFRFIX-IHGLASEAT 167
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FN 233
YPY+G D C+ + + I+GY+ V E+ LQ V+ QPV+VAIDA F
Sbjct: 168 YPYEG-DDGTCNSKKEAHPA--AKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQ 224
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
FY GVFTG CG +HGV VGYG + YWLVKN WGT W E G +R+ R V
Sbjct: 225 FYTSGVFTGQCGTELDHGVAAVGYGIGDDG---MXYWLVKNSWGTGWGEEGYIRMQRDVT 281
Query: 293 GGSGLCNIAANAAYP 307
GLC IA A+YP
Sbjct: 282 AKEGLCGIAMQASYP 296
>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
Length = 360
Score = 201 bits (511), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 123/300 (41%), Positives = 170/300 (56%), Gaps = 32/300 (10%)
Query: 31 EKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDH----P 74
EK RF +F+ N H L+LNKFAD+T +F +Y K P
Sbjct: 53 EKRKRFNVFRANVLHVHNTNKMDKPYKLKLNKFADMTNHEFRTAYASSKVKHHTMFRGAP 112
Query: 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
N S + N++ S IDW ++GAVTPVKDQG CWAF+ + VEG+N I+T
Sbjct: 113 LGNGSFMYGNIDKVPAS----IDWRKKGAVTPVKDQGKCGSCWAFSTIVAVEGINFIKTN 168
Query: 134 QLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRS 191
+L++ S+ +LVDC+T +GC ++ AFE+I + + + +E YPY+ QD +CD +
Sbjct: 169 KLISLSEQELVDCNTGENHGCNGGLMDYAFEFITKQKGITTEANYPYRA-QDGHCD--AN 225
Query: 192 SASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPN 249
A+ +I G++ V E L V+ QPVSVAIDA + F FY GVFTG CG +
Sbjct: 226 KANQPAVSIDGHEDVLHNNENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGECGKELD 285
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
HGV IVGYGTT + YW+V+N WG W E G +R+ RG+ GLC IA A+YP+
Sbjct: 286 HGVAIVGYGTTVDG---TKYWIVRNSWGPEWGERGYIRMQRGISDRRGLCGIAMEASYPI 342
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 122/326 (37%), Positives = 172/326 (52%), Gaps = 41/326 (12%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+ +W E + Y E+E R+ F+ N H F L LN+FADLT E+
Sbjct: 40 YAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHSFRLGLNRFADLTNEE 99
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ +Y G + + P R + L + + +S+DW +GAV +KDQG CWA
Sbjct: 100 YRDTYLGLR----NKPRRERKVSDRYLAADNEALPESVDWRTKGAVAEIKDQGGCGSCWA 155
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+A+A VEG+N+I TG L++ S+ +LVDC T GC ++ AF++I + +E
Sbjct: 156 FSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFDFIINNGGIDTEDD 215
Query: 176 YPYQGRQDYYCDWWRSS----------ASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
YPY+G+ D CD R S + K I Y+ V P +E LQ V+ QPVSV
Sbjct: 216 YPYKGK-DERCDVNRVSFVFFAPLVFQKNAKVVTIDSYEDVTPNSETSLQKAVANQPVSV 274
Query: 226 AIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
AI+A F Y G+FTG CG +HGV VGYGT E + YW+V+N WG +W E
Sbjct: 275 AIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT----ENGKDYWIVRNSWGKSWGES 330
Query: 284 GSMRIFRGV-GGSGLCNIAANAAYPL 308
G +R+ R + SG C IA +YPL
Sbjct: 331 GYVRMERNIKASSGKCGIAVEPSYPL 356
>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
Length = 258
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 122/272 (44%), Positives = 160/272 (58%), Gaps = 24/272 (8%)
Query: 46 LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD---SIDWNERG 102
+ LN+FAD+T ++F+A YTG +P P + + FK N + D ++DW ++G
Sbjct: 1 MELNEFADMTNDEFMAMYTGLRPVPA---GAKKMAGFKYGNVTLSDADDDQQTVDWRQKG 57
Query: 103 AVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLEN 159
AVT +KDQ CCWAF AVA VEG+++I TG LV+ S+ Q++DC T NGC +++N
Sbjct: 58 AVTGIKDQRQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDN 117
Query: 160 AFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS 219
AF+YI LA+E YPY Q C + A AI GYQ V E L V+
Sbjct: 118 AFQYIVGNGGLATEDAYPYTAAQ-AMCQSVQPVA-----AISGYQDVPSGDEAALAAAVA 171
Query: 220 RQPVSVAIDATWFNFYHGGVFTGPCGNTP---NHGVTIVGYGTTTEAEGQQPYWLVKNRW 276
QPVSVAIDA F Y GGV T +TP NH VT VGYGT AE PYWL+KN+W
Sbjct: 172 NQPVSVAIDAHNFQLYGGGVMTAASCSTPPNLNHAVTAVGYGT---AEDGTPYWLLKNQW 228
Query: 277 GTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
G NW EGG +R+ R G+ C +A A+YP+
Sbjct: 229 GQNWGEGGYLRLER---GANACGVAQQASYPV 257
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 201 bits (511), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 123/304 (40%), Positives = 168/304 (55%), Gaps = 38/304 (12%)
Query: 30 AEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPPTDH---- 73
EK RF +FK N L+LNKFAD+T +F ++Y G K +H
Sbjct: 53 GEKHKRFNVFKANLMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSK---VNHHRMF 109
Query: 74 ---PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNK 129
PH N + ++ + +S S+DW ++GAVT VKDQG CWAF+ V VEG+N+
Sbjct: 110 RGTPHENGAFMYEKV----VSVPPSVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQ 165
Query: 130 IRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCD 187
I+T +LV S+ +LVDC GC +E+AFE+I+Q + +E YPY+ Q+ CD
Sbjct: 166 IKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKA-QEGTCD 224
Query: 188 WWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCG 245
S + +I G++ V E+ L V+ QPVSVAIDA + F FY GVFTG C
Sbjct: 225 --ASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCS 282
Query: 246 NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANA 304
NHGV IVGYGTT + YW+V+N WG W E G +R+ R + GLC IA
Sbjct: 283 TDLNHGVAIVGYGTTVDGTN---YWIVRNSWGPEWGEHGYIRMQRNISKKEGLCGIAMLP 339
Query: 305 AYPL 308
+YP+
Sbjct: 340 SYPI 343
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 129/320 (40%), Positives = 174/320 (54%), Gaps = 36/320 (11%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFL 60
A +E+W A +D +K RF +FK N HEF LRLN+F D+T ++F
Sbjct: 47 ALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRRDEPYKLRLNRFGDMTADEFR 105
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD------SIDWNERGAVTPVKDQGSY- 113
Y G + H R + + S+ + D S+DW ++GAVT VKDQG
Sbjct: 106 RHYAGSR---VAHHRMFRGDRQGSSASASFMYADARDVPASVDWRQKGAVTDVKDQGQCG 162
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLA 171
CWAF+ +A VEG+N I+T L + S+ QLVDC T GC ++ AF+YI ++ +A
Sbjct: 163 SCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVA 222
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-- 229
+E YPY+ RQ C + I GY+ V E L+ V+ QPVSVAI+A
Sbjct: 223 AEDAYPYRARQ-ASC----KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASG 277
Query: 230 TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
+ F FY GVF+G CG +HGVT VGYG T A+G + YWLVKN WG W E G +R+
Sbjct: 278 SHFQFYSEGVFSGRCGTELDHGVTAVGYGVT--ADGTK-YWLVKNSWGPEWGEKGYIRMA 334
Query: 290 RGVGG-SGLCNIAANAAYPL 308
R V G C IA A+YP+
Sbjct: 335 RDVAAKEGHCGIAMEASYPV 354
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 121/313 (38%), Positives = 173/313 (55%), Gaps = 27/313 (8%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKFL 60
A +E+W+ + Y EKE RF+IFK N F+ LN+FADLT E++
Sbjct: 45 AIYEKWLTTHGKAYNAIGEKERRFEIFKDNLRFVDEHNAVAGSYRVGLNRFADLTNEEYR 104
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
+ + G + S +S+ + K+ S+DW E+GAV+PVKDQG CWAF+
Sbjct: 105 SMFLGGNMEMKERSASTKSDRYAFRAGDKLP--GSVDWREKGAVSPVKDQGQCGSCWAFS 162
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVYP 177
++ VEG+N+I TG+L++ S+ +LVDC S GC ++ F++I + +E YP
Sbjct: 163 TISAVEGINQIVTGELISLSEQELVDCDKSYNMGCNGGLMDYGFQFIINNGGIDTEEDYP 222
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFY 235
Y+ D CD +R +A + +I GY+ V E L+ V+ QPVSVAI+A F Y
Sbjct: 223 YRA-VDGTCDQFRKNA--RVVSINGYEDVPEDDENSLKKAVANQPVSVAIEAGGRAFQLY 279
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG- 294
GVFTG CG +HGV VGYGT E YW V+N WG W E G +++ R +
Sbjct: 280 ESGVFTGHCGTNLDHGVVAVGYGT----ENGVDYWTVRNSWGPKWGENGYIKLERNINAT 335
Query: 295 SGLCNIAANAAYP 307
SG C IA+ A+YP
Sbjct: 336 SGKCGIASMASYP 348
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 172/311 (55%), Gaps = 28/311 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKF 59
+H WM E R Y D EK R+ +FK+N E L +N+FADLT E+F
Sbjct: 30 RHAAWMTEHGRVYADANEKNNRYVVFKRNVESIERLNEVQYGLTFKLAVNQFADLTNEEF 89
Query: 60 LASYTGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ YTGYK + +++ +++++S + S+DW ++GAVTP+KDQGS CWA
Sbjct: 90 RSMYTGYKGNSVLSSRTKPTSFRYQHVSSDALPI--SVDWRKKGAVTPIKDQGSCGSCWA 147
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVY 176
F+AVA +EG+ +I+ G+L++ S+ +LVDC T +GC ++ +AF Y L SE Y
Sbjct: 148 FSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDDGCMGGYMNSAFNYTMTTGGLTSESNY 207
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI--DATWFNF 234
PY+ D C+ ++ +I+G++ V E+ L V+ PVS+ I T F F
Sbjct: 208 PYK-STDGTCNINKTKQIAT--SIKGFEDVPANDEKALMKAVAHHPVSIGIAGGGTGFQF 264
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVF+G C +HGV +VGYG ++ YW++KN WG W E G MRI +
Sbjct: 265 YSSGVFSGECSTHLDHGVAVVGYGKSSNG---SKYWILKNSWGPKWGERGYMRIKKDTKA 321
Query: 295 S-GLCNIAANA 304
G C +A NA
Sbjct: 322 KHGQCGLAMNA 332
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 201 bits (510), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 124/323 (38%), Positives = 176/323 (54%), Gaps = 30/323 (9%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFAD 53
H + QW+ +R Y +EK+ RF+IFK N + L LNKF+D
Sbjct: 43 HSDDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQEKSYWLGLNKFSD 102
Query: 54 LTREKFLASYTGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
LT ++F A Y G +P H N + ++++ + +M +DW ++GAV+ VKDQGS
Sbjct: 103 LTHDEFRALYLGIRPAGRAHGLRNGDRFIYEDVVAEEM-----VDWRKKGAVSDVKDQGS 157
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS--TLNGCAKNFLENAFEYIRQYQR 169
CWAF+A+ +VEG+N I TG+L++ S+ +LVDC GC ++ AF++I +
Sbjct: 158 CGSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQNQGCNGGLMDYAFDFIIKNGG 217
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
+ +E YPY+ D CD R S K I YQ V +E L VS+ PVSVAI+A
Sbjct: 218 IDTEEDYPYKA-TDGQCDEARKETS-KVVVIDDYQDVPTKSESSLLKAVSKNPVSVAIEA 275
Query: 230 TWFNF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
+F Y GGVFTGPCG +HGV VGYGT + YW+VKN WG +W E G +R
Sbjct: 276 GGRDFQHYQGGVFTGPCGTDLDHGVLAVGYGTDDDGVN---YWIVKNSWGPSWGEKGYIR 332
Query: 288 IFR--GVGGSGLCNIAANAAYPL 308
+ R SG C I ++P+
Sbjct: 333 MERMGSNSTSGKCGINIEPSFPI 355
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 129/320 (40%), Positives = 175/320 (54%), Gaps = 40/320 (12%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFLA 61
+E+W +D EK RF +FK+N HEF L LNKF D+T ++F +
Sbjct: 40 YEKWRTHHT-VARDLDEKNRRFNVFKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRS 98
Query: 62 SYTGYKPPPTDHPHSNR-------SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
Y G K H S R S ++N+ S + SIDW +GAVT VKDQG
Sbjct: 99 KYAGSK---IQHHRSQRGIQKNTGSFMYENVGSLPAA---SIDWRAKGAVTGVKDQGQCG 152
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLA 171
CWAF+ +A+VEG+N+I+TG+LV+ S+ +LVDC T GC ++ AFE+I Q +
Sbjct: 153 SCWAFSTIASVEGINQIKTGELVSLSEQELVDCDTSYNEGCNGGLMDYAFEFI-QKNGIT 211
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT- 230
+E YPY QD C + + +I G+Q V E L V+ QP+SV+I+A+
Sbjct: 212 TEDSYPY-AEQDGTCA--SNLLNSPVVSIDGHQDVPANNENALMQAVANQPISVSIEASG 268
Query: 231 -WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GVFTG CG +HGV IVGYG T + YW+VKN WG W E G +R+
Sbjct: 269 YGFQFYSEGVFTGRCGTELDHGVAIVGYGATRDG---TKYWIVKNSWGEEWGESGYIRMQ 325
Query: 290 RGVGGS-GLCNIAANAAYPL 308
RG+ G C IA A+YP+
Sbjct: 326 RGISDKRGKCGIAMEASYPI 345
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 130/334 (38%), Positives = 179/334 (53%), Gaps = 44/334 (13%)
Query: 4 TSHKTG-NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNK 50
TS +T + +E+W+V+ ++Y EK+ RF+IFK N +F L L +
Sbjct: 43 TSKRTNKEVLTMYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTR 102
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFY---------DSIDWNER 101
FADLT E++ + + G K P NR K L SK + Y +S+DW +
Sbjct: 103 FADLTNEEYRSKFLGTKIDP------NRR--MKKLGGSKSNRYAPRVGDKLPESVDWRKE 154
Query: 102 GAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLE 158
GAV VKDQ S CWAF+A+A VEG+NKI TG L++ S+ +LVDC T GC ++
Sbjct: 155 GAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMD 214
Query: 159 NAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVV 218
AFE+I + SE YPY+ D CD R +A K I Y+ V E LQ V
Sbjct: 215 YAFEFIISNGGIDSEDDYPYKA-VDGRCDQNRKNA--KVVTIDDYEDVPAYDELALQKAV 271
Query: 219 SRQPVSVAID--ATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRW 276
+ QP++VA++ F Y GVFTG CG +HGV VGYGT E + YW+V+N W
Sbjct: 272 ANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVAAVGYGT----ENGKDYWIVRNSW 327
Query: 277 GTNWDEGGSMRIFRGVGGS--GLCNIAANAAYPL 308
G +W E G +R+ R + S G C IA +YP+
Sbjct: 328 GGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 125/301 (41%), Positives = 172/301 (57%), Gaps = 32/301 (10%)
Query: 30 AEKEMR-FKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDHPHS 76
AE E R F +FK+N HE L LNKFAD+T ++F +Y G + H S
Sbjct: 56 AEAEARRFNVFKENVRYIHEANKKDRPFRLALNKFADMTTDEFRRTYAGSR---VRHHRS 112
Query: 77 NRSNWFKN----LNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIR 131
+ + + + ++DW ++GAVTP+KDQG CWAF+ + VEG+NKIR
Sbjct: 113 LSGGRRQGGGSFMYADAENLPAAVDWRQKGAVTPIKDQGQCGSCWAFSTIVAVEGINKIR 172
Query: 132 TGQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWW 189
TG+LV+ S+ +L+DC+ +GC ++ AF++I+Q + +E YPYQG Q+ CD
Sbjct: 173 TGRLVSLSEQELMDCNIGENDGCNGGLMDVAFQFIQQNGGITTEASYPYQGEQN-SCD-- 229
Query: 190 RSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNT 247
+S + +I GY+ V E LQ V+ QPVSVAIDA+ F FY GVFT G
Sbjct: 230 QSKENSHDVSIDGYEDVPANDESALQKAVANQPVSVAIDASGNDFQFYSEGVFTTDGGTD 289
Query: 248 PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAY 306
+HGV VGYGTT + YW+VKN WG +W E G +R+ RGV GLC IA A+Y
Sbjct: 290 LDHGVAAVGYGTTRDG---TKYWIVKNSWGEDWGEKGYIRMQRGVKQAEGLCGIAMEASY 346
Query: 307 P 307
P
Sbjct: 347 P 347
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 118/310 (38%), Positives = 169/310 (54%), Gaps = 26/310 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFL 60
+ E+WMVE+ R YKD EK RF+IFK N + +N+F D+T +F+
Sbjct: 36 RFEEWMVEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSRNENSYTLGINQFTDMTNNEFI 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A YTG P + + F +++ S + SIDW + GAVT VK+Q CWAF
Sbjct: 96 AQYTGGISRPLNIEREPVVS-FDDVDISAVP--QSIDWRDYGAVTSVKNQNPCGACWAFA 152
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A+ATVE + KI+ G L S+ Q++DC+ GC + AFE+I + +AS +YPY+
Sbjct: 153 AIATVESIYKIKKGILEPLSEQQVLDCAKGYGCKGGWEFRAFEFIISNKGVASGAIYPYK 212
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FNFYHGG 238
+ C +++ I GY V E + VS+QP++VA+DA F +Y G
Sbjct: 213 AAKG-TC---KTNGVPNSAYITGYARVPRNNESSMMYAVSKQPITVAVDANANFQYYKSG 268
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGL 297
VF GPCG + NH VT +GYG + + YW+VKN WG W E G +R+ R V SG+
Sbjct: 269 VFNGPCGTSLNHAVTAIGYGQDSNG---KKYWIVKNSWGARWGEAGYIRMARDVSSSSGI 325
Query: 298 CNIAANAAYP 307
C IA ++ YP
Sbjct: 326 CGIAIDSLYP 335
>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
Length = 284
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 119/273 (43%), Positives = 160/273 (58%), Gaps = 25/273 (9%)
Query: 46 LRLNKFADLTREKFLASYTGYKPPPTDHPH----SNRSNWFKNLNSSKMSFYDSIDWNER 101
L +N+FADLT E+F+ P + H + R+ FK N + + DSIDW ++
Sbjct: 24 LGINQFADLTSEEFIV------PRNRFNGHMRFSNTRTTTFKYENVTVLP--DSIDWRQK 75
Query: 102 GAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFL 157
GAVTP+K+QGS CCWAF+A+A EG++KI TG+LV+ S+ ++VDC T +GC ++
Sbjct: 76 GAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYM 135
Query: 158 ENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDV 217
+ AF++I Q + +E YPY+G D C+ + I GY+ V E+ LQ
Sbjct: 136 DGAFKFIIQNHGINTEASYPYKGV-DGKCNIKEEAVHAT--TITGYEDVPINNEKALQKA 192
Query: 218 VSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNR 275
V+ QPVSVAIDA F FY G+FTG CG +HGVT VGYG E YWLVKN
Sbjct: 193 VANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYGENNEG---TKYWLVKNS 249
Query: 276 WGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
WGT W E G + RGV G+C IA A+YP
Sbjct: 250 WGTEWGEEGYTMMQRGVKAVEGICGIAMLASYP 282
>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 200 bits (509), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 124/312 (39%), Positives = 166/312 (53%), Gaps = 28/312 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
+ WMV+ + Y AEKE R IF+ N F L L +FADL+ ++
Sbjct: 57 DSWMVKHGKVYGSVAEKERRLTIFEDNLRFISNRNAENLSYRLGLTQFADLSLHEYGEVC 116
Query: 64 TGYKP-PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTA 120
G P PP +H S+ +K S+ S+DW GAVT VKDQG +C CWAF+
Sbjct: 117 HGADPRPPRNHVFMTSSDRYKT--SAGDVLPKSVDWRNEGAVTEVKDQG-HCRSCWAFST 173
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
V VEGLNKI TG+LVT S+ L++C+ NGC +E A+E+I + L ++ YPY+
Sbjct: 174 VGAVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMKNGGLGTDNDYPYK 233
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHG 237
+ CD R + K I G++ + E L V+ QPV+ ID++ F Y
Sbjct: 234 A-VNGVCD-GRLKENNKNVMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQLYES 291
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
GVF G CG NHGV +VGYGT E + YWLVKN G W E G M++ R + G
Sbjct: 292 GVFDGSCGTNLNHGVVVVGYGT----ENGRDYWLVKNSRGNTWGEAGYMKMARNIANPRG 347
Query: 297 LCNIAANAAYPL 308
LC IA A+YPL
Sbjct: 348 LCGIAMRASYPL 359
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 200 bits (508), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 130/311 (41%), Positives = 171/311 (54%), Gaps = 42/311 (13%)
Query: 29 QAEKEMRFKIFKKNHEF---------------------LRLNKFADLTREKFLASYTGYK 67
AEK RF FK N F LRLN+F D+ + +F +++ G
Sbjct: 56 HAEKHRRFGTFKSNVLFIHAHNTRLNDTSTNNNGPSYRLRLNRFGDMDQAEFRSTFAG-- 113
Query: 68 PPPTDHPHSNRSNWFKN-LNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVE 125
P H H+ + + + ++DW ++GAVT VKDQG CWAF+AVA+VE
Sbjct: 114 --PL-HRHTRPAQSIPGFIYDTVKDIPQAVDWRQKGAVTGVKDQGKCGSCWAFSAVASVE 170
Query: 126 GLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQ-RLASECVYPYQGR 181
GLN IRTG LV+ S+ +L+DC T NGC +E+AFE+I LA+E YPY
Sbjct: 171 GLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMESAFEFIAHSAGGLATEAAYPYHA- 229
Query: 182 QDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGV 239
+ C+ R S+ I G+Q V EE L V+ QPVSVAIDA F FY GV
Sbjct: 230 SNGTCNANRGSSVSVR--IDGHQSVPAGNEEALAKAVAHQPVSVAIDAGGQAFQFYSEGV 287
Query: 240 FTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR--GVGGSGL 297
FTG CG+ +HGV +VGYG E +G++ YW+VKN WG W E G +R+ R GV G GL
Sbjct: 288 FTGDCGSELDHGVAVVGYGVAEE-DGKE-YWIVKNSWGPGWGEHGYVRMQRDSGVDG-GL 344
Query: 298 CNIAANAAYPL 308
C IA A+YP+
Sbjct: 345 CGIAMEASYPV 355
>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
vinifera]
Length = 340
Score = 200 bits (508), Expect = 8e-49, Method: Compositional matrix adjust.
Identities = 122/327 (37%), Positives = 191/327 (58%), Gaps = 32/327 (9%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------L 48
SR H+ ++ +HEQWM ++R YKD AE+E RF +FK N +F++ +
Sbjct: 23 SRPLHE-ASMYERHEQWMARYSRNYKDDAEEERRFXMFKDNVDFIQTFDTAGNMPNKLGV 81
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N AD+T E+F AS +K PP S ++ F++ N +++ ++DW ++ VT +K
Sbjct: 82 NALADMTHEEFRASGNTFKIPPNLGLRSETTS-FRHQNVTRIP--STMDWRKKRTVTHIK 138
Query: 109 DQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEY 163
+Q C CWAF+AVA +EG+ K++T + ++ S+ +LVDC GC +++AF++
Sbjct: 139 NQ-LQCGGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDIFGSNIGCEGGCMDDAFKF 197
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I Q + L SE Y Y+G + +C+ + S + I Y+ + +E+ L VV+ QP+
Sbjct: 198 IIQNRGLNSEARYLYKGVEG-HCN--KKKESSRAARINDYENMPEFSEKALLKVVAHQPI 254
Query: 224 SVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
SVAIDA + F FY G+ T GN ++GVT GYG + A+G++ +WLVKN WGT+W
Sbjct: 255 SVAIDAGGSAFQFYEIGIITXESGNDLDYGVTTDGYGRS--ADGKK-HWLVKNSWGTDWG 311
Query: 282 EGGSMRIFRGVGG-SGLCNIAANAAYP 307
E G R+ RGV +GLC A+YP
Sbjct: 312 ENGYTRMERGVKATTGLCGFTMQASYP 338
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 130/334 (38%), Positives = 179/334 (53%), Gaps = 44/334 (13%)
Query: 4 TSHKTG-NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNK 50
TS +T + +E+W+V+ ++Y EK+ RF+IFK N +F L L +
Sbjct: 43 TSKRTNKEVLTMYEEWLVKHGKSYNGLGEKDKRFEIFKDNLKFIDEHNGLNSTYRLGLTR 102
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFY---------DSIDWNER 101
FADLT E++ + + G K P NR K L SK + Y +S+DW +
Sbjct: 103 FADLTNEEYRSKFLGTKIDP------NRR--MKKLGGSKSNRYAPRVGDKLPESVDWRKE 154
Query: 102 GAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLE 158
GAV VKDQ S CWAF+A+A VEG+NKI TG L++ S+ +LVDC T GC ++
Sbjct: 155 GAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMD 214
Query: 159 NAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVV 218
AFE+I + SE YPY+ D CD R +A K I Y+ V E LQ V
Sbjct: 215 YAFEFIISNGGIDSEDDYPYKA-VDGRCDQNRKNA--KVVTIDDYEDVPAYDELALQKAV 271
Query: 219 SRQPVSVAID--ATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRW 276
+ QP++VA++ F Y GVFTG CG +HGV VGYGT E + YW+V+N W
Sbjct: 272 ANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVAAVGYGT----ENGKDYWIVRNSW 327
Query: 277 GTNWDEGGSMRIFRGVGGS--GLCNIAANAAYPL 308
G +W E G +R+ R + S G C IA +YP+
Sbjct: 328 GGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 200 bits (508), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 123/323 (38%), Positives = 171/323 (52%), Gaps = 30/323 (9%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKF 51
+S ++ +E+W+V+ + EK+ RF+IFK N F L L KF
Sbjct: 31 SSRSDAEVSRLYEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKF 90
Query: 52 ADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG 111
ADLT +++ + Y G + S R + +S+DW + GAV VKDQG
Sbjct: 91 ADLTNDEYRSMYLGSRLKRKATKSSLRYEV-----RVGDAIPESVDWRKEGAVAEVKDQG 145
Query: 112 SY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQ 168
S CWAF+ + VEG+NKI TG L+T S+ +LVDC T GC ++ AFE+I
Sbjct: 146 SCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNG 205
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID 228
+ +E YPY+G D CD R +A K I Y+ V +EE L+ +S QP+SVAI+
Sbjct: 206 GIDTEEDYPYKG-VDGRCDQTRKNA--KVVTIDLYEDVPANSEESLKKALSHQPISVAIE 262
Query: 229 --ATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y G+F G CG +HGV VGYGT E + YW+VKN WGT+W E G +
Sbjct: 263 GGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT----ENGKDYWIVKNSWGTSWGESGYI 318
Query: 287 RIFRGVGGS-GLCNIAANAAYPL 308
R+ R + S G C IA +YP+
Sbjct: 319 RMERNIASSAGKCGIAVEPSYPI 341
>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
Length = 362
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 126/304 (41%), Positives = 167/304 (54%), Gaps = 38/304 (12%)
Query: 30 AEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDH---- 73
+K RF +FK N H L+LNKFAD+T +F ++Y G K +H
Sbjct: 54 GDKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSK---VNHHRMF 110
Query: 74 ---PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNK 129
P N + ++ + S S+DW + GAVT VKDQG CWAF+ V VEG+N+
Sbjct: 111 QGTPRGNGTFMYEKVGS----VPPSVDWRKNGAVTGVKDQGQCGSCWAFSTVVAVEGINQ 166
Query: 130 IRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCD 187
I+T +LV+ S+ +LVDC T GC +E+AFE+I+Q + +E YPY QD CD
Sbjct: 167 IKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIKQKGGITTESNYPYTA-QDGTCD 225
Query: 188 WWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCG 245
S A+ +I G++ V E L V+ QPVSVAIDA + F FY GVFTG C
Sbjct: 226 --ASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCS 283
Query: 246 NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANA 304
NHGV IVGYGTT + YW V+N WG W E G +R+ R + GLC IA A
Sbjct: 284 TELNHGVAIVGYGTTVDGTN---YWTVRNSWGPEWGEQGYIRMQRSISKKEGLCGIAMMA 340
Query: 305 AYPL 308
+YP+
Sbjct: 341 SYPI 344
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 199 bits (507), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 124/318 (38%), Positives = 174/318 (54%), Gaps = 29/318 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
I +E W+ + + Y EK+ +F +FK N + L LN+FADL+ E
Sbjct: 40 IMELYELWLAQHKKAYNGLDEKQKKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHE 99
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F A+Y G K + S ++ S +SIDW E+GAVT VK+QGS CW
Sbjct: 100 EFKAAYLGTKLDAKKRLSRSPSPRYQY--SVGEDLPESIDWREKGAVTAVKNQGSCGSCW 157
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASEC 174
AF+ VA VEG+N+I TG L + S+ +LVDC T GC ++ AF++I L SE
Sbjct: 158 AFSTVAAVEGINQIVTGNLTSLSEQELVDCDTSYNQGCNGGLMDYAFQFIISNGGLDSED 217
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WF 232
YPY+ CD +R +A I Y+ V E+ L+ + QP+SVAI+A+ F
Sbjct: 218 DYPYKANNG-SCDAYRKNA--HVVTIDDYEDVPENDEKSLKKAAANQPISVAIEASGRAF 274
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVFT CG +HGVT+VGYG+ E YWLVKN WG +W E G +++ R +
Sbjct: 275 QFYESGVFTSNCGTQLDHGVTLVGYGS----ESGIDYWLVKNSWGNSWGEKGFIKLQRNL 330
Query: 293 GG--SGLCNIAANAAYPL 308
G +G+C IA A+YP+
Sbjct: 331 EGASTGMCGIAMEASYPV 348
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 172/311 (55%), Gaps = 31/311 (9%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLA 61
+++W+ E + Y E + RF+IFK+N + L LNKFADLT +F
Sbjct: 38 YQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEFRG 97
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
Y G P P + +++ S+DW ++G VT +KDQG CWAF+A
Sbjct: 98 LYVGRLQRPA--PFHEVGDIALVADTAT-----SVDWRKKGGVTEIKDQGDCGSCWAFSA 150
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPY 178
VA VEGL + TG LV+ S+ +LVDC T GC ++ AF+Y+ + + S+ YPY
Sbjct: 151 VAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMDYAFQYMIRNGGITSQSNYPY 210
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYH 236
+ + CD + I G+Q + P +EE L V+ QPVSVAI+A F Y
Sbjct: 211 RALRGA-CD--KDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYS 267
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSG 296
GVFTG CG+ +HGV IVGYGT +A G+Q YWLVKN WG+ W E G +R+ R G+G
Sbjct: 268 SGVFTGECGSNLDHGVAIVGYGT--DAGGRQ-YWLVKNSWGSGWGESGYVRMERQGPGAG 324
Query: 297 LCNIAANAAYP 307
+C I +A+YP
Sbjct: 325 VCGINLDASYP 335
>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
Precursor
gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 125/300 (41%), Positives = 170/300 (56%), Gaps = 31/300 (10%)
Query: 31 EKEMRFKIF-----------KKNHEF-LRLNKFADLTREKFLASYTG----YKPPPTDHP 74
E+E RF +F KKN + L+LNKFADLT +F +YTG +
Sbjct: 53 EREKRFNVFRHNVMHVHNTNKKNRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPK 112
Query: 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
++ + + N SK+ S+DW ++GAVT +K+QG CWAF+ VA VEG+NKI+T
Sbjct: 113 RGSKQFMYDHENLSKLP--SSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTN 170
Query: 134 QLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRS 191
+LV+ S+ +LVDC T GC +E AFE+I++ + +E YPY+G D CD S
Sbjct: 171 KLVSLSEQELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEG-IDGKCD--AS 227
Query: 192 SASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPN 249
+G I G++ V E L V+ QPVSVAIDA + F FY GVFTG CG N
Sbjct: 228 KDNGVLVTIDGHEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELN 287
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
HGV VGYG+ E + YW+V+N WG W EGG ++I R + G C IA A+YP+
Sbjct: 288 HGVAAVGYGS----ERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPI 343
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 199 bits (507), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 123/314 (39%), Positives = 173/314 (55%), Gaps = 27/314 (8%)
Query: 15 HEQWMVEFARTYK-DQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLA 61
+++W ++ T D E RF+IFK+N + L LNKFADL+ E+F A
Sbjct: 45 YDKWALQHRSTRSLDSDEHARRFEIFKENVKHIDSVNKKDGPYKLGLNKFADLSNEEFKA 104
Query: 62 SYTGYKPPPTDHPHSNR---SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ K +R S F NS ++ SIDW ++GAVTPVK+QG CWA
Sbjct: 105 MHMTTKMEKHKSLRGDRGVESGSFMYQNSKRLPA--SIDWRKKGAVTPVKNQGQCGSCWA 162
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVY 176
F+ +A+VEG+N I+TG+LV+ S+ QLVDCS N GC ++NAF+YI + +E Y
Sbjct: 163 FSTIASVEGINYIKTGKLVSLSEQQLVDCSKENAGCNGGLMDNAFQYIIDNGGIVTEDEY 222
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
PY C + + I G++ V E L+ V+ QPVS+AI+A+ F F
Sbjct: 223 PYTAEAG-ECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIAIEASGHDFQF 281
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVFTG CG +HGV +VGYG + E YW+V+N WG W E G +R+ RG+
Sbjct: 282 YSTGVFTGKCGTELDHGVVVVGYGKSPEGIN---YWIVRNSWGPEWGEQGYIRMQRGIEA 338
Query: 295 S-GLCNIAANAAYP 307
+ G C I+ A+YP
Sbjct: 339 TEGKCGISMQASYP 352
>gi|115438530|ref|NP_001043562.1| Os01g0613500 [Oryza sativa Japonica Group]
gi|11034572|dbj|BAB17096.1| cysteine proteinase-like [Oryza sativa Japonica Group]
gi|113533093|dbj|BAF05476.1| Os01g0613500 [Oryza sativa Japonica Group]
gi|215697766|dbj|BAG91959.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 360
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 129/322 (40%), Positives = 174/322 (54%), Gaps = 27/322 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHE--------------FLRLNKFADLT 55
++AA+HE+WM F R Y D AEK R ++F N E L LN+F+DLT
Sbjct: 38 SMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDLT 97
Query: 56 REKFLASYTGYK--PPPTDHPHSNRS-NWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
++F ++ GY PPP H H +R+ N + DS+DW RGAVT VK+Q S
Sbjct: 98 DDEFAQTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQRS 157
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRL 170
CWAF AVA EGL ++ TG LV+ S+ Q++DC+ N C+ + A YI L
Sbjct: 158 CGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDCTGGANTCSGGDVSAALRYIAASGGL 217
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEG-LQDVVSRQPVSVAIDA 229
+E Y Y G+Q C +A A+ G ++ + +EG LQ + + QPV V ++A
Sbjct: 218 QTEAAYAYGGQQGA-CRAGGFAAPNSAAAVGGARWARLYGDEGALQALAAGQPVVVVVEA 276
Query: 230 TW--FNFYHGGVFTG--PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
+ F Y GV+ G CG NH VT+VGYG + G+ YWLVKN+WGT W EGG
Sbjct: 277 SEPDFRHYRSGVYAGSAACGRRLNHAVTVVGYGAAADGGGE--YWLVKNQWGTWWGEGGY 334
Query: 286 MRIFRGVGGSGLCNIAANAAYP 307
MR+ RG G C IA A YP
Sbjct: 335 MRVARGGAAGGNCGIATYAFYP 356
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 123/323 (38%), Positives = 171/323 (52%), Gaps = 30/323 (9%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKF 51
+S ++ +E+W+V+ + EK+ RF+IFK N F L L KF
Sbjct: 37 SSRSDAEVSRLYEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKF 96
Query: 52 ADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG 111
ADLT +++ + Y G + S R + +S+DW + GAV VKDQG
Sbjct: 97 ADLTNDEYRSMYLGSRLKRKATKSSLRYEV-----RVGDAIPESVDWRKEGAVAEVKDQG 151
Query: 112 SY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQ 168
S CWAF+ + VEG+NKI TG L+T S+ +LVDC T GC ++ AFE+I
Sbjct: 152 SCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNG 211
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID 228
+ +E YPY+G D CD R +A K I Y+ V +EE L+ +S QP+SVAI+
Sbjct: 212 GIDTEEDYPYKG-VDGRCDQTRKNA--KVVTIDLYEDVPANSEESLKKALSHQPISVAIE 268
Query: 229 --ATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y G+F G CG +HGV VGYGT E + YW+VKN WGT+W E G +
Sbjct: 269 GGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT----ENGKDYWIVKNSWGTSWGESGYI 324
Query: 287 RIFRGVGGS-GLCNIAANAAYPL 308
R+ R + S G C IA +YP+
Sbjct: 325 RMERNIASSAGKCGIAVEPSYPI 347
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 173/316 (54%), Gaps = 29/316 (9%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIF-------------KKNHEF-LRLNKFADLTREKFL 60
+E W+V + Y EKE RF+IF + NH + L L +FADLT E++
Sbjct: 38 YEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYTLGLTRFADLTNEEYR 97
Query: 61 ASYTGYKPPPTDHPHSNRS-NWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
++Y G KP +NR+ ++L+++ +DW E+GAV P+KDQG CWAF
Sbjct: 98 STYLGVKPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGAVAPIKDQGGCGSCWAF 157
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVY 176
+ VA VEG+N+I TG L+ S+ +LVDC T GC ++ AF++I + +E Y
Sbjct: 158 STVAAVEGINQIVTGDLIVLSEQELVDCDTAYNEGCNGGLMDYAFQFIISNGGIDTEEDY 217
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
PY+ R D CD R +A K +I Y+ V E L+ V+ QPVSVAI+ F
Sbjct: 218 PYKER-DGLCDPNRKNA--KVVSIDSYEDVLENDEHALKTAVAHQPVSVAIEGGGRSFQL 274
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-- 292
Y G+F G CG +HGV VGYGT E + YW+V+N WG +W E G +R+ R +
Sbjct: 275 YKSGIFDGRCGIDLDHGVVAVGYGT----ESGKDYWIVRNSWGKSWGEAGYIRMERNLPS 330
Query: 293 GGSGLCNIAANAAYPL 308
SG C IA +YP+
Sbjct: 331 SSSGKCGIAIEPSYPI 346
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 121/310 (39%), Positives = 173/310 (55%), Gaps = 25/310 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E+W+ + + Y EK RF++FK N + +L LN+FADLT ++F A+Y
Sbjct: 50 EKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINREVTSYWLGLNEFADLTHDEFKAAY 109
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G P S+RS ++++++S + S+DW ++GAVT VK+QG CWAF+ VA
Sbjct: 110 LGLDAAPARRG-SSRSFRYEDVSASDLP--KSVDWRKKGAVTEVKNQGQCGSCWAFSTVA 166
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N I TG L S+ +L+DCS +GC ++ AF YI L +E YPY
Sbjct: 167 AVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGLMDYAFSYIASSGGLHTEEAYPYLM 226
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
+ D ++ + I GY+ V E+ L ++ QPVSVAI+A+ F FY GG
Sbjct: 227 EEGSCGDGKKAESEAV--TISGYEDVPANDEQALIKALAHQPVSVAIEASGRHFQFYSGG 284
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGL 297
VF GPCG +HGV VGYG + + +G Y +V+N WG W E G +R+ RG G GL
Sbjct: 285 VFDGPCGAQLDHGVAAVGYG-SDKGKGHD-YIIVRNSWGAQWGEKGYIRMKRGTSNGEGL 342
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 343 CGINKMASYP 352
>gi|125526835|gb|EAY74949.1| hypothetical protein OsI_02845 [Oryza sativa Indica Group]
Length = 360
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 129/322 (40%), Positives = 174/322 (54%), Gaps = 27/322 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHE--------------FLRLNKFADLT 55
++AA+HE+WM F R Y D AEK R ++F N E L LN+F+DLT
Sbjct: 38 SMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDLT 97
Query: 56 REKFLASYTGYK--PPPTDHPHSNRS-NWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
++F ++ GY PPP H H +R+ N + DS+DW RGAVT VK+Q S
Sbjct: 98 DDEFARTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQRS 157
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRL 170
CWAF AVA EGL ++ TG LV+ S+ Q++DC+ N C+ + A YI L
Sbjct: 158 CGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDCTGGANTCSGGDVSAALRYIAASGGL 217
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEG-LQDVVSRQPVSVAIDA 229
+E Y Y G+Q C +A A+ G ++ + +EG LQ + + QPV V ++A
Sbjct: 218 QTEAAYAYGGQQGA-CRAGGFAAPNSAAAVGGARWARLYGDEGALQALAAGQPVVVVVEA 276
Query: 230 TW--FNFYHGGVFTG--PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
+ F Y GV+ G CG NH VT+VGYG + G+ YWLVKN+WGT W EGG
Sbjct: 277 SEPDFRHYRSGVYAGSAACGRRLNHAVTVVGYGAAADGGGE--YWLVKNQWGTWWGEGGY 334
Query: 286 MRIFRGVGGSGLCNIAANAAYP 307
MR+ RG G C IA A YP
Sbjct: 335 MRVARGGAAGGNCGIATYAFYP 356
>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
Length = 363
Score = 199 bits (506), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 171/321 (53%), Gaps = 34/321 (10%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIF-----------KKNHEF-LRLNKFADLTRE 57
N+ +E+W + T + E RF +F KKN + L++N+FAD+T
Sbjct: 32 NVWKLYERWRDHHSVT-RASHEALKRFNVFRHNVLHVHRTNKKNKPYKLKVNRFADITHH 90
Query: 58 KFLASYTGYKPPPTDHPHSNR-----SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
+F +SY G H R S F N +++ S+DW E+GAVT VK+Q
Sbjct: 91 EFRSSYAGSN---VKHHRMLRGPKRGSGGFMYENVTRVP--SSVDWREKGAVTEVKNQQD 145
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQR 169
CWAF+ VA VEG+NKIRT +LV+ S+ +LVDC T GCA +E AFE+I+
Sbjct: 146 CGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQGCAGGLMEPAFEFIKNNGG 205
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
+ +E YPY +C S G+ I G+++V EE L V+ QPVSVAIDA
Sbjct: 206 IKTEETYPYDSNDVQFCR--AKSIDGETVTIDGHEHVPENDEEALLKAVAHQPVSVAIDA 263
Query: 230 --TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
+ F Y GVF G CG NHGV IVGYG T YW+V+N WG W EGG +R
Sbjct: 264 GSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNG---TKYWIVRNSWGPEWGEGGYVR 320
Query: 288 IFRGVG-GSGLCNIAANAAYP 307
I RG+ G C IA A+YP
Sbjct: 321 IERGISENEGRCGIAMEASYP 341
>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
Length = 350
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 126/325 (38%), Positives = 174/325 (53%), Gaps = 35/325 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLA 61
+ EQWM+ R Y D EK+ RF+++++N E L NKFADLT E+F A
Sbjct: 30 RFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSNGYKLADNKFADLTNEEFRA 89
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNL--NSSKMSFYDSIDWNERGAVT----PVKDQGSYCC 115
G++P T SN + + SS S+DW +GAV D GS C
Sbjct: 90 KMLGFRPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRNKGAVINRWKICVDAGS--C 147
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASEC 174
WAF+AVA +EG+N+I+ G+LV+ S+ +LVDC GC ++ AFE++ L +E
Sbjct: 148 WAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEAVGCGGGYMSWAFEFVVGNHGLTTEA 207
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF 234
YPY + C + + S AI GY+ V P++E L + QPVSVA+D F F
Sbjct: 208 SYPYHA-ANGACQAAKLNQSAV--AIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMF 264
Query: 235 --YHGGVFTGPCGNTPNHGVTIVGYGTT-------TEAEGQQPYWLVKNRWGTNWDEGGS 285
Y GV+TGPC NHGVT+VGYG + A+G + YW+VKN WG W + G
Sbjct: 265 QLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGY 324
Query: 286 MRIFRGVGG--SGLCNIAANAAYPL 308
+ + R V G SGLC IA +YP+
Sbjct: 325 ILMQRDVAGLASGLCGIALLPSYPV 349
>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
Precursor
gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
thaliana]
gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 199 bits (505), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 121/280 (43%), Positives = 158/280 (56%), Gaps = 22/280 (7%)
Query: 40 KKNHEF-LRLNKFADLTREKFLASYTGYKPPPTDHPHSNR-----SNWFKNLNSSKMSFY 93
KKN + L++N+FAD+T +F +SY G H R S F N +++
Sbjct: 73 KKNKPYKLKINRFADITHHEFRSSYAGSN---VKHHRMLRGPKRGSGGFMYENVTRVP-- 127
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-- 150
S+DW E+GAVT VK+Q CWAF+ VA VEG+NKIRT +LV+ S+ +LVDC T
Sbjct: 128 SSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDCDTEENQ 187
Query: 151 GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPAT 210
GCA +E AFE+I+ + +E YPY +C +S G+ I G+++V
Sbjct: 188 GCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCR--ANSIGGETVTIDGHEHVPEND 245
Query: 211 EEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQP 268
EE L V+ QPVSVAIDA + F Y GVF G CG NHGV IVGYG T
Sbjct: 246 EEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYGETKNG---TK 302
Query: 269 YWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYP 307
YW+V+N WG W EGG +RI RG+ G C IA A+YP
Sbjct: 303 YWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYP 342
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 128/313 (40%), Positives = 172/313 (54%), Gaps = 36/313 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E WM + ++TY+ EK RF+IF N + +L LN+FADL+ E+F + Y
Sbjct: 48 ESWMSKHSKTYRSIEEKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKY 107
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G + + P S F + + +S+DW +GAVTPVK+QGS CWAF+ VA
Sbjct: 108 LGLR---VEFPRKRSSRGFSYGDVEDLP--ESVDWRTKGAVTPVKNQGSCGSCWAFSTVA 162
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVYPY-- 178
VEG+N+I TG L + S+ +L+DC S NGC ++ AF+YI L E YPY
Sbjct: 163 AVEGINQIVTGNLTSLSEQELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLM 222
Query: 179 -QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFY 235
+GR C R + I GY+ V E+ L +S QPVSVAI+A+ F FY
Sbjct: 223 EEGR----C--IREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFY 276
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG- 294
GG+FTG CG +HGVT VGYG++ EG Y +VKN WG W E G +R+ R G
Sbjct: 277 KGGIFTGRCGTQMDHGVTAVGYGSS---EGTD-YIIVKNSWGPKWGENGYIRMKRNTGKP 332
Query: 295 SGLCNIAANAAYP 307
GLC I A+YP
Sbjct: 333 EGLCGINQMASYP 345
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 124/298 (41%), Positives = 167/298 (56%), Gaps = 26/298 (8%)
Query: 28 DQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPPTDHPH 75
D E RF+IFK+N ++ L LNKFADL+ E+F A Y G K
Sbjct: 60 DSEEHAERFEIFKENVKYIDSVNKKDSPYKLGLNKFADLSNEEFKAIYMGTKMD-LRGDR 118
Query: 76 SNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAVATVEGLNKIRTG 133
+S F NS + SIDW ++GAV VK+QG +C CWAF+ VA+VEG+N I TG
Sbjct: 119 EVQSGSFMYQNSEPLPA--SIDWRQKGAVAAVKNQG-HCGSCWAFSTVASVEGINYITTG 175
Query: 134 QLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSS 192
LV+ S+ QLVDCST N GC ++ AF+YI + +E YPY + C + +
Sbjct: 176 NLVSLSEQQLVDCSTENSGCNGGLMDTAFQYIINNGGIVTEDNYPYTA-EATECSSTKIN 234
Query: 193 ASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNH 250
+ I G++ V E+ L++ V+ QPVSVAI+A+ F FY GVFTG CG +H
Sbjct: 235 SQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIEASGQDFQFYSTGVFTGKCGTALDH 294
Query: 251 GVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYP 307
GV VGYGT+ E YW+V+N WG W E G +R+ +G+ G C IA A+YP
Sbjct: 295 GVVAVGYGTSPEGIN---YWIVRNSWGPKWGEEGYIRMQQGIEAAEGKCGIAMQASYP 349
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 198 bits (504), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 124/317 (39%), Positives = 170/317 (53%), Gaps = 33/317 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E++M ++ + Y EK RF++FK N +L LN+FADLT ++F A+Y
Sbjct: 53 EKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKKITGYWLGLNEFADLTHDEFKAAY 112
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G P +++ F+ S +DW ++GAVT VK+QG CWAF+ VA
Sbjct: 113 LGLTLTPARRNSNDQ--LFRYEEVEAASLPKEVDWRKKGAVTEVKNQGQCGSCWAFSTVA 170
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N I TG L S+ +L+DC T NGC+ ++ AF YI L +E YPY
Sbjct: 171 AVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGLMDYAFSYIAANGGLHTEESYPYL- 229
Query: 181 RQDYYCDWWRSSASGKYGA-------IRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-- 231
++ C R S G I GY+ V E+ L ++ QPVSVAI+A+
Sbjct: 230 MEEGTC--RRGSTEGDDDGEAAAAVTISGYEDVPRNNEQALLKALAHQPVSVAIEASGRN 287
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY GGVF GPCG +HGVT VGYGT ++ Y +VKN WG++W E G +R+ RG
Sbjct: 288 FQFYSGGVFDGPCGTRLDHGVTAVGYGTASKG---HDYIIVKNSWGSHWGEKGYIRMRRG 344
Query: 292 VGG-SGLCNIAANAAYP 307
G GLC I A+YP
Sbjct: 345 TGKHDGLCGINKMASYP 361
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 122/305 (40%), Positives = 164/305 (53%), Gaps = 40/305 (13%)
Query: 30 AEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDHPHSN 77
EK RF +FK N H L+LNKFAD+T +F ++Y G K N
Sbjct: 54 GEKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKV--------N 105
Query: 78 RSNWFKNLNSSKMSFY--------DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLN 128
F+ +F S+DW ++GAVT VKDQG CWAF+ + VEG+N
Sbjct: 106 HHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGIN 165
Query: 129 KIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYC 186
+I+T +LV+ S+ +LVDC GC +E+AFE+I+Q + +E YPY+ Q+ C
Sbjct: 166 QIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYKA-QEGTC 224
Query: 187 DWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPC 244
D S + +I G++ V E L V+ QPVSVAIDA + F FY GVFTG C
Sbjct: 225 D--ESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDC 282
Query: 245 GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAAN 303
NHGV IVGYGTT + YW+V+N WG W E G +R+ R + GLC IA
Sbjct: 283 NTDLNHGVAIVGYGTTVDGTN---YWIVRNSWGPEWGEQGYIRMQRNISKKEGLCGIAMM 339
Query: 304 AAYPL 308
A+YP+
Sbjct: 340 ASYPI 344
>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 125/301 (41%), Positives = 169/301 (56%), Gaps = 34/301 (11%)
Query: 31 EKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDHPHS-- 76
EK+ RF +FK N H F L+LNKFAD+T +F Y G K H S
Sbjct: 53 EKDKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFRHHYAGSK---IKHHRSFL 109
Query: 77 --NRSN-WFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRT 132
+R+N F N + S+DW ++GAVTPVKDQG CWAF+ V VEG+N+I+T
Sbjct: 110 GASRANGTFMYANVEDVP--PSVDWRKKGAVTPVKDQGKCGSCWAFSTVVAVEGINQIKT 167
Query: 133 GQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWR 190
+LV+ S+ +LVDC T GC ++ AFE+I++ + +E YPY + CD +
Sbjct: 168 NELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINTEENYPYMA-EGGECDIQK 226
Query: 191 SSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTP 248
++ +I GY+ V P E+ L V+ QPVSVAI A+ F FY GVFTG CG
Sbjct: 227 RNSP--VVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQFYSEGVFTGDCGTEL 284
Query: 249 NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
+HGV IVGYGTT + YW+V+N WG W E G +R+ R + GLC IA +YP
Sbjct: 285 DHGVAIVGYGTTLDG---TKYWIVRNSWGPEWGEKGYIRMQREIDAEEGLCGIAMQPSYP 341
Query: 308 L 308
+
Sbjct: 342 I 342
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 123/327 (37%), Positives = 173/327 (52%), Gaps = 34/327 (10%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQ----AEKEMRFKIFKKNHEF------------LR 47
+S + +E WMVE + +Q AEK+ RF+IFK N + L
Sbjct: 39 SSRSDAEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRYIDEHNTKNLSYKLG 98
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
L +FADLT +++ + Y G KP S+R + DS+DW + GAV V
Sbjct: 99 LTRFADLTNDEYRSMYLGAKPVKRVLKTSDRYE-----ARVGDALPDSVDWRKEGAVADV 153
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYI 164
KDQGS CWAF+ + VEG+NKI TG L++ S+ +LVDC T GC ++ AFE+I
Sbjct: 154 KDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFI 213
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ + +E YPY+ D CD R +A K I Y+ V +E L+ ++ QP+S
Sbjct: 214 IKNGGIDTEADYPYKA-ADGRCDQNRKNA--KVVTIDSYEDVPENSEASLKKALAHQPIS 270
Query: 225 VAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
VAI+A F Y GVF G CG +HGV VGYGT E + YW+V+N WG W E
Sbjct: 271 VAIEAGGRAFQLYSSGVFDGICGTELDHGVVAVGYGT----ENGKDYWIVRNSWGNRWGE 326
Query: 283 GGSMRIFRGVGG-SGLCNIAANAAYPL 308
G +++ R + +G C IA A+YP+
Sbjct: 327 SGYIKMARNIAEPTGKCGIAMEASYPI 353
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 122/323 (37%), Positives = 172/323 (53%), Gaps = 30/323 (9%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKF 51
+S ++ +E+W+V+ + EK+ RF+IFK N F L L KF
Sbjct: 31 SSRSDVEVSRLYEEWVVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGKNLSYRLGLTKF 90
Query: 52 ADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG 111
ADLT +++ + Y G + S R + +S+DW + GAV VKDQG
Sbjct: 91 ADLTNDEYRSMYLGSRLKRKATKTSLRYE-----ARVGDAIPESVDWRKEGAVAEVKDQG 145
Query: 112 SY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQ 168
S CWAF+ + VEG+NKI TG L++ S+ +LVDC T GC ++ AFE+I +
Sbjct: 146 SCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNG 205
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID 228
+ +E YPY+G D CD R +A K I Y+ V +EE L+ +S QP+SVAI+
Sbjct: 206 GIDTEEDYPYKG-VDGRCDQTRKNA--KVVTIDSYEDVPANSEESLKKALSHQPISVAIE 262
Query: 229 --ATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y G+F G CG +HGV VGYGT E + YW+VKN WGT+W E G +
Sbjct: 263 GGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT----ENGKDYWIVKNSWGTSWGESGYI 318
Query: 287 RIFRGVGGS-GLCNIAANAAYPL 308
R+ R + S G C IA +YP+
Sbjct: 319 RMERNIASSAGKCGIAVEPSYPI 341
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 125/330 (37%), Positives = 180/330 (54%), Gaps = 33/330 (10%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQ-AEKEMRFKIFKKNHEF-------------- 45
M+RT + + A +EQWM + + E + RF+ F N F
Sbjct: 41 MARTEAQ---VRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGYR 97
Query: 46 LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVT 105
L +N+FADLT +F A+Y + + + + + + + +DW ++GAV
Sbjct: 98 LGINRFADLTNAEFRAAYL---SAGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVA 154
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAF 161
PVK+QG CWAF+AV VEG+N+I TG+LVT S+ +LVDCS GC +++AF
Sbjct: 155 PVKNQGQCGSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAF 214
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
+I + ++ YPY R D CD + S +I G++ V E+ LQ V+ Q
Sbjct: 215 AFIVGNGGIDTDKDYPYTAR-DGKCDVAKRSR--HVVSIDGFEGVPRNDEKSLQKAVAHQ 271
Query: 222 PVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
PV+VAI+A F Y GVFTG CG + +HGV VGYGT EA+G + YWLV+N WG +
Sbjct: 272 PVAVAIEAGGREFQLYQSGVFTGRCGTSLDHGVVAVGYGT--EADGGRDYWLVRNSWGAD 329
Query: 280 WDEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
W EGG +R+ R VG +G C IA A+YP+
Sbjct: 330 WGEGGYIRMERNVGARAGKCGIAMEASYPV 359
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 198 bits (503), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 122/310 (39%), Positives = 166/310 (53%), Gaps = 30/310 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN------------HEFLRLNKFADLTREKFLASY 63
E W+ F R Y+ EK RF+IFK N + +L LN+FADL+ E+F Y
Sbjct: 48 ESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKKVRNYWLGLNEFADLSHEEFKNKY 107
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G KP + +K++ K S+DW ++GAVTPVK+QGS CWAF+ VA
Sbjct: 108 LGLKPDLSKRAQCPEEFTYKDVAIPK-----SVDWRKKGAVTPVKNQGSCGSCWAFSTVA 162
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I TG L + S+ +L+DC T NGC ++ AF YI L E YPY
Sbjct: 163 AVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFAYIVANGGLHKEEDYPYI- 221
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
++ CD + + I GY V +EE L ++ QP+S+AI+A+ F FY GG
Sbjct: 222 MEEGTCDMRKEESDAV--TISGYHDVPQNSEESLLKALANQPLSIAIEASGRDFQFYSGG 279
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF G CG +HGV VGYGT+ + Y +VKN WG W E G +R+ R G+
Sbjct: 280 VFDGHCGTELDHGVAAVGYGTSKGLD----YIIVKNSWGPKWGEKGYIRMKRKTSKPEGI 335
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 336 CGIYKMASYP 345
>gi|9502426|gb|AAF88125.1|AC021043_18 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 365
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 135/340 (39%), Positives = 179/340 (52%), Gaps = 49/340 (14%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+I H+QWM +F+R YKD++EKEMR K+FKKN +F+ +N+F D
Sbjct: 33 SIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSYTLGVNEFTDWKT 92
Query: 57 EKFLASYTGYKPPPTDHPHS-NRSNWFKNLNSSKMSFYD-SIDWNERGAVTPVKDQG--- 111
E+FLA++TG + T N++ +N N S + D S DW + GAVTPVK QG
Sbjct: 93 EEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDIDMEDESKDWRDEGAVTPVKYQGACP 152
Query: 112 ----------SYCCWAFTAVATV------EGLNKIRTGQLVTRSKHQLVDCSTLN--GCA 153
S +T + V EGL KI L+T S+ QL+DC GC
Sbjct: 153 EFPTKQIRRNSLVGKQYTKLLGVLSDWGDEGLTKISGKNLLTLSEQQLIDCDIEKNGGCN 212
Query: 154 KNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSA-SGKYGAIRGYQYVQPATEE 212
E AF+YI + ++ E YPYQ +++ C R++A + IRG+Q V E
Sbjct: 213 GGEFEEAFKYIIKNGGVSLETEYPYQVKKE-SC---RANARRAPHTQIRGFQMVPSHNER 268
Query: 213 GLQDVVSRQPVSVAID--ATWFNFYHGGVFTG-PCGNTPNHGVTIVGYGTTTEAEGQQPY 269
L + V RQPVSV ID A F Y GGV+ G CG NH VTIVGYGT + Y
Sbjct: 269 ALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTMSGLN----Y 324
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W++KN WG +W E G MRI R V G+C IA AAYP+
Sbjct: 325 WVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 364
>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
Length = 357
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 117/313 (37%), Positives = 169/313 (53%), Gaps = 31/313 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFL 60
+ E+WM E+ R YKD EK RF+IFK N + +N+F D+T +F+
Sbjct: 36 RFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSRNGNSYTLGINQFTDMTNNEFV 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ---GSYCCWA 117
A YTG P S F +++ S + SIDW GAVT VK+ GS CWA
Sbjct: 96 AQYTGVSLPLNIEREPVVS--FDDVDISAVP--QSIDWRNYGAVTSVKNHIPCGS--CWA 149
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYP 177
F A+ATVE + KI+ G L++ S+ Q++DC+ GC ++ A+++I + +AS +YP
Sbjct: 150 FAAIATVESIYKIKRGYLISLSEQQVLDCAVSYGCDGGWVNKAYDFIISNKGVASAAIYP 209
Query: 178 YQGRQDY-YCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FNFY 235
Y+ Q C R + I GY VQ E + VS QP++ +I+A+ F Y
Sbjct: 210 YKASQGQGTC---RINGVPNSAYITGYTRVQSNNERSMMYAVSNQPIAASIEASGDFQHY 266
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GG 294
GVF+GPCG + NH +TI+GYG + + +W+V+N WG +W E G +R+ R V
Sbjct: 267 KRGVFSGPCGTSLNHAITIIGYGQDSSG---KKFWIVRNSWGASWGERGYIRMARDVSSS 323
Query: 295 SGLCNIAANAAYP 307
SGLC IA YP
Sbjct: 324 SGLCGIAIRPLYP 336
>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
Length = 367
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 122/317 (38%), Positives = 179/317 (56%), Gaps = 31/317 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREK 58
+ +E+W+V+ + Y EK RF+IFK N F+ LN+F+D+T ++
Sbjct: 31 VMTMYEKWLVKHQKVYYGLGEKNQRFQIFKDNLIFIDEHNAPNHSYRVGLNEFSDITNKE 90
Query: 59 FLASYTG-YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+ +Y + + ++ +K +++K+ S+DW RGA+TP+K+QGS CW
Sbjct: 91 YRDTYLSRWSNNNIKNKITSVRYAYKAGHNNKLPV--SVDW--RGALTPIKNQGSCGACW 146
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASEC 174
AF+AVA VE +NKI TG LV+ S+ +LVDC GC NA+ +I + L S+
Sbjct: 147 AFSAVAAVEAINKIVTGSLVSLSEQELVDCDRTKNKGCNGGNQVNAYRFIVENGGLDSQI 206
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YPY GRQ C+ ++ + K +I GY+ VQ +E L + V+ QPVSV I+A F
Sbjct: 207 DYPYLGRQS-TCN--QAKKNTKVVSINGYKNVQRNSESALMEAVANQPVSVGIEAYGKDF 263
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GVFTG CG + +H V +VGYG+ E + YWLVKN WGTNW E G ++I R +
Sbjct: 264 QLYQSGVFTGSCGTSLDHAVVVVGYGS----ENGKDYWLVKNSWGTNWGERGYLKIERNL 319
Query: 293 G--GSGLCNIAANAAYP 307
+G C IA +A YP
Sbjct: 320 KNTNTGKCGIAMDATYP 336
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 127/320 (39%), Positives = 172/320 (53%), Gaps = 35/320 (10%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFL 60
A +E+W A +D +K RF +FK N HEF LRLN+F D+T ++F
Sbjct: 154 ALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRRDEPYKLRLNRFGDMTADEFR 212
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD------SIDWNERGAVTPVKDQGSY- 113
Y G + H + ++S + D S+DW ++GAVT VKDQG
Sbjct: 213 RHYAGSRV--AHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKDQGQCG 270
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLA 171
CWAF+ +A VEG+N I+T L + S+ QLVDC T GC ++ AF+YI ++ +A
Sbjct: 271 SCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVA 330
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-- 229
+E YPY+ RQ C + I GY+ V E L+ V+ QPVSVAI+A
Sbjct: 331 AEDAYPYRARQ-ASC----KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASG 385
Query: 230 TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
+ F FY GVF+G CG +HGV VGYG T A+G + YWLVKN WG W E G +R+
Sbjct: 386 SHFQFYSEGVFSGRCGTELDHGVAAVGYGVT--ADGTK-YWLVKNSWGPEWGEKGYIRMA 442
Query: 290 RGVGG-SGLCNIAANAAYPL 308
R V G C IA A+YP+
Sbjct: 443 RDVAAKEGHCGIAMEASYPV 462
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 121/332 (36%), Positives = 177/332 (53%), Gaps = 31/332 (9%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------- 47
+ +S + +E W+V+ + Y EKE RF IFK N EF+
Sbjct: 39 LPSSSRSDDEVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFKVG 98
Query: 48 LNKFADLTREKFLASYTG----YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGA 103
LNKFADLT E+F + Y G P ++ + L +++DW + GA
Sbjct: 99 LNKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKNGA 158
Query: 104 VTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENA 160
V VKDQG CWAF+ +A VEG+N+I TG+L++ S+ +LVDC T +GC ++ A
Sbjct: 159 VAKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTSYNSGCDGGLMDYA 218
Query: 161 FEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR 220
+E+I + ++ YPY + D CD +R +A K I ++ V E+ LQ V+
Sbjct: 219 YEFIINNGGIDTDADYPYTAK-DGKCDQYRKNA--KVVTIDDFEDVPENDEKALQKAVAH 275
Query: 221 QPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGT 278
QPVSVAI+A + F FY GVFTG CG +HGV VGYG+ + + YW+V+N WG
Sbjct: 276 QPVSVAIEAGGSTFQFYQSGVFTGKCGADLDHGVVAVGYGS----DDGKDYWIVRNSWGA 331
Query: 279 NWDEGGSMRIFRGVG--GSGLCNIAANAAYPL 308
+W E G +R+ R + +G C IA +YP+
Sbjct: 332 DWGESGYIRMERNLETVKTGKCGIAIEPSYPI 363
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 197 bits (502), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 132/329 (40%), Positives = 177/329 (53%), Gaps = 39/329 (11%)
Query: 3 RTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLN 49
RT +T I +E W+V+ R Y EKE RF+IFK N +F L LN
Sbjct: 16 RTEAETRRI---YEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLN 72
Query: 50 KFADLTREKFLASYTGYKPPPTDH----PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVT 105
KFADL+ +++ + Y G + P S R FK + +++DW E+GAV
Sbjct: 73 KFADLSNDEYRSVYLGTRMDGKGRLLGGPKSERY-LFKEGDD----LPETVDWREKGAVA 127
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS-TLN-GCAKNFLENAFE 162
PVKDQG CWAF+ V VEG+N+I TG L + S+ +LVDC T N GC ++ AF+
Sbjct: 128 PVKDQGQCGSCWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFD 187
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
+I + + +E YPY+ D CD R +A + I GY+ V E+ L+ V+ QP
Sbjct: 188 FIIENGGIDTEEDYPYKA-IDSMCDPNRKNA--RVVTIDGYEDVPQNDEKSLKKAVANQP 244
Query: 223 VSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
VSVAI+A F Y GVFTG CG +HGV VGYGT E YW+V+N WG W
Sbjct: 245 VSVAIEAGGRGFQLYQSGVFTGSCGTQLDHGVVTVGYGT----EHGVDYWIVRNSWGPAW 300
Query: 281 DEGGSMRIFRGVGG--SGLCNIAANAAYP 307
E G +R+ R V +G C IA A+YP
Sbjct: 301 GENGYIRMERDVASTETGKCGIAMEASYP 329
>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
Length = 356
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 169/310 (54%), Gaps = 26/310 (8%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFL 60
+ E+WMVE+ R YKD EK RF+IFK N + +N+F D+T +F+
Sbjct: 36 RFEEWMVEYGRVYKDNDEKMRRFQIFKNNVNHIETFNSRNKDSYTLGINQFTDMTNNEFV 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A YTG P + + F +++ S + SIDW + GAVT VK+Q CWAF
Sbjct: 96 AQYTGGISRPLNIEREPVVS-FDDVDISAVP--QSIDWRDYGAVTSVKNQNPCGACWAFA 152
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A+ATVE + KI+ G L S+ Q++DC+ GC + AFE+I + +AS +YPY+
Sbjct: 153 AIATVESIYKIKKGILEPLSEQQVLDCAKGYGCKGGWEFRAFEFIISNKGVASVAIYPYK 212
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FNFYHGG 238
+ C +++ I GY V E + VS+QP++VA+DA +Y+ G
Sbjct: 213 AAKG-TC---KTNGVPNSAYITGYARVPRNNESSMMYAVSKQPITVAVDANANSQYYNSG 268
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGL 297
VF GPCG + NH VT +GYG + + YW+VKN WG W E G +R+ R V SG+
Sbjct: 269 VFNGPCGTSLNHAVTAIGYGQDSNG---KKYWIVKNSWGARWGEAGYIRMARDVSSSSGI 325
Query: 298 CNIAANAAYP 307
C IA ++ YP
Sbjct: 326 CGIAIDSLYP 335
>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 384
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 126/328 (38%), Positives = 179/328 (54%), Gaps = 39/328 (11%)
Query: 10 NIAAKHEQWMVEFARTY----KDQAEKEMRFKIFKKNHEF-------------LRLNKFA 52
++ A +E+W + R D+ ++ RF +FK+N + L LNKFA
Sbjct: 36 SLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKDGRPFRLALNKFA 95
Query: 53 DLTREKFLASYTGYKPPPTDHPHSN--RSNWFKNLN-----SSKMSFYDSIDWNERGAVT 105
D+T ++F +Y G + T H + + F + S + ++DW RGAVT
Sbjct: 96 DMTTDEFRRTYAGSR---TRHHRAQLGEARSFAHAQHGRGGSGTTNLPPAVDWRLRGAVT 152
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFE 162
VKDQG CWAF+A+A VEG+NKI TG+LV+ S+ +LVDC ++ GC ++ AF+
Sbjct: 153 GVKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAFQ 212
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
YI++ + +E YPY Q C+ + + I GY+ V E+ LQ V+ QP
Sbjct: 213 YIQRNGGVTTESNYPYLAEQ-RSCNKAKERSHDV--TIDGYEDVPANNEDALQKAVASQP 269
Query: 223 VSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
V+VAI+A+ F FY GVFTG CG +HGV VGYGTT + YW VKN WG +W
Sbjct: 270 VAVAIEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDG---TKYWTVKNSWGEDW 326
Query: 281 DEGGSMRIFRGVGGS-GLCNIAANAAYP 307
E G +R+ RGV S GLC IA +YP
Sbjct: 327 GERGYIRMQRGVPDSRGLCGIAMEPSYP 354
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 124/328 (37%), Positives = 172/328 (52%), Gaps = 27/328 (8%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------L 48
M + + +E+W+V+ + Y EK+ RF+IFK N F+ L
Sbjct: 21 MDTSMRSNEEVMTMYEEWLVKHHKVYNGLGEKDQRFEIFKDNLGFIDEHNAQNYTYKVGL 80
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNR-SNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
NKFAD T E++ Y G K + + + + +S +DW +GAV +
Sbjct: 81 NKFADTTNEEYRNMYLGTKNDAKRNVMKIKITTGHRYAFNSGDRLPVHVDWRSKGAVAHI 140
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYI 164
KDQGS CWAF+ +ATVE +NKI TG+LV+ S+ +LVDC GC ++ AFE+I
Sbjct: 141 KDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVDCDRAFNEGCNGGLMDYAFEFI 200
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ + +E YPY+G + CD R +A K +I GY+ V E L+ V QPVS
Sbjct: 201 VENGGIDTEQDYPYKGFEG-RCDPTRKNA--KVVSIDGYEDVPAYNENALKKAVFHQPVS 257
Query: 225 VAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
VAI+A Y GVFTG CG +HGV +VGYG E YWLV+N WGTNW E
Sbjct: 258 VAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVVGYGF----ENGVDYWLVRNSWGTNWGE 313
Query: 283 GGSMRIFRGVG--GSGLCNIAANAAYPL 308
G ++ R V +G C IA A+YP+
Sbjct: 314 DGYFKLERNVKKINTGKCGIAMQASYPV 341
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 123/310 (39%), Positives = 168/310 (54%), Gaps = 30/310 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E W+ + Y EK RF++FK+N + +L LN+FADL+ E+F + +
Sbjct: 48 ESWISGHGKAYNSLEEKLHRFEVFKENLKHIDQRNKEVTSYWLGLNEFADLSHEEFKSKF 107
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G P + P S F + + SIDW ++GAVTPVK+QGS CWAF+ VA
Sbjct: 108 LGLYP---EFPRKKSSEDFSYRDVVDLP--KSIDWRKKGAVTPVKNQGSCGSCWAFSTVA 162
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I G L + S+ QL+DC T NGC ++ AFE+I L E YPY
Sbjct: 163 AVEGINQIVAGNLTSLSEQQLIDCDTSFNNGCNGGLMDYAFEFIVNNGGLHKEEDYPYL- 221
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
++ CD R + I GY V E+ L ++ QP+SVAIDA+ F FY GG
Sbjct: 222 MEEGTCDEKREEM--EVVTISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQFYSGG 279
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF+GPCG +HGV VGYG+++ + Y +VKN WG W E G +R+ R G GL
Sbjct: 280 VFSGPCGTDLDHGVAAVGYGSSSGID----YIIVKNSWGPKWGERGYLRMKRNTGKPEGL 335
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 336 CGINKMASYP 345
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 197 bits (501), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 118/310 (38%), Positives = 169/310 (54%), Gaps = 26/310 (8%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKFLASYTG 65
W+ + + Y E+ RF+IFK N F+ L KFADLT E++ A + G
Sbjct: 7 WLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQNHTYKVGLTKFADLTNEEYRAMFLG 66
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
+ ++S + + +S+DW +GAV P+KDQGS CWAF+ VA V
Sbjct: 67 TRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWAFSTVAAV 126
Query: 125 EGLNKIRTGQLVTRSKHQLVDCS-TLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQ 182
EG+N+I TG+L++ S+ +LVDC T N GC ++ AF++I L +E YPY G
Sbjct: 127 EGINQIVTGELISLSEQELVDCDRTYNAGCNGGLMDYAFQFIINNGGLDTEKDYPYVGDD 186
Query: 183 DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHGGVF 240
D + + +I G++ V P E+ LQ V+ QPVSVAI+A+ FY GVF
Sbjct: 187 DKCDKDKMKTKA---VSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQFYQSGVF 243
Query: 241 TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG--SGLC 298
TG CG +HGV +VGY + E YWLV+N WGT W E G +++ R VG +G C
Sbjct: 244 TGECGTALDHGVVVVGYAS----ENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTGRC 299
Query: 299 NIAANAAYPL 308
IA ++YP+
Sbjct: 300 GIAMESSYPV 309
>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase; AltName:
Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
RecName: Full=Vignain-1; Contains: RecName:
Full=Vignain-2; Flags: Precursor
gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
Length = 362
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 122/305 (40%), Positives = 163/305 (53%), Gaps = 40/305 (13%)
Query: 30 AEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDHPHSN 77
EK RF +FK N H L+LNKFAD+T +F ++Y G K N
Sbjct: 54 GEKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSKV--------N 105
Query: 78 RSNWFKNLNSSKMSFY--------DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLN 128
F+ +F S+DW ++GAVT VKDQG CWAF+ + VEG+N
Sbjct: 106 HHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGIN 165
Query: 129 KIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYC 186
+I+T +LV+ S+ +LVDC GC +E+AFE+I+Q + +E YPY Q+ C
Sbjct: 166 QIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYTA-QEGTC 224
Query: 187 DWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPC 244
D S + +I G++ V E L V+ QPVSVAIDA + F FY GVFTG C
Sbjct: 225 D--ESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDC 282
Query: 245 GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAAN 303
NHGV IVGYGTT + YW+V+N WG W E G +R+ R + GLC IA
Sbjct: 283 NTDLNHGVAIVGYGTTVDGTN---YWIVRNSWGPEWGEQGYIRMQRNISKKEGLCGIAMM 339
Query: 304 AAYPL 308
A+YP+
Sbjct: 340 ASYPI 344
>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
Length = 362
Score = 197 bits (500), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 128/319 (40%), Positives = 172/319 (53%), Gaps = 39/319 (12%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLAS 62
+E+W + + +K RF +FK N H L+LNKFAD+T +F ++
Sbjct: 40 YERWR-SYRTVSRSLGDKHKRFNVFKANVMHVHNTNKMDKPYKLKLNKFADMTNHEFRST 98
Query: 63 YTGYKPPPTDH-------PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
Y G K +H P N + ++ + S S DW + GAVT VKDQG
Sbjct: 99 YAGSK---VNHHRMFQGTPRGNGTFMYEKVGSVP----PSADWRKNGAVTGVKDQGQCGS 151
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLAS 172
CWAF+ V VEG+N+I+T +LV+ S+ +LVDC T GC +E+AFE+I+Q + +
Sbjct: 152 CWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIKQKGGITT 211
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY QD CD S A+ +I G++ V E L V+ QPVSVAIDA F
Sbjct: 212 ESNYPYTA-QDGTCD--ASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAGGF 268
Query: 233 N--FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
+ FY GVFTG C NHGV IVGYGTT + YW V+N WG W E G +R+ R
Sbjct: 269 DFQFYFEGVFTGDCSTELNHGVAIVGYGTTVDGTN---YWTVRNSWGPEWGEQGYIRMQR 325
Query: 291 GV-GGSGLCNIAANAAYPL 308
+ GLC IA A+YP+
Sbjct: 326 SIFKKEGLCGIAMMASYPI 344
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 125/327 (38%), Positives = 179/327 (54%), Gaps = 32/327 (9%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKF 51
T + +E+W+V+ + Y EKE RF+IFK N F L LN+F
Sbjct: 36 TPRTNDQVLTMYEEWLVKHGKNYNALGEKEKRFEIFKDNLGFIDEHNSKNLSFRLGLNRF 95
Query: 52 ADLTREKFLASYTGYKPPPT--DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKD 109
ADLT E++ + G + P + ++++N + K+ +S+DW + GAV VKD
Sbjct: 96 ADLTNEEYRTRFLGTRINPNRRNRKVNSQTNRYATRVGDKLP--ESVDWRKEGAVVGVKD 153
Query: 110 QGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQ 166
QGS CWAF+A+A VEG+NK+ TG L++ S+ +LVDC T GC ++ AFE+I
Sbjct: 154 QGSCGSCWAFSAIAAVEGVNKLATGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIN 213
Query: 167 YQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEG-LQDVVSRQPVSV 225
L E YPY+ D CD R +A K +I Y+ V PA +EG L+ V+ Q ++V
Sbjct: 214 MVALTPEEDYPYRA-IDGRCDQNRKNA--KVVSIDQYEDV-PAYDEGALKKAVANQVIAV 269
Query: 226 AID--ATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
A++ F Y GVFTG CG +HGV VGYGT E + YW+V+N WG +W E
Sbjct: 270 AVEGGGREFQLYDSGVFTGRCGTALDHGVAAVGYGT----ENGKDYWIVRNSWGGSWGEA 325
Query: 284 GSMRIFRGVG--GSGLCNIAANAAYPL 308
G +R+ R + SG C IA +YP+
Sbjct: 326 GYIRLERNLATSKSGKCGIAIEPSYPI 352
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 127/313 (40%), Positives = 171/313 (54%), Gaps = 36/313 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E WM + ++ Y+ EK RF+IF N + +L LN+FADL+ E+F + Y
Sbjct: 48 ESWMSKHSKAYRSIEEKLHRFEIFLDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKSKY 107
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G + + P S F + + +S+DW +GAVTPVK+QGS CWAF+ VA
Sbjct: 108 LGLR---VEFPRKRSSRGFSYGDVEDLP--ESVDWRTKGAVTPVKNQGSCGSCWAFSTVA 162
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVYPY-- 178
VEG+N+I TG L + S+ +L+DC S NGC ++ AF+YI L E YPY
Sbjct: 163 AVEGINQIVTGNLTSLSEQELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLM 222
Query: 179 -QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFY 235
+GR C R + I GY+ V E+ L +S QPVSVAI+A+ F FY
Sbjct: 223 EEGR----C--IREKEQFEVVTISGYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFY 276
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG- 294
GG+FTG CG +HGVT VGYG++ EG Y +VKN WG W E G +R+ R G
Sbjct: 277 KGGIFTGRCGTQMDHGVTAVGYGSS---EGTD-YIIVKNSWGPKWGENGYIRMKRNTGKP 332
Query: 295 SGLCNIAANAAYP 307
GLC I A+YP
Sbjct: 333 EGLCGINQMASYP 345
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 197 bits (500), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 120/313 (38%), Positives = 165/313 (52%), Gaps = 26/313 (8%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLAS 62
+EQW+V+ + Y EK+ RF IFK N F L LN+FADLT E++ A
Sbjct: 4 YEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHNADNRTYKLGLNRFADLTNEEYRAR 63
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
Y G + P ++ + + +S+DW AV PVKDQG+ CWAF+ +
Sbjct: 64 YLGTRIDPNRRFVKTKTQSNRYAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWAFSTI 123
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
VEG+NKI TG L++ S+ +LVDC T GC ++ A+E+I + SE YPY+
Sbjct: 124 GAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINNGGIDSEEDYPYR 183
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID--ATWFNFYHG 237
D CD +R +A K I Y+ V E L+ V+ QPVSVAI+ F Y
Sbjct: 184 A-VDGTCDQYRKNA--KVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLYVS 240
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG--S 295
GVFTG CG +HGV VGYG+ YW+V+N WG +W E G +R+ R + S
Sbjct: 241 GVFTGRCGTALDHGVVAVGYGSVK----GHDYWIVRNSWGASWGEEGYVRLERNLAKSRS 296
Query: 296 GLCNIAANAAYPL 308
G C IA +YP+
Sbjct: 297 GKCGIAIEPSYPI 309
>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 464
Score = 196 bits (499), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 121/316 (38%), Positives = 179/316 (56%), Gaps = 33/316 (10%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREK 58
A ++ W+ E R+Y E E RF++F N F L +N+FADLT E+
Sbjct: 51 AAYDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEE 110
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
F A++ G K +R+ + + +S+DW E+GAV PVK+QG CWA
Sbjct: 111 FRATFLGAKVV-----ERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 165
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASEC 174
F+AV+TVE +N++ TG+++T S+ +LV+CST GC +++AF++I + + +E
Sbjct: 166 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNGGCNGGLMDDAFDFIIKNGGIDTED 225
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YPY+ D CD R +A K +I G++ V E+ LQ V+ QPVSVAI+A F
Sbjct: 226 DYPYKA-VDGKCDINRENA--KVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREF 282
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
YH GVF+G CG + +HGV VGYGT + + YW+V+N WG W E G +R+ R +
Sbjct: 283 QLYHSGVFSGRCGTSLDHGVVAVGYGT----DNGKDYWIVRNSWGPKWGESGYVRMERNI 338
Query: 293 G-GSGLCNIAANAAYP 307
+G C IA A+YP
Sbjct: 339 NVTTGKCGIAMMASYP 354
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 196 bits (499), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 123/303 (40%), Positives = 167/303 (55%), Gaps = 38/303 (12%)
Query: 31 EKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDH----- 73
EK RF +FK+N H L+LNKFAD+T +F ++Y G K +H
Sbjct: 55 EKHKRFNVFKENVMHVHNTNKMDKPYKLKLNKFADMTNHEFRSTYAGSK---VNHHKMFR 111
Query: 74 --PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKI 130
H N + ++ + S S +DW ++GAVT VKDQG CWAF+ V VEG+N+I
Sbjct: 112 GTQHGNGTFMYEKVGSVPAS----VDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQI 167
Query: 131 RTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDW 188
+T +LV+ S+ +LVDC GC +E+AFE+I+Q + +E YPY Q+ CD
Sbjct: 168 KTDKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGITTESNYPYTA-QEGTCD- 225
Query: 189 WRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGN 246
S + +I G++ V E L V+ QPVSVAIDA + F FY GV TG C
Sbjct: 226 -ASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVLTGDCNT 284
Query: 247 TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAA 305
NHGV IVGYGTT + YW+V+N WG W E G +R+ R + GLC IA A+
Sbjct: 285 DLNHGVAIVGYGTTVDGTN---YWIVRNSWGPEWGEQGYIRMQRNISKKEGLCGIAMMAS 341
Query: 306 YPL 308
YP+
Sbjct: 342 YPI 344
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 196 bits (499), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 125/316 (39%), Positives = 172/316 (54%), Gaps = 29/316 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQ-AEKEMRFKIFKKNHEF------------LRLNKFADLTRE 57
+ A ++QW + + + + AE E RF IFK N +F L LN FADLT E
Sbjct: 37 VMALYDQWRAKHGKLHNNLGAEPENRFHIFKDNLKFIDEINAQNLPYRLGLNVFADLTNE 96
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ + Y G K + + + L DSIDW +GAV PVKDQGS CW
Sbjct: 97 EYRSRYLGGKFASGSRRNRTSNRYLPRLGDD---LPDSIDWRAKGAVAPVKDQGSCGSCW 153
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASEC 174
AF+ VA+VE +N+I TG L+ S+ +LVDC S GC ++ AFE+I + L +E
Sbjct: 154 AFSTVASVEAINQIVTGDLIALSEQELVDCDRSYNEGCNGGLMDYAFEFIIENGGLDTEE 213
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID--ATWF 232
YPY G D C ++ +A K AI Y+ V E+ LQ VS+Q VSVAI+ F
Sbjct: 214 DYPYYGF-DSSCIQYKKNA--KVVAIDSYEDVPVNNEKALQKAVSKQVVSVAIEGGGRSF 270
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y G+FTG CG +HGV +VGYG+ EG YW+V+N WG +W E G +++ R +
Sbjct: 271 QLYQSGIFTGRCGTDLDHGVNVVGYGS----EGGVDYWIVRNSWGGSWGESGYVKMQRNI 326
Query: 293 GG-SGLCNIAANAAYP 307
+GLC IA +YP
Sbjct: 327 ASPTGLCGIAMEPSYP 342
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 196 bits (499), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 121/310 (39%), Positives = 173/310 (55%), Gaps = 29/310 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN------------HEFLRLNKFADLTREKFLASY 63
E W+ + + Y+ EK +RF+IFK N + +L LN+F+DL+ E+F Y
Sbjct: 34 ESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNKKVVNYWLGLNEFSDLSHEEFKNKY 93
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G K ++ ++ +K++ MS S+DW ++GAVT VK+QGS CWAF+ VA
Sbjct: 94 LGLKVDMSERRECSQEFNYKDV----MSIPKSVDWRKKGAVTDVKNQGSCGSCWAFSTVA 149
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I TG L + S+ +LVDC T N GC ++ AF YI L E YPY
Sbjct: 150 AVEGINQIVTGNLTSLSEQELVDCDTTNNYGCNGGLMDYAFSYIISNGGLHKEVDYPYI- 208
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
++ C+ + + + I GY V +EE L ++ QP+SVAI+A+ F FY GG
Sbjct: 209 MEEGTCEMRKEES--EVVTISGYHDVPQNSEESLLKALANQPLSVAIEASGRDFQFYSGG 266
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF G CG +HGV VGYG+T + Y +VKN WG+ W E G +R+ R G +GL
Sbjct: 267 VFDGHCGTQLDHGVAAVGYGSTNGLD----YIIVKNSWGSKWGEKGYIRMKRNTGKPAGL 322
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 323 CGINKMASYP 332
>gi|18396952|ref|NP_564322.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|332192922|gb|AEE31043.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 334
Score = 196 bits (499), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 131/321 (40%), Positives = 174/321 (54%), Gaps = 42/321 (13%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+I H+QWM +F+R YKD++EKEMR K+FKKN +F+ +N+F D
Sbjct: 33 SIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSYTLGVNEFTDWKT 92
Query: 57 EKFLASYTGYKPPPTDHPHS-NRSNWFKNLNSSKMSFYD-SIDWNERGAVTPVKDQGSYC 114
E+FLA++TG + T N++ +N N S + D S DW + GAVTPVK QG+ C
Sbjct: 93 EEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDIDMEDESKDWRDEGAVTPVKYQGA-C 151
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLAS 172
L KI L+T S+ QL+DC GC E AF+YI + ++
Sbjct: 152 -----------RLTKISGKNLLTLSEQQLIDCDIEKNGGCNGGEFEEAFKYIIKNGGVSL 200
Query: 173 ECVYPYQGRQDYYCDWWRSSA-SGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID--A 229
E YPYQ +++ C R++A + IRG+Q V E L + V RQPVSV ID A
Sbjct: 201 ETEYPYQVKKE-SC---RANARRAPHTQIRGFQMVPSHNERALLEAVRRQPVSVLIDARA 256
Query: 230 TWFNFYHGGVFTG-PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
F Y GGV+ G CG NH VTIVGYGT + YW++KN WG +W E G MRI
Sbjct: 257 DSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTMSGLN----YWVLKNSWGESWGENGYMRI 312
Query: 289 FRGVG-GSGLCNIAANAAYPL 308
R V G+C IA AAYP+
Sbjct: 313 RRDVEWPQGMCGIAQVAAYPV 333
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 196 bits (499), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 124/315 (39%), Positives = 171/315 (54%), Gaps = 28/315 (8%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFL 60
A +E+W A +D +K RF +FK+N H+F LRLN+F D+T ++F
Sbjct: 45 ALYERWRGRHA-VARDLGDKARRFNVFKENVRLIHDFNQRDEPYKLRLNRFGDMTADEFR 103
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKN-LNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
Y G + +R + + + S+DW ++GAVT VKDQG CWAF
Sbjct: 104 RHYAGSRVAHHRMFRGDRQGSASSFMYAGARDLPTSVDWRQKGAVTDVKDQGQCGSCWAF 163
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVY 176
+ +A VEG+N I+T L + S+ QLVDC T GC ++ AF+YI ++ +A+E Y
Sbjct: 164 STIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDYAFQYIAKHGGVAAEDAY 223
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNF 234
PY+ RQ +S A I GY+ V E L+ V+ QPVSVAI+A + F F
Sbjct: 224 PYKARQ---ASCKKSPAPAV--TIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQF 278
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVF G CG +HGVT VGYG A+G + YW+VKN WG W E G +R+ R V
Sbjct: 279 YSEGVFAGRCGTELDHGVTAVGYGVA--ADGTK-YWVVKNSWGPEWGEKGYIRMARDVAA 335
Query: 295 -SGLCNIAANAAYPL 308
G C IA A+YP+
Sbjct: 336 KEGHCGIAMEASYPV 350
>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 123/311 (39%), Positives = 165/311 (53%), Gaps = 33/311 (10%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+ QW R+Y E+ RF++++ N E+ L N+FADLT E+FL
Sbjct: 44 RFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRGGLTYELGENQFADLTGEEFL 103
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAF 118
A Y G H S + + S + S+DW +GAVTPVK+QGS C CWAF
Sbjct: 104 ARYAG------GHTGSAITTAAEADGSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAF 157
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNG-CAKNFLENAFEYIRQYQRLASECVYP 177
+AVAT+E L I+TG+LV S+ QLVDC +G C K + AF++I + + + YP
Sbjct: 158 SAVATMESLYFIKTGKLVALSEQQLVDCDKYDGGCNKGYYHRAFQWIMENGGITTAAQYP 217
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-TWFNFYH 236
Y+ + SA+ I G+ V E LQ V+RQP+ VAI+ FY
Sbjct: 218 YKAVRG------ACSAAKPAVTITGHLAVAK-NELALQSAVARQPIGVAIEVPISMQFYK 270
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSG 296
GVF+ CG +H V VGYG +A G + YWLVKN WG W E G +R+ R VGG G
Sbjct: 271 SGVFSAACGIQMSHAVVTVGYGA--DASGLK-YWLVKNSWGQTWGEAGYIRMRRDVGGGG 327
Query: 297 LCNIAANAAYP 307
LC IA + AYP
Sbjct: 328 LCGIALDTAYP 338
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 196 bits (498), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 112/316 (35%), Positives = 174/316 (55%), Gaps = 27/316 (8%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTRE 57
+I A+ E++ +F +Y + E+ R +F +N + L +N+FADLT E
Sbjct: 14 DIDAQWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGHTYTLGVNQFADLTVE 73
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F +Y G+K P + + + + + S+DW+ +GAVTPVK+QG CW
Sbjct: 74 EFSKTYMGFKKPAQKY---GDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGSCW 130
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASE 173
+F+ ++EG N+I TG+LV+ S+ Q VDC+ GC +++AF+Y + L +E
Sbjct: 131 SFSTTGSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKY-AEANALCTE 189
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY+G D C S G++ GY+ V +E+ + V++QPVS+AI+A +
Sbjct: 190 QSYPYKGT-DGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKSV 248
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y GGV TG CG + +HGV VGYGT + + YW VKN WG+ W G + + RG
Sbjct: 249 FQLYSGGVLTGACGASLDHGVLAVGYGTLSGTD----YWKVKNSWGSTWGMSGYVLLQRG 304
Query: 292 VGGSGLCNIAANAAYP 307
GGSG C + + +YP
Sbjct: 305 KGGSGECGLLSEPSYP 320
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 196 bits (497), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 170/311 (54%), Gaps = 28/311 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKF 59
+H +WM E R Y D EK R+ +FK+N E L +N+FADLT E+F
Sbjct: 31 RHAEWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQFADLTNEEF 90
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWA 117
+ YTG+K + ++ F+ N S + S+DW ++GAVTP+KDQG C CWA
Sbjct: 91 RSMYTGFKGNSVLSSRTKPTS-FRYQNVSSDALPVSVDWRKKGAVTPIKDQG-LCGSCWA 148
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVY 176
F+AVA +EG+ +I+ G+L++ S+ +LVDC T + GC ++ AF Y L SE Y
Sbjct: 149 FSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDGGCMGGLMDTAFNYTITIGGLTSESNY 208
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNF 234
PY+ + C++ ++ +I+G++ V E+ L V+ PVS+ I F F
Sbjct: 209 PYK-STNGTCNFNKTKQIAT--SIKGFEDVPANDEKALMKAVAHHPVSIGIAGGDIGFQF 265
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVF+G C +HGVT VGYG ++ YW++KN WG W E G MRI + +
Sbjct: 266 YSSGVFSGECTTHLDHGVTAVGYG---RSKNGLKYWILKNSWGPKWGERGYMRIKKDIKP 322
Query: 295 S-GLCNIAANA 304
G C +A NA
Sbjct: 323 KHGQCGLAMNA 333
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 116/313 (37%), Positives = 168/313 (53%), Gaps = 29/313 (9%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLA 61
+EQW+VE + Y EKE RFKIFK N +F+ L +FADLT E+F A
Sbjct: 44 YEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRA 103
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
Y K T +K + D +DW GAV VKDQG+ CWAF+A
Sbjct: 104 IYLRKKMERTKDSVKTERYLYKEGDV----LPDEVDWRANGAVVSVKDQGNCGSCWAFSA 159
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQRLASECVYP 177
V VEG+N+I TG+L++ S+ +LVDC GC + AFE+I + + ++ YP
Sbjct: 160 VGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGIETDQDYP 219
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFY 235
Y C+ +++ + + I GY+ V E+ L+ V+ QPVSVAI+A+ F Y
Sbjct: 220 YNANDLGLCNADKNNNT-RVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASSQAFQLY 278
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
GV TG CG + +HGV +VGYG+T+ + YW+++N WG NW + G +++ R +
Sbjct: 279 KSGVMTGTCGISLDHGVVVVGYGSTS----GEDYWIIRNSWGLNWGDSGYVKLQRNIDDP 334
Query: 296 -GLCNIAANAAYP 307
G C IA +YP
Sbjct: 335 FGKCGIAMMPSYP 347
>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
Length = 361
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 122/303 (40%), Positives = 167/303 (55%), Gaps = 39/303 (12%)
Query: 31 EKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPPTDH----- 73
EK RF +FK N L+LN+FAD+T +F + Y G K +H
Sbjct: 55 EKHNRFNVFKGNVMHVHSSNKMDKPYKLKLNRFADMTNHEFRSIYAGSK---VNHHRMFR 111
Query: 74 --PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKI 130
P N + ++N++ S+DW ++GAVT VKDQG CWAF+ + VEG+N+I
Sbjct: 112 GTPRGNGTFMYQNVDRVP----SSVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQI 167
Query: 131 RTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDW 188
+T +LV S+ +LVDC T GC +E+AFE+I+QY + + YPY+ + D CD
Sbjct: 168 KTHKLVPLSEQELVDCDTTQNQGCNGGLMESAFEFIKQYG-ITTASNYPYEAK-DGTCD- 224
Query: 189 WRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGN 246
S + +I G++ V E L V+ QPVSVAI+A F FY GVFTG CG
Sbjct: 225 -ASKVNEPAVSIDGHENVPVNNEAALLKAVAHQPVSVAIEAGGIDFQFYSEGVFTGNCGT 283
Query: 247 TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAA 305
+HGV IVGYGTT + YW VKN WG+ W E G +R+ R + GLC IA A+
Sbjct: 284 ALDHGVAIVGYGTTQDG---TKYWTVKNSWGSEWGEKGYIRMKRSISVKKGLCGIAMEAS 340
Query: 306 YPL 308
YP+
Sbjct: 341 YPI 343
>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 122/300 (40%), Positives = 170/300 (56%), Gaps = 32/300 (10%)
Query: 31 EKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDHPHS-- 76
EK+ RF +FK N H F L+LNKFAD+T +F Y G K H +
Sbjct: 53 EKDKRFNVFKANVHYVHNFNKKDKPYKLKLNKFADMTNHEFRHHYAGSK---IKHHRTFL 109
Query: 77 --NRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
+R+N + + + S ++DW ++GAVTPVKDQG CWAF+ V VEG+N+I+T
Sbjct: 110 GASRANG-TFMYAHEDSVPPTVDWRKKGAVTPVKDQGKCGSCWAFSTVVAVEGINQIKTN 168
Query: 134 QLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRS 191
+LV+ S+ +LVDC T GC ++ AFE+I++ + +E YPY + CD +
Sbjct: 169 ELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINTEENYPYMA-EGGECDIQKR 227
Query: 192 SASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPN 249
++ +I G++ V P E L V+ QPVSVAI A+ F FY GVFTG CG +
Sbjct: 228 NSP--VVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSDFQFYSEGVFTGDCGTELD 285
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
HGV IVGYGTT + + YW+VKN WG W E G +R+ R + GLC IA +YP+
Sbjct: 286 HGVAIVGYGTTLD---RTKYWIVKNSWGPEWGEKGYIRMQREIDAEEGLCGIAMQPSYPI 342
>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
Length = 472
Score = 195 bits (496), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 119/300 (39%), Positives = 172/300 (57%), Gaps = 33/300 (11%)
Query: 31 EKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLASYTGYKPPPTDHP 74
E+E RF+ F N F L +N+FADLT ++F A+Y G K P
Sbjct: 73 ERERRFRAFWDNLNFVDAHNARAAAGEEGYRLGMNRFADLTNDEFRAAYLGVKAQRA-RP 131
Query: 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
+++ + ++ +++DW E+GAV PVK+QG CWAF+AV+TVE +N+I TG
Sbjct: 132 GRMVGERYRHDGAEELP--EAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVTG 189
Query: 134 QLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWR 190
++VT S+ +LV+C T +GC +++AFE+I + + +E YPY+ D CD R
Sbjct: 190 EMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKA-IDGRCDVLR 248
Query: 191 SSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTP 248
+A K +I G++ V E+ LQ V+ QPVSVAI+A F YH GVF+G CG
Sbjct: 249 KNA--KVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTQL 306
Query: 249 NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYP 307
+HGV VGYGT E + YW+V+N WG NW E G +R+ R + SG C IA ++YP
Sbjct: 307 DHGVVAVGYGT----ENGKDYWIVRNSWGPNWGESGYLRMERNINVTSGKCGIAMMSSYP 362
>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
Length = 469
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 118/300 (39%), Positives = 172/300 (57%), Gaps = 33/300 (11%)
Query: 31 EKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLASYTGYKPPPTDHP 74
E+E RF+ F N F L +N+FADLT ++F A+Y G K P
Sbjct: 70 ERERRFRAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGQRA-RP 128
Query: 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
+++ + ++ +++DW E+GAV PVK+QG CWAF+A++TVE +N+I TG
Sbjct: 129 GRVVGERYRHDGAEELP--EAVDWREKGAVAPVKNQGQCGSCWAFSAISTVESINQIVTG 186
Query: 134 QLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWR 190
++VT S+ +LV+C T +GC +++AFE+I + + +E YPY+ D CD R
Sbjct: 187 EMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKA-IDGRCDVLR 245
Query: 191 SSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTP 248
+A K +I G++ V E+ LQ V+ QPVSVAI+A F YH GVF+G CG
Sbjct: 246 KNA--KVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTQL 303
Query: 249 NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYP 307
+HGV VGYGT E + YW+V+N WG NW E G +R+ R + SG C IA ++YP
Sbjct: 304 DHGVVAVGYGT----ENGKDYWIVRNSWGPNWGEAGYLRMERNINVTSGKCGIAMMSSYP 359
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 123/316 (38%), Positives = 165/316 (52%), Gaps = 28/316 (8%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
IA E W + +TY Q EK R K+F+ N++F L LN FADLT
Sbjct: 26 IAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHH 85
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F AS G + + +RSN + + S+DW + GAVT VKDQG+ CW
Sbjct: 86 EFKASRLGLSSAASASLNVDRSN--RQIPDFVADVPASVDWRKNGAVTQVKDQGNCGACW 143
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASEC 174
+F+A +EG+NKI TG LV+ S+ +LVDC S NGC ++ AF+++ + +E
Sbjct: 144 SFSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEE 203
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WF 232
YPYQGR D C+ + I GY V E+ L V+ QPVSV I + F
Sbjct: 204 DYPYQGR-DRSCN--KEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAF 260
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y G+FTGPC + +H V IVGYG+ E YW+VKN WG+ W G M + R
Sbjct: 261 QLYSKGIFTGPCSTSLDHAVLIVGYGS----ENGVDYWIVKNSWGSYWGMDGYMHMQRNS 316
Query: 293 GGS-GLCNIAANAAYP 307
G S GLC I A+YP
Sbjct: 317 GSSRGLCGINMLASYP 332
>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 380
Score = 194 bits (494), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 129/321 (40%), Positives = 177/321 (55%), Gaps = 31/321 (9%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKF 59
A +E+W +D AEK RF +F++N HEF LRLN+FADLT ++F
Sbjct: 47 ALYERWRARHT-VSRDLAEKSRRFNVFRENARLVHEFNLRRDAPYKLRLNRFADLTSDEF 105
Query: 60 LASYTGYKPPPTD--HPHSNRSNWFKNLNSSKMS----FYDSIDWNERGAVTPVKDQGSY 113
SY + P + +N + S + S+DW E+GAVT VKDQG
Sbjct: 106 RRSYASSRVSHHRMFKPRAANNNDDDDDKGSSFTHGGALPTSVDWREKGAVTGVKDQGQC 165
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRL 170
CWAF+ +A VEG+N IRT L + S+ QLVDC T GC +++AF YI ++ +
Sbjct: 166 GSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNAGCDGGLMDDAFSYIAKHGGV 225
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA- 229
A+E YPY+ RQ C+ +++A+ +I GY+ V E L+ V+ QPV+VAI+A
Sbjct: 226 AAEKSYPYRARQSSSCNSKKAAAA--VVSIDGYEDVPRNDETALKKAVAAQPVAVAIEAG 283
Query: 230 -TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
+ F FY GVF G CG +HGV VGYG T + YW+VKN WG W E G +R+
Sbjct: 284 GSHFQFYSEGVFAGKCGTELDHGVAAVGYGVTVDG---TKYWIVKNSWGEEWGEKGYIRM 340
Query: 289 FRGVGG-SGLCNIAANAAYPL 308
R V GLC IA A+YP+
Sbjct: 341 KRDVADKEGLCGIAMEASYPV 361
>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
Length = 340
Score = 194 bits (494), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 119/318 (37%), Positives = 173/318 (54%), Gaps = 39/318 (12%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
++ +++ W +++ YKD AE+E +IFK N + L +N+FADL E
Sbjct: 35 LSERYKHWKIKYRVIYKDDAEEEKHIQIFKHNVAYIDSFNAAGNKSYKLTINRFADLPTE 94
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ---GSYC 114
+ K PT S+ FK N + + ++DW +RGAVTPVK+Q GS
Sbjct: 95 PSDDGFKKRKLEPTT------SSLFKYKNITDIPA--AVDWRKRGAVTPVKNQRECGS-- 144
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVD---CSTLNGCAKNFLENAFEYIRQYQRLA 171
CWAF+AV +EG+ +I +G LV+ S+ +LVD + NGC +L +AFE++ + +A
Sbjct: 145 CWAFSAVGALEGIQQITSGNLVSLSEQELVDRVRSNWTNGCNGGYLIDAFEFVLENGGIA 204
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT- 230
+E YPY+G + S + I+ Y+ V +E+ L VV+ QPVSV ID +
Sbjct: 205 TEASYPYRGVKGN-----NSKKVSRQVQIKSYEQVPRNSEDSLLKVVANQPVSVGIDISG 259
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY G+FTG CG PNH V IVGYGT+ + YWLVKN WG W E +R+ R
Sbjct: 260 MIRFYSSGIFTGECGTKPNHAVIIVGYGTSNDG---TKYWLVKNSWGIRWGEKRYIRMKR 316
Query: 291 GVGG-SGLCNIAANAAYP 307
+ GLC I +A+YP
Sbjct: 317 DIDAKEGLCGIPMDASYP 334
>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 422
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 120/326 (36%), Positives = 176/326 (53%), Gaps = 32/326 (9%)
Query: 8 TGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKF 51
I A+ ++W+ + Y E+ R IF N EF LRLN
Sbjct: 63 VATIEARFDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHL 122
Query: 52 ADLTREKF---LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
ADLTRE+F L K + P + +NW + ++ +++DW RGAVTPVK
Sbjct: 123 ADLTREEFKHMLGYDASKKRVESSSPPVDAANW----EYADVTPPETMDWVSRGAVTPVK 178
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYI 164
+QG CWAF+ V VEG+ ++TG L++ S+ +LV C+ + NGC ++N FE+I
Sbjct: 179 NQGQCGSCWAFSTVGAVEGVVAVKTGDLISLSEQELVSCAKIGGNNGCKGGLMDNGFEWI 238
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ + + E + Y + D C+W++ + K +I G++ V E+ L+ VS+QPV+
Sbjct: 239 VENRGVDDEEDWGYLAK-DRRCNWFKKRRA-KAASIDGFKDVPRNDEDALKKAVSQQPVA 296
Query: 225 VAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
VAI+A F Y GGVF G CG +HGV +VGYG E+ G + YW VKN WG W E
Sbjct: 297 VAIEADHREFQLYSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGE 356
Query: 283 GGSMRIFR-GVGGSGLCNIAANAAYP 307
G +RI R G+G +G C +A A+YP
Sbjct: 357 EGYIRIARGGMGPAGQCGVAMQASYP 382
>gi|324983200|gb|ADY68475.1| stem bromelain [Ananas comosus]
Length = 291
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 111/262 (42%), Positives = 154/262 (58%), Gaps = 25/262 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFL 60
+ E+WM E+ R YKD EK RF+IFK N + +NKF D+T +F+
Sbjct: 36 RFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFV 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A YTG P + + F ++N S + SIDW + GAVT VKDQ CWAF+
Sbjct: 96 AQYTGGISRPLNIEKEPVVS-FDDVNISAVG--QSIDWRDYGAVTEVKDQNPCGSCWAFS 152
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A+ATVEG+ KI TG LV+ S+ +++DC+ NGC F++NA+++I +ASE YPYQ
Sbjct: 153 AIATVEGIYKIVTGYLVSLSEQEVLDCAVSNGCDGGFVDNAYDFIISNNGVASEADYPYQ 212
Query: 180 GRQ-DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--YH 236
Q D + W +SA I GY YV+ E ++ V QP++ AIDA+ NF Y+
Sbjct: 213 AYQGDCAANSWPNSA-----YITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYN 267
Query: 237 GGVFTGPCGNTPNHGVTIVGYG 258
GGVF+GPCG + NH +TI+GYG
Sbjct: 268 GGVFSGPCGTSLNHAITIIGYG 289
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 194 bits (493), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 118/322 (36%), Positives = 170/322 (52%), Gaps = 47/322 (14%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLA 61
+EQW+VE + Y EKE RFKIFK N +F+ L +FADLT E+F A
Sbjct: 44 YEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFADLTNEEFRA 103
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFY---------DSIDWNERGAVTPVKDQGS 112
Y R +N +S K Y D +DW GAV VKDQG+
Sbjct: 104 IYL-------------RKKMERNKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGN 150
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQ 168
CWAF+AV VEG+N+I TG+L++ S+ +LVDC GC + AFE+I +
Sbjct: 151 CGSCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNG 210
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID 228
+ ++ YPY C+ +++ + + I GY+ V E+ L+ V+ QPVSVAI+
Sbjct: 211 GIETDQDYPYNANDLGLCNADKNNNT-RVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIE 269
Query: 229 AT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
A+ F Y GV TG CG + +HGV +VGYG+T+ + YW+++N WG NW + G +
Sbjct: 270 ASSQAFQLYKSGVMTGTCGISLDHGVVVVGYGSTS----GEDYWIIRNSWGLNWGDSGYV 325
Query: 287 RIFRGVGGS-GLCNIAANAAYP 307
++ R + G C IA +YP
Sbjct: 326 KLQRNIDDPFGKCGIAMMPSYP 347
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 194 bits (493), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 120/311 (38%), Positives = 168/311 (54%), Gaps = 30/311 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFK--------KNHEF----LRLNKFADLTREKFLASY 63
E WM E ++ YK EK RF++F+ +N+E L LN+FADLT E+F Y
Sbjct: 52 ESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLTHEEFKGRY 111
Query: 64 TGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
G P +N+ ++++ S+DW ++GAV PVKDQG CWAF+ V
Sbjct: 112 LGLAKPQFSRKRQPSANFRYRDITD----LPKSVDWRKKGAVAPVKDQGQCGSCWAFSTV 167
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A VEG+N+I TG L + S+ +L+DC T +GC ++ AF+YI L E YPY
Sbjct: 168 AAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDDYPYL 227
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
++ C + + I GY+ V +E L ++ QPVSVAI+A+ F FY G
Sbjct: 228 -MEEGICQEQKEDV--ERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYKG 284
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
GVF G CG +HGV VGYG++ ++ Y +VKN WG W E G +R+ R G G
Sbjct: 285 GVFNGQCGTDLDHGVAAVGYGSSKGSD----YVIVKNSWGPRWGEKGFIRMKRNTGKPEG 340
Query: 297 LCNIAANAAYP 307
LC I A+YP
Sbjct: 341 LCGINKMASYP 351
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 194 bits (492), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 121/319 (37%), Positives = 170/319 (53%), Gaps = 30/319 (9%)
Query: 8 TGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFK--------KNHEF----LRLNKFADLT 55
T + E WM E ++ YK EK RF++F+ +N+E L LN+FADLT
Sbjct: 44 TDKLLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEINSYWLGLNEFADLT 103
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
E+F Y G P +N+ ++++ S+DW ++GAV PVKDQG
Sbjct: 104 HEEFKGRYLGLAKPQFSRKRQPSANFRYRDITD----LPKSVDWRKKGAVAPVKDQGQCG 159
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLA 171
CWAF+ VA VEG+N+I TG L + S+ +L+DC T +GC ++ AF+YI L
Sbjct: 160 SCWAFSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLH 219
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E YPY ++ C + + I GY+ V +E L ++ QPVSVAI+A+
Sbjct: 220 KEDDYPYL-MEEGICQEQKEDV--ERVTISGYEDVPENDDESLVKALAHQPVSVAIEASG 276
Query: 232 --FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GGVF G CG +HGV VGYG++ ++ Y +VKN WG W E G +R+
Sbjct: 277 RDFQFYKGGVFNGKCGTDLDHGVAAVGYGSSKGSD----YVIVKNSWGPRWGEKGFIRMK 332
Query: 290 RGVGG-SGLCNIAANAAYP 307
R G GLC I A+YP
Sbjct: 333 RNTGKPEGLCGINKMASYP 351
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 118/318 (37%), Positives = 170/318 (53%), Gaps = 31/318 (9%)
Query: 11 IAAKHEQWMVEF--ARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTR 56
+ + +E W+V+ A++ EK+ RF+IFK N F L L +FADLT
Sbjct: 46 VMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTN 105
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+++ + Y G K R + +SIDW ++GAV VKDQG C
Sbjct: 106 DEYRSKYLGAKM----EKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSC 161
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASE 173
WAF+ + VEG+N+I TG L+T S+ +LVDC T GC ++ AFE+I + + ++
Sbjct: 162 WAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTD 221
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY+G D CD R +A K I Y+ V +EE L+ V+ QP+S+AI+A
Sbjct: 222 KDYPYKG-VDGTCDQIRKNA--KVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRA 278
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+F G CG +HGV VGYGT E + YW+V+N WG +W E G +R+ R
Sbjct: 279 FQLYDSGIFDGSCGTQLDHGVVAVGYGT----ENGKDYWIVRNSWGKSWGESGYLRMARN 334
Query: 292 VG-GSGLCNIAANAAYPL 308
+ SG C IA +YP+
Sbjct: 335 IASSSGKCGIAIEPSYPI 352
>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 194 bits (492), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 122/314 (38%), Positives = 164/314 (52%), Gaps = 30/314 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+ QW R+Y E+ RF++++ N E+ L N+FADLT E+FL
Sbjct: 44 RFRQWQATHNRSYLSAEERLRRFEVYRTNVEYIDATNRRGGLTYELGENQFADLTGEEFL 103
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD---SIDWNERGAVTPVKDQGSYC--C 115
A Y G + + + S D S+DW +GAVTPVK+QGS C C
Sbjct: 104 ARYAGGHTGSAITTAAEADGLWSSGGSDGSLEADPPASVDWRAKGAVTPVKNQGSQCYSC 163
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNG-CAKNFLENAFEYIRQYQRLASEC 174
WAF+AVAT+E L I+TG+LV S+ QLVDC +G C K + AF++I + + +
Sbjct: 164 WAFSAVATMESLYFIKTGKLVALSEQQLVDCDKYDGGCNKGYYHRAFQWIMENGGITTAA 223
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-TWFN 233
YPY+ + SA+ I G+ V E LQ V+RQP+ VAI+
Sbjct: 224 QYPYKAVRG------ACSAAKPAVTITGHLAVAK-NELALQSAVARQPIGVAIEVPISMQ 276
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GVF+ CG +H V VGYG +A G + YWLVKN WG W E G +R+ R VG
Sbjct: 277 FYKSGVFSAACGIQMSHAVVTVGYGA--DASGLK-YWLVKNSWGQTWGEAGYIRMRRDVG 333
Query: 294 GSGLCNIAANAAYP 307
G GLC IA + AYP
Sbjct: 334 GGGLCGIALDTAYP 347
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 125/311 (40%), Positives = 171/311 (54%), Gaps = 32/311 (10%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASYTG 65
W V+ ++ Y EK R++IFK+N +L LN FAD+ E+F ASY G
Sbjct: 58 WSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRRNGSYWLGLNHFADIAHEEFKASYLG 117
Query: 66 YKPPPT---DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
KP PH S F+ N+ + + ++DW ++GAVTPVK+QG CWAF+ V
Sbjct: 118 LKPGLARRDAQPHG--STTFRYANAVNLPW--AVDWRKKGAVTPVKNQGECGSCWAFSTV 173
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDC-STLN-GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A VEG+N+I TG+LV+ S+ +L+DC +T N GC ++ AF YI Q + +E YPY
Sbjct: 174 AAVEGINQIVTGKLVSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYL 233
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
++ YC K I GY+ V +E L ++ QPVSV I A F FY G
Sbjct: 234 -MEEGYCR--EKQPHSKVITITGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQFYKG 290
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
G+F G CG P+H +T VGYG+ Q Y ++KN WG NW E G RI RG G G
Sbjct: 291 GIFDGECGIQPDHALTAVGYGSYY----GQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEG 346
Query: 297 LCNIAANAAYP 307
+C+I A+YP
Sbjct: 347 VCDIYKIASYP 357
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 120/322 (37%), Positives = 178/322 (55%), Gaps = 34/322 (10%)
Query: 11 IAAKHEQWMVEFARTY----KDQAEKEMRFKIFKKNHEF--------------LRLNKFA 52
+ A ++ W+ E R Y + + E++ RF +F N F L +N+FA
Sbjct: 53 VRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERAGARGFRLGMNQFA 112
Query: 53 DLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
DLT ++F A+Y G P + + + ++ +S+DW E+GAV PVK+QG
Sbjct: 113 DLTNDEFRAAYLGAMVPAARRGAVVGERYRHDGAAEELP--ESVDWREKGAVAPVKNQGQ 170
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQ 168
CWAF+AV++VE +N+I TG++VT S+ +LV+CST +GC ++ AF++I +
Sbjct: 171 CGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKNG 230
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID 228
+ +E YPY+ D CD R +A + +I G++ V E+ LQ V+ QPVSVAI+
Sbjct: 231 GIDTEDDYPYRA-VDGKCDMNRKNA--RVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIE 287
Query: 229 ATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
A F Y GVF+G C +HGV VGYG AE + YW+V+N WG W E G +
Sbjct: 288 AGGREFQLYKSGVFSGSCTTNLDHGVVAVGYG----AENGKDYWIVRNSWGPKWGEAGYI 343
Query: 287 RIFRGVGGS-GLCNIAANAAYP 307
R+ R V S G C IA A+YP
Sbjct: 344 RMERNVNASTGKCGIAMMASYP 365
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 118/318 (37%), Positives = 170/318 (53%), Gaps = 31/318 (9%)
Query: 11 IAAKHEQWMVEF--ARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTR 56
+ + +E W+V+ A++ EK+ RF+IFK N F L L +FADLT
Sbjct: 46 VMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTN 105
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+++ + Y G K R + +SIDW ++GAV VKDQG C
Sbjct: 106 DEYRSKYLGAKM----EKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSC 161
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASE 173
WAF+ + VEG+N+I TG L+T S+ +LVDC T GC ++ AFE+I + + ++
Sbjct: 162 WAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTD 221
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY+G D CD R +A K I Y+ V +EE L+ V+ QP+S+AI+A
Sbjct: 222 KDYPYKG-VDGTCDQIRKNA--KVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRA 278
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+F G CG +HGV VGYGT E + YW+V+N WG +W E G +R+ R
Sbjct: 279 FQLYDSGIFDGSCGTQLDHGVVAVGYGT----ENGKDYWIVRNSWGKSWGESGYLRMARN 334
Query: 292 VG-GSGLCNIAANAAYPL 308
+ SG C IA +YP+
Sbjct: 335 IASSSGKCGIAIEPSYPI 352
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 193 bits (491), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 118/318 (37%), Positives = 170/318 (53%), Gaps = 31/318 (9%)
Query: 11 IAAKHEQWMVEF--ARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTR 56
+ + +E W+V+ A++ EK+ RF+IFK N F L L +FADLT
Sbjct: 46 VMSIYEAWLVKHGKAQSQNSLVEKDRRFEIFKDNLRFVDEHNEKNLSYRLGLTRFADLTN 105
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+++ + Y G K R + +SIDW ++GAV VKDQG C
Sbjct: 106 DEYRSKYLGAKM----EKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSC 161
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASE 173
WAF+ + VEG+N+I TG L+T S+ +LVDC T GC ++ AFE+I + + ++
Sbjct: 162 WAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTD 221
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY+G D CD R +A K I Y+ V +EE L+ V+ QP+S+AI+A
Sbjct: 222 KDYPYKG-VDGTCDQIRKNA--KVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRA 278
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+F G CG +HGV VGYGT E + YW+V+N WG +W E G +R+ R
Sbjct: 279 FQLYDSGIFDGSCGTQLDHGVVAVGYGT----ENGKDYWIVRNSWGKSWGESGYLRMARN 334
Query: 292 VG-GSGLCNIAANAAYPL 308
+ SG C IA +YP+
Sbjct: 335 IASSSGKCGIAIEPSYPI 352
>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 117/301 (38%), Positives = 168/301 (55%), Gaps = 31/301 (10%)
Query: 30 AEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLASYTGYKPPPTDH 73
A++E RF F N F L +N+FADLT ++F A+Y G K +
Sbjct: 71 ADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGA-AER 129
Query: 74 PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRT 132
+ R + + +++DW E+GAV PVK+QG CWAF+AV+TVE +N+I T
Sbjct: 130 NRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVT 189
Query: 133 GQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWW 189
G++VT S+ +LV+C +GC +++AFE+I + + +E YPY+ D CD
Sbjct: 190 GEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKA-VDGRCDVL 248
Query: 190 RSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNT 247
R +A K +I G++ V E+ LQ V+ PVSVAI+A F YH GVF+G CG
Sbjct: 249 RKNA--KVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQ 306
Query: 248 PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAY 306
+HGV VGYGT E + YW+V+N WG NW E G +R+ R + SG C IA ++Y
Sbjct: 307 LDHGVVAVGYGT----ENGKDYWIVRNSWGPNWGEAGYLRMERNINVTSGKCGIAMMSSY 362
Query: 307 P 307
P
Sbjct: 363 P 363
>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
Length = 473
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 117/301 (38%), Positives = 168/301 (55%), Gaps = 31/301 (10%)
Query: 30 AEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLASYTGYKPPPTDH 73
A++E RF F N F L +N+FADLT ++F A+Y G K +
Sbjct: 71 ADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGA-AER 129
Query: 74 PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRT 132
+ R + + +++DW E+GAV PVK+QG CWAF+AV+TVE +N+I T
Sbjct: 130 NRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVT 189
Query: 133 GQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWW 189
G++VT S+ +LV+C +GC +++AFE+I + + +E YPY+ D CD
Sbjct: 190 GEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKA-VDGRCDVL 248
Query: 190 RSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNT 247
R +A K +I G++ V E+ LQ V+ PVSVAI+A F YH GVF+G CG
Sbjct: 249 RKNA--KVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQ 306
Query: 248 PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAY 306
+HGV VGYGT E + YW+V+N WG NW E G +R+ R + SG C IA ++Y
Sbjct: 307 LDHGVVAVGYGT----ENGKDYWIVRNSWGPNWGEAGYLRMERNINVTSGKCGIAMMSSY 362
Query: 307 P 307
P
Sbjct: 363 P 363
>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 120/305 (39%), Positives = 165/305 (54%), Gaps = 40/305 (13%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRLNKFADLTREKFLASYTGYKPPPTD 72
A +E W+ + ++Y EKE RF+IFK N F+ +
Sbjct: 2 AVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFI------------------------DE 37
Query: 73 HPHSNRSNWFKNLNSSKM--SFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNK 129
H NR+ + + ++ S +S+DW ++GAV VKDQGS CWAF+ +A VEG+NK
Sbjct: 38 HNAENRTYKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINK 97
Query: 130 IRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCD 187
I TG L++ S+ +LVDC T GC ++ AFE+I + SE YPY+ D CD
Sbjct: 98 IVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKA-SDGRCD 156
Query: 188 WWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCG 245
+R +A K I GY+ V E+ L+ V+ QPVSVAI+A F Y G+FTG CG
Sbjct: 157 QYRKNA--KVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCG 214
Query: 246 NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS--GLCNIAAN 303
+HGVT VGYGT E YW+VKN WG +W E G +R+ R + S G C IA
Sbjct: 215 TALDHGVTAVGYGT----ENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAME 270
Query: 304 AAYPL 308
A+YP+
Sbjct: 271 ASYPI 275
>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 356
Score = 193 bits (490), Expect = 9e-47, Method: Compositional matrix adjust.
Identities = 119/312 (38%), Positives = 172/312 (55%), Gaps = 29/312 (9%)
Query: 16 EQWMVEFARTYKDQ-AEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLAS 62
+ WM + +TY + EKE RF+ FK N F L L +FADLT +++
Sbjct: 48 QMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRDL 107
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS-YCCWAFTAV 121
+ G P + ++R + L ++ +S+DW + GAV+ +KDQG+ CWAF+ V
Sbjct: 108 FPGSPKPKQRNLKTSRR--YVPLAGDQLP--ESVDWRQEGAVSEIKDQGTCNSCWAFSTV 163
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGC-AKNFLENAFEYIRQYQRLASECVYPYQ 179
A VEGLNKI TG+L++ S+ +LVDC+ + NGC ++ AF+++ L SE YPYQ
Sbjct: 164 AAVEGLNKIVTGELISLSEQELVDCNLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQ 223
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID--ATWFNFYHG 237
G Q C+ + S S K I Y+ V E LQ V+ QPVSV +D + F Y
Sbjct: 224 GTQG-SCN-RKQSTSNKVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRS 281
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSG 296
++ GPCG +H + IVGYG+ E Q YW+V+N WGT W + G ++I R G
Sbjct: 282 CIYNGPCGTNLDHALVIVGYGS----ENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKG 337
Query: 297 LCNIAANAAYPL 308
LC IA A+YP+
Sbjct: 338 LCGIAMLASYPI 349
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 193 bits (490), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 125/311 (40%), Positives = 171/311 (54%), Gaps = 32/311 (10%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASYTG 65
W V+ ++ Y EK R++IFK+N +L LN FAD+ E+F ASY G
Sbjct: 49 WSVKHSKIYASPKEKVKRYEIFKRNLRHIVETNRRNGSYWLGLNHFADIAHEEFKASYLG 108
Query: 66 YKPPPT---DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
KP PH S F+ N+ + + ++DW ++GAVTPVK+QG CWAF+ V
Sbjct: 109 LKPGLARRDAQPHG--STTFRYANAVNLPW--AVDWRKKGAVTPVKNQGECGSCWAFSTV 164
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDC-STLN-GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A VEG+N+I TG+LV+ S+ +L+DC +T N GC ++ AF YI Q + +E YPY
Sbjct: 165 AAVEGINQIVTGKLVSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYL 224
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
++ YC K I GY+ V +E L ++ QPVSV I A F FY G
Sbjct: 225 -MEEGYCR--EKQPHSKVITITGYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQFYKG 281
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
G+F G CG P+H +T VGYG+ Q Y ++KN WG NW E G RI RG G G
Sbjct: 282 GIFDGECGIQPDHALTAVGYGSYY----GQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEG 337
Query: 297 LCNIAANAAYP 307
+C+I A+YP
Sbjct: 338 VCDIYKIASYP 348
>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 122/326 (37%), Positives = 180/326 (55%), Gaps = 32/326 (9%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQ-AEKEMRFKIFKKNHEF------------LRL 48
+R++ + G I + WM + +TY + EKE RF+ FK N F L L
Sbjct: 38 NRSNEEVGFI---FQMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAKNLSYQLGL 94
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
+FADLT +++ + G P + +R + L+ ++ +S+DW GAV+ +K
Sbjct: 95 TRFADLTVQEYRDLFPGSPKPKQRNLRISRR--YVPLDGDQLP--ESVDWRNEGAVSAIK 150
Query: 109 DQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGC-AKNFLENAFEYIR 165
DQG+ CWAF+ VA VEG+NKI TG+LV+ S+ +LVDC+ + NGC ++ AF+++
Sbjct: 151 DQGTCNSCWAFSTVAAVEGINKIVTGELVSLSEQELVDCNLVNNGCYGSGTMDAAFQFLI 210
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
L S+ YPYQG Q YC+ + S S K I Y+ V E LQ V+ QPVSV
Sbjct: 211 NNGGLDSDTDYPYQGSQG-YCN-RKESTSNKIITIDSYEDVPANDEISLQKAVAHQPVSV 268
Query: 226 AID--ATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
+D + F Y G++ GPCG +H + IVGYG+ E Q YW+V+N WGT W +
Sbjct: 269 GVDKKSQEFMLYRSGIYNGPCGTDLDHALVIVGYGS----ENGQDYWIVRNSWGTTWGDA 324
Query: 284 GSMRIFRGVG-GSGLCNIAANAAYPL 308
G ++ R SG+C IA A+YP+
Sbjct: 325 GYAKMARNFEYPSGVCGIAMLASYPV 350
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 120/318 (37%), Positives = 174/318 (54%), Gaps = 31/318 (9%)
Query: 11 IAAKHEQWMVEF--ARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTR 56
+ + +E W+V+ A+ EK+ RF+IFK N F L L +FADLT
Sbjct: 39 VMSIYEAWLVKHGKAQNQNSLVEKDRRFEIFKDNLRFIDDHNKKNLSYRLGLTRFADLTN 98
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+++ + Y G K + S ++ ++ +SIDW ++GAV VKDQGS C
Sbjct: 99 DEYRSKYLGAKMEKKGERRT--SQRYEARVGDELP--ESIDWRKKGAVAEVKDQGSCGSC 154
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASE 173
WAF+ + VEG+N+I TG L+T S+ +LVDC T GC ++ AFE+I + + ++
Sbjct: 155 WAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTD 214
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TW 231
YPY+G D CD R +A K I Y+ V +EE L+ V+ QPVSVAI+A
Sbjct: 215 KDYPYKG-VDGTCDQIRKNA--KVVTIDSYEDVPTYSEESLKKAVAHQPVSVAIEAGGRA 271
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+F G CG +HGV VGYGT E + YW+V+N WG +W E G +++ R
Sbjct: 272 FQLYDSGIFDGTCGTQLDHGVVAVGYGT----ENGKDYWIVRNSWGKSWGESGYLKMARN 327
Query: 292 VG-GSGLCNIAANAAYPL 308
+ SG C IA +YP+
Sbjct: 328 IASSSGKCGIAIEPSYPI 345
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 123/309 (39%), Positives = 168/309 (54%), Gaps = 31/309 (10%)
Query: 19 MVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLASYTG 65
+V+ + Y KE RF+IFK N F L LNKFADL+ E++ + + G
Sbjct: 11 LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
+ S+ FK ++ S+DW E+GAV PVKDQG CWAF+ VA V
Sbjct: 71 GRM--VRDRKGFESDRFKYGVGDELP--QSVDWREKGAVAPVKDQGQCGSCWAFSTVAAV 126
Query: 125 EGLNKIRTGQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQ 182
EG+N+I TG L++ S+ +LVDC GC F++ AFE+I + + +E YPY+G
Sbjct: 127 EGINQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKG-V 185
Query: 183 DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVF 240
D CD R +A K I G++ V E+ L+ V+ QPVSVAI+A F Y G+F
Sbjct: 186 DGQCDQNRKNA--KVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIF 243
Query: 241 TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG--SGLC 298
G CG +HGV VGYGT E + YW+V+N WG NW E G +R+ R V +G C
Sbjct: 244 NGLCGTDLDHGVVAVGYGT----EDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKC 299
Query: 299 NIAANAAYP 307
IA +YP
Sbjct: 300 GIAMQPSYP 308
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 116/311 (37%), Positives = 174/311 (55%), Gaps = 31/311 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E WM + Y+ EK +RF++FK N + +L LN+FADL+ ++F Y
Sbjct: 48 ESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVVSNYWLGLNEFADLSHQEFKNKY 107
Query: 64 TGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
G K + S+ + +++++ K S+DW ++GAVTPVK+QG CWAF+ V
Sbjct: 108 LGLKVDLSQRRESSEEEFTYRDVDLPK-----SVDWRKKGAVTPVKNQGQCGSCWAFSTV 162
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A VEG+N+I TG L + S+ +L+DC T NGC ++ AF +I + L E YPY
Sbjct: 163 AAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVKNGGLHKEEDYPYI 222
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
++ C+ + + + I GY V E+ L ++ QP+SVAI+A+ F FY G
Sbjct: 223 -MEESTCEMKKEVS--EVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSG 279
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-G 296
GVF G CG+ +HGV+ VGYGT+ + Y +VKN WG W E G +R+ R +G S G
Sbjct: 280 GVFDGHCGSELDHGVSAVGYGTSKGLD----YIIVKNSWGAKWGEKGFIRMKRNIGKSEG 335
Query: 297 LCNIAANAAYP 307
+C + A+YP
Sbjct: 336 ICGLYKMASYP 346
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 116/310 (37%), Positives = 167/310 (53%), Gaps = 28/310 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E WM + Y+ EK +RF++FK N + +L LN+FADL+ ++F Y
Sbjct: 48 ESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIVSNYWLGLNEFADLSHQEFKNKY 107
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G K + S+ F + S+DW ++GAVTPVK+QG CWAF+ VA
Sbjct: 108 LGLKVNLSQRRESSNEEEF---TYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVA 164
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I TG L + S+ +L+DC T NGC ++ AF +I Q L E YPY
Sbjct: 165 AVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQNGGLHKEDDYPYI- 223
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
++ C+ + + I GY V E+ L ++ QP+SVAI+A+ F FY GG
Sbjct: 224 MEESTCEMKKEET--QVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSGG 281
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF G CG+ +HGV+ VGYGT+ + Y +VKN WG W E G +R+ R +G G+
Sbjct: 282 VFDGHCGSDLDHGVSAVGYGTSKNLD----YIIVKNSWGAKWGEKGFIRMKRNIGKPEGI 337
Query: 298 CNIAANAAYP 307
C + A+YP
Sbjct: 338 CGLYKMASYP 347
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 116/310 (37%), Positives = 167/310 (53%), Gaps = 28/310 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E WM + Y+ EK +RF++FK N + +L LN+FADL+ ++F Y
Sbjct: 48 ESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKIVSNYWLGLNEFADLSHQEFKNKY 107
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G K + S+ F + S+DW ++GAVTPVK+QG CWAF+ VA
Sbjct: 108 LGLKVDLSQRRESSNEEEF---TYRDVDLPKSVDWRKKGAVTPVKNQGQCGSCWAFSTVA 164
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I TG L + S+ +L+DC T NGC ++ AF +I Q L E YPY
Sbjct: 165 AVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQNGGLHKEEDYPYI- 223
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
++ C+ + + I GY V E+ L ++ QP+SVAI+A+ F FY GG
Sbjct: 224 MEESTCEMKKEET--QVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYSGG 281
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF G CG+ +HGV+ VGYGT+ + Y +VKN WG W E G +R+ R +G G+
Sbjct: 282 VFDGHCGSDLDHGVSAVGYGTSKNLD----YIIVKNSWGAKWGEKGFIRMKRDIGKPEGI 337
Query: 298 CNIAANAAYP 307
C + A+YP
Sbjct: 338 CGLYKMASYP 347
>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
AltName: Allergen=Car p 1; Flags: Precursor
gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
gi|387885|gb|AAA72774.1| papain [synthetic construct]
gi|225437|prf||1303270A papain
Length = 345
Score = 192 bits (489), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 122/312 (39%), Positives = 172/312 (55%), Gaps = 36/312 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E WM++ + YK+ EK RF+IFK N ++ L LN FAD++ ++F Y
Sbjct: 49 ESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKY 108
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAV 121
TG + + ++ + LN ++ + +DW ++GAVTPVK+QGS C CWAF+AV
Sbjct: 109 TG---SIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGS-CGSCWAFSAV 164
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQG 180
T+EG+ KIRTG L S+ +L+DC + GC + +A + + QY + YPY+G
Sbjct: 165 VTIEGIIKIRTGNLNEYSEQELLDCDRRSYGCNGGYPWSALQLVAQYG-IHYRNTYPYEG 223
Query: 181 RQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
Q YC RS G Y A G + VQP E L ++ QPVSV ++A F Y G
Sbjct: 224 VQR-YC---RSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRG 279
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-G 296
G+F GPCGN +H V VGYG Y L+KN WGT W E G +RI RG G S G
Sbjct: 280 GIFVGPCGNKVDHAVAAVGYGPN--------YILIKNSWGTGWGENGYIRIKRGTGNSYG 331
Query: 297 LCNIAANAAYPL 308
+C + ++ YP+
Sbjct: 332 VCGLYTSSFYPV 343
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 192 bits (488), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 119/331 (35%), Positives = 171/331 (51%), Gaps = 42/331 (12%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------- 47
M+ T N + + +F + Y+ E+ RF +F +N +F+
Sbjct: 16 MAEPLSLTVNKGRLFDAFKTKFNKVYESAEEEARRFSVFSQNIDFINRHNAEAARGVHTH 75
Query: 48 ---LNKFADLTREKFLASYTGYKPPPTDHPHSNRSN-WFKNLNSSKMSFYDSIDWNERGA 103
+N+FADLT E++ Y +P PT+ R W N+ S+DW ++GA
Sbjct: 76 TVDVNQFADLTNEEYRQLY--LRPYPTELLGRERQEVWLDGPNAG------SVDWRQKGA 127
Query: 104 VTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLEN 159
VTP+K+QG CW+F+ +VEG + I TG LV+ S+ QLVDCS GC ++N
Sbjct: 128 VTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDN 187
Query: 160 AFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS 219
AF+YI L +E YPY R D CD +S S +I GY+ V E+ L V
Sbjct: 188 AFKYIISNGGLDTEQDYPYTAR-DGVCD--KSKESKHAVSISGYKDVPQNNEDQLAAAVE 244
Query: 220 RQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
+ PVSVAI+A F Y GVF+GPCG +HGV +VGY + YW+VKN WG
Sbjct: 245 KGPVSVAIEADQQSFQMYSSGVFSGPCGTNLDHGVLVVGYTSD--------YWIVKNSWG 296
Query: 278 TNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
+W + G + + RGV +G+C IA +YP+
Sbjct: 297 ASWGDQGYIMMKRGVSSAGICGIAMQPSYPI 327
>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 123/321 (38%), Positives = 167/321 (52%), Gaps = 44/321 (13%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLAS 62
E+WM +F + Y EKE RF +F+ N F LR+N+FADLT ++F+++
Sbjct: 42 EEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEFVST 101
Query: 63 YTGYKPP-PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
+TG KPP P D P W IDW +GAVT VKDQG+ CWAF A
Sbjct: 102 HTGAKPPCPKDAPRGVDPIWLPCC----------IDWRYKGAVTDVKDQGACGSCWAFAA 151
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
VA +EGL +IRTG+L S+ +LVDC T +GCA + AFE + + +E Y Y+
Sbjct: 152 VAAIEGLTQIRTGKLTPLSEQELVDCDTGSSGCAGGHTDRAFELVAAKGGITAESGYRYE 211
Query: 180 G-RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYH 236
G R D + + + G G++ V P E L V+RQPV+ IDA+ F FY
Sbjct: 212 GYRGKCRADDALFNHAARIG---GHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYG 268
Query: 237 GGVFTGPCGN---------TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
GVF GPCG+ T NH VT+VGY + + YW+ KN WG W E G +
Sbjct: 269 SGVFPGPCGSGSGAAAAAPTTNHAVTLVGY--CQDGASGKKYWVAKNSWGKTWGEKGYIL 326
Query: 288 IFRGVGGS-GLCNIAANAAYP 307
+ + V G C +A + YP
Sbjct: 327 LEKDVASPHGTCGVAVSPFYP 347
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 119/319 (37%), Positives = 179/319 (56%), Gaps = 32/319 (10%)
Query: 11 IAAKHEQWMVEFARTYKD-QAEKEMRFKIFKKNHEF--------------LRLNKFADLT 55
+ A +E W+VE R + E + RF++F N F L +N+FADLT
Sbjct: 52 VRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDAHNERAGEHGFRLGMNQFADLT 111
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++F A+Y G + P ++ +++ + ++ +S+DW E+GAV PVK+QG
Sbjct: 112 NDEFRAAYLGARIPAARSGNA-VGEMYRHDGAEELP--ESVDWREKGAVAPVKNQGQCGS 168
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+AV++VE +N+I TG++VT S+ +LV+CST +GC ++ AF +I + +
Sbjct: 169 CWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGGID 228
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-- 229
+E YPY+ D CD R +A K +I ++ V E+ LQ V+ QPVSVAI+A
Sbjct: 229 TEDDYPYKA-VDGKCDINRRNA--KVVSIDAFEDVPENDEKSLQKAVAHQPVSVAIEAGG 285
Query: 230 TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F Y GVF+G C +HGV VGYGT E + YW+V+N WG W E G +R+
Sbjct: 286 RQFQLYKSGVFSGSCTTNLDHGVVAVGYGT----ENGKDYWIVRNSWGPKWGEAGYIRME 341
Query: 290 RGVGG-SGLCNIAANAAYP 307
R + +G C IA A+YP
Sbjct: 342 RNINATTGKCGIAMMASYP 360
>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 192 bits (488), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 117/301 (38%), Positives = 168/301 (55%), Gaps = 31/301 (10%)
Query: 30 AEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLASYTGYKPPPTDH 73
A++E RF F N F L +N+FADLT ++F A+Y G K +
Sbjct: 71 ADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADLTNDEFRAAYLGVKGA-AER 129
Query: 74 PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRT 132
+ R + + +++DW E+GAV PVK+QG CWAF+AV+TVE +N+I T
Sbjct: 130 NRAGRVVGDRYRHDGAEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSTVESINQIVT 189
Query: 133 GQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWW 189
G++VT S+ +LV+C +GC +++AFE+I + + +E YPY+ D CD
Sbjct: 190 GEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGIDTEDDYPYKA-VDGRCDVL 248
Query: 190 RSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNT 247
R +A K +I G++ V E+ LQ V+ PVSVAI+A F YH GVF+G CG
Sbjct: 249 RKNA--KVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREFQLYHSGVFSGRCGTQ 306
Query: 248 PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAY 306
+HGV VGYGT E + YW+V+N WG NW E G +R+ R + SG C IA ++Y
Sbjct: 307 LDHGVVAVGYGT----ENGKDYWIVRNSWGPNWGEAGYLRMERNINVTSGKCGIAMMSSY 362
Query: 307 P 307
P
Sbjct: 363 P 363
>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
Length = 327
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 166/321 (51%), Gaps = 44/321 (13%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLAS 62
E+WM +F + Y EKE RF +F+ N F LR+N+FADLT ++F+++
Sbjct: 20 EEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEFVST 79
Query: 63 YTGYKPP-PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
+TG KPP P D P W IDW +GAVT VKDQG+ CWAF A
Sbjct: 80 HTGAKPPCPKDAPRGVDPIWLPCC----------IDWRYKGAVTDVKDQGACGSCWAFAA 129
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
VA +EGL +IRTG+L S+ +LVDC T +GCA + AFE + + +E Y Y+
Sbjct: 130 VAAIEGLTQIRTGKLTPLSEQELVDCDTGSSGCAGGHTDRAFELVAAKGGITAESGYRYE 189
Query: 180 GRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYH 236
G Y A + A I G++ V P E L V+RQPV+ IDA+ F FY
Sbjct: 190 G---YRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYG 246
Query: 237 GGVFTGPCGN---------TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
GVF GPCG+ T NH VT+VGY + + YW+ KN WG W E G +
Sbjct: 247 SGVFPGPCGSGSGAAAAAPTTNHAVTLVGY--CQDGASGKKYWVAKNSWGKTWGEKGYIL 304
Query: 288 IFRGVGGS-GLCNIAANAAYP 307
+ + V G C +A + YP
Sbjct: 305 LEKDVASPHGTCGVAVSPFYP 325
>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
Length = 377
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 122/303 (40%), Positives = 164/303 (54%), Gaps = 35/303 (11%)
Query: 30 AEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDHPHSN 77
A + F +FK N HEF LRLN+F D+T ++F Y G + H
Sbjct: 64 ATRRAVFNVFKANVRLIHEFNRRDEPYKLRLNRFGDMTADEFRRHYAGSR---VAHHRMF 120
Query: 78 RSNWFKNLNSSKMSFYD------SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKI 130
R + + S+ + D S+DW ++GAVT VKDQG CWAF+ +A VEG+N I
Sbjct: 121 RGDRQGSSASASFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAI 180
Query: 131 RTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDW 188
+T L + S+ QLVDC T GC ++ AF+YI ++ +A+E YPY+ RQ C
Sbjct: 181 KTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQ-ASC-- 237
Query: 189 WRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGN 246
+ I GY+ V E L+ V+ QPVSVAI+A + F FY GVF+G CG
Sbjct: 238 --KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGT 295
Query: 247 TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAA 305
+HGV VGYG T A+G + YWLVKN WG W E G +R+ R V G C IA A+
Sbjct: 296 ELDHGVAAVGYGVT--ADGTK-YWLVKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEAS 352
Query: 306 YPL 308
YP+
Sbjct: 353 YPV 355
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 192 bits (487), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 120/317 (37%), Positives = 176/317 (55%), Gaps = 33/317 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREK 58
+ A+ E W+ + + YK EK RF++F++N +L LN+FADL+ E+
Sbjct: 400 LIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSSYWLGLNEFADLSHEE 459
Query: 59 FLASYTGYKPPPTDHPHS-NRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--C 115
F + Y G + + P S + S F+ + + + +S+DW ++GAVT VK+QG+ C C
Sbjct: 460 FKSKYLGLRA---EFPRSRDYSGEFRYRDVADLP--ESVDWRKKGAVTHVKNQGA-CGSC 513
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASE 173
WAF+ VA VEG+N+I TG L T S+ +L+DC T +GC ++ AF +I L E
Sbjct: 514 WAFSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKE 573
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-- 231
YPY ++ C+ + I GY+ V EE L ++ QP+SVAI+A+
Sbjct: 574 DDYPYL-MEEGTCEEQKEDVD--IVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRD 630
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY GGVF GPCG +HGV VGYG++ + Y +VKN WG W E G +R+ R
Sbjct: 631 FQFYSGGVFNGPCGTELDHGVAAVGYGSSKGLD----YIIVKNSWGPKWGEKGYIRMKRN 686
Query: 292 VGGS-GLCNIAANAAYP 307
G + GLC I A+YP
Sbjct: 687 TGKTEGLCGINKMASYP 703
>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
Length = 352
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 121/309 (39%), Positives = 172/309 (55%), Gaps = 27/309 (8%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLRL-------------NKFADLTREKFLASYT 64
W + R+Y E++ RF+++++N E + N+FADLT E+FL YT
Sbjct: 52 WQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEEEFLDLYT 111
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAVA 122
P R+N + +++ + S+DW +GAVTP+K+QG C CWAF A
Sbjct: 112 MKGMPVRRDAGKKRAN--VSSSAAAVDAPTSVDWRSKGAVTPIKNQGPSCSSCWAFVTAA 169
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTLNG-CAKNFLENAFEYIRQYQRLASECVYPYQGR 181
T+E + KI TG+LV+ S+ +L+DC +G C + N + ++ Q L +E YPYQ R
Sbjct: 170 TIESITKITTGKLVSLSEQELIDCDPYDGGCNLGYFVNGYRWVIQNGGLTTEANYPYQAR 229
Query: 182 QDYYCDWWRSSASGKYGAIRGYQYVQ-PATEEGLQDVVSRQPVSVAID-ATWFNFYHGGV 239
+ Y C RS A+ I YVQ PA E LQ V++QPV+ AI+ FY GGV
Sbjct: 230 R-YACS--RSRAAQHAATIS--DYVQLPAGEGQLQQAVAQQPVAAAIEMGGSLQFYSGGV 284
Query: 240 FTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCN 299
F+G CG NH +T+VGYG ++ YWLVKN WG +W E G +R+ R VG GLC
Sbjct: 285 FSGQCGTRMNHAITVVGYGA--DSSSGLKYWLVKNSWGQSWGERGYLRMRRDVGRGGLCG 342
Query: 300 IAANAAYPL 308
IA + AYP+
Sbjct: 343 IALDLAYPV 351
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 123/310 (39%), Positives = 169/310 (54%), Gaps = 25/310 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E+W+ + + Y EK RF++FK N + L LN+FADLT ++F +Y
Sbjct: 45 EKWLAKHQKAYASFEEKLHRFEVFKDNLKLIDEINREVTSYWLGLNEFADLTHDEFKTTY 104
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G PPP + S F+ N + ++DW ++GAVT VK+QG CWAF+ VA
Sbjct: 105 LGLSPPPA---RRSSSRSFRYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVA 161
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N I TG L S+ +L+DCS +GC ++ AF YI L +E YPY
Sbjct: 162 AVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGMMDYAFSYIASSGGLHTEEAYPYLM 221
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
+ D +S + +I GY+ V E+ L ++ QPVSVAI+A+ F FY GG
Sbjct: 222 EEGSCGDGKKSESEAV--SISGYEDVPTKDEQALIKALAHQPVSVAIEASGRHFQFYSGG 279
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GL 297
VF GPCG +HGV VGYG + + +G Y +VKN WG W E G +R+ RG G S GL
Sbjct: 280 VFDGPCGAQLDHGVAAVGYG-SDKGKGHD-YIIVKNSWGGKWGEKGYIRMKRGTGKSEGL 337
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 338 CGINKMASYP 347
>gi|52076128|dbj|BAD46641.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|52076135|dbj|BAD46648.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 374
Score = 191 bits (486), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 122/310 (39%), Positives = 160/310 (51%), Gaps = 35/310 (11%)
Query: 27 KDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFLASYTGYKPPPTDH 73
+D A+K RF++FKKN H+F L LNKFADLT E+F A YTG P P
Sbjct: 58 RDLADKGSRFEVFKKNARYIHDFNRKKGMSYKLGLNKFADLTLEEFTAKYTGANPGPITG 117
Query: 74 PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRT 132
+ + L + + DW E GAVT VKDQG CWAF+ V VEG+N I T
Sbjct: 118 LKNGTGS--PPLAAVAGDAPPAWDWREHGAVTRVKDQGPCGSCWAFSVVEAVEGINAIMT 175
Query: 133 GQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYY------- 185
G L+T S+ Q++DCS C+ + AF+Y +C P ++Y+
Sbjct: 176 GNLLTLSEQQVLDCSGAGDCSGGYTSYAFDYAVSNGITLDQCFSPPTTGENYFYYPAYEA 235
Query: 186 ----CDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDATW-FNFYHGGV 239
C + + A I Y +V P EE L Q V S+ PVSV I+A++ F Y GGV
Sbjct: 236 VQEPCRFDPNKA--PIVKIDSYSFVDPNDEEALKQAVYSQGPVSVLIEASYEFMIYQGGV 293
Query: 240 FTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGLC 298
F+GPCG NH V +VGY E E PYW+VKN WG W E G +R+ R + G+C
Sbjct: 294 FSGPCGTELNHAVLVVGY---DETEDGTPYWIVKNSWGAGWGESGYIRMIRNIPAPEGIC 350
Query: 299 NIAANAAYPL 308
IA YP+
Sbjct: 351 GIAMYPIYPI 360
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 109/316 (34%), Positives = 169/316 (53%), Gaps = 41/316 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLAS 62
E W + ++Y EK R IF + L LNKF+DLT +F A
Sbjct: 42 EDWAAKHGKSYSSDLEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAM 101
Query: 63 YTG-YKPP------PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+ G +K P P + + S S S+DW ++GAVTP+KDQG
Sbjct: 102 HVGKFKRPRYQDRLPAEDEDVDVS-----------SLPTSLDWRQKGAVTPIKDQGDCGS 150
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+A+A++E + + T +LV+ S+ QL+DC T++ GC +E AF+++ + + +E
Sbjct: 151 CWAFSAIASIESAHFLATKELVSLSEQQLMDCDTVDAGCDGGLMETAFKFVVKNGGVTTE 210
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN 233
YPY G C+ + + K I G++ V + + L VS+ PV+V+I + N
Sbjct: 211 ASYPYTGSVGS-CNANKVAIINKVAEITGFKVVTEDSADALMKAVSKTPVTVSICGSDEN 269
Query: 234 F--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+ +G CG++ +HGV ++GYGT EG PYW++KN WGT+W E G M+I R
Sbjct: 270 FQNYKSGILSGQCGDSLDHGVLLIGYGT----EGGMPYWIIKNSWGTSWGEDGFMKIERK 325
Query: 292 VGGSGLCNIAANAAYP 307
G G+C + +++YP
Sbjct: 326 -DGDGICGMNGDSSYP 340
>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
Length = 348
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 117/295 (39%), Positives = 159/295 (53%), Gaps = 25/295 (8%)
Query: 31 EKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPPPTDHPHSNR 78
EK+ RF +FK N L+LN+FAD+T +F A + R
Sbjct: 55 EKKKRFNVFKYNVNHINRVNQLGKPYKLKLNEFADMTNHEFKAGFDSKILHFRMLKGKRR 114
Query: 79 SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVT 137
F + ++ SIDW GAV P+K+QG CWAF+ + VEG+NKI+T QLV+
Sbjct: 115 QTPFTHAKTTDPP--PSIDWRTNGAVNPIKNQGRCGSCWAFSTIVGVEGINKIKTNQLVS 172
Query: 138 RSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGK 196
S+ +LVDC T GC +EN +E+I++ + +E +YPY R CD S +
Sbjct: 173 LSEQELVDCETDCEGCNGGLMENGYEFIKETGGVTTEQIYPYFARNG-RCDI--SKRNSP 229
Query: 197 YGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN--FYHGGVFTGPCGNTPNHGVTI 254
I G++ V E + V+ QPVS+AIDA N FY GVF G CG NHGV I
Sbjct: 230 VVKIDGFENVPANDESAMLRAVANQPVSIAIDAGGLNFQFYSQGVFNGACGTELNHGVAI 289
Query: 255 VGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
VGYGTT + YW+V+N WGT W E G +R+ RGV GLC +A +A+YP+
Sbjct: 290 VGYGTTQDGTN---YWIVRNSWGTGWGEQGYVRMQRGVNVPEGLCGLAMDASYPI 341
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 129/341 (37%), Positives = 171/341 (50%), Gaps = 45/341 (13%)
Query: 4 TSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN-HEF-----------LRLNKF 51
+SH++ +A E+W+ R Y EK RF++FK N H L LN+F
Sbjct: 50 SSHES--LAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNRKVSSYWLGLNEF 107
Query: 52 ADLTREKFLASYTGYKPP-----PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
ADLT ++F A+Y G + S S+DW +GAVT
Sbjct: 108 ADLTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGASLPKSVDWRSKGAVTG 167
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEY 163
VK+QG CWAF+ VA VEG+N+I TG L S+ +L+DC T NGC ++ AF Y
Sbjct: 168 VKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDGNNGCNGGLMDYAFSY 227
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGK--------------YGAIRGYQYVQPA 209
I L +E YPY ++ C RSS+S K I GY+ V
Sbjct: 228 IAHNGGLHTEEAYPYL-MEEGTCQ--RSSSSEKKWPGSSEDANDDAAVVTISGYEDVPRN 284
Query: 210 TEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQ 267
E+ L +++QPVSVAI+A+ F FY GGVF GPCG +HGV VGYGT +
Sbjct: 285 NEQALLKALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGVAAVGYGTAAKG---H 341
Query: 268 PYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
Y +VKN WG +W E G +R+ RG G GLC I A+YP
Sbjct: 342 DYIIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMASYP 382
>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 493
Score = 191 bits (486), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 120/349 (34%), Positives = 182/349 (52%), Gaps = 66/349 (18%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF---------------LRLNKFADLTRE 57
A ++ W+ E R+Y E+E RF++F N +F L +N+FADLT +
Sbjct: 47 AAYDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTND 106
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--- 114
+F A++ G K +R+ + + +S+DW E+GAV PVK+QG
Sbjct: 107 EFRATFLGAK-----FVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCVDRI 161
Query: 115 ------------------------------CWAFTAVATVEGLNKIRTGQLVTRSKHQLV 144
CWAF+AV+TVE +N++ TG+++T S+ +LV
Sbjct: 162 IVWNSMVRIYVVDAGCMLENPLMGLTVQGSCWAFSAVSTVESINQLVTGEMITLSEQELV 221
Query: 145 DCST---LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIR 201
+CST +GC +++AF++I + + +E YPY+ D CD R +A K +I
Sbjct: 222 ECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKA-VDGKCDINRENA--KVVSID 278
Query: 202 GYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGT 259
G++ V E+ LQ V+ QPVSVAI+A F YH GVF+G CG + +HGV VGYGT
Sbjct: 279 GFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGT 338
Query: 260 TTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
+ + YW+V+N WG W E G +R+ R + +G C IA A+YP
Sbjct: 339 ----DNGKDYWIVRNSWGPKWGESGYVRMERNINATTGKCGIAMMASYP 383
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 191 bits (485), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 115/313 (36%), Positives = 167/313 (53%), Gaps = 41/313 (13%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLA 61
+E+W+VE + Y EKE R KIFK+N +F+ L +FADLT ++
Sbjct: 2 YERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDE--- 58
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
P D ++R L D IDW +GAV PVKDQG+ CWAF+A
Sbjct: 59 --------PKDFMKADRY-----LYKEGDILPDEIDWRAKGAVVPVKDQGNCGSCWAFSA 105
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQRLASECVYP 177
V VEG+N+I+TG+L++ S +L+DC GC + AFE+I + S+ YP
Sbjct: 106 VGAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQDYP 165
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFY 235
Y C+ + + + + I GY+YV E+ L+ V+ QPV VAI+A+ F Y
Sbjct: 166 YTATDLGVCNADKKNNT-RVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLY 224
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
GVFTG CG +HGV +VGYGT++ + YW+++N WG NW E G +++ R + S
Sbjct: 225 KSGVFTGTCGIYLDHGVVVVGYGTSS----GEDYWIIRNSWGLNWGENGYVKLQRNIDDS 280
Query: 296 -GLCNIAANAAYP 307
G C +A +YP
Sbjct: 281 FGKCGVAMMPSYP 293
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 191 bits (485), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 120/312 (38%), Positives = 170/312 (54%), Gaps = 31/312 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E W+ F + Y+ EK +RF++FK N + +L LN+FADL+ E+F Y
Sbjct: 52 ENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLGLNEFADLSHEEFKKMY 111
Query: 64 TGYKPPPT--DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
G K D S ++++ + S +DW ++GAV VK+QGS CWAF+
Sbjct: 112 LGLKTDIVRRDEERSYAEFAYRDVEAVPKS----VDWRKKGAVAEVKNQGSCGSCWAFST 167
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPY 178
VA VEG+NKI TG L T S+ +L+DC T NGC ++ AFEYI + L E YPY
Sbjct: 168 VAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPY 227
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYH 236
++ C+ + + + I G+Q V E+ L ++ QP+SVAIDA+ F FY
Sbjct: 228 S-MEEGTCEMQKDES--ETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYS 284
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-S 295
GGVF G CG +HGV VGYG++ ++ Y +VKN WG W E G +R+ R G
Sbjct: 285 GGVFDGRCGVDLDHGVAAVGYGSSKGSD----YIIVKNSWGPKWGEKGYIRLKRNTGKPE 340
Query: 296 GLCNIAANAAYP 307
GLC I A++P
Sbjct: 341 GLCGINKMASFP 352
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 119/319 (37%), Positives = 178/319 (55%), Gaps = 36/319 (11%)
Query: 13 AKHEQWMVEFARTYKDQ--AEKEMRFKIFKKNHEF---------------LRLNKFADLT 55
A ++ W+ E + E E RF +F N +F L +N+FADLT
Sbjct: 50 AAYDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLT 109
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
E+F A++ G K +R+ + + +S+DW E+GAV PVK+QG
Sbjct: 110 NEEFRATFLGAKVA-----ERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGS 164
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLA 171
CWAF+AV+TVE +N++ TG+++T S+ +LV+CST +GC +++AF++I + +
Sbjct: 165 CWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGID 224
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
+E YPY+ D CD R +A K +I G++ V E+ LQ V+ QPVSVAI+A
Sbjct: 225 TEDDYPYKA-VDGKCDINRENA--KVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGG 281
Query: 232 --FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F YH GVF+G CG + +HGV VGYGT + + YW+V+N WG W E G +R+
Sbjct: 282 REFQLYHSGVFSGRCGTSLDHGVVAVGYGT----DNGKDYWIVRNSWGPKWGESGYVRME 337
Query: 290 RGVG-GSGLCNIAANAAYP 307
R + +G C IA A+YP
Sbjct: 338 RNINVTTGKCGIAMMASYP 356
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 168/310 (54%), Gaps = 30/310 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E WM + + Y+ EK +RF+IFK N + +L LN+FADL+ ++F Y
Sbjct: 48 ESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKY 107
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G K + S +K++ K S+DW ++GAV PVK+QGS CWAF+ VA
Sbjct: 108 LGLKVDYSRRRESPEEFTYKDVELPK-----SVDWRKKGAVAPVKNQGSCGSCWAFSTVA 162
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I TG L + S+ +L+DC NGC ++ AF +I + L E YPY
Sbjct: 163 AVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI- 221
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
++ C+ + + I GY V E+ L ++ QP+SVAI+A+ F FY GG
Sbjct: 222 MEEGTCEMTKEET--EVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGG 279
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF G CG+ +HGV VGYGT + Y +VKN WG+ W E G +R+ R +G G+
Sbjct: 280 VFDGHCGSDLDHGVAAVGYGTAKGVD----YIIVKNSWGSKWGEKGYIRMRRNIGKPEGI 335
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 336 CGIYKMASYP 345
>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 128/300 (42%), Positives = 167/300 (55%), Gaps = 26/300 (8%)
Query: 27 KDQAEKEMRFKIFKKN--HEF----------LRLNKFADLTREKFLASYTGYKPPPTDHP 74
++ EK RF +FK+N H F L+LNKFAD++ +F+ Y
Sbjct: 52 RNLKEKHKRFSVFKENVNHVFTVNQMDKPYKLKLNKFADMSNYEFVNFYARSNISHYRKL 111
Query: 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
H R + S+DW ERGAV VK+QG CWAF++VA VEG+NKI+T
Sbjct: 112 HERRRGAGGFMYEQDTDLPSSVDWRERGAVNAVKEQGRCGSCWAFSSVAAVEGINKIKTN 171
Query: 134 QLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSS 192
QL++ S+ +L+DC+ N GC F+E AF++I++ +A+E YPY G + C RSS
Sbjct: 172 QLLSLSEQELLDCNYRNKGCNGGFMEIAFDFIKRNGGIATENSYPYHGSRG-LC---RSS 227
Query: 193 -ASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPN 249
S I GY+ V P E+ L V+ QPVSVAIDA F FY GVF G CG N
Sbjct: 228 RISSPIVKIDGYESV-PENEDALMQAVANQPVSVAIDAAGRDFQFYSQGVFDGYCGTELN 286
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYPL 308
HGV +GYGTT E YWLV+N WG W E G +R+ RGV GLC IA A+YP+
Sbjct: 287 HGVVAIGYGTT---EDGTDYWLVRNSWGVGWGEDGYVRMKRGVEQAEGLCGIAMEASYPI 343
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 125/320 (39%), Positives = 169/320 (52%), Gaps = 32/320 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREK 58
+A + W + + Y E+ RF ++K N E+++ L KFADLT E+
Sbjct: 41 LAGQFAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQRHSEKNLSYWLGLTKFADLTNEE 100
Query: 59 FLASYTGYKPPPTDHPHSNR--SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
F YTG + + R + F+ NS SIDW E+GAVT VKDQGS C
Sbjct: 101 FRRQYTGTRIDRSRRLKKGRNATGSFRYANSEAPK---SIDWREKGAVTSVKDQGSCGSC 157
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASE 173
WAF+AV +VEG+N IRTG ++ S +LVDC GC ++ AF+++ Q + +E
Sbjct: 158 WAFSAVGSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGGLMDYAFDFVIQNGGIDTE 217
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-- 231
YPYQG D CD + +A + I Y+ V EE L+ V+ QPVSVAI+A
Sbjct: 218 KDYPYQG-YDGRCDVNKMNA--RVVTIDSYEDVPENDEEALKKAVAGQPVSVAIEAGGRD 274
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y GGVFTG CG +HGV VGYG+ E YW+VKN WG W E G +R+ R
Sbjct: 275 FQLYSGGVFTGRCGTDLDHGVLAVGYGS----EKGLDYWIVKNSWGEYWGESGYLRMQRN 330
Query: 292 V---GGSGLCNIAANAAYPL 308
+ G GLC I +Y +
Sbjct: 331 LKDDNGYGLCGINIEPSYAV 350
>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
parachinensis]
Length = 260
Score = 191 bits (484), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 108/264 (40%), Positives = 155/264 (58%), Gaps = 12/264 (4%)
Query: 50 KFADLTREKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
+FA++T ++F + YTGYK +S F+ N S + ++DW ++GAVTP+K
Sbjct: 1 QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQ 166
+QGS CCWAF+AVA +EG +I+ G+L++ S+ QLVDC T + GC+ ++ AFE+I
Sbjct: 61 NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCSGGLIDTAFEHIMA 120
Query: 167 YQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVA 226
L +E YPY+G +D C + S +I GY+ V E L V+ QPVSV
Sbjct: 121 TGGLTTESNYPYKG-EDATCKIKSTXPSA--ASITGYEDVPVNDENALMKAVAHQPVSVG 177
Query: 227 IDATWFN--FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
I+ F+ FY GVFTG C +H VT VGY +++ YW++KN WGT W EGG
Sbjct: 178 IEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGY---SQSSAGSKYWIIKNSWGTKWGEGG 234
Query: 285 SMRIFRGV-GGSGLCNIAANAAYP 307
MRI + + GLC +A A+YP
Sbjct: 235 YMRIKKDIKDKEGLCGLAMKASYP 258
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 190 bits (483), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 124/332 (37%), Positives = 175/332 (52%), Gaps = 36/332 (10%)
Query: 1 MSRTSHKTGNIAAKHEQ---WMVEFARTYKDQAEKEMRFKIFKKNHEFLR---------- 47
+ R + GN EQ W + + Y E R+ ++K N E+++
Sbjct: 29 LLRMTTDLGNERLLSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQRHSEKNRSYW 88
Query: 48 --LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVT 105
L KFAD+T ++F YTG + + S R F+ +S +S+DW ++GAVT
Sbjct: 89 LGLTKFADITNDEFRRQYTGTRIDRS--KRSKRKTGFRYADSEAP---ESVDWRKKGAVT 143
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFE 162
VKDQGS CWAF+A+ +VEG+N IRTG+ V+ S+ +LVDC GC ++ AF+
Sbjct: 144 TVKDQGSCGSCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFD 203
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
+I + + +E YPY+G D CD + + I GY+ V EE L+ V+ QP
Sbjct: 204 FILENGGIDTENDYPYKGL-DGRCD--NNKKNAHVVTIDGYEDVPENDEEALKKAVAGQP 260
Query: 223 VSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
VSVAI+A F Y GGVFTG CG +HGV VGYG+ EG YW+VKN WG W
Sbjct: 261 VSVAIEAGGRDFQLYSGGVFTGECGTDLDHGVLAVGYGS----EGSLDYWIVKNSWGEYW 316
Query: 281 DEGGSMRIFRGVGGS----GLCNIAANAAYPL 308
E G +R+ R + S GLC I +Y +
Sbjct: 317 GESGYLRMQRNIKDSNHQFGLCGINIEPSYAV 348
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 190 bits (483), Expect = 7e-46, Method: Compositional matrix adjust.
Identities = 117/326 (35%), Positives = 175/326 (53%), Gaps = 40/326 (12%)
Query: 10 NIAAKHEQWMVEFAR-TYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLT 55
+++ ++ W +F + + + RF+ FK+N + L LN+F+DLT
Sbjct: 8 DLSGEYASWCAKFGKECASSNSLGDHRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLT 67
Query: 56 REKFLASYTGYKPPPTDHP------HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKD 109
E+F + G +P D P S+ F+N++ S+DW + GAVT KD
Sbjct: 68 SEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVD-----LPASVDWRQHGAVTAPKD 122
Query: 110 QGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS--TLNGCAKNFLENAFEYIR 165
QGS C CWAF +EG+N+I TGQLV+ S+ +L+DC GC +ENA+++I
Sbjct: 123 QGS-CGGCWAFATTGAIEGINQIVTGQLVSLSEQELIDCDKKADKGCDGGLMENAYQFIV 181
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ L +E YPY + +C+ + ++ + AI GY+ + E+ L V++QPVSV
Sbjct: 182 ENGGLDTETDYPYHASES-HCNMKKLNS--RVVAIDGYKAIPEGDEQALLLAVAKQPVSV 238
Query: 226 AIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
AI+ +F H GVFTG CG NHGV IVGYGT E YW+VKN W W +G
Sbjct: 239 AIEGASKDFQHYASGVFTGHCGEEINHGVLIVGYGT----EDGLDYWIVKNSWAATWGDG 294
Query: 284 GSMRIFRGVGG-SGLCNIAANAAYPL 308
G +++ R G GLC+I A+YP+
Sbjct: 295 GFVKMQRNTGKRGGLCSINTLASYPV 320
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 116/314 (36%), Positives = 168/314 (53%), Gaps = 38/314 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLAS 62
E W ++ +TY + EK R K+F++NH F L LN FADLT +F AS
Sbjct: 30 EAWCEQYGKTYSSEEEKASRLKVFEENHAFVTQHNSMANASYTLALNAFADLTHHEFKAS 89
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFY--DSIDWNERGAVTPVKDQGSYC--CWAF 118
G+ P R+ +++ + + ++DW + GAVT VKDQG+ C CW+F
Sbjct: 90 RLGFSP--------GRAQSIRSVGTPVQELHVPPAVDWRKSGAVTGVKDQGN-CGGCWSF 140
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVY 176
+ +EG+NKI TG LV+ S+ +LVDC S +GC ++ A++++ + Q + SE Y
Sbjct: 141 STTGAIEGINKIVTGSLVSLSEQELVDCDRSYNSGCEGGLMDYAYQFVIKNQGIDSEADY 200
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNF 234
PY G D C+ + I GY + P E+ L VV++QPVSV I + F
Sbjct: 201 PYVG-MDKPCN--KEKLKKHIVTIDGYTDIPPNDEKQLLQVVAKQPVSVGICGSEKTFQL 257
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG- 293
Y GV+TGPC +T +H V IVGYGT E +W+VKN WG +W G + + R G
Sbjct: 258 YSKGVYTGPCSSTLDHAVLIVGYGT----EDGVDFWIVKNSWGEHWGMRGYIHMLRNNGT 313
Query: 294 GSGLCNIAANAAYP 307
G+C I A+YP
Sbjct: 314 AEGICGINMLASYP 327
>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
Length = 374
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 123/347 (35%), Positives = 180/347 (51%), Gaps = 47/347 (13%)
Query: 1 MSRT-SHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------- 48
M R+ S ++ + ++W + ++Y AE+ RF++ +N ++
Sbjct: 35 MERSMSTDDSSMIERFQRWKAAYNKSYATVAEERRRFRVCARNMAYIEATNAEAEAAGLT 94
Query: 49 -----NKFADLTREKFLASYTGYKP---PPTDHPHSNRSN-------------WFKNLNS 87
+ DLT ++F+A YT P P + + R+ + NL++
Sbjct: 95 YELGETAYTDLTNQEFMAMYTAPAPAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLST 154
Query: 88 SKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC 146
S + S+DW GAVTPVK+QG CWAF+ VA VEG+ +IRTG+LV+ S+ +LVDC
Sbjct: 155 SAPA---SVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDC 211
Query: 147 STL-NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQY 205
TL +GC A +I + +E YPY G D C+ R+ S +I G +
Sbjct: 212 DTLDDGCDGGISYRALRWIASNGGITTETDYPYTGTTD-ACN--RAKLSHNAVSIAGLRR 268
Query: 206 VQPATEEGLQDVVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEA 263
V +E L + V+ QPV+V+I+A NF H GV+ GPCG NHGVT+VGYG EA
Sbjct: 269 VATRSEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYG--QEA 326
Query: 264 EGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG--SGLCNIAANAAYPL 308
G YW+VKN WG W + G +R+ + V G GLC IA +YPL
Sbjct: 327 AGGDRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
Precursor
gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 190 bits (482), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 119/327 (36%), Positives = 166/327 (50%), Gaps = 28/327 (8%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------L 48
+ + G + +EQW+VE + Y EKE RFKIFK N + + L
Sbjct: 28 TESQRNEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGL 87
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP-V 107
NKF+DLT ++F ASY G K +K + D +DW ERGAV P V
Sbjct: 88 NKFSDLTADEFQASYLGGKMEKKSLSDVAERYQYKEGDV----LPDEVDWRERGAVVPRV 143
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEY 163
K QG CWAF A VEG+N+I TG+LV+ S+ +L+DC N GCA AFE+
Sbjct: 144 KRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEF 203
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I++ + S+ VY Y G C + + I G++ V E L+ V+ QP+
Sbjct: 204 IKENGGIVSDEVYGYTGEDTAACKAIEMKTT-RVVTINGHEVVPVNDEMSLKKAVAYQPI 262
Query: 224 SVAIDATWFNFYHGGVFTGPCGNT-PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
SV I A + Y GV+ G C N +H V IVGYGT+++ + YWL++N WG W E
Sbjct: 263 SVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSD---EGDYWLIRNSWGPEWGE 319
Query: 283 GGSMRIFRGV-GGSGLCNIAANAAYPL 308
GG +R+ R +G C +A YP+
Sbjct: 320 GGYLRLQRNFHEPTGKCAVAVAPVYPI 346
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 125/328 (38%), Positives = 171/328 (52%), Gaps = 48/328 (14%)
Query: 3 RTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNK 50
RT + I +E W+ + + Y E E RF+IFK N +F+ L
Sbjct: 36 RTDEEVKEI---YELWLAKHDKVYSGLVEYEKRFEIFKDNLKFIDEHNSENHTYKMGLTP 92
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDS-------IDWNERGA 103
+ DLT E+F A Y G + +D H + + +N S+ Y++ IDW ++GA
Sbjct: 93 YTDLTNEEFQAIYLGTR---SDTIHRLK----RTINISERYAYEAGDNLPEQIDWRKKGA 145
Query: 104 VTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAF 161
VTPVK+QG CWAF+ V+TVE +N+IRTG L++ S+ QLVDC+ N GC A+
Sbjct: 146 VTPVKNQGKCGSCWAFSTVSTVESINQIRTGNLISLSEQQLVDCNKKNHGCKGGAFVYAY 205
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
+YI + +E YPY+ Q A+ K I GY+ V E L+ V+ Q
Sbjct: 206 QYIIDNGGIDTEANYPYKAVQG------PCRAAKKVVRIDGYKGVPHCNENALKKAVASQ 259
Query: 222 PVSVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
P VAIDA+ F Y G+F+GPCG NHGV IVGY + YW+V+N WG
Sbjct: 260 PSVVAIDASSKQFQHYKSGIFSGPCGTKLNHGVVIVGY--------WKDYWIVRNSWGRY 311
Query: 280 WDEGGSMRIFRGVGGSGLCNIAANAAYP 307
W E G +R+ R VGG GLC IA YP
Sbjct: 312 WGEQGYIRMKR-VGGCGLCGIARLPYYP 338
>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 190 bits (482), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 119/327 (36%), Positives = 166/327 (50%), Gaps = 28/327 (8%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------L 48
+ + G + +EQW+VE + Y EKE RFKIFK N + + L
Sbjct: 28 TESQRNEGGVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGL 87
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP-V 107
NKF+DLT ++F ASY G K +K + D +DW ERGAV P V
Sbjct: 88 NKFSDLTADEFQASYLGGKMEKKSLSDVAERYQYKEGDV----LPDEVDWRERGAVVPRV 143
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEY 163
K QG CWAF A VEG+N+I TG+LV+ S+ +L+DC N GCA AFE+
Sbjct: 144 KRQGECGSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEF 203
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
I++ + S+ VY Y G C + + I G++ V E L+ V+ QP+
Sbjct: 204 IKENGGIVSDEVYGYTGEDTAACKAIEMKTT-RVVTINGHEVVPVNDEMSLKKAVAYQPI 262
Query: 224 SVAIDATWFNFYHGGVFTGPCGNT-PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
SV I A + Y GV+ G C N +H V IVGYGT+++ + YWL++N WG W E
Sbjct: 263 SVMISAANMSDYKSGVYKGACSNLWGDHNVLIVGYGTSSD---EGDYWLIRNSWGPEWGE 319
Query: 283 GGSMRIFRGV-GGSGLCNIAANAAYPL 308
GG +R+ R +G C +A YP+
Sbjct: 320 GGYLRLQRNFHEPTGKCAVAVAPVYPI 346
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 189 bits (481), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 119/319 (37%), Positives = 177/319 (55%), Gaps = 36/319 (11%)
Query: 13 AKHEQWMVEFARTYKDQ--AEKEMRFKIFKKNHEF---------------LRLNKFADLT 55
A ++ W+ E + E E RF +F N +F L +N+FADLT
Sbjct: 49 AAYDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLT 108
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
E+F A++ G K +R+ + + +S+DW E+GAV PVK+QG
Sbjct: 109 NEEFRATFLGAKVA-----ERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGS 163
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLA 171
CWAF+AV+TVE +N++ TG+++T S+ +LV+CST +GC + +AF++I + +
Sbjct: 164 CWAFSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGID 223
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
+E YPY+ D CD R +A K +I G++ V E+ LQ V+ QPVSVAI+A
Sbjct: 224 TEDDYPYKA-VDGKCDINRENA--KVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGG 280
Query: 232 --FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F YH GVF+G CG + +HGV VGYGT + + YW+V+N WG W E G +R+
Sbjct: 281 REFQLYHSGVFSGRCGTSLDHGVVAVGYGT----DNGKDYWIVRNSWGPKWGESGYVRME 336
Query: 290 RGVG-GSGLCNIAANAAYP 307
R + +G C IA A+YP
Sbjct: 337 RNINVTTGKCGIAMMASYP 355
>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
Length = 422
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 121/312 (38%), Positives = 166/312 (53%), Gaps = 29/312 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLAS 62
E W E +TY + +K RFKIF++N+EF L LN FADLT +F AS
Sbjct: 33 ESWTKEHGKTYTSKEDKLYRFKIFEENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFKAS 92
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
G T S R+ L+ SIDW ++GAV+ VKDQG+ CW+F+A
Sbjct: 93 RLGLSAFSTSGKLSRRN---FPLHDFVGDVPISIDWRKKGAVSQVKDQGNCGACWSFSAT 149
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+EG+NKI TG LV+ S+ +LVDC S NGC ++ A++++ + + +E YPYQ
Sbjct: 150 GAIEGINKIVTGSLVSLSEQELVDCDRSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQ 209
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHG 237
R+ C+ + I GY V E+ L V+ QPVSV I + F Y
Sbjct: 210 AREK-TCN--KEKLKRHVVTIDGYTDVPQNNEKELLKAVAAQPVSVGICGSERAFQLYSK 266
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-G 296
G+FTGPC + +H V IVGYG+ E YW+VKN WGT+W G M + R G S G
Sbjct: 267 GIFTGPCSTSLDHAVLIVGYGS----ENGVDYWIVKNSWGTHWGINGYMYMLRNSGNSQG 322
Query: 297 LCNIAANAAYPL 308
LC I A++P+
Sbjct: 323 LCGINMLASFPV 334
>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
Length = 374
Score = 189 bits (481), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 120/346 (34%), Positives = 179/346 (51%), Gaps = 46/346 (13%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL------------ 48
M S+ ++ + ++W + ++Y AE+ RF+++ +N ++
Sbjct: 36 MGSMSNDDSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAAGLTY 95
Query: 49 ----NKFADLTREKFLASYTG---YKPPPTDHPHSNRSN-------------WFKNLNSS 88
+ DLT ++F+A YT + P + + R+ + NL++S
Sbjct: 96 ELGETAYTDLTNQEFMAMYTAPALAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSAS 155
Query: 89 KMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS 147
+ S+DW GAVTPVK+QG CWAF+ VA VEG+ +IRTG+LV+ S+ +LVDC
Sbjct: 156 APA---SVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCD 212
Query: 148 TL-NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYV 206
TL +GC A +I + +E YPY G D C+ R+ S +I G + V
Sbjct: 213 TLDDGCDGGISYRALRWIASNGGITTEADYPYTGTTD-ACN--RAKLSHNAVSIAGLRRV 269
Query: 207 QPATEEGLQDVVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAE 264
+E L + V+ QPV+V+I+A NF H GV+ GPCG NHGVT+VGYG EA
Sbjct: 270 ATRSEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYG--QEAA 327
Query: 265 GQQPYWLVKNRWGTNWDEGGSMRIFRGVGG--SGLCNIAANAAYPL 308
YW+VKN WG W + G +R+ + V G GLC IA +YPL
Sbjct: 328 AGDRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|326494040|dbj|BAJ85482.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 120/318 (37%), Positives = 170/318 (53%), Gaps = 31/318 (9%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE+WM ++ R Y D AEK R ++F N L LN F+DLT E+F
Sbjct: 40 RHERWMAKYGRVYADAAEKLRRQEVFAANARHIDAVNRAGNRTYTLGLNHFSDLTNEEFA 99
Query: 61 ASYTGYKPPPTD---HPHSNRSNWFKNLNSSKM-SFYDSIDWNERGAVTPVKDQGSYC-- 114
++ GY+ P P + N+ +++ S DS+DW RGAVTPVK QG +C
Sbjct: 100 QTHLGYRHQPGPGGLRPEDSSPAAAVNVTDAQLQSTPDSVDWRARGAVTPVKHQG-HCGS 158
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS-TLNGCAKNFLENAFEYIRQYQRLASE 173
CWAF AVA EGL +I TG L++ S+ Q++DC+ + C ++ A YI L +E
Sbjct: 159 CWAFAAVAATEGLVQIATGNLISMSEQQVLDCTGGTSSCKSGYVNAALTYITASGGLQTE 218
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEG-LQDVVSRQPVSVAIDATW- 231
Y Y Q S S A+ ++ +EG LQ +V+ QPV+VA++A
Sbjct: 219 AAYAYSAEQGACRSGGASPNSAA--AVGVHRSAMLNGDEGALQVLVAGQPVAVAVEAEPD 276
Query: 232 FNFYHGGVFTG--PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F+ Y GV+ G CG +H VT+VGYG + +G YW+VKN+WG W E G MR+
Sbjct: 277 FHHYKSGVYVGSPSCGQKLHHAVTVVGYGADGDGQG---YWVVKNQWGAGWGEVGYMRLT 333
Query: 290 RGVGGSGLCNIAANAAYP 307
RG GG+ C +A +A YP
Sbjct: 334 RGNGGNN-CGMATHAYYP 350
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 189 bits (480), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 120/320 (37%), Positives = 168/320 (52%), Gaps = 34/320 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQA----EKEMRFKIFKKNHEF------------LRLNKFADL 54
+A +E WM + + + EK+ RF+IFK N F L L +FADL
Sbjct: 45 VARIYEAWMEKHGKKAQSNGLVGEEKDQRFEIFKDNLRFIDEHNNKNLSYKLGLTRFADL 104
Query: 55 TREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
T E++ + Y G K S+R + DS+DW + GAV VKDQGS
Sbjct: 105 TNEEYRSIYLGAKSKKRVLKTSDRYQ-----PRVGDAIPDSVDWRKEGAVAAVKDQGSCG 159
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLA 171
CWAF+ + VEG+NKI TG L++ S+ +LVDC T GC ++ AFE+I + +
Sbjct: 160 SCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIKNGGID 219
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-- 229
+E YPY+ D CD R +A K I Y+ V E L+ ++ QP+SVAI+A
Sbjct: 220 TEEDYPYKA-ADGRCDQTRKNA--KVVTIDAYEDVPENNEAALKKTLANQPISVAIEAGG 276
Query: 230 TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F Y GVF G CG +HGV VGYGT E + YW+V+N WG +W E G +++
Sbjct: 277 RAFQLYSSGVFDGICGTELDHGVVAVGYGT----ENGKDYWIVRNSWGGSWGESGYIKMA 332
Query: 290 RGVGG-SGLCNIAANAAYPL 308
R + +G C IA A+YP+
Sbjct: 333 RNIAEPTGKCGIAMEASYPI 352
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 116/326 (35%), Positives = 175/326 (53%), Gaps = 40/326 (12%)
Query: 10 NIAAKHEQWMVEFAR-TYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLT 55
+++ ++ W +F + + + RF+ FK+N + L LN+F+DLT
Sbjct: 8 DLSGEYASWCAKFGKECASSNSLGDRRFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLT 67
Query: 56 REKFLASYTGYKPPPTDHP------HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKD 109
E+F + G +P D P S+ F+N++ S+DW + GAVT KD
Sbjct: 68 SEEFRQRFLGLRPDLIDSPVLKMPRDSDIEEGFQNVD-----LPASVDWRKHGAVTAPKD 122
Query: 110 QGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS--TLNGCAKNFLENAFEYIR 165
QGS C CWAF +EG+N+I TGQL++ S+ +L+DC GC +ENA+++I
Sbjct: 123 QGS-CGGCWAFATTGAIEGINQIVTGQLMSLSEQELIDCDKKADKGCDGGLMENAYQFIV 181
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ L +E YPY + +C+ + ++ + AI GY+ + E+ L V++QPVSV
Sbjct: 182 ENGGLDTETDYPYHASES-HCNMKKLNS--RVVAIDGYEAIPDGDEQALLRAVAKQPVSV 238
Query: 226 AIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
AI+ +F H GVFTG CG NHGV IVGYGT E YW+VKN W W +G
Sbjct: 239 AIEGASKDFQHYASGVFTGHCGEEINHGVLIVGYGT----EDGLDYWIVKNSWAATWGDG 294
Query: 284 GSMRIFRGVGG-SGLCNIAANAAYPL 308
G +++ R G GLC+I A+YP+
Sbjct: 295 GFVKMQRNTGKRGGLCSINTLASYPV 320
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 122/310 (39%), Positives = 171/310 (55%), Gaps = 29/310 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN------------HEFLRLNKFADLTREKFLASY 63
E W+ + + Y+ EK RF+IFK N + +L LN+FADL+ E+F Y
Sbjct: 34 ESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKKVVNYWLGLNEFADLSHEEFKNKY 93
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G ++ + +K+++S S+DW ++GAVT VK+QGS CWAF+ VA
Sbjct: 94 LGLNVDLSNRRECSEEFTYKDVSS----IPKSVDWRKKGAVTDVKNQGSCGSCWAFSTVA 149
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I TG L + S+ +LVDC T NGC ++ AF YI L E YPY
Sbjct: 150 AVEGINQIVTGNLTSLSEQELVDCDTTYNNGCNGGLMDYAFAYIISNGGLHKEEDYPYI- 208
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
++ C+ + A + I GY V +EE L ++ QP+SVAIDA+ F FY GG
Sbjct: 209 MEEGTCEMRK--AESEVVTISGYHDVPQNSEESLLKALANQPLSVAIDASGRDFQFYSGG 266
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF G CG +HGV VGYG+ A+G + +VKN WG+ W E G +R+ R G +GL
Sbjct: 267 VFDGHCGTELDHGVAAVGYGS---AKGLD-FIVVKNSWGSKWGEKGFIRMKRNTGKPAGL 322
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 323 CGINKMASYP 332
>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
Length = 358
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 119/311 (38%), Positives = 173/311 (55%), Gaps = 31/311 (9%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL-------------NKFADLTREKFLASY 63
+W + R+Y E++ RF+++++N E + N+FADLT E+FL Y
Sbjct: 59 RWQATYNRSYPTAEERQRRFQVYRRNMEHIEATNRAGNLTYTLGENQFADLTEEEFLDLY 118
Query: 64 TGYKPPPT--DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFT 119
T PP D ++N+ SS + S+DW RGAVTP+K+QG C CWAF
Sbjct: 119 TMKGMPPVRRDAGKKQQANF-----SSVVDAPTSVDWRSRGAVTPIKNQGPSCSSCWAFV 173
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLNG-CAKNFLENAFEYIRQYQRLASECVYPY 178
AT+E + +IRTG+LV+ S+ +L+DC +G C + N ++++ Q L +E YPY
Sbjct: 174 TAATIESITQIRTGKLVSLSEQELIDCDPYDGGCNLGYFVNGYKWVIQNGGLTTEANYPY 233
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID-ATWFNFYHG 237
Q R+ Y C+ RS A + I Y+ + P E LQ V++QPV+ AI+ FY G
Sbjct: 234 QARR-YQCN--RSKAGQRAARISNYRQL-PQGEAQLQQAVAQQPVAAAIEMGGSLQFYSG 289
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGL 297
GV++G CG NH +T+VGYG + YWLVKN WG W E G +R+ + V GL
Sbjct: 290 GVWSGQCGTRMNHAITVVGYGADSSGV---KYWLVKNSWGQTWGERGYLRMRKDVRQGGL 346
Query: 298 CNIAANAAYPL 308
C IA + AYP+
Sbjct: 347 CGIALDLAYPI 357
>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
Length = 360
Score = 189 bits (479), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 123/301 (40%), Positives = 164/301 (54%), Gaps = 35/301 (11%)
Query: 31 EKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPPTDHPHSNR 78
EK RF +FK N H L+LNKFAD+T +F Y K H
Sbjct: 55 EKHNRFNVFKANVMHVHNTNKLDKPYKLKLNKFADMTNYEFRRIYADSKVS-----HHRM 109
Query: 79 SNWFKNLNSSKM-----SFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRT 132
N N + M + SIDW ++GAVT VKDQG CWAF+ + VEG+N+I+T
Sbjct: 110 FRGMSNENGTFMYENVKNVPSSIDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKT 169
Query: 133 GQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWR 190
+LV+ S+ +LVDC T GC +E AFE+I+Q + +E YPY + D CD +
Sbjct: 170 QKLVSLSEQELVDCDTGGNEGCNGGLMEYAFEFIKQ-NGITTESNYPYAAK-DGTCDLKK 227
Query: 191 SSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN--FYHGGVFTGPCGNTP 248
+ +I GY+ V E L ++QPVSVAIDA +N FY GVF+G CG
Sbjct: 228 EDKAEV--SIDGYENVPINNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGVFSGHCGTDL 285
Query: 249 NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYP 307
NHGV +VGYG T + + YW+VKN WG+ W E G +R+ RG+ GLC IA A+YP
Sbjct: 286 NHGVAVVGYGVTQD---RTKYWIVKNSWGSEWGEQGYIRMQRGISHKEGLCGIAMEASYP 342
Query: 308 L 308
+
Sbjct: 343 I 343
>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 337
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 122/316 (38%), Positives = 172/316 (54%), Gaps = 33/316 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLA 61
HE+WM + + YKD AEKE +IF+ N EF L N+FADL E+F A
Sbjct: 32 HEKWMAQHGKVYKDAAEKERCLQIFENNMEFIESFDVCGDKSFNLSTNQFADLHDEEFKA 91
Query: 62 SYT-GYKPPPTDHPH-SNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS-YCCWAF 118
T G+K +H + F+ N +K+ S+DW +RG VTP+KDQG CWAF
Sbjct: 92 LLTNGHKK---EHSLWTTTETLFRYDNVTKIP--ASMDWRKRGVVTPIKDQGKCLSCWAF 146
Query: 119 T-AVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECV 175
+ VAT+EGL++I T +LV S+ +LVD GC +++E+AF++I + R+ SE
Sbjct: 147 SLCVATIEGLHQIITSELVPLSEQELVDFVKGESEGCYGDYVEDAFKFITKKGRIESETH 206
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFN 233
YPY+G + C + + I+GY+ V +E L V+ Q VSV+++A + F
Sbjct: 207 YPYKGVNN-TCKVKKETHG--VAQIKGYKKVPSKSENALLKAVANQLVSVSVEARDSAFQ 263
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
FY G+FTG CG +H V + YG E+ YWL KN WGT W E G +RI +
Sbjct: 264 FYSSGIFTGKCGTDTDHRVALASYG---ESGDGTKYWLAKNSWGTEWGEKGYIRIKXDIP 320
Query: 293 GGSGLCNIAANAAYPL 308
GLC IA YP+
Sbjct: 321 AKEGLCGIAKYPYYPI 336
>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
Length = 369
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 120/313 (38%), Positives = 162/313 (51%), Gaps = 42/313 (13%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLAS 62
+E+W + R +D EK RF +FK N HEF LRLN+F D+T ++ +
Sbjct: 48 YERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRRDEPYKLRLNRFGDMTADESAGA 106
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
Y + + F+ + GAV VKDQG CWAF+ +
Sbjct: 107 YASSRV--------SHHRMFRGRGEKAQRLH--------GAVGAVKDQGQCGSCWAFSTI 150
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASECVYPY 178
A VEG+N IRT L S+ QLVDC T GC ++NAF+YI ++ +A+ YPY
Sbjct: 151 AAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVAASSAYPY 210
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYH 236
+ RQ S+AS I GY+ V +E L+ V+ QPVSVAI+A + F FY
Sbjct: 211 RARQSSCK---SSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSHFQFYS 267
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-S 295
GVF G CG +HGV VGYGTT + YW+V+N WG +W E G +R+ R V
Sbjct: 268 EGVFAGKCGTELDHGVAAVGYGTTVDG---TKYWIVRNSWGADWGEKGYIRMKRDVSAKE 324
Query: 296 GLCNIAANAAYPL 308
GLC IA A+YP+
Sbjct: 325 GLCGIAMEASYPI 337
>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 119/318 (37%), Positives = 168/318 (52%), Gaps = 36/318 (11%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+++ + E W ++ YKD AE++ F+IFK N + L +N+F D
Sbjct: 37 SLSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFNAAGNKPYKLAINRFVDKPI 96
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
E S G++ + + FK N + + ++DW +RGAVTP+K+QG C
Sbjct: 97 ED---SDDGFE----RTTTTTPTTTFKYENVTDIPA--TVDWRKRGAVTPIKNQGKCGSC 147
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLAS 172
WAF+AVA +EG+ KI +G LV+ S+ QLVDC GC + NAF++I + +A+
Sbjct: 148 WAFSAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGGIAT 207
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT-W 231
E YPY+ C S K I+ Y+ V +E+ L V+ QPVSV ID
Sbjct: 208 EANYPYKRVVKGTC----KKVSHKV-QIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRGM 262
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F FY G+FTG CG PNH +TIVGYGT+ + YWLVKN W W E G +RI R
Sbjct: 263 FKFYSSGIFTGECGTKPNHALTIVGYGTSKDG---IKYWLVKNSWSKRWGEKGYIRIKRD 319
Query: 292 VGG-SGLCNIAANAAYPL 308
+ GLC IA +YP+
Sbjct: 320 IDAKEGLCGIAMKPSYPI 337
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 166/310 (53%), Gaps = 30/310 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E WM + Y++ EK +RF+IFK N + +L LN+FADL+ +F Y
Sbjct: 49 ESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHREFNNKY 108
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G K + S +K++ K S+DW ++GAV PVK+QGS CWAF+ VA
Sbjct: 109 LGLKVDYSRRRESPEEFTYKDVELPK-----SVDWRKKGAVAPVKNQGSCGSCWAFSTVA 163
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I TG L + S+ +L+DC NGC ++ AF +I + L E YPY
Sbjct: 164 AVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI- 222
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
++ C+ + + I GY V E+ L ++ QP+SVAI+A+ F FY GG
Sbjct: 223 MEEGTCEMTKEET--QVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGG 280
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF G CG+ +HGV VGYGT + Y VKN WG+ W E G +R+ R +G G+
Sbjct: 281 VFDGHCGSDLDHGVAAVGYGTAKGVD----YITVKNSWGSKWGEKGYIRMRRNIGKPEGI 336
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 337 CGIYKMASYP 346
>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 117/312 (37%), Positives = 169/312 (54%), Gaps = 30/312 (9%)
Query: 16 EQWMVEFARTYKDQ-AEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLAS 62
+ WM + +TY + EKE RF+ FK N F L L +FADLT +++
Sbjct: 48 QMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAKNLSYQLGLTRFADLTVQEYRDL 107
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS-YCCWAFTAV 121
+ G P + ++R + L ++ +S+DW + GAV+ +KDQG+ CWAF+ V
Sbjct: 108 FPGSPKPKQRNLKTSRR--YVPLAGDQLP--ESVDWRQEGAVSEIKDQGTCNSCWAFSTV 163
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGC-AKNFLENAFEYIRQYQRLASECVYPYQ 179
A VEGLNKI TG+L++ S+ +LVDC+ + NGC ++ AF+++ L SE YPYQ
Sbjct: 164 AAVEGLNKIVTGELISLSEQELVDCNLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQ 223
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID--ATWFNFYHG 237
G Q C+ R I Y+ V E LQ V+ QPVSV +D + F Y
Sbjct: 224 GTQG-SCN--RKQVHLLVITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRS 280
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSG 296
++ GPCG +H + IVGYG+ E Q YW+V+N WGT W + G ++I R G
Sbjct: 281 CIYNGPCGTNLDHALVIVGYGS----ENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKG 336
Query: 297 LCNIAANAAYPL 308
LC IA A+YP+
Sbjct: 337 LCGIAMLASYPI 348
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 125/316 (39%), Positives = 169/316 (53%), Gaps = 35/316 (11%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLAS 62
+E+W T ++ EK RF +FK N L+LNKF D+T +F
Sbjct: 40 YERWRSHHTVT-RNLDEKHNRFNVFKANVMHVHNTNKLDKPYKLKLNKFGDMTNYEFRRI 98
Query: 63 YTGYKPPP----TDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
Y K H N + ++N + SIDW +GAVT VKDQG CWA
Sbjct: 99 YADSKISHHRMFRGMSHENGTFMYEN----AVDVPSSIDWRNKGAVTGVKDQGQCGSCWA 154
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+ +A VEG+N+I+T +LV+ S+ QLVDC T GC +E AFE+I+Q + +E
Sbjct: 155 FSTIAAVEGINQIKTQKLVSLSEQQLVDCDTEENEGCNGGLMEYAFEFIKQ-NGITTESN 213
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN-- 233
YPY + D CD + K +I G++ V E L ++QPVSVAIDA +N
Sbjct: 214 YPYAAK-DGTCDVEKED---KAVSIDGHENVPINNEAALLKAAAKQPVSVAIDAGGYNFQ 269
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GVFTG C NHGV IVGYG T + + YW++KN WG+ W E G +R+ RG+
Sbjct: 270 FYSEGVFTGHCDTDLNHGVAIVGYGVTQD---RTKYWIMKNSWGSEWGEQGYIRMQRGIS 326
Query: 294 G-SGLCNIAANAAYPL 308
GLC IA A+YP+
Sbjct: 327 SREGLCGIAMEASYPI 342
>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 188 bits (477), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 109/269 (40%), Positives = 150/269 (55%), Gaps = 28/269 (10%)
Query: 54 LTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFY--------DSIDWNERGAVT 105
+T +F ++Y G K N F+ + SF S+DW ++GAVT
Sbjct: 1 MTNHEFRSTYAGSK--------VNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVT 52
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFE 162
P+KDQG CWAF+ V VEG+N I+T +LV+ S+ +LVDC T GC + AFE
Sbjct: 53 PIKDQGQCGSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFE 112
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
+I++ + +E YPY +D CD S + +I G++ V P E+ L + QP
Sbjct: 113 FIKEKGGITTEQSYPYTA-EDGTCD--VSKVNSPVVSIDGHETVPPNNEDALLKAAANQP 169
Query: 223 VSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
+SVAIDA + F FY GVF G CG +HGV IVGYGTT + YW+VKN WGT+W
Sbjct: 170 ISVAIDAGGSAFQFYSEGVFAGRCGTDLDHGVAIVGYGTTLDG---TKYWIVKNSWGTDW 226
Query: 281 DEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
E G +R+ RG+ GLC IA A+YP+
Sbjct: 227 GENGYIRMKRGISAKEGLCGIAVEASYPI 255
>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 470
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 117/301 (38%), Positives = 166/301 (55%), Gaps = 33/301 (10%)
Query: 31 EKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLASYTGYKPPPTDHP 74
E+E RF+ F N F L +N+FADLT ++F A+Y G K
Sbjct: 69 EEERRFRAFWDNLRFVDAHNARAAAGEEGFRLGMNRFADLTNDEFRAAYLGVKG--AGQR 126
Query: 75 HSNRSNWFKNLNSSKMS-FYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRT 132
S R+ + + +++DW E+GAV PVK+QG CWAF+AV+ VE +N++ T
Sbjct: 127 RSARAGVGERYRHDGVEELPEAVDWREKGAVAPVKNQGQCGSCWAFSAVSAVESINQLVT 186
Query: 133 GQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWW 189
G+LVT S+ +LV+C NGC +++AF++I + +E YPY+ D CD
Sbjct: 187 GELVTLSEQELVECDINGQSNGCNGGLMDDAFDFIINNGGIDTEDDYPYKA-LDGKCDIN 245
Query: 190 RSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNT 247
R +A K +I G++ V E+ LQ V+ QPVSVAI+A F YH GVFTG CG
Sbjct: 246 RRNA--KVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFTGRCGTE 303
Query: 248 PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAY 306
+HGV VGYGT E + YW+V+N WG W E G +R+ R + +G C IA ++Y
Sbjct: 304 LDHGVVAVGYGT----ENGKDYWIVRNSWGPKWGEAGYLRMERNINATTGKCGIAMMSSY 359
Query: 307 P 307
P
Sbjct: 360 P 360
>gi|222625810|gb|EEE59942.1| hypothetical protein OsJ_12596 [Oryza sativa Japonica Group]
Length = 213
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 110/220 (50%), Positives = 134/220 (60%), Gaps = 14/220 (6%)
Query: 96 IDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---G 151
+DW GAVT VKDQGS CCWAF+AVA VEGL KIRTGQLV+ S+ +LVDC G
Sbjct: 1 MDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLAKIRTGQLVSLSEQELVDCDVRGEDQG 60
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ AF+YI + LA+E YPY+G R++A +IRG+Q V E
Sbjct: 61 CEGGLMDTAFQYIARRGGLAAESSYPYRGVDGAC----RAAAGRAAASIRGFQDVPSNDE 116
Query: 212 EGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQP 268
L V+RQPVSVAI+ F FY GV G CG NH VT VGYGT ++ G
Sbjct: 117 GALMAAVARQPVSVAINGAGYVFRFYDRGVLGGAGCGTELNHAVTAVGYGTASDGTG--- 173
Query: 269 YWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
YWL+KN WG +W EGG +RI RGVG G C IA A+YP+
Sbjct: 174 YWLMKNSWGASWGEGGYVRIRRGVGREGACGIAQMASYPV 213
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 117/301 (38%), Positives = 164/301 (54%), Gaps = 39/301 (12%)
Query: 33 EMRFKIFKKNHEF----------------LRLNKFADLTREKFLASYTGYKPPPTDHPHS 76
E R ++FK+N +F L +N+FADLT E++ + D
Sbjct: 69 EYRLEVFKENLQFVDKHNAAADRGEHTFRLGMNRFADLTNEEYRTRFL------RDFSRL 122
Query: 77 NRSNWFKNLNSSKM----SFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIR 131
RS K + ++ DSIDW E+GAV PVK+QG CWAF+ VA VEG+N+I
Sbjct: 123 RRSASGKISSRYRLREGDDLPDSIDWREKGAVVPVKNQGGCGSCWAFSTVAAVEGINQIV 182
Query: 132 TGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWR 190
TG L++ S+ QLVDC+T N GC ++ AF++I + SE YPY+G Q+ C+
Sbjct: 183 TGDLISLSEQQLVDCTTANHGCRGGWMNPAFQFIVNNGGINSEETYPYRG-QNGICN--- 238
Query: 191 SSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTP 248
S+ + +I Y+ V E+ LQ V+ QPVSV +DA F Y G+FTG C +
Sbjct: 239 STVNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISA 298
Query: 249 NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
NH +T+VGYGT E + Y VKN WG NW E G +R+ R +G +G C I A+YP
Sbjct: 299 NHALTVVGYGT----ENDKDYRTVKNSWGKNWGESGYIRVERNIGNPNGKCGITRFASYP 354
Query: 308 L 308
+
Sbjct: 355 V 355
>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 187 bits (476), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 117/305 (38%), Positives = 163/305 (53%), Gaps = 40/305 (13%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRLNKFADLTREKFLASYTGYKPPPTD 72
A +E W+V+ ++Y E+E RF+IFK N F+ +
Sbjct: 2 AVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIE------------------------E 37
Query: 73 HPHSNRSNWFKNLNSSKM--SFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNK 129
H NR+ + S + +S+DW E+GAV PVKDQG+ CWAF+ +A VEG+N+
Sbjct: 38 HNAVNRTYKVGDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAFSTIAAVEGINQ 97
Query: 130 IRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCD 187
I TG L++ S+ +LVDC S GC ++ AFE+I + SE YPY+ D CD
Sbjct: 98 IATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYRA-ADTTCD 156
Query: 188 WWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCG 245
R +A + +I GY+ V E L+ V+ QPVSVAI+A F Y GVFTG CG
Sbjct: 157 PNRKNA--RVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGVFTGQCG 214
Query: 246 NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG--SGLCNIAAN 303
+HGV VGYGT E YW+V+N WG NW E G +++ R + G +G C IA
Sbjct: 215 TQLDHGVVAVGYGT----ENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCGIAIE 270
Query: 304 AAYPL 308
+YP+
Sbjct: 271 PSYPI 275
>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
[Brachypodium distachyon]
Length = 334
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 124/324 (38%), Positives = 168/324 (51%), Gaps = 38/324 (11%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-----------------LRLNKFADLTR 56
++E+WM E RTYKD EK RF++FK N F L NKFADLT
Sbjct: 19 RYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTNKFADLTE 78
Query: 57 EKFLASY-TGYKPPPTDHPHSNRSNWFKNLNSSKMS-FYDSIDWNERGAVTPVKDQG-SY 113
++F Y TG++ P S ++ + +S SIDW RGAVT VKDQ
Sbjct: 79 DEFRNIYVTGHRV--NYRPTSLVTDTVFKFGAVSLSDVPPSIDWRARGAVTSVKDQHLCA 136
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLA 171
CCWAF++ A VEG+++I TG V+ S QLVDCS C ++ A+EYI + L
Sbjct: 137 CCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEKCKAGEIDKAYEYIARSGGLV 196
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
++ YPY+G C + A + I G+QYV E L V+ QPVSVA+D
Sbjct: 197 ADQDYPYEGHSG-TCRVYGKQAVAR---ISGFQYVPARNETALLLAVAHQPVSVALDGLS 252
Query: 232 FNFYH--GGVFTG---PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
H G+F PC NH +TIVGYGT E YWL+KN WG++W + G +
Sbjct: 253 RALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGTD---EHGTRYWLMKNSWGSDWGDKGYV 309
Query: 287 RIFRGVGG--SGLCNIAANAAYPL 308
+ R V +G+C +A A+YP+
Sbjct: 310 KFARDVASEINGVCGLALEASYPV 333
>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 371
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 115/321 (35%), Positives = 168/321 (52%), Gaps = 41/321 (12%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL-------------NKFADLTREKFLASY 63
+W RTY D E+ RF++++ N E++ N+FADLT E+FL+ Y
Sbjct: 61 RWQAAHNRTYGDAEERLRRFQVYRANIEYIEATNRRGGLTYELGENQFADLTSEEFLSMY 120
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMS----FYD---------SIDWNERGAVTPVKDQ 110
+ + +R++ L ++ ++ + D S DW +GAVTP K+Q
Sbjct: 121 A------SSYDAGDRADDEAALITTDVAGDGAWSDGDLEALPPPSWDWRAKGAVTPPKNQ 174
Query: 111 GSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNG-CAKNFLENAFEYIRQY 167
G C CWAF VAT+EGL I+TG+L++ S+ QLVDC +G C F ++ +
Sbjct: 175 GPTCSSCWAFVTVATIEGLTFIKTGKLISLSEQQLVDCDMYDGGCNTGSYSRGFRWVLEN 234
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI 227
L +E YPY + C+ R+ ++ I G + P E +Q V+ QPV VAI
Sbjct: 235 GGLTTEAEYPYTAARGP-CN--RAKSAHHAAKITGQGRIPPQNELVMQKAVAGQPVGVAI 291
Query: 228 DA-TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
+ + FY GV++GPCG H VT+VGYG + + YW+VKN WG W E G +
Sbjct: 292 EVGSGMQFYKTGVYSGPCGTNLAHAVTVVGYGVDPASGAK--YWIVKNSWGQAWGERGFI 349
Query: 287 RIFRGVGGSGLCNIAANAAYP 307
R+ R VGG GLC IA + AYP
Sbjct: 350 RMRRDVGGPGLCGIALDVAYP 370
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 112/310 (36%), Positives = 169/310 (54%), Gaps = 33/310 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLAS 62
E W + ++Y EK R IF + L LNKF+DLT +F A+
Sbjct: 3 EGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRAN 62
Query: 63 YTG-YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
Y G +KPP + +R K+++ S S+DW + GAVTP+KDQG CWAF+A
Sbjct: 63 YVGKFKPPR----YQDRRP-AKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+A++E + + T +LV+ S+ QL+DC T++ GC F E+AF+++ + + +E YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYT 177
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--YHG 237
G C+ + K I GY+ V + + L VS+ PV+V I + NF Y
Sbjct: 178 GFAGS-CN----ANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRS 232
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGL 297
G+ +G C N+ +H V ++GYGT EG PYW++KN WGT+W E G MRI + G G+
Sbjct: 233 GILSGHCSNSRDHAVLVIGYGT----EGGMPYWIIKNSWGTSWGEDGFMRI-KKKDGEGM 287
Query: 298 CNIAANAAYP 307
C + ++YP
Sbjct: 288 CGMNGQSSYP 297
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 187 bits (475), Expect = 5e-45, Method: Compositional matrix adjust.
Identities = 112/310 (36%), Positives = 169/310 (54%), Gaps = 33/310 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLAS 62
E W + ++Y EK R IF + L LNKF+DLT +F A+
Sbjct: 3 EGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRAN 62
Query: 63 YTG-YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
Y G +KPP + +R K+++ S S+DW + GAVTP+KDQG CWAF+A
Sbjct: 63 YVGKFKPPR----YQDRRP-AKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+A++E + + T +LV+ S+ QL+DC T++ GC F E+AF+++ + + +E YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYT 177
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--YHG 237
G C+ + K I GY+ V + + L VS+ PV+V I + NF Y
Sbjct: 178 GFAGS-CN----ANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRS 232
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGL 297
G+ +G C N+ +H V ++GYGT EG PYW++KN WGT+W E G MRI + G G+
Sbjct: 233 GILSGHCSNSRDHAVLVIGYGT----EGGMPYWIIKNSWGTSWGEDGFMRI-KKEDGEGM 287
Query: 298 CNIAANAAYP 307
C + ++YP
Sbjct: 288 CGMNGQSSYP 297
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 187 bits (475), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 112/311 (36%), Positives = 164/311 (52%), Gaps = 27/311 (8%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYT 64
Q+ + + Y + E+ R+ IFK N + L++NKF DLT E+F Y
Sbjct: 91 QFQRDHNKFYATEEERLKRYAIFKNNLTYIHNHNMQGYSYVLKMNKFGDLTLEEFRQRYL 150
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVAT 123
GYK P P +++ + + + +DW +RG VT VKDQG CWAF+A
Sbjct: 151 GYKKPDLRTPPREVDTTLESVEDNDIPTH--VDWRQRGCVTSVKDQGDCGSCWAFSATGA 208
Query: 124 VEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
+EG+ +TG+LV S+ QLVDCS GC +E AFEY+ + + S YPY
Sbjct: 209 MEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENYPYM- 267
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS-RQPVSVAIDATW--FNFYHG 237
R+D C +SS I GY+ V +E+ ++ ++ R PVSVAI A F FY+
Sbjct: 268 RKDGVC---KSSQCTSVATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFYYD 324
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGL 297
G+F PCG +HGV +VGY + E GQ YW++KN WG W +GG M + G +G
Sbjct: 325 GIFDAPCGTNLDHGVLLVGY--SAETAGQGDYWIMKNSWGAAWGKGGYMLMAMHKGPAGQ 382
Query: 298 CNIAANAAYPL 308
C + + ++P+
Sbjct: 383 CGVLLDGSFPV 393
>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
Length = 381
Score = 187 bits (474), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 119/317 (37%), Positives = 168/317 (52%), Gaps = 39/317 (12%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFL 60
+W V+ K E R ++FK+N +F L +N+FADLT E++
Sbjct: 55 EWRVKNHPAEKYLDLNEYRLEVFKENLQFVDEHNAAADRGEHTFLLGMNRFADLTNEEYR 114
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKM----SFYDSIDWNERGAVTPVKDQGSY-CC 115
+ D RS K + ++ DSIDW E GAV PVK+QG C
Sbjct: 115 TRFL------RDFSRLRRSASGKISSRYRLREGDDLPDSIDWRENGAVVPVKNQGGCGSC 168
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASEC 174
WAF+ VA VEG+N+I TG L++ S+ QLVDC+T N GC ++ AF++I + SE
Sbjct: 169 WAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTANHGCRGGWMNPAFQFIVNNGGINSEE 228
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--F 232
YPY+G Q+ C+ S+ + +I Y+ V E+ LQ V+ QPVSV +DA F
Sbjct: 229 TYPYRG-QNGICN---STVNAPVVSIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDF 284
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y G+FTG C + NH +T+VGYGT E + +W+VKN WG NW E G +R R +
Sbjct: 285 QLYRSGIFTGSCNISANHALTVVGYGT----ENDKDFWIVKNSWGKNWGESGYIRAERNI 340
Query: 293 GG-SGLCNIAANAAYPL 308
+G C I A+YP+
Sbjct: 341 ENPNGKCGITRFASYPV 357
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 187 bits (474), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 115/318 (36%), Positives = 169/318 (53%), Gaps = 30/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+I+ + W + +TY + E++ R +IFK NH+F L LN FADLT
Sbjct: 27 DISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTH 86
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+F AS G P ++ ++L S + DS+DW ++GAVT VKDQGS C
Sbjct: 87 HEFKASRLGLSVSA---PSVIMASKGQSLGGS-VKVPDSVDWRKKGAVTNVKDQGSCGAC 142
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASE 173
W+F+A +EG+N+I TG L++ S+ +L+DC S GC ++ AFE++ + + +E
Sbjct: 143 WSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTE 202
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--W 231
YPYQ R D C + K I Y V+ E+ L + V+ QPVSV I +
Sbjct: 203 KDYPYQER-DGTCK--KDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERA 259
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+F+GPC + +H V IVGYG+ + YW+VKN WG +W G M + R
Sbjct: 260 FQLYSSGIFSGPCSTSLDHAVLIVGYGSQNGVD----YWIVKNSWGKSWGMDGFMHMQRN 315
Query: 292 VGGS-GLCNIAANAAYPL 308
S G+C I A+YP+
Sbjct: 316 TENSDGVCGINMLASYPI 333
>gi|404312774|pdb|3TNX|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|404312775|pdb|3TNX|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 2.6 Angstroem Resolution
gi|428698029|pdb|3USV|A Chain A, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
gi|428698030|pdb|3USV|C Chain C, Structure Of The Precursor Of A Thermostable Variant Of
Papain At 3.8 A Resolution From A Crystal Soaked At Ph 4
Length = 363
Score = 186 bits (473), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 118/311 (37%), Positives = 170/311 (54%), Gaps = 34/311 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E WM++ + YK+ EK RF+IFK N ++ L LN FAD++ ++F Y
Sbjct: 67 ESWMLKHNKIYKNIDEKIYRFEIFKDNLKYIDETNKKNNSYWLGLNVFADMSNDEFKEKY 126
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
TG + + ++ + LN ++ + +DW ++GAVTPVK+QGS WAF+AV+
Sbjct: 127 TG---SIAGNYTTTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSAWAFSAVS 183
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGR 181
T+E + KIRTG L S+ +L+DC + GC + +A + + QY + YPY+G
Sbjct: 184 TIESIIKIRTGNLNEYSEQELLDCDRRSYGCNGGYPWSALQLVAQYG-IHYRNTYPYEGV 242
Query: 182 QDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
Q Y C RS G Y A G + VQP E L ++ QPVSV ++A F Y GG
Sbjct: 243 QRY-C---RSREKGPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGG 298
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GL 297
+F GPCGN +H V VGYG Y L++N WGT W E G +RI RG G S G+
Sbjct: 299 IFVGPCGNKVDHAVAAVGYGPN--------YILIRNSWGTGWGENGYIRIKRGTGNSYGV 350
Query: 298 CNIAANAAYPL 308
C + ++ YP+
Sbjct: 351 CGLYTSSFYPV 361
>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 186 bits (473), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 127/300 (42%), Positives = 166/300 (55%), Gaps = 26/300 (8%)
Query: 27 KDQAEKEMRFKIFKKN--HEF----------LRLNKFADLTREKFLASYTGYKPPPTDHP 74
++ EK RF +FK+N H F L+LNKFAD++ +F+ Y
Sbjct: 52 RNLKEKHKRFSVFKENVNHVFTVNQMDKPYKLKLNKFADMSNYEFVNFYARSNISHYRKL 111
Query: 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
H R + S+D ERGAV VK+QG CWAF++VA VEG+NKI+T
Sbjct: 112 HERRRGAGGFMYEQDTDLPSSVDGRERGAVNAVKEQGRCGSCWAFSSVAAVEGINKIKTN 171
Query: 134 QLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSS 192
QL++ S+ +L+DC+ N GC F+E AF++I++ +A+E YPY G + C RSS
Sbjct: 172 QLLSLSEQELLDCNYRNKGCNGGFMEIAFDFIKRNGGIATENSYPYHGSRG-LC---RSS 227
Query: 193 -ASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPN 249
S I GY+ V P E+ L V+ QPVSVAIDA F FY GVF G CG N
Sbjct: 228 RISSPIVKIDGYESV-PENEDALMQAVANQPVSVAIDAAGRDFQFYSQGVFDGYCGTELN 286
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYPL 308
HGV +GYGTT E YWLV+N WG W E G +R+ RGV GLC IA A+YP+
Sbjct: 287 HGVVAIGYGTT---EDGTDYWLVRNSWGVGWGEDGYVRMKRGVEQAEGLCGIAMEASYPI 343
>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 126/317 (39%), Positives = 174/317 (54%), Gaps = 30/317 (9%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFL 60
A +E+W E +D EK RF +F++N HEF LRLN+F D+T ++F
Sbjct: 45 ALYERWR-EQHTVARDLGEKARRFNVFRENVRLIHEFNRGDAPYKLRLNRFGDMTADEFR 103
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD---SIDWNERGAVTPVKDQGSY-CCW 116
+Y + S + ++ S S D S+DW ++GAVT VKDQG CW
Sbjct: 104 RAYASSRVS-HHRMFSLKEGGGGFMHGSAASVRDVPPSVDWRQKGAVTAVKDQGQCGSCW 162
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASEC 174
AF+ +A VEG+N IR+ L + S+ QLVDC T + GC ++ AF+YI ++ +A+E
Sbjct: 163 AFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNAGCNGGLMDYAFQYIAKHGGVAAED 222
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWF 232
YPY+ RQ C+ S+ I GY+ V E L+ V+ QPV+VAI+A + F
Sbjct: 223 AYPYKARQASSCNKKPSAVV----TIDGYEDVPANDETALKKAVAAQPVAVAIEASGSHF 278
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GVF G CG +HGV VGYGTT + YW+VKN WG W E G +R+ R V
Sbjct: 279 QFYSEGVFAGKCGTELDHGVAAVGYGTTVDG---TKYWIVKNSWGPEWGEKGYIRMKRDV 335
Query: 293 -GGSGLCNIAANAAYPL 308
GLC IA A+YP+
Sbjct: 336 KDKEGLCGIAMEASYPV 352
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 186 bits (473), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 169/310 (54%), Gaps = 33/310 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLAS 62
E W + ++Y +EK R IF + L LNKF+DLT +F A+
Sbjct: 3 EDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAN 62
Query: 63 YTG-YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
Y G +K P + +R K+++ S S+DW + GAVTP+KDQG CWAF+A
Sbjct: 63 YVGKFKSPR----YQDRRP-AKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+A++E + + T +LV+ S+ QL+DC T++ GC F E+AF+++ + + +E YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYT 177
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--YHG 237
G C+ + K I GY+ V + + L VS+ PV+V I + NF Y
Sbjct: 178 GFAGS-CN----ANKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRS 232
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGL 297
G+ +G C N+ +H V ++GYGT EG PYW++KN WGT+W E G M+I + G G+
Sbjct: 233 GILSGQCSNSRDHAVLVIGYGT----EGGMPYWIIKNSWGTSWGENGFMKI-KKKDGEGM 287
Query: 298 CNIAANAAYP 307
C + ++YP
Sbjct: 288 CGMNGQSSYP 297
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 109/316 (34%), Positives = 168/316 (53%), Gaps = 43/316 (13%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLAS 62
E W + ++Y EK R IF + L LNKF+DLT +F A
Sbjct: 38 EDWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAM 97
Query: 63 YTG-YKPP------PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+ G +K P P + + S S S+DW ++GAVTP+KDQG
Sbjct: 98 HVGKFKRPRYQDRLPAEDEDVDVS-----------SLPTSLDWRQKGAVTPIKDQGDCGS 146
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+A+A++E + + T +LV+ S+ QL+DC T++ GC +E AF+++ + + +E
Sbjct: 147 CWAFSAIASIESAHFLATKELVSLSEQQLMDCDTVDAGCDGGLMETAFKFVVKNGGVTTE 206
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN 233
YPY G C+ + A K I G++ V + + L VS+ PV+V+I + N
Sbjct: 207 AAYPYTGSVGS-CN--ANKAKNKVAEITGFKVVTEDSADALMKAVSKTPVTVSICGSDEN 263
Query: 234 F--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+ +G C ++ +HGV ++GYGT EG PYW++KN WGT+W E G M+I R
Sbjct: 264 FQNYKSGILSGKCDDSLDHGVLLIGYGT----EGGMPYWIIKNSWGTSWGEDGFMKIERK 319
Query: 292 VGGSGLCNIAANAAYP 307
G G+C + +++YP
Sbjct: 320 -DGDGMCGMNGDSSYP 334
>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
Length = 464
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 121/304 (39%), Positives = 168/304 (55%), Gaps = 37/304 (12%)
Query: 30 AEKEMRFKIFKKNHEF---------------LRLNKFADLTREKFLASYTGYKPPPTDHP 74
E E RF++F N +F L +N+FADLT ++F A+Y G P
Sbjct: 85 GEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGMNRFADLTNDEFRAAYLGTTP------ 138
Query: 75 HSNRSNWFKNL--NSSKMSFYDSIDWNERGAV-TPVKDQGSY-CCWAFTAVATVEGLNKI 130
+ R + + + DS+DW ++GAV +PVK+QG CWAF+AVA VEG+NKI
Sbjct: 139 -AGRGRHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKI 197
Query: 131 RTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCD 187
TG+LV+ S+ +LV+C+ +GC +++AF +I + L +E YPY D CD
Sbjct: 198 VTGELVSLSEQELVECARNRGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTA-MDGKCD 256
Query: 188 WWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCG 245
+ S K +I G++ V E LQ V+ QPVSVAIDA F Y GVFTG CG
Sbjct: 257 LAKKSR--KVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCG 314
Query: 246 NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANA 304
+ +HGV VGYG T+A YW V+N WG +W E G +R+ R V +G C IA A
Sbjct: 315 TSLDHGVVAVGYG--TDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMA 372
Query: 305 AYPL 308
+YP+
Sbjct: 373 SYPI 376
>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
Length = 380
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 117/351 (33%), Positives = 177/351 (50%), Gaps = 52/351 (14%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL------------ 48
M ++ + + ++W + ++Y AE RF ++ +N ++
Sbjct: 38 MGSSTDDNSPMIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIEATNAEAEAAGLTY 97
Query: 49 ----NKFADLTREKFLASYTGYKPPPTDHPHSNRSNW---------------------FK 83
+ DLT ++F+A YT P P P + +
Sbjct: 98 ELGETAYTDLTNQEFMAMYTA-APSPAQLPADEDEDDAAEAVITTRAGPVDAVGQLPVYV 156
Query: 84 NLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQ 142
NL+++ + S+DW GAVTPVK+QG CWAF+ VA VEG+ +IRTG+LV+ S+ +
Sbjct: 157 NLSTAAPA---SVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQE 213
Query: 143 LVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIR 201
LVDC TL+ GC A +I L +E YPY G D C+ R+ + +I
Sbjct: 214 LVDCDTLDAGCDGGISYRALRWITSNGGLTTEEDYPYTGTTD-ACN--RAKLAHNAASIA 270
Query: 202 GYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGT 259
G + V +E L + V+ QPV+V+I+A NF H GV+ GPCG + NHGVT+VGYG
Sbjct: 271 GLRRVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNHGVTVVGYG- 329
Query: 260 TTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG--SGLCNIAANAAYPL 308
E E YW++KN WG +W +GG +++ + V G GLC IA ++PL
Sbjct: 330 -QEEEDGDKYWIIKNSWGASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFPL 379
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 186 bits (472), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 116/310 (37%), Positives = 166/310 (53%), Gaps = 30/310 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E WM + Y++ EK +RF+IFK N + +L L++FADL+ +F Y
Sbjct: 49 ESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLSEFADLSHREFNNKY 108
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G K + S +K++ K S+DW ++GAV PVK+QGS CWAF+ VA
Sbjct: 109 LGLKVDYSRRRESPEEFTYKDVELPK-----SVDWRKKGAVAPVKNQGSCGSCWAFSTVA 163
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I TG L + S+ +L+DC NGC ++ AF +I + L E YPY
Sbjct: 164 AVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI- 222
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
++ C+ + + I GY V E+ L ++ QP+SVAI+A+ F FY GG
Sbjct: 223 MEEGACEMTKEET--QVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGG 280
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF G CG+ +HGV VGYGT + Y VKN WG+ W E G +R+ R +G G+
Sbjct: 281 VFDGHCGSDLDHGVAAVGYGTAKGVD----YITVKNSWGSKWGEKGYIRMRRNIGKPEGI 336
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 337 CGIYKMASYP 346
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 186 bits (471), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 115/318 (36%), Positives = 169/318 (53%), Gaps = 30/318 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+I+ + W + +TY + E++ R +IFK NH+F L LN FADLT
Sbjct: 27 DISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTH 86
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+F AS G P ++ ++L S + DS+DW ++GAVT VKDQGS C
Sbjct: 87 HEFKASRLGLSVSA---PSVIMASKGQSLGGS-VKVPDSVDWRKKGAVTNVKDQGSCGAC 142
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASE 173
W+F+A +EG+N+I TG L++ S+ +L+DC S GC ++ AFE++ + + +E
Sbjct: 143 WSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTE 202
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--W 231
YPYQ R D C + K I Y V+ E+ L + V+ QPVSV I +
Sbjct: 203 KDYPYQER-DGTCK--KDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERA 259
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+F+GPC + +H V IVGYG+ + YW+VKN WG +W G M + R
Sbjct: 260 FQLYSRGIFSGPCSTSLDHAVLIVGYGSQNGVD----YWIVKNSWGKSWGMDGFMHMQRN 315
Query: 292 VGGS-GLCNIAANAAYPL 308
S G+C I A+YP+
Sbjct: 316 TENSDGVCGINMLASYPI 333
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 186 bits (471), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 118/313 (37%), Positives = 166/313 (53%), Gaps = 28/313 (8%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKFLAS 62
+E+W+V+ + Y EK+ RF+IFK N F+ LNKFAD+ E++
Sbjct: 4 YEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQNYSYKVGLNKFADINNEEYRDM 63
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
Y G K + + + + + +DW +GAVT +KDQGS CWAF+ +
Sbjct: 64 YLGTKSDAKRRVMKTKITGHR-ITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCWAFSTI 122
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
ATVE +NKI TG+ V+ S+ +LVDC GC ++ AFE+I + + ++ YPY
Sbjct: 123 ATVEAINKIVTGKFVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIRNGGIDTDQDYPYN 182
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHG 237
G + CD + +A K +I GY+ V P+ L+ V+ QPVSVAI Y
Sbjct: 183 GFER-KCDPTKKNA--KVVSIDGYEDV-PSYMNALKKAVAHQPVSVAIAGLGRALQLYQS 238
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF-RGVGGS- 295
GVFTG CG +HGV +VGYG+ E YWLV+N WGTNW E G +I R V
Sbjct: 239 GVFTGKCGTDLDHGVVVVGYGS----ENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLY 294
Query: 296 GLCNIAANAAYPL 308
C IA A+YP+
Sbjct: 295 RKCGIAMEASYPV 307
>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 306
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 120/320 (37%), Positives = 172/320 (53%), Gaps = 38/320 (11%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL------------NKFADLTREK 58
+ + E+W+ + R YKD+ E E+RF I++ N E++ NKFADLT E+
Sbjct: 1 MRVRFERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQEXSYNLTDNKFADLTNEE 60
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
F++ Y G+ PH+ + +S DW + GAV+ +KDQG+ CWA
Sbjct: 61 FVSPYLGFGTRFL--PHTGF------MYHEHEDLPESKDWRKEGAVSDIKDQGNCGSCWA 112
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASEC 174
F+AVA VEG+NKI++G+LV+ S+ + DC + GC ++ AF +I++ L +
Sbjct: 113 FSAVAAVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSK 172
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL---QDVVSRQPVSVAIDATW 231
YPY+G D C+ + A I G+ V PA +E + + + Q SVAIDA
Sbjct: 173 DYPYEG-VDGTCN--KEKALHHAANISGHVKV-PANDEAMLKAKAAAANQXESVAIDAGG 228
Query: 232 --FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F Y GVF+G CG NHGVTIVGYG T YW+VKN WG +W E G +R+
Sbjct: 229 HAFQLYLKGVFSGICGKQLNHGVTIVGYGKGT----SDKYWIVKNSWGADWGESGYIRMK 284
Query: 290 R-GVGGSGLCNIAANAAYPL 308
R +G C IA A+YPL
Sbjct: 285 RDAFDKAGTCGIAMQASYPL 304
>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
Length = 499
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 120/301 (39%), Positives = 166/301 (55%), Gaps = 33/301 (10%)
Query: 31 EKEMRFKIFKKNHEF---------------LRLNKFADLTREKFLASYTGYKPPPTDHPH 75
E E RF++F N +F L +N+FADLT ++F A+Y G P H
Sbjct: 85 EYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGR-H 143
Query: 76 SNRSNWFKNLNSSKMSFYDSIDWNERGAVT-PVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
+ + + DS+DW ++GAV PVK+QG CWAF+AVA VEG+NKI TG
Sbjct: 144 VGEAYRHDGVEA----LPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTG 199
Query: 134 QLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWR 190
+LV+ S+ +LV+C+ +GC +++AF +I + L +E YPY D C+ +
Sbjct: 200 ELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLDTEEDYPYTA-MDGKCNLAK 258
Query: 191 SSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTP 248
S K +I G++ V E LQ V+ QPVSVAIDA F Y GVFTG CG +
Sbjct: 259 KSR--KVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSL 316
Query: 249 NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
+HGV VGYG T+A YW V+N WG +W E G +R+ R V +G C IA A+YP
Sbjct: 317 DHGVVAVGYG--TDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYP 374
Query: 308 L 308
+
Sbjct: 375 I 375
>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
Length = 475
Score = 185 bits (470), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 121/319 (37%), Positives = 166/319 (52%), Gaps = 38/319 (11%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREK 58
+++W V+ DQ + R ++FK+N F L +N+FADLT E+
Sbjct: 52 YQEWRVKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEE 111
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMS----FYDSIDWNERGAVTPVKDQGSY- 113
+ A + D RS + N ++ DSIDW E+GAV VK+QG
Sbjct: 112 YRARFL------RDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKNQGRCG 165
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLAS 172
CWAF A+A VEG+N+I TG L++ S+ QLVDCST N GC + AF+YI + S
Sbjct: 166 SCWAFAAIAAVEGINQIVTGDLISLSEQQLVDCSTRNYGCEGGWPYRAFQYIINNGGVNS 225
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY G + + +I Y+ V E+ LQ + QP+SV IDA+
Sbjct: 226 EEHYPYTGTNGTC---NTTKENAHVVSIDSYRNVPSNDEKSLQKAAANQPISVGIDASGR 282
Query: 233 NF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
NF YH G+FTG C + NHGVT+VGYGT E YW+VKN WG NW G + + R
Sbjct: 283 NFQLYHSGIFTGSCNTSLNHGVTVVGYGT----ENGNDYWIVKNSWGENWGNSGYILMER 338
Query: 291 GVG-GSGLCNIAANAAYPL 308
+ SG C IA + +YP+
Sbjct: 339 NIAESSGKCGIAISPSYPI 357
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 119/313 (38%), Positives = 169/313 (53%), Gaps = 32/313 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E W+ F + Y+ EK +RF++FK N + +L LN+FADL+ E+F Y
Sbjct: 52 ENWISNFEKAYETVEEKLLRFEVFKDNLKHIDETNKKVKSYWLGLNEFADLSHEEFKKMY 111
Query: 64 TGYKPPPT--DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
G K D S ++++ + S+DW ++GAV VK+QGS CWAF+
Sbjct: 112 LGLKTDIVRRDEERSYAEFAYRDVEAVP----KSVDWRKKGAVAEVKNQGSCGSCWAFST 167
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPY 178
VA VEG+NKI TG L T S+ +L+DC T NGC ++ AFEYI + L E YPY
Sbjct: 168 VAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPY 227
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYH 236
++ C+ + + + I G+Q V E+ L ++ QP+SVAIDA+ F FY
Sbjct: 228 S-MEEGTCEMQKDES--ETVTIDGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYS 284
Query: 237 G-GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG- 294
G VF G CG +HGV VGYG++ ++ Y +VKN WG W E G +R+ R G
Sbjct: 285 GVSVFDGRCGVDLDHGVAAVGYGSSKGSD----YIIVKNSWGPKWGEKGYIRLKRNTGKP 340
Query: 295 SGLCNIAANAAYP 307
GLC I A++P
Sbjct: 341 EGLCGINKMASFP 353
>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
Length = 262
Score = 185 bits (469), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 102/221 (46%), Positives = 138/221 (62%), Gaps = 11/221 (4%)
Query: 95 SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NG 151
S+DW ++GAVT VKDQG CWAF+ V +VEG+N IRTG LV+ S+ +L+DC T +G
Sbjct: 7 SVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDG 66
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPAT 210
C ++NAFEYI+ L +E YPY+ + C+ R++ + I G+Q V +
Sbjct: 67 CQGGLMDNAFEYIKNNGGLITEAAYPYRAARG-TCNVARAAQNSPVVVHIDGHQDVPANS 125
Query: 211 EEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQP 268
EE L V+ QPVSVA++A+ F FY GVFTG CG +HGV +VGYG AE +
Sbjct: 126 EEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGV---AEDGKA 182
Query: 269 YWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYPL 308
YW VKN WG +W E G +R+ + G S GLC IA A+YP+
Sbjct: 183 YWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPV 223
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 185 bits (469), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 163/310 (52%), Gaps = 30/310 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E WM + Y+ EK RF IFK N + +L LN+FADL+ ++F Y
Sbjct: 48 ESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKY 107
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G K + S +K+ K S+DW ++GAVT VK+QGS CWAF+ VA
Sbjct: 108 LGLKVDYSRRRESPEEFTYKDFELPK-----SVDWRKKGAVTQVKNQGSCGSCWAFSTVA 162
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I TG L + S+ +L+DC NGC ++ AF +I + L E YPY
Sbjct: 163 AVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEEDYPYI- 221
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
++ C+ + + I GY V E+ L + QP+SVAI+A+ F FY GG
Sbjct: 222 MEEGTCEMTKEET--EVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFYSGG 279
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF G CG+ +HGV VGYGT+ Y +VKN WG+ W E G +R+ R +G G+
Sbjct: 280 VFDGHCGSDLDHGVAAVGYGTSKGVN----YIIVKNSWGSKWGEKGYIRMRRNIGKPEGI 335
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 336 CGIYKMASYP 345
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 184 bits (468), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 116/316 (36%), Positives = 168/316 (53%), Gaps = 33/316 (10%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREK 58
+E W E + ++ +R ++F+ N H F L L FADLT E+
Sbjct: 52 YEAWKSEHGHGHG--SDDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPFADLTLEE 109
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CW 116
+ G++ + ++ D+IDW E GAVT VK+Q C CW
Sbjct: 110 YRGRALGFRARRGGASRVGSGSSYRP-RPRGGDLPDAIDWRELGAVTGVKNQ-EQCGGCW 167
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECV 175
AF+AVA +EG+N+I TG LV+ S+ +++DC T + GC ++NAF+++ + +E
Sbjct: 168 AFSAVAAIEGINEIVTGNLVSLSEQEIIDCDTQDGGCNGGEMQNAFQFVINNGGIDTEAD 227
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFY 235
YPY G D CD R + + I G+ V E LQ+ V+ QPVSVAIDA+ F
Sbjct: 228 YPYLG-TDAACDANRVNE--RVVTIDGFVSVATENETALQEAVANQPVSVAIDASGRKFQ 284
Query: 236 H--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV- 292
H G+F GPCG +HGVT VGYG+ E + YW+VKN W ++W E G +RI R V
Sbjct: 285 HYTSGIFNGPCGTQLDHGVTAVGYGS----ENGKDYWIVKNSWSSSWGEAGYIRIRRNVA 340
Query: 293 GGSGLCNIAANAAYPL 308
+G C IA +A+YP+
Sbjct: 341 AATGKCGIAMDASYPV 356
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 116/310 (37%), Positives = 165/310 (53%), Gaps = 30/310 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E W+ + Y+ EK RF+IFK N + +L LN+FADL+ ++F Y
Sbjct: 49 ESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKY 108
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G K + S +K++ K S+DW ++GAVT VK+QGS CWAF+ VA
Sbjct: 109 LGLKVDYSRRRESPEEFTYKDVELPK-----SVDWRKKGAVTQVKNQGSCGSCWAFSTVA 163
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I TG L + S+ +L+DC NGC ++ AF +I + L E YPY
Sbjct: 164 AVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENDGLHKEEDYPYI- 222
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
++ C+ + + I GY V E+ L ++ QP+SVAI+A+ F FY GG
Sbjct: 223 MEEGTCEMAKEET--EVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYSGG 280
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF G CG+ +HGV VGYGT + Y VKN WG+ W E G +R+ R +G G+
Sbjct: 281 VFDGHCGSDLDHGVAAVGYGTAKGVD----YITVKNSWGSKWGEKGYIRMRRNIGKPEGI 336
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 337 CGIYKMASYP 346
>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 184 bits (467), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 113/335 (33%), Positives = 168/335 (50%), Gaps = 42/335 (12%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRLNK---------------FADLT 55
+A + +W E +RTY E+ R +++ +N ++ + DLT
Sbjct: 38 MAQRFRRWKAEHSRTYATPEEERHRLRVYARNMRYIEATNGDAGAGLTYELGETAYTDLT 97
Query: 56 REKFLASYTGYKPPPTDHPHS----------------NRSNWFKNLNSSKMSFYDSIDWN 99
++F A YT PP +D W + + S+DW
Sbjct: 98 SDEFTAMYTSRAPPLSDDDDDLPMTMITTRAGPVAAAGGGGWLQVYVNESAGAPASVDWR 157
Query: 100 ERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFL 157
ERGAVT VK+QG CWAF+ VA +EG+++I+TG+L + S+ +LVDC L+ GC
Sbjct: 158 ERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLASLSEQELVDCDKLDHGCNGGVS 217
Query: 158 ENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDV 217
A ++I + S+ YPY + D CD S +I G+Q V +E L +
Sbjct: 218 YRALQWITSNGGITSQDDYPYTAKDD-TCD--TKKLSHHAASISGFQRVATRSELSLTNA 274
Query: 218 VSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNR 275
V+ QPV+V+I+A NF H GV+ GPCG NHGVT+VGYG E G+ YW+VKN
Sbjct: 275 VAMQPVAVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTVVGYG-EDEVTGES-YWIVKNS 332
Query: 276 WGTNWDEGGSMRIFRGV--GGSGLCNIAANAAYPL 308
WG W + G +R+ +G+ G+C IA ++PL
Sbjct: 333 WGEKWGDNGYLRMKKGIIDKPEGICGIAIRPSFPL 367
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 121/311 (38%), Positives = 164/311 (52%), Gaps = 32/311 (10%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLR-----------LNKFADLTREKFLASYTGY 66
W + + Y D + RF ++K N ++R L KFADLT E+F YTG
Sbjct: 57 WAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHSETNRTYSLGLTKFADLTNEEFRRMYTGT 116
Query: 67 KPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVE 125
+ + + R F+ +S +S+DW + GAVT VKDQGS CWAF+AV +VE
Sbjct: 117 RIDRS--RRAKRRTGFRYADSEAP---ESVDWRKNGAVTSVKDQGSCGSCWAFSAVGSVE 171
Query: 126 GLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
G+N IR G+ V+ S+ +LVDC GC ++ AF++I Q + +E YPY+G D
Sbjct: 172 GINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQNGGIDTEKDYPYKGF-D 230
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFT 241
CD S + I GY+ V EE L+ V+ QPVSVAI+A F Y GVF+
Sbjct: 231 GRCD--NSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDFQLYAQGVFS 288
Query: 242 GPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV----GGSGL 297
G CG +HGV VGYGT E YW+VKN WG W E G +R+ R + G GL
Sbjct: 289 GECGTDLDHGVLAVGYGT----EDGVDYWIVKNSWGEYWGESGYLRMKRNMKDSNDGPGL 344
Query: 298 CNIAANAAYPL 308
C I +Y +
Sbjct: 345 CGINIEPSYAV 355
>gi|219884655|gb|ACL52702.1| unknown [Zea mays]
gi|413916718|gb|AFW56650.1| thiol protease SEN102 [Zea mays]
Length = 349
Score = 184 bits (467), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 121/323 (37%), Positives = 173/323 (53%), Gaps = 38/323 (11%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL------------NKFADLTREKFLA 61
+ + W E+ RTY E + RF ++ +N +F+ N+FADLT E+F
Sbjct: 36 RFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGSSYELGENQFADLTEEEFKD 95
Query: 62 SYTGYKPPPTDHPHS--------NRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
+Y P + NR+ N+++ +S+DW +GAVTPVK Q +
Sbjct: 96 TYLMKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAP--NSVDWRTKGAVTPVKSQ-QH 152
Query: 114 C--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFL---ENAFEYIRQYQ 168
C CWAF AVA++EG++KI+TG+LV+ S+ ++VDC +A E++ +
Sbjct: 153 CGSCWAFAAVASIEGVHKIKTGRLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWVTRNG 212
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAI 227
L +E YPY GRQ C S G + A IRG Q VQ E LQ V+ +PV+V+I
Sbjct: 213 GLTTESDYPYVGRQG-QC---MSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSI 268
Query: 228 DAT-WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
+A+ F FY G+F+GPC T NH VT+VGYG A G + YW+VKN WG W E G +
Sbjct: 269 NASRAFQFYKRGIFSGPCNTTRNHAVTVVGYGAN--ASGHK-YWIVKNSWGERWGEKGYV 325
Query: 287 RIFRGV-GGSGLCNIAANAAYPL 308
R+ RGV G+C IA Y +
Sbjct: 326 RMQRGVRAREGVCGIAIAPFYAV 348
>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
Length = 499
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 120/301 (39%), Positives = 165/301 (54%), Gaps = 33/301 (10%)
Query: 31 EKEMRFKIFKKNHEF---------------LRLNKFADLTREKFLASYTGYKPPPTDHPH 75
E E RF++F N +F L +N+FADLT ++F A+Y G P H
Sbjct: 85 EYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEFRAAYLGTTPAGRGR-H 143
Query: 76 SNRSNWFKNLNSSKMSFYDSIDWNERGAVT-PVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
+ + DS+DW ++GAV PVK+QG CWAF+AVA VEG+NKI TG
Sbjct: 144 VGEAYRHDGVEV----LPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTG 199
Query: 134 QLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWR 190
+LV+ S+ +LV+C+ +GC +++AF +I + L +E YPY D C+ +
Sbjct: 200 ELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLDTEEDYPYTA-MDGKCNLAK 258
Query: 191 SSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTP 248
S K +I G++ V E LQ V+ QPVSVAIDA F Y GVFTG CG +
Sbjct: 259 KSR--KVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTSL 316
Query: 249 NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
+HGV VGYG T+A YW V+N WG +W E G +R+ R V +G C IA A+YP
Sbjct: 317 DHGVVAVGYG--TDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYP 374
Query: 308 L 308
+
Sbjct: 375 I 375
>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 358
Score = 184 bits (466), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 122/319 (38%), Positives = 169/319 (52%), Gaps = 40/319 (12%)
Query: 22 FARTYKDQAEKEMRFKIFKKNHEFLRL-------------NKFADLTREKFLASYTGYKP 68
+ RTY E+ RF+++++N +++ N+FADLT ++F A YT P
Sbjct: 47 YNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADLTVQEFRAMYT--MP 104
Query: 69 PPTD-HPHSNRSNWFKNLNSSKM-----SFYD---------SIDWNERGAVTPVKDQGSY 113
D P + R + + S+Y S+DW +GAVTPVKDQG
Sbjct: 105 ARVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPTSVDWRSKGAVTPVKDQGGC 164
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFL-ENAFEYIRQYQRLA 171
CCWAF VAT+EGL+KI+TGQLV+ S+ +LVDC + L E A E++ L
Sbjct: 165 GCCWAFATVATIEGLHKIKTGQLVSLSEQELVDCDDADDGCGGGLPEIAMEWVAHNGGLT 224
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-T 230
+E YPY G+ CD R AS I Q V+ +E L+ V+RQPV+VAI+A
Sbjct: 225 TEANYPYTGKAG-KCD--RGKASNHAAKIAAAQMVRANSEAELERAVARQPVAVAINAPD 281
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GV++GPC +H VT+VGYG + YW++KN W W E G R+ R
Sbjct: 282 SLMFYKSGVYSGPCTAEFDHAVTVVGYGADNKG---HKYWIIKNSWAETWGEKGYGRMQR 338
Query: 291 GVGG-SGLCNIAANAAYPL 308
GV GLC IA +A+YP+
Sbjct: 339 GVAAKEGLCGIATHASYPV 357
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 109/310 (35%), Positives = 169/310 (54%), Gaps = 33/310 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLAS 62
E W + ++Y EK R +F + L LNKF+DLT +F A+
Sbjct: 3 EDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAN 62
Query: 63 YTG-YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
Y G +KPP + +R K+++ S S+DW + GAVTP+KDQG CWAF+A
Sbjct: 63 YVGKFKPPR----YQDRRP-AKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 117
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+A++E + + T +LV+ S+ QL+DC T++ GC F ++AF+++ + + +E YPY
Sbjct: 118 IASIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPDDAFKFVVENGGVTTEEAYPYT 177
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--YHG 237
G C+ + K I GY+ V + + L VS+ PV+V I + NF Y
Sbjct: 178 GFAGS-CN----TNKNKVVEITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRS 232
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGL 297
G+ +G C N+ +H V ++GYGT EG PYW++KN WGT+W E G M+I + G G+
Sbjct: 233 GILSGQCCNSRDHAVLVIGYGT----EGGMPYWIIKNSWGTSWGEDGFMKI-KKKDGEGM 287
Query: 298 CNIAANAAYP 307
C + ++YP
Sbjct: 288 CGMNGQSSYP 297
>gi|357130486|ref|XP_003566879.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 354
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 119/322 (36%), Positives = 165/322 (51%), Gaps = 30/322 (9%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+A++HE+WM F R YKD EK R ++F N L LN F+DLT
Sbjct: 34 VASRHERWMARFGRAYKDADEKARRQEVFGANARHVDAVNRSGNRTYTLGLNHFSDLTDH 93
Query: 58 KFLASYTGYK---PPPTD--HPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
+FL + GY+ P P P + L DS+DW +GAVT +K+Q S
Sbjct: 94 EFLQQHLGYRHHQPGPGGLLRPEDQDMSKATALADYGQDVPDSVDWRAQGAVTEIKNQRS 153
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYIRQYQRL 170
CWAF AVA EGL KI TG L++ S+ Q++DC+ N C + A Y+ L
Sbjct: 154 CGSCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGGGNTCDGGDINAALRYVAASGGL 213
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEG-LQDVVSRQPVSVAIDA 229
E Y Y Q C +S + ++ G ++ + +EG L+ + + QPV+VA++A
Sbjct: 214 QPEAAYAYAA-QKGACRG--ASPANSAASVGGARFARLGGDEGALRGLAAGQPVAVALEA 270
Query: 230 TWFNFYH--GGVFTGP--CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
+ +F H GV+ G CG NHGVT+VGYG E + YW+VKN+WGT W E G
Sbjct: 271 SEPDFRHYKSGVYAGSASCGRRLNHGVTVVGYGA--EDDSGDEYWVVKNQWGTLWGEKGY 328
Query: 286 MRIFRGVGGSGLCNIAANAAYP 307
MR+ RG C IA+ A YP
Sbjct: 329 MRVARGDVAGANCGIASYAYYP 350
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 184 bits (466), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 164/310 (52%), Gaps = 29/310 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E+W+ + Y+ EK RF++FK N + +L +N+FADLT ++F Y
Sbjct: 46 EEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMY 105
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G K + S +K++ + S+DW ++GAVT VK+QGS CWAF+ VA
Sbjct: 106 LGLKVESSRTRQSPEEFTYKDV----VDLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVA 161
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+NKI G L + S+ +L+DC NGC ++ AF +I L E YPY
Sbjct: 162 AVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYL- 220
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
+ CD + I GY+ V E L ++ QP+SVAI+A+ F FY GG
Sbjct: 221 EVESTCD--NKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYSGG 278
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF GPCG +HGVT VGYG++ + Y +VKN WG W E G +R+ R G +GL
Sbjct: 279 VFDGPCGTQLDHGVTAVGYGSSKGVD----YIIVKNSWGPKWGEKGYIRMKRNTGKPAGL 334
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 335 CGINKMASYP 344
>gi|226503205|ref|NP_001150062.1| thiol protease SEN102 precursor [Zea mays]
gi|195636390|gb|ACG37663.1| thiol protease SEN102 precursor [Zea mays]
Length = 349
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 121/323 (37%), Positives = 172/323 (53%), Gaps = 38/323 (11%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL------------NKFADLTREKFLA 61
+ + W E+ RTY E + RF ++ +N +F+ N+FADLT E+F
Sbjct: 36 RFQAWQAEYNRTYATPEEFQQRFMVYSENVKFIETMNQPGSSYELGENRFADLTEEEFKD 95
Query: 62 SYTGYKPPPTDHPHS--------NRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
+Y P + NR+ N+++ +S+DW +GAVTPVK Q +
Sbjct: 96 TYLMKLDNVASSPEAMALTVDTMNRAGTSGGSNTNEAP--NSVDWRTKGAVTPVKSQ-QH 152
Query: 114 C--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFL---ENAFEYIRQYQ 168
C CWAF AVA++EG++KI+TG LV+ S+ ++VDC +A E++ +
Sbjct: 153 CGSCWAFAAVASIEGVHKIKTGLLVSLSEQEIVDCDRGGNNHGCHGGHSSSAMEWVTRNG 212
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAI 227
L +E YPY GRQ C S G + A IRG Q VQ E LQ V+ +PV+V+I
Sbjct: 213 GLTTESDYPYVGRQG-QC---MSDKLGHHAAKIRGRQAVQGKNEGALQHAVAGRPVAVSI 268
Query: 228 DAT-WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
+A+ F FY G+F+GPC T NH VT+VGYG A G + YW+VKN WG W E G +
Sbjct: 269 NASRAFQFYKRGIFSGPCNTTRNHAVTVVGYGAN--ASGHK-YWIVKNSWGERWGEKGYV 325
Query: 287 RIFRGV-GGSGLCNIAANAAYPL 308
R+ RGV G+C IA Y +
Sbjct: 326 RMQRGVRAREGVCGIAIAPFYAV 348
>gi|357130490|ref|XP_003566881.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 350
Score = 183 bits (465), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 119/307 (38%), Positives = 161/307 (52%), Gaps = 39/307 (12%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+AA+HEQWM +F R Y D EK R +F N + L LN+F+DLT
Sbjct: 36 VAARHEQWMAKFGRVYTDANEKARRQAVFGANARYVDAVNRAGNRTYTLGLNEFSDLTDN 95
Query: 58 KFLASYTGYKP--PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+F ++ GY+ P T + + + SF DW +GAVT VK QG C
Sbjct: 96 EFAKTHLGYREFRPETANISKGVDPGYGLAGNIPKSF----DWRTKGAVTEVKSQGGCGC 151
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNG-CAKNFLENAFEYIRQYQRLASE 173
CWAF AVA EGL KI G L++ S+ Q++DC+T N C ++ +A Y+ L +E
Sbjct: 152 CWAFAAVAATEGLVKIAKGTLISMSEQQVLDCTTGNNTCKGGYMNDALSYVFASGGLQTE 211
Query: 174 CVYPYQG-----RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID 228
Y Y R+D + S +Y + G +++ LQ +V+RQPV VA++
Sbjct: 212 EDYEYNAEKGACRRDVTPNPATSVGHAEYMPLDGNEFL-------LQKLVARQPVVVAVE 264
Query: 229 A--TWFNFYHGGVFTG--PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
A T F Y GGVFTG CG +H T+VGYG G+Q YWLVKN+WGT+W E G
Sbjct: 265 AYGTDFKNYGGGVFTGSPSCGQNLDHFFTVVGYGFAD--GGKQMYWLVKNQWGTSWGESG 322
Query: 285 SMRIFRG 291
MRI RG
Sbjct: 323 YMRIARG 329
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 183 bits (464), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 114/312 (36%), Positives = 158/312 (50%), Gaps = 31/312 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLAS 62
E W E ++Y Q E+ R K+F+ N++F L LN FADLT +F S
Sbjct: 30 ETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFKTS 89
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
G P + H N + SIDW +G VT VKDQGS CW+F+A
Sbjct: 90 RLGLSAAPLNLAHRNLE-----ITGVVGDIPASIDWRNKGVVTNVKDQGSCGACWSFSAT 144
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+EG+NKI TG LV+ S+ +L++C S +GC ++ AF+++ + +E YPY+
Sbjct: 145 GAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEEDYPYR 204
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHG 237
R D C+ + + I Y V E+ L V+ QPVSV I + F Y
Sbjct: 205 AR-DGTCN--KDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMYSK 261
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-G 296
G+FTGPC + +H V IVGYG+ E YW+VKN WGT W G M + R G S G
Sbjct: 262 GIFTGPCSTSLDHAVLIVGYGS----ENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQG 317
Query: 297 LCNIAANAAYPL 308
+C I A+YP+
Sbjct: 318 VCGINMLASYPV 329
>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
Length = 439
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 121/325 (37%), Positives = 165/325 (50%), Gaps = 53/325 (16%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------------LRLNKFADLTRE 57
E+W E ++TY + EK R K+F+ N+ F L LN FADLT
Sbjct: 34 EKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADLTHH 93
Query: 58 KFLASYTGYKPPPT----DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
+F + G P T P + +S ++ S IDW + GAVTPVKDQ S
Sbjct: 94 EFKTTRLGL--PLTLLRFKRPQNQQSRDLLHIPSQ-------IDWRQSGAVTPVKDQASC 144
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRL 170
CWAF+A +EG+NKI TG LV+ S+ +L+DC T +GC ++ A++++ + +
Sbjct: 145 GACWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKGI 204
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYG----AIRGYQYVQPATEEGLQDVVSRQPVSVA 226
+E YPYQ RQ RS + K I Y V P+ EE L+ V S QPVSV
Sbjct: 205 DTEDDYPYQARQ-------RSCSKDKLKRRAVTIEDYVDVPPSEEEILKAVAS-QPVSVG 256
Query: 227 IDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
I + F Y G+FTGPC +H V IVGYG +E YW+VKN WG W G
Sbjct: 257 ICGSEREFQLYSKGIFTGPCSTFLDHAVLIVGYG----SENGVDYWIVKNSWGKYWGMNG 312
Query: 285 SMRIFRGVGGS-GLCNIAANAAYPL 308
+ + R G S G+C I A+YP+
Sbjct: 313 YIHMIRNSGNSKGICGINTLASYPV 337
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 183 bits (464), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 117/310 (37%), Positives = 164/310 (52%), Gaps = 29/310 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E+W+ + Y+ EK RF++FK N + +L +N+FADLT ++F Y
Sbjct: 49 EEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVTSYWLGVNEFADLTHQEFKNMY 108
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G K + S +K++ + S+DW ++GAVT VK+QGS CWAF+ VA
Sbjct: 109 LGLKVESSRTRQSPEEFTYKDV----VDLPKSVDWRKKGAVTRVKNQGSCGSCWAFSTVA 164
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+NKI G L + S+ +L+DC NGC ++ AF +I L E YPY
Sbjct: 165 AVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEEDYPYL- 223
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
+ CD + I GY+ V E L ++ QP+SVAI+A+ F FY GG
Sbjct: 224 EVESTCD--NKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYSGG 281
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF GPCG +HGVT VGYG++ + Y +VKN WG W E G +R+ R G +GL
Sbjct: 282 VFDGPCGTQLDHGVTAVGYGSSKGVD----YIIVKNSWGPKWGEKGYIRMKRNTGKPAGL 337
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 338 CGINKMASYP 347
>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
Length = 494
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 120/301 (39%), Positives = 163/301 (54%), Gaps = 33/301 (10%)
Query: 31 EKEMRFKIFKKNHEF---------------LRLNKFADLTREKFLASYTGYKPPPTDHPH 75
E E RF++F N +F L +N+FADLT +F A+Y G P
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPA-----G 138
Query: 76 SNRSNWFKNLNSSKMSFYDSIDWNERGAVT-PVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
R + + DS+DW ++GAV PVK+QG CWAF+AVA VEG+NKI TG
Sbjct: 139 RGRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTG 198
Query: 134 QLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWR 190
+LV+ S+ +LV+C+ +GC +++AF +I + L +E YPY D C+ +
Sbjct: 199 ELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTA-MDGKCNLAK 257
Query: 191 SSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTP 248
S K +I G++ V E LQ V+ QPVSVAIDA F Y GVFTG CG
Sbjct: 258 RSR--KVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNL 315
Query: 249 NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
+HGV VGYG T+A YW V+N WG +W E G +R+ R V +G C IA A+YP
Sbjct: 316 DHGVVAVGYG--TDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYP 373
Query: 308 L 308
+
Sbjct: 374 I 374
>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
Precursor
gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 490
Score = 182 bits (463), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 120/301 (39%), Positives = 163/301 (54%), Gaps = 33/301 (10%)
Query: 31 EKEMRFKIFKKNHEF---------------LRLNKFADLTREKFLASYTGYKPPPTDHPH 75
E E RF++F N +F L +N+FADLT +F A+Y G P
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLGTTPA-----G 138
Query: 76 SNRSNWFKNLNSSKMSFYDSIDWNERGAVT-PVKDQGSY-CCWAFTAVATVEGLNKIRTG 133
R + + DS+DW ++GAV PVK+QG CWAF+AVA VEG+NKI TG
Sbjct: 139 RGRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTG 198
Query: 134 QLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWR 190
+LV+ S+ +LV+C+ +GC +++AF +I + L +E YPY D C+ +
Sbjct: 199 ELVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTA-MDGKCNLAK 257
Query: 191 SSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTP 248
S K +I G++ V E LQ V+ QPVSVAIDA F Y GVFTG CG
Sbjct: 258 RSR--KVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNL 315
Query: 249 NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
+HGV VGYG T+A YW V+N WG +W E G +R+ R V +G C IA A+YP
Sbjct: 316 DHGVVAVGYG--TDAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYP 373
Query: 308 L 308
+
Sbjct: 374 I 374
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 182 bits (462), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 112/314 (35%), Positives = 163/314 (51%), Gaps = 32/314 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLAS 62
+ W +TY + E++ R +IFK NH+F L LN FADLT +F AS
Sbjct: 33 DDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEFKAS 92
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
G + +++ DS+DW ++GAVT VKDQGS CW+F+A
Sbjct: 93 RLGLSVSASSLIMASKGQSL----GGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSFSAT 148
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+EG+N+I TG L++ S+ +L+DC S GC ++ AFE++ + + +E YPYQ
Sbjct: 149 GAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYPYQ 208
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYH- 236
R D C + K I Y V+ E+ L++ V+ QPVSV I + F Y
Sbjct: 209 ER-DGTCK--KDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSR 265
Query: 237 -GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
G+F+GPC + +H V IVGYG+ + YW+VKN WG +W G M + R G S
Sbjct: 266 VSGIFSGPCSTSLDHAVLIVGYGSQNGVD----YWIVKNSWGKSWGMDGFMHMQRNTGNS 321
Query: 296 -GLCNIAANAAYPL 308
G+C I A+YP+
Sbjct: 322 EGICGINMLASYPI 335
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 114/315 (36%), Positives = 163/315 (51%), Gaps = 50/315 (15%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREK 58
+ A+ E W+ + + YK EK RF++F++N +L LN+FADL+ E+
Sbjct: 45 LIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVSSYWLGLNEFADLSHEE 104
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
F + P +S+DW ++GAVT VK+QG+ CWA
Sbjct: 105 FKSKDVADLP-------------------------ESVDWRKKGAVTHVKNQGACGSCWA 139
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECV 175
F+ VA VEG+N+I TG L T S+ +L+DC T +GC ++ AF +I L E
Sbjct: 140 FSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDD 199
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FN 233
YPY ++ C+ + I GY+ V EE L ++ QP+SVAI+A+ F
Sbjct: 200 YPYL-MEEGTCEEQKEDVD--IVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQ 256
Query: 234 FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
FY GGVF GPCG +HGV VGYG++ + Y +VKN WG W E G +R+ R G
Sbjct: 257 FYSGGVFNGPCGTELDHGVAAVGYGSSKGLD----YIIVKNSWGPKWGEKGYIRMKRNTG 312
Query: 294 GS-GLCNIAANAAYP 307
+ GLC I A+YP
Sbjct: 313 KTEGLCGINKMASYP 327
>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
(fragment)
gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
gi|226542|prf||1601514A actinidin
Length = 302
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 113/270 (41%), Positives = 150/270 (55%), Gaps = 20/270 (7%)
Query: 46 LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVT 105
+ LN+FADLT E+F ++Y G+ SNR ++ S + Y +DW GAV
Sbjct: 17 VGLNQFADLTGEEFRSTYLGFTGGSNKTKVSNR---YEPRVSQVLPSY--VDWRSAGAVV 71
Query: 106 PVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENA 160
+K QG C CWAF+A+ATVEG+NKI TG L++ S+ +L+ C GC ++ +
Sbjct: 72 DIKSQGE-CGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGTQNTRGCNGGYITDG 130
Query: 161 FEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR 220
F++I + + YPY QD C+ + KY I Y V E LQ V+
Sbjct: 131 FQFIINNGGINTGENYPYTA-QDGECNLDLQNE--KYVTIDTYGNVPYNNEWALQTAVTY 187
Query: 221 QPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGT 278
QPVSVA+DA F Y G+FTGPCG +H VTIVGYGT EG YW+V+N W T
Sbjct: 188 QPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT----EGGIDYWIVENSWDT 243
Query: 279 NWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
W E G MRI R VGG+G C IA +YP+
Sbjct: 244 TWGEEGYMRILRNVGGAGTCGIATMPSYPV 273
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 182 bits (462), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/323 (35%), Positives = 168/323 (52%), Gaps = 43/323 (13%)
Query: 17 QWMVEFARTYKDQA----EKEMRFKIFKKNHEF--------------LRLNKFADLTREK 58
QW E +T + +++ RF IFK N F L L KF DLT ++
Sbjct: 51 QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDE 110
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNS------SKMSFYDSIDWNERGAVTPVKDQGS 112
+ Y G + P + R KN+N + +++DW ++GAV P+KDQG+
Sbjct: 111 YRKLYLGARTEP-----ARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGT 165
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQR 169
CWAF+ A VEG+NKI TG+L++ S+ +LVDC S GC ++ AF++I +
Sbjct: 166 CGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 225
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
L +E YPY+G + ++S + +I GY+ V E L+ +S QPVSVAI+A
Sbjct: 226 LNTEKDYPYRGFGGKCNSFLKNS---RVVSIDGYEDVPTKDETALKKAISYQPVSVAIEA 282
Query: 230 --TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F Y G+FTG CG +H V VGYG+ E YW+V+N WG W E G +R
Sbjct: 283 GGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGS----ENGVDYWIVRNSWGPRWGEEGYIR 338
Query: 288 IFRGVGG--SGLCNIAANAAYPL 308
+ R + SG C IA A+YP+
Sbjct: 339 MERNLAASKSGKCGIAVEASYPV 361
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 116/323 (35%), Positives = 168/323 (52%), Gaps = 43/323 (13%)
Query: 17 QWMVEFARTYKDQA----EKEMRFKIFKKNHEF--------------LRLNKFADLTREK 58
QW E +T + +++ RF IFK N F L L KF DLT ++
Sbjct: 51 QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDE 110
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNS------SKMSFYDSIDWNERGAVTPVKDQGS 112
+ Y G + P + R KN+N + +++DW ++GAV P+KDQG+
Sbjct: 111 YRKLYLGARTEP-----ARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGT 165
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQR 169
CWAF+ A VEG+NKI TG+L++ S+ +LVDC S GC ++ AF++I +
Sbjct: 166 CGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 225
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
L +E YPY+G + ++S + +I GY+ V E L+ +S QPVSVAI+A
Sbjct: 226 LNTEKDYPYRGFGGKCNSFLKNS---RVVSIDGYEDVPTKDETALKKAISYQPVSVAIEA 282
Query: 230 --TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F Y G+FTG CG +H V VGYG+ E YW+V+N WG W E G +R
Sbjct: 283 GGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGS----ENGVDYWIVRNSWGPRWGEEGYIR 338
Query: 288 IFRGVGG--SGLCNIAANAAYPL 308
+ R + SG C IA A+YP+
Sbjct: 339 MERNLAASKSGKCGIAVEASYPV 361
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 117/308 (37%), Positives = 172/308 (55%), Gaps = 29/308 (9%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASYTG 65
W V+ ++ Y EK R+++FK+N + +L LN+FAD+ E+F ++Y G
Sbjct: 51 WSVKHSKIYVSPEEKVKRYEVFKQNLKHIVETNRRNGSYWLGLNQFADVAHEEFKSTYLG 110
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
K D P + F+ NS + + S+DW ++GAVTPVK+QG CWAF+ VA V
Sbjct: 111 LKTG-MDGP-ARAPTAFRYENSVNLPW--SVDWRKKGAVTPVKNQGECGSCWAFSTVAAV 166
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQ 182
EG+N+I TG+L + S+ +L+DC T +GC F++ AF YI + ++ YPY +
Sbjct: 167 EGINQIATGKLESLSEQELMDCDTTFDHGCGGGFMDFAFAYIMGNLGIHTDDDYPYL-ME 225
Query: 183 DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVF 240
+ YC K I GY+ V +E L ++ QP+SV I A F FY GVF
Sbjct: 226 EGYCK--EKQPQSKVVTISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQFYKRGVF 283
Query: 241 TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCN 299
G CG +H +T VGYG++ +GQ Y ++KN WG +W E G RI RG G G+C+
Sbjct: 284 EGSCGTELDHALTAVGYGSS---DGQD-YIIMKNSWGKSWGEQGYFRIKRGTGKPEGVCS 339
Query: 300 IAANAAYP 307
I + A+YP
Sbjct: 340 IYSMASYP 347
>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
Length = 466
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 119/319 (37%), Positives = 167/319 (52%), Gaps = 38/319 (11%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREK 58
+++W + DQ + R ++FK+N F L +N+FADLT E+
Sbjct: 43 YQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYRLGMNRFADLTNEE 102
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMS----FYDSIDWNERGAVTPVKDQGSY- 113
+ A + D RS + N ++ DSIDW E+GAV VK QG
Sbjct: 103 YRARFL------RDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAVVAVKSQGRCG 156
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLAS 172
CWAF A+ATVEG+N+I TG L++ S+ QLVDCST N GC + AF+YI + S
Sbjct: 157 SCWAFAAIATVEGINQIVTGDLISLSEQQLVDCSTRNHGCEGGWPYRAFQYIINNGGVNS 216
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY G + + +I Y+ V E+ LQ V+ QP+SV I+A+
Sbjct: 217 EEHYPYTGTNGTC---NTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISVGINASGR 273
Query: 233 NF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
NF YH G+FTG C + NHGVT+VGYGT + YW+VKN WG +W + G + + R
Sbjct: 274 NFQLYHSGIFTGSCNTSLNHGVTVVGYGTVNGND----YWIVKNSWGESWGDSGYILMER 329
Query: 291 GVG-GSGLCNIAANAAYPL 308
+ SG C IA + +YP+
Sbjct: 330 NIAESSGKCGIAISPSYPI 348
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 182 bits (461), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 119/323 (36%), Positives = 168/323 (52%), Gaps = 43/323 (13%)
Query: 17 QWMVEFARTYKDQA----EKEMRFKIFKKNHEF--------------LRLNKFADLTREK 58
QW + +T + +++ RF IFK N F L L KF DLT E+
Sbjct: 51 QWSADHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEKNKNATYKLGLTKFTDLTNEE 110
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD------SIDWNERGAVTPVKDQGS 112
+ + Y G + P R KN+N + D ++DW +GAV P+KDQG+
Sbjct: 111 YRSLYLGARTEPV-----RRIAKAKNVNQKYSAAVDGKEVPETVDWRLKGAVNPIKDQGT 165
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQR 169
CWAF+ A VEG+NKI TG+L++ S+ +LVDC S GC ++ AF++I +
Sbjct: 166 CGSCWAFSTAAAVEGINKIVTGELISLSEQELVDCDNSYNQGCNGGLMDYAFQFIMKNGG 225
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
L +E YPY+G C+ + +A K +I GY+ V E L+ +S QPVSVAI+A
Sbjct: 226 LKTEKDYPYRGFGG-KCNSFLKNA--KVVSIDGYEDVPTKDETALKRAISLQPVSVAIEA 282
Query: 230 --TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F Y G+FTG CG +H V VGYG+ E YW+V+N WG W E G +R
Sbjct: 283 GGRIFQHYQTGIFTGNCGTNLDHAVVAVGYGS----ENGVDYWIVRNSWGPRWGEEGYIR 338
Query: 288 IFRGVGG--SGLCNIAANAAYPL 308
+ R + SG C IA A+YP+
Sbjct: 339 MERNLASSKSGKCGIAVEASYPV 361
>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 128/309 (41%), Positives = 172/309 (55%), Gaps = 33/309 (10%)
Query: 21 EFARTYKDQ-AEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLASYTGY 66
EF T D+ +E E R +IFK N E++ LN+++DLT ++FLAS+TG
Sbjct: 67 EFKATQNDKISELEKRKRIFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGL 126
Query: 67 KPPPTDHPHSNRSNWFK-NLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
K RS NLN + +D W ++GAVT VKDQGS CCWAF+ VA V
Sbjct: 127 KVSKQLSSSKMRSAAVPFNLNDDVPTNFD---WRQQGAVTDVKDQGSCGCCWAFSVVAAV 183
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQ-GRQ 182
EG KI TG+L++ S+ QLVDC N GC +++AF+YI Q + + SE YPYQ G Q
Sbjct: 184 EGAVKINTGELISLSEQQLVDCDERNSGCHGGNMDSAFKYIIQ-KGIVSEADYPYQEGSQ 242
Query: 183 DYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAIDA-TWFNFYHGGVF 240
+ + K+ A I + V E+ L V++QPVSV I+ F Y G V+
Sbjct: 243 T-----CQLNDQMKFEAQITNFIDVPANDEQQLLQAVAQQPVSVGIEVGDEFQHYMGDVY 297
Query: 241 TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCN 299
+G CG + NH VT VGYG + E YWL+KN WG W E G M++ R G G C
Sbjct: 298 SGTCGQSMNHAVTAVGYGVS---EDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPGGQCG 354
Query: 300 IAANAAYPL 308
IAA+A+YP+
Sbjct: 355 IAAHASYPI 363
>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
Length = 376
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 116/325 (35%), Positives = 165/325 (50%), Gaps = 29/325 (8%)
Query: 5 SHKT-GNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNK 50
SH+ + +E+W+VE + Y EKE RFKIFK N + + LN+
Sbjct: 30 SHRNEAEVRTIYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSYDRGLNQ 89
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP-VKD 109
F+DLT ++F ASY G K +K + D +DW ERGAV P VK
Sbjct: 90 FSDLTVDEFQASYLGGKIEKKSLSDVAERYQYKEGDI----LPDEVDWRERGAVVPRVKR 145
Query: 110 QGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIR 165
QG CWAF A VEG+N+I TG+L++ S+ +L+DC GCA AFE+I+
Sbjct: 146 QGDCGSCWAFAATGAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWAFEFIK 205
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ + ++ Y Y G C + + I G++ V E L+ VS QP+SV
Sbjct: 206 ENGGIVTDEDYGYTGDDTAACKAIEMKTT-RVVTINGHEVVPVNDEMSLKKAVSYQPISV 264
Query: 226 AIDATWFNFYHGGVFTGPCGNT-PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
I A + Y GV+ GPC N +H V IVGYGT+++ + YWL++N WG W EGG
Sbjct: 265 MISAANMSDYKSGVYKGPCSNLWGDHNVLIVGYGTSSD---EGDYWLIRNSWGPGWGEGG 321
Query: 285 SMRIFRGVGG-SGLCNIAANAAYPL 308
+R+ R +G C +A YP+
Sbjct: 322 YLRLQRNFNEPTGKCAVAVAPVYPI 346
>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
Length = 229
Score = 181 bits (460), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 101/220 (45%), Positives = 134/220 (60%), Gaps = 12/220 (5%)
Query: 95 SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NG 151
S+DW ++GAVT VKDQG CWAF+ + VEG+N+I+T +LV+ S+ +LVDC T G
Sbjct: 5 SVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQNQG 64
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ AFE+I+Q + +E YPY+ D CD + +A +I G++ V E
Sbjct: 65 CNGGLMDYAFEFIKQRGGITTEANYPYEAY-DGTCDVSKENAPAV--SIDGHENVPENDE 121
Query: 212 EGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
L V+ QPVSVAIDA + F FY GVFTG CG +HGV IVGYGTT + Y
Sbjct: 122 NALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTIDG---TKY 178
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
W VKN WG W E G +R+ RG+ GLC IA A+YP+
Sbjct: 179 WTVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPI 218
>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
Length = 361
Score = 181 bits (459), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 119/312 (38%), Positives = 167/312 (53%), Gaps = 26/312 (8%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKN------------HEFLRLNKFADLTREKFLASYTG 65
W V+ + Y EK R++IFK+N +L LN+FAD+ E+F ASY G
Sbjct: 47 WSVKHGKLYASPTEKLERYEIFKQNLMHIAETNRKNGSYWLGLNQFADVAHEEFKASYLG 106
Query: 66 YK--PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
K P P + F+ ++ S S+DW +GAVTPVK+QG CWAF++VA
Sbjct: 107 LKRALPRAGAPQTRTPTAFRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVA 166
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I TG+LV+ S+ +LVDC T +GC ++ AF Y+ Q + +E YPY
Sbjct: 167 AVEGINQIVTGKLVSLSEQELVDCDTTLDHGCEGGTMDLAFAYMMGSQGIHAEDDYPYL- 225
Query: 181 RQDYYCDWWRSSASG-KYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
++ YC + G + G++ V +E L ++ QPVSV I A F FY G
Sbjct: 226 MEEGYCKEKQPCVLGITEQDLTGFEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYRG 285
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
GVF G C +H +T VGYG++ Q Y +KN WG NW E G +RI G G G
Sbjct: 286 GVFDGACSVELDHALTAVGYGSSY----GQNYITMKNSWGKNWGEQGYVRIKMGTGKPEG 341
Query: 297 LCNIAANAAYPL 308
+C I A+YP+
Sbjct: 342 VCGIYTMASYPV 353
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 112/330 (33%), Positives = 174/330 (52%), Gaps = 34/330 (10%)
Query: 3 RTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR--------------- 47
+ + + G+++ +W + +TY + EKE+R KIF NHEF++
Sbjct: 56 KATKEVGSLSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFV 115
Query: 48 -LNKFADLTREKFLASYTGYKPPP-TDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVT 105
LN ADLT+++F GY + S W + ++ + IDW GAVT
Sbjct: 116 GLNHLADLTKDEF-KKMLGYNAALRASRAPVDASTW----EYADVTPPEEIDWVASGAVT 170
Query: 106 PVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAF 161
PVK+Q C CWAF+ VEG+N I+TG+L++ S+ +L+ CST GC ++N F
Sbjct: 171 PVKNQ-KQCGSCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGF 229
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
E+I + + +E + Y +++ C ++R + AI G++ V E+ L VS+Q
Sbjct: 230 EWIVNNRGIDTEDGWEYVAKEEK-CGFFRRHH--RAVAIDGFKDVPSNDEDSLMKAVSQQ 286
Query: 222 PVSVAIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGT 278
PVSVAI+A F Y GGV++ CG +HGV +VGYG ++ + +W +KN WG
Sbjct: 287 PVSVAIEADHQSFQLYAGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGP 346
Query: 279 NWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
W E G +RI +G G G C +A +YP
Sbjct: 347 AWGEDGYIRIAKGGSGVEGQCGVAMQPSYP 376
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 116/322 (36%), Positives = 167/322 (51%), Gaps = 42/322 (13%)
Query: 17 QWMVEFARTYKDQA----EKEMRFKIFKKNHEFLRLNK--------------FADLTREK 58
+W +E ++ + +++ RF IFK N F+ L+ FA+LT ++
Sbjct: 6 RWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDE 65
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD------SIDWNERGAVTPVKDQGS 112
+ + Y G + P R KN+N + + ++DW ++GAV +KDQG+
Sbjct: 66 YRSLYLGARTEPV-----RRITKAKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGT 120
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQR 169
CWAF+ A VEG+NKI TG+LV+ S+ +LVDC S GC ++ AF++I +
Sbjct: 121 CGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 180
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
L +E YPY G ++S + I GY+ V E L+ VS QPVSVAIDA
Sbjct: 181 LNTEKDYPYHGTNGKCNSLLKNS---RVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDA 237
Query: 230 --TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F Y G+FTG CG +H V VGYG+ E YW+V+N WGT W E G +R
Sbjct: 238 GGRAFQHYQSGIFTGKCGTNMDHAVVAVGYGS----ENGVDYWIVRNSWGTRWGEDGYIR 293
Query: 288 IFRGVGG-SGLCNIAANAAYPL 308
+ R V SG C IA A+YP+
Sbjct: 294 MERNVASKSGKCGIAIEASYPV 315
>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
Length = 464
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 121/303 (39%), Positives = 168/303 (55%), Gaps = 37/303 (12%)
Query: 31 EKEMRFKIFKKNHEF---------------LRLNKFADLTREKFLASYTGYKPPPTDHPH 75
E E RF++F N +F L +N+FADLT ++F A+Y G P
Sbjct: 86 EYERRFRVFWDNLKFVDAHNAHADGHGGFRLGMNRFADLTNDEFRAAYLGTTP------- 138
Query: 76 SNRSNWFKNL--NSSKMSFYDSIDWNERGAV-TPVKDQGSY-CCWAFTAVATVEGLNKIR 131
+ R + + + DS+DW ++GAV +PVK+QG CWAF+AVA VEG+NKI
Sbjct: 139 AGRGRHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGSCWAFSAVAAVEGINKIV 198
Query: 132 TGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDW 188
TG+LV+ S+ +LV+C+ +GC +++AF +I + L +E YPY D CD
Sbjct: 199 TGELVSLSEQELVECARNGGNSGCNGGIMDDAFAFITRNGGLDTEEDYPYTA-MDGKCDL 257
Query: 189 WRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGN 246
+ S K +I G++ V E LQ V+ QPVSVAIDA F Y GVFTG CG
Sbjct: 258 AKKSR--KVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGT 315
Query: 247 TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAA 305
+ +HGV VGYG T+A YW V+N WG +W E G +R+ R V +G C IA A+
Sbjct: 316 SLDHGVVAVGYG--TDAATGTDYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMAS 373
Query: 306 YPL 308
YP+
Sbjct: 374 YPI 376
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 181 bits (458), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 116/322 (36%), Positives = 167/322 (51%), Gaps = 42/322 (13%)
Query: 17 QWMVEFARTYKDQA----EKEMRFKIFKKNHEFLRLNK--------------FADLTREK 58
+W +E ++ + +++ RF IFK N F+ L+ FA+LT ++
Sbjct: 6 RWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDE 65
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD------SIDWNERGAVTPVKDQGS 112
+ + Y G + P R KN+N + + ++DW ++GAV +KDQG+
Sbjct: 66 YRSLYLGARTEPV-----RRITKAKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGT 120
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQR 169
CWAF+ A VEG+NKI TG+LV+ S+ +LVDC S GC ++ AF++I +
Sbjct: 121 CGSCWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 180
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
L +E YPY G ++S + I GY+ V E L+ VS QPVSVAIDA
Sbjct: 181 LNTEKDYPYHGTNGKCNSLLKNS---RVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDA 237
Query: 230 --TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F Y G+FTG CG +H V VGYG+ E YW+V+N WGT W E G +R
Sbjct: 238 GGRAFQHYQSGIFTGKCGTNMDHAVVAVGYGS----ENGVDYWIVRNSWGTRWGEDGYIR 293
Query: 288 IFRGVGG-SGLCNIAANAAYPL 308
+ R V SG C IA A+YP+
Sbjct: 294 MERNVASKSGKCGIAIEASYPV 315
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 114/325 (35%), Positives = 169/325 (52%), Gaps = 37/325 (11%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
+I+ + W + +TY + E++ R +IFK NH+F L LN FADLT
Sbjct: 25 DISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTH 84
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+F AS G P ++ ++L S + DS+DW ++GAVT VKDQGS C
Sbjct: 85 HEFKASRLGLSVSA---PSVIMASKGQSLGGS-VKVPDSVDWRKKGAVTNVKDQGSCGAC 140
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASE 173
W+F+A +EG+N+I TG L++ S+ +L+DC S GC ++ AFE++ + + +E
Sbjct: 141 WSFSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTE 200
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID----- 228
YPYQ R D C + K I Y V+ E+ L + V+ QPVSV I
Sbjct: 201 KDYPYQER-DGTCK--KDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERA 257
Query: 229 ----ATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
++ F G+F+GPC + +H V IVGYG+ + YW+VKN WG +W G
Sbjct: 258 FQLYSSKFYLLMQGIFSGPCSTSLDHAVLIVGYGSQNGVD----YWIVKNSWGKSWGMDG 313
Query: 285 SMRIFRGVGGS-GLCNIAANAAYPL 308
M + R S G+C I A+YP+
Sbjct: 314 FMHMQRNTENSDGVCGINMLASYPI 338
>gi|164472556|gb|ABY58967.1| cathepsin L [Toxoplasma gondii]
Length = 421
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 107/307 (34%), Positives = 164/307 (53%), Gaps = 29/307 (9%)
Query: 22 FARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPP 69
+A++Y + EK+ R+ IFK N + L++N F DL+R++F Y G+K
Sbjct: 123 YAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFRRKYLGFKKS 182
Query: 70 PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ---GSYCCWAFTAVATVEG 126
H + + LN +DW RG VTPVKDQ GS CWAF+ +EG
Sbjct: 183 RNLKSH-HLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGS--CWAFSTTGALEG 239
Query: 127 LNKIRTGQLVTRSKHQLVDCSTLNG---CAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
+ +TG+LV+ S+ +L+DCS G C+ + +AF+Y+ + SE YPY R D
Sbjct: 240 AHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLAR-D 298
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFT 241
C R+ + K I G++ V +E ++ +++ PVS+AI+A F FYH GVF
Sbjct: 299 EEC---RAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFD 355
Query: 242 GPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIA 301
CG +HGV +VGYG T+ E ++ +W++KN WGT W G M + G G C +
Sbjct: 356 ASCGTDLDHGVLLVGYG--TDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLL 413
Query: 302 ANAAYPL 308
+A++P+
Sbjct: 414 LDASFPV 420
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 117/311 (37%), Positives = 166/311 (53%), Gaps = 31/311 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E W+V+ ++ Y+ EK RF+IF N + +L LN+FADLT E+F +
Sbjct: 50 ESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKHKF 109
Query: 64 TGYKPPPTDHP-HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
G+K + S++ +++ + S+DW ++GAV PVK+QG CWAF+ V
Sbjct: 110 LGFKGELAERKDESSKEFGYRDF----VDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTV 165
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A VEG+N+I TG L S+ +L+DC T NGC ++ AF Y+ + L E YPY
Sbjct: 166 AAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMR-SGLHKEEEYPYI 224
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
+ CD + S K I GY V E ++ QP+SVAI+A+ F FY G
Sbjct: 225 MSEG-TCD-EKKDVSEKV-TISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSG 281
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
GVF G CG +HGV VGYGTT + Y +V+N WG W E G +R+ RG G G
Sbjct: 282 GVFDGHCGTELDHGVAAVGYGTTKGLD----YVIVRNSWGPKWGEKGYIRMKRGSGKPHG 337
Query: 297 LCNIAANAAYP 307
+C + A+YP
Sbjct: 338 MCGLYMMASYP 348
>gi|357167707|ref|XP_003581294.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 358
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 116/327 (35%), Positives = 164/327 (50%), Gaps = 41/327 (12%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTRE 57
+A++HE+WM F R+Y D EK R ++F N L LN+F+DLT
Sbjct: 38 MASRHERWMARFGRSYTDAGEKARRQEVFGANARHVDAVNRAGNRTYTLGLNQFSDLTDH 97
Query: 58 KFLASYTGYK----------PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+FL + GY P P + + +++ S+DW +GAVT +
Sbjct: 98 EFLQQHLGYGRHHGQRGLLLPEEEVMPKATALGYGQDMPY-------SVDWRAKGAVTEI 150
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS-TLNGCAKNFLENAFEYIR 165
K+Q S CWAF AVA EGL KI TG L++ S+ Q++DC+ + C ++ +A Y+
Sbjct: 151 KNQRSCGSCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGDRSSCDSGYISDALRYVV 210
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEG-LQDVVSRQPVS 224
L E Y Y G Q C R + ++ G +EG LQ + +RQPV+
Sbjct: 211 TSGGLQREAAYAYTG-QKGACGSRRFARPNSAASVGGVHMATLNGDEGALQGLAARQPVA 269
Query: 225 VAIDATWFNFYH--GGVFTG--PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
V ++A+ +F H GV+ G CG NH +T+VGYGT G YWLVKN+WGT W
Sbjct: 270 VIVEASEPDFRHYSSGVYAGSASCGRELNHALTVVGYGTEN---GAGEYWLVKNQWGTWW 326
Query: 281 DEGGSMRIFRGVGGSGLCNIAANAAYP 307
E G MR+ R G C IA+ A YP
Sbjct: 327 GENGYMRVARRNGAGANCGIASVAFYP 353
>gi|237844793|ref|XP_002371694.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|50313163|gb|AAT74529.1| toxopain-2 [Toxoplasma gondii]
gi|89242977|gb|ABD64744.1| cathepsin L [Toxoplasma gondii]
gi|95007485|emb|CAJ20707.1| toxopain-2 [Toxoplasma gondii RH]
gi|211969358|gb|EEB04554.1| cathepsin L-like thiolproteinase, putative [Toxoplasma gondii ME49]
gi|221480879|gb|EEE19300.1| cysteine protease, putative [Toxoplasma gondii GT1]
gi|221501596|gb|EEE27366.1| cysteine protease, putative [Toxoplasma gondii VEG]
Length = 422
Score = 180 bits (457), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 107/307 (34%), Positives = 164/307 (53%), Gaps = 29/307 (9%)
Query: 22 FARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPP 69
+A++Y + EK+ R+ IFK N + L++N F DL+R++F Y G+K
Sbjct: 124 YAKSYATEEEKQRRYAIFKNNLVYIHTHNQQGYSYSLKMNHFGDLSRDEFRRKYLGFKKS 183
Query: 70 PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ---GSYCCWAFTAVATVEG 126
H + + LN +DW RG VTPVKDQ GS CWAF+ +EG
Sbjct: 184 RNLKSH-HLGVATELLNVLPSELPAGVDWRSRGCVTPVKDQRDCGS--CWAFSTTGALEG 240
Query: 127 LNKIRTGQLVTRSKHQLVDCSTLNG---CAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
+ +TG+LV+ S+ +L+DCS G C+ + +AF+Y+ + SE YPY R D
Sbjct: 241 AHCAKTGKLVSLSEQELMDCSRAEGNQSCSGGEMNDAFQYVLDSGGICSEDAYPYLAR-D 299
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFT 241
C R+ + K I G++ V +E ++ +++ PVS+AI+A F FYH GVF
Sbjct: 300 EEC---RAQSCEKVVKILGFKDVPRRSEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFD 356
Query: 242 GPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIA 301
CG +HGV +VGYG T+ E ++ +W++KN WGT W G M + G G C +
Sbjct: 357 ASCGTDLDHGVLLVGYG--TDKESKKDFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLL 414
Query: 302 ANAAYPL 308
+A++P+
Sbjct: 415 LDASFPV 421
>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 325
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 116/317 (36%), Positives = 179/317 (56%), Gaps = 42/317 (13%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF-----------LRLNKFADLTREKFLASYT 64
E W + Y +Q E + R +F +N + + +N+F+DLTR++F+ +Y
Sbjct: 26 EAWKSFHGKKYHNQGEDDFRHYVFLQNIKTIAAHNAKSTFKMAINEFSDLTRKEFVKTYN 85
Query: 65 GYK---PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
GY+ T+ P S + LN++ + +DW + G VTP+K+QG CWAF+
Sbjct: 86 GYRLSMKKSTNKP----STFMAPLNTNMPT---EVDWRKEGYVTPIKNQGRCGSCWAFST 138
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYP 177
++EG + +TG+LV+ S+ L+DCS +GC F+++AFEYI+ + +E YP
Sbjct: 139 TGSLEGQHFRKTGKLVSLSEQNLIDCSAAEGNDGCGGGFMDDAFEYIKLNNGIDTEASYP 198
Query: 178 YQGRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--FN 233
Y+GR D C + +++ GAI GY ++ +E+ L+ V+ P+SVAIDA+ F+
Sbjct: 199 YEGRDD-ICRYKKTNK----GAIDTGYMDIKQYSEDDLKAAVATVGPISVAIDASHKSFH 253
Query: 234 FYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
YH GV+ P C T +HGV +VGYGT E + YWLVKN WGT+W G +++ R
Sbjct: 254 MYHTGVYHEPECSQTVLDHGVLVVGYGT----ENGEDYWLVKNSWGTDWGMNGYIKMSRN 309
Query: 292 VGGSGLCNIAANAAYPL 308
S C IA NA+YPL
Sbjct: 310 R--SNNCGIATNASYPL 324
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 118/312 (37%), Positives = 167/312 (53%), Gaps = 33/312 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E W+V+ ++ Y+ EK RF+IF N + +L LN+FADLT E+F +
Sbjct: 50 ESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKHKF 109
Query: 64 TGYKPPPTDHP-HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTA 120
G+K + S++ +++ + S+DW ++GAV PVK+QG C CWAF+
Sbjct: 110 LGFKGELAERKDESSKEFGYRDF----VDLPKSVDWRKKGAVAPVKNQGQ-CGNCWAFST 164
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPY 178
VA VEG+N+I TG L S+ +L+DC T NGC ++ AF Y+ + L E YPY
Sbjct: 165 VAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMR-SGLHKEEEYPY 223
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYH 236
+ CD + S K I GY V E ++ QP+SVAI+A+ F FY
Sbjct: 224 IMSEG-TCD-EKKDVSEKV-TISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYS 280
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-S 295
GGVF G CG +HGV VGYGTT + Y +V+N WG W E G +R+ RG G
Sbjct: 281 GGVFDGHCGTELDHGVAAVGYGTTKGLD----YVIVRNSWGPKWGEKGYIRMKRGSGKPH 336
Query: 296 GLCNIAANAAYP 307
G+C + A+YP
Sbjct: 337 GMCGLYMMASYP 348
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 180 bits (457), Expect = 7e-43, Method: Compositional matrix adjust.
Identities = 115/323 (35%), Positives = 167/323 (51%), Gaps = 43/323 (13%)
Query: 17 QWMVEFARTYKDQA----EKEMRFKIFKKNHEF--------------LRLNKFADLTREK 58
QW E +T + +++ RF IFK N F L L KF DLT ++
Sbjct: 51 QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDE 110
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNS------SKMSFYDSIDWNERGAVTPVKDQGS 112
+ Y G + P + R KN+N + +++DW ++GAV P+KDQG+
Sbjct: 111 YRKLYLGARTEP-----ARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGT 165
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQR 169
CWAF+ A VEG+NKI TG+L++ S+ +LVDC S GC ++ AF++I +
Sbjct: 166 CGSCWAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGG 225
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
L +E YPY+G + ++S + +I GY+ V E L+ +S QPV VAI+A
Sbjct: 226 LNTEKDYPYRGFGGKCNSFLKNS---RVVSIDGYEDVPTKDETALKKAISYQPVRVAIEA 282
Query: 230 --TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F Y G+FTG CG +H V VGYG+ E YW+V+N WG W E G +R
Sbjct: 283 GGRIFQHYQSGIFTGSCGTNLDHAVVAVGYGS----ENGVDYWIVRNSWGPRWGEEGYIR 338
Query: 288 IFRGVGG--SGLCNIAANAAYPL 308
+ R + SG C IA A+YP+
Sbjct: 339 MERNLAASKSGKCGIAVEASYPV 361
>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
Length = 397
Score = 180 bits (456), Expect = 9e-43, Method: Compositional matrix adjust.
Identities = 118/313 (37%), Positives = 165/313 (52%), Gaps = 43/313 (13%)
Query: 31 EKEMRFKIFKKN---------------HEF-LRLNKFADLTREKFLASYTGYKPPPTDHP 74
E +R ++F+ N H F L L FADLT E++ G++ P
Sbjct: 74 EDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPFADLTLEEYRGRALGFRARHRGGP 133
Query: 75 HSNRSNWFKNLNSSKMSFY-------------DSIDWNERGAVTPVKDQGSYC--CWAFT 119
S R+ + + S + D+IDW + GAVT VK+Q C CWAF+
Sbjct: 134 -SARAAASRVGSGGTRSHHRRPRPRPRCGDLPDAIDWRQLGAVTDVKNQ-EQCGGCWAFS 191
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPY 178
AVA +EG+N I TG LV+ S+ +++DC T + GC +ENAF+++ + SE YP+
Sbjct: 192 AVAAIEGINAIVTGNLVSLSEQEIIDCDTQDSGCNGGQMENAFQFVIDNGGIDSEADYPF 251
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYH 236
D CD +++ K AI G+ V E LQ+ V+ QPVSVAIDA F Y
Sbjct: 252 IA-TDGTCDANKANDE-KVAAIDGFVEVASNNETALQEAVAIQPVSVAIDAGGRAFQHYS 309
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGS 295
G+F GPCG +HGVT+VGYG+ E + YW+VKN W +W E G +RI R V
Sbjct: 310 SGIFNGPCGTNLDHGVTVVGYGS----ENGKAYWIVKNSWSDSWGEAGYIRIRRNVFLPV 365
Query: 296 GLCNIAANAAYPL 308
G C IA +A+YP+
Sbjct: 366 GKCGIAMDASYPV 378
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 118/312 (37%), Positives = 172/312 (55%), Gaps = 28/312 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN------------HEFLRLNKFADLTREKFLASY 63
+ W V+ + Y EK R+ IFK+N +L LN+FAD+T E+F A++
Sbjct: 46 KSWSVKHRKIYVSPKEKLKRYGIFKQNLMHIAETNRKNGSYWLGLNQFADITHEEFKANH 105
Query: 64 TGYKPPPTDHPHSNRS-NWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
G K + R+ F+ ++ + + S+DW +GAVTPVK+QG CWAF++V
Sbjct: 106 LGLKQGLSRMGAQTRTPTTFRYAAAANLPW--SVDWRYKGAVTPVKNQGKCGSCWAFSSV 163
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A VEG+N+I TG+LV+ S+ +L+DC T+ +GC ++ AF YI Q + +E YPY
Sbjct: 164 AAVEGINQIVTGKLVSLSEQELMDCDTMLDHGCEGGLMDFAFAYIMGSQGIHAEDDYPYL 223
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
++ YC + A+ I GY+ V +E L ++ QPVSV I A F FY G
Sbjct: 224 -MEEGYCKEKQPYAN--VVTITGYEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYKG 280
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
GVF G C + +H +T VGYG++ Q Y +KN WG NW E G +RI G G G
Sbjct: 281 GVFDGSCSDELDHALTAVGYGSSY----GQNYITMKNSWGKNWGEQGYVRIKMGTGKPEG 336
Query: 297 LCNIAANAAYPL 308
+C I A+YP+
Sbjct: 337 VCGIYTMASYPV 348
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 179 bits (455), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 114/312 (36%), Positives = 166/312 (53%), Gaps = 37/312 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E W +E + YK+ EK RF+IFK N + L LN+FADLT ++F A Y
Sbjct: 23 ESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKKNSSYWLGLNEFADLTHDEFKAKY 82
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G + + F + + + +SIDW ++GAVTPVK+Q CWAF+ VA
Sbjct: 83 VGSLGEDSTIIEQSDDEEFPYKHV--VDYPESIDWRQKGAVTPVKNQNPCGSCWAFSTVA 140
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGR 181
TVEG+NKI TG+L++ S+ +L+DC + GC + + +Y+ + +E YPY+ +
Sbjct: 141 TVEGINKIVTGKLISLSEQELLDCDRRSHGCKGGYQTTSLQYVAD-NGVHTEKEYPYEKK 199
Query: 182 QDYYCDWWRSSASGKYGA---IRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYH 236
Q + A K G+ I GY+ V E L ++ QPVSV +++ F FY
Sbjct: 200 QG------KCRAKDKKGSKVKITGYKRVPANNEVSLIQAIANQPVSVVVESKGRAFQFYK 253
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS- 295
GG+F GPCG +H VT VGYG + Y L+KN WG W E G +RI R G S
Sbjct: 254 GGIFEGPCGTKVDHAVTAVGYG--------KNYILIKNSWGPKWGEKGYIRIKRASGKSK 305
Query: 296 GLCNIAANAAYP 307
G C + +++ +P
Sbjct: 306 GTCGVYSSSYFP 317
>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 179 bits (454), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 115/328 (35%), Positives = 170/328 (51%), Gaps = 42/328 (12%)
Query: 11 IAAKHE----QWMVEF-------ARTYKDQAEKEMRFKIFKKNHEF------------LR 47
+AA HE +M+ F + Y E +RF IFK N + L
Sbjct: 12 VAAGHEVPPPDYMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALG 71
Query: 48 LNKFADLTREKFLASYTGYKPPP--TDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVT 105
+N+F DLT+E+ ASYTG KP + P + + N + ++ S+DW +G VT
Sbjct: 72 VNEFTDLTQEELAASYTGLKPASLWSGLPRLSTHEY----NGAPLA--SSVDWTTQGVVT 125
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEY 163
PVK+QG CW+F+ +EG + TG LV+ S+ Q VDC T + GC +++NAF +
Sbjct: 126 PVKNQGQCGSCWSFSTTGALEGAWALSTGNLVSLSEQQFVDCDTTDSGCNGGWMDNAFSF 185
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
++ + +E YPY D C+ G + GY V +E+ + V++QPV
Sbjct: 186 AKK-NSICTEGSYPYTAT-DGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPV 243
Query: 224 SVAIDATWFNF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
S+AI+A ++F Y GV T CG +HGV VGYG+ E YW VKN WG++W
Sbjct: 244 SIAIEADQYSFQLYSSGVLTASCGTRLDHGVLAVGYGS----EAGTDYWKVKNSWGSSWG 299
Query: 282 EGGSMRIFRGVGGSGLCNIAAN-AAYPL 308
E G +R+ RG GG+G C + A +YP+
Sbjct: 300 EQGYVRLQRGKGGAGECGLLAGPPSYPV 327
>gi|310942960|pdb|3P5W|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
Length = 220
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 101/221 (45%), Positives = 129/221 (58%), Gaps = 13/221 (5%)
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TL 149
D +DW GAV +KDQG CWAF+ +A VEG+NKI TG L++ S+ +LVDC
Sbjct: 3 DYVDWRSSGAVVDIKDQGQCGSCWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQNT 62
Query: 150 NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPA 209
GC F+ + F++I + +E YPY + C+ KY +I Y+ V
Sbjct: 63 RGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQ-CN--LDLQQEKYVSIDTYENVPYN 119
Query: 210 TEEGLQDVVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQ 267
E LQ V+ QPVSVA++A +NF H G+FTGPCG +H VTIVGYGT EG
Sbjct: 120 NEWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGT----EGGI 175
Query: 268 PYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
YW+VKN WGT W E G MRI R VGG G C IA A+YP+
Sbjct: 176 DYWIVKNSWGTTWGEEGYMRIQRNVGGVGQCGIAKKASYPV 216
>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 370
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 95/224 (42%), Positives = 139/224 (62%), Gaps = 13/224 (5%)
Query: 91 SFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL 149
+ DS+DW E+GAV P+KDQG CWAF+ +A+VEG+NKI TG L++ S+ +LVDC
Sbjct: 40 ALPDSVDWREKGAVVPIKDQGGCGSCWAFSTIASVEGINKIVTGDLISLSEQELVDCDKT 99
Query: 150 --NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQ 207
+GC ++ AF++I + +E YPY QD CD +R +A K +I Y+ V
Sbjct: 100 YNDGCNGGLMDYAFQFIIDNGGIDTEKDYPYT-EQDGRCDSYRKNA--KVVSINSYEDVP 156
Query: 208 PATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEG 265
E+ L+ + QP++VAID F Y+ G+FTG CG + +HGVT+VGYG+ E
Sbjct: 157 VNDEQALKKAAASQPIAVAIDGGGRSFQLYNSGIFTGKCGTSLDHGVTVVGYGS----ES 212
Query: 266 QQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
+ YW+V+N WG +W E G +R+ R + SG+C IA A+YP+
Sbjct: 213 GKDYWIVRNSWGESWGEKGYIRMARNIDSPSGICGIAMEASYPI 256
>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
Length = 382
Score = 179 bits (453), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 115/339 (33%), Positives = 164/339 (48%), Gaps = 56/339 (16%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRLNK-------------FADLTREKFLAS 62
++W E+ R+Y E+ R +++ +N ++ + DLT ++F+A
Sbjct: 53 QRWKAEYNRSYATPEEERRRLRVYARNVRYIEATNAAAGLAYELGETAYTDLTNDEFMAM 112
Query: 63 YTGYKPP----------------------PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNE 100
YT PP P D H +F + S+DW
Sbjct: 113 YTA--PPLRSAADDDDDAATTTIITTRAGPVDE-HQQPEVYFNESAGAPA----SVDWRA 165
Query: 101 RGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLE 158
GAVT VKDQG CWAF+ VA VEG+ KI+ G+LV+ S+ +LVDC TL+ GC
Sbjct: 166 SGAVTEVKDQGRCGSCWAFSTVAVVEGIQKIKKGKLVSLSEQELVDCDTLDSGCDGGVSY 225
Query: 159 NAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVV 218
A E+I + + YPY G CD R+ I G + V +E LQ+
Sbjct: 226 RALEWITANGGITTRDDYPYTGAAAAACD--RAKLGHHAATIAGLRRVATRSEASLQNAA 283
Query: 219 SRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYG-----TTTEAEGQQPYWL 271
+ QPV+V+I+A NF H GV+ GPCG NHGVT+VGYG A G + YW+
Sbjct: 284 AAQPVAVSIEAGGDNFQHYRKGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDK-YWI 342
Query: 272 VKNRWGTNWDEGGSMRIFRGVGG--SGLCNIAANAAYPL 308
+KN WG NW + G +++ + V G GLC IA ++PL
Sbjct: 343 IKNSWGKNWGDQGYIKMKKDVAGKPEGLCGIAIRPSFPL 381
>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
Length = 378
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 128/345 (37%), Positives = 178/345 (51%), Gaps = 49/345 (14%)
Query: 4 TSHKTGNIAAKHEQWMVEFAR-TYKDQAEKEMRFKIFKKN-HEF-----------LRLNK 50
+SH++ +A E+W+ + Y EK RF++FK N H L LN+
Sbjct: 39 SSHES--LAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHIDETNRKVSSYWLGLNE 96
Query: 51 FADLTREKFLASYTGYKPPPTD----HPHSNRSNW----------------FKNLNSSKM 90
FADLT ++F A+Y G P H H + + ++ ++++++
Sbjct: 97 FADLTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSSSSSFRFRYEGVDAARL 156
Query: 91 SFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL 149
S+DW +GAVT VK+QG CWAF+ VA VEG+N+I TG L S+ +LVDC T
Sbjct: 157 P--KSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDTD 214
Query: 150 --NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQ 207
NGC ++ AF YI L +E YPY ++ C S+A I GY+ V
Sbjct: 215 GNNGCNGGLMDYAFSYIAHNGGLHTEEAYPYL-MEEGTCSRGSSAA---VVTISGYEDVP 270
Query: 208 PATEEGLQDVVSRQPVSVAIDATWFN--FYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEG 265
E+ L ++ QPVSVAI+A+ N FY GGVF GPCG +HGV VGYGT + G
Sbjct: 271 RNNEQALLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHGVAAVGYGTAGKDNG 330
Query: 266 Q--QPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
Y +VKN WG +W E G +R+ RG G GLC I +YP
Sbjct: 331 HVVADYIIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMPSYP 375
>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
Length = 398
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 112/304 (36%), Positives = 161/304 (52%), Gaps = 33/304 (10%)
Query: 27 KDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKFLASYTGYKPPP 70
+++ ++ +R ++F+ N H F L L FADLT E++ G++
Sbjct: 80 QEEEDRRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPFADLTLEEYRGRVLGFRARG 139
Query: 71 TDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAVATVEGLN 128
S + + D+IDW + GAVT VKDQ C CWAF+AVA +EG+N
Sbjct: 140 RRSGARYGSGY----SVRGGDLPDAIDWRQLGAVTEVKDQ-QQCGGCWAFSAVAAIEGVN 194
Query: 129 KIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCD 187
I TG LV+ S+ +++DC + GC +ENAF ++ + +E YP+ G D CD
Sbjct: 195 AIATGNLVSLSEQEIIDCDAQDSGCDGGQMENAFRFVIGNGGIDTEADYPFIG-TDGTCD 253
Query: 188 WWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCG 245
+ K I G V E LQ+ V+ QPVSVAIDA+ F Y G+F GPCG
Sbjct: 254 ASKEKNE-KVATIDGLVEVASNNETALQEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCG 312
Query: 246 NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANA 304
+ +HGVT VGYG+ E + YW+VKN W +W E G +R+ R V +G C IA +A
Sbjct: 313 TSLDHGVTAVGYGS----ESGKDYWIVKNSWSASWGEAGYIRMRRNVPRPTGKCGIAMDA 368
Query: 305 AYPL 308
+YP+
Sbjct: 369 SYPV 372
>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 178 bits (452), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 115/328 (35%), Positives = 170/328 (51%), Gaps = 42/328 (12%)
Query: 11 IAAKHE----QWMVEF-------ARTYKDQAEKEMRFKIFKKNHEF------------LR 47
+AA HE +M+ F + Y E +RF IFK N + L
Sbjct: 12 VAAGHEVPPPDYMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNARNLTFALG 71
Query: 48 LNKFADLTREKFLASYTGYKPPP--TDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVT 105
+N+F DLT+E+F ASYTG KP + P + + N + ++ S+DW +G VT
Sbjct: 72 VNEFTDLTQEEFAASYTGLKPASLWSGLPRLSTHEY----NGAPLA--SSVDWTTQGVVT 125
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEY 163
PVK+QG CW+F+ +EG + TG LV+ S+ Q DC T + GC +++NAF +
Sbjct: 126 PVKNQGQCGSCWSFSTTGALEGAWALSTGNLVSLSEQQFEDCDTTDSGCNGGWMDNAFSF 185
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPV 223
++ + +E YPY D C+ G + GY V +E+ + V++QPV
Sbjct: 186 AKK-NSICTEGSYPYTAT-DGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPV 243
Query: 224 SVAIDATWFNF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
S+AI+A ++F Y GV T CG +HGV VGYG+ E YW VKN WG++W
Sbjct: 244 SIAIEADQYSFQLYSSGVLTASCGTRLDHGVLAVGYGS----EAGTDYWKVKNSWGSSWG 299
Query: 282 EGGSMRIFRGVGGSGLCNIAAN-AAYPL 308
E G +R+ RG GG+G C + A +YP+
Sbjct: 300 EQGYVRLQRGKGGAGECGLLAGPPSYPV 327
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 113/310 (36%), Positives = 163/310 (52%), Gaps = 29/310 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E WM + + Y+ EK +RF+IFK N + +L LN+FADL+ ++F Y
Sbjct: 48 ESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVSNYWLGLNEFADLSHQEFKNKY 107
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G K + S +K++ K S+DW ++GAV PVK+QGS CWAF+ VA
Sbjct: 108 LGLKVDYSRRRESPEEFTYKDVELPK-----SVDWRKKGAVAPVKNQGSCGSCWAFSTVA 162
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I TG L + S+ +L+DC NGC ++ AF +I + L E YPY
Sbjct: 163 AVEGINQIVTGNLTSLSEQELIDCDRTYSNGCNGGLMDYAFSFIVENGGLHKEEDYPYI- 221
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
++ C+ + + I GY V E+ L ++ Q +SVAI+A+ F FY GG
Sbjct: 222 MEEGTCEMTKEET--EVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYSGG 279
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLC 298
VF G CG+ +HGV VGYGT + Y +VKN WG+ W E G +R+ + G
Sbjct: 280 VFDGHCGSDLDHGVAAVGYGTAKGVD----YIIVKNSWGSKWGEKGYIRMRGTLETRGNL 335
Query: 299 NIAANAAYPL 308
A+YPL
Sbjct: 336 RYLQMASYPL 345
>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP2-like [Glycine max]
Length = 342
Score = 178 bits (452), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 118/317 (37%), Positives = 168/317 (52%), Gaps = 37/317 (11%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL------------NKFADLTREK 58
+ ++E W+ ++ + Y+++ E E RF+I++ N +F+ + NKF DLT E+
Sbjct: 40 MRMRYESWLKKYGQKYRNKDEWEFRFEIYRANVQFIEVYNSQNYSYKLMDNKFVDLTNEE 99
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CW 116
F Y Y+P H + R + K+ + K IDW RGAVT +KDQG +C CW
Sbjct: 100 FRRMYLVYQPRS--HLQT-RFMYQKHGDLPK-----RIDWRTRGAVTXIKDQG-HCGSCW 150
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASE 173
+F+AVATVE +NKI+TG+LV+ S+ QL+DC N GC +E F +I + L ++
Sbjct: 151 SFSAVATVEDINKIKTGKLVSLSEQQLIDCDNRNGNEGCNGGHME-TFTFITKRGGLTTD 209
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--W 231
YPYQG D ++ AI GY+ + E L+ V+ QP SVA DA
Sbjct: 210 KNYPYQGSDG---DXNKAKVRNHAVAICGYENLPAHNENMLKAAVAHQPASVATDAGGYA 266
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G F+G CG NH +TIVGYG E + YWLVKN W + G +R+ R
Sbjct: 267 FQLYSKGTFSGSCGKDLNHRMTIVGYG----EENGEKYWLVKNSWANDXGVSGYIRMKRD 322
Query: 292 -VGGSGLCNIAANAAYP 307
G C A A+YP
Sbjct: 323 PKDKDGTCGTAMEASYP 339
>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 458
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 123/319 (38%), Positives = 167/319 (52%), Gaps = 36/319 (11%)
Query: 11 IAAKHEQWMVEFARTYKDQ-AEKEMRFKIFKKNHEF------------LRLNKFADLTRE 57
+ A ++QW + + + + AE E RF IFK N +F L LN FADLT E
Sbjct: 37 VMALYDQWRAKHGKLHNNLGAEPENRFHIFKDNLKFIDEINAQNLPYRLGLNVFADLTNE 96
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
++ + Y G K + + + L DSIDW +GAV PVKDQGS CW
Sbjct: 97 EYRSRYLGGKFASGSRRNRTSNRYLPRLGDD---LPDSIDWRAKGAVAPVKDQGSCGSCW 153
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASEC 174
AF+ VA+VE +N+I TG L+ S+ +LVDC S GC ++ AFE+I + L +E
Sbjct: 154 AFSTVASVEAINQIVTGDLIALSEQELVDCDRSYNEGCNGGLMDYAFEFIIENGGLDTEE 213
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT---- 230
YPY G + S K AI GY+ V E+ LQ VS+Q VSV A
Sbjct: 214 DYPYYG-------FDSSCIQYKKNAIDGYEDVPVNNEKALQKAVSKQVVSVVSVAIEGGG 266
Query: 231 -WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F Y G+FTG CG +HGV +VGYG+ EG YW+V+N WG +W E G +++
Sbjct: 267 RSFQLYQSGIFTGRCGTDLDHGVNVVGYGS----EGGVDYWIVRNSWGGSWGESGYVKMQ 322
Query: 290 RGVGG-SGLCNIAANAAYP 307
R + +GLC IA +YP
Sbjct: 323 RNIASPTGLCGIAMEPSYP 341
>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
Length = 233
Score = 178 bits (451), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 96/233 (41%), Positives = 133/233 (57%), Gaps = 15/233 (6%)
Query: 82 FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSK 140
F+ N S + +IDW +GAVTP+KDQG CCWAF+AVA EG+ KI TG+LV+ ++
Sbjct: 7 FRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAE 66
Query: 141 HQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKY 197
+LVDC + GC +++AF++I + L +E YPY D C S S
Sbjct: 67 QELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTA-ADGKC----KSGSNSA 121
Query: 198 GAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIV 255
I+GY+ V E L V+ QPVSVA+D F FY GGV TG CG +HG+ +
Sbjct: 122 ATIKGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAI 181
Query: 256 GYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
GYG T++ YWL+KN WGT W E G +R+ + + G+C +A +YP
Sbjct: 182 GYGKTSDG---TKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYP 231
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 110/310 (35%), Positives = 160/310 (51%), Gaps = 42/310 (13%)
Query: 21 EFARTYKDQAEKEMRFKIFKKNHEFLR----------------LNKFADLTREKFLASYT 64
+++++Y+ +A + R F+ N EF+ +N+FADLT ++F+A Y
Sbjct: 4 DYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMALYV 63
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVAT 123
K NR+ + + S DS+DW +GAVTP+K+QG CW+F+ +
Sbjct: 64 PSK--------FNRTMPYNTVYLPATS-EDSVDWRTKGAVTPIKNQGQCGSCWSFSTTGS 114
Query: 124 VEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
EG + I TG LV+ S+ QLVDCS GC +++AF+YI + L +E YPY
Sbjct: 115 TEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPYTA 174
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGG 238
QD C+ + + I Y V E+ L V++ PVSVAI+A + F Y G
Sbjct: 175 -QDGTCN--KEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSG 231
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLC 298
VF G CG +HGV +VGY YW+VKN WGT W G + + RGV SG+C
Sbjct: 232 VFDGNCGTNLDHGVLVVGY--------TDDYWIVKNSWGTTWGVEGYINMKRGVSASGIC 283
Query: 299 NIAANAAYPL 308
IA +YP+
Sbjct: 284 GIAMQPSYPI 293
>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
Length = 366
Score = 177 bits (450), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 123/330 (37%), Positives = 160/330 (48%), Gaps = 45/330 (13%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKN--------HE-----FLRLNKFADLTREKF 59
A +E+W + +D EK RF +FK+N H+ L LN+F+D+T E+F
Sbjct: 46 ALYERWCAHY-NMARDHGEKTRRFDLFKENARRIYEHNHQGNATYTLGLNRFSDMTDEEF 104
Query: 60 LAS-YTGYKPPP--------------TDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAV 104
S Y G P N K+ ++DW R AV
Sbjct: 105 NRSPYGGCLTAPRMSDDEIEELHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDWRGR-AV 163
Query: 105 TPVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAF 161
T VKDQG C CWAF+A+A VEG+N IRT LV S+ QLVDC LN GC + AF
Sbjct: 164 TRVKDQGPTCGSCWAFSAIAAVEGINAIRTRNLVPLSEQQLVDCDKLNHGCNGGLMTTAF 223
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
++ + + + E YPY GR+ C + Y GYQ V L + V+ Q
Sbjct: 224 SFVVRNRGVVPEGAYPYMGREG-RCKHVMAPPVTIY----GYQRVPRFDANALMNAVAAQ 278
Query: 222 PVSVAIDATWFNF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
PVSVAI+A+ F F Y GGVF G CG H T VGYG A+ P+W+VKN WG
Sbjct: 279 PVSVAIEASSFEFRHYQGGVFNGNCGGRLGHAATAVGYG----ADAGGPFWIVKNSWGPG 334
Query: 280 WDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W EGG +RI R G+C I +YP+
Sbjct: 335 WGEGGYVRISRNTPVRQGVCGILTENSYPV 364
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 120/291 (41%), Positives = 157/291 (53%), Gaps = 29/291 (9%)
Query: 38 IFKKNHEF--------LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSK 89
I K N E+ + LN FA LT E+F A Y GY + P + R+ K+ S+
Sbjct: 62 IMKHNEEYNQGLQSYEMGLNGFAHLTFEEFSAQYLGYGGAEVEQPKTRRAG--KHERKSR 119
Query: 90 MSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST 148
S+DW E+GAV VK+QG+ CWAF+AVA +EG + + +G+L++ S+ QLVDCS
Sbjct: 120 SEIPASVDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGELISLSEQQLVDCSK 179
Query: 149 L---NGCAKNFLENAFEYIRQYQRLA--SECVYPYQGRQDYYCDWWRSSASGKYGAIRGY 203
+GCA +++NAFEY SE YPY+G D C + SA G I GY
Sbjct: 180 KFGNHGCAGGYMDNAFEYWMNNTGHGDDSEKDYPYKG-MDGKCKF---SADGVRATISGY 235
Query: 204 QYVQPATEEGLQDVVSR-QPVSVAIDA-TWFNFYHGGVF---TGPCGNTPNHGVTIVGYG 258
V+ E L D V+ PVSVAI A FY GVF G C NHGVT VGYG
Sbjct: 236 NDVKQGNETDLLDAVANVGPVSVAIHAGAALQFYLRGVFNGVAGTCFGPLNHGVTAVGYG 295
Query: 259 TTTEAEGQQ-PYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
T + G++ YW++KN WG W E G +R R G LC +A A+YPL
Sbjct: 296 TASLRFGRKMDYWIIKNSWGMGWGEKGFVRFAR---GKNLCGVANGASYPL 343
>gi|255635645|gb|ACU18172.1| unknown [Glycine max]
Length = 355
Score = 177 bits (449), Expect = 5e-42, Method: Compositional matrix adjust.
Identities = 117/328 (35%), Positives = 169/328 (51%), Gaps = 36/328 (10%)
Query: 2 SRTSHKTGN-IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRL 48
R + +T + + + E+W+V+ + Y EKE RF+IFK N F L L
Sbjct: 31 DRATRRTDDEVMSMFEEWLVKHDKVYNALGEKEKRFQIFKNNLRFIDERNSLNRTYKLGL 90
Query: 49 NKFADLTREKFLASY--TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
N FADLT ++ A Y T P D R+ + + + S+DW + GAVTP
Sbjct: 91 NVFADLTNAEYRAMYLRTWDDGPRLDLDTPPRNRYVPRVGDT---IPKSVDWRKEGAVTP 147
Query: 107 VKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFE 162
VK+QG+ C CWAFTAV VE L KI+TG L++ S+ ++VDC+T + GC +++ +
Sbjct: 148 VKNQGATCNSCWAFTAVGAVESLVKIKTGDLISLSEQEVVDCTTSSSRGCGGGDIQHGYI 207
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
YIR+ ++ E YPY+G + CD S+ I G+ +V EE L+ ++ QP
Sbjct: 208 YIRK-NGISLEKDYPYRGDEG-KCD---SNKKNAIVTIDGHGWVPTQLEEALKQGIANQP 262
Query: 223 VSVAI--DATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
V+V I D F +Y GVF G CG NH + +VGYG AE YW+ KN + W
Sbjct: 263 VAVPIPADDYEFQYYTSGVFKGKCGTELNHALLLVGYG----AEKDGDYWIAKNSYSDKW 318
Query: 281 DEGGSMRIFRGVGGSGLCNIAANAAYPL 308
E G +RI R + C YP+
Sbjct: 319 GENGYIRIQRKL---STCKFGNGGYYPI 343
>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 348
Score = 177 bits (448), Expect = 6e-42, Method: Compositional matrix adjust.
Identities = 112/311 (36%), Positives = 161/311 (51%), Gaps = 36/311 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E WM++ R Y + EK RF+IFK N + L LN+F DLT ++F Y
Sbjct: 49 ESWMLKHDRVYNNIEEKIHRFEIFKDNLMYIDETNKKNNSYWLGLNEFVDLTHDEFKEKY 108
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYCCWAFTAVAT 123
G D +SN + + + +SIDW ++GAVTPVK CWAF+ VAT
Sbjct: 109 VG--SIGEDFVTIEQSNDEEFPYKHVVDYPESIDWRDKGAVTPVKPNPCGSCWAFSTVAT 166
Query: 124 VEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQ 182
VEG+NKI TG+L++ S+ +L+DC +GC + + +Y+ + +E YPY+ +Q
Sbjct: 167 VEGINKIVTGKLISLSEQELLDCDRRSHGCKGGYQTTSLQYVVD-NGVHTEKEYPYEKKQ 225
Query: 183 DYYCDWWRSSASGKYGA---IRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHG 237
+ A K G I GY+ V E L ++ QPVSV +++ F Y G
Sbjct: 226 G------KCRAKEKKGTKVQITGYKRVPANDEISLIQAIANQPVSVLLESKGRAFQLYKG 279
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-G 296
G+F GPCG +H VT +GYG T Y L+KN WG NW E G ++I R G S G
Sbjct: 280 GIFNGPCGTKLDHAVTAIGYGKT--------YILIKNSWGPNWGEKGYLKIKRASGKSEG 331
Query: 297 LCNIAANAAYP 307
C + ++ +P
Sbjct: 332 TCGVYKSSYFP 342
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 177 bits (448), Expect = 7e-42, Method: Compositional matrix adjust.
Identities = 116/312 (37%), Positives = 170/312 (54%), Gaps = 31/312 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIF-----------KKNHEF-LRLNKFADLTREKFLASY 63
+ WM++ + Y+ EK RF+IF KKN+ + L LN FADL+ ++F Y
Sbjct: 49 DSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKY 108
Query: 64 TGYKPPP-TDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
G+ T H + ++ +K++ + + SIDW +GAVTPVK+QG+ CWAF+
Sbjct: 109 VGFVAEDFTGLEHFDNEDFTYKHVTN----YPQSIDWRAKGAVTPVKNQGACGSCWAFST 164
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+ATVEG+NKI TG L+ S+ +LVDC + GC + + +Y+ + + VYPYQ
Sbjct: 165 IATVEGINKIVTGNLLELSEQELVDCDKHSYGCKGGYQTTSLQYVAN-NGVHTSKVYPYQ 223
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
+Q Y C + G I GY+ V E ++ QP+SV ++A F Y
Sbjct: 224 AKQ-YKCR--ATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKS 280
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-G 296
GVF GPCG +H VT VGYGT+ +G+ Y ++KN WG NW E G MR+ R G S G
Sbjct: 281 GVFDGPCGTKLDHAVTAVGYGTS---DGKN-YIIIKNSWGPNWGEKGYMRLKRQSGNSQG 336
Query: 297 LCNIAANAAYPL 308
C + ++ YP
Sbjct: 337 TCGVYKSSYYPF 348
>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 377
Score = 177 bits (448), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 116/336 (34%), Positives = 168/336 (50%), Gaps = 44/336 (13%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL---------------NKFADL 54
+A + ++W E R Y + E+ R +++ +N ++ + DL
Sbjct: 48 TMAPRFQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGETAYTDL 107
Query: 55 TREKFLASYTGYKPPPTDHPHSNRSNWFKNL----------------NSSKMSFYDSIDW 98
T ++F A YT P P H + + + N S S+DW
Sbjct: 108 TADEFTAMYT--SPSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFNVSTAGAPASVDW 165
Query: 99 NERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNF 156
+GAVT VK+QG CWAF+ VA VEG+++IRTG L++ S+ +LVDC TL+ GC
Sbjct: 166 RAKGAVTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCDTLDYGCDGGV 225
Query: 157 LENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQD 216
+A E+I +A+E YPY G+ D C + AI G+ V +E L +
Sbjct: 226 SYHALEWIASNGGIATEADYPYTGK-DGAC--VANKLPLHAAAISGFARVATRSEPSLAN 282
Query: 217 VVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKN 274
V+ QPV+V+I+A NF H GV+ GPCG NHGVT+VGYG + YW+VKN
Sbjct: 283 AVAAQPVAVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVVGYGEEEGDGEK--YWIVKN 340
Query: 275 RWGTNWDEGGSMRIFRGVGG--SGLCNIAANAAYPL 308
WG W +GG R+ + V G GLC IA ++PL
Sbjct: 341 SWGKKWGDGGYFRMKKDVAGKPEGLCGIAIRPSFPL 376
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 176 bits (447), Expect = 8e-42, Method: Compositional matrix adjust.
Identities = 114/311 (36%), Positives = 162/311 (52%), Gaps = 31/311 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E W+ + ++ Y+ EK RF+IF N + +L LN+FADLT E+F +
Sbjct: 50 ESWLAKHSKIYESLDEKLHRFEIFMDNLKHIDDTNKKVSNYWLGLNEFADLTHEEFKNKF 109
Query: 64 TGYK-PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
G K P S +++ + S+DW ++GAV PVK+QG CWAF+ V
Sbjct: 110 LGLKGELPERKDESIEEFSYRDF----VDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTV 165
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A VEG+N+I TG L S+ +L+DC T NGC ++ AF Y+ + L E YPY
Sbjct: 166 AAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMR-SGLHKEEEYPYI 224
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
+ CD + + + I GY V E+ ++ QP+SVAI+A+ F FY G
Sbjct: 225 MSEG-TCDEKKDVS--ETVTISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSG 281
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
GVF G CG +HGV VGYGTT + Y +V+N WG W E G +R+ R G G
Sbjct: 282 GVFDGHCGTELDHGVAAVGYGTTKGLD----YVIVRNSWGPKWGEKGYIRMKRKTGKPHG 337
Query: 297 LCNIAANAAYP 307
+C + A+YP
Sbjct: 338 MCGLYMMASYP 348
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 114/319 (35%), Positives = 161/319 (50%), Gaps = 33/319 (10%)
Query: 8 TGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADL 54
T N++ E W E ++Y EK R +F N+EF L LN +ADL
Sbjct: 22 TSNVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADL 81
Query: 55 TREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
T +F S G+ P N S DS+DW ++GAVT VKDQGS
Sbjct: 82 THHEFKVSRLGFSPAL-----RNFRPVLPQEPSLPRDVPDSLDWRKKGAVTAVKDQGSCG 136
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLA 171
CW+F+A +EG+N+I TG L++ S+ +L+DC S +GC ++ A++++ +
Sbjct: 137 ACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGID 196
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEG-LQDVVSRQPVSVAIDAT 230
+E YPYQ R D C + I GY + P+ +EG L V+ QPVSV I +
Sbjct: 197 TENDYPYQAR-DGSCR--KDKLQRNVVTIDGYADI-PSNDEGKLLQAVAAQPVSVGICGS 252
Query: 231 --WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
F Y G+F+GPC + +H V IVGYG+ E YW+VKN WG +W G M +
Sbjct: 253 ERAFQLYSKGIFSGPCSTSLDHAVLIVGYGS----ENGVDYWIVKNSWGKSWGMDGYMHM 308
Query: 289 FRGVGGS-GLCNIAANAAY 306
R G S G+C I A+Y
Sbjct: 309 QRNSGNSEGVCGINKLASY 327
>gi|297845822|ref|XP_002890792.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
gi|297336634|gb|EFH67051.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
Length = 322
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 124/321 (38%), Positives = 166/321 (51%), Gaps = 54/321 (16%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTR 56
+I H+QWM +F+R Y+D++EKEMR ++FKKN +F+ +N+F D T
Sbjct: 33 SIVDYHQQWMTQFSRVYQDESEKEMRLQVFKKNLKFIENFNNMGNQSYTVGVNEFTDWTI 92
Query: 57 EKFLASYTGYKPPPTDHPHS-NRSNWFKNLNSSKMSFYD-SIDWNERGAVTPVKDQGSYC 114
E+FLA++TG + T N + +N N S + D S DW + GAV PVK QG+ C
Sbjct: 93 EEFLATHTGLRVNVTTLSELFNETMPSRNWNISDIDIDDESKDWRDEGAVIPVKVQGA-C 151
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLAS 172
GL KI L+T S+ QL+DC T GC +E AF+YI + ++
Sbjct: 152 -----------GLTKISGKNLLTLSEQQLIDCDTEKNTGCDGGGIEEAFKYIIKNGGVSL 200
Query: 173 ECVYPYQGRQDYYCDWWRSSA-SGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID--A 229
E YPYQ ++ C R++A S IRG++ V E L + V RQPVSV ID A
Sbjct: 201 ETEYPYQVKKG-SC---RANARSATQTQIRGFEMVPSHNERALLEAVRRQPVSVLIDARA 256
Query: 230 TWFNFYHGGVFTG-PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
F Y GGV+ G CG NH VT VGYGT ++ W E G MRI
Sbjct: 257 DSFKTYKGGVYAGLDCGTDVNHAVTFVGYGTMIQS----------------WGENGYMRI 300
Query: 289 FRGVG-GSGLCNIAANAAYPL 308
R V G+C IA AAYP+
Sbjct: 301 RRDVEWPQGMCGIAQVAAYPI 321
>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
Length = 435
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 110/303 (36%), Positives = 161/303 (53%), Gaps = 30/303 (9%)
Query: 29 QAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKFLASYTGYKPPPTD 72
+ ++ +R ++F+ N H F L L FADLT +++ G++
Sbjct: 111 EEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFRLGLTPFADLTLDEYRGRVLGFRARARR 170
Query: 73 HPHS-NRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAVATVEGLNK 129
+ ++ D+IDW + GAVT VKDQ C CWAF+AVA +EG+N
Sbjct: 171 SGARYGHGHGYRARPRGGDLLPDAIDWRQLGAVTEVKDQ-QQCGGCWAFSAVAAIEGINA 229
Query: 130 IRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDW 188
I TG LV+ S+ +++DC + GC +ENAF ++ + +E YP+ G D CD
Sbjct: 230 IATGNLVSLSEQEIIDCDAQDSGCDGGQMENAFRFVIGNGGIDTEADYPFIG-TDGTCDA 288
Query: 189 WRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHGGVFTGPCGN 246
+ + + K I G V E LQ+ V+ QPVSVAIDA+ F Y G+F GPCG
Sbjct: 289 SKEN-NEKVATIDGLVEVASNNETALQEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGT 347
Query: 247 TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAA 305
+ +HGVT VGYG+ E + YW+VKN W +W E G +R+ R V +G C IA +A+
Sbjct: 348 SLDHGVTAVGYGS----ESGKDYWIVKNSWSASWGEAGYIRMRRNVPRPTGKCGIAMDAS 403
Query: 306 YPL 308
YP+
Sbjct: 404 YPV 406
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 121/311 (38%), Positives = 160/311 (51%), Gaps = 40/311 (12%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLRL-------------NKFADLTREKFLASYT 64
+M ++++ Y AE RF FK N E +RL N+FADL+ E+F Y
Sbjct: 45 FMKQYSKAYS-HAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYF 103
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVAT 123
GYK + SN NL+ + SIDW AVTP+KDQG CWAF+A +
Sbjct: 104 GYKHVEREFARSN------NLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGS 157
Query: 124 VEGLNKIRTGQ-LVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+EG ++ L + S+ QLVDCST GC ++ AFEYI + + +E YPY+
Sbjct: 158 IEGAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANKGICAESAYPYK 217
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVV-SRQPVSVAIDA--TWFNFYH 236
G C + K I GY+ V E L + V + PVSVAI+A F FY
Sbjct: 218 GVGGL-CQ----KSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYS 272
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSG 296
GVF+G CG+ +HGV VGYGTT G Q YW+VKN WGT+W E G +R+ R
Sbjct: 273 SGVFSGTCGHNLDHGVLAVGYGTT----GSQDYWIVKNSWGTSWGESGYIRMIR---NKN 325
Query: 297 LCNIAANAAYP 307
C IA +YP
Sbjct: 326 QCGIAIQPSYP 336
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 176 bits (446), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 121/311 (38%), Positives = 160/311 (51%), Gaps = 40/311 (12%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLRL-------------NKFADLTREKFLASYT 64
+M ++++ Y AE RF FK N E +RL N+FADL+ E+F Y
Sbjct: 45 FMKQYSKAYS-HAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYF 103
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVAT 123
GYK + SN NL+ + SIDW AVTP+KDQG CWAF+A +
Sbjct: 104 GYKHVEREFARSN------NLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGS 157
Query: 124 VEGLNKIRTGQ-LVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+EG ++ L + S+ QLVDCST GC ++ AFEYI + + +E YPY+
Sbjct: 158 IEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAYPYK 217
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVV-SRQPVSVAIDA--TWFNFYH 236
G C + K I GY+ V E L + V + PVSVAI+A F FY
Sbjct: 218 GVGGL-CQ----KSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYS 272
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSG 296
GVF+G CG+ +HGV VGYGTT G Q YW+VKN WGT+W E G +R+ R
Sbjct: 273 SGVFSGTCGHNLDHGVLAVGYGTT----GSQDYWIVKNSWGTSWGESGYIRMIR---NKN 325
Query: 297 LCNIAANAAYP 307
C IA +YP
Sbjct: 326 QCGIAIQPSYP 336
>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
Length = 246
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 111/269 (41%), Positives = 145/269 (53%), Gaps = 38/269 (14%)
Query: 46 LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVT 105
L +N+FADLT E+F S +K H S + FK N + + + DW ++GAVT
Sbjct: 7 LSINEFADLTNEEFGTSRNRFKA----HICSTEATSFKYENVTAVP--STXDWRKKGAVT 60
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAF 161
P+KDQG CWAF+AVA +EG+ ++ TG+L++ S+ +LVDC T GC
Sbjct: 61 PIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXG------- 113
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
YPY G D C+ R A+ I GY+ V E+ LQ V+ Q
Sbjct: 114 ------------ANYPYAGT-DGTCN--RKKAAHPAAKINGYEDVPANNEKALQKAVAHQ 158
Query: 222 PVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
P++VAIDA F FY GVFTG CG +HGV VGYGT+ + YWLVKN WGT
Sbjct: 159 PIAVAIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDDG---MKYWLVKNSWGTG 215
Query: 280 WDEGGSMRIFRGV-GGSGLCNIAANAAYP 307
W E G +R+ R V GLC IA A+YP
Sbjct: 216 WGEEGYIRMQRDVTAKEGLCGIAMQASYP 244
>gi|18202415|sp|P82474.1|CPGP2_ZINOF RecName: Full=Zingipain-2; AltName: Full=Cysteine proteinase GP-II
gi|6137410|pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137411|pdb|1CQD|B Chain B, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137412|pdb|1CQD|C Chain C, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137413|pdb|1CQD|D Chain D, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
Length = 221
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 99/220 (45%), Positives = 132/220 (60%), Gaps = 13/220 (5%)
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-G 151
DSIDW E GAV PVK+QG CWAF+ VA VEG+N+I TG L++ S+ QLVDC+T N G
Sbjct: 5 DSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTANHG 64
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ AF++I + SE YPY+G QD C+ S+ + +I Y+ V E
Sbjct: 65 CRGGWMNPAFQFIVNNGGINSEETYPYRG-QDGICN---STVNAPVVSIDSYENVPSHNE 120
Query: 212 EGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
+ LQ V+ QPVSV +DA F Y G+FTG C + NH +T+VGYGT E + +
Sbjct: 121 QSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGT----ENDKDF 176
Query: 270 WLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYPL 308
W+VKN WG NW E G +R R + G C I A+YP+
Sbjct: 177 WIVKNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPV 216
>gi|310942958|pdb|3P5U|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
gi|310942959|pdb|3P5V|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
gi|310942961|pdb|3P5X|A Chain A, Actinidin From Actinidia Arguta Planch (Sarusashi)
Length = 220
Score = 175 bits (444), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 100/221 (45%), Positives = 128/221 (57%), Gaps = 13/221 (5%)
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TL 149
D +DW GAV +KDQG WAF+ +A VEG+NKI TG L++ S+ +LVDC
Sbjct: 3 DYVDWRSSGAVVDIKDQGQCGSXWAFSTIAAVEGINKIATGDLISLSEQELVDCGRTQNT 62
Query: 150 NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPA 209
GC F+ + F++I + +E YPY + C+ KY +I Y+ V
Sbjct: 63 RGCDGGFMTDGFQFIINNGGINTEANYPYTAEEGQ-CN--LDLQQEKYVSIDTYENVPYN 119
Query: 210 TEEGLQDVVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQ 267
E LQ V+ QPVSVA++A +NF H G+FTGPCG +H VTIVGYGT EG
Sbjct: 120 NEWALQTAVAYQPVSVALEAAGYNFQHYSSGIFTGPCGTAVDHAVTIVGYGT----EGGI 175
Query: 268 PYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
YW+VKN WGT W E G MRI R VGG G C IA A+YP+
Sbjct: 176 DYWIVKNSWGTTWGEEGYMRIQRNVGGVGQCGIAKKASYPV 216
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 109/310 (35%), Positives = 150/310 (48%), Gaps = 28/310 (9%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTG 65
W+ R Y E E RF ++ N HE+ L + +ADL+++++ + G
Sbjct: 43 WVQTLKRAYASAEEYERRFDVWLDNLRFVHEYNAGHTSHWLSMGVYADLSQDEYRSKALG 102
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ---GSYCCWAFTAVA 122
Y H R +DW +GAVTPVK+Q GS CWAF+
Sbjct: 103 YNA----DLHEERPLRAAPFLYEGTVPPKEVDWVAKGAVTPVKNQLLCGS--CWAFSTTG 156
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG + I TG+L + S+ LVDC NGC ++ AFE+I + + +E YPY
Sbjct: 157 AVEGASAIATGKLASLSEQMLVDCDRERDNGCHGGLMDFAFEFIMKNGGIDTEDDYPYTA 216
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
+ D + I YQ V P E L V+ QPVSVAI+A F Y GG
Sbjct: 217 EEGMCQD---NKMRRHVVTIDDYQDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGG 273
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLC 298
VF CG +HGV +VGYGT + PYWLVKN WG W + G +R+ R +G G C
Sbjct: 274 VFDAECGTALDHGVLVVGYGTASNGTHHLPYWLVKNSWGAEWGDKGYIRLLRNLGEEGQC 333
Query: 299 NIAANAAYPL 308
+A A++P+
Sbjct: 334 GVAMQASFPI 343
>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
Length = 232
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 97/236 (41%), Positives = 134/236 (56%), Gaps = 15/236 (6%)
Query: 79 SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVT 137
S F+ N S + +IDW GAVTP+KDQG CCWAF+AVA EG+ KI TG+L++
Sbjct: 3 STGFRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLIS 62
Query: 138 RSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSAS 194
S+ +LVDC GC +++AF++I + L +E YPY D C +SA+
Sbjct: 63 LSEQELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYT-AADGKCKSGSNSAA 121
Query: 195 GKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGV 252
I+GY+ V E L V+ QPVSVA+D F FY GGV TG CG +HG+
Sbjct: 122 N----IKGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGI 177
Query: 253 TIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
+GYG T++ YWL+KN WGT W E G +R+ + + G+C +A +YP
Sbjct: 178 AAIGYGKTSDG---TKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAIEPSYP 230
>gi|194741252|ref|XP_001953103.1| GF17600 [Drosophila ananassae]
gi|190626162|gb|EDV41686.1| GF17600 [Drosophila ananassae]
Length = 333
Score = 175 bits (443), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 117/321 (36%), Positives = 169/321 (52%), Gaps = 42/321 (13%)
Query: 15 HEQWM---VEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLT 55
E+WM +E+ + Y+D+ E+++RFKIF N L +NKFADL
Sbjct: 27 EEEWMAFKLEYNKVYQDETEEQLRFKIFNYNKLLIARHNLKWAAGKVSFNLAVNKFADLL 86
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+F G P + S S + +N ++ D++DW + G VTPVKDQGS
Sbjct: 87 DHEFQDLMLGKMSPSGSNFGS--STFLPPVN---LTLPDAVDWRKYGFVTPVKDQGSCGS 141
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENAFEYIRQYQRLASE 173
CWAF+ ++EG + +TGQL++ S+ L+DCS NGC +E AF YI+ + + +E
Sbjct: 142 CWAFSTTGSLEGQHFRKTGQLISLSEQNLIDCSPGNNGCKNGAVEYAFRYIQSNKGIDTE 201
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDATW- 231
YPY+ Q+ C + R + G+ + P E L Q V + P+SV I+++
Sbjct: 202 ISYPYEAAQN-QCRFRRDTIGATS---TGFVKLNPGDEMELAQAVATVGPISVLINSSLD 257
Query: 232 -FNFYHGGVFTGPCGNTPN---HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F FYH GV+ P N PN H V +VGYGT +WLVKN W T+W E G ++
Sbjct: 258 SFKFYHDGVYNDPSCN-PNKLTHAVLVVGYGTDDRG---GDFWLVKNSWSTHWGEQGYVK 313
Query: 288 IFRGVGGSGLCNIAANAAYPL 308
I R + LC IA+NA YPL
Sbjct: 314 IKR--NANNLCGIASNALYPL 332
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 114/313 (36%), Positives = 160/313 (51%), Gaps = 35/313 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E WM + ++Y+ EK RF++F+ N + +L LN+FADL+ E+F Y
Sbjct: 49 ESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKY 108
Query: 64 TGYK---PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
G K P D P +L S +DW ++GAV VK+QG+ CWAF+
Sbjct: 109 LGLKIELPKRRDSPEEFSYKDVADLPKS-------VDWRKKGAVAHVKNQGACGSCWAFS 161
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYP 177
VA VEG+N+I TG L S+ +L+DC NGC ++ AF +I L E YP
Sbjct: 162 TVAAVEGINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYP 221
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFY 235
Y ++ C + + I GY V E+ ++ QP+SVAI+A+ F FY
Sbjct: 222 YV-MEEGTCGEKKEEL--EVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFY 278
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG- 294
GG+F G CG +HGV VGYGT+ + Y VKN WG+ W E G +R+ R VG
Sbjct: 279 SGGIFNGHCGTELDHGVAAVGYGTSKGVD----YITVKNSWGSKWGEKGYIRMKRNVGKP 334
Query: 295 SGLCNIAANAAYP 307
G+C I A+YP
Sbjct: 335 EGICGIYKMASYP 347
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 174 bits (442), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 115/312 (36%), Positives = 169/312 (54%), Gaps = 31/312 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIF-----------KKNHEF-LRLNKFADLTREKFLASY 63
+ WM++ + Y+ EK RF+IF KKN+ + L LN FADL+ ++F Y
Sbjct: 49 DSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKY 108
Query: 64 TGYKPPP-TDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
G+ T H + ++ +K++ + + SIDW +GAVTPVK+QG+ CWAF+
Sbjct: 109 VGFVAEDFTGLEHFDNEDFTYKHVTN----YPQSIDWRAKGAVTPVKNQGACGSCWAFST 164
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+ATVEG+NKI TG L+ S+ +LVDC + GC + + +Y+ + + VYPYQ
Sbjct: 165 IATVEGINKIVTGNLLELSEQELVDCDKHSYGCKGGYQTTSLQYVAN-NGVHTSKVYPYQ 223
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
+Q Y C + G I GY+ V E ++ QP+S ++A F Y
Sbjct: 224 AKQ-YKCR--ATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKS 280
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-G 296
GVF GPCG +H VT VGYGT+ +G+ Y ++KN WG NW E G MR+ R G S G
Sbjct: 281 GVFDGPCGTKLDHAVTAVGYGTS---DGKN-YIIIKNSWGPNWGEKGYMRLKRQSGNSQG 336
Query: 297 LCNIAANAAYPL 308
C + ++ YP
Sbjct: 337 TCGVYKSSYYPF 348
>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
C-169]
Length = 481
Score = 174 bits (442), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 115/326 (35%), Positives = 164/326 (50%), Gaps = 42/326 (12%)
Query: 9 GNIAAKHEQWMVEFARTYKDQAEK-EMRFKIFKKNHEF------------LRLNKFADLT 55
GN A W+ + YKD E+ E +F ++ N EF L L FADLT
Sbjct: 42 GNPRAAFSDWVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHNEKDSTFKLGLTNFADLT 101
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD-----SIDWNERGAVTPVKDQ 110
+++ GY+P + S+ + D SIDW ++GAVT VK+Q
Sbjct: 102 HDEYRQHALGYRP-------ELKGTGLGTGKSTGFQYADYEAPPSIDWRKKGAVTDVKNQ 154
Query: 111 ---GSYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIR 165
GS CWAF+ +VEG N I +G+LV+ S+ +LVDC +GC ++ AF +I
Sbjct: 155 QQCGS--CWAFSTTGSVEGANAIYSGELVSLSEQELVDCDVTQDHGCHGGLMDFAFSFII 212
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ + +E Y Y+ QD C+ + I Y+ V P E L+ + QP+SV
Sbjct: 213 RNGGIDTEKDYKYKA-QDGVCNIAKEKR--HVVTIDSYEDVPPNDESALKKAAANQPISV 269
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
AI+A F Y GGVF PCG +HGV +VGYG+ + YW+VKN WG W +
Sbjct: 270 AIEADQREFQLYAGGVFDAPCGTALDHGVLVVGYGSDNGTD----YWIVKNSWGDFWGDS 325
Query: 284 GSMRIFRGVGGS-GLCNIAANAAYPL 308
G +R+ RG+ S G C IA A+YP+
Sbjct: 326 GYIRLARGISNSAGQCGIAMQASYPI 351
>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
Length = 279
Score = 174 bits (441), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 100/220 (45%), Positives = 131/220 (59%), Gaps = 14/220 (6%)
Query: 95 SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NG 151
S+DW ++GAVT VKDQG CWAF+ +A VEG+N I+T L + S+ QLVDC T G
Sbjct: 46 SVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAG 105
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ AF+YI ++ +A+E YPY+ RQ C + I GY+ V E
Sbjct: 106 CNGGLMDYAFQYIAKHGGVAAEDAYPYRARQ-ASC----KKSPAPVVTIDGYEDVPANDE 160
Query: 212 EGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
L+ V+ QPVSVAI+A + F FY GVF+G CG +HGV VGYG T A+G + Y
Sbjct: 161 SALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVT--ADGTK-Y 217
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYPL 308
WLVKN WG W E G +R+ R V G C IA A+YP+
Sbjct: 218 WLVKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASYPV 257
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 113/329 (34%), Positives = 166/329 (50%), Gaps = 41/329 (12%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------------------- 45
I A+ + W E + Y E+ R +F N F
Sbjct: 32 IEAQFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPPSY 91
Query: 46 -LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAV 104
L LN FADLT E+F A+ G + P S + + L + D++DW + GAV
Sbjct: 92 TLALNAFADLTHEEFRAARLG-RIAPGAALRSRAAPVYWGLGGGA-AVPDALDWRKSGAV 149
Query: 105 TPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAF 161
T VKDQGS CW+F+A +EG+NKI+TG LV+ S+ +L+DC S +GC ++ A+
Sbjct: 150 TKVKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAY 209
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
+++ + + +E YPY+ D C+ ++ + I GY V E+ L V++Q
Sbjct: 210 KFVIKNGGIDTEEDYPYR-EADGTCN--KNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQ 266
Query: 222 PVSVAI--DATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
PVSV I A F Y+ G+F GPC + +H V IVGYG+ EG + YW+VKN WG +
Sbjct: 267 PVSVGICGSARAFQLYYQGIFDGPCPTSLDHAVLIVGYGS----EGGKDYWIVKNSWGES 322
Query: 280 WDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
W G M + R G S G+C I A++P
Sbjct: 323 WGMKGYMHMHRNTGDSKGVCGINMMASFP 351
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 111/322 (34%), Positives = 162/322 (50%), Gaps = 35/322 (10%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF---------------------LRLNKF 51
A + W E + Y E+ R +F N F L LN F
Sbjct: 39 ALFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAF 98
Query: 52 ADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG 111
ADLT E+F A+ G S + ++ L+ + D++DW E GAVT VKDQG
Sbjct: 99 ADLTHEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQG 158
Query: 112 SY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQ 168
S CW+F+A +EG+NKI+TG LV+ S+ +L+DC S +GC ++ A++++ +
Sbjct: 159 SCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNG 218
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI- 227
+ +E YPY+ D C+ ++ + I GY V E+ L V++QPVSV I
Sbjct: 219 GIDTEEDYPYR-EADGTCN--KNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGIC 275
Query: 228 -DATWFNFY-HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
A F Y G+F GPC + +H V IVGYG+ EG + YW+VKN WG +W G
Sbjct: 276 GSARAFQLYSQQGIFDGPCPTSLDHAVLIVGYGS----EGGKDYWIVKNSWGESWGMKGY 331
Query: 286 MRIFRGVGGS-GLCNIAANAAY 306
M + R G S G+C I A++
Sbjct: 332 MHMHRNTGDSKGVCGINMMASF 353
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 174 bits (441), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 116/308 (37%), Positives = 154/308 (50%), Gaps = 32/308 (10%)
Query: 24 RTYKDQAEK-EMRFKIFKKN----HEF--------LRLNKFADLTREKFLASYTGYKPPP 70
R Y AE E RF I+ N HE+ L + +ADL+++++ + GY
Sbjct: 59 RAYASSAEVYERRFNIWLDNLRFAHEYNARHTSHWLSMGVYADLSQDEYRSKALGYNA-- 116
Query: 71 TDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ---GSYCCWAFTAVATVEGL 127
H H R + +DW GAVTPVKDQ GS CWAF+ VEG
Sbjct: 117 --HLHKKRPLRAAPFLYKGTVPPEEVDWVAGGAVTPVKDQLLCGS--CWAFSTTGAVEGA 172
Query: 128 NKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYY 185
N I TG+LV+ S+ LVDC GC F+++AF++I + +E YPY+ +D
Sbjct: 173 NAIATGKLVSLSEQMLVDCDREYDTGCRGGFMDSAFDFIVNNGGIDTEDDYPYRA-EDGI 231
Query: 186 CDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGP 243
C R+ I GYQ V P E L V+ QPVSVAI+A F Y GGVF
Sbjct: 232 CQDNRTRR--HVVTIDGYQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGVFDAE 289
Query: 244 CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS---GLCNI 300
CG +H V +VGYGT + PYWLVKN WG W E G +R+ R +G G C +
Sbjct: 290 CGTALDHAVLVVGYGTASNGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQCGL 349
Query: 301 AANAAYPL 308
A A++P+
Sbjct: 350 AMYASFPI 357
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 174 bits (440), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 115/313 (36%), Positives = 166/313 (53%), Gaps = 39/313 (12%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASYT 64
QW + + Y E+ +R+ I+K N L++N+F D+T +F A +
Sbjct: 29 QWKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFILKMNQFGDMTNSEFKA-FN 87
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVAT 123
GY H H N S + L + D++DW G VTPVKDQG CWAF+ +
Sbjct: 88 GY----LSHKHVNGSTF---LTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGS 140
Query: 124 VEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
+EG + +TG+LV+ S+ LVDCST NGC ++NAF YI++ + + SE YPY
Sbjct: 141 LEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCDGGLMDNAFTYIKENKGIDSEASYPYTA 200
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQD-VVSRQPVSVAIDATW--FNFYHG 237
+D C + +SS + G+ + E L++ V S P+SVAIDA+ F FY
Sbjct: 201 -EDGKCVFKKSSVA---ATDTGFVDIPEGNENKLKEAVASVGPISVAIDASHESFQFYSS 256
Query: 238 GVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
GV+ P C +T +HGV +VGYGT E + YWLVKN W T+W + G +++ R
Sbjct: 257 GVYNEPSCSSTELDHGVLVVGYGT----ESGKDYWLVKNSWNTSWGDKGYIKMRRNAKNQ 312
Query: 296 GLCNIAANAAYPL 308
C IA A+YPL
Sbjct: 313 --CGIATKASYPL 323
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 112/299 (37%), Positives = 156/299 (52%), Gaps = 36/299 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E WM++ + YK EK RF+ FK N + L LN+FADLT ++F Y
Sbjct: 49 ESWMLKHDKVYKTIDEKIYRFETFKDNLMYIDETNKKNNSYWLGLNEFADLTHDEFKEKY 108
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G P D +S+ + N + + +SIDW ++GAVTPVK+Q CWAF+ VA
Sbjct: 109 VG--SIPEDSMIIEQSDDVEFPNKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWAFSTVA 166
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGR 181
TVEG+NKI TG L++ S+ +L+DC + GC + + +Y+ + +E YPY+ +
Sbjct: 167 TVEGINKIVTGNLISLSEQELLDCDRRSHGCKGGYQTTSLKYVVD-NGVHTEKEYPYEKK 225
Query: 182 QDYYCDWWRSSASGKYGA---IRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYH 236
Q A K G I GY+ V E L +S QPVSV +++ F FY
Sbjct: 226 QG------NCRAKNKKGLKVYINGYKRVPSNDEISLIKTISIQPVSVLVESKGRPFQFYK 279
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
GGVF GPCG +H VT VGYG + Y L+KN WG W + G ++I R G S
Sbjct: 280 GGVFGGPCGTKLDHAVTAVGYG--------KDYILIKNSWGPKWGDKGYIKIKRASGQS 330
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 173 bits (439), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 112/321 (34%), Positives = 162/321 (50%), Gaps = 32/321 (9%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR---------------LNKFADL 54
+I +QW + Y+ AE E R++ FK+N +++ LNKFADL
Sbjct: 45 SIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAGKKTAALGHSVGLNKFADL 104
Query: 55 TREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
+ E+F Y P + S +W + N S+DW ++G VT VKDQG
Sbjct: 105 SNEEFKELYLSKVKKPINIKRSTARDW-RQRNLQTCDAPSSLDWRKKGVVTAVKDQGDCG 163
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLAS 172
CW+F+ +EG+N I TG L++ S+ +LVDC T N GC +++ AFE++ + +
Sbjct: 164 SCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTTNYGCEGGYMDYAFEWVINNGGIDT 223
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY G D C+ + K +I GY V T+ L +QP+SV +D +
Sbjct: 224 EANYPYTG-VDGTCNTTKEEI--KVVSIDGYTDVD-ETDSALLCATVQQPISVGMDGSAL 279
Query: 233 NF--YHGGVFTGPCGNTPN---HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
+F Y GG++ G C + PN H V IVGYG+ E + YW+VKN WGT W G
Sbjct: 280 DFQLYTGGIYDGDCSDDPNDIDHAVLIVGYGS----ENGEDYWIVKNSWGTEWGMEGYFY 335
Query: 288 IFRGVGGS-GLCNIAANAAYP 307
I R G+C I A A+YP
Sbjct: 336 IKRNTDLPYGVCAINAEASYP 356
>gi|302779822|ref|XP_002971686.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
gi|300160818|gb|EFJ27435.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
Length = 214
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 99/219 (45%), Positives = 136/219 (62%), Gaps = 13/219 (5%)
Query: 95 SIDWNERGAVTPVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--N 150
S+DW ++G VT +KDQG C CWAF+A+A VEGL + TG LV+ S+ +LVDC T
Sbjct: 1 SVDWRKKGGVTEIKDQGD-CGNCWAFSAIAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQ 59
Query: 151 GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPAT 210
GC ++ AF+Y+ + + S+ YPY+ ++ CD + I G+Q + P +
Sbjct: 60 GCDGGMMDYAFQYMIRNGGITSQSNYPYRAQRGA-CD--KDKVKYHAATINGFQAIPPQS 116
Query: 211 EEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQP 268
EE L V+ QPVSVAI+A F Y GVFTG CG+ +HGV IVGYGT +A G+Q
Sbjct: 117 EELLLRAVANQPVSVAIEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGT--DAGGRQ- 173
Query: 269 YWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
YWLVKN WG+ W E G +R+ R G+G+C I +A+YP
Sbjct: 174 YWLVKNSWGSGWGESGYVRMERQGPGAGVCGINLDASYP 212
>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 109/323 (33%), Positives = 161/323 (49%), Gaps = 39/323 (12%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------------LRLNKFAD 53
A+ E W E + Y E+ R F +N F L LN FAD
Sbjct: 37 AQFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFAD 96
Query: 54 LTREKFLASYTG---YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
LT ++F A+ G P P P + + + + D++DW + GAVT VKDQ
Sbjct: 97 LTHDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVP----DALDWRQSGAVTKVKDQ 152
Query: 111 GSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQY 167
GS CW+F+A +EG+NKI TG L++ S+ +L+DC S GC + A++++ +
Sbjct: 153 GSCGACWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKN 212
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI 227
+ +E YP++ D C+ ++ I GY+ V + E+ L V++QP+SV I
Sbjct: 213 GGIDTEDDYPFR-EADGTCN--KNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGI 269
Query: 228 --DATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
A F Y G+F GPC + +H V IVGYG+ EG + YW+VKN WG W G
Sbjct: 270 CGSARAFQLYSQGIFDGPCPTSLDHAVLIVGYGS----EGGKDYWIVKNSWGERWGMKGY 325
Query: 286 MRIFRGVG-GSGLCNIAANAAYP 307
M + R G SG+C I A++P
Sbjct: 326 MHMHRNTGSSSGICGINMMASFP 348
>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
Length = 367
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 112/308 (36%), Positives = 162/308 (52%), Gaps = 29/308 (9%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTG 65
WM+ + Y++ EK RF+IFK N + L LN+FADL+ ++F Y G
Sbjct: 51 WMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYRLGLNEFADLSNDEFNEKYVG 110
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
T + + +N ++ +++DW ++GAVTPV+ QGS CWAF+AVATV
Sbjct: 111 SLIDATIEQSYDE----EFINEDIVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATV 166
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
EG+NKIRTG+LV S+ +LVDC + GC + A EY+ + + YPY+ +Q
Sbjct: 167 EGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAK-NGIHLRSKYPYKAKQG 225
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFT 241
G G VQP E L + +++QPVSV +++ F Y GG+F
Sbjct: 226 ---TCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFE 282
Query: 242 GPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNI 300
GPCG +H VT V G + Y L+KN WGT W E G +RI R G S G+C +
Sbjct: 283 GPCGTKVDHAVTAV----GYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGL 338
Query: 301 AANAAYPL 308
++ YP+
Sbjct: 339 YKSSYYPI 346
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 173 bits (439), Expect = 8e-41, Method: Compositional matrix adjust.
Identities = 112/314 (35%), Positives = 172/314 (54%), Gaps = 29/314 (9%)
Query: 13 AKH-EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR--------LNKFADLTREKFLASY 63
AKH + ++ E ++ + E R KI K N ++ R +N+F D+ +F+++
Sbjct: 32 AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G+K D P S + + N S ++DW +GAVTPVK+QG CWAF+A
Sbjct: 92 NGFKRNYKDQPREG-STYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSATG 150
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
++EG + ++G +V+ S+ LVDCST NGC ++NAF+YIR + + +E YPY
Sbjct: 151 SLEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSYPYN 210
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATW--FNFYH 236
G D C + +S+ G+ ++ +E L+ V+ P+SVAIDA+ F FY
Sbjct: 211 G-TDGTCHFKKSTVG---ATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYS 266
Query: 237 GGVFTGP-CGN-TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
GV+ P C + + +HGV +VGYGT + YWLVKN WGT W + G +R+ R
Sbjct: 267 DGVYDEPECDSESLDHGVLVVGYGTLNGTD----YWLVKNSWGTTWGDEGYIRMSR--NK 320
Query: 295 SGLCNIAANAAYPL 308
C IA++A+YPL
Sbjct: 321 KNQCGIASSASYPL 334
>gi|326503122|dbj|BAJ99186.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326512552|dbj|BAJ99631.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 389
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 120/335 (35%), Positives = 172/335 (51%), Gaps = 46/335 (13%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-LNK---------------FADL 54
+ A+ WM R+Y +EK RFK+++ N ++ LN F DL
Sbjct: 56 MMARFHVWMTVQNRSYPTSSEKAHRFKVYRSNMRYIEALNAEATTSGFTYELGEGPFTDL 115
Query: 55 TREKFLASYTGYKPPPTDH------------PHSNRSNWFKNLNS-SKMSFYDSI--DWN 99
T E+F++ YTG K P DH H+ N + + + S I DW
Sbjct: 116 TDEEFISLYTG-KIPDDDHREDGVHDEQIITTHAGSVNGAEGVTVYANFSAGAPIRMDWR 174
Query: 100 ERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFL 157
+RGAVTPVKDQG CWAF VAT+EG++KI+ G+LV+ S+ QLVDC L+ GC +
Sbjct: 175 KRGAVTPVKDQGKCGSCWAFPTVATIEGIHKIKRGRLVSLSEQQLVDCDFLDGGCNGGWP 234
Query: 158 ENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDV 217
NAF++I Q + + Y Y+ + C R A+ I GY+ V+ +E + ++
Sbjct: 235 RNAFQWIIQNGGITTTSSYTYKAAEG-QCKGNRKPAA----KITGYRKVKSNSEVSMVNI 289
Query: 218 VSRQPV--SVAIDATWFNFYHGGVFTGPCGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKN 274
V+ QP+ S+ + F Y GG++ GPC + NH +TIVGYG +A G + YW+VKN
Sbjct: 290 VANQPIAASIVVHGGQFQHYKGGIYNGPCATSKLNHVITIVGYG--QQAYGAK-YWIVKN 346
Query: 275 RWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYPL 308
WG W G M + RG G C IA +PL
Sbjct: 347 SWGAAWGNKGYMLMKRGTKNPLGQCGIAVRPIFPL 381
>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 360
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 114/309 (36%), Positives = 163/309 (52%), Gaps = 48/309 (15%)
Query: 31 EKEMRFKIFKKN---------------HEF-LRLNKFADLTREKFLASYTGYKPPPTDHP 74
E+E R++ F+ N H F L LN+FA LT E++ A+Y G +
Sbjct: 57 EEEGRYEAFRDNLRYIDEHNAAADAGIHSFRLGLNRFAGLTNEEYRAAYLGLRL------ 110
Query: 75 HSNRSNWFKNLNSSKMSFY--------DSIDWNERGAVTPVKDQGSYC--CWAFTAVATV 124
RS +L + +S+DW E+GAV VKDQG C WAF+A+A V
Sbjct: 111 ---RSGAVGDLRKPSARYEAADGEALPESVDWREKGAVGKVKDQGRSCGSAWAFSAIAAV 167
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQ 182
E +N+I TG+L++ S+ +L+DC T GC +++AFE+I + ++ YPY+ R
Sbjct: 168 ESINQIVTGELISLSEQELMDCDTSYNAGCDGGLMDDAFEFIISNGGIDTDEDYPYKARN 227
Query: 183 DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVF 240
D CD + + K I Y+ ++ E+ LQ VS QPVSVAI+A F Y G+F
Sbjct: 228 D-SCDANKRNR--KAVTIDDYEDLR-MNEKSLQKAVSNQPVSVAIEAGGRDFQLYKSGIF 283
Query: 241 TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCN 299
TG CG +H TIVGYG+ E YW+VK +GT+W E G R+ R + SG C
Sbjct: 284 TGTCGTDLDHATTIVGYGS----ENGTDYWIVKESYGTSWGESGYARMERNIKETSGKCG 339
Query: 300 IAANAAYPL 308
IA +YP+
Sbjct: 340 IAMLPSYPV 348
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 118/321 (36%), Positives = 171/321 (53%), Gaps = 44/321 (13%)
Query: 16 EQWMVEFA---RTYKDQAEKEMRFKIFKKNHEFLR----------------LNKFADLTR 56
E+W V A +TYK+Q E+ R KIF N + + +N F DL
Sbjct: 25 EEWHVFKAMHGKTYKNQFEEMFRMKIFMDNKKKIEAHNAKYEQGEVSYKMMMNHFGDLMV 84
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+F A G+K P D + + N N K ++DW ++GAVTPVKDQG C
Sbjct: 85 HEFKALMNGFKMSP-DTKRNGELYFPSNSNLPK-----TVDWRQKGAVTPVKDQGQCGSC 138
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLAS 172
W+F+A ++EG ++TG+LV+ S+ LVDCST NGC ++ AF+Y+ + + +
Sbjct: 139 WSFSATGSLEGQVFLKTGKLVSLSEQNLVDCSTSYGNNGCEGGLMDQAFQYVSDNKGIDT 198
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW 231
E YPY+ R++ C + ++ G +G+ + E+ LQ+ ++ P+SVAIDA
Sbjct: 199 EASYPYEAREN-TCRFKKNKVG---GTDKGHVDIPAGDEKALQNALATVGPISVAIDANH 254
Query: 232 --FNFYHGGVFTGP-CGN-TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F FY GV+ P C + +HGV VGYGT E Q YWLVKN WG +W E G ++
Sbjct: 255 GSFQFYSKGVYNEPNCSSYDLDHGVLAVGYGT----ENGQDYWLVKNSWGPSWGENGYIK 310
Query: 288 IFRGVGGSGLCNIAANAAYPL 308
I R S C IA+ A+YPL
Sbjct: 311 IAR--NHSNHCGIASMASYPL 329
>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
Length = 245
Score = 173 bits (438), Expect = 9e-41, Method: Compositional matrix adjust.
Identities = 99/222 (44%), Positives = 134/222 (60%), Gaps = 14/222 (6%)
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--N 150
+S+DW E GAV PVKDQ S CWAF+ VA VEG+N+I TG+L++ S+ +LVDC T
Sbjct: 8 ESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDM 67
Query: 151 GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPAT 210
GC ++ AF++I + L +E YPY G D C+ S S K +I GY+ V P
Sbjct: 68 GCNGGLMDYAFDFIIKNGGLDTEKDYPYTGF-DGECNL--SGKSSKVVSIDGYEDVPPFD 124
Query: 211 EEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQP 268
E+ LQ V+ QPVSVA++A Y G+FTG CG +HG+ VGYGT E
Sbjct: 125 EKALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGT----ENGTD 180
Query: 269 YWLVKNRWGTNWDEGGSMRIFRGVGG--SGLCNIAANAAYPL 308
YW+V+N WG++W E G +R+ R + SG C IA A+YP+
Sbjct: 181 YWIVRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPI 222
>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 360
Score = 173 bits (438), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 160/318 (50%), Gaps = 33/318 (10%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLRL-------------NKFADLTREKFLASYT 64
W ++Y+ E+ RF++++ N E++ N+FADLTRE+F+A +T
Sbjct: 45 WQATHNQSYRSAEERLRRFQVYRDNVEYIETTNRRGDLTYQLGENQFADLTREEFIARFT 104
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDS-----------IDWNERGAVTPVKDQGSY 113
Y + + + S +DW +GAV P K Q S
Sbjct: 105 SYNGDDDRTGDDDSVITTAAVGGGDPDLWSSGGDDVSLDPPSVDWRAKGAVVPPKSQSSS 164
Query: 114 CC--WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNG-CAKNFLENAFEYIRQYQRL 170
C WAF AVAT+E L+ I+TG+LV S+ QLVDC +G C + AF ++ Q L
Sbjct: 165 CSSSWAFVAVATIESLHAIKTGKLVALSEQQLVDCDQYDGGCNRGTFRRAFHWVIQNGGL 224
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID-A 229
+E YPY Q C+ +S AI G+ V + E ++ V+ QPV+ AI+
Sbjct: 225 TTEAEYPYTAAQGT-CNSAKSDH--HVAAISGHASVPGSNELAMKHAVATQPVAAAIELG 281
Query: 230 TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
+ FY GV++GPCG H VT+VGYG E+ G + YW+VKN WG W E G +R+
Sbjct: 282 SDMQFYKSGVYSGPCGARLEHAVTVVGYGAD-ESTGDK-YWIVKNSWGQTWGERGYIRMQ 339
Query: 290 RGVGGSGLCNIAANAAYP 307
R + G GLC I + AYP
Sbjct: 340 RKILGPGLCGIMLDVAYP 357
>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
Full=Papaya proteinase III; Short=PPIII; AltName:
Full=Papaya proteinase omega; Flags: Precursor
gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
Length = 348
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 112/307 (36%), Positives = 161/307 (52%), Gaps = 29/307 (9%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTG 65
WM+ + Y++ EK RF+IFK N + L LN+FADL+ ++F Y G
Sbjct: 51 WMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEFNEKYVG 110
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
T + + +N ++ +++DW ++GAVTPV+ QGS CWAF+AVATV
Sbjct: 111 SLIDATIEQSYDE----EFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATV 166
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
EG+NKIRTG+LV S+ +LVDC + GC + A EY+ + + YPY+ +Q
Sbjct: 167 EGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAK-NGIHLRSKYPYKAKQG 225
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFT 241
G G VQP E L + +++QPVSV +++ F Y GG+F
Sbjct: 226 ---TCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFE 282
Query: 242 GPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNI 300
GPCG +H VT V G + Y L+KN WGT W E G +RI R G S G+C +
Sbjct: 283 GPCGTKVDHAVTAV----GYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGL 338
Query: 301 AANAAYP 307
++ YP
Sbjct: 339 YKSSYYP 345
>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
Length = 449
Score = 172 bits (437), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 107/306 (34%), Positives = 160/306 (52%), Gaps = 27/306 (8%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFL 60
A+ E W E R+Y E+ R F N F L LN FADLT ++F
Sbjct: 36 AQFEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYALALNAFADLTHDEFR 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A+ G P + + ++ + D++DW + GAVT VKDQGS CW+F+
Sbjct: 96 AARLGRLA--AAGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFS 153
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVYP 177
A +EG+NKI+TG L++ S+ +L+DC S +GC ++ A++++ + + +E YP
Sbjct: 154 ATGAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYP 213
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI--DATWFNFY 235
Y+ D C+ ++ + I GY+ V E+ L V++QPVSV I A F Y
Sbjct: 214 YR-ETDGTCN--KNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLY 270
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
G+F GPC + +H + IVGYG+ EG + YW+VKN WG +W G M + R G S
Sbjct: 271 SKGIFDGPCPTSLDHAILIVGYGS----EGGKDYWIVKNSWGESWGMKGYMYMHRNTGNS 326
Query: 296 -GLCNI 300
G+C I
Sbjct: 327 NGVCGI 332
>gi|283898066|emb|CBI99501.1| cysteine peptidase precursor [Bromelia hieronymi]
Length = 230
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 91/216 (42%), Positives = 133/216 (61%), Gaps = 10/216 (4%)
Query: 95 SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCA 153
SIDW + GAVT VK+QG CW+F+A+ATVEG+ KI+TG LV+ S+ +++DC+ +GC
Sbjct: 5 SIDWRDYGAVTSVKNQGRCGSCWSFSAIATVEGIYKIKTGNLVSLSEQEVLDCAVSHGCK 64
Query: 154 KNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEG 213
+++ A+ +I + S YPY+G Q C S + Y I GY+YVQ E
Sbjct: 65 GGWVDKAYNFIISNNGVTSAAYYPYKGYQG-TCG-ANSVPNAAY--ITGYKYVQRNNERS 120
Query: 214 LQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWL 271
+ +S QP++ IDA+ F +Y GGV++GPCG + NH +T++GYG + YW+
Sbjct: 121 MMYALSNQPIAALIDASGKNFQYYKGGVYSGPCGTSLNHAITVIGYGQDSSG---IKYWI 177
Query: 272 VKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
VKN WGT+W E G +R+ R V SG+C IA +P
Sbjct: 178 VKNSWGTSWGERGYIRMARDVSSSGICGIAMAPLFP 213
>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
Length = 450
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 106/306 (34%), Positives = 160/306 (52%), Gaps = 26/306 (8%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFL 60
A+ E W E R+Y E+ R F N F L LN FADLT ++F
Sbjct: 36 AQFEAWCAEHGRSYATPGERAARLAAFADNAAFVAAHNGAPASYALALNAFADLTHDEFR 95
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A+ + P + + ++ + D++DW + GAVT VKDQGS CW+F+
Sbjct: 96 AARL-GRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFS 154
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECVYP 177
A +EG+NKI+TG L++ S+ +L+DC S +GC ++ A++++ + + +E YP
Sbjct: 155 ATGAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYP 214
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI--DATWFNFY 235
Y+ D C+ ++ + I GY+ V E+ L V++QPVSV I A F Y
Sbjct: 215 YR-ETDGTCN--KNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLY 271
Query: 236 HGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
G+F GPC + +H + IVGYG+ EG + YW+VKN WG +W G M + R G S
Sbjct: 272 SKGIFDGPCPTSLDHAILIVGYGS----EGGKDYWIVKNSWGESWGMKGYMYMHRNTGNS 327
Query: 296 -GLCNI 300
G+C I
Sbjct: 328 NGVCGI 333
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 172 bits (436), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 117/315 (37%), Positives = 169/315 (53%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIF-----------KKNHEF-LRLNKFADLTREKFLASY 63
+ WM++ + Y+ EK RF+IF KKN+ + L LN FADL+ ++F Y
Sbjct: 49 DSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKY 108
Query: 64 TGYKPPP-TDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
G T H + ++ +K++ + + SIDW +GAVTPVK+QGS CWAF+
Sbjct: 109 VGSVAEDFTGLEHFDNEDFTYKHVTN----YPQSIDWRAKGAVTPVKNQGSCGSCWAFST 164
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCS-TLNGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+ATVEG+NKI TG L+ S+ +LVDC +GC + + +Y+ S+ VYPYQ
Sbjct: 165 IATVEGVNKIVTGNLLELSEQELVDCDKNSHGCKGGYQTTSLQYVADNGVHTSK-VYPYQ 223
Query: 180 GRQDYYCDWWRSSASGKYG---AIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNF 234
+ + A+ K G I GY+ V E ++ QP+SV ++A F
Sbjct: 224 AKA------MQCRATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSVLVEAGGKPFQL 277
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GVF GPCG +H VT VGYGT ++G+ Y ++KN WG NW E G MR+ R G
Sbjct: 278 YKSGVFDGPCGTKLDHAVTAVGYGT---SDGKN-YIIIKNSWGPNWGEKGYMRLKRQSGN 333
Query: 295 S-GLCNIAANAAYPL 308
S G C + ++ YP
Sbjct: 334 SQGTCGVYKSSYYPF 348
>gi|326493706|dbj|BAJ85314.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 169/336 (50%), Gaps = 55/336 (16%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN---------------------------H 43
+ + +WM+++++ Y + E+EMRF++FK N H
Sbjct: 44 VRERFSKWMIKYSKHYSCKQEEEMRFQVFKNNTNSIGQLDRQNPNPGVGGALGPSGSQVH 103
Query: 44 EF--LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD----SID 97
F + +N+F DL+ + + YTG + F+ + + + ++ +D
Sbjct: 104 TFQKVSMNRFGDLSPREVIQQYTGLN-----------TTSFRTASPTYLPYHSFKPCCVD 152
Query: 98 WNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKN 155
W GAVT VK QG+ CWAF AVA +EG+NKIRTG+LV+ S+ LVDC T++ GC
Sbjct: 153 WRSSGAVTGVKHQGTCGSCWAFAAVAAIEGMNKIRTGELVSLSEQVLVDCDTVSTGCGGG 212
Query: 156 FLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQ 215
++A + + SE YPY G Q CD + + +I+G++ V E L
Sbjct: 213 HSDSAMALVAARGGITSEERYPYAGFQG-KCDVDKLMFDHQ-ASIKGFKAVPSNNEAQLA 270
Query: 216 DVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQ-QPYWLV 272
V+ QPV+V IDA + F FY GG++ GPC NH VTIVGY E G+ YW+
Sbjct: 271 IAVAMQPVTVYIDASGSAFQFYSGGIYRGPCSANVNHAVTIVGY---CEGPGEGNKYWIA 327
Query: 273 KNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYP 307
KN W +W E G + + + V +G C +A + YP
Sbjct: 328 KNSWSNDWGEQGYVYLAKDVAWSTGTCGLATSPFYP 363
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 116/312 (37%), Positives = 163/312 (52%), Gaps = 38/312 (12%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASYTG 65
W + + Y ++E+ +R+ I+K N LR+N F D+T +F A G
Sbjct: 30 WKMAHNKAYSHESEENVRYAIWKDNMNRITEYNSKSKNVILRMNHFGDMTNTEFRAKMNG 89
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
H H N S + L S + D++DW G VTPVK+QG CWAF++ +
Sbjct: 90 LLL----HKHQNGSTF---LVPSHTAAPDAVDWRSEGYVTPVKNQGQCGSCWAFSSTGAL 142
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGR 181
EG + +TG+LV+ S+ LVDCST NGC ++NAF YI+ + +E YPY+G
Sbjct: 143 EGQHFKKTGRLVSLSEQNLVDCSTDYGNNGCNGGLMDNAFSYIKANGGIDTETGYPYEG- 201
Query: 182 QDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDATW--FNFYHGG 238
QD C + +SS G+ + E+ L Q V + PVSVAIDA+ F FYH G
Sbjct: 202 QDGTCRYSKSSIGAD---DTGFVDIPEGDEDALKQAVATVGPVSVAIDASHMSFQFYHSG 258
Query: 239 VFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSG 296
V+ P C + +HGV +VGYGT + + YWLVKN WGT W G + + R
Sbjct: 259 VYDEPQCSPSALDHGVLVVGYGT----DNGKDYWLVKNSWGTGWGTEGYIYMSR--NNQN 312
Query: 297 LCNIAANAAYPL 308
C IA+ A+YPL
Sbjct: 313 QCGIASKASYPL 324
>gi|125606655|gb|EAZ45691.1| hypothetical protein OsJ_30364 [Oryza sativa Japonica Group]
Length = 326
Score = 171 bits (434), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 116/299 (38%), Positives = 147/299 (49%), Gaps = 44/299 (14%)
Query: 27 KDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFLASYTGYKPPPTDH 73
+D A+K RF++FKKN H+F L LNKFADLT E+F A YTG P P
Sbjct: 41 RDLADKGSRFEVFKKNARYIHDFNRKKGMSYKLGLNKFADLTLEEFTAKYTGANPGPITG 100
Query: 74 PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRT 132
+ + L + + DW E GAVT VKDQG CWAF+ V VEG+N+I T
Sbjct: 101 LKNGTGS--PPLAAVAGDAPPAWDWREHGAVTRVKDQGPCGSCWAFSVVEAVEGINEIMT 158
Query: 133 GQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSS 192
G +T S+ Q T EN F Y Y+ + C +
Sbjct: 159 GNFLTLSEQQCFSPPTTG-------ENYF-YYPAYEAVQEPCRF--------------DP 196
Query: 193 ASGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDATW-FNFYHGGVFTGPCGNTPNH 250
I Y +V P EE L Q V S+ PVSV I+A++ F Y GGVF+GPCG NH
Sbjct: 197 NKAPIVKIDSYSFVDPNDEEALKQAVYSQGPVSVLIEASYEFMIYQGGVFSGPCGTELNH 256
Query: 251 GVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYPL 308
V +VGY E E PYW+VKN WG W E G +R+ R + G+C IA YP+
Sbjct: 257 AVLVVGY---DETEDGTPYWIVKNSWGAGWGESGYIRMIRNIPAPEGICGIAMYPIYPI 312
>gi|157829826|pdb|1AEC|A Chain A, Crystal Structure Of Actinidin-E-64 Complex+
Length = 218
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 100/220 (45%), Positives = 128/220 (58%), Gaps = 15/220 (6%)
Query: 96 IDWNERGAVTPVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLN 150
+DW GAV +K QG C CWAF+A+ATVEG+NKI TG L++ S+ +L+DC
Sbjct: 5 VDWRSAGAVVDIKSQGE-CGGCWAFSAIATVEGINKIVTGVLISLSEQELIDCGRTQNTR 63
Query: 151 GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPAT 210
GC ++ + F++I + +E YPY QD C+ + KY I Y+ V
Sbjct: 64 GCNGGYITDGFQFIINNGGINTEENYPYT-AQDGECN--VDLQNEKYVTIDTYENVPYNN 120
Query: 211 EEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQP 268
E LQ V+ QPVSVA+DA F Y G+FTGPCG +H VTIVGYGT EG
Sbjct: 121 EWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYGT----EGGID 176
Query: 269 YWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
YW+VKN W T W E G MRI R VGG+G C IA +YP+
Sbjct: 177 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 216
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 171 bits (433), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 114/313 (36%), Positives = 165/313 (52%), Gaps = 39/313 (12%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASYT 64
QW + + Y E+ +R+ I+K N L++N+F D+T +F A +
Sbjct: 29 QWKMYHNKVYSHDGEETVRYTIWKDNERRIREHNLKGGDFLLKMNQFGDMTNSEFKA-FN 87
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVAT 123
GY H H N S + L + D++DW G VTPVKDQG CWAF+ +
Sbjct: 88 GY----LSHKHVNGSTF---LTPNNFVAPDTVDWRNEGYVTPVKDQGQCGSCWAFSTTGS 140
Query: 124 VEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
+EG + +TG+LV+ S+ LVDCST NGC ++NAF YI++ + + SE YPY
Sbjct: 141 LEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENKGIDSEASYPYTA 200
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQD-VVSRQPVSVAIDATW--FNFYHG 237
+D C + + S + G+ + E L++ V S P+SVAIDA+ F FY
Sbjct: 201 -EDGKCVFKKPSVA---ATDTGFVDLPEGNENKLKEAVASVGPISVAIDASHESFQFYSS 256
Query: 238 GVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
GV+ P C +T +HGV +VGYGT E + YWLVKN W T+W + G +++ R
Sbjct: 257 GVYNEPSCSSTELDHGVLVVGYGT----ESGKDYWLVKNSWNTSWGDKGYIKMRRNAKNQ 312
Query: 296 GLCNIAANAAYPL 308
C IA A+YPL
Sbjct: 313 --CGIATKASYPL 323
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 115/316 (36%), Positives = 162/316 (51%), Gaps = 38/316 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
+ W +Y E+ R I++ N +F L +NKFADLT +F A Y
Sbjct: 23 DSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEGHSYKLAVNKFADLTYPEFAAKY 82
Query: 64 TGYKPPPTDHPHS-NRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
G + T+ S S + + +S DS+DW G VTP+KDQG CW+F+
Sbjct: 83 LGLRFDATNATKSFAASTYLPRM----VSLPDSVDWRTAGIVTPIKDQGQCGSCWSFSTT 138
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASECVYPY 178
+VEG + +TGQLV+ S+ LVDCS+ GC ++ AF+YI + +E YPY
Sbjct: 139 GSVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPY 198
Query: 179 QGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSR-QPVSVAIDATW--FNF 234
QD C + S GA + YQ + +E LQ+ V+ P+SVAIDA+ F F
Sbjct: 199 TA-QDGTCQF----NSANVGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQF 253
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ P ++ +HGV VGYGT+ G YWLVKN WGT+W + G + + R
Sbjct: 254 YSSGVYNEPACSSSQLDHGVLAVGYGTS----GSSDYWLVKNSWGTSWGQSGYIWMTR-- 307
Query: 293 GGSGLCNIAANAAYPL 308
+ C IA A+YPL
Sbjct: 308 NSNNQCGIATAASYPL 323
>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
Length = 329
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 118/316 (37%), Positives = 161/316 (50%), Gaps = 39/316 (12%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFL 60
W ++TY + E+ R +I+++N L +N D+TRE+ L
Sbjct: 28 MWKKNHSKTYTSELEELGRREIWERNLRLITVHNLEASLGMHTYDLGMNHMGDMTREEIL 87
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
+ G + P + RS+ F + S+ +S DS+DW E+G VT VK+QGS CWAF+
Sbjct: 88 QMFAGTRVRPN---LTRRSSPF--VASAGISVPDSVDWREKGYVTEVKNQGSCGSCWAFS 142
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
A +EG K TGQ+ + S LVDCS+ GC F+ AF+Y+ + S+ Y
Sbjct: 143 AAGALEGQLKRTTGQVKSLSPQNLVDCSSKYGNKGCNGGFMTQAFQYVIDDGGIDSDEAY 202
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDAT--WFN 233
PY D C R S + Y YV EE L Q V + P+SVAIDAT F
Sbjct: 203 PYTA-MDGQC---RYDQSQRAANCSSYNYVSEGDEEALKQAVATIGPISVAIDATRPMFI 258
Query: 234 FYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
YH GV++ P C NHGV +VGYG+ + YWLVKN WGT + +GG +RI R
Sbjct: 259 LYHSGVYSDPTCTQNVNHGVLVVGYGSLN----GEDYWLVKNSWGTRFGDGGYIRIARNK 314
Query: 293 GGSGLCNIAANAAYPL 308
G +C IA A YPL
Sbjct: 315 G--NMCGIANYACYPL 328
>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
Length = 324
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 115/310 (37%), Positives = 156/310 (50%), Gaps = 55/310 (17%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E WM + +TY+ EK R ++FK N +L LN+FADL+ E+F
Sbjct: 48 ESWMSKHGKTYESIEEKLHRLEVFKDNLMHIDRRNRDVTTYWLALNEFADLSHEEF---- 103
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
SK++ I E+GAV PVK+QGS CWAF+ VA
Sbjct: 104 -----------------------KSKLA---QIRRLEKGAVAPVKNQGSCGSCWAFSTVA 137
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG+N+I TG L + S+ +L+DC T +GC ++ AF+YI L E YPY
Sbjct: 138 AVEGINQIVTGNLTSLSEQELIDCDTSFNSGCNGGLMDYAFDYIVNNGGLHKEEDYPYL- 196
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
++ CD R + I GY V EE L ++ QP+S+AI+A+ F FY G
Sbjct: 197 MEEGTCDEKREEM--EVVTISGYHDVPENNEESLLKALAHQPLSIAIEASGRDFQFYGRG 254
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VF GPCG +HGV VGYG++ + Y +VKN WG W E G +R+ R G GL
Sbjct: 255 VFNGPCGTDLDHGVAAVGYGSSKGLD----YIIVKNSWGPKWGEKGYIRMKRNTGKPEGL 310
Query: 298 CNIAANAAYP 307
C I A+YP
Sbjct: 311 CGINKMASYP 320
>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
Length = 484
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 108/274 (39%), Positives = 151/274 (55%), Gaps = 17/274 (6%)
Query: 43 HEF-LRLNKFADLTREKFLASYT-GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNE 100
H F L L +FADLT E++ A G + S + L ++ D++DW E
Sbjct: 106 HGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGVVGSRRYLPLAGEQLP--DAVDWRE 163
Query: 101 RGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFL 157
RGAV VKDQG CWAF+AVA VEG+NKI TG L++ S+ +L+DC GC +
Sbjct: 164 RGAVAEVKDQGQCGACWAFSAVAAVEGINKIVTGSLISLSEQELIDCDKFQDQGCDGGLM 223
Query: 158 ENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDV 217
+NAF ++ + + +E YP+ G D CD + + +I ++ V E LQ
Sbjct: 224 DNAFVFMIKNGGIDTEADYPFTG-HDGTCDLKLKNT--RVVSIDSFERVPINYERALQKA 280
Query: 218 VSRQPVSVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNR 275
V+ QPVS +I+A+ F Y G+F G CG +HGVT+VGYG+ EG + YW+VKN
Sbjct: 281 VAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYGS----EGGKDYWIVKNS 336
Query: 276 WGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
WGT W E G +R+ R V +G C IA YP+
Sbjct: 337 WGTQWGEAGYVRMARNVRVRAGKCGIAMEPLYPV 370
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 171 bits (433), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 114/312 (36%), Positives = 168/312 (53%), Gaps = 31/312 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIF-----------KKNHEF-LRLNKFADLTREKFLASY 63
+ WM++ + Y+ EK RF+IF KKN+ + L LN FADL+ ++F Y
Sbjct: 49 DSWMLKHNKIYESIDEKIYRFEIFRDNLMYIDETNKKNNSYWLGLNGFADLSNDEFKKKY 108
Query: 64 TGYKPPP-TDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
G+ T H + ++ +K++ + + SIDW +GAVTPVK+QG+ CWAF+
Sbjct: 109 VGFVAEDFTGLEHFDNEDFTYKHVTN----YPQSIDWRAKGAVTPVKNQGACGSCWAFST 164
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+ATVEG+NKI TG L+ S+ +LVDC + GC + + +Y+ + + VYP Q
Sbjct: 165 IATVEGINKIVTGNLLELSEQELVDCDKHSYGCKGGYQTTSLQYVAN-NGVHTSKVYPCQ 223
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
+Q Y C + G I GY+ V E ++ QP+S ++A F Y
Sbjct: 224 AKQ-YKCR--ATDKPGPKVKITGYKRVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKS 280
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-G 296
GVF GPCG +H VT VGYGT ++G+ Y ++KN WG NW E G MR+ R G S G
Sbjct: 281 GVFDGPCGTKLDHAVTAVGYGT---SDGKN-YIIIKNSWGPNWGEKGYMRLKRQSGNSQG 336
Query: 297 LCNIAANAAYPL 308
C + ++ YP
Sbjct: 337 TCGVYKSSYYPF 348
>gi|1709576|sp|P05994.3|PAPA4_CARPA RecName: Full=Papaya proteinase 4; AltName: Full=Glycyl
endopeptidase; AltName: Full=Papaya peptidase B;
AltName: Full=Papaya proteinase IV; Short=PPIV; Flags:
Precursor
gi|953176|emb|CAA54974.1| proteinase IV [Carica papaya]
Length = 348
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 111/310 (35%), Positives = 161/310 (51%), Gaps = 33/310 (10%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTG 65
WM++ + YK+ EK RF+IFK N ++ L LN+F+DL+ ++F Y G
Sbjct: 51 WMLKHNKNYKNVDEKLYRFEIFKDNLKYIDERNKMINGYWLGLNEFSDLSNDEFKEKYVG 110
Query: 66 YKPPP-TDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAVA 122
P T+ P+ +N + +S+DW +GAVTPVK QG YC CWAF+ VA
Sbjct: 111 SLPEDYTNQPYDEEF-----VNEDIVDLPESVDWRAKGAVTPVKHQG-YCESCWAFSTVA 164
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGR 181
TVEG+NKI+TG LV S+ +LVDC + GC + + + +Y+ Q + YPY +
Sbjct: 165 TVEGINKIKTGNLVELSEQELVDCDKQSYGCNRGYQSTSLQYVAQ-NGIHLRAKYPYIAK 223
Query: 182 QDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--YHGGV 239
Q + G G VQ E L + ++ QPVSV +++ +F Y GG+
Sbjct: 224 QQ---TCRANQVGGPKVKTNGVGRVQSNNEGSLLNAIAHQPVSVVVESAGRDFQNYKGGI 280
Query: 240 FTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLC 298
F G CG +H VT V G + Y L+KN WG W E G +RI R G S G+C
Sbjct: 281 FEGSCGTKVDHAVTAV----GYGKSGGKGYILIKNSWGPGWGENGYIRIRRASGNSPGVC 336
Query: 299 NIAANAAYPL 308
+ ++ YP+
Sbjct: 337 GVYRSSYYPI 346
>gi|351629617|gb|AEQ54772.1| KDEL-tailed cysteine proteinase CP4, partial [Coffea canephora]
Length = 215
Score = 171 bits (432), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 94/200 (47%), Positives = 125/200 (62%), Gaps = 12/200 (6%)
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF+ V VEG+NKI+TGQLV+ S+ +LVDC T N GC +ENA+E+I++ + +E
Sbjct: 6 CWAFSTVVGVEGINKIKTGQLVSLSEQELVDCETDNEGCNGGLMENAYEFIKKSGGITTE 65
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-- 231
+YPY+ R D CD + +A I G++ V E L V+ QPVSVAIDA+
Sbjct: 66 RLYPYKAR-DGSCDSSKMNAPAV--TIDGHEMVPANDENALMKAVANQPVSVAIDASGSD 122
Query: 232 FNFYHGGVFTG-PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GV+TG CGN +HGV +VGYGT + YW+VKN WGT W E G +R+ R
Sbjct: 123 MQFYSEGVYTGDSCGNELDHGVAVVGYGTALDG---TKYWIVKNSWGTGWGEQGYIRMQR 179
Query: 291 GVGGS--GLCNIAANAAYPL 308
GV + G+C IA A+YPL
Sbjct: 180 GVDAAEGGVCGIAMEASYPL 199
>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 160/322 (49%), Gaps = 39/322 (12%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------------LRLNKFAD 53
A+ E W E + Y E+ R F +N F L LN FAD
Sbjct: 37 AQFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFAD 96
Query: 54 LTREKFLASYTG---YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
LT ++F A+ G P P P + + + + D++DW + GAVT VKDQ
Sbjct: 97 LTHDEFRAARLGRLAVGPGPLGAPSPSDGGFEGRVGAVP----DALDWRQSGAVTKVKDQ 152
Query: 111 GSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQY 167
GS CW+F+A +EG+NKI TG L++ S+ +L+DC S GC + A++++ +
Sbjct: 153 GSCGACWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKN 212
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI 227
+ +E YP++ D C+ ++ I GY+ V + E+ L V++QP+SV I
Sbjct: 213 GGIDTEDDYPFR-EADGTCN--KNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGI 269
Query: 228 --DATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
A F Y G+F GPC + +H V IVGYG+ EG + YW+VKN WG W G
Sbjct: 270 CGSARAFQLYSQGIFDGPCPTSLDHAVLIVGYGS----EGGKDYWIVKNSWGERWGMKGY 325
Query: 286 MRIFRGVG-GSGLCNIAANAAY 306
M + R G SG+C I A++
Sbjct: 326 MHMHRNTGSSSGICGINMMASF 347
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 111/316 (35%), Positives = 164/316 (51%), Gaps = 34/316 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR----------------LNKFADLTREKF 59
+QW + + Y+ E E RF+ FK N +++ LNKFAD++ E+F
Sbjct: 50 QQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLNKFADMSNEEF 109
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+Y P + + N + + S S+DW G VT VKDQGS CWAF
Sbjct: 110 RKAYLSKVKKPINKGITLSRNMRRKVQSCDAP--SSLDWRNYGVVTAVKDQGSCGSCWAF 167
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++ +EG+N + TG L++ S+ +LV+C T N GC +++ AFE++ + SE YP
Sbjct: 168 SSTGAMEGINALVTGDLISLSEQELVECDTSNYGCEGGYMDYAFEWVINNGGIDSESDYP 227
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID--ATWFNFY 235
Y G D C+ + K +I GYQ V+ ++ L V++QPVSV ID A F Y
Sbjct: 228 YTG-VDGTCN--TTKEETKVVSIDGYQDVEQ-SDSALLCAVAQQPVSVGIDGSAIDFQLY 283
Query: 236 HGGVFTGPCGNTP---NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
GG++ G C + P +H V IVGYG+ E + YW+VKN WGT+W G + R
Sbjct: 284 TGGIYDGSCSDDPDDIDHAVLIVGYGS----EDSEEYWIVKNSWGTSWGIDGYFYLKRDT 339
Query: 293 GGS-GLCNIAANAAYP 307
G+C + A A+YP
Sbjct: 340 DLPYGVCAVNAMASYP 355
>gi|125526836|gb|EAY74950.1| hypothetical protein OsI_02846 [Oryza sativa Indica Group]
Length = 359
Score = 170 bits (431), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 118/333 (35%), Positives = 163/333 (48%), Gaps = 43/333 (12%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHE-------------FLRLNKFADLTRE 57
+AA+H WM RTY D AEK RF++F+ N E L L FADLT +
Sbjct: 34 MAARHRCWMARVGRTYADAAEKARRFEVFRANAERIDAANRAGDLTYTLGLTPFADLTAD 93
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKM--------SFYDSIDWNERGAVTPVKD 109
+F A + D P + R + + ++K + + S DW + GAVTPV+D
Sbjct: 94 EFRARHL-MPDADVDEPATARVLFEQEEKAAKQHLPPSRPPAVWGSKDWRDLGAVTPVQD 152
Query: 110 QGSY---CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS-TLNGCAKNFLENAFEYIR 165
Q CWAF AVA EGL KI TG + S Q++DC+ N C + A YI
Sbjct: 153 QDKNNCNSCWAFAAVAATEGLIKIETGNVTPLSAQQVLDCTGGDNTCKGGHIHEALRYIA 212
Query: 166 QYQ---RLASECVY-PYQGRQDY-YCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR 220
RL+++ Y PY G + +S+S IRG Q V P ++ L+ V R
Sbjct: 213 TASAGGRLSTDTSYRPYDGEKGTCAAGSGSASSSSVAVVIRGVQKVTPHDKDALRAAVER 272
Query: 221 QPVSVAIDAT---WFNFYHGGVFTGP--CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNR 275
QPV+ +D++ + F G V+ G CG NH V +VGYGT ++ PYWL+KN
Sbjct: 273 QPVAADMDSSDPEFRGFKGGRVYRGSAGCGKKRNHAVAVVGYGTASDG---TPYWLLKNS 329
Query: 276 WGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
WGT+W E G MRI C +++ AYP
Sbjct: 330 WGTDWGENGYMRI----AVDADCGVSSRPAYPF 358
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 170 bits (430), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 110/307 (35%), Positives = 158/307 (51%), Gaps = 29/307 (9%)
Query: 19 MVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASYTGY 66
M + ++Y+ EK RF++F+ N + +L LN+FADL+ E+F Y G
Sbjct: 1 MSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVSSYWLGLNEFADLSHEEFKRKYLGL 60
Query: 67 KPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVE 125
K S +K++ S+DW ++GAV VK+QG+ CWAF+ VA VE
Sbjct: 61 KIELPKRRDSPEEFSYKDV----ADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVE 116
Query: 126 GLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
G+N+I TG L S+ +L+DC NGC ++ AF +I L E YPY ++
Sbjct: 117 GINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYV-MEE 175
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHGGVFT 241
C + + I GY V E+ ++ QP+SVAI+A+ F FY GG+F
Sbjct: 176 GTCGEKKEEL--EVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFN 233
Query: 242 GPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNI 300
G CG +HGV VGYGT+ + Y VKN WG+ W E G +R+ R VG G+C I
Sbjct: 234 GHCGTELDHGVAAVGYGTSKGVD----YITVKNSWGSKWGEKGYIRMKRNVGKPEGICGI 289
Query: 301 AANAAYP 307
A+YP
Sbjct: 290 YKMASYP 296
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 180/324 (55%), Gaps = 40/324 (12%)
Query: 15 HEQW---MVEFARTYKDQAEKEMRFKIFKKN-HEF---------------LRLNKFADLT 55
EQW ++ ++ Y + E+ R KIF +N H+ L LNK+AD+
Sbjct: 24 QEQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAKHNKLFSQGFVKFKLGLNKYADML 83
Query: 56 REKFLASYTGY-KPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC 114
+F+++ G+ K S+ ++ + ++ + + D++DW ++GAVT VKDQG +C
Sbjct: 84 HHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVKLPDTVDWRDKGAVTEVKDQG-HC 142
Query: 115 --CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQR 169
CW+F+A ++EG + +TG+LV+ S+ LVDCS NGC ++NAF YI+
Sbjct: 143 GSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSGRYGNNGCNGGLMDNAFRYIKDNGG 202
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAID 228
+ +E YPY +D C +++ SG +G+ ++ A E+ L+ V+ PVS+AID
Sbjct: 203 IDTEKSYPYLA-EDEKCH-YKAQNSG--ATDKGFVDIEEANEDDLKAAVATVGPVSIAID 258
Query: 229 ATW--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
A+ F Y GV++ P C + +HGV +VGYGT+ + Q YWLVKN WG +W G
Sbjct: 259 ASHETFQLYSDGVYSDPECSSQELDHGVLVVGYGTSDDG---QDYWLVKNSWGPSWGLNG 315
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
+++ R +C +A+ A+YPL
Sbjct: 316 YIKMAR--NQDNMCGVASQASYPL 337
>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
Length = 565
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 110/339 (32%), Positives = 164/339 (48%), Gaps = 52/339 (15%)
Query: 6 HKTGNIAAKHE----QWMVEFARTYKDQAEKEMRFKIFKKNHEF---------------- 45
+ GN++A +E W E + Y E+ R F N F
Sbjct: 29 EREGNLSAAYEPLFEAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNA 88
Query: 46 -----LRLNKFADLTREKFLAS------YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD 94
L LN FADLT +F A+ G + PP++ + + + +
Sbjct: 89 APSYTLALNAFADLTHAEFRAARLGRLAVGGARAPPSEGGFAG--------SVGVGAVPE 140
Query: 95 SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNG 151
++DW + GAVT VKDQGS CW+F+A +EG+NKI+TG L++ S+ +L+DC S G
Sbjct: 141 ALDWRQSGAVTKVKDQGSCGACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRSYNAG 200
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ A+ ++ + + +E YPY+ D C+ ++ I GY V E
Sbjct: 201 CGGGLMDYAYRFVIKNGGIDTEDDYPYR-EADGTCN--KNKLKRHVVTIDGYSDVPANKE 257
Query: 212 EGLQDVVSRQPVSVAI--DATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
+ L V++QP+SV I A F Y G+F GPC + +H V IVGYG+ EG + Y
Sbjct: 258 DSLLQAVAQQPISVGICGSARAFQLYSQGIFDGPCPTSLDHAVLIVGYGS----EGGKDY 313
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYP 307
W+VKN WG W G M + R G SG+C I A++P
Sbjct: 314 WIVKNSWGERWGMKGYMHMHRNTGSSSGICGINMMASFP 352
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 169 bits (429), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 118/330 (35%), Positives = 170/330 (51%), Gaps = 49/330 (14%)
Query: 16 EQWMV---EFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTR 56
E+W + E + Y + E++ R KIF N + L LNK++D+
Sbjct: 25 EEWQLFKAEHKKNYNNDVEEKFRMKIFMDNKQKITKHNTKYQRGEVGYKLGLNKYSDMLH 84
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSS------KMSFYDSIDWNERGAVTPVKDQ 110
+F+ ++ G+ PH +N +L S + +DW + GAVTPVKDQ
Sbjct: 85 HEFINTFNGFNKSIIP-PHLRSNNGKTHLKGSFFIPPANVKLPKHVDWVKLGAVTPVKDQ 143
Query: 111 GSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIR 165
G +C CWAF+A +EGL+ +T LV+ S+ L+DCST NGC ++ AF+Y+R
Sbjct: 144 G-HCGSCWAFSATGALEGLHFRKTKVLVSLSEQNLIDCSTEEGNNGCNGGLMDQAFQYVR 202
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIR-GYQYVQPATEEGLQDVVSRQ-PV 223
+ +E YPY+G D C + ++ GAI GY V E+ L+ V+ PV
Sbjct: 203 INGGIDTERSYPYEGNNDV-CRYEPENS----GAIDTGYTDVPLGDEDALKSAVATVGPV 257
Query: 224 SVAIDATW--FNFYHGGVFTGP-CGNTP---NHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
SVAIDA+ F Y GV+ P C N P +HGV +VGYGT + E QQ YWLVKN WG
Sbjct: 258 SVAIDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVGYGT--DEETQQDYWLVKNSWG 315
Query: 278 TNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
+W E G +++ R C IA ++P
Sbjct: 316 DSWGENGYIKMARNADNQ--CGIATQPSFP 343
>gi|242072388|ref|XP_002446130.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
gi|241937313|gb|EES10458.1| hypothetical protein SORBIDRAFT_06g002130 [Sorghum bicolor]
Length = 276
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 101/276 (36%), Positives = 149/276 (53%), Gaps = 33/276 (11%)
Query: 36 FKIFKKNHEFLRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDS 95
F K N +L +N+FADLT E+F A+ G+KP + + FK N S + +
Sbjct: 28 FNANKNNKFWLGVNQFADLTTEEFKAN-KGFKPTSAEKVPTTG---FKYENLSVSALPTA 83
Query: 96 IDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAK 154
+DW +GAVTP+K+QG CCWAF+AVA +EG+ K+ TG L++ SK +LVDC T
Sbjct: 84 VDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSKQELVDCDT------ 137
Query: 155 NFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL 214
+ ++ E Y+ + +C S I+G++ V E L
Sbjct: 138 HSMDEGCEVQLPYKAVDGKC----------------KGGSKSAATIKGHEDVPVNNEAAL 181
Query: 215 QDVVSRQPVSVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLV 272
V+ QPVSVA+DA+ F Y GGV TG CG +HG+ +GYG E++G + YW++
Sbjct: 182 MKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYG--MESDGTK-YWIL 238
Query: 273 KNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
KN WGT W E G +R+ + + G+C +A +YP
Sbjct: 239 KNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYP 274
>gi|413953051|gb|AFW85700.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 359
Score = 169 bits (428), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 118/336 (35%), Positives = 172/336 (51%), Gaps = 51/336 (15%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL--------------NKFADLTR 56
+ + + W E+ RTY E + RF ++ +N F++ N+F DLT
Sbjct: 36 LLERFKAWQAEYNRTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTE 95
Query: 57 EKFLASYT---GYKPPPTDHPHSNRSNWFKNLNSSKMSFYD-------SIDWNERGAVTP 106
E+F +Y +PP + ++++ MS D S+DW +GAVTP
Sbjct: 96 EEFKDTYLMKLDEQPPAAEA----MPPIVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTP 151
Query: 107 VKDQ---GSYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENA 160
VK+Q GS CWAF VA++EG+++I+TG+LV+ S+ ++VDC +GC + +A
Sbjct: 152 VKNQQQCGS--CWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSA 209
Query: 161 FEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYG----AIRGYQYVQPATEEGLQD 216
E++ + L +E YPY G Q R SGK G IRGYQ VQ E L+
Sbjct: 210 MEWVTRNGGLTTESDYPYVGSQ-------RQCMSGKLGHHAARIRGYQAVQRKNEAELER 262
Query: 217 VVSRQPVSVAIDAT-WFNFYHGGVFTGPCGNTP-NHGVTIV-GYGTTTEAEGQQPYWLVK 273
V+ +PV+V IDA+ F FY GVF+GPC T NH VT+V +++ G + YW+VK
Sbjct: 263 AVAGRPVAVVIDASRAFQFYKRGVFSGPCNTTTVNHAVTVVGYGSAGSDSGGGRKYWIVK 322
Query: 274 NRWGTNWDEGG-SMRIFRGVGGSGLCNIAANAAYPL 308
N WG W E G R G+C IA YP+
Sbjct: 323 NSWGQRWGENGYVRMARRVRAREGMCAIAIEPYYPV 358
>gi|414591039|tpg|DAA41610.1| TPA: hypothetical protein ZEAMMB73_356414 [Zea mays]
Length = 376
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 118/311 (37%), Positives = 154/311 (49%), Gaps = 34/311 (10%)
Query: 27 KDQAEKEMRFKIFKKNH----EF---------LRLNKFADLTREKFLASYTGYKPPPTDH 73
+D EK+ RF+ FK N EF L LNKFADLT+E+F++ YTG K ++
Sbjct: 56 RDLREKQSRFEAFKANARHIGEFNKRKDVPYKLGLNKFADLTQEEFVSKYTGAKVVDSEA 115
Query: 74 PH--------SNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
S+ L +S D+ DW + GAVT VKDQG CWAF+AV V
Sbjct: 116 AARLASGVRVSSSDESPPQLAASVGDAPDAWDWRDHGAVTAVKDQGQCGSCWAFSAVGAV 175
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTLNGCA-KNFLENAFEYIRQYQRLASEC-----VYPY 178
E +N I TG L+T S+ Q++DCS C + A Y +C Y
Sbjct: 176 ESVNAIVTGNLLTLSEQQMLDCSGAGDCTYGGYTYYAMLYAISNGLTLDQCGKTPYYQRY 235
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYHGG 238
+Q C + + I + A E L+ V +QPVSV IDA +Y G
Sbjct: 236 DAQQHLPCRF--DAKKPPVVKIDSMYVMNNADEAALKRAVYKQPVSVLIDAGGIGYYSEG 293
Query: 239 VFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGL 297
VFTGPCG + NH V +VGYG T A+G + YW+VKN WG +W E G R+ R VG GL
Sbjct: 294 VFTGPCGTSLNHAVLLVGYGAT--ADGTK-YWIVKNSWGADWGEKGYFRLKRDVGTQGGL 350
Query: 298 CNIAANAAYPL 308
C I YP+
Sbjct: 351 CGITMYPIYPI 361
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 169 bits (428), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 108/292 (36%), Positives = 148/292 (50%), Gaps = 42/292 (14%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR----------------LNKFADLT 55
A + + F + Y+ E+ RF IF N F+ +N+FADLT
Sbjct: 17 AMSFDDFKTTFEKQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLT 76
Query: 56 REKFLASYTGYKPPPTDHPHSNRSN-WFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
E++ Y +P PT+ R W N+ S+DW ++GAVTP+K+QG
Sbjct: 77 NEEYRQLY--LRPYPTELLGRERQEVWLDGPNAG------SVDWRQKGAVTPIKNQGQCG 128
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRL 170
CW+F+ +VEG + I TG LV+ S+ QLVDCS GC ++NAF+YI L
Sbjct: 129 SCWSFSTTGSVEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGL 188
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT 230
+E YPY R D CD +S S +I GY+ V E+ L V + PVSVAI+A
Sbjct: 189 DTEQDYPYTAR-DGVCD--KSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEAD 245
Query: 231 W--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
F Y GVF+GPCG +HGV +VGY + YW+VKN WG +W
Sbjct: 246 QQSFQMYSSGVFSGPCGTNLDHGVLVVGYTSD--------YWIVKNSWGASW 289
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 113/318 (35%), Positives = 164/318 (51%), Gaps = 41/318 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + V + YK+Q E+ R KIF N + +++N F DL +
Sbjct: 28 ETFKVVHGKNYKNQFEEMFRRKIFMNNKKRIEAHNAKYEQGEVSYKMKMNHFGDLMSHEI 87
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
A G+K P N K S S+DW ++GAVTPVKDQG CW+F
Sbjct: 88 KALMNGFKMTP------NTKREGKIYFPSNDKLPKSVDWRQKGAVTPVKDQGQCGSCWSF 141
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+A ++EG ++ G+LV+ S+ L+DCS NGC ++ AF+Y+ + + +E
Sbjct: 142 SATGSLEGQIFLKKGKLVSLSEQNLMDCSKEYGNNGCEGGLMDKAFQYVSDNKGIDTESS 201
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--F 232
YPY+ R DY C + + G +GY + E+ LQ+ ++ P+SVAIDA+ F
Sbjct: 202 YPYEAR-DYACRFKKDKVGG---TDKGYVDIPEGDEKALQNALATVGPISVAIDASHESF 257
Query: 233 NFYHGGVFTGP-CGN-TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
+FY GV+ P C + +HGV VGYGT E Q YWLVKN WG +W E G ++I R
Sbjct: 258 HFYSEGVYNEPYCSSYDLDHGVLAVGYGT----ENGQDYWLVKNSWGPSWGESGYIKIAR 313
Query: 291 GVGGSGLCNIAANAAYPL 308
S C IA+ A+YP+
Sbjct: 314 --NHSNHCGIASMASYPI 329
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 112/320 (35%), Positives = 161/320 (50%), Gaps = 37/320 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-----------------LNKFADLTREK 58
E+WM + + Y EK R+ F N F+R +N FADL+ E+
Sbjct: 52 ERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSSGQGVGMNVFADLSNEE 111
Query: 59 FLASYTGY---KPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
F Y+ K R+ + + S+DW +RGAVT VK+QG
Sbjct: 112 FREVYSSRVLRKKAAEGRGARRRAGEGRVVAGCDAPA--SLDWRKRGAVTAVKNQGDCGS 169
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAF++ +EG+N I TG+L++ S+ +LVDC T N GC +++ AFE++ + SE
Sbjct: 170 CWAFSSTGAMEGINAITTGELISLSEQELVDCDTTNEGCDGGYMDYAFEWVINNGGIDSE 229
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN 233
YPY G+ D C+ + K +I GY+ V +E L +QPVSV ID + +
Sbjct: 230 ANYPYTGQADSVCNTTKEEI--KVVSIDGYEDV-ATSESALLCAAVQQPVSVGIDGSSLD 286
Query: 234 F--YHGGVFTGPCGNTP---NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
F Y GG++ G C P +H V +VGYG +G YW+VKN WGT+W G + I
Sbjct: 287 FQLYAGGIYDGDCSGNPDDIDHAVLVVGYGQ----QGGTDYWIVKNSWGTDWGMQGYIYI 342
Query: 289 FRGVGGS-GLCNIAANAAYP 307
R G G+C I A A+YP
Sbjct: 343 RRNTGLPYGVCAIDAMASYP 362
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 118/331 (35%), Positives = 172/331 (51%), Gaps = 52/331 (15%)
Query: 9 GNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------------RLNKFA 52
G + A+ EQ+ F R Y + R IF+ N +F+ +N F
Sbjct: 27 GELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFT 86
Query: 53 DLTREKFLASYTGYK----PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
DL+ E+F A++ GY+ D H++ N + L ++ +DW +G VTP+K
Sbjct: 87 DLSNEEFRATFNGYRRLAAVSLADSVHAD--NDVEALPAT-------VDWTTKGVVTPIK 137
Query: 109 DQ---GSYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFE 162
+Q GS CWAF+AVA++EG + ++TG+LV+ S+ LVDCS GC+ +++ AF+
Sbjct: 138 NQQQCGS--CWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFK 195
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQD-VVSRQ 221
Y+ Q + + +E YPY+ D C++ R+S I + V+ E LQ+ V S
Sbjct: 196 YVIQNRGIDTEASYPYKAI-DESCEFKRNSVG---ATIHSFVDVKTGDESALQNAVASIG 251
Query: 222 PVSVAIDATW--FNFYHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
P+SVAIDA F FY GV+ P +T +HGVT VGYGT A PYW VKN WG
Sbjct: 252 PISVAIDAAQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTLNGA----PYWKVKNSWG 307
Query: 278 TNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
T+W G IF C IA A+YP+
Sbjct: 308 TSWGRKG--YIFMSRNKQNQCGIATKASYPV 336
>gi|313118760|gb|ADR32292.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 169 bits (427), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 94/220 (42%), Positives = 130/220 (59%), Gaps = 13/220 (5%)
Query: 95 SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNG 151
S+DW ++G + VKDQGS CWAF+AVA +E +N I TG L++ S+ +LVDC S G
Sbjct: 4 SVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYNEG 63
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ AFE++ + SE YPY+ R D CD +R +A K I Y+ V E
Sbjct: 64 CDGGLMDYAFEFVINNGGIDSEEDYPYKERND-VCDQYRKNA--KVVKIDSYEDVPVNNE 120
Query: 212 EGLQDVVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
+ LQ V+ QPVS+A++A +F H G+FTG CG +HGV GYGT E Y
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGT----ENGMDY 176
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W+V+N WG NW E G +R+ R + SGLC +A +YP+
Sbjct: 177 WIVRNSWGANWGEKGYLRVQRNIASSSGLCGLATEPSYPV 216
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 109/311 (35%), Positives = 166/311 (53%), Gaps = 39/311 (12%)
Query: 24 RTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLASYTGYK 67
+ Y ++ E+ R KIF +N + L+LN AD+ ++ Y G+
Sbjct: 36 KEYDNELEESYRKKIFLENKKRIEKHNSRYKQGKVSFKLKLNHLADMLIHEYSDVYLGFN 95
Query: 68 PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAVATVE 125
+ ++N+ + + + ++ +DW +GAVTPVK+QG +C CWAF+ +E
Sbjct: 96 K--SSKANNNKLQSYTFIPPAHVTLNKEVDWRTKGAVTPVKNQG-HCGSCWAFSTTGALE 152
Query: 126 GLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQ 182
G N +TG+LV+ S+ LVDCS NGC ++NAF+YI++ + +E YPY+G +
Sbjct: 153 GQNFRKTGKLVSLSEQNLVDCSGSYGNNGCEGGLMDNAFQYIKENHGIDTEKSYPYEG-E 211
Query: 183 DYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDATW--FNFYHGGV 239
D C + ++S G+ + EE L Q V + P+SVAIDA+ F FY GV
Sbjct: 212 DETCRFRKTSIGA---TDSGFVDITQGDEEALMQAVATIGPISVAIDASHQSFQFYSEGV 268
Query: 240 FTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGL 297
+ P ++ N HGV +VGYG E Q YWLVKN WGT W +GG +++ R +
Sbjct: 269 YYEPECSSENLDHGVLVVGYGV----EDNQKYWLVKNSWGTQWGDGGYIKMARDQDNN-- 322
Query: 298 CNIAANAAYPL 308
C IA A+YPL
Sbjct: 323 CGIATQASYPL 333
>gi|413953046|gb|AFW85695.1| thiol protease SEN102 [Zea mays]
Length = 382
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 121/333 (36%), Positives = 167/333 (50%), Gaps = 46/333 (13%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL--------------NKFADLTR 56
+ + + W E+ RTY E + RF I+ +N F++ N+F DLT
Sbjct: 60 LLERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTE 119
Query: 57 EKFLASY---------TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
E+F +Y PPT S N N++ + +S+DW +GAVT V
Sbjct: 120 EEFKDTYLMKLDEQPPAAEAMPPTVGTMSTAG--MSNGNNTGEA-PNSVDWRTKGAVTRV 176
Query: 108 KDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFE 162
KDQ C CWAF VA++EG+++I+TG+LV+ S+ ++VDC NGC +A E
Sbjct: 177 KDQ-QQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAME 235
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYG----AIRGYQYVQPATEEGLQDVV 218
++ + L +E YPY G Q R SGK G IRGYQ VQ E L+ V
Sbjct: 236 WVTRNGGLTTESDYPYVGSQ-------RQCMSGKLGHHAARIRGYQAVQRNNEAELERAV 288
Query: 219 SRQPVSVAIDAT-WFNFYHGGVFTGPCGNTPNHGVTIV-GYGTTTEAEGQQPYWLVKNRW 276
+ QPV+V +DA+ F FY GVF+GPC T + V V GYG+T G + YW+VKN W
Sbjct: 289 AGQPVAVFVDASRAFQFYKSGVFSGPCDTTTVNHVVTVVGYGSTGSDSGGRKYWIVKNSW 348
Query: 277 GTNWDEGG-SMRIFRGVGGSGLCNIAANAAYPL 308
G W E G R G+C IA YP+
Sbjct: 349 GQGWGENGYVRMARRVRAREGMCAIAIEPYYPV 381
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 168 bits (426), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 117/331 (35%), Positives = 173/331 (52%), Gaps = 52/331 (15%)
Query: 9 GNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------------RLNKFA 52
G + A+ EQ+ F R Y + R IF+ N +F+ +N F
Sbjct: 27 GELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGDSTFSVSVNNFT 86
Query: 53 DLTREKFLASYTGYK----PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
DL+ E+F A++ GY+ D H++ N + L ++ +DW +G VTP+K
Sbjct: 87 DLSNEEFRATFNGYRRLAAVSLADSVHAD--NDVEALPAT-------VDWTTKGVVTPIK 137
Query: 109 DQ---GSYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFE 162
+Q GS CWAF+AVA++EG + ++TG+LV+ S+ LVDCS GC+ +++ AF+
Sbjct: 138 NQQQCGS--CWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMDYAFK 195
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQD-VVSRQ 221
Y+ Q + + +E YPY+ D C++ R+S I + V+ E LQ+ V S
Sbjct: 196 YVIQNRGIDTEASYPYKAI-DESCEFKRNSIG---ATIHSFVDVKTGDESALQNAVASIG 251
Query: 222 PVSVAIDATW--FNFYHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
P+SVAIDA+ F FY GV+ P +T +HGVT VGYGT PYW VKN WG
Sbjct: 252 PISVAIDASQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTLNGV----PYWKVKNSWG 307
Query: 278 TNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
T+W + G IF C IA A+YP+
Sbjct: 308 TSWGQKG--YIFMSRNKQNQCGIATKASYPV 336
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 113/318 (35%), Positives = 167/318 (52%), Gaps = 41/318 (12%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKN------HEFLR----------LNKFADLTR 56
K + + ++ +TYK+Q E+ RF IFK N H L +N+F D+T+
Sbjct: 23 VKFQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHNVLYEQGLVSYKKGINRFTDMTQ 82
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
E+F A T + PH N + + ++ DSIDW +G VT VKDQG+ C
Sbjct: 83 EEFRAFLT---LSSSKKPHFNTTEHVL----TGLAVPDSIDWRTKGQVTGVKDQGNCGSC 135
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LN-GCAKNFLENAFEYIRQYQRLASE 173
WAF+ + E + G+LV+ S+ QLVDCST +N GC +L+ F Y++ + L +E
Sbjct: 136 WAFSVTGSTEAAYYRKAGKLVSLSEQQLVDCSTDINAGCNGGYLDETFTYVKS-KGLEAE 194
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATWF 232
YPY+G D C + SAS + G++ ++ E L D V PVSVAIDAT+
Sbjct: 195 STYPYKG-TDGSCKY---SASKVVTKVSGHKSLKSEDENALLDAVGNVGPVSVAIDATYL 250
Query: 233 NFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
+ Y G++ C + NHGV +VGYGT+ + YW+VKN WG ++ E G R+ R
Sbjct: 251 SSYESGIYEDDWCSPSELNHGVLVVGYGTSN----GKKYWIVKNSWGGSFGESGYFRLLR 306
Query: 291 GVGGSGLCNIAANAAYPL 308
G C +A + YP+
Sbjct: 307 ---GKNECGVAEDTVYPI 321
>gi|115438534|ref|NP_001043563.1| Os01g0613800 [Oryza sativa Japonica Group]
gi|11034574|dbj|BAB17098.1| cysteine proteinase-like [Oryza sativa Japonica Group]
gi|113533094|dbj|BAF05477.1| Os01g0613800 [Oryza sativa Japonica Group]
gi|125571165|gb|EAZ12680.1| hypothetical protein OsJ_02595 [Oryza sativa Japonica Group]
gi|215766821|dbj|BAG99049.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 359
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 117/333 (35%), Positives = 162/333 (48%), Gaps = 43/333 (12%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHE-------------FLRLNKFADLTRE 57
+AA+H WM RTY D AEK RF++F+ N E L L FADLT +
Sbjct: 34 MAARHRCWMARVGRTYADAAEKARRFEVFRANAERIDAANRAGDLTYTLGLTPFADLTAD 93
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKM--------SFYDSIDWNERGAVTPVKD 109
+F A + D P + R + + ++K + + S DW + GAVTPV+D
Sbjct: 94 EFRARHL-MPDADVDEPATARVLFEQEEKAAKQHLPPSRPPAVWGSKDWRDLGAVTPVQD 152
Query: 110 QGS---YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS-TLNGCAKNFLENAFEYIR 165
QG CWAF VA EGL KI TG + S Q++DC+ N C + A YI
Sbjct: 153 QGKNNCNSCWAFAVVAATEGLIKIETGNVTPLSAQQVLDCTGGDNTCKGGHIHEALRYIA 212
Query: 166 QYQ---RLASECVY-PYQGRQDY-YCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR 220
RL+++ Y PY G + +S+S IRG Q V P ++ L+ V R
Sbjct: 213 TASAGGRLSTDKSYRPYDGEKGTCAAGSGSASSSSVAVVIRGVQKVTPHDKDALRAAVER 272
Query: 221 QPVSVAIDAT---WFNFYHGGVFTGP--CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNR 275
QPV+ +D++ + F G V+ G CG NH V +VGYGT ++ PYWL+KN
Sbjct: 273 QPVAADMDSSDPEFRGFKGGRVYRGSAGCGKKRNHAVAVVGYGTASDG---TPYWLLKNS 329
Query: 276 WGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
W T+W E G MRI C +++ AYP
Sbjct: 330 WATDWGENGYMRI----AVDADCGVSSRPAYPF 358
>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 113/327 (34%), Positives = 165/327 (50%), Gaps = 53/327 (16%)
Query: 7 KTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFAD 53
+ ++ +HEQ M + + YKD ++ FK+N ++ +N+FA
Sbjct: 31 QDASMXERHEQRMTRYGKVYKDPPKRX-----FKENVNYIEACNNAANKPYKRGINQFAP 85
Query: 54 LTREK-----FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
R K + T +K F+N+ ++ ++D ++GAVTP+K
Sbjct: 86 RNRFKGHMCSSIIRITTFK--------------FENVTATP----STVDCRQKGAVTPIK 127
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYI 164
DQG CCWAF+AVA EG++ + G+L++ S+ +LVDC T GC +++AF++I
Sbjct: 128 DQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDXGCEGGLMDDAFKFI 187
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEG-LQDVVSRQPV 223
Q L P D C+ ++ + I GY+ V E+ LQ V+ PV
Sbjct: 188 IQNHGLKHXSQLPLYMGVDGKCNANEAAKN-AATIITGYEDVPANNEKAHLQKAVANNPV 246
Query: 224 SVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
S AIDA+ F FY GVFTG CG +HGVT VGYG + + YWLVKN WGT W
Sbjct: 247 SEAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDDG---TEYWLVKNSWGTEWG 303
Query: 282 EGGSMRIFRGVGG-SGLCNIAANAAYP 307
E G +R+ RGV LC IA A+YP
Sbjct: 304 EEGYIRMQRGVDSEEALCGIAVQASYP 330
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 118/321 (36%), Positives = 167/321 (52%), Gaps = 55/321 (17%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTG 65
WM + R Y + E R++ FK+N +F L L KFADLT E++ Y G
Sbjct: 36 WMRKHDRAYSHE-EFTDRYQAFKENMDFIHKWNSQESDTVLGLTKFADLTNEEYKKHYLG 94
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSK--MSFY-----DSIDWNERGAVTPVKDQGSY-CCWA 117
K N KNLN+++ + F+ DSIDW E+GAV+ VKDQG CW+
Sbjct: 95 IKV-----------NVKKNLNAAQKGLKFFKFTGPDSIDWREKGAVSQVKDQGQCGSCWS 143
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+ VEG ++I++G +V+ S+ LVDCS GC + NAFEYI +A+E
Sbjct: 144 FSTTGAVEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGGIATES 203
Query: 175 VYPY---QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
YPY QGR C + +S I GY+ + E+ L +++QPVSVAIDA+
Sbjct: 204 SYPYTAAQGR----CKFTKSMNGAN---IIGYKEIPQGEEDSLTAALAKQPVSVAIDASH 256
Query: 232 FNF--YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
+F Y GV+ P ++ +HGV VGYGT EG+ Y+++KN WG W + G
Sbjct: 257 MSFQLYSSGVYDEPACSSEALDHGVLAVGYGTL---EGKD-YYIIKNSWGPTWGQDG--Y 310
Query: 288 IFRGVGGSGLCNIAANAAYPL 308
IF C +A A+YP+
Sbjct: 311 IFMSRNAQNQCGVATMASYPI 331
>gi|2098464|pdb|1PCI|A Chain A, Procaricain
gi|2098465|pdb|1PCI|B Chain B, Procaricain
gi|2098466|pdb|1PCI|C Chain C, Procaricain
Length = 322
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 112/307 (36%), Positives = 161/307 (52%), Gaps = 29/307 (9%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTG 65
WM+ + Y++ EK RF+IFK N + L LN+FADL+ ++F Y G
Sbjct: 25 WMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKKNNSYWLGLNEFADLSNDEFNEKYVG 84
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
T + + +N ++ +++DW ++GAVTPV+ QGS CWAF+AVATV
Sbjct: 85 SLIDATIEQSYDE----EFINEDIVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATV 140
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
EG+NKIRTG+LV S+ +LVDC + GC + A EY+ + + YPY+ +Q
Sbjct: 141 EGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAK-NGIHLRSKYPYKAKQG 199
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFT 241
C G G VQP E L + +++QPVSV +++ F Y GG+F
Sbjct: 200 -TCR--AKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFE 256
Query: 242 GPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNI 300
GPCG + VT V G + Y L+KN WGT W E G +RI R G S G+C +
Sbjct: 257 GPCGTKVDGAVTAV----GYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGL 312
Query: 301 AANAAYP 307
++ YP
Sbjct: 313 YKSSYYP 319
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 168 bits (425), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 111/324 (34%), Positives = 171/324 (52%), Gaps = 41/324 (12%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIFKKN-HEF---------------LRLNKFADLTR 56
E+W +E +TY+D+ E+ R KIF +N H+ + +NK+AD+
Sbjct: 25 EEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHNQRYATGEVTFKMAVNKYADMLH 84
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNW--FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC 114
+F + G+ ++ ++ ++ + + S+DW E+GAVT VKDQG +C
Sbjct: 85 HEFRETMNGFNYTLHKELRASDPSFTGITFISPAHVKLPKSVDWREKGAVTAVKDQG-HC 143
Query: 115 --CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQR 169
CWAF++ +EG + +TG LV+ S+ LVDCS NGC ++NAF YI+
Sbjct: 144 GSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAID 228
+ +E YPY+G D C + + S RG+ + E+ + + V+ PVSVAID
Sbjct: 204 IDTEKSYPYEGIDD-SCHFNKDSVG---ATDRGFADIPQGNEKKMAEAVATIGPVSVAID 259
Query: 229 ATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
A+ F FY G++ P N+ N HGV +VGYGT E + YWLVKN WGT W + G
Sbjct: 260 ASHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGTD---ESGKDYWLVKNSWGTTWGDKG 316
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
+++ R C IA+ ++YPL
Sbjct: 317 FIKMARNEDNQ--CGIASASSYPL 338
>gi|297727243|ref|NP_001175985.1| Os09g0564600 [Oryza sativa Japonica Group]
gi|52076124|dbj|BAD46637.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|255679140|dbj|BAH94713.1| Os09g0564600 [Oryza sativa Japonica Group]
Length = 369
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 115/322 (35%), Positives = 161/322 (50%), Gaps = 38/322 (11%)
Query: 15 HEQWMVEFARTYKDQAEKEM---RFKIFKKN----HEF---------LRLNKFADLTREK 58
+E+W +A + +D +M RF+ FK N +EF L LNKF+D++ E+
Sbjct: 43 YERWRRVYASSSQDLPSSDMMKSRFEAFKANARQVNEFNKKEGMSYTLGLNKFSDMSYEE 102
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
F A YTG P S+ L + + DW + AVTPVKDQG CWA
Sbjct: 103 FAAKYTGGMPGSIADDRSSAGAVSCKLREKNVPL--TWDWRDSRAVTPVKDQGPCGSCWA 160
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYP 177
F+ V VE +NKIRTG L+T S+ Q++DCS C + ++AF +I + +
Sbjct: 161 FSVVGAVESINKIRTGILLTLSEQQVLDCSGAGDCVFGYPKDAFNHI-----VNTGVSLD 215
Query: 178 YQGRQDYYCDWWRSSASGKYG-------AIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT 230
+G+ YY + ++ I G + Q E L+ V QPVSV I +
Sbjct: 216 SRGKPPYYPPYEAQKKQCRFDLEKPPFVKIDGICFAQSGDETALKLAVLSQPVSVIIQIS 275
Query: 231 -WFNFYHGGVFTGPCG--NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F+ YHGGVF GPCG NH V +VGYG TT+ YW+VKN WG W E G +R
Sbjct: 276 DRFHSYHGGVFDGPCGTETKDNHVVLVVGYGVTTD---NIKYWIVKNSWGEGWGESGYIR 332
Query: 288 IFRGV-GGSGLCNIAANAAYPL 308
+ R + +G+C I A YP+
Sbjct: 333 MKRDITDKNGICGITTWAMYPV 354
>gi|226531284|ref|NP_001147086.1| thiol protease SEN102 precursor [Zea mays]
gi|195607128|gb|ACG25394.1| thiol protease SEN102 precursor [Zea mays]
Length = 356
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 122/333 (36%), Positives = 169/333 (50%), Gaps = 46/333 (13%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL--------------NKFADLTR 56
+ + + W E+ RTY E + RF I+ +N F++ N+F DLT
Sbjct: 34 LLERFKAWQAEYNRTYATPEEFQQRFMIYSENVRFIKTMNQLSTGSSYELGENQFTDLTE 93
Query: 57 EKFLASYT---GYKPP------PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
E+F +Y +PP PT S N N++ + +S+DW +GAVT V
Sbjct: 94 EEFKDTYLMKLDEQPPAAEAMGPTVGTMSTAG--MSNGNNTGEA-PNSVDWRTKGAVTRV 150
Query: 108 KDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFE 162
KDQ C CWAF VA++EG+++I+TG+LV+ S+ ++VDC NGC +A E
Sbjct: 151 KDQ-QQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDNGCRGGSPRSAME 209
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYG----AIRGYQYVQPATEEGLQDVV 218
++ + L +E YPY G Q R SGK G IRGYQ VQ E L+ V
Sbjct: 210 WVTRNGGLTTESDYPYVGSQ-------RQCMSGKLGHHAARIRGYQAVQRNNEAELERAV 262
Query: 219 SRQPVSVAIDAT-WFNFYHGGVFTGPCGNTPNHGVTIV-GYGTTTEAEGQQPYWLVKNRW 276
+ +PV+V IDA+ F FY GVF+GPC T + V V GYG+T G + YW+VKN W
Sbjct: 263 AERPVAVFIDASRAFQFYKSGVFSGPCDTTTVNHVVTVVGYGSTGSDSGGRKYWIVKNSW 322
Query: 277 GTNWDEGG-SMRIFRGVGGSGLCNIAANAAYPL 308
G W E G R G+C IA YP+
Sbjct: 323 GQGWGENGYVRMARRVRAREGMCAIAIEPYYPV 355
>gi|414887427|tpg|DAA63441.1| TPA: hypothetical protein ZEAMMB73_713985 [Zea mays]
Length = 355
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 111/312 (35%), Positives = 159/312 (50%), Gaps = 41/312 (13%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLRLNK-------------FADLTREKFLASYT 64
W + R+Y AE+ RF+++++N E + F DLT E+FLA++T
Sbjct: 43 WQATYNRSYLTAAERLRRFEVYRQNMELIEATNRRAELSYQLSETPFTDLTSEEFLATHT 102
Query: 65 G------------YKPPPTDH--PHSNRS-NWFKNLNSSKMSFYDSIDWNERGAVTPVKD 109
++ T H P S+ W + ++ + +S+DW +GAVT VKD
Sbjct: 103 MSTRLHASEAARRHRELITTHAGPVSDGGRQWNRRNYTTDLDVPESVDWRTKGAVTTVKD 162
Query: 110 QGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIR 165
QG+ C CW+F VA +EGL+KIRTGQLV+ S+ +++DCS+ NGC A +++
Sbjct: 163 QGA-CGGCWSFATVAAIEGLHKIRTGQLVSLSEQEVLDCSSPPNNGCHGGNPAAAIDWVS 221
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
L +E YPY+GRQ C A IRG + V E L+ V++QPV+V
Sbjct: 222 ANGGLTTESDYPYEGRQG-KCKL--DKARNHVAKIRGRKLVDQNNEAALEVAVAQQPVAV 278
Query: 226 AIDATWF-NFYHGGVFTGPCG-NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
++ Y GVF GPC NH VT+VGYG + G + YW+VKN WG W E
Sbjct: 279 GMNVHPIQQHYKSGVFHGPCDPEDLNHAVTMVGYGAES---GGRKYWIVKNSWGEKWGEK 335
Query: 284 GSMRIFRGVGGS 295
G R F G S
Sbjct: 336 GYFRGFASRGAS 347
>gi|357153071|ref|XP_003576329.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 398
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 113/348 (32%), Positives = 162/348 (46%), Gaps = 49/348 (14%)
Query: 3 RTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL-------------- 48
R H + + + WM R+Y E RF+++K N ++
Sbjct: 50 RDKHNDLLMMGRFQGWMAAQGRSYWTAEETARRFEVYKSNVRYIEAVNAEAATTGLTFEL 109
Query: 49 --NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD------------ 94
F DLT E+F A Y G PPP + + + + ++ + D
Sbjct: 110 GEGPFTDLTHEEFSALYNGSMPPPEEEEGDDIQEEDEQVIATVVDGVDVNVAVHTNLSAG 169
Query: 95 --------SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVD 145
S DW + GAVTP+KDQG CWAF VAT+EG +KI G LV+ S+ QL+D
Sbjct: 170 GPRPWPPRSRDWRKHGAVTPIKDQGRCGSCWAFPTVATIEGKHKIVRGNLVSLSEQQLID 229
Query: 146 CSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQ 204
C N GC F+ A+ +IR+ L + YPY+G + I G++
Sbjct: 230 CDYTNSGCKGGFVIRAYRWIRKIGGLTTSSAYPYKGARGKCM-----KRRRAAARIAGWR 284
Query: 205 YVQPATEEGLQDVVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTP-NHGVTIVGYGTTT 261
V+ +E L + V+ QPV+V I A+ NF H G+ GPC NH VT+VGYG
Sbjct: 285 SVRSRSEVALVNAVAGQPVAVYISASGKNFQHYKKGILNGPCDTARLNHAVTVVGYG--R 342
Query: 262 EAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYPL 308
+A+ YW+VKN WGT W + G + + RG G C IA + +PL
Sbjct: 343 QADTGAKYWIVKNSWGTTWGQEGYILMKRGTRNPRGQCGIATSPVFPL 390
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 167 bits (424), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 112/313 (35%), Positives = 163/313 (52%), Gaps = 39/313 (12%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASYT 64
+W + + Y E+ +R+ I+K N L +N+F D+T +F +
Sbjct: 29 RWKMAHNKAYSHDGEETVRYTIWKDNERRIREHNLQGGDFLLEMNQFGDMTNNEF-KDFN 87
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVAT 123
GY H H + S + L + DS+DW G VTPVKDQG CWAF+ +
Sbjct: 88 GY----LSHKHVSGSTF---LTPNSFVAPDSVDWRNEGYVTPVKDQGQCGSCWAFSTTGS 140
Query: 124 VEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
+EG N +TG+LV+ S+ LVDCST NGC ++NAF YI++ + SE YPY
Sbjct: 141 LEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCNGGLMDNAFTYIKENNGIDSEASYPYTA 200
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQD-VVSRQPVSVAIDATWFN--FYHG 237
+ D C + + + + G+ + E L++ V S P+SVAIDA+ F+ FY
Sbjct: 201 K-DGKCAFTKPNVA---ATDTGFVDIPSGDENKLKEAVASVGPISVAIDASHFSFQFYRK 256
Query: 238 GVFTG-PCGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
GV+ C +T +HGV +VGYGT E + YWLVKN W T+W + G +++ R
Sbjct: 257 GVYNERKCSSTELDHGVLVVGYGT----ESGKDYWLVKNSWNTSWGDKGYIKMSRNAKNQ 312
Query: 296 GLCNIAANAAYPL 308
C IA NA+YPL
Sbjct: 313 --CGIATNASYPL 323
>gi|293334761|ref|NP_001168296.1| uncharacterized protein LOC100382061 [Zea mays]
gi|223947281|gb|ACN27724.1| unknown [Zea mays]
Length = 322
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 111/312 (35%), Positives = 159/312 (50%), Gaps = 41/312 (13%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLRLNK-------------FADLTREKFLASYT 64
W + R+Y AE+ RF+++++N E + F DLT E+FLA++T
Sbjct: 10 WQATYNRSYLTAAERLRRFEVYRQNMELIEATNRRAELSYQLSETPFTDLTSEEFLATHT 69
Query: 65 ------------GYKPPPTDH--PHSNRS-NWFKNLNSSKMSFYDSIDWNERGAVTPVKD 109
++ T H P S+ W + ++ + +S+DW +GAVT VKD
Sbjct: 70 MSTRLHASEAARRHRELITTHAGPVSDGGRQWNRRNYTTDLDVPESVDWRTKGAVTTVKD 129
Query: 110 QGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIR 165
QG+ C CW+F VA +EGL+KIRTGQLV+ S+ +++DCS+ NGC A +++
Sbjct: 130 QGA-CGGCWSFATVAAIEGLHKIRTGQLVSLSEQEVLDCSSPPNNGCHGGNPAAAIDWVS 188
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
L +E YPY+GRQ C A IRG + V E L+ V++QPV+V
Sbjct: 189 ANGGLTTESDYPYEGRQG-KCKL--DKARNHVAKIRGRKLVDQNNEAALEVAVAQQPVAV 245
Query: 226 AIDATWF-NFYHGGVFTGPCG-NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
++ Y GVF GPC NH VT+VGYG + G + YW+VKN WG W E
Sbjct: 246 GMNVHPIQQHYKSGVFHGPCDPEDLNHAVTMVGYGAES---GGRKYWIVKNSWGEKWGEK 302
Query: 284 GSMRIFRGVGGS 295
G R F G S
Sbjct: 303 GYFRGFASRGAS 314
>gi|359806985|ref|NP_001241331.1| uncharacterized protein LOC100811719 precursor [Glycine max]
gi|255645733|gb|ACU23360.1| unknown [Glycine max]
Length = 362
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 118/323 (36%), Positives = 163/323 (50%), Gaps = 46/323 (14%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
+ W E R Y +Q EK RF+IF+ N + L LNKFAD++ E+F
Sbjct: 46 QAWQKEHKREYGNQEEKAKRFQIFQSNLRYINEMNAKRKSPTTQHRLGLNKFADMSPEEF 105
Query: 60 LASYTGYKPPPTDHPHSN---RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYCC- 115
+ +Y + P+SN R K ++ + S+DW ++GAVT V+DQG C
Sbjct: 106 MKTYL----KEIEMPYSNLESRKKLQKGDDADCDNLPHSVDWRDKGAVTEVRDQGK--CQ 159
Query: 116 --WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLAS 172
WAF+ +EG+NKI TG LV+ S Q+VDC + GCA F NAF Y+ + + +
Sbjct: 160 SHWAFSVTGAIEGINKIVTGNLVSLSVQQVVDCDPASHGCAGGFYFNAFGYVIENGGIDT 219
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY Q+ C +++A+ V P EE L VS+QPVSV+IDAT
Sbjct: 220 EAHYPYTA-QNGTC---KANANKVVSIDNLLVVVGP--EEALLCRVSKQPVSVSIDATGL 273
Query: 233 NFYHGGVFTGP-CGNTPNHGV---TIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
FY GGV+ G C IVGYG+ G + YW+VKN WG +W E G + I
Sbjct: 274 QFYAGGVYGGENCSKNSTKATLVCLIVGYGSV----GGEDYWIVKNSWGKDWGEEGYLLI 329
Query: 289 FRGVGGS---GLCNIAANAAYPL 308
R V G+C I A +P+
Sbjct: 330 KRNVSDEWPYGVCAINAAPGFPI 352
>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
Length = 342
Score = 167 bits (424), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 110/296 (37%), Positives = 158/296 (53%), Gaps = 30/296 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E +V+ ++ Y+ EK RF+IF N + +L LN+FADLT E+F +
Sbjct: 50 ESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKKVSNYWLGLNEFADLTHEEFKNKF 109
Query: 64 TGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
G+K + + + +++ + S+DW ++GAV+PVK+QG CWAF+ V
Sbjct: 110 LGFKGELAERKDESIEQFRYRDF----VDLPKSVDWRKKGAVSPVKNQGQCGSCWAFSTV 165
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
A VEG+N+I TG L S+ +L+DC T NGC ++ AF Y+ + L E YPY
Sbjct: 166 AAVEGINQIVTGNLTVLSEQELIDCDTTFNNGCNGGLMDYAFAYVTR-NGLHKEEEYPYI 224
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
+ CD R AS K I GY V E+ ++ QP+SVAI+A+ F FY G
Sbjct: 225 MSEG-TCDEKR-DASEKV-TISGYHDVPRNNEDSFLKALANQPISVAIEASGRDFQFYSG 281
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
GVF G CG +HGV VGYGT+ + Y +V+N WG W E G +R+ R G
Sbjct: 282 GVFDGHCGTELDHGVAAVGYGTSKGLD----YVIVRNSWGPKWGEKGYIRMKRNTG 333
>gi|313507179|pdb|2ACT|A Chain A, Crystallographic Refinement Of The Structure Of Actinidin
At 1.7 Angstroms Resolution By Fast Fourier
Least-Squares Methods
Length = 220
Score = 167 bits (423), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 97/220 (44%), Positives = 126/220 (57%), Gaps = 15/220 (6%)
Query: 96 IDWNERGAVTPVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLN 150
+DW GAV +K QG C WAF+A+ATVEG+NKI +G L++ S+ +L+DC
Sbjct: 5 VDWRSAGAVVDIKSQGE-CGGXWAFSAIATVEGINKITSGSLISLSEQELIDCGRTQNTR 63
Query: 151 GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPAT 210
GC ++ + F++I + +E YPY QD CD + KY I Y+ V
Sbjct: 64 GCDGGYITDGFQFIINDGGINTEENYPYT-AQDGDCD--VALQDQKYVTIDTYENVPYNN 120
Query: 211 EEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQP 268
E LQ V+ QPVSVA+DA F Y G+FTGPCG +H + IVGYGT EG
Sbjct: 121 EWALQTAVTYQPVSVALDAAGDAFKQYASGIFTGPCGTAVDHAIVIVGYGT----EGGVD 176
Query: 269 YWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
YW+VKN W T W E G MRI R VGG+G C IA +YP+
Sbjct: 177 YWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 216
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 108/323 (33%), Positives = 164/323 (50%), Gaps = 38/323 (11%)
Query: 15 HEQW---MVEFARTYKDQAEKEMRFKIF--------KKNHEF--------LRLNKFADLT 55
EQW V+ + Y+ + E+ R KIF K N F L +NK+ DL
Sbjct: 24 QEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVAKHNKLFEQGLYPYKLAMNKYGDLL 83
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC- 114
+F+ G+ T + + + + D++DW + GAVTPVKDQG +C
Sbjct: 84 HHEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHVDIPDTVDWRQEGAVTPVKDQG-HCG 142
Query: 115 -CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRL 170
CW+F+A +EG + +T +LV+ S+ LVDCS+ NGC ++NAF YI+ +
Sbjct: 143 SCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRFGNNGCNGGLMDNAFRYIKNNGGI 202
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA 229
+E YPY G + + R SA + +G+ + E+ L+ V+ P+S+AIDA
Sbjct: 203 DTEAAYPYMGEDEKF----RYSAKNRGATDKGFVDIPSGDEDKLKAAVATVGPISIAIDA 258
Query: 230 TW--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
+ F Y GV++ P C +T +HGV +VGYGT + + YWLVKN WG W G
Sbjct: 259 SHESFQLYSNGVYSDPTCSSTELDHGVLVVGYGT--DEKTGMDYWLVKNSWGDTWGLDGY 316
Query: 286 MRIFRGVGGSGLCNIAANAAYPL 308
+++ R C +A A+YPL
Sbjct: 317 IKMARNQDNQ--CGVATQASYPL 337
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 107/310 (34%), Positives = 162/310 (52%), Gaps = 33/310 (10%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKFLASYTG 65
W ++Y D E+ R I+++N E ++ +N DLT ++F Y G
Sbjct: 30 WKSYHGKSYSDVHEERTRMAIWQQNLEKIKRHNAEDHSYKMAMNHLGDLTEDEFRYFYLG 89
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
+ H +S + W + S + S+DW+++G VT VK+QG CWAF+ +V
Sbjct: 90 VRA----HHNSTKRGWATYMPPSNVKIPSSVDWSQKGYVTGVKNQGQCGSCWAFSTTGSV 145
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGR 181
EG + +TG LV+ S+ L+DCS NGC ++NAF YI + +E YPY G+
Sbjct: 146 EGQHFRKTGSLVSLSEQNLIDCSGSYGNNGCQGGLMDNAFRYIESNGGIDTESSYPYLGQ 205
Query: 182 QDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATWFNFYHGGVF 240
Q C + S + + GYQ + +E+ LQ V+ PVSVA+DA+ + FY GV+
Sbjct: 206 QG-SCHFSSSHVGAR---VTGYQDIPQGSEQALQSAVATVGPVSVAVDASQWQFYSSGVY 261
Query: 241 TGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLC 298
P C +T +HGV ++GYG Q YWLVKN WG +W G + + R + C
Sbjct: 262 DNPYCSSTQLDHGVLVIGYGNYNG----QDYWLVKNSWGYSWGVEGYIMMSR--NKNNQC 315
Query: 299 NIAANAAYPL 308
IA++A+YPL
Sbjct: 316 GIASSASYPL 325
>gi|198427748|ref|XP_002130282.1| PREDICTED: similar to predicted protein [Ciona intestinalis]
Length = 340
Score = 167 bits (422), Expect = 6e-39, Method: Compositional matrix adjust.
Identities = 107/306 (34%), Positives = 168/306 (54%), Gaps = 22/306 (7%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRLNKFADLTREKFLASYTGYKPPPTDH 73
+ EQ M + + +E M++ + +K++ L +N++ DLT E+F + GY+
Sbjct: 45 EEEQKMATWFNNWNKISEHNMQYSLKQKSYR-LEMNEYGDLTSEEFSSMMNGYRNDIRLK 103
Query: 74 PHSNRSNWFKNLNS--SKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKI 130
S + + NL S S++ +DW + G VTPVK+QG CW+F+A ++EG +K
Sbjct: 104 RKSTGGSTYLNLLSFGSQIQLPTLVDWRKHGLVTPVKNQGQCGSCWSFSATGSLEGQHKK 163
Query: 131 RTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCD 187
+TG+LV+ S+ L+DCST +GC ++ AF+YI+ + +E YPY+ + D C
Sbjct: 164 KTGKLVSLSEQNLIDCSTPEGNDGCNGGLMDQAFKYIKIQGGIDTEAYYPYEAKDD-TC- 221
Query: 188 WWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNFYHGGVFT-GP 243
R + + G+ ++ EE L++ + P+SVAIDA T F FY GV++
Sbjct: 222 --RFNITDSGATDTGFVDIKSGDEEMLKEAAATVGPISVAIDASHTSFQFYSNGVYSETA 279
Query: 244 CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAA 302
C +T +HGV +VGYGT E + YWLVKN WG W E G +++ R C IA
Sbjct: 280 CSSTMLDHGVLVVGYGT----ENGKDYWLVKNSWGEGWGEAGYIKMSRNADNQ--CGIAT 333
Query: 303 NAAYPL 308
A+YPL
Sbjct: 334 QASYPL 339
>gi|413953050|gb|AFW85699.1| hypothetical protein ZEAMMB73_033873 [Zea mays]
Length = 361
Score = 166 bits (421), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 116/334 (34%), Positives = 169/334 (50%), Gaps = 49/334 (14%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL--------------NKFADLTR 56
+ + + W E+ RTY E + RF ++ +N F++ N+F DLT
Sbjct: 36 LLERFKAWQAEYNRTYATPEEFQQRFMVYSENLRFIKTMNQLSTGSSYELGENQFTDLTE 95
Query: 57 EKFLASYT---GYKPPPTDHPHSNRSNWFKNLNSSKMSFYD-------SIDWNERGAVTP 106
E+F +Y +PP + ++++ MS D S+DW +GAVTP
Sbjct: 96 EEFKDTYLMKLDEQPPAAEA----MPPIVGTMSTAGMSNGDNTGEAPNSVDWRTKGAVTP 151
Query: 107 VKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAF 161
VK+Q C CWAF VA++EG+++I+TG+LV+ S+ ++VDC +GC + +A
Sbjct: 152 VKNQ-QQCGSCWAFATVASIEGVHQIKTGRLVSLSEQEIVDCDRGGNDHGCRGGYPRSAM 210
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYG----AIRGYQYVQPATEEGLQDV 217
E++ + L +E YPY G Q R SGK G IRGYQ VQ E L+
Sbjct: 211 EWVTRNGGLTTESDYPYVGSQ-------RQCMSGKLGHHAARIRGYQAVQRKNEAELERA 263
Query: 218 VSRQPVSVAIDAT-WFNFYHGGVFTGPCGNTP-NHGVTIV-GYGTTTEAEGQQPYWLVKN 274
V+ +PV+V IDA+ F FY GVF+GPC T NH VT+V +++ G + YW+VKN
Sbjct: 264 VAGRPVAVVIDASRAFQFYKRGVFSGPCNTTTVNHAVTVVGYGSAGSDSGGGRKYWIVKN 323
Query: 275 RWGTNWDEGG-SMRIFRGVGGSGLCNIAANAAYP 307
WG W E G R G+C IA P
Sbjct: 324 SWGQRWGENGYVRMARRVRAREGMCAIAIEPLLP 357
>gi|313118762|gb|ADR32293.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 129/220 (58%), Gaps = 13/220 (5%)
Query: 95 SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNG 151
S+DW ++G + VKDQGS CWAF+AVA +E +N I TG L++ S+ +LVDC S G
Sbjct: 4 SVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYNEG 63
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ AFE++ + SE YPY+ R D CD +R +A K I Y+ V E
Sbjct: 64 CDGGLMDYAFEFVINNGGIDSEEDYPYKERND-VCDQYRKNA--KVVKIDSYEDVPVNNE 120
Query: 212 EGLQDVVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
+ LQ V+ QPVS+A++A +F H G+FTG CG +HGV GYGT E Y
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGT----ENGMDY 176
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W+V+N WG W E G +R+ R + SGLC +A +YP+
Sbjct: 177 WIVRNSWGAKWGEKGYLRVQRNIARSSGLCGLATEPSYPV 216
>gi|313118766|gb|ADR32295.1| C14 cysteine protease [Solanum demissum]
gi|313118774|gb|ADR32299.1| C14 cysteine protease [Solanum verrucosum]
gi|313118776|gb|ADR32300.1| C14 cysteine protease [Solanum verrucosum]
gi|313118778|gb|ADR32301.1| C14 cysteine protease [Solanum verrucosum]
Length = 217
Score = 166 bits (421), Expect = 9e-39, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 129/220 (58%), Gaps = 13/220 (5%)
Query: 95 SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNG 151
S+DW ++G + VKDQGS CWAF+AVA +E +N I TG L++ S+ +LVDC S G
Sbjct: 4 SVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYNEG 63
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ AFE++ + SE YPY+ R D CD +R +A K I Y+ V E
Sbjct: 64 CDGGLMDYAFEFVINNGGIDSEEDYPYKERND-VCDQYRKNA--KVVKIDSYEDVPVNNE 120
Query: 212 EGLQDVVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
+ LQ V+ QPVS+A++A +F H G+FTG CG +HGV GYGT E Y
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGT----ENGMDY 176
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W+V+N WG W E G +R+ R + SGLC +A +YP+
Sbjct: 177 WIVRNSWGAKWGEKGYLRVQRNIASSSGLCGLATEPSYPV 216
>gi|161172356|pdb|3BCN|A Chain A, Crystal Structure Of A Papain-Like Cysteine Protease
Ervatamin-A Complexed With Irreversible Inhibitor E-64
gi|161172357|pdb|3BCN|B Chain B, Crystal Structure Of A Papain-Like Cysteine Protease
Ervatamin-A Complexed With Irreversible Inhibitor E-64
Length = 209
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 96/218 (44%), Positives = 126/218 (57%), Gaps = 19/218 (8%)
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-G 151
+ +DW +GAV P+K+QG CWAF+ V TVE +N+IRTG L++ S+ QLVDCS N G
Sbjct: 3 EHVDWRAKGAVIPLKNQGKCGSCWAFSTVTTVESINQIRTGNLISLSEQQLVDCSKKNHG 62
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C + + A++YI + +E YPY+ Q A+ K I G + V E
Sbjct: 63 CKGGYFDRAYQYIIANGGIDTEANYPYKAFQG------PCRAAKKVVRIDGCKGVPQCNE 116
Query: 212 EGLQDVVSRQPVSVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
L++ V+ QP VAIDA+ F Y GG+FTGPCG NHGV IVGYG + Y
Sbjct: 117 NALKNAVASQPSVVAIDASSKQFQHYKGGIFTGPCGTKLNHGVVIVGYG--------KDY 168
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
W+V+N WG +W E G R+ R VGG GLC IA YP
Sbjct: 169 WIVRNSWGRHWGEQGYTRMKR-VGGCGLCGIARLPFYP 205
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 102/285 (35%), Positives = 151/285 (52%), Gaps = 30/285 (10%)
Query: 22 FARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYTGYKPP 69
+ ++Y + E + R+ IFK N + L++N F DL+RE+F Y GY
Sbjct: 126 YGKSYATEEETQKRYAIFKNNLAYIHTHNQQGYSYSLKMNHFGDLSREEFRRKYLGYNKS 185
Query: 70 PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ---GSYCCWAFTAVATVEG 126
+ +N + L S ++DW E+G VTPVKDQ GS CWAF+A +EG
Sbjct: 186 -RNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGS--CWAFSATGALEG 242
Query: 127 LNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
+ +TG+L++ S+ +LVDCS GC+ + +AF+Y+ L SE YPY R D
Sbjct: 243 AHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYPYLAR-D 301
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFT 241
C A K I G++ V +E ++ ++ PVS+AI+A F FYH GVF
Sbjct: 302 GEC----KRACKKVVTISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQFYHEGVFD 357
Query: 242 GPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
CG +HGV +VGYG T+ E ++ +W++KN WG+ W G M
Sbjct: 358 ASCGTDLDHGVLLVGYG--TDKETKKDFWIMKNSWGSGWGRDGYM 400
>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 400
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 103/288 (35%), Positives = 154/288 (53%), Gaps = 27/288 (9%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREK 58
+ +HE+WM ++ + Y+D AE E RF+IFK N +F +R+N+F DL E+
Sbjct: 112 SERHEKWMAQYGKVYEDAAEMEKRFQIFKNNVQFIESFNVAGDKPFNIRINQFPDLHDEE 171
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
F A + + + F+ S + ++D ++G VTP+KDQG CWA
Sbjct: 172 FKALLINGQRKVSGVETATEETSFR-YGSVVTNIPATMDGRKKGVVTPIKDQGIIGSCWA 230
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNGCAKNFLENAFEYIRQYQRLASECV 175
+AVA +EG+++I T +L+ SK +LVD GC ++E+AFE+I + + SE
Sbjct: 231 LSAVAAIEGIHQITTSKLMFLSKQKLVDSVKGESEGCIGGYVEDAFEFIVKKGGILSETH 290
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID--ATWFN 233
YPY+G C + + S + I+GY+ V ++ L VV+ QPVSV ID A F
Sbjct: 291 YPYKGVNX--CKVEKETHSVAH--IKGYEKVPSNNKKALLKVVANQPVSVYIDVGAHAFK 346
Query: 234 FYHGGVFTG-PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
+Y +F CG+ PNH V +VGYG + YW VKN WGT W
Sbjct: 347 YYSSEIFNARNCGSDPNHVVAVVGYGKALDG---AKYWPVKNSWGTEW 391
>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
Length = 325
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 110/326 (33%), Positives = 167/326 (51%), Gaps = 44/326 (13%)
Query: 11 IAAKHEQWM---VEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKF 51
I + +QW E R Y E+ R +F++N +F L++N+F
Sbjct: 15 IPSLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQF 74
Query: 52 ADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG 111
D+T E+ +A+ G+ PT P + L + + + +DW +GAVTPVKDQ
Sbjct: 75 GDMTSEEIVATMNGFLGAPTRRPAAV-------LKADDETLPEKVDWRTKGAVTPVKDQK 127
Query: 112 SY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
CWAF+ ++EG + ++ G+LV+ S+ LVDCS GC ++ AF YI+
Sbjct: 128 QCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKAN 187
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVA 226
+ + +E YPY+ QD C R AS GY V+ +E L+ V+ P+SV
Sbjct: 188 KGIDTEDSYPYEA-QDGKC---RFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVG 243
Query: 227 IDATW--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
IDA+ F+FYH GV+ C +T +HGV VGYG+ E +WLVKN W T+W +
Sbjct: 244 IDASQSTFHFYHTGVYHDDHCSSTMLDHGVLAVGYGSD---ENGGDFWLVKNSWNTSWGD 300
Query: 283 GGSMRIFRGVGGSGLCNIAANAAYPL 308
G +++ R + C IA+ A+YPL
Sbjct: 301 KGYIKMSRNRNNN--CGIASQASYPL 324
>gi|115436422|ref|NP_001042969.1| Os01g0347500 [Oryza sativa Japonica Group]
gi|115436426|ref|NP_001042971.1| Os01g0348000 [Oryza sativa Japonica Group]
gi|15290194|dbj|BAB63883.1| putative SAG12 protein [Oryza sativa Japonica Group]
gi|15290200|dbj|BAB63889.1| putative SAG12 protein [Oryza sativa Japonica Group]
gi|21104809|dbj|BAB93394.1| putative SAG12 protein [Oryza sativa Japonica Group]
gi|113532500|dbj|BAF04883.1| Os01g0347500 [Oryza sativa Japonica Group]
gi|113532502|dbj|BAF04885.1| Os01g0348000 [Oryza sativa Japonica Group]
gi|125570283|gb|EAZ11798.1| hypothetical protein OsJ_01672 [Oryza sativa Japonica Group]
Length = 361
Score = 166 bits (420), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 117/334 (35%), Positives = 164/334 (49%), Gaps = 66/334 (19%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHEFLR--------------------------LNK 50
QWM ++A+ Y E+E R++++K N F+ +N+
Sbjct: 49 QWMAKYAKHYSCPEEQEKRYQVWKGNTNFIGAFRSQTQLSSGVGAFAPQTITDSVVGMNR 108
Query: 51 FADLTREKFLASYTGY------KPPPTD-HPHSNRSNWFKNLNSSKMSFYDSIDWNERGA 103
F DLT +F+ +TG+ PPPT PHS + +DW GA
Sbjct: 109 FGDLTSTEFVQQFTGFNASGFHSPPPTPISPHSWQPC--------------CVDWRSSGA 154
Query: 104 VTPVKDQGSYC-CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAF 161
VT VK QG+ CWAF + A +EGL+KI+TG+LV+ S+ +VDC T + GC+ + A
Sbjct: 155 VTGVKFQGNCASCWAFASAAAIEGLHKIKTGELVSLSEQVMVDCDTGSFGCSGGHSDTAL 214
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCD----WWRSSASGKYGAIRGYQYVQPATEEGLQDV 217
+ + SE YPY G Q CD + SAS + G+ V P E L
Sbjct: 215 NLVASRGGITSEEKYPYTGVQG-SCDVGKLLFDHSAS-----VSGFAAVPPNDERQLALA 268
Query: 218 VSRQPVSVAIDATW--FNFYHGGVFTGPCG-NTPNHGVTIVGYGTTTEAEGQQPYWLVKN 274
V+RQPV+V IDA+ F FY GGV+ GPC + NH VTIVGY E G + YW+ KN
Sbjct: 269 VARQPVTVYIDASAQEFQFYKGGVYKGPCNPGSVNHAVTIVGY---CENFGGEKYWIAKN 325
Query: 275 RWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYP 307
W +W E G + + + V G C +A + YP
Sbjct: 326 SWSNDWGEQGYVYLAKDVWWPQGTCGLATSPFYP 359
>gi|326431661|gb|EGD77231.1| cysteine protease [Salpingoeca sp. ATCC 50818]
Length = 347
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 109/332 (32%), Positives = 162/332 (48%), Gaps = 38/332 (11%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------- 47
M+ S A E++ ++ + Y+ E+ R IF+++ +F+
Sbjct: 17 MTTVSAAPTPSAMTFEEFKDKYNKVYESAEEEARRAAIFQESLDFIEKHNAEAAAGMHTY 76
Query: 48 ---LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDS------IDW 98
+N+FADLTRE+F + P D + +L+ + DS IDW
Sbjct: 77 LVGVNEFADLTREEFRQHHVTRLP--FDDDKRDPVTATLHLDEHAVHAADSNGDSSGIDW 134
Query: 99 NERGAVTPVKDQGSYCCWA-FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFL 157
+RGAVTPV++QG A F AV VEG++ I +G LV S Q++DCS GC+ L
Sbjct: 135 RKRGAVTPVRNQGQCGNPAIFAAVEAVEGMHAISSGNLVELSTQQVIDCSGTPGCSGGSL 194
Query: 158 ENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDV 217
+ F+YI + L S YP G C+ ++ + + GY V P E L
Sbjct: 195 VSFFKYIARNGGLDSAADYPTSGAGGQ-CN--KAKEARHVAKVGGYSVVPPRNETKLAAA 251
Query: 218 VSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNR 275
V + PV+VAI+A F Y GV++GPCG +H V +VGY YW+VKN
Sbjct: 252 VFKMPVAVAIEADTPSFQMYTSGVYSGPCGTQLDHAVLVVGY--------TDEYWIVKNS 303
Query: 276 WGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
WG +W + G + + RGVG +G+C I +A YP
Sbjct: 304 WGASWGDQGYIMMKRGVGAAGICGITLDAMYP 335
>gi|148224682|ref|NP_001086670.1| cathepsin S [Xenopus laevis]
gi|50418223|gb|AAH77285.1| Ctss-prov protein [Xenopus laevis]
Length = 320
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 110/316 (34%), Positives = 162/316 (51%), Gaps = 40/316 (12%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKFLA 61
W + + Y+D++E +R ++KN H + L +N AD+T E+ +
Sbjct: 17 WKNKHTKEYEDESEDLLRRITWEKNLNTVNMHNLEYSMGMHTYELGMNHLADMTSEEIKS 76
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKM--SFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
TG PP HS R F + +S + DSIDW E+G V+ VK+QG CWAF
Sbjct: 77 KMTGLILPP----HSERKATFSSQKNSTLGGKVPDSIDWREKGCVSEVKNQGGCGSCWAF 132
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+AV +EG ++TG++V+ S LVDCS+ GC+ F+ AF+Y+ + S+
Sbjct: 133 SAVGALEGQLMLKTGKIVSLSPQNLVDCSSKYGNKGCSGGFMTRAFQYVIDNNGIDSDTY 192
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT--WF 232
YPY D C + +GK + Y+ + P TE+ L+ + P+SVAID T F
Sbjct: 193 YPYHA-MDEKCHY---ELAGKASSCVKYREIVPGTEDNLKQALGNIGPISVAIDGTRPTF 248
Query: 233 NFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
Y GV++ P C NHGV VGYGT Q +WL+KN WGT + + G +RI R
Sbjct: 249 FLYKSGVYSDPSCSQEVNHGVLAVGYGTLN----GQDFWLLKNSWGTKYGDQGYVRIAR- 303
Query: 292 VGGSGLCNIAANAAYP 307
LC +A+ +YP
Sbjct: 304 -NKENLCGVASYTSYP 318
>gi|46576373|sp|P83654.1|ERVC_TABDI RecName: Full=Ervatamin-C; Short=ERV-C
gi|46014979|pdb|1O0E|A Chain A, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
Protease Ervatamin C
gi|46014980|pdb|1O0E|B Chain B, 1.9 Angstrom Crystal Structure Of A Plant Cysteine
Protease Ervatamin C
Length = 208
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 97/218 (44%), Positives = 127/218 (58%), Gaps = 19/218 (8%)
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-G 151
+ IDW ++GAVTPVK+QGS CWAF+ V+TVE +N+IRTG L++ S+ +LVDC N G
Sbjct: 3 EQIDWRKKGAVTPVKNQGSCGSCWAFSTVSTVESINQIRTGNLISLSEQELVDCDKKNHG 62
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C A++YI + ++ YPY+ Q A+ K +I GY V E
Sbjct: 63 CLGGAFVFAYQYIINNGGIDTQANYPYKAVQG------PCQAASKVVSIDGYNGVPFCNE 116
Query: 212 EGLQDVVSRQPVSVAIDATWFNF--YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
L+ V+ QP +VAIDA+ F Y G+F+GPCG NHGVTIVGY Q Y
Sbjct: 117 XALKQAVAVQPSTVAIDASSAQFQQYSSGIFSGPCGTKLNHGVTIVGY--------QANY 168
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
W+V+N WG W E G +R+ R VGG GLC IA YP
Sbjct: 169 WIVRNSWGRYWGEKGYIRMLR-VGGCGLCGIARLPYYP 205
>gi|118145|sp|P20721.1|CYSPL_SOLLC RecName: Full=Low-temperature-induced cysteine proteinase; Flags:
Precursor
gi|806314|gb|AAA66308.1| thiol protease, partial [Solanum lycopersicum]
Length = 346
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 96/224 (42%), Positives = 132/224 (58%), Gaps = 13/224 (5%)
Query: 91 SFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--S 147
S +SIDW E+G + VKDQGS CWAF+AVA +E +N I TG L++ S+ +LVDC S
Sbjct: 17 SLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDRS 76
Query: 148 TLNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQ 207
GC ++ AFE++ + + +E YPY+ R CD +R +A K I Y+ V
Sbjct: 77 YNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNG-VCDQYRKNA--KVVKIDSYEDVP 133
Query: 208 PATEEGLQDVVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEG 265
E+ LQ V+ QPVS+A++A +F H G+FTG CG +HGV I GYGT E
Sbjct: 134 VNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGT----EN 189
Query: 266 QQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYPL 308
YW+V+N WG N E G +R+ R V SGLC +A +YP+
Sbjct: 190 GMDYWIVRNSWGANCRENGYLRVQRNVSSSSGLCGLAIEPSYPV 233
>gi|313118772|gb|ADR32298.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 129/220 (58%), Gaps = 13/220 (5%)
Query: 95 SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNG 151
S+DW ++G + VKDQGS CWAF+AVA +E +N I TG L++ S+ +LVDC S G
Sbjct: 4 SVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGDLISLSEQELVDCDKSYNQG 63
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ AFE++ + +E YPY+ R D CD +R +A K I Y+ V E
Sbjct: 64 CDGGLMDYAFEFVINNGGIDTEEDYPYKERND-VCDQYRKNA--KVVKIDSYEDVPVNNE 120
Query: 212 EGLQDVVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
+ LQ V+ QPVS+A++A +F H G+FTG CG +HGV GYGT E Y
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGT----ENGMDY 176
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W+V+N WG W E G +R+ R + SGLC +A +YP+
Sbjct: 177 WIVRNSWGAKWGEKGYLRVQRNIASSSGLCGLATEPSYPV 216
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 111/320 (34%), Positives = 167/320 (52%), Gaps = 34/320 (10%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTR 56
N+ E + E + Y+ E+ MR IF++NH+F L +N F DLT
Sbjct: 76 NLNQHWENFKAEHNKKYESFPEELMRRLIFEENHQFIEDHNSKKEFDFYLGMNHFGDLTN 135
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
+++ Y GY+ P +++++ + D IDW ++G VTPVK+QG C
Sbjct: 136 KEYRERYLGYRRPENT---PSKASYIFSRAEKIEDVPDQIDWRDQGFVTPVKNQGQCGSC 192
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLAS 172
WAF+AV ++EG + TG+LV+ S+ LVDCST +GC +++ AFEY++ + +
Sbjct: 193 WAFSAVGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNSGCNGGWMDQAFEYVKDNHGIDT 252
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDAT- 230
E YPY G D C + S ++G+ V+ EE L+ V PVSVAIDA+
Sbjct: 253 EDSYPYVG-TDGSCHFKNKSIG---ATLKGFMDVKEGDEEALRQAVGVAGPVSVAIDASS 308
Query: 231 -WFNFYHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F FY GGV+ P +T +HGV +VGYG + + +W+VKN WG W G +
Sbjct: 309 MLFQFYRGGVYNVPWCSTSELDHGVLVVGYGKQFQG---KDFWMVKNSWGVGWGIYGYIE 365
Query: 288 IFRGVGGSGLCNIAANAAYP 307
+ R G C IA+ A+ P
Sbjct: 366 MSRNKGNQ--CGIASKASIP 383
>gi|442539990|gb|AGC54590.1| bromelain, partial [Ananas comosus]
Length = 241
Score = 165 bits (418), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 95/231 (41%), Positives = 139/231 (60%), Gaps = 16/231 (6%)
Query: 82 FKNLNSSKMSFYDSIDWNERGAVTPVKDQ---GSYCCWAFTAVATVEGLNKIRTGQLVTR 138
F ++N S + SIDW + GAV VK+Q GS CWAF A+ATVEG+ KI+TG LV+
Sbjct: 5 FDDVNISAVP--QSIDWRDYGAVNEVKNQNPCGS--CWAFAAIATVEGIYKIKTGYLVSL 60
Query: 139 SKHQLVDCSTLNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYG 198
S+ +++DC+ GC ++ A+++I + +E YPYQ Q C+ +++
Sbjct: 61 SEQEVLDCAVSYGCKGGWVNKAYDFIISNNGVTTEENYPYQAYQG-TCN---ANSFPNSA 116
Query: 199 AIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FNFYHGGVFTGPCGNTPNHGVTIVGY 257
I GY YV+ E + VS QP++ IDA+ F +Y+GGVF+GPCG + NH +TI+GY
Sbjct: 117 YITGYSYVRRNDERSMMYAVSNQPIAALIDASENFQYYNGGVFSGPCGTSLNHAITIIGY 176
Query: 258 GTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYP 307
G + YW+V N WG++W EGG +R+ RGV SG C IA + +P
Sbjct: 177 GQDSSG---TKYWIVGNSWGSSWGEGGYVRMARGVSSSSGACGIAMSPLFP 224
>gi|326514800|dbj|BAJ99761.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 291
Score = 165 bits (417), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 100/220 (45%), Positives = 132/220 (60%), Gaps = 13/220 (5%)
Query: 95 SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--G 151
S+DW ++GAVT VKDQG CWAF+ +A VEG+N IRT L + S+ QLVDC T + G
Sbjct: 64 SVDWRQKGAVTAVKDQGQCGSCWAFSTIAAVEGINAIRTKNLTSLSEQQLVDCDTKSNAG 123
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ AF+YI ++ +A+E YPY+ RQ C+ S+ I GY+ V E
Sbjct: 124 CNGGLMDYAFQYIAKHGGVAAEDAYPYKARQASSCNKKPSAVV----TIDGYEDVPANDE 179
Query: 212 EGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
L+ V+ QPV+VAI+A+ F FY GVF G CG +HGV VGYGTT + Y
Sbjct: 180 TALKKAVAAQPVAVAIEASGSHFQFYSEGVFAGKCGTELDHGVAAVGYGTTVDG---TKY 236
Query: 270 WLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYPL 308
W+VKN WG W E G +R+ R V GLC IA A+YP+
Sbjct: 237 WIVKNSWGPEWGEKGYIRMKRDVEDKEGLCGIAMEASYPV 276
>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
Length = 326
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 109/321 (33%), Positives = 165/321 (51%), Gaps = 44/321 (13%)
Query: 16 EQWM---VEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTR 56
+QW E R Y E+ R +F++N +F L++N+F D+T
Sbjct: 21 QQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTS 80
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
E+ +A+ G+ PT P + L + + + +DW +GAVTPVKDQ C
Sbjct: 81 EEIVATMNGFLGAPTRRPAAV-------LKADDETLPEKVDWRTKGAVTPVKDQKQCGSC 133
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLAS 172
WAF+ ++EG + ++ G+LV+ S+ LVDCS GC ++ AF YI+ + + +
Sbjct: 134 WAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGIDT 193
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATW 231
E YPY+ QD C R AS GY V+ +E L+ V+ P+SV IDA+
Sbjct: 194 EDSYPYEA-QDGKC---RFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQ 249
Query: 232 --FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F+FYH GV+ C +T +HGV VGYG+ E +WLVKN W T+W + G ++
Sbjct: 250 STFHFYHTGVYHDDHCSSTMLDHGVLAVGYGSD---ENGGDFWLVKNSWNTSWGDKGYIK 306
Query: 288 IFRGVGGSGLCNIAANAAYPL 308
+ R + C IA+ A+YPL
Sbjct: 307 MSRNRNNN--CGIASQASYPL 325
>gi|345309264|ref|XP_001507503.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 335
Score = 165 bits (417), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 114/316 (36%), Positives = 157/316 (49%), Gaps = 35/316 (11%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKFL 60
+W V + Y +AE+ R ++KN H + L +N F D T E+
Sbjct: 30 RWKVLHGKNYSVEAEEVFRRAAWEKNVRVIERHNEEMSQGKHSYRLAMNHFGDQTNEELH 89
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFY--DSIDWNERGAVTPVKDQGSYC--CW 116
G++P D + RS + SK S+ + +DW +G VTPVK+QG C CW
Sbjct: 90 ERLNGFRP---DLGGALRSGREQARFRSKTSWEGPEEVDWRTKGYVTPVKNQG-LCGSCW 145
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNG---CAKNFLENAFEYIRQYQRLASE 173
AF+A +E L TG++V+ S+ LVDCS G C AFEY+R + +E
Sbjct: 146 AFSATGALEALVFKTTGKMVSLSEQNLVDCSWRQGNVGCRGGQYIGAFEYVRANGGIDAE 205
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDATWF 232
+YPY GR D C R S GK G Y V E+ L Q V + PVSVA+DA F
Sbjct: 206 DLYPYLGRDDISC---RYSLQGKAGNCTSYMVVDQDNEQALEQAVATVGPVSVAVDARPF 262
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FYH G + C NH + VGYGT+ E G Q YW++KN W W E G MR+ +G
Sbjct: 263 FFYHSG--SSRCTQKVNHAMLAVGYGTSKEPGGGQDYWILKNSWSERWGEQGYMRLLKGA 320
Query: 293 GGSGLCNIAANAAYPL 308
C +A+ A++P+
Sbjct: 321 NNH--CGVASVASFPV 334
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 164 bits (416), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 108/320 (33%), Positives = 167/320 (52%), Gaps = 36/320 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
E + +E ++ Y + E+ R KIF +N H + L +NK+ D+ +F
Sbjct: 30 EAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDMLHHEF 89
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNL--NSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+++ G++ T +NR+ + ++DW +GAVTP+KDQG CW
Sbjct: 90 VSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQLPKNVDWRTKGAVTPIKDQGQCGSCW 149
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASE 173
AF+A +EG +TGQLV+ S+ LVDCS NGC ++NAFEY+++ + +E
Sbjct: 150 AFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKENGGIDTE 209
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATW- 231
YPY +D C + +A + +G+ V+ +E L+ V+ PVSVAIDA+
Sbjct: 210 ESYPYDA-EDEKCHYNPRAAGAE---DKGFVDVREGSEHALKKAVATVGPVSVAIDASHE 265
Query: 232 -FNFYHGGVFTGP-CG-NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
F FY GV+ P C +HGV +VGYG + YWLVKN WGT W + G +++
Sbjct: 266 SFQFYSHGVYIEPECSPEMLDHGVLVVGYGIDDDG---TDYWLVKNSWGTTWGDQGYVKM 322
Query: 289 FRGVGGSGLCNIAANAAYPL 308
R C IA++A++PL
Sbjct: 323 ARNR--DNQCGIASSASFPL 340
>gi|405971604|gb|EKC36431.1| Cathepsin L [Crassostrea gigas]
Length = 384
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 113/321 (35%), Positives = 173/321 (53%), Gaps = 46/321 (14%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN------HE----------FLRLNKFADLTREKF 59
+++ + ++Y+D E+ RF+IF++N H +L +N+F DL +F
Sbjct: 80 KEFKILHDKSYEDHEEESRRFEIFRENVLRIEKHNKLFHLGKKSYYLGVNQFTDLEYAEF 139
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ ++ G K + N + +L+++ + DS+DW +G VT VK+QG+ CWAF
Sbjct: 140 V-NFNGLK-----MTNLNNTKCSSHLSANNIVVPDSVDWRSKGYVTKVKNQGACGSCWAF 193
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+A ++EG + G+LV S+ QLVDCS GC F+ENAF+Y++ + SE
Sbjct: 194 SATGSLEGQYFRKNGKLVPLSESQLVDCSGSFGNEGCNGGFMENAFKYVKSVGGIESESD 253
Query: 176 YPYQGRQDYYCDWWRSSASGK---YGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA-- 229
YPY+ RQ R+ A K + G V+ +E L++VVS PVSVAIDA
Sbjct: 254 YPYKARQ-------RTCAFDKTKVIATVSGCVDVESGSESSLKEVVSEVGPVSVAIDAGH 306
Query: 230 TWFNFYHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
+ F Y GGV+ P +T NHGV VGYGT+ + + YW+VKN WG W G ++
Sbjct: 307 SSFQLYAGGVYDEPLCSTSRLNHGVLCVGYGTSLQG---KDYWIVKNSWGVRWGVEGYIK 363
Query: 288 IFRGVGGSGLCNIAANAAYPL 308
+ R + C IA+ A+YPL
Sbjct: 364 MSR--NKNNQCGIASEASYPL 382
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 108/314 (34%), Positives = 169/314 (53%), Gaps = 29/314 (9%)
Query: 13 AKH-EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR--------LNKFADLTREKFLASY 63
AKH + ++ E ++ + E R KI K N ++ R +N+F D+ +F+++
Sbjct: 32 AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G+K D P S + + N S ++DW +GAVTPVK+QG CWAF+A
Sbjct: 92 NGFKRNYKDQPREG-STYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSATG 150
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
++EG + ++G +V+ S+ LV CST NGC +++AF+YIR + + +E YPY
Sbjct: 151 SLEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSYPYN 210
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATW--FNFYH 236
G D C + +S+ G+ ++ +E L+ V+ P+SVAIDA+ F FY
Sbjct: 211 G-TDGTCHFKKSTVG---ATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYS 266
Query: 237 GGVFTGP-CGN-TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
GV+ P C + + +HGV +VGYGT + YW VKN WGT W + G +R+ R
Sbjct: 267 DGVYDEPECDSESLDHGVLVVGYGTLNGTD----YWFVKNSWGTTWGDEGYIRMSR--NK 320
Query: 295 SGLCNIAANAAYPL 308
C IA++A+ PL
Sbjct: 321 KNQCGIASSASIPL 334
>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
Length = 330
Score = 164 bits (416), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 111/312 (35%), Positives = 157/312 (50%), Gaps = 40/312 (12%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLASYT 64
WM + R+Y E +++ FK N +F L L +FADLT E++ Y
Sbjct: 36 WMKKHDRSYHHH-EFNNKYQAFKDNMDFIHNWNTNKNSKTVLGLTQFADLTNEEYRKIYL 94
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVAT 123
G K H N N + DSIDW +GAV+ VKDQG CW+F+ +
Sbjct: 95 GTKVNVAPEKH--------NFNMIHFTGPDSIDWRTKGAVSHVKDQGQCGSCWSFSTTGS 146
Query: 124 VEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG ++I+TG +VT S+ LVDCS NGC + NAF++I +A+E YPY
Sbjct: 147 VEGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGGLMVNAFKFIMSQGGVATEDSYPYNA 206
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHGG 238
Q C + +S I GY+ + +E LQ +++QPVS+AIDA+ F Y G
Sbjct: 207 VQG-KCKFTKSMVGAN---ISGYKEITQGSELELQAALTKQPVSIAIDASQQSFQLYKSG 262
Query: 239 VFTGP-CGN-TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSG 296
V+ P C + +HGV VGYGT E + Y++VKN W +W + G IF
Sbjct: 263 VYDEPECSSYQLDHGVLAVGYGT----ENGKDYYIVKNSWADSWGQDG--YIFMSRNAKN 316
Query: 297 LCNIAANAAYPL 308
C +A A+YP+
Sbjct: 317 QCGVATMASYPI 328
>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
Length = 396
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 112/332 (33%), Positives = 163/332 (49%), Gaps = 35/332 (10%)
Query: 7 KTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------------RLNK 50
+ I + W+V++ + + E+ R KIF +N+ F+ +NK
Sbjct: 64 RESKIEDAFDAWLVKYDKEIANAEERLKRLKIFGENYLFVLEHNAKYVAGKVSHYVEMNK 123
Query: 51 FADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL-NSSKMSFYDSIDWNERGAVTPVKD 109
FA TRE++ G+K S + +L + +SIDW + G +T K+
Sbjct: 124 FAAHTREEY-RKMLGFKKSLRRKKDSGEAAKDVSLWEYEGVEAPESIDWVDEGVITTPKN 182
Query: 110 QGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIR 165
QGS CWAF+A+ VEG+N IRTG+LV+ S+ +LV C+ GC ++NAFE+I
Sbjct: 183 QGSCGSCWAFSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAFEWIV 242
Query: 166 QYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSV 225
+ + SE Y Y+ D D +I G+ V E L+ VS+QPVSV
Sbjct: 243 ENGGVDSEKQYQYKASFD---DCKTRKTLLHIASIDGFNDVPSNDETALKKAVSQQPVSV 299
Query: 226 AIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAE------GQQPYWLVKNRW 276
AI+A F Y GGV+ CG +HGV +VGYG + + YW +KN W
Sbjct: 300 AIEADQRSFQLYGGGVYHAEDCGTQLDHGVLVVGYGIDHNSSNVIIPGATKKYWKIKNSW 359
Query: 277 GTNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
W EGG +RI R V SG+C +A A+YP
Sbjct: 360 SEQWGEGGYIRIARDVESPSGMCGVAEMASYP 391
>gi|308810026|ref|XP_003082322.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
gi|116060790|emb|CAL57268.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
Length = 430
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 117/351 (33%), Positives = 170/351 (48%), Gaps = 53/351 (15%)
Query: 3 RTSHKTGN---IAAKHEQWMVE--FARTYKDQAEKEMRFKIFKKNHEFLR---------- 47
R +H + N +A E+W E R +D E R F +N ++
Sbjct: 83 RDAHASSNANALARHFERWCSEHGLERYLRDTEEYAKRLATFAENAAYVVEHNALYAIGE 142
Query: 48 ------LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFY-------- 93
LN A TRE++ A GYKP + S + + ++ K+ Y
Sbjct: 143 VSHWVGLNSLAATTREEYRA-LLGYKP---ELRSSGDAEMLEATSTDKVEQYKASWEYAS 198
Query: 94 ----DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST 148
++IDW E GAVTP K+QG CWAF+ VEG+ KIRTG+LV+ S+ ++V CS
Sbjct: 199 VDPPEAIDWVELGAVTPPKNQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCSK 258
Query: 149 LN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQ 207
N GC ++ AF +I + + SE YPY + C+ W+ I G++ V
Sbjct: 259 QNMGCNGGLMDYAFRWIVKNGGIDSEFQYPYSA-EALACNRWKLQL--HVATIDGFKDVP 315
Query: 208 PATEEGLQDVVSRQPVSVAI--DATWFNFYHGGVF-TGPCGNTPNHGVTIVGYG------ 258
P E+ L+ VS+QPVS+AI D F Y GGV+ + CG+ +HGV +VGYG
Sbjct: 316 PGDEKELEKAVSQQPVSIAIEADTKSFQLYDGGVYDSKECGSQVDHGVLVVGYGFDDTHH 375
Query: 259 -TTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAANAAYP 307
T + + +W VKN WG W EGG +R+ R + +G C I +YP
Sbjct: 376 NATKHHKRHRHFWKVKNSWGGTWGEGGFIRMARRISDETGQCGITTAPSYP 426
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 112/322 (34%), Positives = 164/322 (50%), Gaps = 45/322 (13%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTR 56
+QW E R Y E+ R +F++N +F L++N+F D+T
Sbjct: 22 QQWRDFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFGDMTS 81
Query: 57 EKFLASYTGYKPPPTDHPHSN-RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
E+F A+ G+ P+ P + R++ + L +DW +GAVTPVKDQ
Sbjct: 82 EEFTATMNGFLNVPSRRPTAILRADPDETLPKE-------VDWRTKGAVTPVKDQKQCGS 134
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+ ++EG + ++ G+LV+ S+ LVDCS GC ++ AF YI+ + +
Sbjct: 135 CWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKGID 194
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT 230
+E YPY+ QD C R AS GY V+ +E L+ V+ P+SVAIDA+
Sbjct: 195 TEDSYPYEA-QDGKC---RFDASNVGATDTGYVDVEHGSESALKKAVATIGPISVAIDAS 250
Query: 231 W--FNFYHGGVF--TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F FYH GV+ G +HGV VGYG T + E YWLVKN W T+W G +
Sbjct: 251 QPSFQFYHDGVYYEEGCSSTMLDHGVLAVGYGETEKGEA---YWLVKNSWNTSWGNKGYI 307
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
++ R + C IA+ A+YPL
Sbjct: 308 QMSRDKKNN--CGIASQASYPL 327
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 161/322 (50%), Gaps = 37/322 (11%)
Query: 15 HEQW---MVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLT 55
EQW + + Y+ + E+ R KIF +N L +NK+AD+
Sbjct: 24 QEQWGAFKMTHNKQYQSETEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADML 83
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+F+ G+ + + L + + IDW ++GAVTPVKDQG
Sbjct: 84 HHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGS 143
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CW+F+A ++EG + ++G+LV+ S+ LVDCS NGC ++NAF YI+ +
Sbjct: 144 CWSFSATGSLEGQHFRQSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGID 203
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT 230
+E YPY+ +D C + K RGY ++ E+ LQ V+ PVSVAIDA+
Sbjct: 204 TEQAYPYKA-EDEKCHY---KPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVAIDAS 259
Query: 231 W--FNFYHGGVFTGP--CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y GGV+ P + +HGV +VGYGT + YWLVKN WG +W + G +
Sbjct: 260 HQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDG---TDYWLVKNSWGKSWGDQGYI 316
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
++ R + C IA A+YPL
Sbjct: 317 KMARNRNNN--CGIATEASYPL 336
>gi|66475996|ref|XP_627814.1| cryptopain - cysteine proteinase secreted, possible transmembrane
domain near N-terminus [Cryptosporidium parvum Iowa II]
gi|32399065|emb|CAD98305.1| cryptopain precursor [Cryptosporidium parvum]
gi|46229218|gb|EAK90067.1| cryptopain - cysteine proteinase secreted, possible transmembrane
domain near N-terminus [Cryptosporidium parvum Iowa II]
gi|76160841|gb|ABA40395.1| cryptopain-1 [Cryptosporidium parvum]
Length = 401
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 105/319 (32%), Positives = 164/319 (51%), Gaps = 35/319 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E++ ++ + Y E+ RF+I+K+N F L +N+F DL++E+F+A +
Sbjct: 87 EEFKKKYHKVYSSMEEENQRFEIYKQNMNFIKTTNSQGFSYVLEMNEFGDLSKEEFMARF 146
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFY--DSIDWNERGAVTPVKDQGSY-CCWAFTA 120
TGY D +S+ + + S+ F +SI+W E G V P+++Q + CWAF+A
Sbjct: 147 TGYIKDSKDDERVFKSSRV-SASESEEEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSA 205
Query: 121 VATVEGLNKIRTGQ-LVTRSKHQLVDCSTLNG---CAKNFLENAFEYIRQYQRLASECVY 176
VA +EG +T + L + S+ Q VDCS NG C + AF+Y + + L + Y
Sbjct: 206 VAALEGATCAQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTMGLAFQYAIKNKYLCTNDDY 265
Query: 177 PYQGRQ----DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAI--DA 229
PY + D +C+ + ++ Y+YV P L+ +++ P+SVAI D
Sbjct: 266 PYFAEEKTCMDSFCENYIEIP------VKAYKYVFPRNINALKTALAKYGPISVAIQADQ 319
Query: 230 TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
T F FY GVF PCG NHGV +VGY + + + YWLV+N WG W E G +++
Sbjct: 320 TPFQFYKSGVFDAPCGTKVNHGVVLVGYD--MDEDTNKEYWLVRNSWGEAWGEKGYIKLA 377
Query: 290 RGVGGSGLCNIAANAAYPL 308
G G C I YP+
Sbjct: 378 LHSGKKGTCGILVEPVYPV 396
>gi|32394728|gb|AAM96000.1| cathepsin L precursor [Metapenaeus ensis]
Length = 322
Score = 164 bits (415), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 107/318 (33%), Positives = 164/318 (51%), Gaps = 41/318 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
+ + V++ R Y E R +F++N +F L++N+F D+T E+F
Sbjct: 20 QDFKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEEF 79
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
A+ G+ PT HP L + + +DW +GAVTPVKDQ CWAF
Sbjct: 80 AATMNGFLNVPTRHP-------VAILEADDETLPKHVDWRTKGAVTPVKDQKQCGSCWAF 132
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+ ++EG + ++ G+LV+ S+ LVDCS GC ++ AF+YI++ + + +E
Sbjct: 133 STTGSLEGQHFLKDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEES 192
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATW--F 232
YPY+ QD C R +S G+ + E L V+ P+SVAIDA+ F
Sbjct: 193 YPYEA-QDGKC---RFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPISVAIDASHPSF 248
Query: 233 NFYHGGV-FTGPCGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FYH GV + C +T +HGV +GYG T + + YWLVKN W T+W + G +++ R
Sbjct: 249 QFYHQGVYYEKECSSTMLDHGVLAIGYGETDDG---KEYWLVKNSWNTSWGDKGFIQMSR 305
Query: 291 GVGGSGLCNIAANAAYPL 308
+ C IA+ A+YPL
Sbjct: 306 NKKNN--CGIASQASYPL 321
>gi|46576360|sp|P60994.1|ERVB_TABDI RecName: Full=Ervatamin-B; Short=ERV-B
gi|30749291|pdb|1IWD|A Chain A, Proposed Amino Acid Sequence And The 1.63 Angstrom X-ray
Crystal Structure Of A Plant Cysteine Protease Ervatamin
B: Insight Into The Structural Basis Of Its Stability
And Substrate Specificity
Length = 215
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 97/219 (44%), Positives = 129/219 (58%), Gaps = 18/219 (8%)
Query: 96 IDWNERGAVTPVKDQ---GSYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-G 151
+DW +GAV +K+Q GS CWAF+AVA VE +NKIRTGQL++ S+ +LVDC T + G
Sbjct: 5 VDWRSKGAVNSIKNQKQCGS--CWAFSAVAAVESINKIRTGQLISLSEQELVDCDTASHG 62
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ NAF+YI + ++ YPY Q C +R + +I G+Q V E
Sbjct: 63 CNGGWMNNAFQYIITNGGIDTQQNYPYSAVQG-SCKPYRL----RVVSINGFQRVTRNNE 117
Query: 212 EGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
LQ V+ QPVSV ++A F Y G+FTGPCG NHGV IVGYGT + + Y
Sbjct: 118 SALQSAVASQPVSVTVEAAGAPFQHYSSGIFTGPCGTAQNHGVVIVGYGT----QSGKNY 173
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
W+V+N WG NW G + + R V S GLC IA +YP
Sbjct: 174 WIVRNSWGQNWGNQGYIWMERNVASSAGLCGIAQLPSYP 212
>gi|313118768|gb|ADR32296.1| C14 cysteine protease [Solanum demissum]
gi|313118770|gb|ADR32297.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 92/220 (41%), Positives = 129/220 (58%), Gaps = 13/220 (5%)
Query: 95 SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNG 151
S+DW ++G + VKDQGS CWAF+AVA +E +N I TG L++ S+ +LVDC S G
Sbjct: 4 SVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYNEG 63
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ AFE++ + +E YPY+ R CD +R +A K I Y+ V E
Sbjct: 64 CDGGLMDYAFEFVINNGGIDTEEDYPYKERNG-VCDQYRKNA--KVVTIDSYEDVPVNNE 120
Query: 212 EGLQDVVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
+ LQ V+ QPVS+A++A +F H G+FTG CG +HGV + GYGT E Y
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVVAGYGT----ENGMDY 176
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W+V+N WG W E G +R+ R V SGLC +A +YP+
Sbjct: 177 WIVRNSWGAKWGEKGYLRVQRNVASSSGLCGLAIEPSYPV 216
>gi|413933048|gb|AFW67599.1| hypothetical protein ZEAMMB73_513726 [Zea mays]
Length = 205
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 96/200 (48%), Positives = 124/200 (62%), Gaps = 11/200 (5%)
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRL 170
CCWAF+AVA VEGLNKIRTG+LV+ S+ +LVDC GC ++NAF+++ + L
Sbjct: 12 CCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCDVSGVDQGCDGGLMDNAFQFVARRGGL 71
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA- 229
ASE YPYQGR D C S+A+ + +IRG++ V E L V+ QPVSVAI+
Sbjct: 72 ASESGYPYQGR-DGPCR--SSAAAARAASIRGHEDVPRNNEAALAAAVANQPVSVAINGE 128
Query: 230 -TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
F FY GV G CG NH +T VGYGT + YWL+KN WG +W EGG +RI
Sbjct: 129 DMAFRFYDSGVLGGACGTDLNHAITAVGYGTANDG---TRYWLMKNSWGASWGEGGYVRI 185
Query: 289 FRGVGGSGLCNIAANAAYPL 308
RGV G G+C +A +YP+
Sbjct: 186 RRGVRGEGVCGLAKLPSYPV 205
>gi|18202414|sp|P82473.1|CPGP1_ZINOF RecName: Full=Zingipain-1; AltName: Full=Cysteine proteinase GP-I
Length = 221
Score = 164 bits (415), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 98/220 (44%), Positives = 127/220 (57%), Gaps = 13/220 (5%)
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-G 151
DSIDW E+GAV PVK+QG CWAF A+A VEG+N+I TG L++ S+ QLVDCST N G
Sbjct: 5 DSIDWREKGAVVPVKNQGGCGSCWAFDAIAAVEGINQIVTGDLISLSEQQLVDCSTRNHG 64
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C + AF+YI + SE YPY G + CD + + +I Y+ V E
Sbjct: 65 CEGGWPYRAFQYIINNGGINSEEHYPYTG-TNGTCD---TKENAHVVSIDSYRNVPSNDE 120
Query: 212 EGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
+ LQ V+ QPVSV +DA F Y G+FTG C + NH T+ G E E + Y
Sbjct: 121 KSLQKAVANQPVSVTMDAAGRDFQLYRNGIFTGSCNISANHYRTVGG----RETENDKDY 176
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W VKN WG NW E G +R+ R + SG C IA + +YP+
Sbjct: 177 WTVKNSWGKNWGESGYIRVERNIAESSGKCGIAISPSYPI 216
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 110/319 (34%), Positives = 161/319 (50%), Gaps = 27/319 (8%)
Query: 9 GNIAAKHEQWMVEFARTY-KDQAEKEMRFKIFKKNHEF------------LRLNKFADLT 55
N A +QWM+++ + Y D E E RF ++ +N + L LN FADLT
Sbjct: 39 ANPMAAFQQWMMQYTKAYANDIKELETRFSVWLENLNYILAYNARTTSHWLHLNAFADLT 98
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++F + GY + +S+ F N IDW ++GAVT VK+QG
Sbjct: 99 TDEF-RNRLGYDFKARQASNRLQSSPFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGS 157
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLAS 172
CWAF +VEG+N I TG+L + S+ +LVDC T GC+ ++ A+++I + L +
Sbjct: 158 CWAFATTGSVEGINAIVTGELASLSEQELVDCDTDEDRGCSGGLMDYAYQWIIKNGGLDT 217
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI--DAT 230
E YPY +D C + + + I GY + E L+ + QP++VAI DA
Sbjct: 218 EDDYPYTA-EDGVC--VAAKKNRRVVTIDGYVDIPENDEVALKKAAAHQPIAVAIEADAK 274
Query: 231 WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F Y GGV+ P CG + NHGV +VGYG YW+VKN WG W + G +R+
Sbjct: 275 SFQLYGGGVYDDPTCGTSLNHGVLVVGYGKDPHFGN---YWIVKNSWGPEWGDNGYIRLR 331
Query: 290 RGVGG-SGLCNIAANAAYP 307
G G+C IA ++P
Sbjct: 332 MGAEDVQGMCGIAMAPSFP 350
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 164 bits (414), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 114/325 (35%), Positives = 169/325 (52%), Gaps = 43/325 (13%)
Query: 16 EQWM---VEFARTYKDQAEKEMRFKIFKKN-HEFLR---------------LNKFADLTR 56
E+W +E + Y+D+ E+ R KIF +N H+ + LNK+AD+
Sbjct: 26 EEWQTFKLEHRKQYQDETEERFRLKIFNENKHKIAKHNQLYAAGEVSFKMGLNKYADMLH 85
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNW--FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC 114
+F + G+ ++ + + ++ + S+DW +GAVT VKDQG +C
Sbjct: 86 HEFHETMNGFNYTLHKQLRASDATFTGVTFISPEHVKLPQSVDWRNKGAVTGVKDQG-HC 144
Query: 115 --CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQR 169
CWAF++ +EG + +TG L++ S+ LVDCST NGC ++NAF YI+
Sbjct: 145 GSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 204
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGL-QDVVSRQPVSVAI 227
+ +E YPY+G D C + + G GA RG+ + E+ L Q V + PVSVAI
Sbjct: 205 IDTEKSYPYEGIDD-SCHFNK----GTIGATDRGFTDIPQGDEKKLAQAVATIGPVSVAI 259
Query: 228 DATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
DA+ F FY GV+ P + N HGV +VGYGT E + YWLVKN WGT W +
Sbjct: 260 DASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGTD---ENGKDYWLVKNSWGTTWGDK 316
Query: 284 GSMRIFRGVGGSGLCNIAANAAYPL 308
G +++ R C IA ++YPL
Sbjct: 317 GFIKMAR--NDDNQCGIATASSYPL 339
>gi|91092022|ref|XP_970951.1| PREDICTED: similar to cathepsin l [Tribolium castaneum]
gi|270001246|gb|EEZ97693.1| cathepsin L precursor [Tribolium castaneum]
Length = 343
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 117/324 (36%), Positives = 168/324 (51%), Gaps = 41/324 (12%)
Query: 15 HEQWM---VEFARTYKDQAEKEMRFKIFKKN-HEFLR---------------LNKFADLT 55
E+WM + + ++Y E+ R +IF +N H+ R LN FAD+
Sbjct: 30 QEEWMAFKLTYNKSYASPEEENFRREIFIENRHKIARFNQEYGRGQWSFVQQLNNFADML 89
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC- 114
+F + G+ + +S+ F + S+ + F D +DW E GAVTPVK+QGS
Sbjct: 90 HHEFHRTLNGFNRTLSARVGIPQSSTF--IPSANVIFPDYVDWREVGAVTPVKNQGSCAG 147
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLA 171
CWAF+A +EG N +TG+LV S L+DCST +GC+ + A+EY+R +
Sbjct: 148 CWAFSAAGALEGHNFRKTGRLVELSPQNLIDCSTNYGNDGCSGGLMNPAYEYVRTNPGID 207
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA- 229
+E YPY+ R C +R G Y GY + E+GL+ ++ PVS A+DA
Sbjct: 208 TEDSYPYEARNG-PCR-FRPETVGAY--CTGYVDIAEGDEQGLEAAIATLGPVSAAMDAG 263
Query: 230 -TWFNFYHGGVFTGP-CGNTP---NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
F FY G++ P CGN P NH V +VGYG TE GQ+ YWLVKN +G W GG
Sbjct: 264 RQSFQFYSDGIYYDPQCGNRPDDVNHAVLVVGYG--TEPNGQK-YWLVKNSYGPQWGIGG 320
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
+++ + C IA A+YPL
Sbjct: 321 YVKLAKDANNH--CGIAIQASYPL 342
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 102/264 (38%), Positives = 145/264 (54%), Gaps = 26/264 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASY 63
E W+ F + Y+ EK +RF++FK N + +L LN+FADL+ E+F Y
Sbjct: 52 ENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKGKSYWLGLNEFADLSHEEFKKMY 111
Query: 64 TGYKPPPT--DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
G K D S ++++ + S +DW ++GAV VK+QGS CWAF+
Sbjct: 112 LGLKTDIVRRDEERSYAEFAYRDVEAVPKS----VDWRKKGAVAEVKNQGSCGSCWAFST 167
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYPY 178
VA VEG+NKI TG L T S+ +L+DC T NGC ++ AFEYI + L E YPY
Sbjct: 168 VAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEEDYPY 227
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYH 236
++ C+ + + + I G+Q V E+ L ++ QP+SVAIDA+ F FY
Sbjct: 228 S-MEEGTCEMQKDES--ETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFYS 284
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTT 260
GGVF G CG +HGV VGYG++
Sbjct: 285 GGVFDGRCGVDLDHGVAAVGYGSS 308
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 108/322 (33%), Positives = 160/322 (49%), Gaps = 37/322 (11%)
Query: 15 HEQW---MVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLT 55
EQW + + Y+ E+ R KIF +N L +NK+AD+
Sbjct: 24 QEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADML 83
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+F+ G+ + + L + + IDW ++GAVTPVKDQG
Sbjct: 84 HHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGS 143
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CW+F+A ++EG + ++G+LV+ S+ LVDCS NGC ++NAF YI+ +
Sbjct: 144 CWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGID 203
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT 230
+E YPY+ +D C + K RGY ++ E+ LQ V+ PVSVAIDA+
Sbjct: 204 TEQAYPYKA-EDEKCHY---KPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVAIDAS 259
Query: 231 W--FNFYHGGVFTGP--CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y GGV+ P + +HGV +VGYGT + YWLVKN WG +W + G +
Sbjct: 260 HQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGTEDDG---TDYWLVKNSWGKSWGDQGYI 316
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
++ R + C IA A+YPL
Sbjct: 317 KMARNRDNN--CGIATEASYPL 336
>gi|32394730|gb|AAM96001.1| cathepsin L precursor [Metapenaeus ensis]
Length = 306
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 107/318 (33%), Positives = 164/318 (51%), Gaps = 41/318 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
+ + V++ R Y E R +F++N +F L++N+F D+T E+F
Sbjct: 4 QDFKVQYGRHYGTAREDLYRQSVFEQNQQFIEDHNAKFENGEVTFTLKMNQFGDMTSEEF 63
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
A+ G+ PT HP L + + +DW +GAVTPVKDQ CWAF
Sbjct: 64 AATMNGFLNVPTRHP-------VAILEADDETLPKHVDWRTKGAVTPVKDQKQCGSCWAF 116
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+ ++EG + ++ G+LV+ S+ LVDCS GC ++ AF+YI++ + + +E
Sbjct: 117 STTGSLEGQHFLKDGKLVSLSEQNLVDCSGKFGNMGCCGGLMDQAFKYIKENKGIDTEES 176
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATW--F 232
YPY+ QD C R +S G+ + E L V+ P+SVAIDA+ F
Sbjct: 177 YPYEA-QDGKC---RFDSSNVGATDTGFVDIAHGEENSLMKAVANIGPISVAIDASHPSF 232
Query: 233 NFYHGGV-FTGPCGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FYH GV + C +T +HGV +GYG T + + YWLVKN W T+W + G +++ R
Sbjct: 233 QFYHQGVYYEKECSSTMLDHGVLAIGYGETDDG---KEYWLVKNSWNTSWGDKGFIQMSR 289
Query: 291 GVGGSGLCNIAANAAYPL 308
+ C IA+ A+YPL
Sbjct: 290 NKKNN--CGIASQASYPL 305
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 113/315 (35%), Positives = 165/315 (52%), Gaps = 34/315 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN------------HEF-LRLNKFADLTREKFLAS 62
E W F ++Y D E+ R +++ N H + L +N FADLT E+F
Sbjct: 31 EAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFKRF 90
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
Y G K + P SN S+ F ++ + DS+DW G VTPVKDQG CW+F+
Sbjct: 91 YLGTKVD-LNRPRSNFSSTFI-PTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWSFSTT 148
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPY 178
+VEG + +TGQLV+ S+ LVDCS GC +++AF+YI + + +E YPY
Sbjct: 149 GSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEASYPY 208
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT--WFNFY 235
+ D C + +A+ + +Q + +E LQ+ V+ PVSVAIDA+ F Y
Sbjct: 209 TAK-DGTCKF---NAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQLY 264
Query: 236 HGGVFT-GPCGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
GV+ C +T +HGV GYGT+ PYWLVKN WG++W + G + + R
Sbjct: 265 TSGVYNEKKCSSTSLDHGVLAAGYGTSNGT----PYWLVKNSWGSSWGQAGYIWMSRNAN 320
Query: 294 GSGLCNIAANAAYPL 308
C IA +A+YP+
Sbjct: 321 NQ--CGIATSASYPI 333
>gi|125564726|gb|EAZ10106.1| hypothetical protein OsI_32416 [Oryza sativa Indica Group]
Length = 349
Score = 164 bits (414), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 113/300 (37%), Positives = 152/300 (50%), Gaps = 30/300 (10%)
Query: 28 DQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLASYTGYKPPPTDHP 74
D AE E RF+ FK N + L LNKFAD+T E+F+A YTG K D
Sbjct: 42 DVAETESRFEAFKANARYVSEFNKKEGMTYKLGLNKFADMTLEEFVAKYTGTK---VDAA 98
Query: 75 HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTG 133
R+ + S DW + GAVTP ++QG+ CWAF+AV VEG N I TG
Sbjct: 99 AMARAPQAEEELELAGDVAASWDWRQHGAVTPAREQGTCESCWAFSAVGAVEGANAIATG 158
Query: 134 QLVTRSKHQLVDCSTLNGCAKNFLENAFEYIRQY---QRLASECVYPYQGRQDYYCDWWR 190
+LVT S+ Q++DCS C + F + Y Q ++ YP +D C R
Sbjct: 159 KLVTLSEQQVLDCSGAGDCIGG--GSYFPVLHGYAVKQGISPAGSYPPYEAKDRACR--R 214
Query: 191 SSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FNFYHGGVFTGPCGNTPN 249
++ + + G V PA+E L+ V R PV+V+I+AT Y GV++GPCG T N
Sbjct: 215 NTPAVPVVKMDGAVDV-PASEAALKRSVYRAPVAVSIEATQSLQLYKEGVYSGPCGTTVN 273
Query: 250 HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV-GGSGLCNIAANAAYPL 308
HGV +VGYG T + YW++KN WG W + G + R V GLC IA Y +
Sbjct: 274 HGVLVVGYGVTRD---NIKYWIIKNSWGKEWGDNGFGHMKRDVIAKEGLCGIAMYGVYSV 330
>gi|313118764|gb|ADR32294.1| C14 cysteine protease [Solanum stoloniferum]
Length = 217
Score = 164 bits (414), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 93/220 (42%), Positives = 129/220 (58%), Gaps = 13/220 (5%)
Query: 95 SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC--STLNG 151
S+DW ++G + VKDQGS CWAF+AVA +E +N I TG L++ S+ +LVDC S G
Sbjct: 4 SVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYNQG 63
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C ++ AFE++ + SE YPY+ R CD +R +A K I Y+ V E
Sbjct: 64 CDGGLMDYAFEFVINNGGIDSEEDYPYKERNG-VCDQYRKNA--KVVVIDSYEDVPVNNE 120
Query: 212 EGLQDVVSRQPVSVAIDATWFNFYH--GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
+ LQ V+ QPVS+A++A +F H G+FTG CG +HGV GYGT E Y
Sbjct: 121 KALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGT----ENGLDY 176
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
W+V+N WG +W E G +R+ R V SGLC +A +YP+
Sbjct: 177 WIVRNSWGADWGEKGYLRVQRNVASSSGLCGLAIEPSYPV 216
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 163 bits (413), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 113/323 (34%), Positives = 170/323 (52%), Gaps = 41/323 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + E ++ Y+ E+ R KIF +N + L +NK+ D+ +F
Sbjct: 30 ESFKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDMLHHEF 89
Query: 60 LASYTGYKPPPTDHPH-SNRSNWFKNLN----SSKMSFYDSIDWNERGAVTPVKDQGSY- 113
+ G++ + + +NR F+ + + S+DW E+GAVT VKDQGS
Sbjct: 90 VNMMNGFRANTSGAGYKANRG--FQGAHFVEPPEDVVMPKSVDWREKGAVTEVKDQGSCG 147
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRL 170
CWAF+A +EG + +TG LV+ S+ LVDCS+ NGC ++NAF+YI+ +
Sbjct: 148 SCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYIKVNGGI 207
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDA 229
+E YPY+ +D C + ++A RG+ V+ E L+ ++ PVSVAIDA
Sbjct: 208 DTEKSYPYEA-EDEPCRYNPANAGAD---DRGFVDVREGNENALKKAIATIGPVSVAIDA 263
Query: 230 TW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
+ F FY GV++ P + N HGV VGYGTT E Q YWLVKN W +W + G
Sbjct: 264 SQDSFQFYQHGVYSDPDCSAENLDHGVLAVGYGTT---EDGQDYWLVKNSWSKSWGDQGY 320
Query: 286 MRIFRGVGGSGLCNIAANAAYPL 308
++I R + +C IA+ A+YPL
Sbjct: 321 IKIARNQ--NNMCGIASAASYPL 341
>gi|356549192|ref|XP_003542981.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 517
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 168/315 (53%), Gaps = 36/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF---------------LRLNKFADLTREKFL 60
++W E + Y+ ++++RF+ FK+N ++ L LN+FAD++ E+F
Sbjct: 51 QRWKEENKKIYRSPDQEKLRFENFKRNLKYIAEKNSKRISPYGQSLGLNRFADMSNEEFK 110
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG-SYCCWAFT 119
+ +T P S R+ +S + + Y S+DW ++G VT VKDQG CCWAF+
Sbjct: 111 SKFTS----KVKKPFSKRNGLSGKDHSCEDAPY-SLDWRKKGVVTAVKDQGYCGCCWAFS 165
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPY 178
+ +EG+N I +G L++ S+ +LVDC N GC ++ AFE++ + +E YPY
Sbjct: 166 STGAIEGINAIVSGDLISLSEPELVDCDRTNDGCDGGHMDYAFEWVMHNGGIDTETNYPY 225
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID-ATW-FNFYH 236
G D C+ + K I GY V+ + L V +QP+S ID ++W F Y
Sbjct: 226 SG-ADGTCNVAKEET--KVIGIDGYYNVEQSDRSLLCATV-KQPISAGIDGSSWDFQLYI 281
Query: 237 GGVFTGPCGNTP---NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
GG++ G C + P +H + +VGYG+ EG + YW+VKN WGT+W G + I R
Sbjct: 282 GGIYDGDCSSDPDDIDHAILVVGYGS----EGDEDYWIVKNSWGTSWGMEGYIYIRRNTN 337
Query: 294 GS-GLCNIAANAAYP 307
G+C I A+YP
Sbjct: 338 LKYGVCAINYMASYP 352
>gi|33333704|gb|AAQ11970.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 111/324 (34%), Positives = 164/324 (50%), Gaps = 54/324 (16%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------------RLNKFADLTREKF 59
+Q+ ++ +TY+ E++ RF +F+KN + ++ +FAD+T E+F
Sbjct: 24 QQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEF 83
Query: 60 L--ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
L G P++ H F N M D+IDW E GAVTPVKDQ + CW
Sbjct: 84 LDLLKLQGVPALPSNAVH------FDNFEDIDMEEKDAIDWREEGAVTPVKDQANCGSCW 137
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAFEYIRQYQRLAS 172
AF+AV +EG + G LV+ S +LVDC+T NGC + AF+++ Q + + +
Sbjct: 138 AFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFV-QDEGIQT 196
Query: 173 ECVYPYQGRQDYYCDWWRSSA--SGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDA 229
E YPY+GR RSS SG+Y + YV P E+ + + V ++ PV+VAI+A
Sbjct: 197 EESYPYEGR--------RSSCKKSGEY-VTKVKTYVFPLDEQEMARTVAAKGPVAVAIEA 247
Query: 230 TWFNFYHGGVFTGPC-----GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
+ +FY G+ C NHGV +VGYG+ E YW+VKN WG +W E G
Sbjct: 248 SQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYGS----ENGVDYWIVKNSWGADWGEKG 303
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
R+ + V C I YP+
Sbjct: 304 YFRLKKDVKA---CGIGTYNTYPV 324
>gi|67605684|ref|XP_666697.1| cryptopain precursor [Cryptosporidium hominis TU502]
gi|54657738|gb|EAL36466.1| cryptopain precursor [Cryptosporidium hominis]
Length = 401
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 105/321 (32%), Positives = 166/321 (51%), Gaps = 39/321 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E++ ++ +TY E+ RF+I+K+N F L +N+F DL++E+F+A +
Sbjct: 87 EEFKKKYNKTYSSMEEENQRFEIYKQNMNFIKTTNSQGFSYVLEMNEFGDLSKEEFMARF 146
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFY----DSIDWNERGAVTPVKDQGSY-CCWAF 118
TGY D +S+ +++S++ +SI+W E G V P+++Q + CWAF
Sbjct: 147 TGYIKDSKDDERVFKSS---RVSASELEEEFVPPNSINWVEAGCVNPIRNQKNCGSCWAF 203
Query: 119 TAVATVEGLNKIRTGQ-LVTRSKHQLVDCSTLNG---CAKNFLENAFEYIRQYQRLASEC 174
+AVA +EG +T + L + S+ Q VDCS NG C + AF+Y + + L +
Sbjct: 204 SAVAALEGATCAQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTMGLAFQYAIKNKYLCTND 263
Query: 175 VYPYQGRQ----DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAI-- 227
YPY + D +C+ + ++ Y+YV P L+ +++ P+SVAI
Sbjct: 264 DYPYFAEEKTCMDSFCENYIEIP------VKAYKYVFPRNINTLKTALAKYGPISVAIQA 317
Query: 228 DATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
D T F FY GVF PCG NHGV +VGY + + + YWLV+N WG W E G ++
Sbjct: 318 DQTPFQFYKSGVFDAPCGTKVNHGVVLVGYD--MDEDTNKEYWLVRNSWGEAWGEKGYIK 375
Query: 288 IFRGVGGSGLCNIAANAAYPL 308
+ G G C I YP+
Sbjct: 376 LALHSGKKGTCGILVEPVYPV 396
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 111/316 (35%), Positives = 161/316 (50%), Gaps = 37/316 (11%)
Query: 20 VEFARTYKDQAEKEMRFKIFKKN-HEF---------------LRLNKFADLTREKFLASY 63
+E + YK+ E+ R KIF N H+ L++NK+ D+ +F+ +
Sbjct: 33 MEHNKVYKNDVEERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTL 92
Query: 64 TGYKPPPTDHPHSNRSNWFKN-LNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTA 120
G+ S R + + + + ++DW E GAVTPVKDQG +C CW+F+A
Sbjct: 93 NGFNKSINTQLRSERLPIAASFIEPANVVLPKTVDWREHGAVTPVKDQG-HCGSCWSFSA 151
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYP 177
+EG + RTG L+ S+ L+DCS NGC ++ AF+YI+ + L +E YP
Sbjct: 152 TGALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYP 211
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATW--FNF 234
Y+ D C R +A+ GY + E+ L+ V+ PVSVAIDA+ F F
Sbjct: 212 YEAEND-KC---RYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQF 267
Query: 235 YHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ P ++ N HGV VGYGT E Q YWLVKN WG W + G +++ R
Sbjct: 268 YSEGVYYEPECSSENLDHGVLAVGYGTD---ENGQDYWLVKNSWGETWGDNGYIKMAR-- 322
Query: 293 GGSGLCNIAANAAYPL 308
C IA+ A+YPL
Sbjct: 323 NKLNHCGIASTASYPL 338
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 163 bits (413), Expect = 8e-38, Method: Compositional matrix adjust.
Identities = 109/322 (33%), Positives = 161/322 (50%), Gaps = 37/322 (11%)
Query: 15 HEQW---MVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLT 55
EQW + + Y+ E+ R KIF +N L +NK+AD+
Sbjct: 24 QEQWGAFKMTHNKQYQSDTEERFRMKIFMENSHTVAKHNKLYAQGLVSFKLGINKYADML 83
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+F+ G+ + + L + + IDW ++GAVTPVKDQG
Sbjct: 84 HHEFVQVLNGFNRTKSGLRSGESDDSVTFLPPANVQLPGQIDWRDKGAVTPVKDQGQCGS 143
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CW+F+A ++EG + ++G+LV+ S+ LVDCS NGC ++NAF YI+ +
Sbjct: 144 CWSFSATGSLEGQHFRKSGKLVSLSEQNLVDCSEKFGNNGCNGGLMDNAFRYIKANGGID 203
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT 230
+E YPY+ +D C + K RGY ++ E+ LQ V+ PVSVAIDA+
Sbjct: 204 TEQAYPYKA-EDEKCHY---KPKNKGATDRGYVDIESGNEDKLQSAVATVGPVSVAIDAS 259
Query: 231 W--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y GGV+ P C + +HGV +VGYGT + YWLVKN WG +W + G +
Sbjct: 260 HQSFQLYSGGVYYEPECSPSQLDHGVLVVGYGTEDDG---TDYWLVKNSWGKSWGDQGYI 316
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
++ R + C IA A+YPL
Sbjct: 317 KMARNRDNN--CGIATEASYPL 336
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 163 bits (413), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 113/325 (34%), Positives = 166/325 (51%), Gaps = 42/325 (12%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIF--------KKNHEF--------LRLNKFADLTR 56
E+W ++ + Y + E+ +R KI+ K N F LR+NK+ DL
Sbjct: 25 EEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDLLH 84
Query: 57 EKFLASYTGYKPPPTDHPH---SNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
E+F+ + G+ P + + + ++DW E+GAVTPVKDQG +
Sbjct: 85 EEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQG-H 143
Query: 114 C--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQ 168
C CW+F+A +EG + +TG+LV+ S+ LVDCST NGC ++ AF+YI+
Sbjct: 144 CGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDNG 203
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAI 227
+ +E YPY+ D C + A G +G+ + E+ L ++ PVSVAI
Sbjct: 204 GIDTEKAYPYEAIDD-TCH-YNPKAVG--ATDKGFVDIPQGDEKALMKAIATAGPVSVAI 259
Query: 228 DATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
DA+ F FY GV+ P ++ N HGV VGYGT+ E E YWLVKN WGT W +
Sbjct: 260 DASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGE---DYWLVKNSWGTTWGDQ 316
Query: 284 GSMRIFRGVGGSGLCNIAANAAYPL 308
G +++ R C IA A+YPL
Sbjct: 317 GYVKMARNRDNH--CGIATAASYPL 339
>gi|351694420|gb|EHA97338.1| Cathepsin K [Heterocephalus glaber]
Length = 329
Score = 163 bits (413), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 164/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E W + + Y + ++ R I++KN ++ L +N D+T E+
Sbjct: 27 ELWKKTYQKQYNGKVDELSRRLIWEKNLKYISIHNLEASLGVHTYELSMNHLGDMTNEEV 86
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP H HSN + + + DS+D+ ++G VTPVK+QG CWAF
Sbjct: 87 VQKMTGLKVPPA-HSHSNDTLYIPDWEGRAP---DSVDYRKKGYVTPVKNQGQCGSCWAF 142
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y++Q + + SE YP
Sbjct: 143 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQQNRGIDSEDAYP 202
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK RGY+ V E+ L+ V+R P+SVAIDA T F F
Sbjct: 203 YVG-QDESCMY---NPTGKAAKCRGYREVPVGNEKALKRAVARVGPISVAIDASLTSFQF 258
Query: 235 YHGGVFTGPC--GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ G+ NH V VGYG + +W++KN WG NW G + + R
Sbjct: 259 YSKGVYYDESCDGDNLNHAVLAVGYGI----QRGHKHWILKNSWGENWGNKGYVLLARNK 314
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 315 NNT--CGIANLASFP 327
>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
Length = 334
Score = 163 bits (412), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 107/319 (33%), Positives = 169/319 (52%), Gaps = 41/319 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE----------------FLRLNKFADLTREKF 59
E W + + Y E+++R KIF +N F+++N + DL +F
Sbjct: 30 ESWKLTHQKGYDSSVEEKLRLKIFMENSLRISRHNAEAIQGRHTYFMKMNHYGDLLHHEF 89
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSK-MSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+A GY ++N++ SK ++ + +DW E GAVTPVK+QG CW+
Sbjct: 90 VAMVNGY-------IYNNKTTLGGTFIPSKNINLPEHVDWREEGAVTPVKNQGQCGSCWS 142
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+A ++EG + +TG+L++ S+ LVDCS NGC ++ AF+YI+ + +E
Sbjct: 143 FSATGSLEGQDFRKTGKLISLSEQNLVDCSRKYGNNGCEGGLMDYAFKYIQDNNGIDTEA 202
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW-- 231
YPY+G D +C + K G+ G+ ++ +E+ LQ ++ P+SVAIDA+
Sbjct: 203 SYPYEGI-DGHCHY---DPKNKGGSDIGFVDIKKGSEKDLQKALATVGPISVAIDASHMS 258
Query: 232 FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GV++ + N HGV VGYGT E G+ YWLVKN W W E G +++
Sbjct: 259 FQFYSHGVYSEKKCSPENLDHGVLAVGYGTD-EVTGED-YWLVKNSWSEKWGEDGYIKMA 316
Query: 290 RGVGGSGLCNIAANAAYPL 308
R +C IA++A+YP+
Sbjct: 317 R--NKDNMCGIASSASYPV 333
>gi|147903593|ref|NP_001080822.1| cathepsin S precursor [Xenopus laevis]
gi|33417128|gb|AAH56059.1| Ctss-a protein [Xenopus laevis]
Length = 333
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 108/316 (34%), Positives = 160/316 (50%), Gaps = 40/316 (12%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLA 61
W ++ Y+D+ E R ++KN +F L +N AD+T E+ +
Sbjct: 30 WKNTHSKEYEDETEDLQRRITWEKNLDFVNMHNLEYSMGMHTYELGMNHLADMTSEEMKS 89
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKM--SFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
TG PP HS R F + + DSIDW ++G V+ VK+QG CWAF
Sbjct: 90 KLTGLILPP----HSERKAKFSSQRNGTFGGKVRDSIDWRDKGCVSDVKNQGGCGSCWAF 145
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+AV +EG ++TG+LV+ S LVDC++ GC+ F+ +AF+Y+ + S+
Sbjct: 146 SAVGALEGQLMLKTGKLVSLSPQNLVDCASKYGNKGCSGGFMTSAFQYVIDNNGIDSDSY 205
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVV-SRQPVSVAIDAT--WF 232
YPY D C + +GK + Y + P TE+ L+ + + P+SVAID T F
Sbjct: 206 YPYHA-MDEKCHY---ELAGKASSCVKYTEIVPGTEDNLKQALGTIGPISVAIDGTRPTF 261
Query: 233 NFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
Y GV++ P C NHGV +GYGT Q +WL+KN WGT + + G +RI R
Sbjct: 262 FLYKSGVYSDPSCSQEVNHGVLAIGYGTLN----GQDFWLLKNSWGTYYGDKGFVRIARN 317
Query: 292 VGGSGLCNIAANAAYP 307
G LC +A+ +YP
Sbjct: 318 KG--NLCGVASYTSYP 331
>gi|113120265|gb|ABI30272.1| VXH-A, partial [Vasconcellea x heilbornii]
Length = 318
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 107/286 (37%), Positives = 147/286 (51%), Gaps = 44/286 (15%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
+ WMVE+ + YKD EK RF+IFK N ++ L L F DLT ++F Y
Sbjct: 49 DSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDETNKKNNTYWLGLTSFTDLTNDEFKEKY 108
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G P + + SN + + ++ SIDW ++GAVTPV++QGS CW F++VA
Sbjct: 109 VG--SIPENWSTTEESNDKEFIYDDVVNIPASIDWRQKGAVTPVRNQGSCGSCWTFSSVA 166
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYI-------RQYQRLASEC 174
VEG+NKI TGQLV+ S+ +L+DC + GC F A +Y+ RQY
Sbjct: 167 AVEGINKIVTGQLVSLSEQELLDCERRSYGCRGGFPPYALQYVANSGIHLRQY------- 219
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT--WF 232
YPY+G Q + A G G VQ E+ L ++ QPVS+ ++A F
Sbjct: 220 -YPYEGVQR---QCRAAQAKGPKVKTDGVGRVQRNNEQALIQRIAIQPVSIVVEAKGRAF 275
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGT 278
Y GG+F GPCG + +H V VGYG Y L+KN WGT
Sbjct: 276 QNYRGGIFAGPCGTSIDHAVAAVGYGNG--------YILIKNSWGT 313
>gi|530736|emb|CAA56915.1| cathepsin l [Nephrops norvegicus]
gi|1582621|prf||2119193B cathepsin L-related Cys protease
Length = 313
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/317 (34%), Positives = 164/317 (51%), Gaps = 43/317 (13%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR----------------LNKFADLTREKF 59
E + ++ R Y D E+ R ++F++N + + +N+F D+T E+F
Sbjct: 13 EHFKTQYGRKYGDAKEELYRQRVFQQNEQLVEAFNKKFENGEVTFKVAMNQFGDMTNEEF 72
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
A GYK P + + +DW +GAVTPVKDQG CWAF
Sbjct: 73 NAVMKGYKKGSRGEPTTV-------FTAEGRPMAADVDWRTKGAVTPVKDQGQCGSCWAF 125
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+A ++EG + ++ +LV+ S+ +LVDCST +GC ++ +AF+YI+ + +E
Sbjct: 126 SATGSLEGQHFLKNNELVSLSEQELVDCSTEYGNDGCGGGWMTSAFDYIKDNGGIDTESS 185
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATWFN- 233
YPY+ QD C R A+ G+ VQ TEE L + VS P+SVAIDA+ F+
Sbjct: 186 YPYEA-QDRSC---RFDANSIGATCTGFVEVQ-HTEEALHEAVSDIGPISVAIDASHFSF 240
Query: 234 -FYHGGV-FTGPCGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GV + C T +HGV VGYGT E + YWLVKN WG+ W + G +++ R
Sbjct: 241 QFYSSGVYYEKKCSPTNLDHGVLAVGYGT----ESTEDYWLVKNSWGSGWGDAGYIKMSR 296
Query: 291 GVGGSGLCNIAANAAYP 307
+ C IA+ +YP
Sbjct: 297 NRDNN--CGIASEPSYP 311
>gi|417409774|gb|JAA51378.1| Putative cathepsin k, partial [Desmodus rotundus]
Length = 331
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 166/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
EQW + + Y + ++ R I++KN H + L +N D+T E+
Sbjct: 29 EQWKKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEV 88
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ H SN + + + DSID+ ++G VTPVK+QG CWAF
Sbjct: 89 VQKMTGLKVPPS-HSRSNDTRYVPDWEGK---VPDSIDYRKKGYVTPVKNQGQCGSCWAF 144
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF Y+++ Q + SE YP
Sbjct: 145 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFHYVQKNQGIDSEDAYP 204
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK RGY+ + E+ L+ V+R P+SVAIDA T F F
Sbjct: 205 YVG-QDESCMY---NPTGKAAKCRGYKEIPEGNEKALKRAVARVGPISVAIDASLTSFQF 260
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ N+ NH V VGYG + ++ +W++KN WG +W G + + R
Sbjct: 261 YSKGVYYDKNCNSDNLNHAVLAVGYGI----QKRKKHWIIKNSWGESWGNKGYILMARNK 316
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 317 NNA--CGIANLASFP 329
>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
Length = 493
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 101/257 (39%), Positives = 141/257 (54%), Gaps = 16/257 (6%)
Query: 43 HEF-LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNER 101
H F L L +FADLT E++ A + + L + D++DW ER
Sbjct: 115 HGFRLGLTRFADLTLEEYRARLL-LGSRGRNGTAVGVVGRRRYLPLAGEQLPDAVDWRER 173
Query: 102 GAVTPVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFL 157
GAV VKDQG C CWAF+AVA VEG+NKI TG L++ S+ +L+DC GC +
Sbjct: 174 GAVAEVKDQGQ-CGGCWAFSAVAAVEGINKIVTGSLISLSEQELIDCDKFQDQGCDGGLM 232
Query: 158 ENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDV 217
+NAF ++ + + +E YP+ G D CD + + +I ++ V E LQ
Sbjct: 233 DNAFVFMIKNGGIDTEADYPFTG-HDGTCDLKLKNT--RVVSIDSFERVPINYERALQKA 289
Query: 218 VSRQPVSVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNR 275
V+ QPVS +I+A+ F Y G+F G CG +HGVT+VGYG+ EG + YW+VKN
Sbjct: 290 VAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYGS----EGGKDYWIVKNS 345
Query: 276 WGTNWDEGGSMRIFRGV 292
WGT W E G +R+ R V
Sbjct: 346 WGTQWGEAGYVRMARNV 362
>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
Length = 340
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 163/324 (50%), Gaps = 39/324 (12%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADL 54
+ A +E+W+V+ + Y EK RF+IFK N + L LN+FADL
Sbjct: 30 VIALYEEWLVKHQKLYSSLGEKIKRFEIFKDNLRYIDQQNHYNKVNHMNFTLGLNQFADL 89
Query: 55 TREKFLASYTG----YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
T ++F + Y G Y+ + +P+ + L + DS+DW E+G V P+++Q
Sbjct: 90 TLDEFSSIYLGTSVDYEQIISSNPNHDDVEE-DILKEDVVELPDSVDWREKGVVFPIRNQ 148
Query: 111 GSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQ 168
G CW F+AVA++E LN I+ G ++ S+ +L+DC T++ GC NAF Y+ +
Sbjct: 149 GKCGSCWTFSAVASIETLNGIKKGHMIALSEQELLDCETISQGCKGGHYNNAFAYVAK-N 207
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEG---LQDVVSRQPVSV 225
+ SE YPY RQ + K I GY+ V P G V+V
Sbjct: 208 GITSEEKYPYIFRQG------QCYQKEKVVKISGYKRV-PRNNGGQLQSAVAQQVVSVAV 260
Query: 226 AIDATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
++ F FY G+F+G CG +H V IVGYG+ +G YW+++N WGTNW E G
Sbjct: 261 KCESKDFQFYDRGIFSGACGPILDHAVNIVGYGS----KGGANYWIMRNSWGTNWGENGY 316
Query: 286 MRIFRGVGG-SGLCNIAANAAYPL 308
MRI + G C IA +YP+
Sbjct: 317 MRIQKNSKHYEGHCGIAMQPSYPV 340
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 163 bits (412), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 113/329 (34%), Positives = 173/329 (52%), Gaps = 46/329 (13%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIFKKN-HEF---------------LRLNKFADLTR 56
E+W ++ + Y ++E+ +R KI+ +N H+ LR+NK+ADL
Sbjct: 25 EEWNAFKLQHRKKYDSESEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLH 84
Query: 57 EKFLASYTGY-KPPPTDHPHSNRSNWFKN------LNSSKMSFYDSIDWNERGAVTPVKD 109
E+F+ + G+ + R + + + +IDW E+GAVTPVKD
Sbjct: 85 EEFVHTLNGFNRSAAAGSKLLGREQLMTIEEPITWIEPANVDVPTTIDWREKGAVTPVKD 144
Query: 110 QGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYI 164
QG +C CW+F+A +EG + +TG+LV+ S+ LVDCST NGC ++NAF+Y+
Sbjct: 145 QG-HCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFQYV 203
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PV 223
+ + + +E YPY+ D C + A G +G+ + E+ L+ ++ PV
Sbjct: 204 KDNKGIDTEKAYPYEAIDD-ECH-YNPKAIG--ATDKGFVDIPQGDEKALKKALATVGPV 259
Query: 224 SVAIDATW--FNFYHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
SVAIDA+ F FY GV+ P ++ +HGV VGYGTT + E YWLVKN WGT
Sbjct: 260 SVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTEDGE---DYWLVKNSWGTT 316
Query: 280 WDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
W + G +++ R C IA A+YPL
Sbjct: 317 WGDQGYVKMAR--NRENHCGIATTASYPL 343
>gi|33333708|gb|AAQ11972.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 112/324 (34%), Positives = 166/324 (51%), Gaps = 54/324 (16%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------------RLNKFADLTREKF 59
+Q+ ++ +TY+ E++ RF +F+KN + ++ +FAD+T E+F
Sbjct: 24 QQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEF 83
Query: 60 L--ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
L G P++ H F N + M D++DW E GAVTPVKDQ + CW
Sbjct: 84 LDLLKLQGVPALPSNAVH------FDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCW 137
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAFEYIRQYQRLAS 172
AF+AV +EG + G LV+ S +LVDC+T NGC + AF+++ Q + + +
Sbjct: 138 AFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFV-QDEGIQT 196
Query: 173 ECVYPYQGRQDYYCDWWRSSA--SGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDA 229
E YPY+GR RSS SG Y + YV P E+ + + V ++ PV+VAI+A
Sbjct: 197 EESYPYEGR--------RSSCKKSGDY-VTKVKTYVFPLDEQEMARTVAAKGPVAVAIEA 247
Query: 230 TWFNFYHGGVF--TGPCGN---TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
+ +FY G+ T C N NHGV +VGYG+ E YW+VKN WG +W E G
Sbjct: 248 SQLSFYDKGIVDETCRCSNKREDLNHGVLVVGYGS----ENGVDYWIVKNSWGADWGEKG 303
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
R+ + V C I YP+
Sbjct: 304 YFRLKKDVKA---CGIGYYNTYPI 324
>gi|357127811|ref|XP_003565571.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 364
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/334 (34%), Positives = 165/334 (49%), Gaps = 64/334 (19%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFL----------------------------RLN 49
W ++++TY E+E RF +F+ N + +N
Sbjct: 49 WQAKYSKTYPSHEEQEKRFGVFRGNINNIGAFSAAQTTTTAVVGSFGAPQTVTTVRVGMN 108
Query: 50 KFADLTREKFLASYTGYK-------PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERG 102
+F DL + L +TG+ P PT P+ +R +DW G
Sbjct: 109 RFGDLQPSEVLEQFTGFNSTVVLKTPKPTRLPYHSRKPC-------------CVDWRSSG 155
Query: 103 AVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLENA 160
AVT VK QGS CWAF AVA +EG+NKIRTG LV+ S+ QLVDC +GCA + A
Sbjct: 156 AVTGVKFQGSCLSCWAFAAVAAIEGMNKIRTGTLVSLSEQQLVDCDKGSSGCAGGRTDTA 215
Query: 161 FEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGLQDVVS 219
+ + + + SE YPY G C+ ++ AI +G++ V P E L V+
Sbjct: 216 LDLVAKRGGITSEEKYPYGGFNG-KCN--VDKLLFEHAAIVKGFKAVPPNDEHQLALAVA 272
Query: 220 RQPVSVAIDA-TW-FNFYHGGVFTGPCGNTP---NHGVTIVGYGTTTEAEGQQPYWLVKN 274
+QPV+V +DA TW F FY GG+F GPC P NH VTIVGY E G++ +W+ KN
Sbjct: 273 QQPVTVYVDASTWEFQFYSGGIFRGPCSTDPARVNHAVTIVGY---CEDFGEK-FWIAKN 328
Query: 275 RWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYP 307
W +W + G + + + V +G C++A++ YP
Sbjct: 329 SWSNDWGDQGYIYLAKDVAWPTGTCSLASSPFYP 362
>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
Length = 345
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/336 (33%), Positives = 171/336 (50%), Gaps = 42/336 (12%)
Query: 4 TSHKTGNIAAKHEQWM---VEFARTYKDQAEKEMRFKIFKKN-HEF-------------- 45
T H +++WM +E + YK E+ R KIF N H+
Sbjct: 14 TVHAVSFFELVNQEWMTFKMEHKKAYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSY 73
Query: 46 -LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKN-LNSSKMSFYDSIDWNERGA 103
L++NK+ D+ +F+ G+ S R + + + ++ +DW + GA
Sbjct: 74 KLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERMPIGASFIEPANVALPKKVDWRKEGA 133
Query: 104 VTPVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLE 158
VTPVKDQG +C CW+F+A +EG + RTG LV+ S+ L+DCS NGC ++
Sbjct: 134 VTPVKDQG-HCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMD 192
Query: 159 NAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIR-GYQYVQPATEEGLQDV 217
AF+YI+ + L +E YPY+ D C + +++ GAI GY + E+ L+
Sbjct: 193 QAFQYIKDNKGLDTEASYPYEAEND-KCRYNPANS----GAIDVGYIDIPTGNEKLLKAA 247
Query: 218 VSR-QPVSVAIDATW--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLV 272
V+ PVSVAIDA+ F FY GV+ P C + +HGV ++GYGT E + YWLV
Sbjct: 248 VATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGTN---ENGEDYWLV 304
Query: 273 KNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
KN WG W G +++ R C IA++A+YPL
Sbjct: 305 KNSWGETWGNNGYIKMAR--NKLNHCGIASSASYPL 338
>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
Length = 343
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 111/316 (35%), Positives = 161/316 (50%), Gaps = 37/316 (11%)
Query: 20 VEFARTYKDQAEKEMRFKIFKKN-HEF---------------LRLNKFADLTREKFLASY 63
+E + YK+ E+ R KIF N H+ L++NK+ D+ +F+ +
Sbjct: 33 MEHNKVYKNDIEERFRMKIFMDNKHKIAKHNGNYEMKKVSYKLKMNKYGDMLHHEFVNTL 92
Query: 64 TGYKPPPTDHPHSNRSNWFKN-LNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTA 120
G+ S R + + + + ++DW E GAVTPVKDQG +C CW+F+A
Sbjct: 93 NGFNKSINTQLRSERLPIGASFIEPANVVLPKTVDWREHGAVTPVKDQG-HCGSCWSFSA 151
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYP 177
+EG + RTG L+ S+ L+DCS NGC ++ AF+YI+ + L +E YP
Sbjct: 152 TGALEGQHFRRTGILIPLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYP 211
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATW--FNF 234
Y+ D C R +A+ GY + E+ L+ V+ PVSVAIDA+ F F
Sbjct: 212 YEAEND-KC---RYNAANSGARDVGYVDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQF 267
Query: 235 YHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ P ++ N HGV VGYGT E Q YWLVKN WG W + G +++ R
Sbjct: 268 YSEGVYYEPECSSENLDHGVLAVGYGTD---ENGQDYWLVKNSWGETWGDNGYIKMAR-- 322
Query: 293 GGSGLCNIAANAAYPL 308
C IA+ A+YPL
Sbjct: 323 NKLNHCGIASTASYPL 338
>gi|33333698|gb|AAQ11967.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 164/324 (50%), Gaps = 54/324 (16%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------------RLNKFADLTREKF 59
+Q+ ++ +TY+ E++ RF +F+KN + ++ +FAD+T E+F
Sbjct: 24 QQFKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEF 83
Query: 60 L--ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
L G P++ H F N + M D++DW E GAVTPVKDQ + CW
Sbjct: 84 LDLLKLQGVPALPSNAVH------FDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCW 137
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAFEYIRQYQRLAS 172
AF+AV +EG + G LV+ S +LVDC+T NGC + AF+++ Q + + +
Sbjct: 138 AFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFV-QDEGIQT 196
Query: 173 ECVYPYQGRQDYYCDWWRSSA--SGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDA 229
E YPY+GR RSS SG Y + YV P E+ + + V ++ PV+VAI+A
Sbjct: 197 EESYPYEGR--------RSSCKKSGDY-VTKVKTYVFPLDEQEMARTVAAKGPVAVAIEA 247
Query: 230 TWFNFYHGGVFTGPC-----GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
+ +FY G+ C NHGV +VGYG+ E YW+VKN WG +W E G
Sbjct: 248 SQLSFYDKGIVDEKCRCSNKREDLNHGVLVVGYGS----ENGVDYWIVKNSWGADWGEKG 303
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
R+ + V C I YP+
Sbjct: 304 YFRLKKDVKA---CGIGYYNTYPI 324
>gi|357518983|ref|XP_003629780.1| Cysteine proteinase [Medicago truncatula]
gi|355523802|gb|AET04256.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 108/300 (36%), Positives = 147/300 (49%), Gaps = 18/300 (6%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRLNKFADLTREKFLASYTGYKPPPTDHPH 75
+ W E R Y + E+ M K + L LNKFAD++ E+F +Y P
Sbjct: 68 QMWKKEHGRDYANSEEENMNAKRKSQTQHRLSLNKFADMSPEEFSKTYL---PKIEMQVP 124
Query: 76 SNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYCC---WAFTAVATVEGLNKIRT 132
SNR N + + S+DW E+GAVT V+DQG C WAF+ +EGLNKI T
Sbjct: 125 SNRDNAKLKDDDDCENLPTSVDWREKGAVTEVRDQGD--CQSHWAFSVTGAIEGLNKIVT 182
Query: 133 GQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRS 191
G L+ S +LVDC + GCA F NAF Y+ + + +E YPY + C
Sbjct: 183 GNLINLSAQELVDCDPASKGCAGGFYFNAFGYVIENGGIDTEANYPYLAKNGT-C----K 237
Query: 192 SASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYHGGVFTGPCGNTPNHG 251
+ K +I V TEE L S+QPVSV++DAT FY GGV+ G +
Sbjct: 238 ENANKVVSIDNL-LVLDGTEEALLCRTSKQPVSVSLDATGLQFYAGGVYGGENCKKESRN 296
Query: 252 VTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS---GLCNIAANAAYPL 308
+VG ++ + YW+VKN WG +W E G + I R V G+C I A YP+
Sbjct: 297 ANLVGLIVGYDSVNGEDYWIVKNSWGKDWGEKGYLFIKRNVFEDWPFGVCAINAAVGYPV 356
>gi|33333702|gb|AAQ11969.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 165/324 (50%), Gaps = 54/324 (16%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------------RLNKFADLTREKF 59
+Q+ ++ +TY+ E++ RF +F+KN + ++ +FAD+T E+F
Sbjct: 24 QQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEF 83
Query: 60 L--ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
L G P++ H F N + M D++DW E GAVTPVKDQ + CW
Sbjct: 84 LDLLKLQGVPALPSNAVH------FDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCW 137
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAFEYIRQYQRLAS 172
AF+AV +EG + G LV+ S +LVDC+T NGC + AF+++ Q + + +
Sbjct: 138 AFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFV-QDEGIQT 196
Query: 173 ECVYPYQGRQDYYCDWWRSSA--SGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDA 229
E YPY+GR RSS SG+Y + YV P E+ + + V ++ PV+VAI+A
Sbjct: 197 EESYPYEGR--------RSSCKKSGEY-VTKVKTYVFPLDEQEMARTVAAKGPVAVAIEA 247
Query: 230 TWFNFYHGGVFTGPC-----GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
+ +FY G+ C NHGV +VGYG+ E YW+VKN WG +W E G
Sbjct: 248 SQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYGS----ENGVDYWIVKNSWGADWGEKG 303
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
R+ + V C I YP+
Sbjct: 304 YFRLKKDVKA---CGIGYYNPYPI 324
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 117/338 (34%), Positives = 172/338 (50%), Gaps = 43/338 (12%)
Query: 2 SRTSHKTGNIAAKHEQWM---VEFARTYKDQAEKEMRFKIFKKN-HEF------------ 45
SRT H +++WM +E + YK E+ R KIF N H+
Sbjct: 19 SRT-HAVSFFELVNQEWMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKV 77
Query: 46 ---LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKN-LNSSKMSFYDSIDWNER 101
L++NK+ D+ +F+ G+ S R + + + + +DW +
Sbjct: 78 SYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERLPVGASFIEPANVVLPKKVDWRKE 137
Query: 102 GAVTPVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNF 156
GAVTPVKDQG +C CW+F+A +EG + RTG LV+ S+ L+DCS NGC
Sbjct: 138 GAVTPVKDQG-HCGSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGL 196
Query: 157 LENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIR-GYQYVQPATEEGLQ 215
++ AF+YI+ + L +E YPY+ D C + +++ GAI GY + E+ L+
Sbjct: 197 MDQAFQYIKDNKGLDTEASYPYEAEND-KCRYNPANS----GAIDVGYIDIPTGDEKLLK 251
Query: 216 DVVSR-QPVSVAIDATW--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYW 270
V+ PVSVAIDA+ F FY GV+ P C + +HGV ++GYGT E Q YW
Sbjct: 252 AAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYGTN---ENGQDYW 308
Query: 271 LVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
LVKN WG W G +++ R C IA++A+YPL
Sbjct: 309 LVKNSWGETWGNNGYIKMAR--NKLNHCGIASSASYPL 344
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 162 bits (411), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/321 (35%), Positives = 169/321 (52%), Gaps = 44/321 (13%)
Query: 19 MVEFARTYKDQAEKEMRFKIFKKN-HEF---------------LRLNKFADLTREKFLAS 62
++E + Y D+ E+ R KIF +N H+ L +NK+AD+ +F
Sbjct: 109 VLEHRKNYLDETEERFRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQL 168
Query: 63 YTGYKPPPTDHPHSNRSNW-FKN---LNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CW 116
G+ T H ++ FK ++ ++ S+DW ++GAVT VKDQG +C CW
Sbjct: 169 MNGFNY--TLHKELRAADESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQG-HCGSCW 225
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASE 173
AF++ +EG + ++G LV+ S+ LVDCST NGC ++NAF YI+ + +E
Sbjct: 226 AFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTE 285
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGLQDVVSR-QPVSVAIDATW 231
YPY+ D C + + G GA RG+ + E+ L + V+ PVSVAIDA+
Sbjct: 286 KSYPYEALDD-SCHFNK----GTIGATDRGFVDIPQGNEKKLAEAVATIGPVSVAIDASH 340
Query: 232 --FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
F FY GV+ P + N HGV +VG+GT E Q YWLVKN WGT W + G ++
Sbjct: 341 ESFQFYSEGVYVEPACDAQNLDHGVLVVGFGTD---ESGQDYWLVKNSWGTTWGDKGFIK 397
Query: 288 IFRGVGGSGLCNIAANAAYPL 308
+ R C IA+ ++YPL
Sbjct: 398 MLRNKDNQ--CGIASASSYPL 416
>gi|113120271|gb|ABI30275.1| VS-A [Vasconcellea stipulata]
Length = 318
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 108/289 (37%), Positives = 146/289 (50%), Gaps = 50/289 (17%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
+ WMVE+ + YKD EK RF+IFK N ++ L L F DLT ++F Y
Sbjct: 49 DSWMVEYDKVYKDIDEKIYRFEIFKDNLKYIDETNKKNNTYWLGLTSFTDLTNDEFKEKY 108
Query: 64 TGYKP---PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
G P T+ P+ + +N SIDW ++GAVTPV++QGS CW F+
Sbjct: 109 VGSIPENWSTTEEPNDKEFIYDDVVNIPA-----SIDWRQKGAVTPVRNQGSCGSCWTFS 163
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYI-------RQYQRLA 171
+VA VEG+NKI TGQLV+ S+ +L+DC + GC F A +Y+ RQY
Sbjct: 164 SVAAVEGINKIVTGQLVSLSEQELLDCERRSYGCRGGFPPYALQYVANSGIHLRQY---- 219
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDAT- 230
YPY+G Q + A G G VQ E+ L ++ QPVS+ ++A
Sbjct: 220 ----YPYEGVQR---QCRAAQAKGPKVKTDGVGRVQRNNEQALIQRIAIQPVSIVVEAKG 272
Query: 231 -WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGT 278
F Y GG+F GPCG + +H V VGYG Y L+KN WGT
Sbjct: 273 RAFQNYRGGIFAGPCGTSIDHAVAAVGYGNG--------YILIKNSWGT 313
>gi|33333696|gb|AAQ11966.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 162 bits (411), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 112/324 (34%), Positives = 166/324 (51%), Gaps = 54/324 (16%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------------RLNKFADLTREKF 59
+Q+ ++ +TY+ E++ RF +F+KN + ++ +FAD+T E+F
Sbjct: 24 QQFKLDHGKTYRSLVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEF 83
Query: 60 L--ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
L G P++ H F N + M D++DW E GAVTPVKDQ + CW
Sbjct: 84 LDLLKLQGVPALPSNAVH------FDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCW 137
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAFEYIRQYQRLAS 172
AF+AV +EG + G LV+ S +LVDC+T NGC + AF+++ Q + + +
Sbjct: 138 AFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFV-QDEGIQT 196
Query: 173 ECVYPYQGRQDYYCDWWRSSA--SGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDA 229
E YPY+GR RSS SG Y + YV P E+ + + V ++ PV+VAI+A
Sbjct: 197 EESYPYEGR--------RSSCKKSGDY-VTKVKTYVFPLDEQEMARTVAAKGPVAVAIEA 247
Query: 230 TWFNFYHGGVF--TGPCGNTP---NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
+ +FY G+ T C N NHGV +VGYG+ E YW+VKN WG +W E G
Sbjct: 248 SQLSFYDKGIVDETCRCSNKREDLNHGVLVVGYGS----ENGVDYWIVKNSWGADWGEKG 303
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
R+ + V C I YP+
Sbjct: 304 YFRLKKDVKA---CGIGYYNPYPI 324
>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 469
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 108/320 (33%), Positives = 160/320 (50%), Gaps = 25/320 (7%)
Query: 9 GNIAAKHEQWMVEFARTY-KDQAEKEMRFKIFKKNHEF------------LRLNKFADLT 55
N ++W +R+Y D AE E RFK++ +N E+ L LN ADL+
Sbjct: 7 ANPLGAFKEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNARTTSHWLTLNHLADLS 66
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
++ + G+ + ++ F+ + + +IDW ++ AV VK+QG
Sbjct: 67 TPEYKSKLLGFDNQARVARNKLKTG-FRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGS 125
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLAS 172
CWAF +VEG+N I TG LV+ S+ +LVDC T GC+ ++ A+ +I + + + +
Sbjct: 126 CWAFATTGSVEGINAIVTGSLVSLSEQELVDCDTEQDKGCSGGLMDYAYAWIIKNKGINT 185
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI--DAT 230
E YPY D CD + + I Y+ V E L+ + QPV+VAI DA
Sbjct: 186 EEDYPYTA-MDGQCD--VAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAK 242
Query: 231 WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F Y GGV+ P CG + NHGV +VGYG G YW+VKN WG W + G +R+
Sbjct: 243 SFQLYGGGVYDDPTCGTSLNHGVLVVGYGKDVTGSGSN-YWIVKNSWGAEWGDAGYIRLK 301
Query: 290 RG-VGGSGLCNIAANAAYPL 308
G GLC IA +YP+
Sbjct: 302 MGSTDAEGLCGIAMAPSYPV 321
>gi|33333694|gb|AAQ11965.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 164/324 (50%), Gaps = 54/324 (16%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------------RLNKFADLTREKF 59
+Q+ ++ +TY+ E++ RF +F+KN + ++ +FAD+T E+F
Sbjct: 24 QQFKLDHGKTYRSVVEEKRRFSVFQKNLIDIQEHNKKYERGEVSFAKKVTQFADMTHEEF 83
Query: 60 L--ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
L G P++ H F N + M D++DW E GAVTPVKDQ + CW
Sbjct: 84 LDLLKLQGVPALPSNAVH------FDNFEDTDMEEKDAVDWREEGAVTPVKDQANCGSCW 137
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAFEYIRQYQRLAS 172
AF+AV +EG + G LV+ S +LVDC+T NGC + AF+++ Q + + +
Sbjct: 138 AFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEEYGNNGCRGGLMGQAFDFV-QDEGIQT 196
Query: 173 ECVYPYQGRQDYYCDWWRSSA--SGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDA 229
E YPY+GR RSS SG Y + YV P E+ + + V ++ PV+VAI+A
Sbjct: 197 EESYPYEGR--------RSSCKKSGDY-VTKVKTYVFPLDEQEMARTVAAKGPVAVAIEA 247
Query: 230 TWFNFYHGGVFTGPC-----GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
+ +FY G+ C NHGV +VGYG+ E YW+VKN WG +W E G
Sbjct: 248 SQLSFYDKGIVDEKCRCSNKREDLNHGVLVVGYGS----ENGVDYWIVKNSWGADWGEKG 303
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
R+ + V C I YP+
Sbjct: 304 YFRLKKDVKA---CGIDYYNTYPI 324
>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
Length = 362
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 105/259 (40%), Positives = 152/259 (58%), Gaps = 32/259 (12%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LR 47
M+RT + ++ +HEQWM +AR YKD EK+MR+KIFK+N + L
Sbjct: 26 MARTLQE-ASMYERHEQWMASYARVYKDANEKQMRYKIFKENVQRIDSFNSESDKSYKLA 84
Query: 48 LNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
+N+FADLT E+F + G+K H S ++ F+ N + + SIDW ++GAVT +
Sbjct: 85 VNQFADLTNEEFKSLRNGFK----GHMCSAQAGHFRYENVTAVP--ASIDWRKKGAVTQI 138
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEY 163
K+QG CWAF+AVA VEG+ +I+TG+L++ S+ +LVDC T + GC +++AF++
Sbjct: 139 KEQGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQELVDCDTNSEDQGCQGGLMDDAFKF 198
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQP 222
I Q+ LASE YPY D C ++ K A I GY+ V E L++ V+ QP
Sbjct: 199 IEQHG-LASEATYPYDA-ADSTC---KTKEEAKPSAKITGYEDVPANDEAALKNAVANQP 253
Query: 223 VSVAIDAT--WFNFYHGGV 239
VSVAIDA F FY G+
Sbjct: 254 VSVAIDAGGFEFQFYSSGI 272
>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
Length = 341
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 113/325 (34%), Positives = 165/325 (50%), Gaps = 41/325 (12%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIF--------KKNHEF--------LRLNKFADLTR 56
E+W +E ++ Y + E + R KI+ K N F LR NK+AD+
Sbjct: 25 EEWSAFKLEHSKRYDSEVEDKFRMKIYLENKHRIAKHNQRFEQGAVSYKLRPNKYADMLS 84
Query: 57 EKFLASYTGY----KPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
+F+ G+ K P H S + + +++ D +DW ++GAVT VKDQG
Sbjct: 85 HEFVHVMNGFNKTLKHPKAVHGKGRESRPATFIAPAHVTYPDHVDWRKKGAVTEVKDQGK 144
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQ 168
CWAF+ +EG + +TG LV+ S+ L+DCS NGC ++NAF+YI+
Sbjct: 145 CGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNG 204
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAI 227
+ +E YPY+G D R +A G+ + EE L Q V + PVSVAI
Sbjct: 205 GIDTEKAYPYEGVDDK----CRYNAKNSGADDVGFVDIPQGDEEKLMQAVATVGPVSVAI 260
Query: 228 DATW--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
DA+ F FY GV+ C +T +HGV +VGYGT + YWLVKN WG W +
Sbjct: 261 DASQESFQFYSDGVYYDENCSSTDLDHGVMVVGYGTDEQG---GDYWLVKNSWGRTWGDL 317
Query: 284 GSMRIFRGVGGSGLCNIAANAAYPL 308
G +++ R + C IA++A+YPL
Sbjct: 318 GYIKMAR--NKNNHCGIASSASYPL 340
>gi|33333712|gb|AAQ11974.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 162 bits (410), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 164/324 (50%), Gaps = 54/324 (16%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------------RLNKFADLTREKF 59
+Q+ ++ +TY+ E++ RF +F+KN + ++ +FAD+T E+F
Sbjct: 24 QQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEF 83
Query: 60 L--ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
L G P++ H F N M D++DW E GAVTPVKDQ + CW
Sbjct: 84 LDLLKLQGVPALPSNAVH------FDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCW 137
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAFEYIRQYQRLAS 172
AF+AV +EG + G LV+ S +LVDC+T NGC + AF+++ Q + + +
Sbjct: 138 AFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFV-QDEGIQT 196
Query: 173 ECVYPYQGRQDYYCDWWRSSA--SGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDA 229
E YPY+GR RSS SG+Y + YV P E+ + + V ++ PV+VAI+A
Sbjct: 197 EESYPYEGR--------RSSCKKSGEY-VTKVKTYVFPLDEQEMARTVAAKGPVAVAIEA 247
Query: 230 TWFNFYHGGVFTGPC-----GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
+ +FY G+ C NHGV +VGYG+ E YW+VKN WG +W E G
Sbjct: 248 SQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYGS----ENGVDYWIVKNSWGADWGEKG 303
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
R+ + V C I YP+
Sbjct: 304 YFRLKKDVKA---CGIGYYNTYPI 324
>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
sativus]
Length = 235
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 95/227 (41%), Positives = 132/227 (58%), Gaps = 14/227 (6%)
Query: 89 KMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC- 146
K + +++DW ++GAV +K+QG+ CWAF+ A VEG+NKI TG+L++ S+ +LVDC
Sbjct: 1 KEALPETVDWRQKGAVNAIKNQGTCGSCWAFSTAAVVEGINKIVTGELISLSEQELVDCD 60
Query: 147 -STLNGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQY 205
S GC ++ AF++I + L +E YPY+G D C+ ++ K I GY+
Sbjct: 61 KSYNQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRG-SDGKCNSLLKNS--KVVTIDGYED 117
Query: 206 VQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEA 263
V E L+ VS QPVSVAIDA F Y G+FTG CG +H V VGYG+
Sbjct: 118 VPTNDETALKRAVSYQPVSVAIDAGGRVFQHYQSGIFTGECGTKMDHAVVAVGYGS---- 173
Query: 264 EGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG--SGLCNIAANAAYPL 308
E YW+V+N WG W E G +RI R + SG C IA A+YP+
Sbjct: 174 ENGVDYWIVRNSWGQKWGEDGYIRIERNLASSKSGKCGIAIEASYPV 220
>gi|33333706|gb|AAQ11971.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 110/324 (33%), Positives = 164/324 (50%), Gaps = 54/324 (16%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------------RLNKFADLTREKF 59
+Q+ ++ +TY+ E++ RF +F+KN + ++ +FAD+T E+F
Sbjct: 24 QQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEF 83
Query: 60 L--ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
L G P++ H F N M D++DW E GAVTPVKDQ + CW
Sbjct: 84 LDLLKLQGVPALPSNAVH------FDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCW 137
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAFEYIRQYQRLAS 172
AF+AV +EG + G LV+ S +LVDC+T NGC + AF+++ Q + + +
Sbjct: 138 AFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFV-QDEGIQT 196
Query: 173 ECVYPYQGRQDYYCDWWRSSA--SGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDA 229
E YPY+GR RSS SG+Y + YV P E+ + + V ++ PV+VAI+A
Sbjct: 197 EESYPYEGR--------RSSCKKSGEY-VTKVKTYVFPLDEQEMARTVAAKGPVAVAIEA 247
Query: 230 TWFNFYHGGVFTGPC-----GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
+ +FY G+ C NHGV +VGYG+ E YW+VKN WG +W E G
Sbjct: 248 SQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYGS----ENGVDYWIVKNSWGADWGEKG 303
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
R+ + V C I YP+
Sbjct: 304 YFRLKKDVKA---CGIGYYNTYPI 324
>gi|414875906|tpg|DAA53037.1| TPA: hypothetical protein ZEAMMB73_586844 [Zea mays]
Length = 1039
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 89/199 (44%), Positives = 120/199 (60%), Gaps = 12/199 (6%)
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLAS 172
CWAF+ +A VEG+N+I TG L++ S+ +LVDC T GC ++ AFE+I + +
Sbjct: 715 CWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDT 774
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--T 230
E YPY+G D CD R +A K I Y+ V E+ LQ V+ QPVSVAI+A T
Sbjct: 775 EKDYPYKG-TDGRCDVNRKNA--KVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGT 831
Query: 231 WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F Y G+FTG CG +HGVT+VGYGT E + YW++KN WG++W E G +R+ R
Sbjct: 832 TFQLYSSGIFTGSCGTALDHGVTVVGYGT----ENGKDYWIMKNSWGSSWGESGYVRMER 887
Query: 291 GV-GGSGLCNIAANAAYPL 308
+ SG C IA +YPL
Sbjct: 888 NIKASSGKCGIAVEPSYPL 906
>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
Length = 344
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 97/252 (38%), Positives = 137/252 (54%), Gaps = 28/252 (11%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKFL 60
+WM RTY E+E RF++F+ N H F L LN+FADLT +++
Sbjct: 48 EWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHSFRLGLNRFADLTNDEYR 107
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A+Y G + P R + L +S+DW +GAV VKDQGS CWAF+
Sbjct: 108 ATYLGVRS----RPQRERRLGDRYLAGDNEDLPESVDWRAKGAVAEVKDQGSCGSCWAFS 163
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKNFLENAFEYIRQYQRLASECVYP 177
+A VEG+N+I TG +++ S+ +LVDC T GC ++ AFE+I + +E YP
Sbjct: 164 TIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEEDYP 223
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFY 235
Y+G D CD R +A K I Y+ V +E+ LQ V+ QP+SVAI+A F Y
Sbjct: 224 YKG-TDGRCDVNRKNA--KVVTIDSYEDVPANSEKSLQKAVANQPISVAIEAGGRAFQLY 280
Query: 236 HGGVFTGPCGNT 247
+ G+FTG CGN+
Sbjct: 281 NSGIFTGTCGNS 292
>gi|340370270|ref|XP_003383669.1| PREDICTED: cathepsin L1-like [Amphimedon queenslandica]
Length = 326
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 110/319 (34%), Positives = 171/319 (53%), Gaps = 39/319 (12%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR--------------LNKFADLTRE 57
A + + W V++ + Y+ + + R I++ N +F+ +N+FADL
Sbjct: 20 AQEFQDWKVKYNKVYETKETELERQIIWESNKKFVENHNANSDKFGFTVAMNEFADLDAG 79
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F Y G P P + N + FK + +S D++DW E+GAVT VK+QG CW
Sbjct: 80 EFANIYNGLLPRPASY---NSTKLFKK---TGVSVGDTVDWREKGAVTEVKNQGKCGSCW 133
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASE 173
+F++ ++EG + ++TG L + S+ QL+DCST +GC ++N+F Y+ SE
Sbjct: 134 SFSSTGSLEGQHFLKTGTLSSLSEQQLMDCSTSFGNHGCKGGLMDNSFRYLETVAGDMSE 193
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATW- 231
+YPY +D +C + S A K GY+ + E+ L++ V+ P+SVAIDA
Sbjct: 194 EMYPYTA-EDGFCRYRSSEAIAK---DTGYKDIPRGDEDALKEAVATVGPISVAIDAGHR 249
Query: 232 -FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
F YH G++ P C +T +HGV VGYGT EG++ YWLVKN WG +W G + +
Sbjct: 250 SFQLYHEGIYYEPACSSTKLDHGVLAVGYGT---GEGEE-YWLVKNSWGPSWGNEGYVMM 305
Query: 289 FRGVGGSGLCNIAANAAYP 307
R + C IA A+YP
Sbjct: 306 SRNRENN--CGIATQASYP 322
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 162 bits (409), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 113/327 (34%), Positives = 171/327 (52%), Gaps = 47/327 (14%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIFKKN-HEF---------------LRLNKFADLTR 56
E+W +E + Y+D+ E+ R KIF +N H+ + +NK+AD+
Sbjct: 27 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQLWATGAVSFKMAVNKYADMLH 86
Query: 57 EKFLASYTGYKPPPTDHPH-SNRSNWFKN---LNSSKMSFYDSIDWNERGAVTPVKDQGS 112
+F ++ G+ T H N FK ++ ++ +DW +GAVT VKDQG
Sbjct: 87 HEFYSTMNGFNY--TLHKQLRNADESFKGVTFISPEHVTLPKQVDWRTKGAVTDVKDQG- 143
Query: 113 YC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
+C CWAF++ +EG + ++G LV+ S+ LVDCST NGC ++NAF YI+
Sbjct: 144 HCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 203
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGLQDVVSR-QPVSV 225
+ +E YPY+ D C + + G GA RG+ + E+ + + V+ PV+V
Sbjct: 204 GGIDTEKSYPYEAIDD-SCHFNK----GTIGATDRGFVDIPQGNEKKMAEAVATIGPVAV 258
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
AIDA+ F FY GV+ P + N HGV +VG+GT E Q YWLVKN WGT W
Sbjct: 259 AIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGFGTD---ESGQDYWLVKNSWGTTWG 315
Query: 282 EGGSMRIFRGVGGSGLCNIAANAAYPL 308
+ G +++ R C IA+ ++YPL
Sbjct: 316 DKGFIKMLR--NKENQCGIASASSYPL 340
>gi|356515116|ref|XP_003526247.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 117/347 (33%), Positives = 166/347 (47%), Gaps = 65/347 (18%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREK 58
+ + ++W+ Y+D+ E E+RF I++ N E+ L NKFADLT E+
Sbjct: 1 MKVRFDRWLKXNGXNYEDKEEWEIRFVIYQANVEYIGCKKSQKNSYNLTDNKFADLTNEE 60
Query: 59 FLASYTGYKPPPTDHPHSN-RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS----- 112
F+++Y G+ PH+ + + NL SK DW + GAVT +KDQG+
Sbjct: 61 FVSTYLGFATRLI--PHTRFKYHEHGNLPXSK-------DWRKEGAVTDIKDQGNCGKHS 111
Query: 113 -------------------------YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS 147
WAF+ VA VE +NKI++G+LV+ S+ +LVD
Sbjct: 112 TWFSPEISHNLRNILTNYNTINFRDISFWAFSVVAAVERINKIKSGKLVSLSEQELVDYD 171
Query: 148 TLN---GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQ 204
N GC ++ F +I++ L + YPY+G D C+ + A I GY+
Sbjct: 172 VANKNQGCEGGLMDTTFAFIKKNGGLTTSKDYPYEG-VDGSCN--KEKALHHAVNISGYE 228
Query: 205 YVQPATEEGLQDVVSRQPVSVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTE 262
E L+ + QP+SVAIDA F Y GVF+G CG NHGVTIVGY T
Sbjct: 229 RAPSKDEAMLKVAAANQPISVAIDAGGYAFQLYSQGVFSGVCGKKLNHGVTIVGYDKGT- 287
Query: 263 AEGQQPYWLVKNRWGTNWDEGGSMRIFR-GVGGSGLCNIAANAAYPL 308
Y VKN G +W E G +R+ R +G C IA A+YPL
Sbjct: 288 ---FDKYRTVKNSXGADWGESGYIRMKRDAFDKAGTCGIAMKASYPL 331
>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
Length = 289
Score = 162 bits (409), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 93/221 (42%), Positives = 129/221 (58%), Gaps = 13/221 (5%)
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--N 150
+S+DW + GAV VKDQGS CWAF+ + VEG+NKI TG L++ S+ +LVDC T
Sbjct: 5 ESVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQ 64
Query: 151 GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPAT 210
GC ++ AFE+I + + +E YPY+ D CD R +A K I Y+ V
Sbjct: 65 GCNGGLMDYAFEFIIKNGGIDTEEDYPYKA-ADGRCDQNRKNA--KVVTIDAYEDVPENN 121
Query: 211 EEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQP 268
E L+ ++ QP+SVAI+A F Y GVF G CG +HGV VGYGT E +
Sbjct: 122 EAALKKALANQPISVAIEAGGRAFQLYSSGVFDGTCGTELDHGVVAVGYGT----ENGKD 177
Query: 269 YWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
YW+V+N WG +W E G +++ R + +G C IA A+YP+
Sbjct: 178 YWIVRNSWGGSWGESGYIKMARNIAEATGKCGIAMEASYPI 218
>gi|13365804|dbj|BAB39242.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|14164527|dbj|BAB55776.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 357
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 119/326 (36%), Positives = 162/326 (49%), Gaps = 49/326 (15%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLAS 62
E+WM +F +TYK EKE RF +F+ N F+R +N+FADLT +F+A+
Sbjct: 45 EEWMAKFGKTYKCHGEKEHRFAVFRDNVRFIRSYRPEATYDSAVRINQFADLTNGEFVAT 104
Query: 63 YTGYKPPPT---------DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
YTG K PP + P W IDW +GAVT VKDQG+
Sbjct: 105 YTGVKQPPPATHPHPHPEEAPRPVDPIWMPC----------CIDWRFKGAVTGVKDQGAC 154
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFL----ENAFEYIRQYQ 168
WAF AVA +EGL KIRTGQL S+ +LVDC G + + AF+ +
Sbjct: 155 GSSWAFAAVAAMEGLMKIRTGQLTPLSEQELVDCVDGGGDSDGCGGGHTDAAFQLVVDKG 214
Query: 169 RLASECVYPYQG-RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI 227
+ +E Y Y+G + D + + + G GY+ V PA E L V+RQPV+ +
Sbjct: 215 GITAESEYRYEGYKGRCRVDDMLFNHAARVG---GYRAVPPADERQLATAVARQPVTAYV 271
Query: 228 DAT--WFNFYHGGVFTGPCGNT---PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
DA+ F FY GVF GP G PNH VT+VGY + + YW+ KN WG W +
Sbjct: 272 DASGPAFQFYGSGVFPGPRGTAAPKPNHAVTLVGY--CQDGASGKKYWIAKNSWGKTWGQ 329
Query: 283 GGSMRIFRGVGGS-GLCNIAANAAYP 307
G + + + V G C +A + YP
Sbjct: 330 QGYILLEKDVASPHGTCGLAVSPFYP 355
>gi|45822207|emb|CAE47500.1| cathepsin L-like proteinase [Diabrotica virgifera virgifera]
Length = 326
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 111/317 (35%), Positives = 165/317 (52%), Gaps = 43/317 (13%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFK------KNHEF----------LRLNKFADLTREKFL 60
Q+ V ++Y++ E++ RF IF+ +NH L + KFADLT ++F
Sbjct: 25 QFKVRNNKSYRNYIEEQKRFTIFQGSLRKIENHNDKYDHGLSTFKLGVTKFADLTEKEFS 84
Query: 61 ASYTGYKPPPTDHPHSNRS-NWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ + P S K+L S DW E+GAVT VKDQGS CW+F
Sbjct: 85 DMLGISRSTKSSRPRVIHSLTPVKDLPSK-------FDWREKGAVTEVKDQGSCGSCWSF 137
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVY 176
+ TVEG ++TG+LV+ S+ LVDC+ + GC+ +++ A EYI + SE Y
Sbjct: 138 STTGTVEGAYFLKTGKLVSLSEQNLVDCAKEDCYGCSGGYMDKALEYIETAGGIMSENDY 197
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQD-VVSRQPVSVAIDATW-FNF 234
PY+G D C + S + K I + Y++ E+ L++ V+++ P+SVAIDA++ F
Sbjct: 198 PYEGIDD-KCRFDSSKVAAK---ISNFTYIKKNDEDDLKNAVIAKGPISVAIDASFNFQL 253
Query: 235 YHGGVFTGPCG----NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
Y G+ N+ NHGV +VGYGT E +Q YW+VKN WG +W G + + R
Sbjct: 254 YDSGILDDSSCYSDFNSLNHGVLVVGYGT----EKEQDYWIVKNSWGADWGMDGYIWMSR 309
Query: 291 GVGGSGLCNIAANAAYP 307
C IA +A YP
Sbjct: 310 NKNNQ--CGIATDATYP 324
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 104/314 (33%), Positives = 153/314 (48%), Gaps = 37/314 (11%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLR---------------LNKFADLTREKFLAS 62
W + + YK E E R FK+N +++ LNKFADL+ E+F
Sbjct: 53 WKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNKFADLSNEEFREM 112
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
Y P + + ++ S+DW +G VT VKDQG CW+F+
Sbjct: 113 YLSKVKKPITIEEKRKHRHLQTCDAPS-----SLDWRNKGVVTAVKDQGDCGSCWSFSTT 167
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+E +N I TG L++ S+ +LVDC T N GC +++AF+++ + +E YPY
Sbjct: 168 GAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGNGGIDTEADYPYT 227
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--YHG 237
G D C+ + K +I GY V P ++ L +QP+SV +D + +F Y G
Sbjct: 228 G-VDGTCNTAKEEK--KVVSIEGYVDVDP-SDSALLCATVQQPISVGMDGSALDFQLYTG 283
Query: 238 GVFTGPCGNTPN---HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
G++ G C PN H + IVGYG+ E + YW+VKN WGT W G I R
Sbjct: 284 GIYDGDCSGDPNDIDHAILIVGYGS----ENDEDYWIVKNSWGTEWGMEGYFYIRRNTSK 339
Query: 295 S-GLCNIAANAAYP 307
G+C I A+A+YP
Sbjct: 340 PYGVCAINADASYP 353
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 114/326 (34%), Positives = 171/326 (52%), Gaps = 45/326 (13%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIF--------KKNHEF--------LRLNKFADLTR 56
E+W ++ + Y + E+ +R KI+ K N F LR+NK+ADL
Sbjct: 25 EEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLH 84
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKN----LNSSKMSFYDSIDWNERGAVTPVKDQGS 112
E+F+ + G+ TD S + + + + + ++DW ++GAVTPVKDQG
Sbjct: 85 EEFVQTVNGFNR--TDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQG- 141
Query: 113 YC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
+C CW+F+A +EG + +TG+LV+ S+ LVDCS NGC ++ AF+YI+
Sbjct: 142 HCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDN 201
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVA 226
+ +E YPY+ D C + A G +GY + EE L+ ++ PVS+A
Sbjct: 202 GGIDTEKSYPYEAIDD-TCH-FNPKAVG--ATDKGYVDIPQGDEEALKKALATVGPVSIA 257
Query: 227 IDATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
IDA+ F FY GV+ P ++ N HGV VGYGT+ E E YWLVKN WGT W +
Sbjct: 258 IDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGE---DYWLVKNSWGTTWGD 314
Query: 283 GGSMRIFRGVGGSGLCNIAANAAYPL 308
G +++ R C +A A+YPL
Sbjct: 315 QGYVKMARNRDNH--CGVATCASYPL 338
>gi|150261413|pdb|2PNS|A Chain A, 1.9 Angstrom Resolution Crystal Structure Of A Plant
Cysteine Protease Ervatamin-C Refinement With Cdna
Derived Amino Acid Sequence
gi|150261414|pdb|2PNS|B Chain B, 1.9 Angstrom Resolution Crystal Structure Of A Plant
Cysteine Protease Ervatamin-C Refinement With Cdna
Derived Amino Acid Sequence
gi|166007115|pdb|2PRE|A Chain A, Crystal Structure Of Plant Cysteine Protease Ervatamin-C
Complexed With Irreversible Inhibitor E-64 At 2.7 A
Resolution
gi|166007116|pdb|2PRE|B Chain B, Crystal Structure Of Plant Cysteine Protease Ervatamin-C
Complexed With Irreversible Inhibitor E-64 At 2.7 A
Resolution
Length = 208
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 96/218 (44%), Positives = 125/218 (57%), Gaps = 19/218 (8%)
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-G 151
+ IDW ++GAVTPVK+QG CWAF+ V+TVE +N+IRTG L++ S+ QLVDC+ N G
Sbjct: 3 EQIDWRKKGAVTPVKNQGKCGSCWAFSTVSTVESINQIRTGNLISLSEQQLVDCNKKNHG 62
Query: 152 CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATE 211
C A++YI + +E YPY+ Q A+ K I GY+ V E
Sbjct: 63 CKGGAFVYAYQYIIDNGGIDTEANYPYKAVQG------PCRAAKKVVRIDGYKGVPHCNE 116
Query: 212 EGLQDVVSRQPVSVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
L+ V+ QP VAIDA+ F Y G+F+GPCG NHGV IVGY + Y
Sbjct: 117 NALKKAVASQPSVVAIDASSKQFQHYKSGIFSGPCGTKLNHGVVIVGY--------WKDY 168
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
W+V+N WG W E G +R+ R VGG GLC IA YP
Sbjct: 169 WIVRNSWGRYWGEQGYIRMKR-VGGCGLCGIARLPYYP 205
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 115/322 (35%), Positives = 162/322 (50%), Gaps = 48/322 (14%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEFLR-LNKFADLTR 56
A + + ++ +TYK+QAE+ RF IF++N H + + +NKFAD+TR
Sbjct: 24 AHFQSFKLKHGKTYKNQAEETKRFAIFRENLRKIEAHNAEYKQGIHSYTQGINKFADMTR 83
Query: 57 EKF---LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
+F LA+ KP ++ +S +SIDW R VTP+KDQ
Sbjct: 84 AEFKAMLATQVKTKPSIVATKTFQLAD--------GVSVPESIDWRSRNVVTPIKDQAQ- 134
Query: 114 C--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LN-GCAKNFLENAFEYIRQYQR 169
C CWAF V + EG + TG+L S+ QLVDC+T LN GC +L++ F YI Q
Sbjct: 135 CGSCWAFAVVGSTEGAYALSTGKLTRFSEQQLVDCTTDLNYGCDGGYLDDTFPYI-QTNG 193
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVV-SRQPVSVAID 228
L E YPY G D YC + S K + Y V PA E+ L + V + PV++AI+
Sbjct: 194 LELESDYPYTGY-DGYCSYESSKVVTK---VSSYVSV-PANEQALLEAVGTAGPVAIAIN 248
Query: 229 ATWFNFYHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
A FY G+ + +HGV VGY ++E + YWL+KN WG +W E G
Sbjct: 249 ADDLQFYFSGIIDDKYCDPEYLDHGVLAVGY----DSENGRDYWLIKNSWGADWGESGYF 304
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
R R G +C + +A YPL
Sbjct: 305 RFLR---GQNICGVKEDAVYPL 323
>gi|5381317|gb|AAD42940.1|AF091366_1 cryptopain precursor [Cryptosporidium parvum]
Length = 401
Score = 161 bits (408), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 104/319 (32%), Positives = 163/319 (51%), Gaps = 35/319 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E++ ++ + Y E+ RF+I+K+N F L +N+F DL++E+F+A +
Sbjct: 87 EEFKKKYHKVYSSMEEENQRFEIYKQNMNFIKTTNSQGFSYVLEMNEFGDLSKEEFMARF 146
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFY--DSIDWNERGAVTPVKDQGSY-CCWAFTA 120
TGY D +S+ + + S+ F +SI+W E G V P+++Q + CWAF+A
Sbjct: 147 TGYIKDSKDDERVFKSSRV-SASESEEEFVPPNSINWVEAGCVNPIRNQKNCGSCWAFSA 205
Query: 121 VATVEGLNKIRTGQ-LVTRSKHQLVDCSTLNG---CAKNFLENAFEYIRQYQRLASECVY 176
VA +EG +T + L + S+ Q VDCS NG C + AF+Y + + L + Y
Sbjct: 206 VAALEGATCAQTNRGLPSLSEQQFVDCSKQNGNFGCDGGTMGLAFQYAIKNKYLCTNDDY 265
Query: 177 PYQGRQ----DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAI--DA 229
PY + D +C+ + ++ Y+YV P L+ +++ P+SVAI D
Sbjct: 266 PYFAEEKTCMDSFCENYIEIP------VKAYKYVFPRNINALKTALAKYGPISVAIQADQ 319
Query: 230 TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
T F FY GVF PCG NHGV +V Y + + + YWLV+N WG W E G +++
Sbjct: 320 TPFQFYKSGVFDAPCGTKVNHGVVLVEYD--MDEDTNKEYWLVRNSWGEAWGEKGYIKLA 377
Query: 290 RGVGGSGLCNIAANAAYPL 308
G G C I YP+
Sbjct: 378 LHSGKKGTCGILVEPVYPV 396
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 114/326 (34%), Positives = 171/326 (52%), Gaps = 45/326 (13%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIF--------KKNHEF--------LRLNKFADLTR 56
E+W ++ + Y + E+ +R KI+ K N F LR+NK+ADL
Sbjct: 25 EEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLH 84
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKN----LNSSKMSFYDSIDWNERGAVTPVKDQGS 112
E+F+ + G+ TD S + + + + + ++DW ++GAVTPVKDQG
Sbjct: 85 EEFVQTVNGFNR--TDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQG- 141
Query: 113 YC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
+C CW+F+A +EG + +TG+LV+ S+ LVDCS NGC ++ AF+YI+
Sbjct: 142 HCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDN 201
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVA 226
+ +E YPY+ D C + A G +GY + EE L+ ++ PVS+A
Sbjct: 202 GGIDTEKSYPYEAIDD-TCH-FNPKAVG--ATDKGYVDIPQGDEEALKKALATVGPVSIA 257
Query: 227 IDATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
IDA+ F FY GV+ P ++ N HGV VGYGT+ E E YWLVKN WGT W +
Sbjct: 258 IDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEEGE---DYWLVKNSWGTTWGD 314
Query: 283 GGSMRIFRGVGGSGLCNIAANAAYPL 308
G +++ R C +A A+YPL
Sbjct: 315 QGYVKMAR--NHDNHCGVATCASYPL 338
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 110/312 (35%), Positives = 161/312 (51%), Gaps = 43/312 (13%)
Query: 23 ARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLASYTGY 66
+ Y++Q E+ R K+F N + +++N DL +F A G+
Sbjct: 21 GKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMNHLGDLMVHEFKALMNGF 80
Query: 67 KPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAVATV 124
K P N K S + S+DW +RGAVTPVKDQG +C CW+F+A ++
Sbjct: 81 KKTP------NAERNGKIYVPSNENLPKSVDWRQRGAVTPVKDQG-HCGSCWSFSATGSL 133
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGR 181
EG ++TG+LV+ S+ LVDCS +GC + AF+Y+R + + +E YPY+ R
Sbjct: 134 EGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYVRDNKGIDTEASYPYEAR 193
Query: 182 QDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--FNFYHGG 238
++ C + G +GY + A+E+ LQ V+ P+SV IDA+ F FY G
Sbjct: 194 EN-NCRFKEDKVG---GTDKGYVDILEASEKDLQSAVATVGPISVRIDASHESFQFYSEG 249
Query: 239 VFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSG 296
V+ C + +HGV VGYGT E Q YWLVKN WG +W E G ++I R
Sbjct: 250 VYKEQYCSPSQLDHGVLTVGYGT----ENGQDYWLVKNSWGPSWGESGYIKIAR--NHKN 303
Query: 297 LCNIAANAAYPL 308
C IA+ A+YP+
Sbjct: 304 HCGIASMASYPV 315
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 109/312 (34%), Positives = 168/312 (53%), Gaps = 37/312 (11%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASYT 64
+WM + +++Y ++ E R+ ++++N + FL +NKF DLT +F +
Sbjct: 32 EWMRDNSKSYSNE-EFVFRWNVWRENQQLIEEHNRSNKTSFLAMNKFGDLTNAEFNKLFK 90
Query: 65 GYKPPPTDHP-HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G D+ H+N++ K + + +S DW ++GAVT VK+QG CW+F+
Sbjct: 91 GL---AFDYSFHANKAAAEKAVPAPGLS--ADFDWRQKGAVTHVKNQGQCGSCWSFSTTG 145
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+ EG N ++TG+L + S+ L+DCS NGC ++ AFEYI + + +E YPYQ
Sbjct: 146 STEGANFLKTGRLTSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKGIDTEASYPYQ 205
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
Q Y C + + + G++ Y V E L + V+ +P SVAIDA+ F FY G
Sbjct: 206 TAQ-YTCQY---NPANSGGSLTSYTDVSSGDENALLNAVATEPTSVAIDASHNSFQFYSG 261
Query: 238 GV-FTGPCGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
GV + C +T +HGV VG+GT E Q YWLVKN WG +W G +++ R S
Sbjct: 262 GVYYESACSSTQLDHGVLAVGWGT----EDGQDYWLVKNSWGADWGLAGYIKMARNR--S 315
Query: 296 GLCNIAANAAYP 307
C IA +A+YP
Sbjct: 316 NNCGIATSASYP 327
>gi|355681653|gb|AER96814.1| cathepsin K [Mustela putorius furo]
Length = 329
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 166/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
E W + + Y ++ ++ R I++KN H + L +N D+T E+
Sbjct: 28 ELWKKTYGKQYNNKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEV 87
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ H SN S + + S DSID+ ++G VTPVK+QG CWAF
Sbjct: 88 VQKMTGLKVPPS-HSRSNDSLYIPDWESRAP---DSIDYRKKGYVTPVKNQGQCGSCWAF 143
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 144 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYP 203
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK +GY+ + E+ L+ V+R P+SVAIDA T F F
Sbjct: 204 YVG-QDESCMY---NPTGKAAKCKGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQF 259
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ N+ NH V VGYG + +W++KN WG NW G + + R
Sbjct: 260 YSKGVYYDENCNSDNLNHAVLAVGYGV----QKGNKHWIIKNSWGENWGNKGYILMARNK 315
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 316 NNA--CGIANLASFP 328
>gi|297596679|ref|NP_001042926.2| Os01g0330200 [Oryza sativa Japonica Group]
gi|125570198|gb|EAZ11713.1| hypothetical protein OsJ_01575 [Oryza sativa Japonica Group]
gi|255673185|dbj|BAF04840.2| Os01g0330200 [Oryza sativa Japonica Group]
Length = 337
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 118/326 (36%), Positives = 161/326 (49%), Gaps = 49/326 (15%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLAS 62
E+WM +F +TYK EKE RF +F+ N F+R +N+FADLT +F+A+
Sbjct: 25 EEWMAKFGKTYKCHGEKEHRFAVFRDNVRFIRSYRPEATYDSAVRINQFADLTNGEFVAT 84
Query: 63 YTGYKPPPT---------DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
YTG K PP + P W IDW +GAVT VKDQG+
Sbjct: 85 YTGVKQPPPATHPHPHPEEAPRPVDPIWMPCC----------IDWRFKGAVTGVKDQGAC 134
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLV----DCSTLNGCAKNFLENAFEYIRQYQ 168
WAF AVA +EGL KIRTGQL S+ +LV +GC + AF+ +
Sbjct: 135 GSSWAFAAVAAMEGLMKIRTGQLTPLSEQELVDCVDGGGDSDGCGGGHTDAAFQLVVDKG 194
Query: 169 RLASECVYPYQG-RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAI 227
+ +E Y Y+G + D + + + G GY+ V PA E L V+RQPV+ +
Sbjct: 195 GITAESEYRYEGYKGRCRVDDMLFNHAARVG---GYRAVPPADERQLATAVARQPVTAYV 251
Query: 228 DAT--WFNFYHGGVFTGPCGNT---PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
DA+ F FY GVF GP G PNH VT+VGY + + YW+ KN WG W +
Sbjct: 252 DASGPAFQFYGSGVFPGPRGTAAPKPNHAVTLVGY--CQDGASGKKYWIAKNSWGKTWGQ 309
Query: 283 GGSMRIFRGVGGS-GLCNIAANAAYP 307
G + + + V G C +A + YP
Sbjct: 310 QGYILLEKDVASPHGTCGLAVSPFYP 335
>gi|260516672|gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
Length = 319
Score = 161 bits (407), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 112/291 (38%), Positives = 150/291 (51%), Gaps = 37/291 (12%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLRL-------------NKFADLTREKFLASYT 64
+M ++++ Y AE RF FK + E +RL N+FADL+ E+F Y
Sbjct: 45 FMKQYSKAYS-HAEFSSRFNQFKASVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYF 103
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVAT 123
G K + SN NL+ + SIDW AVTP+KDQG CWAF+A +
Sbjct: 104 GCKHVEREFARSN------NLHQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGS 157
Query: 124 VEGLNKIRTGQ-LVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+EG ++ L + S+ QLVDCST GC ++ AFEYI + + +E YPY+
Sbjct: 158 IEGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAYPYK 217
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATE-EGLQDVVSRQPVSVAIDA--TWFNFYH 236
G C + K I G++ V E L V + PVSVAI+A F FY
Sbjct: 218 GVGGL-CQ----KSCTKVVTISGHKDVASGDEASSLNAVGTVGPVSVAIEADQAGFQFYS 272
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR 287
GVF+G CG+ +HGV VGYGTT G Q YW+VKN WGT+W E G +R
Sbjct: 273 SGVFSGTCGHNLDHGVLAVGYGTT----GSQDYWIVKNSWGTSWGESGYIR 319
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 112/327 (34%), Positives = 171/327 (52%), Gaps = 46/327 (14%)
Query: 15 HEQWM---VEFARTYKDQAEKEMRFKIFKKN----------HEF------LRLNKFADLT 55
+++W+ +E + YK +AE+ +R KI+ KN +E L++NK+ D+
Sbjct: 25 NQEWINFKMEHKKCYKHEAEERLRMKIYMKNKLQIAQHNCDYELKKVTYRLKINKYGDML 84
Query: 56 REKFLASYTGYKPPPTDHPHSNRSNWFKN----LNSSKMSFYDSIDWNERGAVTPVKDQG 111
+F GY H+ R+ + + +DW + GAVT VKDQG
Sbjct: 85 NHEFKNMLNGYNRTIN---HTLRNERLPVGAAFIEPCNVELPKMVDWRKCGAVTEVKDQG 141
Query: 112 SYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQ 166
+C CWAF+A ++EG + RTG LV+ S+ L+DCS NGC ++ AF YI+
Sbjct: 142 -HCGSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDCSGSYGNNGCNGGLMDQAFSYIKD 200
Query: 167 YQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSV 225
+ L +E YPY+G D RSS + G + + E+ L+ V+ PVSV
Sbjct: 201 NKGLDTEKTYPYEGEDDKCRYDKRSSGASDVGFVD----IPVGDEQKLKAAVATVGPVSV 256
Query: 226 AIDATW--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
AIDA+ F FY G++ P C +T +HGV +VGYGT E + YW+VKN WG +W
Sbjct: 257 AIDASHQSFQFYSDGIYFEPECSSTNLDHGVLVVGYGTDEEG---RDYWIVKNSWGESWG 313
Query: 282 EGGSMRIFRGVGGSGLCNIAANAAYPL 308
E G +++ R + C IA++A+YP+
Sbjct: 314 EKGYIKMARNIDNH--CGIASSASYPI 338
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 116/327 (35%), Positives = 171/327 (52%), Gaps = 47/327 (14%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIF--------KKNHEF--------LRLNKFADLTR 56
E+W +E + Y+D+ E+ R KIF K N F L +NK+ADL
Sbjct: 57 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 116
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNW-FKN---LNSSKMSFYDSIDWNERGAVTPVKDQGS 112
+F G+ T H ++ FK ++ + ++ S+DW +GAVT VKDQG
Sbjct: 117 HEFRQLMNGFNY--TLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQG- 173
Query: 113 YC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
+C CWAF++ +EG + ++G LV+ S+ LVDCST NGC ++NAF YI+
Sbjct: 174 HCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 233
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGLQDVVSRQ-PVSV 225
+ +E YPY+ D C + + G GA RG+ + E+ + + V+ PVSV
Sbjct: 234 GGIDTEKSYPYEAIDD-SCHFNK----GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSV 288
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
AIDA+ F FY GV+ P + N HGV +VG+GT E + YWLVKN WGT W
Sbjct: 289 AIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTD---ESGEDYWLVKNSWGTTWG 345
Query: 282 EGGSMRIFRGVGGSGLCNIAANAAYPL 308
+ G +++ R C IA+ ++YPL
Sbjct: 346 DKGFIKMLR--NKENQCGIASASSYPL 370
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 114/326 (34%), Positives = 168/326 (51%), Gaps = 45/326 (13%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIFKKN-HEF---------------LRLNKFADLTR 56
E+W +E + Y+D E+ R KIF +N H+ L +NK+ADL
Sbjct: 27 EEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADLLH 86
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKN---LNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
+F G+ S + FK ++ + ++ S+DW +GAVT VKDQG +
Sbjct: 87 HEFRQLMNGFNYTLHKQLRSTDDS-FKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQG-H 144
Query: 114 C--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQ 168
C CWAF++ +EG + ++G LV+ S+ LVDCST NGC ++NAF YI+
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGLQDVVSRQ-PVSVA 226
+ +E YPY+ D C + + G GA RG+ + E+ + + V+ PV+VA
Sbjct: 205 GIDTEKSYPYEAIDD-SCHFNK----GAIGATDRGFTDIPQGDEKKMAEAVATVGPVAVA 259
Query: 227 IDATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
IDA+ F FY GV+ P + N HGV +VGYGT E YWLVKN WGT W +
Sbjct: 260 IDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGTD---ESGDDYWLVKNSWGTTWGD 316
Query: 283 GGSMRIFRGVGGSGLCNIAANAAYPL 308
G +++ R C IA+ ++YPL
Sbjct: 317 KGFIKMLRNKDNQ--CGIASASSYPL 340
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 112/327 (34%), Positives = 171/327 (52%), Gaps = 47/327 (14%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIFKKN-HEF---------------LRLNKFADLTR 56
E+W +E + Y+D+ E+ R KIF +N H+ + +NK+AD+
Sbjct: 27 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQLWATGAVSFKMAVNKYADMLH 86
Query: 57 EKFLASYTGYKPPPTDHPH-SNRSNWFKN---LNSSKMSFYDSIDWNERGAVTPVKDQGS 112
+F ++ G+ T H N FK ++ ++ +DW +GAVT VKDQG
Sbjct: 87 HEFYSTMNGFNY--TLHKQLRNADESFKGVTFISPEHVTLPKQVDWRTKGAVTDVKDQG- 143
Query: 113 YC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
+C CWAF++ +EG + ++G LV+ S+ LVDCST NGC ++NAF YI+
Sbjct: 144 HCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 203
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGLQDVVSR-QPVSV 225
+ +E YPY+ D C + + G GA RG+ + E+ + + V+ PV+V
Sbjct: 204 GGIDTEKSYPYEAIDD-SCHFNK----GSIGATDRGFVDIPQGNEKKMAEAVATIGPVAV 258
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
AIDA+ F FY GV+ P + N HGV +VG+GT E + YWLVKN WGT W
Sbjct: 259 AIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGFGTD---ESGEDYWLVKNSWGTTWG 315
Query: 282 EGGSMRIFRGVGGSGLCNIAANAAYPL 308
+ G +++ R C IA+ ++YPL
Sbjct: 316 DKGFIKMLR--NKENQCGIASASSYPL 340
>gi|356545071|ref|XP_003540969.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 317
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 112/312 (35%), Positives = 156/312 (50%), Gaps = 66/312 (21%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFL 60
+HE+WM + + YKD E+E RF+IFK+N + L +N+FADL E+F+
Sbjct: 21 RHEEWMSRYGKVYKDPWEREKRFRIFKENMNYIETSKNAAIKPYKLVINQFADLNNEEFI 80
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAF 118
A N FK + ++ AVTPVKDQG +C CWAF
Sbjct: 81 AP----------------QNIFKGMIICRLL---------SRAVTPVKDQG-HCGFCWAF 114
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYI-----RQYQRL 170
VA+ EG+ + G+L++ S+ +LVDC T GC + +++AF ++ L
Sbjct: 115 YDVASTEGILALTAGKLISLSEQELVDCDTKGVDQGCEGDLMDDAFFMAVTLSNSSFKIL 174
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA- 229
S C G+ C+ + I G + V E+ LQ VV+ QPVS+AIDA
Sbjct: 175 ESRCQLGVDGK----CN--ANEEVNPATTITGXEDVPANNEKALQKVVANQPVSIAIDAC 228
Query: 230 -TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
+ F FY GVFTG CG +HGVTIVGYG + +G Q YWLVKN W T W+
Sbjct: 229 DSDFQFYKRGVFTGSCGTELDHGVTIVGYGVS--HDGTQ-YWLVKNSWETEWNSN----- 280
Query: 289 FRGVGGSGLCNI 300
R +G L N+
Sbjct: 281 -RAIGVGVLENV 291
>gi|326495544|dbj|BAJ85868.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 123/336 (36%), Positives = 168/336 (50%), Gaps = 47/336 (13%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFL--------RL------NKFADLTR 56
+ + +WM RTY AEK RF+ +++N + + RL N+F DLT
Sbjct: 41 MLGRFHRWMSSHRRTYPSAAEKLRRFEAYRRNVDLIDASNRDAERLGYELGENEFTDLTN 100
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSK----------MSFYD---SIDWNERGA 103
E+F+ Y G + + + + SSK M+ D DW E GA
Sbjct: 101 EEFMTRYVGGAGAGGGLITTLAGDVVEGVVSSKNTVEGDGNLTMTTSDPPRQFDWREHGA 160
Query: 104 VTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLEN 159
VTP K QG+ CCWAF A ATVE LNKI G+LV S +LVDCST + C + ++
Sbjct: 161 VTPAKQQGACGCCWAFAAAATVESLNKINGGELVDLSVQELVDCSTGVFSSPCGYGWPKS 220
Query: 160 AFEYIRQYQRLASECVYPY---QGRQDYYCDWWRSSASGKYGAIRGYQYVQPAT-EEGLQ 215
A ++I+ L +E YPY +GR + + A+ + G I G Q VQP + E+ L
Sbjct: 221 ALQWIKSKGGLLTEAEYPYVAKRGRCEVH------DAARRIGKITGVQDVQPGSNEDALA 274
Query: 216 DVVSRQPVSVAID--ATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVK 273
V R PV+V ID + Y GV+ GPC + NH VT+VGYG T E YW+ K
Sbjct: 275 LAVLRTPVTVQIDGSGSVLQNYKSGVYKGPCTTSQNHVVTVVGYGVTGAGE---EYWIAK 331
Query: 274 NRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYPL 308
N WG W + G + RG G GLC +A AYP+
Sbjct: 332 NSWGQTWGQNGFFFMRRGADGPRGLCGMAMYGAYPV 367
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 160 bits (406), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 116/327 (35%), Positives = 171/327 (52%), Gaps = 47/327 (14%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIF--------KKNHEF--------LRLNKFADLTR 56
E+W +E + Y+D+ E+ R KIF K N F L +NK+ADL
Sbjct: 61 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 120
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNW-FKN---LNSSKMSFYDSIDWNERGAVTPVKDQGS 112
+F G+ T H ++ FK ++ + ++ S+DW +GAVT VKDQG
Sbjct: 121 HEFRQLMNGFNY--TLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQG- 177
Query: 113 YC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
+C CWAF++ +EG + ++G LV+ S+ LVDCST NGC ++NAF YI+
Sbjct: 178 HCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 237
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGLQDVVSRQ-PVSV 225
+ +E YPY+ D C + + G GA RG+ + E+ + + V+ PVSV
Sbjct: 238 GGIDTEKSYPYEAIDD-SCHFNK----GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSV 292
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
AIDA+ F FY GV+ P + N HGV +VG+GT E + YWLVKN WGT W
Sbjct: 293 AIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTD---ESGEDYWLVKNSWGTTWG 349
Query: 282 EGGSMRIFRGVGGSGLCNIAANAAYPL 308
+ G +++ R C IA+ ++YPL
Sbjct: 350 DKGFIKMLR--NKENQCGIASASSYPL 374
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 112/313 (35%), Positives = 165/313 (52%), Gaps = 40/313 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
+ WMV+ ++Y + E R+ IF+ N +F L LN ADLT +++ Y
Sbjct: 33 QNWMVKHQKSYTND-EFGSRYTIFQDNMDFVTKWNQKGSDTILGLNSMADLTNQEYQRIY 91
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAV 121
G K T N +++ + S +DW GAVT VK+QG C C++F+
Sbjct: 92 LGTK---TTVKKPNLIIGVTDVSKAPAS----VDWRANGAVTAVKNQGQ-CGGCYSFSTT 143
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPY 178
+VEG+++I + QLV+ S+ Q++DCS NGC + N+FEYI L +E YPY
Sbjct: 144 GSVEGIHEITSKQLVSLSEQQILDCSGSEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPY 203
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYH 236
+G C + +++ I GY+ V+ +E LQ V+ QPVSVAIDA+ F Y
Sbjct: 204 EGVVG-KCKFNKANIG---ATITGYKNVKSGSESDLQTAVAAQPVSVAIDASQNSFQLYS 259
Query: 237 GGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
GV+ P C +T +HGV VGYG+ + Q YW+VKN WG +W E G + + R
Sbjct: 260 SGVYYEPACSSTQLDHGVLAVGYGS----QSGQDYWIVKNSWGADWGEKGFILMARNKHN 315
Query: 295 SGLCNIAANAAYP 307
+ C IA A+YP
Sbjct: 316 N--CGIATMASYP 326
>gi|194352764|emb|CAQ00110.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 406
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 116/343 (33%), Positives = 168/343 (48%), Gaps = 62/343 (18%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------NKFADLTREKFLA 61
WM R+Y EK RF++++ N F+ F DLT E+F+
Sbjct: 66 WMTVHNRSYSTAGEKARRFEVYRSNMRFIEAVNAEAATSGLTYELGEGPFTDLTNEEFME 125
Query: 62 SYTG-------------YKPPPTDHPHS-------NRSNWFKNLNSSKMSFYDSIDWNER 101
YTG + T H S + + N ++S + SIDW +R
Sbjct: 126 LYTGQILEDDQSEDGDDDEQIITTHAGSIDGLGTHKGATVYANFSASAPT---SIDWRKR 182
Query: 102 GAVTPVKDQ---GSYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFL 157
G VTPVK+Q GS CWAF VAT+EG++KI+ G LV+ S+ QL+DC L NGC +
Sbjct: 183 GVVTPVKNQKQCGS--CWAFPTVATIEGIHKIKRGTLVSLSEQQLIDCDYLDNGCKGGLV 240
Query: 158 ENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDV 217
AF++I++ + S Y Y+ + C R A+ I G++ V+ +E L +
Sbjct: 241 TRAFQWIKKNGGITSTSSYKYKAVRG-RCLRNRKPAA----KIVGFRKVKSNSEVSLMNA 295
Query: 218 VSRQPVSV--AIDATWFNFYHGGVFTGPCGNTP-NHGVTIVGYGTTTE-----AEGQQP- 268
V+ QPV+V + ++ F+ Y GG++ GPC T NH VT+VGYG + P
Sbjct: 296 VANQPVAVSISSHSSHFHHYKGGIYNGPCSTTKLNHAVTVVGYGQQQQNGADSVHASAPG 355
Query: 269 --YWLVKNRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
YW+VKN WGT W + G + + RG SG C IA +PL
Sbjct: 356 AKYWIVKNSWGTTWGDKGYILMKRGTKHSSGQCGIATRPVFPL 398
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 160 bits (406), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 116/327 (35%), Positives = 171/327 (52%), Gaps = 47/327 (14%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIF--------KKNHEF--------LRLNKFADLTR 56
E+W +E + Y+D+ E+ R KIF K N F L +NK+ADL
Sbjct: 27 EEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 86
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNW-FKN---LNSSKMSFYDSIDWNERGAVTPVKDQGS 112
+F G+ T H ++ FK ++ + ++ S+DW +GAVT VKDQG
Sbjct: 87 HEFRQLMNGFNY--TLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQG- 143
Query: 113 YC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
+C CWAF++ +EG + ++G LV+ S+ LVDCST NGC ++NAF YI+
Sbjct: 144 HCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 203
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGLQDVVSRQ-PVSV 225
+ +E YPY+ D C + + G GA RG+ + E+ + + V+ PVSV
Sbjct: 204 GGIDTEKSYPYEAIDD-SCHFNK----GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSV 258
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
AIDA+ F FY GV+ P + N HGV +VG+GT E + YWLVKN WGT W
Sbjct: 259 AIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTD---ESGEDYWLVKNSWGTTWG 315
Query: 282 EGGSMRIFRGVGGSGLCNIAANAAYPL 308
+ G +++ R C IA+ ++YPL
Sbjct: 316 DKGFIKMLR--NKENQCGIASASSYPL 340
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 160 bits (405), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 107/318 (33%), Positives = 164/318 (51%), Gaps = 39/318 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + ++Y+ E+ +RFKIF +N L +N+F DL +F
Sbjct: 28 EAFKTTHKKSYESHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEF 87
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ GY+ T S S + N + S ++DW ++GAVTPVKDQG CWAF
Sbjct: 88 AKIFNGYRGQRT----SRGSTFMPPANVNDSSLPSTVDWRKKGAVTPVKDQGQCGSCWAF 143
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+A ++EG + ++ G+LV+ S+ LVDCS NGC ++NAF+YI+ + +E
Sbjct: 144 SATGSLEGQHFLKDGELVSLSEQNLVDCSQSFGNNGCEGGLMDNAFKYIKANDGIDAEES 203
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDA--TWF 232
YPY+ D C + + G+ ++ +E+ L+ V+ P+SVAIDA + F
Sbjct: 204 YPYEAMDD-KCRFKKEDVG---ATDTGFVDIEGGSEDDLKKAVATVGPISVAIDAGHSSF 259
Query: 233 NFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
Y GV+ P C + +HGV VGYG +G++ YWLVKN WG +W + G + + R
Sbjct: 260 QLYSEGVYDEPECSSEELDHGVLAVGYGVK---DGKK-YWLVKNSWGGSWGDNGYILMSR 315
Query: 291 GVGGSGLCNIAANAAYPL 308
C IA+ A+YPL
Sbjct: 316 DKNNQ--CGIASAASYPL 331
>gi|300122868|emb|CBK23875.2| unnamed protein product [Blastocystis hominis]
Length = 316
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 110/296 (37%), Positives = 156/296 (52%), Gaps = 40/296 (13%)
Query: 30 AEKEMRFKIFKKN-----------HEF-LRLNKFADLTREKFLAS-YTGYKPPPTDHPHS 76
+E+E R K+ N H F L + FAD+T +F S G P +H +
Sbjct: 41 SEREYRKKVLAYNMDWIEKFNSDEHSFTLGMTPFADMTNTEFATSKLCGCMKKPLNHKQA 100
Query: 77 NRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQL 135
+ LN+ + +SIDW E+GAVTPVK+QGS CWAF+A +EG N + TG+L
Sbjct: 101 ------RVLNNMAV---ESIDWREKGAVTPVKNQGSCGSCWAFSATGALEGGNFVATGKL 151
Query: 136 VTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSAS 194
V+ S+ QLVDC T + GC F++ AFEY+ + + L +E YPY + + D +S
Sbjct: 152 VSLSEQQLVDCDTEDAGCGGGFMDTAFEYVMK-KGLCTEEDYPYHAKDEDCKDDQCTSVI 210
Query: 195 GKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--YHGGVF-TGPCGNTPNHG 251
+I GY+ V L+ +++ PVSVAI A F F Y GGV + CG + NHG
Sbjct: 211 ----SITGYEDVPANDGVALKQALTKAPVSVAIQADSFVFQMYTGGVLDSDMCGTSLNHG 266
Query: 252 VTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
V VGY + Y +VKN WG +W + G ++I G G+C I A+YP
Sbjct: 267 VLAVGYA--------KEYIIVKNSWGASWGDKGYVKIAHRDQGEGICGINMAASYP 314
>gi|224106333|ref|XP_002333699.1| predicted protein [Populus trichocarpa]
gi|222837985|gb|EEE76350.1| predicted protein [Populus trichocarpa]
Length = 197
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 89/198 (44%), Positives = 116/198 (58%), Gaps = 10/198 (5%)
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLAS 172
CCWAF+AVA +EG+ K++TG L++ SK QLV+ N GC ++ AF+YI + + L S
Sbjct: 4 CCWAFSAVAAIEGIIKLKTGNLISLSKQQLVNRDVGNKGCHGGLMDTAFQYIIRNEGLTS 63
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW- 231
E YPYQG D C A+ I G + E L V++QPVSV +D
Sbjct: 64 EDNYPYQGV-DGTCS--SEKAASIAAEITGDENAPKNNENALLQAVAKQPVSVGVDGGGN 120
Query: 232 -FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F FY GVF G CG NH VT +GYGT ++ YWLVKN WGT+W E G R+ R
Sbjct: 121 DFQFYKSGVFNGDCGTQQNHAVTAIGYGTDSDG---TDYWLVKNSWGTSWGESGYTRMQR 177
Query: 291 GVGGS-GLCNIAANAAYP 307
G+G S GLC +A +A+YP
Sbjct: 178 GIGASEGLCGVAMDASYP 195
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 162/315 (51%), Gaps = 35/315 (11%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFL 60
QW E + Y E+ R I++KN + L +N+FADL E+F+
Sbjct: 30 QWKNEHGKRYLSDEEEASRKLIWEKNLDIVIKHNLKYDLGHFTYALGMNQFADLQNEEFV 89
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A TG++ T + S + + N K+ ++DW +G VTPVKDQG CWAF+
Sbjct: 90 AMMTGFRVNGTSKA-AKGSTFLPSNNVDKLP--KTVDWRTKGYVTPVKDQGQCGSCWAFS 146
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPY 178
A ++EG +TG+LV+ S+ LVDCS N GC F++ AF+YI + +E Y Y
Sbjct: 147 ATGSLEGQQFKKTGKLVSLSEQNLVDCSYRNYGCHGGFMDRAFQYIIDAGGIDTEATYSY 206
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT--WFNFY 235
+ D C + +++ + GY V +E+ LQ V+ P+SVAIDA+ +F FY
Sbjct: 207 RAV-DGNCHFKKANVG---ATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHKFFKFY 262
Query: 236 HGGVFTGP-CGNT-PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
GV+ P C T H V +VGYGTT++ YW+VKN W W G + + R
Sbjct: 263 KSGVYNEPGCSTTRLGHAVLVVGYGTTSDG---TDYWIVKNSWAKTWGMNGYLWMSRNKD 319
Query: 294 GSGLCNIAANAAYPL 308
C IA+ A+YP+
Sbjct: 320 NQ--CGIASEASYPM 332
>gi|209882566|ref|XP_002142719.1| papain family cysteine protease [Cryptosporidium muris RN66]
gi|209558325|gb|EEA08370.1| papain family cysteine protease, putative [Cryptosporidium muris
RN66]
Length = 400
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 104/316 (32%), Positives = 157/316 (49%), Gaps = 28/316 (8%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E + ++ + Y + E++ R+ IF+KN F L +N++ DLT E+F+ ++
Sbjct: 87 EDFKQKYKKEYSNLTEEKYRYSIFRKNMNFIKMSNNQGFSYVLEMNEYGDLTHEEFMHNF 146
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAV 121
GY P + S+ N + S ++W + G V PV+DQ YC CWAF+ V
Sbjct: 147 MGYHPQHKNKRFSDSHNILSSNKVENTSPPRFVNWVDAGCVNPVRDQ-RYCGSCWAFSVV 205
Query: 122 ATVE-GLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASECVYP 177
++E + + +LV S+ Q VDC+ N GC L+ AF+Y+ ++Q L +E YP
Sbjct: 206 TSLESAVCAQKNEKLVKLSEQQFVDCTRNNGNFGCDGGSLDLAFQYVMEHQYLCTEEEYP 265
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAI--DATWFNF 234
Y + C + +Y + Y+ V P L+ V++ P+SVAI D F F
Sbjct: 266 YIANEK-SCKFSNCKNPIRY-ILDSYRNVVPNNINALKVAVAKYGPISVAIQADQAPFQF 323
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMR--IFRGV 292
Y GVF PCG NH V +VGY + + YWLV+N WG NW E G ++ I G
Sbjct: 324 YKKGVFDAPCGTDVNHAVVLVGYD--LDIYSGKEYWLVRNSWGENWGENGYIKLAIQAGK 381
Query: 293 GGSGLCNIAANAAYPL 308
G G C I YP+
Sbjct: 382 KGKGTCGILMEPIYPV 397
>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
Length = 324
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 110/321 (34%), Positives = 162/321 (50%), Gaps = 52/321 (16%)
Query: 3 RTSHKTGNIAAKHEQWMVEFARTYKDQ-AEKEMRFKIFKKNHEF------------LRLN 49
R++ + G I + WM + +TY + +KE RF+ FK N F L L
Sbjct: 36 RSNEEVGFI---FQTWMSKHGKTYTNALGDKEQRFQNFKDNLRFIDQHNAKNLSYRLGLT 92
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKD 109
+FADLT +++ ++G P ++ + L ++ S+DW ++GAV+ +KD
Sbjct: 93 QFADLTVQEYQDLFSGR--PIQKQKALRVTHRYVPLAEDQLP--QSVDWRQKGAVSEIKD 148
Query: 110 QGSYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQ 168
QG TVE +NKI TG+L++ S+ +LVDCS N GC +++AF+++
Sbjct: 149 QGR---------CTVESINKIVTGELISLSEQELVDCSIDNHGCNGGLMDSAFQFLINNN 199
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAID 228
L + YPYQ Q Y C+ + S K I GY+ V E LQ V+ QP
Sbjct: 200 GLEYQSDYPYQAVQGY-CNH-NQNTSKKVIKIDGYEDVPANNENSLQKAVAHQP------ 251
Query: 229 ATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
G++TGPCG +H V IVGYGT E Q YW+V+N WGT W E G +I
Sbjct: 252 ---------GIYTGPCGTDLDHAVVIVGYGT----ENGQDYWIVRNSWGTVWGEAGYAKI 298
Query: 289 FRGV-GGSGLCNIAANAAYPL 308
R +G+C IA A+YP+
Sbjct: 299 ARNFENPTGVCGIAMVASYPI 319
>gi|340368358|ref|XP_003382719.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 329
Score = 160 bits (405), Expect = 7e-37, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 162/315 (51%), Gaps = 36/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR--------------LNKFADLTREKFLA 61
+ W V++ + Y+ + + R I++ N +F+ +N+FADL +F
Sbjct: 24 QDWKVKYNKAYETKETELARQVIWESNKKFVENHNANSDKFGFTVAMNEFADLGAGEFAN 83
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
Y G P P P N +N FK S + DS+DW + GAVT VK+QG CWAF+A
Sbjct: 84 IYNGIIPHP---PSYNNTNTFKRTVRSTFALADSVDWRKSGAVTGVKNQGKCGACWAFSA 140
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYP 177
+EG + I TG L++ S+ QL+DCS+ NGC ++NAF Y+ +E YP
Sbjct: 141 TGALEGQHFINTGTLISLSEQQLMDCSSSFGNNGCKGGLMDNAFRYLETVAGDMTEEAYP 200
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDA--TWFNF 234
Y C + S A K Y+ + E+ LQ+ V+ P+SV+I++ + F
Sbjct: 201 YLAEVG-TCRYNSSEAKVKNTV---YKDIPEGDEDALQEAVATIGPISVSINSEHSSFQL 256
Query: 235 YHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ P C ++ +HGV ++GYGT+ + YWLVKN WGTNW G + + R
Sbjct: 257 YDQGVYYEPTCSSSKLDHGVLVIGYGTSDNND----YWLVKNSWGTNWGMDGYIMMSRNK 312
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A+YP
Sbjct: 313 ENN--CGIATRASYP 325
>gi|307103885|gb|EFN52142.1| hypothetical protein CHLNCDRAFT_139276 [Chlorella variabilis]
Length = 388
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 109/311 (35%), Positives = 156/311 (50%), Gaps = 37/311 (11%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHE------------FLRLNKFADLTREKFLASYT 64
QW + R+YK +E R +F +N + L LN+FADLT E+F A++
Sbjct: 48 QWQMTHGRSYKSASEARKRQAVFVENAKHVAEQNARNSGLVLALNQFADLTLEEFAATHL 107
Query: 65 GYKPPPTD-HPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAV 121
GY P + H+ S + + N ++DW ++ AVTPVK+Q + C CWAF+A
Sbjct: 108 GYNPSLREGKEHTTTSFQYADAND----LPSTVDWRKKNAVTPVKNQ-AMCGSCWAFSAT 162
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
VEG+N IRTG+LV+ S+ QLVDC + GC ++ AF+YI + + SE Y Y
Sbjct: 163 GAVEGINAIRTGKLVSLSEQQLVDCDSEKDLGCGGGLMDFAFDYITKNGGIDSEDDYSYW 222
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYHGGV 239
G C R A I G++ V E L+ ++ QPVS+ YH GV
Sbjct: 223 GY-GLICQ-RRKEADRHVVTIDGFEDVPKNDGEALKKAIAHQPVSL---------YHSGV 271
Query: 240 F-TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI-FRGVGGSGL 297
C NHGV VGY + +G P++++KN WG W E G R+ + SG
Sbjct: 272 VGDDACCQDLNHGVLAVGYDDGS--KGGTPHYVIKNSWGEGWGEQGFFRLAAKSSEASGA 329
Query: 298 CNIAANAAYPL 308
C + A+YPL
Sbjct: 330 CGVYKAASYPL 340
>gi|62955291|ref|NP_001017661.1| cathepsin S, b.2 precursor [Danio rerio]
gi|62204682|gb|AAH93339.1| Cathepsin S, b.2 [Danio rerio]
gi|182891354|gb|AAI64362.1| Ctssb.2 protein [Danio rerio]
Length = 330
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 112/329 (34%), Positives = 165/329 (50%), Gaps = 41/329 (12%)
Query: 5 SHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRL 48
+H N+ E W + + Y + E+ R +++++N E L +
Sbjct: 17 AHFNKNLDQHWELWKKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSYDLAI 76
Query: 49 NKFADLTREKFLASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPV 107
N AD+T E+ L + + PP P + + ++SS D++DW ++G VT V
Sbjct: 77 NHMADMTTEEILQTLAVTRVPPGFKRPTA------EYVSSSFAVVPDTLDWRDKGYVTSV 130
Query: 108 KDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEY 163
K+QG+ CWAF++V +EG TG+LV S LVDCS+ GC ++ AF+Y
Sbjct: 131 KNQGACGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQY 190
Query: 164 IRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QP 222
+ + SE YPYQG Q R S + Y++V E+ L++ ++ P
Sbjct: 191 VIDNGGIDSESSYPYQGTQGS----CRYDPSQRAANCTSYKFVSQGDEQALKEALANIGP 246
Query: 223 VSVAIDAT--WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTN 279
VSVAIDAT F FY GV+ P C NHGV VGYGT + Q YWLVKN WG
Sbjct: 247 VSVAIDATRPQFIFYRSGVYDDPSCTQKVNHGVLAVGYGTLS----GQDYWLVKNSWGAG 302
Query: 280 WDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
+ +GG +RI R + +C IA+ A YP+
Sbjct: 303 FGDGGYIRIAR--NKNNMCGIASEACYPI 329
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 160 bits (404), Expect = 8e-37, Method: Compositional matrix adjust.
Identities = 114/326 (34%), Positives = 167/326 (51%), Gaps = 45/326 (13%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIF--------KKNHEF--------LRLNKFADLTR 56
E+W +E + Y+D E+ R KIF K N F L +NK+ADL
Sbjct: 27 EEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 86
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKN---LNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
+F G+ + + FK ++ + ++ S+DW +GAVT VKDQG +
Sbjct: 87 HEFRQLMNGFNYTLHKQLRATDDS-FKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQG-H 144
Query: 114 C--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQ 168
C CWAF++ +EG + ++G LV+ S+ LVDCST NGC ++NAF YI+
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 169 RLASECVYPYQGRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGLQDVVSRQ-PVSVA 226
+ +E YPY+ D C + + G GA RG+ + E+ + + V+ PVSVA
Sbjct: 205 GIDTEKSYPYEAIDD-SCHFNK----GTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVA 259
Query: 227 IDATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
IDA+ F FY GV+ P + N HGV +VG+GT E YWLVKN WGT W +
Sbjct: 260 IDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTD---ESGDDYWLVKNSWGTTWGD 316
Query: 283 GGSMRIFRGVGGSGLCNIAANAAYPL 308
G +++ R C IA+ ++YPL
Sbjct: 317 KGFIKMLRNKDNQ--CGIASASSYPL 340
>gi|358347416|ref|XP_003637753.1| Cysteine proteinase [Medicago truncatula]
gi|355503688|gb|AES84891.1| Cysteine proteinase [Medicago truncatula]
Length = 323
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 119/340 (35%), Positives = 172/340 (50%), Gaps = 75/340 (22%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------- 47
MSRT ++ A HEQWM +F RTY D EKE RFKIF KN E++
Sbjct: 20 MSRTLLESSIAAKTHEQWMKDFGRTYADDVEKEKRFKIFAKNLEYIENFNRAGNETYELG 79
Query: 48 LNKFADLTREKFLASYT--GYKPPPTDHPHSNRSNWF--------KNLNSSKMSFYDSID 97
LN+F DLT+++F + YT K ++ + F +L + +SID
Sbjct: 80 LNQFLDLTKKEFTSKYTCANLKGKLESSMVASVAALFNVSKISTNNSLKGKRKPIPESID 139
Query: 98 WNERGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNF 156
W E GAVT VK QG+ CWAF +A VEG+ +I+ +LV+ S A
Sbjct: 140 WREGGAVTSVKRQGACASCWAFATLAAVEGIVQIKNRELVSLS-------------ASGI 186
Query: 157 LENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQD 216
++ A++YI++ +ASE YPY ++ GK +IR + EE L +
Sbjct: 187 VKFAYDYIKK-NEIASEADYPYTEKE------------GKCLSIR-------SGEENLLE 226
Query: 217 VVSRQPVSVAIDATWFNF--YHGGVF-TGPCGNTPN----HGVTIVGYGTTTEAEGQQPY 269
VV++QPV+V I AT NF Y GG+F +GPCG + H VT++G+ Y
Sbjct: 227 VVAQQPVTVLI-ATNENFVNYKGGIFGSGPCGPIESLQLTHAVTVIGF--------TNEY 277
Query: 270 WLVKNRWGTNWDEGGSMRIFR-GVGGSGLCNIAANAA-YP 307
WL+KN +G +W E G M++ R G +C ++ A+ YP
Sbjct: 278 WLIKNSYGESWGEKGYMKLKRKGDSHHTVCGLSMTASIYP 317
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 103/318 (32%), Positives = 168/318 (52%), Gaps = 38/318 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE----------------FLRLNKFADLTREKF 59
E W + +TY E+++R KI+ +N ++++N + DL +F
Sbjct: 31 ESWKLMHGKTYSSSIEEKLRLKIYMENSLKISRHNSEALNGIHPYYMKMNHYGDLLHHEF 90
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+A GY+ + S + N N + +DW E GAVTPVK+QG CW+F
Sbjct: 91 VAMVNGYQY--ANKTASLGGTYIPNKN---IQLPTHVDWREEGAVTPVKNQGQCGSCWSF 145
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+A +EG + +TG+L++ S+ LVDCS NGC ++ AF YIR + + +E
Sbjct: 146 SATGALEGQDFRKTGKLISLSEQNLVDCSRKFGNNGCEGGLMDFAFTYIRDNKGIDTEAS 205
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS-RQPVSVAIDATW--F 232
YPY+G D +C + + K G+ G+ ++ +E+ L+ V+ P+SVAIDA+ F
Sbjct: 206 YPYEGI-DGHCHY---NPKNKGGSDIGFVDIKKGSEKDLKKAVAGVGPISVAIDASHMSF 261
Query: 233 NFYHGGVFT-GPCGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GV+ C + +HGV +VG+G T++ + YWLVKN W W + G +++ R
Sbjct: 262 QFYSHGVYVESKCSSEELDHGVLVVGFG--TDSVSGEDYWLVKNSWSEKWGDQGYIKMAR 319
Query: 291 GVGGSGLCNIAANAAYPL 308
+C IA++A+YP+
Sbjct: 320 --NKENMCGIASSASYPV 335
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 105/310 (33%), Positives = 159/310 (51%), Gaps = 38/310 (12%)
Query: 24 RTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLASYTGYK 67
+TYK E+ +RFKIF +N F L +N+FADL +F+ GY+
Sbjct: 36 KTYKSNVEELLRFKIFTENSLFIAKHNVKYAKGLVSYKLGINQFADLLPHEFVKMMNGYQ 95
Query: 68 PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEG 126
S + N + S ++DW ++GAVTPVKDQG CWAF++ ++EG
Sbjct: 96 GK---RLAGRGSTYLPPANLNDSSLPKTVDWRKKGAVTPVKDQGQCGSCWAFSSTGSLEG 152
Query: 127 LNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
+ ++TG+LV+ S+ LVDCS+ GC ++N+F YI+ + +E YPY+ +D
Sbjct: 153 QHFLKTGKLVSLSEQNLVDCSSAYGNQGCNGGLMDNSFNYIKANGGIDTEDSYPYEA-ED 211
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--FNFYHGGVF 240
C + + G+ ++ +E+ LQ V+ PVSVAIDA+ F Y GV+
Sbjct: 212 GDCRYKKEDVG---ATDTGFVDIKEGSEKDLQKAVATVGPVSVAIDASQQSFQLYSEGVY 268
Query: 241 TGP--CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLC 298
P + +HGV VGYG + + YWLVKN W W + G + + R C
Sbjct: 269 DEPNCSSESLDHGVLAVGYGV----KNGKKYWLVKNSWAETWGQDGYILMSRDKNNQ--C 322
Query: 299 NIAANAAYPL 308
IA++A+YPL
Sbjct: 323 GIASSASYPL 332
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 107/322 (33%), Positives = 163/322 (50%), Gaps = 47/322 (14%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + +TY+ E+ +RFKIF +N L +N+F DL +F
Sbjct: 28 EAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEF 87
Query: 60 LASYTGYKPPPTDHPHSNR----SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+ G+ H R S + N + S ++DW ++GAVTPVKDQG
Sbjct: 88 ARIFNGH--------HGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGS 139
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+A ++EG + ++ G+LV+ S+ LVDCS NGC +E+AF+YI+ +
Sbjct: 140 CWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGID 199
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDAT 230
+E YPY+ D C + + GY ++ +E+ L+ V+ P+SVAIDA+
Sbjct: 200 TEKSYPYEAV-DGECRFKKEDVG---ATDTGYVEIKAGSEDDLKKAVATVGPISVAIDAS 255
Query: 231 W--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y GV+ P C + +HGV +VGYG +G + YWLVKN W +W + G +
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV----KGGKKYWLVKNSWAESWGDQGYI 311
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
+ R + C IA+ A+YPL
Sbjct: 312 LMSR--DNNNQCGIASQASYPL 331
>gi|307200028|gb|EFN80374.1| Putative cysteine proteinase CG12163 [Harpegnathos saltator]
Length = 1032
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 103/312 (33%), Positives = 160/312 (51%), Gaps = 31/312 (9%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL-------------NKFADLTREKFLAS 62
E ++ + RTY + E+ +R IF++N +RL N+FAD++ E+F A
Sbjct: 728 ENFVNTYNRTYATEEERNLRLSIFRENLGIIRLLRKNEQGTGQYGVNQFADVSTEEFHAF 787
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTA 120
Y G +P + + + + +S DW ++GAVTPVK+QG C CWAF+
Sbjct: 788 YLGLRP----DLRTENNIPLRQAEIPDIELPNSFDWRQKGAVTPVKNQG-MCGSCWAFSV 842
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
VEG I+ +L++ S+ +LVDC L+ GC +NA+ I + L E YPY+
Sbjct: 843 TGNVEGQYAIKHNKLLSLSEQELVDCDDLDEGCNGGLPDNAYRAIEKLGGLELESDYPYE 902
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYHGGV 239
++ C + ++ A + G+ + + Q +V+ P+S+ I+A FY GGV
Sbjct: 903 A-ENERCHFKKNMAKVQVGSAVN---ITSNETQIAQWLVANGPISIGINANAMQFYMGGV 958
Query: 240 ---FTGPCG-NTPNHGVTIVGYGTTTEA--EGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
F C +HGV IVGYGT+ + PYW+VKN WG W E G R++RG G
Sbjct: 959 SHPFKFLCNPKNLDHGVLIVGYGTSNYPLFHKKLPYWIVKNSWGDRWGEQGYYRVYRGDG 1018
Query: 294 GSGLCNIAANAA 305
GL +A++A
Sbjct: 1019 TCGLNTMASSAV 1030
>gi|344275468|ref|XP_003409534.1| PREDICTED: cathepsin K-like [Loxodonta africana]
Length = 329
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 109/315 (34%), Positives = 165/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E W + + Y + ++ R I++KN ++ L +N D+T E+
Sbjct: 27 ELWKKTYGKQYNSKVDEISRRLIWEKNLKYISIHNLEASLGAHTYELAMNHLGDMTSEEV 86
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+D +R+N + + DSID+ ++G VTPVK+QG CWAF
Sbjct: 87 VQKMTGLKVPPSD----SRNNDTLYIPDWEGRAPDSIDYRKKGYVTPVKNQGQCGSCWAF 142
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 143 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYP 202
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK RGY+ + E+ L+ V+R PVSVAIDA T F F
Sbjct: 203 YVG-QDESCMY---NPTGKAAKCRGYREIPVGNEKALKRAVARVGPVSVAIDASLTSFQF 258
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ N+ NH V VGYG + +W++KN WG NW G + + R
Sbjct: 259 YSKGVYYDESCNSDNLNHAVLAVGYGI----QKGNKHWIIKNSWGENWGNKGYILMARNK 314
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 315 NNA--CGIANLASFP 327
>gi|118140100|gb|ABK63481.1| cathepsin S [Channa argus]
Length = 335
Score = 160 bits (404), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 109/319 (34%), Positives = 160/319 (50%), Gaps = 45/319 (14%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
+ W + Y+++ E R ++++KN +F L +N+ DLT+E+
Sbjct: 35 QMWKKTHNKMYQNEVEDAHRRELWEKNLKFISMHNLEASMGIHTYELGMNQMGDLTQEEI 94
Query: 60 LASYTGYKPPPTDH--PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
L +Y +PP H P + +S ++ ++DW + G VT VK+QGS CW
Sbjct: 95 LKTYATLRPPTDVHRTPFTRKSG---------VAAPGAMDWRDLGCVTSVKNQGSCGSCW 145
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASE 173
AF+AV +EG TG+LV S LVDCS +GC F+ NAF+Y+ + Q + SE
Sbjct: 146 AFSAVGALEGQLAKTTGKLVDLSPQNLVDCSGKYGNHGCDGGFMTNAFQYVIENQGIESE 205
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT-- 230
YPY G + C + ++ Y ++ EE L++ ++ P+SVAIDA+
Sbjct: 206 ASYPYIGLEQ-QCHYNPEESAAN---CSQYHFLPEKDEEALKEAIATIGPISVAIDASKP 261
Query: 231 WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GV+ P C NHGV VGYGT + Q WLVKN WGT + + G +R+
Sbjct: 262 TFTFYSSGVYDDPTCSEVINHGVLAVGYGT----QSTQDSWLVKNSWGTYFGDSGYIRMS 317
Query: 290 RGVGGSGLCNIAANAAYPL 308
R G C IA YPL
Sbjct: 318 RNKGNQ--CGIALYGCYPL 334
>gi|194352776|emb|CAQ00116.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 335
Score = 160 bits (404), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 111/319 (34%), Positives = 157/319 (49%), Gaps = 42/319 (13%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFL-- 60
+E+W F D EK MRF IFK+N F L LN FAD T +
Sbjct: 17 YERWCA-FNEVAHDPDEKSMRFSIFKQNVRFIHENNRGDTRFKLGLNIFADRTHAELPNV 75
Query: 61 ---ASYTGYKPPPTDH-PHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC-- 114
+ T + P D+ PH+ +N D +DW ++ AVT VK QG YC
Sbjct: 76 EADCTSTSHLPDDIDYMPHTAVTN---------GDLPDRVDWRDKNAVTSVKKQGDYCGS 126
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASE 173
CWAFTAV VEG+ I+TG+L S L+DC N GC + AF++I++ +A+E
Sbjct: 127 CWAFTAVGAVEGITAIKTGKLEDLSPQMLIDCDKDNRGCRCGMVWRAFDFIKK-NGIATE 185
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN 233
YPY G + + C + +S ++ + ++ V + E L V+ QPV+V I +
Sbjct: 186 RAYPYDGIE-HRC-YMKSDGLSRFASTERFRVVY-SNERALMAAVAVQPVTVDIGVDMYF 242
Query: 234 FYHG---GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
Y+ GV+TGPC T H V +VGY Q+ YW++KN WG W G M + R
Sbjct: 243 HYYSEDMGVYTGPCNKTTTHTVLVVGYDIDA---FQRKYWILKNSWGRKWGHEGYMYMAR 299
Query: 291 GVGG-SGLCNIAANAAYPL 308
GG GLC+I + P+
Sbjct: 300 DEGGPQGLCSILSFPLIPV 318
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 107/322 (33%), Positives = 163/322 (50%), Gaps = 47/322 (14%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + +TY+ E+ +RFKIF +N L +N+F DL +F
Sbjct: 28 EAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEF 87
Query: 60 LASYTGYKPPPTDHPHSNR----SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+ G+ H R S + N + S +DW ++GAVTPVKDQG
Sbjct: 88 ARIFNGH--------HGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGS 139
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+A ++EG + ++ G+LV+ S+ LVDCS NGC +E+AF+YI++ +
Sbjct: 140 CWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKENDGID 199
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDAT 230
+E YPY+ D C + + GY ++ +E+ L+ V+ P+SVAIDA+
Sbjct: 200 TEKSYPYEAV-DGECRFKKEDVG---ATDTGYVEIKAGSEDDLKKAVATVGPISVAIDAS 255
Query: 231 W--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y GV+ P C + +HGV +VGYG +G + YWLVKN W +W + G +
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV----KGGKKYWLVKNSWAESWGDQGYI 311
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
+ R + C IA+ A+YPL
Sbjct: 312 LMSR--DNNNQCGIASQASYPL 331
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 112/313 (35%), Positives = 158/313 (50%), Gaps = 42/313 (13%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
+ WMV+ ++Y + E R+ +F+ N + L LN ADLT E+F Y
Sbjct: 33 QNWMVKHQKSYTND-EFGSRYSVFQDNMDIVAKWNQKGSNTILGLNVMADLTNEEFKKLY 91
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAV 121
G K T + K S+DW GAVT VK+QG C C+AF+
Sbjct: 92 LGTKANVT---------YKKKTLVGVSGLPASVDWRANGAVTAVKNQGQ-CGGCYAFSTT 141
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPY 178
+VEG+++I + QLV S+ Q++DCS NGC + N+FEYI L +E YPY
Sbjct: 142 GSVEGIHEITSQQLVPLSEQQILDCSGSEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPY 201
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYH 236
G C + + + I GY+ V+ +E LQ V+ QPVSVAIDA+ F Y
Sbjct: 202 TGEVG-KCKFNKKNIG---ATITGYKNVESGSESDLQTAVAAQPVSVAIDASQSSFQLYA 257
Query: 237 GGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
GV+ P C +T +HGV VGYG+ + Q YW+VKN WG +W E G + + R
Sbjct: 258 SGVYYEPECSSTQLDHGVLAVGYGS----QSGQDYWIVKNSWGADWGENGFILMARNKDN 313
Query: 295 SGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 314 N--CGIATMASFP 324
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/326 (34%), Positives = 168/326 (51%), Gaps = 45/326 (13%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIFKKN-HEF---------------LRLNKFADLTR 56
EQW ++ + YK E++ R KIF +N H+ L++NK+AD+
Sbjct: 25 EQWGTFKLQHKKQYKSDTEEKFRMKIFMENSHKVAKXNKLYEMGLVSYKLKINKYADMLH 84
Query: 57 EKFLASYTGYK----PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
+F+ + G+ P + F + + + F +++DW E GAVT VKDQG
Sbjct: 85 HEFVHTVNGFNRTKNTPLLGTSEDEQGATF--IAPANVKFPENVDWREHGAVTXVKDQG- 141
Query: 113 YC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
+C CW+F+A +EG + +T +LV+ S+ LVDCST +GC ++NAF+Y++
Sbjct: 142 HCGSCWSFSATGALEGQHFRKTNKLVSLSEQNLVDCSTKFGNDGCNGGLMDNAFKYVKYN 201
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVA 226
+ +E YPY D C + SG RG+ + EE L V+ PVSVA
Sbjct: 202 HGIDTEASYPYHA-DDEKCH-YNPKTSG--ATDRGFVDIPTGDEEKLMAAVATVGPVSVA 257
Query: 227 IDATW--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
IDA+ F Y GV+ P C + +HGV +VGYGT E Q YW+VKN WG +W E
Sbjct: 258 IDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYGTD---ENGQDYWIVKNSWGESWGE 314
Query: 283 GGSMRIFRGVGGSGLCNIAANAAYPL 308
G +++ R + C IA A+YPL
Sbjct: 315 QGYIKMARNRDNN--CGIATQASYPL 338
>gi|410904751|ref|XP_003965855.1| PREDICTED: cathepsin K-like [Takifugu rubripes]
Length = 331
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 153/315 (48%), Gaps = 37/315 (11%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKFL 60
QW + R Y Q E+E+R +++KN H + L +N D+T E+ L
Sbjct: 30 QWKLTHRREYATQGEEEIRRAVWEKNMNVIDAHNQEAALGMHSYELGMNHLGDMTSEEVL 89
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
TG P D + N L++S +D+ ++G VT VKDQG CWAF+
Sbjct: 90 EKMTGLLVPLND-----QRNVTMALSNSIERLPKHLDYRKKGIVTAVKDQGQCGSCWAFS 144
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPY 178
+ +EG+ +TG+LV S LVDC N GC ++ NAF Y+ + + SE YPY
Sbjct: 145 SAGALEGMQAKKTGKLVDLSPQNLVDCVKENDGCGGGYMTNAFRYVATNRGIDSEASYPY 204
Query: 179 QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--FNFY 235
Q+ C + SGK Y+ V E+ L + + P++V IDAT F Y
Sbjct: 205 VA-QEQSCQY---KESGKAAECSSYEEVPQGNEKQLAYALFKHGPIAVGIDATLSTFQLY 260
Query: 236 HGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
GV+ P N NH V +VGYG + Q YW+VKN W TNW GG + + R G
Sbjct: 261 SKGVYYDPNCNPENINHAVLLVGYGVNSRG---QHYWIVKNSWSTNWGNGGYVLMARNRG 317
Query: 294 GSGLCNIAANAAYPL 308
LC IA A+YPL
Sbjct: 318 --NLCGIANLASYPL 330
>gi|33333700|gb|AAQ11968.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 109/324 (33%), Positives = 163/324 (50%), Gaps = 54/324 (16%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------------RLNKFADLTREKF 59
+Q+ ++ +TY+ E++ RF +F+KN + ++ +FAD+T E+F
Sbjct: 24 QQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEF 83
Query: 60 L--ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
L G P++ H F N M D++DW E GAVTP KDQ + CW
Sbjct: 84 LDLLKLQGVPALPSNAVH------FDNSEDIDMEEKDAVDWREEGAVTPAKDQANCGSCW 137
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAFEYIRQYQRLAS 172
AF+AV +EG + G LV+ S +LVDC+T NGC + AF+++ Q + + +
Sbjct: 138 AFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFV-QDEGIQT 196
Query: 173 ECVYPYQGRQDYYCDWWRSSA--SGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDA 229
E YPY+GR RSS SG+Y + YV P E+ + + V ++ PV+VAI+A
Sbjct: 197 EESYPYEGR--------RSSCKKSGEY-VTKVKTYVFPLDEQEMARTVAAKGPVAVAIEA 247
Query: 230 TWFNFYHGGVFTGPC-----GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
+ +FY G+ C NHGV +VGYG+ E YW+VKN WG +W E G
Sbjct: 248 SQLSFYDKGIVDERCRCSNKREDLNHGVLVVGYGS----ENGVDYWIVKNSWGADWGEKG 303
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
R+ + V C I YP+
Sbjct: 304 YFRLKKDVKA---CGIGYYNTYPI 324
>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
Length = 336
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 108/326 (33%), Positives = 169/326 (51%), Gaps = 50/326 (15%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTR 56
A+ + + V + Y+ + R KIF +N L++N+F D+
Sbjct: 30 AEWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLIARHNIKHAKGETTYKLKMNQFGDMLH 89
Query: 57 EKFLASYTGYKPPPTDHPHSNR----SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
+F+++ G SNR S W + +S S+DW E+GAVTPVK+QG
Sbjct: 90 HEFVSTMNGL-------LRSNRTYFGSTW---IEPESVSLPKSVDWREKGAVTPVKNQG- 138
Query: 113 YC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
+C CW+F+ +EG +TG+LV+ S+ L+DCST NGC ++NAF YI++
Sbjct: 139 HCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGNNGCGGGLMDNAFTYIKEN 198
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVA 226
+ +E YPY+G+Q C + + ++G+ G+ + E L + + + PVSVA
Sbjct: 199 HGIDTEESYPYEGKQG-KCRYHKEDSAGR---DTGFVDIPSGNERALAKALATIGPVSVA 254
Query: 227 IDATW--FNFYHGGVFTGP-C-GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
IDA+ F FYH GV+ P C ++ +HGV VGYGTT + Q Y+++KN WG W +
Sbjct: 255 IDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDG---QDYYIIKNSWGERWGQ 311
Query: 283 GGSMRIFRGVGGSGLCNIAANAAYPL 308
G + + R C +A A+YPL
Sbjct: 312 EGYVLMARNSKNE--CGVATQASYPL 335
>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
Length = 327
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 104/319 (32%), Positives = 166/319 (52%), Gaps = 39/319 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHE----------------FLRLNKFADLTREKF 59
E + + + YK E+ +R IF+ N++ F+ +N+F DL ++
Sbjct: 21 EAFKLTHGKQYKSPDEENVRRAIFRDNNQMIKEHNQEAAMGRRSYFMGMNQFGDLAHSEY 80
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWA 117
L G P + + N F++ + + D++DW ++GAVTP+KDQG +C CWA
Sbjct: 81 LELVVGPGLLPLNLSTPSE-NVFES--TPGLQVDDTVDWRQKGAVTPIKDQG-HCGSCWA 136
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASEC 174
F+ ++EG + ++TG+LV+ S+ L+DCS GC ++ AF YI+ + +E
Sbjct: 137 FSTTGSLEGQHFMKTGKLVSLSEQNLLDCSRRFGNKGCEGGLMDQAFRYIKSNGGIDTEE 196
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDATW-- 231
YPY + + CD +++S SG + Y ++ E L Q V + PVSVAIDA+
Sbjct: 197 CYPYMAKDEKVCD-YKTSCSG--ATLSSYTDIKAMDEMALMQAVGTVGPVSVAIDASHKS 253
Query: 232 FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
FY G++ P C T +HGV VGYG+ + YWLVKN WG+ W + G +++
Sbjct: 254 LRFYKSGIYDEPECSRTKLDHGVLAVGYGSMDGMD----YWLVKNSWGSAWGDMGYVKMT 309
Query: 290 RGVGGSGLCNIAANAAYPL 308
R C IA A+YP+
Sbjct: 310 RNKNNQ--CGIATKASYPV 326
>gi|357139514|ref|XP_003571326.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 363
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 114/339 (33%), Positives = 158/339 (46%), Gaps = 70/339 (20%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHEFLR----------------------------L 48
+W ++++ Y E+E RF +F+ N + +
Sbjct: 45 KWQAKYSKRYPSHEEQEKRFGVFRDNSNSIGAFSAPQTTTSAVVGSFGAPQTVTTVRVGM 104
Query: 49 NKFADLTREKFLASYTGYK--------PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNE 100
N+F DL + L +TG+ PPPT PH +R +DW
Sbjct: 105 NRFGDLQPREVLDQFTGFNNTAAVLKTPPPTRLPHHSRKPC-------------CVDWRS 151
Query: 101 RGAVTPVKDQGS-YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST-LNGCAKNFLE 158
GAVT VK QGS CWAF AVA +EG+NKIRTG LV+ S+ QLVDC +GCA +
Sbjct: 152 SGAVTGVKFQGSCQSCWAFAAVAAIEGMNKIRTGTLVSLSEQQLVDCDNGSSGCAGGRTD 211
Query: 159 NAFEYIRQYQRLASECVYPY---QGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQ 215
A + + + + S Y Y GR + A A+ G++ V P E L
Sbjct: 212 TALDLVARRGGITSGERYAYGGFNGRCKVDKLLFDHGA-----AVGGFKAVPPNDEHQLA 266
Query: 216 DVVSRQPVSVAIDA-TW-FNFYHGGVFTGPCGNTP---NHGVTIVGYGTTTEAEGQQPYW 270
V+RQPV+ +DA TW F FY GG+F GPC P NH VTIVGY E G + +W
Sbjct: 267 MAVARQPVTAYVDASTWEFQFYSGGIFRGPCSGDPARVNHAVTIVGY---CEEFGDK-FW 322
Query: 271 LVKNRWGTNWDEGGSMRIFRGVGGS--GLCNIAANAAYP 307
+ KN W +W + G + + + V S G C +A + YP
Sbjct: 323 IAKNSWSDDWGDQGYILLAKDVLSSPNGTCGLATSPFYP 361
>gi|242079875|ref|XP_002444706.1| hypothetical protein SORBIDRAFT_07g026400 [Sorghum bicolor]
gi|241941056|gb|EES14201.1| hypothetical protein SORBIDRAFT_07g026400 [Sorghum bicolor]
Length = 374
Score = 159 bits (403), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 117/336 (34%), Positives = 162/336 (48%), Gaps = 56/336 (16%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFLA 61
+E+W +A + D AEK+ RF FK N +EF L LN+F+ LT E+F +
Sbjct: 50 YERWCSVYAGS-SDLAEKQRRFDAFKMNARQINEFNKREDESYKLALNQFSGLTEEEFNS 108
Query: 62 S-YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSID-------------------WNER 101
YTG P N ++ +S MS D D W
Sbjct: 109 GMYTGALPE-----LDAGGNISSSVGTSGMSMTDDNDDKLLVSAGGNDDKVPAKWDWRRH 163
Query: 102 GAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCAKNFLENA 160
GAVTPVK+QG CWAF+ V +VEG+N I+TG+L T S+ +++DCS C +
Sbjct: 164 GAVTPVKNQGQCGSCWAFSMVGSVEGINAIKTGKLQTLSEQEVLDCSGAGTCKGGNTYKS 223
Query: 161 FEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGA------IRGYQYVQPATEEGL 214
F++ + +QG YY + ++ I G + ++ E L
Sbjct: 224 FDHA-----MRPGLALDHQGNPPYYPAYVAEKKKCRFNPNKPVVKINGKRMMRNTNEAEL 278
Query: 215 QDVVSRQPVSVAIDATW-FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVK 273
VS+QPVSV ++A+ F+ Y GVFTGPCG NH V +VGYGTT YW+VK
Sbjct: 279 LLRVSKQPVSVVVEASQAFSRYSKGVFTGPCGTNLNHAVLVVGYGTTPNGIN---YWIVK 335
Query: 274 NRWGTNWDEGGSMRIFRGVG-GSGLCNIAANAAYPL 308
N WG W E G +R+ R VG +GLC I YP+
Sbjct: 336 NSWGKGWGENGYIRMKRNVGTKAGLCGIYMMPMYPI 371
>gi|195379496|ref|XP_002048514.1| GJ14012 [Drosophila virilis]
gi|194155672|gb|EDW70856.1| GJ14012 [Drosophila virilis]
Length = 327
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 104/319 (32%), Positives = 163/319 (51%), Gaps = 37/319 (11%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRLNKFADLTREKFLASYTGYKPPP 70
+A++ E + VE+ ++Y+D E+++R +IFK N + + D E++ A Y+
Sbjct: 25 LASEFESFKVEYEKSYEDDGEEQLRMQIFKDNKQLI------DRHNERYAAGEETYEMGV 78
Query: 71 ---TDHPHSN-RSNWFKNLNSSKMS-------------FYDSIDWNERGAVTPVKDQGSY 113
TD + R NLN S + +DW E+GAVTPVK+QG
Sbjct: 79 NQFTDMLATEFRKIMLVNLNISDFTSSIEYIYSPANAEIPSQVDWREKGAVTPVKNQGRC 138
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQR 169
CWAF+A +EG + I+T QL+ S+ L+DCS+ +GC + A Y+R +
Sbjct: 139 GSCWAFSAAGALEGQHFIQTKQLIPLSEQNLLDCSSRYNNHGCGGGWPAAALMYVRDNRG 198
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
+ ++ YPY+G C + R S S + + + A V ++ PVSVA+DA
Sbjct: 199 MDNDRAYPYEGHVGR-CRFRRYSVSATVTQVMQVRRDEVALANA---VATKGPVSVAVDA 254
Query: 230 TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
T+F Y GGV++ C NH + +VGYG+ +WL+KN WG W E G MR+
Sbjct: 255 TYFQHYRGGVYSHRCRQQANHAMLVVGYGSDQRG---GDFWLIKNSWG-GWGEQGYMRLA 310
Query: 290 RGVGGSGLCNIAANAAYPL 308
R G LC++A+ A +P+
Sbjct: 311 RNQG--NLCHVASYAVFPI 327
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 106/318 (33%), Positives = 160/318 (50%), Gaps = 39/318 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + +TY+ E+ +RFKIF +N L +N+F DL +F
Sbjct: 28 EAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEF 87
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ GY S S + N + S ++DW ++GAVTPVKDQG CWAF
Sbjct: 88 ARIFNGYHGSR----KSGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAF 143
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+ ++EG + ++ G+LV+ S+ LVDCS NGC +E+AF+YI+ + +E
Sbjct: 144 STTGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKS 203
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--F 232
YPY+ D C + + GY ++ E+ L+ V+ P+SVAIDA+ F
Sbjct: 204 YPYEAV-DGECRFKKEDVG---ATDTGYVEIKAGCEDDLKKAVATVGPISVAIDASHSSF 259
Query: 233 NFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
Y GV+ P C + +HGV +VGYG +G + YWLVKN W +W + G + + R
Sbjct: 260 QLYSEGVYDEPECSSEDLDHGVLVVGYGV----KGGKKYWLVKNSWAESWGDQGYILMSR 315
Query: 291 GVGGSGLCNIAANAAYPL 308
+ C IA+ A+YPL
Sbjct: 316 --DNNNQCGIASQASYPL 331
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 159 bits (402), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 116/327 (35%), Positives = 169/327 (51%), Gaps = 47/327 (14%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIF--------KKNHEF--------LRLNKFADLTR 56
E+W +E + Y+D E+ R KIF K N F L +NK+ADL
Sbjct: 27 EEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 86
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNW-FKN---LNSSKMSFYDSIDWNERGAVTPVKDQGS 112
+F G+ T H ++ FK ++ + ++ S+DW +GAVT VKDQG
Sbjct: 87 HEFRQLMNGFNY--TLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQG- 143
Query: 113 YC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
+C CWAF++ +EG + ++G LV+ S+ LVDCST NGC ++NAF YI+
Sbjct: 144 HCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 203
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGLQDVVSRQ-PVSV 225
+ +E YPY+ D C + + G GA RG+ + E+ + + V+ PVSV
Sbjct: 204 GGIDTEKSYPYEAIDD-SCHFNK----GTIGATDRGFTDIPQGDEKKMAEAVATVGPVSV 258
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
AIDA+ F FY GV+ P + N HGV +VG+GT E YWLVKN WGT W
Sbjct: 259 AIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTD---ESGDDYWLVKNSWGTTWG 315
Query: 282 EGGSMRIFRGVGGSGLCNIAANAAYPL 308
+ G +++ R C IA+ ++YPL
Sbjct: 316 DKGFIKMLR--NKENQCGIASASSYPL 340
>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
Length = 331
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 108/326 (33%), Positives = 169/326 (51%), Gaps = 50/326 (15%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTR 56
A+ + + V + Y+ + R KIF +N L++N+F D+
Sbjct: 25 AEWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLIARHNIKHAKGETTYKLKMNQFGDMLH 84
Query: 57 EKFLASYTGYKPPPTDHPHSNR----SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
+F+++ G SNR S W + +S S+DW E+GAVTPVK+QG
Sbjct: 85 HEFVSTMNGL-------LRSNRTYFGSTW---IEPESVSLPKSVDWREKGAVTPVKNQG- 133
Query: 113 YC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
+C CW+F+ +EG +TG+LV+ S+ L+DCST NGC ++NAF YI++
Sbjct: 134 HCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSYGNNGCGGGLMDNAFTYIKEN 193
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVA 226
+ +E YPY+G+Q C + + ++G+ G+ + E L + + + PVSVA
Sbjct: 194 HGIDTEESYPYEGKQG-KCRYHKEDSAGR---DTGFVDIPSGNERALAKALATIGPVSVA 249
Query: 227 IDATW--FNFYHGGVFTGP-C-GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
IDA+ F FYH GV+ P C ++ +HGV VGYGTT + Q Y+++KN WG W +
Sbjct: 250 IDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDDG---QDYYIIKNSWGERWGQ 306
Query: 283 GGSMRIFRGVGGSGLCNIAANAAYPL 308
G + + R C +A A+YPL
Sbjct: 307 EGYVLMAR--NSKNECGVATQASYPL 330
>gi|301767944|ref|XP_002919404.1| PREDICTED: cathepsin K-like [Ailuropoda melanoleuca]
gi|281352889|gb|EFB28473.1| hypothetical protein PANDA_008011 [Ailuropoda melanoleuca]
Length = 330
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/315 (34%), Positives = 165/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
E W + + Y + ++ R I++KN H + L +N D+T E+
Sbjct: 28 ELWKKTYGKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEV 87
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ H +N + + + S DSID+ ++G VTPVK+QG CWAF
Sbjct: 88 VQKMTGLKVPPS-HSRNNDTLYIPDWESRAP---DSIDYRKKGYVTPVKNQGQCGSCWAF 143
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 144 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYP 203
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK RGY+ + E+ L+ V+R P+SVAIDA T F F
Sbjct: 204 YVG-QDESCMY---NPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQF 259
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ N+ NH V VGYG + +W++KN WG NW G + + R
Sbjct: 260 YSKGVYYDENCNSDNLNHAVLAVGYGI----QKGNKHWIIKNSWGENWGNKGYILMARNK 315
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 316 NNA--CGIANLASFP 328
>gi|47523662|ref|NP_999467.1| cathepsin K precursor [Sus scrofa]
gi|15213940|sp|Q9GLE3.1|CATK_PIG RecName: Full=Cathepsin K; Flags: Precursor
gi|10048286|gb|AAG12340.1|AF292030_1 cathepsin K precursor [Sus scrofa]
Length = 330
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 165/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
E W + + Y + ++ R I++KN H + L +N D+T E+
Sbjct: 28 ELWKKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEV 87
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ H SN + + + DSID+ ++G VTPVK+QG CWAF
Sbjct: 88 VQKMTGLKVPPS-HSRSNDTLYIPDWEGRTP---DSIDYRKKGYVTPVKNQGQCGSCWAF 143
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 144 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYP 203
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK RGY+ + E+ L+ V+R PVSVAIDA T F F
Sbjct: 204 YVG-QDENCMY---NPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQF 259
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ N+ NH V VGYG + + +W++KN WG NW G + + R
Sbjct: 260 YSKGVYYDENCNSDNLNHAVLAVGYGI----QKGKKHWIIKNSWGENWGNKGYILMARNK 315
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 316 NNA--CGIANLASFP 328
>gi|302763127|ref|XP_002964985.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
gi|300167218|gb|EFJ33823.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
Length = 320
Score = 159 bits (402), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 102/296 (34%), Positives = 148/296 (50%), Gaps = 51/296 (17%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLAS 62
E W + ++Y EK R IF ++ LNKF+DLT +F A+
Sbjct: 42 EDWAAKHGKSYSSDWEKARRMTIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRAN 101
Query: 63 YTG-YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
Y G +KPP + +R K+++ S S+DW + GAVTP+KDQG CWAF+A
Sbjct: 102 YVGKFKPPR----YQDRRP-AKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSA 156
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+A++E + + T QLV+ S+ QL+DC T++ GC E YPY
Sbjct: 157 IASIESAHFLATNQLVSLSEQQLIDCDTVDEGC-------------------QEEAYPYT 197
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF--YHG 237
G C+ + K I G+ V + L VS+ PV+V I + NF Y
Sbjct: 198 GLAG-SCN----ANKNKVAEITGFNVVTKDKADALMKAVSKTPVTVGICGSDQNFQNYRS 252
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
G+ +G C N+ +H V ++GYGT EG PYW++KN WGT+W E G M+I + G
Sbjct: 253 GILSGQCCNSRDHVVLVIGYGT----EGGMPYWIIKNSWGTSWGEDGFMKIEKKDG 304
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 111/324 (34%), Positives = 164/324 (50%), Gaps = 41/324 (12%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTR 56
E+W +E + Y D E+ R KIF +N L LNK+AD+
Sbjct: 27 EEWNTFKLEHRKNYADSTEETFRMKIFNENKHHIAKHNQRYATGEVSYKLALNKYADMLH 86
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNW--FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC 114
+F + G+ S ++ ++ + ++DW +GAVT VKDQG +C
Sbjct: 87 HEFRETMNGFNYTLHKQLRSTDESFTGVTFISPEHVKLPTAVDWRTKGAVTEVKDQG-HC 145
Query: 115 --CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQR 169
CWAF++ +EG + ++G LV+ S+ LVDCST NGC ++NAF Y++
Sbjct: 146 GSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYVKDNGG 205
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAID 228
+ +E Y Y+G D C + ++S RG+ + E+ L Q V + PVSVAID
Sbjct: 206 IDTEKSYAYEGIDD-SCHFDKNSIG---ATDRGFADIPQGNEKKLAQAVATIGPVSVAID 261
Query: 229 ATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
A+ F FY GV+ P + N HGV +VGYG TE +G YWLVKN WGT W + G
Sbjct: 262 ASQQSFQFYSEGVYDEPNCSAENLDHGVLVVGYG--TEKDGSD-YWLVKNSWGTTWGDKG 318
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
+++ R C IA+ ++YPL
Sbjct: 319 FIKMSR--NKENQCGIASASSYPL 340
>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 535
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 108/312 (34%), Positives = 156/312 (50%), Gaps = 34/312 (10%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKFLASY 63
WM + ++ D E R + + N + L N+F+ ++ E+F
Sbjct: 32 WMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFKFKM 91
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAV 121
TGY P + ++ NL S + DS+DW ++G VTPVK+QG C CWAF+
Sbjct: 92 TGYVMP-EGYLEQRLASRVDNL-WSDVQVPDSVDWQDKGGVTPVKNQG-MCGSCWAFSTT 148
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
VEG + +G+LV+ S+ +LVDC GC +++AF +I + SE Y Y+
Sbjct: 149 GAVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDYEYK 208
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
+ D K I G+Q V P E L+ V++QPVSVAI+A F FY
Sbjct: 209 AKAQVCRD------CEKVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKS 262
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
GVF CG +HGV VGYG+ E Q +W VKN WG++W E G +R+ R G +G
Sbjct: 263 GVFNLTCGTRLDHGVLAVGYGS----ENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAG 318
Query: 297 LCNIAANAAYPL 308
C IA+ +YP
Sbjct: 319 QCGIASVPSYPF 330
>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
Length = 340
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/318 (34%), Positives = 160/318 (50%), Gaps = 47/318 (14%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLA 61
W + + YK++ E+ R I++KN +F L +N D+T E+ ++
Sbjct: 40 WKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMTSEEVIS 99
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ + P S W +N+ ++S DS+DW E+G VT VK QG+ CWA
Sbjct: 100 LMSSLRVP---------SQWPRNVTYKSNSNQKLPDSVDWREKGCVTKVKYQGACGACWA 150
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAFEYIRQYQRLASE 173
F+AV +E K++TG+LV+ S LVDCST GC F+ AF+YI + SE
Sbjct: 151 FSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSE 210
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--T 230
YPY+ D C R + + Y + +E+ L++ V+ + PVSVAIDA +
Sbjct: 211 ASYPYKA-TDGKC---RYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDARHS 266
Query: 231 WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F Y GV+ P C NHGV +VGYG + YWLVKN WG N+ + G +R+
Sbjct: 267 SFFLYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKD----YWLVKNSWGLNFGDQGYIRMA 322
Query: 290 RGVGGSGLCNIAANAAYP 307
R G C IA+ +YP
Sbjct: 323 RNSGNH--CGIASYPSYP 338
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 115/327 (35%), Positives = 170/327 (51%), Gaps = 47/327 (14%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIF--------KKNHEF--------LRLNKFADLTR 56
E+W +E + Y+D E+ R KIF K N F L +NK+ADL
Sbjct: 27 EEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADLLH 86
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNW-FKN---LNSSKMSFYDSIDWNERGAVTPVKDQGS 112
+F G+ T H ++ FK ++ + ++ S+DW +GAVT VKDQG
Sbjct: 87 HEFRQLMNGFNY--TLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQG- 143
Query: 113 YC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
+C CWAF++ +EG + ++G LV+ S+ LVDCST NGC ++NAF YI+
Sbjct: 144 HCGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDN 203
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGLQDVVSRQ-PVSV 225
+ +E YPY+ D C + + G GA RG+ + E+ + + V+ PV+V
Sbjct: 204 GGIDTEKSYPYEAIDD-SCHFNK----GTIGATDRGFTDIPQGDEKKMAEAVATVGPVAV 258
Query: 226 AIDATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
AIDA+ F FY GV+ P + N HGV +VG+GT E + YWLVKN WGT W
Sbjct: 259 AIDASHESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGTD---ESGEDYWLVKNSWGTTWG 315
Query: 282 EGGSMRIFRGVGGSGLCNIAANAAYPL 308
+ G +++ R C IA+ ++YPL
Sbjct: 316 DKGFIKMLR--NKENQCGIASASSYPL 340
>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
Length = 332
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/326 (33%), Positives = 169/326 (51%), Gaps = 41/326 (12%)
Query: 9 GNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHE----------------FLRLNKFA 52
G + + E W + ++Y+ E+++R KI +N ++++N +
Sbjct: 21 GVVLSDWESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYG 80
Query: 53 DLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSK-MSFYDSIDWNERGAVTPVKDQG 111
DL +F+A GY+ + N+++ + SK + +DW E GAVTPVK+QG
Sbjct: 81 DLLHHEFVAMVNGYE-------YVNKTSLGGSFIPSKNVKLPTHVDWREDGAVTPVKNQG 133
Query: 112 SY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQY 167
CWAF++ ++EG +TG+L+ S+ LVDCS NGC ++ AF YIR
Sbjct: 134 QCGSCWAFSSTGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDN 193
Query: 168 QRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVA 226
+ + +E YPY+G C + S K + G+ V+ +EE L + V S PVSVA
Sbjct: 194 KGIDTEGSYPYEGVGGR-CHY---DPSKKGSSDIGFVDVKKGSEEELLKAVASVGPVSVA 249
Query: 227 IDATW--FNFY-HGGVFTGPCG-NTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDE 282
IDA+ F FY HG F C +HGV +VGYGT + + YWLVKN W NW +
Sbjct: 250 IDASHMSFQFYSHGVYFESKCSPENLDHGVLVVGYGT--DENSGEDYWLVKNSWSENWGD 307
Query: 283 GGSMRIFRGVGGSGLCNIAANAAYPL 308
G +++ R +C IA++A+YP+
Sbjct: 308 QGYIKMAR--NKKNMCGIASSASYPV 331
>gi|71897043|ref|NP_001026516.1| cathepsin S precursor [Gallus gallus]
gi|53126701|emb|CAG30977.1| hypothetical protein RCJMB04_1f23 [Gallus gallus]
Length = 328
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 112/316 (35%), Positives = 160/316 (50%), Gaps = 42/316 (13%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
+ W + Y+ QAE+ R ++KN H + L +N D+T E
Sbjct: 29 QLWKKAHGKEYRHQAEEGQRRATWEKNLRLVMLHNLEHSLGLHSYQLGMNHMGDMTSEDV 88
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
A TG + P + H+ S + + + D++DW E+G VT VK+QG+ CWAF
Sbjct: 89 AALLTGLRVP---YGHNQTSTYRRRGGAP-----DAMDWREKGCVTEVKNQGACGACWAF 140
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+AV +E K++TG+LV+ S LVDCS + GC F+ AF+YI + SE
Sbjct: 141 SAVGALEAQVKLKTGKLVSLSAQNLVDCSMMYGNKGCGGGFMTRAFQYIIDNNGIDSEES 200
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--F 232
YPY Q+ C + + S + Y + A E L+D V+ PVSVAIDAT F
Sbjct: 201 YPYMA-QNGTCQY---NVSTRAATCSKYVELPYADEAALKDAVANVGPVSVAIDATQPTF 256
Query: 233 NFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
Y GV+ P C NHGV +VGYGT E + +WLVKN WG + +GG +R+ R
Sbjct: 257 FLYRSGVYDDPRCTQEVNHGVLVVGYGTLNEKD----FWLVKNSWGERFGDGGYIRMSR- 311
Query: 292 VGGSGLCNIAANAAYP 307
+ C IA+ A+YP
Sbjct: 312 -NHANHCGIASYASYP 326
>gi|54020908|ref|NP_001005695.1| cathepsin S precursor [Xenopus (Silurana) tropicalis]
gi|49522293|gb|AAH75261.1| cathepsin S [Xenopus (Silurana) tropicalis]
Length = 333
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/316 (34%), Positives = 154/316 (48%), Gaps = 40/316 (12%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLA 61
W + Y+D+ E R ++KN L +N AD+T E+ +
Sbjct: 30 WKNTHNKDYEDEIEDLQRRITWEKNLNLVNMHNLEYSMGMHTYELGMNHLADMTSEEIKS 89
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMS--FYDSIDWNERGAVTPVKDQGSY-CCWAF 118
TG PP S R F + +S DSIDW ++G V+ VK+QG CWAF
Sbjct: 90 KLTGLILPP----QSERQATFSSQKNSTFGGKVPDSIDWRDKGCVSDVKNQGGCGSCWAF 145
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+AV +EG ++TG+LV+ S LVDCS+ GC F+ AF+Y+ + + S+
Sbjct: 146 SAVGALEGQLMLKTGKLVSLSPQNLVDCSSKYGNKGCGGGFMTQAFQYVIDNKGIDSDSY 205
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVV-SRQPVSVAIDAT--WF 232
YPY D C + +GK Y + P TE+ L+ + S P+SVAID T F
Sbjct: 206 YPYHA-MDEKCHY---DPTGKASTCAKYTEIVPGTEDNLKQALGSIGPISVAIDGTRPSF 261
Query: 233 NFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
Y GV++ P C + NHGV VGYG Q +WL+KN WGT + + G +RI R
Sbjct: 262 FLYRSGVYSDPTCSHEVNHGVLAVGYGNLN----GQDFWLLKNSWGTKYGDQGYVRIARN 317
Query: 292 VGGSGLCNIAANAAYP 307
G LC +A+ YP
Sbjct: 318 KG--NLCGVASYTCYP 331
>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
Length = 328
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 109/318 (34%), Positives = 160/318 (50%), Gaps = 47/318 (14%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLA 61
W + + YK++ E+ R I++KN +F L +N D+T E+ ++
Sbjct: 28 WKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHLGDMTSEEVIS 87
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ + P S W +N+ ++S DS+DW E+G VT VK QG+ CWA
Sbjct: 88 LMSSLRVP---------SQWPRNVTYKSNSNQKLPDSVDWREKGCVTKVKYQGACGACWA 138
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAFEYIRQYQRLASE 173
F+AV +E K++TG+LV+ S LVDCST GC F+ AF+YI + SE
Sbjct: 139 FSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDNNGIDSE 198
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--T 230
YPY+ D C R + + Y + +E+ L++ V+ + PVSVAIDA +
Sbjct: 199 ASYPYKA-TDGKC---RYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDARHS 254
Query: 231 WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F Y GV+ P C NHGV +VGYG + YWLVKN WG N+ + G +R+
Sbjct: 255 SFFLYRSGVYYDPSCTQNVNHGVLVVGYGNLNGKD----YWLVKNSWGLNFGDQGYIRMA 310
Query: 290 RGVGGSGLCNIAANAAYP 307
R G C IA+ +YP
Sbjct: 311 RNSGNH--CGIASYPSYP 326
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 110/314 (35%), Positives = 157/314 (50%), Gaps = 34/314 (10%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHEF-------------LRLNKFADLTREKFLASY 63
+W R Y E+ +R +I+ N E L +N+F DL +F A Y
Sbjct: 23 EWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAAKY 82
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G + + S S+ + +S DS+DW G VTPVK+QG CW+F+
Sbjct: 83 LGVRFNGVNATKSFASSTYL---PRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTG 139
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+VEG + +TG LV+ S+ LVDCS+ GC +++AFEYI + + +E YPY
Sbjct: 140 SVEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEASYPYT 199
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATWFN--FYH 236
C + +A+ + YQ + +E LQ+ V+ PVSVAIDA+ N FY
Sbjct: 200 ATTG-TCKF---NAANIGATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYF 255
Query: 237 GGVFT-GPCGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
GV+ C T +HGV VGYGT+TE + YWLVKN WG W + G + + R
Sbjct: 256 TGVYNEKKCSTTQLDHGVLAVGYGTSTEG---KDYWLVKNSWGATWGKAGYIWMSRNADN 312
Query: 295 SGLCNIAANAAYPL 308
C IA +A+YPL
Sbjct: 313 Q--CGIATSASYPL 324
>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
Length = 510
Score = 159 bits (401), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 108/312 (34%), Positives = 156/312 (50%), Gaps = 34/312 (10%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKFLASY 63
WM + ++ D E R + + N + L N+F+ ++ E+F
Sbjct: 32 WMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFKFKM 91
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAV 121
TGY P + ++ NL S + DS+DW ++G VTPVK+QG C CWAF+
Sbjct: 92 TGYVMP-EGYLEQRLASRVDNL-WSDVQVPDSVDWQDKGGVTPVKNQG-MCGSCWAFSTT 148
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPYQ 179
VEG + +G+LV+ S+ +LVDC GC +++AF +I + SE Y Y+
Sbjct: 149 GAVEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDYEYK 208
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW--FNFYHG 237
+ D K I G+Q V P E L+ V++QPVSVAI+A F FY
Sbjct: 209 AKAQVCRD------CEKVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKS 262
Query: 238 GVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SG 296
GVF CG +HGV VGYG+ E Q +W VKN WG++W E G +R+ R G +G
Sbjct: 263 GVFNLTCGTRLDHGVLAVGYGS----ENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAG 318
Query: 297 LCNIAANAAYPL 308
C IA+ +YP
Sbjct: 319 QCGIASVPSYPF 330
>gi|224809458|ref|NP_001019580.2| cathepsin S, b.1 precursor [Danio rerio]
gi|63101450|gb|AAH95788.1| Cathepsin S, b.1 [Danio rerio]
gi|77748418|gb|AAI07613.1| Cathepsin S, b.1 [Danio rerio]
Length = 330
Score = 158 bits (400), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 112/328 (34%), Positives = 161/328 (49%), Gaps = 39/328 (11%)
Query: 5 SHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRL 48
+H N+ E W + + Y + E+ R +++++N + L +
Sbjct: 17 AHFNTNLDQHWELWKKTYGKIYTTEVEEFGRRQLWERNLQLITVHNLEASMGMHSYDLSM 76
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N DLT E+ L + T P + + SS + DS+DW E+G V+ VK
Sbjct: 77 NHMGDLTTEEILQTLA-----LTHVPSGFKRQIANIVGSSGDAVPDSLDWREKGYVSSVK 131
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYI 164
QG+ CWAF++V +EG K TG+LV S LVDCS+ GC F+ +AF+Y+
Sbjct: 132 MQGACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYV 191
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQPV 223
+AS+ YPY+G Q A+ Y +V+ E L Q V S P+
Sbjct: 192 IDNGGIASDSAYPYRGVQQQCSYSSSQRAAN----CTKYYFVRQGDENALKQAVASVGPI 247
Query: 224 SVAIDAT--WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
SVAIDAT F YH GV+ P C NH V +VGYGT + Q YWLVKN WGT +
Sbjct: 248 SVAIDATRPQFVLYHSGVYNDPTCSKRVNHAVLVVGYGTLS----GQDYWLVKNSWGTRF 303
Query: 281 DEGGSMRIFRGVGGSGLCNIAANAAYPL 308
+GG +R+ R + +C IA+ A YP+
Sbjct: 304 GDGGYIRMAR--NKNNMCGIASYACYPV 329
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 105/318 (33%), Positives = 162/318 (50%), Gaps = 39/318 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + +TY+ E+ +RFKIF +N L +N+F DL +F
Sbjct: 28 EAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEF 87
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ G++ + S + N + S ++DW ++GAVTPVKDQG CWAF
Sbjct: 88 ARIFNGHRGTR----KTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGSCWAF 143
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+A ++EG + ++ G+LV+ S+ LVDCS NGC +E+AF+YI+ + +E
Sbjct: 144 SATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGIDTEKS 203
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--F 232
YPY+ D C + + GY ++ +E L+ V+ P+SVAIDA+ F
Sbjct: 204 YPYEAV-DGECRFKKEDVG---ATDTGYVEIKAGSEVDLKKAVATVGPISVAIDASHSSF 259
Query: 233 NFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
Y GV+ P C + +HGV +VGYG +G + YWLVKN W +W + G + + R
Sbjct: 260 QLYSEGVYDEPECSSEDLDHGVLVVGYGV----KGGKKYWLVKNSWAESWGDQGYILMSR 315
Query: 291 GVGGSGLCNIAANAAYPL 308
+ C IA+ A+YPL
Sbjct: 316 --DNNNQCGIASQASYPL 331
>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 113/305 (37%), Positives = 159/305 (52%), Gaps = 25/305 (8%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-LNKFADLTREKFLASYTGYKPPPTDH 73
H Q M +++ KD + +FK+N ++ N AD ++ + + K H
Sbjct: 39 HGQRMTRYSKVDKDPPDX-----VFKENVNYIEACNNAADKPYKRDINQFAP-KKRFKGH 92
Query: 74 PHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKI 130
S+ R FK N + ++D ++ AVTP+KDQG C WA +AVA EG++ +
Sbjct: 93 MCSSIIRITTFKFENVTATP--STVDCRQKVAVTPIKDQGQCGCFWALSAVAATEGIHAL 150
Query: 131 RTGQLVTRSKHQ-LVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYC 186
G+L+ S Q LVDC T C +++AF++I Q L +E YPY+G D C
Sbjct: 151 XAGKLILLSSEQELVDCDTKGVDQDCQGGLMDDAFKFIIQNHGLNTEANYPYKGV-DGKC 209
Query: 187 DWWRSSASGKYGAIRGYQYVQPATEEG-LQDVVSRQPVSVAIDATW--FNFYHGGVFTGP 243
+ + + + I GY+ V E+ LQ V+ PVSVAIDA+ F FY GVFTG
Sbjct: 210 NAYEADKNAAT-IITGYEDVPANNEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGS 268
Query: 244 CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-SGLCNIAA 302
CG +HGVT VGYG + + YWLVKN GT W E G +R+ RGV LC IA
Sbjct: 269 CGTELDHGVTAVGYGVSDDG---TEYWLVKNSRGTEWGEEGYIRMQRGVDSEEALCGIAV 325
Query: 303 NAAYP 307
A+YP
Sbjct: 326 QASYP 330
>gi|194352772|emb|CAQ00114.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 123/336 (36%), Positives = 167/336 (49%), Gaps = 47/336 (13%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFL--------RL------NKFADLTR 56
+ + +WM RTY AEK RF+ +++N + + RL N+F DLT
Sbjct: 41 MLGRFHRWMSWHGRTYPSAAEKLRRFEAYRRNVDLIDASNRDAERLGYELGENEFTDLTN 100
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSK----------MSFYD---SIDWNERGA 103
E+F+ Y G + + + + SSK M+ D DW E GA
Sbjct: 101 EEFMTRYIGGAGAGGGLITTLAGDVVEGVVSSKNTIEGDGNLTMTTSDPPRQFDWREHGA 160
Query: 104 VTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLEN 159
VTP K QG+ CCWAF A ATVE LNKI G+LV S +LVDCST + C + ++
Sbjct: 161 VTPAKQQGACGCCWAFAAAATVESLNKINGGELVDLSVQELVDCSTGVFSSPCGYGWPKS 220
Query: 160 AFEYIRQYQRLASECVYPY---QGRQDYYCDWWRSSASGKYGAIRGYQYVQPAT-EEGLQ 215
A ++I+ L +E YPY +GR + A+ + G I G Q VQP + E+ L
Sbjct: 221 ALQWIKSKGGLLTEAEYPYVAKRGRCKVH------DAARRIGKITGVQDVQPGSNEDALA 274
Query: 216 DVVSRQPVSVAID--ATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVK 273
V R PV+V ID + Y GV+ GPC + NH VT+VGYG T E YW+ K
Sbjct: 275 LAVLRTPVTVQIDGSGSVLQNYKSGVYKGPCTTSQNHVVTVVGYGVTGAGE---EYWIAK 331
Query: 274 NRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYPL 308
N WG W + G + RG G GLC +A AYP+
Sbjct: 332 NSWGQTWGQNGFFFMRRGADGPRGLCGMAMYGAYPV 367
>gi|326508044|dbj|BAJ86765.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 124/336 (36%), Positives = 166/336 (49%), Gaps = 47/336 (13%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFL--------RL------NKFADLTR 56
+ + +WM RTY AEK RF+ +++N + + RL N+F DLT
Sbjct: 41 MLGRFHRWMSWHGRTYPSAAEKLRRFEAYRRNVDLIDASNRDAERLGYELGENEFTDLTN 100
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSK----------MSFYD---SIDWNERGA 103
E+F+ Y G + + + + SSK MS D DW E GA
Sbjct: 101 EEFMTRYIGGAGAGGGLITTLAGDVVEGVVSSKNTIEGGGNLTMSTSDPPRQFDWREHGA 160
Query: 104 VTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLEN 159
VTP K QG+ CCWAF A ATVE LNKI G+LV S +LVDCST + C + ++
Sbjct: 161 VTPAKQQGACGCCWAFAAAATVESLNKINGGELVDLSVQELVDCSTGVFSSPCGYGWPKS 220
Query: 160 AFEYIRQYQRLASECVYPY---QGRQDYYCDWWRSSASGKYGAIRGYQYVQPAT-EEGLQ 215
A ++I+ L +E YPY +GR + A+ + G I G Q VQP + E L
Sbjct: 221 ALQWIKSKGGLLTEAEYPYVAKRGRCTVH------DAARRIGKITGVQDVQPGSNENALA 274
Query: 216 DVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVK 273
V R PV+V ID + Y GV+ GPC + NH VT+VGYG T E YW+ K
Sbjct: 275 LAVLRTPVTVQIDGSGSVLQNYKSGVYKGPCTTSQNHVVTVVGYGVTGAGE---EYWIAK 331
Query: 274 NRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYPL 308
N WG W + G + RG G GLC +A AYP+
Sbjct: 332 NSWGQTWGQNGFFFMRRGADGPRGLCGMAMYGAYPV 367
>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
Length = 331
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 111/330 (33%), Positives = 164/330 (49%), Gaps = 47/330 (14%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------N 49
HK + W +++ YK++ E+ R I++KN +F+ L N
Sbjct: 19 HKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 78
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTP 106
D+T E+ ++ + P S W +N+ ++S DS+DW E+G VT
Sbjct: 79 HLGDMTGEEVISLMGSLRVP---------SQWQRNVTYRSNSNQKLPDSVDWREKGCVTE 129
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAF 161
VK QGS CWAF+AV +E K++TG+LV+ S LVDCST GC F+ AF
Sbjct: 130 VKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAF 189
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
+YI + SE YPY+ C R + + Y + +E+ L++ V+ +
Sbjct: 190 QYIIDNNGIDSEASYPYKAMNG-KC---RYDSKKRAATCSKYTELPFGSEDALKEAVANK 245
Query: 222 -PVSVAIDATWFNF--YHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
PVSVAIDA+ ++F Y GV+ P C NHGV +VGYG + YWLVKN WG
Sbjct: 246 GPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKD----YWLVKNSWG 301
Query: 278 TNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
N+ + G +R+ R G C IA+ +YP
Sbjct: 302 LNFGDQGYIRMARNSGNH--CGIASYPSYP 329
>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
Length = 339
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 111/330 (33%), Positives = 164/330 (49%), Gaps = 47/330 (14%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------N 49
HK + W +++ YK++ E+ R I++KN +F+ L N
Sbjct: 27 HKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 86
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTP 106
D+T E+ ++ + P S W +N+ ++S DS+DW E+G VT
Sbjct: 87 HLGDMTGEEVISLMGSLRVP---------SQWQRNVTYRSNSNQKLPDSVDWREKGCVTE 137
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAF 161
VK QGS CWAF+AV +E K++TG+LV+ S LVDCST GC F+ AF
Sbjct: 138 VKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAF 197
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
+YI + SE YPY+ C R + + Y + +E+ L++ V+ +
Sbjct: 198 QYIIDNNGIDSEASYPYKAMNG-KC---RYDSKKRAATCSKYTELPFGSEDALKEAVANK 253
Query: 222 -PVSVAIDATWFNF--YHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
PVSVAIDA+ ++F Y GV+ P C NHGV +VGYG + YWLVKN WG
Sbjct: 254 GPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNLNGKD----YWLVKNSWG 309
Query: 278 TNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
N+ + G +R+ R G C IA+ +YP
Sbjct: 310 LNFGDQGYIRMARNSGNH--CGIASYPSYP 337
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 107/324 (33%), Positives = 166/324 (51%), Gaps = 41/324 (12%)
Query: 16 EQWM---VEFARTYKDQAEKEMRFKIFKKN-HEF---------------LRLNKFADLTR 56
E+W +E + Y D+ E+ R KIF +N H+ + +NK+AD+
Sbjct: 25 EEWQTFKLEHRKNYVDETEERFRLKIFNENKHKIAKHNQRYASGEVSFKMAVNKYADMLH 84
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWF--KNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC 114
+F + G+ ++ ++ ++ + S+DW +GAVT VKDQG +C
Sbjct: 85 HEFHTTMNGFNYTLHKQLRASDPSFVGVTFISPEHVKIPKSVDWRSKGAVTEVKDQG-HC 143
Query: 115 --CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQR 169
CWAF++ +EG + + G L++ S+ LVDCST NGC ++NAF YI+
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAID 228
+ +E YPY+G D C + +++ RG + E+ + + V+ PVSVAID
Sbjct: 204 IDTEKSYPYEGIDD-SCHFNKATIG---ATDRGSVDIPQGDEKKMAEAVATIGPVSVAID 259
Query: 229 ATW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
A+ F FY G++ P + N HGV +VGYGT E Q YWLVKN WGT W + G
Sbjct: 260 ASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYGTD---ESGQDYWLVKNSWGTTWGDKG 316
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
+++ R C IA+ ++YPL
Sbjct: 317 FIKMARNADNQ--CGIASASSYPL 338
>gi|33333710|gb|AAQ11973.1| putative gut cathepsin L-like cysteine protease [Callosobruchus
maculatus]
Length = 326
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 109/324 (33%), Positives = 163/324 (50%), Gaps = 54/324 (16%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFL----------------RLNKFADLTREKF 59
+Q+ ++ +TY+ E++ RF +F+KN + ++ +FAD+T E+F
Sbjct: 24 QQFKLDHGKTYRSLVEEKRRFSVFQKNLVDIQEHNKKYERGEESFAKKVTQFADMTHEEF 83
Query: 60 L--ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
L G P++ H F N M D++DW E GAVTPVKDQ + CW
Sbjct: 84 LDLLKLQGVPALPSNAVH------FDNFEDIDMEEKDAVDWREEGAVTPVKDQANCGSCW 137
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAFEYIRQYQRLAS 172
AF+AV +EG + G LV+ S +LVDC+T NGC + AF+++ Q + + +
Sbjct: 138 AFSAVGAIEGQFFKKNGTLVSLSAQELVDCATEDYGNNGCKGGLMGQAFDFV-QDEGIQT 196
Query: 173 ECVYPYQGRQDYYCDWWRSSA--SGKYGAIRGYQYVQPATEEGL-QDVVSRQPVSVAIDA 229
E YPY+GR RSS SG+Y + YV P E+ + + V ++ PV+VAI+A
Sbjct: 197 EESYPYEGR--------RSSCKKSGEY-VTKVKTYVFPLDEQEMARTVAAKGPVAVAIEA 247
Query: 230 TWFNFYHGGVFTGPC-----GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
+ +FY G+ C N GV +VGYG+ E YW+VKN WG +W E G
Sbjct: 248 SQLSFYDKGIVDERCRCSNKREDLNPGVLVVGYGS----ENGVDYWIVKNSWGADWGEKG 303
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
R+ + V C I YP+
Sbjct: 304 YFRLKKDVKA---CGIGYYNTYPI 324
>gi|149751227|ref|XP_001490649.1| PREDICTED: cathepsin K-like [Equus caballus]
Length = 329
Score = 158 bits (400), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 164/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
E W + + Y + ++ R I++KN H + L +N D+T E+
Sbjct: 27 ELWKKTYRKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEV 86
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ H SN + + + DSID+ ++G VTPVK+QG CWAF
Sbjct: 87 VQKMTGLKVPPS-HTRSNDTLYIPDWEGRAP---DSIDYRKKGYVTPVKNQGQCGSCWAF 142
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 143 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYP 202
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK RGY+ + E+ L+ V+R PVSVAIDA T F F
Sbjct: 203 YVG-QDESCMY---NPTGKAAKCRGYREIPQGNEKALKRAVARVGPVSVAIDASLTSFQF 258
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ N+ NH V VGYG + +W++KN WG NW G + + R
Sbjct: 259 YSRGVYYDENCNSDNLNHAVLAVGYGI----QKGNKHWIIKNSWGENWGNKGYILMARNK 314
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 315 NNA--CGIANMASFP 327
>gi|260234113|dbj|BAI44279.1| cysteine proteinase inhibitor precursor [Manduca sexta]
gi|261336196|dbj|BAH59606.2| cysteine proteinase inhibitor precursor [Manduca sexta]
Length = 2676
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 112/318 (35%), Positives = 160/318 (50%), Gaps = 38/318 (11%)
Query: 16 EQWMVEFARTYK-----DQAEKEMRFKIFKKN----HEFLR---------LNKFADLTRE 57
E EF TYK D+ + RF+IFK+N HE + +FADLT E
Sbjct: 2368 EHLFYEFLSTYKPEYIDDRHQMRQRFEIFKENVRKMHELNTHERGTATYGVTRFADLTYE 2427
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+F + G K D P+ + F+ ++ DS DW + GAVT VKDQGS CW
Sbjct: 2428 EFSTKHMGMKASLRD-PNQVQ---FRKAVIPNVTAPDSFDWRDHGAVTGVKDQGSCGSCW 2483
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECV 175
AF+ +EG K++TG LV+ S+ +LVDC L+ GC +NA+ I Q L SE
Sbjct: 2484 AFSVTGNIEGQWKMKTGDLVSLSEQELVDCDKLDQGCNGGLPDNAYRAIEQLGGLESEDD 2543
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFY 235
YPY+G D C + ++ A + I G + + + +V P+S+ I+A FY
Sbjct: 2544 YPYEGSDDK-CSFNKTLARVQ---ISGAVNITSNETDMAKWLVKHGPISIGINANAMQFY 2599
Query: 236 HGGV------FTGPCGNTPNHGVTIVGYGTTTEA--EGQQPYWLVKNRWGTNWDEGGSMR 287
GG+ P + +HGV IVGYG PYW++KN WGT+W E G R
Sbjct: 2600 MGGISHPWRMLCNP--SNLDHGVLIVGYGAKDYPLFHKHLPYWIIKNSWGTSWGEQGYYR 2657
Query: 288 IFRGVGGSGLCNIAANAA 305
++RG G G+ +A++A
Sbjct: 2658 VYRGDGTCGVNQMASSAV 2675
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 110/315 (34%), Positives = 163/315 (51%), Gaps = 36/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN-----------HEF-LRLNKFADLTREKFLASY 63
+ W + + Y E+ R I++ N H F L +N DLT+++F Y
Sbjct: 29 QAWKLFHTKKYTTVTEEGARKAIWRDNLKKIQKHNAEGHSFTLAMNHLGDLTQDEFRYFY 88
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
TG + +++ S + L S + D++DW + G VTPVK+QG CWAF+
Sbjct: 89 TGMRSHYSNYTKKQGSAF---LAPSHVQVPDTVDWRKEGYVTPVKNQGQCGSCWAFSTTG 145
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
++EG N +TG+LV+ S+ LVDCST NGC ++ AF+YI++ + +E YPY+
Sbjct: 146 SLEGQNFKKTGKLVSLSEQNLVDCSTAYGNNGCQGGLMDYAFKYIKENGGIDTEESYPYE 205
Query: 180 GRQDYYCDWWRSSASGKYGAI-RGYQYVQPATEEGLQDVV-SRQPVSVAIDA--TWFNFY 235
R D C + +S+ GA+ G+ V EE L+ + P+SVAIDA F FY
Sbjct: 206 ARND-RCRFQKSNI----GAVDTGFVDVTHGDEEALKTAAGTVGPISVAIDAGHMSFQFY 260
Query: 236 HGGVF--TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVG 293
H GV+ G + +HGV +VGYGT ++ YWLVKN WG W G + + R
Sbjct: 261 HSGVYNNAGCSSTSLDHGVLVVGYGTYQGSD----YWLVKNSWGERWGMEGYIMMSRNKN 316
Query: 294 GSGLCNIAANAAYPL 308
C +A A+YPL
Sbjct: 317 NQ--CGVATQASYPL 329
>gi|402856105|ref|XP_003892640.1| PREDICTED: cathepsin S isoform 1 [Papio anubis]
Length = 331
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 114/334 (34%), Positives = 167/334 (50%), Gaps = 55/334 (16%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------N 49
HK + W + + YK++ E+ +R I++KN +F+ L N
Sbjct: 19 HKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 78
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL----NSSKMSFYDSIDWNERGAVT 105
D+T E+ ++ + + P S W +N+ N ++M DS+DW E+G VT
Sbjct: 79 HLGDMTSEEVMSLMSSLRVP---------SQWQRNITYKSNPNQM-LPDSVDWREKGCVT 128
Query: 106 PVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENA 160
VK QGS CWAF+AV +E K++TG+LV+ S LVDCST GC F+ A
Sbjct: 129 EVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRA 188
Query: 161 FEYIRQYQRLASECVYPYQGRQDYYCDW---WRSSASGKYGAIRGYQYVQPATEEGLQDV 217
F+YI + S+ YPY+ D C + +R++ KY + E+ L++V
Sbjct: 189 FQYIIDNNGIDSDASYPYKA-TDQKCQYDSKYRAATCSKYTEL------PYGREDVLKEV 241
Query: 218 VSRQ-PVSVAIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVK 273
V+ + PVSV +DA+ F Y GV+ P C NHGV +VGYG E YWLVK
Sbjct: 242 VANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVLNGKE----YWLVK 297
Query: 274 NRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
N WG N+ E G +R+ R G C IA+ +YP
Sbjct: 298 NSWGRNFGEEGYIRMARNKGNH--CGIASFPSYP 329
>gi|67678376|gb|AAH96862.1| Cathepsin S, b.2 [Danio rerio]
Length = 330
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 110/318 (34%), Positives = 161/318 (50%), Gaps = 41/318 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E W + + Y + E+ R +++++N E L +N AD+T E+
Sbjct: 28 ELWKKKHVKLYSCEDEEVGRRELWERNLELIAIHNLEASMGMHSYDLAINHMADMTTEEI 87
Query: 60 LASYTGYKPPPT-DHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
L + + PP P + + ++SS D++DW ++G VT VK+QG+ CWA
Sbjct: 88 LQTLAVTRVPPGFKRPTA------EYVSSSFAVVPDTLDWRDKGYVTSVKNQGACGSCWA 141
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASEC 174
F++V +EG TG+LV S LVDCS+ GC ++ AF+Y+ + SE
Sbjct: 142 FSSVGALEGQLMKTTGKLVDLSPQNLVDCSSKYGNLGCNGGYMSQAFQYVIDNGGIDSES 201
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT--W 231
YPYQG Q R S + Y++V E+ L++ ++ PVSVAIDAT
Sbjct: 202 SYPYQGTQGS----CRYDPSQRAANCTSYKFVSQGDEQALKEALANIGPVSVAIDATRPQ 257
Query: 232 FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F FY GV+ P C NHGV VGYGT + Q YWLVKN WG + +GG +RI R
Sbjct: 258 FIFYRSGVYDDPSCTQKVNHGVLAVGYGTLS----GQDYWLVKNSWGAGFGDGGYIRIAR 313
Query: 291 GVGGSGLCNIAANAAYPL 308
+ +C IA+ A YP+
Sbjct: 314 --NKNNMCGIASEACYPI 329
>gi|77404197|ref|NP_001029168.1| cathepsin K precursor [Canis lupus familiaris]
gi|122056102|sp|Q3ZKN1.1|CATK_CANFA RecName: Full=Cathepsin K; Flags: Precursor
gi|58047562|gb|AAW65150.1| cathepsin K [Canis lupus familiaris]
Length = 330
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 165/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
+ W + + Y + ++ R I++KN H + L +N D+T E+
Sbjct: 28 DLWKKTYRKQYNSKVDELSRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEV 87
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ H SN + + + S DS+D+ ++G VTPVK+QG CWAF
Sbjct: 88 VQKMTGLKVPPS-HSRSNDTLYIPDWESRAP---DSVDYRKKGYVTPVKNQGQCGSCWAF 143
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 144 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYP 203
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK RGY+ + E+ L+ V+R P+SVAIDA T F F
Sbjct: 204 YVG-QDESCMY---NPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQF 259
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ N+ NH V VGYG + +W++KN WG NW G + + R
Sbjct: 260 YSKGVYYDENCNSDNLNHAVLAVGYGI----QKGNKHWIIKNSWGENWGNKGYILMARNK 315
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 316 NNA--CGIANLASFP 328
>gi|356564325|ref|XP_003550405.1| PREDICTED: cysteine proteinase 15A [Glycine max]
Length = 370
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 108/321 (33%), Positives = 161/321 (50%), Gaps = 43/321 (13%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLRLN------------KFADLTREKFLASYTG 65
+ +FA+TY + E + RF +FK N RL+ KF+DLT +F + G
Sbjct: 59 FKAKFAKTYATKEEHDHRFGVFKSNLRRARLHAKLDPSAVHGVTKFSDLTPAEFRRQFLG 118
Query: 66 YKPPPTDHP-HSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVAT 123
KP P H+ ++ + K DW ++GAVT VKDQG+ CW+F+
Sbjct: 119 LKP--LRFPAHAQKAPILPTKDLPK-----DFDWRDKGAVTNVKDQGACGSCWSFSTTGA 171
Query: 124 VEGLNKIRTGQLVTRSKHQLVDCSTL----------NGCAKNFLENAFEYIRQYQRLASE 173
+EG + + TG+LV+ S+ QLVDC + +GC + NAFEYI Q + E
Sbjct: 172 LEGAHYLATGELVSLSEQQLVDCDHVCDPEEYGACDSGCNGGLMNNAFEYILQSGGVQKE 231
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFN 233
YPY GR D C + ++ + + Y V E+ ++V P++VAI+A +
Sbjct: 232 KDYPYTGR-DGTCKFDKTKVAA---TVSNYSVVSLDEEQIAANLVKNGPLAVAINAVFMQ 287
Query: 234 FYHGGVFTGP--CGNTPNHGVTIVGYGTTTEAE---GQQPYWLVKNRWGTNWDEGGSMRI 288
Y GGV + P CG +HGV +VGYG A +PYW++KN WG +W E G +I
Sbjct: 288 TYVGGV-SCPYICGKHLDHGVLLVGYGEGAYAPIRFKNKPYWIIKNSWGESWGENGYYKI 346
Query: 289 FRGVGGSGLCNIAANAA--YP 307
RG G+ ++ + A YP
Sbjct: 347 CRGRNVCGVDSMVSTVAAIYP 367
>gi|225707912|gb|ACO09802.1| Cathepsin K precursor [Osmerus mordax]
Length = 331
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 113/316 (35%), Positives = 153/316 (48%), Gaps = 37/316 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E W + Y E+ +R I++KN L +N D+T E+
Sbjct: 29 ENWKTTHNKEYNGLDEEGIRRAIWEKNMRMIEAHNQEAALGMHSYELGMNNLGDMTSEEV 88
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
G + P + +R N F N+ + SID+ +G VTPVK+QGS CWAF
Sbjct: 89 AEKMMGLQVPL----NRDRGNTFVPDNTVE-RLPKSIDYRRKGMVTPVKNQGSCGSCWAF 143
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG TG+LV S LVDC T N GC ++ NAF Y+R Q + SE YP
Sbjct: 144 SSVGALEGQLMKTTGKLVDLSPQNLVDCVTENNGCGGGYMTNAFNYVRDNQGIDSEAAYP 203
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--FNF 234
Y G QD C + + SG + RGY+ + E L V++ PVSV IDAT F F
Sbjct: 204 YIG-QDETCAY---NVSGMTASCRGYKEIPEGNERALTVAVAKVGPVSVGIDATLSTFQF 259
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ N NH V VGYG T + + YW+VKN W +W G + + R
Sbjct: 260 YQKGVYYDRNCNKDDINHAVLAVGYGVTPKG---KKYWIVKNSWSESWGNKGYILMARNR 316
Query: 293 GGSGLCNIAANAAYPL 308
G LC IA A+YP+
Sbjct: 317 G--NLCGIANLASYPI 330
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 158 bits (399), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 106/322 (32%), Positives = 163/322 (50%), Gaps = 47/322 (14%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + +TY+ E+ +RFKIF ++ L +N+F DL +F
Sbjct: 28 EAFKTTHKKTYQSHMEELLRFKIFTESSLIIARHNAKYAKGLVSYKLGMNQFGDLLAHEF 87
Query: 60 LASYTGYKPPPTDHPHSNR----SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+ G+ H R S + N + S ++DW ++GAVTPVKDQG
Sbjct: 88 ARIFNGH--------HGTRKTGGSTFLPPANVNDSSLPKAVDWRKKGAVTPVKDQGQCGS 139
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+A ++EG + ++ G+LV+ S+ LVDCS NGC +E+AF+YI+ +
Sbjct: 140 CWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGID 199
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDAT 230
+E YPY+ D C + + GY ++ +E+ L+ V+ P+SVAIDA+
Sbjct: 200 TEKSYPYEAV-DGECRFKKEDVG---ATDTGYVEIKAGSEDDLKKAVATVGPISVAIDAS 255
Query: 231 W--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y GV+ P C + +HGV +VGYG +G + YWLVKN W +W + G +
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV----KGGKKYWLVKNSWAESWGDQGYI 311
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
+ R + C IA+ A+YPL
Sbjct: 312 LMSR--DNNNQCGIASQASYPL 331
>gi|414591548|tpg|DAA42119.1| TPA: hypothetical protein ZEAMMB73_388689, partial [Zea mays]
Length = 229
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 89/198 (44%), Positives = 120/198 (60%), Gaps = 11/198 (5%)
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLAS 172
CWAF+A+A VEG+NKI TG+LV+ S+ +LVDC ++ GC ++ AF+YI++ + +
Sbjct: 15 CWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAFQYIQRNGGVTT 74
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW- 231
E YPY Q C+ ++ I GY+ V E+ LQ V+ QPV+VAI+A+
Sbjct: 75 ESNYPYLAEQ-RSCN--KAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAVAIEASGQ 131
Query: 232 -FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
F FY GVFTG CG +HGV VGYGTT + YW VKN WG +W E G +R+ R
Sbjct: 132 DFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGDG---TKYWTVKNSWGEDWGERGYIRMQR 188
Query: 291 GVGGS-GLCNIAANAAYP 307
GV S GLC IA +YP
Sbjct: 189 GVPDSRGLCGIAMEPSYP 206
>gi|312985015|gb|ACX54787.2| cysteine protease [Arachis diogoi]
Length = 360
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 106/331 (32%), Positives = 164/331 (49%), Gaps = 45/331 (13%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRLN------------KFAD 53
H N + +F ++Y Q E + RF +F+ N +L+ KF+D
Sbjct: 35 HHMLNAEHHFTTFKTKFGKSYATQEEHDYRFGVFRANLRRAKLHAKLDPSAEHGVTKFSD 94
Query: 54 LTREKFLASYTGYKP---PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ 110
LT E+F Y G KP P T +N L +S + ++ DW ++GAVTPVK+Q
Sbjct: 95 LTPEEFKRQYLGLKPLRLPST-------ANKAPILPTSDLP--ENFDWRDKGAVTPVKNQ 145
Query: 111 GSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----------NGCAKNFLEN 159
GS CWAF+ +EG + + TG+LV+ S+ QLVDC + GC + N
Sbjct: 146 GSCGSCWAFSTTGALEGAHYLSTGELVSLSEQQLVDCDHVCDPEEYGACDAGCNGGLMNN 205
Query: 160 AFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS 219
AF+YI Q + +E YPY GR D C + +S + + + V ++ ++V
Sbjct: 206 AFDYILQAGGVQTEKDYPYSGR-DETCKFDKSKVA---ATVANFSVVSLDEDQIAANLVK 261
Query: 220 RQPVSVAIDATWFNFYHGGVFTGP--CGNTPNHGVTIVGYGTTTEAE---GQQPYWLVKN 274
P++V I+A + Y GGV + P CG +HGV +VGYG A +P+W++KN
Sbjct: 262 HGPLAVGINAIFMQTYIGGV-SCPYICGKNLDHGVLLVGYGAAGYAPIRFKDKPFWIIKN 320
Query: 275 RWGTNWDEGGSMRIFRGVGGSGLCNIAANAA 305
WG +W E G +I RG G+ ++ ++
Sbjct: 321 SWGESWGEDGYYKICRGKNVCGVDSMVSSVV 351
>gi|355763133|gb|EHH62119.1| hypothetical protein EGM_20318 [Macaca fascicularis]
Length = 331
Score = 158 bits (399), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 112/333 (33%), Positives = 166/333 (49%), Gaps = 53/333 (15%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------N 49
HK + W + + YK++ E+ +R I++KN +F+ L N
Sbjct: 19 HKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 78
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTP 106
D+T E+ ++ + + P S W +N+ +++ DS+DW E+G VT
Sbjct: 79 HLGDMTSEEVMSLMSSLRVP---------SQWQRNITYKSNANQILPDSVDWREKGCVTE 129
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAF 161
VK QGS CWAF+AV +E K++TG+LV+ S LVDCST GC F+ AF
Sbjct: 130 VKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAF 189
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDW---WRSSASGKYGAIRGYQYVQPATEEGLQDVV 218
+YI + S+ YPY+ D C + +R++ KY + E+ L++VV
Sbjct: 190 QYIIDNNGIDSDASYPYKA-TDQKCQYDSKYRAATCSKYTEL------PYGREDVLKEVV 242
Query: 219 SRQ-PVSVAIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKN 274
+ + PVSV +DA+ F Y GV+ P C NHGV +VGYG E YWLVKN
Sbjct: 243 ANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVLNGKE----YWLVKN 298
Query: 275 RWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
WG N+ E G +R+ R G C IA+ +YP
Sbjct: 299 SWGRNFGEEGYIRMARNKGNH--CGIASFPSYP 329
>gi|312306194|gb|ADQ73946.1| cathepsin L [Paralithodes camtschaticus]
Length = 324
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 107/317 (33%), Positives = 160/317 (50%), Gaps = 42/317 (13%)
Query: 17 QWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFL 60
Q+ V++ R Y E+ R ++ +N EF L +N+F D+T E+
Sbjct: 24 QFKVQYGRQYATAQEERYRSSVYDQNMEFIEAHNEQYTNGEVTYMLAINQFGDMTNEEIN 83
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
A G P ++ S L + +DW +GAVTPVKDQ + CWAF+
Sbjct: 84 AVMNGLLP-------ASESRGVAVLGGRDDTLPAEVDWRTKGAVTPVKDQKACGSCWAFS 136
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
A ++EG + ++ G+LV+ S+ LVDCST +GC ++ AF YI+ + +E Y
Sbjct: 137 ATGSLEGQHFLKDGKLVSLSEQNLVDCSTKQGDHGCGGGLMDFAFTYIKDNGGIDTEASY 196
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDA--TWFN 233
PY+ D C + +++ + GY V+ +E+ LQ V+ P+SVAIDA + F+
Sbjct: 197 PYEAT-DGKCQYNPANSG---ATVTGYVDVEHDSEDALQKAVATIGPISVAIDASRSTFH 252
Query: 234 FYHGGV-FTGPCGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
FYH GV + C +T +HGV VGYGT + YWLVKN W W G + + R
Sbjct: 253 FYHKGVYYDKECSSTSLDHGVLAVGYGTQDGTD----YWLVKNSWNITWGNHGFIEMSRN 308
Query: 292 VGGSGLCNIAANAAYPL 308
+ C IA A+YPL
Sbjct: 309 RNNN--CGIATQASYPL 323
>gi|410968296|ref|XP_003990643.1| PREDICTED: cathepsin K [Felis catus]
Length = 330
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 109/315 (34%), Positives = 165/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
E W + + Y ++ ++ R I++KN H + L +N D+T E+
Sbjct: 28 ELWKKTYGKQYNNKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEV 87
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ SN + + + S DSID+ ++G VTPVK+QG CWAF
Sbjct: 88 VQKMTGLKVPPS-RSRSNDTLYIPDWESRAP---DSIDYRKKGYVTPVKNQGQCGSCWAF 143
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 144 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYP 203
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK RGY+ + E+ L+ V+R P+SVAIDA T F F
Sbjct: 204 YVG-QDESCMY---NPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVAIDASLTSFQF 259
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ N+ NH V VGYG + +W++KN WG NW G + + R
Sbjct: 260 YSKGVYYDENCNSDNLNHAVLAVGYGI----QKGNKHWIIKNSWGENWGNKGYILMARNK 315
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 316 NNA--CGIANLASFP 328
>gi|332376957|gb|AEE63618.1| unknown [Dendroctonus ponderosae]
Length = 318
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 110/322 (34%), Positives = 161/322 (50%), Gaps = 47/322 (14%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN------HEFL----------RLNKFAD 53
N+ + + + ++ +++Y +Q E+ R IF +N H L +N+F D
Sbjct: 20 NVGSTFQSFKLKHSKSYSNQVEEAKRLAIFTENLRDIEEHNALYAAGLVSYNKSVNQFTD 79
Query: 54 LTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
LT ++F A T + P + R+ + ++DW +G VT VKDQG
Sbjct: 80 LTIDEFKAYLTLHSKPTLNTVPYVRTG---------LQVPTTLDWRSQGYVTGVKDQGD- 129
Query: 114 C--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCST--LNGCAKNFLENAFEYIRQYQR 169
C CWAF+ V + EG TG+LV+ S+ QL+DC+T +GC +LE F Y++Q
Sbjct: 130 CGSCWAFSVVGSTEGAYYKSTGKLVSLSEQQLIDCTTNVNDGCDGGYLEETFPYVQQ-TG 188
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVV-SRQPVSVAID 228
L SE YPY GR D C S K +YV E L + V S PVSVA+D
Sbjct: 189 LVSESSYPYTGR-DGNCRISESDVVTKVS-----KYVLLGGEADLLEAVGSVGPVSVAMD 242
Query: 229 ATWFNFYHGGVFTGPCGN--TPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
AT+ Y GV+ + + NHGV +VGYGT + + YWL+KN WG W E G +
Sbjct: 243 ATYIYSYASGVYESSLCSLYSLNHGVLVVGYGT----QDGKDYWLIKNSWGNTWGEQGYL 298
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
++ R G+ C IA + YP+
Sbjct: 299 KLLR---GTNECGIAEDDVYPI 317
>gi|355558399|gb|EHH15179.1| hypothetical protein EGK_01236 [Macaca mulatta]
gi|380809986|gb|AFE76868.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416071|gb|AFH31249.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416073|gb|AFH31250.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416075|gb|AFH31251.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416077|gb|AFH31252.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416079|gb|AFH31253.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
Length = 331
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 112/333 (33%), Positives = 166/333 (49%), Gaps = 53/333 (15%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------N 49
HK + W + + YK++ E+ +R I++KN +F+ L N
Sbjct: 19 HKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 78
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTP 106
D+T E+ ++ + + P S W +N+ +++ DS+DW E+G VT
Sbjct: 79 HLGDMTSEEVMSLMSSLRVP---------SQWQRNITYKSNANQILPDSVDWREKGCVTE 129
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAF 161
VK QGS CWAF+AV +E K++TG+LV+ S LVDCST GC F+ AF
Sbjct: 130 VKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTRAF 189
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDW---WRSSASGKYGAIRGYQYVQPATEEGLQDVV 218
+YI + S+ YPY+ D C + +R++ KY + E+ L++VV
Sbjct: 190 QYIIDNNGIDSDASYPYKA-TDQKCQYDSKYRAATCSKYTEL------PYGREDVLKEVV 242
Query: 219 SRQ-PVSVAIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKN 274
+ + PVSV +DA+ F Y GV+ P C NHGV +VGYG E YWLVKN
Sbjct: 243 ANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVLNGKE----YWLVKN 298
Query: 275 RWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
WG N+ E G +R+ R G C IA+ +YP
Sbjct: 299 SWGRNFGEEGYIRMARNKGNH--CGIASFPSYP 329
>gi|118412468|gb|ABK81670.1| fastuosain precursor [Bromelia fastuosa]
Length = 220
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 89/216 (41%), Positives = 127/216 (58%), Gaps = 10/216 (4%)
Query: 95 SIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGCA 153
SIDW + GAVT VK+QGS CWAF+A+ATVEG+ KI+ G L++ S+ +++DC+ GC
Sbjct: 8 SIDWRDYGAVTSVKNQGSCGSCWAFSAIATVEGIYKIKAGNLISLSEQEVLDCALSYGCD 67
Query: 154 KNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEG 213
++ A+++I + S PY+G + C+ + K I GY YVQ E
Sbjct: 68 GGWVNKAYDFIISNNGVTSFANLPYKGYKG-PCN--HNDLPNK-AYITGYTYVQSNNERS 123
Query: 214 LQDVVSRQPVSVAIDATW-FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLV 272
+ V+ QP++ IDA F +Y GVFTG CG + NH +T++GYG T+ YW+V
Sbjct: 124 MMIAVANQPIAALIDAGGDFQYYKSGVFTGSCGTSLNHAITVIGYGQTSSGT---KYWIV 180
Query: 273 KNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
KN WGT+W E G +R+ R V GLC IA +P
Sbjct: 181 KNSWGTSWGERGYIRMARDVSSPYGLCGIAMAPLFP 216
>gi|401416322|ref|XP_003872656.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488880|emb|CBZ24130.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 366
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 150/300 (50%), Gaps = 27/300 (9%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKF 59
AA E++ + R Y+ AE++ R F++N E +R + KF DL+ +F
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEF 94
Query: 60 LASY---TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
A Y Y H + ++ + + D++DW E+GAVTPVKDQG+ C
Sbjct: 95 AARYLNGAAYFAAAKRHA----AQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSC 150
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQ--RLAS 172
WAF+AV +EG + +LV+ S+ QLV C +N GC+ + AF+++ Q L +
Sbjct: 151 WAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCSGGLMLQAFDWLLQNTNGHLYT 210
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY Y + SS I G+ + + + + P+++A+DA+ F
Sbjct: 211 EDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSF 270
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV T G NHGV +VGY T G+ PYW++KN WG +W E G +R+ GV
Sbjct: 271 MSYKSGVLTACIGKQLNHGVLLVGYDMT----GEVPYWVIKNSWGGDWGEQGYVRVVMGV 326
>gi|217072410|gb|ACJ84565.1| unknown [Medicago truncatula]
Length = 328
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 95/222 (42%), Positives = 127/222 (57%), Gaps = 14/222 (6%)
Query: 94 DSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--N 150
+S+DW + GAV VKDQ S CWAF+A+A VEG+NKI TG L++ S+ +LVDC T
Sbjct: 26 ESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSYNE 85
Query: 151 GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPAT 210
GC ++ AFE+I + SE YPY+ D CD R +A K I Y+ V
Sbjct: 86 GCNGGLMDYAFEFIISNGGIDSEDDYPYKA-VDGRCDQNRKNA--KVVTIDDYEDVPAYD 142
Query: 211 EEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQP 268
E LQ V+ QP++VA++ F Y GV TG CG +HGV VGYGT E +
Sbjct: 143 ELALQKAVANQPIAVAVEGGGREFQLYEYGVLTGRCGTALDHGVAAVGYGT----ENGKD 198
Query: 269 YWLVKNRWGTNWDEGGSMRIFRGVGGS--GLCNIAANAAYPL 308
YW+V+N WG +W E G +R+ R + S G C IA +YP+
Sbjct: 199 YWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 240
>gi|401430387|ref|XP_003886572.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491640|emb|CBZ40951.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 332
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 150/300 (50%), Gaps = 27/300 (9%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKF 59
AA E++ + R Y+ AE++ R F++N E +R + KF DL+ +F
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEF 94
Query: 60 LASY---TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
A Y Y H + ++ + + D++DW E+GAVTPVKDQG+ C
Sbjct: 95 AARYLNGAAYFAAAKRHAAQH----YRKARADLSAVPDAVDWREKGAVTPVKDQGACGSC 150
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQ--RLAS 172
WAF+AV +EG + +LV+ S+ QLV C +N GC+ + AF+++ Q L +
Sbjct: 151 WAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCSGGLMLQAFDWLLQNTNGHLHT 210
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY Y + SS I G+ + + + + P+++A+DA+ F
Sbjct: 211 EDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSF 270
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV T G NHGV +VGY T G+ PYW++KN WG +W E G +R+ GV
Sbjct: 271 MSYKSGVLTACIGKQLNHGVLLVGYDMT----GEVPYWVIKNSWGGDWGEQGYVRVVMGV 326
>gi|261289789|ref|XP_002611756.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
gi|229297128|gb|EEN67766.1| hypothetical protein BRAFLDRAFT_236363 [Branchiostoma floridae]
Length = 308
Score = 157 bits (398), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 107/313 (34%), Positives = 159/313 (50%), Gaps = 38/313 (12%)
Query: 22 FARTYKDQAEKEMRFKIFKKNHE----------------FLRLNKFADLTREKFLASYTG 65
+ Y +E+ R IF++N + F+++NKF DLT E+F G
Sbjct: 7 IGKQYNSLSEENARHSIFEENSKIVKQHNEEAAMGKHTFFMKMNKFGDLTTEEFRMIVIG 66
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAVAT 123
++ F++L K+ D++DW ++GAVT VK+Q C CWAF+A +
Sbjct: 67 SGFMQSNKTQQAEGGVFESLPGLKVD--DTVDWRQKGAVTKVKNQ-EQCGSCWAFSATGS 123
Query: 124 VEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASECVYPYQG 180
+EG + ++T LV+ S+ LVDCS GC ++ AF+YI+ + +E Y Y+G
Sbjct: 124 LEGQHFLKTNNLVSLSEQNLVDCSRREGNKGCKGGSMDQAFKYIKMNGGIDTEECYSYRG 183
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDA--TWFNFYHG 237
R + C ++SS SG + Y ++ E L VS P+SVAIDA F YH
Sbjct: 184 RDESMCR-YKSSCSG--ATLSSYTDIKTGDEMALMQAVSTVGPISVAIDAGHKSFQLYHH 240
Query: 238 GVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS 295
GV+ P C +T +HGV VGYG++ ++ YWLVKN WGT W G + + R
Sbjct: 241 GVYDEPKCSSTHLDHGVLAVGYGSSNGSD----YWLVKNSWGTEWGMEGYIMMSRNKHNQ 296
Query: 296 GLCNIAANAAYPL 308
C IA A YP+
Sbjct: 297 --CGIATRAIYPV 307
>gi|431896622|gb|ELK06034.1| Cathepsin K [Pteropus alecto]
Length = 330
Score = 157 bits (398), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 165/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
E W + + Y + ++ R I++KN H + L +N D+T E+
Sbjct: 28 ELWKKSYGKQYDSKVDETSRRLIWEKNLKHISIHNLEAALGVHTYELAMNHLGDMTSEEV 87
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ +RSN + + DS+D+ ++G VTPVK+QG CWAF
Sbjct: 88 VQKMTGLKVPPS----RSRSNDTLYIPDWEGRAPDSVDYRKKGYVTPVKNQGQCGSCWAF 143
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 144 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYP 203
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK RGY+ + E+ L+ V+R P+SVAIDA T F F
Sbjct: 204 YVG-QDESCMY---NPTGKAAKCRGYKEIPEGNEKALKRAVARVGPISVAIDASLTSFQF 259
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ N+ NH V VGYG + + +W++KN WG NW G + + R
Sbjct: 260 YRKGVYYDENCNSDNLNHAVLAVGYGI----QKGRKHWIIKNSWGENWGNKGYVLMARNK 315
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 316 NNA--CGIANLASFP 328
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 107/322 (33%), Positives = 162/322 (50%), Gaps = 47/322 (14%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + +TY+ E+ +RFKIF +N L +N+F DL +F
Sbjct: 28 EAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEF 87
Query: 60 LASYTGYKPPPTDHPHSNR----SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+ G+ H R S++ N + S +DW ++GAVTPVKDQG
Sbjct: 88 ARIFNGH--------HGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGS 139
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+A ++EG + ++ G+LV+ S+ LVDCS NGC +E+AF+YI+ +
Sbjct: 140 CWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGID 199
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDAT 230
+E YPY+ D C + + GY ++ +E L+ V+ P+SVAIDA+
Sbjct: 200 TEKSYPYEAV-DGECRFKKEDVG---ATDTGYVEIKAGSEVDLKKAVATVGPISVAIDAS 255
Query: 231 W--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y GV+ P C + +HGV +VGYG +G + YWLVKN W +W + G +
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV----KGGKKYWLVKNSWAESWGDQGYI 311
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
+ R + C IA+ A+YPL
Sbjct: 312 LMSR--DNNNQCGIASQASYPL 331
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 107/322 (33%), Positives = 162/322 (50%), Gaps = 47/322 (14%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + +TY+ E+ +RFKIF +N L +N+F DL +F
Sbjct: 28 EAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEF 87
Query: 60 LASYTGYKPPPTDHPHSNR----SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+ G+ H R S++ N + S +DW ++GAVTPVKDQG
Sbjct: 88 ARIFNGH--------HGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGS 139
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+A ++EG + ++ G+LV+ S+ LVDCS NGC +E+AF+YI+ +
Sbjct: 140 CWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGID 199
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDAT 230
+E YPY+ D C + + GY ++ +E L+ V+ P+SVAIDA+
Sbjct: 200 TEKSYPYEAV-DGECRFKKEDVG---ATDTGYVEIKAGSEVDLKKAVATVGPISVAIDAS 255
Query: 231 W--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y GV+ P C + +HGV +VGYG +G + YWLVKN W +W + G +
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV----KGGKKYWLVKNSWAESWGDQGYI 311
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
+ R + C IA+ A+YPL
Sbjct: 312 LMSR--DNNNQCGIASQASYPL 331
>gi|113208365|dbj|BAF03553.1| cysteine proteinase CP2 [Phaseolus vulgaris]
Length = 365
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 104/302 (34%), Positives = 152/302 (50%), Gaps = 39/302 (12%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLRLN------------KFADLTREKFLASYTG 65
+ +F +TY + E + RF +FK N RL+ KF+DLT +F + G
Sbjct: 54 FKAKFGKTYATKEEHDHRFGVFKSNMRRARLHAQLDPSAVHGVTKFSDLTPAEFHRKFLG 113
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
KP H+ ++ N K DW ++GAVT VKDQGS CW+F+ +
Sbjct: 114 LKPLRLP-AHAQKAPILPTNNLPK-----DFDWRDKGAVTNVKDQGSCGSCWSFSTTGAL 167
Query: 125 EGLNKIRTGQLVTRSKHQLVDC----------STLNGCAKNFLENAFEYIRQYQRLASEC 174
EG + + TG+LV+ S+ QLVDC S +GC + NAFEY+ + E
Sbjct: 168 EGAHFLATGELVSLSEQQLVDCDHVCDPEEYGSCDSGCNGGLMNNAFEYLIGSGGVQREK 227
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNF 234
YPY GR D C + +S + ++ Y + E+ ++V P++VAI+A +
Sbjct: 228 DYPYTGR-DGTCKFDKSKIAA---SVSNYSVISLDEEQIAANLVKNGPLAVAINAVYMQT 283
Query: 235 YHGGVFTGP--CGNTPNHGVTIVGYGTTTEAE---GQQPYWLVKNRWGTNWDEGGSMRIF 289
Y GGV + P CG +HGV +VGYG A ++PYW++KN WG NW E G +I
Sbjct: 284 YVGGV-SCPYICGKHLDHGVLLVGYGEGAYAPIRFKEKPYWIIKNSWGENWGENGYYKIC 342
Query: 290 RG 291
RG
Sbjct: 343 RG 344
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 157 bits (397), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 107/322 (33%), Positives = 162/322 (50%), Gaps = 47/322 (14%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + +TY+ E+ +RFKIF +N L +N+F DL +F
Sbjct: 28 EAFKTTHKKTYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEF 87
Query: 60 LASYTGYKPPPTDHPHSNR----SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+ G+ H R S++ N + S +DW ++GAVTPVKDQG
Sbjct: 88 ARIFNGH--------HGTRKTGGSSFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGS 139
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+A ++EG + ++ G+LV+ S+ LVDCS NGC +E+AF+YI+ +
Sbjct: 140 CWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGID 199
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDAT 230
+E YPY+ D C + + GY ++ +E L+ V+ P+SVAIDA+
Sbjct: 200 TEKSYPYKAV-DGECRFKKEDVG---ATDTGYVEIKAGSEVDLKKAVATVGPISVAIDAS 255
Query: 231 W--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y GV+ P C + +HGV +VGYG +G + YWLVKN W +W + G +
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV----KGGKKYWLVKNSWAESWGDQGYI 311
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
+ R + C IA+ A+YPL
Sbjct: 312 LMSR--DNNNQCGIASQASYPL 331
>gi|348586441|ref|XP_003478977.1| PREDICTED: cathepsin K-like [Cavia porcellus]
Length = 329
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 106/315 (33%), Positives = 164/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E W + + Y + ++ R I++KN ++ L +N D+T E+
Sbjct: 27 ELWKKTYRKQYNGKVDEISRRIIWEKNLKYISIHNLEASLGVHTYELSMNHLGDMTSEEV 86
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ H HSN + + + DS+D+ ++G VTPVK+QG CWAF
Sbjct: 87 VQKMTGLKVPPS-HSHSNDTLYIPDWEGRAP---DSVDYRKKGYVTPVKNQGQCGSCWAF 142
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 143 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQENRGIDSEDAYP 202
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--FNF 234
Y G Q+ C + + +GK RGY+ + E+ L+ V+R PVSVAIDA+ F F
Sbjct: 203 YVG-QEESCMY---NPTGKAAKCRGYREIPVGNEKALKRAVARVGPVSVAIDASLSSFQF 258
Query: 235 YHGGVFTGPC--GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ G NH + VGYG + +W++KN WG NW G + + R
Sbjct: 259 YSKGVYYDESCNGEDLNHALLAVGYGM----QRGNKHWILKNSWGENWGNKGYVLLARNK 314
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 315 NNA--CGIANLASFP 327
>gi|125606653|gb|EAZ45689.1| hypothetical protein OsJ_30362 [Oryza sativa Japonica Group]
Length = 359
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 118/314 (37%), Positives = 155/314 (49%), Gaps = 29/314 (9%)
Query: 15 HEQWMVEFARTYKDQAEKEMRFKIFKKN----HEF---------LRLNKFADLTREKFLA 61
+++W T +D AEK+ RF+ FK N +EF L LN+FAD+T ++F+A
Sbjct: 30 YQRWSRVHGLTSRDLAEKQGRFEAFKANARHVNEFNKKEGMTYKLALNRFADMTLQEFVA 89
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQ-GSYCCWAFTA 120
Y G K ++ + + S DW E GAVT VKDQ G CWAF+A
Sbjct: 90 KYAGAKVDAAAAALASVAE-VEEEELVVGDVPASWDWREHGAVTAVKDQDGCGSCWAFSA 148
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLEN--AFEYIRQYQRLASECV 175
V VE +N I TG L+T S+ Q++DCS NG N + + A E +
Sbjct: 149 VGAVESINAIATGNLLTLSEQQVLDCSGDGDCNGGWPNLVLSGYAVEQGIALDNIGDPAY 208
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA-TWFNF 234
YP + C R+ A G V ++E L+ V QPVSV I+A T F
Sbjct: 209 YPPYVAKKMAC---RTVAGKPVVKTDGTLQV-ASSETALKQSVYGQPVSVLIEADTNFQL 264
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GV++GPCG NH V VGYG T YW+VKN W T W E G +R+ R VGG
Sbjct: 265 YKSGVYSGPCGTRINHAVLAVGYGVTLN---NTKYWIVKNSWNTTWGESGYIRMKRDVGG 321
Query: 295 S-GLCNIAANAAYP 307
+ GLC IA YP
Sbjct: 322 NKGLCGIAMYGIYP 335
>gi|194352766|emb|CAQ00111.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 384
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 108/349 (30%), Positives = 164/349 (46%), Gaps = 78/349 (22%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLRL-------------NKFADLTREKFLASYT 64
WM R+Y EK RF++++ N EF+ F DLT ++F+A Y+
Sbjct: 55 WMAAHGRSYPTVEEKLRRFEVYRSNMEFIEAANRDSRMSYSLGETPFTDLTHDEFMAMYS 114
Query: 65 GYKPPPTDHPHSNRSNWFK-------------------------NLNSSKMSFYDSIDWN 99
+ + S W + NLN + + S+DW
Sbjct: 115 S---------NDDSSEWEEATVITTRAGPVHEGTAAVEEPPRRTNLNVTAV-LPPSVDWR 164
Query: 100 ERGAVTPVKDQGSYC--CWAFTAVATVEGLNKIRTG-QLVTRSKHQLVDCSTLN-GCAKN 155
+G VTP K+QG+ C CWAFT+VAT+E I TG S+ QLVDCSTL+ GC +
Sbjct: 165 AKGVVTPAKNQGATCFSCWAFTSVATMESAQAISTGGSPPVLSEQQLVDCSTLHHGCGRG 224
Query: 156 FLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQY---VQPATEE 212
++++AF+++ + +E YPY G+ + +GK A+R Y P E
Sbjct: 225 WMDDAFKWVIMNGGITTEAAYPYTGKAG-------NCQTGKPVAVRLRSYKKVTPPGNEA 277
Query: 213 GLQDVVSRQPVSVAIDAT--WFNFYHGGVFT-----------GPCGNTPNHGVTIVGYGT 259
GL++ V++QPV+V+ D + F Y GGV+ G C NH + +VGYGT
Sbjct: 278 GLKEAVAQQPVAVSFDYSDPCFQHYIGGVYNAGCSRSGVYIKGACKTAQNHAMALVGYGT 337
Query: 260 TTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
+ +G + YW+ KN W W + G + + R GLC +A YP+
Sbjct: 338 --KPDGTK-YWIGKNSWTAKWGDKGFIYLLRDSPPLGLCGLAKLPVYPI 383
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 109/337 (32%), Positives = 162/337 (48%), Gaps = 58/337 (17%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASY 63
E W+ F + Y D +E + RF IFK N +F L LN ADLT ++ Y
Sbjct: 182 ENWIDRFEKKY-DVSEFKKRFSIFKSNMDFVHSWNSKNSQTVLGLNHLADLTNLEYRQFY 240
Query: 64 TGYKPP-----PTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
G P +H SN + F + ++DW ++GAV+P+KDQG CW+
Sbjct: 241 LGTHKKAVLGTPGNHEVSNLQSVFGD--------SATVDWRQKGAVSPIKDQGQCGSCWS 292
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASEC 174
F+ +VEG ++I++G +V S+ LVDCST GC ++ AFEYI + +E
Sbjct: 293 FSTTGSVEGAHQIKSGNMVELSEQNLVDCSTSEGNMGCNGGLMDYAFEYIITNNGIDTES 352
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW-- 231
YPY C + ++++ I Y+ + +E L D V PVSVAIDA+
Sbjct: 353 SYPYTASSGTTCKYNKANSG---ATISSYKNITAGSESDLADAVKNAGPVSVAIDASHNS 409
Query: 232 FNFY-HGGVFTGPCGNTP-NHGVTIVGYGTTT------------------EAEGQQPYWL 271
F Y HG + C + +HGV +VGYG+ T + + + YW+
Sbjct: 410 FQLYSHGIYYDASCSSVNLDHGVLVVGYGSGTPDSDSRVHKGSQVRVKVPKTDDTKNYWI 469
Query: 272 VKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
VKN WGT+W + G I+ C IA+ A+YP+
Sbjct: 470 VKNSWGTSWGDKG--FIYMSKDRDNNCGIASCASYPI 504
>gi|332220183|ref|XP_003259237.1| PREDICTED: cathepsin S isoform 1 [Nomascus leucogenys]
Length = 331
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 111/334 (33%), Positives = 165/334 (49%), Gaps = 55/334 (16%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------N 49
HK + W + + YK++ E+ +R I++KN +F+ L N
Sbjct: 19 HKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 78
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTP 106
D+T E+ ++ + + P S W +N+ ++ DS+DW E+G VT
Sbjct: 79 HLGDMTSEEVMSLMSSLRVP---------SQWQRNITYKSNPNQILPDSVDWREKGCVTE 129
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAF 161
VK QGS CWAF+AV +E K++TG+LV+ S LVDCST GC F+ AF
Sbjct: 130 VKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAF 189
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDW---WRSSASGKYGAIRGYQYVQPATEEGL--QD 216
+YI + + S+ YPY+ D C + +R++ KY + P + E + +
Sbjct: 190 QYIIDNKGIDSDASYPYKA-MDQKCQYDSKYRAATCSKYTEL-------PYSREDVLKEA 241
Query: 217 VVSRQPVSVAIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVK 273
V ++ PVSV +DA+ F Y GV+ P C NHGV +VGYG E YWLVK
Sbjct: 242 VANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKE----YWLVK 297
Query: 274 NRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
N WG N+ E G +R+ R G C IA+ +YP
Sbjct: 298 NSWGRNFGEEGYIRMARNKGNH--CGIASFPSYP 329
>gi|34809608|pdb|1KHP|A Chain A, Monoclinic Form Of Papain/zlfg-dam Covalent Complex
gi|34809610|pdb|1KHQ|A Chain A, Orthorhombic Form Of PapainZLFG-Dam Covalent Complex
gi|157833552|pdb|1PPN|A Chain A, Structure Of Monoclinic Papain At 1.60 Angstroms
Resolution
gi|222143126|pdb|3E1Z|B Chain B, Crystal Structure Of The Parasite Protesase Inhibitor
Chagasin In Complex With Papain
Length = 212
Score = 157 bits (397), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 95/219 (43%), Positives = 126/219 (57%), Gaps = 19/219 (8%)
Query: 96 IDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCA 153
+DW ++GAVTPVK+QGS CWAF+AV T+EG+ KIRTG L S+ +L+DC + GC
Sbjct: 5 VDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNEYSEQELLDCDRRSYGCN 64
Query: 154 KNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEE 212
+ +A + + QY + YPY+G Q Y RS G Y A G + VQP E
Sbjct: 65 GGYPWSALQLVAQYG-IHYRNTYPYEGVQRY----CRSREKGPYAAKTDGVRQVQPYNEG 119
Query: 213 GLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYW 270
L ++ QPVSV ++A F Y GG+F GPCGN +H V VGYG Y
Sbjct: 120 ALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGPN--------YI 171
Query: 271 LVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYPL 308
L+KN WGT W E G +RI RG G S G+C + ++ YP+
Sbjct: 172 LIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPV 210
>gi|18141287|gb|AAL60581.1|AF454959_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 368
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 100/299 (33%), Positives = 147/299 (49%), Gaps = 37/299 (12%)
Query: 21 EFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKFLASYTGYKP 68
+F + Y + E + RF +FK N R + +F+DLTR +F + G K
Sbjct: 57 KFGKVYASREEHDYRFSVFKSNLRRARRHQKLDPSARHGVTQFSDLTRSEFKRKHLGVKG 116
Query: 69 PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGL 127
+N++ N + DW ERGAVTPVK+QGS CW+F+A +EG
Sbjct: 117 GFKLPKDANKAPILPTEN-----LPEEFDWRERGAVTPVKNQGSCGSCWSFSATGALEGA 171
Query: 128 NKIRTGQLVTRSKHQLVDC----------STLNGCAKNFLENAFEYIRQYQRLASECVYP 177
N + TG+LV+ S+ QLVDC S +GC + +AFEY + L E YP
Sbjct: 172 NFLATGKLVSLSEQQLVDCDHECDPEEAGSCDSGCNGGLMNSAFEYTLKTGGLMREEDYP 231
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYHG 237
Y G+ C +S ++ + + E+ ++V P++VAI+A + Y G
Sbjct: 232 YTGKDGATCKLDKSKI---VASVSNFSVISIDEEQIAANLVKNGPLAVAINAAYMQTYIG 288
Query: 238 GVFTGP--CGNTPNHGVTIVGYGTTTEAEG---QQPYWLVKNRWGTNWDEGGSMRIFRG 291
GV + P C NHGV +VGYG+ A ++PYW++KN WG W E G +I RG
Sbjct: 289 GV-SCPYICMRRLNHGVLLVGYGSAGYAPARFKEKPYWIIKNSWGETWGEDGFYKICRG 346
>gi|332220191|ref|XP_003259241.1| PREDICTED: cathepsin K [Nomascus leucogenys]
Length = 329
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 164/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E W + Y ++ ++ R I++KN ++ L +N D+T E+
Sbjct: 27 ELWKKTHRKQYNNKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEV 86
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ H SN + + + DS+D+ ++G VTPVK+QG CWAF
Sbjct: 87 VQKMTGLKVPPS-HSRSNDTLYIPDWEGRAP---DSVDYRKKGYVTPVKNQGQCGSCWAF 142
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 143 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENDGCGGGYMTNAFQYVQKNRGIDSEDAYP 202
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G Q+ C + + +GK RGY+ + E+ L+ V+R PVSVAIDA T F F
Sbjct: 203 YVG-QEESCMY---NPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQF 258
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ N+ NH V VGYG + +W++KN WG NW G + + R
Sbjct: 259 YSKGVYYDESCNSDNLNHAVLAVGYGI----QKGNKHWIIKNSWGENWGNKGYILMARNK 314
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 315 NNA--CGIANLASFP 327
>gi|332026794|gb|EGI66903.1| Putative cysteine proteinase [Acromyrmex echinatior]
Length = 774
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 101/309 (32%), Positives = 151/309 (48%), Gaps = 30/309 (9%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLASYT 64
+M + RTY E+ +RFKIF++N F+ +N FAD+++++F Y
Sbjct: 473 FMTTYNRTYSS-LERNLRFKIFRENLNFIEELRETEQGTGIYGVNMFADMSQKEFRTRYL 531
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVAT 123
G +P S + S DW ++G VTPVK+QG CWAF+
Sbjct: 532 GLRP----DLQSENEIPLPKAEIPDIDLPSSFDWRQKGVVTPVKNQGQCGSCWAFSVTGN 587
Query: 124 VEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQGRQ 182
VEG I+ GQL++ S+ +LVDC L+ GC +NA+ I Q L E YPY+
Sbjct: 588 VEGQYAIKHGQLLSLSEQELVDCDHLDEGCNGGLPDNAYRAIEQLGGLELESDYPYEAEN 647
Query: 183 DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYHGGV--- 239
+ C + ++ + + + + Q +V P+++ I+A FY GGV
Sbjct: 648 EK-CHFKQNLVKVELASAVN---ITSNETQIAQWLVQNGPIAIGINANAMQFYMGGVSHP 703
Query: 240 FTGPCG-NTPNHGVTIVGYGTTTEA--EGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSG 296
C N NHGV IVGYGT+ PYW++KN WG +W E G R++RG G G
Sbjct: 704 LKILCNPNNLNHGVLIVGYGTSRYPLFHKNLPYWIIKNSWGKSWGEQGYYRVYRGDGTCG 763
Query: 297 LCNIAANAA 305
L +A++A
Sbjct: 764 LNTMASSAV 772
>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
Length = 339
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 108/323 (33%), Positives = 162/323 (50%), Gaps = 40/323 (12%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIFKKN-HEF---------------LRLNKFADLTR 56
E+W V + Y + E+ R KIF +N H+ L +NK+ D+
Sbjct: 26 EEWNTFKVTHRKAYDSKIEESFRMKIFMENWHKIALHNQKYELNEVSYKLGMNKYGDMLH 85
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC- 114
+F+ + G+ + + R + + + + S+DW GAVTP+KDQG +C
Sbjct: 86 HEFINTLNGFNKSVSAQLRAQRRPIGSRFIEPANVEIPSSVDWRTHGAVTPIKDQG-HCG 144
Query: 115 -CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRL 170
CW+F+A +EG + TG+LV+ S+ L+DCS NGC ++ AF+YI+ L
Sbjct: 145 SCWSFSATGALEGQHYRITGKLVSLSEQNLIDCSGRYGNNGCNGGLMDQAFQYIKDNHGL 204
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDA 229
+E YPY+ D C R + GY + E+ L+ V+ PVSVAIDA
Sbjct: 205 DTEISYPYEAEND-KC---RYNPRNNGATDSGYVDIPEGNEKKLKAAVATIGPVSVAIDA 260
Query: 230 TW--FNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
+ F FY GV+ P ++ N HGV +VGYGT + Q YWLVKN WG W + G
Sbjct: 261 SAESFQFYREGVYYEPRCSSENLDHGVLVVGYGTD---DNDQDYWLVKNSWGVTWGDEGY 317
Query: 286 MRIFRGVGGSGLCNIAANAAYPL 308
+++ R C IA++A+YPL
Sbjct: 318 IKMAR--NKDNHCGIASSASYPL 338
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 157 bits (396), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 159/310 (51%), Gaps = 37/310 (11%)
Query: 24 RTYKDQAEKEMRFKIFKKN------HEFL----------RLNKFADLTREKFLASYTGYK 67
+ Y Q E++ R KI+ +N H L +NKF DL +F + GY+
Sbjct: 40 KEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQ 99
Query: 68 PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEG 126
+ + + F + + + +S+DW E+GA+TPVKDQG CWAF++ +EG
Sbjct: 100 HKKQNSSRAEST--FTFMEPANVEVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEG 157
Query: 127 LNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
+TG+L++ S+ L+DCS GC ++ AF+YI+ + + +E YPY+ D
Sbjct: 158 QTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDD 217
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--FNFYHGGVF 240
C R + + RG+ + E+ L+ V+ PVSVAIDA+ F FY GV+
Sbjct: 218 -VC---RYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVY 273
Query: 241 TGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLC 298
P ++ +HGV +VGYG+ + + YWLVKN W +W + G ++I R C
Sbjct: 274 YEPSCDSDDLDHGVLVVGYGS----DNGKDYWLVKNSWSEHWGDEGYIKIARNRKNH--C 327
Query: 299 NIAANAAYPL 308
+A A+YPL
Sbjct: 328 GVATAASYPL 337
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 161/310 (51%), Gaps = 37/310 (11%)
Query: 24 RTYKDQAEKEMRFKIFKKN------HEFL----------RLNKFADLTREKFLASYTGYK 67
+ Y Q E++ R KI+ +N H L +NKF DL +F + GY+
Sbjct: 36 KEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNGYQ 95
Query: 68 PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEG 126
+ + + F + + ++ +S+DW E+GA+TPVKDQG CWAF++ +EG
Sbjct: 96 HKKQNSSRAEST--FTFMEPANVTVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEG 153
Query: 127 LNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
+TG+LV+ S+ L+DCS GC ++ AF+YI+ + + +E YPY+ D
Sbjct: 154 QTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDD 213
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--FNFYHGGVF 240
C R + + RG+ + E+ L+ V+ PVSVAIDA+ F FY GV+
Sbjct: 214 -VC---RYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVY 269
Query: 241 TGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLC 298
P ++ +HGV +VGYG+ + + YWLVKN W +W + G +++ R C
Sbjct: 270 YEPSCDSDDLDHGVLVVGYGS----DNGKDYWLVKNSWSEHWGDEGYIKMARNRKNH--C 323
Query: 299 NIAANAAYPL 308
+A+ A+YPL
Sbjct: 324 GVASAASYPL 333
>gi|395856027|ref|XP_003800444.1| PREDICTED: cathepsin K [Otolemur garnettii]
Length = 329
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 107/315 (33%), Positives = 162/315 (51%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E W + Y + ++ R I++KN ++ L +N D+T E+
Sbjct: 27 ELWKKTHRKEYDSKVDEISRRLIWEKNLKYISIHNLEASLGVHTYELAMNHLGDMTSEEV 86
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ HSN + + + DSID+ ++G VTPVK+QG CWAF
Sbjct: 87 VQKMTGLKVPPS-RSHSNDTLYIPDWEGRAP---DSIDYRKKGYVTPVKNQGQCGSCWAF 142
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 143 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSDNDGCGGGYMTNAFQYVQKNRGIDSEDAYP 202
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK RGY+ + E+ L+ V+R P+SV IDA T F F
Sbjct: 203 YVG-QDESCMY---NPTGKAAKCRGYREIPEGNEKALKRAVARVGPISVGIDASLTSFQF 258
Query: 235 YHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ N+ NH V VGYG + +W++KN WG NW G + + R
Sbjct: 259 YSKGVYYDESCNSDNVNHAVLAVGYGI----QKGNKHWIIKNSWGENWGNKGYILMARNK 314
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 315 NNA--CGIANLASFP 327
>gi|340370276|ref|XP_003383672.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 105/315 (33%), Positives = 168/315 (53%), Gaps = 39/315 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR--------------LNKFADLTREKFLA 61
+ W V++ + Y+ + + R I++ N +F+ +N+FADL +F
Sbjct: 25 QDWKVKYNKVYETKETELERQIIWESNKKFVENHNANSDKFGFTVAMNEFADLDAGEFGR 84
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
+ G P P+ + N +N +K S + D++DW E+GAVTP+K+QG CW+F++
Sbjct: 85 IFNGLLPRPSSY---NSTNIYK---PSGVKVPDTVDWKEKGAVTPIKNQGQCGSCWSFSS 138
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYP 177
++EG + I TG LV+ S+ QL+DCST +GC ++N+F Y++ +E YP
Sbjct: 139 TGSLEGQHFINTGTLVSLSEQQLMDCSTKYGNHGCNGGLMDNSFRYLKSVAGDETEDNYP 198
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATW--FNF 234
Y ++ C R +S + Y + E+ L+D V+ P+SVAIDA+ F
Sbjct: 199 YTA-ENGVC---RYDSSLAVVTDKSYVDIPQGDEDSLKDAVANVGPISVAIDASHSSFQL 254
Query: 235 YHGGV-FTGPCGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y+ GV + C +T +HGV +GYGT E + YWLVKN WGT+W G +++ R
Sbjct: 255 YNSGVYYASTCSSTQLDHGVLAIGYGT----EDGKDYWLVKNSWGTSWGMEGYIKMSRNR 310
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A+YP
Sbjct: 311 NNN--CGIATQASYP 323
>gi|461905|sp|Q05094.1|CYSP2_LEIPI RecName: Full=Cysteine proteinase 2; AltName: Full=Amastigote
cysteine proteinase A-2; Flags: Precursor
gi|159298|gb|AAA29229.1| cysteine proteinase [Leishmania pifanoi]
Length = 444
Score = 157 bits (396), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 100/301 (33%), Positives = 151/301 (50%), Gaps = 28/301 (9%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKF 59
AA E++ + R Y+ AE++ R F++N E +R + KF DL+ +F
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEF 94
Query: 60 LASY---TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
A Y Y H + ++ + + D++DW E+GAVTPVKDQG+ C
Sbjct: 95 AARYLNGAAYFAAAKRHA----AQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSC 150
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQ--RLAS 172
WAF+AV +EG + +LV+ S+ QLV C +N GC + AF+++ Q L +
Sbjct: 151 WAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHT 210
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEEGLQDVVSRQPVSVAIDATW 231
E YPY Y + SS GA I G+ + + + + P+++A+DA+
Sbjct: 211 EDSYPYVSGNGYVPECSNSSEELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASS 270
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y GV T G NHGV +VGY T G+ PYW++KN WG +W E G +R+ G
Sbjct: 271 FMSYKSGVLTACIGKQLNHGVLLVGYDMT----GEVPYWVIKNSWGGDWGEQGYVRVVMG 326
Query: 292 V 292
V
Sbjct: 327 V 327
>gi|344275470|ref|XP_003409535.1| PREDICTED: cathepsin S-like isoform 1 [Loxodonta africana]
Length = 331
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 111/335 (33%), Positives = 166/335 (49%), Gaps = 48/335 (14%)
Query: 1 MSRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------- 45
M+R HK + + W +++ YK++ E+ R I++KN +F
Sbjct: 15 MARL-HKDPTLDNHWDLWKKTYSKQYKEKNEEVARRLIWEKNLKFVMLHNLEHSMGMHSY 73
Query: 46 -LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNER 101
L +N D+T E+ ++ + + P S W +N+ ++ DS+DW E+
Sbjct: 74 DLSMNHLGDMTSEEVMSLMSSLRVP---------SQWQRNVTFKSNPNQKLPDSLDWREK 124
Query: 102 GAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS----TLNGCAKNF 156
G VT VK QGS CWAF+AV +E K++TG+LV+ S LVDCS + GC F
Sbjct: 125 GCVTDVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGEKYSNKGCNGGF 184
Query: 157 LENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQD 216
+ AF+YI + SE YPY+ D C + + Y + +E+ L++
Sbjct: 185 MTRAFQYIIDNNGIDSEASYPYKA-TDGKCQY---DPKNRAATCSKYTELPYGSEDALKE 240
Query: 217 VVSRQ-PVSVAIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLV 272
V+ + PVSV IDA+ F Y GV+ P C + NHGV +VGYG + YWLV
Sbjct: 241 AVANKGPVSVGIDASRPSFFLYKSGVYYDPSCTDNVNHGVLVVGYGNLNGKD----YWLV 296
Query: 273 KNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
KN WG N+ E G +R+ R G C IA+ +YP
Sbjct: 297 KNSWGLNFGEQGYIRMARNSGNH--CGIASFPSYP 329
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 103/318 (32%), Positives = 165/318 (51%), Gaps = 41/318 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
+Q+ + + Y+ E R ++++N EF L +N+F D+T E+
Sbjct: 23 QQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDMTTEEI 82
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
A+ G+ P R ++ L D++DW ++GAVTPVKDQ + CWAF
Sbjct: 83 NAAMNGFLSAGKKVP---RGTMYQPLVDE---LPDTVDWRDKGAVTPVKDQKACGSCWAF 136
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASECV 175
+A ++EG + + TG+LV+ S+ LVDCS GC ++NAF YI+ + +E
Sbjct: 137 SATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGIDTEES 196
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWF 232
YPY+ + C R ++ + Y +Q +E+ LQ V+ + PVSVAIDA + F
Sbjct: 197 YPYEAKNG-PC---RFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTF 252
Query: 233 NFYHGGV-FTGPCGNT-PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
+FY G+ + C ++ +HGV VGYGT ++ YWLVKN W W + G +++ R
Sbjct: 253 HFYSRGIYYDEKCSSSFLDHGVLAVGYGTDDSSD----YWLVKNSWNETWGDSGYIKMSR 308
Query: 291 GVGGSGLCNIAANAAYPL 308
+ C IA+ A+YP+
Sbjct: 309 NRNNN--CGIASQASYPV 324
>gi|359806140|ref|NP_001241450.1| uncharacterized protein LOC100778716 precursor [Glycine max]
gi|255639509|gb|ACU20049.1| unknown [Glycine max]
Length = 366
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 108/328 (32%), Positives = 159/328 (48%), Gaps = 38/328 (11%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN------HEFLR------LNKFAD 53
H N + +F +TY Q E + RF+IFK N H+ L + +F+D
Sbjct: 42 HHLLNAEHHFSAFKTKFGKTYATQEEHDHRFRIFKNNLLRAKSHQKLDPSAVHGVTRFSD 101
Query: 54 LTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY 113
LT +F + G KP P + N F DW E GAVT VK+QGS
Sbjct: 102 LTPAEFRRQFLGLKP--LRLPSDAQKAPILPTNDLPTDF----DWREHGAVTGVKNQGSC 155
Query: 114 -CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC----------STLNGCAKNFLENAFE 162
CW+F+AV +EG + + TG+LV+ S+ QLVDC + +GC + AFE
Sbjct: 156 GSCWSFSAVGALEGAHFLSTGELVSLSEQQLVDCDHECDPEERGACDSGCNGGLMTTAFE 215
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQP 222
Y Q L E YPY GR C + +S + ++ + V E+ ++V P
Sbjct: 216 YTLQAGGLMREKDYPYTGRDRGPCKFDKSKVA---ASVANFSVVSLDEEQIAANLVQNGP 272
Query: 223 VSVAIDATWFNFYHGGVFTGP--CGNTPNHGVTIVGYGTTTEAE---GQQPYWLVKNRWG 277
++V I+A + Y GGV + P CG +HGV +VGYG+ A ++PYW++KN WG
Sbjct: 273 LAVGINAVFMQTYIGGV-SCPYICGKHLDHGVLLVGYGSGAYAPIRFKEKPYWIIKNSWG 331
Query: 278 TNWDEGGSMRIFRGVGGSGLCNIAANAA 305
+W E G +I RG G+ ++ + A
Sbjct: 332 ESWGEEGYYKICRGRNVCGVDSMVSTVA 359
>gi|63101996|gb|AAH95694.1| Cathepsin S, b.1 [Danio rerio]
Length = 330
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 111/328 (33%), Positives = 161/328 (49%), Gaps = 39/328 (11%)
Query: 5 SHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRL 48
+H N+ E W + + Y + E+ R +++++N + L +
Sbjct: 17 AHFNTNLDQHWELWKKTYGKIYTTEVEEFGRRQLWERNLQLITVHNLEASMGMHSYDLSM 76
Query: 49 NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVK 108
N DLT E+ L + T P + + SS + DS+DW E+G V+ VK
Sbjct: 77 NHMGDLTTEEILQTLA-----LTHVPSGFKRQIANIVGSSGDAVPDSLDWREKGYVSSVK 131
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYI 164
QG+ CWAF++V +EG K TG+LV S LVDCS+ GC F+ +AF+Y+
Sbjct: 132 MQGACGSCWAFSSVGALEGQLKKTTGKLVDLSPQNLVDCSSKYGNKGCNGGFMSDAFQYV 191
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQPV 223
+AS+ YPY+G Q A+ Y +V+ E L Q V S P+
Sbjct: 192 IDNGGIASDSAYPYRGVQQQCSYSSSQRAAN----CTKYYFVRQGDENALKQAVASVGPI 247
Query: 224 SVAIDAT--WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
SVAIDAT F YH GV+ P C NH V +VGYGT + Q +WLVKN WGT +
Sbjct: 248 SVAIDATRPQFVLYHSGVYNDPTCSKRVNHAVLVVGYGTLS----GQDHWLVKNSWGTRF 303
Query: 281 DEGGSMRIFRGVGGSGLCNIAANAAYPL 308
+GG +R+ R + +C IA+ A YP+
Sbjct: 304 GDGGYIRMAR--NKNNMCGIASYACYPV 329
>gi|226821425|gb|ACO82388.1| cathepsin S [Lutjanus argentimaculatus]
Length = 337
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 114/325 (35%), Positives = 161/325 (49%), Gaps = 45/325 (13%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADL 54
+ A + W + Y+++ E+ R ++++KN H + L +N D+
Sbjct: 30 LDAHWDLWKKTHEKKYQNEVEEFSRRRLWEKNLMLITMHNLEASMGLHTYELGMNHMGDM 89
Query: 55 TREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY- 113
T E+ S+ PP TD + SS D++DW E+G VT VK QGS
Sbjct: 90 TPEEIWQSFATLTPP-TDIQRAPS----PFAGSSGADIPDTMDWREKGCVTSVKTQGSCG 144
Query: 114 CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRL 170
CWAF+AV +EG +TG+LV S LVDCST +GC F+++AF+Y+ Q +
Sbjct: 145 SCWAFSAVGALEGQLAKKTGKLVDLSPQNLVDCSTKYGNHGCNGGFMDHAFQYVIDNQGI 204
Query: 171 ASECVYPYQGRQD--YYCDWWRSSASGKYGAIRGYQYVQPATEEGL--QDVVSRQPVSVA 226
S+ YPY GR D +Y +R++ Y + P +EG Q + + P+SVA
Sbjct: 205 DSDASYPYTGRSDQCHYNPSYRAANCSSYNFL-------PEGDEGALKQALATIGPISVA 257
Query: 227 IDAT--WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEG 283
IDAT F FY GV+ P C NHGV VGYGT Q YWLVKN WGT + +
Sbjct: 258 IDATRPRFIFYRSGVYNDPSCSQEVNHGVLAVGYGTLN----GQDYWLVKNSWGTKFGDQ 313
Query: 284 GSMRIFRGVGGSGLCNIAANAAYPL 308
G +R+ R C IA YP+
Sbjct: 314 GYIRMARNQNDQ--CGIAMYGCYPI 336
>gi|348513249|ref|XP_003444155.1| PREDICTED: cathepsin K-like [Oreochromis niloticus]
Length = 330
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 112/319 (35%), Positives = 165/319 (51%), Gaps = 44/319 (13%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
E+W + + Y +Q E + R +++KN H F L LN AD+T E+
Sbjct: 27 EEWKTKHGKVYDNQTEIDFRRAVWEKNVHLVLRHNQEASAGKHSFTLGLNHLADMTAEEI 86
Query: 60 LASYTGYKPPPTDHPHSNRSNW-FKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CW 116
G K T N +N F++++ S + ++DW + G V PV++QG C CW
Sbjct: 87 NEKLNGLKLEET----VNFTNGTFEDVSDSPLPV--NVDWRKEGLVGPVRNQG-LCGSCW 139
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLENAFEYIRQYQRLASE 173
AF+++ +EG K RTG LV+ S LVDCST + GC ++ A+ Y+ + + SE
Sbjct: 140 AFSSLGALEGQLKKRTGTLVSLSPQNLVDCSTQDGNLGCRGGYITKAYSYVIRNGGVDSE 199
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVV-SRQPVSVAIDATW- 231
YPY+ ++ C R S G+ G + + E+ LQ V+ S P+SVA++A
Sbjct: 200 SFYPYE-HKNGKC---RYSVQGRAGYCSKFSILPEGDEKMLQKVLASVGPISVAVNAMLE 255
Query: 232 -FNFYHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
F+ Y GG++ P N NH V +VGYGT + Q YWLVKN WGT W EGG +R+
Sbjct: 256 SFHMYSGGLYNVPSCNPKLINHAVLLVGYGT----DAGQDYWLVKNSWGTAWGEGGYIRL 311
Query: 289 FRGVGGSGLCNIAANAAYP 307
R + LC IA+ YP
Sbjct: 312 AR--NKNNLCGIASFPVYP 328
>gi|9542|emb|CAA78443.1| cysteine proteinase [Leishmania mexicana]
Length = 443
Score = 156 bits (395), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 150/300 (50%), Gaps = 27/300 (9%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKF 59
AA E++ + R Y+ AE++ R F++N E +R + KF DL+ +F
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEF 94
Query: 60 LASY---TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
A Y Y H + ++ + + D++DW E+GAVTPVKDQG+ C
Sbjct: 95 AARYLNGAAYFAAAKRHA----AQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSC 150
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYIRQYQ--RLAS 172
WAF+AV +EG + +LV+ S+ QLV C + NGC+ + AF+++ Q L +
Sbjct: 151 WAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMDNGCSGGLMLQAFDWLLQNTNGHLHT 210
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY Y + SS I G+ + + + + P+++A+DA+ F
Sbjct: 211 EDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSF 270
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV T G NHGV +VGY T G+ PYW++KN WG +W E G +R+ GV
Sbjct: 271 MSYKSGVLTACIGKQLNHGVLLVGYDMT----GEVPYWVIKNSWGGDWGEQGYVRVVMGV 326
>gi|15705865|gb|AAL05851.1|AF411121_1 cysteine proteinase precursor [Sandersonia aurantiaca]
Length = 360
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 157/321 (48%), Gaps = 37/321 (11%)
Query: 13 AKHEQWMVEFARTYKDQAEKEMRFKIFKKN------HEFLR------LNKFADLTREKFL 60
A ++ + ++Y D+AE RF +FK N H+ L + +FADLT +F
Sbjct: 43 AHFSSFLSRYGKSYADEAEHAYRFSVFKSNLRRARRHQRLDPTAVHGVTRFADLTPSEFR 102
Query: 61 ASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
+Y G + P ++ + N F DW + GAVTPVK+QGS CW+F+
Sbjct: 103 RTYLGLRRRPRTAGSTHDAPILPT-NELPADF----DWRDHGAVTPVKNQGSCGSCWSFS 157
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDC----------STLNGCAKNFLENAFEYIRQYQR 169
A +EG N + TG LV+ S+ QLVDC S GC + AFEYI +
Sbjct: 158 AAGALEGANYLSTGNLVSLSEQQLVDCDHECDSSEPDSCDQGCNGGLMTTAFEYILKSGG 217
Query: 170 LASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA 229
L E YPY G C + ++ S + V ++ ++V P++V I+A
Sbjct: 218 LEREADYPYTGTDRGTCKFNKAKISA---VASNFSVVSIDEDQIAANLVKHGPLAVGINA 274
Query: 230 TWFNFYHGGVFTGP--CGNTPNHGVTIVGYGTTTEAE---GQQPYWLVKNRWGTNWDEGG 284
+ Y GGV + P CG +HGV +VGYG+ A ++PYW++KN WG NW E G
Sbjct: 275 VFMQTYVGGV-SCPYICGKHLDHGVLLVGYGSAGFAPIRFKEKPYWIIKNSWGENWGENG 333
Query: 285 SMRIFRGVGGSGLCNIAANAA 305
+I RG G+ ++ ++ +
Sbjct: 334 YYKICRGRNVCGVDSMVSSVS 354
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 105/318 (33%), Positives = 161/318 (50%), Gaps = 38/318 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + ++Y+ E+ +RFKIF +N L +N+F DL +F
Sbjct: 28 EAFKATHKKSYQSNMEELLRFKIFSENSLLVARHNEKYARGLVSYKLGMNQFGDLLPHEF 87
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ GY+ T S + N + S S+DW E+GAVTPVK+QG CWAF
Sbjct: 88 ARMFNGYRGART---AGRGSTFLPPANVNYSSLPQSMDWREKGAVTPVKNQGQCGSCWAF 144
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+ ++EG + ++TG LV+ S+ LVDCS +GC ++NAF+YI+ + +E
Sbjct: 145 STTGSLEGQHFLKTGVLVSLSEQNLVDCSETFGNHGCEGGLMDNAFQYIKANGGIDTEKS 204
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--F 232
YPY+ +D C + + + G+ ++ +E+ L+ V+ PVSVAIDA+ F
Sbjct: 205 YPYEA-EDGECRFKKQNVG---ATDTGFVDIEQGSEDDLKKAVATVGPVSVAIDASHSSF 260
Query: 233 NFYHGGVF--TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
Y GV+ T +HGV +VGYG E + YWLVKN W +W + G +++ R
Sbjct: 261 QLYSEGVYDETECSSEQLDHGVLVVGYGV----EDGKKYWLVKNSWAESWGDNGYIKMSR 316
Query: 291 GVGGSGLCNIAANAAYPL 308
C IA+ A+YPL
Sbjct: 317 DKDNQ--CGIASAASYPL 332
>gi|130502110|ref|NP_001076110.1| cathepsin K precursor [Oryctolagus cuniculus]
gi|1168794|sp|P43236.1|CATK_RABIT RecName: Full=Cathepsin K; AltName: Full=Protein OC-2; Flags:
Precursor
gi|454187|dbj|BAA03125.1| OC-2 protein [Oryctolagus cuniculus]
Length = 329
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/315 (34%), Positives = 164/315 (52%), Gaps = 37/315 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
E W +++ Y + ++ R I++KN H + L +N D+T E+
Sbjct: 27 ELWKKTYSKQYNSKVDEISRRLIWEKNLKHISIHNLEASLGVHTYELAMNHLGDMTSEEV 86
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ TG K PP+ HSN + + + DSID+ ++G VTPVK+QG CWAF
Sbjct: 87 VQKMTGLKVPPS-RSHSNDTLYIPDWEGRTP---DSIDYRKKGYVTPVKNQGQCGSCWAF 142
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
++V +EG K +TG+L+ S LVDC + N GC ++ NAF+Y+++ + + SE YP
Sbjct: 143 SSVGALEGQLKKKTGKLLNLSPQNLVDCVSENYGCGGGYMTNAFQYVQRNRGIDSEDAYP 202
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--TWFNF 234
Y G QD C + + +GK RGY+ + E+ L+ V+R PVSVAIDA T F F
Sbjct: 203 YVG-QDESCMY---NPTGKAAKCRGYREIPEGNEKALKRAVARVGPVSVAIDASLTSFQF 258
Query: 235 YHGGVF--TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ + NH V VGYG + +W++KN WG +W G + + R
Sbjct: 259 YSKGVYYDENCSSDNVNHAVLAVGYGI----QKGNKHWIIKNSWGESWGNKGYILMARNK 314
Query: 293 GGSGLCNIAANAAYP 307
+ C IA A++P
Sbjct: 315 NNA--CGIANLASFP 327
>gi|310975577|gb|ADP55137.1| cathepsin S [Miichthys miiuy]
Length = 338
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 114/319 (35%), Positives = 154/319 (48%), Gaps = 42/319 (13%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
E W +TY++ E E R ++++KN H + L +N DLT E+
Sbjct: 35 ELWKKMHGKTYRNYVEDESRRELWEKNLVLITMHNLEASMGLHTYKLSMNHMGDLTPEEI 94
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+ S+ PP TD + +S + D++DW E+G VT VK QG+ CWAF
Sbjct: 95 MQSFATLTPP-TDIQRAPS----PFAGTSGAAVPDTMDWREKGCVTSVKMQGACGSCWAF 149
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+A +EG TG+LV S LVDCST +GC F+ AF+Y+ + S+
Sbjct: 150 SAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNHGCNGGFMHKAFQYVIDNHGIDSDAA 209
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQY-VQPATEEGL--QDVVSRQPVSVAIDA--T 230
YPY GRQ C + S K+ A QY P +EG Q + + P+SVAIDA
Sbjct: 210 YPYTGRQSQECHY-----SPKFRAANCSQYSFLPEGDEGALKQALATIGPISVAIDARRP 264
Query: 231 WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F FY GV+ P C NHGV VGYGT Q YWLVKN WG + + G +R+
Sbjct: 265 RFAFYSSGVYDDPSCSQDVNHGVLAVGYGTLN----GQDYWLVKNSWGQTFGDNGYIRMA 320
Query: 290 RGVGGSGLCNIAANAAYPL 308
R C IA YP+
Sbjct: 321 RNKNDQ--CGIARYGCYPI 337
>gi|116666824|pdb|2BDZ|A Chain A, Mexicain From Jacaratia Mexicana
gi|116666825|pdb|2BDZ|B Chain B, Mexicain From Jacaratia Mexicana
gi|116666826|pdb|2BDZ|C Chain C, Mexicain From Jacaratia Mexicana
gi|116666827|pdb|2BDZ|D Chain D, Mexicain From Jacaratia Mexicana
Length = 214
Score = 156 bits (395), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 93/225 (41%), Positives = 128/225 (56%), Gaps = 23/225 (10%)
Query: 92 FYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN 150
+ +SIDW E+GAVTPVK+Q CWAF+ VAT+EG+NKI TGQL++ S+ +L+DC +
Sbjct: 1 YPESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCERRS 60
Query: 151 -GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGA---IRGYQYV 206
GC + + +Y+ + +E YPY+ +Q R A K G I GY+YV
Sbjct: 61 HGCDGGYQTTSLQYVVD-NGVHTEREYPYEKKQG------RCRAKDKKGPKVYITGYKYV 113
Query: 207 QPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAE 264
E L ++ QPVSV D+ F FY GG++ GPCG +H VT VGYG T
Sbjct: 114 PANDEISLIQAIANQPVSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTAVGYGKT---- 169
Query: 265 GQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYPL 308
Y L+KN WG NW E G +RI R G S G C + ++ +P+
Sbjct: 170 ----YLLLKNSWGPNWGEKGYIRIKRASGRSKGTCGVYTSSFFPI 210
>gi|121531600|gb|ABM55485.1| digestive cysteine protease intestain [Leptinotarsa decemlineata]
Length = 326
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 113/324 (34%), Positives = 162/324 (50%), Gaps = 42/324 (12%)
Query: 12 AAKHEQWMV---EFARTYKDQAEKEMRFKIFKKN------HE----------FLRLNKFA 52
+ +QW+ +TYK+ E++ RF IF++N H L + +FA
Sbjct: 17 STNEDQWIAFKQTHGKTYKNLLEEKTRFGIFQRNLIKIKEHNARYDKGEETYLLGVTRFA 76
Query: 53 DLTREKFLASYTG-YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQG 111
DLT E+F G K P R N + + DSIDW E+GAV VKDQ
Sbjct: 77 DLTHEEFKDILKGQIKNKP-------RLNATPTVFPEDLEVPDSIDWTEKGAVLEVKDQN 129
Query: 112 SY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL--NGCAKN--FLENAFEYIRQ 166
CWAF+A +EG N I ++ S+ QL+DCS NG K + AFEY+R
Sbjct: 130 PCGSCWAFSATGALEGQNAILNNVKISLSEQQLLDCSAAYGNGNCKEGGDMSAAFEYVRD 189
Query: 167 YQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVV-SRQPVSV 225
Y + SE YPY R+ C + AS I+GY+ V +EEGL+ V + P+S+
Sbjct: 190 YG-IQSEKSYPYI-RKQTECQY---DASKTILKIKGYKNV-TTSEEGLRKAVGAIGPISI 243
Query: 226 AIDATWFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
A+++ Y+ G+ +G C + +HGV +VGYG ++ G+ +W VKN WG W E G
Sbjct: 244 AMNSDPLQLYYSGIISGKGCSHDLDHGVLVVGYGKASQWSGETKFWRVKNSWGKIWGENG 303
Query: 285 SMRIFRGVGGSGLCNIAANAAYPL 308
RI R + LC IA + YP+
Sbjct: 304 YFRIKR--DANNLCGIADDPTYPV 325
>gi|312100382|gb|ADQ27799.1| mitogenic proteinase [Vasconcellea cundinamarcensis]
Length = 214
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 89/222 (40%), Positives = 130/222 (58%), Gaps = 17/222 (7%)
Query: 92 FYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN 150
+ +SIDW ++GAVTPVKDQ CWAF+ VATVEG+NKI TG+L++ S+ +L+DC +
Sbjct: 1 YPESIDWRQKGAVTPVKDQNPCGSCWAFSTVATVEGINKIVTGKLISLSEQELLDCDRRS 60
Query: 151 -GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPA 209
GC + + +Y+ + +E YPY+ +Q + G I GY+ V P
Sbjct: 61 HGCNGGYQTTSLQYVVD-NGVHTEYEYPYEKKQG---NCRAKDKKGLKVQITGYKRVPPN 116
Query: 210 TEEGLQDVVSRQPVSVAIDAT--WFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQ 267
E L V++ QPVSV I++ F+FY GG++ GPCG +H VT +GYG +
Sbjct: 117 DEISLIKVIANQPVSVLIESKDRSFHFYRGGIYKGPCGTRLDHAVTAIGYG--------K 168
Query: 268 PYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYPL 308
Y L+KN WG NW E G +RI R G S G+C + ++ +P+
Sbjct: 169 DYILIKNSWGPNWGEKGYIRIKRASGKSEGICGVYKSSYFPI 210
>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
Length = 323
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 113/316 (35%), Positives = 154/316 (48%), Gaps = 41/316 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------------LRLNKFADLTREKFLA 61
E W E + Y D E+ R+KI++ N + L +NKF DL +F
Sbjct: 23 EDWKNEHNKKYSDDLEELTRYKIWQGNQKIIEVHNANSDKFGFTLGMNKFGDLESHEFAE 82
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD-SIDWNERGAVTPVKDQGSY-CCWAFT 119
+ GY RSN K + D ++DW +GAVT VK+QG CWAF+
Sbjct: 83 MFNGYMMQA-------RSNSTKVFVADPNYKADPTVDWRTKGAVTGVKNQGQCGSCWAFS 135
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
++EG + ++TG+LV+ S+ LVDCS GC ++ AFEYI++ + +E Y
Sbjct: 136 TTGSLEGQHFLKTGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYIKKNGGIDTEASY 195
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATW--FN 233
PYQ D C R AS GY ++ E L V + PVSVAIDA+ F
Sbjct: 196 PYQA-HDERC---RFKASDVGATCTGYVDIKREDENALMQAVEKIGPVSVAIDASHSSFQ 251
Query: 234 FYHGGV-FTGPCGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
Y GV + C T +HGV +GYGT EG YWLVKN WGT+W G + + R
Sbjct: 252 LYRSGVYYERECSQTALDHGVLAIGYGT----EGGSDYWLVKNSWGTDWGMEGYIMMSRN 307
Query: 292 VGGSGLCNIAANAAYP 307
+ C IA A+YP
Sbjct: 308 RNNN--CGIATEASYP 321
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 106/314 (33%), Positives = 162/314 (51%), Gaps = 28/314 (8%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF--------LRLNKFADLTREKFLASY 63
A +++ E Y+ + E R KI + N ++ L +N+F DL +F+++
Sbjct: 55 ALHGKEYHSETEEYYRLKIYMENRLKIARHNEKYANNKASYKLAMNEFGDLLHHEFVSTR 114
Query: 64 TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVA 122
G+K P S + + ++DW ++GAVTPVK+QG CWAF+
Sbjct: 115 NGFKRNYRSTPREG-SFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTG 173
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
++EG + +TG++V+ S+ LVDCS NGC ++NAF+YI+ + +E YPY
Sbjct: 174 SLEGQHFRKTGRMVSLSEQNLVDCSGKFGNNGCEGGLMDNAFKYIKANGGIDTELSYPYN 233
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATW--FNFYH 236
G D C + +S G+ + E+ L+ V+ PVSVAIDA+ F FY
Sbjct: 234 G-TDGICHFEKSDVG---ATDTGFVDIPEGNEQLLKKAVATVGPVSVAIDASHESFQFYS 289
Query: 237 GGVFTGP--CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
GV+ P + +HGV +VGYGT + Q YWLVKN WGT W + G + + R
Sbjct: 290 QGVYDEPECSSESLDHGVLVVGYGT----KDGQDYWLVKNSWGTTWGDDGYIYMTR--NK 343
Query: 295 SGLCNIAANAAYPL 308
C IA++A+YPL
Sbjct: 344 ENQCGIASSASYPL 357
>gi|301612003|ref|XP_002935514.1| PREDICTED: cathepsin K-like [Xenopus (Silurana) tropicalis]
Length = 331
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 103/323 (31%), Positives = 165/323 (51%), Gaps = 37/323 (11%)
Query: 9 GNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR----------------LNKFA 52
G + ++ E W + + Y ++ + MR I++KN +R +NKF
Sbjct: 22 GTLDSEWEIWKTTYHKHYDNKIHELMRRLIWEKNLNIIRSHNLEFTQGLHTYELGMNKFG 81
Query: 53 DLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGS 112
D+T E+ + TG K H +N + + + +SID+ ++G VTP++DQG
Sbjct: 82 DMTSEEVVRMMTGLKV----HTGMGPTNLTSDEDEASQRIPNSIDYRKKGYVTPIRDQGE 137
Query: 113 Y-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRL 170
CWAF+ V +EG +TG+LV S LVDC N GC ++ AF+Y+++ + +
Sbjct: 138 CGSCWAFSTVGALEGQLMKKTGKLVGISPQNLVDCVKDNFGCGGGYMTTAFKYVKKNKGI 197
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDA 229
SE YPY G D C + + SG+ I+G++ V+ +E L+ V P+SV IDA
Sbjct: 198 DSEEAYPYVG-MDQKCKY---NVSGRAAEIKGFKEVKKGSETALKKAVGLVGPISVGIDA 253
Query: 230 ---TWFNFYHGGVFTGPC-GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
T+F + G + C G++ NH V VGYG + + YW++KN WG +W G
Sbjct: 254 GLDTFFLYKKGIYYDKSCDGDSINHAVLAVGYGKQKKGK----YWIIKNSWGEDWGNKGY 309
Query: 286 MRIFRGVGGSGLCNIAANAAYPL 308
+ + R G + C IA A+YP+
Sbjct: 310 ILMAREKGNA--CGIANLASYPV 330
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 102/311 (32%), Positives = 159/311 (51%), Gaps = 36/311 (11%)
Query: 23 ARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLASYTGY 66
+ Y+ + E+ R KI+ +N L +N++ D+ +F+++ G+
Sbjct: 37 GKEYQSETEEYYRLKIYMENRMMIARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTRNGF 96
Query: 67 KPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVE 125
+ P S + + ++DW ++GAVTPVK+QG CWAF+ ++E
Sbjct: 97 RRDYRSKPRQG-SFYIEPEGIEDKHLPKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLE 155
Query: 126 GLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQ 182
G + ++G +V+ S+ LVDCST NGC ++NAF+YI+ + +E YPY G
Sbjct: 156 GQHFRKSGDMVSLSEQNLVDCSTAFGNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNG-T 214
Query: 183 DYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATW--FNFYHGGV 239
D C + +S G+ + E L+ V+ P+SVAIDA+ F FY GV
Sbjct: 215 DGTCHFKKSDVG---ATDTGFVDIPEGNEHLLKKAVATVGPISVAIDASHQSFQFYSQGV 271
Query: 240 FTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGL 297
+ P ++ N HGV +VGYGT + Q YWLVKN WGT W +GG + + R
Sbjct: 272 YDEPECSSENLDHGVLVVGYGTKDD----QDYWLVKNSWGTTWGDGGYIYMTRNKDNQ-- 325
Query: 298 CNIAANAAYPL 308
C IA++A+YPL
Sbjct: 326 CGIASSASYPL 336
>gi|428170119|gb|EKX39047.1| hypothetical protein GUITHDRAFT_154556 [Guillardia theta CCMP2712]
Length = 352
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 111/332 (33%), Positives = 162/332 (48%), Gaps = 29/332 (8%)
Query: 2 SRTSHKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL------------- 48
S+TS I W +F + Y D AE RF +FK N E +R
Sbjct: 22 SKTSSVDDEIHLAFISWKNKFEKVY-DGAEHLARFAVFKANMEIIRAHNALYELGEETFS 80
Query: 49 ---NKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLN--SSKMSFYDSIDWNERGA 103
N+FAD+T E+F + GYKP N KN S+ + +IDW + A
Sbjct: 81 MAANQFADMTAEEFKRTVLGYKPELKGKRLLQGLNSGKNCTHRSNNSTRPKAIDWRTKSA 140
Query: 104 VTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENA 160
VTPVK+QG CW+F+ VEG + L++ S+ +LV C T + GC ++NA
Sbjct: 141 VTPVKNQGQCGSCWSFSTTGAVEGAWVVAGHPLISLSEEELVQCDTKSDQGCNGGLMDNA 200
Query: 161 FEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR 220
+ +I Q +A+E VYPY + S K +I + ++P E L+ + +
Sbjct: 201 YAWIIQNGGIAAEDVYPYISGNGTTGVCHVAFLSKKVASISDWCDLKPEDESDLELALVQ 260
Query: 221 QPVSVAIDA--TWFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
QPV+VAI+A + F FY+GGV CG +HGV VGYG + + + YW+VKN WG
Sbjct: 261 QPVAVAIEADQSSFQFYNGGVLPAKKCGTKLDHGVLAVGYGY--DKKHKMHYWIVKNSWG 318
Query: 278 TNWDEGGSMRIFRGVGGS--GLCNIAANAAYP 307
W + G +R+ + + C IA A+YP
Sbjct: 319 AEWGDEGYIRLEKMPKKTKHSACGIAKAASYP 350
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 108/312 (34%), Positives = 159/312 (50%), Gaps = 34/312 (10%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKN-----------HEF-LRLNKFADLTREKFLASYTG 65
W + Y ++ E+ MR I++ N H F L +N D+T + + G
Sbjct: 32 WKSFHGKEYPNKNEETMRNFIWQNNLKKIVTHNEGKHSFKLAMNHLGDMTSLEISQTLLG 91
Query: 66 YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATV 124
K + + N + DSIDW +G VTPVK+QG CWAF+ +
Sbjct: 92 LKLKKHAESQPKGATFLPPAN---VKVVDSIDWRSKGYVTPVKNQGQCGSCWAFSTTGAL 148
Query: 125 EGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGR 181
EG + +TG+LV+ S+ LVDCS NGC ++NAF+YI++ + +E YPY +
Sbjct: 149 EGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCEGGLMDNAFQYIKENGGIDTEKSYPYLAK 208
Query: 182 QDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQD-VVSRQPVSVAIDATW--FNFYHGG 238
D C + +S+ K G+ + E LQ + S P+S+AIDA+ F+FYH G
Sbjct: 209 -DGVCHYNKSAIGAK---DTGFVDIPTGDENALQQALASVGPISIAIDASQSTFHFYHQG 264
Query: 239 VFTGP-CGNT-PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSG 296
V+ P C +T +HGV VGYGT + + YWLVKN WG +W E G ++I R
Sbjct: 265 VYDDPDCSSTRLDHGVLAVGYGT----DDGKDYWLVKNSWGPSWGEEGYIKIAR--NDHD 318
Query: 297 LCNIAANAAYPL 308
C +A+ A+YPL
Sbjct: 319 KCGVASKASYPL 330
>gi|118401108|ref|XP_001032875.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila]
gi|89287220|gb|EAR85212.1| Papain family cysteine protease containing protein [Tetrahymena
thermophila SB210]
Length = 360
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 113/327 (34%), Positives = 171/327 (52%), Gaps = 43/327 (13%)
Query: 10 NIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNH-------EFLR-----LNKFADLTRE 57
+I + + V++A+TYKD E++ RF +F N+ +FL +N+FADLT E
Sbjct: 40 SIERAFKNFKVKYAKTYKDDTEEQYRFSVFTNNYVEIYRHNKFLVFSKVGVNQFADLTHE 99
Query: 58 KFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYD----SIDWNERGAVTPVKDQ-GS 112
+F A YTG H HS + N N D S DW ++GA+TPVK Q G
Sbjct: 100 EFKALYTG-------HKHSKDDDDDDNKNKQPHLPTDNLPASFDWRDKGAITPVKVQNGC 152
Query: 113 YCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRL 170
CWAF+ V ++EGL ++TG+L + S Q++DC ++ GC E AF I+ +
Sbjct: 153 GGCWAFSTVQSIEGLYFLKTGKLESLSTQQVIDCCRIDESGCLGGDPEPAFRCIQNNGGI 212
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA- 229
+E YPY +Q C + + + G GY V P+ + ++ + QP+S+ +++
Sbjct: 213 MTETEYPYIAKQQ-SCKFDEDKPTFQIG---GYIDV-PSDQSQVKAALLIQPLSICLNSS 267
Query: 230 -TWFNFYHGGVFT----GPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGG 284
T F +Y GV T GP + P+H + +VGYG + E + YWL+KN+WGT W E G
Sbjct: 268 DTSFKYYKSGVITECEDGPY-DGPDHCLLLVGYG--HDEELKVDYWLIKNQWGTTWGEEG 324
Query: 285 SMRIFRGVG---GSGLCNIAANAAYPL 308
+RI R G G C + A YP+
Sbjct: 325 YVRIIRDDNDHKGPGKCFVVAEVRYPI 351
>gi|118119|sp|P13277.2|CYSP1_HOMAM RecName: Full=Digestive cysteine proteinase 1; Flags: Precursor
gi|11051|emb|CAA45127.1| cysteine proteinase preproenzyme [Homarus americanus]
gi|228243|prf||1801240A Cys protease 1
Length = 322
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/318 (34%), Positives = 166/318 (52%), Gaps = 44/318 (13%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E++ +F R Y D E+ R +F N ++ L +N+F+D+T EKF
Sbjct: 21 EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYERGEVTYNLAINQFSDMTNEKF 80
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
A GYK P P + F + +++ S +DW +GAVTPVKDQG CWAF
Sbjct: 81 NAVMKGYKKGP--RPAA----VFTSTDAAPES--TEVDWRTKGAVTPVKDQGQCGSCWAF 132
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN----GCAKNFLENAFEYIRQYQRLASEC 174
+ +EG + ++TG+LV+ S+ QLVDC+ + GC ++E A Y+R + +E
Sbjct: 133 STTGGIEGQHFLKTGRLVSLSEQQLVDCAGGSYYNQGCNGGWVERAIMYVRDNGGVDTES 192
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATWFN 233
YPY+ R D C R +++ GY + +E L+ P+SVAIDA+ +
Sbjct: 193 SYPYEAR-DNTC---RFNSNTIGATCTGYVGIAQGSESALKTATRDIGPISVAIDASHRS 248
Query: 234 F--YHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F Y+ GV+ P C ++ +H V VGYG+ EG Q +WLVKN W T+W E G +++
Sbjct: 249 FQSYYTGVYYEPSCSSSQLDHAVLAVGYGS----EGGQDFWLVKNSWATSWGESGYIKMA 304
Query: 290 RGVGGSGLCNIAANAAYP 307
R + C IA +A YP
Sbjct: 305 RNRNNN--CGIATDACYP 320
>gi|413953048|gb|AFW85697.1| hypothetical protein ZEAMMB73_051316 [Zea mays]
Length = 298
Score = 156 bits (394), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 110/286 (38%), Positives = 153/286 (53%), Gaps = 35/286 (12%)
Query: 46 LRLNKFADLTREKFLASYT---GYKPPPTDHPHSNRSNWFKNLNSSKMSFYD-------S 95
L N+F DLT E+F +Y +PP + ++++ MS D S
Sbjct: 24 LGENQFTDLTEEEFKDTYLMKLDEQPPAAEAMPPT----VGTMSTAGMSNGDNTGEAPNS 79
Query: 96 IDWNERGAVTPVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLN 150
+DW +GAVTPVK+Q C CWAF VA++EG+++I+TG+LV+ S+ Q+VDC +
Sbjct: 80 VDWRTKGAVTPVKNQ-QQCGSCWAFATVASIEGVHQIKTGRLVSLSEQQIVDCDRGGNDH 138
Query: 151 GCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYG----AIRGYQYV 206
GC + +A E++ + L +E YPY G Q R SGK G IRGYQ V
Sbjct: 139 GCHGGYPRSAMEWVTRNGGLTTESDYPYVGSQ-------RQCMSGKLGHQAARIRGYQAV 191
Query: 207 QPATEEGLQDVVSRQPVSVAIDAT-WFNFYHGGVFTGPCGNTP-NHGVTIV-GYGTTTEA 263
Q E L+ V+ +PV+V IDA+ F FY GVF+GPC T NH VT+V T +++
Sbjct: 192 QRKNEAELERAVAGRPVAVVIDASRAFQFYKRGVFSGPCNTTTVNHAVTVVGYGSTGSDS 251
Query: 264 EGQQPYWLVKNRWGTNWDEGG-SMRIFRGVGGSGLCNIAANAAYPL 308
G + YW+VKN WG W E G R G+C IA YP+
Sbjct: 252 GGGRKYWIVKNSWGQRWGENGYVRMARRVRAREGMCAIAIEPYYPV 297
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 160/310 (51%), Gaps = 37/310 (11%)
Query: 24 RTYKDQAEKEMRFKIFKKN------HEFL----------RLNKFADLTREKFLASYTGYK 67
+ Y Q E+++R KI+ +N H L +NKF DL +F + GY+
Sbjct: 40 KEYPSQLEEKLRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQ 99
Query: 68 PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEG 126
+ + + F + + + +S+DW E+GA+TPVKDQG CWAF++ +EG
Sbjct: 100 HKKQNSSRAEST--FTFMEPANVEVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEG 157
Query: 127 LNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
+TG+LV+ S+ L+DCS GC ++ AF+YI+ + + +E YPY+ +D
Sbjct: 158 QTFRKTGKLVSLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEA-ED 216
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--FNFYHGGVF 240
C R + + RG+ + E+ L+ V+ PVSVAIDA+ F FY G +
Sbjct: 217 GVC---RYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGXY 273
Query: 241 TGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLC 298
P ++ +HGV +VGYG+ + + YWLVKN W +W + G ++I R C
Sbjct: 274 YEPSCDSDDLDHGVLVVGYGS----DNGEDYWLVKNSWSEHWGDEGYIKIARNRKNH--C 327
Query: 299 NIAANAAYPL 308
+A A+YPL
Sbjct: 328 GVATAASYPL 337
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 155 bits (393), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 106/322 (32%), Positives = 161/322 (50%), Gaps = 47/322 (14%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + ++Y+ E+ +RFKIF +N L +N+F DL +F
Sbjct: 28 EAFKTTHKKSYQSHMEELLRFKIFTENSLIIAKHNAKYAKGLVSYKLGMNQFGDLLAHEF 87
Query: 60 LASYTGYKPPPTDHPHSNR----SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+ G+ H R S + N + S +DW ++GAVTPVKDQG
Sbjct: 88 ARIFNGH--------HGTRKTGGSTFLPPANVNDSSLPKVVDWRKKGAVTPVKDQGQCGS 139
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+A ++EG + ++ G+LV+ S+ LVDCS NGC +E+AF+YI+ +
Sbjct: 140 CWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSFGNNGCEGGLMEDAFKYIKANDGID 199
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDAT 230
+E YPY+ D C + + GY ++ +E L+ V+ P+SVAIDA+
Sbjct: 200 TEKSYPYEAV-DGECRFKKEDVG---ATDTGYVEIKAGSEVDLKKAVATVGPISVAIDAS 255
Query: 231 W--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y GV+ P C + +HGV +VGYG +G + YWLVKN W +W + G +
Sbjct: 256 HSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV----KGGKKYWLVKNSWAESWGDQGYI 311
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
+ R + C IA+ A+YPL
Sbjct: 312 LMSR--DNNNQCGIASQASYPL 331
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/310 (32%), Positives = 160/310 (51%), Gaps = 37/310 (11%)
Query: 24 RTYKDQAEKEMRFKIFKKN------HEFL----------RLNKFADLTREKFLASYTGYK 67
+ Y Q E++ R KI+ +N H L +NKF DL +F + GY+
Sbjct: 40 KEYPSQLEEKFRMKIYLENKHKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQ 99
Query: 68 PPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEG 126
+ + + F + + + +S+DW +GA+TPVKDQG CWAF++ +EG
Sbjct: 100 HKKQNSSRAEST--FTFMEPANVEVPESVDWRVKGAITPVKDQGQCGSCWAFSSTGALEG 157
Query: 127 LNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQD 183
+TG+L++ S+ L+DCS GC ++ AF+YI+ + + +E YPY+ +D
Sbjct: 158 QTFRKTGKLISLSEQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEA-ED 216
Query: 184 YYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--FNFYHGGVF 240
C R + + RG+ ++ E+ L+ V+ PVSVAIDA+ F FY GV+
Sbjct: 217 NVC---RYNPRNRGAIDRGFVHIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVY 273
Query: 241 TGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGLC 298
P ++ +HGV +VGYG+ + + YWLVKN W +W + G ++I R C
Sbjct: 274 YEPSCDSDDLDHGVLVVGYGS----DNGKDYWLVKNSWSEHWGDEGYIKIARNRKNH--C 327
Query: 299 NIAANAAYPL 308
IA A+YPL
Sbjct: 328 GIATAASYPL 337
>gi|403333364|gb|EJY65772.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/311 (35%), Positives = 160/311 (51%), Gaps = 41/311 (13%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHE-------------FLRLNKFADLTREKFLASYT 64
++ ++ ++Y + E + R K+FK+N L LNKFAD T ++
Sbjct: 46 FVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNARNDVTYRLGLNKFADYTEAEY-KRLL 104
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVAT 123
G+ +P + K L + K D ++W E+GAVTPVKDQG CW+F+A
Sbjct: 105 GFGGQKNKNPRN-----IKVLGAPKN---DGVNWVEQGAVTPVKDQGQCGSCWSFSATGA 156
Query: 124 VEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
+EG KI+ G L + S+ QLVDCS GC +++ AF+Y+ Q L +E YPY+
Sbjct: 157 MEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYVEQ-TALETEDQYPYEA 215
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGG 238
D C R+S++G + + V P L+ + + PVSVAI+A F FY GG
Sbjct: 216 VDD-TC---RASSAGVV-KVDSFVDVTPNNVNELKAALDKGPVSVAIEADQMVFQFYSGG 270
Query: 239 VFT-GPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGL 297
V CG T +HGV VGYG E Q Y+LVKN WG +W E G ++I +
Sbjct: 271 VINDASCGTTLDHGVLAVGYGN----ESGQDYFLVKNSWGASWGEEGYVKI--AASPDNI 324
Query: 298 CNIAANAAYPL 308
C I + A+YP+
Sbjct: 325 CGILSQASYPI 335
>gi|1730100|sp|P36400.2|LMCPB_LEIME RecName: Full=Cysteine proteinase B; Flags: Precursor
gi|899313|emb|CAA90236.1| LmCPb2.8 [Leishmania mexicana]
Length = 443
Score = 155 bits (393), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 149/300 (49%), Gaps = 27/300 (9%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKF 59
AA E++ + R Y+ AE++ R F++N E +R + KF DL+ +F
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEF 94
Query: 60 LASY---TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
A Y Y H + ++ + + D++DW E+GAVTPVKDQG+ C
Sbjct: 95 AARYLNGAAYFAAAKRHA----AQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSC 150
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQ--RLAS 172
WAF+AV +EG + +LV+ S+ QLV C +N GC + AF+++ Q L +
Sbjct: 151 WAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHT 210
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY Y + SS I G+ + + + + P+++A+DA+ F
Sbjct: 211 EDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSF 270
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV T G NHGV +VGY T G+ PYW++KN WG +W E G +R+ GV
Sbjct: 271 MSYKSGVLTACIGKQLNHGVLLVGYDMT----GEVPYWVIKNSWGGDWGEQGYVRVVMGV 326
>gi|356514419|ref|XP_003525903.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 343
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 114/319 (35%), Positives = 156/319 (48%), Gaps = 57/319 (17%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREK 58
+ + +E+ + + + Y E E RF+I K+N +F+ LN+FAD +R
Sbjct: 48 VMSIYEEXLAKHGKVYNAIDEMEERFQISKENLKFVEQHNAGNRTYKVGLNRFADRSRMM 107
Query: 59 FLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CW 116
S + Y P +D+ +S+DW + GAV VK Q S C C
Sbjct: 108 TRPS-SRYAPRVSDN------------------LSESVDWRKEGAVVRVKTQ-SECESCR 147
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCS-TLN-GCAKNFLENAFEYIRQYQRLASEC 174
FT +A VEG+NKI TG L L DC T+N GC+ + A E+I + +E
Sbjct: 148 TFTVIAAVEGINKIVTGNLTA-----LSDCDRTVNAGCSGGLADYALEFIINNGGIDTEE 202
Query: 175 VYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVA-IDA--TW 231
YP+QG CD + K A+ GY+ V E L+ V+ QPVSVA I+A
Sbjct: 203 DYPFQGAVGI-CDQY------KINAVDGYERVPAYDELALKKAVANQPVSVAYIEAYGKE 255
Query: 232 FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRG 291
F Y G+FTG CG + +HGVT VGYGT E YW+VKN WG NW E G +R+ R
Sbjct: 256 FQLYESGIFTGKCGTSIDHGVTAVGYGT----ENGIDYWIVKNSWGENWGEAGYVRMERN 311
Query: 292 VG--GSGLCNIAANAAYPL 308
+G C IA YP+
Sbjct: 312 TAEDTAGKCGIAILTLYPI 330
>gi|27413319|gb|AAO11786.1| pre-pro cysteine proteinase [Vicia faba]
Length = 363
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 104/314 (33%), Positives = 156/314 (49%), Gaps = 39/314 (12%)
Query: 21 EFARTYKDQAEKEMRFKIFKKN------HEFLR------LNKFADLTREKFLASYTGYKP 68
+F+++Y + E + RF +FK N H+ L + KF+DLT +F + G K
Sbjct: 54 KFSKSYSTKEEHDYRFGVFKSNLIKAKLHQKLDPTAEHGITKFSDLTASEFRRQFLGLKK 113
Query: 69 PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGL 127
H+ ++ N + DW E+GAVTPVKDQGS CWAF+ +EG
Sbjct: 114 RLRLPAHAQKAPILPTTN-----LPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGA 168
Query: 128 NKIRTGQLVTRSKHQLVDC----------STLNGCAKNFLENAFEYIRQYQRLASECVYP 177
+ + TG+LV+ S+ QLVDC S +GC + NAFEY+ Q + E Y
Sbjct: 169 HYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQSGGVVQEKDYA 228
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYHG 237
Y GR D C + +S ++ + V E+ ++V P++V I+A W Y
Sbjct: 229 YTGR-DGSCKFDKSKV---VASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQTYMS 284
Query: 238 GVFTGP--CGNT-PNHGVTIVGYGTTTEAE---GQQPYWLVKNRWGTNWDEGGSMRIFRG 291
GV + P C + +HGV +VG+G A ++PYW+VKN WG NW E G +I RG
Sbjct: 285 GV-SCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWGEQGYYKICRG 343
Query: 292 VGGSGLCNIAANAA 305
G+ ++ + A
Sbjct: 344 RNVCGVDSMVSTVA 357
>gi|1401242|gb|AAB67878.1| pre-pro-cysteine proteinase [Vicia faba]
Length = 363
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 104/314 (33%), Positives = 156/314 (49%), Gaps = 39/314 (12%)
Query: 21 EFARTYKDQAEKEMRFKIFKKN------HEFLR------LNKFADLTREKFLASYTGYKP 68
+F+++Y + E + RF +FK N H+ L + KF+DLT +F + G K
Sbjct: 54 KFSKSYSTKEEHDYRFGVFKSNLIKAKLHQKLDPTAEHGITKFSDLTASEFRRQFLGLKK 113
Query: 69 PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGL 127
H+ ++ N + DW E+GAVTPVKDQGS CWAF+ +EG
Sbjct: 114 RLRLPAHAQKAPILPTTN-----LPEDFDWREKGAVTPVKDQGSCGSCWAFSTTGALEGA 168
Query: 128 NKIRTGQLVTRSKHQLVDC----------STLNGCAKNFLENAFEYIRQYQRLASECVYP 177
+ + TG+LV+ S+ QLVDC S +GC + NAFEY+ Q + E Y
Sbjct: 169 HYLATGKLVSLSEQQLVDCDHVCDPEQAGSCDSGCNGGLMNNAFEYLLQSGGVVQEKDYA 228
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYHG 237
Y GR D C + +S ++ + V E+ ++V P++V I+A W Y
Sbjct: 229 YTGR-DGSCKFDKSKV---VASVSNFSVVSLDEEQIAANLVKNGPLAVGINAAWMQTYMS 284
Query: 238 GVFTGP--CGNT-PNHGVTIVGYGTTTEAE---GQQPYWLVKNRWGTNWDEGGSMRIFRG 291
GV + P C + +HGV +VG+G A ++PYW+VKN WG NW E G +I RG
Sbjct: 285 GV-SCPYVCAKSRLDHGVLLVGFGKGAYAPIRLKEKPYWIVKNSWGQNWGEQGYYKICRG 343
Query: 292 VGGSGLCNIAANAA 305
G+ ++ + A
Sbjct: 344 RNVCGVDSMVSTVA 357
>gi|42407296|dbj|BAD10859.1| cysteine protease [Aster tripolium]
Length = 363
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 101/306 (33%), Positives = 154/306 (50%), Gaps = 39/306 (12%)
Query: 21 EFARTYKDQAEKEMRFKIFKKN------HEFLR------LNKFADLTREKFLASYTGYKP 68
+F RTY + E E R +FK N H+ L + KF+DLT +F Y G K
Sbjct: 56 KFGRTYDTEEEHEYRLTVFKSNLRRAKRHQVLDPTAKHGVTKFSDLTPSEFRKKYLGLKS 115
Query: 69 PPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGL 127
+N++ N + DW ++GAVTPVK+QGS CW+F+ +EG
Sbjct: 116 KLKLPADANKAPILPTSNLPQ-----DFDWRDKGAVTPVKNQGSCGSCWSFSTTGALEGS 170
Query: 128 NKIRTGQLVTRSKHQLVDC----------STLNGCAKNFLENAFEYIRQYQRLASECVYP 177
+ ++TG+LV+ S+ QLVDC S +GC + NAFEYI + L E YP
Sbjct: 171 HFLQTGELVSLSEQQLVDCDHECDPAEYNSCDSGCNGGLMNNAFEYILKAGGLQKEADYP 230
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWFNFYHG 237
Y GR D C + +S + ++ + V ++ ++V+ P+++ I+A W Y G
Sbjct: 231 YTGR-DGTCKFDKSKIA---ASVANFSVVSTDEDQIAANLVTNGPLAIGINAAWMQTYIG 286
Query: 238 GVFTGP--CGNTP-NHGVTIVGYGTTTEAE---GQQPYWLVKNRWGTNWDEGGSMRIFRG 291
V + P C T +HGV +VGYG+ A ++PYW++KN WG +W E G ++ G
Sbjct: 287 QV-SCPYICSKTKMDHGVLLVGYGSAGYAPLRFKEKPYWIIKNSWGEDWGEDGYYKLCSG 345
Query: 292 VGGSGL 297
G+
Sbjct: 346 YNACGM 351
>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
Length = 331
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/330 (33%), Positives = 163/330 (49%), Gaps = 47/330 (14%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLN 49
H+ + + W + + Y+++ E+ R I++KN H + L +N
Sbjct: 19 HRDPTLDHHWDLWKKTYGKQYEEKNEEVARRLIWEKNLKTVMLHNLEHSMGMHSYELGMN 78
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTP 106
D+T E+ ++S + + P S W +N+ +S DS+DW E+G VT
Sbjct: 79 HLGDMTSEEVISSMSSLRVP---------SQWPRNVTYKSSPNQKLPDSLDWREKGCVTE 129
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAF 161
VK QG+ CWAF+AV +E K++TG+LV+ S LVDCST+ GC F+ AF
Sbjct: 130 VKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGCNGGFMTEAF 189
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ 221
+YI + SE YPY+ D C + + Y + +EE L++ V+ +
Sbjct: 190 QYIIDNNGIDSEASYPYKA-MDGRCQY---DVKNRAATCSRYIELPFGSEEALKEAVANK 245
Query: 222 -PVSVAIDA--TWFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWG 277
PVSV IDA T F Y GV+ P C NHGV +VGYG+ + YWLVKN WG
Sbjct: 246 GPVSVGIDAKQTSFFLYKTGVYYDPSCTQNVNHGVLVVGYGSLNGKD----YWLVKNSWG 301
Query: 278 TNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
N+ + G +R+ R G C IA +YP
Sbjct: 302 LNFGDQGYIRMARNSGNH--CGIANFPSYP 329
>gi|157278117|ref|NP_001098157.1| cathepsin S precursor [Oryzias latipes]
gi|50251130|dbj|BAD27582.1| cathepsin S [Oryzias latipes]
Length = 327
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 111/316 (35%), Positives = 159/316 (50%), Gaps = 42/316 (13%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKFLA 61
W +++TY + E+ R +I+++N E L +N DLT E+ +A
Sbjct: 28 WKKTYSKTYSHEIEEFGRRRIWEENLEMISVHNLEVSLGLHSYELAMNHLGDLTIEELIA 87
Query: 62 SYTG-YKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFT 119
S TG P + H + L S +S+DW E G VT VK QG CWAF+
Sbjct: 88 SLTGTVAPVGLERIHYD-------LVKINTSVPESVDWREGGLVTSVKTQGRCGSCWAFS 140
Query: 120 AVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVY 176
AV +EG K TG L + S LVDCST GC F+ NAF+Y+ + Q ++S+ Y
Sbjct: 141 AVGALEGQLKKTTGILTSLSPQNLVDCSTKYGNYGCKGGFMSNAFQYVIKNQGISSDAAY 200
Query: 177 PYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQ-DVVSRQPVSVAIDATW--FN 233
PY G++D C + + + GY ++ E L+ V + P+SVAIDA+ F
Sbjct: 201 PYIGKRD-KCKY---DSKHRAANCTGYNFLPKGDEFALKVGVATIGPISVAIDASRPKFL 256
Query: 234 FYHGGVFTG-PCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
FY GV+ C + NHGV +VGYGT E + YWLVKN WG + +GG +++ R
Sbjct: 257 FYRHGVYKDHSCSHNVNHGVLVVGYGT----ENGEDYWLVKNSWGERYGDGGYIKMARNR 312
Query: 293 GGSGLCNIAANAAYPL 308
C IA A +P+
Sbjct: 313 RNQ--CGIALYACFPV 326
>gi|401416324|ref|XP_003872657.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|322488881|emb|CBZ24131.1| putative cathepsin L-like protease [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 443
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 98/300 (32%), Positives = 149/300 (49%), Gaps = 27/300 (9%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKF 59
AA E++ + R Y+ AE++ R F++N E +R + KF DL+ +F
Sbjct: 35 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEF 94
Query: 60 LASY---TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
A Y Y H + ++ + + D++DW E+GAVTPVKDQG+ C
Sbjct: 95 AARYLNGAAYFAAAKRHA----AQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSC 150
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQ--RLAS 172
WAF+AV +EG + +LV+ S+ QLV C +N GC + AF+++ Q L +
Sbjct: 151 WAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHT 210
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY Y + SS I G+ + + + + P+++A+DA+ F
Sbjct: 211 EDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSF 270
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV T G NHGV +VGY T G+ PYW++KN WG +W E G +R+ GV
Sbjct: 271 MSYKSGVLTACIGKQLNHGVLLVGYDMT----GEVPYWVIKNSWGGDWGEQGYVRVVMGV 326
>gi|356552228|ref|XP_003544471.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 351
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/327 (33%), Positives = 164/327 (50%), Gaps = 38/327 (11%)
Query: 2 SRTSHKTGN-IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEF------------LRL 48
R + +T + + + E+W+V+ + Y EKE RF+IFK N F L L
Sbjct: 31 DRATRRTDDEVMSMFEEWLVKHDKVYNALGEKEKRFQIFKNNLRFIDERNSLNRTYKLGL 90
Query: 49 NKFADLTREKFLASY--TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTP 106
N FADLT ++ A Y T P D R+++ + + S+DW + GAVTP
Sbjct: 91 NVFADLTNAEYRAMYLRTWDDGPRLDLDTPPRNHYVPRVGDT---IPKSVDWRKEGAVTP 147
Query: 107 VKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFE 162
VK+QG+ C CWAFTAV VE L KI+TG L++ S+ ++VDC+T + GC +++ +
Sbjct: 148 VKNQGATCNSCWAFTAVGAVESLVKIKTGDLISLSEQEVVDCTTSSSRGCGGGDIQHGYI 207
Query: 163 YIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDVVSRQ 221
YIR+ ++ E YPY+G + CD S+ I G+ +V EE L + +
Sbjct: 208 YIRK-NGISLEKDYPYRGDEG-KCD---SNKKNAIVTIDGHGWVPTQLEEALNRALFCYC 262
Query: 222 PVSVAIDATWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWD 281
+ +D F GVF G CG NH + +VGYGT + + YW+ KN + W
Sbjct: 263 AYFLYVDKF---FLCQGVFKGKCGTELNHALLLVGYGTEKDGD----YWIAKNSYSDKWG 315
Query: 282 EGGSMRIFRGVGGSGLCNIAANAAYPL 308
E G +RI R + C YP+
Sbjct: 316 ENGYIRIQRKL---STCKFGNGGYYPI 339
>gi|59798093|sp|P84346.1|MEX1_JACME RecName: Full=Mexicain
Length = 214
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 93/225 (41%), Positives = 128/225 (56%), Gaps = 23/225 (10%)
Query: 92 FYDSIDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS-TL 149
+ +SIDW E+GAVTPVK+Q CWAF+ VAT+EG+NKI TGQL++ S+ +L+DC
Sbjct: 1 YPESIDWREKGAVTPVKNQNPCGSCWAFSTVATIEGINKIITGQLISLSEQELLDCEYRS 60
Query: 150 NGCAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGA---IRGYQYV 206
+GC + + +Y+ + +E YPY+ +Q R A K G I GY+YV
Sbjct: 61 HGCDGGYQTPSLQYVVD-NGVHTEREYPYEKKQG------RCRAKDKKGPKVYITGYKYV 113
Query: 207 QPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAE 264
E L ++ QPVSV D+ F FY GG++ GPCG +H VT VGYG T
Sbjct: 114 PANDEISLIQAIANQPVSVVTDSRGRGFQFYKGGIYEGPCGTNTDHAVTAVGYGKT---- 169
Query: 265 GQQPYWLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYPL 308
Y L+KN WG NW E G +RI R G S G C + ++ +P+
Sbjct: 170 ----YLLLKNSWGPNWGEKGYIRIKRASGRSKGTCGVYTSSFFPI 210
>gi|159485468|ref|XP_001700766.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158281265|gb|EDP07020.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 498
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 113/314 (35%), Positives = 159/314 (50%), Gaps = 30/314 (9%)
Query: 18 WMVEFARTYKDQA-EKEMRFKIFKKNHEF------------LRLNKFADLTREKFLASYT 64
W + ARTY + + E R +F N L LN++AD T E+F A
Sbjct: 43 WATQHARTYSEGSPEYTRRLGVFADNVRAIAEQNRRNTGITLALNEYADETWEEFAAKRL 102
Query: 65 GYKPPPTDHPHSNRSNWFKNLNS---SKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTA 120
G K + + +S +++ ++DW + AVT VK+QG CWAF+A
Sbjct: 103 GLKISQEQLKAREARSSSSSSSSWRYAQVQTPAAVDWRAKNAVTQVKNQGQCGSCWAFSA 162
Query: 121 VATVEGLNKIRTGQLVTRSKHQLVDCSTLN--GCAKNFLENAFEYIRQYQRLASECVYPY 178
V ++EG N + TGQLV S+ QLVDC T + GC+ +++AF+Y+ + +E Y Y
Sbjct: 163 VGSIEGANALATGQLVALSEQQLVDCDTASNMGCSGGLMDDAFKYVLDNGGIDTEEDYSY 222
Query: 179 QGRQDYYCDWW---RSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATW-FNF 234
Y +W R +I GY+ V P +E L V+ QPV+VAI A+ F
Sbjct: 223 W--SGYGFGFWCNKRKQTDRPAVSIDGYEDV-PTSEPALLKAVAGQPVAVAICASANMQF 279
Query: 235 YHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
Y GV C NHGV VGY T+ +A QPYW+VKN WG +W E G R+ G G
Sbjct: 280 YSSGVINSCCEGL-NHGVLAVGYDTSDKA---QPYWIVKNSWGGSWGEQGYFRLKMGEGP 335
Query: 295 SGLCNIAANAAYPL 308
GLC IA+ A+Y +
Sbjct: 336 KGLCGIASAASYAV 349
>gi|179959|gb|AAA35655.1| cathepsin [Homo sapiens]
gi|248406|gb|AAB22005.1| cathepsin S [Homo sapiens]
Length = 331
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 112/333 (33%), Positives = 164/333 (49%), Gaps = 53/333 (15%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------N 49
HK + W + + YK++ E+ +R I++KN +F+ L N
Sbjct: 19 HKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 78
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTP 106
D+T E+ ++ + + P S W +N+ ++ DS+DW E+G VT
Sbjct: 79 HLGDMTSEEVMSLTSSLRVP---------SQWQRNITYKSNPNRILPDSVDWREKGCVTE 129
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAF 161
VK QGS CWAF+AV +E K++TG+LVT S LVDCST GC F+ AF
Sbjct: 130 VKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCSTEKYGNKGCNGGFMTTAF 189
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDW---WRSSASGKYGAIRGYQYVQPATEEGLQDVV 218
+YI + + S+ YPY+ D C + +R++ KY + E+ L++ V
Sbjct: 190 QYIIDNKGIDSDASYPYKA-MDQKCQYDSKYRAATCSKYTEL------PYGREDVLKEAV 242
Query: 219 SRQ-PVSVAIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKN 274
+ + PVSV +DA F Y GV+ P C NHGV +VGYG E YWLVKN
Sbjct: 243 ANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKE----YWLVKN 298
Query: 275 RWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
WG N+ E G +R+ R G C IA+ +YP
Sbjct: 299 SWGHNFGEEGYIRMARNKGNH--CGIASFPSYP 329
>gi|261289787|ref|XP_002611755.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
gi|229297127|gb|EEN67765.1| hypothetical protein BRAFLDRAFT_284339 [Branchiostoma floridae]
Length = 327
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 97/275 (35%), Positives = 152/275 (55%), Gaps = 24/275 (8%)
Query: 45 FLRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAV 104
F+R+NKF D+T E+F G ++ F++L K++ D++DW ++GAV
Sbjct: 65 FMRMNKFGDMTNEEFQMLVIGSGLLYSNKTQQTEGGVFESLPGLKVN--DTVDWRQKGAV 122
Query: 105 TPVKDQ---GSYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN---GCAKNFLE 158
T VK+Q GS CWAF+ ++EG + +++G LV+ S+ LVDCS GC ++
Sbjct: 123 TKVKNQEQCGS--CWAFSTTGSLEGQHFLKSGTLVSLSEQNLVDCSRKEGNKGCQGGLMD 180
Query: 159 NAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGL-QDV 217
AF+YI+ + +E YPY+G+ + C+ ++SS SG + Y ++ E+ L Q
Sbjct: 181 QAFKYIKTNGGIDTEECYPYKGKNERKCE-YKSSCSG--ATLSSYVDIKTGDEDALMQAS 237
Query: 218 VSRQPVSVAIDATW--FNFYHGGVF-TGPCGNTP-NHGVTIVGYGTTTEAEGQQPYWLVK 273
+ P+SV IDA+ F Y GV+ C + +HGV +VGYGT +G++ YWLVK
Sbjct: 238 ATIGPISVGIDASHPSFQLYDHGVYHEKRCSSKKLDHGVLVVGYGT----DGEKDYWLVK 293
Query: 274 NRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
N WG W G +++ R C IA A+YP+
Sbjct: 294 NSWGEEWGMEGYIKMSRNKDNQ--CGIATQASYPV 326
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 110/316 (34%), Positives = 161/316 (50%), Gaps = 35/316 (11%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
++W E + Y E+ R I++KN + L +N+FADL ++F
Sbjct: 29 KEWKNEHGKRYLSDEEEASRRLIWQKNLDIVIRHNLKYDLGHFTYDLGMNQFADLQNKEF 88
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
+A TG++ T + S + N K+ ++DW +G VTPVKDQG CWAF
Sbjct: 89 VAMMTGFRVNGTSKA-AKGSTFLPPNNVGKLP--KTVDWRTKGYVTPVKDQGQCGSCWAF 145
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYP 177
+A ++EG + +TG+LV+ S+ LVDCS N GC ++ AF+YI + +E YP
Sbjct: 146 SATGSLEGQHFKKTGKLVSLSEQNLVDCSDKNYGCNGGLMDRAFQYIIDAGGIDTEESYP 205
Query: 178 YQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDATWFNF-- 234
Y D C + ++ + GY V +E+ LQ V+ P+SVAIDA+ F+F
Sbjct: 206 YIA-MDGNCHFKTANVG---ATVTGYTDVTSGSEKALQKAVAHIGPISVAIDASHFSFQL 261
Query: 235 YHGGVFTGP-CGNT-PNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV+ P C +T +HGV VGYGTT + YW+VKN W W G + + R
Sbjct: 262 YQSGVYNEPGCSSTLLDHGVLAVGYGTTIDG---TDYWIVKNSWAETWGMNGYIWMSRNK 318
Query: 293 GGSGLCNIAANAAYPL 308
C IA A+YPL
Sbjct: 319 DNQ--CGIATQASYPL 332
>gi|118123|sp|P25782.1|CYSP2_HOMAM RecName: Full=Digestive cysteine proteinase 2; Flags: Precursor
gi|11053|emb|CAA45128.1| cysteine proteinase preproenzyme [Homarus americanus]
Length = 323
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 106/318 (33%), Positives = 161/318 (50%), Gaps = 41/318 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + ++ R Y D E R IF++N ++ L +NKF D+T E+F
Sbjct: 21 EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF 80
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
A G P + S ++ + + +DW +GAVTPVKDQG CWAF
Sbjct: 81 NAVMKGNIP----RRSAPVSVFYPKKETGPQA--TEVDWRTKGAVTPVKDQGQCGSCWAF 134
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASECV 175
+ ++EG + ++TG L++ ++ QLVDCS GC ++ +AF+YI+ + +E
Sbjct: 135 STTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAA 194
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDA--TWF 232
YPY+ R D C + +S + G+ + +E GLQ V P+SV IDA + F
Sbjct: 195 YPYEAR-DGSCRFDSNSVA---ATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSF 250
Query: 233 NFYHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GV+ P + +H V VGYG+ EG Q +WLVKN W T+W + G +++ R
Sbjct: 251 QFYSSGVYYEPSCSPSYLDHAVLAVGYGS----EGGQDFWLVKNSWATSWGDAGYIKMSR 306
Query: 291 GVGGSGLCNIAANAAYPL 308
+ C IA A+YPL
Sbjct: 307 NRNNN--CGIATVASYPL 322
>gi|33242880|gb|AAQ01144.1| cathepsin [Branchiostoma lanceolatum]
Length = 334
Score = 155 bits (392), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 108/318 (33%), Positives = 153/318 (48%), Gaps = 34/318 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKF 59
E W ++ + Y+ +AE+ R IF+KN H + L +NKF D+ E+F
Sbjct: 25 EMWKLQHGKQYETEAEEYSRRFIFEKNTVKIAEHNIRASLGMHSYTLAMNKFGDMHHEEF 84
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
G P N + S+DW V+ VKDQG CWAF
Sbjct: 85 HQRIMGGCLKIVKKPLLGSE---VGDNDDNGTLPKSVDWRNSHMVSEVKDQGECGSCWAF 141
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECV 175
+ ++EG + +TG+LV S+ QLVDCS GC ++ AF+YI+ L +E
Sbjct: 142 STTGSLEGQHSNKTGKLVDLSEQQLVDCSKDFGNQGCGGGLMDQAFQYIKANGGLDTEES 201
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDA--TWF 232
YPY D C + SS + GY+ V+ + E L+ V+ PVSVAIDA F
Sbjct: 202 YPYTATDDKPCKFDNSSVG---ATLIGYKDVKSSNEHALKRAVATVGPVSVAIDAGHESF 258
Query: 233 NFYHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GV+ P +T +HGV +VGYG + Q +W+VKN WG NW + G + + R
Sbjct: 259 QFYSSGVYDEPQCSTEQLDHGVLVVGYGAMND-NSHQAFWIVKNSWGPNWGDQGYIMMSR 317
Query: 291 GVGGSGLCNIAANAAYPL 308
C IA +A+YPL
Sbjct: 318 NKNNQ--CGIATSASYPL 333
>gi|157833554|pdb|1PPP|A Chain A, Crystal Structure Of Papain-E64-C Complex. Binding
Diversity Of E64-C To Papain S2 And S3 Subsites
Length = 212
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 94/219 (42%), Positives = 125/219 (57%), Gaps = 19/219 (8%)
Query: 96 IDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCA 153
+DW ++GAVTPVK+QGS CWAF+AV T+EG+ KIRTG L S+ +L+DC + GC
Sbjct: 5 VDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQELLDCDRRSYGCN 64
Query: 154 KNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEE 212
+ +A + + QY + YPY+G Q Y RS G Y A G + VQP +
Sbjct: 65 GGYPWSALQLVAQYG-IHYRNTYPYEGVQRY----CRSREKGPYAAKTDGVRQVQPYNQG 119
Query: 213 GLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYW 270
L ++ QPVSV + A F Y GG+F GPCGN +H V VGYG Y
Sbjct: 120 ALLYSIANQPVSVVLQAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGPN--------YI 171
Query: 271 LVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYPL 308
L+KN WGT W E G +RI RG G S G+C + ++ YP+
Sbjct: 172 LIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPV 210
>gi|356517398|ref|XP_003527374.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 117/312 (37%), Positives = 158/312 (50%), Gaps = 36/312 (11%)
Query: 14 KHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-LNKFADLTREKFLASYTGYKPPPTD 72
+HEQ M +++ YKD E F N ++ N AD + + + PP +
Sbjct: 38 RHEQRMTRYSKVYKDPPES------FXGNVNYIEACNNAADKPYKXGINQF-----PPRN 86
Query: 73 ----HPHSN--RSNWFKNLNSSKMSFYDSIDWNERGAVTP--VKDQGSY-CCWAFTAVAT 123
H S+ R FK N + ++D ++GAVTP VKDQG C WA +AVA
Sbjct: 87 RFKGHMCSSIIRITTFKFENVTATP--STVDCRQKGAVTPYTVKDQGQCGCFWALSAVAA 144
Query: 124 VEGLNKIRTGQLVTRSKH-QLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
EG++ + G+L+ S +LVDC T GC ++AF++I Q L +E YPY+
Sbjct: 145 TEGIHALXAGKLILLSXEPELVDCDTKGVDQGCEGGLTDDAFKFIIQNHGLNTEANYPYK 204
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEG-LQDVVSRQPVSVAIDATW--FNFYH 236
G D C+ + + I GY V E+ LQ V+ PVSVAIDA+ F FY
Sbjct: 205 GV-DGKCNANEADKNAAT-IITGYDDVPANNEKAHLQKAVANNPVSVAIDASGSDFQFYK 262
Query: 237 GGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG-S 295
GVFTG CG +HGVT VGYG + + YWLVKN G W E G +R+ RGV
Sbjct: 263 SGVFTGSCGTELDHGVTAVGYGVSDDG---TEYWLVKNSRGPEWGEEGYIRMQRGVDSEE 319
Query: 296 GLCNIAANAAYP 307
LC IA A+YP
Sbjct: 320 ALCGIAVQASYP 331
>gi|306992173|gb|ADN19567.1| cathepsin L-like proteinase [Spodoptera frugiperda]
Length = 344
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 112/328 (34%), Positives = 169/328 (51%), Gaps = 44/328 (13%)
Query: 16 EQW---MVEFARTYKDQAEKEMRFKIF--------KKNHEF--------LRLNKFADLTR 56
E+W +E ++ Y + E + R KI+ K N F L+ NK+AD+
Sbjct: 25 EEWNAFKMEHSKQYDSEVEDKFRMKIYVENKHRIAKHNQRFEQRLVSYKLKPNKYADMLH 84
Query: 57 EKFLASYTGYKPPPTDHPHSNRSNWFKN--------LNSSKMSFYDSIDWNERGAVTPVK 108
+F+ + G+ H N++ K + + +S+ D +DW ++GAVT VK
Sbjct: 85 HEFVHTMNGFNKTAK-HGGRNKAVHSKGRDGRAATFIAPAHVSYPDHVDWRKKGAVTDVK 143
Query: 109 DQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYI 164
DQG CWAF+ +EG + +TG LV+ S+ LVDCS NGC ++NAF+YI
Sbjct: 144 DQGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLVDCSAAYGNNGCNGGLMDNAFKYI 203
Query: 165 RQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVS 224
+ + +E YPY+ D ++S + G + Q E+ +Q V + P+S
Sbjct: 204 KDNGGIDTEKSYPYEAVDDKCRYNPKNSGADDVGFV---DIPQGDEEKLMQAVATVGPIS 260
Query: 225 VAIDATW--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNW 280
VAIDA+ F FY GV+ C +T +HGV +VGYG TE EG YWLVKN WG +W
Sbjct: 261 VAIDASQETFQFYSKGVYYDENCSSTDLDHGVMVVGYG--TEEEGGD-YWLVKNSWGRSW 317
Query: 281 DEGGSMRIFRGVGGSGLCNIAANAAYPL 308
E G +++ + C IA++A+YPL
Sbjct: 318 GELGYIKMAH--NKNNHCGIASSASYPL 343
>gi|403368476|gb|EJY84073.1| Cathepsin L [Oxytricha trifallax]
Length = 338
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 110/311 (35%), Positives = 160/311 (51%), Gaps = 41/311 (13%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHE-------------FLRLNKFADLTREKFLASYT 64
++ ++ ++Y + E + R K+FK+N L LNKFAD T ++
Sbjct: 46 FVAKYGKSYGTKEEYDFRSKLFKQNLAKVSMNNVRNDVTYRLGLNKFADYTEAEY-KRLL 104
Query: 65 GYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAVAT 123
G+ +P + K L + K D ++W E+GAVTPVKDQG CW+F+A
Sbjct: 105 GFGGQKNKNPRN-----IKVLGAPKN---DGVNWVEQGAVTPVKDQGQCGSCWSFSATGA 156
Query: 124 VEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQG 180
+EG KI+ G L + S+ QLVDCS GC +++ AF+Y+ Q L +E YPY+
Sbjct: 157 MEGHAKIQFGTLYSLSEQQLVDCSQAEGNEGCGGGWMDQAFQYVEQ-TALETEDQYPYEA 215
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDA--TWFNFYHGG 238
D C R+S++G + + V P L+ + + PVSVAI+A F FY GG
Sbjct: 216 VDD-TC---RASSAGVV-KVDSFVDVTPNNVNELKAALDKGPVSVAIEADQMVFQFYSGG 270
Query: 239 VFT-GPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGGSGL 297
V CG T +HGV VGYG E Q Y+LVKN WG +W E G ++I +
Sbjct: 271 VINDASCGTTLDHGVLAVGYGN----ESGQDYFLVKNSWGASWGEEGYVKI--AASPDNI 324
Query: 298 CNIAANAAYPL 308
C I + A+YP+
Sbjct: 325 CGILSQASYPI 335
>gi|296228726|ref|XP_002759933.1| PREDICTED: cathepsin S isoform 1 [Callithrix jacchus]
Length = 330
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 109/321 (33%), Positives = 163/321 (50%), Gaps = 54/321 (16%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------NKFADLTREKFLA 61
W + + YK++ E+ +R I++KN +F+ L N D+T E+ ++
Sbjct: 31 WKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMS 90
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNL----NSSKMSFYDSIDWNERGAVTPVKDQGSY-CCW 116
+ + P S W +N+ N ++M DS+DW E+G VT VK QGS CW
Sbjct: 91 LMSSLRVP---------SQWQRNITYKSNPNQM-LPDSVDWREKGCVTEVKYQGSCGACW 140
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASE 173
AF+AV +E K++TG+LV+ S LVDCS GC F+ AF+YI + + SE
Sbjct: 141 AFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEKYGNKGCNGGFMTEAFQYIIDNKGIDSE 200
Query: 174 CVYPYQGRQDYYCDW---WRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA 229
YPY+ D C + +R++ KY + E+ L++ V+ + PV V +DA
Sbjct: 201 ASYPYKA-MDQKCQYDSKYRAATCSKYTEL------PYGREDVLKEAVANKGPVCVGVDA 253
Query: 230 TW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
+ F Y GV+ P C NHGV ++GYG E YWLVKN WG+N+ E G +
Sbjct: 254 SHSSFFLYRSGVYYDPACTQNVNHGVLVIGYGDLNGEE----YWLVKNSWGSNFGERGYI 309
Query: 287 RIFRGVGGSGLCNIAANAAYP 307
R+ R G C IA+ +YP
Sbjct: 310 RMARNKGNH--CGIASYPSYP 328
>gi|179957|gb|AAC37592.1| cathepsin S [Homo sapiens]
Length = 331
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 111/333 (33%), Positives = 164/333 (49%), Gaps = 53/333 (15%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------N 49
HK + W + + YK++ E+ +R I++KN +F+ L N
Sbjct: 19 HKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 78
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTP 106
D+T E+ ++ + + P S W +N+ ++ DS+DW E+G VT
Sbjct: 79 HLGDMTSEEVMSLMSSLRVP---------SQWQRNITYKSNPNRILPDSVDWREKGCVTE 129
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAF 161
VK QGS CWAF+AV +E K++TG+LV+ S LVDCST GC F+ AF
Sbjct: 130 VKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAF 189
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDW---WRSSASGKYGAIRGYQYVQPATEEGLQDVV 218
+YI + + S+ YPY+ D C + +R++ KY + E+ L++ V
Sbjct: 190 QYIIDNKGIDSDASYPYKA-MDLKCQYDSKYRAATCSKYTEL------PYGREDVLKEAV 242
Query: 219 SRQ-PVSVAIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKN 274
+ + PVSV +DA F Y GV+ P C NHGV +VGYG E YWLVKN
Sbjct: 243 ANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKE----YWLVKN 298
Query: 275 RWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
WG N+ E G +R+ R G C IA+ +YP
Sbjct: 299 SWGHNFGEEGYIRMARNKGNH--CGIASFPSYP 329
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 104/322 (32%), Positives = 161/322 (50%), Gaps = 47/322 (14%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + ++Y+ + E+ +R+KIF +N L +N+F DL +F
Sbjct: 8 EAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPHEF 67
Query: 60 LASYTGYKPPPTDHPHSNR----SNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-C 114
+ GY H R S + N + S ++DW ++GAVTPVKDQG
Sbjct: 68 AKMFNGY--------HGERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGS 119
Query: 115 CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLA 171
CWAF+A ++EG + +++G+LV+ S+ L+DCS GC ++NAF+YI+ +
Sbjct: 120 CWAFSATGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGID 179
Query: 172 SECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT 230
+E YPY+ D C + + G+ +Q +E+ LQ V+ P+SVAIDA+
Sbjct: 180 TEESYPYEA-MDGDCRFKKEDVG---ATDTGFVDIQQGSEDDLQKAVATVGPISVAIDAS 235
Query: 231 W--FNFYHGGVFTGP-CGNTP-NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSM 286
F Y GV+ P C + +HGV VGYG + + YWLVKN W W + G +
Sbjct: 236 HSSFQLYSEGVYDEPNCSSEELDHGVLAVGYGV----KNGKKYWLVKNSWAETWGDNGYI 291
Query: 287 RIFRGVGGSGLCNIAANAAYPL 308
+ R C IA++A+YPL
Sbjct: 292 LMSRDKDNQ--CGIASSASYPL 311
>gi|401430288|ref|XP_003886537.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
gi|356491333|emb|CBZ40988.1| unnamed protein product [Leishmania mexicana MHOM/GT/2001/U1103]
Length = 533
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 99/305 (32%), Positives = 150/305 (49%), Gaps = 27/305 (8%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKF 59
AA E++ + R Y+ AE++ R F++N E +R + KF DL+ +F
Sbjct: 125 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEF 184
Query: 60 LASY---TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
A Y Y H + ++ + + D++DW E+GAVTPVKDQG+ C
Sbjct: 185 AARYLNGAAYFAAAKRH----AAQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSC 240
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQ--RLAS 172
WAF+AV +EG + +LV+ S+ QLV C +N GC + AF+++ Q L +
Sbjct: 241 WAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLHT 300
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY Y + SS I G+ + + + + P+++A+DA+ F
Sbjct: 301 EDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSF 360
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV T G NHGV +VGY T G+ PYW++KN WG +W E G +R+ GV
Sbjct: 361 MSYKSGVLTACIGKQLNHGVLLVGYDMT----GEVPYWVIKNSWGGDWGEQGYVRVVMGV 416
Query: 293 GGSGL 297
L
Sbjct: 417 NACLL 421
>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
Length = 342
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 112/332 (33%), Positives = 166/332 (50%), Gaps = 51/332 (15%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------N 49
HK + + W + + YK++ E+ +R I++KN +F+ L N
Sbjct: 30 HKDPTLDRHWDLWKKTYGKQYKEKNEEGVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 89
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTP 106
D+T E+ A + + P S W +N+ ++ DS+DW ++G VT
Sbjct: 90 HLGDMTSEEVTALMSSLRVP---------SQWQRNVTYKSNPNQKLPDSVDWRDKGCVTD 140
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCS----TLNGCAKNFLENAF 161
VK QGS CWAF+AV +E K++TG+LV+ S LVDCS + GC F+ AF
Sbjct: 141 VKYQGSCGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSVGKYSNRGCNGGFMTEAF 200
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQ--PATEEGLQDVVS 219
+YI + SE YPY+ D C + KY A +Y + +E+ L++ V+
Sbjct: 201 QYIIDNNGIESEASYPYKA-MDGKCQY-----DSKYRAATCSRYTELPEDSEDALKEAVA 254
Query: 220 RQ-PVSVAIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNR 275
+ PVSVAIDA+ F Y GV+ P C NHGV +VGYG + YWLVKN
Sbjct: 255 NKGPVSVAIDASHPSFFLYRSGVYYDPACTLHVNHGVLVVGYGNLNGKD----YWLVKNS 310
Query: 276 WGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
WG ++ + G +R+ R G C IA+ A+YP
Sbjct: 311 WGLHFGDQGYIRMARNSGNH--CGIASYASYP 340
>gi|443181|pdb|1PIP|A Chain A, Crystal Structure Of
Papain-Succinyl-Gln-Val-Val-Ala-Ala-P- Nitroanilide
Complex At 1.7 Angstroms Resolution: Noncovalent Binding
Mode Of A Common Sequence Of Endogenous Thiol Protease
Inhibitors
gi|443194|pdb|1POP|A Chain A, X-Ray Crystallographic Structure Of A Papain-Leupeptin
Complex
gi|10120627|pdb|1CVZ|A Chain A, Crystal Structure Analysis Of Papain With
Clik148(Cathepsin L Specific Inhibitor)
gi|157830422|pdb|1BP4|A Chain A, Use Of Papain As A Model For The Structure-Based Design Of
Cathepsin K Inhibitors. Crystal Structures Of Two Papain
Inhibitor Complexes Demonstrate Binding To S'-Subsites.
gi|157830437|pdb|1BQI|A Chain A, Use Of Papain As A Model For The Structure-Based Design Of
Cathepsin K Inhibitors. Crystal Structures Of Two Papain
Inhibitor Complexes Demonstrate Binding To S'-Subsites.
gi|157833459|pdb|1PE6|A Chain A, Refined X-Ray Structure Of Papain(Dot)e-64-C Complex At
2.1-Angstroms Resolution
gi|157833550|pdb|1PPD|A Chain A, Restrained Least-Squares Refinement Of The Sulfhydryl
Protease Papain To 2.0 Angstroms
gi|157835640|pdb|2PAD|A Chain A, Binding Of Chloromethyl Ketone Substrate Analogues To
Crystalline Papain
gi|157836979|pdb|4PAD|A Chain A, Binding Of Chloromethyl Ketone Substrate Analogues To
Crystalline Papain
gi|157837114|pdb|6PAD|A Chain A, Binding Of Chloromethyl Ketone Substrate Analogues To
Crystalline Papain
gi|157879620|pdb|1PAD|A Chain A, Binding Of Chloromethyl Ketone Substrate Analogues To
Crystalline Papain
gi|157884465|pdb|5PAD|A Chain A, Binding Of Chloromethyl Ketone Substrate Analogues To
Crystalline Papain
Length = 212
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 94/219 (42%), Positives = 125/219 (57%), Gaps = 19/219 (8%)
Query: 96 IDWNERGAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCA 153
+DW ++GAVTPVK+QGS CWAF+AV T+EG+ KIRTG L S+ +L+DC + GC
Sbjct: 5 VDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIRTGNLNQYSEQELLDCDRRSYGCN 64
Query: 154 KNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGA-IRGYQYVQPATEE 212
+ +A + + QY + YPY+G Q Y RS G Y A G + VQP +
Sbjct: 65 GGYPWSALQLVAQYG-IHYRNTYPYEGVQRY----CRSREKGPYAAKTDGVRQVQPYNQG 119
Query: 213 GLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYW 270
L ++ QPVSV + A F Y GG+F GPCGN +H V VGYG Y
Sbjct: 120 ALLYSIANQPVSVVLQAAGKDFQLYRGGIFVGPCGNKVDHAVAAVGYGPN--------YI 171
Query: 271 LVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYPL 308
L+KN WGT W E G +RI RG G S G+C + ++ YP+
Sbjct: 172 LIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPV 210
>gi|114559418|ref|XP_001171268.1| PREDICTED: cathepsin S isoform 3 [Pan troglodytes]
gi|397492866|ref|XP_003817341.1| PREDICTED: cathepsin S isoform 1 [Pan paniscus]
gi|410225070|gb|JAA09754.1| cathepsin S [Pan troglodytes]
gi|410251608|gb|JAA13771.1| cathepsin S [Pan troglodytes]
gi|410328325|gb|JAA33109.1| cathepsin S [Pan troglodytes]
gi|410328327|gb|JAA33110.1| cathepsin S [Pan troglodytes]
Length = 331
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 111/333 (33%), Positives = 164/333 (49%), Gaps = 53/333 (15%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------N 49
HK + W + + YK++ E+ +R I++KN +F+ L N
Sbjct: 19 HKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 78
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTP 106
D+T E+ ++ + + P S W +N+ ++ DS+DW E+G VT
Sbjct: 79 HLGDMTSEEVMSLMSSLRVP---------SQWQRNITYKSNPNQILPDSVDWREKGCVTE 129
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAF 161
VK QGS CWAF+AV +E K++TG+LV+ S LVDCST GC F+ AF
Sbjct: 130 VKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAF 189
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDW---WRSSASGKYGAIRGYQYVQPATEEGLQDVV 218
+YI + + S+ YPY+ D C + +R++ KY + E+ L++ V
Sbjct: 190 QYIIDNKGIDSDASYPYKA-TDQKCQYDSKYRAATCSKYTEL------PYGREDVLKEAV 242
Query: 219 SRQ-PVSVAIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKN 274
+ + PVSV +DA F Y GV+ P C NHGV +VGYG E YWLVKN
Sbjct: 243 ANKGPVSVGVDALHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKE----YWLVKN 298
Query: 275 RWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
WG N+ E G +R+ R G C IA+ +YP
Sbjct: 299 SWGHNFGEEGYIRMARNKGNH--CGIASFPSYP 329
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 105/314 (33%), Positives = 160/314 (50%), Gaps = 45/314 (14%)
Query: 24 RTYKDQAEKEMRFKIFKKN------HEFL----------RLNKFADLTREKFLASYTGYK 67
+ Y Q E++ R KI+ +N H L +NKF DL +F + GY+
Sbjct: 36 KEYPSQLEEKFRMKIYLENKHKVAKHNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQ 95
Query: 68 PPPTDHPHSNRS---NWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC--CWAFTAVA 122
H N S + F + + + +S+DW E+GA+TPVKDQG C CWAF++
Sbjct: 96 -----HKKQNSSRAESTFTFMEPANVEVPESVDWREKGAITPVKDQGQ-CGPCWAFSSTG 149
Query: 123 TVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRLASECVYPYQ 179
+EG +TG+LV+ + L+DCS GC ++ AF+YI+ + + +E YPY+
Sbjct: 150 ALEGQTFRKTGKLVSLREQNLIDCSGKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYE 209
Query: 180 GRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDATW--FNFYH 236
D C R + + RG+ + E+ L+ V+ PVSVAIDA+ F FY
Sbjct: 210 AEDD-VC---RYNPRNRGAVDRGFVDIPSGEEDKLKAAVATVGPVSVAIDASHESFQFYS 265
Query: 237 GGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGVGG 294
GV+ P ++ +HGV +VGYG+ + + YWLVKN W +W + G ++I R
Sbjct: 266 KGVYYEPSCDSDDLDHGVLVVGYGS----DNGKDYWLVKNSWSEHWGDQGYIKIARNRKN 321
Query: 295 SGLCNIAANAAYPL 308
C +A A+YPL
Sbjct: 322 H--CGVATAASYPL 333
>gi|24638018|sp|P83443.1|MDO1_PSEMR RecName: Full=Macrodontain-1; AltName: Full=Macrodontain I
Length = 213
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 89/219 (40%), Positives = 132/219 (60%), Gaps = 19/219 (8%)
Query: 95 SIDWNERGAVTPVKDQGSYC--CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNGC 152
SIDW + GAV VK+QG C CWAF A+ATVEG+ KIR G LV S+ +++DC+ GC
Sbjct: 5 SIDWRDYGAVNEVKNQGP-CGGCWAFAAIATVEGIYKIRKGNLVYLSEQEVLDCAVSYGC 63
Query: 153 AKNFLENAFEYIRQYQRLASECVYPYQGRQDYY-CDWWRSSASGKYGAIRGYQYVQPATE 211
++ A+++I + ++ YPY+ Q +++ +SA I GY YV+ E
Sbjct: 64 KGGWVNRAYDFIISNNGVTTDENYPYRAYQGTCNANYFPNSAY-----ITGYSYVRRNDE 118
Query: 212 EGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPY 269
+ VS QP++ IDA+ F +Y GGV++GPCG + NH +TI+GY G+ Y
Sbjct: 119 SHMMYAVSNQPIAALIDASGDNFQYYKGGVYSGPCGFSLNHAITIIGY-------GRDSY 171
Query: 270 WLVKNRWGTNWDEGGSMRIFRGVGGS-GLCNIAANAAYP 307
W+V+N WG++W +GG +RI R V S G+C IA + +P
Sbjct: 172 WIVRNSWGSSWGQGGYVRIRRDVSHSGGVCGIAMSPLFP 210
>gi|15593246|gb|AAL02220.1|AF410880_1 cysteine protease CP7 precursor [Frankliniella occidentalis]
Length = 333
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 108/320 (33%), Positives = 165/320 (51%), Gaps = 33/320 (10%)
Query: 11 IAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRLNKFADLTREKFLASYTGYKPPP 70
I A E + A+TY + AE+ R K+FK+N +R+ K D + GY
Sbjct: 24 IQAHWESFKATHAKTYANAAEEAYRAKVFKENA--IRIAKHNDRFASGEVTFKVGYNQYA 81
Query: 71 TDHPH--SNRSNWFKNLNSSKMSFYDS-----------IDWNERGAVTPVKDQGSY-CCW 116
H H + + N +++ +F + +DW +GAVTP+KDQG CW
Sbjct: 82 DMHTHEVTEKLNGYRSGLKQASAFVHTASNDSWPWSKKVDWRSKGAVTPIKDQGQCGSCW 141
Query: 117 AFTAVATVEGLNKIRTGQLVTRSKHQLVDCS---TLNGCAKNFLENAFEYIRQYQRLASE 173
+F+A ++EG ++ LV+ S+ LVDCS GC +++AFEY++ Y + +E
Sbjct: 142 SFSATGSLEGQLFLKNKNLVSLSEQNLVDCSWDFGNEGCNGGLMDSAFEYVKSYGGIDTE 201
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDAT-W 231
YPY +D C + A+ G GY+ VQ +E L+D V + PVSVAIDA+ W
Sbjct: 202 ESYPYTA-EDGTCLY---KAANNAGVNTGYKDVQAKSESALRDAVEKVGPVSVAIDASNW 257
Query: 232 -FNFYHGGVFTGPC--GNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRI 288
F Y G++ P ++ +HGV VGYG+ + +W+VKN WGT+W E G +++
Sbjct: 258 SFQMYTSGIYYEPACSSDSLDHGVLAVGYGSEWP---NKEFWIVKNSWGTSWGEEGYIKM 314
Query: 289 FRGVGGSGLCNIAANAAYPL 308
R + C IA A+YPL
Sbjct: 315 ARNKKNN--CGIATEASYPL 332
>gi|401430350|ref|XP_003886559.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
gi|356491516|emb|CBZ40966.1| unnamed protein product, partial [Leishmania mexicana
MHOM/GT/2001/U1103]
Length = 503
Score = 155 bits (391), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 99/305 (32%), Positives = 150/305 (49%), Gaps = 27/305 (8%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKF 59
AA E++ + R Y+ AE++ R F++N E +R + KF DL+ +F
Sbjct: 95 AALFEEFKRTYGRAYETLAEEQQRLANFERNLELMREHQARNPHAQFGITKFFDLSEAEF 154
Query: 60 LASY---TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
A Y Y H + ++ + + D++DW E+GAVTPVKDQG+ C
Sbjct: 155 AARYLNGAAYFAAAKRHA----AQHYRKARADLSAVPDAVDWREKGAVTPVKDQGACGSC 210
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQ--RLAS 172
WAF+AV +EG + +LV+ S+ QLV C +N GC + AF+++ Q L +
Sbjct: 211 WAFSAVGNIEGQWYLAGHELVSLSEQQLVSCDDMNDGCDGGLMLQAFDWLLQNTNGHLYT 270
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY Y + SS I G+ + + + + P+++A+DA+ F
Sbjct: 271 EDSYPYVSGNGYVPECSNSSELVVGAQIDGHVLIGSSEKAMAAWLAKNGPIAIALDASSF 330
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV T G NHGV +VGY T G+ PYW++KN WG +W E G +R+ GV
Sbjct: 331 MSYKSGVLTACIGKQLNHGVLLVGYDMT----GEVPYWVIKNSWGGDWGEQGYVRVVMGV 386
Query: 293 GGSGL 297
L
Sbjct: 387 NACLL 391
>gi|394331816|gb|AFN27127.1| cysteine protease [Leishmania tropica]
Length = 443
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 101/315 (32%), Positives = 151/315 (47%), Gaps = 27/315 (8%)
Query: 12 AAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR------------LNKFADLTREKF 59
AA E++ + R Y AE++ R F++N E +R + KF DL+ +F
Sbjct: 35 AALFEEFKRTYRRAYGTVAEEQQRLANFERNLELMREHQARNPHARFGITKFFDLSEAEF 94
Query: 60 LASY---TGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CC 115
A Y Y H + ++ + + D++DW ++GAVTPVKDQG+ C
Sbjct: 95 AARYLNGAAYFAAAKQHAGQH----YRKARADLSAVPDAVDWRKKGAVTPVKDQGACGSC 150
Query: 116 WAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL-NGCAKNFLENAFEYIRQYQR--LAS 172
WAF+AV ++E + +L S+ QLV C NGCA + AFE++ + + +
Sbjct: 151 WAFSAVGSIESQWALAGHRLTALSEQQLVSCDDKDNGCAGGLMLQAFEWLLRNMNGTMFT 210
Query: 173 ECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQPVSVAIDATWF 232
E YPY Y + SS I GY ++ + + P+S+A+DA+ F
Sbjct: 211 EDSYPYVSSTGYVPECSNSSQLVPGARIDGYLTIESSETVMAAWLAKNGPISIAVDASSF 270
Query: 233 NFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
Y GV T G+ NHGV +VGY T G+ PYW++KN WG NW E G +R+ GV
Sbjct: 271 MSYQSGVLTSCAGDALNHGVLLVGYNRT----GEVPYWVIKNSWGENWGENGYVRVTMGV 326
Query: 293 GGSGLCNIAANAAYP 307
L +A P
Sbjct: 327 NACLLTEYPVSAHVP 341
>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
Length = 338
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 111/318 (34%), Positives = 158/318 (49%), Gaps = 47/318 (14%)
Query: 18 WMVEFARTYKDQAEKEMRFKIFKKN---------------HEF-LRLNKFADLTREKFLA 61
W + R Y+++ E+ R I++KN H + L +N AD+T E+ +
Sbjct: 39 WKKTYGRQYQEKNEEVARRLIWEKNLKSVMLHNLEYSMGMHSYDLGMNHLADMTSEEVSS 98
Query: 62 SYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWA 117
+ + P S W N+ ++S DS+DW E+G VT VK QG+ CWA
Sbjct: 99 LMSSLRVP---------SQWQANVTYKSNSNQKLPDSVDWREKGCVTEVKYQGACGACWA 149
Query: 118 FTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAFEYIRQYQRLASE 173
F+AV +E K++TG LV+ S LVDCST GC F+ AF+YI + SE
Sbjct: 150 FSAVGALEAQLKLKTGNLVSLSAQNLVDCSTERYGNKGCNGGFMTKAFQYIIDNNGIDSE 209
Query: 174 CVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA--T 230
YPY+ D C R + + Y + +E+ L++ V+ + PVSVAIDA +
Sbjct: 210 VSYPYKA-MDGNC---RYDSKHRAATCSKYTELPFGSEDALKEAVANKGPVSVAIDAKHS 265
Query: 231 WFNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIF 289
F Y GV+ P C NHGV +VGYG + YWLVKN WG N+ E G +R+
Sbjct: 266 SFFLYKSGVYYDPSCTQNVNHGVLVVGYGNLN----GRDYWLVKNSWGLNFGEQGYIRMA 321
Query: 290 RGVGGSGLCNIAANAAYP 307
R G C IA+ +YP
Sbjct: 322 RNSGNH--CGIASYPSYP 337
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 105/322 (32%), Positives = 162/322 (50%), Gaps = 40/322 (12%)
Query: 16 EQWM---VEFARTYKDQAEKEMRFKIFKKN-HEF---------------LRLNKFADLTR 56
E+W +E + Y + E+ R KIF +N H+ L LNK+AD+
Sbjct: 25 EEWQTFKMEHRKNYLSEVEERFRMKIFNENRHKIAKHNQLYAQGKVSFKLGLNKYADMLH 84
Query: 57 EKFLASYTGYKPPPTDHPHSNRS-NWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSYC- 114
+F + GY + N ++ + + ++DW + GAVT VKDQG +C
Sbjct: 85 HEFKETMNGYNHTMRKELRAQEGFNGITYISPANVQVPKAVDWRQHGAVTSVKDQG-HCG 143
Query: 115 -CWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL---NGCAKNFLENAFEYIRQYQRL 170
CW+F++ ++EG + + G LV+ S+ LVDCST NGC ++NAF YI+ +
Sbjct: 144 SCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGV 203
Query: 171 ASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSRQ-PVSVAIDA 229
+E YPY+G D C + +++ G+ + EE + V+ PV+VAIDA
Sbjct: 204 DTEKSYPYEGIDD-SCHFNKATVG---ATDTGFVDIPQGDEEAMMKAVATMGPVAVAIDA 259
Query: 230 T--WFNFYHGGVFTGPCGNTPN--HGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGS 285
+ F Y GV+ P ++ N HGV +VGYGT + Q YWLVKN WGT W + G
Sbjct: 260 SNESFQLYSEGVYNDPNCSSDNLDHGVLVVGYGTDKDG---QDYWLVKNSWGTTWGDQGY 316
Query: 286 MRIFRGVGGSGLCNIAANAAYP 307
+++ R C IA +++P
Sbjct: 317 IKMARNQDNQ--CGIATASSFP 336
>gi|350421176|ref|XP_003492760.1| PREDICTED: hypothetical protein LOC100745708 [Bombus impatiens]
Length = 884
Score = 154 bits (390), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 106/313 (33%), Positives = 153/313 (48%), Gaps = 33/313 (10%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEFLR-------------LNKFADLTREKFLAS 62
E ++ +F +TY EK RFKIFK+N + + + FADLT ++F A
Sbjct: 580 EAFIKKFGKTYNSADEKLDRFKIFKQNLKIIEELQTFERGTAEYGVTMFADLTPKEFKAR 639
Query: 63 YTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAFTAV 121
Y G +P + H N +S DW + VTPVKDQG CWAF+
Sbjct: 640 YLGLRP---ELKHENEIP-LPEAEIPDVSLPLKFDWRDHSVVTPVKDQGQCGSCWAFSVT 695
Query: 122 ATVEGLNKIRTGQLVTRSKHQLVDCSTLN-GCAKNFLENAFEYIRQYQRLASECVYPYQG 180
VEG I+ QL++ S+ +LVDC +L+ GC +ENA++ I + L E YPY
Sbjct: 696 GNVEGQYAIKHNQLLSLSEQELVDCDSLDEGCNGGDMENAYKAIERLGGLELESDYPYDA 755
Query: 181 RQDYYCDWWRSSASGKYGAIRGYQYVQPATEEG--LQDVVSRQPVSVAIDATWFNFYHGG 238
+ D C + ++ A ++ V ++E Q +V P+SV I+A FY GG
Sbjct: 756 K-DEKCHFLQNKAK-----VQVVSAVNITSDEKRMAQWLVKNGPISVGINANAMQFYFGG 809
Query: 239 V---FTGPCG-NTPNHGVTIVGYGTTTEA--EGQQPYWLVKNRWGTNWDEGGSMRIFRGV 292
V C +HGV IVGYG + + PYW++KN WG W E G R++RG
Sbjct: 810 VSHPLNFLCNPKNLDHGVLIVGYGISKYPLFHKELPYWIIKNSWGPRWGERGYYRVYRGD 869
Query: 293 GGSGLCNIAANAA 305
G G+ +A +A
Sbjct: 870 GTCGVNTMATSAV 882
>gi|228244|prf||1801240B Cys protease 2
Length = 323
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 106/318 (33%), Positives = 161/318 (50%), Gaps = 41/318 (12%)
Query: 16 EQWMVEFARTYKDQAEKEMRFKIFKKNHEF----------------LRLNKFADLTREKF 59
E + ++ R Y D E R IF++N ++ L +NKF D+T E+F
Sbjct: 21 EHFKGKYGRQYVDAEEDSYRRVIFEQNQKYIEEFNKKYENGEVTFNLAMNKFGDMTLEEF 80
Query: 60 LASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNERGAVTPVKDQGSY-CCWAF 118
A G P + S ++ + + +DW +GAVTPVKDQG CWAF
Sbjct: 81 NAVMKGNIP----RRSAPVSVFYPKKETGPQA--TEVDWRTKGAVTPVKDQGQCGSCWAF 134
Query: 119 TAVATVEGLNKIRTGQLVTRSKHQLVDCST---LNGCAKNFLENAFEYIRQYQRLASECV 175
+ ++EG + ++TG L++ ++ QLVDCS GC ++ +AF+YI+ + +E
Sbjct: 135 STTGSLEGQHFLKTGSLISLAEQQLVDCSRPYGPQGCNGGWMNDAFDYIKANNGIDTEAS 194
Query: 176 YPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVSR-QPVSVAIDA--TWF 232
YPY+ R D C + +S + G+ + +E GLQ V P+SV IDA + F
Sbjct: 195 YPYEAR-DGSCRFDSNSVA---ATCSGHTNIASGSETGLQQAVRDIGPISVTIDAAHSSF 250
Query: 233 NFYHGGVFTGPCGNTP--NHGVTIVGYGTTTEAEGQQPYWLVKNRWGTNWDEGGSMRIFR 290
FY GV+ P + +H V VGYG+ EG Q +WLVKN W T+W + G +++ R
Sbjct: 251 QFYSSGVYYEPSCSPSYLDHAVLAVGYGS----EGGQDFWLVKNSWATSWGDAGYIKMSR 306
Query: 291 GVGGSGLCNIAANAAYPL 308
+ C IA A+YPL
Sbjct: 307 NRNNN--CGIATVASYPL 322
>gi|45384464|ref|NP_990302.1| cathepsin K precursor [Gallus gallus]
gi|25089842|sp|Q90686.1|CATK_CHICK RecName: Full=Cathepsin K; AltName: Full=JTAP-1; Flags: Precursor
gi|1017831|gb|AAC59739.1| JTAP-1 [Gallus gallus]
Length = 334
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 105/273 (38%), Positives = 149/273 (54%), Gaps = 22/273 (8%)
Query: 43 HEF-LRLNKFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNLNSSKMSFYDSIDWNER 101
H F L +N D+T E+ + + TG + P P N + + + +S + ++DW +
Sbjct: 74 HSFQLAMNYLGDMTSEEVVRTMTGLRVP-RSRPRPNGTLYVPDWSSRAPA---AVDWRRK 129
Query: 102 GAVTPVKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDC-STLNGCAKNFLEN 159
G VTPVKDQG CWAF++V +EG K RTG+L++ S LV C S NGC ++ N
Sbjct: 130 GYVTPVKDQGQCGSCWAFSSVGALEGQLKRRTGKLLSLSPQNLVYCVSNNNGCGGGYMTN 189
Query: 160 AFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPATEEGLQDVVS 219
AFEY+R + + SE YPY G QD C + S +GK RGY+ + E+ L+ V+
Sbjct: 190 AFEYVRLNRGIDSEDAYPYIG-QDESCMY---SPTGKAAKCRGYREIPEDNEKALKRAVA 245
Query: 220 R-QPVSVAIDATW--FNFYHGGVF--TGPCGNTPNHGVTIVGYGTTTEAEGQQPYWLVKN 274
R PVSV IDA+ F FY GV+ TG NH V VGYG A+ +W++KN
Sbjct: 246 RIGPVSVGIDASLPSFQFYSRGVYYDTGCNPENINHAVLAVGYG----AQKGTKHWIIKN 301
Query: 275 RWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
WGT W G + + R + + C IA A++P
Sbjct: 302 SWGTEWGNKGYVLLARNMKQT--CGIANLASFP 332
>gi|215261455|pdb|3F75|A Chain A, Activated Toxoplasma Gondii Cathepsin L (Tgcpl) In Complex
With Its Propeptide
Length = 224
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 85/221 (38%), Positives = 127/221 (57%), Gaps = 16/221 (7%)
Query: 96 IDWNERGAVTPVKDQ---GSYCCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTLNG- 151
+DW RG VTPVKDQ GS CWAF+ +EG + +TG+LV+ S+ +L+DCS G
Sbjct: 11 VDWRSRGCVTPVKDQRDCGS--CWAFSTTGALEGAHCAKTGKLVSLSEQELMDCSRAEGN 68
Query: 152 --CAKNFLENAFEYIRQYQRLASECVYPYQGRQDYYCDWWRSSASGKYGAIRGYQYVQPA 209
C+ + +AF+Y+ + SE YPY R D C R+ + K I G++ V
Sbjct: 69 QSCSGGEMNDAFQYVLDSGGICSEDAYPYLAR-DEEC---RAQSCEKVVKILGFKDVPRR 124
Query: 210 TEEGLQDVVSRQPVSVAIDATW--FNFYHGGVFTGPCGNTPNHGVTIVGYGTTTEAEGQQ 267
+E ++ +++ PVS+AI+A F FYH GVF CG +HGV +VGYGT E+ ++
Sbjct: 125 SEAAMKAALAKSPVSIAIEADQMPFQFYHEGVFDASCGTDLDHGVLLVGYGTDKES--KK 182
Query: 268 PYWLVKNRWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYPL 308
+W++KN WGT W G M + G G C + +A++P+
Sbjct: 183 DFWIMKNSWGTGWGRDGYMYMAMHKGEEGQCGLLLDASFPV 223
>gi|23110962|ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens]
gi|88984046|sp|P25774.3|CATS_HUMAN RecName: Full=Cathepsin S; Flags: Precursor
gi|60816153|gb|AAX36372.1| cathepsin S [synthetic construct]
gi|61358282|gb|AAX41541.1| cathepsin S [synthetic construct]
gi|119573903|gb|EAW53518.1| cathepsin S, isoform CRA_b [Homo sapiens]
gi|119573904|gb|EAW53519.1| cathepsin S, isoform CRA_b [Homo sapiens]
Length = 331
Score = 154 bits (390), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 111/333 (33%), Positives = 164/333 (49%), Gaps = 53/333 (15%)
Query: 6 HKTGNIAAKHEQWMVEFARTYKDQAEKEMRFKIFKKNHEFLRL----------------N 49
HK + W + + YK++ E+ +R I++KN +F+ L N
Sbjct: 19 HKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMN 78
Query: 50 KFADLTREKFLASYTGYKPPPTDHPHSNRSNWFKNL---NSSKMSFYDSIDWNERGAVTP 106
D+T E+ ++ + + P S W +N+ ++ DS+DW E+G VT
Sbjct: 79 HLGDMTSEEVMSLMSSLRVP---------SQWQRNITYKSNPNRILPDSVDWREKGCVTE 129
Query: 107 VKDQGSY-CCWAFTAVATVEGLNKIRTGQLVTRSKHQLVDCSTL----NGCAKNFLENAF 161
VK QGS CWAF+AV +E K++TG+LV+ S LVDCST GC F+ AF
Sbjct: 130 VKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAF 189
Query: 162 EYIRQYQRLASECVYPYQGRQDYYCDW---WRSSASGKYGAIRGYQYVQPATEEGLQDVV 218
+YI + + S+ YPY+ D C + +R++ KY + E+ L++ V
Sbjct: 190 QYIIDNKGIDSDASYPYKA-MDQKCQYDSKYRAATCSKYTEL------PYGREDVLKEAV 242
Query: 219 SRQ-PVSVAIDATW--FNFYHGGVFTGP-CGNTPNHGVTIVGYGTTTEAEGQQPYWLVKN 274
+ + PVSV +DA F Y GV+ P C NHGV +VGYG E YWLVKN
Sbjct: 243 ANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGDLNGKE----YWLVKN 298
Query: 275 RWGTNWDEGGSMRIFRGVGGSGLCNIAANAAYP 307
WG N+ E G +R+ R G C IA+ +YP
Sbjct: 299 SWGHNFGEEGYIRMARNKGNH--CGIASFPSYP 329
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.319 0.133 0.434
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,202,982,530
Number of Sequences: 23463169
Number of extensions: 220193398
Number of successful extensions: 486420
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4046
Number of HSP's successfully gapped in prelim test: 2814
Number of HSP's that attempted gapping in prelim test: 460666
Number of HSP's gapped (non-prelim): 7522
length of query: 308
length of database: 8,064,228,071
effective HSP length: 142
effective length of query: 166
effective length of database: 9,027,425,369
effective search space: 1498552611254
effective search space used: 1498552611254
T: 11
A: 40
X1: 16 ( 7.4 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.8 bits)
S2: 76 (33.9 bits)