BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019112
(346 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 197/339 (58%), Positives = 242/339 (71%), Gaps = 13/339 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
MFV +++V SQ S RS+H+ ++ E+HE WM ++GR YKD EK R IF+ N+E+
Sbjct: 10 MFVALLVVGLWVSQAWS-RSLHDAAMNERHEMWMVKYGRVYKDNSEKERRFEIFRNNVEF 68
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE NK GNR YKL NEF+DLTNEEF+AS GY R S + S S+F+Y NVT VP
Sbjct: 69 IESFNKPGNRPYKLDINEFADLTNEEFKASRNGYKR---SSNVGLSEKSSFRYGNVTAVP 125
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DN 190
TS+DWR+KGAVT IK+QG CG CWAFSAVAA+EGIT+++ GKLI LSEQ+LVDC T ++
Sbjct: 126 TSMDWRQKGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGED 185
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC GGLMD AFE+I +N GL TEA+YPYQ GTC+ K AA I YED+P E
Sbjct: 186 QGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSED 245
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
ALL+AV QPVSV ++ASG AF+FY GV +CG DHGV VG+GT+ DG KYWL
Sbjct: 246 ALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTS---DGTKYWL 302
Query: 311 IKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+KNSWG +WGE GYIR+ RD EGLCGIA ++SYP A
Sbjct: 303 VKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQSSYPTA 341
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 397 bits (1020), Expect = e-108, Method: Compositional matrix adjust.
Identities = 196/339 (57%), Positives = 241/339 (71%), Gaps = 12/339 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
MFV +++V ASQ S RS+H+ ++ E+HE WMA++GR YKD EK R IF+ N+E+
Sbjct: 10 MFVALLVVGLWASQAWS-RSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNNVEF 68
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE NK GNR YKL NEF+DLTNEEF+ S GY R S + S+F+Y NVT VP
Sbjct: 69 IESFNKLGNRPYKLDINEFADLTNEEFKVSKNGYKR---SSGVGLTEKSSFRYANVTAVP 125
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DN 190
TS+DWR+ GAVT IK+QG CG CWAFSAVAA+EGIT+++ GKLI LSEQ+LVDC T ++
Sbjct: 126 TSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGED 185
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC GGLMD AFE+I +N GL TEA+YPYQ GTC+ K AA I YED+P E
Sbjct: 186 QGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSED 245
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
ALL+AV QPVSV ++ASG AF+FY GV +CG DHGV VG+GT+ +DG KYWL
Sbjct: 246 ALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTS--DDGTKYWL 303
Query: 311 IKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+KNSWG +WGE GYIR+ RD EGLCGIA + SYP A
Sbjct: 304 VKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQPSYPTA 342
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 191/339 (56%), Positives = 240/339 (70%), Gaps = 10/339 (2%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ + ++LV ASQ S RS+HE S+ +H+ WM Q+GR YK +EK R IFK+N+E+
Sbjct: 10 VLMAMLLVTLWASQSWS-RSLHEASMELRHKTWMTQYGRVYKGNVEKEKRFKIFKENVEF 68
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N GN+ YKLG N F+DLTNEEFRAS+ GY + S + S R +F+Y+NVT VP
Sbjct: 69 IESFNNNGNKPYKLGINAFTDLTNEEFRASHNGYTMSMSS-HQSSYRTKSFRYENVTAVP 127
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--N 190
S+DWR KGAVTHIK+QG CG CWAFSAVAA+EGIT+++ G LI LSEQ+LVDC T +
Sbjct: 128 PSLDWRTKGAVTHIKDQGQCGCCWAFSAVAAMEGITKLSTGTLISLSEQELVDCDTSGMD 187
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC GGLMD AFE+IIEN GL TEA+YPY+ G+C+ +K AA I YE++P DE
Sbjct: 188 QGCEGGLMDDAFEFIIENNGLTTEANYPYEGVDGSCNTRKAANHAAKITGYENVPAYDEE 247
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
AL +AV QPVSV ++A AF+ Y G+ +CG DHGV VVG+GT+ +DG KYWL
Sbjct: 248 ALRKAVANQPVSVAIDAGESAFQHYSSGIFTGDCGTELDHGVTVVGYGTS--DDGTKYWL 305
Query: 311 IKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+KNSWG +WGE GYIR+ RD EGLCGIA E SYP A
Sbjct: 306 VKNSWGTSWGEDGYIRMERDIDAKEGLCGIAMEPSYPTA 344
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 184/337 (54%), Positives = 239/337 (70%), Gaps = 13/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++ V+ + + RS+HE S+ E+HE WM Q+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK +++YKL NEF+DLTNEEFRAS + + S + ++FKY+NVT VP++
Sbjct: 72 SFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICS-----TEATSFKYENVTAVPST 126
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNG 192
+DWR+KGAVT IK+QG CGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
CSGGLMD AF++I +N GL TEA+YPY GTC+++K AA I YED+P +E AL
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
+AV QP++V ++ASG F+FY GV +CG DHGVA VG+GT+ +DG KYWL+K
Sbjct: 247 QKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTS--DDGMKYWLVK 304
Query: 313 NSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
NSW WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 305 NSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 183/337 (54%), Positives = 238/337 (70%), Gaps = 13/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++ V+ + + R +HE S+ E+HE WM Q+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASQATARXLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK +++YKL NEF+DLTNEEFRAS + + S + ++FKY+NVT VP++
Sbjct: 72 SFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICS-----TEATSFKYENVTAVPST 126
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNG 192
+DWR+KGAVT IK+QG CGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
CSGGLMD AF++I +N GL TEA+YPY GTC+++K AA I YED+P +E AL
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
+AV QP++V ++ASG F+FY GV +CG DHGVA VG+GT+ +DG KYWL+K
Sbjct: 247 QKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTS--DDGMKYWLVK 304
Query: 313 NSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
NSW WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 305 NSWSTGWGEEGYIRMQRDVTVKEGLCGIAMQASYPTA 341
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 184/337 (54%), Positives = 235/337 (69%), Gaps = 13/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++LV S + R++ + S+ E+HEQWMAQ+G+ YKD EK +R IFK+N++ IE
Sbjct: 12 LTLLLVFGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRSKIFKENVQRIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
N GN++YKLG N+F+DLTNEEF+A NR + S+R TFKY++VT VP S
Sbjct: 72 AFNNAGNKSYKLGINQFADLTNEEFKAR----NRFKGHMCSNSTRTPTFKYEHVTSVPAS 127
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNG 192
+DWR+KGAVT IK+QG CG CWAFSAVAA EGIT+++ GKLI LSEQ+LVDC T + G
Sbjct: 128 LDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQG 187
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C GGLMD AF++I++NKGL TEA YPYQ TC+ E AA+I +ED+P E AL
Sbjct: 188 CEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSESAL 247
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
L+AV QP+SV ++ASG F+FY GV CG DHGV VG+G+ + G KYWL+K
Sbjct: 248 LKAVANQPISVAIDASGSEFQFYSSGVFTGSCGTELDHGVTAVGYGS---DGGTKYWLVK 304
Query: 313 NSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
NSWGE WGE GYIR+ RD EGLCG A +ASYP A
Sbjct: 305 NSWGEQWGEQGYIRMQRDVAAEEGLCGFAMQASYPTA 341
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/313 (59%), Positives = 232/313 (74%), Gaps = 14/313 (4%)
Query: 39 VEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
+E+HE WMAQ+GR YK +EK RL IFK N+E+IE NK G + YKL NEF+DLTNEE
Sbjct: 1 MERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEE 60
Query: 99 FRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAF 158
F+AS GY + +S S++P F+Y+NV+ VP+++DWR+KGAVT IK+QG CG CWAF
Sbjct: 61 FQASRNGY-KMSAHLSSSSTKP--FRYENVSAVPSTMDWRKKGAVTPIKDQGQCGCCWAF 117
Query: 159 SAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEAD 216
SAVAA EGITQ++ GKLI LSEQ+LVDC T ++ GC+GGLMD AF++II+NKGL TEA+
Sbjct: 118 SAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEAN 177
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPYQ G C+ K AAA I YED+P E ALL+AV QPVSV ++A G AF+FY
Sbjct: 178 YPYQGADGACNSGK---AAAKITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFYS 234
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV +CG + DHGV VG+G + +DG KYWL+KNSWG +WGE+GYIR+ RD EG
Sbjct: 235 SGVFTGDCGTDLDHGVTAVGYGMS--DDGTKYWLVKNSWGTSWGENGYIRMERDIDAQEG 292
Query: 333 LCGIATEASYPVA 345
LCGIA EASYP A
Sbjct: 293 LCGIAMEASYPTA 305
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 182/337 (54%), Positives = 239/337 (70%), Gaps = 13/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++ V+ + + R++HE S+ E+HE WM Q+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK +++YKL NEF+DLTNEEFRAS + + S + ++FKY+NVT VP++
Sbjct: 72 SFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICS-----TEATSFKYENVTAVPST 126
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNG 192
+DWR+KGAVT IK+QG CGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
CSGGLMD AF++I +N GL TEA+YPY GTC+++K AA I YED+P +E AL
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
+AV QP++V ++A G F+FY GV +CG DHGV+ VG+GT+ +DG KYWL+K
Sbjct: 247 QKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTS--DDGMKYWLVK 304
Query: 313 NSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
NSWG WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 305 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 180/338 (53%), Positives = 231/338 (68%), Gaps = 11/338 (3%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
F IL++ + V+ R + EPS+ +HEQWM G+ Y D EK R IFK N+EYI
Sbjct: 10 FFAFILILGMWAYEVASRELQEPSMSARHEQWMETFGKVYADAAEKERRFEIFKDNVEYI 69
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E N GN+ YKL N+F+DLTNEE + + GY RP+ + + + ++FKY+NVT VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADLTNEELKVARNGYRRPLQT---RPMKVTSFKYENVTAVPA 126
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNN 191
++DWR+KGAVT IK+QG CGSCWAFS VAA EGI Q+T GKL+ LSEQ+LVDC T ++
Sbjct: 127 TMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDTQGEDQ 186
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GGLM+ FE+II+N G+ TEA+YPYQ GTC+ +KE + A I YE +P E A
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKEASRIAKITGYESVPANSEAA 246
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLI 311
LL+AV QP+SV ++A G F+FY GV +CG DHGV VG+G E DG KYWL+
Sbjct: 247 LLKAVASQPISVSIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYG--ETSDGTKYWLV 304
Query: 312 KNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KNSWG +WGE GYIR+ RD EGLCGIA ++SYP A
Sbjct: 305 KNSWGTSWGEEGYIRMQRDTEAEEGLCGIAMDSSYPTA 342
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 183/339 (53%), Positives = 241/339 (71%), Gaps = 12/339 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M +IL+ A Q S R++ E S+ E+HEQWM Q+GR YKDE EK++R IF N+++
Sbjct: 29 MIAALILLGAWACQATS-RTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKF 87
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE+ NK+G ++YKL NEF+D TNEEF+AS GY +VS + S+ + F+Y+NVT VP
Sbjct: 88 IEEFNKDGRQSYKLAVNEFADQTNEEFQASRNGYKM---AVSSRPSQTTLFRYENVTAVP 144
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC--STDN 190
+S+DWR+KGAVT +K+QG CGSCWAFS +AA EGIT++ GKLI LSEQ+LVDC + ++
Sbjct: 145 SSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDKTGED 204
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC GG M+ FE+I++NKG+A EA YPY GTC+ ++E + AA I YE +P E
Sbjct: 205 QGCEGGYMEDGFEFIVKNKGIALEASYPYTAADGTCNSKEEASRAAKISGYEKVPANSET 264
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
ALL+AV QPVSV ++ASG AF+FY GV ECG + DHGV VG+G + DG KYWL
Sbjct: 265 ALLKAVANQPVSVSIDASGVAFQFYSSGVFTGECGTDLDHGVTAVGYG--KTSDGTKYWL 322
Query: 311 IKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
+KNSWG +WG+SGYI + R GLCGIA +ASYP A
Sbjct: 323 VKNSWGASWGDSGYIMMQRGVAAKGGLCGIAMDASYPTA 361
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 379 bits (974), Expect = e-103, Method: Compositional matrix adjust.
Identities = 182/337 (54%), Positives = 238/337 (70%), Gaps = 13/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++ V+ + R++HE S+ E+HE WMAQ+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASHAKARNLHEASMYERHEDWMAQYGRVYKDAGEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK N++YKL NEF+DLTNEEFRAS + + S + ++FKY++V VP++
Sbjct: 72 SFNKAMNKSYKLSINEFADLTNEEFRASRNRFKAHICS-----TEATSFKYEHVXAVPST 126
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNG 192
+DWR+KGAVT IK+QG CGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
CSGGLMD AF++I +N GL TEA+YPY GTC+++K AA I YED+P +E AL
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
+AV QP++V ++A G F+FY GV +CG DHGV+ VG+GT+ +DG KYWL+K
Sbjct: 247 QKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVSAVGYGTS--DDGMKYWLVK 304
Query: 313 NSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
NSWG WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 305 NSWGTGWGEEGYIRMQRDVTEKEGLCGIAMQASYPTA 341
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 189/349 (54%), Positives = 248/349 (71%), Gaps = 20/349 (5%)
Query: 10 IIPMFVIIILVIT-CASQVVSGRS---MHEPSIVEKHEQWMAQHGRTYKDELE--KAMRL 63
++ +F+ + LV++ C S ++G S + E S+ +HE+WM+QHGR Y DE E K R
Sbjct: 3 LLQIFLFVALVLSFCFSIQLAGLSRPLLDEDSM--RHEEWMSQHGRVYADEQEDHKNKRF 60
Query: 64 TIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTF 123
+FK+N+E IE+ N +T+KL N+F+DLTNEEFRASY G+ P+ +S Q ++P+ F
Sbjct: 61 NVFKENVERIEEFND--GKTFKLAINQFADLTNEEFRASYNGFKGPM-VLSSQITKPTPF 117
Query: 124 KYQNVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
+Y+NV+ +P S+DWR+KGAVT +KNQG CG CWAFSAVAA+EGITQI+ GKLI LSEQ+
Sbjct: 118 RYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLISLSEQE 177
Query: 183 LVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
LVDC T ++GC GGLMD AFE+II N GL TE++YPY+ E GTC+ K A +I
Sbjct: 178 LVDCDTKGIDHGCEGGLMDTAFEFIINNGGLTTESNYPYKGEDGTCNFNKTNPIAVSITG 237
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
YED+P DE AL++AV QPVSV +EA G F+FY GV ECG DH V VG+G
Sbjct: 238 YEDVPANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSGVFTGECGTELDHAVTAVGYG-- 295
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
E EDG+KYW++KNSWG WGESGYI + +D +GLCGIA +ASYP A
Sbjct: 296 ESEDGSKYWIVKNSWGTKWGESGYIEMQKDIKVKQGLCGIAMQASYPTA 344
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 181/337 (53%), Positives = 234/337 (69%), Gaps = 12/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++LV + + R++ + S+ E+HEQWM Q+G+ Y D EK +R IFK+N++ IE
Sbjct: 12 LALLLVFGFLAFEANARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNIFKENVQRIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
N GN+ YKLG N+F+DLTNEEF+A NR + S+R TFKY++V+ VP S
Sbjct: 72 AFNNAGNKPYKLGINQFADLTNEEFKAR----NRFKGHMCSNSTRTPTFKYEDVSSVPAS 127
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNG 192
+DWR+KGAVT IK+QG CG CWAFSAVAA EGIT+++ GKLI LSEQ+LVDC T + G
Sbjct: 128 LDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQG 187
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C GGLMD AF++I++NKGL TEA YPYQ TC+ E AA+I +ED+P E AL
Sbjct: 188 CEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSESAL 247
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
L+AV QP+SV ++ASG F+FY G+ CG DHGV VG+G + +DG KYWL+K
Sbjct: 248 LKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYGVS--DDGTKYWLVK 305
Query: 313 NSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
NSWGE WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 306 NSWGEQWGEEGYIRMQRDVAAEEGLCGIAMQASYPTA 342
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 182/337 (54%), Positives = 238/337 (70%), Gaps = 13/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++ V+ + + R++HE S+ E+HE WMAQ+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK +++YKL NEF+DLTNEEF S + + S + ++FKY+NVT VP++
Sbjct: 72 SFNKAMDKSYKLSINEFADLTNEEFGTSRNRFKAHICS-----TEATSFKYENVTAVPST 126
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNG 192
IDWR+KGAVT IK+QG CGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 IDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C+GGLMD AF++I +N GL TEA+YPY GTC+++K AA I YED+P +E AL
Sbjct: 187 CNGGLMDDAFKFIKQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
+AV QP++V ++A G F+FY GV +CG DHGVA VG+GT+ +DG KYWL+K
Sbjct: 247 QKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTS--DDGMKYWLVK 304
Query: 313 NSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
NSWG WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 305 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 190/337 (56%), Positives = 240/337 (71%), Gaps = 12/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++++ ASQ +S R++HE S+ E+HE WM +GRTYKD EK R IFK+N+EYIE
Sbjct: 10 ITLLIMGVWASQALS-RTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEYIE 68
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
N GNR YKL NEF+D TNEEF+AS GYN S +SS ++F+Y+NV VP+S
Sbjct: 69 SVNSAGNRRYKLSINEFADQTNEEFKASRNGYNM---SSRPRSSEITSFRYENVAAVPSS 125
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNG 192
+DWR+KGAVT IK+QG CG CWAFSAVAA+EG+TQ+ G+LI LSEQ+LVDC T ++ G
Sbjct: 126 MDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQG 185
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C GGLMD AFE+II N GL TEA+YPY+ TC+K+K ++AA I YED+P E AL
Sbjct: 186 CGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSEAAL 245
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
L+AV + PVSV ++A G F+FY GV +CG DHGV VG+G + +DG KYWL+K
Sbjct: 246 LKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYG--KTDDGTKYWLVK 303
Query: 313 NSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
NSWG WGE GYI + R DEGLCGIA EASYP A
Sbjct: 304 NSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 340
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 186/352 (52%), Positives = 241/352 (68%), Gaps = 18/352 (5%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ +K + + + +F I +L + + + RS++E S+ E H+QWMA++GR YK EK
Sbjct: 3 LTIKHQCTPLALLFTIGVL-----ASLAAARSLNEASMTETHDQWMARYGRVYKTANEKN 57
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R TIF++NL+YI+ NK N+ YKLG NEF+DLTNEEF S + V + +
Sbjct: 58 RRSTIFQENLKYIQTFNKANNKPYKLGVNEFADLTNEEFTTSRNKFKSHVCA-----TVT 112
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+ F+Y+NVT VP ++DWR+KGAVT IKNQG CG CWAFSAVAA+EGITQ+ GKLI LSE
Sbjct: 113 NVFRYENVTAVPATMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSE 172
Query: 181 QQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
Q+LVDC T+ + GC GGLMD AF++I +N GL+TE +YPY GTC+ KE AATI
Sbjct: 173 QELVDCDTNGEDQGCEGGLMDYAFDFIQQNHGLSTETNYPYSGTDGTCNANKEANHAATI 232
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
+ED+P E ALL+AV QP+SV ++ASG F+FY GV ECG DHGV VG+G
Sbjct: 233 TGHEDVPANSESALLKAVANQPISVAIDASGSDFQFYSSGVFTGECGTELDHGVTAVGYG 292
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVAM 346
TA DG KYWL+KNSWG +WGE GYI++ R EGLCGIA +ASYP A
Sbjct: 293 TA--ADGTKYWLVKNSWGTSWGEEGYIQMQRGVAAAEGLCGIAMQASYPTAF 342
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 180/338 (53%), Positives = 233/338 (68%), Gaps = 14/338 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ +++L+ C SQV+S R++HE S+ E+HEQWM ++G+ YKD EK RL IFK N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N GN+ YKL N +D TNEEF AS+ GY + S + FKY NVTD+P
Sbjct: 69 IESFNAAGNKPYKLSINHLADQTNEEFVASHNGYKY------KGSHSQTPFKYGNVTDIP 122
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNG 192
T++DWR+ GAVT +K+QG CGSCWAFS VAA EGI QI+ G L+ LSEQ+LVDC + ++G
Sbjct: 123 TAVDWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSVDHG 182
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C GGLM+ FE+II+N G+++EA+YPY GTCD KE + AA I YE +P E AL
Sbjct: 183 CDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSEEAL 242
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA-KYWLI 311
QAV QPVSV ++A G F+FY GV +CG DHGV VVG+GT +DG +YW++
Sbjct: 243 QQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTT--DDGTHEYWIV 300
Query: 312 KNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
KNSWG WGE GYIR+ R EGLCGIA +ASYP+
Sbjct: 301 KNSWGTQWGEEGYIRMQRGIDAQEGLCGIAMDASYPMG 338
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 179/337 (53%), Positives = 230/337 (68%), Gaps = 13/337 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ +++L+ C SQV+S R +HE S+ E+HEQWM ++G+ YKD EK RL IFK N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RYLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N GN+ YKLG N +D TNEEF AS+ GY + S + FKY+NVT VP
Sbjct: 69 IESFNAAGNKPYKLGINHLADQTNEEFVASHNGYKH------KASHSQTPFKYENVTGVP 122
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNG 192
++DWRE GAVT +K+QG CGSCWAFS VAA EGI QIT L+ LSEQ+LVDC + ++G
Sbjct: 123 NAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSVDHG 182
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C GG M+ FE+II+N G+++EA+YPY GTCD KE + AA I YE +P E AL
Sbjct: 183 CDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSEDAL 242
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
+AV QPVSV ++A G AF+FY GV +CG DHGV VG+G+ +DG +YW++K
Sbjct: 243 QKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGST--DDGTQYWIVK 300
Query: 313 NSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
NSWG WGE GYIR+ R EGLCGIA +ASYP A
Sbjct: 301 NSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 179/337 (53%), Positives = 230/337 (68%), Gaps = 13/337 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ +++L+ C SQV+S R++HE S+ E+HEQWM ++G+ YKD EK RL IFK N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N GNR YKL N +D TNEEF AS+ GY + S + FKY+NVT VP
Sbjct: 69 IESFNAAGNRPYKLSINHLADQTNEEFVASHNGYKH------KGSHSQTPFKYENVTGVP 122
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNG 192
++DWRE GAVT +K+QG CGSCWAFS VAA EGI QIT L+ LSEQ+LVDC + ++G
Sbjct: 123 NAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSVDHG 182
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C GG M+ FE+II+N G+++EA+YPY GTCD KE + AA I YE +P E AL
Sbjct: 183 CDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSEDAL 242
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
+AV QPVSV ++A G AF+FY GV +CG DHGV VG+G+ +DG +YW++K
Sbjct: 243 QKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGST--DDGTQYWIVK 300
Query: 313 NSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
NSWG WGE GYIR+ R EGLCGIA +ASYP A
Sbjct: 301 NSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 376 bits (965), Expect = e-102, Method: Compositional matrix adjust.
Identities = 188/348 (54%), Positives = 244/348 (70%), Gaps = 18/348 (5%)
Query: 5 FEKSFIIPMFVIIILVITCASQVVSGRSMHE-PSIVEKHEQWMAQHGRTYKDELEKAMRL 63
F+ ++P ++I+ I ASQ +GRS+ E S++E+HEQWMAQHGR YK+ EKA R
Sbjct: 4 FKTVKLLPALALLIVAI-WASQGEAGRSLGENKSMLERHEQWMAQHGRVYKNAAEKAHRF 62
Query: 64 TIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTF 123
IF+ N+E IE N E N +KLG N+F+DLTNEEF+ R S+ +S S F
Sbjct: 63 EIFRANVERIESFNAE-NHKFKLGVNQFADLTNEEFK------TRNTLKPSKMASTKS-F 114
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
KY+NVT VP ++DWR KGAVT IK+QG CGSCWAFSAVAA EGIT+++ GKLI LSEQ++
Sbjct: 115 KYENVTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTGKLISLSEQEV 174
Query: 184 VDC--STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
VDC ++D+ GC+GG MD AFEYII+NKG+ TEA+YPY+ GTC+ +K + AA+I Y
Sbjct: 175 VDCDVTSDDQGCNGGEMDDAFEYIIKNKGITTEANYPYKAADGTCNTKKAASHAASITGY 234
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE 301
ED+ E ALL+A QP++V ++A AF+ Y GV +CG + DHGV +VG+G
Sbjct: 235 EDVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGVFTGDCGTDLDHGVTLVGYGAT- 293
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
DG KYWL+KNSWG +WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 294 -SDGTKYWLVKNSWGTSWGEDGYIRMERDVDAKEGLCGIAMDASYPTA 340
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 182/341 (53%), Positives = 237/341 (69%), Gaps = 13/341 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+ + + + LV ++ + + R++ + + +HEQWMAQ+GR YK+E+EK R IFK+N+
Sbjct: 6 LKLLIALALVFATSAYLATSRTLLDSLMAVRHEQWMAQYGRVYKNEVEKTKRYNIFKENV 65
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
EYIE NK G + YKLG N F+DLTN+EF AS GY P + S + F+Y+NV+
Sbjct: 66 EYIESFNKAGTKPYKLGINAFADLTNKEFIASRNGYILP-----HECSSNTPFRYENVSA 120
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
VPT++DWR+KGAVT +K+QG CG CWAFSAVAA+EGIT+++ G LI LSEQ+LVDC
Sbjct: 121 VPTTVDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKG 180
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
+ GC GGLMD AF +II NKGL TE++YPYQ G+C K K +AA I YED+P
Sbjct: 181 IDQGCEGGLMDDAFTFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANS 240
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL +AV QPVSV ++A G F+FY GV ECG DHGV VG+G A EDG+KY
Sbjct: 241 ESALEKAVANQPVSVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIA--EDGSKY 298
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
WL+KNSWG +WGE GYIR+ +D EGLCGIA ++SYP A
Sbjct: 299 WLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGIAMQSSYPSA 339
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 178/314 (56%), Positives = 223/314 (71%), Gaps = 15/314 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++HE+WMAQHGR Y D EK R IFK+N+E IE N +R YKLG N+F+DLTNE
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 98 EFRASYTGYNRPVPSVSRQSSR--PSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSC 155
EFRA Y GY RQSS+ S+F+Y+N++D+PTS+DWR GAVT +K+QG CG C
Sbjct: 61 EFRAMYHGY-------KRQSSKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCC 113
Query: 156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEA 215
WAFS VAA+EGI ++ G LI LSEQQLVDC+ N GC GGLMD AF+YII N GL +E
Sbjct: 114 WAFSTVAAIEGIIKLQTGNLISLSEQQLVDCTAGNKGCQGGLMDTAFQYIIRNGGLTSED 173
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
+YPYQ GTC +K + A I YED+P+ +E+ALLQAV KQPVSV V+ G FRFY
Sbjct: 174 NYPYQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFY 233
Query: 276 KRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DE 331
K GV +CG N +HGV +G+GT + DG YWL+KNSWG +WGESGY R+ R E
Sbjct: 234 KSGVFEGDCGTNLNHGVTAIGYGT--DSDGTDYWLVKNSWGTSWGESGYTRMQRGIGASE 291
Query: 332 GLCGIATEASYPVA 345
GLCG+A +ASYP +
Sbjct: 292 GLCGVAMDASYPTS 305
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 183/338 (54%), Positives = 239/338 (70%), Gaps = 14/338 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
++ + ASQ + R++ E S+ E+HE WMAQ+GR YKD EK+ R IFK N+ I
Sbjct: 12 LALLFFLAAWASQATA-RNLLEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARI 70
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E NK +++YKL NEF+DLTNEEFRAS + + S + ++FKY++V VP+
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICS-----TEATSFKYEHVAAVPS 125
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNN 191
++DWR+KGAVT IK+QG CGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++
Sbjct: 126 TVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQ 185
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC+GGLMD AF++I +N GLATEA+YPY GTC+++K AA I YED+P +E A
Sbjct: 186 GCNGGLMDDAFKFIEQNHGLATEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKA 245
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLI 311
L +AV QP++V ++A G F+FY GV +CG DHGVA VG+GT+ +DG KYWL+
Sbjct: 246 LQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTS--DDGMKYWLV 303
Query: 312 KNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KNSWG WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 304 KNSWGTGWGEVGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 181/345 (52%), Positives = 238/345 (68%), Gaps = 15/345 (4%)
Query: 9 FIIPMFVIIILVI--TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIF 66
F+ F ++++V ASQ+ + RS+ + S+ E+HE+WMA +GR YKD EK R IF
Sbjct: 3 FVSQCFCLVVMVTLGALASQLAAARSLQDASMRERHEEWMASYGRVYKDINEKQKRYKIF 62
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
++N+ IE +NK+ N+ YKL N+F+DLTNEEF+AS + + S ++ ++FKY
Sbjct: 63 EENVALIESSNKDANKPYKLSVNQFADLTNEEFKASRNRFKGHICS-----TKSTSFKYG 117
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
NV+ VP+++DWR KGAVT +K+QG CG CWAFSAVAA EGIT++T G+LI LSEQ+LVDC
Sbjct: 118 NVSAVPSAMDWRMKGAVTPVKDQGQCGCCWAFSAVAATEGITKLTTGELISLSEQELVDC 177
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
T + GC GGLMD AF +I N GLA+EA+YPY+ GTC+ K+ AA I +ED+
Sbjct: 178 DTSGVDQGCEGGLMDNAFTFIQHNHGLASEANYPYKGVDGTCNTNKQAIHAAEINGFEDV 237
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEED 304
P E ALL AV QPVSV ++A G F+FY +GV CG DHGV VG+GT+ +D
Sbjct: 238 PANSEEALLNAVAHQPVSVAIDAGGSGFQFYSKGVFIGACGTQLDHGVTAVGYGTS--DD 295
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
G KYWL+KNSWG WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 296 GTKYWLVKNSWGTQWGEEGYIRMQRDVDAKEGLCGIAMKASYPTA 340
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 180/340 (52%), Positives = 236/340 (69%), Gaps = 16/340 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPS-IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
+F+ +L++ + ++ R + E ++++HE+WMAQHGR Y D EK R IFK+N+E
Sbjct: 10 IFLPFLLILAAWATKIACRPLDEQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIE 69
Query: 72 YIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSR--PSTFKYQNVT 129
IE N +R YKLG N+F+DLTNEEFRA Y GY RQSS+ S+F+Y+N++
Sbjct: 70 RIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYHGY-------KRQSSKLMSSSFRYENLS 122
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
D+PTS+DWR GAVT +K+QG CG CWAFS VAA+EGI ++ G LI LSEQQLVDC+
Sbjct: 123 DIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDCTAG 182
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
N GC GGLMD AF+YII N GL +E +YPYQ GTC +K + A I YED+P+ +E
Sbjct: 183 NKGCQGGLMDTAFQYIIRNGGLTSEDNYPYQGVDGTCSSEKAASTEAQITGYEDVPQNNE 242
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
+ALLQAV KQPVSV V+ G F+FYK GV N +CG +H V +G+GT + DG YW
Sbjct: 243 NALLQAVAKQPVSVGVDGGGNDFQFYKSGVFNGDCGTQQNHAVTAIGYGT--DIDGTDYW 300
Query: 310 LIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
L+KNSWG +WGE+GY+R+ R EGLCG+A +ASYP A
Sbjct: 301 LVKNSWGTSWGENGYMRMRRGIGSSEGLCGVAMDASYPTA 340
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 180/328 (54%), Positives = 231/328 (70%), Gaps = 13/328 (3%)
Query: 24 ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT 83
++ + + R++ + +V +HEQWMAQ+GR Y++E+EK R IFK+N+EYIE NK G +
Sbjct: 21 SAYLATSRTLSDSLMVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAGTKP 80
Query: 84 YKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAV 143
YKLG N F+DLTN+EF+AS GY P S + F+Y+NV+ VPT++DWR KGAV
Sbjct: 81 YKLGINAFADLTNQEFKASRNGYKLP-----HDCSSNTPFRYENVSSVPTTVDWRTKGAV 135
Query: 144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKA 201
T +K+QG CG CWAFSAVAA+EGIT+++ G LI LSEQ+LVDC + GC GGLMD A
Sbjct: 136 TPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDA 195
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
F +II NKGL TE++YPYQ G+C K K +AA I YED+P E AL +AV QPV
Sbjct: 196 FSFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPV 255
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
SV ++A G F+FY GV ECG DHGV VG+G A EDG+KYWL+KNSWG +WGE
Sbjct: 256 SVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIA--EDGSKYWLVKNSWGTSWGE 313
Query: 322 SGYIRILRD----EGLCGIATEASYPVA 345
GYIR+ +D EGLCGIA ++SYP A
Sbjct: 314 KGYIRMQKDIEAKEGLCGIAMQSSYPSA 341
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 373 bits (957), Expect = e-101, Method: Compositional matrix adjust.
Identities = 181/328 (55%), Positives = 229/328 (69%), Gaps = 13/328 (3%)
Query: 24 ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT 83
++ + + R++ + +V +HEQWMAQ+GR YK E EK R IFK+N+EYIE NK G +
Sbjct: 19 SAYLATSRTLSDSLMVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAGTKP 78
Query: 84 YKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAV 143
YKLG N F+DLTN+EF+AS GY P S + F+Y+NV+ VPT++DWR KGAV
Sbjct: 79 YKLGINAFADLTNQEFKASRNGYKLP-----HDCSSNTPFRYENVSSVPTTVDWRTKGAV 133
Query: 144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKA 201
T +K+QG CG CWAFSAVAA+EGIT+++ G LI LSEQ+LVDC + GC GGLMD A
Sbjct: 134 TPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDA 193
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
F +II NKGL TE++YPYQ G+C K K +AA I YED+P E AL +AV QPV
Sbjct: 194 FSFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPV 253
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
SV ++A G F+FY GV ECG DHGV VG+G A EDG+KYWL+KNSWG +WGE
Sbjct: 254 SVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIA--EDGSKYWLVKNSWGTSWGE 311
Query: 322 SGYIRILRD----EGLCGIATEASYPVA 345
GYIR+ +D EGLCGIA ++SYP A
Sbjct: 312 KGYIRMQKDIEAKEGLCGIAMQSSYPSA 339
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 180/345 (52%), Positives = 237/345 (68%), Gaps = 14/345 (4%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIF 66
K I+P+ + +L + CA Q S R +HE + +HE+WMA+HG+ YKD+ EK R IF
Sbjct: 6 KGKILPIALFFVLAM-CADQAAS-RELHELEMTGRHEKWMAKHGKVYKDDKEKLRRFQIF 63
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
K N+ +IE N GN++Y LG N+F+DLTNEEFRA + GY RP+ + S + + FKY+
Sbjct: 64 KSNVVFIESFNTAGNKSYMLGINKFADLTNEEFRAFWNGYKRPLGA----SRKITPFKYE 119
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
NVT +P+SIDWR KGAVT IK+QG CGSCWAFSAVAA EGI ++ GKL+ LSEQ+LVDC
Sbjct: 120 NVTALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIHKLRTGKLVSLSEQELVDC 179
Query: 187 ST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
+ GC GGLM AF++I + G+ +EA+YPYQ G CD +KE + A I Y+ +
Sbjct: 180 DVKGQDKGCQGGLMVDAFKFIKRHGGMTSEANYPYQGRDGKCDTKKEASRAVKITGYQAV 239
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEED 304
PK E ALL+AV QPVSV ++A +F+FY+ G+ CG + +HGVA VG+G +
Sbjct: 240 PKNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGIFTGICGKDINHGVAAVGYGRSNS-- 297
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
G+KYW++KNSWG WGE GYIR+ RD EGLCGIA E SYP A
Sbjct: 298 GSKYWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGIAMECSYPTA 342
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 181/338 (53%), Positives = 238/338 (70%), Gaps = 13/338 (3%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
F +++ + A QV S R++ + S+ E+HEQWMA++GR YKD EK R +IFK+N+ YI
Sbjct: 12 FALVLCLGLWAFQV-SSRTLQDASMQERHEQWMARYGRVYKDLQEKEKRFSIFKENVNYI 70
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E +N G++ YKLG N+F+DLTNEEF A+ N+ +S +R +TFKY+NVT P+
Sbjct: 71 EASNNAGDKPYKLGVNQFADLTNEEFIATR---NKFKGHMSSSITRTTTFKYENVT-APS 126
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NN 191
++DWR++GAVT +KNQG CG CWAFSAVAA EGI +++ G L+ LSEQ+LVDC T +
Sbjct: 127 TVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQ 186
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GGLMD AF++II+N GL TEA YPYQ GTC+ +E ATI YED+P +E A
Sbjct: 187 GCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEATHVATITGYEDVPSNNEQA 246
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLI 311
L QAV QP+S+ ++ASG F+ Y+ GV CG DHGVAVVG+G + +DG KYWL+
Sbjct: 247 LQQAVANQPISIAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVS--DDGTKYWLV 304
Query: 312 KNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KNSWG WGE GYIR+ RD EGLCG+A + SYP A
Sbjct: 305 KNSWGADWGEEGYIRMQRDVDAPEGLCGLAMQPSYPTA 342
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 180/343 (52%), Positives = 242/343 (70%), Gaps = 13/343 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
I F++ I++ + S S + E S +EKHEQWM++ R Y D+ EK R IFK+NL
Sbjct: 4 IIFFLLAIILSSRTSGATSRGGLFEASAIEKHEQWMSRFHRVYSDDSEKTSRFEIFKKNL 63
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS----TFKYQ 126
+++E N N+TY L NEFSDLT+EEF+A YTG P ++R S+ S +F+Y+
Sbjct: 64 KFVESFNMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVP-EGMTRMSTTDSHETVSFRYE 122
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
NV + S+DWRE+GAVT +K+Q CG CWAFSAVAAVEG+T+I G+L+ LSEQQL+DC
Sbjct: 123 NVGETGESMDWREEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIAKGELVSLSEQQLLDC 182
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
ST+N+GC GG+M KAF+YI+EN+G+ E +YPYQ Q TC+ AAATI YE +P+
Sbjct: 183 STENDGCDGGIMWKAFDYIVENQGITAEDNYPYQGAQQTCESN--HVAAATISGYETVPQ 240
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
DE ALL+AV++QPVSV +E SG F Y G+ N ECG + +H V +VG+G +EE G
Sbjct: 241 NDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTHLNHAVTIVGYGVSEE--GI 298
Query: 307 KYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KYWL+KNSWGE+WGE GY+RI+RD +G+CG+A+ A YPVA
Sbjct: 299 KYWLLKNSWGESWGEDGYMRIMRDVDAPQGMCGLASLAYYPVA 341
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 182/338 (53%), Positives = 239/338 (70%), Gaps = 13/338 (3%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
F +++ + A QV S R++ + S+ E+HEQWMA++G+ YKD EK R IF++N++YI
Sbjct: 12 FALVLCLGLWAFQV-SSRTLQDASMHERHEQWMARYGKVYKDLQEKEKRFNIFQENVKYI 70
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E +N GN+ YKLG N+F+DLTN+EF A+ N+ +S +R +TFKY+NVT P+
Sbjct: 71 EASNNAGNKPYKLGVNQFTDLTNKEFIATR---NKFKGHMSSSITRTTTFKYENVT-APS 126
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NN 191
++DWR++GAVT +KNQG CG CWAFSAVAA EGI +++ G L+ LSEQ+LVDC T +
Sbjct: 127 TVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQ 186
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GGLMD AF++II+N GL TEA YPYQ GTC+ +E ATI YED+P +E A
Sbjct: 187 GCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEVTHVATITGYEDVPSNNEQA 246
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLI 311
L QAV QP+SV ++ASG F+ Y+ GV CG DHGVAVVG+G + +DG KYWL+
Sbjct: 247 LQQAVANQPISVAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVS--DDGTKYWLV 304
Query: 312 KNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KNSWGE WGE GYIR+ RD EGLCGIA + SYP A
Sbjct: 305 KNSWGEDWGEEGYIRMQRDVEAPEGLCGIAMQPSYPTA 342
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 183/338 (54%), Positives = 241/338 (71%), Gaps = 13/338 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++L+ + + R++ + S+ E+HEQWMAQHG+ YKD EK +R IF+QN++ IE
Sbjct: 12 LALLLLFGFWAFSANTRTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
N GN+++KLG N+F+DLT EEF+A N+ + + SR STFKY++VT VP +
Sbjct: 72 GFNNAGNKSHKLGVNQFADLTEEEFKA----INKLKGYMWSKISRTSTFKYEHVTKVPAT 127
Query: 135 IDWREKGAVTHIKNQG-HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNN 191
+DWR+KGAVT IK+QG CGSCWAF+AVAA EGIT++T G+LI LSEQ+L+DC T DN
Sbjct: 128 LDWRQKGAVTPIKSQGLKCGSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNGDNG 187
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC G++ +AF++I++NKGLATEA YPYQ GTC+ + E A+I YED+P +E A
Sbjct: 188 GCKWGIIQEAFKFIVQNKGLATEASYPYQAVDGTCNAKVESKHVASIKGYEDVPANNETA 247
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLI 311
LL AV QPVSV V++S FRFY GVL+ CG DH V VVG+G + +DG KYWLI
Sbjct: 248 LLNAVANQPVSVLVDSSDYDFRFYSSGVLSGSCGTTFDHAVTVVGYGVS--DDGTKYWLI 305
Query: 312 KNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KNSWG WGE GYIRI RD EG+CGIA +ASYP+A
Sbjct: 306 KNSWGVYWGEQGYIRIKRDVAAKEGMCGIAMQASYPIA 343
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 191/341 (56%), Positives = 234/341 (68%), Gaps = 17/341 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+ M ++ IL ASQ S RS+HE S+ E+HE WMA++GR YKD EK R IFK N+
Sbjct: 10 VSMALLFILA-AWASQATS-RSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNV 67
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
IE NK ++TYKL NEF+DLTNEEFR+ + + S +TFKY+NVT
Sbjct: 68 ARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNRFKAHI------CSEATTFKYENVTA 121
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
VP++IDWR+KGAVT IK+Q CG CWAFSAVAA EGITQIT GKLI LSEQ+LVDC T
Sbjct: 122 VPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGG 181
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
+N GCSGGLMD AF + I+ GLA+EA YPY+ + GTC+ +KE AA I YED+P +
Sbjct: 182 ENQGCSGGLMDDAFRF-IKIHGLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANN 240
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL +AV QPV+V ++A G F+FY GV +CG DHGVA VG+G +DG Y
Sbjct: 241 EKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIG--DDGMMY 298
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
WL+KNSWG WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 299 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 339
>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 341
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 186/349 (53%), Positives = 235/349 (67%), Gaps = 14/349 (4%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+ F+K F + + +I CA + + R++ + + E+HEQWMA HG+ YK EK +
Sbjct: 1 MAFKKLFHCTLALFLIFAF-CAFEA-NARTLEDAPMRERHEQWMATHGKVYKHSYEKEQK 58
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
IF +N++ IE N G + YKLG N F+DLTNEEF+A NR V + +R +T
Sbjct: 59 YQIFMENVQRIEAFNNAGXKPYKLGINHFADLTNEEFKA----INRFKGHVCSKRTRTTT 114
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
F+Y+NVT VP S+DWR+KGAVT IK+QG CG CWAFSAVAA EGIT++ GKLI LSEQ+
Sbjct: 115 FRYENVTAVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLRTGKLISLSEQE 174
Query: 183 LVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
LVDC T + GC GGLMD AF++I++NKGLATEA YPY+ GTC+ + + A +I
Sbjct: 175 LVDCDTKGVDQGCEGGLMDDAFKFILQNKGLATEAIYPYEGFDGTCNAKADGNHAGSIKG 234
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
YED+P E ALL+AV QPVSV +EASG F+FY GV CG N DHGV VG+G
Sbjct: 235 YEDVPANSESALLKAVANQPVSVAIEASGFKFQFYSGGVFTGSCGTNLDHGVTSVGYGVG 294
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+DG KYWL+KNSWG WGE GYIR+ RD EGLCGIA ASYP A
Sbjct: 295 --DDGTKYWLVKNSWGVKWGEKGYIRMQRDVAAKEGLCGIAMLASYPSA 341
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 182/342 (53%), Positives = 233/342 (68%), Gaps = 11/342 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
+ +F I + V SQV S R + +E S+ +H+QW+A H + YKD EK MR IFK+N
Sbjct: 12 LALFFIFLGVWR--SQVASSRPINYEASMRARHDQWIAHHDKVYKDLNEKEMRFKIFKEN 69
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+E IE N ++ YKLG N+FSDLTNE+FR +TGY R P V S + F+Y NVT
Sbjct: 70 VERIEAFNAGEDKGYKLGVNKFSDLTNEKFRVLHTGYKRSHPKVMSSSKPKTHFRYANVT 129
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
D+P ++DWR+KGAVT IK+Q CG CWAFSAVAA EG+ Q+ GKLI LSEQ+LVDC
Sbjct: 130 DIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCDVE 189
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
++ GCSGGL+D AF++I++NKGL TEA+YPY+ E G C+K+K +AA I YED+P
Sbjct: 190 GEDEGCSGGLLDTAFDFILKNKGLTTEANYPYKGEDGVCNKKKSALSAAKIAGYEDVPAN 249
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E ALLQAV QPVSV ++ S F+FY GV + C +H V VG+G DG K
Sbjct: 250 SEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGAT--TDGTK 307
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
YW+IKNSWG WG+SGY+RI RD EGLCG+A +ASYP A
Sbjct: 308 YWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 183/343 (53%), Positives = 234/343 (68%), Gaps = 16/343 (4%)
Query: 13 MFVIIILVITC----ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQ 68
++ I + ++ C A QV S R++ + S+ E+H+QWM Q+ + Y D E R IFK+
Sbjct: 7 LYYISLALLMCLGLWAVQVTS-RTLQDASMYERHQQWMGQYAKIYNDHQEWEKRFQIFKE 65
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
N+ YIE +NKEG R YKLG N+F DLTNEEF A NR + R +T+KY+NV
Sbjct: 66 NVNYIETSNKEGGRFYKLGVNQFVDLTNEEFIAPR---NRFKGHMCSSIIRTNTYKYENV 122
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
T VP+++DWR+KGAVT +K+QG CG CWAFSAVAA EGI Q++ GKLI LSEQ+LVDC T
Sbjct: 123 TTVPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELVDCDT 182
Query: 189 D--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
+ GC GGLMD AF++II+N GL TEA YPYQ GTC+ + AATI YED+P
Sbjct: 183 KGVDQGCEGGLMDDAFKFIIQNHGLDTEAKYPYQGVDGTCNANEASINAATITSYEDVPT 242
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
+E AL +AV QP+SV ++ASG F+FY GV CG DHGV VG+G + +DG
Sbjct: 243 NNEQALQKAVANQPISVAIDASGSDFQFYTSGVFTGSCGTELDHGVTAVGYGVS--DDGT 300
Query: 307 KYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KYWL+KNSWG +WGE GYIR+ R EGLCGIA +ASYP+A
Sbjct: 301 KYWLVKNSWGTSWGEEGYIRMQRGVDAVEGLCGIAMQASYPIA 343
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 369 bits (946), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 182/343 (53%), Positives = 239/343 (69%), Gaps = 13/343 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
I F++ IL+ + S V S + E S VEKHEQWM++ R Y D+ EK R IF NL
Sbjct: 4 IVFFLLAILLSSRTSGVTSRGGLFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNL 63
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS----TFKYQ 126
+++E N N+TY L NEFSDLT+EEF+A YTG P ++R S+ S +F+Y+
Sbjct: 64 KFVESINMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVP-EGMTRISTTDSHETVSFRYE 122
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
NV + S+DW ++GAVT +K+Q CG CWAFSAVAAVEG+T+I G+L+ LSEQQL+DC
Sbjct: 123 NVGETGESMDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQLLDC 182
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
ST+NNGC GG+M KAF+YI EN+G+ TE +YPYQ Q TC+ AAATI YE +P+
Sbjct: 183 STENNGCGGGIMWKAFDYIKENQGITTEDNYPYQGAQQTCESN--HLAAATISGYETVPQ 240
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
DE ALL+AV++QPVSV +E SG F Y G+ N ECG H V +VG+G +EE G
Sbjct: 241 NDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEE--GI 298
Query: 307 KYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KYWL+KNSWGE+WGE+GY+RI+RD +G+CG+A+ A YPVA
Sbjct: 299 KYWLLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYYPVA 341
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 369 bits (946), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 172/339 (50%), Positives = 234/339 (69%), Gaps = 12/339 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ + + V+ + S R +HE ++VE+HE+WMA+HG+ YKD+ EK R IFK N+E+
Sbjct: 10 LLIALFFVLAMWADQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVEF 69
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE +N GN +Y LG N F+DLTNEEFRAS+ GY RP+ + S + FKY+NVT +P
Sbjct: 70 IESSNAAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDA----SRIVTPFKYENVTALP 125
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DN 190
S+DWR KGAVT IK+Q CGSCWAFSAVAA EG+ ++ GKL+ LSEQ+LVDC ++
Sbjct: 126 YSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVKGED 185
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC GGLM+ AF++I N G+ TEA+Y Y+ G CD +KE + A I Y+ +P+ E
Sbjct: 186 KGCQGGLMEDAFKFIKRNGGITTEANYAYRGRDGKCDTKKEASHVAKITGYQVVPENSEA 245
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
ALL+AV QPVSV ++A +F+FY+ G+ CG + +HGVA VG+GT+ G+KYW+
Sbjct: 246 ALLKAVAHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAVGYGTS--SSGSKYWI 303
Query: 311 IKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+KNSWG WGE GY+R+ RD +GLCGIA + SYP A
Sbjct: 304 VKNSWGPEWGERGYVRMKRDITSRKGLCGIAMDCSYPTA 342
>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 367 bits (943), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 174/338 (51%), Positives = 227/338 (67%), Gaps = 11/338 (3%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
F IL++ + V+ R + E + +HEQWMA +G+ Y D EK R IFK N+EYI
Sbjct: 10 FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E N GN+ YKL N+F+D TNE+F+ + GY RP + + + ++FKY+NVT VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT---RPMKVTSFKYENVTAVPA 126
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNN 191
++DWR+KGAVT IK+QG CGSCWAFS VAA EGI Q+T GKL+ LSEQ+LVDC ++
Sbjct: 127 TMDWRKKGAVTLIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDIQGEDQ 186
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GGLM+ FE+II+N G+ TEA+YPYQ GTC+ +K+ + A I YE +P E
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAE 246
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLI 311
LL+ V QP+SV ++A G F+FY GV +CG DHGV VG+G E DG KYWL+
Sbjct: 247 LLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYG--ETSDGTKYWLV 304
Query: 312 KNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KNSWG +WGE GYIR+ RD EGLCGIA ++SYP A
Sbjct: 305 KNSWGTSWGEEGYIRMQRDIDTEEGLCGIAMDSSYPTA 342
>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 367 bits (943), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 186/352 (52%), Positives = 238/352 (67%), Gaps = 14/352 (3%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M K + + I + I L + CA QV S RS+ S+ E+HEQWM+Q+ + YKD E+
Sbjct: 1 MASKNQLYYSIALTFIFCLGL-CAIQVTS-RSLQVDSMYERHEQWMSQYSKVYKDPQERE 58
Query: 61 MRLTIFKQNLEYIEKANKEGN-RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSR 119
R IF N+ YIE N + N + YKLG N+F+DLTNEEF AS N+ + ++
Sbjct: 59 ERHKIFTANVNYIEVFNNDANNKLYKLGINQFADLTNEEFIASR---NKFKGHMCSSIAK 115
Query: 120 PSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELS 179
+TFKY+NV+ +P+++DWR+KGAVT +KNQG CG CWAFSAVAA EGIT+++ GKL+ LS
Sbjct: 116 TTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGITKLSTGKLVSLS 175
Query: 180 EQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAAT 237
EQ+LVDC T + GC GGLMD AF++II+N GL+TEA YPYQ GTC+ K AAT
Sbjct: 176 EQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEAAYPYQGVDGTCNANKASIHAAT 235
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I YED+P +E AL +AV QP+SV ++ASG F+FYK GV + CG DHGV VG+
Sbjct: 236 ITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFSGSCGTELDHGVTAVGY 295
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
G DG KYWL+KNSWG WGE GYIR+ R EGLCGIA +ASYP A
Sbjct: 296 GVG--NDGTKYWLVKNSWGTDWGEEGYIRMQRGVDAAEGLCGIAMQASYPTA 345
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 367 bits (942), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 179/341 (52%), Positives = 232/341 (68%), Gaps = 16/341 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
F+I IL TCA ++ R + + S+V +HEQWMA++GR Y D EKA RL +FK N+ +
Sbjct: 82 FLIAILACTCAVSALAARDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAF 141
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + L N+F+D+T +EFRA++TGY +PVP+ R + FKY NV+
Sbjct: 142 IELVNA-GNDKFSLEANQFADMTVDEFRAAHTGY-KPVPA---NKGRTTQFKYANVSLDA 196
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P S+DWR KGAVT IK+QG CG CWAFS VA+VEGI +++ GKLI LSEQ+LVDC D
Sbjct: 197 LPASMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVDG 256
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
+ GC GGLMD AFE+II+N GL TE +YPY +C+ KE A+I YED+P D
Sbjct: 257 MDQGCEGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDSCNSNKESNDVASIKGYEDVPSND 316
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E +LL+AV QPVS+ V+ FRFYK GVL+ CG DHG+A VG+G DG K+
Sbjct: 317 ETSLLKAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDHGIAAVGYGIT--SDGTKF 374
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
WL+KNSWG +WGE G+IR+ RD EGLCG+A + SYP A
Sbjct: 375 WLMKNSWGTSWGEKGFIRMERDIADEEGLCGLAMQPSYPTA 415
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 367 bits (941), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 180/351 (51%), Positives = 239/351 (68%), Gaps = 17/351 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M L + FI + ++ V+ + R++ + S+ E+HEQWMAQ+GR YKD+ EK
Sbjct: 1 MRLTKQSQFIC---LALLFVLGAWPSKSAARTLQDVSMYERHEQWMAQYGRVYKDDAEKE 57
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK+N+ I+ N + ++YKLG N+F+DL+NEEF+AS NR + + P
Sbjct: 58 TRYNIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEFKASR---NRFKGHMCSPQAGP 114
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
F+Y+NV+ VP ++DWR+KGAVT +K+QG CG CWAFSAVAA+EGI Q+T GKLI LSE
Sbjct: 115 --FRYENVSAVPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGINQLTTGKLISLSE 172
Query: 181 QQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
Q++VDC T ++ GC+GGLMD AF++I +NKGL TEA+YPY GTC+ QKE AA I
Sbjct: 173 QEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTGTDGTCNTQKEATHAAKI 232
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
+ED+P E AL++AV KQPVSV ++A G F+FY G+ CG DHGV VG+G
Sbjct: 233 TGFEDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGSCGTQLDHGVTAVGYG 292
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+ DG KYWL+KNSWG WGE GYIR+ +D EGLCGIA +ASYP A
Sbjct: 293 IS---DGTKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPSA 340
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 367 bits (941), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 177/338 (52%), Positives = 236/338 (69%), Gaps = 15/338 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+I L+ SQ ++ R++ + S+ EKHE+WM++ GR Y D EK +R IFK+N++ I
Sbjct: 12 LALIFLLGALVSQAMA-RTLQDASMHEKHEEWMSRFGRVYNDGNEKEIRYKIFKENVQRI 70
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E NK ++YKLG N+F+DLTNEEF+ S + + S S+ F+Y+N+T P+
Sbjct: 71 ESFNKASGKSYKLGINQFADLTNEEFKTSRNRFKGHMCS-----SQAGPFRYENLTAAPS 125
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNN 191
S+DWR+KGAVT IK+QG CGSCWAFSAVAAVEGITQ+ KLI LSEQ+LVDC T ++
Sbjct: 126 SMDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQ 185
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GGLMD AF++I +N+GL TEA+YPY+ GTC+ ++E AA I +ED+P +E A
Sbjct: 186 GCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGA 245
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLI 311
L++AV KQPVSV ++A G F+FY G+ +CG DHGVA VG+G E +G YWL+
Sbjct: 246 LMKAVAKQPVSVAIDAGGFGFQFYSSGIFTGDCGTELDHGVAAVGYG---ESNGMNYWLV 302
Query: 312 KNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KNSWG WGE GYIR+ +D EGLCGIA +ASYP A
Sbjct: 303 KNSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 181/340 (53%), Positives = 234/340 (68%), Gaps = 18/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
F + +L I V+ R++ + SI E+HEQWM +G+ YK+ E+ RL IF +NL+Y
Sbjct: 15 FFCLGLLAIQ-----VTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKY 69
Query: 73 IEKANKEGN-RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
IE +N GN + YKLG N+F+DLTNEEF AS N+ + R +TFKY+N T V
Sbjct: 70 IEASNNAGNNKPYKLGINQFADLTNEEFIASR---NKFKGHMCSSIIRTTTFKYEN-TSV 125
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-- 189
P+++DWR+KGAVT +KNQG CG CWAFSA+AA EGI +I+ GKL+ LSEQ+LVDC T+
Sbjct: 126 PSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGV 185
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
+ GC GGLMD AF++II+N G++TEA YPYQ GTC + +AATI YED+P +E
Sbjct: 186 DQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNE 245
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
+AL +AV QP+SV ++ASG F+FYK GV CG DHGV VG+G + DG KYW
Sbjct: 246 NALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGIS--NDGTKYW 303
Query: 310 LIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
L+KNSWG WGE GYIR+ R EGLCGIA +ASYP A
Sbjct: 304 LVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPTA 343
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 181/340 (53%), Positives = 234/340 (68%), Gaps = 18/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
F + +L I V+ R++ + SI E+HEQWM +G+ YK+ E+ RL IF +NL+Y
Sbjct: 15 FFCLGLLAIQ-----VTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKY 69
Query: 73 IEKANKEGNRT-YKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
IE +N GN+ YKLG N+F+DLTNEEF AS N+ + R +TFKY+N T V
Sbjct: 70 IEASNNAGNKKPYKLGINQFADLTNEEFIASR---NKFKGHMCSSIIRTTTFKYEN-TSV 125
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-- 189
P+++DWR+KGAVT +KNQG CG CWAFSA+AA EGI +I+ GKL+ LSEQ+LVDC T+
Sbjct: 126 PSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGV 185
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
+ GC GGLMD AF++II+N G++TEA YPYQ GTC + +AATI YED+P +E
Sbjct: 186 DQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNE 245
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
+AL +AV QP+SV ++ASG F+FYK GV CG DHGV VG+G + DG KYW
Sbjct: 246 NALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGIS--NDGTKYW 303
Query: 310 LIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
L+KNSWG WGE GYIR+ R EGLCGIA +ASYP A
Sbjct: 304 LVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPTA 343
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 173/338 (51%), Positives = 226/338 (66%), Gaps = 11/338 (3%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
F IL++ + V+ R + E + +HEQWMA +G+ Y D EK R IFK N+EYI
Sbjct: 10 FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E N GN+ YKL N+F+D TNE+F+ + GY RP + + + ++FKY+NVT VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT---RPMKVTSFKYENVTAVPA 126
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNN 191
++DWR+KGAVT IK+QG CGSCWAFS VAA EGI Q+T GKL+ LSEQ+LVDC ++
Sbjct: 127 TMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDNQGEDQ 186
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GGLM+ FE+II+N G+ TEA+YPYQ GTC+ +K+ + A I YE +P E
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAE 246
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLI 311
LL+ V QP+SV ++A G F+FY GV +CG DHGV VG+G E DG KYWL+
Sbjct: 247 LLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYG--ETSDGTKYWLV 304
Query: 312 KNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KNSW +WGE GYIR+ RD EGLCGIA ++SYP A
Sbjct: 305 KNSWXTSWGEEGYIRMQRDIDAEEGLCGIAMDSSYPTA 342
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 177/338 (52%), Positives = 235/338 (69%), Gaps = 15/338 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+I + ASQ ++ R++ + SI EKHE+WM + R Y D EK +R IFK+N++ I
Sbjct: 12 LALIFFLGALASQAIA-RTLQDASIHEKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRI 70
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E NK ++YKLG N+F+DLTNEEF+ S + + S S+ F+Y+N+T VP+
Sbjct: 71 ESFNKASEKSYKLGINQFADLTNEEFKTSRNRFKGHMCS-----SQAGPFRYENITAVPS 125
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNN 191
S+DWR++GAVT IK+QG CGSCWAFSAVAAVEGITQ+ KLI LSEQ+LVDC T ++
Sbjct: 126 SMDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQ 185
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GGLMD AF++I +N+GL TEA+YPY+ GTC+ ++E AA I +ED+P +E A
Sbjct: 186 GCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGA 245
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLI 311
L++AV KQPVSV ++A G F+FY G+ +CG DHGVA VG+G E +G YWL+
Sbjct: 246 LMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGDCGTELDHGVAAVGYG---ESNGMNYWLV 302
Query: 312 KNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KNSWG WGE GYIR+ +D EGLCGIA +ASYP A
Sbjct: 303 KNSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 172/324 (53%), Positives = 225/324 (69%), Gaps = 11/324 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLG 87
V+ R++ + S+ E+H QWM+Q+G+ YKD E+ R IF +N+ Y+E +N + ++YKLG
Sbjct: 25 VTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFTENVNYVEASNADDTKSYKLG 84
Query: 88 TNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N+F+DLTNEEF AS N+ + +R +TFKY+NV+ +P+++DWR+KGAVT +K
Sbjct: 85 INQFADLTNEEFVASR---NKFKGHMCSSITRTTTFKYENVSAIPSTVDWRKKGAVTPVK 141
Query: 148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYI 205
NQG CG CWAFSAVAA EGI +++ GKLI LSEQ+LVDC T + GC GGLMD AF++I
Sbjct: 142 NQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFI 201
Query: 206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCV 265
I+N GL+TEA YPY+ GTC+ K A TI YED+P E AL +AV QP+SV +
Sbjct: 202 IQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAI 261
Query: 266 EASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
+ASG F+FYK GV CG DHGV VG+G + DG KYWL+KNSWG WGE GYI
Sbjct: 262 DASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVS--NDGTKYWLVKNSWGTDWGEEGYI 319
Query: 326 RILRD----EGLCGIATEASYPVA 345
+ R EGLCGIA +ASYP A
Sbjct: 320 MMQRGVEAAEGLCGIAMQASYPTA 343
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 365 bits (936), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 186/345 (53%), Positives = 240/345 (69%), Gaps = 14/345 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F++ I + S S S+ E S +EKHEQWMA+ R Y DE EK R IFK+NLE+
Sbjct: 6 IFILTIFLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEF 65
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRP--VPSVSRQSSRPST--FKYQNV 128
++ N TYK+ NEFSDLT+EEFRA++TG P + +S SS +T F+Y NV
Sbjct: 66 VQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNTVPFRYGNV 125
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+D S+DWR++GAVT +K QG CG CWAFSAVAAVEGIT+IT G+L+ LSEQQL+DC
Sbjct: 126 SDNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDR 185
Query: 189 D-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAA---AATIGKYEDL 244
D N GC GG+M KAFEYII+N+G+ TE +YPYQ+ Q TC ++ AATI YE +
Sbjct: 186 DYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETV 245
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEED 304
P +E ALLQAV++QPVSV +E +G AFR Y GV N ECG + H V +VG+G +EE
Sbjct: 246 PMNNEEALLQAVSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIVGYGMSEE-- 303
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
G KYW++KNSWGETWGE+GY+RI RD +G+CG+A A YP+A
Sbjct: 304 GTKYWVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFYPLA 348
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 364 bits (934), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 184/344 (53%), Positives = 238/344 (69%), Gaps = 13/344 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F++ I + S S + E S +EKHEQWMA+ R Y DE EK R IFK+NLE+
Sbjct: 6 IFILTIFLSYRTSLATSRGGLFEASPIEKHEQWMARFNRVYSDESEKRNRFNIFKKNLEF 65
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRP--VPSVSRQSSRPST-FKYQNVT 129
++ N N TYKL NEFSDLT+EEFRA++TG P + +S SS + F+Y NV+
Sbjct: 66 VQSFNMNKNITYKLDVNEFSDLTDEEFRATHTGLVVPEEITGISTLSSDKTVPFRYGNVS 125
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
D S+DWR++GAVT +K QG CG CWAFSAVAAVEGIT+IT G+L+ LSEQQL+DC TD
Sbjct: 126 DTGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDTD 185
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAA---AATIGKYEDLP 245
N GC GG+M KAFEYII+N+G+ TE +YPYQ+ Q TC ++ AATI YE +P
Sbjct: 186 YNQGCHGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVP 245
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
+E ALLQAV++QPVSV +E +G FR Y G+ N ECG + H V +VG+G +EE G
Sbjct: 246 MNNEEALLQAVSQQPVSVGIEGTGAGFRHYSGGIFNGECGTDLHHAVTIVGYGMSEE--G 303
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KYW++KNSWGETWGE G++RI RD +G+CG+A A YP+A
Sbjct: 304 TKYWVVKNSWGETWGEDGFMRIKRDVDAPQGMCGLAMLAFYPLA 347
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 363 bits (933), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 179/345 (51%), Positives = 232/345 (67%), Gaps = 11/345 (3%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIF 66
K+ + + ++L + + V+ RS+ + S+ E+HEQWM ++G+ YKD E+ R IF
Sbjct: 4 KNHFCHISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIF 63
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
K+N+ YIE N N+ YKL N+F+DLTNEEF A NR + R +TFKY+
Sbjct: 64 KENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPR---NRFKGHMCSSIIRTTTFKYE 120
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
NVT VP+++DWR+KGAVT IK+QG CG CWAFSAVAA EGI +T GKLI LSEQ+LVDC
Sbjct: 121 NVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDC 180
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
T + GC GGLMD AF+++I+N GL TEA+YPY+ G C+ + AATI YED+
Sbjct: 181 DTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNVNEAANDAATITGYEDV 240
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEED 304
P +E AL +AV QPVSV ++ASG F+FYK GV CG DHGV VG+G + D
Sbjct: 241 PANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVS--ND 298
Query: 305 GAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
G +YWL+KNSWG WGE GYIR+ R +EGLCGIA +ASYP A
Sbjct: 299 GTEYWLVKNSWGTEWGEEGYIRMQRGVNSEEGLCGIAMQASYPTA 343
>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 363 bits (931), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 180/349 (51%), Positives = 232/349 (66%), Gaps = 12/349 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+ F+K + + LV + + R++ + + E+HEQWMA HG+ Y EK +
Sbjct: 1 MAFKKVLFQYFTLALCLVFAFCAFEGNARTLEDAPMRERHEQWMAIHGKVYTHSYEKEQK 60
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
FK+N++ IE N GN+ YKLG N F+DLTNEEF+A NR V + +R T
Sbjct: 61 YQTFKENVQRIEAFNHAGNKPYKLGINHFADLTNEEFKA----INRFKGHVCSKITRTPT 116
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
F+Y+N+T VP ++DWR++GAVT IK+QG CG CWAFSAVAA EGIT+++ GKLI LSEQ+
Sbjct: 117 FRYENMTAVPATLDWRQEGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQE 176
Query: 183 LVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
LVDC T + GC GGLMD AF++I++NKGLA EA YPY+ GTC+ + E A +I
Sbjct: 177 LVDCDTKGVDQGCEGGLMDDAFKFILQNKGLAAEAIYPYEGVDGTCNAKAEGNHATSIKG 236
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
YED+P E ALL+AV QPVSV +EASG F+FY GV CG N DHGV VG+G +
Sbjct: 237 YEDVPANSESALLKAVANQPVSVAIEASGFEFQFYSGGVFTGSCGTNLDHGVTAVGYGVS 296
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+DG KYWL+KNSWG WG+ GYIR+ RD EGLCGIA ASYP A
Sbjct: 297 --DDGTKYWLVKNSWGVKWGDKGYIRMQRDVAAKEGLCGIAMLASYPNA 343
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 363 bits (931), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 181/351 (51%), Positives = 235/351 (66%), Gaps = 13/351 (3%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M +K ++ +F+ + + I SQV+ R +H+ ++ E+HE WMA++G+ YKD EK
Sbjct: 1 MAFTGQKQHMLALFLFLAVGI---SQVMP-RKLHQTALRERHENWMAEYGKIYKDAAEKE 56
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK N+E+IE N GN+ YKLG N +DLT EEF+ S G R S + +
Sbjct: 57 KRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTY-EFSTTTFKL 115
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQG-HCGSCWAFSAVAAVEGITQITGGKLIELS 179
+ FKY+NVTD+P +IDWR KGAVT IK+QG CGSCWAFS VAA EGI QI+ G L+ LS
Sbjct: 116 NGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLS 175
Query: 180 EQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIG 239
EQ+LVDC + ++GC GGLM+ FE+II+N G+++EA+YPY GTCD KE + AA I
Sbjct: 176 EQELVDCDSVDHGCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIK 235
Query: 240 KYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGT 299
YE +P E AL QAV QPVSV ++A G F+FY GV +CG DHGV VVG+GT
Sbjct: 236 GYETVPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGT 295
Query: 300 AEEEDGA-KYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+DG +YW++KNSWG WGE GYIR+ R EGLCGIA +ASYP A
Sbjct: 296 T--DDGTHEYWIVKNSWGTQWGEEGYIRMQRGIDALEGLCGIAMDASYPTA 344
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 176/335 (52%), Positives = 226/335 (67%), Gaps = 11/335 (3%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
++L + + V+ RS+ + S+ E+HEQWM ++G+ YKD E+ R IFK+N+ YIE
Sbjct: 561 MLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAF 620
Query: 77 NKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
N N+ YKL N+F+DLTNEEF A NR + R +TFKY+NVT VP+++D
Sbjct: 621 NNAANKRYKLAINQFADLTNEEFIAPR---NRFKGHMCSSIIRTTTFKYENVTAVPSTVD 677
Query: 137 WREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCS 194
WR+KGAVT IK+QG CG CWAFSAVAA EGI +T GKLI LSEQ+LVDC T + GC
Sbjct: 678 WRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCE 737
Query: 195 GGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQ 254
GGLMD AF+++I+N GL TEA+YPY+ G C+ + TI YED+P +E AL +
Sbjct: 738 GGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVPANNEKALQK 797
Query: 255 AVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNS 314
AV QPVSV ++ASG F+FYK GV CG DHGV VG+G + DG +YWL+KNS
Sbjct: 798 AVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVS--NDGTEYWLVKNS 855
Query: 315 WGETWGESGYIRILR----DEGLCGIATEASYPVA 345
WG WGE GYIR+ R +EGLCGIA +ASYP A
Sbjct: 856 WGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 890
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 180/326 (55%), Positives = 225/326 (69%), Gaps = 14/326 (4%)
Query: 28 VSGRSMHEPSIV-EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGN-RTYK 85
V+ R++ + SI+ EKHEQWM +G+ YKD E+ RL IFK+N+ YIE +N GN + YK
Sbjct: 26 VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYK 85
Query: 86 LGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
LG N+F+DLTNEEF AS N+ + ++ STFKY+N + VP+++DWR+KGAVT
Sbjct: 86 LGINQFADLTNEEFIASR---NKFKGHMCSSITKTSTFKYENAS-VPSTVDWRKKGAVTP 141
Query: 146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFE 203
+KNQG CG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC T + GC GGLMD AF+
Sbjct: 142 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFK 201
Query: 204 YIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSV 263
+II+N GL TEA YPYQ GTC K A TI YED+P +E AL +AV QP+SV
Sbjct: 202 FIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISV 261
Query: 264 CVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
++ASG F+FYK GV CG DHGV VG+G DG KYWL+KNSWG WGE G
Sbjct: 262 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVG--NDGTKYWLVKNSWGTDWGEEG 319
Query: 324 YIRILRD----EGLCGIATEASYPVA 345
YI++ R EGLCGIA EASYP A
Sbjct: 320 YIKMQRGVDAAEGLCGIAMEASYPTA 345
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 361 bits (927), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 179/341 (52%), Positives = 233/341 (68%), Gaps = 10/341 (2%)
Query: 11 IPMFVIIILVIT-CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
I +F+I+ LV + C S +S E + +KH++WMA+HGRTY D EK R +FK+N
Sbjct: 6 IKIFLIVSLVSSFCFSTTLSRLLDDELIMQKKHDEWMAEHGRTYADMNEKNNRYVVFKRN 65
Query: 70 LEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
+E IE+ N RT+KL N+F+DLTN+EFR YTGY S+ ++ ++F+YQNV
Sbjct: 66 VERIERLNNVPAGRTFKLAVNQFADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSFRYQNV 125
Query: 129 T--DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+P ++DWR+KGAVT IKNQG CG CWAFSAVAA+EG TQI GKLI LSEQQLVDC
Sbjct: 126 FFGALPIAVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDC 185
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
T++ GCSGGLMD AFE+I+ GL TE++YPY+ E C + K +AA+I YED+P
Sbjct: 186 DTNDFGCSGGLMDTAFEHIMATGGLTTESNYPYKGEDANCKIKSTKPSAASITGYEDVPV 245
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
DE+AL++AV QPVSV +E G F+FY GV EC DH V VG+ ++ G+
Sbjct: 246 NDENALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGY--SQSSAGS 303
Query: 307 KYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
KYW+IKNSWG WGE GY+RI +D EGLCG+A +ASYP
Sbjct: 304 KYWIIKNSWGTKWGEGGYMRIKKDIKDKEGLCGLAMKASYP 344
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 177/345 (51%), Positives = 230/345 (66%), Gaps = 11/345 (3%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIF 66
K+ + + ++L + + V+ RS+ + S+ E+HEQWM ++G+ YKD E+ R IF
Sbjct: 22 KNHFCHISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIF 81
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
K+N+ YIE N N+ YKL N+F+DLTNEEF A NR + R +TFKY+
Sbjct: 82 KENVNYIEAFNNAANKRYKLAINQFADLTNEEFIAPR---NRFKGHMCSSIIRTTTFKYE 138
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
NVT VP+++DWR+KGAVT IK+QG CG CWAFSAVAA EGI +T GKLI LSEQ+LVDC
Sbjct: 139 NVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDC 198
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
T + GC GGLMD AF+++I+N GL TEA+YPY+ G C+ + TI YED+
Sbjct: 199 DTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDV 258
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEED 304
P +E AL +AV QPVSV ++ASG F+FYK GV CG DHGV VG+G + D
Sbjct: 259 PANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVS--ND 316
Query: 305 GAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
G +YWL+KNSWG WGE GYIR+ R +EGLCGIA +ASYP A
Sbjct: 317 GTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 361
>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 360 bits (925), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 178/326 (54%), Positives = 227/326 (69%), Gaps = 14/326 (4%)
Query: 28 VSGRSMHEPSIV-EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGN-RTYK 85
V+ R++ + SI+ EKHEQWM +G+ YKD E+ RL IFK+N+ YIE +N GN + YK
Sbjct: 26 VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYK 85
Query: 86 LGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
LG N+F+D+TNEEF AS N+ + ++ STFKY+N + VP+++DWR+KGAVT
Sbjct: 86 LGINQFADITNEEFIASR---NKFKGHMCSSITKTSTFKYENAS-VPSTVDWRKKGAVTP 141
Query: 146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFE 203
+KNQG CG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC T + GC GGLMD AF+
Sbjct: 142 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFK 201
Query: 204 YIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSV 263
+II+N GL TEA YPYQ GTC + AATI YED+P +E+AL +AV QP+SV
Sbjct: 202 FIIQNHGLHTEAQYPYQGVDGTCSANETSTPAATIAGYEDVPANNENALQKAVANQPISV 261
Query: 264 CVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
++ASG F+FYK GV CG DHGV VG+G + DG KYWL+KNSWG WGE G
Sbjct: 262 AIDASGSDFQFYKSGVFTGSCGTQLDHGVTAVGYGIS--NDGTKYWLVKNSWGNDWGEEG 319
Query: 324 YIRILRD----EGLCGIATEASYPVA 345
YIR+ R +GLCGIA ASYP A
Sbjct: 320 YIRMQRSVDAAQGLCGIAMMASYPTA 345
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 360 bits (925), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 173/343 (50%), Positives = 234/343 (68%), Gaps = 11/343 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIV--EKHEQWMAQHGRTYKDELEKAMRLTIFKQ 68
I +F+I+ L+ + + R + + ++ ++H++WMA+HGR Y D EK R +FK+
Sbjct: 6 IQIFLIVSLISSFCLSITLSRPLDDNELIMQKRHDEWMAKHGRVYADMKEKNNRYVVFKR 65
Query: 69 NLEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+E IE+ N RT+KL N+F+DLTN+EFR+ YTGY S+ ++ S+F+YQN
Sbjct: 66 NVERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMYTGYKGGSVLSSQSGTKTSSFRYQN 125
Query: 128 VTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVD 185
V+ +P S+DWR+KGAVT IKNQG CG CWAFSAVAA+EG T+I GKLI LSEQQLVD
Sbjct: 126 VSSGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIKKGKLISLSEQQLVD 185
Query: 186 CSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
C T++ GCSGGLMD AFE+I+ GL TE++YPY+ + TC + K A +I YED+P
Sbjct: 186 CDTNDFGCSGGLMDTAFEHIMATGGLTTESNYPYKGKDATCKIKNTKPTATSITGYEDVP 245
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
DE AL++AV QPVS+ +E G F+FY GV EC DH V VG+G + +G
Sbjct: 246 VNDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGVFTGECTTYLDHAVTAVGYG--QSSNG 303
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+KYW+IKNSWG WGESGY+RI +D +GLCG+A +ASYP
Sbjct: 304 SKYWIIKNSWGTKWGESGYMRIKKDVKDKKGLCGLAMKASYPT 346
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 360 bits (924), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 176/346 (50%), Positives = 232/346 (67%), Gaps = 13/346 (3%)
Query: 11 IPMFVIIILVITC----ASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTI 65
+ ++ + L C +SQV R + +E ++ +H+QW+ H + YKD EK +R I
Sbjct: 6 LSQYLCLALFFICLGLWSSQVALSRPINYEATMRARHDQWIVHHEKVYKDLNEKEVRFQI 65
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKY 125
FK+N+E IE N ++ YKLG N+FSDLTNEEFR +TGY R P V S + F+Y
Sbjct: 66 FKENVERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSHPKVMTSSKGKTHFRY 125
Query: 126 QNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVD 185
NVTD+P ++DWR+KGAVT IK+Q CG CWAFSAVAA+EG+ Q+ G+LI LSEQ+LVD
Sbjct: 126 TNVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGELIPLSEQELVD 185
Query: 186 CST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
C ++ GCSGGL+D AF++I++NKGL TE +YPY+ E G C+K+K +AA I YED
Sbjct: 186 CDVEGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKGEDGVCNKKKSALSAAKITGYED 245
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+P E ALLQAV QPVSV ++ S F+FY GV + C +H V VG+G
Sbjct: 246 VPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGAT--T 303
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
DG KYW+IKNSWG WG+SGY+RI RD EGLCG+A +ASYP A
Sbjct: 304 DGTKYWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 360 bits (924), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 181/329 (55%), Positives = 223/329 (67%), Gaps = 13/329 (3%)
Query: 24 ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGN-R 82
A QV S + +I EKHEQWM +G+ YKD E+ RL IFK+N+ YIE +N GN +
Sbjct: 23 AIQVTSRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNK 82
Query: 83 TYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
YKLG N+F+DLTNEEF AS N+ + ++ STFKY+N + VP+++DWR+KGA
Sbjct: 83 LYKLGINQFADLTNEEFIASR---NKFKGHMCSSITKTSTFKYENAS-VPSTVDWRKKGA 138
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDK 200
VT +KNQG CG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC T + GC GGLMD
Sbjct: 139 VTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDD 198
Query: 201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQP 260
AF++II+N GL TEA YPYQ GTC K A TI YED+P +E AL +AV QP
Sbjct: 199 AFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQP 258
Query: 261 VSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWG 320
+SV ++ASG F+FYK GV CG DHGV VG+G DG KYWL+KNSWG WG
Sbjct: 259 ISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVG--NDGTKYWLVKNSWGTDWG 316
Query: 321 ESGYIRILRD----EGLCGIATEASYPVA 345
E GYI++ R EGLCGIA EASYP A
Sbjct: 317 EEGYIKMQRGVDAAEGLCGIAMEASYPTA 345
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 360 bits (924), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 173/349 (49%), Positives = 240/349 (68%), Gaps = 15/349 (4%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
++F K F ++ ++ S+ + R++ + + E+HEQWM Q+GR YKD+ E+A R
Sbjct: 1 MRFTKQFQFVCLALLFILGAWPSKSTA-RTLLDAPMYERHEQWMTQYGRVYKDDNERATR 59
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
+IFK+N+ I+ N + ++YKLG N+F+DLTNEEF+AS NR + + P
Sbjct: 60 YSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEFKASR---NRFKGHMCSPQAGP-- 114
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
F+Y+NV+ VP+++DWR++GAVT +K+QG CG CWAFSAVAA+EGI ++T GKLI LSEQ+
Sbjct: 115 FRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQE 174
Query: 183 LVDCST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
+VDC T ++ GC+GGLMD AF++I +NKGL TEA+YPY+ GTC+ K AA I
Sbjct: 175 VVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYKGTDGTCNTNKAAIHAAKITG 234
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
+ED+P E AL++AV KQPVSV ++A G F+FY G+ C DHGV VG+G +
Sbjct: 235 FEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFYSSGIFTGSCDTQLDHGVTAVGYGVS 294
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
DG+KYWL+KNSWG WGE GYIR+ +D EGLCGIA +ASYP A
Sbjct: 295 ---DGSKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPTA 340
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 360 bits (923), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 175/337 (51%), Positives = 233/337 (69%), Gaps = 11/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++L +T + V+ R++ + S+ E+HEQWM ++G+ YKD E+ R +FK+N+ YIE
Sbjct: 12 LAMLLCMTFLAFQVTCRTLQDASMYERHEQWMTRYGKVYKDPQEREKRFRVFKENVNYIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
N N++YKLG N+F+DLTN+EF A G+ + S R +TFK++NVT P++
Sbjct: 72 AFNNAANKSYKLGINQFADLTNKEFIAPRNGFKGHMCS---SIIRTTTFKFENVTATPST 128
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNG 192
+DWR+KGAVT IK+QG CG CWAFSAVAA EGI ++ GKLI LSEQ+LVDC T + G
Sbjct: 129 VDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDQG 188
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C GGLMD AF++II+N GL TEA+YPY+ G C+ + AATI YED+P +E AL
Sbjct: 189 CEGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEAAKNAATITGYEDVPANNEMAL 248
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
+AV QPVSV ++ASG F+FYK GV CG DHGV VG+G + +DG +YWL+K
Sbjct: 249 QKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVS--DDGTEYWLVK 306
Query: 313 NSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
NSWG WGE GYIR+ R +EGLCGIA +ASYP A
Sbjct: 307 NSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 343
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 359 bits (922), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 175/316 (55%), Positives = 221/316 (69%), Gaps = 17/316 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++HE+WMAQHGR Y D EK R IFK+N+E IE N +R YKLG N+F+DLTNE
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 98 EFRASYTGYNRPVPSVSRQSSR--PSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSC 155
EFRA + GY R QSS+ S+F+++N++ +PTS+DWR+ GAVT +K+QG CG C
Sbjct: 61 EFRAMHHGYKR-------QSSKLMSSSFRHENLSAIPTSMDWRKAGAVTPVKDQGTCGCC 113
Query: 156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLAT 213
WAFSAVAA+EGI ++ GKLI LSEQQLVDC + GC GGLMD AF++I+ N GL +
Sbjct: 114 WAFSAVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTS 173
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
EA YPYQ GTC +K + A I YED+P +E+ALLQAV KQPVSV VE G F+
Sbjct: 174 EATYPYQGVDGTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQ 233
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--- 330
FYK GV +CG DH V +G+GT DG YWL+KNSWG +WGESGY+R+ R
Sbjct: 234 FYKSGVFKGDCGTYLDHAVTAIGYGT--NSDGTNYWLVKNSWGTSWGESGYMRMQRGIGA 291
Query: 331 -EGLCGIATEASYPVA 345
EGLCG+A +ASYP A
Sbjct: 292 REGLCGVAMDASYPTA 307
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 173/327 (52%), Positives = 223/327 (68%), Gaps = 8/327 (2%)
Query: 23 CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR 82
C SQV S R +H+ S+ E+HEQWM ++G+ YKD E R IF+ N+E+IE N GN+
Sbjct: 20 CTSQVKS-RKLHDASMYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNK 78
Query: 83 TYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
YKL N +D TNEEF AS+ GY R +++ + FKY+NVTD+P ++DWR+KG
Sbjct: 79 PYKLSINHLADQTNEEFMASHKGYKGSHWQGLRITTQ-TPFKYENVTDIPWAVDWRQKGD 137
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAF 202
T IK+QG CG CWAFSAVAA EGI QIT G L+ LSEQ+LVDC + ++GC GGLM+ F
Sbjct: 138 ATSIKDQGQCGICWAFSAVAATEGIYQITTGNLVSLSEQELVDCDSVDHGCDGGLMEHGF 197
Query: 203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVS 262
E+II+N G+++EA+YPY GTCD KE + A I YE +P E L +AV QPVS
Sbjct: 198 EFIIKNGGISSEANYPYTAVNGTCDTNKEASPGAQIKGYETVPVNCEEELQKAVANQPVS 257
Query: 263 VCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGES 322
V ++A G AF+FY GV +CG DHGV VG+G+ +DG +YW++KNSWG WGE
Sbjct: 258 VSIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGST--DDGIQYWIVKNSWGTQWGEE 315
Query: 323 GYIRILR----DEGLCGIATEASYPVA 345
GYIR+LR EGLCGIA +ASYP A
Sbjct: 316 GYIRMLRGIDAQEGLCGIAMDASYPTA 342
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 177/344 (51%), Positives = 233/344 (67%), Gaps = 17/344 (4%)
Query: 13 MFVIIILVITC----ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQ 68
++ I + ++ C A QV S R++ + S+ E+H QWM+Q+G+ YKD E+ R IFK+
Sbjct: 7 LYHISLALLFCLGLFAIQVTS-RTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFKE 65
Query: 69 NLEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+ YIE N + ++YKLG N+F+DLTNEEF AS N+ + R ++FKY+N
Sbjct: 66 NVNYIETFNNADDTKSYKLGINQFADLTNEEFIASR---NKFKGHMCSSIMRTTSFKYEN 122
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
V+ +P+++DWR+KGAVT +KNQG CG CWAFSAVAA EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 VSGIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCD 182
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
T + GC GGLMD AF++II+N GL+TEA YPY+ GTC+ K A TI YED+P
Sbjct: 183 TKGVDQGCEGGLMDDAFKFIIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVP 242
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
E AL +AV QP+SV ++ASG F+FYK GV CG DHGV VG+G + DG
Sbjct: 243 ANSEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGACGTELDHGVTAVGYGVS--NDG 300
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KYWL+KNSWG WGE GYI + R EG+CGIA +ASYP A
Sbjct: 301 TKYWLVKNSWGTDWGEEGYIMMQRGIEAAEGICGIAMQASYPTA 344
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 178/344 (51%), Positives = 232/344 (67%), Gaps = 17/344 (4%)
Query: 13 MFVIIILVITC----ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQ 68
++ I + ++ C A QV S R++ + S+ E+HE+WM +G+ YKD E+ R IF +
Sbjct: 7 LYHISLALVFCLGLWAIQVTS-RTLQDGSMHERHERWMNHYGKVYKDHQEREKRFKIFTE 65
Query: 69 NLEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
N++YIE N + N +YKLG N+F+DLTNEEF AS N+ + R +TFKY+N
Sbjct: 66 NMKYIEAFNNGDNNESYKLGINQFADLTNEEFVASR---NKFKGHMCSSIIRTTTFKYEN 122
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
V+ +P+++DWR+KGAVT +KNQG CG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC
Sbjct: 123 VSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCD 182
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
T + GC GGLMD AF++II+N GL TEA YPYQ GTC+ K A TI YED+P
Sbjct: 183 TKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCNANKASIQATTITGYEDVP 242
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
+E AL +AV QP+SV ++ASG F+FYK GV CG DHGV VG+G + DG
Sbjct: 243 ANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVS--NDG 300
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KYWL+KNSWG WGE GYI + R EGLCGIA +ASYP A
Sbjct: 301 TKYWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYPTA 344
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 173/325 (53%), Positives = 223/325 (68%), Gaps = 13/325 (4%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK-EGNRTYKL 86
V+ R++ + + E+H QWM+Q+G+ YKD E+ R IF +N+ YIE NK + N+ Y L
Sbjct: 25 VTSRTLQD-DMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTL 83
Query: 87 GTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
G N+F+DLTN+EF +S N+ + +R STFKY+N + +P+S+DWR+KGAVT +
Sbjct: 84 GVNQFADLTNDEFTSSR---NKFKGHMCSSITRTSTFKYENASAIPSSVDWRKKGAVTPV 140
Query: 147 KNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEY 204
KNQG CG CWAFSAVAA EGI +++ GKLI LSEQ+LVDC T + GC GGLMD AF++
Sbjct: 141 KNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKF 200
Query: 205 IIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVC 264
II+N GL TEA+YPYQ GTC+ K A TI YED+P +E AL +AV QP+SV
Sbjct: 201 IIQNHGLNTEANYPYQGVDGTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPISVA 260
Query: 265 VEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGY 324
++ASG F+FYK GV CG DHGV VG+G + DG KYWL+KNSWG WGE GY
Sbjct: 261 IDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVS--NDGTKYWLVKNSWGTEWGEEGY 318
Query: 325 IRILRD----EGLCGIATEASYPVA 345
I + R EGLCGIA +ASYP A
Sbjct: 319 IMMQRGVDAAEGLCGIAMQASYPTA 343
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 173/347 (49%), Positives = 233/347 (67%), Gaps = 16/347 (4%)
Query: 11 IPMFVIIILVITC---ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
IP ++ +V+ C S V+S R + + ++VE+HEQWMAQHGR YKD EKA R F+
Sbjct: 3 IPKVFLLAVVLGCICLCSTVLSARELGDAAMVERHEQWMAQHGRVYKDGAEKARRFEAFR 62
Query: 68 QNLEYIEKANKEGNR-TYKLGTNEFSDLTNEEFRASYT--GYNRPVPSVSRQSSRPSTFK 124
N+ +IE N GNR + LG N+F+DLTN+EFRA+ T G+ + + ++S TF+
Sbjct: 63 NNVVFIESFNAAGNRRKFWLGVNQFTDLTNDEFRATKTNKGFIKRNAAAVNKASPTGTFR 122
Query: 125 YQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
Y NV+ +P ++DWR KGAVT IKNQG CG CWAFSAVAA EGI Q++ GKL+ LSEQ+
Sbjct: 123 YSNVSADALPAAVDWRAKGAVTPIKNQGQCGCCWAFSAVAATEGIVQLSTGKLVPLSEQE 182
Query: 183 LVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
LVDC + ++GC GG MD AFE+II+N GL +E +YPY + G C + + ATI
Sbjct: 183 LVDCDANGADHGCEGGEMDDAFEFIIKNGGLTSETNYPYTAQDGQCKAKNTINSVATIKG 242
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
YED+P DE +L++AV QPVSV V+ F+ Y GVL+ CG + DHG+ VG+G A
Sbjct: 243 YEDVPANDEASLMKAVAAQPVSVAVDGGDMVFQHYAGGVLSGSCGTSLDHGIVAVGYGAA 302
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
+DG K+WL+KNSWG TWGE GYIR+ +D G+CG+A + SYP
Sbjct: 303 --DDGTKFWLMKNSWGTTWGEDGYIRMEKDVADAGGMCGLAMQPSYP 347
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 174/343 (50%), Positives = 232/343 (67%), Gaps = 12/343 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEK-HEQWMAQHGRTYKDELEKAMRLTIFKQN 69
+ +F+ + + + + R + I++K H +WM +HGR Y D EK+ R +FK N
Sbjct: 6 MQIFLFVAIFSSFYFSISLSRPLDNELIMQKRHIEWMTKHGRVYADVKEKSNRYVVFKSN 65
Query: 70 LEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQS-SRPSTFKYQN 127
+E IE N RT+KL N+F+DLTN+EFR+ YTG+ + V S+S QS ++ ++F+YQN
Sbjct: 66 VERIEHLNNIPAGRTFKLAVNQFADLTNDEFRSMYTGF-KGVSSLSSQSQTKTTSFRYQN 124
Query: 128 VTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVD 185
V+ +P S+DWR KGAVT IKNQG CG CWAFSAVAA+EG TQI GKLI LSEQQLVD
Sbjct: 125 VSSGALPISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVD 184
Query: 186 CSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
C T++ GC GGLMD AFE+I+ GL TE++YPY+ E TC+ +K A +I YED+P
Sbjct: 185 CDTNDFGCEGGLMDTAFEHIMATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVP 244
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
DE AL++AV QPVSV +E G F+FY GV EC DH V +G+G + +G
Sbjct: 245 VNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYG--QSTNG 302
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+KYW+IKNSWG WGESGY+RI +D +GLCG+A +ASYP
Sbjct: 303 SKYWIIKNSWGTKWGESGYMRIQKDIKDKQGLCGLAMKASYPT 345
>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
Length = 363
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 168/323 (52%), Positives = 220/323 (68%), Gaps = 10/323 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLG 87
+ R++++P+++ +HEQWMA HGR Y DE EK +R IFK N+ YI+ N +++Y L
Sbjct: 41 ATSRTLNDPTMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYTLE 100
Query: 88 TNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N+F+DLTN+EFRAS GY + S S S F+Y NV+ VP +DWR++GAVT +K
Sbjct: 101 VNKFADLTNDEFRASRNGYKKQPDSDSHVVS--GLFRYANVSAVPDEVDWRKEGAVTPVK 158
Query: 148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYI 205
+QG CG CWAFSAVAA+EGI ++ GKL+ LSEQ+LVDC D + GC GGLM+ AF++I
Sbjct: 159 DQGDCGCCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFI 218
Query: 206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCV 265
+ KGLA E+ YPY E G C+ +K AA I +E +P +E ALLQAV QPVS+ +
Sbjct: 219 EKRKGLAAESVYPYTGEDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVSIAI 278
Query: 266 EASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
+ASG F+FY GV CG DH + VG+G DG KYWL+KNSWG +WGE+GYI
Sbjct: 279 DASGYEFQFYSGGVFTGSCGTELDHAITAVGYGAT--MDGTKYWLMKNSWGASWGENGYI 336
Query: 326 RILRD----EGLCGIATEASYPV 344
RI RD EGLCGIA + SYPV
Sbjct: 337 RIKRDSLAKEGLCGIAMDPSYPV 359
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 353 bits (907), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 179/352 (50%), Positives = 229/352 (65%), Gaps = 22/352 (6%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M +K + I +F+++ L I Q++S R +HE S+ E+HEQWMA++G+ YKD EK
Sbjct: 1 MAFTSQKQYTIALFLLLALGI---PQMMS-RKLHETSMRERHEQWMAEYGKVYKDAAEKE 56
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK N+E+IE N N+ YKLG N +DLT EEF+AS G RP S+ P
Sbjct: 57 KRFLIFKHNVEFIESFNAAANKPYKLGVNHLADLTVEEFKASRNGLKRPY----ELSTTP 112
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHC-GSCWAFSAVAAVEGITQITGGKLIELS 179
FKY+NVT +P +IDWR KGAVT IK+QG C GSCWAFS VAA EGI QIT GKL+ LS
Sbjct: 113 --FKYENVTAIPAAIDWRTKGAVTSIKDQGQCAGSCWAFSTVAATEGIHQITTGKLVSLS 170
Query: 180 EQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAAT 237
EQ+LVDC T + GC GG M+ FE+II+N G+ +EA+YPY+ G C+K + A
Sbjct: 171 EQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGGITSEANYPYKAVDGKCNKA--TSPVAQ 228
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I YE +P E L +AV QPVSV ++A+G+ F FY G+ N ECG DHGV VG+
Sbjct: 229 IKGYEKVPPNSEKTLQKAVANQPVSVSIDANGEGFMFYSSGIYNGECGTELDHGVTAVGY 288
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
G A +G YWL+KNSWG WGE GY+R+ R GLCGIA ++SYP A
Sbjct: 289 GIA---NGTDYWLVKNSWGTQWGEKGYVRMQRGVAAKHGLCGIALDSSYPTA 337
>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
Length = 341
Score = 353 bits (906), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 180/335 (53%), Positives = 231/335 (68%), Gaps = 13/335 (3%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
+ L + S + R++ + E HEQWM QHG+ YK EK R IFK+N+ YIE
Sbjct: 14 LFLCLGLLSFQATSRTLQNDPMYEMHEQWMVQHGKVYKAAHEKQKRFGIFKENVNYIEAF 73
Query: 77 NKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
N GN++YKLG N F+DLTN EF A+ +N + S +TFKY+NV+DVP+++D
Sbjct: 74 NNVGNKSYKLGLNHFADLTNHEFIAARNKFNGYL-----HGSIITTFKYKNVSDVPSAVD 128
Query: 137 WREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCS 194
WR++GAVT +KNQG CG CWAFSAVA+ EGI ++T G L+ LSEQ+LVDC T+ + GC
Sbjct: 129 WRQEGAVTPVKNQGQCGCCWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTNGEDQGCE 188
Query: 195 GGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQ 254
GGLMD AFE+II+N GL+TEA+YPYQ GTC+K + ++AATI YE++P DE AL +
Sbjct: 189 GGLMDDAFEFIIQNNGLSTEAEYPYQGVDGTCNKTEVGSSAATISGYENVPVNDEQALQK 248
Query: 255 AVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNS 314
AV QPVSV ++ASG F+FYK GV CG DHGVAVVG+G E+E +YWL+KNS
Sbjct: 249 AVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVAVVGYGVGEDE--TEYWLVKNS 306
Query: 315 WGETWGESGYIRILR----DEGLCGIATEASYPVA 345
WG WGE GYIR+ R EGLCGIA + SYP A
Sbjct: 307 WGTQWGEEGYIRMQRGVDASEGLCGIAMQPSYPTA 341
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 353 bits (906), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 168/340 (49%), Positives = 235/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS E S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY YQ EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYQGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 353 bits (905), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 168/314 (53%), Positives = 225/314 (71%), Gaps = 14/314 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+ E+HEQWM Q+GR YKD+ E+A R +IFK+N+ I+ N + ++YKLG N+F+DLTNE
Sbjct: 1 MYERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNE 60
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+AS NR + + P F+Y+NV+ VP+++DWR++GAVT +K+QG CG CWA
Sbjct: 61 EFKASR---NRFKGHMCSPQAGP--FRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWA 115
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEA 215
FSAVAA+EGI ++T GKLI LSEQ++VDC T ++ GC+GGLMD AF++I +NKGL TEA
Sbjct: 116 FSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEA 175
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
+YPY+ GTC+ +K AA I +ED+P E AL++AV KQPVSV ++A G F+FY
Sbjct: 176 NYPYKGTDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFY 235
Query: 276 KRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----E 331
G+ C DHGV VG+G + DG+KYWL+KNSWG WGE GYIR+ +D E
Sbjct: 236 SSGIFTGSCDTQLDHGVTAVGYGVS---DGSKYWLVKNSWGAQWGEEGYIRMQKDISAKE 292
Query: 332 GLCGIATEASYPVA 345
GLCGIA +ASYP A
Sbjct: 293 GLCGIAMQASYPTA 306
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 352 bits (903), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 179/352 (50%), Positives = 233/352 (66%), Gaps = 15/352 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M LK + F+ + I C S +S +E + ++H +WM +HGR Y D E+
Sbjct: 1 MALKHMQIFLF----VAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEEN 56
Query: 61 MRLTIFKQNLEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQS-S 118
R +FK N+E IE N RT+KL N+F+DLTN+EFR+ YTG+ + V ++S QS +
Sbjct: 57 NRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGF-KGVSALSSQSQT 115
Query: 119 RPSTFKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLI 176
+ S F+YQNV+ +P S+DWR+KGAVT IKNQG CG CWAFSAVAA+EG TQI GKLI
Sbjct: 116 KMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLI 175
Query: 177 ELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAA 236
LSEQQLVDC T++ GC GGLMD AFE+I GL TE++YPY+ E TC+ +K A
Sbjct: 176 SLSEQQLVDCDTNDFGCEGGLMDTAFEHIKATGGLTTESNYPYKGEDATCNSKKTNPKAT 235
Query: 237 TIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVG 296
+I YED+P DE AL++AV QPVSV +E G F+FY GV EC DH V +G
Sbjct: 236 SITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIG 295
Query: 297 FGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+G E +G+KYW+IKNSWG WGESGY+RI +D +GLCG+A +ASYP
Sbjct: 296 YG--ESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYPT 345
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 351 bits (901), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 167/340 (49%), Positives = 235/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKKN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I GKL+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAEGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 351 bits (900), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 172/324 (53%), Positives = 222/324 (68%), Gaps = 11/324 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLG 87
V+ R++ + S+ E+HE+WMA++ + YKD E+ R IFK+N+ YIE N N+ YKLG
Sbjct: 25 VTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLG 84
Query: 88 TNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N+F+DLTNEEF A NR + +R +TFKY+NVT +P+++DWR+KGAVT IK
Sbjct: 85 INQFADLTNEEFIAPR---NRFKGHMCSSITRTTTFKYENVTALPSTVDWRQKGAVTPIK 141
Query: 148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYI 205
+QG CG CWAFSAVAA EGI + GKLI LSEQ++VDC T ++ GC+GG MD AF++I
Sbjct: 142 DQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFI 201
Query: 206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCV 265
I+N GL TEA+YPY+ G C+ + AATI YED+P +E AL +AV QPVSV +
Sbjct: 202 IQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSVAI 261
Query: 266 EASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
+ASG F+FYK GV CG DHGV VG+G + DG +YWL+KNSWG WGE GYI
Sbjct: 262 DASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVS--ADGTQYWLVKNSWGTEWGEEGYI 319
Query: 326 RILR----DEGLCGIATEASYPVA 345
+ R EGLCGIA ASYP A
Sbjct: 320 MMQRGVKAQEGLCGIAMMASYPTA 343
>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
Length = 344
Score = 351 bits (900), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 167/340 (49%), Positives = 236/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GGLM AF++IIEN G++ E+DY Y EQ TC + +EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQYTC-RSREKTAAVQISSYKVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT EE G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGNCADQINHAVTAIGYGTDEE--GQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 179/351 (50%), Positives = 232/351 (66%), Gaps = 15/351 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M LK + F+ + I C S +S +E + ++H +WM +HGR Y D E+
Sbjct: 1 MALKHMQIFLF----VAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEEN 56
Query: 61 MRLTIFKQNLEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQS-S 118
R +FK N+E IE N RT+KL N+F+DLTN+EF + YTG+ + V ++S QS +
Sbjct: 57 NRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFCSMYTGF-KGVSALSSQSQT 115
Query: 119 RPSTFKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLI 176
+ S F+YQNV+ +P S+DWR+KGAVT IKNQG CG CWAFSAVAA+EG TQI GKLI
Sbjct: 116 KMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLI 175
Query: 177 ELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAA 236
LSEQQLVDC T++ GC GGLMD AFE+I GL TE+DYPY+ E TC+ +K A
Sbjct: 176 SLSEQQLVDCDTNDFGCEGGLMDTAFEHIKATGGLTTESDYPYKGEDATCNSKKTNPKAT 235
Query: 237 TIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVG 296
+I YED+P DE AL++AV QPVSV +E G F+FY GV EC DH V +G
Sbjct: 236 SITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIG 295
Query: 297 FGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
+G E +G+KYW+IKNSWG WGESGY+RI +D +GLCG+A +ASYP
Sbjct: 296 YG--ESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYP 344
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 171/327 (52%), Positives = 217/327 (66%), Gaps = 18/327 (5%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
SQV+ R +HE S+ E+HEQWM ++G+ YKD EK R IFK N+E+IE N +GN+ Y
Sbjct: 22 SQVMC-RKLHETSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPY 80
Query: 85 KLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
KLG N +DLT EEF+AS G+ RP +TFKY+NVT +P +IDWR KGAVT
Sbjct: 81 KLGVNHLADLTVEEFKASRNGFKRP------HEFSTTTFKYENVTAIPAAIDWRTKGAVT 134
Query: 145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAF 202
IK+QG CGSCWAFS +AA EGI QIT GKL+ LSEQ+LVDC T + GC GG M+ F
Sbjct: 135 PIKDQGQCGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGF 194
Query: 203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVS 262
E+II+N G+ +E +YPY+ G C+K + A I YE +P E AL +AV QPVS
Sbjct: 195 EFIIKNGGITSETNYPYKAVDGKCNKA--TSPVAQIKGYEKVPPNSETALQKAVANQPVS 252
Query: 263 VCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGES 322
V ++A G F FY G+ N ECG DHGV VG+GTA +G YW++KNSWG WGE
Sbjct: 253 VSIDADGAGFMFYSSGIYNGECGTELDHGVTAVGYGTA---NGTDYWIVKNSWGTQWGEK 309
Query: 323 GYIRILR----DEGLCGIATEASYPVA 345
GY+R+ R GLCGIA ++SYP +
Sbjct: 310 GYVRMQRGIAAKHGLCGIALDSSYPTS 336
>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
Length = 373
Score = 350 bits (897), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 174/320 (54%), Positives = 221/320 (69%), Gaps = 13/320 (4%)
Query: 33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
+ + S+ E+H +WMA+HGRTYKD EK RL IFK N+EYIE N G R Y+L N+F+
Sbjct: 26 LGDASMAERHVEWMARHGRTYKDAAEKEQRLGIFKSNVEYIESFNA-GKRKYQLAANQFA 84
Query: 93 DLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
DLT+EEF+A +TG+ PS + + F++ +++ VP S+DWR KGAVT +K+QG C
Sbjct: 85 DLTHEEFKAMHTGFK---PSGTGAKKAGNGFRHGSLSSVPDSVDWRSKGAVTPVKDQGLC 141
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKG 210
GSCWAF+ VAAVEGIT+I GKLI LSEQQLVDC + GC GG MD AFE+I+ N G
Sbjct: 142 GSCWAFTVVAAVEGITKIVTGKLISLSEQQLVDCDVHGKDQGCQGGDMDAAFEFIVNNGG 201
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA-SG 269
+ +EA+YPY++ Q C+ ATI +ED+P DE AL +AV QPVSV ++A S
Sbjct: 202 ITSEANYPYEEVQRLCNAHNASFVVATIESHEDVPTNDEKALRKAVANQPVSVGIDAGSS 261
Query: 270 QAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
F+ Y GV + ECG + DH V VVG+GT DG KYWL KNSWGETWGE+GYIR+ R
Sbjct: 262 LDFQLYSGGVFSGECGTDLDHAVTVVGYGTT--SDGTKYWLAKNSWGETWGENGYIRMER 319
Query: 330 D----EGLCGIATEASYPVA 345
D EGLCGIA +ASYP A
Sbjct: 320 DVAAKEGLCGIAMQASYPTA 339
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 350 bits (897), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 173/342 (50%), Positives = 222/342 (64%), Gaps = 16/342 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
F+ ++V T A + R + + I +HEQWMA++GR Y D EKA RL +FK N+
Sbjct: 3 FLFALVVCTFALGALGARDLADDDWLIAARHEQWMARYGRVYSDVAEKARRLEVFKANVG 62
Query: 72 YIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT-- 129
+IE N GN + L N+F+D+T +EFRA + GY V +R + F+Y NV+
Sbjct: 63 FIESVNA-GNHKFWLEANQFADITKDEFRAMHKGYKMQVIG---SKARATGFRYANVSID 118
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
D+P S+DWR GAVT +K+QG CG CWAFS VA++EGI +++ GKLI LSEQ+LVDC
Sbjct: 119 DLPASVDWRANGAVTPVKDQGQCGCCWAFSTVASMEGIVKVSTGKLISLSEQELVDCDVG 178
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC GGLMD AFE+I+ N GL TEADYPY GTC+ KE AA+I YED+P
Sbjct: 179 MQNKGCGGGLMDNAFEFIVNNGGLDTEADYPYTGADGTCNSNKESNIAASIKGYEDVPAN 238
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
DE +L +AV QPVS+ V+ FRFYK GVL CG DHGVA VG+G A DG K
Sbjct: 239 DEASLQKAVAAQPVSIAVDGGDDLFRFYKGGVLTGACGTELDHGVAAVGYGVA--GDGTK 296
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
YWL+KNSWG +WGE G+IR+ RD G+CG+A + SYP A
Sbjct: 297 YWLVKNSWGTSWGEDGFIRLERDVADEAGMCGLAMKPSYPTA 338
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 350 bits (897), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 175/348 (50%), Positives = 231/348 (66%), Gaps = 21/348 (6%)
Query: 6 EKSFIIPMFVIIILVITCASQVVSGRSMHEP--SIVEKHEQWMAQHGRTYKDELEKAMRL 63
+K +I+ +F+++ + I S+V+S R +HE S++E+HEQWMA++ + YKD EK R
Sbjct: 7 QKQYILALFLLLAVGI---SRVIS-RELHETETSLIERHEQWMAKYDKVYKDAAEKEKRF 62
Query: 64 TIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTF 123
IFK N+E+IE N GN+ YKLG N +DLT EEF+AS G R ++F
Sbjct: 63 LIFKDNVEFIESFNAAGNKPYKLGVNHLADLTIEEFKASRNGLKRSYD----YEVGTTSF 118
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
KY+NVT +P S+DWR+KGAVT IK+QG CGSCWAFS VAA EGI +I+ GKL+ LSEQ+L
Sbjct: 119 KYENVTAIPASVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQEL 178
Query: 184 VDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
VDC + GC GG M+ FE+II+N G+ TEA+YPY+ G+C + A AA I Y
Sbjct: 179 VDCDRKGTDQGCEGGYMEDGFEFIIKNGGITTEANYPYKAVDGSC--KNATAPAAQIKGY 236
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE 301
E +P E ALL+AV QPVSV ++A+ +F FY G+ ECG DHGV VG+G A
Sbjct: 237 EKVPVNSEKALLKAVANQPVSVSIDAADGSFMFYSSGIFTGECGTELDHGVTAVGYGRA- 295
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
+G YW++KNSWG WGE GYIR+ R EGLCGIA ++SYP A
Sbjct: 296 --NGTDYWIVKNSWGTVWGEQGYIRMQRGIAAKEGLCGIAMDSSYPTA 341
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 350 bits (897), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 172/342 (50%), Positives = 230/342 (67%), Gaps = 13/342 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ +F I+ + + + HEPS +EKHEQWMA+ R Y+DELEK MR +FK+N
Sbjct: 7 LVTIFTILFTTFSISQATSRTVTFHEPSSLEKHEQWMARFSRVYRDELEKQMRRDVFKKN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
L++IE NK+GN++YKLG NEF+D TNEEF A +TG V ++ ++ N++
Sbjct: 67 LKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLSSKVVDETISSRSW---NIS 123
Query: 130 D-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
D V S DWR +GAVT +K QG CG CWAFSAVAAVEG+T+I GG L+ LSEQQL+DC
Sbjct: 124 DMVGVSKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVTKIAGGNLVSLSEQQLLDCDR 183
Query: 189 D-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
+ + GC GG+M AF YII+N+G+A+E DY YQ G C + AA I ++ +P
Sbjct: 184 EYDRGCDGGIMSDAFNYIIQNRGIASENDYSYQGSDGRC--RSSARPAARISGFQTVPSN 241
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
+E ALL+AV++QPVSV ++A+G F Y GV + CG + +H V VG+GT+ +DG K
Sbjct: 242 NEQALLEAVSRQPVSVSMDANGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTS--QDGTK 299
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
YWL KNSWGETWGE GYIRI RD +G+CG+A A YPVA
Sbjct: 300 YWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 341
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 349 bits (896), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 169/342 (49%), Positives = 228/342 (66%), Gaps = 14/342 (4%)
Query: 13 MFVIIILVITCASQV---VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
+ I + ++ C+ + V+ R++ + S+ E+HE+WM ++ + YKD E+ R IFK+N
Sbjct: 7 FYQISLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ YIE N N+ Y LG N+F+DLTNEEF A NR + +R +TFKY+NVT
Sbjct: 67 VNYIEAFNNAANKPYTLGINQFADLTNEEFIAPR---NRFKGHMCSSITRTTTFKYENVT 123
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
+P+++DWR+KGAVT IK+QG CG CWAFSAVAA EGI ++ GKLI LSEQ++VDC T
Sbjct: 124 AIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTK 183
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
++ GC+GG MD AF++II+N GL E +YPY+ G C+ + ATI YED+P
Sbjct: 184 GEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVN 243
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
+E AL +AV QPVSV ++ASG F+FY+ GV CG DHGV VG+G + DG +
Sbjct: 244 NEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVS--ADGTE 301
Query: 308 YWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
YWL+KNSWG WGE GYIR+ R +EGLCGIA ASYP A
Sbjct: 302 YWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 169/342 (49%), Positives = 228/342 (66%), Gaps = 14/342 (4%)
Query: 13 MFVIIILVITCASQV---VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
+ I + ++ C+ + V+ R++ + S+ E+HE+WM ++ + YKD E+ R IFK+N
Sbjct: 7 FYQISLALLFCSGFLTFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ YIE N N+ Y LG N+F+DLTNEEF A NR + +R +TFKY+NVT
Sbjct: 67 VNYIEAFNNAANKPYTLGINQFADLTNEEFIAPR---NRFKGHMCSSITRTTTFKYENVT 123
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
+P+++DWR+KGAVT IK+QG CG CWAFSAVAA EGI ++ GKLI LSEQ++VDC T
Sbjct: 124 AIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTK 183
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
++ GC+GG MD AF++II+N GL E +YPY+ G C+ + ATI YED+P
Sbjct: 184 GEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVN 243
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
+E AL +AV QPVSV ++ASG F+FY+ GV CG DHGV VG+G + DG +
Sbjct: 244 NEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVS--ADGTE 301
Query: 308 YWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
YWL+KNSWG WGE GYIR+ R +EGLCGIA ASYP A
Sbjct: 302 YWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 166/340 (48%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 339
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 169/339 (49%), Positives = 223/339 (65%), Gaps = 15/339 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+ +++L+ C SQV+S R++HE S + E+HEQW ++G+ YKD EK RL IFK N+
Sbjct: 10 ILALVLLLPICISQVMS-RNLHEASXCMSERHEQWTKKYGKVYKDAAEKQKRLLIFKDNV 68
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+IE N GN+ YKL N +D TNEEF AS+ GY + S + FKY+N+T
Sbjct: 69 EFIESFNAAGNKPYKLSINHLTDQTNEEFVASHNGYKH------KGSHSQTPFKYENITG 122
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN 190
VP ++DWRE GAV +K+QG CG+CWAFS VA EGI QIT L+ LSEQ+LVDC + +
Sbjct: 123 VPNAVDWRENGAVXAMKDQGQCGNCWAFSTVATTEGIYQITTSMLMSLSEQELVDCDSVD 182
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
+GC GG M+ FE+I +N G+++EA+YPY GT D KE + AA I YE +P E
Sbjct: 183 HGCDGGYMEGGFEFIXKNGGISSEANYPYTAVDGTYDANKEASPAAQIKGYETVPANSED 242
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
AL +AV QPVSV ++ G AF+F GV +CG DHGV VG+G+ +DG +YW+
Sbjct: 243 ALQKAVANQPVSVTIDVGGSAFQFNSSGVFTGQCGTQLDHGVTAVGYGST--DDGTQYWI 300
Query: 311 IKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
+KNSWG WGE GYIR+ R EGLCGIA +ASYP A
Sbjct: 301 VKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 339
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 173/342 (50%), Positives = 229/342 (66%), Gaps = 14/342 (4%)
Query: 13 MFVIIILVITCASQV---VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ I + ++ C + V+ R++ + S+ E+H QWMA++ + YKD E+ R IFK+N
Sbjct: 7 LYHISLALLFCMGFLAFQVTCRTLQDASMYERHAQWMARYAKVYKDPQEREKRFRIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ YIE N N++YKL N+F+DLTNEEF A NR + +R +TFKY+NVT
Sbjct: 67 VNYIETFNSADNKSYKLDINQFADLTNEEFIAPR---NRFKGHMCSSITRTTTFKYENVT 123
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
+P+++DWR+KGAVT IK+QG CG CWAFSAVAA EGI + GKLI LSEQ++VDC T
Sbjct: 124 VIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALNAGKLISLSEQEVVDCDTK 183
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
+ GC+GG MD AF++II+N GL TE +YPY+ G C+ + AATI YED+P
Sbjct: 184 GQDQGCAGGFMDGAFKFIIQNHGLNTEPNYPYKAADGKCNAKAAANHAATITGYEDVPVN 243
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
+E AL +AV QPVSV ++ASG F+FYK GV CG DHGV VG+G + DG +
Sbjct: 244 NEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVS--ADGTE 301
Query: 308 YWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
YWL+KNSWG WGE GYIR+ R +EGLCGIA ASYP A
Sbjct: 302 YWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 178/353 (50%), Positives = 233/353 (66%), Gaps = 28/353 (7%)
Query: 12 PMFVIIILVITC------ASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLT 64
P+ + I+ I C + V + R + + ++ +HE+WMAQHGR YKD EKA RL
Sbjct: 7 PLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARRLE 66
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT---GYNRPVPSVSRQSSRPS 121
+FK N+ +IE N G Y LG N+F+DLT+EEF+A+ T G++ P V R S
Sbjct: 67 VFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGV-----RVS 121
Query: 122 T-FKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
T FKY+NV+ +P S+DWR KGAVT IK+QG CG CWAFSAVAA+EGI +++ GKLI L
Sbjct: 122 TGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISL 181
Query: 179 SEQQLVDCSTDNN--GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAA 236
SEQ+LVDC D N GC GG +D AF++I+ N GL EA+YPY E G C AA
Sbjct: 182 SEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAA 241
Query: 237 TIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVG 296
+I YED+P DE +L++AV QPVSV V+AS F+FY GV+ ECG + DHGV V+G
Sbjct: 242 SIRGYEDVPANDEPSLMKAVAGQPVSVAVDAS--KFQFYGGGVMAGECGTSLDHGVTVIG 299
Query: 297 FGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+G A DG KYWL+KNSWG TWGE+GY+R+ +D G+CG+A + SYP A
Sbjct: 300 YGAA--SDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYPTA 350
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 348 bits (894), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 235/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISIFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKTNDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y +Q TC + +EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTC-RSQEKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT EE G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYSGGTYDGSCADRINHAVTAIGYGTDEE--GQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 235/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y +Q TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E+G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DENGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 166/340 (48%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 172/324 (53%), Positives = 223/324 (68%), Gaps = 11/324 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLG 87
V+ R++ + S+ E+HEQWMA++G+ YKD EK R +FK+N+ YIE N N+ YKLG
Sbjct: 25 VASRTLQDASMYERHEQWMARYGKVYKDPEEKEKRFRVFKENVNYIEAFNNAANKPYKLG 84
Query: 88 TNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N+F+DLT+EEF +N ++R +TFKY+NVT +P SIDWR+KGAVT IK
Sbjct: 85 INQFADLTSEEFIVPRNRFN---GHTRSSNTRTTTFKYENVTVLPDSIDWRQKGAVTPIK 141
Query: 148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYI 205
NQG CG CWAFSA+AA EGI +I+ GKL+ LSEQ++VDC T ++GC GG MD AF++I
Sbjct: 142 NQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAFKFI 201
Query: 206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCV 265
I+N G+ TEA YPY+ G C+ ++E AATI YED+P +E AL +AV QPVSV +
Sbjct: 202 IQNHGINTEASYPYKGVDGKCNIKEEAVHAATITGYEDVPINNEKALQKAVANQPVSVAI 261
Query: 266 EASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
+ASG F+FYK G+ CG DHGV VG+G E +G KYWL+KNSWG WGE GYI
Sbjct: 262 DASGADFQFYKSGIFTGSCGTELDHGVTAVGYG--ENNEGTKYWLVKNSWGTEWGEEGYI 319
Query: 326 RILRD----EGLCGIATEASYPVA 345
+ R EG+CGIA ASYP A
Sbjct: 320 MMQRGVKAVEGICGIAMMASYPTA 343
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 168/341 (49%), Positives = 233/341 (68%), Gaps = 12/341 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK N
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDL 126
Query: 130 ---DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
D+P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I GKL+E SEQ+L+DC
Sbjct: 127 SDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSEQELLDC 186
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
+T+N GC+GG M AF++IIEN G++ E+DY Y EQ TC Q EK AA I Y+ +P+
Sbjct: 187 TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPE 245
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
G E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G
Sbjct: 246 G-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQ 301
Query: 307 KYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
KYWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 KYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 342
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 170/324 (52%), Positives = 222/324 (68%), Gaps = 11/324 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLG 87
V+ R++ + S+ E+HE+WMA++ + YKD E+ R IFK+N+ YIE N ++ YKLG
Sbjct: 25 VTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLG 84
Query: 88 TNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N+F+DLTNEEF A N+ + +R +TFKY+NVT +P+++DWR+KGAVT IK
Sbjct: 85 INQFADLTNEEFIAPR---NKFKGHMCSSITRTTTFKYENVTALPSTVDWRQKGAVTPIK 141
Query: 148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYI 205
+QG CG CWAFSAVAA EGI + GKLI LSEQ++VDC T ++ GC+GG MD AF++I
Sbjct: 142 DQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFI 201
Query: 206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCV 265
I+N GL TEA+YPY+ G C+ + AATI YED+P +E AL +AV QPVSV +
Sbjct: 202 IQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSVAI 261
Query: 266 EASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
+ASG F+FYK GV CG DHGV VG+G + DG +YWL+KNSWG WGE GYI
Sbjct: 262 DASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVS--ADGTQYWLVKNSWGTEWGEEGYI 319
Query: 326 RILR----DEGLCGIATEASYPVA 345
+ R EGLCGIA ASYP A
Sbjct: 320 MMQRGVKAQEGLCGIAMMASYPTA 343
>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 166/340 (48%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 235/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC GG M AF++IIEN G++ E+DY Y +Q TC + +EK AA I Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTC-RSQEKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E+G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DENGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 168/320 (52%), Positives = 217/320 (67%), Gaps = 11/320 (3%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEF 91
++ + S+ E+HEQWM +HG+ YKD E+ R IF +N+ Y+E N N+ YKLG N+F
Sbjct: 125 TLQDASMYERHEQWMTRHGKVYKDPREREKRFRIFNENVNYVEAFNNAANKPYKLGINQF 184
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
DLTN+EF A NR + R +TFKY+NVT VP+++DWR+ GAVT +K+QG
Sbjct: 185 XDLTNQEFIAPR---NRFKGHMCSSIIRTTTFKYENVTTVPSTVDWRQNGAVTPVKDQGQ 241
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
CG CWAFSAVAA EGI ++GGKLI LSEQ+LVDC T + GC GGLMD A+++II+N
Sbjct: 242 CGCCWAFSAVAATEGIHALSGGKLISLSEQELVDCDTKGVDQGCEGGLMDDAYKFIIQNH 301
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASG 269
GL TEA+YPY+ G C+ + AATI YED+P +E AL +AV QPVSV ++AS
Sbjct: 302 GLNTEANYPYKGVDGKCNANEAANHAATITGYEDVPANNEKALQKAVANQPVSVAIDASS 361
Query: 270 QAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
F+FYK G CG DHGV VG+G ++ G KYWL+KNSWG WGE GYIR+ R
Sbjct: 362 SDFQFYKSGAFTGSCGTELDHGVTAVGYGVSDH--GTKYWLVKNSWGTEWGEEGYIRMQR 419
Query: 330 ----DEGLCGIATEASYPVA 345
+EG+CGIA +ASYP A
Sbjct: 420 GVDSEEGVCGIAMQASYPTA 439
>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 347 bits (891), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y +Q TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC GG M AF++IIEN G++ E+DY Y +Q TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS E S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC GG M AF++I EN G+++E+DY Y +Q TC + +EK AA I Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTC-RSQEKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 166/340 (48%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VIT + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVITMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 347 bits (889), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
Length = 345
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS E S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC GG M AF++I EN G+++E+DY Y +Q TC + +EK AA I Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTC-RSQEKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 167/341 (48%), Positives = 231/341 (67%), Gaps = 11/341 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
I +F+I+ LV + + R + E ++ ++H WM +HGR Y D EK R +FK+N+
Sbjct: 6 IQIFLIVSLVSSFSLSTTLSRPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNV 65
Query: 71 EYIEKANK-EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
E IE+ N+ + T+KL N+F+DLTNEEFR+ YTGY SV ++P++F+YQ+V+
Sbjct: 66 ESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGN--SVLSSRTKPTSFRYQHVS 123
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+P S+DWR+KGAVT IK+QG CGSCWAFSAVAA+EG+ QI GKLI LSEQ+LVDC
Sbjct: 124 SDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCD 183
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+++GC GG M+ AF Y + GL +E++YPY+ GTC+ K K A +I +ED+P
Sbjct: 184 TNDDGCMGGYMNSAFNYTMTTGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPAN 243
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
DE AL++AV PVS+ + G F+FY GV + EC + DHGVAVVG+G + +G+K
Sbjct: 244 DEKALMKAVAHHPVSIGIAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYG--KSSNGSK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
YW++KNSWG WGE GY+RI +D G CG+A ASYP
Sbjct: 302 YWILKNSWGPKWGERGYMRIKKDTKAKHGQCGLAMNASYPT 342
>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
Length = 347
Score = 346 bits (888), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 238/355 (67%), Gaps = 30/355 (8%)
Query: 11 IPMFVIIIL----VITCASQVVSGRSM---HEPSIVEKHEQWMAQHGRTYKDELEKAMRL 63
IP +++ + V C++ V++ R + E ++V +HEQWM QHGR YKDE +KA R
Sbjct: 3 IPKALLLAILGCGVCLCSAAVLAARELGGDDELAMVARHEQWMVQHGRVYKDETDKAHRF 62
Query: 64 TIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYT--GYNRPVPSVSRQSS 118
+FK N+++IE N GNR + LG N+F+DLTN+EFRA+ T G+N V V
Sbjct: 63 LVFKANVKFIESFNAAAAAGNRKFWLGVNQFADLTNDEFRATKTNKGFNPNVVKV----- 117
Query: 119 RPSTFKYQNVT--DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLI 176
P+ F+YQN++ +P ++DWR KGAVT IK+QG CG CWAFSAVAA EGI +I+ GKL
Sbjct: 118 -PTGFRYQNLSIDALPQTVDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLT 176
Query: 177 ELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAA 234
LSEQ+LVDC ++ GC+GG MD AF++II+N GL TE++YPY + G C +
Sbjct: 177 SLSEQELVDCDVHGEDQGCNGGEMDDAFKFIIKNGGLTTESNYPYTAQDGQC--KSGSNG 234
Query: 235 AATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAV 294
AATI YED+P DE AL++AV QPVSV V+ F+FY GV+ CG + DHG+A
Sbjct: 235 AATIKGYEDVPANDEAALMKAVASQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAA 294
Query: 295 VGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+G+G + DG KYWL+KNSWG TWGE+G++R+ +D +G+CG+A + SYP A
Sbjct: 295 IGYG--KTSDGTKYWLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMQPSYPTA 347
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 346 bits (888), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPVSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 346 bits (888), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 169/328 (51%), Positives = 222/328 (67%), Gaps = 9/328 (2%)
Query: 23 CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR 82
C SQV S R +H+ S+ E+HEQWM ++G+ YKD E R IF+ N+E+IE N GN+
Sbjct: 20 CTSQVKS-RKLHDASMYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNK 78
Query: 83 TYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
YKL N +D TNEEF AS+ GY R +++ + FKY+NVTD+P ++DWR+KG
Sbjct: 79 PYKLSINHLADQTNEEFMASHKGYKGSHWQGLRITTQ-TPFKYENVTDIPWAVDWRQKGD 137
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAF 202
VT IK+Q CG+CWAFSAVAA EGI QIT G L+ LSE++LVDC + ++GC GGLM+ F
Sbjct: 138 VTSIKDQAQCGNCWAFSAVAATEGIYQITTGNLVSLSEKELVDCDSVDHGCDGGLMEHGF 197
Query: 203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PV 261
E+II+N G+++EA+YPY GTCD KE + A I YE +P E L +AV Q +
Sbjct: 198 EFIIKNGGISSEANYPYTAVNGTCDTNKEASPVAQITGYETVPVNCEEELQKAVANQLTM 257
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
SV ++A G AF+FY GV +CG DHGV VG+G+ + G +YW++KNSWG WGE
Sbjct: 258 SVSIDAGGSAFQFYPSGVFTGQCGTQLDHGVTAVGYGSTDY--GTQYWIVKNSWGTQWGE 315
Query: 322 SGYIRILR----DEGLCGIATEASYPVA 345
GYIR+LR EGLCGIA +ASYP A
Sbjct: 316 EGYIRMLRGIDAQEGLCGIAMDASYPTA 343
>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 346 bits (888), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRDE----GLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 346 bits (887), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + K +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 346 bits (887), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + F +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRDE----GLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T+EEF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMSSTEFKINDIS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +KNQG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y +Q TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C + +H V +G+GT +E+G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGT--DENGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGEKGFMKIIRDYGNPSGLCDIAKLSSYP 341
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y +Q TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPSGLCDIAKMSSYP 341
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + F +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 167/327 (51%), Positives = 221/327 (67%), Gaps = 15/327 (4%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
+ V+S + PS+ E+HEQWM+++G+ YKD +EK R IFK N+E+IE N N+ Y
Sbjct: 23 TNVMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPY 82
Query: 85 KLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
KL N +DLT +EF+AS GY + + R+ + S FKY+NVT +P ++DWR KGAVT
Sbjct: 83 KLSVNHLADLTLDEFKASRNGYKK----IDREFATTS-FKYENVTAIPEAVDWRVKGAVT 137
Query: 145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAF 202
IK+QG CGSCWAFS VAA+EGI QIT GKLI LSEQ+LVDC T ++ GC GGLM+ F
Sbjct: 138 PIKDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGF 197
Query: 203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVS 262
E+II+N G+ +E +YPY+ G+C+ A A I YE +P E +LL+AV QP+S
Sbjct: 198 EFIIKNGGITSETNYPYKAADGSCNTAT-TAPVAKITGYEKVPVNSEISLLKAVANQPIS 256
Query: 263 VCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGES 322
V ++AS +F FY G+ ECG DHGV VG+G+A +G YW++KNSWG WGE
Sbjct: 257 VSIDASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA---NGTDYWIVKNSWGTVWGEK 313
Query: 323 GYIRILR----DEGLCGIATEASYPVA 345
GYIR+ R EGLCGIA ++SYP A
Sbjct: 314 GYIRMQRGIADKEGLCGIAMDSSYPTA 340
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 176/351 (50%), Positives = 231/351 (65%), Gaps = 28/351 (7%)
Query: 12 PMFVIIILVITC------ASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLT 64
P+ + I+ I C + V + R + + ++ +HE+WMAQHGR YKD EKA RL
Sbjct: 7 PLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARRLE 66
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT---GYNRPVPSVSRQSSRPS 121
+FK N+ +IE N G Y LG N+F+DLT+EEF+A+ T G++ P V R S
Sbjct: 67 VFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGV-----RVS 121
Query: 122 T-FKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
T FKY+NV+ +P S+DWR KGAVT IK+QG CG CWAFSAVAA+EG +++ GKLI L
Sbjct: 122 TGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGFVKLSTGKLISL 181
Query: 179 SEQQLVDCSTDNN--GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAA 236
SEQ+LVDC D N GC GG +D AF++I+ N GL EA+YPY E G C AA
Sbjct: 182 SEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAA 241
Query: 237 TIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVG 296
+I YED+P DE +L++AV QPVSV V+AS F+FY GV+ ECG + DHGV V+G
Sbjct: 242 SIRGYEDVPANDEPSLMKAVAGQPVSVAVDAS--KFQFYGGGVMAGECGTSLDHGVTVIG 299
Query: 297 FGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
+G A DG KYWL+KNSWG TWGE+GY+R+ +D G+CG+A + SYP
Sbjct: 300 YGAA--SDGTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYP 348
>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y +Q TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC I +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 168/342 (49%), Positives = 227/342 (66%), Gaps = 14/342 (4%)
Query: 13 MFVIIILVITCASQV---VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
+ I + ++ C+ + V+ R++ + S+ E+HE+WM ++ + YKD E+ R IFK+N
Sbjct: 7 FYQISLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ YIE N N+ Y LG N+F+DLTNEEF A NR + +R +TFKY+NVT
Sbjct: 67 VNYIEAFNNAANKPYTLGINQFADLTNEEFIAPR---NRFKGHMCSSITRTTTFKYENVT 123
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
+P+++DWR+KGAVT IK+QG CG CWAFSAVAA EGI ++ GKLI LSEQ++VDC T
Sbjct: 124 AIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTK 183
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
++ GC+GG MD AF++II+N GL E +YPY+ G C+ + ATI YED+P
Sbjct: 184 GEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVN 243
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
+E AL +AV QPVSV ++ASG F+FY+ GV CG DHGV VG+G + DG +
Sbjct: 244 NEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVS--ADGTE 301
Query: 308 YWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
YWL+KNSWG WGE GYIR+ R +EGL GIA ASYP A
Sbjct: 302 YWLVKNSWGTEWGEEGYIRMQRGVKAEEGLXGIAMMASYPTA 343
>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
Length = 345
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 168/341 (49%), Positives = 234/341 (68%), Gaps = 12/341 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN-V 128
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK N +
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDL 126
Query: 129 TD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC
Sbjct: 127 SDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDC 186
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
+T+N GC+GG M AF++IIEN G++ E+DY Y +Q TC Q EK AA I Y+ +P+
Sbjct: 187 TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPE 245
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
G E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT EE G
Sbjct: 246 G-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGNCADRINHAVTAIGYGTDEE--GQ 301
Query: 307 KYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
KYWL+KNSWG +WGE+GY++I+RD GLC IA +SYP
Sbjct: 302 KYWLLKNSWGTSWGENGYMKIIRDSGDPSGLCDIAKMSSYP 342
>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
Length = 321
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 183/350 (52%), Positives = 226/350 (64%), Gaps = 37/350 (10%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M L EK I + V+ T ASQ ++ + ++E ++VEKHEQWMA+HGRTY+D EK
Sbjct: 1 MALSLEKKLAIALLVVFS---TWASQAMARQLINEDALVEKHEQWMARHGRTYQDSEEKE 57
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK NLEYI+ NK N+TY+LG N F+DL++EE+ A+YT PV
Sbjct: 58 RRFQIFKSNLEYIDNFNKASNQTYQLGLNNFADLSHEEYVATYTARKMPV---------- 107
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+VP SIDWR+ GAVT IKNQ CG CWAFSA AAVEGI + G + LS
Sbjct: 108 ---------EVPESIDWRDHGAVTPIKNQYQCGCCWAFSAAAAVEGI--VANG--VSLSA 154
Query: 181 QQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
QQL+DC +DN GC GG M+ AF YII+N+G+A E DYPYQQ Q C + AAA I
Sbjct: 155 QQLLDCVSDNQGCKGGWMNNAFNYIIQNQGIALETDYPYQQMQQMCSS---RMAAAQISG 211
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEA-SGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFG 298
+ED+ DE AL++AV KQPVSV ++A S F+ YK GV A CG+ H V +VG+G
Sbjct: 212 FEDVTPKDEEALMRAVAKQPVSVTIDATSNPNFKLYKEGVFTAAGCGNGHSHAVTLVGYG 271
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGL----CGIATEASYPV 344
T+ EDG KYWL KNSWGETWGESGY+R+ RD GL CGIA ASYP
Sbjct: 272 TS--EDGTKYWLAKNSWGETWGESGYMRLQRDIGLEGGPCGIALYASYPT 319
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 170/351 (48%), Positives = 233/351 (66%), Gaps = 21/351 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSM---------HEPSIVEKHEQWMAQHGRTYKDELEKA 60
I+ +F ++ L S + S+ + +I+E +E W+AQH + Y EK
Sbjct: 3 ILLLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAIMELYELWLAQHKKAYNGLGEKQ 62
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R ++FK N YI + N +GN +YKLG N+F+DL++EEF+A+Y G + + R S+ P
Sbjct: 63 NRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKATYLG--AKLDTKKRLSNSP 120
Query: 121 ST-FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELS 179
S ++Y + D+P SIDWREKGAVT +K+QG CGSCWAFS VAAVEGI QI G L LS
Sbjct: 121 SPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 180
Query: 180 EQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
EQ+LVDC T N GC+GGLMD AF++II N GL +E DYPY+ G+CD ++ A TI
Sbjct: 181 EQELVDCDTSYNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKANDGSCDAYRKNAHVVTI 240
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
YED+P+ DE +L +A QP+SV +EASG+AF+FY+ GV + CG DHGV +VG+G
Sbjct: 241 DDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSTCGTQLDHGVTLVGYG 300
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
+ E G YW++KNSWG++WGE G+IR+ R+ G+CGIA EASYP+
Sbjct: 301 S---ESGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVSTGMCGIAMEASYPL 348
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 179/348 (51%), Positives = 238/348 (68%), Gaps = 17/348 (4%)
Query: 10 IIPMFV-IIILVITC-ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
I+ MFV + IL ++ SQ S + HEP + E H+QWM + R Y DELEK MR +FK
Sbjct: 4 ILFMFVSLTILSMSLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFK 63
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN--RPVPSVSRQSSRPSTFKY 125
+NL++IEK NK+G+RTYKLG NEF+D T EEF A++TG +PS ++ +
Sbjct: 64 KNLKFIEKFNKKGDRTYKLGVNEFADWTKEEFIATHTGLKGFNGIPSSEFVDEMIPSWNW 123
Query: 126 QNVTDV--PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
NV+DV P DWR +GAVT +K QG CG CWAFS+VAAVEG+T+I GG L+ LSEQQL
Sbjct: 124 -NVSDVAGPEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGGNLVSLSEQQL 182
Query: 184 VDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
+DC + +NGC+GG+M AF YII+N+G+A+EA YPYQ+ +GTC + +A I ++
Sbjct: 183 LDCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQETEGTC--RYNAKPSAWIRGFQ 240
Query: 243 DLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAE 301
+P +E ALL+AV++QPVSV ++A G F Y GV + CG + +H V VG+GT+
Sbjct: 241 TVPSNNERALLEAVSRQPVSVSIDADGPGFMHYSGGVYDEPYCGTDVNHAVTFVGYGTSP 300
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
E G KYWL KNSWGETWGE+GYIRI RD +G+CG+A A YPVA
Sbjct: 301 E--GIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 346
>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITVFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPLSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y +Q TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 IKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC GG M AF++I EN G+++E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRDE----GLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 167/327 (51%), Positives = 220/327 (67%), Gaps = 15/327 (4%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
+ V+S + PS+ E+HEQWM+++G+ YKD +EK R IFK N+E+IE N N+ Y
Sbjct: 23 TNVMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPY 82
Query: 85 KLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
KL N +DLT +EF+AS GY + + R+ + S FKY+NVT +P ++DWR KGAVT
Sbjct: 83 KLSVNHLADLTLDEFKASRNGYKK----IDREFATTS-FKYENVTAIPEAVDWRVKGAVT 137
Query: 145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAF 202
IK+QG CGSCWAFS VAA+EGI QIT GKLI LSEQ+LVDC T ++ GC GGLM+ F
Sbjct: 138 PIKDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGF 197
Query: 203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVS 262
E+II+N G+ +E +YPY+ G+C A A I YE +P E +LL+AV QP+S
Sbjct: 198 EFIIKNGGITSETNYPYKAADGSCSAAT-TAPVAKITGYEKVPVNSEISLLKAVANQPIS 256
Query: 263 VCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGES 322
V ++AS +F FY G+ ECG DHGV VG+G+A +G YW++KNSWG WGE
Sbjct: 257 VSIDASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA---NGTDYWIVKNSWGTVWGEK 313
Query: 323 GYIRILR----DEGLCGIATEASYPVA 345
GYIR+ R EGLCGIA ++SYP A
Sbjct: 314 GYIRMQRGIADKEGLCGIAMDSSYPTA 340
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 345 bits (884), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 169/350 (48%), Positives = 227/350 (64%), Gaps = 13/350 (3%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M +K ++ +F+ + + I SQV+ R +H+ ++ E+HE WMA++G+ YKD EK
Sbjct: 1 MAFTGQKQHMLALFLFLAVGI---SQVMP-RKLHQTALRERHENWMAEYGKMYKDAAEKE 56
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK N+E+IE N GN+ YKLG N +DLT EEF+ S G R S + +
Sbjct: 57 KRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTY-EFSTTTFKL 115
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQG-HCGSCWAFSAVAAVEGITQITGGKLIELS 179
+ FKY+NVTD+P +IDWR KGAVT IK+QG CGSCWAFS +AA EGI QI+ G L+ LS
Sbjct: 116 NGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTIAATEGIHQISTGNLVSLS 175
Query: 180 EQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIG 239
EQ+LVDC + ++GC GG M+ FE+II+N G+ +E +YPY+ GTC+ + A I
Sbjct: 176 EQELVDCDSVDDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASPVAQIK 235
Query: 240 KYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGT 299
YE +P E AL +AV QPVSV + A+ F FY G+ N ECG + DHGV VG+GT
Sbjct: 236 GYEIVPSYSEEALQKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGT 295
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
E+G YW++KNSWG WGE GYIR+ R G+CGIA ++SYP A
Sbjct: 296 ---ENGTDYWIVKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 345 bits (884), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 169/350 (48%), Positives = 239/350 (68%), Gaps = 17/350 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M +K + ++ + + + VI+ + + RS + S+ E+HE WM++HGR YKDE+EK
Sbjct: 1 MAMKID---LMSILITLFFVISMFNSQTTARSQPKLSVSERHELWMSRHGRVYKDEVEKG 57
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK+N+++IE NK GN +YKLG NEF+D+T+EEF +TG N +PS S
Sbjct: 58 ERFMIFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFTGIN--IPSYLSPSPMS 115
Query: 121 ST-FKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIE 177
ST FK +++D +P+++DWRE GAVT +KNQG CG CWAFSAV ++EG +I G L+E
Sbjct: 116 STEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLME 175
Query: 178 LSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAAT 237
SEQ+L+DC+T+N GC+GG M AF++I EN G+++E+DY YQ +Q TC Q EK AA
Sbjct: 176 FSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISSESDYEYQGQQYTCRSQ-EKTAAVQ 234
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I Y+ +P+G E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+
Sbjct: 235 ISSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGY 292
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE----GLCGIATEASYP 343
GT +E G KYWL+KNSWG +WGE+G+++I+RD G C IA +SYP
Sbjct: 293 GT--DEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPGGHCDIAKMSSYP 340
>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y +Q TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + F +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++IIEN G++ E+DY Y +Q TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 163/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HG YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGHVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC GG M AF++I EN G+++E+DY Y EQ TC + +EK AA I Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTC-RSQEKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRDE----GLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 167/338 (49%), Positives = 226/338 (66%), Gaps = 16/338 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
II C + + + + +V +HEQWMAQ+ R YKD EKA R +FK N+++IE
Sbjct: 103 AIIGFAFFCGAAMAARDLSDDSVMVARHEQWMAQYSRVYKDASEKARRFEVFKANVQFIE 162
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VP 132
N GN + LG N+F+DLTN+EFR++ T N+ + S + + P+ F+Y+NV+ +P
Sbjct: 163 SFNAGGNNKFWLGVNQFADLTNDEFRSTKT--NKGLKSSNMKI--PTGFRYENVSADALP 218
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DN 190
T+IDWR KGAVT IK+QG CG CWAFSAVAA EGI +I+ GKL+ L+EQ+LVDC ++
Sbjct: 219 TTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGED 278
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC GGLMD AF++II+N GL TE+ YPY G C + +AATI YED+P DE
Sbjct: 279 QGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKC--KSGSNSAATIKGYEDVPANDEA 336
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
AL++AV QPVSV V+ F+FY GV+ CG + DHG+A +G+G + DG KYWL
Sbjct: 337 ALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG--KTSDGTKYWL 394
Query: 311 IKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+KNSWG TWGE+GY+R+ +D G+CG+A E SYP
Sbjct: 395 MKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 432
>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC I +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 231/340 (67%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P VS + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYVSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +KNQG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y +Q TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRDE----GLCGIATEASYP 343
YWL+KNSWG +WGE G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 341
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 172/351 (49%), Positives = 232/351 (66%), Gaps = 21/351 (5%)
Query: 10 IIPMFVIIILVITCAS------QVVSGRS---MHEPSIVEKHEQWMAQHGRTYKDELEKA 60
I+ +F ++ L S ++S S + + +I+E +E W+AQH + Y EK
Sbjct: 3 ILLLFAVLALSAMAGSASRADFSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDEKQ 62
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
+ ++FK N YI + N +GN +YKLG N+F+DL++EEF+A+Y G + + R S P
Sbjct: 63 KKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKAAYLG--TKLDAKKRLSRSP 120
Query: 121 ST-FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELS 179
S ++Y D+P SIDWREKGAVT +KNQG CGSCWAFS VAAVEGI QI G L LS
Sbjct: 121 SPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 180
Query: 180 EQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
EQ+LVDC T N GC+GGLMD AF++II N GL +E DYPY+ G+CD ++ A TI
Sbjct: 181 EQELVDCDTSYNQGCNGGLMDYAFQFIISNGGLDSEDDYPYKANNGSCDAYRKNAHVVTI 240
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
YED+P+ DE +L +A QP+SV +EASG+AF+FY+ GV + CG DHGV +VG+G
Sbjct: 241 DDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSNCGTQLDHGVTLVGYG 300
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
+ E G YWL+KNSWG +WGE G+I++ R+ G+CGIA EASYPV
Sbjct: 301 S---ESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGASTGMCGIAMEASYPV 348
>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 183/343 (53%), Positives = 239/343 (69%), Gaps = 16/343 (4%)
Query: 12 PMFVIIILVITCASQVVSGRSMHE--PSIVEK-HEQWMAQHGRTYKDELEKAMRLTIFKQ 68
P+ + ++ CA +S R++++ S+V K H+QWM Q+GR+Y ++ E R IF +
Sbjct: 6 PIIALCTMLWACAYTAMS-RTLYDETSSVVAKTHQQWMLQYGRSYTNDAEMEKRFKIFME 64
Query: 69 NLEYIEKANKE-GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
NLEYIEK N GN++YKL N+FSDLTNEEF AS+TG S S R S +
Sbjct: 65 NLEYIEKFNNAPGNKSYKLDLNQFSDLTNEEFIASHTGLMIDPSKPSSSSKRASPASL-D 123
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
++D PTS+DWRE+GAVT +KNQG+CGSCWAFSAVAAVEGI +I G LI LSEQQLVDC+
Sbjct: 124 LSDTPTSLDWREQGAVTDVKNQGNCGSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCA 183
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
++ N GC GG MD AF YI EN G+A+E DY Y+ GTC + AA I YED+P
Sbjct: 184 SNEQNQGCGGGFMDNAFSYITEN-GIASENDYQYRGGAGTCQNNEMITPAARISGYEDVP 242
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
G++ LL AV++QPVSV + A GQ+F YK G+ + CG + +HGV +VG+GT+ EEDG
Sbjct: 243 AGEDQLLL-AVSQQPVSVAI-AVGQSFHLYKEGIYSGPCGSSLNHGVTLVGYGTS-EEDG 299
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
KYWLIKNSWGE+WGE+GY+R+LR+ EG CGIA +AS+P
Sbjct: 300 TKYWLIKNSWGESWGENGYMRLLRESGQSEGHCGIAVKASHPT 342
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 343 bits (881), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 167/339 (49%), Positives = 225/339 (66%), Gaps = 15/339 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ I+ + C+S V+S R + + ++VE+HEQWMA+ R YKD EKA R +FK N+ +
Sbjct: 8 LLAIVGCICLCSSAVLSARELGDTAMVERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N E NR + LG N+F+DLTN+EFRA+ T N+ + ++ P+ FKY NV+
Sbjct: 68 IESFNAE-NRKFWLGVNQFTDLTNDEFRATKT--NKGLKMSGGRA--PTGFKYSNVSIDA 122
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+PT++DWR KG VT IK+QG CG CWAFSAV A EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 LPTAVDWRTKGVVTPIKDQGQCGCCWAFSAVVATEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
+ GC GG MD AF++II+N GL TEA+YPY + G C + ATI YED+P D
Sbjct: 183 VDQGCEGGEMDDAFKFIIKNGGLTTEANYPYTAQDGQCKTSIASNSVATIKGYEDVPAND 242
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E +L++AV QPVSV V+ F+ Y GV+ CG + DHG+A +G+G DG KY
Sbjct: 243 ESSLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIAAIGYGMT--SDGTKY 300
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
WL+KNSWG TWGESGY+R+ +D G+CG+A + SYP
Sbjct: 301 WLLKNSWGTTWGESGYLRMEKDISDKSGMCGLAMQPSYP 339
>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (881), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + K +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 343 bits (881), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 165/312 (52%), Positives = 220/312 (70%), Gaps = 11/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++H + YK EK R +F++NL +I++ N E N +Y LG NEF+DLT+E
Sbjct: 47 LLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFADLTHE 105
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G +P S RQ S + F+Y+++TD+P S+DWR+KGAV +K+QG CGSCWA
Sbjct: 106 EFKGRYLGLAKPQFSRKRQPS--ANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWA 163
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QIT G L LSEQ+L+DC T N+GC+GGLMD AF+YII GL E D
Sbjct: 164 FSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDD 223
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+G C +QKE TI YED+P+ D+ +L++A+ QPVSV +EASG+ F+FYK
Sbjct: 224 YPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYK 283
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV N +CG + DHGVA VG+G+++ G+ Y ++KNSWG WGE G+IR+ R+ EG
Sbjct: 284 GGVFNGQCGTDLDHGVAAVGYGSSK---GSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEG 340
Query: 333 LCGIATEASYPV 344
LCGI ASYP
Sbjct: 341 LCGINKMASYPT 352
>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (880), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + K +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 343 bits (880), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 167/341 (48%), Positives = 234/341 (68%), Gaps = 13/341 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST-FKYQNV 128
+++IE NK GN +YKLG NEF+D+T+EEF A +TG N P +S S PST FK ++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLS-PSPMPSTEFKINDL 125
Query: 129 TD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+D +P+++DWRE GAVT +KNQG CG CWAFSAV ++EG +I G L+E SEQ+L+DC
Sbjct: 126 SDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDC 185
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
+T+N GC+GG M AF++IIEN G++ E+DY Y +Q TC Q K AA I Y+ +P+
Sbjct: 186 TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQG-KTAAVQISNYQVVPE 244
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
G E +LLQAVTKQPVS+ + AS +FY G + C + +H V +G+GT +E G
Sbjct: 245 G-ETSLLQAVTKQPVSIGIAAS-HDLQFYAGGTYDGSCANRINHAVTAIGYGT--DEKGQ 300
Query: 307 KYWLIKNSWGETWGESGYIRILRDE----GLCGIATEASYP 343
KYWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 301 KYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (880), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 343 bits (880), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 163/311 (52%), Positives = 215/311 (69%), Gaps = 12/311 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ + E W+++HG+ YK EK R +F++NL +I++ NKE + +Y LG NEF+DL++E
Sbjct: 400 LIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVS-SYWLGLNEFADLSHE 458
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF++ Y G P R F+Y++V D+P S+DWR+KGAVTH+KNQG CGSCWA
Sbjct: 459 EFKSKYLGLRAEFP---RSRDYSGEFRYRDVADLPESVDWRKKGAVTHVKNQGACGSCWA 515
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF +I N GL E D
Sbjct: 516 FSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDD 575
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+GTC++QKE TI YED+P+ DE +LL+A+ QP+SV +EASG+ F+FY
Sbjct: 576 YPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYS 635
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV N CG DHGVA VG+G+++ G Y ++KNSWG WGE GYIR+ R+ EG
Sbjct: 636 GGVFNGPCGTELDHGVAAVGYGSSK---GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEG 692
Query: 333 LCGIATEASYP 343
LCGI ASYP
Sbjct: 693 LCGINKMASYP 703
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 343 bits (880), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 233/347 (67%), Gaps = 14/347 (4%)
Query: 8 SFIIPMFVIIILVITCASQ---VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLT 64
+F + +++L+ + S +V+ R++ E S++E+HE WM HGR YKD++EK R
Sbjct: 4 NFFLKNITVVLLLFSILSLYPFIVTSRNLKELSMLERHENWMVHHGRVYKDDIEKEHRFK 63
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFK 124
FK+N+E+IE NK G + YKL N+++DLT EEF S+ G + + S ++ ++FK
Sbjct: 64 TFKENVEFIESFNKNGTQRYKLAVNKYADLTTEEFTTSFMGLDTSLLSQQESTATTTSFK 123
Query: 125 YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
Y +VT+VP S+DWR++G+VT +K+QG CG CWAFSA AA+EG QI +LI LSEQQL+
Sbjct: 124 YDSVTEVPNSMDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNELISLSEQQLL 183
Query: 185 DCSTDNNGCSGGLMDKAFEYIIENK--GLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
DCST N GC GGLM A++++++N G+ TE +YPY++ Q C + E+ AA TI YE
Sbjct: 184 DCSTQNKGCEGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVC--KTEQPAAVTINGYE 241
Query: 243 DLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEE 302
+P DE +LL+AV QP+SV + A+ + F Y G+ + C +H V V+G+GT+ E
Sbjct: 242 VVPS-DESSLLKAVVNQPISVGIAANDE-FHMYGSGIYDGSCNSRLNHAVTVIGYGTS-E 298
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRDEGL----CGIATEASYPVA 345
EDG KYW++KNSWG WGE GY+RI RD G+ CGIA AS+P A
Sbjct: 299 EDGTKYWIVKNSWGSDWGEEGYMRIARDVGVDGGHCGIAKVASFPTA 345
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 343 bits (879), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 166/340 (48%), Positives = 225/340 (66%), Gaps = 16/340 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ I+ C + + + + ++V +HEQWMAQ+ R YKD EKA R +FK N+++
Sbjct: 8 ILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFEVFKANVKF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN+EFR+ T N+ S + + P+ F+Y+NV+
Sbjct: 68 IESFNAGGNNKFWLGVNQFADLTNDEFRSIKT--NKGFKSSNMK--IPTGFRYENVSVDA 123
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
+PT+IDWR KGAVT IK+QG CG CWAFSAVAA EGI +I+ GKL+ L+EQ+LVDC
Sbjct: 124 LPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHG 183
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
++ GC GGLMD AF++II N GL TE+ YPY G C + +AATI YED+P D
Sbjct: 184 EDQGCEGGLMDDAFKFIINNGGLTTESSYPYTAADGKC--KSGSNSAATIKGYEDVPAND 241
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL++AV QPVSV V+ F+FY GV+ CG + DHG+A +G+G + DG KY
Sbjct: 242 EAALMKAVANQPVSVAVDGGDMTFQFYSSGVMTGSCGTDLDHGIAAIGYG--KTSDGTKY 299
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
WL+KNSWG TWGE+GY+R+ +D G+CG+A E SYP
Sbjct: 300 WLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 339
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (879), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + F +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (879), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + F +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 343 bits (879), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 167/341 (48%), Positives = 224/341 (65%), Gaps = 18/341 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ I+ L + C + + + + ++V +HEQWMAQ+ R YKD EKA R +FK N+++
Sbjct: 8 ILAILGLALFCGAALAARDLNDDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN-RPVPSVSRQSSRPSTFKYQNVT-- 129
IE N GNR + LG N+F+DLTN+EFRA+ T +P P P+ F+Y+NV+
Sbjct: 68 IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSP-----VKVPTGFRYENVSVD 122
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
+P SIDWR KGAVT IK+QG CG CWAFSAVAA EGI +I+ KLI LSEQ+LVDC
Sbjct: 123 ALPASIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTDKLISLSEQELVDCDVH 182
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
++ GC GGLMD AF++II+N GL TE+ YPY G C + +AA I +ED+P
Sbjct: 183 GEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTATDGKC--KSGTNSAANIKGFEDVPAN 240
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
DE AL++AV QPVSV V+ F+ Y GV+ CG + DHG+A +G+G + DG K
Sbjct: 241 DEAALMKAVANQPVSVAVDGGDMTFQLYSGGVMTGSCGTDLDHGIAAIGYG--QTSDGTK 298
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
YWL+KNSWG TWGE+GY+R+ +D G+CG+A E SYP
Sbjct: 299 YWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 339
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 170/327 (51%), Positives = 224/327 (68%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYK 85
++S + + E +I+E +E W+A+H R Y EK R ++FK N YI + N +GNR+YK
Sbjct: 26 IISSKDLREDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHN-QGNRSYK 84
Query: 86 LGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ--NVTDVPTSIDWREKGAV 143
LG N+F+DL++EEF+A+Y G ++ SRP + +YQ + D+P SIDWREKGAV
Sbjct: 85 LGLNQFADLSHEEFKATYLGAKL---DTKKRLSRPPSRRYQYSDGEDLPESIDWREKGAV 141
Query: 144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAF 202
T +K+QG CGSCWAFS VAAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD AF
Sbjct: 142 TSVKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 201
Query: 203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVS 262
E+II N GL +E DYPY G+CD ++ A TI YED+P+ DE +L +A QP+S
Sbjct: 202 EFIINNGGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPIS 261
Query: 263 VCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGES 322
V +EASG+ F+FY GV + CG DHGV +VG+G+ E G YW +KNSWG++WGE
Sbjct: 262 VAIEASGREFQFYDSGVFTSTCGTQLDHGVTLVGYGS---ESGTDYWTVKNSWGKSWGEE 318
Query: 323 GYIRILRD-----EGLCGIATEASYPV 344
G+IR+ R+ G+CGIA EASYPV
Sbjct: 319 GFIRLQRNIEVASTGMCGIAMEASYPV 345
>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + F +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 164/340 (48%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++E +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRDE----GLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 175/306 (57%), Positives = 212/306 (69%), Gaps = 15/306 (4%)
Query: 46 MAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG 105
MA++GR YKD EK R IFK N+ IE NK ++TYKL NEF+DLTNEEFR+
Sbjct: 1 MARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNR 60
Query: 106 YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVE 165
+ + S +TFKY+NVT VP++IDWR+KGAVT IK+Q CG CWAFSAVAA E
Sbjct: 61 FKAHI------CSEATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATE 114
Query: 166 GITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQ 223
GITQIT GKLI LSEQ+LVDC T +N GCSGGLMD AF + I+ GLA+EA YPY+ +
Sbjct: 115 GITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRF-IKIHGLASEATYPYEGDD 173
Query: 224 GTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE 283
GTC+ +KE AA I YED+P +E AL +AV QPV+V ++A G F+FY GV +
Sbjct: 174 GTCNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQ 233
Query: 284 CGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATE 339
CG DHGVA VG+G +DG YWL+KNSWG WGE GYIR+ RD EGLCGIA +
Sbjct: 234 CGTELDHGVAAVGYGIG--DDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQ 291
Query: 340 ASYPVA 345
ASYP A
Sbjct: 292 ASYPTA 297
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 168/341 (49%), Positives = 228/341 (66%), Gaps = 18/341 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPS-IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
+ ++ C + ++ R ++E S +V +HEQWMAQ+ R YKD EKA R +FK N++
Sbjct: 8 ILAVLSFAFFCGA-ALAARDLNEDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVK 66
Query: 72 YIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT-- 129
+IE N GNR + LG N+F+DLTN+EFR + T PS+ + S+ F+Y+NV+
Sbjct: 67 FIESFNTGGNRKFWLGINQFADLTNDEFRTTKTNKGFK-PSLDKVST---GFRYENVSVD 122
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
+P +IDWR GAVT IK+QG CG CWAFSAVAA EGI +I+ GKLI LSEQ+LVDC
Sbjct: 123 AIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQELVDCDVH 182
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
++ GC GGLMD AF++II+N GL TE++YPY G C + +AA I YED+P
Sbjct: 183 GEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKC--KSGSNSAANIKGYEDVPTN 240
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
DE AL++AV QPVSV V+ F+FY GV+ CG + DHG+A +G+G + DG K
Sbjct: 241 DEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG--KTSDGTK 298
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
YWL+KNSWG TWGE+GY+R+ +D +G+CG+A E SYP
Sbjct: 299 YWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAMEPSYPT 339
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 342 bits (877), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 165/312 (52%), Positives = 220/312 (70%), Gaps = 11/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++H + YK EK R +F++NL +I++ N E N +Y LG NEF+DLT+E
Sbjct: 47 LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFADLTHE 105
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G +P S RQ S + F+Y+++TD+P S+DWR+KGAV +K+QG CGSCWA
Sbjct: 106 EFKGRYLGLAKPQFSRKRQPS--ANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWA 163
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QIT G L LSEQ+L+DC T N+GC+GGLMD AF+YII GL E D
Sbjct: 164 FSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDD 223
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+G C +QKE TI YED+P+ D+ +L++A+ QPVSV +EASG+ F+FYK
Sbjct: 224 YPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYK 283
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV N +CG + DHGVA VG+G+++ G+ Y ++KNSWG WGE G+IR+ R+ EG
Sbjct: 284 GGVFNGKCGTDLDHGVAAVGYGSSK---GSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEG 340
Query: 333 LCGIATEASYPV 344
LCGI ASYP
Sbjct: 341 LCGINKMASYPT 352
>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 169/337 (50%), Positives = 222/337 (65%), Gaps = 34/337 (10%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++ V+ + + RS+HE S+ E+HE WM Q+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK +++YKL NEF+DLTNEEFRAS + + S + ++FKY+NVT VP++
Sbjct: 72 SFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICS-----TEATSFKYENVTAVPST 126
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNG 192
+DWR+KGAVT IK+QG CGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C+ +YPY GTC+++K AA I YED+P +E AL
Sbjct: 187 CT---------------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 225
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
+AV QP++V ++ASG F+FY GV +CG DHGVA VG+GT+ +DG KYWL+K
Sbjct: 226 QKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTS--DDGMKYWLVK 283
Query: 313 NSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
NSW WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 284 NSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 176/347 (50%), Positives = 233/347 (67%), Gaps = 19/347 (5%)
Query: 13 MFVIIILVITC----ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQ 68
+F+++ L I SQ S + HEP + E H+QWM + R Y DELEK MR +FK+
Sbjct: 14 LFMLVSLTILSMNLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKK 73
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN--RPVPSVSRQSSRPSTFKYQ 126
NL++IEK NK+G+RTYKLG NEF+D T EEF A++TG +PS ++ +
Sbjct: 74 NLKFIEKFNKKGDRTYKLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNW- 132
Query: 127 NVTDVP--TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
NV+DV + DWR +GAVT +K QG CG CWAFS+VAAVEG+T+I G L+ LSEQQL+
Sbjct: 133 NVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLL 192
Query: 185 DCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
DC + +NGC+GG+M AF YII+N+G+A+EA YPYQ +GTC + +A I ++
Sbjct: 193 DCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQAAEGTC--RYNGKPSAWIRGFQT 250
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEE 302
+P +E ALL+AV+KQPVSV ++A G F Y GV + CG N +H V VG+GT+ E
Sbjct: 251 VPSNNERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPE 310
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
G KYWL KNSWGETWGE+GYIRI RD +G+CG+A A YPVA
Sbjct: 311 --GIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 355
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 170/344 (49%), Positives = 226/344 (65%), Gaps = 10/344 (2%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
SF ++I+ LV+ + V R + E E+HE+WMAQ+GR YKD EK R +FK
Sbjct: 3 SFSQNHYLILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFK 62
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+ +IE N G++ + L N+F+DL +EEF+A + V ++S ++F+Y++
Sbjct: 63 NNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWV--ETSTETSFRYES 120
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC- 186
VT +P +IDWR++GAVT IK+QG CGSCWAFSAVAA EGI QIT GKL+ LSEQ+LVDC
Sbjct: 121 VTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCV 180
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
++ GC GG +D AFE+I + G+A+E YPY+ TC +KE A I YE +P
Sbjct: 181 KGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPS 240
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDG 305
+E ALL+AV QPVSV ++A AF++Y G+ NA CG + +H VAVVG+G A DG
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKA--LDG 298
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+KYWL+KNSWG WGE GYIRI RD EGLCGIA YP A
Sbjct: 299 SKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342
>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
Length = 343
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 183/347 (52%), Positives = 226/347 (65%), Gaps = 19/347 (5%)
Query: 8 SFIIPMFVIIILVI--TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTI 65
S I VI +L+I T SQ + ++ +I EKHEQWMA+HGRTY D EK R I
Sbjct: 4 SLQITKLVITLLMILGTWVSQAMPRPLLNAEAIAEKHEQWMARHGRTYHDNAEKERRFQI 63
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTF-- 123
FK NL+YIE NK N+TYKLG N+FSDL+ EEF +Y GY P + ++ TF
Sbjct: 64 FKNNLDYIENFNKAFNKTYKLGLNKFSDLSEEEFVTTYNGYEMPTTLPTANTTVKPTFFS 123
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
Y N +VP SIDWRE G VT +KNQG CG CWAFSAVAAVEGI G LS QQL
Sbjct: 124 NYYNQDEVPESIDWRENGVVTSVKNQGECGCCWAFSAVAAVEGI----AGNGASLSAQQL 179
Query: 184 VDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
+DC DN+GC GG M KAFEYI++N+G+ ++ DYPY+Q Q C + AA I YE
Sbjct: 180 LDCVGDNSGCGGGTMIKAFEYIVQNQGIVSDTDYPYEQTQEMC--RSGSNVAARITGYES 237
Query: 244 LPKGDEHALLQAVTKQPVSVCVEA-SGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAE 301
+ + +E AL +AV KQP+SV ++A SG F+ Y GV +AE CG + H V +VG+GT
Sbjct: 238 VIQSEE-ALKRAVAKQPISVAIDASSGPNFKSYISGVFSAEDCGTHLTHAVTLVGYGTT- 295
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
EDG KYWL+KNSWGE WGESGY+R+ RD EG CGIA +ASYP
Sbjct: 296 -EDGTKYWLVKNSWGEEWGESGYMRLQRDVGAMEGPCGIAMQASYPT 341
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 166/347 (47%), Positives = 233/347 (67%), Gaps = 17/347 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M +K + ++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK
Sbjct: 1 MAMKID---LMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKG 57
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK+N+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S P
Sbjct: 58 ERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS-----P 112
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
S + D+P+++DWRE GAVT +KNQG CG CWAFSAV ++EG +I G L+E SE
Sbjct: 113 SPINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 172
Query: 181 QQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
Q+L+DC+T+N GC+GG M AF++I EN G++ E+DY Y +Q TC Q EK AA I
Sbjct: 173 QELLDCTTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISS 231
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
Y+ +P+G E +LLQAVTKQPVS+ + AS Q +FY G + C + +H V +G+GT
Sbjct: 232 YQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGT- 288
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRDE----GLCGIATEASYP 343
+E G KYWL+KNSWG +WGE G+++I+RD GLC IA +SYP
Sbjct: 289 -DEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 170/337 (50%), Positives = 223/337 (66%), Gaps = 32/337 (9%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++ V+ + + R++HE S+ E+HE WMAQ+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK +++YKL NEF+DLTNEEF S + + S + ++FKY+NVT VP++
Sbjct: 72 SFNKAMDKSYKLSINEFADLTNEEFGTSRNRFKAHICS-----TEATSFKYENVTAVPST 126
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNG 192
IDWR+KGAVT IK+QG CGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 IDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C+G A+YPY GTC+++K AA I YED+P +E AL
Sbjct: 187 CNG-------------------ANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 227
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
+AV QP++V ++A G F+FY GV +CG DHGVA VG+GT+ +DG KYWL+K
Sbjct: 228 QKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTS--DDGMKYWLVK 285
Query: 313 NSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
NSWG WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 286 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 322
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 163/340 (47%), Positives = 231/340 (67%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + F +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC I +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 168/343 (48%), Positives = 227/343 (66%), Gaps = 11/343 (3%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
+FIIPMF +I V+S R + EP + KHE+WM Q G++YKD EK R IFK
Sbjct: 6 NFIIPMF--LIFTTWMLPYVMSSRVL-EPYLSNKHEKWMTQFGKSYKDAAEKEKRFQIFK 62
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+E+IE N GN+ + L N F+DLTNEEF+AS G N+ + + ++F+Y N
Sbjct: 63 NNVEFIELFNAVGNKPFNLSINHFADLTNEEFKASLNG-NKKLHDKFDILNETTSFRYHN 121
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
VT VP S+DWR++GAVT IKNQG CGSCWAFS VA++EGI QIT G+L+ LSEQ+L+DC
Sbjct: 122 VTSVPASMDWRKRGAVTPIKNQGSCGSCWAFSTVASIEGIHQITTGELVSLSEQELIDCV 181
Query: 188 TDN-NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N +GCSGG ++ AF++I + G+A+E +YPY++ C +KE A I YE +P
Sbjct: 182 RGNSSGCSGGYLEDAFKFIAKKGGMASETNYPYKETDEKCKFKKESKHVAEIKGYEKVPS 241
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
E+ LL+AV QPVSV V+A F+FY G+ +CG + DH V +VG+G + D
Sbjct: 242 NSENDLLKAVANQPVSVYVDAGDYVFQFYSGGIFTGKCGTDTDHVVTIVGYGVS--LDYT 299
Query: 307 KYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+YWL+KNSWG WGE GY+++ R+ +GLCGIAT SYPVA
Sbjct: 300 EYWLVKNSWGTGWGEKGYMKLKRNVDSKKGLCGIATNPSYPVA 342
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 165/325 (50%), Positives = 217/325 (66%), Gaps = 16/325 (4%)
Query: 28 VSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKL 86
V R ++E S+ E+HEQWM +HG+ Y+D +EK R IFK N+E+IE N N+ YKL
Sbjct: 25 VMSRKLYESLSLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKL 84
Query: 87 GTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
N +DLT +EF+AS GY + + R+ + S FKY+NVT +P ++DWR KGAVT I
Sbjct: 85 SVNHLADLTLDEFKASRNGYKK----IDREFTTTS-FKYENVTAIPAAVDWRVKGAVTPI 139
Query: 147 KNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEY 204
K+QG CGSCWAFS VAA EGI QIT GKL+ LSEQ+LVDC T ++ GC GGLM+ FE+
Sbjct: 140 KDQGQCGSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199
Query: 205 IIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVC 264
II+N G+ +E +YPY+ G+C+ A G YE +P E +LL+AV QP+SV
Sbjct: 200 IIKNGGITSETNYPYKAADGSCNTATTTPVAKITG-YEKVPVNSEKSLLKAVANQPISVS 258
Query: 265 VEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGY 324
++AS +F FY G+ ECG DHGV VG+G+A +G YW++KNSWG WGE GY
Sbjct: 259 IDASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA---NGTDYWIVKNSWGTVWGEKGY 315
Query: 325 IRILR----DEGLCGIATEASYPVA 345
IR+ R EGLCGIA ++SYP A
Sbjct: 316 IRMQRGIAAKEGLCGIAMDSSYPTA 340
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 164/338 (48%), Positives = 229/338 (67%), Gaps = 14/338 (4%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S PS +
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS-----PSPINDLSDD 121
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
D+P+++DWRE GAVT +KNQG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+T+
Sbjct: 122 DMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN 181
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
N GC+GG M AF++I EN G++ E+DY Y +Q TC Q EK AA I Y+ +P+G E
Sbjct: 182 NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG-E 239
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
+LLQAVTKQPVS+ + AS Q +FY G + C + +H V +G+GT +E G KYW
Sbjct: 240 TSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGT--DEKGQKYW 296
Query: 310 LIKNSWGETWGESGYIRILRDE----GLCGIATEASYP 343
L+KNSWG +WGE G+++I+RD GLC IA +SYP
Sbjct: 297 LLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
Length = 350
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 176/349 (50%), Positives = 234/349 (67%), Gaps = 22/349 (6%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMH-------EPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+I ++++ + VV R + E ++ +H+QWMA+HGRTYKDE EKA R
Sbjct: 10 MITFTAAALMILAVMTMVVEARDLSTSTGGYGEEAMKVRHQQWMAEHGRTYKDEAEKARR 69
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
+FK N ++++++N G ++Y+L NEF+D+TN+EF A YTG +PVP+ + + +
Sbjct: 70 FQVFKANADFVDRSNAAGGKSYELAINEFADMTNDEFVAMYTGL-KPVPAGPK---KMAG 125
Query: 123 FKYQNVT--DVP-TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELS 179
FKY+N+T DV ++DWR+KGAVT IKNQG CG CWAF+AVAAVE I QIT G L+ LS
Sbjct: 126 FKYENLTLSDVDQQAVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVESIHQITTGNLVSLS 185
Query: 180 EQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
EQQ++DC TD NNGC+GG +D AF+YII N GLATE YPY QGTC Q A TI
Sbjct: 186 EQQVLDCDTDGNNGCNGGYIDNAFQYIISNGGLATEDAYPYAAAQGTC--QSSVQPAVTI 243
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE-CGD-NCDHGVAVVG 296
Y+D+P GDE AL AV QPV+V ++A F+FY GVL A+ CG + +H V VG
Sbjct: 244 SSYQDVPSGDEAALAAAVANQPVAVAIDAHNN-FQFYSSGVLTADTCGTPSLNHAVTAVG 302
Query: 297 FGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEASYPVA 345
+ TA EDG YWL+KN WG+ WGE GY+R+ R CG+A +ASYPVA
Sbjct: 303 YSTA--EDGTPYWLLKNQWGQNWGEGGYLRVERGTNACGVAQQASYPVA 349
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 169/344 (49%), Positives = 226/344 (65%), Gaps = 10/344 (2%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
SF ++I+ LV++ + V R + E E+HE+WMAQ+GR YKD EK R +FK
Sbjct: 3 SFSQNHYLILFLVLSVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFK 62
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+ +IE N G++ + L N+F+DL +EEF+A + V ++S ++F+Y++
Sbjct: 63 NNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWV--ETSTQTSFRYES 120
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC- 186
VT +P +IDWR++GAVT IK+QG CGSCWAFSAVAA EGI QIT GKL+ LSEQ+LVDC
Sbjct: 121 VTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCV 180
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
++ GC GG +D AFE+I + G+A+E YPY+ TC +KE A I YE +P
Sbjct: 181 KGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPS 240
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDG 305
+E ALL+AV QPVSV ++A AF++Y G+ N CG + +H VAVVG+G A DG
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNVRNCGTDPNHAVAVVGYGKA--LDG 298
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+KYWL+KNSWG WGE GYIRI RD EGLCGIA YP A
Sbjct: 299 SKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 167/342 (48%), Positives = 228/342 (66%), Gaps = 12/342 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
I +F+I+ LV + + + R + E ++ ++H +WM +HGR Y D EK R +FK+N
Sbjct: 6 IQIFLIVSLVSSFSLSITLSRPLLDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRN 65
Query: 70 LEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
+E IE+ N + T+KL N+F+DLTNEEFR+ YTG+ SV ++P++F+YQNV
Sbjct: 66 VERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTGFKGN--SVLSSRTKPTSFRYQNV 123
Query: 129 TD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+ +P S+DWR+KGAVT IK+QG CGSCWAFSAVAA+EG+ QI GKLI LSEQ+LVDC
Sbjct: 124 SSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDC 183
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
T++ GC GGLMD AF Y I GL +E++YPY+ GTC+ K K A +I +ED+P
Sbjct: 184 DTNDGGCMGGLMDTAFNYTITIGGLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPA 243
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
DE AL++AV PVS+ + F+FY GV + EC + DHGV VG+G ++G
Sbjct: 244 NDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGVFSGECTTHLDHGVTAVGYG--RSKNGL 301
Query: 307 KYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
KYW++KNSWG WGE GY+RI +D G CG+A ASYP
Sbjct: 302 KYWILKNSWGPKWGERGYMRIKKDIKPKHGQCGLAMNASYPT 343
>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (872), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 163/340 (47%), Positives = 231/340 (67%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +F G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFCAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 340 bits (872), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 175/348 (50%), Positives = 233/348 (66%), Gaps = 15/348 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTI 65
S ++ + V+IIL + R++ E S+V+KHEQWMA+ R Y+DELEK MR +
Sbjct: 3 SIMVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKY 125
FK+NL++IE NK+GN++YKLG NEF+D TNEEF A +TG + + VS T
Sbjct: 63 FKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGL-KGLTEVSPSKVVAKTISS 121
Query: 126 Q--NVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
Q NV+D V S DWR +GAVT +K QG CG CWAFSAVAAVEG+ +I GG L+ LSEQQ
Sbjct: 122 QTWNVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQ 181
Query: 183 LVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
L+DC + + GC GG+M AF Y+++N+G+A+E DY YQ G C + AA I +
Sbjct: 182 LLDCDREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGC--RSNARPAARISGF 239
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE 301
+ +P +E ALL+AV++QPVSV ++A+G F Y GV + CG + +H V VG+GT+
Sbjct: 240 QTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTS- 298
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+DG KYWL KNSWGETWGE GYIRI RD +G+CG+A A YPVA
Sbjct: 299 -QDGTKYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345
>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 163/340 (47%), Positives = 231/340 (67%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + K +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D +P+++DW E GAVT +K+QG CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
T+N GC+GG M AF++I EN G++ E+DY Y EQ TC Q EK AA I Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LLQAVTKQPVS+ + AS Q +FY G + C D +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG +WGE+G+++I+RD GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 167/337 (49%), Positives = 222/337 (65%), Gaps = 34/337 (10%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++ V+ + + R++HE S+ E+HE WM Q+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK +++YKL NEF+DLTNEEFRAS + + S + ++FKY+NVT VP++
Sbjct: 72 SFNKAMDKSYKLSINEFADLTNEEFRASRNRFKAHICS-----TEATSFKYENVTAVPST 126
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNG 192
+DWR+KGAVT IK+QG CGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C+ +YPY GTC+++K AA I YED+P +E AL
Sbjct: 187 CT---------------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 225
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
+AV QP++V ++A G F+FY GV +CG DHGV+ VG+GT+ +DG KYWL+K
Sbjct: 226 QKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTS--DDGMKYWLVK 283
Query: 313 NSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
NSWG WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 284 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320
>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 339 bits (870), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 177/337 (52%), Positives = 226/337 (67%), Gaps = 32/337 (9%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++++ ASQ +S R++HE S+ E+HE WM +GRTYKD EK R IFK+N+EYIE
Sbjct: 10 ITLLIMGVWASQALS-RTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEYIE 68
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK F+AS GYN S +SS ++F+Y+NV VP+S
Sbjct: 69 SVNK--------------------FKASRNGYNM---SSRPRSSEITSFRYENVAAVPSS 105
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNG 192
+DWR+KGAVT IK+QG CG CWAFSAVAA+EG+TQ+ G+LI LSEQ+LVDC T ++ G
Sbjct: 106 MDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQG 165
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C GGLMD AFE+II N GL TEA+YPY+ TC+K+K ++AA I YED+P E AL
Sbjct: 166 CGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSEAAL 225
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
L+AV + PVSV ++A G F+FY GV +CG DHGV VG+G + +DG KYWL+K
Sbjct: 226 LKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYG--KTDDGTKYWLVK 283
Query: 313 NSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
NSWG WGE GYI + R DEGLCGIA EASYP A
Sbjct: 284 NSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 320
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 166/341 (48%), Positives = 228/341 (66%), Gaps = 17/341 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F I+ + C++ + + ++V +HE+WM Q+GR YKD EKA R IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN EFRA+ T +PS R P+TF+Y+NV+
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNYEFRATKTNKGF-IPSTVRV---PTTFRYENVSIDT 122
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
+P ++DWR KGAVT IK+QG CG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
++ GC GGLMD AF++II+N GL TE+ YPY G C+ +AATI YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSN--SAATIKGYEDVPANN 240
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL++AV QPVSV V+ F+FY GV+ CG + DHG+ +G+G ++ DG +Y
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYG--KDGDGTQY 298
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
WL+KNSWG TWGE+G++R+ +D G+CG+A E SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 167/350 (47%), Positives = 225/350 (64%), Gaps = 13/350 (3%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M +K ++ +F+ + + I SQV+ R +H+ ++ E+HE WMA++G+ YKD EK
Sbjct: 1 MAFTGQKQHMLALFLFLAVGI---SQVMP-RKLHQTALRERHENWMAEYGKMYKDAAEKE 56
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK N+E+IE N GN+ YKLG N +DLT EEF+ S G R S + +
Sbjct: 57 KRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTY-EFSTTTFKL 115
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQG-HCGSCWAFSAVAAVEGITQITGGKLIELS 179
+ FKY+NVTD+P +IDWR KGAVT IK+QG CG WAFS +AA EGI QI+ G L+ LS
Sbjct: 116 NGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGRFWAFSTIAATEGIHQISTGNLVSLS 175
Query: 180 EQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIG 239
EQ+LVDC + ++GC GG M+ FE+II+N G+ +E +YPY+ GTC+ + A I
Sbjct: 176 EQELVDCDSVDDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASPVAQIK 235
Query: 240 KYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGT 299
YE +P E AL +AV QPVSV + A+ F FY G+ N ECG + DHGV VG+GT
Sbjct: 236 GYEIVPSYSEEALKKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGT 295
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
E+G YW++KNSWG WGE GYIR+ R G+CGIA ++SYP A
Sbjct: 296 ---ENGTDYWIVKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 163/339 (48%), Positives = 225/339 (66%), Gaps = 20/339 (5%)
Query: 16 IIILVITCAS---QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
++ ++ CAS V++ R + + ++VE+HE WM ++GR YKD EKA R FK N+ +
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAF 66
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN--VTD 130
+E N + LG N+F+DLT EEF+A N+ +S + + FKY+N V+
Sbjct: 67 VESFNTNKKNKFWLGVNQFADLTTEEFKA-----NKGFKPISAEMVPTTGFKYENLSVSA 121
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+PT++DWR KGAVT IKNQG CG CWAFSAVAA+EGI +++ G LI LSEQ+LVDC T
Sbjct: 122 LPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHS 181
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
+ GC GG MD AFE++I+N GLATE+ YPY+ G C + +AATI +ED+P D
Sbjct: 182 MDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSK--SAATIKGHEDVPVND 239
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL++AV QPVSV V+AS + F Y GV+ CG DHG+A +G+G E DG KY
Sbjct: 240 EAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGV--ESDGTKY 297
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
W++KNSWG TWGE G++R+ +D +G+CG+A + SYP
Sbjct: 298 WILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 163/339 (48%), Positives = 226/339 (66%), Gaps = 20/339 (5%)
Query: 16 IIILVITCAS---QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
++ ++ CAS V++ R + + ++VE+HE WM ++GR YKD EKA R +FK N+ +
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEVFKDNVAF 66
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN--VTD 130
+E N N + LG N+F+DLT EEF+A N+ +S + + FKY+N V+
Sbjct: 67 VESFNTNKNNKFWLGINQFADLTIEEFKA-----NKGFKPISAEKVPTTGFKYENLSVSA 121
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+PT++DWR KGAVT IKNQG CG CWAFSAVAA+EGI +++ G LI LSEQ+LVDC T
Sbjct: 122 LPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHS 181
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
+ GC GG MD AFE++I+N GLAT + YPY+ G C + +AATI +ED+P D
Sbjct: 182 MDEGCEGGWMDSAFEFVIKNGGLATVSSYPYKAVDGKCKGGSK--SAATIKGHEDVPVND 239
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL++AV QPVSV V+AS + F Y GV+ CG DHG+A +G+G E DG KY
Sbjct: 240 EAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGV--ESDGTKY 297
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
W++KNSWG TWGE G++R+ +D +G+CG+A + SYP
Sbjct: 298 WILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 173/331 (52%), Positives = 226/331 (68%), Gaps = 15/331 (4%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
SQ S + HEP + E H+QWM + R Y DELEK MR +FK+NL++IEK NK+G+RTY
Sbjct: 6 SQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTY 65
Query: 85 KLGTNEFSDLTNEEFRASYTGYN--RPVPSVSRQSSRPSTFKYQNVTDVP--TSIDWREK 140
KLG NEF+D T EEF A++TG +PS ++ + NV+DV + DWR +
Sbjct: 66 KLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNW-NVSDVAGRETKDWRYE 124
Query: 141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMD 199
GAVT +K QG CG CWAFS+VAAVEG+T+I G L+ LSEQQL+DC + +NGC+GG+M
Sbjct: 125 GAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMS 184
Query: 200 KAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ 259
AF YII+N+G+A+EA YPYQ +GTC + +A I ++ +P +E ALL+AV+KQ
Sbjct: 185 DAFSYIIKNRGIASEASYPYQAAEGTC--RYNGKPSAWIRGFQTVPSNNERALLEAVSKQ 242
Query: 260 PVSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
PVSV ++A G F Y GV + CG N +H V VG+GT+ E G KYWL KNSWGET
Sbjct: 243 PVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPE--GIKYWLAKNSWGET 300
Query: 319 WGESGYIRILRD----EGLCGIATEASYPVA 345
WGE+GYIRI RD +G+CG+A A YPVA
Sbjct: 301 WGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 331
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 337 bits (865), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 163/335 (48%), Positives = 227/335 (67%), Gaps = 11/335 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F+I+ LV + + R + E ++ ++H WM +HGR Y D EK R +FK+N+E
Sbjct: 2 IFLIVSLVSSFSLSTTLSRPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVES 61
Query: 73 IEKANK-EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD- 130
IE+ N+ + T+KL N+F+DLTNEEFR+ YTGY SV ++P++F+YQ+V+
Sbjct: 62 IERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGN--SVLSSRTKPTSFRYQHVSSD 119
Query: 131 -VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+P S+DWR+KGAVT IK+QG CGSCWAFSAVAA+EG+ QI GKLI LSEQ+LVDC T+
Sbjct: 120 ALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN 179
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
++GC GG M+ AF Y + GL +E++YPY+ GTC+ K K A +I +ED+P DE
Sbjct: 180 DDGCMGGYMNSAFNYTMTTGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDE 239
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
AL++AV PVS+ + G F+FY GV + EC + DHGVAVVG+G + +G+KYW
Sbjct: 240 KALMKAVAHHPVSIGIAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYG--KSSNGSKYW 297
Query: 310 LIKNSWGETWGESGYIRILRD----EGLCGIATEA 340
++KNSWG WGE GY+RI +D G CG+A A
Sbjct: 298 ILKNSWGPKWGERGYMRIKKDTKAKHGQCGLAMNA 332
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 168/351 (47%), Positives = 228/351 (64%), Gaps = 13/351 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQV-VSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEK 59
+ KS I +F II +V + A + + R+ + P I +E W+ +HG+ Y EK
Sbjct: 1 MSTSKSTIFLLFSIIFIVSSSALDLSIIDRAFNRPDDEIASLYETWLVKHGKNYNGLGEK 60
Query: 60 AMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQS-S 118
+R IFK NL ++++ N E N ++KLG N F+DLTNEE+R+ Y G +V+R S
Sbjct: 61 QLRFNIFKDNLRFVDERNSE-NLSFKLGLNRFADLTNEEYRSVYLGTRPRSVAVARSGRS 119
Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
+ + ++ +P S+DWR+KGAV IK+QG CGSCWAFSA+AAVEG+ QI G LI L
Sbjct: 120 KSDRYAFRAGDTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIVTGDLISL 179
Query: 179 SEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAAT 237
SEQ+LV+C T N+GC GGLMD AFE+II+N+G+ ++ DYPY G CD ++ A T
Sbjct: 180 SEQELVECDTSYNDGCDGGLMDYAFEFIIKNEGIDSDEDYPYTGRDGRCDTNRKNAKVVT 239
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I YED P DE +L +AV QPVSV +E G+ F+ Y GV +CG DHGVAVVG+
Sbjct: 240 IDDYEDSPVYDEKSLQKAVANQPVSVAIEGGGRDFQLYDSGVFTGKCGTALDHGVAVVGY 299
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
GT EDG YW+++NSWG+TWGE GYIR+ R+ G+CGIA E SYP+
Sbjct: 300 GT---EDGLDYWIVRNSWGDTWGEGGYIRMQRNTKLPSGICGIAIEPSYPI 347
>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
Length = 339
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 165/341 (48%), Positives = 227/341 (66%), Gaps = 17/341 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F I+ + C++ + + ++V +HE+WM Q+GR YKD EKA R IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + L N+F+DLTN EFRA+ T +PS R P+TF+Y+NV+
Sbjct: 68 IESFNA-GNHKFWLSVNQFADLTNYEFRATKTNKGF-IPSTVRV---PTTFRYENVSIDT 122
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
+P ++DWR KGAVT IK+QG CG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
++ GC GGLMD AF++II+N GL TE+ YPY G C+ +AATI YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSN--SAATIKGYEDVPANN 240
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL++AV QPVSV V+ F+FY GV+ CG + DHG+ +G+G ++ DG +Y
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYG--KDGDGTQY 298
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
WL+KNSWG TWGE+G++R+ +D G+CG+A E SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 165/341 (48%), Positives = 228/341 (66%), Gaps = 17/341 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F I+ + C++ + + ++V +HE+WM Q+GR YKD EKA R IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN EFRA+ T +PS R P+TF+Y+NV+
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNYEFRATKTNKGF-IPSTVRV---PTTFRYENVSIDT 122
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
+P ++DWR KGAVT IK+QG CG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
++ GC GGLMD AF++II+N GL TE+ YPY G C+ +AATI YE++P +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSN--SAATIKGYEEVPANN 240
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL++AV QPVSV V+ F+FY GV+ CG + DHG+ +G+G ++ DG +Y
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYG--KDGDGTQY 298
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
WL+KNSWG TWGE+G++R+ +D G+CG+A E SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 337 bits (864), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 171/353 (48%), Positives = 229/353 (64%), Gaps = 18/353 (5%)
Query: 1 MVLKFEKSFIIPMFVIIIL--VITCASQVVSGRSMHEPSI---VEKHEQWMAQHGRTYKD 55
M L K+ + F + + V+ +V H S+ VE E W++ HG+ Y
Sbjct: 1 MALSVLKTSFLTFFASLFVCSVLAHDFSIVGYSPEHLTSVDKLVELFESWISGHGKAYNS 60
Query: 56 ELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSR 115
EK R +FK+NL++I++ NKE +Y LG NEF+DL++EEF++ + G P R
Sbjct: 61 LEEKLHRFEVFKENLKHIDQRNKEVT-SYWLGLNEFADLSHEEFKSKFLGL---YPEFPR 116
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKL 175
+ S F Y++V D+P SIDWR+KGAVT +KNQG CGSCWAFS VAAVEGI QI G L
Sbjct: 117 KKSS-EDFSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVAGNL 175
Query: 176 IELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAA 234
LSEQQL+DC T NNGC+GGLMD AFE+I+ N GL E DYPY E+GTCD+++E+
Sbjct: 176 TSLSEQQLIDCDTSFNNGCNGGLMDYAFEFIVNNGGLHKEEDYPYLMEEGTCDEKREEME 235
Query: 235 AATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAV 294
TI Y D+P+ DE +LL+A+ QP+SV ++ASG+ F+FY GV + CG + DHGVA
Sbjct: 236 VVTISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQFYSGGVFSGPCGTDLDHGVAA 295
Query: 295 VGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
VG+G++ G Y ++KNSWG WGE GY+R+ R+ EGLCGI ASYP
Sbjct: 296 VGYGSSS---GIDYIIVKNSWGPKWGERGYLRMKRNTGKPEGLCGINKMASYP 345
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 336 bits (862), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 163/342 (47%), Positives = 230/342 (67%), Gaps = 19/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F I+ + C++ + + + ++ +HE+WMAQ+GR Y+D+ EKA R +FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRP-VPSVSRQSSRPSTFKYQNVT-- 129
IE N GN + LG N+F+DLTN+EFR +T N+ +PS +R P+ F+Y+NV
Sbjct: 68 IESFNA-GNHNFWLGVNQFADLTNDEFR--WTKTNKGFIPSTTRV---PTGFRYENVNID 121
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
+P ++DWR KGAVT IK+QG CG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC
Sbjct: 122 ALPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVH 181
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
++ GC GGLMD AF++II+N GL TE++YPY C + + A+I YED+P
Sbjct: 182 GEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSNSVASIKGYEDVPAN 239
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
+E AL++AV QPVSV V+ F+FYK GV+ CG + DHG+ +G+G A DG K
Sbjct: 240 NEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKA--SDGTK 297
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
YWL+KNSWG TWGE+G++R+ +D G+CG+A E SYP A
Sbjct: 298 YWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 346
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 166/343 (48%), Positives = 225/343 (65%), Gaps = 16/343 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEP--SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQ 68
+ + I+ + C++ V++ R + + ++ +HEQWMAQ GR YKD EKA RL +FK
Sbjct: 8 LLLVAIVGCLCLCSTAVLAARELGDADNAMAARHEQWMAQFGRVYKDPAEKAHRLEVFKA 67
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
N+ +IE N E N + LG N+F+DLTN+EFRAS T N+ + + + P+ FKY +V
Sbjct: 68 NVAFIESFNAE-NHEFWLGANQFADLTNDEFRASKT--NKGIKQGGVRDA-PTGFKYSDV 123
Query: 129 T--DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+ +P S+DWR KGAVT IKNQG CGSCWAFSAVAA EG+ +++ GKL+ LSEQ+LVDC
Sbjct: 124 SIDALPASVDWRTKGAVTPIKNQGQCGSCWAFSAVAATEGVVKLSTGKLVSLSEQELVDC 183
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
+ GC GG MD AF++II+N GL TEA+YPY E C + AATI YED+
Sbjct: 184 DVHGVDQGCMGGWMDDAFKFIIKNGGLTTEANYPYTGEDDKCKSNETVNVAATIKGYEDV 243
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEED 304
P DE AL++AV QPVSV V+ F+ Y GV+ CG DHG+A +G+G +
Sbjct: 244 PANDESALMKAVAHQPVSVVVDGGDMTFQLYAGGVMTGSCGVEMDHGIAAIGYGAT--SN 301
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
G KYWL+KNSWG TWGE G++R+ +D G+CG+A + SYP
Sbjct: 302 GTKYWLMKNSWGTTWGEKGFLRMAKDIPDKRGMCGLAMKPSYP 344
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 162/341 (47%), Positives = 228/341 (66%), Gaps = 17/341 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F I+ + C++ + + + ++ +HE+WMAQ+GR YKD+ EKA R +FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN+EFR++ T +PS +R P+ F+Y+NV
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRSTKTNKGF-IPSTTRV---PTGFRYENVNIDA 122
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
+P ++DWR KG VT IK+QG CG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 LPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
++ GC GGLMD AF++II+N GL TE++YPY C + + A+I YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSNSVASIKGYEDVPANN 240
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL++AV QPVSV V+ F+FYK GV+ CG + DHG+ +G+G A DG KY
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKA--SDGTKY 298
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
WL+KNSWG TWGE+G++R+ +D G+CG+A E SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 168/344 (48%), Positives = 225/344 (65%), Gaps = 10/344 (2%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
SF ++I+ LV+ + V R + E E+HE+WMAQ+GR YKD EK R +FK
Sbjct: 3 SFSQNHYLILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFK 62
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+ +IE N G++ + L N+F+DL +EEF+A + V ++S ++F+Y++
Sbjct: 63 NNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWV--ETSTETSFRYES 120
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC- 186
VT +P +ID R++GAVT IK+QG CGSCWAFSAVAA EGI QIT GKL+ LSEQ+LVDC
Sbjct: 121 VTKIPATIDRRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCV 180
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
++ GC GG +D AFE+I + G+A+E YPY+ TC +KE A I YE +P
Sbjct: 181 KGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPS 240
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDG 305
+E ALL+AV QPVSV ++A AF++Y G+ NA CG + +H VAVVG+G A D
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKA--LDD 298
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+KYWL+KNSWG WGE GYIRI RD EGLCGIA YP+A
Sbjct: 299 SKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPIA 342
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 162/341 (47%), Positives = 227/341 (66%), Gaps = 17/341 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F I+ + C++ + + + ++ +HE+WMAQ+GR Y+D+ EKA R +FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN+EFR T +PS +R P+ F+Y+NV
Sbjct: 68 IESFNA-GNHNFWLGVNQFADLTNDEFRWMKTNKGF-IPSTTRV---PTGFRYENVNIDA 122
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
+P ++DWR KGAVT IK+QG CG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
++ GC GGLMD AF++II+N GL TE++YPY C + + A+I YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSNSVASIKGYEDVPANN 240
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL++AV QPVSV V+ F+FYK GV+ CG + DHG+ +G+G A DG KY
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKA--SDGTKY 298
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
WL+KNSWG TWGE+G++R+ +D G+CG+A E SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 335 bits (858), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 160/310 (51%), Positives = 209/310 (67%), Gaps = 12/310 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E+WM HGR Y EK R IF+ N EYIE+ N++ N+TY LG N F+D+T++EF+A
Sbjct: 34 YEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFKA 93
Query: 102 SYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
Y G P+ + + S F+Y++ T++P DWR KGAV +KNQG CGSCWAFS V
Sbjct: 94 LYFGTKVPLSNTIK-----SGFRYKDATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTV 148
Query: 162 AAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
AAVEG+ QI G+L+ LSEQ+LVDC N GC+GGLMD AFE+II+N GL +EADYPY+
Sbjct: 149 AAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPYK 208
Query: 221 QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL 280
G+CD+ + + TI +ED+P E LL+AV QPVSV +EASG+ F+ Y GV
Sbjct: 209 AVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGVY 268
Query: 281 NAECGDNCDHGVAVVGFGTAEEEDGA--KYWLIKNSWGETWGESGYIRILRD----EGLC 334
CG DHGV VG+GT++ DG YW+++NSWG+ WGESGYIR+ R+ G C
Sbjct: 269 TGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASPRGKC 328
Query: 335 GIATEASYPV 344
GIA ASYPV
Sbjct: 329 GIAMMASYPV 338
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 335 bits (858), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 160/310 (51%), Positives = 209/310 (67%), Gaps = 12/310 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E+WM HGR Y EK R IF+ N EYIE+ N++ N+TY LG N F+D+T++EF+A
Sbjct: 34 YEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFKA 93
Query: 102 SYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
Y G P+ + + S F+Y++ T++P DWR KGAV +KNQG CGSCWAFS V
Sbjct: 94 LYFGTKVPLSNTIK-----SGFRYEDATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTV 148
Query: 162 AAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
AAVEG+ QI G+L+ LSEQ+LVDC N GC+GGLMD AFE+II+N GL +EADYPY+
Sbjct: 149 AAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPYK 208
Query: 221 QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL 280
G+CD+ + + TI +ED+P E LL+AV QPVSV +EASG+ F+ Y GV
Sbjct: 209 AVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGVY 268
Query: 281 NAECGDNCDHGVAVVGFGTAEEEDGA--KYWLIKNSWGETWGESGYIRILRD----EGLC 334
CG DHGV VG+GT++ DG YW+++NSWG+ WGESGYIR+ R+ G C
Sbjct: 269 TGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASSRGKC 328
Query: 335 GIATEASYPV 344
GIA ASYPV
Sbjct: 329 GIAMMASYPV 338
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 334 bits (856), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 173/348 (49%), Positives = 231/348 (66%), Gaps = 15/348 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTI 65
S ++ + V+IIL + R++ E S+V+KHEQWMA+ R Y+DELEK MR +
Sbjct: 3 SIMVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKY 125
FK+NL++IE NK+GN++YKLG NEF+D TNEEF A +TG + + VS T
Sbjct: 63 FKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGL-KGLTEVSPSKVVAKTISS 121
Query: 126 Q--NVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
Q NV+D V S DWR +GAVT +K QG CG CWAFSAVAAVEG+ +I GG L+ LSEQQ
Sbjct: 122 QTWNVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQ 181
Query: 183 LVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
L+DC + + C GG+M AF Y+++N+G+A+E DY YQ G C + AA I +
Sbjct: 182 LLDCDREYDRDCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGC--RSNARPAARISGF 239
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE 301
+ +P +E ALL+AV++QPVSV ++A+G F Y GV + CG + +H V VG+GT+
Sbjct: 240 QTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTS- 298
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+DG KYWL KNSWGETW E GYIRI RD +G+CG+A A YPVA
Sbjct: 299 -QDGTKYWLAKNSWGETWEEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345
>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 334 bits (856), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 162/342 (47%), Positives = 227/342 (66%), Gaps = 11/342 (3%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIF 66
+SF ++I+ L++T + V R + E E+HE+WMAQ+G+ Y D EK R IF
Sbjct: 2 RSFSQNHYLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIF 61
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
K N+++IE N G++ + L N+F+DL NEEF+AS + V +++ ++F+Y+
Sbjct: 62 KNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGV--ETATETSFRYE 119
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
++T +P ++DWR++GAVT IK+QG+CGSCWAFS VAA+EGI QIT GKL+ LSEQ+LVDC
Sbjct: 120 SITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFSTVAAIEGIHQITTGKLVSLSEQELVDC 179
Query: 187 -STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ GC+ G ++AFE++ +N GLA+E YPY+ TC +KE A I YE++P
Sbjct: 180 VKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVP 239
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
E ALL+AV QPVSV ++A A +FY G+ +CG +H V V+G+G A G
Sbjct: 240 SNSEKALLKAVANQPVSVYIDAG--ALQFYSSGIFTGKCGTAPNHAVTVIGYGKA--RGG 295
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
AKYWL+KNSWG WGE GYI++ RD EGLCGIAT ASYP
Sbjct: 296 AKYWLVKNSWGTKWGEKGYIKMKRDIRAKEGLCGIATNASYP 337
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 333 bits (855), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 160/314 (50%), Positives = 213/314 (67%), Gaps = 8/314 (2%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ +E+W A H + +D + R +FK+N+++I + N++ + TYKL N+F D+
Sbjct: 34 EESLWSLYEKWRAHHAVS-RDLDDTDKRFNVFKENVKFIHEFNQKKDATYKLALNKFGDM 92
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGS 154
TN+EFR++Y G R F Y+ D+PTS+DWREKGAVT +K+QG CGS
Sbjct: 93 TNQEFRSTYAGSKIDHHMTLRGVKDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGS 152
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATE 214
CWAFS V AVEGI QI +L+ LSEQQLVDC T N+GC+GGLMD AF++I N GL++E
Sbjct: 153 CWAFSTVVAVEGINQIKTNELVSLSEQQLVDCDTKNSGCNGGLMDYAFDFIKNNGGLSSE 212
Query: 215 ADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRF 274
YPY EQ +C + +A TI Y+D+P+ +E AL++AV QPVSV +EASG AF+F
Sbjct: 213 DSYPYLAEQKSCGSEA-NSAVVTIDGYQDVPRNNEAALMKAVANQPVSVAIEASGYAFQF 271
Query: 275 YKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----D 330
Y +GV + CG DHGVA VG+G ++DG KYW++KNSWGE WGESGYIR+ R
Sbjct: 272 YSQGVFSGHCGTELDHGVAAVGYGV--DDDGKKYWIVKNSWGEGWGESGYIRMERGIKDK 329
Query: 331 EGLCGIATEASYPV 344
G CGIA EASYP+
Sbjct: 330 RGKCGIAMEASYPI 343
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 333 bits (855), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 162/342 (47%), Positives = 226/342 (66%), Gaps = 11/342 (3%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIF 66
+SF ++I+ L++T + V R + E E+HE+WMAQ+G+ Y D EK R IF
Sbjct: 2 RSFSQNHYLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIF 61
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
K N+++IE N G++ + L N+F+DL NEEF+AS + V +++ ++F+Y+
Sbjct: 62 KNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGV--ETATETSFRYE 119
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
++T +P ++DWR++GAVT IK+QG+CGSCWAFS VAA+EGI QIT GKL+ LSEQ+LVDC
Sbjct: 120 SITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFSIVAAIEGIHQITTGKLVSLSEQELVDC 179
Query: 187 -STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ GC+ G ++AFE++ +N GLA+E YPY+ TC +KE A I YE++P
Sbjct: 180 VKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVP 239
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
E ALL+AV QPVSV ++A A +FY G+ +CG +H V+G+G A G
Sbjct: 240 SNSEKALLKAVANQPVSVYIDAG--ALQFYSSGIFTGKCGTAPNHAATVIGYGKA--RGG 295
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
AKYWL+KNSWG WGE GYIR+ RD EGLCGIAT ASYP
Sbjct: 296 AKYWLVKNSWGTKWGEKGYIRMKRDIRAKEGLCGIATNASYP 337
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 333 bits (853), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 164/341 (48%), Positives = 226/341 (66%), Gaps = 23/341 (6%)
Query: 16 IIILVITCAS---QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
++ ++ CAS V++ R + + ++VE+HE WM ++GR YKD EKA R FK N+ +
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAF 66
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST-FKYQN--VT 129
+E N + LG N+F+DLT EEF+A+ G+ V P+T FKY+N V+
Sbjct: 67 VESFNTNKKNKFWLGVNQFADLTTEEFKAN-KGFKPTAEKV------PTTGFKYENLSVS 119
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+PT++DWR KGAVT IKNQG CG CWAFSAVAA+EGI +++ G LI LSEQ+LVDC T
Sbjct: 120 ALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTH 179
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
+ GC GG MD AFE++I+N GLATE++YPY+ G C + +AATI +ED+P
Sbjct: 180 SMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGGSK--SAATIKGHEDVPVN 237
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
+E AL++AV QPVSV V+AS + F Y GV+ CG DHG+A +G+G E DG K
Sbjct: 238 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGM--ESDGTK 295
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
YW++KNSWG TWGE G++R+ +D G+CG+A + SYP
Sbjct: 296 YWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPT 336
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 164/344 (47%), Positives = 229/344 (66%), Gaps = 19/344 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
I+ +F I+ L S V+S R ++EKHEQWM +HG+ YKD EK R IFK+N
Sbjct: 12 ILTLFFILTL---WTSLVISSR------LLEKHEQWMEEHGKFYKDAAEKEQRFQIFKEN 62
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASY-TGYNRPVPSVSRQS-SRPSTFKYQN 127
LE+IE N G+ + L N+F D TN+EF+A+Y G +P+ V + S F+Y+N
Sbjct: 63 LEFIESFNAAGDNGFNLSINQFGDQTNDEFKANYLNGKKKPLIGVGIAAIEEESVFRYEN 122
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
VT+VP ++DWRE+GAVT IK+Q CGSCWAF+ VAA+EGI QIT G+L+ LSEQ+LVDC
Sbjct: 123 VTEVPATMDWRERGAVTPIKHQHLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELVDCV 182
Query: 188 TDN--NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
N +GC+GG ++ A ++I++ G+ +E +YPY + G C+ +K A I YE +P
Sbjct: 183 KTNTTDGCNGGYVEDACDFIVKKGGITSETNYPYTRVDGKCNVRKGTYNVAKIKGYEHVP 242
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
+E ALL+AV QP++V + A+ +AF+FY G+L +CG + DH V +VG+GT+ +DG
Sbjct: 243 ANNEKALLKAVANQPIAVYIAATKRAFQFYSSGILKGKCGIDLDHTVTIVGYGTS--DDG 300
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KYWL+KNSWG WGE GYI+I RD EG CGIA +YP+
Sbjct: 301 VKYWLVKNSWGTKWGEKGYIKIKRDVHAKEGSCGIAMVPTYPIV 344
>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 165/341 (48%), Positives = 227/341 (66%), Gaps = 19/341 (5%)
Query: 15 VIIILVITC-ASQVVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
++ IL C S V++ R +++ S+ +HE WMAQ+GR YKD EKA + +FK N +
Sbjct: 8 ILAILGCLCFCSSVLAARELNDDLSMAARHETWMAQYGRVYKDAAEKAQKFEVFKANARF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN--VTD 130
I+ N E N + LG N+F+DLTNEEF+A+ T N+ +S ++ + FKY+N +
Sbjct: 68 IDSFNAE-NHKFWLGINQFADLTNEEFKATKT--NKGF--ISNKARVSTGFKYENLKIEA 122
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
+PTSIDWR KGAVT +K+QG CG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC
Sbjct: 123 LPTSIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHG 182
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
++ GC GGLMD AF++II N GL E+ YPY E G C + +A TI YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKC--KSGSKSAGTIKSYEDVPANN 240
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL++AV QPVSV V+ F+FY GV+ CG + DHG+A +G+G DG K+
Sbjct: 241 EGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVT--SDGTKF 298
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
WL+KNSWG TWGE+G++R+ +D +G+CG+A E SYP A
Sbjct: 299 WLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 158/315 (50%), Positives = 213/315 (67%), Gaps = 11/315 (3%)
Query: 33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK-EGNRTYKLGTNEF 91
+ E ++ ++H +WM +HGR Y D EK R +FK+N+E IE+ N + T+KL N+F
Sbjct: 23 LDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQF 82
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKNQ 149
+DLTNEEFR+ YTG+ SV ++P++F+YQNV+ +P S+DWR+KGAVT IK+Q
Sbjct: 83 ADLTNEEFRSMYTGFKGN--SVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQ 140
Query: 150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENK 209
G CGSCWAFSAVAA+EG+ QI GKLI LSEQ+LVDC T++ GC GGLMD AF Y I
Sbjct: 141 GLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDGGCMGGLMDTAFNYTITIG 200
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASG 269
GL +E++YPY+ GTC+ K K A +I +ED+P DE AL++AV PVS+ +
Sbjct: 201 GLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGD 260
Query: 270 QAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
F+FY GV + EC + DHGV VG+G ++G KYW++KNSWG WGE GY+RI +
Sbjct: 261 IGFQFYSSGVFSGECTTHLDHGVTAVGYG--RSKNGLKYWILKNSWGPKWGERGYMRIKK 318
Query: 330 D----EGLCGIATEA 340
D G CG+A A
Sbjct: 319 DIKPKHGQCGLAMNA 333
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 169/345 (48%), Positives = 222/345 (64%), Gaps = 16/345 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSI---VEKHEQWMAQHGRTYKDELEKAMRLT 64
+ I+ + I I +V H S+ +E E WM++H +TY+ EK R
Sbjct: 10 TLILSATLFITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYRSIEEKLHRFE 69
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFK 124
IF NL++I++ NK+ + +Y LG NEF+DL++EEF++ Y G P ++SSR F
Sbjct: 70 IFLDNLKHIDETNKKVS-SYWLGLNEFADLSHEEFKSKYLGLRVEFPR--KRSSR--GFS 124
Query: 125 YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
Y +V D+P S+DWR KGAVT +KNQG CGSCWAFS VAAVEGI QI G L LSEQ+L+
Sbjct: 125 YGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELI 184
Query: 185 DCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
DC NNGC GGLMD AF+YI+ N GL E DYPY E+G C ++KE+ TI YED
Sbjct: 185 DCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQFEVVTISGYED 244
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+P DE +LL+A++ QPVSV +EAS + F+FYK G+ CG DHGV VG+G++E
Sbjct: 245 VPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMDHGVTAVGYGSSE-- 302
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
G Y ++KNSWG WGE+GYIR+ R+ EGLCGI ASYP
Sbjct: 303 -GTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQMASYPT 346
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 163/340 (47%), Positives = 225/340 (66%), Gaps = 15/340 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ ++ ++ ++ +S RS E + E ++ W+A+HG+ Y E+ R IFK+NL++
Sbjct: 8 LALLSFFFLSISASALSRRSDGE--VREIYDLWLAKHGKAYNGIDEREKRFQIFKENLKF 65
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKY--QNVTD 130
I+ N E NRTYK+G N F+DLTNEE+RA Y G P P+ ++ ++ +Y N+
Sbjct: 66 IDDHNSE-NRTYKVGLNMFADLTNEEYRALYLGTRSP-PARRVMKAKTASRRYAVNNLDR 123
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P S+DWR +GAV +KNQG CGSCWAFS +AAVEGI QI G+LI LSEQ+LV C
Sbjct: 124 LPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKY 183
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
N+GC+GGLMD AF++II+N GL TE DYPY+ G CD ++ A +I YED+P DE
Sbjct: 184 NSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDAYEDVPANDE 243
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
+L +AV QPVSV +EASG A + Y+ GV +CG DHGV VG+G +E+G YW
Sbjct: 244 ESLKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYG---KENGVDYW 300
Query: 310 LIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
L++NSWG +WGE GY ++ R+ EG CGIA +ASYPV
Sbjct: 301 LVRNSWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPV 340
>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 164/314 (52%), Positives = 216/314 (68%), Gaps = 22/314 (7%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+ E+HEQWMAQ+GR YKD+ EK R IFK+N+ I+ N + ++Y LG N+F+DL+NE
Sbjct: 1 MYERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYNLGVNQFADLSNE 60
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+AS NR + + P F+Y+NV+ VP ++DWR+KGAVT +K+QG C
Sbjct: 61 EFKASR---NRFKGHMCSPQAGP--FRYENVSAVPATMDWRKKGAVTPVKDQGQC----- 110
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEA 215
VAA+EGI Q+T GKLI LSEQ++VDC T ++ GC+GGLMD AF++I +NKGL TEA
Sbjct: 111 ---VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEA 167
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
+YPY GTC+ QKE + AA I ++D+P E AL++AV KQPVSV ++A G F+FY
Sbjct: 168 NYPYTGTDGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFY 227
Query: 276 KRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----E 331
G+ CG DHGV VG+G + DG KYWL+KNSWG WGE GYIR+ +D E
Sbjct: 228 SSGIFTGSCGTELDHGVTAVGYGGS---DGTKYWLVKNSWGAQWGEEGYIRMQKDISAKE 284
Query: 332 GLCGIATEASYPVA 345
GLCGIA +ASYP A
Sbjct: 285 GLCGIAMQASYPTA 298
>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 340
Score = 331 bits (848), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 173/349 (49%), Positives = 223/349 (63%), Gaps = 17/349 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M L +K + F +L +TC + S R++ E SI +HE+WMA H R Y D EK
Sbjct: 1 MALTLDKKSVGTFF---MLFLTCICRA-SSRTLSESSIATQHEEWMAMHDRVYADSAEKD 56
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG--YNRPVPSVSRQSS 118
R IFK+NLE+IEK N EG + Y L N F+DLTNEEF AS+TG Y P S + +
Sbjct: 57 RRQQIFKENLEFIEKHNNEGKKRYNLSLNSFADLTNEEFVASHTGALYKPPTQLGSFKIN 116
Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
F +V D+ S+DWR++GAV IKNQG CGSCWAFSAVAAVEGI QI G+L+ L
Sbjct: 117 HSLGFHKMSVGDIEASLDWRKRGAVNDIKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSL 176
Query: 179 SEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
SEQ LVDC++ N+GC G ++KAF+Y I + GLA E +YPY + GTC A I
Sbjct: 177 SEQNLVDCAS-NDGCHGQYVEKAFDY-IRDYGLANEEEYPYVETVGTCSGNSN--PAIQI 232
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
Y+ + +E LL AV QPVSV +EA GQ F+FY GV + ECG +H V +VG+G
Sbjct: 233 RGYQSVTPQNEEQLLTAVASQPVSVLLEAKGQGFQFYSGGVFSGECGTELNHAVTIVGYG 292
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
EE KYWLI+NSWG++WGE GY++++RD +GLCGI +ASYP
Sbjct: 293 ---EEAEGKYWLIRNSWGKSWGEGGYMKLMRDTGNPQGLCGINMQASYP 338
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 162/342 (47%), Positives = 231/342 (67%), Gaps = 19/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
+ I+ + C S V++ R +++ S+V +HE WM Q+GR YKD EKA + +FK N E
Sbjct: 8 LLAILGCLCLCGS-VLAARELNDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFKANAE 66
Query: 72 YIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT-- 129
+I N GN + LG N+F+D+TNEEF+A+ T N+ +S + P+ F Y+N++
Sbjct: 67 FINSFNA-GNHKFWLGINQFADITNEEFKATKT--NKGF--ISNKVRVPTGFMYENMSFD 121
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
+P +IDWR KGAVT IK+QG CG CWAFSAVAA+EGI +++ GKL+ LSEQ+LVDC
Sbjct: 122 ALPATIDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLVSLSEQELVDCDVH 181
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
++ GC GGLMD AF++II+N GL E++YPY G C + ++AATI YED+P
Sbjct: 182 GEDQGCEGGLMDDAFKFIIKNGGLTQESNYPYDAADGKC--KSGSSSAATIKSYEDVPAN 239
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
+E AL++AV QPVSV V+ F+FY GV+ CG + DHG+A +G+GT DG K
Sbjct: 240 NEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGTT--SDGTK 297
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+W++KNSWG +WGE+G++R+ +D +G+CG+A E SYP A
Sbjct: 298 FWIMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 168/314 (53%), Positives = 217/314 (69%), Gaps = 14/314 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I+E E+W+A+H + Y EK R +FK NL++I+K N+E +Y LG NEF+DLT+E
Sbjct: 146 IIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVT-SYWLGLNEFADLTHE 204
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKNQGHCGSC 155
EF+A+Y G P P+ + SR S FKY++V+ D+P S+DWR KGAVT +KNQG CGSC
Sbjct: 205 EFKATYLGLAPPAPA---RESRGS-FKYEDVSADDLPKSVDWRTKGAVTEVKNQGQCGSC 260
Query: 156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATE 214
WAFS VAAVEGI I G L LSEQ+L+DCS D NNGC+GGLMD AF YI + GL TE
Sbjct: 261 WAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGCNGGLMDYAFSYIASSGGLHTE 320
Query: 215 ADYPYQQEQGTC-DKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
YPY E+G+C D +K ++ A TI YED+P +E AL++A+ QPVSV +EASG+ F+
Sbjct: 321 EAYPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKALAHQPVSVAIEASGRHFQ 380
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR---- 329
FY GV + CG DHGVA VG+G+ ++ G Y +++NSWG WGE GYIR+ R
Sbjct: 381 FYSGGVFDGPCGTQLDHGVAAVGYGS-DKGKGHDYIIVRNSWGAKWGEKGYIRMKRGTGK 439
Query: 330 DEGLCGIATEASYP 343
EGLCGI ASYP
Sbjct: 440 GEGLCGINKMASYP 453
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 330 bits (847), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 168/346 (48%), Positives = 220/346 (63%), Gaps = 16/346 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQHGRTYKDELEKAMRLT 64
+ F+++ L + + G H E S+ E +E+W + H E EKA R
Sbjct: 1 MKRFIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLE-EKAKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS-TF 123
+FK N+++I + NK+ +++YKL N+F D+T+EEFR +Y G N + + + + +F
Sbjct: 60 VFKHNVKHIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSF 118
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
Y NV +PTS+DWR+ GAVT +KNQG CGSCWAFS V AVEGI QI KL LSEQ+L
Sbjct: 119 MYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQEL 178
Query: 184 VDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
VDC T+ N GC+GGLMD AFE+I E GL +E YPY+ TCD KE A +I +E
Sbjct: 179 VDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHE 238
Query: 243 DLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEE 302
D+PK E L++AV QPVSV ++A G F+FY GV CG +HGVAVVG+GT
Sbjct: 239 DVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTT-- 296
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
DG KYW++KNSWGE WGE GYIR+ R EGLCGIA EASYP+
Sbjct: 297 IDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 330 bits (847), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 166/346 (47%), Positives = 221/346 (63%), Gaps = 18/346 (5%)
Query: 10 IIPMFVIIILVI--TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
+ P+ +++ L T + + E S+ +E+W + H + +D +K R +FK
Sbjct: 4 LFPVLLVLALAFGSTLSIPIKEKDLESEDSLWSLYERWRSHHAVS-RDLDQKQKRFNVFK 62
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG----YNRPVPSVSRQSSRPSTF 123
+N+++I + NK + T+KL N+F D+TN+EFRA Y G ++R + S + F
Sbjct: 63 ENVKFIHEFNKNKDVTFKLALNKFGDMTNQEFRAKYAGSKVHHHRTMKGSRHGSGSGAKF 122
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
Y+N P SIDWRE+GAV +KNQG CGSCWAFSA+AAVEGI QI +L+ LSEQ+L
Sbjct: 123 MYENAV-APPSIDWRERGAVAAVKNQGQCGSCWAFSAIAAVEGINQIVTKELVPLSEQEL 181
Query: 184 VDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
+DC TD N GCSGGLMD AFE+I N G+ TE YPYQ E TC K+ + A I YE
Sbjct: 182 IDCDTDQNQGCSGGLMDYAFEFIKNNGGITTEDVYPYQAEDATC---KKNSPAVVIDGYE 238
Query: 243 DLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEE 302
D+P DE AL++AV QPV+V +EASG F+FY GV CG DHGVAVVG+GT
Sbjct: 239 DVPTNDEDALMKAVANQPVAVAIEASGYVFQFYSEGVFTGRCGTELDHGVAVVGYGTT-- 296
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
+DG KYW ++NSWG WGESGY+R+ R GLCGIA +ASYP+
Sbjct: 297 QDGTKYWTVRNSWGADWGESGYVRMQRGIKATHGLCGIAMQASYPI 342
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 330 bits (847), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 165/319 (51%), Positives = 213/319 (66%), Gaps = 12/319 (3%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEF 91
S + ++ +E W+ +HG++Y EK R IFK NL +I++ N E +RTYK+G N F
Sbjct: 36 SRTDDEVMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAE-SRTYKVGLNRF 94
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKNQ 149
+DLTN+E+R+ Y G S R S++ + +Y V +P S+DWREKGAV +K+Q
Sbjct: 95 ADLTNDEYRSMYLGAR--TGSRRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVVGVKDQ 152
Query: 150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIEN 208
G CGSCWAFS +AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II+N
Sbjct: 153 GSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKN 212
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
G+ TE DYPY G CD+ ++ A TI YED+P +E AL +AV QPVSV +EAS
Sbjct: 213 GGIDTEEDYPYNARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSVAIEAS 272
Query: 269 GQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
G AF+FY+ GV CG DHGV VG+GT E+ YW++KNSWG +WGESGYIR+
Sbjct: 273 GMAFQFYESGVFTGNCGTALDHGVTAVGYGT---ENSVDYWIVKNSWGSSWGESGYIRME 329
Query: 329 RDEGL---CGIATEASYPV 344
R+ G CGIA E SYP+
Sbjct: 330 RNTGATGKCGIAVEPSYPI 348
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 330 bits (846), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 166/341 (48%), Positives = 225/341 (65%), Gaps = 19/341 (5%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCA-SQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDEL 57
M + +F++ + ++ CA S ++ R + + ++V +HE+WMA++ R Y D
Sbjct: 1 MATHYSSAFVL----LSVVAWACALSGSLAARDLADQDQAMVARHEEWMAKYDRVYSDAA 56
Query: 58 EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRP--VPSVSR 115
EKA R +FK N+ IE N GN + L N F+DLT++EFRA++TGY RP + S+
Sbjct: 57 EKARRFEVFKANMALIESVNA-GNHKFWLEANRFADLTDDEFRATWTGY-RPKTAAASSK 114
Query: 116 QSSRPST--FKYQNVT--DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQIT 171
SR +T FKY NV+ DVP S+DWR KGAVT IKNQG CG CWAFSAVA++EG+ +++
Sbjct: 115 GRSRTATTGFKYANVSLDDVPASVDWRTKGAVTPIKNQGECGCCWAFSAVASMEGVVKLS 174
Query: 172 GGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQ 229
GKL+ LSEQ+LVDC + + GC GG MD AF++I+ N GL TE+ YPY GTC+
Sbjct: 175 TGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFDFIVGNGGLTTESRYPYTASDGTCNSN 234
Query: 230 KEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCD 289
+ AA+I YED+P DE +L +AV QPVSV V+ FRFYK GVL+ CG D
Sbjct: 235 EASGDAASIKGYEDVPANDEASLRKAVANQPVSVAVDGGDSHFRFYKGGVLSGACGTELD 294
Query: 290 HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
HG+A VG+G A DG KYW++KNSWG +WGE+GYIR+ RD
Sbjct: 295 HGIAAVGYGVA--SDGTKYWVMKNSWGTSWGEAGYIRMERD 333
>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
vinifera]
Length = 340
Score = 330 bits (845), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 164/343 (47%), Positives = 231/343 (67%), Gaps = 15/343 (4%)
Query: 13 MFVIIILVITC----ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQ 68
+FV + L I AS+ S R +HE S+ E+HEQWMA++ R YKD+ E+ R +FK
Sbjct: 3 LFVCMTLHIYYLEHRASEATS-RPLHEASMYERHEQWMARYSRNYKDDAEEERRFXMFKD 61
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
N+++I+ + GN KLG N +D+T+EEFRAS + P P++ +S ++F++QNV
Sbjct: 62 NVDFIQTFDTAGNMPNKLGVNALADMTHEEFRASGNTFKIP-PNLGLRSE-TTSFRHQNV 119
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
T +P+++DWR+K VTHIKNQ CG CWAFSAVAA+EGI ++ K I LSEQ+LVDC
Sbjct: 120 TRIPSTMDWRKKRTVTHIKNQLQCGGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDI 179
Query: 189 --DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC GG MD AF++II+N+GL +EA Y Y+ +G C+K+KE + AA I YE++P+
Sbjct: 180 FGSNIGCEGGCMDDAFKFIIQNRGLNSEARYLYKGVEGHCNKKKESSRAARINDYENMPE 239
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
E ALL+ V QP+SV ++A G AF+FY+ G++ E G++ D+GV G+G + DG
Sbjct: 240 FSEKALLKVVAHQPISVAIDAGGSAFQFYEIGIITXESGNDLDYGVTTDGYGRS--ADGK 297
Query: 307 KYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
K+WL+KNSWG WGE+GY R+ R GLCG +ASYP A
Sbjct: 298 KHWLVKNSWGTDWGENGYTRMERGVKATTGLCGFTMQASYPTA 340
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 330 bits (845), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 162/312 (51%), Positives = 212/312 (67%), Gaps = 13/312 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+++ E W+++ GR Y+ EK R IFK NL +I+ NK+ R Y LG NEF+DL++E
Sbjct: 43 LIDLFESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKK-VRNYWLGLNEFADLSHE 101
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G P +S+++ P F Y++V +P S+DWR+KGAVT +KNQG CGSCWA
Sbjct: 102 EFKNKYLGLK---PDLSKRAQCPEEFTYKDVA-IPKSVDWRKKGAVTPVKNQGSCGSCWA 157
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T NNGC+GGLMD AF YI+ N GL E D
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFAYIVANGGLHKEED 217
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+GTCD +KE++ A TI Y D+P+ E +LL+A+ QP+S+ +EASG+ F+FY
Sbjct: 218 YPYIMEEGTCDMRKEESDAVTISGYHDVPQNSEESLLKALANQPLSIAIEASGRDFQFYS 277
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV + CG DHGVA VG+GT++ G Y ++KNSWG WGE GYIR+ R EG
Sbjct: 278 GGVFDGHCGTELDHGVAAVGYGTSK---GLDYIIVKNSWGPKWGEKGYIRMKRKTSKPEG 334
Query: 333 LCGIATEASYPV 344
+CGI ASYP
Sbjct: 335 ICGIYKMASYPT 346
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 330 bits (845), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 156/312 (50%), Positives = 211/312 (67%), Gaps = 12/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+ + E WM++HG++Y+ EK R +F+ NL++I++ NK+ + +Y LG NEF+DL++E
Sbjct: 44 LTDLFESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVS-SYWLGLNEFADLSHE 102
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G +P ++ P F Y++V D+P S+DWR+KGAV H+KNQG CGSCWA
Sbjct: 103 EFKRKYLGLKIELP---KRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWA 159
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC NNGC+GGLMD AF +II N GL E D
Sbjct: 160 FSTVAAVEGINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEED 219
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+GTC ++KE+ TI Y D+P+ +E + L+A+ QP+SV +EAS + F+FY
Sbjct: 220 YPYVMEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYS 279
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
G+ N CG DHGVA VG+GT++ G Y +KNSWG WGE GYIR+ R+ EG
Sbjct: 280 GGIFNGHCGTELDHGVAAVGYGTSK---GVDYITVKNSWGSKWGEKGYIRMKRNVGKPEG 336
Query: 333 LCGIATEASYPV 344
+CGI ASYP
Sbjct: 337 ICGIYKMASYPT 348
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 329 bits (844), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 171/349 (48%), Positives = 224/349 (64%), Gaps = 17/349 (4%)
Query: 5 FEKSFIIPMFVIIILVITCASQVVSGRSM-HEPSI---VEKHEQWMAQHGRTYKDELEKA 60
F K+ +I + I T + G S H S+ +E E WM++H + Y+ EK
Sbjct: 6 FSKATLILSATLFITYATAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKAYRSIEEKL 65
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IF NL++I++ NK+ + +Y LG NEF+DL++EEF++ Y G P ++SSR
Sbjct: 66 HRFEIFLDNLKHIDETNKKVS-SYWLGLNEFADLSHEEFKSKYLGLRVEFPR--KRSSR- 121
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
F Y +V D+P S+DWR KGAVT +KNQG CGSCWAFS VAAVEGI QI G L LSE
Sbjct: 122 -GFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSE 180
Query: 181 QQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIG 239
Q+L+DC NNGC GGLMD AF+YI+ N GL E DYPY E+G C ++KE+ TI
Sbjct: 181 QELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQFEVVTIS 240
Query: 240 KYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGT 299
YED+P DE +LL+A++ QPVSV +EAS + F+FYK G+ CG DHGV VG+G+
Sbjct: 241 GYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMDHGVTAVGYGS 300
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+E G Y ++KNSWG WGE+GYIR+ R+ EGLCGI ASYP
Sbjct: 301 SE---GTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQMASYPT 346
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 162/345 (46%), Positives = 229/345 (66%), Gaps = 23/345 (6%)
Query: 14 FVIIILVIT-CA----SQVVSGRSMHE-PSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
F++++ ++T CA S V++ R + + ++ E+HE+WMA +GR YKD EKA R +FK
Sbjct: 7 FLLLLAILTGCACSFPSPVLAARELSDDAAMAERHERWMAVYGRVYKDAAEKARRFEVFK 66
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
NL ++E N + + LG N+F+DLT EEF+A N+ +S + + FKY+N
Sbjct: 67 DNLAFVESFNADKKNKFWLGVNQFADLTTEEFKA-----NKGFKPISAEEVPTTGFKYEN 121
Query: 128 --VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVD 185
V+ +PT++DWR KGAVT IKNQG CG CWAFSAVAA+EGI +++ L+ LSEQ+LVD
Sbjct: 122 LSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTDNLVSLSEQELVD 181
Query: 186 CSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
C T + GC GG MD AFE++I+N GLATE+ YPY+ G C + +AATI +ED
Sbjct: 182 CDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSK--SAATIKGHED 239
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+P +E AL++AV QPVSV V+AS + F Y GV+ CG DHG+A +G+G E
Sbjct: 240 VPPNNEAALMKAVASQPVSVAVDASDRTFMLYSGGVMTGSCGTQLDHGIAAIGYGV--ES 297
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
DG KYW++KNSWG TWGE ++R+ +D +G+CG+A + SYP
Sbjct: 298 DGTKYWILKNSWGTTWGEKRFLRMEKDISDKQGMCGLAMKPSYPT 342
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 163/317 (51%), Positives = 211/317 (66%), Gaps = 13/317 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E + ++E W+A+HGR Y EK R IFK NL +IE N GNRTYK+G N+F+DL
Sbjct: 43 EDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADL 102
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKNQGHC 152
TNEE+R Y G +S PS +Y + + +P S+DWR++GAV IKNQG C
Sbjct: 103 TNEEYRTMYLGTKSDARRRFVKSKNPSQ-RYASRPNELMPHSVDWRKRGAVAPIKNQGSC 161
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEYIIENKGL 211
GSCWAFS VAAVEGI QI G++I LSEQ+LVDC N+GC+GGLMD AFE+II N G+
Sbjct: 162 GSCWAFSTVAAVEGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISNGGM 221
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
TE YPY+ +G CD ++ +I YED+P+ +E AL +AV QPV V +EASG+A
Sbjct: 222 DTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPR-NERALQKAVAHQPVCVAIEASGRA 280
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE 331
F+ Y GV ECG+ DHGV VVG+G+ EDG YW+++NSWG WGE+GY+++ R+
Sbjct: 281 FQLYSSGVFTGECGEEVDHGVVVVGYGS---EDGVDYWIVRNSWGTKWGENGYVKMERNV 337
Query: 332 -----GLCGIATEASYP 343
G CGI TEASYP
Sbjct: 338 KKSHLGKCGIMTEASYP 354
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 162/317 (51%), Positives = 207/317 (65%), Gaps = 10/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E +++ +E W+ +HG+ Y EK R IFK NL ++++ N RTYKLG +F+DL
Sbjct: 45 EAHMMKMYEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADL 104
Query: 95 TNEEFRASYTGYNRPVPSVSR-QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
TNEE+RA Y G R + S+ K N D+P+ +DWREKGAVT +K+QG CG
Sbjct: 105 TNEEYRAMYLGAKMEKKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCG 164
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLA 212
SCWAFS V +VEGI QI G LI LSEQ+LVDC N GC+GGLMD AFE+II+N G+
Sbjct: 165 SCWAFSTVGSVEGINQIVTGDLISLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKNGGID 224
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
+EADYPY+ CD ++ A TI YED+P+ DE +L +AV QPVSV +EA G+ F
Sbjct: 225 SEADYPYRASDNMCDSNRKNAHVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREF 284
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR--- 329
+ Y+ GV CG N DHGV VG+GT E+G YW+++NSWG WGESGYIR+ R
Sbjct: 285 QLYQSGVFTGRCGTNLDHGVVAVGYGT---ENGIDYWIVRNSWGPKWGESGYIRMERNVA 341
Query: 330 --DEGLCGIATEASYPV 344
D G CGIA EASYP
Sbjct: 342 STDTGKCGIAMEASYPT 358
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 159/323 (49%), Positives = 211/323 (65%), Gaps = 13/323 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ II + C+S V+S R + + ++VEKHEQWMA+ R YKD EKA R FK N+ +
Sbjct: 8 LLAIIGSICLCSSTVLSARELGDAAMVEKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSR-PSTFKYQNVTD- 130
IE N GN + LG N+F+DLTN+EFRA+ T + R +R P+ FKY NV+
Sbjct: 68 IESFN-TGNHKFWLGVNQFTDLTNDEFRATKTN-----KGLKRNGARAPTRFKYNNVSTD 121
Query: 131 -VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+P ++DWR KG VT IK+QG CG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC
Sbjct: 122 ALPAAVDWRTKGVVTPIKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVH 181
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
+ GC GG MD AF++II+N GL TEA+YPY + G C + ATI YED+P
Sbjct: 182 GVDQGCEGGEMDNAFKFIIKNGGLTTEANYPYTAQDGQCKTSTTSNSVATIKGYEDVPAN 241
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
DE +L++AV QPVSV V+ F+ Y GV+ CG + DHG+ +G+G DG K
Sbjct: 242 DESSLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIVAIGYGMT--SDGTK 299
Query: 308 YWLIKNSWGETWGESGYIRILRD 330
+WL+KNSWG TWGESGY+R+ +D
Sbjct: 300 FWLLKNSWGTTWGESGYLRMEKD 322
>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 345
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 161/324 (49%), Positives = 212/324 (65%), Gaps = 8/324 (2%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLG 87
+ R + E E+HE WMAQ+G+ YKD EK R IFK N+ +IE N G++ + L
Sbjct: 24 IMSRRLFEACTSERHENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFNLS 83
Query: 88 TNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHI 146
N+F+DL +EEF+A T N+ V SV ++ T FKY VT + ++DWR++GAVT I
Sbjct: 84 INQFADLHDEEFKALLTNGNKKVRSVVGTATETETSFKYNRVTKLLATMDWRKRGAVTPI 143
Query: 147 KNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC-STDNNGCSGGLMDKAFEYI 205
K+Q CGSCWAFSAVAA+EGI QIT KL+ LSEQ+LVDC ++ GC+GG M+ AFE++
Sbjct: 144 KDQRRCGSCWAFSAVAAIEGIHQITTSKLVSLSEQELVDCVKGESEGCNGGYMEDAFEFV 203
Query: 206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCV 265
+ G+A+E+ YPY+ + +C +KE + I YE +P E AL +AV QPVSV V
Sbjct: 204 AKKGGIASESYYPYKGKDKSCKVKKETHGVSQIKGYEKVPSNSEKALQKAVAHQPVSVYV 263
Query: 266 EASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
EA G AF+FY G+ +CG N DH + VVG+G + G KYWL+KNSWG WGE GYI
Sbjct: 264 EAGGNAFQFYSSGIFTGKCGTNTDHAITVVGYG--KSRGGTKYWLVKNSWGAGWGEKGYI 321
Query: 326 RILRD----EGLCGIATEASYPVA 345
R+ RD EGLCGIA A YP A
Sbjct: 322 RMKRDIRAKEGLCGIAMNAFYPTA 345
>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 163/341 (47%), Positives = 226/341 (66%), Gaps = 19/341 (5%)
Query: 15 VIIILVITC-ASQVVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
++ IL C S V++ R +++ S+V +HE WM Q+GR YKD EKA + +FK N +
Sbjct: 8 LLAILGCLCFCSSVLAARELNDDLSMVARHESWMLQYGRVYKDAAEKASKFEVFKANAGF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
I+ N GN + LG N+F+D+TN+EF+A+ T N+ +S + P+ F Y+NV+
Sbjct: 68 IDSFNA-GNHKFWLGINQFADITNKEFKATKT--NKGF--ISNKVRAPTGFSYENVSFDA 122
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
+P SIDWR KGAVT +K+QG CG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC
Sbjct: 123 LPASIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHG 182
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
++ GC GGLMD AF++II N GL E+ YPY E G C + +A TI YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIISNGGLTQESSYPYDAEDGKC--KSGSKSAGTIKSYEDVPANN 240
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL++AV QPVSV V+ F+FY GV+ CG + DHG+A +G+G DG KY
Sbjct: 241 EGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVT--SDGTKY 298
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
WL+KNSWG +WGE+G++R+ +D +G+CG+A E SYP A
Sbjct: 299 WLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 329 bits (843), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 160/314 (50%), Positives = 208/314 (66%), Gaps = 17/314 (5%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E W+ +HG+ Y EK R IFK NL +IE+ N G+++YKLG N+F+DLTNEE+RA
Sbjct: 48 YEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADLTNEEYRA 107
Query: 102 SYTGYNRPVPS-----VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCW 156
+ G P V++++ R + Y+ ++P +DWREKGAVT IK+QG CGSCW
Sbjct: 108 MFLGTRTRGPKNKAAVVAKKTDR---YAYRAGEELPAMVDWREKGAVTPIKDQGQCGSCW 164
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEA 215
AFS V AVEGI QI G L LSEQ+LVDC N GC+GGLMD AFE+I++N G+ TE
Sbjct: 165 AFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGGLMDYAFEFIVQNGGIDTEE 224
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
DYPY + TCD ++ A TI YED+P DE +L++AV QPVSV +EA G F+ Y
Sbjct: 225 DYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAIEAGGMEFQLY 284
Query: 276 KRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----- 330
+ GV CG N DHGV VG+GT E+G YWL++NSWG WGE+GYI++ R+
Sbjct: 285 QSGVFTGRCGTNLDHGVVAVGYGT---ENGTDYWLVRNSWGSAWGENGYIKLERNVQNTE 341
Query: 331 EGLCGIATEASYPV 344
G CGIA EASYP+
Sbjct: 342 TGKCGIAIEASYPI 355
>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 172/347 (49%), Positives = 224/347 (64%), Gaps = 15/347 (4%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLTI 65
K FI+ +++++ T S + + E S+ E +E+W + H E EKA R +
Sbjct: 2 KRFIVLALCMLMVLETTKSLDFHEKDVESEDSLWELYERWKSHHTIARSLE-EKAKRFNV 60
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN---RPVPSVSRQSSRPST 122
FK N+++I + NK+ N +YKL N+F D+T+EEFR +Y G N + RQ+++ +
Sbjct: 61 FKHNVKHIHETNKKEN-SYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGERQTTK--S 117
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
F Y NV +PTS+DWR+ GAVT +KNQG CGSCWAFS V AVEGI QI KL LSEQ+
Sbjct: 118 FMYANVDTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQE 177
Query: 183 LVDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
LVDC T+ N GC+GGLMD AFE+I E GL +E YPY+ TCD KE A +I +
Sbjct: 178 LVDCDTNKNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGH 237
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE 301
ED+PK E L++AV QPVSV ++A G F+FY GV CG +HGVAVVG+GT
Sbjct: 238 EDVPKNSEVDLMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTT- 296
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
DG KYW++KNSWGE WGE GYIR+ R EGLCGIA EASYP+
Sbjct: 297 -IDGTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 161/311 (51%), Positives = 213/311 (68%), Gaps = 13/311 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y+ EK +R IFK NL++I++ NK + Y LG NEF+DL+++
Sbjct: 43 LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G SR+ P F Y++V ++P S+DWR+KGAV +KNQG CGSCWA
Sbjct: 102 EFKNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWA 157
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T NNGC+GGLMD AF +I+EN GL E D
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+GTC+ KE+ TI Y D+P+ +E +LL+A+ QP+SV +EASG+ F+FY
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 277
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV + CG + DHGVA VG+GTA+ G Y ++KNSWG WGE GYIR+ R+ EG
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYIIVKNSWGSKWGEKGYIRMRRNIGKPEG 334
Query: 333 LCGIATEASYP 343
+CGI ASYP
Sbjct: 335 ICGIYKMASYP 345
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 167/359 (46%), Positives = 227/359 (63%), Gaps = 23/359 (6%)
Query: 2 VLKFEKSFIIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRTY 53
+ + S + +F+++ L ++ H + ++ +E W+A+HG++Y
Sbjct: 3 LCRSSSSMAVFLFLLLGLASALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSY 62
Query: 54 KDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSV 113
EK R IFK NL +I++ N E NRTYK+G N F+DLTNEE+R+ Y G +
Sbjct: 63 NALGEKERRFQIFKDNLRFIDEHNAE-NRTYKVGLNRFADLTNEEYRSMYLGTR---TAA 118
Query: 114 SRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQIT 171
R+SS + +Y V D +P S+DWR+KGAV +K+QG CGSCWAFS +AAVEGI +I
Sbjct: 119 KRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIV 178
Query: 172 GGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQK 230
G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ +E DYPY+ G CD+ +
Sbjct: 179 TGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYR 238
Query: 231 EKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDH 290
+ A TI YED+P+ DE +L +AV QPVSV +EA G+ F+ Y+ G+ CG DH
Sbjct: 239 KNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDH 298
Query: 291 GVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
GV VG+GT E+G YW++KNSWG +WGE GYIR+ RD G CGIA EASYP+
Sbjct: 299 GVTAVGYGT---ENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPI 354
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 161/328 (49%), Positives = 212/328 (64%), Gaps = 20/328 (6%)
Query: 35 EPSIVEKHEQWMAQH--------GRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKL 86
E S+ +E+W +++ G D+ E R +F +N YI +AN+ G R ++L
Sbjct: 35 EESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPFRL 94
Query: 87 GTNEFSDLTNEEFRASYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
N+F+D+T +EFR +Y G ++R + + + ++P ++DWRE+GA
Sbjct: 95 ALNKFADMTTDEFRRTYAGSRARHHRSLRGGRGGEGGSFRYGGDDEDNLPPAVDWRERGA 154
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKA 201
VT IK+QG CGSCWAFSAVAAVEG+ +I G+L+ LSEQ+LVDC T DN GC GGLMD A
Sbjct: 155 VTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGLMDYA 214
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
F++I N G+ TE++YPY+ EQG C+K K + TI YED+P DE AL +AV QPV
Sbjct: 215 FQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVANQPV 274
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
+V VEASGQ F+FY GV ECG + DHGVA VG+G DG KYW++KNSWGE WGE
Sbjct: 275 AVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGIT--RDGTKYWIVKNSWGEDWGE 332
Query: 322 SGYIRILR-----DEGLCGIATEASYPV 344
GYIR+ R GLCGIA EASYPV
Sbjct: 333 RGYIRMQRGVSSDSNGLCGIAMEASYPV 360
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 165/325 (50%), Positives = 212/325 (65%), Gaps = 16/325 (4%)
Query: 29 SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGT 88
S RS +E ++ + W+A+H +TY E+ R IFK NL +I++ N NRTYK+G
Sbjct: 37 SWRSDNE--VISMYNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKNRTYKVGL 94
Query: 89 NEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS---TFKYQNVTDVPTSIDWREKGAVTH 145
F+DLTNEE+RA + G +S PS FK +V +P SIDWR+ GAV+
Sbjct: 95 TRFADLTNEEYRAKFLGTKSDPKRRLMKSKNPSQRYAFKAGDV--LPESIDWRQSGAVSA 152
Query: 146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEY 204
IK+QG CGSCWAFS +AAVEG+ +I G+LI LSEQ+LVDC N GC+GGLMD AF++
Sbjct: 153 IKDQGSCGSCWAFSTIAAVEGVNKIVTGELISLSEQELVDCDRSYNAGCNGGLMDNAFQF 212
Query: 205 IIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVC 264
II N G+ T+ DYPYQ G CD K K A TI +ED+ DE AL +AV QPVSV
Sbjct: 213 IINNGGIDTDKDYPYQAVDGKCDTTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQPVSVA 272
Query: 265 VEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGY 324
+EASG A +FY+ GV ECG DHGV +VG+GT EDG YWL++NSWG WGE+GY
Sbjct: 273 IEASGMALQFYQSGVFTGECGSALDHGVVIVGYGT---EDGIDYWLVRNSWGRDWGENGY 329
Query: 325 IRILRD-----EGLCGIATEASYPV 344
I++ R+ G CGIA E+SYP+
Sbjct: 330 IKMQRNVVDTFTGKCGIAMESSYPI 354
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 327 bits (839), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 165/320 (51%), Positives = 216/320 (67%), Gaps = 16/320 (5%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLT 95
++ ++HE+WMA+HGR Y D+ EKA RL +F+ N+ +IE N ++ + L N+F+DLT
Sbjct: 35 AMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLT 94
Query: 96 NEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKNQGHCG 153
N EFRA+ TG PS SR + P++F+Y NV+ D+P S+DWR KGAV +K+QG CG
Sbjct: 95 NAEFRATRTGLR---PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCG 151
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGL 211
CWAFSAVAA+EG ++ GKL+ LSEQQLV C ++ GC GGLMD AF++II+N GL
Sbjct: 152 CCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGL 211
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
A E+DYPY C AAAATI YED+P DE ALL+AV QPVSV ++ +
Sbjct: 212 AAESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRH 271
Query: 272 FRFYKRGVLN--AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
F+FYK GVL+ A C DH + VG+G A DG KYWL+KNSWG +WGE GY+R+ R
Sbjct: 272 FQFYKGGVLSGAAGCATELDHAITAVGYGVAS--DGTKYWLMKNSWGTSWGEDGYVRMER 329
Query: 330 ----DEGLCGIATEASYPVA 345
EG+CG+A ASYP A
Sbjct: 330 GVADKEGVCGLAMMASYPTA 349
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 160/311 (51%), Positives = 212/311 (68%), Gaps = 13/311 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y+ EK R IFK NL++I++ NK + Y LG NEF+DL+++
Sbjct: 43 LIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G SR+ P F Y++ ++P S+DWR+KGAVT +KNQG CGSCWA
Sbjct: 102 EFKNKYLGLK---VDYSRRRESPEEFTYKDF-ELPKSVDWRKKGAVTQVKNQGSCGSCWA 157
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T NNGC+GGLMD AF +I+EN GL E D
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+GTC+ KE+ TI Y D+P+ +E +LL+A+ QP+SV +EASG+ F+FY
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFYS 277
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV + CG + DHGVA VG+GT++ G Y ++KNSWG WGE GYIR+ R+ EG
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTSK---GVNYIIVKNSWGSKWGEKGYIRMRRNIGKPEG 334
Query: 333 LCGIATEASYP 343
+CGI ASYP
Sbjct: 335 ICGIYKMASYP 345
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 327 bits (838), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 168/354 (47%), Positives = 227/354 (64%), Gaps = 27/354 (7%)
Query: 11 IPMFVIIILVITCAS----QVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELE 58
+ +F+ ++L + AS ++ H + ++ +E W+A+HG++Y E
Sbjct: 10 MAVFLFLLLGLASASAXDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSYNALGE 69
Query: 59 KAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSS 118
K R IFK NL +I++ N E NRTYK+G N F+DLTNEE+R+ Y G + R+SS
Sbjct: 70 KERRFQIFKDNLRFIDEHNAE-NRTYKVGLNRFADLTNEEYRSMYLGTR---TAAKRRSS 125
Query: 119 RPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLI 176
+ +Y V D +P S+DWR+KGAV +K+QG CGSCWAFS +AAVEGI +I G LI
Sbjct: 126 NKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIVTGGLI 185
Query: 177 ELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAA 235
LSEQ+LVDC T N GC+GGLMD AFE+II N G+ +E DYPY+ G CD+ ++ A
Sbjct: 186 SLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYRKNAXV 245
Query: 236 ATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVV 295
TI YED+P+ DE +L +AV QPVSV +EA G+ F+ Y+ G+ CG DHGV V
Sbjct: 246 VTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDHGVTAV 305
Query: 296 GFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
G+GT E+G YW++KNSWG +WGE GYIR+ RD G CGIA EASYP+
Sbjct: 306 GYGT---ENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPI 356
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 327 bits (838), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 161/311 (51%), Positives = 212/311 (68%), Gaps = 13/311 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E W+++HG+ Y+ EK R IFK NL++I++ NK + Y LG NEF+DL+++
Sbjct: 44 LIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 102
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G SR+ P F Y++V ++P S+DWR+KGAVT +KNQG CGSCWA
Sbjct: 103 EFKNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVTQVKNQGSCGSCWA 158
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T NNGC+GGLMD AF +I+EN GL E D
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENDGLHKEED 218
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+GTC+ KE+ TI Y D+P+ +E +LL+A+ QP+SV +EASG+ F+FY
Sbjct: 219 YPYIMEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV + CG + DHGVA VG+GTA+ G Y +KNSWG WGE GYIR+ R+ EG
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335
Query: 333 LCGIATEASYP 343
+CGI ASYP
Sbjct: 336 ICGIYKMASYP 346
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 165/323 (51%), Positives = 213/323 (65%), Gaps = 25/323 (7%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E + ++E W+A+HGR Y EK R IFK NL +IE+ N GNRTYK+G N+F+DL
Sbjct: 43 EDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFADL 102
Query: 95 TNEEFRASYTG-----YNRPVPSVS---RQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
TNEE+R Y G R V S + R +SRP+ +P S+DWR++GAV I
Sbjct: 103 TNEEYRTMYLGTKSDARRRFVKSKNPSQRYASRPNEL-------MPHSVDWRKRGAVAPI 155
Query: 147 KNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEYI 205
KNQG CGSCWAFS VAAV GI QI G++I LSEQ+LVDC N+GC+GGLMD AFE+I
Sbjct: 156 KNQGSCGSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFI 215
Query: 206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCV 265
I N G+ TE YPY+ +G CD ++ +I YED+P+ +E AL +AV QPV V +
Sbjct: 216 ISNGGMDTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPR-NERALQKAVAHQPVCVAI 274
Query: 266 EASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
EASG+AF+ Y GV ECG+ DHGV VVG+G+ EDG YW+++NSWG WGE+GY+
Sbjct: 275 EASGRAFQLYSSGVFTGECGEEVDHGVVVVGYGS---EDGVDYWIVRNSWGTKWGENGYV 331
Query: 326 RILRDE-----GLCGIATEASYP 343
++ R+ G CGI TEASYP
Sbjct: 332 KMERNVKKSHLGKCGIMTEASYP 354
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 160/328 (48%), Positives = 211/328 (64%), Gaps = 20/328 (6%)
Query: 35 EPSIVEKHEQWMAQH--------GRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKL 86
E S+ +E+W +++ G D+ E R +F +N YI +AN+ G R ++L
Sbjct: 35 EESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPFRL 94
Query: 87 GTNEFSDLTNEEFRASYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
N+F+D+T +EFR +Y G ++R + + + ++P ++DWRE+GA
Sbjct: 95 ALNKFADMTTDEFRRTYAGSRARHHRSLSGGRGGEGGSFRYGGDDEDNLPPAVDWRERGA 154
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKA 201
VT IK+QG CGSCWAFS VAAVEG+ +I G+L+ LSEQ+LVDC T DN GC GGLMD A
Sbjct: 155 VTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGLMDYA 214
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
F++I N G+ TE++YPY+ EQG C+K K + TI YED+P DE AL +AV QPV
Sbjct: 215 FQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVANQPV 274
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
+V VEASGQ F+FY GV ECG + DHGVA VG+G DG KYW++KNSWGE WGE
Sbjct: 275 AVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGIT--RDGTKYWIVKNSWGEDWGE 332
Query: 322 SGYIRILR-----DEGLCGIATEASYPV 344
GYIR+ R GLCGIA EASYPV
Sbjct: 333 RGYIRMQRGVSSDSNGLCGIAMEASYPV 360
>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
Length = 362
Score = 327 bits (837), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 166/349 (47%), Positives = 226/349 (64%), Gaps = 12/349 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAM 61
++ +K + + + ++L IT + E S+ + +E+W + H T L EK
Sbjct: 1 MEMKKFLFVALSLALVLGITESLDFHEKDLESEESLWDLYERWRSHH--TVSTSLDEKHK 58
Query: 62 RLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS 121
R +FK+N+ ++ K NK G + YKL N+F+D+TN EFR+ Y G + R ++R +
Sbjct: 59 RFNVFKENVMHVHKTNKMG-KPYKLKLNKFADMTNHEFRSVYAGSKVKHHRMFRGTTRGN 117
Query: 122 -TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+F Y V VPTS+DWR+KGAVT +K+QG CGSCWAFS + AVEGI I +L+ LSE
Sbjct: 118 GSFMYGKVEKVPTSVDWRKKGAVTAVKDQGQCGSCWAFSTIVAVEGINYIKTNELVSLSE 177
Query: 181 QQLVDC-STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIG 239
Q+LVDC +T+N GC+GGLM+ AFE+I + +G+ TE+ YPY+ E G CD KE A +I
Sbjct: 178 QELVDCDTTENQGCNGGLMEYAFEFIKKKRGITTESTYPYKAEDGHCDAAKENNPAVSID 237
Query: 240 KYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGT 299
YE +P+ DE ALL+A QPVSV ++A G F+FY GV ECG DHGVAVVG+GT
Sbjct: 238 GYEKVPENDEDALLKAAANQPVSVAIDAGGSDFQFYSEGVFIGECGTELDHGVAVVGYGT 297
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
DG KYW+++NSWG WGE GYIR+ R EGLCGIA EASYP+
Sbjct: 298 T--LDGTKYWIVRNSWGPEWGEKGYIRMQRGISDKEGLCGIAMEASYPI 344
>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
Precursor
gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
Length = 360
Score = 326 bits (836), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 162/309 (52%), Positives = 207/309 (66%), Gaps = 10/309 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E+W + H + EK R +FK N ++ ANK ++ YKL N+F+D+TN EFR
Sbjct: 38 YERWRSHH-TVSRSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFRN 95
Query: 102 SYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
+Y+G + R R + TF Y+ V VP S+DWR+KGAVT +K+QG CGSCWAFS
Sbjct: 96 TYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFST 155
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPY 219
+ AVEGI QI KL+ LSEQ+LVDC TD N GC+GGLMD AFE+I + G+ TEA+YPY
Sbjct: 156 IVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPY 215
Query: 220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
+ GTCD KE A A +I +E++P+ DE+ALL+AV QPVSV ++A G F+FY GV
Sbjct: 216 EAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV 275
Query: 280 LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCG 335
CG DHGVA+VG+GT DG KYW +KNSWG WGE GYIR+ R EGLCG
Sbjct: 276 FTGSCGTELDHGVAIVGYGTT--IDGTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCG 333
Query: 336 IATEASYPV 344
IA EASYP+
Sbjct: 334 IAMEASYPI 342
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 326 bits (836), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 161/311 (51%), Positives = 211/311 (67%), Gaps = 13/311 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y++ EK +R IFK NL++I++ NK + Y LG NEF+DL++
Sbjct: 44 LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHR 102
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF Y G SR+ P F Y++V ++P S+DWR+KGAV +KNQG CGSCWA
Sbjct: 103 EFNNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWA 158
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T NNGC+GGLMD AF +I+EN GL E D
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 218
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+GTC+ KE+ TI Y D+P+ +E +LL+A+ QP+SV +EASG+ F+FY
Sbjct: 219 YPYIMEEGTCEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV + CG + DHGVA VG+GTA+ G Y +KNSWG WGE GYIR+ R+ EG
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335
Query: 333 LCGIATEASYP 343
+CGI ASYP
Sbjct: 336 ICGIYKMASYP 346
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 326 bits (836), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 165/319 (51%), Positives = 215/319 (67%), Gaps = 16/319 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLTN 96
+ ++HE+WMA+HGR Y D+ EKA RL +F+ N+ +IE N ++ + L N+F+DLTN
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 97 EEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKNQGHCGS 154
EFRA+ TG PS SR + P++F+Y NV+ D+P S+DWR KGAV +K+QG CG
Sbjct: 61 AEFRATRTGLR---PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGC 117
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLA 212
CWAFSAVAA+EG ++ GKL+ LSEQQLV C ++ GC GGLMD AF++II+N GLA
Sbjct: 118 CWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLA 177
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
E+DYPY C AAAATI YED+P DE ALL+AV QPVSV ++ + F
Sbjct: 178 AESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHF 237
Query: 273 RFYKRGVLN--AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR- 329
+FYK GVL+ A C DH + VG+G A DG KYWL+KNSWG +WGE GY+R+ R
Sbjct: 238 QFYKGGVLSGAAGCATELDHAITAVGYGVAS--DGTKYWLMKNSWGTSWGEDGYVRMERG 295
Query: 330 ---DEGLCGIATEASYPVA 345
EG+CG+A ASYP A
Sbjct: 296 VADKEGVCGLAMMASYPTA 314
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 163/312 (52%), Positives = 211/312 (67%), Gaps = 12/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I++ E W+++HG+ Y+ EK +R IFK NL +I++ NK+ Y LG NEFSDL++E
Sbjct: 29 IIDLFESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNKK-VVNYWLGLNEFSDLSHE 87
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G + S R+ S+ F Y++V +P S+DWR+KGAVT +KNQG CGSCWA
Sbjct: 88 EFKNKYLGLKVDM-SERRECSQE--FNYKDVMSIPKSVDWRKKGAVTDVKNQGSCGSCWA 144
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+LVDC T NN GC+GGLMD AF YII N GL E D
Sbjct: 145 FSTVAAVEGINQIVTGNLTSLSEQELVDCDTTNNYGCNGGLMDYAFSYIISNGGLHKEVD 204
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+GTC+ +KE++ TI Y D+P+ E +LL+A+ QP+SV +EASG+ F+FY
Sbjct: 205 YPYIMEEGTCEMRKEESEVVTISGYHDVPQNSEESLLKALANQPLSVAIEASGRDFQFYS 264
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE----G 332
GV + CG DHGVA VG+G+ +G Y ++KNSWG WGE GYIR+ R+ G
Sbjct: 265 GGVFDGHCGTQLDHGVAAVGYGST---NGLDYIIVKNSWGSKWGEKGYIRMKRNTGKPAG 321
Query: 333 LCGIATEASYPV 344
LCGI ASYP
Sbjct: 322 LCGINKMASYPT 333
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 155/312 (49%), Positives = 207/312 (66%), Gaps = 9/312 (2%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I+ +E W+ +HG++Y EK R IFK N YI++ N +R++KLG N F+DLTNE
Sbjct: 40 IMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNRFADLTNE 99
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
E+R+ YTG R S + S + + +P S+DWRE GAV +K+QG CGSCWA
Sbjct: 100 EYRSKYTGI-RTKDSRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQGQCGSCWA 158
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS ++AVEGI QI GKLI LSEQ+LVDC N GC+GGLMD AF++II N G+ ++AD
Sbjct: 159 FSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINNGGIDSDAD 218
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY G CD+ ++ A TI YED+P+ DE AL +A QP+SV +EASG+ F+FY
Sbjct: 219 YPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPISVAIEASGRDFQFYD 278
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEG 332
G+ +CG + DHGV VVG+GT E+G YW+++NSWG WGE GY+R+ R G
Sbjct: 279 SGIFTGKCGTDLDHGVVVVGYGT---ENGKDYWIVRNSWGADWGEKGYLRMERGISSKAG 335
Query: 333 LCGIATEASYPV 344
+CGI +E SYPV
Sbjct: 336 ICGITSEPSYPV 347
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 162/354 (45%), Positives = 230/354 (64%), Gaps = 24/354 (6%)
Query: 8 SFIIPMFVIIILVITCASQVVS--------GRSM--HEPSIVEKHEQWMAQHGRTYKDEL 57
+ ++ + VI I C + V + GR+ E ++ ++++WMAQ+ R YKD+
Sbjct: 15 TLMLLLCVIAIADCICQAAVAARVEPSTTVGRTTGGDEAMMMARYKKWMAQYRRKYKDDA 74
Query: 58 EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRP--VPSVSR 115
EKA R +FK N E+I+++N G + Y LGTN+F+DLT++EF A YTG +P VPS ++
Sbjct: 75 EKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGAK 134
Query: 116 QSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGG 173
Q P+ FKYQN T D +DWR++GAVT +KNQG CG CWAFSAV A+EG+ IT G
Sbjct: 135 QI--PAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTG 192
Query: 174 KLIELSEQQLVDC--STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKE 231
L+ LSEQQ++DC S N GC+GG MD AF+Y++ N G+ TE YPY QGTC +
Sbjct: 193 NLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNGGVTTEDAYPYSAVQGTCQNVQP 252
Query: 232 KAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE-CGDNCDH 290
AATI ++DLP GDE+AL AV QPVSV V+ F+FY+ G+ + + CG + +H
Sbjct: 253 ---AATISGFQDLPSGDENALANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNH 309
Query: 291 GVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEASYPV 344
V +G+G ++ G +YW++KNSWG WGE+G++++ G CGI+T ASYP
Sbjct: 310 AVTAIGYGA--DDQGTQYWILKNSWGTGWGENGFMQLQMGVGACGISTMASYPT 361
>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
Length = 353
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 167/354 (47%), Positives = 226/354 (63%), Gaps = 23/354 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSM-------------HEPSIVEKHEQWMAQHGRTYKDEL 57
+ FV+ +LV+ + R++ ++V +HE+WMA+HGRTY DE
Sbjct: 3 VSRFVLTVLVVASVCTAAAPRALAVRELAGEEESAAVAAAMVSRHEKWMAEHGRTYTDEA 62
Query: 58 EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQS 117
EKA RL IF+ N E+I+ N G +++L TN F+DLT+EEFRA+ TG+ +
Sbjct: 63 EKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDEEFRAARTGFRPRPAPAAAAG 122
Query: 118 SRPSTFKYQN--VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKL 175
S F+Y+N + D S+DWR GAVT +K+QG CG CWAFSAVAAVEG+ +I G+L
Sbjct: 123 S-GGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCWAFSAVAAVEGLNKIRTGRL 181
Query: 176 IELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKA 233
+ LSEQ+LVDC + + GC GGLMD AF++I GLA+E+ YPYQ + G+C A
Sbjct: 182 VSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASESGYPYQGDDGSCRSSAAAA 241
Query: 234 AAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVA 293
AA+I +ED+P+ +E AL AV QPVSV + AFRFY GVL ECG + +H +
Sbjct: 242 RAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAFRFYDSGVLGGECGTDLNHAIT 301
Query: 294 VVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI---LRDEGLCGIATEASYPV 344
VG+GTA DG+KYWL+KNSWG +WGE GY+RI +R EG+CG+A SYPV
Sbjct: 302 AVGYGTA--ADGSKYWLMKNSWGTSWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 353
>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
Length = 359
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 155/314 (49%), Positives = 216/314 (68%), Gaps = 8/314 (2%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ +E+W + H + + EK R +FK+NL++I K N++ +R YKL N+F+D+
Sbjct: 33 EESLWNLYERWRSHHTVS-RSLTEKNQRFNVFKENLKHIHKVNQK-DRPYKLRLNKFADM 90
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGS 154
TN EF Y G + S R + F ++N +++P+SIDWR++GAVT +K+QG CGS
Sbjct: 91 TNHEFLQHYGGSKVSHYRMFHGSRRQTGFAHENTSNLPSSIDWRKQGAVTGVKDQGKCGS 150
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATE 214
CWAFS+VAAVEGI +I G+LI LSEQ+LVDC++ N+GC GGLM++AF +I + GL TE
Sbjct: 151 CWAFSSVAAVEGINKIKTGELISLSEQELVDCNSVNHGCDGGLMEQAFSFIEKTGGLTTE 210
Query: 215 ADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRF 274
+YPY+ + G CD K TI YE +P+ DEHAL+QAV QPVS+ ++A GQ F+F
Sbjct: 211 NNYPYRAKDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQAVANQPVSIAIDAGGQDFQF 270
Query: 275 YKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----D 330
Y GV +CG +HGVA+VG+G +DG KYW++KNSWG WGE+G+IR+ R +
Sbjct: 271 YSEGVYTGDCGTELNHGVALVGYGAT--QDGTKYWIVKNSWGSEWGENGFIRMQRENDVE 328
Query: 331 EGLCGIATEASYPV 344
EGLCGI EASYP+
Sbjct: 329 EGLCGITLEASYPI 342
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 164/351 (46%), Positives = 223/351 (63%), Gaps = 26/351 (7%)
Query: 13 MFVIIILVITCAS----QVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKA 60
MFV++ L T +S ++S H + ++ +E+W+ + G+ Y E+
Sbjct: 11 MFVLLFLSFTLSSASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQGKVYNALGERE 70
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R +FK NL +I++ N E NRTYKLG N F+DLTNEE+R++Y G + R R
Sbjct: 71 KRFQVFKDNLRFIDEHNSE-NRTYKLGLNGFADLTNEEYRSTYLG---ARGGMKRNRLRK 126
Query: 121 STFKYQNVT--DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
++ +Y +P S+DWR++GAV +K+QG CGSCWAFS +AAVEGI +I G LI L
Sbjct: 127 TSDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISL 186
Query: 179 SEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAAT 237
SEQ+LVDC T N GC+GGLMD AFE+II N G+ TE DYPY G CD ++ A T
Sbjct: 187 SEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDTYRKNAKVVT 246
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I YED+P E AL +AV QPVSV +EA G+ F+FY G+ + CG DHGVA VG+
Sbjct: 247 IDDYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIFSGRCGTQLDHGVAAVGY 306
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
GT E+G YW+++NSWG++WGE+GY+R+ R G+CGIA EASYP+
Sbjct: 307 GT---ENGKDYWIVRNSWGKSWGENGYLRMARSINSPTGICGIAMEASYPI 354
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 325 bits (832), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 165/320 (51%), Positives = 214/320 (66%), Gaps = 16/320 (5%)
Query: 33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
+H +++ E+W+A++ + Y EK R +FK NL +I++ANK+ TY LG N F+
Sbjct: 57 VHHDRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVT-TYWLGLNAFA 115
Query: 93 DLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKNQG 150
DLT++EF+A+Y G +P + + S F+Y V D VP S+DWR+KGAVT +KNQG
Sbjct: 116 DLTHDEFKATYLGLRQP----ETKKTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQG 171
Query: 151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENK 209
CGSCWAFS VAAVEGI QI G L LSEQ+LVDCSTD NNGC+GG+MD AF YI +
Sbjct: 172 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAFSYIASSG 231
Query: 210 GLATEADYPYQQEQGTC-DKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
GL TE YPY E+G C DK ++ TI YED+P DE AL++A+ QP+SV +EAS
Sbjct: 232 GLRTEEAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQPLSVAIEAS 291
Query: 269 GQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
G+ F+FY GV N CG DHGVA VG+G+++ +D Y ++KNSWG WGE GYIR+
Sbjct: 292 GRHFQFYSGGVFNGPCGSELDHGVAAVGYGSSKGQD---YIIVKNSWGSHWGEKGYIRMK 348
Query: 329 R----DEGLCGIATEASYPV 344
R EGLCGI ASYP
Sbjct: 349 RGTGKPEGLCGINKMASYPT 368
>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 161/314 (51%), Positives = 208/314 (66%), Gaps = 16/314 (5%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
E +E+W + H + + EK R +FK N+ Y+ NK+ ++ YKL N+F+D+TN EF
Sbjct: 36 ELYERWRSHHTVSRSLD-EKDKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHEF 93
Query: 100 RASYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSC 155
R Y G ++R SR + TF Y NV DVP S+DWR+KGAVT +K+QG CGSC
Sbjct: 94 RHHYAGSKIKHHRSFLGASRANG---TFMYANVEDVPPSVDWRKKGAVTPVKDQGKCGSC 150
Query: 156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLATE 214
WAFS V AVEGI QI +L+ LSEQ+LVDC T N GC+GGLMD AFE+I + G+ TE
Sbjct: 151 WAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINTE 210
Query: 215 ADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRF 274
+YPY E G CD QK + +I YED+P DE +LL+AV QPVSV ++ASG F+F
Sbjct: 211 ENYPYMAEGGECDIQKRNSPVVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQF 270
Query: 275 YKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----D 330
Y GV +CG DHGVA+VG+GT DG KYW+++NSWG WGE GYIR+ R +
Sbjct: 271 YSEGVFTGDCGTELDHGVAIVGYGTT--LDGTKYWIVRNSWGPEWGEKGYIRMQREIDAE 328
Query: 331 EGLCGIATEASYPV 344
EGLCGIA + SYP+
Sbjct: 329 EGLCGIAMQPSYPI 342
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 164/319 (51%), Positives = 214/319 (67%), Gaps = 16/319 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLTN 96
+ ++HE+WMA+HGR Y D+ EK RL +F+ N+ +IE N ++ + L N+F+DLTN
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 97 EEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKNQGHCGS 154
EFRA+ TG PS SR + P++F+Y NV+ D+P S+DWR KGAV +K+QG CG
Sbjct: 61 AEFRATRTGLR---PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGC 117
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLA 212
CWAFSAVAA+EG ++ GKL+ LSEQQLV C ++ GC GGLMD AF++II+N GLA
Sbjct: 118 CWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLA 177
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
E+DYPY C AAAATI YED+P DE ALL+AV QPVSV ++ + F
Sbjct: 178 AESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHF 237
Query: 273 RFYKRGVLN--AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR- 329
+FYK GVL+ A C DH + VG+G A DG KYWL+KNSWG +WGE GY+R+ R
Sbjct: 238 QFYKGGVLSGAAGCATELDHAITAVGYGVAS--DGTKYWLMKNSWGTSWGEDGYVRMERG 295
Query: 330 ---DEGLCGIATEASYPVA 345
EG+CG+A ASYP A
Sbjct: 296 VADKEGVCGLAMMASYPTA 314
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 157/313 (50%), Positives = 217/313 (69%), Gaps = 12/313 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E W++ + Y+ EK +R +FK NL++I++ NK+G ++Y LG NEF+DL++E
Sbjct: 47 LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFADLSHE 105
Query: 98 EFRASYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCW 156
EF+ Y G + V R R + F Y++V VP S+DWR+KGAV +KNQG CGSCW
Sbjct: 106 EFKKMYLGLKTDI--VRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEA 215
AFS VAAVEGI +I G L LSEQ+L+DC T NNGC+GGLMD AFEYI++N GL E
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEE 223
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
DYPY E+GTC+ QK+++ TI ++D+P DE +LL+A+ QP+SV ++ASG+ F+FY
Sbjct: 224 DYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFY 283
Query: 276 KRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----E 331
GV + CG + DHGVA VG+G+++ G+ Y ++KNSWG WGE GYIR+ R+ E
Sbjct: 284 SGGVFDGRCGVDLDHGVAAVGYGSSK---GSDYIIVKNSWGPKWGEKGYIRLKRNTGKPE 340
Query: 332 GLCGIATEASYPV 344
GLCGI AS+P
Sbjct: 341 GLCGINKMASFPT 353
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 162/326 (49%), Positives = 217/326 (66%), Gaps = 18/326 (5%)
Query: 35 EPSIVEKHEQW----MAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNE 90
E S+ +EQW M +++ +KA +FK+N+ YI +ANK+G R+++L N+
Sbjct: 35 EESLRALYEQWRSHYMVSRPAGLQEQDDKARWFNVFKENVRYIHEANKKG-RSFRLALNK 93
Query: 91 FSDLTNEEFRASY-----TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
F+D+T +EFR +Y T ++R + S R+ S F Y ++P ++DWR++GAVT
Sbjct: 94 FADMTTDEFRRAYAAGSRTRHHRALSSGIRRHGDGS-FMYAQAGNLPLAVDWRQRGAVTG 152
Query: 146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEY 204
IK+QG CGSCWAFS +AAVEGI +I GKL+ LSEQ+LVDC DN GC+GGLMD AF+Y
Sbjct: 153 IKDQGQCGSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDCDDVDNQGCNGGLMDYAFQY 212
Query: 205 IIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVC 264
I N G+ TE++YPY EQ +C+K KE++ TI YED+P +E AL +AV QPVS+
Sbjct: 213 IKRNGGITTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVANQPVSIA 272
Query: 265 VEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGY 324
+EASGQ F+FY GV CG DHGVA VG+G DG KYW++KNSWGE WGE GY
Sbjct: 273 IEASGQDFQFYSEGVFTGSCGTELDHGVAAVGYGIT--RDGTKYWIVKNSWGEDWGERGY 330
Query: 325 IRILR----DEGLCGIATEASYPVAM 346
IR+ R +GLCGIA E SYP +
Sbjct: 331 IRMQRGISDSQGLCGIAMEPSYPTKI 356
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 156/313 (49%), Positives = 206/313 (65%), Gaps = 10/313 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ +E W+ +HG++Y E+ R IFK NL +IE+ N NRTYK+G N F+DLTNE
Sbjct: 50 VMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVGLNRFADLTNE 108
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
E+R+ Y G R S + ++ D+P S+DWREKGAV +K+QG+CGSCWA
Sbjct: 109 EYRSRYLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWA 168
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS +AAVEGI QI G LI LSEQ+LVDC N GC+GGLMD AFE+II N G+ +E D
Sbjct: 169 FSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEED 228
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY+ TCD ++ A +I YED+P+ DE +L +AV QPVSV +EA G+AF+ Y+
Sbjct: 229 YPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQ 288
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----E 331
GV +CG DHGV VG+GT E+ YW+++NSWG WGESGYI++ R+
Sbjct: 289 SGVFTGQCGTQLDHGVVAVGYGT---ENSVDYWIVRNSWGPNWGESGYIKLERNLAGTET 345
Query: 332 GLCGIATEASYPV 344
G CGIA E SYP+
Sbjct: 346 GKCGIAIEPSYPI 358
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 165/315 (52%), Positives = 217/315 (68%), Gaps = 14/315 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+VE E+W+A+H + Y EK R +FK NL++I+K N+E +Y LG NEF+DLT++
Sbjct: 45 LVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINREVT-SYWLGLNEFADLTHD 103
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKNQGHCGSC 155
EF+A+Y G + R SSR +F+Y++V+ D+P S+DWR+KGAVT +KNQG CGSC
Sbjct: 104 EFKAAYLGLD--AAPARRGSSR--SFRYEDVSASDLPKSVDWRKKGAVTEVKNQGQCGSC 159
Query: 156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATE 214
WAFS VAAVEGI I G L LSEQ+L+DCS D N+GC+GGLMD AF YI + GL TE
Sbjct: 160 WAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGLMDYAFSYIASSGGLHTE 219
Query: 215 ADYPYQQEQGTC-DKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
YPY E+G+C D +K ++ A TI YED+P DE AL++A+ QPVSV +EASG+ F+
Sbjct: 220 EAYPYLMEEGSCGDGKKAESEAVTISGYEDVPANDEQALIKALAHQPVSVAIEASGRHFQ 279
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR---- 329
FY GV + CG DHGVA VG+G+ ++ G Y +++NSWG WGE GYIR+ R
Sbjct: 280 FYSGGVFDGPCGAQLDHGVAAVGYGS-DKGKGHDYIIVRNSWGAQWGEKGYIRMKRGTSN 338
Query: 330 DEGLCGIATEASYPV 344
EGLCGI ASYP
Sbjct: 339 GEGLCGINKMASYPT 353
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 158/335 (47%), Positives = 222/335 (66%), Gaps = 17/335 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F I+ + C++ + + + ++ +HE+WMAQ+GR YKD+ EKA R +FK N +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANAAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN+EFR + T +PS +R P+ F+Y+NV
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRLTKTNKGF-IPSTTRV---PTGFRYENVNIDA 122
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
+P ++DWR KG VT IK+QG CG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 LPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
++ GC GGLMD AF++II+N GL TE++YPY C + + A+I YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKC--KSVSNSVASIKGYEDVPANN 240
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL++AV QPVSV V+ F+FYK GV+ CG + DHG+ +G+G A DG KY
Sbjct: 241 EAALMKAVANQPVSVAVDGDDMTFQFYKGGVMIGSCGTDLDHGIVAIGYGKA--SDGTKY 298
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATE 339
WL+KNSWG TWGE+G++R+ +D G+CG+A E
Sbjct: 299 WLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAME 333
>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
gi|194689328|gb|ACF78748.1| unknown [Zea mays]
gi|219886279|gb|ACL53514.1| unknown [Zea mays]
gi|238010470|gb|ACR36270.1| unknown [Zea mays]
gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
Length = 354
Score = 323 bits (829), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 170/355 (47%), Positives = 228/355 (64%), Gaps = 30/355 (8%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKAM 61
+I + + ++ + + R + E ++ +H+QWMA+HGRTY+DE EKA
Sbjct: 11 VITFTAVALTILAVTTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRDEAEKAH 70
Query: 62 RLTIFKQNLEYIEKANKEGN--RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSR 119
R +FK N ++++ +N G+ ++Y+L NEF+D+TN+EF A YTG RPVP+ ++ +
Sbjct: 71 RFQVFKANADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAMYTGL-RPVPAGAK---K 126
Query: 120 PSTFKYQNVT-----DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGK 174
+ FKY NVT D ++DWR+KGAVT IKNQG CG CWAF+AVAAVEGI QIT G
Sbjct: 127 MAGFKYGNVTLSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITTGN 186
Query: 175 LIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKA 233
L+ LSEQQ++DC TD NNGC+GG +D AF+YI+ N GL TE YPY Q C + A
Sbjct: 187 LVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIVGNGGLGTEDAYPYTAAQAMCQSVQPVA 246
Query: 234 AAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLN-AECGD--NCDH 290
A I Y+D+P GDE AL AV QPVSV ++A F+ Y GV+ A C N +H
Sbjct: 247 A---ISGYQDVPSGDEAALAAAVANQPVSVAIDA--HNFQLYGGGVMTAASCSTPPNLNH 301
Query: 291 GVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEASYPVA 345
V VG+GTA EDG YWL+KN WG+ WGE GY+R+ R CG+A +ASYPVA
Sbjct: 302 AVTAVGYGTA--EDGTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 354
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 162/317 (51%), Positives = 207/317 (65%), Gaps = 12/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYK-DELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S+ ++ W QH + D E A R IFK+N++YI+ NK+ + YKLG N+F+D
Sbjct: 39 EKSLRSLYDNWALQHRSSRSLDSEEHAERFEIFKENVKYIDSVNKK-DSPYKLGLNKFAD 97
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
L+NEEF+A Y G + + + +F YQN +P SIDWR+KGAV +KNQGHCG
Sbjct: 98 LSNEEFKAIYMGTKMDLRG--DREVQSGSFMYQNSEPLPASIDWRQKGAVAAVKNQGHCG 155
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLAT 213
SCWAFS VA+VEGI IT G L+ LSEQQLVDCST+N+GC+GGLMD AF+YII N G+ T
Sbjct: 156 SCWAFSTVASVEGINYITTGNLVSLSEQQLVDCSTENSGCNGGLMDTAFQYIINNGGIVT 215
Query: 214 EADYPYQQEQGTCDKQK--EKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
E +YPY E C K + I +ED+P +E AL +AV QPVSV +EASGQ
Sbjct: 216 EDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIEASGQD 275
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD- 330
F+FY GV +CG DHGV VG+GT+ E G YW+++NSWG WGE GYIR+ +
Sbjct: 276 FQFYSTGVFTGKCGTALDHGVVAVGYGTSPE--GINYWIVRNSWGPKWGEEGYIRMQQGI 333
Query: 331 ---EGLCGIATEASYPV 344
EG CGIA +ASYP
Sbjct: 334 EAAEGKCGIAMQASYPT 350
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 159/321 (49%), Positives = 208/321 (64%), Gaps = 9/321 (2%)
Query: 30 GRSMHEPSIVEKHEQWMAQHGRTYKD-ELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGT 88
G S + ++ +E W+ +HG++Y EK R IFK NL YI++ N G+R+YKLG
Sbjct: 37 GLSRSDEEVMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGL 96
Query: 89 NEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKN 148
N F+DLTNEE+R++Y G ++ + + +P SIDWREKGAV +K+
Sbjct: 97 NRFADLTNEEYRSTYLGAKTDARRRIAKTKSDRRYAPKAGGSLPDSIDWREKGAVAEVKD 156
Query: 149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIE 207
QG CGSCWAFS +AAVEGI QI G+LI LSEQ+LVDC T N GC+GGLMD AFE+II+
Sbjct: 157 QGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIK 216
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA 267
N G+ TEADYPY G CD+ ++ A +I YED+ DE AL +AV QPVSV +EA
Sbjct: 217 NGGIDTEADYPYTGRYGRCDQTRKNAKVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEA 276
Query: 268 SGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
G+ F+ Y G+ CG + DHGV VG+GT E+G YW++KNSW +WGE GY+R+
Sbjct: 277 GGRDFQLYSSGIFTGSCGTDLDHGVTAVGYGT---ENGVDYWIVKNSWAASWGEKGYLRM 333
Query: 328 LRD----EGLCGIATEASYPV 344
R+ GLCGIA E SYP
Sbjct: 334 QRNVKDKNGLCGIAIEPSYPT 354
>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
Length = 292
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 159/295 (53%), Positives = 200/295 (67%), Gaps = 12/295 (4%)
Query: 58 EKAMRLTIFKQNLEYIEKANKE-GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQ 116
E+ RL IF +N+ YIE +N N+ YKL N+F+DLTNEEF AS N+ +
Sbjct: 3 EREKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEFIASR---NKFKGHMCSS 59
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLI 176
R +TFKY+N + +P+++DWR+KGAVT +KNQG CGSCWAFSAVAA EGI Q++ GKL+
Sbjct: 60 IIRTTTFKYENASAIPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLV 119
Query: 177 ELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAA 234
LSEQ+L+DC T + GC GGLMD AF++II+N GL+TE YPY+ GTC+ K
Sbjct: 120 SLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNANKASIH 179
Query: 235 AATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAV 294
A TI YED+P +E AL +AV QP+SV ++ASG F+FY GV CG DHGV
Sbjct: 180 AVTITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTA 239
Query: 295 VGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
VG+G DG KYWL+KNSWG WGE GYIR+ R EGLCGIA +ASYP A
Sbjct: 240 VGYGVG--NDGTKYWLVKNSWGADWGEEGYIRMQRGIAAAEGLCGIAMQASYPTA 292
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 323 bits (827), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 155/343 (45%), Positives = 213/343 (62%), Gaps = 9/343 (2%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
+ +I + + ++CA + + + ++ +E+W+ +H + Y EK R +FK
Sbjct: 6 TLMISTLLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLGEKDKRFQVFK 65
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPS-VSRQSSRPSTFKYQ 126
NL +I++ N N TYKLG N+F+D+TNEE+R Y G + + S + Y
Sbjct: 66 DNLGFIQEHNNNQNNTYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYS 125
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+P +DWR KGAV IK+QG CGSCWAFS VA VE I +I GK + LSEQ+LVDC
Sbjct: 126 AGDQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDC 185
Query: 187 STD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
N GC+GGLMD AFE+II+N G+ T+ DYPY+ G CD K+ A A I YED+P
Sbjct: 186 DRAYNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKAVNIDGYEDVP 245
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
DE+AL +AV +QPVS+ +EASG+A + Y+ GV ECG + DHGV VVG+G+ E+G
Sbjct: 246 PYDENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGVVVVGYGS---ENG 302
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
YWL++NSWG WGE GY ++ R+ G CGI EASYPV
Sbjct: 303 VDYWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345
>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 323 bits (827), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 160/336 (47%), Positives = 215/336 (63%), Gaps = 9/336 (2%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+++ LV+T + V R + E KHE+WMAQ+G+ YKD EK R IFK N+ +IE
Sbjct: 11 LVVFLVLTVWTSQVMSRRLSEAYSSVKHEKWMAQYGKVYKDAAEKEKRFQIFKNNVHFIE 70
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
+ G++ + L N+F+DL +F+A + +V ++ ++FKY +VT +P+S
Sbjct: 71 SFHAAGDKPFNLSINQFADL--HKFKALLINGQKKEHNVRTATATEASFKYDSVTRIPSS 128
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC-STDNNGC 193
+DWR++GAVT IK+QG C SCWAFS VA +EG+ QIT G+L+ LSEQ+LVDC D+ GC
Sbjct: 129 LDWRKRGAVTPIKDQGTCRSCWAFSTVATIEGLHQITKGELVSLSEQELVDCVKGDSEGC 188
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALL 253
GG ++ AFE+I + G+A+E YPY+ TC +KE I YE +P E ALL
Sbjct: 189 YGGYVEDAFEFIAKKGGVASETHYPYKGVNKTCKVKKETHGVVQIKGYEQVPSNSEKALL 248
Query: 254 QAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKN 313
+AV QPVS VEA G AF+FY G+ +CG + DH V VVG+G A G KYWL+KN
Sbjct: 249 KAVAHQPVSAYVEAGGYAFQFYSSGIFTGKCGTDIDHSVTVVGYGKA--RGGNKYWLVKN 306
Query: 314 SWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
SWG WGE GYIR+ RD EGLCGIAT A YP A
Sbjct: 307 SWGTEWGEKGYIRMKRDIRAKEGLCGIATGALYPTA 342
>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
Length = 360
Score = 323 bits (827), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 164/349 (46%), Positives = 220/349 (63%), Gaps = 16/349 (4%)
Query: 5 FEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLT 64
+K F++ + ++L + + E E +E+W + H + + EK R
Sbjct: 1 MKKLFLVLFTLALVLRLGESFDFHEKELETEEKFWELYERWRSHHTVSRSLD-EKHKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG----YNRPVPSVSRQSSRP 120
+FK N+ Y+ NK+ ++ YKL N+F+D+TN EFR Y G ++R + SR +
Sbjct: 60 VFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHEFRQHYAGSKIKHHRTLLGASRANG-- 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
TF Y N +VP SIDWR+KGAVT +K+QG CGSCWAFS V AVEGI QI KL+ LSE
Sbjct: 117 -TFMYANEDNVPPSIDWRKKGAVTPVKDQGQCGSCWAFSTVVAVEGINQIKTKKLVSLSE 175
Query: 181 QQLVDC-STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIG 239
Q+LVDC +T+N GC+GGLMD AF++I + G+ TE YPY+ E CD QK +I
Sbjct: 176 QELVDCDTTENQGCNGGLMDPAFDFIKKRGGITTEERYPYKAEDDKCDIQKRNTPVVSID 235
Query: 240 KYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGT 299
+ED+P DE ALL+AV QP+SV ++ASG F+FY GV ECG DHGVA+VG+GT
Sbjct: 236 GHEDVPPNDEDALLKAVANQPISVAIDASGSQFQFYSEGVFTGECGTELDHGVAIVGYGT 295
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
DG KYW++KNSWG WGE GYIR+ R +EGLCGIA + SYP+
Sbjct: 296 T--VDGTKYWIVKNSWGAGWGEKGYIRMQRKVDAEEGLCGIAMQPSYPI 342
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 166/351 (47%), Positives = 223/351 (63%), Gaps = 21/351 (5%)
Query: 11 IPMFVIIILVITCA--SQVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKA 60
+ +F+++I + A +VS H + ++ +E W+ +HG+ Y EK
Sbjct: 8 LSLFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGEKE 67
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK NL +I++ N + N TY+LG N F+DLTNEE+R+ Y G V+R+ SR
Sbjct: 68 KRFGIFKDNLRFIDEHNSQ-NLTYRLGLNRFADLTNEEYRSMYLGVKPGATRVTRKVSRK 126
Query: 121 STFKYQNVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELS 179
S V D +P IDWR++GAV +K+QG CGSCWAFS +AAVEGI QI G LI LS
Sbjct: 127 SDRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLS 186
Query: 180 EQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
EQ+LVDC T N GC+GGLMD AFE+II N G+ +E DYPY+ CD+ ++ A +I
Sbjct: 187 EQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADQKCDQYRKNANVVSI 246
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
YED+P+ DE AL +AV KQPVSV +EA G+AF+ Y+ GV +CG + DHGVA VG+G
Sbjct: 247 DGYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDHGVAAVGYG 306
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
T E+G YW++ NSWG+ WGE GYIR+ R+ G CGIA SYP+
Sbjct: 307 T---ENGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGIAIGPSYPI 354
>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 346
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 170/347 (48%), Positives = 232/347 (66%), Gaps = 21/347 (6%)
Query: 14 FVIIILVITCA----SQVVSGRSMHEPS-IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQ 68
FV ++L I S+ S ++++PS IV+ H+QWM Q R Y DE EK +RL + +
Sbjct: 6 FVCVVLTIFFMDLKISEATSRVALYKPSSIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTE 65
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGY---NRPVPSVSRQSSRPSTFKY 125
NL++IE N GN++YKLG NEF+D T EEF A+YTG N P ++P+
Sbjct: 66 NLKFIESFNNMGNQSYKLGVNEFTDWTKEEFLATYTGLRGVNVTSPFEVVNETKPAW--N 123
Query: 126 QNVTDV-PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
V+DV T+ DWR +GAVT +K+QG CG CWAFSA+AAVEG+T+I G LI LSEQQL+
Sbjct: 124 WTVSDVLGTNKDWRNEGAVTPVKSQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLL 183
Query: 185 DCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
DC+ + NNGC GG AF YII+++G+++E +YPYQ ++G C + A I +E+
Sbjct: 184 DCTREQNNGCKGGTFVNAFNYIIKHRGISSENEYPYQVKEGPC--RSNARPAILIRGFEN 241
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEE 302
+P +E ALL+AV++QPV+V ++AS F Y GV NA CG + +H V +VG+GT+ E
Sbjct: 242 VPSNNERALLEAVSRQPVAVAIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTSPE 301
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
G KYWL KNSWG+TWGE+GYIRI RD +G+CG+A ASYPVA
Sbjct: 302 --GMKYWLAKNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASYPVA 346
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 156/311 (50%), Positives = 215/311 (69%), Gaps = 11/311 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y+ EK +R +FK NL++I++ NK + Y LG NEF+DL+++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIVS-NYWLGLNEFADLSHQ 101
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G + S R+SS F Y++V D+P S+DWR+KGAVT +KNQG CGSCWA
Sbjct: 102 EFKNKYLGLKVNL-SQRRESSNEEEFTYRDV-DLPKSVDWRKKGAVTPVKNQGQCGSCWA 159
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T NNGC+GGLMD AF +I++N GL E D
Sbjct: 160 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQNGGLHKEDD 219
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+ TC+ +KE+ TI Y D+P+ +E +LL+A+ QP+SV +EAS + F+FY
Sbjct: 220 YPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYS 279
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV + CG + DHGV+ VG+GT++ D Y ++KNSWG WGE G+IR+ R+ EG
Sbjct: 280 GGVFDGHCGSDLDHGVSAVGYGTSKNLD---YIIVKNSWGAKWGEKGFIRMKRNIGKPEG 336
Query: 333 LCGIATEASYP 343
+CG+ ASYP
Sbjct: 337 ICGLYKMASYP 347
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 160/336 (47%), Positives = 212/336 (63%), Gaps = 14/336 (4%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
+ L +VS E + + +WMA+HG TY E+ R F+ NL YI++
Sbjct: 18 VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 77 N---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N G +++LG N F+DLTNEE+R++Y G R P R+ S + ++ + ++P
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGA-RTKPDRERKLS--ARYQAADNDELPE 134
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNG 192
S+DWR+KGAV +K+QG CGSCWAFSA+AAVEGI QI G +I LSEQ+LVDC T N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C+GGLMD AFE+II N G+ +E DYPY++ CD K+ A TI YED+P E +L
Sbjct: 195 CNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
+AV QP+SV +EA G+AF+ YK G+ CG DHGVA VG+GT E+G YWL++
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT---ENGKDYWLVR 311
Query: 313 NSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
NSWG WGE GYIR+ R+ G CGIA E SYP
Sbjct: 312 NSWGSVWGEDGYIRMERNIKASSGKCGIAVEPSYPT 347
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 160/357 (44%), Positives = 219/357 (61%), Gaps = 26/357 (7%)
Query: 10 IIPM-----FVIIILVITCASQVVSGRSMH------------EPSIVEKHEQWMAQHGRT 52
+IPM F +I ++ +++ + H + + +E W+ +HG+T
Sbjct: 3 LIPMATLSFFALISIISAMDMSIINYDATHMSSSSSSAPLRTDDEVNALYESWLVKHGKT 62
Query: 53 YKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPS 112
Y EK R IFK NL +I++ N G+ TYKLG N+F+DLTNEE+R +YTG
Sbjct: 63 YNALGEKDRRFQIFKDNLRFIDEHN-SGDHTYKLGLNKFADLTNEEYRMTYTGIKTIDDK 121
Query: 113 VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITG 172
+ + Y++ +P +DWRE+GAVT +K+QG CGSCWAFS +VEG+ +I
Sbjct: 122 KKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTTGSVEGVNKIVT 181
Query: 173 GKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKE 231
G LI +SEQ+LV+C T N GC+GGLMD AFE+II+N G+ TE DYPY + G CDK K+
Sbjct: 182 GDLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYTGKDGKCDKNKK 241
Query: 232 KAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHG 291
A TI YED+P DE +L +AV+ QPV+V +EA G+ F+FY G+ CG DHG
Sbjct: 242 NAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIFTGSCGTALDHG 301
Query: 292 VAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
V G+GT EDG YWL+KNSWG WGE GY+++ R+ G CGIA EASYP+
Sbjct: 302 VLAAGYGT---EDGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGIAMEASYPI 355
>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
Length = 341
Score = 322 bits (825), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 165/342 (48%), Positives = 220/342 (64%), Gaps = 17/342 (4%)
Query: 14 FVIIILVI----TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
F++ +LV+ C + + ++ +HE+WMA+HGR YKDE EKA RL +F+ N
Sbjct: 6 FLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVFRAN 65
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN-- 127
E I+ N G +++L TN F+DLT EEFRA+ TG RP P+ S + R F+Y+N
Sbjct: 66 AELIDSFNAAGTHSHRLATNRFADLTVEEFRAARTGL-RPRPAPSAGAGR---FRYENFS 121
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+ D S+DWR GAVT +K+QG CG CWAFSAVAAVEG+ +I G+L+ LSEQ+LVDC
Sbjct: 122 LADAAQSVDWRAMGAVTGVKDQGACGCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCD 181
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ GC GGLMD AF+++ GLA+E+ YPYQ G C A AA+I +ED+P
Sbjct: 182 VSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQGRDGPCRSSAAAARAASIRGHEDVP 241
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
+ +E AL AV QPVSV + AFRFY GVL CG + +H + VG+GTA DG
Sbjct: 242 RNNEAALAAAVANQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTA--NDG 299
Query: 306 AKYWLIKNSWGETWGESGYIRI---LRDEGLCGIATEASYPV 344
+YWL+KNSWG +WGE GY+RI +R EG+CG+A SYPV
Sbjct: 300 TRYWLMKNSWGASWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 341
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 163/335 (48%), Positives = 222/335 (66%), Gaps = 13/335 (3%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
II T + S R+ E ++ + +W+A+HG+ Y E+ R IFK NL+++++
Sbjct: 24 IIDYNTNPNHKSSSRTDEE--VMGIYAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEH 81
Query: 77 NKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSI 135
N E NR+YK+G N F+DLTNEE+R+ + G +S S + Q+ +P S+
Sbjct: 82 NSE-NRSYKVGLNRFADLTNEEYRSMFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESV 140
Query: 136 DWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-TDNNGCS 194
DWRE GAV IK+QG CGSCWAFS VAAVEG+ QI G++I+LSEQ+LVDC T + GC+
Sbjct: 141 DWRESGAVAPIKDQGSCGSCWAFSTVAAVEGVNQIATGEMIQLSEQELVDCDRTYDAGCN 200
Query: 195 GGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQ 254
GGLMD AFE+II N G+ TE DYPY+ GTCD +++ +I YED+P DE AL +
Sbjct: 201 GGLMDYAFEFIINNGGIDTEEDYPYRGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKK 260
Query: 255 AVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNS 314
AV QPVSV +EASG+AF+ Y GV ECG DHGV VVG+GT ++GA +W+++NS
Sbjct: 261 AVAHQPVSVAIEASGRAFQLYLSGVFTGECGRALDHGVVVVGYGT---DNGADHWIVRNS 317
Query: 315 WGETWGESGYIRILRD-----EGLCGIATEASYPV 344
WG +WGE+GYIR+ R+ G CGIA +ASYP+
Sbjct: 318 WGTSWGENGYIRMERNVVDNFGGKCGIAMQASYPI 352
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 159/311 (51%), Positives = 210/311 (67%), Gaps = 13/311 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y++ EK +R IFK NL++I++ NK + Y LG +EF+DL++
Sbjct: 44 LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLSEFADLSHR 102
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF Y G SR+ P F Y++V ++P S+DWR+KGAV +KNQG CGSCWA
Sbjct: 103 EFNNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWA 158
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T NNGC+GGLMD AF +I+EN GL E D
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 218
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+G C+ KE+ TI Y D+P+ +E +LL+A+ QP+SV +EASG+ F+FY
Sbjct: 219 YPYIMEEGACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV + CG + DHGVA VG+GTA+ G Y +KNSWG WGE GYIR+ R+ EG
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335
Query: 333 LCGIATEASYP 343
+CGI ASYP
Sbjct: 336 ICGIYKMASYP 346
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 169/355 (47%), Positives = 224/355 (63%), Gaps = 35/355 (9%)
Query: 13 MFVIIILVITCAS----QVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKA 60
M +++ LV +S ++S H + ++ +E+W+ +HG+ Y EK
Sbjct: 1 MLMLLFLVFALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKE 60
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASY----TGYNRPVPSVS-R 115
R IFK NL +I++ N E NRTY +G N F+DLTNEEFR+ Y TG+ + +P S R
Sbjct: 61 KRFEIFKDNLMFIDQHNSE-NRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSDR 119
Query: 116 QSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGK 174
+ R V D +P S+DWR++GAV +K+QG CGSCWAFS +AAVEGI +I G
Sbjct: 120 YAPR--------VGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGD 171
Query: 175 LIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKA 233
LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ TE DYPY G CD ++ A
Sbjct: 172 LIALSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNA 231
Query: 234 AAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVA 293
+I YED+P+ DE AL +AV QPVSV +E G+ F+ Y GV ECG + DHGVA
Sbjct: 232 KVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVA 291
Query: 294 VVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
VG+GT E G YW+++NSWG++WGESGYIR+ R+ G CGIA E SYP+
Sbjct: 292 AVGYGT---EKGKDYWIVRNSWGKSWGESGYIRMERNIASPTGKCGIAIEPSYPI 343
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 154/304 (50%), Positives = 207/304 (68%), Gaps = 12/304 (3%)
Query: 46 MAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG 105
M++HG++Y+ EK R +F+ NL++I++ NK+ + +Y LG NEF+DL++EEF+ Y G
Sbjct: 1 MSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVS-SYWLGLNEFADLSHEEFKRKYLG 59
Query: 106 YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVE 165
+P ++ P F Y++V D+P S+DWR+KGAV H+KNQG CGSCWAFS VAAVE
Sbjct: 60 LKIELP---KRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVE 116
Query: 166 GITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQG 224
GI QI G L LSEQ+L+DC NNGC+GGLMD AF +II N GL E DYPY E+G
Sbjct: 117 GINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEG 176
Query: 225 TCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAEC 284
TC ++KE+ TI Y D+P+ +E + L+A+ QP+SV +EAS + F+FY G+ N C
Sbjct: 177 TCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHC 236
Query: 285 GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEA 340
G DHGVA VG+GT++ G Y +KNSWG WGE GYIR+ R+ EG+CGI A
Sbjct: 237 GTELDHGVAAVGYGTSK---GVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMA 293
Query: 341 SYPV 344
SYP
Sbjct: 294 SYPT 297
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 156/326 (47%), Positives = 214/326 (65%), Gaps = 12/326 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLG 87
V+ + + + +E W+A+HG+TY EK R IF NL++I++ N GNR+YK+G
Sbjct: 22 VTSNTRTDEEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVG 81
Query: 88 TNEFSDLTNEEFRASYTGYN-RPVPSVSRQSSRPSTFKY--QNVTDVPTSIDWREKGAVT 144
N+F+DLTNEE+R+ Y G P +++ + +Y Q P +DWRE+GAV+
Sbjct: 82 LNQFADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGAVS 141
Query: 145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFE 203
+KNQG CGSCWAFS VA+VEGI +I G LI LSEQ+LVDC N+GC+GG MD AF+
Sbjct: 142 PVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYAFQ 201
Query: 204 YIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSV 263
+I+ N G+ +E+DYPY+ CD + KA +I YED+P +E AL++AV QPVSV
Sbjct: 202 FIVSNGGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPVSV 261
Query: 264 CVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
+EASG+AF+ Y GVL CG N DHGV VVG+G+ E+G YW+++NSWG WGE G
Sbjct: 262 GIEASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYGS---ENGKDYWIVRNSWGPEWGEDG 318
Query: 324 YIRILRDE-----GLCGIATEASYPV 344
YIR+ R+ G+CGI ASYP+
Sbjct: 319 YIRMERNMVDTPVGMCGITLMASYPI 344
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 159/344 (46%), Positives = 221/344 (64%), Gaps = 12/344 (3%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
S +IP +++ + A+ +S + E +++ +E+W+ +H + Y EK R +FK
Sbjct: 3 SMLIPTLLLLSFTFSHAT-AMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFK 61
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPS-VSRQSSRPSTFKYQ 126
NL +I+ N + N TY LG N+F+D+TNEE+RA Y G V + + + Y
Sbjct: 62 DNLGFIQDHNAQ-NNTYTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQNTGHRYAYN 120
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+ +P +DWR KGAV IK+QG+CGSCWAFS VAAVEGI I G+ + LSEQ+LVDC
Sbjct: 121 SGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDC 180
Query: 187 STD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ + GC+GGLMD AF++II+N G+ TE DYPYQ GTCD+ K+K I YED+P
Sbjct: 181 DREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQTKKKTKVVQIDGYEDVP 240
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
+E+AL +AV+ QPVSV +EASG+A + Y+ GV +CG DHGV VVG+GT E+G
Sbjct: 241 SNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGT---ENG 297
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
YWL++NSWG WGE GY ++ R+ EG CGIA + SYPV
Sbjct: 298 VDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341
>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 158/345 (45%), Positives = 221/345 (64%), Gaps = 10/345 (2%)
Query: 6 EKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTI 65
+K +I + + ++LV++ + + S+ + +E+W + H + ++ EK R +
Sbjct: 4 KKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS-TFK 124
FK N+ ++ NK ++ YKL N+F+D+TN EF+ +Y G + R + R S TF
Sbjct: 63 FKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGTFM 121
Query: 125 YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
Y+N T P S+DWR+KGAVT +K+QG CGSCWAFS V AVEGI QI +L+ LSEQ+L+
Sbjct: 122 YENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELI 181
Query: 185 DCST-DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
DC +N GC+GGLM+ AFEYI + G+ TE+ YPY G+CD KE A +I +E
Sbjct: 182 DCDNQENQGCNGGLMEYAFEYIKQKGGITTESYYPYTANDGSCDATKENVPAVSIDGHET 241
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+P DE ALL+AV QPVSV ++A G F+FY GV +CG +HGVA+VG+GT
Sbjct: 242 VPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTT--V 299
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
DG YW+++NSWG WGE GYIR+ R+ EGLCGIA EASYPV
Sbjct: 300 DGTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGIAMEASYPV 344
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 217/330 (65%), Gaps = 14/330 (4%)
Query: 26 QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYK 85
++ S + ++ +E W+ QH + Y EK R IFK NLE+I++ N + ++T+K
Sbjct: 37 NLLPSSSRSDDEVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFK 96
Query: 86 LGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSS-----RPSTFKYQNVTDVPTSIDWREK 140
+G N+F+DLTNEEFR+ Y G + S SS + + ++ ++P ++DWR+
Sbjct: 97 VGLNKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKN 156
Query: 141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMD 199
GAV +K+QG CGSCWAFS +AAVEGI QI G+L+ LSEQ+LVDC T N+GC GGLMD
Sbjct: 157 GAVAKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTSYNSGCDGGLMD 216
Query: 200 KAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ 259
A+E+II N G+ T+ADYPY + G CD+ ++ A TI +ED+P+ DE AL +AV Q
Sbjct: 217 YAYEFIINNGGIDTDADYPYTAKDGKCDQYRKNAKVVTIDDFEDVPENDEKALQKAVAHQ 276
Query: 260 PVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
PVSV +EA G F+FY+ GV +CG + DHGV VG+G+ +DG YW+++NSWG W
Sbjct: 277 PVSVAIEAGGSTFQFYQSGVFTGKCGADLDHGVVAVGYGS---DDGKDYWIVRNSWGADW 333
Query: 320 GESGYIRILRD-----EGLCGIATEASYPV 344
GESGYIR+ R+ G CGIA E SYP+
Sbjct: 334 GESGYIRMERNLETVKTGKCGIAIEPSYPI 363
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 162/319 (50%), Positives = 211/319 (66%), Gaps = 17/319 (5%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
HE ++E+ W +HG+ Y D + R ++K NL YI + E NRTY LG +F+D
Sbjct: 46 HENLLLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHS--ETNRTYSLGLTKFAD 103
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
LTNEEFR YTG SR++ R + F+Y + ++ P S+DWR+ GAVT +K+QG CG
Sbjct: 104 LTNEEFRRMYTGTR---IDRSRRAKRRTGFRYAD-SEAPESVDWRKNGAVTSVKDQGSCG 159
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLA 212
SCWAFSAV +VEGI I G+ + LSEQ+LVDC + N GC+GGLMD AF++II+N G+
Sbjct: 160 SCWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQNGGID 219
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE DYPY+ G CD K+ A TI YED+P+ DE AL +AV QPVSV +EA G+ F
Sbjct: 220 TEKDYPYKGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDF 279
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+ Y +GV + ECG + DHGV VG+GT EDG YW++KNSWGE WGESGY+R+ R+
Sbjct: 280 QLYAQGVFSGECGTDLDHGVLAVGYGT---EDGVDYWIVKNSWGEYWGESGYLRMKRNMK 336
Query: 331 -----EGLCGIATEASYPV 344
GLCGI E SY V
Sbjct: 337 DSNDGPGLCGINIEPSYAV 355
>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
gi|223948637|gb|ACN28402.1| unknown [Zea mays]
gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
Length = 354
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 169/355 (47%), Positives = 228/355 (64%), Gaps = 30/355 (8%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKAM 61
+I + + ++ + + R + E ++ +H+QWMA+HGRTY+DE EKA
Sbjct: 11 VIAFTAVALTILAVKTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRDEAEKAH 70
Query: 62 RLTIFKQNLEYIEKANKEGN--RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSR 119
R +FK N ++++ +N G+ ++Y++ NEF+D+TN+EF A YTG RPVP+ ++ +
Sbjct: 71 RFQVFKANADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAMYTGL-RPVPAGAK---K 126
Query: 120 PSTFKYQNVT-----DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGK 174
+ FKY NVT D ++DWR+KGAVT IKNQG CG CWAF+AVAAVEGI QIT G
Sbjct: 127 MAGFKYGNVTLSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITTGN 186
Query: 175 LIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKA 233
L+ LSEQQ++DC T+ NNGC+GG +D AF+YI N GLATE YPY Q C + A
Sbjct: 187 LVSLSEQQVLDCDTEGNNGCNGGYIDNAFQYIAGNGGLATEDAYPYTAAQAMCQSVQPVA 246
Query: 234 AAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLN-AECGD--NCDH 290
A I Y+D+P GDE AL AV QPVSV ++A F+ Y GV+ A C N +H
Sbjct: 247 A---ISGYQDVPSGDEAALAAAVANQPVSVAIDA--HNFQLYGGGVMTAASCSTPPNLNH 301
Query: 291 GVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEASYPVA 345
V VG+GTA EDG YWL+KN WG+ WGE GY+R+ R CG+A +ASYPVA
Sbjct: 302 AVTAVGYGTA--EDGTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 354
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 154/316 (48%), Positives = 206/316 (65%), Gaps = 9/316 (2%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E + +E W+ ++G+ Y EK R IFK NL+++++ N GN +YKLG N+F+DL
Sbjct: 42 EAETLRLYEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADL 101
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGS 154
+NEE+RA+Y G + + + +++ D+P S+DWREKGAV +K+QG CGS
Sbjct: 102 SNEEYRAAYLGTRMDGKRRLLGGPKSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGS 161
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEYIIENKGLAT 213
CWAFS V AVEGI QI G L LSEQ+LVDC N GC+GGLMD AFE+I++N G+ T
Sbjct: 162 CWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGIDT 221
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
E DYPY+ CD ++ A TI YED+P+ DE +L +AV QPVSV +EA G+AF+
Sbjct: 222 EEDYPYKAVDSMCDPNRKNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQ 281
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--- 330
Y+ GV CG DHGV VG+GT E+G YW+++NSWG WGE+GYIR+ R+
Sbjct: 282 LYQSGVFTGSCGTQLDHGVVAVGYGT---ENGVDYWVVRNSWGPAWGENGYIRMERNVAS 338
Query: 331 --EGLCGIATEASYPV 344
G CGIA EASYP
Sbjct: 339 TETGKCGIAMEASYPT 354
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 321 bits (822), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 161/351 (45%), Positives = 226/351 (64%), Gaps = 13/351 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+K K+F+ + + +ILV + ++ E S+ + +E+W + H +D EK R
Sbjct: 1 MKMGKAFLFAVVLAVILVAAMSMEITERDLASEESLWDLYERWRSHH-TVSRDLSEKRKR 59
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
+FK N+ +I K N++ ++ YKL N F+D+TN EFR Y+ + + SR +T
Sbjct: 60 FNVFKANVHHIHKVNQK-DKPYKLKLNSFADMTNHEFREFYSSKVKHYRML--HGSRANT 116
Query: 123 -FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQ 181
F + +P S+DWR++GAVT +KNQG CGSCWAFS V VEGI +I G+L+ LSEQ
Sbjct: 117 GFMHGKTESLPASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQ 176
Query: 182 QLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
+LVDC TDN GC+GGLM+ A+E+I ++ G+ TE YPY+ G+CD K A A TI +
Sbjct: 177 ELVDCETDNEGCNGGLMENAYEFIKKSGGITTERLYPYKARDGSCDSSKMNAPAVTIDGH 236
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTA 300
E +P DE+AL++AV QPVSV ++ASG +FY GV + CG+ DHGVAVVG+GTA
Sbjct: 237 EMVPANDENALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELDHGVAVVGYGTA 296
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILR-----DEGLCGIATEASYPVAM 346
DG KYW++KNSWG WGE GYIR+ R + G+CGIA EASYP+ +
Sbjct: 297 --LDGTKYWIVKNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYPLKL 345
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 321 bits (822), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 157/312 (50%), Positives = 213/312 (68%), Gaps = 11/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y+ EK +R +FK NL++I+ NK + Y LG NEF+DL+++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKIVS-NYWLGLNEFADLSHQ 101
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G + S R+SS F Y++V D+P S+DWR+KGAVT +KNQG CGSCWA
Sbjct: 102 EFKNKYLGLKVDL-SQRRESSNEEEFTYRDV-DLPKSVDWRKKGAVTPVKNQGQCGSCWA 159
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T NNGC+GGLMD AF +I +N GL E D
Sbjct: 160 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQNGGLHKEED 219
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+ TC+ +KE+ TI Y D+P+ +E +LL+A+ QP+SV +EAS + F+FY
Sbjct: 220 YPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYS 279
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV + CG + DHGV+ VG+GT++ D Y ++KNSWG WGE G+IR+ RD EG
Sbjct: 280 GGVFDGHCGSDLDHGVSAVGYGTSKNLD---YIIVKNSWGAKWGEKGFIRMKRDIGKPEG 336
Query: 333 LCGIATEASYPV 344
+CG+ ASYP
Sbjct: 337 ICGLYKMASYPT 348
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 158/312 (50%), Positives = 203/312 (65%), Gaps = 16/312 (5%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
+ +WMA HGRTY E+ R +F+ NL YI+ N G +++LG N F+DLTN+E
Sbjct: 41 YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 100
Query: 99 FRASYTG-YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
+RA+Y G RP R+ + + + D+P S+DWR KGAV +K+QG CGSCWA
Sbjct: 101 YRATYLGARTRP----QRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWA 156
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS +AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ TE D
Sbjct: 157 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKD 216
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY+ G CD ++ A TI YED+P DE +L +AV QPVSV +EA+G AF+ Y
Sbjct: 217 YPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYS 276
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
G+ CG DHGV VG+GT E+G YW++KNSWG +WGESGY+R+ R+ G
Sbjct: 277 SGIFTGSCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 333
Query: 333 LCGIATEASYPV 344
CGIA E SYP+
Sbjct: 334 KCGIAVEPSYPL 345
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 156/344 (45%), Positives = 225/344 (65%), Gaps = 18/344 (5%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK-HEQWMAQHGRTYKDELEKAMRLTIF 66
+ + + ++ + +I A + ++ P++++K +E W+ ++GR Y+D E +R I+
Sbjct: 4 TITLSIVILNLWIIASACPEIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRFDIY 63
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
+ N++YIE N + N +YKL N F+D+TNEEF+++Y GY +P Q+ F+Y
Sbjct: 64 QSNVQYIEFYNSQ-NYSYKLIDNRFADITNEEFKSTYLGY---LPRFRVQTE----FRYH 115
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
++P SIDWR+KGAVTH+K+QG CGSCWAFSAVAAVEGI +I L+ LSEQQL+DC
Sbjct: 116 KHGELPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDC 175
Query: 187 S--TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
+ N GC GG M AF YI ++ G+AT +YPY+ G C+K K K A TI YE +
Sbjct: 176 DIKSGNEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGRDGNCNKSKAKNNAVTISGYESV 235
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEED 304
P +E L AV QPVS+ +A G AF+FY +G+ + CG N +HG+ +VG+G EE+
Sbjct: 236 PARNEKMLKAAVAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYG---EEN 292
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
G KYW++KNSW WGESGY+R+ RD +G CGIA +A+YPV
Sbjct: 293 GDKYWIVKNSWANDWGESGYVRMKRDTKDKDGTCGIAMDATYPV 336
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 156/311 (50%), Positives = 212/311 (68%), Gaps = 12/311 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y+ EK +R +FK NL++I+ NK + Y LG NEF+DL+++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVVS-NYWLGLNEFADLSHQ 101
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G V R+ S F Y++V D+P S+DWR+KGAVT +KNQG CGSCWA
Sbjct: 102 EFKNKYLGL--KVDLSQRRESSEEEFTYRDV-DLPKSVDWRKKGAVTPVKNQGQCGSCWA 158
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T NNGC+GGLMD AF +I++N GL E D
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVKNGGLHKEED 218
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+ TC+ +KE + TI Y D+P+ +E +LL+A+ QP+SV +EASG+ F+FY
Sbjct: 219 YPYIMEESTCEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV + CG DHGV+ VG+GT++ G Y ++KNSWG WGE G+IR+ R+ EG
Sbjct: 279 GGVFDGHCGSELDHGVSAVGYGTSK---GLDYIIVKNSWGAKWGEKGFIRMKRNIGKSEG 335
Query: 333 LCGIATEASYP 343
+CG+ ASYP
Sbjct: 336 ICGLYKMASYP 346
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 158/314 (50%), Positives = 207/314 (65%), Gaps = 12/314 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ + W+ +HG++Y EK R IFK NL YI+ N + +R+Y+LG N F+DLTNE
Sbjct: 45 VMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNADPDRSYELGLNRFADLTNE 104
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKNQGHCGSC 155
E+RA Y G + S + S PS +Y V ++P SIDWREKGAV +K+QG CGSC
Sbjct: 105 EYRAKYLG-TKSRESRPKLSKGPSD-RYAPVEGEELPDSIDWREKGAVAAVKDQGSCGSC 162
Query: 156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATE 214
WAFSA+ AVEGI QIT G+LI LSEQ+LVDC N GC GGLMD AF +II+N G+ ++
Sbjct: 163 WAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNEGCEGGLMDYAFNFIIKNGGIDSD 222
Query: 215 ADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRF 274
DYPY GTC++ KE A TI YED+P DE AL +A QP+SV +EA G F+
Sbjct: 223 LDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKALQKAAANQPISVAIEAGGMDFQL 282
Query: 275 YKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---- 330
Y G+ +CG DHGV VVG+G+ E+G YW+++NSWG WGE+GY+++ R+
Sbjct: 283 YVSGIFTGKCGTAVDHGVVVVGYGS---EEGMDYWIVRNSWGAAWGEAGYLKMQRNVGKS 339
Query: 331 EGLCGIATEASYPV 344
GLCGI E SYPV
Sbjct: 340 SGLCGITIEPSYPV 353
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 320 bits (821), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 158/312 (50%), Positives = 203/312 (65%), Gaps = 16/312 (5%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
+ +WMA HGRTY E+ R +F+ NL YI+ N G +++LG N F+DLTN+E
Sbjct: 46 YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 105
Query: 99 FRASYTG-YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
+RA+Y G RP R+ + + + D+P S+DWR KGAV +K+QG CGSCWA
Sbjct: 106 YRATYLGARTRP----QRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWA 161
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS +AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ TE D
Sbjct: 162 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKD 221
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY+ G CD ++ A TI YED+P DE +L +AV QPVSV +EA+G AF+ Y
Sbjct: 222 YPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYS 281
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
G+ CG DHGV VG+GT E+G YW++KNSWG +WGESGY+R+ R+ G
Sbjct: 282 SGIFTGSCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 338
Query: 333 LCGIATEASYPV 344
CGIA E SYP+
Sbjct: 339 KCGIAVEPSYPL 350
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 320 bits (820), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 158/344 (45%), Positives = 221/344 (64%), Gaps = 12/344 (3%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
S +IP +++ + A+ +S + E +++ +E+W+ +H + Y EK R +FK
Sbjct: 3 SMLIPTLLLLSFTFSHAT-AMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFK 61
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPS-VSRQSSRPSTFKYQ 126
NL +I+ N + N TY LG N+F+D+TN+E+RA Y G V + + + Y
Sbjct: 62 DNLGFIQDHNAQ-NNTYTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQNTGHRYAYN 120
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+ +P +DWR KGAV IK+QG+CGSCWAFS VAAVEGI I G+ + LSEQ+LVDC
Sbjct: 121 SGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDC 180
Query: 187 STD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ + GC+GGLMD AF++II+N G+ TE DYPYQ GTCD+ K+K I YED+P
Sbjct: 181 DREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDETKKKTKVVQIDGYEDVP 240
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
+E+AL +AV+ QPVSV +EASG+A + Y+ GV +CG DHGV VVG+GT E+G
Sbjct: 241 SNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGT---ENG 297
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
YWL++NSWG WGE GY ++ R+ EG CGIA + SYPV
Sbjct: 298 VDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 320 bits (820), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 155/309 (50%), Positives = 200/309 (64%), Gaps = 9/309 (2%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E W+ +HGR Y EK R IFK NL++I++ N GN +YKLG N+F+DL+N+E+R+
Sbjct: 25 YEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADLSNDEYRS 84
Query: 102 SYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
Y G + + ++ D+P ++DWREKGAV +K+QG CGSCWAFS V
Sbjct: 85 VYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDWREKGAVAPVKDQGQCGSCWAFSTV 144
Query: 162 AAVEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
AVEGI QI G L LSEQ+LVDC T N GC+GGLMD AF++IIEN G+ TE DYPY+
Sbjct: 145 GAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFIIENGGIDTEEDYPYK 204
Query: 221 QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL 280
CD ++ A TI YED+P+ DE +L +AV QPVSV +EA G+ F+ Y+ GV
Sbjct: 205 AIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRGFQLYQSGVF 264
Query: 281 NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCG 335
CG DHGV VG+GT E G YW+++NSWG WGE+GYIR+ RD G CG
Sbjct: 265 TGSCGTQLDHGVVTVGYGT---EHGVDYWIVRNSWGPAWGENGYIRMERDVASTETGKCG 321
Query: 336 IATEASYPV 344
IA EASYP
Sbjct: 322 IAMEASYPT 330
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 320 bits (819), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 163/318 (51%), Positives = 211/318 (66%), Gaps = 23/318 (7%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ +E+W+ +HG+ Y EK R IFK NL +I++ N E NRTY +G N F+DLTNE
Sbjct: 47 VMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNSE-NRTYTVGLNRFADLTNE 105
Query: 98 EFRASY----TGYNRPVPSVS-RQSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKNQGH 151
EFR+ Y TG+ + +P S R + R V D +P S+DWR++GAV +K+QG
Sbjct: 106 EFRSMYLGTRTGHKKRLPKTSDRYAPR--------VGDSLPDSVDWRKEGAVAEVKDQGG 157
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKG 210
CGSCWAFS +AAVEGI +I G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G
Sbjct: 158 CGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGG 217
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
+ TE DYPY G CD ++ A +I YED+P+ DE AL +AV QPVSV +E G+
Sbjct: 218 IDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGR 277
Query: 271 AFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
F+ Y GV ECG + DHGVA VG+GT E G YW+++NSWG++WGESGYIR+ R+
Sbjct: 278 NFQLYNSGVFTGECGTSLDHGVAAVGYGT---EKGKDYWIVRNSWGKSWGESGYIRMERN 334
Query: 331 ----EGLCGIATEASYPV 344
G CGIA E SYP+
Sbjct: 335 IASPTGKCGIAIEPSYPI 352
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 320 bits (819), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 163/317 (51%), Positives = 207/317 (65%), Gaps = 11/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ ++ ++ W+ +HG+ Y EKA R IFK NL +I++ N + NRTYK+G +F+DL
Sbjct: 21 DDEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQ-NRTYKVGLTKFADL 79
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
TN+E+RA + G +S PS + Y+ +P S+DWR KGAV IK+QG CG
Sbjct: 80 TNQEYRAMFLGTRSDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGAVNPIKDQGSCG 139
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFEYIIENKGLA 212
SCWAFS VAAVEGI QI G+LI LSEQ+LVDC N GC+GGLMD AF++II N GL
Sbjct: 140 SCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAGCNGGLMDYAFQFIINNGGLD 199
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE DYPY TCD+ K K A +I +ED+ DE AL +AV QPVSV +EASG A
Sbjct: 200 TEKDYPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQPVSVAIEASGMAL 259
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+FY+ GV ECG DHGV VVG+GT E G YWL++NSWG WGE GYI++ R+
Sbjct: 260 QFYQSGVFTGECGTALDHGVVVVGYGT---EKGLDYWLVRNSWGTEWGEHGYIKMQRNVR 316
Query: 331 ---EGLCGIATEASYPV 344
G CGIA E+SYPV
Sbjct: 317 DTYTGRCGIAMESSYPV 333
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 320 bits (819), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 162/351 (46%), Positives = 222/351 (63%), Gaps = 26/351 (7%)
Query: 13 MFVIIILVITCAS----QVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKA 60
MF+++ T +S ++S H + ++ +E W+ +HG+ Y EK
Sbjct: 1 MFMLLFFASTLSSASDLSIISYDQSHGTKSSWRTDDEVMAIYEDWLVKHGKAYNSLGEKE 60
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R +FK NL +I++ N E NRTY++G N F+DLTNEE+R+ Y G + + R R
Sbjct: 61 RRFEVFKDNLRFIDEHNSE-NRTYRVGLNRFADLTNEEYRSMYLG---ALSGIRRNKLRK 116
Query: 121 STFKYQ-NVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
+ +Y V D +P S+DWR++GAV +K+QG CGSCWAFSAVAAVEGI +I G LI L
Sbjct: 117 ISDRYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISL 176
Query: 179 SEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAAT 237
SEQ+LVDC N GC+GGLMD FE+II N G+ +E DYPY G CD ++ A +
Sbjct: 177 SEQELVDCDNSYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLARDGRCDTYRKNARVVS 236
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I YED+P +E AL +AV QPVSV +EA G+ F+ Y GV + CG DHGV VG+
Sbjct: 237 IDSYEDVPVNNEAALQKAVANQPVSVAIEAGGRDFQLYSSGVFSGRCGTALDHGVVAVGY 296
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
GT E+G YW+++NSWG++WGESGY+R+ R+ G+CGIA EASYP+
Sbjct: 297 GT---ENGQDYWIVRNSWGKSWGESGYLRMARNIRKPTGICGIAMEASYPI 344
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 320 bits (819), Expect = 8e-85, Method: Compositional matrix adjust.
Identities = 157/341 (46%), Positives = 221/341 (64%), Gaps = 21/341 (6%)
Query: 17 IILVITCASQVVSGRSMHEP----SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
++ ++ C SG + E S+V +HE WM+Q+GR+YKD EK + +FK N +
Sbjct: 8 LLAILGCLCFFASGLAARELNDDLSMVARHESWMSQYGRSYKDAAEKDRKFEVFKANAAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
I+ N + N + LG N+F+D+TNEEF+ + T N+ +S + + F Y+NV+
Sbjct: 68 IDSFNAK-NHKFWLGINQFADITNEEFKVTKT--NKGF--ISNKVRASTGFSYENVSIDA 122
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
+P +IDWR KGAVT +K+QG CG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC
Sbjct: 123 LPATIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHG 182
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
++ GC GGLMD AF++II N GL E+ YPY E G C + +A TI YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKC--KSGSKSAGTIKSYEDVPANN 240
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL++AV QPVSV V+ F+FY GV+ CG + DHG+A +G+G DG KY
Sbjct: 241 EGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVT--SDGTKY 298
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
WL+KNSWG +WGE+G++R+ +D +G+CG+A E SYP A
Sbjct: 299 WLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 160/314 (50%), Positives = 212/314 (67%), Gaps = 20/314 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I +++++WM ++GR YK E R TI++ N++YI+ N N ++ L N F+DLTNE
Sbjct: 15 IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNFADLTNE 73
Query: 98 EFRASYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCW 156
EF+A+Y GY + S P T F+Y N+ ++PT++DWR++GAVT IKNQG CGSCW
Sbjct: 74 EFKATYLGY--------KTVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCW 125
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDC--STDNNGCSGGLMDKAFEYIIENKGLATE 214
AFSAVAAVEGI +I GKLI LSEQ+LVDC ++ N GC+GG M KAFE+ I+ GL TE
Sbjct: 126 AFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEF-IKRTGLTTE 184
Query: 215 ADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRF 274
+YPYQ + C++QKEK +I YE +P DE +L AV QPVSV ++A G F+F
Sbjct: 185 IEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQF 244
Query: 275 YKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---- 330
Y G+ + CG+ +HGVA+VG+G E YWL+KNSWG WGESGYIR+ RD
Sbjct: 245 YSGGIFSGNCGNQLNHGVAIVGYG---ETSNQAYWLVKNSWGTDWGESGYIRMKRDSTDR 301
Query: 331 EGLCGIATEASYPV 344
+G CGIA ASYP
Sbjct: 302 QGTCGIAMMASYPT 315
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 163/351 (46%), Positives = 214/351 (60%), Gaps = 21/351 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMH---------EPSIVEKHEQWMAQHGRTYKDELEKA 60
I+ +F + + ++S S H E ++ +EQW+ +HG+ Y EK
Sbjct: 18 IVLLFTVFAVSSALDMSIISYDSAHADKAATLRTEEELMSMYEQWLVKHGKVYNALGEKE 77
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK NL +I+ N +RTYKLG N F+DLTNEE+RA Y G + R P
Sbjct: 78 KRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTNEEYRAKYLG--TKIDPNRRLGKTP 135
Query: 121 STFKYQNVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELS 179
S V D +P S+DWR++GAV +K+QG CGSCWAFSA+ AVEGI +I G+LI LS
Sbjct: 136 SNRYAPRVGDKLPDSVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLS 195
Query: 180 EQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
EQ+LVDC T N GC+GGLMD AFE+II N G+ ++ DYPY+ G CD ++ A +I
Sbjct: 196 EQELVDCDTGYNQGCNGGLMDYAFEFIINNGGIDSDEDYPYRGVDGRCDTYRKNAKVVSI 255
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
YED+P DE AL +AV QPVSV +E G+ F+ Y GV CG DHGV VG+G
Sbjct: 256 DDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYG 315
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
TA+ D YW+++NSWG +WGE GYIR+ R+ G CGIA E SYP+
Sbjct: 316 TAKGHD---YWIVRNSWGSSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 363
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 160/313 (51%), Positives = 212/313 (67%), Gaps = 20/313 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I +++++WM ++GR YK E R TI++ N++YI+ N N ++ L N F+DLTNE
Sbjct: 15 IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNFADLTNE 73
Query: 98 EFRASYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCW 156
EF+A+Y GY + S P T F+Y N+ ++PT++DWR++GAVT IKNQG CGSCW
Sbjct: 74 EFKATYLGY--------KTVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCW 125
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDC--STDNNGCSGGLMDKAFEYIIENKGLATE 214
AFSAVAAVEGI +I GKLI LSEQ+LVDC ++ N GC+GG M KAFE+ I+ GL TE
Sbjct: 126 AFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEF-IKRTGLTTE 184
Query: 215 ADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRF 274
+YPYQ + C++QKEK +I YE +P DE +L AV QPVSV ++A G F+F
Sbjct: 185 IEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQF 244
Query: 275 YKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---- 330
Y G+ + CG+ +HGVA+VG+G E YWL+KNSWG WGESGYIR+ RD
Sbjct: 245 YSGGIFSGNCGNQLNHGVAIVGYG---ETSNQAYWLVKNSWGTDWGESGYIRMKRDSTDK 301
Query: 331 EGLCGIATEASYP 343
+G CGIA ASYP
Sbjct: 302 QGTCGIAMMASYP 314
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 162/348 (46%), Positives = 215/348 (61%), Gaps = 18/348 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPS------IVEKHEQWMAQHGRTYKDELEKAMRL 63
I+ +F + + ++S + H + ++ +EQW+ +HG+ Y EK R
Sbjct: 41 ILLLFTVFAVSSALDMSIISYDNAHAATSRSDEELMSMYEQWLVKHGKVYNALGEKEKRF 100
Query: 64 TIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTF 123
IFK NL +I+ N + +RTYKLG N F+DLTNEE+RA Y G + R PS
Sbjct: 101 QIFKDNLRFIDDHNSQEDRTYKLGLNRFADLTNEEYRAKYLGTK--IDPNRRLGKTPSNR 158
Query: 124 KYQNVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
V D +P S+DWR++GAV +K+QG CGSCWAFSA+ AVEGI +I G+LI LSEQ+
Sbjct: 159 YAPRVGDKLPESVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQE 218
Query: 183 LVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
LVDC T N GC+GGLMD AFE+II N G+ +E DYPY+ G CD ++ A +I Y
Sbjct: 219 LVDCDTGYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRGVDGRCDTYRKNAKVVSIDDY 278
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE 301
ED+P DE AL +AV QPVSV +E G+ F+ Y GV CG DHGV VG+GTA
Sbjct: 279 EDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYGTA- 337
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
+G YW+++NSWG +WGE GYIR+ R+ G CGIA E SYP+
Sbjct: 338 --NGHDYWIVRNSWGPSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 383
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 158/309 (51%), Positives = 205/309 (66%), Gaps = 13/309 (4%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
E W+ HG++Y E+ R IFK NL YI++ N +R +KLG N+F+DLTNEE+R+
Sbjct: 46 ESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEEYRSK 105
Query: 103 YTGYNRP--VPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
YTG VS +S R +T +++ P S+DWRE GAV +K+QG CGSCWAFS
Sbjct: 106 YTGIKSKDLRKKVSAKSGRYATLSGESL---PESVDWRESGAVATVKDQGSCGSCWAFST 162
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
++AVEGI QI GKLI LSEQ+LVDC N GC+GGLMD AFE+II N G+ T+ DYPY
Sbjct: 163 ISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDYAFEFIINNGGIDTDVDYPY 222
Query: 220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
G CD+ ++ A TI YED+P DE AL +A QP+SV +EASG+ F+FY G+
Sbjct: 223 TGRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRDFQFYDSGI 282
Query: 280 LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCG 335
+CG DHGV VVG+GT E+G YW+++NSWG WGE+GY+R+ R G+CG
Sbjct: 283 FTGKCGIALDHGVVVVGYGT---ENGKDYWIVRNSWGADWGENGYLRMERGISSKTGICG 339
Query: 336 IATEASYPV 344
IA E SYPV
Sbjct: 340 IAIEPSYPV 348
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 153/313 (48%), Positives = 213/313 (68%), Gaps = 11/313 (3%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
E+HE+WMAQ+G+ YKD EK R +FK N+++IE N G++ + L N+F+DL +EEF
Sbjct: 33 ERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEF 92
Query: 100 RASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH-CGSCWAF 158
+A + V +++ ++F+Y+NVT +P+++DWR++GAVT IK+QG+ CGSCWAF
Sbjct: 93 KALLNNVQKKASRV--ETATETSFRYENVTKIPSTMDWRKRGAVTPIKDQGYTCGSCWAF 150
Query: 159 SAVAAVEGITQITGGKLIELSEQQLVDC-STDNNGCSGGLMDKAFEYIIENKGLATEADY 217
+ VA VE + QIT G+L+ LSEQ+LVDC D+ GC GG ++ AFE+I G+ +EA Y
Sbjct: 151 ATVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEFIANKGGITSEAYY 210
Query: 218 PYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKR 277
PY+ + +C +KE A I YE +P E ALL+AV QPVSV ++A AF+FY
Sbjct: 211 PYKGKDRSCKVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSVYIDAGAIAFKFYSS 270
Query: 278 GVLNAE-CGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
G+ A CG + DH VAVVG+G + DG KYWL+KNSW WGE GY+RI RD +G
Sbjct: 271 GIFEARNCGTHLDHAVAVVGYG--KLRDGTKYWLVKNSWSTAWGEKGYMRIKRDIRAKKG 328
Query: 333 LCGIATEASYPVA 345
LCGIA+ ASYP+A
Sbjct: 329 LCGIASNASYPIA 341
>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
Length = 298
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 171/341 (50%), Positives = 210/341 (61%), Gaps = 58/341 (17%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+ M ++ IL ASQ S RS+HE S+ E+HE WMA++GR YKD EK R IFK N+
Sbjct: 10 VSMALLFILA-AWASQATS-RSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNV 67
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
++ +TFKY+NVT
Sbjct: 68 -----------------------------------------------AQATTFKYENVTA 80
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
VP++IDWR+KGAVT IK+Q CGSCWAFSAVAA EGITQIT GKLI LSEQ+LVDC T
Sbjct: 81 VPSTIDWRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGG 140
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
+N GCSGGL D AF +I + GLA+EA YPY+ + GTC+ +KE AA I YED+P +
Sbjct: 141 ENQGCSGGLXDDAFRFIXIH-GLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANN 199
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL +AV QPV+V ++A G F+FY GV +CG DHGVA VG+G +DG Y
Sbjct: 200 EKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIG--DDGMXY 257
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
WL+KNSWG WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 258 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 298
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 161/322 (50%), Positives = 204/322 (63%), Gaps = 14/322 (4%)
Query: 31 RSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLG 87
RS E I+ +E W+A+HGR Y EK R IFK N+ +I+ N G+R+++LG
Sbjct: 41 RSEEEMRIL--YEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLG 98
Query: 88 TNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N F+D+TNEE+RA Y G RP R ++Y D+P S+DWR KGAV +K
Sbjct: 99 LNRFADMTNEEYRAVYLG-TRPAGHRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVK 157
Query: 148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYII 206
+QG CGSCWAFS VAAVEGI +I G LI LSEQ+LVDC N GC+GGLMD FE+II
Sbjct: 158 DQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQGCNGGLMDYGFEFII 217
Query: 207 ENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVE 266
N G+ TE DYPY G CD+ ++ A +I YED+P DE AL +AV QPVSV +E
Sbjct: 218 NNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIE 277
Query: 267 ASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
A G+ F+ Y G+ CG + DHGV VG+GT E+G YW+++NSWG WGESGYIR
Sbjct: 278 AGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGT---ENGKDYWIVRNSWGGDWGESGYIR 334
Query: 327 ILRD----EGLCGIATEASYPV 344
+ R+ G CGIA E SYP
Sbjct: 335 MERNVNTSTGKCGIAIEPSYPT 356
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 161/345 (46%), Positives = 221/345 (64%), Gaps = 12/345 (3%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTI 65
KS ++ + V + V + + + + E S+ +E+W + H +D EK R +
Sbjct: 4 KSMLLALVVALAFVGVARTIPFNEKDLASEESLWGLYERWRSHH-TVSRDLSEKNKRFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS-TFK 124
FK+N ++I + NK+ + YKLG N+F+D+TN+EFR++Y G R + R + +F
Sbjct: 63 FKENAKFIHEFNKK-DAPYKLGLNKFADMTNQEFRSTYAGSKIHHHRTQRGTPRATGSFM 121
Query: 125 YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
Y+NV +P S+DWR +GAV +K+QG CGSCWAFS +A+VEGI +I +L+ LS QQLV
Sbjct: 122 YENVHSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKIKTNQLVPLSGQQLV 181
Query: 185 DCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
DC TD N GC+GGLMD AFE+I N G+ +E+ YPY EQG+C + A TI YED
Sbjct: 182 DCDTDQNEGCNGGLMDYAFEFIKSNGGITSESAYPYTAEQGSCASE-SSAPVVTIDGYED 240
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+P +E AL++AV Q VSV +EASG AF+FY GV CG+ DHGVAVVG+G
Sbjct: 241 VPANNEAALMKAVANQVVSVAIEASGMAFQFYSEGVFTGSCGNELDHGVAVVGYGAT--R 298
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
DG KYW+++NSWG WGE GYIR+ R GLCGIA E SYP+
Sbjct: 299 DGTKYWIVRNSWGAEWGEKGYIRMQRGIRARHGLCGIAMEPSYPL 343
>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
Length = 351
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 168/345 (48%), Positives = 226/345 (65%), Gaps = 24/345 (6%)
Query: 14 FVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
+ ++ + C V+ R + E ++ +HE+WM +HGRTYKDE EKA R +FK
Sbjct: 18 LLTVLAIANCIGCAVAARDLSSSTGYGEEAMTARHEKWMVEHGRTYKDEAEKARRFQVFK 77
Query: 68 QNLEYIEKANKE-GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
N +++ +N G + Y L N F+D+T++EF A YTG+ +P+P+ + + FKY
Sbjct: 78 ANAAFVDTSNAAAGGKKYHLAINRFADMTHDEFMARYTGF-KPLPATGK---KMPGFKYA 133
Query: 127 NVT---DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
NVT + ++DWR+KGAVT +KNQ CG CWAFSAVAA+EG+ QI G+L+ LSEQQL
Sbjct: 134 NVTLSSEDQQAVDWRKKGAVTDVKNQQKCGCCWAFSAVAAIEGMHQINTGELVSLSEQQL 193
Query: 184 VDCST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
VDCST +NNGC GG M+ AF+Y+I N G+ATEA YPY QG C + A + Y
Sbjct: 194 VDCSTNGNNNGCGGGTMEDAFQYVIGNNGIATEAAYPYTAMQGMCQNVQP---AVAVRSY 250
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTA 300
+ +P+ DE AL AV QPVSV V+A+ F+FYK GV+ A+ CG N +H V VG+GTA
Sbjct: 251 QQVPRDDEDALAAAVAGQPVSVAVDANN--FQFYKGGVMTADSCGTNLNHAVTAVGYGTA 308
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEASYPVA 345
EDG YWL+KN WG TWGE GY+R+ R G CG+A +ASYPVA
Sbjct: 309 --EDGTPYWLLKNQWGSTWGEEGYLRLQRGVGACGVAKDASYPVA 351
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 158/325 (48%), Positives = 206/325 (63%), Gaps = 12/325 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTY 84
V G E + +E W+A+HGR EK R IFK N+ +I+ N G+R++
Sbjct: 36 VQGLERSEEEMRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSF 95
Query: 85 KLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
+LG N F+D+TNEE+R Y G RP R ++Y ++P S+DWR+KGAVT
Sbjct: 96 RLGLNRFADMTNEEYRTVYLG-TRPASHRRRARLGSDRYRYNAGEELPESVDWRDKGAVT 154
Query: 145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFE 203
+K+QG CGSCWAFS +AAVEGI +I G LI LSEQ+LVDC N GC+GGLMD AFE
Sbjct: 155 TVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDNGQNQGCNGGLMDYAFE 214
Query: 204 YIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSV 263
+II N G+ TE DYPY+ G CD+ ++ A +I YED+P DE AL +AV QPVSV
Sbjct: 215 FIINNGGIDTEEDYPYKARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSV 274
Query: 264 CVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
+EA G+ F+ Y G+ CG + DHGV VG+GT E+G YW+++NSWG WGESG
Sbjct: 275 AIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGT---ENGKDYWIVRNSWGGDWGESG 331
Query: 324 YIRILRD----EGLCGIATEASYPV 344
YIR+ R+ G CGIA E+SYP
Sbjct: 332 YIRMERNVNASTGKCGIAMESSYPT 356
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 165/321 (51%), Positives = 210/321 (65%), Gaps = 20/321 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E++MA++ + Y EK R +FK NL +I++ NK+ Y LG NEF+DLT++
Sbjct: 48 LMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKK-ITGYWLGLNEFADLTHD 106
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNV--TDVPTSIDWREKGAVTHIKNQGHCGSC 155
EF+A+Y G + +R++S F+Y+ V +P +DWR+KGAVT +KNQG CGSC
Sbjct: 107 EFKAAYLGL---TLTPARRNSNDQLFRYEEVEAASLPKEVDWRKKGAVTEVKNQGQCGSC 163
Query: 156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATE 214
WAFS VAAVEGI I G L LSEQ+L+DC TD NNGCSGGLMD AF YI N GL TE
Sbjct: 164 WAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGLMDYAFSYIAANGGLHTE 223
Query: 215 ADYPYQQEQGTC-------DKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA 267
YPY E+GTC D E AAA TI YED+P+ +E ALL+A+ QPVSV +EA
Sbjct: 224 ESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNEQALLKALAHQPVSVAIEA 283
Query: 268 SGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
SG+ F+FY GV + CG DHGV VG+GTA + G Y ++KNSWG WGE GYIR+
Sbjct: 284 SGRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTASK--GHDYIIVKNSWGSHWGEKGYIRM 341
Query: 328 LR----DEGLCGIATEASYPV 344
R +GLCGI ASYP
Sbjct: 342 RRGTGKHDGLCGINKMASYPT 362
>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
Length = 364
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 156/289 (53%), Positives = 201/289 (69%), Gaps = 10/289 (3%)
Query: 62 RLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN-RPVPSVSRQSSRP 120
R +FK+N YI + NK+ +R ++L N+F+D+T +EFR +Y G R S+S
Sbjct: 62 RFNVFKENARYIHEGNKK-DRPFRLALNKFADMTTDEFRRTYAGSRVRHHLSLSGGRRGD 120
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+F+Y + ++P ++DWR+KGAVT IK+QG CGSCWAFS + AVEGI +I GKL+ LSE
Sbjct: 121 GSFRYGDADNLPPAVDWRQKGAVTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSLSE 180
Query: 181 QQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIG 239
Q+L+DC NN GC GGLMD AF++I +N G+ TE++YPYQ EQG+CD KEKA A TI
Sbjct: 181 QELMDCDNVNNQGCDGGLMDYAFQFIHKN-GITTESNYPYQGEQGSCDLAKEKAHAVTID 239
Query: 240 KYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGT 299
YED+P DE AL +AV QPVSV ++ASG F+FY GV EC + DHGVA VG+GT
Sbjct: 240 GYEDVPANDESALQKAVAGQPVSVAIDASGNDFQFYSEGVFTGECSTDLDHGVAAVGYGT 299
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
DG KYW++KNSWGE WGE GYIR+ R EG CGIA +ASYP
Sbjct: 300 T--RDGTKYWIVKNSWGEDWGEKGYIRMQRGVSQAEGQCGIAMQASYPT 346
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 164/318 (51%), Positives = 216/318 (67%), Gaps = 13/318 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ ++V +HE+WMA+HGRTY +E EKA RL +F+ N + I+ N + T++L TN F+DL
Sbjct: 37 DSAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADL 96
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN--VTDVPTSIDWREKGAVTHIKNQGHC 152
T+EEFRA+ TG RP + + S F+Y+N + D S+DWR GAVT +K+QG C
Sbjct: 97 TDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSC 156
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKG 210
G CWAFSAVAAVEG+T+I G+L+ LSEQQLVDC D+ GC+GGLMD AFEY+I G
Sbjct: 157 GCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGG 216
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
L TE+ YPY+ G+C + A+AA+I YED+P +E AL+ AV QPVSV +
Sbjct: 217 LTTESSYPYRGTDGSC---RRSASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDS 273
Query: 271 AFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI-- 327
FRFY GVL CG +H + VG+GTA DG KYW++KNSWG +WGE GY+RI
Sbjct: 274 VFRFYDSGVLGGSGCGTELNHAITAVGYGTA--SDGTKYWIMKNSWGGSWGEGGYVRIRR 331
Query: 328 -LRDEGLCGIATEASYPV 344
+R EG+CG+A ASYPV
Sbjct: 332 GVRGEGVCGLAQLASYPV 349
>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 162/329 (49%), Positives = 225/329 (68%), Gaps = 12/329 (3%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
S+ S ++HEP+I H++WM R Y DE EK MRL +F +NL++IE N G+++Y
Sbjct: 21 SEATSRVALHEPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSY 80
Query: 85 KLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ-NVTDV-PTSIDWREKGA 142
KLG N+F+D T EEF A++TG + + + +T + V+DV T+ DWR +GA
Sbjct: 81 KLGVNKFTDWTKEEFLATHTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNEGA 140
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKA 201
VT +K QG CG CWAFSA+AAVEG+T+I G LI LSEQQL+DC+ + NNGC GG M +A
Sbjct: 141 VTPVKYQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCAREQNNGCKGGTMIEA 200
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
F YI++N G+++E YPYQ ++G C + A I +E++P +E ALL+AV++QPV
Sbjct: 201 FNYIVKNGGVSSENAYPYQVKEGPC--RSNDIPAIVIRGFENVPSNNERALLEAVSRQPV 258
Query: 262 SVCVEASGQAFRFYKRGVLNA-ECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWG 320
+V ++AS F Y GV NA +CG + +H V +VG+GT++E G KYWL KNSWG+TWG
Sbjct: 259 AVDIDASETGFIHYSGGVYNARDCGTSVNHAVTLVGYGTSQE--GIKYWLAKNSWGKTWG 316
Query: 321 ESGYIRILRD----EGLCGIATEASYPVA 345
E+GYIRI RD +G+CG+A ASYPVA
Sbjct: 317 ENGYIRIRRDVEWPQGMCGVAQYASYPVA 345
>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 337
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 162/339 (47%), Positives = 216/339 (63%), Gaps = 20/339 (5%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+ +F+++ + I SQV+S R +HE S+ E+HE W+A++G+ YK EK IFK+N+
Sbjct: 11 LALFLLLSIEI---SQVMS-RKLHETSLREEHENWIARYGQVYKVAAEKET-FQIFKENV 65
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+IE N N+ YKLG N F+DLT EEF+ G + + FKY+NVTD
Sbjct: 66 EFIESFNAAANKPYKLGVNLFADLTLEEFKDFRFGLKKT------HEFSITPFKYENVTD 119
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P ++DWREKGAVT IK+QG CGSCWAFS VAA EGI QIT G L+ L EQ+LV C T
Sbjct: 120 IPEALDWREKGAVTPIKDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKG 179
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
+ GC GG M+ FE+II+N G+ T+A+YPY+ GTC+ + A I YE +P
Sbjct: 180 VDQGCEGGYMEDGFEFIIKNGGITTKANYPYKGVNGTCNTTIAASTVAQIKGYETVPSYS 239
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL +AV QPVSV ++A+ F FY G+ ECG + DHGV VG+GT E D Y
Sbjct: 240 EEALQKAVANQPVSVSIDANNGHFMFYAGGIYTGECGTDLDHGVTAVGYGTTNETD---Y 296
Query: 309 WLIKNSWGETWGESGYIRILR----DEGLCGIATEASYP 343
W++KNSWG W E G+IR+ R GLCG+A ++SYP
Sbjct: 297 WIVKNSWGTGWDEKGFIRMQRGITVKHGLCGVALDSSYP 335
>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
Length = 328
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 157/341 (46%), Positives = 216/341 (63%), Gaps = 30/341 (8%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ I+ L C + + + + ++V +HEQWM Q+ R YKD EKA R +FK N+++
Sbjct: 8 ILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN-RPVPSVSRQSSRPSTFKYQNVT-- 129
IE N GNR + LG N+F+DLTN+EFRA+ T +P P P+ F+Y+NV+
Sbjct: 68 IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSP-----VKVPTGFRYENVSVD 122
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
+P +IDWR KGAVT IK+QG C EGI +I+ GKLI LSEQ+LVDC
Sbjct: 123 ALPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDCDVH 170
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
++ GC GGLMD AF++II+N GL TE+ YPY G C + +AAT+ +ED+P
Sbjct: 171 GEDQGCEGGLMDDAFQFIIKNGGLTTESSYPYTAADGKC--KSGSNSAATVKGFEDVPAN 228
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
DE AL++AV QPVSV V+ F+FY GV+ CG + DHG+A +G+G + DG K
Sbjct: 229 DEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG--QTSDGTK 286
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
YWL+KNSWG TWGE+GY+R+ +D G+CG+A E SYP+
Sbjct: 287 YWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPI 327
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 161/322 (50%), Positives = 217/322 (67%), Gaps = 18/322 (5%)
Query: 35 EPSIVEKHEQWMAQHG---RTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEF 91
E S+ +E W + H R E E A R +FK+N+ YI +ANK+ +R ++L N+F
Sbjct: 33 EESLRGLYETWRSHHTVSRRGLGAEAE-ARRFNVFKENVRYIHEANKK-DRPFRLALNKF 90
Query: 92 SDLTNEEFRASYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
+D+T +EFR +Y G ++R + RQ +F Y + ++P ++DWR+KGAVT IK
Sbjct: 91 ADMTTDEFRRTYAGSRVRHHRSLSGGRRQGG--GSFMYADAENLPAAVDWRQKGAVTPIK 148
Query: 148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFEYII 206
+QG CGSCWAFS + AVEGI +I G+L+ LSEQ+L+DC+ +N+GC+GGLMD AF++I
Sbjct: 149 DQGQCGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNIGENDGCNGGLMDVAFQFIQ 208
Query: 207 ENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVE 266
+N G+ TEA YPYQ EQ +CD+ KE + +I YED+P DE AL +AV QPVSV ++
Sbjct: 209 QNGGITTEASYPYQGEQNSCDQSKENSHDVSIDGYEDVPANDESALQKAVANQPVSVAID 268
Query: 267 ASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
ASG F+FY GV + G + DHGVA VG+GT DG KYW++KNSWGE WGE GYIR
Sbjct: 269 ASGNDFQFYSEGVFTTDGGTDLDHGVAAVGYGTT--RDGTKYWIVKNSWGEDWGEKGYIR 326
Query: 327 ILRD----EGLCGIATEASYPV 344
+ R EGLCGIA EASYP
Sbjct: 327 MQRGVKQAEGLCGIAMEASYPT 348
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 156/345 (45%), Positives = 219/345 (63%), Gaps = 16/345 (4%)
Query: 10 IIPMFVIIILV---ITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIF 66
I+P F+ L+ + Q+ +GRS E ++ +E+W+ +H + Y EK R IF
Sbjct: 6 ILPFFLFFSLITFSLALDIQLPTGRSNDE--VMTMYEEWLVKHQKVYNGLREKDQRFQIF 63
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPS-VSRQSSRPSTFKY 125
K NL +I++ N + N TY +G N+F+D+TNEE+R Y G + + + + Y
Sbjct: 64 KDNLNFIDEHNAQ-NYTYIVGLNKFADMTNEEYRDMYLGTRSDIKRRIMKNKITGHRYAY 122
Query: 126 QNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVD 185
+ +P +DWR KGA+THIK+QG CGSCWAFS +A VE I +I GKL+ LSEQ+LVD
Sbjct: 123 NSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVD 182
Query: 186 CSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
C N GC+GGLMD AFE+II N G+ T+ YPY+ +G CD ++KA +I YED+
Sbjct: 183 CDRAFNEGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAKIVSIDGYEDV 242
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEED 304
P +E+AL +AV QPVSV +EASG+A + Y+ GV +CG + DH V +VG+G+ E+
Sbjct: 243 PSNNENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVIVGYGS---EN 299
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
G YWL++NSWG WGE GY ++ R+ G CGIA EASYPV
Sbjct: 300 GLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPV 344
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 154/317 (48%), Positives = 214/317 (67%), Gaps = 13/317 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ ++++WMAQ+ R YKD+ EKA R +FK N E+I+++N G + Y LGTN+F+DL
Sbjct: 52 EAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADL 111
Query: 95 TNEEFRASYTGYNRP--VPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKNQG 150
T++EF A YTG +P VPS ++Q + KYQN T D +DWR++GAVT +KNQG
Sbjct: 112 TSKEFAAMYTGLRKPAAVPSGAKQIPAAGS-KYQNFTRLDDDVQVDWRQQGAVTPVKNQG 170
Query: 151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC--STDNNGCSGGLMDKAFEYIIEN 208
CG CWAFSAV A+EG+ IT G L+ LSEQQ++DC S N GC+GG MD AF+Y+I N
Sbjct: 171 QCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVINN 230
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
G+ TE YPY QGTC + AATI ++DLP GDE+AL AV QPVSV V+
Sbjct: 231 GGVTTEDAYPYSAVQGTCQNVQP---AATISGFQDLPSGDENALANAVANQPVSVGVDGG 287
Query: 269 GQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
F+FY+ G+ + + CG + +H V +G+G ++ G +YW++KNSWG WGE+G++++
Sbjct: 288 SSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGA--DDQGTQYWILKNSWGTGWGENGFMQL 345
Query: 328 LRDEGLCGIATEASYPV 344
G CGI+T ASYP
Sbjct: 346 QMGVGACGISTMASYPT 362
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 157/312 (50%), Positives = 208/312 (66%), Gaps = 33/312 (10%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ + E W+++HG+ YK EK R +F++NL +I++ NKE + +Y LG NEF+DL++E
Sbjct: 45 LIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVS-SYWLGLNEFADLSHE 103
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF++ ++V D+P S+DWR+KGAVTH+KNQG CGSCWA
Sbjct: 104 EFKS------------------------KDVADLPESVDWRKKGAVTHVKNQGACGSCWA 139
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF +I N GL E D
Sbjct: 140 FSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDD 199
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+GTC++QKE TI YED+P+ DE +LL+A+ QP+SV +EASG+ F+FY
Sbjct: 200 YPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYS 259
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV N CG DHGVA VG+G+++ G Y ++KNSWG WGE GYIR+ R+ EG
Sbjct: 260 GGVFNGPCGTELDHGVAAVGYGSSK---GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEG 316
Query: 333 LCGIATEASYPV 344
LCGI ASYP
Sbjct: 317 LCGINKMASYPT 328
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 163/322 (50%), Positives = 205/322 (63%), Gaps = 15/322 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ + +E+W H R + EK R FK N+ +I NK G+R Y+L N F D+
Sbjct: 39 EEALWDLYERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKRGDRPYRLRLNRFGDM 97
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPST--FKYQ--NVTDVPTSIDWREKGAVTHIKNQG 150
+ EFRA++ G ++ PS F Y NV+D+P S+DWR+KGAVT +KNQG
Sbjct: 98 SQAEFRATFAGSRVSDRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGVKNQG 157
Query: 151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFEYIIENK 209
CGSCWAFS V +VEGI I GKL+ LSEQ+L+DC T DN+GC GGLMD AFEYI +N
Sbjct: 158 KCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDNAFEYIKKNG 217
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAAT---IGKYEDLPKGDEHALLQAVTKQPVSVCVE 266
GL TEA YPY+ GTC K ++ I ++D+P E AL +AV QPVSV ++
Sbjct: 218 GLTTEAAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQPVSVGID 277
Query: 267 ASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
ASG+AF FY GV ECG DHGVAVVG+G A EDG YW +KNSWG +WGE GYIR
Sbjct: 278 ASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVA--EDGKAYWTVKNSWGPSWGEKGYIR 335
Query: 327 ILRDE----GLCGIATEASYPV 344
+ +D GLCGIA EASY V
Sbjct: 336 VEKDSGAEGGLCGIAMEASYAV 357
>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
Precursor
gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 157/350 (44%), Positives = 221/350 (63%), Gaps = 13/350 (3%)
Query: 5 FEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLT 64
+K +I +F ++IL C E + +++W + H + E+ R
Sbjct: 1 MKKLLLIFLFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHS-VPRSLNEREKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN---RPVPSVSRQSSRPS 121
+F+ N+ ++ NK+ NR+YKL N+F+DLT EF+ +YTG N + ++ S+
Sbjct: 60 VFRHNVMHVHNTNKK-NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQF 118
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQ 181
+ ++N++ +P+S+DWR+KGAVT IKNQG CGSCWAFS VAAVEGI +I KL+ LSEQ
Sbjct: 119 MYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQ 178
Query: 182 QLVDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
+LVDC T N GC+GGLM+ AFE+I +N G+ TE YPY+ G CD K+ TI
Sbjct: 179 ELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDG 238
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
+ED+P+ DE+ALL+AV QPVSV ++A F+FY GV CG +HGVA VG+G+
Sbjct: 239 HEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYGS- 297
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVAM 346
E G KYW+++NSWG WGE GYI+I R+ EG CGIA EASYP+ +
Sbjct: 298 --ERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPIKL 345
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 160/336 (47%), Positives = 212/336 (63%), Gaps = 14/336 (4%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
+ L +VS E + + +WMA+HG TY E+ R F+ NL YI++
Sbjct: 18 VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 77 N---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N G +++LG N F+DLTNEE+R++Y G R P R+ S + ++ + ++P
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLG-ARTKPDRERKLS--ARYQAADNDELPE 134
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNG 192
S+DWR+KGAV +K+QG CGSCWAFSA+AAVEGI QI G +I LSEQ+LVDC T N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C+GGLMD AFE+II N G+ +E DYPY++ CD K+ A TI YED+P E +L
Sbjct: 195 CNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
+AV QP+SV +EA G+AF+ YK G+ CG DHGVA VG+GT E+G YWL++
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT---ENGKDYWLVR 311
Query: 313 NSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
NSWG WGE GYIR+ R+ G CGIA E SYP
Sbjct: 312 NSWGSVWGEDGYIRMERNIKASSGKCGIAVEPSYPT 347
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 155/312 (49%), Positives = 206/312 (66%), Gaps = 12/312 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
++ W AQH R+Y E RL IF+ NL +I++ N G +++LG F+DLTNEE
Sbjct: 47 YQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTRFADLTNEE 106
Query: 99 FRASYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
+R++Y G R S+ S +++++ D+P SIDWR+KGAV +K+QG CGSCWA
Sbjct: 107 YRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKDQGSCGSCWA 166
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS +AAVEGI I G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ T+ D
Sbjct: 167 FSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMDYAFEFIISNGGIDTDED 226
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY G+CD+ ++ A TI YED+P DE +L +AV QPVSV +EA G+AF+ Y+
Sbjct: 227 YPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEAGGRAFQLYE 286
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
G+ CG DHGV +G+G+ E+G YW++KNSWG WGESGYIR+ R+ G
Sbjct: 287 SGIFTGYCGTELDHGVTAIGYGS---ENGKYYWIVKNSWGSDWGESGYIRMERNINSATG 343
Query: 333 LCGIATEASYPV 344
CGIA EASYP+
Sbjct: 344 KCGIAMEASYPI 355
>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 161/320 (50%), Positives = 214/320 (66%), Gaps = 16/320 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYK----DELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNE 90
E S+ +E+W + + + + D E+ R +FK+N Y+ + NK +R ++L N+
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAEER--RFNVFKENARYVHEGNKR-DRPFRLALNK 90
Query: 91 FSDLTNEEFRASYTGYN-RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQ 149
F+D+T +EFR +Y G R S+S F+Y + ++P ++DWR+KGAVT IK+Q
Sbjct: 91 FADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGAVTAIKDQ 150
Query: 150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIEN 208
G CGSCWAFS + AVEGI +I GKL+ LSEQ+L+DC NN GC GGLMD AF++I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCEGGLMDYAFQFIQKN 210
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
G+ TE++YPYQ EQG+CD+ KE A A TI YED+P DE AL +AV QPVSV ++AS
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269
Query: 269 GQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
GQ F+FY GV EC + DHGVA VG+G DG KYW++KNSWGE WGE GYIR+
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGAT--RDGTKYWIVKNSWGEDWGEKGYIRMQ 327
Query: 329 R----DEGLCGIATEASYPV 344
R EGLCGIA +ASYP
Sbjct: 328 RGVSQTEGLCGIAMQASYPT 347
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 317 bits (813), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 155/313 (49%), Positives = 203/313 (64%), Gaps = 20/313 (6%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+++W+ +HG+ Y E R IFK+N+ YI N N ++ LG N+F+DLTN EFR
Sbjct: 38 YQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEFRG 97
Query: 102 SYTGYNRPVPSVSRQSSRPSTFK----YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
Y G + RP+ F V D TS+DWR+KG VT IK+QG CGSCWA
Sbjct: 98 LYVG----------RLQRPAPFHEVGDIALVADTATSVDWRKKGGVTEIKDQGDCGSCWA 147
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FSAVAAVEG+T ++ G L+ LSEQ+LVDC T N GC GG+MD AF+Y+I N G+ ++++
Sbjct: 148 FSAVAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMDYAFQYMIRNGGITSQSN 207
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY+ +G CDK K K AATI ++ +P E LL+AV QPVSV +EA GQ F+ Y
Sbjct: 208 YPYRALRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYS 267
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---EGL 333
GV ECG N DHGVA+VG+GT + G +YWL+KNSWG WGESGY+R+ R G+
Sbjct: 268 SGVFTGECGSNLDHGVAIVGYGT--DAGGRQYWLVKNSWGSGWGESGYVRMERQGPGAGV 325
Query: 334 CGIATEASYPVAM 346
CGI +ASYP +
Sbjct: 326 CGINLDASYPTKI 338
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 158/326 (48%), Positives = 211/326 (64%), Gaps = 14/326 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRT 83
+VS E + + +WM++H RTY E+ R +F+ NL YI++ N G +
Sbjct: 26 IVSYGERSEEEVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLHS 85
Query: 84 YKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAV 143
++LG N F+DLTNEE+R++Y G R P R+ S + ++ + ++P ++DWR+KGAV
Sbjct: 86 FRLGLNRFADLTNEEYRSTYLG-ARTKPDRERKLS--ARYQADDNEELPETVDWRKKGAV 142
Query: 144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAF 202
IK+QG CGSCWAFSA+AAVEGI QI G +I LSEQ+LVDC T N GC+GGLMD AF
Sbjct: 143 AAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGGLMDYAF 202
Query: 203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVS 262
E+II N G+ +E DYPY++ CD K+ A TI YED+P E +L +AV QP+S
Sbjct: 203 EFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPIS 262
Query: 263 VCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGES 322
V +EA G+AF+ YK G+ CG DHGVA VG+GT E+G YWL++NSWG WGE
Sbjct: 263 VAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT---ENGKDYWLVRNSWGTVWGED 319
Query: 323 GYIRILRD----EGLCGIATEASYPV 344
GYIR+ R+ G CGIA E SYP
Sbjct: 320 GYIRMERNIKASSGKCGIAVEPSYPT 345
>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 384
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 156/324 (48%), Positives = 212/324 (65%), Gaps = 16/324 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTY----KDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNE 90
E S+ +E+W + + R D+ ++A R +FK+N Y+ +AN++ R ++L N+
Sbjct: 34 EESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKDGRPFRLALNK 93
Query: 91 FSDLTNEEFRASYTG----YNRPVPSVSRQSSRPSTFKY-QNVTDVPTSIDWREKGAVTH 145
F+D+T +EFR +Y G ++R +R + + T++P ++DWR +GAVT
Sbjct: 94 FADMTTDEFRRTYAGSRTRHHRAQLGEARSFAHAQHGRGGSGTTNLPPAVDWRLRGAVTG 153
Query: 146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEY 204
+K+QG CGSCWAFSA+AAVEG+ +I GKL+ LSEQ+LVDC DN GC GGLMD AF+Y
Sbjct: 154 VKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAFQY 213
Query: 205 IIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVC 264
I N G+ TE++YPY EQ +C+K KE++ TI YED+P +E AL +AV QPV+V
Sbjct: 214 IQRNGGVTTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAVA 273
Query: 265 VEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGY 324
+EASGQ F+FY GV CG + DHGVA VG+GT DG KYW +KNSWGE WGE GY
Sbjct: 274 IEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTT--GDGTKYWTVKNSWGEDWGERGY 331
Query: 325 IRILR----DEGLCGIATEASYPV 344
IR+ R GLCGIA E SYP
Sbjct: 332 IRMQRGVPDSRGLCGIAMEPSYPT 355
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 155/311 (49%), Positives = 212/311 (68%), Gaps = 12/311 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E+W++ HG+ Y+ EK R +FK NL++I++ NK+ +Y LG NEF+DLT++
Sbjct: 41 LIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT-SYWLGVNEFADLTHQ 99
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G + S +RQS P F Y++V D+P S+DWR+KGAVT +KNQG CGSCWA
Sbjct: 100 EFKNMYLGL-KVESSRTRQS--PEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWA 156
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI +I GG L LSEQ+L+DC NNGC GGLMD AF +I+ + GL E D
Sbjct: 157 FSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEED 216
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY + + TCD +K + TI Y+D+P+ +E +L++A+ QP+SV +EASG+ F+FY
Sbjct: 217 YPYLEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYS 276
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE----G 332
GV + CG DHGV VG+G+++ G Y ++KNSWG WGE GYIR+ R+ G
Sbjct: 277 GGVFDGPCGTQLDHGVTAVGYGSSK---GVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAG 333
Query: 333 LCGIATEASYP 343
LCGI ASYP
Sbjct: 334 LCGINKMASYP 344
>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 352
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 161/343 (46%), Positives = 219/343 (63%), Gaps = 15/343 (4%)
Query: 13 MFVIIILVITCASQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
M ++LV+ ++ +M ++ +H++WMA+HGRTYKD EKA R +FK N+
Sbjct: 11 MAASLLLVVAGGLSTMAKVTMASRAGTMEARHDKWMAEHGRTYKDAAEKARRFRVFKANV 70
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+ I+++N GN+ Y+L TN F+DLT+ EF A YTGYN P+ + ++ +T + + D
Sbjct: 71 DLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYN---PANTMYAAANATTRLSSEDD 127
Query: 131 -VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
P +DWR++GAVT +KNQ CG CWAFS VAAVEGI QIT G+L+ LSEQQL+DC+ D
Sbjct: 128 QQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCA-D 186
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCD---KQKEKAAAATIGKYEDLPK 246
N GC+GG +D AF+Y+ + G+ TEA Y YQ QG C AATI Y+ +
Sbjct: 187 NGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVAATISGYQRVNP 246
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGT-AEEED 304
DE +L AV QPVSV +E SG FR Y GV A+ CG DH VAVVG+G A+
Sbjct: 247 NDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSG 306
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
G YW+IKNSWG TWG+ GY+++ +D +G CG+A SYPV
Sbjct: 307 GGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 349
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 153/310 (49%), Positives = 209/310 (67%), Gaps = 15/310 (4%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
++++W+ Q+GR Y + E +R I+ N+++IE N + N ++KL N+F+DLTN+EF
Sbjct: 45 RYDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQ-NLSFKLTDNKFADLTNDEFN 103
Query: 101 ASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
+ Y GY + R + ++N TD+P ++DWRE GAVT IK+QG CGSCWAFSA
Sbjct: 104 SIYLGY-----QIRSYKRRNLSHMHENSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSA 158
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEADYP 218
VAAVEGI +I G L+ LSEQ+LVDC DN GC+GG M+KAF +I GL TE DYP
Sbjct: 159 VAAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYP 218
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRG 278
Y+ G+C+K K A IG YE +P +E++L AV+KQPVSV ++ASG F+ Y G
Sbjct: 219 YKGTDGSCEKAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEG 278
Query: 279 VLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLC 334
V + CG +HGV +VG+G + +G KYWL+KNSWG+ WGESGYIR+ RD +G+C
Sbjct: 279 VFSGYCGIQLNHGVTIVGYG---DNNGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMC 335
Query: 335 GIATEASYPV 344
GIA E SYP+
Sbjct: 336 GIAMEPSYPI 345
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 158/328 (48%), Positives = 206/328 (62%), Gaps = 16/328 (4%)
Query: 26 QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNR 82
+VS E + +WMA HGRTY E+ R +F+ NL Y++ N G
Sbjct: 30 SIVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVH 89
Query: 83 TYKLGTNEFSDLTNEEFRASYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
+++LG N F+DLTN+E+RA+Y G +RP R+ + + D+P S+DWR KG
Sbjct: 90 SFRLGLNRFADLTNDEYRATYLGVRSRP----QRERRLGDRYLAGDNEDLPESVDWRAKG 145
Query: 142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDK 200
AV IK+QG CGSCWAFS +AAVEGI QI G +I LSEQ+LVDC T N GC+GGLMD
Sbjct: 146 AVAEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDY 205
Query: 201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQP 260
AFE+II N G+ TE DYPY+ G CD ++ A TI YED+P E +L +AV QP
Sbjct: 206 AFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQP 265
Query: 261 VSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWG 320
+SV +EA G+AF+ Y G+ CG DHGV VG+GT E+G YW++KNSWG +WG
Sbjct: 266 ISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWG 322
Query: 321 ESGYIRILRD----EGLCGIATEASYPV 344
ESGY+R+ R+ G CGIA E SYP+
Sbjct: 323 ESGYVRMERNIKASSGKCGIAVEPSYPL 350
>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 164/321 (51%), Positives = 203/321 (63%), Gaps = 17/321 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ + +E+W + H R + EK R FK N +I NK G+ Y+L N F D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 95 TNEEFRASYTG-YNRPVPSVSRQSSRPSTFKYQ--NVTDVPTSIDWREKGAVTHIKNQGH 151
EFRA++ G R PS + S P F Y NV+D+P S+DWR+KGAVT +K+QG
Sbjct: 98 DQAEFRATFVGDLRRDTPS--KPPSVPG-FMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFEYIIENKG 210
CGSCWAFS V +VEGI I G L+ LSEQ+L+DC T DN+GC GGLMD AFEYI N G
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214
Query: 211 LATEADYPYQQEQGTCD---KQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA 267
L TEA YPY+ +GTC+ + I ++D+P E L +AV QPVSV VEA
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274
Query: 268 SGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
SG+AF FY GV ECG DHGVAVVG+G A EDG YW +KNSWG +WGE GYIR+
Sbjct: 275 SGKAFMFYSEGVFTGECGTELDHGVAVVGYGVA--EDGKAYWTVKNSWGPSWGEQGYIRV 332
Query: 328 LRDE----GLCGIATEASYPV 344
+D GLCGIA EASYPV
Sbjct: 333 EKDSGASGGLCGIAMEASYPV 353
>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 368
Score = 317 bits (811), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 154/293 (52%), Positives = 202/293 (68%), Gaps = 9/293 (3%)
Query: 58 EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN-RPVPSVSRQ 116
+ A R +FK+N++YI +ANK+ +R ++L N+F+D+T +E R SY G R ++S
Sbjct: 64 DPARRFNVFKENVKYIHEANKK-DRPFRLALNKFADMTTDELRHSYAGSRVRHHRALSGG 122
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLI 176
F Y + ++P ++DWREKGAVT IK+QG CGSCWAFS +AAVE I +I GKL+
Sbjct: 123 RRAQGNFTYSDAENLPPAVDWREKGAVTGIKDQGQCGSCWAFSTIAAVESINKIRTGKLV 182
Query: 177 ELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAA 235
LSEQ+L+DC N+ GC GGLMD AF++I +N G+ +EA+YPYQ +Q TCD+ KE
Sbjct: 183 SLSEQELMDCDNVNDQGCDGGLMDYAFQFIQKNGGVTSEANYPYQGQQNTCDQAKENTHD 242
Query: 236 ATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVV 295
I YED+P DE AL +AV QPVSV +EASGQ F+FY GV +C + DHGVA V
Sbjct: 243 VAIDGYEDVPANDESALQKAVAYQPVSVAIEASGQDFQFYSEGVFTGQCTTDLDHGVAAV 302
Query: 296 GFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
G+GTA DG KYW++KNSWG WGE GYIR+ R EGLCGIA +ASYP+
Sbjct: 303 GYGTA--RDGTKYWIVKNSWGLDWGEKGYIRMQRGVSQAEGLCGIAMQASYPI 353
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 317 bits (811), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 155/311 (49%), Positives = 212/311 (68%), Gaps = 12/311 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E+W++ HG+ Y+ EK R +FK NL++I++ NK+ +Y LG NEF+DLT++
Sbjct: 44 LIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT-SYWLGVNEFADLTHQ 102
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G + S +RQS P F Y++V D+P S+DWR+KGAVT +KNQG CGSCWA
Sbjct: 103 EFKNMYLGL-KVESSRTRQS--PEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWA 159
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI +I GG L LSEQ+L+DC NNGC GGLMD AF +I+ + GL E D
Sbjct: 160 FSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEED 219
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY + + TCD +K + TI Y+D+P+ +E +L++A+ QP+SV +EASG+ F+FY
Sbjct: 220 YPYLEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYS 279
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE----G 332
GV + CG DHGV VG+G+++ G Y ++KNSWG WGE GYIR+ R+ G
Sbjct: 280 GGVFDGPCGTQLDHGVTAVGYGSSK---GVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAG 336
Query: 333 LCGIATEASYP 343
LCGI ASYP
Sbjct: 337 LCGINKMASYP 347
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 155/320 (48%), Positives = 208/320 (65%), Gaps = 19/320 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ + +E W+ +HG+ Y EK R IFK NL +I++ N +R+YK+G N F+DL
Sbjct: 44 DSQVRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSV-DRSYKVGLNRFADL 102
Query: 95 TNEEFRASYTGYNRPVPSVSRQS----SRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQG 150
TNEE++A + G + R++ +R + +++ D+P ++DWREKGAV +K+QG
Sbjct: 103 TNEEYKAMFLG-----TKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQG 157
Query: 151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENK 209
CGSCWAFS V AVEGI QI G+LI LSEQ+LVDC N GC+GGLMD AFE+II N
Sbjct: 158 QCGSCWAFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNG 217
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASG 269
G+ TE DYPY+ CD ++ A TI YED+P+ DE++L +AV QPVSV +EA G
Sbjct: 218 GIDTEEDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGG 277
Query: 270 QAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
+AF+ YK GV CG DHGV VG+GT E+G YW+++NSWG WGESGYIR+ R
Sbjct: 278 RAFQLYKSGVFTGRCGTELDHGVVAVGYGT---ENGVNYWIVRNSWGSAWGESGYIRMER 334
Query: 330 D-----EGLCGIATEASYPV 344
+ G CGIA + SYP
Sbjct: 335 NVANTKTGKCGIAIQPSYPT 354
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 163/318 (51%), Positives = 215/318 (67%), Gaps = 13/318 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ ++V +HE+WMA+HGRTY +E EKA RL +F+ N + I+ N + T++L TN F+DL
Sbjct: 37 DAAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADL 96
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN--VTDVPTSIDWREKGAVTHIKNQGHC 152
T+EEFRA+ TG RP + + S F+Y+N + D S+DWR GAVT +K+QG C
Sbjct: 97 TDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSC 156
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKG 210
G CWAFSAVAAVEG+T+I G+L+ LSEQQLVDC D+ GC+GGLMD AFEY+I G
Sbjct: 157 GCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGG 216
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
L TE+ YPY+ G+C + A+AA+I YED+P +E AL+ AV QPVSV +
Sbjct: 217 LTTESSYPYRGTDGSC---RRSASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDS 273
Query: 271 AFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI-- 327
FRFY GVL CG +H + G+GTA DG KYW++KNSWG +WGE GY+RI
Sbjct: 274 VFRFYDSGVLGGSGCGTELNHAITAAGYGTA--SDGTKYWIMKNSWGGSWGEGGYVRIRR 331
Query: 328 -LRDEGLCGIATEASYPV 344
+R EG+CG+A ASYPV
Sbjct: 332 GVRGEGVCGLAQLASYPV 349
>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
Length = 342
Score = 317 bits (811), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 161/343 (46%), Positives = 219/343 (63%), Gaps = 15/343 (4%)
Query: 13 MFVIIILVITCASQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
M ++LV+ ++ +M ++ +H++WMA+HGRTYKD EKA R +FK N+
Sbjct: 1 MAASLLLVVAGGLSTMAKVTMASRAGTMEARHDKWMAEHGRTYKDAAEKARRFRVFKANV 60
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+ I+++N GN+ Y+L TN F+DLT+ EF A YTGYN P+ + ++ +T + + D
Sbjct: 61 DLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYN---PANTMYAAANATTRLSSEDD 117
Query: 131 -VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
P +DWR++GAVT +KNQ CG CWAFS VAAVEGI QIT G+L+ LSEQQL+DC+ D
Sbjct: 118 QQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCA-D 176
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCD---KQKEKAAAATIGKYEDLPK 246
N GC+GG +D AF+Y+ + G+ TEA Y YQ QG C AATI Y+ +
Sbjct: 177 NGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVAATISGYQRVNP 236
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGT-AEEED 304
DE +L AV QPVSV +E SG FR Y GV A+ CG DH VAVVG+G A+
Sbjct: 237 NDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSG 296
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
G YW+IKNSWG TWG+ GY+++ +D +G CG+A SYPV
Sbjct: 297 GGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 339
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 316 bits (810), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 163/319 (51%), Positives = 209/319 (65%), Gaps = 26/319 (8%)
Query: 43 EQWMAQHGRTYKDEL--------EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ WM QHG++Y D EKA R IFK NL +I N E N+ Y LG N F+DL
Sbjct: 58 DSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGEN-EKNQGYFLGLNAFADL 116
Query: 95 TNEEFRASYTG--YNRPVPSVSRQSSRPSTFKYQNV--TDVPTSIDWREKGAVTHIKNQG 150
TNEEFRA G ++R SR+ + F+Y +V D+P SIDWREKGAV +K+QG
Sbjct: 117 TNEEFRAQRHGGRFDR-----SRERTSHEEFRYGSVQLKDLPDSIDWREKGAVVGVKDQG 171
Query: 151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFEYIIENK 209
CGSCWAFSAVAA+EG+ ++ G+L+ LSEQ+LVDC ++ GC+GGLMD AF ++I+N
Sbjct: 172 SCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGFVIKNG 231
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASG 269
GL TEADYPY+ CD+ K A TI YED+P DE ALL+AV QPVSV ++A G
Sbjct: 232 GLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAIDAGG 291
Query: 270 QAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
+ +FY+ G+ CG + DHGV VG+G +EDG YW+IKNSWG WGE GY+++ R
Sbjct: 292 SSMQFYRSGIFTGRCGTDLDHGVTNVGYG---KEDGKAYWIIKNSWGSNWGEKGYVKMAR 348
Query: 330 D----EGLCGIATEASYPV 344
+ GLCGI EASYP
Sbjct: 349 NTGLAAGLCGINMEASYPT 367
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 316 bits (810), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 159/318 (50%), Positives = 210/318 (66%), Gaps = 13/318 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ ++ WMA+HG+ Y EK R IFK NL++I++ N + NRTYK+G N F+DL
Sbjct: 39 EEEVMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQ-NRTYKVGLNRFADL 97
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKNQGHC 152
TNEE+RA Y G R P + ++ +Y + +P S+DWRE GAV +K+Q C
Sbjct: 98 TNEEYRAIYLG-TRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSC 156
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGL 211
GSCWAFS VAAVEGI QI G+LI LSEQ+LVDC T+ + GC+GGLMD AF++II+N GL
Sbjct: 157 GSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKNGGL 216
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
TE DYPY G C+ + + +I YED+P DE AL +AV QPVSV VEA G+A
Sbjct: 217 DTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRA 276
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD- 330
+ Y G+ ECG DHG+ VG+GT E+G YW+++NSWG +WGE+GYIR+ R+
Sbjct: 277 LQLYVSGIFTGECGTALDHGIVAVGYGT---ENGTDYWIVRNSWGSSWGENGYIRMERNM 333
Query: 331 ----EGLCGIATEASYPV 344
G CGIA EASYP+
Sbjct: 334 ADAFSGKCGIAMEASYPI 351
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 158/323 (48%), Positives = 208/323 (64%), Gaps = 14/323 (4%)
Query: 30 GRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTN 89
G E + E E W+ +HG++Y EK R IF+ NL+YI++ N NR+YKLG N
Sbjct: 38 GLVRSEDEVKEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLN 97
Query: 90 EFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIK 147
F+D+TNEE+R Y G R SR + + +Y V +P SIDWREKGAVT +K
Sbjct: 98 RFADITNEEYRTGYLGAKR---DASRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVK 154
Query: 148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYII 206
+QG CGSCWAFS +AAVEG+ Q+ G LI LSEQ+LVDC N GC+GG M AF++II
Sbjct: 155 DQGSCGSCWAFSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFII 214
Query: 207 ENKGLATEADYPYQQEQGTCDKQKE-KAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCV 265
+N G+ +E DYPY + G CD ++ A A+I YE++P +E +L +AV QPVSV +
Sbjct: 215 KNGGIDSEEDYPYTGKDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAI 274
Query: 266 EASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
EA G F+ Y G+ CG + DHGVA VG+GT E+G YW++KNSWG+ WGE GY+
Sbjct: 275 EAGGYDFQLYSSGIFTGSCGTDLDHGVAAVGYGT---ENGVDYWIVKNSWGDYWGEKGYV 331
Query: 326 RILRD----EGLCGIATEASYPV 344
R+ R+ GLCGIA EASYP
Sbjct: 332 RMQRNVKAKTGLCGIAMEASYPT 354
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 157/328 (47%), Positives = 206/328 (62%), Gaps = 16/328 (4%)
Query: 26 QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNR 82
+VS E + +WMA HGRTY E+ R +F+ NL Y++ N G
Sbjct: 30 SIVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVH 89
Query: 83 TYKLGTNEFSDLTNEEFRASYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
+++LG N F+DLTN+E+RA+Y G +RP R+ + + D+P S+DWR KG
Sbjct: 90 SFRLGLNRFADLTNDEYRATYLGVRSRP----QRERRLGDRYLAGDNEDLPESVDWRAKG 145
Query: 142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDK 200
AV +K+QG CGSCWAFS +AAVEGI QI G +I LSEQ+LVDC T N GC+GGLMD
Sbjct: 146 AVAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDY 205
Query: 201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQP 260
AFE+II N G+ TE DYPY+ G CD ++ A TI YED+P E +L +AV QP
Sbjct: 206 AFEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQP 265
Query: 261 VSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWG 320
+SV +EA G+AF+ Y G+ CG DHGV VG+GT E+G YW++KNSWG +WG
Sbjct: 266 ISVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWG 322
Query: 321 ESGYIRILRD----EGLCGIATEASYPV 344
ESGY+R+ R+ G CGIA E SYP+
Sbjct: 323 ESGYVRMERNIKASSGKCGIAVEPSYPL 350
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 153/321 (47%), Positives = 215/321 (66%), Gaps = 17/321 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTY----KDELEKAMRLTIFKQNLEYIEKAN-KEGNRTYKLGTN 89
EP + ++ W+A+HGR Y + E E+ R +F NL +++ N + G R ++LG N
Sbjct: 50 EPEVRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERAGARGFRLGMN 109
Query: 90 EFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKN 148
+F+DLTN+EFRA+Y G VP+ R + +++ + +P S+DWREKGAV +KN
Sbjct: 110 QFADLTNDEFRAAYLGA--MVPAARRGAVVGERYRHDGAAEELPESVDWREKGAVAPVKN 167
Query: 149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYII 206
QG CGSCWAFSAV++VE + QI G+++ LSEQ+LV+CSTD N+GC+GGLMD AF++II
Sbjct: 168 QGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFII 227
Query: 207 ENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVE 266
+N G+ TE DYPY+ G CD ++ A +I +ED+P+ DE +L +AV QPVSV +E
Sbjct: 228 KNGGIDTEDDYPYRAVDGKCDMNRKNARVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIE 287
Query: 267 ASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
A G+ F+ YK GV + C N DHGV VG+G E+G YW+++NSWG WGE+GYIR
Sbjct: 288 AGGREFQLYKSGVFSGSCTTNLDHGVVAVGYGA---ENGKDYWIVRNSWGPKWGEAGYIR 344
Query: 327 ILRD----EGLCGIATEASYP 343
+ R+ G CGIA ASYP
Sbjct: 345 MERNVNASTGKCGIAMMASYP 365
>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
Length = 360
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 160/347 (46%), Positives = 221/347 (63%), Gaps = 12/347 (3%)
Query: 5 FEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMRL 63
+K + +++ ++L T + E S+ + +E+W + H T L EK R
Sbjct: 1 MKKLLFVALYLALVLGFTESFDFHEKDLESEESLWDLYEKWRSHH--TVSTSLDEKRKRF 58
Query: 64 TIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS-T 122
+F+ N+ ++ NK ++ YKL N+F+D+TN EFR +Y ++ R + + +
Sbjct: 59 NVFRANVLHVHNTNKM-DKPYKLKLNKFADMTNHEFRTAYASSKVKHHTMFRGAPLGNGS 117
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
F Y N+ VP SIDWR+KGAVT +K+QG CGSCWAFS + AVEGI I KLI LSEQ+
Sbjct: 118 FMYGNIDKVPASIDWRKKGAVTPVKDQGKCGSCWAFSTIVAVEGINFIKTNKLISLSEQE 177
Query: 183 LVDCST-DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
LVDC+T +N+GC+GGLMD AFE+I + KG+ TEA+YPY+ + G CD K A +I +
Sbjct: 178 LVDCNTGENHGCNGGLMDYAFEFITKQKGITTEANYPYRAQDGHCDANKANQPAVSIDGH 237
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE 301
ED+ +E+ALL+AV QPVSV ++A G F+FY GV ECG DHGVA+VG+GT
Sbjct: 238 EDVLHNNENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGECGKELDHGVAIVGYGTT- 296
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
DG KYW+++NSWG WGE GYIR+ R GLCGIA EASYP+
Sbjct: 297 -VDGTKYWIVRNSWGPEWGERGYIRMQRGISDRRGLCGIAMEASYPI 342
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 161/361 (44%), Positives = 224/361 (62%), Gaps = 36/361 (9%)
Query: 9 FIIPMFVIIILVITCASQVVS----------------GRSMHEPSIVEKHEQWMAQHGRT 52
F+ P I+ L + S V GRS E ++ +E W+ +HG+
Sbjct: 3 FLKPTMAILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRS--EAEVMSIYEAWLVKHGKA 60
Query: 53 YKDE--LEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPV 110
+EK R IFK NL ++++ N E N +Y+LG F+DLTN+E+R+ Y G
Sbjct: 61 QSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFADLTNDEYRSKYLG----- 114
Query: 111 PSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGIT 168
+ ++ R ++ +Y+ V D +P SIDWR+KGAV +K+QG CGSCWAFS + AVEGI
Sbjct: 115 AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGIN 174
Query: 169 QITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCD 227
QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II+N G+ T+ DYPY+ GTCD
Sbjct: 175 QIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCD 234
Query: 228 KQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDN 287
+ ++ A TI YED+P E +L +AV QP+S+ +EA G+AF+ Y G+ + CG
Sbjct: 235 QIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQ 294
Query: 288 CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
DHGV VG+GT E+G YW+++NSWG++WGESGY+R+ R+ G CGIA E SYP
Sbjct: 295 LDHGVVAVGYGT---ENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYP 351
Query: 344 V 344
+
Sbjct: 352 I 352
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 161/361 (44%), Positives = 224/361 (62%), Gaps = 36/361 (9%)
Query: 9 FIIPMFVIIILVITCASQVVS----------------GRSMHEPSIVEKHEQWMAQHGRT 52
F+ P I+ L + S V GRS E ++ +E W+ +HG+
Sbjct: 3 FLKPTMAILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRS--EAEVMSIYEAWLVKHGKA 60
Query: 53 YKDE--LEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPV 110
+EK R IFK NL ++++ N E N +Y+LG F+DLTN+E+R+ Y G
Sbjct: 61 QSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFADLTNDEYRSKYLG----- 114
Query: 111 PSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGIT 168
+ ++ R ++ +Y+ V D +P SIDWR+KGAV +K+QG CGSCWAFS + AVEGI
Sbjct: 115 AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGIN 174
Query: 169 QITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCD 227
QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II+N G+ T+ DYPY+ GTCD
Sbjct: 175 QIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCD 234
Query: 228 KQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDN 287
+ ++ A TI YED+P E +L +AV QP+S+ +EA G+AF+ Y G+ + CG
Sbjct: 235 QIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQ 294
Query: 288 CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
DHGV VG+GT E+G YW+++NSWG++WGESGY+R+ R+ G CGIA E SYP
Sbjct: 295 LDHGVVAVGYGT---ENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYP 351
Query: 344 V 344
+
Sbjct: 352 I 352
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 155/314 (49%), Positives = 215/314 (68%), Gaps = 13/314 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E W++ + Y+ EK +R +FK NL++I++ NK+ ++Y LG NEF+DL++E
Sbjct: 47 LIELFENWISNFEKAYETVEEKLLRFEVFKDNLKHIDETNKK-VKSYWLGLNEFADLSHE 105
Query: 98 EFRASYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCW 156
EF+ Y G + V R R + F Y++V VP S+DWR+KGAV +KNQG CGSCW
Sbjct: 106 EFKKMYLGLKTDI--VRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEA 215
AFS VAAVEGI +I G L LSEQ+L+DC T NNGC+GGLMD AFEYI++N GL E
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEE 223
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
DYPY E+GTC+ QK+++ TI ++D+P DE +LL+A+ QP+SV ++ASG+ F+FY
Sbjct: 224 DYPYSMEEGTCEMQKDESETVTIDGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFY 283
Query: 276 K-RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---- 330
V + CG + DHGVA VG+G+++ G+ Y ++KNSWG WGE GYIR+ R+
Sbjct: 284 SGVSVFDGRCGVDLDHGVAAVGYGSSK---GSDYIIVKNSWGPKWGEKGYIRLKRNTGKP 340
Query: 331 EGLCGIATEASYPV 344
EGLCGI AS+P
Sbjct: 341 EGLCGINKMASFPT 354
>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
Length = 362
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 156/345 (45%), Positives = 219/345 (63%), Gaps = 10/345 (2%)
Query: 6 EKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTI 65
+K +I + + ++LV++ + + S+ + +E+W + H + ++ EK R +
Sbjct: 4 KKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS-TFK 124
FK N+ ++ NK ++ YKL N+F+D+TN EF+ +Y G + R + R S TF
Sbjct: 63 FKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGTKVNHHRMFRGTPRVSGTFM 121
Query: 125 YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
Y+N T P S+DWR+KGAVT +K+QG CGSCWAFS V AVEGI QI +L+ LSEQ+L+
Sbjct: 122 YENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELI 181
Query: 185 DCST-DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
DC +N GC+GGLM+ AFEYI + G+ TE+ YPY G+CD KE +I +E
Sbjct: 182 DCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVSIDGHET 241
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+P DE ALL+AV QPVSV ++A G F+FY GV +CG +HGVA+VG+GT
Sbjct: 242 VPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTT--V 299
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
DG YW+++NSWG WGE G IR+ R+ EGLCGIA EASYPV
Sbjct: 300 DGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPV 344
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 161/350 (46%), Positives = 223/350 (63%), Gaps = 26/350 (7%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPS----------IVEKHEQWMAQHGRTYKDELEKA 60
I I IL++ C + V++ S P+ + ++ + W+ +HGR YK E+
Sbjct: 5 ILTTTIFILLMLCNTCVIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHNDERE 64
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
+R I++ N++YI+ N + N +Y L N+F+DLTNEEF+++Y G + +R S
Sbjct: 65 VRFGIYQANVQYIQCKNAQKN-SYNLTDNKFADLTNEEFQSTYMGLS------TRLRSHN 117
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+ F+Y D+P S DWR++GAVT I +QG CG CWAF+AVAAVEGI +I GKLI LSE
Sbjct: 118 TGFRYDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLISLSE 177
Query: 181 QQLVDCS--TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
Q+L+DC + N GC GGLM+ A+ +IIEN GL TE DYPY+ GTC +K AA+I
Sbjct: 178 QELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHYAASI 237
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
YE++P +E L A QPVSV ++A G +F+FY GV + CG +HGV VVG+G
Sbjct: 238 SGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVVGYG 297
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+E KYW++KNSWG WGESGYIR+ RD EG+CGIA +ASYP+
Sbjct: 298 ---KETINKYWIVKNSWGADWGESGYIRMKRDTLSKEGMCGIAMQASYPL 344
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 159/317 (50%), Positives = 207/317 (65%), Gaps = 12/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S+ +E+W + H T L EK R +FK+N+ ++ + NK+ + YKL N+F+D
Sbjct: 31 EESLWNLYERWRSHH--TVSRSLDEKHKRFNVFKENVNFVHEFNKK-DEPYKLKLNKFAD 87
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
+TN EFR++Y G + R S + +F Y+ V VP S+DWR+KGAVT IK+QG C
Sbjct: 88 MTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQC 147
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFEYIIENKGL 211
GSCWAFS V AVEGI I KL+ LSEQ+LVDC T +N GC+GGLM AFE+I E G+
Sbjct: 148 GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEKGGI 207
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
TE YPY E GTCD K + +I +E +P +E ALL+A QP+SV ++A G A
Sbjct: 208 TTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGSA 267
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR-- 329
F+FY GV CG + DHGVA+VG+GT DG KYW++KNSWG WGE+GYIR+ R
Sbjct: 268 FQFYSEGVFAGRCGTDLDHGVAIVGYGTT--LDGTKYWIVKNSWGTDWGENGYIRMKRGI 325
Query: 330 --DEGLCGIATEASYPV 344
EGLCGIA EASYP+
Sbjct: 326 SAKEGLCGIAVEASYPI 342
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 167/354 (47%), Positives = 223/354 (62%), Gaps = 26/354 (7%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIV----------EKHEQWMAQHGRTYKDELEKA 60
+ FV+ +LV++ A+ + GR + +HE+WMA+HG+TYKDE EKA
Sbjct: 3 LSTFVLAVLVMSGAAAL--GRELAGDGAAAAAAADVAMASRHEKWMAKHGKTYKDEEEKA 60
Query: 61 MRLTIFKQNLEYIEKAN----KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQ 116
RL +F+ N + I+ N K+G ++L TN F+DLT++EFRA+ TGY RP P+
Sbjct: 61 RRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDDEFRAARTGYQRP-PAAVAG 119
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLI 176
+ ++ ++ P S+DWR GAVT +K+QG CG CWAFSAVAAVEG+ +I G+L+
Sbjct: 120 AGGGFLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLAKIRTGQLV 179
Query: 177 ELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAA 234
LSEQ+LVDC ++ GC GGLMD AF+YI GLA E+ YPY+ + A
Sbjct: 180 SLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLAAESSYPYRGVD-GACRAAAGRA 238
Query: 235 AATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL-NAECGDNCDHGVA 293
AA+I ++D+P DE AL+ AV +QPVSV + +G FRFY RGVL A CG +H V
Sbjct: 239 AASIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFYDRGVLGGAGCGTELNHAVT 298
Query: 294 VVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
VG+GTA DG YWL+KNSWG +WGE GY+RI R EG CGIA ASYPV
Sbjct: 299 AVGYGTA--SDGTGYWLMKNSWGASWGEGGYVRIRRGVGREGACGIAQMASYPV 350
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 315 bits (808), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 157/312 (50%), Positives = 208/312 (66%), Gaps = 12/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I++ E W+++H + Y+ EK R IFK NL +I++ NK+ Y LG NEF+DL++E
Sbjct: 29 IIDLFESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKK-VVNYWLGLNEFADLSHE 87
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G N +S + F Y++V+ +P S+DWR+KGAVT +KNQG CGSCWA
Sbjct: 88 EFKNKYLGLN---VDLSNRRECSEEFTYKDVSSIPKSVDWRKKGAVTDVKNQGSCGSCWA 144
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+LVDC T NNGC+GGLMD AF YII N GL E D
Sbjct: 145 FSTVAAVEGINQIVTGNLTSLSEQELVDCDTTYNNGCNGGLMDYAFAYIISNGGLHKEED 204
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+GTC+ +K ++ TI Y D+P+ E +LL+A+ QP+SV ++ASG+ F+FY
Sbjct: 205 YPYIMEEGTCEMRKAESEVVTISGYHDVPQNSEESLLKALANQPLSVAIDASGRDFQFYS 264
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
GV + CG DHGVA VG+G+A+ G + ++KNSWG WGE G+IR+ R+ G
Sbjct: 265 GGVFDGHCGTELDHGVAAVGYGSAK---GLDFIVVKNSWGSKWGEKGFIRMKRNTGKPAG 321
Query: 333 LCGIATEASYPV 344
LCGI ASYP
Sbjct: 322 LCGINKMASYPT 333
>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
Length = 362
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 156/345 (45%), Positives = 219/345 (63%), Gaps = 10/345 (2%)
Query: 6 EKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTI 65
+K +I + + ++LV++ + + S+ + +E+W + H + ++ EK R +
Sbjct: 4 KKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS-TFK 124
FK N+ ++ NK ++ YKL N+F+D+TN EF+ +Y G + R + R S TF
Sbjct: 63 FKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGTFM 121
Query: 125 YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
Y+N T P S+DWR+KGAVT +K+QG CGSCWAFS V AVEGI QI +L+ LSEQ+L+
Sbjct: 122 YENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELI 181
Query: 185 DCST-DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
DC +N GC+GGLM+ AFEYI + G+ TE+ YPY G+CD KE +I +E
Sbjct: 182 DCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVSIDGHET 241
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+P DE ALL+AV QPVSV ++A G F+FY GV +CG +HGVA+VG+GT
Sbjct: 242 VPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTT--V 299
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
DG YW+++NSWG WGE G IR+ R+ EGLCGIA EASYPV
Sbjct: 300 DGTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPV 344
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 155/311 (49%), Positives = 201/311 (64%), Gaps = 14/311 (4%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
+ +WMA HGRTY + R +F+ NL YI+ N G +++LG N F+DLTN+E
Sbjct: 44 YAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 103
Query: 99 FRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAF 158
+ A+Y G R P R+ + + + D+P S+DWR KGAV +K+QG CG+CWAF
Sbjct: 104 YPATYLG-ARTRPQRDRKLG--ARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGTCWAF 160
Query: 159 SAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADY 217
S +AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ TE DY
Sbjct: 161 STIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKDY 220
Query: 218 PYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKR 277
PY+ G CD ++ A TI YED+P DE +L +AV QPVSV +EA+G AF+ Y
Sbjct: 221 PYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSS 280
Query: 278 GVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGL 333
G+ CG DHGV VG+GT E+G YW++KNSWG +WGESGY+R+ R+ G
Sbjct: 281 GIFTGSCGTRLDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGK 337
Query: 334 CGIATEASYPV 344
CGIA E SYP+
Sbjct: 338 CGIAVEPSYPL 348
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 159/349 (45%), Positives = 218/349 (62%), Gaps = 23/349 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEK------HEQWMAQHGRTYKDELEKAMRLTI 65
P F+ + LV + E + + +E+W H +D EK R +
Sbjct: 4 PKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHH-TVARDLDEKNRRFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG----YNRPVPSVSRQSSRPS 121
FK+N+++I + N++ + YKL N+F D+TN+EFR+ Y G ++R + + +
Sbjct: 63 FKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTG--- 119
Query: 122 TFKYQNVTDVPT-SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+F Y+NV +P SIDWR KGAVT +K+QG CGSCWAFS +A+VEGI QI G+L+ LSE
Sbjct: 120 SFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSE 179
Query: 181 QQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIG 239
Q+LVDC T N GC+GGLMD AFE+I +N G+ TE YPY ++ GTC + +I
Sbjct: 180 QELVDCDTSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAEQDGTCASNLLNSPVVSID 238
Query: 240 KYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGT 299
++D+P +E+AL+QAV QP+SV +EASG F+FY GV CG DHGVA+VG+G
Sbjct: 239 GHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYGA 298
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
DG KYW++KNSWGE WGESGYIR+ R G CGIA EASYP+
Sbjct: 299 T--RDGTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPI 345
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 161/361 (44%), Positives = 224/361 (62%), Gaps = 36/361 (9%)
Query: 9 FIIPMFVIIILVITCASQVVS----------------GRSMHEPSIVEKHEQWMAQHGRT 52
F+ P I+ L + S V GRS E ++ +E W+ +HG+
Sbjct: 3 FLKPTMAILFLAMVTVSSAVDMSIISYDEKHGVSTTGGRS--EAEVMSIYEAWLVKHGKA 60
Query: 53 YKDE--LEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPV 110
+EK R IFK NL ++++ N E N +Y+LG F+DLTN+E+R+ Y G
Sbjct: 61 QSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFADLTNDEYRSKYLG----- 114
Query: 111 PSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGIT 168
+ ++ R ++ +Y+ V D +P SIDWR+KGAV +K+QG CGSCWAFS + AVEGI
Sbjct: 115 AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGIN 174
Query: 169 QITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCD 227
QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II+N G+ T+ DYPY+ GTCD
Sbjct: 175 QIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCD 234
Query: 228 KQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDN 287
+ ++ A TI YED+P E +L +AV QP+S+ +EA G+AF+ Y G+ + CG
Sbjct: 235 QIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQ 294
Query: 288 CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
DHGV VG+GT E+G YW+++NSWG++WGESGY+R+ R+ G CGIA E SYP
Sbjct: 295 LDHGVVAVGYGT---ENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYP 351
Query: 344 V 344
+
Sbjct: 352 I 352
>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 156/347 (44%), Positives = 223/347 (64%), Gaps = 15/347 (4%)
Query: 9 FIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQ 68
+I +F ++IL C E + + +++W + H + E+ R +F+
Sbjct: 5 LLIFLFSLVILETACGFDYEDKEIESEEGLSKLYDRWRSHHS-VPRSLHEREKRFNVFRH 63
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG----YNRPVPSVSRQSSRPSTFK 124
N+ ++ +NK+ NR+YKL N+F+DLT EF+ +YTG ++R + R S+ +
Sbjct: 64 NVMHVHNSNKK-NRSYKLKLNKFADLTIHEFKNAYTGSKIKHHRMLQGPKR-GSKQFMYD 121
Query: 125 YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
++NV+ +P+S+DWR+KGAVT IKNQG CGSCWAFS VAAVEGI +I KL+ LSEQ+LV
Sbjct: 122 HENVSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELV 181
Query: 185 DCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
DC T+ N GC+GGLM+ AFE+I +N G+ TE YPY+ G CD K+ TI +E+
Sbjct: 182 DCDTNQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEN 241
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+P+ DE+ALL+AV QPVSV ++A F+FY GV +CG +HGVA VG+G+ +
Sbjct: 242 VPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGDCGTELNHGVATVGYGS---Q 298
Query: 304 DGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVAM 346
G KYW+++NSWG WGE GYI+I R EG CGIA EASYP+ +
Sbjct: 299 GGKKYWIVRNSWGTEWGEGGYIKIERGIDEPEGRCGIAMEASYPIKL 345
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 154/320 (48%), Positives = 209/320 (65%), Gaps = 16/320 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN-KEGNRTYKLGTNEFSD 93
EP +E W+A+HGR Y E+ R +F NL +++ N + ++LG N+F+D
Sbjct: 102 EPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFAD 161
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN---VTDVPTSIDWREKGAVTHIKNQG 150
LTN+EFRA+Y G P SR+ +Y++ ++P S+DWREKGAV +KNQG
Sbjct: 162 LTNDEFRAAYLGARIPA---SRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQG 218
Query: 151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIEN 208
CGSCWAFSAV++VE + QI G+++ LSEQ+LV+CSTD N+GC+GGLMD AF++II+N
Sbjct: 219 QCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKN 278
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
G+ TE DYPY+ G CD +E A +I +ED+P+ DE +L +AV QPVSV +EA
Sbjct: 279 GGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAG 338
Query: 269 GQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
G+ F+ YK GV C N DHGV VG+GT E+G YW+++NSWG WGE GYIR+
Sbjct: 339 GREFQLYKAGVFTGTCTTNLDHGVVAVGYGT---ENGKDYWIVRNSWGAKWGEDGYIRME 395
Query: 329 RD----EGLCGIATEASYPV 344
R+ G CGIA ASYP
Sbjct: 396 RNVNATTGKCGIAMMASYPT 415
>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
Length = 361
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 162/349 (46%), Positives = 223/349 (63%), Gaps = 13/349 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAM 61
++ +K F + + ++L + + + E + + +E+W + H T L EK
Sbjct: 1 MEVKKVFFVALSFALVLRVAESFEFNEKDLESEEGLWDLYERWRSHH--TVSRSLDEKHN 58
Query: 62 RLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS 121
R +FK N+ ++ +NK ++ YKL N F+D+TN EFR+ Y G + R + R +
Sbjct: 59 RFNVFKGNVMHVHSSNKM-DKPYKLKLNRFADMTNHEFRSIYAGSKVNHHRMFRGTPRGN 117
Query: 122 -TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
TF YQNV VP+S+DWR+KGAVT +K+QG CGSCWAFS + AVEGI QI KL+ LSE
Sbjct: 118 GTFMYQNVDRVPSSVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTHKLVPLSE 177
Query: 181 QQLVDC-STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIG 239
Q+LVDC +T N GC+GGLM+ AFE+ I+ G+ T ++YPY+ + GTCD K A +I
Sbjct: 178 QELVDCDTTQNQGCNGGLMESAFEF-IKQYGITTASNYPYEAKDGTCDASKVNEPAVSID 236
Query: 240 KYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGT 299
+E++P +E ALL+AV QPVSV +EA G F+FY GV CG DHGVA+VG+GT
Sbjct: 237 GHENVPVNNEAALLKAVAHQPVSVAIEAGGIDFQFYSEGVFTGNCGTALDHGVAIVGYGT 296
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+DG KYW +KNSWG WGE GYIR+ R +GLCGIA EASYP+
Sbjct: 297 T--QDGTKYWTVKNSWGSEWGEKGYIRMKRSISVKKGLCGIAMEASYPI 343
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 160/314 (50%), Positives = 207/314 (65%), Gaps = 12/314 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E+W+A++ + Y EK R +FK NL +I+ NK+ +Y LG NEF+DLT++
Sbjct: 47 LIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGLNEFADLTHD 105
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSC 155
EF+A+Y G P + + F+Y +++ VP +DWR+K AVT +KNQG CGSC
Sbjct: 106 EFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSC 165
Query: 156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATE 214
WAFS VAAVEGI I G L LSEQ+L+DCSTD NNGC+GGLMD AF YI GL TE
Sbjct: 166 WAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIASTGGLRTE 225
Query: 215 ADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRF 274
YPY E+G CD+ K AA TI YED+P DE AL++A+ QPVSV +EASG+ F+F
Sbjct: 226 EAYPYAMEEGDCDEGK-GAAVVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQF 284
Query: 275 YKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----D 330
Y GV + CG+ DHGV VG+GT++ +D Y ++KNSWG WGE GYIR+ R
Sbjct: 285 YSGGVFDGPCGEQLDHGVTAVGYGTSKGQD---YIIVKNSWGPHWGEKGYIRMKRGTGKG 341
Query: 331 EGLCGIATEASYPV 344
EGLCGI ASYP
Sbjct: 342 EGLCGINKMASYPT 355
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 155/317 (48%), Positives = 212/317 (66%), Gaps = 12/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S+ + +E+W + H T L EK R +FK N+ ++ NK ++ YKL N+F+D
Sbjct: 33 EESLWDLYERWRSHH--TVSRSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFAD 89
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
+TN EFR++Y G + R S S TF Y+ V VP S+DWR+KGAVT +K+QG C
Sbjct: 90 MTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQC 149
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGL 211
GSCWAFS + AVEGI QI KL+ LSEQ+LVDC + N GC+GGLM+ AFE+I + G+
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
TE++YPY+ ++GTCD+ K A +I +E++P DE+ALL+AV QPVSV ++A G
Sbjct: 210 TTESNYPYKAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSD 269
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD- 330
F+FY GV +C + +HGVA+VG+GT DG YW+++NSWG WGE GYIR+ R+
Sbjct: 270 FQFYSEGVFTGDCNTDLNHGVAIVGYGTT--VDGTNYWIVRNSWGPEWGEQGYIRMQRNI 327
Query: 331 ---EGLCGIATEASYPV 344
EGLCGIA ASYP+
Sbjct: 328 SKKEGLCGIAMMASYPI 344
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 151/315 (47%), Positives = 207/315 (65%), Gaps = 17/315 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+ E+HE+WMA++ R YKD EKA R +FK N ++E N + + LG N+F+DLT E
Sbjct: 1 MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTE 60
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQN--VTDVPTSIDWREKGAVTHIKNQGHCGSC 155
EF+A N+ +S + + FKY+N V+ +PT++DWR KGAVT IKNQG CG C
Sbjct: 61 EFKA-----NKGFKPISAEEVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCC 115
Query: 156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN--NGCSGGLMDKAFEYIIENKGLAT 213
WAFSA+AA+EGI +++ G L+ LSEQ+ VDC T N GC GG MD AFE++I+N GLAT
Sbjct: 116 WAFSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLAT 175
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
E+ YPY+ G C + +AATI +ED+P +E AL++ V QPVSV V+AS + F
Sbjct: 176 ESSYPYKVVDGKC--KGGSKSAATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRTFM 233
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--- 330
Y GV+ CG DHG+A +G+G E D KYW++KNSWG TWGE G++R+ +D
Sbjct: 234 LYSGGVMTGSCGTQLDHGIAAIGYGV--ESDDTKYWILKNSWGTTWGEKGFLRMEKDISD 291
Query: 331 -EGLCGIATEASYPV 344
G+C +A + SYP
Sbjct: 292 KRGMCDLAMKPSYPT 306
>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
Length = 362
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 158/345 (45%), Positives = 218/345 (63%), Gaps = 20/345 (5%)
Query: 14 FVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQH--GRTYKDELEKAMRLTI 65
F+ ++L ++ V + H E S+ + +E+W + H R+ D K R +
Sbjct: 6 FLWVVLSLSLVLGVANSFDFHDKDLESEESLWDLYERWRSHHTVSRSLGD---KHKRFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS-TFK 124
FK N+ ++ NK ++ YKL N+F+D+TN EFR++Y G + R R + TF
Sbjct: 63 FKANMMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRDMPRGNGTFM 121
Query: 125 YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
Y+ V VP S+DWR+KGAVT +K+QGHCGSCWAFS V AVEGI QI KL+ LSEQ+LV
Sbjct: 122 YEKVGSVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELV 181
Query: 185 DCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
DC T+ N GC+GGLM+ AF++I + G+ TE+ YPY + GTCD K A +I +E+
Sbjct: 182 DCDTEENAGCNGGLMESAFQFIKQKGGITTESYYPYTAQDGTCDASKANDLAVSIDGHEN 241
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+P DE+ALL+AV QPVSV ++A G F+FY GV +C +HGVA+VG+G
Sbjct: 242 VPGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGAT--V 299
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
DG YW+++NSWG WGE GYIR+ R+ EGLCGIA ASYP+
Sbjct: 300 DGTSYWIVRNSWGPEWGELGYIRMQRNISKKEGLCGIAMLASYPI 344
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 158/312 (50%), Positives = 205/312 (65%), Gaps = 12/312 (3%)
Query: 42 HEQWMAQHGRTYK-DELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
+++W QH T D E A R IFK+N+++I+ NK+ + YKLG N+F+DL+NEEF+
Sbjct: 45 YDKWALQHRSTRSLDSDEHARRFEIFKENVKHIDSVNKK-DGPYKLGLNKFADLSNEEFK 103
Query: 101 ASY--TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAF 158
A + T + + +F YQN +P SIDWR+KGAVT +KNQG CGSCWAF
Sbjct: 104 AMHMTTKMEKHKSLRGDRGVESGSFMYQNSKRLPASIDWRKKGAVTPVKNQGQCGSCWAF 163
Query: 159 SAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYP 218
S +A+VEGI I GKL+ LSEQQLVDCS +N GC+GGLMD AF+YII+N G+ TE +YP
Sbjct: 164 STIASVEGINYIKTGKLVSLSEQQLVDCSKENAGCNGGLMDNAFQYIIDNGGIVTEDEYP 223
Query: 219 YQQEQGTCDKQK--EKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
Y E G C K K+ A I +ED+P +E AL +AV QPVS+ +EASG F+FY
Sbjct: 224 YTAEAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIAIEASGHDFQFYS 283
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEG 332
GV +CG DHGV VVG+G + E G YW+++NSWG WGE GYIR+ R EG
Sbjct: 284 TGVFTGKCGTELDHGVVVVGYGKSPE--GINYWIVRNSWGPEWGEQGYIRMQRGIEATEG 341
Query: 333 LCGIATEASYPV 344
CGI+ +ASYP
Sbjct: 342 KCGISMQASYPT 353
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 163/319 (51%), Positives = 209/319 (65%), Gaps = 26/319 (8%)
Query: 43 EQWMAQHGRTYKDEL--------EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ WM QHG++Y + EKA R IFK NL +I N E N+ Y LG N F+DL
Sbjct: 58 DSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGEN-EKNQGYFLGLNAFADL 116
Query: 95 TNEEFRASYTG--YNRPVPSVSRQSSRPSTFKYQNV--TDVPTSIDWREKGAVTHIKNQG 150
TNEEFRA G ++R SR+ + F+Y +V D+P SIDWREKGAV +K+QG
Sbjct: 117 TNEEFRAQRHGGRFDR-----SRERTSYEEFRYGSVQLKDLPDSIDWREKGAVVGVKDQG 171
Query: 151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFEYIIENK 209
CGSCWAFSAVAA+EG+ ++ G+L+ LSEQ+LVDC ++ GC+GGLMD AF ++I+N
Sbjct: 172 SCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGFVIKNG 231
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASG 269
GL TEADYPY+ CD+ K A TI YED+P DE ALL+AV QPVSV ++A G
Sbjct: 232 GLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAIDAGG 291
Query: 270 QAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
+ +FY+ G+ CG + DHGV VG+G +EDG YW+IKNSWG WGE GYI++ R
Sbjct: 292 SSMQFYRSGIFTGRCGTDLDHGVTNVGYG---KEDGKAYWIIKNSWGSNWGEKGYIKMAR 348
Query: 330 D----EGLCGIATEASYPV 344
+ GLCGI EASYP
Sbjct: 349 NTGLAAGLCGINMEASYPT 367
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 153/316 (48%), Positives = 211/316 (66%), Gaps = 10/316 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ + +E+W + H + EK R +FK+N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSHH-TVSRSLTEKHKRFNVFKENVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
TN EFR++Y G + R + + TF Y+ V VP S+DWR+KGAVT +K+QG CG
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCG 150
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLA 212
SCWAFS V AVEGI QI KL+ LSEQ+LVDC + N GC+GGLM+ AFE+I + G+
Sbjct: 151 SCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGIT 210
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE++YPY ++GTCD K A +I +E++P DE+ALL+AV QPVSV ++A G F
Sbjct: 211 TESNYPYTAQEGTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDF 270
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+FY GVL +C + +HGVA+VG+GT DG YW+++NSWG WGE GYIR+ R+
Sbjct: 271 QFYSEGVLTGDCNTDLNHGVAIVGYGTT--VDGTNYWIVRNSWGPEWGEQGYIRMQRNIS 328
Query: 331 --EGLCGIATEASYPV 344
EGLCGIA ASYP+
Sbjct: 329 KKEGLCGIAMMASYPI 344
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 162/328 (49%), Positives = 217/328 (66%), Gaps = 21/328 (6%)
Query: 27 VVSGRSMHEPSIVEK-HEQWMAQHGRTYKDE----LEKAMRLTIFKQNLEYIEKANKEGN 81
VS RS E VE+ +E WM +HG+ ++ EK R IFK NL YI++ N + N
Sbjct: 37 TVSSRSDAE---VERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRYIDEHNTK-N 92
Query: 82 RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
+YKLG F+DLTN+E+R+ Y G +PV V + S R ++ + +P S+DWR++G
Sbjct: 93 LSYKLGLTRFADLTNDEYRSMYLG-AKPVKRVLKTSDR---YEARVGDALPDSVDWRKEG 148
Query: 142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDK 200
AV +K+QG CGSCWAFS + AVEGI +I G LI LSEQ+LVDC T N GC+GGLMD
Sbjct: 149 AVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDY 208
Query: 201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQP 260
AFE+II+N G+ TEADYPY+ G CD+ ++ A TI YED+P+ E +L +A+ QP
Sbjct: 209 AFEFIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQP 268
Query: 261 VSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWG 320
+SV +EA G+AF+ Y GV + CG DHGV VG+GT E+G YW+++NSWG WG
Sbjct: 269 ISVAIEAGGRAFQLYSSGVFDGICGTELDHGVVAVGYGT---ENGKDYWIVRNSWGNRWG 325
Query: 321 ESGYIRILRD----EGLCGIATEASYPV 344
ESGYI++ R+ G CGIA EASYP+
Sbjct: 326 ESGYIKMARNIAEPTGKCGIAMEASYPI 353
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 155/329 (47%), Positives = 210/329 (63%), Gaps = 16/329 (4%)
Query: 26 QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN-KEGNRTY 84
G EP +E W+A+HGR Y E+ R +F NL +++ N + +
Sbjct: 36 HAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGF 95
Query: 85 KLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN---VTDVPTSIDWREKG 141
+LG N+F+DLTN+EFRA+Y G P SR+ +Y++ ++P S+DWREKG
Sbjct: 96 RLGMNQFADLTNDEFRAAYLGARIPA---SRRRGTAVGERYRHGGGAEELPESVDWREKG 152
Query: 142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMD 199
AV +KNQG CGSCWAFSAV++VE + QI G+++ LSEQ+LV+CSTD N+GC+GGLMD
Sbjct: 153 AVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMD 212
Query: 200 KAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ 259
AF++II+N G+ TE DYPY+ G CD +E A +I +ED+P+ DE +L +AV Q
Sbjct: 213 AAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQ 272
Query: 260 PVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
PVSV +EA G+ F+ YK GV C N DHGV VG+GT E+G YW+++NSWG W
Sbjct: 273 PVSVAIEAGGREFQLYKAGVFTGTCTTNLDHGVVAVGYGT---ENGKDYWIVRNSWGAKW 329
Query: 320 GESGYIRILRD----EGLCGIATEASYPV 344
GE GYIR+ R+ G CGIA ASYP
Sbjct: 330 GEDGYIRMERNVNATTGKCGIAMMASYPT 358
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 162/350 (46%), Positives = 225/350 (64%), Gaps = 21/350 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQHGRTYKDELEKAMRLT 64
+ +F +I++ AS + + E S+ +E+W + H + +D EK R
Sbjct: 1 MKLFSLILVASFLASVAATAIDIADKDLETEDSLWNLYERWRSHHTVS-RDLDEKQKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG----YNRPVPSVSRQSSRP 120
+FK+N YI NK + YKL N+F+DLTN EFR++Y G ++R + SR+
Sbjct: 60 VFKENPRYIHDFNKRKDIPYKLRLNKFADLTNHEFRSTYAGSRINHHRSLRG-SRRGGAT 118
Query: 121 STFKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
++F YQ++ +P SIDWR+KGAVT +K+QG CGSCWAFS VAAVEGI QI KL+ L
Sbjct: 119 NSFMYQSLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKKLLSL 178
Query: 179 SEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAAT 237
SEQ+L+DC TD NNGC+GGLMD AF++I +N G+++EA+YPY E C +K K+ +
Sbjct: 179 SEQELIDCDTDENNGCNGGLMDYAFDFIKKNGGISSEAEYPYAAEDSYCATEK-KSHVVS 237
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I +ED+P DE +LL+AV QPVS+ +EASG F+FY GV G DHGVA+VG+
Sbjct: 238 IDGHEDVPANDEDSLLKAVANQPVSIAIEASGYDFQFYSEGVFTGRSGTELDHGVAIVGY 297
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRI---LRDEGLCGIATEASYPV 344
G ++ G KYW+++NSWG WGE GYIRI + LCG+A EASYP+
Sbjct: 298 GKTQQ--GTKYWIVRNSWGAEWGEKGYIRISAASDSKRLCGLAMEASYPI 345
>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 163/321 (50%), Positives = 215/321 (66%), Gaps = 33/321 (10%)
Query: 33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
+ E S +EKHEQWM++ R Y D+ EK R IFK+NL+++E N N TYKL N+FS
Sbjct: 9 LFEASAIEKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFS 68
Query: 93 DLTNEEFRASYTGYNRPVP-SVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
DLT+EEF+A Y G VP ++ S + +F+Y+NV++ S+DWR +GAVT +K+QG
Sbjct: 69 DLTDEEFQARYMGL---VPEGMTGDSQKTVSFRYENVSETGESMDWRLEGAVTPVKDQGQ 125
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN--GCSGGLMDKAFEYIIENK 209
CG CWAF+AVAAVEG+T+I G+L+ LSEQQLVDCST NN GC GGL A++YI EN+
Sbjct: 126 CGCCWAFAAVAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQ 185
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASG 269
G+ +E +YPYQ Q TC + AAATI YE +PK DE ALL+AV++
Sbjct: 186 GITSEENYPYQAVQQTC--KSTDPAAATISGYEAVPKDDEEALLKAVSQH---------- 233
Query: 270 QAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
G+ E CG + H V +VG+GT+EE G KYWL+KNSWGE+WGE+GY+RI
Sbjct: 234 --------GIFEDEYCGTDSHHAVTIVGYGTSEE--GIKYWLLKNSWGESWGENGYMRIK 283
Query: 329 RD----EGLCGIATEASYPVA 345
RD +G+CG+A A YPVA
Sbjct: 284 RDVDEPQGMCGLAHRAYYPVA 304
>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
Length = 371
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 162/321 (50%), Positives = 203/321 (63%), Gaps = 17/321 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ + +E+W + H R + EK R FK N +I NK G+ Y+L N F D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 95 TNEEFRASYTG-YNRPVPSVSRQSSRPSTFKYQ--NVTDVPTSIDWREKGAVTHIKNQGH 151
EFRA++ G R P+ + S P F Y NV+D+P S+DWR+KGAVT +K+QG
Sbjct: 98 DQAEFRATFVGDLRRDTPA--KPPSVPG-FMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFEYIIENKG 210
CGSCWAFS V +VEGI I G L+ LSEQ+L+DC T DN+GC GGLMD AFEYI N G
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214
Query: 211 LATEADYPYQQEQGTCD---KQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA 267
L TEA YPY+ +GTC+ + I ++D+P E L +AV QPVSV VEA
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274
Query: 268 SGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
SG+AF FY GV +CG DHGVAVVG+G A EDG YW +KNSWG +WGE GYIR+
Sbjct: 275 SGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVA--EDGKAYWTVKNSWGPSWGEQGYIRV 332
Query: 328 LRDE----GLCGIATEASYPV 344
+D GLCGIA EASYPV
Sbjct: 333 EKDSGASGGLCGIAMEASYPV 353
>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 379
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 155/350 (44%), Positives = 211/350 (60%), Gaps = 23/350 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIV------EKHEQWMAQHGRTYKDELEKAMRLTIF 66
+ V ++ V + A ++ E + + +E+W H R ++ EK R F
Sbjct: 9 LLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTF 67
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST---- 122
K+N+ +I NK G+R Y+L N F D+ EEFR+++ + + + RQ S +
Sbjct: 68 KENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFA--DSRINDLRRQDSPAARAGAV 125
Query: 123 --FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
F Y + D P S+DWR++GAVT +K+QGHCGSCWAFS V AVEGI I G L LSE
Sbjct: 126 PGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSLASLSE 185
Query: 181 QQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEK---AAAAT 237
Q+L+DC TD NGC GGLM+ AFE+I G+ TEA YPY+ GTCD + +
Sbjct: 186 QELIDCDTDENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGVVVV 245
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I ++ +P G E AL +AV QPVSV V+A GQAF+FY GV +CG + DHGVA VG+
Sbjct: 246 IDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGY 305
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILR---DEGLCGIATEASYPV 344
G +DG YW++KNSWG +WGE GYIR+ R + GLCGIA EAS+P+
Sbjct: 306 GVG--DDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 353
>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 158/348 (45%), Positives = 221/348 (63%), Gaps = 10/348 (2%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+K EK ++ + ++++ + + E S+ + +E+W + H +D EK R
Sbjct: 1 MKMEKVILVALSLVLVFGLAESFDFDEKDLASEESLWDLYERWRSYH-TVSRDLEEKNKR 59
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
+FK+N +++ K N+ ++ YKL N+F+D+TN EFR+SY G + R R +
Sbjct: 60 FNVFKENTKHVHKVNQM-DKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTG 118
Query: 123 -FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQ 181
F ++ T +P S+DWR+KGAVT IK+QG CGSCWAFS V VEGI QI +L+ LSEQ
Sbjct: 119 GFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQ 178
Query: 182 QLVDCS-TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
QL+DC +D++GC+GGLM+ AFE+I +N G+ TE +YPY+ + CD K A TI
Sbjct: 179 QLIDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTIDG 238
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
+E +P DE AL++AV QPVSV ++A G +FY GV + ECG DHGVA+VG+GT
Sbjct: 239 HESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGTT 298
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
DG KYW++KNSWG WGE GYIR+ R EG CGIA EASYPV
Sbjct: 299 --LDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPV 344
>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 423
Score = 314 bits (805), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 155/350 (44%), Positives = 211/350 (60%), Gaps = 23/350 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIV------EKHEQWMAQHGRTYKDELEKAMRLTIF 66
+ V ++ V + A ++ E + + +E+W H R ++ EK R F
Sbjct: 53 LLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTF 111
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST---- 122
K+N+ +I NK G+R Y+L N F D+ EEFR+++ + + + RQ S +
Sbjct: 112 KENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFA--DSRINDLRRQDSPAARAGAV 169
Query: 123 --FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
F Y + D P S+DWR++GAVT +K+QGHCGSCWAFS V AVEGI I G L LSE
Sbjct: 170 PGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSLASLSE 229
Query: 181 QQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEK---AAAAT 237
Q+L+DC TD NGC GGLM+ AFE+I G+ TEA YPY+ GTCD + +
Sbjct: 230 QELIDCDTDENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGVVVV 289
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I ++ +P G E AL +AV QPVSV V+A GQAF+FY GV +CG + DHGVA VG+
Sbjct: 290 IDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGY 349
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILR---DEGLCGIATEASYPV 344
G +DG YW++KNSWG +WGE GYIR+ R + GLCGIA EAS+P+
Sbjct: 350 GVG--DDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 397
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 155/327 (47%), Positives = 210/327 (64%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRT 83
+VS E + + +WMA++GRTY E+ R +F+ NL Y+++ N G +
Sbjct: 27 IVSYGERSEEEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLHS 86
Query: 84 YKLGTNEFSDLTNEEFRASYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R +Y G +PV R+ ++ + ++P S+DWREKGA
Sbjct: 87 FRLGLNRFADLTNEEYRDTYLGVRTKPV----RERRLSGRYQAADNEELPESVDWREKGA 142
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKA 201
V +K+QG CGSCWAFSA+AAVEGI QI G +I LSEQ+LVDC T N GC+GGLMD A
Sbjct: 143 VAKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYNQGCNGGLMDYA 202
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
FE+II N G+ +E DYPY++ CD K+ A TI YED+P E +L +AV QP+
Sbjct: 203 FEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSELSLKKAVANQPI 262
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
SV +EA G+AF+ YK G+ CG DHGV VG+G+ E+G YW++KNSWG WGE
Sbjct: 263 SVAIEAGGRAFQLYKSGIFTGRCGTALDHGVTAVGYGS---ENGKDYWIVKNSWGTVWGE 319
Query: 322 SGYIRILRD----EGLCGIATEASYPV 344
GY+R+ R+ G CGIA E SYP+
Sbjct: 320 DGYVRLERNIKATSGKCGIAIEPSYPL 346
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 152/309 (49%), Positives = 210/309 (67%), Gaps = 9/309 (2%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
++ W+ QHG+ Y E+ R IFK NL +I++ N N TYKLG N+F+DLTN+E+RA
Sbjct: 46 YKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYRA 105
Query: 102 SYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
+ G +S PS+ + ++ ++P S++WR+ GAV+ +K+QG CGSCWAFSA
Sbjct: 106 KFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGSCGSCWAFSA 165
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
+AAVEGI +I G+LI LSEQ+LVDC + GC+GGLMD AF++II+N G+ TE DYPY
Sbjct: 166 IAAVEGINKIVSGELISLSEQELVDCDRSYDAGCNGGLMDYAFQFIIDNGGIDTEKDYPY 225
Query: 220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
CD K+ A +I YED+P +E+AL +AV QPVS+ +EA G+AF+ Y+ GV
Sbjct: 226 LGFNNQCDPTKKNAKVVSIDGYEDVPN-NENALKKAVAHQPVSIAIEAGGRAFQLYESGV 284
Query: 280 LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCG 335
N ECG DHGV VG+G+ +++G YW+++NSWG WGE+GYIR+ R + G CG
Sbjct: 285 FNGECGLALDHGVVAVGYGS--DDNGQDYWIVRNSWGGNWGENGYIRMERNINANTGKCG 342
Query: 336 IATEASYPV 344
IA EASYPV
Sbjct: 343 IAMEASYPV 351
>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
Length = 368
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 149/315 (47%), Positives = 204/315 (64%), Gaps = 12/315 (3%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
++ + +E+W H R ++ EK R FK+N +I NK G+R Y+L N F D+
Sbjct: 37 ALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRLRLNRFGDMGR 95
Query: 97 EEFRASYTGYNRPVPSVSRQ-SSRPST--FKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
EEFR+ + + + + R+ ++ P+ F Y + TD+P S+DWR+KGAVT +KNQG CG
Sbjct: 96 EEFRSGFA--DSRINDLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAVTAVKNQGRCG 153
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLAT 213
SCWAFS V AVEGI I G L+ LSEQ+L+DC TD NGC GGLM+ AFE+I + G+ T
Sbjct: 154 SCWAFSTVVAVEGINAIRTGSLVSLSEQELIDCDTDENGCQGGLMENAFEFIKSHGGITT 213
Query: 214 EADYPYQQEQGTCDKQK-EKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
E+ YPY GTCD + + I ++ +P G E AL +AV QPVSV ++A GQA
Sbjct: 214 ESAYPYHASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAIDAGGQAL 273
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR--- 329
+FY GV +CG + DHGVA VG+G + +DG YW++KNSWG +WGE GYIR+ R
Sbjct: 274 QFYSEGVFTGDCGTDLDHGVAAVGYGVS--DDGTPYWIVKNSWGPSWGEGGYIRMQRGTG 331
Query: 330 DEGLCGIATEASYPV 344
+ GLCGIA EAS+P+
Sbjct: 332 NGGLCGIAMEASFPI 346
>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase; AltName:
Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
RecName: Full=Vignain-1; Contains: RecName:
Full=Vignain-2; Flags: Precursor
gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
Length = 362
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 155/317 (48%), Positives = 211/317 (66%), Gaps = 12/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S+ + +E+W + H T L EK R +FK N+ ++ NK ++ YKL N+F+D
Sbjct: 33 EESLWDLYERWRSHH--TVSRSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFAD 89
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
+TN EFR++Y G + R S S TF Y+ V VP S+DWR+KGAVT +K+QG C
Sbjct: 90 MTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQC 149
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGL 211
GSCWAFS + AVEGI QI KL+ LSEQ+LVDC + N GC+GGLM+ AFE+I + G+
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
TE++YPY ++GTCD+ K A +I +E++P DE+ALL+AV QPVSV ++A G
Sbjct: 210 TTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSD 269
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD- 330
F+FY GV +C + +HGVA+VG+GT DG YW+++NSWG WGE GYIR+ R+
Sbjct: 270 FQFYSEGVFTGDCNTDLNHGVAIVGYGTT--VDGTNYWIVRNSWGPEWGEQGYIRMQRNI 327
Query: 331 ---EGLCGIATEASYPV 344
EGLCGIA ASYP+
Sbjct: 328 SKKEGLCGIAMMASYPI 344
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 154/329 (46%), Positives = 211/329 (64%), Gaps = 16/329 (4%)
Query: 26 QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN-KEGNRTY 84
G EP +E W+A+HGR Y E+ R +F NL +++ N + +
Sbjct: 33 HAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGF 92
Query: 85 KLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN---VTDVPTSIDWREKG 141
+LG N+F+DLTN+EFRA+Y G P +R+ +Y++ ++P S+DWREKG
Sbjct: 93 RLGMNQFADLTNDEFRAAYLGARIPA---ARRRGTAVGERYRHGGGAEELPESVDWREKG 149
Query: 142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMD 199
AV +KNQG CGSCWAFSAV++VE + QI G+++ LSEQ+LV+CSTD N+GC+GGLMD
Sbjct: 150 AVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMD 209
Query: 200 KAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ 259
AF++II+N G+ TE DYPY+ G CD +E A +I +ED+P+ DE +L +AV Q
Sbjct: 210 AAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQ 269
Query: 260 PVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
PVSV +EA G+ F+ YK GV + C N DHGV VG+GT E+G YW+++NSWG W
Sbjct: 270 PVSVAIEAGGREFQLYKAGVFSGTCTTNLDHGVVAVGYGT---ENGKDYWIVRNSWGAKW 326
Query: 320 GESGYIRILRD----EGLCGIATEASYPV 344
GE GYIR+ R+ G CGIA ASYP
Sbjct: 327 GEDGYIRMERNVNATTGKCGIAMMASYPT 355
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 156/312 (50%), Positives = 201/312 (64%), Gaps = 16/312 (5%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
+ +WMA HGRTY E+ R +F+ NL YI+ N G +++LG N F+DLTN+E
Sbjct: 44 YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 103
Query: 99 FRASYTG-YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
+RA+Y G RP R+ + + + D+P S+DWR KGAV +K+QG GSCWA
Sbjct: 104 YRATYLGARTRP----QRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSYGSCWA 159
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS +AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ TE D
Sbjct: 160 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKD 219
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY+ G CD ++ A TI YED+P DE +L +AV QPVSV +EA+G F+ Y
Sbjct: 220 YPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTQFQLYS 279
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
G+ CG DHGV VG+GT E+G YW++KNSWG +WGESGY+R+ R+ G
Sbjct: 280 SGIFTGSCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 336
Query: 333 LCGIATEASYPV 344
CGIA E SYP+
Sbjct: 337 KCGIAVEPSYPL 348
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 150/343 (43%), Positives = 209/343 (60%), Gaps = 9/343 (2%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
+ + + + ++CA + + + ++ +E+W+ +H + Y EK R +FK
Sbjct: 6 TLVTSTLLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLREKDKRFQVFK 65
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPS-VSRQSSRPSTFKYQ 126
NL +I++ N N TYKLG N+F+D+TNEE+R Y G + + S + Y
Sbjct: 66 DNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYS 125
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+P +DWR KGAV IK+QG CGSCWAFS VA VE I +I GK + LSEQ+LVDC
Sbjct: 126 AGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDC 185
Query: 187 STD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
N GC+GGLMD AFE+II+N G+ T+ DYPY+ G CD K+ A I +ED+P
Sbjct: 186 DRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGFEDVP 245
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
DE+AL +AV QPVS+ +EASG+ + Y+ GV +CG + DHGV VVG+G+ E+G
Sbjct: 246 PYDENALKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVVGYGS---ENG 302
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
YWL++NSWG WGE GY ++ R+ G CGI EASYPV
Sbjct: 303 VDYWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 313 bits (803), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 157/349 (44%), Positives = 211/349 (60%), Gaps = 21/349 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKH-------------EQWMAQHGRTYKDELEKA 60
I IL++ S + S M S E H E W+ +HG++Y EK
Sbjct: 8 LTISILLMLIFSTLSSASDMSIISYDETHIHRRTDDEVSALYESWLIEHGKSYNALGEKD 67
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK NL YI++ N N++YKLG +F+DLTNEE+R+ Y G ++
Sbjct: 68 KRFQIFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRKKLSKNKS 127
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+ + +P SIDWREKG + +K+QG CGSCWAFSAVAA+E I I G LI LSE
Sbjct: 128 DRYLPKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSE 187
Query: 181 QQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIG 239
Q+LVDC N GC GGLMD AFE++I+N G+ TE DYPY++ G CD+ ++ A I
Sbjct: 188 QELVDCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKID 247
Query: 240 KYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGT 299
YED+P +E AL +AV QPVS+ +EA G+ F+ YK G+ +CG DHGV + G+GT
Sbjct: 248 SYEDVPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGT 307
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
E+G YW+++NSWG WGE+GY+R+ R+ GLCG+A E SYPV
Sbjct: 308 ---ENGMDYWIVRNSWGANWGENGYLRVQRNVASSSGLCGLAIEPSYPV 353
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 313 bits (803), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 154/315 (48%), Positives = 210/315 (66%), Gaps = 13/315 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDEL---EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
++ +E+W+ ++G+ + + EK R +FK NL +I++ N E NR+YK+G N F+DL
Sbjct: 47 VMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDEHNSE-NRSYKVGLNRFADL 105
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGS 154
TNEE+R+ Y G R +R S + + + +P S+DWR++GAV +K+QG CGS
Sbjct: 106 TNEEYRSMYLG-ARSGAKRNRLSRSSNRYLPRVGDSLPDSVDWRKEGAVAEVKDQGSCGS 164
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLAT 213
CWAFS +AAVEGI +I G LI LSEQ+LVDC N GC+GGLMD AF++II N G+ +
Sbjct: 165 CWAFSTIAAVEGINKIVTGDLISLSEQELVDCDRSYNEGCNGGLMDYAFQFIINNGGIDS 224
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
E DYPY GTCD ++ A TI YED+P DE AL +AV QPVSV +EA G+ F+
Sbjct: 225 EEDYPYLARDGTCDTYRKNAKVVTIDNYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQ 284
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--- 330
FY+ G+ CG DHGVA VG+GT E+G YW+++NSWG++WGESGYIR+ R+
Sbjct: 285 FYQSGIFTGRCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGESGYIRMERNIAT 341
Query: 331 -EGLCGIATEASYPV 344
G CGIA E SYP+
Sbjct: 342 ATGKCGIAIEPSYPI 356
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 154/317 (48%), Positives = 209/317 (65%), Gaps = 12/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S+ + +E+W + H T L EK R +FK NL ++ NK ++ YKL N+F+D
Sbjct: 33 EESLWDLYERWRSHH--TVSRSLGEKHKRFNVFKANLMHVHNTNKM-DKPYKLKLNKFAD 89
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
+TN EFR++Y G P + R + + F Y+ V VP S+DWR+KGAVT +K+QG C
Sbjct: 90 MTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQC 149
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGL 211
GSCWAFS V AVEGI QI KL+ LSEQ+LVDC + N GC+GGLM+ AFE+I + G+
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
TE++YPY+ ++GTCD K A +I +E++P DE ALL+AV QPVSV ++A G
Sbjct: 210 TTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSD 269
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD- 330
F+FY GV +C + +HGVA+VG+GT DG YW+++NSWG WGE GYIR+ R+
Sbjct: 270 FQFYSEGVFTGDCSTDLNHGVAIVGYGTT--VDGTNYWIVRNSWGPEWGEHGYIRMQRNI 327
Query: 331 ---EGLCGIATEASYPV 344
EGLCGIA SYP+
Sbjct: 328 SKKEGLCGIAMLPSYPI 344
>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
Length = 328
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 156/341 (45%), Positives = 214/341 (62%), Gaps = 30/341 (8%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ I+ L C + + + + ++V +HEQWM Q+ R YKD EKA R +FK N+++
Sbjct: 8 ILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN-RPVPSVSRQSSRPSTFKYQNVT-- 129
IE N GNR + LG N+F+DLTN+EFRA+ T +P P + F+Y+NV+
Sbjct: 68 IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSP-----VKVSTGFRYENVSVD 122
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
+P +IDWR KGAVT IK+QG C EGI +I+ GKLI LSEQ+LVDC
Sbjct: 123 ALPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDCDVH 170
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
++ GC GGLMD AF++II+N GL TE+ YPY G C + +AAT+ +ED+P
Sbjct: 171 GEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKC--KSGSNSAATVKGFEDVPAN 228
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
DE AL++AV QPVSV V+ F+FY GV+ CG + DHG+A +G+G + DG K
Sbjct: 229 DEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG--QTSDGTK 286
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
YWL+KNSWG TWGE+GY+R+ +D G+CG+A E SYP
Sbjct: 287 YWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 327
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 155/311 (49%), Positives = 208/311 (66%), Gaps = 15/311 (4%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E W+ HG+ Y EK R IFK NL +I++ N+E +RTYK+G F+DLTNEE+RA
Sbjct: 62 YESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRE-SRTYKVGLTRFADLTNEEYRA 120
Query: 102 SYTG--YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFS 159
+ G ++R P +S +++ + D+P +DWR+KGAV +K+QG CGSCWAFS
Sbjct: 121 RFLGGRFSRK-PRLS--AAKSGRYAAALGDDLPDDVDWRKKGAVATVKDQGQCGSCWAFS 177
Query: 160 AVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYP 218
+VAAVEGI QI G+LI LSEQ+LVDC N GC+GGLMD AF++II N G+ TE DYP
Sbjct: 178 SVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGIDTEEDYP 237
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRG 278
Y+ CD ++ A TI YED+P+ DE +L +AV QPVSV +EA G+AF+ Y+ G
Sbjct: 238 YKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQLYQSG 297
Query: 279 VLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGL 333
V CG + DHGV VG+GT ++G YW+++NSWG+ WGESGYIR+ R+ G
Sbjct: 298 VFTGRCGTDLDHGVVAVGYGT---DNGTDYWIVRNSWGKDWGESGYIRLERNVANITTGK 354
Query: 334 CGIATEASYPV 344
CGIA + SYP
Sbjct: 355 CGIAVQPSYPT 365
>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
Length = 340
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 164/342 (47%), Positives = 221/342 (64%), Gaps = 18/342 (5%)
Query: 14 FVIIILVI----TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
F++ +LV+ C + + ++ +HE+WMA+HGR YKDE EKA RL +F+ N
Sbjct: 6 FLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVFRAN 65
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN-- 127
E I+ N G +++L TN F+DLT +EFRA+ TG RP P+ S + R F+Y+N
Sbjct: 66 AELIDSFNAAGTHSHRLATNRFADLTVQEFRAARTGL-RPRPAPSAGAGR---FRYENFS 121
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+ D S+DWR GAVT +K+QG G CWAFSAVAAVEG+ +I G+L+ LSEQ+LVDC
Sbjct: 122 LADAAQSVDWRAMGAVTGVKDQGASGCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCD 181
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ GC GGLMD AF+++ GLA+E+ YPYQ G C + AAAA+I +ED+P
Sbjct: 182 VSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQCRDGPC-RSSAAAAAASIRGHEDVP 240
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
+ +E AL AV QPVSV + AFRFY GVL CG + +H + VG+GTA DG
Sbjct: 241 RNNEAALAAAVAHQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTA--ADG 298
Query: 306 AKYWLIKNSWGETWGESGYIRI---LRDEGLCGIATEASYPV 344
+YWL+KNSWG +WGE GY+RI +R EG+CG+A SYPV
Sbjct: 299 TRYWLMKNSWGASWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 340
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 313 bits (803), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 149/314 (47%), Positives = 207/314 (65%), Gaps = 10/314 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ +E W+ +H + Y EK R IFK N+ ++++ N N++YKLG N+F+DLTN+
Sbjct: 56 LLSLYESWLVKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTND 115
Query: 98 EFRASY-TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCW 156
E+R+ Y +G + R F +++ +P S+DWR++GAV +K+QG CGSCW
Sbjct: 116 EYRSLYLSGKMMKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQGQCGSCW 175
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEA 215
AFS V AVEGI +I G+LI LSEQ+LVDC N GC+GGLMD AFE+I++N G+ TE
Sbjct: 176 AFSTVGAVEGINKIVTGELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKNGGIDTED 235
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
DYPY+ G CD+ ++ A TI YED+P DE +L +AV QPVSV +EA G+AF+ Y
Sbjct: 236 DYPYKGVDGLCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLY 295
Query: 276 KRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----- 330
+ GV +CG DHGV VG+G+ E+G YW+++NSWG WGESGYIR+ R+
Sbjct: 296 ESGVFTGQCGTELDHGVVAVGYGS---ENGKDYWIVRNSWGPDWGESGYIRLERNVASTS 352
Query: 331 EGLCGIATEASYPV 344
G CGIA +ASYP
Sbjct: 353 TGKCGIAMQASYPT 366
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 313 bits (803), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 156/318 (49%), Positives = 207/318 (65%), Gaps = 14/318 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEF 91
E + + +WMA+H TY E+ R F+ NL YI++ N G +++LG N F
Sbjct: 35 EEEVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRLGLNRF 94
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
+DLTNEE+R++Y G R P R+ S + ++ + ++P S+DWR+KGAV +K+QG
Sbjct: 95 ADLTNEEYRSTYLG-ARTKPDRERKLS--ARYQAADNDELPESVDWRKKGAVGAVKDQGG 151
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKG 210
CGSCWAFSA+AAVEGI QI G +I LSEQ+LVDC T N GC+GGLMD AFE+II N G
Sbjct: 152 CGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGG 211
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
+ +E DYPY++ CD K+ A TI YED+P E +L +AV QP+SV +EA G+
Sbjct: 212 IDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGR 271
Query: 271 AFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
AF+ YK G+ CG DHGVA VG+GT E+G YWL++NSWG WGE+GYIR+ R+
Sbjct: 272 AFQLYKSGIFTGTCGTALDHGVAAVGYGT---ENGKDYWLVRNSWGSVWGENGYIRMERN 328
Query: 331 ----EGLCGIATEASYPV 344
G CGIA E SYP
Sbjct: 329 IKASSGKCGIAVEPSYPT 346
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 313 bits (802), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 155/313 (49%), Positives = 204/313 (65%), Gaps = 12/313 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ +E W+ +HG++Y EK R IFK NL +I++ N E N +YK+G N F+DLTNE
Sbjct: 46 VMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTNE 105
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
E+R++Y G + P +S+ S + + +P S+DWR KGAV IK+QG CGSCWA
Sbjct: 106 EYRSTYLG-AKSKPKLSKVKS--DRYAPRVGDSLPESVDWRAKGAVAPIKDQGSCGSCWA 162
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS V AVEGI QI G+LI LSEQ+LVDC N GC GGLMD FE+II N G+ T+ D
Sbjct: 163 FSTVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDGGLMDYGFEFIINNGGIDTDKD 222
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY CD+ ++ A TI YED+P +E AL +AV QPVSV +E G+AF+FY
Sbjct: 223 YPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYD 282
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----E 331
G+ +CG DHGV VVG+GT E G YW+++NSWG +WGE+GYIR+ R+
Sbjct: 283 SGIFTGKCGTALDHGVNVVGYGT---EKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSV 339
Query: 332 GLCGIATEASYPV 344
G CGIA E SYP+
Sbjct: 340 GKCGIAMEPSYPL 352
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 154/342 (45%), Positives = 216/342 (63%), Gaps = 15/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEK-----HEQWMAQHGRTYKDELEKAMRLTIFK 67
+F+ ++S H P + +E+W+ HG+ Y EK R IFK
Sbjct: 13 LFLCFAFSSALDMSIISYDQTHPPQRTDAEAMAIYEKWLTTHGKAYNAIGEKERRFEIFK 72
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
NL ++++ N +Y++G N F+DLTNEE+R+ + G N + S S++ + ++
Sbjct: 73 DNLRFVDEHNAVAG-SYRVGLNRFADLTNEEYRSMFLGGNMEMKERS-ASTKSDRYAFRA 130
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+P S+DWREKGAV+ +K+QG CGSCWAFS ++AVEGI QI G+LI LSEQ+LVDC
Sbjct: 131 GDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISLSEQELVDCD 190
Query: 188 TD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD F++II N G+ TE DYPY+ GTCD+ ++ A +I YED+P+
Sbjct: 191 KSYNMGCNGGLMDYGFQFIINNGGIDTEEDYPYRAVDGTCDQFRKNARVVSINGYEDVPE 250
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
DE++L +AV QPVSV +EA G+AF+ Y+ GV CG N DHGV VG+GT E+G
Sbjct: 251 DDENSLKKAVANQPVSVAIEAGGRAFQLYESGVFTGHCGTNLDHGVVAVGYGT---ENGV 307
Query: 307 KYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
YW ++NSWG WGE+GYI++ R+ G CGIA+ ASYP
Sbjct: 308 DYWTVRNSWGPKWGENGYIKLERNINATSGKCGIASMASYPT 349
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 164/338 (48%), Positives = 210/338 (62%), Gaps = 29/338 (8%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEF 91
S HE S+ E E+W+++H R Y EK R +FK NL +I++ N++ + +Y LG NEF
Sbjct: 50 SSHE-SLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNRKVS-SYWLGLNEF 107
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV------TDVPTSIDWREKGAVTH 145
+DLT++EF+A+Y G V + + +P S+DWR KGAVT
Sbjct: 108 ADLTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGASLPKSVDWRSKGAVTG 167
Query: 146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEY 204
+KNQG CGSCWAFS VAAVEGI QI G L LSEQ+L+DC TD NNGC+GGLMD AF Y
Sbjct: 168 VKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDGNNGCNGGLMDYAFSY 227
Query: 205 IIENKGLATEADYPYQQEQGTCDKQ--------------KEKAAAATIGKYEDLPKGDEH 250
I N GL TE YPY E+GTC + + AA TI YED+P+ +E
Sbjct: 228 IAHNGGLHTEEAYPYLMEEGTCQRSSSSEKKWPGSSEDANDDAAVVTISGYEDVPRNNEQ 287
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
ALL+A+ +QPVSV +EASG+ F+FY GV + CG DHGVA VG+GTA + G Y +
Sbjct: 288 ALLKALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGVAAVGYGTAAK--GHDYII 345
Query: 311 IKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+KNSWG +WGE GYIR+ R +GLCGI ASYP
Sbjct: 346 VKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMASYPT 383
>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
Length = 365
Score = 313 bits (802), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 161/320 (50%), Positives = 213/320 (66%), Gaps = 16/320 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYK----DELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNE 90
E S+ +E+W + + + + D E+ R +FKQN Y+ + NK + ++L N+
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAGER--RFNVFKQNARYVHEGNKR-DMPFRLALNK 90
Query: 91 FSDLTNEEFRASYTGYN-RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQ 149
F+D+T +EFR +Y G R S+S F+Y + ++P ++DWR+KGAVT IK+Q
Sbjct: 91 FADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150
Query: 150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIEN 208
G CGSCWAFS + AVEGI +I GKL+ LSEQ+L+DC NN GC GGLMD AF++I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN 210
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
G+ TE++YPYQ EQG+CD+ KE A A TI YED+P DE AL +AV QPVSV ++AS
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269
Query: 269 GQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
GQ F+FY GV EC + DHGVA VG+G DG KYW++KNSWGE WGE GYIR+
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGAT--RDGTKYWIVKNSWGEDWGEKGYIRMQ 327
Query: 329 R----DEGLCGIATEASYPV 344
R EGLCGIA +ASYP
Sbjct: 328 RGVSQTEGLCGIAMQASYPT 347
>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 313 bits (801), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 161/320 (50%), Positives = 213/320 (66%), Gaps = 16/320 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYK----DELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNE 90
E S+ +E+W + + + + D E+ R +FKQN Y+ + NK + ++L N+
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAEER--RFNVFKQNARYVHEGNKR-DMPFRLALNK 90
Query: 91 FSDLTNEEFRASYTGYN-RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQ 149
F+D+T +EFR +Y G R S+S F+Y + ++P ++DWR+KGAVT IK+Q
Sbjct: 91 FADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150
Query: 150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIEN 208
G CGSCWAFS + AVEGI +I GKL+ LSEQ+L+DC NN GC GGLMD AF++I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN 210
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
G+ TE++YPYQ EQG+CD+ KE A A TI YED+P DE AL +AV QPVSV ++AS
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269
Query: 269 GQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
GQ F+FY GV EC + DHGVA VG+G DG KYW++KNSWGE WGE GYIR+
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGAT--RDGTKYWIVKNSWGEDWGEKGYIRMQ 327
Query: 329 R----DEGLCGIATEASYPV 344
R EGLCGIA +ASYP
Sbjct: 328 RGVSQTEGLCGIAMQASYPT 347
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 150/309 (48%), Positives = 208/309 (67%), Gaps = 9/309 (2%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
++ W+ QHG+ Y E+ R IFK NL +I++ N N TYKLG N+F+DLTN+E+RA
Sbjct: 45 YKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYRA 104
Query: 102 SYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
+ G +S PS+ + ++ ++P S+DWR+ GAV+ +K+QG CGSCWAFS
Sbjct: 105 KFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVDWRDHGAVSPVKDQGSCGSCWAFST 164
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
+A VEGI +I G+L+ LSEQ+LVDC + GC+GGLMD AF++I++N G+ TE DYPY
Sbjct: 165 IATVEGINKIVSGELVSLSEQELVDCDRSYDAGCNGGLMDYAFQFIMDNGGIDTEKDYPY 224
Query: 220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
CD K+ A +I YED+P +E+AL +AV QPVS+ +EA G+AF+ Y+ GV
Sbjct: 225 LGFNNQCDPTKKNAKVVSIDGYEDVPN-NENALKKAVAHQPVSIAIEAGGRAFQLYESGV 283
Query: 280 LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCG 335
N ECG DHGV VG+GT +++G YW+++NSWG WGE+GYIR+ R + G CG
Sbjct: 284 FNGECGLALDHGVVAVGYGT--DDNGQDYWIVRNSWGSNWGENGYIRMERNINANTGKCG 341
Query: 336 IATEASYPV 344
IA EASYPV
Sbjct: 342 IAMEASYPV 350
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 152/340 (44%), Positives = 213/340 (62%), Gaps = 11/340 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMH--EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
++ ++ L T + + + ++ + ++ +E+W+ +H + Y + +K R +FK NL
Sbjct: 7 IYTLLFLSFTLSYAIKTSTIINYTDNEVMAMYEEWLVRHQKGYNELGKKDKRFQVFKDNL 66
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPS-VSRQSSRPSTFKYQNVT 129
+I++ N N TYKLG N+F+D+TNEE+RA Y G + + S + +
Sbjct: 67 GFIQEHNNNLNNTYKLGLNKFADMTNEEYRAMYLGTKSNAKRRLMKTKSTGHRYAFSARD 126
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+P +DWR KGAV IK+QG CGSCWAFS VA VE I +I GK + LSEQ+LVDC
Sbjct: 127 RLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRA 186
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC+GGLMD AFE+II+N G+ T+ DYPY+ G CD K+ A I YED+P D
Sbjct: 187 YNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGYEDVPPYD 246
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E+AL +AV QPVSV +EASG+A + Y+ GV +CG + DHGV VVG+G+ E+G Y
Sbjct: 247 ENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHGVVVVGYGS---ENGVDY 303
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
WL++NSWG WGE GY ++ R+ G CGI EASYPV
Sbjct: 304 WLVRNSWGTGWGEDGYFKMQRNVRTSTGKCGITMEASYPV 343
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 165/349 (47%), Positives = 227/349 (65%), Gaps = 22/349 (6%)
Query: 11 IPMFVIIILVITCASQ----VVSGRSMHEPS----IVEKHEQWMAQHGRTYKDELEKAMR 62
+ + V+++ V C ++ + G S + S +VE E+W+A+H + Y EK R
Sbjct: 5 LSVAVLLLCVGACVARNSDFSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASFEEKLHR 64
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
+FK NL+ I++ N+E +Y LG NEF+DLT++EF+ +Y G + + S +
Sbjct: 65 FEVFKDNLKLIDEINRE-VTSYWLGLNEFADLTHDEFKTTYLG----LSPPPARRSSSRS 119
Query: 123 FKYQNVT--DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
F+Y+NV D+P ++DWR+KGAVT +KNQG CGSCWAFS VAAVEGI I G L LSE
Sbjct: 120 FRYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSE 179
Query: 181 QQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTC-DKQKEKAAAATI 238
Q+L+DCS D N+GC+GG+MD AF YI + GL TE YPY E+G+C D +K ++ A +I
Sbjct: 180 QELIDCSVDGNSGCNGGMMDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEAVSI 239
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
YED+P DE AL++A+ QPVSV +EASG+ F+FY GV + CG DHGVA VG+G
Sbjct: 240 SGYEDVPTKDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVAAVGYG 299
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYP 343
+ ++ G Y ++KNSWG WGE GYIR+ R EGLCGI ASYP
Sbjct: 300 S-DKGKGHDYIIVKNSWGGKWGEKGYIRMKRGTGKSEGLCGINKMASYP 347
>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
gi|194701540|gb|ACF84854.1| unknown [Zea mays]
Length = 379
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 155/350 (44%), Positives = 210/350 (60%), Gaps = 23/350 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIV------EKHEQWMAQHGRTYKDELEKAMRLTIF 66
+ V ++ V + A ++ E + + +E+W H R ++ EK R F
Sbjct: 9 LLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTF 67
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST---- 122
K+N+ +I NK G+R Y+L N F D+ EEFR+++ + + + RQ S +
Sbjct: 68 KENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFA--DSRINDLRRQDSPAARAGAV 125
Query: 123 --FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
F Y + D P S+DWR++GAVT +K QGHCGSCWAFS V AVEGI I G L LSE
Sbjct: 126 PGFMYDSAADPPRSVDWRQEGAVTGVKVQGHCGSCWAFSTVVAVEGINAIRTGSLASLSE 185
Query: 181 QQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEK---AAAAT 237
Q+L+DC TD NGC GGLM+ AFE+I G+ TEA YPY+ GTCD + +
Sbjct: 186 QELIDCDTDENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGVVVV 245
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I ++ +P G E AL +AV QPVSV V+A GQAF+FY GV +CG + DHGVA VG+
Sbjct: 246 IDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGY 305
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILR---DEGLCGIATEASYPV 344
G +DG YW++KNSWG +WGE GYIR+ R + GLCGIA EAS+P+
Sbjct: 306 GVG--DDGTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 353
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 162/354 (45%), Positives = 218/354 (61%), Gaps = 22/354 (6%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEK 59
+ +I +F ++ + ++S H + ++ +E+W+ +HG+ Y EK
Sbjct: 10 TILIVLFTVLAVSSALDMSIISYDRSHADKSGWKSDEEVMSIYEEWLVKHGKVYNAVEEK 69
Query: 60 AMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSR 119
R IFK NL +IE+ N NRTYK+G N FSDL+NEE+R+ Y G + PS R +R
Sbjct: 70 EKRFQIFKDNLNFIEEHNAV-NRTYKVGLNRFSDLSNEEYRSKYLG-TKIDPS--RMMAR 125
Query: 120 PSTFKYQNVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
PS V D +P S+DWR++GAV +KNQ C CWAFSA+AAVEGI +I G L L
Sbjct: 126 PSRRYSPRVADNLPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKIVTGNLTAL 185
Query: 179 SEQQLVDCS-TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAAT 237
SEQ+L+DC T N GCSGGL+D AFE+II N G+ TE DYP+Q G CD+ K A A T
Sbjct: 186 SEQELLDCDRTVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQGADGICDQYKINARAVT 245
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I YE +P DE AL +AV QPVSV +EA G+ F+ Y+ G+ CG + DHGV VG+
Sbjct: 246 IDGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGIFTGTCGTSIDHGVTAVGY 305
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPVAM 346
GT E+G YW++KNSWGE WGE+GY+ + R+ G CGIA YP+ +
Sbjct: 306 GT---ENGIDYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGIAILTLYPIKI 356
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 161/328 (49%), Positives = 214/328 (65%), Gaps = 16/328 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK-EGNRTYK 85
+V+ R+ E ++ +E W+ +G+ Y EK R IF NL YI+ N+ E N +Y
Sbjct: 25 IVAERTEEEVRLL--YEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYT 82
Query: 86 LGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSR-PSTFK--YQNVTDVPTSIDWREKGA 142
LG F+DLTNEE+R++Y G +P R+++R P + N D+P +DWREKGA
Sbjct: 83 LGLTRFADLTNEEYRSTYLGV-KPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGA 141
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKA 201
V IK+QG CGSCWAFS VAAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 142 VAPIKDQGGCGSCWAFSTVAAVEGINQIVTGDLIVLSEQELVDCDTAYNEGCNGGLMDYA 201
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
F++II N G+ TE DYPY++ G CD ++ A +I YED+ + DEHAL AV QPV
Sbjct: 202 FQFIISNGGIDTEEDYPYKERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPV 261
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
SV +E G++F+ YK G+ + CG + DHGV VG+GT E G YW+++NSWG++WGE
Sbjct: 262 SVAIEGGGRSFQLYKSGIFDGRCGIDLDHGVVAVGYGT---ESGKDYWIVRNSWGKSWGE 318
Query: 322 SGYIRILRD-----EGLCGIATEASYPV 344
+GYIR+ R+ G CGIA E SYP+
Sbjct: 319 AGYIRMERNLPSSSSGKCGIAIEPSYPI 346
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 155/328 (47%), Positives = 213/328 (64%), Gaps = 17/328 (5%)
Query: 26 QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDE----LEKAMRLTIFKQNLEYIEKANKEGN 81
+ + S + + +E WM +HG+ ++ EK R IFK NL +I++ N + N
Sbjct: 34 HITTETSRSDSEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTK-N 92
Query: 82 RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
+YKLG F+DLTNEE+R+ Y G +P V + S R ++ + +P S+DWR++G
Sbjct: 93 LSYKLGLTRFADLTNEEYRSMYLG-AKPTKRVLKTSDR---YQARVGDALPDSVDWRKEG 148
Query: 142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDK 200
AV +K+QG CGSCWAFS + AVEGI +I G LI LSEQ+LVDC T N GC+GGLMD
Sbjct: 149 AVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDY 208
Query: 201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQP 260
AFE+II+N G+ TEADYPY+ G CD+ ++ A TI YED+P+ E +L +A+ QP
Sbjct: 209 AFEFIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQP 268
Query: 261 VSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWG 320
+SV +EA G+AF+ Y GV + CG DHGV VG+GT E+G YW+++NSWG WG
Sbjct: 269 ISVAIEAGGRAFQLYSSGVFDGLCGTELDHGVVAVGYGT---ENGKDYWIVRNSWGNRWG 325
Query: 321 ESGYIRILRD----EGLCGIATEASYPV 344
ESGYI++ R+ G CGIA EASYP+
Sbjct: 326 ESGYIKMARNIEAPTGKCGIAMEASYPI 353
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 312 bits (799), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 150/317 (47%), Positives = 209/317 (65%), Gaps = 13/317 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLTIFKQNLEYIEKAN-KEGNRTYKLGTNEFS 92
E + +E W+ +HGR + L E R +F NL +++ N + G ++LG N+F+
Sbjct: 49 EAEVRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDAHNERAGEHGFRLGMNQFA 108
Query: 93 DLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
DLTN+EFRA+Y G +P+ ++ +++ ++P S+DWREKGAV +KNQG C
Sbjct: 109 DLTNDEFRAAYLGAR--IPAARSGNAVGEMYRHDGAEELPESVDWREKGAVAPVKNQGQC 166
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKG 210
GSCWAFSAV++VE I QI G+++ LSEQ+LV+CSTD N+GC+GGLMD AF +II+N G
Sbjct: 167 GSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGG 226
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
+ TE DYPY+ G CD + A +I +ED+P+ DE +L +AV QPVSV +EA G+
Sbjct: 227 IDTEDDYPYKAVDGKCDINRRNAKVVSIDAFEDVPENDEKSLQKAVAHQPVSVAIEAGGR 286
Query: 271 AFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
F+ YK GV + C N DHGV VG+GT E+G YW+++NSWG WGE+GYIR+ R+
Sbjct: 287 QFQLYKSGVFSGSCTTNLDHGVVAVGYGT---ENGKDYWIVRNSWGPKWGEAGYIRMERN 343
Query: 331 ----EGLCGIATEASYP 343
G CGIA ASYP
Sbjct: 344 INATTGKCGIAMMASYP 360
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 160/313 (51%), Positives = 197/313 (62%), Gaps = 16/313 (5%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E+W +H +D +KA R +FK N+ I + N+ + YKL N F D+T +EFR
Sbjct: 156 YERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDMTADEFRR 213
Query: 102 SYTGYNRPVPSVSR-----QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCW 156
Y G + R S+ S+F Y + DVP S+DWR+KGAVT +K+QG CGSCW
Sbjct: 214 HYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCW 273
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEA 215
AFS +AAVEGI I L LSEQQLVDC T N GC+GGLMD AF+YI ++ G+A E
Sbjct: 274 AFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAED 333
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
YPY+ Q +C +K A TI YED+P DE AL +AV QPVSV +EASG F+FY
Sbjct: 334 AYPYRARQASC--KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFY 391
Query: 276 KRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----E 331
GV + CG DHGVA VG+G DG KYWL+KNSWG WGE GYIR+ RD E
Sbjct: 392 SEGVFSGRCGTELDHGVAAVGYGVT--ADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKE 449
Query: 332 GLCGIATEASYPV 344
G CGIA EASYPV
Sbjct: 450 GHCGIAMEASYPV 462
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 162/346 (46%), Positives = 221/346 (63%), Gaps = 26/346 (7%)
Query: 16 IIILVITCASQVVSGRS----------MH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLT 64
+++LVI Q +GR+ +H + +I++ QW+ H R Y+ EK R
Sbjct: 12 LVLLVIAIGQQADAGRANAIVDYEGNQLHSDDAILDVFHQWLETHSRVYRSLSEKHHRFQ 71
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFK 124
IFK+N YI NK+ ++Y LG N+FSDLT++EFRA Y G +PV +RQ + + F
Sbjct: 72 IFKENFLYIHAHNKQ-QKSYWLGLNKFSDLTHQEFRAQYLG-TKPV---NRQR-KEANFM 125
Query: 125 YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
Y++V P +DWR KGAVT +K+QG CGSCWAFSAV +VEG+ I G+L+ LSEQ+LV
Sbjct: 126 YEDVEAEP-KVDWRLKGAVTDVKDQGACGSCWAFSAVGSVEGVNAIKTGELVSLSEQELV 184
Query: 185 DCS-TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
DC N GC+GGLMD AFE+II+N G+ TE DYPY+ G CD+ + + I Y+D
Sbjct: 185 DCDRKQNQGCNGGLMDYAFEFIIKNGGIDTEKDYPYKARDGRCDEGRRNSKVVVIDDYQD 244
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+P E AL++A+TK PVSV +EA G+ F+ Y+ GV CG DHGV VG+GT ++
Sbjct: 245 VPTQSESALMKALTKNPVSVAIEAGGRDFQHYQGGVFTGPCGSELDHGVLAVGYGT--DD 302
Query: 304 DGAKYWLIKNSWGETWGESGYIRILR-----DEGLCGIATEASYPV 344
DG YW++KNSWG WGE GYIR+ R +G CGI EAS+P+
Sbjct: 303 DGVNYWIVKNSWGPGWGEKGYIRMERFGSDSTDGKCGINIEASFPI 348
>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
Length = 328
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 157/341 (46%), Positives = 220/341 (64%), Gaps = 32/341 (9%)
Query: 16 IIILVITCAS---QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
++ ++ CAS V++ R + + ++VE+HE WM ++GR YKD EKA R +FK N+ +
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKDNVAF 66
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST-FKYQN--VT 129
+E N N + LG N+F+DLT EEF+A+ G+ V P+T FKY+N V+
Sbjct: 67 VESFNTNKNNKFWLGVNQFADLTTEEFKAN-KGFKPTAEKV------PTTGFKYENLSVS 119
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+PT++DWR KGAVT IKNQG C AA+EGI +++ G LI LSEQ+LVDC T
Sbjct: 120 ALPTAVDWRTKGAVTPIKNQGQC---------AAMEGIVKLSTGNLISLSEQELVDCDTH 170
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
+ GC GG MD AFE++I+N GLATE++YPY+ G C + +AATI +ED+P
Sbjct: 171 SMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGGSK--SAATIKGHEDVPVN 228
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
+E AL++AV QPVSV V+AS + F Y GV+ CG DHG+A +G+G E DG K
Sbjct: 229 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGM--ESDGTK 286
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
YW++KNSWG TWGE G++R+ +D G+CG+A + SYP
Sbjct: 287 YWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPT 327
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 161/359 (44%), Positives = 227/359 (63%), Gaps = 29/359 (8%)
Query: 3 LKFEKSFIIPMFVIIILVITCAS----------QVVSGRSMHEPSIVEKHEQWMAQHGRT 52
+K S + +F+ +I+V + VS RS E S + +E+W+ +HG+
Sbjct: 1 MKLLNSATVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRL--YEEWLVKHGKA 58
Query: 53 YKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPS 112
EK R IFK NL +I++ N + N +Y+LG +F+DLTN+E+R+ Y G S
Sbjct: 59 QNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYRLGLTKFADLTNDEYRSMYLG------S 111
Query: 113 VSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQI 170
++ + S+ +Y+ V D +P S+DWR++GAV +K+QG CGSCWAFS + AVEGI +I
Sbjct: 112 RLKRKATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKI 171
Query: 171 TGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQ 229
G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ TE DYPY+ G CD+
Sbjct: 172 VTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQT 231
Query: 230 KEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCD 289
++ A TI YED+P E +L +A++ QP+SV +E G+AF+ Y G+ + CG + D
Sbjct: 232 RKNAKVVTIDLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLD 291
Query: 290 HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
HGV VG+GT E+G YW++KNSWG +WGESGYIR+ R+ G CGIA E SYP+
Sbjct: 292 HGVVAVGYGT---ENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPI 347
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 161/331 (48%), Positives = 208/331 (62%), Gaps = 9/331 (2%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
IIL+ CA +S R++ E S+VE H+QWM ++ RTY + E R IFK+NLEYIE
Sbjct: 9 IILLWACAYPTMS-RTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEYIENF 67
Query: 77 NKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
N GN++YKLG N +SDLT+EEF AS+TG+ + +S R + DVPT+ D
Sbjct: 68 NNVGNKSYKLGLNRYSDLTSEEFIASHTGF-KVSDQLSDSKMRSVAIPFNLNDDVPTNFD 126
Query: 137 WREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGG 196
WREKG VT +KNQ CG CWAF+AVAAVEGI +I G LI LSEQQLVDC ++GC GG
Sbjct: 127 WREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSEQQLVDCDRQSSGCGGG 186
Query: 197 LMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV 256
AF+ II+++G+ E DYPY+ + + AA I Y +P DE LL+AV
Sbjct: 187 DFVLAFDSIIKSRGIVKEDDYPYKANDVQTCQLGQIPGAAQINGYFKVPANDEQQLLRAV 246
Query: 257 TKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWG 316
+QPVSV + S F Y GV CG +H V ++G+G +E G KYWLIKNSWG
Sbjct: 247 LQQPVSVAISTS-YDFHHYMGGVYEGSCGPKLNHAVTIIGYGVSEA--GKKYWLIKNSWG 303
Query: 317 ETWGESGYIRILRDE----GLCGIATEASYP 343
ETWGE GY+++LR+ G C IA A+YP
Sbjct: 304 ETWGEKGYMKVLRESSATGGQCSIAVHAAYP 334
>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
Length = 336
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 156/340 (45%), Positives = 223/340 (65%), Gaps = 18/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F I+ + C++ + + + ++ +HE+WMAQ+GR YKD+ EKA R +FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN+EFR++ T +PS +R P+ F+ +NV
Sbjct: 68 IESFN-AGNHKFWLGVNQFADLTNDEFRSTKTNKGF-IPSTTRV---PTGFRNENVNIDA 122
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN 190
+P ++DWR KG VT IK+QG CG CWAFSAVAA+EGI +++ GKLI S + + + +
Sbjct: 123 LPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISHSLNKSL-LTVMS 181
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKA-AAATIGKYEDLPKGDE 249
GC GGLMD AF++II+N GL TE++YPY DK K + + A+I YED+P +E
Sbjct: 182 MGCEGGLMDDAFKFIIKNGGLTTESNYPYAAVD---DKFKSVSNSVASIKGYEDVPANNE 238
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
AL++AV QPVSV V+ F+FYK GV+ CG + DHG+ +G+G A DG KYW
Sbjct: 239 AALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKA--SDGTKYW 296
Query: 310 LIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
L+KNSWG TWGE+G++R+ +D G+CG+A E SYP A
Sbjct: 297 LLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 336
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 154/344 (44%), Positives = 218/344 (63%), Gaps = 20/344 (5%)
Query: 13 MFVIIILVITCASQV----VSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMRLT 64
+F++ + V+ C++ + G + + + + K E W+A+H + Y+ EK R
Sbjct: 12 LFLVFVSVLACSALANEFSILGYAPEDLTSIHKVIHLFESWLAKHSKIYESLDEKLHRFE 71
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFK 124
IF NL++I+ NK+ + Y LG NEF+DLT+EEF+ + G +P R+ F
Sbjct: 72 IFMDNLKHIDDTNKKVS-NYWLGLNEFADLTHEEFKNKFLGLKGELPE--RKDESIEEFS 128
Query: 125 YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
Y++ D+P S+DWR+KGAV +KNQG CGSCWAFS VAAVEGI QI G L LSEQ+L+
Sbjct: 129 YRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQELI 188
Query: 185 DCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
DC T NNGC+GGLMD AF Y++ + GL E +YPY +GTCD++K+ + TI Y D
Sbjct: 189 DCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSETVTISGYHD 247
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+P+ +E + L+A+ QP+SV +EASG+ F+FY GV + CG DHGVA VG+GT +
Sbjct: 248 VPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGTTK-- 305
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
G Y +++NSWG WGE GYIR+ R G+CG+ ASYP
Sbjct: 306 -GLDYVIVRNSWGPKWGEKGYIRMKRKTGKPHGMCGLYMMASYP 348
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 156/322 (48%), Positives = 211/322 (65%), Gaps = 16/322 (4%)
Query: 30 GRSMHEPSIVEKHEQWMAQHGRTYKDE--LEKAMRLTIFKQNLEYIEKANKEGNRTYKLG 87
GRS E ++ +E W+ +HG+ +EK R IFK NL +I+ NK+ N +Y+LG
Sbjct: 33 GRSDAE--VMSIYEAWLVKHGKAQNQNSLVEKDRRFEIFKDNLRFIDDHNKK-NLSYRLG 89
Query: 88 TNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
F+DLTN+E+R+ Y G R S R ++ + ++P SIDWR+KGAV +K
Sbjct: 90 LTRFADLTNDEYRSKYLGAKMEKKGERRTSQR---YEARVGDELPESIDWRKKGAVAEVK 146
Query: 148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYII 206
+QG CGSCWAFS + AVEGI QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II
Sbjct: 147 DQGSCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFII 206
Query: 207 ENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVE 266
+N G+ T+ DYPY+ GTCD+ ++ A TI YED+P E +L +AV QPVSV +E
Sbjct: 207 KNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPVSVAIE 266
Query: 267 ASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
A G+AF+ Y G+ + CG DHGV VG+GT E+G YW+++NSWG++WGESGY++
Sbjct: 267 AGGRAFQLYDSGIFDGTCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGKSWGESGYLK 323
Query: 327 ILRD----EGLCGIATEASYPV 344
+ R+ G CGIA E SYP+
Sbjct: 324 MARNIASSSGKCGIAIEPSYPI 345
>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
Length = 357
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 157/346 (45%), Positives = 219/346 (63%), Gaps = 10/346 (2%)
Query: 5 FEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLT 64
EK ++ + ++++ + + E S+ + +E+W + H +D EK R
Sbjct: 1 MEKVILVALSLVLVFGLAESFDFDEKDLASEESLWDLYERWRSYH-TVSRDLEEKNKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST-F 123
+FK+N +++ K N+ ++ YKL N+F+D+TN EFR+SY G + R R + F
Sbjct: 60 VFKENTKHVHKVNQM-DKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTGGF 118
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
++ T +P S+DWR+KGAVT IK+QG CGSCWAFS V VEGI QI +L+ LSEQQL
Sbjct: 119 MHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQQL 178
Query: 184 VDCS-TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
+DC +D++GC+GGLM+ AFE+I +N G+ TE +YPY+ + CD K A TI +E
Sbjct: 179 IDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTIDGHE 238
Query: 243 DLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEE 302
+P DE AL++AV QPVSV ++A G +FY GV + ECG DHGVA+VG+GT
Sbjct: 239 SVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGTT-- 296
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
DG KYW++KNSWG WGE GYIR+ R EG CGIA EASYPV
Sbjct: 297 LDGTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPV 342
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 160/319 (50%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ +E+W +H +D +KA R +FK N+ I + N+ + YKL N F D+
Sbjct: 42 EEALWALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDM 99
Query: 95 TNEEFRASYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQG 150
T +EFR Y G ++R + SS ++F Y + DVP S+DWR+KGAVT +K+QG
Sbjct: 100 TADEFRRHYAGSRVAHHRMFRGDRQGSSASASFMYADARDVPASVDWRQKGAVTDVKDQG 159
Query: 151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENK 209
CGSCWAFS +AAVEGI I L LSEQQLVDC T N GC+GGLMD AF+YI ++
Sbjct: 160 QCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHG 219
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASG 269
G+A E YPY+ Q +C +K A TI YED+P DE AL +AV QPVSV +EASG
Sbjct: 220 GVAAEDAYPYRARQASC--KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASG 277
Query: 270 QAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
F+FY GV + CG DHGV VG+G DG KYWL+KNSWG WGE GYIR+ R
Sbjct: 278 SHFQFYSEGVFSGRCGTELDHGVTAVGYGVT--ADGTKYWLVKNSWGPEWGEKGYIRMAR 335
Query: 330 D----EGLCGIATEASYPV 344
D EG CGIA EASYPV
Sbjct: 336 DVAAKEGHCGIAMEASYPV 354
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 155/311 (49%), Positives = 207/311 (66%), Gaps = 12/311 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y+ EK +R IFK NL++I++ NK + Y LG NEF+DL+++
Sbjct: 43 LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G SR+ P F Y++V ++P S+DWR+KGAV +KNQG CGSCWA
Sbjct: 102 EFKNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWA 157
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T +NGC+GGLMD AF +I+EN GL E D
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYSNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+GTC+ KE+ TI Y D+P+ +E +LL+A+ Q +SV +EASG+ F+FY
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYS 277
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI---LRDEGL 333
GV + CG + DHGVA VG+GTA+ G Y ++KNSWG WGE GYIR+ L G
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYIIVKNSWGSKWGEKGYIRMRGTLETRGN 334
Query: 334 CGIATEASYPV 344
ASYP+
Sbjct: 335 LRYLQMASYPL 345
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 159/345 (46%), Positives = 212/345 (61%), Gaps = 14/345 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
S I + L+ + S RS E ++ +E+W+ +H + Y EK R IFK
Sbjct: 3 SITITSLLFFSLITLSLAMDTSMRSNEE--VMTMYEEWLVKHHKVYNGLGEKDQRFEIFK 60
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ- 126
NL +I++ N + N TYK+G N+F+D TNEE+R Y G + + +Y
Sbjct: 61 DNLGFIDEHNAQ-NYTYKVGLNKFADTTNEEYRNMYLGTKNDAKRNVMKIKITTGHRYAF 119
Query: 127 NVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVD 185
N D +P +DWR KGAV HIK+QG CGSCWAFS +A VE I +I GKL+ LSEQ+LVD
Sbjct: 120 NSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVD 179
Query: 186 CSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
C N GC+GGLMD AFE+I+EN G+ TE DYPY+ +G CD ++ A +I YED+
Sbjct: 180 CDRAFNEGCNGGLMDYAFEFIVENGGIDTEQDYPYKGFEGRCDPTRKNAKVVSIDGYEDV 239
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEED 304
P +E+AL +AV QPVSV +EA G+A + Y+ GV CG N DHGV VVG+G E+
Sbjct: 240 PAYNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVVGYGF---EN 296
Query: 305 GAKYWLIKNSWGETWGESGYIRILR-----DEGLCGIATEASYPV 344
G YWL++NSWG WGE GY ++ R + G CGIA +ASYPV
Sbjct: 297 GVDYWLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPV 341
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 155/313 (49%), Positives = 202/313 (64%), Gaps = 17/313 (5%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+EQW+ +HG+ Y EK R IFK NL +I+ N + NRTYKLG N F+DLTNEE+RA
Sbjct: 4 YEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHNAD-NRTYKLGLNRFADLTNEEYRA 62
Query: 102 SYTGY----NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
Y G NR QS+R + + ++P S+DWR + AV +K+QG+CGSCWA
Sbjct: 63 RYLGTRIDPNRRFVKTKTQSNR---YAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWA 119
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS + AVEGI +I G LI LSEQ+LVDC T N GC+GGLMD A+E+II N G+ +E D
Sbjct: 120 FSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINNGGIDSEED 179
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY+ GTCD+ ++ A TI YED+P DE AL +AV QPVSV +E G+ F+ Y
Sbjct: 180 YPYRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLYV 239
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----E 331
GV CG DHGV VG+G+ + D YW+++NSWG +WGE GY+R+ R+
Sbjct: 240 SGVFTGRCGTALDHGVVAVGYGSVKGHD---YWIVRNSWGASWGEEGYVRLERNLAKSRS 296
Query: 332 GLCGIATEASYPV 344
G CGIA E SYP+
Sbjct: 297 GKCGIAIEPSYPI 309
>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
Length = 381
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 152/320 (47%), Positives = 207/320 (64%), Gaps = 12/320 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR-TYKLGTNEFSD 93
+ ++ + +E+W H R ++ EK R FK+N+ +I NK G+R +Y+L N F D
Sbjct: 39 DEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFGD 97
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPST----FKYQNVTDVPTSIDWREKGAVTHIKNQ 149
+ EEFR+++ R+SS +T F Y + TDVP S+DWR+ GAVT +KNQ
Sbjct: 98 MGPEEFRSTFADSRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQHGAVTAVKNQ 157
Query: 150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENK 209
G CGSCWAFS V AVEGI I G L+ LSEQ+LVDC T NGC GGLM+ AF++I
Sbjct: 158 GRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDTAENGCQGGLMENAFDFIKSYG 217
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAA--ATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA 267
G+ TE+ YPY+ GTCD + + +I ++ +P G E AL +AV +QPVSV ++A
Sbjct: 218 GITTESAYPYRASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVARQPVSVAIDA 277
Query: 268 SGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
GQAF+FY GV +CG + DHGVAVVG+G + + DG YW++KNSWG +WGE GYIR+
Sbjct: 278 GGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVS-DVDGTPYWIVKNSWGPSWGEGGYIRM 336
Query: 328 LR---DEGLCGIATEASYPV 344
R + GLCGIA EAS+P+
Sbjct: 337 QRGAGNGGLCGIAMEASFPI 356
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 155/325 (47%), Positives = 205/325 (63%), Gaps = 19/325 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT--------YKL 86
E ++ E + +W + H + EK R FK N+ +I N N T Y+L
Sbjct: 35 EEALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLNDTSTNNNGPSYRL 94
Query: 87 GTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
N F D+ EFR+++ G P+ +R + F Y V D+P ++DWR+KGAVT +
Sbjct: 95 RLNRFGDMDQAEFRSTFAG---PLHRHTRPAQSIPGFIYDTVKDIPQAVDWRQKGAVTGV 151
Query: 147 KNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEY 204
K+QG CGSCWAFSAVA+VEG+ I G L+ LSEQ+L+DC T D+NGC GGLM+ AFE+
Sbjct: 152 KDQGKCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMESAFEF 211
Query: 205 IIENKG-LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSV 263
I + G LATEA YPY GTC+ + + + I ++ +P G+E AL +AV QPVSV
Sbjct: 212 IAHSAGGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAKAVAHQPVSV 271
Query: 264 CVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
++A GQAF+FY GV +CG DHGVAVVG+G A EEDG +YW++KNSWG WGE G
Sbjct: 272 AIDAGGQAFQFYSEGVFTGDCGSELDHGVAVVGYGVA-EEDGKEYWIVKNSWGPGWGEHG 330
Query: 324 YIRILRDE----GLCGIATEASYPV 344
Y+R+ RD GLCGIA EASYPV
Sbjct: 331 YVRMQRDSGVDGGLCGIAMEASYPV 355
>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 310 bits (795), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 158/315 (50%), Positives = 205/315 (65%), Gaps = 18/315 (5%)
Query: 40 EKHEQWMAQHGRTYKDEL-EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
E +E+W + H T L EK R +FK N+ Y+ NK+ ++ YKL N+F+D+TN E
Sbjct: 36 ELYERWRSHH--TVSRSLDEKDKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHE 92
Query: 99 FRASYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGS 154
FR Y G ++R SR + TF Y + VP ++DWR+KGAVT +K+QG CGS
Sbjct: 93 FRHHYAGSKIKHHRTFLGASRANG---TFMYAHEDSVPPTVDWRKKGAVTPVKDQGKCGS 149
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLAT 213
CWAFS V AVEGI QI +L+ LSEQ+LVDC T N GC+GGLMD AFE+I + G+ T
Sbjct: 150 CWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINT 209
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
E +YPY E G CD QK + +I +ED+P DE +LL+AV QPVSV ++ASG F+
Sbjct: 210 EENYPYMAEGGECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSDFQ 269
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR---- 329
FY GV +CG DHGVA+VG+GT D KYW++KNSWG WGE GYIR+ R
Sbjct: 270 FYSEGVFTGDCGTELDHGVAIVGYGTT--LDRTKYWIVKNSWGPEWGEKGYIRMQREIDA 327
Query: 330 DEGLCGIATEASYPV 344
+EGLCGIA + SYP+
Sbjct: 328 EEGLCGIAMQPSYPI 342
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 156/326 (47%), Positives = 215/326 (65%), Gaps = 19/326 (5%)
Query: 26 QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYK 85
VS RS E S + +E+W+ +HG+ EK R IFK NL +I++ N + N +Y+
Sbjct: 28 HTVSSRSDAEVSRL--YEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYR 84
Query: 86 LGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAV 143
LG +F+DLTN+E+R+ Y G S ++ + S+ +Y+ V D +P S+DWR++GAV
Sbjct: 85 LGLTKFADLTNDEYRSMYLG------SRLKRKATKSSLRYEVRVGDAIPESVDWRKEGAV 138
Query: 144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAF 202
+K+QG CGSCWAFS + AVEGI +I G LI LSEQ+LVDC T N GC+GGLMD AF
Sbjct: 139 AEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAF 198
Query: 203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVS 262
E+II N G+ TE DYPY+ G CD+ ++ A TI YED+P E +L +A++ QP+S
Sbjct: 199 EFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPIS 258
Query: 263 VCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGES 322
V +E G+AF+ Y G+ + CG + DHGV VG+GT E+G YW++KNSWG +WGES
Sbjct: 259 VAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT---ENGKDYWIVKNSWGTSWGES 315
Query: 323 GYIRILRD----EGLCGIATEASYPV 344
GYIR+ R+ G CGIA E SYP+
Sbjct: 316 GYIRMERNIASSAGKCGIAVEPSYPI 341
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 155/325 (47%), Positives = 214/325 (65%), Gaps = 17/325 (5%)
Query: 26 QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYK 85
VS RS E S + +E+W+ +HG+ EK R IFK NL +I++ N + N +Y+
Sbjct: 28 HTVSSRSDVEVSRL--YEEWVVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYR 84
Query: 86 LGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAVT 144
LG +F+DLTN+E+R+ Y G + R++++ S V D +P S+DWR++GAV
Sbjct: 85 LGLTKFADLTNDEYRSMYLG-----SRLKRKATKTSLRYEARVGDAIPESVDWRKEGAVA 139
Query: 145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFE 203
+K+QG CGSCWAFS + AVEGI +I G LI LSEQ+LVDC T N GC+GGLMD AFE
Sbjct: 140 EVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFE 199
Query: 204 YIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSV 263
+II+N G+ TE DYPY+ G CD+ ++ A TI YED+P E +L +A++ QP+SV
Sbjct: 200 FIIKNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDSYEDVPANSEESLKKALSHQPISV 259
Query: 264 CVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
+E G+AF+ Y G+ + CG + DHGV VG+GT E+G YW++KNSWG +WGESG
Sbjct: 260 AIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT---ENGKDYWIVKNSWGTSWGESG 316
Query: 324 YIRILRD----EGLCGIATEASYPV 344
YIR+ R+ G CGIA E SYP+
Sbjct: 317 YIRMERNIASSAGKCGIAVEPSYPI 341
>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
Length = 369
Score = 310 bits (793), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 152/350 (43%), Positives = 215/350 (61%), Gaps = 10/350 (2%)
Query: 2 VLKFEKSFIIPMFVIIILVITCASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ + K+ ++ V + V C + R + + ++ + +E+W H EK
Sbjct: 1 MAQLAKTLLLVALVAMSAVELCRAIEFDERDLASDEALWDLYERWQTHHHVHRHHG-EKG 59
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R FK+N+ +I NK G+R Y+L N F D+ EEFR+++ + + P
Sbjct: 60 RRFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDMGREEFRSTFADSRINDLRRAESPAAP 119
Query: 121 ST--FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
+ F Y VTD+P S+DWR++GAVT +K+QGHCGSCWAFS V +VEGI I G L+ L
Sbjct: 120 AVPGFMYDGVTDLPPSVDWRKEGAVTAVKDQGHCGSCWAFSTVVSVEGINAIRTGSLVSL 179
Query: 179 SEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDK-QKEKAAAAT 237
SEQ+L+DC TD NGC GGLM+ AFE+I G+ TE+ YPY+ GTCD + + +
Sbjct: 180 SEQELIDCDTDENGCQGGLMENAFEFIKSYGGVTTESAYPYRASNGTCDSVRSRRGQIVS 239
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I ++ +P G E AL +AV QPVSV ++A GQAF+FY GV +CG + DHGVA VG+
Sbjct: 240 IDGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGY 299
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILR---DEGLCGIATEASYPV 344
G + +DG YW++KNSWG +WGE GYIR+ R + GLCGIA EAS+P+
Sbjct: 300 GVS--DDGTAYWIVKNSWGPSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 347
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 310 bits (793), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 153/317 (48%), Positives = 208/317 (65%), Gaps = 12/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S+ + +E+W + H T L EK R +FK NL ++ NK ++ YKL N+F+D
Sbjct: 32 EESLWDLYERWRSHH--TVSRSLGEKHKRFNVFKANLMHVHNTNKM-DKPYKLKLNKFAD 88
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
+TN EFR++Y G + R + + F Y+ V VP S+DWR+KGAVT +K+QG C
Sbjct: 89 MTNHEFRSTYAGSKVNHHRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQC 148
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGL 211
GSCWAFS V AVEGI QI KL+ LSEQ+LVDC + N GC+GGLM+ AFE+I + G+
Sbjct: 149 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 208
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
TE++YPY+ ++GTCD K A +I +E++P DE ALL+AV QPVSV ++A G
Sbjct: 209 TTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSD 268
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD- 330
F+FY GV +C + +HGVA+VG+GT DG YW+++NSWG WGE GYIR+ R+
Sbjct: 269 FQFYSEGVFTGDCSTDLNHGVAIVGYGTT--VDGTNYWIVRNSWGPEWGEHGYIRMQRNI 326
Query: 331 ---EGLCGIATEASYPV 344
EGLCGIA SYP+
Sbjct: 327 SKKEGLCGIAMLPSYPI 343
>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
Length = 272
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 149/271 (54%), Positives = 188/271 (69%), Gaps = 11/271 (4%)
Query: 81 NRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
N+ YKLG N+F+DLTNEEF+AS N+ + R +TFKY+N + +P+++DWR+K
Sbjct: 7 NKLYKLGINKFADLTNEEFKASR---NKFKGHMCSSIIRTTTFKYENASAIPSTVDWRKK 63
Query: 141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLM 198
GAVT +KNQG CGSCWAFSAVAA EGI Q++ GKL+ LSEQ+L+DC T + GC GGLM
Sbjct: 64 GAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGGLM 123
Query: 199 DKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK 258
D AF++II+N GL+TE YPY+ GTC+ + A TI YED+P +E AL +AV
Sbjct: 124 DDAFKFIIQNHGLSTEVQYPYEGVDGTCNTNEASIHAVTITGYEDVPANNELALQKAVAN 183
Query: 259 QPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
QP+SV ++ASG F+FY GV CG DHGV VG+G DG KYWL+KNSWG
Sbjct: 184 QPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVG--NDGTKYWLVKNSWGAD 241
Query: 319 WGESGYIRILRD----EGLCGIATEASYPVA 345
WGE GYIR+ R EGLCGIA +ASYP A
Sbjct: 242 WGEEGYIRMQRGIDAAEGLCGIAMQASYPTA 272
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 160/317 (50%), Positives = 198/317 (62%), Gaps = 13/317 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ +E+W +H +D +KA R +FK+N+ I N+ + YKL N F D+
Sbjct: 40 EEALWALYERWRGRHA-VARDLGDKARRFNVFKENVRLIHDFNQR-DEPYKLRLNRFGDM 97
Query: 95 TNEEFRASYTGYNRPVPSVSR--QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
T +EFR Y G + R + S+F Y D+PTS+DWR+KGAVT +K+QG C
Sbjct: 98 TADEFRRHYAGSRVAHHRMFRGDRQGSASSFMYAGARDLPTSVDWRQKGAVTDVKDQGQC 157
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGL 211
GSCWAFS +AAVEGI I L LSEQQLVDC T N GC GGLMD AF+YI ++ G+
Sbjct: 158 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDYAFQYIAKHGGV 217
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
A E YPY+ Q +C +K A A TI YED+P DE AL +AV QPVSV +EASG
Sbjct: 218 AAEDAYPYKARQASC--KKSPAPAVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSH 275
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD- 330
F+FY GV CG DHGV VG+G A DG KYW++KNSWG WGE GYIR+ RD
Sbjct: 276 FQFYSEGVFAGRCGTELDHGVTAVGYGVA--ADGTKYWVVKNSWGPEWGEKGYIRMARDV 333
Query: 331 ---EGLCGIATEASYPV 344
EG CGIA EASYPV
Sbjct: 334 AAKEGHCGIAMEASYPV 350
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 152/305 (49%), Positives = 203/305 (66%), Gaps = 12/305 (3%)
Query: 46 MAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG 105
+ +H + Y K R IFK NL +I++ NK N+++KLG N+F+DL+NEE+++ + G
Sbjct: 11 LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70
Query: 106 YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVE 165
R V R+ FKY ++P S+DWREKGAV +K+QG CGSCWAFS VAAVE
Sbjct: 71 -GRMV--RDRKGFESDRFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVE 127
Query: 166 GITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQG 224
GI QI G LI LSEQ+LVDC N GC+GG MD AFE+I++N G+ TE DYPY+ G
Sbjct: 128 GINQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVDG 187
Query: 225 TCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAEC 284
CD+ ++ A TI +ED+P+ DE +L +AV QPVSV +EA G+AF+ Y+ G+ N C
Sbjct: 188 QCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGLC 247
Query: 285 GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR-----DEGLCGIATE 339
G + DHGV VG+GT EDG YW+++NSWG WGE+GYIR+ R + G CGIA +
Sbjct: 248 GTDLDHGVVAVGYGT---EDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQ 304
Query: 340 ASYPV 344
SYP
Sbjct: 305 PSYPT 309
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 156/353 (44%), Positives = 223/353 (63%), Gaps = 17/353 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQV-VSGRSMHEPSIVEK----HEQWMAQHGRTYKD 55
+ +K+ ++ +FV I+ A + + G + + + + K E W+ +H + Y+
Sbjct: 3 FIFSSKKTSLLFLFVSILACSALAHEFSILGYAPEDLTSIHKVIHLFESWLVKHSKFYES 62
Query: 56 ELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSR 115
EK R IF NL++I++ NK+ + Y LG NEF+DLT+EEF+ + G+ +
Sbjct: 63 LDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEFADLTHEEFKHKFLGFKGELAERKD 121
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKL 175
+SS+ F Y++ D+P S+DWR+KGAV +KNQG CGSCWAFS VAAVEGI QI G L
Sbjct: 122 ESSKE--FGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNL 179
Query: 176 IELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAA 234
LSEQ+L+DC T NNGC+GGLMD AF Y++ + GL E +YPY +GTCD++K+ +
Sbjct: 180 TMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSE 238
Query: 235 AATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAV 294
TI Y D+P+ DE + L+A+ QP+SV +EASG+ F+FY GV + CG DHGVA
Sbjct: 239 KVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAA 298
Query: 295 VGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYP 343
VG+GT + G Y +++NSWG WGE GYIR+ R G+CG+ ASYP
Sbjct: 299 VGYGTTK---GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYMMASYP 348
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 152/311 (48%), Positives = 203/311 (65%), Gaps = 14/311 (4%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK----ANKEGNRTYKLGTNEFSDLTNEE 98
+ W+ +H + Y EK R IF+ NLE+I++ N G ++LG N+F+DLTN+E
Sbjct: 6 QSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTNDE 65
Query: 99 FRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAF 158
FR Y G RP + S +S R + + ++P S+DWR+KGAV+H+K+QG CGSCWAF
Sbjct: 66 FRRIYFGVKRPEKAESVKSDR---YAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAF 122
Query: 159 SAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADY 217
SA+ AVEGI +I G LI LSEQ+LVDC T N+GC GGLMD AF +II N G+ T+ DY
Sbjct: 123 SAIGAVEGINKIVTGDLITLSEQELVDCDTSYNSGCDGGLMDYAFRFIINNGGIDTDKDY 182
Query: 218 PYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKR 277
PY+ G+CD ++ A TI ED+P +E AL +AV QPV + +EA G+ F+ YK
Sbjct: 183 PYKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLYKS 242
Query: 278 GVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGL 333
GV CG + DHGV VG+GT +DG YW+++NSWG+ WGE GYIR+ R+ G
Sbjct: 243 GVFTGSCGTSLDHGVVAVGYGTT--DDGKDYWIVRNSWGDDWGEDGYIRMERNTESKSGK 300
Query: 334 CGIATEASYPV 344
CGIA E SYPV
Sbjct: 301 CGIAIEPSYPV 311
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 167/334 (50%), Positives = 212/334 (63%), Gaps = 22/334 (6%)
Query: 27 VVSGRSMHEPSIVEKHE-------QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE 79
V +G + P+ V K + W +HG+ Y E+A R ++K NLEYI++ + E
Sbjct: 23 VANGDVIRMPTDVGKDQLLAGQFAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQR-HSE 81
Query: 80 GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST--FKYQNVTDVPTSIDW 137
N +Y LG +F+DLTNEEFR YTG R S + R +T F+Y N ++ P SIDW
Sbjct: 82 KNLSYWLGLTKFADLTNEEFRRQYTG-TRIDRSRRLKKGRNATGSFRYAN-SEAPKSIDW 139
Query: 138 REKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGG 196
REKGAVT +K+QG CGSCWAFSAV +VEGI I G I LS Q+LVDC N GC+GG
Sbjct: 140 REKGAVTSVKDQGSCGSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGG 199
Query: 197 LMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV 256
LMD AF+++I+N G+ TE DYPYQ G CD K A TI YED+P+ DE AL +AV
Sbjct: 200 LMDYAFDFVIQNGGIDTEKDYPYQGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAV 259
Query: 257 TKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWG 316
QPVSV +EA G+ F+ Y GV CG + DHGV VG+G+ E G YW++KNSWG
Sbjct: 260 AGQPVSVAIEAGGRDFQLYSGGVFTGRCGTDLDHGVLAVGYGS---EKGLDYWIVKNSWG 316
Query: 317 ETWGESGYIRI---LRDE---GLCGIATEASYPV 344
E WGESGY+R+ L+D+ GLCGI E SY V
Sbjct: 317 EYWGESGYLRMQRNLKDDNGYGLCGINIEPSYAV 350
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 156/348 (44%), Positives = 222/348 (63%), Gaps = 18/348 (5%)
Query: 11 IPMFVIIILVITCASQ-----VVSGRSMHEPSI----VEKHEQWMAQHGRTYKDELEKAM 61
+P+ V+ + C++ V G S + ++ V + W +H + Y EK
Sbjct: 5 LPVLVLFLAFAACSASHHRDPSVVGYSQEDLALPNRLVNLFKSWSVKHRKIYVSPKEKLK 64
Query: 62 RLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS 121
R IFKQNL +I + N++ N +Y LG N+F+D+T+EEF+A++ G + + + Q+ P+
Sbjct: 65 RYGIFKQNLMHIAETNRK-NGSYWLGLNQFADITHEEFKANHLGLKQGLSRMGAQTRTPT 123
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQ 181
TF+Y ++P S+DWR KGAVT +KNQG CGSCWAFS+VAAVEGI QI GKL+ LSEQ
Sbjct: 124 TFRYAAAANLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLSEQ 183
Query: 182 QLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
+L+DC T ++GC GGLMD AF YI+ ++G+ E DYPY E+G C +++ A TI
Sbjct: 184 ELMDCDTMLDHGCEGGLMDFAFAYIMGSQGIHAEDDYPYLMEEGYCKEKQPYANVVTITG 243
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
YED+P+ E +LL+A+ QPVSV + A + F+FYK GV + C D DH + VG+G++
Sbjct: 244 YEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYKGGVFDGSCSDELDHALTAVGYGSS 303
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRIL----RDEGLCGIATEASYPV 344
G Y +KNSWG+ WGE GY+RI + EG+CGI T ASYPV
Sbjct: 304 Y---GQNYITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPV 348
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 152/343 (44%), Positives = 214/343 (62%), Gaps = 12/343 (3%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
+ + +F ++++ ++ S + + +E +EQW+ ++ + Y EK R IF
Sbjct: 9 TLALLIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEKETRFEIFT 68
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
NL+YIE+ N N+T+++G F+DLTN+EFRA Y R +R + + Y+
Sbjct: 69 DNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYL---RSKMERTRVPVKGERYLYKV 125
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+P IDWR KGAV +K+QG+CGSCWAFSA+ AVEGI QI G+LI LSEQ+LVDC
Sbjct: 126 GDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 185
Query: 188 TD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLP 245
T N GC GGLMD AF++IIEN G+ TE DYPY + C+ K+ + TI YED+P
Sbjct: 186 TSYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNSDKKNSRVVTIDGYEDVP 245
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
+ DE +L +A+ QP+SV +EA G+AF+ YK GV CG + DHGV VG+G+ E G
Sbjct: 246 QNDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSLDHGVVAVGYGS---EGG 302
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
YW+++NSWG WGESGY ++ R+ G CG+A ASYP
Sbjct: 303 QDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPT 345
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 157/356 (44%), Positives = 222/356 (62%), Gaps = 33/356 (9%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ--------------WMAQHGRTYKDE 56
+ M +++ + C++ S H+PS+V ++ W +H + Y
Sbjct: 14 LSMLFLLLGFVACSATA----SHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASP 69
Query: 57 LEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQ 116
EK R IFK+NL +I + N+ N +Y LG N F+D+ +EEF+ASY G P ++R+
Sbjct: 70 KEKVKRYEIFKRNLRHIVETNRR-NGSYWLGLNHFADIAHEEFKASYLGLK---PGLARR 125
Query: 117 SSRP---STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGG 173
++P +TF+Y N ++P ++DWR+KGAVT +KNQG CGSCWAFS VAAVEGI QI G
Sbjct: 126 DAQPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTG 185
Query: 174 KLIELSEQQLVDC-STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEK 232
KL+ LSEQ+L+DC +T N+GC GGLMD AF YI+ N+G+ TE DYPY E+G C +++
Sbjct: 186 KLVSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPH 245
Query: 233 AAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGV 292
+ TI YED+P E +LL+A+ QPVSV + A + F+FYK G+ + ECG DH +
Sbjct: 246 SKVITITGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHAL 305
Query: 293 AVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
VG+G+ +D Y ++KNSWG+ WGE GY RI R EG+C I ASYP
Sbjct: 306 TAVGYGSYYGQD---YIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYPT 358
>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
Length = 422
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 152/342 (44%), Positives = 218/342 (63%), Gaps = 14/342 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
+F+ +F+I +L + I + E W +HG+TY + +K R IF+
Sbjct: 2 NFLSALFLITLLFF----NLSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFE 57
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
+N E+++K N +GN +Y L N F+DLT+ EF+AS G + S S + SR + +
Sbjct: 58 ENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFKASRLGLS--AFSTSGKLSRRNFPLHDF 115
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
V DVP SIDWR+KGAV+ +K+QG+CG+CW+FSA A+EGI +I G L+ LSEQ+LVDC
Sbjct: 116 VGDVPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCD 175
Query: 188 TD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
NNGC GGLMD A++++IEN G+ TE DYPYQ + TC+K+K K TI Y D+P+
Sbjct: 176 RSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQ 235
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
+E LL+AV QPVSV + S +AF+ Y +G+ C + DH V +VG+G+ E+G
Sbjct: 236 NNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGS---ENGV 292
Query: 307 KYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
YW++KNSWG WG +GY+ +LR+ +GLCGI AS+PV
Sbjct: 293 DYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPV 334
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 157/319 (49%), Positives = 210/319 (65%), Gaps = 16/319 (5%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
+E + E+ W +HG+ Y E A R ++K NLEYI++ + E NR+Y LG +F+D
Sbjct: 38 NERLLSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQR-HSEKNRSYWLGLTKFAD 96
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
+TN+EFR YTG S++S R + F+Y + ++ P S+DWR+KGAVT +K+QG CG
Sbjct: 97 ITNDEFRRQYTGTR---IDRSKRSKRKTGFRYAD-SEAPESVDWRKKGAVTTVKDQGSCG 152
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLA 212
SCWAFSA+ +VEGI I G+ + LSEQ+LVDC + N GC+GGLMD AF++I+EN G+
Sbjct: 153 SCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILENGGID 212
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE DYPY+ G CD K+ A TI YED+P+ DE AL +AV QPVSV +EA G+ F
Sbjct: 213 TENDYPYKGLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDF 272
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+ Y GV ECG + DHGV VG+G+ E YW++KNSWGE WGESGY+R+ R+
Sbjct: 273 QLYSGGVFTGECGTDLDHGVLAVGYGS---EGSLDYWIVKNSWGEYWGESGYLRMQRNIK 329
Query: 331 -----EGLCGIATEASYPV 344
GLCGI E SY V
Sbjct: 330 DSNHQFGLCGINIEPSYAV 348
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 308 bits (789), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 157/356 (44%), Positives = 223/356 (62%), Gaps = 33/356 (9%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ--------------WMAQHGRTYKDE 56
+ M +++ + C++ S H+PS+V ++ W +H + Y
Sbjct: 5 LSMLFLLLGFVACSATA----SHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASP 60
Query: 57 LEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQ 116
EK R IFK+NL +I + N+ N +Y LG N F+D+ +EEF+ASY G P ++R+
Sbjct: 61 KEKVKRYEIFKRNLRHIVETNRR-NGSYWLGLNHFADIAHEEFKASYLGLK---PGLARR 116
Query: 117 SSRP---STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGG 173
++P +TF+Y N ++P ++DWR+KGAVT +KNQG CGSCWAFS VAAVEGI QI G
Sbjct: 117 DAQPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTG 176
Query: 174 KLIELSEQQLVDC-STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEK 232
KL+ LSEQ+L+DC +T N+GC GGLMD AF YI+ N+G+ TE DYPY E+G C +++
Sbjct: 177 KLVSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPH 236
Query: 233 AAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGV 292
+ TI YED+P+ E +LL+A+ QPVSV + A + F+FYK G+ + ECG DH +
Sbjct: 237 SKVITITGYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHAL 296
Query: 293 AVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
VG+G+ +D Y ++KNSWG+ WGE GY RI R EG+C I ASYP
Sbjct: 297 TAVGYGSYYGQD---YIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYPT 349
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 162/355 (45%), Positives = 221/355 (62%), Gaps = 17/355 (4%)
Query: 1 MVLKFEKSFIIPMFV---IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL 57
M++ SF + + + II T + S R+ E ++ +E+W+ +HG++Y
Sbjct: 13 MIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKE--VLTMYEEWLVKHGKSYNGLG 70
Query: 58 EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN-RPVPSVSRQ 116
EK R IFK NL++I++ N N TY+LG F+DLTNEE+R+ + G P + +
Sbjct: 71 EKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKL 129
Query: 117 SSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKL 175
S V D +P S+DWR++GAV +K+Q CGSCWAFSA+AAVEGI +I G L
Sbjct: 130 GGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDL 189
Query: 176 IELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAA 234
I LSEQ+LVDC T N GC+GGLMD AFE+II N G+ +E DYPY+ G CD+ ++ A
Sbjct: 190 ISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAK 249
Query: 235 AATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAV 294
TI YED+P DE AL +AV QP++V VE G+ F+ Y+ GV CG DHGVA
Sbjct: 250 VVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVAA 309
Query: 295 VGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
VG+GT E+G YW+++NSWG +WGE GYIR+ R+ G CGIA E SYP+
Sbjct: 310 VGYGT---ENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361
>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 167/341 (48%), Positives = 224/341 (65%), Gaps = 21/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ +++++++T SQ + + E ++ EKHEQWMA+HGRTY+D+ EK R IFK+NL++
Sbjct: 9 LAIVLMILVTWVSQAMPRPLIDEDAVAEKHEQWMARHGRTYQDDEEKERRFHIFKKNLKH 68
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRP--VPS--VSRQSSRPSTFKYQNV 128
IE N NRTYKLG N F+DLT+EEF A+YTGY P +P+ ++ ++++ S Y+
Sbjct: 69 IENFNNAFNRTYKLGLNHFADLTDEEFLATYTGYKMPKVLPTANITTKTTQSSDVLYE-- 126
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+VP SIDWR +G VT +KNQG CG CWAFSA AAVEGI G + LS QQL+DC
Sbjct: 127 ANVPESIDWRTRGVVTPVKNQGRCGCCWAFSAAAAVEGII----GNGVSLSAQQLLDCVP 182
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
D+NGC+GG MD AF YII+N+GLA+ YPYQ + C + AA I Y D+ D
Sbjct: 183 DSNGCNGGFMDNAFRYIIQNQGLASATYYPYQLMREMC---RPSNNAARISGYVDVTPAD 239
Query: 249 EHALLQAVTKQPVSVCVEASGQA-FRFYKRGVLNA-ECGDNCDHGVAVVGFGTAEEEDGA 306
E L AV +QPVS V+A+ + F++Y G+ +CG H + +VG+GT+ E G
Sbjct: 240 EETLKSAVARQPVSAAVDATSELNFKYYGGGIFPPQDCGSTLTHAITIVGYGTSAE--GT 297
Query: 307 KYWLIKNSWGETWGESGYIRILRDE----GLCGIATEASYP 343
KYWLIKNSWGE WGE GY+R+ RD G CGIA ASYP
Sbjct: 298 KYWLIKNSWGEGWGEGGYMRLQRDVGSYGGACGIALRASYP 338
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 160/348 (45%), Positives = 223/348 (64%), Gaps = 16/348 (4%)
Query: 5 FEKSFIIPMFVIIILVITCAS--QVVSGRSMHEPSIVEKHEQWMAQHGRTYKD-ELEKAM 61
F+ S I+ + + + ++ AS ++ R+ E ++ ++QW A+HG+ + + E
Sbjct: 4 FQSSPIMALLFFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPEN 61
Query: 62 RLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS 121
R IFK NL++I++ N + N Y+LG N F+DLTNEE+R+ Y G S SR++ +
Sbjct: 62 RFHIFKDNLKFIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLG--GKFASGSRRNRTSN 118
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQ 181
+ + D+P SIDWR KGAV +K+QG CGSCWAFS VA+VE I QI G LI LSEQ
Sbjct: 119 RYLPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQ 178
Query: 182 QLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
+LVDC N GC+GGLMD AFE+IIEN GL TE DYPY +C + K+ A I
Sbjct: 179 ELVDCDRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNAKVVAIDS 238
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
YED+P +E AL +AV+KQ VSV +E G++F+ Y+ G+ CG + DHGV VVG+G+
Sbjct: 239 YEDVPVNNEKALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYGS- 297
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
E G YW+++NSWG +WGESGY+++ R+ GLCGIA E SYP
Sbjct: 298 --EGGVDYWIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPT 343
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 308 bits (788), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 161/317 (50%), Positives = 206/317 (64%), Gaps = 20/317 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+V E+W+A++ + Y EK R +FK NL +I++AN++ +Y LG N F+DLT++
Sbjct: 68 LVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFADLTHD 127
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDV----PTSIDWREKGAVTHIKNQGHCG 153
EF+A+Y G S R F+Y V D P S+DWR+KGAVT +KNQG CG
Sbjct: 128 EFKATYLGLLPKRTSGGR-------FRYGGVGDGGDEVPASVDWRKKGAVTEVKNQGQCG 180
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLA 212
SCWAFS VAAVEGI QI G L LSEQQLVDCSTD NNGCSGG+MD AF +I GL
Sbjct: 181 SCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGAGLR 240
Query: 213 TEADYPYQQEQGTC-DKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
+E YPY E+G C D+ ++ TI YED+P DE AL++A+ QPVSV +EASG+
Sbjct: 241 SEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRH 300
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR-- 329
F+FY GV + CG DHGVA VG+G+++ +D Y ++KNSWG WGE GYIR+ R
Sbjct: 301 FQFYSGGVFDGPCGSELDHGVAAVGYGSSKGQD---YIIVKNSWGTHWGEKGYIRMKRGT 357
Query: 330 --DEGLCGIATEASYPV 344
EGLCGI ASYP
Sbjct: 358 GKPEGLCGINKMASYPT 374
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 308 bits (788), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 162/355 (45%), Positives = 221/355 (62%), Gaps = 17/355 (4%)
Query: 1 MVLKFEKSFIIPMFV---IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL 57
M++ SF + + + II T + S R+ E ++ +E+W+ +HG++Y
Sbjct: 13 MIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKE--VLTMYEEWLVKHGKSYNGLG 70
Query: 58 EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN-RPVPSVSRQ 116
EK R IFK NL++I++ N N TY+LG F+DLTNEE+R+ + G P + +
Sbjct: 71 EKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKL 129
Query: 117 SSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKL 175
S V D +P S+DWR++GAV +K+Q CGSCWAFSA+AAVEGI +I G L
Sbjct: 130 GGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDL 189
Query: 176 IELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAA 234
I LSEQ+LVDC T N GC+GGLMD AFE+II N G+ +E DYPY+ G CD+ ++ A
Sbjct: 190 ISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAK 249
Query: 235 AATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAV 294
TI YED+P DE AL +AV QP++V VE G+ F+ Y+ GV CG DHGVA
Sbjct: 250 VVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVAA 309
Query: 295 VGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
VG+GT E+G YW+++NSWG +WGE GYIR+ R+ G CGIA E SYP+
Sbjct: 310 VGYGT---ENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 161/317 (50%), Positives = 206/317 (64%), Gaps = 20/317 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+V E+W+A++ + Y EK R +FK NL +I++AN++ +Y LG N F+DLT++
Sbjct: 82 LVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFADLTHD 141
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDV----PTSIDWREKGAVTHIKNQGHCG 153
EF+A+Y G S R F+Y V D P S+DWR+KGAVT +KNQG CG
Sbjct: 142 EFKATYLGLLPKRTSGGR-------FRYGGVGDGGDEVPASVDWRKKGAVTEVKNQGQCG 194
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLA 212
SCWAFS VAAVEGI QI G L LSEQQLVDCSTD NNGCSGG+MD AF +I GL
Sbjct: 195 SCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGAGLR 254
Query: 213 TEADYPYQQEQGTC-DKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
+E YPY E+G C D+ ++ TI YED+P DE AL++A+ QPVSV +EASG+
Sbjct: 255 SEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRH 314
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR-- 329
F+FY GV + CG DHGVA VG+G+++ +D Y ++KNSWG WGE GYIR+ R
Sbjct: 315 FQFYSGGVFDGPCGSELDHGVAAVGYGSSKGQD---YIIVKNSWGTHWGEKGYIRMKRGT 371
Query: 330 --DEGLCGIATEASYPV 344
EGLCGI ASYP
Sbjct: 372 GKPEGLCGINKMASYPT 388
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 208/330 (63%), Gaps = 11/330 (3%)
Query: 23 CASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGN 81
CA+ R + + ++ + +E+W H + EK R FK N+ YI + NK G
Sbjct: 26 CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRGG 84
Query: 82 RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREK 140
R Y+L N F D+ EEFRA++ G + ++ P F Y+ V D+P ++DWR K
Sbjct: 85 RGYRLRLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRK 144
Query: 141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMD 199
GAVT +K+QG CGSCWAFS V +VEGI I G+L+ LSEQ+L+DC T DN+GC GGLM+
Sbjct: 145 GAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLME 204
Query: 200 KAFEYIIENKGLATEADYPYQQEQGTCDK-QKEKAAAATIGKYEDLPKGDEHALLQAVTK 258
AFEYI + G+ TE+ YPY+ GTCD + +A I ++++P E AL +AV
Sbjct: 205 NAFEYIKHSGGITTESAYPYRAANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAVAN 264
Query: 259 QPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
QPVSV ++A Q+F+FY GV +CG + DHGVAVVG+G E DG +YW++KNSWG
Sbjct: 265 QPVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYG--ETNDGTEYWIVKNSWGTA 322
Query: 319 WGESGYIRILRDE----GLCGIATEASYPV 344
WGE GYIR+ RD GLCGIA EASYPV
Sbjct: 323 WGEGGYIRMQRDSGYDGGLCGIAMEASYPV 352
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 154/317 (48%), Positives = 204/317 (64%), Gaps = 13/317 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ + +E+W +H + + EK R +FK N+ ++ + NK ++ YKL N+F+D+
Sbjct: 33 EDNLWDMYERW--RH-KVATNHGEKLRRFNVFKSNVLHVHETNKM-DKPYKLKLNKFADM 88
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRP--STFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
TN EFR+ Y G S Q R TF Y NV VPTS+DWR+KGAV +K+QG C
Sbjct: 89 TNHEFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVESVPTSVDWRKKGAVAPVKDQGQC 148
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFEYIIENKGL 211
GSCWAFS VAAVEGI +I +L+ LSEQ+LVDC T +N GC+GGLMD AF++I + GL
Sbjct: 149 GSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQGCNGGLMDLAFDFIKKTGGL 208
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
E YPY E G CD K + +I +ED+PK DE +L++AV QPV+V ++A
Sbjct: 209 TREDAYPYAAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVAIDAGSSD 268
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR-- 329
F+FY GV +CG DHGVA VG+GT DG KYW+++NSWG WGE GYIR+ R
Sbjct: 269 FQFYSEGVFTGKCGTQLDHGVAAVGYGTT--LDGTKYWIVRNSWGSEWGEKGYIRMERGI 326
Query: 330 --DEGLCGIATEASYPV 344
GLCGIA EASYP+
Sbjct: 327 SDKRGLCGIAMEASYPI 343
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 307 bits (786), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 150/350 (42%), Positives = 212/350 (60%), Gaps = 16/350 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK--------HEQWMAQHGRTYKDELEK 59
+ I + +++I ++ +S S E I + +E W+ +HG++Y EK
Sbjct: 7 TLTISLLLMLIFSTLSSASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNALGEK 66
Query: 60 AMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSR 119
R IFK NL+YI++ N N++YKLG +F+DLTNEE+R+ Y G ++
Sbjct: 67 DKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRRKLSKNK 126
Query: 120 PSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELS 179
+ + +P S+DWR+KG + +K+QG CGSCWAFSAVAA+E I I G LI LS
Sbjct: 127 SDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLS 186
Query: 180 EQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
EQ+LVDC N GC GGLMD AFE++I N G+ TE DYPY++ CD+ ++ A I
Sbjct: 187 EQELVDCDKSYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKI 246
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
YED+P +E AL +AV QPVS+ +EA G+ + YK G+ +CG DHGV G+G
Sbjct: 247 DSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAVDHGVVAAGYG 306
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+ E+G YW+++NSWG WGE GY+R+ R+ GLCG+ATE SYPV
Sbjct: 307 S---ENGMDYWIVRNSWGAKWGEKGYLRVQRNVASSSGLCGLATEPSYPV 353
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 307 bits (786), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 147/310 (47%), Positives = 206/310 (66%), Gaps = 14/310 (4%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN-KEGNRTYKLGTNEFSDLTNEEFR 100
++ W+A++GR+Y E R +F NL + + N + + ++LG N F+DLTNEEFR
Sbjct: 54 YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 113
Query: 101 ASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
A++ G V R + +++ V ++P S+DWREKGAV +KNQG CGSCWAFSA
Sbjct: 114 ATFLG----AKVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 169
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYP 218
V+ VE I Q+ G++I LSEQ+LV+CST+ N+GC+GGLMD AF++II+N G+ TE DYP
Sbjct: 170 VSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYP 229
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRG 278
Y+ G CD +E A +I +ED+P+ DE +L +AV QPVSV +EA G+ F+ Y G
Sbjct: 230 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 289
Query: 279 VLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLC 334
V + CG + DHGV VG+GT ++G YW+++NSWG WGESGY+R+ R+ G C
Sbjct: 290 VFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKC 346
Query: 335 GIATEASYPV 344
GIA ASYP
Sbjct: 347 GIAMMASYPT 356
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 307 bits (786), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 150/317 (47%), Positives = 209/317 (65%), Gaps = 14/317 (4%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEF 91
S+H+ ++ E W+ +H + Y+ EK R IF NL++I++ NK+ + Y LG NEF
Sbjct: 41 SIHK--VIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEF 97
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
+DLT+EEF+ + G+ + +SS+ F Y++ D+P S+DWR+KGAV +KNQG
Sbjct: 98 ADLTHEEFKHKFLGFKGELAERKDESSK--EFGYRDFVDLPKSVDWRKKGAVAPVKNQGQ 155
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKG 210
CG+CWAFS VAAVEGI QI G L LSEQ+L+DC T NNGC+GGLMD AF Y++ + G
Sbjct: 156 CGNCWAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRS-G 214
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
L E +YPY +GTCD++K+ + TI Y D+P+ DE + L+A+ QP+SV +EASG+
Sbjct: 215 LHKEEEYPYIMSEGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGR 274
Query: 271 AFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR- 329
F+FY GV + CG DHGVA VG+GT + G Y +++NSWG WGE GYIR+ R
Sbjct: 275 DFQFYSGGVFDGHCGTELDHGVAAVGYGTTK---GLDYVIVRNSWGPKWGEKGYIRMKRG 331
Query: 330 ---DEGLCGIATEASYP 343
G+CG+ ASYP
Sbjct: 332 SGKPHGMCGLYMMASYP 348
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 155/328 (47%), Positives = 204/328 (62%), Gaps = 22/328 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E+ EQWM +HGR Y D EK RL ++++N+E +E N GN Y+L N+F+DLTNE
Sbjct: 29 MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGN-GYRLADNKFADLTNE 87
Query: 98 EFRASYTGYNRPVPSV-SRQSSRPST--------FKYQNVTDVPTSIDWREKGAVTHIKN 148
EFRA G+ RP + S+ PST Q +D+P S+DWREKGAV +K+
Sbjct: 88 EFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVKS 147
Query: 149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIEN 208
QG CGSCWAFSAVAA+EGI QI GKL+ LSEQ+LVDC T GC+GG M AFE++++N
Sbjct: 148 QGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKAIGCAGGYMSWAFEFVMKN 207
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
+GL TE +YPYQ G C K K +A +I Y ++ E LL+A QPVSV V+A
Sbjct: 208 RGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAVDAG 267
Query: 269 GQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE--------DGAKYWLIKNSWGETWG 320
++ Y GV C +HGV VVG+G + + G KYW++KNSWG WG
Sbjct: 268 SFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPEWG 327
Query: 321 ESGYIRILRD----EGLCGIATEASYPV 344
++GYI + R+ GLCGIA SYPV
Sbjct: 328 DAGYILMQREASVASGLCGIAMLPSYPV 355
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 306 bits (785), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 155/328 (47%), Positives = 204/328 (62%), Gaps = 22/328 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E+ EQWM +HGR Y D EK RL ++++N+E +E N GN Y+L N+F+DLTNE
Sbjct: 50 MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGN-GYRLADNKFADLTNE 108
Query: 98 EFRASYTGYNRPVPSV-SRQSSRPST--------FKYQNVTDVPTSIDWREKGAVTHIKN 148
EFRA G+ RP + S+ PST Q +D+P S+DWREKGAV +K+
Sbjct: 109 EFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVKS 168
Query: 149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIEN 208
QG CGSCWAFSAVAA+EGI QI GKL+ LSEQ+LVDC T GC+GG M AFE++++N
Sbjct: 169 QGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKAIGCAGGYMSWAFEFVMKN 228
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
+GL TE +YPYQ G C K K +A +I Y ++ E LL+A QPVSV V+A
Sbjct: 229 RGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAVDAG 288
Query: 269 GQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE--------DGAKYWLIKNSWGETWG 320
++ Y GV C +HGV VVG+G + + G KYW++KNSWG WG
Sbjct: 289 SFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPEWG 348
Query: 321 ESGYIRILRD----EGLCGIATEASYPV 344
++GYI + R+ GLCGIA SYPV
Sbjct: 349 DAGYILMQREASVASGLCGIAMLPSYPV 376
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 306 bits (785), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 145/311 (46%), Positives = 208/311 (66%), Gaps = 15/311 (4%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLTNEEF 99
++ W+A++GR+Y E+ R +F NL++++ N + ++LG N F+DLTN+EF
Sbjct: 49 YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108
Query: 100 RASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFS 159
R+++ G V R + +++ V ++P S+DWREKGAV +KNQG CGSCWAFS
Sbjct: 109 RSTFLG----AKVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFS 164
Query: 160 AVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADY 217
AV+ VE I Q+ G++I LSEQ+LV+CST+ N+GC+GGLMD AF++II+N G+ TE DY
Sbjct: 165 AVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDY 224
Query: 218 PYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKR 277
PY+ G CD +E A +I +ED+P+ DE +L +AV QPVSV +EA G+ F+ Y
Sbjct: 225 PYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHS 284
Query: 278 GVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGL 333
GV + CG + DHGV VG+GT ++G YW+++NSWG WGESGY+R+ R+ G
Sbjct: 285 GVFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINATTGK 341
Query: 334 CGIATEASYPV 344
CGIA ASYP
Sbjct: 342 CGIAMMASYPT 352
>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 149/345 (43%), Positives = 219/345 (63%), Gaps = 9/345 (2%)
Query: 7 KSFIIPMF-VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTI 65
+ I+ +F V+++ + + E + + +E+W + H + + EK R +
Sbjct: 4 RKVILAVFSVVLVFRLADSFDYTEEDLASEERLRDLYERWRSHHTVS-RSLAEKQERFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKY 125
FK+NL++I K N + +R YKL N F+D+TN EF Y G V R + + +
Sbjct: 63 FKENLKHIHKVNHK-DRPYKLKLNSFADMTNHEFLQHYGGSKVSHYRVLRGQRQGTGSMH 121
Query: 126 QNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVD 185
++ + +P+S+DWR+ GAVT IK+QG CGSCWAFS VAAVEGI +I G+LI LSEQ+LVD
Sbjct: 122 EDTSKLPSSVDWRKNGAVTGIKDQGKCGSCWAFSTVAAVEGINKIKTGELISLSEQELVD 181
Query: 186 CSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
C +DN+GC+GGLM+ AF +I + GL +E YPY+ ++ CD K + I YE +P
Sbjct: 182 CDSDNHGCNGGLMEDAFNFIKQIGGLTSENTYPYRAKEEPCDSNKMNSPVVNIDGYEMVP 241
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
+ DE+AL++AV QPV++ ++A G+ +FY + +CG +HGVA+VG+GT +DG
Sbjct: 242 ENDENALMKAVANQPVAIAMDAGGKDLQFYSEAIFTGDCGTELNHGVALVGYGTT--QDG 299
Query: 306 AKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVAM 346
KYW++KNSWG WGE GYIR+ R +EGLCGI EASYPV +
Sbjct: 300 TKYWIVKNSWGTDWGEKGYIRMQRGIDAEEGLCGITMEASYPVKL 344
>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
Length = 284
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 153/288 (53%), Positives = 195/288 (67%), Gaps = 17/288 (5%)
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF---RASYTGYNRPVPSVSRQSSRPSTF 123
K+N+ YIE N N+ YKLG N+F+DLT+EEF R + G+ R ++R +TF
Sbjct: 5 KENVNYIEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNGHMR------FSNTRTTTF 58
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
KY+NVT +P SIDWR+KGAVT IKNQG CG CWAFSA+AA EGI +I+ GKL+ LSEQ++
Sbjct: 59 KYENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEV 118
Query: 184 VDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
VDC T ++GC GG MD AF++II+N G+ TEA YPY+ G C+ ++E A TI Y
Sbjct: 119 VDCDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHATTITGY 178
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE 301
ED+P +E AL +AV QPVSV ++A G F+FYK G+ CG DHGV VG+G E
Sbjct: 179 EDVPINNEKALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYG--E 236
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
+G KYWL+KNSWG WGE GY + R EG+CGIA ASYP A
Sbjct: 237 NNEGTKYWLVKNSWGTEWGEEGYTMMQRGVKAVEGICGIAMLASYPTA 284
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 306 bits (784), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 153/348 (43%), Positives = 214/348 (61%), Gaps = 20/348 (5%)
Query: 9 FIIPMFVIIILVITCASQVVSGRSMHEP------SIVEKHEQWMAQHGRTYKDELEKAMR 62
F ++ ++++ T + HEP + +++E+W+ QHGR YK+ E
Sbjct: 6 FCRNVYFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRH 65
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
I++ N+ +I N + N ++ L N+F+D+TNEE++A Y G S QSS
Sbjct: 66 FGIYQSNVRFINYINAQ-NFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQSS---- 120
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
FK + +P S+DWR+ GAVT ++NQG CGSCWAFS VAAVEGI +I GKL+ LSEQ+
Sbjct: 121 FKRERSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQE 180
Query: 183 LVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
L+DC D N GC+GG M AF++I +N G+ T +YPY EQG C+K K I
Sbjct: 181 LLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIGEQGICNKDKAANHVVKISG 240
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
YE +P +E L AV KQPVSV ++A G F+ Y +G+ N CG +H V V+G+G
Sbjct: 241 YETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYG-- 298
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
E++G KYWL+KNSWG WGE+GY R++R DEG+CGIA EASYP+
Sbjct: 299 -EDNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPI 345
>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
Length = 362
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 154/321 (47%), Positives = 205/321 (63%), Gaps = 20/321 (6%)
Query: 35 EPSIVEKHEQWMAQH--GRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
E S + +E+W + H R+ D K R +FK N+ ++ NK ++ YKL N+F+
Sbjct: 33 EESFWDLYERWRSHHTVSRSLGD---KHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFA 88
Query: 93 DLTNEEFRASYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKN 148
D+TN EFR++Y G ++R R + TF Y+ V VP S+DWR+ GAVT +K+
Sbjct: 89 DMTNHEFRSTYAGSKVNHHRMFQGTPRGNG---TFMYEKVGSVPPSVDWRKNGAVTGVKD 145
Query: 149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIE 207
QG CGSCWAFS V AVEGI QI KL+ LSEQ+LVDC T N GC+GGLM+ AFE+I +
Sbjct: 146 QGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIKQ 205
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA 267
G+ TE++YPY + GTCD K A +I +E++P DE+ALL+AV QPVSV ++A
Sbjct: 206 KGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDA 265
Query: 268 SGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
G F+FY GV +C +HGVA+VG+GT DG YW ++NSWG WGE GYIR+
Sbjct: 266 GGSDFQFYSEGVFTGDCSTELNHGVAIVGYGTT--VDGTNYWTVRNSWGPEWGEQGYIRM 323
Query: 328 LRD----EGLCGIATEASYPV 344
R EGLCGIA ASYP+
Sbjct: 324 QRSISKKEGLCGIAMMASYPI 344
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 153/348 (43%), Positives = 214/348 (61%), Gaps = 20/348 (5%)
Query: 9 FIIPMFVIIILVITCASQVVSGRSMHEP------SIVEKHEQWMAQHGRTYKDELEKAMR 62
F ++ ++++ T + HEP + +++E+W+ QHGR YK+ E
Sbjct: 2 FCRNVYFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRH 61
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
I++ N+ +I N + N ++ L N+F+D+TNEE++A Y G S QSS
Sbjct: 62 FGIYQSNVRFINYINAQ-NFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQSS---- 116
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
FK + +P S+DWR+ GAVT ++NQG CGSCWAFS VAAVEGI +I GKL+ LSEQ+
Sbjct: 117 FKRERSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQE 176
Query: 183 LVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
L+DC D N GC+GG M AF++I +N G+ T +YPY EQG C+K K I
Sbjct: 177 LLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIGEQGICNKDKAANHVVKISG 236
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
YE +P +E L AV KQPVSV ++A G F+ Y +G+ N CG +H V V+G+G
Sbjct: 237 YETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYG-- 294
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
E++G KYWL+KNSWG WGE+GY R++R DEG+CGIA EASYP+
Sbjct: 295 -EDNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPI 341
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 306 bits (784), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 152/347 (43%), Positives = 214/347 (61%), Gaps = 14/347 (4%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+ KSF+ + ++ + + + R+ E + +E W+ +HG++Y E+ R
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLALDAKRTNDE--VKAMYESWLIKHGKSYNSLGERERR 58
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
IFK+ L +I++ N + +R+YK+G N+F+DLTNEEFR++Y G+ R S ++ +
Sbjct: 59 FEIFKETLRFIDEHNADTSRSYKVGLNQFADLTNEEFRSTYLGFTRG----SNKTKVSNR 114
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
++ + +P +DWR +GAV IKNQG CGSCWAFSA+AAVEGI +I G LI LSEQ+
Sbjct: 115 YEPRVGQVLPDYVDWRSEGAVVDIKNQGQCGSCWAFSAIAAVEGINKIVTGNLISLSEQE 174
Query: 183 LVDC--STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
LVDC + GC GG M FE+II N G+ TE +YPY ++G CD + TI
Sbjct: 175 LVDCGRTQSTKGCDGGYMTDGFEFIINNGGINTEENYPYTAQEGQCDLNLQNEKYVTIDN 234
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
YE++P +E AL AV QPVSV +E++G AF+ Y G+ CG DH V +VG+GT
Sbjct: 235 YENVPYYNEWALQTAVAYQPVSVALESAGDAFQHYSSGIFTGPCGTATDHAVTIVGYGT- 293
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
E G YW++KNSW TWGE GY+RILR+ G CGIAT SYPV
Sbjct: 294 --EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 338
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 150/343 (43%), Positives = 214/343 (62%), Gaps = 12/343 (3%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
+ + +F ++++ ++ S + + +E +E+W+ ++ + Y EK R IFK
Sbjct: 9 TLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIFK 68
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
NL+++E+ + NRTY++G F+DLTN+EFRA Y R +R + + Y+
Sbjct: 69 DNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYL---RSKMERTRVPVKGEKYLYKV 125
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+P +IDWR KGAV +K+QG CGSCWAFSA+ AVEGI QI G+LI LSEQ+LVDC
Sbjct: 126 GDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 185
Query: 188 TD-NNGCSGGLMDKAFEYIIENKGLATEADYPY-QQEQGTCDKQKEKAAAATIGKYEDLP 245
T N+GC GGLMD AF++IIEN G+ TE DYPY + C+ K+ TI YED+P
Sbjct: 186 TSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVP 245
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
+ DE +L +A+ QP+SV +EA G+AF+ Y GV CG + DHGV VG+G+ E G
Sbjct: 246 QNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYGS---EGG 302
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
YW+++NSWG WGESGY ++ R+ G CG+A ASYP
Sbjct: 303 QDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPT 345
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 152/326 (46%), Positives = 204/326 (62%), Gaps = 11/326 (3%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMRLTIFKQNLEYIEKAN-KEGNRTY 84
V G + E + +EQWMA+HG+ + L E R F NL +++ N + G R Y
Sbjct: 37 VGGGMARTEAQVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGY 96
Query: 85 KLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
+LG N F+DLTN EFRA+Y + + + ++ +++ V +P +DWR+KGAV
Sbjct: 97 RLGINRFADLTNAEFRAAY--LSAGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVA 154
Query: 145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAF 202
+KNQG CGSCWAFSAV AVEGI QI G+L+ LSEQ+LVDCS + N GC GG+MD AF
Sbjct: 155 PVKNQGQCGSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAF 214
Query: 203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVS 262
+I+ N G+ T+ DYPY G CD K +I +E +P+ DE +L +AV QPV+
Sbjct: 215 AFIVGNGGIDTDKDYPYTARDGKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVA 274
Query: 263 VCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGES 322
V +EA G+ F+ Y+ GV CG + DHGV VG+GT E + G YWL++NSWG WGE
Sbjct: 275 VAIEAGGREFQLYQSGVFTGRCGTSLDHGVVAVGYGT-EADGGRDYWLVRNSWGADWGEG 333
Query: 323 GYIRILRD----EGLCGIATEASYPV 344
GYIR+ R+ G CGIA EASYPV
Sbjct: 334 GYIRMERNVGARAGKCGIAMEASYPV 359
>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 153/348 (43%), Positives = 223/348 (64%), Gaps = 27/348 (7%)
Query: 13 MFVIIILVITCASQVVS--------GRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMRL 63
+F++I+ V++ S + RS E + + WM++HG+TY + L EK R
Sbjct: 12 LFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFI--FQMWMSKHGKTYTNALGEKERRF 69
Query: 64 TIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTF 123
FK NL +I++ N + N +Y+LG F+DLT +E+R + G +P +Q + ++
Sbjct: 70 QNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPGSPKP-----KQRNLKTSR 123
Query: 124 KYQNVT--DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQ 181
+Y + +P S+DWR++GAV+ IK+QG C SCWAFS VAAVEG+ +I G+LI LSEQ
Sbjct: 124 RYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQ 183
Query: 182 QLVDCSTDNNGCSG-GLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
+LVDC+ NNGC G GLMD AF+++I N GL +E DYPYQ QG+C++++ TI
Sbjct: 184 ELVDCNLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQVHLLVITIDS 243
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
YED+P DE +L +AV QPVSV V+ Q F Y+ + N CG N DH + +VG+G+
Sbjct: 244 YEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGS- 302
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
E+G YW+++NSWG TWG++GYI+I R+ +GLCGIA ASYP+
Sbjct: 303 --ENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPI 348
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 154/321 (47%), Positives = 211/321 (65%), Gaps = 21/321 (6%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDE----LEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNE 90
+ + +E WM +HG+ + EK R IFK NL +I++ N + N +YKLG
Sbjct: 42 DAEVARIYEAWMEKHGKKAQSNGLVGEEKDQRFEIFKDNLRFIDEHNNK-NLSYKLGLTR 100
Query: 91 FSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKN 148
F+DLTNEE+R+ Y G + S++ ++ +YQ V D +P S+DWR++GAV +K+
Sbjct: 101 FADLTNEEYRSIYLG------AKSKKRVLKTSDRYQPRVGDAIPDSVDWRKEGAVAAVKD 154
Query: 149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIE 207
QG CGSCWAFS + AVEGI +I G LI LSEQ+LVDC T N GC+GGLMD AFE+II+
Sbjct: 155 QGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIK 214
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA 267
N G+ TE DYPY+ G CD+ ++ A TI YED+P+ +E AL + + QP+SV +EA
Sbjct: 215 NGGIDTEEDYPYKAADGRCDQTRKNAKVVTIDAYEDVPENNEAALKKTLANQPISVAIEA 274
Query: 268 SGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
G+AF+ Y GV + CG DHGV VG+GT E+G YW+++NSWG +WGESGYI++
Sbjct: 275 GGRAFQLYSSGVFDGICGTELDHGVVAVGYGT---ENGKDYWIVRNSWGGSWGESGYIKM 331
Query: 328 LRD----EGLCGIATEASYPV 344
R+ G CGIA EASYP+
Sbjct: 332 ARNIAEPTGKCGIAMEASYPI 352
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 305 bits (781), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 159/347 (45%), Positives = 211/347 (60%), Gaps = 21/347 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEK--------HEQWMAQHGRTYKDELEKAMRLT 64
+F + L ++S + H+ + +E+W+ +HG+ Y EK R
Sbjct: 3 LFALFALSSALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKRFQ 62
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFK 124
IFK NL +I++ N E NRTYKLG N F+DLTNEE+RA Y G + R PS
Sbjct: 63 IFKDNLRFIDQQNAE-NRTYKLGLNRFADLTNEEYRARYLG--TKIDPNRRLGRTPSNRY 119
Query: 125 YQNVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
V + +P S+DWR++GAV +K+Q CGSCWAFSA+ AVEGI +I G LI LSEQ+L
Sbjct: 120 APRVGETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQEL 179
Query: 184 VDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
VDC T N GC+GGLMD AFE+II+N G+ +E DYPY+ G CD+ ++ A +I YE
Sbjct: 180 VDCDTGYNMGCNGGLMDYAFEFIIKNGGIDSEEDYPYKGVDGRCDEYRKNAKVVSIDGYE 239
Query: 243 DLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEE 302
D+ DE AL +AV QPVSV VE G+ F+ Y GV CG DHGV VG+GT
Sbjct: 240 DVNTYDELALKKAVANQPVSVAVEGGGREFQLYSSGVFTGRCGTALDHGVVAVGYGT--- 296
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
++G +W+++NSWG WGE GYIR+ R+ G CGIA E SYP+
Sbjct: 297 DNGHDFWIVRNSWGADWGEEGYIRLERNLGNSRSGKCGIAIEPSYPI 343
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 155/327 (47%), Positives = 207/327 (63%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRT 83
+VS E + +W A+HG++Y E+ R F+ NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 84 YKLGTNEFSDLTNEEFRASYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R +Y G N+P R+ + + +P S+DWR KGA
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKA 201
V IK+QG CGSCWAFSA+AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 141 VAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
F++II N G+ TE DYPY+ + CD ++ A TI YED+ E +L +AV QPV
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPV 260
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
SV +EA G+AF+ Y G+ +CG DHGVA VG+GT E+G YW+++NSWG++WGE
Sbjct: 261 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 317
Query: 322 SGYIRILRD----EGLCGIATEASYPV 344
SGY+R+ R+ G CGIA E SYP+
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPL 344
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 155/327 (47%), Positives = 207/327 (63%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRT 83
+VS E + +W A+HG++Y E+ R F+ NL YI++ N G +
Sbjct: 26 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 85
Query: 84 YKLGTNEFSDLTNEEFRASYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R +Y G N+P R+ + + +P S+DWR KGA
Sbjct: 86 FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 141
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKA 201
V IK+QG CGSCWAFSA+AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 142 VAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 201
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
F++II N G+ TE DYPY+ + CD ++ A TI YED+ E +L +AV QPV
Sbjct: 202 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPV 261
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
SV +EA G+AF+ Y G+ +CG DHGVA VG+GT E+G YW+++NSWG++WGE
Sbjct: 262 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 318
Query: 322 SGYIRILRD----EGLCGIATEASYPV 344
SGY+R+ R+ G CGIA E SYP+
Sbjct: 319 SGYVRMERNIKASSGKCGIAVEPSYPL 345
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 162/323 (50%), Positives = 206/323 (63%), Gaps = 20/323 (6%)
Query: 29 SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGT 88
+GR+ E SIV + + Y EK R +FK NL +I+ NK+ +Y LG
Sbjct: 24 AGRNGGEFSIV--------GYRKAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGL 74
Query: 89 NEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHI 146
NEF+DLT++EF+A+Y G P + + F+Y +++ VP +DWR+K AVT +
Sbjct: 75 NEFADLTHDEFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEV 134
Query: 147 KNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYI 205
KNQG CGSCWAFS VAAVEGI I G L LSEQ+L+DCSTD NNGC+GGLMD AF YI
Sbjct: 135 KNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYI 194
Query: 206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCV 265
GL TE YPY E+G CD+ K AA TI YED+P DE AL++A+ QPVSV +
Sbjct: 195 ASTGGLRTEEAYPYAMEEGDCDEGK-GAAVVTISGYEDVPANDEQALVKALAHQPVSVAI 253
Query: 266 EASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
EASG+ F+FY GV + CG+ DHGV VG+GT++ +D Y ++KNSWG WGE GYI
Sbjct: 254 EASGRHFQFYSGGVFDGPCGEQLDHGVTAVGYGTSKGQD---YIIVKNSWGPHWGEKGYI 310
Query: 326 RILR----DEGLCGIATEASYPV 344
R+ R EGLCGI ASYP
Sbjct: 311 RMKRGTGKGEGLCGINKMASYPT 333
>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
Length = 296
Score = 304 bits (779), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 150/316 (47%), Positives = 203/316 (64%), Gaps = 30/316 (9%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+V +HEQWM Q+ R YKD EKA R +FK N+++IE N GNR + LG N+F+DLTN+
Sbjct: 1 MVARHEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTND 60
Query: 98 EFRASYTGYN-RPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKNQGHCGS 154
EFRA+ T +P P P+ F+Y+N++ +P +IDWR KGAVT IK+QG C
Sbjct: 61 EFRATKTNKGFKPSP-----VKVPTGFRYENISVDALPATIDWRTKGAVTPIKDQGQC-- 113
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLA 212
EGI +I+ GKLI LSEQ+LVDC ++ GC GGLMD AF++II+ GL
Sbjct: 114 ----------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLT 163
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE+ YPY G C + + AT+ +ED+P DE +L++AV QPVSV V+ F
Sbjct: 164 TESSYPYTAADGKC--KSGSNSVATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDMTF 221
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+FY GV+ CG + DHG+A +G+G + DG KYWL+KNSWG TWGE+GY+R+ +D
Sbjct: 222 QFYSGGVMTGSCGTDLDHGIAAIGYG--QTSDGTKYWLLKNSWGTTWGENGYLRMEKDIS 279
Query: 331 --EGLCGIATEASYPV 344
G+CG+A E SYP
Sbjct: 280 DKRGMCGLAMEPSYPT 295
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 155/339 (45%), Positives = 204/339 (60%), Gaps = 17/339 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F +L+++ A + + ++ +E W+ +HG++Y EK MR IFK+NL
Sbjct: 13 LFFSTLLILSSAIDIENSVQRTNDQVMAMYESWLVEHGKSYNSLDEKEMRFEIFKENLRI 72
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNR-PVPSVSRQSSRPSTFKYQNVTD- 130
I+ N + NR+Y LG N F+DLT+EE+R++Y G R P VS Q V D
Sbjct: 73 IDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKRGPKTDVSNQ-------YMPKVGDA 125
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P +DWR GAV +KNQG C SCWAFSAVAAVEGI +I G LI LSEQ+LVDC
Sbjct: 126 LPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQ 185
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
GC+ GLM AF++II N G+ TE +YPY + G C+ + TI Y+++P +
Sbjct: 186 ITKGCNRGLMTDAFKFIINNGGINTENNYPYTAKDGQCNLSLKNQKYVTIDSYKNVPSNN 245
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL +AV QPVSV VE+ G F+ Y G+ CG DHGV +VG+GT E G Y
Sbjct: 246 EMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGSCGTAVDHGVTIVGYGT---ERGMDY 302
Query: 309 WLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
W++KNSWG WGESGYIRI R+ G CGIA SYPV
Sbjct: 303 WIVKNSWGTNWGESGYIRIQRNIGGAGKCGIAKMPSYPV 341
>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
Length = 360
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 160/316 (50%), Positives = 212/316 (67%), Gaps = 11/316 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ + +E+W + H T + EK R +FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EKSLWDLYERWRSHHTVT-RSLDEKHNRFNVFKANVMHVHNTNKL-DKPYKLKLNKFADM 90
Query: 95 TNEEFRASYTGYNRPVPSVSR-QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
TN EFR Y + R S+ TF Y+NV +VP+SIDWR+KGAVT +K+QG CG
Sbjct: 91 TNYEFRRIYADSKVSHHRMFRGMSNENGTFMYENVKNVPSSIDWRKKGAVTDVKDQGQCG 150
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLA 212
SCWAFS + AVEGI QI KL+ LSEQ+LVDC T N GC+GGLM+ AFE+I +N G+
Sbjct: 151 SCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTGGNEGCNGGLMEYAFEFIKQN-GIT 209
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE++YPY + GTCD +KE A +I YE++P +E ALL+A KQPVSV ++A G F
Sbjct: 210 TESNYPYAAKDGTCDLKKEDKAEVSIDGYENVPINNEAALLKAAAKQPVSVAIDAGGYNF 269
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR--- 329
+FY GV + CG + +HGVAVVG+G +D KYW++KNSWG WGE GYIR+ R
Sbjct: 270 QFYSEGVFSGHCGTDLNHGVAVVGYGVT--QDRTKYWIVKNSWGSEWGEQGYIRMQRGIS 327
Query: 330 -DEGLCGIATEASYPV 344
EGLCGIA EASYP+
Sbjct: 328 HKEGLCGIAMEASYPI 343
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 304 bits (779), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 161/328 (49%), Positives = 203/328 (61%), Gaps = 25/328 (7%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR-------TYKLGTN 89
++ +HE WMA+HGRTY D EKA RL IF+ N E I+ N + + +++L TN
Sbjct: 38 AMASRHESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATN 97
Query: 90 EFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT---DVPTSIDWREKGAVTHI 146
F+DLT+EEFRA+ TG RP F+Y+N + D S+DWR GAVT +
Sbjct: 98 RFADLTDEEFRAARTGLRRPAAVAGAVGG---GFRYENFSLQADAAGSMDWRAMGAVTGV 154
Query: 147 KNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEY 204
K+QG CG CWAFSAVAA+EG+T+I G+L+ LSEQQLVDC D+ GC GGLMD AF+Y
Sbjct: 155 KDQGSCGCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQY 214
Query: 205 IIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVC 264
I GLA+E+ YPY E G + AA+I +ED+P +E AL+ AV QPVSV
Sbjct: 215 ISRQGGLASESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVSVA 274
Query: 265 VEASGQAFRFYKRGVLNAECGDNC-----DHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
+ FRFY RGVL A C DH + VG+G A DG YWL+KNSWG W
Sbjct: 275 INGGDYVFRFYDRGVLGAGGNGGCESTELDHAITAVGYGMA--GDGTGYWLMKNSWGSGW 332
Query: 320 GESGYIRIL---RDEGLCGIATEASYPV 344
GESGY+RI R EG+CG+A ASYPV
Sbjct: 333 GESGYVRIRRGSRGEGVCGLAKLASYPV 360
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 304 bits (778), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 155/327 (47%), Positives = 206/327 (62%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRT 83
+VS E + +W A+HG+ Y E+ R F+ NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 84 YKLGTNEFSDLTNEEFRASYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R +Y G N+P R+ + + +P S+DWR KGA
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKA 201
V IK+QG CGSCWAFSA+AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 141 VAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
F++II N G+ TE DYPY+ + CD ++ A TI YED+ E +L +AV QPV
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPV 260
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
SV +EA G+AF+ Y G+ +CG DHGVA VG+GT E+G YW+++NSWG++WGE
Sbjct: 261 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 317
Query: 322 SGYIRILRD----EGLCGIATEASYPV 344
SGY+R+ R+ G CGIA E SYP+
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPL 344
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 304 bits (778), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 161/310 (51%), Positives = 205/310 (66%), Gaps = 11/310 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
++ W+A+HG+ Y E+A R IFK NL +I++ N + N TYK+G +F+DLTNEE+RA
Sbjct: 4 YKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQ-NHTYKVGLTKFADLTNEEYRA 62
Query: 102 SYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
+ G +S PS + ++ +P S+DWR KGAV IK+QG CGSCWAFS
Sbjct: 63 MFLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWAFST 122
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEYIIENKGLATEADYPY 219
VAAVEGI QI G+LI LSEQ+LVDC T N GC+GGLMD AF++II N GL TE DYPY
Sbjct: 123 VAAVEGINQIVTGELISLSEQELVDCDRTYNAGCNGGLMDYAFQFIINNGGLDTEKDYPY 182
Query: 220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
+ CDK K K A +I +ED+ DE AL +AV QPVSV +EASG A +FY+ GV
Sbjct: 183 VGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQFYQSGV 242
Query: 280 LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLC 334
ECG DHGV VVG+ + E+G YWL++NSWG WGE GYI++ R+ G C
Sbjct: 243 FTGECGTALDHGVVVVGYAS---ENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTGRC 299
Query: 335 GIATEASYPV 344
GIA E+SYPV
Sbjct: 300 GIAMESSYPV 309
>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 356
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 155/349 (44%), Positives = 222/349 (63%), Gaps = 28/349 (8%)
Query: 13 MFVIIILVITCASQVVS--------GRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMRL 63
+F++I+ V++ S + RS E + + WM++HG+TY + L EK R
Sbjct: 12 LFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFI--FQMWMSKHGKTYTNALGEKERRF 69
Query: 64 TIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTF 123
FK NL +I++ N + N +Y+LG F+DLT +E+R + G +P +Q + ++
Sbjct: 70 QNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPGSPKP-----KQRNLKTSR 123
Query: 124 KYQNVT--DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQ 181
+Y + +P S+DWR++GAV+ IK+QG C SCWAFS VAAVEG+ +I G+LI LSEQ
Sbjct: 124 RYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQ 183
Query: 182 QLVDCSTDNNGCSG-GLMDKAFEYIIENKGLATEADYPYQQEQGTCD-KQKEKAAAATIG 239
+LVDC+ NNGC G GLMD AF+++I N GL +E DYPYQ QG+C+ KQ TI
Sbjct: 184 ELVDCNLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSNKVITID 243
Query: 240 KYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGT 299
YED+P DE +L +AV QPVSV V+ Q F Y+ + N CG N DH + +VG+G+
Sbjct: 244 SYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGS 303
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
E+G YW+++NSWG TWG++GYI+I R+ +GLCGIA ASYP+
Sbjct: 304 ---ENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPI 349
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 303 bits (777), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 160/363 (44%), Positives = 222/363 (61%), Gaps = 30/363 (8%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRT 52
M+ K FI F + + + C ++S H ++ +E+W+ +HG+
Sbjct: 1 MLSKLTILFITLTFTLSLALDMC---IISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKN 57
Query: 53 YKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGY----NR 108
Y EK R IFK NL +I++ N + N +++LG N F+DLTNEE+R + G NR
Sbjct: 58 YNALGEKEKRFEIFKDNLGFIDEHNSK-NLSFRLGLNRFADLTNEEYRTRFLGTRINPNR 116
Query: 109 PVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGI 167
V+ Q++R +T V D +P S+DWR++GAV +K+QG CGSCWAFSA+AAVEG+
Sbjct: 117 RNRKVNSQTNRYAT----RVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGV 172
Query: 168 TQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTC 226
++ G LI LSEQ+LVDC T N GC+GGLMD AFE+II L E DYPY+ G C
Sbjct: 173 NKLATGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAIDGRC 232
Query: 227 DKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGD 286
D+ ++ A +I +YED+P DE AL +AV Q ++V VE G+ F+ Y GV CG
Sbjct: 233 DQNRKNAKVVSIDQYEDVPAYDEGALKKAVANQVIAVAVEGGGREFQLYDSGVFTGRCGT 292
Query: 287 NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEAS 341
DHGVA VG+GT E+G YW+++NSWG +WGE+GYIR+ R+ G CGIA E S
Sbjct: 293 ALDHGVAAVGYGT---ENGKDYWIVRNSWGGSWGEAGYIRLERNLATSKSGKCGIAIEPS 349
Query: 342 YPV 344
YP+
Sbjct: 350 YPI 352
>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 385
Score = 303 bits (776), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 157/317 (49%), Positives = 197/317 (62%), Gaps = 11/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ E +E+W QH R +D EKA R +FK N+ I + N+ + YKL N F D+
Sbjct: 41 EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDM 98
Query: 95 TNEEFRASYTGYNRPVPSVSR-QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
T +EFR +Y + R + R S F Y D+P ++DWREKGAV +K+QG CG
Sbjct: 99 TADEFRRAYASSRVSHHRMFRGRGERRSGFMYAGARDLPAAVDWREKGAVGAVKDQGQCG 158
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGL 211
SCWAFS +AAVEGI I L LSEQQLVDC T N GC GGLMD AF+YI ++ G+
Sbjct: 159 SCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGV 218
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
A + YPY+ Q +C + A TI YED+P E AL +AV QPVSV +EA G
Sbjct: 219 AASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSH 278
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD- 330
F+FY GV +CG DHGVA VG+GT DG KYW+++NSWG WGE GYIR+ RD
Sbjct: 279 FQFYSEGVFAGKCGTELDHGVAAVGYGTT--VDGTKYWIVRNSWGADWGEKGYIRMKRDV 336
Query: 331 ---EGLCGIATEASYPV 344
EGLCGIA EASYP+
Sbjct: 337 SAKEGLCGIAMEASYPI 353
>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
Length = 378
Score = 303 bits (776), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 167/340 (49%), Positives = 214/340 (62%), Gaps = 31/340 (9%)
Query: 32 SMHEPSIVEKHEQWMAQHGR-TYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNE 90
S HE S+ E E+W+++H + Y EK R +FK NL +I++ N++ + +Y LG NE
Sbjct: 39 SSHE-SLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHIDETNRKVS-SYWLGLNE 96
Query: 91 FSDLTNEEFRASYTGYNRPVPSVS-----------------RQSSRPSTFKYQNV--TDV 131
F+DLT++EF+A+Y G + SS F+Y+ V +
Sbjct: 97 FADLTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSSSSSFRFRYEGVDAARL 156
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-N 190
P S+DWR KGAVT +KNQG CGSCWAFS VAAVEGI QI G L LSEQ+LVDC TD N
Sbjct: 157 PKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDTDGN 216
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
NGC+GGLMD AF YI N GL TE YPY E+GTC + AA TI YED+P+ +E
Sbjct: 217 NGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCSR-GSSAAVVTISGYEDVPRNNEQ 275
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG---AK 307
ALL+A+ QPVSV +EASG+ +FY GV + CG DHGVA VG+GTA +++G A
Sbjct: 276 ALLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHGVAAVGYGTAGKDNGHVVAD 335
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
Y ++KNSWG +WGE GYIR+ R +GLCGI SYP
Sbjct: 336 YIIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMPSYP 375
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 303 bits (775), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 146/312 (46%), Positives = 205/312 (65%), Gaps = 12/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+V+ W +H + Y EK R +FKQNL++I + N+ N +Y LG N+F+D+ +E
Sbjct: 44 LVDLFSSWSVKHSKIYVSPEEKVKRYEVFKQNLKHIVETNRR-NGSYWLGLNQFADVAHE 102
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+++Y G + +R P+ F+Y+N ++P S+DWR+KGAVT +KNQG CGSCWA
Sbjct: 103 EFKSTYLGLKTGMDGPARA---PTAFRYENSVNLPWSVDWRKKGAVTPVKNQGECGSCWA 159
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI GKL LSEQ+L+DC T ++GC GG MD AF YI+ N G+ T+ D
Sbjct: 160 FSTVAAVEGINQIATGKLESLSEQELMDCDTTFDHGCGGGFMDFAFAYIMGNLGIHTDDD 219
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+G C +++ ++ TI YED+P+ E +LL+A+ QP+SV + A + F+FYK
Sbjct: 220 YPYLMEEGYCKEKQPQSKVVTISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQFYK 279
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEG 332
RGV CG DH + VG+G++ DG Y ++KNSWG++WGE GY RI R EG
Sbjct: 280 RGVFEGSCGTELDHALTAVGYGSS---DGQDYIIMKNSWGKSWGEQGYFRIKRGTGKPEG 336
Query: 333 LCGIATEASYPV 344
+C I + ASYP
Sbjct: 337 VCSIYSMASYPT 348
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 155/357 (43%), Positives = 221/357 (61%), Gaps = 28/357 (7%)
Query: 12 PMFVIIILVITCAS------QVVSGRSMH--------EPSIVEKHEQWMAQHGRTYK--D 55
PM VI+I+ + ++S H + + +E+W +HG+ D
Sbjct: 9 PMLVILIVFTLFTATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNNNID 68
Query: 56 ELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYN-RPVPSV- 113
EK R IFK NL++I++ N E NRTYK+G N F+DL+NEE+R+ Y G P+ +
Sbjct: 69 GSEKDKRFEIFKDNLKFIDEHNAE-NRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMMM 127
Query: 114 SRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGG 173
+R +R + + +P S+DWR +GAV +K+QG CGSCWAFS +AAVEGI +I G
Sbjct: 128 ARTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIVTG 187
Query: 174 KLIELSEQQLVDCS-TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEK 232
+L+ LSEQ+LVDC T N GC GGLM+ AFE+II N G+ ++ DYPY+ G CD+ K+
Sbjct: 188 ELVSLSEQELVDCDRTVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRGVDGKCDQYKKN 247
Query: 233 AAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGV 292
A +I YE +P DE AL +AV QP+SV +EA G+ F+ Y G+ +CG DHGV
Sbjct: 248 ARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGIFTGKCGTALDHGV 307
Query: 293 AVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
VG+GT E+G YW+++NSWG++WGESGY+R+ R+ G CGI ++SYP+
Sbjct: 308 TAVGYGT---ENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGIVMQSSYPI 361
>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 357
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 164/360 (45%), Positives = 224/360 (62%), Gaps = 31/360 (8%)
Query: 8 SFIIPMFVIIILVITCASQVV--------SGRSMHEPSIVEKHEQWMAQHGRTYKDELEK 59
SF + ++II++ C + +V + + ++ E++E+W A HGRTYKD LEK
Sbjct: 7 SFSLAAILLIIIMYCCPTGLVEAARKGPAAAGGGDDSAMRERYEKWAADHGRTYKDSLEK 66
Query: 60 AMRLTIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSS 118
A R +F+ N +I+ N G + + +L TN+F+DLTNEEF A Y G P +
Sbjct: 67 ARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADLTNEEF-AEYYGRPFSTPVIG---- 121
Query: 119 RPSTFKYQNV--TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLI 176
S F Y NV +DVP +I+WR++GAVT +KNQ C SCWAFSAVAAVEGI QI L+
Sbjct: 122 -GSGFMYGNVRTSDVPANINWRDRGAVTQVKNQKDCASCWAFSAVAAVEGIHQIRSHNLV 180
Query: 177 ELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQ-GTCDKQKEKA 233
LS QQL+DCST +N+GC+ G MD+AF YI N G+A E+DYPY+ GTC + K
Sbjct: 181 ALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAESDYPYEDRALGTC-RASGKP 239
Query: 234 AAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL----NAECGDNCD 289
AA+I ++ +P +E ALL AV QPVSV ++ G+ +F+ GV N C + +
Sbjct: 240 VAASIRGFQYVPPNNETALLLAVAHQPVSVALDGVGKVSQFFSSGVFGAMQNETCTTDLN 299
Query: 290 HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
H + VG+GT +E G KYWL+KNSWG WGE GY++I RD GLCG+A + SYPVA
Sbjct: 300 HAMTAVGYGT--DEHGTKYWLMKNSWGTDWGEGGYMKIARDVASNTGLCGLAMQPSYPVA 357
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 150/341 (43%), Positives = 218/341 (63%), Gaps = 21/341 (6%)
Query: 15 VIIILVITCASQVV--------SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIF 66
+I+LV+ A+ GR++ I E W A+HG++Y + EKA RL IF
Sbjct: 5 TLILLVVVGATPFAIARPAALEDGRALE---IKNMFEDWAAKHGKSYSSDWEKARRLMIF 61
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG-YNRPVPSVSRQSSRPSTFKY 125
L YIEK N + N T+ LG N+FSDLTN EFRA + G + RP Q P+ +
Sbjct: 62 SDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRP----RYQDRLPAEDED 117
Query: 126 QNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVD 185
+V+ +PTS+DWR+KGAVT IK+QG CGSCWAFSA+A++E + +L+ LSEQQL+D
Sbjct: 118 VDVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMD 177
Query: 186 CSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
C T + GC GGLM+ AF+++++N G+ TEA YPY G+C+ K K A I ++ +
Sbjct: 178 CDTVDAGCDGGLMETAFKFVVKNGGVTTEAAYPYTGSVGSCNANKAKNKVAEITGFKVVT 237
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
+ AL++AV+K PV+V + S + F+ YK G+L+ +C D+ DHGV ++G+GT E G
Sbjct: 238 EDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGKCDDSLDHGVLLIGYGT---EGG 294
Query: 306 AKYWLIKNSWGETWGESGYIRILRD--EGLCGIATEASYPV 344
YW+IKNSWG +WGE G+++I R +G+CG+ ++SYP
Sbjct: 295 MPYWIIKNSWGTSWGEDGFMKIERKDGDGMCGMNGDSSYPT 335
>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 302 bits (773), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 157/315 (49%), Positives = 206/315 (65%), Gaps = 21/315 (6%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
S+ E+ E W ++G YKD E+ IFK N+ YI+ N GN+ YKL N F D
Sbjct: 37 SLSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFNAAGNKPYKLAINRFVDKPI 96
Query: 97 EEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCW 156
E+ S G+ R + ++ +TFKY+NVTD+P ++DWR++GAVT IKNQG CGSCW
Sbjct: 97 ED---SDDGFER-----TTTTTPTTTFKYENVTDIPATVDWRKRGAVTPIKNQGKCGSCW 148
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDC--STDNNGCSGGLMDKAFEYIIENKGLATE 214
AFSAVAA+EGI +IT G L+ LSEQQLVDC S GC G M AF++I+EN G+ATE
Sbjct: 149 AFSAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGGIATE 208
Query: 215 ADYPYQQ-EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
A+YPY++ +GTC K K I YE++P E +LL+AV QPVSV ++ G F+
Sbjct: 209 ANYPYKRVVKGTCKKVSHK---VQIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRGM-FK 264
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--- 330
FY G+ ECG +H + +VG+GT+ +DG KYWL+KNSW + WGE GYIRI RD
Sbjct: 265 FYSSGIFTGECGTKPNHALTIVGYGTS--KDGIKYWLVKNSWSKRWGEKGYIRIKRDIDA 322
Query: 331 -EGLCGIATEASYPV 344
EGLCGIA + SYP+
Sbjct: 323 KEGLCGIAMKPSYPI 337
>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
Length = 363
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 158/346 (45%), Positives = 210/346 (60%), Gaps = 17/346 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEP------SIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
I+L C Q G E ++ + +E+W H T + E R +F+
Sbjct: 3 LFFIVLSFLCLLQASKGFDFDEKELETEENVWKLYERWRDHHSVT-RASHEALKRFNVFR 61
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST-FKYQ 126
N+ ++ + NK+ N+ YKL N F+D+T+ EFR+SY G N + R R S F Y+
Sbjct: 62 HNVLHVHRTNKK-NKPYKLKVNRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGFMYE 120
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
NVT VP+S+DWREKGAVT +KNQ CGSCWAFS VAAVEGI +I KL+ LSEQ+LVDC
Sbjct: 121 NVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDC 180
Query: 187 ST-DNNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDL 244
T +N GC+GGLM+ AFE+I N G+ TE YPY + C + TI +E +
Sbjct: 181 DTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSNDVQFCRAKSIDGETVTIDGHEHV 240
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEED 304
P+ DE ALL+AV QPVSV ++A F+ Y GV ECG +HGV +VG+G E ++
Sbjct: 241 PENDEEALLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYG--ETKN 298
Query: 305 GAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVAM 346
G KYW+++NSWG WGE GY+RI R +EG CGIA EASYP +
Sbjct: 299 GTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTKV 344
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 147/340 (43%), Positives = 211/340 (62%), Gaps = 21/340 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F+ + L + AS + R ++++ E+WMA++GR YKD EK R IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG-YNRPV-----PSVSRQSSRPSTFKYQ 126
IE N +Y LG N+F+D+TN EF A YTG +RP+ P VS F
Sbjct: 68 IETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPLNIEKEPVVS--------FDDV 119
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
N++ V SIDWR+ GAVT +K+Q CGSCWAFSA+A VEGI +I G L+ LSEQ+++DC
Sbjct: 120 NISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDC 179
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
+ +NGC GG +D A+++II N G+A+EADYPYQ QG C +A G Y +
Sbjct: 180 AV-SNGCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSWPNSAYITG-YSYVRS 237
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
DE ++ AV QP++ ++ASG F++Y GV + CG + +H + ++G+G ++ G
Sbjct: 238 NDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGT 295
Query: 307 KYWLIKNSWGETWGESGYIRILR---DEGLCGIATEASYP 343
+YW++KNSWG +WGE GYIR+ R GLCGIA + YP
Sbjct: 296 QYWIVKNSWGSSWGERGYIRMARGVSSSGLCGIAMDPLYP 335
>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
Length = 345
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 147/337 (43%), Positives = 219/337 (64%), Gaps = 16/337 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+F+ + L + AS S S EPS ++++ E+WMA++GR YKD EK +R IFK N+
Sbjct: 8 VFLFLFLCVMWASP--SAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNV 65
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+IE N +Y LG N+F+D+TN EF A YTG + P+ ++ R+ +F +++
Sbjct: 66 NHIETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPL-NIKREPV--VSFDDVDISS 122
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN 190
VP SIDWR+ GAVT +KNQG CGSCWAF+++A VE I +I G L+ LSEQQ++DC+ +
Sbjct: 123 VPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAV-S 181
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC GG ++KA+ +II NKG+A+ A YPY+ +GTC K +A I +Y + + +E
Sbjct: 182 YGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTC-KTNGVPNSAYITRYTYVQRNNER 240
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
++ AV+ QP++ ++ASG F+ YKRGV CG +H + ++G+G ++ G K+W+
Sbjct: 241 NMMYAVSNQPIAAALDASGN-FQHYKRGVFTGPCGTRLNHAIVIIGYG--QDSSGKKFWI 297
Query: 311 IKNSWGETWGESGYIRILRDE----GLCGIATEASYP 343
++NSWG WGE GYIR+ RD GLCGIA + YP
Sbjct: 298 VRNSWGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334
>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
Length = 362
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/320 (48%), Positives = 204/320 (63%), Gaps = 18/320 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S + +E+W + RT L +K R +FK N+ ++ NK ++ YKL N+F+D
Sbjct: 33 EESFWDLYERWRSY--RTVSRSLGDKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFAD 89
Query: 94 LTNEEFRASYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQ 149
+TN EFR++Y G ++R R + TF Y+ V VP S DWR+ GAVT +K+Q
Sbjct: 90 MTNHEFRSTYAGSKVNHHRMFQGTPRGNG---TFMYEKVGSVPPSADWRKNGAVTGVKDQ 146
Query: 150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIEN 208
G CGSCWAFS V AVEGI QI KL+ LSEQ+LVDC T N GC+GGLM+ AFE+I +
Sbjct: 147 GQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIKQK 206
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
G+ TE++YPY + GTCD K A +I +E++P DE+ALL+AV QPVSV ++A
Sbjct: 207 GGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAG 266
Query: 269 GQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR-- 326
G F+FY GV +C +HGVA+VG+GT DG YW ++NSWG WGE GYIR
Sbjct: 267 GFDFQFYFEGVFTGDCSTELNHGVAIVGYGTT--VDGTNYWTVRNSWGPEWGEQGYIRMQ 324
Query: 327 --ILRDEGLCGIATEASYPV 344
I + EGLCGIA ASYP+
Sbjct: 325 RSIFKKEGLCGIAMMASYPI 344
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 219/341 (64%), Gaps = 23/341 (6%)
Query: 16 IIILVITCASQVV--------SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
+I+LV+ A+ GR++ I E W A+HG++Y +LEKA RL IF
Sbjct: 10 LILLVVVGATPFAIARPAALEDGRALE---IKNMFEDWAAKHGKSYSSDLEKARRLMIFS 66
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG-YNRPVPSVSRQSSRPSTFKYQ 126
L YIEK N + N T+ LG N+FSDLTN EFRA + G + RP Q P+ +
Sbjct: 67 DTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRP----RYQDRLPAEDEDV 122
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+V+ +PTS+DWR+KGAVT IK+QG CGSCWAFSA+A++E + +L+ LSEQQL+DC
Sbjct: 123 DVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMDC 182
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKA--AAATIGKYEDL 244
T + GC GGLM+ AF+++++N G+ TEA YPY G+C+ K A I ++ +
Sbjct: 183 DTVDAGCDGGLMETAFKFVVKNGGVTTEASYPYTGSVGSCNANKVAIINKVAEITGFKVV 242
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEED 304
+ AL++AV+K PV+V + S + F+ YK G+L+ +CGD+ DHGV ++G+GT E
Sbjct: 243 TEDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGQCGDSLDHGVLLIGYGT---EG 299
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD--EGLCGIATEASYP 343
G YW+IKNSWG +WGE G+++I R +G+CG+ ++SYP
Sbjct: 300 GMPYWIIKNSWGTSWGEDGFMKIERKDGDGICGMNGDSSYP 340
>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
Length = 377
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 153/291 (52%), Positives = 187/291 (64%), Gaps = 14/291 (4%)
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG----YNRPVPSVSRQSS 118
+FK N+ I + N+ + YKL N F D+T +EFR Y G ++R + SS
Sbjct: 70 FNVFKANVRLIHEFNRR-DEPYKLRLNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQGSS 128
Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
++F Y + DVP S+DWR+KGAVT +K+QG CGSCWAFS +AAVEGI I L L
Sbjct: 129 ASASFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSL 188
Query: 179 SEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAAT 237
SEQQLVDC T N GC+GGLMD AF+YI ++ G+A E YPY+ Q +C +K A T
Sbjct: 189 SEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQASC--KKSPAPVVT 246
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I YED+P DE AL +AV QPVSV +EASG F+FY GV + CG DHGVA VG+
Sbjct: 247 IDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDHGVAAVGY 306
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
G DG KYWL+KNSWG WGE GYIR+ RD EG CGIA EASYPV
Sbjct: 307 GVT--ADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASYPV 355
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/327 (47%), Positives = 206/327 (62%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRT 83
+VS E + +W A+HG++Y E+ R F+ NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 84 YKLGTNEFSDLTNEEFRASYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R +Y G N+P R+ + + +P S+DWR KGA
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKA 201
V IK+QG CGSCWAFSA+AAVE I QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 141 VAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
F++II N G+ TE DYPY+ + CD ++ A TI YED+ E +L +AV QPV
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQPV 260
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
SV +EA G+AF+ Y G+ +CG DHGVA VG+GT E+G YW+++NSWG++WGE
Sbjct: 261 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 317
Query: 322 SGYIRILRD----EGLCGIATEASYPV 344
SGY+R+ R+ G CGIA E SYP+
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPL 344
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 300 bits (769), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 150/338 (44%), Positives = 204/338 (60%), Gaps = 15/338 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F +L+++ A +V+ + + +E W+ + G++Y EK MR IFK NL
Sbjct: 13 LFFSTLLILSSALDIVNSAQRTNDQVRDMYESWLVEQGKSYNSLDEKEMRFEIFKDNLRI 72
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDV- 131
I+ N + NR++ LG N F+DLT+EE+R++Y G+ S ++ S V DV
Sbjct: 73 IDDHNADANRSFSLGLNRFADLTDEEYRSTYLGFK------SGPKAKVSNRYVPKVGDVL 126
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC--STD 189
P +DWR GAV +KNQG C SCWAFSAVAAVEGI +I G L+ LSEQ+LVDC +
Sbjct: 127 PNYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIMTGNLLSLSEQELVDCGRTQS 186
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
GC+ G M AF++II N G+ TE +YPY + G C++ + TI YE++P +E
Sbjct: 187 TRGCNRGYMTDAFQFIINNGGINTEDNYPYTAQDGQCNRYLQNQKYVTIDDYENVPSNNE 246
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
AL AV QPVSV +E+ G F+ Y G+ CG DHGV +VG+GT E G YW
Sbjct: 247 WALQNAVAHQPVSVGLESEGGKFKLYTSGIFTQYCGTAIDHGVTIVGYGT---ERGLDYW 303
Query: 310 LIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
++KNSWG WGE+GYIRI R+ G CGIA ASYPV
Sbjct: 304 IVKNSWGTNWGENGYIRIQRNIGGAGKCGIARMASYPV 341
>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
Length = 348
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 146/348 (41%), Positives = 212/348 (60%), Gaps = 13/348 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
++ K F++ + + + + + + S+ + +E+W +QH + + EK R
Sbjct: 1 MECNKVFVLSISLALFIGVVNCIDFTEKDLATDKSLWDLYERWGSQHMVSRAPD-EKKKR 59
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVP--SVSRQSSRP 120
+FK N+ +I + N+ G + YKL NEF+D+TN EF+A G++ + + + R
Sbjct: 60 FNVFKYNVNHINRVNQLG-KPYKLKLNEFADMTNHEFKA---GFDSKILHFRMLKGKRRQ 115
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+ F + TD P SIDWR GAV IKNQG CGSCWAFS + VEGI +I +L+ LSE
Sbjct: 116 TPFTHAKTTDPPPSIDWRTNGAVNPIKNQGRCGSCWAFSTIVGVEGINKIKTNQLVSLSE 175
Query: 181 QQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
Q+LVDC TD GC+GGLM+ +E+I E G+ TE YPY G CD K + I
Sbjct: 176 QELVDCETDCEGCNGGLMENGYEFIKETGGVTTEQIYPYFARNGRCDISKRNSPVVKIDG 235
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
+E++P DE A+L+AV QPVS+ ++A G F+FY +GV N CG +HGVA+VG+GT
Sbjct: 236 FENVPANDESAMLRAVANQPVSIAIDAGGLNFQFYSQGVFNGACGTELNHGVAIVGYGTT 295
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
+DG YW+++NSWG WGE GY+R+ R EGLCG+A +ASYP+
Sbjct: 296 --QDGTNYWIVRNSWGTGWGEQGYVRMQRGVNVPEGLCGLAMDASYPI 341
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 162/351 (46%), Positives = 222/351 (63%), Gaps = 18/351 (5%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
++ +K I + + +I + E S+ +E+W + H T ++ EK R
Sbjct: 1 MEMKKLLFISLSLALIFTVANTFDFNEHDLESEKSLWNLYERWRSHHTVT-RNLDEKHNR 59
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT----GYNRPVPSVSRQSS 118
+FK N+ ++ NK ++ YKL N+F D+TN EFR Y ++R +S ++
Sbjct: 60 FNVFKANVMHVHNTNKL-DKPYKLKLNKFGDMTNYEFRRIYADSKISHHRMFRGMSHENG 118
Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
TF Y+N DVP+SIDWR KGAVT +K+QG CGSCWAFS +AAVEGI QI KL+ L
Sbjct: 119 ---TFMYENAVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSL 175
Query: 179 SEQQLVDCST-DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAAT 237
SEQQLVDC T +N GC+GGLM+ AFE+I +N G+ TE++YPY + GTCD +KE A +
Sbjct: 176 SEQQLVDCDTEENEGCNGGLMEYAFEFIKQN-GITTESNYPYAAKDGTCDVEKEDKAVSI 234
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
G +E++P +E ALL+A KQPVSV ++A G F+FY GV C + +HGVA+VG+
Sbjct: 235 DG-HENVPINNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGVFTGHCDTDLNHGVAIVGY 293
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
G +D KYW++KNSWG WGE GYIR+ R EGLCGIA EASYP+
Sbjct: 294 GVT--QDRTKYWIMKNSWGSEWGEQGYIRMQRGISSREGLCGIAMEASYPI 342
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 143/339 (42%), Positives = 209/339 (61%), Gaps = 20/339 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F+ + L + AS + R ++++ E+WMA++GR YKD EK R IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPV-----PSVSRQSSRPSTFKYQN 127
IE N +Y LG N+F+D+TN EF YTG + P+ P VS F N
Sbjct: 68 IETFNNRNGNSYTLGINKFTDMTNNEFVTQYTGVSLPLNFKREPVVS--------FDDVN 119
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
++ V SIDWR+ GAVT +K+Q CGSCWAFSA+A VEGI +I G L+ LSEQ+++DC+
Sbjct: 120 ISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCA 179
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
+NGC GG +D A+++II N G+A+EADYPYQ +G C +A G Y +
Sbjct: 180 V-SNGCDGGFVDNAYDFIISNNGVASEADYPYQAYEGDCTANSWPNSAYITG-YSYVRSN 237
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
DE ++ AV QP++ ++ASG F++Y GV + CG + +H + ++G+G ++ G +
Sbjct: 238 DESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTQ 295
Query: 308 YWLIKNSWGETWGESGYIRILR---DEGLCGIATEASYP 343
YW++KNSWG +WGE GY+R+ R GLCGIA + YP
Sbjct: 296 YWIVKNSWGSSWGERGYVRMARGVSSSGLCGIAMDPLYP 334
>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
Length = 324
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 153/317 (48%), Positives = 202/317 (63%), Gaps = 40/317 (12%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEF 91
SMH+ + E E WM++HG+TY+ EK RL +FK NL +I++ N++ TY L NEF
Sbjct: 39 SMHK--LTELFESWMSKHGKTYESIEEKLHRLEVFKDNLMHIDRRNRDVT-TYWLALNEF 95
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
+DL++EEF++ R EKGAV +KNQG
Sbjct: 96 ADLSHEEFKSKLAQIRR-----------------------------LEKGAVAPVKNQGS 126
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKG 210
CGSCWAFS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF+YI+ N G
Sbjct: 127 CGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTSFNSGCNGGLMDYAFDYIVNNGG 186
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
L E DYPY E+GTCD+++E+ TI Y D+P+ +E +LL+A+ QP+S+ +EASG+
Sbjct: 187 LHKEEDYPYLMEEGTCDEKREEMEVVTISGYHDVPENNEESLLKALAHQPLSIAIEASGR 246
Query: 271 AFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
F+FY RGV N CG + DHGVA VG+G+++ G Y ++KNSWG WGE GYIR+ R+
Sbjct: 247 DFQFYGRGVFNGPCGTDLDHGVAAVGYGSSK---GLDYIIVKNSWGPKWGEKGYIRMKRN 303
Query: 331 ----EGLCGIATEASYP 343
EGLCGI ASYP
Sbjct: 304 TGKPEGLCGINKMASYP 320
>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
Length = 379
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 141/307 (45%), Positives = 196/307 (63%), Gaps = 9/307 (2%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
E W+ +HG+ Y EK RLTIFK NL +I N E N Y+LG N F+DL+ E++
Sbjct: 65 ESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSE-NLGYRLGLNRFADLSLHEYKEI 123
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVA 162
G + P S +K +P S+DWR +GAVT +K+QGHC SCWAFS V
Sbjct: 124 CHGADPKPPRNHVFMSSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVG 183
Query: 163 AVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
AVEG+ +I G+L+ LSEQ L++C+ +NNGC GG ++ A+E+I+ N GL T+ DYPY+
Sbjct: 184 AVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIVSNGGLGTDNDYPYKAV 243
Query: 223 QGTCD-KQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLN 281
G CD + KE I YE+LP DE AL++AV QPV+ +++S + F+ Y+ GV +
Sbjct: 244 NGACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQLYESGVFD 303
Query: 282 AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIA 337
CG N +HGV VVG+GT E+G YW+++NSWG TWGE+GY+++ R+ GLCGIA
Sbjct: 304 GRCGTNLNHGVVVVGYGT---ENGRNYWIVRNSWGNTWGEAGYMKMARNIANPRGLCGIA 360
Query: 338 TEASYPV 344
SYP+
Sbjct: 361 MRVSYPL 367
>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
Length = 323
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 153/342 (44%), Positives = 216/342 (63%), Gaps = 35/342 (10%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F I+ + C++ + + + ++ +HE+WMAQ+GR YKD+ EKA R +FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN+EFR++ T +PS +R P+ F+ +NV
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRSTKTNKGF-IPSTTRV---PTGFRNENVNIDA 122
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
+P ++DWR KG VT IK+QG CG CWAFSAVAA+E +LVDC
Sbjct: 123 LPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAME----------------ELVDCDVHG 166
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKA-AAATIGKYEDLPKG 247
++ GC GGLMD AF++II+N GL TE++YPY DK K + + A+I YED+P
Sbjct: 167 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAVD---DKFKSVSNSVASIKGYEDVPAN 223
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
+E AL++AV QPVSV V+ F+FYK GV+ CG + DHG+ +G+G A DG K
Sbjct: 224 NEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKA--SDGTK 281
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
YWL+KNSWG TWGE+G++R+ +D G+CG+A E SYP A
Sbjct: 282 YWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 323
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 150/315 (47%), Positives = 205/315 (65%), Gaps = 19/315 (6%)
Query: 44 QWMAQHGRTYKDEL----EKAMRLTIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
+W +HG++ + ++ R IFK NL +I+ N+ N TYKLG F++LTN+E
Sbjct: 6 RWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDE 65
Query: 99 FRASYTG-YNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKNQGHCGS 154
+R+ Y G PV +++ ++ KY NV +VP ++DWR+KGAV IK+QG CGS
Sbjct: 66 YRSLYLGARTEPVRRITK--AKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGS 123
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLAT 213
CWAFS AAVEGI +I G+L+ LSEQ+LVDC N GC+GGLMD AF++I++N GL T
Sbjct: 124 CWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNT 183
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
E DYPY G C+ + + TI YED+P DE AL +AV+ QPVSV ++A G+AF+
Sbjct: 184 EKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQ 243
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--- 330
Y+ G+ +CG N DH V VG+G+ E+G YW+++NSWG WGE GYIR+ R+
Sbjct: 244 HYQSGIFTGKCGTNMDHAVVAVGYGS---ENGVDYWIVRNSWGTRWGEDGYIRMERNVAS 300
Query: 331 -EGLCGIATEASYPV 344
G CGIA EASYPV
Sbjct: 301 KSGKCGIAIEASYPV 315
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 145/313 (46%), Positives = 205/313 (65%), Gaps = 17/313 (5%)
Query: 42 HEQWMAQHGRTYKDEL--EKAMRLTIFKQNLEYIEKANKEGNRT--YKLGTNEFSDLTNE 97
++ W+A++G + L E R +F NL++++ N + ++LG N F+DLTNE
Sbjct: 52 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EFRA++ G R + +++ V ++P S+DWREKGAV +KNQG CGSCWA
Sbjct: 112 EFRATFLG----AKVAERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEA 215
FSAV+ VE I Q+ G++I LSEQ+LV+CST+ N+GC+GGLMD AF++II+N G+ TE
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
DYPY+ G CD +E A +I +ED+P+ DE +L +AV QPVSV +EA G+ F+ Y
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287
Query: 276 KRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----E 331
GV + CG + DHGV VG+GT ++G YW+++NSWG WGESGY+R+ R+
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINVTT 344
Query: 332 GLCGIATEASYPV 344
G CGIA ASYP
Sbjct: 345 GKCGIAMMASYPT 357
>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
Precursor
gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
thaliana]
gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 155/349 (44%), Positives = 210/349 (60%), Gaps = 17/349 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEP------SIVEKHEQWMAQHGRTYKDELEKAMRLT 64
+ +F I+++ Q G E ++ + +E+W H + + E R
Sbjct: 1 MKLFFIVLISFLSLLQASKGFDFDEKELETEENVWKLYERWRGHHSVS-RASHEAIKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST-F 123
+F+ N+ ++ + NK+ N+ YKL N F+D+T+ EFR+SY G N + R R S F
Sbjct: 60 VFRHNVLHVHRTNKK-NKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGF 118
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
Y+NVT VP+S+DWREKGAVT +KNQ CGSCWAFS VAAVEGI +I KL+ LSEQ+L
Sbjct: 119 MYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQEL 178
Query: 184 VDCST-DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQ-GTCDKQKEKAAAATIGKY 241
VDC T +N GC+GGLM+ AFE+I N G+ TE YPY C TI +
Sbjct: 179 VDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGH 238
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE 301
E +P+ DE LL+AV QPVSV ++A F+ Y GV ECG +HGV +VG+G E
Sbjct: 239 EHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYG--E 296
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVAM 346
++G KYW+++NSWG WGE GY+RI R +EG CGIA EASYP +
Sbjct: 297 TKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTKL 345
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 143/311 (45%), Positives = 197/311 (63%), Gaps = 9/311 (2%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I E W QHG+TY + EK RL +F+ N +++ + N +GN +Y L N F+DLT+
Sbjct: 26 IAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHH 85
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+AS G + S S R + V DVP S+DWR+ GAVT +K+QG+CG+CW+
Sbjct: 86 EFKASRLGLS-SAASASLNVDRSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWS 144
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FSA A+EGI +I G L+ LSEQ+LVDC NNGC GG+MD AF+++I+N G+ TE D
Sbjct: 145 FSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEED 204
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPYQ +C+K+K K TI Y D+P+ +E LL+AV QPVSV + S +AF+ Y
Sbjct: 205 YPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYS 264
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
+G+ C + DH V +VG+G+ E+G YW++KNSWG WG GY+ + R+ G
Sbjct: 265 KGIFTGPCSTSLDHAVLIVGYGS---ENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRG 321
Query: 333 LCGIATEASYP 343
LCGI ASYP
Sbjct: 322 LCGINMLASYP 332
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 153/319 (47%), Positives = 205/319 (64%), Gaps = 17/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ +++ QW+ +H R Y EK R IFK NL YI NK+ ++Y LG N+FSDL
Sbjct: 45 DDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQ-EKSYWLGLNKFSDL 103
Query: 95 TNEEFRASYTGYNRPVPSVS--RQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
T++EFRA Y G RP R R F Y++V +DWR+KGAV+ +K+QG C
Sbjct: 104 THDEFRALYLGI-RPAGRAHGLRNGDR---FIYEDVV-AEEMVDWRKKGAVSDVKDQGSC 158
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFEYIIENKGL 211
GSCWAFSA+ +VEG+ I G+LI LSEQ+LVDC N GC+GGLMD AF++II+N G+
Sbjct: 159 GSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQNQGCNGGLMDYAFDFIIKNGGI 218
Query: 212 ATEADYPYQQEQGTCDK-QKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
TE DYPY+ G CD+ +KE + I Y+D+P E +LL+AV+K PVSV +EA G+
Sbjct: 219 DTEEDYPYKATDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSKNPVSVAIEAGGR 278
Query: 271 AFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR- 329
F+ Y+ GV CG + DHGV VG+GT ++DG YW++KNSWG +WGE GYIR+ R
Sbjct: 279 DFQHYQGGVFTGPCGTDLDHGVLAVGYGT--DDDGVNYWIVKNSWGPSWGEKGYIRMERM 336
Query: 330 ----DEGLCGIATEASYPV 344
G CGI E S+P+
Sbjct: 337 GSNSTSGKCGINIEPSFPI 355
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 155/316 (49%), Positives = 202/316 (63%), Gaps = 20/316 (6%)
Query: 44 QWMAQHGRTYKDEL----EKAMRLTIFKQNLEYIEKAN-KEGNRTYKLGTNEFSDLTNEE 98
QW A HG+T + ++ R IFK NL +I+ N K N TYKLG +F+DLTNEE
Sbjct: 51 QWSADHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEKNKNATYKLGLTKFTDLTNEE 110
Query: 99 FRASYTG-YNRPVPSVSRQSSRPSTFKYQNVTD---VPTSIDWREKGAVTHIKNQGHCGS 154
+R+ Y G PV +++ ++ KY D VP ++DWR KGAV IK+QG CGS
Sbjct: 111 YRSLYLGARTEPVRRIAK--AKNVNQKYSAAVDGKEVPETVDWRLKGAVNPIKDQGTCGS 168
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLAT 213
CWAFS AAVEGI +I G+LI LSEQ+LVDC N GC+GGLMD AF++I++N GL T
Sbjct: 169 CWAFSTAAAVEGINKIVTGELISLSEQELVDCDNSYNQGCNGGLMDYAFQFIMKNGGLKT 228
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
E DYPY+ G C+ + A +I YED+P DE AL +A++ QPVSV +EA G+ F+
Sbjct: 229 EKDYPYRGFGGKCNSFLKNAKVVSIDGYEDVPTKDETALKRAISLQPVSVAIEAGGRIFQ 288
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--- 330
Y+ G+ CG N DH V VG+G+ E+G YW+++NSWG WGE GYIR+ R+
Sbjct: 289 HYQTGIFTGNCGTNLDHAVVAVGYGS---ENGVDYWIVRNSWGPRWGEEGYIRMERNLAS 345
Query: 331 --EGLCGIATEASYPV 344
G CGIA EASYPV
Sbjct: 346 SKSGKCGIAVEASYPV 361
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 150/324 (46%), Positives = 206/324 (63%), Gaps = 14/324 (4%)
Query: 28 VSGRSMHE-PSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKL 86
V+ ++ H P V+ E+W+ ++ + Y EK R IF NL+++++ N N++Y+L
Sbjct: 22 VTAKADHRNPEEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYEL 81
Query: 87 GTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
G F+DLTNEEFRA Y R +R S + + + +P +DWR KGAV +
Sbjct: 82 GLTRFADLTNEEFRAIYL---RSKMERTRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPV 138
Query: 147 KNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYI 205
K+QG CGSCWAFSA+ AVEGI QI G+L+ LSEQ+LVDC T NNGC GGLMD AF++I
Sbjct: 139 KDQGSCGSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFI 198
Query: 206 IENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVC 264
I N G+ TE DYPY + C+ K+ TI YED+P+ +E++L +A+ QP+SV
Sbjct: 199 ISNGGIDTEEDYPYTATDDNICNTDKKNTRVVTIDGYEDVPE-NENSLKKALANQPISVA 257
Query: 265 VEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGY 324
+EA G+ F+ YK GV CG DHGV VG+GT+E +D YW+I+NSWG WGESGY
Sbjct: 258 IEAGGRGFQLYKSGVFTGTCGTALDHGVVAVGYGTSEGQD---YWIIRNSWGSNWGESGY 314
Query: 325 IRILRD----EGLCGIATEASYPV 344
I++ R+ G CG+A ASYP
Sbjct: 315 IKLQRNIKDSSGKCGVAMMASYPT 338
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 149/318 (46%), Positives = 204/318 (64%), Gaps = 14/318 (4%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
+E + +EQW+ ++ + Y EK R IFK NL+++++ N +RT+++G F+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
LTNEEFRA Y R ++ S + + Y+ +P +DWR GAV +K+QG+CG
Sbjct: 96 LTNEEFRAIYL---RKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCG 152
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGL 211
SCWAFSAV AVEGI QIT G+LI LSEQ+LVDC N GC GG+M+ AFE+I++N G+
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Query: 212 ATEADYPYQ-QEQGTCDKQK-EKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASG 269
T+ DYPY + G C+ K TI YED+P+ DE +L +AV QPVSV +EAS
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272
Query: 270 QAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
QAF+ YK GV+ CG + DHGV VVG+G+ ED YW+I+NSWG WG+SGY+++ R
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGSTSGED---YWIIRNSWGLNWGDSGYVKLQR 329
Query: 330 D----EGLCGIATEASYP 343
+ G CGIA SYP
Sbjct: 330 NIDDPFGKCGIAMMPSYP 347
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 149/318 (46%), Positives = 204/318 (64%), Gaps = 14/318 (4%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
+E + +EQW+ ++ + Y EK R IFK NL+++++ N +RT+++G F+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
LTNEEFRA Y R ++ S + + Y+ +P +DWR GAV +K+QG+CG
Sbjct: 96 LTNEEFRAIYL---RKKMERNKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCG 152
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGL 211
SCWAFSAV AVEGI QIT G+LI LSEQ+LVDC N GC GG+M+ AFE+I++N G+
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Query: 212 ATEADYPYQ-QEQGTCDKQK-EKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASG 269
T+ DYPY + G C+ K TI YED+P+ DE +L +AV QPVSV +EAS
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272
Query: 270 QAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
QAF+ YK GV+ CG + DHGV VVG+G+ ED YW+I+NSWG WG+SGY+++ R
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGSTSGED---YWIIRNSWGLNWGDSGYVKLQR 329
Query: 330 D----EGLCGIATEASYP 343
+ G CGIA SYP
Sbjct: 330 NIDDPFGKCGIAMMPSYP 347
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 151/338 (44%), Positives = 220/338 (65%), Gaps = 17/338 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+F+ + L + AS S S EPS ++++ E+WM ++GR YKD EK R IFK N+
Sbjct: 8 VFLFLFLCVMWASP--SAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNV 65
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT-GYNRPVPSVSRQSSRPSTFKYQNVT 129
+IE N +Y LG N+F+D+TN EF A YT G +RP+ ++ R+ +F +++
Sbjct: 66 NHIETFNSRNENSYTLGINQFTDMTNNEFIAQYTGGISRPL-NIEREPV--VSFDDVDIS 122
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
VP SIDWR+ GAVT +KNQ CG+CWAF+A+A VE I +I G L LSEQQ++DC+
Sbjct: 123 AVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCA-K 181
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
GC GG +AFE+II NKG+A+ A YPY+ +GTC K +A I Y +P+ +E
Sbjct: 182 GYGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTC-KTNGVPNSAYITGYARVPRNNE 240
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
+++ AV+KQP++V V+A+ F++YK GV N CG + +H V +G+G ++ +G KYW
Sbjct: 241 SSMMYAVSKQPITVAVDANAN-FQYYKSGVFNGPCGTSLNHAVTAIGYG--QDSNGKKYW 297
Query: 310 LIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
++KNSWG WGE+GYIR+ RD G+CGIA ++ YP
Sbjct: 298 IVKNSWGARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 297 bits (761), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 153/327 (46%), Positives = 205/327 (62%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRT 83
+VS E + +W A+HG++Y E+ R F+ NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 84 YKLGTNEFSDLTNEEFRASYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R +Y G N+P R+ + + +P S+DWR KGA
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKA 201
V IK+Q GSCWAFSA+AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 141 VAEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
F++II N G+ TE DYPY+ + CD ++ A TI YED+ E +L +AV QPV
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPV 260
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
SV +EA G+AF+ Y G+ +CG DHGVA VG+GT E+G YW+++NSWG++WGE
Sbjct: 261 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 317
Query: 322 SGYIRILRD----EGLCGIATEASYPV 344
SGY+R+ R+ G CGIA E SYP+
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPL 344
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 297 bits (760), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 199/312 (63%), Gaps = 11/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I E + W +HG+TY E E+ R+ IFK N +++ + N N TY L N F+DLT+
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+AS G + PSV S S VP S+DWR+KGAVT++K+QG CG+CW+
Sbjct: 88 EFKASRLGLSVSAPSVIMASKGQS---LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 144
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FSA A+EGI QI G LI LSEQ+L+DC N GC+GGLMD AFE++I+N G+ TE D
Sbjct: 145 FSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKD 204
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPYQ+ GTC K K K TI Y + DE AL++AV QPVSV + S +AF+ Y
Sbjct: 205 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYS 264
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
RG+ + C + DH V +VG+G+ ++G YW++KNSWG++WG G++ + R+ +G
Sbjct: 265 RGIFSGPCSTSLDHAVLIVGYGS---QNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 321
Query: 333 LCGIATEASYPV 344
+CGI ASYP+
Sbjct: 322 VCGINMLASYPI 333
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 296 bits (758), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 152/315 (48%), Positives = 203/315 (64%), Gaps = 18/315 (5%)
Query: 44 QWMAQHGRTYKDEL----EKAMRLTIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
QW A+HG+T + ++ R IFK NL +I+ N++ N TYKLG +F+DLTN+E
Sbjct: 51 QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDE 110
Query: 99 FRASYTGYNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKNQGHCGSC 155
+R Y G R P+ ++ KY N +VP ++DWR+KGAV IK+QG CGSC
Sbjct: 111 YRKLYLGA-RTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSC 169
Query: 156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATE 214
WAFS AAVEGI +I G+LI LSEQ+LVDC N GC+GGLMD AF++I++N GL TE
Sbjct: 170 WAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTE 229
Query: 215 ADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRF 274
DYPY+ G C+ + + +I YED+P DE AL +A++ QPVSV +EA G+ F+
Sbjct: 230 KDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQH 289
Query: 275 YKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---- 330
Y+ G+ CG N DH V VG+G+ E+G YW+++NSWG WGE GYIR+ R+
Sbjct: 290 YQSGIFTGSCGTNLDHAVVAVGYGS---ENGVDYWIVRNSWGPRWGEEGYIRMERNLAAS 346
Query: 331 -EGLCGIATEASYPV 344
G CGIA EASYPV
Sbjct: 347 KSGKCGIAVEASYPV 361
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 296 bits (758), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 149/315 (47%), Positives = 204/315 (64%), Gaps = 19/315 (6%)
Query: 44 QWMAQHGRTYKDEL----EKAMRLTIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
+W +HG++ + ++ R IFK NL +I+ N+ N TYKLG F++LTN+E
Sbjct: 6 RWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDE 65
Query: 99 FRASYTG-YNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKNQGHCGS 154
+R+ Y G PV +++ ++ KY N +VP ++DWR+KGAV IK+QG CGS
Sbjct: 66 YRSLYLGARTEPVRRITK--AKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGTCGS 123
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLAT 213
CWAFS AAVEGI +I G+L+ LSEQ+LVDC N GC+GGLMD AF++I++N GL T
Sbjct: 124 CWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNT 183
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
E DYPY G C+ + + TI YED+P DE AL +AV+ QPVSV ++A G+AF+
Sbjct: 184 EKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQ 243
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--- 330
Y+ G+ +CG N DH V VG+G+ E+G YW+++NSWG WGE GYIR+ R+
Sbjct: 244 HYQSGIFTGKCGTNMDHAVVAVGYGS---ENGVDYWIVRNSWGTRWGEDGYIRMERNVAS 300
Query: 331 -EGLCGIATEASYPV 344
G CGIA EASYPV
Sbjct: 301 KSGKCGIAIEASYPV 315
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 148/337 (43%), Positives = 206/337 (61%), Gaps = 12/337 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F +LV++ A + + +E W+ ++G++Y E R IFK+ L +
Sbjct: 13 LFFSTLLVLSLAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKETLRF 72
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
I++ N + NR+Y++G N+F+D TNEEF+++Y G+ S S + + ++ + +P
Sbjct: 73 IDEHNADTNRSYRVGLNQFADQTNEEFQSTYLGFT----SGSNKMKVSNRYEPRVGQVLP 128
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN- 191
+DWR GAV IK+QG CGSCWAFSA+A VEGI +I G LI LSEQ+LVDC N
Sbjct: 129 DYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGRTQNT 188
Query: 192 -GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC GG + F++II N G+ TEA+YPY E G C+ + A+I YE++P +E
Sbjct: 189 RGCDGGSITDGFQFIINNGGINTEANYPYTAEDGQCNLDLQNEKYASIDTYENVPYNNEW 248
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
AL AV QPVSV +EA+G AF+ Y G+ CG DH V +VG+GT E G YW+
Sbjct: 249 ALQTAVAYQPVSVALEAAGDAFQHYSSGIFTGPCGTAVDHAVTIVGYGT---EGGIDYWI 305
Query: 311 IKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
+KNSW TWGE GYIRILR+ G CGIAT+ SYPV
Sbjct: 306 VKNSWDTTWGEEGYIRILRNVGGAGTCGIATKPSYPV 342
>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
Length = 368
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 151/329 (45%), Positives = 203/329 (61%), Gaps = 12/329 (3%)
Query: 23 CASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGN 81
CA+ R + + ++ + +E+W H + EK R FK N+ YI + NK
Sbjct: 26 CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAP 84
Query: 82 RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREK 140
L N F D+ EEFRA++ G + ++ P F Y+ V D+P ++DWR K
Sbjct: 85 GYAPL--NRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRK 142
Query: 141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMD 199
GAVT +K+QG CGSCWAFS V +VEGI I G+L+ LSEQ+L+DC T DN+GC GGLM+
Sbjct: 143 GAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLME 202
Query: 200 KAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ 259
AFEYI + G+ TE+ YPY+ GTCD + + I ++++P E AL +AV Q
Sbjct: 203 NAFEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQ 262
Query: 260 PVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
PVSV ++A Q+F+FY GV +CG + DHGVAVVG+G E DG +YW++KNSWG W
Sbjct: 263 PVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYG--ETNDGTEYWIVKNSWGTAW 320
Query: 320 GESGYIRILRDE----GLCGIATEASYPV 344
GE GYIR+ RD GLCGIA EASYPV
Sbjct: 321 GEGGYIRMQRDSGYDGGLCGIAMEASYPV 349
>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
Precursor
gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 152/351 (43%), Positives = 220/351 (62%), Gaps = 27/351 (7%)
Query: 13 MFVIIILVITCAS----QVVS---GRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKA 60
+ ++ +++ +CA+ VVS +H E S++ E WM +HG+ Y EK
Sbjct: 10 ILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLI--FESWMVKHGKVYGSVAEKE 67
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
RLTIF+ NL +I N E N +Y+LG F+DL+ E++ G + P P
Sbjct: 68 RRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGAD-PRPP-RNHVFMT 124
Query: 121 STFKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
S+ +Y+ D +P S+DWR +GAVT +K+QGHC SCWAFS V AVEG+ +I G+L+ L
Sbjct: 125 SSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTL 184
Query: 179 SEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCD-KQKEKAAAAT 237
SEQ L++C+ +NNGC GG ++ A+E+I++N GL T+ DYPY+ G CD + KE
Sbjct: 185 SEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVM 244
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I YE+LP DE AL++AV QPV+ +++S + F+ Y+ GV + CG N +HGV VVG+
Sbjct: 245 IDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGY 304
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
GT E+G YWL+KNS G TWGE+GY+++ R+ GLCGIA ASYP+
Sbjct: 305 GT---ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 352
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 295 bits (756), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 152/315 (48%), Positives = 202/315 (64%), Gaps = 18/315 (5%)
Query: 44 QWMAQHGRTYKDEL----EKAMRLTIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
QW A+HG+T + ++ R IFK NL +I+ N+ N TYKLG +F+DLTN+E
Sbjct: 51 QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDE 110
Query: 99 FRASYTGYNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKNQGHCGSC 155
+R Y G R P+ ++ KY N +VP ++DWR+KGAV IK+QG CGSC
Sbjct: 111 YRKLYLGA-RTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSC 169
Query: 156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATE 214
WAFS AAVEGI +I G+LI LSEQ+LVDC N GC+GGLMD AF++I++N GL TE
Sbjct: 170 WAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTE 229
Query: 215 ADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRF 274
DYPY+ G C+ + + +I YED+P DE AL +A++ QPVSV +EA G+ F+
Sbjct: 230 KDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQH 289
Query: 275 YKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---- 330
Y+ G+ CG N DH V VG+G+ E+G YW+++NSWG WGE GYIR+ R+
Sbjct: 290 YQSGIFTGSCGTNLDHAVVAVGYGS---ENGVDYWIVRNSWGPRWGEEGYIRMERNLAAS 346
Query: 331 -EGLCGIATEASYPV 344
G CGIA EASYPV
Sbjct: 347 KSGKCGIAVEASYPV 361
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 295 bits (756), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 139/312 (44%), Positives = 201/312 (64%), Gaps = 12/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ E W+ ++G++Y EK R IFK NL ++++ N + NR+YK+G N+FSDLT+
Sbjct: 44 VIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTDA 103
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
E+ + Y G + R ++ ++ + +P S+DWR+KGAV +KNQG+CGSCW
Sbjct: 104 EYSSIYLGTKFNI----RMTNVSDRYEPRVGDQLPDSVDWRKKGAVLGVKNQGNCGSCWT 159
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEA 215
F+++AAVEGI +I G LI LSEQ++VDC NNGC+GG + A+++II N G+ TEA
Sbjct: 160 FASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNGGINTEA 219
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
+YPY G CD+ K+ TI +YE++P +E AL +AV QPVSV + ++ AF+ Y
Sbjct: 220 NYPYTGRDGVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIASNSTAFKSY 279
Query: 276 KRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---EG 332
K G+ N CG DHGV +VG+GT E G YW+++NSWG WGESGY+R+ R+ G
Sbjct: 280 KSGIFNGPCGPRIDHGVTIVGYGT---EGGKDYWIVRNSWGPNWGESGYVRMQRNVGGSG 336
Query: 333 LCGIATEASYPV 344
C IA YPV
Sbjct: 337 KCFIARAPVYPV 348
>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
Length = 357
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 152/351 (43%), Positives = 220/351 (62%), Gaps = 27/351 (7%)
Query: 13 MFVIIILVITCAS----QVVS---GRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKA 60
+ ++ +++ +CA+ VVS +H E S++ E WM +HG+ Y EK
Sbjct: 3 ILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLI--FESWMVKHGKVYGSVAEKE 60
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
RLTIF+ NL +I N E N +Y+LG F+DL+ E++ G + P P
Sbjct: 61 RRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGAD-PRPP-RNHVFMT 117
Query: 121 STFKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
S+ +Y+ D +P S+DWR +GAVT +K+QGHC SCWAFS V AVEG+ +I G+L+ L
Sbjct: 118 SSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTL 177
Query: 179 SEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCD-KQKEKAAAAT 237
SEQ L++C+ +NNGC GG ++ A+E+I++N GL T+ DYPY+ G CD + KE
Sbjct: 178 SEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVM 237
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I YE+LP DE AL++AV QPV+ +++S + F+ Y+ GV + CG N +HGV VVG+
Sbjct: 238 IDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGY 297
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
GT E+G YWL+KNS G TWGE+GY+++ R+ GLCGIA ASYP+
Sbjct: 298 GT---ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 345
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 144/313 (46%), Positives = 204/313 (65%), Gaps = 17/313 (5%)
Query: 42 HEQWMAQHGRTYKDEL--EKAMRLTIFKQNLEYIEKANKEGNRT--YKLGTNEFSDLTNE 97
++ W+A++G + L E R +F NL++++ N + ++LG N F+DLTNE
Sbjct: 51 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNE 110
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EFRA++ G R + +++ V ++P S+DWREKGAV +KNQG CGSCWA
Sbjct: 111 EFRATFLG----AKVAERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 166
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEA 215
FSAV+ VE I Q+ G++I LSEQ+LV+CST+ N+GC+GGLM AF++II+N G+ TE
Sbjct: 167 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGIDTED 226
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
DYPY+ G CD +E A +I +ED+P+ DE +L +AV QPVSV +EA G+ F+ Y
Sbjct: 227 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 286
Query: 276 KRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----E 331
GV + CG + DHGV VG+GT ++G YW+++NSWG WGESGY+R+ R+
Sbjct: 287 HSGVFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINVTT 343
Query: 332 GLCGIATEASYPV 344
G CGIA ASYP
Sbjct: 344 GKCGIAMMASYPT 356
>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
Length = 368
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 151/329 (45%), Positives = 203/329 (61%), Gaps = 12/329 (3%)
Query: 23 CASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGN 81
CA+ R + + ++ + +E+W H + EK R FK N+ YI + NK
Sbjct: 26 CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAP 84
Query: 82 RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREK 140
L N F D+ EEFRA++ G + ++ P F Y+ V D+P ++DWR K
Sbjct: 85 GYPPL--NRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRK 142
Query: 141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMD 199
GAVT +K+QG CGSCWAFS V +VEGI I G+L+ LSEQ+L+DC T DN+GC GGLM+
Sbjct: 143 GAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLME 202
Query: 200 KAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ 259
AFEYI + G+ TE+ YPY+ GTCD + + I ++++P E AL +AV Q
Sbjct: 203 NAFEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQ 262
Query: 260 PVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
PVSV ++A Q+F+FY GV +CG + DHGVAVVG+G E DG +YW++KNSWG W
Sbjct: 263 PVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYG--ETNDGTEYWIVKNSWGTAW 320
Query: 320 GESGYIRILRDE----GLCGIATEASYPV 344
GE GYIR+ RD GLCGIA EASYPV
Sbjct: 321 GEGGYIRMQRDSGYDGGLCGIAMEASYPV 349
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 148/346 (42%), Positives = 202/346 (58%), Gaps = 31/346 (8%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F +L+++ A + + ++ +E W+ + G++Y EK MR IFK+NL
Sbjct: 15 LFFSTLLILSSALDIKNSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRI 74
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGY---------NRPVPSVSRQSSRPSTF 123
I+ N + NR+Y LG N F+DLT+EE+R++Y G+ NR VP V
Sbjct: 75 IDDHNADANRSYSLGLNRFADLTDEEYRSTYLGFKSGPKAKVSNRYVPKVG--------- 125
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
+P +DWR GAV +K+QG C SCWAFSAVAAVEGI +I G LI LSEQ+L
Sbjct: 126 -----VVLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQEL 180
Query: 184 VDC--STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
VDC + GC+ G M+ AF++II+N G+ TE +YPY + G CD ++ TI Y
Sbjct: 181 VDCGRTQRTRGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQDGQCDWYRKNQRYVTIDNY 240
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE 301
E LP +E L AV QP++V +E+ G F+ Y G+ CG DHGV +VG+GT
Sbjct: 241 EQLPANNEWVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTAIDHGVTIVGYGT-- 298
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
E G YW++KNSWG WGE+GYIRI R+ G CGIA SYPV
Sbjct: 299 -ERGLDYWIVKNSWGTNWGENGYIRIQRNIGGAGKCGIAMVPSYPV 343
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 155/339 (45%), Positives = 206/339 (60%), Gaps = 28/339 (8%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRT 83
+VS E + +W A+HG+ Y E+ R F+ NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 84 YKLGTNEFSDLTNEEFRASYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R +Y G N+P R+ + + +P S+DWR KGA
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKA 201
V IK+QG CGSCWAFSA+AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 141 VAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQK------------EKAAAATIGKYEDLPKGDE 249
F++II N G+ TE DYPY+ + CD + + A TI YED+ E
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRVSFVFFAPLVFQKNAKVVTIDSYEDVTPNSE 260
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
+L +AV QPVSV +EA G+AF+ Y G+ +CG DHGVA VG+GT E+G YW
Sbjct: 261 TSLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYW 317
Query: 310 LIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+++NSWG++WGESGY+R+ R+ G CGIA E SYP+
Sbjct: 318 IVRNSWGKSWGESGYVRMERNIKASSGKCGIAVEPSYPL 356
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 147/349 (42%), Positives = 212/349 (60%), Gaps = 14/349 (4%)
Query: 3 LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ KSF+ +F +L+++ A + + +E W+ ++G++Y E
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR++Y G+ S S ++
Sbjct: 61 RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+ ++ + +P+ +DWR GAV IK+QG CG CWAFSA+A VEGI +I G LI LSE
Sbjct: 117 NRYEPRFGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176
Query: 181 QQLVDCSTDNN--GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
Q+L+DC N GC+GG + F++II N G+ TE +YPY + G C+ + TI
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTI 236
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
YE++P +E AL AVT QPVSV ++A+G AF+ Y G+ CG DH V +VG+G
Sbjct: 237 DTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG 296
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
T E G YW++KNSW TWGE GY+RILR+ G CGIAT SYPV
Sbjct: 297 T---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
Length = 340
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 150/340 (44%), Positives = 214/340 (62%), Gaps = 21/340 (6%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
+I + ++I ++ + +S+ ++ E+++ W ++ YKD+ E+ + IFK N
Sbjct: 10 LINILIVIWVMFPSNQNQENDQSL---TLSERYKHWKIKYRVIYKDDAEEEKHIQIFKHN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ YI+ N GN++YKL N F+DL E S G+ + + + S FKY+N+T
Sbjct: 67 VAYIDSFNAAGNKSYKLTINRFADLPTE---PSDDGFKKR----KLEPTTSSLFKYKNIT 119
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
D+P ++DWR++GAVT +KNQ CGSCWAFSAV A+EGI QIT G L+ LSEQ+LVD
Sbjct: 120 DIPAAVDWRKRGAVTPVKNQRECGSCWAFSAVGALEGIQQITSGNLVSLSEQELVDRVRS 179
Query: 190 N--NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N NGC+GG + AFE+++EN G+ATEA YPY+ +G + K+ + I YE +P+
Sbjct: 180 NWTNGCNGGYLIDAFEFVLENGGIATEASYPYRGVKG--NNSKKVSRQVQIKSYEQVPRN 237
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
E +LL+ V QPVSV ++ SG RFY G+ ECG +H V +VG+GT+ DG K
Sbjct: 238 SEDSLLKVVANQPVSVGIDISGM-IRFYSSGIFTGECGTKPNHAVIIVGYGTS--NDGTK 294
Query: 308 YWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
YWL+KNSWG WGE YIR+ RD EGLCGI +ASYP
Sbjct: 295 YWLVKNSWGIRWGEKRYIRMKRDIDAKEGLCGIPMDASYP 334
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 147/349 (42%), Positives = 212/349 (60%), Gaps = 14/349 (4%)
Query: 3 LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ KSF+ +F +L+++ A + + +E W+ ++G++Y E
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR++Y G+ S S ++
Sbjct: 61 RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+ ++ + +P+ +DWR GAV IK+QG CG CWAFSA+A VEGI +I G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176
Query: 181 QQLVDCSTDNN--GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
Q+L+DC N GC+GG + F++II N G+ TE +YPY + G C+ + TI
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYVTI 236
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
YE++P +E AL AVT QPVSV ++A+G AF+ Y G+ CG DH V +VG+G
Sbjct: 237 DTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYG 296
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
T E G YW++KNSW TWGE GY+RILR+ G CGIAT SYPV
Sbjct: 297 T---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 144/312 (46%), Positives = 198/312 (63%), Gaps = 11/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I E + W +HG+TY E E+ R+ IFK N +++ + N N TY L N F+DLT+
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+AS G + PSV S S VP S+DWR+KGAVT++K+QG CG+CW+
Sbjct: 88 EFKASRLGLSVSAPSVIMASKGQS---LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 144
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FSA A+EGI QI G LI LSEQ+L+DC N GC+GGLMD AFE++I+N G+ TE D
Sbjct: 145 FSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKD 204
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPYQ+ GTC K K K TI Y + DE AL++AV QPVSV + S +AF+ Y
Sbjct: 205 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYS 264
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
G+ + C + DH V +VG+G+ ++G YW++KNSWG++WG G++ + R+ +G
Sbjct: 265 SGIFSGPCSTSLDHAVLIVGYGS---QNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 321
Query: 333 LCGIATEASYPV 344
+CGI ASYP+
Sbjct: 322 VCGINMLASYPI 333
>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
Length = 357
Score = 295 bits (754), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 145/337 (43%), Positives = 215/337 (63%), Gaps = 14/337 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F+ + L + AS + R ++++ E+WMA++GR YKD EK R IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N +Y LG N+F+D+TN EF A YTG + P+ ++ R+ +F +++ VP
Sbjct: 68 IETFNSRNGNSYTLGINQFTDMTNNEFVAQYTGVSLPL-NIEREPV--VSFDDVDISAVP 124
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNG 192
SIDWR GAVT +KN CGSCWAF+A+A VE I +I G LI LSEQQ++DC+ + G
Sbjct: 125 QSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGYLISLSEQQVLDCAV-SYG 183
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQ--QEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
C GG ++KA+++II NKG+A+ A YPY+ Q QGTC + +A I Y + +E
Sbjct: 184 CDGGWVNKAYDFIISNKGVASAAIYPYKASQGQGTC-RINGVPNSAYITGYTRVQSNNER 242
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
+++ AV+ QP++ +EASG F+ YKRGV + CG + +H + ++G+G ++ G K+W+
Sbjct: 243 SMMYAVSNQPIAASIEASGD-FQHYKRGVFSGPCGTSLNHAITIIGYG--QDSSGKKFWI 299
Query: 311 IKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
++NSWG +WGE GYIR+ RD GLCGIA YP
Sbjct: 300 VRNSWGASWGERGYIRMARDVSSSSGLCGIAIRPLYP 336
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 147/349 (42%), Positives = 213/349 (61%), Gaps = 14/349 (4%)
Query: 3 LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ KSF+ +F +L+++ A + + +E W+ ++G++Y E
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR++Y G+ S S ++
Sbjct: 61 RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+ ++ + +P+ +DWR GAV IK+QG CG CWAFSA+A VEGI +I G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176
Query: 181 QQLVDCSTDNN--GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
Q+L+DC N GC+GG + F++II N G+ TE +YPY + G C+ + + TI
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVELQNEKYVTI 236
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
YE++P +E AL AVT QPVSV ++A+G AF+ Y G+ CG DH V +VG+G
Sbjct: 237 DTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYG 296
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
T E G YW++KNSW TWGE GY+RILR+ G CGIAT SYPV
Sbjct: 297 T---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 380
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 159/325 (48%), Positives = 202/325 (62%), Gaps = 20/325 (6%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ +E+W A+H +D EK+ R +F++N + + N + YKL N F+DL
Sbjct: 42 EESLWALYERWRARH-TVSRDLAEKSRRFNVFRENARLVHEFNLRRDAPYKLRLNRFADL 100
Query: 95 TNEEFRASYTGYN---------RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
T++EFR SY R + + S+F + +PTS+DWREKGAVT
Sbjct: 101 TSDEFRRSYASSRVSHHRMFKPRAANNNDDDDDKGSSFTHGGA--LPTSVDWREKGAVTG 158
Query: 146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEY 204
+K+QG CGSCWAFS +AAVEGI I L LSEQQLVDC T N GC GGLMD AF Y
Sbjct: 159 VKDQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNAGCDGGLMDDAFSY 218
Query: 205 IIENKGLATEADYPYQQEQ-GTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSV 263
I ++ G+A E YPY+ Q +C+ +K AA +I YED+P+ DE AL +AV QPV+V
Sbjct: 219 IAKHGGVAAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRNDETALKKAVAAQPVAV 278
Query: 264 CVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
+EA G F+FY GV +CG DHGVA VG+G DG KYW++KNSWGE WGE G
Sbjct: 279 AIEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGVT--VDGTKYWIVKNSWGEEWGEKG 336
Query: 324 YIRILRD----EGLCGIATEASYPV 344
YIR+ RD EGLCGIA EASYPV
Sbjct: 337 YIRMKRDVADKEGLCGIAMEASYPV 361
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 147/349 (42%), Positives = 212/349 (60%), Gaps = 14/349 (4%)
Query: 3 LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ KSF+ +F +L+++ A + + +E W+ ++G++Y E
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLAFNTKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR++Y G+ S S ++
Sbjct: 61 RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+ ++ + +P+ +DWR GAV IK+QG CG CWAFSA+A VEGI +I G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176
Query: 181 QQLVDCSTDNN--GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
Q+L+DC N GC+GG + F++II N G+ TE +YPY + G C+ + TI
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTI 236
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
YE++P +E AL AVT QPVSV ++A+G AF+ Y G+ CG DH V +VG+G
Sbjct: 237 DTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYG 296
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
T E G YW++KNSW TWGE GY+RILR+ G CGIAT SYPV
Sbjct: 297 T---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 470
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 143/310 (46%), Positives = 197/310 (63%), Gaps = 13/310 (4%)
Query: 45 WMAQHGRTYKDEL-EKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
W A+HG + L E+ R F NL +++ N G ++LG N F+DLTN+EFR
Sbjct: 55 WRAEHGSGNSNSLGEEERRFRAFWDNLRFVDAHNARAAAGEEGFRLGMNRFADLTNDEFR 114
Query: 101 ASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
A+Y G S ++ +++ V ++P ++DWREKGAV +KNQG CGSCWAFSA
Sbjct: 115 AAYLGVKGAGQRRSARAGVGERYRHDGVEELPEAVDWREKGAVAPVKNQGQCGSCWAFSA 174
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYP 218
V+AVE I Q+ G+L+ LSEQ+LV+C + +NGC+GGLMD AF++II N G+ TE DYP
Sbjct: 175 VSAVESINQLVTGELVTLSEQELVECDINGQSNGCNGGLMDDAFDFIINNGGIDTEDDYP 234
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRG 278
Y+ G CD + A +I +ED+P+ DE +L +AV QPVSV +EA G+ F+ Y G
Sbjct: 235 YKALDGKCDINRRNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 294
Query: 279 VLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLC 334
V CG DHGV VG+GT E+G YW+++NSWG WGE+GY+R+ R+ G C
Sbjct: 295 VFTGRCGTELDHGVVAVGYGT---ENGKDYWIVRNSWGPKWGEAGYLRMERNINATTGKC 351
Query: 335 GIATEASYPV 344
GIA +SYP
Sbjct: 352 GIAMMSSYPT 361
>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
Length = 314
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 151/340 (44%), Positives = 206/340 (60%), Gaps = 42/340 (12%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ I+ C + + + + ++V +HEQWMAQ+ R YKD EKA R
Sbjct: 8 ILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRF--------- 58
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
+F+DLTN EFR+ T N+ S + + + F+Y+NV+
Sbjct: 59 -----------------KFADLTNHEFRSVKT--NKGFKSSNMKI--LTGFRYENVSADA 97
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
+PT+IDWR KG VT IK+QG CG C AFSAVAA EGI +I+ GKL+ L++Q+LVDC
Sbjct: 98 LPTTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGIVKISTGKLVSLADQELVDCDVHG 157
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
++ GC GGLMD AF++II+N GL TE+ YPY G C+ +AATI YED+P D
Sbjct: 158 EDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCNSGSN--SAATIKGYEDVPAND 215
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E AL++A+ QPVSV V+ FRFY GV+ CG + DHG+A +G+G + DG KY
Sbjct: 216 EAALMKAMANQPVSVAVDGGDMTFRFYSGGVMTGSCGTDLDHGIAAIGYG--KTSDGTKY 273
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
WL+KNSWG TWGE+GY+R+ +D G+CG+A E SYP
Sbjct: 274 WLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 313
>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
Length = 342
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 142/302 (47%), Positives = 201/302 (66%), Gaps = 10/302 (3%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEF 91
S+H+ ++ E + +H + Y+ EK R IF NL++I++ NK+ + Y LG NEF
Sbjct: 41 SIHK--VIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEF 97
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
+DLT+EEF+ + G+ + R+ F+Y++ D+P S+DWR+KGAV+ +KNQG
Sbjct: 98 ADLTHEEFKNKFLGFKGEL--AERKDESIEQFRYRDFVDLPKSVDWRKKGAVSPVKNQGQ 155
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKG 210
CGSCWAFS VAAVEGI QI G L LSEQ+L+DC T NNGC+GGLMD AF Y+ N G
Sbjct: 156 CGSCWAFSTVAAVEGINQIVTGNLTVLSEQELIDCDTTFNNGCNGGLMDYAFAYVTRN-G 214
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
L E +YPY +GTCD++++ + TI Y D+P+ +E + L+A+ QP+SV +EASG+
Sbjct: 215 LHKEEEYPYIMSEGTCDEKRDASEKVTISGYHDVPRNNEDSFLKALANQPISVAIEASGR 274
Query: 271 AFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
F+FY GV + CG DHGVA VG+GT++ G Y +++NSWG WGE GYIR+ R+
Sbjct: 275 DFQFYSGGVFDGHCGTELDHGVAAVGYGTSK---GLDYVIVRNSWGPKWGEKGYIRMKRN 331
Query: 331 EG 332
G
Sbjct: 332 TG 333
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 151/315 (47%), Positives = 201/315 (63%), Gaps = 18/315 (5%)
Query: 44 QWMAQHGRTYKDEL----EKAMRLTIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
QW A+HG+T + ++ R IFK NL +I+ N+ N TYKLG +F+DLTN+E
Sbjct: 51 QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDE 110
Query: 99 FRASYTGYNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKNQGHCGSC 155
+R Y G R P+ ++ KY N +VP ++DWR+KGAV IK+QG CGSC
Sbjct: 111 YRKLYLGA-RTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSC 169
Query: 156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATE 214
WAFS AAVEGI +I G+LI LSEQ+LVDC N GC+GGLMD AF++I++N GL TE
Sbjct: 170 WAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTE 229
Query: 215 ADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRF 274
DYPY+ G C+ + + +I YED+P DE AL +A++ QPV V +EA G+ F+
Sbjct: 230 KDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVRVAIEAGGRIFQH 289
Query: 275 YKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---- 330
Y+ G+ CG N DH V VG+G+ E+G YW+++NSWG WGE GYIR+ R+
Sbjct: 290 YQSGIFTGSCGTNLDHAVVAVGYGS---ENGVDYWIVRNSWGPRWGEEGYIRMERNLAAS 346
Query: 331 -EGLCGIATEASYPV 344
G CGIA EASYPV
Sbjct: 347 KSGKCGIAVEASYPV 361
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 139/313 (44%), Positives = 198/313 (63%), Gaps = 24/313 (7%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
E W +HG++Y + E++ RL +F+ N +++ K N +GN +Y L N F+DLT+ EF+ S
Sbjct: 30 ETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFKTS 89
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQN------VTDVPTSIDWREKGAVTHIKNQGHCGSCW 156
G S+ P ++N V D+P SIDWR KG VT++K+QG CG+CW
Sbjct: 90 RLGL----------SAAPLNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACW 139
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEA 215
+FSA A+EGI +I G L+ LSEQ+L++C N+GC GGLMD AF+++I N G+ TE
Sbjct: 140 SFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEE 199
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
DYPY+ GTC+K + K TI KY D+P+ +E LLQAV QPVSV + S +AF+ Y
Sbjct: 200 DYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMY 259
Query: 276 KRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----E 331
+G+ C + DH V +VG+G+ E+G YW++KNSWG WG GY+ + R+ +
Sbjct: 260 SKGIFTGPCSTSLDHAVLIVGYGS---ENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQ 316
Query: 332 GLCGIATEASYPV 344
G+CGI ASYPV
Sbjct: 317 GVCGINMLASYPV 329
>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 294 bits (752), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 150/357 (42%), Positives = 217/357 (60%), Gaps = 31/357 (8%)
Query: 13 MFVIIILVIT-CAS----QVVSGRSMH---------------EPSIVEKHEQWMAQHGRT 52
+ +++ +VIT CA+ VVS + H E S++ + WM +HG+
Sbjct: 9 LILLVAMVITSCATAMDMSVVSSNNNHHLTTSPGRLHSGFDAEASLI--FDSWMVKHGKV 66
Query: 53 YKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPS 112
Y EK RLTIF+ NL +I N E N +Y+LG +F+DL+ E+ G + P
Sbjct: 67 YGSVAEKERRLTIFEDNLRFISNRNAE-NLSYRLGLTQFADLSLHEYGEVCHGADPRPPR 125
Query: 113 VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITG 172
+ +K +P S+DWR +GAVT +K+QGHC SCWAFS V AVEG+ +I
Sbjct: 126 NHVFMTSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVT 185
Query: 173 GKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCD-KQKE 231
G+L+ LSEQ L++C+ +NNGC GG ++ A+E+I++N GL T+ DYPY+ G CD + KE
Sbjct: 186 GELVTLSEQDLINCNKENNGCGGGKVETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKE 245
Query: 232 KAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHG 291
I +E+LP DE AL++AV QPV+ +++S + F+ Y+ GV + CG N +HG
Sbjct: 246 NNKNVMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHG 305
Query: 292 VAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
V VVG+GT E+G YWL+KNS G TWGE+GY+++ R+ GLCGIA ASYP+
Sbjct: 306 VVVVGYGT---ENGRDYWLVKNSRGNTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 293 bits (751), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 148/338 (43%), Positives = 202/338 (59%), Gaps = 15/338 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F +L+++ A + + ++ +E W+ + G++Y EK MR IFK+NL
Sbjct: 13 LFFSTLLILSLALDIENSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRI 72
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNR-PVPSVSRQSSRPSTFKYQNVTDV 131
I+ N + NR+Y LG N F+DLT+EE+R++Y G P VS + + + +
Sbjct: 73 IDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKMGPKTDVSNE------YMPKVGEAL 126
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC--STD 189
P +DWR GAV +KNQG C SCWAFSAV AVEGI +I G LI LSEQ+LVDC +
Sbjct: 127 PDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSEQELVDCGRTQR 186
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
GC+ GLM AF++II N G+ TE +YPY + G C+ + TI Y+++P +E
Sbjct: 187 TKGCNRGLMTDAFQFIINNGGINTEDNYPYTAKDGQCNLSLKNQKYVTIDNYKNVPSNNE 246
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
AL +AV QPVSV VE+ G F+ Y G+ CG DHGV +VG+GT E G YW
Sbjct: 247 MALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTAVDHGVTIVGYGT---ERGMDYW 303
Query: 310 LIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
++KNSWG WGE+GYIRI R+ G CGIA SYPV
Sbjct: 304 IVKNSWGTNWGENGYIRIQRNIGGAGKCGIARMPSYPV 341
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 143/304 (47%), Positives = 200/304 (65%), Gaps = 10/304 (3%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
E W A+HG++Y + EKA RL IF L YIEK N N T+ LG N+FSDLTN EFRA+
Sbjct: 3 EGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRAN 62
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVA 162
Y G +P Q RP+ +V+ +PTS+DWR++GAVT IK+QG CGSCWAFSA+A
Sbjct: 63 YVGKFKPP---RYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 163 AVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
++E + +L+ LSEQQL+DC T + GC GG + AF++++EN G+ TE YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTGF 179
Query: 223 QGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNA 282
G+C+ K K T Y+D+ K AL++AV+K PV+V + S Q F+ Y+ G+L+
Sbjct: 180 AGSCNANKNKVVEIT--GYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237
Query: 283 ECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--EGLCGIATEA 340
C ++ DH V V+G+GT E G YW+IKNSWG +WGE G++RI ++ EG+CG+ ++
Sbjct: 238 HCSNSRDHAVLVIGYGT---EGGMPYWIIKNSWGTSWGEDGFMRIKKEDGEGMCGMNGQS 294
Query: 341 SYPV 344
SYP
Sbjct: 295 SYPT 298
>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 140/307 (45%), Positives = 197/307 (64%), Gaps = 9/307 (2%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
E WM +HG+ Y+ EK RLTIF+ NL +I N E N +Y+LG N F+DL+ E+
Sbjct: 57 ESWMVKHGKVYESVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYAQI 115
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVA 162
G + P + + +K + +P S+DWR +GAVT +K+QG C SCWAFS V
Sbjct: 116 CHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRSCWAFSTVG 175
Query: 163 AVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
AVEG+ +I G+L+ LSEQ L++C+ +NNGC GG ++ A+E+I+ N GL T+ DYPY+
Sbjct: 176 AVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKAL 235
Query: 223 QGTC-DKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLN 281
G C D+ KE I YE+LP DE AL++AV QPV+ V++S + F+ Y GV +
Sbjct: 236 NGVCNDRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSREFQLYASGVFD 295
Query: 282 AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIA 337
CG N +HGV VVG+GT E+G YW+++NS G TWGE+GY+++ R+ GLCGIA
Sbjct: 296 GTCGTNLNHGVVVVGYGT---ENGRDYWIVRNSRGNTWGEAGYMKMARNIANPRGLCGIA 352
Query: 338 TEASYPV 344
ASYP+
Sbjct: 353 MRASYPL 359
>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
Length = 356
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 149/338 (44%), Positives = 219/338 (64%), Gaps = 17/338 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+F+ + L + AS S S EPS ++++ E+WM ++GR YKD EK R IFK N+
Sbjct: 8 VFLFLFLCVMWASP--SAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNV 65
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT-GYNRPVPSVSRQSSRPSTFKYQNVT 129
+IE N +Y LG N+F+D+TN EF A YT G +RP+ ++ R+ +F +++
Sbjct: 66 NHIETFNSRNKDSYTLGINQFTDMTNNEFVAQYTGGISRPL-NIEREPV--VSFDDVDIS 122
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
VP SIDWR+ GAVT +KNQ CG+CWAF+A+A VE I +I G L LSEQQ++DC+
Sbjct: 123 AVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCA-K 181
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
GC GG +AFE+II NKG+A+ A YPY+ +GTC K +A I Y +P+ +E
Sbjct: 182 GYGCKGGWEFRAFEFIISNKGVASVAIYPYKAAKGTC-KTNGVPNSAYITGYARVPRNNE 240
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
+++ AV+KQP++V V+A+ + ++Y GV N CG + +H V +G+G ++ +G KYW
Sbjct: 241 SSMMYAVSKQPITVAVDANANS-QYYNSGVFNGPCGTSLNHAVTAIGYG--QDSNGKKYW 297
Query: 310 LIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
++KNSWG WGE+GYIR+ RD G+CGIA ++ YP
Sbjct: 298 IVKNSWGARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 293 bits (750), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 142/304 (46%), Positives = 201/304 (66%), Gaps = 10/304 (3%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
E W A+HG++Y + EKA RL IF L YIEK N + N T+ LG N+FSDLTN EFRA+
Sbjct: 3 EDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAN 62
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVA 162
Y G S Q RP+ +V+ +PTS+DWR++GAVT IK+QG CGSCWAFSA+A
Sbjct: 63 YVG---KFKSPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 163 AVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
++E + +L+ LSEQQL+DC T + GC GG + AF++++EN G+ TE YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTGF 179
Query: 223 QGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNA 282
G+C+ K K T Y+D+ K AL++AV+K PV+V + S Q F+ Y+ G+L+
Sbjct: 180 AGSCNANKNKVVEIT--GYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237
Query: 283 ECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--EGLCGIATEA 340
+C ++ DH V V+G+GT E G YW+IKNSWG +WGE+G+++I + EG+CG+ ++
Sbjct: 238 QCSNSRDHAVLVIGYGT---EGGMPYWIIKNSWGTSWGENGFMKIKKKDGEGMCGMNGQS 294
Query: 341 SYPV 344
SYP
Sbjct: 295 SYPT 298
>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
Length = 361
Score = 293 bits (750), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 153/355 (43%), Positives = 214/355 (60%), Gaps = 26/355 (7%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ-----------WMAQHGRTYKDELEKA 60
P + + V+ A S +PS+V ++ W +HG+ Y EK
Sbjct: 3 PKLAVAVFVLFLAFAACSANHHRDPSVVGYSQEDLALPSSLFRSWSVKHGKLYASPTEKL 62
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSR- 119
R IFKQNL +I + N++ N +Y LG N+F+D+ +EEF+ASY G R +P +R
Sbjct: 63 ERYEIFKQNLMHIAETNRK-NGSYWLGLNQFADVAHEEFKASYLGLKRALPRAGAPQTRT 121
Query: 120 PSTFKYQNVT--DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIE 177
P+ F+Y +P S+DWR KGAVT +KNQG CGSCWAFS+VAAVEGI QI GKL+
Sbjct: 122 PTAFRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVS 181
Query: 178 LSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAA 236
LSEQ+LVDC T ++GC GG MD AF Y++ ++G+ E DYPY E+G C +++
Sbjct: 182 LSEQELVDCDTTLDHGCEGGTMDLAFAYMMGSQGIHAEDDYPYLMEEGYCKEKQPCVLGI 241
Query: 237 T---IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVA 293
T + +ED+P+ E +LL+A+ QPVSV + A + F+FY+ GV + C DH +
Sbjct: 242 TEQDLTGFEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYRGGVFDGACSVELDHALT 301
Query: 294 VVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL----RDEGLCGIATEASYPV 344
VG+G++ G Y +KNSWG+ WGE GY+RI + EG+CGI T ASYPV
Sbjct: 302 AVGYGSSY---GQNYITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPV 353
>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
Precursor
gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 371
Score = 293 bits (750), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 149/364 (40%), Positives = 220/364 (60%), Gaps = 27/364 (7%)
Query: 3 LKFEKSFIIPMFVIIILVITCAS----QVVSGRSMH-------------EPSIVEKHEQW 45
+ + KS ++ +F++ +++ +CA+ VVS H + E W
Sbjct: 1 MGYAKSAML-IFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESW 59
Query: 46 MAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG 105
M +HG+ Y EK RLTIF+ NL +I N E N +Y+LG N F+DL+ E+ G
Sbjct: 60 MVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEICHG 118
Query: 106 YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVE 165
+ P + + +K + +P S+DWR +GAVT +K+QG C SCWAFS V AVE
Sbjct: 119 ADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVE 178
Query: 166 GITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGT 225
G+ +I G+L+ LSEQ L++C+ +NNGC GG ++ A+E+I+ N GL T+ DYPY+ G
Sbjct: 179 GLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGV 238
Query: 226 CD-KQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAEC 284
C+ + KE I YE+LP DE AL++AV QPV+ V++S + F+ Y+ GV + C
Sbjct: 239 CEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTC 298
Query: 285 GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEA 340
G N +HGV VVG+GT E+G YW++KNS G+TWGE+GY+++ R+ GLCGIA A
Sbjct: 299 GTNLNHGVVVVGYGT---ENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRA 355
Query: 341 SYPV 344
SYP+
Sbjct: 356 SYPL 359
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 143/304 (47%), Positives = 199/304 (65%), Gaps = 10/304 (3%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
E W A+HG++Y + EKA RL IF L YIEK N N T+ LG N+FSDLTN EFRA+
Sbjct: 3 EGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRAN 62
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVA 162
Y G +P Q RP+ +V+ +PTS+DWR++GAVT IK+QG CGSCWAFSA+A
Sbjct: 63 YVGKFKPP---RYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 163 AVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
++E + +L+ LSEQQL+DC T + GC GG + AF++++EN G+ TE YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTGF 179
Query: 223 QGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNA 282
G+C+ K K T Y+D+ K AL++AV+K PV+V + S Q F+ Y+ G+L+
Sbjct: 180 AGSCNANKNKVVEIT--GYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237
Query: 283 ECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--EGLCGIATEA 340
C ++ DH V V+G+GT E G YW+IKNSWG +WGE G++RI + EG+CG+ ++
Sbjct: 238 HCSNSRDHAVLVIGYGT---EGGMPYWIIKNSWGTSWGEDGFMRIKKKDGEGMCGMNGQS 294
Query: 341 SYPV 344
SYP
Sbjct: 295 SYPT 298
>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
Length = 367
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 146/345 (42%), Positives = 215/345 (62%), Gaps = 14/345 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
S + + + ++ ++ + + SGRS E ++ +E+W+ +H + Y EK R IFK
Sbjct: 3 SILYSLILFGLITLSLSLDMSSGRSNKE--VMTMYEKWLVKHQKVYYGLGEKNQRFQIFK 60
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
NL +I++ N N +Y++G NEFSD+TN+E+R +Y ++ +S +K +
Sbjct: 61 DNLIFIDEHNAP-NHSYRVGLNEFSDITNKEYRDTYLSRWSNNNIKNKITSVRYAYKAGH 119
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+P S+DWR GA+T IKNQG CG+CWAFSAVAAVE I +I G L+ LSEQ+LVDC
Sbjct: 120 NNKLPVSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTGSLVSLSEQELVDCD 177
Query: 188 -TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
T N GC+GG A+ +I+EN GL ++ DYPY Q TC++ K+ +I Y+++ +
Sbjct: 178 RTKNKGCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQSTCNQAKKNTKVVSINGYKNVQR 237
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
E AL++AV QPVSV +EA G+ F+ Y+ GV CG + DH V VVG+G+ E+G
Sbjct: 238 NSESALMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAVVVVGYGS---ENGK 294
Query: 307 KYWLIKNSWGETWGESGYIRILR-----DEGLCGIATEASYPVAM 346
YWL+KNSWG WGE GY++I R + G CGIA +A+YP +
Sbjct: 295 DYWLVKNSWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYPTKL 339
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 141/304 (46%), Positives = 200/304 (65%), Gaps = 10/304 (3%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
E W A+H ++Y + EKA RL +F L YIEK N + N T+ LG N+FSDLTN EFRA+
Sbjct: 3 EDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAN 62
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVA 162
Y G +P Q RP+ +V+ +PTS+DWR++GAVT IK+QG CGSCWAFSA+A
Sbjct: 63 YVGKFKPP---RYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 163 AVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
++E + +L+ LSEQQL+DC T + GC GG D AF++++EN G+ TE YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPDDAFKFVVENGGVTTEEAYPYTGF 179
Query: 223 QGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNA 282
G+C+ K K T Y+D+ K AL++AV+K PV+V + S Q F+ Y+ G+L+
Sbjct: 180 AGSCNTNKNKVVEIT--GYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237
Query: 283 ECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--EGLCGIATEA 340
+C ++ DH V V+G+GT E G YW+IKNSWG +WGE G+++I + EG+CG+ ++
Sbjct: 238 QCCNSRDHAVLVIGYGT---EGGMPYWIIKNSWGTSWGEDGFMKIKKKDGEGMCGMNGQS 294
Query: 341 SYPV 344
SYP
Sbjct: 295 SYPT 298
>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 493
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 208/343 (60%), Gaps = 47/343 (13%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLTNEEF 99
++ W+A++GR+Y E+ R +F NL++++ N + ++LG N F+DLTN+EF
Sbjct: 49 YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108
Query: 100 RASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC------- 152
RA++ G V R + +++ V ++P S+DWREKGAV +KNQG C
Sbjct: 109 RATFLG----AKFVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCVDRIIVW 164
Query: 153 -------------------------GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
GSCWAFSAV+ VE I Q+ G++I LSEQ+LV+CS
Sbjct: 165 NSMVRIYVVDAGCMLENPLMGLTVQGSCWAFSAVSTVESINQLVTGEMITLSEQELVECS 224
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
T+ N+GC+GGLMD AF++II+N G+ TE DYPY+ G CD +E A +I +ED+P
Sbjct: 225 TNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVP 284
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
+ DE +L +AV QPVSV +EA G+ F+ Y GV + CG + DHGV VG+GT ++G
Sbjct: 285 QNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGT---DNG 341
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
YW+++NSWG WGESGY+R+ R+ G CGIA ASYP
Sbjct: 342 KDYWIVRNSWGPKWGESGYVRMERNINATTGKCGIAMMASYPT 384
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 149/312 (47%), Positives = 191/312 (61%), Gaps = 11/312 (3%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
+ EQWM +HGR Y + EK R ++K+NL IE+ N G Y L N+F+DLTNEEFR
Sbjct: 118 RFEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNS-GGHGYTLTDNKFADLTNEEFR 176
Query: 101 ASYTGYNRPVPSVSRQSSRPSTF----KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCW 156
A G P R++ S N TD+P +DWR+KGAV +KNQG CGSCW
Sbjct: 177 AKMLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSCGSCW 236
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEAD 216
AFSAVAA+EG+ QI GKL+ LSEQ+LVDC + GC+GG M AFE+++ N GL TEA
Sbjct: 237 AFSAVAAMEGLNQIKNGKLVSLSEQELVDCDAEAVGCAGGFMSWAFEFVMANHGLTTEAS 296
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY+ G C K ++ +I Y ++ E LL+ QPVSV V+A G F+ Y
Sbjct: 297 YPYKGINGACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGGFLFQLYA 356
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE----G 332
GV + C +HGV VVG+G E + KYW++KNSWG WGE+GY+ + RD G
Sbjct: 357 GGVFSGPCTAQINHGVTVVGYG--ETDKAEKYWIVKNSWGPEWGEAGYMLMQRDAGVPTG 414
Query: 333 LCGIATEASYPV 344
LCGIA ASYPV
Sbjct: 415 LCGIAMLASYPV 426
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 149/324 (45%), Positives = 195/324 (60%), Gaps = 19/324 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ EQWM +HGR Y D EK R ++++N+E +E N N YKL N+F+DLTNE
Sbjct: 28 MLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 86
Query: 98 EFRASYTGYNRP---VPSVSRQSSRPSTFKYQNVTDV-PTSIDWREKGAVTHIKNQGHCG 153
EFRA G+ RP +P +S S ++ D+ P S+DWR+KGAV +KNQG CG
Sbjct: 87 EFRAKMLGF-RPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCG 145
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLAT 213
SCWAFSAVAA+EGI QI G+L+ LSEQ+LVDC + GC GG M AFE+++ N GL T
Sbjct: 146 SCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEAVGCGGGYMSWAFEFVVGNHGLTT 205
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
EA YPY G C K +A I Y ++ E L +A QPVSV V+ F+
Sbjct: 206 EASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQ 265
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEED--------GAKYWLIKNSWGETWGESGYI 325
Y GV C + +HGV VVG+G +E + G KYW++KNSWG WG++GYI
Sbjct: 266 LYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYI 325
Query: 326 RILRD-----EGLCGIATEASYPV 344
+ RD GLCGIA SYPV
Sbjct: 326 LMQRDVAGLASGLCGIALLPSYPV 349
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 138/336 (41%), Positives = 213/336 (63%), Gaps = 13/336 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F+ + L + AS + R ++++ E+WMA++GR YKD EK R IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG-YNRPVPSVSRQSSRPSTFKYQNVTDV 131
IE N +Y LG N+F+D+T EF A YTG +RP+ ++ R+ +F N++ V
Sbjct: 68 IETFNSHNGNSYTLGINQFTDMTKSEFVAQYTGGISRPL-NIEREPV--VSFDDVNISAV 124
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN 191
P SIDWR+ GAV +KNQ CGSCWAF+A+A VEGI +I G L+ LSEQ+++DC+ +
Sbjct: 125 PQSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV-SY 183
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GG ++KA+++II N G+ TE +YPYQ QGTC+ +A G Y + + DE +
Sbjct: 184 GCKGGWVNKAYDFIISNNGVTTEENYPYQAYQGTCNANSFPNSAYITG-YSYVRRNDERS 242
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLI 311
++ AV+ QP++ ++AS + F++Y GV + CG + +H + ++G+G ++ G KYW++
Sbjct: 243 MMYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTKYWIV 299
Query: 312 KNSWGETWGESGYIRILR----DEGLCGIATEASYP 343
+NSWG +WGE GY+R+ R G CGIA +P
Sbjct: 300 RNSWGSSWGEGGYVRMARGVSSSSGACGIAMSPLFP 335
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 149/324 (45%), Positives = 195/324 (60%), Gaps = 19/324 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ EQWM +HGR Y D EK R ++++N+E +E N N YKL N+F+DLTNE
Sbjct: 27 MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 85
Query: 98 EFRASYTGYNRP---VPSVSRQSSRPSTFKYQNVTDV-PTSIDWREKGAVTHIKNQGHCG 153
EFRA G+ RP +P +S S ++ D+ P S+DWR+KGAV +KNQG CG
Sbjct: 86 EFRAKMLGF-RPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCG 144
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLAT 213
SCWAFSAVAA+EGI QI G+L+ LSEQ+LVDC + GC GG M AFE+++ N GL T
Sbjct: 145 SCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEAVGCGGGYMSWAFEFVVGNHGLTT 204
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
EA YPY G C K +A I Y ++ E L +A QPVSV V+ F+
Sbjct: 205 EASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQ 264
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEED--------GAKYWLIKNSWGETWGESGYI 325
Y GV C + +HGV VVG+G +E + G KYW++KNSWG WG++GYI
Sbjct: 265 LYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYI 324
Query: 326 RILRD-----EGLCGIATEASYPV 344
+ RD GLCGIA SYPV
Sbjct: 325 LMQRDVAGLASGLCGIALLPSYPV 348
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 146/349 (41%), Positives = 211/349 (60%), Gaps = 14/349 (4%)
Query: 3 LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ KSF+ +F +L+++ A + + +E W+ ++G++Y E
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR++Y + S S ++
Sbjct: 61 RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFT----SGSNKTKVS 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+ ++ + +P+ +DWR GAV IK+QG CG CWAFSA+A VEGI +I G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176
Query: 181 QQLVDCSTDNN--GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
Q+L+DC N GC+GG + F++II N G+ TE +YPY + G C+ + TI
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTI 236
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
YE++P +E AL AVT QPVSV ++A+G AF+ Y G+ CG DH V +VG+G
Sbjct: 237 DTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVGYG 296
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
T E G YW++KNSW TWGE GY+RILR+ G CGIAT SYPV
Sbjct: 297 T---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 146/349 (41%), Positives = 210/349 (60%), Gaps = 14/349 (4%)
Query: 3 LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ KSF+ +F +L+++ A + + +E W+ ++G++Y E
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
R IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR++Y G+ S S ++
Sbjct: 61 RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+ ++ + +P+ +DWR GAV IK+QG CG CWAFSA+A VEGI +I G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176
Query: 181 QQLVDCSTDNN--GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
Q+L+DC N GC+G + F +II N G+ TE +YPY + G C+ + TI
Sbjct: 177 QELIDCGRTQNTRGCNGSYITDGFPFIINNGGINTEENYPYTAQDGECNVDLQNEKYVTI 236
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
YE++P +E AL AVT QPVSV ++A+G AF+ Y G+ CG DH V +VG+G
Sbjct: 237 DTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYG 296
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
T E G YW++KNSW TWGE GY+RILR+ G CGIAT SYPV
Sbjct: 297 T---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 148/336 (44%), Positives = 202/336 (60%), Gaps = 18/336 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
++ + IL++ S V S + E W Q+G+TY E EKA RL +F++N +
Sbjct: 5 LWAVSILILAVHSSVSEASST-----ADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAF 59
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
+ + N N +Y L N F+DLT+ EF+AS G+ S R S S VP
Sbjct: 60 VTQHNSMANASYTLALNAFADLTHHEFKASRLGF-----SPGRAQSIRSVGTPVQELHVP 114
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NN 191
++DWR+ GAVT +K+QG+CG CW+FS A+EGI +I G L+ LSEQ+LVDC N+
Sbjct: 115 PAVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNS 174
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GGLMD A++++I+N+G+ +EADYPY C+K+K K TI Y D+P DE
Sbjct: 175 GCEGGLMDYAYQFVIKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQ 234
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLI 311
LLQ V KQPVSV + S + F+ Y +GV C DH V +VG+GT EDG +W++
Sbjct: 235 LLQVVAKQPVSVGICGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYGT---EDGVDFWIV 291
Query: 312 KNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
KNSWGE WG GYI +LR+ EG+CGI ASYP
Sbjct: 292 KNSWGEHWGMRGYIHMLRNNGTAEGICGINMLASYP 327
>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
Length = 472
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 202/321 (62%), Gaps = 28/321 (8%)
Query: 42 HEQWMAQHGRTYKDEL----EKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
++ W+A++G E+ R F NL +++ N G Y+LG N F+DL
Sbjct: 53 YDLWLAENGGGSSPNANSIPERERRFRAFWDNLNFVDAHNARAAAGEEGYRLGMNRFADL 112
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPST-----FKYQNVTDVPTSIDWREKGAVTHIKNQ 149
TN+EFRA+Y G V Q +RP +++ ++P ++DWREKGAV +KNQ
Sbjct: 113 TNDEFRAAYLG-------VKAQRARPGRMVGERYRHDGAEELPEAVDWREKGAVAPVKNQ 165
Query: 150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIE 207
G CGSCWAFSAV+ VE I QI G+++ LSEQ+LV+C T+ ++GC+GGLMD AFE+II+
Sbjct: 166 GQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIK 225
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA 267
N G+ TE DYPY+ G CD ++ A +I +ED+P+ DE +L +AV QPVSV +EA
Sbjct: 226 NGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEA 285
Query: 268 SGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
G+ F+ Y GV + CG DHGV VG+GT E+G YW+++NSWG WGESGY+R+
Sbjct: 286 GGREFQLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGESGYLRM 342
Query: 328 LRD----EGLCGIATEASYPV 344
R+ G CGIA +SYP
Sbjct: 343 ERNINVTSGKCGIAMMSSYPT 363
>gi|357130490|ref|XP_003566881.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 350
Score = 290 bits (743), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 150/338 (44%), Positives = 196/338 (57%), Gaps = 11/338 (3%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
+++LV T V+ + ++ +HEQWMA+ GR Y D EKA R +F N Y++
Sbjct: 14 LLVLVATAVFHAVAAQGEAGLTVAARHEQWMAKFGRVYTDANEKARRQAVFGANARYVDA 73
Query: 76 ANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
N+ GNRTY LG NEFSDLT+ EF ++ GY P + S+ Y ++P S
Sbjct: 74 VNRAGNRTYTLGLNEFSDLTDNEFAKTHLGYREFRPETA-NISKGVDPGYGLAGNIPKSF 132
Query: 136 DWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSG 195
DWR KGAVT +K+QG CG CWAF+AVAA EG+ +I G LI +SEQQ++DC+T NN C G
Sbjct: 133 DWRTKGAVTEVKSQGGCGCCWAFAAVAATEGLVKIAKGTLISMSEQQVLDCTTGNNTCKG 192
Query: 196 GLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP-KGDEHALLQ 254
G M+ A Y+ + GL TE DY Y E+G C + A ++G E +P G+E L +
Sbjct: 193 GYMNDALSYVFASGGLQTEEDYEYNAEKGACRRDVTPNPATSVGHAEYMPLDGNEFLLQK 252
Query: 255 AVTKQPVSVCVEASGQAFRFYKRGVLNA--ECGDNCDHGVAVVGFGTAEEEDGAK--YWL 310
V +QPV V VEA G F+ Y GV CG N DH VVG+G A DG K YWL
Sbjct: 253 LVARQPVVVAVEAYGTDFKNYGGGVFTGSPSCGQNLDHFFTVVGYGFA---DGGKQMYWL 309
Query: 311 IKNSWGETWGESGYIRILRDEGL--CGIATEASYPVAM 346
+KN WG +WGESGY+RI R CG+ Y M
Sbjct: 310 VKNQWGTSWGESGYMRIARGSSARNCGMTNNYVYYATM 347
>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 464
Score = 290 bits (742), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 142/310 (45%), Positives = 199/310 (64%), Gaps = 14/310 (4%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN-KEGNRTYKLGTNEFSDLTNEEFR 100
++ W+A++GR+Y E R +F NL + + N + + ++LG N F+DLTNEEFR
Sbjct: 53 YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 112
Query: 101 ASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
A++ G V R + +++ V ++P S+DWREKGAV +KNQG CGSCWAFSA
Sbjct: 113 ATFLG----AKVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 168
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGG--LMDKAFEYIIENKGLATEADYP 218
V+ VE I Q+ G++I LSEQ+LV+CST+ LMD AF++II+N G+ TE DYP
Sbjct: 169 VSTVESINQLVTGEMITLSEQELVECSTNGQNGGCNGGLMDDAFDFIIKNGGIDTEDDYP 228
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRG 278
Y+ G CD +E A +I +ED+P+ DE +L +AV QPVSV +EA G+ F+ Y G
Sbjct: 229 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 288
Query: 279 VLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLC 334
V + CG + DHGV VG+GT ++G YW+++NSWG WGESGY+R+ R+ G C
Sbjct: 289 VFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKC 345
Query: 335 GIATEASYPV 344
GIA ASYP
Sbjct: 346 GIAMMASYPT 355
>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
Length = 475
Score = 290 bits (742), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 157/346 (45%), Positives = 213/346 (61%), Gaps = 18/346 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++P I+ L A +GRS E I+ +++W +H D+ RL +FK+N
Sbjct: 23 VVPPLDILTLSKQ-AWAAPAGRSDEEVRII--YQEWRVKHRPAENDQYVGDYRLEVFKEN 79
Query: 70 LEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
L ++++ N +R Y+LG N F+DLTNEE+RA + R + + R +S + +Y+
Sbjct: 80 LRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRARFL---RDLSRLGRSTSGEISNQYR 136
Query: 127 -NVTDV-PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
DV P SIDWREKGAV +KNQG CGSCWAF+A+AAVEGI QI G LI LSEQQLV
Sbjct: 137 LREGDVLPDSIDWREKGAVVAVKNQGRCGSCWAFAAIAAVEGINQIVTGDLISLSEQQLV 196
Query: 185 DCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
DCST N GC GG +AF+YII N G+ +E YPY GTC+ KE A +I Y ++
Sbjct: 197 DCSTRNYGCEGGWPYRAFQYIINNGGVNSEEHYPYTGTNGTCNTTKENAHVVSIDSYRNV 256
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEED 304
P DE +L +A QP+SV ++ASG+ F+ Y G+ C + +HGV VVG+GT E+
Sbjct: 257 PSNDEKSLQKAAANQPISVGIDASGRNFQLYHSGIFTGSCNTSLNHGVTVVGYGT---EN 313
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVAM 346
G YW++KNSWGE WG SGYI + R+ G CGIA SYP+ +
Sbjct: 314 GNDYWIVKNSWGENWGNSGYILMERNIAESSGKCGIAISPSYPIKV 359
>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
Length = 384
Score = 290 bits (741), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 152/358 (42%), Positives = 202/358 (56%), Gaps = 53/358 (14%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E+ EQWM +HGR Y D EK RL ++++N+ +E N N Y+L N+F+DLTNE
Sbjct: 28 MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMSNGGYRLADNKFADLTNE 87
Query: 98 EFRASYTGYNRPVP--SVSRQSSRPSTF---------KYQNVTDVPTSIDWREKGAVTHI 146
EFRA G+ RP P + ++ P T +Y + ++P S+DWREKGAV +
Sbjct: 88 EFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSD--ELPKSVDWREKGAVAPV 145
Query: 147 KNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYII 206
KNQG CGSCWAFSAVAA+EGI QI GKL+ LSEQ+LVDC T GC+GG M AFE+++
Sbjct: 146 KNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKAIGCAGGYMSWAFEFVM 205
Query: 207 ENKGLATEADYPYQQE----------------------------QGTCDKQKEKAAAATI 238
N GL TE +YPYQ G C K K +A +I
Sbjct: 206 NNSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKESAVSI 265
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
Y ++ E LL+A QPVSV V+A ++ Y GV C + +HGV VVG+G
Sbjct: 266 SGYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTVVGYG 325
Query: 299 TAEEE--------DGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+ + G KYW++KNSWG WG++GYI + R+ GLCGIA SYPV
Sbjct: 326 ETQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIALLPSYPV 383
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 135/335 (40%), Positives = 213/335 (63%), Gaps = 12/335 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F+ + L AS + R ++++ E+WMA++GR YKD+ EK R IFK N+++
Sbjct: 8 VFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N +Y LG N+F+D+T EF A YTG + P+ ++ R+ +F N++ VP
Sbjct: 68 IETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPL-NIEREPV--VSFDDVNISAVP 124
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNG 192
SIDWR+ GAV +KNQ CGSCW+F+A+A VEGI +I G L+ LSEQ+++DC+ + G
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV-SYG 183
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C GG ++KA+++II N G+ TE +YPY QGTC+ +A G Y + + DE ++
Sbjct: 184 CKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTCNANSFPNSAYITG-YSYVRRNDERSM 242
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
+ AV+ QP++ ++AS + F++Y GV + CG + +H + ++G+G ++ G KYW+++
Sbjct: 243 MYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTKYWIVR 299
Query: 313 NSWGETWGESGYIRILR----DEGLCGIATEASYP 343
NSWG +WGE GY+R+ R G+CGIA +P
Sbjct: 300 NSWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 144/319 (45%), Positives = 199/319 (62%), Gaps = 18/319 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I E + W +HG+TY E E+ R+ IFK N +++ + N N TY L N F+DLT+
Sbjct: 26 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 85
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+AS G + PSV S S VP S+DWR+KGAVT++K+QG CG+CW+
Sbjct: 86 EFKASRLGLSVSAPSVIMASKGQS---LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 142
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FSA A+EGI QI G LI LSEQ+L+DC N GC+GGLMD AFE++I+N G+ TE D
Sbjct: 143 FSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKD 202
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPYQ+ GTC K K K TI Y + DE AL++AV QPVSV + S +AF+ Y
Sbjct: 203 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYS 262
Query: 277 -------RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
+G+ + C + DH V +VG+G+ ++G YW++KNSWG++WG G++ + R
Sbjct: 263 SKFYLLMQGIFSGPCSTSLDHAVLIVGYGS---QNGVDYWIVKNSWGKSWGMDGFMHMQR 319
Query: 330 D----EGLCGIATEASYPV 344
+ +G+CGI ASYP+
Sbjct: 320 NTENSDGVCGINMLASYPI 338
>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
parachinensis]
Length = 260
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 141/260 (54%), Positives = 185/260 (71%), Gaps = 8/260 (3%)
Query: 90 EFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIK 147
+F+++TN+EFR+ YTGY S+ ++ ++F+YQNV+ +P ++DWR+KGAVT IK
Sbjct: 1 QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60
Query: 148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
NQG CG CWAFSAVAA+EG TQI GKLI LSEQQLVDC T++ GCSGGL+D AFE+I+
Sbjct: 61 NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCSGGLIDTAFEHIMA 120
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA 267
GL TE++YPY+ E TC + +AA+I YED+P DE+AL++AV QPVSV +E
Sbjct: 121 TGGLTTESNYPYKGEDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVGIEG 180
Query: 268 SGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
G F+FY GV EC DH V VG+ ++ G+KYW+IKNSWG WGE GY+RI
Sbjct: 181 GGFDFQFYSSGVFTGECTTYLDHAVTAVGY--SQSSAGSKYWIIKNSWGTKWGEGGYMRI 238
Query: 328 LRD----EGLCGIATEASYP 343
+D EGLCG+A +ASYP
Sbjct: 239 KKDIKDKEGLCGLAMKASYP 258
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 146/350 (41%), Positives = 211/350 (60%), Gaps = 23/350 (6%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M L F +F+I F I +++ R+ E ++ +E W+ ++G++Y E+
Sbjct: 10 MSLLFFSTFLIFSFAI-------DAKISPLRTNDE--VMALYESWLVKYGKSYNSLGERE 60
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP 120
MR+ IFK+NL +I++ N + NR+Y +G N+F+DLT+EE+R++Y G+ + S P
Sbjct: 61 MRIEIFKENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFKSSLKSKVSNRYMP 120
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+ +P +DWR GAV +KNQG C SCWAF+ +A VE I QI G LI LSE
Sbjct: 121 QVGEV-----LPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSE 175
Query: 181 QQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
Q+LVDC+ N GC GG MD A+E+II N G+ TE +YPY + CD+ K+ TI
Sbjct: 176 QELVDCNRTPINEGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQCDEPKKNQNYVTI 235
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLN-AECGDNCDHGVAVVGF 297
YE +P DE A+ +AV QPVSV ++A FRFY+ G+ CG +H V ++G+
Sbjct: 236 DSYEQVPPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTTLNHAVTIIGY 295
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
GT E+G YW++KNS+G WGESGY ++ R+ EG CGIA+ YPV
Sbjct: 296 GT---ENGIDYWIVKNSYGTQWGESGYGKVQRNVGGEGRCGIASYPFYPV 342
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 152/308 (49%), Positives = 190/308 (61%), Gaps = 27/308 (8%)
Query: 47 AQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASY 103
+ + ++Y+ E +A RL F+ NLE+I K N E G +Y +G NEF+DLT +EF A Y
Sbjct: 3 SDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMALY 62
Query: 104 --TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
+ +NR +P P+T + S+DWR KGAVT IKNQG CGSCW+FS
Sbjct: 63 VPSKFNRTMPY--NTVYLPATSE--------DSVDWRTKGAVTPIKNQGQCGSCWSFSTT 112
Query: 162 AAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
+ EG I G L+ LSEQQLVDCS N GC+GGLMD AF+YII NKGL TE DYPY
Sbjct: 113 GSTEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPY 172
Query: 220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
+ GTC+K+KE AATI Y D+PK +E L AV K PVSV +EA F+ YK GV
Sbjct: 173 TAQDGTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGV 232
Query: 280 LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGI 336
+ CG N DHGV VVG+ YW++KNSWG TWG GYI + R G+CGI
Sbjct: 233 FDGNCGTNLDHGVLVVGYTD-------DYWIVKNSWGTTWGVEGYINMKRGVSASGICGI 285
Query: 337 ATEASYPV 344
A + SYP+
Sbjct: 286 AMQPSYPI 293
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 154/346 (44%), Positives = 207/346 (59%), Gaps = 33/346 (9%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKH-----EQWMAQHGRTYKDELEKAMRLTIFKQN 69
+++ LV+ CA + G +M EP + + + + + + Y+ E+A R ++F QN
Sbjct: 1 MMLKLVLVCA---LVGAAMAEPLSLTVNKGRLFDAFKTKFNKVYESAEEEARRFSVFSQN 57
Query: 70 LEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
+++I + N E R T+ + N+F+DLTNEE+R Y RP P+ R +
Sbjct: 58 IDFINRHNAEAARGVHTHTVDVNQFADLTNEEYRQLYL---RPYPTELLGRERQEVW--- 111
Query: 127 NVTDVPT--SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
D P S+DWR+KGAVT IKNQG CGSCW+FS +VEG I G L+ LSEQQLV
Sbjct: 112 --LDGPNAGSVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLV 169
Query: 185 DCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
DCS N GC+GGLMD AF+YII N GL TE DYPY G CDK KE A +I Y+
Sbjct: 170 DCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPYTARDGVCDKSKESKHAVSISGYK 229
Query: 243 DLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEE 302
D+P+ +E L AV K PVSV +EA Q+F+ Y GV + CG N DHGV VVG+ +
Sbjct: 230 DVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGPCGTNLDHGVLVVGYTS--- 286
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILR---DEGLCGIATEASYPVA 345
YW++KNSWG +WG+ GYI + R G+CGIA + SYP+A
Sbjct: 287 ----DYWIVKNSWGASWGDQGYIMMKRGVSSAGICGIAMQPSYPIA 328
>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
Length = 469
Score = 288 bits (737), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 143/321 (44%), Positives = 204/321 (63%), Gaps = 28/321 (8%)
Query: 42 HEQWMAQHGR-TYKDE---LEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
++ W+A+HG +Y + E+ R F NL +++ N G ++L N F+DL
Sbjct: 50 YDLWLAEHGGGSYPNANSIPERERRFRAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 109
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPST-----FKYQNVTDVPTSIDWREKGAVTHIKNQ 149
TN+EFRA+Y G V Q +RP +++ ++P ++DWREKGAV +KNQ
Sbjct: 110 TNDEFRAAYLG-------VKGQRARPGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQ 162
Query: 150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIE 207
G CGSCWAFSA++ VE I QI G+++ LSEQ+LV+C T+ ++GC+GGLMD AFE+II+
Sbjct: 163 GQCGSCWAFSAISTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIK 222
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA 267
N G+ TE DYPY+ G CD ++ A +I +ED+P+ DE +L +AV QPVSV +EA
Sbjct: 223 NGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEA 282
Query: 268 SGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
G+ F+ Y GV + CG DHGV VG+GT E+G YW+++NSWG WGE+GY+R+
Sbjct: 283 GGREFQLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGEAGYLRM 339
Query: 328 LRD----EGLCGIATEASYPV 344
R+ G CGIA +SYP
Sbjct: 340 ERNINVTSGKCGIAMMSSYPT 360
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 141/293 (48%), Positives = 189/293 (64%), Gaps = 16/293 (5%)
Query: 62 RLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGY-----NRPVPSVSRQ 116
R FK+N YIE+ N+ G +Y+LG N+FSDLT+EEFR + G + PV + R
Sbjct: 34 RFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRFLGLRPDLIDSPVLKMPRD 93
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLI 176
S F QNV D+P S+DWR+ GAVT K+QG CG CWAF+ A+EGI QI G+L+
Sbjct: 94 SDIEEGF--QNV-DLPASVDWRKHGAVTAPKDQGSCGGCWAFATTGAIEGINQIVTGQLM 150
Query: 177 ELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAA 235
LSEQ+L+DC + GC GGLM+ A+++I+EN GL TE DYPY + C+ +K +
Sbjct: 151 SLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGLDTETDYPYHASESHCNMKKLNSRV 210
Query: 236 ATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVV 295
I YE +P GDE ALL+AV KQPVSV +E + + F+ Y GV CG+ +HGV +V
Sbjct: 211 VAIDGYEAIPDGDEQALLRAVAKQPVSVAIEGASKDFQHYASGVFTGHCGEEINHGVLIV 270
Query: 296 GFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE----GLCGIATEASYPV 344
G+GT EDG YW++KNSW TWG+ G++++ R+ GLC I T ASYPV
Sbjct: 271 GYGT---EDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCSINTLASYPV 320
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 140/315 (44%), Positives = 196/315 (62%), Gaps = 20/315 (6%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
++ E E W +HG++Y EK RL +F N E++ N N +Y L N ++DLT+
Sbjct: 24 NVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTH 83
Query: 97 EEFRASYTGYNRPV----PSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
EF+ S G++ + P + ++ S P DVP S+DWR+KGAVT +K+QG C
Sbjct: 84 HEFKVSRLGFSPALRNFRPVLPQEPSLPR--------DVPDSLDWRKKGAVTAVKDQGSC 135
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGL 211
G+CW+FSA A+EGI QI G LI LSEQ+L+DC N+GC GGLMD A++++I N G+
Sbjct: 136 GACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGI 195
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
TE DYPYQ G+C K K + TI Y D+P DE LLQAV QPVSV + S +A
Sbjct: 196 DTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERA 255
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD- 330
F+ Y +G+ + C + DH V +VG+G+ E+G YW++KNSWG++WG GY+ + R+
Sbjct: 256 FQLYSKGIFSGPCSTSLDHAVLIVGYGS---ENGVDYWIVKNSWGKSWGMDGYMHMQRNS 312
Query: 331 ---EGLCGIATEASY 342
EG+CGI ASY
Sbjct: 313 GNSEGVCGINKLASY 327
>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 287 bits (735), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 143/309 (46%), Positives = 202/309 (65%), Gaps = 14/309 (4%)
Query: 43 EQWMAQHGRTYKDEL-EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+ WM++HG+TY + L EK R FK NL +I++ N + N +Y+LG F+DLT +E+R
Sbjct: 49 QMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRD 107
Query: 102 SYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
+ G +P R S R + + +P S+DWR +GAV+ IK+QG C SCWAFS V
Sbjct: 108 LFPGSPKPKQRNLRISRR---YVPLDGDQLPESVDWRNEGAVSAIKDQGTCNSCWAFSTV 164
Query: 162 AAVEGITQITGGKLIELSEQQLVDCSTDNNGCSG-GLMDKAFEYIIENKGLATEADYPYQ 220
AAVEGI +I G+L+ LSEQ+LVDC+ NNGC G G MD AF+++I N GL ++ DYPYQ
Sbjct: 165 AAVEGINKIVTGELVSLSEQELVDCNLVNNGCYGSGTMDAAFQFLINNGGLDSDTDYPYQ 224
Query: 221 QEQGTCDKQKEKA-AAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
QG C++++ + TI YED+P DE +L +AV QPVSV V+ Q F Y+ G+
Sbjct: 225 GSQGYCNRKESTSNKIITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSGI 284
Query: 280 LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCG 335
N CG + DH + +VG+G+ E+G YW+++NSWG TWG++GY ++ R+ G+CG
Sbjct: 285 YNGPCGTDLDHALVIVGYGS---ENGQDYWIVRNSWGTTWGDAGYAKMARNFEYPSGVCG 341
Query: 336 IATEASYPV 344
IA ASYPV
Sbjct: 342 IAMLASYPV 350
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 152/340 (44%), Positives = 211/340 (62%), Gaps = 18/340 (5%)
Query: 14 FVIIILVITCASQVVS-GRSMHEP--SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
V++I + AS++ S S+++P ++ ++ E+W+ H + Y E +R I++ N+
Sbjct: 12 LVVLICFVLIASKLCSVNSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNV 71
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+ I+ N + +KL N F+D+TN EF+A + G N + ++ RP NV
Sbjct: 72 QLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQ-RPVCDPAGNV-- 127
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC--ST 188
P ++DWR +GAVT I+NQG CG CWAFSAVAA+EGI +I G L+ LSEQQL+DC T
Sbjct: 128 -PDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGT 186
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GCSGGLM+ AFE+I N GL TE DYPY +GTCD++K K TI Y+ + + +
Sbjct: 187 YNKGCSGGLMETAFEFIKSNGGLTTETDYPYTGIEGTCDQEKAKNKVVTIQGYQKVAQ-N 245
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E +L A +QPVSV ++A G F+ Y GV + CG N +HGV VVG+G E KY
Sbjct: 246 EASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTSYCGTNLNHGVTVVGYGV---EGDQKY 302
Query: 309 WLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
W++KNSWG WGE GYIR+ R D G CGIA ASYP+
Sbjct: 303 WIVKNSWGTGWGEEGYIRMERGISEDTGKCGIAMLASYPL 342
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 139/312 (44%), Positives = 202/312 (64%), Gaps = 13/312 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ E W+ ++G++Y EK R IFK NL ++++ N + NR+YK+G N+FSDLT E
Sbjct: 44 VMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTLE 103
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
E+ + Y G + R ++ ++ + +P SIDWR+KGAV +KNQG+CGSCW
Sbjct: 104 EYSSIYLGTKFDM----RMTNVSDRYEPRVGDQLPNSIDWRKKGAVLGVKNQGNCGSCWT 159
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDC--STDNNGCSGGLMDKAFEYIIENKGLATEA 215
F+ +AAVE I QI G LI LSEQQ+VDC + NNGC GG A+++II+N G+ TEA
Sbjct: 160 FAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGAYQFIIDNGGINTEA 219
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
+YPY+ + G CD+QK + TI +YE++P+ +E AL +AV+ Q VSV + ++ F+ Y
Sbjct: 220 NYPYKAQDGECDEQKNQ-KYVTIDRYENVPRKNEKALQKAVSNQLVSVGIASNSSEFKAY 278
Query: 276 KRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR---DEG 332
K G+ CG DH V +VG+GT E G YW+++NSWG WGE+GY+R+ R + G
Sbjct: 279 KSGIFTGPCGAKIDHAVTIVGYGT---EGGMDYWIVRNSWGSNWGENGYVRMQRNVGNAG 335
Query: 333 LCGIATEASYPV 344
C IAT +YPV
Sbjct: 336 TCFIATSPNYPV 347
>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 287 bits (734), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 147/309 (47%), Positives = 191/309 (61%), Gaps = 43/309 (13%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E W+A+HG++Y EK R IFK NL +I++ N E NRTYK+
Sbjct: 4 YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAE-NRTYKI--------------- 47
Query: 102 SYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
S R + ++ +P S+DWR+KGAV +K+QG CGSCWAFS +
Sbjct: 48 ---------------SDR---YAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTI 89
Query: 162 AAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
AAVEGI +I G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ +E DYPY+
Sbjct: 90 AAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYK 149
Query: 221 QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL 280
G CD+ ++ A TI YED+P+ DE +L +AV QPVSV +EA G+ F+ Y+ G+
Sbjct: 150 ASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIF 209
Query: 281 NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCG 335
CG DHGV VG+GT E+G YW++KNSWG +WGE GYIR+ RD G CG
Sbjct: 210 TGRCGTALDHGVTAVGYGT---ENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCG 266
Query: 336 IATEASYPV 344
IA EASYP+
Sbjct: 267 IAMEASYPI 275
>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 286 bits (733), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 158/341 (46%), Positives = 211/341 (61%), Gaps = 29/341 (8%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
F +++ + A QV R++ + S+ E+HEQ M ++G+ YKD ++ FK+N+ YI
Sbjct: 12 FAMLLCMAFLAFQVTC-RTLQDASMXERHEQRMTRYGKVYKDPPKRX-----FKENVNYI 65
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E N N+ YK G N+F+ R + G+ + R +TFK++NVT P+
Sbjct: 66 EACNNAANKPYKRGINQFAP------RNRFKGH------MCSSIIRITTFKFENVTATPS 113
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NN 191
++D R+KGAVT IK+QG CG CWAFSAVAA EGI ++ GKLI LSEQ+LVDC T +
Sbjct: 114 TVDCRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDX 173
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYP-YQQEQGTCDKQKEKAAAAT-IGKYEDLPKGDE 249
GC GGLMD AF++II+N GL + P Y G C+ + AAT I YED+P +E
Sbjct: 174 GCEGGLMDDAFKFIIQNHGLKHXSQLPLYMGVDGKCNANEAAKNAATIITGYEDVPANNE 233
Query: 250 HALLQ-AVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
A LQ AV PVS ++ASG F+FYK GV CG DHGV VG+G + +DG +Y
Sbjct: 234 KAHLQKAVANNPVSEAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVS--DDGTEY 291
Query: 309 WLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
WL+KNSWG WGE GYIR+ R +E LCGIA +ASYP A
Sbjct: 292 WLVKNSWGTEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 332
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 286 bits (732), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 151/342 (44%), Positives = 209/342 (61%), Gaps = 17/342 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEP--SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQ 68
+ + V+I V+ + S+++P ++ ++ E+W+ H + Y E +R I++
Sbjct: 10 LTLAVLICFVLIASKLCSVDSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQS 69
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
N++ I+ N + +KL N F+D+TN EF+A + G N + ++ RP NV
Sbjct: 70 NVQLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQ-RPVCDPAGNV 127
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC-- 186
P ++DWR +GAVT I+NQG CG CWAFSAVAA+EGI +I G L+ LSEQQL+DC
Sbjct: 128 ---PDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDV 184
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
T N GCSGGLM+ AFE+I N GLATE DYPY +GTCD++K K TI Y+ + +
Sbjct: 185 GTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQKVAQ 244
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
+E +L A +QPVSV ++A G F+ Y GV CG N +HGV VVG+G E
Sbjct: 245 -NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGV---EGDQ 300
Query: 307 KYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
KYW++KNSWG WGE GYIR+ R D G CGIA ASYP+
Sbjct: 301 KYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPL 342
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 286 bits (732), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 143/312 (45%), Positives = 196/312 (62%), Gaps = 13/312 (4%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
E + W +HG+TY E E+ R+ IFK N +++ + N N TY L N F+DLT+ EF
Sbjct: 30 ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89
Query: 100 RASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFS 159
+AS G + S+ S S VP S+DWR+KGAVT++K+QG CG+CW+FS
Sbjct: 90 KASRLGLSVSASSLIMASKGQS---LGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146
Query: 160 AVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYP 218
A A+EGI QI G LI LSEQ+L+DC N GC+GGLMD AFE++I+N G+ TE DYP
Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKR- 277
YQ+ GTC K K K TI Y + DE AL +AV QPVSV + S +AF+ Y R
Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRV 266
Query: 278 -GVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EG 332
G+ + C + DH V +VG+G+ ++G YW++KNSWG++WG G++ + R+ EG
Sbjct: 267 SGIFSGPCSTSLDHAVLIVGYGS---QNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEG 323
Query: 333 LCGIATEASYPV 344
+CGI ASYP+
Sbjct: 324 ICGINMLASYPI 335
>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
Length = 397
Score = 286 bits (732), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 149/330 (45%), Positives = 207/330 (62%), Gaps = 32/330 (9%)
Query: 42 HEQWMAQHGRTYKD----ELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
+E W ++HGR + E +RL +F+ NL YI+ N E G T++LG F+DL
Sbjct: 54 YEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPFADL 113
Query: 95 TNEEFRASYTGY---NRPVPSVSRQSSRPSTFKYQN----------VTDVPTSIDWREKG 141
T EE+R G+ +R PS +SR + ++ D+P +IDWR+ G
Sbjct: 114 TLEEYRGRALGFRARHRGGPSARAAASRVGSGGTRSHHRRPRPRPRCGDLPDAIDWRQLG 173
Query: 142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKA 201
AVT +KNQ CG CWAFSAVAA+EGI I G L+ LSEQ+++DC T ++GC+GG M+ A
Sbjct: 174 AVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCDTQDSGCNGGQMENA 233
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQK---EKAAAATIGKYEDLPKGDEHALLQAVTK 258
F+++I+N G+ +EADYP+ GTCD K EK AA I + ++ +E AL +AV
Sbjct: 234 FQFVIDNGGIDSEADYPFIATDGTCDANKANDEKVAA--IDGFVEVASNNETALQEAVAI 291
Query: 259 QPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
QPVSV ++A G+AF+ Y G+ N CG N DHGV VVG+G+ E+G YW++KNSW ++
Sbjct: 292 QPVSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGYGS---ENGKAYWIVKNSWSDS 348
Query: 319 WGESGYIRILRD----EGLCGIATEASYPV 344
WGE+GYIRI R+ G CGIA +ASYPV
Sbjct: 349 WGEAGYIRIRRNVFLPVGKCGIAMDASYPV 378
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 189/310 (60%), Gaps = 12/310 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEE 98
+E W ++HG + + +RL +F+ NL YI+ N E G T++LG F+DLT EE
Sbjct: 52 YEAWKSEHGHGHGSD--DRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPFADLTLEE 109
Query: 99 FRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAF 158
+R G+ SR S S D+P +IDWRE GAVT +KNQ CG CWAF
Sbjct: 110 YRGRALGFRARRGGASRVGSGSSYRPRPRGGDLPDAIDWRELGAVTGVKNQEQCGGCWAF 169
Query: 159 SAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYP 218
SAVAA+EGI +I G L+ LSEQ+++DC T + GC+GG M AF+++I N G+ TEADYP
Sbjct: 170 SAVAAIEGINEIVTGNLVSLSEQEIIDCDTQDGGCNGGEMQNAFQFVINNGGIDTEADYP 229
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRG 278
Y CD + TI + + +E AL +AV QPVSV ++ASG+ F+ Y G
Sbjct: 230 YLGTDAACDANRVNERVVTIDGFVSVATENETALQEAVANQPVSVAIDASGRKFQHYTSG 289
Query: 279 VLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLC 334
+ N CG DHGV VG+G+ E+G YW++KNSW +WGE+GYIRI R+ G C
Sbjct: 290 IFNGPCGTQLDHGVTAVGYGS---ENGKDYWIVKNSWSSSWGEAGYIRIRRNVAAATGKC 346
Query: 335 GIATEASYPV 344
GIA +ASYPV
Sbjct: 347 GIAMDASYPV 356
>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 160/319 (50%), Positives = 203/319 (63%), Gaps = 15/319 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ +E+W QH +D EKA R +F++N+ I + N+ G+ YKL N F D+
Sbjct: 40 EDSLWALYERWREQH-TVARDLGEKARRFNVFRENVRLIHEFNR-GDAPYKLRLNRFGDM 97
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKY---QNVTDVPTSIDWREKGAVTHIKNQGH 151
T +EFR +Y + F + +V DVP S+DWR+KGAVT +K+QG
Sbjct: 98 TADEFRRAYASSRVSHHRMFSLKEGGGGFMHGSAASVRDVPPSVDWRQKGAVTAVKDQGQ 157
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKG 210
CGSCWAFS +AAVEGI I L LSEQQLVDC T +N GC+GGLMD AF+YI ++ G
Sbjct: 158 CGSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNAGCNGGLMDYAFQYIAKHGG 217
Query: 211 LATEADYPYQQEQG-TCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASG 269
+A E YPY+ Q +C+K+ +A TI YED+P DE AL +AV QPV+V +EASG
Sbjct: 218 VAAEDAYPYKARQASSCNKK--PSAVVTIDGYEDVPANDETALKKAVAAQPVAVAIEASG 275
Query: 270 QAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
F+FY GV +CG DHGVA VG+GT DG KYW++KNSWG WGE GYIR+ R
Sbjct: 276 SHFQFYSEGVFAGKCGTELDHGVAAVGYGTT--VDGTKYWIVKNSWGPEWGEKGYIRMKR 333
Query: 330 D----EGLCGIATEASYPV 344
D EGLCGIA EASYPV
Sbjct: 334 DVKDKEGLCGIAMEASYPV 352
>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
Length = 324
Score = 286 bits (732), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 202/315 (64%), Gaps = 14/315 (4%)
Query: 35 EPS--IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
EPS ++E+ E+WMA++GR Y D EK R IFK N+ +IE N +Y LG N+F+
Sbjct: 1 EPSDPMMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFT 60
Query: 93 DLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
D+TN EF A YTG + P+ ++ R +F +++ VP SIDWR+ GAVT +KNQG C
Sbjct: 61 DMTNNEFLARYTGASLPL-NIERDPV--VSFDDVDISAVPQSIDWRDYGAVTSVKNQGSC 117
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLA 212
GSCWAFSA+A VEGI +I G LI LSEQ+++DC+ + GC GG ++KA+++II N G+
Sbjct: 118 GSCWAFSAIATVEGIYKIKAGNLISLSEQEVLDCAL-SYGCDGGWVNKAYDFIISNNGVT 176
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
+ A+ PY+ +G C+ + A I Y + +E +++ AV QP++ ++A G F
Sbjct: 177 SFANLPYKGYKGPCN-HNDLPNKAYITGYTYVQSNNERSMMIAVANQPIAALIDAGGD-F 234
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
++YK GV CG + +H + V+G+G + G KYW++KNSWG +WGE GYIR+ RD
Sbjct: 235 QYYKSGVFTGSCGTSLNHAITVIGYG--QTSSGTKYWIVKNSWGTSWGERGYIRMARDVS 292
Query: 331 --EGLCGIATEASYP 343
GLCGIA +P
Sbjct: 293 SPYGLCGIAMAPLFP 307
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 144/310 (46%), Positives = 196/310 (63%), Gaps = 14/310 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+V E W ++ + YK+ EK R IFK NL YI++ NK+ N +Y LG NEF+DLT++
Sbjct: 18 LVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKK-NSSYWLGLNEFADLTHD 76
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+A Y G ++ QS F Y++V D P SIDWR+KGAVT +KNQ CGSCWA
Sbjct: 77 EFKAKYVGSLGEDSTIIEQSD-DEEFPYKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWA 135
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADY 217
FS VA VEGI +I GKLI LSEQ+L+DC ++GC GG + +Y+ +N G+ TE +Y
Sbjct: 136 FSTVATVEGINKIVTGKLISLSEQELLDCDRRSHGCKGGYQTTSLQYVADN-GVHTEKEY 194
Query: 218 PYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKR 277
PY+++QG C + +K + I Y+ +P +E +L+QA+ QPVSV VE+ G+AF+FYK
Sbjct: 195 PYEKKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQAIANQPVSVVVESKGRAFQFYKG 254
Query: 278 GVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGL 333
G+ CG DH V VG+ G Y LIKNSWG WGE GYIRI R +G
Sbjct: 255 GIFEGPCGTKVDHAVTAVGY-------GKNYILIKNSWGPKWGEKGYIRIKRASGKSKGT 307
Query: 334 CGIATEASYP 343
CG+ + + +P
Sbjct: 308 CGVYSSSYFP 317
>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 286 bits (731), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 154/293 (52%), Positives = 195/293 (66%), Gaps = 11/293 (3%)
Query: 56 ELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSR 115
ELEK R IFK NLEYIE N GN++YKLG N++SDLT++EF AS+TG + +S
Sbjct: 78 ELEKRKR--IFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGL-KVSKQLSS 134
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKL 175
R + + DVPT+ DWR++GAVT +K+QG CG CWAFS VAAVEG +I G+L
Sbjct: 135 SKMRSAAVPFNLNDDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGEL 194
Query: 176 IELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAA 235
I LSEQQLVDC N+GC GG MD AF+YII+ KG+ +EADYPYQ+ TC +
Sbjct: 195 ISLSEQQLVDCDERNSGCHGGNMDSAFKYIIQ-KGIVSEADYPYQEGSQTCQLNDQMKFE 253
Query: 236 ATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVV 295
A I + D+P DE LLQAV +QPVSV +E G F+ Y V + CG + +H V V
Sbjct: 254 AQITNFIDVPANDEQQLLQAVAQQPVSVGIEV-GDEFQHYMGDVYSGTCGQSMNHAVTAV 312
Query: 296 GFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE----GLCGIATEASYPV 344
G+G + EDG KYWLIKNSWG+ WGE GY+++LR+ G CGIA ASYP+
Sbjct: 313 GYGVS--EDGTKYWLIKNSWGKGWGEEGYMKLLRESGEPGGQCGIAAHASYPI 363
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 285 bits (730), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 140/293 (47%), Positives = 189/293 (64%), Gaps = 16/293 (5%)
Query: 62 RLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGY-----NRPVPSVSRQ 116
R FK+N YIE+ N+ G +Y+LG N+FSDLT+EEFR + G + PV + R
Sbjct: 34 RFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRFLGLRPDLIDSPVLKMPRD 93
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLI 176
S F QNV D+P S+DWR+ GAVT K+QG CG CWAF+ A+EGI QI G+L+
Sbjct: 94 SDIEEGF--QNV-DLPASVDWRQHGAVTAPKDQGSCGGCWAFATTGAIEGINQIVTGQLV 150
Query: 177 ELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAA 235
LSEQ+L+DC + GC GGLM+ A+++I+EN GL TE DYPY + C+ +K +
Sbjct: 151 SLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGLDTETDYPYHASESHCNMKKLNSRV 210
Query: 236 ATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVV 295
I Y+ +P+GDE ALL AV KQPVSV +E + + F+ Y GV CG+ +HGV +V
Sbjct: 211 VAIDGYKAIPEGDEQALLLAVAKQPVSVAIEGASKDFQHYASGVFTGHCGEEINHGVLIV 270
Query: 296 GFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE----GLCGIATEASYPV 344
G+GT EDG YW++KNSW TWG+ G++++ R+ GLC I T ASYPV
Sbjct: 271 GYGT---EDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCSINTLASYPV 320
>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 147/309 (47%), Positives = 191/309 (61%), Gaps = 43/309 (13%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E W+ +HG++Y E+ R IFK NL +IE+ N NRTYK+G + +S FRA
Sbjct: 4 YEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVG-DRYS------FRA 55
Query: 102 SYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
D+P S+DWREKGAV +K+QG+CGSCWAFS +
Sbjct: 56 G--------------------------EDLPESVDWREKGAVVPVKDQGNCGSCWAFSTI 89
Query: 162 AAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
AAVEGI QI G LI LSEQ+LVDC N GC+GGLMD AFE+II N G+ +E DYPY+
Sbjct: 90 AAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYR 149
Query: 221 QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL 280
TCD ++ A +I YED+P+ DE +L +AV QPVSV +EA G+AF+ Y+ GV
Sbjct: 150 AADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGVF 209
Query: 281 NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCG 335
+CG DHGV VG+GT E+ YW+++NSWG WGESGYI++ R+ G CG
Sbjct: 210 TGQCGTQLDHGVVAVGYGT---ENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCG 266
Query: 336 IATEASYPV 344
IA E SYP+
Sbjct: 267 IAIEPSYPI 275
>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
Length = 466
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 151/325 (46%), Positives = 206/325 (63%), Gaps = 17/325 (5%)
Query: 29 SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR---TYK 85
+GRS E I+ +++W A+H D+ RL +FK+NL ++++ N +R Y+
Sbjct: 32 AGRSDEEVRII--YQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYR 89
Query: 86 LGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ-NVTDV-PTSIDWREKGAV 143
LG N F+DLTNEE+RA + R + + R +S + +Y+ DV P SIDWREKGAV
Sbjct: 90 LGMNRFADLTNEEYRARFL---RDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAV 146
Query: 144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFE 203
+K+QG CGSCWAF+A+A VEGI QI G LI LSEQQLVDCST N+GC GG +AF+
Sbjct: 147 VAVKSQGRCGSCWAFAAIATVEGINQIVTGDLISLSEQQLVDCSTRNHGCEGGWPYRAFQ 206
Query: 204 YIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSV 263
YII N G+ +E YPY GTC+ K A +I Y ++P DE +L +AV QP+SV
Sbjct: 207 YIINNGGVNSEEHYPYTGTNGTCNTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISV 266
Query: 264 CVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
+ ASG+ F+ Y G+ C + +HGV VVG+GT +G YW++KNSWGE+WG+SG
Sbjct: 267 GINASGRNFQLYHSGIFTGSCNTSLNHGVTVVGYGTV---NGNDYWIVKNSWGESWGDSG 323
Query: 324 YIRILRD----EGLCGIATEASYPV 344
YI + R+ G CGIA SYP+
Sbjct: 324 YILMERNIAESSGKCGIAISPSYPI 348
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 140/307 (45%), Positives = 194/307 (63%), Gaps = 21/307 (6%)
Query: 46 MAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG 105
MA++GR YKD EK R IFK N+ +IE N +Y LG N+F+D+TN EF A YTG
Sbjct: 1 MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60
Query: 106 -YNRPV-----PSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFS 159
+RP+ P VS F N++ V SIDWR+ GAVT +K+Q CGSCWAFS
Sbjct: 61 GISRPLNIEKEPVVS--------FDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFS 112
Query: 160 AVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPY 219
A+A VEGI +I G L+ LSEQ+++DC+ +NGC GG +D A+++II N G+A+EADYPY
Sbjct: 113 AIATVEGIYKIVTGYLVSLSEQEVLDCAV-SNGCDGGFVDNAYDFIISNNGVASEADYPY 171
Query: 220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
Q QG C +A G Y + DE ++ AV QP++ ++ASG F++Y GV
Sbjct: 172 QAYQGDCAANSWPNSAYITG-YSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGV 230
Query: 280 LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR---DEGLCGI 336
+ CG + +H + ++G+G ++ G +YW++KNSWG +WGE GYIR+ R GLCGI
Sbjct: 231 FSGPCGTSLNHAITIIGYG--QDSSGTQYWIVKNSWGSSWGERGYIRMARGVSSSGLCGI 288
Query: 337 ATEASYP 343
A + YP
Sbjct: 289 AMDPLYP 295
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 284 bits (727), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 206/315 (65%), Gaps = 14/315 (4%)
Query: 35 EPS--IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
EP+ ++++ E+WMA++GR YKD EK R IFK N+++IE N +Y LG N+F+
Sbjct: 1 EPNDPMMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFT 60
Query: 93 DLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
D+T EF A YTG + P+ ++ R+ +F N++ VP SIDWR+ GAV +KNQ C
Sbjct: 61 DMTKSEFVAQYTGVSLPL-NIEREPV--VSFDDVNISAVPQSIDWRDYGAVNEVKNQNPC 117
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLA 212
GSCWAF+A+A VEGI +I G L+ LSEQ+++DC+ + GC GG ++KA+++II N G+
Sbjct: 118 GSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV-SYGCKGGWVNKAYDFIISNNGVT 176
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE +YPYQ QGTC+ +A G Y + + DE +++ AV+ QP++ ++AS + F
Sbjct: 177 TEENYPYQAYQGTCNANSFPNSAYITG-YSYVRRNDERSMMYAVSNQPIAALIDAS-ENF 234
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR--- 329
++Y GV + CG + +H + ++G+G ++ G KYW+++NSWG +WGE GY+R+ R
Sbjct: 235 QYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTKYWIVRNSWGSSWGEGGYVRMARGVS 292
Query: 330 -DEGLCGIATEASYP 343
G CGIA +P
Sbjct: 293 SSSGACGIAMSPLFP 307
>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
C-169]
Length = 481
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 142/306 (46%), Positives = 195/306 (63%), Gaps = 11/306 (3%)
Query: 45 WMAQHGRTYKDELEKAMR-LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASY 103
W+ + YKD +E+ R +++ NLE++ N E + T+KLG F+DLT++E+R
Sbjct: 51 WVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHN-EKDSTFKLGLTNFADLTHDEYRQHA 109
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAA 163
GY + + + + F+Y + + P SIDWR+KGAVT +KNQ CGSCWAFS +
Sbjct: 110 LGYRPELKGTGLGTGKSTGFQYADY-EAPPSIDWRKKGAVTDVKNQQQCGSCWAFSTTGS 168
Query: 164 VEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
VEG I G+L+ LSEQ+LVDC T ++GC GGLMD AF +II N G+ TE DY Y+ +
Sbjct: 169 VEGANAIYSGELVSLSEQELVDCDVTQDHGCHGGLMDFAFSFIIRNGGIDTEKDYKYKAQ 228
Query: 223 QGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNA 282
G C+ KEK TI YED+P DE AL +A QP+SV +EA + F+ Y GV +A
Sbjct: 229 DGVCNIAKEKRHVVTIDSYEDVPPNDESALKKAAANQPISVAIEADQREFQLYAGGVFDA 288
Query: 283 ECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIAT 338
CG DHGV VVG+G+ ++G YW++KNSWG+ WG+SGYIR+ R G CGIA
Sbjct: 289 PCGTALDHGVLVVGYGS---DNGTDYWIVKNSWGDFWGDSGYIRLARGISNSAGQCGIAM 345
Query: 339 EASYPV 344
+ASYP+
Sbjct: 346 QASYPI 351
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 151/304 (49%), Positives = 189/304 (62%), Gaps = 15/304 (4%)
Query: 45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT 104
+M Q+ + Y E + R FK N+E I N N +Y +G NEF+DL+ EEF+ Y
Sbjct: 45 FMKQYSKAY-SHAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYF 103
Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAV 164
GY V R+ +R + +Q V PTSIDWR AVT IK+QG CGSCWAFSA ++
Sbjct: 104 GYKH----VEREFARSNNL-HQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSI 158
Query: 165 EGITQITGGK-LIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
EG + G L LSEQQLVDCST N GC+GGLMD AFEYII NKG+ E+ YPY+
Sbjct: 159 EGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAYPYKG 218
Query: 222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL 280
G C QK TI Y+D+ GDE +LL AV T PVSV +EA F+FY GV
Sbjct: 219 VGGLC--QKSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGVF 276
Query: 281 NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
+ CG N DHGV VG+GT +D YW++KNSWG +WGESGYIR++R++ CGIA +
Sbjct: 277 SGTCGHNLDHGVLAVGYGTTGSQD---YWIVKNSWGTSWGESGYIRMIRNKNQCGIAIQP 333
Query: 341 SYPV 344
SYP
Sbjct: 334 SYPT 337
>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 337
Score = 284 bits (726), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 152/343 (44%), Positives = 209/343 (60%), Gaps = 18/343 (5%)
Query: 10 IIPMFVIIILVITCA-SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQ 68
I+ V I V C+ S+ S+ HE+WMAQHG+ YKD EK L IF+
Sbjct: 6 ILKFLVAFIEVDACSLSESCCSHSL-------SHEKWMAQHGKVYKDAAEKERCLQIFEN 58
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
N+E+IE + G++++ L TN+F+DL +EEF+A T ++ S+ ++ + F+Y NV
Sbjct: 59 NMEFIESFDVCGDKSFNLSTNQFADLHDEEFKALLTNGHKKEHSL--WTTTETLFRYDNV 116
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFS-AVAAVEGITQITGGKLIELSEQQLVD-C 186
T +P S+DWR++G VT IK+QG C SCWAFS VA +EG+ QI +L+ LSEQ+LVD
Sbjct: 117 TKIPASMDWRKRGVVTPIKDQGKCLSCWAFSLCVATIEGLHQIITSELVPLSEQELVDFV 176
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
++ GC G ++ AF++I + + +E YPY+ TC +KE A I Y+ +P
Sbjct: 177 KGESEGCYGDYVEDAFKFITKKGRIESETHYPYKGVNNTCKVKKETHGVAQIKGYKKVPS 236
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
E+ALL+AV Q VSV VEA AF+FY G+ +CG + DH VA+ +G E DG
Sbjct: 237 KSENALLKAVANQLVSVSVEARDSAFQFYSSGIFTGKCGTDTDHRVALASYG--ESGDGT 294
Query: 307 KYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPVA 345
KYWL KNSWG WGE GYIRI D EGLCGIA YP+A
Sbjct: 295 KYWLAKNSWGTEWGEKGYIRIKXDIPAKEGLCGIAKYPYYPIA 337
>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 458
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 157/351 (44%), Positives = 219/351 (62%), Gaps = 23/351 (6%)
Query: 5 FEKSFIIPMFVIIILVITCAS--QVVSGRSMHEPSIVEKHEQWMAQHGRTYKD-ELEKAM 61
F+ S I+ + + + ++ AS ++ R+ E ++ ++QW A+HG+ + + E
Sbjct: 4 FQSSPIMALLFFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPEN 61
Query: 62 RLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS 121
R IFK NL++I++ N + N Y+LG N F+DLTNEE+R+ Y G S SR++ +
Sbjct: 62 RFHIFKDNLKFIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLG--GKFASGSRRNRTSN 118
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQ 181
+ + D+P SIDWR KGAV +K+QG CGSCWAFS VA+VE I QI G LI LSEQ
Sbjct: 119 RYLPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQ 178
Query: 182 QLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
+LVDC N GC+GGLMD AFE+IIEN GL TE DYPY +C + K+ A I
Sbjct: 179 ELVDCDRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNA----IDG 234
Query: 241 YEDLPKGDEHALLQA---VTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
YED+P +E AL +A VSV +E G++F+ Y+ G+ CG + DHGV VVG+
Sbjct: 235 YEDVPVNNEKALQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGY 294
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
G+ E G YW+++NSWG +WGESGY+++ R+ GLCGIA E SYP
Sbjct: 295 GS---EGGVDYWIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPT 342
>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 306
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 150/313 (47%), Positives = 196/313 (62%), Gaps = 21/313 (6%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
+ E+W+ Q+ R YKD+ E +R I++ NLEYIE N + +Y L N+F+DLTNEEF
Sbjct: 4 RFERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQ-EXSYNLTDNKFADLTNEEFV 62
Query: 101 ASYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFS 159
+ Y G+ R +P F Y D+P S DWR++GAV+ IK+QG+CGSCWAFS
Sbjct: 63 SPYLGFGTRFLPHTG--------FMYHEHEDLPESKDWRKEGAVSDIKDQGNCGSCWAFS 114
Query: 160 AVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADY 217
AVAAVEGI +I GKL+ LSEQ+ DC + N GC GGLMD AF +I +N GL T DY
Sbjct: 115 AVAAVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSKDY 174
Query: 218 PYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL--LQAVTKQPVSVCVEASGQAFRFY 275
PY+ GTC+K+K AA I + +P DE L A Q SV ++A G AF+ Y
Sbjct: 175 PYEGVDGTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHAFQLY 234
Query: 276 KRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----E 331
+GV + CG +HGV +VG+G + KYW++KNSWG WGESGYIR+ RD
Sbjct: 235 LKGVFSGICGKQLNHGVTIVGYGKGTSD---KYWIVKNSWGADWGESGYIRMKRDAFDKA 291
Query: 332 GLCGIATEASYPV 344
G CGIA +ASYP+
Sbjct: 292 GTCGIAMQASYPL 304
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 143/309 (46%), Positives = 188/309 (60%), Gaps = 11/309 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E+W+ +H + Y EK R IFK NL +I++ N + N +YK+G N+F+D+ NEE+R
Sbjct: 4 YEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQ-NYSYKVGLNKFADINNEEYRD 62
Query: 102 SYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
Y G ++ N V +DWR KGAVTHIK+QG CGSCWAFS +
Sbjct: 63 MYLGTKSDAKRRVMKTKITGHRITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCWAFSTI 122
Query: 162 AAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
A VE I +I GK + LSEQ+LVDC N GC+GGLMD AFE+II N G+ T+ DYPY
Sbjct: 123 ATVEAINKIVTGKFVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIRNGGIDTDQDYPYN 182
Query: 221 QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL 280
+ CD K+ A +I YED+P +AL +AV QPVSV + G+A + Y+ GV
Sbjct: 183 GFERKCDPTKKNAKVVSIDGYEDVPSY-MNALKKAVAHQPVSVAIAGLGRALQLYQSGVF 241
Query: 281 NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE-----GLCG 335
+CG + DHGV VVG+G+ E+G YWL++NSWG WGE GY +I CG
Sbjct: 242 TGKCGTDLDHGVVVVGYGS---ENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRKCG 298
Query: 336 IATEASYPV 344
IA EASYPV
Sbjct: 299 IAMEASYPV 307
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 283 bits (725), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 148/335 (44%), Positives = 211/335 (62%), Gaps = 16/335 (4%)
Query: 18 ILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN 77
+L ++ V RS E ++ + +W A++ K RL +FK+NL++++K N
Sbjct: 29 VLTLSKQGGAVPVRSDEEVRML--YLEWRAKNHPAEKYLDLNEYRLEVFKENLQFVDKHN 86
Query: 78 KEGNR---TYKLGTNEFSDLTNEEFRASYT-GYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
+R T++LG N F+DLTNEE+R + ++R S S + S S ++ + D+P
Sbjct: 87 AAADRGEHTFRLGMNRFADLTNEEYRTRFLRDFSRLRRSASGKIS--SRYRLREGDDLPD 144
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGC 193
SIDWREKGAV +KNQG CGSCWAFS VAAVEGI QI G LI LSEQQLVDC+T N+GC
Sbjct: 145 SIDWREKGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTANHGC 204
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALL 253
GG M+ AF++I+ N G+ +E YPY+ + G C+ A +I YE++P +E +L
Sbjct: 205 RGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNSTV-NAPVVSIDSYENVPSHNEQSLQ 263
Query: 254 QAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKN 313
+AV QPVSV ++A+G+ F+ Y+ G+ C + +H + VVG+GT ++D Y +KN
Sbjct: 264 KAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGTENDKD---YRTVKN 320
Query: 314 SWGETWGESGYIRILRD----EGLCGIATEASYPV 344
SWG+ WGESGYIR+ R+ G CGI ASYPV
Sbjct: 321 SWGKNWGESGYIRVERNIGNPNGKCGITRFASYPV 355
>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 136/315 (43%), Positives = 195/315 (61%), Gaps = 16/315 (5%)
Query: 42 HEQWMAQHG----RTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
++ W+A+HG ++ R + F NL +++ N G ++L N F+DL
Sbjct: 52 YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGS 154
TN+EFRA+Y G +++ ++P ++DWREKGAV +KNQG CGS
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGDRYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLA 212
CWAFSAV+ VE I QI G+++ LSEQ+LV+C + ++GC+GGLMD AFE+II+N G+
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE DYPY+ G CD ++ A +I +ED+P+ DE +L +AV PVSV +EA G+ F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+ Y GV + CG DHGV VG+GT E+G YW+++NSWG WGE+GY+R+ R+
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGEAGYLRMERNIN 348
Query: 331 --EGLCGIATEASYP 343
G CGIA +SYP
Sbjct: 349 VTSGKCGIAMMSSYP 363
>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 360
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 148/325 (45%), Positives = 199/325 (61%), Gaps = 15/325 (4%)
Query: 29 SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYK 85
SG+ E + +W AQHG +E E R F+ NL YI++ N G +++
Sbjct: 30 SGQIRSEEETRRMYAEWTAQHGSPITNEEEG--RYEAFRDNLRYIDEHNAAADAGIHSFR 87
Query: 86 LGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
LG N F+ LTNEE+RA+Y G +V + ++ + +P S+DWREKGAV
Sbjct: 88 LGLNRFAGLTNEEYRAAYLGLRLRSGAVGDLRKPSARYEAADGEALPESVDWREKGAVGK 147
Query: 146 IKNQGH-CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFE 203
+K+QG CGS WAFSA+AAVE I QI G+LI LSEQ+L+DC T N GC GGLMD AFE
Sbjct: 148 VKDQGRSCGSAWAFSAIAAVESINQIVTGELISLSEQELMDCDTSYNAGCDGGLMDDAFE 207
Query: 204 YIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSV 263
+II N G+ T+ DYPY+ +CD K A TI YEDL + +E +L +AV+ QPVSV
Sbjct: 208 FIISNGGIDTDEDYPYKARNDSCDANKRNRKAVTIDDYEDL-RMNEKSLQKAVSNQPVSV 266
Query: 264 CVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
+EA G+ F+ YK G+ CG + DH +VG+G+ E+G YW++K S+G +WGESG
Sbjct: 267 AIEAGGRDFQLYKSGIFTGTCGTDLDHATTIVGYGS---ENGTDYWIVKESYGTSWGESG 323
Query: 324 YIRILRD----EGLCGIATEASYPV 344
Y R+ R+ G CGIA SYPV
Sbjct: 324 YARMERNIKETSGKCGIAMLPSYPV 348
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 150/304 (49%), Positives = 189/304 (62%), Gaps = 15/304 (4%)
Query: 45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT 104
+M Q+ + Y E + R FK N+E I N N +Y +G NEF+DL+ EEF+ Y
Sbjct: 45 FMKQYSKAYS-HAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYF 103
Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAV 164
GY V R+ +R + +Q V PTSIDWR AVT IK+QG CGSCWAFSA ++
Sbjct: 104 GYKH----VEREFARSNNL-HQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSI 158
Query: 165 EGITQITGGK-LIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
EG + G L LSEQQLVDCST + GC+GGLMD AFEYII NKG+ E+ YPY+
Sbjct: 159 EGAWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANKGICAESAYPYKG 218
Query: 222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL 280
G C QK TI Y+D+ GDE +LL AV T PVSV +EA F+FY GV
Sbjct: 219 VGGLC--QKSCTKVVTISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGVF 276
Query: 281 NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGLCGIATEA 340
+ CG N DHGV VG+GT +D YW++KNSWG +WGESGYIR++R++ CGIA +
Sbjct: 277 SGTCGHNLDHGVLAVGYGTTGSQD---YWIVKNSWGTSWGESGYIRMIRNKNQCGIAIQP 333
Query: 341 SYPV 344
SYP
Sbjct: 334 SYPT 337
>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 136/316 (43%), Positives = 195/316 (61%), Gaps = 16/316 (5%)
Query: 42 HEQWMAQHG----RTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
++ W+A+HG ++ R + F NL +++ N G ++L N F+DL
Sbjct: 52 YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGS 154
TN+EFRA+Y G +++ ++P ++DWREKGAV +KNQG CGS
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLA 212
CWAFSAV+ VE I QI G+++ LSEQ+LV+C + ++GC+GGLMD AFE+II+N G+
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE DYPY+ G CD ++ A +I +ED+P+ DE +L +AV PVSV +EA G+ F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+ Y GV + CG DHGV VG+GT E+G YW+++NSWG WGE+GY+R+ R+
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGEAGYLRMERNIN 348
Query: 331 --EGLCGIATEASYPV 344
G CGIA +SYP
Sbjct: 349 VTSGKCGIAMMSSYPT 364
>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
Length = 473
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 136/316 (43%), Positives = 195/316 (61%), Gaps = 16/316 (5%)
Query: 42 HEQWMAQHG----RTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
++ W+A+HG ++ R + F NL +++ N G ++L N F+DL
Sbjct: 52 YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGS 154
TN+EFRA+Y G +++ ++P ++DWREKGAV +KNQG CGS
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLA 212
CWAFSAV+ VE I QI G+++ LSEQ+LV+C + ++GC+GGLMD AFE+II+N G+
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE DYPY+ G CD ++ A +I +ED+P+ DE +L +AV PVSV +EA G+ F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+ Y GV + CG DHGV VG+GT E+G YW+++NSWG WGE+GY+R+ R+
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGEAGYLRMERNIN 348
Query: 331 --EGLCGIATEASYPV 344
G CGIA +SYP
Sbjct: 349 VTSGKCGIAMMSSYPT 364
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 134/269 (49%), Positives = 188/269 (69%), Gaps = 5/269 (1%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E W++ + Y+ EK +R +FK NL++I++ NK+G ++Y LG NEF+DL++E
Sbjct: 47 LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFADLSHE 105
Query: 98 EFRASYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCW 156
EF+ Y G + V R R + F Y++V VP S+DWR+KGAV +KNQG CGSCW
Sbjct: 106 EFKKMYLGLKTDI--VRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEA 215
AFS VAAVEGI +I G L LSEQ+L+DC T NNGC+GGLMD AFEYI++N GL E
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEE 223
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
DYPY E+GTC+ QK+++ TI ++D+P DE +LL+A+ QP+SV ++ASG+ F+FY
Sbjct: 224 DYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFY 283
Query: 276 KRGVLNAECGDNCDHGVAVVGFGTAEEED 304
GV + CG + DHGVA VG+G+++ D
Sbjct: 284 SGGVFDGRCGVDLDHGVAAVGYGSSKGSD 312
>gi|18396952|ref|NP_564322.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|332192922|gb|AEE31043.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 334
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 145/329 (44%), Positives = 209/329 (63%), Gaps = 25/329 (7%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
SQ +++E SIV+ H+QWM Q R YKDE EK MRL +FK+NL++IE N GN++Y
Sbjct: 21 SQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSY 80
Query: 85 KLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT---SIDWREKG 141
LG NEF+D EEF A++TG V S+S ++ + N++D+ S DWR++G
Sbjct: 81 TLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDIDMEDESKDWRDEG 140
Query: 142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDK 200
AVT +K QG C +T+I+G L+ LSEQQL+DC + N GC+GG ++
Sbjct: 141 AVTPVKYQGAC-------------RLTKISGKNLLTLSEQQLIDCDIEKNGGCNGGEFEE 187
Query: 201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQP 260
AF+YII+N G++ E +YPYQ ++ +C +A I ++ +P +E ALL+AV +QP
Sbjct: 188 AFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGFQMVPSHNERALLEAVRRQP 247
Query: 261 VSVCVEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
VSV ++A +F YK GV +CG + +H V +VG+GT G YW++KNSWGE+W
Sbjct: 248 VSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTM---SGLNYWVLKNSWGESW 304
Query: 320 GESGYIRILRD----EGLCGIATEASYPV 344
GE+GY+RI RD +G+CGIA A+YPV
Sbjct: 305 GENGYMRIRRDVEWPQGMCGIAQVAAYPV 333
>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 281 bits (720), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 138/257 (53%), Positives = 172/257 (66%), Gaps = 8/257 (3%)
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
+TN EFR++Y G + R S + +F Y+ V VP S+DWR+KGAVT IK+QG C
Sbjct: 1 MTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQC 60
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFEYIIENKGL 211
GSCWAFS V AVEGI I KL+ LSEQ+LVDC T +N GC+GGLM AFE+I E G+
Sbjct: 61 GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEKGGI 120
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
TE YPY E GTCD K + +I +E +P +E ALL+A QP+SV ++A G A
Sbjct: 121 TTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGSA 180
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR-- 329
F+FY GV CG + DHGVA+VG+GT DG KYW++KNSWG WGE+GYIR+ R
Sbjct: 181 FQFYSEGVFAGRCGTDLDHGVAIVGYGTT--LDGTKYWIVKNSWGTDWGENGYIRMKRGI 238
Query: 330 --DEGLCGIATEASYPV 344
EGLCGIA EASYP+
Sbjct: 239 SAKEGLCGIAVEASYPI 255
>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP2-like [Glycine max]
Length = 342
Score = 281 bits (719), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 152/342 (44%), Positives = 207/342 (60%), Gaps = 20/342 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEK--HEQWMAQHGRTYKDELEKAMRLTIFK 67
II + V+ L IT ++ S V + +E W+ ++G+ Y+++ E R I++
Sbjct: 10 IINLLVLCNLWITASACPAKHNDNSSDSEVMRMRYESWLKKYGQKYRNKDEWEFRFEIYR 69
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+++IE N + N +YKL N+F DLTNEEFR Y Y +P +S + F YQ
Sbjct: 70 ANVQFIEVYNSQ-NYSYKLMDNKFVDLTNEEFRRMYLVY-QP------RSHLQTRFMYQK 121
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
D+P IDWR +GAVT IK+QGHCGSCW+FSAVA VE I +I GKL+ LSEQQL+DC
Sbjct: 122 HGDLPKRIDWRTRGAVTXIKDQGHCGSCWSFSAVATVEDINKIKTGKLVSLSEQQLIDCD 181
Query: 188 TDNN--GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
N GC+GG M+ F +I + GL T+ +YPYQ G +K K + A I YE+LP
Sbjct: 182 NRNGNEGCNGGHME-TFTFITKRGGLTTDKNYPYQGSDGDXNKAKVRNHAVAICGYENLP 240
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
+E+ L AV QP SV +A G AF+ Y +G + CG + +H + +VG+G EE+G
Sbjct: 241 AHNENMLKAAVAHQPASVATDAGGYAFQLYSKGTFSGSCGKDLNHRMTIVGYG---EENG 297
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
KYWL+KNSW G SGYIR+ RD +G CG A EASYP
Sbjct: 298 EKYWLVKNSWANDXGVSGYIRMKRDPKDKDGTCGTAMEASYP 339
>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
Length = 362
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 192/316 (60%), Gaps = 14/316 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ W H R+Y E R ++++N E+I+ N G+ TY+L NEF+DLT E
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYRLAENEFADLTEE 106
Query: 98 EFRASYTGY---NRPVPS---VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQ-G 150
EF A+YTGY + PV + ++F Y+ DVP S+DWR +GAV K+Q
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYR--VDVPASVDWRAQGAVVPPKSQTS 164
Query: 151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKG 210
C SCWAF A +E + I GKL+ LSEQQLVDC + + GC+ G +A+++++EN G
Sbjct: 165 TCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGG 224
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
L TEADYPY +G C++ K AA I + +P +E AL AV +QPV+V +E G
Sbjct: 225 LTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GS 283
Query: 271 AFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
+FYK GV CG H V VVG+GT + GAKYW IKNSWG++WGE GYIRILRD
Sbjct: 284 GMQFYKGGVYTGPCGTRLAHAVTVVGYGT-DASSGAKYWTIKNSWGQSWGERGYIRILRD 342
Query: 331 ---EGLCGIATEASYP 343
GLCG+ + +YP
Sbjct: 343 VGGPGLCGVTLDIAYP 358
>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
Length = 381
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 138/291 (47%), Positives = 193/291 (66%), Gaps = 14/291 (4%)
Query: 62 RLTIFKQNLEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRASYT-GYNRPVPSVSRQS 117
RL +FK+NL+++++ N +R T+ LG N F+DLTNEE+R + ++R S S +
Sbjct: 73 RLEVFKENLQFVDEHNAAADRGEHTFLLGMNRFADLTNEEYRTRFLRDFSRLRRSASGKI 132
Query: 118 SRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIE 177
S S ++ + D+P SIDWRE GAV +KNQG CGSCWAFS VAAVEGI QI G LI
Sbjct: 133 S--SRYRLREGDDLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLIS 190
Query: 178 LSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAAT 237
LSEQQLVDC+T N+GC GG M+ AF++I+ N G+ +E YPY+ + G C+ A +
Sbjct: 191 LSEQQLVDCTTANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQNGICNSTV-NAPVVS 249
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I YE++P +E +L +AV QPVSV ++A+G+ F+ Y+ G+ C + +H + VVG+
Sbjct: 250 IDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGY 309
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
GT ++D +W++KNSWG+ WGESGYIR R+ G CGI ASYPV
Sbjct: 310 GTENDKD---FWIVKNSWGKNWGESGYIRAERNIENPNGKCGITRFASYPV 357
>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 348
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 146/350 (41%), Positives = 208/350 (59%), Gaps = 19/350 (5%)
Query: 2 VLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDEL 57
+ K + +I+ + ++ A + G S + + E+ E WM +H R Y +
Sbjct: 4 ICSISKLIFVATCLIVHVGLSSADFSIVGYSQDDLTSTERLIRLFESWMLKHDRVYNNIE 63
Query: 58 EKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQS 117
EK R IFK NL YI++ NK+ N +Y LG NEF DLT++EF+ Y G + V+ +
Sbjct: 64 EKIHRFEIFKDNLMYIDETNKK-NNSYWLGLNEFVDLTHDEFKEKYVG-SIGEDFVTIEQ 121
Query: 118 SRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIE 177
S F Y++V D P SIDWR+KGAVT +K CGSCWAFS VA VEGI +I GKLI
Sbjct: 122 SNDEEFPYKHVVDYPESIDWRDKGAVTPVK-PNPCGSCWAFSTVATVEGINKIVTGKLIS 180
Query: 178 LSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAAT 237
LSEQ+L+DC ++GC GG + +Y+++N G+ TE +YPY+++QG C +++K
Sbjct: 181 LSEQELLDCDRRSHGCKGGYQTTSLQYVVDN-GVHTEKEYPYEKKQGKCRAKEKKGTKVQ 239
Query: 238 IGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGF 297
I Y+ +P DE +L+QA+ QPVSV +E+ G+AF+ YK G+ N CG DH V +G+
Sbjct: 240 ITGYKRVPANDEISLIQAIANQPVSVLLESKGRAFQLYKGGIFNGPCGTKLDHAVTAIGY 299
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYP 343
G Y LIKNSWG WGE GY++I R EG CG+ + +P
Sbjct: 300 GKT-------YILIKNSWGPNWGEKGYLKIKRASGKSEGTCGVYKSSYFP 342
>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
Length = 362
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 140/316 (44%), Positives = 192/316 (60%), Gaps = 14/316 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ W H R+Y E R ++++N E+I+ N G+ TY+L NEF+DLT E
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEE 106
Query: 98 EFRASYTGY---NRPVPS---VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQ-G 150
EF A+YTGY + PV + ++F Y+ DVP S+DWR +GAV K+Q
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYR--VDVPASVDWRAQGAVVPPKSQTS 164
Query: 151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKG 210
C SCWAF A +E + I GKL+ LSEQQLVDC + + GC+ G +A+++++EN G
Sbjct: 165 TCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGG 224
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
L TEADYPY +G C++ K AA I + +P +E AL AV +QPV+V +E G
Sbjct: 225 LTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GS 283
Query: 271 AFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
+FYK GV CG H V VVG+GT + GAKYW IKNSWG++WGE GYIRILRD
Sbjct: 284 GMQFYKGGVYTGPCGTRLAHAVTVVGYGT-DASSGAKYWTIKNSWGQSWGERGYIRILRD 342
Query: 331 ---EGLCGIATEASYP 343
GLCG+ + +YP
Sbjct: 343 VGGPGLCGVTLDIAYP 358
>gi|9502426|gb|AAF88125.1|AC021043_18 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 365
Score = 280 bits (716), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 148/347 (42%), Positives = 215/347 (61%), Gaps = 30/347 (8%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
SQ +++E SIV+ H+QWM Q R YKDE EK MRL +FK+NL++IE N GN++Y
Sbjct: 21 SQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSY 80
Query: 85 KLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT---SIDWREKG 141
LG NEF+D EEF A++TG V S+S ++ + N++D+ S DWR++G
Sbjct: 81 TLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDIDMEDESKDWRDEG 140
Query: 142 AVTHIKNQGHCGSCWA------------FSAVAAV------EGITQITGGKLIELSEQQL 183
AVT +K QG C ++ + V EG+T+I+G L+ LSEQQL
Sbjct: 141 AVTPVKYQGACPEFPTKQIRRNSLVGKQYTKLLGVLSDWGDEGLTKISGKNLLTLSEQQL 200
Query: 184 VDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
+DC + N GC+GG ++AF+YII+N G++ E +YPYQ ++ +C +A I ++
Sbjct: 201 IDCDIEKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGFQ 260
Query: 243 DLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAE 301
+P +E ALL+AV +QPVSV ++A +F YK GV +CG + +H V +VG+GT
Sbjct: 261 MVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTM- 319
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
G YW++KNSWGE+WGE+GY+RI RD +G+CGIA A+YPV
Sbjct: 320 --SGLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 364
>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 358
Score = 280 bits (715), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 145/353 (41%), Positives = 207/353 (58%), Gaps = 25/353 (7%)
Query: 12 PMFVIIILVITC----ASQVVSGRS-------MHEPSIVEKHEQWMAQHGRTYKDELEKA 60
P + + L+ +C A+ ++ R+ + + ++++ W H R+Y E
Sbjct: 6 PPVLTLALLASCGALLATSMLPARATAGSCLDVGDMVMMDRFRAWQGAHNRSYPSAEEAL 65
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGY---NRPVPS---VS 114
R ++++N E+I+ N G+ TY+L NEF+DLT EEF A+YTGY + PV +
Sbjct: 66 QRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEEEFLATYTGYYAGDGPVDDSVITT 125
Query: 115 RQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQ-GHCGSCWAFSAVAAVEGITQITGG 173
++F Y+ DVP S+DWR +GAV K+Q C SCWAF A +E + I G
Sbjct: 126 GAGDVDASFSYR--VDVPASVDWRAQGAVVPPKSQTSTCSSCWAFVTAATIESLNMIKTG 183
Query: 174 KLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKA 233
KL+ LSEQQLVDC + + GC+ G +A+++++EN GL TEADYPY +G C++ K
Sbjct: 184 KLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGGLTTEADYPYTARRGPCNRAKSAH 243
Query: 234 AAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVA 293
AA I + +P +E AL AV +QPV+V +E G +FYK GV CG H V
Sbjct: 244 HAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GSGMQFYKGGVYTGPCGTRLAHAVT 302
Query: 294 VVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYP 343
VVG+GT + GAKYW IKNSWG++WGE GYIRILRD GLCG+ + +YP
Sbjct: 303 VVGYGT-DASSGAKYWTIKNSWGQSWGERGYIRILRDVGGPGLCGVTLDIAYP 354
>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
Length = 367
Score = 280 bits (715), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 150/318 (47%), Positives = 203/318 (63%), Gaps = 20/318 (6%)
Query: 35 EPSIVEKHEQWMAQH--GRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
+ ++ + +E+W + + R++ EK R +FK+N++YI + NK ++ YKL N+F
Sbjct: 37 DETLWDLYERWRSVYTSARSFG---EKQNRFHVFKENVKYINEVNKM-DKPYKLRLNQFG 92
Query: 93 DLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
DLT EF +Y N + +R S F Y+NV +VP SIDWR KGAVT +KNQG C
Sbjct: 93 DLTPSEFARTYA--NSKIIEGTRNES--GGFMYENV-EVPRSIDWRVKGAVTPVKNQGRC 147
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLA 212
G CWAFSA AAVEGI QIT G+LI LSEQQL+DC T N+GC GG M +AFEYI + G+
Sbjct: 148 GGCWAFSAAAAVEGINQITTGQLISLSEQQLIDCDTQNSGCRGGTMGRAFEYIKQRGGIT 207
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA---SG 269
+EA+YPY+ + G C + +I Y ++ + E A+L+ + QPVSV V+A S
Sbjct: 208 SEANYPYKAQAGMCKNNLIQRPTVSIDGYYNIRR-SEDAVLKILAHQPVSVAVDATTWSS 266
Query: 270 QAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
+ FY +GV CG +HGV VG+GT DG YW+IKNSWGETWGE GY+R+LR
Sbjct: 267 LDWMFYFQGVFTGPCGTKLNHGVTAVGYGTT--NDGYDYWIIKNSWGETWGERGYMRMLR 324
Query: 330 ---DEGLCGIATEASYPV 344
GLCGIA +AS+P+
Sbjct: 325 GVSPYGLCGIAMQASFPI 342
>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
Length = 362
Score = 279 bits (714), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 141/279 (50%), Positives = 186/279 (66%), Gaps = 9/279 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+ F + +I F + + SQ ++ R++ E S+ E+HEQWMA + R YKD EK MR
Sbjct: 1 MVFTEPYICITFALFFSIGAWTSQCMA-RTLQEASMYERHEQWMASYARVYKDANEKQMR 59
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
IFK+N++ I+ N E +++YKL N+F+DLTNEEF++ G+ + S ++
Sbjct: 60 YKIFKENVQRIDSFNSESDKSYKLAVNQFADLTNEEFKSLRNGFKGHMCS-----AQAGH 114
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
F+Y+NVT VP SIDWR+KGAVT IK QG CGSCWAFSAVAAVEGIT+I GKLI LSEQ+
Sbjct: 115 FRYENVTAVPASIDWRKKGAVTQIKEQGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQE 174
Query: 183 LVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
LVDC T+ + GC GGLMD AF++ IE GLA+EA YPY TC ++E +A I
Sbjct: 175 LVDCDTNSEDQGCQGGLMDDAFKF-IEQHGLASEATYPYDAADSTCKTKEEAKPSAKITG 233
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
YED+P DE AL AV QPVSV ++A G F+FY G+
Sbjct: 234 YEDVPANDEAALKNAVANQPVSVAIDAGGFEFQFYSSGI 272
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 149/345 (43%), Positives = 211/345 (61%), Gaps = 30/345 (8%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWM---AQHGRTYKDELEKAMRLTIFKQ 68
P+ V + ++ S PS E+W A HG+TYK++ E+ R+ IF
Sbjct: 3 PLLVAVAII---------ALSYAHPSFDIYPEEWHVFKAMHGKTYKNQFEEMFRMKIFMD 53
Query: 69 NLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKY 125
N + IE N ++G +YK+ N F DL EF+A G+ +S + R +
Sbjct: 54 NKKKIEAHNAKYEQGEVSYKMMMNHFGDLMVHEFKALMNGF-----KMSPDTKRNGELYF 108
Query: 126 QNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVD 185
+ +++P ++DWR+KGAVT +K+QG CGSCW+FSA ++EG + GKL+ LSEQ LVD
Sbjct: 109 PSNSNLPKTVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQVFLKTGKLVSLSEQNLVD 168
Query: 186 CSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
CST NNGC GGLMD+AF+Y+ +NKG+ TEA YPY+ + TC +K K G + D
Sbjct: 169 CSTSYGNNGCEGGLMDQAFQYVSDNKGIDTEASYPYEARENTCRFKKNKVGGTDKG-HVD 227
Query: 244 LPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AECGD-NCDHGVAVVGFGTA 300
+P GDE AL A+ T P+SV ++A+ +F+FY +GV N C + DHGV VG+GT
Sbjct: 228 IPAGDEKALQNALATVGPISVAIDANHGSFQFYSKGVYNEPNCSSYDLDHGVLAVGYGT- 286
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
E+G YWL+KNSWG +WGE+GYI+I R+ CGIA+ ASYP+
Sbjct: 287 --ENGQDYWLVKNSWGPSWGENGYIKIARNHSNHCGIASMASYPL 329
>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 335
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 148/342 (43%), Positives = 206/342 (60%), Gaps = 20/342 (5%)
Query: 13 MFVIIILVITC-ASQVVSGRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIF 66
M I++LV T A Q ++ + + + ++ E+WMA+ G+TYK EK R IF
Sbjct: 1 MTSIVLLVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIF 60
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
+ N+ +I + +G N+F+DLTN+EF A+YTG P P +++ RP +
Sbjct: 61 RDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHP---KEAPRPVDPIW- 116
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
P IDWR +GAVT +K+QG CGSCWAF+AVAA+EG+T+I G+L LSEQ+LVDC
Sbjct: 117 ----TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDC 172
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEK-AAAATIGKYEDLP 245
T++NGC GG D+AFE + G+ E+DY Y+ QG C AA+IG Y +P
Sbjct: 173 DTNSNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVP 232
Query: 246 KGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDG 305
DE L AV +QPV+V ++ASG AF+FYK GV CG + +H V +VG+ + G
Sbjct: 233 PNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY-CQDGASG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYP 343
KYWL KNSWG+TWG+ GYI + +D G CG+A YP
Sbjct: 292 KKYWLAKNSWGKTWGQQGYILLEKDIVQPHGTCGLAVSPFYP 333
>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
Precursor
gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 490
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 139/296 (46%), Positives = 186/296 (62%), Gaps = 14/296 (4%)
Query: 58 EKAMRLTIFKQNLEYIEKANKEGNRT--YKLGTNEFSDLTNEEFRASYTGYNRPVPSVSR 115
E R +F NL++++ N + ++LG N F+DLTN EFRA+Y G R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG----TTPAGR 139
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTH-IKNQGHCGSCWAFSAVAAVEGITQITGGK 174
+++ V +P S+DWR+KGAV +KNQG CGSCWAFSAVAAVEGI +I G+
Sbjct: 140 GRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 175 LIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEK 232
L+ LSEQ+LV+C+ + N+GC+GG+MD AF +I N GL TE DYPY G C+ K
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 233 AAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGV 292
+I +ED+P+ DE +L +AV QPVSV ++A G+ F+ Y GV CG N DHGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 293 AVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
VG+GT + GA YW ++NSWG WGE+GYIR+ R+ G CGIA ASYP+
Sbjct: 320 VAVGYGT-DAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
Length = 494
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 139/296 (46%), Positives = 186/296 (62%), Gaps = 14/296 (4%)
Query: 58 EKAMRLTIFKQNLEYIEKANKEGNRT--YKLGTNEFSDLTNEEFRASYTGYNRPVPSVSR 115
E R +F NL++++ N + ++LG N F+DLTN EFRA+Y G R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG----TTPAGR 139
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTH-IKNQGHCGSCWAFSAVAAVEGITQITGGK 174
+++ V +P S+DWR+KGAV +KNQG CGSCWAFSAVAAVEGI +I G+
Sbjct: 140 GRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 175 LIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEK 232
L+ LSEQ+LV+C+ + N+GC+GG+MD AF +I N GL TE DYPY G C+ K
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 233 AAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGV 292
+I +ED+P+ DE +L +AV QPVSV ++A G+ F+ Y GV CG N DHGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 293 AVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
VG+GT + GA YW ++NSWG WGE+GYIR+ R+ G CGIA ASYP+
Sbjct: 320 VAVGYGT-DAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
Length = 279
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 142/261 (54%), Positives = 169/261 (64%), Gaps = 14/261 (5%)
Query: 94 LTNEEFRASYTGYNRPVPSVSR-----QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKN 148
+T +EFR Y G + R S+ S+F Y + DVP S+DWR+KGAVT +K+
Sbjct: 1 MTADEFRRHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKD 60
Query: 149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIE 207
QG CGSCWAFS +AAVEGI I L LSEQQLVDC T N GC+GGLMD AF+YI +
Sbjct: 61 QGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAK 120
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA 267
+ G+A E YPY+ Q +C +K A TI YED+P DE AL +AV QPVSV +EA
Sbjct: 121 HGGVAAEDAYPYRARQASC--KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEA 178
Query: 268 SGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
SG F+FY GV + CG DHGVA VG+G DG KYWL+KNSWG WGE GYIR+
Sbjct: 179 SGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVT--ADGTKYWLVKNSWGPEWGEKGYIRM 236
Query: 328 LRD----EGLCGIATEASYPV 344
RD EG CGIA EASYPV
Sbjct: 237 ARDVAAKEGHCGIAMEASYPV 257
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 153/343 (44%), Positives = 213/343 (62%), Gaps = 19/343 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+I+++ A+ VS + + E+ + QH + Y E E+ +RL I+ QN I
Sbjct: 3 ILILLMAFVAAANAVSLYEL----VKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSR---PSTFKYQN 127
K N+ G Y+L N+++DL +EEF + G+NR S + R P TF
Sbjct: 59 AKHNQRFDLGQEKYRLRVNKYADLLHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPA 118
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+VPT++DWR+KGAVT +K+QGHCGSCW+FSA A+EG GKL+ LSEQ LVDCS
Sbjct: 119 NVEVPTTVDWRKKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCS 178
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
NNGC+GG+MD AF+YI +N G+ TE YPY+ TC KA AT Y D+P
Sbjct: 179 GKYGNNGCNGGMMDYAFQYIKDNGGIDTEKSYPYEAIDDTC-HFNPKAVGATDKGYVDIP 237
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAEC-GDNCDHGVAVVGFGTAEE 302
+GDE AL +A+ T PVS+ ++AS ++F+FY GV +C +N DHGV VG+GT+EE
Sbjct: 238 QGDEEALKKALATVGPVSIAIDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEE 297
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
G YWL+KNSWG TWG+ GY+++ R+ + CG+AT ASYP+
Sbjct: 298 --GEDYWLVKNSWGTTWGDQGYVKMARNRDNHCGVATCASYPL 338
>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
Length = 464
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 142/316 (44%), Positives = 199/316 (62%), Gaps = 18/316 (5%)
Query: 42 HEQWMAQH---GRTYKDEL-EKAMRLTIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLT 95
++ W+A+H G ++ + E R +F NL++++ N + ++LG N F+DLT
Sbjct: 66 YDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGMNRFADLT 125
Query: 96 NEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAV-THIKNQGHCGS 154
N+EFRA+Y G R +++ V +P S+DWR+KGAV + +KNQG CGS
Sbjct: 126 NDEFRAAYLG----TTPAGRGRHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGS 181
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLA 212
CWAFSAVAAVEGI +I G+L+ LSEQ+LV+C+ + N+GC+GG+MD AF +I N GL
Sbjct: 182 CWAFSAVAAVEGINKIVTGELVSLSEQELVECARNRGNSGCNGGIMDDAFAFITRNGGLD 241
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE DYPY G CD K+ +I +ED+P+ DE +L +AV QPVSV ++A G+ F
Sbjct: 242 TEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREF 301
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+ Y GV CG + DHGV VG+GT + G YW ++NSWG WGE+GYIR+ R+
Sbjct: 302 QLYDSGVFTGRCGTSLDHGVVAVGYGT-DAATGTDYWTVRNSWGPDWGENGYIRMERNVT 360
Query: 331 --EGLCGIATEASYPV 344
G CGIA ASYP+
Sbjct: 361 ARTGKCGIAMMASYPI 376
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 149/347 (42%), Positives = 203/347 (58%), Gaps = 19/347 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHE----------PSIVEKHEQWMAQHGRTYKDELEKAMR 62
M + +L++ C+ V+ E S E + W+ R Y E R
Sbjct: 1 MRLSCVLLVACSCLAVAAGFPFENHRLFIQQAVESPREAFDFWVQTLKRAYASAEEYERR 60
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
++ NL ++ + N G+ ++ L ++DL+ +E+R+ GYN + + R +
Sbjct: 61 FDVWLDNLRFVHEYNA-GHTSHWLSMGVYADLSQDEYRSKALGYNADLHE--ERPLRAAP 117
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
F Y+ T P +DW KGAVT +KNQ CGSCWAFS AVEG + I GKL LSEQ
Sbjct: 118 FLYEG-TVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAVEGASAIATGKLASLSEQM 176
Query: 183 LVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
LVDC + +NGC GGLMD AFE+I++N G+ TE DYPY E+G C K + TI Y
Sbjct: 177 LVDCDRERDNGCHGGLMDFAFEFIMKNGGIDTEDDYPYTAEEGMCQDNKMRRHVVTIDDY 236
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE 301
+D+P DEHAL++AV QPVSV +EA +AF+ Y GV +AECG DHGV VVG+GTA
Sbjct: 237 QDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGVFDAECGTALDHGVLVVGYGTAS 296
Query: 302 E-EDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
YWL+KNSWG WG+ GYIR+LR+ EG CG+A +AS+P+
Sbjct: 297 NGTHHLPYWLVKNSWGAEWGDKGYIRLLRNLGEEGQCGVAMQASFPI 343
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 158/355 (44%), Positives = 216/355 (60%), Gaps = 33/355 (9%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEK-HEQWMA---QHGRTYKDELEKAMRLTIF 66
+ +F++++ + A+ V SI E+W A QH + Y E E+ +R+ I+
Sbjct: 1 MKLFLLLVSFLAAANAV---------SIFNLVKEEWNAFKLQHRKKYDSESEERIRMKIY 51
Query: 67 KQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSR---- 119
QN I K N+ G ++L N+++DL +EEF + G+NR + S+ R
Sbjct: 52 VQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSAAAGSKLLGREQLM 111
Query: 120 ----PSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKL 175
P T+ DVPT+IDWREKGAVT +K+QGHCGSCW+FSA A+EG GKL
Sbjct: 112 TIEEPITWIEPANVDVPTTIDWREKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKL 171
Query: 176 IELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKA 233
+ LSEQ LVDCST NNGC+GGLMD AF+Y+ +NKG+ TE YPY+ C KA
Sbjct: 172 VSLSEQNLVDCSTKYGNNGCNGGLMDNAFQYVKDNKGIDTEKAYPYEAIDDEC-HYNPKA 230
Query: 234 AAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDH 290
AT + D+P+GDE AL +A+ T PVSV ++AS ++F+FY GV +C + DH
Sbjct: 231 IGATDKGFVDIPQGDEKALKKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDH 290
Query: 291 GVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
GV VG+GT EDG YWL+KNSWG TWG+ GY+++ R+ E CGIAT ASYP+
Sbjct: 291 GVLAVGYGTT--EDGEDYWLVKNSWGTTWGDQGYVKMARNRENHCGIATTASYPL 343
>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
Length = 499
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 198/316 (62%), Gaps = 18/316 (5%)
Query: 42 HEQWMAQH---GRTYKDEL-EKAMRLTIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLT 95
++ W+A+H G ++ + E R +F NL++++ N + ++LG N F+DLT
Sbjct: 65 YDLWVARHRHGGGSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLT 124
Query: 96 NEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH-IKNQGHCGS 154
N+EFRA+Y G R +++ V +P S+DWR+KGAV +KNQG CGS
Sbjct: 125 NDEFRAAYLG----TTPAGRGRHVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGS 180
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLA 212
CWAFSAVAAVEGI +I G+L+ LSEQ+LV+C+ + N+GC+GG+MD AF +I N GL
Sbjct: 181 CWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLD 240
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE DYPY G C+ K+ +I +ED+P+ DE +L +AV QPVSV ++A G+ F
Sbjct: 241 TEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREF 300
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+ Y GV CG + DHGV VG+GT + G YW ++NSWG WGE+GYIR+ R+
Sbjct: 301 QLYDSGVFTGRCGTSLDHGVVAVGYGT-DAATGTDYWTVRNSWGPDWGENGYIRMERNVT 359
Query: 331 --EGLCGIATEASYPV 344
G CGIA ASYP+
Sbjct: 360 ARTGKCGIAMMASYPI 375
>gi|326494040|dbj|BAJ85482.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 149/318 (46%), Positives = 190/318 (59%), Gaps = 18/318 (5%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
+HE+WMA++GR Y D EK R +F N +I+ N+ GNRTY LG N FSDLTNEEF
Sbjct: 39 HRHERWMAKYGRVYADAAEKLRRQEVFAANARHIDAVNRAGNRTYTLGLNHFSDLTNEEF 98
Query: 100 RASYTGY-NRPVPSVSR-QSSRPSTFKYQNVTDV-----PTSIDWREKGAVTHIKNQGHC 152
++ GY ++P P R + S P+ NVTD P S+DWR +GAVT +K+QGHC
Sbjct: 99 AQTHLGYRHQPGPGGLRPEDSSPAAAV--NVTDAQLQSTPDSVDWRARGAVTPVKHQGHC 156
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLA 212
GSCWAF+AVAA EG+ QI G LI +SEQQ++DC+ + C G ++ A YI + GL
Sbjct: 157 GSCWAFAAVAATEGLVQIATGNLISMSEQQVLDCTGGTSSCKSGYVNAALTYITASGGLQ 216
Query: 213 TEADYPYQQEQGTCDKQKEKA-AAATIGKYED-LPKGDEHALLQAVTKQPVSVCVEASGQ 270
TEA Y Y EQG C +AA +G + + GDE AL V QPV+V VEA
Sbjct: 217 TEAAYAYSAEQGACRSGGASPNSAAAVGVHRSAMLNGDEGALQVLVAGQPVAVAVEAEPD 276
Query: 271 AFRFYKRGVL--NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
F YK GV + CG H V VVG+G + DG YW++KN WG WGE GY+R+
Sbjct: 277 -FHHYKSGVYVGSPSCGQKLHHAVTVVGYGA--DGDGQGYWVVKNQWGAGWGEVGYMRLT 333
Query: 329 RDEG--LCGIATEASYPV 344
R G CG+AT A YP
Sbjct: 334 RGNGGNNCGMATHAYYPT 351
>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
Length = 336
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 145/341 (42%), Positives = 205/341 (60%), Gaps = 15/341 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
+F++ + ++ L AS + S + ++ E+WMA+ G+TYK EK R IF+
Sbjct: 4 AFLLVVCTLMALQAMAASAYYNNGS-DDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFR 62
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+ +I + +G N+F+DLTN+EF A+YTG P P +++ RP +
Sbjct: 63 DNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHP---KEAPRPVDPIW-- 117
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
P IDWR +GAVT +K+QG CGSCWAF+AVAA+EG+T+I G+L LSEQ+LVDC
Sbjct: 118 ---TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCD 174
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEK-AAAATIGKYEDLPK 246
T++NGC GG D+AFE + G+ E+DY Y+ QG C AA+IG Y +P
Sbjct: 175 TNSNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPP 234
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
DE L AV +QPV+V ++ASG AF+FYK GV CG + +H V +VG+ + G
Sbjct: 235 NDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY-CQDGASGK 293
Query: 307 KYWLIKNSWGETWGESGYI----RILRDEGLCGIATEASYP 343
KYW+ KNSWG+TWG+ GYI +L+ G CG+A YP
Sbjct: 294 KYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYP 334
>gi|357130486|ref|XP_003566879.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 354
Score = 277 bits (708), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 194/321 (60%), Gaps = 15/321 (4%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
++ +HE+WMA+ GR YKD EKA R +F N +++ N+ GNRTY LG N FSDLT+
Sbjct: 33 TVASRHERWMARFGRAYKDADEKARRQEVFGANARHVDAVNRSGNRTYTLGLNHFSDLTD 92
Query: 97 EEFRASYTGY--NRPVPS----VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQG 150
EF + GY ++P P Q +T DVP S+DWR +GAVT IKNQ
Sbjct: 93 HEFLQQHLGYRHHQPGPGGLLRPEDQDMSKATALADYGQDVPDSVDWRAQGAVTEIKNQR 152
Query: 151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKG 210
CGSCWAF+AVAA EG+ +I G LI +SEQQ++DC+ N C GG ++ A Y+ + G
Sbjct: 153 SCGSCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGGGNTCDGGDINAALRYVAASGG 212
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIG--KYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
L EA Y Y ++G C +AA++G ++ L GDE AL QPV+V +EAS
Sbjct: 213 LQPEAAYAYAAQKGACRGASPANSAASVGGARFARL-GGDEGALRGLAAGQPVAVALEAS 271
Query: 269 GQAFRFYKRGVL--NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
FR YK GV +A CG +HGV VVG+G AE++ G +YW++KN WG WGE GY+R
Sbjct: 272 EPDFRHYKSGVYAGSASCGRRLNHGVTVVGYG-AEDDSGDEYWVVKNQWGTLWGEKGYMR 330
Query: 327 ILRDE---GLCGIATEASYPV 344
+ R + CGIA+ A YP
Sbjct: 331 VARGDVAGANCGIASYAYYPT 351
>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
Length = 319
Score = 276 bits (707), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 192/311 (61%), Gaps = 14/311 (4%)
Query: 39 VEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
++ E+WMA+ G+TYK EK R IF+ N+ +I + +G N+F+DLTN+E
Sbjct: 17 MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 76
Query: 99 FRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAF 158
F A+YTG P P +++ RP + P IDWR +GAVT +K+QG CGSCWAF
Sbjct: 77 FVATYTGAKPPHP---KEAPRPVDPIW-----TPCCIDWRFRGAVTGVKDQGACGSCWAF 128
Query: 159 SAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYP 218
+AVAA+EG+T+I G+L LSEQ+LVDC T++NGC GG D+AFE + G+ E+DY
Sbjct: 129 AAVAAIEGLTKIRTGQLTPLSEQELVDCDTNSNGCGGGHTDRAFELVASKGGITAESDYR 188
Query: 219 YQQEQGTCDKQKEK-AAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKR 277
Y+ QG C AA+IG Y +P DE L AV +QPV+V ++ASG AF+FYK
Sbjct: 189 YEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKS 248
Query: 278 GVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGL 333
GV CG + +H V +VG+ + G KYWL KNSWG+TWG+ GYI + +D G
Sbjct: 249 GVFPGPCGASSNHAVTLVGY-CQDGASGKKYWLAKNSWGKTWGQQGYILLEKDIVQPHGT 307
Query: 334 CGIATEASYPV 344
CG+A YP
Sbjct: 308 CGLAVSPFYPT 318
>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
Length = 439
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 199/315 (63%), Gaps = 18/315 (5%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGN-----RTYKLGTNEFSDL 94
E E+W +H +TY E EK RL +F+ N ++ + N+ N +Y L N F+DL
Sbjct: 31 ELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADL 90
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGS 154
T+ EF+ + G +P + RP + +++ +P+ IDWR+ GAVT +K+Q CG+
Sbjct: 91 THHEFKTTRLG----LPLTLLRFKRPQNQQSRDLLHIPSQIDWRQSGAVTPVKDQASCGA 146
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLAT 213
CWAFSA A+EGI +I G L+ LSEQ+L+DC T N+GC GGLMD A++++I+NKG+ T
Sbjct: 147 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKGIDT 206
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
E DYPYQ Q +C K K K A TI Y D+P +E +L+AV QPVSV + S + F+
Sbjct: 207 EDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGSEREFQ 265
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--- 330
Y +G+ C DH V +VG+G+ E+G YW++KNSWG+ WG +GYI ++R+
Sbjct: 266 LYSKGIFTGPCSTFLDHAVLIVGYGS---ENGVDYWIVKNSWGKYWGMNGYIHMIRNSGN 322
Query: 331 -EGLCGIATEASYPV 344
+G+CGI T ASYPV
Sbjct: 323 SKGICGINTLASYPV 337
>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
Length = 499
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 198/316 (62%), Gaps = 18/316 (5%)
Query: 42 HEQWMAQH---GRTYKDEL-EKAMRLTIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLT 95
++ W+A+H G ++ + E R +F NL++++ N + ++LG N F+DLT
Sbjct: 65 YDLWVARHRHGGDSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLT 124
Query: 96 NEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH-IKNQGHCGS 154
N+EFRA+Y G R +++ V +P S+DWR+KGAV +KNQG CGS
Sbjct: 125 NDEFRAAYLG----TTPAGRGRHVGEAYRHDGVEVLPDSVDWRDKGAVVAPVKNQGQCGS 180
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLA 212
CWAFSAVAAVEGI +I G+L+ LSEQ+LV+C+ + N+GC+GG+MD AF +I N GL
Sbjct: 181 CWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLD 240
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE DYPY G C+ K+ +I +ED+P+ DE +L +AV QPVSV ++A G+ F
Sbjct: 241 TEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREF 300
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+ Y GV CG + DHGV VG+GT + G YW ++NSWG WGE+GYIR+ R+
Sbjct: 301 QLYDSGVFTGRCGTSLDHGVVAVGYGT-DAATGTDYWTVRNSWGPDWGENGYIRMERNVT 359
Query: 331 --EGLCGIATEASYPV 344
G CGIA ASYP+
Sbjct: 360 ARTGKCGIAMMASYPI 375
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 276 bits (706), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 148/319 (46%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+DE E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114
Query: 95 TNEEFRASYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
+ EFR G+N + R +S + TF +P S+DWR KGAVT +K+QGH
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST NNGC+GGLMD AF YI +N
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEAS 268
G+ TE YPY+ +C K A G + D+P+GDE + +AV T PVSV ++AS
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNKGTVGATDRG-FTDIPQGDEKKMAEAVATVGPVSVAIDAS 293
Query: 269 GQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
++F+FY GV N +C N DHGV VVGFGT +E G YWL+KNSWG TWG+ G+I+
Sbjct: 294 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGEDYWLVKNSWGTTWGDKGFIK 351
Query: 327 ILRD-EGLCGIATEASYPV 344
+LR+ E CGIA+ +SYP+
Sbjct: 352 MLRNKENQCGIASASSYPL 370
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 144/329 (43%), Positives = 197/329 (59%), Gaps = 23/329 (6%)
Query: 17 IILVITCASQ---------VVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMRL 63
+I V+TC S + G S + + +E E WM +H + YK EK R
Sbjct: 10 LIFVVTCLSLHLGLSSADFSIVGYSQDDLTSIESSIRLFESWMLKHDKVYKTIDEKIYRF 69
Query: 64 TIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTF 123
FK NL YI++ NK+ N +Y LG NEF+DLT++EF+ Y G + P S+ + S F
Sbjct: 70 ETFKDNLMYIDETNKK-NNSYWLGLNEFADLTHDEFKEKYVG-SIPEDSMIIEQSDDVEF 127
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
++V D P SIDWR+KGAVT +KNQ CGSCWAFS VA VEGI +I G LI LSEQ+L
Sbjct: 128 PNKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWAFSTVATVEGINKIVTGNLISLSEQEL 187
Query: 184 VDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
+DC ++GC GG + +Y+++N G+ TE +YPY+++QG C + +K I Y+
Sbjct: 188 LDCDRRSHGCKGGYQTTSLKYVVDN-GVHTEKEYPYEKKQGNCRAKNKKGLKVYINGYKR 246
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+P DE +L++ ++ QPVSV VE+ G+ F+FYK GV CG DH V VG+
Sbjct: 247 VPSNDEISLIKTISIQPVSVLVESKGRPFQFYKGGVFGGPCGTKLDHAVTAVGY------ 300
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDEG 332
G Y LIKNSWG WG+ GYI+I R G
Sbjct: 301 -GKDYILIKNSWGPKWGDKGYIKIKRASG 328
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 148/319 (46%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+DE E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 59 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 118
Query: 95 TNEEFRASYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
+ EFR G+N + R +S + TF +P S+DWR KGAVT +K+QGH
Sbjct: 119 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 178
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST NNGC+GGLMD AF YI +N
Sbjct: 179 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 238
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEAS 268
G+ TE YPY+ +C K A G + D+P+GDE + +AV T PVSV ++AS
Sbjct: 239 GIDTEKSYPYEAIDDSCHFNKGTVGATDRG-FTDIPQGDEKKMAEAVATVGPVSVAIDAS 297
Query: 269 GQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
++F+FY GV N +C N DHGV VVGFGT +E G YWL+KNSWG TWG+ G+I+
Sbjct: 298 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGEDYWLVKNSWGTTWGDKGFIK 355
Query: 327 ILRD-EGLCGIATEASYPV 344
+LR+ E CGIA+ +SYP+
Sbjct: 356 MLRNKENQCGIASASSYPL 374
>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
Length = 246
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 140/271 (51%), Positives = 179/271 (66%), Gaps = 32/271 (11%)
Query: 81 NRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
+++YKL NEF+DLTNEEF S + + S + ++FKY+NVT VP++ DWR+K
Sbjct: 2 DKSYKLSINEFADLTNEEFGTSRNRFKAHICS-----TEATSFKYENVTAVPSTXDWRKK 56
Query: 141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLM 198
GAVT IK+QG CGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ GC G
Sbjct: 57 GAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXG--- 113
Query: 199 DKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK 258
A+YPY GTC+++K AA I YED+P +E AL +AV
Sbjct: 114 ----------------ANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAH 157
Query: 259 QPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
QP++V ++A G F+FY GV +CG DHGV VG+GT+ +DG KYWL+KNSWG
Sbjct: 158 QPIAVAIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTS--DDGMKYWLVKNSWGTG 215
Query: 319 WGESGYIRILRD----EGLCGIATEASYPVA 345
WGE GYIR+ RD EGLCGIA +ASYP A
Sbjct: 216 WGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 246
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 140/309 (45%), Positives = 192/309 (62%), Gaps = 14/309 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+ E +E W+A+H + Y +E R IFK NL++I++ N E N TYK+G ++DLTNE
Sbjct: 41 VKEIYELWLAKHDKVYSGLVEYEKRFEIFKDNLKFIDEHNSE-NHTYKMGLTPYTDLTNE 99
Query: 98 EFRASYTG-YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCW 156
EF+A Y G + + + R + + Y+ ++P IDWR+KGAVT +KNQG CGSCW
Sbjct: 100 EFQAIYLGTRSDTIHRLKRTINISERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKCGSCW 159
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEAD 216
AFS V+ VE I QI G LI LSEQQLVDC+ N+GC GG A++YII+N G+ TEA+
Sbjct: 160 AFSTVSTVESINQIRTGNLISLSEQQLVDCNKKNHGCKGGAFVYAYQYIIDNGGIDTEAN 219
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY+ QG C K+ I Y+ +P +E+AL +AV QP V ++AS + F+ YK
Sbjct: 220 YPYKAVQGPCRAAKK---VVRIDGYKGVPHCNENALKKAVASQPSVVAIDASSKQFQHYK 276
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR--DEGLC 334
G+ + CG +HGV +VG+ YW+++NSWG WGE GYIR+ R GLC
Sbjct: 277 SGIFSGPCGTKLNHGVVIVGY-------WKDYWIVRNSWGRYWGEQGYIRMKRVGGCGLC 329
Query: 335 GIATEASYP 343
GIA YP
Sbjct: 330 GIARLPYYP 338
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 148/319 (46%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+DE E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 95 TNEEFRASYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
+ EFR G+N + R +S + TF +P S+DWR KGAVT +K+QGH
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST NNGC+GGLMD AF YI +N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEAS 268
G+ TE YPY+ +C K A G + D+P+GDE + +AV T PVSV ++AS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTVGATDRG-FTDIPQGDEKKMAEAVATVGPVSVAIDAS 263
Query: 269 GQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
++F+FY GV N +C N DHGV VVGFGT +E G YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGEDYWLVKNSWGTTWGDKGFIK 321
Query: 327 ILRD-EGLCGIATEASYPV 344
+LR+ E CGIA+ +SYP+
Sbjct: 322 MLRNKENQCGIASASSYPL 340
>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
Length = 342
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 146/344 (42%), Positives = 204/344 (59%), Gaps = 21/344 (6%)
Query: 12 PMFVIIILVITC--ASQVVSGRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKAMRLT 64
PM ++LV+ A Q + + + + ++ E+WMA+ G+TYK EK R
Sbjct: 6 PMASAVLLVVCTLMALQAMGADAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFG 65
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFK 124
IF+ N+ +I + +G N+F+DLTN+EF A+YTG P P +++ RP
Sbjct: 66 IFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHP---KEAPRPVDPI 122
Query: 125 YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
+ P IDWR +GAVT +K+QG CGSCWAF+AVAA+EG+T+I G+L LSEQ+LV
Sbjct: 123 W-----TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELV 177
Query: 185 DCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEK-AAAATIGKYED 243
DC T++NGC GG D+AFE + G+ E+DY Y+ QG C AA IG Y
Sbjct: 178 DCDTNSNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAARIGGYRA 237
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+P DE L AV +QPV+V ++ASG AF+FYK GV CG + +H V +VG+ +
Sbjct: 238 VPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY-CQDGA 296
Query: 304 DGAKYWLIKNSWGETWGESGYI----RILRDEGLCGIATEASYP 343
G KYW+ KNSWG+TWG+ GYI +L+ G CG+A YP
Sbjct: 297 SGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYP 340
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 150/338 (44%), Positives = 205/338 (60%), Gaps = 15/338 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
++ ++LV C VVS SM E +W +HG+ Y + E+A R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFTDFDEDWNEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
K N + G+ TY LG N+F+DL NEEF A TG+ V S+ + + NV +
Sbjct: 60 IKHNLKYDLGHFTYDLGINQFTDLQNEEFVAMMTGFR--VSGTSKAAKGSTFLPPNNVGE 117
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN 190
+P ++DWR KG VT +K+QG CGSCWAFS +VEG GKL+ LSEQ LVDCS +
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSVEGQHFKATGKLVSLSEQNLVDCSGRD 177
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC GG MD+AF+YII+ G+ TEA YPY+ G C +K A G Y D+ G E
Sbjct: 178 AGCDGGFMDRAFQYIIDAGGIDTEASYPYKAVDGKCHFKKANVGATVTG-YTDVTSGSEK 236
Query: 251 ALLQAVTK-QPVSVCVEASGQAFRFYKRGVLNAECGDN--CDHGVAVVGFGTAEEEDGAK 307
AL +AV P+SV ++AS +F+ YK GV N D+ DHGV VG+GT+ DG
Sbjct: 237 ALQKAVAHVGPISVAIDASHMSFQHYKSGVYNEPGCDSTVLDHGVLAVGYGTS--SDGTD 294
Query: 308 YWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
YW++KNSW ETWG +GY+ + R+ + CGIAT ASYP+
Sbjct: 295 YWIVKNSWAETWGMNGYVWMSRNKDNQCGIATNASYPL 332
>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
Length = 343
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 151/348 (43%), Positives = 217/348 (62%), Gaps = 24/348 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+ +F+++I+ I +Q +S + + ++ + +H + YK+++E+ R+ IF N
Sbjct: 1 MKLFLLLIVAILATAQAISFFEL----VNQEWTTFKMEHNKVYKNDIEERFRMKIFMDNK 56
Query: 71 EYIEKANKEGNR-----TYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP---ST 122
I K N GN +YKL N++ D+ + EF + G+N+ + + R P S
Sbjct: 57 HKIAKHN--GNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLPIGASF 114
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
+ NV +P ++DWRE GAVT +K+QGHCGSCW+FSA A+EG G LI LSEQ
Sbjct: 115 IEPANVV-LPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQN 173
Query: 183 LVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
L+DCS NNGC+GGLMD+AF+YI +NKGL TE YPY+ E C + A +G
Sbjct: 174 LIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDVG- 232
Query: 241 YEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGF 297
Y D+P+G+E L AV T PVSV ++AS Q+F+FY GV EC +N DHGV VG+
Sbjct: 233 YVDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGY 292
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
GT +E+G YWL+KNSWGETWG++GYI++ R++ CGIA+ ASYP+
Sbjct: 293 GT--DENGQDYWLVKNSWGETWGDNGYIKMARNKLNHCGIASTASYPL 338
>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
Length = 319
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 192/311 (61%), Gaps = 14/311 (4%)
Query: 39 VEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
++ E+WMA+ G+TYK EK R IF+ N+ +I + +G N+F+DLTN+E
Sbjct: 17 MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 76
Query: 99 FRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAF 158
F A+YTG P P +++ RP + P IDWR +GAVT +K+QG CGSCWAF
Sbjct: 77 FVATYTGAKPPHP---KEAPRPVDPIW-----TPCCIDWRFRGAVTGVKDQGACGSCWAF 128
Query: 159 SAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYP 218
+AVAA+EG+T+I G+L LSEQ+LVDC T++NGC GG D+AFE + G+ E+DY
Sbjct: 129 AAVAAIEGLTKIRTGQLTPLSEQELVDCDTNSNGCGGGHTDRAFELVASKGGITAESDYR 188
Query: 219 YQQEQGTCDKQKEK-AAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKR 277
Y+ QG C AA+IG Y +P DE L AV +QPV+V ++ASG AF+FYK
Sbjct: 189 YEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKS 248
Query: 278 GVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI----RILRDEGL 333
GV CG + +H V +VG+ + G KYW+ KNSWG+TWG+ GYI +L+ G
Sbjct: 249 GVFPGPCGASSNHAVTLVGY-CQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGT 307
Query: 334 CGIATEASYPV 344
CG+A YP
Sbjct: 308 CGLAVSPFYPT 318
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 152/345 (44%), Positives = 212/345 (61%), Gaps = 23/345 (6%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWM---AQHGRTYKDELEKAMRLTIFKQNLE 71
+++++VITCA+ V S E +++W+ +H + YK E E+ +R+ I+ +N
Sbjct: 4 ILLLIVITCAA--VQAISFFELV----NQEWINFKMEHKKCYKHEAEERLRMKIYMKNKL 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP--STFKYQ 126
I + N + TY+L N++ D+ N EF+ GYNR + R P + F
Sbjct: 58 QIAQHNCDYELKKVTYRLKINKYGDMLNHEFKNMLNGYNRTINHTLRNERLPVGAAFIEP 117
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
++P +DWR+ GAVT +K+QGHCGSCWAFSA ++EG G L+ LSEQ L+DC
Sbjct: 118 CNVELPKMVDWRKCGAVTEVKDQGHCGSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDC 177
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S NNGC+GGLMD+AF YI +NKGL TE YPY+ E C K + A+ +G + D+
Sbjct: 178 SGSYGNNGCNGGLMDQAFSYIKDNKGLDTEKTYPYEGEDDKCRYDKRSSGASDVG-FVDI 236
Query: 245 PKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGD-NCDHGVAVVGFGTAE 301
P GDE L AV T PVSV ++AS Q+F+FY G+ EC N DHGV VVG+GT E
Sbjct: 237 PVGDEQKLKAAVATVGPVSVAIDASHQSFQFYSDGIYFEPECSSTNLDHGVLVVGYGTDE 296
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVA 345
E G YW++KNSWGE+WGE GYI++ R+ + CGIA+ ASYP+
Sbjct: 297 E--GRDYWIVKNSWGESWGEKGYIKMARNIDNHCGIASSASYPIV 339
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 151/318 (47%), Positives = 203/318 (63%), Gaps = 18/318 (5%)
Query: 43 EQWMA---QHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTN 96
E+W A QH + Y E E+ +RL I+ QN I K N+ G Y+L N+++DL +
Sbjct: 25 EEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLH 84
Query: 97 EEFRASYTGYNRPVPSVSRQSSR---PSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
EEF + G+NR S + R P TF +VPT++DWR+KGAVT +K+QGHCG
Sbjct: 85 EEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCG 144
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGL 211
SCW+FSA A+EG GKL+ LSEQ LVDCS NNGC+GG+MD AF+YI +N G+
Sbjct: 145 SCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGGI 204
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQ 270
TE YPY+ TC KA AT Y D+P+GDE AL +A+ T PVS+ ++AS +
Sbjct: 205 DTEKSYPYEAIDDTC-HFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHE 263
Query: 271 AFRFYKRGVL-NAEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
+F+FY GV +C +N DHGV VG+GT+EE G YWL+KNSWG TWG+ GY+++
Sbjct: 264 SFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEE--GEDYWLVKNSWGTTWGDQGYVKMA 321
Query: 329 RD-EGLCGIATEASYPVA 345
R+ + CG+AT ASYP+
Sbjct: 322 RNHDNHCGVATCASYPLV 339
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 151/340 (44%), Positives = 210/340 (61%), Gaps = 21/340 (6%)
Query: 18 ILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN 77
+L + +Q VS + I E+ + +H +TY+DE E+ RL IF +N I K N
Sbjct: 7 LLALVAVAQAVSFADV----IKEEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHN 62
Query: 78 KE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS----TFKYQNVTD 130
+ G T+K+ N+++D+ + EFR + G+N + R +S PS TF
Sbjct: 63 QRYATGEVTFKMAVNKYADMLHHEFRETMNGFNYTLHKELR-ASDPSFTGITFISPAHVK 121
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P S+DWREKGAVT +K+QGHCGSCWAFS+ A+EG G L+ LSEQ LVDCS
Sbjct: 122 LPKSVDWREKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKY 181
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC+GGLMD AF YI +N G+ TE YPY+ +C K+ A G + D+P+G+
Sbjct: 182 GNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKDSVGATDRG-FADIPQGN 240
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AECGD-NCDHGVAVVGFGTAEEEDG 305
E + +AV T PVSV ++AS ++F+FY G+ N EC N DHGV VVG+GT +E G
Sbjct: 241 EKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGT--DESG 298
Query: 306 AKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
YWL+KNSWG TWG+ G+I++ R+E CGIA+ +SYP+
Sbjct: 299 KDYWLVKNSWGTTWGDKGFIKMARNEDNQCGIASASSYPL 338
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 150/318 (47%), Positives = 205/318 (64%), Gaps = 19/318 (5%)
Query: 43 EQWMA---QHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLTN 96
E+W A QH + Y E E+ +RL I+ QN I K N+ +G ++L N+++DL +
Sbjct: 25 EEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDLLH 84
Query: 97 EEFRASYTGYNR---PVPSVSR-QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
EEF + G+NR P + + P T+ +VP ++DWREKGAVT +K+QGHC
Sbjct: 85 EEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQGHC 144
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKG 210
GSCW+FSA A+EG GKL+ LSEQ LVDCST NNGC+GG+MD AF+YI +N G
Sbjct: 145 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDNGG 204
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASG 269
+ TE YPY+ TC KA AT + D+P+GDE AL++A+ T PVSV ++AS
Sbjct: 205 IDTEKAYPYEAIDDTC-HYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSVAIDASH 263
Query: 270 QAFRFYKRGVL-NAEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
++F+FY GV +C +N DHGV VG+GT+EE G YWL+KNSWG TWG+ GY+++
Sbjct: 264 ESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEE--GEDYWLVKNSWGTTWGDQGYVKM 321
Query: 328 LRD-EGLCGIATEASYPV 344
R+ + CGIAT ASYP+
Sbjct: 322 ARNRDNHCGIATAASYPL 339
>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
Length = 380
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 148/333 (44%), Positives = 197/333 (59%), Gaps = 27/333 (8%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR---TYKLGTNEFSDL 94
++E+ ++W A + ++Y E R ++ +N+ YIE N E TY+LG ++DL
Sbjct: 48 MIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIEATNAEAEAAGLTYELGETAYTDL 107
Query: 95 TNEEFRASYTGYNRP--VPSVSRQ--------SSRPSTFK-------YQNV-TDVPTSID 136
TN+EF A YT P +P+ + ++R Y N+ T P S+D
Sbjct: 108 TNQEFMAMYTAAPSPAQLPADEDEDDAAEAVITTRAGPVDAVGQLPVYVNLSTAAPASVD 167
Query: 137 WREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGG 196
WR GAVT +KNQG CGSCWAFS VA VEGI QI GKL+ LSEQ+LVDC T + GC GG
Sbjct: 168 WRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTLDAGCDGG 227
Query: 197 LMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV 256
+ +A +I N GL TE DYPY C++ K AA+I + E +L AV
Sbjct: 228 ISYRALRWITSNGGLTTEEDYPYTGTTDACNRAKLAHNAASIAGLRRVATRSEASLANAV 287
Query: 257 TKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWG 316
QPV+V +EA G F+ YKRGV N CG + +HGV VVG+G EEEDG KYW+IKNSWG
Sbjct: 288 AGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNHGVTVVGYGQ-EEEDGDKYWIIKNSWG 346
Query: 317 ETWGESGYIRILRD-----EGLCGIATEASYPV 344
+WG+ GYI++ +D EGLCGIA S+P+
Sbjct: 347 ASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFPL 379
>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
Length = 398
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 142/345 (41%), Positives = 203/345 (58%), Gaps = 28/345 (8%)
Query: 22 TCASQVVSGRSMHEPSIVEKHEQWMAQHGR--------------TYKDELEKAMRLTIFK 67
T ++V + + + +E W ++HGR ++E ++ +RL +F+
Sbjct: 34 TTTTRVPAPAERADEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFR 93
Query: 68 QNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFK 124
NL YI+ N E G T++LG F+DLT EE+R G+ + + +
Sbjct: 94 DNLRYIDAHNAEADAGLHTFRLGLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGYSVR 153
Query: 125 YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
D+P +IDWR+ GAVT +K+Q CG CWAFSAVAA+EG+ I G L+ LSEQ+++
Sbjct: 154 G---GDLPDAIDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEII 210
Query: 185 DCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEK-AAAATIGKYED 243
DC ++GC GG M+ AF ++I N G+ TEADYP+ GTCD KEK ATI +
Sbjct: 211 DCDAQDSGCDGGQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKEKNEKVATIDGLVE 270
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+ +E AL +AV QPVSV ++ASG+AF+ Y G+ N CG + DHGV VG+G+ E
Sbjct: 271 VASNNETALQEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGS---E 327
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
G YW++KNSW +WGE+GYIR+ R+ G CGIA +ASYPV
Sbjct: 328 SGKDYWIVKNSWSASWGEAGYIRMRRNVPRPTGKCGIAMDASYPV 372
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 147/318 (46%), Positives = 201/318 (63%), Gaps = 14/318 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
I E+ + QH + Y +E+E+ R+ IF +N I K N+ +G +YKLG N+++D+
Sbjct: 24 IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSR--PSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
+ EF+ + GYN + + R+ + +T+ VP S+DWRE GAVT +K+QGHC
Sbjct: 84 LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKG 210
GSCWAFS+ A+EG G L+ LSEQ LVDCST NNGC+GGLMD AF YI +N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASG 269
+ TE YPY+ +C K A G + D+P+GDE + +AV T PVSV ++AS
Sbjct: 204 IDTEKSYPYEGIDDSCHFNKATIGATDTG-FVDIPEGDEEKMKKAVATMGPVSVAIDASH 262
Query: 270 QAFRFYKRGVLN-AECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
++F+ Y GV N EC + N DHGV VVG+GT +E G YWL+KNSWG TWGE GYI++
Sbjct: 263 ESFQLYSEGVYNEPECDEQNLDHGVLVVGYGT--DESGMDYWLVKNSWGTTWGEQGYIKM 320
Query: 328 LRDE-GLCGIATEASYPV 344
R++ CGIAT +SYP
Sbjct: 321 ARNQNNQCGIATASSYPT 338
>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
Length = 229
Score = 275 bits (703), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 131/219 (59%), Positives = 159/219 (72%), Gaps = 7/219 (3%)
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN 190
VP S+DWR+KGAVT +K+QG CGSCWAFS + AVEGI QI KL+ LSEQ+LVDC TD
Sbjct: 2 VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61
Query: 191 N-GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
N GC+GGLMD AFE+I + G+ TEA+YPY+ GTCD KE A A +I +E++P+ DE
Sbjct: 62 NQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDE 121
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
+ALL+AV QPVSV ++A G F+FY GV CG DHGVA+VG+GT DG KYW
Sbjct: 122 NALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTT--IDGTKYW 179
Query: 310 LIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
+KNSWG WGE GYIR+ R EGLCGIA EASYP+
Sbjct: 180 TVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPI 218
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 137/327 (41%), Positives = 197/327 (60%), Gaps = 22/327 (6%)
Query: 36 PSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR------------- 82
P+I + + W A+HG+ Y E+A RL +F N ++ N
Sbjct: 30 PAIEAQFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPP 89
Query: 83 TYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
+Y L N F+DLT+EEFRA+ G P ++ R + P + VP ++DWR+ GA
Sbjct: 90 SYTLALNAFADLTHEEFRAARLGRIAPGAAL-RSRAAPVYWGLGGGAAVPDALDWRKSGA 148
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKA 201
VT +K+QG CG+CW+FSA A+EGI +I G L+ LSEQ+L+DC N+GC GGLMD A
Sbjct: 149 VTKVKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 208
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
++++I+N G+ TE DYPY++ GTC+K K K TI Y D+P E LLQAV +QPV
Sbjct: 209 YKFVIKNGGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPV 268
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
SV + S +AF+ Y +G+ + C + DH V +VG+G+ E G YW++KNSWGE+WG
Sbjct: 269 SVGICGSARAFQLYYQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGESWGM 325
Query: 322 SGYIRILRD----EGLCGIATEASYPV 344
GY+ + R+ +G+CGI AS+P
Sbjct: 326 KGYMHMHRNTGDSKGVCGINMMASFPT 352
>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
[Brachypodium distachyon]
Length = 334
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 150/330 (45%), Positives = 201/330 (60%), Gaps = 27/330 (8%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE----GNRTYKLGTNE 90
+ ++ E++E+WMA+ GRTYKD EKA R +FK N +I+ N G KL TN+
Sbjct: 13 DKAMRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTNK 72
Query: 91 FSDLTNEEFRASY-TGYN---RPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVT 144
F+DLT +EFR Y TG+ RP V+ + FK+ V+ DVP SIDWR +GAVT
Sbjct: 73 FADLTEDEFRNIYVTGHRVNYRPTSLVT-----DTVFKFGAVSLSDVPPSIDWRARGAVT 127
Query: 145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFE 203
+K+Q C CWAFS+ AAVEGI QIT G + LS QQLVDCS N C G +DKA+E
Sbjct: 128 SVKDQHLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEKCKAGEIDKAYE 187
Query: 204 YIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSV 263
YI + GL + DYPY+ GTC + K A A I ++ +P +E ALL AV QPVSV
Sbjct: 188 YIARSGGLVADQDYPYEGHSGTC-RVYGKQAVARISGFQYVPARNETALLLAVAHQPVSV 246
Query: 264 CVEASGQAFRFYKRGVLNA---ECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWG 320
++ +A + G+ + C N +H + +VG+GT +E G +YWL+KNSWG WG
Sbjct: 247 ALDGLSRALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGT--DEHGTRYWLMKNSWGSDWG 304
Query: 321 ESGYIRILRD-----EGLCGIATEASYPVA 345
+ GY++ RD G+CG+A EASYPVA
Sbjct: 305 DKGYVKFARDVASEINGVCGLALEASYPVA 334
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 148/347 (42%), Positives = 214/347 (61%), Gaps = 22/347 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+ +F+ +I+ + +Q +S + + ++ + +H + YK+++E+ R+ IF N
Sbjct: 1 MKLFLFLIVAVLATAQAISFFEL----VNQEWTTFKMEHNKVYKNDVEERFRMKIFMDNK 56
Query: 71 EYIEKANKEGNR-----TYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKY 125
I K N GN +YKL N++ D+ + EF + G+N+ + + R P +
Sbjct: 57 HKIAKHN--GNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLPIAASF 114
Query: 126 QNVTDV--PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
+V P ++DWRE GAVT +K+QGHCGSCW+FSA A+EG G LI LSEQ L
Sbjct: 115 IEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNL 174
Query: 184 VDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
+DCS NNGC+GGLMD+AF+YI +NKGL TE YPY+ E C + A +G Y
Sbjct: 175 IDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDVG-Y 233
Query: 242 EDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG 298
D+P+G+E L AV T PVSV ++AS Q+F+FY GV EC +N DHGV VG+G
Sbjct: 234 VDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYG 293
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
T +E+G YWL+KNSWGETWG++GYI++ R++ CGIA+ ASYP+
Sbjct: 294 T--DENGQDYWLVKNSWGETWGDNGYIKMARNKLNHCGIASTASYPL 338
>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 274 bits (701), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 145/340 (42%), Positives = 204/340 (60%), Gaps = 18/340 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMH--EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
+IP+ V++ + A SG + + ++++ QW A H R+Y E+ R +++
Sbjct: 11 VIPILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRFEVYR 70
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+EYI+ N+ G TY+LG N+F+DLT EEF A Y G +++ + + +
Sbjct: 71 TNVEYIDATNRRGGLTYELGENQFADLTGEEFLARYAG-GHTGSAITTAAEADGSLE--- 126
Query: 128 VTDVPTSIDWREKGAVTHIKNQG-HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
D P S+DWR KGAVT +KNQG C SCWAFSAVA +E + I GKL+ LSEQQLVDC
Sbjct: 127 -ADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQQLVDC 185
Query: 187 STDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
+ GC+ G +AF++I+EN G+ T A YPY+ +G C K A TI + + K
Sbjct: 186 DKYDGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVRGACSAAKP---AVTITGHLAVAK 242
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
+E AL AV +QP+ V +E + +FYK GV +A CG H V VG+G + G
Sbjct: 243 -NELALQSAVARQPIGVAIEVP-ISMQFYKSGVFSAACGIQMSHAVVTVGYGA--DASGL 298
Query: 307 KYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYP 343
KYWL+KNSWG+TWGE+GYIR+ RD GLCGIA + +YP
Sbjct: 299 KYWLVKNSWGQTWGEAGYIRMRRDVGGGGLCGIALDTAYP 338
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 144/320 (45%), Positives = 198/320 (61%), Gaps = 13/320 (4%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSD 93
S+ + +W +HG+TY E EK +RL IF N E+++K N E G T+ +G N +D
Sbjct: 63 SLSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLAD 122
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
LT +EF+ GYN + SR ST++Y +VT P IDW GAVT +KNQ CG
Sbjct: 123 LTKDEFK-KMLGYNAAL-RASRAPVDASTWEYADVTP-PEEIDWVASGAVTPVKNQKQCG 179
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLA 212
SCWAFS AVEG+ I GKLI LSE++L+ CST+ N GC+GGLMD FE+I+ N+G+
Sbjct: 180 SCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGFEWIVNNRGID 239
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE + Y ++ C + A I ++D+P DE +L++AV++QPVSV +EA Q+F
Sbjct: 240 TEDGWEYVAKEEKCGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEADHQSF 299
Query: 273 RFYKRGVLNA-ECGDNCDHGVAVVGFGTAEEEDGAK-YWLIKNSWGETWGESGYIRILRD 330
+ Y GV +A +CG DHGV +VG+G + K +W IKNSWG WGE GYIRI +
Sbjct: 300 QLYAGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYIRIAKG 359
Query: 331 ----EGLCGIATEASYPVAM 346
EG CG+A + SYP +
Sbjct: 360 GSGVEGQCGVAMQPSYPTKL 379
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 147/319 (46%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+D+ E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 95 TNEEFRASYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
+ EFR G+N + R +S + TF +P S+DWR KGAVT +K+QGH
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST NNGC+GGLMD AF YI +N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEAS 268
G+ TE YPY+ +C K A G + D+P+GDE + +AV T PVSV ++AS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGATDRG-FTDIPQGDEKKMAEAVATVGPVSVAIDAS 263
Query: 269 GQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
++F+FY GV N +C N DHGV VVGFGT +E G YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGDDYWLVKNSWGTTWGDKGFIK 321
Query: 327 ILRD-EGLCGIATEASYPV 344
+LR+ E CGIA+ +SYP+
Sbjct: 322 MLRNKENQCGIASASSYPL 340
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 151/343 (44%), Positives = 208/343 (60%), Gaps = 19/343 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
++ +L + +Q VS + I E+ + + +H + Y+DE E+ RL IF +N I
Sbjct: 4 YIFALLALVAVAQAVSFADV----IKEEWQTFKLEHRKQYQDETEERFRLKIFNENKHKI 59
Query: 74 EKANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP---STFKYQN 127
K N+ G ++K+G N+++D+ + EF + G+N + R S TF
Sbjct: 60 AKHNQLYAAGEVSFKMGLNKYADMLHHEFHETMNGFNYTLHKQLRASDATFTGVTFISPE 119
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+P S+DWR KGAVT +K+QGHCGSCWAFS+ A+EG G LI LSEQ LVDCS
Sbjct: 120 HVKLPQSVDWRNKGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCS 179
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
T NNGC+GGLMD AF YI +N G+ TE YPY+ +C K A G + D+P
Sbjct: 180 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKGTIGATDRG-FTDIP 238
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLNAECGD--NCDHGVAVVGFGTAEE 302
+GDE L QAV T PVSV ++AS ++F+FY GV + D N DHGV VVG+GT +
Sbjct: 239 QGDEKKLAQAVATIGPVSVAIDASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGT--D 296
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILR-DEGLCGIATEASYPV 344
E+G YWL+KNSWG TWG+ G+I++ R D+ CGIAT +SYP+
Sbjct: 297 ENGKDYWLVKNSWGTTWGDKGFIKMARNDDNQCGIATASSYPL 339
>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
Length = 358
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 199/311 (63%), Gaps = 10/311 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ +W A + R+Y E+ R ++++N+E+IE N+ GN TY LG N+F+DLT E
Sbjct: 53 MMDRFLRWQATYNRSYPTAEERQRRFQVYRRNMEHIEATNRAGNLTYTLGENQFADLTEE 112
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQG-HCGSCW 156
EF YT + +P V R + + + +V D PTS+DWR +GAVT IKNQG C SCW
Sbjct: 113 EFLDLYT--MKGMPPVRRDAGKKQQANFSSVVDAPTSVDWRSRGAVTPIKNQGPSCSSCW 170
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEAD 216
AF A +E ITQI GKL+ LSEQ+L+DC + GC+ G ++++I+N GL TEA+
Sbjct: 171 AFVTAATIESITQIRTGKLVSLSEQELIDCDPYDGGCNLGYFVNGYKWVIQNGGLTTEAN 230
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPYQ + C++ K AA I Y LP+G E L QAV +QPV+ +E G + +FY
Sbjct: 231 YPYQARRYQCNRSKAGQRAARISNYRQLPQG-EAQLQQAVAQQPVAAAIEMGG-SLQFYS 288
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI---LRDEGL 333
GV + +CG +H + VVG+G + G KYWL+KNSWG+TWGE GY+R+ +R GL
Sbjct: 289 GGVWSGQCGTRMNHAITVVGYGA--DSSGVKYWLVKNSWGQTWGERGYLRMRKDVRQGGL 346
Query: 334 CGIATEASYPV 344
CGIA + +YP+
Sbjct: 347 CGIALDLAYPI 357
>gi|356517398|ref|XP_003527374.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 154/343 (44%), Positives = 207/343 (60%), Gaps = 32/343 (9%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
F +++ + A QV R++ + S+ E+HEQ M ++ + YKD E F N+ YI
Sbjct: 12 FAMLLCMAFLAFQVTC-RTLQDASMYERHEQRMTRYSKVYKDPPES------FXGNVNYI 64
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E N ++ YK G N+F R + G+ + R +TFK++NVT P+
Sbjct: 65 EACNNAADKPYKXGINQFPP------RNRFKGH------MCSSIIRITTFKFENVTATPS 112
Query: 134 SIDWREKGAVTH--IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELS-EQQLVDCSTD- 189
++D R+KGAVT +K+QG CG WA SAVAA EGI + GKLI LS E +LVDC T
Sbjct: 113 TVDCRQKGAVTPYTVKDQGQCGCFWALSAVAATEGIHALXAGKLILLSXEPELVDCDTKG 172
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCD-KQKEKAAAATIGKYEDLPKG 247
+ GC GGL D AF++II+N GL TEA+YPY+ G C+ + +K AA I Y+D+P
Sbjct: 173 VDQGCEGGLTDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEADKNAATIITGYDDVPAN 232
Query: 248 DEHALLQ-AVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGA 306
+E A LQ AV PVSV ++ASG F+FYK GV CG DHGV VG+G + +DG
Sbjct: 233 NEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVS--DDGT 290
Query: 307 KYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
+YWL+KNS G WGE GYIR+ R +E LCGIA +ASYP A
Sbjct: 291 EYWLVKNSRGPEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 333
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 149/339 (43%), Positives = 208/339 (61%), Gaps = 17/339 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
V+ +L + Q +S + I E+ + + +H + + E+E+ R+ IF +N I
Sbjct: 4 VLALLALVAFVQAIS----YTDVIKEEWQTFKMEHRKNFLSEVEERFRMKIFNENRHKIA 59
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSR-QSSRPSTFKYQNVTD 130
K N+ +G ++KLG N++SD+ EF+ + GYN + V R Q +
Sbjct: 60 KHNQLYAQGKVSFKLGLNKYSDMLYHEFKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQ 119
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P S+DWR+ GAVT +K+QGHCGSCWAFS+ AA+EG G L+ LSEQ LVDCST
Sbjct: 120 IPKSVDWRQHGAVTAVKDQGHCGSCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKY 179
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC+GGLMD AF YI +N G+ TE YPY+ +C K A G + D+P+GD
Sbjct: 180 GNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFTKSGVGATDTG-FVDIPQGD 238
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDG 305
E AL++AV T PVSV ++AS ++F+ Y GV N EC N DHGV VVG+GT ++ G
Sbjct: 239 EEALMKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDAQNLDHGVLVVGYGT--DKTG 296
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYP 343
YWL+KNSWG TWG+ GYI++ R+ + CGIAT +SYP
Sbjct: 297 LDYWLVKNSWGTTWGDQGYIKMARNQDNQCGIATASSYP 335
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 155/344 (45%), Positives = 211/344 (61%), Gaps = 29/344 (8%)
Query: 18 ILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN 77
+LV++C + S + S ++ + H + Y +ELE++ R IF +N + IEK N
Sbjct: 4 LLVLSCLIALGQAVSFFDLS-ADEFTLFKKFHRKEYDNELEESYRKKIFLENKKRIEKHN 62
Query: 78 ---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
K+G ++KL N +D+ E+ Y G+N+ SS+ + K Q+ T +P +
Sbjct: 63 SRYKQGKVSFKLKLNHLADMLIHEYSDVYLGFNK--------SSKANNNKLQSYTFIPPA 114
Query: 135 -------IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+DWR KGAVT +KNQGHCGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 115 HVTLNKEVDWRTKGAVTPVKNQGHCGSCWAFSTTGALEGQNFRKTGKLVSLSEQNLVDCS 174
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
NNGC GGLMD AF+YI EN G+ TE YPY+ E TC + ++ + AT + D+
Sbjct: 175 GSYGNNGCEGGLMDNAFQYIKENHGIDTEKSYPYEGEDETC-RFRKTSIGATDSGFVDIT 233
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEE 302
+GDE AL+QAV T P+SV ++AS Q+F+FY GV EC +N DHGV VVG+G
Sbjct: 234 QGDEEALMQAVATIGPISVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLVVGYGV--- 290
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVA 345
ED KYWL+KNSWG WG+ GYI++ RD + CGIAT+ASYP+
Sbjct: 291 EDNQKYWLVKNSWGTQWGDGGYIKMARDQDNNCGIATQASYPLV 334
>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
Length = 344
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 153/350 (43%), Positives = 212/350 (60%), Gaps = 29/350 (8%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLTIFKQNL 70
F+I+IL A+ +S + + E+W A QH + Y E E+ +R+ I+ QN
Sbjct: 4 FLILILGFVAAANAISIFELVK-------EEWTAFKLQHRKKYDSETEERIRMKIYVQNK 56
Query: 71 EYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVS-------RQSSRP 120
I K N+ G ++L N+++DL +EEF + G+NR V + P
Sbjct: 57 HKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSVSGKGQLLRGELKPIEEP 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
T+ DVPT++DWR KGAVT +K+QGHCGSCW+FSA A+EG GKL+ LSE
Sbjct: 117 VTWIEPANVDVPTAMDWRTKGAVTQVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSE 176
Query: 181 QQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
Q LVDCS NNGC+GG+MD AF+YI +NKG+ TE YPY+ C KA AT
Sbjct: 177 QNLVDCSQKYGNNGCNGGMMDFAFQYIKDNKGIDTEKSYPYEAIDDEC-HYNPKAVGATD 235
Query: 239 GKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAEC-GDNCDHGVAVV 295
+ D+P+G+E AL++A+ T PVSV ++AS ++F+FY GV +C + DHGV V
Sbjct: 236 KGFVDIPQGNEKALMKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLAV 295
Query: 296 GFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
G+GT E DG YWL+KNSWG TWG+ GY+++ R+ + CGIAT ASYP+
Sbjct: 296 GYGTTE--DGEDYWLVKNSWGTTWGDQGYVKMARNRDNHCGIATTASYPL 343
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 209/343 (60%), Gaps = 19/343 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ +++ + +Q VS + + E+ + +H + Y D E+ R+ IF +N +I
Sbjct: 5 LITLLIALVAMTQAVSYSEL----VREEWNTFKLEHRKNYADSTEETFRMKIFNENKHHI 60
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSR---QSSRPSTFKYQN 127
K N+ G +YKL N+++D+ + EFR + G+N + R +S TF
Sbjct: 61 AKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESFTGVTFISPE 120
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+PT++DWR KGAVT +K+QGHCGSCWAFS+ A+EG G L+ LSEQ LVDCS
Sbjct: 121 HVKLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVDCS 180
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
T NNGC+GGLMD AF Y+ +N G+ TE Y Y+ +C K A G + D+P
Sbjct: 181 TKYGNNGCNGGLMDNAFRYVKDNGGIDTEKSYAYEGIDDSCHFDKNSIGATDRG-FADIP 239
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEE 302
+G+E L QAV T PVSV ++AS Q+F+FY GV + C +N DHGV VVG+GT E
Sbjct: 240 QGNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPNCSAENLDHGVLVVGYGT--E 297
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
+DG+ YWL+KNSWG TWG+ G+I++ R+ E CGIA+ +SYP+
Sbjct: 298 KDGSDYWLVKNSWGTTWGDKGFIKMSRNKENQCGIASASSYPL 340
>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
Length = 374
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 145/348 (41%), Positives = 200/348 (57%), Gaps = 33/348 (9%)
Query: 24 ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR- 82
A + S + S++E+ ++W A + ++Y E+ R ++ +N+ YIE N E
Sbjct: 32 AGDTMGSMSNDDSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAA 91
Query: 83 --TYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFK---------------- 124
TY+LG ++DLTN+EF A YT P++++ + S
Sbjct: 92 GLTYELGETAYTDLTNQEFMAMYT-----APALAQLPADESVITTRAGPVDAVGGAPGQL 146
Query: 125 --YQNVT-DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQ 181
Y N++ P S+DWR GAVT +KNQG CGSCWAFS VA VEGI QI GKL+ LSEQ
Sbjct: 147 PVYVNLSASAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQ 206
Query: 182 QLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
+LVDC T ++GC GG+ +A +I N G+ TEADYPY C++ K A +I
Sbjct: 207 ELVDCDTLDDGCDGGISYRALRWIASNGGITTEADYPYTGTTDACNRAKLSHNAVSIAGL 266
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE 301
+ E +L AV QPV+V +EA G F+ YK+GV N CG N +HGV VVG+G E
Sbjct: 267 RRVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQ-E 325
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
G +YW++KNSWG+ WG+ GYIR+ +D EGLCGIA SYP+
Sbjct: 326 AAAGDRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 146/319 (45%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+D+ E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 95 TNEEFRASYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
+ EFR G+N + R +S + TF +P S+DWR KGAVT +K+QGH
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST NNGC+GGLMD AF YI +N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEAS 268
G+ TE YPY+ +C K A G + D+P+GDE + +AV T PV+V ++AS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGATDRG-FTDIPQGDEKKMAEAVATVGPVAVAIDAS 263
Query: 269 GQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
++F+FY GV N +C N DHGV VVGFGT +E G YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGEDYWLVKNSWGTTWGDKGFIK 321
Query: 327 ILRD-EGLCGIATEASYPV 344
+LR+ E CGIA+ +SYP+
Sbjct: 322 MLRNKENQCGIASASSYPL 340
>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 273 bits (698), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 140/333 (42%), Positives = 196/333 (58%), Gaps = 29/333 (8%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLT 95
+ ++ +W A+H RTY E+ RL ++ +N+ YIE N + TY+LG ++DLT
Sbjct: 38 MAQRFRRWKAEHSRTYATPEEERHRLRVYARNMRYIEATNGDAGAGLTYELGETAYTDLT 97
Query: 96 NEEFRASYTGYNRPVPSVSRQSSRPSTF------------------KYQNVT-DVPTSID 136
++EF A YT +R P P T Y N + P S+D
Sbjct: 98 SDEFTAMYT--SRAPPLSDDDDDLPMTMITTRAGPVAAAGGGGWLQVYVNESAGAPASVD 155
Query: 137 WREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGG 196
WRE+GAVT +KNQG CGSCWAFS VA +EGI QI GKL LSEQ+LVDC ++GC+GG
Sbjct: 156 WRERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLASLSEQELVDCDKLDHGCNGG 215
Query: 197 LMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV 256
+ +A ++I N G+ ++ DYPY + TCD +K AA+I ++ + E +L AV
Sbjct: 216 VSYRALQWITSNGGITSQDDYPYTAKDDTCDTKKLSHHAASISGFQRVATRSELSLTNAV 275
Query: 257 TKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWG 316
QPV+V +EA G F+ Y+ GV N CG +HGV VVG+G +E G YW++KNSWG
Sbjct: 276 AMQPVAVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTVVGYG-EDEVTGESYWIVKNSWG 334
Query: 317 ETWGESGYIR-----ILRDEGLCGIATEASYPV 344
E WG++GY+R I + EG+CGIA S+P+
Sbjct: 335 EKWGDNGYLRMKKGIIDKPEGICGIAIRPSFPL 367
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 148/338 (43%), Positives = 198/338 (58%), Gaps = 14/338 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M I ILV+ A V S + + +WM + ++Y +E E R ++++N +
Sbjct: 1 MRAITILVLLAAICVASTLATTHDPLTGVFAEWMRDNSKSYSNE-EFVFRWNVWRENQQL 59
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE+ N+ N+T L N+F DLTN EF + G S +++ + K +
Sbjct: 60 IEEHNRS-NKTSFLAMNKFGDLTNAEFNKLFKGL---AFDYSFHANKAAAEKAVPAPGLS 115
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--N 190
DWR+KGAVTH+KNQG CGSCW+FS + EG + G+L LSEQ L+DCS N
Sbjct: 116 ADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGN 175
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
NGC+GGLMD AFEYII NKG+ TEA YPYQ Q TC + + ++ Y D+ GDE+
Sbjct: 176 NGCNGGLMDYAFEYIINNKGIDTEASYPYQTAQYTC-QYNPANSGGSLTSYTDVSSGDEN 234
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVL--NAECGDNCDHGVAVVGFGTAEEEDGAKY 308
ALL AV +P SV ++AS +F+FY GV +A DHGV VG+GT EDG Y
Sbjct: 235 ALLNAVATEPTSVAIDASHNSFQFYSGGVYYESACSSTQLDHGVLAVGWGT---EDGQDY 291
Query: 309 WLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPVA 345
WL+KNSWG WG +GYI++ R+ CGIAT ASYP A
Sbjct: 292 WLVKNSWGADWGLAGYIKMARNRSNNCGIATSASYPTA 329
>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
Length = 339
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 212/341 (62%), Gaps = 18/341 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ ++L I A+Q +S ++ + E+ + H + Y ++E++ R+ IF +N I
Sbjct: 5 IFLLLGILAAAQAISFFNL----VTEEWNTFKVTHRKAYDSKIEESFRMKIFMENWHKIA 60
Query: 75 KANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP--STFKYQNVT 129
N++ +YKLG N++ D+ + EF + G+N+ V + R RP S F
Sbjct: 61 LHNQKYELNEVSYKLGMNKYGDMLHHEFINTLNGFNKSVSAQLRAQRRPIGSRFIEPANV 120
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-- 187
++P+S+DWR GAVT IK+QGHCGSCW+FSA A+EG GKL+ LSEQ L+DCS
Sbjct: 121 EIPSSVDWRTHGAVTPIKDQGHCGSCWSFSATGALEGQHYRITGKLVSLSEQNLIDCSGR 180
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
NNGC+GGLMD+AF+YI +N GL TE YPY+ E C + + AT Y D+P+G
Sbjct: 181 YGNNGCNGGLMDQAFQYIKDNHGLDTEISYPYEAENDKC-RYNPRNNGATDSGYVDIPEG 239
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEEED 304
+E L AV T PVSV ++AS ++F+FY+ GV C +N DHGV VVG+GT +++
Sbjct: 240 NEKKLKAAVATIGPVSVAIDASAESFQFYREGVYYEPRCSSENLDHGVLVVGYGT--DDN 297
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
YWL+KNSWG TWG+ GYI++ R+ + CGIA+ ASYP+
Sbjct: 298 DQDYWLVKNSWGVTWGDEGYIKMARNKDNHCGIASSASYPL 338
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 146/319 (45%), Positives = 200/319 (62%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+D+ E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 95 TNEEFRASYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
+ EFR G+N + R S + TF +P S+DWR KGAVT +K+QGH
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQGH 144
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST NNGC+GGLMD AF YI +N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEAS 268
G+ TE YPY+ +C K A G + D+P+GDE + +AV T PVSV ++AS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNKGTIGATDRG-FTDIPQGDEKKMAEAVATVGPVSVAIDAS 263
Query: 269 GQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
++F+FY GV N +C N DHGV VVGFGT +E G YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGDDYWLVKNSWGTTWGDKGFIK 321
Query: 327 ILRD-EGLCGIATEASYPV 344
+LR+ + CGIA+ +SYP+
Sbjct: 322 MLRNKDNQCGIASASSYPL 340
>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 351
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 136/355 (38%), Positives = 219/355 (61%), Gaps = 20/355 (5%)
Query: 2 VLKFEKSFIIPMFVIIILVITCASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKA 60
V+KF I+P+ +I L C S + + E S+++ +++W + H R ++ E
Sbjct: 3 VMKF---LIVPLVLIAFLCNICESFELERKDFESEKSLMQLYKRWSSHH-RISRNANEMH 58
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG---YNRPVPS--VSR 115
R +FK N +++ K N G ++ KL N+F+D++++EFR Y+ Y + + + +
Sbjct: 59 NRFKVFKNNAKHVFKVNLMG-KSLKLKLNQFADMSDDEFRNMYSSNITYYKDLHAKKIEA 117
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKL 175
R F Y++ ++P+SIDWR+KGAV IKNQG CGSCWAF+AVAAVE I QI +L
Sbjct: 118 TGGRIGGFMYEHANNIPSSIDWRKKGAVNAIKNQGRCGSCWAFAAVAAVESIHQIKTNEL 177
Query: 176 IELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAA 235
+ LSE++++DC + GC GG + AFE++++N G+ E +YPY + G C ++ +
Sbjct: 178 VSLSEEEVLDCDYRDGGCRGGFYNSAFEFMMDNDGVTIEDNYPYYEGNGYCRRRGGRNKR 237
Query: 236 ATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL--NAECGDNCDHGVA 293
I YE++P+ +E+AL++AV QPV+V + + G F+FY G+ N CG N DH V
Sbjct: 238 VRIDGYENVPRNNEYALMKAVAHQPVAVAIASGGSDFKFYGGGMFTENDFCGFNIDHTVV 297
Query: 294 VVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
VVG+GT E+ D YW+I+N +G WG +GY+++ R +G+CG+A + +YPV
Sbjct: 298 VVGYGTDEDGD---YWIIRNQYGHRWGMNGYMKMQRGAHSPQGVCGMAMQPAYPV 349
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 146/338 (43%), Positives = 195/338 (57%), Gaps = 17/338 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M +L + A V S ++ + WM +H ++Y +E E R ++++N Y
Sbjct: 1 MRTTTLLALCVALFVASTFAVSHDPLTGVFADWMQEHQKSYANE-EFVYRWNVWRENYLY 59
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N + N+++ L N+F DLTN EF + G + ++S +P
Sbjct: 60 IEAHNHQ-NKSFHLAMNKFGDLTNAEFNKLFKGLSITADQAKQESDIAP------APGLP 112
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--N 190
DWR+KGAVTH+KNQG CGSCW+FS + EG + G+L LSEQ LVDCST N
Sbjct: 113 ADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKHGRLTSLSEQNLVDCSTSYGN 172
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
+GC+GGLMD AFEYII NKG+ TE YPY QGTC K+ + + Y ++P G+E
Sbjct: 173 HGCNGGLMDYAFEYIIRNKGIDTEESYPYHASQGTCRYNKQHSGGELV-SYTNVPSGNEG 231
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLN--AECGDNCDHGVAVVGFGTAEEEDGAKY 308
ALL AV QP SV ++AS +F+FYK GV + A DHGV VG+G DG Y
Sbjct: 232 ALLNAVATQPTSVAIDASHSSFQFYKGGVYDEPACSSSRLDHGVLAVGWGV---RDGKDY 288
Query: 309 WLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPVA 345
WL+KNSWG WG SGYI + R++ CGIAT AS+P A
Sbjct: 289 WLVKNSWGADWGLSGYIEMSRNKHNQCGIATAASHPHA 326
>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
Length = 374
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 146/338 (43%), Positives = 194/338 (57%), Gaps = 29/338 (8%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR---TYKLGT 88
S + S++E+ ++W A + ++Y E+ R + +N+ YIE N E TY+LG
Sbjct: 40 STDDSSMIERFQRWKAAYNKSYATVAEERRRFRVCARNMAYIEATNAEAEAAGLTYELGE 99
Query: 89 NEFSDLTNEEFRASYTGYNRPVPS---------------VSRQSSRPSTFK-YQNV-TDV 131
++DLTN+EF A YT P P+ V P Y N+ T
Sbjct: 100 TAYTDLTNQEFMAMYTA---PAPAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSTSA 156
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN 191
P S+DWR GAVT +KNQG CGSCWAFS VA VEGI QI GKL+ LSEQ+LVDC T ++
Sbjct: 157 PASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTLDD 216
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GG+ +A +I N G+ TE DYPY C++ K A +I + E +
Sbjct: 217 GCDGGISYRALRWIASNGGITTETDYPYTGTTDACNRAKLSHNAVSIAGLRRVATRSEAS 276
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLI 311
L AV QPV+V +EA G F+ YK+GV N CG N +HGV VVG+G E G +YW++
Sbjct: 277 LANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQ-EAAGGDRYWIV 335
Query: 312 KNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
KNSWG+ WG+ GYIR+ +D EGLCGIA SYP+
Sbjct: 336 KNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 371
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 143/325 (44%), Positives = 187/325 (57%), Gaps = 28/325 (8%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ +W A H RTY D E+ R +++ N+EYIE N+ G TY+LG N+F+DLT+E
Sbjct: 55 MLDRFVRWQAAHNRTYGDAEERLRRFQVYRANIEYIEATNRRGGLTYELGENQFADLTSE 114
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDV---------------PTSIDWREKGA 142
EF + Y S R TDV P S DWR KGA
Sbjct: 115 EFLSMYA-------SSYDAGDRADDEAALITTDVAGDGAWSDGDLEALPPPSWDWRAKGA 167
Query: 143 VTHIKNQGH-CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKA 201
VT KNQG C SCWAF VA +EG+T I GKLI LSEQQLVDC + GC+ G +
Sbjct: 168 VTPPKNQGPTCSSCWAFVTVATIEGLTFIKTGKLISLSEQQLVDCDMYDGGCNTGSYSRG 227
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
F +++EN GL TEA+YPY +G C++ K AA I +P +E + +AV QPV
Sbjct: 228 FRWVLENGGLTTEAEYPYTAARGPCNRAKSAHHAAKITGQGRIPPQNELVMQKAVAGQPV 287
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
V +E G +FYK GV + CG N H V VVG+G + GAKYW++KNSWG+ WGE
Sbjct: 288 GVAIEV-GSGMQFYKTGVYSGPCGTNLAHAVTVVGYGV-DPASGAKYWIVKNSWGQAWGE 345
Query: 322 SGYIRILRD---EGLCGIATEASYP 343
G+IR+ RD GLCGIA + +YP
Sbjct: 346 RGFIRMRRDVGGPGLCGIALDVAYP 370
>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 152/341 (44%), Positives = 207/341 (60%), Gaps = 29/341 (8%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
F +++ + A QV R++ + S+ E H Q M ++ + KD + +FK+N+ YI
Sbjct: 12 FAMLLSMAFLAFQVTC-RTLQDASMYESHGQRMTRYSKVDKDPPD-----XVFKENVNYI 65
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E N ++ YK N+F+ + + G+ + R +TFK++NVT P+
Sbjct: 66 EACNNAADKPYKRDINQFAP------KKRFKGH------MCSSIIRITTFKFENVTATPS 113
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL-SEQQLVDCSTD--N 190
++D R+K AVT IK+QG CG WA SAVAA EGI + GKLI L SEQ+LVDC T +
Sbjct: 114 TVDCRQKVAVTPIKDQGQCGCFWALSAVAATEGIHALXAGKLILLSSEQELVDCDTKGVD 173
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDK-QKEKAAAATIGKYEDLPKGDE 249
C GGLMD AF++II+N GL TEA+YPY+ G C+ + +K AA I YED+P +E
Sbjct: 174 QDCQGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNAYEADKNAATIITGYEDVPANNE 233
Query: 250 HALLQ-AVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
A LQ AV PVSV ++ASG F+FYK GV CG DHGV VG+G + +DG +Y
Sbjct: 234 KAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVS--DDGTEY 291
Query: 309 WLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPVA 345
WL+KNS G WGE GYIR+ R +E LCGIA +ASYP A
Sbjct: 292 WLVKNSRGTEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 332
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 150/337 (44%), Positives = 210/337 (62%), Gaps = 16/337 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ ++ VI AS + P++ + E + A+H + Y+ E+ MR IF++N ++I
Sbjct: 58 LLAVLAVIGLASALSP-----NPNLNQHWENFKAEHNKKYESFPEELMRRLIFEENHQFI 112
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E N + + LG N F DLTN+E+R Y GY RP + S+ S S + + + DVP
Sbjct: 113 EDHNSKKEFDFYLGMNHFGDLTNKEYRERYLGYRRPENTPSKASYIFS--RAEKIEDVPD 170
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NN 191
IDWR++G VT +KNQG CGSCWAFSAV ++EG + GKL+ LSEQ LVDCST N+
Sbjct: 171 QIDWRDQGFVTPVKNQGQCGSCWAFSAVGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNS 230
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC+GG MD+AFEY+ +N G+ TE YPY G+C K K+ AT+ + D+ +GDE A
Sbjct: 231 GCNGGWMDQAFEYVKDNHGIDTEDSYPYVGTDGSC-HFKNKSIGATLKGFMDVKEGDEEA 289
Query: 252 LLQAV-TKQPVSVCVEASGQAFRFYKRGVLNAE-CGDN-CDHGVAVVGFGTAEEEDGAKY 308
L QAV PVSV ++AS F+FY+ GV N C + DHGV VVG+G ++ G +
Sbjct: 290 LRQAVGVAGPVSVAIDASSMLFQFYRGGVYNVPWCSTSELDHGVLVVGYG--KQFQGKDF 347
Query: 309 WLIKNSWGETWGESGYIRILRDEG-LCGIATEASYPV 344
W++KNSWG WG GYI + R++G CGIA++AS P
Sbjct: 348 WMVKNSWGVGWGIYGYIEMSRNKGNQCGIASKASIPT 384
>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
Length = 435
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 139/326 (42%), Positives = 195/326 (59%), Gaps = 26/326 (7%)
Query: 42 HEQWMAQHGR-------------TYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYK 85
+E W ++HGR + E ++ +RL +F+ NL YI+K N E G T++
Sbjct: 84 YEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFR 143
Query: 86 LGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAV 143
LG F+DLT +E+R G+ + ++ + +P +IDWR+ GAV
Sbjct: 144 LGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLLPDAIDWRQLGAV 203
Query: 144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFE 203
T +K+Q CG CWAFSAVAA+EGI I G L+ LSEQ+++DC ++GC GG M+ AF
Sbjct: 204 TEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQDSGCDGGQMENAFR 263
Query: 204 YIIENKGLATEADYPYQQEQGTCDKQKE-KAAAATIGKYEDLPKGDEHALLQAVTKQPVS 262
++I N G+ TEADYP+ GTCD KE ATI ++ +E AL +AV QPVS
Sbjct: 264 FVIGNGGIDTEADYPFIGTDGTCDASKENNEKVATIDGLVEVASNNETALQEAVAIQPVS 323
Query: 263 VCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGES 322
V ++ASG+AF+ Y G+ N CG + DHGV VG+G+ E G YW++KNSW +WGE+
Sbjct: 324 VAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGS---ESGKDYWIVKNSWSASWGEA 380
Query: 323 GYIRILRD----EGLCGIATEASYPV 344
GYIR+ R+ G CGIA +ASYPV
Sbjct: 381 GYIRMRRNVPRPTGKCGIAMDASYPV 406
>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 146/344 (42%), Positives = 203/344 (59%), Gaps = 17/344 (4%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMH--EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
+IP+ V++ + A SG + + ++++ QW A H R+Y E+ R +++
Sbjct: 11 VIPILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRFEVYR 70
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG--YNRPVPSVSRQSSRPSTFKY 125
N+EYI+ N+ G TY+LG N+F+DLT EEF A Y G + + + S+
Sbjct: 71 TNVEYIDATNRRGGLTYELGENQFADLTGEEFLARYAGGHTGSAITTAAEADGLWSSGGS 130
Query: 126 QNV--TDVPTSIDWREKGAVTHIKNQG-HCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
D P S+DWR KGAVT +KNQG C SCWAFSAVA +E + I GKL+ LSEQQ
Sbjct: 131 DGSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQQ 190
Query: 183 LVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
LVDC + GC+ G +AF++I+EN G+ T A YPY+ +G C K A TI +
Sbjct: 191 LVDCDKYDGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVRGACSAAKP---AVTITGHL 247
Query: 243 DLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEE 302
+ K +E AL AV +QP+ V +E + +FYK GV +A CG H V VG+G +
Sbjct: 248 AVAK-NELALQSAVARQPIGVAIEVP-ISMQFYKSGVFSAACGIQMSHAVVTVGYGA--D 303
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYP 343
G KYWL+KNSWG+TWGE+GYIR+ RD GLCGIA + +YP
Sbjct: 304 ASGLKYWLVKNSWGQTWGEAGYIRMRRDVGGGGLCGIALDTAYP 347
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 143/343 (41%), Positives = 209/343 (60%), Gaps = 25/343 (7%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+ + + ++ ++CA++ + E+ E + HG+ YK++ E+ R IF N
Sbjct: 3 VLLVAVAVIAVSCANRFYNINP-------EEWETFKVVHGKNYKNQFEEMFRRKIFMNNK 55
Query: 71 EYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
+ IE N ++G +YK+ N F DL + E +A G+ ++ + R + +
Sbjct: 56 KRIEAHNAKYEQGEVSYKMKMNHFGDLMSHEIKALMNGF-----KMTPNTKREGKIYFPS 110
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+P S+DWR+KGAVT +K+QG CGSCW+FSA ++EG + GKL+ LSEQ L+DCS
Sbjct: 111 NDKLPKSVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQNLMDCS 170
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ NNGC GGLMDKAF+Y+ +NKG+ TE+ YPY+ C +K+K G Y D+P
Sbjct: 171 KEYGNNGCEGGLMDKAFQYVSDNKGIDTESSYPYEARDYACRFKKDKVGGTDKG-YVDIP 229
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLNAE-CGD-NCDHGVAVVGFGTAEE 302
+GDE AL A+ T P+SV ++AS ++F FY GV N C + DHGV VG+GT
Sbjct: 230 EGDEKALQNALATVGPISVAIDASHESFHFYSEGVYNEPYCSSYDLDHGVLAVGYGT--- 286
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
E+G YWL+KNSWG +WGESGYI+I R+ CGIA+ ASYP+
Sbjct: 287 ENGQDYWLVKNSWGPSWGESGYIKIARNHSNHCGIASMASYPI 329
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 142/345 (41%), Positives = 197/345 (57%), Gaps = 14/345 (4%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMR 62
K + +II + ++ A G S + + +E+ + WM +H + Y+ EK R
Sbjct: 9 KIIFLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYR 68
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
IF+ NL YI++ NK+ N +Y LG N F+DL+N+EF+ Y G+ +
Sbjct: 69 FEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEFKKKYVGF-VAEDFTGLEHFDNED 126
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
F Y++VT+ P SIDWR KGAVT +KNQG CGSCWAFS +A VEGI +I G L+ELSEQ+
Sbjct: 127 FTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQE 186
Query: 183 LVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
LVDC + GC GG + +Y + N G+ T YPYQ +Q C + I Y+
Sbjct: 187 LVDCDKHSYGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYK 245
Query: 243 DLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEE 302
+P E + L A+ QP+SV VEA G+ F+ YK GV + CG DH V VG+GT+
Sbjct: 246 RVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-- 303
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYP 343
DG Y +IKNSWG WGE GY+R+ R +G CG+ + YP
Sbjct: 304 -DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 149/344 (43%), Positives = 213/344 (61%), Gaps = 20/344 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M ++L CA+ + + H+ + + + A HG+ Y+ E E+ RL I+ +N
Sbjct: 1 MRGFVVLCFLCAAMTAAAIT-HQELVGAEWSAFKALHGKEYQSETEEYYRLKIYMENRMM 59
Query: 73 IEKANKE--GNR-TYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSS---RPSTFKYQ 126
I + N++ N+ +YKL NE+ D+ + EF ++ G+ R S RQ S P + +
Sbjct: 60 IARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTRNGFRRDYRSKPRQGSFYIEPEGIEDK 119
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
++ P ++DWR+KGAVT +KNQG CGSCWAFS ++EG G ++ LSEQ LVDC
Sbjct: 120 HL---PKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDC 176
Query: 187 ST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
ST NNGC GGLMD AF+YI N G+ TE YPY GTC +K A G + D+
Sbjct: 177 STAFGNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNGTDGTCHFKKSDVGATDTG-FVDI 235
Query: 245 PKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAE 301
P+G+EH L +AV T P+SV ++AS Q+F+FY +GV + EC +N DHGV VVG+GT +
Sbjct: 236 PEGNEHLLKKAVATVGPISVAIDASHQSFQFYSQGVYDEPECSSENLDHGVLVVGYGTKD 295
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
++D YWL+KNSWG TWG+ GYI + R+ + CGIA+ ASYP+
Sbjct: 296 DQD---YWLVKNSWGTTWGDGGYIYMTRNKDNQCGIASSASYPL 336
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 140/345 (40%), Positives = 199/345 (57%), Gaps = 14/345 (4%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMR 62
K + +II + ++ A G S + + +E+ + WM +H + Y+ EK R
Sbjct: 9 KIIFLATCLIIHMSLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYR 68
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
IF+ NL YI++ NK+ N +Y LG N F+DL+N+EF+ Y G + +
Sbjct: 69 FEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEFKKKYVG-SVAEDFTGLEHFDNED 126
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
F Y++VT+ P SIDWR KGAVT +KNQG CGSCWAFS +A VEG+ +I G L+ELSEQ+
Sbjct: 127 FTYKHVTNYPQSIDWRAKGAVTPVKNQGSCGSCWAFSTIATVEGVNKIVTGNLLELSEQE 186
Query: 183 LVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
LVDC +++GC GG + +Y+ +N G+ T YPYQ + C + I Y+
Sbjct: 187 LVDCDKNSHGCKGGYQTTSLQYVADN-GVHTSKVYPYQAKAMQCRATDKPGPKVKITGYK 245
Query: 243 DLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEE 302
+P E + L A+ QP+SV VEA G+ F+ YK GV + CG DH V VG+GT+
Sbjct: 246 RVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-- 303
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYP 343
DG Y +IKNSWG WGE GY+R+ R +G CG+ + YP
Sbjct: 304 -DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 150/338 (44%), Positives = 203/338 (60%), Gaps = 15/338 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
++ ++LV C VVS SM E QW +HG+ Y + E+A R I+++NL+ +
Sbjct: 3 YLSVLLVAVC---VVSSLSMSFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
K N + G+ TY LG N+F+DL NEEF A TG+ V S+ + + NV
Sbjct: 60 IKHNLKYDLGHFTYALGMNQFADLQNEEFVAMMTGFR--VNGTSKAAKGSTFLPSNNVDK 117
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN 190
+P ++DWR KG VT +K+QG CGSCWAFSA ++EG GKL+ LSEQ LVDCS N
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQQFKKTGKLVSLSEQNLVDCSYRN 177
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC GG MD+AF+YII+ G+ TEA Y Y+ G C +K A G Y D+ G E
Sbjct: 178 YGCHGGFMDRAFQYIIDAGGIDTEATYSYRAVDGNCHFKKANVGATVTG-YTDVTSGSEK 236
Query: 251 ALLQAVTK-QPVSVCVEASGQAFRFYKRGVLNAE-CGDN-CDHGVAVVGFGTAEEEDGAK 307
AL +AV P+SV ++AS + F+FYK GV N C H V VVG+GT DG
Sbjct: 237 ALQKAVAHIGPISVAIDASHKFFKFYKSGVYNEPGCSTTRLGHAVLVVGYGTT--SDGTD 294
Query: 308 YWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
YW++KNSW +TWG +GY+ + R+ + CGIA+EASYP+
Sbjct: 295 YWIVKNSWAKTWGMNGYLWMSRNKDNQCGIASEASYPM 332
>gi|357167707|ref|XP_003581294.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 358
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 139/320 (43%), Positives = 188/320 (58%), Gaps = 14/320 (4%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
++ +HE+WMA+ GR+Y D EKA R +F N +++ N+ GNRTY LG N+FSDLT+
Sbjct: 37 TMASRHERWMARFGRSYTDAGEKARRQEVFGANARHVDAVNRAGNRTYTLGLNQFSDLTD 96
Query: 97 EEFRASYTGYNRPVPS----VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
EF + GY R + + P D+P S+DWR KGAVT IKNQ C
Sbjct: 97 HEFLQQHLGYGRHHGQRGLLLPEEEVMPKATALGYGQDMPYSVDWRAKGAVTEIKNQRSC 156
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLA 212
GSCWAF+AVAA EG+ +I G LI +SEQQ++DC+ D + C G + A Y++ + GL
Sbjct: 157 GSCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGDRSSCDSGYISDALRYVVTSGGLQ 216
Query: 213 TEADYPYQQEQGTCDKQ---KEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASG 269
EA Y Y ++G C + + +AA+ G + GDE AL +QPV+V VEAS
Sbjct: 217 REAAYAYTGQKGACGSRRFARPNSAASVGGVHMATLNGDEGALQGLAARQPVAVIVEASE 276
Query: 270 QAFRFYKRGVL--NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
FR Y GV +A CG +H + VVG+GT E +YWL+KN WG WGE+GY+R+
Sbjct: 277 PDFRHYSSGVYAGSASCGRELNHALTVVGYGT--ENGAGEYWLVKNQWGTWWGENGYMRV 334
Query: 328 LRDEGL---CGIATEASYPV 344
R G CGIA+ A YP
Sbjct: 335 ARRNGAGANCGIASVAFYPT 354
>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
Length = 341
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 142/317 (44%), Positives = 193/317 (60%), Gaps = 36/317 (11%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKA--MRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFS 92
+ + ++ W ++HGR +D + A +RL +F+ NL YI+ N E G T++LG F+
Sbjct: 47 VRQLYKTWKSEHGRP-RDGISVADGLRLKVFRDNLRYIDAHNAEADAGLHTFRLGLTPFT 105
Query: 93 DLTNEEFRASYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
DLT EEFRA G+ N +P V+ P D+P ++DWR++GAVT +KNQ
Sbjct: 106 DLTLEEFRAHALGFLNSTLPRVASDRYLPRAGD-----DLPDAVDWRQQGAVTGVKNQLD 160
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGL 211
CG CWAFSAVAA+EGI +I LI LSEQ+L+DC T++ GC GG M KAF+++I+N G+
Sbjct: 161 CGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDTEDYGCQGGEMQKAFQFVIDNGGI 220
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
TEADYP+ GTCD +EK +I YE++P DE AL +AV QP
Sbjct: 221 DTEADYPFIGTNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQP----------- 269
Query: 272 FRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD- 330
G+ N CG DHGV VG+G+ ++G +W++KNSWG WGESGYIR+ R+
Sbjct: 270 ------GIFNGPCGFILDHGVTAVGYGS---DNGEDFWIVKNSWGAEWGESGYIRMKRNV 320
Query: 331 ---EGLCGIATEASYPV 344
G CGIA ASYPV
Sbjct: 321 LLPMGKCGIAMYASYPV 337
>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
Length = 360
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 135/302 (44%), Positives = 184/302 (60%), Gaps = 11/302 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ W H R+Y E R ++++N E+I+ N G+ TY+L NEF+DLT E
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEE 106
Query: 98 EFRASYTGY---NRPVPS---VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQ-G 150
EF A+YTGY + PV + ++F Y+ DVP S+DWR +GAV K+Q
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYR--VDVPASVDWRAQGAVVPPKSQTS 164
Query: 151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKG 210
C SCWAF A +E + I GKL+ LSEQQLVDC + + GC+ G +A+++++EN G
Sbjct: 165 TCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGG 224
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
L TEADYPY +G C++ K AA I + +P +E AL AV +QPV+V +E G
Sbjct: 225 LTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GS 283
Query: 271 AFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
+FYK GV CG H V VVG+GT + GAKYW IKNSWG++WGE GYIRILRD
Sbjct: 284 GMQFYKGGVYTGPCGTRLAHAVTVVGYGT-DASSGAKYWTIKNSWGQSWGERGYIRILRD 342
Query: 331 EG 332
G
Sbjct: 343 VG 344
>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
Length = 350
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 145/325 (44%), Positives = 190/325 (58%), Gaps = 20/325 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ EQWM +HGR Y D EK R ++++N+E +E N N YKL N+F+DLTNE
Sbjct: 27 MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 85
Query: 98 EFRASYTGYNRP---VPSVSRQSSRPSTFKYQNVTDV-PTSIDWREKGAVTH-IKNQGHC 152
EFRA G+ RP +P +S S ++ D+ P S+DWR KGAV + K
Sbjct: 86 EFRAKMLGF-RPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRNKGAVINRWKICVDA 144
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLA 212
GSCWAFSAVAA+EGI QI G+L+ LSEQ+LVDC + GC GG M AFE+++ N GL
Sbjct: 145 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEAVGCGGGYMSWAFEFVVGNHGLT 204
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TEA YPY G C K +A I Y ++ E L +A QPVSV V+ F
Sbjct: 205 TEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMF 264
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEED--------GAKYWLIKNSWGETWGESGY 324
+ Y GV C + +HGV VVG+G +E + G KYW++KNSWG WG++GY
Sbjct: 265 QLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGY 324
Query: 325 IRILRD-----EGLCGIATEASYPV 344
I + RD GLCGIA SYPV
Sbjct: 325 ILMQRDVAGLASGLCGIALLPSYPV 349
>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 288
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 131/245 (53%), Positives = 174/245 (71%), Gaps = 5/245 (2%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++H + YK EK R +F++NL +I++ N E N +Y LG NEF+DLT+E
Sbjct: 47 LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFADLTHE 105
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G +P S RQ S + F+Y+++TD+P S+DWR+KGAV +K+QG CGSCWA
Sbjct: 106 EFKGRYLGLAKPQFSRKRQPS--ANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWA 163
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QIT G L LSEQ+L+DC T N+GC+GGLMD AF+YII GL E D
Sbjct: 164 FSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDD 223
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+G C +QKE TI YED+P+ D+ +L++A+ QPVSV +EASG+ F+FYK
Sbjct: 224 YPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYK 283
Query: 277 RGVLN 281
GV N
Sbjct: 284 -GVYN 287
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 143/341 (41%), Positives = 210/341 (61%), Gaps = 18/341 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
V+ +L + Q +S + I E+ + + +H + Y E+E+ R+ IF +N I
Sbjct: 4 VLALLALVAFVQAISITDV----IKEEWQTFKMEHRKNYLSEVEERFRMKIFNENRHKIA 59
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
K N+ +G ++KLG N+++D+ + EF+ + GYN + R + Y + +V
Sbjct: 60 KHNQLYAQGKVSFKLGLNKYADMLHHEFKETMNGYNHTMRKELRAQEGFNGITYISPANV 119
Query: 132 --PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
P ++DWR+ GAVT +K+QGHCGSCW+FS+ ++EG G L+ LSEQ LVDCST
Sbjct: 120 QVPKAVDWRQHGAVTSVKDQGHCGSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTK 179
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
NNGC+GGLMD AF YI +N G+ TE YPY+ +C K A G + D+P+G
Sbjct: 180 YGNNGCNGGLMDNAFRYIKDNGGVDTEKSYPYEGIDDSCHFNKATVGATDTG-FVDIPQG 238
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEED 304
DE A+++AV T PV+V ++AS ++F+ Y GV N C DN DHGV VVG+GT ++D
Sbjct: 239 DEEAMMKAVATMGPVAVAIDASNESFQLYSEGVYNDPNCSSDNLDHGVLVVGYGT--DKD 296
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
G YWL+KNSWG TWG+ GYI++ R+ + CGIAT +S+P
Sbjct: 297 GQDYWLVKNSWGTTWGDQGYIKMARNQDNQCGIATASSFPT 337
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 208/343 (60%), Gaps = 19/343 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
++ +L + +Q VS + I E+ + +H + Y+DE E+ RL IF +N I
Sbjct: 5 LILPLLALVAVAQAVS----YAEVIQEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKI 60
Query: 74 EKANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSR---QSSRPSTFKYQN 127
K N+ G ++K+ N+++D+ + EF ++ G+N + R +S + TF
Sbjct: 61 AKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTFISPE 120
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+P +DWR KGAVT +K+QGHCGSCWAFS+ A+EG G L+ LSEQ LVDCS
Sbjct: 121 HVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCS 180
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
T NNGC+GGLMD AF YI +N G+ TE YPY+ +C K A G + D+P
Sbjct: 181 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGSIGATDRG-FVDIP 239
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLNAECGD--NCDHGVAVVGFGTAEE 302
+G+E + +AV T PV+V ++AS ++F+FY GV N D N DHGV VVGFGT +
Sbjct: 240 QGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGFGT--D 297
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
E G YWL+KNSWG TWG+ G+I++LR+ E CGIA+ +SYP+
Sbjct: 298 ESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 340
>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 142/315 (45%), Positives = 204/315 (64%), Gaps = 10/315 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ + +E+W +H ++ EK R ++FK+N+ ++ N+ ++ YKL N+F+D+
Sbjct: 34 EESLWQLYERW-GKHHTISRNLKEKHKRFSVFKENVNHVFTVNQM-DKPYKLKLNKFADM 91
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
+N EF Y N + R + F Y+ TD+P+S+DWRE+GAV +K QG CG
Sbjct: 92 SNYEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDWRERGAVNAVKEQGRCG 151
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLAT 213
SCWAFS+VAAVEGI +I +L+ LSEQ+L+DC+ N GC+GG M+ AF++I N G+AT
Sbjct: 152 SCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYRNKGCNGGFMEIAFDFIKRNGGIAT 211
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
E YPY +G C + + I YE +P+ +E AL+QAV QPVSV ++A+G+ F+
Sbjct: 212 ENSYPYHGSRGLCRSSRISSPIVKIDGYESVPE-NEDALMQAVANQPVSVAIDAAGRDFQ 270
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--- 330
FY +GV + CG +HGV +G+GT EDG YWL++NSWG WGE GY+R+ R
Sbjct: 271 FYSQGVFDGYCGTELNHGVVAIGYGTT--EDGTDYWLVRNSWGVGWGEDGYVRMKRGVEQ 328
Query: 331 -EGLCGIATEASYPV 344
EGLCGIA EASYP+
Sbjct: 329 AEGLCGIAMEASYPI 343
>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
Length = 352
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 194/311 (62%), Gaps = 8/311 (2%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ W A + R+Y E+ R ++++N+E+IE N+ GN TY LG N+F+DLT E
Sbjct: 45 MMDRFLSWQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEE 104
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQG-HCGSCW 156
EF YT PV + + R + D PTS+DWR KGAVT IKNQG C SCW
Sbjct: 105 EFLDLYTMKGMPVRRDAGKK-RANVSSSAAAVDAPTSVDWRSKGAVTPIKNQGPSCSSCW 163
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEAD 216
AF A +E IT+IT GKL+ LSEQ+L+DC + GC+ G + ++I+N GL TEA+
Sbjct: 164 AFVTAATIESITKITTGKLVSLSEQELIDCDPYDGGCNLGYFVNGYRWVIQNGGLTTEAN 223
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPYQ + C + + AATI Y LP G E L QAV +QPV+ +E G + +FY
Sbjct: 224 YPYQARRYACSRSRAAQHAATISDYVQLPAG-EGQLQQAVAQQPVAAAIEMGG-SLQFYS 281
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---EGL 333
GV + +CG +H + VVG+G A+ G KYWL+KNSWG++WGE GY+R+ RD GL
Sbjct: 282 GGVFSGQCGTRMNHAITVVGYG-ADSSSGLKYWLVKNSWGQSWGERGYLRMRRDVGRGGL 340
Query: 334 CGIATEASYPV 344
CGIA + +YPV
Sbjct: 341 CGIALDLAYPV 351
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+D+ E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADL 84
Query: 95 TNEEFRASYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
+ EFR G+N + R S + TF +P S+DWR KGAVT +K+QGH
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST NNGC+GGLMD AF YI +N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEAS 268
G+ TE YPY+ +C K A AT + D+P+GDE + +AV T PV+V ++AS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNK-GAIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDAS 263
Query: 269 GQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
++F+FY GV N +C N DHGV VVG+GT +E G YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGT--DESGDDYWLVKNSWGTTWGDKGFIK 321
Query: 327 ILRD-EGLCGIATEASYPV 344
+LR+ + CGIA+ +SYP+
Sbjct: 322 MLRNKDNQCGIASASSYPL 340
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 270 bits (690), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 208/343 (60%), Gaps = 19/343 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
++ +L + +Q VS + I E+ + +H + Y+DE E+ RL IF +N I
Sbjct: 5 LILPLLALVAVAQAVS----YAEVIQEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKI 60
Query: 74 EKANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSR---QSSRPSTFKYQN 127
K N+ G ++K+ N+++D+ + EF ++ G+N + R +S + TF
Sbjct: 61 AKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTFISPE 120
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+P +DWR KGAVT +K+QGHCGSCWAFS+ A+EG G L+ LSEQ LVDCS
Sbjct: 121 HVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCS 180
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
T NNGC+GGLMD AF YI +N G+ TE YPY+ +C K A G + D+P
Sbjct: 181 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNKGTIGATDRG-FVDIP 239
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLNAECGD--NCDHGVAVVGFGTAEE 302
+G+E + +AV T PV+V ++AS ++F+FY GV N D N DHGV VVGFGT +
Sbjct: 240 QGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGFGT--D 297
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
E G YWL+KNSWG TWG+ G+I++LR+ E CGIA+ +SYP+
Sbjct: 298 ESGQDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 340
>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 135/316 (42%), Positives = 193/316 (61%), Gaps = 16/316 (5%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK------EGNRTYKLGTNEFSDL 94
+ E W A+HG+ Y E+A RL F +N ++ N G +Y L N F+DL
Sbjct: 38 QFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADL 97
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN-VTDVPTSIDWREKGAVTHIKNQGHCG 153
T++EFRA+ G P S PS ++ V VP ++DWR+ GAVT +K+QG CG
Sbjct: 98 THDEFRAARLGRLAVGPGPLGAPS-PSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCG 156
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLA 212
+CW+FSA A+EGI +IT G L+ LSEQ+L+DC N GC GGLM A++++I+N G+
Sbjct: 157 ACWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGID 216
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE DYP+++ GTC+K K K TI Y+++P E LLQAV +QP+SV + S +AF
Sbjct: 217 TEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAF 276
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+ Y +G+ + C + DH V +VG+G+ E G YW++KNSWGE WG GY+ + R+
Sbjct: 277 QLYSQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGERWGMKGYMHMHRNTG 333
Query: 331 --EGLCGIATEASYPV 344
G+CGI AS+P
Sbjct: 334 SSSGICGINMMASFPT 349
>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 377
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 151/339 (44%), Positives = 197/339 (58%), Gaps = 33/339 (9%)
Query: 35 EPSIVE----KHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE--GNRTYKLGT 88
+P+I++ + ++W A+HGR Y E+ RL ++ +N+ YIE AN + TY+LG
Sbjct: 42 DPTILQTMAPRFQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGE 101
Query: 89 NEFSDLTNEEFRASYTGYNRPVPSVSRQ----------SSRPSTFK------YQNVTDV- 131
++DLT +EF A YT P P +S ++R Y NV+
Sbjct: 102 TAYTDLTADEFTAMYTS---PSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFNVSTAG 158
Query: 132 -PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN 190
P S+DWR KGAVT +KNQG CGSCWAFS VA VEGI QI G LI LSEQ+LVDC T +
Sbjct: 159 APASVDWRAKGAVTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCDTLD 218
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC GG+ A E+I N G+ATEADYPY + G C K AA I + + E
Sbjct: 219 YGCDGGVSYHALEWIASNGGIATEADYPYTGKDGACVANKLPLHAAAISGFARVATRSEP 278
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
+L AV QPV+V +EA G F+ Y +GV N CG +HGV VV EE DG KYW+
Sbjct: 279 SLANAVAAQPVAVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVV-GYGEEEGDGEKYWI 337
Query: 311 IKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
+KNSWG+ WG+ GY R+ +D EGLCGIA S+P+
Sbjct: 338 VKNSWGKKWGDGGYFRMKKDVAGKPEGLCGIAIRPSFPL 376
>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
Length = 345
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 151/348 (43%), Positives = 209/348 (60%), Gaps = 24/348 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEK-HEQWMA---QHGRTYKDELEKAMRLTIFKQ 68
M + +IL IT + V H S E +++WM +H + YK ++E+ R+ IF
Sbjct: 1 MKLFLILFITIFATV------HAVSFFELVNQEWMTFKMEHKKAYKSDVEERFRMKIFMD 54
Query: 69 NLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRP--STF 123
N I K N +YKL N++ D+ + EF G+N+ + + R P ++F
Sbjct: 55 NKHKIAKHNSNYEMKKVSYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERMPIGASF 114
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
+P +DWR++GAVT +K+QGHCGSCW+FSA A+EG G L+ LSEQ L
Sbjct: 115 IEPANVALPKKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNL 174
Query: 184 VDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
+DCS NNGC+GGLMD+AF+YI +NKGL TEA YPY+ E C + A +G Y
Sbjct: 175 IDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDVG-Y 233
Query: 242 EDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG 298
D+P G+E L AV T PVSV ++AS Q+F+FY GV EC + DHGV V+G+G
Sbjct: 234 IDIPTGNEKLLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYG 293
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPVA 345
T E+G YWL+KNSWGETWG +GYI++ R++ CGIA+ ASYP+
Sbjct: 294 T--NENGEDYWLVKNSWGETWGNNGYIKMARNKLNHCGIASSASYPLV 339
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 141/345 (40%), Positives = 196/345 (56%), Gaps = 14/345 (4%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMR 62
K + +II + ++ A G S + + +E+ + WM +H + Y+ EK R
Sbjct: 9 KIIFLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYR 68
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
IF+ NL YI++ NK+ N +Y LG N F+DL+N+EF+ Y G+ +
Sbjct: 69 FEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEFKKKYVGF-VAEDFTGLEHFDNED 126
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
F Y++VT+ P SIDWR KGAVT +KNQG CGSCWAFS +A VEGI +I G L+ELSEQ+
Sbjct: 127 FTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQE 186
Query: 183 LVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
LVDC + GC GG + +Y + N G+ T YPYQ +Q C + I Y+
Sbjct: 187 LVDCDKHSYGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYK 245
Query: 243 DLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEE 302
+P E + L A+ QP+S VEA G+ F+ YK GV + CG DH V VG+GT+
Sbjct: 246 RVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-- 303
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYP 343
DG Y +IKNSWG WGE GY+R+ R +G CG+ + YP
Sbjct: 304 -DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 144/335 (42%), Positives = 202/335 (60%), Gaps = 18/335 (5%)
Query: 19 LVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI-EKAN 77
+V+ S++VS E SI+E +QW +H + Y+ E R FK+NL+YI EKA
Sbjct: 32 IVVNDFSELVS-----EESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAG 86
Query: 78 KE-GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
K+ + +G N+F+DL+NEEF+ Y + ++ R ++R + D P+S+D
Sbjct: 87 KKTAALGHSVGLNKFADLSNEEFKELYLSKVKKPINIKRSTARDWRQRNLQTCDAPSSLD 146
Query: 137 WREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGG 196
WR+KG VT +K+QG CGSCW+FS A+EGI I G LI LSEQ+LVDC T N GC GG
Sbjct: 147 WRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTTNYGCEGG 206
Query: 197 LMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV 256
MD AFE++I N G+ TEA+YPY GTC+ KE+ +I Y D+ + D ALL A
Sbjct: 207 YMDYAFEWVINNGGIDTEANYPYTGVDGTCNTTKEEIKVVSIDGYTDVDETDS-ALLCAT 265
Query: 257 TKQPVSVCVEASGQAFRFYKRGVLNAECGD---NCDHGVAVVGFGTAEEEDGAKYWLIKN 313
+QP+SV ++ S F+ Y G+ + +C D + DH V +VG+G+ E+G YW++KN
Sbjct: 266 VQQPISVGMDGSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGYGS---ENGEDYWIVKN 322
Query: 314 SWGETWGESGYIRILRDE----GLCGIATEASYPV 344
SWG WG GY I R+ G+C I EASYP
Sbjct: 323 SWGTEWGMEGYFYIKRNTDLPYGVCAINAEASYPT 357
>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
Length = 450
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 134/307 (43%), Positives = 191/307 (62%), Gaps = 9/307 (2%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
+ E W A+HGR+Y E+A RL F N ++ A+ +Y L N F+DLT++EFR
Sbjct: 37 QFEAWCAEHGRSYATPGERAARLAAFADNAAFV-AAHNGAPASYALALNAFADLTHDEFR 95
Query: 101 ASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
A+ G R P V VP ++DWR+ GAVT +K+QG CG+CW+FSA
Sbjct: 96 AARLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSA 155
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
A+EGI +I G LI LSEQ+L+DC N+GC GGLMD A++++++N G+ TEADYPY
Sbjct: 156 TGAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPY 215
Query: 220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
++ GTC+K K K TI Y+D+P +E LLQAV +QPVSV + S +AF+ Y +G+
Sbjct: 216 RETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGI 275
Query: 280 LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCG 335
+ C + DH + +VG+G+ E G YW++KNSWGE+WG GY+ + R+ G+CG
Sbjct: 276 FDGPCPTSLDHAILIVGYGS---EGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCG 332
Query: 336 IATEASY 342
I S+
Sbjct: 333 INQMPSF 339
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 152/356 (42%), Positives = 209/356 (58%), Gaps = 19/356 (5%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+V+ F +FI+ + IL A Q + + + +H + Y DE E+
Sbjct: 68 VVMLFVNAFIL----VFILKKRKAYQNLKATEEQPRTSYAATSTHVLEHRKNYLDETEER 123
Query: 61 MRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSR-- 115
RL IF +N I K N+ G +YKL N+++D+ + EFR G+N + R
Sbjct: 124 FRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMNGFNYTLHKELRAA 183
Query: 116 -QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGK 174
+S + TF +P S+DWR+KGAVT +K+QGHCGSCWAFS+ A+EG G
Sbjct: 184 DESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSSTGALEGQHYRKSGV 243
Query: 175 LIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEK 232
L+ LSEQ LVDCST NNGC+GGLMD AF YI +N G+ TE YPY+ +C K
Sbjct: 244 LVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEALDDSCHFNKGT 303
Query: 233 AAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLNAECGD--NCD 289
A G + D+P+G+E L +AV T PVSV ++AS ++F+FY GV D N D
Sbjct: 304 IGATDRG-FVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSEGVYVEPACDAQNLD 362
Query: 290 HGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
HGV VVGFGT +E G YWL+KNSWG TWG+ G+I++LR+ + CGIA+ +SYP+
Sbjct: 363 HGVLVVGFGT--DESGQDYWLVKNSWGTTWGDKGFIKMLRNKDNQCGIASASSYPL 416
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 148/343 (43%), Positives = 206/343 (60%), Gaps = 21/343 (6%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+ +L + +Q VS + I E+ + + +H + Y DE E+ RL IF +N I
Sbjct: 4 LFALLALVAVAQAVS----YADVIKEEWQTFKLEHRKNYVDETEERFRLKIFNENKHKIA 59
Query: 75 KANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS----TFKYQN 127
K N+ G ++K+ N+++D+ + EF + G+N + R +S PS TF
Sbjct: 60 KHNQRYASGEVSFKMAVNKYADMLHHEFHTTMNGFNYTLHKQLR-ASDPSFVGVTFISPE 118
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+P S+DWR KGAVT +K+QGHCGSCWAFS+ A+EG G LI LSEQ LVDCS
Sbjct: 119 HVKIPKSVDWRSKGAVTEVKDQGHCGSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDCS 178
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
T NNGC+GGLMD AF YI +N G+ TE YPY+ +C K A G D+P
Sbjct: 179 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKATIGATDRGSV-DIP 237
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLNAECGD--NCDHGVAVVGFGTAEE 302
+GDE + +AV T PVSV ++AS ++F+FY G+ N D N DHGV VVG+GT +
Sbjct: 238 QGDEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYGT--D 295
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
E G YWL+KNSWG TWG+ G+I++ R+ + CGIA+ +SYP+
Sbjct: 296 ESGQDYWLVKNSWGTTWGDKGFIKMARNADNQCGIASASSYPL 338
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 151/347 (43%), Positives = 210/347 (60%), Gaps = 27/347 (7%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQW---MAQHGRTYKDELEKAMRLTIFKQNL 70
F ++ LV +Q VS + + EQW QH + YK + E+ R+ IF +N
Sbjct: 3 FFVLALVFIVGAQAVSFFDLVQ-------EQWGTFKLQHKKQYKSDTEEKFRMKIFMENS 55
Query: 71 EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNR----PVPSVSRQSSRPSTF 123
+ K NK G +YKL N+++D+ + EF + G+NR P+ S + + +TF
Sbjct: 56 HKVAKXNKLYEMGLVSYKLKINKYADMLHHEFVHTVNGFNRTKNTPLLGTS-EDEQGATF 114
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
P ++DWRE GAVT +K+QGHCGSCW+FSA A+EG KL+ LSEQ L
Sbjct: 115 IAPANVKFPENVDWREHGAVTXVKDQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQNL 174
Query: 184 VDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
VDCST N+GC+GGLMD AF+Y+ N G+ TEA YPY + C K + AT +
Sbjct: 175 VDCSTKFGNDGCNGGLMDNAFKYVKYNHGIDTEASYPYHADDEKC-HYNPKTSGATDRGF 233
Query: 242 EDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG 298
D+P GDE L+ AV T PVSV ++AS ++F+ Y GV + EC + DHGV VVG+G
Sbjct: 234 VDIPTGDEEKLMAAVATVGPVSVAIDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYG 293
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
T +E+G YW++KNSWGE+WGE GYI++ R+ + CGIAT+ASYP+
Sbjct: 294 T--DENGQDYWIVKNSWGESWGEQGYIKMARNRDNNCGIATQASYPL 338
>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
Length = 333
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 153/342 (44%), Positives = 207/342 (60%), Gaps = 20/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M +++IL C + S SM + S+ +W A+H + Y E+ R ++++N++
Sbjct: 1 MNLLLILAAFCVG-ITSATSMFDGSLNAHWYRWKAKHRKLYGMR-EEGWRRAVWEKNMKM 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N+E G + + N F D+TNEEFR G+ +++ + F+ +
Sbjct: 59 IEVHNQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFR------NQKHKKGKVFQEPSFL 112
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+VP S+DWREKG VT +KNQG CGSCWAFSA A+EG GKLI LSEQ LVDCS
Sbjct: 113 EVPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRP 172
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC GGLMD AF+YI EN GL +E YPY +C + E + A G + D+PK
Sbjct: 173 QGNEGCDGGLMDYAFQYIKENGGLDSEESYPYDAMDESCKYRPEYSVANDTG-FVDIPK- 230
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAE-EE 303
+E AL++AV T P+SV ++A ++F+FYK GV EC DN DHGV VVG+G E E
Sbjct: 231 EEKALMKAVATVGPISVAIDAGHESFQFYKEGVYFEPECSSDNVDHGVLVVGYGYEETES 290
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
D K+WL+KNSWGE WG GYI++ +D+ CGIAT ASYP
Sbjct: 291 DNNKFWLVKNSWGEEWGLGGYIKMTKDQKNHCGIATAASYPT 332
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 149/338 (44%), Positives = 205/338 (60%), Gaps = 15/338 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
++ ++LV C VVS SM E ++W +HG+ Y + E+A R I+++NL+ +
Sbjct: 3 YLSVLLVAVC---VVSSLSMSFTDFDEDWKEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+ N + G+ TY LG N+F+DL N+EF A TG+ V S+ + + NV
Sbjct: 60 IRHNLKYDLGHFTYDLGMNQFADLQNKEFVAMMTGFR--VNGTSKAAKGSTFLPPNNVGK 117
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDN 190
+P ++DWR KG VT +K+QG CGSCWAFSA ++EG GKL+ LSEQ LVDCS N
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSDKN 177
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC+GGLMD+AF+YII+ G+ TE YPY G C K AT+ Y D+ G E
Sbjct: 178 YGCNGGLMDRAFQYIIDAGGIDTEESYPYIAMDGNC-HFKTANVGATVTGYTDVTSGSEK 236
Query: 251 ALLQAVTK-QPVSVCVEASGQAFRFYKRGVLNAE-CGDN-CDHGVAVVGFGTAEEEDGAK 307
AL +AV P+SV ++AS +F+ Y+ GV N C DHGV VG+GT DG
Sbjct: 237 ALQKAVAHIGPISVAIDASHFSFQLYQSGVYNEPGCSSTLLDHGVLAVGYGTT--IDGTD 294
Query: 308 YWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
YW++KNSW ETWG +GYI + R+ + CGIAT+ASYP+
Sbjct: 295 YWIVKNSWAETWGMNGYIWMSRNKDNQCGIATQASYPL 332
>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
Length = 484
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 144/325 (44%), Positives = 194/325 (59%), Gaps = 29/325 (8%)
Query: 42 HEQWMAQH----------GRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGT 88
+E+W ++H G E + A RL +F+ NL YI+ N E G ++LG
Sbjct: 53 YEEWRSEHDAGPRRGATGGSLGPGEDDDARRLEVFRYNLRYIDAHNAEADAGLHGFRLGL 112
Query: 89 NEFSDLTNEEFRASYT--GYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVT 144
F+DLT EE+RA R +V SR +Y + +P ++DWRE+GAV
Sbjct: 113 TRFADLTLEEYRARLLLGSRGRNGTAVGVVGSR----RYLPLAGEQLPDAVDWRERGAVA 168
Query: 145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-DNNGCSGGLMDKAFE 203
+K+QG CG+CWAFSAVAAVEGI +I G LI LSEQ+L+DC + GC GGLMD AF
Sbjct: 169 EVKDQGQCGACWAFSAVAAVEGINKIVTGSLISLSEQELIDCDKFQDQGCDGGLMDNAFV 228
Query: 204 YIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSV 263
++I+N G+ TEADYP+ GTCD + + +I +E +P E AL +AV QPVS
Sbjct: 229 FMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVVSIDSFERVPINYERALQKAVAHQPVSA 288
Query: 264 CVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
+EAS +AF+ Y G+ + CG DHGV VVG+G+ E G YW++KNSWG WGE+G
Sbjct: 289 SIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYGS---EGGKDYWIVKNSWGTQWGEAG 345
Query: 324 YIRILRD----EGLCGIATEASYPV 344
Y+R+ R+ G CGIA E YPV
Sbjct: 346 YVRMARNVRVRAGKCGIAMEPLYPV 370
>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
Length = 337
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 150/342 (43%), Positives = 203/342 (59%), Gaps = 16/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M+ + L C S V + S+ + + + EQW HG+ Y E E+ R I+++NL
Sbjct: 1 MWTYLALFTLCLSGVFAAPSL-DKQLDDHWEQWKTWHGKNYH-EKEEGWRRMIWEKNLRK 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I+ N E G TY+LG N F D+ +EEFR GY + + + S F N
Sbjct: 59 IQFHNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGYKHK----TERKFKGSLFMEPNFL 114
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+VP+ +DWREKG VT +K+QG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 115 EVPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRP 174
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD+AF+YI +N GL +E YPY K AA + D+P G
Sbjct: 175 EGNEGCNGGLMDQAFQYIKDNNGLDSEEAYPYLGTDDQPCHYDPKYNAANDTGFVDIPSG 234
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
EHAL++AV PVSV ++A ++F+FY+ G+ EC + DHGV VVG+G E+
Sbjct: 235 KEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSSEELDHGVLVVGYGFEGEDV 294
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E+WG+ GYI + +D + CGIAT ASYP+
Sbjct: 295 DGKKYWIVKNSWSESWGDKGYIYMAKDRKNHCGIATAASYPL 336
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 149/340 (43%), Positives = 204/340 (60%), Gaps = 17/340 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
++ ++LV C VVS SM E QW +HG+ Y + E+A R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
K N + G+ TY LG N+F+DL NEEF A TG+ V S+ + + N+ +
Sbjct: 60 IKHNLKYDLGHFTYALGMNQFADLKNEEFVAMMTGFR--VNGTSKAAKGSTFLPSNNIGE 117
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P ++DWR KG VT +K+QG CGSCWAFS ++EG GKL+ LSEQ LVDCS
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKE 177
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC GGLMD+AF+YII+ G+ TE YPY+ G C +K A G Y D+
Sbjct: 178 GNEGCDGGLMDQAFQYIIKAGGIDTEESYPYKAVDGECHFKKANIGATVTG-YTDVTSDS 236
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGVLN-AECGDN-CDHGVAVVGFGTAEEEDG 305
E AL +AV P+SV ++AS +F+ YK GV N +C DHGV VG+GT DG
Sbjct: 237 ETALQKAVAHIGPISVAIDASHMSFQLYKSGVYNEPDCSSTLLDHGVLAVGYGTT--SDG 294
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
YW++KNSW ETWG +GY+ + R+ + CGIAT+ASYP+
Sbjct: 295 TDYWIVKNSWAETWGMNGYLWMSRNKDNQCGIATQASYPL 334
>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
Length = 262
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 156/225 (69%), Gaps = 10/225 (4%)
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
V+D+P S+DWR+KGAVT +K+QG CGSCWAFS V +VEGI I G L+ LSEQ+L+DC
Sbjct: 1 VSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCD 60
Query: 188 T-DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCD---KQKEKAAAATIGKYED 243
T DN+GC GGLMD AFEYI N GL TEA YPY+ +GTC+ + I ++D
Sbjct: 61 TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQD 120
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+P E L +AV QPVSV VEASG+AF FY GV ECG DHGVAVVG+G A E
Sbjct: 121 VPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVA--E 178
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDE----GLCGIATEASYPV 344
DG YW +KNSWG +WGE GYIR+ +D GLCGIA EASYPV
Sbjct: 179 DGKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPV 223
>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
Length = 565
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 137/318 (43%), Positives = 189/318 (59%), Gaps = 23/318 (7%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR--------TYKLGTNEFSDL 94
E W A+HG+ Y E+A RL F N ++ N G +Y L N F+DL
Sbjct: 43 EAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFADL 102
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN---VTDVPTSIDWREKGAVTHIKNQGH 151
T+ EFRA+ G +V + PS + V VP ++DWR+ GAVT +K+QG
Sbjct: 103 THAEFRAARLGRL----AVGGARAPPSEGGFAGSVGVGAVPEALDWRQSGAVTKVKDQGS 158
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKG 210
CG+CW+FSA A+EGI +I G LI LSEQ+L+DC N GC GGLMD A+ ++I+N G
Sbjct: 159 CGACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRSYNAGCGGGLMDYAYRFVIKNGG 218
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
+ TE DYPY++ GTC+K K K TI Y D+P E +LLQAV +QP+SV + S +
Sbjct: 219 IDTEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSAR 278
Query: 271 AFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
AF+ Y +G+ + C + DH V +VG+G+ E G YW++KNSWGE WG GY+ + R+
Sbjct: 279 AFQLYSQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGERWGMKGYMHMHRN 335
Query: 331 ----EGLCGIATEASYPV 344
G+CGI AS+P
Sbjct: 336 TGSSSGICGINMMASFPT 353
>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
Length = 332
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 147/317 (46%), Positives = 195/317 (61%), Gaps = 16/317 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
++ E W HG++Y+ +E+ +RL I +N I + N E G +Y + N + DL
Sbjct: 23 VLSDWESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYGDL 82
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGS 154
+ EF A GY V++ S S +NV +PT +DWRE GAVT +KNQG CGS
Sbjct: 83 LHHEFVAMVNGYEY----VNKTSLGGSFIPSKNVK-LPTHVDWREDGAVTPVKNQGQCGS 137
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLA 212
CWAFS+ ++EG T GKLI LSEQ LVDCS NNGC GGLMD AF YI +NKG+
Sbjct: 138 CWAFSSTGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDNKGID 197
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQA 271
TE YPY+ G C K ++ IG + D+ KG E LL+AV PVSV ++AS +
Sbjct: 198 TEGSYPYEGVGGRCHYDPSKKGSSDIG-FVDVKKGSEEELLKAVASVGPVSVAIDASHMS 256
Query: 272 FRFYKRGV-LNAECG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
F+FY GV ++C +N DHGV VVG+GT +E G YWL+KNSW E WG+ GYI++ R
Sbjct: 257 FQFYSHGVYFESKCSPENLDHGVLVVGYGT-DENSGEDYWLVKNSWSENWGDQGYIKMAR 315
Query: 330 D-EGLCGIATEASYPVA 345
+ + +CGIA+ ASYPV
Sbjct: 316 NKKNMCGIASSASYPVV 332
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 148/341 (43%), Positives = 207/341 (60%), Gaps = 21/341 (6%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
++ + + CA VV+ + + + E + A H ++Y+ +E+ +R IF +N + +
Sbjct: 1 MLRISLLCAFVVVTTAASSHEILRTQWEAFKATHKKSYQSNMEELLRFKIFSENSLLVAR 60
Query: 76 ANKEGNR---TYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTF---KYQNVT 129
N++ R +YKLG N+F DL EF + GY +R + R STF N +
Sbjct: 61 HNEKYARGLVSYKLGMNQFGDLLPHEFARMFNGYRG-----ARTAGRGSTFLPPANVNYS 115
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+P S+DWREKGAVT +KNQG CGSCWAFS ++EG + G L+ LSEQ LVDCS
Sbjct: 116 SLPQSMDWREKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCSET 175
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N+GC GGLMD AF+YI N G+ TE YPY+ E G C +K+ A G + D+ +G
Sbjct: 176 FGNHGCEGGLMDNAFQYIKANGGIDTEKSYPYEAEDGECRFKKQNVGATDTG-FVDIEQG 234
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEED 304
E L +AV T PVSV ++AS +F+ Y GV + EC + DHGV VVG+G ED
Sbjct: 235 SEDDLKKAVATVGPVSVAIDASHSSFQLYSEGVYDETECSSEQLDHGVLVVGYGV---ED 291
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
G KYWL+KNSW E+WG++GYI++ RD + CGIA+ ASYP+
Sbjct: 292 GKKYWLVKNSWAESWGDNGYIKMSRDKDNQCGIASAASYPL 332
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 134/315 (42%), Positives = 192/315 (60%), Gaps = 18/315 (5%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR--------TYKLGTNEFSDL 94
+ W A+HG+ Y E+A RL +F N ++ N N +Y L N F+DL
Sbjct: 42 DAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFADL 101
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN-VTDVPTSIDWREKGAVTHIKNQGHCG 153
T+EEFRA+ G + R + P + VP ++DWRE GAVT +K+QG CG
Sbjct: 102 THEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQGSCG 161
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLA 212
+CW+FSA A+EGI +I G L+ LSEQ+L+DC N+GC GGLMD A++++++N G+
Sbjct: 162 ACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGID 221
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE DYPY++ GTC+K K K TI Y D+P E LLQAV +QPVSV + S +AF
Sbjct: 222 TEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGSARAF 281
Query: 273 RFY-KRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD- 330
+ Y ++G+ + C + DH V +VG+G+ E G YW++KNSWGE+WG GY+ + R+
Sbjct: 282 QLYSQQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGESWGMKGYMHMHRNT 338
Query: 331 ---EGLCGIATEASY 342
+G+CGI AS+
Sbjct: 339 GDSKGVCGINMMASF 353
>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 134/314 (42%), Positives = 192/314 (61%), Gaps = 16/314 (5%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK------EGNRTYKLGTNEFSDL 94
+ E W A+HG+ Y E+A RL F +N ++ N G +Y L N F+DL
Sbjct: 38 QFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADL 97
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN-VTDVPTSIDWREKGAVTHIKNQGHCG 153
T++EFRA+ G P S PS ++ V VP ++DWR+ GAVT +K+QG CG
Sbjct: 98 THDEFRAARLGRLAVGPGPLGAPS-PSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCG 156
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLA 212
+CW+FSA A+EGI +IT G L+ LSEQ+L+DC N GC GGLM A++++I+N G+
Sbjct: 157 ACWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGID 216
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE DYP+++ GTC+K K K TI Y+++P E LLQAV +QP+SV + S +AF
Sbjct: 217 TEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAF 276
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+ Y +G+ + C + DH V +VG+G+ E G YW++KNSWGE WG GY+ + R+
Sbjct: 277 QLYSQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGERWGMKGYMHMHRNTG 333
Query: 331 --EGLCGIATEASY 342
G+CGI AS+
Sbjct: 334 SSSGICGINMMASF 347
>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 535
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 142/307 (46%), Positives = 190/307 (61%), Gaps = 13/307 (4%)
Query: 45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT-YKLGTNEFSDLTNEEFRASY 103
WM H ++ D LE A RL + N YI + N E T KL NEFS ++ EEF+
Sbjct: 32 WMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFKFKM 91
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAA 163
TGY P + ++ + + +V VP S+DW++KG VT +KNQG CGSCWAFS A
Sbjct: 92 TGYVMPEGYLEQRLASRVDNLWSDVQ-VPDSVDWQDKGGVTPVKNQGMCGSCWAFSTTGA 150
Query: 164 VEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
VEG ++ GKL+ LSEQ+LVDC + + GC+GGLMD AF +I +N G+ +E DY Y+ +
Sbjct: 151 VEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDYEYKAK 210
Query: 223 QGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNA 282
C ++ I ++D+ DEHAL AV +QPVSV +EA +AF+FYK GV N
Sbjct: 211 AQVC---RDCEKVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 267
Query: 283 ECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE----GLCGIAT 338
CG DHGV VG+G+ E+G K+W +KNSWG +WGE GYIR+ R+E G CGIA+
Sbjct: 268 TCGTRLDHGVLAVGYGS---ENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQCGIAS 324
Query: 339 EASYPVA 345
SYP A
Sbjct: 325 VPSYPFA 331
>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
Length = 510
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 142/307 (46%), Positives = 190/307 (61%), Gaps = 13/307 (4%)
Query: 45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT-YKLGTNEFSDLTNEEFRASY 103
WM H ++ D LE A RL + N YI + N E T KL NEFS ++ EEF+
Sbjct: 32 WMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFKFKM 91
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAA 163
TGY P + ++ + + +V VP S+DW++KG VT +KNQG CGSCWAFS A
Sbjct: 92 TGYVMPEGYLEQRLASRVDNLWSDVQ-VPDSVDWQDKGGVTPVKNQGMCGSCWAFSTTGA 150
Query: 164 VEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
VEG ++ GKL+ LSEQ+LVDC + + GC+GGLMD AF +I +N G+ +E DY Y+ +
Sbjct: 151 VEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDYEYKAK 210
Query: 223 QGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNA 282
C ++ I ++D+ DEHAL AV +QPVSV +EA +AF+FYK GV N
Sbjct: 211 AQVC---RDCEKVVKISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 267
Query: 283 ECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE----GLCGIAT 338
CG DHGV VG+G+ E+G K+W +KNSWG +WGE GYIR+ R+E G CGIA+
Sbjct: 268 TCGTRLDHGVLAVGYGS---ENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQCGIAS 324
Query: 339 EASYPVA 345
SYP A
Sbjct: 325 VPSYPFA 331
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 209/343 (60%), Gaps = 15/343 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+ +I L+ Q+ + S+ E H + A H + Y +LE+ +R+ I+ +N
Sbjct: 1 MKQITLIFLLAAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKLRMKIYLENK 59
Query: 71 EYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
+ K N ++G ++Y++ N+F DL + EFR+ GY + SR S + + N
Sbjct: 60 HKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPAN 119
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
V +VP S+DWREKGA+T +K+QG CGSCWAFS+ A+EG T GKL+ LSEQ L+DCS
Sbjct: 120 V-EVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCS 178
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
N GC+GGLMD+AF+YI +NKG+ TE YPY+ E G C A G + D+P
Sbjct: 179 GKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDGVCRYNPRNRGAVDRG-FVDIP 237
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRG-VLNAEC-GDNCDHGVAVVGFGTAEE 302
G+E L AV T PVSV ++AS ++F+FY +G C D+ DHGV VVG+G+
Sbjct: 238 SGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGXYYEPSCDSDDLDHGVLVVGYGS--- 294
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
++G YWL+KNSW E WG+ GYI+I R+ + CG+AT ASYP+
Sbjct: 295 DNGEDYWLVKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPL 337
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 145/307 (47%), Positives = 193/307 (62%), Gaps = 16/307 (5%)
Query: 45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT 104
WM +H R Y E E R FK+N+++I K N + + T LG +F+DLTNEE++ Y
Sbjct: 36 WMRKHDRAYSHE-EFTDRYQAFKENMDFIHKWNSQESDTV-LGLTKFADLTNEEYKKHYL 93
Query: 105 GYNRPVP-SVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAA 163
G V +++ FK+ P SIDWREKGAV+ +K+QG CGSCW+FS A
Sbjct: 94 GIKVNVKKNLNAAQKGLKFFKFTG----PDSIDWREKGAVSQVKDQGQCGSCWSFSTTGA 149
Query: 164 VEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
VEG QI G ++ LSEQ LVDCS N GC GGLM AFEYII+N G+ATE+ YPY
Sbjct: 150 VEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGGIATESSYPYTA 209
Query: 222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLN 281
QG C K A IG Y+++P+G+E +L A+ KQPVSV ++AS +F+ Y GV +
Sbjct: 210 AQGRCKFTKSMNGANIIG-YKEIPQGEEDSLTAALAKQPVSVAIDASHMSFQLYSSGVYD 268
Query: 282 --AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIAT 338
A + DHGV VG+GT E +D Y++IKNSWG TWG+ GYI + R+ + CG+AT
Sbjct: 269 EPACSSEALDHGVLAVGYGTLEGKD---YYIIKNSWGPTWGQDGYIFMSRNAQNQCGVAT 325
Query: 339 EASYPVA 345
ASYP++
Sbjct: 326 MASYPIS 332
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 198/319 (62%), Gaps = 19/319 (5%)
Query: 42 HEQWMA---QHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLT 95
+++WM +H + YK ++E+ R+ IF N I K N +YKL N++ D+
Sbjct: 31 NQEWMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYKLKMNKYGDML 90
Query: 96 NEEFRASYTGYNRPVPSVSRQSSRP---STFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
+ EF G+N+ + + R P S + NV +P +DWR++GAVT +K+QGHC
Sbjct: 91 HHEFVNILNGFNKSINTQLRSERLPVGASFIEPANVV-LPKKVDWRKEGAVTPVKDQGHC 149
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKG 210
GSCW+FSA A+EG G L+ LSEQ L+DCS NNGC+GGLMD+AF+YI +NKG
Sbjct: 150 GSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKG 209
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASG 269
L TEA YPY+ E C + A +G Y D+P GDE L AV T PVSV ++AS
Sbjct: 210 LDTEASYPYEAENDKCRYNPANSGAIDVG-YIDIPTGDEKLLKAAVATIGPVSVAIDASH 268
Query: 270 QAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
Q+F+FY GV EC + DHGV V+G+GT E+G YWL+KNSWGETWG +GYI++
Sbjct: 269 QSFQFYSEGVYYEPECSSEELDHGVLVIGYGT--NENGQDYWLVKNSWGETWGNNGYIKM 326
Query: 328 LRDE-GLCGIATEASYPVA 345
R++ CGIA+ ASYP+
Sbjct: 327 ARNKLNHCGIASSASYPLV 345
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 147/307 (47%), Positives = 183/307 (59%), Gaps = 11/307 (3%)
Query: 44 QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASY 103
+W A H R Y E+A+R I+ NLE I + N G +Y LG NEF DL + EF A Y
Sbjct: 23 EWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAAKY 82
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAA 163
G V+ S S+ + +P S+DWR G VT +KNQG CGSCW+FS +
Sbjct: 83 LGVR--FNGVNATKSFASSTYLPRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGS 140
Query: 164 VEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
VEG G L+ LSEQ LVDCS+ N GC+GGLMD AFEYII+N G+ TEA YPY
Sbjct: 141 VEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEASYPYTA 200
Query: 222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL 280
GTC K AT+ Y+D+ G E L AV T PVSV ++AS F+FY GV
Sbjct: 201 TTGTC-KFNAANIGATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGVY 259
Query: 281 N-AECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIA 337
N +C DHGV VG+GT+ E G YWL+KNSWG TWG++GYI + R+ + CGIA
Sbjct: 260 NEKKCSTTQLDHGVLAVGYGTSTE--GKDYWLVKNSWGATWGKAGYIWMSRNADNQCGIA 317
Query: 338 TEASYPV 344
T ASYP+
Sbjct: 318 TSASYPL 324
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 145/351 (41%), Positives = 207/351 (58%), Gaps = 24/351 (6%)
Query: 14 FVIIILVITCASQVVSG-----RSMHEPSIVEK-------HEQWMAQHGRTYKDEL-EKA 60
F+I L++ + V + R HE +++ +QWM Q+ + Y +++ E
Sbjct: 5 FLIAALLVAASGGVGAAPELQLREQHEKLLLDAKANPMAAFQQWMMQYTKAYANDIKELE 64
Query: 61 MRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVS-RQSSR 119
R +++ +NL YI N ++ L N F+DLT +EFR + GY+ S R S
Sbjct: 65 TRFSVWLENLNYILAYNAR-TTSHWLHLNAFADLTTDEFR-NRLGYDFKARQASNRLQSS 122
Query: 120 PSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELS 179
P + + +PT IDWR+KGAVT +KNQG CGSCWAF+ +VEGI I G+L LS
Sbjct: 123 PFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWAFATTGSVEGINAIVTGELASLS 182
Query: 180 EQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
EQ+LVDC TD + GCSGGLMD A+++II+N GL TE DYPY E G C K+ TI
Sbjct: 183 EQELVDCDTDEDRGCSGGLMDYAYQWIIKNGGLDTEDDYPYTAEDGVCVAAKKNRRVVTI 242
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGF 297
Y D+P+ DE AL +A QP++V +EA ++F+ Y GV + CG + +HGV VVG+
Sbjct: 243 DGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYGGGVYDDPTCGTSLNHGVLVVGY 302
Query: 298 GTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
G ++ YW++KNSWG WG++GYIR+ +G+CGIA S+P
Sbjct: 303 G--KDPHFGNYWIVKNSWGPEWGDNGYIRLRMGAEDVQGMCGIAMAPSFPT 351
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 145/308 (47%), Positives = 191/308 (62%), Gaps = 13/308 (4%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
+ W A HG +Y E+ R I++ NL++IEK N EG +YKL N+F+DLT EF A
Sbjct: 23 DSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEG-HSYKLAVNKFADLTYPEFAAK 81
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVA 162
Y G R + + +S ST+ + V+ +P S+DWR G VT IK+QG CGSCW+FS
Sbjct: 82 YLGL-RFDATNATKSFAASTYLPRMVS-LPDSVDWRTAGIVTPIKDQGQCGSCWSFSTTG 139
Query: 163 AVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
+VEG G+L+ LSEQ LVDCS+ N GC+GGLMD+AF+YII N G+ TE+ YPY
Sbjct: 140 SVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPYT 199
Query: 221 QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV 279
+ GTC AT+ Y+D+ G E L AV T P+SV ++AS +F+FY GV
Sbjct: 200 AQDGTCQFNSAN-VGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSGV 258
Query: 280 LN--AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGI 336
N A DHGV VG+GT+ D YWL+KNSWG +WG+SGYI + R+ CGI
Sbjct: 259 YNEPACSSSQLDHGVLAVGYGTSGSSD---YWLVKNSWGTSWGQSGYIWMTRNSNNQCGI 315
Query: 337 ATEASYPV 344
AT ASYP+
Sbjct: 316 ATAASYPL 323
>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
Length = 347
Score = 266 bits (680), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 145/344 (42%), Positives = 208/344 (60%), Gaps = 21/344 (6%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ M V+I + + CAS R H+P + E W +G+ Y+++ ++ R I+++N
Sbjct: 13 LLRMKVVIWMFLACASTTAYLR--HDPMLDNHWELWKKTYGKQYEEQNQEVTRRLIWEKN 70
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
L+++ N E G +Y L N SD+T+EE + + P Q SR +T++
Sbjct: 71 LKFVTLHNLEHSMGLHSYDLSMNHLSDMTSEEVASLMSSLRIP-----NQWSRNTTYRLN 125
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+ +P S+DWR+KG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDC
Sbjct: 126 SNQKLPDSVDWRDKGCVTEVKYQGTCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDC 185
Query: 187 STD----NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
ST+ N+GC+GG M +AF+YII+N G+ ++A YPY+ + G C + AAT +Y
Sbjct: 186 STNEKYENHGCNGGCMTEAFQYIIDNNGIDSDASYPYKAKDGKC-QYNPANRAATCSRYT 244
Query: 243 DLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTA 300
+LP G E AL +AV K PVSV ++AS +F YK GV + C N +HGV V G+G
Sbjct: 245 ELPYGSEDALKEAVANKGPVSVGIDASLPSFFLYKSGVYYDPSCTQNVNHGVLVTGYGNL 304
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
DG YWL+KNSWG ++G+ GYIRI R+ G CGIA SYP
Sbjct: 305 ---DGKDYWLVKNSWGLSFGDKGYIRIARNRGNHCGIANFPSYP 345
>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
Length = 372
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 148/332 (44%), Positives = 198/332 (59%), Gaps = 23/332 (6%)
Query: 23 CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR 82
C V +G S H H + YK +E+ R+ IF N I + N++
Sbjct: 55 CCGSVFAGSSCHR-----------THHKKVYKSPIEEGYRMKIFLDNKRKIVEHNRKYEM 103
Query: 83 ---TYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWRE 139
YKLG N++ D+ + E + G+N+ V +VS + +TF ++P S+DWR+
Sbjct: 104 KEVNYKLGMNKYGDMLHHELINTLNGFNKSV-TVSEEQLIGATFIEPANVELPKSVDWRK 162
Query: 140 KGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGL 197
KGAVT IK+QG CGSCWAFS+ A+EG G L+ LSEQ L+DCS NNGC+GGL
Sbjct: 163 KGAVTAIKDQGQCGSCWAFSSTGALEGQHFRQSGVLVSLSEQNLIDCSGKYGNNGCNGGL 222
Query: 198 MDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV- 256
MD AF YI ENKGL TE YPY+ E C + + A+ +G + D+P+GDE L AV
Sbjct: 223 MDYAFRYIKENKGLDTEKSYPYEAENDQCRYNPKNSGASDVG-FVDIPEGDEDKLKAAVA 281
Query: 257 TKQPVSVCVEASGQAFRFYKRGV-LNAECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNS 314
T P+SV ++AS ++F FY GV EC N DHGV +VG+GT + G YWL+KNS
Sbjct: 282 TIGPISVAIDASHESFHFYSEGVYYEPECSPANLDHGVLIVGYGT-DSGTGEDYWLVKNS 340
Query: 315 WGETWGESGYIRILRD-EGLCGIATEASYPVA 345
WGETWGE GYI++ R+ E CGIA+ ASYP+
Sbjct: 341 WGETWGEKGYIKMARNKENHCGIASSASYPLV 372
>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
Length = 336
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 206/340 (60%), Gaps = 21/340 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
F+I+ +++ AS ++ + + + + + H + Y+ +A R IF QN I
Sbjct: 8 FLILAVLVGAASAALTLEQLFDA----EWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLI 63
Query: 74 EKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+ N +G TYKL N+F D+ + EF ++ G R S ++ ST+
Sbjct: 64 ARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGLLR-----SNRTYFGSTWIEPESVS 118
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P S+DWREKGAVT +KNQGHCGSCW+FS A+EG G+L+ LSEQ L+DCST
Sbjct: 119 LPKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSY 178
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC GGLMD AF YI EN G+ TE YPY+ +QG C KE +A G + D+P G+
Sbjct: 179 GNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDSAGRDTG-FVDIPSGN 237
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLNAECGD--NCDHGVAVVGFGTAEEEDG 305
E AL +A+ T PVSV ++AS ++F+FY GV N D + DHGV VG+GT +DG
Sbjct: 238 ERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTT--DDG 295
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
Y++IKNSWGE WG+ GY+ + R+ + CG+AT+ASYP+
Sbjct: 296 QDYYIIKNSWGERWGQEGYVLMARNSKNECGVATQASYPL 335
>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 422
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 148/323 (45%), Positives = 204/323 (63%), Gaps = 19/323 (5%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE---KANKEGNRTYKLGTNEFSD 93
+I + ++W+A HG+ Y E+A RL IF N E++ +A+ G +++ L N +D
Sbjct: 65 TIEARFDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHLAD 124
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRP----STFKYQNVTDVPTSIDWREKGAVTHIKNQ 149
LT EEF+ GY+ V +SS P + ++Y +VT P ++DW +GAVT +KNQ
Sbjct: 125 LTREEFK-HMLGYDASKKRV--ESSSPPVDAANWEYADVTP-PETMDWVSRGAVTPVKNQ 180
Query: 150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIE 207
G CGSCWAFS V AVEG+ + G LI LSEQ+LV C+ NNGC GGLMD FE+I+E
Sbjct: 181 GQCGSCWAFSTVGAVEGVVAVKTGDLISLSEQELVSCAKIGGNNGCKGGLMDNGFEWIVE 240
Query: 208 NKGLATEADYPYQQEQGTCDK-QKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVE 266
N+G+ E D+ Y + C+ +K +A AA+I ++D+P+ DE AL +AV++QPV+V +E
Sbjct: 241 NRGVDDEEDWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVAIE 300
Query: 267 ASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK-YWLIKNSWGETWGESGYI 325
A + F+ Y GV + ECG N DHGV VVG+G E G K YW +KNSWG WGE GYI
Sbjct: 301 ADHREFQLYSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEGYI 360
Query: 326 RILR----DEGLCGIATEASYPV 344
RI R G CG+A +ASYP
Sbjct: 361 RIARGGMGPAGQCGVAMQASYPT 383
>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 356
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 143/323 (44%), Positives = 190/323 (58%), Gaps = 26/323 (8%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
+HE+WMA+ GR Y D EKA R +F N Y++ N+ GNRTY LG N+FSDLT++EF
Sbjct: 38 RHEEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRAGNRTYTLGLNKFSDLTDDEFV 97
Query: 101 ASYTGYN-------RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
++ GY RP S+ + Y D+P S+DWR +GAVT +KNQG CG
Sbjct: 98 QTHLGYRGHQQGGLRPE---EENVSKVAALGYGQA-DMPESVDWRAQGAVTGVKNQGSCG 153
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS------TDNNGCSGGLMDKAFEYIIE 207
CWAF+AVAA EG+ +I G LI +SEQQ++DC+ + N C GG +D A Y+
Sbjct: 154 CCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGQSPGMGNTNTCDGGHIDDALRYVAA 213
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP-KGDEHALLQAVTKQPVSVCVE 266
++GL EA Y Y QG C +AA+ G+ + + +GDE L V QP++V VE
Sbjct: 214 SRGLQPEAAYAYTGLQGACQSGFTPNSAASFGEPQTVTLQGDEGRLQGLVAGQPIAVSVE 273
Query: 267 ASGQAFRFYKRGVLNA---ECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
AS FR Y GV A CG +H V VVG+G+A + G +YWL+KN WG +WGE G
Sbjct: 274 AS-DDFRHYMSGVFTAGTSSCGQRLNHAVTVVGYGSA--DGGQEYWLVKNQWGTSWGEGG 330
Query: 324 YIRILRDEGL--CGIATEASYPV 344
Y+RI R G CGI+ A YP
Sbjct: 331 YMRIARGNGAPNCGISAYAYYPT 353
>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
Length = 331
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 206/340 (60%), Gaps = 21/340 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
F+I+ +++ AS ++ + + + + + H + Y+ +A R IF QN I
Sbjct: 3 FLILAVLVGAASAALTLEQLFDA----EWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLI 58
Query: 74 EKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+ N +G TYKL N+F D+ + EF ++ G R S ++ ST+
Sbjct: 59 ARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGLLR-----SNRTYFGSTWIEPESVS 113
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P S+DWREKGAVT +KNQGHCGSCW+FS A+EG G+L+ LSEQ L+DCST
Sbjct: 114 LPKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSY 173
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC GGLMD AF YI EN G+ TE YPY+ +QG C KE +A G + D+P G+
Sbjct: 174 GNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDSAGRDTG-FVDIPSGN 232
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLNAECGD--NCDHGVAVVGFGTAEEEDG 305
E AL +A+ T PVSV ++AS ++F+FY GV N D + DHGV VG+GT +DG
Sbjct: 233 ERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTT--DDG 290
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
Y++IKNSWGE WG+ GY+ + R+ + CG+AT+ASYP+
Sbjct: 291 QDYYIIKNSWGERWGQEGYVLMARNSKNECGVATQASYPL 330
>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 469
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 192/311 (61%), Gaps = 12/311 (3%)
Query: 43 EQWMAQHGRTY-KDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
++W H R+Y D E R ++ +NLEY+ N ++ L N +DL+ E+++
Sbjct: 14 KEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNAR-TTSHWLTLNHLADLSTPEYKS 72
Query: 102 SYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFS 159
G++ V+R + + F+Y++V +P +IDWR+K AV +KNQG CGSCWAF+
Sbjct: 73 KLLGFDNQA-RVARNKLK-TGFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGSCWAFA 130
Query: 160 AVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYP 218
+VEGI I G L+ LSEQ+LVDC T+ + GCSGGLMD A+ +II+NKG+ TE DYP
Sbjct: 131 TTGSVEGINAIVTGSLVSLSEQELVDCDTEQDKGCSGGLMDYAYAWIIKNKGINTEEDYP 190
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRG 278
Y G CD K K TI YED+P+ DE AL +A QPV+V +EA ++F+ Y G
Sbjct: 191 YTAMDGQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAKSFQLYGGG 250
Query: 279 VL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI----LRDEGL 333
V + CG + +HGV VVG+G G+ YW++KNSWG WG++GYIR+ EGL
Sbjct: 251 VYDDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKMGSTDAEGL 310
Query: 334 CGIATEASYPV 344
CGIA SYPV
Sbjct: 311 CGIAMAPSYPV 321
>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
Length = 449
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 134/307 (43%), Positives = 191/307 (62%), Gaps = 10/307 (3%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
+ E W A+HGR+Y E+A RL F N ++ A+ +Y L N F+DLT++EFR
Sbjct: 37 QFEAWCAEHGRSYATPGERAARLAAFADNAAFV-AAHNGAPASYALALNAFADLTHDEFR 95
Query: 101 ASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
A+ G R P V VP ++DWR+ GAVT +K+QG CG+CW+FSA
Sbjct: 96 AARLGRLAAA-GPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSA 154
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
A+EGI +I G LI LSEQ+L+DC N+GC GGLMD A++++++N G+ TEADYPY
Sbjct: 155 TGAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPY 214
Query: 220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
++ GTC+K K K TI Y+D+P +E LLQAV +QPVSV + S +AF+ Y +G+
Sbjct: 215 RETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGI 274
Query: 280 LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCG 335
+ C + DH + +VG+G+ E G YW++KNSWGE+WG GY+ + R+ G+CG
Sbjct: 275 FDGPCPTSLDHAILIVGYGS---EGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCG 331
Query: 336 IATEASY 342
I S+
Sbjct: 332 INQMPSF 338
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 140/345 (40%), Positives = 195/345 (56%), Gaps = 14/345 (4%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMR 62
K + +II + ++ A G S + + +E+ + WM +H + Y+ EK R
Sbjct: 9 KIIFLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYR 68
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
IF+ NL YI++ NK+ N +Y LG N F+DL+N+EF+ Y G+ +
Sbjct: 69 FEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEFKKKYVGF-VAEDFTGLEHFDNED 126
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
F Y++VT+ P SIDWR KGAVT +KNQG CGSCWAFS +A VEGI +I G L+ELSEQ+
Sbjct: 127 FTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQE 186
Query: 183 LVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
LVDC + GC GG + +Y + N G+ T YP Q +Q C + I Y+
Sbjct: 187 LVDCDKHSYGCKGGYQTTSLQY-VANNGVHTSKVYPCQAKQYKCRATDKPGPKVKITGYK 245
Query: 243 DLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEE 302
+P E + L A+ QP+S VEA G+ F+ YK GV + CG DH V VG+GT+
Sbjct: 246 RVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTS-- 303
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYP 343
DG Y +IKNSWG WGE GY+R+ R +G CG+ + YP
Sbjct: 304 -DGKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347
>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
Length = 369
Score = 266 bits (679), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 148/316 (46%), Positives = 184/316 (58%), Gaps = 25/316 (7%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ E +E+W QH R +D EKA R +FK N+ I + N+ + YKL N F D+
Sbjct: 41 EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDM 98
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGS 154
T +E +Y SSR S + R GAV +K+QG CGS
Sbjct: 99 TADESAGAYA------------SSRVSHHRMFRGRGEKAQ---RLHGAVGAVKDQGQCGS 143
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLA 212
CWAFS +AAVEGI I L LSEQQLVDC T N GC GGLMD AF+YI ++ G+A
Sbjct: 144 CWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVA 203
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
+ YPY+ Q +C + A TI YED+P E AL +AV QPVSV +EA G F
Sbjct: 204 ASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSHF 263
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+FY GV +CG DHGVA VG+GT DG KYW+++NSWG WGE GYIR+ RD
Sbjct: 264 QFYSEGVFAGKCGTELDHGVAAVGYGTT--VDGTKYWIVRNSWGADWGEKGYIRMKRDVS 321
Query: 331 --EGLCGIATEASYPV 344
EGLCGIA EASYP+
Sbjct: 322 AKEGLCGIAMEASYPI 337
>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
Length = 382
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 147/378 (38%), Positives = 209/378 (55%), Gaps = 45/378 (11%)
Query: 9 FIIPMFVII--ILVITCAS----QVVSGRSMH---EP---SIVEKHEQWMAQHGRTYKDE 56
F +P +I+ + I C+S +V S + + EP +++E ++W A++ R+Y
Sbjct: 7 FSMPCLLILLGVFFIGCSSGTARRVTSDTAANTDGEPAATTMMEMFQRWKAEYNRSYATP 66
Query: 57 LEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSR- 115
E+ RL ++ +N+ YIE N Y+LG ++DLTN+EF A YT P+ S +
Sbjct: 67 EEERRRLRVYARNVRYIEATNAAAGLAYELGETAYTDLTNDEFMAMYTA--PPLRSAADD 124
Query: 116 ------------------QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
+ +P + + P S+DWR GAVT +K+QG CGSCWA
Sbjct: 125 DDDAATTTIITTRAGPVDEHQQPEVY-FNESAGAPASVDWRASGAVTEVKDQGRCGSCWA 183
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADY 217
FS VA VEGI +I GKL+ LSEQ+LVDC T ++GC GG+ +A E+I N G+ T DY
Sbjct: 184 FSTVAVVEGIQKIKKGKLVSLSEQELVDCDTLDSGCDGGVSYRALEWITANGGITTRDDY 243
Query: 218 PYQ-QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
PY CD+ K AATI + E +L A QPV+V +EA G F+ Y+
Sbjct: 244 PYTGAAAAACDRAKLGHHAATIAGLRRVATRSEASLQNAAAAQPVAVSIEAGGDNFQHYR 303
Query: 277 RGVLNAECGDNCDHGVAVVGFGTAE-----EEDGAKYWLIKNSWGETWGESGYIRILRD- 330
+GV + CG +HGV VVG+G E G KYW+IKNSWG+ WG+ GYI++ +D
Sbjct: 304 KGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWIIKNSWGKNWGDQGYIKMKKDV 363
Query: 331 ----EGLCGIATEASYPV 344
EGLCGIA S+P+
Sbjct: 364 AGKPEGLCGIAIRPSFPL 381
>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 140/315 (44%), Positives = 202/315 (64%), Gaps = 10/315 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ + +E+W +H ++ EK R ++FK+N+ ++ N+ ++ YKL N+F+D+
Sbjct: 34 EESLWQLYERW-GKHHTISRNLKEKHKRFSVFKENVNHVFTVNQM-DKPYKLKLNKFADM 91
Query: 95 TNEEFRASYTGYN-RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
+N EF Y N + + F Y+ TD+P+S+D RE+GAV +K QG CG
Sbjct: 92 SNYEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDGRERGAVNAVKEQGRCG 151
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLAT 213
SCWAFS+VAAVEGI +I +L+ LSEQ+L+DC+ N GC+GG M+ AF++I N G+AT
Sbjct: 152 SCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYRNKGCNGGFMEIAFDFIKRNGGIAT 211
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
E YPY +G C + + I YE +P+ +E AL+QAV QPVSV ++A+G+ F+
Sbjct: 212 ENSYPYHGSRGLCRSSRISSPIVKIDGYESVPE-NEDALMQAVANQPVSVAIDAAGRDFQ 270
Query: 274 FYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD--- 330
FY +GV + CG +HGV +G+GT EDG YWL++NSWG WGE GY+R+ R
Sbjct: 271 FYSQGVFDGYCGTELNHGVVAIGYGTT--EDGTDYWLVRNSWGVGWGEDGYVRMKRGVEQ 328
Query: 331 -EGLCGIATEASYPV 344
EGLCGIA EASYP+
Sbjct: 329 AEGLCGIAMEASYPI 343
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 138/310 (44%), Positives = 192/310 (61%), Gaps = 26/310 (8%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E+W+ ++ + Y EK R IFK+NL++I++ N N+T+++G F+DLTN+E
Sbjct: 2 YERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDE--- 58
Query: 102 SYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
P ++ R + Y+ +P IDWR KGAV +K+QG+CGSCWAFSAV
Sbjct: 59 ---------PKDFMKADR---YLYKEGDILPDEIDWRAKGAVVPVKDQGNCGSCWAFSAV 106
Query: 162 AAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
AVEGI QI G+LI LS+Q+L+DC N GC GG+M+ AFE+II N G+ ++ DYPY
Sbjct: 107 GAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQDYPY 166
Query: 220 Q-QEQGTCD-KQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKR 277
+ G C+ +K I YE + + DE +L +AV QPV V +EAS QAF+ YK
Sbjct: 167 TATDLGVCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLYKS 226
Query: 278 GVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGL 333
GV CG DHGV VVG+GT+ ED YW+I+NSWG WGE+GY+++ R+ G
Sbjct: 227 GVFTGTCGIYLDHGVVVVGYGTSSGED---YWIIRNSWGLNWGENGYVKLQRNIDDSFGK 283
Query: 334 CGIATEASYP 343
CG+A SYP
Sbjct: 284 CGVAMMPSYP 293
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 147/343 (42%), Positives = 208/343 (60%), Gaps = 15/343 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+ +I L+ Q+ + S+ E H + A H + Y +LE+ R+ I+ +N
Sbjct: 1 MKQITLIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENK 59
Query: 71 EYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
+ K N ++G ++Y++ N+F DL + EFR+ GY + SR S + + N
Sbjct: 60 HKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPAN 119
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
V +VP S+DWREKGA+T +K+QG CGSCWAFS+ A+EG T GKLI LSEQ L+DCS
Sbjct: 120 V-EVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCS 178
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
N GC+GGLMD+AF+YI +NKG+ TE YPY+ E C A G + D+P
Sbjct: 179 GKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRG-FVDIP 237
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEE 302
G+E L AV T PVSV ++AS ++F+FY +GV C D+ DHGV VVG+G+
Sbjct: 238 SGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS--- 294
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
++G YWL+KNSW E WG+ GYI+I R+ + CG+AT ASYP+
Sbjct: 295 DNGKDYWLVKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPL 337
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 148/341 (43%), Positives = 200/341 (58%), Gaps = 19/341 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
++ +LVI + VS + ++ E W HG+TY +E+ +RL I+ +N I
Sbjct: 6 LLLSVLVIASTANAVSFFDV----VLSDWESWKLMHGKTYSSSIEEKLRLKIYMENSLKI 61
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+ N E G Y + N + DL + EF A GY ++ +S T+
Sbjct: 62 SRHNSEALNGIHPYYMKMNHYGDLLHHEFVAMVNGYQY----ANKTASLGGTYIPNKNIQ 117
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+PT +DWRE+GAVT +KNQG CGSCW+FSA A+EG GKLI LSEQ LVDCS
Sbjct: 118 LPTHVDWREEGAVTPVKNQGQCGSCWSFSATGALEGQDFRKTGKLISLSEQNLVDCSRKF 177
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC GGLMD AF YI +NKG+ TEA YPY+ G C + + IG + D+ KG
Sbjct: 178 GNNGCEGGLMDFAFTYIRDNKGIDTEASYPYEGIDGHCHYNPKNKGGSDIG-FVDIKKGS 236
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEEEDG 305
E L +AV P+SV ++AS +F+FY GV + ++C + DHGV VVGFGT + G
Sbjct: 237 EKDLKKAVAGVGPISVAIDASHMSFQFYSHGVYVESKCSSEELDHGVLVVGFGT-DSVSG 295
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVA 345
YWL+KNSW E WG+ GYI++ R+ E +CGIA+ ASYPV
Sbjct: 296 EDYWLVKNSWSEKWGDQGYIKMARNKENMCGIASSASYPVV 336
>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 533
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 140/307 (45%), Positives = 193/307 (62%), Gaps = 13/307 (4%)
Query: 45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT-YKLGTNEFSDLTNEEFRASY 103
WM+ HG T+ D LE A RL + N YI + N E T KLG N FS ++ +EF+
Sbjct: 31 WMSAHGVTFSDALEFARRLENYIANDMYILEHNAENAWTGVKLGHNAFSHMSFDEFKFKM 90
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAA 163
TG P + ++ + + +V +VP+++DW +KG VT +KNQG CGSCWAFS A
Sbjct: 91 TGLVLPEGYLEQRLASRVDGLWSDV-EVPSAVDWVDKGGVTPVKNQGMCGSCWAFSTTGA 149
Query: 164 VEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
VEG T ++ GKL+ LSEQ+LVDC + + GC+GGLMD AF++I ++ G+ +E DY Y+ +
Sbjct: 150 VEGATFVSSGKLLSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYEYKAK 209
Query: 223 QGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNA 282
C K + + ++D+ DEHAL AV +QPVSV +EA +AF+FYK GV N
Sbjct: 210 AQVCRKCD---SVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 266
Query: 283 ECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE----GLCGIAT 338
CG DHGV VG+G ++G K+W +KNSWG +WGE GYIR+ R+E G CGIA+
Sbjct: 267 TCGTRLDHGVLAVGYGN---DNGQKFWKVKNSWGASWGEQGYIRLAREENGPAGQCGIAS 323
Query: 339 EASYPVA 345
SYP A
Sbjct: 324 VPSYPFA 330
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 148/342 (43%), Positives = 199/342 (58%), Gaps = 16/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M V + C S V + ++ + + EQW HG+ Y E E+ R ++++NL+
Sbjct: 1 MRVFLAAFALCLSAVFAAPTL-DKQLDNHWEQWKNWHGKKYH-EKEEGWRRMVWEKNLQK 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G TY+LG N F D+T+EEFR GY + R S F N
Sbjct: 59 IELHNLEHSMGTHTYRLGMNRFGDMTHEEFRQVMNGYKHK----KERRFRGSLFMEPNFL 114
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+VP S+DWREKG VT +K+QG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 115 EVPNSLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSRP 174
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD+AF+YI + GL +E YPY K +AA + D+P G
Sbjct: 175 EGNEGCNGGLMDQAFQYIKDQNGLDSEESYPYVGTDDQPCHYDPKYSAANDTGFVDIPSG 234
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
EHAL++A+ PVSV ++A ++F+FY+ G+ EC + DHGV VG+G E+
Sbjct: 235 KEHALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDV 294
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E WG+ GY+ + +D CGIAT ASYP+
Sbjct: 295 DGKKYWIVKNSWSENWGDKGYVYMAKDRHNHCGIATAASYPL 336
>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
Length = 233
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 126/233 (54%), Positives = 164/233 (70%), Gaps = 12/233 (5%)
Query: 120 PSTFKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIE 177
P+ F+Y+NV+ +PT+IDWR KGAVT IK+QG CG CWAFSAVAA EGI +I+ GKL+
Sbjct: 4 PTGFRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVS 63
Query: 178 LSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAA 235
L+EQ+LVDC ++ GC GGLMD AF++II+N GL TE+ YPY G C + +A
Sbjct: 64 LAEQELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKC--KSGSNSA 121
Query: 236 ATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVV 295
ATI YED+P DE AL++AV QPVSV V+ F+FY GV+ CG + DHG+A +
Sbjct: 122 ATIKGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAI 181
Query: 296 GFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
G+G + DG KYWL+KNSWG TWGE+GY+R+ +D G+CG+A E SYP
Sbjct: 182 GYG--KTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 232
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 143/309 (46%), Positives = 195/309 (63%), Gaps = 17/309 (5%)
Query: 47 AQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASY 103
A+HG++Y E E+ RL I+ +N I K N++ G Y + NEF D+ + EF ++
Sbjct: 32 AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
G+ R R+ S + + +N+ D +P ++DWR KGAVT +KNQG CGSCWAFSA
Sbjct: 92 NGFKRNYKDQPREGS--TYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSAT 149
Query: 162 AAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
++EG G ++ LSEQ LVDCSTD NNGC GGLMD AF+YI NKG+ TE YPY
Sbjct: 150 GSLEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSYPY 209
Query: 220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRG 278
GTC +K A G + D+ +G E L +AV T P+SV ++AS ++F+FY G
Sbjct: 210 NGTDGTCHFKKSTVGATDSG-FVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDG 268
Query: 279 VLN-AEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCG 335
V + EC ++ DHGV VVG+GT +G YWL+KNSWG TWG+ GYIR+ R+ + CG
Sbjct: 269 VYDEPECDSESLDHGVLVVGYGTL---NGTDYWLVKNSWGTTWGDEGYIRMSRNKKNQCG 325
Query: 336 IATEASYPV 344
IA+ ASYP+
Sbjct: 326 IASSASYPL 334
>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
Length = 464
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 143/316 (45%), Positives = 201/316 (63%), Gaps = 18/316 (5%)
Query: 42 HEQWMAQH---GRTYKDEL-EKAMRLTIFKQNLEYIE--KANKEGNRTYKLGTNEFSDLT 95
++ W+A+H G ++ + E R +F NL++++ A+ +G+ ++LG N F+DLT
Sbjct: 66 YDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADGHGGFRLGMNRFADLT 125
Query: 96 NEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAV-THIKNQGHCGS 154
N+EFRA+Y G R +++ V +P S+DWR+KGAV + +KNQG CGS
Sbjct: 126 NDEFRAAYLG----TTPAGRGRHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGS 181
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLMDKAFEYIIENKGLA 212
CWAFSAVAAVEGI +I G+L+ LSEQ+LV+C+ N+GC+GG+MD AF +I N GL
Sbjct: 182 CWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGGNSGCNGGIMDDAFAFITRNGGLD 241
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TE DYPY G CD K+ +I +ED+P+ DE +L +AV QPVSV ++A G+ F
Sbjct: 242 TEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREF 301
Query: 273 RFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
+ Y GV CG + DHGV VG+GT + G YW ++NSWG WGE+GYIR+ R+
Sbjct: 302 QLYDSGVFTGRCGTSLDHGVVAVGYGT-DAATGTDYWTVRNSWGPDWGENGYIRMERNVT 360
Query: 331 --EGLCGIATEASYPV 344
G CGIA ASYP+
Sbjct: 361 ARTGKCGIAMMASYPI 376
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 146/329 (44%), Positives = 202/329 (61%), Gaps = 24/329 (7%)
Query: 32 SMHEPSIV-------EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GN 81
S H PS++ + EQ+ + GR Y + R +IF+ NL++I + N + G+
Sbjct: 16 SAHIPSMLLTEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGD 75
Query: 82 RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
T+ + N F+DL+NEEFRA++ GY R ++ S S +V +P ++DW KG
Sbjct: 76 STFSVSVNNFTDLSNEEFRATFNGYRR----LAAVSLADSVHADNDVEALPATVDWTTKG 131
Query: 142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMD 199
VT IKNQ CGSCWAFSAVA++EG + GKL+ LSEQ LVDCS + GCSGG MD
Sbjct: 132 VVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMD 191
Query: 200 KAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK- 258
AF+Y+I+N+G+ TEA YPY+ +C+ K + ATI + D+ GDE AL AV
Sbjct: 192 YAFKYVIQNRGIDTEASYPYKAIDESCEF-KRNSVGATIHSFVDVKTGDESALQNAVASI 250
Query: 259 QPVSVCVEASGQAFRFYKRGVLN-AECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWG 316
P+SV ++A+ +F+FY GV N +C DHGV VG+GT +GA YW +KNSWG
Sbjct: 251 GPISVAIDAAQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTL---NGAPYWKVKNSWG 307
Query: 317 ETWGESGYIRILRD-EGLCGIATEASYPV 344
+WG GYI + R+ + CGIAT+ASYPV
Sbjct: 308 TSWGRKGYIFMSRNKQNQCGIATKASYPV 336
>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
Length = 523
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 188/311 (60%), Gaps = 12/311 (3%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
K WM + + LE R +F N + IE NK+ + ++ +G NE+S LT +EF+
Sbjct: 27 KFLSWMKKFAVKL-NPLEWVHRFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFK 85
Query: 101 ASYTGYNRPVPSVSRQSSRPSTFKYQ-NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFS 159
TG R PS + ++ + N+TDVP +DW E+G VT +KNQG CGSCWAFS
Sbjct: 86 KLRTGL-RVSPSYIQSRAKYALMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCGSCWAFS 144
Query: 160 AVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYP 218
A+EG ++ +L+ +SEQ+LVDC + + GC+GGLMD AF+++ +KGL E DYP
Sbjct: 145 TTGAIEGAAFVSSKQLVSVSEQELVDCDHNGDMGCNGGLMDNAFKWVKTHKGLCKEEDYP 204
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRG 278
Y ++GTC +K K + + D+P DE AL AV KQPVSV +EA F+FYK G
Sbjct: 205 YHAKEGTCALKKCK-PVTKVTAFHDVPANDEQALKAAVAKQPVSVAIEADQPEFQFYKSG 263
Query: 279 VLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLC 334
V + CG DHGV VVG+G EE G KYW +KNSWG WG+ GYI++ R + G C
Sbjct: 264 VFDKSCGTKLDHGVLVVGYG---EEGGKKYWKVKNSWGADWGDKGYIKLAREFGPETGQC 320
Query: 335 GIATEASYPVA 345
G+A SYP A
Sbjct: 321 GVAMVPSYPTA 331
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 146/329 (44%), Positives = 202/329 (61%), Gaps = 24/329 (7%)
Query: 32 SMHEPSIV-------EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GN 81
S H PS++ + EQ+ + GR Y + R +IF+ NL++I + N + G+
Sbjct: 16 SAHIPSMLLTEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGD 75
Query: 82 RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
T+ + N F+DL+NEEFRA++ GY R ++ S S +V +P ++DW KG
Sbjct: 76 STFSVSVNNFTDLSNEEFRATFNGYRR----LAAVSLADSVHADNDVEALPATVDWTTKG 131
Query: 142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMD 199
VT IKNQ CGSCWAFSAVA++EG + GKL+ LSEQ LVDCS + GCSGG MD
Sbjct: 132 VVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMD 191
Query: 200 KAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK- 258
AF+Y+I+N+G+ TEA YPY+ +C+ K + ATI + D+ GDE AL AV
Sbjct: 192 YAFKYVIQNRGIDTEASYPYKAIDESCEF-KRNSIGATIHSFVDVKTGDESALQNAVASI 250
Query: 259 QPVSVCVEASGQAFRFYKRGVLN-AECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWG 316
P+SV ++AS +F+FY GV N +C DHGV VG+GT +G YW +KNSWG
Sbjct: 251 GPISVAIDASQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTL---NGVPYWKVKNSWG 307
Query: 317 ETWGESGYIRILRD-EGLCGIATEASYPV 344
+WG+ GYI + R+ + CGIAT+ASYPV
Sbjct: 308 TSWGQKGYIFMSRNKQNQCGIATKASYPV 336
>gi|125526835|gb|EAY74949.1| hypothetical protein OsI_02845 [Oryza sativa Indica Group]
Length = 360
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 189/321 (58%), Gaps = 14/321 (4%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLT 95
S+ +HE+WMA+ GR Y D EKA R+ +F N E ++ AN+ G +RTY LG N+FSDLT
Sbjct: 38 SMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDLT 97
Query: 96 NEEFRASYTGYN-RPVPSVSRQSSRP---STFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
++EF ++ GY+ P P R R + + TDVP S+DWR +GAVT +KNQ
Sbjct: 98 DDEFARTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQRS 157
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGL 211
CGSCWAF+AVAA EG+ Q+ G L+ LSEQQ++DC+ N CSGG + A YI + GL
Sbjct: 158 CGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDCTGGANTCSGGDVSAALRYIAASGGL 217
Query: 212 ATEADYPYQQEQGTCDK---QKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
TEA Y Y +QG C +AAA G GDE AL QPV V VEAS
Sbjct: 218 QTEAAYAYGGQQGACRAGGFAAPNSAAAVGGARWARLYGDEGALQALAAGQPVVVVVEAS 277
Query: 269 GQAFRFYKRGVL--NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
FR Y+ GV +A CG +H V VV A + G +YWL+KN WG WGE GY+R
Sbjct: 278 EPDFRHYRSGVYAGSAACGRRLNHAVTVV-GYGAAADGGGEYWLVKNQWGTWWGEGGYMR 336
Query: 327 ILRD---EGLCGIATEASYPV 344
+ R G CGIAT A YP
Sbjct: 337 VARGGAAGGNCGIATYAFYPT 357
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 153/344 (44%), Positives = 204/344 (59%), Gaps = 23/344 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLTIFKQNL 70
F+I + + SQ VS + + EQW A H + Y+ + E+ R+ IF +N
Sbjct: 3 FLIFLAICVAGSQAVSFFDLVQ-------EQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55
Query: 71 EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSV-SRQSSRPSTFKYQ 126
+ K NK +G ++KLG N+++D+ + EF G+NR + S +S TF
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+P IDWR+KGAVT +K+QG CGSCW+FSA ++EG GKL+ LSEQ LVDC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC 175
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S NNGC+GGLMD AF YI N G+ TE YPY+ E C K K AT Y D+
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKC-HYKPKNKGATDRGYVDI 234
Query: 245 PKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECG-DNCDHGVAVVGFGTAE 301
G+E L AV T PVSV ++AS Q+F+ Y GV EC DHGV VVG+GT
Sbjct: 235 ESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYGT-- 292
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
E+DG YWL+KNSWG++WG+ GYI++ R+ + CGIATEASYP+
Sbjct: 293 EDDGTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPL 336
>gi|115438530|ref|NP_001043562.1| Os01g0613500 [Oryza sativa Japonica Group]
gi|11034572|dbj|BAB17096.1| cysteine proteinase-like [Oryza sativa Japonica Group]
gi|113533093|dbj|BAF05476.1| Os01g0613500 [Oryza sativa Japonica Group]
gi|215697766|dbj|BAG91959.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 360
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 149/321 (46%), Positives = 189/321 (58%), Gaps = 14/321 (4%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLT 95
S+ +HE+WMA+ GR Y D EKA R+ +F N E ++ AN+ G +RTY LG N+FSDLT
Sbjct: 38 SMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDLT 97
Query: 96 NEEFRASYTGYN-RPVPSVSRQSSRP---STFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
++EF ++ GY+ P P R R + + TDVP S+DWR +GAVT +KNQ
Sbjct: 98 DDEFAQTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQRS 157
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGL 211
CGSCWAF+AVAA EG+ Q+ G L+ LSEQQ++DC+ N CSGG + A YI + GL
Sbjct: 158 CGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDCTGGANTCSGGDVSAALRYIAASGGL 217
Query: 212 ATEADYPYQQEQGTCDK---QKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
TEA Y Y +QG C +AAA G GDE AL QPV V VEAS
Sbjct: 218 QTEAAYAYGGQQGACRAGGFAAPNSAAAVGGARWARLYGDEGALQALAAGQPVVVVVEAS 277
Query: 269 GQAFRFYKRGVL--NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
FR Y+ GV +A CG +H V VV A + G +YWL+KN WG WGE GY+R
Sbjct: 278 EPDFRHYRSGVYAGSAACGRRLNHAVTVV-GYGAAADGGGEYWLVKNQWGTWWGEGGYMR 336
Query: 327 ILRD---EGLCGIATEASYPV 344
+ R G CGIAT A YP
Sbjct: 337 VARGGAAGGNCGIATYAFYPT 357
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 149/343 (43%), Positives = 206/343 (60%), Gaps = 19/343 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
MF +++L + C + +S S+ +P + E W H + Y E E+ R ++++NL+
Sbjct: 1 MFPVVVLAL-CVTAALSAPSL-DPQLDEHWNLWKDWHSKKYH-EKEEGWRRMVWEKNLKK 57
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G TY LG N F D+T+EEFR GY S++ R S F N
Sbjct: 58 IELHNLEHSMGKHTYSLGMNHFGDMTHEEFRQIMNGYKLK----SQRKLRGSLFMEPNFL 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+ P S+DWR+KG VT +K+QG CGSCWAFS A+EG G L+ LSEQ LVDCS
Sbjct: 114 EAPRSVDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRP 173
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD+AF+YI +N GL +E YPY ++G C +A G + D+P
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDSEESYPYLGTDEGPCHYDPSYNSANDTG-FVDVPS 232
Query: 247 GDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEE 302
G E AL++AV PVSV ++A ++F+FY G+ + EC + DHGV VVG+G ++
Sbjct: 233 GSERALMKAVASVGPVSVAIDAGHESFQFYHSGIYYDKECSSEELDHGVLVVGYGFEGKD 292
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E WG+ GYI + +D + CGIAT ASYP+
Sbjct: 293 VDGKKYWIVKNSWSENWGDKGYIYMAKDKKNHCGIATAASYPL 335
>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
Length = 401
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 190/322 (59%), Gaps = 20/322 (6%)
Query: 36 PSIVEKHEQ-----WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE--GNRTYKLGT 88
P VE EQ WM H ++Y + R I+K N +I NK+ ++ +
Sbjct: 84 PRDVELEEQRAFTEWMRTHRKSYHHD-HFLPRFEIWKTNNRWITHWNKKHANASSFTVAI 142
Query: 89 NEFSDLTNEEFRASYTGYNR-PVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N+F DLT++EF Y G + P S + RP ++ N +P S DWR+KG V+ +K
Sbjct: 143 NQFGDLTSDEFNRLYNGLHVFSAPKASEKVERPR--QWANTAGIPESGDWRQKGVVSRVK 200
Query: 148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST---DNNGCSGGLMDKAFEY 204
+QG CGSCWAFS + EGI IT +L+ LSEQ LVDC+T DN GC+GG MD AF Y
Sbjct: 201 DQGMCGSCWAFSTTGSTEGINAITTSRLVPLSEQNLVDCATAAYDNYGCNGGFMDNAFRY 260
Query: 205 IIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVC 264
II+NKG+ +EA YPY G C + G + LPKGDE ALL A +QP+SV
Sbjct: 261 IIDNKGIDSEASYPYVAADGQCRFNPKTVYGGKGGTLKSLPKGDEKALLVAAARQPISVG 320
Query: 265 VEASGQAFRFYKRGVLN-AECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGES 322
++A +F+FY +GV N EC +HGV +VG+G E G YWL+KNSWG+TWG
Sbjct: 321 IDAGRPSFQFYSKGVYNEPECSSTELNHGVLIVGWGV---ERGQAYWLVKNSWGQTWGMD 377
Query: 323 GYIRILRDE-GLCGIATEASYP 343
GYI++ RD+ CGIAT ASYP
Sbjct: 378 GYIKMSRDKNNQCGIATLASYP 399
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 153/344 (44%), Positives = 203/344 (59%), Gaps = 23/344 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLTIFKQNL 70
F+I + + SQ VS + + EQW A H + Y+ E E+ R+ IF +N
Sbjct: 3 FLIFLAICVAGSQAVSFFDLVQ-------EQWGAFKMTHNKQYQSETEERFRMKIFMENS 55
Query: 71 EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSV-SRQSSRPSTFKYQ 126
+ K NK +G ++KLG N+++D+ + EF G+NR + S +S TF
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+P IDWR+KGAVT +K+QG CGSCW+FSA ++EG GKL+ LSEQ LVDC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDC 175
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S NNGC+GGLMD AF YI N G+ TE YPY+ E C K K AT Y D+
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKC-HYKPKNKGATDRGYVDI 234
Query: 245 PKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAE 301
G+E L AV T PVSV ++AS Q+F+ Y GV +C DHGV VVG+GT
Sbjct: 235 ESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGT-- 292
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
E+DG YWL+KNSWG++WG+ GYI++ R+ CGIATEASYP+
Sbjct: 293 EDDGTDYWLVKNSWGKSWGDQGYIKMARNRNNNCGIATEASYPL 336
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 152/344 (44%), Positives = 204/344 (59%), Gaps = 23/344 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLTIFKQNL 70
F+I + + SQ VS + + EQW A H + Y+ + E+ R+ IF +N
Sbjct: 3 FLIFLAICVAGSQAVSFFDLVQ-------EQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55
Query: 71 EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSV-SRQSSRPSTFKYQ 126
+ K NK +G ++KLG N+++D+ + EF G+NR + S +S TF
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+P IDWR+KGAVT +K+QG CGSCW+FSA ++EG GKL+ LSEQ LVDC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC 175
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S NNGC+GGLMD AF YI N G+ TE YPY+ E C K K AT Y D+
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKC-HYKPKNKGATDRGYVDI 234
Query: 245 PKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAE 301
G+E L AV T PVSV ++AS Q+F+ Y GV +C DHGV VVG+GT
Sbjct: 235 ESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGT-- 292
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
E+DG YWL+KNSWG++WG+ GYI++ R+ + CGIATEASYP+
Sbjct: 293 EDDGTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPL 336
>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 325
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 149/338 (44%), Positives = 201/338 (59%), Gaps = 20/338 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M + + + C + VVS + +PS E W + HG+ Y ++ E R +F QN++
Sbjct: 1 MKTLSVFLAICLA-VVSAIPLKDPSW----EAWKSFHGKKYHNQGEDDFRHYVFLQNIKT 55
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
I N + T+K+ NEFSDLT +EF +Y GY S+ + +++PSTF T++P
Sbjct: 56 IAAHNAKS--TFKMAINEFSDLTRKEFVKTYNGYRL---SMKKSTNKPSTFMAPLNTNMP 110
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DN 190
T +DWR++G VT IKNQG CGSCWAFS ++EG GKL+ LSEQ L+DCS N
Sbjct: 111 TEVDWRKEGYVTPIKNQGRCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLIDCSAAEGN 170
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
+GC GG MD AFEYI N G+ TEA YPY+ C +K A G Y D+ + E
Sbjct: 171 DGCGGGFMDDAFEYIKLNNGIDTEASYPYEGRDDICRYKKTNKGAIDTG-YMDIKQYSED 229
Query: 251 ALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AECGDNC-DHGVAVVGFGTAEEEDGAK 307
L AV T P+SV ++AS ++F Y GV + EC DHGV VVG+GT E+G
Sbjct: 230 DLKAAVATVGPISVAIDASHKSFHMYHTGVYHEPECSQTVLDHGVLVVGYGT---ENGED 286
Query: 308 YWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
YWL+KNSWG WG +GYI++ R+ CGIAT ASYP+
Sbjct: 287 YWLVKNSWGTDWGMNGYIKMSRNRSNNCGIATNASYPL 324
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 149/335 (44%), Positives = 202/335 (60%), Gaps = 24/335 (7%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
+++L +T A ++ P E QW H + Y + E+ +R TI+K N I +
Sbjct: 7 LLLLGVTLA------YTIERPVKDESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIRE 60
Query: 76 ANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
N +G + L N+F D+TN EF+A + GY +S + STF N P ++
Sbjct: 61 HNLKGG-DFLLKMNQFGDMTNSEFKA-FNGY------LSHKHVNGSTFLTPNNFVAPDTV 112
Query: 136 DWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGC 193
DWR +G VT +K+QG CGSCWAFS ++EG GKL+ LSEQ LVDCST NNGC
Sbjct: 113 DWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGC 172
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALL 253
+GGLMD AF YI ENKG+ +EA YPY E G C +K AA G + DLP+G+E+ L
Sbjct: 173 NGGLMDNAFTYIKENKGIDSEASYPYTAEDGKCVFKKPSVAATDTG-FVDLPEGNENKLK 231
Query: 254 QAVTK-QPVSVCVEASGQAFRFYKRGVLN-AECGDN-CDHGVAVVGFGTAEEEDGAKYWL 310
+AV P+SV ++AS ++F+FY GV N C DHGV VVG+GT E G YWL
Sbjct: 232 EAVASVGPISVAIDASHESFQFYSSGVYNEPSCSSTELDHGVLVVGYGT---ESGKDYWL 288
Query: 311 IKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
+KNSW +WG+ GYI++ R+ + CGIAT+ASYP+
Sbjct: 289 VKNSWNTSWGDKGYIKMRRNAKNQCGIATKASYPL 323
>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
Length = 286
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 130/249 (52%), Positives = 174/249 (69%), Gaps = 6/249 (2%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y+ EK +R IFK NL++I++ NK + Y LG NEF+DL++
Sbjct: 4 LIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVVS-NYWLGLNEFADLSHH 62
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF+ Y G S R+SS F Y++V D+P S+DWR+KGAVT+IKNQG CGSCWA
Sbjct: 63 EFKKQYLGLKVDF-STRRESSEE--FTYRDV-DLPKSVDWRKKGAVTNIKNQGSCGSCWA 118
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCS-TDNNGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF +I+EN GL E D
Sbjct: 119 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNSGCNGGLMDYAFSFIVENGGLHKEDD 178
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYK 276
YPY E+GTC+ KE++ TI Y D+P+ +E +LL+A+ QP+SV +EASG+ F+FY
Sbjct: 179 YPYIMEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 238
Query: 277 RGVLNAECG 285
GV + CG
Sbjct: 239 GGVFDGHCG 247
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 206/339 (60%), Gaps = 15/339 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+I L+ Q+ + S+ E H + A H + Y +LE+ R+ I+ +N +
Sbjct: 1 TLIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVA 59
Query: 75 KAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
K N ++G ++Y + N+F DL + EFR+ GY + SR S + + NVT V
Sbjct: 60 KHNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVT-V 118
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-- 189
P S+DWREKGA+T +K+QG CGSCWAFS+ A+EG T GKL+ LSEQ L+DCS
Sbjct: 119 PESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYG 178
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
N GC+GGLMD+AF+YI +NKG+ TE YPY+ E C A G + D+P G+E
Sbjct: 179 NEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRG-FVDIPSGEE 237
Query: 250 HALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEEEDGA 306
L AV T PVSV ++AS ++F+FY +GV C D+ DHGV VVG+G+ ++G
Sbjct: 238 DKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS---DNGK 294
Query: 307 KYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
YWL+KNSW E WG+ GYI++ R+ + CG+A+ ASYP+
Sbjct: 295 DYWLVKNSWSEHWGDEGYIKMARNRKNHCGVASAASYPL 333
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 151/344 (43%), Positives = 201/344 (58%), Gaps = 27/344 (7%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIF 66
K+F+ + V + L+ C S++ R H W HG+TY E E+ +R I+
Sbjct: 2 KAFLACLLVAV-LIAQCFSELSQDRQWHA---------WKDFHGKTYTGE-EEDLRRAIW 50
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
NLE ++K N E N +YKL N F+DLT EF+ + GY + S+ STF
Sbjct: 51 NDNLEIVKKHNAE-NHSYKLDMNHFADLTVTEFKQRFMGYR-----AASNSTGGSTFLPL 104
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+ +P +DWR+KG VT +KNQG CGSCWAFS+ ++EG GKL+ LSEQ LVDC
Sbjct: 105 SNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVSLSEQNLVDC 164
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S NNGC GGLMD AF+YI N G+ TE YPY G C K + AT+ Y D+
Sbjct: 165 SKKYGNNGCEGGLMDYAFKYIKNNDGIDTEQSYPYTARDGQC-HFKPGSVGATVTGYTDV 223
Query: 245 PKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AECGDN-CDHGVAVVGFGTAE 301
+G E L AV T P+SV ++A +F+ YK GV + +C DHGV VG+G
Sbjct: 224 QRGSEGDLQSAVATVGPISVAIDAGHSSFQLYKTGVYSEPDCSSTQLDHGVLAVGYGA-- 281
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
EDG YWL+KNSWGE WG +GYI++ R+ + CGIAT+ASYP+
Sbjct: 282 -EDGKDYWLVKNSWGEGWGMNGYIKMSRNKDNQCGIATQASYPL 324
>gi|3850787|emb|CAA05360.1| cathepsin S [Mus musculus]
Length = 330
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 146/325 (44%), Positives = 197/325 (60%), Gaps = 19/325 (5%)
Query: 29 SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYK 85
+G + P++ + W H + YKD+ E+ +R I+++NL++I N E G TY+
Sbjct: 13 NGATAERPTLDHHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQ 72
Query: 86 LGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
+G N+ D+TNEE P RQS + TF+ + +P ++DWREKG VT
Sbjct: 73 VGMNDMGDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTE 127
Query: 146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD----NNGCSGGLMDKA 201
+K QG CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +A
Sbjct: 128 VKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEA 187
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQP 260
F+YII+N G+ +A YPY+ C K AAT +Y LP GDE AL +AV TK P
Sbjct: 188 FQYIIDNGGIEADASYPYKAMDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGP 246
Query: 261 VSVCVEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
VSV ++AS +F FYK GV + C N +HGV VVG+GT DG YWL+KNSWG +
Sbjct: 247 VSVGIDASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL---DGKDYWLVKNSWGLNF 303
Query: 320 GESGYIRILR-DEGLCGIATEASYP 343
G+ GYIR+ R ++ CGIA+ SYP
Sbjct: 304 GDQGYIRMARNNKNHCGIASYCSYP 328
>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 148/342 (43%), Positives = 204/342 (59%), Gaps = 17/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M + +L + C S +S S+ +P + E + W + H + Y E E+ R ++++NL+
Sbjct: 1 MLPVAVLAV-CLSAALSAPSL-DPQLDEHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKK 57
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G TY+LG N F D+T+EEFR GY R S + + S F N
Sbjct: 58 IELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMNGYKRK----SERKFKGSLFMEPNFL 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+ P S+DWR+ G VT +K+QG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 114 EAPRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRP 173
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD+AF+YI +N+GL +E YPY K +A + D+P G
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSG 233
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
E AL++AV PVSV ++A ++F+FY+ G+ EC + DHGV VVG+G E+
Sbjct: 234 KERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDV 293
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E WG+ GYI + +D + CGIAT ASYP+
Sbjct: 294 DGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|12805315|gb|AAH02125.1| Ctss protein [Mus musculus]
Length = 340
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 196/319 (61%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 29 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
D+TNEE P RQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 89 GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 143
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD----NNGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 144 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 203
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVE 266
N G+ +A YPY+ C K AAT +Y LP GDE AL +AV TK PVSV ++
Sbjct: 204 NGGIEADASYPYKAMDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 262
Query: 267 ASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
AS +F FYK GV + C N +HGV VVG+GT DG YWL+KNSWG +G+ GYI
Sbjct: 263 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL---DGKDYWLVKNSWGLNFGDQGYI 319
Query: 326 RILR-DEGLCGIATEASYP 343
R+ R ++ CGIA++ SYP
Sbjct: 320 RMARNNKNHCGIASDCSYP 338
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 143/319 (44%), Positives = 188/319 (58%), Gaps = 18/319 (5%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYK----LGTNEFSDLT 95
E E+WM +H + Y EKA R F NL ++ K N EG R +G N F+DL+
Sbjct: 49 ELFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSSGQGVGMNVFADLS 108
Query: 96 NEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKNQGHCG 153
NEEFR Y+ + + +R + + V D P S+DWR++GAVT +KNQG CG
Sbjct: 109 NEEFREVYSSRVLRKKAAEGRGARRRAGEGRVVAGCDAPASLDWRKRGAVTAVKNQGDCG 168
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLAT 213
SCWAFS+ A+EGI IT G+LI LSEQ+LVDC T N GC GG MD AFE++I N G+ +
Sbjct: 169 SCWAFSSTGAMEGINAITTGELISLSEQELVDCDTTNEGCDGGYMDYAFEWVINNGGIDS 228
Query: 214 EADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
EA+YPY Q C+ KE+ +I YED+ E ALL A +QPVSV ++ S F
Sbjct: 229 EANYPYTGQADSVCNTTKEEIKVVSIDGYEDVAT-SESALLCAAVQQPVSVGIDGSSLDF 287
Query: 273 RFYKRGVLNAECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
+ Y G+ + +C D+ DH V VVG+G ++ G YW++KNSWG WG GYI I R
Sbjct: 288 QLYAGGIYDGDCSGNPDDIDHAVLVVGYG---QQGGTDYWIVKNSWGTDWGMQGYIYIRR 344
Query: 330 DEGL----CGIATEASYPV 344
+ GL C I ASYP
Sbjct: 345 NTGLPYGVCAIDAMASYPT 363
>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
Length = 335
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 139/340 (40%), Positives = 206/340 (60%), Gaps = 15/340 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ +LV C S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLCISAVFTAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR + GY + ++S+ + F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKQD----PNRTSKGALFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +K+Q CGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC+GG+MD+AF+Y+ ENKGL +E YPY + + A I + D+PKG+
Sbjct: 175 GNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGN 234
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFG-TAEEEDG 305
E AL+ AV PVSV ++AS Q+ +FY+ G+ C DH V VVG+G + G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAG 294
Query: 306 AKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
+YW++KNSW + WG+ GYI + +D+ CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|2961621|gb|AAC05781.1| cathepsin S [Mus musculus]
Length = 340
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 196/319 (61%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 29 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
D+TNEE +SRQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 89 GDMTNEEISCRMGALR-----ISRQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 143
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD----NNGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 144 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 203
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVE 266
N G+ +A YPY+ C K AAT +Y LP GDE AL +AV TK PVSV ++
Sbjct: 204 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 262
Query: 267 ASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
AS +F FYK GV + C N +HGV VVG+GT DG YWL+KNSWG +G+ GYI
Sbjct: 263 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL---DGKDYWLVKNSWGLNFGDQGYI 319
Query: 326 RILR-DEGLCGIATEASYP 343
R+ R ++ CGIA+ SYP
Sbjct: 320 RMARNNKNHCGIASYCSYP 338
>gi|2746723|gb|AAB94925.1| cathepsin S precursor [Mus musculus]
Length = 340
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 196/319 (61%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 29 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
D+TNEE +SRQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 89 GDMTNEEISCRMGALR-----ISRQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 143
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD----NNGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 144 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 203
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVE 266
N G+ +A YPY+ C K AAT +Y LP GDE AL +AV TK PVSV ++
Sbjct: 204 NGGIEADASYPYKAMDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 262
Query: 267 ASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
AS +F FYK GV + C N +HGV VVG+GT DG YWL+KNSWG +G+ GYI
Sbjct: 263 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL---DGKDYWLVKNSWGLNFGDQGYI 319
Query: 326 RILR-DEGLCGIATEASYP 343
R+ R ++ CGIA+ SYP
Sbjct: 320 RMARNNKNHCGIASYCSYP 338
>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 370
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 119/227 (52%), Positives = 162/227 (71%), Gaps = 8/227 (3%)
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQ 182
++Y+ +P S+DWREKGAV IK+QG CGSCWAFS +A+VEGI +I G LI LSEQ+
Sbjct: 33 YRYRAGDALPDSVDWREKGAVVPIKDQGGCGSCWAFSTIASVEGINKIVTGDLISLSEQE 92
Query: 183 LVDCS-TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
LVDC T N+GC+GGLMD AF++II+N G+ TE DYPY ++ G CD ++ A +I Y
Sbjct: 93 LVDCDKTYNDGCNGGLMDYAFQFIIDNGGIDTEKDYPYTEQDGRCDSYRKNAKVVSINSY 152
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAE 301
ED+P DE AL +A QP++V ++ G++F+ Y G+ +CG + DHGV VVG+G+
Sbjct: 153 EDVPVNDEQALKKAAASQPIAVAIDGGGRSFQLYNSGIFTGKCGTSLDHGVTVVGYGS-- 210
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
E G YW+++NSWGE+WGE GYIR+ R+ G+CGIA EASYP+
Sbjct: 211 -ESGKDYWIVRNSWGESWGEKGYIRMARNIDSPSGICGIAMEASYPI 256
>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 148/342 (43%), Positives = 204/342 (59%), Gaps = 17/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M + +L + C S +S S+ +P + E + W + H + Y E E+ R ++++NL+
Sbjct: 1 MLPVAVLAV-CLSAALSAPSL-DPQLDEHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKK 57
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G TY+LG N F D+T+EEFR GY R S + + S F N
Sbjct: 58 IELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMYGYKRK----SERKFKGSLFMEPNFL 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+ P S+DWR+ G VT +K+QG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 114 EAPRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRP 173
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD+AF+YI +N+GL +E YPY K +A + D+P G
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSG 233
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
E AL++AV PVSV ++A ++F+FY+ G+ EC + DHGV VVG+G E+
Sbjct: 234 KERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDV 293
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E WG+ GYI + +D + CGIAT ASYP+
Sbjct: 294 DGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 148/335 (44%), Positives = 201/335 (60%), Gaps = 24/335 (7%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
+++L +T A ++ P E QW H + Y + E+ +R TI+K N I +
Sbjct: 7 LLLLGVTLA------YTIERPVKDESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIRE 60
Query: 76 ANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
N +G + L N+F D+TN EF+A + GY +S + STF N P ++
Sbjct: 61 HNLKGGD-FILKMNQFGDMTNSEFKA-FNGY------LSHKHVNGSTFLTPNNFVAPDTV 112
Query: 136 DWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGC 193
DWR +G VT +K+QG CGSCWAFS ++EG GKL+ LSEQ LVDCST NNGC
Sbjct: 113 DWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGC 172
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALL 253
GGLMD AF YI ENKG+ +EA YPY E G C +K AA G + D+P+G+E+ L
Sbjct: 173 DGGLMDNAFTYIKENKGIDSEASYPYTAEDGKCVFKKSSVAATDTG-FVDIPEGNENKLK 231
Query: 254 QAVTK-QPVSVCVEASGQAFRFYKRGVLN-AECGDN-CDHGVAVVGFGTAEEEDGAKYWL 310
+AV P+SV ++AS ++F+FY GV N C DHGV VVG+GT E G YWL
Sbjct: 232 EAVASVGPISVAIDASHESFQFYSSGVYNEPSCSSTELDHGVLVVGYGT---ESGKDYWL 288
Query: 311 IKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
+KNSW +WG+ GYI++ R+ + CGIAT+ASYP+
Sbjct: 289 VKNSWNTSWGDKGYIKMRRNAKNQCGIATKASYPL 323
>gi|311265493|ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]
Length = 332
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 203/337 (60%), Gaps = 19/337 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
++L C + S H+ S+ +W A H + Y E+ R I+++N++ IE+
Sbjct: 5 LLLAAFCLG-IASAAPRHDHSLDADWYKWKATHRKLYGLN-EEGRRRAIWEKNMKMIERH 62
Query: 77 N---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N ++G ++ + N F D+TNEEFR + G+ +++ + F P
Sbjct: 63 NWEHRQGKHSFTMAMNAFGDMTNEEFRKTMNGFQ------NQKHKKGKVFLDAGSALTPH 116
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNN 191
S+DWREKG VT +KNQGHCGSCWAFSA A+EG KLI LSEQ LVDCS N
Sbjct: 117 SVDWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNE 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC+GGLMD AF+YI +N GL +E YPY + G+C K K +++AA Y D+PK E A
Sbjct: 177 GCNGGLMDNAFQYIKDNGGLDSEESYPYFGKDGSC-KYKPQSSAANDTGYVDIPK-QEKA 234
Query: 252 LLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEEEDGAKY 308
L++AV T P+SV ++AS ++F+FY G+ +C ++ DHGV VVG+G KY
Sbjct: 235 LMKAVATVGPISVGIDASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKY 294
Query: 309 WLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
WL+KNSWG TWG GYI++ +D+ CGIAT ASYPV
Sbjct: 295 WLVKNSWGNTWGMDGYIKMTKDQNNHCGIATMASYPV 331
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 203/340 (59%), Gaps = 16/340 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+++++ I A Q VS + + E+ + QH + Y+ E E+ R+ IF N +
Sbjct: 4 LVLLVTIAVACQAVSFSEL----VQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVA 59
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS-TFKYQNVTD 130
K NK +G YKL N++ DL + EF G+NR + R + S TF D
Sbjct: 60 KHNKLFEQGLYPYKLAMNKYGDLLHHEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHVD 119
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
+P ++DWR++GAVT +K+QGHCGSCW+FSA A+EG KL+ LSEQ LVDCS+
Sbjct: 120 IPDTVDWRQEGAVTPVKDQGHCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRF 179
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC+GGLMD AF YI N G+ TEA YPY E + K AT + D+P GD
Sbjct: 180 GNNGCNGGLMDNAFRYIKNNGGIDTEAAYPYMGEDEKF-RYSAKNRGATDKGFVDIPSGD 238
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFGTAEEEDG 305
E L AV T P+S+ ++AS ++F+ Y GV + C DHGV VVG+GT +E+ G
Sbjct: 239 EDKLKAAVATVGPISIAIDASHESFQLYSNGVYSDPTCSSTELDHGVLVVGYGT-DEKTG 297
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
YWL+KNSWG+TWG GYI++ R+ + CG+AT+ASYP+
Sbjct: 298 MDYWLVKNSWGDTWGLDGYIKMARNQDNQCGVATQASYPL 337
>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
Length = 338
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 148/342 (43%), Positives = 207/342 (60%), Gaps = 20/342 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ + +++ C S V + S +E H W H + Y E E+ R ++++NL+ I
Sbjct: 4 LYLAVLVLCVSAVCAAPRF--DSQLEDHWHLWKNWHSKHYH-ESEEGWRRMVWEKNLKKI 60
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E N E G +Y+LG N F D+TNEEFR + GY + + + + S F N
Sbjct: 61 EIHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQ----TTERKFKGSLFMEPNYLQ 116
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P ++DWREKG VT +K+QG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 117 APKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPE 176
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPY-QQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD+AF+YI +N GL TE YPY ++ C + E +AA G + D+P G
Sbjct: 177 GNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSAANETG-FVDIPSG 235
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
EHA+++AV PVSV ++A ++F+FY+ G+ EC + DHGV VVG+G E+
Sbjct: 236 KEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDV 295
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E WG+ GYI + +D + CGIAT +SYP+
Sbjct: 296 DGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
Length = 258
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 143/266 (53%), Positives = 178/266 (66%), Gaps = 20/266 (7%)
Query: 89 NEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT-----DVPTSIDWREKGAV 143
NEF+D+TN+EF A YTG RPVP+ ++ + + FKY NVT D ++DWR+KGAV
Sbjct: 4 NEFADMTNDEFMAMYTGL-RPVPAGAK---KMAGFKYGNVTLSDADDDQQTVDWRQKGAV 59
Query: 144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKAF 202
T IK+Q CG CWAF+AVAAVEGI QIT G L+ LSEQQ++DC TD NNGC+GG +D AF
Sbjct: 60 TGIKDQRQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAF 119
Query: 203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVS 262
+YI+ N GLATE YPY Q C + AA I Y+D+P GDE AL AV QPVS
Sbjct: 120 QYIVGNGGLATEDAYPYTAAQAMCQSVQPVAA---ISGYQDVPSGDEAALAAAVANQPVS 176
Query: 263 VCVEASGQAFRFYKRGVLN-AECG--DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
V ++A F+ Y GV+ A C N +H V VG+GTA EDG YWL+KN WG+ W
Sbjct: 177 VAIDA--HNFQLYGGGVMTAASCSTPPNLNHAVTAVGYGTA--EDGTPYWLLKNQWGQNW 232
Query: 320 GESGYIRILRDEGLCGIATEASYPVA 345
GE GY+R+ R CG+A +ASYPVA
Sbjct: 233 GEGGYLRLERGANACGVAQQASYPVA 258
>gi|156739275|ref|NP_001096585.1| cathepsin L1-like precursor [Danio rerio]
gi|156230123|gb|AAI52285.1| MGC174857 protein [Danio rerio]
Length = 335
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 204/340 (60%), Gaps = 15/340 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ +LV C S V + S+ + + + W +QHG++Y ++LE R+ I+++NL I
Sbjct: 1 MMFALLVTLCISAVFTAPSI-DIQLDDHWNSWKSQHGKSYHEDLEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR + GY +R S P F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKH---DPNRTSQGP-LFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +K+Q CGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC+GG+MD+AF+Y+ ENKGL +E YPY + + A I + D+P+G+
Sbjct: 175 GNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFG-TAEEEDG 305
E AL+ AV PVSV ++AS Q+ +FY+ G+ C DH V VVG+G + G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAG 294
Query: 306 AKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
+YW++KNSW + WG+ GYI + +D+ CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|74178074|dbj|BAE29827.1| unnamed protein product [Mus musculus]
gi|74178231|dbj|BAE29900.1| unnamed protein product [Mus musculus]
gi|74220784|dbj|BAE31361.1| unnamed protein product [Mus musculus]
Length = 326
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 195/319 (61%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 15 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 74
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
D+TNEE P RQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 75 GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 129
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD----NNGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 130 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 189
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVE 266
N G+ +A YPY+ C K AAT +Y LP GDE AL +AV TK PVSV ++
Sbjct: 190 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 248
Query: 267 ASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
AS +F FYK GV + C N +HGV VVG+GT DG YWL+KNSWG +G+ GYI
Sbjct: 249 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL---DGKDYWLVKNSWGLNFGDQGYI 305
Query: 326 RILR-DEGLCGIATEASYP 343
R+ R ++ CGIA+ SYP
Sbjct: 306 RMARNNKNHCGIASYCSYP 324
>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
Length = 533
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 139/307 (45%), Positives = 191/307 (62%), Gaps = 13/307 (4%)
Query: 45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT-YKLGTNEFSDLTNEEFRASY 103
WM HG T+ D LE A RL + N YI + N E T LG N FS ++ +EF+
Sbjct: 31 WMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAENAWTGVTLGHNAFSHMSFDEFKFKM 90
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAA 163
TG P + ++ + + +V +VP+++DW +KG VT +KNQG CGSCWAFS A
Sbjct: 91 TGLVLPEGYLEQRLASRVDGLWSDV-EVPSAVDWVDKGGVTPVKNQGMCGSCWAFSTTGA 149
Query: 164 VEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
VEG T ++ GKL LSEQ+LVDC + + GC+GGLMD AF++I ++ G+ +E DY Y+ +
Sbjct: 150 VEGATFVSSGKLPSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYEYKAK 209
Query: 223 QGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNA 282
C +E + + ++D+ DEHAL AV +QPVSV +EA +AF+FYK GV N
Sbjct: 210 AQVC---RECDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 266
Query: 283 ECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE----GLCGIAT 338
CG DHGV VG+G ++G K+W +KNSWG +WGE GYIR+ R+E G CGIA+
Sbjct: 267 TCGTRLDHGVLAVGYGN---DNGHKFWKVKNSWGASWGEQGYIRLAREENGPAGQCGIAS 323
Query: 339 EASYPVA 345
SYP A
Sbjct: 324 VPSYPFA 330
>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
Length = 336
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 204/341 (59%), Gaps = 16/341 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ +LV C S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLCISAVFAASSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR + GY Q+S+ F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHD----PNQTSQGPLFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +K+Q CGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPH 174
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC+GGLMD+AF+Y+ ENKGL +E YPY + + A I + D+PKG+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGN 234
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGVL--NAECGDNCDHGVAVVGFG-TAEEED 304
E AL+ AV PVSV ++AS Q+ +FY+ G+ A DH V VVG+G +
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
G +YW++KNSW + WG+ GYI + +D+ CGIAT ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 335
>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 147/342 (42%), Positives = 207/342 (60%), Gaps = 20/342 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ + +++ C S V + S +E H W H ++Y E E+ R ++++NL+ I
Sbjct: 4 LYLAVLVLCVSAVCAAPRF--DSQLEDHWHLWKNWHSKSYH-ESEEGWRRMVWEKNLKKI 60
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E N E G +Y+LG N F D+TNEEFR + GY + + + + S F N
Sbjct: 61 EMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQ----TTERKFKGSLFMEPNYLQ 116
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P ++DWREKG VT +K+QG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 117 APKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPE 176
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPY-QQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD+AF+YI +N GL TE YPY ++ C + E + A G + D+P G
Sbjct: 177 GNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANETG-FVDIPSG 235
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
EHA+++AV PVSV ++A ++F+FY+ G+ EC + DHGV VVG+G E+
Sbjct: 236 KEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDV 295
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E WG+ GYI + +D + CGIAT +SYP+
Sbjct: 296 DGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 134/309 (43%), Positives = 191/309 (61%), Gaps = 12/309 (3%)
Query: 44 QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASY 103
Q+ H + Y E E+ R IFK NL YI N +G +Y L N+F DLT EEFR Y
Sbjct: 91 QFQRDHNKFYATEEERLKRYAIFKNNLTYIHNHNMQG-YSYVLKMNKFGDLTLEEFRQRY 149
Query: 104 TGYNRP-VPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVA 162
GY +P + + R+ +T + D+PT +DWR++G VT +K+QG CGSCWAFSA
Sbjct: 150 LGYKKPDLRTPPREVD--TTLESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFSATG 207
Query: 163 AVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQ 220
A+EG+ GKL+ LS+QQLVDCS N GC GG M++AFEY++EN G+ + +YPY
Sbjct: 208 AMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENYPYM 267
Query: 221 QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVT-KQPVSVCVEASGQAFRFYKRGV 279
++ G C K + + ATI Y +P+ E ++ A+ + PVSV ++A+ AF+FY G+
Sbjct: 268 RKDGVC-KSSQCTSVATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFYYDGI 326
Query: 280 LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE---GLCGI 336
+A CG N DHGV +VG+ +AE YW++KNSWG WG+ GY+ + + G CG+
Sbjct: 327 FDAPCGTNLDHGVLLVGY-SAETAGQGDYWIMKNSWGAAWGKGGYMLMAMHKGPAGQCGV 385
Query: 337 ATEASYPVA 345
+ S+PVA
Sbjct: 386 LLDGSFPVA 394
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 135/321 (42%), Positives = 193/321 (60%), Gaps = 15/321 (4%)
Query: 33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI--EKANKEGNR-TYKLGTN 89
+ E ++E +QW +H + Y+ E R FK NL+YI A ++ N+ + +G N
Sbjct: 40 LSEERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLN 99
Query: 90 EFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQ 149
+F+D++NEEFR +Y + + SR K Q+ D P+S+DWR G VT +K+Q
Sbjct: 100 KFADMSNEEFRKAYLSKVKKPINKGITLSRNMRRKVQSC-DAPSSLDWRNYGVVTAVKDQ 158
Query: 150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENK 209
G CGSCWAFS+ A+EGI + G LI LSEQ+LV+C T N GC GG MD AFE++I N
Sbjct: 159 GSCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTSNYGCEGGYMDYAFEWVINNG 218
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASG 269
G+ +E+DYPY GTC+ KE+ +I Y+D+ + D ALL AV +QPVSV ++ S
Sbjct: 219 GIDSESDYPYTGVDGTCNTTKEETKVVSIDGYQDVEQSDS-ALLCAVAQQPVSVGIDGSA 277
Query: 270 QAFRFYKRGVLNAECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
F+ Y G+ + C D+ DH V +VG+G+ ED +YW++KNSWG +WG GY
Sbjct: 278 IDFQLYTGGIYDGSCSDDPDDIDHAVLIVGYGS---EDSEEYWIVKNSWGTSWGIDGYFY 334
Query: 327 ILRDE----GLCGIATEASYP 343
+ RD G+C + ASYP
Sbjct: 335 LKRDTDLPYGVCAVNAMASYP 355
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 142/319 (44%), Positives = 202/319 (63%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ E + +H + Y E+E++ R+ IF +N I NK +G+ TYKL N++ D+
Sbjct: 25 VLEEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDM 84
Query: 95 TNEEFRASYTGY--NRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKNQGH 151
+ EF ++ G+ N + ++ +TF + + +P ++DWR KGAVT IK+QG
Sbjct: 85 LHHEFVSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQLPKNVDWRTKGAVTPIKDQGQ 144
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
CGSCWAFSA A+EG T G+L+ LSEQ LVDCS NNGC+GGLMD AFEY+ EN
Sbjct: 145 CGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKENG 204
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEAS 268
G+ TE YPY E C A A G + D+ +G EHAL +AV T PVSV ++AS
Sbjct: 205 GIDTEESYPYDAEDEKCHYNPRAAGAEDKG-FVDVREGSEHALKKAVATVGPVSVAIDAS 263
Query: 269 GQAFRFYKRGV-LNAECG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
++F+FY GV + EC + DHGV VVG+G ++DG YWL+KNSWG TWG+ GY++
Sbjct: 264 HESFQFYSHGVYIEPECSPEMLDHGVLVVGYGI--DDDGTDYWLVKNSWGTTWGDQGYVK 321
Query: 327 ILRD-EGLCGIATEASYPV 344
+ R+ + CGIA+ AS+P+
Sbjct: 322 MARNRDNQCGIASSASFPL 340
>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 342
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 143/319 (44%), Positives = 195/319 (61%), Gaps = 20/319 (6%)
Query: 43 EQWMA---QHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLTN 96
E+W+A QH + Y E+E R+ I+ +N I K N+ +G +YKLG N+++D+ +
Sbjct: 26 EEWVAFKMQHDKKYDSEVEDRFRMKIYAENKHKIAKHNQLYEQGLVSYKLGPNKYTDMLH 85
Query: 97 EEFRASYTGYNRPVPSVS-----RQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
EF + GYNR + R +TF P +DW +KGAVT +K+QG
Sbjct: 86 HEFIQAMNGYNRTAKHNKGLYGKKHDVRGATFIPPAHVKYPDHVDWTKKGAVTEVKDQGK 145
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
CGSCWAFS A+EG G L+ LSEQ L+DCS+ NNGC+GGLMD AF+YI +N
Sbjct: 146 CGSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMDNAFKYIKDNG 205
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEAS 268
G+ TE YPY+ C + + A +G + D+P GDE L+QAV T PVSV ++AS
Sbjct: 206 GIDTEKTYPYEGVDDKCRYNPKNSGAEDVG-FVDIPSGDEEKLMQAVATVGPVSVAIDAS 264
Query: 269 GQAFRFYKRGVL-NAECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
+F+FY GV + EC + DHGV VVG+GT +E G YWL+KNSW TWGE GYI+
Sbjct: 265 QNSFQFYSGGVYYDTECSSTDLDHGVLVVGYGT--DEAGGDYWLVKNSWSRTWGELGYIK 322
Query: 327 ILRD-EGLCGIATEASYPV 344
+ R+ + CGIAT+ASYP+
Sbjct: 323 MARNRDNHCGIATDASYPL 341
>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
Length = 337
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 143/342 (41%), Positives = 205/342 (59%), Gaps = 19/342 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+++LV+ + + + R + E + W + H + Y+ E E+ R ++++NL+ I
Sbjct: 3 LYLVVLVLCTGAALAAPR--FDAQFDEHWDLWKSWHSKNYQHEKEEGWRRMVWEKNLKKI 60
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E N E G +Y LG N F D+TNEEFR GY + ++ + S F N +
Sbjct: 61 EMHNLEHSLGKHSYSLGMNHFGDMTNEEFRQVMNGY-----KLQQRKFKGSLFLEPNNME 115
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P +DWRE+G VT +K+QG CGSCWAFS A+EG KL+ LSEQ LVDCS
Sbjct: 116 APKQVDWREEGYVTPVKDQGQCGSCWAFSTTGAMEGQMFRKTQKLVSLSEQNLVDCSRPE 175
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD+AF+YI +N GL +E YPY + C+ + E +AA G + D+P G
Sbjct: 176 GNEGCNGGLMDQAFQYIQDNSGLDSEEAYPYLGTDDQPCNYKAEFSAANDTG-FMDIPSG 234
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
EHAL++A+ PVSV ++A ++F+FY+ G+ EC + DHGV VG+G E+
Sbjct: 235 KEHALMKAIASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDV 294
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E WG+ GYI + +D + CGIAT ASYP+
Sbjct: 295 DGKKYWIVKNSWSEKWGDKGYILMAKDRKNHCGIATAASYPL 336
>gi|392306967|ref|NP_067256.3| cathepsin S isoform 2 preproprotein [Mus musculus]
gi|26390492|dbj|BAC25906.1| unnamed protein product [Mus musculus]
gi|148706872|gb|EDL38819.1| cathepsin S [Mus musculus]
Length = 342
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 195/319 (61%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 31 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 90
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
D+TNEE P RQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 91 GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 145
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD----NNGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 146 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 205
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVE 266
N G+ +A YPY+ C K AAT +Y LP GDE AL +AV TK PVSV ++
Sbjct: 206 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 264
Query: 267 ASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
AS +F FYK GV + C N +HGV VVG+GT DG YWL+KNSWG +G+ GYI
Sbjct: 265 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL---DGKDYWLVKNSWGLNFGDQGYI 321
Query: 326 RILR-DEGLCGIATEASYP 343
R+ R ++ CGIA+ SYP
Sbjct: 322 RMARNNKNHCGIASYCSYP 340
>gi|390608645|ref|NP_001254624.1| cathepsin S isoform 1 preproprotein [Mus musculus]
gi|74214026|dbj|BAE29430.1| unnamed protein product [Mus musculus]
Length = 343
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 195/319 (61%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 32 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 91
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
D+TNEE P RQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 92 GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 146
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD----NNGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 147 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 206
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVE 266
N G+ +A YPY+ C K AAT +Y LP GDE AL +AV TK PVSV ++
Sbjct: 207 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 265
Query: 267 ASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
AS +F FYK GV + C N +HGV VVG+GT DG YWL+KNSWG +G+ GYI
Sbjct: 266 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL---DGKDYWLVKNSWGLNFGDQGYI 322
Query: 326 RILR-DEGLCGIATEASYP 343
R+ R ++ CGIA+ SYP
Sbjct: 323 RMARNNKNHCGIASYCSYP 341
>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 360
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 141/350 (40%), Positives = 199/350 (56%), Gaps = 20/350 (5%)
Query: 12 PMFVIIILVITCASQVVSGRSMH--EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
P+ +++ A+ SGR + + ++++ W A H ++Y+ E+ R +++ N
Sbjct: 10 PVITASTILLAWAAAAASGRGVDVGDMLMMDRFLMWQATHNQSYRSAEERLRRFQVYRDN 69
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKY---- 125
+EYIE N+ G+ TY+LG N+F+DLT EEF A +T YN S +T
Sbjct: 70 VEYIETTNRRGDLTYQLGENQFADLTREEFIARFTSYNGDDDRTGDDDSVITTAAVGGGD 129
Query: 126 --------QNVTDVPTSIDWREKGAVTHIKNQGHCGSC-WAFSAVAAVEGITQITGGKLI 176
+V+ P S+DWR KGAV K+Q S WAF AVA +E + I GKL+
Sbjct: 130 PDLWSSGGDDVSLDPPSVDWRAKGAVVPPKSQSSSCSSSWAFVAVATIESLHAIKTGKLV 189
Query: 177 ELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAA 236
LSEQQLVDC + GC+ G +AF ++I+N GL TEA+YPY QGTC+ K A
Sbjct: 190 ALSEQQLVDCDQYDGGCNRGTFRRAFHWVIQNGGLTTEAEYPYTAAQGTCNSAKSDHHVA 249
Query: 237 TIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVG 296
I + +P +E A+ AV QPV+ +E G +FYK GV + CG +H V VVG
Sbjct: 250 AISGHASVPGSNELAMKHAVATQPVAAAIEL-GSDMQFYKSGVYSGPCGARLEHAVTVVG 308
Query: 297 FGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYP 343
+G A+E G KYW++KNSWG+TWGE GYIR+ R GLCGI + +YP
Sbjct: 309 YG-ADESTGDKYWIVKNSWGQTWGERGYIRMQRKILGPGLCGIMLDVAYP 357
>gi|341940310|sp|O70370.2|CATS_MOUSE RecName: Full=Cathepsin S; Flags: Precursor
Length = 340
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 195/319 (61%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 29 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
D+TNEE P RQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 89 GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 143
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD----NNGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 144 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 203
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVE 266
N G+ +A YPY+ C K AAT +Y LP GDE AL +AV TK PVSV ++
Sbjct: 204 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 262
Query: 267 ASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
AS +F FYK GV + C N +HGV VVG+GT DG YWL+KNSWG +G+ GYI
Sbjct: 263 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTL---DGKDYWLVKNSWGLNFGDQGYI 319
Query: 326 RILR-DEGLCGIATEASYP 343
R+ R ++ CGIA+ SYP
Sbjct: 320 RMARNNKNHCGIASYCSYP 338
>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
Length = 335
Score = 261 bits (666), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 137/340 (40%), Positives = 206/340 (60%), Gaps = 15/340 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ +L+ C S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLITLCISAVFTAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR + GY + ++S+ + F +
Sbjct: 59 EQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKQD----PNRTSKGALFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +K+Q CGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC+GG+MD+AF+Y+ ENKGL +E YPY + + A I + D+P+G+
Sbjct: 175 GNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFG-TAEEEDG 305
E AL+ AV PVSV ++AS Q+ +FY+ G+ C DH V VVG+G + G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAG 294
Query: 306 AKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
+YW++KNSW + WG+ GYI + +D+ CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 147/342 (42%), Positives = 207/342 (60%), Gaps = 20/342 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ + +++ C S V + S +E H W H ++Y E E+ R ++++NL+ I
Sbjct: 4 LYLAVLVLCVSAVCAAPRF--DSQLEDHWHLWKNWHSKSYH-ESEEGWRRMVWEKNLKKI 60
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E N E G +Y+LG N F D+TNEEFR + GY + + + + S F N
Sbjct: 61 EMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQ----TTERKFKGSLFMEPNYLQ 116
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P ++DWREKG VT +K+QG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 117 APKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPE 176
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPY-QQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD+AF+YI +N GL TE YPY ++ C + E + A G + D+P G
Sbjct: 177 GNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANETG-FVDIPSG 235
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
EHA+++AV PVSV ++A ++F+FY+ G+ EC + DHGV VVG+G E+
Sbjct: 236 KEHAMMKAVAAVGPVSVAIDAGHESFQFYEFGIYYEKECSSEELDHGVLVVGYGFEGEDV 295
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E WG+ GYI + +D + CGIAT +SYP+
Sbjct: 296 DGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
Length = 367
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 139/352 (39%), Positives = 201/352 (57%), Gaps = 16/352 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ----WMAQHGRTYKDE 56
M+ K + + + + + ++ + G S + + E+ Q WM H + Y++
Sbjct: 3 MIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENV 62
Query: 57 LEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQ 116
EK R IFK NL YI++ NK+ N +Y+LG NEF+DL+N+EF Y G + + +
Sbjct: 63 DEKLYRFEIFKDNLNYIDETNKK-NNSYRLGLNEFADLSNDEFNEKYVG---SLIDATIE 118
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLI 176
S F +++ ++P ++DWR+KGAVT +++QG CGSCWAFSAVA VEGI +I GKL+
Sbjct: 119 QSYDEEFINEDIVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLV 178
Query: 177 ELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAA 236
ELSEQ+LVDC ++GC GG A EY+ +N G+ + YPY+ +QGTC ++
Sbjct: 179 ELSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGTCRAKQVGGPIV 237
Query: 237 TIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVG 296
+ +E LL A+ KQPVSV VE+ G+ F+ YK G+ CG DH V V
Sbjct: 238 KTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAV- 296
Query: 297 FGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
+ G Y LIKNSWG WGE GYIRI R G+CG+ + YP+
Sbjct: 297 --GYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPI 346
>gi|2804262|dbj|BAA24442.1| cysteine proteinase [Sitophilus zeamais]
Length = 338
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 140/340 (41%), Positives = 212/340 (62%), Gaps = 19/340 (5%)
Query: 19 LVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLTIFKQNLEYIEK 75
L + A+ V+S +++ +V+ EQW + QH + Y E E+ R+ IF +N + K
Sbjct: 3 LFLILAAVVISCQAVSFYDLVQ--EQWSSFKMQHSKNYDSETEERFRMKIFMENAHKVAK 60
Query: 76 ANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDV- 131
NK +G +KLG N+++D+ + EF ++ G+N+ ++ + S ++ + +V
Sbjct: 61 HNKLFSQGFVKFKLGLNKYADMLHHEFVSTLNGFNKTKNNILKGSDLNDAVRFISPANVK 120
Query: 132 -PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--T 188
P ++DWR+KGAVT +K+QGHCGSCW+FSA ++EG GKL+ LSEQ LVDCS
Sbjct: 121 LPDTVDWRDKGAVTEVKDQGHCGSCWSFSATGSLEGQHFRKTGKLVSLSEQNLVDCSGRY 180
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC+GGLMD AF YI +N G+ TE YPY E C + + + A G + D+ + +
Sbjct: 181 GNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYLAEDEKCHYKAQNSGATDKG-FVDIEEAN 239
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFGTAEEEDG 305
E L AV T PVS+ ++AS + F+ Y GV + EC DHGV VVG+GT+ +DG
Sbjct: 240 EDDLKAAVATVGPVSIAIDASHETFQLYSDGVYSDPECSSQELDHGVLVVGYGTS--DDG 297
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
YWL+KNSWG +WG +GYI++ R+ + +CG+A++ASYP+
Sbjct: 298 QDYWLVKNSWGPSWGLNGYIKMARNQDNMCGVASQASYPL 337
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 147/338 (43%), Positives = 195/338 (57%), Gaps = 14/338 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M +I V+ C S ++ M EP + W + HG+ Y ++ E+ MR I++ NL+
Sbjct: 1 MEAVIFAVLLCISSALAMPPM-EPLQDPNWKAWKSFHGKEYPNKNEETMRNFIWQNNLKK 59
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
I N EG ++KL N D+T+ E + G + + + +TF V
Sbjct: 60 IVTHN-EGKHSFKLAMNHLGDMTSLEISQTLLGLK--LKKHAESQPKGATFLPPANVKVV 116
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--N 190
SIDWR KG VT +KNQG CGSCWAFS A+EG GKL+ LSEQ LVDCS N
Sbjct: 117 DSIDWRSKGYVTPVKNQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSGKYGN 176
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
NGC GGLMD AF+YI EN G+ TE YPY + G C K A G + D+P GDE+
Sbjct: 177 NGCEGGLMDNAFQYIKENGGIDTEKSYPYLAKDGVCHYNKSAIGAKDTG-FVDIPTGDEN 235
Query: 251 ALLQAVTK-QPVSVCVEASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFGTAEEEDGAK 307
AL QA+ P+S+ ++AS F FY +GV + +C DHGV VG+GT +DG
Sbjct: 236 ALQQALASVGPISIAIDASQSTFHFYHQGVYDDPDCSSTRLDHGVLAVGYGT---DDGKD 292
Query: 308 YWLIKNSWGETWGESGYIRILR-DEGLCGIATEASYPV 344
YWL+KNSWG +WGE GYI+I R D CG+A++ASYP+
Sbjct: 293 YWLVKNSWGPSWGEEGYIKIARNDHDKCGVASKASYPL 330
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 137/324 (42%), Positives = 188/324 (58%), Gaps = 21/324 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+V ++W+ +HG+ Y EKA RL IF+ NL+YI NK N +++LG N+F+DLTNE
Sbjct: 39 LVRLFDRWLGRHGKLYGSHEEKARRLQIFRTNLQYIHAHNKNSNSSFRLGLNKFADLTNE 98
Query: 98 EFRASYTGYNRPVPSVSRQSS------RP----STFKYQNVTDVPTSIDWREKGAVTHIK 147
EF+ Y G N R++ RP + + + +S+DWR+KGAVT +K
Sbjct: 99 EFKTRYFGKNSKQWRDRRRTELEGAELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVK 158
Query: 148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIE 207
+Q CGSCWAFS A+EG+ I+ GKL+ LSEQ+LV C N GC GG MD AF ++I+
Sbjct: 159 DQAQCGSCWAFSTTGAIEGVNFISTGKLVSLSEQELVACDATNYGCEGGDMDYAFTWVIQ 218
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEA 267
N G+ TE DY Y TC+ KE +I Y D+ D+ ALL A QPVSV ++
Sbjct: 219 NGGIDTEKDYSYTGVDSTCNTNKEAKKIVSIDGYTDVSP-DDSALLCAAGSQPVSVGIDG 277
Query: 268 SGQAFRFYKRGVLNAECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGY 324
S F+ Y G+ + +C D+ DH V VVG+ ++G YW++KNSWG WG GY
Sbjct: 278 SAIDFQLYTGGIYDGDCSGNPDDIDHAVLVVGY---SAKNGKDYWIVKNSWGTDWGLEGY 334
Query: 325 IRILRDE----GLCGIATEASYPV 344
ILR+ G+C I ASYP
Sbjct: 335 FYILRNTELPYGVCAINAMASYPT 358
>gi|380236892|emb|CBK52289.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 145/340 (42%), Positives = 198/340 (58%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M ++LV C V +M EP + + W HG+ Y+ E+E R ++++NL
Sbjct: 9 MLGSLMLVSLC----VGAAAMFEPKLDAHWKLWKMTHGKKYQTEVEDVSRRELWEKNLML 64
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I N E G TY+L N DLT EE S+ + P + R +S F
Sbjct: 65 ITMHNLEASMGLHTYELSMNHMGDLTQEEIMQSFATLSPPT-DIQRAAS---PFAGTTGA 120
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
DVP ++DWREKG VT +K QG CGSCWAFSA A+EG T GKL++LS Q LVDCST
Sbjct: 121 DVPDTMDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTK 180
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N+GC+GGLM AF+Y+I+N+G+ ++A YPY G C + K AA +Y LP+G
Sbjct: 181 YGNHGCNGGLMHHAFQYVIDNQGIDSDASYPYTGRNGEC-RYNSKFRAANCSQYSFLPEG 239
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGVLN-AECGDNCDHGVAVVGFGTAEEEDG 305
+E AL +A+ P+SV ++A+ F FY+ GV N C +HGV VG+GT DG
Sbjct: 240 NEGALKEALANIGPISVAIDATRPTFTFYRSGVYNDPNCSQKVNHGVLAVGYGTL---DG 296
Query: 306 AKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYPV 344
YWL+KNSWG+T+G+ GYIR+ R++ CGIA YP+
Sbjct: 297 QDYWLVKNSWGKTFGDQGYIRMSRNKNDQCGIALYGCYPI 336
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 147/355 (41%), Positives = 211/355 (59%), Gaps = 17/355 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ L S + M V + ++ C + + +P + + W + H + Y E E++
Sbjct: 4 LFLARRLSRFVNMNVCLTILSLCLGLAFAAPRV-DPDLDSHWQLWKSWHSKDYH-EREES 61
Query: 61 MRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQS 117
R ++++NL+ IE N + G +YKLG N+F D+T EEFR GY S +
Sbjct: 62 WRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMNGYKH---KKSERK 118
Query: 118 SRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIE 177
R S F + + P S+DWREKG VT +K+QG CGSCWAFS A+EG GKL+
Sbjct: 119 YRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVS 178
Query: 178 LSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAA 234
LSEQ LVDCS N GC+GGLMD+AF+Y+ +N G+ +E YPY ++ C + E A
Sbjct: 179 LSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNA 238
Query: 235 AATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHG 291
A G + D+P+G E AL++AV PVSV ++A +F+FY+ G+ +C ++ DHG
Sbjct: 239 ANDTG-FVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHG 297
Query: 292 VAVVGFG-TAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
V VVG+G E+ DG KYW++KNSWGE WG+ GYI + +D + CGIAT ASYP+
Sbjct: 298 VLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 352
>gi|157787177|ref|NP_001099150.1| cathepsin L1-like precursor [Danio rerio]
gi|157422879|gb|AAI53505.1| MGC174152 protein [Danio rerio]
Length = 336
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 140/341 (41%), Positives = 204/341 (59%), Gaps = 16/341 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ +LV C S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLCISAVFAASSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR + GY Q+S+ F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHD----PNQTSQGPLFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +K+Q CGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC+GGLMD+AF+Y+ ENKGL +E YPY + + A I + D+P+G+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGVL--NAECGDNCDHGVAVVGFG-TAEEED 304
E AL+ AV PVSV ++AS Q+ +FY+ G+ A DH V VVG+G +
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
G +YW++KNSW + WG+ GYI + +D+ CGIAT ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 335
>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
tropicalis]
gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 138/340 (40%), Positives = 204/340 (60%), Gaps = 15/340 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ +LV C S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLCISAVFAASSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR + GY ++S+ F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHD----PNRTSQGPLFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +K+Q CGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC+GG+MD+AF+Y+ ENKGL +E YPY + + A I + D+P+G+
Sbjct: 175 GNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFG-TAEEEDG 305
E AL+ AV PVSV ++AS Q+ +FY+ G+ C DH V VVG+G + G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAG 294
Query: 306 AKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
+YW++KNSW + WG+ GYI + +D+ CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 260 bits (665), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 144/339 (42%), Positives = 205/339 (60%), Gaps = 15/339 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
+I L+ Q+ + S+ E H + A H + Y +LE+ R+ I+ +N +
Sbjct: 1 TLIFLLGAVFVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVA 59
Query: 75 KAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
K N ++G ++Y++ N+F DL + EFR+ GY + SR S + + NV +V
Sbjct: 60 KHNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANV-EV 118
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-- 189
P S+DWREKGA+T +K+QG CG CWAFS+ A+EG T GKL+ L EQ L+DCS
Sbjct: 119 PESVDWREKGAITPVKDQGQCGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYG 178
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
N GC+GGLMD+AF+YI +NKG+ TE YPY+ E C A G + D+P G+E
Sbjct: 179 NEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVCRYNPRNRGAVDRG-FVDIPSGEE 237
Query: 250 HALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEEEDGA 306
L AV T PVSV ++AS ++F+FY +GV C D+ DHGV VVG+G+ ++G
Sbjct: 238 DKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS---DNGK 294
Query: 307 KYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
YWL+KNSW E WG+ GYI+I R+ + CG+AT ASYP+
Sbjct: 295 DYWLVKNSWSEHWGDQGYIKIARNRKNHCGVATAASYPL 333
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 260 bits (664), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 206/343 (60%), Gaps = 15/343 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+ +I L+ Q+ + S+ E H + A H + Y +LE+ R+ I+ +N
Sbjct: 1 MKQITLIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENK 59
Query: 71 EYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
+ K N ++G ++Y++ N+F DL + EFR+ GY + SR S + + N
Sbjct: 60 HKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPAN 119
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
V +VP S+DWR KGA+T +K+QG CGSCWAFS+ A+EG T GKLI LSEQ L+DCS
Sbjct: 120 V-EVPESVDWRVKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCS 178
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
N GC+GGLMD+AF+YI +NKG+ TE YPY+ E C A G + +P
Sbjct: 179 GKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDNVCRYNPRNRGAIDRG-FVHIP 237
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEE 302
G+E L AV T PVSV ++AS ++F+FY +GV C D+ DHGV VVG+G+
Sbjct: 238 SGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS--- 294
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
++G YWL+KNSW E WG+ GYI+I R+ + CGIAT ASYP+
Sbjct: 295 DNGKDYWLVKNSWSEHWGDEGYIKIARNRKNHCGIATAASYPL 337
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 147/328 (44%), Positives = 196/328 (59%), Gaps = 30/328 (9%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL---EKAMRLTIFKQNLEYIEKANKEGN 81
SQ + R++H +++ + HG Y +L E A R + NL IE A+ GN
Sbjct: 11 SQFLPRRNLH--LVLKGPTAFRRIHGVFYSSQLGLCEPAFRCHL--ANLRVIE-AHNAGN 65
Query: 82 RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS-IDWREK 140
++ +G +F+DLT EF A Y + P +RP + +T+ P +DWR+K
Sbjct: 66 SSFTMGITQFADLTAAEFSA----YVKRFP---MNVTRPRNEVW--ITEAPLQEVDWRQK 116
Query: 141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLM 198
AVT IKNQG CGSCW+FS +VEG I GKL+ LSEQQL+DCST N+GC+GGLM
Sbjct: 117 NAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDCSTRYGNHGCNGGLM 176
Query: 199 DKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK 258
D AFEY+I N GL TE DYPY E G C+ +KEK AA I + ++PK E L AV+
Sbjct: 177 DYAFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAEIHGFRNVPKEHEDQLAAAVSI 236
Query: 259 QPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
PVSV +EA F+ Y GV + +CG + DHGV VVG+ YW++KNSWG++
Sbjct: 237 GPVSVAIEADQAGFQHYTSGVFDGKCGTSLDHGVLVVGYSD-------DYWIVKNSWGKS 289
Query: 319 WGESGYIRILR---DEGLCGIATEASYP 343
WGE GYIR+ R +G+CGI +ASYP
Sbjct: 290 WGEEGYIRLKRGVDKKGMCGITMQASYP 317
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 146/346 (42%), Positives = 207/346 (59%), Gaps = 22/346 (6%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++P+ V+ + C S +S S+ +P + + E W + H + Y E E+ R ++++N
Sbjct: 1 MLPLAVVAL----CLSAALSAPSL-DPQLDDHWELWKSWHSKKYH-EKEEGWRRMVWEKN 54
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
L+ IE N E G +Y+LG N F D+T+EEFR GY R + +R S F
Sbjct: 55 LKKIELHNLEHSMGTHSYRLGMNHFGDMTHEEFRQLMNGYKRK----AETKARGSLFLEP 110
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
N + P S+DWR+ G VT +K+QG CGSCWAFS A+EG GKL+ LSEQ LVDC
Sbjct: 111 NFLEAPKSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDC 170
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYED 243
S N GC+GGLMD+AF+Y+ +N+GL +E YPY + C + G + D
Sbjct: 171 SRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPTYNSVNDTG-FVD 229
Query: 244 LPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-T 299
+P G E AL++AV PVSV ++A ++F+FY+ G+ EC + DHGV VVG+G
Sbjct: 230 IPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQ 289
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
E+ DG KYW++KNSW E WG+ GYI + +D + CGIAT ASYP+
Sbjct: 290 GEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 259 bits (663), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 146/343 (42%), Positives = 206/343 (60%), Gaps = 24/343 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
FV++ + A+ + H+ + + + A HG+ Y E E+ RL I+ +N I
Sbjct: 27 FVVLGCLFVTAAAIT-----HQELVGAEWSAFKALHGKEYHSETEEYYRLKIYMENRLKI 81
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSS---RPSTFKYQN 127
+ N++ +YKL NEF DL + EF ++ G+ R S R+ S P + ++
Sbjct: 82 ARHNEKYANNKASYKLAMNEFGDLLHHEFVSTRNGFKRNYRSTPREGSFYIEPEGIEDKH 141
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+ P ++DWR+KGAVT +KNQG CGSCWAFS ++EG G+++ LSEQ LVDCS
Sbjct: 142 L---PKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGRMVSLSEQNLVDCS 198
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
NNGC GGLMD AF+YI N G+ TE YPY G C +K A G + D+P
Sbjct: 199 GKFGNNGCEGGLMDNAFKYIKANGGIDTELSYPYNGTDGICHFEKSDVGATDTG-FVDIP 257
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEE 302
+G+E L +AV T PVSV ++AS ++F+FY +GV + EC ++ DHGV VVG+GT
Sbjct: 258 EGNEQLLKKAVATVGPVSVAIDASHESFQFYSQGVYDEPECSSESLDHGVLVVGYGT--- 314
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
+DG YWL+KNSWG TWG+ GYI + R+ E CGIA+ ASYP+
Sbjct: 315 KDGQDYWLVKNSWGTTWGDDGYIYMTRNKENQCGIASSASYPL 357
>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
Length = 366
Score = 259 bits (663), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 142/331 (42%), Positives = 186/331 (56%), Gaps = 28/331 (8%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ +E+W A H +D EK R +FK+N I + N +GN TY LG N FSD+
Sbjct: 41 EESLWALYERWCA-HYNMARDHGEKTRRFDLFKENARRIYEHNHQGNATYTLGLNRFSDM 99
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD----------------VPTSIDWR 138
T+EEF S G P +S + D P ++DWR
Sbjct: 100 TDEEFNRSPYGGCLTAPRMSDDEIEELHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDWR 159
Query: 139 EKGAVTHIKNQGH-CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGL 197
+ AVT +K+QG CGSCWAFSA+AAVEGI I L+ LSEQQLVDC N+GC+GGL
Sbjct: 160 GR-AVTRVKDQGPTCGSCWAFSAIAAVEGINAIRTRNLVPLSEQQLVDCDKLNHGCNGGL 218
Query: 198 MDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVT 257
M AF +++ N+G+ E YPY +G C + A TI Y+ +P+ D +AL+ AV
Sbjct: 219 MTTAFSFVVRNRGVVPEGAYPYMGREGRC--KHVMAPPVTIYGYQRVPRFDANALMNAVA 276
Query: 258 KQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGE 317
QPVSV +EAS FR Y+ GV N CG H VG+G + G +W++KNSWG
Sbjct: 277 AQPVSVAIEASSFEFRHYQGGVFNGNCGGRLGHAATAVGYGA---DAGGPFWIVKNSWGP 333
Query: 318 TWGESGYIRILRD----EGLCGIATEASYPV 344
WGE GY+RI R+ +G+CGI TE SYPV
Sbjct: 334 GWGEGGYVRISRNTPVRQGVCGILTENSYPV 364
>gi|375340657|emb|CBJ56264.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 198/340 (58%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M ++LV C V +M EP + + W HG+ Y+ E+E R ++++NL
Sbjct: 9 MLGSLMLVSLC----VGAAAMFEPKLDAHWKLWKMTHGKKYQTEVEDVSRRELWEKNLML 64
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I N E G TY+L N DLT EE S+ + P + R +S F
Sbjct: 65 ITMHNLEASMGLHTYELSMNHMGDLTQEEIMQSFATLSPPT-DIQRAAS---PFAGTTGA 120
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
DVP ++DWREKG VT +K QG CGSCWAFSA A+EG T GKL++LS Q LVDCST
Sbjct: 121 DVPDTMDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTK 180
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N+GC+GG M +AF+Y+I+N+G+ ++A YPY G C + K AA +Y LP+G
Sbjct: 181 YGNHGCNGGFMHQAFQYVIDNQGIDSDASYPYTGRNGEC-RYNSKFRAANCSQYSFLPEG 239
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGVLN-AECGDNCDHGVAVVGFGTAEEEDG 305
+E AL +A+ P+SV ++A+ F FY+ GV N C +HGV VG+GT DG
Sbjct: 240 NEGALKEALANIGPISVAIDATRPTFTFYRSGVYNDPNCSQKVNHGVLAVGYGTL---DG 296
Query: 306 AKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYPV 344
YWL+KNSWG+T+G+ GYIR+ R++ CGIA YP+
Sbjct: 297 QDYWLVKNSWGKTFGDQGYIRMSRNKNDQCGIALYGCYPI 336
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 145/344 (42%), Positives = 201/344 (58%), Gaps = 28/344 (8%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++ + L+I C S + R + + WM +H ++Y ++ E R ++F+ N
Sbjct: 3 LVLALIFCFLIINCCS---AARIFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYSVFQDN 58
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN-- 127
++ + K N++G+ T LG N +DLTNEEF+ Y G V T+K +
Sbjct: 59 MDIVAKWNQKGSNTI-LGLNVMADLTNEEFKKLYLGTKANV-----------TYKKKTLV 106
Query: 128 -VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
V+ +P S+DWR GAVT +KNQG CG C+AFS +VEGI +IT +L+ LSEQQ++DC
Sbjct: 107 GVSGLPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDC 166
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S NNGC GGLM +FEYII GL TEA YPY E G C K +K ATI Y+++
Sbjct: 167 SGSEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYTGEVGKC-KFNKKNIGATITGYKNV 225
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV-LNAECGDN-CDHGVAVVGFGTAEE 302
G E L AV QPVSV ++AS +F+ Y GV EC DHGV VG+G+
Sbjct: 226 ESGSESDLQTAVAAQPVSVAIDASQSSFQLYASGVYYEPECSSTQLDHGVLAVGYGS--- 282
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVA 345
+ G YW++KNSWG WGE+G+I + R+ + CGIAT AS+P A
Sbjct: 283 QSGQDYWIVKNSWGADWGENGFILMARNKDNNCGIATMASFPTA 326
>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 443
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 148/354 (41%), Positives = 213/354 (60%), Gaps = 22/354 (6%)
Query: 2 VLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAM 61
+ K +++ +IP + +++ + R +P + + W + H + Y E E+
Sbjct: 100 LRKLQRNQVIP------VTKENSTETLHCRWQVDPELDGHWQLWKSWHRKDYH-EREEGW 152
Query: 62 RLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSS 118
R ++++NL+ IE N + G +YKLG N+F D+T EEFR GY V S +
Sbjct: 153 RRVVWEKNLKMIEIHNLDHALGKHSYKLGMNQFGDMTTEEFRQLMNGY---VHKKSERKY 209
Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
R S F N + P S+DWREKG VT +K+QG CGSCWAFS A+EG GKL+ L
Sbjct: 210 RGSQFLEPNFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSL 269
Query: 179 SEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAA 235
SEQ LVDCS N GC+GGLMD+AF+Y+ +N G+ +E YPY ++ C + E AA
Sbjct: 270 SEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAA 329
Query: 236 ATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGV 292
G + D+P+G E AL++AV PVSV ++A +F+FY+ G+ +C ++ DHGV
Sbjct: 330 NDTG-FVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGV 388
Query: 293 AVVGFG-TAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
VVG+G E+ DG KYW++KNSWGE WG+ GYI + +D + CGIAT ASYP+
Sbjct: 389 LVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 442
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 190/313 (60%), Gaps = 21/313 (6%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
E W G++Y D +E+ R +++ N ++ N G +Y LG N F+DLT+EEF+
Sbjct: 31 EAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFKRF 90
Query: 103 YTG----YNRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
Y G NRP +S+ STF NV +P S+DWR G VT +K+QG CGSCW+
Sbjct: 91 YLGTKVDLNRP------RSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWS 144
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEA 215
FS +VEG G+L+ LSEQ LVDCS N GC+GGLMD AF+YII NKG+ TEA
Sbjct: 145 FSTTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEA 204
Query: 216 DYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRF 274
YPY + GTC K AT+ ++D+ +G E L AV T PVSV ++AS +F+
Sbjct: 205 SYPYTAKDGTC-KFNAANVGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQL 263
Query: 275 YKRGVLN-AECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-E 331
Y GV N +C + DHGV G+GT+ +G YWL+KNSWG +WG++GYI + R+
Sbjct: 264 YTSGVYNEKKCSSTSLDHGVLAAGYGTS---NGTPYWLVKNSWGSSWGQAGYIWMSRNAN 320
Query: 332 GLCGIATEASYPV 344
CGIAT ASYP+
Sbjct: 321 NQCGIATSASYPI 333
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 149/307 (48%), Positives = 194/307 (63%), Gaps = 18/307 (5%)
Query: 48 QHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYT 104
QHGR Y+ E+ R IFKQNL+YIE+ NK+ G ++Y LG N+F+D+ NEEFR Y
Sbjct: 48 QHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFRM-YN 106
Query: 105 GYNRPVP-SVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAA 163
G R S Q S T +Y P +DWR+KG VT +KNQG CGSCW+FS +
Sbjct: 107 GLRRDYNYSREVQCSNHLTPEY---LVAPDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGS 163
Query: 164 VEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
+EG GKL+ LSEQQLVDCS N GC+GGLMD+AFEYII N G+ TE +YPY
Sbjct: 164 LEGQHFHKSGKLVSLSEQQLVDCSGKFGNEGCNGGLMDQAFEYIITNGGIETEEEYPYDA 223
Query: 222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGVL 280
Q C +K + AA G D+ GDE L +V + PVS+ ++AS Q+F+ Y GV
Sbjct: 224 RQERCHFKKSEVAATASGCV-DVKSGDETDLKNSVAEVGPVSIAIDASHQSFQLYSGGVY 282
Query: 281 N-AECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIA 337
+ +C DHGV VVG+GT +DG YWL+KNSWG TWG GY+++ R+ + CG+A
Sbjct: 283 DEPKCSSTELDHGVLVVGYGT---DDGQDYWLVKNSWGTTWGLEGYVKMSRNQDNQCGVA 339
Query: 338 TEASYPV 344
T+ASYP+
Sbjct: 340 TQASYPL 346
>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
Length = 341
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 143/322 (44%), Positives = 197/322 (61%), Gaps = 26/322 (8%)
Query: 43 EQWMA---QHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLTN 96
E+W A +H + Y E+E R+ I+ +N I K N+ +G +YKL N+++D+ +
Sbjct: 25 EEWSAFKLEHSKRYDSEVEDKFRMKIYLENKHRIAKHNQRFEQGAVSYKLRPNKYADMLS 84
Query: 97 EEFRASYTGYNRPV--PSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
EF G+N+ + P + SRP+TF P +DWR+KGAVT +K+QG
Sbjct: 85 HEFVHVMNGFNKTLKHPKAVHGKGRESRPATFIAPAHVTYPDHVDWRKKGAVTEVKDQGK 144
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENK 209
CGSCWAFS A+EG G L+ LSEQ L+DCS NNGC+GGLMD AF+YI +N
Sbjct: 145 CGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNG 204
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEAS 268
G+ TE YPY+ C + + A +G + D+P+GDE L+QAV T PVSV ++AS
Sbjct: 205 GIDTEKAYPYEGVDDKCRYNAKNSGADDVG-FVDIPQGDEEKLMQAVATVGPVSVAIDAS 263
Query: 269 GQAFRFYKRGVLNAECGDNC-----DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
++F+FY GV E NC DHGV VVG+GT +E G YWL+KNSWG TWG+ G
Sbjct: 264 QESFQFYSDGVYYDE---NCSSTDLDHGVMVVGYGT--DEQGGDYWLVKNSWGRTWGDLG 318
Query: 324 YIRILRDE-GLCGIATEASYPV 344
YI++ R++ CGIA+ ASYP+
Sbjct: 319 YIKMARNKNNHCGIASSASYPL 340
>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
Length = 493
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 136/296 (45%), Positives = 181/296 (61%), Gaps = 21/296 (7%)
Query: 62 RLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNR-----PVPSV 113
RL +F+ NL YI+ N E G ++LG F+DLT EE+RA +R V V
Sbjct: 92 RLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGVV 151
Query: 114 SRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGG 173
R+ P + +P ++DWRE+GAV +K+QG CG CWAFSAVAAVEGI +I G
Sbjct: 152 GRRRYLPLAGE-----QLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTG 206
Query: 174 KLIELSEQQLVDCST-DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEK 232
LI LSEQ+L+DC + GC GGLMD AF ++I+N G+ TEADYP+ GTCD + +
Sbjct: 207 SLISLSEQELIDCDKFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKN 266
Query: 233 AAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGV 292
+I +E +P E AL +AV QPVS +EAS +AF+ Y G+ + CG DHGV
Sbjct: 267 TRVVSIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGV 326
Query: 293 AVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEGL----CGIATEASYPV 344
VVG+G+ E G YW++KNSWG WGE+GY+R+ R+ + GIA E YPV
Sbjct: 327 TVVGYGS---EGGKDYWIVKNSWGTQWGEAGYVRMARNVRVRPPSAGIAMEPLYPV 379
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 200/341 (58%), Gaps = 21/341 (6%)
Query: 11 IPMFVIIILVI----TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIF 66
+ +F+I+ LVI CA+ + ++ S + WM +H + Y E + F
Sbjct: 3 LAVFLIVSLVILSINVCAATNLFSAQTYQTSFL----GWMKKHNKAYHHH-EFNDKYQTF 57
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
K N+++I N + + T LG N F+DLTNEE++ +Y G + V + Q + ++
Sbjct: 58 KDNMDFIHNWNSKESDTV-LGLNRFADLTNEEYKKTYLGMSINVNLRANQVPM-NGLNFE 115
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
T P+SIDWR+ GAV ++K+QGHCGSCWAF+ AVEG QI G ++ SEQ LVDC
Sbjct: 116 RFTG-PSSIDWRQNGAVAYVKDQGHCGSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDC 174
Query: 187 S--TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S NNGC GGLM AF+YII+N G+ATE YPY Q C A I Y+D+
Sbjct: 175 SGRYGNNGCDGGLMTSAFKYIIDNDGIATEEAYPYTATQNRCVYNTTMLGTA-ISGYKDV 233
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLN-AECGD-NCDHGVAVVGFGTAEE 302
P+G E AL A++KQPV+V ++AS F+ YK GV A C +HGV VG+GT E
Sbjct: 234 PRGSESALTAAISKQPVAVAIDASPITFQLYKSGVYQEATCSSYRLNHGVLAVGYGTLEG 293
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASY 342
+D Y+++KNSW ETWG GYI + R+ CGIAT ASY
Sbjct: 294 KD---YYIVKNSWAETWGNQGYILMARNANNHCGIATMASY 331
>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 338
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 149/342 (43%), Positives = 209/342 (61%), Gaps = 15/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M +++ LV C VS + + + + W H ++Y E E+ R T++++NL+
Sbjct: 1 MNLLVCLVSLCWGLAVSA-PLGDSELDRHWKLWKNWHQKSYH-EAEEGWRRTVWEENLKA 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I+ N E G TY+LG N+F DLTNEEF+ TG R +R + S F N
Sbjct: 59 IQLHNLEQSLGLHTYRLGMNQFGDLTNEEFQEILTG-ERHFSKGNRING--SAFLEANFV 115
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-- 187
VPTS+DWR+ G VT +KNQGHCGSCWAFS A+EG G+LI LSEQ LVDCS
Sbjct: 116 QVPTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLISLSEQNLVDCSWQ 175
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC GG++D AF+YI++N+G+ +E YPY + K + A A + + D+P
Sbjct: 176 QGNQGCHGGIVDLAFQYILQNQGIDSEDCYPYTAKDTAQCTFKPECATAPVTGFVDIPPH 235
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAEC-GDNCDHGVAVVGFG-TAEEE 303
E AL++AV T PVSV ++AS +FRFY+ G+ + +C ++ DH V VVG+G E+E
Sbjct: 236 SEEALMKAVATVGPVSVGIDASSTSFRFYQSGIFYDPKCSSESLDHAVLVVGYGYEREDE 295
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYPV 344
G KYW++KNSWG+ WG+ GY+ + +D G CGIAT ASYP+
Sbjct: 296 AGKKYWIVKNSWGKHWGDRGYVYMSKDRGNHCGIATVASYPL 337
>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
Length = 232
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 124/230 (53%), Positives = 161/230 (70%), Gaps = 12/230 (5%)
Query: 123 FKYQNVT--DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
F+Y+NV+ +P +IDWR GAVT IK+QG CG CWAFSAVAA EGI +I+ GKLI LSE
Sbjct: 6 FRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSE 65
Query: 181 QQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
Q+LVDC ++ GC GGLMD AF++II+N GL TE++YPY G C + +AA I
Sbjct: 66 QELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKC--KSGSNSAANI 123
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
YED+P DE AL++AV QPVSV V+ F+FY GV+ CG + DHG+A +G+G
Sbjct: 124 KGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG 183
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+ DG KYWL+KNSWG TWGE+GY+R+ +D +G+CG+A E SYP
Sbjct: 184 --KTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAIEPSYPT 231
>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
Length = 333
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 146/338 (43%), Positives = 204/338 (60%), Gaps = 20/338 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
++L + C + S + S+ + E W A H + Y D E+ R ++K+N++ IE
Sbjct: 5 LLLTVLCLG-IASAAPKFDHSLNTQWELWKAVHRKPY-DLNEEGWRKAVWKKNMKMIELH 62
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N+E G ++ + N F DLT+EEFR G+ R +++ + F +P
Sbjct: 63 NQEYSQGKHSFSMAMNAFGDLTSEEFRQMMNGFQR------QENKKGKVFHETIFASIPP 116
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NN 191
S+DWREKG VT +KNQG CGSCWAFS A+EG GKL+ LSEQ LVDCS N
Sbjct: 117 SVDWREKGYVTPVKNQGKCGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSQPEGNR 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GGLMD AF+Y+++ GL +E YPY GTC+ + +AA G + DLPK E+A
Sbjct: 177 GCHGGLMDNAFQYVLDVGGLDSEESYPYTGLVGTCNYNPKNSAANETG-FVDLPK-QENA 234
Query: 252 LLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEEDGAK 307
L++AV T P+SV V+AS +F+FYK G+ +C ++ DHGV VVG+G + D K
Sbjct: 235 LMKAVATLGPISVAVDASNPSFQFYKSGIYYEPKCKSESVDHGVLVVGYGFEGADSDDNK 294
Query: 308 YWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
YWL+KNSWG+ WG +GYI++ +D+ CGIAT ASYP
Sbjct: 295 YWLVKNSWGKHWGINGYIKMAKDQNNHCGIATMASYPT 332
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 198/337 (58%), Gaps = 22/337 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ + L+ C ++ + + E S W H + Y E E+ +R I+K N+ I
Sbjct: 4 LIFVSLITLCFGYIIE-KPIRESSWY----VWKMAHNKAYSHESEENVRYAIWKDNMNRI 58
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
+ N + ++ L N F D+TN EFRA G + + STF + T P
Sbjct: 59 TEYNSK-SKNVILRMNHFGDMTNTEFRAKMNGL------LLHKHQNGSTFLVPSHTAAPD 111
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NN 191
++DWR +G VT +KNQG CGSCWAFS+ A+EG G+L+ LSEQ LVDCSTD NN
Sbjct: 112 AVDWRSEGYVTPVKNQGQCGSCWAFSSTGALEGQHFKKTGRLVSLSEQNLVDCSTDYGNN 171
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC+GGLMD AF YI N G+ TE YPY+ + GTC K A G + D+P+GDE A
Sbjct: 172 GCNGGLMDNAFSYIKANGGIDTETGYPYEGQDGTCRYSKSSIGADDTG-FVDIPEGDEDA 230
Query: 252 LLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AECGDNC-DHGVAVVGFGTAEEEDGAKY 308
L QAV T PVSV ++AS +F+FY GV + +C + DHGV VVG+GT ++G Y
Sbjct: 231 LKQAVATVGPVSVAIDASHMSFQFYHSGVYDEPQCSPSALDHGVLVVGYGT---DNGKDY 287
Query: 309 WLIKNSWGETWGESGYIRILR-DEGLCGIATEASYPV 344
WL+KNSWG WG GYI + R ++ CGIA++ASYP+
Sbjct: 288 WLVKNSWGTGWGTEGYIYMSRNNQNQCGIASKASYPL 324
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 144/337 (42%), Positives = 199/337 (59%), Gaps = 19/337 (5%)
Query: 20 VITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE 79
++ C V + H+ + + + A HG+ Y + E+ RL I+ +N I + N++
Sbjct: 5 IVLCCLFVTAAAITHQELVGAEWSAFKALHGKDYASDTEEYYRLKIYMENRLKIARHNEK 64
Query: 80 GNRT---YKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSS---RPSTFKYQNVTDVPT 133
++ YKL NEF DL + EF ++ G+ R R+ S P F+ +P
Sbjct: 65 YAKSQVSYKLAMNEFGDLLHHEFVSTRNGFKRNYRDSPREGSFFVEPEGFE---DLQLPK 121
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NN 191
++DWR+KGAVT +KNQG CGSCWAFS ++EG KL+ LSEQ LVDCS NN
Sbjct: 122 TVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFGNN 181
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GGLMD AF+YI NKG+ TE YPY G C + A G + D+P+GDE+
Sbjct: 182 GCEGGLMDNAFKYIKSNKGIDTEWSYPYNATDGVCHFNRSDVGATDTG-FVDIPEGDENK 240
Query: 252 LLQAVTK-QPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDGAKY 308
L +AV PVSV ++AS ++F+FY GV + EC + DHGV VVG+GT +DG Y
Sbjct: 241 LKKAVAAVGPVSVAIDASHESFQFYSEGVYDEPECSSEQLDHGVLVVGYGT---KDGQDY 297
Query: 309 WLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
WL+KNSWG TWG+ GYI + R+ + CGIA+ ASYP+
Sbjct: 298 WLVKNSWGTTWGDEGYIYMTRNKDNQCGIASSASYPL 334
>gi|344275470|ref|XP_003409535.1| PREDICTED: cathepsin S-like isoform 1 [Loxodonta africana]
Length = 331
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 203/340 (59%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M ++ + + C+S + R +P++ + W + + YK++ E+ R I+++NL++
Sbjct: 1 MKWLLWVALVCSSAMA--RLHKDPTLDNHWDLWKKTYSKQYKEKNEEVARRLIWEKNLKF 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y L N D+T+EE + + VPS Q R TFK
Sbjct: 59 VMLHNLEHSMGMHSYDLSMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNVTFKSNPNQ 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCS +
Sbjct: 114 KLPDSLDWREKGCVTDVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGE 173
Query: 190 ---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GG M +AF+YII+N G+ +EA YPY+ G C + K AAT KY +LP
Sbjct: 174 KYSNKGCNGGFMTRAFQYIIDNNGIDSEASYPYKATDGKC-QYDPKNRAATCSKYTELPY 232
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEED 304
G E AL +AV K PVSV ++AS +F YK GV + C DN +HGV VVG+G +
Sbjct: 233 GSEDALKEAVANKGPVSVGIDASRPSFFLYKSGVYYDPSCTDNVNHGVLVVGYGNL---N 289
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
G YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 290 GKDYWLVKNSWGLNFGEQGYIRMARNSGNHCGIASFPSYP 329
>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
Length = 341
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 147/346 (42%), Positives = 200/346 (57%), Gaps = 16/346 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+ F + V+ S V+ S ++ I E+ E + Q + Y E+E+ R+ +F N
Sbjct: 1 MKAFAFLCCVLIYHSNSVTAVSFNDL-IAEEWELFKTQFSKAYNTEIEEKFRMKVFMDNK 59
Query: 71 EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
I + NK G +Y+L N F DL + EF + GY + V+ TF
Sbjct: 60 HKIARHNKLFQNGEVSYELEMNHFGDLLHHEFVKTVNGYRHSLRRVTGDEIDSVTFIPAY 119
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
VP S+DWR +GAVT +KNQG CGSCWAFS ++EG +L LSEQ L+DCS
Sbjct: 120 NVTVPDSVDWRTEGAVTEVKNQGQCGSCWAFSTTGSLEGQHFRNTKQLTSLSEQNLIDCS 179
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
NNGCSGGLMD AF YI NKG+ TE YPY+ C + K + + AT + D+P
Sbjct: 180 GKYGNNGCSGGLMDNAFAYIKSNKGIDTEQSYPYEGIDDKC-RYKPQESGATDKGFVDIP 238
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGD---NCDHGVAVVGFGTA 300
+GDE L AV T P+SV ++AS Q+F+FYK+GV + CG+ + DHGV VG+GT
Sbjct: 239 QGDEEKLKLAVATVGPISVAIDASHQSFQFYKKGVYYDKGCGNGEEDLDHGVLAVGYGT- 297
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPVA 345
E+G YWL+KNSWG+ WG GYI++ R++ CGIAT ASYP+
Sbjct: 298 --ENGKDYWLVKNSWGKRWGLDGYIKMARNKHNHCGIATSASYPLV 341
>gi|157311713|ref|NP_001098585.1| uncharacterized protein LOC564979 precursor [Danio rerio]
gi|156230121|gb|AAI52284.1| Wu:fa26c03 protein [Danio rerio]
Length = 336
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 138/341 (40%), Positives = 204/341 (59%), Gaps = 16/341 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ +LV C S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLCISAVFAASSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR + GY ++S+ F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHD----PNRTSQGPLFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +K+Q CGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGFVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC+GGLMD+AF+Y+ ENKGL +E YPY + + A I + D+P+G+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGVL--NAECGDNCDHGVAVVGFG-TAEEED 304
E AL+ AV PVSV ++AS Q+ +FY+ G+ A DH V VVG+G +
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
G +YW++KNSW + WG+ GYI + +D+ CG+AT ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATSASYPL 335
>gi|318037269|ref|NP_001187182.1| cathepsin L precursor [Ictalurus punctatus]
gi|196475596|gb|ACG76367.1| cathepsin L [Ictalurus punctatus]
Length = 336
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 148/343 (43%), Positives = 203/343 (59%), Gaps = 19/343 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M + + ++ C S V + ++ + + +QW H + Y E E+ R ++++NL+
Sbjct: 1 MRLCLAVLAVCLSTVSAAPTV-DRELDGHWQQWKEWHNKDYH-EKEEGWRRMVWEKNLKK 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +Y+L N F D+ +EEFR GY V + R S F N
Sbjct: 59 IELHNLEHSLGKHSYRLAMNHFGDMPHEEFRQVMNGYKHKVRKI-----RGSLFMEPNFL 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+ P+ +DWREKG VT +K+QG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 114 EAPSKLDWREKGYVTPVKDQGQCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRP 173
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD+AF+YI +N GL TE YPY + C +AA G + D+P
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDTEKFYPYLGTDDQPCHYDPSYSAANDTG-FVDIPS 232
Query: 247 GDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEE 302
G EHAL++AVT PVSV ++A ++F+FY+ G+ A+C ++ DHGV VVG+G E
Sbjct: 233 GKEHALMKAVTAVGPVSVAIDAGHESFQFYQSGIYYEADCSSEDLDHGVLVVGYGYEGEN 292
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E WG GYI + +D CGIAT ASYP+
Sbjct: 293 VDGKKYWIVKNSWSEQWGNKGYIYMAKDRHNHCGIATAASYPL 335
>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
Length = 331
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 145/341 (42%), Positives = 206/341 (60%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
M ++ +++ C+S + +H +++H + W +G+ YK++ E+ R I+++NL+
Sbjct: 1 MKCLVWVLLLCSSAMAQ---LHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
+ N E G +Y LG N D+T+EE + + VPS Q R T+K
Sbjct: 58 TVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVR--VPS---QWPRNVTYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CGSCWAFSAV A+E ++ G+L+ LS Q LVDCST
Sbjct: 113 QKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCST 172
Query: 189 D---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ N GC+GG M +AF+YII+N G+ +EA YPY+ G C K K AAT +Y +LP
Sbjct: 173 EKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKC-KYDSKNRAATCSRYTELP 231
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEE 303
DE+AL +AV K PVSV ++A +F FY+ GV + C N +HGV VVG+G
Sbjct: 232 FADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNL--- 288
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYP 343
+G YWL+KNSWG +G+ GYIR+ R+ E CGIA SYP
Sbjct: 289 NGKDYWLVKNSWGLNFGDGGYIRMARNSENHCGIANYPSYP 329
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 143/348 (41%), Positives = 215/348 (61%), Gaps = 22/348 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M ++ + +T S ++ S ++ ++E+ + + A+H + Y +++E+ R+ IF N +
Sbjct: 1 MKILFFIALTVLS--INAVSFYD-LVMEEWQLFKAEHKKNYNNDVEEKFRMKIFMDNKQK 57
Query: 73 IEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPV-PSVSRQSS-----RPSTF 123
I K N + G YKLG N++SD+ + EF ++ G+N+ + P R ++ + S F
Sbjct: 58 ITKHNTKYQRGEVGYKLGLNKYSDMLHHEFINTFNGFNKSIIPPHLRSNNGKTHLKGSFF 117
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
+P +DW + GAVT +K+QGHCGSCWAFSA A+EG+ L+ LSEQ L
Sbjct: 118 IPPANVKLPKHVDWVKLGAVTPVKDQGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNL 177
Query: 184 VDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKY 241
+DCST+ NNGC+GGLMD+AF+Y+ N G+ TE YPY+ C + E + A G Y
Sbjct: 178 IDCSTEEGNNGCNGGLMDQAFQYVRINGGIDTERSYPYEGNNDVCRYEPENSGAIDTG-Y 236
Query: 242 EDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGD---NCDHGVAVVG 296
D+P GDE AL AV T PVSV ++AS ++F+ Y GV C + + DHGV VVG
Sbjct: 237 TDVPLGDEDALKSAVATVGPVSVAIDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVG 296
Query: 297 FGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYP 343
+GT +EE YWL+KNSWG++WGE+GYI++ R+ + CGIAT+ S+P
Sbjct: 297 YGT-DEETQQDYWLVKNSWGDSWGENGYIKMARNADNQCGIATQPSFP 343
>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 334
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 201/341 (58%), Gaps = 20/341 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
++ ++LV C VVS SM E QW +HG+ Y + E+A R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFIDFDEDWNQWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTF-KYQNVT 129
K N + G+ TY LG N+F+DL NEEF + G+ S +++R STF NV
Sbjct: 60 IKHNLKYDLGHFTYDLGMNQFADLKNEEFVSLMNGFRGN----SSKATRGSTFLPPSNVF 115
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
D+PT +DWR KG VT +KNQ CGSCWAFSA ++EG GKL+ LSEQ LVDCS
Sbjct: 116 DMPTMVDWRTKGYVTPVKNQLQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGK 175
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC GGLMD+AF+YI++ G+ TE YPY G C K A G Y D+ G
Sbjct: 176 EGNMGCEGGLMDQAFQYILDVGGIDTEMSYPYTAMDGQCHFNKANIGATDTG-YTDVTTG 234
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGVLN-AECGDN-CDHGVAVVGFGTAEEED 304
E AL AV P+SV ++AS Q+F+ YK GV N C DHGV VG+GT+ D
Sbjct: 235 SESALQMAVASVGPISVAIDASHQSFQLYKSGVYNEPACSSTLLDHGVLAVGYGTS--SD 292
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
G Y+ +SWG WG +GY+ + R+ + CGIAT+ASYP+
Sbjct: 293 GTDYFFFFHSWGAAWGMNGYLWMSRNKDNQCGIATKASYPL 333
>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
Length = 330
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 139/317 (43%), Positives = 187/317 (58%), Gaps = 14/317 (4%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
H+P + WM H ++Y +E E R ++++N +I++ N++ N +Y L N+F D
Sbjct: 23 HDP-LTGVFADWMRTHTKSYSNE-EFVFRWNVWRENYNFIQEENRK-NNSYYLTMNKFGD 79
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
LTN EF Y G + + +P + DWR+KGAVTH+KNQG CG
Sbjct: 80 LTNAEFNKVYKG--LAFDYSAHILKAKAATPAAPAPGLPANFDWRQKGAVTHVKNQGQCG 137
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGL 211
SCW+FS + EG + G L+ LSEQ L+DCS NNGC+GGLMD AFEYII NKG+
Sbjct: 138 SCWSFSTTGSTEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKGI 197
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQA 271
TEA YPY+ Q C + + ++ Y D+ GDE+ALL AV +P SV ++AS +
Sbjct: 198 DTEASYPYETAQYNC-RYNPANSGGSLTSYTDVSSGDENALLNAVAIEPTSVAIDASHNS 256
Query: 272 FRFYKRGV-LNAECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
F+FY GV + C DHGV VG+GT E+G YWL+KNSWG WG GYI++ R
Sbjct: 257 FQFYSGGVYYESSCSSTQLDHGVLAVGWGT---ENGQDYWLVKNSWGADWGLQGYIKMAR 313
Query: 330 D-EGLCGIATEASYPVA 345
+ CGIAT ASYP A
Sbjct: 314 NRHNNCGIATAASYPTA 330
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 135/316 (42%), Positives = 190/316 (60%), Gaps = 20/316 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I + E++ A+ G +Y E E+A R +F QN++ I + N +G+ TY LG N+F+DLT E
Sbjct: 15 IDAQWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGH-TYTLGVNQFADLTVE 73
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD---VPTSIDWREKGAVTHIKNQGHCGS 154
EF +Y G+ +P Q + + ++V + +PTS+DW +GAVT +KNQG CGS
Sbjct: 74 EFSKTYMGFKKPA-----QKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGS 128
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLA 212
CW+FS ++EG +I+ GKL+ LSEQQ VDC+ N GC+GGLMD AF+Y E L
Sbjct: 129 CWSFSTTGSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKY-AEANALC 187
Query: 213 TEADYPYQQEQGTCDKQ--KEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQ 270
TE YPY+ G+C A ++ Y+D+ E ++ AV +QPVS+ +EA
Sbjct: 188 TEQSYPYKGTDGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKS 247
Query: 271 AFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
F+ Y GVL CG + DHGV VG+GT G YW +KNSWG TWG SGY+ + R
Sbjct: 248 VFQLYSGGVLTGACGASLDHGVLAVGYGTL---SGTDYWKVKNSWGSTWGMSGYVLLQRG 304
Query: 331 E---GLCGIATEASYP 343
+ G CG+ +E SYP
Sbjct: 305 KGGSGECGLLSEPSYP 320
>gi|357627452|gb|EHJ77132.1| cathepsin L-like protease [Danaus plexippus]
Length = 341
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 207/344 (60%), Gaps = 23/344 (6%)
Query: 18 ILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLTIFKQNLEYIE 74
IL++ CA V +G ++ +V E+W +H + Y E E+ R+ I+ +N +
Sbjct: 3 ILLVLCAV-VAAGTAVSFFDLVR--EEWNTFKLEHKKQYDSETEEKFRMKIYAENKHKVA 59
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPS-----VSRQSSRPSTFKYQ 126
K N+ +G +Y+L TN++SD+ + EF + G+N+ V R +TF
Sbjct: 60 KHNQRYQKGLVSYRLKTNKYSDMLHHEFVNTMNGFNKTVKHNKGLYAKGNDIRGATFVSP 119
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
P ++DWR+ GAVT +K+QG CGSCW+FS A+EG G L+ LSEQ L+DC
Sbjct: 120 ANVAAPPTVDWRQHGAVTPVKDQGKCGSCWSFSTTGALEGQHFRKSGFLVSLSEQNLIDC 179
Query: 187 ST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S+ NNGC+GGLMD AF+YI +N G+ TE YPY+ C + + A +G + D+
Sbjct: 180 SSAYGNNGCNGGLMDNAFKYIKDNDGIDTEKTYPYEAVDDKCRYNPKNSGAEDVG-FVDI 238
Query: 245 PKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLNAE--CGDNCDHGVAVVGFGTAE 301
P GDEH L+ A+ T PVSV ++AS ++F+ Y GV E +N DHGV VVG+GT
Sbjct: 239 PAGDEHKLMLALATVGPVSVAIDASQESFQLYSDGVYYDENCSSENLDHGVLVVGYGT-- 296
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
+EDG YWL+KNSWG +WG+ GYI++ R+ + CGIA+ ASYP+
Sbjct: 297 DEDGGDYWLVKNSWGPSWGDEGYIKMARNRDNHCGIASSASYPL 340
>gi|351705687|gb|EHB08606.1| Cathepsin S [Heterocephalus glaber]
Length = 331
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 143/335 (42%), Positives = 201/335 (60%), Gaps = 19/335 (5%)
Query: 19 LVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN 77
L C + ++G + + +++ H W +G+ Y+++ E+ +R I+++NL+++ N
Sbjct: 4 LAWVCVTCSLAGAQLQQDPMLDYHWHLWKKTYGKHYQEKNEEQVRRLIWEKNLKFVMLHN 63
Query: 78 KE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
E G +Y LG N D+T+EE R+ + P RQ R T+K +P S
Sbjct: 64 LEHSMGMHSYDLGMNHLGDMTSEEVRSLMSSLRVP-----RQWLRNVTYKSDPNQKLPDS 118
Query: 135 IDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD---NN 191
+DWREKG VT +K QG CGSCWAFSAV A+EG ++ GKL+ LS Q LVDCST+ N
Sbjct: 119 VDWREKGCVTEVKYQGACGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEKYRNK 178
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GCSGG M +AF+Y+I+N G+ +E YPY+ C K AAT +Y +LP G E A
Sbjct: 179 GCSGGFMTEAFQYVIDNNGIDSETSYPYKATDEKC-HYDSKNRAATCSRYTELPYGSEEA 237
Query: 252 LLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
L +AV K PVSV V+AS +F YK GV + C N HGV VG+G +G YW
Sbjct: 238 LKEAVANKGPVSVAVDASRPSFFLYKNGVYDDPSCTQNVTHGVLAVGYGNL---NGKDYW 294
Query: 310 LIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
L+KNSWG +G+ GYIR+ R++G CGIA+ +SYP
Sbjct: 295 LVKNSWGLYFGDQGYIRMARNKGNHCGIASYSSYP 329
>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
Length = 475
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 135/319 (42%), Positives = 188/319 (58%), Gaps = 31/319 (9%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT--YKLGTNEFS 92
E +VE +QW +H + Y E A+RL FK+NL+YI + N N + LG N F+
Sbjct: 44 EEQVVELFQQWKKEHQKFYIHPEEAALRLENFKRNLKYIVERNAMRNSPVGHHLGLNRFA 103
Query: 93 DLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
D++NEEF+ + K ++ D P S+DWR+KG VT +K+QG+C
Sbjct: 104 DMSNEEFKNKFIS------------------KVESCDDAPYSLDWRKKGVVTGVKDQGNC 145
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLA 212
GSCW+FS+ A+EG+ I G LI LSEQ+LVDC T N+GC GG MD AFE++I N G+
Sbjct: 146 GSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTTNDGCEGGYMDYAFEWVINNGGID 205
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
TEADYPY GTC+ KE+ TI Y D+ + D AL A KQP+SV ++ S F
Sbjct: 206 TEADYPYIGVGGTCNVTKEETKVVTIDGYTDVTQSDS-ALFCATVKQPISVGIDGSTLDF 264
Query: 273 RFYKRGVLNAECG---DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
+ Y G+ + +C D+ DH V +VG+G+ +D YW++KNSWG +WG G+I I R
Sbjct: 265 QLYTGGIYDGDCSSNPDDIDHAVLIVGYGSDGNQD---YWIVKNSWGTSWGIEGFIYIRR 321
Query: 330 DE----GLCGIATEASYPV 344
+ G+C I AS+P
Sbjct: 322 NTNLKYGVCAINYMASFPT 340
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 140/322 (43%), Positives = 202/322 (62%), Gaps = 20/322 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ E + +H + Y+ + E+ R+ IF +N + I NK G++TYKLG N++ D+
Sbjct: 25 VMEEWESFKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDM 84
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD------VPTSIDWREKGAVTHIKN 148
+ EF G+ +++R F+ + + +P S+DWREKGAVT +K+
Sbjct: 85 LHHEFVNMMNGFRANTSGAGYKANR--GFQGAHFVEPPEDVVMPKSVDWREKGAVTEVKD 142
Query: 149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYII 206
QG CGSCWAFSA A+EG G L+ LSEQ LVDCS+ NNGC+GGLMD AF+YI
Sbjct: 143 QGSCGSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYIK 202
Query: 207 ENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCV 265
N G+ TE YPY+ E C A A G + D+ +G+E+AL +A+ T PVSV +
Sbjct: 203 VNGGIDTEKSYPYEAEDEPCRYNPANAGADDRG-FVDVREGNENALKKAIATIGPVSVAI 261
Query: 266 EASGQAFRFYKRGVL-NAEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
+AS +F+FY+ GV + +C +N DHGV VG+GT EDG YWL+KNSW ++WG+ G
Sbjct: 262 DASQDSFQFYQHGVYSDPDCSAENLDHGVLAVGYGTT--EDGQDYWLVKNSWSKSWGDQG 319
Query: 324 YIRILRDE-GLCGIATEASYPV 344
YI+I R++ +CGIA+ ASYP+
Sbjct: 320 YIKIARNQNNMCGIASAASYPL 341
>gi|225706086|gb|ACO08889.1| Cathepsin S precursor [Osmerus mordax]
Length = 333
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 139/310 (44%), Positives = 188/310 (60%), Gaps = 16/310 (5%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEF 99
+ W QHG+ YK E+E+ R ++++NL+ I N E G TY LG N D+T EE
Sbjct: 31 QMWKKQHGKNYKTEVEELGRREVWERNLQLISLHNLEASMGMHTYDLGMNHMGDMTEEEI 90
Query: 100 RASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFS 159
S+ P + R+ PS F + T VP ++DWR+KG VT +KNQG CGSCWAFS
Sbjct: 91 LQSFASLKVPA-DLKRE---PSAFVASSGTPVPDTVDWRQKGYVTQVKNQGSCGSCWAFS 146
Query: 160 AVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADY 217
+V A+EG T GKL++LS Q LVDCS+ N GC+GG M +AF+Y+I+NKG+ ++ Y
Sbjct: 147 SVGALEGQLMRTTGKLLDLSPQNLVDCSSKYGNKGCNGGFMSEAFQYVIDNKGIDSDTSY 206
Query: 218 PYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYK 276
PYQ QGTC +A +Y LP+GDE L QAV P+SV ++A+ +F ++
Sbjct: 207 PYQGVQGTC-HYNPSYRSANCTRYSFLPEGDETTLKQAVAMIGPISVAIDATRPSFILWR 265
Query: 277 RGVLN-AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE-GLC 334
GV N C +H V VVG+GT DG YWL+KNSWG +GE+GYIR+ R+ C
Sbjct: 266 SGVYNDLTCTQKINHAVLVVGYGTL---DGQDYWLVKNSWGTRFGENGYIRMSRNRNNQC 322
Query: 335 GIATEASYPV 344
GIA YP+
Sbjct: 323 GIALYGCYPI 332
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 144/317 (45%), Positives = 192/317 (60%), Gaps = 17/317 (5%)
Query: 36 PSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT--YKLGTNEFSD 93
PS + + H ++Y+D E+ +R IF+ NL IE+ N+ + LG NEF+D
Sbjct: 22 PSAEPHWNAFKSTHLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFAD 81
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
+TN EF G + + S F+ +V D+P +DW +KG VT +KNQG CG
Sbjct: 82 MTNTEFSNMLLGLGG-----RNKIAGDSVFESSHVQDLPAEVDWTQKGYVTEVKNQGQCG 136
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGL 211
SCWAFS ++EG GKL+ LSEQ LVDCST N GC+GGLMD+AF YI +N G+
Sbjct: 137 SCWAFSTTGSLEGQVFKKTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGI 196
Query: 212 ATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQ 270
TEA YPY GTC + E AT+ + D+ GDE+AL +AV T P+SV ++AS
Sbjct: 197 DTEAAYPYTGSDGTC-RFLENKVGATVSGFVDVKSGDENALKEAVATVGPISVAIDASSI 255
Query: 271 AFRFYKRGVLNA-ECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRIL 328
F+FY+ GV N C DHGV VVG+GT E G YWL+KNSWG +WG GYI+++
Sbjct: 256 FFQFYRGGVYNPWFCSSTELDHGVLVVGYGT---EGGKDYWLVKNSWGSSWGLKGYIKMV 312
Query: 329 RD-EGLCGIATEASYPV 344
R+ + CGIAT+ASYP
Sbjct: 313 RNKKNRCGIATQASYPT 329
>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
Length = 342
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 210/343 (61%), Gaps = 22/343 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQN 69
I M ++++++ C+S + +H+ +++H + W +G+ YK++ E+ +R I+++N
Sbjct: 10 IIMKWLVLVLLGCSSAMAQ---LHKDPTLDRHWDLWKKTYGKQYKEKNEEGVRRLIWEKN 66
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
L+++ N E G +Y LG N D+T+EE A + VPS Q R T+K
Sbjct: 67 LKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVTALMSSLR--VPS---QWQRNVTYKSN 121
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+P S+DWR+KG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDC
Sbjct: 122 PNQKLPDSVDWRDKGCVTDVKYQGSCGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDC 181
Query: 187 ST---DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
S N GC+GG M +AF+YII+N G+ +EA YPY+ G C + K AAT +Y +
Sbjct: 182 SVGKYSNRGCNGGFMTEAFQYIIDNNGIESEASYPYKAMDGKC-QYDSKYRAATCSRYTE 240
Query: 244 LPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAE 301
LP+ E AL +AV K PVSV ++AS +F Y+ GV + C + +HGV VVG+G
Sbjct: 241 LPEDSEDALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDPACTLHVNHGVLVVGYGNL- 299
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+G YWL+KNSWG +G+ GYIR+ R+ G CGIA+ ASYP
Sbjct: 300 --NGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIASYASYP 340
>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 338
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 146/342 (42%), Positives = 206/342 (60%), Gaps = 20/342 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ + +++ C S V + S +E H W H + Y E+ R ++++NL+ I
Sbjct: 4 LYLAVLVLCVSAVCAAPRF--DSQLEDHWHLWKNWHSKNYHAS-EEGWRRMVWEKNLKKI 60
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E N E G +++LG N F D+TNEEFR + GY + + + + S F N
Sbjct: 61 EIHNLEHTMGKHSHRLGMNHFGDMTNEEFRQTMNGYKQ----TTERKFKGSLFMEPNYLQ 116
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P ++DWREKG VT +K+QG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 117 APKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQPFRKTGKLVSLSEQNLVDCSRPE 176
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPY-QQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD+AF+YI +N GL TE YPY ++ C + E +AA G + D+P G
Sbjct: 177 GNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSAANETG-FVDIPSG 235
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
EHA+++AV PVSV ++A ++F+FY+ G+ EC + DHGV VVG+G E+
Sbjct: 236 KEHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDV 295
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E WG+ GYI + +D + CGIAT +SYP+
Sbjct: 296 DGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 148/344 (43%), Positives = 202/344 (58%), Gaps = 23/344 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLTIFKQNL 70
F++ + + SQ VS + + EQW A H + Y+ E E+ R+ IF +N
Sbjct: 3 FLVFVALCVVGSQAVSFFDLVQ-------EQWGAFKVTHKKQYESETEERFRMKIFMENA 55
Query: 71 EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRP-VPSVSRQSSRPSTFKYQ 126
+ K NK +G ++KLG N++SD+ N EF + GYNR P S + TF
Sbjct: 56 HKVAKHNKLYAQGLVSFKLGVNKYSDMLNHEFVHTLNGYNRSKTPLRSGELDESITFIPP 115
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
++P IDWR+ GAVT +K+QG CGSCW+FS ++EG KL+ LSEQ L+DC
Sbjct: 116 ANVELPKQIDWRKLGAVTPVKDQGQCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDC 175
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S NNGC+GGLMD AF YI +N G+ TE YPY+ E C K + AT + D+
Sbjct: 176 SEKYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYKAEDEKC-HYKPRNKGATDRGFVDI 234
Query: 245 PKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAE 301
GDE L AV T P+SV ++AS F+ Y GV EC + DHGV VVG+GT
Sbjct: 235 ESGDEEKLKAAVATVGPISVAIDASHPTFQQYSEGVYYEPECSSEQLDHGVLVVGYGT-- 292
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
+EDG YWL+KNSWG++WG+ GYI++ R+ + CGIAT+ASYP+
Sbjct: 293 DEDGNDYWLVKNSWGDSWGDQGYIKMARNRDNNCGIATQASYPL 336
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 201/340 (59%), Gaps = 23/340 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+ +F ++L+ + ++ P+ + +W H + Y + E+ +R TI+K N
Sbjct: 1 MKVFCALLLLGVTLAYII-----ERPTEDDSWIRWKMAHNKAYSHDGEETVRYTIWKDNE 55
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
I + N +G + L N+F D+TN EF+ + GY +S + STF N
Sbjct: 56 RRIREHNLQGG-DFLLEMNQFGDMTNNEFK-DFNGY------LSHKHVSGSTFLTPNSFV 107
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST-- 188
P S+DWR +G VT +K+QG CGSCWAFS ++EG GKL+ LSEQ LVDCST
Sbjct: 108 APDSVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAY 167
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC+GGLMD AF YI EN G+ +EA YPY + G C K AA G + D+P GD
Sbjct: 168 GNNGCNGGLMDNAFTYIKENNGIDSEASYPYTAKDGKCAFTKPNVAATDTG-FVDIPSGD 226
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGVLN-AECGDN-CDHGVAVVGFGTAEEEDG 305
E+ L +AV P+SV ++AS +F+FY++GV N +C DHGV VVG+GT E G
Sbjct: 227 ENKLKEAVASVGPISVAIDASHFSFQFYRKGVYNERKCSSTELDHGVLVVGYGT---ESG 283
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
YWL+KNSW +WG+ GYI++ R+ + CGIAT ASYP+
Sbjct: 284 KDYWLVKNSWNTSWGDKGYIKMSRNAKNQCGIATNASYPL 323
>gi|61661067|gb|AAX51229.1| cathepsin S cysteine protease [Paralichthys olivaceus]
Length = 337
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 147/340 (43%), Positives = 196/340 (57%), Gaps = 21/340 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M ++LV C V +M + + E W HG+TY +E+E R ++++NL
Sbjct: 10 MLASLLLVSLC----VEAAAMLDVRLDVHWELWKKSHGKTYPNEVEDVRRRELWERNLML 65
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I K N E G +TY L N DLT EE SY P + R P+ F +
Sbjct: 66 ITKHNLEASMGLQTYDLSMNHMGDLTTEEIMQSYATLTPPA-DIQRA---PAPF-VGSGA 120
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
DVP S+DWR +G VT +K QG CGSCWAFSA A+EG T GKL++LS Q LVDCS
Sbjct: 121 DVPVSVDWRLQGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSLK 180
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GG MD+AF+Y+I+NKG+ +EA YPY+ + C AA +Y LP+G
Sbjct: 181 YGNKGCNGGFMDRAFQYVIDNKGIDSEASYPYRGQLQQC-SYNPSYRAANCSRYSFLPEG 239
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AECGDNCDHGVAVVGFGTAEEEDG 305
DE AL A+ T P+SV ++A+ F FY+ GV N C +HGV VG+GT E G
Sbjct: 240 DEGALKNALATIGPISVAIDATRPTFAFYRSGVYNDPTCTQRVNHGVLAVGYGT---ESG 296
Query: 306 AKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYPV 344
YWL+KNSWG ++G+ GYIR+ R++ CGIA SYP+
Sbjct: 297 QDYWLVKNSWGTSFGDKGYIRMSRNKNDQCGIALYCSYPI 336
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 143/342 (41%), Positives = 197/342 (57%), Gaps = 22/342 (6%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
II V L++ C S + R + + WM +H ++Y ++ E R TIF+ N
Sbjct: 3 IILALVFCFLIVNCIS---AARVFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYTIFQDN 58
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS-TFKYQNV 128
++++ K N++G+ T LG N +DLTN+E++ Y G V +P+ +V
Sbjct: 59 MDFVTKWNQKGSDTI-LGLNSMADLTNQEYQRIYLGTKTTVK-------KPNLIIGVTDV 110
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+ P S+DWR GAVT +KNQG CG C++FS +VEGI +IT +L+ LSEQQ++DCS
Sbjct: 111 SKAPASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILDCSG 170
Query: 189 D--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
NNGC GGLM +FEYII GL TEA YPY+ G C K ATI Y+++
Sbjct: 171 SEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYEGVVGKCKFNKAN-IGATITGYKNVKS 229
Query: 247 GDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL--NAECGDNCDHGVAVVGFGTAEEED 304
G E L AV QPVSV ++AS +F+ Y GV A DHGV VG+G+ +
Sbjct: 230 GSESDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVLAVGYGS---QS 286
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPVA 345
G YW++KNSWG WGE G+I + R++ CGIAT ASYP A
Sbjct: 287 GQDYWIVKNSWGADWGEKGFILMARNKHNNCGIATMASYPTA 328
>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
Length = 340
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 148/348 (42%), Positives = 209/348 (60%), Gaps = 22/348 (6%)
Query: 6 EKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLT 64
E+ + M ++++++ C+S + +H+ ++ H + W +G+ Y +E E+ R
Sbjct: 3 EQQTVQRMKWLLLVLLGCSSAMAQ---LHKDPTLDHHWDLWKKTYGKQYTEENEEVTRRF 59
Query: 65 IFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS 121
I+++NL+Y+ N E G +Y LG N +D+T+EE + VPS Q R
Sbjct: 60 IWEKNLKYVMLHNLEHSMGMHSYDLGMNHLADMTSEEVMLLMSSLR--VPS---QWQRNV 114
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQ 181
TFK +P S+DWR+KG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q
Sbjct: 115 TFKSNPNQKLPDSMDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSVQ 174
Query: 182 QLVDCST---DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
LVDCST N GC+GG M +AF+YII+N G+ +EA YPY+ G C + K AAT
Sbjct: 175 NLVDCSTGKYSNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKC-QYDVKNRAATC 233
Query: 239 GKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVG 296
KY +LP G+E AL +AV K PVSV ++AS +F Y+ GV + C N +HGV VG
Sbjct: 234 SKYVELPFGNEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDKACTLNVNHGVLAVG 293
Query: 297 FGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+G +G YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 294 YGNY---NGKDYWLVKNSWGLHFGEQGYIRMARNSGNHCGIASYPSYP 338
>gi|189525868|ref|XP_001341714.2| PREDICTED: cathepsin L1-like isoform 1 [Danio rerio]
Length = 336
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 137/340 (40%), Positives = 203/340 (59%), Gaps = 15/340 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
++ L++T + V S + + + W +QHG++Y +++E R+ I+++NL IE
Sbjct: 1 MMFALLVTLSISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 75 KANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
+ N E GN T+K+G N+F D+TNEEFR + GY Q+S+ F +
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHD----PNQTSQGPLFMEPSFFAA 115
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-- 189
P +DWR++G VT +K+Q CGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
N GC+GGLMD+AF+Y+ ENKGL +E YPY + + A I + D+P G+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNE 235
Query: 250 HALLQAVTK-QPVSVCVEASGQAFRFYKRGVL--NAECGDNCDHGVAVVGFG-TAEEEDG 305
AL+ AV PVSV ++AS Q+ +FY+ G+ A DH V VVG+G + G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295
Query: 306 AKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
+YW++KNSW + WG+ GYI + +D+ CG+AT+ASYP+
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|323451555|gb|EGB07432.1| hypothetical protein AURANDRAFT_2413 [Aureococcus anophagefferens]
Length = 263
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 132/265 (49%), Positives = 170/265 (64%), Gaps = 8/265 (3%)
Query: 81 NRTYKLGTNEFSDLTNEEFRASYTGYNRPVPS-VSRQSSRPSTFKYQNVTDVPTSIDWRE 139
N TYKLG NEFS + +EF A Y G + + R+ + T Q V V + +DW
Sbjct: 5 NSTYKLGHNEFSGMFWDEFVAQYVGDATGAKAYMERERNYDYTLAKQ-VDAVASDVDWVA 63
Query: 140 KGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMD 199
GAVT +KNQG CGSCW+FS A+EG +I G L LSEQ LVDC T ++GC+GGLMD
Sbjct: 64 SGAVTGVKNQGQCGSCWSFSTTGALEGAFEIAGNTLTSLSEQNLVDCDTTDSGCNGGLMD 123
Query: 200 KAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ 259
AF++I N G+ +EADY Y +GTC +K AT+ + D+P GDE AL AV
Sbjct: 124 NAFKWIQSNGGICSEADYAYTAAKGTCKTTCDK--VATLSGHTDVPSGDEDALKTAVAIG 181
Query: 260 PVSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
PVS+ +EA F+ Y G+L++ CG N DHGV VVG+GT +DG++YW +KNSWG T
Sbjct: 182 PVSIAIEADKSVFQSYSSGILDSSACGTNLDHGVLVVGYGT---DDGSEYWKVKNSWGTT 238
Query: 319 WGESGYIRILRDEGLCGIATEASYP 343
WGESGY+RI R +CGIA+E SYP
Sbjct: 239 WGESGYVRIARGSNICGIASEPSYP 263
>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 139/330 (42%), Positives = 188/330 (56%), Gaps = 14/330 (4%)
Query: 21 ITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEG 80
I A V +G + P + + ++G+ Y E A+R IFK N++ I N
Sbjct: 6 IAAAVLVAAGHEVPPPDYMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNAR- 64
Query: 81 NRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
N T+ LG NEF+DLT EE ASYTG +P S+ R ST +Y N + +S+DW +
Sbjct: 65 NLTFALGVNEFTDLTQEELAASYTGL-KPA-SLWSGLPRLSTHEY-NGAPLASSVDWTTQ 121
Query: 141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDK 200
G VT +KNQG CGSCW+FS A+EG ++ G L+ LSEQQ VDC T ++GC+GG MD
Sbjct: 122 GVVTPVKNQGQCGSCWSFSTTGALEGAWALSTGNLVSLSEQQFVDCDTTDSGCNGGWMDN 181
Query: 201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIG--KYEDLPKGDEHALLQAVTK 258
AF + +N + TE YPY GTC+ + G Y D+ E A++ AV +
Sbjct: 182 AFSFAKKNS-ICTEGSYPYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQ 240
Query: 259 QPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
QPVS+ +EA +F+ Y GVL A CG DHGV VG+G+ E G YW +KNSWG +
Sbjct: 241 QPVSIAIEADQYSFQLYSSGVLTASCGTRLDHGVLAVGYGS---EAGTDYWKVKNSWGSS 297
Query: 319 WGESGYIRILRDEGLCG----IATEASYPV 344
WGE GY+R+ R +G G +A SYPV
Sbjct: 298 WGEQGYVRLQRGKGGAGECGLLAGPPSYPV 327
>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
Length = 334
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 200/340 (58%), Gaps = 20/340 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
++++ VI + VS + ++ E W H + Y +E+ +RL IF +N I
Sbjct: 6 ILLLSVIISTASAVSFFDV----VLSDWESWKLTHQKGYDSSVEEKLRLKIFMENSLRIS 61
Query: 75 KANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
+ N E G TY + N + DL + EF A GY + +++ TF ++
Sbjct: 62 RHNAEAIQGRHTYFMKMNHYGDLLHHEFVAMVNGY-----IYNNKTTLGGTFIPSKNINL 116
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-- 189
P +DWRE+GAVT +KNQG CGSCW+FSA ++EG GKLI LSEQ LVDCS
Sbjct: 117 PEHVDWREEGAVTPVKNQGQCGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRKYG 176
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
NNGC GGLMD AF+YI +N G+ TEA YPY+ G C + + IG + D+ KG E
Sbjct: 177 NNGCEGGLMDYAFKYIQDNNGIDTEASYPYEGIDGHCHYDPKNKGGSDIG-FVDIKKGSE 235
Query: 250 HALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AECG-DNCDHGVAVVGFGTAEEEDGA 306
L +A+ T P+SV ++AS +F+FY GV + +C +N DHGV VG+GT +E G
Sbjct: 236 KDLQKALATVGPISVAIDASHMSFQFYSHGVYSEKKCSPENLDHGVLAVGYGT-DEVTGE 294
Query: 307 KYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVA 345
YWL+KNSW E WGE GYI++ R+ + +CGIA+ ASYPV
Sbjct: 295 DYWLVKNSWSEKWGEDGYIKMARNKDNMCGIASSASYPVV 334
>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
(fragment)
gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
gi|226542|prf||1601514A actinidin
Length = 302
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 127/280 (45%), Positives = 176/280 (62%), Gaps = 12/280 (4%)
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
L +I++ N + NR+YK+G N+F+DLT EEFR++Y G+ S ++ + ++ +
Sbjct: 1 LRFIDEHNADTNRSYKVGLNQFADLTGEEFRSTYLGFT----GGSNKTKVSNRYEPRVSQ 56
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+P+ +DWR GAV IK+QG CG CWAFSA+A VEGI +I G LI LSEQ+L+ C
Sbjct: 57 VLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGT 116
Query: 190 NN--GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GG + F++II N G+ T +YPY + G C+ + TI Y ++P
Sbjct: 117 QNTRGCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKYVTIDTYGNVPYN 176
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAK 307
+E AL AVT QPVSV ++A+G AF+ Y G+ CG DH V +VG+GT E G
Sbjct: 177 NEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT---EGGID 233
Query: 308 YWLIKNSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
YW+++NSW TWGE GY+RILR+ G CGIAT SYPV
Sbjct: 234 YWIVENSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 273
>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
Length = 335
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 202/340 (59%), Gaps = 15/340 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ +LV S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLYISAVFAAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR + GY +R S P F
Sbjct: 59 EQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKH---DPNRTSQGP-LFMEPKFFA 114
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +K+Q CGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPH 174
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC+GGLMD+AF+Y+ ENKGL +E YPY + + A I + D+PKG+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGN 234
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFG-TAEEEDG 305
E AL+ AV PVSV ++AS Q+ +FY+ G+ C DH V VVG+G + G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAG 294
Query: 306 AKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
+YW++KNSW + WG+ GYI + +D+ CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 146/312 (46%), Positives = 190/312 (60%), Gaps = 25/312 (8%)
Query: 45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE-GNRTYKLGTNEFSDLTNEEFRASY 103
W A+HG++Y++ E+ +R ++ N +YI++ N+ G Y L N+F DL N EF++ Y
Sbjct: 25 WKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHAGVFGYTLKMNQFGDLENSEFKSLY 84
Query: 104 TGYNRPVPSVSRQSSRPSTFK----YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFS 159
GY R S+ P K V D+P S+DW +KG VT +KNQG CGSCW+FS
Sbjct: 85 NGY--------RMSNAPRKGKPFVPAARVQDLPASVDWSKKGWVTPVKNQGQCGSCWSFS 136
Query: 160 AVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEADY 217
A ++EG G L+ LSEQ LVDCS N+GC+GGLMD AFEY+I+N G+ TEA Y
Sbjct: 137 ATGSMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKNNGIDTEASY 196
Query: 218 PYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYK 276
PY+ TC K ATI Y D+ K E L AV T PVSV ++AS +F+FY
Sbjct: 197 PYRAVDSTC-KFNTADVGATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFYS 255
Query: 277 RGVLNAE--CGDNCDHGVAVVGFGTAEEEDGAK-YWLIKNSWGETWGESGYIRILRDE-G 332
GV + N DHGV VG+GT DG+K YWL+KNSWG +WG SGYI ++R+
Sbjct: 256 SGVYDPLICSSTNLDHGVLAVGYGT----DGSKDYWLVKNSWGASWGMSGYIEMVRNHNN 311
Query: 333 LCGIATEASYPV 344
CGIAT ASYPV
Sbjct: 312 KCGIATSASYPV 323
>gi|355763133|gb|EHH62119.1| hypothetical protein EGM_20318 [Macaca fascicularis]
Length = 331
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 206/341 (60%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
M +I +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKQLICVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE + + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNAN 112
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ N GC+GG M +AF+YII+N G+ ++A YPY+ C + K AAT KY +LP
Sbjct: 173 EKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKC-QYDSKYRAATCSKYTELP 231
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEE 303
G E L + V K PVSV V+AS +F Y+ GV C N +HGV VVG+G
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL--- 288
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+G +YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|23110962|ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens]
gi|88984046|sp|P25774.3|CATS_HUMAN RecName: Full=Cathepsin S; Flags: Precursor
gi|60816153|gb|AAX36372.1| cathepsin S [synthetic construct]
gi|61358282|gb|AAX41541.1| cathepsin S [synthetic construct]
gi|119573903|gb|EAW53518.1| cathepsin S, isoform CRA_b [Homo sapiens]
gi|119573904|gb|EAW53519.1| cathepsin S, isoform CRA_b [Homo sapiens]
Length = 331
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 207/341 (60%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE + + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ C + K AAT KY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC-QYDSKYRAATCSKYTELP 231
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEE 303
G E L +AV K PVSV V+A +F Y+ GV C N +HGV VVG+G +
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+G +YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|322799749|gb|EFZ20954.1| hypothetical protein SINV_06041 [Solenopsis invicta]
Length = 337
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 140/344 (40%), Positives = 205/344 (59%), Gaps = 23/344 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWM---AQHGRTYKDELEKAMRLTIFKQN 69
+ ++ L + Q VS + + +W H + YK +E+ R+ I+ N
Sbjct: 4 VVALLFLAVLAMGQTVSFNKILDA-------EWFIFKLHHNKVYKSPVEEGYRMKIYMDN 56
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
I + N++ TYKLG N++ D+ + EF + G+N+ V + + TF
Sbjct: 57 KRKIAEHNRKYELNEVTYKLGMNKYGDMLHHEFVNTLNGFNKSV--TAGIETEGVTFISP 114
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+P +DW ++GAVT +K+QGHCGSCWAFS+ A+EG + G L+ LSEQ L+DC
Sbjct: 115 ANVKLPDEVDWTKQGAVTAVKDQGHCGSCWAFSSTGALEGQHFRSTGYLVSLSEQNLIDC 174
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S NNGC+GGLMD AF+YI +NKGL TE YPY+ E C + + + AT Y D+
Sbjct: 175 SGKYGNNGCNGGLMDYAFQYIKDNKGLDTEKTYPYEAENDRC-RYNPRNSGATDKGYVDI 233
Query: 245 PKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAE 301
P+GDE L AV T P+SV ++AS ++F+ Y GV + +C +N DHGV +VG+GT +
Sbjct: 234 PQGDEEKLKAAVATIGPISVAIDASHESFQLYSEGVYYDPDCSAENLDHGVLIVGYGT-D 292
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
E G YWL+KNSWG+TWG+ GYI++ R++ CGIA+ ASYP+
Sbjct: 293 ETSGHDYWLVKNSWGKTWGQKGYIKMARNKNNHCGIASSASYPL 336
>gi|402856105|ref|XP_003892640.1| PREDICTED: cathepsin S isoform 1 [Papio anubis]
Length = 331
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 145/341 (42%), Positives = 206/341 (60%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE + + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 QMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ N GC+GG M +AF+YII+N G+ ++A YPY+ C + K AAT KY +LP
Sbjct: 173 EKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKC-QYDSKYRAATCSKYTELP 231
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEE 303
G E L + V K PVSV V+AS +F Y+ GV C N +HGV VVG+G
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL--- 288
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+G +YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|61368403|gb|AAX43172.1| cathepsin S [synthetic construct]
Length = 332
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 207/341 (60%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE + + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ C + K AAT KY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC-QYDSKYRAATCSKYTELP 231
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEE 303
G E L +AV K PVSV V+A +F Y+ GV C N +HGV VVG+G +
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+G +YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|302779822|ref|XP_002971686.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
gi|300160818|gb|EFJ27435.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
Length = 214
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 120/215 (55%), Positives = 156/215 (72%), Gaps = 6/215 (2%)
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNG 192
S+DWR+KG VT IK+QG CG+CWAFSA+AAVEG+T ++ G L+ LSEQ+LVDC T N G
Sbjct: 1 SVDWRKKGGVTEIKDQGDCGNCWAFSAIAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQG 60
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHAL 252
C GG+MD AF+Y+I N G+ ++++YPY+ ++G CDK K K AATI ++ +P E L
Sbjct: 61 CDGGMMDYAFQYMIRNGGITSQSNYPYRAQRGACDKDKVKYHAATINGFQAIPPQSEELL 120
Query: 253 LQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIK 312
L+AV QPVSV +EA GQ F+ Y GV ECG N DHGVA+VG+GT + G +YWL+K
Sbjct: 121 LRAVANQPVSVAIEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGT--DAGGRQYWLVK 178
Query: 313 NSWGETWGESGYIRILRD---EGLCGIATEASYPV 344
NSWG WGESGY+R+ R G+CGI +ASYP
Sbjct: 179 NSWGSGWGESGYVRMERQGPGAGVCGINLDASYPT 213
>gi|75067394|sp|Q9GKL8.1|CATL1_CERAE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
chain; Contains: RecName: Full=Cathepsin L1 light chain;
Flags: Precursor
gi|11493685|gb|AAG35605.1|AF201700_1 cysteine protease [Chlorocebus aethiops]
Length = 333
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 143/343 (41%), Positives = 204/343 (59%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
P F++ L + AS ++ S+ + +W A H R Y E+ R ++++N++
Sbjct: 3 PTFILAALCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N+E G ++ + N F D+T+EEFR G+ +R+ + F+
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLF 111
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS- 187
+ P S+DWREKG VT +KNQG CGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 188 -TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY+ + +C E + A G + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTG-FVDIPK 230
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEE 302
E AL++AV T P+SV ++A ++F FYK G+ +C ++ DHGV VVG+G + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
D +KYWL+KNSWGE WG GYI++ +D CGIA+ ASYP
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 144/341 (42%), Positives = 204/341 (59%), Gaps = 21/341 (6%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
++ + CA + + + + + E + + H +TYK +E+ +R IF +N +I K
Sbjct: 1 MLRFALLCAIVAAATAATSQEILRTEWEAFKSTHKKTYKSNVEELLRFKIFTENSLFIAK 60
Query: 76 AN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTF---KYQNVT 129
N +G +YKLG N+F+DL EF GY R + R ST+ N +
Sbjct: 61 HNVKYAKGLVSYKLGINQFADLLPHEFVKMMNGYQG-----KRLAGRGSTYLPPANLNDS 115
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
+P ++DWR+KGAVT +K+QG CGSCWAFS+ ++EG + GKL+ LSEQ LVDCS+
Sbjct: 116 SLPKTVDWRKKGAVTPVKDQGQCGSCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDCSSA 175
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD +F YI N G+ TE YPY+ E G C +KE A G + D+ +G
Sbjct: 176 YGNQGCNGGLMDNSFNYIKANGGIDTEDSYPYEAEDGDCRYKKEDVGATDTG-FVDIKEG 234
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEED 304
E L +AV T PVSV ++AS Q+F+ Y GV + C ++ DHGV VG+G ++
Sbjct: 235 SEKDLQKAVATVGPVSVAIDASQQSFQLYSEGVYDEPNCSSESLDHGVLAVGYGV---KN 291
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
G KYWL+KNSW ETWG+ GYI + RD+ CGIA+ ASYP+
Sbjct: 292 GKKYWLVKNSWAETWGQDGYILMSRDKNNQCGIASSASYPL 332
>gi|355558399|gb|EHH15179.1| hypothetical protein EGK_01236 [Macaca mulatta]
gi|380809986|gb|AFE76868.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416071|gb|AFH31249.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416073|gb|AFH31250.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416075|gb|AFH31251.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416077|gb|AFH31252.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416079|gb|AFH31253.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
Length = 331
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 145/341 (42%), Positives = 206/341 (60%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKQLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE + + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNAN 112
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ N GC+GG M +AF+YII+N G+ ++A YPY+ C + K AAT KY +LP
Sbjct: 173 EKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKC-QYDSKYRAATCSKYTELP 231
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEE 303
G E L + V K PVSV V+AS +F Y+ GV C N +HGV VVG+G
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL--- 288
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+G +YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|179957|gb|AAC37592.1| cathepsin S [Homo sapiens]
Length = 331
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 207/341 (60%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE + + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ C + K AAT KY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDLKC-QYDSKYRAATCSKYTELP 231
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEE 303
G E L +AV K PVSV V+A +F Y+ GV C N +HGV VVG+G +
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+G +YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
Length = 333
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 143/344 (41%), Positives = 197/344 (57%), Gaps = 21/344 (6%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
+ P FV+ L + +VS + ++ + +QW A HGR Y E+ R ++++N
Sbjct: 1 MTPSFVLAALCLG----IVSALPKLDQTLDAQWDQWKAAHGRLYGLN-EEGWRRAVWEKN 55
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
L IE N E G ++ LG N F D+TNEEFR G+ + P +
Sbjct: 56 LRMIELHNGEYSQGRHSFTLGMNHFGDMTNEEFRQVMNGFQHQKHKTGKMYQEPLLLQ-- 113
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+P S+DWREKG VT +KNQG CGSCWAFSA ++EG G L+ LSEQ LVDC
Sbjct: 114 ----LPKSVDWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDC 169
Query: 187 S--TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S N GC+GGLMD AF+Y+ +NKGL E YPY + G C + E +AA G + D+
Sbjct: 170 SRPQGNQGCNGGLMDFAFQYVKDNKGLEAEKSYPYVGKDGECKYKPELSAANDTG-FVDV 228
Query: 245 PKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV-LNAECGD-NCDHGVAVVGFGTAEE 302
P+ ++ T P+SV ++A Q+F+FYK G+ + C + +HGV +VG+GT
Sbjct: 229 PQREKVVQKALATVGPLSVAIDAGLQSFQFYKEGIYYDPGCSSRDLNHGVLLVGYGTDAS 288
Query: 303 EDG-AKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
E G YWLIKNSWG TWG GY++I R+ CG+AT ASYP+
Sbjct: 289 ETGKGDYWLIKNSWGTTWGADGYVKIARNRNNHCGVATAASYPL 332
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 140/309 (45%), Positives = 192/309 (62%), Gaps = 17/309 (5%)
Query: 47 AQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASY 103
A+HG++Y E E+ RL I+ +N I K N++ G Y + NEF D+ + EF ++
Sbjct: 32 AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
G+ R R+ S + + +N+ D +P ++DWR KGAVT +KNQG CGSCWAFSA
Sbjct: 92 NGFKRNYKDQPREGS--TYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSAT 149
Query: 162 AAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPY 219
++EG G ++ LSEQ LV CSTD NNGC GGLMD AF+YI NKG+ TE YPY
Sbjct: 150 GSLEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSYPY 209
Query: 220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRG 278
GTC +K A G + D+ +G E L +AV T P+SV ++AS ++F+FY G
Sbjct: 210 NGTDGTCHFKKSTVGATDSG-FVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDG 268
Query: 279 VLN-AEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCG 335
V + EC ++ DHGV VVG+GT +G YW +KNSWG TWG+ GYIR+ R+ + CG
Sbjct: 269 VYDEPECDSESLDHGVLVVGYGTL---NGTDYWFVKNSWGTTWGDEGYIRMSRNKKNQCG 325
Query: 336 IATEASYPV 344
IA+ AS P+
Sbjct: 326 IASSASIPL 334
>gi|179959|gb|AAA35655.1| cathepsin [Homo sapiens]
gi|248406|gb|AAB22005.1| cathepsin S [Homo sapiens]
Length = 331
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 149/341 (43%), Positives = 208/341 (60%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE S T R VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEV-MSLTSSLR-VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCST 172
Query: 189 D---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ C + K AAT KY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC-QYDSKYRAATCSKYTELP 231
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEE 303
G E L +AV K PVSV V+A +F Y+ GV C N +HGV VVG+G +
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+G +YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 131/291 (45%), Positives = 176/291 (60%), Gaps = 12/291 (4%)
Query: 62 RLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS 121
R I+ NL + + N + ++ L ++DL+ +E+R+ GYN + ++ R +
Sbjct: 71 RFNIWLDNLRFAHEYNAR-HTSHWLSMGVYADLSQDEYRSKALGYNAHLHK--KRPLRAA 127
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQ 181
F Y+ T P +DW GAVT +K+Q CGSCWAFS AVEG I GKL+ LSEQ
Sbjct: 128 PFLYKG-TVPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGANAIATGKLVSLSEQ 186
Query: 182 QLVDCSTD-NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGK 240
LVDC + + GC GG MD AF++I+ N G+ TE DYPY+ E G C + + TI
Sbjct: 187 MLVDCDREYDTGCRGGFMDSAFDFIVNNGGIDTEDDYPYRAEDGICQDNRTRRHVVTIDG 246
Query: 241 YEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTA 300
Y+D+P DE+AL++AV QPVSV +EA AF+ Y GV +AECG DH V VVG+GTA
Sbjct: 247 YQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGVFDAECGTALDHAVLVVGYGTA 306
Query: 301 EE-EDGAKYWLIKNSWGETWGESGYIRILRD------EGLCGIATEASYPV 344
YWL+KNSWG WGE GYIR+LR+ EG CG+A AS+P+
Sbjct: 307 SNGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQCGLAMYASFPI 357
>gi|297845822|ref|XP_002890792.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
gi|297336634|gb|EFH67051.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
Length = 322
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 200/329 (60%), Gaps = 37/329 (11%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
SQ +++E SIV+ H+QWM Q R Y+DE EK MRL +FK+NL++IE N GN++Y
Sbjct: 21 SQARPHVTLNEQSIVDYHQQWMTQFSRVYQDESEKEMRLQVFKKNLKFIENFNNMGNQSY 80
Query: 85 KLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT---SIDWREKG 141
+G NEF+D T EEF A++TG V ++S + + N++D+ S DWR++G
Sbjct: 81 TVGVNEFTDWTIEEFLATHTGLRVNVTTLSELFNETMPSRNWNISDIDIDDESKDWRDEG 140
Query: 142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDK 200
AV +K QG C G+T+I+G L+ LSEQQL+DC T+ N GC GG +++
Sbjct: 141 AVIPVKVQGAC-------------GLTKISGKNLLTLSEQQLIDCDTEKNTGCDGGGIEE 187
Query: 201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQP 260
AF+YII+N G++ E +YPYQ ++G+C A I +E +P +E ALL+AV +QP
Sbjct: 188 AFKYIIKNGGVSLETEYPYQVKKGSCRANARSATQTQIRGFEMVPSHNERALLEAVRRQP 247
Query: 261 VSVCVEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
VSV ++A +F+ YK GV +CG + +H V VG+GT ++W
Sbjct: 248 VSVLIDARADSFKTYKGGVYAGLDCGTDVNHAVTFVGYGTMI---------------QSW 292
Query: 320 GESGYIRILRD----EGLCGIATEASYPV 344
GE+GY+RI RD +G+CGIA A+YP+
Sbjct: 293 GENGYMRIRRDVEWPQGMCGIAQVAAYPI 321
>gi|354473025|ref|XP_003498737.1| PREDICTED: cathepsin S-like [Cricetulus griseus]
Length = 341
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 204/343 (59%), Gaps = 22/343 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
I ++ + ++ C + + +P++ + W HG+ YK++ E+ R I+++NL
Sbjct: 9 ITRWLFWVPMVCC---LAGDQLQRDPTLDHHWDLWKKFHGKQYKEKNEEEARRLIWEKNL 65
Query: 71 EYIEKANKEGN---RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
+ + N E + +Y LG N D+T+EE G RP+ V Q R ST+K
Sbjct: 66 KLVMLHNLEYSLEMHSYSLGMNHMGDMTSEEV----LGQMRPL-RVPSQRHRNSTYKSNP 120
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+P S+DWREKG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDCS
Sbjct: 121 NQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCS 180
Query: 188 TD----NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
T+ N GC GG M +AF+YII+N G+ ++A YPY+ C K+ AAT +Y +
Sbjct: 181 TEEKYGNKGCDGGFMTRAFQYIIDNGGIDSDASYPYKAVAEKC-HYDSKSRAATCSRYME 239
Query: 244 LPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AECGDNCDHGVAVVGFGTAE 301
LP GDE AL +AV K PVSV ++AS +F YK GV + C +N +HGV VVG+G
Sbjct: 240 LPSGDEEALKEAVANKGPVSVGIDASHPSFFLYKSGVYDEPSCTENVNHGVLVVGYGNL- 298
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILR-DEGLCGIATEASYP 343
DG YWL+KNSWG +G+ GYIR+ R ++ CGIA+ SYP
Sbjct: 299 --DGKDYWLVKNSWGLHFGDQGYIRMARNNKNQCGIASYGSYP 339
>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
Length = 331
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 144/338 (42%), Positives = 201/338 (59%), Gaps = 21/338 (6%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
++ L++ C++ R +P++ W +G+ Y ++ E+ R I+++NL+++
Sbjct: 4 LVWTLLVCCSAMAQLHR---DPALDHHWHLWKKTYGKQYTEKNEETERRLIWEKNLKFVM 60
Query: 75 KANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
N E G +Y LG N D+T+EE + T P RQS R T+K +
Sbjct: 61 LHNLEHSMGMHSYDLGMNHLGDMTSEEVVSLMTCLKVP-----RQSQRNVTYKSSPNQKL 115
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-- 189
P S+DWREKG VT +K QG CGSCWAFSAV A+E ++T GKL+ LS Q LVDCST+
Sbjct: 116 PDSLDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLTTGKLVSLSAQNLVDCSTEKY 175
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC GG M +AF+YII+N G+ +EA YPY+ C + K AAT KY +LP G
Sbjct: 176 RNEGCHGGFMTEAFQYIIDNNGIDSEASYPYKAMDEKC-QYDSKNRAATCSKYTELPFGS 234
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEEDGA 306
E AL +AV +K PVSV ++AS +F Y+ GV C +HGV VVG+G +G
Sbjct: 235 EEALKEAVASKGPVSVAIDASHSSFFLYRSGVYYEPACTQVVNHGVLVVGYGNL---NGN 291
Query: 307 KYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYP 343
YWL+KNSWG +G+ GYIR+ R+ E CGIA+ +SYP
Sbjct: 292 DYWLVKNSWGLYFGDKGYIRMARNRENHCGIASYSSYP 329
>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
Length = 324
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 145/335 (43%), Positives = 199/335 (59%), Gaps = 21/335 (6%)
Query: 19 LVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANK 78
++ C + ++ + + ++ E + H +TY E E MR I++++L I + N
Sbjct: 1 MLACCIAATLASPLVFDEALDEMWTLFKTTHSKTYATEAED-MRRFIWERHLNMINQHNI 59
Query: 79 E---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
E G T+ LG NE+ DLT E+ A+ +GY SV P + VP ++
Sbjct: 60 EADLGKHTFSLGMNEYGDLTQHEY-AAMSGYKMAKSSVGSSFLEPENLQ------VPKTV 112
Query: 136 DWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGC 193
DWREKG VT +KNQG CGSCWAFS+ ++EG G+L +SEQ LVDCS D N GC
Sbjct: 113 DWREKGYVTPVKNQGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGC 172
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALL 253
SGGLMD AF YI +N G+ +E YPY+ G C + K+ + T + D+P GDE AL
Sbjct: 173 SGGLMDNAFTYIKKNMGIDSEKSYPYEAVDGEC-RYKKSDSVTTDSGFVDIPHGDETALR 231
Query: 254 QAVTK-QPVSVCVEASGQAFRFYKRGVLN-AECGDN-CDHGVAVVGFGTAEEEDGAKYWL 310
AV PVSV ++AS +F+FYK GV A C DHGV VVG+G E+G YWL
Sbjct: 232 TAVASVGPVSVAIDASHTSFQFYKTGVYTEANCSSTQLDHGVLVVGYGV---ENGQDYWL 288
Query: 311 IKNSWGETWGESGYIRILRDEG-LCGIATEASYPV 344
+KNSWG +WGE+GYI++ R+ G CGIA++ASYP+
Sbjct: 289 VKNSWGASWGEAGYIKLARNHGNQCGIASQASYPL 323
>gi|114559418|ref|XP_001171268.1| PREDICTED: cathepsin S isoform 3 [Pan troglodytes]
gi|397492866|ref|XP_003817341.1| PREDICTED: cathepsin S isoform 1 [Pan paniscus]
gi|410225070|gb|JAA09754.1| cathepsin S [Pan troglodytes]
gi|410251608|gb|JAA13771.1| cathepsin S [Pan troglodytes]
gi|410328325|gb|JAA33109.1| cathepsin S [Pan troglodytes]
gi|410328327|gb|JAA33110.1| cathepsin S [Pan troglodytes]
Length = 331
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 207/341 (60%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE + + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ C + K AAT KY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKATDQKC-QYDSKYRAATCSKYTELP 231
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEE 303
G E L +AV K PVSV V+A +F Y+ GV C N +HGV VVG+G +
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDALHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+G +YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
Length = 337
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 140/342 (40%), Positives = 204/342 (59%), Gaps = 16/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M + +L + +S V+S S+ +P + + W + H + Y + E+ R ++++NL+
Sbjct: 1 MLPVAVLTLCLSSAVLSAPSL-DPQLDQHWNLWKSWHSKNYH-QREEGWRRLVWEKNLKK 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +Y+LG N F D+T+EEF+ GY + + + S F N
Sbjct: 59 IELHNLEHSMGKHSYRLGMNHFGDMTHEEFKQIMNGYKHK----AERKFKGSLFLEPNFL 114
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+ P S+DWREKG VT +K+QG CGSCWAFS A+EG GKL+ LS Q LV+CS
Sbjct: 115 EAPRSVDWREKGYVTPVKDQGECGSCWAFSTTGALEGQEFTRTGKLVSLSGQNLVECSRP 174
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD+AF+Y+ +N+GL +E YPY K +AA + D+P G
Sbjct: 175 EGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKFSAANDTGFVDIPSG 234
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
+E AL++AV PVSV ++A ++F+FY+ G+ EC + DHGV VG+G E+
Sbjct: 235 NERALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFQGEDV 294
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG K+W++KNSW E WG+ GYI + +D + CGIAT ASYP+
Sbjct: 295 DGKKFWIVKNSWSENWGDKGYIYMAKDRKNHCGIATAASYPL 336
>gi|332220183|ref|XP_003259237.1| PREDICTED: cathepsin S isoform 1 [Nomascus leucogenys]
Length = 331
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 207/341 (60%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKWLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE + + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ C + K AAT KY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC-QYDSKYRAATCSKYTELP 231
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEE 303
E L +AV K PVSV V+AS +F Y+ GV C N +HGV VVG+G +
Sbjct: 232 YSREDVLKEAVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+G +YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
Length = 342
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 145/342 (42%), Positives = 203/342 (59%), Gaps = 20/342 (5%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
I M ++ ++ C+S + + +P++ + W +G+ YK++ E+ R I+++NL
Sbjct: 10 ITMNWLVWALLLCSSAMA--QVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNL 67
Query: 71 EYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
+ + N E G +Y+LG N D+T+EE + + P Q R T+K
Sbjct: 68 KTVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVP-----SQWPRNVTYKSDP 122
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+P S+DWREKG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDCS
Sbjct: 123 NQKLPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCS 182
Query: 188 T---DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
T N GC+GG M +AF+YII+N G+ +EA YPY+ G C + K AAT +Y +L
Sbjct: 183 TAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKC-QYDVKNRAATCSRYIEL 241
Query: 245 PKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEE 302
P G E AL +AV K PVSV ++AS +F YK GV + C N +HGV VVG+G
Sbjct: 242 PFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNL-- 299
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
DG YWL+KNSWG +G+ GYIR+ R+ G CGIA+ SYP
Sbjct: 300 -DGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIASYPSYP 340
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 139/334 (41%), Positives = 199/334 (59%), Gaps = 19/334 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
FV ++L+I S V+ E+ W ++G+TY+ E MR I+ QN +Y+
Sbjct: 9 FVAVLLLIGLVSAAVND--------AEEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYV 60
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
+ N + +++L NEF+DLT EEF + Y GY + +R++ +T +P
Sbjct: 61 NEHNSM-DSSFQLEVNEFADLTAEEFSSIYNGYGK---GRNRENHENTTIYRYTGGAIPD 116
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGC 193
S+DWR KG VT +KNQ CGSCWAFS ++EG GKL+ LSEQ LVDC ++GC
Sbjct: 117 SVDWRTKGLVTPVKNQKQCGSCWAFSTTGSLEGAHAKKTGKLVSLSEQNLVDCDKKDHGC 176
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALL 253
GGLM AF+YI ENKG+ TE YPY+ + G C+ +K+ AT+ ++ + D AL
Sbjct: 177 QGGLMTTAFKYIEENKGIDTEESYPYKAKNGRCEFKKDD-IGATVERHVSILTTDCEALK 235
Query: 254 QAVTK-QPVSVCVEASGQAFRFYKRGVLNAE-CGD-NCDHGVAVVGFGTAEEEDGAKYWL 310
+AV + P+SV ++AS +F+ YK G+ + + C DHGV VVG+G +EDG +YWL
Sbjct: 236 KAVAEIGPISVAMDASHSSFQLYKSGIYDPKICSSRKLDHGVLVVGYG---KEDGEEYWL 292
Query: 311 IKNSWGETWGESGYIRILRDEGLCGIATEASYPV 344
+KNSWG+ WG GY +I + LCGI T A YPV
Sbjct: 293 VKNSWGKNWGMEGYFKIASKKNLCGICTSACYPV 326
>gi|296228726|ref|XP_002759933.1| PREDICTED: cathepsin S isoform 1 [Callithrix jacchus]
Length = 330
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 144/339 (42%), Positives = 206/339 (60%), Gaps = 19/339 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M ++ ++ C+S V + + +P++ W +G+ YK++ E+A+R I+++NL++
Sbjct: 1 MKQLVCVLFVCSSAV--AQLLKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLKF 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y LG N D+T+EE + + VPS Q R T+K
Sbjct: 59 VMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPNQ 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCS
Sbjct: 114 MLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEK 173
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GG M +AF+YII+NKG+ +EA YPY+ C + K AAT KY +LP G
Sbjct: 174 YGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKAMDQKC-QYDSKYRAATCSKYTELPYG 232
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEEDG 305
E L +AV K PV V V+AS +F Y+ GV + C N +HGV V+G+G + +G
Sbjct: 233 REDVLKEAVANKGPVCVGVDASHSSFFLYRSGVYYDPACTQNVNHGVLVIGYG---DLNG 289
Query: 306 AKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 290 EEYWLVKNSWGSNFGERGYIRMARNKGNHCGIASYPSYP 328
>gi|12803615|gb|AAH02642.1| Cathepsin S [Homo sapiens]
gi|49456313|emb|CAG46477.1| CTSS [Homo sapiens]
gi|60821573|gb|AAX36579.1| cathepsin S [synthetic construct]
gi|189069420|dbj|BAG37086.1| unnamed protein product [Homo sapiens]
gi|261858586|dbj|BAI45815.1| cathepsin S [synthetic construct]
Length = 331
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 207/341 (60%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE + + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 WILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ C + K AAT KY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKC-QYDSKYRAATCSKYTELP 231
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEE 303
G E L +AV K PVSV V+A +F Y+ GV C N +HGV VVG+G +
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+G +YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 134/276 (48%), Positives = 167/276 (60%), Gaps = 22/276 (7%)
Query: 51 RTYKDELEKAMRLTIFKQNLEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRASYTGYN 107
+ Y+ E+A R IF NL +I + N E R T+ +G N+F+DLTNEE+R Y
Sbjct: 29 KQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEEYRQLYL--- 85
Query: 108 RPVPSVSRQSSRPSTFKYQNVTDVPT--SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVE 165
RP P+ R + D P S+DWR+KGAVT IKNQG CGSCW+FS +VE
Sbjct: 86 RPYPTELLGRERQEVW-----LDGPNAGSVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVE 140
Query: 166 GITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQ 223
G I G L+ LSEQQLVDCS N GC+GGLMD AF+YII N GL TE DYPY
Sbjct: 141 GAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPYTARD 200
Query: 224 GTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAE 283
G CDK KE A +I Y+D+P+ +E L AV K PVSV +EA Q+F+ Y GV +
Sbjct: 201 GVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGP 260
Query: 284 CGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
CG N DHGV VVG+ + YW++KNSWG +W
Sbjct: 261 CGTNLDHGVLVVGYTS-------DYWIVKNSWGASW 289
>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
Length = 289
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 120/219 (54%), Positives = 157/219 (71%), Gaps = 8/219 (3%)
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P S+DWR++GAV +K+QG CGSCWAFS + AVEGI +I G LI LSEQ+LVDC T
Sbjct: 3 IPESVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSY 62
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
N GC+GGLMD AFE+II+N G+ TE DYPY+ G CD+ ++ A TI YED+P+ +E
Sbjct: 63 NQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQNRKNAKVVTIDAYEDVPENNE 122
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
AL +A+ QP+SV +EA G+AF+ Y GV + CG DHGV VG+GT E+G YW
Sbjct: 123 AALKKALANQPISVAIEAGGRAFQLYSSGVFDGTCGTELDHGVVAVGYGT---ENGKDYW 179
Query: 310 LIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+++NSWG +WGESGYI++ R+ G CGIA EASYP+
Sbjct: 180 IVRNSWGGSWGESGYIKMARNIAEATGKCGIAMEASYPI 218
>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 137/324 (42%), Positives = 186/324 (57%), Gaps = 14/324 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKL 86
V +G + P + + ++G+ Y E A+R IFK N++ I N N T+ L
Sbjct: 12 VAAGHEVPPPDYMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNAR-NLTFAL 70
Query: 87 GTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
G NEF+DLT EEF ASYTG +P S+ R ST +Y N + +S+DW +G VT +
Sbjct: 71 GVNEFTDLTQEEFAASYTGL-KPA-SLWSGLPRLSTHEY-NGAPLASSVDWTTQGVVTPV 127
Query: 147 KNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYII 206
KNQG CGSCW+FS A+EG ++ G L+ LSEQQ DC T ++GC+GG MD AF +
Sbjct: 128 KNQGQCGSCWSFSTTGALEGAWALSTGNLVSLSEQQFEDCDTTDSGCNGGWMDNAFSFAK 187
Query: 207 ENKGLATEADYPYQQEQGTCDKQKEKAAAATIG--KYEDLPKGDEHALLQAVTKQPVSVC 264
+N + TE YPY GTC+ + G Y D+ E A++ AV +QPVS+
Sbjct: 188 KNS-ICTEGSYPYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIA 246
Query: 265 VEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGY 324
+EA +F+ Y GVL A CG DHGV VG+G+ E G YW +KNSWG +WGE GY
Sbjct: 247 IEADQYSFQLYSSGVLTASCGTRLDHGVLAVGYGS---EAGTDYWKVKNSWGSSWGEQGY 303
Query: 325 IRILRDEGLCG----IATEASYPV 344
+R+ R +G G +A SYPV
Sbjct: 304 VRLQRGKGGAGECGLLAGPPSYPV 327
>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
Full=Papaya proteinase III; Short=PPIII; AltName:
Full=Papaya proteinase omega; Flags: Precursor
gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
Length = 348
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 139/351 (39%), Positives = 198/351 (56%), Gaps = 16/351 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ----WMAQHGRTYKDE 56
M+ K + + + + + ++ + G S + + E+ Q WM H + Y++
Sbjct: 3 MIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENV 62
Query: 57 LEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQ 116
EK R IFK NL YI++ NK+ N +Y LG NEF+DL+N+EF Y G + + +
Sbjct: 63 DEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFADLSNDEFNEKYVG---SLIDATIE 118
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLI 176
S F ++ ++P ++DWR+KGAVT +++QG CGSCWAFSAVA VEGI +I GKL+
Sbjct: 119 QSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLV 178
Query: 177 ELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAA 236
ELSEQ+LVDC ++GC GG A EY+ +N G+ + YPY+ +QGTC ++
Sbjct: 179 ELSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGTCRAKQVGGPIV 237
Query: 237 TIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVG 296
+ +E LL A+ KQPVSV VE+ G+ F+ YK G+ CG DH V V
Sbjct: 238 KTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAV- 296
Query: 297 FGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYP 343
+ G Y LIKNSWG WGE GYIRI R G+CG+ + YP
Sbjct: 297 --GYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYP 345
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 142/321 (44%), Positives = 199/321 (61%), Gaps = 16/321 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P + + W + H + Y E E++ R ++++NL+ IE N + G +YKLG N+F
Sbjct: 3 DPELDGHWQLWKSWHNKDYH-EREESWRRVVWEKNLKMIELHNLDHTLGKHSYKLGMNQF 61
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
D+T EEFR GY S + R S F + + P S+DWREKG VT +K+QG
Sbjct: 62 GDMTTEEFRQLMNGY---AHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQ 118
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
CGSCWAFS A+EG GKL+ LSEQ LVDCS N GC+GGLMD+AF+Y+ +N
Sbjct: 119 CGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNG 178
Query: 210 GLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEA 267
G+ +E YPY ++ C + E AA G + D+P+G E AL++AV PVSV ++A
Sbjct: 179 GIDSEESYPYTAKDDEDCRYKAEYNAANDTG-FVDIPQGHERALMKAVAAVGPVSVAIDA 237
Query: 268 SGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEEDGAKYWLIKNSWGETWGESGY 324
+F+FY+ G+ +C ++ DHGV VVG+G E+ DG KYW++KNSWGE WG+ GY
Sbjct: 238 GHSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGY 297
Query: 325 IRILRD-EGLCGIATEASYPV 344
I + +D + CGIAT ASYP+
Sbjct: 298 IYMAKDRKNHCGIATAASYPL 318
>gi|306992173|gb|ADN19567.1| cathepsin L-like proteinase [Spodoptera frugiperda]
Length = 344
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 140/325 (43%), Positives = 194/325 (59%), Gaps = 29/325 (8%)
Query: 43 EQWMA---QHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR---TYKLGTNEFSDLTN 96
E+W A +H + Y E+E R+ I+ +N I K N+ + +YKL N+++D+ +
Sbjct: 25 EEWNAFKMEHSKQYDSEVEDKFRMKIYVENKHRIAKHNQRFEQRLVSYKLKPNKYADMLH 84
Query: 97 EEFRASYTGYNRPVPSVSRQSS--------RPSTFKYQNVTDVPTSIDWREKGAVTHIKN 148
EF + G+N+ R + R +TF P +DWR+KGAVT +K+
Sbjct: 85 HEFVHTMNGFNKTAKHGGRNKAVHSKGRDGRAATFIAPAHVSYPDHVDWRKKGAVTDVKD 144
Query: 149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYII 206
QG CGSCWAFS A+EG G L+ LSEQ LVDCS NNGC+GGLMD AF+YI
Sbjct: 145 QGKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLVDCSAAYGNNGCNGGLMDNAFKYIK 204
Query: 207 ENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCV 265
+N G+ TE YPY+ C + + A +G + D+P+GDE L+QAV T P+SV +
Sbjct: 205 DNGGIDTEKSYPYEAVDDKCRYNPKNSGADDVG-FVDIPQGDEEKLMQAVATVGPISVAI 263
Query: 266 EASGQAFRFYKRGVLNAECGDNC-----DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWG 320
+AS + F+FY +GV E NC DHGV VVG+GT EE+G YWL+KNSWG +WG
Sbjct: 264 DASQETFQFYSKGVYYDE---NCSSTDLDHGVMVVGYGT--EEEGGDYWLVKNSWGRSWG 318
Query: 321 ESGYIRILRDE-GLCGIATEASYPV 344
E GYI++ ++ CGIA+ ASYP+
Sbjct: 319 ELGYIKMAHNKNNHCGIASSASYPL 343
>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 201/343 (58%), Gaps = 17/343 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
VI++ ++ A VS +++E I E+ + AQ + Y+D E+A R ++ N I
Sbjct: 4 VIVLGLVVFAISSVSSINLNE-VIEEEWSLFKAQFKKIYEDVKEEAFRKKVYLDNKLKIA 62
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST---FKYQNV 128
+ NK G TY L N F DL E++ G+ + + + K +NV
Sbjct: 63 RHNKLYETGEETYALEMNHFGDLMQHEYKKMMNGFKPSLAGGDKNFTDDDAVTFLKSENV 122
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
VP +IDWR+KG VT +KNQG CGSCW+FSA ++EG G L+ LSEQ L+DCS
Sbjct: 123 V-VPKAIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSR 181
Query: 189 D--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
NNGC GGLMD AF+YI NKGL TE YPY+ E C E + A G + D+P+
Sbjct: 182 KYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKG-FVDIPE 240
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFGTAEEE 303
GDE AL+ A+ T PVS+ ++AS + F+FYK+GV N C DHGV VG+GT +
Sbjct: 241 GDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGT--DH 298
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVA 345
G YW++KNSWG+TWG+ GYI + R+ + CG+A+ ASYP+
Sbjct: 299 KGGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASYPLV 341
>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
Length = 333
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 145/342 (42%), Positives = 201/342 (58%), Gaps = 20/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M +IL C + S + S+ + +W A H R Y E+ R ++++N++
Sbjct: 1 MNPTLILAAFCLG-IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58
Query: 73 IEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE+ N +EG ++ + N F D+T+EEFR G+ P + P +
Sbjct: 59 IEQHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFY------ 112
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-- 187
+ P S+DWREKG VT +KNQG CGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGP 172
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD AF+Y+ +N GL +E YPY+ + +C K K + A + D+PK
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIPK- 230
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
E AL++AV T P+SV V+A Q+F+FYK G+ +C ++ DHGV VVG+G + E
Sbjct: 231 QEKALMKAVATVGPISVAVDAGHQSFQFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
D KYWL+KNSWGE WG GYI++ +D CGIA+ ASYP
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 358
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/332 (43%), Positives = 198/332 (59%), Gaps = 35/332 (10%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ + A + RTY E+ R ++++N++YIE N+ G+ TY+LG N+F+DLT +
Sbjct: 36 MMDRFRAFQATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADLTVQ 95
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDV---------------------PTSID 136
EFRA YT +P+ R SRP ++ + + PTS+D
Sbjct: 96 EFRAMYT-----MPA--RVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPTSVD 148
Query: 137 WREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGG 196
WR KGAVT +K+QG CG CWAF+ VA +EG+ +I G+L+ LSEQ+LVDC ++GC GG
Sbjct: 149 WRSKGAVTPVKDQGGCGCCWAFATVATIEGLHKIKTGQLVSLSEQELVDCDDADDGCGGG 208
Query: 197 LMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV 256
L + A E++ N GL TEA+YPY + G CD+ K AA I + + E L +AV
Sbjct: 209 LPEIAMEWVAHNGGLTTEANYPYTGKAGKCDRGKASNHAAKIAAAQMVRANSEAELERAV 268
Query: 257 TKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWG 316
+QPV+V + A + FYK GV + C DH V VVG+G + G KYW+IKNSW
Sbjct: 269 ARQPVAVAINAP-DSLMFYKSGVYSGPCTAEFDHAVTVVGYGA--DNKGHKYWIIKNSWA 325
Query: 317 ETWGESGYIRILR----DEGLCGIATEASYPV 344
ETWGE GY R+ R EGLCGIAT ASYPV
Sbjct: 326 ETWGEKGYGRMQRGVAAKEGLCGIATHASYPV 357
>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
Length = 330
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 137/306 (44%), Positives = 185/306 (60%), Gaps = 17/306 (5%)
Query: 45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT 104
WM +H R+Y E + FK N+++I N N LG +F+DLTNEE+R Y
Sbjct: 36 WMKKHDRSYHHH-EFNNKYQAFKDNMDFIHNWNTNKNSKTVLGLTQFADLTNEEYRKIYL 94
Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAV 164
G V + F + T P SIDWR KGAV+H+K+QG CGSCW+FS +V
Sbjct: 95 GTKVNV------APEKHNFNMIHFTG-PDSIDWRTKGAVSHVKDQGQCGSCWSFSTTGSV 147
Query: 165 EGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
EG QI G ++ LSEQ LVDCS NNGC GGLM AF++I+ G+ATE YPY
Sbjct: 148 EGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGGLMVNAFKFIMSQGGVATEDSYPYNAV 207
Query: 223 QGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLN- 281
QG C K + A I Y+++ +G E L A+TKQPVS+ ++AS Q+F+ YK GV +
Sbjct: 208 QGKC-KFTKSMVGANISGYKEITQGSELELQAALTKQPVSIAIDASQQSFQLYKSGVYDE 266
Query: 282 AECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATE 339
EC DHGV VG+GT E+G Y+++KNSW ++WG+ GYI + R+ + CG+AT
Sbjct: 267 PECSSYQLDHGVLAVGYGT---ENGKDYYIVKNSWADSWGQDGYIFMSRNAKNQCGVATM 323
Query: 340 ASYPVA 345
ASYP++
Sbjct: 324 ASYPIS 329
>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
Length = 337
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/346 (41%), Positives = 208/346 (60%), Gaps = 21/346 (6%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++P+ V+ + C S +S S+ +P + + + W + H + Y E E+ R ++++N
Sbjct: 1 MLPLAVLAV----CLSAALSAPSL-DPQLDDHWDLWKSWHSKKYH-EKEEGWRRMVWEKN 54
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
L+ IE N E G Y+LG N F D+T+EEFR GY + + + + S F
Sbjct: 55 LKKIELHNLEHSMGKHPYRLGMNHFGDMTHEEFRQIMNGYKQ---RKTERKFKGSLFMEP 111
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
N + P ++DWR+KG VT +K+QG CGSCWAFS A+EG GKL+ LSEQ LVDC
Sbjct: 112 NFLEAPRALDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDC 171
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYED 243
S N GC+GGLMD+AF+Y+ +N+GL +E YPY + C +A G + D
Sbjct: 172 SRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPNYNSANDTG-FVD 230
Query: 244 LPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-T 299
+P G E AL++AV PVSV ++A ++F+FY+ G+ +C + DHGV VVG+G
Sbjct: 231 VPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEELDHGVLVVGYGYE 290
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
E+ DG KYW++KNSW E WG+ GYI + +D + CGIAT ASYP+
Sbjct: 291 GEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 336
>gi|18858809|ref|NP_571273.1| cathepsin L, 1 b precursor [Danio rerio]
gi|1752664|emb|CAA69623.1| cathepsin L [Danio rerio]
Length = 336
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 138/341 (40%), Positives = 203/341 (59%), Gaps = 16/341 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ +LV S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLYISAVFAAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR + GY Q+S+ F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHD----PNQTSQGPLFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +K+Q CGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC+GGLMD+AF+Y+ ENKGL +E YPY + + A I + D+P G+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGN 234
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGVL--NAECGDNCDHGVAVVGFG-TAEEED 304
E AL+ AV PVSV ++AS Q+ +FY+ G+ A DH V VVG+G +
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
G +YW++KNSW + WG+ GYI + +D+ CG+AT+ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
Length = 335
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 142/343 (41%), Positives = 207/343 (60%), Gaps = 20/343 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M + ++ C + V + + +P++ + W H ++Y + E+ R ++++NL
Sbjct: 1 MALYLVAAALCLTTVFAAPTT-DPALDDHWHLWKNWHKKSYLPK-EEGWRRVLWEKNLRT 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N + G +Y+LG N+F D+TNEEFR GY +++ + STF N
Sbjct: 59 IEFHNLDHSLGKHSYRLGMNQFGDMTNEEFRQLMNGYK------NQKMIKGSTFLAPNNF 112
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-- 187
+ P ++DWREKG VT +K+QG CGSCWAFS A+EG GKLI LSEQ LVDCS
Sbjct: 113 EAPKTVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKAGKLISLSEQNLVDCSRA 172
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD+AF+Y+ +N G+ +E YPY ++ C +A G + D+P
Sbjct: 173 QGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTG-FVDVPS 231
Query: 247 GDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEE 302
G E L++AV PVSV V+A ++F+FY+ G+ + EC ++ DHGV VVG+G E+
Sbjct: 232 GSEKDLMKAVASVGPVSVAVDAGHKSFQFYQSGIYYDPECSSEDLDHGVLVVGYGFEGED 291
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG +YW++KNSW E WG +GYI+I +D CGIAT ASYP+
Sbjct: 292 VDGKRYWIVKNSWSEKWGNNGYIKIAKDRHNHCGIATAASYPL 334
>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
Length = 326
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 151/343 (44%), Positives = 204/343 (59%), Gaps = 27/343 (7%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
+ MF+ + LV A+ S+ + E W +G+ Y + E+A+R I+ NL
Sbjct: 1 MKMFISLALVAMAAA----------TSVNTEWESWKRTYGKEYTQK-EEALRHMIWNVNL 49
Query: 71 EYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
+ I+ N++ G TY N+F DLTNEE+R GY + +V S+PSTF +
Sbjct: 50 KMIQMHNEKYMSGKSTYTQNMNQFGDLTNEEYRELMCGYKKSNKTVI---SKPSTFLLPS 106
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
P SIDWR +G VT +K+QG CGSCWAFS+ ++EG T GKL+ LSEQQLVDCS
Sbjct: 107 NYRAPASIDWRTQGYVTDVKDQGACGSCWAFSSTGSLEGQTFKKTGKLVPLSEQQLVDCS 166
Query: 188 TD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
D N GC GG MD+AF Y I++KG +E YPY TC K A G Y D+P
Sbjct: 167 GDYGNMGCGGGWMDQAFSY-IKDKGEESEDGYPYTGTDDTCVYDASKVVATDTG-YTDIP 224
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AECGD-NCDHGVAVVGFGTAEE 302
+ DE+AL QAV T P+SV ++A+ +F+FY+ GV + EC N DH V VG+GT+EE
Sbjct: 225 EMDENALQQAVATVGPISVAIDATHSSFQFYESGVYDEPECSQTNLDHAVLAVGYGTSEE 284
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
G YW++KNSW WG GYI + R+ + CGIA++ASYPV
Sbjct: 285 --GLDYWIVKNSWSTGWGMQGYIEMSRNKDNQCGIASKASYPV 325
>gi|109112057|ref|XP_001086247.1| PREDICTED: cathepsin L1-like isoform 5 [Macaca mulatta]
gi|402897797|ref|XP_003911929.1| PREDICTED: cathepsin L1 [Papio anubis]
Length = 333
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 142/343 (41%), Positives = 203/343 (59%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
P F++ + AS ++ S+ + +W A H R Y E+ R ++++N++
Sbjct: 3 PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N+E G ++ + N F D+T+EEFR G+ +R+ + F+
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLF 111
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS- 187
+ P S+DWREKG VT +KNQG CGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 188 -TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY+ + +C E + A G + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTG-FVDIPK 230
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEE 302
E AL++AV T P+SV ++A ++F FYK G+ +C ++ DHGV VVG+G + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
D +KYWL+KNSWGE WG GYI++ +D CGIA+ ASYP
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>gi|380790141|gb|AFE66946.1| cathepsin L1 preproprotein [Macaca mulatta]
gi|384939708|gb|AFI33459.1| cathepsin L1 preproprotein [Macaca mulatta]
Length = 333
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 142/343 (41%), Positives = 203/343 (59%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
P F++ + AS ++ S+ + +W A H R Y E+ R ++++N++
Sbjct: 3 PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N+E G ++ + N F D+T+EEFR G+ +R+ + F+
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQLMNGFQ------NRKPRKGKVFQEPLF 111
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS- 187
+ P S+DWREKG VT +KNQG CGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 188 -TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY+ + +C E + A G + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTG-FVDIPK 230
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEE 302
E AL++AV T P+SV ++A ++F FYK G+ +C ++ DHGV VVG+G + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
D +KYWL+KNSWGE WG GYI++ +D CGIA+ ASYP
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 138/312 (44%), Positives = 193/312 (61%), Gaps = 20/312 (6%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEF 99
+Q+ A++G+ Y+ E + R ++++QN E+I N++ G ++ L N+F D+T EE
Sbjct: 23 QQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDMTTEEI 82
Query: 100 RASYTGYNRPVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKNQGHCGSCWAF 158
A+ G+ +S P YQ + D +P ++DWR+KGAVT +K+Q CGSCWAF
Sbjct: 83 NAAMNGF------LSAGKKVPRGTMYQPLVDELPDTVDWRDKGAVTPVKDQKACGSCWAF 136
Query: 159 SAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEAD 216
SA ++EG ++ GKL+ LSEQ LVDCS N GC GGLMD AF YI +N G+ TE
Sbjct: 137 SATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGIDTEES 196
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVT-KQPVSVCVEASGQAFRFY 275
YPY+ + G C + AT+ Y D+ G E L +AV K PVSV ++AS F FY
Sbjct: 197 YPYEAKNGPC-RFNSDNVGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTFHFY 255
Query: 276 KRGVLNAE-CGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE-G 332
RG+ E C + DHGV VG+GT +D + YWL+KNSW ETWG+SGYI++ R+
Sbjct: 256 SRGIYYDEKCSSSFLDHGVLAVGYGT---DDSSDYWLVKNSWNETWGDSGYIKMSRNRNN 312
Query: 333 LCGIATEASYPV 344
CGIA++ASYPV
Sbjct: 313 NCGIASQASYPV 324
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 137/325 (42%), Positives = 193/325 (59%), Gaps = 20/325 (6%)
Query: 33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN--KEGNRTYKLGTNE 90
+ E I E + W +H + YK E R+ FK+NL+YI + N ++ +K+G N+
Sbjct: 41 LTEEGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNK 100
Query: 91 FSDLTNEEFRASY-TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQ 149
F+DL+NEEFR Y + +P+ ++ R + D P+S+DWR KG VT +K+Q
Sbjct: 101 FADLSNEEFREMYLSKVKKPITIEEKRKHR-----HLQTCDAPSSLDWRNKGVVTAVKDQ 155
Query: 150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN-GCSGGLMDKAFEYIIEN 208
G CGSCW+FS A+E I I G LI LSEQ+LVDC T NN GC GG MD AF+++I N
Sbjct: 156 GDCGSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGN 215
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
G+ TEADYPY GTC+ KE+ +I Y D+ D ALL A +QP+SV ++ S
Sbjct: 216 GGIDTEADYPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSDS-ALLCATVQQPISVGMDGS 274
Query: 269 GQAFRFYKRGVLNAEC-GD--NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
F+ Y G+ + +C GD + DH + +VG+G+ +ED YW++KNSWG WG GY
Sbjct: 275 ALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYGSENDED---YWIVKNSWGTEWGMEGYF 331
Query: 326 RILRDE----GLCGIATEASYPVAM 346
I R+ G+C I +ASYP +
Sbjct: 332 YIRRNTSKPYGVCAINADASYPTKV 356
>gi|444519959|gb|ELV12909.1| Cathepsin L1 [Tupaia chinensis]
Length = 333
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 208/341 (60%), Gaps = 26/341 (7%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
+ L I C + S H+ S+ E+ QW A+HG+ Y E+++R ++++NL+ IE+
Sbjct: 5 LFLTILCLG-IASAAPTHDQSLDEQWNQWTAEHGKVYSTG-EESLRRAVWEKNLKMIEQH 62
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N E G T+ +G N F D+TNE+FR TG+ +++ ++ F+ +VP
Sbjct: 63 NLEYSQGKHTFTMGMNAFGDMTNEDFRQMMTGFQ------NQKYNKGEVFQPPQPLEVPE 116
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNN 191
S+DWREKG VT +KNQ CGSCWAFSA A+EG GKL+ LSEQ LVDCS N+
Sbjct: 117 SVDWREKGYVTPVKNQHRCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQPQHNS 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GGL+ KAF+Y+ +N GL +E YPY++ + TC + +AAT+ ++ +P +E A
Sbjct: 177 GCKGGLVIKAFQYVKDNGGLDSEESYPYEEMESTC-RYSPGNSAATVTGFKHIP-AEEKA 234
Query: 252 LLQAVTK-QPVSVCVEASGQAFRFYKRGVLNAECGDNC-----DHGVAVVGFGTAEE-ED 304
L +AV P+SV ++A +F+FY G+L+ NC +H V VVG+G +E +
Sbjct: 235 LEKAVASVGPISVAIDAHHHSFQFYTGGILHE---PNCSPKWLNHAVLVVGYGVMQEGSN 291
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
YWL+KNSWGE WG GYI + +D+ CGIA++A YP+
Sbjct: 292 NNTYWLVKNSWGERWGVGGYIMMAKDKNNHCGIASDALYPI 332
>gi|355753449|gb|EHH57495.1| Cathepsin L1 [Macaca fascicularis]
Length = 333
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 142/343 (41%), Positives = 203/343 (59%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
P F++ + AS ++ S+ + +W A H R Y E+ R ++++N++
Sbjct: 3 PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N+E G ++ + N F D+T+EEFR G+ +R+ + F+
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQELLF 111
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS- 187
+ P S+DWREKG VT +KNQG CGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSW 171
Query: 188 -TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY+ + +C E + A G + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTG-FVDIPK 230
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEE 302
E AL++AV T P+SV ++A ++F FYK G+ +C ++ DHGV VVG+G + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
D +KYWL+KNSWGE WG GYI++ +D CGIA+ ASYP
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
Length = 336
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 144/345 (41%), Positives = 203/345 (58%), Gaps = 20/345 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
++P+ ++ + V S V+S S+ + + + E W H + Y E E+ R I+++N
Sbjct: 1 MLPLALLALGV----SAVLSAPSL-DARLSDHWELWKNWHSKKYH-EKEEGWRRMIWEKN 54
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
L IE N E G +Y+LG N F D+T+EEFR GY R + + + S F
Sbjct: 55 LNKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYQRK----TERKAIGSLFMEP 110
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
N P+++DWREKG VT +K+QG CGSCWAFS A+ZG GKL+ LSEQ LVDC
Sbjct: 111 NFMVAPSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALZGQNFRKMGKLVSLSEQNLVDC 170
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S N GC GGLMD+AF+Y+ +N+GL +E YPY K + + D+
Sbjct: 171 SRPEGNEGCGGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSVNDTGFVDI 230
Query: 245 PKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TA 300
P G EHAL++AV PVSV ++A ++F+FY+ G+ EC + DHGV VG+G
Sbjct: 231 PSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEG 290
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
E+ DG KYW++KNSW E WG+ GYI + +D + CGIAT ASYP+
Sbjct: 291 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|308810026|ref|XP_003082322.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
gi|116060790|emb|CAL57268.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
Length = 430
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 140/337 (41%), Positives = 196/337 (58%), Gaps = 31/337 (9%)
Query: 37 SIVEKHEQWMAQHG--RTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
++ E+W ++HG R +D E A RL F +N Y+ + N G ++ +G N
Sbjct: 93 ALARHFERWCSEHGLERYLRDTEEYAKRLATFAENAAYVVEHNALYAIGEVSHWVGLNSL 152
Query: 92 SDLTNEEFRASYTGYNRPV-----------PSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
+ T EE+RA GY + S + ++++Y +V D P +IDW E
Sbjct: 153 AATTREEYRA-LLGYKPELRSSGDAEMLEATSTDKVEQYKASWEYASV-DPPEAIDWVEL 210
Query: 141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDK 200
GAVT KNQG CGSCWAFS AVEGIT+I G+L+ LSEQ++V CS N GC+GGLMD
Sbjct: 211 GAVTPPKNQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCSKQNMGCNGGLMDY 270
Query: 201 AFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQP 260
AF +I++N G+ +E YPY E C++ K + ATI ++D+P GDE L +AV++QP
Sbjct: 271 AFRWIVKNGGIDSEFQYPYSAEALACNRWKLQLHVATIDGFKDVPPGDEKELEKAVSQQP 330
Query: 261 VSVCVEASGQAFRFYKRGVLNA-ECGDNCDHGVAVVGFG--------TAEEEDGAKYWLI 311
VS+ +EA ++F+ Y GV ++ ECG DHGV VVG+G T + +W +
Sbjct: 331 VSIAIEADTKSFQLYDGGVYDSKECGSQVDHGVLVVGYGFDDTHHNATKHHKRHRHFWKV 390
Query: 312 KNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
KNSWG TWGE G+IR+ R + G CGI T SYP
Sbjct: 391 KNSWGGTWGEGGFIRMARRISDETGQCGITTAPSYPT 427
>gi|355567871|gb|EHH24212.1| Cathepsin L1 [Macaca mulatta]
Length = 333
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 142/343 (41%), Positives = 203/343 (59%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
P F++ + AS ++ S+ + +W A H R Y E+ R ++++N++
Sbjct: 3 PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N+E G ++ + N F D+T+EEFR G+ +R+ + F+
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLF 111
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS- 187
+ P S+DWREKG VT +KNQG CGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 188 -TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY+ + +C E + A G + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEEAYPYEATEESCKYNPEYSVANDTG-FVDIPK 230
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEE 302
E AL++AV T P+SV ++A ++F FYK G+ +C ++ DHGV VVG+G + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
D +KYWL+KNSWGE WG GYI++ +D CGIA+ ASYP
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
Length = 336
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 144/342 (42%), Positives = 207/342 (60%), Gaps = 19/342 (5%)
Query: 15 VIIILVIT-CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
++ +LV+T C S V+S + + + E + W + H + Y E E+ R ++++NL+ I
Sbjct: 1 MLPLLVLTACLSSVLSAPVL-DAQLNEHWDLWKSWHSKKYH-EKEEGWRRMVWEKNLQKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E N E G +++LG N F D+T+EEFR GY +++ S F N
Sbjct: 59 ELHNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLK----TQRKFTGSLFMEPNFMT 114
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P+++DWREKG VT +K+QG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 115 APSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPE 174
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC GGLMD+AF+Y+ +N+GL +E YPY + C +A G + D+P G
Sbjct: 175 GNEGCGGGLMDQAFQYVTDNQGLDSEDSYPYTGTDDQPCHYDPLYNSANDTG-FVDVPSG 233
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
EHAL++AV PVSV ++A ++F+FY+ G+ EC + DHGV VG+G E++
Sbjct: 234 KEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDK 293
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
G K+W++KNSWGE WG+ GYI + +D + CGIAT ASYP+
Sbjct: 294 MGKKFWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
Length = 327
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 147/346 (42%), Positives = 204/346 (58%), Gaps = 37/346 (10%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
F+I++L +T A+ ++ + E + HG+ YK E+ +R IF+ N + I
Sbjct: 3 FLILVLSVTMAT-----------AMDVEWEAFKLTHGKQYKSPDEENVRRAIFRDNNQMI 51
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTG-----YNRPVPSVSRQSSRPSTFKY 125
++ N+E G R+Y +G N+F DL + E+ G N PS + S P
Sbjct: 52 KEHNQEAAMGRRSYFMGMNQFGDLAHSEYLELVVGPGLLPLNLSTPSENVFESTPGL--- 108
Query: 126 QNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVD 185
V ++DWR+KGAVT IK+QGHCGSCWAFS ++EG + GKL+ LSEQ L+D
Sbjct: 109 ----QVDDTVDWRQKGAVTPIKDQGHCGSCWAFSTTGSLEGQHFMKTGKLVSLSEQNLLD 164
Query: 186 CST--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYE 242
CS N GC GGLMD+AF YI N G+ TE YPY +++ CD K + AT+ Y
Sbjct: 165 CSRRFGNKGCEGGLMDQAFRYIKSNGGIDTEECYPYMAKDEKVCD-YKTSCSGATLSSYT 223
Query: 243 DLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AECG-DNCDHGVAVVGFGT 299
D+ DE AL+QAV T PVSV ++AS ++ RFYK G+ + EC DHGV VG+G+
Sbjct: 224 DIKAMDEMALMQAVGTVGPVSVAIDASHKSLRFYKSGIYDEPECSRTKLDHGVLAVGYGS 283
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
DG YWL+KNSWG WG+ GY+++ R++ CGIAT+ASYPV
Sbjct: 284 M---DGMDYWLVKNSWGSAWGDMGYVKMTRNKNNQCGIATKASYPV 326
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 145/335 (43%), Positives = 196/335 (58%), Gaps = 20/335 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
+++ V+ +S+ S R + V W + HG++Y D E+ R+ I++QNLE I++
Sbjct: 5 LVLCVLVASSRGWSVRFGQDSEWVA----WKSYHGKSYSDVHEERTRMAIWQQNLEKIKR 60
Query: 76 ANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
N E + +YK+ N DLT +EFR Y G S R +T+ + +P+S+
Sbjct: 61 HNAE-DHSYKMAMNHLGDLTEDEFRYFYLGVRAHHNSTKRG---WATYMPPSNVKIPSSV 116
Query: 136 DWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGC 193
DW +KG VT +KNQG CGSCWAFS +VEG G L+ LSEQ L+DCS NNGC
Sbjct: 117 DWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLSEQNLIDCSGSYGNNGC 176
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALL 253
GGLMD AF YI N G+ TE+ YPY +QG+C A G Y+D+P+G E AL
Sbjct: 177 QGGLMDNAFRYIESNGGIDTESSYPYLGQQGSCHFSSSHVGARVTG-YQDIPQGSEQALQ 235
Query: 254 QAV-TKQPVSVCVEASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFGTAEEEDGAKYWL 310
AV T PVSV V+AS ++FY GV N C DHGV V+G+G +D YWL
Sbjct: 236 SAVATVGPVSVAVDAS--QWQFYSSGVYDNPYCSSTQLDHGVLVIGYGNYNGQD---YWL 290
Query: 311 IKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
+KNSWG +WG GYI + R++ CGIA+ ASYP+
Sbjct: 291 VKNSWGYSWGVEGYIMMSRNKNNQCGIASSASYPL 325
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 148/343 (43%), Positives = 197/343 (57%), Gaps = 23/343 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQ---HGRTYKDELEKAMRLTIFK 67
+ + V L+ AS V E +QW A H + Y E+ R I++
Sbjct: 1 MKLLVAACLLFAVASGFV-------VKFDEDEQQWQAWKLFHTKKYTTVTEEGARKAIWR 53
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
NL+ I+K N EG+ ++ L N DLT +EFR YTG + +++ + S F +
Sbjct: 54 DNLKKIQKHNAEGH-SFTLAMNHLGDLTQDEFRYFYTGMRSHYSNYTKK--QGSAFLAPS 110
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
VP ++DWR++G VT +KNQG CGSCWAFS ++EG GKL+ LSEQ LVDCS
Sbjct: 111 HVQVPDTVDWRKEGYVTPVKNQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCS 170
Query: 188 T--DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
T NNGC GGLMD AF+YI EN G+ TE YPY+ C QK A G + D+
Sbjct: 171 TAYGNNGCQGGLMDYAFKYIKENGGIDTEESYPYEARNDRCRFQKSNIGAVDTG-FVDVT 229
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAECGD-NCDHGVAVVGFGTAEE 302
GDE AL A T P+SV ++A +F+FY GV NA C + DHGV VVG+GT +
Sbjct: 230 HGDEEALKTAAGTVGPISVAIDAGHMSFQFYHSGVYNNAGCSSTSLDHGVLVVGYGTYQ- 288
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
G+ YWL+KNSWGE WG GYI + R++ CG+AT+ASYP+
Sbjct: 289 --GSDYWLVKNSWGERWGMEGYIMMSRNKNNQCGVATQASYPL 329
>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
Length = 331
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 147/340 (43%), Positives = 201/340 (59%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M ++ L+ C+ V + +P++ W + + YK+E E+ R I+++NL++
Sbjct: 1 MKWLVGLLPLCSYAVA--QVHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKF 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y LG N D+T EE S G R VPS Q R T++ +
Sbjct: 59 VMLHNLEHSMGMHSYDLGMNHLGDMTGEEV-ISLMGSLR-VPS---QWQRNVTYRSNSNQ 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST+
Sbjct: 114 KLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 190 ---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GG M AF+YII+N G+ +EA YPY+ G C + K AAT KY +LP
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKC-RYDSKKRAATCSKYTELPF 232
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEED 304
G E AL +AV K PVSV ++AS +F Y+ GV C N +HGV VVG+G +
Sbjct: 233 GSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL---N 289
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
G YWL+KNSWG +G+ GYIR+ R+ G CGIA+ SYP
Sbjct: 290 GKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 329
>gi|293342577|ref|XP_001065834.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354413|ref|XP_573976.3| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039745|gb|EDL93861.1| rCG24317, isoform CRA_a [Rattus norvegicus]
Length = 330
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 141/338 (41%), Positives = 199/338 (58%), Gaps = 20/338 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
I L+ T ++S H+PS E+W +HG+TY E+ + +++ N++ I
Sbjct: 4 IFLLATLCLGMISAAPTHDPSFDTVWEEWKTKHGKTYNTN-EEGQKRAVWENNMKMINLH 62
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N++ G + L N F DLTN EFR TG+ +++ F + DVP
Sbjct: 63 NEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQ------GQKTKMMKVFPEPFLGDVPK 116
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NN 191
++DWR+ G VT +KNQG CGSCWAFSAV ++EG GKL+ LSEQ LVDCS N
Sbjct: 117 TVDWRKHGYVTPVKNQGPCGSCWAFSAVGSLEGQVFRKTGKLVPLSEQNLVDCSWSHGNK 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GGL D AF+Y+ +N GL T YPY+ GTC + +AA +G + +P E+A
Sbjct: 177 GCDGGLPDFAFQYVKDNGGLDTSVSYPYEALNGTCRYNPKYSAAKVVG-FMSIPP-SENA 234
Query: 252 LLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGD-NCDHGVAVVGFGTAEEEDGAKY 308
L++AV T P+SV ++ ++F+FYK G+ +C N +H V VVG+G EE DG KY
Sbjct: 235 LMKAVATVGPISVGIDIKHKSFQFYKGGMYYEPDCSSTNLNHAVLVVGYG--EESDGRKY 292
Query: 309 WLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVA 345
WL+KNSWG WG GYI++ +D CGIA++ASYP+
Sbjct: 293 WLVKNSWGRDWGMDGYIKMAKDWNNNCGIASDASYPIV 330
>gi|403302730|ref|XP_003942006.1| PREDICTED: cathepsin S isoform 1 [Saimiri boliviensis boliviensis]
Length = 339
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 143/342 (41%), Positives = 204/342 (59%), Gaps = 21/342 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQN 69
I M ++ ++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++N
Sbjct: 8 ITMKQLVCVLFVCSSAVTQ---LHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKN 64
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
L+++ N E G +Y LG N D+T+EE + + P Q R T+K
Sbjct: 65 LKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP-----NQWQRNITYKSN 119
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDC
Sbjct: 120 PNQMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDC 179
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S N GC+GG M +AF+YII+NKG+ +EA YPY+ C + K AAT KY +L
Sbjct: 180 SEKYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKATDQKC-QYDSKYRAATCSKYTEL 238
Query: 245 PKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEE 302
P G E L +AV K PV V V+AS +F Y+ GV + C +HGV V+G+G +
Sbjct: 239 PYGREDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYG---D 295
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+G +YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 296 LNGKEYWLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSYP 337
>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
Length = 338
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 142/333 (42%), Positives = 204/333 (61%), Gaps = 22/333 (6%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GN 81
S V+S + +P + E W + H + Y E E+ R ++++NL+ IE N + G
Sbjct: 14 SSVLSAPHL-DPQLDEHWNLWKSWHTKKYH-EKEEGWRRMVWEKNLKKIELHNLDHSMGK 71
Query: 82 RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
TY+LG N F D+TNEEFR GY + + + S F N + P S+DWR+KG
Sbjct: 72 HTYRLGMNHFGDMTNEEFRQLMNGYKHK----AERKVKGSLFLEPNFLEAPRSLDWRDKG 127
Query: 142 AVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMD 199
VT +K+QG CGSCWAFSA A+EG GK+++LSEQ LV+CS N GC+GGLMD
Sbjct: 128 YVTPVKDQGQCGSCWAFSATGALEGQQFRKTGKMVQLSEQNLVECSRPEGNEGCNGGLMD 187
Query: 200 KAFEYIIENKGLATEADYPYQQEQGTCDKQ---KEKAAAATIGKYEDLPKGDEHALLQAV 256
+AF+Y+ +N+GL +E YPY GT D++ + A + D+ G EHAL++AV
Sbjct: 188 QAFQYVKDNQGLDSEESYPY---LGTDDQKCHYDPRYNAVNDTGFVDIKSGSEHALMKAV 244
Query: 257 TK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEEDGAKYWLIK 312
T P+SV ++A ++F+FY+ G+ EC + DHGV +VG+G E+ DG KYW++K
Sbjct: 245 TAVGPISVAIDAGHESFQFYQSGIYYEPECSSEELDHGVLLVGYGFEGEDVDGKKYWIVK 304
Query: 313 NSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
NSW E WG+ GY+ + +D + CGIAT ASYP+
Sbjct: 305 NSWSEKWGDKGYVYMAKDRQNHCGIATAASYPL 337
>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 336
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 139/341 (40%), Positives = 205/341 (60%), Gaps = 19/341 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
F++ I ++ CA+ + P + + +W H ++Y +++ + R ++++N++ I
Sbjct: 6 FLVAIGLVACATAAFVKPT--NPDLDSRWLEWKIAHTKSYTNDMHELERRLVWEENVKMI 63
Query: 74 EKANKEGN---RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
N + + + ++LG NE+ D+ E R++ GY S + + STF +
Sbjct: 64 NMHNLDHSLHKKGFRLGMNEYGDMRLHEVRSTMNGYK----SSNVTKVQGSTFLTPSNIQ 119
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
VP ++DWR KG VT +KNQG CGSCWAFS ++EG T KL+ LSEQ LVDCS
Sbjct: 120 VPDTVDWRTKGYVTPVKNQGQCGSCWAFSTTGSLEGQTFKKTSKLVSLSEQNLVDCSRTE 179
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC GGLMD+ F+Y+I+N G+ +E YPY E TC K +A + + D+ GD
Sbjct: 180 GNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAEDETC-HYKASCDSAEVTGFTDVTSGD 238
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGVLN-AECGDN-CDHGVAVVGFGTAEEEDG 305
E AL++AV PVSV ++AS Q+F+ Y+ GV + EC + DHGV VVG+GT + G
Sbjct: 239 EQALMEAVASVGPVSVAIDASHQSFQLYESGVYDEPECSSSELDHGVLVVGYGT---DGG 295
Query: 306 AKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPVA 345
YWL+KNSWGETWG SGYI++ R++ CGIAT ASYP+
Sbjct: 296 KDYWLVKNSWGETWGLSGYIKMSRNKSNQCGIATSASYPLV 336
>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 198/343 (57%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
P F++ + AS + + ++ + QW A H R Y E+ R ++++N+
Sbjct: 3 PSFLLAAVCWGIASAIPK----FDQNLDTQWYQWKATHKRLYGLN-EEGWRRAVWEKNMR 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N E G + +G N + D+TNEEFR G+ + P +Y
Sbjct: 58 MIELHNGEYSQGKHGFTMGMNAYGDMTNEEFRQVMNGFQNQKHKKGKMFRDPLLLQY--- 114
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS- 187
P S+DWREKG VT +KNQG CGSCWAFSA A+EG GKLI LSEQ LVDCS
Sbjct: 115 ---PKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFQKTGKLISLSEQNLVDCSH 171
Query: 188 -TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY+ GTC + E + A G + D+P
Sbjct: 172 PQGNQGCNGGLMDYAFQYVKDNSGLDSEESYPYEGMDGTCKYKPECSVANDTG-FVDIP- 229
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGD-NCDHGVAVVGFG-TAEE 302
G E ALL+AV T P+S ++A +F+FYK G+ + +C + DHG+ VVG+G
Sbjct: 230 GHEKALLRAVATVGPISAAIDAGHMSFQFYKSGIYYDPDCSSKDLDHGILVVGYGFEGTN 289
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
+ KYWL+KNSWG TWG+ GY++I+RD + CGIAT ASYP
Sbjct: 290 SNATKYWLVKNSWGTTWGDEGYVKIIRDKDNHCGIATAASYPT 332
>gi|431896621|gb|ELK06033.1| Cathepsin S [Pteropus alecto]
Length = 331
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 205/340 (60%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M + +++ C++ V + +P++ + W + + Y++++E+ R I+++NL++
Sbjct: 1 MKWLACVLLGCSAAV--AQLQRDPTLDRHWDLWKKTYSKHYREKIEEVARRLIWEKNLKF 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y LG N D+T+EE S G + VPS Q R T+K
Sbjct: 59 VMLHNLEHSMGMHSYDLGMNHLGDMTSEEV-ISLMG-SLTVPS---QWQRNVTYKSNPNQ 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+P S+DWR+KG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDCST+
Sbjct: 114 KLPDSLDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 190 ---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GG M AF+YII+N G+ +EA YPY+ + G C + K AAT KY +LP
Sbjct: 174 KYSNKGCNGGFMTSAFQYIIDNNGIDSEASYPYKAQDGKC-QYDSKFRAATCSKYTELPF 232
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEED 304
G E AL +AV K PVSV ++AS +F Y+ GV + C +HGV VVG+G D
Sbjct: 233 GSEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDQSCTLKVNHGVLVVGYGNL---D 289
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
G YWL+KNSWG +G+ GYIR+ R+ G CGIA+ SYP
Sbjct: 290 GKDYWLVKNSWGLNFGDKGYIRMARNSGNHCGIASYPSYP 329
>gi|162138968|ref|NP_001104662.1| uncharacterized protein LOC567623 precursor [Danio rerio]
gi|158254065|gb|AAI54241.1| Zgc:174153 protein [Danio rerio]
Length = 336
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 141/342 (41%), Positives = 204/342 (59%), Gaps = 18/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
MF +II + C S V + S+ + + + W +QHG++Y +++E R+ I+++NL
Sbjct: 2 MFALIITL--CISAVFTAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRK 57
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE+ N E GN T+K+G N+F D+TNEEFR + GY +R S P F +
Sbjct: 58 IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKH---DPNRTSQGP-LFMEPSFF 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
P +DWR++G VT +K+Q CGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD AF+Y+ ENKGL +E YPY + + A + D+P G
Sbjct: 174 QGNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKSTGFVDIPSG 233
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGVL--NAECGDNCDHGVAVVGFG-TAEEE 303
+E AL+ AV PVSV ++AS Q+ +FY+ G+ A DH V VVG+G +
Sbjct: 234 NEPALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADV 293
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
G +YW++KNSW + WG+ GYI + +D+ CG+AT+ASYP+
Sbjct: 294 AGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|432910512|ref|XP_004078392.1| PREDICTED: cathepsin K-like [Oryzias latipes]
Length = 331
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 203/340 (59%), Gaps = 26/340 (7%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
V+++L + SQ M E ++ E+W H + Y E+ +R I+++NL IE
Sbjct: 7 VLLLLSASVMSQ------MDETTLDAHWEEWKMTHTKEYITVEEEGIRRAIWEKNLRMIE 60
Query: 75 KANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPV---PSVSRQSSRPSTFKYQNV 128
N+E G TY LG N+F D+T EE TG P+ P V ++ ++
Sbjct: 61 AHNQEAALGMHTYTLGMNQFGDMTQEEVVERMTGLQMPLNPEPRVPMETD-------GSL 113
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+D+R+KG VT +KNQG CGSCWAFS+V A+EG G L++LS Q LVDC T
Sbjct: 114 IKLPKSVDYRKKGMVTSVKNQGSCGSCWAFSSVGALEGQLAKKTGNLVDLSPQNLVDCVT 173
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
+N+GC GG M AF+Y+ EN G+ +EA YPY E C + AA I Y+++P+GD
Sbjct: 174 ENDGCGGGYMTNAFKYVQENGGIDSEAAYPYMGEDQPC-RYNVSGLAAQIKGYKEVPEGD 232
Query: 249 EHALLQAVTKQ-PVSVCVEASGQAFRFYKRGV-LNAECG-DNCDHGVAVVGFGTAEEEDG 305
EHAL A+ K PVSV ++AS +F +Y++G+ + C ++ +H V VG+G + G
Sbjct: 233 EHALAVALFKAGPVSVGIDASQNSFLYYQKGIYFDRNCNKEDINHAVLAVGYGVNAK--G 290
Query: 306 AKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYPV 344
K+W++KNSWGETWG GY+ + R+ G +CGIA ASYPV
Sbjct: 291 KKFWIVKNSWGETWGNKGYVLMARNRGNVCGIANLASYPV 330
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 203/340 (59%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF + G++ +R++ + NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSTFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P +DWR+KGAVT +K+QG CGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC GGLM+ AF+YI EN G+ TE YPY+ G C +KE A G Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKENDGIDTEKSYPYEAVDGECRFKKEDVGATDTG-YVEIKAGS 234
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDG 305
E L +AV T P+SV ++AS +F+ Y GV + EC ++ DHGV VVG+G + G
Sbjct: 235 EDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA++ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
Length = 339
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 147/340 (43%), Positives = 201/340 (59%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M ++ L+ C+ V + +P++ W + + YK+E E+ R I+++NL++
Sbjct: 9 MKWLVGLLPLCSYAVA--QVHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKF 66
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y LG N D+T EE S G R VPS Q R T++ +
Sbjct: 67 VMLHNLEHSMGMHSYDLGMNHLGDMTGEEV-ISLMGSLR-VPS---QWQRNVTYRSNSNQ 121
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST+
Sbjct: 122 KLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 181
Query: 190 ---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GG M AF+YII+N G+ +EA YPY+ G C + K AAT KY +LP
Sbjct: 182 KYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKC-RYDSKKRAATCSKYTELPF 240
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEED 304
G E AL +AV K PVSV ++AS +F Y+ GV C N +HGV VVG+G +
Sbjct: 241 GSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL---N 297
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
G YWL+KNSWG +G+ GYIR+ R+ G CGIA+ SYP
Sbjct: 298 GKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 337
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 142/300 (47%), Positives = 183/300 (61%), Gaps = 16/300 (5%)
Query: 58 EKAMRLTIFKQNLEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRASYTGYNRPVPSVS 114
E +F++NL+ I K N+E N+ +Y++G N F+ LT EEF A Y GY
Sbjct: 47 ESTRAFEVFQKNLDMIMKHNEEYNQGLQSYEMGLNGFAHLTFEEFSAQYLGYG-GAEVEQ 105
Query: 115 RQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGK 174
++ R + ++ +++P S+DWREKGAV +KNQG CGSCWAFSAVAA+EG + G+
Sbjct: 106 PKTRRAGKHERKSRSEIPASVDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGE 165
Query: 175 LIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLA--TEADYPYQQEQGTCDKQK 230
LI LSEQQLVDCS N+GC+GG MD AFEY + N G +E DYPY+ G C K
Sbjct: 166 LISLSEQQLVDCSKKFGNHGCAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMDGKC-KFS 224
Query: 231 EKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGVLN---AECGD 286
ATI Y D+ +G+E LL AV PVSV + A G A +FY RGV N C
Sbjct: 225 ADGVRATISGYNDVKQGNETDLLDAVANVGPVSVAIHA-GAALQFYLRGVFNGVAGTCFG 283
Query: 287 NCDHGVAVVGFGTAEEEDGAK--YWLIKNSWGETWGESGYIRILRDEGLCGIATEASYPV 344
+HGV VG+GTA G K YW+IKNSWG WGE G++R R + LCG+A ASYP+
Sbjct: 284 PLNHGVTAVGYGTASLRFGRKMDYWIIKNSWGMGWGEKGFVRFARGKNLCGVANGASYPL 343
>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 136/318 (42%), Positives = 192/318 (60%), Gaps = 12/318 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI-EKANKEGNRTYKLGTNEFSD 93
+ SI+E +QW +H + YK E R FK+NL+YI EK KE +++G N+F+D
Sbjct: 36 DESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLRHRVGLNKFAD 95
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG 153
L+NEEF+ Y + + +R + + + D P+S+DWR+KG VT +K+QG CG
Sbjct: 96 LSNEEFKQLYLSKVKKPINKTRIDAEDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDCG 155
Query: 154 SCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLAT 213
SCW+FS A+EGI I LI LSEQ+LVDC T N GC GG MD AFE++I N G+ T
Sbjct: 156 SCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTTNYGCEGGYMDYAFEWVINNGGIDT 215
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFR 273
EA+YPY GTC+ KE+ +I Y+D+ + D ALL A +QP+SV ++ S F+
Sbjct: 216 EANYPYTGVDGTCNTAKEEIKVVSIDGYKDVDETDS-ALLCAAAQQPISVGIDGSAIDFQ 274
Query: 274 FYKRGVL---NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
Y G+ ++ D+ DH V +VG+G+ E+G YW++KNSWG +WG GY I R+
Sbjct: 275 LYTGGIYDGDCSDDPDDIDHAVLIVGYGS---ENGEDYWIVKNSWGTSWGIEGYFYIKRN 331
Query: 331 E----GLCGIATEASYPV 344
G+C I ASYP
Sbjct: 332 TDLPYGVCAINAMASYPT 349
>gi|334332716|ref|XP_001367365.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 335
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 138/309 (44%), Positives = 201/309 (65%), Gaps = 15/309 (4%)
Query: 44 QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
QW AQHG++Y+ E ++R I+++NL+ IE+ N+E G ++++LG N+F D+T EEF+
Sbjct: 31 QWKAQHGKSYEAN-EDSLRRAIWEKNLKMIERHNQEYRAGKQSFQLGMNKFGDMTTEEFQ 89
Query: 101 ASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
+ YN S S++ ++ + + +P S+DWRE+G VT +KNQG C SCWAFSA
Sbjct: 90 EAINFYNS---SASQRRTKRYLHREPLLAQLPESVDWREEGYVTPVKNQGQCLSCWAFSA 146
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTDN--NGCSGGLMDKAFEYIIENKGLATEADYP 218
V A+EG G+L+ LS Q LVDC+T + + C GG MD+AF+Y+ +N G+ TE YP
Sbjct: 147 VGAIEGQWFRKTGELVSLSIQNLVDCTTSDSISSCHGGFMDRAFQYVQDNGGIDTEECYP 206
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKR 277
Y E C Q E + A +G + D+P DE AL++AV T P+SV ++ +F+FY+
Sbjct: 207 YVGEVNECKYQPECSGANVVG-FVDIPSMDERALMEAVATVGPISVAIDGGNPSFKFYES 265
Query: 278 GV-LNAECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE-GLC 334
GV + +C + +H VVG+G+ E DG KYW++KNSWGE WG +GYI + +DE C
Sbjct: 266 GVYYDPQCSSSQLNHAGLVVGYGS-EGIDGRKYWIVKNSWGELWGNNGYILMAKDEDNHC 324
Query: 335 GIATEASYP 343
GIATEASYP
Sbjct: 325 GIATEASYP 333
>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
Length = 337
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 199/343 (58%), Gaps = 18/343 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M V + C S V + ++ + + + +QW H + Y E+ R I+++NL+
Sbjct: 1 MRVFLAAFTLCLSAVFAAPTL-DQQLNDHWDQWKKWHSKKYH-ATEEGWRRVIWEKNLKK 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G TY+LG N F D+T+EEFR G+ + R S F N
Sbjct: 59 IEMHNLEHSMGIHTYRLGMNHFGDMTHEEFRQVMNGFKHK----KDRRFRGSLFMEPNFI 114
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+VP +DWREKG VT +K+QG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 115 EVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRP 174
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD+AF+Y+ + GL +E YPY + C + +AA G + D+P
Sbjct: 175 EGNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNSAANDTG-FVDIPS 233
Query: 247 GDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEE 302
G E AL++A+ PVSV ++A ++F+FY+ G+ EC + DHGV VG+G E+
Sbjct: 234 GKERALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGED 293
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E WG+ GYI + +D CGIAT ASYP+
Sbjct: 294 VDGKKYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASYPL 336
>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
Length = 329
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 196/340 (57%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M ++ + C + V ++ +PS+ W H +TY ELE+ R I+++NL
Sbjct: 1 MLRSLLFTVICGAVV----ALQDPSLDMHWLMWKKNHSKTYTSELEELGRREIWERNLRL 56
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I N E G TY LG N D+T EE + G R P+++R R S F
Sbjct: 57 ITVHNLEASLGMHTYDLGMNHMGDMTREEILQMFAG-TRVRPNLTR---RSSPFVASAGI 112
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
VP S+DWREKG VT +KNQG CGSCWAFSA A+EG + T G++ LS Q LVDCS+
Sbjct: 113 SVPDSVDWREKGYVTEVKNQGSCGSCWAFSAAGALEGQLKRTTGQVKSLSPQNLVDCSSK 172
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GG M +AF+Y+I++ G+ ++ YPY G C + + AA Y + +G
Sbjct: 173 YGNKGCNGGFMTQAFQYVIDDGGIDSDEAYPYTAMDGQC-RYDQSQRAANCSSYNYVSEG 231
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDG 305
DE AL QAV T P+SV ++A+ F Y GV + C N +HGV VVG+G+ ED
Sbjct: 232 DEEALKQAVATIGPISVAIDATRPMFILYHSGVYSDPTCTQNVNHGVLVVGYGSLNGED- 290
Query: 306 AKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYPV 344
YWL+KNSWG +G+ GYIRI R++G +CGIA A YP+
Sbjct: 291 --YWLVKNSWGTRFGDGGYIRIARNKGNMCGIANYACYPL 328
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 202/340 (59%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF + GY+ SR+S + NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGYHG-----SRKSGGSTFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P ++DWR+KGAVT +K+QG CGSCWAFS ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC GGLM+ AF+YI N G+ TE YPY+ G C +KE A G Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTG-YVEIKAGC 234
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDG 305
E L +AV T P+SV ++AS +F+ Y GV + EC ++ DHGV VVG+G + G
Sbjct: 235 EDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA++ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 204/340 (60%), Gaps = 18/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M + L C +V+ + ++ + QW AQH RTY E R +++NL+
Sbjct: 1 MNFYLCLASLCLG-LVAATPEFDQTLDSQWHQWKAQHRRTYAAN-EDGWRRATWEKNLKM 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +++LG N+F D+T EEF+ GYN + S++ ++ S ++ +
Sbjct: 59 IEMHNLEYSAGKHSFQLGMNKFGDMTTEEFKQVMNGYNS---NGSQKRTKGSLYREPLLA 115
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+P S+DWREKG VT +KNQG CGSCWAFSA ++EG KL+ LSEQ LVDCST
Sbjct: 116 QLPKSVDWREKGYVTPVKNQGQCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCSTS 175
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
NNGCSGGLMD AFEY+ N G+ TE YPY + C K + + + A + + D+P
Sbjct: 176 EGNNGCSGGLMDNAFEYVKNNGGIDTEQAYPYLGQDNEC-KYRAECSGANVTGFVDIPSM 234
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAECGDN-CDHGVAVVGFGTAEEED 304
+E AL++AV P+SV ++A +F+FY+ GV +C + DHGV VVG+G+ +++
Sbjct: 235 NERALMKAVANVGPISVAIDAGNPSFQFYESGVYYEPQCSSSQLDHGVLVVGYGSIGKDE 294
Query: 305 GAKYWLIKNSWGETWGESGYIRILR-DEGLCGIATEASYP 343
YW++KNSWGE WG+ GY+ + + CGIAT ASYP
Sbjct: 295 ---YWIVKNSWGEEWGKKGYVLMAKFRNNHCGIATAASYP 331
>gi|81294188|gb|AAI08032.1| Cathepsin L, 1 b [Danio rerio]
Length = 336
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 138/341 (40%), Positives = 202/341 (59%), Gaps = 16/341 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
+ +LV S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLYISAVFAAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR + GY Q+S+ F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHD----PNQTSQGPLFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +K+Q CGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC+GGLMD AF+Y+ ENKGL +E YPY + + A I + D+P G+
Sbjct: 175 GNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGN 234
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGVL--NAECGDNCDHGVAVVGFG-TAEEED 304
E AL+ AV PVSV ++AS Q+ +FY+ G+ A DH V VVG+G +
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
G +YW++KNSW + WG+ GYI + +D+ CG+AT+ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|328872971|gb|EGG21338.1| cysteine proteinase 5 precursor [Dictyostelium fasciculatum]
Length = 358
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 145/361 (40%), Positives = 201/361 (55%), Gaps = 36/361 (9%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYI 73
F +I L++ + S S E + WM +H R+Y E R +++K+N++Y+
Sbjct: 3 FAVIFLIVLMLA-FASASSYSEQQYRDSFTNWMQKHSRSYASH-EFNTRYSVYKKNMDYV 60
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTF-KYQNVTDVP 132
+ N +G+ T LG N +D+TN+E++A Y G + +S ++F K Q +P
Sbjct: 61 NEWNSKGSETV-LGLNSLADMTNQEYQAIYLGTKTDATARLAAASASASFGKVQGA--LP 117
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--N 190
SIDW +GAVT +KNQG CGSCW+FSA + EG QI+ L+ LSEQ L+DCS+ N
Sbjct: 118 ASIDWVAQGAVTQVKNQGQCGSCWSFSATGSTEGAHQISTSNLVALSEQNLIDCSSSYGN 177
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
+GC+GGLMD AF+YII N G+ TEA YPY + C K + AT+ Y D+ G E
Sbjct: 178 DGCNGGLMDNAFKYIIANGGIDTEASYPYVAKVQKC-KYNPANSGATLSSYVDVTSGSES 236
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVL--NAECGDNCDHGVAVVGFGTAEEE----- 303
AL K PVSV ++AS Q+F+ Y GV A N DHGV VVG+GTA
Sbjct: 237 ALQSQTVKGPVSVAIDASHQSFQLYDSGVYYEPACSSTNLDHGVLVVGYGTASANGSSDS 296
Query: 304 -------------------DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYP 343
GA++W +KNSWG WG SGYI++ R+ + CGIAT AS P
Sbjct: 297 DSSAASQSSSSESSDDQATQGAQFWKVKNSWGPEWGLSGYIQMARNRDNNCGIATTASQP 356
Query: 344 V 344
+
Sbjct: 357 I 357
>gi|223673161|gb|ACN12762.1| Cathepsin S precursor [Salmo salar]
Length = 330
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 139/340 (40%), Positives = 199/340 (58%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M ++L + C + V ++ +P + + + W HG+ Y+ E+E+ R ++++NL+
Sbjct: 2 MLWSLLLAVLCGTAV----ALFDPMLEQHWQMWKKTHGKNYQTEVEELGRREVWERNLQL 57
Query: 73 IEKANKEGN---RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I N E + TY LG N D+T EE S+ P + R+ PS F +
Sbjct: 58 ISLHNLEASMDMHTYDLGMNHMGDMTQEEIAQSFASLLVPA-DLKRE---PSAFAGSSGA 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+P + DWREKG VT +K QG CGSCWAFS+V A+EG T GKLI+LS Q LVDCS+
Sbjct: 114 PIPDTFDWREKGYVTGVKMQGSCGSCWAFSSVGALEGQLMKTTGKLIDLSPQNLVDCSSK 173
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC GG M KAF+Y+I+N+G+A++ YPY+ Q C + AA +Y LP+G
Sbjct: 174 YGNKGCHGGFMTKAFQYVIDNQGIASDQSYPYKGVQQQCIYNPAQ-RAANCSRYSFLPEG 232
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AECGDNCDHGVAVVGFGTAEEEDG 305
DE L +A+ T P+SV ++A+ +F FY+ GV N C +H V VG+GT +D
Sbjct: 233 DEGVLKEALATIGPISVGIDATRPSFAFYRSGVYNDPTCTKKTNHAVLAVGYGTLGGQD- 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
YWL+KNSWG +WG+ GYIR+ R+ + CGIA YPV
Sbjct: 292 --YWLVKNSWGLSWGDQGYIRMSRNKDNQCGIALYGCYPV 329
>gi|260516672|gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
Length = 319
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 139/286 (48%), Positives = 174/286 (60%), Gaps = 15/286 (5%)
Query: 45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYT 104
+M Q+ + Y E + R FK ++E I N N +Y +G NEF+DL+ EEF+ Y
Sbjct: 45 FMKQYSKAYS-HAEFSSRFNQFKASVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYF 103
Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAV 164
G V R+ +R + +Q V PTSIDWR AVT IK+QG CGSCWAFSA ++
Sbjct: 104 G----CKHVEREFARSNNL-HQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSI 158
Query: 165 EGITQITGGK-LIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQ 221
EG + G L LSEQQLVDCST N GC+GGLMD AFEYII NKG+ E+ YPY+
Sbjct: 159 EGAWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAYPYKG 218
Query: 222 EQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL 280
G C QK TI ++D+ GDE + L AV T PVSV +EA F+FY GV
Sbjct: 219 VGGLC--QKSCTKVVTISGHKDVASGDEASSLNAVGTVGPVSVAIEADQAGFQFYSSGVF 276
Query: 281 NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
+ CG N DHGV VG+GT +D YW++KNSWG +WGESGYIR
Sbjct: 277 SGTCGHNLDHGVLAVGYGTTGSQD---YWIVKNSWGTSWGESGYIR 319
>gi|334324655|ref|XP_001370975.2| PREDICTED: cathepsin S-like isoform 1 [Monodelphis domestica]
Length = 331
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 137/321 (42%), Positives = 192/321 (59%), Gaps = 19/321 (5%)
Query: 33 MHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGT 88
+H +++ H + W HG+ YK + E+ R I+++NL+Y+ N E G +Y L
Sbjct: 18 LHRDPMLDGHWDLWKKTHGKQYKGQNEEIARRLIWEKNLKYVTLHNLEHSMGLHSYDLSM 77
Query: 89 NEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKN 148
N D+T+EE + + P Q +R +T++ + +P S+DWREKG VT +K
Sbjct: 78 NHLGDMTSEEVISLMSSLRIP-----NQWNRNTTYRLSSNQKLPDSVDWREKGCVTEVKY 132
Query: 149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST---DNNGCSGGLMDKAFEYI 205
QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDCST DN+GC+GG M AF+Y+
Sbjct: 133 QGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTSAFQYV 192
Query: 206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVC 264
I+N G+ ++ YPY+ G C + + AAT KY +LP G E AL +AV K PVSV
Sbjct: 193 IDNNGIDSDVSYPYKATDGKC-QYNPASRAATCSKYTELPYGSEEALKEAVANKGPVSVG 251
Query: 265 VEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESG 323
++A +F YK GV + C +HGV V+G+G DG YWL+KNSWG +G+ G
Sbjct: 252 IDAKTPSFFLYKSGVYYDPSCTQKVNHGVLVIGYGNL---DGQDYWLVKNSWGLHFGDKG 308
Query: 324 YIRILRDEG-LCGIATEASYP 343
Y+RI R+ G CGIA SYP
Sbjct: 309 YVRIARNRGNHCGIANFPSYP 329
>gi|209693435|ref|NP_001129410.1| cathepsin L precursor [Acyrthosiphon pisum]
gi|251823771|ref|NP_001156569.1| cathepsin L precursor [Acyrthosiphon pisum]
Length = 341
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 201/343 (58%), Gaps = 17/343 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
VI++ ++ A VS +++E I E+ + Q + Y+D E+ R ++ N I
Sbjct: 4 VIVLGLVAFAISTVSSINLNE-VIEEEWSLFKIQFKKLYEDIKEETFRKKVYLDNKLKIA 62
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQ--SSRPSTF-KYQNV 128
+ NK G TY L N F DL E+ G+ + R + TF K +NV
Sbjct: 63 RHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKSENV 122
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWR+KG VT +KNQG CGSCW+FSA ++EG G L+ LSEQ L+DCS
Sbjct: 123 V-IPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSR 181
Query: 189 D--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
NNGC GGLMD AF+YI NKGL TE YPY+ E C E + A G + D+P+
Sbjct: 182 KYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKG-FVDIPE 240
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFGTAEEE 303
GDE AL+ A+ T PVS+ ++AS + F+FYK+GV N C DHGV VGFG+ ++
Sbjct: 241 GDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGS--DK 298
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVA 345
G YW++KNSWG+TWG+ GYI + R+ + CG+A+ ASYP+
Sbjct: 299 KGGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341
>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
Length = 336
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 139/321 (43%), Positives = 194/321 (60%), Gaps = 17/321 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P + + + W H + Y E E+ R ++++NL IE N E G +Y+LG N F
Sbjct: 21 DPQLDQHWQLWKGWHSKNYH-EKEEGWRRLVWEKNLRKIELHNLEHSMGKHSYRLGMNHF 79
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
D+T+EEFR GY R ++ S F N + P ++DWR+KG VT +K+QG
Sbjct: 80 GDMTHEEFRQIMNGYKR----REQRKYSGSLFMEPNFLEAPRAVDWRDKGYVTPVKDQGQ 135
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
CGSCWAFS A+EG GKL+ LSEQ LVDCS N GC+GGLMD+AF+Y+ +N+
Sbjct: 136 CGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQ 195
Query: 210 GLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEA 267
GL +E YPY+ + C + +A G + D+P G E AL++AV PVSV ++A
Sbjct: 196 GLDSEDFYPYKGTDDQPCQYNAQYSAVNDTG-FVDIPSGKERALMKAVASVGPVSVAIDA 254
Query: 268 SGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEEDGAKYWLIKNSWGETWGESGY 324
++F+FY+ G+ EC D DHGV VVG+G E+ DG KYW++KNSW E WG+ G+
Sbjct: 255 GHESFQFYQSGIYFEKECSSDELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGF 314
Query: 325 IRILRD-EGLCGIATEASYPV 344
I + +D CGIAT ASYP+
Sbjct: 315 IYMAKDRHNHCGIATAASYPL 335
>gi|427797099|gb|JAA64001.1| Putative cathepsin l cathepsin l, partial [Rhipicephalus
pulchellus]
Length = 331
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 140/320 (43%), Positives = 194/320 (60%), Gaps = 13/320 (4%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT---YKLGTNE 90
HE + + + A HG+ Y+ + E+ RL I+ +N I + N++ ++ YKL NE
Sbjct: 15 HEELVGAEWSAFKALHGKEYESDTEEYYRLKIYMENRLKIARHNEKYAKSQVSYKLAMNE 74
Query: 91 FSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQG 150
F D+ + EF ++ G+ R R+ S + +P ++DWR+KGAVT +KNQG
Sbjct: 75 FGDMLHHEFVSTRNGFKRNYRDTPREGSFFVEPEGLEDFHLPKTVDWRKKGAVTPVKNQG 134
Query: 151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIEN 208
CGSCW+FS ++EG KL+ LSEQ L+DCS NNGC GGLMD AF+YI N
Sbjct: 135 QCGSCWSFSTTGSLEGQHFRKLHKLVSLSEQNLIDCSRSFGNNGCEGGLMDYAFKYIKAN 194
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEA 267
KG+ TE YPY G C K A G + D+P+GDE+ L +AV T PVSV ++A
Sbjct: 195 KGIDTEQSYPYNATDGVCHFNKSAVGATDTG-FVDIPEGDENKLKKAVATVGPVSVAIDA 253
Query: 268 SGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
S ++F+FY GV + EC + DHGV VVG+GT +DG YWL+KNSWG TWG+ GYI
Sbjct: 254 SHESFQFYSEGVYDEPECDSEQLDHGVLVVGYGT---KDGQDYWLVKNSWGTTWGDGGYI 310
Query: 326 RILRD-EGLCGIATEASYPV 344
+ R+ + CGIA+ ASYP+
Sbjct: 311 YMSRNKDNQCGIASAASYPL 330
>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
Length = 341
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 142/345 (41%), Positives = 204/345 (59%), Gaps = 24/345 (6%)
Query: 18 ILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLTIFKQNLEYIE 74
++++ CA VS + +V+ E+W A QH Y+ E+E R+ I+ ++ I
Sbjct: 4 LVLLLCAVAAVSAVQFFD--LVK--EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIA 59
Query: 75 KANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPS-----VSRQSSRPSTFKYQ 126
K N++ G +YKLG N++ D+ + EF + G+N+ + S R + F
Sbjct: 60 KHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISP 119
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+P +DWR+ GAVT IK+QG CGSCW+FS A+EG G L+ LSEQ L+DC
Sbjct: 120 ANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC 179
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S NNGC+GGLMD AF+YI +N G+ TE YPY+ C + A +G + D+
Sbjct: 180 SEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYNPKNTGAEDVG-FVDI 238
Query: 245 PKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLNAE--CGDNCDHGVAVVGFGTAE 301
P+GDE L++AV T PVSV ++AS +F+ Y GV N E + DHGV VVG+GT
Sbjct: 239 PEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGT-- 296
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPVA 345
+E G YWL+KNSWG +WGE GYI+++R++ CGIA+ ASYP+
Sbjct: 297 DEQGVDYWLVKNSWGRSWGELGYIKMIRNKNNRCGIASSASYPLV 341
>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
Length = 331
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 200/340 (58%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M ++ ++ C+S + +P++ + W +G+ YK++ E+ R I+++NL+
Sbjct: 1 MNWLVWALLLCSSAMA--HVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKT 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y+LG N D+T+EE + + P Q R T+K
Sbjct: 59 VTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVP-----SQWPRNVTYKSDPNQ 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
+P S+DWREKG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 114 KLPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTA 173
Query: 189 --DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GG M +AF+YII+N G+ +EA YPY+ G C + K AAT +Y +LP
Sbjct: 174 KYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKC-QYDVKNRAATCSRYIELPF 232
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEED 304
G E AL +AV K PVSV ++AS +F YK GV + C N +HGV VVG+G D
Sbjct: 233 GSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNL---D 289
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
G YWL+KNSWG +G+ GYIR+ R+ G CGIA SYP
Sbjct: 290 GKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYP 329
>gi|118424553|gb|ABK90824.1| cathepsin L-like cysteine proteinase [Spodoptera exigua]
Length = 344
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 139/324 (42%), Positives = 195/324 (60%), Gaps = 29/324 (8%)
Query: 44 QWMA---QHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR---TYKLGTNEFSDLTNE 97
+W A +H + Y E+E R+ I+ +N I K N+ + +YKL N+++D+ +
Sbjct: 26 EWNAFKMEHSKQYDSEVEDKFRMKIYVENKHRITKHNQRFEQRLVSYKLKPNKYADMLHH 85
Query: 98 EFRASYTGYNRPVPSVSRQSS--------RPSTFKYQNVTDVPTSIDWREKGAVTHIKNQ 149
EF + G+N+ R + R +TF P +DWR+KGAVT +K+Q
Sbjct: 86 EFVHTMNGFNKTAKHGGRNKNVHGKGHDGRAATFIAPAHVSYPDHVDWRKKGAVTDVKDQ 145
Query: 150 GHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIE 207
G CGSCWAFS A+EG G L+ LSEQ L+DCS NNGC+GGLMD AF+YI +
Sbjct: 146 GKCGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKD 205
Query: 208 NKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVE 266
N G+ TE YPY+ C +++ A +G + D+P+GDE L+QAV T P+SV ++
Sbjct: 206 NGGIDTEKSYPYEAVDDKCRYNPKESGADDVG-FVDIPQGDEEKLMQAVATVGPISVAID 264
Query: 267 ASGQAFRFYKRGVLNAECGDNC-----DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
AS + F+FY +GV E NC DHGV VVG+GT EEDG+ WL+KNSWG +WGE
Sbjct: 265 ASQETFQFYSKGVYYDE---NCSSTDLDHGVMVVGYGT--EEDGSDDWLVKNSWGRSWGE 319
Query: 322 SGYIRILRDE-GLCGIATEASYPV 344
GYI++ R++ CGIA+ ASYP+
Sbjct: 320 LGYIKMARNKNNHCGIASSASYPL 343
>gi|325185016|emb|CCA19507.1| cysteine protease family C01A putative [Albugo laibachii Nc14]
Length = 492
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 141/338 (41%), Positives = 188/338 (55%), Gaps = 40/338 (11%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
+I L + A G++ E W+ H T+ D E A RL + N YI
Sbjct: 9 LIALSLLFAQNRADGKTFKEYE--SDFVSWLKTHHLTFSDAFEYAKRLETYIANDIYILT 66
Query: 76 ANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVP 132
N + ++KLG N FS LTNEEFR + G+ +++ QS+ S+ +Q + D+P
Sbjct: 67 HNLQ-ESSFKLGHNAFSHLTNEEFRQRFNGFKASDDYLTKRLAQSNVASSTNFQYI-DLP 124
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NN 191
S+DW EKGAVT +KNQG CGSCWAFS A+EG T I+ GKL+ LSEQ+LVDC + ++
Sbjct: 125 ESVDWVEKGAVTGVKNQGMCGSCWAFSTTGAIEGATFISSGKLVSLSEQELVDCDHNGDH 184
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC+GGLMD AF +I E+ G+ +E DY Y Q C K +
Sbjct: 185 GCNGGLMDHAFSWISEHDGICSEEDYAYIHSQSLCRSCKPVVS----------------- 227
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLI 311
PV+V ++A ++F+FY+ GV N CG DHGV VG+G EDG KYW +
Sbjct: 228 --------PVAVAIDAGDRSFQFYQSGVYNKTCGTQLDHGVLTVGYGV---EDGQKYWKV 276
Query: 312 KNSWGETWGESGYIRILRDE----GLCGIATEASYPVA 345
KNSWG +WGE GYIR+ RD+ G CGIA SYP A
Sbjct: 277 KNSWGNSWGEKGYIRLSRDQNGRSGQCGIAMVPSYPTA 314
>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 132/318 (41%), Positives = 180/318 (56%), Gaps = 23/318 (7%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
+ E+WMA+ G+ Y EK R +F+ N+ +I L N+F+DLTN+EF
Sbjct: 39 QMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEF 98
Query: 100 RASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFS 159
+++TG P P + + P +P IDWR KGAVT +K+QG CGSCWAF+
Sbjct: 99 VSTHTGAKPPCPKDAPRGVDP--------IWLPCCIDWRYKGAVTDVKDQGACGSCWAFA 150
Query: 160 AVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPY 219
AVAA+EG+TQI GKL LSEQ+LVDC T ++GC+GG D+AFE + G+ E+ Y Y
Sbjct: 151 AVAAIEGLTQIRTGKLTPLSEQELVDCDTGSSGCAGGHTDRAFELVAAKGGITAESGYRY 210
Query: 220 QQEQGTCDKQKEK-AAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRG 278
+ +G C AA IG + +P GDE L AV +QPV+ ++ASG AF+FY G
Sbjct: 211 EGYRGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGSG 270
Query: 279 VLNAEC---------GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
V C +H V +VG+ + G KYW+ KNSWG+TWGE GYI + +
Sbjct: 271 VFPGPCGSGSGAAAAAPTTNHAVTLVGY-CQDGASGKKYWVAKNSWGKTWGEKGYILLEK 329
Query: 330 D----EGLCGIATEASYP 343
D G CG+A YP
Sbjct: 330 DVASPHGTCGVAVSPFYP 347
>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
Length = 326
Score = 253 bits (647), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 144/329 (43%), Positives = 193/329 (58%), Gaps = 23/329 (6%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR---T 83
VV+ ++ S+ + + +H + YKD E+A R +F + +EYI++ N E +R +
Sbjct: 7 VVALLALASCSLDREWGMFKVRHNKQYKDNQEEAYRKGVFMKAVEYIQQHNLEADRGVHS 66
Query: 84 YKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREK 140
+++G NE++D+ NEEF GY Q RP Y NV D+P ++DWR K
Sbjct: 67 FRVGINEYADMPNEEFVRVMNGY-------KMQEQRPKAPTYMPPSNVGDLPATVDWRTK 119
Query: 141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLM 198
G VT +KNQG CGSCWAFS+ ++EG T KLI LSEQ LVDCST+ N GC GGLM
Sbjct: 120 GYVTEVKNQGQCGSCWAFSSTGSLEGQTFKKYNKLISLSEQNLVDCSTEQGNMGCGGGLM 179
Query: 199 DKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-T 257
D+AF YI N G+ TE YPY+ G C K A G Y D+ E L AV T
Sbjct: 180 DQAFTYIKVNDGIDTETSYPYEAASGKCRFNKANVGANDTG-YTDIKSKSESDLQSAVAT 238
Query: 258 KQPVSVCVEASGQAFRFYKRGVLN-AECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSW 315
P++V ++AS +F+ YK GV + C DHGV VG+GT + G YWL+KNSW
Sbjct: 239 VGPIAVAIDASHMSFQLYKSGVYHYIFCSQTRLDHGVLAVGYGT---DSGKDYWLVKNSW 295
Query: 316 GETWGESGYIRILRD-EGLCGIATEASYP 343
G TWG+ GYI + R+ + CGIAT+ASYP
Sbjct: 296 GATWGQQGYIMMSRNRDNNCGIATQASYP 324
>gi|225707912|gb|ACO09802.1| Cathepsin K precursor [Osmerus mordax]
Length = 331
Score = 253 bits (647), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 140/320 (43%), Positives = 193/320 (60%), Gaps = 16/320 (5%)
Query: 33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTN 89
M E S+ + E W H + Y E+ +R I+++N+ IE N+E G +Y+LG N
Sbjct: 19 MDEVSLDTEWENWKTTHNKEYNGLDEEGIRRAIWEKNMRMIEAHNQEAALGMHSYELGMN 78
Query: 90 EFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN-VTDVPTSIDWREKGAVTHIKN 148
D+T+EE G P+ R +TF N V +P SID+R KG VT +KN
Sbjct: 79 NLGDMTSEEVAEKMMGLQVPL-----NRDRGNTFVPDNTVERLPKSIDYRRKGMVTPVKN 133
Query: 149 QGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIEN 208
QG CGSCWAFS+V A+EG T GKL++LS Q LVDC T+NNGC GG M AF Y+ +N
Sbjct: 134 QGSCGSCWAFSSVGALEGQLMKTTGKLVDLSPQNLVDCVTENNGCGGGYMTNAFNYVRDN 193
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEA 267
+G+ +EA YPY + TC A+ G Y+++P+G+E AL AV K PVSV ++A
Sbjct: 194 QGIDSEAAYPYIGQDETCAYNVSGMTASCRG-YKEIPEGNERALTVAVAKVGPVSVGIDA 252
Query: 268 SGQAFRFYKRGV-LNAECG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
+ F+FY++GV + C D+ +H V VG+G + G KYW++KNSW E+WG GYI
Sbjct: 253 TLSTFQFYQKGVYYDRNCNKDDINHAVLAVGYGVTPK--GKKYWIVKNSWSESWGNKGYI 310
Query: 326 RILRDEG-LCGIATEASYPV 344
+ R+ G LCGIA ASYP+
Sbjct: 311 LMARNRGNLCGIANLASYPI 330
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 203/340 (59%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF + G++ +R++ + NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSTFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P ++DWR+KGAVT +K+QG CGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC GGLM+ AF+YI N G+ TE YPY+ G C +KE A G Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTG-YVEIKAGS 234
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDG 305
E L +AV T P+SV ++AS +F+ Y GV + EC ++ DHGV VVG+G + G
Sbjct: 235 EDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA++ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 400
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 135/311 (43%), Positives = 196/311 (63%), Gaps = 8/311 (2%)
Query: 18 ILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN 77
++ +TC Q S +S E E+HE+WMAQ+G+ Y+D E R IFK N+++IE N
Sbjct: 92 LVGVTCGRQCRS-KSRLEACTSERHEKWMAQYGKVYEDAAEMEKRFQIFKNNVQFIESFN 150
Query: 78 KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN-VTDVPTSID 136
G++ + + N+F DL +EEF+A R V V ++ ++F+Y + VT++P ++D
Sbjct: 151 VAGDKPFNIRINQFPDLHDEEFKALLINGQRKVSGV-ETATEETSFRYGSVVTNIPATMD 209
Query: 137 WREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVD-CSTDNNGCSG 195
R+KG VT IK+QG GSCWA SAVAA+EGI QIT KL+ LS+Q+LVD ++ GC G
Sbjct: 210 GRKKGVVTPIKDQGIIGSCWALSAVAAIEGIHQITTSKLMFLSKQKLVDSVKGESEGCIG 269
Query: 196 GLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQA 255
G ++ AFE+I++ G+ +E YPY+ C +KE + A I YE +P ++ ALL+
Sbjct: 270 GYVEDAFEFIVKKGGILSETHYPYKGVN-XCKVEKETHSVAHIKGYEKVPSNNKKALLKV 328
Query: 256 VTKQPVSVCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDGAKYWLIKNS 314
V QPVSV ++ AF++Y + NA CG + +H VAVVG+G A DGAKYW +KNS
Sbjct: 329 VANQPVSVYIDVGAHAFKYYSSEIFNARNCGSDPNHVVAVVGYGKA--LDGAKYWPVKNS 386
Query: 315 WGETWGESGYI 325
WG WG Y+
Sbjct: 387 WGTEWGGKWYM 397
>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
Length = 328
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 139/318 (43%), Positives = 193/318 (60%), Gaps = 18/318 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ W +G+ YK++ E+ R I+++NL+++ N E G +Y LG N
Sbjct: 18 DPALDHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHL 77
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
D+T+EE + + VPS Q R T+K + +P S+DWREKG VT +K QG
Sbjct: 78 GDMTSEEVISLMSSLR--VPS---QWPRNVTYKSNSNQKLPDSVDWREKGCVTKVKYQGA 132
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD---NNGCSGGLMDKAFEYIIEN 208
CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST+ N GC+GG M +AF+YII+N
Sbjct: 133 CGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDN 192
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEA 267
G+ +EA YPY+ G C + K AAT KY +LP G E L +AV K PVSV ++A
Sbjct: 193 NGIDSEASYPYKATDGKC-RYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDA 251
Query: 268 SGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
+F Y+ GV + C N +HGV VVG+G +G YWL+KNSWG +G+ GYIR
Sbjct: 252 RHSSFFLYRSGVYYDPSCTQNVNHGVLVVGYGNL---NGKDYWLVKNSWGLNFGDQGYIR 308
Query: 327 ILRDEG-LCGIATEASYP 343
+ R+ G CGIA+ SYP
Sbjct: 309 MARNSGNHCGIASYPSYP 326
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 143/310 (46%), Positives = 191/310 (61%), Gaps = 20/310 (6%)
Query: 45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRA 101
++ HG+ Y E E+A R I++ NL+YIEK N G+ ++ LG NE+ D+TNEEFR+
Sbjct: 30 YLKAHGKQYGAE-EEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMTNEEFRS 88
Query: 102 SYTGYNRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
+ GY + +SR S + N+ D+P ++DWR KG VT IKNQG CGSCW+FSA
Sbjct: 89 TMNGYK-----MRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSA 143
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYP 218
++EG T GKL LSEQ LVDCS N+GC GGLMD AF+YI +N G+ TE+ YP
Sbjct: 144 TGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNSGIDTESSYP 203
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKR 277
Y+ + G C A G + D+ E L AV T P+SV ++AS +F+ Y+
Sbjct: 204 YEAKNGKCRFNAANVGATDSG-FTDIKSKSESDLQSAVATVGPISVAIDASHMSFQLYRS 262
Query: 278 GVLNA-ECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE-GLC 334
GV + C + DHGV VG+GT E G YWL+KNSWGE+WG+ GYI + R++ C
Sbjct: 263 GVYHEFFCSETRLDHGVLAVGYGT---ESGKDYWLVKNSWGESWGQKGYIMMSRNKRNNC 319
Query: 335 GIATEASYPV 344
GIAT ASYP
Sbjct: 320 GIATSASYPT 329
>gi|52630917|gb|AAU84922.1| putative cathepsin L [Toxoptera citricida]
Length = 341
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 200/343 (58%), Gaps = 17/343 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
VI++ ++ A VS +++E I E+ + + Q + Y+D E+A R ++ N I
Sbjct: 4 VIVLGLVVFAISSVSSINLNEI-IEEEWDLFKVQFKKIYEDVKEEAFRKKVYLDNKLKIA 62
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST---FKYQNV 128
+ NK G TY L N F DL E+ G+ + + + K +NV
Sbjct: 63 RHNKLYETGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDKNFTDDDAVTFLKSENV 122
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P SIDWR+KG VT +KNQG CGSCW+FSA ++EG G L+ LSEQ L+DCS
Sbjct: 123 V-IPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSR 181
Query: 189 D--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
NNGC GGLMD AF+YI NKGL TE YPY+ E C E + A G + D+P+
Sbjct: 182 KYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKG-FVDIPE 240
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFGTAEEE 303
GDE AL+ A+ T PVS+ ++AS + F+FYK+GV N C DHGV VG+GT +
Sbjct: 241 GDEDALVHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGT--DH 298
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVA 345
G YW++KNSWG+TWG+ GYI + R+ + CG+A+ ASYP+
Sbjct: 299 KGGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASYPLV 341
>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
Length = 330
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 139/336 (41%), Positives = 194/336 (57%), Gaps = 18/336 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
I L+ T ++S H+PS E+W +HG+TY E+ + +++ N++ I
Sbjct: 4 IFLLATLCLGMISAAPTHDPSFDTVWEEWKTKHGKTYNTN-EEGQKRAVWENNMKMINLH 62
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N++ G + L N F DLTN EFR TG+ P + F+ + D+P
Sbjct: 63 NEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQSMGPK------ETTIFREPFLGDIPK 116
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NN 191
S+DWRE G VT +KNQG CGSCWAFSAV ++EG GKL+ LSEQ LVDCS N
Sbjct: 117 SLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWSYGNL 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC+GGLM+ AF+Y+ EN+GL T Y Y+ + G C + K +AA + + +P ++
Sbjct: 177 GCNGGLMEFAFQYVKENRGLDTGESYAYEAQDGLC-RYNPKYSAANVTGFVKVPLSEDDL 235
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGV-LNAECGDN-CDHGVAVVGFGTAEEEDGAKYW 309
+ + PVSV +++ Q+FRFY G+ +C DH V VVG+G EE DG KYW
Sbjct: 236 MSAVASVGPVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYG--EESDGGKYW 293
Query: 310 LIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
L+KNSWGE WG GYI++ +D+ CGIAT A YP
Sbjct: 294 LVKNSWGEDWGMDGYIKMAKDQNNNCGIATYAIYPT 329
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 143/310 (46%), Positives = 191/310 (61%), Gaps = 20/310 (6%)
Query: 45 WMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRA 101
++ HG+ Y E E+A R I++ NL+YIEK N G+ ++ LG NE+ D+TNEEFR+
Sbjct: 30 YLKAHGKQYGAE-EEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMTNEEFRS 88
Query: 102 SYTGYNRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
+ GY + +SR S + N+ D+P ++DWR KG VT IKNQG CGSCW+FSA
Sbjct: 89 TMNGY-----KMRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSA 143
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYP 218
++EG T GKL LSEQ LVDCS N+GC GGLMD AF+YI +N G+ TE+ YP
Sbjct: 144 TGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNNGIDTESSYP 203
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKR 277
Y+ + G C A G + D+ E L AV T P++V ++AS +F+ YK
Sbjct: 204 YEAKNGKCRFNAANVGATDSG-FTDIKSKSESDLQSAVATVGPIAVAIDASHMSFQLYKS 262
Query: 278 GVLNA-ECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE-GLC 334
GV + C + DHGV VG+GT E G YWL+KNSWGE+WG+ GYI + R++ C
Sbjct: 263 GVYHEFFCSETRLDHGVLAVGYGT---ESGKDYWLVKNSWGESWGQKGYIMMSRNKRNNC 319
Query: 335 GIATEASYPV 344
GIAT ASYP
Sbjct: 320 GIATSASYPT 329
>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
Length = 340
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 139/318 (43%), Positives = 193/318 (60%), Gaps = 18/318 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ W +G+ YK++ E+ R I+++NL+++ N E G +Y LG N
Sbjct: 30 DPALDHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHL 89
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
D+T+EE + + VPS Q R T+K + +P S+DWREKG VT +K QG
Sbjct: 90 GDMTSEEVISLMSSLR--VPS---QWPRNVTYKSNSNQKLPDSVDWREKGCVTKVKYQGA 144
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD---NNGCSGGLMDKAFEYIIEN 208
CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST+ N GC+GG M +AF+YII+N
Sbjct: 145 CGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDN 204
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEA 267
G+ +EA YPY+ G C + K AAT KY +LP G E L +AV K PVSV ++A
Sbjct: 205 NGIDSEASYPYKATDGKC-RYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDA 263
Query: 268 SGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
+F Y+ GV + C N +HGV VVG+G +G YWL+KNSWG +G+ GYIR
Sbjct: 264 RHSSFFLYRSGVYYDPSCTQNVNHGVLVVGYGNL---NGKDYWLVKNSWGLNFGDQGYIR 320
Query: 327 ILRDEG-LCGIATEASYP 343
+ R+ G CGIA+ SYP
Sbjct: 321 MARNSGNHCGIASYPSYP 338
>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
Length = 330
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 140/306 (45%), Positives = 187/306 (61%), Gaps = 18/306 (5%)
Query: 48 QHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYT 104
Q+ + Y++E E+A R +++ NL++I N G T+ +G NE+ D+TNEEF +
Sbjct: 33 QYNKLYQNE-EEARRRLVWESNLDFITLHNLAADRGEHTFWVGMNEYGDMTNEEFTKTMN 91
Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAV 164
GY ++ S+ P N+ D+P ++DWR KG VT IKNQG CGSCW+FSA ++
Sbjct: 92 GYRMR----NKTSNAPVFMPPNNMGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATGSL 147
Query: 165 EGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
EG T GKL+ LSEQ LVDCS N+GC GGLMD AF YI N G+ TEA YPY+
Sbjct: 148 EGQTFKKTGKLVSLSEQNLVDCSKKQGNHGCEGGLMDDAFTYIKANNGIDTEASYPYKAR 207
Query: 223 QGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN 281
G C+ + A G + D+ DE AL QAV T P+SV ++AS +F+ Y+ GV +
Sbjct: 208 DGKCEFKSADVGATDTG-FVDIKTKDEEALKQAVATVGPISVAIDASHMSFQLYRTGVYH 266
Query: 282 AE-CGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIAT 338
C DHGV VG+GT ED YWL+KNSWGE+WG+ GYI++ R+ CGIAT
Sbjct: 267 DWFCSQTKLDHGVLAVGYGT---EDSKDYWLVKNSWGESWGQKGYIQMSRNRRNNCGIAT 323
Query: 339 EASYPV 344
ASYP
Sbjct: 324 SASYPT 329
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 138/323 (42%), Positives = 198/323 (61%), Gaps = 25/323 (7%)
Query: 36 PSIVEKHEQWMA---QHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTN 89
PS ++W+A HG+ Y+++ E+ R+ +F N + I++ N + G +YK+ N
Sbjct: 4 PSFDIDPQEWLAFKAMHGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMN 63
Query: 90 EFSDLTNEEFRASYTGYNRPVPSVSRQSS--RPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
DL EF+A G+ + P+ R PS ++P S+DWR++GAVT +K
Sbjct: 64 HLGDLMVHEFKALMNGFKK-TPNAERNGKIYVPSN------ENLPKSVDWRQRGAVTPVK 116
Query: 148 NQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYI 205
+QGHCGSCW+FSA ++EG + G+L+ LSEQ LVDCS N+GC GGLM++AF+Y+
Sbjct: 117 DQGHCGSCWSFSATGSLEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYV 176
Query: 206 IENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVC 264
+NKG+ TEA YPY+ + C + KE T Y D+ + E L AV T P+SV
Sbjct: 177 RDNKGIDTEASYPYEARENNC-RFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVR 235
Query: 265 VEASGQAFRFYKRGVLNAE-CG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGES 322
++AS ++F+FY GV + C DHGV VG+GT E+G YWL+KNSWG +WGES
Sbjct: 236 IDASHESFQFYSEGVYKEQYCSPSQLDHGVLTVGYGT---ENGQDYWLVKNSWGPSWGES 292
Query: 323 GYIRILRD-EGLCGIATEASYPV 344
GYI+I R+ + CGIA+ ASYPV
Sbjct: 293 GYIKIARNHKNHCGIASMASYPV 315
>gi|313118768|gb|ADR32296.1| C14 cysteine protease [Solanum demissum]
gi|313118770|gb|ADR32297.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 117/218 (53%), Positives = 150/218 (68%), Gaps = 8/218 (3%)
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-N 190
P S+DWR+KG + +K+QG CGSCWAFSAVAA+E I I G LI LSEQ+LVDC N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC GGLMD AFE++I N G+ TE DYPY++ G CD+ ++ A TI YED+P +E
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNGVCDQYRKNAKVVTIDSYEDVPVNNEK 121
Query: 251 ALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWL 310
AL +AV QPVS+ +EA G+ F+ YK G+ +CG DHGV V G+GT E+G YW+
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVVAGYGT---ENGMDYWI 178
Query: 311 IKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
++NSWG WGE GY+R+ R+ GLCG+A E SYPV
Sbjct: 179 VRNSWGAKWGEKGYLRVQRNVASSSGLCGLAIEPSYPV 216
>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
Length = 338
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 142/329 (43%), Positives = 195/329 (59%), Gaps = 18/329 (5%)
Query: 24 ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---G 80
AS + ++P++ W +GR Y+++ E+ R I+++NL+ + N E G
Sbjct: 18 ASSYAVAQVQNDPTLDHHWNLWKKTYGRQYQEKNEEVARRLIWEKNLKSVMLHNLEYSMG 77
Query: 81 NRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
+Y LG N +D+T+EE + + VPS Q T+K + +P S+DWREK
Sbjct: 78 MHSYDLGMNHLADMTSEEVSSLMSSLR--VPS---QWQANVTYKSNSNQKLPDSVDWREK 132
Query: 141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD---NNGCSGGL 197
G VT +K QG CG+CWAFSAV A+E ++ G L+ LS Q LVDCST+ N GC+GG
Sbjct: 133 GCVTEVKYQGACGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTERYGNKGCNGGF 192
Query: 198 MDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV- 256
M KAF+YII+N G+ +E YPY+ G C + K AAT KY +LP G E AL +AV
Sbjct: 193 MTKAFQYIIDNNGIDSEVSYPYKAMDGNC-RYDSKHRAATCSKYTELPFGSEDALKEAVA 251
Query: 257 TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSW 315
K PVSV ++A +F YK GV + C N +HGV VVG+G +G YWL+KNSW
Sbjct: 252 NKGPVSVAIDAKHSSFFLYKSGVYYDPSCTQNVNHGVLVVGYGNL---NGRDYWLVKNSW 308
Query: 316 GETWGESGYIRILRDEG-LCGIATEASYP 343
G +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 309 GLNFGEQGYIRMARNSGNHCGIASYPSYP 337
>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
Length = 376
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 142/330 (43%), Positives = 196/330 (59%), Gaps = 19/330 (5%)
Query: 27 VVSGRSMH--EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTY 84
VV+ H E + +E+W+ +HG+ Y EK R IFK NL++IE+ N + NR+Y
Sbjct: 24 VVTATESHRNEAEVRTIYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSY 83
Query: 85 KLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
G N+FSDLT +EF+ASY G S+S + R ++Y+ +P +DWRE+GAV
Sbjct: 84 DRGLNQFSDLTVDEFQASYLGGKIEKKSLSDVAER---YQYKEGDILPDEVDWRERGAVV 140
Query: 145 -HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLMDKA 201
+K QG CGSCWAF+A AVEGI QIT G+L+ LSEQ+L+DC DN GC+GG A
Sbjct: 141 PRVKRQGDCGSCWAFAATGAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWA 200
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAA--AATIGKYEDLPKGDEHALLQAVTKQ 259
FE+I EN G+ T+ DY Y + K E TI +E +P DE +L +AV+ Q
Sbjct: 201 FEFIKENGGIVTDEDYGYTGDDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVSYQ 260
Query: 260 PVSVCVEASGQAFRFYKRGVLNAECGDNC-DHGVAVVGFGTAEEEDGAKYWLIKNSWGET 318
P+SV + A+ + YK GV C + DH V +VG+GT+ +E YWLI+NSWG
Sbjct: 261 PISVMISAANMS--DYKSGVYKGPCSNLWGDHNVLIVGYGTSSDE--GDYWLIRNSWGPG 316
Query: 319 WGESGYIRILRD----EGLCGIATEASYPV 344
WGE GY+R+ R+ G C +A YP+
Sbjct: 317 WGEGGYLRLQRNFNEPTGKCAVAVAPVYPI 346
>gi|194719810|emb|CAR31335.1| pro-asclepain f [Gomphocarpus fruticosus subsp. fruticosus]
Length = 340
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 145/350 (41%), Positives = 209/350 (59%), Gaps = 23/350 (6%)
Query: 7 KSFIIPM-FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTI 65
KSF++ + F++ + ITC S + RS E ++ +E+W+ +H + Y EK R I
Sbjct: 2 KSFVLILSFLLFVSAITCIS--TNWRSDDE--VIALYEEWLVKHQKLYSSLGEKIKRFEI 57
Query: 66 FKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTG----YNRPVPSVSRQSS 118
FK NL YI++ N K + + LG N+F+DLT +EF + Y G Y + + S
Sbjct: 58 FKDNLRYIDQQNHYNKVNHMNFTLGLNQFADLTLDEFSSIYLGTSVDYEQIISSNPNHDD 117
Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIEL 178
++V ++P S+DWREKG V I+NQG CGSCW FSAVA++E + I G +I L
Sbjct: 118 VEEDILKEDVVELPDSVDWREKGVVFPIRNQGKCGSCWTFSAVASIETLNGIKKGHMIAL 177
Query: 179 SEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATI 238
SEQ+L+DC T + GC GG + AF Y+ +N G+ +E YPY QG C QKEK I
Sbjct: 178 SEQELLDCETISQGCKGGHYNNAFAYVAKN-GITSEEKYPYIFRQGQC-YQKEKVVK--I 233
Query: 239 GKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
Y+ +P+ + L AV +Q VSV V+ + F+FY RG+ + CG DH V +VG+G
Sbjct: 234 SGYKRVPRNNGGQLQSAVAQQVVSVAVKCESKDFQFYDRGIFSGACGPILDHAVNIVGYG 293
Query: 299 TAEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+ + GA YW+++NSWG WGE+GY+RI ++ EG CGIA + SYPV
Sbjct: 294 S---KGGANYWIMRNSWGTNWGENGYMRIQKNSKHYEGHCGIAMQPSYPV 340
>gi|62510453|sp|Q8HY82.1|CATS_SAIBB RecName: Full=Cathepsin S; Flags: Precursor
gi|27497536|gb|AAO13008.1| cathepsin S preproprotein [Saimiri boliviensis]
Length = 330
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 203/340 (59%), Gaps = 21/340 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
M ++ ++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKQLVCVLFVCSSAVTQ---LHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE + + P Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP-----NQWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCS
Sbjct: 113 QMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSE 172
Query: 189 D--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GG M +AF+YII+NKG+ +EA YPY+ C + K AAT KY +LP
Sbjct: 173 KYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKATDQKC-QYDSKYRAATCSKYTELPY 231
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEED 304
G E L +AV K PV V V+AS +F Y+ GV + C +HGV V+G+G + +
Sbjct: 232 GREDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYG---DLN 288
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
G +YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 289 GKEYWLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSYP 328
>gi|383410403|gb|AFH28415.1| cathepsin L1 preproprotein [Macaca mulatta]
Length = 333
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 142/343 (41%), Positives = 202/343 (58%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
P F++ + AS ++ S+ + +W A H R Y E+ R ++++N++
Sbjct: 3 PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N+E G ++ + N F D+T+EEFR G+ +R+ + F+
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLF 111
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS- 187
+ P S+DWREKG VT +KNQG CGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 188 -TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY+ + +C E + A G + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSVANDTG-FVDIPK 230
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEE 302
E AL++AV T P+SV ++A ++F FYK G+ +C ++ DHGV VVG+G + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
D +KYWL KNSWGE WG GYI++ +D CGIA+ ASYP
Sbjct: 290 SDNSKYWLGKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>gi|302763127|ref|XP_002964985.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
gi|300167218|gb|EFJ33823.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
Length = 320
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 134/329 (40%), Positives = 194/329 (58%), Gaps = 31/329 (9%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPS----IVEKHEQWMAQHGRTYKDELEKAMRL 63
S +I + +I+++V+ A ++ + E I E W A+HG++Y + EKA R+
Sbjct: 3 SNMIALILILLVVVGAAPFAIARPAALEDDRALEIKNMFEDWAAKHGKSYSSDWEKARRM 62
Query: 64 TIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTF 123
TIF L YIEK N N T+ LG N+FSDLTN EFRA+Y G +P Q RP+
Sbjct: 63 TIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRANYVGKFKPP---RYQDRRPAKD 119
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQL 183
+V+ +PTS+DWR++GAVT IK+QG CGSCWAFSA+A++E + +L+ LSEQQL
Sbjct: 120 VDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIASIESAHFLATNQLVSLSEQQL 179
Query: 184 VDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
+DC T + GC E YPY G+C+ K K A T +
Sbjct: 180 IDCDTVDEGCQ-------------------EEAYPYTGLAGSCNANKNKVAEIT--GFNV 218
Query: 244 LPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEE 303
+ K AL++AV+K PV+V + S Q F+ Y+ G+L+ +C ++ DH V V+G+GT E
Sbjct: 219 VTKDKADALMKAVSKTPVTVGICGSDQNFQNYRSGILSGQCCNSRDHVVLVIGYGT---E 275
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRDEG 332
G YW+IKNSWG +WGE G+++I + +G
Sbjct: 276 GGMPYWIIKNSWGTSWGEDGFMKIEKKDG 304
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 202/340 (59%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF + G++ +R++ S NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSSFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P +DWR+KGAVT +K+QG CGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC GGLM+ AF+YI N G+ TE YPY+ G C +KE A G Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYKAVDGECRFKKEDVGATDTG-YVEIKAGS 234
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDG 305
E L +AV T P+SV ++AS +F+ Y GV + EC ++ DHGV VVG+G + G
Sbjct: 235 EVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA++ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
Length = 327
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 132/316 (41%), Positives = 179/316 (56%), Gaps = 23/316 (7%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAS 102
E+WMA+ G+ Y EK R +F+ N+ +I L N+F+DLTN+EF ++
Sbjct: 20 EEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDEFVST 79
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVA 162
+TG P P + + P +P IDWR KGAVT +K+QG CGSCWAF+AVA
Sbjct: 80 HTGAKPPCPKDAPRGVDPIW--------LPCCIDWRYKGAVTDVKDQGACGSCWAFAAVA 131
Query: 163 AVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQE 222
A+EG+TQI GKL LSEQ+LVDC T ++GC+GG D+AFE + G+ E+ Y Y+
Sbjct: 132 AIEGLTQIRTGKLTPLSEQELVDCDTGSSGCAGGHTDRAFELVAAKGGITAESGYRYEGY 191
Query: 223 QGTCDKQKEK-AAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLN 281
+G C AA IG + +P GDE L AV +QPV+ ++ASG AF+FY GV
Sbjct: 192 RGKCRADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFYGSGVFP 251
Query: 282 AEC---------GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-- 330
C +H V +VG+ + G KYW+ KNSWG+TWGE GYI + +D
Sbjct: 252 GPCGSGSGAAAAAPTTNHAVTLVGY-CQDGASGKKYWVAKNSWGKTWGEKGYILLEKDVA 310
Query: 331 --EGLCGIATEASYPV 344
G CG+A YP
Sbjct: 311 SPHGTCGVAVSPFYPT 326
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 202/340 (59%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF + G++ +R++ S NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSSFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P +DWR+KGAVT +K+QG CGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC GGLM+ AF+YI N G+ TE YPY+ G C +KE A G Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTG-YVEIKAGS 234
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDG 305
E L +AV T P+SV ++AS +F+ Y GV + EC ++ DHGV VVG+G + G
Sbjct: 235 EVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA++ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|356515116|ref|XP_003526247.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 190/340 (55%), Gaps = 48/340 (14%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
+ ++W+ +G Y+D+ E +R I++ N+EYI K +Y L N+F+DLTNEEF
Sbjct: 4 RFDRWLKXNGXNYEDKEEWEIRFVIYQANVEYI-GCKKSQKNSYNLTDNKFADLTNEEFV 62
Query: 101 ASYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCG------ 153
++Y G+ R +P + FKY ++P S DWR++GAVT IK+QG+CG
Sbjct: 63 STYLGFATRLIPH--------TRFKYHEHGNLPXSKDWRKEGAVTDIKDQGNCGKHSTWF 114
Query: 154 -----------------------SCWAFSAVAAVEGITQITGGKLIELSEQQLV--DCST 188
S WAFS VAAVE I +I GKL+ LSEQ+LV D +
Sbjct: 115 SPEISHNLRNILTNYNTINFRDISFWAFSVVAAVERINKIKSGKLVSLSEQELVDYDVAN 174
Query: 189 DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC GGLMD F +I +N GL T DYPY+ G+C+K+K A I YE P D
Sbjct: 175 KNQGCEGGLMDTTFAFIKKNGGLTTSKDYPYEGVDGSCNKEKALHHAVNISGYERAPSKD 234
Query: 249 EHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKY 308
E L A QP+SV ++A G AF+ Y +GV + CG +HGV +VG+ + KY
Sbjct: 235 EAMLKVAAANQPISVAIDAGGYAFQLYSQGVFSGVCGKKLNHGVTIVGYDKGTFD---KY 291
Query: 309 WLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
+KNS G WGESGYIR+ RD G CGIA +ASYP+
Sbjct: 292 RTVKNSXGADWGESGYIRMKRDAFDKAGTCGIAMKASYPL 331
>gi|324512246|gb|ADY45078.1| Cathepsin L [Ascaris suum]
Length = 388
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 146/320 (45%), Positives = 194/320 (60%), Gaps = 19/320 (5%)
Query: 38 IVEKHEQWMA---QHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEF 91
I + +EQW QHG+ Y+DE + + F NLE I K N + G ++++GTN
Sbjct: 76 IKQGYEQWRLFKEQHGKNYEDEETENDHMLAFLSNLEEIRKHNARYQRGESSFEMGTNHI 135
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
+DL EE+R GY P + F +VP DWR+ G VT +KNQG
Sbjct: 136 TDLPFEEYR-KLNGYK---PRYDDSHRNGTKFLVPFNINVPGHWDWRDHGYVTEVKNQGM 191
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENK 209
CGSCWAFSA A+EG + G L+ LSEQ LVDCS NNGC+GGLMD AFEYI +N
Sbjct: 192 CGSCWAFSATGALEGQHKRKIGSLVSLSEQNLVDCSRKYGNNGCNGGLMDYAFEYIKDNH 251
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEAS 268
G+ TEA YPY+ ++ C K+ A G Y DLP+GDE L AV Q P+SV ++A
Sbjct: 252 GVDTEASYPYKGKEMKCHFNKKTVGAEDEG-YVDLPEGDEEKLKIAVATQGPISVAIDAG 310
Query: 269 GQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
+F+ Y++GV +C ++ DHGV VVG+GT +E DG YW++KNSWG WGE GY+R
Sbjct: 311 HPSFQMYRKGVYYEPQCSSESLDHGVLVVGYGT-DEIDG-DYWIVKNSWGPGWGEKGYVR 368
Query: 327 ILRD-EGLCGIATEASYPVA 345
I R+ + CGIA++ASYP+
Sbjct: 369 IARNRDNHCGIASKASYPIV 388
>gi|296189340|ref|XP_002742739.1| PREDICTED: cathepsin L1 [Callithrix jacchus]
Length = 333
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 145/344 (42%), Positives = 200/344 (58%), Gaps = 25/344 (7%)
Query: 12 PMFVIIILVITCASQVVS-GRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNL 70
P ++ + AS ++ RS+ I +W A H R Y E+ R ++++N+
Sbjct: 3 PTLILTAFCLGLASSALTFDRSLEAQWI-----KWKAMHNRLYGMN-EEEWRRAVWEKNM 56
Query: 71 EYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQN 127
+ IE N E G ++ + N F D+TNEEFR G+ +R+ F+
Sbjct: 57 KMIELHNHEYNQGKHSFTMAMNAFGDMTNEEFRQVMNGFQ------NRKPRNGKVFQEPL 110
Query: 128 VTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS 187
+ P S+DWREKG VT +KNQG CGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 111 FHEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCS 170
Query: 188 --TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
N GC GGLMD AF+Y+ EN GL +E YPY+ + +C E + A G + D+P
Sbjct: 171 GPQGNQGCDGGLMDYAFQYVQENGGLDSEESYPYEATEESCKYNPEYSVANDTG-FVDIP 229
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEE 302
K E AL++AV T P+SV ++A ++F+FYK G+ EC ++ DHGV VVG+G
Sbjct: 230 K-LEKALMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMDHGVLVVGYGFERT 288
Query: 303 -EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
D +KYWL+KNSWGE WG GYI++ +D + CGIA+ ASYP
Sbjct: 289 GSDNSKYWLVKNSWGEKWGMDGYIKMAKDRKNHCGIASAASYPT 332
>gi|213512938|ref|NP_001133871.1| Cathepsin K precursor [Salmo salar]
gi|209155648|gb|ACI34056.1| Cathepsin K precursor [Salmo salar]
gi|223647252|gb|ACN10384.1| Cathepsin K precursor [Salmo salar]
gi|223673129|gb|ACN12746.1| Cathepsin K precursor [Salmo salar]
Length = 331
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 139/324 (42%), Positives = 193/324 (59%), Gaps = 14/324 (4%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTY 84
V ++E S+ + + W H R Y E+ +R TI+++N+ IE N+E G +Y
Sbjct: 14 VLAHPLNEMSLDAQWDSWKTTHLREYNGLGEEVIRRTIWEKNMRLIEAHNEEAALGIHSY 73
Query: 85 KLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
+LG N D+T+EE TG P+ + P NV +P SID+R+KG VT
Sbjct: 74 ELGMNHLGDMTSEEIAEKLTGLQVPMNRDRSNTWIPD----NNVVKIPRSIDYRKKGMVT 129
Query: 145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEY 204
+KNQ CGSCWAFS+ A+EG T GKLI+LS Q LVDC T+NNGC GG M AFEY
Sbjct: 130 PVKNQLSCGSCWAFSSAGALEGQLAKTTGKLIDLSPQNLVDCVTENNGCGGGYMTNAFEY 189
Query: 205 IIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSV 263
+ EN G+ TE YPY + G C A G ++++P+GDE AL +AV K PV+V
Sbjct: 190 VEENGGIDTEEAYPYLGQDGQCAYNASGMGAQCRG-FKEIPEGDEWALTKAVVKVGPVAV 248
Query: 264 CVEASGQAFRFYKRGV-LNAECG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
++A+ F+FY+RGV + C D+ +H V VG+G + G K+W++KNSW E+WG+
Sbjct: 249 GIDATLSTFQFYQRGVYYDPNCNKDDINHAVLAVGYG--QTAKGMKFWIVKNSWSESWGK 306
Query: 322 SGYIRILRDEG-LCGIATEASYPV 344
GYI + R+ G CGIA ASYP+
Sbjct: 307 QGYIMMARNRGNACGIANLASYPI 330
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 202/340 (59%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF + G++ +R++ S NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSSFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P +DWR+KGAVT +K+QG CGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC GGLM+ AF+YI N G+ TE YPY+ G C +KE A G Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTG-YVEIKAGS 234
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDG 305
E L +AV T P+SV ++AS +F+ Y GV + EC ++ DHGV VVG+G + G
Sbjct: 235 EVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA++ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|52345644|ref|NP_001004869.1| cathepsin L2 precursor [Xenopus (Silurana) tropicalis]
gi|49522051|gb|AAH74718.1| MGC69486 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 201/343 (58%), Gaps = 20/343 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M + + + C + V + + +P++ W H ++Y + E+ R ++++NL
Sbjct: 1 MALYLGIAAICLTTVFAAPTT-DPALDNHWNLWKNWHKKSYAPK-EEGWRRVLWEKNLRM 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G ++ LG N+F D+TNEEFR GY +++ R STF N
Sbjct: 59 IEFHNLEHSLGKHSHSLGMNQFGDMTNEEFRQLMNGYK------NQKKIRGSTFLAPNNF 112
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-- 187
+ P S+DWR+KG VT +K+QG CGSCWAFS A+EG GK+I LSEQ LVDCS
Sbjct: 113 ESPKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKMISLSEQNLVDCSRA 172
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQ-QEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD+AF+Y+ +N G+ +E YPY ++ C +A G + D+
Sbjct: 173 QGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTG-FVDVTS 231
Query: 247 GDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEE 302
G E L+ AV PVSV V+A Q+F+FYK G+ EC ++ DHGV VVG+G E+
Sbjct: 232 GSEKDLMNAVASVGPVSVAVDAGHQSFQFYKSGIYYEPECSSEDLDHGVLVVGYGFEGED 291
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
EDG KYW++KNSW E WG GYI I +D CGIAT ASYP+
Sbjct: 292 EDGKKYWIVKNSWSEKWGNDGYIYIAKDRHNHCGIATAASYPL 334
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 144/341 (42%), Positives = 202/341 (59%), Gaps = 22/341 (6%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
++ L + CA V+ + + + E + H ++Y+ +E+ +R IF +N I K
Sbjct: 1 MLRLSLLCAIVAVTVAANSHEILRTQWEAFKTTHKKSYESHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTF-KYQNVTD- 130
N + G +YKLG N+F DL EF + GY +++SR STF NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNGYR------GQRTSRGSTFMPPANVNDS 114
Query: 131 -VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+P+++DWR+KGAVT +K+QG CGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 115 SLPSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELVSLSEQNLVDCSQS 174
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
NNGC GGLMD AF+YI N G+ E YPY+ C +KE A G + D+ G
Sbjct: 175 FGNNGCEGGLMDNAFKYIKANDGIDAEESYPYEAMDDKCRFKKEDVGATDTG-FVDIEGG 233
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEED 304
E L +AV T P+SV ++A +F+ Y GV + EC + DHGV VG+G +D
Sbjct: 234 SEDDLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPECSSEELDHGVLAVGYGV---KD 290
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
G KYWL+KNSWG +WG++GYI + RD+ CGIA+ ASYP+
Sbjct: 291 GKKYWLVKNSWGGSWGDNGYILMSRDKNNQCGIASAASYPL 331
>gi|334332718|ref|XP_001367502.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 252 bits (644), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 133/309 (43%), Positives = 201/309 (65%), Gaps = 17/309 (5%)
Query: 44 QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
QW AQHG++Y+ E ++R +++NL+ IE+ N+E G +++L N+F D++ EEF+
Sbjct: 31 QWKAQHGKSYEAN-EDSLRRATWEKNLKMIERHNQEYSAGKHSFQLRMNKFGDMSTEEFK 89
Query: 101 ASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
GY + S++ ++ S ++ + +P S+DWREKG VT +K QG CG+CW+FSA
Sbjct: 90 QVMNGYKS---NGSQRRTKGSLYRESLLAQLPESVDWREKGYVTPVKEQGDCGACWSFSA 146
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYP 218
V A+EG GKL+ LS Q L+DC+ NNGC GG MD AF+Y+ +N G+ TE YP
Sbjct: 147 VGAIEGQWFRKTGKLVSLSIQNLIDCTIPEGNNGCDGGFMDNAFQYVQDNGGIDTEECYP 206
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKR 277
Y + C K K + + A I + D+P DE AL++AV T P+SV ++++ +F+FY+
Sbjct: 207 YVAQDTEC-KYKPECSGANITGFVDIPSMDERALMEAVATVGPISVGIDSANPSFKFYQS 265
Query: 278 GV-LNAECGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLC 334
GV +C + DHGV VVG+G+ +++ YW++KNSWGE WG++GYI + +D + C
Sbjct: 266 GVYYEPDCSSSQLDHGVLVVGYGSIGKDE---YWIVKNSWGEAWGDNGYILMAKDKDNHC 322
Query: 335 GIATEASYP 343
GIATEASYP
Sbjct: 323 GIATEASYP 331
>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 140/343 (40%), Positives = 206/343 (60%), Gaps = 17/343 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M + +++ C + ++ S+ +P + EQW + HG++Y ++ E+ R +++++L
Sbjct: 1 MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEEHLRV 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +++LG N F D+ NEEFR GY Q S F N
Sbjct: 59 IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSH---FLEPNFL 115
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+VP +DWR++G VT +K+QG CGSCWAFS A+EG G+L+ LSEQ LV+CS
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGT-CDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD+AF+Y+ +N G+ +E YPY T C + AA G + D+P
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTG-FVDIPS 234
Query: 247 GDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAECGD-NCDHGVAVVGFGTAEEE 303
G E AL++A+ PVSV ++A +F+FY+ G+ AEC + DHGV VVG+G + +
Sbjct: 235 GKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRD 294
Query: 304 -DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E WG++GYI + +D + CGIAT ASYP+
Sbjct: 295 TDGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|21425246|emb|CAD33266.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 200/343 (58%), Gaps = 17/343 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
VI++ ++ A VS +++E I E+ + Q + Y+D E+ R ++ N I
Sbjct: 4 VIVLGLVAFAISTVSSINLNE-VIEEEWSLFKIQFKKLYEDIKEETFRKKVYLDNKLKIA 62
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQ--SSRPSTF-KYQNV 128
NK G TY L N F DL E+ G+ + R + TF K +NV
Sbjct: 63 GHNKLYESGEETYALEMNHFGDLMQHEYTKMMNGFKPSLAGGDRNFTNDEAVTFLKSENV 122
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWR+KG VT +KNQG CGSCW+FSA ++EG G L+ LSEQ L+DCS
Sbjct: 123 V-IPKSVDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSR 181
Query: 189 D--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
NNGC GGLMD AF+YI NKGL TE YPY+ E C E + A G + D+P+
Sbjct: 182 KYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENSGATDKG-FVDIPE 240
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFGTAEEE 303
GDE AL+ A+ T PVS+ ++AS + F+FYK+GV N C DHGV VGFG+ ++
Sbjct: 241 GDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFGS--DK 298
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVA 345
G YW++KNSWG+TWG+ GYI + R+ + CG+A+ ASYP+
Sbjct: 299 KGGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341
>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
sativus]
Length = 235
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 119/220 (54%), Positives = 152/220 (69%), Gaps = 9/220 (4%)
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P ++DWR+KGAV IKNQG CGSCWAFS A VEGI +I G+LI LSEQ+LVDC
Sbjct: 4 LPETVDWRQKGAVNAIKNQGTCGSCWAFSTAAVVEGINKIVTGELISLSEQELVDCDKSY 63
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
N GC+GGLMD AF++I++N GL TE DYPY+ G C+ + + TI YED+P DE
Sbjct: 64 NQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRGSDGKCNSLLKNSKVVTIDGYEDVPTNDE 123
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
AL +AV+ QPVSV ++A G+ F+ Y+ G+ ECG DH V VG+G+ E+G YW
Sbjct: 124 TALKRAVSYQPVSVAIDAGGRVFQHYQSGIFTGECGTKMDHAVVAVGYGS---ENGVDYW 180
Query: 310 LIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
+++NSWG+ WGE GYIRI R+ G CGIA EASYPV
Sbjct: 181 IVRNSWGQKWGEDGYIRIERNLASSKSGKCGIAIEASYPV 220
>gi|310975577|gb|ADP55137.1| cathepsin S [Miichthys miiuy]
Length = 338
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 143/336 (42%), Positives = 193/336 (57%), Gaps = 19/336 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
++L CA +M + + E W HG+TY++ +E R ++++NL I
Sbjct: 13 LLLFSLCAGAA----AMFDSKLDGHWELWKKMHGKTYRNYVEDESRRELWEKNLVLITMH 68
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N E G TYKL N DLT EE S+ P + R PS F + VP
Sbjct: 69 NLEASMGLHTYKLSMNHMGDLTPEEIMQSFATLTPPT-DIQRA---PSPFAGTSGAAVPD 124
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NN 191
++DWREKG VT +K QG CGSCWAFSA A+EG T GKL++LS Q LVDCST N+
Sbjct: 125 TMDWREKGCVTSVKMQGACGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNH 184
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC+GG M KAF+Y+I+N G+ ++A YPY Q K AA +Y LP+GDE A
Sbjct: 185 GCNGGFMHKAFQYVIDNHGIDSDAAYPYTGRQSQECHYSPKFRAANCSQYSFLPEGDEGA 244
Query: 252 LLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
L QA+ T P+SV ++A F FY GV + C + +HGV VG+GT +D YW
Sbjct: 245 LKQALATIGPISVAIDARRPRFAFYSSGVYDDPSCSQDVNHGVLAVGYGTLNGQD---YW 301
Query: 310 LIKNSWGETWGESGYIRILRDEG-LCGIATEASYPV 344
L+KNSWG+T+G++GYIR+ R++ CGIA YP+
Sbjct: 302 LVKNSWGQTFGDNGYIRMARNKNDQCGIARYGCYPI 337
>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 140/343 (40%), Positives = 206/343 (60%), Gaps = 17/343 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M + +++ C + ++ S+ +P + EQW + HG++Y ++ E+ R +++++L
Sbjct: 1 MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEKHLRV 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +++LG N F D+ NEEFR GY Q S F N
Sbjct: 59 IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSH---FLEPNFQ 115
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+VP +DWR++G VT +K+QG CGSCWAFS A+EG G+L+ LSEQ LV+CS
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGT-CDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD+AF+Y+ +N G+ +E YPY T C + AA G + D+P
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTG-FVDIPS 234
Query: 247 GDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAECGD-NCDHGVAVVGFGTAEEE 303
G E AL++A+ PVSV ++A +F+FY+ G+ AEC + DHGV VVG+G + +
Sbjct: 235 GKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRD 294
Query: 304 -DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E WG++GYI + +D + CGIAT ASYP+
Sbjct: 295 TDGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 140/343 (40%), Positives = 206/343 (60%), Gaps = 17/343 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M + +++ C + ++ S+ +P + EQW + HG++Y ++ E+ R +++++L
Sbjct: 1 MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEKHLRV 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +++LG N F D+ NEEFR GY Q S F N
Sbjct: 59 IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSH---FLEPNFL 115
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+VP +DWR++G VT +K+QG CGSCWAFS A+EG G+L+ LSEQ LV+CS
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGT-CDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD+AF+Y+ +N G+ +E YPY T C + AA G + D+P
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTG-FVDIPS 234
Query: 247 GDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAECGD-NCDHGVAVVGFGTAEEE 303
G E AL++A+ PVSV ++A +F+FY+ G+ AEC + DHGV VVG+G + +
Sbjct: 235 GKERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRD 294
Query: 304 -DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
DG KYW++KNSW E WG++GYI + +D + CGIAT ASYP+
Sbjct: 295 TDGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|66823245|ref|XP_644977.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
gi|166201986|sp|P54640.2|CYSP5_DICDI RecName: Full=Cysteine proteinase 5; Flags: Precursor
gi|60473097|gb|EAL71045.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
Length = 344
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 140/352 (39%), Positives = 196/352 (55%), Gaps = 29/352 (8%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M V+ L + S + + E WM H ++Y E E R IFK N++Y
Sbjct: 1 MKVLSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSE-EFGARYNIFKANMDY 59
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPS-VSRQSSRPSTFKYQNVTDV 131
+++ N +G+ T LG N F+D+TNEE+R +Y G S + Q + T T
Sbjct: 60 VQQWNSKGSETV-LGLNNFADITNEEYRNTYLGTKFDASSLIGTQEEKVFT------TSS 112
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN 191
S DWR +GAVT +KNQG CG CW+FS + EG + G+L+ LSEQ L+DCST+N+
Sbjct: 113 AASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTENS 172
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GGLM AFEYII N G+ TE+ YPY+ E G C+ + E + AT+ Y+ + G E +
Sbjct: 173 GCDGGLMTYAFEYIINNNGIDTESSYPYKAENGKCEYKSEN-SGATLSSYKTVTAGSESS 231
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEEEDGA--- 306
L AV PVSV ++AS Q+F+ Y G+ EC +N DHGV VG+G+
Sbjct: 232 LESAVNVNPVSVAIDASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSS 291
Query: 307 -------------KYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
+YW++KNSWG +WG GYI + R+ + CGIA+ AS+PV
Sbjct: 292 GQSSGNLSASSSNEYWIVKNSWGTSWGIEGYILMSRNRDNNCGIASSASFPV 343
>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
Length = 338
Score = 252 bits (643), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 209/344 (60%), Gaps = 20/344 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+++ I+ + AS G +P++ + W + H + Y E E+ R I+++NL+
Sbjct: 2 IYLCILALSFGASFAAPGL---DPALNDHWLSWKSWHSKKYH-EKEEGWRRMIWEKNLKM 57
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N + G +Y+LG N F D+TNEEFR G+ + S S++ + S F N
Sbjct: 58 IELHNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGFKQ---SRSQRKYKGSQFLEPNFL 114
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-- 187
P S+DWREKG VT +K+QG CGSCWAFSA A+EG GKL+ LSEQ L+DCS
Sbjct: 115 QAPKSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQHFRKTGKLVSLSEQNLIDCSGP 174
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPY-QQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD+AF+YI +N G+ +E YPY ++ C + E +A G + D+P+
Sbjct: 175 EGNQGCNGGLMDQAFQYIKDNNGIDSEESYPYIGKDDEDCLYKPEYNSANDTG-FVDIPE 233
Query: 247 GDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAECG-DNCDHGVAVVGFGT--AE 301
G E AL++AV P+SV ++AS +F+FY+ GV +C + DHGV VVG+G +
Sbjct: 234 GRERALMKAVAAVGPISVAIDASHTSFQFYESGVYYEPQCNSEELDHGVLVVGYGYEGTD 293
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
+++ +YW++KNSW E WG+ GYI + +D CGIA+ ASYP+
Sbjct: 294 DDNKKRYWIVKNSWSEKWGDQGYIHMAKDRSNNCGIASAASYPM 337
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 142/320 (44%), Positives = 192/320 (60%), Gaps = 21/320 (6%)
Query: 36 PSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEFS 92
PS+ ++ + A+HGR Y E+ RL++F+QN ++I+ N + G T+ L N+F
Sbjct: 18 PSLRQQWRDFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFG 77
Query: 93 DLTNEEFRASYTGY-NRPVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKNQG 150
D+T+EEF A+ G+ N P S RP+ + + +P +DWR KGAVT +K+Q
Sbjct: 78 DMTSEEFTATMNGFLNVP-------SRRPTAILRADPDETLPKEVDWRTKGAVTPVKDQK 130
Query: 151 HCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIEN 208
CGSCWAFS ++EG + GKL+ LSEQ LVDCS N GC GGLMD+AF YI N
Sbjct: 131 QCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKAN 190
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEA 267
KG+ TE YPY+ + G C A G Y D+ G E AL +AV T P+SV ++A
Sbjct: 191 KGIDTEDSYPYEAQDGKCRFDASNVGATDTG-YVDVEHGSESALKKAVATIGPISVAIDA 249
Query: 268 SGQAFRFYKRGVLNAE-CGDN-CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYI 325
S +F+FY GV E C DHGV VG+G E E G YWL+KNSW +WG GYI
Sbjct: 250 SQPSFQFYHDGVYYEEGCSSTMLDHGVLAVGYG--ETEKGEAYWLVKNSWNTSWGNKGYI 307
Query: 326 RILRD-EGLCGIATEASYPV 344
++ RD + CGIA++ASYP+
Sbjct: 308 QMSRDKKNNCGIASQASYPL 327
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 251 bits (642), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 202/340 (59%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF + G+ +R++ + NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHRG-----TRKTGGSTFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P ++DWR+KGAVT +K+QG CGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC GGLM+ AF+YI N G+ TE YPY+ G C +KE A G Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTG-YVEIKAGS 234
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDG 305
E L +AV T P+SV ++AS +F+ Y GV + EC ++ DHGV VVG+G + G
Sbjct: 235 EVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA++ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|60827856|gb|AAX36816.1| cathepsin L [synthetic construct]
Length = 334
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 202/344 (58%), Gaps = 20/344 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M +IL C + S + S+ + +W A H R Y E+ R ++++N++
Sbjct: 1 MNPTLILAAFCLG-IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58
Query: 73 IEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N +EG ++ + N F D+T+EEFR G+ +R+ + F+
Sbjct: 59 IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLFY 112
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-- 187
+ P S+DWREKG VT +KNQG CGSCWAFSA A+EG G+LI LSEQ LVDCS
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP 172
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD AF+Y+ +N GL +E YPY+ + +C K K + A + D+PK
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIPK- 230
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
E AL++AV T P+SV ++A ++F FYK G+ +C ++ DHGV VVG+G + E
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVAM 346
D KYWL+KNSWGE WG GY+++ +D CGIA+ ASYP +
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTVL 334
>gi|403300975|ref|XP_003941187.1| PREDICTED: cathepsin L1-like isoform 1 [Saimiri boliviensis
boliviensis]
gi|403300977|ref|XP_003941188.1| PREDICTED: cathepsin L1-like isoform 2 [Saimiri boliviensis
boliviensis]
gi|403300979|ref|XP_003941189.1| PREDICTED: cathepsin L1-like isoform 3 [Saimiri boliviensis
boliviensis]
Length = 333
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 143/343 (41%), Positives = 200/343 (58%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
P ++ + AS ++ E + +W A H R Y E+ R ++++N++
Sbjct: 3 PTLILAAFCLGLASAALTFNHSLEAQWI----KWKAMHNRLYGKN-EEEWRRAVWEKNMK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N E G ++ + N F D+TNEEFR G+ +R+ F+ +
Sbjct: 58 TIELHNHEYNQGKHSFTMAMNTFGDMTNEEFRQVMNGFQ------NRKPRNGKVFQEPLL 111
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS- 187
+ P S+DWREKG VT +KNQG CGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 HEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 188 -TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPK 246
N GC+GGLMD AF+Y+ EN GL +E YPY+ + +C K K + A + D+PK
Sbjct: 172 PQGNQGCNGGLMDYAFQYVQENGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIPK 230
Query: 247 GDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEE- 302
E AL++AV T P+SV ++A ++F+FYK G+ EC ++ DHGV VVG+G
Sbjct: 231 -LEKALMKAVATVGPISVAIDAGHESFQFYKEGIYFEPECSSEDMDHGVLVVGYGFERTG 289
Query: 303 EDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
D +KYWL+KNSWGE WG GYI++ +D + CGIA+ ASYP
Sbjct: 290 SDNSKYWLVKNSWGEEWGMDGYIKMAKDRKNHCGIASAASYPT 332
>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
Length = 509
Score = 251 bits (642), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 140/329 (42%), Positives = 193/329 (58%), Gaps = 18/329 (5%)
Query: 30 GRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT--YKLG 87
G S+ E +VE ++W +HG+ YK E + F+ NL Y+ + N E + + +G
Sbjct: 39 GESIAEERVVELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVG 98
Query: 88 TNEFSDLTNEEFRASYTGYNRPVPSVS-----RQSSRPSTFKYQNVTDVPTSIDWREKGA 142
N+F+D++NEEFR Y + S R+ + + K D PTS+DWR+ G
Sbjct: 99 LNKFADMSNEEFREVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGI 158
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAF 202
VT +K+QG CGSCWAFS+ A+EGI + G LI LSEQ+LVDC + N+GC GG MD AF
Sbjct: 159 VTGVKDQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDCDSTNDGCEGGYMDYAF 218
Query: 203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVS 262
E+++ N G+ TE DYPY E GTC+ KE+ A +I YED+ + +E AL AV KQP+S
Sbjct: 219 EWVMSNGGIDTETDYPYTGEDGTCNTTKEETKAVSIDGYEDVAE-EESALFCAVLKQPIS 277
Query: 263 VCVEASGQAFRFYKRGVL---NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
V ++ F+ Y G+ ++ D+ DH V VVG+G E G +YW+IKNSWG W
Sbjct: 278 VGIDGGAIDFQLYTGGIYDGDCSDDPDDIDHAVLVVGYGA---ESGEEYWIIKNSWGTDW 334
Query: 320 GESGYIRILR----DEGLCGIATEASYPV 344
G GY I R D G+C I ASYP
Sbjct: 335 GMKGYAYIKRNTSKDYGVCAINAMASYPT 363
>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
Length = 339
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 203/341 (59%), Gaps = 21/341 (6%)
Query: 20 VITCASQVVSGRSMHEPSI---VEKHEQ-WMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
V CA + PS+ ++ H Q W H + Y + E+ R I+++NL+ I+
Sbjct: 3 VYLCALALFLEACFAAPSLDSALDDHWQAWKTWHSKKYHQQ-EEGWRRMIWEKNLKMIQL 61
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
N + G +Y+LG N F D+TNEEFR GY S + + R S F N VP
Sbjct: 62 HNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGYKH---SKTEKKYRGSEFLEPNFLVVP 118
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--N 190
S+DWREKG VT +K+QG CGSCWAFS ++EG GKL+ LSEQ LVDCS N
Sbjct: 119 KSVDWREKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSRPEGN 178
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPY-QQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
GC+GGLMD+AFEYI +N G+ +E YPY ++ C + E AA G + D+P+G E
Sbjct: 179 QGCNGGLMDQAFEYIADNGGIDSEESYPYIAKDDEDCLYKSEFNAANDTG-FVDVPEGHE 237
Query: 250 HALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG--TAEEED 304
AL++AV PVSV ++AS F+FY+ G+ + +C + DHGV VVG+G ++++
Sbjct: 238 RALMKAVAAVGPVSVAIDASHSTFQFYESGIYYDPDCSSEELDHGVLVVGYGFEGTDDDN 297
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
KYW++KNSW + WG+ GYI + +D CGIAT ASYP+
Sbjct: 298 KKKYWIVKNSWSDKWGDKGYILMAKDRNNHCGIATAASYPL 338
>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
Length = 396
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 138/333 (41%), Positives = 195/333 (58%), Gaps = 22/333 (6%)
Query: 31 RSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLG 87
R + E I + + W+ ++ + + E+ RL IF +N ++ + N + G ++ +
Sbjct: 61 RVLRESKIEDAFDAWLVKYDKEIANAEERLKRLKIFGENYLFVLEHNAKYVAGKVSHYVE 120
Query: 88 TNEFSDLTNEEFRASYTGYNRPV---PSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
N+F+ T EE+R G+ + + + S ++Y+ V + P SIDW ++G +T
Sbjct: 121 MNKFAAHTREEYR-KMLGFKKSLRRKKDSGEAAKDVSLWEYEGV-EAPESIDWVDEGVIT 178
Query: 145 HIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAF 202
KNQG CGSCWAFSA+ AVEGI I GKL+ LSEQ+LV C+ + N GC+GGLMD AF
Sbjct: 179 TPKNQGSCGSCWAFSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAF 238
Query: 203 EYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVS 262
E+I+EN G+ +E Y Y+ C +K A+I + D+P DE AL +AV++QPVS
Sbjct: 239 EWIVENGGVDSEKQYQYKASFDDCKTRKTLLHIASIDGFNDVPSNDETALKKAVSQQPVS 298
Query: 263 VCVEASGQAFRFYKRGVLNAE-CGDNCDHGVAVVGFGTAEEEDGA-------KYWLIKNS 314
V +EA ++F+ Y GV +AE CG DHGV VVG+G KYW IKNS
Sbjct: 299 VAIEADQRSFQLYGGGVYHAEDCGTQLDHGVLVVGYGIDHNSSNVIIPGATKKYWKIKNS 358
Query: 315 WGETWGESGYIRILRD----EGLCGIATEASYP 343
W E WGE GYIRI RD G+CG+A ASYP
Sbjct: 359 WSEQWGEGGYIRIARDVESPSGMCGVAEMASYP 391
>gi|327289213|ref|XP_003229319.1| PREDICTED: cathepsin S-like [Anolis carolinensis]
Length = 333
Score = 251 bits (641), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 136/318 (42%), Positives = 201/318 (63%), Gaps = 18/318 (5%)
Query: 37 SIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFS 92
S+++ H E W ++ + Y+++ E+ +R I+++NL ++ N E G +Y+LG N
Sbjct: 23 SMLDGHWELWKKKYNKEYQNKEEEGVRRVIWEKNLRFVMLHNLEQSLGLHSYELGMNHLG 82
Query: 93 DLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHC 152
D+T+EE A TG PV QS + + + P ++DWREKG VT++KNQG C
Sbjct: 83 DMTSEEVTALMTGLKIPVS----QSRNSTLYWARQGASAPDTVDWREKGCVTNVKNQGSC 138
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKG 210
GSCWAFSAV A+E ++ G L+ LS Q LVDCS+ N+GC+GG + AF+Y+I N G
Sbjct: 139 GSCWAFSAVGALECQLKLKTGNLVSLSPQNLVDCSSAFGNHGCNGGYISAAFQYVIYNNG 198
Query: 211 LATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASG 269
+ +EA YPY + GTC + + AAT +Y DLP G+E AL AV PVSV ++AS
Sbjct: 199 IDSEASYPYTGQSGTC-RYNLQGRAATCSRYVDLPSGNEAALKDAVANFGPVSVAIDASR 257
Query: 270 QAFRFYKRGVL-NAECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
+F +++GV + C + +HGV VVG+GT EDG YWL+KNSWG ++G+ GYI+I
Sbjct: 258 PSFFLFRKGVYDDPSCTSAHINHGVLVVGYGT---EDGIDYWLVKNSWGVSFGDQGYIKI 314
Query: 328 LRD-EGLCGIATEASYPV 344
R+ + CGIA++ +YP+
Sbjct: 315 ARNHDNRCGIASQCTYPL 332
>gi|71897043|ref|NP_001026516.1| cathepsin S precursor [Gallus gallus]
gi|53126701|emb|CAG30977.1| hypothetical protein RCJMB04_1f23 [Gallus gallus]
Length = 328
Score = 251 bits (641), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 143/339 (42%), Positives = 196/339 (57%), Gaps = 27/339 (7%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M V++ LV V G +P++ + + W HG+ Y+ + E+ R +++NL
Sbjct: 7 MAVLVTLV------AVMGHP--DPTLDQHWQLWKKAHGKEYRHQAEEGQRRATWEKNLRL 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y+LG N D+T+E+ A TG VP Q+S Y+
Sbjct: 59 VMLHNLEHSLGLHSYQLGMNHMGDMTSEDVAALLTGLR--VPYGHNQTS-----TYRRRG 111
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
P ++DWREKG VT +KNQG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCS
Sbjct: 112 GAPDAMDWREKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMM 171
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC GG M +AF+YII+N G+ +E YPY + GTC + AAT KY +LP
Sbjct: 172 YGNKGCGGGFMTRAFQYIIDNNGIDSEESYPYMAQNGTC-QYNVSTRAATCSKYVELPYA 230
Query: 248 DEHALLQAVTK-QPVSVCVEASGQAFRFYKRGVL-NAECGDNCDHGVAVVGFGTAEEEDG 305
DE AL AV PVSV ++A+ F Y+ GV + C +HGV VVG+GT E+D
Sbjct: 231 DEAALKDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQEVNHGVLVVGYGTLNEKD- 289
Query: 306 AKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
+WL+KNSWGE +G+ GYIR+ R+ CGIA+ ASYP
Sbjct: 290 --FWLVKNSWGERFGDGGYIRMSRNHANHCGIASYASYP 326
>gi|405966499|gb|EKC31777.1| Cathepsin L [Crassostrea gigas]
Length = 331
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 196/340 (57%), Gaps = 23/340 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+ V+ L +T + + + + ++ K H +TY + E+ MR I++ N+ Y
Sbjct: 4 LIVVASLCVTAFASPILNKDLDGDWVLYKQ-----THKKTYSQD-EEQMRRLIWEDNVNY 57
Query: 73 IEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I+K N G TY LG NE++D+T EFRA GY + ++ N+
Sbjct: 58 IQKHNLAADRGEHTYWLGQNEYADMTIFEFRAIMNGYKMS----ANRTKGDLYMSPSNIG 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
D+P S+DWR++G VT IKNQGHCGSCW+FSA ++EG KL+ LSEQ LVDCS
Sbjct: 114 DLPDSVDWRKEGYVTDIKNQGHCGSCWSFSATGSLEGQHFKASKKLVSLSEQNLVDCSKK 173
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N+GC GGLMD AF YI NKG+ TE YPY + G C + E A G Y D+P
Sbjct: 174 EGNHGCQGGLMDNAFRYIESNKGIDTEESYPYTAKNGFCHFKAENVGATDTG-YVDIPHM 232
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN--AECGDNCDHGVAVVGFGTAEEED 304
E L +AV T P+SV ++A ++F+ Y+ GV + A DHGV VG+GT E
Sbjct: 233 QEDKLQEAVATVGPISVGIDAGHKSFQLYREGVYSEPACSSSKLDHGVLAVGYGT---ES 289
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYP 343
G YWL+KNSWG +WG GY+ + R++ +CGIAT+ASYP
Sbjct: 290 GDDYWLVKNSWGTSWGMQGYVMMARNKHNMCGIATQASYP 329
>gi|33242886|gb|AAQ01147.1| cathepsin [Haplochromis chilotes]
Length = 334
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 142/324 (43%), Positives = 190/324 (58%), Gaps = 22/324 (6%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGT 88
+M E ++ E W HG++YK+++E A R ++ NL+ I N E G TY+LG
Sbjct: 21 AMFESTLDAHWELWKKTHGKSYKNDVENAHRRELWGNNLKMITVHNLEASMGLHTYELGM 80
Query: 89 NEFSDLTNEE---FRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
N DLT EE F AS T P + R PS F + + +P ++DWREKG VT
Sbjct: 81 NHMGDLTEEEIMQFFASLT----PPTDIQRA---PSPFAGASGSGIPDTMDWREKGCVTK 133
Query: 146 IKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFE 203
+K QG CGSCWAFSA A+EG + GKL++LS Q LVDCS N+GC+GG M +AF+
Sbjct: 134 VKMQGACGSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQ 193
Query: 204 YIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVS 262
Y+I+N G+ ++A YPY C AA Y+ LP+GDE+AL Q + T P+S
Sbjct: 194 YVIDNHGIDSDASYPYIGRDDQC-HYNPATRAANCSSYQFLPEGDENALKQGLATVGPIS 252
Query: 263 VCVEASGQAFRFYKRGVLN-AECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGE 321
V ++A F FY+ GV N C +HGV VG+GT +D YWL+KNSWG T+G+
Sbjct: 253 VAIDARRPRFSFYRSGVYNDPSCTQKVNHGVLAVGYGTLNGQD---YWLVKNSWGTTFGD 309
Query: 322 SGYIRILRDEG-LCGIATEASYPV 344
GYIR+ R+ G CGIA YPV
Sbjct: 310 QGYIRMARNTGNQCGIALYPCYPV 333
>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 326
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 140/313 (44%), Positives = 184/313 (58%), Gaps = 24/313 (7%)
Query: 44 QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
+W A HG+ Y E+++R IF++N I + N+E G TY LG N F DL + EF
Sbjct: 25 KWKATHGKVYNSADEESLRFKIFQENSLMITQHNEEYRQGFHTYILGMNHFGDLLHSEFL 84
Query: 101 ASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
G+ V S F + VP+ +W KGAVT +K+QG CGSCWAFSA
Sbjct: 85 ERSNGFQGGV-------SGGDVFTFDTNAPVPSYANWTAKGAVTPVKDQGKCGSCWAFSA 137
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYP 218
+VEG + KL+ LSEQQLVDCS D N GC GGLMD AF+Y I NKG+A E YP
Sbjct: 138 TGSVEGQIFLKKKKLMSLSEQQLVDCSGDEGNLGCGGGLMDNAFKYFIANKGIANEKSYP 197
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKR 277
Y + C K K+ + ATI ++D+ DE L AV PVSV ++AS F+FY+
Sbjct: 198 YTAKDNDC-KYKKSMSVATISSFKDVKHKDEDQLKMAVANVGPVSVAIDASSSKFQFYES 256
Query: 278 GVLNAECGDNC-----DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-E 331
GV E NC DHGV VG+GT +++ G +WL+KNSW +WG +GYI++ R+ +
Sbjct: 257 GVYYDE---NCSSEVLDHGVLAVGYGT-DKKSGMDFWLVKNSWAASWGLNGYIKMARNKD 312
Query: 332 GLCGIATEASYPV 344
CGIAT ASYP+
Sbjct: 313 NNCGIATMASYPI 325
>gi|356545079|ref|XP_003540973.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 330
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 132/298 (44%), Positives = 181/298 (60%), Gaps = 14/298 (4%)
Query: 24 ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRT 83
ASQV R++ + S+ E+HE+WM+++G+ YKD E+ R IFK+N+ YIE +N +
Sbjct: 5 ASQVTC-RTLQDASMYERHEEWMSRYGKVYKDPREREKRFRIFKENMNYIETSNNVAIKP 63
Query: 84 YKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAV 143
KL N+F+DL NEEF A + + + R SR TF + P +KGAV
Sbjct: 64 XKLVINQFADLNNEEFIAPRNIFKGMI--LCRFLSRKHTFPF------PYVFLGHKKGAV 115
Query: 144 THIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKA 201
T +K+QGHCG CWAF VA+ EGI +T GKLI LSEQ+LVDC T + GC GLMD A
Sbjct: 116 TPVKDQGHCGFCWAFYDVASTEGILALTAGKLISLSEQELVDCDTKGVDQGCECGLMDDA 175
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
F++II+N G+ +A+YPY+ G C+ +E AATI ED+P +E AL + V QPV
Sbjct: 176 FKFIIQNHGV-XDANYPYKGVDGKCNANEEANPAATITGXEDVPANNEKALQKVVANQPV 234
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETW 319
V ++A F+FYK GV C +HGV +G+G + DG +YWL+KNS W
Sbjct: 235 FVAIDACDSDFQFYKSGVFTGSCETELNHGVTTMGYGVS--HDGTQYWLVKNSXETEW 290
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 251 bits (640), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 136/353 (38%), Positives = 196/353 (55%), Gaps = 29/353 (8%)
Query: 5 FEKSFIIPMFVIIILVITCASQVV-----SGRSMHEPSIVEKHE------QWMAQH---- 49
F+ I + + ++ AS ++ R + PS VE H+ +W +H
Sbjct: 59 FKTRAWIALVAAAVSLLVFASFLIQWQGDDDRGVFPPSPVEDHKTPVNIWEWKEEHFQNA 118
Query: 50 --------GRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
G++Y E E R IFK NL YI N++G +Y L N F DL+ EEFR
Sbjct: 119 FGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQG-YSYSLKMNHFGDLSREEFRR 177
Query: 102 SYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAV 161
Y GYN+ S + + +DVP+++DWREKG VT +K+Q CGSCWAFSA
Sbjct: 178 KYLGYNKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCWAFSAT 237
Query: 162 AAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEADYPY 219
A+EG G+L+ LSEQ+LVDCS N GCSGG M+ AF+Y++++ GL +E YPY
Sbjct: 238 GALEGAHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYPY 297
Query: 220 QQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGV 279
G C + +K TI ++D+P+ E A+ A+ PVS+ +EA F+FY GV
Sbjct: 298 LARDGECKRACKK--VVTISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQFYHEGV 355
Query: 280 LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDEG 332
+A CG + DHGV +VG+GT ++E +W++KNSWG WG GY+ + +G
Sbjct: 356 FDASCGTDLDHGVLLVGYGT-DKETKKDFWIMKNSWGSGWGRDGYMYMAMHKG 407
>gi|149755237|ref|XP_001495795.1| PREDICTED: cathepsin L1-like [Equus caballus]
Length = 339
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 141/335 (42%), Positives = 193/335 (57%), Gaps = 17/335 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
+ L C + S +PS+ + QW A H R Y E A R ++++N+ IE
Sbjct: 5 LFLAALCLG-IASAAPKLDPSLDAQWYQWKATHRRLYGVNKE-AWRRAVWEKNMRMIELH 62
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N+E G + + N F D+TNEEFR G + R P + ++P
Sbjct: 63 NQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGLHNQTHKKGRVFREPLS------AELPK 116
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNN 191
S+DWR+KG VT +KNQG CGSCWAFSA A+EG GKL+ LSEQ LVDCS N
Sbjct: 117 SVDWRKKGYVTPVKNQGLCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSWAQGNE 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GCSGGLMD AF+Y+ +N GL +E YPY E G C + E +AA G + D+ + ++
Sbjct: 177 GCSGGLMDYAFQYVKDNGGLDSEKSYPYLAEDGFCKYKPEYSAANDTG-FLDIQQQEKFL 235
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGV-LNAECGDN-CDHGVAVVGFGTAEEEDGAKYW 309
+ T P+S ++AS ++F+FYK G+ + +C DHGV VVG+G ++ KYW
Sbjct: 236 MEAVATVGPISAGIDASLESFQFYKEGIYYDPDCSSKYLDHGVLVVGYGFEGKDSRNKYW 295
Query: 310 LIKNSWGETWGESGYIRILRD-EGLCGIATEASYP 343
L+KNSWGE WG +GYI++ +D E CGIAT ASYP
Sbjct: 296 LVKNSWGEDWGMNGYIKMAKDRENHCGIATMASYP 330
>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
Length = 355
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 140/305 (45%), Positives = 189/305 (61%), Gaps = 14/305 (4%)
Query: 50 GRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGY 106
G++Y+ E E + F +N+ +IE+ NKE G +T+++G NE +DL ++R GY
Sbjct: 56 GKSYEPEEENDY-MEAFVKNVIHIEEHNKEHRLGRKTFEMGLNEIADLPFSQYR-KLNGY 113
Query: 107 NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEG 166
S + F +P S+DWRE+G VT +KNQG CGSCWAFS+ A+EG
Sbjct: 114 RMRRQFGDSMQSNGTKFLVPFNVQIPESVDWREEGLVTPVKNQGMCGSCWAFSSTGALEG 173
Query: 167 ITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQG 224
GKL+ LSEQ LVDCST N+GC+GGLMD AFEYI EN G+ TE YPY +
Sbjct: 174 QHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENHGVDTEDSYPYVGRET 233
Query: 225 TCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGV-LNA 282
C ++ A G + DLP+GDE AL +AV Q P+S+ ++A ++F+ YK+GV +
Sbjct: 234 KCHFKRNTVGADDKG-FVDLPEGDEEALKKAVATQGPISIAIDAGHRSFQLYKKGVYFDE 292
Query: 283 EC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEA 340
EC + DHGV +VG+GT E YWL+KNSWG TWGE GYIRI R+ CG+AT+A
Sbjct: 293 ECSSEELDHGVLLVGYGTDPE--AGDYWLVKNSWGPTWGEKGYIRIARNRNNHCGVATKA 350
Query: 341 SYPVA 345
SYP+
Sbjct: 351 SYPLV 355
>gi|354502595|ref|XP_003513369.1| PREDICTED: cathepsin L1-like [Cricetulus griseus]
Length = 330
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 145/345 (42%), Positives = 201/345 (58%), Gaps = 24/345 (6%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQN 69
+IP+F + L + VV H+PS+ ++ ++W +HG+TY + E+ + +++ N
Sbjct: 1 MIPIFFLATLCLG----VVPAAPTHDPSLDDEWQEWKTRHGKTYSMD-EEGQKRAVWENN 55
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQ 126
+ IE N++ G + L N F DLTN EFR TG+ S + + F+
Sbjct: 56 RKMIELHNEDYTKGKHGFHLEMNAFGDLTNIEFRQLMTGFQ------SMGTKEMNVFQEP 109
Query: 127 NVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDC 186
+ DVP S+DWR VT +K+QG C SCWAFSAV ++EG G+LI LSEQ LVDC
Sbjct: 110 LLGDVPKSVDWRNLSYVTPVKDQGQCSSCWAFSAVGSLEGQIFRKTGQLISLSEQNLVDC 169
Query: 187 STD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDL 244
S N GC GGLM+ AF Y+ EN+GL T YPY+ G C + K +AA + + +
Sbjct: 170 SWSYGNIGCFGGLMEYAFRYVKENRGLDTRVSYPYEARNGPC-RYDPKNSAANVTDFVKI 228
Query: 245 PKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAE 301
P E AL++AV T P+SV V++ +FRFYK G+ C N DH V VVG+G E
Sbjct: 229 PI-SEDALMKAVATVGPISVGVDSHHHSFRFYKGGMYYEPHCSSSNLDHAVLVVGYG--E 285
Query: 302 EEDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPVA 345
E DG KYW++KNSWG+ WG +GYI++ RD CGIAT A YP
Sbjct: 286 ESDGNKYWMVKNSWGQGWGMNGYIKMARDRNNNCGIATYAIYPTV 330
>gi|2098464|pdb|1PCI|A Chain A, Procaricain
gi|2098465|pdb|1PCI|B Chain B, Procaricain
gi|2098466|pdb|1PCI|C Chain C, Procaricain
Length = 322
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 132/310 (42%), Positives = 185/310 (59%), Gaps = 12/310 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+++ WM H + Y++ EK R IFK NL YI++ NK+ N +Y LG NEF+DL+N+
Sbjct: 18 LIQLFNSWMLNHNKFYENVDEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFADLSND 76
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWA 157
EF Y G + + + S F +++ ++P ++DWR+KGAVT +++QG CGSCWA
Sbjct: 77 EFNEKYVG---SLIDATIEQSYDEEFINEDIVNLPENVDWRKKGAVTPVRHQGSCGSCWA 133
Query: 158 FSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADY 217
FSAVA VEGI +I GKL+ELSEQ+LVDC ++GC GG A EY+ +N G+ + Y
Sbjct: 134 FSAVATVEGINKIRTGKLVELSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIHLRSKY 192
Query: 218 PYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKR 277
PY+ +QGTC ++ + +E LL A+ KQPVSV VE+ G+ F+ YK
Sbjct: 193 PYKAKQGTCRAKQVGGPIVKTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKG 252
Query: 278 GVLNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGL 333
G+ CG D V VG+G + + LIKNSWG WGE GYIRI R G+
Sbjct: 253 GIFEGPCGTKVDGAVTAVGYGKSGGKGYI---LIKNSWGTAWGEKGYIRIKRAPGNSPGV 309
Query: 334 CGIATEASYP 343
CG+ + YP
Sbjct: 310 CGLYKSSYYP 319
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 143/319 (44%), Positives = 192/319 (60%), Gaps = 31/319 (9%)
Query: 44 QWMA---QHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNE 97
QW A H ++Y+ ++E+ +R IF +N I K N + G +YKLG N+F DL
Sbjct: 6 QWEAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPH 65
Query: 98 EFRASYTGYNRPVPSVSRQSSRPSTF-KYQNVTD--VPTSIDWREKGAVTHIKNQGHCGS 154
EF + GY+ + R STF NV D +P ++DWR+KGAVT +K+QG CGS
Sbjct: 66 EFAKMFNGYH------GERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGS 119
Query: 155 CWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLA 212
CWAFSA ++EG + GKL+ LSEQ L+DCS N GC GGLMD AF+YI N G+
Sbjct: 120 CWAFSATGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGID 179
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEASGQA 271
TE YPY+ G C +KE A G + D+ +G E L +AV T P+SV ++AS +
Sbjct: 180 TEESYPYEAMDGDCRFKKEDVGATDTG-FVDIQQGSEDDLQKAVATVGPISVAIDASHSS 238
Query: 272 FRFYKRGVLNAECGDNC-----DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
F+ Y GV + NC DHGV VG+G ++G KYWL+KNSW ETWG++GYI
Sbjct: 239 FQLYSEGVYDEP---NCSSEELDHGVLAVGYGV---KNGKKYWLVKNSWAETWGDNGYIL 292
Query: 327 ILRD-EGLCGIATEASYPV 344
+ RD + CGIA+ ASYP+
Sbjct: 293 MSRDKDNQCGIASSASYPL 311
>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 138/321 (42%), Positives = 192/321 (59%), Gaps = 17/321 (5%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
+E ++ +EQW+ ++G+ Y EK R IFK NL+ IE+ N + NR+Y+ G N+FSD
Sbjct: 33 NEGGVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSD 92
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT-HIKNQGHC 152
LT +EF+ASY G S+S + R ++Y+ +P +DWRE+GAV +K QG C
Sbjct: 93 LTADEFQASYLGGKMEKKSLSDVAER---YQYKEGDVLPDEVDWRERGAVVPRVKRQGEC 149
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLMDKAFEYIIENKG 210
GSCWAF+A AVEGI QIT G+L+ LSEQ+L+DC DN GC+GG AFE+I EN G
Sbjct: 150 GSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGG 209
Query: 211 LATEADYPYQQEQGTCDKQKEKAA--AATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
+ ++ Y Y E K E TI +E +P DE +L +AV QP+SV + A+
Sbjct: 210 IVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMISAA 269
Query: 269 GQAFRFYKRGVLNAECGDNC-DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
+ YK GV C + DH V +VG+GT+ +E YWLI+NSWG WGE GY+R+
Sbjct: 270 NMS--DYKSGVYKGACSNLWGDHNVLIVGYGTSSDE--GDYWLIRNSWGPEWGEGGYLRL 325
Query: 328 LRD----EGLCGIATEASYPV 344
R+ G C +A YP+
Sbjct: 326 QRNFHEPTGKCAVAVAPVYPI 346
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 140/340 (41%), Positives = 203/340 (59%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF ++ I +
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTESSLIIAR 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF + G++ +R++ + NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSTFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P ++DWR+KGAVT +K+QG CGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC GGLM+ AF+YI N G+ TE YPY+ G C +KE A G Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTG-YVEIKAGS 234
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDG 305
E L +AV T P+SV ++AS +F+ Y GV + EC ++ DHGV VVG+G + G
Sbjct: 235 EDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA++ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 251 bits (640), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 137/319 (42%), Positives = 188/319 (58%), Gaps = 14/319 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E + E W +H R YK E A R IFK+NL+Y+ + N +G+R + LG N+F+D+
Sbjct: 39 EERVRELFHLWKERHKRVYKHAEETAKRFEIFKENLKYVIERNSKGHR-HTLGMNKFADM 97
Query: 95 TNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKNQGHC 152
+NEEF+ Y + + R S + + + P+S+DWR+KG VT IK+QG C
Sbjct: 98 SNEEFKEKYLSKIKKPINKKNNYLRRSMQQKKGTASCEAPSSLDWRKKGVVTGIKDQGDC 157
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLA 212
GSCWAFS+ A+EGI I G LI LSEQ+LVDC T N GC GG MD AFE++I N G+
Sbjct: 158 GSCWAFSSTGAMEGINAIVTGDLISLSEQELVDCDTTNYGCEGGYMDYAFEWVISNGGID 217
Query: 213 TEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAF 272
+E+DYPY GTC+ KE +I Y+D+ + D ALL A QP+SV ++ S F
Sbjct: 218 SESDYPYTGTDGTCNTTKEDTKVVSIDGYKDVDESDS-ALLCAAVNQPISVGMDGSALDF 276
Query: 273 RFYKRGVL---NAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR 329
+ Y G+ ++ D+ DH V +VG+G+ + ED YW+ KNSWG +WG GY I R
Sbjct: 277 QLYTSGIYAGDCSDDPDDIDHAVLIVGYGSEDSED---YWICKNSWGTSWGMEGYFYIKR 333
Query: 330 DEGL----CGIATEASYPV 344
+ L C I ASYP
Sbjct: 334 NTDLPYGECAINAMASYPT 352
>gi|217072410|gb|ACJ84565.1| unknown [Medicago truncatula]
Length = 328
Score = 250 bits (639), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 121/220 (55%), Positives = 154/220 (70%), Gaps = 9/220 (4%)
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P S+DWR++GAV +K+Q CGSCWAFSA+AAVEGI +I G LI LSEQ+LVDC T
Sbjct: 24 LPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSY 83
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
N GC+GGLMD AFE+II N G+ +E DYPY+ G CD+ ++ A TI YED+P DE
Sbjct: 84 NEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDE 143
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
AL +AV QP++V VE G+ F+ Y+ GVL CG DHGVA VG+GT E+G YW
Sbjct: 144 LALQKAVANQPIAVAVEGGGREFQLYEYGVLTGRCGTALDHGVAAVGYGT---ENGKDYW 200
Query: 310 LIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
+++NSWG +WGE GYIR+ R+ G CGIA E SYP+
Sbjct: 201 IVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 240
>gi|324983200|gb|ADY68475.1| stem bromelain [Ananas comosus]
Length = 291
Score = 250 bits (639), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 120/287 (41%), Positives = 179/287 (62%), Gaps = 6/287 (2%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
+F+ + L + AS + R ++++ E+WMA++GR YKD EK R IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRASYTG-YNRPVPSVSRQSSRPSTFKYQNVTDV 131
IE N +Y LG N+F+D+TN EF A YTG +RP+ + + +F N++ V
Sbjct: 68 IETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPL---NIEKEPVVSFDDVNISAV 124
Query: 132 PTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTDNN 191
SIDWR+ GAVT +K+Q CGSCWAFSA+A VEGI +I G L+ LSEQ+++DC+ +N
Sbjct: 125 GQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAV-SN 183
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GG +D A+++II N G+A+EADYPYQ QG C +A G Y + DE +
Sbjct: 184 GCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSWPNSAYITG-YSYVRSNDESS 242
Query: 252 LLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFG 298
+ AV QP++ ++ASG F++Y GV + CG + +H + ++G+G
Sbjct: 243 MKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYG 289
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 250 bits (639), Expect = 5e-64, Method: Compositional matrix adjust.
Identities = 144/375 (38%), Positives = 206/375 (54%), Gaps = 38/375 (10%)
Query: 5 FEKSF---------IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKD 55
FE SF I+ ++ I+L+I + + E + E W+ + + Y D
Sbjct: 135 FESSFRCFSIIFLKIMNRYINILLLIFGLIAISNALLFSEEQYKNEFENWIDRFEKKY-D 193
Query: 56 ELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSR 115
E R +IFK N++++ N + ++T LG N +DLTN E+R Y G ++ +V
Sbjct: 194 VSEFKKRFSIFKSNMDFVHSWNSKNSQTV-LGLNHLADLTNLEYRQFYLGTHKK--AVLG 250
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKL 175
Q+V ++DWR+KGAV+ IK+QG CGSCW+FS +VEG QI G +
Sbjct: 251 TPGNHEVSNLQSVFGDSATVDWRQKGAVSPIKDQGQCGSCWSFSTTGSVEGAHQIKSGNM 310
Query: 176 IELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKA 233
+ELSEQ LVDCST N GC+GGLMD AFEYII N G+ TE+ YPY GT K +
Sbjct: 311 VELSEQNLVDCSTSEGNMGCNGGLMDYAFEYIITNNGIDTESSYPYTASSGTTCKYNKAN 370
Query: 234 AAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGV-LNAECGD-NCDH 290
+ ATI Y+++ G E L AV PVSV ++AS +F+ Y G+ +A C N DH
Sbjct: 371 SGATISSYKNITAGSESDLADAVKNAGPVSVAIDASHNSFQLYSHGIYYDASCSSVNLDH 430
Query: 291 GVAVVGFGTA-------------------EEEDGAKYWLIKNSWGETWGESGYIRILRD- 330
GV VVG+G+ + +D YW++KNSWG +WG+ G+I + +D
Sbjct: 431 GVLVVGYGSGTPDSDSRVHKGSQVRVKVPKTDDTKNYWIVKNSWGTSWGDKGFIYMSKDR 490
Query: 331 EGLCGIATEASYPVA 345
+ CGIA+ ASYP+
Sbjct: 491 DNNCGIASCASYPIV 505
>gi|253796148|gb|ACT35690.1| cathepsin L-like cysteine proteinase [Ditylenchus destructor]
Length = 376
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 141/319 (44%), Positives = 197/319 (61%), Gaps = 18/319 (5%)
Query: 38 IVEKHEQWMAQ---HGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
I + ++ W A +G+++ DE + R+ F + ++I+K N++ G ++KL N
Sbjct: 63 IQQGYQDWEAYKGLNGKSFYDEDTENERMLAFLSSQQHIKKHNEQYEQGKVSFKLDANSI 122
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
+DL E++ GY R R++S S F + +VP S+DWR+ G VT +KNQG
Sbjct: 123 ADLPFSEYQ-KLNGYRRIYGDPLRRNS--SRFLAPHNVEVPESMDWRDHGYVTEVKNQGM 179
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENK 209
CGSCWAFSA ++EG + + G L+ LSEQ LVDCS NNGC+GGLMD AF+YI EN
Sbjct: 180 CGSCWAFSATGSLEGQHKRSKGTLVSLSEQNLVDCSAAYGNNGCNGGLMDFAFQYIKENH 239
Query: 210 GLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEAS 268
G+ TE YPY+ Q C Q+ A G + DLP+GDE L AV Q P+SV ++A
Sbjct: 240 GIDTETSYPYKARQKKCHFQRSSVGADDTG-FMDLPEGDEDQLKIAVATQGPISVAIDAG 298
Query: 269 GQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
++F+ YK GV EC + DHGV VVG+GT + D YW++KNSWG TWGE GY+R
Sbjct: 299 HRSFQLYKTGVYYEKECSSEQLDHGVLVVGYGT--DPDHGDYWIVKNSWGTTWGEQGYVR 356
Query: 327 ILRDE-GLCGIATEASYPV 344
+ R++ CGIAT+ASYP+
Sbjct: 357 MARNKNNHCGIATKASYPL 375
>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
Length = 245
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 121/220 (55%), Positives = 153/220 (69%), Gaps = 9/220 (4%)
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P S+DWRE GAV +K+Q CGSCWAFS VAAVEGI QI G+LI LSEQ+LVDC T+
Sbjct: 6 LPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEY 65
Query: 190 NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDE 249
+ GC+GGLMD AF++II+N GL TE DYPY G C+ + + +I YED+P DE
Sbjct: 66 DMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDE 125
Query: 250 HALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGTAEEEDGAKYW 309
AL +AV QPVSV VEA G+A + Y G+ ECG DHG+ VG+GT E+G YW
Sbjct: 126 KALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGT---ENGTDYW 182
Query: 310 LIKNSWGETWGESGYIRILRD-----EGLCGIATEASYPV 344
+++NSWG +WGE+GYIR+ R+ G CGIA EASYP+
Sbjct: 183 IVRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPI 222
>gi|149751225|ref|XP_001490531.1| PREDICTED: cathepsin S-like [Equus caballus]
Length = 332
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 139/318 (43%), Positives = 191/318 (60%), Gaps = 18/318 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W +G+ YK++ E+ R I+++NL+++ N E G +Y LG N
Sbjct: 22 DPTLDNHWDLWKKTYGKQYKEKNEEVARRLIWERNLKFVMLHNLEHSMGMHSYDLGMNHL 81
Query: 92 SDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGH 151
D+T+EE + + VPS Q R T+K +P S+DWREKG VT +K QG
Sbjct: 82 GDMTSEEVTSLMSSLR--VPS---QWQRNVTYKSNPNEKLPDSLDWREKGCVTEVKYQGS 136
Query: 152 CGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD---NNGCSGGLMDKAFEYIIEN 208
CG+CWAFSAV A+E ++ G L+ LS Q LVDCST+ N GC+GG M AF+YII+N
Sbjct: 137 CGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTEKYSNKGCNGGFMTAAFQYIIDN 196
Query: 209 KGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAV-TKQPVSVCVEA 267
G+ ++A YPY+ G C + K AAT KY +LP G E L +AV K PVSV ++A
Sbjct: 197 NGIDSDASYPYKAMDGKC-RYDSKNRAATCSKYTELPFGSEDDLKEAVANKGPVSVAIDA 255
Query: 268 SGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIR 326
S +F YK GV + C N +HGV VVG+G +G YWL+KNSWG +G+ GYIR
Sbjct: 256 SHPSFFLYKSGVYYDPSCTQNVNHGVLVVGYGNL---NGKDYWLVKNSWGINFGDKGYIR 312
Query: 327 ILRDEG-LCGIATEASYP 343
+ R+ G CGIA SYP
Sbjct: 313 MARNSGNHCGIANYCSYP 330
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 200/340 (58%), Gaps = 25/340 (7%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLTIFKQNLEYI 73
+++++ C + + E QW A +H + Y ++ E A RL IF+ NL+ I
Sbjct: 3 LLVLLACVAMATAASLSFES-------QWEAFKIKHDKVYSEKEEYARRL-IFQDNLKTI 54
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E N+E G +Y LG N+F+D+T+ E+ G ++++ SR +T++Y
Sbjct: 55 ESHNQEADTGKHSYWLGVNQFADMTHAEYLNQVIGGCLITSNLTKTGSR-ATYRYMPNMQ 113
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
V ++DWR+KG VT IK+QG CGSCWAFS ++EG G L+ LSEQ LVDCS
Sbjct: 114 VNDTVDWRDKGLVTDIKDQGQCGSCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQE 173
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
N GC GG MD+ F+YII+NKG+ TE YPY+ + C K AT+ + D+ GD
Sbjct: 174 GNKGCEGGDMDQGFQYIIQNKGIDTEQCYPYKAKNHRC-KFDNSCIGATMSSFTDVTSGD 232
Query: 249 EHALLQAVTK-QPVSVCVEASGQAFRFYKRGVLNA-ECGDN-CDHGVAVVGFGTAEEEDG 305
E AL QA P+SV ++AS Q+F+FY GV N EC DHGV VVG+GT +D
Sbjct: 233 EDALKQACANIGPISVGIDASHQSFQFYSSGVYNEFECSSTKLDHGVLVVGYGTYGSKD- 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
YWL+KNSWG WG GYI + R+ + CG+AT+AS+PV
Sbjct: 292 --YWLVKNSWGTVWGNEGYIMMSRNKDNQCGVATDASFPV 329
>gi|395740610|ref|XP_002819972.2| PREDICTED: cathepsin L1 [Pongo abelii]
Length = 333
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 200/342 (58%), Gaps = 20/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M + L C + S + S+ + +W A H R Y E+ R ++++N++
Sbjct: 1 MNPTLFLAAFCLG-IASATLTFDHSLEARWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58
Query: 73 IEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N +EG ++ + N F D+T+EEFR G+ +R+ + F+
Sbjct: 59 IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLFY 112
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-- 187
+ P S+DWREKG VT +KNQG CGSCWAFSA A+EG GKLI LSEQ LVDCS
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSGP 172
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD AF+Y+ +N GL +E YPY+ + +C K K + A + D+PK
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIPK- 230
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
E AL++AV T P+SV ++A ++F FYK G+ +C ++ DHGV VVG+G + E
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
D KYWL+KNSWGE WG GY+++ +D CGIA+ ASYP
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPT 332
>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
Precursor
gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 138/321 (42%), Positives = 192/321 (59%), Gaps = 17/321 (5%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
+E ++ +EQW+ ++G+ Y EK R IFK NL+ IE+ N + NR+Y+ G N+FSD
Sbjct: 33 NEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSD 92
Query: 94 LTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT-HIKNQGHC 152
LT +EF+ASY G S+S + R ++Y+ +P +DWRE+GAV +K QG C
Sbjct: 93 LTADEFQASYLGGKMEKKSLSDVAER---YQYKEGDVLPDEVDWRERGAVVPRVKRQGEC 149
Query: 153 GSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNNGCSGGLMDKAFEYIIENKG 210
GSCWAF+A AVEGI QIT G+L+ LSEQ+L+DC DN GC+GG AFE+I EN G
Sbjct: 150 GSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGG 209
Query: 211 LATEADYPYQQEQGTCDKQKEKAA--AATIGKYEDLPKGDEHALLQAVTKQPVSVCVEAS 268
+ ++ Y Y E K E TI +E +P DE +L +AV QP+SV + A+
Sbjct: 210 IVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMISAA 269
Query: 269 GQAFRFYKRGVLNAECGDNC-DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRI 327
+ YK GV C + DH V +VG+GT+ +E YWLI+NSWG WGE GY+R+
Sbjct: 270 NMS--DYKSGVYKGACSNLWGDHNVLIVGYGTSSDE--GDYWLIRNSWGPEWGEGGYLRL 325
Query: 328 LRD----EGLCGIATEASYPV 344
R+ G C +A YP+
Sbjct: 326 QRNFHEPTGKCAVAVAPVYPI 346
>gi|4503155|ref|NP_001903.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|22202619|ref|NP_666023.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|384081592|ref|NP_001244900.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|384081594|ref|NP_001244901.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|332832229|ref|XP_003312197.1| PREDICTED: cathepsin L1 isoform 2 [Pan troglodytes]
gi|332832233|ref|XP_001137800.2| PREDICTED: cathepsin L1 isoform 1 [Pan troglodytes]
gi|397470218|ref|XP_003806728.1| PREDICTED: cathepsin L1 isoform 1 [Pan paniscus]
gi|397470220|ref|XP_003806729.1| PREDICTED: cathepsin L1 isoform 2 [Pan paniscus]
gi|397470222|ref|XP_003806730.1| PREDICTED: cathepsin L1 isoform 3 [Pan paniscus]
gi|410042824|ref|XP_003951515.1| PREDICTED: cathepsin L1 [Pan troglodytes]
gi|115741|sp|P07711.2|CATL1_HUMAN RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
chain; Contains: RecName: Full=Cathepsin L1 light chain;
Flags: Precursor
gi|29715|emb|CAA30981.1| pro-(cathepsin L) [Homo sapiens]
gi|190418|gb|AAA66974.1| preprocathepsin L precursor [Homo sapiens]
gi|31873292|emb|CAD97637.1| hypothetical protein [Homo sapiens]
gi|48146223|emb|CAG33334.1| CTSL [Homo sapiens]
gi|119583135|gb|EAW62731.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583136|gb|EAW62732.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583137|gb|EAW62733.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583138|gb|EAW62734.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583140|gb|EAW62736.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|208965934|dbj|BAG72981.1| cathepsin L1 [synthetic construct]
gi|410303006|gb|JAA30103.1| cathepsin L1 [Pan troglodytes]
gi|410303008|gb|JAA30104.1| cathepsin L1 [Pan troglodytes]
gi|410303010|gb|JAA30105.1| cathepsin L1 [Pan troglodytes]
Length = 333
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 201/342 (58%), Gaps = 20/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M +IL C + S + S+ + +W A H R Y E+ R ++++N++
Sbjct: 1 MNPTLILAAFCLG-IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58
Query: 73 IEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N +EG ++ + N F D+T+EEFR G+ +R+ + F+
Sbjct: 59 IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLFY 112
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-- 187
+ P S+DWREKG VT +KNQG CGSCWAFSA A+EG G+LI LSEQ LVDCS
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP 172
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD AF+Y+ +N GL +E YPY+ + +C K K + A + D+PK
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIPK- 230
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
E AL++AV T P+SV ++A ++F FYK G+ +C ++ DHGV VVG+G + E
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
D KYWL+KNSWGE WG GY+++ +D CGIA+ ASYP
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPT 332
>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
Length = 331
Score = 250 bits (639), Expect = 6e-64, Method: Compositional matrix adjust.
Identities = 143/333 (42%), Positives = 201/333 (60%), Gaps = 20/333 (6%)
Query: 20 VITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE 79
++ C+S + + +P++ + W +G+ Y+++ E+ R I+++NL+ + N E
Sbjct: 8 LLLCSSAMA--QVHRDPTLDHHWDLWKKTYGKQYEEKNEEVARRLIWEKNLKTVMLHNLE 65
Query: 80 ---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
G +Y+LG N D+T+EE +S + VPS Q R T+K +P S+D
Sbjct: 66 HSMGMHSYELGMNHLGDMTSEEVISSMSSLR--VPS---QWPRNVTYKSSPNQKLPDSLD 120
Query: 137 WREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST---DNNGC 193
WREKG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDCST N GC
Sbjct: 121 WREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGC 180
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALL 253
+GG M +AF+YII+N G+ +EA YPY+ G C + K AAT +Y +LP G E AL
Sbjct: 181 NGGFMTEAFQYIIDNNGIDSEASYPYKAMDGRC-QYDVKNRAATCSRYIELPFGSEEALK 239
Query: 254 QAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEEDGAKYWLI 311
+AV K PVSV ++A +F YK GV + C N +HGV VVG+G+ +G YWL+
Sbjct: 240 EAVANKGPVSVGIDAKQTSFFLYKTGVYYDPSCTQNVNHGVLVVGYGSL---NGKDYWLV 296
Query: 312 KNSWGETWGESGYIRILRDEG-LCGIATEASYP 343
KNSWG +G+ GYIR+ R+ G CGIA SYP
Sbjct: 297 KNSWGLNFGDQGYIRMARNSGNHCGIANFPSYP 329
>gi|345320664|ref|XP_001521690.2| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 388
Score = 250 bits (639), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 145/339 (42%), Positives = 204/339 (60%), Gaps = 18/339 (5%)
Query: 20 VITCASQVVSGRSMHEP---SIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
++ C + G ++ P S ++KH E W H ++Y + E+ R ++++NL+ IE
Sbjct: 53 LLVCLLSLCWGLAVSAPLGDSELDKHWELWKNWHQKSYH-KAEEGWRRMVWEENLKVIEL 111
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
N E G TY+LG N+F DLTNEEF+ R +R + S F N VP
Sbjct: 112 HNLEQSLGLHTYQLGMNQFGDLTNEEFQQMLIS-ERHFSEGNRING--SAFLEVNYVQVP 168
Query: 133 TSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--N 190
TS+DWR+ G VT +KNQGHCGSCWAFS A+EG G+L+ LSEQ LVDCS N
Sbjct: 169 TSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLVSLSEQNLVDCSWQQGN 228
Query: 191 NGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC+GG++D AF+YI+EN+G+ +E YPY + K + A A + + D+P E
Sbjct: 229 QGCNGGIVDFAFQYILENRGIDSEDCYPYTAKDTAQCAFKPECATARVTGFVDIPPHSEE 288
Query: 251 ALLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAEC-GDNCDHGVAVVGFG-TAEEEDGA 306
AL++AV T PVSV ++A +FRFY+ G+ +C + +H V VVG+G E+E G
Sbjct: 289 ALMKAVATVGPVSVAIDAHPTSFRFYQSGIFYEPKCSSERLNHAVLVVGYGYEGEDEAGK 348
Query: 307 KYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYPV 344
KYW++KNSWG+ WG+ GY + +D G CGIAT ASYP+
Sbjct: 349 KYWIVKNSWGKQWGDHGYFYLSKDRGNHCGIATTASYPL 387
>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
Length = 328
Score = 250 bits (638), Expect = 7e-64, Method: Compositional matrix adjust.
Identities = 138/314 (43%), Positives = 196/314 (62%), Gaps = 22/314 (7%)
Query: 43 EQWMA---QHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTN 96
E+W+A Q G++YK+ E+ R+ ++K+N I++ NK G +YKL N F DL
Sbjct: 24 EEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYKLKMNHFGDLMQ 83
Query: 97 EEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCW 156
EF+A N+ S +Q+S F+ +P +DWR+KGAVT +K+ G CGSCW
Sbjct: 84 HEFKA----LNKLKRSAKQQNS-GEVFRATG-GKLPAKVDWRQKGAVTPVKDPGQCGSCW 137
Query: 157 AFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATE 214
AFS+ ++ G + KL+ LSEQQLVDCS + N+GC GG+M +AF+YI N G+ TE
Sbjct: 138 AFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYIKGNGGIDTE 197
Query: 215 ADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFR 273
YPY+ E C + K K+ A T Y D+ +GDE+AL +AV + P+SV ++A +F+
Sbjct: 198 GSYPYEAEDDKC-RYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVAIDAGNLSFQ 256
Query: 274 FYKRGVLNAECGDN--CDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE 331
FY G+ + N DHGV VVG+GT E+G YWL+KNSWG +WGE+GYI+I R+
Sbjct: 257 FYSEGIYDEPFCSNTELDHGVLVVGYGT---ENGQDYWLVKNSWGPSWGENGYIKIARNH 313
Query: 332 -GLCGIATEASYPV 344
CGIA+ ASYP+
Sbjct: 314 NNHCGIASMASYPI 327
>gi|21953244|emb|CAD42716.1| putative cathepsin L [Myzus persicae]
Length = 341
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 145/346 (41%), Positives = 201/346 (58%), Gaps = 23/346 (6%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
VI++ ++ A VS +++E I E+ + Q + Y+D E+ R ++ N I
Sbjct: 4 VIVLGLVAFAISSVSSINLNE-VIEEEWSLFKMQFKKLYEDIKEETFRKKVYLDNKLKIA 62
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST------FKY 125
+ NK G TY L N F DL E+ G+ PS++ S + K
Sbjct: 63 RHNKLYESGEETYALEMNHFGDLMQHEYSKMMNGFK---PSLAGGDSNFTNDEGVTFLKS 119
Query: 126 QNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVD 185
+NV +P SIDWR+KG VT +KNQG CGSCW+FSA ++EG G L+ LSEQ L+D
Sbjct: 120 ENVV-IPKSIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLID 178
Query: 186 CSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYED 243
CS NNGC GGLMD AF+YI NKGL TE YPY+ E C + + A G + D
Sbjct: 179 CSRKYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPDNSGATDNG-FVD 237
Query: 244 LPKGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVL-NAECGDN-CDHGVAVVGFGTA 300
+P+GDE AL+ A+ T PVS+ ++AS + F+FYK+GV N C DHGV VGF T
Sbjct: 238 IPEGDEEALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGFRT- 296
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVA 345
++ G YW++KNSWG+TWG+ GYI + R+ + CG+A+ ASYP+
Sbjct: 297 -DKKGGDYWIVKNSWGKTWGDEGYIMMARNKKNNCGVASSASYPLV 341
>gi|7381610|gb|AAF61565.1|AF227957_1 cathepsin L-like proteinase precursor [Rhipicephalus microplus]
Length = 332
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 202/340 (59%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H ++Y+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKSYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF + G++ +R++ + NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSTFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD- 189
+P +DWR+KGAVT +K+QG CGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGD 248
NNGC GGLM+ AF+YI N G+ TE YPY+ G C +KE A G Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKEDVGATDTG-YVEIKAGS 234
Query: 249 EHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AEC-GDNCDHGVAVVGFGTAEEEDG 305
E L +AV T P+SV ++AS +F+ Y GV + EC ++ DHGV VVG+G + G
Sbjct: 235 EVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA++ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|82796372|gb|ABB91778.1| cathepsin L [Hymeniacidon perlevis]
Length = 323
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 144/315 (45%), Positives = 186/315 (59%), Gaps = 16/315 (5%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLT 95
S + E W +H + Y D+LE+ R I++ N + IE N ++ + LG N+F DL
Sbjct: 17 SFSQDWEDWKNEHNKKYSDDLEELTRYKIWQGNQKIIEVHNANSDKFGFTLGMNKFGDLE 76
Query: 96 NEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSC 155
+ EF + GY +R +S N PT +DWR KGAVT +KNQG CGSC
Sbjct: 77 SHEFAEMFNGYMMQ----ARSNSTKVFVADPNYKADPT-VDWRTKGAVTGVKNQGQCGSC 131
Query: 156 WAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLAT 213
WAFS ++EG + GKL+ LSEQ LVDCS N GC+GGLMD+AFEYI +N G+ T
Sbjct: 132 WAFSTTGSLEGQHFLKTGKLVSLSEQNLVDCSGKEGNEGCNGGLMDQAFEYIKKNGGIDT 191
Query: 214 EADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAF 272
EA YPYQ C + K AT Y D+ + DE+AL+QAV K PVSV ++AS +F
Sbjct: 192 EASYPYQAHDERC-RFKASDVGATCTGYVDIKREDENALMQAVEKIGPVSVAIDASHSSF 250
Query: 273 RFYKRGV-LNAECGDNC-DHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD 330
+ Y+ GV EC DHGV +G+GT E G+ YWL+KNSWG WG GYI + R+
Sbjct: 251 QLYRSGVYYERECSQTALDHGVLAIGYGT---EGGSDYWLVKNSWGTDWGMEGYIMMSRN 307
Query: 331 E-GLCGIATEASYPV 344
CGIATEASYP
Sbjct: 308 RNNNCGIATEASYPT 322
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 135/312 (43%), Positives = 188/312 (60%), Gaps = 14/312 (4%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLTNEE 98
E+ E W +HG+ Y + E+ R I++ N +Y+++ N + + +G N+F+DL + E
Sbjct: 20 EEWESWKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEKFGFTVGMNQFADLESSE 79
Query: 99 FRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAF 158
F Y GYN PS+ + S+ + K V D+PTS+DWR KG VT IKNQG CGSCWAF
Sbjct: 80 FGRLYNGYNNK-PSMKKAQSKVFSTK---VGDLPTSVDWRTKGFVTAIKNQGQCGSCWAF 135
Query: 159 SAVAAVEGITQITGGKLIELSEQQLVDCST--DNNGCSGGLMDKAFEYIIENKGLATEAD 216
SAVA +EG G L+ LSEQ LVDCST N GC+GGLMD AF+Y+I+N G+ TEA
Sbjct: 136 SAVAGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGGIDTEAS 195
Query: 217 YPYQQEQGTCDKQKEKAAAATIGKYEDLP-KGDEHALLQAVTKQPVSVCVEASGQAFRFY 275
YPY+ C + G + LP K + + P+SV ++AS +F+ Y
Sbjct: 196 YPYKAVDQKCKFNAANVGSTCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHTSFQLY 255
Query: 276 KRGVLN-AECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE-G 332
K GV + + C + DHGV VG+ + G YW++KNSWG TWG++GYI + R++
Sbjct: 256 KSGVYSESACSQTSLDHGVTAVGY---DSSSGVAYWIVKNSWGTTWGQAGYIWMSRNKNN 312
Query: 333 LCGIATEASYPV 344
CGIAT ASYP+
Sbjct: 313 QCGIATAASYPI 324
>gi|15214962|gb|AAH12612.1| Cathepsin L1 [Homo sapiens]
gi|61363426|gb|AAX42388.1| cathepsin L [synthetic construct]
gi|123988681|gb|ABM83856.1| cathepsin L [synthetic construct]
gi|123999196|gb|ABM87178.1| cathepsin L [synthetic construct]
Length = 333
Score = 250 bits (638), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 201/342 (58%), Gaps = 20/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M +IL C + S + S+ + +W A H R Y E+ R ++++N++
Sbjct: 1 MNPTLILAAFCLG-IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNVKM 58
Query: 73 IEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N +EG ++ + N F D+T+EEFR G+ +R+ + F+
Sbjct: 59 IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLFY 112
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-- 187
+ P S+DWREKG VT +KNQG CGSCWAFSA A+EG G+LI LSEQ LVDCS
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP 172
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD AF+Y+ +N GL +E YPY+ + +C K K + A + D+PK
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIPK- 230
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
E AL++AV T P+SV ++A ++F FYK G+ +C ++ DHGV VVG+G + E
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
D KYWL+KNSWGE WG GY+++ +D CGIA+ ASYP
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPT 332
>gi|148745204|gb|AAI42984.1| Cathepsin L1 [Homo sapiens]
Length = 333
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 201/342 (58%), Gaps = 20/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M +IL C + S + S+ + +W A H R Y E+ R ++++N++
Sbjct: 1 MNPTLILAAFCLG-IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58
Query: 73 IEKAN---KEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N +EG ++ + N F D+T+EEFR G+ +R+ + F+
Sbjct: 59 IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLFY 112
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS-- 187
+ P S+DWREKG VT +KNQG CGSCWAFSA A+EG G+LI LSEQ LVDCS
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP 172
Query: 188 TDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD AF+Y+ +N GL +E YPY+ + +C K K + A + D+PK
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESC-KYNPKYSVANDTGFVDIPK- 230
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEE 303
E AL++AV T P+SV ++A ++F FYK G+ +C ++ DHGV VVG+G + E
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
D KYWL+KNSWGE WG GY+++ +D CGIA+ ASYP
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPT 332
>gi|161408095|dbj|BAF94151.1| cathepsin L-like cysteine protease 1 [Plautia stali]
Length = 344
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 124/310 (40%), Positives = 196/310 (63%), Gaps = 14/310 (4%)
Query: 44 QWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
++ +Q+ + Y + + R ++KQN +++ + N+ G TYK+ N +D+ EF
Sbjct: 25 RFKSQYRKDYPSDSVERYRKKVYKQNEKFVREHNERYERGEVTYKMALNHLADMHPREFM 84
Query: 101 ASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSA 160
A++ G+NR + + ++ F++ + +DWR+KGA++ +K+QGHCGSCWAFS+
Sbjct: 85 ATFLGFNRSLRATNK-VPEGIPFRHNKDAVIQKEVDWRQKGAISPVKDQGHCGSCWAFSS 143
Query: 161 VAAVEGITQITGGKLIELSEQQLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYP 218
A+E T + G+ + LSEQ L+DCS + NNGC GGLM++AF+Y+ +N G+ TE YP
Sbjct: 144 TGALEAHTFLKKGRRVSLSEQNLIDCSLNYGNNGCEGGLMEQAFQYVRDNDGIDTEEAYP 203
Query: 219 YQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKR 277
Y+ E C +K A G + +P GDE AL++AV Q P+S+ ++AS +F+FY
Sbjct: 204 YEGEDSECRFKKNNVGATDAG-FVTIPSGDEQALMEAVATQGPLSIAIDASNPSFQFYSE 262
Query: 278 GV-LNAECGD-NCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLC 334
GV EC DHGV +VG+G +++ KYWL+KNSW E WGE+GYI++ R+ + C
Sbjct: 263 GVYYEPECSSAQLDHGVLLVGYGVEKDQ---KYWLVKNSWSEQWGENGYIKMARNKDNNC 319
Query: 335 GIATEASYPV 344
GIAT+AS+P+
Sbjct: 320 GIATQASFPI 329
>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
Length = 344
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 123/266 (46%), Positives = 165/266 (62%), Gaps = 9/266 (3%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRT 83
+VS E + +WMA HGRTY E+ R +F+ NL Y++ N G +
Sbjct: 31 IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90
Query: 84 YKLGTNEFSDLTNEEFRASYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTN+E+RA+Y G +RP R+ + + D+P S+DWR KGA
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGVRSRP----QRERRLGDRYLAGDNEDLPESVDWRAKGA 146
Query: 143 VTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMDKA 201
V +K+QG CGSCWAFS +AAVEGI QI G +I LSEQ+LVDC T N GC+GGLMD A
Sbjct: 147 VAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYA 206
Query: 202 FEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQPV 261
FE+II N G+ TE DYPY+ G CD ++ A TI YED+P E +L +AV QP+
Sbjct: 207 FEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPI 266
Query: 262 SVCVEASGQAFRFYKRGVLNAECGDN 287
SV +EA G+AF+ Y G+ CG++
Sbjct: 267 SVAIEAGGRAFQLYNSGIFTGTCGNS 292
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 153/350 (43%), Positives = 197/350 (56%), Gaps = 45/350 (12%)
Query: 24 ASQVVSGRSMHEPSIV---------EKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIE 74
AS + S SMH ++ + +M + R Y D E R IF N I
Sbjct: 39 ASPLTSLDSMHMQDVIGVDWNFTLSSIWKHFMTTYKRNYIDPSEHERRFKIFANNFVRIS 98
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEE------FRASYTGYNRPVPSVSRQSSRPSTFKY 125
K N +G +Y +G NEFSD T+EE FR S + SR S+ T
Sbjct: 99 KHNVRFIQGQVSYTMGINEFSDKTDEELKRLRCFRGSL--------NASRDGSKYITI-- 148
Query: 126 QNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVD 185
P+ IDWR KGAVT +KNQG+CGSCWAFSA A+EG + G L+ LSEQQLVD
Sbjct: 149 --AAPPPSEIDWRNKGAVTPVKNQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVD 206
Query: 186 CSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPY-QQEQG----TCDKQKEKAAAATI 238
CS++ NN C+GGLMD AF+Y+ ++ G+ TEA YPY E G TC + K A +
Sbjct: 207 CSSEYGNNACNGGLMDNAFKYVKDSNGIDTEASYPYVSGETGDANPTC-RFNLKEAVVRV 265
Query: 239 GKYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQAFRFYKRGVLNAE--CGDNCDHGVAVV 295
Y DLP+G L QAV P+SV + A +F YK GV + + D+ DHGV +V
Sbjct: 266 TGYIDLPRGQVSELKQAVGHYGPISVAINAGLPSFMSYKSGVYSDDQCSSDDLDHGVLLV 325
Query: 296 GFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
G+G EE+G YWLIKNSWG WGE+GY++ILRD LCG+A+ ASYP+
Sbjct: 326 GYG---EENGIPYWLIKNSWGPHWGENGYVKILRDHNNLCGVASMASYPL 372
>gi|23306947|dbj|BAC16538.1| cathepsin L [Engraulis japonicus]
Length = 336
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 147/345 (42%), Positives = 201/345 (58%), Gaps = 23/345 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M+V + + C S V++ S + + + W H + Y E E+ R ++++NL
Sbjct: 1 MYVAAVFTL-CLSAVLAAPSF-DRELDDHWNHWKNFHTKKYH-EKEEGWRRVVWEKNLRK 57
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +Y+LG N F D+T+EEFR GY + + + S F N
Sbjct: 58 IEMHNLEHSMGAHSYRLGMNHFGDMTHEEFRQVMNGYKHK----AERRVKGSLFMEPNFI 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+ P ID+R+ G T +K+QG CGSCWAFS A+EG GGKL+ LSEQ LVDCS
Sbjct: 114 EAPKKIDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLSEQNLVDCSRP 173
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQ---KEKAAAATIGKYEDL 244
N GC+GGLMD+AF+YI +N GL TE YPY GT D+ K +AA + D+
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDTEDAYPY---LGTDDQDCHYDPKYSAANDTGFVDI 230
Query: 245 PKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAECGDN-CDHGVAVVGFG-TA 300
P+G E AL++AV PVSV ++A ++F+FY G+ EC DHGV VVG+G
Sbjct: 231 PEGKERALMKAVAAVGPVSVAIDAGHESFQFYHSGIYFEKECSSTELDHGVLVVGYGFEG 290
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
E+ DG KYW++KNSW E WG+ GYI + +D + CGIAT ASYP+
Sbjct: 291 EDVDGKKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPL 335
>gi|116242316|gb|ABJ89815.1| putative cathepsin L preprotein [Clonorchis sinensis]
Length = 371
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 186/312 (59%), Gaps = 19/312 (6%)
Query: 43 EQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEF 99
+ ++ ++ R Y +LE+ RL IF +N I + N ++G +Y +G N FSD TN E
Sbjct: 68 QAFLEKYKRVYDSKLEEERRLGIFTENFIRISEHNLLFEKGEVSYSMGINAFSDKTNSEL 127
Query: 100 RASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFS 159
G+ R SR S+ F P +DWR KGAVT +KNQG CGSCWAFS
Sbjct: 128 DV-LRGF-RHSSKASRSGSQYIPFD----AAPPAEVDWRTKGAVTPVKNQGDCGSCWAFS 181
Query: 160 AVAAVEGITQITGGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPY 219
A +EG + GKL+ LSEQQLVDCS+ N+GC GGLMD AFEY+ E+KG+ TE YPY
Sbjct: 182 ATGGIEGQHYLATGKLVSLSEQQLVDCSSSNDGCDGGLMDLAFEYVKEHKGIDTEVHYPY 241
Query: 220 QQEQGTCDKQ---KEKAAAATIGKYEDLPKGDEHALLQAVTKQ-PVSVCVEASGQAFRFY 275
+Q K AA + Y D+P+G E L QAV P+SV + A +F Y
Sbjct: 242 VSGNTGYARQCSFDPKYAAVNVTGYVDIPEGQELLLQQAVGFHGPISVGINAGLPSFMAY 301
Query: 276 KRGVL-NAECG-DNCDHGVAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILRDE-G 332
+ G+ + C + DHGV VVG+G ++G YWLIKNSWGE WGE+GY+RILR+
Sbjct: 302 ESGIYSDHRCNPHDLDHGVLVVGYGV---DNGVPYWLIKNSWGEDWGENGYVRILRNHNN 358
Query: 333 LCGIATEASYPV 344
LCG+AT ASYP+
Sbjct: 359 LCGVATMASYPL 370
>gi|34850847|dbj|BAC87861.1| cathepsin L [Engraulis japonicus]
Length = 336
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 147/345 (42%), Positives = 201/345 (58%), Gaps = 23/345 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M+V + + C S V++ S + + + W + H + Y E E+ R ++++NL
Sbjct: 1 MYVAAVFTL-CLSAVLAAPSF-DRELDDHWNHWKSFHTKKYH-EKEEGWRRVVWEKNLRK 57
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +Y+LG N F D+T+EEFR GY + + + S F N
Sbjct: 58 IEMHNLEHSMGAHSYRLGMNHFGDMTHEEFRQVMNGYKHK----AERRVKGSLFMEPNFI 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+ P ID+R+ G T +K+QG CGSCWAFS A+EG GGKL+ LSEQ LVDCS
Sbjct: 114 EAPKKIDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLSEQNLVDCSRP 173
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQ---KEKAAAATIGKYEDL 244
N GC+GGLMD+AF+YI +N GL TE YPY GT D+ K +AA + D+
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDTEDAYPY---LGTDDQDCHYDPKYSAANDTGFVDI 230
Query: 245 PKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAECGDN-CDHGVAVVGFG-TA 300
P+G E AL++AV PVSV ++A + F+FY G+ EC DHGV VVG+G
Sbjct: 231 PEGKERALMKAVAAVGPVSVAIDAGHECFQFYHSGIYFEKECSSTELDHGVLVVGYGFEG 290
Query: 301 EEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
E+ DG KYW++KNSW E WG+ GYI + +D + CGIAT ASYP+
Sbjct: 291 EDVDGKKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPL 335
>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
Length = 324
Score = 249 bits (637), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 139/349 (39%), Positives = 206/349 (59%), Gaps = 52/349 (14%)
Query: 10 IIPMFVIIILVITCAS----QVVSG--RSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMR 62
+I + ++II ++ +S V SG RS E + + WM++HG+TY + L +K R
Sbjct: 9 MITLSLLIIFLLPPSSAMDLSVTSGGLRSNEEVGFI--FQTWMSKHGKTYTNALGDKEQR 66
Query: 63 LTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPST 122
FK NL +I++ N + N +Y+LG +F+DLT +E++ ++G RP+ +Q + T
Sbjct: 67 FQNFKDNLRFIDQHNAK-NLSYRLGLTQFADLTVQEYQDLFSG--RPI---QKQKALRVT 120
Query: 123 FKYQNVTD--VPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSE 180
+Y + + +P S+DWR+KGAV+ IK+QG C VE I +I G+LI LSE
Sbjct: 121 HRYVPLAEDQLPQSVDWRQKGAVSEIKDQGRC----------TVESINKIVTGELISLSE 170
Query: 181 QQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCD-KQKEKAAAATIG 239
Q+LVDCS DN+GC+GGLMD AF+++I N GL ++DYPYQ QG C+ Q I
Sbjct: 171 QELVDCSIDNHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQGYCNHNQNTSKKVIKID 230
Query: 240 KYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHGVAVVGFGT 299
YED+P +E++L +AV QP G+ CG + DH V +VG+GT
Sbjct: 231 GYEDVPANNENSLQKAVAHQP-----------------GIYTGPCGTDLDHAVVIVGYGT 273
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILRD----EGLCGIATEASYPV 344
E+G YW+++NSWG WGE+GY +I R+ G+CGIA ASYP+
Sbjct: 274 ---ENGQDYWIVRNSWGTVWGEAGYAKIARNFENPTGVCGIAMVASYPI 319
>gi|148709355|gb|EDL41301.1| cDNA sequence BC051665 [Mus musculus]
Length = 349
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 147/349 (42%), Positives = 199/349 (57%), Gaps = 23/349 (6%)
Query: 8 SFIIPMF---VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLT 64
+ IP F + L+ T VVS H+PS+ E+W +H +TY E+A +
Sbjct: 11 AMYIPGFGSMTPVFLLATLCLGVVSAAPAHDPSLDAVWEEWKTKHRKTYNMN-EEAQKRA 69
Query: 65 IFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPS 121
+++ N++ I N++ G + L N F DLTN EFR TG+ S +
Sbjct: 70 VWENNMKMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ------SMGHKEMT 123
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQ 181
F+ + DVP S+DWR+ G VT +K+QGHCGSCWAFSAV ++EG GKL+ LSEQ
Sbjct: 124 IFQEPLLGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQ 183
Query: 182 QLVDCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIG 239
L+DCS N GC+GGLM+ AF+Y+ EN+GL T Y Y+ G C + K +A I
Sbjct: 184 NLMDCSWSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPC-RYDPKYSAVNIT 242
Query: 240 KYEDLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAECGD-NCDHGVAVVG 296
+ +P E AL+ AV PVSV ++ +FRFY+ G +C N DH V VVG
Sbjct: 243 GFVKVPL-SEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVG 301
Query: 297 FGTAEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
+G EE DG KYWL+KNSWGE WG GYI++ +D + CGIAT A YP
Sbjct: 302 YG--EESDGRKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYPT 348
>gi|18308182|gb|AAL67857.1|AF462309_1 cysteine proteinase [Acanthamoeba healyi]
Length = 330
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 132/287 (45%), Positives = 178/287 (62%), Gaps = 14/287 (4%)
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFK 124
I++ N+ E+ N++ N++Y L N+F DLTN EF + G S+ + +
Sbjct: 52 IYRWNVWRDEEHNRQ-NKSYFLAMNQFGDLTNAEFNRLFKGL---AFDYSKHAKIHTAAP 107
Query: 125 YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
T +P+ DWR+KGAVTH+KNQG CGSCW+FS + EG + G+L+ LSEQ L+
Sbjct: 108 EAPATGIPSEFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLVSLSEQNLI 167
Query: 185 DCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQG-TCDKQKEKAAAATIGKY 241
DCS NNGC+GGLMD AFEYII N+G+ TEA YPYQ TC + G Y
Sbjct: 168 DCSVSYGNNGCNGGLMDYAFEYIINNRGIDTEASYPYQTAGPLTCQYNAANKGGSLTG-Y 226
Query: 242 EDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVL--NAECGDNCDHGVAVVGFGT 299
D+ GDE+ALL A K+PVSV ++AS +F+FY GV +A DHGV VVG+G+
Sbjct: 227 TDVTSGDENALLNAAVKEPVSVAIDASHNSFQFYSGGVYYESACSSTQLDHGVLVVGWGS 286
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPVA 345
E+G +W +KNSWG +WG +GYI++ R++ CGIAT ASYP A
Sbjct: 287 ---ENGQDFWWVKNSWGASWGLNGYIKMSRNQNNNCGIATAASYPTA 330
>gi|291398027|ref|XP_002715626.1| PREDICTED: cathepsin S [Oryctolagus cuniculus]
Length = 331
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 140/341 (41%), Positives = 198/341 (58%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLTIFKQNLE 71
M ++ ++ C+S V +H ++ H W +G+ YK++ E+A R I+++NL+
Sbjct: 1 MKWLVWALLVCSSTVAQ---LHRDPTLDHHWHLWKKAYGKQYKEKNEEAARRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y +G N +D+T+EE + + P Q R T+K
Sbjct: 58 FVTLHNLEHSMGMHSYDVGMNHLADMTSEEVVSLMSSLRIP-----HQWPRNVTYKLNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST 188
+P S+DWRE+G VT +K QG CG+CWAFSAV A+E ++ G L+ LS Q LVDCST
Sbjct: 113 QKLPDSVDWRERGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCST 172
Query: 189 D---NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLP 245
N GC+GG M +AF+YII+N G+ +EA YPY+ C K AAT KY +LP
Sbjct: 173 TKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDQKC-HYDSKHRAATCSKYTELP 231
Query: 246 KGDEHALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGDNCDHGVAVVGFGTAEEE 303
G E AL +AV K PVSV ++AS +F Y+ GV C N +HGV VG+G + +
Sbjct: 232 FGSEEALKEAVANKGPVSVAIDASHSSFFLYRSGVYYEPSCTQNVNHGVLAVGYGNLKGK 291
Query: 304 DGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYP 343
D YWL+KNSWG +GE GYIR+ R+ + CGIA SYP
Sbjct: 292 D---YWLVKNSWGIHFGEQGYIRMARNSKNHCGIANYPSYP 329
>gi|139947602|ref|NP_001077155.1| cathepsin L1 precursor [Bos taurus]
gi|134025180|gb|AAI34742.1| CTSL1 protein [Bos taurus]
gi|296484500|tpg|DAA26615.1| TPA: cathepsin L1 [Bos taurus]
Length = 333
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 142/338 (42%), Positives = 196/338 (57%), Gaps = 20/338 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
++L C + S + S+ + + W A H + Y D E+ R ++K+N++ IE
Sbjct: 5 LLLTALCLG-IASAAPKFDHSLDTQWKLWKAAHRKPY-DLNEEGWRKAVWKKNMKMIELH 62
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N+E G ++ + N F D+TNEEFR + G+ R +++ + F +P
Sbjct: 63 NQEYSQGKHSFSMAMNAFGDMTNEEFRHTMNGFQR------QKNKKGKEFHETIFASIPP 116
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD--NN 191
S+DWREKG VT +KNQG CGSCWAFSA A+EG GKL+ LSEQ LVDCS N
Sbjct: 117 SVDWREKGYVTPVKNQGKCGSCWAFSATGALEGQMFQKTGKLVSLSEQNLVDCSQPEGNR 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GG +D AF+Y+++ GL +E YPY GTC +AA G + DLPK E A
Sbjct: 177 GCHGGFIDNAFQYVLDVGGLDSEESYPYTGLVGTCLYNPNNSAANETG-FVDLPK-QEKA 234
Query: 252 LLQAVTKQ-PVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFG-TAEEEDGAK 307
L++AV P+SV V+A +F+FYK G+ C ++ DH V VVG+G + D K
Sbjct: 235 LMKAVANLGPISVAVDAHNPSFQFYKSGIYYEPNCSSESVDHAVLVVGYGFEGADSDDNK 294
Query: 308 YWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
YWL+KNSWGE WG +GYI++ +D CGIAT ASYP
Sbjct: 295 YWLVKNSWGEHWGMNGYIKMAKDRNNHCGIATMASYPT 332
>gi|356582227|ref|NP_001239115.1| cathepsin L1 precursor [Canis lupus familiaris]
gi|62899810|sp|Q9GL24.1|CATL1_CANFA RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|10185020|emb|CAC08809.1| cathepsin L [Canis lupus familiaris]
Length = 333
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 140/338 (41%), Positives = 201/338 (59%), Gaps = 20/338 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
+ L C + S + S+ + QW A H R Y E+ R ++++N++ IE
Sbjct: 5 LFLTALCLG-IASAAPKFDQSLNAQWYQWKATHRRLYGMN-EEGWRRAVWEKNMKMIELH 62
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N+E G + + N F D+TNEEFR G+ +++ + F+ ++P
Sbjct: 63 NREYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQ------NQKHKKGKMFQEPLFAEIPK 116
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCS--TDNN 191
S+DWREKG VT +KNQG CGSCWAFSA A+EG GKL+ LSEQ LVDCS N
Sbjct: 117 SVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNE 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPY-QQEQGTCDKQKEKAAAATIGKYEDLPKGDEH 250
GC+GGLMD AF Y+ +N GL +E YPY ++ TC+ + E +AA G + DLP+ E
Sbjct: 177 GCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTG-FVDLPQ-REK 234
Query: 251 ALLQAV-TKQPVSVCVEASGQAFRFYKRGV-LNAECGD-NCDHGVAVVGFGTAEEEDGAK 307
AL++AV T P+SV ++A Q+F+FYK G+ + +C + DHGV VVG+G + K
Sbjct: 235 ALMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNK 294
Query: 308 YWLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
+W++KNSWG WG +GY+++ +D+ CGIAT ASYP
Sbjct: 295 FWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPT 332
>gi|384247445|gb|EIE20932.1| hypothetical protein COCSUDRAFT_18161 [Coccomyxa subellipsoidea
C-169]
Length = 387
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 147/332 (44%), Positives = 193/332 (58%), Gaps = 46/332 (13%)
Query: 51 RTYKDELEKAMRLTIFKQNLEYIEKANKEGNRTYK------------------------- 85
+ Y +E E A+RL IFK N++YI N ++Y+
Sbjct: 9 KKYSNEEEAALRLNIFKTNVDYITSVNS-AQQSYQASKHFSENTQQTALSSLFLSQLAHT 67
Query: 86 -----LGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
LG NEF+D T EEF +++ G N S +SS + F++ +VT SI+W E
Sbjct: 68 DLLPQLGLNEFADQTWEEFSSTHLGLNAGEDG-SFRSSANTGFRHADVTPA-NSINWVEA 125
Query: 141 GAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD-NNGCSGGLMD 199
GAVT +KNQ CGSCWAFS +VEG + G L+ LSEQQLVDC T + GC GGLMD
Sbjct: 126 GAVTPVKNQAFCGSCWAFSTTGSVEGANFLATGDLVSLSEQQLVDCDTKKDQGCGGGLMD 185
Query: 200 KAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHALLQAVTKQ 259
AF+YII+N GL TE DY Y G C+K +E+ +I YED+P DE AL +AV+KQ
Sbjct: 186 YAFDYIIKNGGLDTEEDYSYWSVGGFCNKLREERTVVSIDGYEDVPVNDEVALAKAVSKQ 245
Query: 260 PVSVCVEASGQAFRFYKRGVLNAECGDNC---DHGVAVVGFGTAEEEDGAKYWLIKNSWG 316
PVSV + AS +A +FY GV+ A+ +C +HGV G+ +E G YWL+KNSWG
Sbjct: 246 PVSVAICAS-EAMQFYSSGVIAAK--GSCIGLNHGVLAAGYDV--DESGKPYWLVKNSWG 300
Query: 317 ETWGESGYIRILRD----EGLCGIATEASYPV 344
TWG GY+++ +D EG CGIA ASYPV
Sbjct: 301 GTWGMQGYMKLEKDSSVKEGACGIAMAASYPV 332
>gi|129614|sp|P00784.1|PAPA1_CARPA RecName: Full=Papain; AltName: Full=Papaya proteinase I; Short=PPI;
AltName: Allergen=Car p 1; Flags: Precursor
gi|167391|gb|AAB02650.1| papain precursor [Carica papaya]
gi|387885|gb|AAA72774.1| papain [synthetic construct]
gi|225437|prf||1303270A papain
Length = 345
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 135/357 (37%), Positives = 203/357 (56%), Gaps = 29/357 (8%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPS----IVEKHEQWMAQHGRTYKDE 56
M+ K + + + + + ++ + G S ++ + +++ E WM +H + YK+
Sbjct: 3 MIPSISKLLFVAICLFVYMGLSFGDFSIVGYSQNDLTSTERLIQLFESWMLKHNKIYKNI 62
Query: 57 LEKAMRLTIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQ 116
EK R IFK NL+YI++ NK+ N +Y LG N F+D++N+EF+ YTG S++
Sbjct: 63 DEKIYRFEIFKDNLKYIDETNKK-NNSYWLGLNVFADMSNDEFKEKYTG------SIAGN 115
Query: 117 SSRPSTFKYQNV-----TDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQIT 171
+ + Y+ V ++P +DWR+KGAVT +KNQG CGSCWAFSAV +EGI +I
Sbjct: 116 YT-TTELSYEEVLNDGDVNIPEYVDWRQKGAVTPVKNQGSCGSCWAFSAVVTIEGIIKIR 174
Query: 172 GGKLIELSEQQLVDCSTDNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKE 231
G L E SEQ+L+DC + GC+GG A + ++ G+ YPY+ Q C +++
Sbjct: 175 TGNLNEYSEQELLDCDRRSYGCNGGYPWSALQ-LVAQYGIHYRNTYPYEGVQRYCRSREK 233
Query: 232 KAAAATIGKYEDLPKGDEHALLQAVTKQPVSVCVEASGQAFRFYKRGVLNAECGDNCDHG 291
AA + +E ALL ++ QPVSV +EA+G+ F+ Y+ G+ CG+ DH
Sbjct: 234 GPYAAKTDGVRQVQPYNEGALLYSIANQPVSVVLEAAGKDFQLYRGGIFVGPCGNKVDHA 293
Query: 292 VAVVGFGTAEEEDGAKYWLIKNSWGETWGESGYIRILR----DEGLCGIATEASYPV 344
VA VG+ G Y LIKNSWG WGE+GYIRI R G+CG+ T + YPV
Sbjct: 294 VAAVGY-------GPNYILIKNSWGTGWGENGYIRIKRGTGNSYGVCGLYTSSFYPV 343
>gi|281346354|gb|EFB21938.1| hypothetical protein PANDA_009085 [Ailuropoda melanoleuca]
Length = 333
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 136/341 (39%), Positives = 197/341 (57%), Gaps = 18/341 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M + L C + S ++ + +W A +G+ Y + E+ R ++++N++
Sbjct: 1 MHPSLFLAALCLG-IASAAPRFNENLDARWTRWKAANGKLYNKD-EEVWRRAVWEKNMKM 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I++ N+E G ++ L N F DLTNEEF+ G P + F+
Sbjct: 59 IDQHNEEYSQGKHSFILAMNAFGDLTNEEFKQVMNGLKIQNPR------EGNMFQLLPFA 112
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST- 188
+ P+S+DWREKG VT +K+QG CGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 113 ETPSSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRA 172
Query: 189 -DNNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GGLMD AF Y+ +N GL +E YPY + G C + E++AA G + D+ +
Sbjct: 173 EGNAGCNGGLMDNAFRYVKDNGGLDSEESYPYLAQDGRCKYKPEQSAANDTG-FADIHQD 231
Query: 248 DEHALLQAVTKQPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEEE-D 304
+E +L T P+SV ++AS FRFY +G+ + C ++ DHGV VVG+G+ E E +
Sbjct: 232 EESLMLSVATVGPISVAIDASLDTFRFYYKGIYYDPNCSSEDLDHGVLVVGYGSDEREAE 291
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDEG-LCGIATEASYPV 344
YW++KNSWG WG GYI + +D G CGIAT AS+P+
Sbjct: 292 NKNYWIVKNSWGTQWGMQGYILMAKDRGNHCGIATSASFPI 332
>gi|74211558|dbj|BAE26509.1| unnamed protein product [Mus musculus]
Length = 338
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 147/347 (42%), Positives = 198/347 (57%), Gaps = 23/347 (6%)
Query: 11 IPMF---VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFK 67
IP F + L+ T VVS H+PS+ E+W +H +TY E+A + +++
Sbjct: 3 IPGFGSMTPVFLLATLCLGVVSAAPAHDPSLDAVWEEWKTKHRKTYNMN-EEAQKRAVWE 61
Query: 68 QNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFK 124
N++ I N++ G + L N F DLTN EFR TG+ S + F+
Sbjct: 62 NNMKMIGLHNEDYLKGKHGFNLEMNAFGDLTNTEFRELMTGFQ------SMGHKEMTIFQ 115
Query: 125 YQNVTDVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLV 184
+ DVP S+DWR+ G VT +K+QGHCGSCWAFSAV ++EG GKL+ LSEQ L+
Sbjct: 116 EPLLGDVPKSVDWRDHGYVTPVKDQGHCGSCWAFSAVGSLEGQIFRKTGKLVPLSEQNLM 175
Query: 185 DCSTD--NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYE 242
DCS N GC+GGLM+ AF+Y+ EN+GL T Y Y+ G C + K +A I +
Sbjct: 176 DCSWSYGNVGCNGGLMELAFQYVKENRGLDTRESYAYEAWDGPC-RYDPKYSAVNITGFV 234
Query: 243 DLPKGDEHALLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAECGD-NCDHGVAVVGFGT 299
+P E AL+ AV PVSV ++ +FRFY+ G +C N DH V VVG+G
Sbjct: 235 KVPL-SEDALMNAVASVGPVSVGIDTHHHSFRFYRGGTYYEPDCSSTNLDHAVLVVGYG- 292
Query: 300 AEEEDGAKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPVA 345
EE DG KYWL+KNSWGE WG GYI++ +D + CGIAT A YP
Sbjct: 293 -EESDGRKYWLVKNSWGEDWGMDGYIKMAKDRDNNCGIATYAIYPTV 338
>gi|213514640|ref|NP_001134963.1| Cathepsin S precursor [Salmo salar]
gi|209155506|gb|ACI33985.1| Cathepsin S precursor [Salmo salar]
gi|209737594|gb|ACI69666.1| Cathepsin S precursor [Salmo salar]
gi|223647278|gb|ACN10397.1| Cathepsin S precursor [Salmo salar]
gi|223673157|gb|ACN12760.1| Cathepsin S precursor [Salmo salar]
Length = 330
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 137/340 (40%), Positives = 195/340 (57%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEY 72
M ++L C V ++ +P + + + W HG+ Y+ E+E+ R ++++NL+
Sbjct: 2 MLWSLLLAALCGIAV----ALFDPMLEQHWQMWKKTHGKNYQTEVEELGRREVWERNLQL 57
Query: 73 IEKANKEGN---RTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I N E + TY LG N D+T EE S+ P + R+ PS F +
Sbjct: 58 INLHNLEASMDMHTYDLGMNHMGDMTQEEIAQSFASLRVPA-DLKRE---PSAFVGSSGA 113
Query: 130 DVPTSIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCSTD 189
+P + DWREKG VT +K QG CGSCWAFSAV A+EG T GKLI++S Q LVDCS+
Sbjct: 114 PIPDTFDWREKGYVTEVKMQGSCGSCWAFSAVGALEGQLMKTTGKLIDISSQNLVDCSSK 173
Query: 190 --NNGCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKG 247
N GC+GG M +AF+Y+I+N+G+ ++ YPY+ Q C + AA KY LP+G
Sbjct: 174 YGNKGCNGGFMSQAFQYVIDNQGIDSDQSYPYKGVQQQCSYNPAQ-RAANCSKYSFLPEG 232
Query: 248 DEHALLQAV-TKQPVSVCVEASGQAFRFYKRGVLN-AECGDNCDHGVAVVGFGTAEEEDG 305
DE L +A+ T P+SV ++A+ F FY+ GV N C +H V VG+GT +D
Sbjct: 233 DEGVLKEALATIGPISVAIDATRPLFTFYRSGVYNDPTCTKKINHAVLAVGYGTLGGQD- 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-EGLCGIATEASYPV 344
YWL+KNSW +WG+ GYIR+ R+ + CGIA YPV
Sbjct: 292 --YWLVKNSWSLSWGDQGYIRMSRNKDNQCGIALYGCYPV 329
>gi|444514070|gb|ELV10520.1| Cathepsin L1 [Tupaia chinensis]
Length = 450
Score = 249 bits (636), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 141/337 (41%), Positives = 188/337 (55%), Gaps = 25/337 (7%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLTIFKQNLEYIEKA 76
+ L C + S + ++ W + H R Y E+ R ++++N++ IE
Sbjct: 129 LFLAALCLG-IASATPNSDQNLDTSWHHWKSTHRRLYGKN-EEGWRRAVWEKNMKMIEMH 186
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRASYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N E G + +G N F D+TNEEFR G+ +++ F + P
Sbjct: 187 NHEYSNGKHGFTMGMNAFGDMTNEEFRQVMNGFR------NQKQKSGKVFHAPLLLQAPK 240
Query: 134 SIDWREKGAVTHIKNQGHCGSCWAFSAVAAVEGITQITGGKLIELSEQQLVDCST--DNN 191
S+DWREKG VT +KNQG CGSCWAFSA A+EG GKLI LSEQ LVDCS N
Sbjct: 241 SVDWREKGFVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRRQGNL 300
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYQQEQGTCDKQKEKAAAATIGKYEDLPKGDEHA 251
GC GGLMD AF+YI +N GL +E YPY+ GTC + E A A G E A
Sbjct: 301 GCQGGLMDNAFQYIKDNGGLDSEESYPYKGMDGTCQYKAEWAVANDTGF--------EKA 352
Query: 252 LLQAVTK-QPVSVCVEASGQAFRFYKRGV-LNAEC-GDNCDHGVAVVGFGTAEEEDGAKY 308
L++AV P+SV ++A +F+FYK G+ +C +N DHGV VVG+G + KY
Sbjct: 353 LMKAVASVGPISVAIDAGHASFQFYKDGIYYEPDCSSENLDHGVLVVGYGVEKRNSNDKY 412
Query: 309 WLIKNSWGETWGESGYIRILRDE-GLCGIATEASYPV 344
WLIKNSWGE WG +GY++I +D CG+A+ ASYPV
Sbjct: 413 WLIKNSWGEQWGANGYVKIAKDRNNHCGVASAASYPV 449
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.132 0.396
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,559,803,433
Number of Sequences: 23463169
Number of extensions: 236044936
Number of successful extensions: 621177
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6675
Number of HSP's successfully gapped in prelim test: 902
Number of HSP's that attempted gapping in prelim test: 589016
Number of HSP's gapped (non-prelim): 9930
length of query: 346
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 203
effective length of database: 9,003,962,200
effective search space: 1827804326600
effective search space used: 1827804326600
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)