BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 019063
(346 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255564910|ref|XP_002523448.1| cysteine protease, putative [Ricinus communis]
gi|223537276|gb|EEF38907.1| cysteine protease, putative [Ricinus communis]
Length = 341
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 197/339 (58%), Positives = 243/339 (71%), Gaps = 13/339 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
MFV +++V SQ S RS+H+ ++ E+HE WM ++GR YKD EK R IF+ N+E+
Sbjct: 10 MFVALLVVGLWVSQAWS-RSLHDAAMNERHEMWMVKYGRVYKDNSEKERRFEIFRNNVEF 68
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE NK GNR YKL NEF+DLTNEEF+A GY R S + S S+F+Y NVT VP
Sbjct: 69 IESFNKPGNRPYKLDINEFADLTNEEFKASRNGYKR---SSNVGLSEKSSFRYGNVTAVP 125
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DN 190
TS+DWR+KGAVT IKDQGQCG CWAFSAVAA+EGIT+++ GKLI LSEQ+LVDC T ++
Sbjct: 126 TSMDWRQKGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGED 185
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
GC GGLMD AFE+I +N GL TEA+YPY+ +GTC+ K AA I+ YED+P E
Sbjct: 186 QGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSED 245
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
ALL+AV++QPVSV +DASG AF FY GV DCG DHGV VG+GT++ G KYWL
Sbjct: 246 ALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSD---GTKYWL 302
Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
+KNSWG +WGE GYIR+ RD GLCGIA +SYP A
Sbjct: 303 VKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQSSYPTA 341
>gi|255564908|ref|XP_002523447.1| cysteine protease, putative [Ricinus communis]
gi|223537275|gb|EEF38906.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 399 bits (1026), Expect = e-109, Method: Compositional matrix adjust.
Identities = 196/339 (57%), Positives = 242/339 (71%), Gaps = 12/339 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
MFV +++V ASQ S RS+H+ ++ E+HE WMA++GR YKD EK R IF+ N+E+
Sbjct: 10 MFVALLVVGLWASQAWS-RSLHDAAMNERHEMWMAKYGRVYKDNSEKERRFEIFRNNVEF 68
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE NK GNR YKL NEF+DLTNEEF+ GY R S + S+F+Y NVT VP
Sbjct: 69 IESFNKLGNRPYKLDINEFADLTNEEFKVSKNGYKR---SSGVGLTEKSSFRYANVTAVP 125
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DN 190
TS+DWR+ GAVT IKDQGQCG CWAFSAVAA+EGIT+++ GKLI LSEQ+LVDC T ++
Sbjct: 126 TSMDWRQNGAVTPIKDQGQCGCCWAFSAVAAMEGITKLSTGKLISLSEQELVDCDTSGED 185
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
GC GGLMD AFE+I +N GL TEA+YPY+ +GTC+ K AA I+ YED+P E
Sbjct: 186 QGCEGGLMDDAFEFIKQNGGLTTEANYPYQGTDGTCNTNKAGNDAAKITGYEDVPANSED 245
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
ALL+AV++QPVSV +DASG AF FY GV DCG DHGV VG+GT+++ G KYWL
Sbjct: 246 ALLKAVASQPVSVAIDASGSAFQFYSGGVFTGDCGTELDHGVTAVGYGTSDD--GTKYWL 303
Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
+KNSWG +WGE GYIR+ RD GLCGIA SYP A
Sbjct: 304 VKNSWGTSWGEDGYIRMERDIEAKEGLCGIAMQPSYPTA 342
>gi|255563110|ref|XP_002522559.1| cysteine protease, putative [Ricinus communis]
gi|223538250|gb|EEF39859.1| cysteine protease, putative [Ricinus communis]
Length = 344
Score = 396 bits (1018), Expect = e-108, Method: Compositional matrix adjust.
Identities = 193/339 (56%), Positives = 242/339 (71%), Gaps = 10/339 (2%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+ + ++LV ASQ S RS+HE S+ +H+ WM Q+GR YK +EK R IFK+N+E+
Sbjct: 10 VLMAMLLVTLWASQSWS-RSLHEASMELRHKTWMTQYGRVYKGNVEKEKRFKIFKENVEF 68
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N GN+ YKLG N F+DLTNEEFRA + GY + S + S R +F+Y+NVT VP
Sbjct: 69 IESFNNNGNKPYKLGINAFTDLTNEEFRASHNGYTMSMSS-HQSSYRTKSFRYENVTAVP 127
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--N 190
S+DWR KGAVTHIKDQGQCG CWAFSAVAA+EGIT+++ G LI LSEQ+LVDC T +
Sbjct: 128 PSLDWRTKGAVTHIKDQGQCGCCWAFSAVAAMEGITKLSTGTLISLSEQELVDCDTSGMD 187
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
GC GGLMD AFE+IIEN GL TEA+YPY +G+C+ +K AA I+ YE++P DE+
Sbjct: 188 QGCEGGLMDDAFEFIIENNGLTTEANYPYEGVDGSCNTRKAANHAAKITGYENVPAYDEE 247
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
AL +AV+NQPVSV +DA AF Y SG+ DCG DHGV VVG+GT+++ G KYWL
Sbjct: 248 ALRKAVANQPVSVAIDAGESAFQHYSSGIFTGDCGTELDHGVTVVGYGTSDD--GTKYWL 305
Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
+KNSWG +WGE GYIR+ RD GLCGIA SYP A
Sbjct: 306 VKNSWGTSWGEDGYIRMERDIDAKEGLCGIAMEPSYPTA 344
>gi|144905116|dbj|BAF56430.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 390 bits (1001), Expect = e-106, Method: Compositional matrix adjust.
Identities = 187/337 (55%), Positives = 237/337 (70%), Gaps = 13/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ ++LV S + R++ + S+ E+HEQWMAQ+G+ YKD EK +R IFK+N++ IE
Sbjct: 12 LTLLLVFGFLSFEANARTLEDASMHERHEQWMAQYGKVYKDSYEKELRSKIFKENVQRIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
N GN++YKLG N+F+DLTNEEF+A NR + S+R TFKY++VT VP S
Sbjct: 72 AFNNAGNKSYKLGINQFADLTNEEFKA----RNRFKGHMCSNSTRTPTFKYEHVTSVPAS 127
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHG 192
+DWR+KGAVT IKDQGQCG CWAFSAVAA EGIT+++ GKLI LSEQ+LVDC T + G
Sbjct: 128 LDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQG 187
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C GGLMD AF++I++NKGL TEA YPY+ + TC+ E AA+I +ED+P E AL
Sbjct: 188 CEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSESAL 247
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
L+AV+NQP+SV +DASG F FY SGV CG DHGV VG+G+ + G KYWL+K
Sbjct: 248 LKAVANQPISVAIDASGSEFQFYSSGVFTGSCGTELDHGVTAVGYGS---DGGTKYWLVK 304
Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
NSWGE WGE GYIR+ RD GLCG A ASYP A
Sbjct: 305 NSWGEQWGEQGYIRMQRDVAAEEGLCGFAMQASYPTA 341
>gi|50355621|dbj|BAD29959.1| cysteine protease [Daucus carota]
Length = 361
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 186/339 (54%), Positives = 245/339 (72%), Gaps = 12/339 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M +IL+ A Q S R++ E S+ E+HEQWM Q+GR YKDE EK++R IF N+++
Sbjct: 29 MIAALILLGAWACQATS-RTLPEASMFERHEQWMIQYGRVYKDEAEKSVRFQIFMDNVKF 87
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE+ NK+G ++YKL NEF+D TNEEF+A GY +VS + S+ + F+Y+NVT VP
Sbjct: 88 IEEFNKDGRQSYKLAVNEFADQTNEEFQASRNGYKM---AVSSRPSQTTLFRYENVTAVP 144
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--STDN 190
+S+DWR+KGAVT +KDQGQCGSCWAFS +AA EGIT++ GKLI LSEQ+LVDC + ++
Sbjct: 145 SSMDWRKKGAVTPVKDQGQCGSCWAFSTIAATEGITKLKTGKLISLSEQELVDCDKTGED 204
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
GC GG M+ FE+I++NKG+A EA YPY +GTC++++E + AA IS YE +P E
Sbjct: 205 QGCEGGYMEDGFEFIVKNKGIALEASYPYTAADGTCNSKEEASRAAKISGYEKVPANSET 264
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
ALL+AV+NQPVSV +DASG AF FY SGV +CG + DHGV VG+G + +G KYWL
Sbjct: 265 ALLKAVANQPVSVSIDASGVAFQFYSSGVFTGECGTDLDHGVTAVGYG--KTSDGTKYWL 322
Query: 311 IKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
+KNSWG +WG+SGYI + R GLCGIA ASYP A
Sbjct: 323 VKNSWGASWGDSGYIMMQRGVAAKGGLCGIAMDASYPTA 361
>gi|224114698|ref|XP_002316833.1| predicted protein [Populus trichocarpa]
gi|222859898|gb|EEE97445.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 187/313 (59%), Positives = 235/313 (75%), Gaps = 14/313 (4%)
Query: 39 VEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
+E+HE WMAQ+GR YK +EK RLNIFK N+E+IE NK G + YKL NEF+DLTNEE
Sbjct: 1 MERHETWMAQYGRAYKGHVEKERRLNIFKNNVEFIESFNKVGKKPYKLSVNEFADLTNEE 60
Query: 99 FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
F+A GY + +S S++P F+Y+NV+ VP+++DWR+KGAVT IKDQGQCG CWAF
Sbjct: 61 FQASRNGY-KMSAHLSSSSTKP--FRYENVSAVPSTMDWRKKGAVTPIKDQGQCGCCWAF 117
Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEAD 216
SAVAA EGITQ++ GKLI LSEQ+LVDC T ++ GC+GGLMD AF++II+NKGL TEA+
Sbjct: 118 SAVAATEGITQLSTGKLISLSEQELVDCDTSGEDQGCNGGLMDDAFDFIIQNKGLTTEAN 177
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY+ +G C++ K AA I+ YED+P E ALL+AV+NQPVSV +DA G AF FY
Sbjct: 178 YPYQGADGACNSGK---AAAKITGYEDVPANSEAALLKAVANQPVSVAIDAGGSAFQFYS 234
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
SGV DCG + DHGV VG+G +++ G KYWL+KNSWG +WGE+GYIR+ RD G
Sbjct: 235 SGVFTGDCGTDLDHGVTAVGYGMSDD--GTKYWLVKNSWGTSWGENGYIRMERDIDAQEG 292
Query: 333 LCGIATAASYPVA 345
LCGIA ASYP A
Sbjct: 293 LCGIAMEASYPTA 305
>gi|144905108|dbj|BAF56428.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 184/337 (54%), Positives = 237/337 (70%), Gaps = 12/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ ++LV + + R++ + S+ E+HEQWM Q+G+ Y D EK +R NIFK+N++ IE
Sbjct: 12 LALLLVFGFLAFEANARTLEDVSLKERHEQWMTQYGKVYTDSYEKELRSNIFKENVQRIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
N GN+ YKLG N+F+DLTNEEF+A NR + S+R TFKY++V+ VP S
Sbjct: 72 AFNNAGNKPYKLGINQFADLTNEEFKA----RNRFKGHMCSNSTRTPTFKYEDVSSVPAS 127
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHG 192
+DWR+KGAVT IKDQGQCG CWAFSAVAA EGIT+++ GKLI LSEQ+LVDC T + G
Sbjct: 128 LDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQELVDCDTKGVDQG 187
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C GGLMD AF++I++NKGL TEA YPY+ + TC+ E AA+I +ED+P E AL
Sbjct: 188 CEGGLMDDAFKFIMQNKGLNTEAKYPYQGVDATCNANAEAKDAASIKGFEDVPANSESAL 247
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
L+AV+NQP+SV +DASG F FY SG+ CG DHGV VG+G +++ G KYWL+K
Sbjct: 248 LKAVANQPISVAIDASGSEFQFYSSGLFTGSCGTELDHGVTAVGYGVSDD--GTKYWLVK 305
Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
NSWGE WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 306 NSWGEQWGEEGYIRMQRDVAAEEGLCGIAMQASYPTA 342
>gi|40806498|gb|AAR92154.1| putative cysteine protease 1 [Iris x hollandica]
Length = 340
Score = 386 bits (992), Expect = e-105, Method: Compositional matrix adjust.
Identities = 192/348 (55%), Positives = 247/348 (70%), Gaps = 18/348 (5%)
Query: 5 FEKSFIIPMFVIIILVITCASQVVSGRSMHE-PSIVEKHEQWMAQHGRTYKDELEKAMRL 63
F+ ++P ++I+ I ASQ +GRS+ E S++E+HEQWMAQHGR YK+ EKA R
Sbjct: 4 FKTVKLLPALALLIVAI-WASQGEAGRSLGENKSMLERHEQWMAQHGRVYKNAAEKAHRF 62
Query: 64 NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
IF+ N+E IE N E N +KLG N+F+DLTNEEF+ R S+ +S S F
Sbjct: 63 EIFRANVERIESFNAE-NHKFKLGVNQFADLTNEEFK------TRNTLKPSKMASTKS-F 114
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
KY+NVT VP ++DWR KGAVT IKDQGQCGSCWAFSAVAA EGIT+++ GKLI LSEQ++
Sbjct: 115 KYENVTAVPATMDWRTKGAVTPIKDQGQCGSCWAFSAVAATEGITKLSTGKLISLSEQEV 174
Query: 184 VDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
VDC ++D+ GC+GG MD AFEYII+NKG+ TEA+YPY+ +GTC+ +K + AA+I+ Y
Sbjct: 175 VDCDVTSDDQGCNGGEMDDAFEYIIKNKGITTEANYPYKAADGTCNTKKAASHAASITGY 234
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
ED+ E ALL+A +NQP++V +DA AF Y SGV DCG + DHGV +VG+G
Sbjct: 235 EDVTVNSEAALLKAAANQPIAVAIDAGDFAFQMYSSGVFTGDCGTDLDHGVTLVGYGATS 294
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
+ G KYWL+KNSWG +WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 295 D--GTKYWLVKNSWGTSWGEDGYIRMERDVDAKEGLCGIAMDASYPTA 340
>gi|225446581|ref|XP_002280246.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 187/337 (55%), Positives = 241/337 (71%), Gaps = 13/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ ++ V+ + + RS+HE S+ E+HE WM Q+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK +++YKL NEF+DLTNEEFRA NR + S+ ++FKY+NVT VP++
Sbjct: 72 SFNKAMDKSYKLSINEFADLTNEEFRA---SRNRFKAHIC--STEATSFKYENVTAVPST 126
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
+DWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
CSGGLMD AF++I +N GL TEA+YPY +GTC+ +K AA I+ YED+P +E+AL
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+AV++QP++V +DASG F FY SGV CG DHGVA VG+GT+++ G KYWL+K
Sbjct: 247 QKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDD--GMKYWLVK 304
Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
NSW WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 305 NSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|47524507|gb|AAT34987.1| putative cysteine protease [Gossypium hirsutum]
Length = 344
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 189/349 (54%), Positives = 253/349 (72%), Gaps = 20/349 (5%)
Query: 10 IIPMFVIIILVIT-CASQVVSGRS---MHEPSIVEKHEQWMAQHGRTYKDELE--KAMRL 63
++ +F+ + LV++ C S ++G S + E S+ +HE+WM+QHGR Y DE E K R
Sbjct: 3 LLQIFLFVALVLSFCFSIQLAGLSRPLLDEDSM--RHEEWMSQHGRVYADEQEDHKNKRF 60
Query: 64 NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
N+FK+N+E IE+ N +T+KL N+F+DLTNEEFRA Y G+ P+ +S Q ++P+ F
Sbjct: 61 NVFKENVERIEEFND--GKTFKLAINQFADLTNEEFRASYNGFKGPM-VLSSQITKPTPF 117
Query: 124 KYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
+Y+NV+ +P S+DWR+KGAVT +K+QGQCG CWAFSAVAA+EGITQI+ GKLI LSEQ+
Sbjct: 118 RYENVSSALPVSVDWRKKGAVTPVKNQGQCGCCWAFSAVAAIEGITQISTGKLISLSEQE 177
Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
LVDC T +HGC GGLMD AFE+II N GL TE++YPY+ E+GTC+ K +A +I+
Sbjct: 178 LVDCDTKGIDHGCEGGLMDTAFEFIINNGGLTTESNYPYKGEDGTCNFNKTNPIAVSITG 237
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
YED+P DEQAL++AV++QPVSV ++A G F FY SGV +CG DH V VG+G
Sbjct: 238 YEDVPANDEQALMKAVAHQPVSVAIEAGGSDFQFYSSGVFTGECGTELDHAVTAVGYG-- 295
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
E E+G+KYW++KNSWG WGESGYI + +D GLCGIA ASYP A
Sbjct: 296 ESEDGSKYWIVKNSWGTKWGESGYIEMQKDIKVKQGLCGIAMQASYPTA 344
>gi|224135841|ref|XP_002327317.1| predicted protein [Populus trichocarpa]
gi|222835687|gb|EEE74122.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 181/338 (53%), Positives = 233/338 (68%), Gaps = 11/338 (3%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
F IL++ + V+ R + EPS+ +HEQWM G+ Y D EK R IFK N+EYI
Sbjct: 10 FFAFILILGMWAYEVASRELQEPSMSARHEQWMETFGKVYADAAEKERRFEIFKDNVEYI 69
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E N GN+ YKL N+F+DLTNEE + GY RP+ + + + ++FKY+NVT VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADLTNEELKVARNGYRRPLQT---RPMKVTSFKYENVTAVPA 126
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNH 191
++DWR+KGAVT IKDQGQCGSCWAFS VAA EGI Q+T GKL+ LSEQ+LVDC T ++
Sbjct: 127 TMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDTQGEDQ 186
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GGLM+ FE+II+N G+ TEA+YPY+ +GTC+++KE + A I+ YE +P E A
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKEASRIAKITGYESVPANSEAA 246
Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
LL+AV++QP+SV +DA G F FY SGV CG DHGV VG+G E +G KYWL+
Sbjct: 247 LLKAVASQPISVSIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYG--ETSDGTKYWLV 304
Query: 312 KNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
KNSWG +WGE GYIR+ RD GLCGIA +SYP A
Sbjct: 305 KNSWGTSWGEEGYIRMQRDTEAEEGLCGIAMDSSYPTA 342
>gi|356515048|ref|XP_003526213.1| PREDICTED: vignain-like [Glycine max]
Length = 350
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 184/337 (54%), Positives = 235/337 (69%), Gaps = 12/337 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+ +++L+ C SQV+S R++HE S+ E+HEQWM ++G+ YKD EK RL IFK N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N GN+ YKL N +D TNEEF A + GY + S + FKY NVTD+P
Sbjct: 69 IESFNAAGNKPYKLSINHLADQTNEEFVASHNGYKY------KGSHSQTPFKYGNVTDIP 122
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG 192
T++DWR+ GAVT +KDQGQCGSCWAFS VAA EGI QI+ G L+ LSEQ+LVDC + +HG
Sbjct: 123 TAVDWRQNGAVTAVKDQGQCGSCWAFSTVAATEGIYQISTGMLMSLSEQELVDCDSVDHG 182
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C GGLM+ FE+II+N G+++EA+YPY +GTCD KE + AA I YE +P E+AL
Sbjct: 183 CDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIKGYETVPANSEEAL 242
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
QAV+NQPVSV +DA G F FY SGV CG DHGV VVG+GT ++ +YW++K
Sbjct: 243 QQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGTTDDGT-HEYWIVK 301
Query: 313 NSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
NSWG WGE GYIR+ R DA GLCGIA ASYP+
Sbjct: 302 NSWGTQWGEEGYIRMQRGIDAQEGLCGIAMDASYPMG 338
>gi|147839728|emb|CAN70559.1| hypothetical protein VITISV_032465 [Vitis vinifera]
Length = 341
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 186/337 (55%), Positives = 240/337 (71%), Gaps = 13/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ ++ V+ + + R +HE S+ E+HE WM Q+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASQATARXLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK +++YKL NEF+DLTNEEFRA NR + S+ ++FKY+NVT VP++
Sbjct: 72 SFNKAMDKSYKLSINEFADLTNEEFRA---SRNRFKAHIC--STEATSFKYENVTAVPST 126
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
+DWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
CSGGLMD AF++I +N GL TEA+YPY +GTC+ +K AA I+ YED+P +E+AL
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+AV++QP++V +DASG F FY SGV CG DHGVA VG+GT+++ G KYWL+K
Sbjct: 247 QKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDD--GMKYWLVK 304
Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
NSW WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 305 NSWSTGWGEEGYIRMQRDVTVKEGLCGIAMQASYPTA 341
>gi|225446583|ref|XP_002280204.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1 [Vitis vinifera]
Length = 341
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 185/337 (54%), Positives = 241/337 (71%), Gaps = 13/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ ++ V+ + + R++HE S+ E+HE WM Q+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK +++YKL NEF+DLTNEEFRA NR + S+ ++FKY+NVT VP++
Sbjct: 72 SFNKAMDKSYKLSINEFADLTNEEFRA---SRNRFKAHIC--STEATSFKYENVTAVPST 126
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
+DWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
CSGGLMD AF++I +N GL TEA+YPY +GTC+ +K AA I+ YED+P +E+AL
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+AV++QP++V +DA G F FY SGV CG DHGV+ VG+GT+++ G KYWL+K
Sbjct: 247 QKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDD--GMKYWLVK 304
Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
NSWG WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 305 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|356543116|ref|XP_003540009.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 337
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 184/337 (54%), Positives = 232/337 (68%), Gaps = 13/337 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+ +++L+ C SQV+S R +HE S+ E+HEQWM ++G+ YKD EK RL IFK N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RYLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N GN+ YKLG N +D TNEEF A + GY + S + FKY+NVT VP
Sbjct: 69 IESFNAAGNKPYKLGINHLADQTNEEFVASHNGYKH------KASHSQTPFKYENVTGVP 122
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG 192
++DWRE GAVT +KDQGQCGSCWAFS VAA EGI QIT L+ LSEQ+LVDC + +HG
Sbjct: 123 NAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSVDHG 182
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C GG M+ FE+II+N G+++EA+YPY +GTCD KE + AA I YE +P E AL
Sbjct: 183 CDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSEDAL 242
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+AV+NQPVSV +DA G AF FY SGV CG DHGV VG+G+ ++ G +YW++K
Sbjct: 243 QKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDD--GTQYWIVK 300
Query: 313 NSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
NSWG WGE GYIR+ R DA GLCGIA ASYP A
Sbjct: 301 NSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337
>gi|356543124|ref|XP_003540013.1| PREDICTED: vignain-like [Glycine max]
gi|356543126|ref|XP_003540014.1| PREDICTED: vignain-like [Glycine max]
Length = 337
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 184/337 (54%), Positives = 232/337 (68%), Gaps = 13/337 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+ +++L+ C SQV+S R++HE S+ E+HEQWM ++G+ YKD EK RL IFK N+E+
Sbjct: 10 ILALVLLLSICTSQVMS-RNLHEASMSERHEQWMKKYGKVYKDAAEKQKRLLIFKDNVEF 68
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N GNR YKL N +D TNEEF A + GY + S + FKY+NVT VP
Sbjct: 69 IESFNAAGNRPYKLSINHLADQTNEEFVASHNGYKH------KGSHSQTPFKYENVTGVP 122
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG 192
++DWRE GAVT +KDQGQCGSCWAFS VAA EGI QIT L+ LSEQ+LVDC + +HG
Sbjct: 123 NAVDWRENGAVTAVKDQGQCGSCWAFSTVAATEGIYQITTSMLMSLSEQELVDCDSVDHG 182
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C GG M+ FE+II+N G+++EA+YPY +GTCD KE + AA I YE +P E AL
Sbjct: 183 CDGGYMEGGFEFIIKNGGISSEANYPYTAVDGTCDANKEASPAAQIKGYETVPANSEDAL 242
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+AV+NQPVSV +DA G AF FY SGV CG DHGV VG+G+ ++ G +YW++K
Sbjct: 243 QKAVANQPVSVTIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDD--GTQYWIVK 300
Query: 313 NSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
NSWG WGE GYIR+ R DA GLCGIA ASYP A
Sbjct: 301 NSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 337
>gi|359485281|ref|XP_002280230.2| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1 [Vitis vinifera]
Length = 341
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 186/337 (55%), Positives = 240/337 (71%), Gaps = 13/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ ++ V+ + + R++HE S+ E+HE WMAQ+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK +++YKL NEF+DLTNEEF T NR + S+ ++FKY+NVT VP++
Sbjct: 72 SFNKAMDKSYKLSINEFADLTNEEFG---TSRNRFKAHIC--STEATSFKYENVTAVPST 126
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
IDWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 IDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C+GGLMD AF++I +N GL TEA+YPY +GTC+ +K AA I+ YED+P +E+AL
Sbjct: 187 CNGGLMDDAFKFIKQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+AV +QP++V +DA G F FY SGV CG DHGVA VG+GT+++ G KYWL+K
Sbjct: 247 QKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDD--GMKYWLVK 304
Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
NSWG WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 305 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|147788834|emb|CAN64655.1| hypothetical protein VITISV_005140 [Vitis vinifera]
Length = 341
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 185/337 (54%), Positives = 240/337 (71%), Gaps = 13/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ ++ V+ + R++HE S+ E+HE WMAQ+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASHAKARNLHEASMYERHEDWMAQYGRVYKDAGEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK N++YKL NEF+DLTNEEFRA NR + S+ ++FKY++V VP++
Sbjct: 72 SFNKAMNKSYKLSINEFADLTNEEFRA---SRNRFKAHIC--STEATSFKYEHVXAVPST 126
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
+DWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
CSGGLMD AF++I +N GL TEA+YPY +GTC+ +K AA I+ YED+P +E+AL
Sbjct: 187 CSGGLMDDAFKFIEQNHGLTTEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 246
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+AV++QP++V +DA G F FY SGV CG DHGV+ VG+GT+++ G KYWL+K
Sbjct: 247 QKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDD--GMKYWLVK 304
Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
NSWG WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 305 NSWGTGWGEEGYIRMQRDVTEKEGLCGIAMQASYPTA 341
>gi|124484401|dbj|BAF46311.1| cysteine proteinase precursor [Ipomoea nil]
Length = 339
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 183/341 (53%), Positives = 238/341 (69%), Gaps = 13/341 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ + + + LV ++ + + R++ + + +HEQWMAQ+GR YK+E+EK R NIFK+N+
Sbjct: 6 LKLLIALALVFATSAYLATSRTLLDSLMAVRHEQWMAQYGRVYKNEVEKTKRYNIFKENV 65
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
EYIE NK G + YKLG N F+DLTN+EF A GY P + S + F+Y+NV+
Sbjct: 66 EYIESFNKAGTKPYKLGINAFADLTNKEFIASRNGYILP-----HECSSNTPFRYENVSA 120
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
VPT++DWR+KGAVT +KDQGQCG CWAFSAVAA+EGIT+++ G LI LSEQ+LVDC
Sbjct: 121 VPTTVDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKG 180
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
+ GC GGLMD AF +II NKGL TE++YPY+ +G+C K AA IS YED+P
Sbjct: 181 IDQGCEGGLMDDAFTFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANS 240
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL +AV+NQPVSV +DA G F FY SGV +CG DHGV VG+G AE+ G+KY
Sbjct: 241 ESALEKAVANQPVSVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAED--GSKY 298
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
WL+KNSWG +WGE GYIR+ +D GLCGIA +SYP A
Sbjct: 299 WLVKNSWGTSWGEKGYIRMQKDIEAKEGLCGIAMQSSYPSA 339
>gi|255568297|ref|XP_002525123.1| cysteine protease, putative [Ricinus communis]
gi|223535582|gb|EEF37250.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 379 bits (974), Expect = e-103, Method: Compositional matrix adjust.
Identities = 181/345 (52%), Positives = 240/345 (69%), Gaps = 14/345 (4%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
K I+P+ + +L + CA Q S R +HE + +HE+WMA+HG+ YKD+ EK R IF
Sbjct: 6 KGKILPIALFFVLAM-CADQAAS-RELHELEMTGRHEKWMAKHGKVYKDDKEKLRRFQIF 63
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
K N+ +IE N GN++Y LG N+F+DLTNEEFRA + GY RP+ + S + + FKY+
Sbjct: 64 KSNVVFIESFNTAGNKSYMLGINKFADLTNEEFRAFWNGYKRPLGA----SRKITPFKYE 119
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
NVT +P+SIDWR KGAVT IKDQG CGSCWAFSAVAA EGI ++ GKL+ LSEQ+LVDC
Sbjct: 120 NVTALPSSIDWRSKGAVTPIKDQGVCGSCWAFSAVAATEGIHKLRTGKLVSLSEQELVDC 179
Query: 187 ST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
+ GC GGLM AF++I + G+ +EA+YPY+ +G CD +KE + A I+ Y+ +
Sbjct: 180 DVKGQDKGCQGGLMVDAFKFIKRHGGMTSEANYPYQGRDGKCDTKKEASRAVKITGYQAV 239
Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
PK E ALL+AV+NQPVSV +DA +F FY+SG+ CG + +HGVA VG+G + +
Sbjct: 240 PKNSEAALLKAVANQPVSVAIDAGSLSFQFYRSGIFTGICGKDINHGVAAVGYGRS--NS 297
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
G+KYW++KNSWG WGE GYIR+ RD GLCGIA SYP A
Sbjct: 298 GSKYWIVKNSWGTEWGEKGYIRMKRDVRSKEGLCGIAMECSYPTA 342
>gi|225446589|ref|XP_002280263.1| PREDICTED: vignain [Vitis vinifera]
Length = 339
Score = 379 bits (974), Expect = e-103, Method: Compositional matrix adjust.
Identities = 194/341 (56%), Positives = 237/341 (69%), Gaps = 17/341 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ M ++ IL ASQ S RS+HE S+ E+HE WMA++GR YKD EK R IFK N+
Sbjct: 10 VSMALLFILA-AWASQATS-RSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNV 67
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
IE NK ++TYKL NEF+DLTNEEFR+L + + S +TFKY+NVT
Sbjct: 68 ARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNRFKAHI------CSEATTFKYENVTA 121
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
VP++IDWR+KGAVT IKDQ QCG CWAFSAVAA EGITQIT GKLI LSEQ+LVDC T
Sbjct: 122 VPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGG 181
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
+N GCSGGLMD AF + I+ GLA+EA YPY ++GTC+++KE AA I YED+P +
Sbjct: 182 ENQGCSGGLMDDAFRF-IKIHGLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANN 240
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E+AL +AV++QPV+V +DA G F FY SGV CG DHGVA VG+G ++G Y
Sbjct: 241 EKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIG--DDGMMY 298
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
WL+KNSWG WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 299 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 339
>gi|225443827|ref|XP_002274223.1| PREDICTED: vignain-like [Vitis vinifera]
Length = 340
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 189/337 (56%), Positives = 236/337 (70%), Gaps = 12/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ ++++ ASQ +S R++HE S+ E+HE WM +GRTYKD EK R IFK+N+EYIE
Sbjct: 10 ITLLIMGVWASQALS-RTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEYIE 68
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
N GNR YKL NEF+D TNEEF+A GYN S +SS ++F+Y+NV VP+S
Sbjct: 69 SVNSAGNRRYKLSINEFADQTNEEFKASRNGYNM---SSRPRSSEITSFRYENVAAVPSS 125
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
+DWR+KGAVT IKDQGQCG CWAFSAVAA+EG+TQ+ G+LI LSEQ+LVDC T ++ G
Sbjct: 126 MDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQG 185
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C GGLMD AFE+II N GL TEA+YPY+ + TC+ +K + AA I YED+P E AL
Sbjct: 186 CGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSEAAL 245
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
L+AV+ PVSV +DA G F FY SGV CG DHGV VG+G + ++G KYWL+K
Sbjct: 246 LKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYG--KTDDGTKYWLVK 303
Query: 313 NSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
NSWG WGE GYI + R D GLCGIA ASYP A
Sbjct: 304 NSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 340
>gi|24285904|gb|AAL14199.1| cysteine proteinase precursor [Ipomoea batatas]
gi|56961686|gb|AAK15148.2| cysteine proteinase-like protein [Ipomoea batatas]
Length = 341
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 181/328 (55%), Positives = 232/328 (70%), Gaps = 13/328 (3%)
Query: 24 ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT 83
++ + + R++ + +V +HEQWMAQ+GR Y++E+EK R NIFK+N+EYIE NK G +
Sbjct: 21 SAYLATSRTLSDSLMVVRHEQWMAQYGRVYENEVEKTKRFNIFKENVEYIESFNKAGTKP 80
Query: 84 YKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAV 143
YKLG N F+DLTN+EF+A GY P S + F+Y+NV+ VPT++DWR KGAV
Sbjct: 81 YKLGINAFADLTNQEFKASRNGYKLP-----HDCSSNTPFRYENVSSVPTTVDWRTKGAV 135
Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKA 201
T +KDQGQCG CWAFSAVAA+EGIT+++ G LI LSEQ+LVDC + GC GGLMD A
Sbjct: 136 TPVKDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGIDQGCEGGLMDDA 195
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
F +II NKGL TE++YPY+ +G+C K AA IS YED+P E AL +AV+NQPV
Sbjct: 196 FSFIINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPV 255
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
SV +DA G F FY SGV +CG DHGV VG+G AE+ G+KYWL+KNSWG +WGE
Sbjct: 256 SVAIDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAED--GSKYWLVKNSWGTSWGE 313
Query: 322 SGYIRILRD----AGLCGIATAASYPVA 345
GYIR+ +D GLCGIA +SYP A
Sbjct: 314 KGYIRMQKDIEAKEGLCGIAMQSSYPSA 341
>gi|13491750|gb|AAK27968.1|AF242372_1 cysteine protease [Ipomoea batatas]
Length = 339
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 182/325 (56%), Positives = 228/325 (70%), Gaps = 13/325 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKL 86
+ + R++ + +V +HEQWMAQ+GR YK E EK R NIFK+N+EYIE NK G + YKL
Sbjct: 22 LATSRTLSDSLMVVRHEQWMAQYGRVYKTEAEKTKRFNIFKENVEYIESFNKAGTKPYKL 81
Query: 87 GTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
G N F+DLTN+EF+A GY P S + F+Y+NV+ VPT++DWR KGAVT +
Sbjct: 82 GINAFADLTNQEFKASRNGYKLP-----HDCSSNTPFRYENVSSVPTTVDWRTKGAVTPV 136
Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEY 204
KDQGQCG CWAFSAVAA+EGIT+++ G LI LSEQ+LVDC + GC GGLMD AF +
Sbjct: 137 KDQGQCGCCWAFSAVAAMEGITKLSTGNLISLSEQELVDCDVKGTDQGCEGGLMDDAFSF 196
Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
II NKGL TE++YPY+ +G+C K AA IS YED+P E AL +AV+NQPVSV
Sbjct: 197 IINNKGLTTESNYPYQGTDGSCKKSKSSNSAAKISGYEDVPANSESALEKAVANQPVSVA 256
Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
+DA G F FY SGV +CG DHGV VG+G AE+ G+KYWL+KNSWG +WGE GY
Sbjct: 257 IDAGGSDFQFYSSGVFTGECGTELDHGVTAVGYGIAED--GSKYWLVKNSWGTSWGEKGY 314
Query: 325 IRILRD----AGLCGIATAASYPVA 345
IR+ +D GLCGIA +SYP A
Sbjct: 315 IRMQKDIEAKEGLCGIAMQSSYPSA 339
>gi|10336513|dbj|BAB13759.1| cysteine proteinase [Astragalus sinicus]
Length = 343
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 188/343 (54%), Positives = 240/343 (69%), Gaps = 16/343 (4%)
Query: 13 MFVIIILVITC----ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
++ I + ++ C A QV S R++ + S+ E+H+QWM Q+ + Y D E R IFK+
Sbjct: 7 LYYISLALLMCLGLWAVQVTS-RTLQDASMYERHQQWMGQYAKIYNDHQEWEKRFQIFKE 65
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
N+ YIE +NKEG R YKLG N+F DLTNEEF A NR + R +T+KY+NV
Sbjct: 66 NVNYIETSNKEGGRFYKLGVNQFVDLTNEEFIAPR---NRFKGHMCSSIIRTNTYKYENV 122
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
T VP+++DWR+KGAVT +KDQGQCG CWAFSAVAA EGI Q++ GKLI LSEQ+LVDC T
Sbjct: 123 TTVPSNVDWRQKGAVTPVKDQGQCGCCWAFSAVAATEGIHQLSTGKLISLSEQELVDCDT 182
Query: 189 D--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
+ GC GGLMD AF++II+N GL TEA YPY+ +GTC+ + AATI+ YED+P
Sbjct: 183 KGVDQGCEGGLMDDAFKFIIQNHGLDTEAKYPYQGVDGTCNANEASINAATITSYEDVPT 242
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
+EQAL +AV+NQP+SV +DASG F FY SGV CG DHGV VG+G +++ G
Sbjct: 243 NNEQALQKAVANQPISVAIDASGSDFQFYTSGVFTGSCGTELDHGVTAVGYGVSDD--GT 300
Query: 307 KYWLIKNSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
KYWL+KNSWG +WGE GYIR+ R DA GLCGIA ASYP+A
Sbjct: 301 KYWLVKNSWGTSWGEEGYIRMQRGVDAVEGLCGIAMQASYPIA 343
>gi|318136892|gb|ADV41672.1| cysteine protease [Nicotiana tabacum]
Length = 349
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 182/340 (53%), Positives = 234/340 (68%), Gaps = 9/340 (2%)
Query: 13 MFVIIILVITCASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
+ + I + SQV S R + +E S+ +H+QW+A H + YKD EK MR IFK+N+E
Sbjct: 12 LALFFIFLGVWRSQVASSRPINYEASMRARHDQWIAHHDKVYKDLNEKEMRFKIFKENVE 71
Query: 72 YIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
IE N ++ YKLG N+FSDLTNE+FR L+TGY R P V S + F+Y NVTD+
Sbjct: 72 RIEAFNAGEDKGYKLGVNKFSDLTNEKFRVLHTGYKRSHPKVMSSSKPKTHFRYANVTDI 131
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--D 189
P ++DWR+KGAVT IKDQ +CG CWAFSAVAA EG+ Q+ GKLI LSEQ+LVDC +
Sbjct: 132 PPTMDWRKKGAVTPIKDQKECGCCWAFSAVAATEGLHQLKTGKLIPLSEQELVDCDVEGE 191
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
+ GCSGGL+D AF++I++NKGL TEA+YPY+ E+G C+ +K AA I+ YED+P E
Sbjct: 192 DEGCSGGLLDTAFDFILKNKGLTTEANYPYKGEDGVCNKKKSALSAAKIAGYEDVPANSE 251
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
+ALLQAV+NQPVSV +D S F FY SGV + C +H V VG+G + G KYW
Sbjct: 252 KALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTD--GTKYW 309
Query: 310 LIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
+IKNSWG WG+SGY+RI RD GLCG+A ASYP A
Sbjct: 310 IIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349
>gi|225446585|ref|XP_002280215.1| PREDICTED: vignain [Vitis vinifera]
Length = 341
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 186/338 (55%), Positives = 241/338 (71%), Gaps = 14/338 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
++ + ASQ + R++ E S+ E+HE WMAQ+GR YKD EK+ R IFK N+ I
Sbjct: 12 LALLFFLAAWASQATA-RNLLEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARI 70
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E NK +++YKL NEF+DLTNEEFRA NR + S+ ++FKY++V VP+
Sbjct: 71 ESFNKAMDKSYKLSINEFADLTNEEFRA---SRNRFKAHIC--STEATSFKYEHVAAVPS 125
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNH 191
++DWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++
Sbjct: 126 TVDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQ 185
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC+GGLMD AF++I +N GLATEA+YPY +GTC+ +K AA I+ YED+P +E+A
Sbjct: 186 GCNGGLMDDAFKFIEQNHGLATEANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKA 245
Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
L +AV++QP++V +DA G F FY SGV CG DHGVA VG+GT+++ G KYWL+
Sbjct: 246 LQKAVAHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDD--GMKYWLV 303
Query: 312 KNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
KNSWG WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 304 KNSWGTGWGEVGYIRMQRDVTAKEGLCGIAMQASYPTA 341
>gi|356542633|ref|XP_003539771.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 341
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 186/349 (53%), Positives = 237/349 (67%), Gaps = 14/349 (4%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+ F+K F + + +I CA + + R++ + + E+HEQWMA HG+ YK EK +
Sbjct: 1 MAFKKLFHCTLALFLIFAF-CAFEA-NARTLEDAPMRERHEQWMATHGKVYKHSYEKEQK 58
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
IF +N++ IE N G + YKLG N F+DLTNEEF+A+ NR V + +R +T
Sbjct: 59 YQIFMENVQRIEAFNNAGXKPYKLGINHFADLTNEEFKAI----NRFKGHVCSKRTRTTT 114
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
F+Y+NVT VP S+DWR+KGAVT IKDQGQCG CWAFSAVAA EGIT++ GKLI LSEQ+
Sbjct: 115 FRYENVTAVPASLDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGITKLRTGKLISLSEQE 174
Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
LVDC T + GC GGLMD AF++I++NKGLATEA YPY +GTC+ + + A +I
Sbjct: 175 LVDCDTKGVDQGCEGGLMDDAFKFILQNKGLATEAIYPYEGFDGTCNAKADGNHAGSIKG 234
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
YED+P E ALL+AV+NQPVSV ++ASG F FY GV CG N DHGV VG+G
Sbjct: 235 YEDVPANSESALLKAVANQPVSVAIEASGFKFQFYSGGVFTGSCGTNLDHGVTSVGYGVG 294
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
++ G KYWL+KNSWG WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 295 DD--GTKYWLVKNSWGVKWGEKGYIRMQRDVAAKEGLCGIAMLASYPSA 341
>gi|224076970|ref|XP_002305073.1| predicted protein [Populus trichocarpa]
gi|222848037|gb|EEE85584.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 180/340 (52%), Positives = 237/340 (69%), Gaps = 16/340 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPS-IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
+F+ +L++ + ++ R + E ++++HE+WMAQHGR Y D EK R IFK+N+E
Sbjct: 10 IFLPFLLILAAWATKIACRPLDEQEYMLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIE 69
Query: 72 YIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR--PSTFKYQNVT 129
IE N +R YKLG N+F+DLTNEEFRA+Y GY RQSS+ S+F+Y+N++
Sbjct: 70 RIEAFNNGSDRGYKLGVNKFADLTNEEFRAMYHGY-------KRQSSKLMSSSFRYENLS 122
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
D+PTS+DWR GAVT +KDQG CG CWAFS VAA+EGI ++ G LI LSEQQLVDC+
Sbjct: 123 DIPTSMDWRNDGAVTPVKDQGTCGCCWAFSTVAAIEGIIKLQTGNLISLSEQQLVDCTAG 182
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
N GC GGLMD AF+YII N GL +E +YPY+ +GTC ++K + A I+ YED+P+ +E
Sbjct: 183 NKGCQGGLMDTAFQYIIRNGGLTSEDNYPYQGVDGTCSSEKAASTEAQITGYEDVPQNNE 242
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
ALLQAV+ QPVSV VD G F FYKSGV N DCG +H V +G+GT + +G YW
Sbjct: 243 NALLQAVAKQPVSVGVDGGGNDFQFYKSGVFNGDCGTQQNHAVTAIGYGT--DIDGTDYW 300
Query: 310 LIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
L+KNSWG +WGE+GY+R+ R GLCG+A ASYP A
Sbjct: 301 LVKNSWGTSWGENGYMRMRRGIGSSEGLCGVAMDASYPTA 340
>gi|144905104|dbj|BAF56427.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 377 bits (967), Expect = e-102, Method: Compositional matrix adjust.
Identities = 183/338 (54%), Positives = 242/338 (71%), Gaps = 13/338 (3%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
F +++ + A QV S R++ + S+ E+HEQWMA++G+ YKD EK R NIF++N++YI
Sbjct: 12 FALVLCLGLWAFQV-SSRTLQDASMHERHEQWMARYGKVYKDLQEKEKRFNIFQENVKYI 70
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E +N GN+ YKLG N+F+DLTN+EF A N+ +S +R +TFKY+NVT P+
Sbjct: 71 EASNNAGNKPYKLGVNQFTDLTNKEFIATR---NKFKGHMSSSITRTTTFKYENVT-APS 126
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
++DWR++GAVT +K+QG CG CWAFSAVAA EGI +++ G L+ LSEQ+LVDC T +
Sbjct: 127 TVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQELVDCDTSGADQ 186
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GGLMD AF++II+N GL TEA YPY+ +GTC+ +E ATI+ YED+P +EQA
Sbjct: 187 GCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEVTHVATITGYEDVPSNNEQA 246
Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
L QAV+NQP+SV +DASG F Y+SGV CG DHGVAVVG+G +++ G KYWL+
Sbjct: 247 LQQAVANQPISVAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVSDD--GTKYWLV 304
Query: 312 KNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
KNSWGE WGE GYIR+ RD GLCGIA SYP A
Sbjct: 305 KNSWGEDWGEEGYIRMQRDVEAPEGLCGIAMQPSYPTA 342
>gi|535454|gb|AAA50755.1| cysteine proteinase [Alnus glutinosa]
Length = 340
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 181/345 (52%), Positives = 239/345 (69%), Gaps = 15/345 (4%)
Query: 9 FIIPMFVIIILVI--TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
F+ F ++++V ASQ+ + RS+ + S+ E+HE+WMA +GR YKD EK R IF
Sbjct: 3 FVSQCFCLVVMVTLGALASQLAAARSLQDASMRERHEEWMASYGRVYKDINEKQKRYKIF 62
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
++N+ IE +NK+ N+ YKL N+F+DLTNEEF+A + + S ++ ++FKY
Sbjct: 63 EENVALIESSNKDANKPYKLSVNQFADLTNEEFKASRNRFKGHICS-----TKSTSFKYG 117
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
NV+ VP+++DWR KGAVT +KDQGQCG CWAFSAVAA EGIT++T G+LI LSEQ+LVDC
Sbjct: 118 NVSAVPSAMDWRMKGAVTPVKDQGQCGCCWAFSAVAATEGITKLTTGELISLSEQELVDC 177
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
T + GC GGLMD AF +I N GLA+EA+YPY+ +GTC+ K+ AA I+ +ED+
Sbjct: 178 DTSGVDQGCEGGLMDNAFTFIQHNHGLASEANYPYKGVDGTCNTNKQAIHAAEINGFEDV 237
Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
P E+ALL AV++QPVSV +DA G F FY GV CG DHGV VG+GT+++
Sbjct: 238 PANSEEALLNAVAHQPVSVAIDAGGSGFQFYSKGVFIGACGTQLDHGVTAVGYGTSDD-- 295
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
G KYWL+KNSWG WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 296 GTKYWLVKNSWGTQWGEEGYIRMQRDVDAKEGLCGIAMKASYPTA 340
>gi|255568299|ref|XP_002525124.1| cysteine protease, putative [Ricinus communis]
gi|223535583|gb|EEF37251.1| cysteine protease, putative [Ricinus communis]
Length = 342
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 175/339 (51%), Positives = 238/339 (70%), Gaps = 12/339 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+ + + V+ + S R +HE ++VE+HE+WMA+HG+ YKD+ EK R IFK N+E+
Sbjct: 10 LLIALFFVLAMWADQASTRELHESTMVERHEKWMAKHGKVYKDDEEKLRRFQIFKNNVEF 69
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE +N GN +Y LG N F+DLTNEEFRA + GY RP+ + S + FKY+NVT +P
Sbjct: 70 IESSNAAGNNSYMLGINRFADLTNEEFRASWNGYKRPLDA----SRIVTPFKYENVTALP 125
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DN 190
S+DWR KGAVT IKDQ +CGSCWAFSAVAA EG+ ++ GKL+ LSEQ+LVDC ++
Sbjct: 126 YSMDWRRKGAVTSIKDQRECGSCWAFSAVAATEGVHKLRTGKLVSLSEQELVDCDVKGED 185
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
GC GGLM+ AF++I N G+ TEA+Y YR +G CD +KE + A I+ Y+ +P+ E
Sbjct: 186 KGCQGGLMEDAFKFIKRNGGITTEANYAYRGRDGKCDTKKEASHVAKITGYQVVPENSEA 245
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
ALL+AV++QPVSV +DA +F FY+SG+ CG++ +HGVA VG+GT+ +G+KYW+
Sbjct: 246 ALLKAVAHQPVSVSIDAGSMSFQFYQSGIYAGSCGSDLNHGVAAVGYGTS--SSGSKYWI 303
Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
+KNSWG WGE GY+R+ RD GLCGIA SYP A
Sbjct: 304 VKNSWGPEWGERGYVRMKRDITSRKGLCGIAMDCSYPTA 342
>gi|224076968|ref|XP_002305072.1| predicted protein [Populus trichocarpa]
gi|222848036|gb|EEE85583.1| predicted protein [Populus trichocarpa]
Length = 305
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 177/314 (56%), Positives = 224/314 (71%), Gaps = 15/314 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++HE+WMAQHGR Y D EK R IFK+N+E IE N +R YKLG N+F+DLTNE
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 98 EFRALYTGYNRPVPSVSRQSSR--PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
EFRA+Y GY RQSS+ S+F+Y+N++D+PTS+DWR GAVT +KDQG CG C
Sbjct: 61 EFRAMYHGY-------KRQSSKLMSSSFRYENLSDIPTSMDWRNDGAVTPVKDQGTCGCC 113
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEA 215
WAFS VAA+EGI ++ G LI LSEQQLVDC+ N GC GGLMD AF+YII N GL +E
Sbjct: 114 WAFSTVAAIEGIIKLQTGNLISLSEQQLVDCTAGNKGCQGGLMDTAFQYIIRNGGLTSED 173
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
+YPY+ +GTC ++K + A I+ YED+P+ +E ALLQAV+ QPVSV VD G F FY
Sbjct: 174 NYPYQGVDGTCSSEKAASTEAQITGYEDVPQNNENALLQAVAKQPVSVAVDGGGNDFRFY 233
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DA 331
KSGV DCG N +HGV +G+GT + +G YWL+KNSWG +WGESGY R+ R
Sbjct: 234 KSGVFEGDCGTNLNHGVTAIGYGT--DSDGTDYWLVKNSWGTSWGESGYTRMQRGIGASE 291
Query: 332 GLCGIATAASYPVA 345
GLCG+A ASYP +
Sbjct: 292 GLCGVAMDASYPTS 305
>gi|50355613|dbj|BAD29955.1| cysteine protease [Daucus carota]
Length = 365
Score = 375 bits (963), Expect = e-101, Method: Compositional matrix adjust.
Identities = 184/352 (52%), Positives = 242/352 (68%), Gaps = 18/352 (5%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ +K + + + +F I +L + + + RS++E S+ E H+QWMA++GR YK EK
Sbjct: 3 LTIKHQCTPLALLFTIGVL-----ASLAAARSLNEASMTETHDQWMARYGRVYKTANEKN 57
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IF++NL+YI+ NK N+ YKLG NEF+DLTNEEF + V + +
Sbjct: 58 RRSTIFQENLKYIQTFNKANNKPYKLGVNEFADLTNEEFTTSRNKFKSHVCA-----TVT 112
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+ F+Y+NVT VP ++DWR+KGAVT IK+QGQCG CWAFSAVAA+EGITQ+ GKLI LSE
Sbjct: 113 NVFRYENVTAVPATMDWRKKGAVTPIKNQGQCGCCWAFSAVAAMEGITQLKTGKLISLSE 172
Query: 181 QQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
Q+LVDC T+ + GC GGLMD AF++I +N GL+TE +YPY +GTC+ KE AATI
Sbjct: 173 QELVDCDTNGEDQGCEGGLMDYAFDFIQQNHGLSTETNYPYSGTDGTCNANKEANHAATI 232
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
+ +ED+P E ALL+AV+NQP+SV +DASG F FY SGV +CG DHGV VG+G
Sbjct: 233 TGHEDVPANSESALLKAVANQPISVAIDASGSDFQFYSSGVFTGECGTELDHGVTAVGYG 292
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVAI 346
TA + G KYWL+KNSWG +WGE GYI++ R GLCGIA ASYP A
Sbjct: 293 TAAD--GTKYWLVKNSWGTSWGEEGYIQMQRGVAAAEGLCGIAMQASYPTAF 342
>gi|255580657|ref|XP_002531151.1| cysteine protease, putative [Ricinus communis]
gi|223529264|gb|EEF31236.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 183/338 (54%), Positives = 237/338 (70%), Gaps = 15/338 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+I L+ SQ ++ R++ + S+ EKHE+WM++ GR Y D EK +R IFK+N++ I
Sbjct: 12 LALIFLLGALVSQAMA-RTLQDASMHEKHEEWMSRFGRVYNDGNEKEIRYKIFKENVQRI 70
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E NK ++YKLG N+F+DLTNEEF+ T NR + SS+ F+Y+N+T P+
Sbjct: 71 ESFNKASGKSYKLGINQFADLTNEEFK---TSRNRFKGHMC--SSQAGPFRYENLTAAPS 125
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNH 191
S+DWR+KGAVT IKDQGQCGSCWAFSAVAAVEGITQ+ KLI LSEQ+LVDC T ++
Sbjct: 126 SMDWRKKGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQ 185
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GGLMD AF++I +N+GL TEA+YPY +GTC+ ++E AA I+ +ED+P +E A
Sbjct: 186 GCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGA 245
Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
L++AV+ QPVSV +DA G F FY SG+ DCG DHGVA VG+G E NG YWL+
Sbjct: 246 LMKAVAKQPVSVAIDAGGFGFQFYSSGIFTGDCGTELDHGVAAVGYG---ESNGMNYWLV 302
Query: 312 KNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
KNSWG WGE GYIR+ +D GLCGIA ASYP A
Sbjct: 303 KNSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340
>gi|409190991|gb|AFV30165.1| cysteine proteinase [Lotus japonicus]
Length = 342
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 182/349 (52%), Positives = 243/349 (69%), Gaps = 13/349 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+ + F F +++ + A QV S R++ + S+ E+HEQWMA++GR YKD EK R
Sbjct: 1 MATKNQFYQVSFALVLCLGLWAFQV-SSRTLQDASMQERHEQWMARYGRVYKDLQEKEKR 59
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
+IFK+N+ YIE +N G++ YKLG N+F+DLTNEEF A N+ +S +R +T
Sbjct: 60 FSIFKENVNYIEASNNAGDKPYKLGVNQFADLTNEEFIATR---NKFKGHMSSSITRTTT 116
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
FKY+NVT P+++DWR++GAVT +K+QG CG CWAFSAVAA EGI +++ G L+ LSEQ+
Sbjct: 117 FKYENVT-APSTVDWRQEGAVTPVKNQGTCGCCWAFSAVAATEGIHKLSTGNLVSLSEQE 175
Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
LVDC T + GC GGLMD AF++II+N GL TEA YPY+ +GTC+ +E ATI+
Sbjct: 176 LVDCDTSGADQGCQGGLMDDAFKFIIQNGGLNTEAQYPYQGVDGTCNTNEEATHVATITG 235
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
YED+P +EQAL QAV+NQP+S+ +DASG F Y+SGV CG DHGVAVVG+G +
Sbjct: 236 YEDVPSNNEQALQQAVANQPISIAIDASGSDFQNYQSGVFTGSCGTQLDHGVAVVGYGVS 295
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
++ G KYWL+KNSWG WGE GYIR+ RD GLCG+A SYP A
Sbjct: 296 DD--GTKYWLVKNSWGADWGEEGYIRMQRDVDAPEGLCGLAMQPSYPTA 342
>gi|255580659|ref|XP_002531152.1| cysteine protease, putative [Ricinus communis]
gi|223529265|gb|EEF31237.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 373 bits (958), Expect = e-101, Method: Compositional matrix adjust.
Identities = 183/338 (54%), Positives = 236/338 (69%), Gaps = 15/338 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+I + ASQ ++ R++ + SI EKHE+WM + R Y D EK +R IFK+N++ I
Sbjct: 12 LALIFFLGALASQAIA-RTLQDASIHEKHEEWMTRFKRVYSDAKEKEIRYKIFKENVQRI 70
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E NK ++YKLG N+F+DLTNEEF+ T NR + SS+ F+Y+N+T VP+
Sbjct: 71 ESFNKASEKSYKLGINQFADLTNEEFK---TSRNRFKGHMC--SSQAGPFRYENITAVPS 125
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNH 191
S+DWR++GAVT IKDQGQCGSCWAFSAVAAVEGITQ+ KLI LSEQ+LVDC T ++
Sbjct: 126 SMDWRKEGAVTAIKDQGQCGSCWAFSAVAAVEGITQLATSKLISLSEQELVDCDTKGEDQ 185
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GGLMD AF++I +N+GL TEA+YPY +GTC+ ++E AA I+ +ED+P +E A
Sbjct: 186 GCQGGLMDDAFKFIEQNQGLTTEANYPYEGSDGTCNTKQEANHAAKINGFEDVPANNEGA 245
Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
L++AV+ QPVSV +DA G F FY SG+ DCG DHGVA VG+G E NG YWL+
Sbjct: 246 LMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGDCGTELDHGVAAVGYG---ESNGMNYWLV 302
Query: 312 KNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
KNSWG WGE GYIR+ +D GLCGIA ASYP A
Sbjct: 303 KNSWGTQWGEEGYIRMQKDIDAKEGLCGIAMQASYPTA 340
>gi|224121800|ref|XP_002330656.1| predicted protein [Populus trichocarpa]
gi|222872260|gb|EEF09391.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 373 bits (957), Expect = e-101, Method: Compositional matrix adjust.
Identities = 176/338 (52%), Positives = 229/338 (67%), Gaps = 11/338 (3%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
F IL++ + V+ R + E + +HEQWMA +G+ Y D EK R IFK N+EYI
Sbjct: 10 FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E N GN+ YKL N+F+D TNE+F+ GY RP + + + ++FKY+NVT VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT---RPMKVTSFKYENVTAVPA 126
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNH 191
++DWR+KGAVT IKDQGQCGSCWAFS VAA EGI Q+T GKL+ LSEQ+LVDC ++
Sbjct: 127 TMDWRKKGAVTLIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDIQGEDQ 186
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GGLM+ FE+II+N G+ TEA+YPY+ +GTC+++K+ + A I+ YE +P E
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAE 246
Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
LL+ V+NQP+SV +DA G F FY SGV CG DHGV VG+G E +G KYWL+
Sbjct: 247 LLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYG--ETSDGTKYWLV 304
Query: 312 KNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
KNSWG +WGE GYIR+ RD GLCGIA +SYP A
Sbjct: 305 KNSWGTSWGEEGYIRMQRDIDTEEGLCGIAMDSSYPTA 342
>gi|356515050|ref|XP_003526214.1| PREDICTED: vignain-like [Glycine max]
Length = 344
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 185/350 (52%), Positives = 237/350 (67%), Gaps = 11/350 (3%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M +K ++ +F+ + + I SQV+ R +H+ ++ E+HE WMA++G+ YKD EK
Sbjct: 1 MAFTGQKQHMLALFLFLAVGI---SQVMP-RKLHQTALRERHENWMAEYGKIYKDAAEKE 56
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK N+E+IE N GN+ YKLG N +DLT EEF+ G R S + +
Sbjct: 57 KRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTY-EFSTTTFKL 115
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQG-QCGSCWAFSAVAAVEGITQITRGKLIELS 179
+ FKY+NVTD+P +IDWR KGAVT IKDQG QCGSCWAFS VAA EGI QI+ G L+ LS
Sbjct: 116 NGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTVAATEGIYQISTGMLMSLS 175
Query: 180 EQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
EQ+LVDC + +HGC GGLM+ FE+II+N G+++EA+YPY +GTCD KE + AA I
Sbjct: 176 EQELVDCDSVDHGCDGGLMEDGFEFIIKNGGISSEANYPYTAVDGTCDASKEASPAAQIK 235
Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
YE +P E+AL QAV+NQPVSV +DA G F FY SGV CG DHGV VVG+GT
Sbjct: 236 GYETVPANSEEALQQAVANQPVSVSIDAGGSGFQFYSSGVFTGQCGTQLDHGVTVVGYGT 295
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
++ +YW++KNSWG WGE GYIR+ R DA GLCGIA ASYP A
Sbjct: 296 TDDGT-HEYWIVKNSWGTQWGEEGYIRMQRGIDALEGLCGIAMDASYPTA 344
>gi|356542631|ref|XP_003539770.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 181/322 (56%), Positives = 236/322 (73%), Gaps = 13/322 (4%)
Query: 31 RSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
R++ + S+ E+HEQWMAQHG+ YKD EK +R IF+QN++ IE N GN+++KLG N+
Sbjct: 28 RTLEDASMHERHEQWMAQHGKVYKDHHEKELRYKIFQQNVKGIEGFNNAGNKSHKLGVNQ 87
Query: 91 FSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQG 150
F+DLT EEF+A+ N+ + + SR STFKY++VT VP ++DWR+KGAVT IK QG
Sbjct: 88 FADLTEEEFKAI----NKLKGYMWSKISRTSTFKYEHVTKVPATLDWRQKGAVTPIKSQG 143
Query: 151 -QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIE 207
+CGSCWAF+AVAA EGIT++T G+LI LSEQ+L+DC T DN GC G++ +AF++I++
Sbjct: 144 LKCGSCWAFAAVAATEGITKLTTGELISLSEQELIDCDTNGDNGGCKWGIIQEAFKFIVQ 203
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
NKGLATEA YPY+ +GTC+ + E A+I YED+P +E ALL AV+NQPVSV VD+
Sbjct: 204 NKGLATEASYPYQAVDGTCNAKVESKHVASIKGYEDVPANNETALLNAVANQPVSVLVDS 263
Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
S F FY SGVL+ CG DH V VVG+G +++ G KYWLIKNSWG WGE GYIRI
Sbjct: 264 SDYDFRFYSSGVLSGSCGTTFDHAVTVVGYGVSDD--GTKYWLIKNSWGVYWGEQGYIRI 321
Query: 328 LRDA----GLCGIATAASYPVA 345
RD G+CGIA ASYP+A
Sbjct: 322 KRDVAAKEGMCGIAMQASYPIA 343
>gi|224081320|ref|XP_002306369.1| predicted protein [Populus trichocarpa]
gi|222855818|gb|EEE93365.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 181/351 (51%), Positives = 240/351 (68%), Gaps = 17/351 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M L + FI + ++ V+ + R++ + S+ E+HEQWMAQ+GR YKD+ EK
Sbjct: 1 MRLTKQSQFIC---LALLFVLGAWPSKSAARTLQDVSMYERHEQWMAQYGRVYKDDAEKE 57
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R NIFK+N+ I+ N + ++YKLG N+F+DL+NEEF+A NR + + P
Sbjct: 58 TRYNIFKENVARIDAFNSQTGKSYKLGVNQFADLSNEEFKA---SRNRFKGHMCSPQAGP 114
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
F+Y+NV+ VP ++DWR+KGAVT +KDQGQCG CWAFSAVAA+EGI Q+T GKLI LSE
Sbjct: 115 --FRYENVSAVPATMDWRKKGAVTPVKDQGQCGCCWAFSAVAAMEGINQLTTGKLISLSE 172
Query: 181 QQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
Q++VDC T ++ GC+GGLMD AF++I +NKGL TEA+YPY +GTC+ QKE AA I
Sbjct: 173 QEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYTGTDGTCNTQKEATHAAKI 232
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
+ +ED+P E AL++AV+ QPVSV +DA G F FY SG+ CG DHGV VG+G
Sbjct: 233 TGFEDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFYSSGIFTGSCGTQLDHGVTAVGYG 292
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
++ G KYWL+KNSWG WGE GYIR+ +D GLCGIA ASYP A
Sbjct: 293 ISD---GTKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPSA 340
>gi|297819568|ref|XP_002877667.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
gi|297323505|gb|EFH53926.1| hypothetical protein ARALYDRAFT_348033 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 178/343 (51%), Positives = 242/343 (70%), Gaps = 13/343 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
I F++ I++ + S S + E S +EKHEQWM++ R Y D+ EK R IFK+NL
Sbjct: 4 IIFFLLAIILSSRTSGATSRGGLFEASAIEKHEQWMSRFHRVYSDDSEKTSRFEIFKKNL 63
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS----TFKYQ 126
+++E N N+TY L NEFSDLT+EEF+A YTG P ++R S+ S +F+Y+
Sbjct: 64 KFVESFNMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVP-EGMTRMSTTDSHETVSFRYE 122
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
NV + S+DWRE+GAVT +K Q QCG CWAFSAVAAVEG+T+I +G+L+ LSEQQL+DC
Sbjct: 123 NVGETGESMDWREEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIAKGELVSLSEQQLLDC 182
Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
ST+N GC GG+M KAF+YI+EN+G+ E +YPY+ + TC++ AATIS YE +P+
Sbjct: 183 STENDGCDGGIMWKAFDYIVENQGITAEDNYPYQGAQQTCESN--HVAAATISGYETVPQ 240
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
DE+ALL+AVS QPVSV ++ SG F Y G+ N +CG + +H V +VG+G +EE G
Sbjct: 241 NDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTHLNHAVTIVGYGVSEE--GI 298
Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
KYWL+KNSWGE+WGE GY+RI+RD G+CG+A+ A YPVA
Sbjct: 299 KYWLLKNSWGESWGEDGYMRIMRDVDAPQGMCGLASLAYYPVA 341
>gi|356577811|ref|XP_003557016.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 182/345 (52%), Positives = 236/345 (68%), Gaps = 11/345 (3%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
K+ + + ++L + + V+ RS+ + S+ E+HEQWM ++G+ YKD E+ R IF
Sbjct: 4 KNHFCHISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIF 63
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
K+N+ YIE N N+ YKL N+F+DLTNEEF A NR + R +TFKY+
Sbjct: 64 KENVNYIEAFNNAANKRYKLAINQFADLTNEEFIA---PRNRFKGHMCSSIIRTTTFKYE 120
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
NVT VP+++DWR+KGAVT IKDQGQCG CWAFSAVAA EGI +T GKLI LSEQ+LVDC
Sbjct: 121 NVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDC 180
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
T + GC GGLMD AF+++I+N GL TEA+YPY+ +G C+ + AATI+ YED+
Sbjct: 181 DTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNVNEAANDAATITGYEDV 240
Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
P +E+AL +AV+NQPVSV +DASG F FYKSGV CG DHGV VG+G + +
Sbjct: 241 PANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND-- 298
Query: 305 GAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
G +YWL+KNSWG WGE GYIR+ R + GLCGIA ASYP A
Sbjct: 299 GTEYWLVKNSWGTEWGEEGYIRMQRGVNSEEGLCGIAMQASYPTA 343
>gi|224099295|ref|XP_002334495.1| predicted protein [Populus trichocarpa]
gi|222872550|gb|EEF09681.1| predicted protein [Populus trichocarpa]
Length = 342
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 175/338 (51%), Positives = 228/338 (67%), Gaps = 11/338 (3%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
F IL++ + V+ R + E + +HEQWMA +G+ Y D EK R IFK N+EYI
Sbjct: 10 FFAFILILGMWAFEVASRELQESYMSARHEQWMATYGKVYVDAAEKERRFKIFKNNVEYI 69
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E N GN+ YKL N+F+D TNE+F+ GY RP + + + ++FKY+NVT VP
Sbjct: 70 ESFNTAGNKPYKLSVNKFADQTNEKFKGARNGYRRPFQT---RPMKVTSFKYENVTAVPA 126
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNH 191
++DWR+KGAVT IKDQGQCGSCWAFS VAA EGI Q+T GKL+ LSEQ+LVDC ++
Sbjct: 127 TMDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGINQLTTGKLVSLSEQELVDCDNQGEDQ 186
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GGLM+ FE+II+N G+ TEA+YPY+ +GTC+++K+ + A I+ YE +P E
Sbjct: 187 GCEGGLMEDGFEFIIKNHGITTEANYPYQAADGTCNSKKQASHIAKITGYESVPANSEAE 246
Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
LL+ V+NQP+SV +DA G F FY SGV CG DHGV VG+G E +G KYWL+
Sbjct: 247 LLKVVANQPISVSIDAGGSDFQFYSSGVFTGKCGTELDHGVTAVGYG--ETSDGTKYWLV 304
Query: 312 KNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
KNSW +WGE GYIR+ RD GLCGIA +SYP A
Sbjct: 305 KNSWXTSWGEEGYIRMQRDIDAEEGLCGIAMDSSYPTA 342
>gi|357477459|ref|XP_003609015.1| Cysteine proteinase [Medicago truncatula]
gi|355510070|gb|AES91212.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 370 bits (950), Expect = e-100, Method: Compositional matrix adjust.
Identities = 188/352 (53%), Positives = 243/352 (69%), Gaps = 14/352 (3%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M K + + I + I L + CA QV S RS+ S+ E+HEQWM+Q+ + YKD E+
Sbjct: 1 MASKNQLYYSIALTFIFCLGL-CAIQVTS-RSLQVDSMYERHEQWMSQYSKVYKDPQERE 58
Query: 61 MRLNIFKQNLEYIEKANKEGN-RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR 119
R IF N+ YIE N + N + YKLG N+F+DLTNEEF A N+ + ++
Sbjct: 59 ERHKIFTANVNYIEVFNNDANNKLYKLGINQFADLTNEEFIA---SRNKFKGHMCSSIAK 115
Query: 120 PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
+TFKY+NV+ +P+++DWR+KGAVT +K+QGQCG CWAFSAVAA EGIT+++ GKL+ LS
Sbjct: 116 TTTFKYENVSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGITKLSTGKLVSLS 175
Query: 180 EQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
EQ+LVDC T + GC GGLMD AF++II+N GL+TEA YPY+ +GTC+ K AAT
Sbjct: 176 EQELVDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEAAYPYQGVDGTCNANKASIHAAT 235
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I+ YED+P +EQAL +AV+NQP+SV +DASG F FYKSGV + CG DHGV VG+
Sbjct: 236 ITGYEDVPANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFSGSCGTELDHGVTAVGY 295
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
G + G KYWL+KNSWG WGE GYIR+ R DA GLCGIA ASYP A
Sbjct: 296 GVGND--GTKYWLVKNSWGTDWGEEGYIRMQRGVDAAEGLCGIAMQASYPTA 345
>gi|356577813|ref|XP_003557017.1| PREDICTED: uncharacterized protein LOC100801364 [Glycine max]
Length = 890
Score = 369 bits (947), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 179/335 (53%), Positives = 230/335 (68%), Gaps = 11/335 (3%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
++L + + V+ RS+ + S+ E+HEQWM ++G+ YKD E+ R IFK+N+ YIE
Sbjct: 561 MLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIFKENVNYIEAF 620
Query: 77 NKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
N N+ YKL N+F+DLTNEEF A NR + R +TFKY+NVT VP+++D
Sbjct: 621 NNAANKRYKLAINQFADLTNEEFIA---PRNRFKGHMCSSIIRTTTFKYENVTAVPSTVD 677
Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCS 194
WR+KGAVT IKDQGQCG CWAFSAVAA EGI +T GKLI LSEQ+LVDC T + GC
Sbjct: 678 WRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDCDTKGVDQGCE 737
Query: 195 GGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQ 254
GGLMD AF+++I+N GL TEA+YPY+ +G C+ + TI+ YED+P +E+AL +
Sbjct: 738 GGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDVPANNEKALQK 797
Query: 255 AVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNS 314
AV+NQPVSV +DASG F FYKSGV CG DHGV VG+G + + G +YWL+KNS
Sbjct: 798 AVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND--GTEYWLVKNS 855
Query: 315 WGETWGESGYIRILR----DAGLCGIATAASYPVA 345
WG WGE GYIR+ R + GLCGIA ASYP A
Sbjct: 856 WGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 890
>gi|37780047|gb|AAP32196.1| cysteine protease 8 [Trifolium repens]
Length = 343
Score = 368 bits (945), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 173/324 (53%), Positives = 227/324 (70%), Gaps = 11/324 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
V+ R++ + S+ E+H QWM+Q+G+ YKD E+ R IF +N+ Y+E +N + ++YKLG
Sbjct: 25 VTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFTENVNYVEASNADDTKSYKLG 84
Query: 88 TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N+F+DLTNEEF A N+ + +R +TFKY+NV+ +P+++DWR+KGAVT +K
Sbjct: 85 INQFADLTNEEFVA---SRNKFKGHMCSSITRTTTFKYENVSAIPSTVDWRKKGAVTPVK 141
Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYI 205
+QGQCG CWAFSAVAA EGI +++ GKLI LSEQ+LVDC T + GC GGLMD AF++I
Sbjct: 142 NQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKFI 201
Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
I+N GL+TEA YPY +GTC+ K A TI+ YED+P EQAL +AV+NQP+SV +
Sbjct: 202 IQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVAI 261
Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
DASG F FYKSGV CG DHGV VG+G + + G KYWL+KNSWG WGE GYI
Sbjct: 262 DASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND--GTKYWLVKNSWGTDWGEEGYI 319
Query: 326 RILRDA----GLCGIATAASYPVA 345
+ R GLCGIA ASYP A
Sbjct: 320 MMQRGVEAAEGLCGIAMQASYPTA 343
>gi|356539398|ref|XP_003538185.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 368 bits (945), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 180/349 (51%), Positives = 234/349 (67%), Gaps = 12/349 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+ F+K + + LV + + R++ + + E+HEQWMA HG+ Y EK +
Sbjct: 1 MAFKKVLFQYFTLALCLVFAFCAFEGNARTLEDAPMRERHEQWMAIHGKVYTHSYEKEQK 60
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
FK+N++ IE N GN+ YKLG N F+DLTNEEF+A+ NR V + +R T
Sbjct: 61 YQTFKENVQRIEAFNHAGNKPYKLGINHFADLTNEEFKAI----NRFKGHVCSKITRTPT 116
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
F+Y+N+T VP ++DWR++GAVT IKDQGQCG CWAFSAVAA EGIT+++ GKLI LSEQ+
Sbjct: 117 FRYENMTAVPATLDWRQEGAVTPIKDQGQCGCCWAFSAVAATEGITKLSTGKLISLSEQE 176
Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
LVDC T + GC GGLMD AF++I++NKGLA EA YPY +GTC+ + E A +I
Sbjct: 177 LVDCDTKGVDQGCEGGLMDDAFKFILQNKGLAAEAIYPYEGVDGTCNAKAEGNHATSIKG 236
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
YED+P E ALL+AV+NQPVSV ++ASG F FY GV CG N DHGV VG+G +
Sbjct: 237 YEDVPANSESALLKAVANQPVSVAIEASGFEFQFYSGGVFTGSCGTNLDHGVTAVGYGVS 296
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
++ G KYWL+KNSWG WG+ GYIR+ RD GLCGIA ASYP A
Sbjct: 297 DD--GTKYWLVKNSWGVKWGDKGYIRMQRDVAAKEGLCGIAMLASYPNA 343
>gi|84181681|gb|AAW78661.2| senescence-specific cysteine protease [Nicotiana tabacum]
Length = 349
Score = 368 bits (944), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 178/346 (51%), Positives = 235/346 (67%), Gaps = 13/346 (3%)
Query: 11 IPMFVIIILVITC----ASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
+ ++ + L C +SQV R + +E ++ +H+QW+ H + YKD EK +R I
Sbjct: 6 LSQYLCLALFFICLGLWSSQVALSRPINYEATMRARHDQWIVHHEKVYKDLNEKEVRFQI 65
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY 125
FK+N+E IE N ++ YKLG N+FSDLTNEEFR L+TGY R P V S + F+Y
Sbjct: 66 FKENVERIEAFNAGEDKGYKLGFNKFSDLTNEEFRVLHTGYKRSHPKVMTSSKGKTHFRY 125
Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
NVTD+P ++DWR+KGAVT IKDQ +CG CWAFSAVAA+EG+ Q+ G+LI LSEQ+LVD
Sbjct: 126 TNVTDIPPTMDWRKKGAVTPIKDQKECGCCWAFSAVAAMEGLHQLKTGELIPLSEQELVD 185
Query: 186 CST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
C ++ GCSGGL+D AF++I++NKGL TE +YPY+ E+G C+ +K AA I+ YED
Sbjct: 186 CDVEGEDEGCSGGLLDTAFDFILKNKGLTTEVNYPYKGEDGVCNKKKSALSAAKITGYED 245
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P E+ALLQAV+NQPVSV +D S F FY SGV + C +H V VG+G +
Sbjct: 246 VPANSEKALLQAVANQPVSVAIDGSSFDFQFYSSGVFSGSCSTWLNHAVTAVGYGATTD- 304
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
G KYW+IKNSWG WG+SGY+RI RD GLCG+A ASYP A
Sbjct: 305 -GTKYWIIKNSWGSKWGDSGYMRIKRDVHEKEGLCGLAMDASYPTA 349
>gi|356545063|ref|XP_003540965.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 361
Score = 367 bits (943), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 180/345 (52%), Positives = 234/345 (67%), Gaps = 11/345 (3%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
K+ + + ++L + + V+ RS+ + S+ E+HEQWM ++G+ YKD E+ R IF
Sbjct: 22 KNHFCHISLAMLLCMAFLAFQVTCRSLQDASMYERHEQWMTRYGKVYKDPQEREKRFRIF 81
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
K+N+ YIE N N+ YKL N+F+DLTNEEF A NR + R +TFKY+
Sbjct: 82 KENVNYIEAFNNAANKRYKLAINQFADLTNEEFIA---PRNRFKGHMCSSIIRTTTFKYE 138
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
NVT VP+++DWR+KGAVT IKDQGQCG CWAFSAVAA EGI +T GKLI LSEQ+LVDC
Sbjct: 139 NVTAVPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALTSGKLISLSEQELVDC 198
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
T + GC GGLMD AF+++I+N GL TEA+YPY+ +G C+ + TI+ YED+
Sbjct: 199 DTKGVDQGCEGGLMDDAFKFVIQNHGLNTEANYPYKGVDGKCNANEAANDVVTITGYEDV 258
Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
P +E+AL +AV+NQPVSV +DASG F FYKSGV CG DHGV VG+G + +
Sbjct: 259 PANNEKALQKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND-- 316
Query: 305 GAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
G +YWL+KNSWG WGE GYIR+ R + GLCGIA ASYP A
Sbjct: 317 GTEYWLVKNSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 361
>gi|356515086|ref|XP_003526232.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 367 bits (941), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 178/337 (52%), Positives = 236/337 (70%), Gaps = 11/337 (3%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ ++L +T + V+ R++ + S+ E+HEQWM ++G+ YKD E+ R +FK+N+ YIE
Sbjct: 12 LAMLLCMTFLAFQVTCRTLQDASMYERHEQWMTRYGKVYKDPQEREKRFRVFKENVNYIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
N N++YKLG N+F+DLTN+EF A G+ + S R +TFK++NVT P++
Sbjct: 72 AFNNAANKSYKLGINQFADLTNKEFIAPRNGFKGHMCS---SIIRTTTFKFENVTATPST 128
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHG 192
+DWR+KGAVT IKDQGQCG CWAFSAVAA EGI ++ GKLI LSEQ+LVDC T + G
Sbjct: 129 VDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDQG 188
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C GGLMD AF++II+N GL TEA+YPY+ +G C+ + AATI+ YED+P +E AL
Sbjct: 189 CEGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEAAKNAATITGYEDVPANNEMAL 248
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+AV+NQPVSV +DASG F FYKSGV CG DHGV VG+G +++ G +YWL+K
Sbjct: 249 QKAVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDD--GTEYWLVK 306
Query: 313 NSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
NSWG WGE GYIR+ R + GLCGIA ASYP A
Sbjct: 307 NSWGTEWGEEGYIRMQRGVDSEEGLCGIAMQASYPTA 343
>gi|357167190|ref|XP_003581045.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 415
Score = 367 bits (941), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 179/341 (52%), Positives = 232/341 (68%), Gaps = 16/341 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
F+I IL TCA ++ R + + S+V +HEQWMA++GR Y D EKA RL +FK N+ +
Sbjct: 82 FLIAILACTCAVSALAARDLTDDLSMVARHEQWMAKYGRVYNDVAEKAQRLEVFKANVAF 141
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + L N+F+D+T +EFRA +TGY +PVP+ R + FKY NV+
Sbjct: 142 IELVNA-GNDKFSLEANQFADMTVDEFRAAHTGY-KPVPA---NKGRTTQFKYANVSLDA 196
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P S+DWR KGAVT IKDQGQCG CWAFS VA+VEGI +++ GKLI LSEQ+LVDC D
Sbjct: 197 LPASMDWRAKGAVTPIKDQGQCGCCWAFSTVASVEGIVKLSTGKLISLSEQELVDCDVDG 256
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
+ GC GGLMD AFE+II+N GL TE +YPY + +C++ KE A+I YED+P D
Sbjct: 257 MDQGCEGGLMDNAFEFIIDNGGLTTEGNYPYTGTDDSCNSNKESNDVASIKGYEDVPSND 316
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E +LL+AV+ QPVS+ VD F FYK GVL+ CG DHG+A VG+G + G K+
Sbjct: 317 ETSLLKAVAAQPVSIAVDGGDNLFRFYKGGVLSGACGTELDHGIAAVGYGITSD--GTKF 374
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
WL+KNSWG +WGE G+IR+ RD GLCG+A SYP A
Sbjct: 375 WLMKNSWGTSWGEKGFIRMERDIADEEGLCGLAMQPSYPTA 415
>gi|312281697|dbj|BAJ33714.1| unnamed protein product [Thellungiella halophila]
Length = 347
Score = 366 bits (940), Expect = 8e-99, Method: Compositional matrix adjust.
Identities = 172/343 (50%), Positives = 239/343 (69%), Gaps = 11/343 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIV--EKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
I +F+I+ L+ + + R + + ++ ++H++WMA+HGR Y D EK R +FK+
Sbjct: 6 IQIFLIVSLISSFCLSITLSRPLDDNELIMQKRHDEWMAKHGRVYADMKEKNNRYVVFKR 65
Query: 69 NLEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+E IE+ N RT+KL N+F+DLTN+EFR++YTGY S+ ++ S+F+YQN
Sbjct: 66 NVERIERLNNVPAGRTFKLAVNQFADLTNDEFRSMYTGYKGGSVLSSQSGTKTSSFRYQN 125
Query: 128 VTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
V+ +P S+DWR+KGAVT IK+QG CG CWAFSAVAA+EG T+I +GKLI LSEQQLVD
Sbjct: 126 VSSGALPVSVDWRKKGAVTPIKNQGTCGCCWAFSAVAAIEGATKIKKGKLISLSEQQLVD 185
Query: 186 CSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
C T++ GCSGGLMD AFE+I+ GL TE++YPY+ ++ TC + K A +I+ YED+P
Sbjct: 186 CDTNDFGCSGGLMDTAFEHIMATGGLTTESNYPYKGKDATCKIKNTKPTATSITGYEDVP 245
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
DE+AL++AV++QPVS+ ++ G F FY SGV +C DH V VG+G + NG
Sbjct: 246 VNDEKALMKAVAHQPVSIGIEGGGFDFQFYGSGVFTGECTTYLDHAVTAVGYG--QSSNG 303
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+KYW+IKNSWG WGESGY+RI +D GLCG+A ASYP
Sbjct: 304 SKYWIIKNSWGTKWGESGYMRIKKDVKDKKGLCGLAMKASYPT 346
>gi|18408828|ref|NP_566920.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|12324451|gb|AAG52191.1|AC012329_18 putative cysteine proteinase; 15366-14136 [Arabidopsis thaliana]
gi|6723404|emb|CAB66413.1| cysteine protease-like protein [Arabidopsis thaliana]
gi|332645009|gb|AEE78530.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 341
Score = 365 bits (938), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 179/343 (52%), Positives = 239/343 (69%), Gaps = 13/343 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
I F++ IL+ + S V S + E S VEKHEQWM++ R Y D+ EK R IF NL
Sbjct: 4 IVFFLLAILLSSRTSGVTSRGGLFEASAVEKHEQWMSRFNRVYSDDSEKTSRFEIFTNNL 63
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS----TFKYQ 126
+++E N N+TY L NEFSDLT+EEF+A YTG P ++R S+ S +F+Y+
Sbjct: 64 KFVESINMNTNKTYTLDVNEFSDLTDEEFKARYTGLVVP-EGMTRISTTDSHETVSFRYE 122
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
NV + S+DW ++GAVT +K Q QCG CWAFSAVAAVEG+T+I G+L+ LSEQQL+DC
Sbjct: 123 NVGETGESMDWIQEGAVTSVKHQQQCGCCWAFSAVAAVEGMTKIANGELVSLSEQQLLDC 182
Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
ST+N+GC GG+M KAF+YI EN+G+ TE +YPY+ + TC++ AATIS YE +P+
Sbjct: 183 STENNGCGGGIMWKAFDYIKENQGITTEDNYPYQGAQQTCESN--HLAAATISGYETVPQ 240
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
DE+ALL+AVS QPVSV ++ SG F Y G+ N +CG H V +VG+G +EE G
Sbjct: 241 NDEEALLKAVSQQPVSVAIEGSGYEFIHYSGGIFNGECGTQLTHAVTIVGYGVSEE--GI 298
Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
KYWL+KNSWGE+WGE+GY+RI+RD G+CG+A+ A YPVA
Sbjct: 299 KYWLLKNSWGESWGENGYMRIMRDVDSPQGMCGLASLAYYPVA 341
>gi|37780051|gb|AAP32198.1| cysteine protease 12 [Trifolium repens]
Length = 343
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 182/340 (53%), Positives = 236/340 (69%), Gaps = 18/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
F + +L I V+ R++ + SI E+HEQWM +G+ YK+ E+ RL IF +NL+Y
Sbjct: 15 FFCLGLLAIQ-----VTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKY 69
Query: 73 IEKANKEGNRT-YKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
IE +N GN+ YKLG N+F+DLTNEEF A N+ + R +TFKY+N T V
Sbjct: 70 IEASNNAGNKKPYKLGINQFADLTNEEFIA---SRNKFKGHMCSSIIRTTTFKYEN-TSV 125
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
P+++DWR+KGAVT +K+QGQCG CWAFSA+AA EGI +I+ GKL+ LSEQ+LVDC T+
Sbjct: 126 PSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGV 185
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
+ GC GGLMD AF++II+N G++TEA YPY+ +GTC + AATI+ YED+P +E
Sbjct: 186 DQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNE 245
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
AL +AV+NQP+SV +DASG F FYKSGV CG DHGV VG+G + + G KYW
Sbjct: 246 NALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISND--GTKYW 303
Query: 310 LIKNSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
L+KNSWG WGE GYIR+ R DA GLCGIA ASYP A
Sbjct: 304 LVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPTA 343
>gi|37780045|gb|AAP32195.1| cysteine protease 5 [Trifolium repens]
Length = 343
Score = 365 bits (937), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 182/340 (53%), Positives = 236/340 (69%), Gaps = 18/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
F + +L I V+ R++ + SI E+HEQWM +G+ YK+ E+ RL IF +NL+Y
Sbjct: 15 FFCLGLLAIQ-----VTSRTLQDDSIFERHEQWMTHYGKVYKNPQEREKRLRIFTENLKY 69
Query: 73 IEKANKEGN-RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
IE +N GN + YKLG N+F+DLTNEEF A N+ + R +TFKY+N T V
Sbjct: 70 IEASNNAGNNKPYKLGINQFADLTNEEFIA---SRNKFKGHMCSSIIRTTTFKYEN-TSV 125
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
P+++DWR+KGAVT +K+QGQCG CWAFSA+AA EGI +I+ GKL+ LSEQ+LVDC T+
Sbjct: 126 PSTVDWRKKGAVTPVKNQGQCGCCWAFSAIAATEGIHKISTGKLVSLSEQELVDCDTNGV 185
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
+ GC GGLMD AF++II+N G++TEA YPY+ +GTC + AATI+ YED+P +E
Sbjct: 186 DQGCEGGLMDDAFKFIIQNNGISTEAGYPYQGVDGTCKANEASTSAATITGYEDVPANNE 245
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
AL +AV+NQP+SV +DASG F FYKSGV CG DHGV VG+G + + G KYW
Sbjct: 246 NALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGISND--GTKYW 303
Query: 310 LIKNSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
L+KNSWG WGE GYIR+ R DA GLCGIA ASYP A
Sbjct: 304 LVKNSWGTDWGEEGYIRMQRSIDAAEGLCGIAMQASYPTA 343
>gi|356543122|ref|XP_003540012.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 342
Score = 364 bits (935), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 178/327 (54%), Positives = 225/327 (68%), Gaps = 8/327 (2%)
Query: 23 CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR 82
C SQV S R +H+ S+ E+HEQWM ++G+ YKD E R IF+ N+E+IE N GN+
Sbjct: 20 CTSQVKS-RKLHDASMYERHEQWMEKYGKVYKDSAEXEKRFLIFENNVEFIESFNAAGNK 78
Query: 83 TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
YKL N +D TNEEF A + GY R +++ + FKY+NVTD+P ++DWR+KG
Sbjct: 79 PYKLSINHLADQTNEEFMASHKGYKGSHWQGLRITTQ-TPFKYENVTDIPWAVDWRQKGD 137
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAF 202
T IKDQGQCG CWAFSAVAA EGI QIT G L+ LSEQ+LVDC + +HGC GGLM+ F
Sbjct: 138 ATSIKDQGQCGICWAFSAVAATEGIYQITTGNLVSLSEQELVDCDSVDHGCDGGLMEHGF 197
Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
E+II+N G+++EA+YPY GTCD KE + A I YE +P E+ L +AV+NQPVS
Sbjct: 198 EFIIKNGGISSEANYPYTAVNGTCDTNKEASPGAQIKGYETVPVNCEEELQKAVANQPVS 257
Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
V +DA G AF FY SGV CG DHGV VG+G+ ++ G +YW++KNSWG WGE
Sbjct: 258 VSIDAGGSAFQFYSSGVFTGQCGTQLDHGVTAVGYGSTDD--GIQYWIVKNSWGTQWGEE 315
Query: 323 GYIRILR--DA--GLCGIATAASYPVA 345
GYIR+LR DA GLCGIA ASYP A
Sbjct: 316 GYIRMLRGIDAQEGLCGIAMDASYPTA 342
>gi|18401420|ref|NP_565649.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|4314384|gb|AAD15594.1| cysteine proteinase [Arabidopsis thaliana]
gi|17381154|gb|AAL36389.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|20465849|gb|AAM20029.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|330252901|gb|AEC07995.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 348
Score = 363 bits (931), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 184/345 (53%), Positives = 240/345 (69%), Gaps = 14/345 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F++ I + S S S+ E S +EKHEQWMA+ R Y DE EK R NIFK+NLE+
Sbjct: 6 IFILTIFLSYRTSLATSRGSLFEASAIEKHEQWMARFNRVYSDETEKRNRFNIFKKNLEF 65
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRP--VPSVSRQSSRPST--FKYQNV 128
++ N TYK+ NEFSDLT+EEFRA +TG P + +S SS +T F+Y NV
Sbjct: 66 VQNFNMNNKITYKVDINEFSDLTDEEFRATHTGLVVPEAITRISTLSSGKNTVPFRYGNV 125
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+D S+DWR++GAVT +K QG+CG CWAFSAVAAVEGIT+IT+G+L+ LSEQQL+DC
Sbjct: 126 SDNGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDR 185
Query: 189 D-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV---AATISKYEDL 244
D N GC GG+M KAFEYII+N+G+ TE +YPY+ + TC + + AATIS YE +
Sbjct: 186 DYNQGCRGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETV 245
Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
P +E+ALLQAVS QPVSV ++ +G AF Y GV N +CG + H V +VG+G +EE
Sbjct: 246 PMNNEEALLQAVSQQPVSVGIEGTGAAFRHYSGGVFNGECGTDLHHAVTIVGYGMSEE-- 303
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
G KYW++KNSWGETWGE+GY+RI RD G+CG+A A YP+A
Sbjct: 304 GTKYWVVKNSWGETWGENGYMRIKRDVDAPQGMCGLAILAFYPLA 348
>gi|297826061|ref|XP_002880913.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
gi|297326752|gb|EFH57172.1| hypothetical protein ARALYDRAFT_481640 [Arabidopsis lyrata subsp.
lyrata]
Length = 347
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 182/344 (52%), Positives = 238/344 (69%), Gaps = 13/344 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F++ I + S S + E S +EKHEQWMA+ R Y DE EK R NIFK+NLE+
Sbjct: 6 IFILTIFLSYRTSLATSRGGLFEASPIEKHEQWMARFNRVYSDESEKRNRFNIFKKNLEF 65
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRP--VPSVSRQSSRPST-FKYQNVT 129
++ N N TYKL NEFSDLT+EEFRA +TG P + +S SS + F+Y NV+
Sbjct: 66 VQSFNMNKNITYKLDVNEFSDLTDEEFRATHTGLVVPEEITGISTLSSDKTVPFRYGNVS 125
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
D S+DWR++GAVT +K QG+CG CWAFSAVAAVEGIT+IT+G+L+ LSEQQL+DC TD
Sbjct: 126 DTGESMDWRQEGAVTPVKYQGRCGGCWAFSAVAAVEGITKITKGELVSLSEQQLLDCDTD 185
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV---AATISKYEDLP 245
N GC GG+M KAFEYII+N+G+ TE +YPY+ + TC + + AATIS YE +P
Sbjct: 186 YNQGCHGGIMSKAFEYIIKNQGITTEDNYPYQESQQTCSSSTTLSSSFRAATISGYETVP 245
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+E+ALLQAVS QPVSV ++ +G F Y G+ N +CG + H V +VG+G +EE G
Sbjct: 246 MNNEEALLQAVSQQPVSVGIEGTGAGFRHYSGGIFNGECGTDLHHAVTIVGYGMSEE--G 303
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
KYW++KNSWGETWGE G++RI RD G+CG+A A YP+A
Sbjct: 304 TKYWVVKNSWGETWGEDGFMRIKRDVDAPQGMCGLAMLAFYPLA 347
>gi|357474573|ref|XP_003607571.1| Cysteine proteinase EP-B [Medicago truncatula]
gi|34329348|gb|AAQ63885.1| putative cysteine proteinase [Medicago truncatula]
gi|355508626|gb|AES89768.1| Cysteine proteinase EP-B [Medicago truncatula]
Length = 345
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 181/326 (55%), Positives = 230/326 (70%), Gaps = 14/326 (4%)
Query: 28 VSGRSMHEPSIV-EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGN-RTYK 85
V+ R++ + SI+ EKHEQWM +G+ YKD E+ RL IFK+N+ YIE +N GN + YK
Sbjct: 26 VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYK 85
Query: 86 LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
LG N+F+DLTNEEF A N+ + ++ STFKY+N + VP+++DWR+KGAVT
Sbjct: 86 LGINQFADLTNEEFIA---SRNKFKGHMCSSITKTSTFKYENAS-VPSTVDWRKKGAVTP 141
Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFE 203
+K+QGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC T + GC GGLMD AF+
Sbjct: 142 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFK 201
Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
+II+N GL TEA YPY+ +GTC K A TI+ YED+P +EQAL +AV+NQP+SV
Sbjct: 202 FIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQPISV 261
Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
+DASG F FYKSGV CG DHGV VG+G + G KYWL+KNSWG WGE G
Sbjct: 262 AIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGND--GTKYWLVKNSWGTDWGEEG 319
Query: 324 YIRILR--DA--GLCGIATAASYPVA 345
YI++ R DA GLCGIA ASYP A
Sbjct: 320 YIKMQRGVDAAEGLCGIAMEASYPTA 345
>gi|357474579|ref|XP_003607574.1| Cysteine protease [Medicago truncatula]
gi|355508629|gb|AES89771.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 362 bits (930), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 178/326 (54%), Positives = 229/326 (70%), Gaps = 14/326 (4%)
Query: 28 VSGRSMHEPSIV-EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGN-RTYK 85
V+ R++ + SI+ EKHEQWM +G+ YKD E+ RL IFK+N+ YIE +N GN + YK
Sbjct: 26 VTSRTLQDDSIIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNKLYK 85
Query: 86 LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
LG N+F+D+TNEEF A N+ + ++ STFKY+N + VP+++DWR+KGAVT
Sbjct: 86 LGINQFADITNEEFIA---SRNKFKGHMCSSITKTSTFKYENAS-VPSTVDWRKKGAVTP 141
Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFE 203
+K+QGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC T + GC GGLMD AF+
Sbjct: 142 VKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDDAFK 201
Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
+II+N GL TEA YPY+ +GTC + AATI+ YED+P +E AL +AV+NQP+SV
Sbjct: 202 FIIQNHGLHTEAQYPYQGVDGTCSANETSTPAATIAGYEDVPANNENALQKAVANQPISV 261
Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
+DASG F FYKSGV CG DHGV VG+G + + G KYWL+KNSWG WGE G
Sbjct: 262 AIDASGSDFQFYKSGVFTGSCGTQLDHGVTAVGYGISND--GTKYWLVKNSWGNDWGEEG 319
Query: 324 YIRILRDA----GLCGIATAASYPVA 345
YIR+ R GLCGIA ASYP A
Sbjct: 320 YIRMQRSVDAAQGLCGIAMMASYPTA 345
>gi|224093956|ref|XP_002310053.1| predicted protein [Populus trichocarpa]
gi|224147016|ref|XP_002336386.1| predicted protein [Populus trichocarpa]
gi|222834869|gb|EEE73318.1| predicted protein [Populus trichocarpa]
gi|222852956|gb|EEE90503.1| predicted protein [Populus trichocarpa]
Length = 340
Score = 362 bits (929), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 173/349 (49%), Positives = 240/349 (68%), Gaps = 15/349 (4%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
++F K F ++ ++ S+ + R++ + + E+HEQWM Q+GR YKD+ E+A R
Sbjct: 1 MRFTKQFQFVCLALLFILGAWPSKSTA-RTLLDAPMYERHEQWMTQYGRVYKDDNERATR 59
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
+IFK+N+ I+ N + ++YKLG N+F+DLTNEEF+A NR + + P
Sbjct: 60 YSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNEEFKA---SRNRFKGHMCSPQAGP-- 114
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
F+Y+NV+ VP+++DWR++GAVT +KDQGQCG CWAFSAVAA+EGI ++T GKLI LSEQ+
Sbjct: 115 FRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWAFSAVAAMEGINKLTTGKLISLSEQE 174
Query: 183 LVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
+VDC T ++ GC+GGLMD AF++I +NKGL TEA+YPY+ +GTC+ K AA I+
Sbjct: 175 VVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEANYPYKGTDGTCNTNKAAIHAAKITG 234
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
+ED+P E AL++AV+ QPVSV +DA G F FY SG+ C DHGV VG+G +
Sbjct: 235 FEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFYSSGIFTGSCDTQLDHGVTAVGYGVS 294
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
+ G+KYWL+KNSWG WGE GYIR+ +D GLCGIA ASYP A
Sbjct: 295 D---GSKYWLVKNSWGAQWGEEGYIRMQKDISAKEGLCGIAMQASYPTA 340
>gi|356545118|ref|XP_003540992.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 337
Score = 362 bits (928), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 183/352 (51%), Positives = 232/352 (65%), Gaps = 22/352 (6%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M +K + I +F+++ L I Q++S R +HE S+ E+HEQWMA++G+ YKD EK
Sbjct: 1 MAFTSQKQYTIALFLLLALGI---PQMMS-RKLHETSMRERHEQWMAEYGKVYKDAAEKE 56
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK N+E+IE N N+ YKLG N +DLT EEF+A G RP S+ P
Sbjct: 57 KRFLIFKHNVEFIESFNAAANKPYKLGVNHLADLTVEEFKASRNGLKRPY----ELSTTP 112
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQC-GSCWAFSAVAAVEGITQITRGKLIELS 179
FKY+NVT +P +IDWR KGAVT IKDQGQC GSCWAFS VAA EGI QIT GKL+ LS
Sbjct: 113 --FKYENVTAIPAAIDWRTKGAVTSIKDQGQCAGSCWAFSTVAATEGIHQITTGKLVSLS 170
Query: 180 EQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
EQ+LVDC T + GC GG M+ FE+II+N G+ +EA+YPY+ +G C+ K + A
Sbjct: 171 EQELVDCDTKGVDQGCEGGYMEDGFEFIIKNGGITSEANYPYKAVDGKCN--KATSPVAQ 228
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I YE +P E+ L +AV+NQPVSV +DA+G F FY SG+ N +CG DHGV VG+
Sbjct: 229 IKGYEKVPPNSEKTLQKAVANQPVSVSIDANGEGFMFYSSGIYNGECGTELDHGVTAVGY 288
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
G A NG YWL+KNSWG WGE GY+R+ R GLCGIA +SYP A
Sbjct: 289 GIA---NGTDYWLVKNSWGTQWGEKGYVRMQRGVAAKHGLCGIALDSSYPTA 337
>gi|297794671|ref|XP_002865220.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
gi|297311055|gb|EFH41479.1| senescence-associated gene 12 [Arabidopsis lyrata subsp. lyrata]
Length = 346
Score = 361 bits (926), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 174/343 (50%), Positives = 238/343 (69%), Gaps = 12/343 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEK-HEQWMAQHGRTYKDELEKAMRLNIFKQN 69
+ +F+ + + + + R + I++K H +WM +HGR Y D EK+ R +FK N
Sbjct: 6 MQIFLFVAIFSSFYFSISLSRPLDNELIMQKRHIEWMTKHGRVYADVKEKSNRYVVFKSN 65
Query: 70 LEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS-SRPSTFKYQN 127
+E IE N RT+KL N+F+DLTN+EFR++YTG+ + V S+S QS ++ ++F+YQN
Sbjct: 66 VERIEHLNNIPAGRTFKLAVNQFADLTNDEFRSMYTGF-KGVSSLSSQSQTKTTSFRYQN 124
Query: 128 VTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
V+ +P S+DWR KGAVT IK+QG CG CWAFSAVAA+EG TQI +GKLI LSEQQLVD
Sbjct: 125 VSSGALPISVDWRTKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVD 184
Query: 186 CSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
C T++ GC GGLMD AFE+I+ GL TE++YPY+ E+ TC+++K A +I+ YED+P
Sbjct: 185 CDTNDFGCEGGLMDTAFEHIMATGGLTTESNYPYKGEDATCNSKKTNPKATSITGYEDVP 244
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
DEQAL++AV++QPVSV ++ G F FY SGV +C DH V +G+G + NG
Sbjct: 245 VNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIGYG--QSTNG 302
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
+KYW+IKNSWG WGESGY+RI +D GLCG+A ASYP
Sbjct: 303 SKYWIIKNSWGTKWGESGYMRIQKDIKDKQGLCGLAMKASYPT 345
>gi|118627554|emb|CAL64936.1| putative cysteine protease 8 [Trifolium pratense]
Length = 344
Score = 361 bits (926), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 173/325 (53%), Positives = 226/325 (69%), Gaps = 12/325 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK-EGNRTYKL 86
V+ R++ + S+ E+H QWM+Q+G+ YKD E+ R IFK+N+ YIE N + ++YKL
Sbjct: 25 VTSRTLQDDSMYERHGQWMSQYGKIYKDHQERETRFKIFKENVNYIETFNNADDTKSYKL 84
Query: 87 GTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
G N+F+DLTNEEF A N+ + R ++FKY+NV+ +P+++DWR+KGAVT +
Sbjct: 85 GINQFADLTNEEFIA---SRNKFKGHMCSSIMRTTSFKYENVSGIPSTVDWRKKGAVTPV 141
Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEY 204
K+QGQCG CWAFSAVAA EGI +++ GKLI LSEQ+LVDC T + GC GGLMD AF++
Sbjct: 142 KNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKF 201
Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
II+N GL+TEA YPY +GTC+ K A TI+ YED+P EQAL +AV+NQP+SV
Sbjct: 202 IIQNHGLSTEAQYPYEGVDGTCNANKASVQAVTITGYEDVPANSEQALQKAVANQPISVA 261
Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
+DASG F FYKSGV CG DHGV VG+G + + G KYWL+KNSWG WGE GY
Sbjct: 262 IDASGSDFQFYKSGVFTGACGTELDHGVTAVGYGVSND--GTKYWLVKNSWGTDWGEEGY 319
Query: 325 IRILRDA----GLCGIATAASYPVA 345
I + R G+CGIA ASYP A
Sbjct: 320 IMMQRGIEAAEGICGIAMQASYPTA 344
>gi|357474527|ref|XP_003607548.1| Cysteine protease [Medicago truncatula]
gi|358347211|ref|XP_003637653.1| Cysteine protease [Medicago truncatula]
gi|355503588|gb|AES84791.1| Cysteine protease [Medicago truncatula]
gi|355508603|gb|AES89745.1| Cysteine protease [Medicago truncatula]
Length = 345
Score = 360 bits (925), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 182/329 (55%), Positives = 228/329 (69%), Gaps = 13/329 (3%)
Query: 24 ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGN-R 82
A QV S + +I EKHEQWM +G+ YKD E+ RL IFK+N+ YIE +N GN +
Sbjct: 23 AIQVTSRTLQDDSNIYEKHEQWMVHYGKVYKDLQERENRLKIFKENVNYIEASNNAGNNK 82
Query: 83 TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
YKLG N+F+DLTNEEF A N+ + ++ STFKY+N + VP+++DWR+KGA
Sbjct: 83 LYKLGINQFADLTNEEFIA---SRNKFKGHMCSSITKTSTFKYENAS-VPSTVDWRKKGA 138
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDK 200
VT +K+QGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC T + GC GGLMD
Sbjct: 139 VTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCDTKGVDQGCEGGLMDD 198
Query: 201 AFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQP 260
AF++II+N GL TEA YPY+ +GTC K A TI+ YED+P +EQAL +AV+NQP
Sbjct: 199 AFKFIIQNHGLNTEAQYPYQGVDGTCSANKASIHAVTITGYEDVPANNEQALQKAVANQP 258
Query: 261 VSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWG 320
+SV +DASG F FYKSGV CG DHGV VG+G + G KYWL+KNSWG WG
Sbjct: 259 ISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVGND--GTKYWLVKNSWGTDWG 316
Query: 321 ESGYIRILR--DA--GLCGIATAASYPVA 345
E GYI++ R DA GLCGIA ASYP A
Sbjct: 317 EEGYIKMQRGVDAAEGLCGIAMEASYPTA 345
>gi|5823018|gb|AAD53011.1|AF089848_1 senescence-specific cysteine protease [Brassica napus]
Length = 346
Score = 360 bits (925), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 176/342 (51%), Positives = 235/342 (68%), Gaps = 10/342 (2%)
Query: 11 IPMFVIIILVIT-CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
I +F+I+ LV + C S +S E + +KH++WMA+HGRTY D EK R +FK+N
Sbjct: 6 IKIFLIVSLVSSFCFSTTLSRLLDDELIMQKKHDEWMAEHGRTYADMNEKNNRYVVFKRN 65
Query: 70 LEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
+E IE+ N RT+KL N+F+DLTN+EFR +YTGY S+ ++ ++F+YQNV
Sbjct: 66 VERIERLNNVPAGRTFKLAVNQFADLTNDEFRFMYTGYKGDFVLFSQSQTKSTSFRYQNV 125
Query: 129 --TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+P ++DWR+KGAVT IK+QG CG CWAFSAVAA+EG TQI +GKLI LSEQQLVDC
Sbjct: 126 FFGALPIAVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDC 185
Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
T++ GCSGGLMD AFE+I+ GL TE++YPY+ E+ C + K AA+I+ YED+P
Sbjct: 186 DTNDFGCSGGLMDTAFEHIMATGGLTTESNYPYKGEDANCKIKSTKPSAASITGYEDVPV 245
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
DE AL++AV++QPVSV ++ G F FY SGV +C DH V VG+ ++ G+
Sbjct: 246 NDENALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAVGY--SQSSAGS 303
Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
KYW+IKNSWG WGE GY+RI +D GLCG+A ASYP
Sbjct: 304 KYWIIKNSWGTKWGEGGYMRIKKDIKDKEGLCGLAMKASYPT 345
>gi|224076972|ref|XP_002305074.1| predicted protein [Populus trichocarpa]
gi|224106329|ref|XP_002333698.1| predicted protein [Populus trichocarpa]
gi|222837984|gb|EEE76349.1| predicted protein [Populus trichocarpa]
gi|222848038|gb|EEE85585.1| predicted protein [Populus trichocarpa]
Length = 307
Score = 360 bits (925), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 173/316 (54%), Positives = 223/316 (70%), Gaps = 17/316 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++HE+WMAQHGR Y D EK R IFK+N+E IE N +R YKLG N+F+DLTNE
Sbjct: 1 MLKRHEEWMAQHGRVYGDMKEKEKRYLIFKENIERIEAFNNGSDRGYKLGVNKFADLTNE 60
Query: 98 EFRALYTGYNRPVPSVSRQSSR--PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
EFRA++ GY R QSS+ S+F+++N++ +PTS+DWR+ GAVT +KDQG CG C
Sbjct: 61 EFRAMHHGYKR-------QSSKLMSSSFRHENLSAIPTSMDWRKAGAVTPVKDQGTCGCC 113
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLAT 213
WAFSAVAA+EGI ++ GKLI LSEQQLVDC + GC GGLMD AF++I+ N GL +
Sbjct: 114 WAFSAVAAIEGIIKLKTGKLISLSEQQLVDCDVKGVDQGCGGGLMDNAFQFILRNGGLTS 173
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
EA YPY+ +GTC ++K ++ A I+ YED+P +E ALLQAV+ QPVSV V+ G F
Sbjct: 174 EATYPYQGVDGTCKSKKTASIEAKITGYEDVPVNNENALLQAVAKQPVSVAVEGGGYDFQ 233
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
FYKSGV DCG DH V +G+GT +G YWL+KNSWG +WGESGY+R+ R
Sbjct: 234 FYKSGVFKGDCGTYLDHAVTAIGYGT--NSDGTNYWLVKNSWGTSWGESGYMRMQRGIGA 291
Query: 331 -AGLCGIATAASYPVA 345
GLCG+A ASYP A
Sbjct: 292 REGLCGVAMDASYPTA 307
>gi|357483847|ref|XP_003612210.1| Cysteine proteinase [Medicago truncatula]
gi|355513545|gb|AES95168.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 360 bits (923), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 178/344 (51%), Positives = 235/344 (68%), Gaps = 17/344 (4%)
Query: 13 MFVIIILVITC----ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
++ I + ++ C A QV S R++ + S+ E+HE+WM +G+ YKD E+ R IF +
Sbjct: 7 LYHISLALVFCLGLWAIQVTS-RTLQDGSMHERHERWMNHYGKVYKDHQEREKRFKIFTE 65
Query: 69 NLEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
N++YIE N + N +YKLG N+F+DLTNEEF A N+ + R +TFKY+N
Sbjct: 66 NMKYIEAFNNGDNNESYKLGINQFADLTNEEFVA---SRNKFKGHMCSSIIRTTTFKYEN 122
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
V+ +P+++DWR+KGAVT +K+QGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC
Sbjct: 123 VSAIPSTVDWRKKGAVTPVKNQGQCGCCWAFSAVAATEGIHKLSTGKLVSLSEQELVDCD 182
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
T + GC GGLMD AF++II+N GL TEA YPY+ +GTC+ K A TI+ YED+P
Sbjct: 183 TKGVDQGCEGGLMDDAFKFIIQNHGLNTEAQYPYQGVDGTCNANKASIQATTITGYEDVP 242
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+EQAL +AV+NQP+SV +DASG F FYKSGV CG DHGV VG+G + + G
Sbjct: 243 ANNEQALQKAVANQPISVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND--G 300
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
KYWL+KNSWG WGE GYI + R GLCGIA ASYP A
Sbjct: 301 TKYWLVKNSWGTDWGEEGYIMMQRGVEAAEGLCGIAMQASYPTA 344
>gi|356543038|ref|XP_003539970.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 360 bits (923), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 174/324 (53%), Positives = 227/324 (70%), Gaps = 11/324 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
V+ R++ + S+ E+HE+WMA++ + YKD E+ R IFK+N+ YIE N N+ YKLG
Sbjct: 25 VTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAANKPYKLG 84
Query: 88 TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N+F+DLTNEEF A NR + +R +TFKY+NVT +P+++DWR+KGAVT IK
Sbjct: 85 INQFADLTNEEFIAPR---NRFKGHMCSSITRTTTFKYENVTALPSTVDWRQKGAVTPIK 141
Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYI 205
DQGQCG CWAFSAVAA EGI + GKLI LSEQ++VDC T ++ GC+GG MD AF++I
Sbjct: 142 DQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFI 201
Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
I+N GL TEA+YPY+ +G C+ + AATI+ YED+P +E+AL +AV+NQPVSV +
Sbjct: 202 IQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSVAI 261
Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
DASG F FYK+GV CG DHGV VG+G + + G +YWL+KNSWG WGE GYI
Sbjct: 262 DASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSAD--GTQYWLVKNSWGTEWGEEGYI 319
Query: 326 RILRDA----GLCGIATAASYPVA 345
+ R GLCGIA ASYP A
Sbjct: 320 MMQRGVKAQEGLCGIAMMASYPTA 343
>gi|356515036|ref|XP_003526207.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 336
Score = 359 bits (921), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 175/327 (53%), Positives = 220/327 (67%), Gaps = 18/327 (5%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
SQV+ R +HE S+ E+HEQWM ++G+ YKD EK R IFK N+E+IE N +GN+ Y
Sbjct: 22 SQVMC-RKLHETSMRERHEQWMTEYGKVYKDAAEKDKRFQIFKDNVEFIESFNADGNKPY 80
Query: 85 KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
KLG N +DLT EEF+A G+ RP +TFKY+NVT +P +IDWR KGAVT
Sbjct: 81 KLGVNHLADLTVEEFKASRNGFKRP------HEFSTTTFKYENVTAIPAAIDWRTKGAVT 134
Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAF 202
IKDQGQCGSCWAFS +AA EGI QIT GKL+ LSEQ+LVDC T + GC GG M+ F
Sbjct: 135 PIKDQGQCGSCWAFSTIAATEGIHQITTGKLVSLSEQELVDCDTKGVDQGCEGGYMEDGF 194
Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
E+II+N G+ +E +YPY+ +G C+ K + A I YE +P E AL +AV+NQPVS
Sbjct: 195 EFIIKNGGITSETNYPYKAVDGKCN--KATSPVAQIKGYEKVPPNSETALQKAVANQPVS 252
Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
V +DA G F FY SG+ N +CG DHGV VG+GTA NG YW++KNSWG WGE
Sbjct: 253 VSIDADGAGFMFYSSGIYNGECGTELDHGVTAVGYGTA---NGTDYWIVKNSWGTQWGEK 309
Query: 323 GYIRILR----DAGLCGIATAASYPVA 345
GY+R+ R GLCGIA +SYP +
Sbjct: 310 GYVRMQRGIAAKHGLCGIALDSSYPTS 336
>gi|18422605|ref|NP_568651.1| senescence-associated protein 12 [Arabidopsis thaliana]
gi|13877737|gb|AAK43946.1|AF370131_1 putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|9758936|dbj|BAB09317.1| senescence-specific cysteine protease [Arabidopsis thaliana]
gi|14532898|gb|AAK64131.1| putative senescence-specific cysteine protease SAG12 [Arabidopsis
thaliana]
gi|332007929|gb|AED95312.1| senescence-associated protein 12 [Arabidopsis thaliana]
Length = 346
Score = 358 bits (919), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 179/352 (50%), Positives = 239/352 (67%), Gaps = 15/352 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M LK + F+ + I C S +S +E + ++H +WM +HGR Y D E+
Sbjct: 1 MALKHMQIFLF----VAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEEN 56
Query: 61 MRLNIFKQNLEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS-S 118
R +FK N+E IE N RT+KL N+F+DLTN+EFR++YTG+ + V ++S QS +
Sbjct: 57 NRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFRSMYTGF-KGVSALSSQSQT 115
Query: 119 RPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
+ S F+YQNV+ +P S+DWR+KGAVT IK+QG CG CWAFSAVAA+EG TQI +GKLI
Sbjct: 116 KMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLI 175
Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
LSEQQLVDC T++ GC GGLMD AFE+I GL TE++YPY+ E+ TC+++K A
Sbjct: 176 SLSEQQLVDCDTNDFGCEGGLMDTAFEHIKATGGLTTESNYPYKGEDATCNSKKTNPKAT 235
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
+I+ YED+P DEQAL++AV++QPVSV ++ G F FY SGV +C DH V +G
Sbjct: 236 SITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIG 295
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+G E NG+KYW+IKNSWG WGESGY+RI +D GLCG+A ASYP
Sbjct: 296 YG--ESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYPT 345
>gi|357167196|ref|XP_003581047.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 338
Score = 358 bits (918), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 175/342 (51%), Positives = 227/342 (66%), Gaps = 16/342 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
F+ ++V T A + R + + I +HEQWMA++GR Y D EKA RL +FK N+
Sbjct: 3 FLFALVVCTFALGALGARDLADDDWLIAARHEQWMARYGRVYSDVAEKARRLEVFKANVG 62
Query: 72 YIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT-- 129
+IE N GN + L N+F+D+T +EFRA++ GY V +R + F+Y NV+
Sbjct: 63 FIESVNA-GNHKFWLEANQFADITKDEFRAMHKGYKMQVIG---SKARATGFRYANVSID 118
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
D+P S+DWR GAVT +KDQGQCG CWAFS VA++EGI +++ GKLI LSEQ+LVDC
Sbjct: 119 DLPASVDWRANGAVTPVKDQGQCGCCWAFSTVASMEGIVKVSTGKLISLSEQELVDCDVG 178
Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC GGLMD AFE+I+ N GL TEADYPY +GTC++ KE +AA+I YED+P
Sbjct: 179 MQNKGCGGGLMDNAFEFIVNNGGLDTEADYPYTGADGTCNSNKESNIAASIKGYEDVPAN 238
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
DE +L +AV+ QPVS+ VD F FYK GVL CG DHGVA VG+G A + G K
Sbjct: 239 DEASLQKAVAAQPVSIAVDGGDDLFRFYKGGVLTGACGTELDHGVAAVGYGVAGD--GTK 296
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
YWL+KNSWG +WGE G+IR+ RD AG+CG+A SYP A
Sbjct: 297 YWLVKNSWGTSWGEDGFIRLERDVADEAGMCGLAMKPSYPTA 338
>gi|357471211|ref|XP_003605890.1| Cysteine proteinase [Medicago truncatula]
gi|355506945|gb|AES88087.1| Cysteine proteinase [Medicago truncatula]
Length = 343
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 175/325 (53%), Positives = 228/325 (70%), Gaps = 13/325 (4%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK-EGNRTYKL 86
V+ R++ + + E+H QWM+Q+G+ YKD E+ R IF +N+ YIE NK + N+ Y L
Sbjct: 25 VTSRTLQD-DMYERHRQWMSQYGKVYKDSQEREKRFKIFTENVNYIEAFNKGDNNKLYTL 83
Query: 87 GTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
G N+F+DLTN+EF + N+ + +R STFKY+N + +P+S+DWR+KGAVT +
Sbjct: 84 GVNQFADLTNDEFT---SSRNKFKGHMCSSITRTSTFKYENASAIPSSVDWRKKGAVTPV 140
Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEY 204
K+QGQCG CWAFSAVAA EGI +++ GKLI LSEQ+LVDC T + GC GGLMD AF++
Sbjct: 141 KNQGQCGCCWAFSAVAATEGIHKLSTGKLISLSEQELVDCDTKGVDQGCEGGLMDDAFKF 200
Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
II+N GL TEA+YPY+ +GTC+ K A TI+ YED+P +EQAL +AV+NQP+SV
Sbjct: 201 IIQNHGLNTEANYPYQGVDGTCNANKGSINAVTITGYEDVPTNNEQALQKAVANQPISVA 260
Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
+DASG F FYKSGV CG DHGV VG+G + + G KYWL+KNSWG WGE GY
Sbjct: 261 IDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSND--GTKYWLVKNSWGTEWGEEGY 318
Query: 325 IRILR--DA--GLCGIATAASYPVA 345
I + R DA GLCGIA ASYP A
Sbjct: 319 IMMQRGVDAAEGLCGIAMQASYPTA 343
>gi|319826926|gb|ADV74756.1| cysteine protease [Lactuca sativa]
Length = 363
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 169/323 (52%), Positives = 222/323 (68%), Gaps = 10/323 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
+ R++++P+++ +HEQWMA HGR Y DE EK +R IFK N+ YI+ N +++Y L
Sbjct: 41 ATSRTLNDPTMIARHEQWMAHHGRIYTDENEKQLRFQIFKNNVAYIDAHNARSDQSYTLE 100
Query: 88 TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N+F+DLTN+EFRA GY + S S S F+Y NV+ VP +DWR++GAVT +K
Sbjct: 101 VNKFADLTNDEFRASRNGYKKQPDSDSHVVS--GLFRYANVSAVPDEVDWRKEGAVTPVK 158
Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYI 205
DQG CG CWAFSAVAA+EGI ++ GKL+ LSEQ+LVDC D + GC GGLM+ AF++I
Sbjct: 159 DQGDCGCCWAFSAVAAMEGINKLENGKLVSLSEQELVDCDIDGIDQGCEGGLMENAFQFI 218
Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
+ KGLA E+ YPY E+G C+ +K AA IS +E +P +E+ALLQAV+NQPVS+ +
Sbjct: 219 EKRKGLAAESVYPYTGEDGICNTKKAAIPAAKISGHEKVPANNEKALLQAVANQPVSIAI 278
Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
DASG F FY GV CG DH + VG+G + G KYWL+KNSWG +WGE+GYI
Sbjct: 279 DASGYEFQFYSGGVFTGSCGTELDHAITAVGYGATMD--GTKYWLMKNSWGASWGENGYI 336
Query: 326 RILRDA----GLCGIATAASYPV 344
RI RD+ GLCGIA SYPV
Sbjct: 337 RIKRDSLAKEGLCGIAMDPSYPV 359
>gi|144905112|dbj|BAF56429.1| cysteine proteinase [Lotus japonicus]
Length = 341
Score = 358 bits (918), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 179/348 (51%), Positives = 234/348 (67%), Gaps = 21/348 (6%)
Query: 6 EKSFIIPMFVIIILVITCASQVVSGRSMHEP--SIVEKHEQWMAQHGRTYKDELEKAMRL 63
+K +I+ +F+++ + I S+V+S R +HE S++E+HEQWMA++ + YKD EK R
Sbjct: 7 QKQYILALFLLLAVGI---SRVIS-RELHETETSLIERHEQWMAKYDKVYKDAAEKEKRF 62
Query: 64 NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
IFK N+E+IE N GN+ YKLG N +DLT EEF+A G R ++F
Sbjct: 63 LIFKDNVEFIESFNAAGNKPYKLGVNHLADLTIEEFKASRNGLKRSYD----YEVGTTSF 118
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
KY+NVT +P S+DWR+KGAVT IKDQGQCGSCWAFS VAA EGI +I+ GKL+ LSEQ+L
Sbjct: 119 KYENVTAIPASVDWRKKGAVTPIKDQGQCGSCWAFSTVAATEGIHKISTGKLVSLSEQEL 178
Query: 184 VDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
VDC + GC GG M+ FE+II+N G+ TEA+YPY+ +G+C N A AA I Y
Sbjct: 179 VDCDRKGTDQGCEGGYMEDGFEFIIKNGGITTEANYPYKAVDGSCKNA--TAPAAQIKGY 236
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
E +P E+ALL+AV+NQPVSV +DA+ +F FY SG+ +CG DHGV VG+G A
Sbjct: 237 EKVPVNSEKALLKAVANQPVSVSIDAADGSFMFYSSGIFTGECGTELDHGVTAVGYGRA- 295
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
NG YW++KNSWG WGE GYIR+ R GLCGIA +SYP A
Sbjct: 296 --NGTDYWIVKNSWGTVWGEQGYIRMQRGIAAKEGLCGIAMDSSYPTA 341
>gi|357160300|ref|XP_003578721.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 349
Score = 357 bits (917), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 174/347 (50%), Positives = 232/347 (66%), Gaps = 16/347 (4%)
Query: 11 IPMFVIIILVITC---ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
IP ++ +V+ C S V+S R + + ++VE+HEQWMAQHGR YKD EKA R F+
Sbjct: 3 IPKVFLLAVVLGCICLCSTVLSARELGDAAMVERHEQWMAQHGRVYKDGAEKARRFEAFR 62
Query: 68 QNLEYIEKANKEGNR-TYKLGTNEFSDLTNEEFRALYT--GYNRPVPSVSRQSSRPSTFK 124
N+ +IE N GNR + LG N+F+DLTN+EFRA T G+ + + ++S TF+
Sbjct: 63 NNVVFIESFNAAGNRRKFWLGVNQFTDLTNDEFRATKTNKGFIKRNAAAVNKASPTGTFR 122
Query: 125 YQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
Y NV+ +P ++DWR KGAVT IK+QGQCG CWAFSAVAA EGI Q++ GKL+ LSEQ+
Sbjct: 123 YSNVSADALPAAVDWRAKGAVTPIKNQGQCGCCWAFSAVAATEGIVQLSTGKLVPLSEQE 182
Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
LVDC + +HGC GG MD AFE+II+N GL +E +YPY ++G C + ATI
Sbjct: 183 LVDCDANGADHGCEGGEMDDAFEFIIKNGGLTSETNYPYTAQDGQCKAKNTINSVATIKG 242
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
YED+P DE +L++AV+ QPVSV VD F Y GVL+ CG + DHG+ VG+G A
Sbjct: 243 YEDVPANDEASLMKAVAAQPVSVAVDGGDMVFQHYAGGVLSGSCGTSLDHGIVAVGYGAA 302
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
++ G K+WL+KNSWG TWGE GYIR+ +D G+CG+A SYP
Sbjct: 303 DD--GTKFWLMKNSWGTTWGEDGYIRMEKDVADAGGMCGLAMQPSYP 347
>gi|356517426|ref|XP_003527388.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 357 bits (916), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 174/324 (53%), Positives = 226/324 (69%), Gaps = 11/324 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
V+ R++ + S+ E+H QWMA++ + YKD E+ R IFK+N+ YIE N N++YKL
Sbjct: 25 VTCRTLQDASMYERHAQWMARYAKVYKDPQEREKRFRIFKENVNYIETFNSADNKSYKLD 84
Query: 88 TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N+F+DLTNEEF A NR + +R +TFKY+NVT +P+++DWR+KGAVT IK
Sbjct: 85 INQFADLTNEEFIA---PRNRFKGHMCSSITRTTTFKYENVTVIPSTVDWRQKGAVTPIK 141
Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYI 205
DQGQCG CWAFSAVAA EGI + GKLI LSEQ++VDC T + GC+GG MD AF++I
Sbjct: 142 DQGQCGCCWAFSAVAATEGIHALNAGKLISLSEQEVVDCDTKGQDQGCAGGFMDGAFKFI 201
Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
I+N GL TE +YPY+ +G C+ + AATI+ YED+P +E+AL +AV+NQPVSV +
Sbjct: 202 IQNHGLNTEPNYPYKAADGKCNAKAAANHAATITGYEDVPVNNEKALQKAVANQPVSVAI 261
Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
DASG F FYKSGV CG DHGV VG+G + + G +YWL+KNSWG WGE GYI
Sbjct: 262 DASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSAD--GTEYWLVKNSWGTEWGEEGYI 319
Query: 326 RILR----DAGLCGIATAASYPVA 345
R+ R + GLCGIA ASYP A
Sbjct: 320 RMQRGVKAEEGLCGIAMMASYPTA 343
>gi|356577763|ref|XP_003556992.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 357 bits (916), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 172/342 (50%), Positives = 233/342 (68%), Gaps = 14/342 (4%)
Query: 13 MFVIIILVITCASQV---VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
+ I + ++ C+ + V+ R++ + S+ E+HE+WM ++ + YKD E+ R IFK+N
Sbjct: 7 FYQISLALLFCSGFLTFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ YIE N N+ Y LG N+F+DLTNEEF A NR + +R +TFKY+NVT
Sbjct: 67 VNYIEAFNNAANKPYTLGINQFADLTNEEFIAPR---NRFKGHMCSSITRTTTFKYENVT 123
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
+P+++DWR+KGAVT IKDQGQCG CWAFSAVAA EGI ++ GKLI LSEQ++VDC T
Sbjct: 124 AIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTK 183
Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
++ GC+GG MD AF++II+N GL E +YPY+ +G C+ + ATI+ YED+P
Sbjct: 184 GEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVN 243
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
+E+AL +AV+NQPVSV +DASG F FY+SGV CG DHGV VG+G + + G +
Sbjct: 244 NEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSAD--GTE 301
Query: 308 YWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
YWL+KNSWG WGE GYIR+ R + GLCGIA ASYP A
Sbjct: 302 YWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343
>gi|356517348|ref|XP_003527349.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 357 bits (916), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 172/342 (50%), Positives = 233/342 (68%), Gaps = 14/342 (4%)
Query: 13 MFVIIILVITCASQV---VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
+ I + ++ C+ + V+ R++ + S+ E+HE+WM ++ + YKD E+ R IFK+N
Sbjct: 7 FYQISLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ YIE N N+ Y LG N+F+DLTNEEF A NR + +R +TFKY+NVT
Sbjct: 67 VNYIEAFNNAANKPYTLGINQFADLTNEEFIAPR---NRFKGHMCSSITRTTTFKYENVT 123
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
+P+++DWR+KGAVT IKDQGQCG CWAFSAVAA EGI ++ GKLI LSEQ++VDC T
Sbjct: 124 AIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTK 183
Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
++ GC+GG MD AF++II+N GL E +YPY+ +G C+ + ATI+ YED+P
Sbjct: 184 GEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVN 243
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
+E+AL +AV+NQPVSV +DASG F FY+SGV CG DHGV VG+G + + G +
Sbjct: 244 NEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSAD--GTE 301
Query: 308 YWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
YWL+KNSWG WGE GYIR+ R + GLCGIA ASYP A
Sbjct: 302 YWLVKNSWGTEWGEEGYIRMQRGVKAEEGLCGIAMMASYPTA 343
>gi|356543076|ref|XP_003539989.1| PREDICTED: vignain-like [Glycine max]
Length = 343
Score = 357 bits (915), Expect = 6e-96, Method: Compositional matrix adjust.
Identities = 172/324 (53%), Positives = 227/324 (70%), Gaps = 11/324 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
V+ R++ + S+ E+HE+WMA++ + YKD E+ R IFK+N+ YIE N ++ YKLG
Sbjct: 25 VTSRTLQDASMYERHEEWMARYAKVYKDPEEREKRFKIFKENVNYIEAFNNAADKPYKLG 84
Query: 88 TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N+F+DLTNEEF A N+ + +R +TFKY+NVT +P+++DWR+KGAVT IK
Sbjct: 85 INQFADLTNEEFIAPR---NKFKGHMCSSITRTTTFKYENVTALPSTVDWRQKGAVTPIK 141
Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYI 205
DQGQCG CWAFSAVAA EGI + GKLI LSEQ++VDC T ++ GC+GG MD AF++I
Sbjct: 142 DQGQCGCCWAFSAVAATEGIHALNSGKLISLSEQEVVDCDTKGEDQGCAGGFMDGAFKFI 201
Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
I+N GL TEA+YPY+ +G C+ + AATI+ YED+P +E+AL +AV+NQPVSV +
Sbjct: 202 IQNHGLNTEANYPYKAVDGKCNANEAANHAATITGYEDVPVNNEKALQKAVANQPVSVAI 261
Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
DASG F FYK+GV CG DHGV VG+G + + G +YWL+KNSWG WGE GYI
Sbjct: 262 DASGSDFQFYKTGVFTGSCGTQLDHGVTAVGYGVSAD--GTQYWLVKNSWGTEWGEEGYI 319
Query: 326 RILRDA----GLCGIATAASYPVA 345
+ R GLCGIA ASYP A
Sbjct: 320 MMQRGVKAQEGLCGIAMMASYPTA 343
>gi|357452075|ref|XP_003596314.1| Cysteine proteinase [Medicago truncatula]
gi|355485362|gb|AES66565.1| Cysteine proteinase [Medicago truncatula]
Length = 341
Score = 356 bits (914), Expect = 7e-96, Method: Compositional matrix adjust.
Identities = 184/335 (54%), Positives = 234/335 (69%), Gaps = 13/335 (3%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
+ L + S + R++ + E HEQWM QHG+ YK EK R IFK+N+ YIE
Sbjct: 14 LFLCLGLLSFQATSRTLQNDPMYEMHEQWMVQHGKVYKAAHEKQKRFGIFKENVNYIEAF 73
Query: 77 NKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
N GN++YKLG N F+DLTN EF A +N + S +TFKY+NV+DVP+++D
Sbjct: 74 NNVGNKSYKLGLNHFADLTNHEFIAARNKFNGYL-----HGSIITTFKYKNVSDVPSAVD 128
Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCS 194
WR++GAVT +K+QGQCG CWAFSAVA+ EGI ++T G L+ LSEQ+LVDC T+ + GC
Sbjct: 129 WRQEGAVTPVKNQGQCGCCWAFSAVASTEGIHKLTTGNLVSLSEQELVDCDTNGEDQGCE 188
Query: 195 GGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQ 254
GGLMD AFE+II+N GL+TEA+YPY+ +GTC+ + + AATIS YE++P DEQAL +
Sbjct: 189 GGLMDDAFEFIIQNNGLSTEAEYPYQGVDGTCNKTEVGSSAATISGYENVPVNDEQALQK 248
Query: 255 AVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNS 314
AV+NQPVSV +DASG F FYKSGV CG DHGVAVVG+G E+E +YWL+KNS
Sbjct: 249 AVANQPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVAVVGYGVGEDE--TEYWLVKNS 306
Query: 315 WGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
WG WGE GYIR+ R DA GLCGIA SYP A
Sbjct: 307 WGTQWGEEGYIRMQRGVDASEGLCGIAMQPSYPTA 341
>gi|356517350|ref|XP_003527350.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
gi|356577765|ref|XP_003556993.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 343
Score = 356 bits (914), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 175/324 (54%), Positives = 226/324 (69%), Gaps = 11/324 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
V+ R++ + S+ E+HEQWMA++G+ YKD EK R +FK+N+ YIE N N+ YKLG
Sbjct: 25 VASRTLQDASMYERHEQWMARYGKVYKDPEEKEKRFRVFKENVNYIEAFNNAANKPYKLG 84
Query: 88 TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N+F+DLT+EEF NR ++R +TFKY+NVT +P SIDWR+KGAVT IK
Sbjct: 85 INQFADLTSEEF---IVPRNRFNGHTRSSNTRTTTFKYENVTVLPDSIDWRQKGAVTPIK 141
Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYI 205
+QG CG CWAFSA+AA EGI +I+ GKL+ LSEQ++VDC T +HGC GG MD AF++I
Sbjct: 142 NQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEVVDCDTKGTDHGCEGGYMDGAFKFI 201
Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
I+N G+ TEA YPY+ +G C+ ++E AATI+ YED+P +E+AL +AV+NQPVSV +
Sbjct: 202 IQNHGINTEASYPYKGVDGKCNIKEEAVHAATITGYEDVPINNEKALQKAVANQPVSVAI 261
Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
DASG F FYKSG+ CG DHGV VG+G E G KYWL+KNSWG WGE GYI
Sbjct: 262 DASGADFQFYKSGIFTGSCGTELDHGVTAVGYG--ENNEGTKYWLVKNSWGTEWGEEGYI 319
Query: 326 RILRDA----GLCGIATAASYPVA 345
+ R G+CGIA ASYP A
Sbjct: 320 MMQRGVKAVEGICGIAMMASYPTA 343
>gi|1046373|gb|AAC49135.1| SAG12 protein [Arabidopsis thaliana]
Length = 346
Score = 356 bits (913), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 179/352 (50%), Positives = 238/352 (67%), Gaps = 15/352 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M LK + F+ + I C S +S +E + ++H +WM +HGR Y D E+
Sbjct: 1 MALKHMQIFLF----VAIFSSFCFSITLSRPLDNELIMQKRHIEWMTKHGRVYADVKEEN 56
Query: 61 MRLNIFKQNLEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS-S 118
R +FK N+E IE N RT+KL N+F+DLTN+EF ++YTG+ + V ++S QS +
Sbjct: 57 NRYVVFKNNVERIEHLNSIPAGRTFKLAVNQFADLTNDEFCSMYTGF-KGVSALSSQSQT 115
Query: 119 RPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
+ S F+YQNV+ +P S+DWR+KGAVT IK+QG CG CWAFSAVAA+EG TQI +GKLI
Sbjct: 116 KMSPFRYQNVSSGALPVSVDWRKKGAVTPIKNQGSCGCCWAFSAVAAIEGATQIKKGKLI 175
Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
LSEQQLVDC T++ GC GGLMD AFE+I GL TE+DYPY+ E+ TC+++K A
Sbjct: 176 SLSEQQLVDCDTNDFGCEGGLMDTAFEHIKATGGLTTESDYPYKGEDATCNSKKTNPKAT 235
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
+I+ YED+P DEQAL++AV++QPVSV ++ G F FY SGV +C DH V +G
Sbjct: 236 SITGYEDVPVNDEQALMKAVAHQPVSVGIEGGGFDFQFYSSGVFTGECTTYLDHAVTAIG 295
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+G E NG+KYW+IKNSWG WGESGY+RI +D GLCG+A ASYP
Sbjct: 296 YG--ESTNGSKYWIIKNSWGTKWGESGYMRIQKDVKDKQGLCGLAMKASYPT 345
>gi|21666724|gb|AAM73806.1|AF448505_1 cysteine proteinase [Brassica napus]
gi|21666726|gb|AAM73807.1|AF448506_1 cysteine proteinase [Brassica napus]
Length = 343
Score = 355 bits (912), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 169/341 (49%), Positives = 237/341 (69%), Gaps = 11/341 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
I +F+I+ LV + + R + E ++ ++H WM +HGR Y D EK R +FK+N+
Sbjct: 6 IQIFLIVSLVSSFSLSTTLSRPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNV 65
Query: 71 EYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
E IE+ N+ + T+KL N+F+DLTNEEFR++YTGY SV ++P++F+YQ+V+
Sbjct: 66 ESIERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGN--SVLSSRTKPTSFRYQHVS 123
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+P S+DWR+KGAVT IKDQG CGSCWAFSAVAA+EG+ QI +GKLI LSEQ+LVDC
Sbjct: 124 SDALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCD 183
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T++ GC GG M+ AF Y + GL +E++YPY+ +GTC+ K K +A +I +ED+P
Sbjct: 184 TNDDGCMGGYMNSAFNYTMTTGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPAN 243
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
DE+AL++AV++ PVS+ + G F FY SGV + +C + DHGVAVVG+G + NG+K
Sbjct: 244 DEKALMKAVAHHPVSIGIAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYG--KSSNGSK 301
Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
YW++KNSWG WGE GY+RI +D G CG+A ASYP
Sbjct: 302 YWILKNSWGPKWGERGYMRIKKDTKAKHGQCGLAMNASYPT 342
>gi|224162986|ref|XP_002338508.1| predicted protein [Populus trichocarpa]
gi|222872535|gb|EEF09666.1| predicted protein [Populus trichocarpa]
Length = 306
Score = 355 bits (911), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 168/314 (53%), Positives = 225/314 (71%), Gaps = 14/314 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+ E+HEQWM Q+GR YKD+ E+A R +IFK+N+ I+ N + ++YKLG N+F+DLTNE
Sbjct: 1 MYERHEQWMTQYGRVYKDDNERATRYSIFKENVARIDAFNSQTGKSYKLGVNQFADLTNE 60
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+A NR + + P F+Y+NV+ VP+++DWR++GAVT +KDQGQCG CWA
Sbjct: 61 EFKA---SRNRFKGHMCSPQAGP--FRYENVSAVPSTVDWRKEGAVTPVKDQGQCGCCWA 115
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEA 215
FSAVAA+EGI ++T GKLI LSEQ++VDC T ++ GC+GGLMD AF++I +NKGL TEA
Sbjct: 116 FSAVAAMEGINKLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEA 175
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
+YPY+ +GTC+ +K AA I+ +ED+P E AL++AV+ QPVSV +DA G F FY
Sbjct: 176 NYPYKGTDGTCNTKKSAIHAAKITGFEDVPANSEAALMKAVAKQPVSVAIDAGGSDFQFY 235
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----A 331
SG+ C DHGV VG+G ++ G+KYWL+KNSWG WGE GYIR+ +D
Sbjct: 236 SSGIFTGSCDTQLDHGVTAVGYGVSD---GSKYWLVKNSWGAQWGEEGYIRMQKDISAKE 292
Query: 332 GLCGIATAASYPVA 345
GLCGIA ASYP A
Sbjct: 293 GLCGIAMQASYPTA 306
>gi|356554921|ref|XP_003545789.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 439
Score = 354 bits (909), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 171/320 (53%), Positives = 221/320 (69%), Gaps = 11/320 (3%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEF 91
++ + S+ E+HEQWM +HG+ YKD E+ R IF +N+ Y+E N N+ YKLG N+F
Sbjct: 125 TLQDASMYERHEQWMTRHGKVYKDPREREKRFRIFNENVNYVEAFNNAANKPYKLGINQF 184
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
DLTN+EF A NR + R +TFKY+NVT VP+++DWR+ GAVT +KDQGQ
Sbjct: 185 XDLTNQEFIA---PRNRFKGHMCSSIIRTTTFKYENVTTVPSTVDWRQNGAVTPVKDQGQ 241
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
CG CWAFSAVAA EGI ++ GKLI LSEQ+LVDC T + GC GGLMD A+++II+N
Sbjct: 242 CGCCWAFSAVAATEGIHALSGGKLISLSEQELVDCDTKGVDQGCEGGLMDDAYKFIIQNH 301
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
GL TEA+YPY+ +G C+ + AATI+ YED+P +E+AL +AV+NQPVSV +DAS
Sbjct: 302 GLNTEANYPYKGVDGKCNANEAANHAATITGYEDVPANNEKALQKAVANQPVSVAIDASS 361
Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
F FYKSG CG DHGV VG+G + ++G KYWL+KNSWG WGE GYIR+ R
Sbjct: 362 SDFQFYKSGAFTGSCGTELDHGVTAVGYGVS--DHGTKYWLVKNSWGTEWGEEGYIRMQR 419
Query: 330 ----DAGLCGIATAASYPVA 345
+ G+CGIA ASYP A
Sbjct: 420 GVDSEEGVCGIAMQASYPTA 439
>gi|357474725|ref|XP_003607647.1| Cysteine proteinase [Medicago truncatula]
gi|355508702|gb|AES89844.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 354 bits (908), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 171/327 (52%), Positives = 224/327 (68%), Gaps = 15/327 (4%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
+ V+S + PS+ E+HEQWM+++G+ YKD +EK R IFK N+E+IE N N+ Y
Sbjct: 23 TNVMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPY 82
Query: 85 KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
KL N +DLT +EF+A GY + + R+ + S FKY+NVT +P ++DWR KGAVT
Sbjct: 83 KLSVNHLADLTLDEFKASRNGYKK----IDREFATTS-FKYENVTAIPEAVDWRVKGAVT 137
Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAF 202
IKDQGQCGSCWAFS VAA+EGI QIT GKLI LSEQ+LVDC T ++ GC GGLM+ F
Sbjct: 138 PIKDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGF 197
Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
E+II+N G+ +E +YPY+ +G+C N A A I+ YE +P E +LL+AV+NQP+S
Sbjct: 198 EFIIKNGGITSETNYPYKAADGSC-NTATTAPVAKITGYEKVPVNSEISLLKAVANQPIS 256
Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
V +DAS +F FY SG+ +CG DHGV VG+G+A NG YW++KNSWG WGE
Sbjct: 257 VSIDASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA---NGTDYWIVKNSWGTVWGEK 313
Query: 323 GYIRILR----DAGLCGIATAASYPVA 345
GYIR+ R GLCGIA +SYP A
Sbjct: 314 GYIRMQRGIADKEGLCGIAMDSSYPTA 340
>gi|102140014|gb|ABF70145.1| cysteine protease, putative [Musa acuminata]
Length = 373
Score = 354 bits (908), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 174/320 (54%), Positives = 221/320 (69%), Gaps = 13/320 (4%)
Query: 33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
+ + S+ E+H +WMA+HGRTYKD EK RL IFK N+EYIE N G R Y+L N+F+
Sbjct: 26 LGDASMAERHVEWMARHGRTYKDAAEKEQRLGIFKSNVEYIESFNA-GKRKYQLAANQFA 84
Query: 93 DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
DLT+EEF+A++TG+ PS + + F++ +++ VP S+DWR KGAVT +KDQG C
Sbjct: 85 DLTHEEFKAMHTGFK---PSGTGAKKAGNGFRHGSLSSVPDSVDWRSKGAVTPVKDQGLC 141
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKG 210
GSCWAF+ VAAVEGIT+I GKLI LSEQQLVDC + GC GG MD AFE+I+ N G
Sbjct: 142 GSCWAFTVVAAVEGITKIVTGKLISLSEQQLVDCDVHGKDQGCQGGDMDAAFEFIVNNGG 201
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA-SG 269
+ +EA+YPY + C+ V ATI +ED+P DE+AL +AV+NQPVSV +DA S
Sbjct: 202 ITSEANYPYEEVQRLCNAHNASFVVATIESHEDVPTNDEKALRKAVANQPVSVGIDAGSS 261
Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
F Y GV + +CG + DH V VVG+GT + G KYWL KNSWGETWGE+GYIR+ R
Sbjct: 262 LDFQLYSGGVFSGECGTDLDHAVTVVGYGTTSD--GTKYWLAKNSWGETWGENGYIRMER 319
Query: 330 DA----GLCGIATAASYPVA 345
D GLCGIA ASYP A
Sbjct: 320 DVAAKEGLCGIAMQASYPTA 339
>gi|356543114|ref|XP_003540008.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 343
Score = 354 bits (908), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 175/328 (53%), Positives = 225/328 (68%), Gaps = 9/328 (2%)
Query: 23 CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR 82
C SQV S R +H+ S+ E+HEQWM ++G+ YKD E R IF+ N+E+IE N GN+
Sbjct: 20 CTSQVKS-RKLHDASMYERHEQWMEKYGKVYKDSAEMQKRFLIFENNVEFIESFNAAGNK 78
Query: 83 TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
YKL N +D TNEEF A + GY R +++ + FKY+NVTD+P ++DWR+KG
Sbjct: 79 PYKLSINHLADQTNEEFMASHKGYKGSHWQGLRITTQ-TPFKYENVTDIPWAVDWRQKGD 137
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAF 202
VT IKDQ QCG+CWAFSAVAA EGI QIT G L+ LSE++LVDC + +HGC GGLM+ F
Sbjct: 138 VTSIKDQAQCGNCWAFSAVAATEGIYQITTGNLVSLSEKELVDCDSVDHGCDGGLMEHGF 197
Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PV 261
E+II+N G+++EA+YPY GTCD KE + A I+ YE +P E+ L +AV+NQ +
Sbjct: 198 EFIIKNGGISSEANYPYTAVNGTCDTNKEASPVAQITGYETVPVNCEEELQKAVANQLTM 257
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
SV +DA G AF FY SGV CG DHGV VG+G+ + G +YW++KNSWG WGE
Sbjct: 258 SVSIDAGGSAFQFYPSGVFTGQCGTQLDHGVTAVGYGSTDY--GTQYWIVKNSWGTQWGE 315
Query: 322 SGYIRILR--DA--GLCGIATAASYPVA 345
GYIR+LR DA GLCGIA ASYP A
Sbjct: 316 EGYIRMLRGIDAQEGLCGIAMDASYPTA 343
>gi|356543118|ref|XP_003540010.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 339
Score = 353 bits (907), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 174/339 (51%), Positives = 225/339 (66%), Gaps = 15/339 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ +++L+ C SQV+S R++HE S + E+HEQW ++G+ YKD EK RL IFK N+
Sbjct: 10 ILALVLLLPICISQVMS-RNLHEASXCMSERHEQWTKKYGKVYKDAAEKQKRLLIFKDNV 68
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+IE N GN+ YKL N +D TNEEF A + GY + S + FKY+N+T
Sbjct: 69 EFIESFNAAGNKPYKLSINHLTDQTNEEFVASHNGYKH------KGSHSQTPFKYENITG 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
VP ++DWRE GAV +KDQGQCG+CWAFS VA EGI QIT L+ LSEQ+LVDC + +
Sbjct: 123 VPNAVDWRENGAVXAMKDQGQCGNCWAFSTVATTEGIYQITTSMLMSLSEQELVDCDSVD 182
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
HGC GG M+ FE+I +N G+++EA+YPY +GT D KE + AA I YE +P E
Sbjct: 183 HGCDGGYMEGGFEFIXKNGGISSEANYPYTAVDGTYDANKEASPAAQIKGYETVPANSED 242
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
AL +AV+NQPVSV +D G AF F SGV CG DHGV VG+G+ ++ G +YW+
Sbjct: 243 ALQKAVANQPVSVTIDVGGSAFQFNSSGVFTGQCGTQLDHGVTAVGYGSTDD--GTQYWI 300
Query: 311 IKNSWGETWGESGYIRILR--DA--GLCGIATAASYPVA 345
+KNSWG WGE GYIR+ R DA GLCGIA ASYP A
Sbjct: 301 VKNSWGTQWGEEGYIRMQRGTDAQEGLCGIAMDASYPTA 339
>gi|125551397|gb|EAY97106.1| hypothetical protein OsI_19029 [Oryza sativa Indica Group]
Length = 350
Score = 353 bits (907), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 179/353 (50%), Positives = 234/353 (66%), Gaps = 28/353 (7%)
Query: 12 PMFVIIILVITC------ASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
P+ + I+ I C + V + R + + ++ +HE+WMAQHGR YKD EKA RL
Sbjct: 7 PLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARRLE 66
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT---GYNRPVPSVSRQSSRPS 121
+FK N+ +IE N G Y LG N+F+DLT+EEF+A T G++ P V R S
Sbjct: 67 VFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGV-----RVS 121
Query: 122 T-FKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
T FKY+NV+ +P S+DWR KGAVT IKDQGQCG CWAFSAVAA+EGI +++ GKLI L
Sbjct: 122 TGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISL 181
Query: 179 SEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
SEQ+LVDC D + GC GG +D AF++I+ N GL EA+YPY E+G C VAA
Sbjct: 182 SEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAA 241
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
+I YED+P DE +L++AV+ QPVSV VDAS F FY GV+ +CG + DHGV V+G
Sbjct: 242 SIRGYEDVPANDEPSLMKAVAGQPVSVAVDAS--KFQFYGGGVMAGECGTSLDHGVTVIG 299
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
+G A + G KYWL+KNSWG TWGE+GY+R+ +D G+CG+A SYP A
Sbjct: 300 YGAASD--GTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYPTA 350
>gi|356517358|ref|XP_003527354.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
gi|356577767|ref|XP_003556994.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 343
Score = 353 bits (906), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 171/342 (50%), Positives = 232/342 (67%), Gaps = 14/342 (4%)
Query: 13 MFVIIILVITCASQV---VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
+ I + ++ C+ + V+ R++ + S+ E+HE+WM ++ + YKD E+ R IFK+N
Sbjct: 7 FYQISLALLFCSGFLAFQVTCRTLQDASMYERHEEWMGRYAKVYKDPQERERRFKIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ YIE N N+ Y LG N+F+DLTNEEF A NR + +R +TFKY+NVT
Sbjct: 67 VNYIEAFNNAANKPYTLGINQFADLTNEEFIAPR---NRFKGHMCSSITRTTTFKYENVT 123
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
+P+++DWR+KGAVT IKDQGQCG CWAFSAVAA EGI ++ GKLI LSEQ++VDC T
Sbjct: 124 AIPSTVDWRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQEVVDCDTK 183
Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
++ GC+GG MD AF++II+N GL E +YPY+ +G C+ + ATI+ YED+P
Sbjct: 184 GEDQGCAGGFMDGAFKFIIQNHGLNNEPNYPYKAVDGKCNAKAAANHVATITGYEDVPVN 243
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
+E+AL +AV+NQPVSV +DASG F FY+SGV CG DHGV VG+G + + G +
Sbjct: 244 NEKALQKAVANQPVSVAIDASGSDFQFYQSGVFTGSCGTELDHGVTAVGYGVSAD--GTE 301
Query: 308 YWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
YWL+KNSWG WGE GYIR+ R + GL GIA ASYP A
Sbjct: 302 YWLVKNSWGTEWGEEGYIRMQRGVKAEEGLXGIAMMASYPTA 343
>gi|297826875|ref|XP_002881320.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
gi|297327159|gb|EFH57579.1| hypothetical protein ARALYDRAFT_321132 [Arabidopsis lyrata subsp.
lyrata]
Length = 341
Score = 353 bits (905), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 174/342 (50%), Positives = 233/342 (68%), Gaps = 13/342 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ +F I+ + + + HEPS +EKHEQWMA+ R Y+DELEK MR ++FK+N
Sbjct: 7 LVTIFTILFTTFSISQATSRTVTFHEPSSLEKHEQWMARFSRVYRDELEKQMRRDVFKKN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
L++IE NK+GN++YKLG NEF+D TNEEF A++TG V ++ ++ N++
Sbjct: 67 LKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGLKGLSSKVVDETISSRSW---NIS 123
Query: 130 D-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
D V S DWR +GAVT +K QGQCG CWAFSAVAAVEG+T+I G L+ LSEQQL+DC
Sbjct: 124 DMVGVSKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVTKIAGGNLVSLSEQQLLDCDR 183
Query: 189 D-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
+ + GC GG+M AF YII+N+G+A+E DY Y+ +G C + AA IS ++ +P
Sbjct: 184 EYDRGCDGGIMSDAFNYIIQNRGIASENDYSYQGSDGRCRSSARP--AARISGFQTVPSN 241
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
+EQALL+AVS QPVSV +DA+G F Y GV + CG + +H V VG+GT+++ G K
Sbjct: 242 NEQALLEAVSRQPVSVSMDANGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQD--GTK 299
Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
YWL KNSWGETWGE GYIRI RD G+CG+A A YPVA
Sbjct: 300 YWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 341
>gi|388512155|gb|AFK44139.1| unknown [Medicago truncatula]
Length = 340
Score = 352 bits (903), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 170/327 (51%), Positives = 224/327 (68%), Gaps = 15/327 (4%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
+ V+S + PS+ E+HEQWM+++G+ YKD +EK R IFK N+E+IE N N+ Y
Sbjct: 23 TNVMSRKLYESPSLQERHEQWMSEYGKLYKDAIEKEKRFMIFKDNVEFIESFNAADNKPY 82
Query: 85 KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
KL N +DLT +EF+A GY + + R+ + S FKY+NVT +P ++DWR KGAVT
Sbjct: 83 KLSVNHLADLTLDEFKASRNGYKK----IDREFATTS-FKYENVTAIPEAVDWRVKGAVT 137
Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAF 202
IKDQGQCGSCWAFS VAA+EGI QIT GKLI LSEQ+LVDC T ++ GC GGLM+ F
Sbjct: 138 PIKDQGQCGSCWAFSTVAAIEGINQITTGKLISLSEQELVDCDTKGEDQGCEGGLMEDGF 197
Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
E+II+N G+ +E +YPY+ +G+C + A A I+ YE +P E +LL+AV+NQP+S
Sbjct: 198 EFIIKNGGITSETNYPYKAADGSC-SAATTAPVAKITGYEKVPVNSEISLLKAVANQPIS 256
Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
V +DAS +F FY SG+ +CG DHGV VG+G+A NG YW++KNSWG WGE
Sbjct: 257 VSIDASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA---NGTDYWIVKNSWGTVWGEK 313
Query: 323 GYIRILR----DAGLCGIATAASYPVA 345
GYIR+ R GLCGIA +SYP A
Sbjct: 314 GYIRMQRGIADKEGLCGIAMDSSYPTA 340
>gi|413953667|gb|AFW86316.1| hypothetical protein ZEAMMB73_635707 [Zea mays]
Length = 340
Score = 352 bits (902), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 169/340 (49%), Positives = 228/340 (67%), Gaps = 16/340 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+ I+ C + + + + ++V +HEQWMAQ+ R YKD EKA R +FK N+++
Sbjct: 8 ILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRFEVFKANVKF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN+EFR++ T N+ S + + P+ F+Y+NV+
Sbjct: 68 IESFNAGGNNKFWLGVNQFADLTNDEFRSIKT--NKGFKSSNMK--IPTGFRYENVSVDA 123
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
+PT+IDWR KGAVT IKDQGQCG CWAFSAVAA EGI +I+ GKL+ L+EQ+LVDC
Sbjct: 124 LPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHG 183
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
++ GC GGLMD AF++II N GL TE+ YPY +G C + AATI YED+P D
Sbjct: 184 EDQGCEGGLMDDAFKFIINNGGLTTESSYPYTAADGKCKSGSNS--AATIKGYEDVPAND 241
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL++AV+NQPVSV VD F FY SGV+ CG + DHG+A +G+G + +G KY
Sbjct: 242 EAALMKAVANQPVSVAVDGGDMTFQFYSSGVMTGSCGTDLDHGIAAIGYG--KTSDGTKY 299
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
WL+KNSWG TWGE+GY+R+ +D G+CG+A SYP
Sbjct: 300 WLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 339
>gi|356515040|ref|XP_003526209.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 351 bits (901), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 172/350 (49%), Positives = 230/350 (65%), Gaps = 13/350 (3%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M +K ++ +F+ + + I SQV+ R +H+ ++ E+HE WMA++G+ YKD EK
Sbjct: 1 MAFTGQKQHMLALFLFLAVGI---SQVMP-RKLHQTALRERHENWMAEYGKMYKDAAEKE 56
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK N+E+IE N GN+ YKLG N +DLT EEF+ G R S + +
Sbjct: 57 KRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTY-EFSTTTFKL 115
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQG-QCGSCWAFSAVAAVEGITQITRGKLIELS 179
+ FKY+NVTD+P +IDWR KGAVT IKDQG QCGSCWAFS +AA EGI QI+ G L+ LS
Sbjct: 116 NGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGSCWAFSTIAATEGIHQISTGNLVSLS 175
Query: 180 EQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
EQ+LVDC + + GC GG M+ FE+II+N G+ +E +YPY+ +GTC+ + A I
Sbjct: 176 EQELVDCDSVDDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASPVAQIK 235
Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
YE +P E+AL +AV+NQPVSV + A+ F FY SG+ N +CG + DHGV VG+GT
Sbjct: 236 GYEIVPSYSEEALQKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGT 295
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
ENG YW++KNSWG WGE GYIR+ R G+CGIA +SYP A
Sbjct: 296 ---ENGTDYWIVKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342
>gi|310656789|gb|ADP02218.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 341
Score = 351 bits (900), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 170/339 (50%), Positives = 226/339 (66%), Gaps = 15/339 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+ I+ + C+S V+S R + + ++VE+HEQWMA+ R YKD EKA R +FK N+ +
Sbjct: 8 LLAIVGCICLCSSAVLSARELGDTAMVERHEQWMAKFNRVYKDGTEKAQRFEVFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N E NR + LG N+F+DLTN+EFRA T N+ + ++ P+ FKY NV+
Sbjct: 68 IESFNAE-NRKFWLGVNQFTDLTNDEFRATKT--NKGLKMSGGRA--PTGFKYSNVSIDA 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+PT++DWR KG VT IKDQGQCG CWAFSAV A EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 LPTAVDWRTKGVVTPIKDQGQCGCCWAFSAVVATEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
+ GC GG MD AF++II+N GL TEA+YPY ++G C ATI YED+P D
Sbjct: 183 VDQGCEGGEMDDAFKFIIKNGGLTTEANYPYTAQDGQCKTSIASNSVATIKGYEDVPAND 242
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E +L++AV+NQPVSV VD F Y GV+ CG + DHG+A +G+G + G KY
Sbjct: 243 ESSLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIAAIGYGMTSD--GTKY 300
Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
WL+KNSWG TWGESGY+R+ +D +G+CG+A SYP
Sbjct: 301 WLLKNSWGTTWGESGYLRMEKDISDKSGMCGLAMQPSYP 339
>gi|302143416|emb|CBI21977.3| unnamed protein product [Vitis vinifera]
Length = 297
Score = 350 bits (899), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 178/306 (58%), Positives = 215/306 (70%), Gaps = 15/306 (4%)
Query: 46 MAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG 105
MA++GR YKD EK R IFK N+ IE NK ++TYKL NEF+DLTNEEFR+L
Sbjct: 1 MARYGRMYKDANEKEKRFKIFKDNVARIESFNKAMDKTYKLSINEFADLTNEEFRSLRNR 60
Query: 106 YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVE 165
+ + S +TFKY+NVT VP++IDWR+KGAVT IKDQ QCG CWAFSAVAA E
Sbjct: 61 FKAHI------CSEATTFKYENVTAVPSTIDWRKKGAVTPIKDQQQCGCCWAFSAVAATE 114
Query: 166 GITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEE 223
GITQIT GKLI LSEQ+LVDC T +N GCSGGLMD AF + I+ GLA+EA YPY ++
Sbjct: 115 GITQITTGKLISLSEQELVDCDTGGENQGCSGGLMDDAFRF-IKIHGLASEATYPYEGDD 173
Query: 224 GTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD 283
GTC+++KE AA I YED+P +E+AL +AV++QPV+V +DA G F FY SGV
Sbjct: 174 GTCNSKKEAHPAAKIKGYEDVPANNEKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQ 233
Query: 284 CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATA 339
CG DHGVA VG+G ++G YWL+KNSWG WGE GYIR+ RD GLCGIA
Sbjct: 234 CGTELDHGVAAVGYGIG--DDGMMYWLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQ 291
Query: 340 ASYPVA 345
ASYP A
Sbjct: 292 ASYPTA 297
>gi|116786779|gb|ABK24233.1| unknown [Picea sitchensis]
Length = 463
Score = 350 bits (899), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 172/351 (49%), Positives = 236/351 (67%), Gaps = 21/351 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSM---------HEPSIVEKHEQWMAQHGRTYKDELEKA 60
I+ +F ++ L S + S+ + +I+E +E W+AQH + Y EK
Sbjct: 3 ILLLFAVLALSAMAGSASRADFSIIGYDSKDLREDDAIMELYELWLAQHKKAYNGLGEKQ 62
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R ++FK N YI + N +GN +YKLG N+F+DL++EEF+A Y G + + R S+ P
Sbjct: 63 NRFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKATYLG--AKLDTKKRLSNSP 120
Query: 121 ST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
S ++Y + D+P SIDWREKGAVT +KDQG CGSCWAFS VAAVEGI QI G L LS
Sbjct: 121 SPRYQYSDGEDLPESIDWREKGAVTAVKDQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 180
Query: 180 EQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
EQ+LVDC T N GC+GGLMD AF++II N GL +E DYPY+ +G+CD ++ A TI
Sbjct: 181 EQELVDCDTSYNQGCNGGLMDYAFQFIINNGGLDSEDDYPYKANDGSCDAYRKNAHVVTI 240
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
YED+P+ DE++L +A +NQP+SV ++ASGRAF FY+SGV + CG DHGV +VG+G
Sbjct: 241 DDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSTCGTQLDHGVTLVGYG 300
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
+ E+G YW++KNSWG++WGE G+IR+ R+ G+CGIA ASYP+
Sbjct: 301 S---ESGTDYWIVKNSWGKSWGEKGFIRLQRNIEGVSTGMCGIAMEASYPL 348
>gi|357458911|ref|XP_003599736.1| Cysteine proteinase [Medicago truncatula]
gi|357474719|ref|XP_003607644.1| Cysteine proteinase [Medicago truncatula]
gi|355488784|gb|AES69987.1| Cysteine proteinase [Medicago truncatula]
gi|355508699|gb|AES89841.1| Cysteine proteinase [Medicago truncatula]
Length = 340
Score = 350 bits (898), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 169/325 (52%), Positives = 222/325 (68%), Gaps = 16/325 (4%)
Query: 28 VSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKL 86
V R ++E S+ E+HEQWM +HG+ Y+D +EK R IFK N+E+IE N N+ YKL
Sbjct: 25 VMSRKLYESLSLQERHEQWMTEHGKVYEDAIEKEKRFMIFKDNVEFIESFNAADNQPYKL 84
Query: 87 GTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
N +DLT +EF+A GY + + R+ + S FKY+NVT +P ++DWR KGAVT I
Sbjct: 85 SVNHLADLTLDEFKASRNGYKK----IDREFTTTS-FKYENVTAIPAAVDWRVKGAVTPI 139
Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEY 204
KDQGQCGSCWAFS VAA EGI QIT GKL+ LSEQ+LVDC T ++ GC GGLM+ FE+
Sbjct: 140 KDQGQCGSCWAFSTVAATEGINQITTGKLVSLSEQELVDCDTKGEDQGCEGGLMEDGFEF 199
Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
II+N G+ +E +YPY+ +G+C+ VA I+ YE +P E++LL+AV+NQP+SV
Sbjct: 200 IIKNGGITSETNYPYKAADGSCNTATTTPVAK-ITGYEKVPVNSEKSLLKAVANQPISVS 258
Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
+DAS +F FY SG+ +CG DHGV VG+G+A NG YW++KNSWG WGE GY
Sbjct: 259 IDASDSSFMFYSSGIYTGECGTELDHGVTAVGYGSA---NGTDYWIVKNSWGTVWGEKGY 315
Query: 325 IRILR----DAGLCGIATAASYPVA 345
IR+ R GLCGIA +SYP A
Sbjct: 316 IRMQRGIAAKEGLCGIAMDSSYPTA 340
>gi|77554625|gb|ABA97421.1| Vignain precursor, putative [Oryza sativa Japonica Group]
gi|222630746|gb|EEE62878.1| hypothetical protein OsJ_17681 [Oryza sativa Japonica Group]
Length = 350
Score = 350 bits (898), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 177/352 (50%), Positives = 232/352 (65%), Gaps = 28/352 (7%)
Query: 12 PMFVIIILVITC------ASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
P+ + I+ I C + V + R + + ++ +HE+WMAQHGR YKD EKA RL
Sbjct: 7 PLLLAILCCIVCLYSSSGGAIVAAARELGGDAAMAARHERWMAQHGRVYKDAAEKARRLE 66
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT---GYNRPVPSVSRQSSRPS 121
+FK N+ +IE N G Y LG N+F+DLT+EEF+A T G++ P V R S
Sbjct: 67 VFKANVAFIESFNAGGKNRYWLGVNQFADLTSEEFKATMTNSKGFSTPNNGV-----RVS 121
Query: 122 T-FKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
T FKY+NV+ +P S+DWR KGAVT IKDQGQCG CWAFSAVAA+EG +++ GKLI L
Sbjct: 122 TGFKYENVSADALPASVDWRTKGAVTRIKDQGQCGCCWAFSAVAAMEGFVKLSTGKLISL 181
Query: 179 SEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
SEQ+LVDC D + GC GG +D AF++I+ N GL EA+YPY E+G C VAA
Sbjct: 182 SEQELVDCDVDGNDQGCEGGEIDGAFQFILSNGGLTAEANYPYTAEDGRCKTTAAADVAA 241
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
+I YED+P DE +L++AV+ QPVSV VDAS F FY GV+ +CG + DHGV V+G
Sbjct: 242 SIRGYEDVPANDEPSLMKAVAGQPVSVAVDAS--KFQFYGGGVMAGECGTSLDHGVTVIG 299
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+G A + G KYWL+KNSWG TWGE+GY+R+ +D G+CG+A SYP
Sbjct: 300 YGAASD--GTKYWLVKNSWGTTWGEAGYLRMEKDIDDKRGMCGLAMQPSYPT 349
>gi|414588010|tpg|DAA38581.1| TPA: hypothetical protein ZEAMMB73_156486 [Zea mays]
Length = 347
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 175/355 (49%), Positives = 238/355 (67%), Gaps = 30/355 (8%)
Query: 11 IPMFVIIIL----VITCASQVVSGRSM---HEPSIVEKHEQWMAQHGRTYKDELEKAMRL 63
IP +++ + V C++ V++ R + E ++V +HEQWM QHGR YKDE +KA R
Sbjct: 3 IPKALLLAILGCGVCLCSAAVLAARELGGDDELAMVARHEQWMVQHGRVYKDETDKAHRF 62
Query: 64 NIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYT--GYNRPVPSVSRQSS 118
+FK N+++IE N GNR + LG N+F+DLTN+EFRA T G+N V V
Sbjct: 63 LVFKANVKFIESFNAAAAAGNRKFWLGVNQFADLTNDEFRATKTNKGFNPNVVKV----- 117
Query: 119 RPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
P+ F+YQN++ +P ++DWR KGAVT IKDQGQCG CWAFSAVAA EGI +I+ GKL
Sbjct: 118 -PTGFRYQNLSIDALPQTVDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLT 176
Query: 177 ELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV 234
LSEQ+LVDC ++ GC+GG MD AF++II+N GL TE++YPY ++G C +
Sbjct: 177 SLSEQELVDCDVHGEDQGCNGGEMDDAFKFIIKNGGLTTESNYPYTAQDGQCKSGSNG-- 234
Query: 235 AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAV 294
AATI YED+P DE AL++AV++QPVSV VD F FY GV+ CG + DHG+A
Sbjct: 235 AATIKGYEDVPANDEAALMKAVASQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAA 294
Query: 295 VGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
+G+G + +G KYWL+KNSWG TWGE+G++R+ +D G+CG+A SYP A
Sbjct: 295 IGYG--KTSDGTKYWLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMQPSYPTA 347
>gi|116787404|gb|ABK24495.1| unknown [Picea sitchensis]
gi|224286306|gb|ACN40861.1| unknown [Picea sitchensis]
Length = 452
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 172/327 (52%), Positives = 228/327 (69%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYK 85
++S + + E +I+E +E W+A+H R Y EK R ++FK N YI + N +GNR+YK
Sbjct: 26 IISSKDLREDDAIMELYELWLAEHKRAYNGLDEKQKRFSVFKDNFLYIHEHN-QGNRSYK 84
Query: 86 LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ--NVTDVPTSIDWREKGAV 143
LG N+F+DL++EEF+A Y G ++ SRP + +YQ + D+P SIDWREKGAV
Sbjct: 85 LGLNQFADLSHEEFKATYLGAKL---DTKKRLSRPPSRRYQYSDGEDLPESIDWREKGAV 141
Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAF 202
T +KDQG CGSCWAFS VAAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD AF
Sbjct: 142 TSVKDQGSCGSCWAFSTVAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAF 201
Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
E+II N GL +E DYPY +G+CD+ ++ A TI YED+P+ DE++L +A +NQP+S
Sbjct: 202 EFIINNGGLDSEEDYPYTAYDGSCDSYRKNAHVVTIDDYEDVPENDEKSLKKAAANQPIS 261
Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
V ++ASGR F FY SGV + CG DHGV +VG+G+ E+G YW +KNSWG++WGE
Sbjct: 262 VAIEASGREFQFYDSGVFTSTCGTQLDHGVTLVGYGS---ESGTDYWTVKNSWGKSWGEE 318
Query: 323 GYIRILRD-----AGLCGIATAASYPV 344
G+IR+ R+ G+CGIA ASYPV
Sbjct: 319 GFIRLQRNIEVASTGMCGIAMEASYPV 345
>gi|356517308|ref|XP_003527330.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 173/344 (50%), Positives = 231/344 (67%), Gaps = 10/344 (2%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
SF ++I+ LV+ + V R + E E+HE+WMAQ+GR YKD EK R +FK
Sbjct: 3 SFSQNHYLILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFK 62
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+ +IE N G++ + L N+F+DL +EEF+AL + V ++S ++F+Y++
Sbjct: 63 NNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWV--ETSTETSFRYES 120
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC- 186
VT +P +IDWR++GAVT IKDQG+CGSCWAFSAVAA EGI QIT GKL+ LSEQ+LVDC
Sbjct: 121 VTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCV 180
Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
++ GC GG +D AFE+I + G+A+E YPY+ TC +KE A I YE +P
Sbjct: 181 KGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPS 240
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENG 305
+E+ALL+AV+NQPVSV +DA AF +Y SG+ NA +CG + +H VAVVG+G A + G
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALD--G 298
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
+KYWL+KNSWG WGE GYIRI RD GLCGIA YP A
Sbjct: 299 SKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342
>gi|413953668|gb|AFW86317.1| hypothetical protein ZEAMMB73_339067 [Zea mays]
Length = 433
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 169/338 (50%), Positives = 226/338 (66%), Gaps = 16/338 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
II C + + + + +V +HEQWMAQ+ R YKD EKA R +FK N+++IE
Sbjct: 103 AIIGFAFFCGAAMAARDLSDDSVMVARHEQWMAQYSRVYKDASEKARRFEVFKANVQFIE 162
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VP 132
N GN + LG N+F+DLTN+EFR+ T N+ + S + + P+ F+Y+NV+ +P
Sbjct: 163 SFNAGGNNKFWLGVNQFADLTNDEFRSTKT--NKGLKSSNMKI--PTGFRYENVSADALP 218
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DN 190
T+IDWR KGAVT IKDQGQCG CWAFSAVAA EGI +I+ GKL+ L+EQ+LVDC ++
Sbjct: 219 TTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVSLAEQELVDCDVHGED 278
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
GC GGLMD AF++II+N GL TE+ YPY +G C + AATI YED+P DE
Sbjct: 279 QGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSNS--AATIKGYEDVPANDEA 336
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
AL++AV+NQPVSV VD F FY GV+ CG + DHG+A +G+G + +G KYWL
Sbjct: 337 ALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG--KTSDGTKYWL 394
Query: 311 IKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+KNSWG TWGE+GY+R+ +D G+CG+A SYP
Sbjct: 395 MKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 432
>gi|297802418|ref|XP_002869093.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
gi|297314929|gb|EFH45352.1| hypothetical protein ARALYDRAFT_491113 [Arabidopsis lyrata subsp.
lyrata]
Length = 355
Score = 349 bits (895), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 168/312 (53%), Positives = 220/312 (70%), Gaps = 11/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++H + YK EK R +F++NL +I++ N E N +Y LG NEF+DLT+E
Sbjct: 47 LLELFESWMSEHSKVYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFADLTHE 105
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G +P S RQ S + F+Y+++TD+P S+DWR+KGAV +KDQGQCGSCWA
Sbjct: 106 EFKGRYLGLAKPQFSRKRQPS--ANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWA 163
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QIT G L LSEQ+L+DC T N GC+GGLMD AF+YII GL E D
Sbjct: 164 FSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDD 223
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEG C QKE TIS YED+P+ D+++L++A+++QPVSV ++ASGR F FYK
Sbjct: 224 YPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYK 283
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
GV N CG + DHGVA VG+G+++ G+ Y ++KNSWG WGE G+IR+ R+ G
Sbjct: 284 GGVFNGQCGTDLDHGVAAVGYGSSK---GSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEG 340
Query: 333 LCGIATAASYPV 344
LCGI ASYP
Sbjct: 341 LCGINKMASYPT 352
>gi|5823020|gb|AAD53012.1|AF089849_1 senescence-specific cysteine protease [Brassica napus]
Length = 344
Score = 348 bits (894), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 169/342 (49%), Positives = 234/342 (68%), Gaps = 12/342 (3%)
Query: 11 IPMFVIIILVITCASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
I +F+I+ LV + + + R + E ++ ++H +WM +HGR Y D EK R +FK+N
Sbjct: 6 IQIFLIVSLVSSFSLSITLSRPLLDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRN 65
Query: 70 LEYIEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
+E IE+ N + T+KL N+F+DLTNEEFR++YTG+ SV ++P++F+YQNV
Sbjct: 66 VERIERLNDVQSGLTFKLAVNQFADLTNEEFRSMYTGFKGN--SVLSSRTKPTSFRYQNV 123
Query: 129 TD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+ +P S+DWR+KGAVT IKDQG CGSCWAFSAVAA+EG+ QI +GKLI LSEQ+LVDC
Sbjct: 124 SSDALPVSVDWRKKGAVTPIKDQGLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDC 183
Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
T++ GC GGLMD AF Y I GL +E++YPY+ GTC+ K K +A +I +ED+P
Sbjct: 184 DTNDGGCMGGLMDTAFNYTITIGGLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPA 243
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
DE+AL++AV++ PVS+ + F FY SGV + +C + DHGV VG+G +NG
Sbjct: 244 NDEKALMKAVAHHPVSIGIAGGDIGFQFYSSGVFSGECTTHLDHGVTAVGYG--RSKNGL 301
Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
KYW++KNSWG WGE GY+RI +D G CG+A ASYP
Sbjct: 302 KYWILKNSWGPKWGERGYMRIKKDIKPKHGQCGLAMNASYPT 343
>gi|242072572|ref|XP_002446222.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
gi|241937405|gb|EES10550.1| hypothetical protein SORBIDRAFT_06g005410 [Sorghum bicolor]
Length = 340
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 169/341 (49%), Positives = 224/341 (65%), Gaps = 18/341 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+ I+ L + C + + + + ++V +HEQWMAQ+ R YKD EKA R +FK N+++
Sbjct: 8 ILAILGLALFCGAALAARDLNDDSAMVARHEQWMAQYNRVYKDATEKAQRFEVFKANVKF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVT-- 129
IE N GNR + LG N+F+DLTN+EFRA T +P P P+ F+Y+NV+
Sbjct: 68 IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSPV-----KVPTGFRYENVSVD 122
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
+P SIDWR KGAVT IKDQGQCG CWAFSAVAA EGI +I+ KLI LSEQ+LVDC
Sbjct: 123 ALPASIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTDKLISLSEQELVDCDVH 182
Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
++ GC GGLMD AF++II+N GL TE+ YPY +G C + AA I +ED+P
Sbjct: 183 GEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTATDGKCKSGTNS--AANIKGFEDVPAN 240
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
DE AL++AV+NQPVSV VD F Y GV+ CG + DHG+A +G+G + +G K
Sbjct: 241 DEAALMKAVANQPVSVAVDGGDMTFQLYSGGVMTGSCGTDLDHGIAAIGYG--QTSDGTK 298
Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
YWL+KNSWG TWGE+GY+R+ +D G+CG+A SYP
Sbjct: 299 YWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 339
>gi|18418684|ref|NP_567983.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|71153408|sp|O65493.1|XCP1_ARATH RecName: Full=Xylem cysteine proteinase 1; Short=AtXCP1; Flags:
Precursor
gi|6708181|gb|AAF25831.1|AF191027_1 papain-type cysteine endopeptidase XCP1 [Arabidopsis thaliana]
gi|3080415|emb|CAA18734.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7270487|emb|CAB80252.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26449881|dbj|BAC42063.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|28827736|gb|AAO50712.1| unknown protein [Arabidopsis thaliana]
gi|332661101|gb|AEE86501.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 355
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 168/312 (53%), Positives = 220/312 (70%), Gaps = 11/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++H + YK EK R +F++NL +I++ N E N +Y LG NEF+DLT+E
Sbjct: 47 LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFADLTHE 105
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G +P S RQ S + F+Y+++TD+P S+DWR+KGAV +KDQGQCGSCWA
Sbjct: 106 EFKGRYLGLAKPQFSRKRQPS--ANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWA 163
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QIT G L LSEQ+L+DC T N GC+GGLMD AF+YII GL E D
Sbjct: 164 FSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDD 223
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEG C QKE TIS YED+P+ D+++L++A+++QPVSV ++ASGR F FYK
Sbjct: 224 YPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYK 283
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
GV N CG + DHGVA VG+G+++ G+ Y ++KNSWG WGE G+IR+ R+ G
Sbjct: 284 GGVFNGKCGTDLDHGVAAVGYGSSK---GSDYVIVKNSWGPRWGEKGFIRMKRNTGKPEG 340
Query: 333 LCGIATAASYPV 344
LCGI ASYP
Sbjct: 341 LCGINKMASYPT 352
>gi|356515056|ref|XP_003526217.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 342
Score = 348 bits (893), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 172/344 (50%), Positives = 231/344 (67%), Gaps = 10/344 (2%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
SF ++I+ LV++ + V R + E E+HE+WMAQ+GR YKD EK R +FK
Sbjct: 3 SFSQNHYLILFLVLSVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFK 62
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+ +IE N G++ + L N+F+DL +EEF+AL + V ++S ++F+Y++
Sbjct: 63 NNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWV--ETSTQTSFRYES 120
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC- 186
VT +P +IDWR++GAVT IKDQG+CGSCWAFSAVAA EGI QIT GKL+ LSEQ+LVDC
Sbjct: 121 VTKIPATIDWRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCV 180
Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
++ GC GG +D AFE+I + G+A+E YPY+ TC +KE A I YE +P
Sbjct: 181 KGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPS 240
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENG 305
+E+ALL+AV+NQPVSV +DA AF +Y SG+ N +CG + +H VAVVG+G A + G
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNVRNCGTDPNHAVAVVGYGKALD--G 298
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
+KYWL+KNSWG WGE GYIRI RD GLCGIA YP A
Sbjct: 299 SKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPTA 342
>gi|242068363|ref|XP_002449458.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
gi|241935301|gb|EES08446.1| hypothetical protein SORBIDRAFT_05g013840 [Sorghum bicolor]
Length = 350
Score = 347 bits (891), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 178/349 (51%), Positives = 240/349 (68%), Gaps = 22/349 (6%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMH-------EPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+I ++++ + VV R + E ++ +H+QWMA+HGRTYKDE EKA R
Sbjct: 10 MITFTAAALMILAVMTMVVEARDLSTSTGGYGEEAMKVRHQQWMAEHGRTYKDEAEKARR 69
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
+FK N ++++++N G ++Y+L NEF+D+TN+EF A+YTG +PVP+ + + +
Sbjct: 70 FQVFKANADFVDRSNAAGGKSYELAINEFADMTNDEFVAMYTGL-KPVPAGPK---KMAG 125
Query: 123 FKYQNVT--DVP-TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
FKY+N+T DV ++DWR+KGAVT IK+QGQCG CWAF+AVAAVE I QIT G L+ LS
Sbjct: 126 FKYENLTLSDVDQQAVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVESIHQITTGNLVSLS 185
Query: 180 EQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
EQQ++DC TD N+GC+GG +D AF+YII N GLATE YPY +GTC + + AV TI
Sbjct: 186 EQQVLDCDTDGNNGCNGGYIDNAFQYIISNGGLATEDAYPYAAAQGTCQSSVQPAV--TI 243
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGN-NCDHGVAVVG 296
S Y+D+P GDE AL AV+NQPV+V +DA F FY SGVL AD CG + +H V VG
Sbjct: 244 SSYQDVPSGDEAALAAAVANQPVAVAIDAHNN-FQFYSSGVLTADTCGTPSLNHAVTAVG 302
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPVA 345
+ TAE+ G YWL+KN WG+ WGE GY+R+ R CG+A ASYPVA
Sbjct: 303 YSTAED--GTPYWLLKNQWGQNWGEGGYLRVERGTNACGVAQQASYPVA 349
>gi|413944253|gb|AFW76902.1| hypothetical protein ZEAMMB73_056195 [Zea mays]
Length = 340
Score = 347 bits (890), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 171/343 (49%), Positives = 229/343 (66%), Gaps = 22/343 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPS-IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
+ ++ C + ++ R ++E S +V +HEQWMAQ+ R YKD EKA R +FK N++
Sbjct: 8 ILAVLSFAFFCGA-ALAARDLNEDSAMVARHEQWMAQYSRVYKDAAEKARRFEVFKANVK 66
Query: 72 YIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT--GYNRPVPSVSRQSSRPSTFKYQNVT 129
+IE N GNR + LG N+F+DLTN+EFR T G+ PS+ + S+ F+Y+NV+
Sbjct: 67 FIESFNTGGNRKFWLGINQFADLTNDEFRTTKTNKGFK---PSLDKVST---GFRYENVS 120
Query: 130 --DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+P +IDWR GAVT IKDQGQCG CWAFSAVAA EGI +I+ GKLI LSEQ+LVDC
Sbjct: 121 VDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSEQELVDCD 180
Query: 188 T--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
++ GC GGLMD AF++II+N GL TE++YPY +G C + AA I YED+P
Sbjct: 181 VHGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSNS--AANIKGYEDVP 238
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
DE AL++AV+NQPVSV VD F FY GV+ CG + DHG+A +G+G + +G
Sbjct: 239 TNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG--KTSDG 296
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
KYWL+KNSWG TWGE+GY+R+ +D G+CG+A SYP
Sbjct: 297 TKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAMEPSYPT 339
>gi|400180449|gb|AFP73361.1| cysteine protease [Solanum chilense]
Length = 344
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 163/340 (47%), Positives = 236/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKKN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QGQCG CWAFSAV ++EG +I GKL+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAEGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|171702829|dbj|BAG16370.1| cysteine protease [Brassica oleracea var. italica]
Length = 332
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 165/335 (49%), Positives = 233/335 (69%), Gaps = 11/335 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F+I+ LV + + R + E ++ ++H WM +HGR Y D EK R +FK+N+E
Sbjct: 2 IFLIVSLVSSFSLSTTLSRPLDEVTMQKRHAAWMTEHGRVYADANEKNNRYVVFKRNVES 61
Query: 73 IEKANK-EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD- 130
IE+ N+ + T+KL N+F+DLTNEEFR++YTGY SV ++P++F+YQ+V+
Sbjct: 62 IERLNEVQYGLTFKLAVNQFADLTNEEFRSMYTGYKGN--SVLSSRTKPTSFRYQHVSSD 119
Query: 131 -VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+P S+DWR+KGAVT IKDQG CGSCWAFSAVAA+EG+ QI +GKLI LSEQ+LVDC T+
Sbjct: 120 ALPISVDWRKKGAVTPIKDQGSCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTN 179
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
+ GC GG M+ AF Y + GL +E++YPY+ +GTC+ K K +A +I +ED+P DE
Sbjct: 180 DDGCMGGYMNSAFNYTMTTGGLTSESNYPYKSTDGTCNINKTKQIATSIKGFEDVPANDE 239
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
+AL++AV++ PVS+ + G F FY SGV + +C + DHGVAVVG+G + NG+KYW
Sbjct: 240 KALMKAVAHHPVSIGIAGGGTGFQFYSSGVFSGECSTHLDHGVAVVGYG--KSSNGSKYW 297
Query: 310 LIKNSWGETWGESGYIRILRDA----GLCGIATAA 340
++KNSWG WGE GY+RI +D G CG+A A
Sbjct: 298 ILKNSWGPKWGERGYMRIKKDTKAKHGQCGLAMNA 332
>gi|356515046|ref|XP_003526212.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 342
Score = 346 bits (887), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 170/350 (48%), Positives = 228/350 (65%), Gaps = 13/350 (3%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M +K ++ +F+ + + I SQV+ R +H+ ++ E+HE WMA++G+ YKD EK
Sbjct: 1 MAFTGQKQHMLALFLFLAVGI---SQVMP-RKLHQTALRERHENWMAEYGKMYKDAAEKE 56
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK N+E+IE N GN+ YKLG N +DLT EEF+ G R S + +
Sbjct: 57 KRFQIFKDNVEFIESFNAAGNKPYKLGVNHLADLTLEEFKDSRNGLKRTY-EFSTTTFKL 115
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQG-QCGSCWAFSAVAAVEGITQITRGKLIELS 179
+ FKY+NVTD+P +IDWR KGAVT IKDQG QCG WAFS +AA EGI QI+ G L+ LS
Sbjct: 116 NGFKYENVTDIPEAIDWRVKGAVTPIKDQGDQCGRFWAFSTIAATEGIHQISTGNLVSLS 175
Query: 180 EQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
EQ+LVDC + + GC GG M+ FE+II+N G+ +E +YPY+ +GTC+ + A I
Sbjct: 176 EQELVDCDSVDDGCEGGFMEDGFEFIIKNGGITSETNYPYKGVDGTCNTTIAASPVAQIK 235
Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
YE +P E+AL +AV+NQPVSV + A+ F FY SG+ N +CG + DHGV VG+GT
Sbjct: 236 GYEIVPSYSEEALKKAVANQPVSVSIHATNATFMFYSSGIYNGECGTDLDHGVTAVGYGT 295
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
ENG YW++KNSWG WGE GYIR+ R G+CGIA +SYP A
Sbjct: 296 ---ENGTDYWIVKNSWGTQWGEKGYIRMHRGIAAKHGICGIALDSSYPTA 342
>gi|1208549|gb|AAC49455.1| Pseudotzain [Pseudotsuga menziesii]
Length = 454
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 172/351 (49%), Positives = 235/351 (66%), Gaps = 21/351 (5%)
Query: 10 IIPMFVIIILVITCAS------QVVSGRS---MHEPSIVEKHEQWMAQHGRTYKDELEKA 60
I+ +F ++ L S ++S S + + +I+E +E W+AQH + Y EK
Sbjct: 3 ILLLFAVLALSAMAGSASRADFSIISYDSQDLIGDDAIMELYELWLAQHKKAYNGLDEKQ 62
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
+ ++FK N YI + N +GN +YKLG N+F+DL++EEF+A Y G + + R S P
Sbjct: 63 KKFSVFKDNFLYIHQHNNQGNPSYKLGLNQFADLSHEEFKAAYLG--TKLDAKKRLSRSP 120
Query: 121 ST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
S ++Y D+P SIDWREKGAVT +K+QG CGSCWAFS VAAVEGI QI G L LS
Sbjct: 121 SPRYQYSVGEDLPESIDWREKGAVTAVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLS 180
Query: 180 EQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
EQ+LVDC T N GC+GGLMD AF++II N GL +E DYPY+ G+CD ++ A TI
Sbjct: 181 EQELVDCDTSYNQGCNGGLMDYAFQFIISNGGLDSEDDYPYKANNGSCDAYRKNAHVVTI 240
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
YED+P+ DE++L +A +NQP+SV ++ASGRAF FY+SGV ++CG DHGV +VG+G
Sbjct: 241 DDYEDVPENDEKSLKKAAANQPISVAIEASGRAFQFYESGVFTSNCGTQLDHGVTLVGYG 300
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
+ E+G YWL+KNSWG +WGE G+I++ R+ G+CGIA ASYPV
Sbjct: 301 S---ESGIDYWLVKNSWGNSWGEKGFIKLQRNLEGASTGMCGIAMEASYPV 348
>gi|400180441|gb|AFP73357.1| cysteine protease [Solanum habrochaites]
Length = 344
Score = 345 bits (886), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 162/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T+EEF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLSPSPMSSTEFKINDIS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QGQCG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y ++ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIRENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C N +H V +G+GT +ENG K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGT--DENGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGEKGFMKIIRDYGNPSGLCDIAKLSSYP 341
>gi|400180377|gb|AFP73327.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 345 bits (886), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 162/340 (47%), Positives = 236/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS E S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y+ E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYQGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|242072394|ref|XP_002446133.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
gi|241937316|gb|EES10461.1| hypothetical protein SORBIDRAFT_06g002160 [Sorghum bicolor]
Length = 338
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 165/340 (48%), Positives = 226/340 (66%), Gaps = 20/340 (5%)
Query: 16 IIILVITCAS---QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
++ ++ CAS V++ R + + ++VE+HE WM ++GR YKD EKA R FK N+ +
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAF 66
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN--VTD 130
+E N + LG N+F+DLT EEF+A N+ +S + + FKY+N V+
Sbjct: 67 VESFNTNKKNKFWLGVNQFADLTTEEFKA-----NKGFKPISAEMVPTTGFKYENLSVSA 121
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+PT++DWR KGAVT IK+QGQCG CWAFSAVAA+EGI +++ G LI LSEQ+LVDC T
Sbjct: 122 LPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHS 181
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
+ GC GG MD AFE++I+N GLATE+ YPY+ +G C + AATI +ED+P D
Sbjct: 182 MDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSKS--AATIKGHEDVPVND 239
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL++AV+NQPVSV VDAS R F Y GV+ CG DHG+A +G+G E +G KY
Sbjct: 240 EAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGV--ESDGTKY 297
Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
W++KNSWG TWGE G++R+ +D G+CG+A SYP
Sbjct: 298 WILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYPT 337
>gi|350535639|ref|NP_001233949.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
gi|108937128|gb|ABG23376.1| phytophthora-inhibited protease 1 [Solanum lycopersicum]
Length = 345
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 233/347 (67%), Gaps = 14/347 (4%)
Query: 8 SFIIPMFVIIILVITCASQ---VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
+F + +++L+ + S +V+ R++ E S++E+HE WM HGR YKD++EK R
Sbjct: 4 NFFLKNITVVLLLFSILSLYPFIVTSRNLKELSMLERHENWMVHHGRVYKDDIEKEHRFK 63
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
FK+N+E+IE NK G + YKL N+++DLT EEF + G + + S ++ ++FK
Sbjct: 64 TFKENVEFIESFNKNGTQRYKLAVNKYADLTTEEFTTSFMGLDTSLLSQQESTATTTSFK 123
Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
Y +VT+VP S+DWR++G+VT +KDQG CG CWAFSA AA+EG QI +LI LSEQQL+
Sbjct: 124 YDSVTEVPNSMDWRKRGSVTGVKDQGVCGCCWAFSAAAAIEGAYQIANNELISLSEQQLL 183
Query: 185 DCSTDNHGCSGGLMDKAFEYIIENK--GLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
DCST N GC GGLM A++++++N G+ TE +YPY + C + E+ A TI+ YE
Sbjct: 184 DCSTQNKGCEGGLMTVAYDFLLQNNGGGITTETNYPYEEAQNVC--KTEQPAAVTINGYE 241
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
+P DE +LL+AV NQP+SV + A+ FH Y SG+ + C + +H V V+G+GT+EE
Sbjct: 242 VVPS-DESSLLKAVVNQPISVGI-AANDEFHMYGSGIYDGSCNSRLNHAVTVIGYGTSEE 299
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDAGL----CGIATAASYPVA 345
+ G KYW++KNSWG WGE GY+RI RD G+ CGIA AS+P A
Sbjct: 300 D-GTKYWIVKNSWGSDWGEEGYMRIARDVGVDGGHCGIAKVASFPTA 345
>gi|38346003|emb|CAD40112.2| OSJNBa0035O13.5 [Oryza sativa Japonica Group]
gi|125589427|gb|EAZ29777.1| hypothetical protein OsJ_13835 [Oryza sativa Japonica Group]
Length = 339
Score = 345 bits (885), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 168/341 (49%), Positives = 228/341 (66%), Gaps = 17/341 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F I+ + C++ + + ++V +HE+WM Q+GR YKD EKA R IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN EFRA T +PS R P+TF+Y+NV+
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNYEFRATKTNKGF-IPSTVRV---PTTFRYENVSIDT 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
+P ++DWR KGAVT IKDQGQCG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
++ GC GGLMD AF++II+N GL TE+ YPY +G C+ AATI YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNS--AATIKGYEDVPANN 240
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL++AV+NQPVSV VD F FY GV+ CG + DHG+ +G+G ++ +G +Y
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYG--KDGDGTQY 298
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
WL+KNSWG TWGE+G++R+ +D G+CG+A SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|242072398|ref|XP_002446135.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
gi|241937318|gb|EES10463.1| hypothetical protein SORBIDRAFT_06g002170 [Sorghum bicolor]
Length = 338
Score = 345 bits (884), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 165/339 (48%), Positives = 227/339 (66%), Gaps = 20/339 (5%)
Query: 16 IIILVITCAS---QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
++ ++ CAS V++ R + + ++VE+HE WM ++GR YKD EKA R +FK N+ +
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEVFKDNVAF 66
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN--VTD 130
+E N N + LG N+F+DLT EEF+A N+ +S + + FKY+N V+
Sbjct: 67 VESFNTNKNNKFWLGINQFADLTIEEFKA-----NKGFKPISAEKVPTTGFKYENLSVSA 121
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+PT++DWR KGAVT IK+QGQCG CWAFSAVAA+EGI +++ G LI LSEQ+LVDC T
Sbjct: 122 LPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTHS 181
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
+ GC GG MD AFE++I+N GLAT + YPY+ +G C + AATI +ED+P D
Sbjct: 182 MDEGCEGGWMDSAFEFVIKNGGLATVSSYPYKAVDGKCKGGSKS--AATIKGHEDVPVND 239
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL++AV+NQPVSV VDAS R F Y GV+ CG DHG+A +G+G E +G KY
Sbjct: 240 EAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGV--ESDGTKY 297
Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
W++KNSWG TWGE G++R+ +D G+CG+A SYP
Sbjct: 298 WILKNSWGTTWGEKGFLRMEKDISDKQGMCGLAMKPSYP 336
>gi|388497270|gb|AFK36701.1| unknown [Lotus japonicus]
Length = 343
Score = 345 bits (884), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 184/343 (53%), Positives = 240/343 (69%), Gaps = 16/343 (4%)
Query: 12 PMFVIIILVITCASQVVSGRSMHE--PSIVEK-HEQWMAQHGRTYKDELEKAMRLNIFKQ 68
P+ + ++ CA +S R++++ S+V K H+QWM Q+GR+Y ++ E R IF +
Sbjct: 6 PIIALCTMLWACAYTAMS-RTLYDETSSVVAKTHQQWMLQYGRSYTNDAEMEKRFKIFME 64
Query: 69 NLEYIEKANKE-GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
NLEYIEK N GN++YKL N+FSDLTNEEF A +TG S S R S +
Sbjct: 65 NLEYIEKFNNAPGNKSYKLDLNQFSDLTNEEFIASHTGLMIDPSKPSSSSKRASPASL-D 123
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
++D PTS+DWRE+GAVT +K+QG CGSCWAFSAVAAVEGI +I G LI LSEQQLVDC+
Sbjct: 124 LSDTPTSLDWREQGAVTDVKNQGNCGSCWAFSAVAAVEGIVKIKNGNLISLSEQQLVDCA 183
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
++ N GC GG MD AF YI EN G+A+E DY YR GTC N + AA IS YED+P
Sbjct: 184 SNEQNQGCGGGFMDNAFSYITEN-GIASENDYQYRGGAGTCQNNEMITPAARISGYEDVP 242
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
G++Q LL AVS QPVSV + A G++FH YK G+ + CG++ +HGV +VG+GT+EE+ G
Sbjct: 243 AGEDQLLL-AVSQQPVSVAI-AVGQSFHLYKEGIYSGPCGSSLNHGVTLVGYGTSEED-G 299
Query: 306 AKYWLIKNSWGETWGESGYIRILRDAGL----CGIATAASYPV 344
KYWLIKNSWGE+WGE+GY+R+LR++G CGIA AS+P
Sbjct: 300 TKYWLIKNSWGESWGENGYMRLLRESGQSEGHCGIAVKASHPT 342
>gi|20334373|gb|AAM19207.1|AF493232_1 cysteine protease [Solanum pimpinellifolium]
gi|400180424|gb|AFP73350.1| cysteine protease [Solanum pimpinellifolium]
gi|400180433|gb|AFP73354.1| cysteine protease [Solanum lycopersicum]
Length = 344
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 162/340 (47%), Positives = 236/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GGLM AF++IIEN G++ E+DY Y E+ TC +EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGLMTNAFDFIIENGGISRESDYEYLGEQYTC-RSREKTAAVQISSYKVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + +C + +H V +G+GT EE G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGNCADQINHAVTAIGYGTDEE--GQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|302143415|emb|CBI21976.3| unnamed protein product [Vitis vinifera]
Length = 322
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 174/337 (51%), Positives = 225/337 (66%), Gaps = 32/337 (9%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ ++ V+ + + R++HE S+ E+HE WMAQ+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASQATARNLHEASMYERHEDWMAQYGRVYKDADEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK +++YKL NEF+DLTNEEF T NR + S+ ++FKY+NVT VP++
Sbjct: 72 SFNKAMDKSYKLSINEFADLTNEEFG---TSRNRFKAHIC--STEATSFKYENVTAVPST 126
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
IDWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 IDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C+G A+YPY +GTC+ +K AA I+ YED+P +E+AL
Sbjct: 187 CNG-------------------ANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 227
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+AV +QP++V +DA G F FY SGV CG DHGVA VG+GT+++ G KYWL+K
Sbjct: 228 QKAVVHQPIAVAIDAGGFEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDD--GMKYWLVK 285
Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
NSWG WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 286 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 322
>gi|400180345|gb|AFP73311.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 344 bits (883), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 162/340 (47%), Positives = 235/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC GG M AF++IIEN G++ E+DY Y ++ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +ENG K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DENGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180447|gb|AFP73360.1| cysteine protease [Solanum chilense]
Length = 345
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 164/341 (48%), Positives = 234/341 (68%), Gaps = 12/341 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK N
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDL 126
Query: 130 ---DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
D+P+++DWRE GAVT +K QGQCG CWAFSAV ++EG +I GKL+E SEQ+L+DC
Sbjct: 127 SDDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGKLMEFSEQELLDC 186
Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
+T+N+GC+GG M AF++IIEN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+
Sbjct: 187 TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPE 245
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
G E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G
Sbjct: 246 G-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQ 301
Query: 307 KYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
KYWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 KYWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 342
>gi|400180422|gb|AFP73349.1| cysteine protease [Solanum chmielewskii]
Length = 344
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 162/340 (47%), Positives = 235/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISIFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKTNDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QGQCG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y ++ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT EE G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYSGGTYDGSCADRINHAVTAIGYGTDEE--GQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180426|gb|AFP73351.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 161/340 (47%), Positives = 236/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y ++ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +ENG K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DENGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180435|gb|AFP73355.1| cysteine protease [Solanum pennellii]
Length = 344
Score = 344 bits (882), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 162/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P VS + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYVSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K+QGQCG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y ++ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C N +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 341
>gi|46401612|dbj|BAD16614.1| cysteine proteinase [Dianthus caryophyllus]
Length = 459
Score = 343 bits (881), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 170/351 (48%), Positives = 232/351 (66%), Gaps = 13/351 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQV-VSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEK 59
+ KS I +F II +V + A + + R+ + P I +E W+ +HG+ Y EK
Sbjct: 1 MSTSKSTIFLLFSIIFIVSSSALDLSIIDRAFNRPDDEIASLYETWLVKHGKNYNGLGEK 60
Query: 60 AMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS-S 118
+R NIFK NL ++++ N E N ++KLG N F+DLTNEE+R++Y G +V+R S
Sbjct: 61 QLRFNIFKDNLRFVDERNSE-NLSFKLGLNRFADLTNEEYRSVYLGTRPRSVAVARSGRS 119
Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
+ + ++ +P S+DWR+KGAV IKDQG CGSCWAFSA+AAVEG+ QI G LI L
Sbjct: 120 KSDRYAFRAGDTLPESVDWRKKGAVAGIKDQGSCGSCWAFSAIAAVEGVNQIVTGDLISL 179
Query: 179 SEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
SEQ+LV+C T N GC GGLMD AFE+II+N+G+ ++ DYPY +G CD ++ A T
Sbjct: 180 SEQELVECDTSYNDGCDGGLMDYAFEFIIKNEGIDSDEDYPYTGRDGRCDTNRKNAKVVT 239
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I YED P DE++L +AV+NQPVSV ++ GR F Y SGV CG DHGVAVVG+
Sbjct: 240 IDDYEDSPVYDEKSLQKAVANQPVSVAIEGGGRDFQLYDSGVFTGKCGTALDHGVAVVGY 299
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
GT E+G YW+++NSWG+TWGE GYIR+ R+ +G+CGIA SYP+
Sbjct: 300 GT---EDGLDYWIVRNSWGDTWGEGGYIRMQRNTKLPSGICGIAIEPSYPI 347
>gi|400180367|gb|AFP73322.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (881), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 162/340 (47%), Positives = 235/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|225428328|ref|XP_002279940.1| PREDICTED: cysteine proteinase-like [Vitis vinifera]
Length = 707
Score = 343 bits (881), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 163/311 (52%), Positives = 215/311 (69%), Gaps = 12/311 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ + E W+++HG+ YK EK R +F++NL +I++ NKE + +Y LG NEF+DL++E
Sbjct: 400 LIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVS-SYWLGLNEFADLSHE 458
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF++ Y G P R F+Y++V D+P S+DWR+KGAVTH+K+QG CGSCWA
Sbjct: 459 EFKSKYLGLRAEFP---RSRDYSGEFRYRDVADLPESVDWRKKGAVTHVKNQGACGSCWA 515
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N GC+GGLMD AF +I N GL E D
Sbjct: 516 FSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDD 575
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEGTC+ QKE TIS YED+P+ DE++LL+A+++QP+SV ++ASGR F FY
Sbjct: 576 YPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYS 635
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
GV N CG DHGVA VG+G+++ G Y ++KNSWG WGE GYIR+ R+ G
Sbjct: 636 GGVFNGPCGTELDHGVAAVGYGSSK---GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEG 692
Query: 333 LCGIATAASYP 343
LCGI ASYP
Sbjct: 693 LCGINKMASYP 703
>gi|400180355|gb|AFP73316.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (881), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 161/340 (47%), Positives = 235/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|125547256|gb|EAY93078.1| hypothetical protein OsI_14879 [Oryza sativa Indica Group]
Length = 339
Score = 343 bits (880), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 167/341 (48%), Positives = 228/341 (66%), Gaps = 17/341 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F I+ + C++ + + ++V +HE+WM Q+GR YKD EKA R IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN EFRA T +PS R P+TF+Y+NV+
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNYEFRATKTNKGF-IPSTVRV---PTTFRYENVSIDT 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
+P ++DWR KGAVT IKDQGQCG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
++ GC GGLMD AF++II+N GL TE+ YPY +G C+ AATI YE++P +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNS--AATIKGYEEVPANN 240
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL++AV+NQPVSV VD F FY GV+ CG + DHG+ +G+G ++ +G +Y
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYG--KDGDGTQY 298
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
WL+KNSWG TWGE+G++R+ +D G+CG+A SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|356517310|ref|XP_003527331.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 343 bits (880), Expect = 6e-92, Method: Compositional matrix adjust.
Identities = 171/344 (49%), Positives = 231/344 (67%), Gaps = 10/344 (2%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
SF ++I+ LV+ + V R + E E+HE+WMAQ+GR YKD EK R +FK
Sbjct: 3 SFSQNHYLILFLVLAVWTSHVMSRRLSEACTSERHEKWMAQYGRVYKDAAEKEKRFQVFK 62
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+ +IE N G++ + L N+F+DL +EEF+AL + V ++S ++F+Y++
Sbjct: 63 NNVHFIESFNAAGDKPFNLSINQFADLNDEEFKALLINVQKKASWV--ETSTETSFRYES 120
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC- 186
VT +P +ID R++GAVT IKDQG+CGSCWAFSAVAA EGI QIT GKL+ LSEQ+LVDC
Sbjct: 121 VTKIPATIDRRKRGAVTPIKDQGRCGSCWAFSAVAATEGIHQITTGKLVPLSEQELVDCV 180
Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
++ GC GG +D AFE+I + G+A+E YPY+ TC +KE A I YE +P
Sbjct: 181 KGESEGCIGGYVDDAFEFIAKKGGIASETHYPYKGVNKTCKVKKETHGVAEIKGYEKVPS 240
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENG 305
+E+ALL+AV+NQPVSV +DA AF +Y SG+ NA +CG + +H VAVVG+G A ++
Sbjct: 241 NNEKALLKAVANQPVSVYIDAGTHAFKYYSSGIFNARNCGTDPNHAVAVVGYGKALDD-- 298
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
+KYWL+KNSWG WGE GYIRI RD GLCGIA YP+A
Sbjct: 299 SKYWLVKNSWGTEWGERGYIRIKRDIRAKEGLCGIAKYPYYPIA 342
>gi|400180443|gb|AFP73358.1| cysteine protease, partial [Solanum habrochaites]
Length = 345
Score = 343 bits (880), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 166/341 (48%), Positives = 236/341 (69%), Gaps = 13/341 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-FKYQNV 128
+++IE NK GN +YKLG NEF+D+T+EEF A +TG N P +S S PST FK ++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSEEFLAKFTGLNIPNSYLS-PSPMPSTEFKINDL 125
Query: 129 TD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+D +P+++DWRE GAVT +K+QGQCG CWAFSAV ++EG +I G L+E SEQ+L+DC
Sbjct: 126 SDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDC 185
Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
+T+N+GC+GG M AF++IIEN G++ E+DY Y ++ TC +Q K A IS Y+ +P+
Sbjct: 186 TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQG-KTAAVQISNYQVVPE 244
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
G E +LLQAV+ QPVS+ + AS FY G + C N +H V +G+GT +E G
Sbjct: 245 G-ETSLLQAVTKQPVSIGIAAS-HDLQFYAGGTYDGSCANRINHAVTAIGYGT--DEKGQ 300
Query: 307 KYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
KYWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 301 KYWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|302143411|emb|CBI21972.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 343 bits (880), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 172/337 (51%), Positives = 224/337 (66%), Gaps = 34/337 (10%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ ++ V+ + + RS+HE S+ E+HE WM Q+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASQATARSLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK +++YKL NEF+DLTNEEFRA NR + S+ ++FKY+NVT VP++
Sbjct: 72 SFNKAMDKSYKLSINEFADLTNEEFRA---SRNRFKAHIC--STEATSFKYENVTAVPST 126
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
+DWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C+ +YPY +GTC+ +K AA I+ YED+P +E+AL
Sbjct: 187 CT---------------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 225
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+AV++QP++V +DASG F FY SGV CG DHGVA VG+GT+++ G KYWL+K
Sbjct: 226 QKAVAHQPIAVAIDASGSEFQFYSSGVFTGQCGTELDHGVAAVGYGTSDD--GMKYWLVK 283
Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
NSW WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 284 NSWSTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320
>gi|374713649|gb|AEZ65082.1| cysteine protease [Carica papaya]
Length = 471
Score = 343 bits (880), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 167/317 (52%), Positives = 213/317 (67%), Gaps = 10/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E +++ +E W+ +HG+ Y EK R IFK NL ++++ N RTYKLG +F+DL
Sbjct: 45 EAHMMKMYEHWLVKHGKNYNAIGEKERRFEIFKDNLRFVDEQNSVPGRTYKLGLTKFADL 104
Query: 95 TNEEFRALYTGYNRPVPSVSR-QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
TNEE+RA+Y G R + S+ K N D+P+ +DWREKGAVT +KDQGQCG
Sbjct: 105 TNEEYRAMYLGAKMEKKEKLRTERSQRYLHKAGNDDDLPSHVDWREKGAVTEVKDQGQCG 164
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
SCWAFS V +VEGI QI G LI LSEQ+LVDC N GC+GGLMD AFE+II+N G+
Sbjct: 165 SCWAFSTVGSVEGINQIVTGDLISLSEQELVDCDKAYNQGCNGGLMDYAFEFIIKNGGID 224
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
+EADYPYR + CD+ ++ A TI YED+P+ DE++L +AV+NQPVSV ++A GR F
Sbjct: 225 SEADYPYRASDNMCDSNRKNAHVVTIDGYEDVPENDEESLKKAVANQPVSVAIEAGGREF 284
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR--- 329
Y+SGV CG N DHGV VG+GT ENG YW+++NSWG WGESGYIR+ R
Sbjct: 285 QLYQSGVFTGRCGTNLDHGVVAVGYGT---ENGIDYWIVRNSWGPKWGESGYIRMERNVA 341
Query: 330 --DAGLCGIATAASYPV 344
D G CGIA ASYP
Sbjct: 342 STDTGKCGIAMEASYPT 358
>gi|297851334|ref|XP_002893548.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339390|gb|EFH69807.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 346
Score = 343 bits (879), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 180/348 (51%), Positives = 236/348 (67%), Gaps = 17/348 (4%)
Query: 10 IIPMFV-IIILVITC-ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
I+ MFV + IL ++ SQ S + HEP + E H+QWM + R Y DELEK MR ++FK
Sbjct: 4 ILFMFVSLTILSMSLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFK 63
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN--RPVPSVSRQSSRPSTFKY 125
+NL++IEK NK+G+RTYKLG NEF+D T EEF A +TG +PS ++ +
Sbjct: 64 KNLKFIEKFNKKGDRTYKLGVNEFADWTKEEFIATHTGLKGFNGIPSSEFVDEMIPSWNW 123
Query: 126 QNVTDV--PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
NV+DV P DWR +GAVT +K QGQCG CWAFS+VAAVEG+T+I G L+ LSEQQL
Sbjct: 124 -NVSDVAGPEIKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGGNLVSLSEQQL 182
Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
+DC + ++GC+GG+M AF YII+N+G+A+EA YPY+ EGTC + +A I ++
Sbjct: 183 LDCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQETEGTCRYNAKP--SAWIRGFQ 240
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGTAE 301
+P +E+ALL+AVS QPVSV +DA G F Y GV + CG + +H V VG+GT+
Sbjct: 241 TVPSNNERALLEAVSRQPVSVSIDADGPGFMHYSGGVYDEPYCGTDVNHAVTFVGYGTSP 300
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
E G KYWL KNSWGETWGE+GYIRI RD G+CG+A A YPVA
Sbjct: 301 E--GIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 346
>gi|116309130|emb|CAH66233.1| H0825G02.10 [Oryza sativa Indica Group]
Length = 339
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 167/341 (48%), Positives = 227/341 (66%), Gaps = 17/341 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F I+ + C++ + + ++V +HE+WM Q+GR YKD EKA R IFK N+ +
Sbjct: 8 LFAILSCLCLCSAVLAAREQSDHAAMVARHERWMEQYGRVYKDATEKARRFEIFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + L N+F+DLTN EFRA T +PS R P+TF+Y+NV+
Sbjct: 68 IESFNA-GNHKFWLSVNQFADLTNYEFRATKTNKGF-IPSTVRV---PTTFRYENVSIDT 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
+P ++DWR KGAVT IKDQGQCG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
++ GC GGLMD AF++II+N GL TE+ YPY +G C+ AATI YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESKYPYTAADGKCNGGSNS--AATIKGYEDVPANN 240
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL++AV+NQPVSV VD F FY GV+ CG + DHG+ +G+G ++ +G +Y
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIVAIGYG--KDGDGTQY 298
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
WL+KNSWG TWGE+G++R+ +D G+CG+A SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|400180389|gb|AFP73333.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 343 bits (879), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 161/340 (47%), Positives = 235/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|18403438|ref|NP_565780.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|2342728|gb|AAB67626.1| cysteine proteinase [Arabidopsis thaliana]
gi|330253821|gb|AEC08915.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 345
Score = 342 bits (877), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 176/348 (50%), Positives = 236/348 (67%), Gaps = 15/348 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
S ++ + V+IIL + R++ E S+V+KHEQWMA+ R Y+DELEK MR ++
Sbjct: 3 SIMVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY 125
FK+NL++IE NK+GN++YKLG NEF+D TNEEF A++TG + + VS T
Sbjct: 63 FKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGL-KGLTEVSPSKVVAKTISS 121
Query: 126 Q--NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
Q NV+D V S DWR +GAVT +K QGQCG CWAFSAVAAVEG+ +I G L+ LSEQQ
Sbjct: 122 QTWNVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQ 181
Query: 183 LVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
L+DC + + GC GG+M AF Y+++N+G+A+E DY Y+ +G C + AA IS +
Sbjct: 182 LLDCDREYDRGCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNARP--AARISGF 239
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
+ +P +E+ALL+AVS QPVSV +DA+G F Y GV + CG + +H V VG+GT++
Sbjct: 240 QTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQ 299
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
+ G KYWL KNSWGETWGE GYIRI RD G+CG+A A YPVA
Sbjct: 300 D--GTKYWLAKNSWGETWGEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345
>gi|38345008|emb|CAD40026.2| OSJNBa0052O21.11 [Oryza sativa Japonica Group]
gi|125589414|gb|EAZ29764.1| hypothetical protein OsJ_13822 [Oryza sativa Japonica Group]
Length = 339
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 164/341 (48%), Positives = 229/341 (67%), Gaps = 17/341 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F I+ + C++ + + + ++ +HE+WMAQ+GR Y+D+ EKA R +FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN+EFR + T +PS +R P+ F+Y+NV
Sbjct: 68 IESFNA-GNHNFWLGVNQFADLTNDEFRWMKTNKGF-IPSTTRV---PTGFRYENVNIDA 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
+P ++DWR KGAVT IKDQGQCG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 LPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
++ GC GGLMD AF++II+N GL TE++YPY + C + A+I YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPANN 240
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL++AV+NQPVSV VD F FYK GV+ CG + DHG+ +G+G A + G KY
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASD--GTKY 298
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
WL+KNSWG TWGE+G++R+ +D G+CG+A SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|357113934|ref|XP_003558756.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like
[Brachypodium distachyon]
Length = 346
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 168/341 (49%), Positives = 227/341 (66%), Gaps = 16/341 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEP--SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ I+ + C++ V++ R + + ++ +HEQWMAQ GR YKD EKA RL +FK N+
Sbjct: 10 LVAIVGCLCLCSTAVLAARELGDADNAMAARHEQWMAQFGRVYKDPAEKAHRLEVFKANV 69
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT- 129
+IE N E N + LG N+F+DLTN+EFRA T N+ + + + P+ FKY +V+
Sbjct: 70 AFIESFNAE-NHEFWLGANQFADLTNDEFRASKT--NKGIKQGGVRDA-PTGFKYSDVSI 125
Query: 130 -DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWR KGAVT IK+QGQCGSCWAFSAVAA EG+ +++ GKL+ LSEQ+LVDC
Sbjct: 126 DALPASVDWRTKGAVTPIKNQGQCGSCWAFSAVAATEGVVKLSTGKLVSLSEQELVDCDV 185
Query: 189 D--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
+ GC GG MD AF++II+N GL TEA+YPY E+ C + + VAATI YED+P
Sbjct: 186 HGVDQGCMGGWMDDAFKFIIKNGGLTTEANYPYTGEDDKCKSNETVNVAATIKGYEDVPA 245
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
DE AL++AV++QPVSV VD F Y GV+ CG DHG+A +G+G NG
Sbjct: 246 NDESALMKAVAHQPVSVVVDGGDMTFQLYAGGVMTGSCGVEMDHGIAAIGYGAT--SNGT 303
Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
KYWL+KNSWG TWGE G++R+ +D G+CG+A SYP
Sbjct: 304 KYWLMKNSWGTTWGEKGFLRMAKDIPDKRGMCGLAMKPSYP 344
>gi|125547236|gb|EAY93058.1| hypothetical protein OsI_14861 [Oryza sativa Indica Group]
Length = 339
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 165/342 (48%), Positives = 231/342 (67%), Gaps = 19/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F I+ + C++ + + + ++ +HE+WMAQ+GR Y+D+ EKA R +FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRVYRDDAEKARRFEVFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRP-VPSVSRQSSRPSTFKYQNVT-- 129
IE N GN + LG N+F+DLTN+EFR +T N+ +PS +R P+ F+Y+NV
Sbjct: 68 IESFNA-GNHNFWLGVNQFADLTNDEFR--WTKTNKGFIPSTTRV---PTGFRYENVNID 121
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
+P ++DWR KGAVT IKDQGQCG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC
Sbjct: 122 ALPATVDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVH 181
Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
++ GC GGLMD AF++II+N GL TE++YPY + C + A+I YED+P
Sbjct: 182 GEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPAN 239
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
+E AL++AV+NQPVSV VD F FYK GV+ CG + DHG+ +G+G A + G K
Sbjct: 240 NEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASD--GTK 297
Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
YWL+KNSWG TWGE+G++R+ +D G+CG+A SYP A
Sbjct: 298 YWLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|400180347|gb|AFP73312.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 342 bits (876), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC GG M AF++IIEN G++ E+DY Y ++ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180353|gb|AFP73315.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 161/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + F +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180365|gb|AFP73321.1| cysteine protease [Solanum peruvianum]
gi|400180395|gb|AFP73336.1| cysteine protease [Solanum peruvianum]
gi|400180405|gb|AFP73341.1| cysteine protease [Solanum peruvianum]
gi|400180409|gb|AFP73343.1| cysteine protease [Solanum peruvianum]
gi|400180411|gb|AFP73344.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 341 bits (875), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 235/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y ++ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180403|gb|AFP73340.1| cysteine protease [Solanum peruvianum]
gi|400180413|gb|AFP73345.1| cysteine protease [Solanum peruvianum]
gi|400180415|gb|AFP73346.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 161/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|20334375|gb|AAM19208.1|AF493233_1 cysteine protease [Solanum pennellii]
Length = 337
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 164/347 (47%), Positives = 235/347 (67%), Gaps = 17/347 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M +K + ++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK
Sbjct: 1 MAMKID---LMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKG 57
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK+N+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S P
Sbjct: 58 ERFMIFKENMKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS-----P 112
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
S + D+P+++DWRE GAVT +K+QGQCG CWAFSAV ++EG +I G L+E SE
Sbjct: 113 SPINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSE 172
Query: 181 QQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
Q+L+DC+T+N+GC+GG M AF++I EN G++ E+DY Y ++ TC +Q EK A IS
Sbjct: 173 QELLDCTTNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISS 231
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
Y+ +P+G E +LLQAV+ QPVS+ + AS + FY G + C N +H V +G+GT
Sbjct: 232 YQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGT- 288
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
+E G KYWL+KNSWG +WGE G+++I+RD AGLC IA +SYP
Sbjct: 289 -DEKGQKYWLLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|400180373|gb|AFP73325.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 161/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HG YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGHVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QGQCG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC GG M AF++I EN G+++E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180451|gb|AFP73362.1| cysteine protease [Solanum chilense]
Length = 344
Score = 341 bits (875), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS E S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC GG M AF++I EN G+++E+DY Y ++ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180437|gb|AFP73356.1| cysteine protease [Solanum pennellii]
Length = 337
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 162/338 (47%), Positives = 231/338 (68%), Gaps = 14/338 (4%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S PS +
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLS-----PSPINDLSDD 121
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
D+P+++DWRE GAVT +K+QGQCG CWAFSAV ++EG +I G L+E SEQ+L+DC+T+
Sbjct: 122 DMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCTTN 181
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
N+GC+GG M AF++I EN G++ E+DY Y ++ TC +Q EK A IS Y+ +P+G E
Sbjct: 182 NYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG-E 239
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
+LLQAV+ QPVS+ + AS + FY G + C N +H V +G+GT +E G KYW
Sbjct: 240 TSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCANRINHAVTAIGYGT--DEKGQKYW 296
Query: 310 LIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
L+KNSWG +WGE G+++I+RD AGLC IA +SYP
Sbjct: 297 LLKNSWGTSWGEDGFMKIIRDSGNPAGLCDIAKVSSYP 334
>gi|116309178|emb|CAH66275.1| OSIGBa0147O06.5 [Oryza sativa Indica Group]
Length = 339
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 164/341 (48%), Positives = 228/341 (66%), Gaps = 17/341 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F I+ + C++ + + + ++ +HE+WMAQ+GR YKD+ EKA R +FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN+EFR+ T +PS +R P+ F+Y+NV
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRSTKTNKGF-IPSTTRV---PTGFRYENVNIDA 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
+P ++DWR KG VT IKDQGQCG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 LPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
++ GC GGLMD AF++II+N GL TE++YPY + C + A+I YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPANN 240
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL++AV+NQPVSV VD F FYK GV+ CG + DHG+ +G+G A + G KY
Sbjct: 241 EAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASD--GTKY 298
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
WL+KNSWG TWGE+G++R+ +D G+CG+A SYP A
Sbjct: 299 WLLKNSWGTTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 339
>gi|302143412|emb|CBI21973.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 170/337 (50%), Positives = 224/337 (66%), Gaps = 34/337 (10%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ ++ V+ + + R++HE S+ E+HE WM Q+GR YKD EK+ R IFK N+ IE
Sbjct: 12 LALLFVLAAWASQATARNLHEASMYERHEDWMVQYGREYKDADEKSKRYKIFKDNVARIE 71
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK +++YKL NEF+DLTNEEFRA NR + S+ ++FKY+NVT VP++
Sbjct: 72 SFNKAMDKSYKLSINEFADLTNEEFRA---SRNRFKAHIC--STEATSFKYENVTAVPST 126
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
+DWR+KGAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ G
Sbjct: 127 VDWRKKGAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQG 186
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C+ +YPY +GTC+ +K AA I+ YED+P +E+AL
Sbjct: 187 CT---------------------NYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKAL 225
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+AV++QP++V +DA G F FY SGV CG DHGV+ VG+GT+++ G KYWL+K
Sbjct: 226 QKAVAHQPIAVAIDAGGSEFQFYSSGVFTGQCGTELDHGVSAVGYGTSDD--GMKYWLVK 283
Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
NSWG WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 284 NSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 320
>gi|400180351|gb|AFP73314.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 161/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180428|gb|AFP73352.1| cysteine protease [Solanum corneliomuelleri]
Length = 344
Score = 341 bits (874), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180379|gb|AFP73328.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 341 bits (874), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 161/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VIT + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVITMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180445|gb|AFP73359.1| cysteine protease, partial [Solanum chilense]
Length = 345
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS E S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPELSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC GG M AF++I EN G+++E+DY Y ++ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|171702841|dbj|BAG16376.1| cysteine protease [Brassica rapa var. perviridis]
Length = 333
Score = 340 bits (873), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 160/315 (50%), Positives = 219/315 (69%), Gaps = 11/315 (3%)
Query: 33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK-EGNRTYKLGTNEF 91
+ E ++ ++H +WM +HGR Y D EK R +FK+N+E IE+ N + T+KL N+F
Sbjct: 23 LDEVAMQKRHAEWMTEHGRVYADANEKNNRYAVFKRNVERIERLNDVQSGLTFKLAVNQF 82
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQ 149
+DLTNEEFR++YTG+ SV ++P++F+YQNV+ +P S+DWR+KGAVT IKDQ
Sbjct: 83 ADLTNEEFRSMYTGFKGN--SVLSSRTKPTSFRYQNVSSDALPVSVDWRKKGAVTPIKDQ 140
Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENK 209
G CGSCWAFSAVAA+EG+ QI +GKLI LSEQ+LVDC T++ GC GGLMD AF Y I
Sbjct: 141 GLCGSCWAFSAVAAIEGVAQIKKGKLISLSEQELVDCDTNDGGCMGGLMDTAFNYTITIG 200
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
GL +E++YPY+ GTC+ K K +A +I +ED+P DE+AL++AV++ PVS+ +
Sbjct: 201 GLTSESNYPYKSTNGTCNFNKTKQIATSIKGFEDVPANDEKALMKAVAHHPVSIGIAGGD 260
Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
F FY SGV + +C + DHGV VG+G +NG KYW++KNSWG WGE GY+RI +
Sbjct: 261 IGFQFYSSGVFSGECTTHLDHGVTAVGYG--RSKNGLKYWILKNSWGPKWGERGYMRIKK 318
Query: 330 DA----GLCGIATAA 340
D G CG+A A
Sbjct: 319 DIKPKHGQCGLAMNA 333
>gi|400180453|gb|AFP73363.1| cysteine protease [Solanum chilense]
Length = 344
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPVSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180393|gb|AFP73335.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + K +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180407|gb|AFP73342.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180375|gb|AFP73326.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (872), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 IKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC GG M AF++I EN G+++E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCDGGFMTNAFDFIKENGGISSESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|297740489|emb|CBI30671.3| unnamed protein product [Vitis vinifera]
Length = 320
Score = 340 bits (872), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 176/337 (52%), Positives = 222/337 (65%), Gaps = 32/337 (9%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ ++++ ASQ +S R++HE S+ E+HE WM +GRTYKD EK R IFK+N+EYIE
Sbjct: 10 ITLLIMGVWASQALS-RTLHEVSMSERHEDWMGLYGRTYKDIAEKERRFKIFKENVEYIE 68
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
NK F+A GYN S +SS ++F+Y+NV VP+S
Sbjct: 69 SVNK--------------------FKASRNGYNM---SSRPRSSEITSFRYENVAAVPSS 105
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHG 192
+DWR+KGAVT IKDQGQCG CWAFSAVAA+EG+TQ+ G+LI LSEQ+LVDC T ++ G
Sbjct: 106 MDWRKKGAVTPIKDQGQCGCCWAFSAVAAMEGVTQLKTGELISLSEQELVDCDTSGEDQG 165
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C GGLMD AFE+II N GL TEA+YPY+ + TC+ +K + AA I YED+P E AL
Sbjct: 166 CGGGLMDSAFEFIIGNGGLTTEANYPYKGVDATCNKKKAASSAAKIKNYEDVPANSEAAL 225
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
L+AV+ PVSV +DA G F FY SGV CG DHGV VG+G + ++G KYWL+K
Sbjct: 226 LKAVAQHPVSVAIDAGGSDFQFYSSGVFTGQCGTELDHGVTAVGYG--KTDDGTKYWLVK 283
Query: 313 NSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
NSWG WGE GYI + R D GLCGIA ASYP A
Sbjct: 284 NSWGTGWGEDGYIWMERDIGADEGLCGIAMEASYPTA 320
>gi|255557851|ref|XP_002519955.1| cysteine protease, putative [Ricinus communis]
gi|223541001|gb|EEF42559.1| cysteine protease, putative [Ricinus communis]
Length = 321
Score = 340 bits (872), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 180/350 (51%), Positives = 226/350 (64%), Gaps = 37/350 (10%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M L EK I + V+ T ASQ ++ + ++E ++VEKHEQWMA+HGRTY+D EK
Sbjct: 1 MALSLEKKLAIALLVVFS---TWASQAMARQLINEDALVEKHEQWMARHGRTYQDSEEKE 57
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK NLEYI+ NK N+TY+LG N F+DL++EE+ A YT PV
Sbjct: 58 RRFQIFKSNLEYIDNFNKASNQTYQLGLNNFADLSHEEYVATYTARKMPV---------- 107
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+VP SIDWR+ GAVT IK+Q QCG CWAFSA AAVEGI + G + LS
Sbjct: 108 ---------EVPESIDWRDHGAVTPIKNQYQCGCCWAFSAAAAVEGI--VANG--VSLSA 154
Query: 181 QQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
QQL+DC +DN GC GG M+ AF YII+N+G+A E DYPY+ + C + + AA IS
Sbjct: 155 QQLLDCVSDNQGCKGGWMNNAFNYIIQNQGIALETDYPYQQMQQMCSS---RMAAAQISG 211
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDA-SGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFG 298
+ED+ DE+AL++AV+ QPVSV +DA S F YK GV A CGN H V +VG+G
Sbjct: 212 FEDVTPKDEEALMRAVAKQPVSVTIDATSNPNFKLYKEGVFTAAGCGNGHSHAVTLVGYG 271
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDAGL----CGIATAASYPV 344
T+E+ G KYWL KNSWGETWGESGY+R+ RD GL CGIA ASYP
Sbjct: 272 TSED--GTKYWLAKNSWGETWGESGYMRLQRDIGLEGGPCGIALYASYPT 319
>gi|20334377|gb|AAM19209.1|AF493234_1 cysteine protease [Solanum lycopersicum]
gi|400180431|gb|AFP73353.1| cysteine protease [Solanum lycopersicum]
Length = 345
Score = 340 bits (872), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 163/341 (47%), Positives = 236/341 (69%), Gaps = 12/341 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-V 128
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK N +
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKKINDL 126
Query: 129 TD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC
Sbjct: 127 SDDYMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDC 186
Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
+T+N+GC+GG M AF++IIEN G++ E+DY Y ++ TC +Q EK A IS Y+ +P+
Sbjct: 187 TTNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYQVVPE 245
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
G E +LLQAV+ QPVS+ + AS + FY G + +C + +H V +G+GT EE G
Sbjct: 246 G-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGNCADRINHAVTAIGYGTDEE--GQ 301
Query: 307 KYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
KYWL+KNSWG +WGE+GY++I+RD +GLC IA +SYP
Sbjct: 302 KYWLLKNSWGTSWGENGYMKIIRDSGDPSGLCDIAKMSSYP 342
>gi|400180357|gb|AFP73317.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (872), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 159/340 (46%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y ++ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPSGLCDIAKMSSYP 341
>gi|400180455|gb|AFP73364.1| cysteine protease [Solanum peruvianum]
gi|400180459|gb|AFP73366.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|400180359|gb|AFP73318.1| cysteine protease [Solanum peruvianum]
gi|400180477|gb|AFP73375.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (871), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180457|gb|AFP73365.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (871), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + F +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180381|gb|AFP73329.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (871), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 159/340 (46%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y ++ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC I +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|400180417|gb|AFP73347.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 340 bits (871), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 235/340 (69%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITVFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPLSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y ++ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|224131910|ref|XP_002328138.1| predicted protein [Populus trichocarpa]
gi|222837653|gb|EEE76018.1| predicted protein [Populus trichocarpa]
Length = 349
Score = 340 bits (871), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 173/354 (48%), Positives = 231/354 (65%), Gaps = 18/354 (5%)
Query: 1 MVLKFEKSFIIPMFVIIIL--VITCASQVVSGRSMHEPSI---VEKHEQWMAQHGRTYKD 55
M L K+ + F + + V+ +V H S+ VE E W++ HG+ Y
Sbjct: 1 MALSVLKTSFLTFFASLFVCSVLAHDFSIVGYSPEHLTSVDKLVELFESWISGHGKAYNS 60
Query: 56 ELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR 115
EK R +FK+NL++I++ NKE +Y LG NEF+DL++EEF++ + G P R
Sbjct: 61 LEEKLHRFEVFKENLKHIDQRNKEVT-SYWLGLNEFADLSHEEFKSKFLGL---YPEFPR 116
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
+ S F Y++V D+P SIDWR+KGAVT +K+QG CGSCWAFS VAAVEGI QI G L
Sbjct: 117 KKSS-EDFSYRDVVDLPKSIDWRKKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVAGNL 175
Query: 176 IELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV 234
LSEQQL+DC T N+GC+GGLMD AFE+I+ N GL E DYPY EEGTCD ++E+
Sbjct: 176 TSLSEQQLIDCDTSFNNGCNGGLMDYAFEFIVNNGGLHKEEDYPYLMEEGTCDEKREEME 235
Query: 235 AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAV 294
TIS Y D+P+ DEQ+LL+A+++QP+SV +DASGR F FY GV + CG + DHGVA
Sbjct: 236 VVTISGYHDVPRNDEQSLLKALAHQPLSVAIDASGRDFQFYSGGVFSGPCGTDLDHGVAA 295
Query: 295 VGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
VG+G++ +G Y ++KNSWG WGE GY+R+ R+ GLCGI ASYP
Sbjct: 296 VGYGSS---SGIDYIIVKNSWGPKWGERGYLRMKRNTGKPEGLCGINKMASYPT 346
>gi|146216004|gb|ABQ10204.1| cysteine protease Cp6 [Actinidia deliciosa]
Length = 461
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 168/319 (52%), Positives = 217/319 (68%), Gaps = 12/319 (3%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEF 91
S + ++ +E W+ +HG++Y EK R IFK NL +I++ N E +RTYK+G N F
Sbjct: 36 SRTDDEVMAMYESWLVKHGKSYNAIGEKEKRFQIFKDNLRFIDEHNAE-SRTYKVGLNRF 94
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQ 149
+DLTN+E+R++Y G S R S++ + +Y V +P S+DWREKGAV +KDQ
Sbjct: 95 ADLTNDEYRSMYLGAR--TGSRRRLSTQKRSDRYVPVAGESLPDSVDWREKGAVVGVKDQ 152
Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIEN 208
G CGSCWAFS +AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II+N
Sbjct: 153 GSCGSCWAFSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKN 212
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
G+ TE DYPY +G CD ++ A TI YED+P +EQAL +AV+NQPVSV ++AS
Sbjct: 213 GGIDTEEDYPYNARDGRCDQYRKNAKVVTIDDYEDVPVNNEQALQKAVANQPVSVAIEAS 272
Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
G AF FY+SGV +CG DHGV VG+GT EN YW++KNSWG +WGESGYIR+
Sbjct: 273 GMAFQFYESGVFTGNCGTALDHGVTAVGYGT---ENSVDYWIVKNSWGSSWGESGYIRME 329
Query: 329 RDAGL---CGIATAASYPV 344
R+ G CGIA SYP+
Sbjct: 330 RNTGATGKCGIAVEPSYPI 348
>gi|242072392|ref|XP_002446132.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
gi|241937315|gb|EES10460.1| hypothetical protein SORBIDRAFT_06g002150 [Sorghum bicolor]
Length = 337
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 166/341 (48%), Positives = 227/341 (66%), Gaps = 23/341 (6%)
Query: 16 IIILVITCAS---QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
++ ++ CAS V++ R + + ++VE+HE WM ++GR YKD EKA R FK N+ +
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFEAFKHNVAF 66
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-FKYQN--VT 129
+E N + LG N+F+DLT EEF+A G+ V P+T FKY+N V+
Sbjct: 67 VESFNTNKKNKFWLGVNQFADLTTEEFKA-NKGFKPTAEKV------PTTGFKYENLSVS 119
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+PT++DWR KGAVT IK+QGQCG CWAFSAVAA+EGI +++ G LI LSEQ+LVDC T
Sbjct: 120 ALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTGNLISLSEQELVDCDTH 179
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
+ GC GG MD AFE++I+N GLATE++YPY+ +G C + AATI +ED+P
Sbjct: 180 SMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGGSKS--AATIKGHEDVPVN 237
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
+E AL++AV+NQPVSV VDAS R F Y GV+ CG DHG+A +G+G E +G K
Sbjct: 238 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGM--ESDGTK 295
Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
YW++KNSWG TWGE G++R+ +D G+CG+A SYP
Sbjct: 296 YWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPT 336
>gi|357458909|ref|XP_003599735.1| Cysteine proteinase [Medicago truncatula]
gi|357474677|ref|XP_003607623.1| Cysteine proteinase [Medicago truncatula]
gi|355488783|gb|AES69986.1| Cysteine proteinase [Medicago truncatula]
gi|355508678|gb|AES89820.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 167/343 (48%), Positives = 223/343 (65%), Gaps = 11/343 (3%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
+FIIPMF +I V+S R + EP + KHE+WM Q G++YKD EK R IFK
Sbjct: 6 NFIIPMF--LIFTTWMLPYVMSSRVL-EPYLSNKHEKWMTQFGKSYKDAAEKEKRFQIFK 62
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+E+IE N GN+ + L N F+DLTNEEF+A G N+ + + ++F+Y N
Sbjct: 63 NNVEFIELFNAVGNKPFNLSINHFADLTNEEFKASLNG-NKKLHDKFDILNETTSFRYHN 121
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
VT VP S+DWR++GAVT IK+QG CGSCWAFS VA++EGI QIT G+L+ LSEQ+L+DC
Sbjct: 122 VTSVPASMDWRKRGAVTPIKNQGSCGSCWAFSTVASIEGIHQITTGELVSLSEQELIDCV 181
Query: 188 TDNH-GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GCSGG ++ AF++I + G+A+E +YPY+ + C +KE A I YE +P
Sbjct: 182 RGNSSGCSGGYLEDAFKFIAKKGGMASETNYPYKETDEKCKFKKESKHVAEIKGYEKVPS 241
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
E LL+AV+NQPVSV VDA F FY G+ CG + DH V +VG+G + +
Sbjct: 242 NSENDLLKAVANQPVSVYVDAGDYVFQFYSGGIFTGKCGTDTDHVVTIVGYGVSLDY--T 299
Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
+YWL+KNSWG WGE GY+++ R+ GLCGIAT SYPVA
Sbjct: 300 EYWLVKNSWGTGWGEKGYMKLKRNVDSKKGLCGIATNPSYPVA 342
>gi|400180467|gb|AFP73370.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180363|gb|AFP73320.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 339 bits (869), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + K +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180349|gb|AFP73313.1| cysteine protease [Solanum peruvianum]
gi|400180469|gb|AFP73371.1| cysteine protease [Solanum peruvianum]
gi|400180471|gb|AFP73372.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + K +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180383|gb|AFP73330.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 159/340 (46%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + F +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++IIEN G++ E+DY Y ++ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIIENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|400180369|gb|AFP73323.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 159/340 (46%), Positives = 234/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y ++ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGQQYTCRSQ-EKTAAVQISSYKVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|195628596|gb|ACG36128.1| vignain precursor [Zea mays]
Length = 362
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 166/354 (46%), Positives = 234/354 (66%), Gaps = 24/354 (6%)
Query: 8 SFIIPMFVIIILVITCASQVVS--------GRSM--HEPSIVEKHEQWMAQHGRTYKDEL 57
+ ++ + VI I C + V + GR+ E ++ ++++WMAQ+ R YKD+
Sbjct: 15 TLMLLLCVIAIADCICQAAVAARVEPSTTVGRTTGGDEAMMMARYKKWMAQYRRKYKDDA 74
Query: 58 EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRP--VPSVSR 115
EKA R +FK N E+I+++N G + Y LGTN+F+DLT++EF A+YTG +P VPS ++
Sbjct: 75 EKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADLTSKEFAAMYTGLRKPAAVPSGAK 134
Query: 116 QSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRG 173
Q P+ FKYQN T D +DWR++GAVT +K+QGQCG CWAFSAV A+EG+ IT G
Sbjct: 135 QI--PAGFKYQNFTRLDDDVQVDWRQQGAVTPVKNQGQCGCCWAFSAVGAMEGLIMITTG 192
Query: 174 KLIELSEQQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKE 231
L+ LSEQQ++DC S N GC+GG MD AF+Y++ N G+ TE YPY +GTC N +
Sbjct: 193 NLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVVNNGGVTTEDAYPYSAVQGTCQNVQP 252
Query: 232 KAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGNNCDH 290
AATIS ++DLP GDE AL AV+NQPVSV VD F FY+ G+ + D CG + +H
Sbjct: 253 ---AATISGFQDLPSGDENALANAVANQPVSVGVDGGSSPFQFYQGGIYDGDGCGTDMNH 309
Query: 291 GVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPV 344
V +G+G ++ G +YW++KNSWG WGE+G++++ G CGI+T ASYP
Sbjct: 310 AVTAIGYGA--DDQGTQYWILKNSWGTGWGENGFMQLQMGVGACGISTMASYPT 361
>gi|302816909|ref|XP_002990132.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
gi|300142145|gb|EFJ08849.1| hypothetical protein SELMODRAFT_428615 [Selaginella moellendorffii]
Length = 358
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 160/310 (51%), Positives = 210/310 (67%), Gaps = 12/310 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E+WM HGR Y EK R IF+ N EYIE+ N++ N+TY LG N F+D+T++EF+A
Sbjct: 34 YEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFKA 93
Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
LY G P+ + + S F+Y++ T++P DWR KGAV +K+QG CGSCWAFS V
Sbjct: 94 LYFGTKVPLSNTIK-----SGFRYEDATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTV 148
Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
AAVEG+ QI G+L+ LSEQ+LVDC N GC+GGLMD AFE+II+N GL +EADYPY+
Sbjct: 149 AAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPYK 208
Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL 280
G+CD + + TI +ED+P E LL+AV+NQPVSV ++ASGR F Y GV
Sbjct: 209 AVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGVY 268
Query: 281 NADCGNNCDHGVAVVGFGTAEEENG--AKYWLIKNSWGETWGESGYIRILRDA----GLC 334
CG DHGV VG+GT++ +G YW+++NSWG+ WGESGYIR+ R+ G C
Sbjct: 269 TGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASSRGKC 328
Query: 335 GIATAASYPV 344
GIA ASYPV
Sbjct: 329 GIAMMASYPV 338
>gi|400180385|gb|AFP73331.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 159/340 (46%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC I +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|400180463|gb|AFP73368.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 159/340 (46%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|30690594|ref|NP_564321.2| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|28393492|gb|AAO42167.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332192920|gb|AEE31041.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 338 bits (867), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 176/347 (50%), Positives = 232/347 (66%), Gaps = 19/347 (5%)
Query: 13 MFVIIILVITC----ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
+F+++ L I SQ S + HEP + E H+QWM + R Y DELEK MR ++FK+
Sbjct: 14 LFMLVSLTILSMNLKVSQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKK 73
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN--RPVPSVSRQSSRPSTFKYQ 126
NL++IEK NK+G+RTYKLG NEF+D T EEF A +TG +PS ++ +
Sbjct: 74 NLKFIEKFNKKGDRTYKLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNW- 132
Query: 127 NVTDVP--TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
NV+DV + DWR +GAVT +K QGQCG CWAFS+VAAVEG+T+I L+ LSEQQL+
Sbjct: 133 NVSDVAGRETKDWRYEGAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLL 192
Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
DC + ++GC+GG+M AF YII+N+G+A+EA YPY+ EGTC + +A I ++
Sbjct: 193 DCDRERDNGCNGGIMSDAFSYIIKNRGIASEASYPYQAAEGTCRYNGKP--SAWIRGFQT 250
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGTAEE 302
+P +E+ALL+AVS QPVSV +DA G F Y GV + CG N +H V VG+GT+ E
Sbjct: 251 VPSNNERALLEAVSKQPVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPE 310
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
G KYWL KNSWGETWGE+GYIRI RD G+CG+A A YPVA
Sbjct: 311 --GIKYWLAKNSWGETWGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 355
>gi|400180465|gb|AFP73369.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++E +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEVAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPAGLCDIAKMSSYP 341
>gi|400180461|gb|AFP73367.1| cysteine protease [Solanum peruvianum]
gi|400180473|gb|AFP73373.1| cysteine protease [Solanum peruvianum]
gi|400180475|gb|AFP73374.1| cysteine protease [Solanum peruvianum]
gi|400180479|gb|AFP73376.1| cysteine protease [Solanum peruvianum]
gi|400180481|gb|AFP73377.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 160/340 (47%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + F +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|302816222|ref|XP_002989790.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
gi|300142356|gb|EFJ09057.1| hypothetical protein SELMODRAFT_184826 [Selaginella moellendorffii]
Length = 358
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 160/310 (51%), Positives = 210/310 (67%), Gaps = 12/310 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E+WM HGR Y EK R IF+ N EYIE+ N++ N+TY LG N F+D+T++EF+A
Sbjct: 34 YEKWMVDHGRVYNGIGEKERRFQIFRDNAEYIEEHNRQVNQTYWLGLNNFADMTHDEFKA 93
Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
LY G P+ + + S F+Y++ T++P DWR KGAV +K+QG CGSCWAFS V
Sbjct: 94 LYFGTKVPLSNTIK-----SGFRYKDATNLPLDTDWRSKGAVATVKNQGACGSCWAFSTV 148
Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
AAVEG+ QI G+L+ LSEQ+LVDC N GC+GGLMD AFE+II+N GL +EADYPY+
Sbjct: 149 AAVEGVNQIVTGELVSLSEQELVDCDKQKNQGCNGGLMDSAFEFIIQNGGLDSEADYPYK 208
Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL 280
G+CD + + TI +ED+P E LL+AV+NQPVSV ++ASGR F Y GV
Sbjct: 209 AVSGSCDESRRNSHVVTIDGFEDVPAESEADLLKAVANQPVSVAIEASGRNFQLYSGGVY 268
Query: 281 NADCGNNCDHGVAVVGFGTAEEENG--AKYWLIKNSWGETWGESGYIRILRDA----GLC 334
CG DHGV VG+GT++ +G YW+++NSWG+ WGESGYIR+ R+ G C
Sbjct: 269 TGHCGYELDHGVVAVGYGTSKTPDGVATDYWIVRNSWGDAWGESGYIRLQRNVASPRGKC 328
Query: 335 GIATAASYPV 344
GIA ASYPV
Sbjct: 329 GIAMMASYPV 338
>gi|356533293|ref|XP_003535200.1| PREDICTED: LOW QUALITY PROTEIN: cysteine proteinase RD21a-like
[Glycine max]
Length = 466
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 163/314 (51%), Positives = 213/314 (67%), Gaps = 17/314 (5%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E W+ +HG+ Y EK R IFK NL +IE+ N G+++YKLG N+F+DLTNEE+RA
Sbjct: 48 YEAWLVKHGKAYNALGEKERRFKIFKDNLRFIEEHNGAGDKSYKLGLNKFADLTNEEYRA 107
Query: 102 LYTGYNRPVPS-----VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
++ G P V++++ R + Y+ ++P +DWREKGAVT IKDQGQCGSCW
Sbjct: 108 MFLGTRTRGPKNKAAVVAKKTDR---YAYRAGEELPAMVDWREKGAVTPIKDQGQCGSCW 164
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
AFS V AVEGI QI G L LSEQ+LVDC N GC+GGLMD AFE+I++N G+ TE
Sbjct: 165 AFSTVGAVEGINQIVTGNLTSLSEQELVDCDRGYNMGCNGGLMDYAFEFIVQNGGIDTEE 224
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
DYPY ++ TCD ++ A TI YED+P DE++L++AV+NQPVSV ++A G F Y
Sbjct: 225 DYPYHAKDNTCDPNRKNARVVTIDGYEDVPTNDEKSLMKAVANQPVSVAIEAGGMEFQLY 284
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----- 330
+SGV CG N DHGV VG+GT ENG YWL++NSWG WGE+GYI++ R+
Sbjct: 285 QSGVFTGRCGTNLDHGVVAVGYGT---ENGTDYWLVRNSWGSAWGENGYIKLERNVQNTE 341
Query: 331 AGLCGIATAASYPV 344
G CGIA ASYP+
Sbjct: 342 TGKCGIAIEASYPI 355
>gi|225458701|ref|XP_002284973.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 467
Score = 338 bits (866), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 170/359 (47%), Positives = 230/359 (64%), Gaps = 23/359 (6%)
Query: 2 VLKFEKSFIIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRTY 53
+ + S + +F+++ L ++ H + ++ +E W+A+HG++Y
Sbjct: 3 LCRSSSSMAVFLFLLLGLASALDMSIIGYDETHGDKSSWRTDEDVMAVYEAWLAKHGKSY 62
Query: 54 KDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSV 113
EK R IFK NL +I++ N E NRTYK+G N F+DLTNEE+R++Y G +
Sbjct: 63 NALGEKERRFQIFKDNLRFIDEHNAE-NRTYKVGLNRFADLTNEEYRSMYLGTR---TAA 118
Query: 114 SRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQIT 171
R+SS + +Y V D +P S+DWR+KGAV +KDQG CGSCWAFS +AAVEGI +I
Sbjct: 119 KRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTIAAVEGINKIV 178
Query: 172 RGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQK 230
G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ +E DYPY+ +G CD +
Sbjct: 179 TGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYKASDGRCDQYR 238
Query: 231 EKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDH 290
+ A TI YED+P+ DE++L +AV+NQPVSV ++A GR F Y+SG+ CG DH
Sbjct: 239 KNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIFTGRCGTALDH 298
Query: 291 GVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
GV VG+GT ENG YW++KNSWG +WGE GYIR+ RD G CGIA ASYP+
Sbjct: 299 GVTAVGYGT---ENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCGIAMEASYPI 354
>gi|400180419|gb|AFP73348.1| cysteine protease [Solanum lycopersicoides]
Length = 343
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 163/350 (46%), Positives = 241/350 (68%), Gaps = 17/350 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M +K + ++ + + + VI+ + + RS + S+ E+HE WM++HGR YKDE+EK
Sbjct: 1 MAMKID---LMSILITLFFVISMFNSQTTARSQPKLSVSERHELWMSRHGRVYKDEVEKG 57
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK+N+++IE NK GN +YKLG NEF+D+T+EEF +TG N +PS S
Sbjct: 58 ERFMIFKENMKFIESVNKAGNLSYKLGINEFADITSEEFLTKFTGIN--IPSYLSPSPMS 115
Query: 121 ST-FKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIE 177
ST FK +++D +P+++DWRE GAVT +K+QGQCG CWAFSAV ++EG +I G L+E
Sbjct: 116 STEFKINDLSDDDMPSNLDWRESGAVTQVKNQGQCGCCWAFSAVGSLEGAYKIATGNLME 175
Query: 178 LSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
SEQ+L+DC+T+N+GC+GG M AF++I EN G+++E+DY Y+ ++ TC +Q EK A
Sbjct: 176 FSEQELLDCTTNNYGCNGGFMTNAFDFIKENGGISSESDYEYQGQQYTCRSQ-EKTAAVQ 234
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
IS Y+ +P+G E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+
Sbjct: 235 ISSYQVVPEG-ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGY 292
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
GT +E G KYWL+KNSWG +WGE+G+++I+RD+ G C IA +SYP
Sbjct: 293 GT--DEKGQKYWLLKNSWGTSWGENGFMKIIRDSGNPGGHCDIAKMSSYP 340
>gi|356515052|ref|XP_003526215.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 164/343 (47%), Positives = 226/343 (65%), Gaps = 11/343 (3%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
+SF ++I+ L++T + V R + E E+HE+WMAQ+G+ Y D EK R IF
Sbjct: 2 RSFSQNHYLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIF 61
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
K N+++IE N G++ + L N+F+DL NEEF+A + V +++ ++F+Y+
Sbjct: 62 KNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGV--ETATETSFRYE 119
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
++T +P ++DWR++GAVT IKDQG CGSCWAFS VAA+EGI QIT GKL+ LSEQ+LVDC
Sbjct: 120 SITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFSTVAAIEGIHQITTGKLVSLSEQELVDC 179
Query: 187 -STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ GC+ G ++AFE++ +N GLA+E YPY+ TC +KE A I YE++P
Sbjct: 180 VKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVP 239
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
E+ALL+AV+NQPVSV +DA A FY SG+ CG +H V V+G+G A G
Sbjct: 240 SNSEKALLKAVANQPVSVYIDAG--ALQFYSSGIFTGKCGTAPNHAVTVIGYGKA--RGG 295
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
AKYWL+KNSWG WGE GYI++ RD GLCGIAT ASYP
Sbjct: 296 AKYWLVKNSWGTKWGEKGYIKMKRDIRAKEGLCGIATNASYPT 338
>gi|310656790|gb|ADP02219.1| Peptidase_C1 domain-containing protein [Triticum aestivum]
Length = 419
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 162/323 (50%), Positives = 212/323 (65%), Gaps = 13/323 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+ II + C+S V+S R + + ++VEKHEQWMA+ R YKD EKA R FK N+ +
Sbjct: 8 LLAIIGSICLCSSTVLSARELGDAAMVEKHEQWMAKFNRVYKDSTEKAQRFKAFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR-PSTFKYQNVTD- 130
IE N GN + LG N+F+DLTN+EFRA T + R +R P+ FKY NV+
Sbjct: 68 IESFN-TGNHKFWLGVNQFTDLTNDEFRATKTN-----KGLKRNGARAPTRFKYNNVSTD 121
Query: 131 -VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+P ++DWR KG VT IKDQGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC
Sbjct: 122 ALPAAVDWRTKGVVTPIKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVH 181
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
+ GC GG MD AF++II+N GL TEA+YPY ++G C ATI YED+P
Sbjct: 182 GVDQGCEGGEMDNAFKFIIKNGGLTTEANYPYTAQDGQCKTSTTSNSVATIKGYEDVPAN 241
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
DE +L++AV+NQPVSV VD F Y GV+ CG + DHG+ +G+G + G K
Sbjct: 242 DESSLMKAVANQPVSVAVDGGDVIFQHYSGGVMTGSCGTDLDHGIVAIGYGMTSD--GTK 299
Query: 308 YWLIKNSWGETWGESGYIRILRD 330
+WL+KNSWG TWGESGY+R+ +D
Sbjct: 300 FWLLKNSWGTTWGESGYLRMEKD 322
>gi|400180371|gb|AFP73324.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 159/340 (46%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + F +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGNPSGLCDIAKMSSYP 341
>gi|147790682|emb|CAN61026.1| hypothetical protein VITISV_001146 [Vitis vinifera]
Length = 469
Score = 337 bits (864), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 166/318 (52%), Positives = 217/318 (68%), Gaps = 15/318 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ ++ +E W+A+HG++Y EK R IFK NL +I++ N E NRTYK+G N F+DL
Sbjct: 46 DEDVMAVYEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAE-NRTYKVGLNRFADL 104
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQC 152
TNEE+R++Y G + R+SS + +Y V D +P S+DWR+KGAV +KDQG C
Sbjct: 105 TNEEYRSMYLGTR---TAAKRRSSNKISDRYAFRVGDSLPESVDWRKKGAVVEVKDQGSC 161
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
GSCWAFS +AAVEGI +I G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+
Sbjct: 162 GSCWAFSTIAAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGI 221
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
+E DYPY+ +G CD ++ A TI YED+P+ DE++L +AV+NQPVSV ++A GR
Sbjct: 222 DSEEDYPYKASDGRCDQYRKNAXVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGRE 281
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
F Y+SG+ CG DHGV VG+GT ENG YW++KNSWG +WGE GYIR+ RD
Sbjct: 282 FQLYQSGIFTGRCGTALDHGVTAVGYGT---ENGVDYWIVKNSWGASWGEEGYIRMERDL 338
Query: 331 ----AGLCGIATAASYPV 344
G CGIA ASYP+
Sbjct: 339 ATSATGKCGIAMEASYPI 356
>gi|400180399|gb|AFP73338.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 337 bits (864), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 159/340 (46%), Positives = 233/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + F +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDIAKMSSYP 341
>gi|224083868|ref|XP_002307151.1| predicted protein [Populus trichocarpa]
gi|222856600|gb|EEE94147.1| predicted protein [Populus trichocarpa]
Length = 298
Score = 337 bits (864), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 165/314 (52%), Positives = 217/314 (69%), Gaps = 22/314 (7%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+ E+HEQWMAQ+GR YKD+ EK R NIFK+N+ I+ N + ++Y LG N+F+DL+NE
Sbjct: 1 MYERHEQWMAQYGRVYKDDAEKETRYNIFKENVARIDAFNSQTGKSYNLGVNQFADLSNE 60
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+A NR + + P F+Y+NV+ VP ++DWR+KGAVT +KDQGQC
Sbjct: 61 EFKA---SRNRFKGHMCSPQAGP--FRYENVSAVPATMDWRKKGAVTPVKDQGQC----- 110
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEA 215
VAA+EGI Q+T GKLI LSEQ++VDC T ++ GC+GGLMD AF++I +NKGL TEA
Sbjct: 111 ---VAAMEGINQLTTGKLISLSEQEVVDCDTKGEDQGCNGGLMDDAFKFIEQNKGLTTEA 167
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
+YPY +GTC+ QKE + AA I+ ++D+P E AL++AV+ QPVSV +DA G F FY
Sbjct: 168 NYPYTGTDGTCNTQKEVSHAAKITGFQDVPANSEAALMKAVAKQPVSVAIDAGGFEFQFY 227
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----A 331
SG+ CG DHGV VG+G ++ G KYWL+KNSWG WGE GYIR+ +D
Sbjct: 228 SSGIFTGSCGTELDHGVTAVGYGGSD---GTKYWLVKNSWGAQWGEEGYIRMQKDISAKE 284
Query: 332 GLCGIATAASYPVA 345
GLCGIA ASYP A
Sbjct: 285 GLCGIAMQASYPTA 298
>gi|357160599|ref|XP_003578815.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 337 bits (863), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 167/341 (48%), Positives = 227/341 (66%), Gaps = 19/341 (5%)
Query: 15 VIIILVITC-ASQVVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
++ IL C S V++ R +++ S+ +HE WMAQ+GR YKD EKA + +FK N +
Sbjct: 8 ILAILGCLCFCSSVLAARELNDDLSMAARHETWMAQYGRVYKDAAEKAQKFEVFKANARF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN--VTD 130
I+ N E N + LG N+F+DLTNEEF+A T N+ +S ++ + FKY+N +
Sbjct: 68 IDSFNAE-NHKFWLGINQFADLTNEEFKATKT--NKGF--ISNKARVSTGFKYENLKIEA 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
+PTSIDWR KGAVT +KDQGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC
Sbjct: 123 LPTSIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHG 182
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
++ GC GGLMD AF++II N GL E+ YPY E+G C + + A TI YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKCKSGSKS--AGTIKSYEDVPANN 240
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL++AV+NQPVSV VD F FY GV+ CG + DHG+A +G+G + G K+
Sbjct: 241 EGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSD--GTKF 298
Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
WL+KNSWG TWGE+G++R+ +D G+CG+A SYP A
Sbjct: 299 WLMKNSWGTTWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|356515038|ref|XP_003526208.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 339
Score = 337 bits (863), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 164/343 (47%), Positives = 225/343 (65%), Gaps = 11/343 (3%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
+SF ++I+ L++T + V R + E E+HE+WMAQ+G+ Y D EK R IF
Sbjct: 2 RSFSQNHYLILFLILTVWTFHVMSRRLSEVCTSERHEKWMAQYGKLYTDAAEKEKRFQIF 61
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
K N+++IE N G++ + L N+F+DL NEEF+A + V +++ ++F+Y+
Sbjct: 62 KNNVQFIESFNAAGDKPFNLSINQFADLHNEEFKASLINVQKKESGV--ETATETSFRYE 119
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
++T +P ++DWR++GAVT IKDQG CGSCWAFS VAA+EGI QIT GKL+ LSEQ+LVDC
Sbjct: 120 SITKIPVTMDWRKRGAVTPIKDQGNCGSCWAFSIVAAIEGIHQITTGKLVSLSEQELVDC 179
Query: 187 -STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ GC+ G ++AFE++ +N GLA+E YPY+ TC +KE A I YE++P
Sbjct: 180 VKGKSEGCNFGYKEEAFEFVAKNGGLASEISYPYKANNKTCMVKKETQGVAQIKGYENVP 239
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
E+ALL+AV+NQPVSV +DA A FY SG+ CG +H V+G+G A G
Sbjct: 240 SNSEKALLKAVANQPVSVYIDAG--ALQFYSSGIFTGKCGTAPNHAATVIGYGKA--RGG 295
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
AKYWL+KNSWG WGE GYIR+ RD GLCGIAT ASYP
Sbjct: 296 AKYWLVKNSWGTKWGEKGYIRMKRDIRAKEGLCGIATNASYPT 338
>gi|50355623|dbj|BAD29960.1| cysteine protease [Daucus carota]
Length = 460
Score = 336 bits (862), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 160/312 (51%), Positives = 210/312 (67%), Gaps = 9/312 (2%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I+ +E W+ +HG++Y EK R IFK N YI++ N +R++KLG N F+DLTNE
Sbjct: 40 IMAAYESWLVKHGKSYNALGEKEQRFQIFKDNFLYIDEQNAAKDRSFKLGLNRFADLTNE 99
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
E+R+ YTG R S + S + + +P S+DWRE GAV +KDQGQCGSCWA
Sbjct: 100 EYRSKYTGI-RTKDSRKKVSGKSQRYASLAGESLPESVDWREHGAVASVKDQGQCGSCWA 158
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS ++AVEGI QI GKLI LSEQ+LVDC N GC+GGLMD AF++II N G+ ++AD
Sbjct: 159 FSTISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDDAFQFIINNGGIDSDAD 218
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY +G CD ++ A TI YED+P+ DE+AL +A +NQP+SV ++ASGR F FY
Sbjct: 219 YPYTGRDGQCDQYRKNAKVVTIDSYEDVPEYDEKALQKAAANQPISVAIEASGRDFQFYD 278
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAG 332
SG+ CG + DHGV VVG+GT ENG YW+++NSWG WGE GY+R+ R AG
Sbjct: 279 SGIFTGKCGTDLDHGVVVVGYGT---ENGKDYWIVRNSWGADWGEKGYLRMERGISSKAG 335
Query: 333 LCGIATAASYPV 344
+CGI + SYPV
Sbjct: 336 ICGITSEPSYPV 347
>gi|110737404|dbj|BAF00646.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 345
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 174/348 (50%), Positives = 234/348 (67%), Gaps = 15/348 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
S ++ + V+IIL + R++ E S+V+KHEQWMA+ R Y+DELEK MR ++
Sbjct: 3 SIMVLVTVLIILFTGFRISQATSRTVIFREQSMVDKHEQWMARFSREYRDELEKNMRRDV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY 125
FK+NL++IE NK+GN++YKLG NEF+D TNEEF A++TG + + VS T
Sbjct: 63 FKKNLKFIENFNKKGNKSYKLGVNEFADWTNEEFLAIHTGL-KGLTEVSPSKVVAKTISS 121
Query: 126 Q--NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
Q NV+D V S DWR +GAVT +K QGQCG CWAFSAVAAVEG+ +I G L+ LSEQQ
Sbjct: 122 QTWNVSDMVVESKDWRAEGAVTPVKYQGQCGCCWAFSAVAAVEGVAKIAGGNLVSLSEQQ 181
Query: 183 LVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
L+DC + + C GG+M AF Y+++N+G+A+E DY Y+ +G C + AA IS +
Sbjct: 182 LLDCDREYDRDCDGGIMSDAFNYVVQNRGIASENDYSYQGSDGGCRSNARP--AARISGF 239
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
+ +P +E+ALL+AVS QPVSV +DA+G F Y GV + CG + +H V VG+GT++
Sbjct: 240 QTVPSNNERALLEAVSRQPVSVSMDATGDGFMHYSGGVYDGPCGTSSNHAVTFVGYGTSQ 299
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
+ G KYWL KNSWGETW E GYIRI RD G+CG+A A YPVA
Sbjct: 300 D--GTKYWLAKNSWGETWEEKGYIRIRRDVAWPQGMCGVAQYAFYPVA 345
>gi|400180361|gb|AFP73319.1| cysteine protease [Solanum peruvianum]
gi|400180397|gb|AFP73337.1| cysteine protease [Solanum peruvianum]
gi|400180401|gb|AFP73339.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 158/340 (46%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + F +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFIINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD +GLC I +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDSGDPSGLCDITKMSSYP 341
>gi|255563134|ref|XP_002522571.1| cysteine protease, putative [Ricinus communis]
gi|223538262|gb|EEF39871.1| cysteine protease, putative [Ricinus communis]
Length = 343
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 180/347 (51%), Positives = 223/347 (64%), Gaps = 19/347 (5%)
Query: 8 SFIIPMFVIIILVI--TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
S I VI +L+I T SQ + ++ +I EKHEQWMA+HGRTY D EK R I
Sbjct: 4 SLQITKLVITLLMILGTWVSQAMPRPLLNAEAIAEKHEQWMARHGRTYHDNAEKERRFQI 63
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF-- 123
FK NL+YIE NK N+TYKLG N+FSDL+ EEF Y GY P + ++ TF
Sbjct: 64 FKNNLDYIENFNKAFNKTYKLGLNKFSDLSEEEFVTTYNGYEMPTTLPTANTTVKPTFFS 123
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
Y N +VP SIDWRE G VT +K+QG+CG CWAFSAVAAVEGI G LS QQL
Sbjct: 124 NYYNQDEVPESIDWRENGVVTSVKNQGECGCCWAFSAVAAVEGIA----GNGASLSAQQL 179
Query: 184 VDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
+DC DN GC GG M KAFEYI++N+G+ ++ DYPY + C + VAA I+ YE
Sbjct: 180 LDCVGDNSGCGGGTMIKAFEYIVQNQGIVSDTDYPYEQTQEMC--RSGSNVAARITGYES 237
Query: 244 LPKGDEQALLQAVSNQPVSVCVDA-SGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAE 301
+ + E+AL +AV+ QP+SV +DA SG F Y SGV +A DCG + H V +VG+GT E
Sbjct: 238 VIQ-SEEALKRAVAKQPISVAIDASSGPNFKSYISGVFSAEDCGTHLTHAVTLVGYGTTE 296
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDAGL----CGIATAASYPV 344
+ G KYWL+KNSWGE WGESGY+R+ RD G CGIA ASYP
Sbjct: 297 D--GTKYWLVKNSWGEEWGESGYMRLQRDVGAMEGPCGIAMQASYPT 341
>gi|400180391|gb|AFP73334.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 335 bits (860), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 159/340 (46%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + RS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMSILITLFFVISMFNSQTRARSQPKLSVSERHELWMSRHGRVYKDEVEKGERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + FK +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTEFKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DWRE GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWRESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + F G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFCAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|400180387|gb|AFP73332.1| cysteine protease [Solanum peruvianum]
Length = 344
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 159/340 (46%), Positives = 232/340 (68%), Gaps = 11/340 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + + + VI+ + GRS + S+ E+HE WM++HGR YKDE+EK R IFK+N
Sbjct: 7 LMNILITLFFVISMFNTQTRGRSQPKLSVSERHELWMSRHGRVYKDEVEKVERFMIFKEN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+++IE NK GN +YKLG NEF+D+T++EF A +TG N P +S + K +++
Sbjct: 67 MKFIESVNKAGNLSYKLGMNEFADITSQEFLAKFTGLNIPNSYLSPSPMSSTELKINDLS 126
Query: 130 D--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D +P+++DW E GAVT +K QG+CG CWAFSAV ++EG +I G L+E SEQ+L+DC+
Sbjct: 127 DDDMPSNLDWIESGAVTQVKHQGRCGCCWAFSAVGSLEGAYKIATGNLMEFSEQELLDCT 186
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
T+N+GC+GG M AF++I EN G++ E+DY Y E+ TC +Q EK A IS Y+ +P+G
Sbjct: 187 TNNYGCNGGFMTNAFDFIKENGGISRESDYEYLGEQYTCRSQ-EKTAAVQISSYQVVPEG 245
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LLQAV+ QPVS+ + AS + FY G + C + +H V +G+GT +E G K
Sbjct: 246 -ETSLLQAVTKQPVSIGIAAS-QDLQFYAGGTYDGSCADRINHAVTAIGYGT--DEKGQK 301
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YWL+KNSWG +WGE+G+++I+RD AGLC IA +SYP
Sbjct: 302 YWLLKNSWGTSWGENGFMKIIRDYGNPAGLCDIAKMSSYP 341
>gi|357160572|ref|XP_003578808.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 335 bits (859), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 164/342 (47%), Positives = 230/342 (67%), Gaps = 19/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
+ I+ + C S V++ R +++ S+V +HE WM Q+GR YKD EKA + +FK N E
Sbjct: 8 LLAILGCLCLCGS-VLAARELNDDLSMVARHENWMLQYGRVYKDAAEKAQKFEVFKANAE 66
Query: 72 YIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT-- 129
+I N GN + LG N+F+D+TNEEF+A T N+ +S + P+ F Y+N++
Sbjct: 67 FINSFNA-GNHKFWLGINQFADITNEEFKATKT--NKGF--ISNKVRVPTGFMYENMSFD 121
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
+P +IDWR KGAVT IKDQGQCG CWAFSAVAA+EGI +++ GKL+ LSEQ+LVDC
Sbjct: 122 ALPATIDWRTKGAVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLVSLSEQELVDCDVH 181
Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
++ GC GGLMD AF++II+N GL E++YPY +G C + + AATI YED+P
Sbjct: 182 GEDQGCEGGLMDDAFKFIIKNGGLTQESNYPYDAADGKC--KSGSSSAATIKSYEDVPAN 239
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
+E AL++AV+NQPVSV VD F FY GV+ CG + DHG+A +G+GT + G K
Sbjct: 240 NEGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGTTSD--GTK 297
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
+W++KNSWG +WGE+G++R+ +D G+CG+A SYP A
Sbjct: 298 FWIMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|9502421|gb|AAF88120.1|AC021043_13 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 331
Score = 335 bits (858), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 173/331 (52%), Positives = 225/331 (67%), Gaps = 15/331 (4%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
SQ S + HEP + E H+QWM + R Y DELEK MR ++FK+NL++IEK NK+G+RTY
Sbjct: 6 SQATSRVTFHEPIVAEHHQQWMTRFSRVYSDELEKQMRFDVFKKNLKFIEKFNKKGDRTY 65
Query: 85 KLGTNEFSDLTNEEFRALYTGYN--RPVPSVSRQSSRPSTFKYQNVTDVP--TSIDWREK 140
KLG NEF+D T EEF A +TG +PS ++ + NV+DV + DWR +
Sbjct: 66 KLGVNEFADWTREEFIATHTGLKGVNGIPSSEFVDEMIPSWNW-NVSDVAGRETKDWRYE 124
Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMD 199
GAVT +K QGQCG CWAFS+VAAVEG+T+I L+ LSEQQL+DC + ++GC+GG+M
Sbjct: 125 GAVTPVKYQGQCGCCWAFSSVAAVEGLTKIVGNNLVSLSEQQLLDCDRERDNGCNGGIMS 184
Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ 259
AF YII+N+G+A+EA YPY+ EGTC + +A I ++ +P +E+ALL+AVS Q
Sbjct: 185 DAFSYIIKNRGIASEASYPYQAAEGTC--RYNGKPSAWIRGFQTVPSNNERALLEAVSKQ 242
Query: 260 PVSVCVDASGRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
PVSV +DA G F Y GV + CG N +H V VG+GT+ E G KYWL KNSWGET
Sbjct: 243 PVSVSIDADGPGFMHYSGGVYDEPYCGTNVNHAVTFVGYGTSPE--GIKYWLAKNSWGET 300
Query: 319 WGESGYIRILRDA----GLCGIATAASYPVA 345
WGE+GYIRI RD G+CG+A A YPVA
Sbjct: 301 WGENGYIRIRRDVAWPQGMCGVAQYAFYPVA 331
>gi|60100207|gb|AAX13273.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 335 bits (858), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 166/320 (51%), Positives = 215/320 (67%), Gaps = 16/320 (5%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLT 95
++ ++HE+WMA+HGR Y D+ EKA RL +F+ N+ +IE N ++ + L N+F+DLT
Sbjct: 35 AMAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLT 94
Query: 96 NEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCG 153
N EFRA TG PS SR + P++F+Y NV+ D+P S+DWR KGAV +KDQG CG
Sbjct: 95 NAEFRATRTGLR---PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCG 151
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGL 211
CWAFSAVAA+EG ++ GKL+ LSEQQLV C ++ GC GGLMD AF++II+N GL
Sbjct: 152 CCWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGL 211
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
A E+DYPY + C A AATI YED+P DE ALL+AV+NQPVSV +D R
Sbjct: 212 AAESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRH 271
Query: 272 FHFYKSGVLN--ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
F FYK GVL+ A C DH + VG+G A + G KYWL+KNSWG +WGE GY+R+ R
Sbjct: 272 FQFYKGGVLSGAAGCATELDHAITAVGYGVASD--GTKYWLMKNSWGTSWGEDGYVRMER 329
Query: 330 DA----GLCGIATAASYPVA 345
G+CG+A ASYP A
Sbjct: 330 GVADKEGVCGLAMMASYPTA 349
>gi|356543112|ref|XP_003540007.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Glycine
max]
Length = 345
Score = 335 bits (858), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 162/324 (50%), Positives = 216/324 (66%), Gaps = 8/324 (2%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
+ R + E E+HE WMAQ+G+ YKD EK R IFK N+ +IE N G++ + L
Sbjct: 24 IMSRRLFEACTSERHENWMAQYGKVYKDAAEKKKRFQIFKNNVHFIESFNTAGDKPFNLS 83
Query: 88 TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHI 146
N+F+DL +EEF+AL T N+ V SV ++ T FKY VT + ++DWR++GAVT I
Sbjct: 84 INQFADLHDEEFKALLTNGNKKVRSVVGTATETETSFKYNRVTKLLATMDWRKRGAVTPI 143
Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGCSGGLMDKAFEYI 205
KDQ +CGSCWAFSAVAA+EGI QIT KL+ LSEQ+LVDC ++ GC+GG M+ AFE++
Sbjct: 144 KDQRRCGSCWAFSAVAAIEGIHQITTSKLVSLSEQELVDCVKGESEGCNGGYMEDAFEFV 203
Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
+ G+A+E+ YPY+ ++ +C +KE + I YE +P E+AL +AV++QPVSV V
Sbjct: 204 AKKGGIASESYYPYKGKDKSCKVKKETHGVSQIKGYEKVPSNSEKALQKAVAHQPVSVYV 263
Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
+A G AF FY SG+ CG N DH + VVG+G + G KYWL+KNSWG WGE GYI
Sbjct: 264 EAGGNAFQFYSSGIFTGKCGTNTDHAITVVGYG--KSRGGTKYWLVKNSWGAGWGEKGYI 321
Query: 326 RILRD----AGLCGIATAASYPVA 345
R+ RD GLCGIA A YP A
Sbjct: 322 RMKRDIRAKEGLCGIAMNAFYPTA 345
>gi|537437|gb|AAC35211.1| cysteine proteinase [Hemerocallis hybrid cultivar]
Length = 359
Score = 334 bits (857), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 161/314 (51%), Positives = 213/314 (67%), Gaps = 8/314 (2%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ +E+W A H + +D + R N+FK+N+++I + N++ + TYKL N+F D+
Sbjct: 34 EESLWSLYEKWRAHHAVS-RDLDDTDKRFNVFKENVKFIHEFNQKKDATYKLALNKFGDM 92
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
TN+EFR+ Y G R F Y+ D+PTS+DWREKGAVT +KDQGQCGS
Sbjct: 93 TNQEFRSTYAGSKIDHHMTLRGVKDAGEFSYEKFHDLPTSVDWREKGAVTGVKDQGQCGS 152
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATE 214
CWAFS V AVEGI QI +L+ LSEQQLVDC T N GC+GGLMD AF++I N GL++E
Sbjct: 153 CWAFSTVVAVEGINQIKTNELVSLSEQQLVDCDTKNSGCNGGLMDYAFDFIKNNGGLSSE 212
Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
YPY E+ +C ++ AV TI Y+D+P+ +E AL++AV+NQPVSV ++ASG AF F
Sbjct: 213 DSYPYLAEQKSCGSEANSAV-VTIDGYQDVPRNNEAALMKAVANQPVSVAIEASGYAFQF 271
Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----D 330
Y GV + CG DHGVA VG+G +++G KYW++KNSWGE WGESGYIR+ R
Sbjct: 272 YSQGVFSGHCGTELDHGVAAVGYGV--DDDGKKYWIVKNSWGEGWGESGYIRMERGIKDK 329
Query: 331 AGLCGIATAASYPV 344
G CGIA ASYP+
Sbjct: 330 RGKCGIAMEASYPI 343
>gi|414587996|tpg|DAA38567.1| TPA: hypothetical protein ZEAMMB73_390779 [Zea mays]
Length = 343
Score = 334 bits (857), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 163/345 (47%), Positives = 230/345 (66%), Gaps = 23/345 (6%)
Query: 14 FVIIILVIT-CA----SQVVSGRSMHE-PSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
F++++ ++T CA S V++ R + + ++ E+HE+WMA +GR YKD EKA R +FK
Sbjct: 7 FLLLLAILTGCACSFPSPVLAARELSDDAAMAERHERWMAVYGRVYKDAAEKARRFEVFK 66
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
NL ++E N + + LG N+F+DLT EEF+A N+ +S + + FKY+N
Sbjct: 67 DNLAFVESFNADKKNKFWLGVNQFADLTTEEFKA-----NKGFKPISAEEVPTTGFKYEN 121
Query: 128 --VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
V+ +PT++DWR KGAVT IK+QGQCG CWAFSAVAA+EGI +++ L+ LSEQ+LVD
Sbjct: 122 LSVSALPTAVDWRTKGAVTPIKNQGQCGCCWAFSAVAAMEGIVKLSTDNLVSLSEQELVD 181
Query: 186 CSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
C T + GC GG MD AFE++I+N GLATE+ YPY+ +G C + AATI +ED
Sbjct: 182 CDTHSMDEGCEGGWMDSAFEFVIKNGGLATESSYPYKAVDGKCKGGSKS--AATIKGHED 239
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P +E AL++AV++QPVSV VDAS R F Y GV+ CG DHG+A +G+G E
Sbjct: 240 VPPNNEAALMKAVASQPVSVAVDASDRTFMLYSGGVMTGSCGTQLDHGIAAIGYGV--ES 297
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
+G KYW++KNSWG TWGE ++R+ +D G+CG+A SYP
Sbjct: 298 DGTKYWILKNSWGTTWGEKRFLRMEKDISDKQGMCGLAMKPSYPT 342
>gi|357130141|ref|XP_003566711.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 457
Score = 333 bits (855), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 169/315 (53%), Positives = 219/315 (69%), Gaps = 14/315 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I+E E+W+A+H + Y EK R +FK NL++I+K N+E +Y LG NEF+DLT+E
Sbjct: 146 IIELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKVNREVT-SYWLGLNEFADLTHE 204
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSC 155
EF+A Y G P P+ + SR S FKY++V+ D+P S+DWR KGAVT +K+QGQCGSC
Sbjct: 205 EFKATYLGLAPPAPA---RESRGS-FKYEDVSADDLPKSVDWRTKGAVTEVKNQGQCGSC 260
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
WAFS VAAVEGI I G L LSEQ+L+DCS D N+GC+GGLMD AF YI + GL TE
Sbjct: 261 WAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNNGCNGGLMDYAFSYIASSGGLHTE 320
Query: 215 ADYPYRHEEGTC-DNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
YPY EEG+C D +K ++ A TIS YED+P +EQAL++A+++QPVSV ++ASGR F
Sbjct: 321 EAYPYLMEEGSCGDGKKSESEAVTISGYEDVPAHNEQALIKALAHQPVSVAIEASGRHFQ 380
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-- 331
FY GV + CG DHGVA VG+G+ ++ G Y +++NSWG WGE GYIR+ R
Sbjct: 381 FYSGGVFDGPCGTQLDHGVAAVGYGS-DKGKGHDYIIVRNSWGAKWGEKGYIRMKRGTGK 439
Query: 332 --GLCGIATAASYPV 344
GLCGI ASYP
Sbjct: 440 GEGLCGINKMASYPT 454
>gi|357452869|ref|XP_003596711.1| Cysteine proteinase [Medicago truncatula]
gi|355485759|gb|AES66962.1| Cysteine proteinase [Medicago truncatula]
Length = 344
Score = 333 bits (855), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 165/344 (47%), Positives = 227/344 (65%), Gaps = 19/344 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
I+ +F I+ L S V+S R ++EKHEQWM +HG+ YKD EK R IFK+N
Sbjct: 12 ILTLFFILTL---WTSLVISSR------LLEKHEQWMEEHGKFYKDAAEKEQRFQIFKEN 62
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALY-TGYNRPVPSVSRQS-SRPSTFKYQN 127
LE+IE N G+ + L N+F D TN+EF+A Y G +P+ V + S F+Y+N
Sbjct: 63 LEFIESFNAAGDNGFNLSINQFGDQTNDEFKANYLNGKKKPLIGVGIAAIEEESVFRYEN 122
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
VT+VP ++DWRE+GAVT IK Q CGSCWAF+ VAA+EGI QIT G+L+ LSEQ+LVDC
Sbjct: 123 VTEVPATMDWRERGAVTPIKHQHLCGSCWAFATVAAIEGIHQITTGRLVSLSEQELVDCV 182
Query: 188 TDN--HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
N GC+GG ++ A ++I++ G+ +E +YPY +G C+ +K A I YE +P
Sbjct: 183 KTNTTDGCNGGYVEDACDFIVKKGGITSETNYPYTRVDGKCNVRKGTYNVAKIKGYEHVP 242
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+E+ALL+AV+NQP++V + A+ RAF FY SG+L CG + DH V +VG+GT+++ G
Sbjct: 243 ANNEKALLKAVANQPIAVYIAATKRAFQFYSSGILKGKCGIDLDHTVTIVGYGTSDD--G 300
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
KYWL+KNSWG WGE GYI+I RD G CGIA +YP+
Sbjct: 301 VKYWLVKNSWGTKWGEKGYIKIKRDVHAKEGSCGIAMVPTYPIV 344
>gi|357160569|ref|XP_003578807.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 333 bits (855), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 165/341 (48%), Positives = 226/341 (66%), Gaps = 19/341 (5%)
Query: 15 VIIILVITC-ASQVVSGRSMHEP-SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
++ IL C S V++ R +++ S+V +HE WM Q+GR YKD EKA + +FK N +
Sbjct: 8 LLAILGCLCFCSSVLAARELNDDLSMVARHESWMLQYGRVYKDAAEKASKFEVFKANAGF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
I+ N GN + LG N+F+D+TN+EF+A T N+ +S + P+ F Y+NV+
Sbjct: 68 IDSFNA-GNHKFWLGINQFADITNKEFKATKT--NKGF--ISNKVRAPTGFSYENVSFDA 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
+P SIDWR KGAVT +KDQGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC
Sbjct: 123 LPASIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHG 182
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
++ GC GGLMD AF++II N GL E+ YPY E+G C + + A TI YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIISNGGLTQESSYPYDAEDGKCKSGSKS--AGTIKSYEDVPANN 240
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL++AV+NQPVSV VD F FY GV+ CG + DHG+A +G+G + G KY
Sbjct: 241 EGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSD--GTKY 298
Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
WL+KNSWG +WGE+G++R+ +D G+CG+A SYP A
Sbjct: 299 WLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|224065647|ref|XP_002301901.1| predicted protein [Populus trichocarpa]
gi|222843627|gb|EEE81174.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 333 bits (854), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 166/312 (53%), Positives = 217/312 (69%), Gaps = 12/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I++ E W+++HG+ Y+ EK +R IFK NL +I++ NK+ Y LG NEFSDL++E
Sbjct: 29 IIDLFESWISKHGKIYESIEEKWLRFEIFKDNLFHIDETNKK-VVNYWLGLNEFSDLSHE 87
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G + S R+ S+ F Y++V +P S+DWR+KGAVT +K+QG CGSCWA
Sbjct: 88 EFKNKYLGLKVDM-SERRECSQE--FNYKDVMSIPKSVDWRKKGAVTDVKNQGSCGSCWA 144
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+LVDC +T+N+GC+GGLMD AF YII N GL E D
Sbjct: 145 FSTVAAVEGINQIVTGNLTSLSEQELVDCDTTNNYGCNGGLMDYAFSYIISNGGLHKEVD 204
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEGTC+ +KE++ TIS Y D+P+ E++LL+A++NQP+SV ++ASGR F FY
Sbjct: 205 YPYIMEEGTCEMRKEESEVVTISGYHDVPQNSEESLLKALANQPLSVAIEASGRDFQFYS 264
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
GV + CG DHGVA VG+G+ NG Y ++KNSWG WGE GYIR+ R+ AG
Sbjct: 265 GGVFDGHCGTQLDHGVAAVGYGST---NGLDYIIVKNSWGSKWGEKGYIRMKRNTGKPAG 321
Query: 333 LCGIATAASYPV 344
LCGI ASYP
Sbjct: 322 LCGINKMASYPT 333
>gi|38346007|emb|CAD40110.2| OSJNBa0035O13.9 [Oryza sativa Japonica Group]
gi|125589429|gb|EAZ29779.1| hypothetical protein OsJ_13837 [Oryza sativa Japonica Group]
Length = 314
Score = 333 bits (853), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 166/319 (52%), Positives = 214/319 (67%), Gaps = 16/319 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLTN 96
+ ++HE+WMA+HGR Y D+ EKA RL +F+ N+ +IE N ++ + L N+F+DLTN
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKARRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 97 EEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGS 154
EFRA TG PS SR + P++F+Y NV+ D+P S+DWR KGAV +KDQG CG
Sbjct: 61 AEFRATRTGLR---PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGC 117
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLA 212
CWAFSAVAA+EG ++ GKL+ LSEQQLV C ++ GC GGLMD AF++II+N GLA
Sbjct: 118 CWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLA 177
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
E+DYPY + C A AATI YED+P DE ALL+AV+NQPVSV +D R F
Sbjct: 178 AESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHF 237
Query: 273 HFYKSGVLN--ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
FYK GVL+ A C DH + VG+G A + G KYWL+KNSWG +WGE GY+R+ R
Sbjct: 238 QFYKGGVLSGAAGCATELDHAITAVGYGVASD--GTKYWLMKNSWGTSWGEDGYVRMERG 295
Query: 331 A----GLCGIATAASYPVA 345
G+CG+A ASYP A
Sbjct: 296 VADKEGVCGLAMMASYPTA 314
>gi|374713651|gb|AEZ65083.1| cysteine protease [Carica papaya]
Length = 467
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 169/351 (48%), Positives = 224/351 (63%), Gaps = 21/351 (5%)
Query: 11 IPMFVIIILVITCA--SQVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKA 60
+ +F+++I + A +VS H + ++ +E W+ +HG+ Y EK
Sbjct: 8 LSLFLLMIFTASSAVDMSIVSYDQRHADKSSWRTDDEVMAMYEAWLVKHGKAYNALGEKE 67
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK NL +I++ N + N TY+LG N F+DLTNEE+R++Y G V+R+ SR
Sbjct: 68 KRFGIFKDNLRFIDEHNSQ-NLTYRLGLNRFADLTNEEYRSMYLGVKPGATRVTRKVSRK 126
Query: 121 STFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
S V D +P IDWR++GAV +KDQG CGSCWAFS +AAVEGI QI G LI LS
Sbjct: 127 SDRFAARVGDALPDFIDWRKEGAVVGVKDQGSCGSCWAFSTIAAVEGINQIVTGDLISLS 186
Query: 180 EQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
EQ+LVDC T N GC+GGLMD AFE+II N G+ +E DYPYR + CD ++ A +I
Sbjct: 187 EQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRAADQKCDQYRKNANVVSI 246
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
YED+P+ DE AL +AV+ QPVSV ++A GRAF Y+SGV CG + DHGVA VG+G
Sbjct: 247 DGYEDVPENDEAALKKAVAKQPVSVAIEAGGRAFQLYQSGVFTGKCGTSLDHGVAAVGYG 306
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
T ENG YW++ NSWG+ WGE GYIR+ R+ +G CGIA SYP+
Sbjct: 307 T---ENGQDYWIVGNSWGKNWGEDGYIRMERNLAGSSSGKCGIAIGPSYPI 354
>gi|146215996|gb|ABQ10200.1| cysteine protease Cp2 [Actinidia deliciosa]
Length = 376
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 160/310 (51%), Positives = 216/310 (69%), Gaps = 11/310 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+ +W+A+HG+ Y E+ R IFK NL+++++ N E NR+YK+G N F+DLTNEE+R+
Sbjct: 47 YAEWLAKHGKAYNGIGERERRFEIFKDNLKFVDEHNSE-NRSYKVGLNRFADLTNEEYRS 105
Query: 102 LYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
++ G +S S + Q+ +P S+DWRE GAV IKDQG CGSCWAFS
Sbjct: 106 MFLGTKTDSKRRFMKSKSASRRYAVQDSDMLPESVDWRESGAVAPIKDQGSCGSCWAFST 165
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPY 219
VAAVEG+ QI G++I+LSEQ+LVDC T + GC+GGLMD AFE+II N G+ TE DYPY
Sbjct: 166 VAAVEGVNQIATGEMIQLSEQELVDCDRTYDAGCNGGLMDYAFEFIINNGGIDTEEDYPY 225
Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
R +GTCD +++ +I+ YED+P DE AL +AV++QPVSV ++ASGRAF Y SGV
Sbjct: 226 RGVDGTCDPERKNTKVVSINDYEDVPPYDEMALKKAVAHQPVSVAIEASGRAFQLYLSGV 285
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLC 334
+CG DHGV VVG+GT +NGA +W+++NSWG +WGE+GYIR+ R+ G C
Sbjct: 286 FTGECGRALDHGVVVVGYGT---DNGADHWIVRNSWGTSWGENGYIRMERNVVDNFGGKC 342
Query: 335 GIATAASYPV 344
GIA ASYP+
Sbjct: 343 GIAMQASYPI 352
>gi|449448298|ref|XP_004141903.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
gi|449531757|ref|XP_004172852.1| PREDICTED: germination-specific cysteine protease 1-like [Cucumis
sativus]
Length = 365
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 163/340 (47%), Positives = 226/340 (66%), Gaps = 15/340 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+ ++ ++ ++ +S RS E + E ++ W+A+HG+ Y E+ R IFK+NL++
Sbjct: 8 LALLSFFFLSISASALSRRSDGE--VREIYDLWLAKHGKAYNGIDEREKRFQIFKENLKF 65
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY--QNVTD 130
I+ N E NRTYK+G N F+DLTNEE+RALY G P P+ ++ ++ +Y N+
Sbjct: 66 IDDHNSE-NRTYKVGLNMFADLTNEEYRALYLGTRSP-PARRVMKAKTASRRYAVNNLDR 123
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P S+DWR +GAV +K+QG CGSCWAFS +AAVEGI QI G+LI LSEQ+LV C
Sbjct: 124 LPESMDWRTRGAVAPVKNQGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVSCDKKY 183
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
N GC+GGLMD AF++II+N GL TE DYPY +G CD ++ A +I YED+P DE
Sbjct: 184 NSGCNGGLMDYAFQFIIDNGGLDTEEDYPYEAFDGQCDPTRKNAKVVSIDAYEDVPANDE 243
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
++L +AV++QPVSV ++ASG A Y+SGV CG+ DHGV VG+G +ENG YW
Sbjct: 244 ESLKKAVAHQPVSVAIEASGLALQLYQSGVFTGKCGSALDHGVVAVGYG---KENGVDYW 300
Query: 310 LIKNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
L++NSWG +WGE GY ++ R+ G CGIA ASYPV
Sbjct: 301 LVRNSWGTSWGEDGYFKLERNVKHITEGKCGIAMQASYPV 340
>gi|224133760|ref|XP_002321654.1| predicted protein [Populus trichocarpa]
gi|222868650|gb|EEF05781.1| predicted protein [Populus trichocarpa]
Length = 362
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 167/349 (47%), Positives = 229/349 (65%), Gaps = 12/349 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAM 61
++ +K + + + ++L IT + E S+ + +E+W + H T L EK
Sbjct: 1 MEMKKFLFVALSLALVLGITESLDFHEKDLESEESLWDLYERWRSHH--TVSTSLDEKHK 58
Query: 62 RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
R N+FK+N+ ++ K NK G + YKL N+F+D+TN EFR++Y G + R ++R +
Sbjct: 59 RFNVFKENVMHVHKTNKMG-KPYKLKLNKFADMTNHEFRSVYAGSKVKHHRMFRGTTRGN 117
Query: 122 -TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+F Y V VPTS+DWR+KGAVT +KDQGQCGSCWAFS + AVEGI I +L+ LSE
Sbjct: 118 GSFMYGKVEKVPTSVDWRKKGAVTAVKDQGQCGSCWAFSTIVAVEGINYIKTNELVSLSE 177
Query: 181 QQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
Q+LVDC +T+N GC+GGLM+ AFE+I + +G+ TE+ YPY+ E+G CD KE A +I
Sbjct: 178 QELVDCDTTENQGCNGGLMEYAFEFIKKKRGITTESTYPYKAEDGHCDAAKENNPAVSID 237
Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
YE +P+ DE ALL+A +NQPVSV +DA G F FY GV +CG DHGVAVVG+GT
Sbjct: 238 GYEKVPENDEDALLKAAANQPVSVAIDAGGSDFQFYSEGVFIGECGTELDHGVAVVGYGT 297
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
+ G KYW+++NSWG WGE GYIR+ R GLCGIA ASYP+
Sbjct: 298 TLD--GTKYWIVRNSWGPEWGEKGYIRMQRGISDKEGLCGIAMEASYPI 344
>gi|225428879|ref|XP_002285299.1| PREDICTED: cysteine proteinase RD21a-like [Vitis vinifera]
Length = 469
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 160/313 (51%), Positives = 207/313 (66%), Gaps = 10/313 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ +E W+ +HG++Y E+ R IFK NL +IE+ N NRTYK+G N F+DLTNE
Sbjct: 50 VMAVYEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVGLNRFADLTNE 108
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
E+R+ Y G R S + ++ D+P S+DWREKGAV +KDQG CGSCWA
Sbjct: 109 EYRSRYLGRRDETRRGLRASRVSDRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWA 168
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS +AAVEGI QI G LI LSEQ+LVDC N GC+GGLMD AFE+II N G+ +E D
Sbjct: 169 FSTIAAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEED 228
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPYR + TCD ++ A +I YED+P+ DE++L +AV+NQPVSV ++A GRAF Y+
Sbjct: 229 YPYRAADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQ 288
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR-----DA 331
SGV CG DHGV VG+GT EN YW+++NSWG WGESGYI++ R +
Sbjct: 289 SGVFTGQCGTQLDHGVVAVGYGT---ENSVDYWIVRNSWGPNWGESGYIKLERNLAGTET 345
Query: 332 GLCGIATAASYPV 344
G CGIA SYP+
Sbjct: 346 GKCGIAIEPSYPI 358
>gi|62526575|gb|AAX84673.1| cysteine protease CP1 [Manihot esculenta]
Length = 467
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 167/351 (47%), Positives = 224/351 (63%), Gaps = 26/351 (7%)
Query: 13 MFVIIILVITCAS----QVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKA 60
MFV++ L T +S ++S H + ++ +E+W+ + G+ Y E+
Sbjct: 11 MFVLLFLSFTLSSASDMSIISYDQTHATKSSWRTDDEVMAIYEEWLVKQGKVYNALGERE 70
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R +FK NL +I++ N E NRTYKLG N F+DLTNEE+R+ Y G + R R
Sbjct: 71 KRFQVFKDNLRFIDEHNSE-NRTYKLGLNGFADLTNEEYRSTYLG---ARGGMKRNRLRK 126
Query: 121 STFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
++ +Y +P S+DWR++GAV +KDQG CGSCWAFS +AAVEGI +I G LI L
Sbjct: 127 TSDRYAPRVGESLPDSVDWRKEGAVAEVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISL 186
Query: 179 SEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
SEQ+LVDC T N GC+GGLMD AFE+II N G+ TE DYPY +G CD ++ A T
Sbjct: 187 SEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYLARDGRCDTYRKNAKVVT 246
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I YED+P E AL +AV+NQPVSV ++A GR F FY SG+ + CG DHGVA VG+
Sbjct: 247 IDDYEDVPVNSETALQKAVANQPVSVAIEAGGRDFQFYASGIFSGRCGTQLDHGVAAVGY 306
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
GT ENG YW+++NSWG++WGE+GY+R+ R G+CGIA ASYP+
Sbjct: 307 GT---ENGKDYWIVRNSWGKSWGENGYLRMARSINSPTGICGIAMEASYPI 354
>gi|359491865|ref|XP_002273243.2| PREDICTED: xylem cysteine proteinase 1-like [Vitis vinifera]
Length = 351
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 157/312 (50%), Positives = 212/312 (67%), Gaps = 12/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+ + E WM++HG++Y+ EK R +F+ NL++I++ NK+ + +Y LG NEF+DL++E
Sbjct: 44 LTDLFESWMSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVS-SYWLGLNEFADLSHE 102
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G +P ++ P F Y++V D+P S+DWR+KGAV H+K+QG CGSCWA
Sbjct: 103 EFKRKYLGLKIELP---KRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWA 159
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC N+GC+GGLMD AF +II N GL E D
Sbjct: 160 FSTVAAVEGINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEED 219
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEGTC +KE+ TIS Y D+P+ +EQ+ L+A++NQP+SV ++AS R F FY
Sbjct: 220 YPYVMEEGTCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYS 279
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
G+ N CG DHGVA VG+GT++ G Y +KNSWG WGE GYIR+ R+ G
Sbjct: 280 GGIFNGHCGTELDHGVAAVGYGTSK---GVDYITVKNSWGSKWGEKGYIRMKRNVGKPEG 336
Query: 333 LCGIATAASYPV 344
+CGI ASYP
Sbjct: 337 ICGIYKMASYPT 348
>gi|224103643|ref|XP_002313136.1| predicted protein [Populus trichocarpa]
gi|222849544|gb|EEE87091.1| predicted protein [Populus trichocarpa]
Length = 477
Score = 332 bits (850), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 158/316 (50%), Positives = 209/316 (66%), Gaps = 9/316 (2%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E + +E W+ ++G+ Y EK R IFK NL+++++ N GN +YKLG N+F+DL
Sbjct: 42 EAETLRLYEMWLVKYGKAYNALGEKERRFEIFKDNLKFVDQHNSVGNPSYKLGLNKFADL 101
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
+NEE+RA Y G + + + +++ D+P S+DWREKGAV +KDQGQCGS
Sbjct: 102 SNEEYRAAYLGTRMDGKRRLLGGPKSARYLFKDGDDLPESVDWREKGAVAPVKDQGQCGS 161
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLAT 213
CWAFS V AVEGI QI G L LSEQ+LVDC N GC+GGLMD AFE+I++N G+ T
Sbjct: 162 CWAFSTVGAVEGINQIVTGNLTSLSEQELVDCDKVYNQGCNGGLMDYAFEFIMKNGGIDT 221
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
E DYPY+ + CD ++ A TI YED+P+ DE++L +AV+NQPVSV ++A GRAF
Sbjct: 222 EEDYPYKAVDSMCDPNRKNARVVTIDGYEDVPQNDEKSLRKAVANQPVSVAIEAGGRAFQ 281
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
Y+SGV CG DHGV VG+GT ENG YW+++NSWG WGE+GYIR+ R+
Sbjct: 282 LYQSGVFTGSCGTQLDHGVVAVGYGT---ENGVDYWVVRNSWGPAWGENGYIRMERNVAS 338
Query: 331 --AGLCGIATAASYPV 344
G CGIA ASYP
Sbjct: 339 TETGKCGIAMEASYPT 354
>gi|226508570|ref|NP_001141984.1| uncharacterized protein LOC100274134 precursor [Zea mays]
gi|194706676|gb|ACF87422.1| unknown [Zea mays]
gi|413920745|gb|AFW60677.1| vignain [Zea mays]
Length = 363
Score = 331 bits (849), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 158/317 (49%), Positives = 218/317 (68%), Gaps = 13/317 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ ++++WMAQ+ R YKD+ EKA R +FK N E+I+++N G + Y LGTN+F+DL
Sbjct: 52 EAMMMARYKKWMAQYRRKYKDDAEKAHRFQVFKANAEFIDRSNAGGKKKYVLGTNQFADL 111
Query: 95 TNEEFRALYTGYNRP--VPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQG 150
T++EF A+YTG +P VPS ++Q + KYQN T D +DWR++GAVT +K+QG
Sbjct: 112 TSKEFAAMYTGLRKPAAVPSGAKQIPAAGS-KYQNFTRLDDDVQVDWRQQGAVTPVKNQG 170
Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--STDNHGCSGGLMDKAFEYIIEN 208
QCG CWAFSAV A+EG+ IT G L+ LSEQQ++DC S N GC+GG MD AF+Y+I N
Sbjct: 171 QCGCCWAFSAVGAMEGLIMITTGNLVSLSEQQILDCDESDGNQGCNGGYMDNAFQYVINN 230
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
G+ TE YPY +GTC N + AATIS ++DLP GDE AL AV+NQPVSV VD
Sbjct: 231 GGVTTEDAYPYSAVQGTCQNVQP---AATISGFQDLPSGDENALANAVANQPVSVGVDGG 287
Query: 269 GRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
F FY+ G+ + D CG + +H V +G+G ++ G +YW++KNSWG WGE+G++++
Sbjct: 288 SSPFQFYQGGIYDGDGCGTDMNHAVTAIGYGA--DDQGTQYWILKNSWGTGWGENGFMQL 345
Query: 328 LRDAGLCGIATAASYPV 344
G CGI+T ASYP
Sbjct: 346 QMGVGACGISTMASYPT 362
>gi|449500145|ref|XP_004161017.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 170/345 (49%), Positives = 222/345 (64%), Gaps = 16/345 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSI---VEKHEQWMAQHGRTYKDELEKAMRLN 64
+ I+ + I I +V H S+ +E E WM++H +TY+ EK R
Sbjct: 10 TLILSATLFITYAIAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKTYRSIEEKLHRFE 69
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
IF NL++I++ NK+ + +Y LG NEF+DL++EEF++ Y G P ++SSR F
Sbjct: 70 IFLDNLKHIDETNKKVS-SYWLGLNEFADLSHEEFKSKYLGLRVEFPR--KRSSR--GFS 124
Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
Y +V D+P S+DWR KGAVT +K+QG CGSCWAFS VAAVEGI QI G L LSEQ+L+
Sbjct: 125 YGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELI 184
Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
DC N+GC GGLMD AF+YI+ N GL E DYPY EEG C +KE+ TIS YED
Sbjct: 185 DCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQFEVVTISGYED 244
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P DEQ+LL+A+S+QPVSV ++AS R F FYK G+ CG DHGV VG+G++E
Sbjct: 245 VPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMDHGVTAVGYGSSE-- 302
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
G Y ++KNSWG WGE+GYIR+ R+ GLCGI ASYP
Sbjct: 303 -GTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQMASYPT 346
>gi|125547258|gb|EAY93080.1| hypothetical protein OsI_14881 [Oryza sativa Indica Group]
Length = 314
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 165/319 (51%), Positives = 213/319 (66%), Gaps = 16/319 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLTN 96
+ ++HE+WMA+HGR Y D+ EK RL +F+ N+ +IE N ++ + L N+F+DLTN
Sbjct: 1 MAQRHERWMAKHGRAYADDAEKVRRLEVFRDNVAFIESVNAAASQHKFWLEENQFADLTN 60
Query: 97 EEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGS 154
EFRA TG PS SR + P++F+Y NV+ D+P S+DWR KGAV +KDQG CG
Sbjct: 61 AEFRATRTGLR---PSSSRGNRAPTSFRYANVSTGDLPASVDWRGKGAVNPVKDQGDCGC 117
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLA 212
CWAFSAVAA+EG ++ GKL+ LSEQQLV C ++ GC GGLMD AF++II+N GLA
Sbjct: 118 CWAFSAVAAMEGAVKLATGKLVSLSEQQLVSCDVKGEDQGCEGGLMDDAFDFIIKNGGLA 177
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
E+DYPY + C A AATI YED+P DE ALL+AV+NQPVSV +D R F
Sbjct: 178 AESDYPYTASDDKCATAGAGAAAATIKGYEDVPANDEAALLKAVANQPVSVAIDGGDRHF 237
Query: 273 HFYKSGVLN--ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
FYK GVL+ A C DH + VG+G A + G KYWL+KNSWG +WGE GY+R+ R
Sbjct: 238 QFYKGGVLSGAAGCATELDHAITAVGYGVASD--GTKYWLMKNSWGTSWGEDGYVRMERG 295
Query: 331 A----GLCGIATAASYPVA 345
G+CG+A ASYP A
Sbjct: 296 VADKEGVCGLAMMASYPTA 314
>gi|224056176|ref|XP_002298740.1| predicted protein [Populus trichocarpa]
gi|222845998|gb|EEE83545.1| predicted protein [Populus trichocarpa]
Length = 455
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 158/309 (51%), Positives = 206/309 (66%), Gaps = 9/309 (2%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E W+ +HGR Y EK R IFK NL++I++ N GN +YKLG N+F+DL+N+E+R+
Sbjct: 25 YEMWLVKHGRAYNALGEKERRFEIFKDNLKFIDEHNSVGNPSYKLGLNKFADLSNDEYRS 84
Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
+Y G + + ++ D+P ++DWREKGAV +KDQGQCGSCWAFS V
Sbjct: 85 VYLGTRMDGKGRLLGGPKSERYLFKEGDDLPETVDWREKGAVAPVKDQGQCGSCWAFSTV 144
Query: 162 AAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
AVEGI QI G L LSEQ+LVDC T N GC+GGLMD AF++IIEN G+ TE DYPY+
Sbjct: 145 GAVEGINQIVTGNLTSLSEQELVDCDKTYNLGCNGGLMDYAFDFIIENGGIDTEEDYPYK 204
Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL 280
+ CD ++ A TI YED+P+ DE++L +AV+NQPVSV ++A GR F Y+SGV
Sbjct: 205 AIDSMCDPNRKNARVVTIDGYEDVPQNDEKSLKKAVANQPVSVAIEAGGRGFQLYQSGVF 264
Query: 281 NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCG 335
CG DHGV VG+GT E+G YW+++NSWG WGE+GYIR+ RD G CG
Sbjct: 265 TGSCGTQLDHGVVTVGYGT---EHGVDYWIVRNSWGPAWGENGYIRMERDVASTETGKCG 321
Query: 336 IATAASYPV 344
IA ASYP
Sbjct: 322 IAMEASYPT 330
>gi|413917937|gb|AFW57869.1| hypothetical protein ZEAMMB73_830006 [Zea mays]
Length = 443
Score = 331 bits (848), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 163/341 (47%), Positives = 225/341 (65%), Gaps = 19/341 (5%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCA-SQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDEL 57
M + +F++ + ++ CA S ++ R + + ++V +HE+WMA++ R Y D
Sbjct: 1 MATHYSSAFVL----LSVVAWACALSGSLAARDLADQDQAMVARHEEWMAKYDRVYSDAA 56
Query: 58 EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS 117
EKA R +FK N+ IE N GN + L N F+DLT++EFRA +TGY RP + +
Sbjct: 57 EKARRFEVFKANMALIESVNA-GNHKFWLEANRFADLTDDEFRATWTGY-RPKTAAASSK 114
Query: 118 SRPST----FKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQIT 171
R T FKY NV+ DVP S+DWR KGAVT IK+QG+CG CWAFSAVA++EG+ +++
Sbjct: 115 GRSRTATTGFKYANVSLDDVPASVDWRTKGAVTPIKNQGECGCCWAFSAVASMEGVVKLS 174
Query: 172 RGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ 229
GKL+ LSEQ+LVDC + + GC GG MD AF++I+ N GL TE+ YPY +GTC++
Sbjct: 175 TGKLVSLSEQELVDCDVNGMDQGCEGGEMDDAFDFIVGNGGLTTESRYPYTASDGTCNSN 234
Query: 230 KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCD 289
+ AA+I YED+P DE +L +AV+NQPVSV VD F FYK GVL+ CG D
Sbjct: 235 EASGDAASIKGYEDVPANDEASLRKAVANQPVSVAVDGGDSHFRFYKGGVLSGACGTELD 294
Query: 290 HGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
HG+A VG+G A + G KYW++KNSWG +WGE+GYIR+ RD
Sbjct: 295 HGIAAVGYGVASD--GTKYWVMKNSWGTSWGEAGYIRMERD 333
>gi|18423124|ref|NP_568722.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75309064|sp|Q9FGR9.1|CEP1_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP1; AltName:
Full=Cysteine proteinase CP56; Short=AtCP56; Flags:
Precursor
gi|9759028|dbj|BAB09397.1| cysteine endopeptidase [Arabidopsis thaliana]
gi|20258850|gb|AAM13907.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|308097832|gb|ADO14465.1| papain [Arabidopsis thaliana]
gi|332008536|gb|AED95919.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 331 bits (848), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 168/346 (48%), Positives = 221/346 (63%), Gaps = 16/346 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
+ F+++ L + + G H E S+ E +E+W + H E EKA R N
Sbjct: 1 MKRFIVLALCMLMVLETTKGLDFHNKDVESENSLWELYERWRSHHTVARSLE-EKAKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TF 123
+FK N+++I + NK+ +++YKL N+F D+T+EEFR Y G N + + + + +F
Sbjct: 60 VFKHNVKHIHETNKK-DKSYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGEKKATKSF 118
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
Y NV +PTS+DWR+ GAVT +K+QGQCGSCWAFS V AVEGI QI KL LSEQ+L
Sbjct: 119 MYANVNTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQEL 178
Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
VDC T+ N GC+GGLMD AFE+I E GL +E YPY+ + TCD KE A +I +E
Sbjct: 179 VDCDTNQNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGHE 238
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
D+PK E L++AV+NQPVSV +DA G F FY GV CG +HGVAVVG+GT +
Sbjct: 239 DVPKNSEDDLMKAVANQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTID 298
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
G KYW++KNSWGE WGE GYIR+ R GLCGIA ASYP+
Sbjct: 299 --GTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>gi|356545116|ref|XP_003540991.1| PREDICTED: vignain-like [Glycine max]
Length = 342
Score = 330 bits (847), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 161/336 (47%), Positives = 218/336 (64%), Gaps = 9/336 (2%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+++ LV+T + V R + E KHE+WMAQ+G+ YKD EK R IFK N+ +IE
Sbjct: 11 LVVFLVLTVWTSQVMSRRLSEAYSSVKHEKWMAQYGKVYKDAAEKEKRFQIFKNNVHFIE 70
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
+ G++ + L N+F+DL +F+AL + +V ++ ++FKY +VT +P+S
Sbjct: 71 SFHAAGDKPFNLSINQFADL--HKFKALLINGQKKEHNVRTATATEASFKYDSVTRIPSS 128
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGC 193
+DWR++GAVT IKDQG C SCWAFS VA +EG+ QIT+G+L+ LSEQ+LVDC D+ GC
Sbjct: 129 LDWRKRGAVTPIKDQGTCRSCWAFSTVATIEGLHQITKGELVSLSEQELVDCVKGDSEGC 188
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
GG ++ AFE+I + G+A+E YPY+ TC +KE I YE +P E+ALL
Sbjct: 189 YGGYVEDAFEFIAKKGGVASETHYPYKGVNKTCKVKKETHGVVQIKGYEQVPSNSEKALL 248
Query: 254 QAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKN 313
+AV++QPVS V+A G AF FY SG+ CG + DH V VVG+G A G KYWL+KN
Sbjct: 249 KAVAHQPVSAYVEAGGYAFQFYSSGIFTGKCGTDIDHSVTVVGYGKA--RGGNKYWLVKN 306
Query: 314 SWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
SWG WGE GYIR+ RD GLCGIAT A YP A
Sbjct: 307 SWGTEWGEKGYIRMKRDIRAKEGLCGIATGALYPTA 342
>gi|255539310|ref|XP_002510720.1| cysteine protease, putative [Ricinus communis]
gi|223551421|gb|EEF52907.1| cysteine protease, putative [Ricinus communis]
Length = 349
Score = 330 bits (847), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 162/312 (51%), Positives = 214/312 (68%), Gaps = 13/312 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+++ E W+++ GR Y+ EK R IFK NL +I+ NK+ R Y LG NEF+DL++E
Sbjct: 43 LIDLFESWISRFGRVYESAEEKLERFEIFKDNLFHIDDTNKK-VRNYWLGLNEFADLSHE 101
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G P +S+++ P F Y++V +P S+DWR+KGAVT +K+QG CGSCWA
Sbjct: 102 EFKNKYLGLK---PDLSKRAQCPEEFTYKDVA-IPKSVDWRKKGAVTPVKNQGSCGSCWA 157
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF YI+ N GL E D
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFAYIVANGGLHKEED 217
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEGTCD +KE++ A TIS Y D+P+ E++LL+A++NQP+S+ ++ASGR F FY
Sbjct: 218 YPYIMEEGTCDMRKEESDAVTISGYHDVPQNSEESLLKALANQPLSIAIEASGRDFQFYS 277
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
GV + CG DHGVA VG+GT++ G Y ++KNSWG WGE GYIR+ R G
Sbjct: 278 GGVFDGHCGTELDHGVAAVGYGTSK---GLDYIIVKNSWGPKWGEKGYIRMKRKTSKPEG 334
Query: 333 LCGIATAASYPV 344
+CGI ASYP
Sbjct: 335 ICGIYKMASYPT 346
>gi|255547982|ref|XP_002515048.1| cysteine protease, putative [Ricinus communis]
gi|223546099|gb|EEF47602.1| cysteine protease, putative [Ricinus communis]
Length = 359
Score = 330 bits (847), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 157/314 (50%), Positives = 217/314 (69%), Gaps = 8/314 (2%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ +E+W + H + EK R N+FK+NL++I K N++ +R YKL N+F+D+
Sbjct: 33 EESLWNLYERWRSHH-TVSRSLTEKNQRFNVFKENLKHIHKVNQK-DRPYKLRLNKFADM 90
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
TN EF Y G + S R + F ++N +++P+SIDWR++GAVT +KDQG+CGS
Sbjct: 91 TNHEFLQHYGGSKVSHYRMFHGSRRQTGFAHENTSNLPSSIDWRKQGAVTGVKDQGKCGS 150
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATE 214
CWAFS+VAAVEGI +I G+LI LSEQ+LVDC++ NHGC GGLM++AF +I + GL TE
Sbjct: 151 CWAFSSVAAVEGINKIKTGELISLSEQELVDCNSVNHGCDGGLMEQAFSFIEKTGGLTTE 210
Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
+YPYR ++G CD+ K TI YE +P+ DE AL+QAV+NQPVS+ +DA G+ F F
Sbjct: 211 NNYPYRAKDGYCDSAKMNTPMVTIDGYEMVPENDEHALMQAVANQPVSIAIDAGGQDFQF 270
Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----D 330
Y GV DCG +HGVA+VG+G ++ G KYW++KNSWG WGE+G+IR+ R +
Sbjct: 271 YSEGVYTGDCGTELNHGVALVGYGATQD--GTKYWIVKNSWGSEWGENGFIRMQRENDVE 328
Query: 331 AGLCGIATAASYPV 344
GLCGI ASYP+
Sbjct: 329 EGLCGITLEASYPI 342
>gi|356508490|ref|XP_003522989.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 330 bits (846), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 162/312 (51%), Positives = 216/312 (69%), Gaps = 13/312 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y+ EK +R IFK NL++I++ NK + Y LG NEF+DL+++
Sbjct: 43 LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G SR+ P F Y++V ++P S+DWR+KGAV +K+QG CGSCWA
Sbjct: 102 EFKNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWA 157
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF +I+EN GL E D
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEGTC+ KE+ TIS Y D+P+ +EQ+LL+A++NQP+SV ++ASGR F FY
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 277
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
GV + CG++ DHGVA VG+GTA+ G Y ++KNSWG WGE GYIR+ R+ G
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYIIVKNSWGSKWGEKGYIRMRRNIGKPEG 334
Query: 333 LCGIATAASYPV 344
+CGI ASYP
Sbjct: 335 ICGIYKMASYPT 346
>gi|2414570|emb|CAB16317.1| cysteine proteinase precursor [Nicotiana tabacum]
Length = 374
Score = 330 bits (845), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 162/318 (50%), Positives = 212/318 (66%), Gaps = 13/318 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E + ++E W+A+HGR Y EK R IFK NL +IE N GNRTYK+G N+F+DL
Sbjct: 43 EDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEGHNNSGNRTYKVGLNQFADL 102
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQC 152
TNEE+R +Y G +S PS +Y + + +P S+DWR++GAV IK+QG C
Sbjct: 103 TNEEYRTMYLGTKSDARRRFVKSKNPSQ-RYASRPNELMPHSVDWRKRGAVAPIKNQGSC 161
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGL 211
GSCWAFS VAAVEGI QI G++I LSEQ+LVDC N GC+GGLMD AFE+II N G+
Sbjct: 162 GSCWAFSTVAAVEGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISNGGM 221
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
TE YPYR EG CD ++ +I YED+P+ +E+AL +AV++QPV V ++ASGRA
Sbjct: 222 DTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPR-NERALQKAVAHQPVCVAIEASGRA 280
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
F Y SGV +CG DHGV VVG+G+ E+G YW+++NSWG WGE+GY+++ R+
Sbjct: 281 FQLYSSGVFTGECGEEVDHGVVVVGYGS---EDGVDYWIVRNSWGTKWGENGYVKMERNV 337
Query: 332 -----GLCGIATAASYPV 344
G CGI T ASYP
Sbjct: 338 KKSHLGKCGIMTEASYPT 355
>gi|326493368|dbj|BAJ85145.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 436
Score = 330 bits (845), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 163/336 (48%), Positives = 214/336 (63%), Gaps = 14/336 (4%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
+ L +VS E + + +WMA+HG TY E+ R F+ NL YI++
Sbjct: 18 VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 77 N---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N G +++LG N F+DLTNEE+R+ Y G R P R+ S + ++ + ++P
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLGA-RTKPDRERKLS--ARYQAADNDELPE 134
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHG 192
S+DWR+KGAV +KDQG CGSCWAFSA+AAVEGI QI G +I LSEQ+LVDC T N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C+GGLMD AFE+II N G+ +E DYPY+ + CD K+ A TI YED+P E++L
Sbjct: 195 CNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+AV+NQP+SV ++A GRAF YKSG+ CG DHGVA VG+GT ENG YWL++
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT---ENGKDYWLVR 311
Query: 313 NSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
NSWG WGE GYIR+ R+ +G CGIA SYP
Sbjct: 312 NSWGSVWGEDGYIRMERNIKASSGKCGIAVEPSYPT 347
>gi|359483514|ref|XP_003632971.1| PREDICTED: LOW QUALITY PROTEIN: oryzain beta chain-like [Vitis
vinifera]
Length = 340
Score = 330 bits (845), Expect = 8e-88, Method: Compositional matrix adjust.
Identities = 163/343 (47%), Positives = 230/343 (67%), Gaps = 15/343 (4%)
Query: 13 MFVIIILVITC----ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
+FV + L I AS+ S R +HE S+ E+HEQWMA++ R YKD+ E+ R +FK
Sbjct: 3 LFVCMTLHIYYLEHRASEATS-RPLHEASMYERHEQWMARYSRNYKDDAEEERRFXMFKD 61
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
N+++I+ + GN KLG N +D+T+EEFRA +G +P S ++F++QNV
Sbjct: 62 NVDFIQTFDTAGNMPNKLGVNALADMTHEEFRA--SGNTFKIPPNLGLRSETTSFRHQNV 119
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
T +P+++DWR+K VTHIK+Q QCG CWAFSAVAA+EGI ++ K I LSEQ+LVDC
Sbjct: 120 TRIPSTMDWRKKRTVTHIKNQLQCGGCWAFSAVAAMEGIAKLQTSKSISLSEQELVDCDI 179
Query: 189 --DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC GG MD AF++II+N+GL +EA Y Y+ EG C+ +KE + AA I+ YE++P+
Sbjct: 180 FGSNIGCEGGCMDDAFKFIIQNRGLNSEARYLYKGVEGHCNKKKESSRAARINDYENMPE 239
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
E+ALL+ V++QP+SV +DA G AF FY+ G++ + GN+ D+GV G+G + + G
Sbjct: 240 FSEKALLKVVAHQPISVAIDAGGSAFQFYEIGIITXESGNDLDYGVTTDGYGRSAD--GK 297
Query: 307 KYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
K+WL+KNSWG WGE+GY R+ R GLCG ASYP A
Sbjct: 298 KHWLVKNSWGTDWGENGYTRMERGVKATTGLCGFTMQASYPTA 340
>gi|224136808|ref|XP_002326950.1| predicted protein [Populus trichocarpa]
gi|222835265|gb|EEE73700.1| predicted protein [Populus trichocarpa]
Length = 456
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 170/355 (47%), Positives = 227/355 (63%), Gaps = 35/355 (9%)
Query: 13 MFVIIILVITCAS----QVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKA 60
M +++ LV +S ++S H + ++ +E+W+ +HG+ Y EK
Sbjct: 1 MLMLLFLVFALSSAFDMSIISYHQTHATKSSWRTDDEVMAMYEEWLVKHGKNYNALGEKE 60
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALY----TGYNRPVPSVS-R 115
R IFK NL +I++ N E NRTY +G N F+DLTNEEFR++Y TG+ + +P S R
Sbjct: 61 KRFEIFKDNLMFIDQHNSE-NRTYTVGLNRFADLTNEEFRSMYLGTRTGHKKRLPKTSDR 119
Query: 116 QSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGK 174
+ R V D +P S+DWR++GAV +KDQG CGSCWAFS +AAVEGI +I G
Sbjct: 120 YAPR--------VGDSLPDSVDWRKEGAVAEVKDQGGCGSCWAFSTIAAVEGINKIVTGD 171
Query: 175 LIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKA 233
LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ TE DYPY +G CD ++ A
Sbjct: 172 LIALSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEDDYPYLGRDGRCDTYRKNA 231
Query: 234 VAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVA 293
+I YED+P+ DE AL +AV+NQPVSV ++ GR F Y SGV +CG + DHGVA
Sbjct: 232 KVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGRNFQLYNSGVFTGECGTSLDHGVA 291
Query: 294 VVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
VG+GT E G YW+++NSWG++WGESGYIR+ R+ G CGIA SYP+
Sbjct: 292 AVGYGT---EKGKDYWIVRNSWGKSWGESGYIRMERNIASPTGKCGIAIEPSYPI 343
>gi|449454309|ref|XP_004144898.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
gi|449471311|ref|XP_004153272.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 349
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 172/349 (49%), Positives = 224/349 (64%), Gaps = 17/349 (4%)
Query: 5 FEKSFIIPMFVIIILVITCASQVVSGRSM-HEPSI---VEKHEQWMAQHGRTYKDELEKA 60
F K+ +I + I T + G S H S+ +E E WM++H + Y+ EK
Sbjct: 6 FSKATLILSATLFITYATAHDFSIVGYSPEHLASMDKTIELFESWMSKHSKAYRSIEEKL 65
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IF NL++I++ NK+ + +Y LG NEF+DL++EEF++ Y G P ++SSR
Sbjct: 66 HRFEIFLDNLKHIDETNKKVS-SYWLGLNEFADLSHEEFKSKYLGLRVEFPR--KRSSR- 121
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
F Y +V D+P S+DWR KGAVT +K+QG CGSCWAFS VAAVEGI QI G L LSE
Sbjct: 122 -GFSYGDVEDLPESVDWRTKGAVTPVKNQGSCGSCWAFSTVAAVEGINQIVTGNLTSLSE 180
Query: 181 QQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
Q+L+DC N+GC GGLMD AF+YI+ N GL E DYPY EEG C +KE+ TIS
Sbjct: 181 QELIDCDRSFNNGCYGGLMDYAFQYIMSNSGLRKEEDYPYLMEEGRCIREKEQFEVVTIS 240
Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
YED+P DEQ+LL+A+S+QPVSV ++AS R F FYK G+ CG DHGV VG+G+
Sbjct: 241 GYEDVPANDEQSLLKALSHQPVSVAIEASSRNFQFYKGGIFTGRCGTQMDHGVTAVGYGS 300
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+E G Y ++KNSWG WGE+GYIR+ R+ GLCGI ASYP
Sbjct: 301 SE---GTDYIIVKNSWGPKWGENGYIRMKRNTGKPEGLCGINQMASYPT 346
>gi|4731374|gb|AAD28477.1|AF133839_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 357
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 165/346 (47%), Positives = 224/346 (64%), Gaps = 18/346 (5%)
Query: 10 IIPMFVIIILVI--TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
+ P+ +++ L T + + E S+ +E+W + H + +D +K R N+FK
Sbjct: 4 LFPVLLVLALAFGSTLSIPIKEKDLESEDSLWSLYERWRSHHAVS-RDLDQKQKRFNVFK 62
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRPSTF 123
+N+++I + NK + T+KL N+F D+TN+EFRA Y G ++R + S + F
Sbjct: 63 ENVKFIHEFNKNKDVTFKLALNKFGDMTNQEFRAKYAGSKVHHHRTMKGSRHGSGSGAKF 122
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
Y+N P SIDWRE+GAV +K+QGQCGSCWAFSA+AAVEGI QI +L+ LSEQ+L
Sbjct: 123 MYENAV-APPSIDWRERGAVAAVKNQGQCGSCWAFSAIAAVEGINQIVTKELVPLSEQEL 181
Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
+DC TD N GCSGGLMD AFE+I N G+ TE YPY+ E+ TC K+ + A I YE
Sbjct: 182 IDCDTDQNQGCSGGLMDYAFEFIKNNGGITTEDVYPYQAEDATC---KKNSPAVVIDGYE 238
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
D+P DE AL++AV+NQPV+V ++ASG F FY GV CG DHGVAVVG+GT ++
Sbjct: 239 DVPTNDEDALMKAVANQPVAVAIEASGYVFQFYSEGVFTGRCGTELDHGVAVVGYGTTQD 298
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
G KYW ++NSWG WGESGY+R+ R GLCGIA ASYP+
Sbjct: 299 --GTKYWTVRNSWGADWGESGYVRMQRGIKATHGLCGIAMQASYPI 342
>gi|255540425|ref|XP_002511277.1| cysteine protease, putative [Ricinus communis]
gi|46395620|sp|O65039.1|CYSEP_RICCO RecName: Full=Vignain; AltName: Full=Cysteine endopeptidase; Flags:
Precursor
gi|2944446|gb|AAC62396.1| cysteine endopeptidase precursor [Ricinus communis]
gi|223550392|gb|EEF51879.1| cysteine protease, putative [Ricinus communis]
Length = 360
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 164/309 (53%), Positives = 206/309 (66%), Gaps = 10/309 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E+W + H + EK R N+FK N ++ ANK ++ YKL N+F+D+TN EFR
Sbjct: 38 YERWRSHH-TVSRSLHEKQKRFNVFKHNAMHVHNANKM-DKPYKLKLNKFADMTNHEFRN 95
Query: 102 LYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
Y+G + R R + TF Y+ V VP S+DWR+KGAVT +KDQGQCGSCWAFS
Sbjct: 96 TYSGSKVKHHRMFRGGPRGNGTFMYEKVDTVPASVDWRKKGAVTSVKDQGQCGSCWAFST 155
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
+ AVEGI QI KL+ LSEQ+LVDC TD N GC+GGLMD AFE+I + G+ TEA+YPY
Sbjct: 156 IVAVEGINQIKTNKLVSLSEQELVDCDTDQNQGCNGGLMDYAFEFIKQRGGITTEANYPY 215
Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
+GTCD KE A A +I +E++P+ DE ALL+AV+NQPVSV +DA G F FY GV
Sbjct: 216 EAYDGTCDVSKENAPAVSIDGHENVPENDENALLKAVANQPVSVAIDAGGSDFQFYSEGV 275
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCG 335
CG DHGVA+VG+GT + G KYW +KNSWG WGE GYIR+ R GLCG
Sbjct: 276 FTGSCGTELDHGVAIVGYGTTID--GTKYWTVKNSWGPEWGEKGYIRMERGISDKEGLCG 333
Query: 336 IATAASYPV 344
IA ASYP+
Sbjct: 334 IAMEASYPI 342
>gi|356563584|ref|XP_003550041.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 366
Score = 329 bits (843), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 157/343 (45%), Positives = 213/343 (62%), Gaps = 9/343 (2%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
+ +I + + ++CA + + + ++ +E+W+ +H + Y EK R +FK
Sbjct: 6 TLMISTLLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLGEKDKRFQVFK 65
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKYQ 126
NL +I++ N N TYKLG N+F+D+TNEE+R +Y G + + S + Y
Sbjct: 66 DNLGFIQEHNNNQNNTYKLGLNKFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYS 125
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+P +DWR KGAV IKDQG CGSCWAFS VA VE I +I GK + LSEQ+LVDC
Sbjct: 126 AGDQLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDC 185
Query: 187 STD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
N GC+GGLMD AFE+II+N G+ T+ DYPYR +G CD K+ A A I YED+P
Sbjct: 186 DRAYNQGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKAVNIDGYEDVP 245
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
DE AL +AV+ QPVS+ ++ASGRA Y+SGV +CG + DHGV VVG+G+ ENG
Sbjct: 246 PYDENALKKAVARQPVSIAIEASGRALQLYQSGVFTGECGTSLDHGVVVVGYGS---ENG 302
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
YWL++NSWG WGE GY ++ R+ G CGI ASYPV
Sbjct: 303 VDYWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345
>gi|218202087|gb|EEC84514.1| hypothetical protein OsI_31214 [Oryza sativa Indica Group]
Length = 348
Score = 329 bits (843), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 160/333 (48%), Positives = 222/333 (66%), Gaps = 17/333 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F I+ + C++ + + + ++ +HE+WMAQ+GR YKD+ EKA R +FK N +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANAAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN+EFR T +PS +R P+ F+Y+NV
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRLTKTNKGF-IPSTTRV---PTGFRYENVNIDA 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
+P ++DWR KG VT IKDQGQCG CWAFSAVAA+EGI +++ GKLI LSEQ+LVDC
Sbjct: 123 LPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISLSEQELVDCDVHG 182
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
++ GC GGLMD AF++II+N GL TE++YPY + C + A+I YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAADDKCKSVSNS--VASIKGYEDVPANN 240
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL++AV+NQPVSV VD F FYK GV+ CG + DHG+ +G+G A + G KY
Sbjct: 241 EAALMKAVANQPVSVAVDGDDMTFQFYKGGVMIGSCGTDLDHGIVAIGYGKASD--GTKY 298
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIA 337
WL+KNSWG TWGE+G++R+ +D G+CG+A
Sbjct: 299 WLLKNSWGMTWGENGFLRMEKDISDKRGMCGLA 331
>gi|351629615|gb|AEQ54771.1| KDDL-tailed cysteine proteinase CP4 [Coffea canephora]
Length = 359
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 163/351 (46%), Positives = 229/351 (65%), Gaps = 13/351 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+K K+F+ + + +ILV + ++ E S+ + +E+W + H +D EK R
Sbjct: 1 MKMGKAFLFAVVLAVILVAAMSMEITERDLASEESLWDLYERWRSHH-TVSRDLSEKRKR 59
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
N+FK N+ +I K N++ ++ YKL N F+D+TN EFR Y+ + + SR +T
Sbjct: 60 FNVFKANVHHIHKVNQK-DKPYKLKLNSFADMTNHEFREFYSSKVKHYRML--HGSRANT 116
Query: 123 -FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
F + +P S+DWR++GAVT +K+QG+CGSCWAFS V VEGI +I G+L+ LSEQ
Sbjct: 117 GFMHGKTESLPASVDWRKQGAVTGVKNQGKCGSCWAFSTVVGVEGINKIKTGQLVSLSEQ 176
Query: 182 QLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
+LVDC TDN GC+GGLM+ A+E+I ++ G+ TE YPY+ +G+CD+ K A A TI +
Sbjct: 177 ELVDCETDNEGCNGGLMENAYEFIKKSGGITTERLYPYKARDGSCDSSKMNAPAVTIDGH 236
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGTA 300
E +P DE AL++AV+NQPVSV +DASG FY GV D CGN DHGVAVVG+GTA
Sbjct: 237 EMVPANDENALMKAVANQPVSVAIDASGSDMQFYSEGVYAGDSCGNELDHGVAVVGYGTA 296
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILR-----DAGLCGIATAASYPVAI 346
+ G KYW++KNSWG WGE GYIR+ R + G+CGIA ASYP+ +
Sbjct: 297 LD--GTKYWIVKNSWGTGWGEQGYIRMQRGVDAAEGGVCGIAMEASYPLKL 345
>gi|212275830|ref|NP_001130503.1| cysteine protease 1 [Zea mays]
gi|194689328|gb|ACF78748.1| unknown [Zea mays]
gi|219886279|gb|ACL53514.1| unknown [Zea mays]
gi|238010470|gb|ACR36270.1| unknown [Zea mays]
gi|413920875|gb|AFW60807.1| cysteine protease 1 [Zea mays]
Length = 354
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 173/355 (48%), Positives = 234/355 (65%), Gaps = 30/355 (8%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKAM 61
+I + + ++ + + R + E ++ +H+QWMA+HGRTY+DE EKA
Sbjct: 11 VITFTAVALTILAVTTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRDEAEKAH 70
Query: 62 RLNIFKQNLEYIEKANKEGN--RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR 119
R +FK N ++++ +N G+ ++Y+L NEF+D+TN+EF A+YTG RPVP+ ++ +
Sbjct: 71 RFQVFKANADFVDASNAAGDDKKSYRLELNEFADMTNDEFMAMYTGL-RPVPAGAK---K 126
Query: 120 PSTFKYQNVT-----DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGK 174
+ FKY NVT D ++DWR+KGAVT IK+QGQCG CWAF+AVAAVEGI QIT G
Sbjct: 127 MAGFKYGNVTLSDADDDQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITTGN 186
Query: 175 LIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKA 233
L+ LSEQQ++DC TD N+GC+GG +D AF+YI+ N GL TE YPY + C Q +
Sbjct: 187 LVSLSEQQVLDCDTDGNNGCNGGYIDNAFQYIVGNGGLGTEDAYPYTAAQAMC--QSVQP 244
Query: 234 VAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN-ADCGN--NCDH 290
VAA IS Y+D+P GDE AL AV+NQPVSV +DA F Y GV+ A C N +H
Sbjct: 245 VAA-ISGYQDVPSGDEAALAAAVANQPVSVAIDA--HNFQLYGGGVMTAASCSTPPNLNH 301
Query: 291 GVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPVA 345
V VG+GTAE+ G YWL+KN WG+ WGE GY+R+ R A CG+A ASYPVA
Sbjct: 302 AVTAVGYGTAED--GTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 354
>gi|356517184|ref|XP_003527269.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 328 bits (842), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 162/312 (51%), Positives = 215/312 (68%), Gaps = 13/312 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E W+++HG+ Y+ EK R IFK NL++I++ NK + Y LG NEF+DL+++
Sbjct: 44 LIELFESWISRHGKIYQSIEEKLHRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 102
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G SR+ P F Y++V ++P S+DWR+KGAVT +K+QG CGSCWA
Sbjct: 103 EFKNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVTQVKNQGSCGSCWA 158
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF +I+EN GL E D
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENDGLHKEED 218
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEGTC+ KE+ TIS Y D+P+ +EQ+LL+A++NQP+SV ++ASGR F FY
Sbjct: 219 YPYIMEEGTCEMAKEETEVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
GV + CG++ DHGVA VG+GTA+ G Y +KNSWG WGE GYIR+ R+ G
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335
Query: 333 LCGIATAASYPV 344
+CGI ASYP
Sbjct: 336 ICGIYKMASYPT 347
>gi|356508487|ref|XP_003522988.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 349
Score = 328 bits (841), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 161/312 (51%), Positives = 215/312 (68%), Gaps = 13/312 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y+ EK R +IFK NL++I++ NK + Y LG NEF+DL+++
Sbjct: 43 LIELFESWMSRHGKIYQSIEEKLHRFDIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G SR+ P F Y++ ++P S+DWR+KGAVT +K+QG CGSCWA
Sbjct: 102 EFKNKYLGLK---VDYSRRRESPEEFTYKDF-ELPKSVDWRKKGAVTQVKNQGSCGSCWA 157
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF +I+EN GL E D
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEGTC+ KE+ TIS Y D+P+ +EQ+LL+A+ NQP+SV ++ASGR F FY
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALVNQPLSVAIEASGRDFQFYS 277
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
GV + CG++ DHGVA VG+GT++ G Y ++KNSWG WGE GYIR+ R+ G
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTSK---GVNYIIVKNSWGSKWGEKGYIRMRRNIGKPEG 334
Query: 333 LCGIATAASYPV 344
+CGI ASYP
Sbjct: 335 ICGIYKMASYPT 346
>gi|297792329|ref|XP_002864049.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
gi|297309884|gb|EFH40308.1| hypothetical protein ARALYDRAFT_495086 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 171/347 (49%), Positives = 225/347 (64%), Gaps = 15/347 (4%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
K FI+ +++++ T S + + E S+ E +E+W + H E EKA R N+
Sbjct: 2 KRFIVLALCMLMVLETTKSLDFHEKDVESEDSLWELYERWKSHHTIARSLE-EKAKRFNV 60
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN---RPVPSVSRQSSRPST 122
FK N+++I + NK+ N +YKL N+F D+T+EEFR Y G N + RQ+++ +
Sbjct: 61 FKHNVKHIHETNKKEN-SYKLKLNKFGDMTSEEFRRTYAGSNIKHHRMFQGERQTTK--S 117
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
F Y NV +PTS+DWR+ GAVT +K+QGQCGSCWAFS V AVEGI QI KL LSEQ+
Sbjct: 118 FMYANVDTLPTSVDWRKNGAVTPVKNQGQCGSCWAFSTVVAVEGINQIRTKKLTSLSEQE 177
Query: 183 LVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
LVDC T+ N GC+GGLMD AFE+I E GL +E YPY+ + TCD KE A +I +
Sbjct: 178 LVDCDTNKNQGCNGGLMDLAFEFIKEKGGLTSELVYPYKASDETCDTNKENAPVVSIDGH 237
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
ED+PK E L++AV++QPVSV +DA G F FY GV CG +HGVAVVG+GT
Sbjct: 238 EDVPKNSEVDLMKAVAHQPVSVAIDAGGSDFQFYSEGVFTGRCGTELNHGVAVVGYGTTI 297
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
+ G KYW++KNSWGE WGE GYIR+ R GLCGIA ASYP+
Sbjct: 298 D--GTKYWIVKNSWGEEWGEKGYIRMQRGIRHKEGLCGIAMEASYPL 342
>gi|146216000|gb|ABQ10202.1| cysteine protease Cp4 [Actinidia deliciosa]
Length = 463
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 160/342 (46%), Positives = 221/342 (64%), Gaps = 15/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEK-----HEQWMAQHGRTYKDELEKAMRLNIFK 67
+F+ ++S H P + +E+W+ HG+ Y EK R IFK
Sbjct: 13 LFLCFAFSSALDMSIISYDQTHPPQRTDAEAMAIYEKWLTTHGKAYNAIGEKERRFEIFK 72
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
NL ++++ N +Y++G N F+DLTNEE+R+++ G N + S S++ + ++
Sbjct: 73 DNLRFVDEHNAVAG-SYRVGLNRFADLTNEEYRSMFLGGNMEMKERS-ASTKSDRYAFRA 130
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+P S+DWREKGAV+ +KDQGQCGSCWAFS ++AVEGI QI G+LI LSEQ+LVDC
Sbjct: 131 GDKLPGSVDWREKGAVSPVKDQGQCGSCWAFSTISAVEGINQIVTGELISLSEQELVDCD 190
Query: 188 TD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD F++II N G+ TE DYPYR +GTCD ++ A +I+ YED+P+
Sbjct: 191 KSYNMGCNGGLMDYGFQFIINNGGIDTEEDYPYRAVDGTCDQFRKNARVVSINGYEDVPE 250
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
DE +L +AV+NQPVSV ++A GRAF Y+SGV CG N DHGV VG+GT ENG
Sbjct: 251 DDENSLKKAVANQPVSVAIEAGGRAFQLYESGVFTGHCGTNLDHGVVAVGYGT---ENGV 307
Query: 307 KYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
YW ++NSWG WGE+GYI++ R+ +G CGIA+ ASYP
Sbjct: 308 DYWTVRNSWGPKWGENGYIKLERNINATSGKCGIASMASYPT 349
>gi|194352750|emb|CAQ00103.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326514262|dbj|BAJ92281.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326519402|dbj|BAJ96700.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326524351|dbj|BAK00559.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326531998|dbj|BAK01375.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 356
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 167/315 (53%), Positives = 218/315 (69%), Gaps = 14/315 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+VE E+W+A+H + Y EK R +FK NL++I+K N+E +Y LG NEF+DLT++
Sbjct: 45 LVELFEKWLAKHQKAYASFEEKLHRFEVFKDNLKHIDKINRE-VTSYWLGLNEFADLTHD 103
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSC 155
EF+A Y G + R SSR +F+Y++V+ D+P S+DWR+KGAVT +K+QGQCGSC
Sbjct: 104 EFKAAYLGLD--AAPARRGSSR--SFRYEDVSASDLPKSVDWRKKGAVTEVKNQGQCGSC 159
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
WAFS VAAVEGI I G L LSEQ+L+DCS D N GC+GGLMD AF YI + GL TE
Sbjct: 160 WAFSTVAAVEGINAIVTGNLTALSEQELIDCSVDGNSGCNGGLMDYAFSYIASSGGLHTE 219
Query: 215 ADYPYRHEEGTC-DNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
YPY EEG+C D +K ++ A TIS YED+P DEQAL++A+++QPVSV ++ASGR F
Sbjct: 220 EAYPYLMEEGSCGDGKKAESEAVTISGYEDVPANDEQALIKALAHQPVSVAIEASGRHFQ 279
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-- 331
FY GV + CG DHGVA VG+G+ ++ G Y +++NSWG WGE GYIR+ R
Sbjct: 280 FYSGGVFDGPCGAQLDHGVAAVGYGS-DKGKGHDYIIVRNSWGAQWGEKGYIRMKRGTSN 338
Query: 332 --GLCGIATAASYPV 344
GLCGI ASYP
Sbjct: 339 GEGLCGINKMASYPT 353
>gi|30141027|dbj|BAC75927.1| cysteine protease-5 [Helianthus annuus]
Length = 365
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 157/326 (48%), Positives = 217/326 (66%), Gaps = 12/326 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
V+ + + + +E W+A+HG+TY EK R IF NL++I++ N GNR+YK+G
Sbjct: 22 VTSNTRTDEEVRNTYELWLARHGKTYNALGEKESRFRIFADNLKFIDEHNLSGNRSYKVG 81
Query: 88 TNEFSDLTNEEFRALYTGYN-RPVPSVSRQSSRPSTFKY--QNVTDVPTSIDWREKGAVT 144
N+F+DLTNEE+R++Y G P +++ + +Y Q P +DWRE+GAV+
Sbjct: 82 LNQFADLTNEEYRSMYLGTKVDPYRRIAKMQRGEISRRYAVQENEMFPAKVDWRERGAVS 141
Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFE 203
+K+QG CGSCWAFS VA+VEGI +I G LI LSEQ+LVDC N GC+GG MD AF+
Sbjct: 142 PVKNQGGCGSCWAFSTVASVEGINKIVTGDLISLSEQELVDCDNKYNSGCNGGSMDYAFQ 201
Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
+I+ N G+ +E+DYPY+ CD + KA +I YED+P +E+AL++AV++QPVSV
Sbjct: 202 FIVSNGGIDSESDYPYKGVGAVCDPVRNKAKIVSIDGYEDVPPMNEKALMKAVAHQPVSV 261
Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
++ASGRAF Y SGVL CG N DHGV VVG+G+ ENG YW+++NSWG WGE G
Sbjct: 262 GIEASGRAFQLYTSGVLTGSCGTNLDHGVVVVGYGS---ENGKDYWIVRNSWGPEWGEDG 318
Query: 324 YIRILRD-----AGLCGIATAASYPV 344
YIR+ R+ G+CGI ASYP+
Sbjct: 319 YIRMERNMVDTPVGMCGITLMASYPI 344
>gi|13897890|gb|AAK48495.1|AF259983_1 putative cysteine protease [Ipomoea batatas]
Length = 462
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 161/321 (50%), Positives = 206/321 (64%), Gaps = 9/321 (2%)
Query: 30 GRSMHEPSIVEKHEQWMAQHGRTYKD-ELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGT 88
G S + ++ +E W+ +HG++Y EK R IFK NL YI++ N G+R+YKLG
Sbjct: 37 GLSRSDEEVMALYESWLVEHGKSYNGLGGEKDKRFEIFKDNLRYIDEQNSRGDRSYKLGL 96
Query: 89 NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKD 148
N F+DLTNEE+R+ Y G ++ + + +P SIDWREKGAV +KD
Sbjct: 97 NRFADLTNEEYRSTYLGAKTDARRRIAKTKSDRRYAPKAGGSLPDSIDWREKGAVAEVKD 156
Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIE 207
QG CGSCWAFS +AAVEGI QI G+LI LSEQ+LVDC T N GC+GGLMD AFE+II+
Sbjct: 157 QGSCGSCWAFSTIAAVEGINQIVTGELISLSEQELVDCDTSYNEGCNGGLMDYAFEFIIK 216
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
N G+ TEADYPY G CD ++ A +I YED+ DE AL +AV+ QPVSV ++A
Sbjct: 217 NGGIDTEADYPYTGRYGRCDQTRKNAKVVSIDGYEDVTPYDEAALKEAVAGQPVSVAIEA 276
Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
GR F Y SG+ CG + DHGV VG+GT ENG YW++KNSW +WGE GY+R+
Sbjct: 277 GGRDFQLYSSGIFTGSCGTDLDHGVTAVGYGT---ENGVDYWIVKNSWAASWGEKGYLRM 333
Query: 328 LRDA----GLCGIATAASYPV 344
R+ GLCGIA SYP
Sbjct: 334 QRNVKDKNGLCGIAIEPSYPT 354
>gi|356553978|ref|XP_003545327.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 496
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 166/348 (47%), Positives = 218/348 (62%), Gaps = 18/348 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQHGRTYKDELEKAMRL 63
I+ +F + + ++S + H + ++ +EQW+ +HG+ Y EK R
Sbjct: 41 ILLLFTVFAVSSALDMSIISYDNAHAATSRSDEELMSMYEQWLVKHGKVYNALGEKEKRF 100
Query: 64 NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
IFK NL +I+ N + +RTYKLG N F+DLTNEE+RA Y G + R PS
Sbjct: 101 QIFKDNLRFIDDHNSQEDRTYKLGLNRFADLTNEEYRAKYLGTK--IDPNRRLGKTPSNR 158
Query: 124 KYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
V D +P S+DWR++GAV +KDQG CGSCWAFSA+ AVEGI +I G+LI LSEQ+
Sbjct: 159 YAPRVGDKLPESVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLSEQE 218
Query: 183 LVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
LVDC T N GC+GGLMD AFE+II N G+ +E DYPYR +G CD ++ A +I Y
Sbjct: 219 LVDCDTGYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYRGVDGRCDTYRKNAKVVSIDDY 278
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
ED+P DE AL +AV+NQPVSV ++ GR F Y SGV CG DHGV VG+GTA
Sbjct: 279 EDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYGTA- 337
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
NG YW+++NSWG +WGE GYIR+ R+ +G CGIA SYP+
Sbjct: 338 --NGHDYWIVRNSWGPSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 383
>gi|34223513|gb|AAQ62999.1| oil palm polygalacturonase allergen PEST472 [Elaeis guineensis]
Length = 525
Score = 328 bits (840), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 161/325 (49%), Positives = 209/325 (64%), Gaps = 12/325 (3%)
Query: 28 VSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTY 84
V G E + +E W+A+HGR EK R IFK N+ +I+ N G+R++
Sbjct: 36 VQGLERSEEEMRLLYEGWLAKHGRADNALGEKERRFEIFKDNVRFIDAHNAAADSGHRSF 95
Query: 85 KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
+LG N F+D+TNEE+R +Y G RP R ++Y ++P S+DWR+KGAVT
Sbjct: 96 RLGLNRFADMTNEEYRTVYLG-TRPASHRRRARLGSDRYRYNAGEELPESVDWRDKGAVT 154
Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFE 203
+KDQG CGSCWAFS +AAVEGI +I G LI LSEQ+LVDC N GC+GGLMD AFE
Sbjct: 155 TVKDQGSCGSCWAFSTIAAVEGINKIVTGDLISLSEQELVDCDNGQNQGCNGGLMDYAFE 214
Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
+II N G+ TE DYPY+ +G CD ++ A +I YED+P DE+AL +AV+NQPVSV
Sbjct: 215 FIINNGGIDTEEDYPYKARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSV 274
Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
++A GR F Y SG+ CG + DHGV VG+GT ENG YW+++NSWG WGESG
Sbjct: 275 AIEAGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGT---ENGKDYWIVRNSWGGDWGESG 331
Query: 324 YIRILRD----AGLCGIATAASYPV 344
YIR+ R+ G CGIA +SYP
Sbjct: 332 YIRMERNVNASTGKCGIAMESSYPT 356
>gi|356517188|ref|XP_003527271.1| PREDICTED: xylem cysteine proteinase 2-like [Glycine max]
Length = 350
Score = 327 bits (839), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 162/312 (51%), Positives = 214/312 (68%), Gaps = 13/312 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y++ EK +R IFK NL++I++ NK + Y LG NEF+DL++
Sbjct: 44 LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHR 102
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF Y G SR+ P F Y++V ++P S+DWR+KGAV +K+QG CGSCWA
Sbjct: 103 EFNNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWA 158
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF +I+EN GL E D
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 218
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEGTC+ KE+ TIS Y D+P+ +EQ+LL+A++NQP+SV ++ASGR F FY
Sbjct: 219 YPYIMEEGTCEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
GV + CG++ DHGVA VG+GTA+ G Y +KNSWG WGE GYIR+ R+ G
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335
Query: 333 LCGIATAASYPV 344
+CGI ASYP
Sbjct: 336 ICGIYKMASYPT 347
>gi|18396939|ref|NP_564320.1| Papain family cysteine protease [Arabidopsis thaliana]
gi|9502427|gb|AAF88126.1|AC021043_19 Putative cysteine proteinase [Arabidopsis thaliana]
gi|67633400|gb|AAY78625.1| peptidase C1A papain family protein [Arabidopsis thaliana]
gi|332192919|gb|AEE31040.1| Papain family cysteine protease [Arabidopsis thaliana]
Length = 346
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 172/347 (49%), Positives = 233/347 (67%), Gaps = 21/347 (6%)
Query: 14 FVIIILVITCA----SQVVSGRSMHEPS-IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
FV ++L I S+ S ++++PS IV+ H+QWM Q R Y DE EK +RL + +
Sbjct: 6 FVCVVLTIFFMDLKISEATSRVALYKPSSIVDYHQQWMIQFSRVYDDEFEKQLRLQVLTE 65
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGY---NRPVPSVSRQSSRPSTFKY 125
NL++IE N GN++YKLG NEF+D T EEF A YTG N P ++P+
Sbjct: 66 NLKFIESFNNMGNQSYKLGVNEFTDWTKEEFLATYTGLRGVNVTSPFEVVNETKPAW--N 123
Query: 126 QNVTDV-PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
V+DV T+ DWR +GAVT +K QG+CG CWAFSA+AAVEG+T+I RG LI LSEQQL+
Sbjct: 124 WTVSDVLGTNKDWRNEGAVTPVKSQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLL 183
Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
DC+ + N+GC GG AF YII+++G+++E +YPY+ +EG C + A+ I +E+
Sbjct: 184 DCTREQNNGCKGGTFVNAFNYIIKHRGISSENEYPYQVKEGPCRSNARPAI--LIRGFEN 241
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEE 302
+P +E+ALL+AVS QPV+V +DAS F Y GV NA +CG + +H V +VG+GT+ E
Sbjct: 242 VPSNNERALLEAVSRQPVAVAIDASEAGFVHYSGGVYNARNCGTSVNHAVTLVGYGTSPE 301
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
G KYWL KNSWG+TWGE+GYIRI RD G+CG+A ASYPVA
Sbjct: 302 --GMKYWLAKNSWGKTWGENGYIRIRRDVEWPQGMCGVAQYASYPVA 346
>gi|358248896|ref|NP_001239703.1| uncharacterized protein LOC100799247 precursor [Glycine max]
gi|255636729|gb|ACU18700.1| unknown [Glycine max]
Length = 341
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 157/313 (50%), Positives = 216/313 (69%), Gaps = 11/313 (3%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
E+HE+WMAQ+G+ YKD EK R +FK N+++IE N G++ + L N+F+DL +EEF
Sbjct: 33 ERHEKWMAQYGKVYKDAAEKEKRFQVFKNNVQFIESFNAAGDKPFNLSINQFADLHDEEF 92
Query: 100 RALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQG-QCGSCWAF 158
+AL + V +++ ++F+Y+NVT +P+++DWR++GAVT IKDQG CGSCWAF
Sbjct: 93 KALLNNVQKKASRV--ETATETSFRYENVTKIPSTMDWRKRGAVTPIKDQGYTCGSCWAF 150
Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADY 217
+ VA VE + QIT G+L+ LSEQ+LVDC D+ GC GG ++ AFE+I G+ +EA Y
Sbjct: 151 ATVATVESLHQITTGELVSLSEQELVDCVRGDSEGCRGGYVENAFEFIANKGGITSEAYY 210
Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
PY+ ++ +C +KE A I YE +P E+ALL+AV+NQPVSV +DA AF FY S
Sbjct: 211 PYKGKDRSCKVKKETHGVARIIGYESVPSNSEKALLKAVANQPVSVYIDAGAIAFKFYSS 270
Query: 278 GVLNA-DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
G+ A +CG + DH VAVVG+G + G KYWL+KNSW WGE GY+RI RD G
Sbjct: 271 GIFEARNCGTHLDHAVAVVGYGKLRD--GTKYWLVKNSWSTAWGEKGYMRIKRDIRAKKG 328
Query: 333 LCGIATAASYPVA 345
LCGIA+ ASYP+A
Sbjct: 329 LCGIASNASYPIA 341
>gi|109390302|gb|ABG33750.1| cysteine protease [Hevea brasiliensis]
Length = 457
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 165/351 (47%), Positives = 225/351 (64%), Gaps = 26/351 (7%)
Query: 13 MFVIIILVITCAS----QVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKA 60
MF+++ T +S ++S H + ++ +E W+ +HG+ Y EK
Sbjct: 1 MFMLLFFASTLSSASDLSIISYDQSHGTKSSWRTDDEVMAIYEDWLVKHGKAYNSLGEKE 60
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R +FK NL +I++ N E NRTY++G N F+DLTNEE+R++Y G + + R R
Sbjct: 61 RRFEVFKDNLRFIDEHNSE-NRTYRVGLNRFADLTNEEYRSMYLG---ALSGIRRNKLRK 116
Query: 121 STFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
+ +Y V D +P S+DWR++GAV +KDQG CGSCWAFSAVAAVEGI +I G LI L
Sbjct: 117 ISDRYTPRVGDSLPDSVDWRKEGAVVGVKDQGSCGSCWAFSAVAAVEGINKIVTGDLISL 176
Query: 179 SEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
SEQ+LVDC N GC+GGLMD FE+II N G+ +E DYPY +G CD ++ A +
Sbjct: 177 SEQELVDCDNSYNEGCNGGLMDYGFEFIINNGGIDSEEDYPYLARDGRCDTYRKNARVVS 236
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I YED+P +E AL +AV+NQPVSV ++A GR F Y SGV + CG DHGV VG+
Sbjct: 237 IDSYEDVPVNNEAALQKAVANQPVSVAIEAGGRDFQLYSSGVFSGRCGTALDHGVVAVGY 296
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
GT ENG YW+++NSWG++WGESGY+R+ R+ G+CGIA ASYP+
Sbjct: 297 GT---ENGQDYWIVRNSWGKSWGESGYLRMARNIRKPTGICGIAMEASYPI 344
>gi|28192373|gb|AAK07730.1| CPR1-like cysteine proteinase [Nicotiana tabacum]
Length = 374
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 161/318 (50%), Positives = 212/318 (66%), Gaps = 13/318 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E + ++E W+A+HGR Y EK R IFK NL +IE+ N GNRTYK+G N+F+DL
Sbjct: 43 EDQVKNRYEMWLAEHGRAYNALGEKEKRFEIFKDNLRFIEEHNNSGNRTYKVGLNQFADL 102
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQC 152
TNEE+R +Y G +S PS +Y + + +P S+DWR++GAV IK+QG C
Sbjct: 103 TNEEYRTMYLGTKSDARRRFVKSKNPSQ-RYASRPNELMPHSVDWRKRGAVAPIKNQGSC 161
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGL 211
GSCWAFS VAAV GI QI G++I LSEQ+LVDC N GC+GGLMD AFE+II N G+
Sbjct: 162 GSCWAFSTVAAVGGINQIVTGEMITLSEQELVDCDRVQNSGCNGGLMDYAFEFIISNGGM 221
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
TE YPYR EG CD ++ +I YED+P+ +E+AL +AV++QPV V ++ASGRA
Sbjct: 222 DTEKHYPYRGVEGRCDPVRKNYKVVSIDGYEDVPR-NERALQKAVAHQPVCVAIEASGRA 280
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
F Y SGV +CG DHGV VVG+G+ E+G YW+++NSWG WGE+GY+++ R+
Sbjct: 281 FQLYSSGVFTGECGEEVDHGVVVVGYGS---EDGVDYWIVRNSWGTKWGENGYVKMERNV 337
Query: 332 -----GLCGIATAASYPV 344
G CGI T ASYP
Sbjct: 338 KKSHLGKCGIMTEASYPT 355
>gi|148927382|gb|ABR19827.1| cysteine proteinase [Elaeis guineensis]
Length = 470
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 164/322 (50%), Positives = 207/322 (64%), Gaps = 14/322 (4%)
Query: 31 RSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLG 87
RS E I+ +E W+A+HGR Y EK R IFK N+ +I+ N G+R+++LG
Sbjct: 41 RSEEEMRIL--YEGWLAKHGRAYNALGEKERRFEIFKDNVLFIDAHNAAADAGHRSFRLG 98
Query: 88 TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N F+D+TNEE+RA+Y G RP R ++Y D+P S+DWR KGAV +K
Sbjct: 99 LNRFADMTNEEYRAVYLG-TRPAGHRRRARVGSDRYRYNAGEDLPESVDWRAKGAVAAVK 157
Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYII 206
DQG CGSCWAFS VAAVEGI +I G LI LSEQ+LVDC N GC+GGLMD FE+II
Sbjct: 158 DQGSCGSCWAFSTVAAVEGINKIVTGDLISLSEQELVDCDNGYNQGCNGGLMDYGFEFII 217
Query: 207 ENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVD 266
N G+ TE DYPY +G CD ++ A +I YED+P DE+AL +AV+NQPVSV ++
Sbjct: 218 NNGGIDTEEDYPYTARDGKCDQYRKNAKVVSIDGYEDVPVNDEKALQKAVANQPVSVAIE 277
Query: 267 ASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
A GR F Y SG+ CG + DHGV VG+GT ENG YW+++NSWG WGESGYIR
Sbjct: 278 AGGREFQLYHSGIFTGRCGTDLDHGVVAVGYGT---ENGKDYWIVRNSWGGDWGESGYIR 334
Query: 327 ILRD----AGLCGIATAASYPV 344
+ R+ G CGIA SYP
Sbjct: 335 MERNVNTSTGKCGIAIEPSYPT 356
>gi|157093728|gb|ABV22590.1| KDEL-tailed cysteine endopeptidase [Solanum lycopersicum]
Length = 360
Score = 327 bits (839), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 166/349 (47%), Positives = 222/349 (63%), Gaps = 16/349 (4%)
Query: 5 FEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
+K F++ + ++L + + E E +E+W + H + + EK R N
Sbjct: 1 MKKLFLVLFTLALVLRLGESFDFHEKELETEEKFWELYERWRSHHTVSRSLD-EKHKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRP 120
+FK N+ Y+ NK+ ++ YKL N+F+D+TN EFR Y G ++R + SR +
Sbjct: 60 VFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHEFRQHYAGSKIKHHRTLLGASRANG-- 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
TF Y N +VP SIDWR+KGAVT +KDQGQCGSCWAFS V AVEGI QI KL+ LSE
Sbjct: 117 -TFMYANEDNVPPSIDWRKKGAVTPVKDQGQCGSCWAFSTVVAVEGINQIKTKKLVSLSE 175
Query: 181 QQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
Q+LVDC +T+N GC+GGLMD AF++I + G+ TE YPY+ E+ CD QK +I
Sbjct: 176 QELVDCDTTENQGCNGGLMDPAFDFIKKRGGITTEERYPYKAEDDKCDIQKRNTPVVSID 235
Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
+ED+P DE ALL+AV+NQP+SV +DASG F FY GV +CG DHGVA+VG+GT
Sbjct: 236 GHEDVPPNDEDALLKAVANQPISVAIDASGSQFQFYSEGVFTGECGTELDHGVAIVGYGT 295
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
+ G KYW++KNSWG WGE GYIR+ R + GLCGIA SYP+
Sbjct: 296 TVD--GTKYWIVKNSWGAGWGEKGYIRMQRKVDAEEGLCGIAMQPSYPI 342
>gi|18394919|ref|NP_564126.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
gi|71153409|sp|Q9LM66.2|XCP2_ARATH RecName: Full=Xylem cysteine proteinase 2; Short=AtXCP2; Flags:
Precursor
gi|4836904|gb|AAD30607.1|AC007369_17 Putative cysteine proteinase [Arabidopsis thaliana]
gi|6708183|gb|AAF25832.1|AF191028_1 papain-type cysteine endopeptidase XCP2 [Arabidopsis thaliana]
gi|28466959|gb|AAO44088.1| At1g20850 [Arabidopsis thaliana]
gi|110743795|dbj|BAE99733.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332191910|gb|AEE30031.1| Xylem cysteine proteinase 2 [Arabidopsis thaliana]
Length = 356
Score = 327 bits (838), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 157/313 (50%), Positives = 220/313 (70%), Gaps = 12/313 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E W++ + Y+ EK +R +FK NL++I++ NK+G ++Y LG NEF+DL++E
Sbjct: 47 LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFADLSHE 105
Query: 98 EFRALYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
EF+ +Y G + V R R + F Y++V VP S+DWR+KGAV +K+QG CGSCW
Sbjct: 106 EFKKMYLGLKTDI--VRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
AFS VAAVEGI +I G L LSEQ+L+DC T N+GC+GGLMD AFEYI++N GL E
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEE 223
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
DYPY EEGTC+ QK+++ TI+ ++D+P DE++LL+A+++QP+SV +DASGR F FY
Sbjct: 224 DYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFY 283
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---- 331
GV + CG + DHGVA VG+G+++ G+ Y ++KNSWG WGE GYIR+ R+
Sbjct: 284 SGGVFDGRCGVDLDHGVAAVGYGSSK---GSDYIIVKNSWGPKWGEKGYIRLKRNTGKPE 340
Query: 332 GLCGIATAASYPV 344
GLCGI AS+P
Sbjct: 341 GLCGINKMASFPT 353
>gi|118486542|gb|ABK95110.1| unknown [Populus trichocarpa]
Length = 465
Score = 327 bits (838), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 164/318 (51%), Positives = 214/318 (67%), Gaps = 23/318 (7%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ +E+W+ +HG+ Y EK R IFK NL +I++ N E NRTY +G N F+DLTNE
Sbjct: 47 VMAMYEEWLVKHGKNYNALGEKEKRFEIFKDNLMFIDQHNSE-NRTYTVGLNRFADLTNE 105
Query: 98 EFRALY----TGYNRPVPSVS-RQSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKDQGQ 151
EFR++Y TG+ + +P S R + R V D +P S+DWR++GAV +KDQG
Sbjct: 106 EFRSMYLGTRTGHKKRLPKTSDRYAPR--------VGDSLPDSVDWRKEGAVAEVKDQGG 157
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKG 210
CGSCWAFS +AAVEGI +I G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G
Sbjct: 158 CGSCWAFSTIAAVEGINKIVTGDLIALSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGG 217
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
+ TE DYPY +G CD ++ A +I YED+P+ DE AL +AV+NQPVSV ++ GR
Sbjct: 218 IDTEDDYPYLGRDGRCDTYRKNAKVVSIDSYEDVPENDETALKKAVANQPVSVAIEGGGR 277
Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
F Y SGV +CG + DHGVA VG+GT E G YW+++NSWG++WGESGYIR+ R+
Sbjct: 278 NFQLYNSGVFTGECGTSLDHGVAAVGYGT---EKGKDYWIVRNSWGKSWGESGYIRMERN 334
Query: 331 ----AGLCGIATAASYPV 344
G CGIA SYP+
Sbjct: 335 IASPTGKCGIAIEPSYPI 352
>gi|356515044|ref|XP_003526211.1| PREDICTED: LOW QUALITY PROTEIN: thiol protease SEN102-like [Glycine
max]
Length = 337
Score = 327 bits (838), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 166/340 (48%), Positives = 220/340 (64%), Gaps = 20/340 (5%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ +F+++ + I SQV+S R +HE S+ E+HE W+A++G+ YK EK IFK+N+
Sbjct: 11 LALFLLLSIEI---SQVMS-RKLHETSLREEHENWIARYGQVYKVAAEKET-FQIFKENV 65
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+IE N N+ YKLG N F+DLT EEF+ G + S P FKY+NVTD
Sbjct: 66 EFIESFNAAANKPYKLGVNLFADLTLEEFKDFRFGLKK----THEFSITP--FKYENVTD 119
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P ++DWREKGAVT IKDQGQCGSCWAFS VAA EGI QIT G L+ L EQ+LV C T
Sbjct: 120 IPEALDWREKGAVTPIKDQGQCGSCWAFSTVAATEGIHQITTGNLVSLXEQELVSCDTKG 179
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
+ GC GG M+ FE+II+N G+ T+A+YPY+ GTC+ + A I YE +P
Sbjct: 180 VDQGCEGGYMEDGFEFIIKNGGITTKANYPYKGVNGTCNTTIAASTVAQIKGYETVPSYS 239
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E+AL +AV+NQPVSV +DA+ F FY G+ +CG + DHGV VG+GT E + Y
Sbjct: 240 EEALQKAVANQPVSVSIDANNGHFMFYAGGIYTGECGTDLDHGVTAVGYGTTNETD---Y 296
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
W++KNSWG W E G+IR+ R GLCG+A +SYP
Sbjct: 297 WIVKNSWGTGWDEKGFIRMQRGITVKHGLCGVALDSSYPT 336
>gi|226495425|ref|NP_001148706.1| cysteine protease 1 precursor [Zea mays]
gi|195621544|gb|ACG32602.1| cysteine protease 1 precursor [Zea mays]
Length = 463
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 160/312 (51%), Positives = 206/312 (66%), Gaps = 16/312 (5%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
+ +WMA HGRTY E+ R +F+ NL YI+ N G +++LG N F+DLTN+E
Sbjct: 41 YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 100
Query: 99 FRALYTG-YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
+RA Y G RP R+ + + + D+P S+DWR KGAV +KDQG CGSCWA
Sbjct: 101 YRATYLGARTRP----QRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWA 156
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS +AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ TE D
Sbjct: 157 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKD 216
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY+ +G CD ++ A TI YED+P DE++L +AV+NQPVSV ++A+G AF Y
Sbjct: 217 YPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYS 276
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
SG+ CG DHGV VG+GT ENG YW++KNSWG +WGESGY+R+ R+ +G
Sbjct: 277 SGIFTGSCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 333
Query: 333 LCGIATAASYPV 344
CGIA SYP+
Sbjct: 334 KCGIAVEPSYPL 345
>gi|358348957|ref|XP_003638507.1| Cysteine proteinase [Medicago truncatula]
gi|355504442|gb|AES85645.1| Cysteine proteinase [Medicago truncatula]
Length = 362
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 161/345 (46%), Positives = 222/345 (64%), Gaps = 10/345 (2%)
Query: 6 EKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
+K +I + + ++LV++ + + S+ + +E+W + H + ++ EK R N+
Sbjct: 4 KKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TFK 124
FK N+ ++ NK ++ YKL N+F+D+TN EF+ Y G + R + R S TF
Sbjct: 63 FKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGTFM 121
Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
Y+N T P S+DWR+KGAVT +KDQGQCGSCWAFS V AVEGI QI +L+ LSEQ+L+
Sbjct: 122 YENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELI 181
Query: 185 DCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
DC +N GC+GGLM+ AFEYI + G+ TE+ YPY +G+CD KE A +I +E
Sbjct: 182 DCDNQENQGCNGGLMEYAFEYIKQKGGITTESYYPYTANDGSCDATKENVPAVSIDGHET 241
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P DE ALL+AV+NQPVSV +DA G F FY GV DCG +HGVA+VG+GT +
Sbjct: 242 VPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTTVD- 300
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
G YW+++NSWG WGE GYIR+ R+ GLCGIA ASYPV
Sbjct: 301 -GTNYWIVRNSWGAEWGEQGYIRMKRNVSNKEGLCGIAMEASYPV 344
>gi|414585111|tpg|DAA35682.1| TPA: cysteine proteinase Mir3 [Zea mays]
Length = 468
Score = 327 bits (837), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 160/312 (51%), Positives = 206/312 (66%), Gaps = 16/312 (5%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
+ +WMA HGRTY E+ R +F+ NL YI+ N G +++LG N F+DLTN+E
Sbjct: 46 YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 105
Query: 99 FRALYTG-YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
+RA Y G RP R+ + + + D+P S+DWR KGAV +KDQG CGSCWA
Sbjct: 106 YRATYLGARTRP----QRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGSCWA 161
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS +AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ TE D
Sbjct: 162 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKD 221
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY+ +G CD ++ A TI YED+P DE++L +AV+NQPVSV ++A+G AF Y
Sbjct: 222 YPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYS 281
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
SG+ CG DHGV VG+GT ENG YW++KNSWG +WGESGY+R+ R+ +G
Sbjct: 282 SGIFTGSCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 338
Query: 333 LCGIATAASYPV 344
CGIA SYP+
Sbjct: 339 KCGIAVEPSYPL 350
>gi|356564154|ref|XP_003550321.1| PREDICTED: cysteine proteinase RD21a-like [Glycine max]
Length = 476
Score = 326 bits (836), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 166/351 (47%), Positives = 217/351 (61%), Gaps = 21/351 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMH---------EPSIVEKHEQWMAQHGRTYKDELEKA 60
I+ +F + + ++S S H E ++ +EQW+ +HG+ Y EK
Sbjct: 18 IVLLFTVFAVSSALDMSIISYDSAHADKAATLRTEEELMSMYEQWLVKHGKVYNALGEKE 77
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK NL +I+ N +RTYKLG N F+DLTNEE+RA Y G + R P
Sbjct: 78 KRFQIFKDNLRFIDDHNSAEDRTYKLGLNRFADLTNEEYRAKYLGTK--IDPNRRLGKTP 135
Query: 121 STFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
S V D +P S+DWR++GAV +KDQG CGSCWAFSA+ AVEGI +I G+LI LS
Sbjct: 136 SNRYAPRVGDKLPDSVDWRKEGAVPPVKDQGGCGSCWAFSAIGAVEGINKIVTGELISLS 195
Query: 180 EQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
EQ+LVDC T N GC+GGLMD AFE+II N G+ ++ DYPYR +G CD ++ A +I
Sbjct: 196 EQELVDCDTGYNQGCNGGLMDYAFEFIINNGGIDSDEDYPYRGVDGRCDTYRKNAKVVSI 255
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
YED+P DE AL +AV+NQPVSV ++ GR F Y SGV CG DHGV VG+G
Sbjct: 256 DDYEDVPAYDELALKKAVANQPVSVAIEGGGREFQLYVSGVFTGRCGTALDHGVVAVGYG 315
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
TA+ G YW+++NSWG +WGE GYIR+ R+ +G CGIA SYP+
Sbjct: 316 TAK---GHDYWIVRNSWGSSWGEDGYIRLERNLANSRSGKCGIAIEPSYPL 363
>gi|255555337|ref|XP_002518705.1| cysteine protease, putative [Ricinus communis]
gi|223542086|gb|EEF43630.1| cysteine protease, putative [Ricinus communis]
Length = 471
Score = 326 bits (836), Expect = 8e-87, Method: Compositional matrix adjust.
Identities = 159/320 (49%), Positives = 211/320 (65%), Gaps = 19/320 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ + +E W+ +HG+ Y EK R IFK NL +I++ N +R+YK+G N F+DL
Sbjct: 44 DSQVRRMYEMWLVEHGKAYNALGEKEKRFEIFKDNLRFIDEHNSV-DRSYKVGLNRFADL 102
Query: 95 TNEEFRALYTGYNRPVPSVSRQS----SRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQG 150
TNEE++A++ G + R++ +R + +++ D+P ++DWREKGAV +KDQG
Sbjct: 103 TNEEYKAMFLG-----TKMERKNRFLGTRSQRYLFKDGDDLPENVDWREKGAVVPVKDQG 157
Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENK 209
QCGSCWAFS V AVEGI QI G+LI LSEQ+LVDC N GC+GGLMD AFE+II N
Sbjct: 158 QCGSCWAFSTVGAVEGINQIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNG 217
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
G+ TE DYPY+ + CD ++ A TI YED+P+ DE +L +AV++QPVSV ++A G
Sbjct: 218 GIDTEEDYPYKASDNICDPNRKNAKVVTIDGYEDVPENDENSLKKAVAHQPVSVAIEAGG 277
Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
RAF YKSGV CG DHGV VG+GT ENG YW+++NSWG WGESGYIR+ R
Sbjct: 278 RAFQLYKSGVFTGRCGTELDHGVVAVGYGT---ENGVNYWIVRNSWGSAWGESGYIRMER 334
Query: 330 D-----AGLCGIATAASYPV 344
+ G CGIA SYP
Sbjct: 335 NVANTKTGKCGIAIQPSYPT 354
>gi|58531896|gb|AAW78660.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 326 bits (836), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 163/314 (51%), Positives = 208/314 (66%), Gaps = 16/314 (5%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
E +E+W + H + + EK R N+FK N+ Y+ NK+ ++ YKL N+F+D+TN EF
Sbjct: 36 ELYERWRSHHTVSRSLD-EKDKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHEF 93
Query: 100 RALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
R Y G ++R SR + TF Y NV DVP S+DWR+KGAVT +KDQG+CGSC
Sbjct: 94 RHHYAGSKIKHHRSFLGASRANG---TFMYANVEDVPPSVDWRKKGAVTPVKDQGKCGSC 150
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
WAFS V AVEGI QI +L+ LSEQ+LVDC T N GC+GGLMD AFE+I + G+ TE
Sbjct: 151 WAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINTE 210
Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
+YPY E G CD QK + +I YED+P DE +LL+AV+NQPVSV + ASG F F
Sbjct: 211 ENYPYMAEGGECDIQKRNSPVVSIDGYEDVPPNDEDSLLKAVANQPVSVAIQASGSDFQF 270
Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----D 330
Y GV DCG DHGVA+VG+GT + G KYW+++NSWG WGE GYIR+ R +
Sbjct: 271 YSEGVFTGDCGTELDHGVAIVGYGTTLD--GTKYWIVRNSWGPEWGEKGYIRMQREIDAE 328
Query: 331 AGLCGIATAASYPV 344
GLCGIA SYP+
Sbjct: 329 EGLCGIAMQPSYPI 342
>gi|125533982|gb|EAY80530.1| hypothetical protein OsI_35710 [Oryza sativa Indica Group]
Length = 378
Score = 326 bits (836), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 159/328 (48%), Positives = 212/328 (64%), Gaps = 20/328 (6%)
Query: 35 EPSIVEKHEQWMAQH--------GRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKL 86
E S+ +E+W +++ G D+ E R N+F +N YI +AN+ G R ++L
Sbjct: 35 EESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPFRL 94
Query: 87 GTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
N+F+D+T +EFR Y G ++R + + + ++P ++DWRE+GA
Sbjct: 95 ALNKFADMTTDEFRRTYAGSRARHHRSLRGGRGGEGGSFRYGGDDEDNLPPAVDWRERGA 154
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKA 201
VT IKDQGQCGSCWAFSAVAAVEG+ +I G+L+ LSEQ+LVDC T DN GC GGLMD A
Sbjct: 155 VTGIKDQGQCGSCWAFSAVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGLMDYA 214
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
F++I N G+ TE++YPYR E+G C+ K + TI YED+P DE AL +AV+NQPV
Sbjct: 215 FQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVANQPV 274
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
+V V+ASG+ F FY GV +CG + DHGVA VG+G + G KYW++KNSWGE WGE
Sbjct: 275 AVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRD--GTKYWIVKNSWGEDWGE 332
Query: 322 SGYIRILRDA-----GLCGIATAASYPV 344
GYIR+ R GLCGIA ASYPV
Sbjct: 333 RGYIRMQRGVSSDSNGLCGIAMEASYPV 360
>gi|242086591|ref|XP_002439128.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
gi|241944413|gb|EES17558.1| hypothetical protein SORBIDRAFT_09g000960 [Sorghum bicolor]
Length = 371
Score = 326 bits (836), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 165/320 (51%), Positives = 217/320 (67%), Gaps = 16/320 (5%)
Query: 33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
+H +++ E+W+A++ + Y EK R +FK NL +I++ANK+ TY LG N F+
Sbjct: 57 VHHDRLIKLFEEWVAKYRKAYASFEEKLHRFEVFKDNLHHIDEANKKVT-TYWLGLNAFA 115
Query: 93 DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQG 150
DLT++EF+A Y G +P + + S F+Y V D VP S+DWR+KGAVT +K+QG
Sbjct: 116 DLTHDEFKATYLGLRQP----ETKKTTDSRFRYGGVADDDVPASVDWRKKGAVTDVKNQG 171
Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENK 209
QCGSCWAFS VAAVEGI QI G L LSEQ+LVDCSTD N+GC+GG+MD AF YI +
Sbjct: 172 QCGSCWAFSTVAAVEGINQIVTGNLTSLSEQELVDCSTDGNNGCNGGVMDNAFSYIASSG 231
Query: 210 GLATEADYPYRHEEGTCDNQ-KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
GL TE YPY EEG CD++ ++ TIS YED+P DEQAL++A+++QP+SV ++AS
Sbjct: 232 GLRTEEAYPYLMEEGDCDDKARDGEQVVTISGYEDVPANDEQALVKALAHQPLSVAIEAS 291
Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
GR F FY GV N CG+ DHGVA VG+G+++ G Y ++KNSWG WGE GYIR+
Sbjct: 292 GRHFQFYSGGVFNGPCGSELDHGVAAVGYGSSK---GQDYIIVKNSWGSHWGEKGYIRMK 348
Query: 329 RDA----GLCGIATAASYPV 344
R GLCGI ASYP
Sbjct: 349 RGTGKPEGLCGINKMASYPT 368
>gi|226502454|ref|NP_001140922.1| hypothetical protein [Zea mays]
gi|223948637|gb|ACN28402.1| unknown [Zea mays]
gi|413920877|gb|AFW60809.1| hypothetical protein ZEAMMB73_830238 [Zea mays]
Length = 354
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 172/355 (48%), Positives = 234/355 (65%), Gaps = 30/355 (8%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKAM 61
+I + + ++ + + R + E ++ +H+QWMA+HGRTY+DE EKA
Sbjct: 11 VIAFTAVALTILAVKTMMAEARDLSSTSTGGYGEEAMKVRHQQWMAEHGRTYRDEAEKAH 70
Query: 62 RLNIFKQNLEYIEKANKEGN--RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR 119
R +FK N ++++ +N G+ ++Y++ NEF+D+TN+EF A+YTG RPVP+ ++ +
Sbjct: 71 RFQVFKANADFVDASNAAGDDKKSYRMELNEFADMTNDEFMAMYTGL-RPVPAGAK---K 126
Query: 120 PSTFKYQNVT-----DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGK 174
+ FKY NVT D ++DWR+KGAVT IK+QGQCG CWAF+AVAAVEGI QIT G
Sbjct: 127 MAGFKYGNVTLSDADDNQQTVDWRQKGAVTGIKNQGQCGCCWAFAAVAAVEGIHQITTGN 186
Query: 175 LIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKA 233
L+ LSEQQ++DC T+ N+GC+GG +D AF+YI N GLATE YPY + C Q +
Sbjct: 187 LVSLSEQQVLDCDTEGNNGCNGGYIDNAFQYIAGNGGLATEDAYPYTAAQAMC--QSVQP 244
Query: 234 VAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN-ADCGN--NCDH 290
VAA IS Y+D+P GDE AL AV+NQPVSV +DA F Y GV+ A C N +H
Sbjct: 245 VAA-ISGYQDVPSGDEAALAAAVANQPVSVAIDA--HNFQLYGGGVMTAASCSTPPNLNH 301
Query: 291 GVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPVA 345
V VG+GTAE+ G YWL+KN WG+ WGE GY+R+ R A CG+A ASYPVA
Sbjct: 302 AVTAVGYGTAED--GTPYWLLKNQWGQNWGEGGYLRLERGANACGVAQQASYPVA 354
>gi|50355619|dbj|BAD29958.1| cysteine protease [Daucus carota]
Length = 496
Score = 326 bits (835), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 161/309 (52%), Positives = 205/309 (66%), Gaps = 13/309 (4%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
E W+ HG++Y E+ R IFK NL YI++ N +R +KLG N+F+DLTNEE+R+
Sbjct: 46 ESWLVTHGKSYNALGEEEKRFQIFKNNLRYIDEQNLVEDRGFKLGLNKFADLTNEEYRSK 105
Query: 103 YTGYNRP--VPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
YTG VS +S R +T +++ P S+DWRE GAV +KDQG CGSCWAFS
Sbjct: 106 YTGIKSKDLRKKVSAKSGRYATLSGESL---PESVDWRESGAVATVKDQGSCGSCWAFST 162
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
++AVEGI QI GKLI LSEQ+LVDC N GC+GGLMD AFE+II N G+ T+ DYPY
Sbjct: 163 ISAVEGINQIATGKLITLSEQELVDCDRSYNEGCNGGLMDYAFEFIINNGGIDTDVDYPY 222
Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
+G CD ++ A TI YED+P DE AL +A +NQP+SV ++ASGR F FY SG+
Sbjct: 223 TGRDGKCDQYRKNAKVVTIDSYEDVPAYDELALKKAAANQPISVAIEASGRDFQFYDSGI 282
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCG 335
CG DHGV VVG+GT ENG YW+++NSWG WGE+GY+R+ R G+CG
Sbjct: 283 FTGKCGIALDHGVVVVGYGT---ENGKDYWIVRNSWGADWGENGYLRMERGISSKTGICG 339
Query: 336 IATAASYPV 344
IA SYPV
Sbjct: 340 IAIEPSYPV 348
>gi|2511689|emb|CAB17074.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 364
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 161/344 (46%), Positives = 218/344 (63%), Gaps = 12/344 (3%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
S +IP +++ + A+ +S + E +++ +E+W+ +H + Y EK R +FK
Sbjct: 3 SMLIPTLLLLSFTFSHAT-AMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFK 61
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKYQ 126
NL +I+ N + N TY LG N+F+D+TNEE+RA+Y G V + + + Y
Sbjct: 62 DNLGFIQDHNAQ-NNTYTLGLNKFADITNEEYRAMYLGTRTDAKRRVMKTQNTGHRYAYN 120
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+ +P +DWR KGAV IKDQG CGSCWAFS VAAVEGI I G+ + LSEQ+LVDC
Sbjct: 121 SGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDC 180
Query: 187 STD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ + GC+GGLMD AF++II+N G+ TE DYPY+ +GTCD K+K I YED+P
Sbjct: 181 DREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDQTKKKTKVVQIDGYEDVP 240
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+E AL +AVS+QPVSV ++ASGRA Y+SGV CG DHGV VVG+GT ENG
Sbjct: 241 SNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGT---ENG 297
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
YWL++NSWG WGE GY ++ R+ G CGIA SYPV
Sbjct: 298 VDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341
>gi|30141019|dbj|BAC75923.1| cysteine protease-1 [Helianthus annuus]
Length = 461
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 155/308 (50%), Positives = 203/308 (65%), Gaps = 9/308 (2%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E W+ +HG+TY EK R IFK NL +I++ N G+ TYKLG N+F+DLTNEE+R
Sbjct: 52 YESWLVKHGKTYNALGEKDRRFQIFKDNLRFIDEHN-SGDHTYKLGLNKFADLTNEEYRM 110
Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
YTG + + Y++ +P +DWRE+GAVT +KDQG CGSCWAFS
Sbjct: 111 TYTGIKTIDDKKKLSKMKSDRYAYRSGDSLPEYVDWREQGAVTDVKDQGSCGSCWAFSTT 170
Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
+VEG+ +I G LI +SEQ+LV+C T N GC+GGLMD AFE+II+N G+ TE DYPY
Sbjct: 171 GSVEGVNKIVTGDLISVSEQELVNCDTSYNQGCNGGLMDYAFEFIIKNGGIDTEEDYPYT 230
Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL 280
++G CD K+ A TI YED+P DE +L +AVSNQPV+V ++A GR F FY SG+
Sbjct: 231 GKDGKCDKNKKNAKVVTIDSYEDVPVNDESSLKKAVSNQPVAVAIEAGGRDFQFYTSGIF 290
Query: 281 NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGI 336
CG DHGV G+GT E+G YWL+KNSWG WGE GY+++ R+ +G CGI
Sbjct: 291 TGSCGTALDHGVLAAGYGT---EDGKDYWLVKNSWGAEWGEGGYLKMERNIADKSGKCGI 347
Query: 337 ATAASYPV 344
A ASYP+
Sbjct: 348 AMEASYPI 355
>gi|449524070|ref|XP_004169046.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like, partial
[Cucumis sativus]
Length = 314
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 163/313 (52%), Positives = 215/313 (68%), Gaps = 20/313 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I +++++WM ++GR YK E R I++ N++YI+ N N ++ L N F+DLTNE
Sbjct: 15 IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNFADLTNE 73
Query: 98 EFRALYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
EF+A Y GY + S P T F+Y N+ ++PT++DWR++GAVT IK+QGQCGSCW
Sbjct: 74 EFKATYLGY--------KTVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCW 125
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATE 214
AFSAVAAVEGI +I GKLI LSEQ+LVDC ++ N GC+GG M KAFE+I + GL TE
Sbjct: 126 AFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFI-KRTGLTTE 184
Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
+YPY+ E C+ QKEK +IS YE +P DE++L AV+NQPVSV +DA G F F
Sbjct: 185 IEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQF 244
Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA--- 331
Y G+ + +CGN +HGVA+VG+G E + YWL+KNSWG WGESGYIR+ RD+
Sbjct: 245 YSGGIFSGNCGNQLNHGVAIVGYG---ETSNQAYWLVKNSWGTDWGESGYIRMKRDSTDK 301
Query: 332 -GLCGIATAASYP 343
G CGIA ASYP
Sbjct: 302 QGTCGIAMMASYP 314
>gi|194352754|emb|CAQ00105.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326513690|dbj|BAJ87864.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326514532|dbj|BAJ96253.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 463
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 163/336 (48%), Positives = 214/336 (63%), Gaps = 14/336 (4%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
+ L +VS E + + +WMA+HG TY E+ R F+ NL YI++
Sbjct: 18 VSLAAAADMSIVSYGERSEEEVRRMYAEWMAEHGSTYNAIGEEERRFEAFRDNLRYIDQH 77
Query: 77 N---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N G +++LG N F+DLTNEE+R+ Y G R P R+ S + ++ + ++P
Sbjct: 78 NAAADAGVHSFRLGLNRFADLTNEEYRSTYLG-ARTKPDRERKLS--ARYQAADNDELPE 134
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHG 192
S+DWR+KGAV +KDQG CGSCWAFSA+AAVEGI QI G +I LSEQ+LVDC T N G
Sbjct: 135 SVDWRKKGAVGAVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQG 194
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C+GGLMD AFE+II N G+ +E DYPY+ + CD K+ A TI YED+P E++L
Sbjct: 195 CNGGLMDYAFEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSL 254
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+AV+NQP+SV ++A GRAF YKSG+ CG DHGVA VG+GT ENG YWL++
Sbjct: 255 QKAVANQPISVAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT---ENGKDYWLVR 311
Query: 313 NSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
NSWG WGE GYIR+ R+ +G CGIA SYP
Sbjct: 312 NSWGSVWGEDGYIRMERNIKASSGKCGIAVEPSYPT 347
>gi|449460678|ref|XP_004148072.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP1-like [Cucumis
sativus]
Length = 317
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 163/314 (51%), Positives = 215/314 (68%), Gaps = 20/314 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I +++++WM ++GR YK E R I++ N++YI+ N N ++ L N F+DLTNE
Sbjct: 15 IQDRYQKWMDKYGRQYKSREEWERRFTIYQANVQYIDNFNSM-NHSHTLAENNFADLTNE 73
Query: 98 EFRALYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
EF+A Y GY + S P T F+Y N+ ++PT++DWR++GAVT IK+QGQCGSCW
Sbjct: 74 EFKATYLGY--------KTVSIPDTCFRYGNMVNLPTNVDWRQEGAVTPIKNQGQCGSCW 125
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATE 214
AFSAVAAVEGI +I GKLI LSEQ+LVDC ++ N GC+GG M KAFE+I + GL TE
Sbjct: 126 AFSAVAAVEGINKIKAGKLISLSEQELVDCDVTSGNQGCNGGYMYKAFEFI-KRTGLTTE 184
Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
+YPY+ E C+ QKEK +IS YE +P DE++L AV+NQPVSV +DA G F F
Sbjct: 185 IEYPYQGAESACNEQKEKYQFVSISGYEKVPVNDEKSLKAAVANQPVSVAIDAEGNNFQF 244
Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA--- 331
Y G+ + +CGN +HGVA+VG+G E + YWL+KNSWG WGESGYIR+ RD+
Sbjct: 245 YSGGIFSGNCGNQLNHGVAIVGYG---ETSNQAYWLVKNSWGTDWGESGYIRMKRDSTDR 301
Query: 332 -GLCGIATAASYPV 344
G CGIA ASYP
Sbjct: 302 QGTCGIAMMASYPT 315
>gi|358343350|ref|XP_003635767.1| Cysteine proteinase [Medicago truncatula]
gi|355501702|gb|AES82905.1| Cysteine proteinase [Medicago truncatula]
Length = 338
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 159/344 (46%), Positives = 226/344 (65%), Gaps = 18/344 (5%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK-HEQWMAQHGRTYKDELEKAMRLNIF 66
+ + + ++ + +I A + ++ P++++K +E W+ ++GR Y+D E +R +I+
Sbjct: 4 TITLSIVILNLWIIASACPEIHTKNSTNPAVMKKRYETWLKRYGRHYRDREEWEVRFDIY 63
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
+ N++YIE N + N +YKL N F+D+TNEEF++ Y GY +P Q+ F+Y
Sbjct: 64 QSNVQYIEFYNSQ-NYSYKLIDNRFADITNEEFKSTYLGY---LPRFRVQTE----FRYH 115
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
++P SIDWR+KGAVTH+KDQG+CGSCWAFSAVAAVEGI +I L+ LSEQQL+DC
Sbjct: 116 KHGELPKSIDWRKKGAVTHVKDQGRCGSCWAFSAVAAVEGINKIKTENLVSLSEQQLIDC 175
Query: 187 S--TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
+ N GC GG M AF YI ++ G+AT +YPY+ +G C+ K K A TIS YE +
Sbjct: 176 DIKSGNEGCEGGDMYIAFNYIKKHGGIATAKEYPYKGRDGNCNKSKAKNNAVTISGYESV 235
Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
P +E+ L AV++QPVS+ DA G AF FY G+ + CG N +HG+ +VG+G EEN
Sbjct: 236 PARNEKMLKAAVAHQPVSIATDAGGYAFQFYSKGIFSGSCGKNLNHGMTIVGYG---EEN 292
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
G KYW++KNSW WGESGY+R+ RD G CGIA A+YPV
Sbjct: 293 GDKYWIVKNSWANDWGESGYVRMKRDTKDKDGTCGIAMDATYPV 336
>gi|302764466|ref|XP_002965654.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
gi|300166468|gb|EFJ33074.1| hypothetical protein SELMODRAFT_230713 [Selaginella moellendorffii]
Length = 345
Score = 325 bits (833), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 158/313 (50%), Positives = 207/313 (66%), Gaps = 20/313 (6%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+++W+ +HG+ Y E R IFK+N+ YI N N ++ LG N+F+DLTN EFR
Sbjct: 38 YQKWIQEHGKAYNSAHEYKKRFQIFKENVNYINSHNARRNNSHSLGLNKFADLTNSEFRG 97
Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQN----VTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
LY G + RP+ F V D TS+DWR+KG VT IKDQG CGSCWA
Sbjct: 98 LYVG----------RLQRPAPFHEVGDIALVADTATSVDWRKKGGVTEIKDQGDCGSCWA 147
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FSAVAAVEG+T ++ G L+ LSEQ+LVDC T N GC GG+MD AF+Y+I N G+ ++++
Sbjct: 148 FSAVAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQGCDGGIMDYAFQYMIRNGGITSQSN 207
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPYR G CD K K AATI+ ++ +P E+ LL+AV+NQPVSV ++A G+ F Y
Sbjct: 208 YPYRALRGACDKDKVKYHAATINGFQAIPPQSEELLLRAVANQPVSVAIEAGGQDFQLYS 267
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGL 333
SGV +CG+N DHGVA+VG+GT + G +YWL+KNSWG WGESGY+R+ R AG+
Sbjct: 268 SGVFTGECGSNLDHGVAIVGYGT--DAGGRQYWLVKNSWGSGWGESGYVRMERQGPGAGV 325
Query: 334 CGIATAASYPVAI 346
CGI ASYP I
Sbjct: 326 CGINLDASYPTKI 338
>gi|115484973|ref|NP_001067630.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|530335|emb|CAA56844.1| cysteine protease [Oryza sativa Japonica Group]
gi|5761322|dbj|BAA83472.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|62732672|gb|AAX94791.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732673|gb|AAX94792.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|62732674|gb|AAX94793.1| cysteine proteinase (EC 3.4.22.-) - rice [Oryza sativa Japonica
Group]
gi|77549615|gb|ABA92412.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549616|gb|ABA92413.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|77549617|gb|ABA92414.1| Thiol protease SEN102 precursor, putative, expressed [Oryza sativa
Japonica Group]
gi|113644852|dbj|BAF27993.1| Os11g0255300 [Oryza sativa Japonica Group]
gi|125576789|gb|EAZ18011.1| hypothetical protein OsJ_33558 [Oryza sativa Japonica Group]
gi|215701098|dbj|BAG92522.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 378
Score = 325 bits (832), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 158/328 (48%), Positives = 211/328 (64%), Gaps = 20/328 (6%)
Query: 35 EPSIVEKHEQWMAQH--------GRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKL 86
E S+ +E+W +++ G D+ E R N+F +N YI +AN+ G R ++L
Sbjct: 35 EESLRALYERWRSRYTVSRPAASGGVGNDDGEARRRFNVFVENARYIHEANRRGGRPFRL 94
Query: 87 GTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
N+F+D+T +EFR Y G ++R + + + ++P ++DWRE+GA
Sbjct: 95 ALNKFADMTTDEFRRTYAGSRARHHRSLSGGRGGEGGSFRYGGDDEDNLPPAVDWRERGA 154
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKA 201
VT IKDQGQCGSCWAFS VAAVEG+ +I G+L+ LSEQ+LVDC T DN GC GGLMD A
Sbjct: 155 VTGIKDQGQCGSCWAFSTVAAVEGVNKIKTGRLVTLSEQELVDCDTGDNQGCDGGLMDYA 214
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
F++I N G+ TE++YPYR E+G C+ K + TI YED+P DE AL +AV+NQPV
Sbjct: 215 FQFIKRNGGITTESNYPYRAEQGRCNKAKASSHDVTIDGYEDVPANDESALQKAVANQPV 274
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
+V V+ASG+ F FY GV +CG + DHGVA VG+G + G KYW++KNSWGE WGE
Sbjct: 275 AVAVEASGQDFQFYSEGVFTGECGTDLDHGVAAVGYGITRD--GTKYWIVKNSWGEDWGE 332
Query: 322 SGYIRILRDA-----GLCGIATAASYPV 344
GYIR+ R GLCGIA ASYPV
Sbjct: 333 RGYIRMQRGVSSDSNGLCGIAMEASYPV 360
>gi|50355611|dbj|BAD29954.1| cysteine protease [Daucus carota]
Length = 474
Score = 325 bits (832), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 154/314 (49%), Positives = 211/314 (67%), Gaps = 10/314 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ +E W+ +H + Y EK R IFK N+ ++++ N N++YKLG N+F+DLTN+
Sbjct: 56 LLSLYESWLVKHHKNYNALGEKETRFGIFKDNVGFVDRHNSMRNQSYKLGLNKFADLTND 115
Query: 98 EFRALY-TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
E+R+LY +G + R F +++ +P S+DWR++GAV +KDQGQCGSCW
Sbjct: 116 EYRSLYLSGKMMKRERKNEDGFRSDRFVFEDGDHLPESVDWRDRGAVAPVKDQGQCGSCW 175
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
AFS V AVEGI +I G+LI LSEQ+LVDC N GC+GGLMD AFE+I++N G+ TE
Sbjct: 176 AFSTVGAVEGINKIVTGELISLSEQELVDCDNGYNQGCNGGLMDYAFEFIVKNGGIDTED 235
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
DYPY+ +G CD ++ A TI+ YED+P DE++L +AV++QPVSV ++A GRAF Y
Sbjct: 236 DYPYKGVDGLCDQNRKNAKVVTINGYEDVPHNDEKSLKKAVAHQPVSVAIEAGGRAFQLY 295
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----- 330
+SGV CG DHGV VG+G+ ENG YW+++NSWG WGESGYIR+ R+
Sbjct: 296 ESGVFTGQCGTELDHGVVAVGYGS---ENGKDYWIVRNSWGPDWGESGYIRLERNVASTS 352
Query: 331 AGLCGIATAASYPV 344
G CGIA ASYP
Sbjct: 353 TGKCGIAMQASYPT 366
>gi|226496089|ref|NP_001149658.1| cysteine protease 1 precursor [Zea mays]
gi|195629242|gb|ACG36262.1| cysteine protease 1 precursor [Zea mays]
Length = 469
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 161/327 (49%), Positives = 209/327 (63%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
+VS E + +WMA HGRTY E+ R +F+ NL Y++ N G +
Sbjct: 31 IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90
Query: 84 YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTN+E+RA Y G +RP R+ + + D+P S+DWR KGA
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGVRSRP----QRERRLGDRYLAGDNEDLPESVDWRAKGA 146
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
V IKDQG CGSCWAFS +AAVEGI QI G +I LSEQ+LVDC T N GC+GGLMD A
Sbjct: 147 VAEIKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYA 206
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
FE+II N G+ TE DYPY+ +G CD ++ A TI YED+P E++L +AV+NQP+
Sbjct: 207 FEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPI 266
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
SV ++A GRAF Y SG+ CG DHGV VG+GT ENG YW++KNSWG +WGE
Sbjct: 267 SVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGE 323
Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
SGY+R+ R+ +G CGIA SYP+
Sbjct: 324 SGYVRMERNIKASSGKCGIAVEPSYPL 350
>gi|535473|emb|CAA53377.1| cysteine protease [Vicia sativa]
Length = 368
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 159/345 (46%), Positives = 219/345 (63%), Gaps = 16/345 (4%)
Query: 10 IIPMFVIIILV---ITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
I+P F+ L+ + Q+ +GRS E ++ +E+W+ +H + Y EK R IF
Sbjct: 6 ILPFFLFFSLITFSLALDIQLPTGRSNDE--VMTMYEEWLVKHQKVYNGLREKDQRFQIF 63
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKY 125
K NL +I++ N + N TY +G N+F+D+TNEE+R +Y G + + + + Y
Sbjct: 64 KDNLNFIDEHNAQ-NYTYIVGLNKFADMTNEEYRDMYLGTRSDIKRRIMKNKITGHRYAY 122
Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
+ +P +DWR KGA+THIKDQG CGSCWAFS +A VE I +I GKL+ LSEQ+LVD
Sbjct: 123 NSGDRLPVHVDWRLKGAITHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVD 182
Query: 186 CSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
C N GC+GGLMD AFE+II N G+ T+ YPY+ EG CD ++KA +I YED+
Sbjct: 183 CDRAFNEGCNGGLMDYAFEFIIGNGGIDTDQHYPYKGFEGRCDPTRKKAKIVSIDGYEDV 242
Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
P +E AL +AV++QPVSV ++ASGRA Y+SGV CG + DH V +VG+G+ EN
Sbjct: 243 PSNNENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHAVVIVGYGS---EN 299
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
G YWL++NSWG WGE GY ++ R+ G CGIA ASYPV
Sbjct: 300 GLDYWLVRNSWGTNWGEDGYFKMERNVKGTHTGKCGIAVEASYPV 344
>gi|357160591|ref|XP_003578813.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 339
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 159/341 (46%), Positives = 221/341 (64%), Gaps = 21/341 (6%)
Query: 17 IILVITCASQVVSGRSMHEP----SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
++ ++ C SG + E S+V +HE WM+Q+GR+YKD EK + +FK N +
Sbjct: 8 LLAILGCLCFFASGLAARELNDDLSMVARHESWMSQYGRSYKDAAEKDRKFEVFKANAAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
I+ N + N + LG N+F+D+TNEEF+ T N+ +S + + F Y+NV+
Sbjct: 68 IDSFNAK-NHKFWLGINQFADITNEEFKVTKT--NKGF--ISNKVRASTGFSYENVSIDA 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
+P +IDWR KGAVT +KDQGQCG CWAFSAVAA EGI +++ GKL+ LSEQ+LVDC
Sbjct: 123 LPATIDWRTKGAVTPVKDQGQCGCCWAFSAVAATEGIVKLSTGKLVSLSEQELVDCDVHG 182
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
++ GC GGLMD AF++II N GL E+ YPY E+G C + + A TI YED+P +
Sbjct: 183 EDQGCEGGLMDDAFKFIITNGGLTQESSYPYDAEDGKCKSGSKS--AGTIKSYEDVPANN 240
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL++AV+NQPVSV VD F FY GV+ CG + DHG+A +G+G + G KY
Sbjct: 241 EGALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYGVTSD--GTKY 298
Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
WL+KNSWG +WGE+G++R+ +D G+CG+A SYP A
Sbjct: 299 WLMKNSWGTSWGENGFLRMEKDIADKKGMCGLAMEPSYPTA 339
>gi|356521444|ref|XP_003529366.1| PREDICTED: thiol protease SEN102-like [Glycine max]
Length = 340
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 168/349 (48%), Positives = 222/349 (63%), Gaps = 17/349 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M L +K + F +L +TC + S R++ E SI +HE+WMA H R Y D EK
Sbjct: 1 MALTLDKKSVGTFF---MLFLTCICRA-SSRTLSESSIATQHEEWMAMHDRVYADSAEKD 56
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG--YNRPVPSVSRQSS 118
R IFK+NLE+IEK N EG + Y L N F+DLTNEEF A +TG Y P S + +
Sbjct: 57 RRQQIFKENLEFIEKHNNEGKKRYNLSLNSFADLTNEEFVASHTGALYKPPTQLGSFKIN 116
Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
F +V D+ S+DWR++GAV IK+QG+CGSCWAFSAVAAVEGI QI G+L+ L
Sbjct: 117 HSLGFHKMSVGDIEASLDWRKRGAVNDIKNQGRCGSCWAFSAVAAVEGINQIKNGQLVSL 176
Query: 179 SEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
SEQ LVDC++ N GC G ++KAF+Y I + GLA E +YPY GTC A+ I
Sbjct: 177 SEQNLVDCAS-NDGCHGQYVEKAFDY-IRDYGLANEEEYPYVETVGTCSGNSNPAI--QI 232
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
Y+ + +E+ LL AV++QPVSV ++A G+ F FY GV + +CG +H V +VG+G
Sbjct: 233 RGYQSVTPQNEEQLLTAVASQPVSVLLEAKGQGFQFYSGGVFSGECGTELNHAVTIVGYG 292
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
EE KYWLI+NSWG++WGE GY++++RD GLCGI ASYP
Sbjct: 293 ---EEAEGKYWLIRNSWGKSWGEGGYMKLMRDTGNPQGLCGINMQASYP 338
>gi|242038089|ref|XP_002466439.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
gi|241920293|gb|EER93437.1| hypothetical protein SORBIDRAFT_01g007820 [Sorghum bicolor]
Length = 353
Score = 325 bits (832), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 165/354 (46%), Positives = 230/354 (64%), Gaps = 23/354 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSM-------------HEPSIVEKHEQWMAQHGRTYKDEL 57
+ FV+ +LV+ + R++ ++V +HE+WMA+HGRTY DE
Sbjct: 3 VSRFVLTVLVVASVCTAAAPRALAVRELAGEEESAAVAAAMVSRHEKWMAEHGRTYTDEA 62
Query: 58 EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS 117
EKA RL IF+ N E+I+ N G +++L TN F+DLT+EEFRA TG+ +
Sbjct: 63 EKARRLEIFRANAEFIDSFNDAGKHSHRLATNRFADLTDEEFRAARTGFRPRPAPAAAAG 122
Query: 118 SRPSTFKYQN--VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
S F+Y+N + D S+DWR GAVT +KDQG+CG CWAFSAVAAVEG+ +I G+L
Sbjct: 123 S-GGRFRYENFSLADAAQSVDWRAMGAVTGVKDQGECGCCWAFSAVAAVEGLNKIRTGRL 181
Query: 176 IELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKA 233
+ LSEQ+LVDC + + GC GGLMD AF++I GLA+E+ YPY+ ++G+C + A
Sbjct: 182 VSLSEQELVDCDVNGEDQGCEGGLMDDAFQFIERRGGLASESGYPYQGDDGSCRSSAAAA 241
Query: 234 VAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVA 293
AA+I +ED+P+ +E AL AV+NQPVSV ++ AF FY SGVL +CG + +H +
Sbjct: 242 RAASIRGHEDVPRNNEAALAAAVANQPVSVAINGEDYAFRFYDSGVLGGECGTDLNHAIT 301
Query: 294 VVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI---LRDAGLCGIATAASYPV 344
VG+GTA + G+KYWL+KNSWG +WGE GY+RI +R G+CG+A SYPV
Sbjct: 302 AVGYGTAAD--GSKYWLMKNSWGTSWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 353
>gi|147772785|emb|CAN62838.1| hypothetical protein VITISV_003391 [Vitis vinifera]
Length = 298
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 173/341 (50%), Positives = 212/341 (62%), Gaps = 58/341 (17%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ M ++ IL ASQ S RS+HE S+ E+HE WMA++GR YKD EK R IFK N+
Sbjct: 10 VSMALLFILA-AWASQATS-RSLHEASMYERHEDWMARYGRMYKDANEKEKRFKIFKDNV 67
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
++ +TFKY+NVT
Sbjct: 68 -----------------------------------------------AQATTFKYENVTA 80
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
VP++IDWR+KGAVT IKDQ QCGSCWAFSAVAA EGITQIT GKLI LSEQ+LVDC T
Sbjct: 81 VPSTIDWRKKGAVTPIKDQQQCGSCWAFSAVAATEGITQITTGKLISLSEQELVDCDTGG 140
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
+N GCSGGL D AF +I + GLA+EA YPY ++GTC+++KE AA I YED+P +
Sbjct: 141 ENQGCSGGLXDDAFRFIXIH-GLASEATYPYEGDDGTCNSKKEAHPAAKIKGYEDVPANN 199
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E+AL +AV++QPV+V +DA G F FY SGV CG DHGVA VG+G ++G Y
Sbjct: 200 EKALQKAVAHQPVAVAIDAGGFEFQFYTSGVFTGQCGTELDHGVAAVGYGIG--DDGMXY 257
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
WL+KNSWG WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 258 WLVKNSWGTGWGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 298
>gi|111073715|dbj|BAF02546.1| triticain alpha [Triticum aestivum]
gi|388890585|gb|AFK80346.1| cysteine endopeptidase EP alpha [Secale cereale x Triticum durum]
Length = 461
Score = 324 bits (831), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 161/326 (49%), Positives = 213/326 (65%), Gaps = 14/326 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
+VS E + + +WM++H RTY E+ R +F+ NL YI++ N G +
Sbjct: 26 IVSYGERSEEEVRRMYAEWMSEHRRTYNAIGEEERRFEVFRDNLRYIDQHNAAADAGLHS 85
Query: 84 YKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAV 143
++LG N F+DLTNEE+R+ Y G R P R+ S + ++ + ++P ++DWR+KGAV
Sbjct: 86 FRLGLNRFADLTNEEYRSTYLG-ARTKPDRERKLS--ARYQADDNEELPETVDWRKKGAV 142
Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAF 202
IKDQG CGSCWAFSA+AAVEGI QI G +I LSEQ+LVDC T N GC+GGLMD AF
Sbjct: 143 AAIKDQGGCGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNEGCNGGLMDYAF 202
Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
E+II N G+ +E DYPY+ + CD K+ A TI YED+P E++L +AV+NQP+S
Sbjct: 203 EFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPIS 262
Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
V ++A GRAF YKSG+ CG DHGVA VG+GT ENG YWL++NSWG WGE
Sbjct: 263 VAIEAGGRAFQLYKSGIFTGTCGTALDHGVAAVGYGT---ENGKDYWLVRNSWGTVWGED 319
Query: 323 GYIRILRD----AGLCGIATAASYPV 344
GYIR+ R+ +G CGIA SYP
Sbjct: 320 GYIRMERNIKASSGKCGIAVEPSYPT 345
>gi|37780043|gb|AAP32194.1| cysteine protease 1 [Trifolium repens]
Length = 292
Score = 324 bits (831), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 159/295 (53%), Positives = 201/295 (68%), Gaps = 12/295 (4%)
Query: 58 EKAMRLNIFKQNLEYIEKANKE-GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQ 116
E+ RL IF +N+ YIE +N N+ YKL N+F+DLTNEEF A N+ +
Sbjct: 3 EREKRLRIFNKNVNYIEASNSAVNNKLYKLSINKFADLTNEEFIA---SRNKFKGHMCSS 59
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
R +TFKY+N + +P+++DWR+KGAVT +K+QGQCGSCWAFSAVAA EGI Q++ GKL+
Sbjct: 60 IIRTTTFKYENASAIPSTVDWRKKGAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLV 119
Query: 177 ELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV 234
LSEQ+L+DC T + GC GGLMD AF++II+N GL+TE YPY +GTC+ K
Sbjct: 120 SLSEQELIDCDTKGVDQGCEGGLMDDAFKFIIQNHGLSTEVQYPYEGVDGTCNANKASIH 179
Query: 235 AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAV 294
A TI+ YED+P +E AL +AV+NQP+SV +DASG F FY SGV CG DHGV
Sbjct: 180 AVTITGYEDVPANNELALQKAVANQPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTA 239
Query: 295 VGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
VG+G + G KYWL+KNSWG WGE GYIR+ R GLCGIA ASYP A
Sbjct: 240 VGYGVGND--GTKYWLVKNSWGADWGEEGYIRMQRGIAAAEGLCGIAMQASYPTA 292
>gi|445927|prf||1910332A Cys endopeptidase
Length = 362
Score = 324 bits (830), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 160/317 (50%), Positives = 212/317 (66%), Gaps = 12/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S+ + +E+W + H T L EK R N+FK N+ ++ NK ++ YKL N+F+D
Sbjct: 33 EESLWDLYERWRSHH--TVSRSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFAD 89
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
+TN EFR+ Y G + R S S TF Y+ V VP S+DWR+KGAVT +KDQGQC
Sbjct: 90 MTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQC 149
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
GSCWAFS + AVEGI QI KL+ LSEQ+LVDC + N GC+GGLM+ AFE+I + G+
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
TE++YPY+ +EGTCD K +A +I +E++P DE ALL+AV+NQPVSV +DA G
Sbjct: 210 TTESNYPYKAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSD 269
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
F FY GV DC + +HGVA+VG+GT + G YW+++NSWG WGE GYIR+ R+
Sbjct: 270 FQFYSEGVFTGDCNTDLNHGVAIVGYGTTVD--GTNYWIVRNSWGPEWGEQGYIRMQRNI 327
Query: 331 ---AGLCGIATAASYPV 344
GLCGIA ASYP+
Sbjct: 328 SKKEGLCGIAMMASYPI 344
>gi|148927394|gb|ABR19828.1| cysteine proteinase [Elaeis guineensis]
Length = 469
Score = 324 bits (830), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 158/312 (50%), Positives = 207/312 (66%), Gaps = 12/312 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
++ W AQH R+Y E RL IF+ NL +I++ N G +++LG F+DLTNEE
Sbjct: 47 YQAWKAQHARSYNALDEDEQRLEIFRDNLRFIDQHNAAANAGKYSFRLGLTRFADLTNEE 106
Query: 99 FRALYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
+R+ Y G R S+ S +++++ D+P SIDWR+KGAV +KDQG CGSCWA
Sbjct: 107 YRSTYLGVRTAGSRRRRNSTVGSNRYRFRSSDDLPDSIDWRDKGAVVDVKDQGSCGSCWA 166
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS +AAVEGI I G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ T+ D
Sbjct: 167 FSTIAAVEGINHIVTGDLISLSEQELVDCDTYYNQGCNGGLMDYAFEFIISNGGIDTDED 226
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY +G+CD ++ A TI YED+P DE++L +AV+NQPVSV ++A GRAF Y+
Sbjct: 227 YPYTGRDGSCDQYRKNAHVVTIDSYEDVPINDEKSLQKAVANQPVSVAIEAGGRAFQLYE 286
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
SG+ CG DHGV +G+G+ ENG YW++KNSWG WGESGYIR+ R+ G
Sbjct: 287 SGIFTGYCGTELDHGVTAIGYGS---ENGKYYWIVKNSWGSDWGESGYIRMERNINSATG 343
Query: 333 LCGIATAASYPV 344
CGIA ASYP+
Sbjct: 344 KCGIAMEASYPI 355
>gi|168063167|ref|XP_001783545.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162664932|gb|EDQ51634.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 461
Score = 324 bits (830), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 161/319 (50%), Positives = 213/319 (66%), Gaps = 17/319 (5%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
HE ++E+ W +HG+ Y D + R ++K NL YI + E NRTY LG +F+D
Sbjct: 46 HENLLLEQFAAWAHKHGKAYHDAEQCLHRFAVWKDNLAYIRHS--ETNRTYSLGLTKFAD 103
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
LTNEEFR +YTG SR++ R + F+Y + ++ P S+DWR+ GAVT +KDQG CG
Sbjct: 104 LTNEEFRRMYTGTR---IDRSRRAKRRTGFRYAD-SEAPESVDWRKNGAVTSVKDQGSCG 159
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
SCWAFSAV +VEGI I G+ + LSEQ+LVDC + N GC+GGLMD AF++II+N G+
Sbjct: 160 SCWAFSAVGSVEGINAIRNGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFIIQNGGID 219
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE DYPY+ +G CDN K+ A TI YED+P+ DE+AL +AV+ QPVSV ++A GR F
Sbjct: 220 TEKDYPYKGFDGRCDNSKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDF 279
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
Y GV + +CG + DHGV VG+GT E+G YW++KNSWGE WGESGY+R+ R+
Sbjct: 280 QLYAQGVFSGECGTDLDHGVLAVGYGT---EDGVDYWIVKNSWGEYWGESGYLRMKRNMK 336
Query: 331 -----AGLCGIATAASYPV 344
GLCGI SY V
Sbjct: 337 DSNDGPGLCGINIEPSYAV 355
>gi|1223922|gb|AAA92063.1| cysteinyl endopeptidase [Vigna radiata]
Length = 362
Score = 324 bits (830), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 158/316 (50%), Positives = 212/316 (67%), Gaps = 10/316 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ + +E+W + H + EK R N+FK+N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EESLWDLYERWRSHH-TVSRSLTEKHKRFNVFKENVMHVHNTNKM-DKPYKLKLNKFADM 90
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
TN EFR+ Y G + R + + TF Y+ V VP S+DWR+KGAVT +KDQGQCG
Sbjct: 91 TNHEFRSTYAGSKVNHHKMFRGTQHGNGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQCG 150
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
SCWAFS V AVEGI QI KL+ LSEQ+LVDC + N GC+GGLM+ AFE+I + G+
Sbjct: 151 SCWAFSTVVAVEGINQIKTDKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGIT 210
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE++YPY +EGTCD K +A +I +E++P DE ALL+AV+NQPVSV +DA G F
Sbjct: 211 TESNYPYTAQEGTCDASKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSDF 270
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
FY GVL DC + +HGVA+VG+GT + G YW+++NSWG WGE GYIR+ R+
Sbjct: 271 QFYSEGVLTGDCNTDLNHGVAIVGYGTTVD--GTNYWIVRNSWGPEWGEQGYIRMQRNIS 328
Query: 331 --AGLCGIATAASYPV 344
GLCGIA ASYP+
Sbjct: 329 KKEGLCGIAMMASYPI 344
>gi|297745594|emb|CBI40759.3| unnamed protein product [Vitis vinifera]
Length = 300
Score = 324 bits (830), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 155/304 (50%), Positives = 208/304 (68%), Gaps = 12/304 (3%)
Query: 46 MAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG 105
M++HG++Y+ EK R +F+ NL++I++ NK+ + +Y LG NEF+DL++EEF+ Y G
Sbjct: 1 MSKHGKSYRSFEEKLHRFEVFQDNLKHIDETNKKVS-SYWLGLNEFADLSHEEFKRKYLG 59
Query: 106 YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVE 165
+P ++ P F Y++V D+P S+DWR+KGAV H+K+QG CGSCWAFS VAAVE
Sbjct: 60 LKIELP---KRRDSPEEFSYKDVADLPKSVDWRKKGAVAHVKNQGACGSCWAFSTVAAVE 116
Query: 166 GITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEG 224
GI QI G L LSEQ+L+DC N+GC+GGLMD AF +II N GL E DYPY EEG
Sbjct: 117 GINQIVTGNLTALSEQELIDCDKPFNNGCNGGLMDYAFAFIISNGGLRKEEDYPYVMEEG 176
Query: 225 TCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADC 284
TC +KE+ TIS Y D+P+ +EQ+ L+A++NQP+SV ++AS R F FY G+ N C
Sbjct: 177 TCGEKKEELEVVTISGYHDVPEDNEQSFLKALANQPLSVAIEASSRGFQFYSGGIFNGHC 236
Query: 285 GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAA 340
G DHGVA VG+GT++ G Y +KNSWG WGE GYIR+ R+ G+CGI A
Sbjct: 237 GTELDHGVAAVGYGTSK---GVDYITVKNSWGSKWGEKGYIRMKRNVGKPEGICGIYKMA 293
Query: 341 SYPV 344
SYP
Sbjct: 294 SYPT 297
>gi|255567869|ref|XP_002524912.1| cysteine protease, putative [Ricinus communis]
gi|223535747|gb|EEF37409.1| cysteine protease, putative [Ricinus communis]
Length = 366
Score = 324 bits (830), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 162/325 (49%), Positives = 215/325 (66%), Gaps = 16/325 (4%)
Query: 29 SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGT 88
S RS +E ++ + W+A+H +TY E+ R IFK NL +I++ N NRTYK+G
Sbjct: 37 SWRSDNE--VISMYNWWLAKHSKTYNKLGEREKRFEIFKNNLRFIDEHNNSKNRTYKVGL 94
Query: 89 NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS---TFKYQNVTDVPTSIDWREKGAVTH 145
F+DLTNEE+RA + G +S PS FK +V +P SIDWR+ GAV+
Sbjct: 95 TRFADLTNEEYRAKFLGTKSDPKRRLMKSKNPSQRYAFKAGDV--LPESIDWRQSGAVSA 152
Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEY 204
IKDQG CGSCWAFS +AAVEG+ +I G+LI LSEQ+LVDC N GC+GGLMD AF++
Sbjct: 153 IKDQGSCGSCWAFSTIAAVEGVNKIVTGELISLSEQELVDCDRSYNAGCNGGLMDNAFQF 212
Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
II N G+ T+ DYPY+ +G CD K K A TI +ED+ DE AL +AV++QPVSV
Sbjct: 213 IINNGGIDTDKDYPYQAVDGKCDTTKVKNKAVTIDGFEDVMAFDEMALQKAVAHQPVSVA 272
Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
++ASG A FY+SGV +CG+ DHGV +VG+GT E+G YWL++NSWG WGE+GY
Sbjct: 273 IEASGMALQFYQSGVFTGECGSALDHGVVIVGYGT---EDGIDYWLVRNSWGRDWGENGY 329
Query: 325 IRILRDA-----GLCGIATAASYPV 344
I++ R+ G CGIA +SYP+
Sbjct: 330 IKMQRNVVDTFTGKCGIAMESSYPI 354
>gi|413919736|gb|AFW59668.1| cysteine protease 1 [Zea mays]
Length = 469
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 160/327 (48%), Positives = 209/327 (63%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
+VS E + +WMA HGRTY E+ R +F+ NL Y++ N G +
Sbjct: 31 IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90
Query: 84 YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTN+E+RA Y G +RP R+ + + D+P S+DWR KGA
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGVRSRP----QRERRLGDRYLAGDNEDLPESVDWRAKGA 146
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
V +KDQG CGSCWAFS +AAVEGI QI G +I LSEQ+LVDC T N GC+GGLMD A
Sbjct: 147 VAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYA 206
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
FE+II N G+ TE DYPY+ +G CD ++ A TI YED+P E++L +AV+NQP+
Sbjct: 207 FEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPI 266
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
SV ++A GRAF Y SG+ CG DHGV VG+GT ENG YW++KNSWG +WGE
Sbjct: 267 SVAIEAGGRAFQLYNSGIFTGTCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGE 323
Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
SGY+R+ R+ +G CGIA SYP+
Sbjct: 324 SGYVRMERNIKASSGKCGIAVEPSYPL 350
>gi|225438807|ref|XP_002283263.1| PREDICTED: germination-specific cysteine protease 1-like isoform 1
[Vitis vinifera]
Length = 374
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 160/318 (50%), Positives = 215/318 (67%), Gaps = 13/318 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ ++ WMA+HG+ Y EK R IFK NL++I++ N + NRTYK+G N F+DL
Sbjct: 39 EEEVMGMYQWWMAKHGKAYNGLGEKEKRFEIFKDNLKFIDEHNAQ-NRTYKVGLNRFADL 97
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQC 152
TNEE+RA+Y G R P + ++ +Y + +P S+DWRE GAV +KDQ C
Sbjct: 98 TNEEYRAIYLG-TRSDPKRRFAKLKNASPRYAVMPGEVLPESVDWRETGAVNPVKDQRSC 156
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
GSCWAFS VAAVEGI QI G+LI LSEQ+LVDC T+ + GC+GGLMD AF++II+N GL
Sbjct: 157 GSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEYDMGCNGGLMDYAFDFIIKNGGL 216
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
TE DYPY +G C+ + + +I YED+P DE+AL +AV++QPVSV V+A GRA
Sbjct: 217 DTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDEKALQKAVAHQPVSVAVEAGGRA 276
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
Y SG+ +CG DHG+ VG+GT ENG YW+++NSWG +WGE+GYIR+ R+
Sbjct: 277 LQLYVSGIFTGECGTALDHGIVAVGYGT---ENGTDYWIVRNSWGSSWGENGYIRMERNM 333
Query: 331 ----AGLCGIATAASYPV 344
+G CGIA ASYP+
Sbjct: 334 ADAFSGKCGIAMEASYPI 351
>gi|449438381|ref|XP_004136967.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 479
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 160/311 (51%), Positives = 211/311 (67%), Gaps = 15/311 (4%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E W+ HG+ Y EK R IFK NL +I++ N+E +RTYK+G F+DLTNEE+RA
Sbjct: 62 YESWLVHHGKAYNAIGEKERRFEIFKDNLRFIDEHNRE-SRTYKVGLTRFADLTNEEYRA 120
Query: 102 LYTG--YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
+ G ++R P +S +++ + D+P +DWR+KGAV +KDQGQCGSCWAFS
Sbjct: 121 RFLGGRFSRK-PRLS--AAKSGRYAAALGDDLPDDVDWRKKGAVATVKDQGQCGSCWAFS 177
Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYP 218
+VAAVEGI QI G+LI LSEQ+LVDC N GC+GGLMD AF++II N G+ TE DYP
Sbjct: 178 SVAAVEGINQIVTGELIPLSEQELVDCDKSFNMGCNGGLMDYAFQFIIGNGGIDTEEDYP 237
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
Y+ + CD ++ A TI YED+P+ DE +L +AV+NQPVSV ++A GRAF Y+SG
Sbjct: 238 YKGRDAACDPNRKNAKVVTIDGYEDVPENDESSLKKAVANQPVSVAIEAGGRAFQLYQSG 297
Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGL 333
V CG + DHGV VG+GT +NG YW+++NSWG+ WGESGYIR+ R+ G
Sbjct: 298 VFTGRCGTDLDHGVVAVGYGT---DNGTDYWIVRNSWGKDWGESGYIRLERNVANITTGK 354
Query: 334 CGIATAASYPV 344
CGIA SYP
Sbjct: 355 CGIAVQPSYPT 365
>gi|544129|sp|P25803.2|CYSEP_PHAVU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase EP-C1; Flags: Precursor
gi|20994|emb|CAA44816.1| endopeptidase [Phaseolus vulgaris]
Length = 362
Score = 324 bits (830), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 159/317 (50%), Positives = 211/317 (66%), Gaps = 12/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S+ + +E+W + H T L EK R N+FK NL ++ NK ++ YKL N+F+D
Sbjct: 33 EESLWDLYERWRSHH--TVSRSLGEKHKRFNVFKANLMHVHNTNKM-DKPYKLKLNKFAD 89
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
+TN EFR+ Y G P + R + + F Y+ V VP S+DWR+KGAVT +KDQGQC
Sbjct: 90 MTNHEFRSTYAGSKVNHPRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQC 149
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
GSCWAFS V AVEGI QI KL+ LSEQ+LVDC + N GC+GGLM+ AFE+I + G+
Sbjct: 150 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
TE++YPY+ +EGTCD K +A +I +E++P DE ALL+AV+NQPVSV +DA G
Sbjct: 210 TTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSD 269
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
F FY GV DC + +HGVA+VG+GT + G YW+++NSWG WGE GYIR+ R+
Sbjct: 270 FQFYSEGVFTGDCSTDLNHGVAIVGYGTTVD--GTNYWIVRNSWGPEWGEHGYIRMQRNI 327
Query: 331 ---AGLCGIATAASYPV 344
GLCGIA SYP+
Sbjct: 328 SKKEGLCGIAMLPSYPI 344
>gi|255646767|gb|ACU23856.1| unknown [Glycine max]
Length = 350
Score = 323 bits (829), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 160/312 (51%), Positives = 213/312 (68%), Gaps = 13/312 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y++ EK +R IFK NL++I++ NK + Y LG +EF+DL++
Sbjct: 44 LIELFESWMSRHGKIYENIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLSEFADLSHR 102
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF Y G SR+ P F Y++V ++P S+DWR+KGAV +K+QG CGSCWA
Sbjct: 103 EFNNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWA 158
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF +I+EN GL E D
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNNGCNGGLMDYAFSFIVENGGLHKEED 218
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEG C+ KE+ TIS Y D+P+ +EQ+LL+A++NQP+SV ++ASGR F FY
Sbjct: 219 YPYIMEEGACEMTKEETQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
GV + CG++ DHGVA VG+GTA+ G Y +KNSWG WGE GYIR+ R+ G
Sbjct: 279 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYITVKNSWGSKWGEKGYIRMRRNIGKPEG 335
Query: 333 LCGIATAASYPV 344
+CGI ASYP
Sbjct: 336 ICGIYKMASYPT 347
>gi|1256830|gb|AAB68374.1| cysteine endopeptidase 1 [Phaseolus vulgaris]
gi|2959418|emb|CAA12118.1| cysteine protease [Phaseolus vulgaris]
Length = 364
Score = 323 bits (829), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 160/344 (46%), Positives = 218/344 (63%), Gaps = 12/344 (3%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
S +IP +++ + A+ +S + E +++ +E+W+ +H + Y EK R +FK
Sbjct: 3 SMLIPTLLLLSFTFSHAT-AMSIINYSENEVMDMYEEWLVKHRKVYNGLDEKEKRFQVFK 61
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKYQ 126
NL +I+ N + N TY LG N+F+D+TN+E+RA+Y G V + + + Y
Sbjct: 62 DNLGFIQDHNAQ-NNTYTLGLNKFADITNKEYRAMYLGTRTDAKRRVMKTQNTGHRYAYN 120
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+ +P +DWR KGAV IKDQG CGSCWAFS VAAVEGI I G+ + LSEQ+LVDC
Sbjct: 121 SGDQLPVHVDWRLKGAVGPIKDQGNCGSCWAFSTVAAVEGINNIVTGEFVSLSEQELVDC 180
Query: 187 STD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ + GC+GGLMD AF++II+N G+ TE DYPY+ +GTCD K+K I YED+P
Sbjct: 181 DREYDEGCNGGLMDYAFQFIIQNGGIDTEEDYPYQGIDGTCDETKKKTKVVQIDGYEDVP 240
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+E AL +AVS+QPVSV ++ASGRA Y+SGV CG DHGV VVG+GT ENG
Sbjct: 241 SNNENALKKAVSHQPVSVAIEASGRALQLYQSGVFTGKCGTALDHGVVVVGYGT---ENG 297
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
YWL++NSWG WGE GY ++ R+ G CGIA SYPV
Sbjct: 298 VDYWLVRNSWGTGWGEDGYFKMERNVRSTSEGKCGIAMDCSYPV 341
>gi|242092700|ref|XP_002436840.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
gi|241915063|gb|EER88207.1| hypothetical protein SORBIDRAFT_10g009830 [Sorghum bicolor]
Length = 328
Score = 323 bits (828), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 159/341 (46%), Positives = 216/341 (63%), Gaps = 30/341 (8%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+ I+ L C + + + + ++V +HEQWM Q+ R YKD EKA R +FK N+++
Sbjct: 8 ILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVT-- 129
IE N GNR + LG N+F+DLTN+EFRA T +P P P+ F+Y+NV+
Sbjct: 68 IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSP-----VKVPTGFRYENVSVD 122
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
+P +IDWR KGAVT IKDQGQC EGI +I+ GKLI LSEQ+LVDC
Sbjct: 123 ALPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDCDVH 170
Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
++ GC GGLMD AF++II+N GL TE+ YPY +G C + AAT+ +ED+P
Sbjct: 171 GEDQGCEGGLMDDAFQFIIKNGGLTTESSYPYTAADGKCKSGSNS--AATVKGFEDVPAN 228
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
DE AL++AV+NQPVSV VD F FY GV+ CG + DHG+A +G+G + +G K
Sbjct: 229 DEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG--QTSDGTK 286
Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
YWL+KNSWG TWGE+GY+R+ +D G+CG+A SYP+
Sbjct: 287 YWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPI 327
>gi|118158|sp|P12412.1|CYSEP_VIGMU RecName: Full=Vignain; AltName: Full=Bean endopeptidase; AltName:
Full=Cysteine proteinase; AltName:
Full=Sulfhydryl-endopeptidase; Short=SH-EP; Contains:
RecName: Full=Vignain-1; Contains: RecName:
Full=Vignain-2; Flags: Precursor
gi|22062|emb|CAA33753.1| sulfhydryl-pre-endopeptidase (AA -20 to 342) [Vigna mungo]
gi|22066|emb|CAA36181.1| sulfhydryl-endopeptidase [Vigna mungo]
Length = 362
Score = 323 bits (827), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 160/317 (50%), Positives = 211/317 (66%), Gaps = 12/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S+ + +E+W + H T L EK R N+FK N+ ++ NK ++ YKL N+F+D
Sbjct: 33 EESLWDLYERWRSHH--TVSRSLGEKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFAD 89
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
+TN EFR+ Y G + R S S TF Y+ V VP S+DWR+KGAVT +KDQGQC
Sbjct: 90 MTNHEFRSTYAGSKVNHHKMFRGSQHGSGTFMYEKVGSVPASVDWRKKGAVTDVKDQGQC 149
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
GSCWAFS + AVEGI QI KL+ LSEQ+LVDC + N GC+GGLM+ AFE+I + G+
Sbjct: 150 GSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 209
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
TE++YPY +EGTCD K +A +I +E++P DE ALL+AV+NQPVSV +DA G
Sbjct: 210 TTESNYPYTAQEGTCDESKVNDLAVSIDGHENVPVNDENALLKAVANQPVSVAIDAGGSD 269
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
F FY GV DC + +HGVA+VG+GT + G YW+++NSWG WGE GYIR+ R+
Sbjct: 270 FQFYSEGVFTGDCNTDLNHGVAIVGYGTTVD--GTNYWIVRNSWGPEWGEQGYIRMQRNI 327
Query: 331 ---AGLCGIATAASYPV 344
GLCGIA ASYP+
Sbjct: 328 SKKEGLCGIAMMASYPI 344
>gi|50355617|dbj|BAD29957.1| cysteine protease [Daucus carota]
Length = 437
Score = 323 bits (827), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 159/314 (50%), Positives = 208/314 (66%), Gaps = 12/314 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ + W+ +HG++Y EK R IFK NL YI+ N + +R+Y+LG N F+DLTNE
Sbjct: 45 VMTMYNSWLVKHGKSYNALGEKETRFQIFKDNLRYIDNHNADPDRSYELGLNRFADLTNE 104
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSC 155
E+RA Y G + S + S PS +Y V ++P SIDWREKGAV +KDQG CGSC
Sbjct: 105 EYRAKYLG-TKSRESRPKLSKGPSD-RYAPVEGEELPDSIDWREKGAVAAVKDQGSCGSC 162
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
WAFSA+ AVEGI QIT G+LI LSEQ+LVDC N GC GGLMD AF +II+N G+ ++
Sbjct: 163 WAFSAIGAVEGINQITTGELITLSEQELVDCDRSYNEGCEGGLMDYAFNFIIKNGGIDSD 222
Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
DYPY +GTC+ KE A TI YED+P DE+AL +A +NQP+SV ++A G F
Sbjct: 223 LDYPYTGRDGTCNQNKENAKVVTIDSYEDVPVYDEKALQKAAANQPISVAIEAGGMDFQL 282
Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---- 330
Y SG+ CG DHGV VVG+G+ E G YW+++NSWG WGE+GY+++ R+
Sbjct: 283 YVSGIFTGKCGTAVDHGVVVVGYGS---EEGMDYWIVRNSWGAAWGEAGYLKMQRNVGKS 339
Query: 331 AGLCGIATAASYPV 344
+GLCGI SYPV
Sbjct: 340 SGLCGITIEPSYPV 353
>gi|242071345|ref|XP_002450949.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
gi|241936792|gb|EES09937.1| hypothetical protein SORBIDRAFT_05g021550 [Sorghum bicolor]
Length = 371
Score = 323 bits (827), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 161/326 (49%), Positives = 216/326 (66%), Gaps = 18/326 (5%)
Query: 35 EPSIVEKHEQW----MAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
E S+ +EQW M +++ +KA N+FK+N+ YI +ANK+G R+++L N+
Sbjct: 35 EESLRALYEQWRSHYMVSRPAGLQEQDDKARWFNVFKENVRYIHEANKKG-RSFRLALNK 93
Query: 91 FSDLTNEEFRALY-----TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
F+D+T +EFR Y T ++R + S R+ S F Y ++P ++DWR++GAVT
Sbjct: 94 FADMTTDEFRRAYAAGSRTRHHRALSSGIRRHGDGS-FMYAQAGNLPLAVDWRQRGAVTG 152
Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEY 204
IKDQGQCGSCWAFS +AAVEGI +I GKL+ LSEQ+LVDC DN GC+GGLMD AF+Y
Sbjct: 153 IKDQGQCGSCWAFSTIAAVEGINKIRTGKLVSLSEQELVDCDDVDNQGCNGGLMDYAFQY 212
Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
I N G+ TE++YPY E+ +C+ KE++ TI YED+P +E AL +AV+NQPVS+
Sbjct: 213 IKRNGGITTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVANQPVSIA 272
Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
++ASG+ F FY GV CG DHGVA VG+G + G KYW++KNSWGE WGE GY
Sbjct: 273 IEASGQDFQFYSEGVFTGSCGTELDHGVAAVGYGITRD--GTKYWIVKNSWGEDWGERGY 330
Query: 325 IRILR----DAGLCGIATAASYPVAI 346
IR+ R GLCGIA SYP I
Sbjct: 331 IRMQRGISDSQGLCGIAMEPSYPTKI 356
>gi|297851332|ref|XP_002893547.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
gi|297339389|gb|EFH69806.1| peptidase C1A papain family protein [Arabidopsis lyrata subsp.
lyrata]
Length = 345
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 165/329 (50%), Positives = 226/329 (68%), Gaps = 12/329 (3%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
S+ S ++HEP+I H++WM R Y DE EK MRL +F +NL++IE N G+++Y
Sbjct: 21 SEATSRVALHEPTIFYYHQKWMINFSRVYDDEFEKQMRLEVFTENLKFIENFNNMGSQSY 80
Query: 85 KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ-NVTDV-PTSIDWREKGA 142
KLG N+F+D T EEF A +TG + + + +T + V+DV T+ DWR +GA
Sbjct: 81 KLGVNKFTDWTKEEFLATHTGLSGINVTSPFEVVNETTPAWNWTVSDVLGTTKDWRNEGA 140
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
VT +K QG+CG CWAFSA+AAVEG+T+I RG LI LSEQQL+DC+ + N+GC GG M +A
Sbjct: 141 VTPVKYQGECGGCWAFSAIAAVEGLTKIARGNLISLSEQQLLDCAREQNNGCKGGTMIEA 200
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
F YI++N G+++E YPY+ +EG C + A+ I +E++P +E+ALL+AVS QPV
Sbjct: 201 FNYIVKNGGVSSENAYPYQVKEGPCRSNDIPAIV--IRGFENVPSNNERALLEAVSRQPV 258
Query: 262 SVCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWG 320
+V +DAS F Y GV NA DCG + +H V +VG+GT++E G KYWL KNSWG+TWG
Sbjct: 259 AVDIDASETGFIHYSGGVYNARDCGTSVNHAVTLVGYGTSQE--GIKYWLAKNSWGKTWG 316
Query: 321 ESGYIRILRDA----GLCGIATAASYPVA 345
E+GYIRI RD G+CG+A ASYPVA
Sbjct: 317 ENGYIRIRRDVEWPQGMCGVAQYASYPVA 345
>gi|224083362|ref|XP_002306996.1| predicted protein [Populus trichocarpa]
gi|222856445|gb|EEE93992.1| predicted protein [Populus trichocarpa]
Length = 336
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 161/312 (51%), Positives = 212/312 (67%), Gaps = 12/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I++ E W+++H + Y+ EK R IFK NL +I++ NK+ Y LG NEF+DL++E
Sbjct: 29 IIDLFESWISKHQKIYESIEEKWHRFEIFKDNLFHIDETNKK-VVNYWLGLNEFADLSHE 87
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G N +S + F Y++V+ +P S+DWR+KGAVT +K+QG CGSCWA
Sbjct: 88 EFKNKYLGLN---VDLSNRRECSEEFTYKDVSSIPKSVDWRKKGAVTDVKNQGSCGSCWA 144
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+LVDC T N+GC+GGLMD AF YII N GL E D
Sbjct: 145 FSTVAAVEGINQIVTGNLTSLSEQELVDCDTTYNNGCNGGLMDYAFAYIISNGGLHKEED 204
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEGTC+ +K ++ TIS Y D+P+ E++LL+A++NQP+SV +DASGR F FY
Sbjct: 205 YPYIMEEGTCEMRKAESEVVTISGYHDVPQNSEESLLKALANQPLSVAIDASGRDFQFYS 264
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
GV + CG DHGVA VG+G+A+ G + ++KNSWG WGE G+IR+ R+ AG
Sbjct: 265 GGVFDGHCGTELDHGVAAVGYGSAK---GLDFIVVKNSWGSKWGEKGFIRMKRNTGKPAG 321
Query: 333 LCGIATAASYPV 344
LCGI ASYP
Sbjct: 322 LCGINKMASYPT 333
>gi|226529105|ref|NP_001150196.1| cysteine protease 1 precursor [Zea mays]
gi|194701798|gb|ACF84983.1| unknown [Zea mays]
gi|194704800|gb|ACF86484.1| unknown [Zea mays]
gi|195637480|gb|ACG38208.1| cysteine protease 1 precursor [Zea mays]
gi|413919895|gb|AFW59827.1| cysteine protease 1 [Zea mays]
Length = 470
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 157/327 (48%), Positives = 219/327 (66%), Gaps = 17/327 (5%)
Query: 30 GRSMHEPSIVEKHEQWMAQHGRTY----KDELEKAMRLNIFKQNLEYIEKAN-KEGNRTY 84
G EP + ++ W+A+HGR Y + E E+ R +F NL +++ N + G R +
Sbjct: 45 GLERTEPEVRAMYDLWLAEHGRAYNALGEGEGERDRRFLVFWDNLRFVDAHNERAGARGF 104
Query: 85 KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAV 143
+LG N+F+DLTN+EFRA Y G VP+ R + +++ + +P S+DWREKGAV
Sbjct: 105 RLGMNQFADLTNDEFRAAYLGA--MVPAARRGAVVGERYRHDGAAEELPESVDWREKGAV 162
Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKA 201
+K+QGQCGSCWAFSAV++VE + QI G+++ LSEQ+LV+CSTD N GC+GGLMD A
Sbjct: 163 APVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAA 222
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
F++II+N G+ TE DYPYR +G CD ++ A +I +ED+P+ DE++L +AV++QPV
Sbjct: 223 FDFIIKNGGIDTEDDYPYRAVDGKCDMNRKNARVVSIDGFEDVPENDEKSLQKAVAHQPV 282
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
SV ++A GR F YKSGV + C N DHGV VG+G ENG YW+++NSWG WGE
Sbjct: 283 SVAIEAGGREFQLYKSGVFSGSCTTNLDHGVVAVGYGA---ENGKDYWIVRNSWGPKWGE 339
Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
+GYIR+ R+ G CGIA ASYP
Sbjct: 340 AGYIRMERNVNASTGKCGIAMMASYPT 366
>gi|255538210|ref|XP_002510170.1| cysteine protease, putative [Ricinus communis]
gi|223550871|gb|EEF52357.1| cysteine protease, putative [Ricinus communis]
Length = 469
Score = 323 bits (827), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 157/315 (49%), Positives = 214/315 (67%), Gaps = 13/315 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDEL---EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
++ +E+W+ ++G+ + + EK R +FK NL +I++ N E NR+YK+G N F+DL
Sbjct: 47 VMAIYEEWLVKNGKAHSNNNALGEKERRFQVFKDNLRFIDEHNSE-NRSYKVGLNRFADL 105
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
TNEE+R++Y G R +R S + + + +P S+DWR++GAV +KDQG CGS
Sbjct: 106 TNEEYRSMYLG-ARSGAKRNRLSRSSNRYLPRVGDSLPDSVDWRKEGAVAEVKDQGSCGS 164
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLAT 213
CWAFS +AAVEGI +I G LI LSEQ+LVDC N GC+GGLMD AF++II N G+ +
Sbjct: 165 CWAFSTIAAVEGINKIVTGDLISLSEQELVDCDRSYNEGCNGGLMDYAFQFIINNGGIDS 224
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
E DYPY +GTCD ++ A TI YED+P DE+AL +AV+NQPVSV ++A GR F
Sbjct: 225 EEDYPYLARDGTCDTYRKNAKVVTIDNYEDVPVNDEKALQKAVANQPVSVAIEAGGREFQ 284
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
FY+SG+ CG DHGVA VG+GT ENG YW+++NSWG++WGESGYIR+ R+
Sbjct: 285 FYQSGIFTGRCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGESGYIRMERNIAT 341
Query: 331 -AGLCGIATAASYPV 344
G CGIA SYP+
Sbjct: 342 ATGKCGIAIEPSYPI 356
>gi|297791625|ref|XP_002863697.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
gi|297309532|gb|EFH39956.1| hypothetical protein ARALYDRAFT_917391 [Arabidopsis lyrata subsp.
lyrata]
Length = 463
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 164/328 (50%), Positives = 219/328 (66%), Gaps = 21/328 (6%)
Query: 27 VVSGRSMHEPSIVEK-HEQWMAQHGRTYKDE----LEKAMRLNIFKQNLEYIEKANKEGN 81
VS RS E VE+ +E WM +HG+ ++ EK R IFK NL YI++ N + N
Sbjct: 37 TVSSRSDAE---VERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRYIDEHNTK-N 92
Query: 82 RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
+YKLG F+DLTN+E+R++Y G +PV V + S R ++ + +P S+DWR++G
Sbjct: 93 LSYKLGLTRFADLTNDEYRSMYLG-AKPVKRVLKTSDR---YEARVGDALPDSVDWRKEG 148
Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDK 200
AV +KDQG CGSCWAFS + AVEGI +I G LI LSEQ+LVDC T N GC+GGLMD
Sbjct: 149 AVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDY 208
Query: 201 AFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQP 260
AFE+II+N G+ TEADYPY+ +G CD ++ A TI YED+P+ E +L +A+++QP
Sbjct: 209 AFEFIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAHQP 268
Query: 261 VSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWG 320
+SV ++A GRAF Y SGV + CG DHGV VG+GT ENG YW+++NSWG WG
Sbjct: 269 ISVAIEAGGRAFQLYSSGVFDGICGTELDHGVVAVGYGT---ENGKDYWIVRNSWGNRWG 325
Query: 321 ESGYIRILRD----AGLCGIATAASYPV 344
ESGYI++ R+ G CGIA ASYP+
Sbjct: 326 ESGYIKMARNIAEPTGKCGIAMEASYPI 353
>gi|18401614|ref|NP_564497.1| cysteine proteinase RD21a [Arabidopsis thaliana]
gi|1172873|sp|P43297.1|RD21A_ARATH RecName: Full=Cysteine proteinase RD21a; Short=RD21; Flags:
Precursor
gi|12321010|gb|AAG50628.1|AC083835_13 cysteine protease, putative [Arabidopsis thaliana]
gi|435619|dbj|BAA02374.1| thiol protease [Arabidopsis thaliana]
gi|18175926|gb|AAL59952.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|22136972|gb|AAM91715.1| putative cysteine proteinase RD21A [Arabidopsis thaliana]
gi|332194014|gb|AEE32135.1| cysteine proteinase RD21a [Arabidopsis thaliana]
Length = 462
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 163/361 (45%), Positives = 227/361 (62%), Gaps = 36/361 (9%)
Query: 9 FIIPMFVIIILVITCASQVVS----------------GRSMHEPSIVEKHEQWMAQHGRT 52
F+ P I+ L + S V GRS E ++ +E W+ +HG+
Sbjct: 3 FLKPTMAILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRS--EAEVMSIYEAWLVKHGKA 60
Query: 53 YKDE--LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPV 110
+EK R IFK NL ++++ N E N +Y+LG F+DLTN+E+R+ Y G
Sbjct: 61 QSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFADLTNDEYRSKYLG----- 114
Query: 111 PSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGIT 168
+ ++ R ++ +Y+ V D +P SIDWR+KGAV +KDQG CGSCWAFS + AVEGI
Sbjct: 115 AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGIN 174
Query: 169 QITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD 227
QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II+N G+ T+ DYPY+ +GTCD
Sbjct: 175 QIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCD 234
Query: 228 NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNN 287
++ A TI YED+P E++L +AV++QP+S+ ++A GRAF Y SG+ + CG
Sbjct: 235 QIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQ 294
Query: 288 CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
DHGV VG+GT ENG YW+++NSWG++WGESGY+R+ R+ +G CGIA SYP
Sbjct: 295 LDHGVVAVGYGT---ENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYP 351
Query: 344 V 344
+
Sbjct: 352 I 352
>gi|224096714|ref|XP_002310708.1| predicted protein [Populus trichocarpa]
gi|222853611|gb|EEE91158.1| predicted protein [Populus trichocarpa]
Length = 356
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 162/317 (51%), Positives = 210/317 (66%), Gaps = 11/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ ++ ++ W+ +HG+ Y EKA R IFK NL +I++ N + NRTYK+G +F+DL
Sbjct: 21 DDEVMSIYKWWLQKHGKAYNRLGEKAKRFEIFKNNLRFIDEHNSQ-NRTYKVGLTKFADL 79
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
TN+E+RA++ G +S PS + Y+ +P S+DWR KGAV IKDQG CG
Sbjct: 80 TNQEYRAMFLGTRSDPKRRLMKSKNPSERYAYKAGDKLPESVDWRGKGAVNPIKDQGSCG 139
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGLA 212
SCWAFS VAAVEGI QI G+LI LSEQ+LVDC N GC+GGLMD AF++II N GL
Sbjct: 140 SCWAFSTVAAVEGINQIVTGELISLSEQELVDCDRFYNAGCNGGLMDYAFQFIINNGGLD 199
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE DYPY + TCD K K A +I +ED+ DE+AL +AV++QPVSV ++ASG A
Sbjct: 200 TEKDYPYLGNDDTCDRDKMKTKAVSIDGFEDVLPFDEKALQKAVAHQPVSVAIEASGMAL 259
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
FY+SGV +CG DHGV VVG+GT E G YWL++NSWG WGE GYI++ R+
Sbjct: 260 QFYQSGVFTGECGTALDHGVVVVGYGT---EKGLDYWLVRNSWGTEWGEHGYIKMQRNVR 316
Query: 331 ---AGLCGIATAASYPV 344
G CGIA +SYPV
Sbjct: 317 DTYTGRCGIAMESSYPV 333
>gi|124484387|dbj|BAF46304.1| cysteine proteinase precursor [Ipomoea nil]
Length = 474
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 160/323 (49%), Positives = 212/323 (65%), Gaps = 14/323 (4%)
Query: 30 GRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTN 89
G E + E E W+ +HG++Y EK R IF+ NL+YI++ N NR+YKLG N
Sbjct: 38 GLVRSEDEVKEMFESWLVKHGKSYNAVDEKDKRFKIFRDNLKYIDEKNSLENRSYKLGLN 97
Query: 90 EFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIK 147
F+D+TNEE+R Y G R SR + + +Y V +P SIDWREKGAVT +K
Sbjct: 98 RFADITNEEYRTGYLGAKR---DASRNMVKSKSDRYAPVAGDSLPDSIDWREKGAVTGVK 154
Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYII 206
DQG CGSCWAFS +AAVEG+ Q+ G LI LSEQ+LVDC N GC+GG M AF++II
Sbjct: 155 DQGSCGSCWAFSTIAAVEGVNQLATGNLISLSEQELVDCDRKINQGCNGGDMGYAFQFII 214
Query: 207 ENKGLATEADYPYRHEEGTCDNQKE-KAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
+N G+ +E DYPY ++G CD+ ++ A A+I YE++P +E++L +AV+NQPVSV +
Sbjct: 215 KNGGIDSEEDYPYTGKDGKCDSYRQNNAKVASIDGYEEVPVNNEKSLQKAVANQPVSVAI 274
Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
+A G F Y SG+ CG + DHGVA VG+GT ENG YW++KNSWG+ WGE GY+
Sbjct: 275 EAGGYDFQLYSSGIFTGSCGTDLDHGVAAVGYGT---ENGVDYWIVKNSWGDYWGEKGYV 331
Query: 326 RILRD----AGLCGIATAASYPV 344
R+ R+ GLCGIA ASYP
Sbjct: 332 RMQRNVKAKTGLCGIAMEASYPT 354
>gi|62320725|dbj|BAD95392.1| cysteine proteinase RD21A [Arabidopsis thaliana]
Length = 433
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 163/361 (45%), Positives = 227/361 (62%), Gaps = 36/361 (9%)
Query: 9 FIIPMFVIIILVITCASQVVS----------------GRSMHEPSIVEKHEQWMAQHGRT 52
F+ P I+ L + S V GRS E ++ +E W+ +HG+
Sbjct: 3 FLKPTMAILFLAMVAVSSAVDMSIISYDEKHGVSTTGGRS--EAEVMSIYEAWLVKHGKA 60
Query: 53 YKDE--LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPV 110
+EK R IFK NL ++++ N E N +Y+LG F+DLTN+E+R+ Y G
Sbjct: 61 QSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFADLTNDEYRSKYLG----- 114
Query: 111 PSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGIT 168
+ ++ R ++ +Y+ V D +P SIDWR+KGAV +KDQG CGSCWAFS + AVEGI
Sbjct: 115 AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGIN 174
Query: 169 QITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD 227
QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II+N G+ T+ DYPY+ +GTCD
Sbjct: 175 QIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCD 234
Query: 228 NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNN 287
++ A TI YED+P E++L +AV++QP+S+ ++A GRAF Y SG+ + CG
Sbjct: 235 QIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQ 294
Query: 288 CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
DHGV VG+GT ENG YW+++NSWG++WGESGY+R+ R+ +G CGIA SYP
Sbjct: 295 LDHGVVAVGYGT---ENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYP 351
Query: 344 V 344
+
Sbjct: 352 I 352
>gi|18141285|gb|AAL60580.1|AF454958_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 485
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 165/359 (45%), Positives = 230/359 (64%), Gaps = 29/359 (8%)
Query: 3 LKFEKSFIIPMFVIIILVITCAS----------QVVSGRSMHEPSIVEKHEQWMAQHGRT 52
+K S + +F+ +I+V + VS RS E S + +E+W+ +HG+
Sbjct: 1 MKLLNSATVILFLTMIVVSSAMDMSIISYDKNHHTVSSRSDAEVSRL--YEEWLVKHGKA 58
Query: 53 YKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS 112
EK R IFK NL +I++ N + N +Y+LG +F+DLTN+E+R++Y G S
Sbjct: 59 QNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYRLGLTKFADLTNDEYRSMYLG------S 111
Query: 113 VSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQI 170
++ + S+ +Y+ V D +P S+DWR++GAV +KDQG CGSCWAFS + AVEGI +I
Sbjct: 112 RLKRKATKSSLRYEVRVGDAIPESVDWRKEGAVAEVKDQGSCGSCWAFSTIGAVEGINKI 171
Query: 171 TRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ 229
G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ TE DYPY+ +G CD
Sbjct: 172 VTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDTEEDYPYKGVDGRCDQT 231
Query: 230 KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCD 289
++ A TI YED+P E++L +A+S+QP+SV ++ GRAF Y SG+ + CG + D
Sbjct: 232 RKNAKVVTIDLYEDVPANSEESLKKALSHQPISVAIEGGGRAFQLYDSGIFDGICGTDLD 291
Query: 290 HGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
HGV VG+GT ENG YW++KNSWG +WGESGYIR+ R+ AG CGIA SYP+
Sbjct: 292 HGVVAVGYGT---ENGKDYWIVKNSWGTSWGESGYIRMERNIASSAGKCGIAVEPSYPI 347
>gi|357467173|ref|XP_003603871.1| Cysteine proteinase [Medicago truncatula]
gi|355492919|gb|AES74122.1| Cysteine proteinase [Medicago truncatula]
gi|388499154|gb|AFK37643.1| unknown [Medicago truncatula]
Length = 350
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 157/311 (50%), Positives = 216/311 (69%), Gaps = 12/311 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y+ EK +R +FK NL++I+ NK + Y LG NEF+DL+++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKVVS-NYWLGLNEFADLSHQ 101
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G V R+ S F Y++V D+P S+DWR+KGAVT +K+QGQCGSCWA
Sbjct: 102 EFKNKYLGL--KVDLSQRRESSEEEFTYRDV-DLPKSVDWRKKGAVTPVKNQGQCGSCWA 158
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF +I++N GL E D
Sbjct: 159 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVKNGGLHKEED 218
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EE TC+ +KE + TI+ Y D+P+ +EQ+LL+A++NQP+SV ++ASGR F FY
Sbjct: 219 YPYIMEESTCEMKKEVSEVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 278
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
GV + CG+ DHGV+ VG+GT++ G Y ++KNSWG WGE G+IR+ R+ G
Sbjct: 279 GGVFDGHCGSELDHGVSAVGYGTSK---GLDYIIVKNSWGAKWGEKGFIRMKRNIGKSEG 335
Query: 333 LCGIATAASYP 343
+CG+ ASYP
Sbjct: 336 ICGLYKMASYP 346
>gi|356559055|ref|XP_003547817.1| PREDICTED: cysteine proteinase RD21a [Glycine max]
Length = 366
Score = 322 bits (826), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 153/343 (44%), Positives = 210/343 (61%), Gaps = 9/343 (2%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
+ + + + ++CA + + + ++ +E+W+ +H + Y EK R +FK
Sbjct: 6 TLVTSTLLFLSFTLSCAIDTSTITNYTDNEVMTMYEEWLVKHQKVYNGLREKDKRFQVFK 65
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKYQ 126
NL +I++ N N TYKLG N+F+D+TNEE+R +Y G + + S + Y
Sbjct: 66 DNLGFIQEHNNNQNNTYKLGLNQFADMTNEEYRVMYFGTKSDAKRRLMKTKSTGHRYAYS 125
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+P +DWR KGAV IKDQG CGSCWAFS VA VE I +I GK + LSEQ+LVDC
Sbjct: 126 AGDRLPVHVDWRVKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDC 185
Query: 187 STD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
N GC+GGLMD AFE+II+N G+ T+ DYPYR +G CD K+ A I +ED+P
Sbjct: 186 DRAYNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGFEDVP 245
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
DE AL +AV++QPVS+ ++ASGR Y+SGV CG + DHGV VVG+G+ ENG
Sbjct: 246 PYDENALKKAVAHQPVSIAIEASGRDLQLYQSGVFTGKCGTSLDHGVVVVGYGS---ENG 302
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
YWL++NSWG WGE GY ++ R+ G CGI ASYPV
Sbjct: 303 VDYWLVRNSWGTGWGEDGYFKMQRNVRTPTGKCGITMEASYPV 345
>gi|242070333|ref|XP_002450443.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
gi|241936286|gb|EES09431.1| hypothetical protein SORBIDRAFT_05g005530 [Sorghum bicolor]
Length = 351
Score = 322 bits (825), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 167/345 (48%), Positives = 227/345 (65%), Gaps = 24/345 (6%)
Query: 14 FVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
+ ++ + C V+ R + E ++ +HE+WM +HGRTYKDE EKA R +FK
Sbjct: 18 LLTVLAIANCIGCAVAARDLSSSTGYGEEAMTARHEKWMVEHGRTYKDEAEKARRFQVFK 77
Query: 68 QNLEYIEKANKE-GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
N +++ +N G + Y L N F+D+T++EF A YTG+ +P+P+ + + FKY
Sbjct: 78 ANAAFVDTSNAAAGGKKYHLAINRFADMTHDEFMARYTGF-KPLPATGK---KMPGFKYA 133
Query: 127 NVT---DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
NVT + ++DWR+KGAVT +K+Q +CG CWAFSAVAA+EG+ QI G+L+ LSEQQL
Sbjct: 134 NVTLSSEDQQAVDWRKKGAVTDVKNQQKCGCCWAFSAVAAIEGMHQINTGELVSLSEQQL 193
Query: 184 VDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
VDCST +N+GC GG M+ AF+Y+I N G+ATEA YPY +G C N + A + Y
Sbjct: 194 VDCSTNGNNNGCGGGTMEDAFQYVIGNNGIATEAAYPYTAMQGMCQNVQP---AVAVRSY 250
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGTA 300
+ +P+ DE AL AV+ QPVSV VDA+ F FYK GV+ AD CG N +H V VG+GTA
Sbjct: 251 QQVPRDDEDALAAAVAGQPVSVAVDANN--FQFYKGGVMTADSCGTNLNHAVTAVGYGTA 308
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPVA 345
E+ G YWL+KN WG TWGE GY+R+ R G CG+A ASYPVA
Sbjct: 309 ED--GTPYWLLKNQWGSTWGEEGYLRLQRGVGACGVAKDASYPVA 351
>gi|222425026|dbj|BAH20463.1| cysteine protease [Spinacia oleracea]
Length = 473
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 155/330 (46%), Positives = 219/330 (66%), Gaps = 14/330 (4%)
Query: 26 QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYK 85
++ S + ++ +E W+ QH + Y EK R IFK NLE+I++ N + ++T+K
Sbjct: 37 NLLPSSSRSDDEVMRIYESWLVQHRKNYNALGEKEKRFAIFKDNLEFIDQHNSDDSQTFK 96
Query: 86 LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS-----RPSTFKYQNVTDVPTSIDWREK 140
+G N+F+DLTNEEFR++Y G + S SS + + ++ ++P ++DWR+
Sbjct: 97 VGLNKFADLTNEEFRSVYLGRKKSSSSSPLLSSAKSKVKSDRYLFKEGDELPEAVDWRKN 156
Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMD 199
GAV +KDQGQCGSCWAFS +AAVEGI QI G+L+ LSEQ+LVDC T N GC GGLMD
Sbjct: 157 GAVAKVKDQGQCGSCWAFSTIAAVEGINQIVTGELLSLSEQELVDCDTSYNSGCDGGLMD 216
Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ 259
A+E+II N G+ T+ADYPY ++G CD ++ A TI +ED+P+ DE+AL +AV++Q
Sbjct: 217 YAYEFIINNGGIDTDADYPYTAKDGKCDQYRKNAKVVTIDDFEDVPENDEKALQKAVAHQ 276
Query: 260 PVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
PVSV ++A G F FY+SGV CG + DHGV VG+G+ ++G YW+++NSWG W
Sbjct: 277 PVSVAIEAGGSTFQFYQSGVFTGKCGADLDHGVVAVGYGS---DDGKDYWIVRNSWGADW 333
Query: 320 GESGYIRILRD-----AGLCGIATAASYPV 344
GESGYIR+ R+ G CGIA SYP+
Sbjct: 334 GESGYIRMERNLETVKTGKCGIAIEPSYPI 363
>gi|224133764|ref|XP_002321655.1| predicted protein [Populus trichocarpa]
gi|222868651|gb|EEF05782.1| predicted protein [Populus trichocarpa]
Length = 360
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 163/347 (46%), Positives = 222/347 (63%), Gaps = 12/347 (3%)
Query: 5 FEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMRL 63
+K + +++ ++L T + E S+ + +E+W + H T L EK R
Sbjct: 1 MKKLLFVALYLALVLGFTESFDFHEKDLESEESLWDLYEKWRSHH--TVSTSLDEKRKRF 58
Query: 64 NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-T 122
N+F+ N+ ++ NK ++ YKL N+F+D+TN EFR Y ++ R + + +
Sbjct: 59 NVFRANVLHVHNTNKM-DKPYKLKLNKFADMTNHEFRTAYASSKVKHHTMFRGAPLGNGS 117
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
F Y N+ VP SIDWR+KGAVT +KDQG+CGSCWAFS + AVEGI I KLI LSEQ+
Sbjct: 118 FMYGNIDKVPASIDWRKKGAVTPVKDQGKCGSCWAFSTIVAVEGINFIKTNKLISLSEQE 177
Query: 183 LVDCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
LVDC+T +NHGC+GGLMD AFE+I + KG+ TEA+YPYR ++G CD K A +I +
Sbjct: 178 LVDCNTGENHGCNGGLMDYAFEFITKQKGITTEANYPYRAQDGHCDANKANQPAVSIDGH 237
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
ED+ +E ALL+AV+NQPVSV +DA G F FY GV +CG DHGVA+VG+GT
Sbjct: 238 EDVLHNNENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGECGKELDHGVAIVGYGTTV 297
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+ G KYW+++NSWG WGE GYIR+ R GLCGIA ASYP+
Sbjct: 298 D--GTKYWIVRNSWGPEWGERGYIRMQRGISDRRGLCGIAMEASYPI 342
>gi|14517542|gb|AAK62661.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
gi|19548039|gb|AAL87383.1| F2G19.31/F2G19.31 [Arabidopsis thaliana]
Length = 462
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 163/361 (45%), Positives = 227/361 (62%), Gaps = 36/361 (9%)
Query: 9 FIIPMFVIIILVITCASQVVS----------------GRSMHEPSIVEKHEQWMAQHGRT 52
F+ P I+ L + S V GRS E ++ +E W+ +HG+
Sbjct: 3 FLKPTMAILFLAMVTVSSAVDMSIISYDEKHGVSTTGGRS--EAEVMSIYEAWLVKHGKA 60
Query: 53 YKDE--LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPV 110
+EK R IFK NL ++++ N E N +Y+LG F+DLTN+E+R+ Y G
Sbjct: 61 QSQNSLVEKDRRFEIFKDNLRFVDEHN-EKNLSYRLGLTRFADLTNDEYRSKYLG----- 114
Query: 111 PSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGIT 168
+ ++ R ++ +Y+ V D +P SIDWR+KGAV +KDQG CGSCWAFS + AVEGI
Sbjct: 115 AKMEKKGERRTSLRYEARVGDELPESIDWRKKGAVAEVKDQGGCGSCWAFSTIGAVEGIN 174
Query: 169 QITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD 227
QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II+N G+ T+ DYPY+ +GTCD
Sbjct: 175 QIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFIIKNGGIDTDKDYPYKGVDGTCD 234
Query: 228 NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNN 287
++ A TI YED+P E++L +AV++QP+S+ ++A GRAF Y SG+ + CG
Sbjct: 235 QIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPISIAIEAGGRAFQLYDSGIFDGSCGTQ 294
Query: 288 CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
DHGV VG+GT ENG YW+++NSWG++WGESGY+R+ R+ +G CGIA SYP
Sbjct: 295 LDHGVVAVGYGT---ENGKDYWIVRNSWGKSWGESGYLRMARNIASSSGKCGIAIEPSYP 351
Query: 344 V 344
+
Sbjct: 352 I 352
>gi|37780041|gb|AAP32193.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 322 bits (825), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 156/312 (50%), Positives = 219/312 (70%), Gaps = 11/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y+ EK +R +FK NL++I++ NK + Y LG NEF+DL+++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDERNKIVS-NYWLGLNEFADLSHQ 101
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G + S R+SS F Y++V D+P S+DWR+KGAVT +K+QGQCGSCWA
Sbjct: 102 EFKNKYLGLKVNL-SQRRESSNEEEFTYRDV-DLPKSVDWRKKGAVTPVKNQGQCGSCWA 159
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF +I++N GL E D
Sbjct: 160 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIVQNGGLHKEDD 219
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EE TC+ +KE+ TI+ Y D+P+ +EQ+LL+A++NQP+SV ++AS R F FY
Sbjct: 220 YPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYS 279
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
GV + CG++ DHGV+ VG+GT++ + Y ++KNSWG WGE G+IR+ R+ G
Sbjct: 280 GGVFDGHCGSDLDHGVSAVGYGTSKNLD---YIIVKNSWGAKWGEKGFIRMKRNIGKPEG 336
Query: 333 LCGIATAASYPV 344
+CG+ ASYP
Sbjct: 337 ICGLYKMASYPT 348
>gi|357474523|ref|XP_003607546.1| Cysteine proteinase [Medicago truncatula]
gi|358347207|ref|XP_003637651.1| Cysteine proteinase [Medicago truncatula]
gi|355503586|gb|AES84789.1| Cysteine proteinase [Medicago truncatula]
gi|355508601|gb|AES89743.1| Cysteine proteinase [Medicago truncatula]
Length = 345
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 164/350 (46%), Positives = 223/350 (63%), Gaps = 26/350 (7%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPS----------IVEKHEQWMAQHGRTYKDELEKA 60
I I IL++ C + V++ S P+ + ++ + W+ +HGR YK E+
Sbjct: 5 ILTTTIFILLMLCNTCVIASESECPPTHKQKSSDVEAMKKRFDGWVKRHGRKYKHNDERE 64
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
+R I++ N++YI+ N + N +Y L N+F+DLTNEEF++ Y G + +R S
Sbjct: 65 VRFGIYQANVQYIQCKNAQKN-SYNLTDNKFADLTNEEFQSTYMGLS------TRLRSHN 117
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+ F+Y D+P S DWR++GAVT I DQGQCG CWAF+AVAAVEGI +I GKLI LSE
Sbjct: 118 TGFRYDEHGDLPESKDWRKEGAVTEIMDQGQCGGCWAFAAVAAVEGINKIKSGKLISLSE 177
Query: 181 QQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
Q+L+DC + N GC GGLM+ A+ +IIEN GL TE DYPY +GTC +K AA+I
Sbjct: 178 QELIDCDVKSGNQGCQGGLMETAYTFIIENGGLTTEQDYPYEGVDGTCKMEKAAHYAASI 237
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
S YE++P +E L A ++QPVSV +DA G +F FY GV + CG +HGV VVG+G
Sbjct: 238 SGYEEVPADNEAKLKAAAAHQPVSVAIDAGGYSFQFYSEGVFSGICGKQLNHGVTVVGYG 297
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+E KYW++KNSWG WGESGYIR+ RD G+CGIA ASYP+
Sbjct: 298 ---KETINKYWIVKNSWGADWGESGYIRMKRDTLSKEGMCGIAMQASYPL 344
>gi|413933049|gb|AFW67600.1| cysteine protease 1 [Zea mays]
Length = 341
Score = 322 bits (824), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 165/342 (48%), Positives = 224/342 (65%), Gaps = 17/342 (4%)
Query: 14 FVIIILVI----TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
F++ +LV+ C + + ++ +HE+WMA+HGR YKDE EKA RL +F+ N
Sbjct: 6 FLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVFRAN 65
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-- 127
E I+ N G +++L TN F+DLT EEFRA TG RP P+ S + R F+Y+N
Sbjct: 66 AELIDSFNAAGTHSHRLATNRFADLTVEEFRAARTGL-RPRPAPSAGAGR---FRYENFS 121
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC- 186
+ D S+DWR GAVT +KDQG CG CWAFSAVAAVEG+ +I G+L+ LSEQ+LVDC
Sbjct: 122 LADAAQSVDWRAMGAVTGVKDQGACGCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCD 181
Query: 187 -STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
S + GC GGLMD AF+++ GLA+E+ YPY+ +G C + A AA+I +ED+P
Sbjct: 182 VSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQGRDGPCRSSAAAARAASIRGHEDVP 241
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+ +E AL AV+NQPVSV ++ AF FY SGVL CG + +H + VG+GTA + G
Sbjct: 242 RNNEAALAAAVANQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTAND--G 299
Query: 306 AKYWLIKNSWGETWGESGYIRI---LRDAGLCGIATAASYPV 344
+YWL+KNSWG +WGE GY+RI +R G+CG+A SYPV
Sbjct: 300 TRYWLMKNSWGASWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 341
>gi|162463464|ref|NP_001104879.1| cysteine proteinase Mir3 precursor [Zea mays]
gi|2425066|gb|AAB88263.1| cysteine proteinase Mir3 [Zea mays]
Length = 480
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 157/311 (50%), Positives = 204/311 (65%), Gaps = 14/311 (4%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
+ +WMA HGRTY + R +F+ NL YI+ N G +++LG N F+DLTN+E
Sbjct: 44 YAEWMAAHGRTYNAVGAEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 103
Query: 99 FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
+ A Y G R P R+ + + + D+P S+DWR KGAV +KDQG CG+CWAF
Sbjct: 104 YPATYLG-ARTRPQRDRKLG--ARYHAADNEDLPESVDWRAKGAVAEVKDQGSCGTCWAF 160
Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADY 217
S +AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ TE DY
Sbjct: 161 STIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKDY 220
Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
PY+ +G CD ++ A TI YED+P DE++L +AV+NQPVSV ++A+G AF Y S
Sbjct: 221 PYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTAFQLYSS 280
Query: 278 GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGL 333
G+ CG DHGV VG+GT ENG YW++KNSWG +WGESGY+R+ R+ +G
Sbjct: 281 GIFTGSCGTRLDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSGK 337
Query: 334 CGIATAASYPV 344
CGIA SYP+
Sbjct: 338 CGIAVEPSYPL 348
>gi|297598407|ref|NP_001045533.2| Os01g0971400 [Oryza sativa Japonica Group]
gi|15289977|dbj|BAB63672.1| putative cysteine protease CP1 [Oryza sativa Japonica Group]
gi|125529282|gb|EAY77396.1| hypothetical protein OsI_05384 [Oryza sativa Indica Group]
gi|125573472|gb|EAZ14987.1| hypothetical protein OsJ_04922 [Oryza sativa Japonica Group]
gi|215740756|dbj|BAG97412.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215741010|dbj|BAG97505.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765325|dbj|BAG87022.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215767338|dbj|BAG99566.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255674119|dbj|BAF07447.2| Os01g0971400 [Oryza sativa Japonica Group]
Length = 365
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 166/321 (51%), Positives = 212/321 (66%), Gaps = 20/321 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E++MA++ + Y EK R +FK NL +I++ NK+ Y LG NEF+DLT++
Sbjct: 48 LMELFEKFMAKYRKAYSSLEEKLRRFEVFKDNLNHIDEENKK-ITGYWLGLNEFADLTHD 106
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNV--TDVPTSIDWREKGAVTHIKDQGQCGSC 155
EF+A Y G + +R++S F+Y+ V +P +DWR+KGAVT +K+QGQCGSC
Sbjct: 107 EFKAAYLGL---TLTPARRNSNDQLFRYEEVEAASLPKEVDWRKKGAVTEVKNQGQCGSC 163
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
WAFS VAAVEGI I G L LSEQ+L+DC TD N+GCSGGLMD AF YI N GL TE
Sbjct: 164 WAFSTVAAVEGINAIVTGNLTRLSEQELIDCDTDGNNGCSGGLMDYAFSYIAANGGLHTE 223
Query: 215 ADYPYRHEEGTC-------DNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
YPY EEGTC D+ E A A TIS YED+P+ +EQALL+A+++QPVSV ++A
Sbjct: 224 ESYPYLMEEGTCRRGSTEGDDDGEAAAAVTISGYEDVPRNNEQALLKALAHQPVSVAIEA 283
Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
SGR F FY GV + CG DHGV VG+GTA + G Y ++KNSWG WGE GYIR+
Sbjct: 284 SGRNFQFYSGGVFDGPCGTRLDHGVTAVGYGTASK--GHDYIIVKNSWGSHWGEKGYIRM 341
Query: 328 LRDA----GLCGIATAASYPV 344
R GLCGI ASYP
Sbjct: 342 RRGTGKHDGLCGINKMASYPT 362
>gi|171702831|dbj|BAG16371.1| cysteine protease [Brassica oleracea var. italica]
Length = 441
Score = 321 bits (823), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 160/326 (49%), Positives = 218/326 (66%), Gaps = 19/326 (5%)
Query: 26 QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYK 85
VS RS E S + +E+W+ +HG+ EK R IFK NL +I++ N + N +Y+
Sbjct: 28 HTVSSRSDAEVSRL--YEEWLVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYR 84
Query: 86 LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAV 143
LG +F+DLTN+E+R++Y G S ++ + S+ +Y+ V D +P S+DWR++GAV
Sbjct: 85 LGLTKFADLTNDEYRSMYLG------SRLKRKATKSSLRYEVRVGDAIPESVDWRKEGAV 138
Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAF 202
+KDQG CGSCWAFS + AVEGI +I G LI LSEQ+LVDC T N GC+GGLMD AF
Sbjct: 139 AEVKDQGSCGSCWAFSTIGAVEGINKIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAF 198
Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
E+II N G+ TE DYPY+ +G CD ++ A TI YED+P E++L +A+S+QP+S
Sbjct: 199 EFIINNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDLYEDVPANSEESLKKALSHQPIS 258
Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
V ++ GRAF Y SG+ + CG + DHGV VG+GT ENG YW++KNSWG +WGES
Sbjct: 259 VAIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT---ENGKDYWIVKNSWGTSWGES 315
Query: 323 GYIRILRD----AGLCGIATAASYPV 344
GYIR+ R+ AG CGIA SYP+
Sbjct: 316 GYIRMERNIASSAGKCGIAVEPSYPI 341
>gi|171702843|dbj|BAG16377.1| cysteine protease [Brassica rapa var. perviridis]
Length = 431
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 159/325 (48%), Positives = 217/325 (66%), Gaps = 17/325 (5%)
Query: 26 QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYK 85
VS RS E S + +E+W+ +HG+ EK R IFK NL +I++ N + N +Y+
Sbjct: 28 HTVSSRSDVEVSRL--YEEWVVKHGKAQNSLTEKDRRFEIFKDNLRFIDEHNGK-NLSYR 84
Query: 86 LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAVT 144
LG +F+DLTN+E+R++Y G + R++++ S V D +P S+DWR++GAV
Sbjct: 85 LGLTKFADLTNDEYRSMYLG-----SRLKRKATKTSLRYEARVGDAIPESVDWRKEGAVA 139
Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFE 203
+KDQG CGSCWAFS + AVEGI +I G LI LSEQ+LVDC T N GC+GGLMD AFE
Sbjct: 140 EVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYAFE 199
Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
+II+N G+ TE DYPY+ +G CD ++ A TI YED+P E++L +A+S+QP+SV
Sbjct: 200 FIIKNGGIDTEEDYPYKGVDGRCDQTRKNAKVVTIDSYEDVPANSEESLKKALSHQPISV 259
Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
++ GRAF Y SG+ + CG + DHGV VG+GT ENG YW++KNSWG +WGESG
Sbjct: 260 AIEGGGRAFQLYDSGIFDGICGTDLDHGVVAVGYGT---ENGKDYWIVKNSWGTSWGESG 316
Query: 324 YIRILRD----AGLCGIATAASYPV 344
YIR+ R+ AG CGIA SYP+
Sbjct: 317 YIRMERNIASSAGKCGIAVEPSYPI 341
>gi|449522968|ref|XP_004168497.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 348
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 156/311 (50%), Positives = 215/311 (69%), Gaps = 12/311 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E+W++ HG+ Y+ EK R +FK NL++I++ NK+ +Y LG NEF+DLT++
Sbjct: 41 LIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT-SYWLGVNEFADLTHQ 99
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ +Y G + S +RQS P F Y++V D+P S+DWR+KGAVT +K+QG CGSCWA
Sbjct: 100 EFKNMYLGL-KVESSRTRQS--PEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWA 156
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI +I G L LSEQ+L+DC N+GC GGLMD AF +I+ + GL E D
Sbjct: 157 FSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEED 216
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY E TCDN+K + TIS Y+D+P+ +E +L++A+++QP+SV ++ASGR F FY
Sbjct: 217 YPYLEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYS 276
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
GV + CG DHGV VG+G+++ G Y ++KNSWG WGE GYIR+ R+ AG
Sbjct: 277 GGVFDGPCGTQLDHGVTAVGYGSSK---GVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAG 333
Query: 333 LCGIATAASYP 343
LCGI ASYP
Sbjct: 334 LCGINKMASYP 344
>gi|37780039|gb|AAP32192.1| cysteine protease 14 [Trifolium repens]
Length = 351
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 157/312 (50%), Positives = 217/312 (69%), Gaps = 11/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y+ EK +R +FK NL++I+ NK + Y LG NEF+DL+++
Sbjct: 43 LIELFESWMSRHGKIYETIEEKLLRFEVFKDNLKHIDDRNKIVS-NYWLGLNEFADLSHQ 101
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G + S R+SS F Y++V D+P S+DWR+KGAVT +K+QGQCGSCWA
Sbjct: 102 EFKNKYLGLKVDL-SQRRESSNEEEFTYRDV-DLPKSVDWRKKGAVTPVKNQGQCGSCWA 159
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF +I +N GL E D
Sbjct: 160 FSTVAAVEGINQIVTGNLTSLSEQELIDCDTTYNNGCNGGLMDYAFSFIGQNGGLHKEED 219
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EE TC+ +KE+ TI+ Y D+P+ +EQ+LL+A++NQP+SV ++AS R F FY
Sbjct: 220 YPYIMEESTCEMKKEETQVVTINGYHDVPQNNEQSLLKALANQPLSVAIEASSRDFQFYS 279
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
GV + CG++ DHGV+ VG+GT++ + Y ++KNSWG WGE G+IR+ RD G
Sbjct: 280 GGVFDGHCGSDLDHGVSAVGYGTSKNLD---YIIVKNSWGAKWGEKGFIRMKRDIGKPEG 336
Query: 333 LCGIATAASYPV 344
+CG+ ASYP
Sbjct: 337 ICGLYKMASYPT 348
>gi|363807062|ref|NP_001242584.1| uncharacterized protein LOC100804015 precursor [Glycine max]
gi|255640677|gb|ACU20623.1| unknown [Glycine max]
Length = 366
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 155/340 (45%), Positives = 214/340 (62%), Gaps = 11/340 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMH--EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
++ ++ L T + + + ++ + ++ +E+W+ +H + Y + +K R +FK NL
Sbjct: 7 IYTLLFLSFTLSYAIKTSTIINYTDNEVMAMYEEWLVRHQKGYNELGKKDKRFQVFKDNL 66
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKYQNVT 129
+I++ N N TYKLG N+F+D+TNEE+RA+Y G + + S + +
Sbjct: 67 GFIQEHNNNLNNTYKLGLNKFADMTNEEYRAMYLGTKSNAKRRLMKTKSTGHRYAFSARD 126
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+P +DWR KGAV IKDQG CGSCWAFS VA VE I +I GK + LSEQ+LVDC
Sbjct: 127 RLPVHVDWRMKGAVAPIKDQGSCGSCWAFSTVATVEAINKIVTGKFVSLSEQELVDCDRA 186
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GGLMD AFE+II+N G+ T+ DYPYR +G CD K+ A I YED+P D
Sbjct: 187 YNEGCNGGLMDYAFEFIIQNGGIDTDKDYPYRGFDGICDPTKKNAKVVNIDGYEDVPPYD 246
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL +AV++QPVSV ++ASGRA Y+SGV CG + DHGV VVG+G+ ENG Y
Sbjct: 247 ENALKKAVAHQPVSVAIEASGRALQLYQSGVFTGKCGTSLDHGVVVVGYGS---ENGVDY 303
Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
WL++NSWG WGE GY ++ R+ G CGI ASYPV
Sbjct: 304 WLVRNSWGTGWGEDGYFKMQRNVRTSTGKCGITMEASYPV 343
>gi|449455625|ref|XP_004145553.1| PREDICTED: xylem cysteine proteinase 1-like [Cucumis sativus]
Length = 351
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 156/312 (50%), Positives = 215/312 (68%), Gaps = 12/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E+W++ HG+ Y+ EK R +FK NL++I++ NK+ +Y LG NEF+DLT++
Sbjct: 44 LIELFEEWISNHGKIYETIEEKWHRFEVFKDNLKHIDETNKKVT-SYWLGVNEFADLTHQ 102
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ +Y G + S +RQS P F Y++V D+P S+DWR+KGAVT +K+QG CGSCWA
Sbjct: 103 EFKNMYLGL-KVESSRTRQS--PEEFTYKDVVDLPKSVDWRKKGAVTRVKNQGSCGSCWA 159
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI +I G L LSEQ+L+DC N+GC GGLMD AF +I+ + GL E D
Sbjct: 160 FSTVAAVEGINKIVGGNLTSLSEQELIDCDRPYNNGCHGGLMDYAFSFIVSSGGLHKEED 219
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY E TCDN+K + TIS Y+D+P+ +E +L++A+++QP+SV ++ASGR F FY
Sbjct: 220 YPYLEVESTCDNKKGELEVVTISGYKDVPENNEASLIKALAHQPLSVAIEASGRDFQFYS 279
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
GV + CG DHGV VG+G+++ G Y ++KNSWG WGE GYIR+ R+ AG
Sbjct: 280 GGVFDGPCGTQLDHGVTAVGYGSSK---GVDYIIVKNSWGPKWGEKGYIRMKRNTGKPAG 336
Query: 333 LCGIATAASYPV 344
LCGI ASYP
Sbjct: 337 LCGINKMASYPT 348
>gi|388517427|gb|AFK46775.1| unknown [Medicago truncatula]
Length = 362
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 159/345 (46%), Positives = 220/345 (63%), Gaps = 10/345 (2%)
Query: 6 EKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
+K +I + + ++LV++ + + S+ + +E+W + H + ++ EK R N+
Sbjct: 4 KKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TFK 124
FK N+ ++ NK ++ YKL N+F+D+TN EF+ Y G + R + R S TF
Sbjct: 63 FKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGTKVNHHRMFRGTPRVSGTFM 121
Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
Y+N T P S+DWR+KGAVT +KDQGQCGSCWAFS V AVEGI QI +L+ LSEQ+L+
Sbjct: 122 YENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELI 181
Query: 185 DCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
DC +N GC+GGLM+ AFEYI + G+ TE+ YPY +G+CD KE +I +E
Sbjct: 182 DCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVSIDGHET 241
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P DE ALL+AV+NQPVSV +DA G F FY GV DCG +HGVA+VG+GT +
Sbjct: 242 VPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTTVD- 300
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
G YW+++NSWG WGE G IR+ R+ GLCGIA ASYPV
Sbjct: 301 -GTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPV 344
>gi|217073894|gb|ACJ85307.1| unknown [Medicago truncatula]
gi|388507498|gb|AFK41815.1| unknown [Medicago truncatula]
Length = 362
Score = 320 bits (821), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 159/345 (46%), Positives = 220/345 (63%), Gaps = 10/345 (2%)
Query: 6 EKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
+K +I + + ++LV++ + + S+ + +E+W + H + ++ EK R N+
Sbjct: 4 KKLLLIVLSIALVLVVSESFDFHDKDVSSDESLWDLYERWRSHHTVS-RNLNEKQKRFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TFK 124
FK N+ ++ NK ++ YKL N+F+D+TN EF+ Y G + R + R S TF
Sbjct: 63 FKSNVMHVHNTNKM-DKPYKLKLNKFADMTNHEFKTTYAGSKVNHHRMFRGTPRVSGTFM 121
Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
Y+N T P S+DWR+KGAVT +KDQGQCGSCWAFS V AVEGI QI +L+ LSEQ+L+
Sbjct: 122 YENFTKAPASVDWRKKGAVTDVKDQGQCGSCWAFSTVVAVEGINQIKTNRLVPLSEQELI 181
Query: 185 DCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
DC +N GC+GGLM+ AFEYI + G+ TE+ YPY +G+CD KE +I +E
Sbjct: 182 DCDNQENQGCNGGLMEYAFEYIKQKGGVTTESYYPYTANDGSCDATKENVPTVSIDGHET 241
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P DE ALL+AV+NQPVSV +DA G F FY GV DCG +HGVA+VG+GT +
Sbjct: 242 VPANDEDALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCGKELNHGVAIVGYGTTVD- 300
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
G YW+++NSWG WGE G IR+ R+ GLCGIA ASYPV
Sbjct: 301 -GTNYWIVRNSWGAEWGEQGCIRMKRNVSNKEGLCGIAMEASYPV 344
>gi|40806500|gb|AAR92155.1| putative cysteine protease 2 [Iris x hollandica]
Length = 359
Score = 320 bits (821), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 162/345 (46%), Positives = 223/345 (64%), Gaps = 12/345 (3%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
KS ++ + V + V + + + + E S+ +E+W + H +D EK R N+
Sbjct: 4 KSMLLALVVALAFVGVARTIPFNEKDLASEESLWGLYERWRSHH-TVSRDLSEKNKRFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TFK 124
FK+N ++I + NK+ + YKLG N+F+D+TN+EFR+ Y G R + R + +F
Sbjct: 63 FKENAKFIHEFNKK-DAPYKLGLNKFADMTNQEFRSTYAGSKIHHHRTQRGTPRATGSFM 121
Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
Y+NV +P S+DWR +GAV +KDQGQCGSCWAFS +A+VEGI +I +L+ LS QQLV
Sbjct: 122 YENVHSIPASVDWRTQGAVAPVKDQGQCGSCWAFSTIASVEGINKIKTNQLVPLSGQQLV 181
Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
DC TD N GC+GGLMD AFE+I N G+ +E+ YPY E+G+C ++ V TI YED
Sbjct: 182 DCDTDQNEGCNGGLMDYAFEFIKSNGGITSESAYPYTAEQGSCASESSAPV-VTIDGYED 240
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P +E AL++AV+NQ VSV ++ASG AF FY GV CGN DHGVAVVG+G +
Sbjct: 241 VPANNEAALMKAVANQVVSVAIEASGMAFQFYSEGVFTGSCGNELDHGVAVVGYGATRD- 299
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
G KYW+++NSWG WGE GYIR+ R GLCGIA SYP+
Sbjct: 300 -GTKYWIVRNSWGAEWGEKGYIRMQRGIRARHGLCGIAMEPSYPL 343
>gi|220983358|dbj|BAH11164.1| cysteine protease [Hordeum vulgare]
Length = 462
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 159/318 (50%), Positives = 209/318 (65%), Gaps = 14/318 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEF 91
E + + +WMA+H TY E+ R F+ NL YI++ N G +++LG N F
Sbjct: 35 EEEVRRMYAEWMAEHHSTYNPIGEEERRFEAFRNNLRYIDQHNAAADAGVHSFRLGLNRF 94
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
+DLTNEE+R+ Y G R P R+ S + ++ + ++P S+DWR+KGAV +KDQG
Sbjct: 95 ADLTNEEYRSTYLG-ARTKPDRERKLS--ARYQAADNDELPESVDWRKKGAVGAVKDQGG 151
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKG 210
CGSCWAFSA+AAVEGI QI G +I LSEQ+LVDC T N GC+GGLMD AFE+II N G
Sbjct: 152 CGSCWAFSAIAAVEGINQIVTGDMIPLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGG 211
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
+ +E DYPY+ + CD K+ A TI YED+P E++L +AV+NQP+SV ++A GR
Sbjct: 212 IDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSEKSLQKAVANQPISVAIEAGGR 271
Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
AF YKSG+ CG DHGVA VG+GT ENG YWL++NSWG WGE+GYIR+ R+
Sbjct: 272 AFQLYKSGIFTGTCGTALDHGVAAVGYGT---ENGKDYWLVRNSWGSVWGENGYIRMERN 328
Query: 331 ----AGLCGIATAASYPV 344
+G CGIA SYP
Sbjct: 329 IKASSGKCGIAVEPSYPT 346
>gi|414584879|tpg|DAA35450.1| TPA: cysteine protease 1 [Zea mays]
Length = 522
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 155/320 (48%), Positives = 212/320 (66%), Gaps = 16/320 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN-KEGNRTYKLGTNEFSD 93
EP +E W+A+HGR Y E+ R +F NL +++ N + ++LG N+F+D
Sbjct: 102 EPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGMNQFAD 161
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN---VTDVPTSIDWREKGAVTHIKDQG 150
LTN+EFRA Y G P SR+ +Y++ ++P S+DWREKGAV +K+QG
Sbjct: 162 LTNDEFRAAYLGARIPA---SRRRGTAVGERYRHGGGAEELPESVDWREKGAVAPVKNQG 218
Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIEN 208
QCGSCWAFSAV++VE + QI G+++ LSEQ+LV+CSTD N GC+GGLMD AF++II+N
Sbjct: 219 QCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFDFIIKN 278
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
G+ TE DYPY+ +G CD +E A +I +ED+P+ DE++L +AV++QPVSV ++A
Sbjct: 279 GGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAG 338
Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
GR F YK+GV C N DHGV VG+GT ENG YW+++NSWG WGE GYIR+
Sbjct: 339 GREFQLYKAGVFTGTCTTNLDHGVVAVGYGT---ENGKDYWIVRNSWGAKWGEDGYIRME 395
Query: 329 RD----AGLCGIATAASYPV 344
R+ G CGIA ASYP
Sbjct: 396 RNVNATTGKCGIAMMASYPT 415
>gi|162459393|ref|NP_001105993.1| cysteine protease component of protease-inhibitor complex precursor
[Zea mays]
gi|6682829|dbj|BAA88898.1| cysteine protease component of protease-inhibitor complex [Zea
mays]
Length = 465
Score = 320 bits (820), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 158/312 (50%), Positives = 204/312 (65%), Gaps = 16/312 (5%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEE 98
+ +WMA HGRTY E+ R +F+ NL YI+ N G +++LG N F+DLTN+E
Sbjct: 44 YAEWMAAHGRTYNAVGEEERRYQVFRDNLRYIDAHNAAADAGVHSFRLGLNRFADLTNDE 103
Query: 99 FRALYTG-YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
+RA Y G RP R+ + + + D+P S+DWR KGAV +KDQG GSCWA
Sbjct: 104 YRATYLGARTRP----QRERKLGARYHAADNEDLPESVDWRAKGAVAEVKDQGSYGSCWA 159
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS +AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ TE D
Sbjct: 160 FSTIAAVEGINQIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIINNGGIDTEKD 219
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY+ +G CD ++ A TI YED+P DE++L +AV+NQPVSV ++A+G F Y
Sbjct: 220 YPYKGTDGRCDVNRKNAKVVTIDSYEDVPANDEKSLQKAVANQPVSVAIEAAGTQFQLYS 279
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AG 332
SG+ CG DHGV VG+GT ENG YW++KNSWG +WGESGY+R+ R+ +G
Sbjct: 280 SGIFTGSCGTALDHGVTAVGYGT---ENGKDYWIVKNSWGSSWGESGYVRMERNIKASSG 336
Query: 333 LCGIATAASYPV 344
CGIA SYP+
Sbjct: 337 KCGIAVEPSYPL 348
>gi|18422289|ref|NP_568620.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|9757832|dbj|BAB08269.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|17065064|gb|AAL32686.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|21387153|gb|AAM47980.1| cysteine protease component of protease-inhibitor complex
[Arabidopsis thaliana]
gi|332007522|gb|AED94905.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 463
Score = 320 bits (820), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 161/330 (48%), Positives = 216/330 (65%), Gaps = 21/330 (6%)
Query: 26 QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDE----LEKAMRLNIFKQNLEYIEKANKEGN 81
+ + S + + +E WM +HG+ ++ EK R IFK NL +I++ N + N
Sbjct: 34 HITTETSRSDSEVERIYEAWMVEHGKKKMNQNGLGAEKDQRFEIFKDNLRFIDEHNTK-N 92
Query: 82 RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ-NVTD-VPTSIDWRE 139
+YKLG F+DLTNEE+R++Y G +P V + S R YQ V D +P S+DWR+
Sbjct: 93 LSYKLGLTRFADLTNEEYRSMYLG-AKPTKRVLKTSDR-----YQARVGDALPDSVDWRK 146
Query: 140 KGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLM 198
+GAV +KDQG CGSCWAFS + AVEGI +I G LI LSEQ+LVDC T N GC+GGLM
Sbjct: 147 EGAVADVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLM 206
Query: 199 DKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN 258
D AFE+II+N G+ TEADYPY+ +G CD ++ A TI YED+P+ E +L +A+++
Sbjct: 207 DYAFEFIIKNGGIDTEADYPYKAADGRCDQNRKNAKVVTIDSYEDVPENSEASLKKALAH 266
Query: 259 QPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
QP+SV ++A GRAF Y SGV + CG DHGV VG+GT ENG YW+++NSWG
Sbjct: 267 QPISVAIEAGGRAFQLYSSGVFDGLCGTELDHGVVAVGYGT---ENGKDYWIVRNSWGNR 323
Query: 319 WGESGYIRILRD----AGLCGIATAASYPV 344
WGESGYI++ R+ G CGIA ASYP+
Sbjct: 324 WGESGYIKMARNIEAPTGKCGIAMEASYPI 353
>gi|357166359|ref|XP_003580684.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 456
Score = 320 bits (820), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 158/327 (48%), Positives = 211/327 (64%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
+VS E + + +WMA++GRTY E+ R +F+ NL Y+++ N G +
Sbjct: 27 IVSYGERSEEEVRRMYVEWMAENGRTYNAIGEEERRFEVFRDNLRYVDQHNAAADAGLHS 86
Query: 84 YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R Y G +PV R+ ++ + ++P S+DWREKGA
Sbjct: 87 FRLGLNRFADLTNEEYRDTYLGVRTKPV----RERRLSGRYQAADNEELPESVDWREKGA 142
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
V +KDQG CGSCWAFSA+AAVEGI QI G +I LSEQ+LVDC T N GC+GGLMD A
Sbjct: 143 VAKVKDQGGCGSCWAFSAIAAVEGINQIVTGDMIALSEQELVDCDTSYNQGCNGGLMDYA 202
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
FE+II N G+ +E DYPY+ + CD K+ A TI YED+P E +L +AV+NQP+
Sbjct: 203 FEFIINNGGIDSEEDYPYKERDNRCDANKKNAKVVTIDGYEDVPVNSELSLKKAVANQPI 262
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
SV ++A GRAF YKSG+ CG DHGV VG+G+ ENG YW++KNSWG WGE
Sbjct: 263 SVAIEAGGRAFQLYKSGIFTGRCGTALDHGVTAVGYGS---ENGKDYWIVKNSWGTVWGE 319
Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
GY+R+ R+ +G CGIA SYP+
Sbjct: 320 DGYVRLERNIKATSGKCGIAIEPSYPL 346
>gi|238006338|gb|ACR34204.1| unknown [Zea mays]
Length = 465
Score = 320 bits (820), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 156/325 (48%), Positives = 213/325 (65%), Gaps = 16/325 (4%)
Query: 30 GRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN-KEGNRTYKLGT 88
G EP +E W+A+HGR Y E+ R +F NL +++ N + ++LG
Sbjct: 40 GLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGFRLGM 99
Query: 89 NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN---VTDVPTSIDWREKGAVTH 145
N+F+DLTN+EFRA Y G P SR+ +Y++ ++P S+DWREKGAV
Sbjct: 100 NQFADLTNDEFRAAYLGARIPA---SRRRGTAVGERYRHGGGAEELPESVDWREKGAVAP 156
Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFE 203
+K+QGQCGSCWAFSAV++VE + QI G+++ LSEQ+LV+CSTD N GC+GGLMD AF+
Sbjct: 157 VKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFD 216
Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
+II+N G+ TE DYPY+ +G CD +E A +I +ED+P+ DE++L +AV++QPVSV
Sbjct: 217 FIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQPVSV 276
Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
++A GR F YK+GV C N DHGV VG+GT ENG YW+++NSWG WGE G
Sbjct: 277 AIEAGGREFQLYKAGVFTGTCTTNLDHGVVAVGYGT---ENGKDYWIVRNSWGAKWGEDG 333
Query: 324 YIRILRD----AGLCGIATAASYPV 344
YIR+ R+ G CGIA ASYP
Sbjct: 334 YIRMERNVNATTGKCGIAMMASYPT 358
>gi|350538043|ref|NP_001234324.1| cysteine protease TDI-65 precursor [Solanum lycopersicum]
gi|5726641|gb|AAD48496.1|AF172856_1 cysteine protease TDI-65 [Solanum lycopersicum]
gi|2828252|emb|CAA05894.1| CYP1 [Solanum lycopersicum]
Length = 466
Score = 320 bits (819), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 155/345 (44%), Positives = 212/345 (61%), Gaps = 13/345 (3%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
I+ M + L ++S H + + +E W+ +HG++Y EK R
Sbjct: 12 ILLMLIFSTLSSASDMSIISYDETHIHRRTDDEVSALYESWLIEHGKSYNALGEKDKRFQ 71
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
IFK NL YI++ N N++YKLG +F+DLTNEE+R++Y G ++ +
Sbjct: 72 IFKDNLRYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRKKLSKNKSDRYL 131
Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
+ +P SIDWREKG + +KDQG CGSCWAFSAVAA+E I I G LI LSEQ+LV
Sbjct: 132 PKVGDSLPESIDWREKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELV 191
Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
DC N GC GGLMD AFE++I+N G+ TE DYPY+ G CD ++ A I YED
Sbjct: 192 DCDRSYNEGCDGGLMDYAFEFVIKNGGIDTEEDYPYKERNGVCDQYRKNAKVVKIDSYED 251
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P +E+AL +AV++QPVS+ ++A GR F YKSG+ CG DHGV + G+GT E
Sbjct: 252 VPVNNEKALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVIAGYGT---E 308
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
NG YW+++NSWG WGE+GY+R+ R+ +GLCG+A SYPV
Sbjct: 309 NGMDYWIVRNSWGANWGENGYLRVQRNVASSSGLCGLAIEPSYPV 353
>gi|1345573|emb|CAA40073.1| endopeptidase (EP-C1) [Phaseolus vulgaris]
Length = 361
Score = 320 bits (819), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 158/317 (49%), Positives = 210/317 (66%), Gaps = 12/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S+ + +E+W + H T L EK R N+FK NL ++ NK ++ YKL N+F+D
Sbjct: 32 EESLWDLYERWRSHH--TVSRSLGEKHKRFNVFKANLMHVHNTNKM-DKPYKLKLNKFAD 88
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
+TN EFR+ Y G + R + + F Y+ V VP S+DWR+KGAVT +KDQGQC
Sbjct: 89 MTNHEFRSTYAGSKVNHHRMFRGTPHENGAFMYEKVVSVPPSVDWRKKGAVTDVKDQGQC 148
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
GSCWAFS V AVEGI QI KL+ LSEQ+LVDC + N GC+GGLM+ AFE+I + G+
Sbjct: 149 GSCWAFSTVVAVEGINQIKTNKLVALSEQELVDCDKEENQGCNGGLMESAFEFIKQKGGI 208
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
TE++YPY+ +EGTCD K +A +I +E++P DE ALL+AV+NQPVSV +DA G
Sbjct: 209 TTESNYPYKAQEGTCDASKVNDLAVSIDGHENVPANDEDALLKAVANQPVSVAIDAGGSD 268
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
F FY GV DC + +HGVA+VG+GT + G YW+++NSWG WGE GYIR+ R+
Sbjct: 269 FQFYSEGVFTGDCSTDLNHGVAIVGYGTTVD--GTNYWIVRNSWGPEWGEHGYIRMQRNI 326
Query: 331 ---AGLCGIATAASYPV 344
GLCGIA SYP+
Sbjct: 327 SKKEGLCGIAMLPSYPI 343
>gi|334185815|ref|NP_680113.3| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313879|sp|Q9STL4.1|CEP2_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP2; Flags:
Precursor
gi|4678354|emb|CAB41164.1| cysteine endopeptidase-like protein [Arabidopsis thaliana]
gi|332644882|gb|AEE78403.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 361
Score = 320 bits (819), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 158/350 (45%), Positives = 221/350 (63%), Gaps = 13/350 (3%)
Query: 5 FEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
+K +I +F ++IL C E + +++W + H + E+ R N
Sbjct: 1 MKKLLLIFLFSLVILQTACGFDYDDKEIESEEGLSTLYDRWRSHHS-VPRSLNEREKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN---RPVPSVSRQSSRPS 121
+F+ N+ ++ NK+ NR+YKL N+F+DLT EF+ YTG N + ++ S+
Sbjct: 60 VFRHNVMHVHNTNKK-NRSYKLKLNKFADLTINEFKNAYTGSNIKHHRMLQGPKRGSKQF 118
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
+ ++N++ +P+S+DWR+KGAVT IK+QG+CGSCWAFS VAAVEGI +I KL+ LSEQ
Sbjct: 119 MYDHENLSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQ 178
Query: 182 QLVDCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
+LVDC T N GC+GGLM+ AFE+I +N G+ TE YPY +G CD K+ V TI
Sbjct: 179 ELVDCDTKQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDG 238
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
+ED+P+ DE ALL+AV+NQPVSV +DA F FY GV CG +HGVA VG+G+
Sbjct: 239 HEDVPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGSCGTELNHGVAAVGYGS- 297
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVAI 346
E G KYW+++NSWG WGE GYI+I R+ G CGIA ASYP+ +
Sbjct: 298 --ERGKKYWIVRNSWGAEWGEGGYIKIEREIDEPEGRCGIAMEASYPIKL 345
>gi|414588007|tpg|DAA38578.1| TPA: hypothetical protein ZEAMMB73_159244 [Zea mays]
Length = 307
Score = 320 bits (819), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 152/315 (48%), Positives = 209/315 (66%), Gaps = 17/315 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+ E+HE+WMA++ R YKD EKA R +FK N ++E N + + LG N+F+DLT E
Sbjct: 1 MAERHERWMAEYDRVYKDAAEKARRFEVFKDNFAFVESFNADKKNKFWLGVNQFADLTTE 60
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQN--VTDVPTSIDWREKGAVTHIKDQGQCGSC 155
EF+A N+ +S + + FKY+N V+ +PT++DWR KGAVT IK+QGQCG C
Sbjct: 61 EFKA-----NKGFKPISAEEVPTTGFKYENLSVSALPTAVDWRTKGAVTPIKNQGQCGCC 115
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN--HGCSGGLMDKAFEYIIENKGLAT 213
WAFSA+AA+EGI +++ G L+ LSEQ+ VDC T N GC GG MD AFE++I+N GLAT
Sbjct: 116 WAFSAIAAMEGIVKLSTGNLVSLSEQEPVDCDTHNMDEGCEGGWMDNAFEFVIKNGGLAT 175
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
E+ YPY+ +G C + AATI +ED+P +E AL++ V++QPVSV VDAS R F
Sbjct: 176 ESSYPYKVVDGKCKGGSKS--AATIKGHEDVPPNNEAALMKVVASQPVSVAVDASDRTFM 233
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-- 331
Y GV+ CG DHG+A +G+G E + KYW++KNSWG TWGE G++R+ +D
Sbjct: 234 LYSGGVMTGSCGTQLDHGIAAIGYGV--ESDDTKYWILKNSWGTTWGEKGFLRMEKDISD 291
Query: 332 --GLCGIATAASYPV 344
G+C +A SYP
Sbjct: 292 KRGMCDLAMKPSYPT 306
>gi|297830592|ref|XP_002883178.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
gi|297329018|gb|EFH59437.1| hypothetical protein ARALYDRAFT_479457 [Arabidopsis lyrata subsp.
lyrata]
Length = 452
Score = 319 bits (818), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 155/343 (45%), Positives = 220/343 (64%), Gaps = 12/343 (3%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
+ + +F ++++ ++ S + + +E +EQW+ ++ + Y EK R IF
Sbjct: 9 TLALLIFSMLLISLSLGSVTAADTTRNEAEARRMYEQWLVENRKNYNGLGEKETRFEIFT 68
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
NL+YIE+ N N+T+++G F+DLTN+EFRA+Y R +R + + Y+
Sbjct: 69 DNLKYIEEHNSVPNQTFEVGLTRFADLTNDEFRAIYL---RSKMERTRVPVKGERYLYKV 125
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+P IDWR KGAV +KDQG CGSCWAFSA+ AVEGI QI G+LI LSEQ+LVDC
Sbjct: 126 GDTLPDQIDWRAKGAVNPVKDQGNCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 185
Query: 188 TD-NHGCSGGLMDKAFEYIIENKGLATEADYPYR-HEEGTCDNQKEKAVAATISKYEDLP 245
T N GC GGLMD AF++IIEN G+ TE DYPY ++ C++ K+ + TI YED+P
Sbjct: 186 TSYNGGCGGGLMDYAFKFIIENGGIDTEEDYPYTATDDNICNSDKKNSRVVTIDGYEDVP 245
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+ DE++L +A++NQP+SV ++A GRAF YKSGV CG + DHGV VG+G+ E G
Sbjct: 246 QNDEKSLKKALANQPISVAIEAGGRAFQLYKSGVFTGTCGTSLDHGVVAVGYGS---EGG 302
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
YW+++NSWG WGESGY ++ R+ +G CG+A ASYP
Sbjct: 303 QDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPT 345
>gi|302796898|ref|XP_002980210.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
gi|300151826|gb|EFJ18470.1| hypothetical protein SELMODRAFT_153766 [Selaginella moellendorffii]
Length = 479
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 165/319 (51%), Positives = 210/319 (65%), Gaps = 26/319 (8%)
Query: 43 EQWMAQHGRTYKDEL--------EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ WM QHG++Y D EKA R IFK NL +I N E N+ Y LG N F+DL
Sbjct: 58 DSWMLQHGKSYADNALSGDSQAGEKATRYGIFKDNLRFIHGEN-EKNQGYFLGLNAFADL 116
Query: 95 TNEEFRALYTG--YNRPVPSVSRQSSRPSTFKYQNV--TDVPTSIDWREKGAVTHIKDQG 150
TNEEFRA G ++R SR+ + F+Y +V D+P SIDWREKGAV +KDQG
Sbjct: 117 TNEEFRAQRHGGRFDR-----SRERTSHEEFRYGSVQLKDLPDSIDWREKGAVVGVKDQG 171
Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENK 209
CGSCWAFSAVAA+EG+ ++ G+L+ LSEQ+LVDC ++ GC+GGLMD AF ++I+N
Sbjct: 172 SCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGFVIKNG 231
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
GL TEADYPY+ CD K A TI YED+P DE ALL+AV++QPVSV +DA G
Sbjct: 232 GLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAIDAGG 291
Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
+ FY+SG+ CG + DHGV VG+G +E+G YW+IKNSWG WGE GY+++ R
Sbjct: 292 SSMQFYRSGIFTGRCGTDLDHGVTNVGYG---KEDGKAYWIIKNSWGSNWGEKGYVKMAR 348
Query: 330 D----AGLCGIATAASYPV 344
+ AGLCGI ASYP
Sbjct: 349 NTGLAAGLCGINMEASYPT 367
>gi|262360187|gb|ACY38051.2| cysteine proteinase C1A [Dactylis glomerata]
Length = 365
Score = 319 bits (818), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 162/322 (50%), Positives = 215/322 (66%), Gaps = 18/322 (5%)
Query: 35 EPSIVEKHEQWMAQHG---RTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEF 91
E S+ +E W + H R E E A R N+FK+N+ YI +ANK+ +R ++L N+F
Sbjct: 33 EESLRGLYETWRSHHTVSRRGLGAEAE-ARRFNVFKENVRYIHEANKK-DRPFRLALNKF 90
Query: 92 SDLTNEEFRALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
+D+T +EFR Y G ++R + RQ +F Y + ++P ++DWR+KGAVT IK
Sbjct: 91 ADMTTDEFRRTYAGSRVRHHRSLSGGRRQGG--GSFMYADAENLPAAVDWRQKGAVTPIK 148
Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYII 206
DQGQCGSCWAFS + AVEGI +I G+L+ LSEQ+L+DC+ +N GC+GGLMD AF++I
Sbjct: 149 DQGQCGSCWAFSTIVAVEGINKIRTGRLVSLSEQELMDCNIGENDGCNGGLMDVAFQFIQ 208
Query: 207 ENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVD 266
+N G+ TEA YPY+ E+ +CD KE + +I YED+P DE AL +AV+NQPVSV +D
Sbjct: 209 QNGGITTEASYPYQGEQNSCDQSKENSHDVSIDGYEDVPANDESALQKAVANQPVSVAID 268
Query: 267 ASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
ASG F FY GV D G + DHGVA VG+GT + G KYW++KNSWGE WGE GYIR
Sbjct: 269 ASGNDFQFYSEGVFTTDGGTDLDHGVAAVGYGTTRD--GTKYWIVKNSWGEDWGEKGYIR 326
Query: 327 ILRDA----GLCGIATAASYPV 344
+ R GLCGIA ASYP
Sbjct: 327 MQRGVKQAEGLCGIAMEASYPT 348
>gi|297744465|emb|CBI37727.3| unnamed protein product [Vitis vinifera]
Length = 331
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 157/312 (50%), Positives = 208/312 (66%), Gaps = 33/312 (10%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ + E W+++HG+ YK EK R +F++NL +I++ NKE + +Y LG NEF+DL++E
Sbjct: 45 LIARFESWVSKHGKVYKSMEEKLHRFEVFRENLNHIDERNKEVS-SYWLGLNEFADLSHE 103
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF++ ++V D+P S+DWR+KGAVTH+K+QG CGSCWA
Sbjct: 104 EFKS------------------------KDVADLPESVDWRKKGAVTHVKNQGACGSCWA 139
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N GC+GGLMD AF +I N GL E D
Sbjct: 140 FSTVAAVEGINQIVTGNLTTLSEQELIDCDTTFNSGCNGGLMDYAFAFIASNGGLHKEDD 199
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEGTC+ QKE TIS YED+P+ DE++LL+A+++QP+SV ++ASGR F FY
Sbjct: 200 YPYLMEEGTCEEQKEDVDIVTISGYEDVPEKDEESLLKALAHQPLSVAIEASGRDFQFYS 259
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
GV N CG DHGVA VG+G+++ G Y ++KNSWG WGE GYIR+ R+ G
Sbjct: 260 GGVFNGPCGTELDHGVAAVGYGSSK---GLDYIIVKNSWGPKWGEKGYIRMKRNTGKTEG 316
Query: 333 LCGIATAASYPV 344
LCGI ASYP
Sbjct: 317 LCGINKMASYPT 328
>gi|226501480|ref|NP_001150266.1| cysteine protease 1 precursor [Zea mays]
gi|195637948|gb|ACG38442.1| cysteine protease 1 precursor [Zea mays]
Length = 462
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 155/329 (47%), Positives = 214/329 (65%), Gaps = 16/329 (4%)
Query: 26 QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN-KEGNRTY 84
G EP +E W+A+HGR Y E+ R +F NL +++ N + +
Sbjct: 33 HAARGLERTEPEARTLYELWLAEHGRAYNALGERDRRFRVFWDNLRFVDAHNERAAEHGF 92
Query: 85 KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN---VTDVPTSIDWREKG 141
+LG N+F+DLTN+EFRA Y G P +R+ +Y++ ++P S+DWREKG
Sbjct: 93 RLGMNQFADLTNDEFRAAYLGARIPA---ARRRGTAVGERYRHGGGAEELPESVDWREKG 149
Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMD 199
AV +K+QGQCGSCWAFSAV++VE + QI G+++ LSEQ+LV+CSTD N GC+GGLMD
Sbjct: 150 AVAPVKNQGQCGSCWAFSAVSSVESVNQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMD 209
Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ 259
AF++II+N G+ TE DYPY+ +G CD +E A +I +ED+P+ DE++L +AV++Q
Sbjct: 210 AAFDFIIKNGGIDTEGDYPYKAVDGKCDINRENAKVVSIDGFEDVPENDEKSLQKAVAHQ 269
Query: 260 PVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
PVSV ++A GR F YK+GV + C N DHGV VG+GT ENG YW+++NSWG W
Sbjct: 270 PVSVAIEAGGREFQLYKAGVFSGTCTTNLDHGVVAVGYGT---ENGKDYWIVRNSWGAKW 326
Query: 320 GESGYIRILRD----AGLCGIATAASYPV 344
GE GYIR+ R+ G CGIA ASYP
Sbjct: 327 GEDGYIRMERNVNATTGKCGIAMMASYPT 355
>gi|388519351|gb|AFK47737.1| unknown [Medicago truncatula]
Length = 359
Score = 319 bits (817), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 163/345 (47%), Positives = 212/345 (61%), Gaps = 14/345 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
S I + L+ + S RS E ++ +E+W+ +H + Y EK R IFK
Sbjct: 3 SITITSLLFFSLITLSLAMDTSMRSNEE--VMTMYEEWLVKHHKVYNGLGEKDQRFEIFK 60
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ- 126
NL +I++ N + N TYK+G N+F+D TNEE+R +Y G + + +Y
Sbjct: 61 DNLGFIDEHNAQ-NYTYKVGLNKFADTTNEEYRNMYLGTKNDAKRNVMKIKITTGHRYAF 119
Query: 127 NVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
N D +P +DWR KGAV HIKDQG CGSCWAFS +A VE I +I GKL+ LSEQ+LVD
Sbjct: 120 NSGDRLPVHVDWRSKGAVAHIKDQGSCGSCWAFSTIATVEAINKIVTGKLVSLSEQELVD 179
Query: 186 CSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
C N GC+GGLMD AFE+I+EN G+ TE DYPY+ EG CD ++ A +I YED+
Sbjct: 180 CDRAFNEGCNGGLMDYAFEFIVENGGIDTEQDYPYKGFEGRCDPTRKNAKVVSIDGYEDV 239
Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
P +E AL +AV +QPVSV ++A GRA Y+SGV CG N DHGV VVG+G EN
Sbjct: 240 PAYNENALKKAVFHQPVSVAIEAGGRALQLYQSGVFTGRCGTNLDHGVVVVGYGF---EN 296
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
G YWL++NSWG WGE GY ++ R+ G CGIA ASYPV
Sbjct: 297 GVDYWLVRNSWGTNWGEDGYFKLERNVKKINTGKCGIAMQASYPV 341
>gi|326502440|dbj|BAJ95283.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 164/318 (51%), Positives = 217/318 (68%), Gaps = 13/318 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ ++V +HE+WMA+HGRTY +E EKA RL +F+ N + I+ N + T++L TN F+DL
Sbjct: 37 DSAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADL 96
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN--VTDVPTSIDWREKGAVTHIKDQGQC 152
T+EEFRA TG RP + + S F+Y+N + D S+DWR GAVT +KDQG C
Sbjct: 97 TDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSC 156
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKG 210
G CWAFSAVAAVEG+T+I G+L+ LSEQQLVDC D+ GC+GGLMD AFEY+I G
Sbjct: 157 GCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGG 216
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
L TE+ YPYR +G+C + A AA+I YED+P +E AL+ AV++QPVSV ++
Sbjct: 217 LTTESSYPYRGTDGSC---RRSASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDS 273
Query: 271 AFHFYKSGVLNAD-CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI-- 327
F FY SGVL CG +H + VG+GTA + G KYW++KNSWG +WGE GY+RI
Sbjct: 274 VFRFYDSGVLGGSGCGTELNHAITAVGYGTASD--GTKYWIMKNSWGGSWGEGGYVRIRR 331
Query: 328 -LRDAGLCGIATAASYPV 344
+R G+CG+A ASYPV
Sbjct: 332 GVRGEGVCGLAQLASYPV 349
>gi|297816028|ref|XP_002875897.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
gi|297321735|gb|EFH52156.1| hypothetical protein ARALYDRAFT_347926 [Arabidopsis lyrata subsp.
lyrata]
Length = 361
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 158/347 (45%), Positives = 223/347 (64%), Gaps = 15/347 (4%)
Query: 9 FIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
+I +F ++IL C E + + +++W + H + E+ R N+F+
Sbjct: 5 LLIFLFSLVILETACGFDYEDKEIESEEGLSKLYDRWRSHHS-VPRSLHEREKRFNVFRH 63
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRPSTFK 124
N+ ++ +NK+ NR+YKL N+F+DLT EF+ YTG ++R + R S+ +
Sbjct: 64 NVMHVHNSNKK-NRSYKLKLNKFADLTIHEFKNAYTGSKIKHHRMLQGPKR-GSKQFMYD 121
Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
++NV+ +P+S+DWR+KGAVT IK+QG+CGSCWAFS VAAVEGI +I KL+ LSEQ+LV
Sbjct: 122 HENVSKLPSSVDWRKKGAVTEIKNQGKCGSCWAFSTVAAVEGINKIKTNKLVSLSEQELV 181
Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
DC T+ N GC+GGLM+ AFE+I +N G+ TE YPY +G CD K+ V TI +E+
Sbjct: 182 DCDTNQNEGCNGGLMEIAFEFIKKNGGITTEDSYPYEGIDGKCDASKDNGVLVTIDGHEN 241
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P+ DE ALL+AV+NQPVSV +DA F FY GV DCG +HGVA VG+G+ +
Sbjct: 242 VPENDENALLKAVANQPVSVAIDAGSSDFQFYSEGVFTGDCGTELNHGVATVGYGS---Q 298
Query: 304 NGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVAI 346
G KYW+++NSWG WGE GYI+I R G CGIA ASYP+ +
Sbjct: 299 GGKKYWIVRNSWGTEWGEGGYIKIERGIDEPEGRCGIAMEASYPIKL 345
>gi|118120|sp|P25249.1|CYSP1_HORVU RecName: Full=Cysteine proteinase EP-B 1; Flags: Precursor
gi|1146116|gb|AAA85035.1| cysteine proteinase EPB1 precursor [Hordeum vulgare subsp. vulgare]
Length = 371
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 164/321 (51%), Positives = 205/321 (63%), Gaps = 17/321 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ + +E+W + H R + EK R FK N +I NK G+ Y+L N F D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 95 TNEEFRALYTG-YNRPVPSVSRQSSRPSTFKYQ--NVTDVPTSIDWREKGAVTHIKDQGQ 151
EFRA + G R P+ + S P F Y NV+D+P S+DWR+KGAVT +KDQG+
Sbjct: 98 DQAEFRATFVGDLRRDTPA--KPPSVPG-FMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKG 210
CGSCWAFS V +VEGI I G L+ LSEQ+L+DC T DN GC GGLMD AFEYI N G
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214
Query: 211 LATEADYPYRHEEGTCD---NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
L TEA YPYR GTC+ + V I ++D+P E+ L +AV+NQPVSV V+A
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274
Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
SG+AF FY GV DCG DHGVAVVG+G AE+ G YW +KNSWG +WGE GYIR+
Sbjct: 275 SGKAFMFYSEGVFTGDCGTELDHGVAVVGYGVAED--GKAYWTVKNSWGPSWGEQGYIRV 332
Query: 328 LRDA----GLCGIATAASYPV 344
+D+ GLCGIA ASYPV
Sbjct: 333 EKDSGASGGLCGIAMEASYPV 353
>gi|242092702|ref|XP_002436841.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
gi|241915064|gb|EER88208.1| hypothetical protein SORBIDRAFT_10g009840 [Sorghum bicolor]
Length = 328
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 160/342 (46%), Positives = 216/342 (63%), Gaps = 32/342 (9%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+ I+ L C + + + + ++V +HEQWM Q+ R YKD EKA R +FK N+++
Sbjct: 8 ILAILGLAFFCGAALAARDLNDDSAMVARHEQWMVQYSRVYKDTTEKARRFEVFKANVKF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVP-SVSRQSSRPSTFKYQNVT- 129
IE N GNR + LG N+F+DLTN+EFRA T +P P VS + F+Y+NV+
Sbjct: 68 IESFNAGGNRKFWLGVNQFADLTNDEFRATKTNKGFKPSPVKVS------TGFRYENVSV 121
Query: 130 -DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P +IDWR KGAVT IKDQGQC EGI +I+ GKLI LSEQ+LVDC
Sbjct: 122 DALPATIDWRTKGAVTPIKDQGQC------------EGIVKISTGKLISLSEQELVDCDV 169
Query: 189 --DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
++ GC GGLMD AF++II+N GL TE+ YPY +G C + AAT+ +ED+P
Sbjct: 170 HGEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSNS--AATVKGFEDVPA 227
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
DE AL++AV+NQPVSV VD F FY GV+ CG + DHG+A +G+G + +G
Sbjct: 228 NDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG--QTSDGT 285
Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
KYWL+KNSWG TWGE+GY+R+ +D G+CG+A SYP
Sbjct: 286 KYWLLKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 327
>gi|109119897|dbj|BAE96008.1| cysteine proteinase [Triticum aestivum]
Length = 377
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 162/322 (50%), Positives = 206/322 (63%), Gaps = 15/322 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ + +E+W H R + EK R FK N+ +I NK G+R Y+L N F D+
Sbjct: 39 EEALWDLYERWQTAH-RVPRHHAEKHRRFGTFKSNVHFIHSHNKRGDRPYRLRLNRFGDM 97
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPST--FKYQ--NVTDVPTSIDWREKGAVTHIKDQG 150
+ EFRA + G ++ PS F Y NV+D+P S+DWR+KGAVT +K+QG
Sbjct: 98 SQAEFRATFAGSRVSDRRRDGPATPPSVPGFMYAAVNVSDLPRSVDWRQKGAVTGVKNQG 157
Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENK 209
+CGSCWAFS V +VEGI I GKL+ LSEQ+L+DC T DN GC GGLMD AFEYI +N
Sbjct: 158 KCGSCWAFSTVVSVEGINAIRTGKLVSLSEQELIDCDTADNDGCEGGLMDNAFEYIKKNG 217
Query: 210 GLATEADYPYRHEEGTCDN---QKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVD 266
GL TEA YPYR GTC K + I ++D+P E+AL +AV+NQPVSV +D
Sbjct: 218 GLTTEAAYPYRAANGTCKAAKVAKSSPMVVHIDGHQDVPANSEEALAKAVANQPVSVGID 277
Query: 267 ASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
ASG+AF FY GV +CG DHGVAVVG+G AE+ G YW +KNSWG +WGE GYIR
Sbjct: 278 ASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAED--GKAYWTVKNSWGPSWGEKGYIR 335
Query: 327 ILRDA----GLCGIATAASYPV 344
+ +D+ GLCGIA ASY V
Sbjct: 336 VEKDSGAEGGLCGIAMEASYAV 357
>gi|363814535|ref|NP_001242660.1| uncharacterized protein LOC100807362 precursor [Glycine max]
gi|255636658|gb|ACU18666.1| unknown [Glycine max]
Length = 367
Score = 318 bits (816), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 164/354 (46%), Positives = 222/354 (62%), Gaps = 22/354 (6%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEK 59
+ +I +F ++ + ++S H + ++ +E+W+ +HG+ Y EK
Sbjct: 10 TILIVLFTVLAVSSALDMSIISYDRSHADKSGWKSDEEVMSIYEEWLVKHGKVYNAVEEK 69
Query: 60 AMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR 119
R IFK NL +IE+ N NRTYK+G N FSDL+NEE+R+ Y G + PS R +R
Sbjct: 70 EKRFQIFKDNLNFIEEHNAV-NRTYKVGLNRFSDLSNEEYRSKYLG-TKIDPS--RMMAR 125
Query: 120 PSTFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
PS V D +P S+DWR++GAV +K+Q +C CWAFSA+AAVEGI +I G L L
Sbjct: 126 PSRRYSPRVADNLPESVDWRKEGAVVRVKNQSECEGCWAFSAIAAVEGINKIVTGNLTAL 185
Query: 179 SEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
SEQ+L+DC T N GCSGGL+D AFE+II N G+ TE DYP++ +G CD K A A T
Sbjct: 186 SEQELLDCDRTVNAGCSGGLVDYAFEFIINNGGIDTEEDYPFQGADGICDQYKINARAVT 245
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I YE +P DE AL +AV+NQPVSV ++A G+ F Y+SG+ CG + DHGV VG+
Sbjct: 246 IDGYERVPAYDELALKKAVANQPVSVAIEAYGKEFQLYESGIFTGTCGTSIDHGVTAVGY 305
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPVAI 346
GT ENG YW++KNSWGE WGE+GY+ + R+ AG CGIA YP+ I
Sbjct: 306 GT---ENGIDYWIVKNSWGENWGEAGYVGMERNIAEDTAGKCGIAILTLYPIKI 356
>gi|242072390|ref|XP_002446131.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
gi|241937314|gb|EES10459.1| hypothetical protein SORBIDRAFT_06g002140 [Sorghum bicolor]
Length = 328
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 159/341 (46%), Positives = 221/341 (64%), Gaps = 32/341 (9%)
Query: 16 IIILVITCAS---QVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
++ ++ CAS V++ R + + ++VE+HE WM ++GR YKD EKA R +FK N+ +
Sbjct: 7 FLLAILGCASLCSSVLAARELSDAAMVERHENWMVEYGRVYKDAAEKARRFQVFKDNVAF 66
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-FKYQN--VT 129
+E N N + LG N+F+DLT EEF+A G+ V P+T FKY+N V+
Sbjct: 67 VESFNTNKNNKFWLGVNQFADLTTEEFKA-NKGFKPTAEKV------PTTGFKYENLSVS 119
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+PT++DWR KGAVT IK+QGQC AA+EGI +++ G LI LSEQ+LVDC T
Sbjct: 120 ALPTAVDWRTKGAVTPIKNQGQC---------AAMEGIVKLSTGNLISLSEQELVDCDTH 170
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
+ GC GG MD AFE++I+N GLATE++YPY+ +G C + AATI +ED+P
Sbjct: 171 SMDEGCEGGWMDSAFEFVIKNGGLATESNYPYKAVDGKCKGGSKS--AATIKGHEDVPVN 228
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
+E AL++AV+NQPVSV VDAS R F Y GV+ CG DHG+A +G+G E +G K
Sbjct: 229 NEAALMKAVANQPVSVAVDASDRTFMLYSGGVMTGSCGTELDHGIAAIGYGM--ESDGTK 286
Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
YW++KNSWG TWGE G++R+ +D G+CG+A SYP
Sbjct: 287 YWILKNSWGTTWGEKGFLRMEKDITDKRGMCGLAMKPSYPT 327
>gi|356563155|ref|XP_003549830.1| PREDICTED: vignain-like [Glycine max]
Length = 361
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 162/349 (46%), Positives = 226/349 (64%), Gaps = 13/349 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAM 61
++ +K F + + ++L + + + E + + +E+W + H T L EK
Sbjct: 1 MEVKKVFFVALSFALVLRVAESFEFNEKDLESEEGLWDLYERWRSHH--TVSRSLDEKHN 58
Query: 62 RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
R N+FK N+ ++ +NK ++ YKL N F+D+TN EFR++Y G + R + R +
Sbjct: 59 RFNVFKGNVMHVHSSNKM-DKPYKLKLNRFADMTNHEFRSIYAGSKVNHHRMFRGTPRGN 117
Query: 122 -TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
TF YQNV VP+S+DWR+KGAVT +KDQGQCGSCWAFS + AVEGI QI KL+ LSE
Sbjct: 118 GTFMYQNVDRVPSSVDWRKKGAVTDVKDQGQCGSCWAFSTIVAVEGINQIKTHKLVPLSE 177
Query: 181 QQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
Q+LVDC +T N GC+GGLM+ AFE+ I+ G+ T ++YPY ++GTCD K A +I
Sbjct: 178 QELVDCDTTQNQGCNGGLMESAFEF-IKQYGITTASNYPYEAKDGTCDASKVNEPAVSID 236
Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
+E++P +E ALL+AV++QPVSV ++A G F FY GV +CG DHGVA+VG+GT
Sbjct: 237 GHENVPVNNEAALLKAVAHQPVSVAIEAGGIDFQFYSEGVFTGNCGTALDHGVAIVGYGT 296
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
++ G KYW +KNSWG WGE GYIR+ R GLCGIA ASYP+
Sbjct: 297 TQD--GTKYWTVKNSWGSEWGEKGYIRMKRSISVKKGLCGIAMEASYPI 343
>gi|297845064|ref|XP_002890413.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
gi|297336255|gb|EFH66672.1| hypothetical protein ARALYDRAFT_472321 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 155/314 (49%), Positives = 217/314 (69%), Gaps = 13/314 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E W++ + Y+ EK +R +FK NL++I++ NK+ ++Y LG NEF+DL++E
Sbjct: 47 LIELFENWISNFEKAYETVEEKLLRFEVFKDNLKHIDETNKK-VKSYWLGLNEFADLSHE 105
Query: 98 EFRALYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
EF+ +Y G + V R R + F Y++V VP S+DWR+KGAV +K+QG CGSCW
Sbjct: 106 EFKKMYLGLKTDI--VRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
AFS VAAVEGI +I G L LSEQ+L+DC T N+GC+GGLMD AFEYI++N GL E
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEE 223
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
DYPY EEGTC+ QK+++ TI ++D+P DE++LL+A+++QP+SV +DASGR F FY
Sbjct: 224 DYPYSMEEGTCEMQKDESETVTIDGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFY 283
Query: 276 KS-GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA--- 331
V + CG + DHGVA VG+G+++ G+ Y ++KNSWG WGE GYIR+ R+
Sbjct: 284 SGVSVFDGRCGVDLDHGVAAVGYGSSK---GSDYIIVKNSWGPKWGEKGYIRLKRNTGKP 340
Query: 332 -GLCGIATAASYPV 344
GLCGI AS+P
Sbjct: 341 EGLCGINKMASFPT 354
>gi|255568345|ref|XP_002525147.1| cysteine protease, putative [Ricinus communis]
gi|223535606|gb|EEF37274.1| cysteine protease, putative [Ricinus communis]
Length = 347
Score = 318 bits (815), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 154/310 (49%), Positives = 206/310 (66%), Gaps = 15/310 (4%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
++++W+ Q+GR Y + E +R I+ N+++IE N + N ++KL N+F+DLTN+EF
Sbjct: 45 RYDKWLEQYGRKYDTKDEYLLRFGIYHSNIQFIEYINSQ-NLSFKLTDNKFADLTNDEFN 103
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
++Y GY + R + ++N TD+P ++DWRE GAVT IKDQGQCGSCWAFSA
Sbjct: 104 SIYLGY-----QIRSYKRRNLSHMHENSTDLPDAVDWRENGAVTPIKDQGQCGSCWAFSA 158
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYP 218
VAAVEGI +I G L+ LSEQ+LVDC DN GC+GG M+KAF +I GL TE DYP
Sbjct: 159 VAAVEGINKIKTGNLVSLSEQELVDCDVNGDNKGCNGGFMEKAFTFIKSIGGLTTENDYP 218
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
Y+ +G+C+ K A I YE +P +E +L AVS QPVSV +DASG F Y G
Sbjct: 219 YKGTDGSCEKAKTDNHAVIIGGYETVPANNENSLKVAVSKQPVSVAIDASGYEFQLYSEG 278
Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLC 334
V + CG +HGV +VG+G + NG KYWL+KNSWG+ WGESGYIR+ RD+ G+C
Sbjct: 279 VFSGYCGIQLNHGVTIVGYG---DNNGQKYWLVKNSWGKGWGESGYIRMKRDSSDTKGMC 335
Query: 335 GIATAASYPV 344
GIA SYP+
Sbjct: 336 GIAMEPSYPI 345
>gi|297852302|ref|XP_002894032.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
gi|297339874|gb|EFH70291.1| F2G19.31/F2G19.31 [Arabidopsis lyrata subsp. lyrata]
Length = 455
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 158/322 (49%), Positives = 214/322 (66%), Gaps = 16/322 (4%)
Query: 30 GRSMHEPSIVEKHEQWMAQHGRTYKDE--LEKAMRLNIFKQNLEYIEKANKEGNRTYKLG 87
GRS E ++ +E W+ +HG+ +EK R IFK NL +I+ NK+ N +Y+LG
Sbjct: 33 GRSDAE--VMSIYEAWLVKHGKAQNQNSLVEKDRRFEIFKDNLRFIDDHNKK-NLSYRLG 89
Query: 88 TNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
F+DLTN+E+R+ Y G R S R ++ + ++P SIDWR+KGAV +K
Sbjct: 90 LTRFADLTNDEYRSKYLGAKMEKKGERRTSQR---YEARVGDELPESIDWRKKGAVAEVK 146
Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYII 206
DQG CGSCWAFS + AVEGI QI G LI LSEQ+LVDC T N GC+GGLMD AFE+II
Sbjct: 147 DQGSCGSCWAFSTIGAVEGINQIVTGDLITLSEQELVDCDTSYNEGCNGGLMDYAFEFII 206
Query: 207 ENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVD 266
+N G+ T+ DYPY+ +GTCD ++ A TI YED+P E++L +AV++QPVSV ++
Sbjct: 207 KNGGIDTDKDYPYKGVDGTCDQIRKNAKVVTIDSYEDVPTYSEESLKKAVAHQPVSVAIE 266
Query: 267 ASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
A GRAF Y SG+ + CG DHGV VG+GT ENG YW+++NSWG++WGESGY++
Sbjct: 267 AGGRAFQLYDSGIFDGTCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGKSWGESGYLK 323
Query: 327 ILRD----AGLCGIATAASYPV 344
+ R+ +G CGIA SYP+
Sbjct: 324 MARNIASSSGKCGIAIEPSYPI 345
>gi|116781957|gb|ABK22314.1| unknown [Picea sitchensis]
Length = 369
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 158/317 (49%), Positives = 207/317 (65%), Gaps = 12/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYK-DELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S+ ++ W QH + D E A R IFK+N++YI+ NK+ + YKLG N+F+D
Sbjct: 39 EKSLRSLYDNWALQHRSSRSLDSEEHAERFEIFKENVKYIDSVNKK-DSPYKLGLNKFAD 97
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
L+NEEF+A+Y G + + + +F YQN +P SIDWR+KGAV +K+QG CG
Sbjct: 98 LSNEEFKAIYMGTKMDLRG--DREVQSGSFMYQNSEPLPASIDWRQKGAVAAVKNQGHCG 155
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
SCWAFS VA+VEGI IT G L+ LSEQQLVDCST+N GC+GGLMD AF+YII N G+ T
Sbjct: 156 SCWAFSTVASVEGINYITTGNLVSLSEQQLVDCSTENSGCNGGLMDTAFQYIINNGGIVT 215
Query: 214 EADYPYRHEEGTCDNQK--EKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
E +YPY E C + K + I +ED+P +EQAL +AV++QPVSV ++ASG+
Sbjct: 216 EDNYPYTAEATECSSTKINSQTTRVVIDGFEDVPANNEQALKEAVAHQPVSVAIEASGQD 275
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD- 330
F FY +GV CG DHGV VG+GT+ E G YW+++NSWG WGE GYIR+ +
Sbjct: 276 FQFYSTGVFTGKCGTALDHGVVAVGYGTSPE--GINYWIVRNSWGPKWGEEGYIRMQQGI 333
Query: 331 ---AGLCGIATAASYPV 344
G CGIA ASYP
Sbjct: 334 EAAEGKCGIAMQASYPT 350
>gi|118124|sp|P25250.1|CYSP2_HORVU RecName: Full=Cysteine proteinase EP-B 2; Flags: Precursor
gi|1146118|gb|AAA85036.1| cysteine proteinase EPB2 precursor [Hordeum vulgare subsp. vulgare]
Length = 373
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 164/321 (51%), Positives = 205/321 (63%), Gaps = 17/321 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ + +E+W + H R + EK R FK N +I NK G+ Y+L N F D+
Sbjct: 39 EEALWDLYERWQSAH-RVRRHHAEKHRRFGTFKSNAHFIHSHNKRGDHPYRLHLNRFGDM 97
Query: 95 TNEEFRALYTG-YNRPVPSVSRQSSRPSTFKYQ--NVTDVPTSIDWREKGAVTHIKDQGQ 151
EFRA + G R PS + S P F Y NV+D+P S+DWR+KGAVT +KDQG+
Sbjct: 98 DQAEFRATFVGDLRRDTPS--KPPSVPG-FMYAALNVSDLPPSVDWRQKGAVTGVKDQGK 154
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKG 210
CGSCWAFS V +VEGI I G L+ LSEQ+L+DC T DN GC GGLMD AFEYI N G
Sbjct: 155 CGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCDTADNDGCQGGLMDNAFEYIKNNGG 214
Query: 211 LATEADYPYRHEEGTCD---NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
L TEA YPYR GTC+ + V I ++D+P E+ L +AV+NQPVSV V+A
Sbjct: 215 LITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQDVPANSEEDLARAVANQPVSVAVEA 274
Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
SG+AF FY GV +CG DHGVAVVG+G AE+ G YW +KNSWG +WGE GYIR+
Sbjct: 275 SGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAED--GKAYWTVKNSWGPSWGEQGYIRV 332
Query: 328 LRDA----GLCGIATAASYPV 344
+D+ GLCGIA ASYPV
Sbjct: 333 EKDSGASGGLCGIAMEASYPV 353
>gi|225456820|ref|XP_002278323.1| PREDICTED: vignain [Vitis vinifera]
Length = 360
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 161/317 (50%), Positives = 208/317 (65%), Gaps = 12/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S+ +E+W + H T L EK R N+FK+N+ ++ + NK+ + YKL N+F+D
Sbjct: 31 EESLWNLYERWRSHH--TVSRSLDEKHKRFNVFKENVNFVHEFNKK-DEPYKLKLNKFAD 87
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
+TN EFR+ Y G + R S + +F Y+ V VP S+DWR+KGAVT IKDQGQC
Sbjct: 88 MTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQC 147
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGL 211
GSCWAFS V AVEGI I KL+ LSEQ+LVDC T +N GC+GGLM AFE+I E G+
Sbjct: 148 GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEKGGI 207
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
TE YPY E+GTCD K + +I +E +P +E ALL+A +NQP+SV +DA G A
Sbjct: 208 TTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGSA 267
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR-- 329
F FY GV CG + DHGVA+VG+GT + G KYW++KNSWG WGE+GYIR+ R
Sbjct: 268 FQFYSEGVFAGRCGTDLDHGVAIVGYGTTLD--GTKYWIVKNSWGTDWGENGYIRMKRGI 325
Query: 330 --DAGLCGIATAASYPV 344
GLCGIA ASYP+
Sbjct: 326 SAKEGLCGIAVEASYPI 342
>gi|302759380|ref|XP_002963113.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
gi|300169974|gb|EFJ36576.1| hypothetical protein SELMODRAFT_270344 [Selaginella moellendorffii]
Length = 479
Score = 318 bits (814), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 165/319 (51%), Positives = 210/319 (65%), Gaps = 26/319 (8%)
Query: 43 EQWMAQHGRTYKDEL--------EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ WM QHG++Y + EKA R IFK NL +I N E N+ Y LG N F+DL
Sbjct: 58 DSWMLQHGKSYAENALSGDSQAGEKATRYGIFKDNLRFIHGEN-EKNQGYFLGLNAFADL 116
Query: 95 TNEEFRALYTG--YNRPVPSVSRQSSRPSTFKYQNV--TDVPTSIDWREKGAVTHIKDQG 150
TNEEFRA G ++R SR+ + F+Y +V D+P SIDWREKGAV +KDQG
Sbjct: 117 TNEEFRAQRHGGRFDR-----SRERTSYEEFRYGSVQLKDLPDSIDWREKGAVVGVKDQG 171
Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENK 209
CGSCWAFSAVAA+EG+ ++ G+L+ LSEQ+LVDC ++ GC+GGLMD AF ++I+N
Sbjct: 172 SCGSCWAFSAVAAIEGVNKLATGELVSLSEQELVDCDKGEDEGCNGGLMDYAFGFVIKNG 231
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
GL TEADYPY+ CD K A TI YED+P DE ALL+AV++QPVSV +DA G
Sbjct: 232 GLDTEADYPYKGYGTRCDRSKMNAKVVTIDGYEDVPVNDETALLKAVAHQPVSVAIDAGG 291
Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
+ FY+SG+ CG + DHGV VG+G +E+G YW+IKNSWG WGE GYI++ R
Sbjct: 292 SSMQFYRSGIFTGRCGTDLDHGVTNVGYG---KEDGKAYWIIKNSWGSNWGEKGYIKMAR 348
Query: 330 D----AGLCGIATAASYPV 344
+ AGLCGI ASYP
Sbjct: 349 NTGLAAGLCGINMEASYPT 367
>gi|242077600|ref|XP_002448736.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
gi|241939919|gb|EES13064.1| hypothetical protein SORBIDRAFT_06g032320 [Sorghum bicolor]
Length = 467
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 152/318 (47%), Positives = 212/318 (66%), Gaps = 13/318 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKAN-KEGNRTYKLGTNEFS 92
E + +E W+ +HGR + L E R +F NL +++ N + G ++LG N+F+
Sbjct: 49 EAEVRAMYELWLVEHGRRVSNVLGEHDSRFRVFWDNLRFVDAHNERAGEHGFRLGMNQFA 108
Query: 93 DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
DLTN+EFRA Y G +P+ ++ +++ ++P S+DWREKGAV +K+QGQC
Sbjct: 109 DLTNDEFRAAYLGAR--IPAARSGNAVGEMYRHDGAEELPESVDWREKGAVAPVKNQGQC 166
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKG 210
GSCWAFSAV++VE I QI G+++ LSEQ+LV+CSTD N GC+GGLMD AF +II+N G
Sbjct: 167 GSCWAFSAVSSVESINQIVTGEMVTLSEQELVECSTDGGNSGCNGGLMDAAFNFIIKNGG 226
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
+ TE DYPY+ +G CD + A +I +ED+P+ DE++L +AV++QPVSV ++A GR
Sbjct: 227 IDTEDDYPYKAVDGKCDINRRNAKVVSIDAFEDVPENDEKSLQKAVAHQPVSVAIEAGGR 286
Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
F YKSGV + C N DHGV VG+GT ENG YW+++NSWG WGE+GYIR+ R+
Sbjct: 287 QFQLYKSGVFSGSCTTNLDHGVVAVGYGT---ENGKDYWIVRNSWGPKWGEAGYIRMERN 343
Query: 331 ----AGLCGIATAASYPV 344
G CGIA ASYP
Sbjct: 344 INATTGKCGIAMMASYPT 361
>gi|22093636|dbj|BAC06931.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|50510021|dbj|BAD30633.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 352
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 161/343 (46%), Positives = 225/343 (65%), Gaps = 15/343 (4%)
Query: 13 MFVIIILVITCASQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
M ++LV+ ++ +M ++ +H++WMA+HGRTYKD EKA R +FK N+
Sbjct: 11 MAASLLLVVAGGLSTMAKVTMASRAGTMEARHDKWMAEHGRTYKDAAEKARRFRVFKANV 70
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+ I+++N GN+ Y+L TN F+DLT+ EF A+YTGYN P+ + ++ +T + + D
Sbjct: 71 DLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYN---PANTMYAAANATTRLSSEDD 127
Query: 131 -VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
P +DWR++GAVT +K+Q CG CWAFS VAAVEGI QIT G+L+ LSEQQL+DC+ D
Sbjct: 128 QQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCA-D 186
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD---NQKEKAVAATISKYEDLPK 246
N GC+GG +D AF+Y+ + G+ TEA Y Y+ +G C + VAATIS Y+ +
Sbjct: 187 NGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVAATISGYQRVNP 246
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGT-AEEEN 304
DE +L AV++QPVSV ++ SG F Y SGV AD CG DH VAVVG+G A+
Sbjct: 247 NDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSG 306
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
G YW+IKNSWG TWG+ GY+++ +D G CG+A A SYPV
Sbjct: 307 GGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 349
>gi|2224810|emb|CAB09698.1| cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 163/318 (51%), Positives = 216/318 (67%), Gaps = 13/318 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ ++V +HE+WMA+HGRTY +E EKA RL +F+ N + I+ N + T++L TN F+DL
Sbjct: 37 DAAMVSRHEKWMAEHGRTYANEEEKARRLEVFRANAKLIDSFNSAEDSTHRLATNRFADL 96
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN--VTDVPTSIDWREKGAVTHIKDQGQC 152
T+EEFRA TG RP + + S F+Y+N + D S+DWR GAVT +KDQG C
Sbjct: 97 TDEEFRAARTGLRRPPAAAAGAGSGAGGFRYENFSLADAAGSMDWRAMGAVTGVKDQGSC 156
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKG 210
G CWAFSAVAAVEG+T+I G+L+ LSEQQLVDC D+ GC+GGLMD AFEY+I G
Sbjct: 157 GCCWAFSAVAAVEGLTKIRTGRLVSLSEQQLVDCDVYGDDEGCAGGLMDNAFEYMINRGG 216
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
L TE+ YPYR +G+C + A AA+I YED+P +E AL+ AV++QPVSV ++
Sbjct: 217 LTTESSYPYRGTDGSC---RRSASAASIRGYEDVPANNEAALMAAVAHQPVSVAINGGDS 273
Query: 271 AFHFYKSGVLNAD-CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI-- 327
F FY SGVL CG +H + G+GTA + G KYW++KNSWG +WGE GY+RI
Sbjct: 274 VFRFYDSGVLGGSGCGTELNHAITAAGYGTASD--GTKYWIMKNSWGGSWGEGGYVRIRR 331
Query: 328 -LRDAGLCGIATAASYPV 344
+R G+CG+A ASYPV
Sbjct: 332 GVRGEGVCGLAQLASYPV 349
>gi|50355615|dbj|BAD29956.1| cysteine protease [Daucus carota]
Length = 423
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 154/305 (50%), Positives = 208/305 (68%), Gaps = 12/305 (3%)
Query: 46 MAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG 105
+ +H + Y K R IFK NL +I++ NK N+++KLG N+F+DL+NEE+++++ G
Sbjct: 11 LVKHHKNYNALGAKEKRFEIFKDNLRFIDEHNKGVNQSFKLGLNKFADLSNEEYKSMFLG 70
Query: 106 YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVE 165
R V R+ FKY ++P S+DWREKGAV +KDQGQCGSCWAFS VAAVE
Sbjct: 71 -GRMV--RDRKGFESDRFKYGVGDELPQSVDWREKGAVAPVKDQGQCGSCWAFSTVAAVE 127
Query: 166 GITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEG 224
GI QI G LI LSEQ+LVDC N GC+GG MD AFE+I++N G+ TE DYPY+ +G
Sbjct: 128 GINQIATGDLISLSEQELVDCDKGFNQGCNGGFMDYAFEFIVKNGGIDTEDDYPYKGVDG 187
Query: 225 TCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADC 284
CD ++ A TI+ +ED+P+ DE++L +AV++QPVSV ++A GRAF Y+SG+ N C
Sbjct: 188 QCDQNRKNAKVVTINGFEDVPQNDEKSLKKAVAHQPVSVAIEAGGRAFQLYESGIFNGLC 247
Query: 285 GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR-----DAGLCGIATA 339
G + DHGV VG+GT E+G YW+++NSWG WGE+GYIR+ R + G CGIA
Sbjct: 248 GTDLDHGVVAVGYGT---EDGKDYWIVRNSWGPNWGENGYIRLERNVASTNTGKCGIAMQ 304
Query: 340 ASYPV 344
SYP
Sbjct: 305 PSYPT 309
>gi|351721126|ref|NP_001237199.1| cysteine proteinase precursor [Glycine max]
gi|31559530|dbj|BAC77523.1| cysteine proteinase [Glycine max]
gi|31559532|dbj|BAC77524.1| cysteine proteinase [Glycine max]
Length = 362
Score = 317 bits (813), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 160/345 (46%), Positives = 218/345 (63%), Gaps = 20/345 (5%)
Query: 14 FVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQH--GRTYKDELEKAMRLNI 65
F+ ++L ++ V + H E S+ + +E+W + H R+ D K R N+
Sbjct: 6 FLWVVLSLSLVLGVANSFDFHDKDLESEESLWDLYERWRSHHTVSRSLGD---KHKRFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TFK 124
FK N+ ++ NK ++ YKL N+F+D+TN EFR+ Y G + R R + TF
Sbjct: 63 FKANMMHVHNTNKM-DKPYKLKLNKFADMTNHEFRSTYAGSKVNHHRMFRDMPRGNGTFM 121
Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
Y+ V VP S+DWR+KGAVT +KDQG CGSCWAFS V AVEGI QI KL+ LSEQ+LV
Sbjct: 122 YEKVGSVPASVDWRKKGAVTDVKDQGHCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELV 181
Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
DC T+ N GC+GGLM+ AF++I + G+ TE+ YPY ++GTCD K +A +I +E+
Sbjct: 182 DCDTEENAGCNGGLMESAFQFIKQKGGITTESYYPYTAQDGTCDASKANDLAVSIDGHEN 241
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P DE ALL+AV+NQPVSV +DA G F FY GV DC +HGVA+VG+G +
Sbjct: 242 VPGNDENALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGDCSTELNHGVAIVGYGATVD- 300
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
G YW+++NSWG WGE GYIR+ R+ GLCGIA ASYP+
Sbjct: 301 -GTSYWIVRNSWGPEWGELGYIRMQRNISKKEGLCGIAMLASYPI 344
>gi|218198967|gb|EEC81394.1| hypothetical protein OsI_24614 [Oryza sativa Indica Group]
Length = 342
Score = 317 bits (813), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 161/343 (46%), Positives = 225/343 (65%), Gaps = 15/343 (4%)
Query: 13 MFVIIILVITCASQVVSGRSM--HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
M ++LV+ ++ +M ++ +H++WMA+HGRTYKD EKA R +FK N+
Sbjct: 1 MAASLLLVVAGGLSTMAKVTMASRAGTMEARHDKWMAEHGRTYKDAAEKARRFRVFKANV 60
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+ I+++N GN+ Y+L TN F+DLT+ EF A+YTGYN P+ + ++ +T + + D
Sbjct: 61 DLIDRSNAAGNKRYRLATNRFTDLTDAEFAAMYTGYN---PANTMYAAANATTRLSSEDD 117
Query: 131 -VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
P +DWR++GAVT +K+Q CG CWAFS VAAVEGI QIT G+L+ LSEQQL+DC+ D
Sbjct: 118 QQPAEVDWRQQGAVTGVKNQRSCGCCWAFSTVAAVEGIHQITTGELVSLSEQQLLDCA-D 176
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD---NQKEKAVAATISKYEDLPK 246
N GC+GG +D AF+Y+ + G+ TEA Y Y+ +G C + VAATIS Y+ +
Sbjct: 177 NGGCTGGSLDNAFQYMANSGGVTTEAAYAYQGAQGACQFDASSSASGVAATISGYQRVNP 236
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD-CGNNCDHGVAVVGFGT-AEEEN 304
DE +L AV++QPVSV ++ SG F Y SGV AD CG DH VAVVG+G A+
Sbjct: 237 NDEGSLAAAVASQPVSVAIEGSGAMFRHYGSGVFTADSCGTKLDHAVAVVGYGAEADGSG 296
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
G YW+IKNSWG TWG+ GY+++ +D G CG+A A SYPV
Sbjct: 297 GGGYWIIKNSWGTTWGDGGYMKLEKDVGSQGACGVAMAPSYPV 339
>gi|374530932|gb|AEP83812.2| cysteine endopeptidase EP8 [Secale cereale x Triticum durum]
Length = 364
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 154/289 (53%), Positives = 201/289 (69%), Gaps = 10/289 (3%)
Query: 62 RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVPSVSRQSSRP 120
R N+FK+N YI + NK+ +R ++L N+F+D+T +EFR Y G R S+S
Sbjct: 62 RFNVFKENARYIHEGNKK-DRPFRLALNKFADMTTDEFRRTYAGSRVRHHLSLSGGRRGD 120
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+F+Y + ++P ++DWR+KGAVT IKDQGQCGSCWAFS + AVEGI +I GKL+ LSE
Sbjct: 121 GSFRYGDADNLPPAVDWRQKGAVTAIKDQGQCGSCWAFSTIVAVEGINKIRTGKLVSLSE 180
Query: 181 QQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
Q+L+DC + +N GC GGLMD AF++I +N G+ TE++YPY+ E+G+CD KEKA A TI
Sbjct: 181 QELMDCDNVNNQGCDGGLMDYAFQFIHKN-GITTESNYPYQGEQGSCDLAKEKAHAVTID 239
Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
YED+P DE AL +AV+ QPVSV +DASG F FY GV +C + DHGVA VG+GT
Sbjct: 240 GYEDVPANDESALQKAVAGQPVSVAIDASGNDFQFYSEGVFTGECSTDLDHGVAAVGYGT 299
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+ G KYW++KNSWGE WGE GYIR+ R G CGIA ASYP
Sbjct: 300 TRD--GTKYWIVKNSWGEDWGEKGYIRMQRGVSQAEGQCGIAMQASYPT 346
>gi|182375363|gb|ACB87490.1| mucunain [Mucuna pruriens]
Length = 422
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 158/313 (50%), Positives = 204/313 (65%), Gaps = 17/313 (5%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+EQW+ +HG+ Y EK R +IFK NL +I+ N + NRTYKLG N F+DLTNEE+RA
Sbjct: 4 YEQWLVKHGKAYNALGEKDKRFDIFKDNLRFIDDHNAD-NRTYKLGLNRFADLTNEEYRA 62
Query: 102 LYTGY----NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
Y G NR QS+R + + ++P S+DWR + AV +KDQG CGSCWA
Sbjct: 63 RYLGTRIDPNRRFVKTKTQSNR---YAPRVGDNLPESVDWRNESAVLPVKDQGNCGSCWA 119
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS + AVEGI +I G LI LSEQ+LVDC T N GC+GGLMD A+E+II N G+ +E D
Sbjct: 120 FSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAYEFIINNGGIDSEED 179
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPYR +GTCD ++ A TI YED+P DE AL +AV+NQPVSV ++ GR F Y
Sbjct: 180 YPYRAVDGTCDQYRKNAKVVTIDSYEDVPANDELALKKAVANQPVSVAIEGGGREFQLYV 239
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----A 331
SGV CG DHGV VG+G+ + G YW+++NSWG +WGE GY+R+ R+ +
Sbjct: 240 SGVFTGRCGTALDHGVVAVGYGSVK---GHDYWIVRNSWGASWGEEGYVRLERNLAKSRS 296
Query: 332 GLCGIATAASYPV 344
G CGIA SYP+
Sbjct: 297 GKCGIAIEPSYPI 309
>gi|224102377|ref|XP_002312656.1| predicted protein [Populus trichocarpa]
gi|222852476|gb|EEE90023.1| predicted protein [Populus trichocarpa]
Length = 358
Score = 317 bits (812), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 154/345 (44%), Positives = 219/345 (63%), Gaps = 9/345 (2%)
Query: 7 KSFIIPMF-VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNI 65
+ I+ +F V+++ + + E + + +E+W + H + EK R N+
Sbjct: 4 RKVILAVFSVVLVFRLADSFDYTEEDLASEERLRDLYERWRSHH-TVSRSLAEKQERFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY 125
FK+NL++I K N + +R YKL N F+D+TN EF Y G V R + + +
Sbjct: 63 FKENLKHIHKVNHK-DRPYKLKLNSFADMTNHEFLQHYGGSKVSHYRVLRGQRQGTGSMH 121
Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
++ + +P+S+DWR+ GAVT IKDQG+CGSCWAFS VAAVEGI +I G+LI LSEQ+LVD
Sbjct: 122 EDTSKLPSSVDWRKNGAVTGIKDQGKCGSCWAFSTVAAVEGINKIKTGELISLSEQELVD 181
Query: 186 CSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
C +DNHGC+GGLM+ AF +I + GL +E YPYR +E CD+ K + I YE +P
Sbjct: 182 CDSDNHGCNGGLMEDAFNFIKQIGGLTSENTYPYRAKEEPCDSNKMNSPVVNIDGYEMVP 241
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+ DE AL++AV+NQPV++ +DA G+ FY + DCG +HGVA+VG+GT ++ G
Sbjct: 242 ENDENALMKAVANQPVAIAMDAGGKDLQFYSEAIFTGDCGTELNHGVALVGYGTTQD--G 299
Query: 306 AKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVAI 346
KYW++KNSWG WGE GYIR+ R + GLCGI ASYPV +
Sbjct: 300 TKYWIVKNSWGTDWGEKGYIRMQRGIDAEEGLCGITMEASYPVKL 344
>gi|1169186|sp|P43156.1|CYSP_HEMSP RecName: Full=Thiol protease SEN102; Flags: Precursor
gi|396568|emb|CAA52425.1| thiol-protease [Hemerocallis hybrid cultivar]
Length = 360
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 160/349 (45%), Positives = 220/349 (63%), Gaps = 23/349 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEK------HEQWMAQHGRTYKDELEKAMRLNI 65
P F+ + LV + E + + +E+W H +D EK R N+
Sbjct: 4 PKFIALALVALSFLSIAQSIPFTEKDLASEDSLWNLYEKWRTHH-TVARDLDEKNRRFNV 62
Query: 66 FKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRPS 121
FK+N+++I + N++ + YKL N+F D+TN+EFR+ Y G ++R + + +
Sbjct: 63 FKENVKFIHEFNQKKDAPYKLALNKFGDMTNQEFRSKYAGSKIQHHRSQRGIQKNTG--- 119
Query: 122 TFKYQNVTDVPT-SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+F Y+NV +P SIDWR KGAVT +KDQGQCGSCWAFS +A+VEGI QI G+L+ LSE
Sbjct: 120 SFMYENVGSLPAASIDWRAKGAVTGVKDQGQCGSCWAFSTIASVEGINQIKTGELVSLSE 179
Query: 181 QQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
Q+LVDC T N GC+GGLMD AFE+I +N G+ TE YPY ++GTC + + +I
Sbjct: 180 QELVDCDTSYNEGCNGGLMDYAFEFIQKN-GITTEDSYPYAEQDGTCASNLLNSPVVSID 238
Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
++D+P +E AL+QAV+NQP+SV ++ASG F FY GV CG DHGVA+VG+G
Sbjct: 239 GHQDVPANNENALMQAVANQPISVSIEASGYGFQFYSEGVFTGRCGTELDHGVAIVGYGA 298
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
+ G KYW++KNSWGE WGESGYIR+ R G CGIA ASYP+
Sbjct: 299 TRD--GTKYWIVKNSWGEEWGESGYIRMQRGISDKRGKCGIAMEASYPI 345
>gi|89274062|dbj|BAE80740.1| cysteine proteinase [Platycodon grandiflorus]
Length = 462
Score = 317 bits (812), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 156/313 (49%), Positives = 204/313 (65%), Gaps = 12/313 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ +E W+ +HG++Y EK R IFK NL +I++ N E N +YK+G N F+DLTNE
Sbjct: 46 VMAMYESWLVKHGKSYNALGEKEKRFQIFKDNLRFIDEHNAEENLSYKVGLNRFADLTNE 105
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
E+R+ Y G + P +S+ S + + +P S+DWR KGAV IKDQG CGSCWA
Sbjct: 106 EYRSTYLG-AKSKPKLSKVKS--DRYAPRVGDSLPESVDWRAKGAVAPIKDQGSCGSCWA 162
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS V AVEGI QI G+LI LSEQ+LVDC N GC GGLMD FE+II N G+ T+ D
Sbjct: 163 FSTVNAVEGINQIVTGELITLSEQELVDCDKSYNEGCDGGLMDYGFEFIINNGGIDTDKD 222
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY + CD ++ A TI YED+P +E+AL +AV++QPVSV ++ GRAF FY
Sbjct: 223 YPYLGRDARCDQYRKNAKVVTIDSYEDVPVNNEEALKKAVASQPVSVGIEGGGRAFQFYD 282
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----A 331
SG+ CG DHGV VVG+GT E G YW+++NSWG +WGE+GYIR+ R+
Sbjct: 283 SGIFTGKCGTALDHGVNVVGYGT---EKGKDYWIVRNSWGSSWGEAGYIRMERNLAGTSV 339
Query: 332 GLCGIATAASYPV 344
G CGIA SYP+
Sbjct: 340 GKCGIAMEPSYPL 352
>gi|18402225|ref|NP_566633.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
gi|11994461|dbj|BAB02463.1| cysteine proteinase [Arabidopsis thaliana]
gi|17065298|gb|AAL32803.1| cysteine proteinase [Arabidopsis thaliana]
gi|20260004|gb|AAM13349.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642713|gb|AEE76234.1| Granulin repeat cysteine protease family protein [Arabidopsis
thaliana]
Length = 452
Score = 317 bits (811), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 153/343 (44%), Positives = 219/343 (63%), Gaps = 12/343 (3%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
+ + +F ++++ ++ S + + +E +E+W+ ++ + Y EK R IFK
Sbjct: 9 TLALLIFSVLLISLSLGSVTATETTRNEAEARRMYERWLVENRKNYNGLGEKERRFEIFK 68
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
NL+++E+ + NRTY++G F+DLTN+EFRA+Y R +R + + Y+
Sbjct: 69 DNLKFVEEHSSIPNRTYEVGLTRFADLTNDEFRAIYL---RSKMERTRVPVKGEKYLYKV 125
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+P +IDWR KGAV +KDQG CGSCWAFSA+ AVEGI QI G+LI LSEQ+LVDC
Sbjct: 126 GDSLPDAIDWRAKGAVNPVKDQGSCGSCWAFSAIGAVEGINQIKTGELISLSEQELVDCD 185
Query: 188 TD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEE-GTCDNQKEKAVAATISKYEDLP 245
T N GC GGLMD AF++IIEN G+ TE DYPY + C++ K+ TI YED+P
Sbjct: 186 TSYNDGCGGGLMDYAFKFIIENGGIDTEEDYPYIATDVNVCNSDKKNTRVVTIDGYEDVP 245
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+ DE++L +A++NQP+SV ++A GRAF Y SGV CG + DHGV VG+G+ E G
Sbjct: 246 QNDEKSLKKALANQPISVAIEAGGRAFQLYTSGVFTGTCGTSLDHGVVAVGYGS---EGG 302
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
YW+++NSWG WGESGY ++ R+ +G CG+A ASYP
Sbjct: 303 QDYWIVRNSWGSNWGESGYFKLERNIKESSGKCGVAMMASYPT 345
>gi|194352752|emb|CAQ00104.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 351
Score = 317 bits (811), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 167/350 (47%), Positives = 228/350 (65%), Gaps = 22/350 (6%)
Query: 11 IPMFVIIILVITCASQ----VVSGRSMHEPS----IVEKHEQWMAQHGRTYKDELEKAMR 62
+ + V+++ V C ++ + G S + S +VE E+W+A+H + Y EK R
Sbjct: 5 LSVAVLLLCVGACVARNSDFSIVGYSEEDLSSHDRLVELFEKWLAKHQKAYASFEEKLHR 64
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
+FK NL+ I++ N+E +Y LG NEF+DLT++EF+ Y G + + S +
Sbjct: 65 FEVFKDNLKLIDEINRE-VTSYWLGLNEFADLTHDEFKTTYLG----LSPPPARRSSSRS 119
Query: 123 FKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
F+Y+NV D+P ++DWR+KGAVT +K+QGQCGSCWAFS VAAVEGI I G L LSE
Sbjct: 120 FRYENVAAHDLPKAVDWRKKGAVTDVKNQGQCGSCWAFSTVAAVEGINAIVTGNLTALSE 179
Query: 181 QQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC-DNQKEKAVAATI 238
Q+L+DCS D N GC+GG+MD AF YI + GL TE YPY EEG+C D +K ++ A +I
Sbjct: 180 QELIDCSVDGNSGCNGGMMDYAFSYIASSGGLHTEEAYPYLMEEGSCGDGKKSESEAVSI 239
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
S YED+P DEQAL++A+++QPVSV ++ASGR F FY GV + CG DHGVA VG+G
Sbjct: 240 SGYEDVPTKDEQALIKALAHQPVSVAIEASGRHFQFYSGGVFDGPCGAQLDHGVAAVGYG 299
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+ ++ G Y ++KNSWG WGE GYIR+ R GLCGI ASYP
Sbjct: 300 S-DKGKGHDYIIVKNSWGGKWGEKGYIRMKRGTGKSEGLCGINKMASYPT 348
>gi|30141021|dbj|BAC75924.1| cysteine protease-2 [Helianthus annuus]
Length = 362
Score = 316 bits (810), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 158/317 (49%), Positives = 209/317 (65%), Gaps = 13/317 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ + +E+W +H + + EK R N+FK N+ ++ + NK ++ YKL N+F+D+
Sbjct: 33 EDNLWDMYERW--RH-KVATNHGEKLRRFNVFKSNVLHVHETNKM-DKPYKLKLNKFADM 88
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRP--STFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
TN EFR++Y G S Q R TF Y NV VPTS+DWR+KGAV +KDQGQC
Sbjct: 89 TNHEFRSVYAGSKIHHHDRSLQGDRSGSKTFMYANVESVPTSVDWRKKGAVAPVKDQGQC 148
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGL 211
GSCWAFS VAAVEGI +I +L+ LSEQ+LVDC T +N GC+GGLMD AF++I + GL
Sbjct: 149 GSCWAFSTVAAVEGINKIKTNELVSLSEQELVDCDTLENQGCNGGLMDLAFDFIKKTGGL 208
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
E YPY E+G CD+ K + +I +ED+PK DEQ+L++AV+NQPV+V +DA
Sbjct: 209 TREDAYPYAAEDGKCDSNKMNSPVVSIDGHEDVPKNDEQSLMKAVANQPVAVAIDAGSSD 268
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR-- 329
F FY GV CG DHGVA VG+GT + G KYW+++NSWG WGE GYIR+ R
Sbjct: 269 FQFYSEGVFTGKCGTQLDHGVAAVGYGTTLD--GTKYWIVRNSWGSEWGEKGYIRMERGI 326
Query: 330 --DAGLCGIATAASYPV 344
GLCGIA ASYP+
Sbjct: 327 SDKRGLCGIAMEASYPI 343
>gi|1173630|gb|AAB37233.1| cysteine proteinase [Phalaenopsis sp. SM9108]
Length = 359
Score = 316 bits (810), Expect = 9e-84, Method: Compositional matrix adjust.
Identities = 163/350 (46%), Positives = 227/350 (64%), Gaps = 21/350 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMH------EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
+ +F +I++ AS + + E S+ +E+W + H + +D EK R N
Sbjct: 1 MKLFSLILVASFLASVAATAIDIADKDLETEDSLWNLYERWRSHHTVS-RDLDEKQKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSSRP 120
+FK+N YI NK + YKL N+F+DLTN EFR+ Y G ++R + SR+
Sbjct: 60 VFKENPRYIHDFNKRKDIPYKLRLNKFADLTNHEFRSTYAGSRINHHRSLRG-SRRGGAT 118
Query: 121 STFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
++F YQ++ +P SIDWR+KGAVT +KDQGQCGSCWAFS VAAVEGI QI KL+ L
Sbjct: 119 NSFMYQSLDSRSLPASIDWRQKGAVTAVKDQGQCGSCWAFSTVAAVEGINQIKTKKLLSL 178
Query: 179 SEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
SEQ+L+DC TD N+GC+GGLMD AF++I +N G+++EA+YPY E+ C +K+ V +
Sbjct: 179 SEQELIDCDTDENNGCNGGLMDYAFDFIKKNGGISSEAEYPYAAEDSYCATEKKSHV-VS 237
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I +ED+P DE +LL+AV+NQPVS+ ++ASG F FY GV G DHGVA+VG+
Sbjct: 238 IDGHEDVPANDEDSLLKAVANQPVSIAIEASGYDFQFYSEGVFTGRSGTELDHGVAIVGY 297
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDAG---LCGIATAASYPV 344
G ++ G KYW+++NSWG WGE GYIRI + LCG+A ASYP+
Sbjct: 298 GKTQQ--GTKYWIVRNSWGAEWGEKGYIRISAASDSKRLCGLAMEASYPI 345
>gi|296081395|emb|CBI16828.3| unnamed protein product [Vitis vinifera]
Length = 359
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 157/348 (45%), Positives = 223/348 (64%), Gaps = 10/348 (2%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+K EK ++ + ++++ + + E S+ + +E+W + H +D EK R
Sbjct: 1 MKMEKVILVALSLVLVFGLAESFDFDEKDLASEESLWDLYERWRSYH-TVSRDLEEKNKR 59
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
N+FK+N +++ K N+ ++ YKL N+F+D+TN EFR+ Y G + R R +
Sbjct: 60 FNVFKENTKHVHKVNQM-DKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTG 118
Query: 123 -FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
F ++ T +P S+DWR+KGAVT IKDQG+CGSCWAFS V VEGI QI +L+ LSEQ
Sbjct: 119 GFMHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQ 178
Query: 182 QLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
QL+DC +D+HGC+GGLM+ AFE+I +N G+ TE +YPY+ ++ CD K A TI
Sbjct: 179 QLIDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTIDG 238
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
+E +P DE+AL++AV++QPVSV +DA G FY GV + +CG DHGVA+VG+GT
Sbjct: 239 HESVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGTT 298
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
+ G KYW++KNSWG WGE GYIR+ R G CGIA ASYPV
Sbjct: 299 LD--GTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPV 344
>gi|148907299|gb|ABR16787.1| unknown [Picea sitchensis]
Length = 372
Score = 316 bits (809), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 156/312 (50%), Positives = 207/312 (66%), Gaps = 12/312 (3%)
Query: 42 HEQWMAQHGRTYK-DELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
+++W QH T D E A R IFK+N+++I+ NK+ + YKLG N+F+DL+NEEF+
Sbjct: 45 YDKWALQHRSTRSLDSDEHARRFEIFKENVKHIDSVNKK-DGPYKLGLNKFADLSNEEFK 103
Query: 101 ALY--TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
A++ T + + +F YQN +P SIDWR+KGAVT +K+QGQCGSCWAF
Sbjct: 104 AMHMTTKMEKHKSLRGDRGVESGSFMYQNSKRLPASIDWRKKGAVTPVKNQGQCGSCWAF 163
Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
S +A+VEGI I GKL+ LSEQQLVDCS +N GC+GGLMD AF+YII+N G+ TE +YP
Sbjct: 164 STIASVEGINYIKTGKLVSLSEQQLVDCSKENAGCNGGLMDNAFQYIIDNGGIVTEDEYP 223
Query: 219 YRHEEGTCDNQK--EKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
Y E G C K K++A I +ED+P +E AL +AV++QPVS+ ++ASG F FY
Sbjct: 224 YTAEAGECSTTKIESKSIATIIDGFEDVPANNEGALKKAVAHQPVSIAIEASGHDFQFYS 283
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAG 332
+GV CG DHGV VVG+G + E G YW+++NSWG WGE GYIR+ R G
Sbjct: 284 TGVFTGKCGTELDHGVVVVGYGKSPE--GINYWIVRNSWGPEWGEQGYIRMQRGIEATEG 341
Query: 333 LCGIATAASYPV 344
CGI+ ASYP
Sbjct: 342 KCGISMQASYPT 353
>gi|18141283|gb|AAL60579.1|AF454957_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 460
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 157/321 (48%), Positives = 213/321 (66%), Gaps = 21/321 (6%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDE----LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
+ + +E WM +HG+ + EK R IFK NL +I++ N + N +YKLG
Sbjct: 42 DAEVARIYEAWMEKHGKKAQSNGLVGEEKDQRFEIFKDNLRFIDEHNNK-NLSYKLGLTR 100
Query: 91 FSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ-NVTD-VPTSIDWREKGAVTHIKD 148
F+DLTNEE+R++Y G + S++ ++ +YQ V D +P S+DWR++GAV +KD
Sbjct: 101 FADLTNEEYRSIYLG------AKSKKRVLKTSDRYQPRVGDAIPDSVDWRKEGAVAAVKD 154
Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIE 207
QG CGSCWAFS + AVEGI +I G LI LSEQ+LVDC T N GC+GGLMD AFE+II+
Sbjct: 155 QGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSYNQGCNGGLMDYAFEFIIK 214
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
N G+ TE DYPY+ +G CD ++ A TI YED+P+ +E AL + ++NQP+SV ++A
Sbjct: 215 NGGIDTEEDYPYKAADGRCDQTRKNAKVVTIDAYEDVPENNEAALKKTLANQPISVAIEA 274
Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
GRAF Y SGV + CG DHGV VG+GT ENG YW+++NSWG +WGESGYI++
Sbjct: 275 GGRAFQLYSSGVFDGICGTELDHGVVAVGYGT---ENGKDYWIVRNSWGGSWGESGYIKM 331
Query: 328 LRD----AGLCGIATAASYPV 344
R+ G CGIA ASYP+
Sbjct: 332 ARNIAEPTGKCGIAMEASYPI 352
>gi|297602242|ref|NP_001052232.2| Os04g0203500 [Oryza sativa Japonica Group]
gi|255675217|dbj|BAF14146.2| Os04g0203500 [Oryza sativa Japonica Group]
Length = 336
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 158/342 (46%), Positives = 224/342 (65%), Gaps = 22/342 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F I+ + C++ + + + ++ +HE+WMAQ+GR YKD+ EKA R +FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN+EFR+ T +PS +R P+ F+ +NV
Sbjct: 68 IESFN-AGNHKFWLGVNQFADLTNDEFRSTKTNKGF-IPSTTRV---PTGFRNENVNIDA 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
+P ++DWR KG VT IKDQGQCG CWAFSAVAA+EGI +++ GKLI S + + + +
Sbjct: 123 LPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAMEGIVKLSTGKLISHSLNKSL-LTVMS 181
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA---ATISKYEDLPKG 247
GC GGLMD AF++II+N GL TE++YPY + K K+V+ A+I YED+P
Sbjct: 182 MGCEGGLMDDAFKFIIKNGGLTTESNYPYAAVD-----DKFKSVSNSVASIKGYEDVPAN 236
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
+E AL++AV+NQPVSV VD F FYK GV+ CG + DHG+ +G+G A + G K
Sbjct: 237 NEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASD--GTK 294
Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
YWL+KNSWG TWGE+G++R+ +D G+CG+A SYP A
Sbjct: 295 YWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 336
>gi|115461667|ref|NP_001054433.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|14719319|gb|AAK73137.1|AC079022_10 putative cysteine proteinase [Oryza sativa]
gi|33151125|gb|AAP97431.1| cysteine protease CP1 [Oryza sativa]
gi|52353572|gb|AAU44138.1| cysteine proteinase CP1 [Oryza sativa Japonica Group]
gi|113577984|dbj|BAF16347.1| Os05g0108600 [Oryza sativa Japonica Group]
gi|125550541|gb|EAY96250.1| hypothetical protein OsI_18148 [Oryza sativa Indica Group]
Length = 358
Score = 315 bits (808), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 161/314 (51%), Positives = 206/314 (65%), Gaps = 12/314 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E+W+A++ + Y EK R +FK NL +I+ NK+ +Y LG NEF+DLT++
Sbjct: 47 LIELFEKWVAKYRKAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGLNEFADLTHD 105
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSC 155
EF+A Y G P + + F+Y +++ VP +DWR+K AVT +K+QGQCGSC
Sbjct: 106 EFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEVKNQGQCGSC 165
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
WAFS VAAVEGI I G L LSEQ+L+DCSTD N+GC+GGLMD AF YI GL TE
Sbjct: 166 WAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYIASTGGLRTE 225
Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
YPY EEG CD K AV TIS YED+P DEQAL++A+++QPVSV ++ASGR F F
Sbjct: 226 EAYPYAMEEGDCDEGKGAAV-VTISGYEDVPANDEQALVKALAHQPVSVAIEASGRHFQF 284
Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA--- 331
Y GV + CG DHGV VG+GT++ G Y ++KNSWG WGE GYIR+ R
Sbjct: 285 YSGGVFDGPCGEQLDHGVTAVGYGTSK---GQDYIIVKNSWGPHWGEKGYIRMKRGTGKG 341
Query: 332 -GLCGIATAASYPV 344
GLCGI ASYP
Sbjct: 342 EGLCGINKMASYPT 355
>gi|57282619|emb|CAE54307.1| cysteine proteinase [Gossypium hirsutum]
Length = 372
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 153/309 (49%), Positives = 210/309 (67%), Gaps = 9/309 (2%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
++ W+ QHG+ Y E+ R IFK NL +I++ N N TYKLG N+F+DLTN+E+RA
Sbjct: 46 YKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYRA 105
Query: 102 LYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
+ G +S PS+ + ++ ++P S++WR+ GAV+ +KDQG CGSCWAFSA
Sbjct: 106 KFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVNWRDHGAVSRVKDQGSCGSCWAFSA 165
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
+AAVEGI +I G+LI LSEQ+LVDC + GC+GGLMD AF++II+N G+ TE DYPY
Sbjct: 166 IAAVEGINKIVSGELISLSEQELVDCDRSYDAGCNGGLMDYAFQFIIDNGGIDTEKDYPY 225
Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
CD K+ A +I YED+P +E AL +AV++QPVS+ ++A GRAF Y+SGV
Sbjct: 226 LGFNNQCDPTKKNAKVVSIDGYEDVPN-NENALKKAVAHQPVSIAIEAGGRAFQLYESGV 284
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCG 335
N +CG DHGV VG+G+ ++NG YW+++NSWG WGE+GYIR+ R + G CG
Sbjct: 285 FNGECGLALDHGVVAVGYGS--DDNGQDYWIVRNSWGGNWGENGYIRMERNINANTGKCG 342
Query: 336 IATAASYPV 344
IA ASYPV
Sbjct: 343 IAMEASYPV 351
>gi|595986|gb|AAA79915.1| cysteine proteinase, partial [Dianthus caryophyllus]
Length = 427
Score = 315 bits (807), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 153/311 (49%), Positives = 209/311 (67%), Gaps = 14/311 (4%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK----ANKEGNRTYKLGTNEFSDLTNEE 98
+ W+ +H + Y EK R IF+ NLE+I++ N G ++LG N+F+DLTN+E
Sbjct: 6 QSWLVKHRKNYNALGEKEKRFAIFRDNLEFIDQHNNNNNGGGGGEFELGLNKFADLTNDE 65
Query: 99 FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
FR +Y G RP + S +S R + + ++P S+DWR+KGAV+H+KDQGQCGSCWAF
Sbjct: 66 FRRIYFGVKRPEKAESVKSDR---YAVKEGDELPESVDWRKKGAVSHVKDQGQCGSCWAF 122
Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADY 217
SA+ AVEGI +I G LI LSEQ+LVDC T N GC GGLMD AF +II N G+ T+ DY
Sbjct: 123 SAIGAVEGINKIVTGDLITLSEQELVDCDTSYNSGCDGGLMDYAFRFIINNGGIDTDKDY 182
Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
PY+ +G+CD+ ++ A TI ED+P +E+AL +AV++QPV + ++A GR F YKS
Sbjct: 183 PYKATDGSCDSNRKNAKVVTIDGLEDVPANNEKALQKAVAHQPVRLAIEAGGRDFQLYKS 242
Query: 278 GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGL 333
GV CG + DHGV VG+GT ++ G YW+++NSWG+ WGE GYIR+ R+ +G
Sbjct: 243 GVFTGSCGTSLDHGVVAVGYGTTDD--GKDYWIVRNSWGDDWGEDGYIRMERNTESKSGK 300
Query: 334 CGIATAASYPV 344
CGIA SYPV
Sbjct: 301 CGIAIEPSYPV 311
>gi|357437715|ref|XP_003589133.1| Cysteine proteinase [Medicago truncatula]
gi|87240770|gb|ABD32628.1| Granulin; Peptidase C1A, papain [Medicago truncatula]
gi|355478181|gb|AES59384.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 315 bits (806), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 165/355 (46%), Positives = 222/355 (62%), Gaps = 17/355 (4%)
Query: 1 MVLKFEKSFIIPMFV---IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL 57
M++ SF + + + II T + S R+ E ++ +E+W+ +HG++Y
Sbjct: 13 MIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKE--VLTMYEEWLVKHGKSYNGLG 70
Query: 58 EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVPSVSRQ 116
EK R IFK NL++I++ N N TY+LG F+DLTNEE+R+ + G P + +
Sbjct: 71 EKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKL 129
Query: 117 SSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
S V D +P S+DWR++GAV +KDQ CGSCWAFSA+AAVEGI +I G L
Sbjct: 130 GGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDL 189
Query: 176 IELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV 234
I LSEQ+LVDC T N GC+GGLMD AFE+II N G+ +E DYPY+ +G CD ++ A
Sbjct: 190 ISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAK 249
Query: 235 AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAV 294
TI YED+P DE AL +AV+NQP++V V+ GR F Y+ GV CG DHGVA
Sbjct: 250 VVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVAA 309
Query: 295 VGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
VG+GT ENG YW+++NSWG +WGE GYIR+ R+ AG CGIA SYP+
Sbjct: 310 VGYGT---ENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361
>gi|30141025|dbj|BAC75926.1| cysteine protease-4 [Helianthus annuus]
Length = 352
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 156/345 (45%), Positives = 220/345 (63%), Gaps = 20/345 (5%)
Query: 13 MFVIIILVITCASQV----VSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMRLN 64
+F++ + V+ C++ + G + + + + K E W+A+H + Y+ EK R
Sbjct: 12 LFLVFVSVLACSALANEFSILGYAPEDLTSIHKVIHLFESWLAKHSKIYESLDEKLHRFE 71
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
IF NL++I+ NK+ + Y LG NEF+DLT+EEF+ + G +P R+ F
Sbjct: 72 IFMDNLKHIDDTNKKVS-NYWLGLNEFADLTHEEFKNKFLGLKGELPE--RKDESIEEFS 128
Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
Y++ D+P S+DWR+KGAV +K+QGQCGSCWAFS VAAVEGI QI G L LSEQ+L+
Sbjct: 129 YRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTMLSEQELI 188
Query: 185 DCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
DC T N+GC+GGLMD AF Y++ + GL E +YPY EGTCD +K+ + TIS Y D
Sbjct: 189 DCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSETVTISGYHD 247
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P+ +E + L+A++NQP+SV ++ASGR F FY GV + CG DHGVA VG+GT +
Sbjct: 248 VPRNNEDSFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAAVGYGTTK-- 305
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
G Y +++NSWG WGE GYIR+ R G+CG+ ASYP
Sbjct: 306 -GLDYVIVRNSWGPKWGEKGYIRMKRKTGKPHGMCGLYMMASYPT 349
>gi|255635584|gb|ACU18142.1| unknown [Glycine max]
Length = 345
Score = 315 bits (806), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 157/311 (50%), Positives = 211/311 (67%), Gaps = 12/311 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y+ EK +R IFK NL++I++ NK + Y LG NEF+DL+++
Sbjct: 43 LIELFESWMSKHGKIYQSIEEKLLRFEIFKDNLKHIDERNKVVS-NYWLGLNEFADLSHQ 101
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G SR+ P F Y++V ++P S+DWR+KGAV +K+QG CGSCWA
Sbjct: 102 EFKNKYLGLK---VDYSRRRESPEEFTYKDV-ELPKSVDWRKKGAVAPVKNQGSCGSCWA 157
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T ++GC+GGLMD AF +I+EN GL E D
Sbjct: 158 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYSNGCNGGLMDYAFSFIVENGGLHKEED 217
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEGTC+ KE+ TIS Y D+P+ +EQ+LL+A++NQ +SV ++ASGR F FY
Sbjct: 218 YPYIMEEGTCEMTKEETEVVTISGYHDVPQNNEQSLLKALANQSLSVAIEASGRDFQFYS 277
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI---LRDAGL 333
GV + CG++ DHGVA VG+GTA+ G Y ++KNSWG WGE GYIR+ L G
Sbjct: 278 GGVFDGHCGSDLDHGVAAVGYGTAK---GVDYIIVKNSWGSKWGEKGYIRMRGTLETRGN 334
Query: 334 CGIATAASYPV 344
ASYP+
Sbjct: 335 LRYLQMASYPL 345
>gi|351726339|ref|NP_001237379.1| cysteine proteinase precursor [Glycine max]
gi|31559526|dbj|BAC77521.1| cysteine proteinase [Glycine max]
gi|31559528|dbj|BAC77522.1| cysteine proteinase [Glycine max]
Length = 362
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 158/321 (49%), Positives = 207/321 (64%), Gaps = 20/321 (6%)
Query: 35 EPSIVEKHEQWMAQH--GRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
E S + +E+W + H R+ D K R N+FK N+ ++ NK ++ YKL N+F+
Sbjct: 33 EESFWDLYERWRSHHTVSRSLGD---KHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFA 88
Query: 93 DLTNEEFRALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKD 148
D+TN EFR+ Y G ++R R + TF Y+ V VP S+DWR+ GAVT +KD
Sbjct: 89 DMTNHEFRSTYAGSKVNHHRMFQGTPRGNG---TFMYEKVGSVPPSVDWRKNGAVTGVKD 145
Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIE 207
QGQCGSCWAFS V AVEGI QI KL+ LSEQ+LVDC T N GC+GGLM+ AFE+I +
Sbjct: 146 QGQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIKQ 205
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
G+ TE++YPY ++GTCD K +A +I +E++P DE ALL+AV+NQPVSV +DA
Sbjct: 206 KGGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDA 265
Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
G F FY GV DC +HGVA+VG+GT + G YW ++NSWG WGE GYIR+
Sbjct: 266 GGSDFQFYSEGVFTGDCSTELNHGVAIVGYGTTVD--GTNYWTVRNSWGPEWGEQGYIRM 323
Query: 328 LRD----AGLCGIATAASYPV 344
R GLCGIA ASYP+
Sbjct: 324 QRSISKKEGLCGIAMMASYPI 344
>gi|357437719|ref|XP_003589135.1| Cysteine proteinase [Medicago truncatula]
gi|355478183|gb|AES59386.1| Cysteine proteinase [Medicago truncatula]
Length = 457
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 165/355 (46%), Positives = 222/355 (62%), Gaps = 17/355 (4%)
Query: 1 MVLKFEKSFIIPMFV---IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL 57
M++ SF + + + II T + S R+ E ++ +E+W+ +HG++Y
Sbjct: 13 MIVLIISSFTVSLALDMSIISYDKTHPDKSTSKRTNKE--VLTMYEEWLVKHGKSYNGLG 70
Query: 58 EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVPSVSRQ 116
EK R IFK NL++I++ N N TY+LG F+DLTNEE+R+ + G P + +
Sbjct: 71 EKDKRFEIFKDNLKFIDEHNGL-NSTYRLGLTRFADLTNEEYRSKFLGTKIDPNRRMKKL 129
Query: 117 SSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
S V D +P S+DWR++GAV +KDQ CGSCWAFSA+AAVEGI +I G L
Sbjct: 130 GGSKSNRYAPRVGDKLPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDL 189
Query: 176 IELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV 234
I LSEQ+LVDC T N GC+GGLMD AFE+II N G+ +E DYPY+ +G CD ++ A
Sbjct: 190 ISLSEQELVDCDTSYNEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAK 249
Query: 235 AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAV 294
TI YED+P DE AL +AV+NQP++V V+ GR F Y+ GV CG DHGVA
Sbjct: 250 VVTIDDYEDVPAYDELALQKAVANQPIAVAVEGGGREFQLYEYGVFTGRCGTALDHGVAA 309
Query: 295 VGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
VG+GT ENG YW+++NSWG +WGE GYIR+ R+ AG CGIA SYP+
Sbjct: 310 VGYGT---ENGKDYWIVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 361
>gi|356515080|ref|XP_003526229.1| PREDICTED: vignain-like [Glycine max]
Length = 284
Score = 314 bits (805), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 155/288 (53%), Positives = 198/288 (68%), Gaps = 17/288 (5%)
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF---RALYTGYNRPVPSVSRQSSRPSTF 123
K+N+ YIE N N+ YKLG N+F+DLT+EEF R + G+ R ++R +TF
Sbjct: 5 KENVNYIEAFNNAANKPYKLGINQFADLTSEEFIVPRNRFNGHMR------FSNTRTTTF 58
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
KY+NVT +P SIDWR+KGAVT IK+QG CG CWAFSA+AA EGI +I+ GKL+ LSEQ++
Sbjct: 59 KYENVTVLPDSIDWRQKGAVTPIKNQGSCGCCWAFSAIAATEGIHKISTGKLVSLSEQEV 118
Query: 184 VDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
VDC T +HGC GG MD AF++II+N G+ TEA YPY+ +G C+ ++E A TI+ Y
Sbjct: 119 VDCDTKGTDHGCEGGYMDGAFKFIIQNHGINTEASYPYKGVDGKCNIKEEAVHATTITGY 178
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
ED+P +E+AL +AV+NQPVSV +DA G F FYKSG+ CG DHGV VG+G E
Sbjct: 179 EDVPINNEKALQKAVANQPVSVAIDARGADFQFYKSGIFTGSCGTELDHGVTAVGYG--E 236
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
G KYWL+KNSWG WGE GY + R G+CGIA ASYP A
Sbjct: 237 NNEGTKYWLVKNSWGTEWGEEGYTMMQRGVKAVEGICGIAMLASYPTA 284
>gi|2224808|emb|CAB09697.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
gi|326502180|dbj|BAK06781.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 314 bits (804), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 158/320 (49%), Positives = 213/320 (66%), Gaps = 16/320 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYK----DELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
E S+ +E+W + + + + D E+ R N+FK+N Y+ + NK +R ++L N+
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAEER--RFNVFKENARYVHEGNKR-DRPFRLALNK 90
Query: 91 FSDLTNEEFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ 149
F+D+T +EFR Y G R S+S F+Y + ++P ++DWR+KGAVT IKDQ
Sbjct: 91 FADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYADADNLPPAVDWRQKGAVTAIKDQ 150
Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIEN 208
GQCGSCWAFS + AVEGI +I GKL+ LSEQ+L+DC + +N GC GGLMD AF++I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCEGGLMDYAFQFIQKN 210
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
G+ TE++YPY+ E+G+CD KE A A TI YED+P DE AL +AV+ QPVSV +DAS
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269
Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
G+ F FY GV +C + DHGVA VG+G +G KYW++KNSWGE WGE GYIR+
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGAT--RDGTKYWIVKNSWGEDWGEKGYIRMQ 327
Query: 329 RDA----GLCGIATAASYPV 344
R GLCGIA ASYP
Sbjct: 328 RGVSQTEGLCGIAMQASYPT 347
>gi|226533314|ref|NP_001150119.1| xylem cysteine proteinase 2 [Zea mays]
gi|195636886|gb|ACG37911.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|223946183|gb|ACN27175.1| unknown [Zea mays]
gi|413951209|gb|AFW83858.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 385
Score = 314 bits (804), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 165/338 (48%), Positives = 210/338 (62%), Gaps = 29/338 (8%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEF 91
S HE S+ E E+W+++H R Y EK R +FK NL +I++ N++ + +Y LG NEF
Sbjct: 50 SSHE-SLAELFERWLSRHRRAYASLEEKLRRFQVFKDNLHHIDETNRKVS-SYWLGLNEF 107
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV------TDVPTSIDWREKGAVTH 145
+DLT++EF+A Y G V + + +P S+DWR KGAVT
Sbjct: 108 ADLTHDEFKATYLGLRSSVGDGGSGIDDDDEPEEEEGYEGVDGASLPKSVDWRSKGAVTG 167
Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEY 204
+K+QGQCGSCWAFS VAAVEGI QI G L LSEQ+L+DC TD N+GC+GGLMD AF Y
Sbjct: 168 VKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELIDCDTDGNNGCNGGLMDYAFSY 227
Query: 205 IIENKGLATEADYPYRHEEGTC--------------DNQKEKAVAATISKYEDLPKGDEQ 250
I N GL TE YPY EEGTC ++ + A TIS YED+P+ +EQ
Sbjct: 228 IAHNGGLHTEEAYPYLMEEGTCQRSSSSEKKWPGSSEDANDDAAVVTISGYEDVPRNNEQ 287
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
ALL+A++ QPVSV ++ASGR F FY GV + CG DHGVA VG+GTA + G Y +
Sbjct: 288 ALLKALAQQPVSVAIEASGRNFQFYSGGVFDGPCGTQLDHGVAAVGYGTAAK--GHDYII 345
Query: 311 IKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+KNSWG +WGE GYIR+ R GLCGI ASYP
Sbjct: 346 VKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMASYPT 383
>gi|357465603|ref|XP_003603086.1| Cysteine proteinase [Medicago truncatula]
gi|355492134|gb|AES73337.1| Cysteine proteinase [Medicago truncatula]
Length = 474
Score = 313 bits (803), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 161/357 (45%), Positives = 222/357 (62%), Gaps = 28/357 (7%)
Query: 12 PMFVIIILVITCAS------QVVSGRSMH--------EPSIVEKHEQWMAQHGRTYK--D 55
PM VI+I+ + ++S H + + +E+W +HG+ D
Sbjct: 9 PMLVILIVFTLFTATFALDMSIISYDKTHSDKSSRRSDKEVKNIYEEWRVKHGKLNNNID 68
Query: 56 ELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVPSV- 113
EK R IFK NL++I++ N E NRTYK+G N F+DL+NEE+R+ Y G P+ +
Sbjct: 69 GSEKDKRFEIFKDNLKFIDEHNAE-NRTYKVGLNRFADLSNEEYRSRYLGTKIDPIGMMM 127
Query: 114 SRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRG 173
+R +R + + +P S+DWR +GAV +KDQG CGSCWAFS +AAVEGI +I G
Sbjct: 128 ARTKTRSNRYAPSVGDKLPKSVDWRSQGAVVQVKDQGSCGSCWAFSTIAAVEGINKIVTG 187
Query: 174 KLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
+L+ LSEQ+LVDC T N GC GGLM+ AFE+II N G+ ++ DYPYR +G CD K+
Sbjct: 188 ELVSLSEQELVDCDRTVNAGCDGGLMEYAFEFIINNGGIDSDEDYPYRGVDGKCDQYKKN 247
Query: 233 AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGV 292
A +I YE +P DE AL +AV+NQP+SV ++A GR F Y SG+ CG DHGV
Sbjct: 248 ARVVSIDDYEQVPAYDELALKKAVANQPISVAIEAGGREFQLYVSGIFTGKCGTALDHGV 307
Query: 293 AVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
VG+GT ENG YW+++NSWG++WGESGY+R+ R+ AG CGI +SYP+
Sbjct: 308 TAVGYGT---ENGVDYWIVRNSWGKSWGESGYVRMERNLAASVAGKCGIVMQSSYPI 361
>gi|172052260|gb|ACB70409.1| cysteine protease [Nicotiana tabacum]
Length = 361
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 160/315 (50%), Positives = 205/315 (65%), Gaps = 18/315 (5%)
Query: 40 EKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
E +E+W + H T L EK R N+FK N+ Y+ NK+ ++ YKL N+F+D+TN E
Sbjct: 36 ELYERWRSHH--TVSRSLDEKDKRFNVFKANVHYVHNFNKK-DKPYKLKLNKFADMTNHE 92
Query: 99 FRALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
FR Y G ++R SR + TF Y + VP ++DWR+KGAVT +KDQG+CGS
Sbjct: 93 FRHHYAGSKIKHHRTFLGASRANG---TFMYAHEDSVPPTVDWRKKGAVTPVKDQGKCGS 149
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLAT 213
CWAFS V AVEGI QI +L+ LSEQ+LVDC T N GC+GGLMD AFE+I + G+ T
Sbjct: 150 CWAFSTVVAVEGINQIKTNELVSLSEQELVDCDTSQNQGCNGGLMDMAFEFIKKKGGINT 209
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
E +YPY E G CD QK + +I +ED+P DE +LL+AV+NQPVSV + ASG F
Sbjct: 210 EENYPYMAEGGECDIQKRNSPVVSIDGHEDVPPNDEGSLLKAVANQPVSVAIQASGSDFQ 269
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR---- 329
FY GV DCG DHGVA+VG+GT + KYW++KNSWG WGE GYIR+ R
Sbjct: 270 FYSEGVFTGDCGTELDHGVAIVGYGTTLDR--TKYWIVKNSWGPEWGEKGYIRMQREIDA 327
Query: 330 DAGLCGIATAASYPV 344
+ GLCGIA SYP+
Sbjct: 328 EEGLCGIAMQPSYPI 342
>gi|168006315|ref|XP_001755855.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162693174|gb|EDQ79528.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 454
Score = 313 bits (803), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 157/319 (49%), Positives = 212/319 (66%), Gaps = 16/319 (5%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
+E + E+ W +HG+ Y E A R ++K NLEYI++ + E NR+Y LG +F+D
Sbjct: 38 NERLLSEQFGAWAHKHGKVYSSLEEHAHRYMVWKDNLEYIQR-HSEKNRSYWLGLTKFAD 96
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
+TN+EFR YTG S++S R + F+Y + ++ P S+DWR+KGAVT +KDQG CG
Sbjct: 97 ITNDEFRRQYTGTR---IDRSKRSKRKTGFRYAD-SEAPESVDWRKKGAVTTVKDQGSCG 152
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
SCWAFSA+ +VEGI I G+ + LSEQ+LVDC + N GC+GGLMD AF++I+EN G+
Sbjct: 153 SCWAFSAIGSVEGINAIRTGEAVSLSEQELVDCDLEYNQGCNGGLMDYAFDFILENGGID 212
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE DYPY+ +G CDN K+ A TI YED+P+ DE+AL +AV+ QPVSV ++A GR F
Sbjct: 213 TENDYPYKGLDGRCDNNKKNAHVVTIDGYEDVPENDEEALKKAVAGQPVSVAIEAGGRDF 272
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
Y GV +CG + DHGV VG+G+ E YW++KNSWGE WGESGY+R+ R+
Sbjct: 273 QLYSGGVFTGECGTDLDHGVLAVGYGS---EGSLDYWIVKNSWGEYWGESGYLRMQRNIK 329
Query: 332 ------GLCGIATAASYPV 344
GLCGI SY V
Sbjct: 330 DSNHQFGLCGINIEPSYAV 348
>gi|5777889|emb|CAB53515.1| cysteine protease [Solanum tuberosum]
Length = 466
Score = 313 bits (802), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 152/350 (43%), Positives = 213/350 (60%), Gaps = 16/350 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK--------HEQWMAQHGRTYKDELEK 59
+ I + +++I ++ +S S E I + +E W+ +HG++Y EK
Sbjct: 7 TLTISLLLMLIFSTLSSASDMSIISYDETHIHHRSDDEVSALYESWLIEHGKSYNALGEK 66
Query: 60 AMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR 119
R IFK NL+YI++ N N++YKLG +F+DLTNEE+R++Y G ++
Sbjct: 67 DKRFQIFKDNLKYIDEQNSVPNQSYKLGLTKFADLTNEEYRSIYLGTKSSGDRRKLSKNK 126
Query: 120 PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
+ + +P S+DWR+KG + +KDQG CGSCWAFSAVAA+E I I G LI LS
Sbjct: 127 SDRYLPKVGDSLPESVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLS 186
Query: 180 EQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
EQ+LVDC N GC GGLMD AFE++I N G+ TE DYPY+ CD ++ A I
Sbjct: 187 EQELVDCDKSYNEGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKI 246
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
YED+P +E+AL +AV++QPVS+ ++A GR YKSG+ CG DHGV G+G
Sbjct: 247 DSYEDVPVNNEKALQKAVAHQPVSIAIEAGGRDLQHYKSGIFTGKCGTAVDHGVVAAGYG 306
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
+ ENG YW+++NSWG WGE GY+R+ R+ +GLCG+AT SYPV
Sbjct: 307 S---ENGMDYWIVRNSWGAKWGEKGYLRVQRNVASSSGLCGLATEPSYPV 353
>gi|21070926|gb|AAM34401.1|AF377947_7 putative cysteine proteinase [Oryza sativa Japonica Group]
gi|31712050|gb|AAP68356.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|40538988|gb|AAR87245.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|108711126|gb|ABF98921.1| Papain family cysteine protease containing protein, expressed
[Oryza sativa Japonica Group]
gi|125545747|gb|EAY91886.1| hypothetical protein OsI_13535 [Oryza sativa Indica Group]
Length = 350
Score = 313 bits (802), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 164/354 (46%), Positives = 219/354 (61%), Gaps = 26/354 (7%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIV----------EKHEQWMAQHGRTYKDELEKA 60
+ FV+ +LV++ A+ + GR + +HE+WMA+HG+TYKDE EKA
Sbjct: 3 LSTFVLAVLVMSGAAAL--GRELAGDGAAAAAAADVAMASRHEKWMAKHGKTYKDEEEKA 60
Query: 61 MRLNIFKQNLEYIEKAN----KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQ 116
RL +F+ N + I+ N K+G ++L TN F+DLT++EFRA TGY RP P+
Sbjct: 61 RRLEVFRANAKLIDSFNAAAEKDGGGGHRLATNRFADLTDDEFRAARTGYQRP-PAAVAG 119
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
+ ++ ++ P S+DWR GAVT +KDQG CG CWAFSAVAAVEG+ +I G+L+
Sbjct: 120 AGGGFLYENFSLAAAPQSMDWRAMGAVTGVKDQGSCGCCWAFSAVAAVEGLAKIRTGQLV 179
Query: 177 ELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV 234
LSEQ+LVDC ++ GC GGLMD AF+YI GLA E+ YPYR +
Sbjct: 180 SLSEQELVDCDVRGEDQGCEGGLMDTAFQYIARRGGLAAESSYPYRGVD-GACRAAAGRA 238
Query: 235 AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVA 293
AA+I ++D+P DE AL+ AV+ QPVSV ++ +G F FY GVL A CG +H V
Sbjct: 239 AASIRGFQDVPSNDEGALMAAVARQPVSVAINGAGYVFRFYDRGVLGGAGCGTELNHAVT 298
Query: 294 VVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
VG+GTA + G YWL+KNSWG +WGE GY+RI R G CGIA ASYPV
Sbjct: 299 AVGYGTASDGTG--YWLMKNSWGASWGEGGYVRIRRGVGREGACGIAQMASYPV 350
>gi|355344587|gb|AER60490.1| cysteine proteases [Gossypium hirsutum]
Length = 371
Score = 313 bits (802), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 151/309 (48%), Positives = 208/309 (67%), Gaps = 9/309 (2%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
++ W+ QHG+ Y E+ R IFK NL +I++ N N TYKLG N+F+DLTN+E+RA
Sbjct: 45 YKSWVIQHGKAYNGIGEEEKRFEIFKDNLRFIDEHNSNNNTTYKLGLNKFADLTNQEYRA 104
Query: 102 LYTGYNRPVPSVSRQSSRPST-FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
+ G +S PS+ + ++ ++P S+DWR+ GAV+ +KDQG CGSCWAFS
Sbjct: 105 KFLGTRTDPRRRLMKSKIPSSRYAHRAGDNLPDSVDWRDHGAVSPVKDQGSCGSCWAFST 164
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
+A VEGI +I G+L+ LSEQ+LVDC + GC+GGLMD AF++I++N G+ TE DYPY
Sbjct: 165 IATVEGINKIVSGELVSLSEQELVDCDRSYDAGCNGGLMDYAFQFIMDNGGIDTEKDYPY 224
Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
CD K+ A +I YED+P +E AL +AV++QPVS+ ++A GRAF Y+SGV
Sbjct: 225 LGFNNQCDPTKKNAKVVSIDGYEDVPN-NENALKKAVAHQPVSIAIEAGGRAFQLYESGV 283
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCG 335
N +CG DHGV VG+GT ++NG YW+++NSWG WGE+GYIR+ R + G CG
Sbjct: 284 FNGECGLALDHGVVAVGYGT--DDNGQDYWIVRNSWGSNWGENGYIRMERNINANTGKCG 341
Query: 336 IATAASYPV 344
IA ASYPV
Sbjct: 342 IAMEASYPV 350
>gi|413951605|gb|AFW84254.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 423
Score = 313 bits (802), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 156/350 (44%), Positives = 210/350 (60%), Gaps = 23/350 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIV------EKHEQWMAQHGRTYKDELEKAMRLNIF 66
+ V ++ V + A ++ E + + +E+W H R ++ EK R F
Sbjct: 53 LLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTF 111
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST---- 122
K+N+ +I NK G+R Y+L N F D+ EEFR+ + + + + RQ S +
Sbjct: 112 KENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFA--DSRINDLRRQDSPAARAGAV 169
Query: 123 --FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
F Y + D P S+DWR++GAVT +KDQG CGSCWAFS V AVEGI I G L LSE
Sbjct: 170 PGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSLASLSE 229
Query: 181 QQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK---AVAAT 237
Q+L+DC TD +GC GGLM+ AFE+I G+ TEA YPYR GTCD + + V
Sbjct: 230 QELIDCDTDENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGVVVV 289
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I ++ +P G E AL +AV++QPVSV VDA G+AF FY GV DCG + DHGVA VG+
Sbjct: 290 IDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGY 349
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
G ++ G YW++KNSWG +WGE GYIR+ R A GLCGIA AS+P+
Sbjct: 350 GVGDD--GTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 397
>gi|413951606|gb|AFW84255.1| hypothetical protein ZEAMMB73_933931 [Zea mays]
Length = 379
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 156/350 (44%), Positives = 210/350 (60%), Gaps = 23/350 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIV------EKHEQWMAQHGRTYKDELEKAMRLNIF 66
+ V ++ V + A ++ E + + +E+W H R ++ EK R F
Sbjct: 9 LLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTF 67
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST---- 122
K+N+ +I NK G+R Y+L N F D+ EEFR+ + + + + RQ S +
Sbjct: 68 KENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFA--DSRINDLRRQDSPAARAGAV 125
Query: 123 --FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
F Y + D P S+DWR++GAVT +KDQG CGSCWAFS V AVEGI I G L LSE
Sbjct: 126 PGFMYDSAADPPRSVDWRQEGAVTGVKDQGHCGSCWAFSTVVAVEGINAIRTGSLASLSE 185
Query: 181 QQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK---AVAAT 237
Q+L+DC TD +GC GGLM+ AFE+I G+ TEA YPYR GTCD + + V
Sbjct: 186 QELIDCDTDENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGVVVV 245
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I ++ +P G E AL +AV++QPVSV VDA G+AF FY GV DCG + DHGVA VG+
Sbjct: 246 IDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGY 305
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
G ++ G YW++KNSWG +WGE GYIR+ R A GLCGIA AS+P+
Sbjct: 306 GVGDD--GTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 353
>gi|359473128|ref|XP_002285397.2| PREDICTED: vignain-like [Vitis vinifera]
Length = 357
Score = 313 bits (802), Expect = 8e-83, Method: Compositional matrix adjust.
Identities = 156/346 (45%), Positives = 221/346 (63%), Gaps = 10/346 (2%)
Query: 5 FEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
EK ++ + ++++ + + E S+ + +E+W + H +D EK R N
Sbjct: 1 MEKVILVALSLVLVFGLAESFDFDEKDLASEESLWDLYERWRSYH-TVSRDLEEKNKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-F 123
+FK+N +++ K N+ ++ YKL N+F+D+TN EFR+ Y G + R R + F
Sbjct: 60 VFKENTKHVHKVNQM-DKPYKLKLNKFADMTNHEFRSSYGGSKVKHYRMLRGDRRGTGGF 118
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
++ T +P S+DWR+KGAVT IKDQG+CGSCWAFS V VEGI QI +L+ LSEQQL
Sbjct: 119 MHEKTTYLPPSVDWRKKGAVTGIKDQGKCGSCWAFSTVVGVEGINQIKTKELLSLSEQQL 178
Query: 184 VDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
+DC +D+HGC+GGLM+ AFE+I +N G+ TE +YPY+ ++ CD K A TI +E
Sbjct: 179 IDCDRSDDHGCNGGLMESAFEFIKKNGGITTENNYPYKAKDERCDMLKMNAPVVTIDGHE 238
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
+P DE+AL++AV++QPVSV +DA G FY GV + +CG DHGVA+VG+GT +
Sbjct: 239 SVPVNDERALMKAVAHQPVSVAIDAGGSDLQFYSEGVFDGECGTELDHGVAIVGYGTTLD 298
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
G KYW++KNSWG WGE GYIR+ R G CGIA ASYPV
Sbjct: 299 --GTKYWIVKNSWGAEWGEKGYIRMARGIQAAEGQCGIAMEASYPV 342
>gi|297819566|ref|XP_002877666.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
gi|297323504|gb|EFH53925.1| hypothetical protein ARALYDRAFT_906213 [Arabidopsis lyrata subsp.
lyrata]
Length = 304
Score = 313 bits (801), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 162/321 (50%), Positives = 216/321 (67%), Gaps = 33/321 (10%)
Query: 33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
+ E S +EKHEQWM++ R Y D+ EK R IFK+NL+++E N N TYKL N+FS
Sbjct: 9 LFEASAIEKHEQWMSRFNRVYSDDSEKTSRFEIFKKNLKFVESFNMNTNNTYKLDVNKFS 68
Query: 93 DLTNEEFRALYTGYNRPVP-SVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
DLT+EEF+A Y G VP ++ S + +F+Y+NV++ S+DWR +GAVT +KDQGQ
Sbjct: 69 DLTDEEFQARYMGL---VPEGMTGDSQKTVSFRYENVSETGESMDWRLEGAVTPVKDQGQ 125
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENK 209
CG CWAF+AVAAVEG+T+I G+L+ LSEQQLVDCST +N GC GGL A++YI EN+
Sbjct: 126 CGCCWAFAAVAAVEGVTKIANGELVSLSEQQLVDCSTANNNMGCDGGLALTAYDYIKENQ 185
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
G+ +E +YPY+ + TC + AATIS YE +PK DE+ALL+AVS
Sbjct: 186 GITSEENYPYQAVQQTC--KSTDPAAATISGYEAVPKDDEEALLKAVS------------ 231
Query: 270 RAFHFYKSGVLNAD-CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
+ G+ + CG + H V +VG+GT+EE G KYWL+KNSWGE+WGE+GY+RI
Sbjct: 232 ------QHGIFEDEYCGTDSHHAVTIVGYGTSEE--GIKYWLLKNSWGESWGENGYMRIK 283
Query: 329 RDA----GLCGIATAASYPVA 345
RD G+CG+A A YPVA
Sbjct: 284 RDVDEPQGMCGLAHRAYYPVA 304
>gi|414591545|tpg|DAA42116.1| TPA: hypothetical protein ZEAMMB73_388689 [Zea mays]
Length = 384
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 153/324 (47%), Positives = 212/324 (65%), Gaps = 16/324 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTY----KDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
E S+ +E+W + + R D+ ++A R N+FK+N Y+ +AN++ R ++L N+
Sbjct: 34 EESLRALYERWRSHYHRVSPRDGDDKQQQARRFNVFKENARYVHEANRKDGRPFRLALNK 93
Query: 91 FSDLTNEEFRALYTG----YNRPVPSVSRQSSRPSTFKY-QNVTDVPTSIDWREKGAVTH 145
F+D+T +EFR Y G ++R +R + + T++P ++DWR +GAVT
Sbjct: 94 FADMTTDEFRRTYAGSRTRHHRAQLGEARSFAHAQHGRGGSGTTNLPPAVDWRLRGAVTG 153
Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEY 204
+KDQGQCGSCWAFSA+AAVEG+ +I GKL+ LSEQ+LVDC DN GC GGLMD AF+Y
Sbjct: 154 VKDQGQCGSCWAFSAIAAVEGVNKIMTGKLVSLSEQELVDCDDVDNQGCDGGLMDYAFQY 213
Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
I N G+ TE++YPY E+ +C+ KE++ TI YED+P +E AL +AV++QPV+V
Sbjct: 214 IQRNGGVTTESNYPYLAEQRSCNKAKERSHDVTIDGYEDVPANNEDALQKAVASQPVAVA 273
Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
++ASG+ F FY GV CG + DHGVA VG+GT + G KYW +KNSWGE WGE GY
Sbjct: 274 IEASGQDFQFYSEGVFTGSCGTDLDHGVAAVGYGTTGD--GTKYWTVKNSWGEDWGERGY 331
Query: 325 IRILRDA----GLCGIATAASYPV 344
IR+ R GLCGIA SYP
Sbjct: 332 IRMQRGVPDSRGLCGIAMEPSYPT 355
>gi|357446975|ref|XP_003593763.1| Cysteine proteinase [Medicago truncatula]
gi|355482811|gb|AES64014.1| Cysteine proteinase [Medicago truncatula]
Length = 350
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 162/331 (48%), Positives = 210/331 (63%), Gaps = 9/331 (2%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
IIL+ CA +S R++ E S+VE H+QWM ++ RTY + E R IFK+NLEYIE
Sbjct: 9 IILLWACAYPTMS-RTLTESSVVEAHQQWMMKYERTYTNSSEMEKRKKIFKENLEYIENF 67
Query: 77 NKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
N GN++YKLG N +SDLT+EEF A +TG+ + +S R + DVPT+ D
Sbjct: 68 NNVGNKSYKLGLNRYSDLTSEEFIASHTGF-KVSDQLSDSKMRSVAIPFNLNDDVPTNFD 126
Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGG 196
WREKG VT +K+Q QCG CWAF+AVAAVEGI +I G LI LSEQQLVDC + GC GG
Sbjct: 127 WREKGVVTDVKNQRQCGCCWAFTAVAAVEGIVKIKNGNLISLSEQQLVDCDRQSSGCGGG 186
Query: 197 LMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAV 256
AF+ II+++G+ E DYPY+ + + AA I+ Y +P DEQ LL+AV
Sbjct: 187 DFVLAFDSIIKSRGIVKEDDYPYKANDVQTCQLGQIPGAAQINGYFKVPANDEQQLLRAV 246
Query: 257 SNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWG 316
QPVSV + S FH Y GV CG +H V ++G+G +E G KYWLIKNSWG
Sbjct: 247 LQQPVSVAISTS-YDFHHYMGGVYEGSCGPKLNHAVTIIGYGVSEA--GKKYWLIKNSWG 303
Query: 317 ETWGESGYIRILRDA----GLCGIATAASYP 343
ETWGE GY+++LR++ G C IA A+YP
Sbjct: 304 ETWGEKGYMKVLRESSATGGQCSIAVHAAYP 334
>gi|22759715|dbj|BAC10906.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 312 bits (800), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 158/354 (44%), Positives = 226/354 (63%), Gaps = 17/354 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQV-VSGRSMHEPSIVEK----HEQWMAQHGRTYKD 55
+ +K+ ++ +FV I+ A + + G + + + + K E W+ +H + Y+
Sbjct: 3 FIFSSKKTSLLFLFVSILACSALAHEFSILGYAPEDLTSIHKVIHLFESWLVKHSKFYES 62
Query: 56 ELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR 115
EK R IF NL++I++ NK+ + Y LG NEF+DLT+EEF+ + G+ +
Sbjct: 63 LDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEFADLTHEEFKHKFLGFKGELAERKD 121
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
+SS+ F Y++ D+P S+DWR+KGAV +K+QGQCGSCWAFS VAAVEGI QI G L
Sbjct: 122 ESSKE--FGYRDFVDLPKSVDWRKKGAVAPVKNQGQCGSCWAFSTVAAVEGINQIVTGNL 179
Query: 176 IELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV 234
LSEQ+L+DC T N+GC+GGLMD AF Y++ + GL E +YPY EGTCD +K+ +
Sbjct: 180 TMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRS-GLHKEEEYPYIMSEGTCDEKKDVSE 238
Query: 235 AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAV 294
TIS Y D+P+ DE + L+A++NQP+SV ++ASGR F FY GV + CG DHGVA
Sbjct: 239 KVTISGYHDVPRNDEASFLKALANQPISVAIEASGRDFQFYSGGVFDGHCGTELDHGVAA 298
Query: 295 VGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
VG+GT + G Y +++NSWG WGE GYIR+ R + G+CG+ ASYP
Sbjct: 299 VGYGTTK---GLDYVIVRNSWGPKWGEKGYIRMKRGSGKPHGMCGLYMMASYPT 349
>gi|359359213|gb|AEV41117.1| putative oryzain beta chain precursor [Oryza officinalis]
Length = 465
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 149/310 (48%), Positives = 209/310 (67%), Gaps = 14/310 (4%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN-KEGNRTYKLGTNEFSDLTNEEFR 100
++ W+A++GR+Y E R +F NL + + N + + ++LG N F+DLTNEEFR
Sbjct: 54 YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 113
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
A + G V R + +++ V ++P S+DWREKGAV +K+QGQCGSCWAFSA
Sbjct: 114 ATFLG----AKVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 169
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYP 218
V+ VE I Q+ G++I LSEQ+LV+CST+ N GC+GGLMD AF++II+N G+ TE DYP
Sbjct: 170 VSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYP 229
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
Y+ +G CD +E A +I +ED+P+ DE++L +AV++QPVSV ++A GR F Y SG
Sbjct: 230 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 289
Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLC 334
V + CG + DHGV VG+GT +NG YW+++NSWG WGESGY+R+ R+ G C
Sbjct: 290 VFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKC 346
Query: 335 GIATAASYPV 344
GIA ASYP
Sbjct: 347 GIAMMASYPT 356
>gi|359359066|gb|AEV40973.1| putative oryzain beta chain precursor [Oryza punctata]
Length = 461
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 147/311 (47%), Positives = 211/311 (67%), Gaps = 15/311 (4%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLTNEEF 99
++ W+A++GR+Y E+ R +F NL++++ N + ++LG N F+DLTN+EF
Sbjct: 49 YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108
Query: 100 RALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
R+ + G V R + +++ V ++P S+DWREKGAV +K+QGQCGSCWAFS
Sbjct: 109 RSTFLG----AKVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFS 164
Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADY 217
AV+ VE I Q+ G++I LSEQ+LV+CST+ N GC+GGLMD AF++II+N G+ TE DY
Sbjct: 165 AVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDY 224
Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
PY+ +G CD +E A +I +ED+P+ DE++L +AV++QPVSV ++A GR F Y S
Sbjct: 225 PYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHS 284
Query: 278 GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGL 333
GV + CG + DHGV VG+GT +NG YW+++NSWG WGESGY+R+ R+ G
Sbjct: 285 GVFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINATTGK 341
Query: 334 CGIATAASYPV 344
CGIA ASYP
Sbjct: 342 CGIAMMASYPT 352
>gi|32396020|gb|AAP41847.1| senescence-associated cysteine protease [Anthurium andraeanum]
Length = 460
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 161/328 (49%), Positives = 215/328 (65%), Gaps = 16/328 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK-EGNRTYK 85
+V+ R+ E ++ +E W+ +G+ Y EK R IF NL YI+ N+ E N +Y
Sbjct: 25 IVAERTEEEVRLL--YEGWLVGNGKAYNLLGEKERRFEIFWDNLRYIDDHNRAENNHSYT 82
Query: 86 LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR-PSTFK--YQNVTDVPTSIDWREKGA 142
LG F+DLTNEE+R+ Y G +P R+++R P + N D+P +DWREKGA
Sbjct: 83 LGLTRFADLTNEEYRSTYLGV-KPGQVRPRRANRAPGRGRDLSANGDDLPQKVDWREKGA 141
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
V IKDQG CGSCWAFS VAAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 142 VAPIKDQGGCGSCWAFSTVAAVEGINQIVTGDLIVLSEQELVDCDTAYNEGCNGGLMDYA 201
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
F++II N G+ TE DYPY+ +G CD ++ A +I YED+ + DE AL AV++QPV
Sbjct: 202 FQFIISNGGIDTEEDYPYKERDGLCDPNRKNAKVVSIDSYEDVLENDEHALKTAVAHQPV 261
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
SV ++ GR+F YKSG+ + CG + DHGV VG+GT E+G YW+++NSWG++WGE
Sbjct: 262 SVAIEGGGRSFQLYKSGIFDGRCGIDLDHGVVAVGYGT---ESGKDYWIVRNSWGKSWGE 318
Query: 322 SGYIRILRD-----AGLCGIATAASYPV 344
+GYIR+ R+ +G CGIA SYP+
Sbjct: 319 AGYIRMERNLPSSSSGKCGIAIEPSYPI 346
>gi|357126406|ref|XP_003564878.1| PREDICTED: cysteine proteinase EP-B 1-like [Brachypodium
distachyon]
Length = 377
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 155/325 (47%), Positives = 209/325 (64%), Gaps = 19/325 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT--------YKL 86
E ++ E + +W + H + EK R FK N+ +I N N T Y+L
Sbjct: 35 EEALWELYTRWQSAHRLPPQHHAEKHRRFGTFKSNVLFIHAHNTRLNDTSTNNNGPSYRL 94
Query: 87 GTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
N F D+ EFR+ + G P+ +R + F Y V D+P ++DWR+KGAVT +
Sbjct: 95 RLNRFGDMDQAEFRSTFAG---PLHRHTRPAQSIPGFIYDTVKDIPQAVDWRQKGAVTGV 151
Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEY 204
KDQG+CGSCWAFSAVA+VEG+ I G L+ LSEQ+L+DC T D++GC GGLM+ AFE+
Sbjct: 152 KDQGKCGSCWAFSAVASVEGLNAIRTGSLVSLSEQELIDCDTGGDDNGCQGGLMESAFEF 211
Query: 205 IIENKG-LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
I + G LATEA YPY GTC+ + +V+ I ++ +P G+E+AL +AV++QPVSV
Sbjct: 212 IAHSAGGLATEAAYPYHASNGTCNANRGSSVSVRIDGHQSVPAGNEEALAKAVAHQPVSV 271
Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
+DA G+AF FY GV DCG+ DHGVAVVG+G AEE+ G +YW++KNSWG WGE G
Sbjct: 272 AIDAGGQAFQFYSEGVFTGDCGSELDHGVAVVGYGVAEED-GKEYWIVKNSWGPGWGEHG 330
Query: 324 YIRILRDA----GLCGIATAASYPV 344
Y+R+ RD+ GLCGIA ASYPV
Sbjct: 331 YVRMQRDSGVDGGLCGIAMEASYPV 355
>gi|226503129|ref|NP_001149806.1| LOC100283433 precursor [Zea mays]
gi|195634783|gb|ACG36860.1| xylem cysteine proteinase 2 precursor [Zea mays]
gi|219884977|gb|ACL52863.1| unknown [Zea mays]
Length = 377
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 163/317 (51%), Positives = 210/317 (66%), Gaps = 20/317 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+V E+W+A++ + Y EK R +FK NL +I++AN++ +Y LG N F+DLT++
Sbjct: 68 LVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFADLTHD 127
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV----PTSIDWREKGAVTHIKDQGQCG 153
EF+A Y G S R F+Y V D P S+DWR+KGAVT +K+QGQCG
Sbjct: 128 EFKATYLGLLPKRTSGGR-------FRYGGVGDGGDEVPASVDWRKKGAVTEVKNQGQCG 180
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
SCWAFS VAAVEGI QI G L LSEQQLVDCSTD N+GCSGG+MD AF +I GL
Sbjct: 181 SCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGAGLR 240
Query: 213 TEADYPYRHEEGTCDNQ-KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
+E YPY EEG CD++ ++ V TIS YED+P DEQAL++A+++QPVSV ++ASGR
Sbjct: 241 SEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRH 300
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
F FY GV + CG+ DHGVA VG+G+++ G Y ++KNSWG WGE GYIR+ R
Sbjct: 301 FQFYSGGVFDGPCGSELDHGVAAVGYGSSK---GQDYIIVKNSWGTHWGEKGYIRMKRGT 357
Query: 332 ----GLCGIATAASYPV 344
GLCGI ASYP
Sbjct: 358 GKPEGLCGINKMASYPT 374
>gi|2511693|emb|CAB17076.1| cysteine proteinase precursor [Phaseolus vulgaris]
Length = 455
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 162/347 (46%), Positives = 213/347 (61%), Gaps = 21/347 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEK--------HEQWMAQHGRTYKDELEKAMRLN 64
+F + L ++S + H+ + +E+W+ +HG+ Y EK R
Sbjct: 3 LFALFALSSALDMSIISYDNAHQDKATWRTDEEVNSLYEEWLVKHGKLYNALGEKDKRFQ 62
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
IFK NL +I++ N E NRTYKLG N F+DLTNEE+RA Y G + R PS
Sbjct: 63 IFKDNLRFIDQQNAE-NRTYKLGLNRFADLTNEEYRARYLG--TKIDPNRRLGRTPSNRY 119
Query: 125 YQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
V + +P S+DWR++GAV +KDQ CGSCWAFSA+ AVEGI +I G LI LSEQ+L
Sbjct: 120 APRVGETLPDSVDWRKEGAVVPVKDQASCGSCWAFSAIGAVEGINKIVTGDLISLSEQEL 179
Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
VDC T N GC+GGLMD AFE+II+N G+ +E DYPY+ +G CD ++ A +I YE
Sbjct: 180 VDCDTGYNMGCNGGLMDYAFEFIIKNGGIDSEEDYPYKGVDGRCDEYRKNAKVVSIDGYE 239
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
D+ DE AL +AV+NQPVSV V+ GR F Y SGV CG DHGV VG+GT
Sbjct: 240 DVNTYDELALKKAVANQPVSVAVEGGGREFQLYSSGVFTGRCGTALDHGVVAVGYGT--- 296
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
+NG +W+++NSWG WGE GYIR+ R+ +G CGIA SYP+
Sbjct: 297 DNGHDFWIVRNSWGADWGEEGYIRLERNLGNSRSGKCGIAIEPSYPI 343
>gi|414589857|tpg|DAA40428.1| TPA: Vignain [Zea mays]
Length = 377
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 157/328 (47%), Positives = 204/328 (62%), Gaps = 22/328 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E+ EQWM +HGR Y D EK RL ++++N+E +E N GN Y+L N+F+DLTNE
Sbjct: 50 MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGN-GYRLADNKFADLTNE 108
Query: 98 EFRALYTGYNRPVPSV-SRQSSRPST--------FKYQNVTDVPTSIDWREKGAVTHIKD 148
EFRA G+ RP + S+ PST Q +D+P S+DWREKGAV +K
Sbjct: 109 EFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVKS 168
Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIEN 208
QG CGSCWAFSAVAA+EGI QI GKL+ LSEQ+LVDC T GC+GG M AFE++++N
Sbjct: 169 QGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKAIGCAGGYMSWAFEFVMKN 228
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
+GL TE +YPY+ G C K K A +IS Y ++ E LL+A + QPVSV VDA
Sbjct: 229 RGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAVDAG 288
Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN--------GAKYWLIKNSWGETWG 320
+ Y GV C +HGV VVG+G + + G KYW++KNSWG WG
Sbjct: 289 SFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPEWG 348
Query: 321 ESGYIRILRDA----GLCGIATAASYPV 344
++GYI + R+A GLCGIA SYPV
Sbjct: 349 DAGYILMQREASVASGLCGIAMLPSYPV 376
>gi|226507844|ref|NP_001148894.1| LOC100282514 precursor [Zea mays]
gi|194703250|gb|ACF85709.1| unknown [Zea mays]
gi|195622994|gb|ACG33327.1| vignain precursor [Zea mays]
Length = 356
Score = 311 bits (798), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 157/328 (47%), Positives = 204/328 (62%), Gaps = 22/328 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E+ EQWM +HGR Y D EK RL ++++N+E +E N GN Y+L N+F+DLTNE
Sbjct: 29 MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVELVETFNSMGN-GYRLADNKFADLTNE 87
Query: 98 EFRALYTGYNRPVPSV-SRQSSRPST--------FKYQNVTDVPTSIDWREKGAVTHIKD 148
EFRA G+ RP + S+ PST Q +D+P S+DWREKGAV +K
Sbjct: 88 EFRAKMLGFGRPRSGGGAGHSTAPSTVACIGSGLMGRQGYSDLPKSVDWREKGAVAPVKS 147
Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIEN 208
QG CGSCWAFSAVAA+EGI QI GKL+ LSEQ+LVDC T GC+GG M AFE++++N
Sbjct: 148 QGDCGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKAIGCAGGYMSWAFEFVMKN 207
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
+GL TE +YPY+ G C K K A +IS Y ++ E LL+A + QPVSV VDA
Sbjct: 208 RGLTTERNYPYQGLNGACQTPKLKESAVSISGYMNVTPSSEPDLLRAAAAQPVSVAVDAG 267
Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN--------GAKYWLIKNSWGETWG 320
+ Y GV C +HGV VVG+G + + G KYW++KNSWG WG
Sbjct: 268 SFVWQLYGGGVFTGPCTAELNHGVTVVGYGETQGDTDGDGSGVPGKKYWIVKNSWGPEWG 327
Query: 321 ESGYIRILRDA----GLCGIATAASYPV 344
++GYI + R+A GLCGIA SYPV
Sbjct: 328 DAGYILMQREASVASGLCGIAMLPSYPV 355
>gi|37780049|gb|AAP32197.1| cysteine protease 10 [Trifolium repens]
Length = 272
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 151/271 (55%), Positives = 191/271 (70%), Gaps = 11/271 (4%)
Query: 81 NRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
N+ YKLG N+F+DLTNEEF+A N+ + R +TFKY+N + +P+++DWR+K
Sbjct: 7 NKLYKLGINKFADLTNEEFKA---SRNKFKGHMCSSIIRTTTFKYENASAIPSTVDWRKK 63
Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLM 198
GAVT +K+QGQCGSCWAFSAVAA EGI Q++ GKL+ LSEQ+L+DC T + GC GGLM
Sbjct: 64 GAVTPVKNQGQCGSCWAFSAVAATEGIHQLSTGKLVSLSEQELIDCDTKGVDQGCEGGLM 123
Query: 199 DKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN 258
D AF++II+N GL+TE YPY +GTC+ + A TI+ YED+P +E AL +AV+N
Sbjct: 124 DDAFKFIIQNHGLSTEVQYPYEGVDGTCNTNEASIHAVTITGYEDVPANNELALQKAVAN 183
Query: 259 QPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
QP+SV +DASG F FY SGV CG DHGV VG+G + G KYWL+KNSWG
Sbjct: 184 QPISVAIDASGSDFQFYNSGVFTGSCGTELDHGVTAVGYGVGND--GTKYWLVKNSWGAD 241
Query: 319 WGESGYIRILR--DA--GLCGIATAASYPVA 345
WGE GYIR+ R DA GLCGIA ASYP A
Sbjct: 242 WGEEGYIRMQRGIDAAEGLCGIAMQASYPTA 272
>gi|3980198|emb|CAA46863.1| thiolprotease [Pisum sativum]
Length = 464
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 164/363 (45%), Positives = 224/363 (61%), Gaps = 30/363 (8%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRT 52
M+ K FI F + + + C ++S H ++ +E+W+ +HG+
Sbjct: 1 MLSKLTILFITLTFTLSLALDMC---IISYDKTHPDKSTPRTNDQVLTMYEEWLVKHGKN 57
Query: 53 YKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGY----NR 108
Y EK R IFK NL +I++ N + N +++LG N F+DLTNEE+R + G NR
Sbjct: 58 YNALGEKEKRFEIFKDNLGFIDEHNSK-NLSFRLGLNRFADLTNEEYRTRFLGTRINPNR 116
Query: 109 PVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGI 167
V+ Q++R +T V D +P S+DWR++GAV +KDQG CGSCWAFSA+AAVEG+
Sbjct: 117 RNRKVNSQTNRYAT----RVGDKLPESVDWRKEGAVVGVKDQGSCGSCWAFSAIAAVEGV 172
Query: 168 TQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC 226
++ G LI LSEQ+LVDC T N GC+GGLMD AFE+II L E DYPYR +G C
Sbjct: 173 NKLATGDLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINMVALTPEEDYPYRAIDGRC 232
Query: 227 DNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGN 286
D ++ A +I +YED+P DE AL +AV+NQ ++V V+ GR F Y SGV CG
Sbjct: 233 DQNRKNAKVVSIDQYEDVPAYDEGALKKAVANQVIAVAVEGGGREFQLYDSGVFTGRCGT 292
Query: 287 NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCGIATAAS 341
DHGVA VG+GT ENG YW+++NSWG +WGE+GYIR+ R+ +G CGIA S
Sbjct: 293 ALDHGVAAVGYGT---ENGKDYWIVRNSWGGSWGEAGYIRLERNLATSKSGKCGIAIEPS 349
Query: 342 YPV 344
YP+
Sbjct: 350 YPI 352
>gi|115441717|ref|NP_001045138.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|5761329|dbj|BAA83473.1| cysteine endopeptidase [Oryza sativa]
gi|20804884|dbj|BAB92565.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|56785107|dbj|BAD82745.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113534669|dbj|BAF07052.1| Os01g0907600 [Oryza sativa Japonica Group]
gi|119395242|gb|ABL74582.1| cysteine endopeptidase [Oryza sativa Japonica Group]
gi|125528777|gb|EAY76891.1| hypothetical protein OsI_04850 [Oryza sativa Indica Group]
gi|125573036|gb|EAZ14551.1| hypothetical protein OsJ_04473 [Oryza sativa Japonica Group]
Length = 371
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 157/330 (47%), Positives = 208/330 (63%), Gaps = 11/330 (3%)
Query: 23 CASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGN 81
CA+ R + + ++ + +E+W H + EK R FK N+ YI + NK G
Sbjct: 26 CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRGG 84
Query: 82 RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREK 140
R Y+L N F D+ EEFRA + G + ++ P F Y+ V D+P ++DWR K
Sbjct: 85 RGYRLRLNRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRK 144
Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMD 199
GAVT +KDQG+CGSCWAFS V +VEGI I G+L+ LSEQ+L+DC T DN GC GGLM+
Sbjct: 145 GAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLME 204
Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDN-QKEKAVAATISKYEDLPKGDEQALLQAVSN 258
AFEYI + G+ TE+ YPYR GTCD + +A I ++++P E AL +AV+N
Sbjct: 205 NAFEYIKHSGGITTESAYPYRAANGTCDAVRARRAPLVVIDGHQNVPANSEAALAKAVAN 264
Query: 259 QPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
QPVSV +DA ++F FY GV DCG + DHGVAVVG+G E +G +YW++KNSWG
Sbjct: 265 QPVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYG--ETNDGTEYWIVKNSWGTA 322
Query: 319 WGESGYIRILRDA----GLCGIATAASYPV 344
WGE GYIR+ RD+ GLCGIA ASYPV
Sbjct: 323 WGEGGYIRMQRDSGYDGGLCGIAMEASYPV 352
>gi|413942348|gb|AFW74997.1| Xylem cysteine proteinase 2 [Zea mays]
Length = 391
Score = 311 bits (797), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 163/317 (51%), Positives = 210/317 (66%), Gaps = 20/317 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+V E+W+A++ + Y EK R +FK NL +I++AN++ +Y LG N F+DLT++
Sbjct: 82 LVRLFEEWVAKYRKAYGSFEEKLRRFEVFKDNLHHIDEANRKEVTSYWLGLNAFADLTHD 141
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV----PTSIDWREKGAVTHIKDQGQCG 153
EF+A Y G S R F+Y V D P S+DWR+KGAVT +K+QGQCG
Sbjct: 142 EFKATYLGLLPKRTSGGR-------FRYGGVGDGGDEVPASVDWRKKGAVTEVKNQGQCG 194
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
SCWAFS VAAVEGI QI G L LSEQQLVDCSTD N+GCSGG+MD AF +I GL
Sbjct: 195 SCWAFSTVAAVEGINQIVTGNLTSLSEQQLVDCSTDGNNGCSGGVMDNAFSFIATGAGLR 254
Query: 213 TEADYPYRHEEGTCDNQ-KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
+E YPY EEG CD++ ++ V TIS YED+P DEQAL++A+++QPVSV ++ASGR
Sbjct: 255 SEEAYPYLMEEGDCDDRARDGEVLVTISGYEDVPANDEQALVKALAHQPVSVAIEASGRH 314
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
F FY GV + CG+ DHGVA VG+G+++ G Y ++KNSWG WGE GYIR+ R
Sbjct: 315 FQFYSGGVFDGPCGSELDHGVAAVGYGSSK---GQDYIIVKNSWGTHWGEKGYIRMKRGT 371
Query: 332 ----GLCGIATAASYPV 344
GLCGI ASYP
Sbjct: 372 GKPEGLCGINKMASYPT 388
>gi|242066206|ref|XP_002454392.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
gi|241934223|gb|EES07368.1| hypothetical protein SORBIDRAFT_04g029960 [Sorghum bicolor]
Length = 356
Score = 311 bits (796), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 156/348 (44%), Positives = 223/348 (64%), Gaps = 18/348 (5%)
Query: 11 IPMFVIIILVITCASQ-----VVSGRSMHEPSI----VEKHEQWMAQHGRTYKDELEKAM 61
+P+ V+ + C++ V G S + ++ V + W +H + Y EK
Sbjct: 5 LPVLVLFLAFAACSASHHRDPSVVGYSQEDLALPNRLVNLFKSWSVKHRKIYVSPKEKLK 64
Query: 62 RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
R IFKQNL +I + N++ N +Y LG N+F+D+T+EEF+A + G + + + Q+ P+
Sbjct: 65 RYGIFKQNLMHIAETNRK-NGSYWLGLNQFADITHEEFKANHLGLKQGLSRMGAQTRTPT 123
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
TF+Y ++P S+DWR KGAVT +K+QG+CGSCWAFS+VAAVEGI QI GKL+ LSEQ
Sbjct: 124 TFRYAAAANLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVSLSEQ 183
Query: 182 QLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
+L+DC T +HGC GGLMD AF YI+ ++G+ E DYPY EEG C ++ A TI+
Sbjct: 184 ELMDCDTMLDHGCEGGLMDFAFAYIMGSQGIHAEDDYPYLMEEGYCKEKQPYANVVTITG 243
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
YED+P+ E +LL+A+++QPVSV + A R F FYK GV + C + DH + VG+G++
Sbjct: 244 YEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYKGGVFDGSCSDELDHALTAVGYGSS 303
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRIL----RDAGLCGIATAASYPV 344
+N Y +KNSWG+ WGE GY+RI + G+CGI T ASYPV
Sbjct: 304 YGQN---YITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPV 348
>gi|118127|sp|P25251.1|CYSP4_BRANA RecName: Full=Cysteine proteinase COT44; Flags: Precursor
Length = 328
Score = 311 bits (796), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 157/315 (49%), Positives = 207/315 (65%), Gaps = 19/315 (6%)
Query: 44 QWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
+W +HG++ + ++ R NIFK NL +I+ N+ N TYKLG F++LTN+E
Sbjct: 6 RWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDE 65
Query: 99 FRALYTG-YNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKDQGQCGS 154
+R+LY G PV +++ ++ KY NV +VP ++DWR+KGAV IKDQG CGS
Sbjct: 66 YRSLYLGARTEPVRRITK--AKNVNMKYSAAVNVDEVPVTVDWRQKGAVNAIKDQGTCGS 123
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLAT 213
CWAFS AAVEGI +I G+L+ LSEQ+LVDC N GC+GGLMD AF++I++N GL T
Sbjct: 124 CWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNT 183
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
E DYPY G C++ + + TI YED+P DE AL +AVS QPVSV +DA GRAF
Sbjct: 184 EKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQ 243
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
Y+SG+ CG N DH V VG+G+ ENG YW+++NSWG WGE GYIR+ R+
Sbjct: 244 HYQSGIFTGKCGTNMDHAVVAVGYGS---ENGVDYWIVRNSWGTRWGEDGYIRMERNVAS 300
Query: 331 -AGLCGIATAASYPV 344
+G CGIA ASYPV
Sbjct: 301 KSGKCGIAIEASYPV 315
>gi|121308860|dbj|BAF43527.1| cysteine proteinase [Zinnia elegans]
Length = 352
Score = 310 bits (795), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 152/318 (47%), Positives = 212/318 (66%), Gaps = 14/318 (4%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEF 91
S+H+ ++ E W+ +H + Y+ EK R IF NL++I++ NK+ + Y LG NEF
Sbjct: 41 SIHK--VIHLFESWLVKHSKFYESLDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEF 97
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
+DLT+EEF+ + G+ + +SS+ F Y++ D+P S+DWR+KGAV +K+QGQ
Sbjct: 98 ADLTHEEFKHKFLGFKGELAERKDESSK--EFGYRDFVDLPKSVDWRKKGAVAPVKNQGQ 155
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKG 210
CG+CWAFS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF Y++ + G
Sbjct: 156 CGNCWAFSTVAAVEGINQIVTGNLTMLSEQELIDCDTTFNNGCNGGLMDYAFAYVMRS-G 214
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
L E +YPY EGTCD +K+ + TIS Y D+P+ DE + L+A++NQP+SV ++ASGR
Sbjct: 215 LHKEEEYPYIMSEGTCDEKKDVSEKVTISGYHDVPRNDEASFLKALANQPISVAIEASGR 274
Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
F FY GV + CG DHGVA VG+GT + G Y +++NSWG WGE GYIR+ R
Sbjct: 275 DFQFYSGGVFDGHCGTELDHGVAAVGYGTTK---GLDYVIVRNSWGPKWGEKGYIRMKRG 331
Query: 331 A----GLCGIATAASYPV 344
+ G+CG+ ASYP
Sbjct: 332 SGKPHGMCGLYMMASYPT 349
>gi|242092704|ref|XP_002436842.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
gi|241915065|gb|EER88209.1| hypothetical protein SORBIDRAFT_10g009850 [Sorghum bicolor]
Length = 296
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 152/316 (48%), Positives = 203/316 (64%), Gaps = 30/316 (9%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+V +HEQWM Q+ R YKD EKA R +FK N+++IE N GNR + LG N+F+DLTN+
Sbjct: 1 MVARHEQWMVQYSRVYKDATEKAQRFEVFKSNVKFIESFNAGGNRKFWLGVNQFADLTND 60
Query: 98 EFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGS 154
EFRA T +P P P+ F+Y+N++ +P +IDWR KGAVT IKDQGQC
Sbjct: 61 EFRATKTNKGFKPSP-----VKVPTGFRYENISVDALPATIDWRTKGAVTPIKDQGQC-- 113
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLA 212
EGI +I+ GKLI LSEQ+LVDC ++ GC GGLMD AF++II+ GL
Sbjct: 114 ----------EGIVKISTGKLISLSEQELVDCDVHGEDQGCEGGLMDDAFKFIIKKGGLT 163
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE+ YPY +G C + AT+ +ED+P DE +L++AV+NQPVSV VD F
Sbjct: 164 TESSYPYTAADGKCKSGSNS--VATVKGFEDVPANDEASLMKAVANQPVSVAVDGGDMTF 221
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
FY GV+ CG + DHG+A +G+G + +G KYWL+KNSWG TWGE+GY+R+ +D
Sbjct: 222 QFYSGGVMTGSCGTDLDHGIAAIGYG--QTSDGTKYWLLKNSWGTTWGENGYLRMEKDIS 279
Query: 332 ---GLCGIATAASYPV 344
G+CG+A SYP
Sbjct: 280 DKRGMCGLAMEPSYPT 295
>gi|109939734|sp|P25776.2|ORYA_ORYSJ RecName: Full=Oryzain alpha chain; Flags: Precursor
gi|78192122|gb|ABB30151.1| oryzain alpha [Oryza sativa Japonica Group]
Length = 458
Score = 310 bits (795), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 158/327 (48%), Positives = 208/327 (63%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
+VS E + +W A+HG++Y E+ R F+ NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 84 YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R Y G N+P R+ + + +P S+DWR KGA
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
V IKDQG CGSCWAFSA+AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 141 VAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
F++II N G+ TE DYPY+ ++ CD ++ A TI YED+ E +L +AV+NQPV
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPV 260
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
SV ++A GRAF Y SG+ CG DHGVA VG+GT ENG YW+++NSWG++WGE
Sbjct: 261 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 317
Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
SGY+R+ R+ +G CGIA SYP+
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPL 344
>gi|242055323|ref|XP_002456807.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
gi|241928782|gb|EES01927.1| hypothetical protein SORBIDRAFT_03g043220 [Sorghum bicolor]
Length = 369
Score = 310 bits (795), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 154/352 (43%), Positives = 216/352 (61%), Gaps = 14/352 (3%)
Query: 2 VLKFEKSFIIPMFVIIILVITCASQVVSGRSM-HEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ + K+ ++ V + V C + R + + ++ + +E+W H EK
Sbjct: 1 MAQLAKTLLLVALVAMSAVELCRAIEFDERDLASDEALWDLYERWQTHHHVHRHHG-EKG 59
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R FK+N+ +I NK G+R Y+L N F D+ EEFR+ + + + + R S
Sbjct: 60 RRFGTFKENVRFIHAHNKRGDRPYRLSLNRFGDMGREEFRSTFA--DSRINDLRRAESPA 117
Query: 121 ST----FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
+ F Y VTD+P S+DWR++GAVT +KDQG CGSCWAFS V +VEGI I G L+
Sbjct: 118 APAVPGFMYDGVTDLPPSVDWRKEGAVTAVKDQGHCGSCWAFSTVVSVEGINAIRTGSLV 177
Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDN-QKEKAVA 235
LSEQ+L+DC TD +GC GGLM+ AFE+I G+ TE+ YPYR GTCD+ + +
Sbjct: 178 SLSEQELIDCDTDENGCQGGLMENAFEFIKSYGGVTTESAYPYRASNGTCDSVRSRRGQI 237
Query: 236 ATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVV 295
+I ++ +P G E AL +AV+NQPVSV +DA G+AF FY GV DCG + DHGVA V
Sbjct: 238 VSIDGHQMVPTGSEDALAKAVANQPVSVAIDAGGQAFQFYSEGVFTGDCGTDLDHGVAAV 297
Query: 296 GFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
G+G +++ G YW++KNSWG +WGE GYIR+ R A GLCGIA AS+P+
Sbjct: 298 GYGVSDD--GTAYWIVKNSWGPSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 347
>gi|414870137|tpg|DAA48694.1| TPA: vignain [Zea mays]
Length = 484
Score = 310 bits (794), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 159/313 (50%), Positives = 198/313 (63%), Gaps = 16/313 (5%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E+W +H +D +KA R N+FK N+ I + N+ + YKL N F D+T +EFR
Sbjct: 156 YERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDMTADEFRR 213
Query: 102 LYTGYNRPVPSVSR-----QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
Y G + R S+ S+F Y + DVP S+DWR+KGAVT +KDQGQCGSCW
Sbjct: 214 HYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCW 273
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
AFS +AAVEGI I L LSEQQLVDC T N GC+GGLMD AF+YI ++ G+A E
Sbjct: 274 AFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAED 333
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
YPYR + +C +K A TI YED+P DE AL +AV++QPVSV ++ASG F FY
Sbjct: 334 AYPYRARQASC--KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFY 391
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---- 331
GV + CG DHGVA VG+G +G KYWL+KNSWG WGE GYIR+ RD
Sbjct: 392 SEGVFSGRCGTELDHGVAAVGYGVT--ADGTKYWLVKNSWGPEWGEKGYIRMARDVAAKE 449
Query: 332 GLCGIATAASYPV 344
G CGIA ASYPV
Sbjct: 450 GHCGIAMEASYPV 462
>gi|222629675|gb|EEE61807.1| hypothetical protein OsJ_16426 [Oryza sativa Japonica Group]
Length = 459
Score = 310 bits (794), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 158/327 (48%), Positives = 208/327 (63%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
+VS E + +W A+HG++Y E+ R F+ NL YI++ N G +
Sbjct: 26 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 85
Query: 84 YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R Y G N+P R+ + + +P S+DWR KGA
Sbjct: 86 FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 141
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
V IKDQG CGSCWAFSA+AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 142 VAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 201
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
F++II N G+ TE DYPY+ ++ CD ++ A TI YED+ E +L +AV+NQPV
Sbjct: 202 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPV 261
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
SV ++A GRAF Y SG+ CG DHGVA VG+GT ENG YW+++NSWG++WGE
Sbjct: 262 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 318
Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
SGY+R+ R+ +G CGIA SYP+
Sbjct: 319 SGYVRMERNIKASSGKCGIAVEPSYPL 345
>gi|2224812|emb|CAB09699.1| cysteine endopeptidase EP-A [Hordeum vulgare subsp. vulgare]
Length = 365
Score = 310 bits (794), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 158/320 (49%), Positives = 212/320 (66%), Gaps = 16/320 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYK----DELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
E S+ +E+W + + + + D E+ R N+FKQN Y+ + NK + ++L N+
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAEER--RFNVFKQNARYVHEGNKR-DMPFRLALNK 90
Query: 91 FSDLTNEEFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ 149
F+D+T +EFR Y G R S+S F+Y + ++P ++DWR+KGAVT IKDQ
Sbjct: 91 FADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150
Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIEN 208
GQCGSCWAFS + AVEGI +I GKL+ LSEQ+L+DC + +N GC GGLMD AF++I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN 210
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
G+ TE++YPY+ E+G+CD KE A A TI YED+P DE AL +AV+ QPVSV +DAS
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269
Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
G+ F FY GV +C + DHGVA VG+G + G KYW++KNSWGE WGE GYIR+
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRD--GTKYWIVKNSWGEDWGEKGYIRMQ 327
Query: 329 RDA----GLCGIATAASYPV 344
R GLCGIA ASYP
Sbjct: 328 RGVSQTEGLCGIAMQASYPT 347
>gi|4100157|gb|AAD10337.1| cysteine proteinase precursor [Hordeum vulgare]
Length = 365
Score = 310 bits (794), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 158/320 (49%), Positives = 212/320 (66%), Gaps = 16/320 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYK----DELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
E S+ +E+W + + + + D E+ R N+FKQN Y+ + NK + ++L N+
Sbjct: 34 EESLRGLYERWRSHYTVSRRGLGADAGER--RFNVFKQNARYVHEGNKR-DMPFRLALNK 90
Query: 91 FSDLTNEEFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ 149
F+D+T +EFR Y G R S+S F+Y + ++P ++DWR+KGAVT IKDQ
Sbjct: 91 FADMTTDEFRRTYAGSRVRHHLSLSGGRRGDGGFRYGDADNLPPAVDWRQKGAVTAIKDQ 150
Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIEN 208
GQCGSCWAFS + AVEGI +I GKL+ LSEQ+L+DC + +N GC GGLMD AF++I +N
Sbjct: 151 GQCGSCWAFSTIVAVEGINKIRTGKLVSLSEQELMDCDNVNNQGCDGGLMDYAFQFIQKN 210
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
G+ TE++YPY+ E+G+CD KE A A TI YED+P DE AL +AV+ QPVSV +DAS
Sbjct: 211 -GITTESNYPYQGEQGSCDQAKENAQAVTIDGYEDVPANDESALQKAVAGQPVSVAIDAS 269
Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
G+ F FY GV +C + DHGVA VG+G + G KYW++KNSWGE WGE GYIR+
Sbjct: 270 GQDFQFYSEGVFTGECSTDLDHGVAAVGYGATRD--GTKYWIVKNSWGEDWGEKGYIRMQ 327
Query: 329 RDA----GLCGIATAASYPV 344
R GLCGIA ASYP
Sbjct: 328 RGVSQTEGLCGIAMQASYPT 347
>gi|3451077|emb|CAA20473.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|7269200|emb|CAB79307.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 355
Score = 310 bits (794), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 154/348 (44%), Positives = 226/348 (64%), Gaps = 27/348 (7%)
Query: 13 MFVIIILVITCASQVVS--------GRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMRL 63
+F++I+ V++ S + RS E + + WM++HG+TY + L EK R
Sbjct: 12 LFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFI--FQMWMSKHGKTYTNALGEKERRF 69
Query: 64 NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
FK NL +I++ N + N +Y+LG F+DLT +E+R L+ G +P +Q + ++
Sbjct: 70 QNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPGSPKP-----KQRNLKTSR 123
Query: 124 KYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
+Y + +P S+DWR++GAV+ IKDQG C SCWAFS VAAVEG+ +I G+LI LSEQ
Sbjct: 124 RYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQ 183
Query: 182 QLVDCSTDNHGCSG-GLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
+LVDC+ N+GC G GLMD AF+++I N GL +E DYPY+ +G+C+ ++ + TI
Sbjct: 184 ELVDCNLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQVHLLVITIDS 243
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
YED+P DE +L +AV++QPVSV VD + F Y+S + N CG N DH + +VG+G+
Sbjct: 244 YEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGS- 302
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
ENG YW+++NSWG TWG++GYI+I R+ GLCGIA ASYP+
Sbjct: 303 --ENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPI 348
>gi|297802228|ref|XP_002868998.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
gi|297314834|gb|EFH45257.1| cysteine proteinase [Arabidopsis lyrata subsp. lyrata]
Length = 375
Score = 310 bits (794), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 160/316 (50%), Positives = 206/316 (65%), Gaps = 20/316 (6%)
Query: 44 QWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKAN-KEGNRTYKLGTNEFSDLTNEE 98
QW A HG+T + ++ R NIFK NL +I+ N K N TYKLG +F+DLTNEE
Sbjct: 51 QWSADHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEKNKNATYKLGLTKFTDLTNEE 110
Query: 99 FRALYTG-YNRPVPSVSRQSSRPSTFKYQNVTD---VPTSIDWREKGAVTHIKDQGQCGS 154
+R+LY G PV +++ ++ KY D VP ++DWR KGAV IKDQG CGS
Sbjct: 111 YRSLYLGARTEPVRRIAK--AKNVNQKYSAAVDGKEVPETVDWRLKGAVNPIKDQGTCGS 168
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLAT 213
CWAFS AAVEGI +I G+LI LSEQ+LVDC N GC+GGLMD AF++I++N GL T
Sbjct: 169 CWAFSTAAAVEGINKIVTGELISLSEQELVDCDNSYNQGCNGGLMDYAFQFIMKNGGLKT 228
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
E DYPYR G C++ + A +I YED+P DE AL +A+S QPVSV ++A GR F
Sbjct: 229 EKDYPYRGFGGKCNSFLKNAKVVSIDGYEDVPTKDETALKRAISLQPVSVAIEAGGRIFQ 288
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
Y++G+ +CG N DH V VG+G+ ENG YW+++NSWG WGE GYIR+ R+
Sbjct: 289 HYQTGIFTGNCGTNLDHAVVAVGYGS---ENGVDYWIVRNSWGPRWGEEGYIRMERNLAS 345
Query: 331 --AGLCGIATAASYPV 344
+G CGIA ASYPV
Sbjct: 346 SKSGKCGIAVEASYPV 361
>gi|255646088|gb|ACU23531.1| unknown [Glycine max]
Length = 362
Score = 310 bits (794), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 158/320 (49%), Positives = 206/320 (64%), Gaps = 18/320 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
E S + +E+W + RT L +K R N+FK N+ ++ NK ++ YKL N+F+D
Sbjct: 33 EESFWDLYERWRSY--RTVSRSLGDKHKRFNVFKANVMHVHNTNKM-DKPYKLKLNKFAD 89
Query: 94 LTNEEFRALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ 149
+TN EFR+ Y G ++R R + TF Y+ V VP S DWR+ GAVT +KDQ
Sbjct: 90 MTNHEFRSTYAGSKVNHHRMFQGTPRGNG---TFMYEKVGSVPPSADWRKNGAVTGVKDQ 146
Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIEN 208
GQCGSCWAFS V AVEGI QI KL+ LSEQ+LVDC T N GC+GGLM+ AFE+I +
Sbjct: 147 GQCGSCWAFSTVVAVEGINQIKTNKLVSLSEQELVDCDTKKNAGCNGGLMESAFEFIKQK 206
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
G+ TE++YPY ++GTCD K +A +I +E++P DE ALL+AV+NQPVSV +DA
Sbjct: 207 GGITTESNYPYTAQDGTCDASKANDLAVSIDGHENVPANDENALLKAVANQPVSVAIDAG 266
Query: 269 GRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR-- 326
G F FY GV DC +HGVA+VG+GT + G YW ++NSWG WGE GYIR
Sbjct: 267 GFDFQFYFEGVFTGDCSTELNHGVAIVGYGTTVD--GTNYWTVRNSWGPEWGEQGYIRMQ 324
Query: 327 --ILRDAGLCGIATAASYPV 344
I + GLCGIA ASYP+
Sbjct: 325 RSIFKKEGLCGIAMMASYPI 344
>gi|226504984|ref|NP_001151293.1| cysteine protease 1 precursor [Zea mays]
gi|195645596|gb|ACG42266.1| cysteine protease 1 precursor [Zea mays]
Length = 340
Score = 310 bits (794), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 162/342 (47%), Positives = 222/342 (64%), Gaps = 18/342 (5%)
Query: 14 FVIIILVI----TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
F++ +LV+ C + + ++ +HE+WMA+HGR YKDE EKA RL +F+ N
Sbjct: 6 FLLAVLVVGSAVLCTAAAPRALAAAAAAMASRHEKWMAEHGRAYKDEAEKARRLEVFRAN 65
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-- 127
E I+ N G +++L TN F+DLT +EFRA TG RP P+ S + R F+Y+N
Sbjct: 66 AELIDSFNAAGTHSHRLATNRFADLTVQEFRAARTGL-RPRPAPSAGAGR---FRYENFS 121
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC- 186
+ D S+DWR GAVT +KDQG G CWAFSAVAAVEG+ +I G+L+ LSEQ+LVDC
Sbjct: 122 LADAAQSVDWRAMGAVTGVKDQGASGCCWAFSAVAAVEGLNKIRTGRLVSLSEQELVDCD 181
Query: 187 -STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
S + GC GGLMD AF+++ GLA+E+ YPY+ +G C A AA+I +ED+P
Sbjct: 182 VSGVDQGCDGGLMDNAFQFVARRGGLASESGYPYQCRDGPC-RSSAAAAAASIRGHEDVP 240
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+ +E AL AV++QPVSV ++ AF FY SGVL CG + +H + VG+GTA + G
Sbjct: 241 RNNEAALAAAVAHQPVSVAINGEDMAFRFYDSGVLGGACGTDLNHAITAVGYGTAAD--G 298
Query: 306 AKYWLIKNSWGETWGESGYIRI---LRDAGLCGIATAASYPV 344
+YWL+KNSWG +WGE GY+RI +R G+CG+A SYPV
Sbjct: 299 TRYWLMKNSWGASWGEGGYVRIRRGVRGEGVCGLAKLPSYPV 340
>gi|449525012|ref|XP_004169515.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 459
Score = 310 bits (793), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 161/348 (46%), Positives = 223/348 (64%), Gaps = 16/348 (4%)
Query: 5 FEKSFIIPMFVIIILVITCAS--QVVSGRSMHEPSIVEKHEQWMAQHGRTYKD-ELEKAM 61
F+ S I+ + + + ++ AS ++ R+ E ++ ++QW A+HG+ + + E
Sbjct: 4 FQSSPIMALLFFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPEN 61
Query: 62 RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
R +IFK NL++I++ N + N Y+LG N F+DLTNEE+R+ Y G S SR++ +
Sbjct: 62 RFHIFKDNLKFIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLG--GKFASGSRRNRTSN 118
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
+ + D+P SIDWR KGAV +KDQG CGSCWAFS VA+VE I QI G LI LSEQ
Sbjct: 119 RYLPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQ 178
Query: 182 QLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
+LVDC N GC+GGLMD AFE+IIEN GL TE DYPY + +C K+ A I
Sbjct: 179 ELVDCDRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNAKVVAIDS 238
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
YED+P +E+AL +AVS Q VSV ++ GR+F Y+SG+ CG + DHGV VVG+G+
Sbjct: 239 YEDVPVNNEKALQKAVSKQVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGYGS- 297
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
E G YW+++NSWG +WGESGY+++ R+ GLCGIA SYP
Sbjct: 298 --EGGVDYWIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPT 343
>gi|218195711|gb|EEC78138.1| hypothetical protein OsI_17694 [Oryza sativa Indica Group]
Length = 458
Score = 310 bits (793), Expect = 8e-82, Method: Compositional matrix adjust.
Identities = 158/327 (48%), Positives = 207/327 (63%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
+VS E + +W A+HG+ Y E+ R F+ NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 84 YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R Y G N+P R+ + + +P S+DWR KGA
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
V IKDQG CGSCWAFSA+AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 141 VAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
F++II N G+ TE DYPY+ ++ CD ++ A TI YED+ E +L +AV+NQPV
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPV 260
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
SV ++A GRAF Y SG+ CG DHGVA VG+GT ENG YW+++NSWG++WGE
Sbjct: 261 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 317
Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
SGY+R+ R+ +G CGIA SYP+
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPL 344
>gi|242032709|ref|XP_002463749.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
gi|241917603|gb|EER90747.1| hypothetical protein SORBIDRAFT_01g005350 [Sorghum bicolor]
Length = 381
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 152/320 (47%), Positives = 206/320 (64%), Gaps = 12/320 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR-TYKLGTNEFSD 93
+ ++ + +E+W H R ++ EK R FK+N+ +I NK G+R +Y+L N F D
Sbjct: 39 DEALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENVRFIHAHNKRGDRPSYRLRLNRFGD 97
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPST----FKYQNVTDVPTSIDWREKGAVTHIKDQ 149
+ EEFR+ + R+SS +T F Y + TDVP S+DWR+ GAVT +K+Q
Sbjct: 98 MGPEEFRSTFADSRINDLRRYRESSPAATAVPGFMYDDATDVPRSVDWRQHGAVTAVKNQ 157
Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENK 209
G+CGSCWAFS V AVEGI I G L+ LSEQ+LVDC T +GC GGLM+ AF++I
Sbjct: 158 GRCGSCWAFSTVVAVEGINAIRTGSLVSLSEQELVDCDTAENGCQGGLMENAFDFIKSYG 217
Query: 210 GLATEADYPYRHEEGTCDNQKEK--AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
G+ TE+ YPYR GTCD + + V +I ++ +P G E AL +AV+ QPVSV +DA
Sbjct: 218 GITTESAYPYRASNGTCDGMRARRGRVHVSIDGHQMVPTGSEDALAKAVARQPVSVAIDA 277
Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
G+AF FY GV DCG + DHGVAVVG+G ++ + G YW++KNSWG +WGE GYIR+
Sbjct: 278 GGQAFQFYSEGVFTGDCGTDLDHGVAVVGYGVSDVD-GTPYWIVKNSWGPSWGEGGYIRM 336
Query: 328 LRDA---GLCGIATAASYPV 344
R A GLCGIA AS+P+
Sbjct: 337 QRGAGNGGLCGIAMEASFPI 356
>gi|226507950|ref|NP_001151278.1| LOC100284911 precursor [Zea mays]
gi|195645488|gb|ACG42212.1| vignain precursor [Zea mays]
Length = 376
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 159/319 (49%), Positives = 202/319 (63%), Gaps = 15/319 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ +E+W +H +D +KA R N+FK N+ I + N+ + YKL N F D+
Sbjct: 42 EEALWALYERWRGRHALA-RDLGDKARRFNVFKANVRLIHEFNRR-DEPYKLRLNRFGDM 99
Query: 95 TNEEFRALYTG----YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQG 150
T +EFR Y G ++R + SS ++F Y + DVP S+DWR+KGAVT +KDQG
Sbjct: 100 TADEFRRHYAGSRVAHHRMFRGDRQGSSASASFMYADARDVPASVDWRQKGAVTDVKDQG 159
Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENK 209
QCGSCWAFS +AAVEGI I L LSEQQLVDC T N GC+GGLMD AF+YI ++
Sbjct: 160 QCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAKHG 219
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
G+A E YPYR + +C +K A TI YED+P DE AL +AV++QPVSV ++ASG
Sbjct: 220 GVAAEDAYPYRARQASC--KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEASG 277
Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
F FY GV + CG DHGV VG+G +G KYWL+KNSWG WGE GYIR+ R
Sbjct: 278 SHFQFYSEGVFSGRCGTELDHGVTAVGYGVT--ADGTKYWLVKNSWGPEWGEKGYIRMAR 335
Query: 330 DA----GLCGIATAASYPV 344
D G CGIA ASYPV
Sbjct: 336 DVAAKEGHCGIAMEASYPV 354
>gi|46395939|sp|Q94B08.2|GCP1_ARATH RecName: Full=Germination-specific cysteine protease 1; Flags:
Precursor
gi|4006883|emb|CAB16767.1| cysteine proteinase [Arabidopsis thaliana]
gi|7270637|emb|CAB80354.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 158/315 (50%), Positives = 206/315 (65%), Gaps = 18/315 (5%)
Query: 44 QWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
QW A+HG+T + ++ R NIFK NL +I+ N++ N TYKLG +F+DLTN+E
Sbjct: 51 QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNEDNKNATYKLGLTKFTDLTNDE 110
Query: 99 FRALYTGYNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
+R LY G R P+ ++ KY N +VP ++DWR+KGAV IKDQG CGSC
Sbjct: 111 YRKLYLGA-RTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSC 169
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
WAFS AAVEGI +I G+LI LSEQ+LVDC N GC+GGLMD AF++I++N GL TE
Sbjct: 170 WAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTE 229
Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
DYPYR G C++ + + +I YED+P DE AL +A+S QPVSV ++A GR F
Sbjct: 230 KDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQH 289
Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---- 330
Y+SG+ CG N DH V VG+G+ ENG YW+++NSWG WGE GYIR+ R+
Sbjct: 290 YQSGIFTGSCGTNLDHAVVAVGYGS---ENGVDYWIVRNSWGPRWGEEGYIRMERNLAAS 346
Query: 331 -AGLCGIATAASYPV 344
+G CGIA ASYPV
Sbjct: 347 KSGKCGIAVEASYPV 361
>gi|414879123|tpg|DAA56254.1| TPA: hypothetical protein ZEAMMB73_708930 [Zea mays]
Length = 368
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 146/315 (46%), Positives = 205/315 (65%), Gaps = 12/315 (3%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
++ + +E+W H R ++ EK R FK+N +I NK G+R Y+L N F D+
Sbjct: 37 ALWDLYERWQTHH-RVHRHHGEKGRRFGTFKENARFIHAHNKRGDRPYRLRLNRFGDMGR 95
Query: 97 EEFRALYTGYNRPVPSVSRQ-SSRPST--FKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
EEFR+ + + + + R+ ++ P+ F Y + TD+P S+DWR+KGAVT +K+QG+CG
Sbjct: 96 EEFRSGFA--DSRINDLRREPTAAPAVPGFMYDDATDLPRSVDWRQKGAVTAVKNQGRCG 153
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
SCWAFS V AVEGI I G L+ LSEQ+L+DC TD +GC GGLM+ AFE+I + G+ T
Sbjct: 154 SCWAFSTVVAVEGINAIRTGSLVSLSEQELIDCDTDENGCQGGLMENAFEFIKSHGGITT 213
Query: 214 EADYPYRHEEGTCDNQK-EKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
E+ YPY GTCD + + I ++ +P G E AL +AV++QPVSV +DA G+A
Sbjct: 214 ESAYPYHASNGTCDGARARRGRVVAIDGHQAVPAGSEDALAKAVAHQPVSVAIDAGGQAL 273
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR--- 329
FY GV DCG + DHGVA VG+G +++ G YW++KNSWG +WGE GYIR+ R
Sbjct: 274 QFYSEGVFTGDCGTDLDHGVAAVGYGVSDD--GTPYWIVKNSWGPSWGEGGYIRMQRGTG 331
Query: 330 DAGLCGIATAASYPV 344
+ GLCGIA AS+P+
Sbjct: 332 NGGLCGIAMEASFPI 346
>gi|226506492|ref|NP_001140873.1| uncharacterized protein LOC100272949 precursor [Zea mays]
gi|194701540|gb|ACF84854.1| unknown [Zea mays]
Length = 379
Score = 309 bits (792), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 155/350 (44%), Positives = 209/350 (59%), Gaps = 23/350 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIV------EKHEQWMAQHGRTYKDELEKAMRLNIF 66
+ V ++ V + A ++ E + + +E+W H R ++ EK R F
Sbjct: 9 LLVALVFVSSAAVELCRAIDFDERDLASDEALWDLYERWQTHH-RVHRHHGEKGRRFGTF 67
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST---- 122
K+N+ +I NK G+R Y+L N F D+ EEFR+ + + + + RQ S +
Sbjct: 68 KENVRFIHAHNKRGDRPYRLRLNRFGDMGREEFRSTFA--DSRINDLRRQDSPAARAGAV 125
Query: 123 --FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
F Y + D P S+DWR++GAVT +K QG CGSCWAFS V AVEGI I G L LSE
Sbjct: 126 PGFMYDSAADPPRSVDWRQEGAVTGVKVQGHCGSCWAFSTVVAVEGINAIRTGSLASLSE 185
Query: 181 QQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK---AVAAT 237
Q+L+DC TD +GC GGLM+ AFE+I G+ TEA YPYR GTCD + + V
Sbjct: 186 QELIDCDTDENGCQGGLMENAFEFIKSFGGITTEAAYPYRASNGTCDGDRARRGGGVVVV 245
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I ++ +P G E AL +AV++QPVSV VDA G+AF FY GV DCG + DHGVA VG+
Sbjct: 246 IDGHQMVPAGSEDALAKAVAHQPVSVAVDAGGQAFQFYSEGVFTGDCGTDLDHGVAAVGY 305
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
G ++ G YW++KNSWG +WGE GYIR+ R A GLCGIA AS+P+
Sbjct: 306 GVGDD--GTPYWIVKNSWGTSWGEGGYIRMQRGAGNGGLCGIAMEASFPI 353
>gi|357156854|ref|XP_003577598.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 368
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 149/293 (50%), Positives = 200/293 (68%), Gaps = 9/293 (3%)
Query: 58 EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYN-RPVPSVSRQ 116
+ A R N+FK+N++YI +ANK+ +R ++L N+F+D+T +E R Y G R ++S
Sbjct: 64 DPARRFNVFKENVKYIHEANKK-DRPFRLALNKFADMTTDELRHSYAGSRVRHHRALSGG 122
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
F Y + ++P ++DWREKGAVT IKDQGQCGSCWAFS +AAVE I +I GKL+
Sbjct: 123 RRAQGNFTYSDAENLPPAVDWREKGAVTGIKDQGQCGSCWAFSTIAAVESINKIRTGKLV 182
Query: 177 ELSEQQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA 235
LSEQ+L+DC + ++ GC GGLMD AF++I +N G+ +EA+YPY+ ++ TCD KE
Sbjct: 183 SLSEQELMDCDNVNDQGCDGGLMDYAFQFIQKNGGVTSEANYPYQGQQNTCDQAKENTHD 242
Query: 236 ATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVV 295
I YED+P DE AL +AV+ QPVSV ++ASG+ F FY GV C + DHGVA V
Sbjct: 243 VAIDGYEDVPANDESALQKAVAYQPVSVAIEASGQDFQFYSEGVFTGQCTTDLDHGVAAV 302
Query: 296 GFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
G+GTA + G KYW++KNSWG WGE GYIR+ R GLCGIA ASYP+
Sbjct: 303 GYGTARD--GTKYWIVKNSWGLDWGEKGYIRMQRGVSQAEGLCGIAMQASYPI 353
>gi|125540888|gb|EAY87283.1| hypothetical protein OsI_08685 [Oryza sativa Indica Group]
Length = 357
Score = 309 bits (791), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 156/356 (43%), Positives = 222/356 (62%), Gaps = 33/356 (9%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ--------------WMAQHGRTYKDE 56
+ M +++ + C++ S H+PS+V ++ W +H + Y
Sbjct: 5 LSMLFLLLGFVACSATA----SHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASP 60
Query: 57 LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQ 116
EK R IFK+NL +I + N+ N +Y LG N F+D+ +EEF+A Y G P ++R+
Sbjct: 61 KEKVKRYEIFKRNLRHIVETNRR-NGSYWLGLNHFADIAHEEFKASYLGLK---PGLARR 116
Query: 117 SSRP---STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRG 173
++P +TF+Y N ++P ++DWR+KGAVT +K+QG+CGSCWAFS VAAVEGI QI G
Sbjct: 117 DAQPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTG 176
Query: 174 KLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
KL+ LSEQ+L+DC +T NHGC GGLMD AF YI+ N+G+ TE DYPY EEG C ++
Sbjct: 177 KLVSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPH 236
Query: 233 AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGV 292
+ TI+ YED+P+ E +LL+A+++QPVSV + A R F FYK G+ + +CG DH +
Sbjct: 237 SKVITITGYEDVPENSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHAL 296
Query: 293 AVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
VG+G+ G Y ++KNSWG+ WGE GY RI R G+C I ASYP
Sbjct: 297 TAVGYGSYY---GQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYPT 349
>gi|146215978|gb|ABQ10191.1| actinidin Act1c [Actinidia eriantha]
Length = 368
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 154/347 (44%), Positives = 216/347 (62%), Gaps = 14/347 (4%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+ KSF+ + ++ + + + R+ E + +E W+ +HG++Y E+ R
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLALDAKRTNDE--VKAMYESWLIKHGKSYNSLGERERR 58
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
IFK+ L +I++ N + +R+YK+G N+F+DLTNEEFR+ Y G+ R S ++ +
Sbjct: 59 FEIFKETLRFIDEHNADTSRSYKVGLNQFADLTNEEFRSTYLGFTRG----SNKTKVSNR 114
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
++ + +P +DWR +GAV IK+QGQCGSCWAFSA+AAVEGI +I G LI LSEQ+
Sbjct: 115 YEPRVGQVLPDYVDWRSEGAVVDIKNQGQCGSCWAFSAIAAVEGINKIVTGNLISLSEQE 174
Query: 183 LVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
LVDC + GC GG M FE+II N G+ TE +YPY +EG CD + TI
Sbjct: 175 LVDCGRTQSTKGCDGGYMTDGFEFIINNGGINTEENYPYTAQEGQCDLNLQNEKYVTIDN 234
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
YE++P +E AL AV+ QPVSV ++++G AF Y SG+ CG DH V +VG+GT
Sbjct: 235 YENVPYYNEWALQTAVAYQPVSVALESAGDAFQHYSSGIFTGPCGTATDHAVTIVGYGT- 293
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
E G YW++KNSW TWGE GY+RILR+ AG CGIAT SYPV
Sbjct: 294 --EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 338
>gi|186516984|ref|NP_195406.2| cysteine proteinase1 [Arabidopsis thaliana]
gi|15290508|gb|AAK92229.1| cysteine proteinase [Arabidopsis thaliana]
gi|332661313|gb|AEE86713.1| cysteine proteinase1 [Arabidopsis thaliana]
Length = 376
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 158/315 (50%), Positives = 205/315 (65%), Gaps = 18/315 (5%)
Query: 44 QWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
QW A+HG+T + ++ R NIFK NL +I+ N+ N TYKLG +F+DLTN+E
Sbjct: 51 QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDE 110
Query: 99 FRALYTGYNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
+R LY G R P+ ++ KY N +VP ++DWR+KGAV IKDQG CGSC
Sbjct: 111 YRKLYLGA-RTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSC 169
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
WAFS AAVEGI +I G+LI LSEQ+LVDC N GC+GGLMD AF++I++N GL TE
Sbjct: 170 WAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTE 229
Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
DYPYR G C++ + + +I YED+P DE AL +A+S QPVSV ++A GR F
Sbjct: 230 KDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVSVAIEAGGRIFQH 289
Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---- 330
Y+SG+ CG N DH V VG+G+ ENG YW+++NSWG WGE GYIR+ R+
Sbjct: 290 YQSGIFTGSCGTNLDHAVVAVGYGS---ENGVDYWIVRNSWGPRWGEEGYIRMERNLAAS 346
Query: 331 -AGLCGIATAASYPV 344
+G CGIA ASYPV
Sbjct: 347 KSGKCGIAVEASYPV 361
>gi|168058022|ref|XP_001781010.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667567|gb|EDQ54194.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 457
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 165/334 (49%), Positives = 211/334 (63%), Gaps = 22/334 (6%)
Query: 27 VVSGRSMHEPSIVEKHE-------QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE 79
V +G + P+ V K + W +HG+ Y E+A R ++K NLEYI++ + E
Sbjct: 23 VANGDVIRMPTDVGKDQLLAGQFAAWAHKHGKVYSAAEERAHRFLVWKDNLEYIQR-HSE 81
Query: 80 GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST--FKYQNVTDVPTSIDW 137
N +Y LG +F+DLTNEEFR YTG R S + R +T F+Y N ++ P SIDW
Sbjct: 82 KNLSYWLGLTKFADLTNEEFRRQYTG-TRIDRSRRLKKGRNATGSFRYAN-SEAPKSIDW 139
Query: 138 REKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGG 196
REKGAVT +KDQG CGSCWAFSAV +VEGI I G I LS Q+LVDC N GC+GG
Sbjct: 140 REKGAVTSVKDQGSCGSCWAFSAVGSVEGINAIRTGDAISLSVQELVDCDKKYNQGCNGG 199
Query: 197 LMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAV 256
LMD AF+++I+N G+ TE DYPY+ +G CD K A TI YED+P+ DE+AL +AV
Sbjct: 200 LMDYAFDFVIQNGGIDTEKDYPYQGYDGRCDVNKMNARVVTIDSYEDVPENDEEALKKAV 259
Query: 257 SNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWG 316
+ QPVSV ++A GR F Y GV CG + DHGV VG+G+ E G YW++KNSWG
Sbjct: 260 AGQPVSVAIEAGGRDFQLYSGGVFTGRCGTDLDHGVLAVGYGS---EKGLDYWIVKNSWG 316
Query: 317 ETWGESGYIRILRDA------GLCGIATAASYPV 344
E WGESGY+R+ R+ GLCGI SY V
Sbjct: 317 EYWGESGYLRMQRNLKDDNGYGLCGINIEPSYAV 350
>gi|115448287|ref|NP_001047923.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|42408029|dbj|BAD09165.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113537454|dbj|BAF09837.1| Os02g0715000 [Oryza sativa Japonica Group]
gi|215737450|dbj|BAG96580.1| unnamed protein product [Oryza sativa Japonica Group]
gi|215765786|dbj|BAG87483.1| unnamed protein product [Oryza sativa Japonica Group]
gi|222623551|gb|EEE57683.1| hypothetical protein OsJ_08138 [Oryza sativa Japonica Group]
Length = 366
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 156/356 (43%), Positives = 221/356 (62%), Gaps = 33/356 (9%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ--------------WMAQHGRTYKDE 56
+ M +++ + C++ S H+PS+V ++ W +H + Y
Sbjct: 14 LSMLFLLLGFVACSATA----SHHDPSVVGYSQEDLALPNKLVGLFTSWSVKHSKIYASP 69
Query: 57 LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQ 116
EK R IFK+NL +I + N+ N +Y LG N F+D+ +EEF+A Y G P ++R+
Sbjct: 70 KEKVKRYEIFKRNLRHIVETNRR-NGSYWLGLNHFADIAHEEFKASYLGLK---PGLARR 125
Query: 117 SSRP---STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRG 173
++P +TF+Y N ++P ++DWR+KGAVT +K+QG+CGSCWAFS VAAVEGI QI G
Sbjct: 126 DAQPHGSTTFRYANAVNLPWAVDWRKKGAVTPVKNQGECGSCWAFSTVAAVEGINQIVTG 185
Query: 174 KLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
KL+ LSEQ+L+DC +T NHGC GGLMD AF YI+ N+G+ TE DYPY EEG C ++
Sbjct: 186 KLVSLSEQELMDCDNTFNHGCRGGLMDFAFAYIMGNQGIYTEEDYPYLMEEGYCREKQPH 245
Query: 233 AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGV 292
+ TI+ YED+P E +LL+A+++QPVSV + A R F FYK G+ + +CG DH +
Sbjct: 246 SKVITITGYEDVPANSETSLLKALAHQPVSVGIAAGSRDFQFYKGGIFDGECGIQPDHAL 305
Query: 293 AVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
VG+G+ G Y ++KNSWG+ WGE GY RI R G+C I ASYP
Sbjct: 306 TAVGYGSYY---GQDYIIMKNSWGKNWGEQGYFRIRRGTGKPEGVCDIYKIASYPT 358
>gi|255032|gb|AAB23155.1| COT44=cysteine proteinase homolog [Brassica napus, seedling, rapid
cycling base population CrGC5, Peptide, 328 aa]
Length = 328
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 156/315 (49%), Positives = 206/315 (65%), Gaps = 19/315 (6%)
Query: 44 QWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
+W +HG++ + ++ R NIFK NL +I+ N+ N TYKLG F++LTN+E
Sbjct: 6 RWSLEHGKSNSNSNGIINQQDERFNIFKDNLRFIDLHNENNKNATYKLGLTIFANLTNDE 65
Query: 99 FRALYTG-YNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKDQGQCGS 154
+R+LY G PV +++ ++ KY N +VP ++DWR+KGAV IKDQG CGS
Sbjct: 66 YRSLYLGARTEPVRRITK--AKNVNMKYSAAVNDVEVPVTVDWRQKGAVNAIKDQGTCGS 123
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLAT 213
CWAFS AAVEGI +I G+L+ LSEQ+LVDC N GC+GGLMD AF++I++N GL T
Sbjct: 124 CWAFSTAAAVEGINKIVTGELVSLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNT 183
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
E DYPY G C++ + + TI YED+P DE AL +AVS QPVSV +DA GRAF
Sbjct: 184 EKDYPYHGTNGKCNSLLKNSRVVTIDGYEDVPSKDETALKRAVSYQPVSVAIDAGGRAFQ 243
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
Y+SG+ CG N DH V VG+G+ ENG YW+++NSWG WGE GYIR+ R+
Sbjct: 244 HYQSGIFTGKCGTNMDHAVVAVGYGS---ENGVDYWIVRNSWGTRWGEDGYIRMERNVAS 300
Query: 331 -AGLCGIATAASYPV 344
+G CGIA ASYPV
Sbjct: 301 KSGKCGIAIEASYPV 315
>gi|242074728|ref|XP_002447300.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
gi|241938483|gb|EES11628.1| hypothetical protein SORBIDRAFT_06g032360 [Sorghum bicolor]
Length = 471
Score = 308 bits (790), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 153/326 (46%), Positives = 208/326 (63%), Gaps = 11/326 (3%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKAN-KEGNRTY 84
V G + E + +EQWMA+HG+ + L E R F NL +++ N + G R Y
Sbjct: 37 VGGGMARTEAQVRAMYEQWMARHGKAASNALGEHDRRFRAFWDNLRFVDAHNARAGARGY 96
Query: 85 KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
+LG N F+DLTN EFRA Y + + + ++ +++ V +P +DWR+KGAV
Sbjct: 97 RLGINRFADLTNAEFRAAY--LSAGARNGTATAATGERYRHDGVEALPEFVDWRQKGAVA 154
Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAF 202
+K+QGQCGSCWAFSAV AVEGI QI G+L+ LSEQ+LVDCS + N GC GG+MD AF
Sbjct: 155 PVKNQGQCGSCWAFSAVGAVEGINQIVTGELVTLSEQELVDCSKNGQNGGCDGGMMDDAF 214
Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
+I+ N G+ T+ DYPY +G CD K +I +E +P+ DE++L +AV++QPV+
Sbjct: 215 AFIVGNGGIDTDKDYPYTARDGKCDVAKRSRHVVSIDGFEGVPRNDEKSLQKAVAHQPVA 274
Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
V ++A GR F Y+SGV CG + DHGV VG+GT E + G YWL++NSWG WGE
Sbjct: 275 VAIEAGGREFQLYQSGVFTGRCGTSLDHGVVAVGYGT-EADGGRDYWLVRNSWGADWGEG 333
Query: 323 GYIRILRD----AGLCGIATAASYPV 344
GYIR+ R+ AG CGIA ASYPV
Sbjct: 334 GYIRMERNVGARAGKCGIAMEASYPV 359
>gi|224081756|ref|XP_002306486.1| predicted protein [Populus trichocarpa]
gi|222855935|gb|EEE93482.1| predicted protein [Populus trichocarpa]
Length = 352
Score = 308 bits (789), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 160/310 (51%), Positives = 208/310 (67%), Gaps = 11/310 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
++ W+A+HG+ Y E+A R IFK NL +I++ N + N TYK+G +F+DLTNEE+RA
Sbjct: 4 YKWWLAKHGKAYNGLGEEAERFEIFKNNLRFIDEHNSQ-NHTYKVGLTKFADLTNEEYRA 62
Query: 102 LYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
++ G +S PS + ++ +P S+DWR KGAV IKDQG CGSCWAFS
Sbjct: 63 MFLGTRSDAKRRLMKSKSPSERYAFKAGDKLPESVDWRAKGAVNPIKDQGSCGSCWAFST 122
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPY 219
VAAVEGI QI G+LI LSEQ+LVDC T N GC+GGLMD AF++II N GL TE DYPY
Sbjct: 123 VAAVEGINQIVTGELISLSEQELVDCDRTYNAGCNGGLMDYAFQFIINNGGLDTEKDYPY 182
Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
++ CD K K A +I +ED+ DE+AL +AV++QPVSV ++ASG A FY+SGV
Sbjct: 183 VGDDDKCDKDKMKTKAVSIDGFEDVLPYDEKALQKAVAHQPVSVAIEASGMALQFYQSGV 242
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLC 334
+CG DHGV VVG+ + ENG YWL++NSWG WGE GYI++ R+ G C
Sbjct: 243 FTGECGTALDHGVVVVGYAS---ENGLDYWLVRNSWGTEWGEHGYIKMQRNVGDTYTGRC 299
Query: 335 GIATAASYPV 344
GIA +SYPV
Sbjct: 300 GIAMESSYPV 309
>gi|115468686|ref|NP_001057942.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|55296512|dbj|BAD68726.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113595982|dbj|BAF19856.1| Os06g0582600 [Oryza sativa Japonica Group]
gi|215695236|dbj|BAG90427.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 357
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 167/360 (46%), Positives = 225/360 (62%), Gaps = 31/360 (8%)
Query: 8 SFIIPMFVIIILVITCASQVV--------SGRSMHEPSIVEKHEQWMAQHGRTYKDELEK 59
SF + ++II++ C + +V + + ++ E++E+W A HGRTYKD LEK
Sbjct: 7 SFSLAAILLIIIMYCCPTGLVEAARKGPAAAGGGDDSAMRERYEKWAADHGRTYKDSLEK 66
Query: 60 AMRLNIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS 118
A R +F+ N +I+ N G + + +L TN+F+DLTNEEF A Y G P +
Sbjct: 67 ARRFEVFRTNALFIDSFNAAGGKKSPRLTTNKFADLTNEEF-AEYYGRPFSTPVIG---- 121
Query: 119 RPSTFKYQNV--TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
S F Y NV +DVP +I+WR++GAVT +K+Q C SCWAFSAVAAVEGI QI L+
Sbjct: 122 -GSGFMYGNVRTSDVPANINWRDRGAVTQVKNQKDCASCWAFSAVAAVEGIHQIRSHNLV 180
Query: 177 ELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEE-GTCDNQKEKA 233
LS QQL+DCST +NHGC+ G MD+AF YI N G+A E+DYPY GTC K
Sbjct: 181 ALSTQQLLDCSTGRNNHGCNRGDMDEAFRYITSNGGIAAESDYPYEDRALGTC-RASGKP 239
Query: 234 VAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL----NADCGNNCD 289
VAA+I ++ +P +E ALL AV++QPVSV +D G+ F+ SGV N C + +
Sbjct: 240 VAASIRGFQYVPPNNETALLLAVAHQPVSVALDGVGKVSQFFSSGVFGAMQNETCTTDLN 299
Query: 290 HGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
H + VG+GT +E+G KYWL+KNSWG WGE GY++I RD GLCG+A SYPVA
Sbjct: 300 HAMTAVGYGT--DEHGTKYWLMKNSWGTDWGEGGYMKIARDVASNTGLCGLAMQPSYPVA 357
>gi|168057475|ref|XP_001780740.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162667829|gb|EDQ54449.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 463
Score = 307 bits (787), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 159/346 (45%), Positives = 218/346 (63%), Gaps = 26/346 (7%)
Query: 16 IIILVITCASQVVSGRS----------MH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
+++LVI Q +GR+ +H + +I++ QW+ H R Y+ EK R
Sbjct: 12 LVLLVIAIGQQADAGRANAIVDYEGNQLHSDDAILDVFHQWLETHSRVYRSLSEKHHRFQ 71
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
IFK+N YI NK+ ++Y LG N+FSDLT++EFRA Y G +PV +RQ + + F
Sbjct: 72 IFKENFLYIHAHNKQ-QKSYWLGLNKFSDLTHQEFRAQYLG-TKPV---NRQR-KEANFM 125
Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
Y++V P +DWR KGAVT +KDQG CGSCWAFSAV +VEG+ I G+L+ LSEQ+LV
Sbjct: 126 YEDVEAEP-KVDWRLKGAVTDVKDQGACGSCWAFSAVGSVEGVNAIKTGELVSLSEQELV 184
Query: 185 DCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
DC N GC+GGLMD AFE+II+N G+ TE DYPY+ +G CD + + I Y+D
Sbjct: 185 DCDRKQNQGCNGGLMDYAFEFIIKNGGIDTEKDYPYKARDGRCDEGRRNSKVVVIDDYQD 244
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P E AL++A++ PVSV ++A GR F Y+ GV CG+ DHGV VG+GT ++
Sbjct: 245 VPTQSESALMKALTKNPVSVAIEAGGRDFQHYQGGVFTGPCGSELDHGVLAVGYGT--DD 302
Query: 304 NGAKYWLIKNSWGETWGESGYIRILR-----DAGLCGIATAASYPV 344
+G YW++KNSWG WGE GYIR+ R G CGI AS+P+
Sbjct: 303 DGVNYWIVKNSWGPGWGEKGYIRMERFGSDSTDGKCGINIEASFPI 348
>gi|218181|dbj|BAA14402.1| oryzain alpha precursor [Oryza sativa Japonica Group]
Length = 458
Score = 307 bits (786), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 157/327 (48%), Positives = 206/327 (62%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
+VS E + +W A+HG++Y E+ R F+ NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 84 YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R Y G N+P R+ + + +P S+DWR KGA
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
V IKDQG CGSCWAFSA+AAVE I QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 141 VAEIKDQGGCGSCWAFSAIAAVEDINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
F++II N G+ TE DYPY+ ++ CD ++ A TI YED+ E +L +AV NQPV
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVRNQPV 260
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
SV ++A GRAF Y SG+ CG DHGVA VG+GT ENG YW+++NSWG++WGE
Sbjct: 261 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 317
Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
SGY+R+ R+ +G CGIA SYP+
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPL 344
>gi|242055753|ref|XP_002457022.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
gi|241928997|gb|EES02142.1| hypothetical protein SORBIDRAFT_03g047290 [Sorghum bicolor]
Length = 378
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 170/340 (50%), Positives = 215/340 (63%), Gaps = 31/340 (9%)
Query: 32 SMHEPSIVEKHEQWMAQHGR-TYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNE 90
S HE S+ E E+W+++H + Y EK R +FK NL +I++ N++ + +Y LG NE
Sbjct: 39 SSHE-SLAELFERWLSRHRKGAYASLEEKLRRFEVFKDNLHHIDETNRKVS-SYWLGLNE 96
Query: 91 FSDLTNEEFRALYTGYNRPVPSVS-----------------RQSSRPSTFKYQNV--TDV 131
F+DLT++EF+A Y G + SS F+Y+ V +
Sbjct: 97 FADLTHDEFKATYLGLSPSGGGGDVVHMHHDDDDEEPEEEGSSSSSSFRFRYEGVDAARL 156
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-N 190
P S+DWR KGAVT +K+QGQCGSCWAFS VAAVEGI QI G L LSEQ+LVDC TD N
Sbjct: 157 PKSVDWRSKGAVTGVKNQGQCGSCWAFSTVAAVEGINQIVTGNLTALSEQELVDCDTDGN 216
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
+GC+GGLMD AF YI N GL TE YPY EEGTC AV TIS YED+P+ +EQ
Sbjct: 217 NGCNGGLMDYAFSYIAHNGGLHTEEAYPYLMEEGTCSRGSSAAV-VTISGYEDVPRNNEQ 275
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG---AK 307
ALL+A+++QPVSV ++ASGR FY GV + CG DHGVA VG+GTA ++NG A
Sbjct: 276 ALLKALAHQPVSVAIEASGRNLQFYSGGVFDGPCGTQLDHGVAAVGYGTAGKDNGHVVAD 335
Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
Y ++KNSWG +WGE GYIR+ R GLCGI SYP
Sbjct: 336 YIIVKNSWGPSWGEKGYIRMRRGTGKRQGLCGINKMPSYP 375
>gi|110737959|dbj|BAF00916.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 307 bits (786), Expect = 6e-81, Method: Compositional matrix adjust.
Identities = 157/315 (49%), Positives = 204/315 (64%), Gaps = 18/315 (5%)
Query: 44 QWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLTNEE 98
QW A+HG+T + ++ R NIFK NL +I+ N+ N TYKLG +F+DLTN+E
Sbjct: 51 QWSAEHGKTNNNNNGIINDQDKRFNIFKDNLRFIDLHNENNKNATYKLGLTKFTDLTNDE 110
Query: 99 FRALYTGYNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
+R LY G R P+ ++ KY N +VP ++DWR+KGAV IKDQG CGSC
Sbjct: 111 YRKLYLGA-RTEPARRIAKAKNVNQKYSAAVNGKEVPETVDWRQKGAVNPIKDQGTCGSC 169
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATE 214
WAFS AAVEGI +I G+LI LSEQ+LVDC N GC+GGLMD AF++I++N GL TE
Sbjct: 170 WAFSTTAAVEGINKIVTGELISLSEQELVDCDKSYNQGCNGGLMDYAFQFIMKNGGLNTE 229
Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHF 274
DYPYR G C++ + + +I YED+P DE AL +A+S QPV V ++A GR F
Sbjct: 230 KDYPYRGFGGKCNSFLKNSRVVSIDGYEDVPTKDETALKKAISYQPVRVAIEAGGRIFQH 289
Query: 275 YKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---- 330
Y+SG+ CG N DH V VG+G+ ENG YW+++NSWG WGE GYIR+ R+
Sbjct: 290 YQSGIFTGSCGTNLDHAVVAVGYGS---ENGVDYWIVRNSWGPRWGEEGYIRMERNLAAS 346
Query: 331 -AGLCGIATAASYPV 344
+G CGIA ASYPV
Sbjct: 347 KSGKCGIAVEASYPV 361
>gi|255563136|ref|XP_002522572.1| cysteine protease, putative [Ricinus communis]
gi|223538263|gb|EEF39872.1| cysteine protease, putative [Ricinus communis]
Length = 340
Score = 306 bits (785), Expect = 8e-81, Method: Compositional matrix adjust.
Identities = 167/342 (48%), Positives = 225/342 (65%), Gaps = 21/342 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+ +++++++T SQ + + E ++ EKHEQWMA+HGRTY+D+ EK R +IFK+NL++
Sbjct: 9 LAIVLMILVTWVSQAMPRPLIDEDAVAEKHEQWMARHGRTYQDDEEKERRFHIFKKNLKH 68
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRP--VPS--VSRQSSRPSTFKYQNV 128
IE N NRTYKLG N F+DLT+EEF A YTGY P +P+ ++ ++++ S Y+
Sbjct: 69 IENFNNAFNRTYKLGLNHFADLTDEEFLATYTGYKMPKVLPTANITTKTTQSSDVLYE-- 126
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+VP SIDWR +G VT +K+QG+CG CWAFSA AAVEGI G + LS QQL+DC
Sbjct: 127 ANVPESIDWRTRGVVTPVKNQGRCGCCWAFSAAAAVEGII----GNGVSLSAQQLLDCVP 182
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
D++GC+GG MD AF YII+N+GLA+ YPY+ C + AA IS Y D+ D
Sbjct: 183 DSNGCNGGFMDNAFRYIIQNQGLASATYYPYQLMREMC---RPSNNAARISGYVDVTPAD 239
Query: 249 EQALLQAVSNQPVSVCVDASGRA-FHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENGA 306
E+ L AV+ QPVS VDA+ F +Y G+ DCG+ H + +VG+GT+ E G
Sbjct: 240 EETLKSAVARQPVSAAVDATSELNFKYYGGGIFPPQDCGSTLTHAITIVGYGTSAE--GT 297
Query: 307 KYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
KYWLIKNSWGE WGE GY+R+ RD G CGIA ASYP
Sbjct: 298 KYWLIKNSWGEGWGEGGYMRLQRDVGSYGGACGIALRASYPT 339
>gi|255538788|ref|XP_002510459.1| cysteine protease, putative [Ricinus communis]
gi|223551160|gb|EEF52646.1| cysteine protease, putative [Ricinus communis]
Length = 422
Score = 306 bits (784), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 152/342 (44%), Positives = 215/342 (62%), Gaps = 14/342 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
+F+ +F+I +L + I + E W +HG+TY + +K R IF+
Sbjct: 2 NFLSALFLITLLFF----NLSISSFSSSSDISKLFESWTKEHGKTYTSKEDKLYRFKIFE 57
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
+N E+++K N +GN +Y L N F+DLT+ EF+A G + S S + SR + +
Sbjct: 58 ENYEFVKKHNSQGNSSYTLSLNAFADLTHHEFKASRLGLS--AFSTSGKLSRRNFPLHDF 115
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
V DVP SIDWR+KGAV+ +KDQG CG+CW+FSA A+EGI +I G L+ LSEQ+LVDC
Sbjct: 116 VGDVPISIDWRKKGAVSQVKDQGNCGACWSFSATGAIEGINKIVTGSLVSLSEQELVDCD 175
Query: 188 TD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N+GC GGLMD A++++IEN G+ TE DYPY+ E TC+ +K K TI Y D+P+
Sbjct: 176 RSYNNGCEGGLMDYAYQFVIENNGIDTEEDYPYQAREKTCNKEKLKRHVVTIDGYTDVPQ 235
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
+E+ LL+AV+ QPVSV + S RAF Y G+ C + DH V +VG+G+ ENG
Sbjct: 236 NNEKELLKAVAAQPVSVGICGSERAFQLYSKGIFTGPCSTSLDHAVLIVGYGS---ENGV 292
Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
YW++KNSWG WG +GY+ +LR++ GLCGI AS+PV
Sbjct: 293 DYWIVKNSWGTHWGINGYMYMLRNSGNSQGLCGINMLASFPV 334
>gi|242081867|ref|XP_002445702.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
gi|241942052|gb|EES15197.1| hypothetical protein SORBIDRAFT_07g024430 [Sorghum bicolor]
Length = 372
Score = 306 bits (784), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 158/317 (49%), Positives = 199/317 (62%), Gaps = 13/317 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ +E+W +H +D +KA R N+FK+N+ I N+ + YKL N F D+
Sbjct: 40 EEALWALYERWRGRHA-VARDLGDKARRFNVFKENVRLIHDFNQR-DEPYKLRLNRFGDM 97
Query: 95 TNEEFRALYTGYNRPVPSVSR--QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
T +EFR Y G + R + S+F Y D+PTS+DWR+KGAVT +KDQGQC
Sbjct: 98 TADEFRRHYAGSRVAHHRMFRGDRQGSASSFMYAGARDLPTSVDWRQKGAVTDVKDQGQC 157
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
GSCWAFS +AAVEGI I L LSEQQLVDC T N GC GGLMD AF+YI ++ G+
Sbjct: 158 GSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKGNAGCDGGLMDYAFQYIAKHGGV 217
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
A E YPY+ + +C +K A A TI YED+P DE AL +AV++QPVSV ++ASG
Sbjct: 218 AAEDAYPYKARQASC--KKSPAPAVTIDGYEDVPANDESALKKAVAHQPVSVAIEASGSH 275
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
F FY GV CG DHGV VG+G A +G KYW++KNSWG WGE GYIR+ RD
Sbjct: 276 FQFYSEGVFAGRCGTELDHGVTAVGYGVA--ADGTKYWVVKNSWGPEWGEKGYIRMARDV 333
Query: 332 ----GLCGIATAASYPV 344
G CGIA ASYPV
Sbjct: 334 AAKEGHCGIAMEASYPV 350
>gi|3688528|emb|CAA06243.1| pre-pro-TPE4A protein [Pisum sativum]
Length = 360
Score = 306 bits (784), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 159/316 (50%), Positives = 212/316 (67%), Gaps = 11/316 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ + +E+W + H T + EK R N+FK N+ ++ NK ++ YKL N+F+D+
Sbjct: 33 EKSLWDLYERWRSHHTVT-RSLDEKHNRFNVFKANVMHVHNTNKL-DKPYKLKLNKFADM 90
Query: 95 TNEEFRALYTGYNRPVPSVSR-QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
TN EFR +Y + R S+ TF Y+NV +VP+SIDWR+KGAVT +KDQGQCG
Sbjct: 91 TNYEFRRIYADSKVSHHRMFRGMSNENGTFMYENVKNVPSSIDWRKKGAVTDVKDQGQCG 150
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGLA 212
SCWAFS + AVEGI QI KL+ LSEQ+LVDC T N GC+GGLM+ AFE+I +N G+
Sbjct: 151 SCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTGGNEGCNGGLMEYAFEFIKQN-GIT 209
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE++YPY ++GTCD +KE +I YE++P +E ALL+A + QPVSV +DA G F
Sbjct: 210 TESNYPYAAKDGTCDLKKEDKAEVSIDGYENVPINNEAALLKAAAKQPVSVAIDAGGYNF 269
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR--- 329
FY GV + CG + +HGVAVVG+G ++ KYW++KNSWG WGE GYIR+ R
Sbjct: 270 QFYSEGVFSGHCGTDLNHGVAVVGYGVTQDR--TKYWIVKNSWGSEWGEQGYIRMQRGIS 327
Query: 330 -DAGLCGIATAASYPV 344
GLCGIA ASYP+
Sbjct: 328 HKEGLCGIAMEASYPI 343
>gi|13432122|sp|P80884.2|ANAN_ANACO RecName: Full=Ananain; Flags: Precursor
gi|2623956|emb|CAA05487.1| Ananain precursor [Ananas comosus]
Length = 345
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 150/339 (44%), Positives = 224/339 (66%), Gaps = 20/339 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+F+ + L + AS S S EPS ++++ E+WMA++GR YKD EK +R IFK N+
Sbjct: 8 VFLFLFLCVMWASP--SAASCDEPSDPMMKQFEEWMAEYGRVYKDNDEKMLRFQIFKNNV 65
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+IE N +Y LG N+F+D+TN EF A YTG + P+ ++ R+ +F +++
Sbjct: 66 NHIETFNNRNGNSYTLGINQFTDMTNNEFVAQYTGLSLPL-NIKREPV--VSFDDVDISS 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
VP SIDWR+ GAVT +K+QG+CGSCWAF+++A VE I +I RG L+ LSEQQ++DC+ +
Sbjct: 123 VPQSIDWRDSGAVTSVKNQGRCGSCWAFASIATVESIYKIKRGNLVSLSEQQVLDCAV-S 181
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV--AATISKYEDLPKGD 248
+GC GG ++KA+ +II NKG+A+ A YPY+ +GTC K V +A I++Y + + +
Sbjct: 182 YGCKGGWINKAYSFIISNKGVASAAIYPYKAAKGTC---KTNGVPNSAYITRYTYVQRNN 238
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E+ ++ AVSNQP++ +DASG F YK GV CG +H + ++G+G ++ +G K+
Sbjct: 239 ERNMMYAVSNQPIAAALDASGN-FQHYKRGVFTGPCGTRLNHAIVIIGYG--QDSSGKKF 295
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
W+++NSWG WGE GYIR+ RD GLCGIA YP
Sbjct: 296 WIVRNSWGAGWGEGGYIRLARDVSSSFGLCGIAMDPLYP 334
>gi|18141281|gb|AAL60578.1|AF454956_1 senescence-associated cysteine protease [Brassica oleracea]
Length = 445
Score = 305 bits (782), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 152/324 (46%), Positives = 209/324 (64%), Gaps = 14/324 (4%)
Query: 28 VSGRSMHE-PSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKL 86
V+ ++ H P V+ E+W+ ++ + Y EK R IF NL+++++ N N++Y+L
Sbjct: 22 VTAKADHRNPEEVKMFERWLVENHKNYNGLGEKDKRFEIFMDNLKFVQEHNSVPNQSYEL 81
Query: 87 GTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
G F+DLTNEEFRA+Y R +R S + + + +P +DWR KGAV +
Sbjct: 82 GLTRFADLTNEEFRAIYL---RSKMERTRDSVKSERYLHNVGDKLPDEVDWRAKGAVVPV 138
Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYI 205
KDQG CGSCWAFSA+ AVEGI QI G+L+ LSEQ+LVDC T N+GC GGLMD AF++I
Sbjct: 139 KDQGSCGSCWAFSAIGAVEGINQIKTGELVSLSEQELVDCDTSYNNGCGGGLMDYAFQFI 198
Query: 206 IENKGLATEADYPYR-HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
I N G+ TE DYPY ++ C+ K+ TI YED+P+ +E +L +A++NQP+SV
Sbjct: 199 ISNGGIDTEEDYPYTATDDNICNTDKKNTRVVTIDGYEDVPE-NENSLKKALANQPISVA 257
Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
++A GR F YKSGV CG DHGV VG+GT+E G YW+I+NSWG WGESGY
Sbjct: 258 IEAGGRGFQLYKSGVFTGTCGTALDHGVVAVGYGTSE---GQDYWIIRNSWGSNWGESGY 314
Query: 325 IRILRD----AGLCGIATAASYPV 344
I++ R+ +G CG+A ASYP
Sbjct: 315 IKLQRNIKDSSGKCGVAMMASYPT 338
>gi|42567068|ref|NP_567686.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|332659371|gb|AEE84771.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 356
Score = 305 bits (782), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 154/349 (44%), Positives = 226/349 (64%), Gaps = 28/349 (8%)
Query: 13 MFVIIILVITCASQVVS--------GRSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMRL 63
+F++I+ V++ S + RS E + + WM++HG+TY + L EK R
Sbjct: 12 LFLLIVFVLSAPSSAMDLPATSGGHNRSNEEVEFI--FQMWMSKHGKTYTNALGEKERRF 69
Query: 64 NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
FK NL +I++ N + N +Y+LG F+DLT +E+R L+ G +P +Q + ++
Sbjct: 70 QNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRDLFPGSPKP-----KQRNLKTSR 123
Query: 124 KYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
+Y + +P S+DWR++GAV+ IKDQG C SCWAFS VAAVEG+ +I G+LI LSEQ
Sbjct: 124 RYVPLAGDQLPESVDWRQEGAVSEIKDQGTCNSCWAFSTVAAVEGLNKIVTGELISLSEQ 183
Query: 182 QLVDCSTDNHGCSG-GLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKA-VAATIS 239
+LVDC+ N+GC G GLMD AF+++I N GL +E DYPY+ +G+C+ ++ + TI
Sbjct: 184 ELVDCNLVNNGCYGSGLMDTAFQFLINNNGLDSEKDYPYQGTQGSCNRKQSTSNKVITID 243
Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
YED+P DE +L +AV++QPVSV VD + F Y+S + N CG N DH + +VG+G+
Sbjct: 244 SYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSCIYNGPCGTNLDHALVIVGYGS 303
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
ENG YW+++NSWG TWG++GYI+I R+ GLCGIA ASYP+
Sbjct: 304 ---ENGQDYWIVRNSWGTTWGDAGYIKIARNFEDPKGLCGIAMLASYPI 349
>gi|222629922|gb|EEE62054.1| hypothetical protein OsJ_16838 [Oryza sativa Japonica Group]
Length = 336
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 163/323 (50%), Positives = 205/323 (63%), Gaps = 20/323 (6%)
Query: 29 SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGT 88
+GR+ E SIV + + Y EK R +FK NL +I+ NK+ +Y LG
Sbjct: 24 AGRNGGEFSIV--------GYRKAYASFEEKVRRFEVFKDNLNHIDDINKK-VTSYWLGL 74
Query: 89 NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHI 146
NEF+DLT++EF+A Y G P + + F+Y +++ VP +DWR+K AVT +
Sbjct: 75 NEFADLTHDEFKATYLGLTPPPTRSNSKHYSSEEFRYGKMSNGEVPKEMDWRKKNAVTEV 134
Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYI 205
K+QGQCGSCWAFS VAAVEGI I G L LSEQ+L+DCSTD N+GC+GGLMD AF YI
Sbjct: 135 KNQGQCGSCWAFSTVAAVEGINAIVTGNLTSLSEQELIDCSTDGNNGCNGGLMDYAFSYI 194
Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCV 265
GL TE YPY EEG CD K AV TIS YED+P DEQAL++A+++QPVSV +
Sbjct: 195 ASTGGLRTEEAYPYAMEEGDCDEGKGAAV-VTISGYEDVPANDEQALVKALAHQPVSVAI 253
Query: 266 DASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
+ASGR F FY GV + CG DHGV VG+GT++ G Y ++KNSWG WGE GYI
Sbjct: 254 EASGRHFQFYSGGVFDGPCGEQLDHGVTAVGYGTSK---GQDYIIVKNSWGPHWGEKGYI 310
Query: 326 RILRDA----GLCGIATAASYPV 344
R+ R GLCGI ASYP
Sbjct: 311 RMKRGTGKGEGLCGINKMASYPT 333
>gi|225446523|ref|XP_002275891.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2 [Vitis vinifera]
Length = 358
Score = 305 bits (780), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 151/348 (43%), Positives = 214/348 (61%), Gaps = 20/348 (5%)
Query: 9 FIIPMFVIIILVITCASQVVSGRSMHEP------SIVEKHEQWMAQHGRTYKDELEKAMR 62
F ++ ++++ T + HEP + +++E+W+ QHGR YK+ E
Sbjct: 6 FCRNVYFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRH 65
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
I++ N+ +I N + N ++ L N+F+D+TNEE++ALY G S QSS
Sbjct: 66 FGIYQSNVRFINYINAQ-NFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQSS---- 120
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
FK + +P S+DWR+ GAVT +++QG+CGSCWAFS VAAVEGI +I GKL+ LSEQ+
Sbjct: 121 FKRERSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQE 180
Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
L+DC D N GC+GG M AF++I +N G+ T +YPY E+G C+ K IS
Sbjct: 181 LLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIGEQGICNKDKAANHVVKISG 240
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
YE +P +E+ L AV+ QPVSV +DA G F Y G+ N CG +H V V+G+G
Sbjct: 241 YETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYG-- 298
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
E+NG KYWL+KNSWG WGE+GY R++RD+ G+CGIA ASYP+
Sbjct: 299 -EDNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPI 345
>gi|302143380|emb|CBI21941.3| unnamed protein product [Vitis vinifera]
Length = 354
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 151/348 (43%), Positives = 214/348 (61%), Gaps = 20/348 (5%)
Query: 9 FIIPMFVIIILVITCASQVVSGRSMHEP------SIVEKHEQWMAQHGRTYKDELEKAMR 62
F ++ ++++ T + HEP + +++E+W+ QHGR YK+ E
Sbjct: 2 FCRNVYFALLIMWTVGVSWSAFSEEHEPMESEMSDMEKRYERWLVQHGRRYKNRDEWQRH 61
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
I++ N+ +I N + N ++ L N+F+D+TNEE++ALY G S QSS
Sbjct: 62 FGIYQSNVRFINYINAQ-NFSFTLTDNQFADMTNEEYKALYMGLGTSETSRKNQSS---- 116
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
FK + +P S+DWR+ GAVT +++QG+CGSCWAFS VAAVEGI +I GKL+ LSEQ+
Sbjct: 117 FKRERSKVLPISVDWRKMGAVTPVRNQGECGSCWAFSTVAAVEGINKIRTGKLVSLSEQE 176
Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
L+DC D N GC+GG M AF++I +N G+ T +YPY E+G C+ K IS
Sbjct: 177 LLDCDIDSGNEGCNGGYMVNAFKFIKQNGGITTARNYPYIGEQGICNKDKAANHVVKISG 236
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
YE +P +E+ L AV+ QPVSV +DA G F Y G+ N CG +H V V+G+G
Sbjct: 237 YETVPPNNEKILQAAVAKQPVSVAIDAGGYEFQLYSKGIFNGFCGKQLNHAVTVIGYG-- 294
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
E+NG KYWL+KNSWG WGE+GY R++RD+ G+CGIA ASYP+
Sbjct: 295 -EDNGKKYWLVKNSWGTGWGEAGYARMIRDSRDDEGICGIAMEASYPI 341
>gi|146215980|gb|ABQ10192.1| actinidin Act2a [Actinidia deliciosa]
Length = 378
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 155/339 (45%), Positives = 206/339 (60%), Gaps = 17/339 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F +L+++ A + + ++ +E W+ +HG++Y EK MR IFK+NL
Sbjct: 13 LFFSTLLILSSAIDIENSVQRTNDQVMAMYESWLVEHGKSYNSLDEKEMRFEIFKENLRI 72
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNR-PVPSVSRQSSRPSTFKYQNVTD- 130
I+ N + NR+Y LG N F+DLT+EE+R+ Y G R P VS Q V D
Sbjct: 73 IDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKRGPKTDVSNQ-------YMPKVGDA 125
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P +DWR GAV +K+QG C SCWAFSAVAAVEGI +I G LI LSEQ+LVDC
Sbjct: 126 LPDYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQELVDCGRTQ 185
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
GC+ GLM AF++II N G+ TE +YPY ++G C+ + TI Y+++P +
Sbjct: 186 ITKGCNRGLMTDAFKFIINNGGINTENNYPYTAKDGQCNLSLKNQKYVTIDSYKNVPSNN 245
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL +AV+ QPVSV V++ G F Y SG+ CG DHGV +VG+GT E G Y
Sbjct: 246 EMALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGSCGTAVDHGVTIVGYGT---ERGMDY 302
Query: 309 WLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
W++KNSWG WGESGYIRI R+ AG CGIA SYPV
Sbjct: 303 WIVKNSWGTNWGESGYIRIQRNIGGAGKCGIAKMPSYPV 341
>gi|115477767|ref|NP_001062479.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|42407937|dbj|BAD09076.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|113624448|dbj|BAF24393.1| Os08g0556900 [Oryza sativa Japonica Group]
gi|125562525|gb|EAZ07973.1| hypothetical protein OsI_30231 [Oryza sativa Indica Group]
gi|215701458|dbj|BAG92882.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 385
Score = 304 bits (778), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 157/317 (49%), Positives = 197/317 (62%), Gaps = 11/317 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ E +E+W QH R +D EKA R N+FK N+ I + N+ + YKL N F D+
Sbjct: 41 EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDM 98
Query: 95 TNEEFRALYTGYNRPVPSVSR-QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
T +EFR Y + R + R S F Y D+P ++DWREKGAV +KDQGQCG
Sbjct: 99 TADEFRRAYASSRVSHHRMFRGRGERRSGFMYAGARDLPAAVDWREKGAVGAVKDQGQCG 158
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGL 211
SCWAFS +AAVEGI I L LSEQQLVDC T N GC GGLMD AF+YI ++ G+
Sbjct: 159 SCWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGV 218
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
A + YPYR + +C + + A TI YED+P E AL +AV+NQPVSV ++A G
Sbjct: 219 AASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSH 278
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
F FY GV CG DHGVA VG+GT + G KYW+++NSWG WGE GYIR+ RD
Sbjct: 279 FQFYSEGVFAGKCGTELDHGVAAVGYGTTVD--GTKYWIVRNSWGADWGEKGYIRMKRDV 336
Query: 332 ----GLCGIATAASYPV 344
GLCGIA ASYP+
Sbjct: 337 SAKEGLCGIAMEASYPI 353
>gi|38345906|emb|CAE04498.2| OSJNBb0059K02.8 [Oryza sativa Japonica Group]
Length = 458
Score = 303 bits (777), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 156/327 (47%), Positives = 206/327 (62%), Gaps = 16/327 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
+VS E + +W A+HG++Y E+ R F+ NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKSYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 84 YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R Y G N+P R+ + + +P S+DWR KGA
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
V IKDQ GSCWAFSA+AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 141 VAEIKDQEVAGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
F++II N G+ TE DYPY+ ++ CD ++ A TI YED+ E +L +AV+NQPV
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRKNAKVVTIDSYEDVTPNSETSLQKAVANQPV 260
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
SV ++A GRAF Y SG+ CG DHGVA VG+GT ENG YW+++NSWG++WGE
Sbjct: 261 SVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYWIVRNSWGKSWGE 317
Query: 322 SGYIRILRD----AGLCGIATAASYPV 344
SGY+R+ R+ +G CGIA SYP+
Sbjct: 318 SGYVRMERNIKASSGKCGIAVEPSYPL 344
>gi|600111|emb|CAA84378.1| cysteine proteinase [Vicia sativa]
Length = 359
Score = 303 bits (775), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 162/351 (46%), Positives = 223/351 (63%), Gaps = 18/351 (5%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
++ +K I + + +I + E S+ +E+W + H T ++ EK R
Sbjct: 1 MEMKKLLFISLSLALIFTVANTFDFNEHDLESEKSLWNLYERWRSHHTVT-RNLDEKHNR 59
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT----GYNRPVPSVSRQSS 118
N+FK N+ ++ NK ++ YKL N+F D+TN EFR +Y ++R +S ++
Sbjct: 60 FNVFKANVMHVHNTNKL-DKPYKLKLNKFGDMTNYEFRRIYADSKISHHRMFRGMSHENG 118
Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
TF Y+N DVP+SIDWR KGAVT +KDQGQCGSCWAFS +AAVEGI QI KL+ L
Sbjct: 119 ---TFMYENAVDVPSSIDWRNKGAVTGVKDQGQCGSCWAFSTIAAVEGINQIKTQKLVSL 175
Query: 179 SEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
SEQQLVDC T +N GC+GGLM+ AFE+I +N G+ TE++YPY ++GTCD +KE A +
Sbjct: 176 SEQQLVDCDTEENEGCNGGLMEYAFEFIKQN-GITTESNYPYAAKDGTCDVEKEDK-AVS 233
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I +E++P +E ALL+A + QPVSV +DA G F FY GV C + +HGVA+VG+
Sbjct: 234 IDGHENVPINNEAALLKAAAKQPVSVAIDAGGYNFQFYSEGVFTGHCDTDLNHGVAIVGY 293
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
G ++ KYW++KNSWG WGE GYIR+ R GLCGIA ASYP+
Sbjct: 294 GVTQDR--TKYWIMKNSWGSEWGEQGYIRMQRGISSREGLCGIAMEASYPI 342
>gi|297603535|ref|NP_001054211.2| Os04g0670200 [Oryza sativa Japonica Group]
gi|109939735|sp|P25777.2|ORYB_ORYSJ RecName: Full=Oryzain beta chain; Flags: Precursor
gi|32488398|emb|CAE02823.1| OSJNBa0043A12.28 [Oryza sativa Japonica Group]
gi|90399163|emb|CAJ86092.1| H0818H01.14 [Oryza sativa Indica Group]
gi|125550169|gb|EAY95991.1| hypothetical protein OsI_17862 [Oryza sativa Indica Group]
gi|215766596|dbj|BAG98700.1| unnamed protein product [Oryza sativa Japonica Group]
gi|255675868|dbj|BAF16125.2| Os04g0670200 [Oryza sativa Japonica Group]
Length = 466
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 208/313 (66%), Gaps = 17/313 (5%)
Query: 42 HEQWMAQHGRTYKDEL--EKAMRLNIFKQNLEYIEKANKEGNRT--YKLGTNEFSDLTNE 97
++ W+A++G + L E R +F NL++++ N + ++LG N F+DLTNE
Sbjct: 52 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNE 111
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EFRA + G R + +++ V ++P S+DWREKGAV +K+QGQCGSCWA
Sbjct: 112 EFRATFLG----AKVAERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 167
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEA 215
FSAV+ VE I Q+ G++I LSEQ+LV+CST+ N GC+GGLMD AF++II+N G+ TE
Sbjct: 168 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMDDAFDFIIKNGGIDTED 227
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
DYPY+ +G CD +E A +I +ED+P+ DE++L +AV++QPVSV ++A GR F Y
Sbjct: 228 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 287
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----A 331
SGV + CG + DHGV VG+GT +NG YW+++NSWG WGESGY+R+ R+
Sbjct: 288 HSGVFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINVTT 344
Query: 332 GLCGIATAASYPV 344
G CGIA ASYP
Sbjct: 345 GKCGIAMMASYPT 357
>gi|2463586|dbj|BAA22545.1| FB22 precursor [Ananas comosus]
Length = 340
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 210/339 (61%), Gaps = 20/339 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F+ + L + AS + R ++++ E+WMA++GR YKD EK R IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPV-----PSVSRQSSRPSTFKYQN 127
IE N +Y LG N+F+D+TN EF YTG + P+ P VS F N
Sbjct: 68 IETFNNRNGNSYTLGINKFTDMTNNEFVTQYTGVSLPLNFKREPVVS--------FDDVN 119
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
++ V SIDWR+ GAVT +KDQ CGSCWAFSA+A VEGI +I G L+ LSEQ+++DC+
Sbjct: 120 ISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCA 179
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC GG +D A+++II N G+A+EADYPY+ EG C +A I+ Y +
Sbjct: 180 VSN-GCDGGFVDNAYDFIISNNGVASEADYPYQAYEGDC-TANSWPNSAYITGYSYVRSN 237
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
DE ++ AV NQP++ +DASG F +Y GV + CG + +H + ++G+G ++ +G +
Sbjct: 238 DESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTQ 295
Query: 308 YWLIKNSWGETWGESGYIRILR---DAGLCGIATAASYP 343
YW++KNSWG +WGE GY+R+ R +GLCGIA YP
Sbjct: 296 YWIVKNSWGSSWGERGYVRMARGVSSSGLCGIAMDPLYP 334
>gi|38345188|emb|CAE03344.2| OSJNBb0005B05.11 [Oryza sativa Japonica Group]
gi|125589403|gb|EAZ29753.1| hypothetical protein OsJ_13812 [Oryza sativa Japonica Group]
Length = 323
Score = 303 bits (775), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 155/344 (45%), Positives = 217/344 (63%), Gaps = 39/344 (11%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F I+ + C++ + + + ++ +HE+WMAQ+GR YKD+ EKA R +FK N+ +
Sbjct: 8 LFAILGCLCLCSAVLAARELSDDAAMAARHERWMAQYGRMYKDDAEKARRFEVFKANVAF 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--D 130
IE N GN + LG N+F+DLTN+EFR+ T +PS +R P+ F+ +NV
Sbjct: 68 IESFNA-GNHKFWLGVNQFADLTNDEFRSTKTNKGF-IPSTTRV---PTGFRNENVNIDA 122
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
+P ++DWR KG VT IKDQGQCG CWAFSAVAA+E +LVDC
Sbjct: 123 LPATMDWRTKGVVTPIKDQGQCGCCWAFSAVAAME----------------ELVDCDVHG 166
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA---ATISKYEDLP 245
++ GC GGLMD AF++II+N GL TE++YPY + K K+V+ A+I YED+P
Sbjct: 167 EDQGCEGGLMDDAFKFIIKNGGLTTESNYPYAAVD-----DKFKSVSNSVASIKGYEDVP 221
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+E AL++AV+NQPVSV VD F FYK GV+ CG + DHG+ +G+G A + G
Sbjct: 222 ANNEAALMKAVANQPVSVAVDGGDMTFQFYKGGVMTGSCGTDLDHGIVAIGYGKASD--G 279
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPVA 345
KYWL+KNSWG TWGE+G++R+ +D G+CG+A SYP A
Sbjct: 280 TKYWLLKNSWGMTWGENGFLRMEKDISDKRGMCGLAMEPSYPTA 323
>gi|312282059|dbj|BAJ33895.1| unnamed protein product [Thellungiella halophila]
Length = 379
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 197/307 (64%), Gaps = 9/307 (2%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
E W+ +HG+ Y EK RL IFK NL +I N E N Y+LG N F+DL+ E++ +
Sbjct: 65 ESWIVKHGKVYDSVAEKERRLTIFKDNLRFITNRNSE-NLGYRLGLNRFADLSLHEYKEI 123
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
G + P S +K +P S+DWR +GAVT +KDQG C SCWAFS V
Sbjct: 124 CHGADPKPPRNHVFMSSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVG 183
Query: 163 AVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
AVEG+ +I G+L+ LSEQ L++C+ +N+GC GG ++ A+E+I+ N GL T+ DYPY+
Sbjct: 184 AVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIVSNGGLGTDNDYPYKAV 243
Query: 223 EGTCDNQ-KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN 281
G CD + KE I YE+LP DE AL++AV++QPV+ +D+S R F Y+SGV +
Sbjct: 244 NGACDGRLKENIKNVMIDGYENLPANDELALMKAVAHQPVTAVIDSSSREFQLYESGVFD 303
Query: 282 ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIA 337
CG N +HGV VVG+GT ENG YW+++NSWG TWGE+GY+++ R+ GLCGIA
Sbjct: 304 GRCGTNLNHGVVVVGYGT---ENGRNYWIVRNSWGNTWGEAGYMKMARNIANPRGLCGIA 360
Query: 338 TAASYPV 344
SYP+
Sbjct: 361 MRVSYPL 367
>gi|357143305|ref|XP_003572875.1| PREDICTED: xylem cysteine proteinase 1-like [Brachypodium
distachyon]
Length = 473
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 204/312 (65%), Gaps = 12/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+V+ W +H + Y EK R +FKQNL++I + N+ N +Y LG N+F+D+ +E
Sbjct: 44 LVDLFSSWSVKHSKIYVSPEEKVKRYEVFKQNLKHIVETNRR-NGSYWLGLNQFADVAHE 102
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF++ Y G + +R P+ F+Y+N ++P S+DWR+KGAVT +K+QG+CGSCWA
Sbjct: 103 EFKSTYLGLKTGMDGPARA---PTAFRYENSVNLPWSVDWRKKGAVTPVKNQGECGSCWA 159
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI GKL LSEQ+L+DC T +HGC GG MD AF YI+ N G+ T+ D
Sbjct: 160 FSTVAAVEGINQIATGKLESLSEQELMDCDTTFDHGCGGGFMDFAFAYIMGNLGIHTDDD 219
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEG C ++ ++ TIS YED+P+ E +LL+A+++QP+SV + A + F FYK
Sbjct: 220 YPYLMEEGYCKEKQPQSKVVTISGYEDVPENSEVSLLKALAHQPISVGIAAGSKDFQFYK 279
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
GV CG DH + VG+G++ +G Y ++KNSWG++WGE GY RI R G
Sbjct: 280 RGVFEGSCGTELDHALTAVGYGSS---DGQDYIIMKNSWGKSWGEQGYFRIKRGTGKPEG 336
Query: 333 LCGIATAASYPV 344
+C I + ASYP
Sbjct: 337 VCSIYSMASYPT 348
>gi|302763831|ref|XP_002965337.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
gi|300167570|gb|EFJ34175.1| hypothetical protein SELMODRAFT_230602 [Selaginella moellendorffii]
Length = 343
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 152/343 (44%), Positives = 217/343 (63%), Gaps = 23/343 (6%)
Query: 15 VIIILVITCASQVV--------SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
+I+LV+ A+ GR++ I E W A+HG++Y +LEKA RL IF
Sbjct: 9 TLILLVVVGATPFAIARPAALEDGRALE---IKNMFEDWAAKHGKSYSSDLEKARRLMIF 65
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG-YNRPVPSVSRQSSRPSTFKY 125
L YIEK N + N T+ LG N+FSDLTN EFRA++ G + RP Q P+ +
Sbjct: 66 SDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRP----RYQDRLPAEDED 121
Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
+V+ +PTS+DWR+KGAVT IKDQG CGSCWAFSA+A++E + +L+ LSEQQL+D
Sbjct: 122 VDVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMD 181
Query: 186 CSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV--AATISKYED 243
C T + GC GGLM+ AF+++++N G+ TEA YPY G+C+ K + A I+ ++
Sbjct: 182 CDTVDAGCDGGLMETAFKFVVKNGGVTTEASYPYTGSVGSCNANKVAIINKVAEITGFKV 241
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+ + AL++AVS PV+V + S F YKSG+L+ CG++ DHGV ++G+GT E
Sbjct: 242 VTEDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGQCGDSLDHGVLLIGYGT---E 298
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD--AGLCGIATAASYPV 344
G YW+IKNSWG +WGE G+++I R G+CG+ +SYP
Sbjct: 299 GGMPYWIIKNSWGTSWGEDGFMKIERKDGDGICGMNGDSSYPT 341
>gi|2342494|dbj|BAA21848.1| bromelain [Ananas comosus]
gi|2463582|dbj|BAA22543.1| FB31 precursor [Ananas comosus]
Length = 352
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 145/335 (43%), Positives = 212/335 (63%), Gaps = 11/335 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F+ + L + AS + R ++++ E+WMA++GR YKD EK R IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG-YNRPVPSVSRQSSRPSTFKYQNVTDV 131
IE N +Y LG N+F+D+TN EF A YTG +RP+ + + +F N++ V
Sbjct: 68 IETFNNRNGNSYTLGINKFTDMTNNEFVAQYTGGISRPL---NIEKEPVVSFDDVNISAV 124
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH 191
SIDWR+ GAVT +KDQ CGSCWAFSA+A VEGI +I G L+ LSEQ+++DC+ N
Sbjct: 125 GQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATVEGIYKIVTGYLVSLSEQEVLDCAVSN- 183
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GG +D A+++II N G+A+EADYPY+ +G C +A I+ Y + DE +
Sbjct: 184 GCDGGFVDNAYDFIISNNGVASEADYPYQAYQGDCAANSWPN-SAYITGYSYVRSNDESS 242
Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
+ AV NQP++ +DASG F +Y GV + CG + +H + ++G+G ++ +G +YW++
Sbjct: 243 MKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTQYWIV 300
Query: 312 KNSWGETWGESGYIRILR---DAGLCGIATAASYP 343
KNSWG +WGE GYIR+ R +GLCGIA YP
Sbjct: 301 KNSWGSSWGERGYIRMARGVSSSGLCGIAMDPLYP 335
>gi|30685308|ref|NP_566634.2| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315949|sp|Q9LT77.1|CPR1_ARATH RecName: Full=Probable cysteine proteinase At3g19400; Flags:
Precursor
gi|11994462|dbj|BAB02464.1| cysteine proteinase [Arabidopsis thaliana]
gi|332642715|gb|AEE76236.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 362
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 148/318 (46%), Positives = 208/318 (65%), Gaps = 14/318 (4%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
+E + +EQW+ ++ + Y EK R IFK NL+++++ N +RT+++G F+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
LTNEEFRA+Y R ++ S + + Y+ +P +DWR GAV +KDQG CG
Sbjct: 96 LTNEEFRAIYL---RKKMERTKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCG 152
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGL 211
SCWAFSAV AVEGI QIT G+LI LSEQ+LVDC N GC GG+M+ AFE+I++N G+
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Query: 212 ATEADYPYR-HEEGTCDNQKEKAV-AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
T+ DYPY ++ G C+ K TI YED+P+ DE++L +AV++QPVSV ++AS
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272
Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
+AF YKSGV+ CG + DHGV VVG+G+ +G YW+I+NSWG WG+SGY+++ R
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGST---SGEDYWIIRNSWGLNWGDSGYVKLQR 329
Query: 330 DA----GLCGIATAASYP 343
+ G CGIA SYP
Sbjct: 330 NIDDPFGKCGIAMMPSYP 347
>gi|90399361|emb|CAJ86180.1| H0212B02.7 [Oryza sativa Indica Group]
Length = 470
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 158/339 (46%), Positives = 207/339 (61%), Gaps = 28/339 (8%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
+VS E + +W A+HG+ Y E+ R F+ NL YI++ N G +
Sbjct: 25 IVSYGERSEEEARRLYAEWKAEHGKNYNAVGEEERRYAAFRDNLRYIDEHNAAADAGVHS 84
Query: 84 YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTNEE+R Y G N+P R+ + + +P S+DWR KGA
Sbjct: 85 FRLGLNRFADLTNEEYRDTYLGLRNKP----RRERKVSDRYLAADNEALPESVDWRTKGA 140
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
V IKDQG CGSCWAFSA+AAVEGI QI G LI LSEQ+LVDC T N GC+GGLMD A
Sbjct: 141 VAEIKDQGGCGSCWAFSAIAAVEGINQIVTGDLISLSEQELVDCDTSYNEGCNGGLMDYA 200
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQK------------EKAVAATISKYEDLPKGDE 249
F++II N G+ TE DYPY+ ++ CD + + A TI YED+ E
Sbjct: 201 FDFIINNGGIDTEDDYPYKGKDERCDVNRVSFVFFAPLVFQKNAKVVTIDSYEDVTPNSE 260
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
+L +AV+NQPVSV ++A GRAF Y SG+ CG DHGVA VG+GT ENG YW
Sbjct: 261 TSLQKAVANQPVSVAIEAGGRAFQLYSSGIFTGKCGTALDHGVAAVGYGT---ENGKDYW 317
Query: 310 LIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
+++NSWG++WGESGY+R+ R+ +G CGIA SYP+
Sbjct: 318 IVRNSWGKSWGESGYVRMERNIKASSGKCGIAVEPSYPL 356
>gi|2463584|dbj|BAA22544.1| FBSB precursor [Ananas comosus]
Length = 356
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 154/340 (45%), Positives = 223/340 (65%), Gaps = 21/340 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+F+ + L + AS S S EPS ++++ E+WM ++GR YKD EK R IFK N+
Sbjct: 8 VFLFLFLCVMWASP--SAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNV 65
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT-GYNRPVPSVSRQSSRPSTFKYQNVT 129
+IE N +Y LG N+F+D+TN EF A YT G +RP+ ++ R+ +F +++
Sbjct: 66 NHIETFNSRNENSYTLGINQFTDMTNNEFIAQYTGGISRPL-NIEREPV--VSFDDVDIS 122
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
VP SIDWR+ GAVT +K+Q CG+CWAF+A+A VE I +I +G L LSEQQ++DC+
Sbjct: 123 AVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCA-K 181
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV--AATISKYEDLPKG 247
+GC GG +AFE+II NKG+A+ A YPY+ +GTC K V +A I+ Y +P+
Sbjct: 182 GYGCKGGWEFRAFEFIISNKGVASGAIYPYKAAKGTC---KTNGVPNSAYITGYARVPRN 238
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
+E +++ AVS QP++V VDA+ F +YKSGV N CG + +H V +G+G ++ NG K
Sbjct: 239 NESSMMYAVSKQPITVAVDANAN-FQYYKSGVFNGPCGTSLNHAVTAIGYG--QDSNGKK 295
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YW++KNSWG WGE+GYIR+ RD +G+CGIA + YP
Sbjct: 296 YWIVKNSWGARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335
>gi|26452046|dbj|BAC43113.1| putative cysteine proteinase RD21A precursor [Arabidopsis thaliana]
Length = 362
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 148/318 (46%), Positives = 208/318 (65%), Gaps = 14/318 (4%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
+E + +EQW+ ++ + Y EK R IFK NL+++++ N +RT+++G F+D
Sbjct: 36 NETEVRLMYEQWLVENRKNYNGLGEKERRFKIFKDNLKFVDEHNSVPDRTFEVGLTRFAD 95
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
LTNEEFRA+Y R ++ S + + Y+ +P +DWR GAV +KDQG CG
Sbjct: 96 LTNEEFRAIYL---RKKMERNKDSVKTERYLYKEGDVLPDEVDWRANGAVVSVKDQGNCG 152
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGL 211
SCWAFSAV AVEGI QIT G+LI LSEQ+LVDC N GC GG+M+ AFE+I++N G+
Sbjct: 153 SCWAFSAVGAVEGINQITTGELISLSEQELVDCDRGFVNAGCDGGIMNYAFEFIMKNGGI 212
Query: 212 ATEADYPYR-HEEGTCDNQKEKAV-AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
T+ DYPY ++ G C+ K TI YED+P+ DE++L +AV++QPVSV ++AS
Sbjct: 213 ETDQDYPYNANDLGLCNADKNNNTRVVTIDGYEDVPRDDEKSLKKAVAHQPVSVAIEASS 272
Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
+AF YKSGV+ CG + DHGV VVG+G+ +G YW+I+NSWG WG+SGY+++ R
Sbjct: 273 QAFQLYKSGVMTGTCGISLDHGVVVVGYGST---SGEDYWIIRNSWGLNWGDSGYVKLQR 329
Query: 330 DA----GLCGIATAASYP 343
+ G CGIA SYP
Sbjct: 330 NIDDPFGKCGIAMMPSYP 347
>gi|57118009|gb|AAW34136.1| cysteine protease gp3a [Zingiber officinale]
Length = 475
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 161/346 (46%), Positives = 220/346 (63%), Gaps = 18/346 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++P I+ L A +GRS E I+ +++W +H D+ RL +FK+N
Sbjct: 23 VVPPLDILTLSKQ-AWAAPAGRSDEEVRII--YQEWRVKHRPAENDQYVGDYRLEVFKEN 79
Query: 70 LEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
L ++++ N +R Y+LG N F+DLTNEE+RA + R + + R +S + +Y+
Sbjct: 80 LRFVDEHNAAADRGEHAYRLGMNRFADLTNEEYRARFL---RDLSRLGRSTSGEISNQYR 136
Query: 127 -NVTDV-PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
DV P SIDWREKGAV +K+QG+CGSCWAF+A+AAVEGI QI G LI LSEQQLV
Sbjct: 137 LREGDVLPDSIDWREKGAVVAVKNQGRCGSCWAFAAIAAVEGINQIVTGDLISLSEQQLV 196
Query: 185 DCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
DCST N+GC GG +AF+YII N G+ +E YPY GTC+ KE A +I Y ++
Sbjct: 197 DCSTRNYGCEGGWPYRAFQYIINNGGVNSEEHYPYTGTNGTCNTTKENAHVVSIDSYRNV 256
Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
P DE++L +A +NQP+SV +DASGR F Y SG+ C + +HGV VVG+GT EN
Sbjct: 257 PSNDEKSLQKAAANQPISVGIDASGRNFQLYHSGIFTGSCNTSLNHGVTVVGYGT---EN 313
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVAI 346
G YW++KNSWGE WG SGYI + R+ +G CGIA + SYP+ +
Sbjct: 314 GNDYWIVKNSWGENWGNSGYILMERNIAESSGKCGIAISPSYPIKV 359
>gi|302790828|ref|XP_002977181.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
gi|300155157|gb|EFJ21790.1| hypothetical protein SELMODRAFT_106402 [Selaginella moellendorffii]
Length = 337
Score = 301 bits (770), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 215/341 (63%), Gaps = 21/341 (6%)
Query: 15 VIIILVITCASQVV--------SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
+I+LV+ A+ GR++ I E W A+HG++Y + EKA RL IF
Sbjct: 5 TLILLVVVGATPFAIARPAALEDGRALE---IKNMFEDWAAKHGKSYSSDWEKARRLMIF 61
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG-YNRPVPSVSRQSSRPSTFKY 125
L YIEK N + N T+ LG N+FSDLTN EFRA++ G + RP Q P+ +
Sbjct: 62 SDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAMHVGKFKRP----RYQDRLPAEDED 117
Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
+V+ +PTS+DWR+KGAVT IKDQG CGSCWAFSA+A++E + +L+ LSEQQL+D
Sbjct: 118 VDVSSLPTSLDWRQKGAVTPIKDQGDCGSCWAFSAIASIESAHFLATKELVSLSEQQLMD 177
Query: 186 CSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
C T + GC GGLM+ AF+++++N G+ TEA YPY G+C+ K K A I+ ++ +
Sbjct: 178 CDTVDAGCDGGLMETAFKFVVKNGGVTTEAAYPYTGSVGSCNANKAKNKVAEITGFKVVT 237
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+ AL++AVS PV+V + S F YKSG+L+ C ++ DHGV ++G+GT E G
Sbjct: 238 EDSADALMKAVSKTPVTVSICGSDENFQNYKSGILSGKCDDSLDHGVLLIGYGT---EGG 294
Query: 306 AKYWLIKNSWGETWGESGYIRILRD--AGLCGIATAASYPV 344
YW+IKNSWG +WGE G+++I R G+CG+ +SYP
Sbjct: 295 MPYWIIKNSWGTSWGEDGFMKIERKDGDGMCGMNGDSSYPT 335
>gi|168017893|ref|XP_001761481.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162687165|gb|EDQ73549.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 471
Score = 300 bits (769), Expect = 5e-79, Method: Compositional matrix adjust.
Identities = 153/319 (47%), Positives = 204/319 (63%), Gaps = 17/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
+ +++ QW+ +H R Y EK R IFK NL YI NK+ ++Y LG N+FSDL
Sbjct: 45 DDGMLDVFHQWLERHSRVYHSLSEKQRRFQIFKDNLHYIHNHNKQ-EKSYWLGLNKFSDL 103
Query: 95 TNEEFRALYTGYNRPVPSVS--RQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
T++EFRALY G RP R R F Y++V +DWR+KGAV+ +KDQG C
Sbjct: 104 THDEFRALYLGI-RPAGRAHGLRNGDR---FIYEDVV-AEEMVDWRKKGAVSDVKDQGSC 158
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGL 211
GSCWAFSA+ +VEG+ I G+LI LSEQ+LVDC N GC+GGLMD AF++II+N G+
Sbjct: 159 GSCWAFSAIGSVEGVNAIVTGELISLSEQELVDCDRGQNQGCNGGLMDYAFDFIIKNGGI 218
Query: 212 ATEADYPYRHEEGTCDN-QKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
TE DYPY+ +G CD +KE + I Y+D+P E +LL+AVS PVSV ++A GR
Sbjct: 219 DTEEDYPYKATDGQCDEARKETSKVVVIDDYQDVPTKSESSLLKAVSKNPVSVAIEAGGR 278
Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR- 329
F Y+ GV CG + DHGV VG+GT +++G YW++KNSWG +WGE GYIR+ R
Sbjct: 279 DFQHYQGGVFTGPCGTDLDHGVLAVGYGT--DDDGVNYWIVKNSWGPSWGEKGYIRMERM 336
Query: 330 ----DAGLCGIATAASYPV 344
+G CGI S+P+
Sbjct: 337 GSNSTSGKCGINIEPSFPI 355
>gi|297809385|ref|XP_002872576.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
gi|297318413|gb|EFH48835.1| hypothetical protein ARALYDRAFT_489965 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 300 bits (768), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 143/307 (46%), Positives = 201/307 (65%), Gaps = 9/307 (2%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
E WM +HG+ Y+ EK RL IF+ NL +I N E N +Y+LG N F+DL+ E+ +
Sbjct: 57 ESWMVKHGKVYESVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYAQI 115
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
G + P + + +K + +P S+DWR +GAVT +KDQGQC SCWAFS V
Sbjct: 116 CHGADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGQCRSCWAFSTVG 175
Query: 163 AVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
AVEG+ +I G+L+ LSEQ L++C+ +N+GC GG ++ A+E+I+ N GL T+ DYPY+
Sbjct: 176 AVEGLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKAL 235
Query: 223 EGTCDNQ-KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN 281
G C+++ KE I YE+LP DE AL++AV++QPV+ VD+S R F Y SGV +
Sbjct: 236 NGVCNDRLKENNKNVMIDGYENLPANDESALMKAVAHQPVTAVVDSSSREFQLYASGVFD 295
Query: 282 ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIA 337
CG N +HGV VVG+GT ENG YW+++NS G TWGE+GY+++ R+ GLCGIA
Sbjct: 296 GTCGTNLNHGVVVVGYGT---ENGRDYWIVRNSRGNTWGEAGYMKMARNIANPRGLCGIA 352
Query: 338 TAASYPV 344
ASYP+
Sbjct: 353 MRASYPL 359
>gi|218183|dbj|BAA14403.1| oryzain beta precursor [Oryza sativa Japonica Group]
Length = 471
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 146/313 (46%), Positives = 207/313 (66%), Gaps = 17/313 (5%)
Query: 42 HEQWMAQHGRTYKDEL--EKAMRLNIFKQNLEYIEKANKEGNRT--YKLGTNEFSDLTNE 97
++ W+A++G + L E R +F NL++++ N + ++LG N F+DLTNE
Sbjct: 51 YDLWLAENGGGSPNALGGEHERRFLVFWDNLKFVDAHNARADEGGGFRLGMNRFADLTNE 110
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EFRA + G R + +++ V ++P S+DWREKGAV +K+QGQCGSCWA
Sbjct: 111 EFRATFLG----AKVAERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWA 166
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEA 215
FSAV+ VE I Q+ G++I LSEQ+LV+CST+ N GC+GGLM AF++II+N G+ TE
Sbjct: 167 FSAVSTVESINQLVTGEMITLSEQELVECSTNGQNSGCNGGLMADAFDFIIKNGGIDTED 226
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
DYPY+ +G CD +E A +I +ED+P+ DE++L +AV++QPVSV ++A GR F Y
Sbjct: 227 DYPYKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLY 286
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----A 331
SGV + CG + DHGV VG+GT +NG YW+++NSWG WGESGY+R+ R+
Sbjct: 287 HSGVFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINVTT 343
Query: 332 GLCGIATAASYPV 344
G CGIA ASYP
Sbjct: 344 GKCGIAMMASYPT 356
>gi|1514953|dbj|BAA11170.1| cysteine proteinase [Oryza sativa (japonica cultivar-group)]
Length = 368
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 203/329 (61%), Gaps = 12/329 (3%)
Query: 23 CASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGN 81
CA+ R + + ++ + +E+W H + EK R FK N+ YI + NK
Sbjct: 26 CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAP 84
Query: 82 RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREK 140
L N F D+ EEFRA + G + ++ P F Y+ V D+P ++DWR K
Sbjct: 85 GYAPL--NRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRK 142
Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMD 199
GAVT +KDQG+CGSCWAFS V +VEGI I G+L+ LSEQ+L+DC T DN GC GGLM+
Sbjct: 143 GAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLME 202
Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ 259
AFEYI + G+ TE+ YPYR GTCD + + I ++++P E AL +AV+NQ
Sbjct: 203 NAFEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQ 262
Query: 260 PVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
PVSV +DA ++F FY GV DCG + DHGVAVVG+G E +G +YW++KNSWG W
Sbjct: 263 PVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYG--ETNDGTEYWIVKNSWGTAW 320
Query: 320 GESGYIRILRDA----GLCGIATAASYPV 344
GE GYIR+ RD+ GLCGIA ASYPV
Sbjct: 321 GEGGYIRMQRDSGYDGGLCGIAMEASYPV 349
>gi|297816030|ref|XP_002875898.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
gi|297321736|gb|EFH52157.1| hypothetical protein ARALYDRAFT_485194 [Arabidopsis lyrata subsp.
lyrata]
Length = 363
Score = 300 bits (767), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 156/346 (45%), Positives = 211/346 (60%), Gaps = 17/346 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEP------SIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
I+L C Q G E ++ + +E+W H T + E R N+F+
Sbjct: 3 LFFIVLSFLCLLQASKGFDFDEKELETEENVWKLYERWRDHHSVT-RASHEALKRFNVFR 61
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-FKYQ 126
N+ ++ + NK+ N+ YKL N F+D+T+ EFR+ Y G N + R R S F Y+
Sbjct: 62 HNVLHVHRTNKK-NKPYKLKVNRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGFMYE 120
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
NVT VP+S+DWREKGAVT +K+Q CGSCWAFS VAAVEGI +I KL+ LSEQ+LVDC
Sbjct: 121 NVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQELVDC 180
Query: 187 ST-DNHGCSGGLMDKAFEYIIENKGLATEADYPY-RHEEGTCDNQKEKAVAATISKYEDL 244
T +N GC+GGLM+ AFE+I N G+ TE YPY ++ C + TI +E +
Sbjct: 181 DTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSNDVQFCRAKSIDGETVTIDGHEHV 240
Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN 304
P+ DE+ALL+AV++QPVSV +DA F Y GV +CG +HGV +VG+G E +N
Sbjct: 241 PENDEEALLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYG--ETKN 298
Query: 305 GAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVAI 346
G KYW+++NSWG WGE GY+RI R + G CGIA ASYP +
Sbjct: 299 GTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTKV 344
>gi|30141023|dbj|BAC75925.1| cysteine protease-3 [Helianthus annuus]
Length = 348
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 146/348 (41%), Positives = 212/348 (60%), Gaps = 13/348 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
++ K F++ + + + + + + S+ + +E+W +QH + + EK R
Sbjct: 1 MECNKVFVLSISLALFIGVVNCIDFTEKDLATDKSLWDLYERWGSQHMVSRAPD-EKKKR 59
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVP--SVSRQSSRP 120
N+FK N+ +I + N+ G + YKL NEF+D+TN EF+A G++ + + + R
Sbjct: 60 FNVFKYNVNHINRVNQLG-KPYKLKLNEFADMTNHEFKA---GFDSKILHFRMLKGKRRQ 115
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+ F + TD P SIDWR GAV IK+QG+CGSCWAFS + VEGI +I +L+ LSE
Sbjct: 116 TPFTHAKTTDPPPSIDWRTNGAVNPIKNQGRCGSCWAFSTIVGVEGINKIKTNQLVSLSE 175
Query: 181 QQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
Q+LVDC TD GC+GGLM+ +E+I E G+ TE YPY G CD K + I
Sbjct: 176 QELVDCETDCEGCNGGLMENGYEFIKETGGVTTEQIYPYFARNGRCDISKRNSPVVKIDG 235
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
+E++P DE A+L+AV+NQPVS+ +DA G F FY GV N CG +HGVA+VG+GT
Sbjct: 236 FENVPANDESAMLRAVANQPVSIAIDAGGLNFQFYSQGVFNGACGTELNHGVAIVGYGTT 295
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
++ G YW+++NSWG WGE GY+R+ R GLCG+A ASYP+
Sbjct: 296 QD--GTNYWIVRNSWGTGWGEQGYVRMQRGVNVPEGLCGLAMDASYPI 341
>gi|357115272|ref|XP_003559414.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 360
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 158/328 (48%), Positives = 203/328 (61%), Gaps = 25/328 (7%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR-------TYKLGTN 89
++ +HE WMA+HGRTY D EKA RL IF+ N E I+ N + + +++L TN
Sbjct: 38 AMASRHESWMAEHGRTYADAEEKARRLEIFRANAERIDSFNSKADAAAGESVDSHRLATN 97
Query: 90 EFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT---DVPTSIDWREKGAVTHI 146
F+DLT+EEFRA TG RP F+Y+N + D S+DWR GAVT +
Sbjct: 98 RFADLTDEEFRAARTGLRRPAAVAGAVGG---GFRYENFSLQADAAGSMDWRAMGAVTGV 154
Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEY 204
KDQG CG CWAFSAVAA+EG+T+I G+L+ LSEQQLVDC D+ GC GGLMD AF+Y
Sbjct: 155 KDQGSCGCCWAFSAVAAMEGLTKIRTGRLVSLSEQQLVDCDVYGDDQGCEGGLMDNAFQY 214
Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
I GLA+E+ YPY E+G AA+I +ED+P +E AL+ AV++QPVSV
Sbjct: 215 ISRQGGLASESAYPYSGEDGGSCRSGRAQPAASIRGHEDVPANNEGALMAAVAHQPVSVA 274
Query: 265 VDASGRAFHFYKSGVLNADCGNNC-----DHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
++ F FY GVL A C DH + VG+G A +G YWL+KNSWG W
Sbjct: 275 INGGDYVFRFYDRGVLGAGGNGGCESTELDHAITAVGYGMA--GDGTGYWLMKNSWGSGW 332
Query: 320 GESGYIRILRDA---GLCGIATAASYPV 344
GESGY+RI R + G+CG+A ASYPV
Sbjct: 333 GESGYVRIRRGSRGEGVCGLAKLASYPV 360
>gi|195637152|gb|ACG38044.1| vignain precursor [Zea mays]
Length = 377
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 152/291 (52%), Positives = 188/291 (64%), Gaps = 14/291 (4%)
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG----YNRPVPSVSRQSS 118
N+FK N+ I + N+ + YKL N F D+T +EFR Y G ++R + SS
Sbjct: 70 FNVFKANVRLIHEFNRR-DEPYKLRLNRFGDMTADEFRRHYAGSRVAHHRMFRGDRQGSS 128
Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
++F Y + DVP S+DWR+KGAVT +KDQGQCGSCWAFS +AAVEGI I L L
Sbjct: 129 ASASFMYADARDVPASVDWRQKGAVTDVKDQGQCGSCWAFSTIAAVEGINAIKTKNLTSL 188
Query: 179 SEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
SEQQLVDC T N GC+GGLMD AF+YI ++ G+A E YPYR + +C +K A T
Sbjct: 189 SEQQLVDCDTKANAGCNGGLMDYAFQYIAKHGGVAAEDAYPYRARQASC--KKSPAPVVT 246
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I YED+P DE AL +AV++QPVSV ++ASG F FY GV + CG DHGVA VG+
Sbjct: 247 IDGYEDVPANDESALKKAVAHQPVSVAIEASGSHFQFYSEGVFSGRCGTELDHGVAAVGY 306
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
G + G KYWL+KNSWG WGE GYIR+ RD G CGIA ASYPV
Sbjct: 307 GVTAD--GTKYWLVKNSWGPEWGEKGYIRMARDVAAKEGHCGIAMEASYPV 355
>gi|18413507|ref|NP_567377.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315953|sp|Q9SUS9.1|CPR4_ARATH RecName: Full=Probable cysteine proteinase At4g11320; Flags:
Precursor
gi|5596478|emb|CAB51416.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267831|emb|CAB81233.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|14334764|gb|AAK59560.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|15293257|gb|AAK93739.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332657596|gb|AEE82996.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 371
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 152/364 (41%), Positives = 222/364 (60%), Gaps = 27/364 (7%)
Query: 3 LKFEKSFIIPMFVIIILVITCAS----QVVSGRSMH-------------EPSIVEKHEQW 45
+ + KS ++ +F++ +++ +CA+ VVS H + E W
Sbjct: 1 MGYAKSAML-IFLLALVIASCATAMDMSVVSSNDNHHVTAGPGRRQGIFDAEATLMFESW 59
Query: 46 MAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG 105
M +HG+ Y EK RL IF+ NL +I N E N +Y+LG N F+DL+ E+ + G
Sbjct: 60 MVKHGKVYDSVAEKERRLTIFEDNLRFITNRNAE-NLSYRLGLNRFADLSLHEYGEICHG 118
Query: 106 YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVE 165
+ P + + +K + +P S+DWR +GAVT +KDQG C SCWAFS V AVE
Sbjct: 119 ADPRPPRNHVFMTSSNRYKTSDGDVLPKSVDWRNEGAVTEVKDQGLCRSCWAFSTVGAVE 178
Query: 166 GITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGT 225
G+ +I G+L+ LSEQ L++C+ +N+GC GG ++ A+E+I+ N GL T+ DYPY+ G
Sbjct: 179 GLNKIVTGELVTLSEQDLINCNKENNGCGGGKVETAYEFIMNNGGLGTDNDYPYKALNGV 238
Query: 226 CDNQ-KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADC 284
C+ + KE I YE+LP DE AL++AV++QPV+ VD+S R F Y+SGV + C
Sbjct: 239 CEGRLKEDNKNVMIDGYENLPANDEAALMKAVAHQPVTAVVDSSSREFQLYESGVFDGTC 298
Query: 285 GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAA 340
G N +HGV VVG+GT ENG YW++KNS G+TWGE+GY+++ R+ GLCGIA A
Sbjct: 299 GTNLNHGVVVVGYGT---ENGRDYWIVKNSRGDTWGEAGYMKMARNIANPRGLCGIAMRA 355
Query: 341 SYPV 344
SYP+
Sbjct: 356 SYPL 359
>gi|146215984|gb|ABQ10194.1| actinidin Act2c [Actinidia arguta]
Length = 378
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 150/338 (44%), Positives = 206/338 (60%), Gaps = 15/338 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F +L+++ A +V+ + + +E W+ + G++Y EK MR IFK NL
Sbjct: 13 LFFSTLLILSSALDIVNSAQRTNDQVRDMYESWLVEQGKSYNSLDEKEMRFEIFKDNLRI 72
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV- 131
I+ N + NR++ LG N F+DLT+EE+R+ Y G+ S ++ S V DV
Sbjct: 73 IDDHNADANRSFSLGLNRFADLTDEEYRSTYLGFK------SGPKAKVSNRYVPKVGDVL 126
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--STD 189
P +DWR GAV +K+QG C SCWAFSAVAAVEGI +I G L+ LSEQ+LVDC +
Sbjct: 127 PNYVDWRTVGAVVGVKNQGLCSSCWAFSAVAAVEGINKIMTGNLLSLSEQELVDCGRTQS 186
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
GC+ G M AF++II N G+ TE +YPY ++G C+ + TI YE++P +E
Sbjct: 187 TRGCNRGYMTDAFQFIINNGGINTEDNYPYTAQDGQCNRYLQNQKYVTIDDYENVPSNNE 246
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
AL AV++QPVSV +++ G F Y SG+ CG DHGV +VG+GT E G YW
Sbjct: 247 WALQNAVAHQPVSVGLESEGGKFKLYTSGIFTQYCGTAIDHGVTIVGYGT---ERGLDYW 303
Query: 310 LIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
++KNSWG WGE+GYIRI R+ AG CGIA ASYPV
Sbjct: 304 IVKNSWGTNWGENGYIRIQRNIGGAGKCGIARMASYPV 341
>gi|146215986|gb|ABQ10195.1| actinidin Act2d [Actinidia eriantha]
Length = 381
Score = 299 bits (765), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 150/346 (43%), Positives = 204/346 (58%), Gaps = 31/346 (8%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F +L+++ A + + ++ +E W+ + G++Y EK MR IFK+NL
Sbjct: 15 LFFSTLLILSSALDIKNSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRI 74
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGY---------NRPVPSVSRQSSRPSTF 123
I+ N + NR+Y LG N F+DLT+EE+R+ Y G+ NR VP V
Sbjct: 75 IDDHNADANRSYSLGLNRFADLTDEEYRSTYLGFKSGPKAKVSNRYVPKVG--------- 125
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
+P +DWR GAV +KDQG C SCWAFSAVAAVEGI +I G LI LSEQ+L
Sbjct: 126 -----VVLPNYVDWRTVGAVVGVKDQGLCSSCWAFSAVAAVEGINKIVTGNLISLSEQEL 180
Query: 184 VDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
VDC + GC+ G M+ AF++II+N G+ TE +YPY ++G CD ++ TI Y
Sbjct: 181 VDCGRTQRTRGCNRGYMNDAFQFIIDNGGINTEDNYPYTAQDGQCDWYRKNQRYVTIDNY 240
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
E LP +E L AV+ QP++V +++ G F Y SG+ CG DHGV +VG+GT
Sbjct: 241 EQLPANNEWVLQNAVAYQPITVGLESEGGKFKLYTSGIYTGYCGTAIDHGVTIVGYGT-- 298
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
E G YW++KNSWG WGE+GYIRI R+ AG CGIA SYPV
Sbjct: 299 -ERGLDYWIVKNSWGTNWGENGYIRIQRNIGGAGKCGIAMVPSYPV 343
>gi|4426617|gb|AAD20453.1| cysteine endopeptidase precursor [Oryza sativa]
Length = 368
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 153/329 (46%), Positives = 203/329 (61%), Gaps = 12/329 (3%)
Query: 23 CASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGN 81
CA+ R + + ++ + +E+W H + EK R FK N+ YI + NK
Sbjct: 26 CAAIPFDERDLESDEALWDLYERWQEHH-HVPRHHGEKHRRFGAFKDNVRYIHEHNKRAP 84
Query: 82 RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREK 140
L N F D+ EEFRA + G + ++ P F Y+ V D+P ++DWR K
Sbjct: 85 GYPPL--NRFGDMGREEFRATFAGSHANDLRRDGLAAPPLPGFMYEGVRDLPRAVDWRRK 142
Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMD 199
GAVT +KDQG+CGSCWAFS V +VEGI I G+L+ LSEQ+L+DC T DN GC GGLM+
Sbjct: 143 GAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGRLVSLSEQELIDCDTADNSGCQGGLME 202
Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ 259
AFEYI + G+ TE+ YPYR GTCD + + I ++++P E AL +AV+NQ
Sbjct: 203 NAFEYIKHSGGITTESAYPYRAANGTCDAVRARGGLVVIDGHQNVPANSEAALAKAVANQ 262
Query: 260 PVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
PVSV +DA ++F FY GV DCG + DHGVAVVG+G E +G +YW++KNSWG W
Sbjct: 263 PVSVAIDAGDQSFQFYSDGVFAGDCGTDLDHGVAVVGYG--ETNDGTEYWIVKNSWGTAW 320
Query: 320 GESGYIRILRDA----GLCGIATAASYPV 344
GE GYIR+ RD+ GLCGIA ASYPV
Sbjct: 321 GEGGYIRMQRDSGYDGGLCGIAMEASYPV 349
>gi|413953666|gb|AFW86315.1| hypothetical protein ZEAMMB73_539008 [Zea mays]
Length = 314
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 152/340 (44%), Positives = 209/340 (61%), Gaps = 42/340 (12%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+ I+ C + + + + ++V +HEQWMAQ+ R YKD EKA R
Sbjct: 8 ILAILGFAFFCGAALAARDLSDDSAMVARHEQWMAQYSRVYKDASEKARRF--------- 58
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
+F+DLTN EFR++ T N+ S + + + F+Y+NV+
Sbjct: 59 -----------------KFADLTNHEFRSVKT--NKGFKSSNMKI--LTGFRYENVSADA 97
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
+PT+IDWR KG VT IKDQGQCG C AFSAVAA EGI +I+ GKL+ L++Q+LVDC
Sbjct: 98 LPTTIDWRTKGVVTPIKDQGQCGCCSAFSAVAATEGIVKISTGKLVSLADQELVDCDVHG 157
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
++ GC GGLMD AF++II+N GL TE+ YPY +G C++ AATI YED+P D
Sbjct: 158 EDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCNSGSNS--AATIKGYEDVPAND 215
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E AL++A++NQPVSV VD F FY GV+ CG + DHG+A +G+G + +G KY
Sbjct: 216 EAALMKAMANQPVSVAVDGGDMTFRFYSGGVMTGSCGTDLDHGIAAIGYG--KTSDGTKY 273
Query: 309 WLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
WL+KNSWG TWGE+GY+R+ +D G+CG+A SYP
Sbjct: 274 WLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 313
>gi|317106666|dbj|BAJ53169.1| JHL18I08.3 [Jatropha curcas]
Length = 441
Score = 299 bits (765), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 143/311 (45%), Positives = 197/311 (63%), Gaps = 9/311 (2%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I E W QHG+TY + EK RL +F+ N +++ + N +GN +Y L N F+DLT+
Sbjct: 26 IAHLFETWCQQHGKTYASQEEKLFRLKVFQDNYDFVTEHNSQGNSSYTLSLNAFADLTHH 85
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+A G + S S R + V DVP S+DWR+ GAVT +KDQG CG+CW+
Sbjct: 86 EFKASRLGLS-SAASASLNVDRSNRQIPDFVADVPASVDWRKNGAVTQVKDQGNCGACWS 144
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FSA A+EGI +I G L+ LSEQ+LVDC N+GC GG+MD AF+++I+N G+ TE D
Sbjct: 145 FSATGAIEGINKIVTGSLVSLSEQELVDCDKSYNNGCEGGIMDYAFQFVIDNHGIDTEED 204
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY+ + +C+ +K K TI Y D+P+ +E+ LL+AV+NQPVSV + S RAF Y
Sbjct: 205 YPYQGRDRSCNKEKLKRHVVTIDGYVDVPQNNEKELLKAVANQPVSVGICGSERAFQLYS 264
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
G+ C + DH V +VG+G+ ENG YW++KNSWG WG GY+ + R++ G
Sbjct: 265 KGIFTGPCSTSLDHAVLIVGYGS---ENGVDYWIVKNSWGSYWGMDGYMHMQRNSGSSRG 321
Query: 333 LCGIATAASYP 343
LCGI ASYP
Sbjct: 322 LCGINMLASYP 332
>gi|357162587|ref|XP_003579458.1| PREDICTED: oryzain beta chain-like [Brachypodium distachyon]
Length = 470
Score = 298 bits (764), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 144/310 (46%), Positives = 201/310 (64%), Gaps = 13/310 (4%)
Query: 45 WMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
W A+HG + L E+ R F NL +++ N G ++LG N F+DLTN+EFR
Sbjct: 55 WRAEHGSGNSNSLGEEERRFRAFWDNLRFVDAHNARAAAGEEGFRLGMNRFADLTNDEFR 114
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
A Y G S ++ +++ V ++P ++DWREKGAV +K+QGQCGSCWAFSA
Sbjct: 115 AAYLGVKGAGQRRSARAGVGERYRHDGVEELPEAVDWREKGAVAPVKNQGQCGSCWAFSA 174
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYP 218
V+AVE I Q+ G+L+ LSEQ+LV+C + ++GC+GGLMD AF++II N G+ TE DYP
Sbjct: 175 VSAVESINQLVTGELVTLSEQELVECDINGQSNGCNGGLMDDAFDFIINNGGIDTEDDYP 234
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
Y+ +G CD + A +I +ED+P+ DE++L +AV++QPVSV ++A GR F Y SG
Sbjct: 235 YKALDGKCDINRRNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 294
Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLC 334
V CG DHGV VG+GT ENG YW+++NSWG WGE+GY+R+ R+ G C
Sbjct: 295 VFTGRCGTELDHGVVAVGYGT---ENGKDYWIVRNSWGPKWGEAGYLRMERNINATTGKC 351
Query: 335 GIATAASYPV 344
GIA +SYP
Sbjct: 352 GIAMMSSYPT 361
>gi|15984|emb|CAA34486.1| unnamed protein product [Actinidia deliciosa]
Length = 380
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 153/351 (43%), Positives = 217/351 (61%), Gaps = 18/351 (5%)
Query: 3 LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ KSF+ +F +L+++ A + + +E W+ ++G++Y E
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR+ Y G+ S S ++
Sbjct: 61 RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+ ++ + +P+ +DWR GAV IK QG+CG CWAFSA+A VEGI +I G LI LSE
Sbjct: 117 NRYEPRFGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176
Query: 181 QQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAA 236
Q+L+DC + + GC+GG + F++II N G+ TE +YPY ++G C D Q EK V
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYV-- 234
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
TI YE++P +E AL AV+ QPVSV +DA+G AF Y SG+ CG DH V +VG
Sbjct: 235 TIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVG 294
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
+GT E G YW++KNSW TWGE GY+RILR+ AG CGIAT SYPV
Sbjct: 295 YGT---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|146215976|gb|ABQ10190.1| actinidin Act1b [Actinidia arguta]
Length = 380
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 149/337 (44%), Positives = 208/337 (61%), Gaps = 12/337 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F +LV++ A + + +E W+ ++G++Y E R IFK+ L +
Sbjct: 13 LFFSTLLVLSLAFNAKNLTKRTNDELKAMYESWLTKYGKSYNSLGEWERRFEIFKETLRF 72
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
I++ N + NR+Y++G N+F+D TNEEF++ Y G+ S S + + ++ + +P
Sbjct: 73 IDEHNADTNRSYRVGLNQFADQTNEEFQSTYLGFT----SGSNKMKVSNRYEPRVGQVLP 128
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--STDN 190
+DWR GAV IK QGQCGSCWAFSA+A VEGI +I G LI LSEQ+LVDC + +
Sbjct: 129 DYVDWRSAGAVVDIKSQGQCGSCWAFSAIATVEGINKIVTGDLISLSEQELVDCGRTQNT 188
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
GC GG + F++II N G+ TEA+YPY E+G C+ + A+I YE++P +E
Sbjct: 189 RGCDGGSITDGFQFIINNGGINTEANYPYTAEDGQCNLDLQNEKYASIDTYENVPYNNEW 248
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
AL AV+ QPVSV ++A+G AF Y SG+ CG DH V +VG+GT E G YW+
Sbjct: 249 ALQTAVAYQPVSVALEAAGDAFQHYSSGIFTGPCGTAVDHAVTIVGYGT---EGGIDYWI 305
Query: 311 IKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
+KNSW TWGE GYIRILR+ AG CGIAT SYPV
Sbjct: 306 VKNSWDTTWGEEGYIRILRNVGGAGTCGIATKPSYPV 342
>gi|193806686|sp|A5HII1.1|ACTN_ACTDE RecName: Full=Actinidain; Short=Actinidin; AltName: Full=Allergen
Act d 1; AltName: Allergen=Act d 1; Flags: Precursor
gi|146215974|gb|ABQ10189.1| actinidin Act1a [Actinidia deliciosa]
Length = 380
Score = 298 bits (763), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 153/351 (43%), Positives = 217/351 (61%), Gaps = 18/351 (5%)
Query: 3 LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ KSF+ +F +L+++ A + + +E W+ ++G++Y E
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR+ Y G+ S S ++
Sbjct: 61 RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+ ++ + +P+ +DWR GAV IK QG+CG CWAFSA+A VEGI +I G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176
Query: 181 QQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAA 236
Q+L+DC + + GC+GG + F++II N G+ TE +YPY ++G C D Q EK V
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNLDLQNEKYV-- 234
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
TI YE++P +E AL AV+ QPVSV +DA+G AF Y SG+ CG DH V +VG
Sbjct: 235 TIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVG 294
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
+GT E G YW++KNSW TWGE GY+RILR+ AG CGIAT SYPV
Sbjct: 295 YGT---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|312451836|gb|ADQ85985.1| actinidin [Actinidia chinensis]
Length = 380
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 153/351 (43%), Positives = 217/351 (61%), Gaps = 18/351 (5%)
Query: 3 LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ KSF+ +F +L+++ A + + +E W+ ++G++Y E
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLAFNTKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR+ Y G+ S S ++
Sbjct: 61 RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+ ++ + +P+ +DWR GAV IK QG+CG CWAFSA+A VEGI +I G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176
Query: 181 QQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAA 236
Q+L+DC + + GC+GG + F++II N G+ TE +YPY ++G C D Q EK V
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYV-- 234
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
TI YE++P +E AL AV+ QPVSV +DA+G AF Y SG+ CG DH V +VG
Sbjct: 235 TIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVG 294
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
+GT E G YW++KNSW TWGE GY+RILR+ AG CGIAT SYPV
Sbjct: 295 YGT---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|413938554|gb|AFW73105.1| hypothetical protein ZEAMMB73_931917 [Zea mays]
Length = 361
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 153/355 (43%), Positives = 216/355 (60%), Gaps = 26/355 (7%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ-----------WMAQHGRTYKDELEKA 60
P + + V+ A S +PS+V ++ W +HG+ Y EK
Sbjct: 3 PKLAVAVFVLFLAFAACSANHHRDPSVVGYSQEDLALPSSLFRSWSVKHGKLYASPTEKL 62
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR- 119
R IFKQNL +I + N++ N +Y LG N+F+D+ +EEF+A Y G R +P +R
Sbjct: 63 ERYEIFKQNLMHIAETNRK-NGSYWLGLNQFADVAHEEFKASYLGLKRALPRAGAPQTRT 121
Query: 120 PSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIE 177
P+ F+Y +P S+DWR KGAVT +K+QG+CGSCWAFS+VAAVEGI QI GKL+
Sbjct: 122 PTAFRYAAAAAGSLPWSVDWRYKGAVTPVKNQGKCGSCWAFSSVAAVEGINQIVTGKLVS 181
Query: 178 LSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
LSEQ+LVDC T +HGC GG MD AF Y++ ++G+ E DYPY EEG C ++ +
Sbjct: 182 LSEQELVDCDTTLDHGCEGGTMDLAFAYMMGSQGIHAEDDYPYLMEEGYCKEKQPCVLGI 241
Query: 237 T---ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVA 293
T ++ +ED+P+ E +LL+A+++QPVSV + A R F FY+ GV + C DH +
Sbjct: 242 TEQDLTGFEDVPENSEISLLKALAHQPVSVGIAAGSRDFQFYRGGVFDGACSVELDHALT 301
Query: 294 VVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL----RDAGLCGIATAASYPV 344
VG+G++ +N Y +KNSWG+ WGE GY+RI + G+CGI T ASYPV
Sbjct: 302 AVGYGSSYGQN---YITMKNSWGKNWGEQGYVRIKMGTGKPEGVCGIYTMASYPV 353
>gi|2144501|pir||TAGB actinidain (EC 3.4.22.14) precursor - kiwi fruit
gi|166317|gb|AAA32629.1| actinidin [Actinidia deliciosa]
Length = 380
Score = 298 bits (762), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 148/349 (42%), Positives = 215/349 (61%), Gaps = 14/349 (4%)
Query: 3 LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ KSF+ +F +L+++ A + + +E W+ ++G++Y E
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR+ Y G+ S S ++
Sbjct: 61 RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+ ++ + +P+ +DWR GAV IK QG+CG CWAFSA+A VEGI +I G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176
Query: 181 QQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
Q+L+DC + + GC+GG + F++II N G+ TE +YPY ++G C+ + + TI
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVELQNEKYVTI 236
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
YE++P +E AL AV+ QPVSV +DA+G AF Y SG+ CG DH V +VG+G
Sbjct: 237 DTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVGYG 296
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
T E G YW++KNSW TWGE GY+RILR+ AG CGIAT SYPV
Sbjct: 297 T---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|18413505|ref|NP_567376.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315954|sp|Q9SUT0.1|CPR3_ARATH RecName: Full=Probable cysteine proteinase At4g11310; Flags:
Precursor
gi|5596477|emb|CAB51415.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|7267830|emb|CAB81232.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|332657595|gb|AEE82995.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 298 bits (762), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 153/351 (43%), Positives = 220/351 (62%), Gaps = 27/351 (7%)
Query: 13 MFVIIILVITCAS----QVVS---GRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKA 60
+ ++ +++ +CA+ VVS +H E S++ E WM +HG+ Y EK
Sbjct: 10 ILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLI--FESWMVKHGKVYGSVAEKE 67
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
RL IF+ NL +I N E N +Y+LG F+DL+ E++ + G + P
Sbjct: 68 RRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGADPRPPR--NHVFMT 124
Query: 121 STFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
S+ +Y+ D +P S+DWR +GAVT +KDQG C SCWAFS V AVEG+ +I G+L+ L
Sbjct: 125 SSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTL 184
Query: 179 SEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ-KEKAVAAT 237
SEQ L++C+ +N+GC GG ++ A+E+I++N GL T+ DYPY+ G CD + KE
Sbjct: 185 SEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVM 244
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I YE+LP DE AL++AV++QPV+ +D+S R F Y+SGV + CG N +HGV VVG+
Sbjct: 245 IDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGY 304
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
GT ENG YWL+KNS G TWGE+GY+++ R+ GLCGIA ASYP+
Sbjct: 305 GT---ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 352
>gi|357507505|ref|XP_003624041.1| Cysteine proteinase [Medicago truncatula]
gi|355499056|gb|AES80259.1| Cysteine proteinase [Medicago truncatula]
Length = 342
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 155/315 (49%), Positives = 206/315 (65%), Gaps = 21/315 (6%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
S+ E+ E W ++G YKD E+ IFK N+ YI+ N GN+ YKL N F D
Sbjct: 37 SLSERFEYWKTKYGVVYKDVAEQKKHFQIFKHNVAYIDYFNAAGNKPYKLAINRFVDKPI 96
Query: 97 EEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
E+ G+ R + ++ +TFKY+NVTD+P ++DWR++GAVT IK+QG+CGSCW
Sbjct: 97 EDSD---DGFER-----TTTTTPTTTFKYENVTDIPATVDWRKRGAVTPIKNQGKCGSCW 148
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATE 214
AFSAVAA+EGI +IT G L+ LSEQQLVDC S GC G M AF++I+EN G+ATE
Sbjct: 149 AFSAVAAIEGIQKITSGNLVSLSEQQLVDCDRSGRTKGCDNGNMINAFKFILENGGIATE 208
Query: 215 ADYPY-RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
A+YPY R +GTC K+ + I YE++P E +LL+AV+NQPVSV +D G F
Sbjct: 209 ANYPYKRVVKGTC---KKVSHKVQIKSYEEVPSNSEDSLLKAVANQPVSVGIDMRG-MFK 264
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--- 330
FY SG+ +CG +H + +VG+GT+++ G KYWL+KNSW + WGE GYIRI RD
Sbjct: 265 FYSSGIFTGECGTKPNHALTIVGYGTSKD--GIKYWLVKNSWSKRWGEKGYIRIKRDIDA 322
Query: 331 -AGLCGIATAASYPV 344
GLCGIA SYP+
Sbjct: 323 KEGLCGIAMKPSYPI 337
>gi|20260334|gb|AAM13065.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
gi|23197782|gb|AAN15418.1| drought-inducible cysteine proteinase RD21A precursor-like protein
[Arabidopsis thaliana]
Length = 357
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 154/351 (43%), Positives = 221/351 (62%), Gaps = 27/351 (7%)
Query: 13 MFVIIILVITCAS----QVVS---GRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKA 60
+ ++ +++ +CA+ VVS +H E S++ E WM +HG+ Y EK
Sbjct: 3 ILLVAMVIASCATAIDMSVVSYDDNNRLHSVFDAEASLI--FESWMVKHGKVYGSVAEKE 60
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
RL IF+ NL +I N E N +Y+LG F+DL+ E++ + G + P P
Sbjct: 61 RRLTIFEDNLRFINNRNAE-NLSYRLGLTGFADLSLHEYKEVCHGAD-PRPP-RNHVFMT 117
Query: 121 STFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
S+ +Y+ D +P S+DWR +GAVT +KDQG C SCWAFS V AVEG+ +I G+L+ L
Sbjct: 118 SSDRYKTSADDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVTGELVTL 177
Query: 179 SEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ-KEKAVAAT 237
SEQ L++C+ +N+GC GG ++ A+E+I++N GL T+ DYPY+ G CD + KE
Sbjct: 178 SEQDLINCNKENNGCGGGKLETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKENNKNVM 237
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I YE+LP DE AL++AV++QPV+ +D+S R F Y+SGV + CG N +HGV VVG+
Sbjct: 238 IDGYENLPANDESALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHGVVVVGY 297
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
GT ENG YWL+KNS G TWGE+GY+++ R+ GLCGIA ASYP+
Sbjct: 298 GT---ENGRDYWLVKNSRGITWGEAGYMKMARNIANPRGLCGIAMRASYPL 345
>gi|641905|gb|AAC49406.1| cysteine proteinase [Zinnia violacea]
Length = 342
Score = 297 bits (761), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 144/302 (47%), Positives = 203/302 (67%), Gaps = 10/302 (3%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEF 91
S+H+ ++ E + +H + Y+ EK R IF NL++I++ NK+ + Y LG NEF
Sbjct: 41 SIHK--VIHLFESSLVKHSKIYESFDEKLHRFEIFMDNLKHIDETNKKVS-NYWLGLNEF 97
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
+DLT+EEF+ + G+ + R+ F+Y++ D+P S+DWR+KGAV+ +K+QGQ
Sbjct: 98 ADLTHEEFKNKFLGFKGEL--AERKDESIEQFRYRDFVDLPKSVDWRKKGAVSPVKNQGQ 155
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKG 210
CGSCWAFS VAAVEGI QI G L LSEQ+L+DC T N+GC+GGLMD AF Y+ N G
Sbjct: 156 CGSCWAFSTVAAVEGINQIVTGNLTVLSEQELIDCDTTFNNGCNGGLMDYAFAYVTRN-G 214
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
L E +YPY EGTCD +++ + TIS Y D+P+ +E + L+A++NQP+SV ++ASGR
Sbjct: 215 LHKEEEYPYIMSEGTCDEKRDASEKVTISGYHDVPRNNEDSFLKALANQPISVAIEASGR 274
Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
F FY GV + CG DHGVA VG+GT++ G Y +++NSWG WGE GYIR+ R+
Sbjct: 275 DFQFYSGGVFDGHCGTELDHGVAAVGYGTSK---GLDYVIVRNSWGPKWGEKGYIRMKRN 331
Query: 331 AG 332
G
Sbjct: 332 TG 333
>gi|57118007|gb|AAW34135.1| cysteine protease gp2b [Zingiber officinale]
Length = 379
Score = 297 bits (760), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 156/336 (46%), Positives = 212/336 (63%), Gaps = 18/336 (5%)
Query: 18 ILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN 77
+L ++ V RS E ++ + +W A++ K RL +FK+NL++++K N
Sbjct: 29 VLTLSKQGGAVPVRSDEEVRML--YLEWRAKNHPAEKYLDLNEYRLEVFKENLQFVDKHN 86
Query: 78 KEGNR---TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR--PSTFKYQNVTDVP 132
+R T++LG N F+DLTNEE+R T + R + R +S S ++ + D+P
Sbjct: 87 AAADRGEHTFRLGMNRFADLTNEEYR---TRFLRDFSRLRRSASGKISSRYRLREGDDLP 143
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG 192
SIDWREKGAV +K+QG CGSCWAFS VAAVEGI QI G LI LSEQQLVDC+T NHG
Sbjct: 144 DSIDWREKGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTANHG 203
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C GG M+ AF++I+ N G+ +E YPYR + G C N A +I YE++P +EQ+L
Sbjct: 204 CRGGWMNPAFQFIVNNGGINSEETYPYRGQNGIC-NSTVNAPVVSIDSYENVPSHNEQSL 262
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+AV+NQPVSV +DA+GR F Y+SG+ C + +H + VVG+GT EN Y +K
Sbjct: 263 QKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGT---ENDKDYRTVK 319
Query: 313 NSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
NSWG+ WGESGYIR+ R+ G CGI ASYPV
Sbjct: 320 NSWGKNWGESGYIRVERNIGNPNGKCGITRFASYPV 355
>gi|18391078|ref|NP_563855.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
gi|110741821|dbj|BAE98853.1| papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
gi|111074448|gb|ABH04597.1| At1g09850 [Arabidopsis thaliana]
gi|332190386|gb|AEE28507.1| xylem bark cysteine peptidase 3 [Arabidopsis thaliana]
Length = 437
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 145/312 (46%), Positives = 197/312 (63%), Gaps = 11/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I E + W +HG+TY E E+ R+ IFK N +++ + N N TY L N F+DLT+
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+A G + PSV S S VP S+DWR+KGAVT++KDQG CG+CW+
Sbjct: 88 EFKASRLGLSVSAPSVIMASKGQS---LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 144
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FSA A+EGI QI G LI LSEQ+L+DC N GC+GGLMD AFE++I+N G+ TE D
Sbjct: 145 FSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKD 204
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY+ +GTC K K TI Y + DE+AL++AV+ QPVSV + S RAF Y
Sbjct: 205 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYS 264
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
SG+ + C + DH V +VG+G+ +NG YW++KNSWG++WG G++ + R+ G
Sbjct: 265 SGIFSGPCSTSLDHAVLIVGYGS---QNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 321
Query: 333 LCGIATAASYPV 344
+CGI ASYP+
Sbjct: 322 VCGINMLASYPI 333
>gi|255546708|ref|XP_002514413.1| cysteine protease, putative [Ricinus communis]
gi|223546510|gb|EEF48009.1| cysteine protease, putative [Ricinus communis]
Length = 324
Score = 296 bits (759), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 152/318 (47%), Positives = 201/318 (63%), Gaps = 40/318 (12%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEF 91
SMH+ + E E WM++HG+TY+ EK RL +FK NL +I++ N++ TY L NEF
Sbjct: 39 SMHK--LTELFESWMSKHGKTYESIEEKLHRLEVFKDNLMHIDRRNRDVT-TYWLALNEF 95
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
+DL++EEF++ R EKGAV +K+QG
Sbjct: 96 ADLSHEEFKSKLAQIRR-----------------------------LEKGAVAPVKNQGS 126
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKG 210
CGSCWAFS VAAVEGI QI G L LSEQ+L+DC T N GC+GGLMD AF+YI+ N G
Sbjct: 127 CGSCWAFSTVAAVEGINQIVTGNLTSLSEQELIDCDTSFNSGCNGGLMDYAFDYIVNNGG 186
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
L E DYPY EEGTCD ++E+ TIS Y D+P+ +E++LL+A+++QP+S+ ++ASGR
Sbjct: 187 LHKEEDYPYLMEEGTCDEKREEMEVVTISGYHDVPENNEESLLKALAHQPLSIAIEASGR 246
Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
F FY GV N CG + DHGVA VG+G+++ G Y ++KNSWG WGE GYIR+ R+
Sbjct: 247 DFQFYGRGVFNGPCGTDLDHGVAAVGYGSSK---GLDYIIVKNSWGPKWGEKGYIRMKRN 303
Query: 331 A----GLCGIATAASYPV 344
GLCGI ASYP
Sbjct: 304 TGKPEGLCGINKMASYPT 321
>gi|57118011|gb|AAW34137.1| cysteine protease gp3b [Zingiber officinale]
Length = 466
Score = 296 bits (759), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 156/325 (48%), Positives = 212/325 (65%), Gaps = 17/325 (5%)
Query: 29 SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR---TYK 85
+GRS E I+ +++W A+H D+ RL +FK+NL ++++ N +R Y+
Sbjct: 32 AGRSDEEVRII--YQEWRAKHRPAENDQYVGDYRLEVFKENLRFVDEHNAAADRGEHAYR 89
Query: 86 LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ-NVTDV-PTSIDWREKGAV 143
LG N F+DLTNEE+RA + R + + R +S + +Y+ DV P SIDWREKGAV
Sbjct: 90 LGMNRFADLTNEEYRARFL---RDLSRLGRSTSGEISNQYRLREGDVLPDSIDWREKGAV 146
Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFE 203
+K QG+CGSCWAF+A+A VEGI QI G LI LSEQQLVDCST NHGC GG +AF+
Sbjct: 147 VAVKSQGRCGSCWAFAAIATVEGINQIVTGDLISLSEQQLVDCSTRNHGCEGGWPYRAFQ 206
Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
YII N G+ +E YPY GTC+ K A +I Y ++P DE++L +AV+NQP+SV
Sbjct: 207 YIINNGGVNSEEHYPYTGTNGTCNTTKGNAHVVSIDSYRNVPSNDEKSLQKAVANQPISV 266
Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
++ASGR F Y SG+ C + +HGV VVG+GT NG YW++KNSWGE+WG+SG
Sbjct: 267 GINASGRNFQLYHSGIFTGSCNTSLNHGVTVVGYGTV---NGNDYWIVKNSWGESWGDSG 323
Query: 324 YIRILRD----AGLCGIATAASYPV 344
YI + R+ +G CGIA + SYP+
Sbjct: 324 YILMERNIAESSGKCGIAISPSYPI 348
>gi|3377948|emb|CAA08860.1| cysteine proteinase precursor, AN8 [Ananas comosus]
Length = 356
Score = 296 bits (759), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 152/340 (44%), Positives = 222/340 (65%), Gaps = 21/340 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+F+ + L + AS S S EPS ++++ E+WM ++GR YKD EK R IFK N+
Sbjct: 8 VFLFLFLCVMWASP--SAASADEPSDPMMKRFEEWMVEYGRVYKDNDEKMRRFQIFKNNV 65
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG-YNRPVPSVSRQSSRPSTFKYQNVT 129
+IE N +Y LG N+F+D+TN EF A YTG +RP+ ++ R+ +F +++
Sbjct: 66 NHIETFNSRNKDSYTLGINQFTDMTNNEFVAQYTGGISRPL-NIEREPV--VSFDDVDIS 122
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
VP SIDWR+ GAVT +K+Q CG+CWAF+A+A VE I +I +G L LSEQQ++DC+
Sbjct: 123 AVPQSIDWRDYGAVTSVKNQNPCGACWAFAAIATVESIYKIKKGILEPLSEQQVLDCA-K 181
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAV--AATISKYEDLPKG 247
+GC GG +AFE+II NKG+A+ A YPY+ +GTC K V +A I+ Y +P+
Sbjct: 182 GYGCKGGWEFRAFEFIISNKGVASVAIYPYKAAKGTC---KTNGVPNSAYITGYARVPRN 238
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
+E +++ AVS QP++V VDA+ + +Y SGV N CG + +H V +G+G ++ NG K
Sbjct: 239 NESSMMYAVSKQPITVAVDANANS-QYYNSGVFNGPCGTSLNHAVTAIGYG--QDSNGKK 295
Query: 308 YWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
YW++KNSWG WGE+GYIR+ RD +G+CGIA + YP
Sbjct: 296 YWIVKNSWGARWGEAGYIRMARDVSSSSGICGIAIDSLYP 335
>gi|359359118|gb|AEV41024.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 493
Score = 296 bits (759), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 148/343 (43%), Positives = 211/343 (61%), Gaps = 47/343 (13%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLTNEEF 99
++ W+A++GR+Y E+ R +F NL++++ N + ++LG N F+DLTN+EF
Sbjct: 49 YDLWLAENGRSYNALGERERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLTNDEF 108
Query: 100 RALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC------- 152
RA + G V R + +++ V ++P S+DWREKGAV +K+QGQC
Sbjct: 109 RATFLG----AKFVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCVDRIIVW 164
Query: 153 -------------------------GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
GSCWAFSAV+ VE I Q+ G++I LSEQ+LV+CS
Sbjct: 165 NSMVRIYVVDAGCMLENPLMGLTVQGSCWAFSAVSTVESINQLVTGEMITLSEQELVECS 224
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
T+ N GC+GGLMD AF++II+N G+ TE DYPY+ +G CD +E A +I +ED+P
Sbjct: 225 TNGQNSGCNGGLMDDAFDFIIKNGGIDTEDDYPYKAVDGKCDINRENAKVVSIDGFEDVP 284
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+ DE++L +AV++QPVSV ++A GR F Y SGV + CG + DHGV VG+GT +NG
Sbjct: 285 QNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSGVFSGRCGTSLDHGVVAVGYGT---DNG 341
Query: 306 AKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
YW+++NSWG WGESGY+R+ R+ G CGIA ASYP
Sbjct: 342 KDYWIVRNSWGPKWGESGYVRMERNINATTGKCGIAMMASYPT 384
>gi|146215990|gb|ABQ10197.1| actinidin Act4a [Actinidia eriantha]
Length = 385
Score = 296 bits (759), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 139/312 (44%), Positives = 205/312 (65%), Gaps = 12/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ E W+ ++G++Y EK R IFK NL ++++ N + NR+YK+G N+FSDLT+
Sbjct: 44 VIAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTDA 103
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
E+ ++Y G + R ++ ++ + +P S+DWR+KGAV +K+QG CGSCW
Sbjct: 104 EYSSIYLGTKFNI----RMTNVSDRYEPRVGDQLPDSVDWRKKGAVLGVKNQGNCGSCWT 159
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEA 215
F+++AAVEGI +I G LI LSEQ++VDC N+GC+GG + A+++II N G+ TEA
Sbjct: 160 FASIAAVEGINKIVTGNLISLSEQEIVDCQRKYPNNGCNGGTLSGAYQFIINNGGINTEA 219
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
+YPY +G CD K+ TI +YE++P +E+AL +AV+ QPVSV + ++ AF Y
Sbjct: 220 NYPYTGRDGVCDQNKKNKKYVTIDRYENVPSNNEKALQKAVAFQPVSVVIASNSTAFKSY 279
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AG 332
KSG+ N CG DHGV +VG+GT E G YW+++NSWG WGESGY+R+ R+ +G
Sbjct: 280 KSGIFNGPCGPRIDHGVTIVGYGT---EGGKDYWIVRNSWGPNWGESGYVRMQRNVGGSG 336
Query: 333 LCGIATAASYPV 344
C IA A YPV
Sbjct: 337 KCFIARAPVYPV 348
>gi|111073717|dbj|BAF02547.1| triticain beta [Triticum aestivum]
Length = 472
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 147/321 (45%), Positives = 205/321 (63%), Gaps = 28/321 (8%)
Query: 42 HEQWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
++ W+A++G E+ R F NL +++ N G Y+LG N F+DL
Sbjct: 53 YDLWLAENGGGSSPNANSIPERERRFRAFWDNLNFVDAHNARAAAGEEGYRLGMNRFADL 112
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPST-----FKYQNVTDVPTSIDWREKGAVTHIKDQ 149
TN+EFRA Y G V Q +RP +++ ++P ++DWREKGAV +K+Q
Sbjct: 113 TNDEFRAAYLG-------VKAQRARPGRMVGERYRHDGAEELPEAVDWREKGAVAPVKNQ 165
Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH--GCSGGLMDKAFEYIIE 207
GQCGSCWAFSAV+ VE I QI G+++ LSEQ+LV+C T+ GC+GGLMD AFE+II+
Sbjct: 166 GQCGSCWAFSAVSTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIK 225
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
N G+ TE DYPY+ +G CD ++ A +I +ED+P+ DE++L +AV++QPVSV ++A
Sbjct: 226 NGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEA 285
Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
GR F Y SGV + CG DHGV VG+GT ENG YW+++NSWG WGESGY+R+
Sbjct: 286 GGREFQLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGESGYLRM 342
Query: 328 LRD----AGLCGIATAASYPV 344
R+ +G CGIA +SYP
Sbjct: 343 ERNINVTSGKCGIAMMSSYPT 363
>gi|45738078|gb|AAS75836.1| fastuosain precursor [Bromelia fastuosa]
Length = 324
Score = 296 bits (759), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 143/316 (45%), Positives = 209/316 (66%), Gaps = 14/316 (4%)
Query: 35 EPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
EPS ++E+ E+WMA++GR Y D EK R IFK N+ +IE N +Y LG N+F+
Sbjct: 1 EPSDPMMERFEEWMAEYGRVYNDNAEKMRRFQIFKNNVNHIETFNNRSGNSYTLGVNQFT 60
Query: 93 DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
D+TN EF A YTG + P+ ++ R +F +++ VP SIDWR+ GAVT +K+QG C
Sbjct: 61 DMTNNEFLARYTGASLPL-NIERDPV--VSFDDVDISAVPQSIDWRDYGAVTSVKNQGSC 117
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
GSCWAFSA+A VEGI +I G LI LSEQ+++DC+ ++GC GG ++KA+++II N G+
Sbjct: 118 GSCWAFSAIATVEGIYKIKAGNLISLSEQEVLDCAL-SYGCDGGWVNKAYDFIISNNGVT 176
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
+ A+ PY+ +G C N + A I+ Y + +E++++ AV+NQP++ +DA G F
Sbjct: 177 SFANLPYKGYKGPC-NHNDLPNKAYITGYTYVQSNNERSMMIAVANQPIAALIDAGGD-F 234
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
+YKSGV CG + +H + V+G+G + +G KYW++KNSWG +WGE GYIR+ RD
Sbjct: 235 QYYKSGVFTGSCGTSLNHAITVIGYG--QTSSGTKYWIVKNSWGTSWGERGYIRMARDVS 292
Query: 332 ---GLCGIATAASYPV 344
GLCGIA A +P
Sbjct: 293 SPYGLCGIAMAPLFPT 308
>gi|224085750|ref|XP_002307688.1| predicted protein [Populus trichocarpa]
gi|222857137|gb|EEE94684.1| predicted protein [Populus trichocarpa]
Length = 436
Score = 296 bits (758), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 196/313 (62%), Gaps = 24/313 (7%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
E W +HG++Y + E++ RL +F+ N +++ K N +GN +Y L N F+DLT+ EF+
Sbjct: 30 ETWCKEHGKSYTSQEERSHRLKVFEDNYDFVTKHNSKGNSSYSLALNAFADLTHHEFKTS 89
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQN------VTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
G S+ P ++N V D+P SIDWR KG VT++KDQG CG+CW
Sbjct: 90 RLGL----------SAAPLNLAHRNLEITGVVGDIPASIDWRNKGVVTNVKDQGSCGACW 139
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
+FSA A+EGI +I G L+ LSEQ+L++C N GC GGLMD AF+++I N G+ TE
Sbjct: 140 SFSATGAIEGINKIVTGSLVSLSEQELIECDKSYNDGCGGGLMDYAFQFVINNHGIDTEE 199
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
DYPYR +GTC+ + K TI KY D+P+ +E+ LLQAV+ QPVSV + S RAF Y
Sbjct: 200 DYPYRARDGTCNKDRMKRRVVTIDKYVDVPENNEKQLLQAVAAQPVSVGICGSERAFQMY 259
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---- 331
G+ C + DH V +VG+G+ ENG YW++KNSWG WG GY+ + R++
Sbjct: 260 SKGIFTGPCSTSLDHAVLIVGYGS---ENGVDYWIVKNSWGTGWGMRGYMHMQRNSGNSQ 316
Query: 332 GLCGIATAASYPV 344
G+CGI ASYPV
Sbjct: 317 GVCGINMLASYPV 329
>gi|75277440|sp|O23791.1|BROM1_ANACO RecName: Full=Fruit bromelain; AltName: Allergen=Ana c 2; Flags:
Precursor
gi|2342496|dbj|BAA21849.1| bromelain [Ananas comosus]
Length = 351
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 138/335 (41%), Positives = 218/335 (65%), Gaps = 12/335 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F+ + L AS + R ++++ E+WMA++GR YKD+ EK R IFK N+++
Sbjct: 8 VFLFLFLCAMWASPSAASRDEPNDPMMKRFEEWMAEYGRVYKDDDEKMRRFQIFKNNVKH 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N +Y LG N+F+D+T EF A YTG + P+ ++ R+ +F N++ VP
Sbjct: 68 IETFNSRNENSYTLGINQFTDMTKSEFVAQYTGVSLPL-NIEREPV--VSFDDVNISAVP 124
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG 192
SIDWR+ GAV +K+Q CGSCW+F+A+A VEGI +I G L+ LSEQ+++DC+ ++G
Sbjct: 125 QSIDWRDYGAVNEVKNQNPCGSCWSFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV-SYG 183
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C GG ++KA+++II N G+ TE +YPY +GTC N +A I+ Y + + DE+++
Sbjct: 184 CKGGWVNKAYDFIISNNGVTTEENYPYLAYQGTC-NANSFPNSAYITGYSYVRRNDERSM 242
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
+ AVSNQP++ +DAS F +Y GV + CG + +H + ++G+G ++ +G KYW+++
Sbjct: 243 MYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTKYWIVR 299
Query: 313 NSWGETWGESGYIRILR----DAGLCGIATAASYP 343
NSWG +WGE GY+R+ R +G+CGIA A +P
Sbjct: 300 NSWGSSWGEGGYVRMARGVSSSSGVCGIAMAPLFP 334
>gi|302763837|ref|XP_002965340.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
gi|302790566|ref|XP_002977050.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300155026|gb|EFJ21659.1| hypothetical protein SELMODRAFT_232903 [Selaginella moellendorffii]
gi|300167573|gb|EFJ34178.1| hypothetical protein SELMODRAFT_143126 [Selaginella moellendorffii]
Length = 300
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 146/304 (48%), Positives = 199/304 (65%), Gaps = 10/304 (3%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
E W A+HG++Y + EKA RL IF L YIEK N N T+ LG N+FSDLTN EFRA
Sbjct: 3 EGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRAN 62
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
Y G +P Q RP+ +V+ +PTS+DWR++GAVT IKDQGQCGSCWAFSA+A
Sbjct: 63 YVGKFKPP---RYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 163 AVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
++E + +L+ LSEQQL+DC T + GC GG + AF++++EN G+ TE YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTGF 179
Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
G+C+ K K V I+ Y+D+ K AL++AVS PV+V + S + F Y+SG+L+
Sbjct: 180 AGSCNANKNKVVE--ITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237
Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--AGLCGIATAA 340
C N+ DH V V+G+GT E G YW+IKNSWG +WGE G++RI ++ G+CG+ +
Sbjct: 238 HCSNSRDHAVLVIGYGT---EGGMPYWIIKNSWGTSWGEDGFMRIKKEDGEGMCGMNGQS 294
Query: 341 SYPV 344
SYP
Sbjct: 295 SYPT 298
>gi|359359166|gb|AEV41071.1| putative oryzain beta chain precursor [Oryza minuta]
Length = 464
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 144/310 (46%), Positives = 203/310 (65%), Gaps = 14/310 (4%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN-KEGNRTYKLGTNEFSDLTNEEFR 100
++ W+A++GR+Y E R +F NL + + N + + ++LG N F+DLTNEEFR
Sbjct: 53 YDLWLAENGRSYNALGEHERRFRVFWDNLRFADAHNARADDHGFRLGMNRFADLTNEEFR 112
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
A + G V R + +++ V ++P S+DWREKGAV +K+QGQCGSCWAFSA
Sbjct: 113 ATFLG----AKVVERSRAAGERYRHDGVEELPESVDWREKGAVAPVKNQGQCGSCWAFSA 168
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGG--LMDKAFEYIIENKGLATEADYP 218
V+ VE I Q+ G++I LSEQ+LV+CST+ LMD AF++II+N G+ TE DYP
Sbjct: 169 VSTVESINQLVTGEMITLSEQELVECSTNGQNGGCNGGLMDDAFDFIIKNGGIDTEDDYP 228
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
Y+ +G CD +E A +I +ED+P+ DE++L +AV++QPVSV ++A GR F Y SG
Sbjct: 229 YKAVDGKCDINRENAKVVSIDGFEDVPQNDEKSLQKAVAHQPVSVAIEAGGREFQLYHSG 288
Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLC 334
V + CG + DHGV VG+GT +NG YW+++NSWG WGESGY+R+ R+ G C
Sbjct: 289 VFSGRCGTSLDHGVVAVGYGT---DNGKDYWIVRNSWGPKWGESGYVRMERNINVTTGKC 345
Query: 335 GIATAASYPV 344
GIA ASYP
Sbjct: 346 GIAMMASYPT 355
>gi|357154164|ref|XP_003576692.1| PREDICTED: vignain-like [Brachypodium distachyon]
Length = 427
Score = 296 bits (757), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 150/312 (48%), Positives = 192/312 (61%), Gaps = 11/312 (3%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
+ EQWM +HGR Y + EK R ++K+NL IE+ N G Y L N+F+DLTNEEFR
Sbjct: 118 RFEQWMGKHGRAYANGGEKQRRFEVYKENLALIEEFNS-GGHGYTLTDNKFADLTNEEFR 176
Query: 101 ALYTGYNRPVPSVSRQSSRPSTF----KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
A G P R++ S N TD+P +DWR+KGAV +K+QG CGSCW
Sbjct: 177 AKMLGGLGADPDRRRRARHASNALELPGNDNSTDLPKDVDWRKKGAVVEVKNQGSCGSCW 236
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEAD 216
AFSAVAA+EG+ QI GKL+ LSEQ+LVDC + GC+GG M AFE+++ N GL TEA
Sbjct: 237 AFSAVAAMEGLNQIKNGKLVSLSEQELVDCDAEAVGCAGGFMSWAFEFVMANHGLTTEAS 296
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY+ G C K + +I+ Y ++ E LL+ + QPVSV VDA G F Y
Sbjct: 297 YPYKGINGACQTAKLNESSVSITGYVNVTVNSEAELLKVAAVQPVSVAVDAGGFLFQLYA 356
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
GV + C +HGV VVG+G E + KYW++KNSWG WGE+GY+ + RDA G
Sbjct: 357 GGVFSGPCTAQINHGVTVVGYG--ETDKAEKYWIVKNSWGPEWGEAGYMLMQRDAGVPTG 414
Query: 333 LCGIATAASYPV 344
LCGIA ASYPV
Sbjct: 415 LCGIAMLASYPV 426
>gi|18408616|ref|NP_566901.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|75313880|sp|Q9STL5.1|CEP3_ARATH RecName: Full=KDEL-tailed cysteine endopeptidase CEP3; Flags:
Precursor
gi|4678353|emb|CAB41163.1| cysteine endopeptidase precursor-like protein [Arabidopsis
thaliana]
gi|26453052|dbj|BAC43602.1| putative cysteine endopeptidase precursor [Arabidopsis thaliana]
gi|332644885|gb|AEE78406.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 364
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 153/349 (43%), Positives = 211/349 (60%), Gaps = 17/349 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEP------SIVEKHEQWMAQHGRTYKDELEKAMRLN 64
+ +F I+++ Q G E ++ + +E+W H + + E R N
Sbjct: 1 MKLFFIVLISFLSLLQASKGFDFDEKELETEENVWKLYERWRGHHSVS-RASHEAIKRFN 59
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST-F 123
+F+ N+ ++ + NK+ N+ YKL N F+D+T+ EFR+ Y G N + R R S F
Sbjct: 60 VFRHNVLHVHRTNKK-NKPYKLKINRFADITHHEFRSSYAGSNVKHHRMLRGPKRGSGGF 118
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
Y+NVT VP+S+DWREKGAVT +K+Q CGSCWAFS VAAVEGI +I KL+ LSEQ+L
Sbjct: 119 MYENVTRVPSSVDWREKGAVTEVKNQQDCGSCWAFSTVAAVEGINKIRTNKLVSLSEQEL 178
Query: 184 VDCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEE-GTCDNQKEKAVAATISKY 241
VDC T +N GC+GGLM+ AFE+I N G+ TE YPY + C TI +
Sbjct: 179 VDCDTEENQGCAGGLMEPAFEFIKNNGGIKTEETYPYDSSDVQFCRANSIGGETVTIDGH 238
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
E +P+ DE+ LL+AV++QPVSV +DA F Y GV +CG +HGV +VG+G E
Sbjct: 239 EHVPENDEEELLKAVAHQPVSVAIDAGSSDFQLYSEGVFIGECGTQLNHGVVIVGYG--E 296
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVAI 346
+NG KYW+++NSWG WGE GY+RI R + G CGIA ASYP +
Sbjct: 297 TKNGTKYWIVRNSWGPEWGEGGYVRIERGISENEGRCGIAMEASYPTKL 345
>gi|14600257|gb|AAK71314.1|AF388175_1 papain-like cysteine peptidase XBCP3 [Arabidopsis thaliana]
Length = 437
Score = 295 bits (756), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 144/312 (46%), Positives = 196/312 (62%), Gaps = 11/312 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I E + W +HG+TY E E+ R+ IFK N +++ + N N TY L N F+DLT+
Sbjct: 28 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 87
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+A G + PSV S S VP S+DWR+KGAVT++KDQG CG+CW+
Sbjct: 88 EFKASRLGLSVSAPSVIMASKGQS---LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 144
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FSA A+EGI QI G LI LSEQ+L+DC N GC+GGLMD AFE++I+N G+ TE D
Sbjct: 145 FSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKD 204
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY+ +GTC K K TI Y + DE+AL++AV+ QPVSV + S RAF Y
Sbjct: 205 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYS 264
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
G+ + C + DH V +VG+G+ +NG YW++KNSWG++WG G++ + R+ G
Sbjct: 265 RGIFSGPCSTSLDHAVLIVGYGS---QNGVDYWIVKNSWGKSWGMDGFMHMQRNTENSDG 321
Query: 333 LCGIATAASYPV 344
+CGI ASYP+
Sbjct: 322 VCGINMLASYPI 333
>gi|302142276|emb|CBI19479.3| unnamed protein product [Vitis vinifera]
Length = 388
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 150/309 (48%), Positives = 193/309 (62%), Gaps = 43/309 (13%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E W+A+HG++Y EK R IFK NL +I++ N E NRTYK+
Sbjct: 4 YEAWLAKHGKSYNALGEKERRFQIFKDNLRFIDEHNAE-NRTYKI--------------- 47
Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
S R + ++ +P S+DWR+KGAV +KDQG CGSCWAFS +
Sbjct: 48 ---------------SDR---YAFRVGDSLPESVDWRKKGAVVEVKDQGSCGSCWAFSTI 89
Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
AAVEGI +I G LI LSEQ+LVDC T N GC+GGLMD AFE+II N G+ +E DYPY+
Sbjct: 90 AAVEGINKIVTGGLISLSEQELVDCDTSYNEGCNGGLMDYAFEFIINNGGIDSEEDYPYK 149
Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL 280
+G CD ++ A TI YED+P+ DE++L +AV+NQPVSV ++A GR F Y+SG+
Sbjct: 150 ASDGRCDQYRKNAKVVTIDGYEDVPENDEKSLEKAVANQPVSVAIEAGGREFQLYQSGIF 209
Query: 281 NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-----AGLCG 335
CG DHGV VG+GT ENG YW++KNSWG +WGE GYIR+ RD G CG
Sbjct: 210 TGRCGTALDHGVTAVGYGT---ENGVDYWIVKNSWGASWGEEGYIRMERDLATSATGKCG 266
Query: 336 IATAASYPV 344
IA ASYP+
Sbjct: 267 IAMEASYPI 275
>gi|302790570|ref|XP_002977052.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
gi|300155028|gb|EFJ21661.1| hypothetical protein SELMODRAFT_268054 [Selaginella moellendorffii]
Length = 300
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 146/304 (48%), Positives = 198/304 (65%), Gaps = 10/304 (3%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
E W A+HG++Y + EKA RL IF L YIEK N N T+ LG N+FSDLTN EFRA
Sbjct: 3 EGWAAKHGKSYSSDWEKARRLMIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRAN 62
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
Y G +P Q RP+ +V+ +PTS+DWR++GAVT IKDQGQCGSCWAFSA+A
Sbjct: 63 YVGKFKPP---RYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 163 AVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
++E + +L+ LSEQQL+DC T + GC GG + AF++++EN G+ TE YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTGF 179
Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
G+C+ K K V I+ Y+D+ K AL++AVS PV+V + S + F Y+SG+L+
Sbjct: 180 AGSCNANKNKVVE--ITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237
Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--AGLCGIATAA 340
C N+ DH V V+G+GT E G YW+IKNSWG +WGE G++RI + G+CG+ +
Sbjct: 238 HCSNSRDHAVLVIGYGT---EGGMPYWIIKNSWGTSWGEDGFMRIKKKDGEGMCGMNGQS 294
Query: 341 SYPV 344
SYP
Sbjct: 295 SYPT 298
>gi|242094000|ref|XP_002437490.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
gi|241915713|gb|EER88857.1| hypothetical protein SORBIDRAFT_10g028000 [Sorghum bicolor]
Length = 372
Score = 295 bits (755), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 146/310 (47%), Positives = 192/310 (61%), Gaps = 12/310 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEE 98
+E W ++HG + + +RL +F+ NL YI+ N E G T++LG F+DLT EE
Sbjct: 52 YEAWKSEHGHGHGSD--DRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPFADLTLEE 109
Query: 99 FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
+R G+ SR S S D+P +IDWRE GAVT +K+Q QCG CWAF
Sbjct: 110 YRGRALGFRARRGGASRVGSGSSYRPRPRGGDLPDAIDWRELGAVTGVKNQEQCGGCWAF 169
Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
SAVAA+EGI +I G L+ LSEQ+++DC T + GC+GG M AF+++I N G+ TEADYP
Sbjct: 170 SAVAAIEGINEIVTGNLVSLSEQEIIDCDTQDGGCNGGEMQNAFQFVINNGGIDTEADYP 229
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
Y + CD + TI + + +E AL +AV+NQPVSV +DASGR F Y SG
Sbjct: 230 YLGTDAACDANRVNERVVTIDGFVSVATENETALQEAVANQPVSVAIDASGRKFQHYTSG 289
Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLC 334
+ N CG DHGV VG+G+ ENG YW++KNSW +WGE+GYIRI R+ G C
Sbjct: 290 IFNGPCGTQLDHGVTAVGYGS---ENGKDYWIVKNSWSSSWGEAGYIRIRRNVAAATGKC 346
Query: 335 GIATAASYPV 344
GIA ASYPV
Sbjct: 347 GIAMDASYPV 356
>gi|334904467|gb|AEH26024.1| cysteine peptidase [Ananas comosus]
Length = 352
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 139/336 (41%), Positives = 218/336 (64%), Gaps = 13/336 (3%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F+ + L + AS + R ++++ E+WMA++GR YKD EK R IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG-YNRPVPSVSRQSSRPSTFKYQNVTDV 131
IE N +Y LG N+F+D+T EF A YTG +RP+ ++ R+ +F N++ V
Sbjct: 68 IETFNSHNGNSYTLGINQFTDMTKSEFVAQYTGGISRPL-NIEREPV--VSFDDVNISAV 124
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH 191
P SIDWR+ GAV +K+Q CGSCWAF+A+A VEGI +I G L+ LSEQ+++DC+ ++
Sbjct: 125 PQSIDWRDYGAVNEVKNQNPCGSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV-SY 183
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GG ++KA+++II N G+ TE +YPY+ +GTC N +A I+ Y + + DE++
Sbjct: 184 GCKGGWVNKAYDFIISNNGVTTEENYPYQAYQGTC-NANSFPNSAYITGYSYVRRNDERS 242
Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
++ AVSNQP++ +DAS F +Y GV + CG + +H + ++G+G ++ +G KYW++
Sbjct: 243 MMYAVSNQPIAALIDAS-ENFQYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTKYWIV 299
Query: 312 KNSWGETWGESGYIRILR----DAGLCGIATAASYP 343
+NSWG +WGE GY+R+ R +G CGIA + +P
Sbjct: 300 RNSWGSSWGEGGYVRMARGVSSSSGACGIAMSPLFP 335
>gi|302790836|ref|XP_002977185.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
gi|300155161|gb|EFJ21794.1| hypothetical protein SELMODRAFT_106228 [Selaginella moellendorffii]
Length = 299
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 145/304 (47%), Positives = 199/304 (65%), Gaps = 10/304 (3%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
E W A+HG++Y + EKA RL IF L YIEK N + N T+ LG N+FSDLTN EFRA
Sbjct: 3 EDWAAKHGKSYSSDSEKARRLMIFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAN 62
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
Y G S Q RP+ +V+ +PTS+DWR++GAVT IKDQGQCGSCWAFSA+A
Sbjct: 63 YVG---KFKSPRYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 163 AVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
++E + +L+ LSEQQL+DC T + GC GG + AF++++EN G+ TE YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPEDAFKFVVENGGVTTEEAYPYTGF 179
Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
G+C+ K K V I+ Y+D+ K AL++AVS PV+V + S + F Y+SG+L+
Sbjct: 180 AGSCNANKNKVVE--ITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237
Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--AGLCGIATAA 340
C N+ DH V V+G+GT E G YW+IKNSWG +WGE+G+++I + G+CG+ +
Sbjct: 238 QCSNSRDHAVLVIGYGT---EGGMPYWIIKNSWGTSWGENGFMKIKKKDGEGMCGMNGQS 294
Query: 341 SYPV 344
SYP
Sbjct: 295 SYPT 298
>gi|190358935|sp|P00785.4|ACTN_ACTCH RecName: Full=Actinidain; Short=Actinidin; AltName: Allergen=Act c
1; Flags: Precursor
gi|12744965|gb|AAK06862.1|AF343446_1 actinidin protease [Actinidia chinensis]
Length = 380
Score = 294 bits (753), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 152/351 (43%), Positives = 216/351 (61%), Gaps = 18/351 (5%)
Query: 3 LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ KSF+ +F +L+++ A + + +E W+ ++G++Y E
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR+ Y + S S ++
Sbjct: 61 RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLRFT----SGSNKTKVS 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+ ++ + +P+ +DWR GAV IK QG+CG CWAFSA+A VEGI +I G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176
Query: 181 QQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAA 236
Q+L+DC + + GC+GG + F++II N G+ TE +YPY ++G C D Q EK V
Sbjct: 177 QELIDCGRTQNTRGCNGGYITDGFQFIINNGGINTEENYPYTAQDGECNVDLQNEKYV-- 234
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
TI YE++P +E AL AV+ QPVSV +DA+G AF Y SG+ CG DH V +VG
Sbjct: 235 TIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAVDHAVTIVG 294
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
+GT E G YW++KNSW TWGE GY+RILR+ AG CGIAT SYPV
Sbjct: 295 YGT---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|297809383|ref|XP_002872575.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
gi|297318412|gb|EFH48834.1| hypothetical protein ARALYDRAFT_911472 [Arabidopsis lyrata subsp.
lyrata]
Length = 371
Score = 294 bits (753), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 152/357 (42%), Positives = 218/357 (61%), Gaps = 31/357 (8%)
Query: 13 MFVIIILVIT-CAS----QVVSGRSMH---------------EPSIVEKHEQWMAQHGRT 52
+ +++ +VIT CA+ VVS + H E S++ + WM +HG+
Sbjct: 9 LILLVAMVITSCATAMDMSVVSSNNNHHLTTSPGRLHSGFDAEASLI--FDSWMVKHGKV 66
Query: 53 YKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS 112
Y EK RL IF+ NL +I N E N +Y+LG +F+DL+ E+ + G + P
Sbjct: 67 YGSVAEKERRLTIFEDNLRFISNRNAE-NLSYRLGLTQFADLSLHEYGEVCHGADPRPPR 125
Query: 113 VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITR 172
+ +K +P S+DWR +GAVT +KDQG C SCWAFS V AVEG+ +I
Sbjct: 126 NHVFMTSSDRYKTSAGDVLPKSVDWRNEGAVTEVKDQGHCRSCWAFSTVGAVEGLNKIVT 185
Query: 173 GKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ-KE 231
G+L+ LSEQ L++C+ +N+GC GG ++ A+E+I++N GL T+ DYPY+ G CD + KE
Sbjct: 186 GELVTLSEQDLINCNKENNGCGGGKVETAYEFIMKNGGLGTDNDYPYKAVNGVCDGRLKE 245
Query: 232 KAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHG 291
I +E+LP DE AL++AV++QPV+ +D+S R F Y+SGV + CG N +HG
Sbjct: 246 NNKNVMIDGFENLPANDEFALMKAVAHQPVTAVIDSSSREFQLYESGVFDGSCGTNLNHG 305
Query: 292 VAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
V VVG+GT ENG YWL+KNS G TWGE+GY+++ R+ GLCGIA ASYP+
Sbjct: 306 VVVVGYGT---ENGRDYWLVKNSRGNTWGEAGYMKMARNIANPRGLCGIAMRASYPL 359
>gi|160858205|dbj|BAF93840.1| triticain beta 2 [Triticum aestivum]
Length = 469
Score = 294 bits (753), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 145/321 (45%), Positives = 207/321 (64%), Gaps = 28/321 (8%)
Query: 42 HEQWMAQHGR-TYKDE---LEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
++ W+A+HG +Y + E+ R F NL +++ N G ++L N F+DL
Sbjct: 50 YDLWLAEHGGGSYPNANSIPERERRFRAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 109
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPST-----FKYQNVTDVPTSIDWREKGAVTHIKDQ 149
TN+EFRA Y G V Q +RP +++ ++P ++DWREKGAV +K+Q
Sbjct: 110 TNDEFRAAYLG-------VKGQRARPGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQ 162
Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH--GCSGGLMDKAFEYIIE 207
GQCGSCWAFSA++ VE I QI G+++ LSEQ+LV+C T+ GC+GGLMD AFE+II+
Sbjct: 163 GQCGSCWAFSAISTVESINQIVTGEMVTLSEQELVECDTNGQSSGCNGGLMDDAFEFIIK 222
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
N G+ TE DYPY+ +G CD ++ A +I +ED+P+ DE++L +AV++QPVSV ++A
Sbjct: 223 NGGIDTEDDYPYKAIDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHQPVSVAIEA 282
Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
GR F Y SGV + CG DHGV VG+GT ENG YW+++NSWG WGE+GY+R+
Sbjct: 283 GGREFQLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGEAGYLRM 339
Query: 328 LRD----AGLCGIATAASYPV 344
R+ +G CGIA +SYP
Sbjct: 340 ERNINVTSGKCGIAMMSSYPT 360
>gi|312451845|gb|ADQ85986.1| actinidin [Actinidia chinensis]
Length = 380
Score = 294 bits (752), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 152/351 (43%), Positives = 215/351 (61%), Gaps = 18/351 (5%)
Query: 3 LKFEKSFIIP--MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ KSF+ +F +L+++ A + + +E W+ ++G++Y E
Sbjct: 1 MGLPKSFVSMSLLFFSTLLILSLAFNAKNLTQRTNDEVKAMYESWLIKYGKSYNSLGEWE 60
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
R IFK+ L +I++ N + NR+YK+G N+F+DLT+EEFR+ Y G+ S S ++
Sbjct: 61 RRFEIFKETLRFIDEHNADTNRSYKVGLNQFADLTDEEFRSTYLGFT----SGSNKTKVS 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+ ++ + +P+ +DWR GAV IK QG+CG CWAFSA+A VEGI +I G LI LSE
Sbjct: 117 NRYEPRVGQVLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSE 176
Query: 181 QQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAA 236
Q+L+DC + + GC+G + F +II N G+ TE +YPY ++G C D Q EK V
Sbjct: 177 QELIDCGRTQNTRGCNGSYITDGFPFIINNGGINTEENYPYTAQDGECNVDLQNEKYV-- 234
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
TI YE++P +E AL AV+ QPVSV +DA+G AF Y SG+ CG DH V +VG
Sbjct: 235 TIDTYENVPYNNEWALQTAVTYQPVSVALDAAGDAFKQYSSGIFTGPCGTAIDHAVTIVG 294
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
+GT E G YW++KNSW TWGE GY+RILR+ AG CGIAT SYPV
Sbjct: 295 YGT---EGGIDYWIVKNSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 342
>gi|302763109|ref|XP_002964976.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|302763113|ref|XP_002964978.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
gi|300167209|gb|EFJ33814.1| hypothetical protein SELMODRAFT_83176 [Selaginella moellendorffii]
gi|300167211|gb|EFJ33816.1| hypothetical protein SELMODRAFT_83554 [Selaginella moellendorffii]
Length = 300
Score = 293 bits (751), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 144/304 (47%), Positives = 198/304 (65%), Gaps = 10/304 (3%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
E W A+H ++Y + EKA RL +F L YIEK N + N T+ LG N+FSDLTN EFRA
Sbjct: 3 EDWAAKHDKSYSSDWEKARRLMVFSDTLAYIEKHNAQPNTTFTLGLNKFSDLTNAEFRAN 62
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
Y G +P Q RP+ +V+ +PTS+DWR++GAVT IKDQGQCGSCWAFSA+A
Sbjct: 63 YVGKFKPP---RYQDRRPAKDVDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIA 119
Query: 163 AVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
++E + +L+ LSEQQL+DC T + GC GG D AF++++EN G+ TE YPY
Sbjct: 120 SIESAHFLATKELVSLSEQQLIDCDTVDQGCQGGFPDDAFKFVVENGGVTTEEAYPYTGF 179
Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
G+C+ K K V I+ Y+D+ K AL++AVS PV+V + S + F Y+SG+L+
Sbjct: 180 AGSCNTNKNKVVE--ITGYKDVTKDSADALMKAVSKTPVTVGICGSDQNFQNYRSGILSG 237
Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD--AGLCGIATAA 340
C N+ DH V V+G+GT E G YW+IKNSWG +WGE G+++I + G+CG+ +
Sbjct: 238 QCCNSRDHAVLVIGYGT---EGGMPYWIIKNSWGTSWGEDGFMKIKKKDGEGMCGMNGQS 294
Query: 341 SYPV 344
SYP
Sbjct: 295 SYPT 298
>gi|57118005|gb|AAW34134.1| cysteine protease gp2a [Zingiber officinale]
Length = 381
Score = 293 bits (751), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 146/292 (50%), Positives = 194/292 (66%), Gaps = 16/292 (5%)
Query: 62 RLNIFKQNLEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS 118
RL +FK+NL+++++ N +R T+ LG N F+DLTNEE+R T + R + R +S
Sbjct: 73 RLEVFKENLQFVDEHNAAADRGEHTFLLGMNRFADLTNEEYR---TRFLRDFSRLRRSAS 129
Query: 119 R--PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
S ++ + D+P SIDWRE GAV +K+QG CGSCWAFS VAAVEGI QI G LI
Sbjct: 130 GKISSRYRLREGDDLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLI 189
Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
LSEQQLVDC+T NHGC GG M+ AF++I+ N G+ +E YPYR + G C N A
Sbjct: 190 SLSEQQLVDCTTANHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQNGIC-NSTVNAPVV 248
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
+I YE++P +EQ+L +AV+NQPVSV +DA+GR F Y+SG+ C + +H + VVG
Sbjct: 249 SIDSYENVPSHNEQSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVG 308
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+GT EN +W++KNSWG+ WGESGYIR R+ G CGI ASYPV
Sbjct: 309 YGT---ENDKDFWIVKNSWGKNWGESGYIRAERNIENPNGKCGITRFASYPV 357
>gi|146215982|gb|ABQ10193.1| actinidin Act2b [Actinidia eriantha]
Length = 378
Score = 293 bits (751), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 148/338 (43%), Positives = 204/338 (60%), Gaps = 15/338 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F +L+++ A + + ++ +E W+ + G++Y EK MR IFK+NL
Sbjct: 13 LFFSTLLILSLALDIENSVQRTNDQVMAMYESWLVEQGKSYNSLDEKEMRFEIFKENLRI 72
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNR-PVPSVSRQSSRPSTFKYQNVTDV 131
I+ N + NR+Y LG N F+DLT+EE+R+ Y G P VS + + + +
Sbjct: 73 IDDHNADANRSYSLGLNRFADLTDEEYRSTYLGLKMGPKTDVSNE------YMPKVGEAL 126
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--STD 189
P +DWR GAV +K+QG C SCWAFSAV AVEGI +I G LI LSEQ+LVDC +
Sbjct: 127 PDYVDWRTVGAVVGVKNQGLCSSCWAFSAVTAVEGINKIVTGNLISLSEQELVDCGRTQR 186
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
GC+ GLM AF++II N G+ TE +YPY ++G C+ + TI Y+++P +E
Sbjct: 187 TKGCNRGLMTDAFQFIINNGGINTEDNYPYTAKDGQCNLSLKNQKYVTIDNYKNVPSNNE 246
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
AL +AV+ QPVSV V++ G F Y SG+ CG DHGV +VG+GT E G YW
Sbjct: 247 MALKKAVAYQPVSVGVESEGGKFKLYTSGIFTGFCGTAVDHGVTIVGYGT---ERGMDYW 303
Query: 310 LIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
++KNSWG WGE+GYIRI R+ AG CGIA SYPV
Sbjct: 304 IVKNSWGTNWGENGYIRIQRNIGGAGKCGIARMPSYPV 341
>gi|242049716|ref|XP_002462602.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
gi|241925979|gb|EER99123.1| hypothetical protein SORBIDRAFT_02g028840 [Sorghum bicolor]
Length = 384
Score = 293 bits (750), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 153/358 (42%), Positives = 204/358 (56%), Gaps = 53/358 (14%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E+ EQWM +HGR Y D EK RL ++++N+ +E N N Y+L N+F+DLTNE
Sbjct: 28 MLERFEQWMGRHGRLYADAGEKQRRLEVYRRNVALVETFNSMSNGGYRLADNKFADLTNE 87
Query: 98 EFRALYTGYNRPVP--SVSRQSSRPSTF---------KYQNVTDVPTSIDWREKGAVTHI 146
EFRA G+ RP P + ++ P T +Y + ++P S+DWREKGAV +
Sbjct: 88 EFRAKMLGFGRPPPHGRATGHTTTPGTVACIGSGLGRRYSD--ELPKSVDWREKGAVAPV 145
Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYII 206
K+QG+CGSCWAFSAVAA+EGI QI GKL+ LSEQ+LVDC T GC+GG M AFE+++
Sbjct: 146 KNQGECGSCWAFSAVAAIEGINQIKNGKLVSLSEQELVDCDTKAIGCAGGYMSWAFEFVM 205
Query: 207 ENKGLATEADYPYRHE----------------------------EGTCDNQKEKAVAATI 238
N GL TE +YPY+ G C K K A +I
Sbjct: 206 NNSGLTTERNYPYQGTYAHGNRKTHALPFDCTKGSSTCDSRAGMNGACQTPKLKESAVSI 265
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
S Y ++ E LL+A + QPVSV VDA + Y GV C + +HGV VVG+G
Sbjct: 266 SGYVNVTASSEPDLLRAAAAQPVSVAVDAGSFVWQLYGGGVFTGPCTADLNHGVTVVGYG 325
Query: 299 TAEEEN--------GAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+ + G KYW++KNSWG WG++GYI + R+A GLCGIA SYPV
Sbjct: 326 ETQRDTDGDGTGVPGQKYWIVKNSWGPEWGDAGYILMQREASVASGLCGIALLPSYPV 383
>gi|357166364|ref|XP_003580686.1| PREDICTED: oryzain alpha chain-like [Brachypodium distachyon]
Length = 360
Score = 293 bits (750), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 153/325 (47%), Positives = 202/325 (62%), Gaps = 15/325 (4%)
Query: 29 SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYK 85
SG+ E + +W AQHG +E E R F+ NL YI++ N G +++
Sbjct: 30 SGQIRSEEETRRMYAEWTAQHGSPITNEEEG--RYEAFRDNLRYIDEHNAAADAGIHSFR 87
Query: 86 LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
LG N F+ LTNEE+RA Y G +V + ++ + +P S+DWREKGAV
Sbjct: 88 LGLNRFAGLTNEEYRAAYLGLRLRSGAVGDLRKPSARYEAADGEALPESVDWREKGAVGK 147
Query: 146 IKDQGQ-CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFE 203
+KDQG+ CGS WAFSA+AAVE I QI G+LI LSEQ+L+DC T N GC GGLMD AFE
Sbjct: 148 VKDQGRSCGSAWAFSAIAAVESINQIVTGELISLSEQELMDCDTSYNAGCDGGLMDDAFE 207
Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
+II N G+ T+ DYPY+ +CD K A TI YEDL + +E++L +AVSNQPVSV
Sbjct: 208 FIISNGGIDTDEDYPYKARNDSCDANKRNRKAVTIDDYEDL-RMNEKSLQKAVSNQPVSV 266
Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
++A GR F YKSG+ CG + DH +VG+G+ ENG YW++K S+G +WGESG
Sbjct: 267 AIEAGGRDFQLYKSGIFTGTCGTDLDHATTIVGYGS---ENGTDYWIVKESYGTSWGESG 323
Query: 324 YIRILRD----AGLCGIATAASYPV 344
Y R+ R+ +G CGIA SYPV
Sbjct: 324 YARMERNIKETSGKCGIAMLPSYPV 348
>gi|384253406|gb|EIE26881.1| hypothetical protein COCSUDRAFT_21961 [Coccomyxa subellipsoidea
C-169]
Length = 481
Score = 293 bits (750), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 146/306 (47%), Positives = 198/306 (64%), Gaps = 11/306 (3%)
Query: 45 WMAQHGRTYKDELEKAMR-LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALY 103
W+ + YKD +E+ R +++ NLE++ N E + T+KLG F+DLT++E+R
Sbjct: 51 WVEHLQKAYKDNVEEYERKFSVWLDNLEFVHSHN-EKDSTFKLGLTNFADLTHDEYRQHA 109
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
GY + + + + F+Y + + P SIDWR+KGAVT +K+Q QCGSCWAFS +
Sbjct: 110 LGYRPELKGTGLGTGKSTGFQYADY-EAPPSIDWRKKGAVTDVKNQQQCGSCWAFSTTGS 168
Query: 164 VEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
VEG I G+L+ LSEQ+LVDC T +HGC GGLMD AF +II N G+ TE DY Y+ +
Sbjct: 169 VEGANAIYSGELVSLSEQELVDCDVTQDHGCHGGLMDFAFSFIIRNGGIDTEKDYKYKAQ 228
Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
+G C+ KEK TI YED+P DE AL +A +NQP+SV ++A R F Y GV +A
Sbjct: 229 DGVCNIAKEKRHVVTIDSYEDVPPNDESALKKAAANQPISVAIEADQREFQLYAGGVFDA 288
Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIAT 338
CG DHGV VVG+G+ +NG YW++KNSWG+ WG+SGYIR+ R AG CGIA
Sbjct: 289 PCGTALDHGVLVVGYGS---DNGTDYWIVKNSWGDFWGDSGYIRLARGISNSAGQCGIAM 345
Query: 339 AASYPV 344
ASYP+
Sbjct: 346 QASYPI 351
>gi|242094002|ref|XP_002437491.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
gi|241915714|gb|EER88858.1| hypothetical protein SORBIDRAFT_10g028010 [Sorghum bicolor]
Length = 397
Score = 293 bits (750), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 150/328 (45%), Positives = 206/328 (62%), Gaps = 28/328 (8%)
Query: 42 HEQWMAQHGRTYKD----ELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
+E W ++HGR + E +RL +F+ NL YI+ N E G T++LG F+DL
Sbjct: 54 YEAWKSKHGRPRGNCDMAGDEDRLRLEVFRDNLRYIDAHNAEADAGLHTFRLGLTPFADL 113
Query: 95 TNEEFRALYTGY---NRPVPSVSRQSSRPSTFKYQN----------VTDVPTSIDWREKG 141
T EE+R G+ +R PS +SR + ++ D+P +IDWR+ G
Sbjct: 114 TLEEYRGRALGFRARHRGGPSARAAASRVGSGGTRSHHRRPRPRPRCGDLPDAIDWRQLG 173
Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKA 201
AVT +K+Q QCG CWAFSAVAA+EGI I G L+ LSEQ+++DC T + GC+GG M+ A
Sbjct: 174 AVTDVKNQEQCGGCWAFSAVAAIEGINAIVTGNLVSLSEQEIIDCDTQDSGCNGGQMENA 233
Query: 202 FEYIIENKGLATEADYPYRHEEGTCD-NQKEKAVAATISKYEDLPKGDEQALLQAVSNQP 260
F+++I+N G+ +EADYP+ +GTCD N+ A I + ++ +E AL +AV+ QP
Sbjct: 234 FQFVIDNGGIDSEADYPFIATDGTCDANKANDEKVAAIDGFVEVASNNETALQEAVAIQP 293
Query: 261 VSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWG 320
VSV +DA GRAF Y SG+ N CG N DHGV VVG+G+ ENG YW++KNSW ++WG
Sbjct: 294 VSVAIDAGGRAFQHYSSGIFNGPCGTNLDHGVTVVGYGS---ENGKAYWIVKNSWSDSWG 350
Query: 321 ESGYIRILRD----AGLCGIATAASYPV 344
E+GYIRI R+ G CGIA ASYPV
Sbjct: 351 EAGYIRIRRNVFLPVGKCGIAMDASYPV 378
>gi|255544115|ref|XP_002513120.1| cysteine protease, putative [Ricinus communis]
gi|223548131|gb|EEF49623.1| cysteine protease, putative [Ricinus communis]
Length = 362
Score = 293 bits (750), Expect = 9e-77, Method: Compositional matrix adjust.
Identities = 146/279 (52%), Positives = 193/279 (69%), Gaps = 9/279 (3%)
Query: 3 LKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMR 62
+ F + +I F + + SQ ++ R++ E S+ E+HEQWMA + R YKD EK MR
Sbjct: 1 MVFTEPYICITFALFFSIGAWTSQCMA-RTLQEASMYERHEQWMASYARVYKDANEKQMR 59
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
IFK+N++ I+ N E +++YKL N+F+DLTNEEF++L G+ + S ++
Sbjct: 60 YKIFKENVQRIDSFNSESDKSYKLAVNQFADLTNEEFKSLRNGFKGHMCS-----AQAGH 114
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
F+Y+NVT VP SIDWR+KGAVT IK+QGQCGSCWAFSAVAAVEGIT+I GKLI LSEQ+
Sbjct: 115 FRYENVTAVPASIDWRKKGAVTQIKEQGQCGSCWAFSAVAAVEGITEIKTGKLISLSEQE 174
Query: 183 LVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
LVDC T ++ GC GGLMD AF++ IE GLA+EA YPY + TC ++E +A I+
Sbjct: 175 LVDCDTNSEDQGCQGGLMDDAFKF-IEQHGLASEATYPYDAADSTCKTKEEAKPSAKITG 233
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
YED+P DE AL AV+NQPVSV +DA G F FY SG+
Sbjct: 234 YEDVPANDEAALKNAVANQPVSVAIDAGGFEFQFYSSGI 272
>gi|218202389|gb|EEC84816.1| hypothetical protein OsI_31898 [Oryza sativa Indica Group]
Length = 350
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 150/324 (46%), Positives = 197/324 (60%), Gaps = 19/324 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ EQWM +HGR Y D EK R ++++N+E +E N N YKL N+F+DLTNE
Sbjct: 28 MLDRFEQWMIRHGRAYTDSGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 86
Query: 98 EFRALYTGYNRP---VPSVSRQSSRPSTFKYQNVTDV-PTSIDWREKGAVTHIKDQGQCG 153
EFRA G+ RP +P +S S ++ D+ P S+DWR+KGAV +K+QG CG
Sbjct: 87 EFRAKMLGF-RPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCG 145
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
SCWAFSAVAA+EGI QI G+L+ LSEQ+LVDC + GC GG M AFE+++ N GL T
Sbjct: 146 SCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEAVGCGGGYMSWAFEFVVGNHGLTT 205
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
EA YPY G C K A I+ Y ++ E L +A + QPVSV VD F
Sbjct: 206 EASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQ 265
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEEN--------GAKYWLIKNSWGETWGESGYI 325
Y SGV C + +HGV VVG+G +E + G KYW++KNSWG WG++GYI
Sbjct: 266 LYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYI 325
Query: 326 RILRD-----AGLCGIATAASYPV 344
+ RD +GLCGIA SYPV
Sbjct: 326 LMQRDVAGLASGLCGIALLPSYPV 349
>gi|296090463|emb|CBI40282.3| unnamed protein product [Vitis vinifera]
Length = 386
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 147/309 (47%), Positives = 189/309 (61%), Gaps = 43/309 (13%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E W+ +HG++Y E+ R IFK NL +IE+ N NRTYK+G
Sbjct: 4 YEAWLVKHGKSYNALGERERRFEIFKDNLRFIEEHNAV-NRTYKVG-------------- 48
Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
+ ++ D+P S+DWREKGAV +KDQG CGSCWAFS +
Sbjct: 49 -------------------DRYSFRAGEDLPESVDWREKGAVVPVKDQGNCGSCWAFSTI 89
Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
AAVEGI QI G LI LSEQ+LVDC N GC+GGLMD AFE+II N G+ +E DYPYR
Sbjct: 90 AAVEGINQIATGDLISLSEQELVDCDKSYNQGCNGGLMDYAFEFIINNGGIDSEEDYPYR 149
Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL 280
+ TCD ++ A +I YED+P+ DE++L +AV+NQPVSV ++A GRAF Y+SGV
Sbjct: 150 AADTTCDPNRKNARVVSIDGYEDVPQNDERSLKKAVANQPVSVAIEAGGRAFQLYQSGVF 209
Query: 281 NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR-----DAGLCG 335
CG DHGV VG+GT EN YW+++NSWG WGESGYI++ R + G CG
Sbjct: 210 TGQCGTQLDHGVVAVGYGT---ENSVDYWIVRNSWGPNWGESGYIKLERNLAGTETGKCG 266
Query: 336 IATAASYPV 344
IA SYP+
Sbjct: 267 IAIEPSYPI 275
>gi|356517368|ref|XP_003527359.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 161/341 (47%), Positives = 213/341 (62%), Gaps = 29/341 (8%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
F +++ + A QV R++ + S+ E+HEQ M ++G+ YKD ++ FK+N+ YI
Sbjct: 12 FAMLLCMAFLAFQVTC-RTLQDASMXERHEQRMTRYGKVYKDPPKRX-----FKENVNYI 65
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E N N+ YK G N+F+ NR + R +TFK++NVT P+
Sbjct: 66 EACNNAANKPYKRGINQFAPR------------NRFKGHMCSSIIRITTFKFENVTATPS 113
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
++D R+KGAVT IKDQGQCG CWAFSAVAA EGI ++ GKLI LSEQ+LVDC T +
Sbjct: 114 TVDCRQKGAVTPIKDQGQCGCCWAFSAVAATEGIHALSAGKLISLSEQELVDCDTKGVDX 173
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYP-YRHEEGTCDNQKEKAVAAT-ISKYEDLPKGDE 249
GC GGLMD AF++II+N GL + P Y +G C+ + AAT I+ YED+P +E
Sbjct: 174 GCEGGLMDDAFKFIIQNHGLKHXSQLPLYMGVDGKCNANEAAKNAATIITGYEDVPANNE 233
Query: 250 QALLQ-AVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
+A LQ AV+N PVS +DASG F FYKSGV CG DHGV VG+G +++ G +Y
Sbjct: 234 KAHLQKAVANNPVSEAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDD--GTEY 291
Query: 309 WLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
WL+KNSWG WGE GYIR+ R + LCGIA ASYP A
Sbjct: 292 WLVKNSWGTEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 332
>gi|115479933|ref|NP_001063560.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|113631793|dbj|BAF25474.1| Os09g0497500 [Oryza sativa Japonica Group]
gi|215704298|dbj|BAG93138.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 349
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 150/324 (46%), Positives = 197/324 (60%), Gaps = 19/324 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ EQWM +HGR Y D EK R ++++N+E +E N N YKL N+F+DLTNE
Sbjct: 27 MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 85
Query: 98 EFRALYTGYNRP---VPSVSRQSSRPSTFKYQNVTDV-PTSIDWREKGAVTHIKDQGQCG 153
EFRA G+ RP +P +S S ++ D+ P S+DWR+KGAV +K+QG CG
Sbjct: 86 EFRAKMLGF-RPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRKKGAVVEVKNQGDCG 144
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
SCWAFSAVAA+EGI QI G+L+ LSEQ+LVDC + GC GG M AFE+++ N GL T
Sbjct: 145 SCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEAVGCGGGYMSWAFEFVVGNHGLTT 204
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
EA YPY G C K A I+ Y ++ E L +A + QPVSV VD F
Sbjct: 205 EASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMFQ 264
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEEN--------GAKYWLIKNSWGETWGESGYI 325
Y SGV C + +HGV VVG+G +E + G KYW++KNSWG WG++GYI
Sbjct: 265 LYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGYI 324
Query: 326 RILRD-----AGLCGIATAASYPV 344
+ RD +GLCGIA SYPV
Sbjct: 325 LMQRDVAGLASGLCGIALLPSYPV 348
>gi|3377950|emb|CAA08861.1| cysteine proteinase precursor, AN11 [Ananas comosus]
Length = 357
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 218/337 (64%), Gaps = 14/337 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
+F+ + L + AS + R ++++ E+WMA++GR YKD EK R IFK N+ +
Sbjct: 8 VFLFLFLCVMWASPSAASRDEPSDPMMKRFEEWMAEYGRVYKDNDEKMRRFQIFKNNVNH 67
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N +Y LG N+F+D+TN EF A YTG + P+ ++ R+ +F +++ VP
Sbjct: 68 IETFNSRNGNSYTLGINQFTDMTNNEFVAQYTGVSLPL-NIEREPV--VSFDDVDISAVP 124
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG 192
SIDWR GAVT +K+ CGSCWAF+A+A VE I +I RG LI LSEQQ++DC+ ++G
Sbjct: 125 QSIDWRNYGAVTSVKNHIPCGSCWAFAAIATVESIYKIKRGYLISLSEQQVLDCAV-SYG 183
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYR--HEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
C GG ++KA+++II NKG+A+ A YPY+ +GTC +A I+ Y + +E+
Sbjct: 184 CDGGWVNKAYDFIISNKGVASAAIYPYKASQGQGTCRINGVPN-SAYITGYTRVQSNNER 242
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
+++ AVSNQP++ ++ASG F YK GV + CG + +H + ++G+G ++ +G K+W+
Sbjct: 243 SMMYAVSNQPIAASIEASGD-FQHYKRGVFSGPCGTSLNHAITIIGYG--QDSSGKKFWI 299
Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
++NSWG +WGE GYIR+ RD +GLCGIA YP
Sbjct: 300 VRNSWGASWGERGYIRMARDVSSSSGLCGIAIRPLYP 336
>gi|297799636|ref|XP_002867702.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
gi|297313538|gb|EFH43961.1| hypothetical protein ARALYDRAFT_329301 [Arabidopsis lyrata subsp.
lyrata]
Length = 357
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 147/310 (47%), Positives = 206/310 (66%), Gaps = 16/310 (5%)
Query: 43 EQWMAQHGRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+ WM++HG+TY + L EK R FK NL +I++ N + N +Y+LG F+DLT +E+R
Sbjct: 49 QMWMSKHGKTYTNALGEKERRFQNFKDNLRFIDQHNAK-NLSYQLGLTRFADLTVQEYRD 107
Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
L+ G +P R S R + + +P S+DWR +GAV+ IKDQG C SCWAFS V
Sbjct: 108 LFPGSPKPKQRNLRISRR---YVPLDGDQLPESVDWRNEGAVSAIKDQGTCNSCWAFSTV 164
Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTDNHGCSG-GLMDKAFEYIIENKGLATEADYPYR 220
AAVEGI +I G+L+ LSEQ+LVDC+ N+GC G G MD AF+++I N GL ++ DYPY+
Sbjct: 165 AAVEGINKIVTGELVSLSEQELVDCNLVNNGCYGSGTMDAAFQFLINNGGLDSDTDYPYQ 224
Query: 221 HEEGTCDNQKEKAVAA--TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
+G C N+KE TI YED+P DE +L +AV++QPVSV VD + F Y+SG
Sbjct: 225 GSQGYC-NRKESTSNKIITIDSYEDVPANDEISLQKAVAHQPVSVGVDKKSQEFMLYRSG 283
Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLC 334
+ N CG + DH + +VG+G+ ENG YW+++NSWG TWG++GY ++ R+ +G+C
Sbjct: 284 IYNGPCGTDLDHALVIVGYGS---ENGQDYWIVRNSWGTTWGDAGYAKMARNFEYPSGVC 340
Query: 335 GIATAASYPV 344
GIA ASYPV
Sbjct: 341 GIAMLASYPV 350
>gi|146215992|gb|ABQ10198.1| actinidin Act4b [Actinidia eriantha]
Length = 379
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 141/312 (45%), Positives = 206/312 (66%), Gaps = 13/312 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++ E W+ ++G++Y EK R IFK NL ++++ N + NR+YK+G N+FSDLT E
Sbjct: 44 VMAMFESWLVEYGKSYNALGEKERRFEIFKDNLRFVDEHNADVNRSYKVGLNQFSDLTLE 103
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
E+ ++Y G + R ++ ++ + +P SIDWR+KGAV +K+QG CGSCW
Sbjct: 104 EYSSIYLGTKFDM----RMTNVSDRYEPRVGDQLPNSIDWRKKGAVLGVKNQGNCGSCWT 159
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDC--STDNHGCSGGLMDKAFEYIIENKGLATEA 215
F+ +AAVE I QI G LI LSEQQ+VDC + N+GC GG A+++II+N G+ TEA
Sbjct: 160 FAPIAAVEAINQIVTGNLISLSEQQIVDCQRKSPNNGCKGGSRAGAYQFIIDNGGINTEA 219
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
+YPY+ ++G CD QK + TI +YE++P+ +E+AL +AVSNQ VSV + ++ F Y
Sbjct: 220 NYPYKAQDGECDEQKNQKY-VTIDRYENVPRKNEKALQKAVSNQLVSVGIASNSSEFKAY 278
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR---DAG 332
KSG+ CG DH V +VG+GT E G YW+++NSWG WGE+GY+R+ R +AG
Sbjct: 279 KSGIFTGPCGAKIDHAVTIVGYGT---EGGMDYWIVRNSWGSNWGENGYVRMQRNVGNAG 335
Query: 333 LCGIATAASYPV 344
C IAT+ +YPV
Sbjct: 336 TCFIATSPNYPV 347
>gi|1174171|gb|AAB41816.1| NTH1 [Pisum sativum]
Length = 367
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 146/345 (42%), Positives = 215/345 (62%), Gaps = 14/345 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
S + + + ++ ++ + + SGRS E ++ +E+W+ +H + Y EK R IFK
Sbjct: 3 SILYSLILFGLITLSLSLDMSSGRSNKE--VMTMYEKWLVKHQKVYYGLGEKNQRFQIFK 60
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
NL +I++ N N +Y++G NEFSD+TN+E+R Y ++ +S +K +
Sbjct: 61 DNLIFIDEHNAP-NHSYRVGLNEFSDITNKEYRDTYLSRWSNNNIKNKITSVRYAYKAGH 119
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+P S+DWR GA+T IK+QG CG+CWAFSAVAAVE I +I G L+ LSEQ+LVDC
Sbjct: 120 NNKLPVSVDWR--GALTPIKNQGSCGACWAFSAVAAVEAINKIVTGSLVSLSEQELVDCD 177
Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
T N GC+GG A+ +I+EN GL ++ DYPY + TC+ K+ +I+ Y+++ +
Sbjct: 178 RTKNKGCNGGNQVNAYRFIVENGGLDSQIDYPYLGRQSTCNQAKKNTKVVSINGYKNVQR 237
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
E AL++AV+NQPVSV ++A G+ F Y+SGV CG + DH V VVG+G+ ENG
Sbjct: 238 NSESALMEAVANQPVSVGIEAYGKDFQLYQSGVFTGSCGTSLDHAVVVVGYGS---ENGK 294
Query: 307 KYWLIKNSWGETWGESGYIRILR-----DAGLCGIATAASYPVAI 346
YWL+KNSWG WGE GY++I R + G CGIA A+YP +
Sbjct: 295 DYWLVKNSWGTNWGERGYLKIERNLKNTNTGKCGIAMDATYPTKL 339
>gi|297843430|ref|XP_002889596.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
gi|297335438|gb|EFH65855.1| hypothetical protein ARALYDRAFT_887827 [Arabidopsis lyrata subsp.
lyrata]
Length = 343
Score = 291 bits (746), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 154/340 (45%), Positives = 211/340 (62%), Gaps = 18/340 (5%)
Query: 14 FVIIILVITCASQVVS-GRSMHEP--SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
V++I + AS++ S S+++P ++ ++ E+W+ H + Y E +R I++ N+
Sbjct: 12 LVVLICFVLIASKLCSVNSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQSNV 71
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+ I+ N + +KL N F+D+TN EF+A + G N + ++ RP NV
Sbjct: 72 QLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQ-RPVCDPAGNV-- 127
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--ST 188
P ++DWR +GAVT I++QG+CG CWAFSAVAA+EGI +I G L+ LSEQQL+DC T
Sbjct: 128 -PDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDVGT 186
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GCSGGLM+ AFE+I N GL TE DYPY EGTCD +K K TI Y+ + + +
Sbjct: 187 YNKGCSGGLMETAFEFIKSNGGLTTETDYPYTGIEGTCDQEKAKNKVVTIQGYQKVAQ-N 245
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E +L A + QPVSV +DA G F Y SGV + CG N +HGV VVG+G E KY
Sbjct: 246 EASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTSYCGTNLNHGVTVVGYGV---EGDQKY 302
Query: 309 WLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
W++KNSWG WGE GYIR+ R D G CGIA ASYP+
Sbjct: 303 WIVKNSWGTGWGEEGYIRMERGISEDTGKCGIAMLASYPL 342
>gi|357507617|ref|XP_003624097.1| Cysteine protease [Medicago truncatula]
gi|355499112|gb|AES80315.1| Cysteine protease [Medicago truncatula]
Length = 340
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 148/340 (43%), Positives = 215/340 (63%), Gaps = 21/340 (6%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
+I + ++I ++ + +S+ ++ E+++ W ++ YKD+ E+ + IFK N
Sbjct: 10 LINILIVIWVMFPSNQNQENDQSL---TLSERYKHWKIKYRVIYKDDAEEEKHIQIFKHN 66
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ YI+ N GN++YKL N F+DL E G+ + + + S FKY+N+T
Sbjct: 67 VAYIDSFNAAGNKSYKLTINRFADLPTEPSD---DGFKKR----KLEPTTSSLFKYKNIT 119
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
D+P ++DWR++GAVT +K+Q +CGSCWAFSAV A+EGI QIT G L+ LSEQ+LVD
Sbjct: 120 DIPAAVDWRKRGAVTPVKNQRECGSCWAFSAVGALEGIQQITSGNLVSLSEQELVDRVRS 179
Query: 190 N--HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N +GC+GG + AFE+++EN G+ATEA YPYR +G +N K+ + I YE +P+
Sbjct: 180 NWTNGCNGGYLIDAFEFVLENGGIATEASYPYRGVKG--NNSKKVSRQVQIKSYEQVPRN 237
Query: 248 DEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK 307
E +LL+ V+NQPVSV +D SG FY SG+ +CG +H V +VG+GT+ + G K
Sbjct: 238 SEDSLLKVVANQPVSVGIDISG-MIRFYSSGIFTGECGTKPNHAVIIVGYGTSND--GTK 294
Query: 308 YWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
YWL+KNSWG WGE YIR+ RD GLCGI ASYP
Sbjct: 295 YWLVKNSWGIRWGEKRYIRMKRDIDAKEGLCGIPMDASYP 334
>gi|2160175|gb|AAB60738.1| Strong similarity to Dianthus cysteine proteinase (gb|U17135)
[Arabidopsis thaliana]
Length = 416
Score = 291 bits (745), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 197/319 (61%), Gaps = 18/319 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I E + W +HG+TY E E+ R+ IFK N +++ + N N TY L N F+DLT+
Sbjct: 26 ISELFDDWCQKHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHH 85
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+A G + PSV S S VP S+DWR+KGAVT++KDQG CG+CW+
Sbjct: 86 EFKASRLGLSVSAPSVIMASKGQS---LGGSVKVPDSVDWRKKGAVTNVKDQGSCGACWS 142
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FSA A+EGI QI G LI LSEQ+L+DC N GC+GGLMD AFE++I+N G+ TE D
Sbjct: 143 FSATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKD 202
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY+ +GTC K K TI Y + DE+AL++AV+ QPVSV + S RAF Y
Sbjct: 203 YPYQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALMEAVAAQPVSVGICGSERAFQLYS 262
Query: 277 S-------GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
S G+ + C + DH V +VG+G+ +NG YW++KNSWG++WG G++ + R
Sbjct: 263 SKFYLLMQGIFSGPCSTSLDHAVLIVGYGS---QNGVDYWIVKNSWGKSWGMDGFMHMQR 319
Query: 330 DA----GLCGIATAASYPV 344
+ G+CGI ASYP+
Sbjct: 320 NTENSDGVCGINMLASYPI 338
>gi|18390634|ref|NP_563764.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|8844131|gb|AAF80223.1|AC025290_12 Contains similarity to a cysteine endopeptidase 1 from Phaseolus
vulgaris gb|U52970 and is a member of the papain
cysteine protease family PF|00112 [Arabidopsis thaliana]
gi|332189848|gb|AEE27969.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 343
Score = 291 bits (744), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 153/342 (44%), Positives = 209/342 (61%), Gaps = 17/342 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEP--SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
+ + V+I V+ + S+++P ++ ++ E+W+ H + Y E +R I++
Sbjct: 10 LTLAVLICFVLIASKLCSVDSSVYDPHKTLKQRFEKWLKTHSKLYGGRDEWMLRFGIYQS 69
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
N++ I+ N + +KL N F+D+TN EF+A + G N + ++ RP NV
Sbjct: 70 NVQLIDYINSL-HLPFKLTDNRFADMTNSEFKAHFLGLNTSSLRLHKKQ-RPVCDPAGNV 127
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC-- 186
P ++DWR +GAVT I++QG+CG CWAFSAVAA+EGI +I G L+ LSEQQL+DC
Sbjct: 128 ---PDAVDWRTQGAVTPIRNQGKCGGCWAFSAVAAIEGINKIKTGNLVSLSEQQLIDCDV 184
Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
T N GCSGGLM+ AFE+I N GLATE DYPY EGTCD +K K TI Y+ + +
Sbjct: 185 GTYNKGCSGGLMETAFEFIKTNGGLATETDYPYTGIEGTCDQEKSKNKVVTIQGYQKVAQ 244
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
+E +L A + QPVSV +DA G F Y SGV CG N +HGV VVG+G E
Sbjct: 245 -NEASLQIAAAQQPVSVGIDAGGFIFQLYSSGVFTNYCGTNLNHGVTVVGYGV---EGDQ 300
Query: 307 KYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
KYW++KNSWG WGE GYIR+ R D G CGIA ASYP+
Sbjct: 301 KYWIVKNSWGTGWGEEGYIRMERGVSEDTGKCGIAMMASYPL 342
>gi|558563|emb|CAA57538.1| cysteine proteinase [Cicer arietinum]
Length = 325
Score = 290 bits (743), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 148/309 (47%), Positives = 190/309 (61%), Gaps = 11/309 (3%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E+W+ +H + Y EK R IFK NL +I++ N + N +YK+G N+F+D+ NEE+R
Sbjct: 4 YEKWLVKHQKMYNGLGEKDTRFQIFKDNLRFIDEHNAQ-NYSYKVGLNKFADINNEEYRD 62
Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
+Y G ++ N V +DWR KGAVTHIKDQG CGSCWAFS +
Sbjct: 63 MYLGTKSDAKRRVMKTKITGHRITYNSVIVTVKVDWRLKGAVTHIKDQGSCGSCWAFSTI 122
Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
A VE I +I GK + LSEQ+LVDC N GC+GGLMD AFE+II N G+ T+ DYPY
Sbjct: 123 ATVEAINKIVTGKFVSLSEQELVDCDRAFNEGCNGGLMDYAFEFIIRNGGIDTDQDYPYN 182
Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL 280
E CD K+ A +I YED+P AL +AV++QPVSV + GRA Y+SGV
Sbjct: 183 GFERKCDPTKKNAKVVSIDGYEDVPS-YMNALKKAVAHQPVSVAIAGLGRALQLYQSGVF 241
Query: 281 NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL-RDAG----LCG 335
CG + DHGV VVG+G+ ENG YWL++NSWG WGE GY +I R+ CG
Sbjct: 242 TGKCGTDLDHGVVVVGYGS---ENGVDYWLVRNSWGTNWGEDGYFKIASRNVKSLYRKCG 298
Query: 336 IATAASYPV 344
IA ASYPV
Sbjct: 299 IAMEASYPV 307
>gi|341850671|gb|AEK97329.1| chromoplast senescence-associated protein 12 [Brassica rapa var.
parachinensis]
Length = 260
Score = 290 bits (743), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 138/260 (53%), Positives = 187/260 (71%), Gaps = 8/260 (3%)
Query: 90 EFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIK 147
+F+++TN+EFR++YTGY S+ ++ ++F+YQNV+ +P ++DWR+KGAVT IK
Sbjct: 1 QFAEITNDEFRSMYTGYKGDSVLSSQSQTKSTSFRYQNVSSGALPIAVDWRKKGAVTPIK 60
Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIE 207
+QG CG CWAFSAVAA+EG TQI +GKLI LSEQQLVDC T++ GCSGGL+D AFE+I+
Sbjct: 61 NQGSCGCCWAFSAVAAIEGATQIKKGKLISLSEQQLVDCDTNDFGCSGGLIDTAFEHIMA 120
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
GL TE++YPY+ E+ TC + AA+I+ YED+P DE AL++AV++QPVSV ++
Sbjct: 121 TGGLTTESNYPYKGEDATCKIKSTXPSAASITGYEDVPVNDENALMKAVAHQPVSVGIEG 180
Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
G F FY SGV +C DH V VG+ ++ G+KYW+IKNSWG WGE GY+RI
Sbjct: 181 GGFDFQFYSSGVFTGECTTYLDHAVTAVGY--SQSSAGSKYWIIKNSWGTKWGEGGYMRI 238
Query: 328 LRDA----GLCGIATAASYP 343
+D GLCG+A ASYP
Sbjct: 239 KKDIKDKEGLCGLAMKASYP 258
>gi|167521499|ref|XP_001745088.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776702|gb|EDQ90321.1| predicted protein [Monosiga brevicollis MX1]
Length = 294
Score = 290 bits (743), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 152/308 (49%), Positives = 194/308 (62%), Gaps = 27/308 (8%)
Query: 47 AQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALY 103
+ + ++Y+ E +A RL F+ NLE+I K N E G +Y +G NEF+DLT +EF ALY
Sbjct: 3 SDYSKSYESEAVEAKRLAAFEANLEFINKHNAEHAQGLHSYTVGVNEFADLTIDEFMALY 62
Query: 104 --TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
+ +NR +P P+T + S+DWR KGAVT IK+QGQCGSCW+FS
Sbjct: 63 VPSKFNRTMPY--NTVYLPATSE--------DSVDWRTKGAVTPIKNQGQCGSCWSFSTT 112
Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
+ EG I G L+ LSEQQLVDCS N GC+GGLMD AF+YII NKGL TE DYPY
Sbjct: 113 GSTEGAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDDAFKYIISNKGLDTEEDYPY 172
Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
++GTC+ +KE AATIS Y D+PK +E L AV+ PVSV ++A F YKSGV
Sbjct: 173 TAQDGTCNKEKEAKHAATISSYSDVPKNNEDQLAAAVAKGPVSVAIEADQSGFQLYKSGV 232
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGI 336
+ +CG N DHGV VVG+ YW++KNSWG TWG GYI + R +G+CGI
Sbjct: 233 FDGNCGTNLDHGVLVVGYTD-------DYWIVKNSWGTTWGVEGYINMKRGVSASGICGI 285
Query: 337 ATAASYPV 344
A SYP+
Sbjct: 286 AMQPSYPI 293
>gi|302781881|ref|XP_002972714.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
gi|300159315|gb|EFJ25935.1| hypothetical protein SELMODRAFT_98707 [Selaginella moellendorffii]
Length = 446
Score = 289 bits (740), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 142/293 (48%), Positives = 189/293 (64%), Gaps = 16/293 (5%)
Query: 62 RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGY-----NRPVPSVSRQ 116
R FK+N YIE+ N+ G +Y+LG N+FSDLT+EEFR + G + PV + R
Sbjct: 34 RFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRFLGLRPDLIDSPVLKMPRD 93
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
S F QNV D+P S+DWR+ GAVT KDQG CG CWAF+ A+EGI QI G+L+
Sbjct: 94 SDIEEGF--QNV-DLPASVDWRKHGAVTAPKDQGSCGGCWAFATTGAIEGINQIVTGQLM 150
Query: 177 ELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA 235
LSEQ+L+DC + GC GGLM+ A+++I+EN GL TE DYPY E C+ +K +
Sbjct: 151 SLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGLDTETDYPYHASESHCNMKKLNSRV 210
Query: 236 ATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVV 295
I YE +P GDEQALL+AV+ QPVSV ++ + + F Y SGV CG +HGV +V
Sbjct: 211 VAIDGYEAIPDGDEQALLRAVAKQPVSVAIEGASKDFQHYASGVFTGHCGEEINHGVLIV 270
Query: 296 GFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
G+GT E+G YW++KNSW TWG+ G++++ R+ GLC I T ASYPV
Sbjct: 271 GYGT---EDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCSINTLASYPV 320
>gi|386648114|gb|AFJ15104.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 323
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 196/311 (63%), Gaps = 14/311 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+V E W ++ + YK+ EK R IFK NL YI++ NK+ N +Y LG NEF+DLT++
Sbjct: 18 LVRLFESWTLENDKIYKNIDEKIYRFEIFKDNLMYIDETNKK-NSSYWLGLNEFADLTHD 76
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+A Y G ++ QS F Y++V D P SIDWR+KGAVT +K+Q CGSCWA
Sbjct: 77 EFKAKYVGSLGEDSTIIEQSD-DEEFPYKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWA 135
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADY 217
FS VA VEGI +I GKLI LSEQ+L+DC +HGC GG + +Y+ +N G+ TE +Y
Sbjct: 136 FSTVATVEGINKIVTGKLISLSEQELLDCDRRSHGCKGGYQTTSLQYVADN-GVHTEKEY 194
Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
PY ++G C + +K I+ Y+ +P +E +L+QA++NQPVSV V++ GRAF FYK
Sbjct: 195 PYEKKQGKCRAKDKKGSKVKITGYKRVPANNEVSLIQAIANQPVSVVVESKGRAFQFYKG 254
Query: 278 GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GL 333
G+ CG DH V VG+ G Y LIKNSWG WGE GYIRI R + G
Sbjct: 255 GIFEGPCGTKVDHAVTAVGY-------GKNYILIKNSWGPKWGEKGYIRIKRASGKSKGT 307
Query: 334 CGIATAASYPV 344
CG+ +++ +P
Sbjct: 308 CGVYSSSYFPT 318
>gi|356543010|ref|XP_003539956.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 306
Score = 289 bits (739), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 154/313 (49%), Positives = 195/313 (62%), Gaps = 21/313 (6%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
+ E+W+ Q+ R YKD+ E +R I++ NLEYIE N + +Y L N+F+DLTNEEF
Sbjct: 4 RFERWLKQNDRXYKDKEEWEVRFGIYQANLEYIECKNSQ-EXSYNLTDNKFADLTNEEFV 62
Query: 101 ALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
+ Y G+ R +P F Y D+P S DWR++GAV+ IKDQG CGSCWAFS
Sbjct: 63 SPYLGFGTRFLPHTG--------FMYHEHEDLPESKDWRKEGAVSDIKDQGNCGSCWAFS 114
Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADY 217
AVAAVEGI +I GKL+ LSEQ+ DC + N GC GGLMD AF +I +N GL T DY
Sbjct: 115 AVAAVEGINKIKSGKLVSLSEQEFRDCDVEDGNQGCEGGLMDTAFAFIKKNGGLTTSKDY 174
Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL--LQAVSNQPVSVCVDASGRAFHFY 275
PY +GTC+ +K AA IS + +P DE L A +NQ SV +DA G AF Y
Sbjct: 175 PYEGVDGTCNKEKALHHAANISGHVKVPANDEAMLKAKAAAANQXESVAIDAGGHAFQLY 234
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----A 331
GV + CG +HGV +VG+G + KYW++KNSWG WGESGYIR+ RD A
Sbjct: 235 LKGVFSGICGKQLNHGVTIVGYGKGTSD---KYWIVKNSWGADWGESGYIRMKRDAFDKA 291
Query: 332 GLCGIATAASYPV 344
G CGIA ASYP+
Sbjct: 292 GTCGIAMQASYPL 304
>gi|326430490|gb|EGD76060.1| cysteine proteinase [Salpingoeca sp. ATCC 50818]
Length = 448
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 154/346 (44%), Positives = 209/346 (60%), Gaps = 33/346 (9%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKH-----EQWMAQHGRTYKDELEKAMRLNIFKQN 69
+++ LV+ CA + G +M EP + + + + + + Y+ E+A R ++F QN
Sbjct: 1 MMLKLVLVCA---LVGAAMAEPLSLTVNKGRLFDAFKTKFNKVYESAEEEARRFSVFSQN 57
Query: 70 LEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
+++I + N E R T+ + N+F+DLTNEE+R LY RP P+ R +
Sbjct: 58 IDFINRHNAEAARGVHTHTVDVNQFADLTNEEYRQLYL---RPYPTELLGRERQEVW--- 111
Query: 127 NVTDVPT--SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
D P S+DWR+KGAVT IK+QGQCGSCW+FS +VEG I G L+ LSEQQLV
Sbjct: 112 --LDGPNAGSVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVEGAHAIATGNLVSLSEQQLV 169
Query: 185 DCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
DCS N GC+GGLMD AF+YII N GL TE DYPY +G CD KE A +IS Y+
Sbjct: 170 DCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPYTARDGVCDKSKESKHAVSISGYK 229
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
D+P+ +E L AV PVSV ++A ++F Y SGV + CG N DHGV VVG+ +
Sbjct: 230 DVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGPCGTNLDHGVLVVGYTS--- 286
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILR---DAGLCGIATAASYPVA 345
YW++KNSWG +WG+ GYI + R AG+CGIA SYP+A
Sbjct: 287 ----DYWIVKNSWGASWGDQGYIMMKRGVSSAGICGIAMQPSYPIA 328
>gi|357129125|ref|XP_003566217.1| PREDICTED: thiol protease SEN102-like [Brachypodium distachyon]
Length = 380
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 156/325 (48%), Positives = 200/325 (61%), Gaps = 20/325 (6%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ +E+W A+H +D EK+ R N+F++N + + N + YKL N F+DL
Sbjct: 42 EESLWALYERWRARH-TVSRDLAEKSRRFNVFRENARLVHEFNLRRDAPYKLRLNRFADL 100
Query: 95 TNEEFRALYTGYN---------RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
T++EFR Y R + + S+F + +PTS+DWREKGAVT
Sbjct: 101 TSDEFRRSYASSRVSHHRMFKPRAANNNDDDDDKGSSFTHGGA--LPTSVDWREKGAVTG 158
Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEY 204
+KDQGQCGSCWAFS +AAVEGI I L LSEQQLVDC T N GC GGLMD AF Y
Sbjct: 159 VKDQGQCGSCWAFSTIAAVEGINAIRTNNLTSLSEQQLVDCDTKTNAGCDGGLMDDAFSY 218
Query: 205 IIENKGLATEADYPYR-HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
I ++ G+A E YPYR + +C+++K A +I YED+P+ DE AL +AV+ QPV+V
Sbjct: 219 IAKHGGVAAEKSYPYRARQSSSCNSKKAAAAVVSIDGYEDVPRNDETALKKAVAAQPVAV 278
Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
++A G F FY GV CG DHGVA VG+G +G KYW++KNSWGE WGE G
Sbjct: 279 AIEAGGSHFQFYSEGVFAGKCGTELDHGVAAVGYGVT--VDGTKYWIVKNSWGEEWGEKG 336
Query: 324 YIRILRDA----GLCGIATAASYPV 344
YIR+ RD GLCGIA ASYPV
Sbjct: 337 YIRMKRDVADKEGLCGIAMEASYPV 361
>gi|146215988|gb|ABQ10196.1| actinidin Act3a [Actinidia eriantha]
Length = 380
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 146/350 (41%), Positives = 210/350 (60%), Gaps = 23/350 (6%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
M L F +F+I F I +++ R+ E ++ +E W+ ++G++Y E+
Sbjct: 10 MSLLFFSTFLIFSFAI-------DAKISPLRTNDE--VMALYESWLVKYGKSYNSLGERE 60
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP 120
MR+ IFK+NL +I++ N + NR+Y +G N+F+DLT+EE+R+ Y G+ + S P
Sbjct: 61 MRIEIFKENLRFIDEHNADPNRSYTVGLNQFADLTDEEYRSTYLGFKSSLKSKVSNRYMP 120
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+ +P +DWR GAV +K+QG C SCWAF+ +A VE I QI G LI LSE
Sbjct: 121 QVGEV-----LPDYVDWRTTGAVVDVKNQGLCSSCWAFATIATVESINQIITGDLISLSE 175
Query: 181 QQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
Q+LVDC+ N GC GG MD A+E+II N G+ TE +YPY ++ CD K+ TI
Sbjct: 176 QELVDCNRTPINEGCKGGFMDDAYEFIINNGGINTEENYPYIGQDDQCDEPKKNQNYVTI 235
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGF 297
YE +P DE A+ +AV+ QPVSV +DA F FY+SG+ CG +H V ++G+
Sbjct: 236 DSYEQVPPNDELAMKRAVAYQPVSVAIDAYCLGFRFYQSGIFTGGSCGTTLNHAVTIIGY 295
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
GT ENG YW++KNS+G WGESGY ++ R+ G CGIA+ YPV
Sbjct: 296 GT---ENGIDYWIVKNSYGTQWGESGYGKVQRNVGGEGRCGIASYPFYPV 342
>gi|2463588|dbj|BAA22546.1| FB1035 precursor [Ananas comosus]
Length = 324
Score = 288 bits (738), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 135/315 (42%), Positives = 211/315 (66%), Gaps = 14/315 (4%)
Query: 35 EPS--IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
EP+ ++++ E+WMA++GR YKD EK R IFK N+++IE N +Y LG N+F+
Sbjct: 1 EPNDPMMKRFEEWMAEYGRIYKDNDEKMRRFQIFKNNVKHIETFNSRNGNSYTLGINQFT 60
Query: 93 DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
D+T EF A YTG + P+ ++ R+ +F N++ VP SIDWR+ GAV +K+Q C
Sbjct: 61 DMTKSEFVAQYTGVSLPL-NIEREPV--VSFDDVNISAVPQSIDWRDYGAVNEVKNQNPC 117
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
GSCWAF+A+A VEGI +I G L+ LSEQ+++DC+ ++GC GG ++KA+++II N G+
Sbjct: 118 GSCWAFAAIATVEGIYKIKTGYLVSLSEQEVLDCAV-SYGCKGGWVNKAYDFIISNNGVT 176
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE +YPY+ +GTC N +A I+ Y + + DE++++ AVSNQP++ +DAS F
Sbjct: 177 TEENYPYQAYQGTC-NANSFPNSAYITGYSYVRRNDERSMMYAVSNQPIAALIDAS-ENF 234
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR--- 329
+Y GV + CG + +H + ++G+G ++ +G KYW+++NSWG +WGE GY+R+ R
Sbjct: 235 QYYNGGVFSGPCGTSLNHAITIIGYG--QDSSGTKYWIVRNSWGSSWGEGGYVRMARGVS 292
Query: 330 -DAGLCGIATAASYP 343
+G CGIA + +P
Sbjct: 293 SSSGACGIAMSPLFP 307
>gi|356515062|ref|XP_003526220.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 337
Score = 288 bits (737), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 153/343 (44%), Positives = 210/343 (61%), Gaps = 18/343 (5%)
Query: 10 IIPMFVIIILVITCA-SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQ 68
I+ V I V C+ S+ S+ HE+WMAQHG+ YKD EK L IF+
Sbjct: 6 ILKFLVAFIEVDACSLSESCCSHSL-------SHEKWMAQHGKVYKDAAEKERCLQIFEN 58
Query: 69 NLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
N+E+IE + G++++ L TN+F+DL +EEF+AL T ++ S+ ++ + F+Y NV
Sbjct: 59 NMEFIESFDVCGDKSFNLSTNQFADLHDEEFKALLTNGHKKEHSL--WTTTETLFRYDNV 116
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFS-AVAAVEGITQITRGKLIELSEQQLVD-C 186
T +P S+DWR++G VT IKDQG+C SCWAFS VA +EG+ QI +L+ LSEQ+LVD
Sbjct: 117 TKIPASMDWRKRGVVTPIKDQGKCLSCWAFSLCVATIEGLHQIITSELVPLSEQELVDFV 176
Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
++ GC G ++ AF++I + + +E YPY+ TC +KE A I Y+ +P
Sbjct: 177 KGESEGCYGDYVEDAFKFITKKGRIESETHYPYKGVNNTCKVKKETHGVAQIKGYKKVPS 236
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
E ALL+AV+NQ VSV V+A AF FY SG+ CG + DH VA+ +G E +G
Sbjct: 237 KSENALLKAVANQLVSVSVEARDSAFQFYSSGIFTGKCGTDTDHRVALASYG--ESGDGT 294
Query: 307 KYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
KYWL KNSWG WGE GYIRI D GLCGIA YP+A
Sbjct: 295 KYWLAKNSWGTEWGEKGYIRIKXDIPAKEGLCGIAKYPYYPIA 337
>gi|449447027|ref|XP_004141271.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 458
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 158/351 (45%), Positives = 220/351 (62%), Gaps = 23/351 (6%)
Query: 5 FEKSFIIPMFVIIILVITCAS--QVVSGRSMHEPSIVEKHEQWMAQHGRTYKD-ELEKAM 61
F+ S I+ + + + ++ AS ++ R+ E ++ ++QW A+HG+ + + E
Sbjct: 4 FQSSPIMALLFFLFIALSAASPSSIIPQRTDDE--VMALYDQWRAKHGKLHNNLGAEPEN 61
Query: 62 RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
R +IFK NL++I++ N + N Y+LG N F+DLTNEE+R+ Y G S SR++ +
Sbjct: 62 RFHIFKDNLKFIDEINAQ-NLPYRLGLNVFADLTNEEYRSRYLG--GKFASGSRRNRTSN 118
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
+ + D+P SIDWR KGAV +KDQG CGSCWAFS VA+VE I QI G LI LSEQ
Sbjct: 119 RYLPRLGDDLPDSIDWRAKGAVAPVKDQGSCGSCWAFSTVASVEAINQIVTGDLIALSEQ 178
Query: 182 QLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
+LVDC N GC+GGLMD AFE+IIEN GL TE DYPY + +C K+ A I
Sbjct: 179 ELVDCDRSYNEGCNGGLMDYAFEFIIENGGLDTEEDYPYYGFDSSCIQYKKNA----IDG 234
Query: 241 YEDLPKGDEQALLQA---VSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
YED+P +E+AL +A VSV ++ GR+F Y+SG+ CG + DHGV VVG+
Sbjct: 235 YEDVPVNNEKALQKAVSKQVVSVVSVAIEGGGRSFQLYQSGIFTGRCGTDLDHGVNVVGY 294
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
G+ E G YW+++NSWG +WGESGY+++ R+ GLCGIA SYP
Sbjct: 295 GS---EGGVDYWIVRNSWGGSWGESGYVKMQRNIASPTGLCGIAMEPSYPT 342
>gi|194320502|gb|ACF48469.1| cathepsin L [Triatoma brasiliensis]
Length = 330
Score = 288 bits (736), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 154/345 (44%), Positives = 215/345 (62%), Gaps = 30/345 (8%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWM---AQHGRTYKDELEKAMRLNIFKQ 68
P+ V + ++ S PS E+W A HG+TYK++ E+ R+ IF
Sbjct: 3 PLLVAVAIIAL---------SYAHPSFDIYPEEWHVFKAMHGKTYKNQFEEMFRMKIFMD 53
Query: 69 NLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY 125
N + IE N ++G +YK+ N F DL EF+AL G+ +S + R +
Sbjct: 54 NKKKIEAHNAKYEQGEVSYKMMMNHFGDLMVHEFKALMNGF-----KMSPDTKRNGELYF 108
Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
+ +++P ++DWR+KGAVT +KDQGQCGSCW+FSA ++EG + GKL+ LSEQ LVD
Sbjct: 109 PSNSNLPKTVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQVFLKTGKLVSLSEQNLVD 168
Query: 186 CSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
CST N+GC GGLMD+AF+Y+ +NKG+ TEA YPY E TC +K K V T + D
Sbjct: 169 CSTSYGNNGCEGGLMDQAFQYVSDNKGIDTEASYPYEARENTCRFKKNK-VGGTDKGHVD 227
Query: 244 LPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTA 300
+P GDE+AL A++ P+SV +DA+ +F FY GV N +C + + DHGV VG+GT
Sbjct: 228 IPAGDEKALQNALATVGPISVAIDANHGSFQFYSKGVYNEPNCSSYDLDHGVLAVGYGT- 286
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
ENG YWL+KNSWG +WGE+GYI+I R+ + CGIA+ ASYP+
Sbjct: 287 --ENGQDYWLVKNSWGPSWGENGYIKIARNHSNHCGIASMASYPL 329
>gi|302812789|ref|XP_002988081.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
gi|300144187|gb|EFJ10873.1| hypothetical protein SELMODRAFT_183539 [Selaginella moellendorffii]
Length = 425
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 141/293 (48%), Positives = 189/293 (64%), Gaps = 16/293 (5%)
Query: 62 RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGY-----NRPVPSVSRQ 116
R FK+N YIE+ N+ G +Y+LG N+FSDLT+EEFR + G + PV + R
Sbjct: 34 RFETFKENFRYIEEHNRAGKHSYRLGLNQFSDLTSEEFRQRFLGLRPDLIDSPVLKMPRD 93
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
S F QNV D+P S+DWR+ GAVT KDQG CG CWAF+ A+EGI QI G+L+
Sbjct: 94 SDIEEGF--QNV-DLPASVDWRQHGAVTAPKDQGSCGGCWAFATTGAIEGINQIVTGQLV 150
Query: 177 ELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA 235
LSEQ+L+DC + GC GGLM+ A+++I+EN GL TE DYPY E C+ +K +
Sbjct: 151 SLSEQELIDCDKKADKGCDGGLMENAYQFIVENGGLDTETDYPYHASESHCNMKKLNSRV 210
Query: 236 ATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVV 295
I Y+ +P+GDEQALL AV+ QPVSV ++ + + F Y SGV CG +HGV +V
Sbjct: 211 VAIDGYKAIPEGDEQALLLAVAKQPVSVAIEGASKDFQHYASGVFTGHCGEEINHGVLIV 270
Query: 296 GFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
G+GT E+G YW++KNSW TWG+ G++++ R+ GLC I T ASYPV
Sbjct: 271 GYGT---EDGLDYWIVKNSWAATWGDGGFVKMQRNTGKRGGLCSINTLASYPV 320
>gi|194352756|emb|CAQ00106.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 476
Score = 287 bits (734), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 198/316 (62%), Gaps = 16/316 (5%)
Query: 42 HEQWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
++ W+A+HG ++ R + F NL +++ N G ++L N F+DL
Sbjct: 52 YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
TN+EFRA Y G +++ ++P ++DWREKGAV +K+QGQCGS
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGDRYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH--GCSGGLMDKAFEYIIENKGLA 212
CWAFSAV+ VE I QI G+++ LSEQ+LV+C + GC+GGLMD AFE+II+N G+
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE DYPY+ +G CD ++ A +I +ED+P+ DE++L +AV++ PVSV ++A GR F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
Y SGV + CG DHGV VG+GT ENG YW+++NSWG WGE+GY+R+ R+
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGEAGYLRMERNIN 348
Query: 331 --AGLCGIATAASYPV 344
+G CGIA +SYP
Sbjct: 349 VTSGKCGIAMMSSYPT 364
>gi|326507362|dbj|BAK03074.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 473
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 198/316 (62%), Gaps = 16/316 (5%)
Query: 42 HEQWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
++ W+A+HG ++ R + F NL +++ N G ++L N F+DL
Sbjct: 52 YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
TN+EFRA Y G +++ ++P ++DWREKGAV +K+QGQCGS
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH--GCSGGLMDKAFEYIIENKGLA 212
CWAFSAV+ VE I QI G+++ LSEQ+LV+C + GC+GGLMD AFE+II+N G+
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE DYPY+ +G CD ++ A +I +ED+P+ DE++L +AV++ PVSV ++A GR F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
Y SGV + CG DHGV VG+GT ENG YW+++NSWG WGE+GY+R+ R+
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGEAGYLRMERNIN 348
Query: 331 --AGLCGIATAASYPV 344
+G CGIA +SYP
Sbjct: 349 VTSGKCGIAMMSSYPT 364
>gi|260516654|gb|ACX43954.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516656|gb|ACX43955.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516658|gb|ACX43956.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516660|gb|ACX43957.1| cysteine protease 1 [Brachiaria hybrid cultivar]
gi|260516662|gb|ACX43958.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516664|gb|ACX43959.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516666|gb|ACX43960.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516668|gb|ACX43961.1| cysteine protease 2 [Brachiaria hybrid cultivar]
gi|260516670|gb|ACX43962.1| cysteine protease 2 [Brachiaria hybrid cultivar]
Length = 338
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 154/305 (50%), Positives = 192/305 (62%), Gaps = 17/305 (5%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT 104
+M Q+ + Y E + R N FK N+E I N N +Y +G NEF+DL+ EEF+ Y
Sbjct: 45 FMKQYSKAY-SHAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYF 103
Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
GY V R+ +R + +Q V PTSIDWR AVT IKDQGQCGSCWAFSA ++
Sbjct: 104 GYKH----VEREFARSNNL-HQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSI 158
Query: 165 EGITQITRGK--LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
EG + +GK L LSEQQLVDCST N GC+GGLMD AFEYII NKG+ E+ YPY+
Sbjct: 159 EG-AWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAYPYK 217
Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV 279
G C K V TIS Y+D+ GDE +LL AV PVSV ++A F FY SGV
Sbjct: 218 GVGGLCQKSCTKVV--TISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGV 275
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATA 339
+ CG+N DHGV VG+GT ++ YW++KNSWG +WGESGYIR++R+ CGIA
Sbjct: 276 FSGTCGHNLDHGVLAVGYGTTGSQD---YWIVKNSWGTSWGESGYIRMIRNKNQCGIAIQ 332
Query: 340 ASYPV 344
SYP
Sbjct: 333 PSYPT 337
>gi|204307508|gb|ACI00280.1| triticain beta 2 [Hordeum vulgare]
Length = 473
Score = 287 bits (734), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 138/316 (43%), Positives = 198/316 (62%), Gaps = 16/316 (5%)
Query: 42 HEQWMAQHGRTYKDEL----EKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
++ W+A+HG ++ R + F NL +++ N G ++L N F+DL
Sbjct: 52 YDLWLAEHGGGSSPNANSIADRERRFSAFWDNLRFVDAHNARAAAGEEGFRLAMNRFADL 111
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
TN+EFRA Y G +++ ++P ++DWREKGAV +K+QGQCGS
Sbjct: 112 TNDEFRAAYLGVKGAAERNRAGRVVGERYRHDGAEELPEAVDWREKGAVAPVKNQGQCGS 171
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH--GCSGGLMDKAFEYIIENKGLA 212
CWAFSAV+ VE I QI G+++ LSEQ+LV+C + GC+GGLMD AFE+II+N G+
Sbjct: 172 CWAFSAVSTVESINQIVTGEMVTLSEQELVECDINGQSSGCNGGLMDDAFEFIIKNGGID 231
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE DYPY+ +G CD ++ A +I +ED+P+ DE++L +AV++ PVSV ++A GR F
Sbjct: 232 TEDDYPYKAVDGRCDVLRKNAKVVSIDGFEDVPENDEKSLQKAVAHHPVSVAIEAGGREF 291
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
Y SGV + CG DHGV VG+GT ENG YW+++NSWG WGE+GY+R+ R+
Sbjct: 292 QLYHSGVFSGRCGTQLDHGVVAVGYGT---ENGKDYWIVRNSWGPNWGEAGYLRMERNIN 348
Query: 331 --AGLCGIATAASYPV 344
+G CGIA +SYP
Sbjct: 349 VTSGKCGIAMMSSYPT 364
>gi|156142226|gb|ABU51882.1| ervatamin-C precursor [Tabernaemontana divaricata]
Length = 365
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 141/310 (45%), Positives = 195/310 (62%), Gaps = 14/310 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+ E +E W+A+H + Y +E R IFK NL++I++ N E N TYK+G ++DLTNE
Sbjct: 41 VKEIYELWLAKHDKVYSGLVEYEKRFEIFKDNLKFIDEHNSE-NHTYKMGLTPYTDLTNE 99
Query: 98 EFRALYTG-YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
EF+A+Y G + + + R + + Y+ ++P IDWR+KGAVT +K+QG+CGSCW
Sbjct: 100 EFQAIYLGTRSDTIHRLKRTINISERYAYEAGDNLPEQIDWRKKGAVTPVKNQGKCGSCW 159
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEAD 216
AFS V+ VE I QI G LI LSEQQLVDC+ NHGC GG A++YII+N G+ TEA+
Sbjct: 160 AFSTVSTVESINQIRTGNLISLSEQQLVDCNKKNHGCKGGAFVYAYQYIIDNGGIDTEAN 219
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY+ +G C K+ I Y+ +P +E AL +AV++QP V +DAS + F YK
Sbjct: 220 YPYKAVQGPCRAAKK---VVRIDGYKGVPHCNENALKKAVASQPSVVAIDASSKQFQHYK 276
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR--DAGLC 334
SG+ + CG +HGV +VG+ YW+++NSWG WGE GYIR+ R GLC
Sbjct: 277 SGIFSGPCGTKLNHGVVIVGYWK-------DYWIVRNSWGRYWGEQGYIRMKRVGGCGLC 329
Query: 335 GIATAASYPV 344
GIA YP
Sbjct: 330 GIARLPYYPT 339
>gi|260516678|gb|ACX43965.1| cysteine protease 1 [Brachiaria hybrid cultivar]
Length = 338
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 153/305 (50%), Positives = 192/305 (62%), Gaps = 17/305 (5%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT 104
+M Q+ + Y E + R N FK N+E I N N +Y +G NEF+DL+ EEF+ Y
Sbjct: 45 FMKQYSKAYS-HAEFSSRFNQFKANVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYF 103
Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
GY V R+ +R + +Q V PTSIDWR AVT IKDQGQCGSCWAFSA ++
Sbjct: 104 GYKH----VEREFARSNNL-HQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSI 158
Query: 165 EGITQITRGK--LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
EG + +GK L LSEQQLVDCST + GC+GGLMD AFEYII NKG+ E+ YPY+
Sbjct: 159 EG-AWVLQGKHTLTSLSEQQLVDCSTSYGDAGCNGGLMDYAFEYIIANKGICAESAYPYK 217
Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV 279
G C K V TIS Y+D+ GDE +LL AV PVSV ++A F FY SGV
Sbjct: 218 GVGGLCQKSCTKVV--TISGYKDVASGDEASLLNAVGTVGPVSVAIEADQAGFQFYSSGV 275
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATA 339
+ CG+N DHGV VG+GT ++ YW++KNSWG +WGESGYIR++R+ CGIA
Sbjct: 276 FSGTCGHNLDHGVLAVGYGTTGSQD---YWIVKNSWGTSWGESGYIRMIRNKNQCGIAIQ 332
Query: 340 ASYPV 344
SYP
Sbjct: 333 PSYPT 337
>gi|449469929|ref|XP_004152671.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
gi|449529596|ref|XP_004171784.1| PREDICTED: cysteine proteinase RD21a-like [Cucumis sativus]
Length = 431
Score = 285 bits (729), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 139/315 (44%), Positives = 193/315 (61%), Gaps = 20/315 (6%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
++ E E W +HG++Y EK RL +F N E++ N N +Y L N ++DLT+
Sbjct: 24 NVSELFEIWCTEHGKSYSSAEEKLYRLGVFADNYEFVTHHNNLDNSSYTLSLNSYADLTH 83
Query: 97 EEFRALYTGYNRPV----PSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
EF+ G++ + P + ++ S P DVP S+DWR+KGAVT +KDQG C
Sbjct: 84 HEFKVSRLGFSPALRNFRPVLPQEPSLPR--------DVPDSLDWRKKGAVTAVKDQGSC 135
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGL 211
G+CW+FSA A+EGI QI G LI LSEQ+L+DC N GC GGLMD A++++I N G+
Sbjct: 136 GACWSFSATGAMEGINQIMTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYQFVISNHGI 195
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
TE DYPY+ +G+C K + TI Y D+P DE LLQAV+ QPVSV + S RA
Sbjct: 196 DTENDYPYQARDGSCRKDKLQRNVVTIDGYADIPSNDEGKLLQAVAAQPVSVGICGSERA 255
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
F Y G+ + C + DH V +VG+G+ ENG YW++KNSWG++WG GY+ + R++
Sbjct: 256 FQLYSKGIFSGPCSTSLDHAVLIVGYGS---ENGVDYWIVKNSWGKSWGMDGYMHMQRNS 312
Query: 332 ----GLCGIATAASY 342
G+CGI ASY
Sbjct: 313 GNSEGVCGINKLASY 327
>gi|8886940|gb|AAF80626.1|AC069251_19 F2D10.37 [Arabidopsis thaliana]
Length = 315
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 134/266 (50%), Positives = 191/266 (71%), Gaps = 5/266 (1%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E W++ + Y+ EK +R +FK NL++I++ NK+G ++Y LG NEF+DL++E
Sbjct: 47 LIELFENWISNFEKAYETVEEKFLRFEVFKDNLKHIDETNKKG-KSYWLGLNEFADLSHE 105
Query: 98 EFRALYTGYNRPVPSVSRQSSRP-STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
EF+ +Y G + V R R + F Y++V VP S+DWR+KGAV +K+QG CGSCW
Sbjct: 106 EFKKMYLGLKTDI--VRRDEERSYAEFAYRDVEAVPKSVDWRKKGAVAEVKNQGSCGSCW 163
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEA 215
AFS VAAVEGI +I G L LSEQ+L+DC T N+GC+GGLMD AFEYI++N GL E
Sbjct: 164 AFSTVAAVEGINKIVTGNLTTLSEQELIDCDTTYNNGCNGGLMDYAFEYIVKNGGLRKEE 223
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
DYPY EEGTC+ QK+++ TI+ ++D+P DE++LL+A+++QP+SV +DASGR F FY
Sbjct: 224 DYPYSMEEGTCEMQKDESETVTINGHQDVPTNDEKSLLKALAHQPLSVAIDASGREFQFY 283
Query: 276 KSGVLNADCGNNCDHGVAVVGFGTAE 301
GV + CG + DHGVA VG+G+++
Sbjct: 284 SGGVFDGRCGVDLDHGVAAVGYGSSK 309
>gi|343978787|gb|AEM76722.1| cathepsin L-like proteinase [Triatoma brasiliensis]
Length = 330
Score = 285 bits (728), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 152/345 (44%), Positives = 218/345 (63%), Gaps = 29/345 (8%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ + + ++ ++CA++ + E+ E + HG+ YK++ E+ R IF N
Sbjct: 3 VLLVAVAVIAVSCANRFYNINP-------EEWETFKVVHGKNYKNQFEEMFRRKIFMNNK 55
Query: 71 EYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS--RPSTFKY 125
+ IE N ++G +YK+ N F DL + E +AL G+ + P+ R+ PS K
Sbjct: 56 KRIEAHNAKYEQGEVSYKMKMNHFGDLMSHEIKALMNGF-KMTPNTKREGKIYFPSNDK- 113
Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
+P S+DWR+KGAVT +KDQGQCGSCW+FSA ++EG + +GKL+ LSEQ L+D
Sbjct: 114 -----LPKSVDWRQKGAVTPVKDQGQCGSCWSFSATGSLEGQIFLKKGKLVSLSEQNLMD 168
Query: 186 CSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
CS + N+GC GGLMDKAF+Y+ +NKG+ TE+ YPY + C +K+K V T Y D
Sbjct: 169 CSKEYGNNGCEGGLMDKAFQYVSDNKGIDTESSYPYEARDYACRFKKDK-VGGTDKGYVD 227
Query: 244 LPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLNAD-CGN-NCDHGVAVVGFGTA 300
+P+GDE+AL A++ P+SV +DAS +FHFY GV N C + + DHGV VG+GT
Sbjct: 228 IPEGDEKALQNALATVGPISVAIDASHESFHFYSEGVYNEPYCSSYDLDHGVLAVGYGT- 286
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
ENG YWL+KNSWG +WGESGYI+I R+ + CGIA+ ASYP+
Sbjct: 287 --ENGQDYWLVKNSWGPSWGESGYIKIARNHSNHCGIASMASYPI 329
>gi|115461226|ref|NP_001054213.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|62510688|sp|Q7XR52.2|CYSP1_ORYSJ RecName: Full=Cysteine protease 1; AltName: Full=OsCP1; Flags:
Precursor
gi|38345300|emb|CAE02828.2| OSJNBa0043A12.33 [Oryza sativa Japonica Group]
gi|113565784|dbj|BAF16127.1| Os04g0670500 [Oryza sativa Japonica Group]
gi|215741575|dbj|BAG98070.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 490
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 142/296 (47%), Positives = 188/296 (63%), Gaps = 14/296 (4%)
Query: 58 EKAMRLNIFKQNLEYIEKANKEGNRT--YKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR 115
E R +F NL++++ N + ++LG N F+DLTN EFRA Y G R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG----TTPAGR 139
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTH-IKDQGQCGSCWAFSAVAAVEGITQITRGK 174
+++ V +P S+DWR+KGAV +K+QGQCGSCWAFSAVAAVEGI +I G+
Sbjct: 140 GRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 175 LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
L+ LSEQ+LV+C+ + N GC+GG+MD AF +I N GL TE DYPY +G C+ K
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 233 AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGV 292
+I +ED+P+ DE +L +AV++QPVSV +DA GR F Y SGV CG N DHGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 293 AVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
VG+GT + GA YW ++NSWG WGE+GYIR+ R+ G CGIA ASYP+
Sbjct: 320 VAVGYGT-DAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>gi|2351107|dbj|BAA21929.1| bromelain [Ananas comosus]
Length = 312
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 138/302 (45%), Positives = 195/302 (64%), Gaps = 11/302 (3%)
Query: 46 MAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG 105
MA++GR YKD EK R IFK N+ +IE N +Y LG N+F+D+TN EF A YTG
Sbjct: 1 MAEYGRVYKDNDEKMRRFQIFKNNVNHIETFNNRNGNSYTLGINKFTDMTNNEFVAQYTG 60
Query: 106 -YNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
+RP+ + + +F N++ V SIDWR+ GAVT +KDQ CGSCWAFSA+A V
Sbjct: 61 GISRPL---NIEKEPVVSFDDVNISAVGQSIDWRDYGAVTEVKDQNPCGSCWAFSAIATV 117
Query: 165 EGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEG 224
EGI +I G L+ LSEQ+++DC+ N GC GG +D A+++II N G+A+EADYPY+ +G
Sbjct: 118 EGIYKIVTGYLVSLSEQEVLDCAVSN-GCDGGFVDNAYDFIISNNGVASEADYPYQAYQG 176
Query: 225 TCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADC 284
C +A I+ Y + DE ++ AV NQP++ +DASG F +Y GV + C
Sbjct: 177 DCA-ANSWPNSAYITGYSYVRSNDESSMKYAVWNQPIAAAIDASGDNFQYYNGGVFSGPC 235
Query: 285 GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR---DAGLCGIATAAS 341
G + +H + ++G+G ++ +G +YW++KNSWG +WGE GYIR+ R +GLCGIA
Sbjct: 236 GTSLNHAITIIGYG--QDSSGTQYWIVKNSWGSSWGERGYIRMARGVSSSGLCGIAMDPL 293
Query: 342 YP 343
YP
Sbjct: 294 YP 295
>gi|297843784|ref|XP_002889773.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
gi|297335615|gb|EFH66032.1| hypothetical protein ARALYDRAFT_471096 [Arabidopsis lyrata subsp.
lyrata]
Length = 439
Score = 284 bits (727), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 142/312 (45%), Positives = 194/312 (62%), Gaps = 13/312 (4%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
E + W +HG+TY E E+ R+ IFK N +++ + N N TY L N F+DLT+ EF
Sbjct: 30 ELFDDWCQRHGKTYGSEEERQQRIQIFKDNHDFVTQHNLITNATYSLSLNAFADLTHHEF 89
Query: 100 RALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
+A G + S+ S S VP S+DWR+KGAVT++KDQG CG+CW+FS
Sbjct: 90 KASRLGLSVSASSLIMASKGQS---LGGNAKVPDSVDWRKKGAVTNVKDQGSCGACWSFS 146
Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYP 218
A A+EGI QI G LI LSEQ+L+DC N GC+GGLMD AFE++I+N G+ TE DYP
Sbjct: 147 ATGAMEGINQIVTGDLISLSEQELIDCDKSYNAGCNGGLMDYAFEFVIKNHGIDTEKDYP 206
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK-- 276
Y+ +GTC K K TI Y + DE+AL +AV+ QPVSV + S RAF Y
Sbjct: 207 YQERDGTCKKDKLKQKVVTIDSYAGVKSNDEKALREAVAAQPVSVGICGSERAFQLYSRV 266
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----G 332
SG+ + C + DH V +VG+G+ +NG YW++KNSWG++WG G++ + R+ G
Sbjct: 267 SGIFSGPCSTSLDHAVLIVGYGS---QNGVDYWIVKNSWGKSWGMDGFMHMQRNTGNSEG 323
Query: 333 LCGIATAASYPV 344
+CGI ASYP+
Sbjct: 324 ICGINMLASYPI 335
>gi|386648112|gb|AFJ15103.1| mexicain-like cystein protease, partial [Jacaratia mexicana]
Length = 348
Score = 284 bits (726), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 146/350 (41%), Positives = 209/350 (59%), Gaps = 19/350 (5%)
Query: 2 VLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDEL 57
+ K + +I+ + ++ A + G S + + E+ E WM +H R Y +
Sbjct: 4 ICSISKLIFVATCLIVHVGLSSADFSIVGYSQDDLTSTERLIRLFESWMLKHDRVYNNIE 63
Query: 58 EKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS 117
EK R IFK NL YI++ NK+ N +Y LG NEF DLT++EF+ Y G + V+ +
Sbjct: 64 EKIHRFEIFKDNLMYIDETNKK-NNSYWLGLNEFVDLTHDEFKEKYVG-SIGEDFVTIEQ 121
Query: 118 SRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIE 177
S F Y++V D P SIDWR+KGAVT +K CGSCWAFS VA VEGI +I GKLI
Sbjct: 122 SNDEEFPYKHVVDYPESIDWRDKGAVTPVKPN-PCGSCWAFSTVATVEGINKIVTGKLIS 180
Query: 178 LSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAAT 237
LSEQ+L+DC +HGC GG + +Y+++N G+ TE +YPY ++G C +++K
Sbjct: 181 LSEQELLDCDRRSHGCKGGYQTTSLQYVVDN-GVHTEKEYPYEKKQGKCRAKEKKGTKVQ 239
Query: 238 ISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGF 297
I+ Y+ +P DE +L+QA++NQPVSV +++ GRAF YK G+ N CG DH V +G+
Sbjct: 240 ITGYKRVPANDEISLIQAIANQPVSVLLESKGRAFQLYKGGIFNGPCGTKLDHAVTAIGY 299
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
G Y LIKNSWG WGE GY++I R + G CG+ ++ +P
Sbjct: 300 GKT-------YILIKNSWGPNWGEKGYLKIKRASGKSEGTCGVYKSSYFP 342
>gi|90265242|emb|CAH67695.1| H0624F09.3 [Oryza sativa Indica Group]
Length = 494
Score = 284 bits (726), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 142/296 (47%), Positives = 188/296 (63%), Gaps = 14/296 (4%)
Query: 58 EKAMRLNIFKQNLEYIEKANKEGNRT--YKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR 115
E R +F NL++++ N + ++LG N F+DLTN EFRA Y G R
Sbjct: 84 EHERRFRVFWDNLKFVDAHNARADERGGFRLGMNRFADLTNGEFRATYLG----TTPAGR 139
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTH-IKDQGQCGSCWAFSAVAAVEGITQITRGK 174
+++ V +P S+DWR+KGAV +K+QGQCGSCWAFSAVAAVEGI +I G+
Sbjct: 140 GRRVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGSCWAFSAVAAVEGINKIVTGE 199
Query: 175 LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
L+ LSEQ+LV+C+ + N GC+GG+MD AF +I N GL TE DYPY +G C+ K
Sbjct: 200 LVSLSEQELVECARNGQNSGCNGGIMDDAFAFIARNGGLDTEEDYPYTAMDGKCNLAKRS 259
Query: 233 AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGV 292
+I +ED+P+ DE +L +AV++QPVSV +DA GR F Y SGV CG N DHGV
Sbjct: 260 RKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREFQLYDSGVFTGRCGTNLDHGV 319
Query: 293 AVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
VG+GT + GA YW ++NSWG WGE+GYIR+ R+ G CGIA ASYP+
Sbjct: 320 VAVGYGT-DAATGAAYWTVRNSWGPDWGENGYIRMERNVTARTGKCGIAMMASYPI 374
>gi|194352762|emb|CAQ00109.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326517250|dbj|BAJ99991.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 367
Score = 284 bits (726), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 159/318 (50%), Positives = 199/318 (62%), Gaps = 13/318 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ +E+W QH +D EKA R N+F++N+ I + N+ G+ YKL N F D+
Sbjct: 40 EDSLWALYERWREQH-TVARDLGEKARRFNVFRENVRLIHEFNR-GDAPYKLRLNRFGDM 97
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKY---QNVTDVPTSIDWREKGAVTHIKDQGQ 151
T +EFR Y + F + +V DVP S+DWR+KGAVT +KDQGQ
Sbjct: 98 TADEFRRAYASSRVSHHRMFSLKEGGGGFMHGSAASVRDVPPSVDWRQKGAVTAVKDQGQ 157
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKG 210
CGSCWAFS +AAVEGI I L LSEQQLVDC T N GC+GGLMD AF+YI ++ G
Sbjct: 158 CGSCWAFSTIAAVEGINAIRSKNLTSLSEQQLVDCDTKSNAGCNGGLMDYAFQYIAKHGG 217
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
+A E YPY+ + + N+K AV TI YED+P DE AL +AV+ QPV+V ++ASG
Sbjct: 218 VAAEDAYPYKARQASSCNKKPSAV-VTIDGYEDVPANDETALKKAVAAQPVAVAIEASGS 276
Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
F FY GV CG DHGVA VG+GT + G KYW++KNSWG WGE GYIR+ RD
Sbjct: 277 HFQFYSEGVFAGKCGTELDHGVAAVGYGTTVD--GTKYWIVKNSWGPEWGEKGYIRMKRD 334
Query: 331 A----GLCGIATAASYPV 344
GLCGIA ASYPV
Sbjct: 335 VKDKEGLCGIAMEASYPV 352
>gi|162459488|ref|NP_001105571.1| maize insect resistance1 precursor [Zea mays]
gi|5731354|gb|AAB70820.2| cysteine protease Mir1 [Zea mays]
Length = 398
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 147/345 (42%), Positives = 205/345 (59%), Gaps = 28/345 (8%)
Query: 22 TCASQVVSGRSMHEPSIVEKHEQWMAQHGR--------------TYKDELEKAMRLNIFK 67
T ++V + + + +E W ++HGR ++E ++ +RL +F+
Sbjct: 34 TTTTRVPAPAERADEEVRRMYEAWKSKHGRGGSSNDDCDMAPGDDEQEEEDRRLRLEVFR 93
Query: 68 QNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
NL YI+ N E G T++LG F+DLT EE+R G+ + + +
Sbjct: 94 DNLRYIDAHNAEADAGLHTFRLGLTPFADLTLEEYRGRVLGFRARGRRSGARYGSGYSVR 153
Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
D+P +IDWR+ GAVT +KDQ QCG CWAFSAVAA+EG+ I G L+ LSEQ+++
Sbjct: 154 G---GDLPDAIDWRQLGAVTEVKDQQQCGGCWAFSAVAAIEGVNAIATGNLVSLSEQEII 210
Query: 185 DCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK-AVAATISKYED 243
DC + GC GG M+ AF ++I N G+ TEADYP+ +GTCD KEK ATI +
Sbjct: 211 DCDAQDSGCDGGQMENAFRFVIGNGGIDTEADYPFIGTDGTCDASKEKNEKVATIDGLVE 270
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+ +E AL +AV+ QPVSV +DASGRAF Y SG+ N CG + DHGV VG+G+ E
Sbjct: 271 VASNNETALQEAVAIQPVSVAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGS---E 327
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
+G YW++KNSW +WGE+GYIR+ R+ G CGIA ASYPV
Sbjct: 328 SGKDYWIVKNSWSASWGEAGYIRMRRNVPRPTGKCGIAMDASYPV 372
>gi|357124027|ref|XP_003563708.1| PREDICTED: germination-specific cysteine protease 1-like
[Brachypodium distachyon]
Length = 334
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 153/330 (46%), Positives = 205/330 (62%), Gaps = 27/330 (8%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE----GNRTYKLGTNE 90
+ ++ E++E+WMA+ GRTYKD EKA R +FK N +I+ N G KL TN+
Sbjct: 13 DKAMRERYEKWMAEQGRTYKDSTEKARRFEVFKSNAHFIDSHNAATGPGGKSRPKLTTNK 72
Query: 91 FSDLTNEEFRALY-TGYN---RPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVT 144
F+DLT +EFR +Y TG+ RP V+ + FK+ V+ DVP SIDWR +GAVT
Sbjct: 73 FADLTEDEFRNIYVTGHRVNYRPTSLVT-----DTVFKFGAVSLSDVPPSIDWRARGAVT 127
Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFE 203
+KDQ C CWAFS+ AAVEGI QIT G + LS QQLVDCS N C G +DKA+E
Sbjct: 128 SVKDQHLCACCWAFSSAAAVEGIHQITTGNQVSLSVQQLVDCSNAANEKCKAGEIDKAYE 187
Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
YI + GL + DYPY GTC ++AV A IS ++ +P +E ALL AV++QPVSV
Sbjct: 188 YIARSGGLVADQDYPYEGHSGTCRVYGKQAV-ARISGFQYVPARNETALLLAVAHQPVSV 246
Query: 264 CVDASGRAFHFYKSGVLNA---DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWG 320
+D RA +G+ + C N +H + +VG+GT +E+G +YWL+KNSWG WG
Sbjct: 247 ALDGLSRALQHIGTGIFGSAGEPCTTNLNHAMTIVGYGT--DEHGTRYWLMKNSWGSDWG 304
Query: 321 ESGYIRILRDA-----GLCGIATAASYPVA 345
+ GY++ RD G+CG+A ASYPVA
Sbjct: 305 DKGYVKFARDVASEINGVCGLALEASYPVA 334
>gi|390337642|ref|XP_780653.3| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 284 bits (726), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 154/338 (45%), Positives = 214/338 (63%), Gaps = 15/338 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
++ ++LV C VVS SM E +W +HG+ Y + E+A R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFTDFDEDWNEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
K N + G+ TY LG N+F+DL NEEF A+ TG+ V S+ + + NV +
Sbjct: 60 IKHNLKYDLGHFTYDLGINQFTDLQNEEFVAMMTGFR--VSGTSKAAKGSTFLPPNNVGE 117
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
+P ++DWR KG VT +KDQGQCGSCWAFS +VEG GKL+ LSEQ LVDCS +
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSVEGQHFKATGKLVSLSEQNLVDCSGRD 177
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
GC GG MD+AF+YII+ G+ TEA YPY+ +G C + K+ V AT++ Y D+ G E+
Sbjct: 178 AGCDGGFMDRAFQYIIDAGGIDTEASYPYKAVDGKC-HFKKANVGATVTGYTDVTSGSEK 236
Query: 251 ALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNC-DHGVAVVGFGTAEEENGAK 307
AL +AV++ P+SV +DAS +F YKSGV N C + DHGV VG+GT+ + G
Sbjct: 237 ALQKAVAHVGPISVAIDASHMSFQHYKSGVYNEPGCDSTVLDHGVLAVGYGTSSD--GTD 294
Query: 308 YWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
YW++KNSW ETWG +GY+ + R+ CGIAT ASYP+
Sbjct: 295 YWIVKNSWAETWGMNGYVWMSRNKDNQCGIATNASYPL 332
>gi|359359120|gb|AEV41026.1| putative cysteine protease [Oryza minuta]
Length = 464
Score = 283 bits (725), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 145/316 (45%), Positives = 201/316 (63%), Gaps = 18/316 (5%)
Query: 42 HEQWMAQH---GRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLT 95
++ W+A+H G ++ + E R +F NL++++ N + ++LG N F+DLT
Sbjct: 66 YDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADEHGGFRLGMNRFADLT 125
Query: 96 NEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAV-THIKDQGQCGS 154
N+EFRA Y G R +++ V +P S+DWR+KGAV + +K+QGQCGS
Sbjct: 126 NDEFRAAYLG----TTPAGRGRHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGS 181
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLA 212
CWAFSAVAAVEGI +I G+L+ LSEQ+LV+C+ + N GC+GG+MD AF +I N GL
Sbjct: 182 CWAFSAVAAVEGINKIVTGELVSLSEQELVECARNRGNSGCNGGIMDDAFAFITRNGGLD 241
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE DYPY +G CD K+ +I +ED+P+ DE +L +AV++QPVSV +DA GR F
Sbjct: 242 TEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREF 301
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
Y SGV CG + DHGV VG+GT + G YW ++NSWG WGE+GYIR+ R+
Sbjct: 302 QLYDSGVFTGRCGTSLDHGVVAVGYGT-DAATGTDYWTVRNSWGPDWGENGYIRMERNVT 360
Query: 331 --AGLCGIATAASYPV 344
G CGIA ASYP+
Sbjct: 361 ARTGKCGIAMMASYPI 376
>gi|359359168|gb|AEV41073.1| putative cysteine protease [Oryza minuta]
Length = 499
Score = 283 bits (724), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 200/316 (63%), Gaps = 18/316 (5%)
Query: 42 HEQWMAQH---GRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLT 95
++ W+A+H G ++ + E R +F NL++++ N + ++LG N F+DLT
Sbjct: 65 YDLWVARHRHGGGSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLT 124
Query: 96 NEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH-IKDQGQCGS 154
N+EFRA Y G R +++ V +P S+DWR+KGAV +K+QGQCGS
Sbjct: 125 NDEFRAAYLG----TTPAGRGRHVGEAYRHDGVEALPDSVDWRDKGAVVAPVKNQGQCGS 180
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLA 212
CWAFSAVAAVEGI +I G+L+ LSEQ+LV+C+ + N GC+GG+MD AF +I N GL
Sbjct: 181 CWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLD 240
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE DYPY +G C+ K+ +I +ED+P+ DE +L +AV++QPVSV +DA GR F
Sbjct: 241 TEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREF 300
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
Y SGV CG + DHGV VG+GT + G YW ++NSWG WGE+GYIR+ R+
Sbjct: 301 QLYDSGVFTGRCGTSLDHGVVAVGYGT-DAATGTDYWTVRNSWGPDWGENGYIRMERNVT 359
Query: 331 --AGLCGIATAASYPV 344
G CGIA ASYP+
Sbjct: 360 ARTGKCGIAMMASYPI 375
>gi|225458143|ref|XP_002280937.1| PREDICTED: cysteine proteinase RD21a [Vitis vinifera]
gi|302142569|emb|CBI19772.3| unnamed protein product [Vitis vinifera]
Length = 436
Score = 283 bits (724), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 144/336 (42%), Positives = 198/336 (58%), Gaps = 18/336 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
++ + IL++ S V S + E W Q+G+TY E EKA RL +F++N +
Sbjct: 5 LWAVSILILAVHSSVSEASST-----ADLFEAWCEQYGKTYSSEEEKASRLKVFEENHAF 59
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
+ + N N +Y L N F+DLT+ EF+A G+ S R S S VP
Sbjct: 60 VTQHNSMANASYTLALNAFADLTHHEFKASRLGF-----SPGRAQSIRSVGTPVQELHVP 114
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NH 191
++DWR+ GAVT +KDQG CG CW+FS A+EGI +I G L+ LSEQ+LVDC N
Sbjct: 115 PAVDWRKSGAVTGVKDQGNCGGCWSFSTTGAIEGINKIVTGSLVSLSEQELVDCDRSYNS 174
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GGLMD A++++I+N+G+ +EADYPY + C+ +K K TI Y D+P DE+
Sbjct: 175 GCEGGLMDYAYQFVIKNQGIDSEADYPYVGMDKPCNKEKLKKHIVTIDGYTDIPPNDEKQ 234
Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
LLQ V+ QPVSV + S + F Y GV C + DH V +VG+GT E+G +W++
Sbjct: 235 LLQVVAKQPVSVGICGSEKTFQLYSKGVYTGPCSSTLDHAVLIVGYGT---EDGVDFWIV 291
Query: 312 KNSWGETWGESGYIRILRD----AGLCGIATAASYP 343
KNSWGE WG GYI +LR+ G+CGI ASYP
Sbjct: 292 KNSWGEHWGMRGYIHMLRNNGTAEGICGINMLASYP 327
>gi|359359215|gb|AEV41119.1| putative cysteine protease [Oryza officinalis]
Length = 499
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 144/316 (45%), Positives = 200/316 (63%), Gaps = 18/316 (5%)
Query: 42 HEQWMAQH---GRTYKDEL-EKAMRLNIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLT 95
++ W+A+H G ++ + E R +F NL++++ N + ++LG N F+DLT
Sbjct: 65 YDLWVARHRHGGDSHNGLVGEYERRFRVFWDNLKFVDAHNARADEHGGFRLGMNRFADLT 124
Query: 96 NEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH-IKDQGQCGS 154
N+EFRA Y G R +++ V +P S+DWR+KGAV +K+QGQCGS
Sbjct: 125 NDEFRAAYLG----TTPAGRGRHVGEAYRHDGVEVLPDSVDWRDKGAVVAPVKNQGQCGS 180
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLA 212
CWAFSAVAAVEGI +I G+L+ LSEQ+LV+C+ + N GC+GG+MD AF +I N GL
Sbjct: 181 CWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGANSGCNGGMMDDAFAFIARNGGLD 240
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE DYPY +G C+ K+ +I +ED+P+ DE +L +AV++QPVSV +DA GR F
Sbjct: 241 TEEDYPYTAMDGKCNLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREF 300
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
Y SGV CG + DHGV VG+GT + G YW ++NSWG WGE+GYIR+ R+
Sbjct: 301 QLYDSGVFTGRCGTSLDHGVVAVGYGT-DAATGTDYWTVRNSWGPDWGENGYIRMERNVT 359
Query: 331 --AGLCGIATAASYPV 344
G CGIA ASYP+
Sbjct: 360 ARTGKCGIAMMASYPI 375
>gi|297733654|emb|CBI14901.3| unnamed protein product [Vitis vinifera]
Length = 273
Score = 282 bits (722), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 139/257 (54%), Positives = 172/257 (66%), Gaps = 8/257 (3%)
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
+TN EFR+ Y G + R S + +F Y+ V VP S+DWR+KGAVT IKDQGQC
Sbjct: 1 MTNHEFRSTYAGSKVNHHRMFRGSQHAAGSFMYEKVKSVPPSVDWRKKGAVTPIKDQGQC 60
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGL 211
GSCWAFS V AVEGI I KL+ LSEQ+LVDC T +N GC+GGLM AFE+I E G+
Sbjct: 61 GSCWAFSTVVAVEGINHIKTNKLVSLSEQELVDCDTSENQGCNGGLMGYAFEFIKEKGGI 120
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
TE YPY E+GTCD K + +I +E +P +E ALL+A +NQP+SV +DA G A
Sbjct: 121 TTEQSYPYTAEDGTCDVSKVNSPVVSIDGHETVPPNNEDALLKAAANQPISVAIDAGGSA 180
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR-- 329
F FY GV CG + DHGVA+VG+GT + G KYW++KNSWG WGE+GYIR+ R
Sbjct: 181 FQFYSEGVFAGRCGTDLDHGVAIVGYGTTLD--GTKYWIVKNSWGTDWGENGYIRMKRGI 238
Query: 330 --DAGLCGIATAASYPV 344
GLCGIA ASYP+
Sbjct: 239 SAKEGLCGIAVEASYPI 255
>gi|390368662|ref|XP_780781.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 154/338 (45%), Positives = 211/338 (62%), Gaps = 15/338 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
++ ++LV C VVS SM E QW +HG+ Y + E+A R I+++NL+ +
Sbjct: 3 YLSVLLVAVC---VVSSLSMSFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
K N + G+ TY LG N+F+DL NEEF A+ TG+ V S+ + + NV
Sbjct: 60 IKHNLKYDLGHFTYALGMNQFADLQNEEFVAMMTGFR--VNGTSKAAKGSTFLPSNNVDK 117
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
+P ++DWR KG VT +KDQGQCGSCWAFSA ++EG GKL+ LSEQ LVDCS N
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQQFKKTGKLVSLSEQNLVDCSYRN 177
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
+GC GG MD+AF+YII+ G+ TEA Y YR +G C +K V AT++ Y D+ G E+
Sbjct: 178 YGCHGGFMDRAFQYIIDAGGIDTEATYSYRAVDGNCHFKKAN-VGATVTGYTDVTSGSEK 236
Query: 251 ALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAK 307
AL +AV++ P+SV +DAS + F FYKSGV N C H V VVG+GT + G
Sbjct: 237 ALQKAVAHIGPISVAIDASHKFFKFYKSGVYNEPGCSTTRLGHAVLVVGYGTTSD--GTD 294
Query: 308 YWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
YW++KNSW +TWG +GY+ + R+ CGIA+ ASYP+
Sbjct: 295 YWIVKNSWAKTWGMNGYLWMSRNKDNQCGIASEASYPM 332
>gi|357446979|ref|XP_003593765.1| Cysteine proteinase [Medicago truncatula]
gi|355482813|gb|AES64016.1| Cysteine proteinase [Medicago truncatula]
Length = 364
Score = 282 bits (721), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 152/293 (51%), Positives = 194/293 (66%), Gaps = 11/293 (3%)
Query: 56 ELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR 115
ELEK R IFK NLEYIE N GN++YKLG N++SDLT++EF A +TG + +S
Sbjct: 78 ELEKRKR--IFKNNLEYIENFNNAGNKSYKLGLNQYSDLTSDEFLASHTGL-KVSKQLSS 134
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
R + + DVPT+ DWR++GAVT +KDQG CG CWAFS VAAVEG +I G+L
Sbjct: 135 SKMRSAAVPFNLNDDVPTNFDWRQQGAVTDVKDQGSCGCCWAFSVVAAVEGAVKINTGEL 194
Query: 176 IELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA 235
I LSEQQLVDC N GC GG MD AF+YII+ KG+ +EADYPY+ TC +
Sbjct: 195 ISLSEQQLVDCDERNSGCHGGNMDSAFKYIIQ-KGIVSEADYPYQEGSQTCQLNDQMKFE 253
Query: 236 ATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVV 295
A I+ + D+P DEQ LLQAV+ QPVSV ++ G F Y V + CG + +H V V
Sbjct: 254 AQITNFIDVPANDEQQLLQAVAQQPVSVGIEV-GDEFQHYMGDVYSGTCGQSMNHAVTAV 312
Query: 296 GFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
G+G +E+ G KYWLIKNSWG+ WGE GY+++LR++ G CGIA ASYP+
Sbjct: 313 GYGVSED--GTKYWLIKNSWGKGWGEEGYMKLLRESGEPGGQCGIAAHASYPI 363
>gi|356517398|ref|XP_003527374.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 157/343 (45%), Positives = 211/343 (61%), Gaps = 32/343 (9%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
F +++ + A QV R++ + S+ E+HEQ M ++ + YKD E F N+ YI
Sbjct: 12 FAMLLCMAFLAFQVTC-RTLQDASMYERHEQRMTRYSKVYKDPPES------FXGNVNYI 64
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E N ++ YK G N+F R + G+ + R +TFK++NVT P+
Sbjct: 65 EACNNAADKPYKXGINQFPP------RNRFKGH------MCSSIIRITTFKFENVTATPS 112
Query: 134 SIDWREKGAVTH--IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS-EQQLVDCSTD- 189
++D R+KGAVT +KDQGQCG WA SAVAA EGI + GKLI LS E +LVDC T
Sbjct: 113 TVDCRQKGAVTPYTVKDQGQCGCFWALSAVAATEGIHALXAGKLILLSXEPELVDCDTKG 172
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD-NQKEKAVAATISKYEDLPKG 247
+ GC GGL D AF++II+N GL TEA+YPY+ +G C+ N+ +K A I+ Y+D+P
Sbjct: 173 VDQGCEGGLTDDAFKFIIQNHGLNTEANYPYKGVDGKCNANEADKNAATIITGYDDVPAN 232
Query: 248 DEQALLQ-AVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
+E+A LQ AV+N PVSV +DASG F FYKSGV CG DHGV VG+G +++ G
Sbjct: 233 NEKAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDD--GT 290
Query: 307 KYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
+YWL+KNS G WGE GYIR+ R + LCGIA ASYP A
Sbjct: 291 EYWLVKNSRGPEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 333
>gi|944916|gb|AAA74430.1| cysteine proteinase [Mesembryanthemum crystallinum]
Length = 367
Score = 281 bits (719), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 150/318 (47%), Positives = 204/318 (64%), Gaps = 20/318 (6%)
Query: 35 EPSIVEKHEQWMAQH--GRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFS 92
+ ++ + +E+W + + R++ EK R ++FK+N++YI + NK ++ YKL N+F
Sbjct: 37 DETLWDLYERWRSVYTSARSFG---EKQNRFHVFKENVKYINEVNKM-DKPYKLRLNQFG 92
Query: 93 DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
DLT EF Y N + +R S F Y+NV +VP SIDWR KGAVT +K+QG+C
Sbjct: 93 DLTPSEFARTYA--NSKIIEGTRNES--GGFMYENV-EVPRSIDWRVKGAVTPVKNQGRC 147
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
G CWAFSA AAVEGI QIT G+LI LSEQQL+DC T N GC GG M +AFEYI + G+
Sbjct: 148 GGCWAFSAAAAVEGINQITTGQLISLSEQQLIDCDTQNSGCRGGTMGRAFEYIKQRGGIT 207
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA---SG 269
+EA+YPY+ + G C N + +I Y ++ + E A+L+ +++QPVSV VDA S
Sbjct: 208 SEANYPYKAQAGMCKNNLIQRPTVSIDGYYNIRR-SEDAVLKILAHQPVSVAVDATTWSS 266
Query: 270 RAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
+ FY GV CG +HGV VG+GT + G YW+IKNSWGETWGE GY+R+LR
Sbjct: 267 LDWMFYFQGVFTGPCGTKLNHGVTAVGYGTTND--GYDYWIIKNSWGETWGERGYMRMLR 324
Query: 330 DA---GLCGIATAASYPV 344
GLCGIA AS+P+
Sbjct: 325 GVSPYGLCGIAMQASFPI 342
>gi|440793751|gb|ELR14926.1| Cysteine proteinase 5, putative [Acanthamoeba castellanii str.
Neff]
Length = 326
Score = 281 bits (718), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 150/338 (44%), Positives = 200/338 (59%), Gaps = 17/338 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M +L + A V S ++ + WM +H ++Y +E E R N++++N Y
Sbjct: 1 MRTTTLLALCVALFVASTFAVSHDPLTGVFADWMQEHQKSYANE-EFVYRWNVWRENYLY 59
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE N + N+++ L N+F DLTN EF L+ G + ++S +P
Sbjct: 60 IEAHNHQ-NKSFHLAMNKFGDLTNAEFNKLFKGLSITADQAKQESDIAP------APGLP 112
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--N 190
DWR+KGAVTH+K+QGQCGSCW+FS + EG + G+L LSEQ LVDCST N
Sbjct: 113 ADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKHGRLTSLSEQNLVDCSTSYGN 172
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
HGC+GGLMD AFEYII NKG+ TE YPY +GTC K+ + +S Y ++P G+E
Sbjct: 173 HGCNGGLMDYAFEYIIRNKGIDTEESYPYHASQGTCRYNKQHSGGELVS-YTNVPSGNEG 231
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLN--ADCGNNCDHGVAVVGFGTAEEENGAKY 308
ALL AV+ QP SV +DAS +F FYK GV + A + DHGV VG+G +G Y
Sbjct: 232 ALLNAVATQPTSVAIDASHSSFQFYKGGVYDEPACSSSRLDHGVLAVGWGV---RDGKDY 288
Query: 309 WLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPVA 345
WL+KNSWG WG SGYI + R+ CGIATAAS+P A
Sbjct: 289 WLVKNSWGADWGLSGYIEMSRNKHNQCGIATAASHPHA 326
>gi|390337645|ref|XP_001199228.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 333
Score = 280 bits (717), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 153/338 (45%), Positives = 214/338 (63%), Gaps = 15/338 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
++ ++LV C VVS SM E ++W +HG+ Y + E+A R I+++NL+ +
Sbjct: 3 YLSVLLVAVC---VVSSLSMSFTDFDEDWKEWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+ N + G+ TY LG N+F+DL N+EF A+ TG+ V S+ + + NV
Sbjct: 60 IRHNLKYDLGHFTYDLGMNQFADLQNKEFVAMMTGFR--VNGTSKAAKGSTFLPPNNVGK 117
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
+P ++DWR KG VT +KDQGQCGSCWAFSA ++EG GKL+ LSEQ LVDCS N
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSDKN 177
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
+GC+GGLMD+AF+YII+ G+ TE YPY +G C + K V AT++ Y D+ G E+
Sbjct: 178 YGCNGGLMDRAFQYIIDAGGIDTEESYPYIAMDGNC-HFKTANVGATVTGYTDVTSGSEK 236
Query: 251 ALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAK 307
AL +AV++ P+SV +DAS +F Y+SGV N C + DHGV VG+GT + G
Sbjct: 237 ALQKAVAHIGPISVAIDASHFSFQLYQSGVYNEPGCSSTLLDHGVLAVGYGTTID--GTD 294
Query: 308 YWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
YW++KNSW ETWG +GYI + R+ CGIAT ASYP+
Sbjct: 295 YWIVKNSWAETWGMNGYIWMSRNKDNQCGIATQASYPL 332
>gi|413943290|gb|AFW75939.1| maize insect resistance1 [Zea mays]
Length = 435
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 144/326 (44%), Positives = 197/326 (60%), Gaps = 26/326 (7%)
Query: 42 HEQWMAQHGR-------------TYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYK 85
+E W ++HGR + E ++ +RL +F+ NL YI+K N E G T++
Sbjct: 84 YEAWKSKHGRGGSSNDDCDMAPGDDEQEEDRRLRLEVFRDNLRYIDKHNAEADAGLHTFR 143
Query: 86 LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAV 143
LG F+DLT +E+R G+ + ++ + +P +IDWR+ GAV
Sbjct: 144 LGLTPFADLTLDEYRGRVLGFRARARRSGARYGHGHGYRARPRGGDLLPDAIDWRQLGAV 203
Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFE 203
T +KDQ QCG CWAFSAVAA+EGI I G L+ LSEQ+++DC + GC GG M+ AF
Sbjct: 204 TEVKDQQQCGGCWAFSAVAAIEGINAIATGNLVSLSEQEIIDCDAQDSGCDGGQMENAFR 263
Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKE-KAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
++I N G+ TEADYP+ +GTCD KE ATI ++ +E AL +AV+ QPVS
Sbjct: 264 FVIGNGGIDTEADYPFIGTDGTCDASKENNEKVATIDGLVEVASNNETALQEAVAIQPVS 323
Query: 263 VCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
V +DASGRAF Y SG+ N CG + DHGV VG+G+ E+G YW++KNSW +WGE+
Sbjct: 324 VAIDASGRAFQHYSSGIFNGPCGTSLDHGVTAVGYGS---ESGKDYWIVKNSWSASWGEA 380
Query: 323 GYIRILRD----AGLCGIATAASYPV 344
GYIR+ R+ G CGIA ASYPV
Sbjct: 381 GYIRMRRNVPRPTGKCGIAMDASYPV 406
>gi|147769019|emb|CAN62459.1| hypothetical protein VITISV_015168 [Vitis vinifera]
Length = 246
Score = 280 bits (716), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 144/271 (53%), Positives = 182/271 (67%), Gaps = 32/271 (11%)
Query: 81 NRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
+++YKL NEF+DLTNEEF T NR + S+ ++FKY+NVT VP++ DWR+K
Sbjct: 2 DKSYKLSINEFADLTNEEFG---TSRNRFKAHIC--STEATSFKYENVTAVPSTXDWRKK 56
Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLM 198
GAVT IKDQGQCGSCWAFSAVAA+EGITQ++ GKLI LSEQ+LVDC T ++ GC G
Sbjct: 57 GAVTPIKDQGQCGSCWAFSAVAAMEGITQLSTGKLISLSEQELVDCDTSGEDQGCXG--- 113
Query: 199 DKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN 258
A+YPY +GTC+ +K AA I+ YED+P +E+AL +AV++
Sbjct: 114 ----------------ANYPYAGTDGTCNRKKAAHPAAKINGYEDVPANNEKALQKAVAH 157
Query: 259 QPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
QP++V +DA G F FY SGV CG DHGV VG+GT+++ G KYWL+KNSWG
Sbjct: 158 QPIAVAIDAGGXEFQFYSSGVFTGQCGTELDHGVXAVGYGTSDD--GMKYWLVKNSWGTG 215
Query: 319 WGESGYIRILRDA----GLCGIATAASYPVA 345
WGE GYIR+ RD GLCGIA ASYP A
Sbjct: 216 WGEEGYIRMQRDVTAKEGLCGIAMQASYPTA 246
>gi|224116884|ref|XP_002317418.1| predicted protein [Populus trichocarpa]
gi|222860483|gb|EEE98030.1| predicted protein [Populus trichocarpa]
Length = 503
Score = 280 bits (715), Expect = 8e-73, Method: Compositional matrix adjust.
Identities = 148/335 (44%), Positives = 202/335 (60%), Gaps = 18/335 (5%)
Query: 19 LVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI-EKAN 77
+V+ S++VS E SI+E +QW +H + Y+ E R FK+NL+YI EKA
Sbjct: 32 IVVNDFSELVS-----EESIIEIFQQWRDRHQKVYEHAAESEKRYRNFKRNLKYIIEKAG 86
Query: 78 KE-GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
K+ + +G N+F+DL+NEEF+ LY + ++ R ++R + D P+S+D
Sbjct: 87 KKTAALGHSVGLNKFADLSNEEFKELYLSKVKKPINIKRSTARDWRQRNLQTCDAPSSLD 146
Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGG 196
WR+KG VT +KDQG CGSCW+FS A+EGI I G LI LSEQ+LVDC T N+GC GG
Sbjct: 147 WRKKGVVTAVKDQGDCGSCWSFSTTGAIEGINAIVTGDLISLSEQELVDCDTTNYGCEGG 206
Query: 197 LMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAV 256
MD AFE++I N G+ TEA+YPY +GTC+ KE+ +I Y D+ + D ALL A
Sbjct: 207 YMDYAFEWVINNGGIDTEANYPYTGVDGTCNTTKEEIKVVSIDGYTDVDETD-SALLCAT 265
Query: 257 SNQPVSVCVDASGRAFHFYKSGVLNADCG---NNCDHGVAVVGFGTAEEENGAKYWLIKN 313
QP+SV +D S F Y G+ + DC N+ DH V +VG+G+ ENG YW++KN
Sbjct: 266 VQQPISVGMDGSALDFQLYTGGIYDGDCSDDPNDIDHAVLIVGYGS---ENGEDYWIVKN 322
Query: 314 SWGETWGESGYIRILRDA----GLCGIATAASYPV 344
SWG WG GY I R+ G+C I ASYP
Sbjct: 323 SWGTEWGMEGYFYIKRNTDLPYGVCAINAEASYPT 357
>gi|72008176|ref|XP_780713.1| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 335
Score = 279 bits (713), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 153/340 (45%), Positives = 212/340 (62%), Gaps = 17/340 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
++ ++LV C VVS SM E QW +HG+ Y + E+A R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFTDFDEDWNQWKNEHGKRYLSDEEEASRKLIWEKNLDIV 59
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
K N + G+ TY LG N+F+DL NEEF A+ TG+ V S+ + + N+ +
Sbjct: 60 IKHNLKYDLGHFTYALGMNQFADLKNEEFVAMMTGFR--VNGTSKAAKGSTFLPSNNIGE 117
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P ++DWR KG VT +KDQGQCGSCWAFS ++EG GKL+ LSEQ LVDCS
Sbjct: 118 LPKTVDWRTKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKATGKLVSLSEQNLVDCSGKE 177
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC GGLMD+AF+YII+ G+ TE YPY+ +G C + K+ + AT++ Y D+
Sbjct: 178 GNEGCDGGLMDQAFQYIIKAGGIDTEESYPYKAVDGEC-HFKKANIGATVTGYTDVTSDS 236
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENG 305
E AL +AV++ P+SV +DAS +F YKSGV N DC + DHGV VG+GT + G
Sbjct: 237 ETALQKAVAHIGPISVAIDASHMSFQLYKSGVYNEPDCSSTLLDHGVLAVGYGTTSD--G 294
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
YW++KNSW ETWG +GY+ + R+ CGIAT ASYP+
Sbjct: 295 TDYWIVKNSWAETWGMNGYLWMSRNKDNQCGIATQASYPL 334
>gi|356542171|ref|XP_003539543.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP2-like [Glycine max]
Length = 342
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 151/342 (44%), Positives = 204/342 (59%), Gaps = 20/342 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEK--HEQWMAQHGRTYKDELEKAMRLNIFK 67
II + V+ L IT ++ S V + +E W+ ++G+ Y+++ E R I++
Sbjct: 10 IINLLVLCNLWITASACPAKHNDNSSDSEVMRMRYESWLKKYGQKYRNKDEWEFRFEIYR 69
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+++IE N + N +YKL N+F DLTNEEFR +Y Y +P +S + F YQ
Sbjct: 70 ANVQFIEVYNSQ-NYSYKLMDNKFVDLTNEEFRRMYLVY-QP------RSHLQTRFMYQK 121
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
D+P IDWR +GAVT IKDQG CGSCW+FSAVA VE I +I GKL+ LSEQQL+DC
Sbjct: 122 HGDLPKRIDWRTRGAVTXIKDQGHCGSCWSFSAVATVEDINKIKTGKLVSLSEQQLIDCD 181
Query: 188 --TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
N GC+GG M+ F +I + GL T+ +YPY+ +G + K + A I YE+LP
Sbjct: 182 NRNGNEGCNGGHME-TFTFITKRGGLTTDKNYPYQGSDGDXNKAKVRNHAVAICGYENLP 240
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+E L AV++QP SV DA G AF Y G + CG + +H + +VG+G EENG
Sbjct: 241 AHNENMLKAAVAHQPASVATDAGGYAFQLYSKGTFSGSCGKDLNHRMTIVGYG---EENG 297
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
KYWL+KNSW G SGYIR+ RD G CG A ASYP
Sbjct: 298 EKYWLVKNSWANDXGVSGYIRMKRDPKDKDGTCGTAMEASYP 339
>gi|18396952|ref|NP_564322.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|332192922|gb|AEE31043.1| cysteine proteinase-like protein [Arabidopsis thaliana]
Length = 334
Score = 278 bits (712), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 145/329 (44%), Positives = 207/329 (62%), Gaps = 25/329 (7%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
SQ +++E SIV+ H+QWM Q R YKDE EK MRL +FK+NL++IE N GN++Y
Sbjct: 21 SQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSY 80
Query: 85 KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT---SIDWREKG 141
LG NEF+D EEF A +TG V S+S ++ + N++D+ S DWR++G
Sbjct: 81 TLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDIDMEDESKDWRDEG 140
Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDK 200
AVT +K QG C +T+I+ L+ LSEQQL+DC + N GC+GG ++
Sbjct: 141 AVTPVKYQGAC-------------RLTKISGKNLLTLSEQQLIDCDIEKNGGCNGGEFEE 187
Query: 201 AFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQP 260
AF+YII+N G++ E +YPY+ ++ +C +A I ++ +P +E+ALL+AV QP
Sbjct: 188 AFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGFQMVPSHNERALLEAVRRQP 247
Query: 261 VSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
VSV +DA +F YK GV DCG + +H V +VG+GT +G YW++KNSWGE+W
Sbjct: 248 VSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTM---SGLNYWVLKNSWGESW 304
Query: 320 GESGYIRILRDA----GLCGIATAASYPV 344
GE+GY+RI RD G+CGIA A+YPV
Sbjct: 305 GENGYMRIRRDVEWPQGMCGIAQVAAYPV 333
>gi|238007404|gb|ACR34737.1| unknown [Zea mays]
gi|413943289|gb|AFW75938.1| cysteine proteinase Mir2 [Zea mays]
Length = 484
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 148/325 (45%), Positives = 200/325 (61%), Gaps = 29/325 (8%)
Query: 42 HEQWMAQH----------GRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGT 88
+E+W ++H G E + A RL +F+ NL YI+ N E G ++LG
Sbjct: 53 YEEWRSEHDAGPRRGATGGSLGPGEDDDARRLEVFRYNLRYIDAHNAEADAGLHGFRLGL 112
Query: 89 NEFSDLTNEEFRA--LYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVT 144
F+DLT EE+RA L R +V SR +Y + +P ++DWRE+GAV
Sbjct: 113 TRFADLTLEEYRARLLLGSRGRNGTAVGVVGSR----RYLPLAGEQLPDAVDWRERGAVA 168
Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-DNHGCSGGLMDKAFE 203
+KDQGQCG+CWAFSAVAAVEGI +I G LI LSEQ+L+DC + GC GGLMD AF
Sbjct: 169 EVKDQGQCGACWAFSAVAAVEGINKIVTGSLISLSEQELIDCDKFQDQGCDGGLMDNAFV 228
Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
++I+N G+ TEADYP+ +GTCD + + +I +E +P E+AL +AV++QPVS
Sbjct: 229 FMIKNGGIDTEADYPFTGHDGTCDLKLKNTRVVSIDSFERVPINYERALQKAVAHQPVSA 288
Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
++AS RAF Y SG+ + CG DHGV VVG+G+ E G YW++KNSWG WGE+G
Sbjct: 289 SIEASRRAFQLYSSGIFDGRCGTYLDHGVTVVGYGS---EGGKDYWIVKNSWGTQWGEAG 345
Query: 324 YIRILRD----AGLCGIATAASYPV 344
Y+R+ R+ AG CGIA YPV
Sbjct: 346 YVRMARNVRVRAGKCGIAMEPLYPV 370
>gi|113120269|gb|ABI30274.1| VS-B, partial [Vasconcellea stipulata]
Length = 341
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 144/329 (43%), Positives = 197/329 (59%), Gaps = 23/329 (6%)
Query: 17 IILVITCASQ---------VVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMRL 63
+I V+TC S + G S + + +E E WM +H + YK EK R
Sbjct: 10 LIFVVTCLSLHLGLSSADFSIVGYSQDDLTSIESSIRLFESWMLKHDKVYKTIDEKIYRF 69
Query: 64 NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
FK NL YI++ NK+ N +Y LG NEF+DLT++EF+ Y G + P S+ + S F
Sbjct: 70 ETFKDNLMYIDETNKK-NNSYWLGLNEFADLTHDEFKEKYVG-SIPEDSMIIEQSDDVEF 127
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
++V D P SIDWR+KGAVT +K+Q CGSCWAFS VA VEGI +I G LI LSEQ+L
Sbjct: 128 PNKHVVDYPESIDWRQKGAVTPVKNQNPCGSCWAFSTVATVEGINKIVTGNLISLSEQEL 187
Query: 184 VDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
+DC +HGC GG + +Y+++N G+ TE +YPY ++G C + +K + I+ Y+
Sbjct: 188 LDCDRRSHGCKGGYQTTSLKYVVDN-GVHTEKEYPYEKKQGNCRAKNKKGLKVYINGYKR 246
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P DE +L++ +S QPVSV V++ GR F FYK GV CG DH V VG+
Sbjct: 247 VPSNDEISLIKTISIQPVSVLVESKGRPFQFYKGGVFGGPCGTKLDHAVTAVGY------ 300
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG 332
G Y LIKNSWG WG+ GYI+I R +G
Sbjct: 301 -GKDYILIKNSWGPKWGDKGYIKIKRASG 328
>gi|357130490|ref|XP_003566881.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 350
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 144/328 (43%), Positives = 191/328 (58%), Gaps = 11/328 (3%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
+++LV T V+ + ++ +HEQWMA+ GR Y D EKA R +F N Y++
Sbjct: 14 LLVLVATAVFHAVAAQGEAGLTVAARHEQWMAKFGRVYTDANEKARRQAVFGANARYVDA 73
Query: 76 ANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
N+ GNRTY LG NEFSDLT+ EF + GY P + S+ Y ++P S
Sbjct: 74 VNRAGNRTYTLGLNEFSDLTDNEFAKTHLGYREFRPETA-NISKGVDPGYGLAGNIPKSF 132
Query: 136 DWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSG 195
DWR KGAVT +K QG CG CWAF+AVAA EG+ +I +G LI +SEQQ++DC+T N+ C G
Sbjct: 133 DWRTKGAVTEVKSQGGCGCCWAFAAVAATEGLVKIAKGTLISMSEQQVLDCTTGNNTCKG 192
Query: 196 GLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP-KGDEQALLQ 254
G M+ A Y+ + GL TE DY Y E+G C A ++ E +P G+E L +
Sbjct: 193 GYMNDALSYVFASGGLQTEEDYEYNAEKGACRRDVTPNPATSVGHAEYMPLDGNEFLLQK 252
Query: 255 AVSNQPVSVCVDASGRAFHFYKSGVLNA--DCGNNCDHGVAVVGFGTAEEENGAK--YWL 310
V+ QPV V V+A G F Y GV CG N DH VVG+G A+ G K YWL
Sbjct: 253 LVARQPVVVAVEAYGTDFKNYGGGVFTGSPSCGQNLDHFFTVVGYGFAD---GGKQMYWL 309
Query: 311 IKNSWGETWGESGYIRILRDAGL--CGI 336
+KN WG +WGESGY+RI R + CG+
Sbjct: 310 VKNQWGTSWGESGYMRIARGSSARNCGM 337
>gi|53791858|dbj|BAD53944.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 335
Score = 278 bits (710), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 148/343 (43%), Positives = 207/343 (60%), Gaps = 20/343 (5%)
Query: 13 MFVIIILVITC-ASQVVSGRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
M I++LV T A Q ++ + + + ++ E+WMA+ G+TYK EK R IF
Sbjct: 1 MTSIVLLVCTLMALQAMAASAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIF 60
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
+ N+ +I + +G N+F+DLTN+EF A YTG P P +++ RP +
Sbjct: 61 RDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHP---KEAPRPVDPIW- 116
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
P IDWR +GAVT +KDQG CGSCWAF+AVAA+EG+T+I G+L LSEQ+LVDC
Sbjct: 117 ----TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDC 172
Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD-NQKEKAVAATISKYEDLP 245
T+++GC GG D+AFE + G+ E+DY Y +G C + AA+I Y +P
Sbjct: 173 DTNSNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVP 232
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
DE+ L AV+ QPV+V +DASG AF FYKSGV CG + +H V +VG+ + +G
Sbjct: 233 PNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY-CQDGASG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
KYWL KNSWG+TWG+ GYI + +D G CG+A + YP
Sbjct: 292 KKYWLAKNSWGKTWGQQGYILLEKDIVQPHGTCGLAVSPFYPT 334
>gi|356509992|ref|XP_003523725.1| PREDICTED: oryzain alpha chain-like [Glycine max]
Length = 439
Score = 278 bits (710), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 138/315 (43%), Positives = 197/315 (62%), Gaps = 18/315 (5%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGN-----RTYKLGTNEFSDL 94
E E+W +H +TY E EK RL +F+ N ++ + N+ N +Y L N F+DL
Sbjct: 31 ELFEKWCKEHSKTYSSEEEKLYRLKVFEDNYAFVAQHNQNANNNNNNSSYTLSLNAFADL 90
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
T+ EF+ G +P + RP + +++ +P+ IDWR+ GAVT +KDQ CG+
Sbjct: 91 THHEFKTTRLG----LPLTLLRFKRPQNQQSRDLLHIPSQIDWRQSGAVTPVKDQASCGA 146
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLAT 213
CWAFSA A+EGI +I G L+ LSEQ+L+DC T N GC GGLMD A++++I+NKG+ T
Sbjct: 147 CWAFSATGAIEGINKIVTGSLVSLSEQELIDCDTSYNSGCGGGLMDFAYQFVIDNKGIDT 206
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
E DYPY+ + +C K K A TI Y D+P +E+ +L+AV++QPVSV + S R F
Sbjct: 207 EDDYPYQARQRSCSKDKLKRRAVTIEDYVDVPPSEEE-ILKAVASQPVSVGICGSEREFQ 265
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-- 331
Y G+ C DH V +VG+G+ ENG YW++KNSWG+ WG +GYI ++R++
Sbjct: 266 LYSKGIFTGPCSTFLDHAVLIVGYGS---ENGVDYWIVKNSWGKYWGMNGYIHMIRNSGN 322
Query: 332 --GLCGIATAASYPV 344
G+CGI T ASYPV
Sbjct: 323 SKGICGINTLASYPV 337
>gi|218202077|gb|EEC84504.1| hypothetical protein OsI_31195 [Oryza sativa Indica Group]
Length = 362
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 189/316 (59%), Gaps = 14/316 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ W H R+Y E R +++++N E+I+ N G+ TY+L NEF+DLT E
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYRLAENEFADLTEE 106
Query: 98 EFRALYTGY---NRPVPS---VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ-G 150
EF A YTGY + PV + ++F Y+ DVP S+DWR +GAV K Q
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYR--VDVPASVDWRAQGAVVPPKSQTS 164
Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKG 210
C SCWAF A +E + I GKL+ LSEQQLVDC + + GC+ G +A+++++EN G
Sbjct: 165 TCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGG 224
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
L TEADYPY G C+ K AA I+ + +P +E AL AV+ QPV+V ++ G
Sbjct: 225 LTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GS 283
Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
FYK GV CG H V VVG+GT + +GAKYW IKNSWG++WGE GYIRILRD
Sbjct: 284 GMQFYKGGVYTGPCGTRLAHAVTVVGYGT-DASSGAKYWTIKNSWGQSWGERGYIRILRD 342
Query: 331 A---GLCGIATAASYP 343
GLCG+ +YP
Sbjct: 343 VGGPGLCGVTLDIAYP 358
>gi|9502426|gb|AAF88125.1|AC021043_18 Putative cysteine proteinase [Arabidopsis thaliana]
Length = 365
Score = 277 bits (709), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 148/347 (42%), Positives = 213/347 (61%), Gaps = 30/347 (8%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
SQ +++E SIV+ H+QWM Q R YKDE EK MRL +FK+NL++IE N GN++Y
Sbjct: 21 SQARPHVTLNEQSIVDYHQQWMTQFSRVYKDESEKEMRLKVFKKNLKFIENFNNMGNQSY 80
Query: 85 KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT---SIDWREKG 141
LG NEF+D EEF A +TG V S+S ++ + N++D+ S DWR++G
Sbjct: 81 TLGVNEFTDWKTEEFLATHTGLRVNVTSLSELFNKTKPSRNWNMSDIDMEDESKDWRDEG 140
Query: 142 AVTHIKDQGQCGSCWA------------FSAVAAV------EGITQITRGKLIELSEQQL 183
AVT +K QG C ++ + V EG+T+I+ L+ LSEQQL
Sbjct: 141 AVTPVKYQGACPEFPTKQIRRNSLVGKQYTKLLGVLSDWGDEGLTKISGKNLLTLSEQQL 200
Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
+DC + N GC+GG ++AF+YII+N G++ E +YPY+ ++ +C +A I ++
Sbjct: 201 IDCDIEKNGGCNGGEFEEAFKYIIKNGGVSLETEYPYQVKKESCRANARRAPHTQIRGFQ 260
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAE 301
+P +E+ALL+AV QPVSV +DA +F YK GV DCG + +H V +VG+GT
Sbjct: 261 MVPSHNERALLEAVRRQPVSVLIDARADSFGHYKGGVYAGLDCGTDVNHAVTIVGYGTM- 319
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+G YW++KNSWGE+WGE+GY+RI RD G+CGIA A+YPV
Sbjct: 320 --SGLNYWVLKNSWGESWGENGYMRIRRDVEWPQGMCGIAQVAAYPV 364
>gi|356517384|ref|XP_003527367.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 332
Score = 277 bits (708), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 154/341 (45%), Positives = 210/341 (61%), Gaps = 29/341 (8%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
F +++ + A QV R++ + S+ E H Q M ++ + KD + +FK+N+ YI
Sbjct: 12 FAMLLSMAFLAFQVTC-RTLQDASMYESHGQRMTRYSKVDKDPPDX-----VFKENVNYI 65
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E N ++ YK N+F+ + + G+ + R +TFK++NVT P+
Sbjct: 66 EACNNAADKPYKRDINQFAP------KKRFKGH------MCSSIIRITTFKFENVTATPS 113
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL-SEQQLVDCSTD--N 190
++D R+K AVT IKDQGQCG WA SAVAA EGI + GKLI L SEQ+LVDC T +
Sbjct: 114 TVDCRQKVAVTPIKDQGQCGCFWALSAVAATEGIHALXAGKLILLSSEQELVDCDTKGVD 173
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDN-QKEKAVAATISKYEDLPKGDE 249
C GGLMD AF++II+N GL TEA+YPY+ +G C+ + +K A I+ YED+P +E
Sbjct: 174 QDCQGGLMDDAFKFIIQNHGLNTEANYPYKGVDGKCNAYEADKNAATIITGYEDVPANNE 233
Query: 250 QALLQ-AVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
+A LQ AV+N PVSV +DASG F FYKSGV CG DHGV VG+G +++ G +Y
Sbjct: 234 KAHLQKAVANNPVSVAIDASGSDFQFYKSGVFTGSCGTELDHGVTAVGYGVSDD--GTEY 291
Query: 309 WLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPVA 345
WL+KNS G WGE GYIR+ R + LCGIA ASYP A
Sbjct: 292 WLVKNSRGTEWGEEGYIRMQRGVDSEEALCGIAVQASYPSA 332
>gi|226505708|ref|NP_001141813.1| uncharacterized protein LOC100273952 precursor [Zea mays]
gi|194706024|gb|ACF87096.1| unknown [Zea mays]
gi|413945958|gb|AFW78607.1| hypothetical protein ZEAMMB73_489507 [Zea mays]
Length = 460
Score = 277 bits (708), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 139/327 (42%), Positives = 191/327 (58%), Gaps = 22/327 (6%)
Query: 36 PSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR------------- 82
P+I + + W A+HG+ Y E+A RL +F N ++ N
Sbjct: 30 PAIEAQFDAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARAGANAAGGGGGGAAPP 89
Query: 83 TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
+Y L N F+DLT+EEFRA G P ++ R + P + VP ++DWR+ GA
Sbjct: 90 SYTLALNAFADLTHEEFRAARLGRIAPGAAL-RSRAAPVYWGLGGGAAVPDALDWRKSGA 148
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
VT +KDQG CG+CW+FSA A+EGI +I G L+ LSEQ+L+DC N GC GGLMD A
Sbjct: 149 VTKVKDQGSCGACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYA 208
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
++++I+N G+ TE DYPYR +GTC+ K K TI Y D+P E LLQAV+ QPV
Sbjct: 209 YKFVIKNGGIDTEEDYPYREADGTCNKNKLKKRVVTIDGYTDVPSNKEDLLLQAVAQQPV 268
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
SV + S RAF Y G+ + C + DH V +VG+G+ E G YW++KNSWGE+WG
Sbjct: 269 SVGICGSARAFQLYYQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGESWGM 325
Query: 322 SGYIRILRDA----GLCGIATAASYPV 344
GY+ + R+ G+CGI AS+P
Sbjct: 326 KGYMHMHRNTGDSKGVCGINMMASFPT 352
>gi|47169030|pdb|1S4V|A Chain A, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
gi|47169031|pdb|1S4V|B Chain B, The 2.0 A Crystal Structure Of The Kdel-Tailed Cysteine
Endopeptidase Functioning In Programmed Cell Death Of
Ricinus Communis Endosperm
Length = 229
Score = 277 bits (708), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 132/219 (60%), Positives = 158/219 (72%), Gaps = 7/219 (3%)
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
VP S+DWR+KGAVT +KDQGQCGSCWAFS + AVEGI QI KL+ LSEQ+LVDC TD
Sbjct: 2 VPASVDWRKKGAVTSVKDQGQCGSCWAFSTIVAVEGINQIKTNKLVSLSEQELVDCDTDQ 61
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
N GC+GGLMD AFE+I + G+ TEA+YPY +GTCD KE A A +I +E++P+ DE
Sbjct: 62 NQGCNGGLMDYAFEFIKQRGGITTEANYPYEAYDGTCDVSKENAPAVSIDGHENVPENDE 121
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
ALL+AV+NQPVSV +DA G F FY GV CG DHGVA+VG+GT + G KYW
Sbjct: 122 NALLKAVANQPVSVAIDAGGSDFQFYSEGVFTGSCGTELDHGVAIVGYGTTID--GTKYW 179
Query: 310 LIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
+KNSWG WGE GYIR+ R GLCGIA ASYP+
Sbjct: 180 TVKNSWGPEWGEKGYIRMERGISDKEGLCGIAMEASYPI 218
>gi|115478933|ref|NP_001063060.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|113631293|dbj|BAF24974.1| Os09g0381400 [Oryza sativa Japonica Group]
gi|215678649|dbj|BAG92304.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218202075|gb|EEC84502.1| hypothetical protein OsI_31193 [Oryza sativa Indica Group]
Length = 362
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 189/316 (59%), Gaps = 14/316 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ W H R+Y E R +++++N E+I+ N G+ TY+L NEF+DLT E
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEE 106
Query: 98 EFRALYTGY---NRPVPS---VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ-G 150
EF A YTGY + PV + ++F Y+ DVP S+DWR +GAV K Q
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYR--VDVPASVDWRAQGAVVPPKSQTS 164
Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKG 210
C SCWAF A +E + I GKL+ LSEQQLVDC + + GC+ G +A+++++EN G
Sbjct: 165 TCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGG 224
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
L TEADYPY G C+ K AA I+ + +P +E AL AV+ QPV+V ++ G
Sbjct: 225 LTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GS 283
Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
FYK GV CG H V VVG+GT + +GAKYW IKNSWG++WGE GYIRILRD
Sbjct: 284 GMQFYKGGVYTGPCGTRLAHAVTVVGYGT-DASSGAKYWTIKNSWGQSWGERGYIRILRD 342
Query: 331 A---GLCGIATAASYP 343
GLCG+ +YP
Sbjct: 343 VGGPGLCGVTLDIAYP 358
>gi|395535909|ref|XP_003769963.1| PREDICTED: cathepsin S [Sarcophilus harrisii]
Length = 347
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 149/344 (43%), Positives = 213/344 (61%), Gaps = 21/344 (6%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ M V+I + + CAS R H+P + E W +G+ Y+++ ++ R I+++N
Sbjct: 13 LLRMKVVIWMFLACASTTAYLR--HDPMLDNHWELWKKTYGKQYEEQNQEVTRRLIWEKN 70
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
L+++ N E G +Y L N SD+T+EE +L + P Q SR +T++
Sbjct: 71 LKFVTLHNLEHSMGLHSYDLSMNHLSDMTSEEVASLMSSLRIP-----NQWSRNTTYRLN 125
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+ +P S+DWR+KG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDC
Sbjct: 126 SNQKLPDSVDWRDKGCVTEVKYQGTCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDC 185
Query: 187 STD----NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
ST+ NHGC+GG M +AF+YII+N G+ ++A YPY+ ++G C AAT S+Y
Sbjct: 186 STNEKYENHGCNGGCMTEAFQYIIDNNGIDSDASYPYKAKDGKCQYNPANR-AATCSRYT 244
Query: 243 DLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTA 300
+LP G E AL +AV+N+ PVSV +DAS +F YKSGV + C N +HGV V G+G
Sbjct: 245 ELPYGSEDALKEAVANKGPVSVGIDASLPSFFLYKSGVYYDPSCTQNVNHGVLVTGYGNL 304
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
+ G YWL+KNSWG ++G+ GYIRI R+ G CGIA SYP
Sbjct: 305 D---GKDYWLVKNSWGLSFGDKGYIRIARNRGNHCGIANFPSYP 345
>gi|255522980|gb|ACU12382.1| RE21773p [Drosophila melanogaster]
Length = 375
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 149/319 (46%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+DE E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 59 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 118
Query: 95 TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
+ EFR L G+N + R +S + TF +P S+DWR KGAVT +KDQG
Sbjct: 119 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 178
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST N+GC+GGLMD AF YI +N
Sbjct: 179 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 238
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
G+ TE YPY + +C K V AT + D+P+GDE+ + +AV+ PVSV +DAS
Sbjct: 239 GIDTEKSYPYEAIDDSCHFNK-GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 297
Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+F FY GV N C N DHGV VVGFGT +E+G YWL+KNSWG TWG+ G+I+
Sbjct: 298 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGEDYWLVKNSWGTTWGDKGFIK 355
Query: 327 ILRDA-GLCGIATAASYPV 344
+LR+ CGIA+A+SYP+
Sbjct: 356 MLRNKENQCGIASASSYPL 374
>gi|42573181|ref|NP_974687.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
gi|332661102|gb|AEE86502.1| Xylem cysteine proteinase 1 [Arabidopsis thaliana]
Length = 288
Score = 276 bits (707), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 135/245 (55%), Positives = 176/245 (71%), Gaps = 5/245 (2%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++H + YK EK R +F++NL +I++ N E N +Y LG NEF+DLT+E
Sbjct: 47 LLELFESWMSEHSKAYKSVEEKVHRFEVFRENLMHIDQRNNEIN-SYWLGLNEFADLTHE 105
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G +P S RQ S + F+Y+++TD+P S+DWR+KGAV +KDQGQCGSCWA
Sbjct: 106 EFKGRYLGLAKPQFSRKRQPS--ANFRYRDITDLPKSVDWRKKGAVAPVKDQGQCGSCWA 163
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QIT G L LSEQ+L+DC T N GC+GGLMD AF+YII GL E D
Sbjct: 164 FSTVAAVEGINQITTGNLSSLSEQELIDCDTTFNSGCNGGLMDYAFQYIISTGGLHKEDD 223
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEG C QKE TIS YED+P+ D+++L++A+++QPVSV ++ASGR F FYK
Sbjct: 224 YPYLMEEGICQEQKEDVERVTISGYEDVPENDDESLVKALAHQPVSVAIEASGRDFQFYK 283
Query: 277 SGVLN 281
GV N
Sbjct: 284 -GVYN 287
>gi|194352760|emb|CAQ00108.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
gi|326510977|dbj|BAJ91836.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326523875|dbj|BAJ96948.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326528631|dbj|BAJ97337.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 368
Score = 276 bits (707), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 140/333 (42%), Positives = 200/333 (60%), Gaps = 29/333 (8%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR--TYKLGTNEFSDLT 95
+ ++ +W A+H RTY E+ RL ++ +N+ YIE N + TY+LG ++DLT
Sbjct: 38 MAQRFRRWKAEHSRTYATPEEERHRLRVYARNMRYIEATNGDAGAGLTYELGETAYTDLT 97
Query: 96 NEEFRALYTGYNRPVPSVSRQSSRPSTF------------------KYQNVT-DVPTSID 136
++EF A+YT +R P P T Y N + P S+D
Sbjct: 98 SDEFTAMYT--SRAPPLSDDDDDLPMTMITTRAGPVAAAGGGGWLQVYVNESAGAPASVD 155
Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGG 196
WRE+GAVT +K+QGQCGSCWAFS VA +EGI QI GKL LSEQ+LVDC +HGC+GG
Sbjct: 156 WRERGAVTAVKNQGQCGSCWAFSTVAVIEGIHQIKTGKLASLSEQELVDCDKLDHGCNGG 215
Query: 197 LMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAV 256
+ +A ++I N G+ ++ DYPY ++ TCD +K AA+IS ++ + E +L AV
Sbjct: 216 VSYRALQWITSNGGITSQDDYPYTAKDDTCDTKKLSHHAASISGFQRVATRSELSLTNAV 275
Query: 257 SNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWG 316
+ QPV+V ++A G F Y++GV N CG +HGV VVG+G +E G YW++KNSWG
Sbjct: 276 AMQPVAVSIEAGGANFQHYRNGVYNGPCGTRLNHGVTVVGYG-EDEVTGESYWIVKNSWG 334
Query: 317 ETWGESGYIR-----ILRDAGLCGIATAASYPV 344
E WG++GY+R I + G+CGIA S+P+
Sbjct: 335 EKWGDNGYLRMKKGIIDKPEGICGIAIRPSFPL 367
>gi|24653514|ref|NP_523735.2| cysteine proteinase-1, isoform C [Drosophila melanogaster]
gi|118572624|sp|Q95029.2|CATL_DROME RecName: Full=Cathepsin L; AltName: Full=Cysteine proteinase 1;
Contains: RecName: Full=Cathepsin L heavy chain;
Contains: RecName: Full=Cathepsin L light chain; Flags:
Precursor
gi|21627209|gb|AAM68565.1| cysteine proteinase-1, isoform C [Drosophila melanogaster]
Length = 371
Score = 276 bits (706), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 149/319 (46%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+DE E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 55 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 114
Query: 95 TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
+ EFR L G+N + R +S + TF +P S+DWR KGAVT +KDQG
Sbjct: 115 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 174
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST N+GC+GGLMD AF YI +N
Sbjct: 175 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 234
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
G+ TE YPY + +C K V AT + D+P+GDE+ + +AV+ PVSV +DAS
Sbjct: 235 GIDTEKSYPYEAIDDSCHFNK-GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 293
Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+F FY GV N C N DHGV VVGFGT +E+G YWL+KNSWG TWG+ G+I+
Sbjct: 294 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGEDYWLVKNSWGTTWGDKGFIK 351
Query: 327 ILRDA-GLCGIATAASYPV 344
+LR+ CGIA+A+SYP+
Sbjct: 352 MLRNKENQCGIASASSYPL 370
>gi|125525815|gb|EAY73929.1| hypothetical protein OsI_01813 [Oryza sativa Indica Group]
Length = 336
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 145/342 (42%), Positives = 206/342 (60%), Gaps = 15/342 (4%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
+F++ + ++ L AS + S + ++ E+WMA+ G+TYK EK R IF+
Sbjct: 4 AFLLVVCTLMALQAMAASAYYNNGS-DDGVTMQMFEEWMAKFGKTYKCHGEKEHRFGIFR 62
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+ +I + +G N+F+DLTN+EF A YTG P P +++ RP +
Sbjct: 63 DNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHP---KEAPRPVDPIW-- 117
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
P IDWR +GAVT +KDQG CGSCWAF+AVAA+EG+T+I G+L LSEQ+LVDC
Sbjct: 118 ---TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELVDCD 174
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD-NQKEKAVAATISKYEDLPK 246
T+++GC GG D+AFE + G+ E+DY Y +G C + AA+I Y +P
Sbjct: 175 TNSNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAASIGGYRAVPP 234
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
DE+ L AV+ QPV+V +DASG AF FYKSGV CG + +H V +VG+ + +G
Sbjct: 235 NDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY-CQDGASGK 293
Query: 307 KYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
KYW+ KNSWG+TWG+ GYI + +D G CG+A + YP
Sbjct: 294 KYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYPT 335
>gi|49387634|dbj|BAD25828.1| putative cysteine proteinase [Oryza sativa Japonica Group]
gi|49388888|dbj|BAD26098.1| putative cysteine proteinase [Oryza sativa Japonica Group]
Length = 358
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 139/316 (43%), Positives = 189/316 (59%), Gaps = 14/316 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ W H R+Y E R +++++N E+I+ N G+ TY+L NEF+DLT E
Sbjct: 43 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEE 102
Query: 98 EFRALYTGY---NRPVPS---VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ-G 150
EF A YTGY + PV + ++F Y+ DVP S+DWR +GAV K Q
Sbjct: 103 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYR--VDVPASVDWRAQGAVVPPKSQTS 160
Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKG 210
C SCWAF A +E + I GKL+ LSEQQLVDC + + GC+ G +A+++++EN G
Sbjct: 161 TCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGG 220
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
L TEADYPY G C+ K AA I+ + +P +E AL AV+ QPV+V ++ G
Sbjct: 221 LTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GS 279
Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
FYK GV CG H V VVG+GT + +GAKYW IKNSWG++WGE GYIRILRD
Sbjct: 280 GMQFYKGGVYTGPCGTRLAHAVTVVGYGT-DASSGAKYWTIKNSWGQSWGERGYIRILRD 338
Query: 331 A---GLCGIATAASYP 343
GLCG+ +YP
Sbjct: 339 VGGPGLCGVTLDIAYP 354
>gi|157132324|ref|XP_001655999.1| cathepsin l [Aedes aegypti]
gi|108881694|gb|EAT45919.1| AAEL002833-PA [Aedes aegypti]
Length = 339
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 153/343 (44%), Positives = 210/343 (61%), Gaps = 19/343 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+I+++ A+ VS + + E+ + QH + Y E E+ +RL I+ QN I
Sbjct: 3 ILILLMAFVAAANAVSLYEL----VKEEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSR---PSTFKYQN 127
K N+ G Y+L N+++DL +EEF G+NR S + R P TF
Sbjct: 59 AKHNQRFDLGQEKYRLRVNKYADLLHEEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPA 118
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+VPT++DWR+KGAVT +KDQG CGSCW+FSA A+EG GKL+ LSEQ LVDCS
Sbjct: 119 NVEVPTTVDWRKKGAVTPVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCS 178
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
N+GC+GG+MD AF+YI +N G+ TE YPY + TC + KAV AT Y D+P
Sbjct: 179 GKYGNNGCNGGMMDYAFQYIKDNGGIDTEKSYPYEAIDDTC-HFNPKAVGATDKGYVDIP 237
Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEE 302
+GDE+AL +A++ PVS+ +DAS +F FY GV C + N DHGV VG+GT+EE
Sbjct: 238 QGDEEALKKALATVGPVSIAIDASHESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEE 297
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G YWL+KNSWG TWG+ GY+++ R+ CG+AT ASYP+
Sbjct: 298 --GEDYWLVKNSWGTTWGDQGYVKMARNRDNHCGVATCASYPL 338
>gi|125570286|gb|EAZ11801.1| hypothetical protein OsJ_01675 [Oryza sativa Japonica Group]
Length = 319
Score = 276 bits (706), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 193/311 (62%), Gaps = 14/311 (4%)
Query: 39 VEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
++ E+WMA+ G+TYK EK R IF+ N+ +I + +G N+F+DLTN+E
Sbjct: 17 MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 76
Query: 99 FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
F A YTG P P +++ RP + P IDWR +GAVT +KDQG CGSCWAF
Sbjct: 77 FVATYTGAKPPHP---KEAPRPVDPIW-----TPCCIDWRFRGAVTGVKDQGACGSCWAF 128
Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
+AVAA+EG+T+I G+L LSEQ+LVDC T+++GC GG D+AFE + G+ E+DY
Sbjct: 129 AAVAAIEGLTKIRTGQLTPLSEQELVDCDTNSNGCGGGHTDRAFELVASKGGITAESDYR 188
Query: 219 YRHEEGTCD-NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
Y +G C + AA+I Y +P DE+ L AV+ QPV+V +DASG AF FYKS
Sbjct: 189 YEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKS 248
Query: 278 GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GL 333
GV CG + +H V +VG+ + +G KYWL KNSWG+TWG+ GYI + +D G
Sbjct: 249 GVFPGPCGASSNHAVTLVGY-CQDGASGKKYWLAKNSWGKTWGQQGYILLEKDIVQPHGT 307
Query: 334 CGIATAASYPV 344
CG+A + YP
Sbjct: 308 CGLAVSPFYPT 318
>gi|24653516|ref|NP_725347.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|24653518|ref|NP_725348.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|1658527|gb|AAB18345.1| cysteine proteinase 1 [Drosophila melanogaster]
gi|2305221|gb|AAB65749.1| cysteine proteinase-1 [Drosophila melanogaster]
gi|7303249|gb|AAF58311.1| cysteine proteinase-1, isoform A [Drosophila melanogaster]
gi|21627210|gb|AAM68566.1| cysteine proteinase-1, isoform B [Drosophila melanogaster]
gi|54650754|gb|AAV36956.1| LP06554p [Drosophila melanogaster]
gi|220951982|gb|ACL88534.1| Cp1-PA [synthetic construct]
Length = 341
Score = 276 bits (705), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 149/319 (46%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+DE E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 95 TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
+ EFR L G+N + R +S + TF +P S+DWR KGAVT +KDQG
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST N+GC+GGLMD AF YI +N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
G+ TE YPY + +C K V AT + D+P+GDE+ + +AV+ PVSV +DAS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNK-GTVGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 263
Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+F FY GV N C N DHGV VVGFGT +E+G YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGEDYWLVKNSWGTTWGDKGFIK 321
Query: 327 ILRDA-GLCGIATAASYPV 344
+LR+ CGIA+A+SYP+
Sbjct: 322 MLRNKENQCGIASASSYPL 340
>gi|15290195|dbj|BAB63884.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|125525813|gb|EAY73927.1| hypothetical protein OsI_01811 [Oryza sativa Indica Group]
Length = 342
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 146/345 (42%), Positives = 205/345 (59%), Gaps = 21/345 (6%)
Query: 12 PMFVIIILVITC--ASQVVSGRSMH-----EPSIVEKHEQWMAQHGRTYKDELEKAMRLN 64
PM ++LV+ A Q + + + + ++ E+WMA+ G+TYK EK R
Sbjct: 6 PMASAVLLVVCTLMALQAMGADAYYNNGSDDGVTMQMFEEWMAKFGKTYKCHGEKEHRFG 65
Query: 65 IFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK 124
IF+ N+ +I + +G N+F+DLTN+EF A YTG P P +++ RP
Sbjct: 66 IFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDEFVATYTGAKPPHP---KEAPRPVDPI 122
Query: 125 YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLV 184
+ P IDWR +GAVT +KDQG CGSCWAF+AVAA+EG+T+I G+L LSEQ+LV
Sbjct: 123 W-----TPCCIDWRFRGAVTGVKDQGACGSCWAFAAVAAIEGLTKIRTGQLTPLSEQELV 177
Query: 185 DCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD-NQKEKAVAATISKYED 243
DC T+++GC GG D+AFE + G+ E+DY Y +G C + AA I Y
Sbjct: 178 DCDTNSNGCGGGHTDRAFELVASKGGITAESDYRYEGFQGKCRVDDMLFNHAARIGGYRA 237
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P DE+ L AV+ QPV+V +DASG AF FYKSGV CG + +H V +VG+ +
Sbjct: 238 VPPNDERQLATAVARQPVTVYIDASGPAFQFYKSGVFPGPCGASSNHAVTLVGY-CQDGA 296
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
+G KYW+ KNSWG+TWG+ GYI + +D G CG+A + YP
Sbjct: 297 SGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGTCGLAVSPFYPT 341
>gi|320169658|gb|EFW46557.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 324
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 148/308 (48%), Positives = 197/308 (63%), Gaps = 13/308 (4%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
+ W A HG +Y E+ R I++ NL++IEK N EG +YKL N+F+DLT EF A
Sbjct: 23 DSWKATHGVSYATVGEETARRGIYRANLDFIEKHNSEG-HSYKLAVNKFADLTYPEFAAK 81
Query: 103 YTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
Y G + ++ S ST+ + V+ +P S+DWR G VT IKDQGQCGSCW+FS
Sbjct: 82 YLGLRFDATNATK-SFAASTYLPRMVS-LPDSVDWRTAGIVTPIKDQGQCGSCWSFSTTG 139
Query: 163 AVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
+VEG G+L+ LSEQ LVDCS+ N GC+GGLMD+AF+YII N G+ TE+ YPY
Sbjct: 140 SVEGQHARKTGQLVSLSEQNLVDCSSAQGNAGCNGGLMDQAFQYIISNNGIDTESSYPYT 199
Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV 279
++GTC V AT++ Y+D+ G E L AV+ P+SV +DAS +F FY SGV
Sbjct: 200 AQDGTCQFNSAN-VGATVASYQDIASGSESDLQNAVATVGPISVAIDASQPSFQFYSSGV 258
Query: 280 LN--ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGI 336
N A + DHGV VG+GT+ + YWL+KNSWG +WG+SGYI + R++ CGI
Sbjct: 259 YNEPACSSSQLDHGVLAVGYGTSGSSD---YWLVKNSWGTSWGQSGYIWMTRNSNNQCGI 315
Query: 337 ATAASYPV 344
ATAASYP+
Sbjct: 316 ATAASYPL 323
>gi|242093994|ref|XP_002437487.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
gi|241915710|gb|EER88854.1| hypothetical protein SORBIDRAFT_10g027980 [Sorghum bicolor]
Length = 341
Score = 275 bits (704), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 143/317 (45%), Positives = 197/317 (62%), Gaps = 36/317 (11%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKA--MRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFS 92
+ + ++ W ++HGR +D + A +RL +F+ NL YI+ N E G T++LG F+
Sbjct: 47 VRQLYKTWKSEHGRP-RDGISVADGLRLKVFRDNLRYIDAHNAEADAGLHTFRLGLTPFT 105
Query: 93 DLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
DLT EEFRA G+ N +P V+ P D+P ++DWR++GAVT +K+Q
Sbjct: 106 DLTLEEFRAHALGFLNSTLPRVASDRYLPRAGD-----DLPDAVDWRQQGAVTGVKNQLD 160
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGL 211
CG CWAFSAVAA+EGI +I LI LSEQ+L+DC T+++GC GG M KAF+++I+N G+
Sbjct: 161 CGGCWAFSAVAAMEGINKIVTNNLISLSEQELIDCDTEDYGCQGGEMQKAFQFVIDNGGI 220
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
TEADYP+ GTCD +EK +I YE++P DE+AL +AV+NQP
Sbjct: 221 DTEADYPFIGTNGTCDAIREKRKVVSIDSYENVPTNDEEALQKAVANQP----------- 269
Query: 272 FHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
G+ N CG DHGV VG+G+ +NG +W++KNSWG WGESGYIR+ R+
Sbjct: 270 ------GIFNGPCGFILDHGVTAVGYGS---DNGEDFWIVKNSWGAEWGESGYIRMKRNV 320
Query: 332 ----GLCGIATAASYPV 344
G CGIA ASYPV
Sbjct: 321 LLPMGKCGIAMYASYPV 337
>gi|195056367|ref|XP_001995082.1| GH22826 [Drosophila grimshawi]
gi|193899288|gb|EDV98154.1| GH22826 [Drosophila grimshawi]
Length = 340
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 152/343 (44%), Positives = 207/343 (60%), Gaps = 19/343 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
++ +L + +Q VS + I E+ + + +H + Y+DE E+ RL IF +N I
Sbjct: 4 YIFALLALVAVAQAVSFADV----IKEEWQTFKLEHRKQYQDETEERFRLKIFNENKHKI 59
Query: 74 EKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP---STFKYQN 127
K N+ G ++K+G N+++D+ + EF G+N + R S TF
Sbjct: 60 AKHNQLYAAGEVSFKMGLNKYADMLHHEFHETMNGFNYTLHKQLRASDATFTGVTFISPE 119
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+P S+DWR KGAVT +KDQG CGSCWAFS+ A+EG G LI LSEQ LVDCS
Sbjct: 120 HVKLPQSVDWRNKGAVTGVKDQGHCGSCWAFSSTGALEGQHFRKTGTLISLSEQNLVDCS 179
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
T N+GC+GGLMD AF YI +N G+ TE YPY + +C K + AT + D+P
Sbjct: 180 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNK-GTIGATDRGFTDIP 238
Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCG-NNCDHGVAVVGFGTAEE 302
+GDE+ L QAV+ PVSV +DAS +F FY +GV + C N DHGV VVG+GT +
Sbjct: 239 QGDEKKLAQAVATIGPVSVAIDASHESFQFYSTGVYDEPQCDPQNLDHGVLVVGYGT--D 296
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILR-DAGLCGIATAASYPV 344
ENG YWL+KNSWG TWG+ G+I++ R D CGIATA+SYP+
Sbjct: 297 ENGKDYWLVKNSWGTTWGDKGFIKMARNDDNQCGIATASSYPL 339
>gi|125525812|gb|EAY73926.1| hypothetical protein OsI_01810 [Oryza sativa Indica Group]
Length = 319
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 193/311 (62%), Gaps = 14/311 (4%)
Query: 39 VEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
++ E+WMA+ G+TYK EK R IF+ N+ +I + +G N+F+DLTN+E
Sbjct: 17 MQMFEEWMAKFGKTYKCHGEKEHRFGIFRDNVHFIRGYKPQVTYDSAVGINQFADLTNDE 76
Query: 99 FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
F A YTG P P +++ RP + P IDWR +GAVT +KDQG CGSCWAF
Sbjct: 77 FVATYTGAKPPHP---KEAPRPVDPIW-----TPCCIDWRFRGAVTGVKDQGACGSCWAF 128
Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
+AVAA+EG+T+I G+L LSEQ+LVDC T+++GC GG D+AFE + G+ E+DY
Sbjct: 129 AAVAAIEGLTKIRTGQLTPLSEQELVDCDTNSNGCGGGHTDRAFELVASKGGITAESDYR 188
Query: 219 YRHEEGTCD-NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
Y +G C + AA+I Y +P DE+ L AV+ QPV+V +DASG AF FYKS
Sbjct: 189 YEGFQGKCRVDDMLFNHAASIGGYRAVPPNDERQLATAVARQPVTVYIDASGPAFQFYKS 248
Query: 278 GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GL 333
GV CG + +H V +VG+ + +G KYW+ KNSWG+TWG+ GYI + +D G
Sbjct: 249 GVFPGPCGASSNHAVTLVGY-CQDGASGKKYWVAKNSWGKTWGQQGYILLEKDVLQPHGT 307
Query: 334 CGIATAASYPV 344
CG+A + YP
Sbjct: 308 CGLAVSPFYPT 318
>gi|391343119|ref|XP_003745860.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 385
Score = 275 bits (703), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 151/337 (44%), Positives = 211/337 (62%), Gaps = 16/337 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ ++ VI AS + P++ + E + A+H + Y+ E+ MR IF++N ++I
Sbjct: 58 LLAVLAVIGLASALSP-----NPNLNQHWENFKAEHNKKYESFPEELMRRLIFEENHQFI 112
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
E N + + LG N F DLTN+E+R Y GY RP + S+ S S + + + DVP
Sbjct: 113 EDHNSKKEFDFYLGMNHFGDLTNKEYRERYLGYRRPENTPSKASYIFS--RAEKIEDVPD 170
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
IDWR++G VT +K+QGQCGSCWAFSAV ++EG + GKL+ LSEQ LVDCST N
Sbjct: 171 QIDWRDQGFVTPVKNQGQCGSCWAFSAVGSLEGQHFKSTGKLVSLSEQNLVDCSTPEGNS 230
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC+GG MD+AFEY+ +N G+ TE YPY +G+C + K K++ AT+ + D+ +GDE+A
Sbjct: 231 GCNGGWMDQAFEYVKDNHGIDTEDSYPYVGTDGSC-HFKNKSIGATLKGFMDVKEGDEEA 289
Query: 252 LLQAVS-NQPVSVCVDASGRAFHFYKSGVLNAD-CGNN-CDHGVAVVGFGTAEEENGAKY 308
L QAV PVSV +DAS F FY+ GV N C + DHGV VVG+G ++ G +
Sbjct: 290 LRQAVGVAGPVSVAIDASSMLFQFYRGGVYNVPWCSTSELDHGVLVVGYG--KQFQGKDF 347
Query: 309 WLIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
W++KNSWG WG GYI + R+ G CGIA+ AS P
Sbjct: 348 WMVKNSWGVGWGIYGYIEMSRNKGNQCGIASKASIPT 384
>gi|170041165|ref|XP_001848344.1| cathepsin l [Culex quinquefasciatus]
gi|167864709|gb|EDS28092.1| cathepsin l [Culex quinquefasciatus]
Length = 340
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 151/318 (47%), Positives = 203/318 (63%), Gaps = 19/318 (5%)
Query: 43 EQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLTN 96
E+W A QH + Y E E+ +RL I+ QN I K N+ +G ++L N+++DL +
Sbjct: 25 EEWNAYKLQHRKKYDSETEERLRLKIYVQNKHKIAKHNQRFEQGQEKFRLRVNKYTDLLH 84
Query: 97 EEFRALYTGYNR---PVPSVSR-QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
EEF G+NR P + + P T+ +VP ++DWREKGAVT +KDQG C
Sbjct: 85 EEFVQTLNGFNRTNAKKPMLKGVKIDEPVTYIEPANVEVPKTVDWREKGAVTPVKDQGHC 144
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKG 210
GSCW+FSA A+EG GKL+ LSEQ LVDCST N+GC+GG+MD AF+YI +N G
Sbjct: 145 GSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGMMDFAFQYIKDNGG 204
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASG 269
+ TE YPY + TC + KAV AT + D+P+GDE+AL++A++ PVSV +DAS
Sbjct: 205 IDTEKAYPYEAIDDTC-HYNPKAVGATDKGFVDIPQGDEKALMKAIATAGPVSVAIDASH 263
Query: 270 RAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
+F FY GV C + N DHGV VG+GT+EE G YWL+KNSWG TWG+ GY+++
Sbjct: 264 ESFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEE--GEDYWLVKNSWGTTWGDQGYVKM 321
Query: 328 LRDA-GLCGIATAASYPV 344
R+ CGIATAASYP+
Sbjct: 322 ARNRDNHCGIATAASYPL 339
>gi|223946391|gb|ACN27279.1| unknown [Zea mays]
Length = 279
Score = 275 bits (702), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 140/261 (53%), Positives = 169/261 (64%), Gaps = 14/261 (5%)
Query: 94 LTNEEFRALYTGYNRPVPSVSR-----QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKD 148
+T +EFR Y G + R S+ S+F Y + DVP S+DWR+KGAVT +KD
Sbjct: 1 MTADEFRRHYAGSRVAHHRMFRGDRQGSSASASSFMYADARDVPASVDWRQKGAVTDVKD 60
Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIE 207
QGQCGSCWAFS +AAVEGI I L LSEQQLVDC T N GC+GGLMD AF+YI +
Sbjct: 61 QGQCGSCWAFSTIAAVEGINAIKTKNLTSLSEQQLVDCDTKANAGCNGGLMDYAFQYIAK 120
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
+ G+A E YPYR + +C +K A TI YED+P DE AL +AV++QPVSV ++A
Sbjct: 121 HGGVAAEDAYPYRARQASC--KKSPAPVVTIDGYEDVPANDESALKKAVAHQPVSVAIEA 178
Query: 268 SGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
SG F FY GV + CG DHGVA VG+G +G KYWL+KNSWG WGE GYIR+
Sbjct: 179 SGSHFQFYSEGVFSGRCGTELDHGVAAVGYGVT--ADGTKYWLVKNSWGPEWGEKGYIRM 236
Query: 328 LRDA----GLCGIATAASYPV 344
RD G CGIA ASYPV
Sbjct: 237 ARDVAAKEGHCGIAMEASYPV 257
>gi|255078398|ref|XP_002502779.1| cysteine endopeptidase [Micromonas sp. RCC299]
gi|226518045|gb|ACO64037.1| cysteine endopeptidase [Micromonas sp. RCC299]
Length = 414
Score = 274 bits (701), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 144/320 (45%), Positives = 196/320 (61%), Gaps = 13/320 (4%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSD 93
S+ + +W +HG+TY E EK +RL IF N E+++K N E G T+ +G N +D
Sbjct: 63 SLSDLFHEWTQKHGKTYDSEEEKELRLKIFADNHEFVQKHNAEYENGEHTHFVGLNHLAD 122
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
LT +EF+ + GYN + SR ST++Y +VT P IDW GAVT +K+Q QCG
Sbjct: 123 LTKDEFKKML-GYNAAL-RASRAPVDASTWEYADVTP-PEEIDWVASGAVTPVKNQKQCG 179
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
SCWAFS AVEG+ I GKLI LSE++L+ CST+ N GC+GGLMD FE+I+ N+G+
Sbjct: 180 SCWAFSTTGAVEGVNAIKTGKLISLSEEELISCSTNGNMGCNGGLMDNGFEWIVNNRGID 239
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE + Y +E C + A I ++D+P DE +L++AVS QPVSV ++A ++F
Sbjct: 240 TEDGWEYVAKEEKCGFFRRHHRAVAIDGFKDVPSNDEDSLMKAVSQQPVSVAIEADHQSF 299
Query: 273 HFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENGAK-YWLIKNSWGETWGESGYIRILRD 330
Y GV +A DCG DHGV +VG+G + K +W IKNSWG WGE GYIRI +
Sbjct: 300 QLYAGGVYSAKDCGTELDHGVLLVGYGVDPKSTKHKHFWKIKNSWGPAWGEDGYIRIAKG 359
Query: 331 A----GLCGIATAASYPVAI 346
G CG+A SYP +
Sbjct: 360 GSGVEGQCGVAMQPSYPTKL 379
>gi|195583187|ref|XP_002081405.1| GD10995 [Drosophila simulans]
gi|194193414|gb|EDX06990.1| GD10995 [Drosophila simulans]
Length = 341
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 147/319 (46%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+D+ E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 95 TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
+ EFR L G+N + R +S + TF +P S+DWR KGAVT +KDQG
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST N+GC+GGLMD AF YI +N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
G+ TE YPY + +C K + AT + D+P+GDE+ + +AV+ PVSV +DAS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNK-GTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 263
Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+F FY GV N C N DHGV VVGFGT +E+G YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGDDYWLVKNSWGTTWGDKGFIK 321
Query: 327 ILRDA-GLCGIATAASYPV 344
+LR+ CGIA+A+SYP+
Sbjct: 322 MLRNKENQCGIASASSYPL 340
>gi|388890776|gb|AFK80364.1| cysteine proteinase 3, partial [Acanthamoeba castellanii]
Length = 329
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 147/338 (43%), Positives = 202/338 (59%), Gaps = 14/338 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M I ILV+ A V S + + +WM + ++Y +E E R N++++N +
Sbjct: 1 MRAITILVLLAAICVASTLATTHDPLTGVFAEWMRDNSKSYSNE-EFVFRWNVWRENQQL 59
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
IE+ N+ N+T L N+F DLTN EF L+ G S +++ + K +
Sbjct: 60 IEEHNRS-NKTSFLAMNKFGDLTNAEFNKLFKGL---AFDYSFHANKAAAEKAVPAPGLS 115
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--N 190
DWR+KGAVTH+K+QGQCGSCW+FS + EG + G+L LSEQ L+DCS N
Sbjct: 116 ADFDWRQKGAVTHVKNQGQCGSCWSFSTTGSTEGANFLKTGRLTSLSEQNLIDCSGSYGN 175
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
+GC+GGLMD AFEYII NKG+ TEA YPY+ + TC + +++ Y D+ GDE
Sbjct: 176 NGCNGGLMDYAFEYIINNKGIDTEASYPYQTAQYTCQYNPANS-GGSLTSYTDVSSGDEN 234
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFGTAEEENGAKY 308
ALL AV+ +P SV +DAS +F FY GV +A DHGV VG+GT E+G Y
Sbjct: 235 ALLNAVATEPTSVAIDASHNSFQFYSGGVYYESACSSTQLDHGVLAVGWGT---EDGQDY 291
Query: 309 WLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPVA 345
WL+KNSWG WG +GYI++ R+ + CGIAT+ASYP A
Sbjct: 292 WLVKNSWGADWGLAGYIKMARNRSNNCGIATSASYPTA 329
>gi|326497561|dbj|BAK05870.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 340
Score = 274 bits (700), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 145/340 (42%), Positives = 205/340 (60%), Gaps = 18/340 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMH--EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFK 67
+IP+ V++ + A SG + + ++++ QW A H R+Y E+ R +++
Sbjct: 11 VIPILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRFEVYR 70
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
N+EYI+ N+ G TY+LG N+F+DLT EEF A Y G +++ + + +
Sbjct: 71 TNVEYIDATNRRGGLTYELGENQFADLTGEEFLARYAG-GHTGSAITTAAEADGSLE--- 126
Query: 128 VTDVPTSIDWREKGAVTHIKDQG-QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
D P S+DWR KGAVT +K+QG QC SCWAFSAVA +E + I GKL+ LSEQQLVDC
Sbjct: 127 -ADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQQLVDC 185
Query: 187 STDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
+ GC+ G +AF++I+EN G+ T A YPY+ G C K A TI+ + + K
Sbjct: 186 DKYDGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVRGACSAAKP---AVTITGHLAVAK 242
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGA 306
+E AL AV+ QP+ V ++ + FYKSGV +A CG H V VG+G + +G
Sbjct: 243 -NELALQSAVARQPIGVAIEVP-ISMQFYKSGVFSAACGIQMSHAVVTVGYGA--DASGL 298
Query: 307 KYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYP 343
KYWL+KNSWG+TWGE+GYIR+ RD GLCGIA +YP
Sbjct: 299 KYWLVKNSWGQTWGEAGYIRMRRDVGGGGLCGIALDTAYP 338
>gi|91992508|gb|ABE72970.1| cathepsin L [Aedes aegypti]
Length = 339
Score = 274 bits (700), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 151/317 (47%), Positives = 200/317 (63%), Gaps = 18/317 (5%)
Query: 43 EQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTN 96
E+W A QH + Y E E+ +RL I+ QN I K N+ G Y+L N+++DL +
Sbjct: 25 EEWNAFKLQHRKNYDSETEERIRLKIYVQNKHKIAKHNQRFDLGQEKYRLRVNKYADLLH 84
Query: 97 EEFRALYTGYNRPVPSVSRQSSR---PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
EEF G+NR S + R P TF +VPT++DWR+KGAVT +KDQG CG
Sbjct: 85 EEFVQTVNGFNRTDSKKSLKGVRIEEPVTFIEPANVEVPTTVDWRKKGAVTPVKDQGHCG 144
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGL 211
SCW+FSA A+EG GKL+ LSEQ LVDCS N+GC+GG+MD AF+YI +N G+
Sbjct: 145 SCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSGKYGNNGCNGGMMDYAFQYIKDNGGI 204
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGR 270
TE YPY + TC + KAV AT Y D+P+GDE+AL +A++ PVS+ +DAS
Sbjct: 205 DTEKSYPYEAIDDTC-HFNPKAVGATDKGYVDIPQGDEEALKKALATVGPVSIAIDASHE 263
Query: 271 AFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
+F FY GV C + N DHGV VG+GT+EE G YWL+KNSWG TWG+ GY+++
Sbjct: 264 SFQFYSEGVYYEPQCDSENLDHGVLAVGYGTSEE--GEDYWLVKNSWGTTWGDQGYVKMA 321
Query: 329 RDA-GLCGIATAASYPV 344
R+ CG+AT ASYP+
Sbjct: 322 RNHDNHCGVATCASYPL 338
>gi|195484843|ref|XP_002090843.1| GE12574 [Drosophila yakuba]
gi|194176944|gb|EDW90555.1| GE12574 [Drosophila yakuba]
Length = 341
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 147/319 (46%), Positives = 200/319 (62%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+D+ E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 95 TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
+ EFR L G+N + R S + TF +P S+DWR KGAVT +KDQG
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRATDDSFKGVTFISPAHVTLPKSVDWRSKGAVTAVKDQGH 144
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST N+GC+GGLMD AF YI +N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
G+ TE YPY + +C K + AT + D+P+GDE+ + +AV+ PVSV +DAS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNK-GTIGATDRGFTDIPQGDEKKMAEAVATVGPVSVAIDAS 263
Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+F FY GV N C N DHGV VVGFGT +E+G YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGDDYWLVKNSWGTTWGDKGFIK 321
Query: 327 ILRDA-GLCGIATAASYPV 344
+LR+ CGIA+A+SYP+
Sbjct: 322 MLRNKDNQCGIASASSYPL 340
>gi|125606204|gb|EAZ45240.1| hypothetical protein OsJ_29883 [Oryza sativa Japonica Group]
Length = 350
Score = 273 bits (699), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 147/325 (45%), Positives = 192/325 (59%), Gaps = 20/325 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ EQWM +HGR Y D EK R ++++N+E +E N N YKL N+F+DLTNE
Sbjct: 27 MLDRFEQWMIRHGRAYTDAGEKQRRFEVYRRNVELVETFNSMSN-GYKLADNKFADLTNE 85
Query: 98 EFRALYTGYNRP---VPSVSRQSSRPSTFKYQNVTDV-PTSIDWREKGAVTH-IKDQGQC 152
EFRA G+ RP +P +S S ++ D+ P S+DWR KGAV + K
Sbjct: 86 EFRAKMLGF-RPHVTIPQISNTCSADIAMPGESSDDILPKSVDWRNKGAVINRWKICVDA 144
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
GSCWAFSAVAA+EGI QI G+L+ LSEQ+LVDC + GC GG M AFE+++ N GL
Sbjct: 145 GSCWAFSAVAAIEGINQIKNGELVSLSEQELVDCDDEAVGCGGGYMSWAFEFVVGNHGLT 204
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TEA YPY G C K A I+ Y ++ E L +A + QPVSV VD F
Sbjct: 205 TEASYPYHAANGACQAAKLNQSAVAIAGYRNVTPSSEPDLARAAAAQPVSVAVDGGSFMF 264
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEEN--------GAKYWLIKNSWGETWGESGY 324
Y SGV C + +HGV VVG+G +E + G KYW++KNSWG WG++GY
Sbjct: 265 QLYGSGVYTGPCTADVNHGVTVVGYGESEPKTDGGGAAKGGEKYWIVKNSWGAEWGDAGY 324
Query: 325 IRILRD-----AGLCGIATAASYPV 344
I + RD +GLCGIA SYPV
Sbjct: 325 ILMQRDVAGLASGLCGIALLPSYPV 349
>gi|146152090|gb|ABQ08058.1| cathepsin L [Misgurnus mizolepis]
Length = 337
Score = 273 bits (699), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 151/342 (44%), Positives = 209/342 (61%), Gaps = 16/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M+ + L C S V + S+ + + + EQW HG+ Y E E+ R I+++NL
Sbjct: 1 MWTYLALFTLCLSGVFAAPSL-DKQLDDHWEQWKTWHGKNYH-EKEEGWRRMIWEKNLRK 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I+ N E G TY+LG N F D+ +EEFR + GY + + + S F N
Sbjct: 59 IQFHNLEHSMGIHTYRLGMNHFGDMNHEEFRQVMNGYKHK----TERKFKGSLFMEPNFL 114
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+VP+ +DWREKG VT +KDQG+CGSCWAFS A+EG +GKL+ LSEQ LVDCS
Sbjct: 115 EVPSKLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKQGKLVSLSEQNLVDCSRP 174
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+YI +N GL +E YPY + + K AA + + D+P G
Sbjct: 175 EGNEGCNGGLMDQAFQYIKDNNGLDSEEAYPYLGTDDQPCHYDPKYNAANDTGFVDIPSG 234
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E AL++AV++ PVSV +DA +F FY+SG+ +C + DHGV VVG+G E+
Sbjct: 235 KEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYFEKECSSEELDHGVLVVGYGFEGEDV 294
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+G KYW++KNSW E+WG+ GYI + +D CGIATAASYP+
Sbjct: 295 DGKKYWIVKNSWSESWGDKGYIYMAKDRKNHCGIATAASYPL 336
>gi|195334204|ref|XP_002033774.1| GM21500 [Drosophila sechellia]
gi|194125744|gb|EDW47787.1| GM21500 [Drosophila sechellia]
Length = 341
Score = 273 bits (698), Expect = 8e-71, Method: Compositional matrix adjust.
Identities = 146/319 (45%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+D+ E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRFAEGKVSFKLAVNKYADL 84
Query: 95 TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
+ EFR L G+N + R +S + TF +P S+DWR KGAVT +KDQG
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRAADESFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST N+GC+GGLMD AF YI +N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
G+ TE YPY + +C K + AT + D+P+GDE+ + +AV+ PV+V +DAS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNK-GTIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDAS 263
Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+F FY GV N C N DHGV VVGFGT +E+G YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGFGT--DESGEDYWLVKNSWGTTWGDKGFIK 321
Query: 327 ILRDA-GLCGIATAASYPV 344
+LR+ CGIA+A+SYP+
Sbjct: 322 MLRNKENQCGIASASSYPL 340
>gi|326490904|dbj|BAJ90119.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 273 bits (698), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 137/316 (43%), Positives = 190/316 (60%), Gaps = 16/316 (5%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK------EGNRTYKLGTNEFSDL 94
+ E W A+HG+ Y E+A RL F +N ++ N G +Y L N F+DL
Sbjct: 38 QFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADL 97
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-VTDVPTSIDWREKGAVTHIKDQGQCG 153
T++EFRA G P S PS ++ V VP ++DWR+ GAVT +KDQG CG
Sbjct: 98 THDEFRAARLGRLAVGPGPLGAPS-PSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCG 156
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
+CW+FSA A+EGI +IT G L+ LSEQ+L+DC N GC GGLM A++++I+N G+
Sbjct: 157 ACWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGID 216
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE DYP+R +GTC+ K K TI Y+++P E LLQAV+ QP+SV + S RAF
Sbjct: 217 TEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAF 276
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
Y G+ + C + DH V +VG+G+ E G YW++KNSWGE WG GY+ + R+
Sbjct: 277 QLYSQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGERWGMKGYMHMHRNTG 333
Query: 331 --AGLCGIATAASYPV 344
+G+CGI AS+P
Sbjct: 334 SSSGICGINMMASFPT 349
>gi|116794072|gb|ABK26996.1| unknown [Picea sitchensis]
Length = 367
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 142/324 (43%), Positives = 190/324 (58%), Gaps = 21/324 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
+V ++W+ +HG+ Y EKA RL IF+ NL+YI NK N +++LG N+F+DLTNE
Sbjct: 39 LVRLFDRWLGRHGKLYGSHEEKARRLQIFRTNLQYIHAHNKNSNSSFRLGLNKFADLTNE 98
Query: 98 EFRALYTGYNRPVPSVSRQSS------RP----STFKYQNVTDVPTSIDWREKGAVTHIK 147
EF+ Y G N R++ RP + + + +S+DWR+KGAVT +K
Sbjct: 99 EFKTRYFGKNSKQWRDRRRTELEGAELRPVLKQTVGSQSSSCSIASSLDWRKKGAVTGVK 158
Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIE 207
DQ QCGSCWAFS A+EG+ I+ GKL+ LSEQ+LV C N+GC GG MD AF ++I+
Sbjct: 159 DQAQCGSCWAFSTTGAIEGVNFISTGKLVSLSEQELVACDATNYGCEGGDMDYAFTWVIQ 218
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
N G+ TE DY Y + TC+ KE +I Y D+ D+ ALL A +QPVSV +D
Sbjct: 219 NGGIDTEKDYSYTGVDSTCNTNKEAKKIVSIDGYTDVSP-DDSALLCAAGSQPVSVGIDG 277
Query: 268 SGRAFHFYKSGVLNADCGNN---CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
S F Y G+ + DC N DH V VVG+ +NG YW++KNSWG WG GY
Sbjct: 278 SAIDFQLYTGGIYDGDCSGNPDDIDHAVLVVGY---SAKNGKDYWIVKNSWGTDWGLEGY 334
Query: 325 IRILRDA----GLCGIATAASYPV 344
ILR+ G+C I ASYP
Sbjct: 335 FYILRNTELPYGVCAINAMASYPT 358
>gi|333069454|gb|AEF13978.1| chymopapain [Carica papaya]
Length = 352
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 139/345 (40%), Positives = 203/345 (58%), Gaps = 14/345 (4%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMR 62
K + +II + ++ A G S + + +E+ + WM +H + Y+ EK R
Sbjct: 9 KIIFLATCLIIHMSLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYR 68
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
IF+ NL YI++ NK+ N +Y LG N F+DL+N+EF+ Y G + +
Sbjct: 69 FEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEFKKKYVG-SVAEDFTGLEHFDNED 126
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
F Y++VT+ P SIDWR KGAVT +K+QG CGSCWAFS +A VEG+ +I G L+ELSEQ+
Sbjct: 127 FTYKHVTNYPQSIDWRAKGAVTPVKNQGSCGSCWAFSTIATVEGVNKIVTGNLLELSEQE 186
Query: 183 LVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
LVDC ++HGC GG + +Y+ +N G+ T YPY+ + C + I+ Y+
Sbjct: 187 LVDCDKNSHGCKGGYQTTSLQYVADN-GVHTSKVYPYQAKAMQCRATDKPGPKVKITGYK 245
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
+P E + L A++NQP+SV V+A G+ F YKSGV + CG DH V VG+GT++
Sbjct: 246 RVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSD- 304
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
G Y +IKNSWG WGE GY+R+ R + G CG+ ++ YP
Sbjct: 305 --GKNYIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347
>gi|307110445|gb|EFN58681.1| hypothetical protein CHLNCDRAFT_56822 [Chlorella variabilis]
Length = 466
Score = 273 bits (697), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 148/349 (42%), Positives = 205/349 (58%), Gaps = 23/349 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHE----------PSIVEKHEQWMAQHGRTYKDELEKAMR 62
M + +L++ C+ V+ E S E + W+ R Y E R
Sbjct: 1 MRLSCVLLVACSCLAVAAGFPFENHRLFIQQAVESPREAFDFWVQTLKRAYASAEEYERR 60
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
+++ NL ++ + N G+ ++ L ++DL+ +E+R+ GYN + + R +
Sbjct: 61 FDVWLDNLRFVHEYNA-GHTSHWLSMGVYADLSQDEYRSKALGYNADLHE--ERPLRAAP 117
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
F Y+ T P +DW KGAVT +K+Q CGSCWAFS AVEG + I GKL LSEQ
Sbjct: 118 FLYEG-TVPPKEVDWVAKGAVTPVKNQLLCGSCWAFSTTGAVEGASAIATGKLASLSEQM 176
Query: 183 LVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
LVDC + ++GC GGLMD AFE+I++N G+ TE DYPY EEG C + K + TI Y
Sbjct: 177 LVDCDRERDNGCHGGLMDFAFEFIMKNGGIDTEDDYPYTAEEGMCQDNKMRRHVVTIDDY 236
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
+D+P DE AL++AV+NQPVSV ++A RAF Y GV +A+CG DHGV VVG+GTA
Sbjct: 237 QDVPPNDEHALMKAVANQPVSVAIEADQRAFQLYGGGVFDAECGTALDHGVLVVGYGTA- 295
Query: 302 EENGA---KYWLIKNSWGETWGESGYIRILRDA---GLCGIATAASYPV 344
NG YWL+KNSWG WG+ GYIR+LR+ G CG+A AS+P+
Sbjct: 296 -SNGTHHLPYWLVKNSWGAEWGDKGYIRLLRNLGEEGQCGVAMQASFPI 343
>gi|195124431|ref|XP_002006696.1| GI21205 [Drosophila mojavensis]
gi|193911764|gb|EDW10631.1| GI21205 [Drosophila mojavensis]
Length = 339
Score = 272 bits (696), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 150/340 (44%), Positives = 211/340 (62%), Gaps = 21/340 (6%)
Query: 18 ILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN 77
+L + +Q VS + I E+ + +H +TY+DE E+ RL IF +N I K N
Sbjct: 7 LLALVAVAQAVSFADV----IKEEWHTFKLEHRKTYQDETEERFRLKIFNENKHKIAKHN 62
Query: 78 KE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS----TFKYQNVTD 130
+ G T+K+ N+++D+ + EFR G+N + R +S PS TF
Sbjct: 63 QRYATGEVTFKMAVNKYADMLHHEFRETMNGFNYTLHKELR-ASDPSFTGITFISPAHVK 121
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P S+DWREKGAVT +KDQG CGSCWAFS+ A+EG G L+ LSEQ LVDCS
Sbjct: 122 LPKSVDWREKGAVTAVKDQGHCGSCWAFSSTGALEGQHFRKTGTLVSLSEQNLVDCSAKY 181
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC+GGLMD AF YI +N G+ TE YPY + +C K+ +V AT + D+P+G+
Sbjct: 182 GNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNKD-SVGATDRGFADIPQGN 240
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
E+ + +AV+ PVSV +DAS +F FY G+ N +C + N DHGV VVG+GT +E+G
Sbjct: 241 EKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPECNSQNLDHGVLVVGYGT--DESG 298
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
YWL+KNSWG TWG+ G+I++ R+ CGIA+A+SYP+
Sbjct: 299 KDYWLVKNSWGTTWGDKGFIKMARNEDNQCGIASASSYPL 338
>gi|320164780|gb|EFW41679.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 334
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 147/313 (46%), Positives = 193/313 (61%), Gaps = 21/313 (6%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRAL 102
E W G++Y D +E+ R +++ N ++ N G +Y LG N F+DLT+EEF+
Sbjct: 31 EAWKRTFGKSYSDAVEEINRRAVWEANKMLVDAHNGAGIHSYTLGMNIFADLTHEEFKRF 90
Query: 103 YTG----YNRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
Y G NRP +S+ STF NV +P S+DWR G VT +KDQGQCGSCW+
Sbjct: 91 YLGTKVDLNRP------RSNFSSTFIPTANVGALPDSVDWRTAGIVTPVKDQGQCGSCWS 144
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEA 215
FS +VEG G+L+ LSEQ LVDCS N GC+GGLMD AF+YII NKG+ TEA
Sbjct: 145 FSTTGSVEGQHARKTGQLVSLSEQNLVDCSKAQGNQGCNGGLMDDAFQYIITNKGIDTEA 204
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHF 274
YPY ++GTC V AT+S ++D+ +G E L AV+ PVSV +DAS +F
Sbjct: 205 SYPYTAKDGTCKFNAAN-VGATLSSFQDITRGSESDLQNAVATVGPVSVAIDASKNSFQL 263
Query: 275 YKSGVLNAD--CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
Y SGV N + DHGV G+GT+ NG YWL+KNSWG +WG++GYI + R+A
Sbjct: 264 YTSGVYNEKKCSSTSLDHGVLAAGYGTS---NGTPYWLVKNSWGSSWGQAGYIWMSRNAN 320
Query: 332 GLCGIATAASYPV 344
CGIAT+ASYP+
Sbjct: 321 NQCGIATSASYPI 333
>gi|194883222|ref|XP_001975702.1| GG20414 [Drosophila erecta]
gi|190658889|gb|EDV56102.1| GG20414 [Drosophila erecta]
Length = 341
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 146/319 (45%), Positives = 201/319 (63%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ + +H + Y+D+ E+ RL IF +N I K N+ EG ++KL N+++DL
Sbjct: 25 VMEEWHTFKLEHRKNYQDDTEERFRLKIFNENKHKIAKHNQRYAEGKVSFKLAVNKYADL 84
Query: 95 TNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
+ EFR L G+N + R S + TF +P S+DWR KGAVT +KDQG
Sbjct: 85 LHHEFRQLMNGFNYTLHKQLRSTDDSFKGVTFISPAHVTLPKSVDWRTKGAVTAVKDQGH 144
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
CGSCWAFS+ A+EG G L+ LSEQ LVDCST N+GC+GGLMD AF YI +N
Sbjct: 145 CGSCWAFSSTGALEGQHFRKSGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNG 204
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
G+ TE YPY + +C K A+ AT + D+P+GDE+ + +AV+ PV+V +DAS
Sbjct: 205 GIDTEKSYPYEAIDDSCHFNK-GAIGATDRGFTDIPQGDEKKMAEAVATVGPVAVAIDAS 263
Query: 269 GRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+F FY GV N C N DHGV VVG+GT +E+G YWL+KNSWG TWG+ G+I+
Sbjct: 264 HESFQFYSEGVYNEPQCDAQNLDHGVLVVGYGT--DESGDDYWLVKNSWGTTWGDKGFIK 321
Query: 327 ILRDA-GLCGIATAASYPV 344
+LR+ CGIA+A+SYP+
Sbjct: 322 MLRNKDNQCGIASASSYPL 340
>gi|115464789|ref|NP_001055994.1| Os05g0508300 [Oryza sativa Japonica Group]
gi|48475189|gb|AAT44258.1| hypothetical protein [Oryza sativa Japonica Group]
gi|113579545|dbj|BAF17908.1| Os05g0508300 [Oryza sativa Japonica Group]
Length = 450
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 136/307 (44%), Positives = 186/307 (60%), Gaps = 9/307 (2%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
+ E W A+HGR+Y E+A RL F N ++ A+ +Y L N F+DLT++EFR
Sbjct: 37 QFEAWCAEHGRSYATPGERAARLAAFADNAAFV-AAHNGAPASYALALNAFADLTHDEFR 95
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
A G R P V VP ++DWR+ GAVT +KDQG CG+CW+FSA
Sbjct: 96 AARLGRLAAAGGPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSA 155
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
A+EGI +I G LI LSEQ+L+DC N GC GGLMD A++++++N G+ TEADYPY
Sbjct: 156 TGAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPY 215
Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
R +GTC+ K K TI Y+D+P +E LLQAV+ QPVSV + S RAF Y G+
Sbjct: 216 RETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGI 275
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCG 335
+ C + DH + +VG+G+ E G YW++KNSWGE+WG GY+ + R+ G+CG
Sbjct: 276 FDGPCPTSLDHAILIVGYGS---EGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCG 332
Query: 336 IATAASY 342
I S+
Sbjct: 333 INQMPSF 339
>gi|156124998|gb|ABU50817.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 151/329 (45%), Positives = 204/329 (62%), Gaps = 24/329 (7%)
Query: 32 SMHEPSIV-------EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GN 81
S H PS++ + EQ+ + GR Y + R +IF+ NL++I + N + G+
Sbjct: 16 SAHIPSMLLTEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGD 75
Query: 82 RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
T+ + N F+DL+NEEFRA + GY R ++ S S +V +P ++DW KG
Sbjct: 76 STFSVSVNNFTDLSNEEFRATFNGYRR----LAAVSLADSVHADNDVEALPATVDWTTKG 131
Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMD 199
VT IK+Q QCGSCWAFSAVA++EG + GKL+ LSEQ LVDCS + GCSGG MD
Sbjct: 132 VVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMD 191
Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN- 258
AF+Y+I+N+G+ TEA YPY+ + +C+ K +V ATI + D+ GDE AL AV++
Sbjct: 192 YAFKYVIQNRGIDTEASYPYKAIDESCE-FKRNSVGATIHSFVDVKTGDESALQNAVASI 250
Query: 259 QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWG 316
P+SV +DA+ +F FY SGV N DC DHGV VG+GT NGA YW +KNSWG
Sbjct: 251 GPISVAIDAAQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTL---NGAPYWKVKNSWG 307
Query: 317 ETWGESGYIRILRDA-GLCGIATAASYPV 344
+WG GYI + R+ CGIAT ASYPV
Sbjct: 308 TSWGRKGYIFMSRNKQNQCGIATKASYPV 336
>gi|242040563|ref|XP_002467676.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
gi|241921530|gb|EER94674.1| hypothetical protein SORBIDRAFT_01g032090 [Sorghum bicolor]
Length = 358
Score = 272 bits (695), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 197/311 (63%), Gaps = 10/311 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ +W A + R+Y E+ R ++++N+E+IE N+ GN TY LG N+F+DLT E
Sbjct: 53 MMDRFLRWQATYNRSYPTAEERQRRFQVYRRNMEHIEATNRAGNLTYTLGENQFADLTEE 112
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQG-QCGSCW 156
EF LYT + +P V R + + + +V D PTS+DWR +GAVT IK+QG C SCW
Sbjct: 113 EFLDLYT--MKGMPPVRRDAGKKQQANFSSVVDAPTSVDWRSRGAVTPIKNQGPSCSSCW 170
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEAD 216
AF A +E ITQI GKL+ LSEQ+L+DC + GC+ G ++++I+N GL TEA+
Sbjct: 171 AFVTAATIESITQIRTGKLVSLSEQELIDCDPYDGGCNLGYFVNGYKWVIQNGGLTTEAN 230
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY+ C+ K AA IS Y LP+G E L QAV+ QPV+ ++ G + FY
Sbjct: 231 YPYQARRYQCNRSKAGQRAARISNYRQLPQG-EAQLQQAVAQQPVAAAIEMGG-SLQFYS 288
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI---LRDAGL 333
GV + CG +H + VVG+G + +G KYWL+KNSWG+TWGE GY+R+ +R GL
Sbjct: 289 GGVWSGQCGTRMNHAITVVGYGA--DSSGVKYWLVKNSWGQTWGERGYLRMRKDVRQGGL 346
Query: 334 CGIATAASYPV 344
CGIA +YP+
Sbjct: 347 CGIALDLAYPI 357
>gi|312381833|gb|EFR27483.1| hypothetical protein AND_05794 [Anopheles darlingi]
Length = 344
Score = 271 bits (694), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 150/322 (46%), Positives = 200/322 (62%), Gaps = 23/322 (7%)
Query: 43 EQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTN 96
E+W A QH + Y E E+ +R+ I+ QN I K N+ G ++L N+++DL +
Sbjct: 25 EEWNAFKLQHRKKYDSESEERIRMKIYVQNKHKIAKHNQRYDLGQEKFRLRVNKYADLLH 84
Query: 97 EEFRALYTGYNRPVPSVSRQSSR--------PSTFKYQNVTDVPTSIDWREKGAVTHIKD 148
EEF G+NR + S+ R P T+ DVPT+IDWREKGAVT +KD
Sbjct: 85 EEFVHTLNGFNRSAAAGSKLLGREQLMTIEEPITWIEPANVDVPTTIDWREKGAVTPVKD 144
Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYII 206
QG CGSCW+FSA A+EG GKL+ LSEQ LVDCST N+GC+GGLMD AF+Y+
Sbjct: 145 QGHCGSCWSFSATGALEGQHFRKTGKLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFQYVK 204
Query: 207 ENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCV 265
+NKG+ TE YPY + C + KA+ AT + D+P+GDE+AL +A++ PVSV +
Sbjct: 205 DNKGIDTEKAYPYEAIDDEC-HYNPKAIGATDKGFVDIPQGDEKALKKALATVGPVSVAI 263
Query: 266 DASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
DAS +F FY GV C + DHGV VG+GT E+ G YWL+KNSWG TWG+ G
Sbjct: 264 DASHESFQFYSEGVYYEPQCDSEQLDHGVLAVGYGTTED--GEDYWLVKNSWGTTWGDQG 321
Query: 324 YIRILRD-AGLCGIATAASYPV 344
Y+++ R+ CGIAT ASYP+
Sbjct: 322 YVKMARNRENHCGIATTASYPL 343
>gi|156124996|gb|ABU50816.1| Ale o 1 allergen [Aleuroglyphus ovatus]
Length = 337
Score = 271 bits (694), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 150/329 (45%), Positives = 204/329 (62%), Gaps = 24/329 (7%)
Query: 32 SMHEPSIV-------EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GN 81
S H PS++ + EQ+ + GR Y + R +IF+ NL++I + N + G+
Sbjct: 16 SAHIPSMLLTEGELEAQFEQFKSTFGRVYPSPEIELHRKSIFRANLQFILRHNIDYFNGD 75
Query: 82 RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
T+ + N F+DL+NEEFRA + GY R ++ S S +V +P ++DW KG
Sbjct: 76 STFSVSVNNFTDLSNEEFRATFNGYRR----LAAVSLADSVHADNDVEALPATVDWTTKG 131
Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMD 199
VT IK+Q QCGSCWAFSAVA++EG + GKL+ LSEQ LVDCS + GCSGG MD
Sbjct: 132 VVTPIKNQQQCGSCWAFSAVASMEGQHALKTGKLVSLSEQNLVDCSAAEGDMGCSGGWMD 191
Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN- 258
AF+Y+I+N+G+ TEA YPY+ + +C+ K ++ ATI + D+ GDE AL AV++
Sbjct: 192 YAFKYVIQNRGIDTEASYPYKAIDESCEF-KRNSIGATIHSFVDVKTGDESALQNAVASI 250
Query: 259 QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWG 316
P+SV +DAS +F FY SGV N DC DHGV VG+GT NG YW +KNSWG
Sbjct: 251 GPISVAIDASQPSFQFYSSGVYNEPDCSTEILDHGVTAVGYGTL---NGVPYWKVKNSWG 307
Query: 317 ETWGESGYIRILRDA-GLCGIATAASYPV 344
+WG+ GYI + R+ CGIAT ASYPV
Sbjct: 308 TSWGQKGYIFMSRNKQNQCGIATKASYPV 336
>gi|154183745|gb|ABS70713.1| cathepsin L-like cysteine proteinase [Dermacentor variabilis]
Length = 333
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 149/341 (43%), Positives = 210/341 (61%), Gaps = 21/341 (6%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
++ + + CA VV+ + + + E + A H ++Y+ +E+ +R IF +N + +
Sbjct: 1 MLRISLLCAFVVVTTAASSHEILRTQWEAFKATHKKSYQSNMEELLRFKIFSENSLLVAR 60
Query: 76 ANKEGNR---TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF---KYQNVT 129
N++ R +YKLG N+F DL EF ++ GY +R + R STF N +
Sbjct: 61 HNEKYARGLVSYKLGMNQFGDLLPHEFARMFNGYRG-----ARTAGRGSTFLPPANVNYS 115
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+P S+DWREKGAVT +K+QGQCGSCWAFS ++EG + G L+ LSEQ LVDCS
Sbjct: 116 SLPQSMDWREKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFLKTGVLVSLSEQNLVDCSET 175
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
NHGC GGLMD AF+YI N G+ TE YPY E+G C +K+ V AT + + D+ +G
Sbjct: 176 FGNHGCEGGLMDNAFQYIKANGGIDTEKSYPYEAEDGECRFKKQN-VGATDTGFVDIEQG 234
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEEN 304
E L +AV+ PVSV +DAS +F Y GV + +C + DHGV VVG+G E+
Sbjct: 235 SEDDLKKAVATVGPVSVAIDASHSSFQLYSEGVYDETECSSEQLDHGVLVVGYGV---ED 291
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G KYWL+KNSW E+WG++GYI++ RD CGIA+AASYP+
Sbjct: 292 GKKYWLVKNSWAESWGDNGYIKMSRDKDNQCGIASAASYPL 332
>gi|449450419|ref|XP_004142960.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 141/315 (44%), Positives = 203/315 (64%), Gaps = 10/315 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ + +E+W +H ++ EK R ++FK+N+ ++ N+ ++ YKL N+F+D+
Sbjct: 34 EESLWQLYERW-GKHHTISRNLKEKHKRFSVFKENVNHVFTVNQM-DKPYKLKLNKFADM 91
Query: 95 TNEEFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
+N EF Y N + + F Y+ TD+P+S+DWRE+GAV +K+QG+CG
Sbjct: 92 SNYEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDWRERGAVNAVKEQGRCG 151
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
SCWAFS+VAAVEGI +I +L+ LSEQ+L+DC+ N GC+GG M+ AF++I N G+AT
Sbjct: 152 SCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYRNKGCNGGFMEIAFDFIKRNGGIAT 211
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
E YPY G C + + + I YE +P+ +E AL+QAV+NQPVSV +DA+GR F
Sbjct: 212 ENSYPYHGSRGLCRSSRISSPIVKIDGYESVPE-NEDALMQAVANQPVSVAIDAAGRDFQ 270
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-- 331
FY GV + CG +HGV +G+GT E+ G YWL++NSWG WGE GY+R+ R
Sbjct: 271 FYSQGVFDGYCGTELNHGVVAIGYGTTED--GTDYWLVRNSWGVGWGEDGYVRMKRGVEQ 328
Query: 332 --GLCGIATAASYPV 344
GLCGIA ASYP+
Sbjct: 329 AEGLCGIAMEASYPI 343
>gi|16304178|gb|AAL16954.1|AF426414_1 cathepsin L-like cysteine protease precursor [Delia radicum]
Length = 337
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 149/340 (43%), Positives = 207/340 (60%), Gaps = 17/340 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
V+ +L + Q +S + I E+ + + +H + + E+E+ R+ IF +N I
Sbjct: 4 VLALLALVAFVQAIS----YTDVIKEEWQTFKMEHRKNFLSEVEERFRMKIFNENRHKIA 59
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR-QSSRPSTFKYQNVTD 130
K N+ +G ++KLG N++SD+ EF+ GYN + V R Q +
Sbjct: 60 KHNQLYAQGKVSFKLGLNKYSDMLYHEFKETMNGYNHTMRKVLRAQGFSGIIYIPPANVQ 119
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P S+DWR+ GAVT +KDQG CGSCWAFS+ AA+EG G L+ LSEQ LVDCST
Sbjct: 120 IPKSVDWRQHGAVTAVKDQGHCGSCWAFSSTAALEGQHFRKAGVLVSLSEQNLVDCSTKY 179
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC+GGLMD AF YI +N G+ TE YPY + +C K V AT + + D+P+GD
Sbjct: 180 GNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFTKS-GVGATDTGFVDIPQGD 238
Query: 249 EQALLQAVSNQ-PVSVCVDASGRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENG 305
E+AL++AV+ PVSV +DAS +F Y GV N +C N DHGV VVG+GT ++ G
Sbjct: 239 EEALMKAVATMGPVSVAIDASHESFQLYSEGVYNEPECDAQNLDHGVLVVGYGT--DKTG 296
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
YWL+KNSWG TWG+ GYI++ R+ CGIATA+SYP
Sbjct: 297 LDYWLVKNSWGTTWGDQGYIKMARNQDNQCGIATASSYPT 336
>gi|22653679|sp|Q26636.1|CATL_SARPE RecName: Full=Cathepsin L; Contains: RecName: Full=Cathepsin L
heavy chain; Contains: RecName: Full=Cathepsin L light
chain; Flags: Precursor
gi|505140|dbj|BAA03970.1| cathepsin L precursor [Sarcophaga peregrina]
Length = 339
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 146/318 (45%), Positives = 200/318 (62%), Gaps = 14/318 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
I E+ + QH + Y +E+E+ R+ IF +N I K N+ +G +YKLG N+++D+
Sbjct: 24 IKEEWHTYKLQHRKNYANEVEERFRMKIFNENRHKIAKHNQLFAQGKVSYKLGLNKYADM 83
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSR--PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
+ EF+ GYN + + R+ + +T+ VP S+DWRE GAVT +KDQG C
Sbjct: 84 LHHEFKETMNGYNHTLRQLMRERTGLVGATYIPPAHVTVPKSVDWREHGAVTGVKDQGHC 143
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKG 210
GSCWAFS+ A+EG G L+ LSEQ LVDCST N+GC+GGLMD AF YI +N G
Sbjct: 144 GSCWAFSSTGALEGQHFRKAGVLVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGG 203
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASG 269
+ TE YPY + +C K + AT + + D+P+GDE+ + +AV+ PVSV +DAS
Sbjct: 204 IDTEKSYPYEGIDDSCHFNK-ATIGATDTGFVDIPEGDEEKMKKAVATMGPVSVAIDASH 262
Query: 270 RAFHFYKSGVLN-ADCG-NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
+F Y GV N +C N DHGV VVG+GT +E+G YWL+KNSWG TWGE GYI++
Sbjct: 263 ESFQLYSEGVYNEPECDEQNLDHGVLVVGYGT--DESGMDYWLVKNSWGTTWGEQGYIKM 320
Query: 328 LRDA-GLCGIATAASYPV 344
R+ CGIATA+SYP
Sbjct: 321 ARNQNNQCGIATASSYPT 338
>gi|269784818|ref|NP_001161481.1| cathepsin L1 precursor [Gallus gallus]
Length = 353
Score = 271 bits (693), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 151/354 (42%), Positives = 215/354 (60%), Gaps = 15/354 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+ L S + M V + ++ C + + +P + + W + H + Y E E++
Sbjct: 4 LFLARRLSRFVNMNVCLTILSLCLGLAFAAPRV-DPDLDSHWQLWKSWHSKDYH-EREES 61
Query: 61 MRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQS 117
R ++++NL+ IE N + G +YKLG N+F D+T EEFR L GY S +
Sbjct: 62 WRRVVWEKNLKMIELHNLDHSLGKHSYKLGMNQFGDMTAEEFRQLMNGYKH---KKSERK 118
Query: 118 SRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIE 177
R S F + + P S+DWREKG VT +KDQGQCGSCWAFS A+EG GKL+
Sbjct: 119 YRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVS 178
Query: 178 LSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA 235
LSEQ LVDCS N GC+GGLMD+AF+Y+ +N G+ +E YPY ++ K + A
Sbjct: 179 LSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNA 238
Query: 236 ATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGV 292
A + + D+P+G E+AL++AV++ PVSV +DA +F FY+SG+ DC + + DHGV
Sbjct: 239 ANDTGFVDIPQGHERALMKAVASVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGV 298
Query: 293 AVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
VVG+G E+ +G KYW++KNSWGE WG+ GYI + +D CGIATAASYP+
Sbjct: 299 LVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 352
>gi|413953665|gb|AFW86314.1| hypothetical protein ZEAMMB73_546353 [Zea mays]
Length = 233
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 128/233 (54%), Positives = 165/233 (70%), Gaps = 12/233 (5%)
Query: 120 PSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIE 177
P+ F+Y+NV+ +PT+IDWR KGAVT IKDQGQCG CWAFSAVAA EGI +I+ GKL+
Sbjct: 4 PTGFRYENVSADALPTTIDWRTKGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLVS 63
Query: 178 LSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA 235
L+EQ+LVDC ++ GC GGLMD AF++II+N GL TE+ YPY +G C + A
Sbjct: 64 LAEQELVDCDVHDEDQGCEGGLMDDAFKFIIKNGGLTTESSYPYTAADGKCKSGSNS--A 121
Query: 236 ATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVV 295
ATI YED+P DE AL++AV+NQPVSV VD F FY GV+ CG + DHG+A +
Sbjct: 122 ATIKGYEDVPANDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAI 181
Query: 296 GFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
G+G + +G KYWL+KNSWG TWGE+GY+R+ +D G+CG+A SYP
Sbjct: 182 GYG--KTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKRGMCGLAMEPSYPT 232
>gi|326520387|dbj|BAK07452.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 349
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 146/344 (42%), Positives = 203/344 (59%), Gaps = 17/344 (4%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIV--EKHEQWMAQHGRTYKDELEKAMRLNIFK 67
+IP+ V++ + A SG + ++ ++ QW A H R+Y E+ R +++
Sbjct: 11 VIPILVLLTGGLFAAFPAASGGRVDAGDMLMMDRFRQWQATHNRSYLSAEERLRRFEVYR 70
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG--YNRPVPSVSRQSSRPSTFKY 125
N+EYI+ N+ G TY+LG N+F+DLT EEF A Y G + + + S+
Sbjct: 71 TNVEYIDATNRRGGLTYELGENQFADLTGEEFLARYAGGHTGSAITTAAEADGLWSSGGS 130
Query: 126 QNV--TDVPTSIDWREKGAVTHIKDQG-QCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
D P S+DWR KGAVT +K+QG QC SCWAFSAVA +E + I GKL+ LSEQQ
Sbjct: 131 DGSLEADPPASVDWRAKGAVTPVKNQGSQCYSCWAFSAVATMESLYFIKTGKLVALSEQQ 190
Query: 183 LVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
LVDC + GC+ G +AF++I+EN G+ T A YPY+ G C K A TI+ +
Sbjct: 191 LVDCDKYDGGCNKGYYHRAFQWIMENGGITTAAQYPYKAVRGACSAAKP---AVTITGHL 247
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
+ K +E AL AV+ QP+ V ++ + FYKSGV +A CG H V VG+G +
Sbjct: 248 AVAK-NELALQSAVARQPIGVAIEVP-ISMQFYKSGVFSAACGIQMSHAVVTVGYGA--D 303
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYP 343
+G KYWL+KNSWG+TWGE+GYIR+ RD GLCGIA +YP
Sbjct: 304 ASGLKYWLVKNSWGQTWGEAGYIRMRRDVGGGGLCGIALDTAYP 347
>gi|66735056|gb|AAY53767.1| cysteine protease [Saprolegnia parasitica]
Length = 523
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 192/311 (61%), Gaps = 12/311 (3%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
K WM + + LE R +F N + IE NK+ + ++ +G NE+S LT +EF+
Sbjct: 27 KFLSWMKKFAVKL-NPLEWVHRFEVFILNDQRIEAHNKDASSSFTMGHNEYSHLTFDEFK 85
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQ-NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
L TG R PS + ++ + N+TDVP +DW E+G VT +K+QG CGSCWAFS
Sbjct: 86 KLRTGL-RVSPSYIQSRAKYALMAPAVNMTDVPNEMDWVEQGGVTPVKNQGMCGSCWAFS 144
Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYP 218
A+EG ++ +L+ +SEQ+LVDC + + GC+GGLMD AF+++ +KGL E DYP
Sbjct: 145 TTGAIEGAAFVSSKQLVSVSEQELVDCDHNGDMGCNGGLMDNAFKWVKTHKGLCKEEDYP 204
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
Y +EGTC +K K V ++ + D+P DEQAL AV+ QPVSV ++A F FYKSG
Sbjct: 205 YHAKEGTCALKKCKPV-TKVTAFHDVPANDEQALKAAVAKQPVSVAIEADQPEFQFYKSG 263
Query: 279 VLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLC 334
V + CG DHGV VVG+G EE G KYW +KNSWG WG+ GYI++ R + G C
Sbjct: 264 VFDKSCGTKLDHGVLVVGYG---EEGGKKYWKVKNSWGADWGDKGYIKLAREFGPETGQC 320
Query: 335 GIATAASYPVA 345
G+A SYP A
Sbjct: 321 GVAMVPSYPTA 331
>gi|299507656|gb|ADJ21807.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 151/342 (44%), Positives = 210/342 (61%), Gaps = 17/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + +L + C S +S S+ +P + E + W + H + Y E E+ R ++++NL+
Sbjct: 1 MLPVAVLAV-CLSAALSAPSL-DPQLDEHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKK 57
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G TY+LG N F D+T+EEFR + GY R S + + S F N
Sbjct: 58 IELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMNGYKRK----SERKFKGSLFMEPNFL 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+ P S+DWR+ G VT +KDQGQCGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 114 EAPRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRP 173
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+YI +N+GL +E YPY + + K +A + + D+P G
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSG 233
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+AL++AV+ PVSV +DA +F FY+SG+ +C + DHGV VVG+G E+
Sbjct: 234 KERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDV 293
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+G KYW++KNSW E WG+ GYI + +D CGIATAASYP+
Sbjct: 294 DGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|2507252|sp|P14080.2|PAPA2_CARPA RecName: Full=Chymopapain; AltName: Full=Papaya proteinase II;
Short=PPII; Flags: Precursor
gi|1332461|emb|CAA66378.1| chymopapain [Carica papaya]
Length = 352
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 139/345 (40%), Positives = 203/345 (58%), Gaps = 14/345 (4%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMR 62
K + +II + ++ A G S + + +E+ + WM +H + Y+ EK R
Sbjct: 9 KIIFLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYR 68
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
IF+ NL YI++ NK+ N +Y LG N F+DL+N+EF+ Y G+ +
Sbjct: 69 FEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEFKKKYVGF-VAEDFTGLEHFDNED 126
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
F Y++VT+ P SIDWR KGAVT +K+QG CGSCWAFS +A VEGI +I G L+ELSEQ+
Sbjct: 127 FTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQE 186
Query: 183 LVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
LVDC ++GC GG + +Y + N G+ T YPY+ ++ C + I+ Y+
Sbjct: 187 LVDCDKHSYGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYK 245
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
+P E + L A++NQP+SV V+A G+ F YKSGV + CG DH V VG+GT++
Sbjct: 246 RVPSNCETSFLGALANQPLSVLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDG 305
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
+N Y +IKNSWG WGE GY+R+ R + G CG+ ++ YP
Sbjct: 306 KN---YIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347
>gi|55740406|gb|AAV63979.1| cathepsin L1 precursor [Artemia parthenogenetica]
Length = 338
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 148/343 (43%), Positives = 208/343 (60%), Gaps = 15/343 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ +I L+ Q+ + S+ E H + A H + Y +LE+ +R+ I+ +N
Sbjct: 1 MKQITLIFLLAAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKLRMKIYLENK 59
Query: 71 EYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
+ K N ++G ++Y++ N+F DL + EFR++ GY + SR S + + N
Sbjct: 60 HKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPAN 119
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
V +VP S+DWREKGA+T +KDQGQCGSCWAFS+ A+EG T GKL+ LSEQ L+DCS
Sbjct: 120 V-EVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCS 178
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
N GC+GGLMD+AF+YI +NKG+ TE YPY E+G C + A + D+P
Sbjct: 179 GKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDGVC-RYNPRNRGAVDRGFVDIP 237
Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSG-VLNADC-GNNCDHGVAVVGFGTAEE 302
G+E L AV+ PVSV +DAS +F FY G C ++ DHGV VVG+G+
Sbjct: 238 SGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGXYYEPSCDSDDLDHGVLVVGYGS--- 294
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+NG YWL+KNSW E WG+ GYI+I R+ CG+ATAASYP+
Sbjct: 295 DNGEDYWLVKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPL 337
>gi|357114837|ref|XP_003559200.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 371
Score = 271 bits (692), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 141/325 (43%), Positives = 188/325 (57%), Gaps = 28/325 (8%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ +W A H RTY D E+ R +++ N+EYIE N+ G TY+LG N+F+DLT+E
Sbjct: 55 MLDRFVRWQAAHNRTYGDAEERLRRFQVYRANIEYIEATNRRGGLTYELGENQFADLTSE 114
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV---------------PTSIDWREKGA 142
EF ++Y S R TDV P S DWR KGA
Sbjct: 115 EFLSMYA-------SSYDAGDRADDEAALITTDVAGDGAWSDGDLEALPPPSWDWRAKGA 167
Query: 143 VTHIKDQGQ-CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKA 201
VT K+QG C SCWAF VA +EG+T I GKLI LSEQQLVDC + GC+ G +
Sbjct: 168 VTPPKNQGPTCSSCWAFVTVATIEGLTFIKTGKLISLSEQQLVDCDMYDGGCNTGSYSRG 227
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
F +++EN GL TEA+YPY G C+ K AA I+ +P +E + +AV+ QPV
Sbjct: 228 FRWVLENGGLTTEAEYPYTAARGPCNRAKSAHHAAKITGQGRIPPQNELVMQKAVAGQPV 287
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGE 321
V ++ G FYK+GV + CG N H V VVG+G + +GAKYW++KNSWG+ WGE
Sbjct: 288 GVAIEV-GSGMQFYKTGVYSGPCGTNLAHAVTVVGYGV-DPASGAKYWIVKNSWGQAWGE 345
Query: 322 SGYIRILRDA---GLCGIATAASYP 343
G+IR+ RD GLCGIA +YP
Sbjct: 346 RGFIRMRRDVGGPGLCGIALDVAYP 370
>gi|357133074|ref|XP_003568153.1| PREDICTED: cysteine proteinase RD21a-like [Brachypodium distachyon]
Length = 565
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 139/318 (43%), Positives = 186/318 (58%), Gaps = 23/318 (7%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR--------TYKLGTNEFSDL 94
E W A+HG+ Y E+A RL F N ++ N G +Y L N F+DL
Sbjct: 43 EAWCAEHGKAYASPGERAARLAAFADNAAFVAAHNAGGGGAGGSNAAPSYTLALNAFADL 102
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN---VTDVPTSIDWREKGAVTHIKDQGQ 151
T+ EFRA G +V + PS + V VP ++DWR+ GAVT +KDQG
Sbjct: 103 THAEFRAARLGRL----AVGGARAPPSEGGFAGSVGVGAVPEALDWRQSGAVTKVKDQGS 158
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKG 210
CG+CW+FSA A+EGI +I G LI LSEQ+L+DC N GC GGLMD A+ ++I+N G
Sbjct: 159 CGACWSFSATGAIEGINKIKTGSLISLSEQELIDCDRSYNAGCGGGLMDYAYRFVIKNGG 218
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
+ TE DYPYR +GTC+ K K TI Y D+P E +LLQAV+ QP+SV + S R
Sbjct: 219 IDTEDDYPYREADGTCNKNKLKRHVVTIDGYSDVPANKEDSLLQAVAQQPISVGICGSAR 278
Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
AF Y G+ + C + DH V +VG+G+ E G YW++KNSWGE WG GY+ + R+
Sbjct: 279 AFQLYSQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGERWGMKGYMHMHRN 335
Query: 331 ----AGLCGIATAASYPV 344
+G+CGI AS+P
Sbjct: 336 TGSSSGICGINMMASFPT 353
>gi|194757786|ref|XP_001961143.1| GF13722 [Drosophila ananassae]
gi|190622441|gb|EDV37965.1| GF13722 [Drosophila ananassae]
Length = 417
Score = 271 bits (692), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 153/356 (42%), Positives = 211/356 (59%), Gaps = 19/356 (5%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKA 60
+V+ F +FI+ + IL A Q + + + +H + Y DE E+
Sbjct: 68 VVMLFVNAFIL----VFILKKRKAYQNLKATEEQPRTSYAATSTHVLEHRKNYLDETEER 123
Query: 61 MRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR-- 115
RL IF +N I K N+ G +YKL N+++D+ + EFR L G+N + R
Sbjct: 124 FRLKIFNENKHKIAKHNQLWASGKVSYKLAVNKYADMLHHEFRQLMNGFNYTLHKELRAA 183
Query: 116 -QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGK 174
+S + TF +P S+DWR+KGAVT +KDQG CGSCWAFS+ A+EG G
Sbjct: 184 DESFKGVTFISPEHVTLPKSVDWRDKGAVTGVKDQGHCGSCWAFSSTGALEGQHYRKSGV 243
Query: 175 LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
L+ LSEQ LVDCST N+GC+GGLMD AF YI +N G+ TE YPY + +C K
Sbjct: 244 LVSLSEQNLVDCSTKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEALDDSCHFNK-G 302
Query: 233 AVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCD 289
+ AT + D+P+G+E+ L +AV+ PVSV +DAS +F FY GV + C N D
Sbjct: 303 TIGATDRGFVDIPQGNEKKLAEAVATIGPVSVAIDASHESFQFYSEGVYVEPACDAQNLD 362
Query: 290 HGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
HGV VVGFGT +E+G YWL+KNSWG TWG+ G+I++LR+ CGIA+A+SYP+
Sbjct: 363 HGVLVVGFGT--DESGQDYWLVKNSWGTTWGDKGFIKMLRNKDNQCGIASASSYPL 416
>gi|112490572|pdb|2FO5|A Chain A, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490573|pdb|2FO5|B Chain B, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490574|pdb|2FO5|C Chain C, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
gi|112490575|pdb|2FO5|D Chain D, Crystal Structure Of Recombinant Barley Cysteine
Endoprotease B Isoform 2 (Ep-B2) In Complex With
Leupeptin
Length = 262
Score = 270 bits (691), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 131/225 (58%), Positives = 159/225 (70%), Gaps = 10/225 (4%)
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
V+D+P S+DWR+KGAVT +KDQG+CGSCWAFS V +VEGI I G L+ LSEQ+L+DC
Sbjct: 1 VSDLPPSVDWRQKGAVTGVKDQGKCGSCWAFSTVVSVEGINAIRTGSLVSLSEQELIDCD 60
Query: 188 T-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD---NQKEKAVAATISKYED 243
T DN GC GGLMD AFEYI N GL TEA YPYR GTC+ + V I ++D
Sbjct: 61 TADNDGCQGGLMDNAFEYIKNNGGLITEAAYPYRAARGTCNVARAAQNSPVVVHIDGHQD 120
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+P E+ L +AV+NQPVSV V+ASG+AF FY GV +CG DHGVAVVG+G AE+
Sbjct: 121 VPANSEEDLARAVANQPVSVAVEASGKAFMFYSEGVFTGECGTELDHGVAVVGYGVAED- 179
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
G YW +KNSWG +WGE GYIR+ +D+ GLCGIA ASYPV
Sbjct: 180 -GKAYWTVKNSWGPSWGEQGYIRVEKDSGASGGLCGIAMEASYPV 223
>gi|449513868|ref|XP_002191976.2| PREDICTED: cathepsin L1-like [Taeniopygia guttata]
Length = 443
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 152/353 (43%), Positives = 216/353 (61%), Gaps = 20/353 (5%)
Query: 2 VLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAM 61
+ K +++ +IP + +++ + R +P + + W + H + Y E E+
Sbjct: 100 LRKLQRNQVIP------VTKENSTETLHCRWQVDPELDGHWQLWKSWHRKDYH-EREEGW 152
Query: 62 RLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS 118
R ++++NL+ IE N + G +YKLG N+F D+T EEFR L GY V S +
Sbjct: 153 RRVVWEKNLKMIEIHNLDHALGKHSYKLGMNQFGDMTTEEFRQLMNGY---VHKKSERKY 209
Query: 119 RPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIEL 178
R S F N + P S+DWREKG VT +KDQGQCGSCWAFS A+EG GKL+ L
Sbjct: 210 RGSQFLEPNFLEAPRSVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSL 269
Query: 179 SEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
SEQ LVDCS N GC+GGLMD+AF+Y+ +N G+ +E YPY ++ K + AA
Sbjct: 270 SEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNGGIDSEESYPYTAKDDEDCRYKAEYNAA 329
Query: 237 TISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVA 293
+ + D+P+G E+AL++AV+ PVSV +DA +F FY+SG+ DC + + DHGV
Sbjct: 330 NDTGFVDIPQGHERALMKAVAAVGPVSVAIDAGHSSFQFYQSGIYYEPDCSSEDLDHGVL 389
Query: 294 VVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
VVG+G E+ +G KYW++KNSWGE WG+ GYI + +D CGIATAASYP+
Sbjct: 390 VVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 442
>gi|342305188|dbj|BAK55648.1| cathepsin L [Oplegnathus fasciatus]
Length = 336
Score = 270 bits (691), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 151/342 (44%), Positives = 210/342 (61%), Gaps = 17/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + +L + C S +S S+ +P + E + W + H + Y E E+ R ++++NL+
Sbjct: 1 MLPVAVLAV-CLSAALSAPSL-DPQLDEHWDLWKSWHTKKYH-EKEEGWRRMVWEKNLKK 57
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G TY+LG N F D+T+EEFR + GY R S + + S F N
Sbjct: 58 IELHNLEHSMGEHTYRLGMNHFGDMTHEEFRQIMYGYKRK----SERKFKGSLFMEPNFL 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+ P S+DWR+ G VT +KDQGQCGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 114 EAPRSVDWRDNGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGKLVSLSEQNLVDCSRP 173
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+YI +N+GL +E YPY + + K +A + + D+P G
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSANDTGFIDIPSG 233
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+AL++AV+ PVSV +DA +F FY+SG+ +C + DHGV VVG+G E+
Sbjct: 234 KERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFEGEDV 293
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+G KYW++KNSW E WG+ GYI + +D CGIATAASYP+
Sbjct: 294 DGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|359359068|gb|AEV40975.1| putative cysteine protease [Oryza punctata]
Length = 464
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 146/316 (46%), Positives = 203/316 (64%), Gaps = 18/316 (5%)
Query: 42 HEQWMAQH---GRTYKDEL-EKAMRLNIFKQNLEYIE--KANKEGNRTYKLGTNEFSDLT 95
++ W+A+H G ++ + E R +F NL++++ A+ +G+ ++LG N F+DLT
Sbjct: 66 YDLWVARHRHGGGSHNGFVGEYERRFRVFWDNLKFVDAHNAHADGHGGFRLGMNRFADLT 125
Query: 96 NEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAV-THIKDQGQCGS 154
N+EFRA Y G R +++ V +P S+DWR+KGAV + +K+QGQCGS
Sbjct: 126 NDEFRAAYLG----TTPAGRGRHVGEMYRHDGVEALPDSVDWRDKGAVVSPVKNQGQCGS 181
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLA 212
CWAFSAVAAVEGI +I G+L+ LSEQ+LV+C+ N GC+GG+MD AF +I N GL
Sbjct: 182 CWAFSAVAAVEGINKIVTGELVSLSEQELVECARNGGNSGCNGGIMDDAFAFITRNGGLD 241
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE DYPY +G CD K+ +I +ED+P+ DE +L +AV++QPVSV +DA GR F
Sbjct: 242 TEEDYPYTAMDGKCDLAKKSRKVVSIDGFEDVPENDELSLQKAVAHQPVSVAIDAGGREF 301
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
Y SGV CG + DHGV VG+GT + G YW ++NSWG WGE+GYIR+ R+
Sbjct: 302 QLYDSGVFTGRCGTSLDHGVVAVGYGT-DAATGTDYWTVRNSWGPDWGENGYIRMERNVT 360
Query: 331 --AGLCGIATAASYPV 344
G CGIA ASYP+
Sbjct: 361 ARTGKCGIAMMASYPI 376
>gi|302845628|ref|XP_002954352.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
gi|300260282|gb|EFJ44502.1| hypothetical protein VOLCADRAFT_76255 [Volvox carteri f.
nagariensis]
Length = 489
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 146/351 (41%), Positives = 211/351 (60%), Gaps = 24/351 (6%)
Query: 14 FVIIILVITCASQVVSG-----RSMHEPSIVEKH-------EQWMAQHGRTYKDEL-EKA 60
F+I L++ + V + R HE +++ +QWM Q+ + Y +++ E
Sbjct: 5 FLIAALLVAASGGVGAAPELQLREQHEKLLLDAKANPMAAFQQWMMQYTKAYANDIKELE 64
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVS-RQSSR 119
R +++ +NL YI N ++ L N F+DLT +EFR GY+ S R S
Sbjct: 65 TRFSVWLENLNYILAYNAR-TTSHWLHLNAFADLTTDEFRN-RLGYDFKARQASNRLQSS 122
Query: 120 PSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELS 179
P + + +PT IDWR+KGAVT +K+QGQCGSCWAF+ +VEGI I G+L LS
Sbjct: 123 PFIYDNVDANQLPTEIDWRKKGAVTEVKNQGQCGSCWAFATTGSVEGINAIVTGELASLS 182
Query: 180 EQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
EQ+LVDC TD + GCSGGLMD A+++II+N GL TE DYPY E+G C K+ TI
Sbjct: 183 EQELVDCDTDEDRGCSGGLMDYAYQWIIKNGGLDTEDDYPYTAEDGVCVAAKKNRRVVTI 242
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGF 297
Y D+P+ DE AL +A ++QP++V ++A ++F Y GV + CG + +HGV VVG+
Sbjct: 243 DGYVDIPENDEVALKKAAAHQPIAVAIEADAKSFQLYGGGVYDDPTCGTSLNHGVLVVGY 302
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
G ++ + YW++KNSWG WG++GYIR+ A G+CGIA A S+P
Sbjct: 303 G--KDPHFGNYWIVKNSWGPEWGDNGYIRLRMGAEDVQGMCGIAMAPSFPT 351
>gi|320169652|gb|EFW46551.1| cathepsin L2 [Capsaspora owczarzaki ATCC 30864]
Length = 325
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 147/307 (47%), Positives = 185/307 (60%), Gaps = 11/307 (3%)
Query: 44 QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALY 103
+W A H R Y E+A+R I+ NLE I + N G +Y LG NEF DL + EF A Y
Sbjct: 23 EWKALHNRQYASAQEEALRQEIYLSNLELINEHNAAGRHSYTLGMNEFGDLAHHEFAAKY 82
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
G V+ S S+ + +P S+DWR G VT +K+QGQCGSCW+FS +
Sbjct: 83 LGVR--FNGVNATKSFASSTYLPRMVSLPDSVDWRTAGIVTPVKNQGQCGSCWSFSTTGS 140
Query: 164 VEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRH 221
VEG G L+ LSEQ LVDCS+ N GC+GGLMD AFEYII+N G+ TEA YPY
Sbjct: 141 VEGQHARKTGTLVSLSEQNLVDCSSQEGNEGCNGGLMDDAFEYIIKNGGIDTEASYPYTA 200
Query: 222 EEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL 280
GTC + AT++ Y+D+ G E L AV+ PVSV +DAS F FY +GV
Sbjct: 201 TTGTCKFNAAN-IGATVASYQDIITGSESDLQNAVATVGPVSVAIDASHINFQFYFTGVY 259
Query: 281 N-ADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIA 337
N C DHGV VG+GT+ E G YWL+KNSWG TWG++GYI + R+A CGIA
Sbjct: 260 NEKKCSTTQLDHGVLAVGYGTSTE--GKDYWLVKNSWGATWGKAGYIWMSRNADNQCGIA 317
Query: 338 TAASYPV 344
T+ASYP+
Sbjct: 318 TSASYPL 324
>gi|326494040|dbj|BAJ85482.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 355
Score = 270 bits (690), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 146/318 (45%), Positives = 187/318 (58%), Gaps = 18/318 (5%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEF 99
+HE+WMA++GR Y D EK R +F N +I+ N+ GNRTY LG N FSDLTNEEF
Sbjct: 39 HRHERWMAKYGRVYADAAEKLRRQEVFAANARHIDAVNRAGNRTYTLGLNHFSDLTNEEF 98
Query: 100 RALYTGY-NRPVPSVSR-QSSRPSTFKYQNVTD-----VPTSIDWREKGAVTHIKDQGQC 152
+ GY ++P P R + S P+ NVTD P S+DWR +GAVT +K QG C
Sbjct: 99 AQTHLGYRHQPGPGGLRPEDSSPAAAV--NVTDAQLQSTPDSVDWRARGAVTPVKHQGHC 156
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
GSCWAF+AVAA EG+ QI G LI +SEQQ++DC+ C G ++ A YI + GL
Sbjct: 157 GSCWAFAAVAATEGLVQIATGNLISMSEQQVLDCTGGTSSCKSGYVNAALTYITASGGLQ 216
Query: 213 TEADYPYRHEEGTCDN--QKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
TEA Y Y E+G C + + AA + GDE AL V+ QPV+V V+A
Sbjct: 217 TEAAYAYSAEQGACRSGGASPNSAAAVGVHRSAMLNGDEGALQVLVAGQPVAVAVEAE-P 275
Query: 271 AFHFYKSGVL--NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
FH YKSGV + CG H V VVG+G + +G YW++KN WG WGE GY+R+
Sbjct: 276 DFHHYKSGVYVGSPSCGQKLHHAVTVVGYGA--DGDGQGYWVVKNQWGAGWGEVGYMRLT 333
Query: 329 RDAGL--CGIATAASYPV 344
R G CG+AT A YP
Sbjct: 334 RGNGGNNCGMATHAYYPT 351
>gi|194352758|emb|CAQ00107.1| papain-like cysteine proteinase [Hordeum vulgare subsp. vulgare]
Length = 457
Score = 270 bits (690), Expect = 8e-70, Method: Compositional matrix adjust.
Identities = 136/314 (43%), Positives = 189/314 (60%), Gaps = 16/314 (5%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK------EGNRTYKLGTNEFSDL 94
+ E W A+HG+ Y E+A RL F +N ++ N G +Y L N F+DL
Sbjct: 38 QFEAWCAEHGKAYATPGERAARLAAFAENAAFVAAHNDAVASSGPGGPSYTLALNAFADL 97
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-VTDVPTSIDWREKGAVTHIKDQGQCG 153
T++EFRA G P S PS ++ V VP ++DWR+ GAVT +KDQG CG
Sbjct: 98 THDEFRAARLGRLAVGPGPLGAPS-PSDGGFEGRVGAVPDALDWRQSGAVTKVKDQGSCG 156
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
+CW+FSA A+EGI +IT G L+ LSEQ+L+DC N GC GGLM A++++I+N G+
Sbjct: 157 ACWSFSATGAMEGINKITTGSLLSLSEQELIDCDRSYNTGCGGGLMTYAYKFVIKNGGID 216
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE DYP+R +GTC+ K K TI Y+++P E LLQAV+ QP+SV + S RAF
Sbjct: 217 TEDDYPFREADGTCNKNKLKKHVVTIDGYKEVPSSKEDLLLQAVAQQPISVGICGSARAF 276
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-- 330
Y G+ + C + DH V +VG+G+ E G YW++KNSWGE WG GY+ + R+
Sbjct: 277 QLYSQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGERWGMKGYMHMHRNTG 333
Query: 331 --AGLCGIATAASY 342
+G+CGI AS+
Sbjct: 334 SSSGICGINMMASF 347
>gi|432114311|gb|ELK36239.1| Cathepsin S [Myotis davidii]
Length = 340
Score = 270 bits (689), Expect = 9e-70, Method: Compositional matrix adjust.
Identities = 153/348 (43%), Positives = 216/348 (62%), Gaps = 22/348 (6%)
Query: 6 EKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLN 64
E+ + M ++++++ C+S + +H+ ++ H + W +G+ Y +E E+ R
Sbjct: 3 EQQTVQRMKWLLLVLLGCSSAMAQ---LHKDPTLDHHWDLWKKTYGKQYTEENEEVTRRF 59
Query: 65 IFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
I+++NL+Y+ N E G +Y LG N +D+T+EE L + VPS Q R
Sbjct: 60 IWEKNLKYVMLHNLEHSMGMHSYDLGMNHLADMTSEEVMLLMSSLR--VPS---QWQRNV 114
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
TFK +P S+DWR+KG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q
Sbjct: 115 TFKSNPNQKLPDSMDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSVQ 174
Query: 182 QLVDCST---DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
LVDCST N GC+GG M +AF+YII+N G+ +EA YPY+ +G C K AAT
Sbjct: 175 NLVDCSTGKYSNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKCQ-YDVKNRAATC 233
Query: 239 SKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVG 296
SKY +LP G+E+AL +AV+N+ PVSV +DAS +F Y+SGV + C N +HGV VG
Sbjct: 234 SKYVELPFGNEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDKACTLNVNHGVLAVG 293
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
+G NG YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 294 YGNY---NGKDYWLVKNSWGLHFGEQGYIRMARNSGNHCGIASYPSYP 338
>gi|226499806|ref|NP_001151335.1| cysteine protease 1 [Zea mays]
gi|195645896|gb|ACG42416.1| cysteine protease 1 precursor [Zea mays]
Length = 258
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 148/266 (55%), Positives = 184/266 (69%), Gaps = 20/266 (7%)
Query: 89 NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT-----DVPTSIDWREKGAV 143
NEF+D+TN+EF A+YTG RPVP+ ++ + + FKY NVT D ++DWR+KGAV
Sbjct: 4 NEFADMTNDEFMAMYTGL-RPVPAGAK---KMAGFKYGNVTLSDADDDQQTVDWRQKGAV 59
Query: 144 THIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAF 202
T IKDQ QCG CWAF+AVAAVEGI QIT G L+ LSEQQ++DC TD N+GC+GG +D AF
Sbjct: 60 TGIKDQRQCGCCWAFAAVAAVEGIHQITTGNLVSLSEQQVLDCDTDGNNGCNGGYIDNAF 119
Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
+YI+ N GLATE YPY + C Q + VAA IS Y+D+P GDE AL AV+NQPVS
Sbjct: 120 QYIVGNGGLATEDAYPYTAAQAMC--QSVQPVAA-ISGYQDVPSGDEAALAAAVANQPVS 176
Query: 263 VCVDASGRAFHFYKSGVLN-ADCGN--NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
V +DA F Y GV+ A C N +H V VG+GTAE+ G YWL+KN WG+ W
Sbjct: 177 VAIDA--HNFQLYGGGVMTAASCSTPPNLNHAVTAVGYGTAED--GTPYWLLKNQWGQNW 232
Query: 320 GESGYIRILRDAGLCGIATAASYPVA 345
GE GY+R+ R A CG+A ASYPVA
Sbjct: 233 GEGGYLRLERGANACGVAQQASYPVA 258
>gi|383849553|ref|XP_003700409.1| PREDICTED: cathepsin L-like [Megachile rotundata]
Length = 343
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 149/345 (43%), Positives = 212/345 (61%), Gaps = 23/345 (6%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWM---AQHGRTYKDELEKAMRLNIFKQNLE 71
+++++VITCA+ V S E +++W+ +H + YK E E+ +R+ I+ +N
Sbjct: 4 ILLLIVITCAA--VQAISFFELV----NQEWINFKMEHKKCYKHEAEERLRMKIYMKNKL 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP--STFKYQ 126
I + N + TY+L N++ D+ N EF+ + GYNR + R P + F
Sbjct: 58 QIAQHNCDYELKKVTYRLKINKYGDMLNHEFKNMLNGYNRTINHTLRNERLPVGAAFIEP 117
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
++P +DWR+ GAVT +KDQG CGSCWAFSA ++EG G L+ LSEQ L+DC
Sbjct: 118 CNVELPKMVDWRKCGAVTEVKDQGHCGSCWAFSATGSLEGQHFRRTGVLVSLSEQNLIDC 177
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
S N+GC+GGLMD+AF YI +NKGL TE YPY E+ C K + A+ + + D+
Sbjct: 178 SGSYGNNGCNGGLMDQAFSYIKDNKGLDTEKTYPYEGEDDKCRYDKRSSGASDVG-FVDI 236
Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAE 301
P GDEQ L AV+ PVSV +DAS ++F FY G+ +C + N DHGV VVG+GT E
Sbjct: 237 PVGDEQKLKAAVATVGPVSVAIDASHQSFQFYSDGIYFEPECSSTNLDHGVLVVGYGTDE 296
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPVA 345
E G YW++KNSWGE+WGE GYI++ R+ CGIA++ASYP+
Sbjct: 297 E--GRDYWIVKNSWGESWGEKGYIKMARNIDNHCGIASSASYPIV 339
>gi|158300877|ref|XP_001689282.1| AGAP011828-PA [Anopheles gambiae str. PEST]
gi|157013372|gb|EDO63348.1| AGAP011828-PA [Anopheles gambiae str. PEST]
Length = 344
Score = 270 bits (689), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 152/350 (43%), Positives = 209/350 (59%), Gaps = 29/350 (8%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNL 70
F+I+IL A+ +S + + E+W A QH + Y E E+ +R+ I+ QN
Sbjct: 4 FLILILGFVAAANAISIFELVK-------EEWTAFKLQHRKKYDSETEERIRMKIYVQNK 56
Query: 71 EYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVS-------RQSSRP 120
I K N+ G ++L N+++DL +EEF G+NR V + P
Sbjct: 57 HKIAKHNQRYDLGQEKFRLRVNKYADLLHEEFVHTLNGFNRSVSGKGQLLRGELKPIEEP 116
Query: 121 STFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
T+ DVPT++DWR KGAVT +KDQG CGSCW+FSA A+EG GKL+ LSE
Sbjct: 117 VTWIEPANVDVPTAMDWRTKGAVTQVKDQGHCGSCWSFSATGALEGQHFRKTGKLVSLSE 176
Query: 181 QQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
Q LVDCS N+GC+GG+MD AF+YI +NKG+ TE YPY + C + KAV AT
Sbjct: 177 QNLVDCSQKYGNNGCNGGMMDFAFQYIKDNKGIDTEKSYPYEAIDDEC-HYNPKAVGATD 235
Query: 239 SKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVV 295
+ D+P+G+E+AL++A++ PVSV +DAS +F FY GV C + DHGV V
Sbjct: 236 KGFVDIPQGNEKALMKALATVGPVSVAIDASHESFQFYSEGVYYEPQCDSEQLDHGVLAV 295
Query: 296 GFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G+GT E+ G YWL+KNSWG TWG+ GY+++ R+ CGIAT ASYP+
Sbjct: 296 GYGTTED--GEDYWLVKNSWGTTWGDQGYVKMARNRDNHCGIATTASYPL 343
>gi|357130486|ref|XP_003566879.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 354
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 143/321 (44%), Positives = 196/321 (61%), Gaps = 15/321 (4%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
++ +HE+WMA+ GR YKD EKA R +F N +++ N+ GNRTY LG N FSDLT+
Sbjct: 33 TVASRHERWMARFGRAYKDADEKARRQEVFGANARHVDAVNRSGNRTYTLGLNHFSDLTD 92
Query: 97 EEFRALYTGY--NRPVPS----VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQG 150
EF + GY ++P P Q +T DVP S+DWR +GAVT IK+Q
Sbjct: 93 HEFLQQHLGYRHHQPGPGGLLRPEDQDMSKATALADYGQDVPDSVDWRAQGAVTEIKNQR 152
Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKG 210
CGSCWAF+AVAA EG+ +I G LI +SEQQ++DC+ + C GG ++ A Y+ + G
Sbjct: 153 SCGSCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGGGNTCDGGDINAALRYVAASGG 212
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATI--SKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
L EA Y Y ++G C AA++ +++ L GDE AL + QPV+V ++AS
Sbjct: 213 LQPEAAYAYAAQKGACRGASPANSAASVGGARFARL-GGDEGALRGLAAGQPVAVALEAS 271
Query: 269 GRAFHFYKSGVL--NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
F YKSGV +A CG +HGV VVG+G AE+++G +YW++KN WG WGE GY+R
Sbjct: 272 EPDFRHYKSGVYAGSASCGRRLNHGVTVVGYG-AEDDSGDEYWVVKNQWGTLWGEKGYMR 330
Query: 327 ILRD--AGL-CGIATAASYPV 344
+ R AG CGIA+ A YP
Sbjct: 331 VARGDVAGANCGIASYAYYPT 351
>gi|222641485|gb|EEE69617.1| hypothetical protein OsJ_29194 [Oryza sativa Japonica Group]
Length = 360
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 134/302 (44%), Positives = 182/302 (60%), Gaps = 11/302 (3%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ W H R+Y E R +++++N E+I+ N G+ TY+L NEF+DLT E
Sbjct: 47 MMDRFRAWQGAHNRSYPSAEEALQRFDVYRRNAEFIDAVNLRGDLTYQLAENEFADLTEE 106
Query: 98 EFRALYTGY---NRPVPS---VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ-G 150
EF A YTGY + PV + ++F Y+ DVP S+DWR +GAV K Q
Sbjct: 107 EFLATYTGYYAGDGPVDDSVITTGAGDVDASFSYR--VDVPASVDWRAQGAVVPPKSQTS 164
Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKG 210
C SCWAF A +E + I GKL+ LSEQQLVDC + + GC+ G +A+++++EN G
Sbjct: 165 TCSSCWAFVTAATIESLNMIKTGKLVSLSEQQLVDCDSYDGGCNLGSYGRAYKWVVENGG 224
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
L TEADYPY G C+ K AA I+ + +P +E AL AV+ QPV+V ++ G
Sbjct: 225 LTTEADYPYTARRGPCNRAKSAHHAAKITGFGKVPPRNEAALQAAVARQPVAVAIEV-GS 283
Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
FYK GV CG H V VVG+GT + +GAKYW IKNSWG++WGE GYIRILRD
Sbjct: 284 GMQFYKGGVYTGPCGTRLAHAVTVVGYGT-DASSGAKYWTIKNSWGQSWGERGYIRILRD 342
Query: 331 AG 332
G
Sbjct: 343 VG 344
>gi|4469153|emb|CAB38314.1| chymopapain isoform II [Carica papaya]
Length = 352
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 138/345 (40%), Positives = 202/345 (58%), Gaps = 14/345 (4%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMR 62
K + +II + ++ A G S + + +E+ + WM +H + Y+ EK R
Sbjct: 9 KIIFLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYR 68
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
IF+ NL YI++ NK+ N +Y LG N F+DL+N+EF+ Y G+ +
Sbjct: 69 FEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEFKKKYVGF-VAEDFTGLEHFDNED 126
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
F Y++VT+ P SIDWR KGAVT +K+QG CGSCWAFS +A VEGI +I G L+ELSEQ+
Sbjct: 127 FTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQE 186
Query: 183 LVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
LVDC ++GC GG + +Y + N G+ T YPY+ ++ C + I+ Y+
Sbjct: 187 LVDCDKHSYGCKGGYQTTSLQY-VANNGVHTSKVYPYQAKQYKCRATDKPGPKVKITGYK 245
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
+P E + L A++NQP+S V+A G+ F YKSGV + CG DH V VG+GT++
Sbjct: 246 RVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDG 305
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
+N Y +IKNSWG WGE GY+R+ R + G CG+ ++ YP
Sbjct: 306 KN---YIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347
>gi|242088413|ref|XP_002440039.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
gi|241945324|gb|EES18469.1| hypothetical protein SORBIDRAFT_09g024940 [Sorghum bicolor]
Length = 463
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 136/315 (43%), Positives = 186/315 (59%), Gaps = 18/315 (5%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR--------TYKLGTNEFSDL 94
+ W A+HG+ Y E+A RL +F N ++ N N +Y L N F+DL
Sbjct: 42 DAWCAEHGKAYATPEERAARLAVFADNAAFVAAHNARVNAAGGGGAPPSYTLALNAFADL 101
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-VTDVPTSIDWREKGAVTHIKDQGQCG 153
T+EEFRA G + R + P + VP ++DWRE GAVT +KDQG CG
Sbjct: 102 THEEFRAARLGRIAAGAAALRSPAAPVYRGLDGGLGAVPDALDWRENGAVTKVKDQGSCG 161
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLA 212
+CW+FSA A+EGI +I G L+ LSEQ+L+DC N GC GGLMD A++++++N G+
Sbjct: 162 ACWSFSATGAMEGINKIKTGSLVSLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGID 221
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TE DYPYR +GTC+ K K TI Y D+P E LLQAV+ QPVSV + S RAF
Sbjct: 222 TEEDYPYREADGTCNKNKLKKRIVTIDGYSDVPSNKEDLLLQAVAQQPVSVGICGSARAF 281
Query: 273 HFY-KSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
Y + G+ + C + DH V +VG+G+ E G YW++KNSWGE+WG GY+ + R+
Sbjct: 282 QLYSQQGIFDGPCPTSLDHAVLIVGYGS---EGGKDYWIVKNSWGESWGMKGYMHMHRNT 338
Query: 332 ----GLCGIATAASY 342
G+CGI AS+
Sbjct: 339 GDSKGVCGINMMASF 353
>gi|351705687|gb|EHB08606.1| Cathepsin S [Heterocephalus glaber]
Length = 331
Score = 269 bits (688), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 147/335 (43%), Positives = 208/335 (62%), Gaps = 19/335 (5%)
Query: 19 LVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN 77
L C + ++G + + +++ H W +G+ Y+++ E+ +R I+++NL+++ N
Sbjct: 4 LAWVCVTCSLAGAQLQQDPMLDYHWHLWKKTYGKHYQEKNEEQVRRLIWEKNLKFVMLHN 63
Query: 78 KE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
E G +Y LG N D+T+EE R+L + P RQ R T+K +P S
Sbjct: 64 LEHSMGMHSYDLGMNHLGDMTSEEVRSLMSSLRVP-----RQWLRNVTYKSDPNQKLPDS 118
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD---NH 191
+DWREKG VT +K QG CGSCWAFSAV A+EG ++ GKL+ LS Q LVDCST+ N
Sbjct: 119 VDWREKGCVTEVKYQGACGSCWAFSAVGALEGQLKLKTGKLVSLSAQNLVDCSTEKYRNK 178
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GCSGG M +AF+Y+I+N G+ +E YPY+ + C + K AAT S+Y +LP G E+A
Sbjct: 179 GCSGGFMTEAFQYVIDNNGIDSETSYPYKATDEKC-HYDSKNRAATCSRYTELPYGSEEA 237
Query: 252 LLQAVSNQ-PVSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYW 309
L +AV+N+ PVSV VDAS +F YK+GV + C N HGV VG+G NG YW
Sbjct: 238 LKEAVANKGPVSVAVDASRPSFFLYKNGVYDDPSCTQNVTHGVLAVGYGNL---NGKDYW 294
Query: 310 LIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
L+KNSWG +G+ GYIR+ R+ G CGIA+ +SYP
Sbjct: 295 LVKNSWGLYFGDQGYIRMARNKGNHCGIASYSSYP 329
>gi|55740402|gb|AAV63977.1| cathepsin L precursor [Artemia franciscana]
Length = 338
Score = 269 bits (687), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 149/343 (43%), Positives = 207/343 (60%), Gaps = 15/343 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ +I L+ Q+ + S+ E H + A H + Y +LE+ R+ I+ +N
Sbjct: 1 MKQITLIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENK 59
Query: 71 EYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
+ K N ++G ++Y++ N+F DL + EFR++ GY + SR S + + N
Sbjct: 60 HKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPAN 119
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
V +VP S+DWREKGA+T +KDQGQCGSCWAFS+ A+EG T GKLI LSEQ L+DCS
Sbjct: 120 V-EVPESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCS 178
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
N GC+GGLMD+AF+YI +NKG+ TE YPY E+ C + A + D+P
Sbjct: 179 GKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVC-RYNPRNRGAVDRGFVDIP 237
Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFGTAEE 302
G+E L AV+ PVSV +DAS +F FY GV C ++ DHGV VVG+G+
Sbjct: 238 SGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS--- 294
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+NG YWL+KNSW E WG+ GYI+I R+ CG+ATAASYP+
Sbjct: 295 DNGKDYWLVKNSWSEHWGDEGYIKIARNRKNHCGVATAASYPL 337
>gi|410923307|ref|XP_003975123.1| PREDICTED: cathepsin L1-like [Takifugu rubripes]
Length = 336
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 152/343 (44%), Positives = 211/343 (61%), Gaps = 19/343 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
MF +++L + C + +S S+ +P + E W H + Y E E+ R ++++NL+
Sbjct: 1 MFPVVVLAL-CVTAALSAPSL-DPQLDEHWNLWKDWHSKKYH-EKEEGWRRMVWEKNLKK 57
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G TY LG N F D+T+EEFR + GY S++ R S F N
Sbjct: 58 IELHNLEHSMGKHTYSLGMNHFGDMTHEEFRQIMNGYKLK----SQRKLRGSLFMEPNFL 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+ P S+DWR+KG VT +KDQGQCGSCWAFS A+EG G L+ LSEQ LVDCS
Sbjct: 114 EAPRSVDWRDKGYVTPVKDQGQCGSCWAFSTTGAMEGQHFRKTGTLVSLSEQNLVDCSRP 173
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYR-HEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD+AF+YI +N GL +E YPY +EG C + +A + + D+P
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDSEESYPYLGTDEGPC-HYDPSYNSANDTGFVDVPS 232
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
G E+AL++AV++ PVSV +DA +F FY SG+ + +C + DHGV VVG+G ++
Sbjct: 233 GSERALMKAVASVGPVSVAIDAGHESFQFYHSGIYYDKECSSEELDHGVLVVGYGFEGKD 292
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+G KYW++KNSW E WG+ GYI + +D CGIATAASYP+
Sbjct: 293 VDGKKYWIVKNSWSENWGDKGYIYMAKDKKNHCGIATAASYPL 335
>gi|417409876|gb|JAA51427.1| Putative cathepsin s, partial [Desmodus rotundus]
Length = 342
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 149/343 (43%), Positives = 216/343 (62%), Gaps = 22/343 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQN 69
I M ++++++ C+S + +H+ +++H + W +G+ YK++ E+ +R I+++N
Sbjct: 10 IIMKWLVLVLLGCSSAMAQ---LHKDPTLDRHWDLWKKTYGKQYKEKNEEGVRRLIWEKN 66
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
L+++ N E G +Y LG N D+T+EE AL + VPS Q R T+K
Sbjct: 67 LKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVTALMSSLR--VPS---QWQRNVTYKSN 121
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+P S+DWR+KG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDC
Sbjct: 122 PNQKLPDSVDWRDKGCVTDVKYQGSCGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDC 181
Query: 187 ST---DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
S N GC+GG M +AF+YII+N G+ +EA YPY+ +G C K AAT S+Y +
Sbjct: 182 SVGKYSNRGCNGGFMTEAFQYIIDNNGIESEASYPYKAMDGKC-QYDSKYRAATCSRYTE 240
Query: 244 LPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAE 301
LP+ E AL +AV+N+ PVSV +DAS +F Y+SGV + C + +HGV VVG+G
Sbjct: 241 LPEDSEDALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDPACTLHVNHGVLVVGYGNL- 299
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
NG YWL+KNSWG +G+ GYIR+ R++G CGIA+ ASYP
Sbjct: 300 --NGKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIASYASYP 340
>gi|357160095|ref|XP_003578656.1| PREDICTED: KDEL-tailed cysteine endopeptidase CEP2-like
[Brachypodium distachyon]
Length = 377
Score = 269 bits (687), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 148/339 (43%), Positives = 200/339 (58%), Gaps = 33/339 (9%)
Query: 35 EPSIVE----KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE--GNRTYKLGT 88
+P+I++ + ++W A+HGR Y E+ RL ++ +N+ YIE AN + TY+LG
Sbjct: 42 DPTILQTMAPRFQRWKAEHGRAYATRDEELRRLRVYARNVRYIEAANGDPAAGLTYQLGE 101
Query: 89 NEFSDLTNEEFRALYTGYNRPVPSVSRQ----------SSRPSTFK------YQNVTDV- 131
++DLT +EF A+YT P P +S ++R Y NV+
Sbjct: 102 TAYTDLTADEFTAMYT---SPSPVLSAHDDEAAGAMMITTRAGAVDAGGQQVYFNVSTAG 158
Query: 132 -PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDN 190
P S+DWR KGAVT +K+QG+CGSCWAFS VA VEGI QI G LI LSEQ+LVDC T +
Sbjct: 159 APASVDWRAKGAVTEVKNQGRCGSCWAFSTVAVVEGIHQIRTGNLISLSEQELVDCDTLD 218
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
+GC GG+ A E+I N G+ATEADYPY ++G C K AA IS + + E
Sbjct: 219 YGCDGGVSYHALEWIASNGGIATEADYPYTGKDGACVANKLPLHAAAISGFARVATRSEP 278
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
+L AV+ QPV+V ++A G F Y GV N CG +HGV VV EE +G KYW+
Sbjct: 279 SLANAVAAQPVAVSIEAGGANFQHYVKGVYNGPCGTRLNHGVTVV-GYGEEEGDGEKYWI 337
Query: 311 IKNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
+KNSWG+ WG+ GY R+ +D GLCGIA S+P+
Sbjct: 338 VKNSWGKKWGDGGYFRMKKDVAGKPEGLCGIAIRPSFPL 376
>gi|4731372|gb|AAD28476.1|AF133838_1 papain-like cysteine protease [Sandersonia aurantiaca]
Length = 370
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 123/227 (54%), Positives = 166/227 (73%), Gaps = 8/227 (3%)
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
++Y+ +P S+DWREKGAV IKDQG CGSCWAFS +A+VEGI +I G LI LSEQ+
Sbjct: 33 YRYRAGDALPDSVDWREKGAVVPIKDQGGCGSCWAFSTIASVEGINKIVTGDLISLSEQE 92
Query: 183 LVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
LVDC T N GC+GGLMD AF++II+N G+ TE DYPY ++G CD+ ++ A +I+ Y
Sbjct: 93 LVDCDKTYNDGCNGGLMDYAFQFIIDNGGIDTEKDYPYTEQDGRCDSYRKNAKVVSINSY 152
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
ED+P DEQAL +A ++QP++V +D GR+F Y SG+ CG + DHGV VVG+G+
Sbjct: 153 EDVPVNDEQALKKAAASQPIAVAIDGGGRSFQLYNSGIFTGKCGTSLDHGVTVVGYGS-- 210
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
E+G YW+++NSWGE+WGE GYIR+ R+ +G+CGIA ASYP+
Sbjct: 211 -ESGKDYWIVRNSWGESWGEKGYIRMARNIDSPSGICGIAMEASYPI 256
>gi|226509942|ref|NP_001146834.1| cysteine protease precursor [Zea mays]
gi|159506725|gb|ABW97700.1| cysteine protease [Zea mays]
gi|414867308|tpg|DAA45865.1| TPA: cysteine protease [Zea mays]
Length = 352
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 140/311 (45%), Positives = 193/311 (62%), Gaps = 8/311 (2%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ W A + R+Y E+ R ++++N+E+IE N+ GN TY LG N+F+DLT E
Sbjct: 45 MMDRFLSWQATYNRSYPTAEERQRRFQVYRRNIEHIEATNRAGNLTYTLGENQFADLTEE 104
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQG-QCGSCW 156
EF LYT PV + + R + D PTS+DWR KGAVT IK+QG C SCW
Sbjct: 105 EFLDLYTMKGMPVRRDAGKK-RANVSSSAAAVDAPTSVDWRSKGAVTPIKNQGPSCSSCW 163
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEAD 216
AF A +E IT+IT GKL+ LSEQ+L+DC + GC+ G + ++I+N GL TEA+
Sbjct: 164 AFVTAATIESITKITTGKLVSLSEQELIDCDPYDGGCNLGYFVNGYRWVIQNGGLTTEAN 223
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY+ C + AATIS Y LP G+ Q L QAV+ QPV+ ++ G + FY
Sbjct: 224 YPYQARRYACSRSRAAQHAATISDYVQLPAGEGQ-LQQAVAQQPVAAAIEMGG-SLQFYS 281
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGL 333
GV + CG +H + VVG+G A+ +G KYWL+KNSWG++WGE GY+R+ RD GL
Sbjct: 282 GGVFSGQCGTRMNHAITVVGYG-ADSSSGLKYWLVKNSWGQSWGERGYLRMRRDVGRGGL 340
Query: 334 CGIATAASYPV 344
CGIA +YPV
Sbjct: 341 CGIALDLAYPV 351
>gi|221090861|ref|XP_002167224.1| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 268 bits (686), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 153/335 (45%), Positives = 207/335 (61%), Gaps = 24/335 (7%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
+++L +T A ++ P E QW H + Y + E+ +R I+K N I +
Sbjct: 7 LLLLGVTLA------YTIERPVKDESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIRE 60
Query: 76 ANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
N +G + L N+F D+TN EF+A + GY +S + STF N P ++
Sbjct: 61 HNLKGG-DFLLKMNQFGDMTNSEFKA-FNGY------LSHKHVNGSTFLTPNNFVAPDTV 112
Query: 136 DWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGC 193
DWR +G VT +KDQGQCGSCWAFS ++EG GKL+ LSEQ LVDCST N+GC
Sbjct: 113 DWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGC 172
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
+GGLMD AF YI ENKG+ +EA YPY E+G C +K +VAAT + + DLP+G+E L
Sbjct: 173 NGGLMDNAFTYIKENKGIDSEASYPYTAEDGKCVFKK-PSVAATDTGFVDLPEGNENKLK 231
Query: 254 QAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWL 310
+AV++ P+SV +DAS +F FY SGV N C + DHGV VVG+GT E+G YWL
Sbjct: 232 EAVASVGPISVAIDASHESFQFYSSGVYNEPSCSSTELDHGVLVVGYGT---ESGKDYWL 288
Query: 311 IKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+KNSW +WG+ GYI++ R+A CGIAT ASYP+
Sbjct: 289 VKNSWNTSWGDKGYIKMRRNAKNQCGIATKASYPL 323
>gi|330805273|ref|XP_003290609.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
gi|325079248|gb|EGC32857.1| hypothetical protein DICPUDRAFT_92519 [Dictyostelium purpureum]
Length = 333
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 147/307 (47%), Positives = 191/307 (62%), Gaps = 16/307 (5%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT 104
WM +H R Y E E R FK+N+++I K N + + T LG +F+DLTNEE++ Y
Sbjct: 36 WMRKHDRAYSHE-EFTDRYQAFKENMDFIHKWNSQESDTV-LGLTKFADLTNEEYKKHYL 93
Query: 105 GYNRPVP-SVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
G V +++ FK+ P SIDWREKGAV+ +KDQGQCGSCW+FS A
Sbjct: 94 GIKVNVKKNLNAAQKGLKFFKFTG----PDSIDWREKGAVSQVKDQGQCGSCWSFSTTGA 149
Query: 164 VEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRH 221
VEG QI G ++ LSEQ LVDCS N GC GGLM AFEYII+N G+ATE+ YPY
Sbjct: 150 VEGAHQIKSGNMVSLSEQNLVDCSGQYGNQGCEGGLMVNAFEYIIDNGGIATESSYPYTA 209
Query: 222 EEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN 281
+G C K A I Y+++P+G+E +L A++ QPVSV +DAS +F Y SGV +
Sbjct: 210 AQGRCKFTKSMN-GANIIGYKEIPQGEEDSLTAALAKQPVSVAIDASHMSFQLYSSGVYD 268
Query: 282 --ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIAT 338
A DHGV VG+GT E G Y++IKNSWG TWG+ GYI + R+A CG+AT
Sbjct: 269 EPACSSEALDHGVLAVGYGTLE---GKDYYIIKNSWGPTWGQDGYIFMSRNAQNQCGVAT 325
Query: 339 AASYPVA 345
ASYP++
Sbjct: 326 MASYPIS 332
>gi|307192137|gb|EFN75465.1| Cathepsin L [Harpegnathos saltator]
Length = 339
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 147/341 (43%), Positives = 208/341 (60%), Gaps = 18/341 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ ++L I A+Q +S ++ + E+ + H + Y ++E++ R+ IF +N I
Sbjct: 5 IFLLLGILAAAQAISFFNL----VTEEWNTFKVTHRKAYDSKIEESFRMKIFMENWHKIA 60
Query: 75 KANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP--STFKYQNVT 129
N++ +YKLG N++ D+ + EF G+N+ V + R RP S F
Sbjct: 61 LHNQKYELNEVSYKLGMNKYGDMLHHEFINTLNGFNKSVSAQLRAQRRPIGSRFIEPANV 120
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
++P+S+DWR GAVT IKDQG CGSCW+FSA A+EG GKL+ LSEQ L+DCS
Sbjct: 121 EIPSSVDWRTHGAVTPIKDQGHCGSCWSFSATGALEGQHYRITGKLVSLSEQNLIDCSGR 180
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N+GC+GGLMD+AF+YI +N GL TE YPY E C + AT S Y D+P+G
Sbjct: 181 YGNNGCNGGLMDQAFQYIKDNHGLDTEISYPYEAENDKC-RYNPRNNGATDSGYVDIPEG 239
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEEN 304
+E+ L AV+ PVSV +DAS +F FY+ GV C + N DHGV VVG+GT ++N
Sbjct: 240 NEKKLKAAVATIGPVSVAIDASAESFQFYREGVYYEPRCSSENLDHGVLVVGYGT--DDN 297
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
YWL+KNSWG TWG+ GYI++ R+ CGIA++ASYP+
Sbjct: 298 DQDYWLVKNSWGVTWGDEGYIKMARNKDNHCGIASSASYPL 338
>gi|125552927|gb|EAY98636.1| hypothetical protein OsI_20560 [Oryza sativa Indica Group]
Length = 449
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 136/307 (44%), Positives = 186/307 (60%), Gaps = 10/307 (3%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
+ E W A+HGR+Y E+A RL F N ++ A+ +Y L N F+DLT++EFR
Sbjct: 37 QFEAWCAEHGRSYATPGERAARLAAFADNAAFV-AAHNGAPASYALALNAFADLTHDEFR 95
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
A G R P V VP ++DWR+ GAVT +KDQG CG+CW+FSA
Sbjct: 96 AARLGRLAAA-GPGRDGGAPYLGVDGGVGAVPDAVDWRQSGAVTKVKDQGSCGACWSFSA 154
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
A+EGI +I G LI LSEQ+L+DC N GC GGLMD A++++++N G+ TEADYPY
Sbjct: 155 TGAMEGINKIKTGSLISLSEQELIDCDRSYNSGCGGGLMDYAYKFVVKNGGIDTEADYPY 214
Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
R +GTC+ K K TI Y+D+P +E LLQAV+ QPVSV + S RAF Y G+
Sbjct: 215 RETDGTCNKNKLKRRVVTIDGYKDVPANNEDMLLQAVAQQPVSVGICGSARAFQLYSKGI 274
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCG 335
+ C + DH + +VG+G+ E G YW++KNSWGE+WG GY+ + R+ G+CG
Sbjct: 275 FDGPCPTSLDHAILIVGYGS---EGGKDYWIVKNSWGESWGMKGYMYMHRNTGNSNGVCG 331
Query: 336 IATAASY 342
I S+
Sbjct: 332 INQMPSF 338
>gi|344275470|ref|XP_003409535.1| PREDICTED: cathepsin S-like isoform 1 [Loxodonta africana]
Length = 331
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 150/340 (44%), Positives = 209/340 (61%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M ++ + + C+S + R +P++ + W + + YK++ E+ R I+++NL++
Sbjct: 1 MKWLLWVALVCSSAMA--RLHKDPTLDNHWDLWKKTYSKQYKEKNEEVARRLIWEKNLKF 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y L N D+T+EE +L + VPS Q R TFK
Sbjct: 59 VMLHNLEHSMGMHSYDLSMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNVTFKSNPNQ 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCS +
Sbjct: 114 KLPDSLDWREKGCVTDVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSGE 173
Query: 190 ---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GG M +AF+YII+N G+ +EA YPY+ +G C K AAT SKY +LP
Sbjct: 174 KYSNKGCNGGFMTRAFQYIIDNNGIDSEASYPYKATDGKCQ-YDPKNRAATCSKYTELPY 232
Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
G E AL +AV+N+ PVSV +DAS +F YKSGV + C +N +HGV VVG+G N
Sbjct: 233 GSEDALKEAVANKGPVSVGIDASRPSFFLYKSGVYYDPSCTDNVNHGVLVVGYGNL---N 289
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
G YWL+KNSWG +GE GYIR+ R++G CGIA+ SYP
Sbjct: 290 GKDYWLVKNSWGLNFGEQGYIRMARNSGNHCGIASFPSYP 329
>gi|195429415|ref|XP_002062758.1| GK19626 [Drosophila willistoni]
gi|194158843|gb|EDW73744.1| GK19626 [Drosophila willistoni]
Length = 341
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 143/343 (41%), Positives = 209/343 (60%), Gaps = 19/343 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ +++ + +Q VS + + E+ + +H + Y D E+ R+ IF +N +I
Sbjct: 5 LITLLIALVAMTQAVSYSEL----VREEWNTFKLEHRKNYADSTEETFRMKIFNENKHHI 60
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQN 127
K N+ G +YKL N+++D+ + EFR G+N + R +S TF
Sbjct: 61 AKHNQRYATGEVSYKLALNKYADMLHHEFRETMNGFNYTLHKQLRSTDESFTGVTFISPE 120
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+PT++DWR KGAVT +KDQG CGSCWAFS+ A+EG G L+ LSEQ LVDCS
Sbjct: 121 HVKLPTAVDWRTKGAVTEVKDQGHCGSCWAFSSTGAIEGQHFRKSGTLVSLSEQNLVDCS 180
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
T N+GC+GGLMD AF Y+ +N G+ TE Y Y + +C K ++ AT + D+P
Sbjct: 181 TKYGNNGCNGGLMDNAFRYVKDNGGIDTEKSYAYEGIDDSCHFDK-NSIGATDRGFADIP 239
Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEE 302
+G+E+ L QAV+ PVSV +DAS ++F FY GV + +C N DHGV VVG+GT E
Sbjct: 240 QGNEKKLAQAVATIGPVSVAIDASQQSFQFYSEGVYDEPNCSAENLDHGVLVVGYGT--E 297
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
++G+ YWL+KNSWG TWG+ G+I++ R+ CGIA+A+SYP+
Sbjct: 298 KDGSDYWLVKNSWGTTWGDKGFIKMSRNKENQCGIASASSYPL 340
>gi|440799058|gb|ELR20119.1| cysteine proteinase [Acanthamoeba castellanii str. Neff]
Length = 401
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 145/322 (45%), Positives = 191/322 (59%), Gaps = 20/322 (6%)
Query: 36 PSIVEKHEQ-----WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE--GNRTYKLGT 88
P VE EQ WM H ++Y + R I+K N +I NK+ ++ +
Sbjct: 84 PRDVELEEQRAFTEWMRTHRKSYHHD-HFLPRFEIWKTNNRWITHWNKKHANASSFTVAI 142
Query: 89 NEFSDLTNEEFRALYTGYNR-PVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
N+F DLT++EF LY G + P S + RP ++ N +P S DWR+KG V+ +K
Sbjct: 143 NQFGDLTSDEFNRLYNGLHVFSAPKASEKVERPR--QWANTAGIPESGDWRQKGVVSRVK 200
Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST---DNHGCSGGLMDKAFEY 204
DQG CGSCWAFS + EGI IT +L+ LSEQ LVDC+T DN+GC+GG MD AF Y
Sbjct: 201 DQGMCGSCWAFSTTGSTEGINAITTSRLVPLSEQNLVDCATAAYDNYGCNGGFMDNAFRY 260
Query: 205 IIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
II+NKG+ +EA YPY +G C + + LPKGDE+ALL A + QP+SV
Sbjct: 261 IIDNKGIDSEASYPYVAADGQCRFNPKTVYGGKGGTLKSLPKGDEKALLVAAARQPISVG 320
Query: 265 VDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
+DA +F FY GV N +C + +HGV +VG+G E G YWL+KNSWG+TWG
Sbjct: 321 IDAGRPSFQFYSKGVYNEPECSSTELNHGVLIVGWGV---ERGQAYWLVKNSWGQTWGMD 377
Query: 323 GYIRILRDA-GLCGIATAASYP 343
GYI++ RD CGIAT ASYP
Sbjct: 378 GYIKMSRDKNNQCGIATLASYP 399
>gi|344953542|gb|AEN28617.1| cathepsin L-like cysteine protease [Epinephelus coioides]
Length = 336
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 148/345 (42%), Positives = 210/345 (60%), Gaps = 20/345 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++P+ V+ + C S +S S+ +P + + E W + H + Y E E+ R ++++N
Sbjct: 1 MLPLAVVAL----CLSAALSAPSL-DPQLDDHWELWKSWHSKKYH-EKEEGWRRMVWEKN 54
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
L+ IE N E G +Y+LG N F D+T+EEFR L GY R + +R S F
Sbjct: 55 LKKIELHNLEHSMGTHSYRLGMNHFGDMTHEEFRQLMNGYKRKAET----KARGSLFLEP 110
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
N + P S+DWR+ G VT +KDQGQCGSCWAFS A+EG GKL+ LSEQ LVDC
Sbjct: 111 NFLEAPKSVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDC 170
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
S N GC+GGLMD+AF+Y+ +N+GL +E YPY + + + + + D+
Sbjct: 171 SRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPTYNSVNDTGFVDI 230
Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TA 300
P G E+AL++AV+ PVSV +DA +F FY+SG+ +C + DHGV VVG+G
Sbjct: 231 PSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLVVGYGFQG 290
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
E+ +G KYW++KNSW E WG+ GYI + +D CGIATAASYP+
Sbjct: 291 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|355763133|gb|EHH62119.1| hypothetical protein EGM_20318 [Macaca fascicularis]
Length = 331
Score = 268 bits (685), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 210/341 (61%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M +I +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKQLICVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE +L + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNAN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ N GC+GG M +AF+YII+N G+ ++A YPY+ + C K AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKCQ-YDSKYRAATCSKYTELP 231
Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
G E L + V+N+ PVSV VDAS +F Y+SGV C N +HGV VVG+G
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL--- 288
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
NG +YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|41688064|dbj|BAD08618.1| cathepsin L preproprotein [Cyprinus carpio]
Length = 337
Score = 268 bits (684), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 149/342 (43%), Positives = 203/342 (59%), Gaps = 16/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M V + C S V + ++ + + EQW HG+ Y E E+ R ++++NL+
Sbjct: 1 MRVFLAAFALCLSAVFAAPTL-DKQLDNHWEQWKNWHGKKYH-EKEEGWRRMVWEKNLQK 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G TY+LG N F D+T+EEFR + GY + R S F N
Sbjct: 59 IELHNLEHSMGTHTYRLGMNRFGDMTHEEFRQVMNGYKHK----KERRFRGSLFMEPNFL 114
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+VP S+DWREKG VT +KDQG+CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 115 EVPNSLDWREKGYVTPVKDQGECGSCWAFSTTGAMEGQMFRKTGKLVSLSEQNLVDCSRP 174
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+YI + GL +E YPY + + K AA + + D+P G
Sbjct: 175 EGNEGCNGGLMDQAFQYIKDQNGLDSEESYPYVGTDDQPCHYDPKYSAANDTGFVDIPSG 234
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E AL++A++ PVSV +DA +F FY+SG+ +C + DHGV VG+G E+
Sbjct: 235 KEHALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDV 294
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+G KYW++KNSW E WG+ GY+ + +D CGIATAASYP+
Sbjct: 295 DGKKYWIVKNSWSENWGDKGYVYMAKDRHNHCGIATAASYPL 336
>gi|402856105|ref|XP_003892640.1| PREDICTED: cathepsin S isoform 1 [Papio anubis]
Length = 331
Score = 267 bits (683), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 150/341 (43%), Positives = 210/341 (61%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE +L + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 QMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ N GC+GG M +AF+YII+N G+ ++A YPY+ + C K AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKCQ-YDSKYRAATCSKYTELP 231
Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
G E L + V+N+ PVSV VDAS +F Y+SGV C N +HGV VVG+G
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL--- 288
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
NG +YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|355558399|gb|EHH15179.1| hypothetical protein EGK_01236 [Macaca mulatta]
gi|380809986|gb|AFE76868.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416071|gb|AFH31249.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416073|gb|AFH31250.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416075|gb|AFH31251.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416077|gb|AFH31252.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
gi|383416079|gb|AFH31253.1| cathepsin S isoform 1 preproprotein [Macaca mulatta]
Length = 331
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 150/341 (43%), Positives = 210/341 (61%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKQLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE +L + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNAN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ N GC+GG M +AF+YII+N G+ ++A YPY+ + C K AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTRAFQYIIDNNGIDSDASYPYKATDQKCQ-YDSKYRAATCSKYTELP 231
Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
G E L + V+N+ PVSV VDAS +F Y+SGV C N +HGV VVG+G
Sbjct: 232 YGREDVLKEVVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYGVL--- 288
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
NG +YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|326672302|ref|XP_003199633.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|157423549|gb|AAI53506.1| Im:6910535 [Danio rerio]
Length = 335
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 142/340 (41%), Positives = 208/340 (61%), Gaps = 15/340 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ +LV C S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLCISAVFTAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR GY + ++S+ + F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKQD----PNRTSKGALFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +KDQ QCGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GG+MD+AF+Y+ ENKGL +E YPY + + A I+ + D+PKG+
Sbjct: 175 GNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGN 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFG-TAEEENG 305
E AL+ AV+ PVSV +DAS ++ FY+SG+ C + DH V VVG+G + G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAG 294
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+YW++KNSW + WG+ GYI + +D CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|61368403|gb|AAX43172.1| cathepsin S [synthetic construct]
Length = 332
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE +L + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ + C K AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQ-YDSKYRAATCSKYTELP 231
Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
G E L +AV+N+ PVSV VDA +F Y+SGV C N +HGV VVG+G +
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
NG +YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|303283194|ref|XP_003060888.1| predicted protein [Micromonas pusilla CCMP1545]
gi|226457239|gb|EEH54538.1| predicted protein [Micromonas pusilla CCMP1545]
Length = 422
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 148/323 (45%), Positives = 205/323 (63%), Gaps = 19/323 (5%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE---KANKEGNRTYKLGTNEFSD 93
+I + ++W+A HG+ Y E+A RL IF N E++ +A+ G +++ L N +D
Sbjct: 65 TIEARFDRWLATHGKAYACPKERAKRLAIFADNAEFVRVHNEAHAAGKKSHWLRLNHLAD 124
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRP----STFKYQNVTDVPTSIDWREKGAVTHIKDQ 149
LT EEF+ + GY+ V +SS P + ++Y +VT P ++DW +GAVT +K+Q
Sbjct: 125 LTREEFKHML-GYDASKKRV--ESSSPPVDAANWEYADVTP-PETMDWVSRGAVTPVKNQ 180
Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIE 207
GQCGSCWAFS V AVEG+ + G LI LSEQ+LV C+ N+GC GGLMD FE+I+E
Sbjct: 181 GQCGSCWAFSTVGAVEGVVAVKTGDLISLSEQELVSCAKIGGNNGCKGGLMDNGFEWIVE 240
Query: 208 NKGLATEADYPYRHEEGTCD-NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVD 266
N+G+ E D+ Y ++ C+ +K +A AA+I ++D+P+ DE AL +AVS QPV+V ++
Sbjct: 241 NRGVDDEEDWGYLAKDRRCNWFKKRRAKAASIDGFKDVPRNDEDALKKAVSQQPVAVAIE 300
Query: 267 ASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAK-YWLIKNSWGETWGESGYI 325
A R F Y GV + +CG N DHGV VVG+G E G K YW +KNSWG WGE GYI
Sbjct: 301 ADHREFQLYSGGVFDGECGTNLDHGVLVVGYGYDGESAGHKHYWTVKNSWGAKWGEEGYI 360
Query: 326 RILR----DAGLCGIATAASYPV 344
RI R AG CG+A ASYP
Sbjct: 361 RIARGGMGPAGQCGVAMQASYPT 383
>gi|226499884|ref|NP_001148278.1| thiol protease SEN102 precursor [Zea mays]
gi|195617112|gb|ACG30386.1| thiol protease SEN102 precursor [Zea mays]
Length = 374
Score = 267 bits (683), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 142/348 (40%), Positives = 199/348 (57%), Gaps = 33/348 (9%)
Query: 24 ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR- 82
A + S + S++E+ ++W A + ++Y E+ R ++ +N+ YIE N E
Sbjct: 32 AGDTMGSMSNDDSSMIERFQRWKAAYNKSYATVAEERRRFRVYARNMAYIEATNAEAEAA 91
Query: 83 --TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFK---------------- 124
TY+LG ++DLTN+EF A+YT P++++ + S
Sbjct: 92 GLTYELGETAYTDLTNQEFMAMYT-----APALAQLPADESVITTRAGPVDAVGGAPGQL 146
Query: 125 --YQNVT-DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
Y N++ P S+DWR GAVT +K+QG+CGSCWAFS VA VEGI QI GKL+ LSEQ
Sbjct: 147 PVYVNLSASAPASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQ 206
Query: 182 QLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
+LVDC T + GC GG+ +A +I N G+ TEADYPY C+ K A +I+
Sbjct: 207 ELVDCDTLDDGCDGGISYRALRWIASNGGITTEADYPYTGTTDACNRAKLSHNAVSIAGL 266
Query: 242 EDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAE 301
+ E +L AV+ QPV+V ++A G F YK GV N CG N +HGV VVG+G E
Sbjct: 267 RRVATRSEASLANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQ-E 325
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
G +YW++KNSWG+ WG+ GYIR+ +D GLCGIA SYP+
Sbjct: 326 AAAGDRYWIVKNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|301116794|ref|XP_002906125.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
gi|262107474|gb|EEY65526.1| cysteine protease family C01A, putative [Phytophthora infestans
T30-4]
Length = 535
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 145/307 (47%), Positives = 192/307 (62%), Gaps = 13/307 (4%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT-YKLGTNEFSDLTNEEFRALY 103
WM H ++ D LE A RL + N YI + N E T KL NEFS ++ EEF+
Sbjct: 32 WMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFKFKM 91
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
TGY P + ++ + + +V VP S+DW++KG VT +K+QG CGSCWAFS A
Sbjct: 92 TGYVMPEGYLEQRLASRVDNLWSDVQ-VPDSVDWQDKGGVTPVKNQGMCGSCWAFSTTGA 150
Query: 164 VEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
VEG ++ GKL+ LSEQ+LVDC + + GC+GGLMD AF +I +N G+ +E DY Y+ +
Sbjct: 151 VEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDYEYKAK 210
Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
C + EK V IS ++D+ DE AL AV+ QPVSV ++A +AF FYKSGV N
Sbjct: 211 AQVCRDC-EKVV--KISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 267
Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIAT 338
CG DHGV VG+G+ ENG K+W +KNSWG +WGE GYIR+ R+ AG CGIA+
Sbjct: 268 TCGTRLDHGVLAVGYGS---ENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQCGIAS 324
Query: 339 AASYPVA 345
SYP A
Sbjct: 325 VPSYPFA 331
>gi|66270077|gb|AAY43368.1| cysteine protease [Phytophthora infestans]
Length = 510
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 145/307 (47%), Positives = 192/307 (62%), Gaps = 13/307 (4%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT-YKLGTNEFSDLTNEEFRALY 103
WM H ++ D LE A RL + N YI + N E T KL NEFS ++ EEF+
Sbjct: 32 WMKTHSVSFSDALEFAKRLENYIANDMYIMEHNLENAWTGVKLDHNEFSSMSFEEFKFKM 91
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
TGY P + ++ + + +V VP S+DW++KG VT +K+QG CGSCWAFS A
Sbjct: 92 TGYVMPEGYLEQRLASRVDNLWSDVQ-VPDSVDWQDKGGVTPVKNQGMCGSCWAFSTTGA 150
Query: 164 VEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
VEG ++ GKL+ LSEQ+LVDC + + GC+GGLMD AF +I +N G+ +E DY Y+ +
Sbjct: 151 VEGAAFVSSGKLVSLSEQELVDCDHNGDMGCNGGLMDHAFAWIEDNGGICSEDDYEYKAK 210
Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
C + EK V IS ++D+ DE AL AV+ QPVSV ++A +AF FYKSGV N
Sbjct: 211 AQVCRDC-EKVV--KISGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 267
Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIAT 338
CG DHGV VG+G+ ENG K+W +KNSWG +WGE GYIR+ R+ AG CGIA+
Sbjct: 268 TCGTRLDHGVLAVGYGS---ENGQKFWKVKNSWGSSWGEKGYIRLAREENGPAGQCGIAS 324
Query: 339 AASYPVA 345
SYP A
Sbjct: 325 VPSYPFA 331
>gi|29165304|gb|AAO65603.1| cathepsin L precursor [Hydra vulgaris]
Length = 324
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 152/335 (45%), Positives = 206/335 (61%), Gaps = 24/335 (7%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
+++L +T A ++ P E QW H + Y + E+ +R I+K N I +
Sbjct: 7 LLLLGVTLA------YTIERPVKDESWIQWKMYHNKVYSHDGEETVRYTIWKDNERRIRE 60
Query: 76 ANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
N +G + L N+F D+TN EF+A + GY +S + STF N P ++
Sbjct: 61 HNLKGGD-FILKMNQFGDMTNSEFKA-FNGY------LSHKHVNGSTFLTPNNFVAPDTV 112
Query: 136 DWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGC 193
DWR +G VT +KDQGQCGSCWAFS ++EG GKL+ LSEQ LVDCST N+GC
Sbjct: 113 DWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQHFKKTGKLVSLSEQNLVDCSTAYGNNGC 172
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
GGLMD AF YI ENKG+ +EA YPY E+G C +K +VAAT + + D+P+G+E L
Sbjct: 173 DGGLMDNAFTYIKENKGIDSEASYPYTAEDGKCVFKKS-SVAATDTGFVDIPEGNENKLK 231
Query: 254 QAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWL 310
+AV++ P+SV +DAS +F FY SGV N C + DHGV VVG+GT E+G YWL
Sbjct: 232 EAVASVGPISVAIDASHESFQFYSSGVYNEPSCSSTELDHGVLVVGYGT---ESGKDYWL 288
Query: 311 IKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+KNSW +WG+ GYI++ R+A CGIAT ASYP+
Sbjct: 289 VKNSWNTSWGDKGYIKMRRNAKNQCGIATKASYPL 323
>gi|195381187|ref|XP_002049336.1| GJ20806 [Drosophila virilis]
gi|194144133|gb|EDW60529.1| GJ20806 [Drosophila virilis]
Length = 339
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 149/343 (43%), Positives = 206/343 (60%), Gaps = 21/343 (6%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+ +L + +Q VS + I E+ + + +H + Y DE E+ RL IF +N I
Sbjct: 4 LFALLALVAVAQAVS----YADVIKEEWQTFKLEHRKNYVDETEERFRLKIFNENKHKIA 59
Query: 75 KANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS----TFKYQN 127
K N+ G ++K+ N+++D+ + EF G+N + R +S PS TF
Sbjct: 60 KHNQRYASGEVSFKMAVNKYADMLHHEFHTTMNGFNYTLHKQLR-ASDPSFVGVTFISPE 118
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+P S+DWR KGAVT +KDQG CGSCWAFS+ A+EG G LI LSEQ LVDCS
Sbjct: 119 HVKIPKSVDWRSKGAVTEVKDQGHCGSCWAFSSTGALEGQHFRKAGTLISLSEQNLVDCS 178
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
T N+GC+GGLMD AF YI +N G+ TE YPY + +C K + AT D+P
Sbjct: 179 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEGIDDSCHFNK-ATIGATDRGSVDIP 237
Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCG-NNCDHGVAVVGFGTAEE 302
+GDE+ + +AV+ PVSV +DAS +F FY G+ N C N DHGV VVG+GT +
Sbjct: 238 QGDEKKMAEAVATIGPVSVAIDASHESFQFYSEGIYNEPQCDPQNLDHGVLVVGYGT--D 295
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
E+G YWL+KNSWG TWG+ G+I++ R+A CGIA+A+SYP+
Sbjct: 296 ESGQDYWLVKNSWGTTWGDKGFIKMARNADNQCGIASASSYPL 338
>gi|114559418|ref|XP_001171268.1| PREDICTED: cathepsin S isoform 3 [Pan troglodytes]
gi|397492866|ref|XP_003817341.1| PREDICTED: cathepsin S isoform 1 [Pan paniscus]
gi|410225070|gb|JAA09754.1| cathepsin S [Pan troglodytes]
gi|410251608|gb|JAA13771.1| cathepsin S [Pan troglodytes]
gi|410328325|gb|JAA33109.1| cathepsin S [Pan troglodytes]
gi|410328327|gb|JAA33110.1| cathepsin S [Pan troglodytes]
Length = 331
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE +L + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ + C K AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKATDQKCQ-YDSKYRAATCSKYTELP 231
Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
G E L +AV+N+ PVSV VDA +F Y+SGV C N +HGV VVG+G +
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDALHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
NG +YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|23110962|ref|NP_004070.3| cathepsin S isoform 1 preproprotein [Homo sapiens]
gi|88984046|sp|P25774.3|CATS_HUMAN RecName: Full=Cathepsin S; Flags: Precursor
gi|60816153|gb|AAX36372.1| cathepsin S [synthetic construct]
gi|61358282|gb|AAX41541.1| cathepsin S [synthetic construct]
gi|119573903|gb|EAW53518.1| cathepsin S, isoform CRA_b [Homo sapiens]
gi|119573904|gb|EAW53519.1| cathepsin S, isoform CRA_b [Homo sapiens]
Length = 331
Score = 267 bits (682), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE +L + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ + C K AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQ-YDSKYRAATCSKYTELP 231
Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
G E L +AV+N+ PVSV VDA +F Y+SGV C N +HGV VVG+G +
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
NG +YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|395819351|ref|XP_003783057.1| PREDICTED: cathepsin L1-like [Otolemur garnettii]
Length = 333
Score = 267 bits (682), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 151/342 (44%), Positives = 210/342 (61%), Gaps = 20/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M +++IL C + S SM + S+ +W A+H + Y E+ R ++++N++
Sbjct: 1 MNLLLILAAFCVG-ITSATSMFDGSLNAHWYRWKAKHRKLYGMR-EEGWRRAVWEKNMKM 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N+E G + + N F D+TNEEFR + G+ +++ + F+ +
Sbjct: 59 IEVHNQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFR------NQKHKKGKVFQEPSFL 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
+VP S+DWREKG VT +K+QGQCGSCWAFSA A+EG GKLI LSEQ LVDCS
Sbjct: 113 EVPKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRP 172
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC GGLMD AF+YI EN GL +E YPY + +C + E +VA + + D+PK
Sbjct: 173 QGNEGCDGGLMDYAFQYIKENGGLDSEESYPYDAMDESCKYRPEYSVAND-TGFVDIPK- 230
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFGTAE-EE 303
+E+AL++AV+ P+SV +DA +F FYK GV +C +N DHGV VVG+G E E
Sbjct: 231 EEKALMKAVATVGPISVAIDAGHESFQFYKEGVYFEPECSSDNVDHGVLVVGYGYEETES 290
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+ K+WL+KNSWGE WG GYI++ +D CGIATAASYP
Sbjct: 291 DNNKFWLVKNSWGEEWGLGGYIKMTKDQKNHCGIATAASYPT 332
>gi|449275508|gb|EMC84350.1| Cathepsin L1, partial [Columba livia]
Length = 319
Score = 266 bits (681), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 146/320 (45%), Positives = 202/320 (63%), Gaps = 14/320 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P + + W + H + Y E E++ R ++++NL+ IE N + G +YKLG N+F
Sbjct: 3 DPELDGHWQLWKSWHNKDYH-EREESWRRVVWEKNLKMIELHNLDHTLGKHSYKLGMNQF 61
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
D+T EEFR L GY S + R S F + + P S+DWREKG VT +KDQGQ
Sbjct: 62 GDMTTEEFRQLMNGY---AHKKSERKYRGSQFLEPSFLEAPRSVDWREKGYVTPVKDQGQ 118
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
CGSCWAFS A+EG GKL+ LSEQ LVDCS N GC+GGLMD+AF+Y+ +N
Sbjct: 119 CGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSRPEGNQGCNGGLMDQAFQYVQDNG 178
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
G+ +E YPY ++ K + AA + + D+P+G E+AL++AV+ PVSV +DA
Sbjct: 179 GIDSEESYPYTAKDDEDCRYKAEYNAANDTGFVDIPQGHERALMKAVAAVGPVSVAIDAG 238
Query: 269 GRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYI 325
+F FY+SG+ DC + + DHGV VVG+G E+ +G KYW++KNSWGE WG+ GYI
Sbjct: 239 HSSFQFYQSGIYYEPDCSSEDLDHGVLVVGYGFEGEDVDGKKYWIVKNSWGEKWGDKGYI 298
Query: 326 RILRD-AGLCGIATAASYPV 344
+ +D CGIATAASYP+
Sbjct: 299 YMAKDRKNHCGIATAASYPL 318
>gi|94448674|emb|CAI91575.1| cathepsin L2 [Lubomirskia baicalensis]
Length = 324
Score = 266 bits (681), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 150/312 (48%), Positives = 195/312 (62%), Gaps = 25/312 (8%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE-GNRTYKLGTNEFSDLTNEEFRALY 103
W A+HG++Y++ E+ +R ++ N +YI++ N+ G Y L N+F DL N EF++LY
Sbjct: 25 WKAEHGKSYRNHKEEMLRHVTWQANKKYIDEHNQHAGVFGYTLKMNQFGDLENSEFKSLY 84
Query: 104 TGYNRPVPSVSRQSSRPSTFK----YQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
GY R S+ P K V D+P S+DW +KG VT +K+QGQCGSCW+FS
Sbjct: 85 NGY--------RMSNAPRKGKPFVPAARVQDLPASVDWSKKGWVTPVKNQGQCGSCWSFS 136
Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADY 217
A ++EG G L+ LSEQ LVDCS NHGC+GGLMD AFEY+I+N G+ TEA Y
Sbjct: 137 ATGSMEGQHFNATGTLMSLSEQNLVDCSAAEGNHGCNGGLMDDAFEYVIKNNGIDTEASY 196
Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYK 276
PYR + TC V ATIS Y D+ K E L AV+ PVSV +DAS +F FY
Sbjct: 197 PYRAVDSTCKFNTAD-VGATISGYVDVTKDSESDLQVAVATIGPVSVAIDASHISFQFYS 255
Query: 277 SGVLNAD--CGNNCDHGVAVVGFGTAEEENGAK-YWLIKNSWGETWGESGYIRILRDA-G 332
SGV + N DHGV VG+GT +G+K YWL+KNSWG +WG SGYI ++R+
Sbjct: 256 SGVYDPLICSSTNLDHGVLAVGYGT----DGSKDYWLVKNSWGASWGMSGYIEMVRNHNN 311
Query: 333 LCGIATAASYPV 344
CGIAT+ASYPV
Sbjct: 312 KCGIATSASYPV 323
>gi|195153545|ref|XP_002017686.1| GL17172 [Drosophila persimilis]
gi|194113482|gb|EDW35525.1| GL17172 [Drosophila persimilis]
Length = 341
Score = 266 bits (681), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 207/343 (60%), Gaps = 19/343 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
++ +L + +Q VS + I E+ + +H + Y+DE E+ RL IF +N I
Sbjct: 5 LILPLLALVAVAQAVS----YAEVIQEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKI 60
Query: 74 EKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQN 127
K N+ G ++K+ N+++D+ + EF + G+N + R +S + TF
Sbjct: 61 AKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTFISPE 120
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+P +DWR KGAVT +KDQG CGSCWAFS+ A+EG G L+ LSEQ LVDCS
Sbjct: 121 HVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCS 180
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
T N+GC+GGLMD AF YI +N G+ TE YPY + +C K ++ AT + D+P
Sbjct: 181 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNK-GSIGATDRGFVDIP 239
Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLNADC--GNNCDHGVAVVGFGTAEE 302
+G+E+ + +AV+ PV+V +DAS +F FY GV N N DHGV VVGFGT +
Sbjct: 240 QGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGFGT--D 297
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
E+G YWL+KNSWG TWG+ G+I++LR+ CGIA+A+SYP+
Sbjct: 298 ESGEDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 340
>gi|179957|gb|AAC37592.1| cathepsin S [Homo sapiens]
Length = 331
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE +L + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ + C K AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDLKCQ-YDSKYRAATCSKYTELP 231
Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
G E L +AV+N+ PVSV VDA +F Y+SGV C N +HGV VVG+G +
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
NG +YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|297830594|ref|XP_002883179.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
gi|297329019|gb|EFH59438.1| hypothetical protein ARALYDRAFT_318695 [Arabidopsis lyrata subsp.
lyrata]
Length = 308
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 137/310 (44%), Positives = 195/310 (62%), Gaps = 26/310 (8%)
Query: 42 HEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
+E+W+ ++ + Y EK R IFK+NL++I++ N N+T+++G F+DLTN+E
Sbjct: 2 YERWLVENRKNYNGLGEKERRCKIFKENLKFIDEHNSLPNQTFEVGLTRFADLTNDE--- 58
Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
P ++ R + Y+ +P IDWR KGAV +KDQG CGSCWAFSAV
Sbjct: 59 ---------PKDFMKADR---YLYKEGDILPDEIDWRAKGAVVPVKDQGNCGSCWAFSAV 106
Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
AVEGI QI G+LI LS+Q+L+DC N GC GG+M+ AFE+II N G+ ++ DYPY
Sbjct: 107 GAVEGINQIKTGELISLSDQELIDCDRGFVNAGCEGGVMNYAFEFIINNGGIESDQDYPY 166
Query: 220 RHEE-GTCD-NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKS 277
+ G C+ ++K I YE + + DE++L +AV++QPV V ++AS +AF YKS
Sbjct: 167 TATDLGVCNADKKNNTRVVKIDGYEYVAQNDEKSLKKAVAHQPVGVAIEASSQAFKLYKS 226
Query: 278 GVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GL 333
GV CG DHGV VVG+GT+ +G YW+I+NSWG WGE+GY+++ R+ G
Sbjct: 227 GVFTGTCGIYLDHGVVVVGYGTS---SGEDYWIIRNSWGLNWGENGYVKLQRNIDDSFGK 283
Query: 334 CGIATAASYP 343
CG+A SYP
Sbjct: 284 CGVAMMPSYP 293
>gi|244539471|dbj|BAH82657.1| cysteine protease [Lotus japonicus]
Length = 286
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 133/249 (53%), Positives = 176/249 (70%), Gaps = 6/249 (2%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++E E WM++HG+ Y+ EK +R IFK NL++I++ NK + Y LG NEF+DL++
Sbjct: 4 LIELFESWMSRHGKIYESIEEKLLRFEIFKDNLKHIDETNKVVS-NYWLGLNEFADLSHH 62
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
EF+ Y G S R+SS F Y++V D+P S+DWR+KGAVT+IK+QG CGSCWA
Sbjct: 63 EFKKQYLGLKVDF-STRRESSEE--FTYRDV-DLPKSVDWRKKGAVTNIKNQGSCGSCWA 118
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCS-TDNHGCSGGLMDKAFEYIIENKGLATEAD 216
FS VAAVEGI QI G L LSEQ+L+DC T N GC+GGLMD AF +I+EN GL E D
Sbjct: 119 FSTVAAVEGINQIVTGNLTSLSEQELIDCDRTYNSGCNGGLMDYAFSFIVENGGLHKEDD 178
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
YPY EEGTC+ KE++ TIS Y D+P+ +EQ+LL+A++NQP+SV ++ASGR F FY
Sbjct: 179 YPYIMEEGTCEMSKEESQVVTISGYHDVPQNNEQSLLKALANQPLSVAIEASGRDFQFYS 238
Query: 277 SGVLNADCG 285
GV + CG
Sbjct: 239 GGVFDGHCG 247
>gi|224079085|ref|XP_002305743.1| predicted protein [Populus trichocarpa]
gi|222848707|gb|EEE86254.1| predicted protein [Populus trichocarpa]
Length = 494
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 194/318 (61%), Gaps = 12/318 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI-EKANKEGNRTYKLGTNEFSD 93
+ SI+E +QW +H + YK E R FK+NL+YI EK KE +++G N+F+D
Sbjct: 36 DESIIEIFQQWRDRHQKAYKHAEEAEKRFGNFKRNLKYIIEKTGKETTLRHRVGLNKFAD 95
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
L+NEEF+ LY + + +R + + + D P+S+DWR+KG VT +KDQG CG
Sbjct: 96 LSNEEFKQLYLSKVKKPINKTRIDAEDRSRRNLQSCDAPSSLDWRKKGVVTAVKDQGDCG 155
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
SCW+FS A+EGI I LI LSEQ+LVDC T N+GC GG MD AFE++I N G+ T
Sbjct: 156 SCWSFSTTGAIEGINAIVTSDLISLSEQELVDCDTTNYGCEGGYMDYAFEWVINNGGIDT 215
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
EA+YPY +GTC+ KE+ +I Y+D+ + D ALL A + QP+SV +D S F
Sbjct: 216 EANYPYTGVDGTCNTAKEEIKVVSIDGYKDVDETD-SALLCAAAQQPISVGIDGSAIDFQ 274
Query: 274 FYKSGVL---NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
Y G+ +D ++ DH V +VG+G+ ENG YW++KNSWG +WG GY I R+
Sbjct: 275 LYTGGIYDGDCSDDPDDIDHAVLIVGYGS---ENGEDYWIVKNSWGTSWGIEGYFYIKRN 331
Query: 331 A----GLCGIATAASYPV 344
G+C I ASYP
Sbjct: 332 TDLPYGVCAINAMASYPT 349
>gi|159479072|ref|XP_001697622.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
gi|158274232|gb|EDP00016.1| cysteine endopeptidase [Chlamydomonas reinhardtii]
Length = 469
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 137/311 (44%), Positives = 197/311 (63%), Gaps = 12/311 (3%)
Query: 43 EQWMAQHGRTY-KDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
++W H R+Y D E R ++ +NLEY+ N ++ L N +DL+ E+++
Sbjct: 14 KEWAQTHSRSYVNDVAEFENRFKVWLENLEYVLAYNAR-TTSHWLTLNHLADLSTPEYKS 72
Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
G++ V+R + + F+Y++V +P +IDWR+K AV +K+QGQCGSCWAF+
Sbjct: 73 KLLGFDNQA-RVARNKLK-TGFRYEDVDAEALPPAIDWRKKNAVAEVKNQGQCGSCWAFA 130
Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYP 218
+VEGI I G L+ LSEQ+LVDC T+ + GCSGGLMD A+ +II+NKG+ TE DYP
Sbjct: 131 TTGSVEGINAIVTGSLVSLSEQELVDCDTEQDKGCSGGLMDYAYAWIIKNKGINTEEDYP 190
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSG 278
Y +G CD K K TI YED+P+ DE AL +A ++QPV+V ++A ++F Y G
Sbjct: 191 YTAMDGQCDVAKMKRRVVTIDSYEDVPENDEVALKKAAAHQPVAVAIEADAKSFQLYGGG 250
Query: 279 VL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GL 333
V + CG + +HGV VVG+G +G+ YW++KNSWG WG++GYIR+ + GL
Sbjct: 251 VYDDPTCGTSLNHGVLVVGYGKDVTGSGSNYWIVKNSWGAEWGDAGYIRLKMGSTDAEGL 310
Query: 334 CGIATAASYPV 344
CGIA A SYPV
Sbjct: 311 CGIAMAPSYPV 321
>gi|194701748|gb|ACF84958.1| unknown [Zea mays]
gi|414589103|tpg|DAA39674.1| TPA: thiol protease SEN102 [Zea mays]
Length = 374
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 143/338 (42%), Positives = 193/338 (57%), Gaps = 29/338 (8%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR---TYKLGT 88
S + S++E+ ++W A + ++Y E+ R + +N+ YIE N E TY+LG
Sbjct: 40 STDDSSMIERFQRWKAAYNKSYATVAEERRRFRVCARNMAYIEATNAEAEAAGLTYELGE 99
Query: 89 NEFSDLTNEEFRALYTGYNRPVPS---------------VSRQSSRPSTFK-YQNV-TDV 131
++DLTN+EF A+YT P P+ V P Y N+ T
Sbjct: 100 TAYTDLTNQEFMAMYTA---PAPAQLPADESVITTRAGPVDAVGGAPGQLPVYVNLSTSA 156
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH 191
P S+DWR GAVT +K+QG+CGSCWAFS VA VEGI QI GKL+ LSEQ+LVDC T +
Sbjct: 157 PASVDWRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTLDD 216
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GG+ +A +I N G+ TE DYPY C+ K A +I+ + E +
Sbjct: 217 GCDGGISYRALRWIASNGGITTETDYPYTGTTDACNRAKLSHNAVSIAGLRRVATRSEAS 276
Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
L AV+ QPV+V ++A G F YK GV N CG N +HGV VVG+G E G +YW++
Sbjct: 277 LANAVAGQPVAVSIEAGGDNFQHYKKGVYNGPCGTNLNHGVTVVGYGQ-EAAGGDRYWIV 335
Query: 312 KNSWGETWGESGYIRILRDA-----GLCGIATAASYPV 344
KNSWG+ WG+ GYIR+ +D GLCGIA SYP+
Sbjct: 336 KNSWGQGWGDDGYIRMKKDVAGKPEGLCGIAIRPSYPL 373
>gi|332220183|ref|XP_003259237.1| PREDICTED: cathepsin S isoform 1 [Nomascus leucogenys]
Length = 331
Score = 266 bits (681), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKWLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE +L + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 QILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ + C K AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQ-YDSKYRAATCSKYTELP 231
Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
E L +AV+N+ PVSV VDAS +F Y+SGV C N +HGV VVG+G +
Sbjct: 232 YSREDVLKEAVANKGPVSVGVDASHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
NG +YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|12803615|gb|AAH02642.1| Cathepsin S [Homo sapiens]
gi|49456313|emb|CAG46477.1| CTSS [Homo sapiens]
gi|60821573|gb|AAX36579.1| cathepsin S [synthetic construct]
gi|189069420|dbj|BAG37086.1| unnamed protein product [Homo sapiens]
gi|261858586|dbj|BAI45815.1| cathepsin S [synthetic construct]
Length = 331
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE +L + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 WILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ + C K AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQ-YDSKYRAATCSKYTELP 231
Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
G E L +AV+N+ PVSV VDA +F Y+SGV C N +HGV VVG+G +
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
NG +YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|156739275|ref|NP_001096585.1| cathepsin L1-like precursor [Danio rerio]
gi|156230123|gb|AAI52285.1| MGC174857 protein [Danio rerio]
Length = 335
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 206/340 (60%), Gaps = 15/340 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ +LV C S V + S+ + + + W +QHG++Y ++LE R+ I+++NL I
Sbjct: 1 MMFALLVTLCISAVFTAPSI-DIQLDDHWNSWKSQHGKSYHEDLEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR GY +R S P F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKH---DPNRTSQGP-LFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +KDQ QCGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GG+MD+AF+Y+ ENKGL +E YPY + + A I+ + D+P+G+
Sbjct: 175 GNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFG-TAEEENG 305
E AL+ AV+ PVSV +DAS ++ FY+SG+ C + DH V VVG+G + G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAG 294
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+YW++KNSW + WG+ GYI + +D CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|179959|gb|AAA35655.1| cathepsin [Homo sapiens]
gi|248406|gb|AAB22005.1| cathepsin S [Homo sapiens]
Length = 331
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 151/341 (44%), Positives = 211/341 (61%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKRLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE +L + VPS Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLTSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVTLSAQNLVDCST 172
Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ + C K AAT SKY +LP
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMDQKCQ-YDSKYRAATCSKYTELP 231
Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
G E L +AV+N+ PVSV VDA +F Y+SGV C N +HGV VVG+G +
Sbjct: 232 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 288
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
NG +YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 289 NGKEYWLVKNSWGHNFGEEGYIRMARNKGNHCGIASFPSYP 329
>gi|326672297|ref|XP_003199631.1| PREDICTED: cathepsin L1-like [Danio rerio]
Length = 336
Score = 266 bits (681), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 144/341 (42%), Positives = 206/341 (60%), Gaps = 16/341 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ +LV C S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLCISAVFAASSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR GY Q+S+ F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHD----PNQTSQGPLFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +KDQ QCGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPH 174
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GGLMD+AF+Y+ ENKGL +E YPY + + A I+ + D+PKG+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGN 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFG-TAEEEN 304
E AL+ AV+ PVSV +DAS ++ FY+SG+ A + DH V VVG+G +
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G +YW++KNSW + WG+ GYI + +D CGIAT ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 335
>gi|125811033|ref|XP_001361727.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
gi|54636904|gb|EAL26307.1| GA25021 [Drosophila pseudoobscura pseudoobscura]
Length = 341
Score = 266 bits (680), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 206/343 (60%), Gaps = 19/343 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
++ +L + +Q VS + I E+ + +H + Y+DE E+ RL IF +N I
Sbjct: 5 LILPLLALVAVAQAVS----YAEVIQEEWHTFKLEHRKNYQDETEERFRLKIFNENKHKI 60
Query: 74 EKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQN 127
K N+ G ++K+ N+++D+ + EF + G+N + R +S + TF
Sbjct: 61 AKHNQLWATGAVSFKMAVNKYADMLHHEFYSTMNGFNYTLHKQLRNADESFKGVTFISPE 120
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+P +DWR KGAVT +KDQG CGSCWAFS+ A+EG G L+ LSEQ LVDCS
Sbjct: 121 HVTLPKQVDWRTKGAVTDVKDQGHCGSCWAFSSTGALEGQHYRKSGVLVSLSEQNLVDCS 180
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
T N+GC+GGLMD AF YI +N G+ TE YPY + +C K + AT + D+P
Sbjct: 181 TKYGNNGCNGGLMDNAFRYIKDNGGIDTEKSYPYEAIDDSCHFNK-GTIGATDRGFVDIP 239
Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLNADC--GNNCDHGVAVVGFGTAEE 302
+G+E+ + +AV+ PV+V +DAS +F FY GV N N DHGV VVGFGT +
Sbjct: 240 QGNEKKMAEAVATIGPVAVAIDASHESFQFYSEGVYNEPACDAQNLDHGVLVVGFGT--D 297
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
E+G YWL+KNSWG TWG+ G+I++LR+ CGIA+A+SYP+
Sbjct: 298 ESGQDYWLVKNSWGTTWGDKGFIKMLRNKENQCGIASASSYPL 340
>gi|151573014|gb|ABS17682.1| cathepsin L-1 [Artemia salina]
Length = 334
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 147/339 (43%), Positives = 205/339 (60%), Gaps = 15/339 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+I L+ Q+ + S+ E H + A H + Y +LE+ R+ I+ +N +
Sbjct: 1 TLIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVA 59
Query: 75 KAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
K N ++G ++Y + N+F DL + EFR++ GY + SR S + + NVT V
Sbjct: 60 KHNILYEKGEKSYHVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANVT-V 118
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
P S+DWREKGA+T +KDQGQCGSCWAFS+ A+EG T GKL+ LSEQ L+DCS
Sbjct: 119 PESVDWREKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLVSLSEQNLIDCSGKYG 178
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
N GC+GGLMD+AF+YI +NKG+ TE YPY E+ C + A + D+P G+E
Sbjct: 179 NEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVC-RYNPRNRGAVDRGFVDIPSGEE 237
Query: 250 QALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFGTAEEENGA 306
L AV+ PVSV +DAS +F FY GV C ++ DHGV VVG+G+ +NG
Sbjct: 238 DKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS---DNGK 294
Query: 307 KYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
YWL+KNSW E WG+ GYI++ R+ CG+A+AASYP+
Sbjct: 295 DYWLVKNSWSEHWGDEGYIKMARNRKNHCGVASAASYPL 333
>gi|449500383|ref|XP_004161083.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 345
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 140/315 (44%), Positives = 202/315 (64%), Gaps = 10/315 (3%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ + +E+W +H ++ EK R ++FK+N+ ++ N+ ++ YKL N+F+D+
Sbjct: 34 EESLWQLYERW-GKHHTISRNLKEKHKRFSVFKENVNHVFTVNQM-DKPYKLKLNKFADM 91
Query: 95 TNEEFRALYTGYN-RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
+N EF Y N + + F Y+ TD+P+S+D RE+GAV +K+QG+CG
Sbjct: 92 SNYEFVNFYARSNISHYRKLHERRRGAGGFMYEQDTDLPSSVDGRERGAVNAVKEQGRCG 151
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
SCWAFS+VAAVEGI +I +L+ LSEQ+L+DC+ N GC+GG M+ AF++I N G+AT
Sbjct: 152 SCWAFSSVAAVEGINKIKTNQLLSLSEQELLDCNYRNKGCNGGFMEIAFDFIKRNGGIAT 211
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFH 273
E YPY G C + + + I YE +P+ +E AL+QAV+NQPVSV +DA+GR F
Sbjct: 212 ENSYPYHGSRGLCRSSRISSPIVKIDGYESVPE-NEDALMQAVANQPVSVAIDAAGRDFQ 270
Query: 274 FYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-- 331
FY GV + CG +HGV +G+GT E+ G YWL++NSWG WGE GY+R+ R
Sbjct: 271 FYSQGVFDGYCGTELNHGVVAIGYGTTED--GTDYWLVRNSWGVGWGEDGYVRMKRGVEQ 328
Query: 332 --GLCGIATAASYPV 344
GLCGIA ASYP+
Sbjct: 329 AEGLCGIAMEASYPI 343
>gi|350412176|ref|XP_003489564.1| PREDICTED: cathepsin L-like [Bombus impatiens]
Length = 343
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 148/348 (42%), Positives = 212/348 (60%), Gaps = 24/348 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ +F+++I+ I +Q +S + + ++ + +H + YK+++E+ R+ IF N
Sbjct: 1 MKLFLLLIVAILATAQAISFFEL----VNQEWTTFKMEHNKVYKNDIEERFRMKIFMDNK 56
Query: 71 EYIEKANKEGNR-----TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP---ST 122
I K N GN +YKL N++ D+ + EF G+N+ + + R P S
Sbjct: 57 HKIAKHN--GNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLPIGASF 114
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
+ NV +P ++DWRE GAVT +KDQG CGSCW+FSA A+EG G LI LSEQ
Sbjct: 115 IEPANVV-LPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQN 173
Query: 183 LVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
L+DCS N+GC+GGLMD+AF+YI +NKGL TE YPY E C + A +
Sbjct: 174 LIDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDVG- 232
Query: 241 YEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGF 297
Y D+P+G+E+ L AV+ PVSV +DAS ++F FY GV +C + N DHGV VG+
Sbjct: 233 YVDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGY 292
Query: 298 GTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
GT +ENG YWL+KNSWGETWG++GYI++ R+ CGIA+ ASYP+
Sbjct: 293 GT--DENGQDYWLVKNSWGETWGDNGYIKMARNKLNHCGIASTASYPL 338
>gi|296228726|ref|XP_002759933.1| PREDICTED: cathepsin S isoform 1 [Callithrix jacchus]
Length = 330
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 149/339 (43%), Positives = 210/339 (61%), Gaps = 19/339 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M ++ ++ C+S V + + +P++ W +G+ YK++ E+A+R I+++NL++
Sbjct: 1 MKQLVCVLFVCSSAV--AQLLKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLKF 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y LG N D+T+EE +L + VPS Q R T+K
Sbjct: 59 VMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPNQ 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCS
Sbjct: 114 MLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSEK 173
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GG M +AF+YII+NKG+ +EA YPY+ + C K AAT SKY +LP G
Sbjct: 174 YGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKAMDQKCQ-YDSKYRAATCSKYTELPYG 232
Query: 248 DEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENG 305
E L +AV+N+ PV V VDAS +F Y+SGV + C N +HGV V+G+G + NG
Sbjct: 233 REDVLKEAVANKGPVCVGVDASHSSFFLYRSGVYYDPACTQNVNHGVLVIGYG---DLNG 289
Query: 306 AKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
+YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 290 EEYWLVKNSWGSNFGERGYIRMARNKGNHCGIASYPSYP 328
>gi|449530091|ref|XP_004172030.1| PREDICTED: vignain-like [Cucumis sativus]
Length = 351
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 135/355 (38%), Positives = 217/355 (61%), Gaps = 20/355 (5%)
Query: 2 VLKFEKSFIIPMFVIIILVITCASQVVSGRSMH-EPSIVEKHEQWMAQHGRTYKDELEKA 60
V+KF I+P+ +I L C S + + E S+++ +++W + H R ++ E
Sbjct: 3 VMKF---LIVPLVLIAFLCNICESFELERKDFESEKSLMQLYKRWSSHH-RISRNANEMH 58
Query: 61 MRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTG---YNRPVPS--VSR 115
R +FK N +++ K N G ++ KL N+F+D++++EFR +Y+ Y + + + +
Sbjct: 59 NRFKVFKNNAKHVFKVNLMG-KSLKLKLNQFADMSDDEFRNMYSSNITYYKDLHAKKIEA 117
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
R F Y++ ++P+SIDWR+KGAV IK+QG+CGSCWAF+AVAAVE I QI +L
Sbjct: 118 TGGRIGGFMYEHANNIPSSIDWRKKGAVNAIKNQGRCGSCWAFAAVAAVESIHQIKTNEL 177
Query: 176 IELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVA 235
+ LSE++++DC + GC GG + AFE++++N G+ E +YPY G C + +
Sbjct: 178 VSLSEEEVLDCDYRDGGCRGGFYNSAFEFMMDNDGVTIEDNYPYYEGNGYCRRRGGRNKR 237
Query: 236 ATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVA 293
I YE++P+ +E AL++AV++QPV+V + + G F FY G+ N CG N DH V
Sbjct: 238 VRIDGYENVPRNNEYALMKAVAHQPVAVAIASGGSDFKFYGGGMFTENDFCGFNIDHTVV 297
Query: 294 VVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
VVG+GT E+ YW+I+N +G WG +GY+++ R A G+CG+A +YPV
Sbjct: 298 VVGYGTDED---GDYWIIRNQYGHRWGMNGYMKMQRGAHSPQGVCGMAMQPAYPV 349
>gi|125604306|gb|EAZ43631.1| hypothetical protein OsJ_28254 [Oryza sativa Japonica Group]
Length = 369
Score = 266 bits (680), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 148/316 (46%), Positives = 184/316 (58%), Gaps = 25/316 (7%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E ++ E +E+W QH R +D EKA R N+FK N+ I + N+ + YKL N F D+
Sbjct: 41 EEALWELYERWRGQH-RVARDLGEKARRFNVFKDNVRLIHEFNRR-DEPYKLRLNRFGDM 98
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
T +E Y SSR S + R GAV +KDQGQCGS
Sbjct: 99 TADESAGAYA------------SSRVSHHRMFRGRGEKAQ---RLHGAVGAVKDQGQCGS 143
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLA 212
CWAFS +AAVEGI I L LSEQQLVDC T N GC GGLMD AF+YI ++ G+A
Sbjct: 144 CWAFSTIAAVEGINAIRTSNLTALSEQQLVDCDTKTGNAGCDGGLMDNAFQYIAKHGGVA 203
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
+ YPYR + +C + + A TI YED+P E AL +AV+NQPVSV ++A G F
Sbjct: 204 ASSAYPYRARQSSCKSSAASSPAVTIDGYEDVPANSESALKKAVANQPVSVAIEAGGSHF 263
Query: 273 HFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
FY GV CG DHGVA VG+GT + G KYW+++NSWG WGE GYIR+ RD
Sbjct: 264 QFYSEGVFAGKCGTELDHGVAAVGYGTTVD--GTKYWIVRNSWGADWGEKGYIRMKRDVS 321
Query: 332 ---GLCGIATAASYPV 344
GLCGIA ASYP+
Sbjct: 322 AKEGLCGIAMEASYPI 337
>gi|156739289|ref|NP_001096592.1| uncharacterized protein LOC569326 precursor [Danio rerio]
gi|156230119|gb|AAI52283.1| Im:6910535 protein [Danio rerio]
Length = 335
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 140/340 (41%), Positives = 208/340 (61%), Gaps = 15/340 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ +L+ C S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLITLCISAVFTAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR GY + ++S+ + F +
Sbjct: 59 EQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKQD----PNRTSKGALFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +KDQ QCGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GG+MD+AF+Y+ ENKGL +E YPY + + A I+ + D+P+G+
Sbjct: 175 GNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFG-TAEEENG 305
E AL+ AV+ PVSV +DAS ++ FY+SG+ C + DH V VVG+G + G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAG 294
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+YW++KNSW + WG+ GYI + +D CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|225706370|gb|ACO09031.1| Cathepsin L precursor [Osmerus mordax]
Length = 337
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 144/341 (42%), Positives = 205/341 (60%), Gaps = 17/341 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+++LV+ + + + R + E + W + H + Y+ E E+ R ++++NL+ I
Sbjct: 3 LYLVVLVLCTGAALAAPR--FDAQFDEHWDLWKSWHSKNYQHEKEEGWRRMVWEKNLKKI 60
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E N E G +Y LG N F D+TNEEFR + GY + ++ + S F N +
Sbjct: 61 EMHNLEHSLGKHSYSLGMNHFGDMTNEEFRQVMNGY-----KLQQRKFKGSLFLEPNNME 115
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P +DWRE+G VT +KDQGQCGSCWAFS A+EG KL+ LSEQ LVDCS
Sbjct: 116 APKQVDWREEGYVTPVKDQGQCGSCWAFSTTGAMEGQMFRKTQKLVSLSEQNLVDCSRPE 175
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GGLMD+AF+YI +N GL +E YPY + N K + AA + + D+P G
Sbjct: 176 GNEGCNGGLMDQAFQYIQDNSGLDSEEAYPYLGTDDQPCNYKAEFSAANDTGFMDIPSGK 235
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEEN 304
E AL++A+++ PVSV +DA +F FY+SG+ +C + DHGV VG+G E+ +
Sbjct: 236 EHALMKAIASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDVD 295
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
G KYW++KNSW E WG+ GYI + +D CGIATAASYP+
Sbjct: 296 GKKYWIVKNSWSEKWGDKGYILMAKDRKNHCGIATAASYPL 336
>gi|442754503|gb|JAA69411.1| Putative cathepsin l-like cysteine proteinase b [Ixodes ricinus]
Length = 335
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 144/309 (46%), Positives = 197/309 (63%), Gaps = 17/309 (5%)
Query: 47 AQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALY 103
A+HG++Y E E+ RL I+ +N I K N++ G Y + NEF D+ + EF +
Sbjct: 32 AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
G+ R R+ S + + +N+ D +P ++DWR KGAVT +K+QGQCGSCWAFSA
Sbjct: 92 NGFKRNYKDQPREGS--TYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSAT 149
Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
++EG G ++ LSEQ LVDCSTD N+GC GGLMD AF+YI NKG+ TE YPY
Sbjct: 150 GSLEGQHFRKSGSMVSLSEQNLVDCSTDFGNNGCEGGLMDNAFKYIRANKGIDTEKSYPY 209
Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSG 278
+GTC + K+ V AT S + D+ +G E L +AV+ P+SV +DAS +F FY G
Sbjct: 210 NGTDGTC-HFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDG 268
Query: 279 VLN-ADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCG 335
V + +C + + DHGV VVG+GT NG YWL+KNSWG TWG+ GYIR+ R+ CG
Sbjct: 269 VYDEPECDSESLDHGVLVVGYGTL---NGTDYWLVKNSWGTTWGDEGYIRMSRNKKNQCG 325
Query: 336 IATAASYPV 344
IA++ASYP+
Sbjct: 326 IASSASYPL 334
>gi|242048430|ref|XP_002461961.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
gi|241925338|gb|EER98482.1| hypothetical protein SORBIDRAFT_02g011230 [Sorghum bicolor]
Length = 380
Score = 266 bits (679), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 143/333 (42%), Positives = 197/333 (59%), Gaps = 27/333 (8%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR---TYKLGTNEFSDL 94
++E+ ++W A + ++Y E R ++ +N+ YIE N E TY+LG ++DL
Sbjct: 48 MIERFQRWKAAYNKSYATVAEDRRRFLVYARNMAYIEATNAEAEAAGLTYELGETAYTDL 107
Query: 95 TNEEFRALYTGYNRP--VPSVSRQ--------SSRPSTFK-------YQNV-TDVPTSID 136
TN+EF A+YT P +P+ + ++R Y N+ T P S+D
Sbjct: 108 TNQEFMAMYTAAPSPAQLPADEDEDDAAEAVITTRAGPVDAVGQLPVYVNLSTAAPASVD 167
Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGG 196
WR GAVT +K+QG+CGSCWAFS VA VEGI QI GKL+ LSEQ+LVDC T + GC GG
Sbjct: 168 WRASGAVTPVKNQGRCGSCWAFSTVAVVEGIYQIRTGKLVSLSEQELVDCDTLDAGCDGG 227
Query: 197 LMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAV 256
+ +A +I N GL TE DYPY C+ K AA+I+ + E +L AV
Sbjct: 228 ISYRALRWITSNGGLTTEEDYPYTGTTDACNRAKLAHNAASIAGLRRVATRSEASLANAV 287
Query: 257 SNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWG 316
+ QPV+V ++A G F YK GV N CG + +HGV VVG+G EEE+G KYW+IKNSWG
Sbjct: 288 AGQPVAVSIEAGGDNFQHYKRGVYNGPCGTSLNHGVTVVGYGQ-EEEDGDKYWIIKNSWG 346
Query: 317 ETWGESGYIRILRDA-----GLCGIATAASYPV 344
+WG+ GYI++ +D GLCGIA S+P+
Sbjct: 347 ASWGDGGYIKMRKDVAGKPEGLCGIAIRPSFPL 379
>gi|157644745|gb|ABV59078.1| cathepsin L [Lates calcarifer]
Length = 337
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 146/345 (42%), Positives = 211/345 (61%), Gaps = 19/345 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++P+ V+ + C S +S S+ +P + + + W + H + Y E E+ R ++++N
Sbjct: 1 MLPLAVLAV----CLSAALSAPSL-DPQLDDHWDLWKSWHSKKYH-EKEEGWRRMVWEKN 54
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
L+ IE N E G Y+LG N F D+T+EEFR + GY + + + + S F
Sbjct: 55 LKKIELHNLEHSMGKHPYRLGMNHFGDMTHEEFRQIMNGYKQ---RKTERKFKGSLFMEP 111
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
N + P ++DWR+KG VT +KDQGQCGSCWAFS A+EG GKL+ LSEQ LVDC
Sbjct: 112 NFLEAPRALDWRDKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDC 171
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
S N GC+GGLMD+AF+Y+ +N+GL +E YPY + + +A + + D+
Sbjct: 172 SRPEGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPNYNSANDTGFVDV 231
Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TA 300
P G E+AL++AV+ PVSV +DA +F FY+SG+ DC + DHGV VVG+G
Sbjct: 232 PSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEELDHGVLVVGYGYEG 291
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
E+ +G KYW++KNSW E WG+ GYI + +D CGIATAASYP+
Sbjct: 292 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 336
>gi|157787177|ref|NP_001099150.1| cathepsin L1-like precursor [Danio rerio]
gi|157422879|gb|AAI53505.1| MGC174152 protein [Danio rerio]
Length = 336
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 143/341 (41%), Positives = 206/341 (60%), Gaps = 16/341 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ +LV C S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLCISAVFAASSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR GY Q+S+ F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRHAMNGYKHD----PNQTSQGPLFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +KDQ QCGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GGLMD+AF+Y+ ENKGL +E YPY + + A I+ + D+P+G+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFG-TAEEEN 304
E AL+ AV+ PVSV +DAS ++ FY+SG+ A + DH V VVG+G +
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G +YW++KNSW + WG+ GYI + +D CGIAT ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 335
>gi|162463334|ref|NP_001104878.1| maize insect resistance2 precursor [Zea mays]
gi|2425064|gb|AAB88262.1| cysteine proteinase Mir2 [Zea mays]
Length = 493
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 138/296 (46%), Positives = 185/296 (62%), Gaps = 21/296 (7%)
Query: 62 RLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNR-----PVPSV 113
RL +F+ NL YI+ N E G ++LG F+DLT EE+RA +R V V
Sbjct: 92 RLEVFRDNLRYIDAHNAEADAGLHGFRLGLTRFADLTLEEYRARLLLGSRGRNGTAVGVV 151
Query: 114 SRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRG 173
R+ P + +P ++DWRE+GAV +KDQGQCG CWAFSAVAAVEGI +I G
Sbjct: 152 GRRRYLPLAGE-----QLPDAVDWRERGAVAEVKDQGQCGGCWAFSAVAAVEGINKIVTG 206
Query: 174 KLIELSEQQLVDCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEK 232
LI LSEQ+L+DC + GC GGLMD AF ++I+N G+ TEADYP+ +GTCD + +
Sbjct: 207 SLISLSEQELIDCDKFQDQGCDGGLMDNAFVFMIKNGGIDTEADYPFTGHDGTCDLKLKN 266
Query: 233 AVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGV 292
+I +E +P E+AL +AV++QPVS ++AS RAF Y SG+ + CG DHGV
Sbjct: 267 TRVVSIDSFERVPINYERALQKAVAHQPVSASIEASRRAFQLYSSGIFDGRCGTYLDHGV 326
Query: 293 AVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAGL----CGIATAASYPV 344
VVG+G+ E G YW++KNSWG WGE+GY+R+ R+ + GIA YPV
Sbjct: 327 TVVGYGS---EGGKDYWIVKNSWGTQWGEAGYVRMARNVRVRPPSAGIAMEPLYPV 379
>gi|380236892|emb|CBK52289.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 148/340 (43%), Positives = 200/340 (58%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M ++LV C V +M EP + + W HG+ Y+ E+E R ++++NL
Sbjct: 9 MLGSLMLVSLC----VGAAAMFEPKLDAHWKLWKMTHGKKYQTEVEDVSRRELWEKNLML 64
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I N E G TY+L N DLT EE + + P + R +S F
Sbjct: 65 ITMHNLEASMGLHTYELSMNHMGDLTQEEIMQSFATLSPPT-DIQRAAS---PFAGTTGA 120
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
DVP ++DWREKG VT +K QG CGSCWAFSA A+EG T GKL++LS Q LVDCST
Sbjct: 121 DVPDTMDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTK 180
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
NHGC+GGLM AF+Y+I+N+G+ ++A YPY G C K AA S+Y LP+G
Sbjct: 181 YGNHGCNGGLMHHAFQYVIDNQGIDSDASYPYTGRNGEC-RYNSKFRAANCSQYSFLPEG 239
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAEEENG 305
+E AL +A++N P+SV +DA+ F FY+SGV N +C +HGV VG+GT + G
Sbjct: 240 NEGALKEALANIGPISVAIDATRPTFTFYRSGVYNDPNCSQKVNHGVLAVGYGTLD---G 296
Query: 306 AKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
YWL+KNSWG+T+G+ GYIR+ R+ CGIA YP+
Sbjct: 297 QDYWLVKNSWGKTFGDQGYIRMSRNKNDQCGIALYGCYPI 336
>gi|151176971|gb|ABR88030.1| digestive cysteine protease [Dermestes frischii]
Length = 339
Score = 265 bits (678), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 151/347 (43%), Positives = 205/347 (59%), Gaps = 27/347 (7%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQW---MAQHGRTYKDELEKAMRLNIFKQNL 70
F ++ LV +Q VS + + EQW QH + YK + E+ R+ IF +N
Sbjct: 3 FFVLALVFIVGAQAVSFFDLVQ-------EQWGTFKLQHKKQYKSDTEEKFRMKIFMENS 55
Query: 71 EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNR----PVPSVSRQSSRPSTF 123
+ K NK G +YKL N+++D+ + EF G+NR P+ S + + +TF
Sbjct: 56 HKVAKXNKLYEMGLVSYKLKINKYADMLHHEFVHTVNGFNRTKNTPLLGTS-EDEQGATF 114
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
P ++DWRE GAVT +KDQG CGSCW+FSA A+EG KL+ LSEQ L
Sbjct: 115 IAPANVKFPENVDWREHGAVTXVKDQGHCGSCWSFSATGALEGQHFRKTNKLVSLSEQNL 174
Query: 184 VDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
VDCST N GC+GGLMD AF+Y+ N G+ TEA YPY ++ C + K AT +
Sbjct: 175 VDCSTKFGNDGCNGGLMDNAFKYVKYNHGIDTEASYPYHADDEKC-HYNPKTSGATDRGF 233
Query: 242 EDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG 298
D+P GDE+ L+ AV+ PVSV +DAS +F Y GV + +C + DHGV VVG+G
Sbjct: 234 VDIPTGDEEKLMAAVATVGPVSVAIDASHESFQLYSEGVYYDPECSSEELDHGVLVVGYG 293
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
T +ENG YW++KNSWGE+WGE GYI++ R+ CGIAT ASYP+
Sbjct: 294 T--DENGQDYWIVKNSWGESWGEQGYIKMARNRDNNCGIATQASYPL 338
>gi|161408097|dbj|BAF94152.1| cathepsin L-like cysteine protease 2 [Plautia stali]
Length = 334
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 151/343 (44%), Positives = 210/343 (61%), Gaps = 29/343 (8%)
Query: 18 ILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN 77
+LV++C + S + S ++ + H + Y +ELE++ R IF +N + IEK N
Sbjct: 4 LLVLSCLIALGQAVSFFDLS-ADEFTLFKKFHRKEYDNELEESYRKKIFLENKKRIEKHN 62
Query: 78 ---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
K+G ++KL N +D+ E+ +Y G+N+ SS+ + K Q+ T +P +
Sbjct: 63 SRYKQGKVSFKLKLNHLADMLIHEYSDVYLGFNK--------SSKANNNKLQSYTFIPPA 114
Query: 135 -------IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+DWR KGAVT +K+QG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 115 HVTLNKEVDWRTKGAVTPVKNQGHCGSCWAFSTTGALEGQNFRKTGKLVSLSEQNLVDCS 174
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
N+GC GGLMD AF+YI EN G+ TE YPY E+ TC +K ++ AT S + D+
Sbjct: 175 GSYGNNGCEGGLMDNAFQYIKENHGIDTEKSYPYEGEDETCRFRK-TSIGATDSGFVDIT 233
Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEE 302
+GDE+AL+QAV+ P+SV +DAS ++F FY GV +C + N DHGV VVG+G
Sbjct: 234 QGDEEALMQAVATIGPISVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLVVGYGV--- 290
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
E+ KYWL+KNSWG WG+ GYI++ RD CGIAT ASYP+
Sbjct: 291 EDNQKYWLVKNSWGTQWGDGGYIKMARDQDNNCGIATQASYPL 333
>gi|156739281|ref|NP_001096588.1| cathepsin L1-like precursor [Danio rerio]
gi|166158351|ref|NP_001107526.1| uncharacterized protein LOC100135391 precursor [Xenopus (Silurana)
tropicalis]
gi|326672305|ref|XP_003199634.1| PREDICTED: cathepsin L1-like [Danio rerio]
gi|156230096|gb|AAI52237.1| MGC174155 protein [Danio rerio]
gi|163916362|gb|AAI57707.1| LOC100135391 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 206/340 (60%), Gaps = 15/340 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ +LV C S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLCISAVFAASSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR GY ++S+ F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHD----PNRTSQGPLFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +KDQ QCGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GG+MD+AF+Y+ ENKGL +E YPY + + A I+ + D+P+G+
Sbjct: 175 GNQGCNGGIMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFG-TAEEENG 305
E AL+ AV+ PVSV +DAS ++ FY+SG+ C + DH V VVG+G + G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSRLDHAVLVVGYGYQGADVAG 294
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+YW++KNSW + WG+ GYI + +D CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|4469155|emb|CAB38315.1| chymopapain isoform III [Carica papaya]
Length = 361
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 137/345 (39%), Positives = 201/345 (58%), Gaps = 14/345 (4%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEK----HEQWMAQHGRTYKDELEKAMR 62
K + +II + ++ A G S + + +E+ + WM +H + Y+ EK R
Sbjct: 9 KIIFLATCLIIHMGLSSADFYTVGYSQDDLTSIERLIQLFDSWMLKHNKIYESIDEKIYR 68
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
IF+ NL YI++ NK+ N +Y LG N F+DL+N+EF+ Y G+ +
Sbjct: 69 FEIFRDNLMYIDETNKK-NNSYWLGLNGFADLSNDEFKKKYVGF-VAEDFTGLEHFDNED 126
Query: 123 FKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQ 182
F Y++VT+ P SIDWR KGAVT +K+QG CGSCWAFS +A VEGI +I G L+ELSEQ+
Sbjct: 127 FTYKHVTNYPQSIDWRAKGAVTPVKNQGACGSCWAFSTIATVEGINKIVTGNLLELSEQE 186
Query: 183 LVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
LVDC ++GC GG + +Y + N G+ T YP + ++ C + I+ Y+
Sbjct: 187 LVDCDKHSYGCKGGYQTTSLQY-VANNGVHTSKVYPCQAKQYKCRATDKPGPKVKITGYK 245
Query: 243 DLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEE 302
+P E + L A++NQP+S V+A G+ F YKSGV + CG DH V VG+GT++
Sbjct: 246 RVPSNCETSFLGALANQPLSFLVEAGGKPFQLYKSGVFDGPCGTKLDHAVTAVGYGTSDG 305
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA----GLCGIATAASYP 343
+N Y +IKNSWG WGE GY+R+ R + G CG+ ++ YP
Sbjct: 306 KN---YIIIKNSWGPNWGEKGYMRLKRQSGNSQGTCGVYKSSYYP 347
>gi|340371596|ref|XP_003384331.1| PREDICTED: digestive cysteine proteinase 2-like [Amphimedon
queenslandica]
Length = 327
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 140/334 (41%), Positives = 201/334 (60%), Gaps = 19/334 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
FV ++L+I S V+ E+ W ++G+TY+ E MR I+ QN +Y+
Sbjct: 9 FVAVLLLIGLVSAAVND--------AEEWRLWKGKYGKTYRSIYEDNMRQKIWLQNRDYV 60
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
+ N + +++L NEF+DLT EEF ++Y GY + +R++ +T +P
Sbjct: 61 NEHNSM-DSSFQLEVNEFADLTAEEFSSIYNGYGK---GRNRENHENTTIYRYTGGAIPD 116
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGC 193
S+DWR KG VT +K+Q QCGSCWAFS ++EG GKL+ LSEQ LVDC +HGC
Sbjct: 117 SVDWRTKGLVTPVKNQKQCGSCWAFSTTGSLEGAHAKKTGKLVSLSEQNLVDCDKKDHGC 176
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
GGLM AF+YI ENKG+ TE YPY+ + G C+ +K+ + AT+ ++ + D +AL
Sbjct: 177 QGGLMTTAFKYIEENKGIDTEESYPYKAKNGRCEFKKDD-IGATVERHVSILTTDCEALK 235
Query: 254 QAVSN-QPVSVCVDASGRAFHFYKSGVLNAD--CGNNCDHGVAVVGFGTAEEENGAKYWL 310
+AV+ P+SV +DAS +F YKSG+ + DHGV VVG+G +E+G +YWL
Sbjct: 236 KAVAEIGPISVAMDASHSSFQLYKSGIYDPKICSSRKLDHGVLVVGYG---KEDGEEYWL 292
Query: 311 IKNSWGETWGESGYIRILRDAGLCGIATAASYPV 344
+KNSWG+ WG GY +I LCGI T+A YPV
Sbjct: 293 VKNSWGKNWGMEGYFKIASKKNLCGICTSACYPV 326
>gi|238816977|gb|ACR56863.1| cathepsin L-like cysteine proteinase [Delia coarctata]
Length = 338
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 142/341 (41%), Positives = 210/341 (61%), Gaps = 18/341 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
V+ +L + Q +S + I E+ + + +H + Y E+E+ R+ IF +N I
Sbjct: 4 VLALLALVAFVQAISITDV----IKEEWQTFKMEHRKNYLSEVEERFRMKIFNENRHKIA 59
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
K N+ +G ++KLG N+++D+ + EF+ GYN + R + Y + +V
Sbjct: 60 KHNQLYAQGKVSFKLGLNKYADMLHHEFKETMNGYNHTMRKELRAQEGFNGITYISPANV 119
Query: 132 --PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
P ++DWR+ GAVT +KDQG CGSCW+FS+ ++EG G L+ LSEQ LVDCST
Sbjct: 120 QVPKAVDWRQHGAVTSVKDQGHCGSCWSFSSTGSLEGQHFRKAGVLVSLSEQNLVDCSTK 179
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N+GC+GGLMD AF YI +N G+ TE YPY + +C K V AT + + D+P+G
Sbjct: 180 YGNNGCNGGLMDNAFRYIKDNGGVDTEKSYPYEGIDDSCHFNK-ATVGATDTGFVDIPQG 238
Query: 248 DEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEEN 304
DE+A+++AV+ PV+V +DAS +F Y GV N +C +N DHGV VVG+GT +++
Sbjct: 239 DEEAMMKAVATMGPVAVAIDASNESFQLYSEGVYNDPNCSSDNLDHGVLVVGYGT--DKD 296
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G YWL+KNSWG TWG+ GYI++ R+ CGIATA+S+P
Sbjct: 297 GQDYWLVKNSWGTTWGDQGYIKMARNQDNQCGIATASSFPT 337
>gi|413944252|gb|AFW76901.1| hypothetical protein ZEAMMB73_101481 [Zea mays]
Length = 232
Score = 265 bits (677), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 126/230 (54%), Positives = 161/230 (70%), Gaps = 12/230 (5%)
Query: 123 FKYQNVT--DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
F+Y+NV+ +P +IDWR GAVT IKDQGQCG CWAFSAVAA EGI +I+ GKLI LSE
Sbjct: 6 FRYENVSVDAIPATIDWRTNGAVTPIKDQGQCGCCWAFSAVAATEGIVKISTGKLISLSE 65
Query: 181 QQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATI 238
Q+LVDC ++ GC GGLMD AF++II+N GL TE++YPY +G C + AA I
Sbjct: 66 QELVDCDVYGEDQGCEGGLMDDAFKFIIKNGGLTTESNYPYTAADGKCKSGSNS--AANI 123
Query: 239 SKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFG 298
YED+P DE AL++AV+NQPVSV VD F FY GV+ CG + DHG+A +G+G
Sbjct: 124 KGYEDVPTNDEAALMKAVANQPVSVAVDGGDMTFQFYSGGVMTGSCGTDLDHGIAAIGYG 183
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
+ +G KYWL+KNSWG TWGE+GY+R+ +D G+CG+A SYP
Sbjct: 184 --KTSDGTKYWLMKNSWGTTWGENGYLRMEKDISDKKGMCGLAIEPSYPT 231
>gi|30023547|gb|AAO48766.2| cathepsin L-like cysteine proteinase [Tenebrio molitor]
Length = 337
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 152/344 (44%), Positives = 206/344 (59%), Gaps = 23/344 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNL 70
F+I + + SQ VS + + EQW A H + Y+ E E+ R+ IF +N
Sbjct: 3 FLIFLAICVAGSQAVSFFDLVQ-------EQWGAFKMTHNKQYQSETEERFRMKIFMENS 55
Query: 71 EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSV-SRQSSRPSTFKYQ 126
+ K NK +G ++KLG N+++D+ + EF + G+NR + S +S TF
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+P IDWR+KGAVT +KDQGQCGSCW+FSA ++EG GKL+ LSEQ LVDC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRQSGKLVSLSEQNLVDC 175
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
S N+GC+GGLMD AF YI N G+ TE YPY+ E+ C + K K AT Y D+
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKC-HYKPKNKGATDRGYVDI 234
Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFGTAE 301
G+E L AV+ PVSV +DAS ++F Y GV DC + DHGV VVG+GT
Sbjct: 235 ESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGT-- 292
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
E++G YWL+KNSWG++WG+ GYI++ R+ CGIAT ASYP+
Sbjct: 293 EDDGTDYWLVKNSWGKSWGDQGYIKMARNRNNNCGIATEASYPL 336
>gi|340727787|ref|XP_003402217.1| PREDICTED: cathepsin L-like [Bombus terrestris]
Length = 343
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 145/347 (41%), Positives = 209/347 (60%), Gaps = 22/347 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ +F+ +I+ + +Q +S + + ++ + +H + YK+++E+ R+ IF N
Sbjct: 1 MKLFLFLIVAVLATAQAISFFEL----VNQEWTTFKMEHNKVYKNDVEERFRMKIFMDNK 56
Query: 71 EYIEKANKEGNR-----TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY 125
I K N GN +YKL N++ D+ + EF G+N+ + + R P +
Sbjct: 57 HKIAKHN--GNYEMKKVSYKLKMNKYGDMLHHEFVNTLNGFNKSINTQLRSERLPIAASF 114
Query: 126 QNVTDV--PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
+V P ++DWRE GAVT +KDQG CGSCW+FSA A+EG G LI LSEQ L
Sbjct: 115 IEPANVVLPKTVDWREHGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGILIPLSEQNL 174
Query: 184 VDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
+DCS N+GC+GGLMD+AF+YI +NKGL TE YPY E C + A + Y
Sbjct: 175 IDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEVTYPYEAENDKCRYNAANSGARDVG-Y 233
Query: 242 EDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG 298
D+P+G+E+ L AV+ PVSV +DAS ++F FY GV +C + N DHGV VG+G
Sbjct: 234 VDIPQGNEKKLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSENLDHGVLAVGYG 293
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
T +ENG YWL+KNSWGETWG++GYI++ R+ CGIA+ ASYP+
Sbjct: 294 T--DENGQDYWLVKNSWGETWGDNGYIKMARNKLNHCGIASTASYPL 338
>gi|125564712|gb|EAZ10092.1| hypothetical protein OsI_32402 [Oryza sativa Indica Group]
Length = 382
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 146/378 (38%), Positives = 209/378 (55%), Gaps = 45/378 (11%)
Query: 9 FIIPMFVII--ILVITCAS----QVVSGRSMH---EP---SIVEKHEQWMAQHGRTYKDE 56
F +P +I+ + I C+S +V S + + EP +++E ++W A++ R+Y
Sbjct: 7 FSMPCLLILLGVFFIGCSSGTARRVTSDTAANTDGEPAATTMMEMFQRWKAEYNRSYATP 66
Query: 57 LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR- 115
E+ RL ++ +N+ YIE N Y+LG ++DLTN+EF A+YT P+ S +
Sbjct: 67 EEERRRLRVYARNVRYIEATNAAAGLAYELGETAYTDLTNDEFMAMYTA--PPLRSAADD 124
Query: 116 ------------------QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
+ +P + + P S+DWR GAVT +KDQG+CGSCWA
Sbjct: 125 DDDAATTTIITTRAGPVDEHQQPEVY-FNESAGAPASVDWRASGAVTEVKDQGRCGSCWA 183
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADY 217
FS VA VEGI +I +GKL+ LSEQ+LVDC T + GC GG+ +A E+I N G+ T DY
Sbjct: 184 FSTVAVVEGIQKIKKGKLVSLSEQELVDCDTLDSGCDGGVSYRALEWITANGGITTRDDY 243
Query: 218 PYR-HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYK 276
PY CD K AATI+ + E +L A + QPV+V ++A G F Y+
Sbjct: 244 PYTGAAAAACDRAKLGHHAATIAGLRRVATRSEASLQNAAAAQPVAVSIEAGGDNFQHYR 303
Query: 277 SGVLNADCGNNCDHGVAVVGFGTAE-----EENGAKYWLIKNSWGETWGESGYIRILRDA 331
GV + CG +HGV VVG+G E G KYW+IKNSWG+ WG+ GYI++ +D
Sbjct: 304 KGVYDGPCGTRLNHGVTVVGYGQEEAPVDGSAAGDKYWIIKNSWGKNWGDQGYIKMKKDV 363
Query: 332 -----GLCGIATAASYPV 344
GLCGIA S+P+
Sbjct: 364 AGKPEGLCGIAIRPSFPL 381
>gi|375340657|emb|CBJ56264.1| cathepsin S protein [Dicentrarchus labrax]
Length = 337
Score = 265 bits (677), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 147/340 (43%), Positives = 200/340 (58%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M ++LV C V +M EP + + W HG+ Y+ E+E R ++++NL
Sbjct: 9 MLGSLMLVSLC----VGAAAMFEPKLDAHWKLWKMTHGKKYQTEVEDVSRRELWEKNLML 64
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I N E G TY+L N DLT EE + + P + R +S F
Sbjct: 65 ITMHNLEASMGLHTYELSMNHMGDLTQEEIMQSFATLSPPT-DIQRAAS---PFAGTTGA 120
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
DVP ++DWREKG VT +K QG CGSCWAFSA A+EG T GKL++LS Q LVDCST
Sbjct: 121 DVPDTMDWREKGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTK 180
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
NHGC+GG M +AF+Y+I+N+G+ ++A YPY G C K AA S+Y LP+G
Sbjct: 181 YGNHGCNGGFMHQAFQYVIDNQGIDSDASYPYTGRNGEC-RYNSKFRAANCSQYSFLPEG 239
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAEEENG 305
+E AL +A++N P+SV +DA+ F FY+SGV N +C +HGV VG+GT + G
Sbjct: 240 NEGALKEALANIGPISVAIDATRPTFTFYRSGVYNDPNCSQKVNHGVLAVGYGTLD---G 296
Query: 306 AKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
YWL+KNSWG+T+G+ GYIR+ R+ CGIA YP+
Sbjct: 297 QDYWLVKNSWGKTFGDQGYIRMSRNKNDQCGIALYGCYPI 336
>gi|350583407|ref|XP_003481511.1| PREDICTED: cathepsin S [Sus scrofa]
Length = 331
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 148/341 (43%), Positives = 210/341 (61%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ +++ C+S + +H +++H + W +G+ YK++ E+ R I+++NL+
Sbjct: 1 MKCLVWVLLLCSSAMAQ---LHRDPTLDRHWDLWKKTYGKQYKEKNEEVARRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
+ N E G +Y LG N D+T+EE +L + VPS Q R T+K
Sbjct: 58 TVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMSCVR--VPS---QWPRNVTYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CGSCWAFSAV A+E ++ G+L+ LS Q LVDCST
Sbjct: 113 QKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQVKMKTGRLVSLSAQNLVDCST 172
Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ N GC+GG M +AF+YII+N G+ +EA YPY+ +G C K AAT S+Y +LP
Sbjct: 173 EKYRNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAVDGKC-KYDSKNRAATCSRYTELP 231
Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
DE AL +AV+N+ PVSV +DA +F FY+SGV + C N +HGV VVG+G
Sbjct: 232 FADEYALKEAVANKGPVSVAIDAKHSSFFFYRSGVYYDPSCTQNVNHGVLVVGYGNL--- 288
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYP 343
NG YWL+KNSWG +G+ GYIR+ R++ CGIA SYP
Sbjct: 289 NGKDYWLVKNSWGLNFGDGGYIRMARNSENHCGIANYPSYP 329
>gi|346469447|gb|AEO34568.1| hypothetical protein [Amblyomma maculatum]
Length = 333
Score = 265 bits (676), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 147/341 (43%), Positives = 209/341 (61%), Gaps = 21/341 (6%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
++ + CA + + + + + E + + H +TYK +E+ +R IF +N +I K
Sbjct: 1 MLRFALLCAIVAAATAATSQEILRTEWEAFKSTHKKTYKSNVEELLRFKIFTENSLFIAK 60
Query: 76 AN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF---KYQNVT 129
N +G +YKLG N+F+DL EF + GY R + R ST+ N +
Sbjct: 61 HNVKYAKGLVSYKLGINQFADLLPHEFVKMMNGYQG-----KRLAGRGSTYLPPANLNDS 115
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
+P ++DWR+KGAVT +KDQGQCGSCWAFS+ ++EG + GKL+ LSEQ LVDCS+
Sbjct: 116 SLPKTVDWRKKGAVTPVKDQGQCGSCWAFSSTGSLEGQHFLKTGKLVSLSEQNLVDCSSA 175
Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD +F YI N G+ TE YPY E+G C +KE V AT + + D+ +G
Sbjct: 176 YGNQGCNGGLMDNSFNYIKANGGIDTEDSYPYEAEDGDCRYKKED-VGATDTGFVDIKEG 234
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEEN 304
E+ L +AV+ PVSV +DAS ++F Y GV + +C + + DHGV VG+G +N
Sbjct: 235 SEKDLQKAVATVGPVSVAIDASQQSFQLYSEGVYDEPNCSSESLDHGVLAVGYGV---KN 291
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G KYWL+KNSW ETWG+ GYI + RD CGIA++ASYP+
Sbjct: 292 GKKYWLVKNSWAETWGQDGYILMSRDKNNQCGIASSASYPL 332
>gi|403302730|ref|XP_003942006.1| PREDICTED: cathepsin S isoform 1 [Saimiri boliviensis boliviensis]
Length = 339
Score = 265 bits (676), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 148/342 (43%), Positives = 208/342 (60%), Gaps = 21/342 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQN 69
I M ++ ++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++N
Sbjct: 8 ITMKQLVCVLFVCSSAVTQ---LHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKN 64
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
L+++ N E G +Y LG N D+T+EE +L + P Q R T+K
Sbjct: 65 LKFVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP-----NQWQRNITYKSN 119
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDC
Sbjct: 120 PNQMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDC 179
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
S N GC+GG M +AF+YII+NKG+ +EA YPY+ + C K AAT SKY +L
Sbjct: 180 SEKYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKATDQKCQ-YDSKYRAATCSKYTEL 238
Query: 245 PKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEE 302
P G E L +AV+N+ PV V VDAS +F Y+SGV + C +HGV V+G+G +
Sbjct: 239 PYGREDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYG---D 295
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
NG +YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 296 LNGKEYWLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSYP 337
>gi|27728675|gb|AAO18731.1| cysteine protease [Gossypium hirsutum]
Length = 389
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 135/322 (41%), Positives = 193/322 (59%), Gaps = 15/322 (4%)
Query: 33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI--EKANKEGNR-TYKLGTN 89
+ E ++E +QW +H + Y+ E R FK NL+YI A ++ N+ + +G N
Sbjct: 40 LSEERVLEIFQQWKEKHRKVYRHAEEAEKRFENFKGNLKYILERNAKRKANKWEHHVGLN 99
Query: 90 EFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ 149
+F+D++NEEFR Y + + SR K Q+ D P+S+DWR G VT +KDQ
Sbjct: 100 KFADMSNEEFRKAYLSKVKKPINKGITLSRNMRRKVQSC-DAPSSLDWRNYGVVTAVKDQ 158
Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENK 209
G CGSCWAFS+ A+EGI + G LI LSEQ+LV+C T N+GC GG MD AFE++I N
Sbjct: 159 GSCGSCWAFSSTGAMEGINALVTGDLISLSEQELVECDTSNYGCEGGYMDYAFEWVINNG 218
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
G+ +E+DYPY +GTC+ KE+ +I Y+D+ + D ALL AV+ QPVSV +D S
Sbjct: 219 GIDSESDYPYTGVDGTCNTTKEETKVVSIDGYQDVEQSD-SALLCAVAQQPVSVGIDGSA 277
Query: 270 RAFHFYKSGVLNADCG---NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
F Y G+ + C ++ DH V +VG+G+ + E +YW++KNSWG +WG GY
Sbjct: 278 IDFQLYTGGIYDGSCSDDPDDIDHAVLIVGYGSEDSE---EYWIVKNSWGTSWGIDGYFY 334
Query: 327 ILRDA----GLCGIATAASYPV 344
+ RD G+C + ASYP
Sbjct: 335 LKRDTDLPYGVCAVNAMASYPT 356
>gi|357167707|ref|XP_003581294.1| PREDICTED: actinidain-like [Brachypodium distachyon]
Length = 358
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 138/321 (42%), Positives = 189/321 (58%), Gaps = 16/321 (4%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTN 96
++ +HE+WMA+ GR+Y D EKA R +F N +++ N+ GNRTY LG N+FSDLT+
Sbjct: 37 TMASRHERWMARFGRSYTDAGEKARRQEVFGANARHVDAVNRAGNRTYTLGLNQFSDLTD 96
Query: 97 EEFRALYTGYNRPVPS----VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
EF + GY R + + P D+P S+DWR KGAVT IK+Q C
Sbjct: 97 HEFLQQHLGYGRHHGQRGLLLPEEEVMPKATALGYGQDMPYSVDWRAKGAVTEIKNQRSC 156
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
GSCWAF+AVAA EG+ +I G LI +SEQQ++DC+ D C G + A Y++ + GL
Sbjct: 157 GSCWAFAAVAATEGLVKIATGNLISMSEQQVLDCTGDRSSCDSGYISDALRYVVTSGGLQ 216
Query: 213 TEADYPYRHEEGTCDNQ---KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASG 269
EA Y Y ++G C ++ + + A+ + GDE AL + QPV+V V+AS
Sbjct: 217 REAAYAYTGQKGACGSRRFARPNSAASVGGVHMATLNGDEGALQGLAARQPVAVIVEASE 276
Query: 270 RAFHFYKSGVL--NADCGNNCDHGVAVVGFGTAEEENGA-KYWLIKNSWGETWGESGYIR 326
F Y SGV +A CG +H + VVG+GT ENGA +YWL+KN WG WGE+GY+R
Sbjct: 277 PDFRHYSSGVYAGSASCGRELNHALTVVGYGT---ENGAGEYWLVKNQWGTWWGENGYMR 333
Query: 327 ILRDAGL---CGIATAASYPV 344
+ R G CGIA+ A YP
Sbjct: 334 VARRNGAGANCGIASVAFYPT 354
>gi|32396018|gb|AAP41846.1| cysteine protease [Anthurium andraeanum]
Length = 502
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 143/319 (44%), Positives = 187/319 (58%), Gaps = 18/319 (5%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYK----LGTNEFSDLT 95
E E+WM +H + Y EKA R F NL ++ K N EG R +G N F+DL+
Sbjct: 49 ELFERWMEKHRKVYAHPGEKARRYANFLSNLAFVRKRNAEGRRAPSSGQGVGMNVFADLS 108
Query: 96 NEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQCG 153
NEEFR +Y+ + + +R + + V D P S+DWR++GAVT +K+QG CG
Sbjct: 109 NEEFREVYSSRVLRKKAAEGRGARRRAGEGRVVAGCDAPASLDWRKRGAVTAVKNQGDCG 168
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLAT 213
SCWAFS+ A+EGI IT G+LI LSEQ+LVDC T N GC GG MD AFE++I N G+ +
Sbjct: 169 SCWAFSSTGAMEGINAITTGELISLSEQELVDCDTTNEGCDGGYMDYAFEWVINNGGIDS 228
Query: 214 EADYPYRHE-EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
EA+YPY + + C+ KE+ +I YED+ E ALL A QPVSV +D S F
Sbjct: 229 EANYPYTGQADSVCNTTKEEIKVVSIDGYEDVAT-SESALLCAAVQQPVSVGIDGSSLDF 287
Query: 273 HFYKSGVLNADCGNN---CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
Y G+ + DC N DH V VVG+G ++ G YW++KNSWG WG GYI I R
Sbjct: 288 QLYAGGIYDGDCSGNPDDIDHAVLVVGYG---QQGGTDYWIVKNSWGTDWGMQGYIYIRR 344
Query: 330 DAGL----CGIATAASYPV 344
+ GL C I ASYP
Sbjct: 345 NTGLPYGVCAIDAMASYPT 363
>gi|330803818|ref|XP_003289899.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
gi|325080010|gb|EGC33584.1| hypothetical protein DICPUDRAFT_154350 [Dictyostelium purpureum]
Length = 326
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 146/344 (42%), Positives = 206/344 (59%), Gaps = 28/344 (8%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++ + L+I C S + R + + WM +H ++Y ++ E R ++F+ N
Sbjct: 3 LVLALIFCFLIINCCS---AARIFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYSVFQDN 58
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-- 127
++ + K N++G+ T LG N +DLTNEEF+ LY G V T+K +
Sbjct: 59 MDIVAKWNQKGSNTI-LGLNVMADLTNEEFKKLYLGTKANV-----------TYKKKTLV 106
Query: 128 -VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
V+ +P S+DWR GAVT +K+QGQCG C+AFS +VEGI +IT +L+ LSEQQ++DC
Sbjct: 107 GVSGLPASVDWRANGAVTAVKNQGQCGGCYAFSTTGSVEGIHEITSQQLVPLSEQQILDC 166
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
S N+GC GGLM +FEYII GL TEA YPY E G C K K + ATI+ Y+++
Sbjct: 167 SGSEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYTGEVGKCKFNK-KNIGATITGYKNV 225
Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV-LNADCGNN-CDHGVAVVGFGTAEE 302
G E L AV+ QPVSV +DAS +F Y SGV +C + DHGV VG+G+
Sbjct: 226 ESGSESDLQTAVAAQPVSVAIDASQSSFQLYASGVYYEPECSSTQLDHGVLAVGYGS--- 282
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPVA 345
++G YW++KNSWG WGE+G+I + R+ CGIAT AS+P A
Sbjct: 283 QSGQDYWIVKNSWGADWGENGFILMARNKDNNCGIATMASFPTA 326
>gi|119433808|gb|ABL74967.1| cysteine protease [Acanthamoeba castellanii]
Length = 330
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 142/317 (44%), Positives = 191/317 (60%), Gaps = 14/317 (4%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
H+P + WM H ++Y +E E R N++++N +I++ N++ N +Y L N+F D
Sbjct: 23 HDP-LTGVFADWMRTHTKSYSNE-EFVFRWNVWRENYNFIQEENRK-NNSYYLTMNKFGD 79
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
LTN EF +Y G + + +P + DWR+KGAVTH+K+QGQCG
Sbjct: 80 LTNAEFNKVYKG--LAFDYSAHILKAKAATPAAPAPGLPANFDWRQKGAVTHVKNQGQCG 137
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGL 211
SCW+FS + EG + RG L+ LSEQ L+DCS N+GC+GGLMD AFEYII NKG+
Sbjct: 138 SCWSFSTTGSTEGANFLKRGTLVSLSEQNLIDCSGSYGNNGCNGGLMDYAFEYIINNKGI 197
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRA 271
TEA YPY + C + +++ Y D+ GDE ALL AV+ +P SV +DAS +
Sbjct: 198 DTEASYPYETAQYNCRYNPANS-GGSLTSYTDVSSGDENALLNAVAIEPTSVAIDASHNS 256
Query: 272 FHFYKSGV-LNADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
F FY GV + C + DHGV VG+GT ENG YWL+KNSWG WG GYI++ R
Sbjct: 257 FQFYSGGVYYESSCSSTQLDHGVLAVGWGT---ENGQDYWLVKNSWGADWGLQGYIKMAR 313
Query: 330 DA-GLCGIATAASYPVA 345
+ CGIATAASYP A
Sbjct: 314 NRHNNCGIATAASYPTA 330
>gi|5081735|gb|AAD39513.1|AF147207_1 cathepsin L-like protease precursor [Artemia franciscana]
Length = 338
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 148/343 (43%), Positives = 205/343 (59%), Gaps = 15/343 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ +I L+ Q+ + S+ E H + A H + Y +LE+ R+ I+ +N
Sbjct: 1 MKQITLIFLLGAVLVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENK 59
Query: 71 EYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
+ K N ++G ++Y++ N+F DL + EFR++ GY + SR S + + N
Sbjct: 60 HKVAKHNILYEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPAN 119
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
V +VP S+DWR KGA+T +KDQGQCGSCWAFS+ A+EG T GKLI LSEQ L+DCS
Sbjct: 120 V-EVPESVDWRVKGAITPVKDQGQCGSCWAFSSTGALEGQTFRKTGKLISLSEQNLIDCS 178
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
N GC+GGLMD+AF+YI +NKG+ TE YPY E+ C + A + +P
Sbjct: 179 GKYGNEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDNVC-RYNPRNRGAIDRGFVHIP 237
Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFGTAEE 302
G+E L AV+ PVSV +DAS +F FY GV C ++ DHGV VVG+G+
Sbjct: 238 SGEEDKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS--- 294
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+NG YWL+KNSW E WG+ GYI+I R+ CGIATAASYP+
Sbjct: 295 DNGKDYWLVKNSWSEHWGDEGYIKIARNRKNHCGIATAASYPL 337
>gi|151573016|gb|ABS17683.1| cathepsin L-1 [Artemia persimilis]
Length = 334
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 146/339 (43%), Positives = 204/339 (60%), Gaps = 15/339 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+I L+ Q+ + S+ E H + A H + Y +LE+ R+ I+ +N +
Sbjct: 1 TLIFLLGAVFVQLSAALSLTNLLADEWH-LFKATHKKEYPSQLEEKFRMKIYLENKHKVA 59
Query: 75 KAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
K N ++G ++Y++ N+F DL + EFR++ GY + SR S + + NV +V
Sbjct: 60 KHNILFEKGEKSYQVAMNKFGDLLHHEFRSIMNGYQHKKQNSSRAESTFTFMEPANV-EV 118
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
P S+DWREKGA+T +KDQGQCG CWAFS+ A+EG T GKL+ L EQ L+DCS
Sbjct: 119 PESVDWREKGAITPVKDQGQCGPCWAFSSTGALEGQTFRKTGKLVSLREQNLIDCSGKYG 178
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
N GC+GGLMD+AF+YI +NKG+ TE YPY E+ C + A + D+P G+E
Sbjct: 179 NEGCNGGLMDQAFQYIKDNKGIDTENTYPYEAEDDVC-RYNPRNRGAVDRGFVDIPSGEE 237
Query: 250 QALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFGTAEEENGA 306
L AV+ PVSV +DAS +F FY GV C ++ DHGV VVG+G+ +NG
Sbjct: 238 DKLKAAVATVGPVSVAIDASHESFQFYSKGVYYEPSCDSDDLDHGVLVVGYGS---DNGK 294
Query: 307 KYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
YWL+KNSW E WG+ GYI+I R+ CG+ATAASYP+
Sbjct: 295 DYWLVKNSWSEHWGDQGYIKIARNRKNHCGVATAASYPL 333
>gi|33112583|gb|AAP94047.1| cathepsin-L-like cysteine peptidase 03 [Tenebrio molitor]
Length = 337
Score = 264 bits (675), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 151/344 (43%), Positives = 206/344 (59%), Gaps = 23/344 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNL 70
F+I + + SQ VS + + EQW A H + Y+ + E+ R+ IF +N
Sbjct: 3 FLIFLAICVAGSQAVSFFDLVQ-------EQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55
Query: 71 EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSV-SRQSSRPSTFKYQ 126
+ K NK +G ++KLG N+++D+ + EF + G+NR + S +S TF
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+P IDWR+KGAVT +KDQGQCGSCW+FSA ++EG GKL+ LSEQ LVDC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC 175
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
S N+GC+GGLMD AF YI N G+ TE YPY+ E+ C + K K AT Y D+
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKC-HYKPKNKGATDRGYVDI 234
Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFGTAE 301
G+E L AV+ PVSV +DAS ++F Y GV DC + DHGV VVG+GT
Sbjct: 235 ESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPDCSASQLDHGVLVVGYGT-- 292
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
E++G YWL+KNSWG++WG+ GYI++ R+ CGIAT ASYP+
Sbjct: 293 EDDGTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPL 336
>gi|167526493|ref|XP_001747580.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163774026|gb|EDQ87660.1| predicted protein [Monosiga brevicollis MX1]
Length = 330
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 148/328 (45%), Positives = 194/328 (59%), Gaps = 30/328 (9%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDEL---EKAMRLNIFKQNLEYIEKANKEGN 81
SQ + R++H +++ + HG Y +L E A R ++ NL IE A+ GN
Sbjct: 11 SQFLPRRNLH--LVLKGPTAFRRIHGVFYSSQLGLCEPAFRCHL--ANLRVIE-AHNAGN 65
Query: 82 RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS-IDWREK 140
++ +G +F+DLT EF A V +RP + +T+ P +DWR+K
Sbjct: 66 SSFTMGITQFADLTAAEFSAY-------VKRFPMNVTRPRNEVW--ITEAPLQEVDWRQK 116
Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLM 198
AVT IK+QGQCGSCW+FS +VEG I GKL+ LSEQQL+DCST NHGC+GGLM
Sbjct: 117 NAVTEIKNQGQCGSCWSFSTTGSVEGAHAIATGKLVSLSEQQLMDCSTRYGNHGCNGGLM 176
Query: 199 DKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN 258
D AFEY+I N GL TE DYPY E+G C+ +KEK AA I + ++PK E L AVS
Sbjct: 177 DYAFEYVIANGGLDTEEDYPYTAEDGKCNTEKEKKHAAEIHGFRNVPKEHEDQLAAAVSI 236
Query: 259 QPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
PVSV ++A F Y SGV + CG + DHGV VVG+ YW++KNSWG++
Sbjct: 237 GPVSVAIEADQAGFQHYTSGVFDGKCGTSLDHGVLVVGYSD-------DYWIVKNSWGKS 289
Query: 319 WGESGYIRILRDA---GLCGIATAASYP 343
WGE GYIR+ R G+CGI ASYP
Sbjct: 290 WGEEGYIRLKRGVDKKGMCGITMQASYP 317
>gi|224116880|ref|XP_002317417.1| predicted protein [Populus trichocarpa]
gi|118488173|gb|ABK95906.1| unknown [Populus trichocarpa]
gi|222860482|gb|EEE98029.1| predicted protein [Populus trichocarpa]
Length = 498
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 138/325 (42%), Positives = 192/325 (59%), Gaps = 20/325 (6%)
Query: 33 MHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN--KEGNRTYKLGTNE 90
+ E I E + W +H + YK E R+ FK+NL+YI + N ++ +K+G N+
Sbjct: 41 LTEEGITEVFKLWKEKHQKVYKHAEEAERRIGNFKRNLKYIIEKNGKRKSGLEHKVGLNK 100
Query: 91 FSDLTNEEFRALY-TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQ 149
F+DL+NEEFR +Y + +P+ ++ R + D P+S+DWR KG VT +KDQ
Sbjct: 101 FADLSNEEFREMYLSKVKKPITIEEKRKHR-----HLQTCDAPSSLDWRNKGVVTAVKDQ 155
Query: 150 GQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC-STDNHGCSGGLMDKAFEYIIEN 208
G CGSCW+FS A+E I I G LI LSEQ+LVDC +T+N+GC GG MD AF+++I N
Sbjct: 156 GDCGSCWSFSTTGAIEAINAIVTGDLISLSEQELVDCDTTNNYGCEGGDMDSAFQWVIGN 215
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
G+ TEADYPY +GTC+ KE+ +I Y D+ D ALL A QP+SV +D S
Sbjct: 216 GGIDTEADYPYTGVDGTCNTAKEEKKVVSIEGYVDVDPSD-SALLCATVQQPISVGMDGS 274
Query: 269 GRAFHFYKSGVLNADCG---NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
F Y G+ + DC N+ DH + +VG+G+ EN YW++KNSWG WG GY
Sbjct: 275 ALDFQLYTGGIYDGDCSGDPNDIDHAILIVGYGS---ENDEDYWIVKNSWGTEWGMEGYF 331
Query: 326 RILRDA----GLCGIATAASYPVAI 346
I R+ G+C I ASYP +
Sbjct: 332 YIRRNTSKPYGVCAINADASYPTKV 356
>gi|219687002|dbj|BAH08632.1| daikon cysteine protease RD21 [Raphanus sativus]
Length = 289
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 123/219 (56%), Positives = 158/219 (72%), Gaps = 8/219 (3%)
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P S+DWR++GAV +KDQG CGSCWAFS + AVEGI +I G LI LSEQ+LVDC T
Sbjct: 3 IPESVDWRKEGAVAAVKDQGSCGSCWAFSTIGAVEGINKIVTGDLISLSEQELVDCDTSY 62
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
N GC+GGLMD AFE+II+N G+ TE DYPY+ +G CD ++ A TI YED+P+ +E
Sbjct: 63 NQGCNGGLMDYAFEFIIKNGGIDTEEDYPYKAADGRCDQNRKNAKVVTIDAYEDVPENNE 122
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
AL +A++NQP+SV ++A GRAF Y SGV + CG DHGV VG+GT ENG YW
Sbjct: 123 AALKKALANQPISVAIEAGGRAFQLYSSGVFDGTCGTELDHGVVAVGYGT---ENGKDYW 179
Query: 310 LIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
+++NSWG +WGESGYI++ R+ G CGIA ASYP+
Sbjct: 180 IVRNSWGGSWGESGYIKMARNIAEATGKCGIAMEASYPI 218
>gi|157311713|ref|NP_001098585.1| uncharacterized protein LOC564979 precursor [Danio rerio]
gi|156230121|gb|AAI52284.1| Wu:fa26c03 protein [Danio rerio]
Length = 336
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 207/341 (60%), Gaps = 16/341 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ +LV C S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLCISAVFAASSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR GY ++S+ F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHD----PNRTSQGPLFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +KDQ QCGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGFVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GGLMD+AF+Y+ ENKGL +E YPY + + A I+ + D+P+G+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPRGN 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFG-TAEEEN 304
E AL+ AV+ PVSV +DAS ++ FY+SG+ A + DH V VVG+G +
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G +YW++KNSW + WG+ GYI + +D CG+AT+ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATSASYPL 335
>gi|334324655|ref|XP_001370975.2| PREDICTED: cathepsin S-like isoform 1 [Monodelphis domestica]
Length = 331
Score = 264 bits (674), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 141/321 (43%), Positives = 198/321 (61%), Gaps = 19/321 (5%)
Query: 33 MHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGT 88
+H +++ H + W HG+ YK + E+ R I+++NL+Y+ N E G +Y L
Sbjct: 18 LHRDPMLDGHWDLWKKTHGKQYKGQNEEIARRLIWEKNLKYVTLHNLEHSMGLHSYDLSM 77
Query: 89 NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKD 148
N D+T+EE +L + P Q +R +T++ + +P S+DWREKG VT +K
Sbjct: 78 NHLGDMTSEEVISLMSSLRIP-----NQWNRNTTYRLSSNQKLPDSVDWREKGCVTEVKY 132
Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST---DNHGCSGGLMDKAFEYI 205
QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDCST DNHGC+GG M AF+Y+
Sbjct: 133 QGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTDKYDNHGCNGGFMTSAFQYV 192
Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVC 264
I+N G+ ++ YPY+ +G C + AAT SKY +LP G E+AL +AV+N+ PVSV
Sbjct: 193 IDNNGIDSDVSYPYKATDGKC-QYNPASRAATCSKYTELPYGSEEALKEAVANKGPVSVG 251
Query: 265 VDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
+DA +F YKSGV + C +HGV V+G+G + G YWL+KNSWG +G+ G
Sbjct: 252 IDAKTPSFFLYKSGVYYDPSCTQKVNHGVLVIGYGNLD---GQDYWLVKNSWGLHFGDKG 308
Query: 324 YIRILRDAG-LCGIATAASYP 343
Y+RI R+ G CGIA SYP
Sbjct: 309 YVRIARNRGNHCGIANFPSYP 329
>gi|356545112|ref|XP_003540989.1| PREDICTED: LOW QUALITY PROTEIN: KDEL-tailed cysteine endopeptidase
CEP1-like [Glycine max]
Length = 400
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 201/311 (64%), Gaps = 8/311 (2%)
Query: 18 ILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN 77
++ +TC Q S +S E E+HE+WMAQ+G+ Y+D E R IFK N+++IE N
Sbjct: 92 LVGVTCGRQCRS-KSRLEACTSERHEKWMAQYGKVYEDAAEMEKRFQIFKNNVQFIESFN 150
Query: 78 KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN-VTDVPTSID 136
G++ + + N+F DL +EEF+AL R V V ++ ++F+Y + VT++P ++D
Sbjct: 151 VAGDKPFNIRINQFPDLHDEEFKALLINGQRKVSGV-ETATEETSFRYGSVVTNIPATMD 209
Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD-CSTDNHGCSG 195
R+KG VT IKDQG GSCWA SAVAA+EGI QIT KL+ LS+Q+LVD ++ GC G
Sbjct: 210 GRKKGVVTPIKDQGIIGSCWALSAVAAIEGIHQITTSKLMFLSKQKLVDSVKGESEGCIG 269
Query: 196 GLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQA 255
G ++ AFE+I++ G+ +E YPY+ C +KE A I YE +P +++ALL+
Sbjct: 270 GYVEDAFEFIVKKGGILSETHYPYKGVN-XCKVEKETHSVAHIKGYEKVPSNNKKALLKV 328
Query: 256 VSNQPVSVCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNS 314
V+NQPVSV +D AF +Y S + NA +CG++ +H VAVVG+G A + GAKYW +KNS
Sbjct: 329 VANQPVSVYIDVGAHAFKYYSSEIFNARNCGSDPNHVVAVVGYGKALD--GAKYWPVKNS 386
Query: 315 WGETWGESGYI 325
WG WG Y+
Sbjct: 387 WGTEWGGKWYM 397
>gi|281352890|gb|EFB28474.1| hypothetical protein PANDA_008012 [Ailuropoda melanoleuca]
Length = 328
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 144/318 (45%), Positives = 199/318 (62%), Gaps = 18/318 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ W +G+ YK++ E+ R I+++NL+++ N E G +Y LG N
Sbjct: 18 DPALDHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHL 77
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
D+T+EE +L + VPS Q R T+K + +P S+DWREKG VT +K QG
Sbjct: 78 GDMTSEEVISLMSSLR--VPS---QWPRNVTYKSNSNQKLPDSVDWREKGCVTKVKYQGA 132
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD---NHGCSGGLMDKAFEYIIEN 208
CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST+ N GC+GG M +AF+YII+N
Sbjct: 133 CGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDN 192
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDA 267
G+ +EA YPY+ +G C K AAT SKY +LP G E L +AV+N+ PVSV +DA
Sbjct: 193 NGIDSEASYPYKATDGKC-RYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDA 251
Query: 268 SGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+F Y+SGV + C N +HGV VVG+G NG YWL+KNSWG +G+ GYIR
Sbjct: 252 RHSSFFLYRSGVYYDPSCTQNVNHGVLVVGYGNL---NGKDYWLVKNSWGLNFGDQGYIR 308
Query: 327 ILRDAG-LCGIATAASYP 343
+ R++G CGIA+ SYP
Sbjct: 309 MARNSGNHCGIASYPSYP 326
>gi|301767946|ref|XP_002919405.1| PREDICTED: cathepsin S-like [Ailuropoda melanoleuca]
Length = 340
Score = 264 bits (674), Expect = 6e-68, Method: Compositional matrix adjust.
Identities = 144/318 (45%), Positives = 199/318 (62%), Gaps = 18/318 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ W +G+ YK++ E+ R I+++NL+++ N E G +Y LG N
Sbjct: 30 DPALDHHWNLWKKTYGKQYKEKNEEVARRLIWEKNLKFVTLHNLEHSMGMHSYDLGMNHL 89
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
D+T+EE +L + VPS Q R T+K + +P S+DWREKG VT +K QG
Sbjct: 90 GDMTSEEVISLMSSLR--VPS---QWPRNVTYKSNSNQKLPDSVDWREKGCVTKVKYQGA 144
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD---NHGCSGGLMDKAFEYIIEN 208
CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST+ N GC+GG M +AF+YII+N
Sbjct: 145 CGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTEAFQYIIDN 204
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDA 267
G+ +EA YPY+ +G C K AAT SKY +LP G E L +AV+N+ PVSV +DA
Sbjct: 205 NGIDSEASYPYKATDGKC-RYDSKNRAATCSKYTELPSGSEDDLKEAVANKGPVSVAIDA 263
Query: 268 SGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+F Y+SGV + C N +HGV VVG+G NG YWL+KNSWG +G+ GYIR
Sbjct: 264 RHSSFFLYRSGVYYDPSCTQNVNHGVLVVGYGNL---NGKDYWLVKNSWGLNFGDQGYIR 320
Query: 327 ILRDAG-LCGIATAASYP 343
+ R++G CGIA+ SYP
Sbjct: 321 MARNSGNHCGIASYPSYP 338
>gi|327263389|ref|XP_003216502.1| PREDICTED: cathepsin L1-like [Anolis carolinensis]
Length = 339
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 151/340 (44%), Positives = 207/340 (60%), Gaps = 19/340 (5%)
Query: 20 VITCASQVVSGRSMHEPSI---VEKHEQ-WMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
V CA + PS+ ++ H Q W H + Y + E+ R I+++NL+ I+
Sbjct: 3 VYLCALALFLEACFAAPSLDSALDDHWQAWKTWHSKKYHQQ-EEGWRRMIWEKNLKMIQL 61
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
N + G +Y+LG N F D+TNEEFR + GY S + + R S F N VP
Sbjct: 62 HNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGYKH---SKTEKKYRGSEFLEPNFLVVP 118
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--N 190
S+DWREKG VT +KDQGQCGSCWAFS ++EG GKL+ LSEQ LVDCS N
Sbjct: 119 KSVDWREKGYVTPVKDQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSRPEGN 178
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
GC+GGLMD+AFEYI +N G+ +E YPY ++ K + AA + + D+P+G E+
Sbjct: 179 QGCNGGLMDQAFEYIADNGGIDSEESYPYIAKDDEDCLYKSEFNAANDTGFVDVPEGHER 238
Query: 251 ALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG--TAEEENG 305
AL++AV+ PVSV +DAS F FY+SG+ + DC + DHGV VVG+G +++N
Sbjct: 239 ALMKAVAAVGPVSVAIDASHSTFQFYESGIYYDPDCSSEELDHGVLVVGYGFEGTDDDNK 298
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
KYW++KNSW + WG+ GYI + +D CGIATAASYP+
Sbjct: 299 KKYWIVKNSWSDKWGDKGYILMAKDRNNHCGIATAASYPL 338
>gi|225709022|gb|ACO10357.1| Cathepsin L precursor [Caligus rogercresseyi]
Length = 332
Score = 263 bits (673), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 146/316 (46%), Positives = 196/316 (62%), Gaps = 16/316 (5%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDL 94
++ E W HG++Y+ +E+ +RL I +N I + N E G +Y + N + DL
Sbjct: 23 VLSDWESWKLTHGKSYESSIEEKLRLKIHMENSLKISRHNAEAINGKHSYYMKMNHYGDL 82
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
+ EF A+ GY V++ S S +NV +PT +DWRE GAVT +K+QGQCGS
Sbjct: 83 LHHEFVAMVNGYEY----VNKTSLGGSFIPSKNVK-LPTHVDWREDGAVTPVKNQGQCGS 137
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLA 212
CWAFS+ ++EG T GKLI LSEQ LVDCS N+GC GGLMD AF YI +NKG+
Sbjct: 138 CWAFSSTGSLEGQTFRKTGKLIPLSEQNLVDCSRKYGNNGCEGGLMDFAFTYIRDNKGID 197
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRA 271
TE YPY G C K ++ I + D+ KG E+ LL+AV++ PVSV +DAS +
Sbjct: 198 TEGSYPYEGVGGRCHYDPSKKGSSDIG-FVDVKKGSEEELLKAVASVGPVSVAIDASHMS 256
Query: 272 FHFYKSGV-LNADCG-NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
F FY GV + C N DHGV VVG+GT +E +G YWL+KNSW E WG+ GYI++ R
Sbjct: 257 FQFYSHGVYFESKCSPENLDHGVLVVGYGT-DENSGEDYWLVKNSWSENWGDQGYIKMAR 315
Query: 330 D-AGLCGIATAASYPV 344
+ +CGIA++ASYPV
Sbjct: 316 NKKNMCGIASSASYPV 331
>gi|148224022|ref|NP_001087489.1| cathepsin L2 precursor [Xenopus laevis]
gi|51258284|gb|AAH80004.1| MGC81823 protein [Xenopus laevis]
Length = 335
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 144/342 (42%), Positives = 213/342 (62%), Gaps = 18/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + ++ C + V + + +P++ + W H ++Y + E+ R ++++NL
Sbjct: 1 MALYLVAAALCLTTVFAAPTT-DPALDDHWHLWKNWHKKSYLPK-EEGWRRVLWEKNLRT 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N + G +Y+LG N+F D+TNEEFR L GY +++ + STF N
Sbjct: 59 IEFHNLDHSLGKHSYRLGMNQFGDMTNEEFRQLMNGYK------NQKMIKGSTFLAPNNF 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
+ P ++DWREKG VT +KDQGQCGSCWAFS A+EG GKLI LSEQ LVDCS
Sbjct: 113 EAPKTVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKAGKLISLSEQNLVDCSRA 172
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+Y+ +N G+ +E YPY ++ + +A + + D+P G
Sbjct: 173 QGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVPSG 232
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+ L++AV++ PVSV VDA ++F FY+SG+ + +C + + DHGV VVG+G E+
Sbjct: 233 SEKDLMKAVASVGPVSVAVDAGHKSFQFYQSGIYYDPECSSEDLDHGVLVVGYGFEGEDV 292
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+G +YW++KNSW E WG +GYI+I +D CGIATAASYP+
Sbjct: 293 DGKRYWIVKNSWSEKWGNNGYIKIAKDRHNHCGIATAASYPL 334
>gi|156399477|ref|XP_001638528.1| predicted protein [Nematostella vectensis]
gi|156225649|gb|EDO46465.1| predicted protein [Nematostella vectensis]
Length = 325
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 151/344 (43%), Positives = 205/344 (59%), Gaps = 27/344 (7%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
K+F+ + V + L+ C S++ R H W HG+TY E E+ +R I+
Sbjct: 2 KAFLACLLVAV-LIAQCFSELSQDRQWHA---------WKDFHGKTYTGE-EEDLRRAIW 50
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
NLE ++K N E N +YKL N F+DLT EF+ + GY + S+ STF
Sbjct: 51 NDNLEIVKKHNAE-NHSYKLDMNHFADLTVTEFKQRFMGYR-----AASNSTGGSTFLPL 104
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+ +P +DWR+KG VT +K+QGQCGSCWAFS+ ++EG GKL+ LSEQ LVDC
Sbjct: 105 SNVQLPAEVDWRDKGFVTAVKNQGQCGSCWAFSSTGSLEGQHFRKTGKLVSLSEQNLVDC 164
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
S N+GC GGLMD AF+YI N G+ TE YPY +G C + K +V AT++ Y D+
Sbjct: 165 SKKYGNNGCEGGLMDYAFKYIKNNDGIDTEQSYPYTARDGQC-HFKPGSVGATVTGYTDV 223
Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAE 301
+G E L AV+ P+SV +DA +F YK+GV + DC + DHGV VG+G
Sbjct: 224 QRGSEGDLQSAVATVGPISVAIDAGHSSFQLYKTGVYSEPDCSSTQLDHGVLAVGYGA-- 281
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
E+G YWL+KNSWGE WG +GYI++ R+ CGIAT ASYP+
Sbjct: 282 -EDGKDYWLVKNSWGEGWGMNGYIKMSRNKDNQCGIATQASYPL 324
>gi|440906716|gb|ELR56945.1| Cathepsin S, partial [Bos grunniens mutus]
Length = 342
Score = 263 bits (673), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 147/342 (42%), Positives = 210/342 (61%), Gaps = 20/342 (5%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
I M ++ ++ C+S + + +P++ + W +G+ YK++ E+ R I+++NL
Sbjct: 10 ITMNWLVWALLLCSSAMA--QVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNL 67
Query: 71 EYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
+ + N E G +Y+LG N D+T+EE +L + P Q R T+K
Sbjct: 68 KTVTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVP-----SQWPRNVTYKSDP 122
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+P S+DWREKG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDCS
Sbjct: 123 NQKLPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCS 182
Query: 188 T---DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
T N GC+GG M +AF+YII+N G+ +EA YPY+ +G C K AAT S+Y +L
Sbjct: 183 TAKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKC-QYDVKNRAATCSRYIEL 241
Query: 245 PKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEE 302
P G E+AL +AV+N+ PVSV +DAS +F YK+GV + C N +HGV VVG+G +
Sbjct: 242 PFGSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLD- 300
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
G YWL+KNSWG +G+ GYIR+ R++G CGIA+ SYP
Sbjct: 301 --GKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIASYPSYP 340
>gi|391336140|ref|XP_003742440.1| PREDICTED: cathepsin L-like [Metaseiulus occidentalis]
Length = 330
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 146/317 (46%), Positives = 194/317 (61%), Gaps = 17/317 (5%)
Query: 36 PSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT--YKLGTNEFSD 93
PS + + H ++Y+D E+ +R IF+ NL IE+ N+ + LG NEF+D
Sbjct: 22 PSAEPHWNAFKSTHLKSYRDGQEELIRRFIFEDNLHTIEEFNRVNASLAGFTLGVNEFAD 81
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG 153
+TN EF + G + + S F+ +V D+P +DW +KG VT +K+QGQCG
Sbjct: 82 MTNTEFSNMLLGLGG-----RNKIAGDSVFESSHVQDLPAEVDWTQKGYVTEVKNQGQCG 136
Query: 154 SCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGL 211
SCWAFS ++EG GKL+ LSEQ LVDCST N GC+GGLMD+AF YI +N G+
Sbjct: 137 SCWAFSTTGSLEGQVFKKTGKLVSLSEQNLVDCSTSEGNQGCNGGLMDQAFTYIKKNGGI 196
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGR 270
TEA YPY +GTC + K V AT+S + D+ GDE AL +AV+ P+SV +DAS
Sbjct: 197 DTEAAYPYTGSDGTCRFLENK-VGATVSGFVDVKSGDENALKEAVATVGPISVAIDASSI 255
Query: 271 AFHFYKSGVLNA-DCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
F FY+ GV N C + DHGV VVG+GT E G YWL+KNSWG +WG GYI+++
Sbjct: 256 FFQFYRGGVYNPWFCSSTELDHGVLVVGYGT---EGGKDYWLVKNSWGSSWGLKGYIKMV 312
Query: 329 RD-AGLCGIATAASYPV 344
R+ CGIAT ASYP
Sbjct: 313 RNKKNRCGIATQASYPT 329
>gi|355681664|gb|AER96818.1| cathepsin S [Mustela putorius furo]
Length = 338
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 147/329 (44%), Positives = 201/329 (61%), Gaps = 18/329 (5%)
Query: 24 ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---G 80
AS + ++P++ W +GR Y+++ E+ R I+++NL+ + N E G
Sbjct: 18 ASSYAVAQVQNDPTLDHHWNLWKKTYGRQYQEKNEEVARRLIWEKNLKSVMLHNLEYSMG 77
Query: 81 NRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
+Y LG N +D+T+EE +L + VPS Q T+K + +P S+DWREK
Sbjct: 78 MHSYDLGMNHLADMTSEEVSSLMSSLR--VPS---QWQANVTYKSNSNQKLPDSVDWREK 132
Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD---NHGCSGGL 197
G VT +K QG CG+CWAFSAV A+E ++ G L+ LS Q LVDCST+ N GC+GG
Sbjct: 133 GCVTEVKYQGACGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTERYGNKGCNGGF 192
Query: 198 MDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVS 257
M KAF+YII+N G+ +E YPY+ +G C K AAT SKY +LP G E AL +AV+
Sbjct: 193 MTKAFQYIIDNNGIDSEVSYPYKAMDGNC-RYDSKHRAATCSKYTELPFGSEDALKEAVA 251
Query: 258 NQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSW 315
N+ PVSV +DA +F YKSGV + C N +HGV VVG+G NG YWL+KNSW
Sbjct: 252 NKGPVSVAIDAKHSSFFLYKSGVYYDPSCTQNVNHGVLVVGYGNL---NGRDYWLVKNSW 308
Query: 316 GETWGESGYIRILRDAG-LCGIATAASYP 343
G +GE GYIR+ R++G CGIA+ SYP
Sbjct: 309 GLNFGEQGYIRMARNSGNHCGIASYPSYP 337
>gi|2961621|gb|AAC05781.1| cathepsin S [Mus musculus]
Length = 340
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 199/319 (62%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 29 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
D+TNEE +SRQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 89 GDMTNEEISC-----RMGALRISRQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 143
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 144 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 203
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
N G+ +A YPY+ + C + K AAT S+Y LP GDE AL +AV+ + PVSV +D
Sbjct: 204 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 262
Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
AS +F FYKSGV + C N +HGV VVG+GT + G YWL+KNSWG +G+ GYI
Sbjct: 263 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 319
Query: 326 RILR-DAGLCGIATAASYP 343
R+ R + CGIA+ SYP
Sbjct: 320 RMARNNKNHCGIASYCSYP 338
>gi|6650705|gb|AAF21977.1|AF115280_1 thiolproteinase SmTP1 [Sarcocystis muris]
Length = 394
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 136/309 (44%), Positives = 191/309 (61%), Gaps = 12/309 (3%)
Query: 44 QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALY 103
Q+ H + Y E E+ R IFK NL YI N +G +Y L N+F DLT EEFR Y
Sbjct: 91 QFQRDHNKFYATEEERLKRYAIFKNNLTYIHNHNMQG-YSYVLKMNKFGDLTLEEFRQRY 149
Query: 104 TGYNRP-VPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVA 162
GY +P + + R+ +T + D+PT +DWR++G VT +KDQG CGSCWAFSA
Sbjct: 150 LGYKKPDLRTPPREVD--TTLESVEDNDIPTHVDWRQRGCVTSVKDQGDCGSCWAFSATG 207
Query: 163 AVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
A+EG+ GKL+ LS+QQLVDCS N GC GG M++AFEY++EN G+ + +YPY
Sbjct: 208 AMEGVYCAKTGKLVNLSQQQLVDCSRFLGNQGCDGGRMEEAFEYVVENGGICSGENYPYM 267
Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVS-NQPVSVCVDASGRAFHFYKSGV 279
++G C + + +V ATI+ Y +P+ E+++ A++ PVSV + A+ AF FY G+
Sbjct: 268 RKDGVCKSSQCTSV-ATITGYRSVPRRSEKSMKTALALRSPVSVAIQANQAAFQFYYDGI 326
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGI 336
+A CG N DHGV +VG+ +AE YW++KNSWG WG+ GY+ + AG CG+
Sbjct: 327 FDAPCGTNLDHGVLLVGY-SAETAGQGDYWIMKNSWGAAWGKGGYMLMAMHKGPAGQCGV 385
Query: 337 ATAASYPVA 345
S+PVA
Sbjct: 386 LLDGSFPVA 394
>gi|3850787|emb|CAA05360.1| cathepsin S [Mus musculus]
Length = 330
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 146/325 (44%), Positives = 200/325 (61%), Gaps = 19/325 (5%)
Query: 29 SGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYK 85
+G + P++ + W H + YKD+ E+ +R I+++NL++I N E G TY+
Sbjct: 13 NGATAERPTLDHHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQ 72
Query: 86 LGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTH 145
+G N+ D+TNEE P RQS + TF+ + +P ++DWREKG VT
Sbjct: 73 VGMNDMGDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTE 127
Query: 146 IKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKA 201
+K QG CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +A
Sbjct: 128 VKYQGSCGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEA 187
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-P 260
F+YII+N G+ +A YPY+ + C + K AAT S+Y LP GDE AL +AV+ + P
Sbjct: 188 FQYIIDNGGIEADASYPYKAMDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGP 246
Query: 261 VSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
VSV +DAS +F FYKSGV + C N +HGV VVG+GT + G YWL+KNSWG +
Sbjct: 247 VSVGIDASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNF 303
Query: 320 GESGYIRILR-DAGLCGIATAASYP 343
G+ GYIR+ R + CGIA+ SYP
Sbjct: 304 GDQGYIRMARNNKNHCGIASYCSYP 328
>gi|37786769|gb|AAO64471.1| cathepsin L precursor [Fundulus heteroclitus]
Length = 337
Score = 263 bits (672), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 143/342 (41%), Positives = 209/342 (61%), Gaps = 16/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + +L + +S V+S S+ +P + + W + H + Y E RL ++++NL+
Sbjct: 1 MLPVAVLTLCLSSAVLSAPSL-DPQLDQHWNLWKSWHSKNYHQREEGWRRL-VWEKNLKK 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +Y+LG N F D+T+EEF+ + GY + + + S F N
Sbjct: 59 IELHNLEHSMGKHSYRLGMNHFGDMTHEEFKQIMNGYKHK----AERKFKGSLFLEPNFL 114
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+ P S+DWREKG VT +KDQG+CGSCWAFS A+EG GKL+ LS Q LV+CS
Sbjct: 115 EAPRSVDWREKGYVTPVKDQGECGSCWAFSTTGALEGQEFTRTGKLVSLSGQNLVECSRP 174
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+Y+ +N+GL +E YPY + + K AA + + D+P G
Sbjct: 175 EGNEGCNGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKFSAANDTGFVDIPSG 234
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
+E+AL++AV++ PVSV +DA +F FY+SG+ +C + DHGV VG+G E+
Sbjct: 235 NERALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFQGEDV 294
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+G K+W++KNSW E WG+ GYI + +D CGIATAASYP+
Sbjct: 295 DGKKFWIVKNSWSENWGDKGYIYMAKDRKNHCGIATAASYPL 336
>gi|443698586|gb|ELT98517.1| hypothetical protein CAPTEDRAFT_128252 [Capitella teleta]
Length = 324
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 149/335 (44%), Positives = 204/335 (60%), Gaps = 21/335 (6%)
Query: 19 LVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK 78
++ C + ++ + + ++ E + H +TY E E MR I++++L I + N
Sbjct: 1 MLACCIAATLASPLVFDEALDEMWTLFKTTHSKTYATEAED-MRRFIWERHLNMINQHNI 59
Query: 79 E---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
E G T+ LG NE+ DLT E+ A+ +GY SV P + VP ++
Sbjct: 60 EADLGKHTFSLGMNEYGDLTQHEYAAM-SGYKMAKSSVGSSFLEPENLQ------VPKTV 112
Query: 136 DWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGC 193
DWREKG VT +K+QGQCGSCWAFS+ ++EG G+L +SEQ LVDCS D N GC
Sbjct: 113 DWREKGYVTPVKNQGQCGSCWAFSSTGSLEGQVFRKTGRLPSISEQNLVDCSRDEGNMGC 172
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
SGGLMD AF YI +N G+ +E YPY +G C +K +V T S + D+P GDE AL
Sbjct: 173 SGGLMDNAFTYIKKNMGIDSEKSYPYEAVDGECRYKKSDSV-TTDSGFVDIPHGDETALR 231
Query: 254 QAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWL 310
AV++ PVSV +DAS +F FYK+GV A+C + DHGV VVG+G ENG YWL
Sbjct: 232 TAVASVGPVSVAIDASHTSFQFYKTGVYTEANCSSTQLDHGVLVVGYGV---ENGQDYWL 288
Query: 311 IKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
+KNSWG +WGE+GYI++ R+ G CGIA+ ASYP+
Sbjct: 289 VKNSWGASWGEAGYIKLARNHGNQCGIASQASYPL 323
>gi|291383517|ref|XP_002708299.1| PREDICTED: cathepsin L1 [Oryctolagus cuniculus]
Length = 333
Score = 263 bits (672), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 149/343 (43%), Positives = 203/343 (59%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
P F++ + AS + + ++ + QW A H R Y E+ R ++++N+
Sbjct: 3 PSFLLAAVCWGIASAIPK----FDQNLDTQWYQWKATHKRLYGLN-EEGWRRAVWEKNMR 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N E G + +G N + D+TNEEFR + G+ + P +Y
Sbjct: 58 MIELHNGEYSQGKHGFTMGMNAYGDMTNEEFRQVMNGFQNQKHKKGKMFRDPLLLQY--- 114
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
P S+DWREKG VT +K+QGQCGSCWAFSA A+EG GKLI LSEQ LVDCS
Sbjct: 115 ---PKSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFQKTGKLISLSEQNLVDCSH 171
Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY +GTC + E +VA + + D+P
Sbjct: 172 PQGNQGCNGGLMDYAFQYVKDNSGLDSEESYPYEGMDGTCKYKPECSVAND-TGFVDIP- 229
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
G E+ALL+AV+ P+S +DA +F FYKSG+ + DC + + DHG+ VVG+G
Sbjct: 230 GHEKALLRAVATVGPISAAIDAGHMSFQFYKSGIYYDPDCSSKDLDHGILVVGYGFEGTN 289
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
N KYWL+KNSWG TWG+ GY++I+RD CGIATAASYP
Sbjct: 290 SNATKYWLVKNSWGTTWGDEGYVKIIRDKDNHCGIATAASYPT 332
>gi|2746723|gb|AAB94925.1| cathepsin S precursor [Mus musculus]
Length = 340
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 199/319 (62%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 29 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
D+TNEE +SRQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 89 GDMTNEEISC-----RMGALRISRQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 143
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 144 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 203
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
N G+ +A YPY+ + C + K AAT S+Y LP GDE AL +AV+ + PVSV +D
Sbjct: 204 NGGIEADASYPYKAMDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 262
Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
AS +F FYKSGV + C N +HGV VVG+GT + G YWL+KNSWG +G+ GYI
Sbjct: 263 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 319
Query: 326 RILR-DAGLCGIATAASYP 343
R+ R + CGIA+ SYP
Sbjct: 320 RMARNNKNHCGIASYCSYP 338
>gi|62510453|sp|Q8HY82.1|CATS_SAIBB RecName: Full=Cathepsin S; Flags: Precursor
gi|27497536|gb|AAO13008.1| cathepsin S preproprotein [Saimiri boliviensis]
Length = 330
Score = 263 bits (672), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 147/340 (43%), Positives = 207/340 (60%), Gaps = 21/340 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ ++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKQLVCVLFVCSSAVTQ---LHKDPTLDHHWNLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE +L + P Q R T+K
Sbjct: 58 FVMLHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLRVP-----NQWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCS
Sbjct: 113 QMLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSE 172
Query: 189 D--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GG M +AF+YII+NKG+ +EA YPY+ + C K AAT SKY +LP
Sbjct: 173 KYGNKGCNGGFMTEAFQYIIDNKGIDSEASYPYKATDQKCQ-YDSKYRAATCSKYTELPY 231
Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
G E L +AV+N+ PV V VDAS +F Y+SGV + C +HGV V+G+G + N
Sbjct: 232 GREDVLKEAVANKGPVCVGVDASHPSFFLYRSGVYYDPACTQKVNHGVLVIGYG---DLN 288
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
G +YWL+KNSWG +GE GYIR+ R+ G CGIA+ SYP
Sbjct: 289 GKEYWLVKNSWGSNFGEQGYIRMARNKGNHCGIASYPSYP 328
>gi|354622947|ref|NP_001002938.2| cathepsin S precursor [Canis lupus familiaris]
Length = 339
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 151/340 (44%), Positives = 207/340 (60%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M ++ L+ C+ V + +P++ W + + YK+E E+ R I+++NL++
Sbjct: 9 MKWLVGLLPLCSYAVA--QVHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKF 66
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y LG N D+T EE +L G R VPS Q R T++ +
Sbjct: 67 VMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISL-MGSLR-VPS---QWQRNVTYRSNSNQ 121
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST+
Sbjct: 122 KLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 181
Query: 190 ---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GG M AF+YII+N G+ +EA YPY+ G C +K AAT SKY +LP
Sbjct: 182 KYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKR-AATCSKYTELPF 240
Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
G E AL +AV+N+ PVSV +DAS +F Y+SGV C N +HGV VVG+G N
Sbjct: 241 GSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL---N 297
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
G YWL+KNSWG +G+ GYIR+ R++G CGIA+ SYP
Sbjct: 298 GKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 337
>gi|395856029|ref|XP_003800445.1| PREDICTED: cathepsin S [Otolemur garnettii]
Length = 331
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 147/338 (43%), Positives = 205/338 (60%), Gaps = 21/338 (6%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
++ L++ C++ R +P++ W +G+ Y ++ E+ R I+++NL+++
Sbjct: 4 LVWTLLVCCSAMAQLHR---DPALDHHWHLWKKTYGKQYTEKNEETERRLIWEKNLKFVM 60
Query: 75 KANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
N E G +Y LG N D+T+EE +L T P RQS R T+K +
Sbjct: 61 LHNLEHSMGMHSYDLGMNHLGDMTSEEVVSLMTCLKVP-----RQSQRNVTYKSSPNQKL 115
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
P S+DWREKG VT +K QG CGSCWAFSAV A+E ++T GKL+ LS Q LVDCST+
Sbjct: 116 PDSLDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLTTGKLVSLSAQNLVDCSTEKY 175
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC GG M +AF+YII+N G+ +EA YPY+ + C K AAT SKY +LP G
Sbjct: 176 RNEGCHGGFMTEAFQYIIDNNGIDSEASYPYKAMDEKCQ-YDSKNRAATCSKYTELPFGS 234
Query: 249 EQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGA 306
E+AL +AV+++ PVSV +DAS +F Y+SGV C +HGV VVG+G NG
Sbjct: 235 EEALKEAVASKGPVSVAIDASHSSFFLYRSGVYYEPACTQVVNHGVLVVGYGNL---NGN 291
Query: 307 KYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYP 343
YWL+KNSWG +G+ GYIR+ R+ CGIA+ +SYP
Sbjct: 292 DYWLVKNSWGLYFGDKGYIRMARNRENHCGIASYSSYP 329
>gi|189525870|ref|XP_001923796.1| PREDICTED: cathepsin L1 [Danio rerio]
Length = 335
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 204/340 (60%), Gaps = 15/340 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ +LV S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLYISAVFAAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR GY +R S P F
Sbjct: 59 EQHNFEYSLGNHTFKMGMNQFGDMTNEEFRQAMNGYKH---DPNRTSQGP-LFMEPKFFA 114
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +KDQ QCGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPH 174
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GGLMD+AF+Y+ ENKGL +E YPY + + A I+ + D+PKG+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPKGN 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFG-TAEEENG 305
E AL+ AV+ PVSV +DAS ++ FY+SG+ C + DH V VVG+G + G
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACTSQLDHAVLVVGYGYQGADVAG 294
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+YW++KNSW + WG+ GYI + +D CGIAT ASYP+
Sbjct: 295 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGIATMASYPL 334
>gi|38147395|gb|AAR12010.1| cathepsin L-like proteinase [Triatoma infestans]
Length = 328
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 143/314 (45%), Positives = 195/314 (62%), Gaps = 22/314 (7%)
Query: 43 EQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTN 96
E+W+A Q G++YK+ E+ R+N++K+N I++ NK G +YKL N F DL
Sbjct: 24 EEWLAFKAQFGKSYKNSFEELFRMNVYKENQRKIDEHNKRYENGEVSYKLKMNHFGDLMQ 83
Query: 97 EEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
EF+AL + ++Q + F+ +P +DWR+KGAVT +KD GQCGSCW
Sbjct: 84 HEFKAL-----NKLKRSAKQQNSGEVFRATG-GKLPAKVDWRQKGAVTPVKDPGQCGSCW 137
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATE 214
AFS+ ++ G + KL+ LSEQQLVDCS + N GC GG+M +AF+YI N G+ TE
Sbjct: 138 AFSSTGSLGGQLFLKNKKLVSLSEQQLVDCSGNYGNDGCDGGIMVQAFQYIKGNGGIDTE 197
Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFH 273
YPY E+ C K K+VA T Y D+ +GDE AL +AV+ P+SV +DA +F
Sbjct: 198 GSYPYEAEDDKC-RYKTKSVAGTDKGYVDIAQGDENALKEAVAEIGPISVAIDAGNLSFQ 256
Query: 274 FYKSGVLNAD-CGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
FY G+ + C N DHGV VVG+GT ENG YWL+KNSWG +WGE+GYI+I R+
Sbjct: 257 FYSEGIYDEPFCSNTELDHGVLVVGYGT---ENGQDYWLVKNSWGPSWGENGYIKIARNH 313
Query: 332 -GLCGIATAASYPV 344
CGIA+ ASYP+
Sbjct: 314 NNHCGIASMASYPI 327
>gi|307175095|gb|EFN65237.1| Cathepsin L [Camponotus floridanus]
Length = 372
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 147/332 (44%), Positives = 196/332 (59%), Gaps = 23/332 (6%)
Query: 23 CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR 82
C V +G S H H + YK +E+ R+ IF N I + N++
Sbjct: 55 CCGSVFAGSSCHR-----------THHKKVYKSPIEEGYRMKIFLDNKRKIVEHNRKYEM 103
Query: 83 ---TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWRE 139
YKLG N++ D+ + E G+N+ V +VS + +TF ++P S+DWR+
Sbjct: 104 KEVNYKLGMNKYGDMLHHELINTLNGFNKSV-TVSEEQLIGATFIEPANVELPKSVDWRK 162
Query: 140 KGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGL 197
KGAVT IKDQGQCGSCWAFS+ A+EG G L+ LSEQ L+DCS N+GC+GGL
Sbjct: 163 KGAVTAIKDQGQCGSCWAFSSTGALEGQHFRQSGVLVSLSEQNLIDCSGKYGNNGCNGGL 222
Query: 198 MDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVS 257
MD AF YI ENKGL TE YPY E C + + A+ + + D+P+GDE L AV+
Sbjct: 223 MDYAFRYIKENKGLDTEKSYPYEAENDQCRYNPKNSGASDVG-FVDIPEGDEDKLKAAVA 281
Query: 258 N-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNS 314
P+SV +DAS +FHFY GV +C N DHGV +VG+GT + G YWL+KNS
Sbjct: 282 TIGPISVAIDASHESFHFYSEGVYYEPECSPANLDHGVLIVGYGT-DSGTGEDYWLVKNS 340
Query: 315 WGETWGESGYIRILRDA-GLCGIATAASYPVA 345
WGETWGE GYI++ R+ CGIA++ASYP+
Sbjct: 341 WGETWGEKGYIKMARNKENHCGIASSASYPLV 372
>gi|380014284|ref|XP_003691169.1| PREDICTED: cathepsin L-like [Apis florea]
Length = 345
Score = 263 bits (671), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 148/347 (42%), Positives = 207/347 (59%), Gaps = 24/347 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEK-HEQWMA---QHGRTYKDELEKAMRLNIFKQ 68
M + +IL IT + V H S E +++WM +H + YK ++E+ R+ IF
Sbjct: 1 MKLFLILFITIFATV------HAVSFFELVNQEWMTFKMEHKKAYKSDVEERFRMKIFMD 54
Query: 69 NLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRP--STF 123
N I K N +YKL N++ D+ + EF + G+N+ + + R P ++F
Sbjct: 55 NKHKIAKHNSNYEMKKVSYKLKMNKYGDMLHHEFVNILNGFNKSINTQLRSERMPIGASF 114
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
+P +DWR++GAVT +KDQG CGSCW+FSA A+EG G L+ LSEQ L
Sbjct: 115 IEPANVALPKKVDWRKEGAVTPVKDQGHCGSCWSFSATGALEGQHFRRTGVLVSLSEQNL 174
Query: 184 VDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
+DCS N+GC+GGLMD+AF+YI +NKGL TEA YPY E C + A + Y
Sbjct: 175 IDCSGKYGNNGCNGGLMDQAFQYIKDNKGLDTEASYPYEAENDKCRYNPANSGAIDVG-Y 233
Query: 242 EDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG 298
D+P G+E+ L AV+ PVSV +DAS ++F FY GV +C + DHGV V+G+G
Sbjct: 234 IDIPTGNEKLLKAAVATIGPVSVAIDASHQSFQFYSEGVYYEPECSSEELDHGVLVIGYG 293
Query: 299 TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
T ENG YWL+KNSWGETWG +GYI++ R+ CGIA++ASYP+
Sbjct: 294 T--NENGEDYWLVKNSWGETWGNNGYIKMARNKLNHCGIASSASYPL 338
>gi|348687948|gb|EGZ27762.1| papain-like cysteine protease C1 [Phytophthora sojae]
Length = 533
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 138/307 (44%), Positives = 193/307 (62%), Gaps = 13/307 (4%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT-YKLGTNEFSDLTNEEFRALY 103
WM+ HG T+ D LE A RL + N YI + N E T KLG N FS ++ +EF+
Sbjct: 31 WMSAHGVTFSDALEFARRLENYIANDMYILEHNAENAWTGVKLGHNAFSHMSFDEFKFKM 90
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
TG P + ++ + + +V +VP+++DW +KG VT +K+QG CGSCWAFS A
Sbjct: 91 TGLVLPEGYLEQRLASRVDGLWSDV-EVPSAVDWVDKGGVTPVKNQGMCGSCWAFSTTGA 149
Query: 164 VEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
VEG T ++ GKL+ LSEQ+LVDC + + GC+GGLMD AF++I ++ G+ +E DY Y+ +
Sbjct: 150 VEGATFVSSGKLLSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYEYKAK 209
Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
C ++ ++ ++D+ DE AL AV+ QPVSV ++A +AF FYKSGV N
Sbjct: 210 AQVC---RKCDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 266
Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIAT 338
CG DHGV VG+G +NG K+W +KNSWG +WGE GYIR+ R+ AG CGIA+
Sbjct: 267 TCGTRLDHGVLAVGYGN---DNGQKFWKVKNSWGASWGEQGYIRLAREENGPAGQCGIAS 323
Query: 339 AASYPVA 345
SYP A
Sbjct: 324 VPSYPFA 330
>gi|62510452|sp|Q8HY81.1|CATS_CANFA RecName: Full=Cathepsin S; Flags: Precursor
gi|27497538|gb|AAO13009.1| cathepsin S preproprotein [Canis lupus familiaris]
Length = 331
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 151/340 (44%), Positives = 207/340 (60%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M ++ L+ C+ V + +P++ W + + YK+E E+ R I+++NL++
Sbjct: 1 MKWLVGLLPLCSYAVA--QVHKDPTLDHHWNLWKKTYSKQYKEENEEVARRLIWEKNLKF 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y LG N D+T EE +L G R VPS Q R T++ +
Sbjct: 59 VMLHNLEHSMGMHSYDLGMNHLGDMTGEEVISL-MGSLR-VPS---QWQRNVTYRSNSNQ 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST+
Sbjct: 114 KLPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 190 ---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GG M AF+YII+N G+ +EA YPY+ G C +K AAT SKY +LP
Sbjct: 174 KYGNKGCNGGFMTTAFQYIIDNNGIDSEASYPYKAMNGKCRYDSKKR-AATCSKYTELPF 232
Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
G E AL +AV+N+ PVSV +DAS +F Y+SGV C N +HGV VVG+G N
Sbjct: 233 GSEDALKEAVANKGPVSVAIDASHYSFFLYRSGVYYEPSCTQNVNHGVLVVGYGNL---N 289
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
G YWL+KNSWG +G+ GYIR+ R++G CGIA+ SYP
Sbjct: 290 GKDYWLVKNSWGLNFGDQGYIRMARNSGNHCGIASYPSYP 329
>gi|196002275|ref|XP_002111005.1| expressed hypothetical protein [Trichoplax adhaerens]
gi|190586956|gb|EDV27009.1| expressed hypothetical protein [Trichoplax adhaerens]
Length = 325
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 148/338 (43%), Positives = 203/338 (60%), Gaps = 20/338 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + + + C + VVS + +PS E W + HG+ Y ++ E R +F QN++
Sbjct: 1 MKTLSVFLAICLA-VVSAIPLKDPSW----EAWKSFHGKKYHNQGEDDFRHYVFLQNIKT 55
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
I N + T+K+ NEFSDLT +EF Y GY S+ + +++PSTF T++P
Sbjct: 56 IAAHNAKS--TFKMAINEFSDLTRKEFVKTYNGYRL---SMKKSTNKPSTFMAPLNTNMP 110
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DN 190
T +DWR++G VT IK+QG+CGSCWAFS ++EG GKL+ LSEQ L+DCS N
Sbjct: 111 TEVDWRKEGYVTPIKNQGRCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLIDCSAAEGN 170
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
GC GG MD AFEYI N G+ TEA YPY + C +K A + Y D+ + E
Sbjct: 171 DGCGGGFMDDAFEYIKLNNGIDTEASYPYEGRDDICRYKKTNK-GAIDTGYMDIKQYSED 229
Query: 251 ALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNC-DHGVAVVGFGTAEEENGAK 307
L AV+ P+SV +DAS ++FH Y +GV + +C DHGV VVG+GT ENG
Sbjct: 230 DLKAAVATVGPISVAIDASHKSFHMYHTGVYHEPECSQTVLDHGVLVVGYGT---ENGED 286
Query: 308 YWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
YWL+KNSWG WG +GYI++ R+ + CGIAT ASYP+
Sbjct: 287 YWLVKNSWGTDWGMNGYIKMSRNRSNNCGIATNASYPL 324
>gi|156397875|ref|XP_001637915.1| predicted protein [Nematostella vectensis]
gi|156225031|gb|EDO45852.1| predicted protein [Nematostella vectensis]
Length = 331
Score = 262 bits (670), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 148/338 (43%), Positives = 199/338 (58%), Gaps = 14/338 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M +I V+ C S ++ M EP + W + HG+ Y ++ E+ MR I++ NL+
Sbjct: 1 MEAVIFAVLLCISSALAMPPM-EPLQDPNWKAWKSFHGKEYPNKNEETMRNFIWQNNLKK 59
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVP 132
I N EG ++KL N D+T+ E G + + + +TF V
Sbjct: 60 IVTHN-EGKHSFKLAMNHLGDMTSLEISQTLLGLK--LKKHAESQPKGATFLPPANVKVV 116
Query: 133 TSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--N 190
SIDWR KG VT +K+QGQCGSCWAFS A+EG GKL+ LSEQ LVDCS N
Sbjct: 117 DSIDWRSKGYVTPVKNQGQCGSCWAFSTTGALEGQHFRKTGKLVSLSEQNLVDCSGKYGN 176
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
+GC GGLMD AF+YI EN G+ TE YPY ++G C K A+ A + + D+P GDE
Sbjct: 177 NGCEGGLMDNAFQYIKENGGIDTEKSYPYLAKDGVCHYNKS-AIGAKDTGFVDIPTGDEN 235
Query: 251 ALLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGNN-CDHGVAVVGFGTAEEENGAK 307
AL QA+++ P+S+ +DAS FHFY GV + DC + DHGV VG+GT ++G
Sbjct: 236 ALQQALASVGPISIAIDASQSTFHFYHQGVYDDPDCSSTRLDHGVLAVGYGT---DDGKD 292
Query: 308 YWLIKNSWGETWGESGYIRILR-DAGLCGIATAASYPV 344
YWL+KNSWG +WGE GYI+I R D CG+A+ ASYP+
Sbjct: 293 YWLVKNSWGPSWGEEGYIKIARNDHDKCGVASKASYPL 330
>gi|390608645|ref|NP_001254624.1| cathepsin S isoform 1 preproprotein [Mus musculus]
gi|74214026|dbj|BAE29430.1| unnamed protein product [Mus musculus]
Length = 343
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 198/319 (62%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 32 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 91
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
D+TNEE P RQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 92 GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 146
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 147 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 206
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
N G+ +A YPY+ + C + K AAT S+Y LP GDE AL +AV+ + PVSV +D
Sbjct: 207 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 265
Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
AS +F FYKSGV + C N +HGV VVG+GT + G YWL+KNSWG +G+ GYI
Sbjct: 266 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 322
Query: 326 RILR-DAGLCGIATAASYP 343
R+ R + CGIA+ SYP
Sbjct: 323 RMARNNKNHCGIASYCSYP 341
>gi|22661|emb|CAA49504.1| papaya proteinase omega [Carica papaya]
Length = 367
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 139/352 (39%), Positives = 201/352 (57%), Gaps = 16/352 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ----WMAQHGRTYKDE 56
M+ K + + + + + ++ + G S + + E+ Q WM H + Y++
Sbjct: 3 MIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENV 62
Query: 57 LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQ 116
EK R IFK NL YI++ NK+ N +Y+LG NEF+DL+N+EF Y G + + +
Sbjct: 63 DEKLYRFEIFKDNLNYIDETNKK-NNSYRLGLNEFADLSNDEFNEKYVG---SLIDATIE 118
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
S F +++ ++P ++DWR+KGAVT ++ QG CGSCWAFSAVA VEGI +I GKL+
Sbjct: 119 QSYDEEFINEDIVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLV 178
Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
ELSEQ+LVDC +HGC GG A EY+ +N G+ + YPY+ ++GTC ++
Sbjct: 179 ELSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGTCRAKQVGGPIV 237
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
S + +E LL A++ QPVSV V++ GR F YK G+ CG DH V V
Sbjct: 238 KTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAV- 296
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
+ G Y LIKNSWG WGE GYIRI R G+CG+ ++ YP+
Sbjct: 297 --GYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYPI 346
>gi|74178074|dbj|BAE29827.1| unnamed protein product [Mus musculus]
gi|74178231|dbj|BAE29900.1| unnamed protein product [Mus musculus]
gi|74220784|dbj|BAE31361.1| unnamed protein product [Mus musculus]
Length = 326
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 198/319 (62%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 15 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 74
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
D+TNEE P RQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 75 GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 129
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 130 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 189
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
N G+ +A YPY+ + C + K AAT S+Y LP GDE AL +AV+ + PVSV +D
Sbjct: 190 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 248
Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
AS +F FYKSGV + C N +HGV VVG+GT + G YWL+KNSWG +G+ GYI
Sbjct: 249 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 305
Query: 326 RILR-DAGLCGIATAASYP 343
R+ R + CGIA+ SYP
Sbjct: 306 RMARNNKNHCGIASYCSYP 324
>gi|318037269|ref|NP_001187182.1| cathepsin L precursor [Ictalurus punctatus]
gi|196475596|gb|ACG76367.1| cathepsin L [Ictalurus punctatus]
Length = 336
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 149/342 (43%), Positives = 203/342 (59%), Gaps = 17/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + + ++ C S V + ++ + + +QW H + Y E E+ R ++++NL+
Sbjct: 1 MRLCLAVLAVCLSTVSAAPTV-DRELDGHWQQWKEWHNKDYH-EKEEGWRRMVWEKNLKK 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +Y+L N F D+ +EEFR + GY V + R S F N
Sbjct: 59 IELHNLEHSLGKHSYRLAMNHFGDMPHEEFRQVMNGYKHKVRKI-----RGSLFMEPNFL 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+ P+ +DWREKG VT +KDQGQCGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 114 EAPSKLDWREKGYVTPVKDQGQCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRP 173
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+YI +N GL TE YPY + + AA + + D+P G
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDTEKFYPYLGTDDQPCHYDPSYSAANDTGFVDIPSG 233
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E AL++AV+ PVSV +DA +F FY+SG+ ADC + + DHGV VVG+G E
Sbjct: 234 KEHALMKAVTAVGPVSVAIDAGHESFQFYQSGIYYEADCSSEDLDHGVLVVGYGYEGENV 293
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+G KYW++KNSW E WG GYI + +D CGIATAASYP+
Sbjct: 294 DGKKYWIVKNSWSEQWGNKGYIYMAKDRHNHCGIATAASYPL 335
>gi|392306967|ref|NP_067256.3| cathepsin S isoform 2 preproprotein [Mus musculus]
gi|26390492|dbj|BAC25906.1| unnamed protein product [Mus musculus]
gi|148706872|gb|EDL38819.1| cathepsin S [Mus musculus]
Length = 342
Score = 262 bits (670), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 198/319 (62%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 31 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 90
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
D+TNEE P RQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 91 GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 145
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 146 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 205
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
N G+ +A YPY+ + C + K AAT S+Y LP GDE AL +AV+ + PVSV +D
Sbjct: 206 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 264
Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
AS +F FYKSGV + C N +HGV VVG+GT + G YWL+KNSWG +G+ GYI
Sbjct: 265 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 321
Query: 326 RILR-DAGLCGIATAASYP 343
R+ R + CGIA+ SYP
Sbjct: 322 RMARNNKNHCGIASYCSYP 340
>gi|189525868|ref|XP_001341714.2| PREDICTED: cathepsin L1-like isoform 1 [Danio rerio]
Length = 336
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 140/340 (41%), Positives = 204/340 (60%), Gaps = 15/340 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
++ L++T + V S + + + W +QHG++Y +++E R+ I+++NL IE
Sbjct: 1 MMFALLVTLSISAVFAASSIDIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKIE 59
Query: 75 KANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
+ N E GN T+K+G N+F D+TNEEFR GY Q+S+ F +
Sbjct: 60 QHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKHD----PNQTSQGPLFMEPSFFAA 115
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
P +DWR++G VT +KDQ QCGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 116 PQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQG 175
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
N GC+GGLMD+AF+Y+ ENKGL +E YPY + + A I+ + D+P G+E
Sbjct: 176 NQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGNE 235
Query: 250 QALLQAVSN-QPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFG-TAEEENG 305
AL+ AV+ PVSV +DAS ++ FY+SG+ A + DH V VVG+G + G
Sbjct: 236 LALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVAG 295
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+YW++KNSW + WG+ GYI + +D CG+AT ASYP+
Sbjct: 296 NRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|325303202|tpg|DAA34687.1| TPA_inf: cathepsin L-like cysteine proteinase B [Amblyomma
variegatum]
Length = 337
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/344 (42%), Positives = 214/344 (62%), Gaps = 20/344 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M ++L CA+ + + H+ + + + A HG+ Y+ E E+ RL I+ +N
Sbjct: 1 MRGFVVLCFLCAAMTAAAIT-HQELVGAEWSAFKALHGKEYQSETEEYYRLKIYMENRMM 59
Query: 73 IEKANKE--GNR-TYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS---RPSTFKYQ 126
I + N++ N+ +YKL NE+ D+ + EF + G+ R S RQ S P + +
Sbjct: 60 IARHNEKYANNKVSYKLAMNEYGDMLHHEFVSTRNGFRRDYRSKPRQGSFYIEPEGIEDK 119
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
++ P ++DWR+KGAVT +K+QGQCGSCWAFS ++EG G ++ LSEQ LVDC
Sbjct: 120 HL---PKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKSGDMVSLSEQNLVDC 176
Query: 187 ST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
ST N+GC GGLMD AF+YI N G+ TE YPY +GTC + K+ V AT + + D+
Sbjct: 177 STAFGNNGCEGGLMDNAFKYIKANGGIDTEKSYPYNGTDGTC-HFKKSDVGATDTGFVDI 235
Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAE 301
P+G+E L +AV+ P+SV +DAS ++F FY GV + +C + N DHGV VVG+GT +
Sbjct: 236 PEGNEHLLKKAVATVGPISVAIDASHQSFQFYSQGVYDEPECSSENLDHGVLVVGYGTKD 295
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+++ YWL+KNSWG TWG+ GYI + R+ CGIA++ASYP+
Sbjct: 296 DQD---YWLVKNSWGTTWGDGGYIYMTRNKDNQCGIASSASYPL 336
>gi|33112581|gb|AAP94046.1| cathepsin-L-like cysteine peptidase 02 [Tenebrio molitor]
Length = 337
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 150/344 (43%), Positives = 206/344 (59%), Gaps = 23/344 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNL 70
F+I + + SQ VS + + EQW A H + Y+ + E+ R+ IF +N
Sbjct: 3 FLIFLAICVAGSQAVSFFDLVQ-------EQWGAFKMTHNKQYQSDTEERFRMKIFMENS 55
Query: 71 EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSV-SRQSSRPSTFKYQ 126
+ K NK +G ++KLG N+++D+ + EF + G+NR + S +S TF
Sbjct: 56 HTVAKHNKLYAQGLVSFKLGINKYADMLHHEFVQVLNGFNRTKSGLRSGESDDSVTFLPP 115
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+P IDWR+KGAVT +KDQGQCGSCW+FSA ++EG GKL+ LSEQ LVDC
Sbjct: 116 ANVQLPGQIDWRDKGAVTPVKDQGQCGSCWSFSATGSLEGQHFRKSGKLVSLSEQNLVDC 175
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
S N+GC+GGLMD AF YI N G+ TE YPY+ E+ C + K K AT Y D+
Sbjct: 176 SEKFGNNGCNGGLMDNAFRYIKANGGIDTEQAYPYKAEDEKC-HYKPKNKGATDRGYVDI 234
Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCG-NNCDHGVAVVGFGTAE 301
G+E L AV+ PVSV +DAS ++F Y GV +C + DHGV VVG+GT
Sbjct: 235 ESGNEDKLQSAVATVGPVSVAIDASHQSFQLYSGGVYYEPECSPSQLDHGVLVVGYGT-- 292
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
E++G YWL+KNSWG++WG+ GYI++ R+ CGIAT ASYP+
Sbjct: 293 EDDGTDYWLVKNSWGKSWGDQGYIKMARNRDNNCGIATEASYPL 336
>gi|341940310|sp|O70370.2|CATS_MOUSE RecName: Full=Cathepsin S; Flags: Precursor
Length = 340
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 198/319 (62%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 29 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
D+TNEE P RQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 89 GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 143
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 144 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 203
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
N G+ +A YPY+ + C + K AAT S+Y LP GDE AL +AV+ + PVSV +D
Sbjct: 204 NGGIEADASYPYKATDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 262
Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
AS +F FYKSGV + C N +HGV VVG+GT + G YWL+KNSWG +G+ GYI
Sbjct: 263 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 319
Query: 326 RILR-DAGLCGIATAASYP 343
R+ R + CGIA+ SYP
Sbjct: 320 RMARNNKNHCGIASYCSYP 338
>gi|387015022|gb|AFJ49630.1| Cathepsin L1-like [Crotalus adamanteus]
Length = 338
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 212/343 (61%), Gaps = 17/343 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + IL ++ + + +P++ + W + H + Y E E+ R I+++NL+
Sbjct: 1 MIYLCILALSFGASFAA--PGLDPALNDHWLSWKSWHSKKYH-EKEEGWRRMIWEKNLKM 57
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N + G +Y+LG N F D+TNEEFR + G+ + S S++ + S F N
Sbjct: 58 IELHNLDHSLGKHSYRLGMNHFGDMTNEEFRQVMNGFKQ---SRSQRKYKGSQFLEPNFL 114
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
P S+DWREKG VT +KDQGQCGSCWAFSA A+EG GKL+ LSEQ L+DCS
Sbjct: 115 QAPKSVDWREKGYVTPVKDQGQCGSCWAFSATGALEGQHFRKTGKLVSLSEQNLIDCSGP 174
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+YI +N G+ +E YPY ++ K + +A + + D+P+G
Sbjct: 175 EGNQGCNGGLMDQAFQYIKDNNGIDSEESYPYIGKDDEDCLYKPEYNSANDTGFVDIPEG 234
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLNADCGNN--CDHGVAVVGFGT--AEE 302
E+AL++AV+ P+SV +DAS +F FY+SGV N+ DHGV VVG+G ++
Sbjct: 235 RERALMKAVAAVGPISVAIDASHTSFQFYESGVYYEPQCNSEELDHGVLVVGYGYEGTDD 294
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+N +YW++KNSW E WG+ GYI + +D + CGIA+AASYP+
Sbjct: 295 DNKKRYWIVKNSWSEKWGDQGYIHMAKDRSNNCGIASAASYPM 337
>gi|116563690|gb|ABJ99858.1| cathepsin L [Hippoglossus hippoglossus]
Length = 336
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/341 (42%), Positives = 209/341 (61%), Gaps = 17/341 (4%)
Query: 15 VIIILVIT-CASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
++ +LV+T C S V+S + + + E + W + H + Y E E+ R ++++NL+ I
Sbjct: 1 MLPLLVLTACLSSVLSAPVL-DAQLNEHWDLWKSWHSKKYH-EKEEGWRRMVWEKNLQKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E N E G +++LG N F D+T+EEFR + GY +++ S F N
Sbjct: 59 ELHNLEHSMGTHSFRLGMNHFGDMTHEEFRQIMNGYKLK----TQRKFTGSLFMEPNFMT 114
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P+++DWREKG VT +KDQGQCGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 115 APSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPE 174
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC GGLMD+AF+Y+ +N+GL +E YPY + + +A + + D+P G
Sbjct: 175 GNEGCGGGLMDQAFQYVTDNQGLDSEDSYPYTGTDDQPCHYDPLYNSANDTGFVDVPSGK 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEEN 304
E AL++AV++ PVSV +DA +F FY+SG+ +C + DHGV VG+G E++
Sbjct: 235 EHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDKM 294
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
G K+W++KNSWGE WG+ GYI + +D CGIATAASYP+
Sbjct: 295 GKKFWIVKNSWGEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|12805315|gb|AAH02125.1| Ctss protein [Mus musculus]
Length = 340
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/319 (45%), Positives = 198/319 (62%), Gaps = 19/319 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W H + YKD+ E+ +R I+++NL++I N E G TY++G N+
Sbjct: 29 DPTLDYHWDLWKKTHEKEYKDKNEEEVRRLIWEKNLKFIMIHNLEYSMGMHTYQVGMNDM 88
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
D+TNEE P RQS + TF+ + +P ++DWREKG VT +K QG
Sbjct: 89 GDMTNEEILCRMGALRIP-----RQSPKTVTFRSYSNRTLPDTVDWREKGCVTEVKYQGS 143
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD----NHGCSGGLMDKAFEYIIE 207
CG+CWAFSAV A+EG ++ GKLI LS Q LVDCS + N GC GG M +AF+YII+
Sbjct: 144 CGACWAFSAVGALEGQLKLKTGKLISLSAQNLVDCSNEEKYGNKGCGGGYMTEAFQYIID 203
Query: 208 NKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVD 266
N G+ +A YPY+ + C + K AAT S+Y LP GDE AL +AV+ + PVSV +D
Sbjct: 204 NGGIEADASYPYKAMDEKC-HYNSKNRAATCSRYIQLPFGDEDALKEAVATKGPVSVGID 262
Query: 267 ASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
AS +F FYKSGV + C N +HGV VVG+GT + G YWL+KNSWG +G+ GYI
Sbjct: 263 ASHSSFFFYKSGVYDDPSCTGNVNHGVLVVGYGTLD---GKDYWLVKNSWGLNFGDQGYI 319
Query: 326 RILR-DAGLCGIATAASYP 343
R+ R + CGIA+ SYP
Sbjct: 320 RMARNNKNHCGIASDCSYP 338
>gi|387914010|gb|AFK10614.1| cathepsin L [Callorhinchus milii]
gi|392873762|gb|AFM85713.1| cathepsin L [Callorhinchus milii]
gi|392877488|gb|AFM87576.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 212/342 (61%), Gaps = 15/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + +++ C + ++ S+ +P + EQW + HG++Y ++ E+ R +++++L
Sbjct: 1 MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEKHLRV 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +++LG N F D+ NEEFR L GY Q S F N
Sbjct: 59 IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSH---FLEPNFL 115
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+VP +DWR++G VT +KDQGQCGSCWAFS A+EG G+L+ LSEQ LV+CS
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+Y+ +N G+ +E YPY + T + + AA + + D+P G
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSG 235
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEE- 303
E+AL++A++ PVSV +DA +F FY+SG+ A+C + + DHGV VVG+G + +
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+G KYW++KNSW E WG++GYI + +D CGIATAASYP+
Sbjct: 296 DGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|392884266|gb|AFM90965.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 262 bits (669), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 212/342 (61%), Gaps = 15/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + +++ C + ++ S+ +P + EQW + HG++Y+ + E R+ +++++L
Sbjct: 1 MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSYEQKEETWRRM-VWEEHLRV 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +++LG N F D+ NEEFR L GY Q S F N
Sbjct: 59 IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSH---FLEPNFL 115
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+VP +DWR++G VT +KDQGQCGSCWAFS A+EG G+L+ LSEQ LV+CS
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+Y+ +N G+ +E YPY + T + + AA + + D+P G
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSG 235
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEE- 303
E+AL++A++ PVSV +DA +F FY+SG+ A+C + + DHGV VVG+G + +
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+G KYW++KNSW E WG++GYI + +D CGIATAASYP+
Sbjct: 296 DGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|431896621|gb|ELK06033.1| Cathepsin S [Pteropus alecto]
Length = 331
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 145/340 (42%), Positives = 210/340 (61%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + +++ C++ V + +P++ + W + + Y++++E+ R I+++NL++
Sbjct: 1 MKWLACVLLGCSAAVA--QLQRDPTLDRHWDLWKKTYSKHYREKIEEVARRLIWEKNLKF 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y LG N D+T+EE +L VPS Q R T+K
Sbjct: 59 VMLHNLEHSMGMHSYDLGMNHLGDMTSEEVISLMGSLT--VPS---QWQRNVTYKSNPNQ 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+P S+DWR+KG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDCST+
Sbjct: 114 KLPDSLDWRDKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTE 173
Query: 190 ---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GG M AF+YII+N G+ +EA YPY+ ++G C K AAT SKY +LP
Sbjct: 174 KYSNKGCNGGFMTSAFQYIIDNNGIDSEASYPYKAQDGKC-QYDSKFRAATCSKYTELPF 232
Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
G E+AL +AV+N+ PVSV +DAS +F Y+SGV + C +HGV VVG+G +
Sbjct: 233 GSEEALKEAVANKGPVSVAIDASHPSFFLYRSGVYYDQSCTLKVNHGVLVVGYGNLD--- 289
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
G YWL+KNSWG +G+ GYIR+ R++G CGIA+ SYP
Sbjct: 290 GKDYWLVKNSWGLNFGDKGYIRMARNSGNHCGIASYPSYP 329
>gi|392881548|gb|AFM89606.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 261 bits (668), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 212/342 (61%), Gaps = 15/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + +++ C + ++ S+ +P + EQW + HG++Y ++ E+ R +++++L
Sbjct: 1 MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEKHLRV 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +++LG N F D+ NEEFR L GY Q S F N
Sbjct: 59 IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSH---FLEPNFQ 115
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+VP +DWR++G VT +KDQGQCGSCWAFS A+EG G+L+ LSEQ LV+CS
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+Y+ +N G+ +E YPY + T + + AA + + D+P G
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSG 235
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEE- 303
E+AL++A++ PVSV +DA +F FY+SG+ A+C + + DHGV VVG+G + +
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+G KYW++KNSW E WG++GYI + +D CGIATAASYP+
Sbjct: 296 DGKKYWIVKNSWSEKWGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|330803820|ref|XP_003289900.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
gi|325080011|gb|EGC33585.1| hypothetical protein DICPUDRAFT_80649 [Dictyostelium purpureum]
Length = 328
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 143/342 (41%), Positives = 200/342 (58%), Gaps = 22/342 (6%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
II V L++ C S + R + + WM +H ++Y ++ E R IF+ N
Sbjct: 3 IILALVFCFLIVNCIS---AARVFSQKQYQTAFQNWMVKHQKSYTND-EFGSRYTIFQDN 58
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNV 128
++++ K N++G+ T LG N +DLTN+E++ +Y G V +P+ +V
Sbjct: 59 MDFVTKWNQKGSDTI-LGLNSMADLTNQEYQRIYLGTKTTVK-------KPNLIIGVTDV 110
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+ P S+DWR GAVT +K+QGQCG C++FS +VEGI +IT +L+ LSEQQ++DCS
Sbjct: 111 SKAPASVDWRANGAVTAVKNQGQCGGCYSFSTTGSVEGIHEITSKQLVSLSEQQILDCSG 170
Query: 189 D--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N+GC GGLM +FEYII GL TEA YPY G C K + ATI+ Y+++
Sbjct: 171 SEGNNGCDGGLMTNSFEYIIAVGGLDTEASYPYEGVVGKCKFNKAN-IGATITGYKNVKS 229
Query: 247 GDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFGTAEEEN 304
G E L AV+ QPVSV +DAS +F Y SGV A DHGV VG+G+ ++
Sbjct: 230 GSESDLQTAVAAQPVSVAIDASQNSFQLYSSGVYYEPACSSTQLDHGVLAVGYGS---QS 286
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPVA 345
G YW++KNSWG WGE G+I + R+ CGIAT ASYP A
Sbjct: 287 GQDYWIVKNSWGADWGEKGFILMARNKHNNCGIATMASYPTA 328
>gi|291224870|ref|XP_002732425.1| PREDICTED: cathepsin L2-like [Saccoglossus kowalevskii]
Length = 326
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 154/343 (44%), Positives = 205/343 (59%), Gaps = 27/343 (7%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ MF+ + LV A+ S+ + E W +G+ Y + E+A+R I+ NL
Sbjct: 1 MKMFISLALVAMAAA----------TSVNTEWESWKRTYGKEYTQK-EEALRHMIWNVNL 49
Query: 71 EYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
+ I+ N++ G TY N+F DLTNEE+R L GY + +V S+PSTF +
Sbjct: 50 KMIQMHNEKYMSGKSTYTQNMNQFGDLTNEEYRELMCGYKKSNKTVI---SKPSTFLLPS 106
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
P SIDWR +G VT +KDQG CGSCWAFS+ ++EG T GKL+ LSEQQLVDCS
Sbjct: 107 NYRAPASIDWRTQGYVTDVKDQGACGSCWAFSSTGSLEGQTFKKTGKLVPLSEQQLVDCS 166
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
D N GC GG MD+AF Y I++KG +E YPY + TC K V AT + Y D+P
Sbjct: 167 GDYGNMGCGGGWMDQAFSY-IKDKGEESEDGYPYTGTDDTCVYDASK-VVATDTGYTDIP 224
Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCG-NNCDHGVAVVGFGTAEE 302
+ DE AL QAV+ P+SV +DA+ +F FY+SGV + +C N DH V VG+GT+EE
Sbjct: 225 EMDENALQQAVATVGPISVAIDATHSSFQFYESGVYDEPECSQTNLDHAVLAVGYGTSEE 284
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G YW++KNSW WG GYI + R+ CGIA+ ASYPV
Sbjct: 285 --GLDYWIVKNSWSTGWGMQGYIEMSRNKDNQCGIASKASYPV 325
>gi|110349473|gb|ABG73217.1| cathepsin L 1 precursor [Diaprepes abbreviatus]
Length = 322
Score = 261 bits (668), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 148/334 (44%), Positives = 200/334 (59%), Gaps = 22/334 (6%)
Query: 18 ILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN 77
+ + C VS + E + K + + +HG+TYK+++E+ R NIFK NL IE+ N
Sbjct: 3 VFIAACLLVAVSATVLEETGV--KFQAFKLKHGKTYKNQVEETARFNIFKDNLRAIEQHN 60
Query: 78 ---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTS 134
++G +YK G N F+D+T EEFRA T S S++ +T VP S
Sbjct: 61 VLYEQGLVSYKKGINRFTDMTQEEFRAFLT------LSSSKKPHFNTTEHVLTGLAVPDS 114
Query: 135 IDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGC 193
IDWR KG VT +KDQG CGSCWAFS + E GKL+ LSEQQLVDCSTD N GC
Sbjct: 115 IDWRTKGQVTGVKDQGNCGSCWAFSVTGSTEAAYYRKAGKLVSLSEQQLVDCSTDINAGC 174
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
+GG +D+ F Y +++KGL E+ YPY+ +G+C K V +S ++ L DE ALL
Sbjct: 175 NGGYLDETFTY-VKSKGLEAESTYPYKGTDGSCKYSASKVVTK-VSGHKSLKSEDENALL 232
Query: 254 QAVSN-QPVSVCVDASGRAFHFYKSGVLNAD-CG-NNCDHGVAVVGFGTAEEENGAKYWL 310
AV N PVSV +DA+ Y+SG+ D C + +HGV VVG+GT+ NG KYW+
Sbjct: 233 DAVGNVGPVSVAIDAT--YLSSYESGIYEDDWCSPSELNHGVLVVGYGTS---NGKKYWI 287
Query: 311 IKNSWGETWGESGYIRILRDAGLCGIATAASYPV 344
+KNSWG ++GESGY R+LR CG+A YP+
Sbjct: 288 VKNSWGGSFGESGYFRLLRGKNECGVAEDTVYPI 321
>gi|225719058|gb|ACO15375.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 326
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 143/310 (46%), Positives = 191/310 (61%), Gaps = 18/310 (5%)
Query: 44 QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
+W A HG+ Y E+++R IF++N I + N+E G TY LG N F DL + EF
Sbjct: 25 KWKATHGKVYNSADEESLRFKIFQENSLMITQHNEEYRQGFHTYILGMNHFGDLLHSEFL 84
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
G+ V S F + VP+ +W KGAVT +KDQG+CGSCWAFSA
Sbjct: 85 ERSNGFQGGV-------SGGDVFTFDTNAPVPSYANWTAKGAVTPVKDQGKCGSCWAFSA 137
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYP 218
+VEG + + KL+ LSEQQLVDCS D N GC GGLMD AF+Y I NKG+A E YP
Sbjct: 138 TGSVEGQIFLKKKKLMSLSEQQLVDCSGDEGNLGCGGGLMDNAFKYFIANKGIANEKSYP 197
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
Y ++ C +K +V ATIS ++D+ DE L AV+N PVSV +DAS F FY+S
Sbjct: 198 YTAKDNDCKYKKSMSV-ATISSFKDVKHKDEDQLKMAVANVGPVSVAIDASSSKFQFYES 256
Query: 278 GV-LNADCGNNC-DHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLC 334
GV + +C + DHGV VG+GT ++++G +WL+KNSW +WG +GYI++ R+ C
Sbjct: 257 GVYYDENCSSEVLDHGVLAVGYGT-DKKSGMDFWLVKNSWAASWGLNGYIKMARNKDNNC 315
Query: 335 GIATAASYPV 344
GIAT ASYP+
Sbjct: 316 GIATMASYPI 325
>gi|33242886|gb|AAQ01147.1| cathepsin [Haplochromis chilotes]
Length = 334
Score = 261 bits (667), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 142/321 (44%), Positives = 190/321 (59%), Gaps = 16/321 (4%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGT 88
+M E ++ E W HG++YK+++E A R ++ NL+ I N E G TY+LG
Sbjct: 21 AMFESTLDAHWELWKKTHGKSYKNDVENAHRRELWGNNLKMITVHNLEASMGLHTYELGM 80
Query: 89 NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKD 148
N DLT EE + P + R PS F + + +P ++DWREKG VT +K
Sbjct: 81 NHMGDLTEEEIMQFFASLTPPT-DIQRA---PSPFAGASGSGIPDTMDWREKGCVTKVKM 136
Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYII 206
QG CGSCWAFSA A+EG + GKL++LS Q LVDCS NHGC+GG M +AF+Y+I
Sbjct: 137 QGACGSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVDCSGKYGNHGCNGGFMTRAFQYVI 196
Query: 207 ENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCV 265
+N G+ ++A YPY + C + AA S Y+ LP+GDE AL Q ++ P+SV +
Sbjct: 197 DNHGIDSDASYPYIGRDDQC-HYNPATRAANCSSYQFLPEGDENALKQGLATVGPISVAI 255
Query: 266 DASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
DA F FY+SGV N C +HGV VG+GT NG YWL+KNSWG T+G+ GY
Sbjct: 256 DARRPRFSFYRSGVYNDPSCTQKVNHGVLAVGYGTL---NGQDYWLVKNSWGTTFGDQGY 312
Query: 325 IRILRDAG-LCGIATAASYPV 344
IR+ R+ G CGIA YPV
Sbjct: 313 IRMARNTGNQCGIALYPCYPV 333
>gi|302779822|ref|XP_002971686.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
gi|300160818|gb|EFJ27435.1| hypothetical protein SELMODRAFT_16221 [Selaginella moellendorffii]
Length = 214
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 121/215 (56%), Positives = 159/215 (73%), Gaps = 6/215 (2%)
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHG 192
S+DWR+KG VT IKDQG CG+CWAFSA+AAVEG+T ++ G L+ LSEQ+LVDC T N G
Sbjct: 1 SVDWRKKGGVTEIKDQGDCGNCWAFSAIAAVEGLTFLSTGTLVSLSEQELVDCDTTVNQG 60
Query: 193 CSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQAL 252
C GG+MD AF+Y+I N G+ ++++YPYR + G CD K K AATI+ ++ +P E+ L
Sbjct: 61 CDGGMMDYAFQYMIRNGGITSQSNYPYRAQRGACDKDKVKYHAATINGFQAIPPQSEELL 120
Query: 253 LQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIK 312
L+AV+NQPVSV ++A G+ F Y SGV +CG+N DHGVA+VG+GT + G +YWL+K
Sbjct: 121 LRAVANQPVSVAIEAGGQDFQLYSSGVFTGECGSNLDHGVAIVGYGT--DAGGRQYWLVK 178
Query: 313 NSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
NSWG WGESGY+R+ R AG+CGI ASYP
Sbjct: 179 NSWGSGWGESGYVRMERQGPGAGVCGINLDASYPT 213
>gi|115715524|ref|XP_780580.2| PREDICTED: cathepsin L-like [Strongylocentrotus purpuratus]
Length = 334
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 152/341 (44%), Positives = 206/341 (60%), Gaps = 20/341 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
++ ++LV C VVS SM E QW +HG+ Y + E+A R I+++NL+ +
Sbjct: 3 YLSVLLVAAC---VVSSLSMSFIDFDEDWNQWKNEHGKRYLSDEEEASRRLIWQKNLDIV 59
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF-KYQNVT 129
K N + G+ TY LG N+F+DL NEEF +L G+ S +++R STF NV
Sbjct: 60 IKHNLKYDLGHFTYDLGMNQFADLKNEEFVSLMNGFRGN----SSKATRGSTFLPPSNVF 115
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
D+PT +DWR KG VT +K+Q QCGSCWAFSA ++EG GKL+ LSEQ LVDCS
Sbjct: 116 DMPTMVDWRTKGYVTPVKNQLQCGSCWAFSATGSLEGQHFKKTGKLVSLSEQNLVDCSGK 175
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC GGLMD+AF+YI++ G+ TE YPY +G C K + AT + Y D+ G
Sbjct: 176 EGNMGCEGGLMDQAFQYILDVGGIDTEMSYPYTAMDGQCHFNKAN-IGATDTGYTDVTTG 234
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN--ADCGNNCDHGVAVVGFGTAEEEN 304
E AL AV++ P+SV +DAS ++F YKSGV N A DHGV VG+GT+ +
Sbjct: 235 SESALQMAVASVGPISVAIDASHQSFQLYKSGVYNEPACSSTLLDHGVLAVGYGTSSD-- 292
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G Y+ +SWG WG +GY+ + R+ CGIAT ASYP+
Sbjct: 293 GTDYFFFFHSWGAAWGMNGYLWMSRNKDNQCGIATKASYPL 333
>gi|75812934|ref|NP_001028787.1| cathepsin S precursor [Bos taurus]
gi|115503669|sp|P25326.2|CATS_BOVIN RecName: Full=Cathepsin S; Flags: Precursor
gi|74353837|gb|AAI02246.1| Cathepsin S [Bos taurus]
gi|296489535|tpg|DAA31648.1| TPA: cathepsin S precursor [Bos taurus]
Length = 331
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 207/340 (60%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M ++ ++ C+S + +P++ + W +G+ YK++ E+ R I+++NL+
Sbjct: 1 MNWLVWALLLCSSAMA--HVHRDPTLDHHWDLWKKTYGKQYKEKNEEVARRLIWEKNLKT 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y+LG N D+T+EE +L + P Q R T+K
Sbjct: 59 VTLHNLEHSMGMHSYELGMNHLGDMTSEEVISLMSSLRVP-----SQWPRNVTYKSDPNQ 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
+P S+DWREKG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 114 KLPDSMDWREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTA 173
Query: 189 --DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GG M +AF+YII+N G+ +EA YPY+ +G C K AAT S+Y +LP
Sbjct: 174 KYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDGKC-QYDVKNRAATCSRYIELPF 232
Query: 247 GDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEEN 304
G E+AL +AV+N+ PVSV +DAS +F YK+GV + C N +HGV VVG+G +
Sbjct: 233 GSEEALKEAVANKGPVSVGIDASHSSFFLYKTGVYYDPSCTQNVNHGVLVVGYGNLD--- 289
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
G YWL+KNSWG +G+ GYIR+ R++G CGIA SYP
Sbjct: 290 GKDYWLVKNSWGLHFGDQGYIRMARNSGNHCGIANYPSYP 329
>gi|61661067|gb|AAX51229.1| cathepsin S cysteine protease [Paralichthys olivaceus]
Length = 337
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 149/340 (43%), Positives = 198/340 (58%), Gaps = 21/340 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M ++LV C V +M + + E W HG+TY +E+E R ++++NL
Sbjct: 10 MLASLLLVSLC----VEAAAMLDVRLDVHWELWKKSHGKTYPNEVEDVRRRELWERNLML 65
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I K N E G +TY L N DLT EE Y P + R P+ F +
Sbjct: 66 ITKHNLEASMGLQTYDLSMNHMGDLTTEEIMQSYATLTPPA-DIQRA---PAPF-VGSGA 120
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
DVP S+DWR +G VT +K QG CGSCWAFSA A+EG T GKL++LS Q LVDCS
Sbjct: 121 DVPVSVDWRLQGCVTSVKMQGSCGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSLK 180
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GG MD+AF+Y+I+NKG+ +EA YPYR + C + AA S+Y LP+G
Sbjct: 181 YGNKGCNGGFMDRAFQYVIDNKGIDSEASYPYRGQLQQC-SYNPSYRAANCSRYSFLPEG 239
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAEEENG 305
DE AL A++ P+SV +DA+ F FY+SGV N C +HGV VG+GT E+G
Sbjct: 240 DEGALKNALATIGPISVAIDATRPTFAFYRSGVYNDPTCTQRVNHGVLAVGYGT---ESG 296
Query: 306 AKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
YWL+KNSWG ++G+ GYIR+ R+ CGIA SYP+
Sbjct: 297 QDYWLVKNSWGTSFGDKGYIRMSRNKNDQCGIALYCSYPI 336
>gi|330805277|ref|XP_003290611.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
gi|325079250|gb|EGC32859.1| hypothetical protein DICPUDRAFT_81345 [Dictyostelium purpureum]
Length = 330
Score = 261 bits (666), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 139/306 (45%), Positives = 188/306 (61%), Gaps = 17/306 (5%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT 104
WM +H R+Y E + FK N+++I N N LG +F+DLTNEE+R +Y
Sbjct: 36 WMKKHDRSYHHH-EFNNKYQAFKDNMDFIHNWNTNKNSKTVLGLTQFADLTNEEYRKIYL 94
Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
G V + F + T P SIDWR KGAV+H+KDQGQCGSCW+FS +V
Sbjct: 95 GTKVNV------APEKHNFNMIHFTG-PDSIDWRTKGAVSHVKDQGQCGSCWSFSTTGSV 147
Query: 165 EGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
EG QI G ++ LSEQ LVDCS N+GC GGLM AF++I+ G+ATE YPY
Sbjct: 148 EGAHQIKTGNMVTLSEQNLVDCSGKFGNNGCDGGLMVNAFKFIMSQGGVATEDSYPYNAV 207
Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN- 281
+G C K V A IS Y+++ +G E L A++ QPVS+ +DAS ++F YKSGV +
Sbjct: 208 QGKCKFTKS-MVGANISGYKEITQGSELELQAALTKQPVSIAIDASQQSFQLYKSGVYDE 266
Query: 282 ADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATA 339
+C + DHGV VG+GT ENG Y+++KNSW ++WG+ GYI + R+A CG+AT
Sbjct: 267 PECSSYQLDHGVLAVGYGT---ENGKDYYIVKNSWADSWGQDGYIFMSRNAKNQCGVATM 323
Query: 340 ASYPVA 345
ASYP++
Sbjct: 324 ASYPIS 329
>gi|342675481|gb|AEL31666.1| cathepsin L [Cynoglossus semilaevis]
Length = 336
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 146/345 (42%), Positives = 208/345 (60%), Gaps = 20/345 (5%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++P+ ++ + V S V+S S+ + + + E W H + Y E E+ R I+++N
Sbjct: 1 MLPLALLALGV----SAVLSAPSL-DARLSDHWELWKNWHSKKYH-EKEEGWRRMIWEKN 54
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
L IE N E G +Y+LG N F D+T+EEFR + GY R + + + S F
Sbjct: 55 LNKIELHNLEHSMGKHSYRLGMNHFGDMTHEEFRQIMNGYQRK----TERKAIGSLFMEP 110
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
N P+++DWREKG VT +KDQGQCGSCWAFS A+ZG GKL+ LSEQ LVDC
Sbjct: 111 NFMVAPSAVDWREKGYVTPVKDQGQCGSCWAFSTTGALZGQNFRKMGKLVSLSEQNLVDC 170
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
S N GC GGLMD+AF+Y+ +N+GL +E YPY + + K + + + D+
Sbjct: 171 SRPEGNEGCGGGLMDQAFQYVKDNQGLDSEDSYPYLGTDDQPCHYDPKYNSVNDTGFVDI 230
Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TA 300
P G E AL++AV++ PVSV +DA +F FY+SG+ +C + DHGV VG+G
Sbjct: 231 PSGKEHALMKAVASVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEG 290
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
E+ +G KYW++KNSW E WG+ GYI + +D CGIATAASYP+
Sbjct: 291 EDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 335
>gi|149751225|ref|XP_001490531.1| PREDICTED: cathepsin S-like [Equus caballus]
Length = 332
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 144/318 (45%), Positives = 197/318 (61%), Gaps = 18/318 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ + W +G+ YK++ E+ R I+++NL+++ N E G +Y LG N
Sbjct: 22 DPTLDNHWDLWKKTYGKQYKEKNEEVARRLIWERNLKFVMLHNLEHSMGMHSYDLGMNHL 81
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
D+T+EE +L + VPS Q R T+K +P S+DWREKG VT +K QG
Sbjct: 82 GDMTSEEVTSLMSSLR--VPS---QWQRNVTYKSNPNEKLPDSLDWREKGCVTEVKYQGS 136
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD---NHGCSGGLMDKAFEYIIEN 208
CG+CWAFSAV A+E ++ G L+ LS Q LVDCST+ N GC+GG M AF+YII+N
Sbjct: 137 CGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCSTEKYSNKGCNGGFMTAAFQYIIDN 196
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDA 267
G+ ++A YPY+ +G C K AAT SKY +LP G E L +AV+N+ PVSV +DA
Sbjct: 197 NGIDSDASYPYKAMDGKC-RYDSKNRAATCSKYTELPFGSEDDLKEAVANKGPVSVAIDA 255
Query: 268 SGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
S +F YKSGV + C N +HGV VVG+G NG YWL+KNSWG +G+ GYIR
Sbjct: 256 SHPSFFLYKSGVYYDPSCTQNVNHGVLVVGYGNL---NGKDYWLVKNSWGINFGDKGYIR 312
Query: 327 ILRDAG-LCGIATAASYP 343
+ R++G CGIA SYP
Sbjct: 313 MARNSGNHCGIANYCSYP 330
>gi|81542|pir||S02728 actinidain (EC 3.4.22.14) precursor (clone pAC.1) - kiwi fruit
(fragment)
gi|15957|emb|CAA31435.1| actinidin precursor [Actinidia chinensis]
gi|166319|gb|AAA32630.1| actinidin precursor [Actinidia deliciosa]
gi|226542|prf||1601514A actinidin
Length = 302
Score = 260 bits (665), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 133/282 (47%), Positives = 181/282 (64%), Gaps = 16/282 (5%)
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
L +I++ N + NR+YK+G N+F+DLT EEFR+ Y G+ S ++ + ++ +
Sbjct: 1 LRFIDEHNADTNRSYKVGLNQFADLTGEEFRSTYLGFT----GGSNKTKVSNRYEPRVSQ 56
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--S 187
+P+ +DWR GAV IK QG+CG CWAFSA+A VEGI +I G LI LSEQ+L+ C +
Sbjct: 57 VLPSYVDWRSAGAVVDIKSQGECGGCWAFSAIATVEGINKIVTGVLISLSEQELIGCGGT 116
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAATISKYEDLP 245
+ GC+GG + F++II N G+ T +YPY ++G C D Q EK V TI Y ++P
Sbjct: 117 QNTRGCNGGYITDGFQFIINNGGINTGENYPYTAQDGECNLDLQNEKYV--TIDTYGNVP 174
Query: 246 KGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENG 305
+E AL AV+ QPVSV +DA+G AF Y SG+ CG DH V +VG+GT E G
Sbjct: 175 YNNEWALQTAVTYQPVSVALDAAGDAFKHYSSGIFTGPCGTAIDHAVTIVGYGT---EGG 231
Query: 306 AKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYPV 344
YW+++NSW TWGE GY+RILR+ AG CGIAT SYPV
Sbjct: 232 IDYWIVENSWDTTWGEEGYMRILRNVGGAGTCGIATMPSYPV 273
>gi|225718114|gb|ACO14903.1| Cathepsin L precursor [Caligus clemensi]
Length = 336
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 202/340 (59%), Gaps = 19/340 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
++ +LVI + VS + ++ E W HG+TY +E+ +RL I+ +N I
Sbjct: 6 LLLSVLVIASTANAVSFFDV----VLSDWESWKLMHGKTYSSSIEEKLRLKIYMENSLKI 61
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+ N E G Y + N + DL + EF A+ GY ++ +S T+
Sbjct: 62 SRHNSEALNGIHPYYMKMNHYGDLLHHEFVAMVNGYQY----ANKTASLGGTYIPNKNIQ 117
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+PT +DWRE+GAVT +K+QGQCGSCW+FSA A+EG GKLI LSEQ LVDCS
Sbjct: 118 LPTHVDWREEGAVTPVKNQGQCGSCWSFSATGALEGQDFRKTGKLISLSEQNLVDCSRKF 177
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC GGLMD AF YI +NKG+ TEA YPY +G C + + I + D+ KG
Sbjct: 178 GNNGCEGGLMDFAFTYIRDNKGIDTEASYPYEGIDGHCHYNPKNKGGSDIG-FVDIKKGS 236
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENG 305
E+ L +AV+ P+SV +DAS +F FY GV + + C + DHGV VVGFGT + +G
Sbjct: 237 EKDLKKAVAGVGPISVAIDASHMSFQFYSHGVYVESKCSSEELDHGVLVVGFGT-DSVSG 295
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
YWL+KNSW E WG+ GYI++ R+ +CGIA++ASYPV
Sbjct: 296 EDYWLVKNSWSEKWGDQGYIKMARNKENMCGIASSASYPV 335
>gi|354473025|ref|XP_003498737.1| PREDICTED: cathepsin S-like [Cricetulus griseus]
Length = 341
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 147/343 (42%), Positives = 208/343 (60%), Gaps = 22/343 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
I ++ + ++ C + + +P++ + W HG+ YK++ E+ R I+++NL
Sbjct: 9 ITRWLFWVPMVCC---LAGDQLQRDPTLDHHWDLWKKFHGKQYKEKNEEEARRLIWEKNL 65
Query: 71 EYIEKANKEGN---RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
+ + N E + +Y LG N D+T+EE G RP+ V Q R ST+K
Sbjct: 66 KLVMLHNLEYSLEMHSYSLGMNHMGDMTSEEV----LGQMRPL-RVPSQRHRNSTYKSNP 120
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+P S+DWREKG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDCS
Sbjct: 121 NQKLPDSMDWREKGCVTEVKYQGSCGSCWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCS 180
Query: 188 TD----NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
T+ N GC GG M +AF+YII+N G+ ++A YPY+ C + K+ AAT S+Y +
Sbjct: 181 TEEKYGNKGCDGGFMTRAFQYIIDNGGIDSDASYPYKAVAEKC-HYDSKSRAATCSRYME 239
Query: 244 LPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAE 301
LP GDE+AL +AV+N+ PVSV +DAS +F YKSGV + C N +HGV VVG+G +
Sbjct: 240 LPSGDEEALKEAVANKGPVSVGIDASHPSFFLYKSGVYDEPSCTENVNHGVLVVGYGNLD 299
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILR-DAGLCGIATAASYP 343
G YWL+KNSWG +G+ GYIR+ R + CGIA+ SYP
Sbjct: 300 ---GKDYWLVKNSWGLHFGDQGYIRMARNNKNQCGIASYGSYP 339
>gi|328776427|ref|XP_625135.3| PREDICTED: cathepsin L-like [Apis mellifera]
Length = 351
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 142/318 (44%), Positives = 196/318 (61%), Gaps = 19/318 (5%)
Query: 42 HEQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLT 95
+++WM +H + YK ++E+ R+ IF N I K N +YKL N++ D+
Sbjct: 31 NQEWMTFKMEHKKVYKSDVEERFRMKIFMDNKHKIAKHNSNYEMKKVSYKLKMNKYGDML 90
Query: 96 NEEFRALYTGYNRPVPSVSRQSSRP---STFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
+ EF + G+N+ + + R P S + NV +P +DWR++GAVT +KDQG C
Sbjct: 91 HHEFVNILNGFNKSINTQLRSERLPVGASFIEPANVV-LPKKVDWRKEGAVTPVKDQGHC 149
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKG 210
GSCW+FSA A+EG G L+ LSEQ L+DCS N+GC+GGLMD+AF+YI +NKG
Sbjct: 150 GSCWSFSATGALEGQHFRRTGVLVSLSEQNLIDCSGKYGNNGCNGGLMDQAFQYIKDNKG 209
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASG 269
L TEA YPY E C + A + Y D+P GDE+ L AV+ PVSV +DAS
Sbjct: 210 LDTEASYPYEAENDKCRYNPANSGAIDVG-YIDIPTGDEKLLKAAVATIGPVSVAIDASH 268
Query: 270 RAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
++F FY GV +C + DHGV V+G+GT ENG YWL+KNSWGETWG +GYI++
Sbjct: 269 QSFQFYSEGVYYEPECSSEELDHGVLVIGYGT--NENGQDYWLVKNSWGETWGNNGYIKM 326
Query: 328 LRDA-GLCGIATAASYPV 344
R+ CGIA++ASYP+
Sbjct: 327 ARNKLNHCGIASSASYPL 344
>gi|410898132|ref|XP_003962552.1| PREDICTED: cathepsin L-like [Takifugu rubripes]
Length = 335
Score = 260 bits (665), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 146/314 (46%), Positives = 198/314 (63%), Gaps = 23/314 (7%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRA 101
W + GR+Y+ E+ R+ I+ N + + N +G ++Y+LG +F+D+ NEE+++
Sbjct: 30 WKLKFGRSYRTPSEEVQRMQIWLNNRKLVLVHNILADQGIKSYRLGMTQFADMDNEEYKS 89
Query: 102 LYT-----GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
L + +N P R+ S + F+ T +PT++DWR+KG VT +KDQ QCGSCW
Sbjct: 90 LISLGCLRAFNTSAP---RRGS--AFFRLAEGTHLPTTVDWRDKGYVTGVKDQKQCGSCW 144
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATE 214
AFSA ++EG GKL+ LSEQQLVDCS D N GC+GGLMD AF+YI EN G+ TE
Sbjct: 145 AFSATGSLEGQNFRKTGKLVSLSEQQLVDCSGDYGNMGCNGGLMDYAFKYIQENGGIDTE 204
Query: 215 ADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFH 273
YPY E+G C + E V A + Y D+ GDE AL +AV+ PVSV +DAS +F
Sbjct: 205 KSYPYEAEDGQCRFKPEN-VGAKCTGYVDVTVGDEDALKEAVATIGPVSVGIDASHSSFQ 263
Query: 274 FYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA 331
Y SGV + DC + + DHGV VG+GT +NG YWL+KNSWG WG+ GYI + R+
Sbjct: 264 LYDSGVYDEQDCSSQDLDHGVLAVGYGT---DNGQDYWLVKNSWGLGWGQEGYIMMSRNK 320
Query: 332 -GLCGIATAASYPV 344
CGIATAASYP+
Sbjct: 321 DNQCGIATAASYPL 334
>gi|334332720|ref|XP_001367595.2| PREDICTED: cathepsin L1-like [Monodelphis domestica]
Length = 333
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 145/340 (42%), Positives = 210/340 (61%), Gaps = 18/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + L C +V+ + ++ + QW AQH RTY E R +++NL+
Sbjct: 1 MNFYLCLASLCLG-LVAATPEFDQTLDSQWHQWKAQHRRTYAAN-EDGWRRATWEKNLKM 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +++LG N+F D+T EEF+ + GYN + S++ ++ S ++ +
Sbjct: 59 IEMHNLEYSAGKHSFQLGMNKFGDMTTEEFKQVMNGYNS---NGSQKRTKGSLYREPLLA 115
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+P S+DWREKG VT +K+QGQCGSCWAFSA ++EG KL+ LSEQ LVDCST
Sbjct: 116 QLPKSVDWREKGYVTPVKNQGQCGSCWAFSATGSLEGQWFHKTKKLVSLSEQNLVDCSTS 175
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N+GCSGGLMD AFEY+ N G+ TE YPY ++ C + E A ++ + D+P
Sbjct: 176 EGNNGCSGGLMDNAFEYVKNNGGIDTEQAYPYLGQDNECKYRAE-CSGANVTGFVDIPSM 234
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNN-CDHGVAVVGFGTAEEEN 304
+E+AL++AV+N P+SV +DA +F FY+SGV C ++ DHGV VVG+G+ ++
Sbjct: 235 NERALMKAVANVGPISVAIDAGNPSFQFYESGVYYEPQCSSSQLDHGVLVVGYGSIGKD- 293
Query: 305 GAKYWLIKNSWGETWGESGYIRILR-DAGLCGIATAASYP 343
+YW++KNSWGE WG+ GY+ + + CGIATAASYP
Sbjct: 294 --EYWIVKNSWGEEWGKKGYVLMAKFRNNHCGIATAASYP 331
>gi|186701255|gb|ACC91281.1| putative cysteine proteinase [Capsella rubella]
Length = 324
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 142/349 (40%), Positives = 210/349 (60%), Gaps = 52/349 (14%)
Query: 10 IIPMFVIIILVITCAS----QVVSG--RSMHEPSIVEKHEQWMAQHGRTYKDEL-EKAMR 62
+I + ++II ++ +S V SG RS E + + WM++HG+TY + L +K R
Sbjct: 9 MITLSLLIIFLLPPSSAMDLSVTSGGLRSNEEVGFI--FQTWMSKHGKTYTNALGDKEQR 66
Query: 63 LNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST 122
FK NL +I++ N + N +Y+LG +F+DLT +E++ L++G RP+ +Q + T
Sbjct: 67 FQNFKDNLRFIDQHNAK-NLSYRLGLTQFADLTVQEYQDLFSG--RPI---QKQKALRVT 120
Query: 123 FKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+Y + + +P S+DWR+KGAV+ IKDQG+C VE I +I G+LI LSE
Sbjct: 121 HRYVPLAEDQLPQSVDWRQKGAVSEIKDQGRC----------TVESINKIVTGELISLSE 170
Query: 181 QQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCD-NQKEKAVAATIS 239
Q+LVDCS DNHGC+GGLMD AF+++I N GL ++DYPY+ +G C+ NQ I
Sbjct: 171 QELVDCSIDNHGCNGGLMDSAFQFLINNNGLEYQSDYPYQAVQGYCNHNQNTSKKVIKID 230
Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
YED+P +E +L +AV++QP G+ CG + DH V +VG+GT
Sbjct: 231 GYEDVPANNENSLQKAVAHQP-----------------GIYTGPCGTDLDHAVVIVGYGT 273
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
ENG YW+++NSWG WGE+GY +I R+ G+CGIA ASYP+
Sbjct: 274 ---ENGQDYWIVRNSWGTVWGEAGYAKIARNFENPTGVCGIAMVASYPI 319
>gi|75067394|sp|Q9GKL8.1|CATL1_CERAE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
chain; Contains: RecName: Full=Cathepsin L1 light chain;
Flags: Precursor
gi|11493685|gb|AAG35605.1|AF201700_1 cysteine protease [Chlorocebus aethiops]
Length = 333
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 207/343 (60%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
P F++ L + AS ++ S+ + +W A H R Y E+ R ++++N++
Sbjct: 3 PTFILAALCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N+E G ++ + N F D+T+EEFR + G+ +R+ + F+
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLF 111
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
+ P S+DWREKG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY E +C E +V A + + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSV-ANDTGFVDIPK 230
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
E+AL++AV+ P+SV +DA +F FYK G+ DC + + DHGV VVG+G + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+ +KYWL+KNSWGE WG GYI++ +D CGIA+AASYP
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>gi|157093563|gb|ABV22436.1| cysteine proteinase [Oxyrrhis marina]
Length = 329
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 134/316 (42%), Positives = 191/316 (60%), Gaps = 20/316 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
I + E++ A+ G +Y E E+A R +F QN++ I + N +G+ TY LG N+F+DLT E
Sbjct: 15 IDAQWEEFKAKFGESYNGEEEEAERKGVFAQNVQLINEENSKGH-TYTLGVNQFADLTVE 73
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD---VPTSIDWREKGAVTHIKDQGQCGS 154
EF Y G+ +P Q + + ++V + +PTS+DW +GAVT +K+QGQCGS
Sbjct: 74 EFSKTYMGFKKPA-----QKYGDAAYLGRHVYNGEALPTSVDWSSQGAVTPVKNQGQCGS 128
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLA 212
CW+FS ++EG +I+ GKL+ LSEQQ VDC+ N GC+GGLMD AF+Y E L
Sbjct: 129 CWSFSTTGSLEGANEISTGKLVSLSEQQFVDCAGTYGNQGCNGGLMDSAFKY-AEANALC 187
Query: 213 TEADYPYRHEEGTCDNQ--KEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGR 270
TE YPY+ +G+C ++S Y+D+ EQ ++ AV+ QPVS+ ++A
Sbjct: 188 TEQSYPYKGTDGSCQASSCSTGLAKGSVSGYKDVSSDSEQDMMSAVAQQPVSIAIEADKS 247
Query: 271 AFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR- 329
F Y GVL CG + DHGV VG+GT +G YW +KNSWG TWG SGY+ + R
Sbjct: 248 VFQLYSGGVLTGACGASLDHGVLAVGYGTL---SGTDYWKVKNSWGSTWGMSGYVLLQRG 304
Query: 330 --DAGLCGIATAASYP 343
+G CG+ + SYP
Sbjct: 305 KGGSGECGLLSEPSYP 320
>gi|157278115|ref|NP_001098156.1| cathepsin L precursor [Oryzias latipes]
gi|50251128|dbj|BAD27581.1| cathepsin L [Oryzias latipes]
Length = 336
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 139/320 (43%), Positives = 199/320 (62%), Gaps = 15/320 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P + + + W H + Y ++ E RL ++++NL IE N E G +Y+LG N F
Sbjct: 21 DPQLDQHWQLWKGWHSKNYHEKEEGWRRL-VWEKNLRKIELHNLEHSMGKHSYRLGMNHF 79
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
D+T+EEFR + GY R ++ S F N + P ++DWR+KG VT +KDQGQ
Sbjct: 80 GDMTHEEFRQIMNGYKR----REQRKYSGSLFMEPNFLEAPRAVDWRDKGYVTPVKDQGQ 135
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
CGSCWAFS A+EG GKL+ LSEQ LVDCS N GC+GGLMD+AF+Y+ +N+
Sbjct: 136 CGSCWAFSTTGALEGQQFRKTGKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYVKDNQ 195
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
GL +E YPY+ + + A + + D+P G E+AL++AV++ PVSV +DA
Sbjct: 196 GLDSEDFYPYKGTDDQPCQYNAQYSAVNDTGFVDIPSGKERALMKAVASVGPVSVAIDAG 255
Query: 269 GRAFHFYKSGV-LNADCGNN-CDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYI 325
+F FY+SG+ +C ++ DHGV VVG+G E+ +G KYW++KNSW E WG+ G+I
Sbjct: 256 HESFQFYQSGIYFEKECSSDELDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGFI 315
Query: 326 RILRDA-GLCGIATAASYPV 344
+ +D CGIATAASYP+
Sbjct: 316 YMAKDRHNHCGIATAASYPL 335
>gi|380790141|gb|AFE66946.1| cathepsin L1 preproprotein [Macaca mulatta]
gi|384939708|gb|AFI33459.1| cathepsin L1 preproprotein [Macaca mulatta]
Length = 333
Score = 260 bits (664), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 145/343 (42%), Positives = 206/343 (60%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
P F++ + AS ++ S+ + +W A H R Y E+ R ++++N++
Sbjct: 3 PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N+E G ++ + N F D+T+EEFR L G+ +R+ + F+
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQLMNGFQ------NRKPRKGKVFQEPLF 111
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
+ P S+DWREKG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY E +C E +V A + + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSV-ANDTGFVDIPK 230
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
E+AL++AV+ P+SV +DA +F FYK G+ DC + + DHGV VVG+G + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+ +KYWL+KNSWGE WG GYI++ +D CGIA+AASYP
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>gi|332375975|gb|AEE63128.1| unknown [Dendroctonus ponderosae]
Length = 338
Score = 260 bits (664), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 146/340 (42%), Positives = 201/340 (59%), Gaps = 16/340 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+++++ I A Q VS + + E+ + QH + Y+ E E+ R+ IF N +
Sbjct: 4 LVLLVTIAVACQAVSFSEL----VQEQWNSFKVQHKKQYESETEERFRMKIFMDNSHKVA 59
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS-TFKYQNVTD 130
K NK +G YKL N++ DL + EF L G+NR + R + S TF D
Sbjct: 60 KHNKLFEQGLYPYKLAMNKYGDLLHHEFVGLLNGFNRTKTYLKRGELQDSITFIEPAHVD 119
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
+P ++DWR++GAVT +KDQG CGSCW+FSA A+EG KL+ LSEQ LVDCS+
Sbjct: 120 IPDTVDWRQEGAVTPVKDQGHCGSCWSFSATGALEGQHFRQTKKLVSLSEQNLVDCSSRF 179
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC+GGLMD AF YI N G+ TEA YPY E+ K AT + D+P GD
Sbjct: 180 GNNGCNGGLMDNAFRYIKNNGGIDTEAAYPYMGEDEKF-RYSAKNRGATDKGFVDIPSGD 238
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGNN-CDHGVAVVGFGTAEEENG 305
E L AV+ P+S+ +DAS +F Y +GV + C + DHGV VVG+GT +E+ G
Sbjct: 239 EDKLKAAVATVGPISIAIDASHESFQLYSNGVYSDPTCSSTELDHGVLVVGYGT-DEKTG 297
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
YWL+KNSWG+TWG GYI++ R+ CG+AT ASYP+
Sbjct: 298 MDYWLVKNSWGDTWGLDGYIKMARNQDNQCGVATQASYPL 337
>gi|325185016|emb|CCA19507.1| cysteine protease family C01A putative [Albugo laibachii Nc14]
Length = 492
Score = 259 bits (663), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 143/339 (42%), Positives = 189/339 (55%), Gaps = 40/339 (11%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
+I L + A G++ E W+ H T+ D E A RL + N YI
Sbjct: 8 TLIALSLLFAQNRADGKTFKEYE--SDFVSWLKTHHLTFSDAFEYAKRLETYIANDIYIL 65
Query: 75 KANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR---QSSRPSTFKYQNVTDV 131
N + ++KLG N FS LTNEEFR + G+ +++ QS+ S+ +Q + D+
Sbjct: 66 THNLQ-ESSFKLGHNAFSHLTNEEFRQRFNGFKASDDYLTKRLAQSNVASSTNFQYI-DL 123
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-N 190
P S+DW EKGAVT +K+QG CGSCWAFS A+EG T I+ GKL+ LSEQ+LVDC + +
Sbjct: 124 PESVDWVEKGAVTGVKNQGMCGSCWAFSTTGAIEGATFISSGKLVSLSEQELVDCDHNGD 183
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
HGC+GGLMD AF +I E+ G+ +E DY Y H + C + K
Sbjct: 184 HGCNGGLMDHAFSWISEHDGICSEEDYAYIHSQSLCRSCKPVV----------------- 226
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
PV+V +DA R+F FY+SGV N CG DHGV VG+G E+G KYW
Sbjct: 227 --------SPVAVAIDAGDRSFQFYQSGVYNKTCGTQLDHGVLTVGYGV---EDGQKYWK 275
Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPVA 345
+KNSWG +WGE GYIR+ RD +G CGIA SYP A
Sbjct: 276 VKNSWGNSWGEKGYIRLSRDQNGRSGQCGIAMVPSYPTA 314
>gi|326531188|dbj|BAK04945.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 360
Score = 259 bits (663), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 141/350 (40%), Positives = 200/350 (57%), Gaps = 20/350 (5%)
Query: 12 PMFVIIILVITCASQVVSGRSMH--EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
P+ +++ A+ SGR + + ++++ W A H ++Y+ E+ R +++ N
Sbjct: 10 PVITASTILLAWAAAAASGRGVDVGDMLMMDRFLMWQATHNQSYRSAEERLRRFQVYRDN 69
Query: 70 LEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKY---- 125
+EYIE N+ G+ TY+LG N+F+DLT EEF A +T YN S +T
Sbjct: 70 VEYIETTNRRGDLTYQLGENQFADLTREEFIARFTSYNGDDDRTGDDDSVITTAAVGGGD 129
Query: 126 --------QNVTDVPTSIDWREKGAVTHIKDQGQCGSC-WAFSAVAAVEGITQITRGKLI 176
+V+ P S+DWR KGAV K Q S WAF AVA +E + I GKL+
Sbjct: 130 PDLWSSGGDDVSLDPPSVDWRAKGAVVPPKSQSSSCSSSWAFVAVATIESLHAIKTGKLV 189
Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
LSEQQLVDC + GC+ G +AF ++I+N GL TEA+YPY +GTC++ K A
Sbjct: 190 ALSEQQLVDCDQYDGGCNRGTFRRAFHWVIQNGGLTTEAEYPYTAAQGTCNSAKSDHHVA 249
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
IS + +P +E A+ AV+ QPV+ ++ G FYKSGV + CG +H V VVG
Sbjct: 250 AISGHASVPGSNELAMKHAVATQPVAAAIEL-GSDMQFYKSGVYSGPCGARLEHAVTVVG 308
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRD---AGLCGIATAASYP 343
+G A+E G KYW++KNSWG+TWGE GYIR+ R GLCGI +YP
Sbjct: 309 YG-ADESTGDKYWIVKNSWGQTWGERGYIRMQRKILGPGLCGIMLDVAYP 357
>gi|242093944|ref|XP_002437462.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
gi|241915685|gb|EER88829.1| hypothetical protein SORBIDRAFT_10g027570 [Sorghum bicolor]
Length = 366
Score = 259 bits (663), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 141/331 (42%), Positives = 184/331 (55%), Gaps = 28/331 (8%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E S+ +E+W A H +D EK R ++FK+N I + N +GN TY LG N FSD+
Sbjct: 41 EESLWALYERWCA-HYNMARDHGEKTRRFDLFKENARRIYEHNHQGNATYTLGLNRFSDM 99
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD----------------VPTSIDWR 138
T+EEF G P +S + D P ++DWR
Sbjct: 100 TDEEFNRSPYGGCLTAPRMSDDEIEELHHHHHQQEDDGSFNLTHGSGGGKLGAPPAVDWR 159
Query: 139 EKGAVTHIKDQGQ-CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGL 197
+ AVT +KDQG CGSCWAFSA+AAVEGI I L+ LSEQQLVDC NHGC+GGL
Sbjct: 160 GR-AVTRVKDQGPTCGSCWAFSAIAAVEGINAIRTRNLVPLSEQQLVDCDKLNHGCNGGL 218
Query: 198 MDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVS 257
M AF +++ N+G+ E YPY EG C + V TI Y+ +P+ D AL+ AV+
Sbjct: 219 MTTAFSFVVRNRGVVPEGAYPYMGREGRCKHVMAPPV--TIYGYQRVPRFDANALMNAVA 276
Query: 258 NQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGE 317
QPVSV ++AS F Y+ GV N +CG H VG+G + G +W++KNSWG
Sbjct: 277 AQPVSVAIEASSFEFRHYQGGVFNGNCGGRLGHAATAVGYGA---DAGGPFWIVKNSWGP 333
Query: 318 TWGESGYIRILRDA----GLCGIATAASYPV 344
WGE GY+RI R+ G+CGI T SYPV
Sbjct: 334 GWGEGGYVRISRNTPVRQGVCGILTENSYPV 364
>gi|449683741|ref|XP_002155462.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 324
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 147/340 (43%), Positives = 204/340 (60%), Gaps = 23/340 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ +F ++L+ + ++ P+ + +W H + Y + E+ +R I+K N
Sbjct: 1 MKVFCALLLLGVTLAYII-----ERPTEDDSWIRWKMAHNKAYSHDGEETVRYTIWKDNE 55
Query: 71 EYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
I + N +G + L N+F D+TN EF+ + GY +S + STF N
Sbjct: 56 RRIREHNLQGG-DFLLEMNQFGDMTNNEFKD-FNGY------LSHKHVSGSTFLTPNSFV 107
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST-- 188
P S+DWR +G VT +KDQGQCGSCWAFS ++EG GKL+ LSEQ LVDCST
Sbjct: 108 APDSVDWRNEGYVTPVKDQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCSTAY 167
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC+GGLMD AF YI EN G+ +EA YPY ++G C K VAAT + + D+P GD
Sbjct: 168 GNNGCNGGLMDNAFTYIKENNGIDSEASYPYTAKDGKCAFTKPN-VAATDTGFVDIPSGD 226
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLNA-DCGNN-CDHGVAVVGFGTAEEENG 305
E L +AV++ P+SV +DAS +F FY+ GV N C + DHGV VVG+GT E+G
Sbjct: 227 ENKLKEAVASVGPISVAIDASHFSFQFYRKGVYNERKCSSTELDHGVLVVGYGT---ESG 283
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
YWL+KNSW +WG+ GYI++ R+A CGIAT ASYP+
Sbjct: 284 KDYWLVKNSWNTSWGDKGYIKMSRNAKNQCGIATNASYPL 323
>gi|18858809|ref|NP_571273.1| cathepsin L, 1 b precursor [Danio rerio]
gi|1752664|emb|CAA69623.1| cathepsin L [Danio rerio]
Length = 336
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 204/341 (59%), Gaps = 16/341 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ +LV S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLYISAVFAAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR GY Q+S+ F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHD----PNQTSQGPLFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +KDQ QCGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GGLMD+AF+Y+ ENKGL +E YPY + + A I+ + D+P G+
Sbjct: 175 GNQGCNGGLMDQAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGN 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFG-TAEEEN 304
E AL+ AV+ PVSV +DAS ++ FY+SG+ A + DH V VVG+G +
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G +YW++KNSW + WG+ GYI + +D CG+AT ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|354549232|gb|AER27707.1| putative cysteine protease [Phytophthora sp. SH-2011]
Length = 533
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 138/307 (44%), Positives = 190/307 (61%), Gaps = 13/307 (4%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT-YKLGTNEFSDLTNEEFRALY 103
WM HG T+ D LE A RL + N YI + N E T LG N FS ++ +EF+
Sbjct: 31 WMGAHGVTFSDALEFARRLENYIVNDMYIMEHNAENAWTGVTLGHNAFSHMSFDEFKFKM 90
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
TG P + ++ + + +V +VP+++DW +KG VT +K+QG CGSCWAFS A
Sbjct: 91 TGLVLPEGYLEQRLASRVDGLWSDV-EVPSAVDWVDKGGVTPVKNQGMCGSCWAFSTTGA 149
Query: 164 VEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
VEG T ++ GKL LSEQ+LVDC + + GC+GGLMD AF++I ++ G+ +E DY Y+ +
Sbjct: 150 VEGATFVSSGKLPSLSEQELVDCDHNGDMGCNGGLMDHAFQWIEDHGGICSEDDYEYKAK 209
Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNA 282
C +E ++ ++D+ DE AL AV+ QPVSV ++A +AF FYKSGV N
Sbjct: 210 AQVC---RECDSVVKVTGFQDVNPQDEHALKVAVAQQPVSVAIEADQKAFQFYKSGVFNL 266
Query: 283 DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD----AGLCGIAT 338
CG DHGV VG+G +NG K+W +KNSWG +WGE GYIR+ R+ AG CGIA+
Sbjct: 267 TCGTRLDHGVLAVGYGN---DNGHKFWKVKNSWGASWGEQGYIRLAREENGPAGQCGIAS 323
Query: 339 AASYPVA 345
SYP A
Sbjct: 324 VPSYPFA 330
>gi|291398027|ref|XP_002715626.1| PREDICTED: cathepsin S [Oryctolagus cuniculus]
Length = 331
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 144/341 (42%), Positives = 205/341 (60%), Gaps = 22/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ ++ C+S V +H ++ H W +G+ YK++ E+A R I+++NL+
Sbjct: 1 MKWLVWALLVCSSTVAQ---LHRDPTLDHHWHLWKKAYGKQYKEKNEEAARRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y +G N +D+T+EE +L + P Q R T+K
Sbjct: 58 FVTLHNLEHSMGMHSYDVGMNHLADMTSEEVVSLMSSLRIP-----HQWPRNVTYKLNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWRE+G VT +K QG CG+CWAFSAV A+E ++ G L+ LS Q LVDCST
Sbjct: 113 QKLPDSVDWRERGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGNLVSLSAQNLVDCST 172
Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
N GC+GG M +AF+YII+N G+ +EA YPY+ + C + K AAT SKY +LP
Sbjct: 173 TKYGNKGCNGGFMTEAFQYIIDNNGIDSEASYPYKAMDQKC-HYDSKHRAATCSKYTELP 231
Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
G E+AL +AV+N+ PVSV +DAS +F Y+SGV C N +HGV VG+G +
Sbjct: 232 FGSEEALKEAVANKGPVSVAIDASHSSFFLYRSGVYYEPSCTQNVNHGVLAVGYGNLK-- 289
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYP 343
G YWL+KNSWG +GE GYIR+ R++ CGIA SYP
Sbjct: 290 -GKDYWLVKNSWGIHFGEQGYIRMARNSKNHCGIANYPSYP 329
>gi|422001787|dbj|BAM66994.1| germination-specific cysteine protease 1, partial [Raphanus
sativus]
Length = 235
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 122/220 (55%), Positives = 154/220 (70%), Gaps = 9/220 (4%)
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P ++DWR+KGAV IK+QG CGSCWAFS A VEGI +I G+LI LSEQ+LVDC
Sbjct: 4 LPETVDWRQKGAVNAIKNQGTCGSCWAFSTAAVVEGINKIVTGELISLSEQELVDCDKSY 63
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
N GC+GGLMD AF++I++N GL TE DYPYR +G C++ + + TI YED+P DE
Sbjct: 64 NQGCNGGLMDYAFQFIMKNGGLNTEQDYPYRGSDGKCNSLLKNSKVVTIDGYEDVPTNDE 123
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
AL +AVS QPVSV +DA GR F Y+SG+ +CG DH V VG+G+ ENG YW
Sbjct: 124 TALKRAVSYQPVSVAIDAGGRVFQHYQSGIFTGECGTKMDHAVVAVGYGS---ENGVDYW 180
Query: 310 LIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
+++NSWG+ WGE GYIRI R+ +G CGIA ASYPV
Sbjct: 181 IVRNSWGQKWGEDGYIRIERNLASSKSGKCGIAIEASYPV 220
>gi|402770499|gb|AFQ98384.1| cathepsin L, partial [Hyalomma anatolicum anatolicum]
Length = 312
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 146/316 (46%), Positives = 196/316 (62%), Gaps = 25/316 (7%)
Query: 44 QWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNE 97
QW A H ++Y+ ++E+ +R IF +N I K N + G +YKLG N+F DL
Sbjct: 6 QWEAFKTTHKKSYQSKMEELLRYKIFTENSLLIAKHNAKYAKGLVSYKLGMNQFGDLLPH 65
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTF-KYQNVTD--VPTSIDWREKGAVTHIKDQGQCGS 154
EF ++ GY+ + R STF NV D +P ++DWR+KGAVT +KDQGQCGS
Sbjct: 66 EFAKMFNGYH------GERKGRGSTFLPPANVNDSSLPKTVDWRKKGAVTPVKDQGQCGS 119
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLA 212
CWAFSA ++EG + GKL+ LSEQ L+DCS N GC GGLMD AF+YI N G+
Sbjct: 120 CWAFSATGSLEGQHFLKSGKLVSLSEQNLIDCSGSFGNEGCGGGLMDNAFKYIKANDGID 179
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRA 271
TE YPY +G C +KE V AT + + D+ +G E L +AV+ P+SV +DAS +
Sbjct: 180 TEESYPYEAMDGDCRFKKED-VGATDTGFVDIQQGSEDDLQKAVATVGPISVAIDASHSS 238
Query: 272 FHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
F Y GV + +C + DHGV VG+G +NG KYWL+KNSW ETWG++GYI + R
Sbjct: 239 FQLYSEGVYDEPNCSSEELDHGVLAVGYGV---KNGKKYWLVKNSWAETWGDNGYILMSR 295
Query: 330 DA-GLCGIATAASYPV 344
D CGIA++ASYP+
Sbjct: 296 DKDNQCGIASSASYPL 311
>gi|47086859|ref|NP_997749.1| cathepsin L, 1 a precursor [Danio rerio]
gi|42542930|gb|AAH66490.1| Cathepsin L1, a [Danio rerio]
Length = 337
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 145/342 (42%), Positives = 202/342 (59%), Gaps = 16/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M V + C S V + ++ + + + +QW H + Y E+ R I+++NL+
Sbjct: 1 MRVFLAAFTLCLSAVFAAPTL-DQQLNDHWDQWKKWHSKKYH-ATEEGWRRVIWEKNLKK 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G TY+LG N F D+T+EEFR + G+ + R S F N
Sbjct: 59 IEMHNLEHSMGIHTYRLGMNHFGDMTHEEFRQVMNGFKHK----KDRRFRGSLFMEPNFI 114
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+VP +DWREKG VT +KDQG+CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 115 EVPNKLDWREKGYVTPVKDQGECGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSRP 174
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+Y+ + GL +E YPY + + K AA + + D+P G
Sbjct: 175 EGNEGCNGGLMDQAFQYVKDQNGLDSEESYPYLGTDDQPCHFDPKNSAANDTGFVDIPSG 234
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+AL++A++ PVSV +DA +F FY+SG+ +C + DHGV VG+G E+
Sbjct: 235 KERALMKAIAAVGPVSVAIDAGHESFQFYQSGIYYEKECSSEELDHGVLAVGYGFEGEDV 294
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+G KYW++KNSW E WG+ GYI + +D CGIATAASYP+
Sbjct: 295 DGKKYWIVKNSWSENWGDKGYIYMAKDRHNHCGIATAASYPL 336
>gi|2765358|emb|CAA74241.1| cathepsin L [Litopenaeus vannamei]
Length = 325
Score = 259 bits (662), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 195/318 (61%), Gaps = 18/318 (5%)
Query: 36 PSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFS 92
PS+ ++ + + A+HGR Y E+ RL++F+QN ++I+ N + G T+ L N+F
Sbjct: 16 PSLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFG 75
Query: 93 DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
D+T+EE A G+ + + RP+ + +P +DWR KGAVT +KDQ QC
Sbjct: 76 DMTSEEIVATMNGF------LGAPTRRPAAVLKADDETLPEKVDWRTKGAVTPVKDQKQC 129
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKG 210
GSCWAFS ++EG + GKL+ LSEQ LVDCS N GC GGLMD+AF YI NKG
Sbjct: 130 GSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFRNMGCMGGLMDQAFRYIKANKG 189
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASG 269
+ TE YPY ++G C V AT + Y D+ G E AL +AV+ P+SV +DAS
Sbjct: 190 IDTEDSYPYEAQDGKCRFDASN-VGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQ 248
Query: 270 RAFHFYKSGVLNAD-CGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
FHFY +GV + D C + DHGV VG+G+ +ENG +WL+KNSW +WG+ GYI++
Sbjct: 249 STFHFYHTGVYHDDHCSSTMLDHGVLAVGYGS--DENGGDFWLVKNSWNTSWGDKGYIKM 306
Query: 328 LRDA-GLCGIATAASYPV 344
R+ CGIA+ ASYP+
Sbjct: 307 SRNRNNNCGIASQASYPL 324
>gi|428186189|gb|EKX55040.1| hypothetical protein GUITHDRAFT_63227 [Guillardia theta CCMP2712]
Length = 344
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 143/300 (47%), Positives = 185/300 (61%), Gaps = 16/300 (5%)
Query: 58 EKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVS 114
E +F++NL+ I K N+E G ++Y++G N F+ LT EEF A Y GY
Sbjct: 47 ESTRAFEVFQKNLDMIMKHNEEYNQGLQSYEMGLNGFAHLTFEEFSAQYLGYG-GAEVEQ 105
Query: 115 RQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGK 174
++ R + ++ +++P S+DWREKGAV +K+QG CGSCWAFSAVAA+EG + G+
Sbjct: 106 PKTRRAGKHERKSRSEIPASVDWREKGAVAEVKNQGACGSCWAFSAVAALEGAHFLNSGE 165
Query: 175 LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLA--TEADYPYRHEEGTCDNQK 230
LI LSEQQLVDCS NHGC+GG MD AFEY + N G +E DYPY+ +G C
Sbjct: 166 LISLSEQQLVDCSKKFGNHGCAGGYMDNAFEYWMNNTGHGDDSEKDYPYKGMDGKCKFSA 225
Query: 231 EKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN---ADCGN 286
+ V ATIS Y D+ +G+E LL AV+N PVSV + A G A FY GV N C
Sbjct: 226 D-GVRATISGYNDVKQGNETDLLDAVANVGPVSVAIHA-GAALQFYLRGVFNGVAGTCFG 283
Query: 287 NCDHGVAVVGFGTAEEENGAK--YWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPV 344
+HGV VG+GTA G K YW+IKNSWG WGE G++R R LCG+A ASYP+
Sbjct: 284 PLNHGVTAVGYGTASLRFGRKMDYWIIKNSWGMGWGEKGFVRFARGKNLCGVANGASYPL 343
>gi|359483753|ref|XP_002266308.2| PREDICTED: oryzain alpha chain-like [Vitis vinifera]
Length = 501
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 191/319 (59%), Gaps = 14/319 (4%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDL 94
E + E W +H R YK E A R IFK+NL+Y+ + N +G+R + LG N+F+D+
Sbjct: 39 EERVRELFHLWKERHKRVYKHAEETAKRFEIFKENLKYVIERNSKGHR-HTLGMNKFADM 97
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT--DVPTSIDWREKGAVTHIKDQGQC 152
+NEEF+ Y + + R S + + + P+S+DWR+KG VT IKDQG C
Sbjct: 98 SNEEFKEKYLSKIKKPINKKNNYLRRSMQQKKGTASCEAPSSLDWRKKGVVTGIKDQGDC 157
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
GSCWAFS+ A+EGI I G LI LSEQ+LVDC T N+GC GG MD AFE++I N G+
Sbjct: 158 GSCWAFSSTGAMEGINAIVTGDLISLSEQELVDCDTTNYGCEGGYMDYAFEWVISNGGID 217
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
+E+DYPY +GTC+ KE +I Y+D+ + D ALL A NQP+SV +D S F
Sbjct: 218 SESDYPYTGTDGTCNTTKEDTKVVSIDGYKDVDESD-SALLCAAVNQPISVGMDGSALDF 276
Query: 273 HFYKSGVL---NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
Y SG+ +D ++ DH V +VG+G+ + E+ YW+ KNSWG +WG GY I R
Sbjct: 277 QLYTSGIYAGDCSDDPDDIDHAVLIVGYGSEDSED---YWICKNSWGTSWGMEGYFYIKR 333
Query: 330 DAGL----CGIATAASYPV 344
+ L C I ASYP
Sbjct: 334 NTDLPYGECAINAMASYPT 352
>gi|226443040|ref|NP_001140018.1| Cathepsin L1 precursor [Salmo salar]
gi|221221188|gb|ACM09255.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 146/341 (42%), Positives = 203/341 (59%), Gaps = 18/341 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ + +++ C S V + S +E H W H ++Y E E+ R ++++NL+ I
Sbjct: 4 LYLAVLVLCVSAVCAAPRF--DSQLEDHWHLWKNWHSKSYH-ESEEGWRRMVWEKNLKKI 60
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E N E G +Y+LG N F D+TNEEFR GY + + + + S F N
Sbjct: 61 EMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQ----TTERKFKGSLFMEPNYLQ 116
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P ++DWREKG VT +KDQG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 117 APKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPE 176
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GGLMD+AF+YI +N GL TE YPY + + K + A + + D+P G
Sbjct: 177 GNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPSGK 236
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEEN 304
E A+++AV+ PVSV +DA +F FY+SG+ +C + DHGV VVG+G E+ +
Sbjct: 237 EHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDVD 296
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
G KYW++KNSW E WG+ GYI + +D CGIATA+SYP+
Sbjct: 297 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|185135439|ref|NP_001117777.1| procathepsin L precursor [Oncorhynchus mykiss]
gi|14582899|gb|AAK69706.1|AF358668_1 procathepsin L [Oncorhynchus mykiss]
Length = 338
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 147/341 (43%), Positives = 203/341 (59%), Gaps = 18/341 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ + +++ C S V + S +E H W H + Y E E+ R ++++NL+ I
Sbjct: 4 LYLAVLVLCVSAVCAAPRF--DSQLEDHWHLWKNWHSKHYH-ESEEGWRRMVWEKNLKKI 60
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E N E G +Y+LG N F D+TNEEFR GY + + + + S F N
Sbjct: 61 EIHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQ----TTERKFKGSLFMEPNYLQ 116
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P ++DWREKG VT +KDQG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 117 APKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPE 176
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GGLMD+AF+YI +N GL TE YPY + + K + AA + + D+P G
Sbjct: 177 GNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSAANETGFVDIPSGK 236
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEEN 304
E A+++AV+ PVSV +DA +F FY+SG+ +C + DHGV VVG+G E+ +
Sbjct: 237 EHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDVD 296
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
G KYW++KNSW E WG+ GYI + +D CGIATA+SYP+
Sbjct: 297 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|444514070|gb|ELV10520.1| Cathepsin L1 [Tupaia chinensis]
Length = 450
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 145/337 (43%), Positives = 196/337 (58%), Gaps = 25/337 (7%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
+ L C + S + ++ W + H R Y E+ R ++++N++ IE
Sbjct: 129 LFLAALCLG-IASATPNSDQNLDTSWHHWKSTHRRLYGKN-EEGWRRAVWEKNMKMIEMH 186
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N E G + +G N F D+TNEEFR + G+ +++ F + P
Sbjct: 187 NHEYSNGKHGFTMGMNAFGDMTNEEFRQVMNGFR------NQKQKSGKVFHAPLLLQAPK 240
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS--TDNH 191
S+DWREKG VT +K+QGQCGSCWAFSA A+EG GKLI LSEQ LVDCS N
Sbjct: 241 SVDWREKGFVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSRRQGNL 300
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GGLMD AF+YI +N GL +E YPY+ +GTC + E AVA G E+A
Sbjct: 301 GCQGGLMDNAFQYIKDNGGLDSEESYPYKGMDGTCQYKAEWAVANDT--------GFEKA 352
Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKY 308
L++AV++ P+SV +DA +F FYK G+ DC + N DHGV VVG+G + + KY
Sbjct: 353 LMKAVASVGPISVAIDAGHASFQFYKDGIYYEPDCSSENLDHGVLVVGYGVEKRNSNDKY 412
Query: 309 WLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
WLIKNSWGE WG +GY++I +D CG+A+AASYPV
Sbjct: 413 WLIKNSWGEQWGANGYVKIAKDRNNHCGVASAASYPV 449
>gi|728637|emb|CAA59441.1| cathepsin l [Litopenaeus vannamei]
Length = 326
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 140/318 (44%), Positives = 195/318 (61%), Gaps = 18/318 (5%)
Query: 36 PSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFS 92
PS+ ++ + + A+HGR Y E+ RL++F+QN ++I+ N + G T+ L N+F
Sbjct: 17 PSLRQQWQNFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFG 76
Query: 93 DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
D+T+EE A G+ + + RP+ + +P +DWR KGAVT +KDQ QC
Sbjct: 77 DMTSEEIVATMNGF------LGAPTRRPAAVLKADDETLPEKVDWRTKGAVTPVKDQKQC 130
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKG 210
GSCWAFS ++EG + GKL+ LSEQ LVDCS N GC GGLMD+AF YI NKG
Sbjct: 131 GSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKANKG 190
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASG 269
+ TE YPY ++G C V AT + Y D+ G E AL +AV+ P+SV +DAS
Sbjct: 191 IDTEDSYPYEAQDGKCRFDASN-VGATDTGYVDVEHGSESALKKAVATIGPISVGIDASQ 249
Query: 270 RAFHFYKSGVLNAD-CGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
FHFY +GV + D C + DHGV VG+G+ +ENG +WL+KNSW +WG+ GYI++
Sbjct: 250 STFHFYHTGVYHDDHCSSTMLDHGVLAVGYGS--DENGGDFWLVKNSWNTSWGDKGYIKM 307
Query: 328 LRDA-GLCGIATAASYPV 344
R+ CGIA+ ASYP+
Sbjct: 308 SRNRNNNCGIASQASYPL 325
>gi|413919735|gb|AFW59667.1| hypothetical protein ZEAMMB73_680472 [Zea mays]
Length = 344
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 127/266 (47%), Positives = 168/266 (63%), Gaps = 9/266 (3%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRT 83
+VS E + +WMA HGRTY E+ R +F+ NL Y++ N G +
Sbjct: 31 IVSYGERSEEEARRMYAEWMAAHGRTYNAVGEEERRFEVFRDNLRYVDAHNAAADAGVHS 90
Query: 84 YKLGTNEFSDLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGA 142
++LG N F+DLTN+E+RA Y G +RP R+ + + D+P S+DWR KGA
Sbjct: 91 FRLGLNRFADLTNDEYRATYLGVRSRP----QRERRLGDRYLAGDNEDLPESVDWRAKGA 146
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKA 201
V +KDQG CGSCWAFS +AAVEGI QI G +I LSEQ+LVDC T N GC+GGLMD A
Sbjct: 147 VAEVKDQGSCGSCWAFSTIAAVEGINQIVTGDMISLSEQELVDCDTSYNQGCNGGLMDYA 206
Query: 202 FEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPV 261
FE+II N G+ TE DYPY+ +G CD ++ A TI YED+P E++L +AV+NQP+
Sbjct: 207 FEFIINNGGIDTEEDYPYKGTDGRCDVNRKNAKVVTIDSYEDVPANSEKSLQKAVANQPI 266
Query: 262 SVCVDASGRAFHFYKSGVLNADCGNN 287
SV ++A GRAF Y SG+ CGN+
Sbjct: 267 SVAIEAGGRAFQLYNSGIFTGTCGNS 292
>gi|260516672|gb|ACX43963.1| cysteine protease 3, partial [Brachiaria hybrid cultivar]
Length = 319
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 142/287 (49%), Positives = 179/287 (62%), Gaps = 17/287 (5%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYT 104
+M Q+ + Y E + R N FK ++E I N N +Y +G NEF+DL+ EEF+ Y
Sbjct: 45 FMKQYSKAYS-HAEFSSRFNQFKASVETIRLHNTLANASYTMGLNEFADLSFEEFKGKYF 103
Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
G V R+ +R + +Q V PTSIDWR AVT IKDQGQCGSCWAFSA ++
Sbjct: 104 G----CKHVEREFARSNNL-HQEVEAAPTSIDWRTSNAVTPIKDQGQCGSCWAFSATGSI 158
Query: 165 EGITQITRGK--LIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYR 220
EG + +GK L LSEQQLVDCST N GC+GGLMD AFEYII NKG+ E+ YPY+
Sbjct: 159 EG-AWVLQGKHTLTSLSEQQLVDCSTSYGNAGCNGGLMDYAFEYIIANKGICAESAYPYK 217
Query: 221 HEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV 279
G C K V TIS ++D+ GDE + L AV PVSV ++A F FY SGV
Sbjct: 218 GVGGLCQKSCTKVV--TISGHKDVASGDEASSLNAVGTVGPVSVAIEADQAGFQFYSSGV 275
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+ CG+N DHGV VG+GT ++ YW++KNSWG +WGESGYIR
Sbjct: 276 FSGTCGHNLDHGVLAVGYGTTGSQD---YWIVKNSWGTSWGESGYIR 319
>gi|34559455|gb|AAQ75437.1| cathepsin L-like protease [Helicoverpa armigera]
gi|338855117|gb|AEJ31938.1| cathepsin L-like protease [Helicoverpa assulta]
Length = 341
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 140/319 (43%), Positives = 199/319 (62%), Gaps = 20/319 (6%)
Query: 43 EQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLTN 96
E+W A +H + Y E+E R+ I+ +N I K N+ +G +YKL N+++D+ +
Sbjct: 25 EEWSAFKLEHSKRYDSEVEDKFRMKIYLENKHRIAKHNQRFEQGAVSYKLRPNKYADMLS 84
Query: 97 EEFRALYTGYNRPV--PSVSR---QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
EF + G+N+ + P + SRP+TF P +DWR+KGAVT +KDQG+
Sbjct: 85 HEFVHVMNGFNKTLKHPKAVHGKGRESRPATFIAPAHVTYPDHVDWRKKGAVTEVKDQGK 144
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENK 209
CGSCWAFS A+EG G L+ LSEQ L+DCS N+GC+GGLMD AF+YI +N
Sbjct: 145 CGSCWAFSTTGALEGQHFRKTGYLVSLSEQNLIDCSAAYGNNGCNGGLMDNAFKYIKDNG 204
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
G+ TE YPY + C + + A + + D+P+GDE+ L+QAV+ PVSV +DAS
Sbjct: 205 GIDTEKAYPYEGVDDKCRYNAKNSGADDVG-FVDIPQGDEEKLMQAVATVGPVSVAIDAS 263
Query: 269 GRAFHFYKSGVL-NADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+F FY GV + +C + + DHGV VVG+GT +E G YWL+KNSWG TWG+ GYI+
Sbjct: 264 QESFQFYSDGVYYDENCSSTDLDHGVMVVGYGT--DEQGGDYWLVKNSWGRTWGDLGYIK 321
Query: 327 ILRDA-GLCGIATAASYPV 344
+ R+ CGIA++ASYP+
Sbjct: 322 MARNKNNHCGIASSASYPL 340
>gi|52345644|ref|NP_001004869.1| cathepsin L2 precursor [Xenopus (Silurana) tropicalis]
gi|49522051|gb|AAH74718.1| MGC69486 protein [Xenopus (Silurana) tropicalis]
Length = 335
Score = 259 bits (661), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 146/342 (42%), Positives = 207/342 (60%), Gaps = 18/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + + + C + V + + +P++ W H ++Y + E+ R ++++NL
Sbjct: 1 MALYLGIAAICLTTVFAAPTT-DPALDNHWNLWKNWHKKSYAPK-EEGWRRVLWEKNLRM 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G ++ LG N+F D+TNEEFR L GY +++ R STF N
Sbjct: 59 IEFHNLEHSLGKHSHSLGMNQFGDMTNEEFRQLMNGYK------NQKKIRGSTFLAPNNF 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
+ P S+DWR+KG VT +KDQGQCGSCWAFS A+EG GK+I LSEQ LVDCS
Sbjct: 113 ESPKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKMISLSEQNLVDCSRA 172
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+Y+ +N G+ +E YPY ++ + +A + + D+ G
Sbjct: 173 QGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVTSG 232
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+ L+ AV++ PVSV VDA ++F FYKSG+ +C + + DHGV VVG+G E+E
Sbjct: 233 SEKDLMNAVASVGPVSVAVDAGHQSFQFYKSGIYYEPECSSEDLDHGVLVVGYGFEGEDE 292
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+G KYW++KNSW E WG GYI I +D CGIATAASYP+
Sbjct: 293 DGKKYWIVKNSWSEKWGNDGYIYIAKDRHNHCGIATAASYPL 334
>gi|444519959|gb|ELV12909.1| Cathepsin L1 [Tupaia chinensis]
Length = 333
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 143/341 (41%), Positives = 207/341 (60%), Gaps = 26/341 (7%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
+ L I C + S H+ S+ E+ QW A+HG+ Y E+++R ++++NL+ IE+
Sbjct: 5 LFLTILCLG-IASAAPTHDQSLDEQWNQWTAEHGKVYSTG-EESLRRAVWEKNLKMIEQH 62
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N E G T+ +G N F D+TNE+FR + TG+ +++ ++ F+ +VP
Sbjct: 63 NLEYSQGKHTFTMGMNAFGDMTNEDFRQMMTGFQ------NQKYNKGEVFQPPQPLEVPE 116
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH-- 191
S+DWREKG VT +K+Q +CGSCWAFSA A+EG GKL+ LSEQ LVDCS H
Sbjct: 117 SVDWREKGYVTPVKNQHRCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSQPQHNS 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GGL+ KAF+Y+ +N GL +E YPY E TC + AAT++ ++ +P +E+A
Sbjct: 177 GCKGGLVIKAFQYVKDNGGLDSEESYPYEEMESTCRYSPGNS-AATVTGFKHIP-AEEKA 234
Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGVLNADCGNNC-----DHGVAVVGFGTAEE-EN 304
L +AV++ P+SV +DA +F FY G+L+ NC +H V VVG+G +E N
Sbjct: 235 LEKAVASVGPISVAIDAHHHSFQFYTGGILHEP---NCSPKWLNHAVLVVGYGVMQEGSN 291
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
YWL+KNSWGE WG GYI + +D CGIA+ A YP+
Sbjct: 292 NNTYWLVKNSWGERWGVGGYIMMAKDKNNHCGIASDALYPI 332
>gi|307111936|gb|EFN60170.1| hypothetical protein CHLNCDRAFT_59551 [Chlorella variabilis]
Length = 364
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 131/291 (45%), Positives = 179/291 (61%), Gaps = 12/291 (4%)
Query: 62 RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
R NI+ NL + + N + ++ L ++DL+ +E+R+ GYN + ++ R +
Sbjct: 71 RFNIWLDNLRFAHEYNAR-HTSHWLSMGVYADLSQDEYRSKALGYNAHLHK--KRPLRAA 127
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQ 181
F Y+ T P +DW GAVT +KDQ CGSCWAFS AVEG I GKL+ LSEQ
Sbjct: 128 PFLYKG-TVPPEEVDWVAGGAVTPVKDQLLCGSCWAFSTTGAVEGANAIATGKLVSLSEQ 186
Query: 182 QLVDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISK 240
LVDC + + GC GG MD AF++I+ N G+ TE DYPYR E+G C + + + TI
Sbjct: 187 MLVDCDREYDTGCRGGFMDSAFDFIVNNGGIDTEDDYPYRAEDGICQDNRTRRHVVTIDG 246
Query: 241 YEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTA 300
Y+D+P DE AL++AV++QPVSV ++A AF Y GV +A+CG DH V VVG+GTA
Sbjct: 247 YQDVPPNDENALMKAVAHQPVSVAIEADQLAFQLYGGGVFDAECGTALDHAVLVVGYGTA 306
Query: 301 EE-ENGAKYWLIKNSWGETWGESGYIRILRDA------GLCGIATAASYPV 344
+ YWL+KNSWG WGE GYIR+LR+ G CG+A AS+P+
Sbjct: 307 SNGTHNLPYWLVKNSWGAEWGEKGYIRLLRNLGKDAPEGQCGLAMYASFPI 357
>gi|330805275|ref|XP_003290610.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
gi|325079249|gb|EGC32858.1| hypothetical protein DICPUDRAFT_98747 [Dictyostelium purpureum]
Length = 334
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 147/341 (43%), Positives = 199/341 (58%), Gaps = 21/341 (6%)
Query: 11 IPMFVIIILVI----TCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
+ +F+I+ LVI CA+ + ++ S + WM +H + Y E + F
Sbjct: 3 LAVFLIVSLVILSINVCAATNLFSAQTYQTSFL----GWMKKHNKAYHHH-EFNDKYQTF 57
Query: 67 KQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
K N+++I N + + T LG N F+DLTNEE++ Y G + V + Q + ++
Sbjct: 58 KDNMDFIHNWNSKESDTV-LGLNRFADLTNEEYKKTYLGMSINVNLRANQVPM-NGLNFE 115
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
T P+SIDWR+ GAV ++KDQG CGSCWAF+ AVEG QI G ++ SEQ LVDC
Sbjct: 116 RFTG-PSSIDWRQNGAVAYVKDQGHCGSCWAFATTGAVEGAHQIKTGNMVTFSEQHLVDC 174
Query: 187 S--TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
S N+GC GGLM AF+YII+N G+ATE YPY + C + IS Y+D+
Sbjct: 175 SGRYGNNGCDGGLMTSAFKYIIDNDGIATEEAYPYTATQNRCV-YNTTMLGTAISGYKDV 233
Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEE 302
P+G E AL A+S QPV+V +DAS F YKSGV A C + +HGV VG+GT E
Sbjct: 234 PRGSESALTAAISKQPVAVAIDASPITFQLYKSGVYQEATCSSYRLNHGVLAVGYGTLE- 292
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASY 342
G Y+++KNSW ETWG GYI + R+A CGIAT ASY
Sbjct: 293 --GKDYYIVKNSWAETWGNQGYILMARNANNHCGIATMASY 331
>gi|162138968|ref|NP_001104662.1| uncharacterized protein LOC567623 precursor [Danio rerio]
gi|158254065|gb|AAI54241.1| Zgc:174153 protein [Danio rerio]
Length = 336
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 144/342 (42%), Positives = 205/342 (59%), Gaps = 18/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
MF +II + C S V + S+ + + + W +QHG++Y +++E R+ I+++NL
Sbjct: 2 MFALIITL--CISAVFTAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRK 57
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE+ N E GN T+K+G N+F D+TNEEFR GY +R S P F +
Sbjct: 58 IEQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYKH---DPNRTSQGP-LFMEPSFF 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
P +DWR++G VT +KDQ QCGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 114 AAPQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRP 173
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD AF+Y+ ENKGL +E YPY + + A + + D+P G
Sbjct: 174 QGNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKSTGFVDIPSG 233
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFG-TAEEE 303
+E AL+ AV+ PVSV +DAS ++ FY+SG+ A + DH V VVG+G +
Sbjct: 234 NEPALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADV 293
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G +YW++KNSW + WG+ GYI + +D CG+AT ASYP+
Sbjct: 294 AGNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|109112057|ref|XP_001086247.1| PREDICTED: cathepsin L1-like isoform 5 [Macaca mulatta]
gi|402897797|ref|XP_003911929.1| PREDICTED: cathepsin L1 [Papio anubis]
Length = 333
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 206/343 (60%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
P F++ + AS ++ S+ + +W A H R Y E+ R ++++N++
Sbjct: 3 PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N+E G ++ + N F D+T+EEFR + G+ +R+ + F+
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLF 111
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
+ P S+DWREKG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY E +C E +V A + + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSV-ANDTGFVDIPK 230
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
E+AL++AV+ P+SV +DA +F FYK G+ DC + + DHGV VVG+G + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+ +KYWL+KNSWGE WG GYI++ +D CGIA+AASYP
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>gi|157093355|gb|ABV22332.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 140/330 (42%), Positives = 190/330 (57%), Gaps = 14/330 (4%)
Query: 21 ITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEG 80
I A V +G + P + + ++G+ Y E A+R IFK N++ I N
Sbjct: 6 IAAAVLVAAGHEVPPPDYMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNAR- 64
Query: 81 NRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
N T+ LG NEF+DLT EE A YTG +P S+ R ST +Y N + +S+DW +
Sbjct: 65 NLTFALGVNEFTDLTQEELAASYTGL-KPA-SLWSGLPRLSTHEY-NGAPLASSVDWTTQ 121
Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDK 200
G VT +K+QGQCGSCW+FS A+EG ++ G L+ LSEQQ VDC T + GC+GG MD
Sbjct: 122 GVVTPVKNQGQCGSCWSFSTTGALEGAWALSTGNLVSLSEQQFVDCDTTDSGCNGGWMDN 181
Query: 201 AFEYIIENKGLATEADYPYRHEEGTCD--NQKEKAVAATISKYEDLPKGDEQALLQAVSN 258
AF + +N + TE YPY +GTC+ + + Y D+ EQA++ AV+
Sbjct: 182 AFSFAKKNS-ICTEGSYPYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQ 240
Query: 259 QPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
QPVS+ ++A +F Y SGVL A CG DHGV VG+G+ E G YW +KNSWG +
Sbjct: 241 QPVSIAIEADQYSFQLYSSGVLTASCGTRLDHGVLAVGYGS---EAGTDYWKVKNSWGSS 297
Query: 319 WGESGYIRILR---DAGLCG-IATAASYPV 344
WGE GY+R+ R AG CG +A SYPV
Sbjct: 298 WGEQGYVRLQRGKGGAGECGLLAGPPSYPV 327
>gi|297818854|ref|XP_002877310.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
gi|297323148|gb|EFH53569.1| hypothetical protein ARALYDRAFT_484828 [Arabidopsis lyrata subsp.
lyrata]
Length = 376
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 145/330 (43%), Positives = 200/330 (60%), Gaps = 19/330 (5%)
Query: 27 VVSGRSMH--EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
VV+ H E + +E+W+ +HG+ Y EK R IFK NL++IE+ N + NR+Y
Sbjct: 24 VVTATESHRNEAEVRTIYERWLVEHGKNYNGLGEKERRFKIFKDNLKHIEEHNSDPNRSY 83
Query: 85 KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
G N+FSDLT +EF+A Y G S+S + R ++Y+ +P +DWRE+GAV
Sbjct: 84 DRGLNQFSDLTVDEFQASYLGGKIEKKSLSDVAER---YQYKEGDILPDEVDWRERGAVV 140
Query: 145 -HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKA 201
+K QG CGSCWAF+A AVEGI QIT G+L+ LSEQ+L+DC DN GC+GG A
Sbjct: 141 PRVKRQGDCGSCWAFAATGAVEGINQITTGELLSLSEQELIDCDRGKDNFGCAGGGAVWA 200
Query: 202 FEYIIENKGLATEADYPYRHEE-GTCDNQKEKAV-AATISKYEDLPKGDEQALLQAVSNQ 259
FE+I EN G+ T+ DY Y ++ C + K TI+ +E +P DE +L +AVS Q
Sbjct: 201 FEFIKENGGIVTDEDYGYTGDDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVSYQ 260
Query: 260 PVSVCVDASGRAFHFYKSGVLNADCGNNC-DHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
P+SV + A+ + YKSGV C N DH V +VG+GT+ +E YWLI+NSWG
Sbjct: 261 PISVMISAANMS--DYKSGVYKGPCSNLWGDHNVLIVGYGTSSDE--GDYWLIRNSWGPG 316
Query: 319 WGESGYIRILRD----AGLCGIATAASYPV 344
WGE GY+R+ R+ G C +A A YP+
Sbjct: 317 WGEGGYLRLQRNFNEPTGKCAVAVAPVYPI 346
>gi|119389039|pdb|2C0Y|A Chain A, The Crystal Structure Of A Cys25ala Mutant Of Human
Procathepsin S
Length = 315
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 145/318 (45%), Positives = 198/318 (62%), Gaps = 18/318 (5%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
+P++ W +G+ YK++ E+A+R I+++NL+++ N E G +Y LG N
Sbjct: 5 DPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLKFVMLHNLEHSMGMHSYDLGMNHL 64
Query: 92 SDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
D+T+EE +L + VPS Q R T+K +P S+DWREKG VT +K QG
Sbjct: 65 GDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPNRILPDSVDWREKGCVTEVKYQGS 119
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD---NHGCSGGLMDKAFEYIIEN 208
CG+ WAFSAV A+E ++ GKL+ LS Q LVDCST+ N GC+GG M AF+YII+N
Sbjct: 120 CGAAWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCSTEKYGNKGCNGGFMTTAFQYIIDN 179
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDA 267
KG+ ++A YPY+ + C K AAT SKY +LP G E L +AV+N+ PVSV VDA
Sbjct: 180 KGIDSDASYPYKAMDQKCQ-YDSKYRAATCSKYTELPYGREDVLKEAVANKGPVSVGVDA 238
Query: 268 SGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+F Y+SGV C N +HGV VVG+G + NG +YWL+KNSWG +GE GYIR
Sbjct: 239 RHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DLNGKEYWLVKNSWGHNFGEEGYIR 295
Query: 327 ILRDAG-LCGIATAASYP 343
+ R+ G CGIA+ SYP
Sbjct: 296 MARNKGNHCGIASFPSYP 313
>gi|156398078|ref|XP_001638016.1| predicted protein [Nematostella vectensis]
gi|156225133|gb|EDO45953.1| predicted protein [Nematostella vectensis]
Length = 326
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 146/335 (43%), Positives = 202/335 (60%), Gaps = 20/335 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
+++ V+ +S+ S R + V W + HG++Y D E+ R+ I++QNLE I++
Sbjct: 5 LVLCVLVASSRGWSVRFGQDSEWVA----WKSYHGKSYSDVHEERTRMAIWQQNLEKIKR 60
Query: 76 ANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSI 135
N E + +YK+ N DLT +EFR Y G S R +T+ + +P+S+
Sbjct: 61 HNAE-DHSYKMAMNHLGDLTEDEFRYFYLGVRAHHNSTKRG---WATYMPPSNVKIPSSV 116
Query: 136 DWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGC 193
DW +KG VT +K+QGQCGSCWAFS +VEG G L+ LSEQ L+DCS N+GC
Sbjct: 117 DWSQKGYVTGVKNQGQCGSCWAFSTTGSVEGQHFRKTGSLVSLSEQNLIDCSGSYGNNGC 176
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
GGLMD AF YI N G+ TE+ YPY ++G+C + V A ++ Y+D+P+G EQAL
Sbjct: 177 QGGLMDNAFRYIESNGGIDTESSYPYLGQQGSC-HFSSSHVGARVTGYQDIPQGSEQALQ 235
Query: 254 QAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGNN-CDHGVAVVGFGTAEEENGAKYWL 310
AV+ PVSV VDAS + FY SGV N C + DHGV V+G+G NG YWL
Sbjct: 236 SAVATVGPVSVAVDAS--QWQFYSSGVYDNPYCSSTQLDHGVLVIGYGNY---NGQDYWL 290
Query: 311 IKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+KNSWG +WG GYI + R+ CGIA++ASYP+
Sbjct: 291 VKNSWGYSWGVEGYIMMSRNKNNQCGIASSASYPL 325
>gi|126681066|gb|ABO26562.1| cathepsin L-like cysteine protease [Ixodes ricinus]
Length = 335
Score = 258 bits (660), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 141/309 (45%), Positives = 194/309 (62%), Gaps = 17/309 (5%)
Query: 47 AQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALY 103
A+HG++Y E E+ RL I+ +N I K N++ G Y + NEF D+ + EF +
Sbjct: 32 AKHGKSYVSETEEVFRLKIYMENRHKIAKHNEKYARGEVPYSMAMNEFGDMLHHEFVSTR 91
Query: 104 TGYNRPVPSVSRQSSRPSTFKYQNVTD--VPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
G+ R R+ S + + +N+ D +P ++DWR KGAVT +K+QGQCGSCWAFSA
Sbjct: 92 NGFKRNYKDQPREGS--TYLEPENIEDFSLPKTVDWRTKGAVTPVKNQGQCGSCWAFSAT 149
Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
++EG G ++ LSEQ LV CSTD N+GC GGLMD AF+YI NKG+ TE YPY
Sbjct: 150 GSLEGQHFRKSGSMVSLSEQNLVGCSTDFGNNGCEGGLMDDAFKYIRANKGIDTEKSYPY 209
Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSG 278
+GTC + K+ V AT S + D+ +G E L +AV+ P+SV +DAS +F FY G
Sbjct: 210 NGTDGTC-HFKKSTVGATDSGFVDIKEGSETQLKKAVATVGPISVAIDASHESFQFYSDG 268
Query: 279 VLN-ADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCG 335
V + +C + + DHGV VVG+GT NG YW +KNSWG TWG+ GYIR+ R+ CG
Sbjct: 269 VYDEPECDSESLDHGVLVVGYGTL---NGTDYWFVKNSWGTTWGDEGYIRMSRNKKNQCG 325
Query: 336 IATAASYPV 344
IA++AS P+
Sbjct: 326 IASSASIPL 334
>gi|390347681|ref|XP_801784.2| PREDICTED: cathepsin L1-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 336
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 140/340 (41%), Positives = 213/340 (62%), Gaps = 19/340 (5%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
F++ I ++ CA+ + P + + +W H ++Y +++ + R ++++N++ I
Sbjct: 6 FLVAIGLVACATAAFVKPT--NPDLDSRWLEWKIAHTKSYTNDMHELERRLVWEENVKMI 63
Query: 74 EKANKEGN---RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
N + + + ++LG NE+ D+ E R+ GY S + + STF +
Sbjct: 64 NMHNLDHSLHKKGFRLGMNEYGDMRLHEVRSTMNGYK----SSNVTKVQGSTFLTPSNIQ 119
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
VP ++DWR KG VT +K+QGQCGSCWAFS ++EG T KL+ LSEQ LVDCS
Sbjct: 120 VPDTVDWRTKGYVTPVKNQGQCGSCWAFSTTGSLEGQTFKKTSKLVSLSEQNLVDCSRTE 179
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC GGLMD+ F+Y+I+N G+ +E YPY E+ TC + K +A ++ + D+ GD
Sbjct: 180 GNMGCEGGLMDQGFQYVIDNHGIDSEDCYPYDAEDETC-HYKASCDSAEVTGFTDVTSGD 238
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENG 305
EQAL++AV++ PVSV +DAS ++F Y+SGV + +C ++ DHGV VVG+GT + G
Sbjct: 239 EQALMEAVASVGPVSVAIDASHQSFQLYESGVYDEPECSSSELDHGVLVVGYGT---DGG 295
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
YWL+KNSWGETWG SGYI++ R+ + CGIAT+ASYP+
Sbjct: 296 KDYWLVKNSWGETWGLSGYIKMSRNKSNQCGIATSASYPL 335
>gi|355753449|gb|EHH57495.1| Cathepsin L1 [Macaca fascicularis]
Length = 333
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 206/343 (60%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
P F++ + AS ++ S+ + +W A H R Y E+ R ++++N++
Sbjct: 3 PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N+E G ++ + N F D+T+EEFR + G+ +R+ + F+
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQELLF 111
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
+ P S+DWREKG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSW 171
Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY E +C E +V A + + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSV-ANDTGFVDIPK 230
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
E+AL++AV+ P+SV +DA +F FYK G+ DC + + DHGV VVG+G + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+ +KYWL+KNSWGE WG GYI++ +D CGIA+AASYP
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>gi|308810026|ref|XP_003082322.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
gi|116060790|emb|CAL57268.1| cysteine protease-1 (ISS) [Ostreococcus tauri]
Length = 430
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 141/337 (41%), Positives = 198/337 (58%), Gaps = 31/337 (9%)
Query: 37 SIVEKHEQWMAQHG--RTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEF 91
++ E+W ++HG R +D E A RL F +N Y+ + N G ++ +G N
Sbjct: 93 ALARHFERWCSEHGLERYLRDTEEYAKRLATFAENAAYVVEHNALYAIGEVSHWVGLNSL 152
Query: 92 SDLTNEEFRALYTGYNRPV-----------PSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
+ T EE+RAL GY + S + ++++Y +V D P +IDW E
Sbjct: 153 AATTREEYRALL-GYKPELRSSGDAEMLEATSTDKVEQYKASWEYASV-DPPEAIDWVEL 210
Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDK 200
GAVT K+QGQCGSCWAFS AVEGIT+I G+L+ LSEQ++V CS N GC+GGLMD
Sbjct: 211 GAVTPPKNQGQCGSCWAFSTTGAVEGITKIRTGRLVSLSEQEMVSCSKQNMGCNGGLMDY 270
Query: 201 AFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQP 260
AF +I++N G+ +E YPY E C+ K + ATI ++D+P GDE+ L +AVS QP
Sbjct: 271 AFRWIVKNGGIDSEFQYPYSAEALACNRWKLQLHVATIDGFKDVPPGDEKELEKAVSQQP 330
Query: 261 VSVCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFG--------TAEEENGAKYWLI 311
VS+ ++A ++F Y GV ++ +CG+ DHGV VVG+G T + +W +
Sbjct: 331 VSIAIEADTKSFQLYDGGVYDSKECGSQVDHGVLVVGYGFDDTHHNATKHHKRHRHFWKV 390
Query: 312 KNSWGETWGESGYIRILR----DAGLCGIATAASYPV 344
KNSWG TWGE G+IR+ R + G CGI TA SYP
Sbjct: 391 KNSWGGTWGEGGFIRMARRISDETGQCGITTAPSYPT 427
>gi|308321226|gb|ADO27765.1| cathepsin S [Ictalurus furcatus]
Length = 329
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 200/340 (58%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M ++ + C + V ++ +PS+ W H +TY ELE+ R I+++NL
Sbjct: 1 MLRSLLFTVICGAVV----ALQDPSLDMHWLMWKKNHSKTYTSELEELGRREIWERNLRL 56
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I N E G TY LG N D+T EE ++ G R P+++R R S F
Sbjct: 57 ITVHNLEASLGMHTYDLGMNHMGDMTREEILQMFAG-TRVRPNLTR---RSSPFVASAGI 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
VP S+DWREKG VT +K+QG CGSCWAFSA A+EG + T G++ LS Q LVDCS+
Sbjct: 113 SVPDSVDWREKGYVTEVKNQGSCGSCWAFSAAGALEGQLKRTTGQVKSLSPQNLVDCSSK 172
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GG M +AF+Y+I++ G+ ++ YPY +G C + + AA S Y + +G
Sbjct: 173 YGNKGCNGGFMTQAFQYVIDDGGIDSDEAYPYTAMDGQCRYDQSQR-AANCSSYNYVSEG 231
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENG 305
DE+AL QAV+ P+SV +DA+ F Y SGV + C N +HGV VVG+G+ NG
Sbjct: 232 DEEALKQAVATIGPISVAIDATRPMFILYHSGVYSDPTCTQNVNHGVLVVGYGSL---NG 288
Query: 306 AKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
YWL+KNSWG +G+ GYIRI R+ G +CGIA A YP+
Sbjct: 289 EDYWLVKNSWGTRFGDGGYIRIARNKGNMCGIANYACYPL 328
>gi|1709574|sp|P10056.2|PAPA3_CARPA RecName: Full=Caricain; AltName: Full=Papaya peptidase A; AltName:
Full=Papaya proteinase III; Short=PPIII; AltName:
Full=Papaya proteinase omega; Flags: Precursor
gi|18098|emb|CAA46862.1| proteinase omega [Carica papaya]
Length = 348
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 139/351 (39%), Positives = 198/351 (56%), Gaps = 16/351 (4%)
Query: 1 MVLKFEKSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQ----WMAQHGRTYKDE 56
M+ K + + + + + ++ + G S + + E+ Q WM H + Y++
Sbjct: 3 MIPSISKLLFVAICLFVHMSVSFGDFSIVGYSQDDLTSTERLIQLFNSWMLNHNKFYENV 62
Query: 57 LEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQ 116
EK R IFK NL YI++ NK+ N +Y LG NEF+DL+N+EF Y G + + +
Sbjct: 63 DEKLYRFEIFKDNLNYIDETNKK-NNSYWLGLNEFADLSNDEFNEKYVG---SLIDATIE 118
Query: 117 SSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLI 176
S F ++ ++P ++DWR+KGAVT ++ QG CGSCWAFSAVA VEGI +I GKL+
Sbjct: 119 QSYDEEFINEDTVNLPENVDWRKKGAVTPVRHQGSCGSCWAFSAVATVEGINKIRTGKLV 178
Query: 177 ELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAA 236
ELSEQ+LVDC +HGC GG A EY+ +N G+ + YPY+ ++GTC ++
Sbjct: 179 ELSEQELVDCERRSHGCKGGYPPYALEYVAKN-GIHLRSKYPYKAKQGTCRAKQVGGPIV 237
Query: 237 TISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVG 296
S + +E LL A++ QPVSV V++ GR F YK G+ CG DH V V
Sbjct: 238 KTSGVGRVQPNNEGNLLNAIAKQPVSVVVESKGRPFQLYKGGIFEGPCGTKVDHAVTAV- 296
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILR----DAGLCGIATAASYP 343
+ G Y LIKNSWG WGE GYIRI R G+CG+ ++ YP
Sbjct: 297 --GYGKSGGKGYILIKNSWGTAWGEKGYIRIKRAPGNSPGVCGLYKSSYYP 345
>gi|357130488|ref|XP_003566880.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 356
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 138/320 (43%), Positives = 188/320 (58%), Gaps = 20/320 (6%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
+HE+WMA+ GR Y D EKA R +F N Y++ N+ GNRTY LG N+FSDLT++EF
Sbjct: 38 RHEEWMAKFGRVYTDAQEKARRQEVFGANARYVDAVNRAGNRTYTLGLNKFSDLTDDEFV 97
Query: 101 ALYTGYNRPVPSVSRQS----SRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCW 156
+ GY R S+ + Y D+P S+DWR +GAVT +K+QG CG CW
Sbjct: 98 QTHLGYRGHQQGGLRPEEENVSKVAALGYGQA-DMPESVDWRAQGAVTGVKNQGSCGCCW 156
Query: 157 AFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHG------CSGGLMDKAFEYIIENKG 210
AF+AVAA EG+ +I G LI +SEQQ++DC+ + G C GG +D A Y+ ++G
Sbjct: 157 AFAAVAATEGLVKIATGNLISMSEQQVLDCTGQSPGMGNTNTCDGGHIDDALRYVAASRG 216
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP-KGDEQALLQAVSNQPVSVCVDASG 269
L EA Y Y +G C + AA+ + + + +GDE L V+ QP++V V+AS
Sbjct: 217 LQPEAAYAYTGLQGACQSGFTPNSAASFGEPQTVTLQGDEGRLQGLVAGQPIAVSVEAS- 275
Query: 270 RAFHFYKSGVLNA---DCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
F Y SGV A CG +H V VVG+G+A + G +YWL+KN WG +WGE GY+R
Sbjct: 276 DDFRHYMSGVFTAGTSSCGQRLNHAVTVVGYGSA--DGGQEYWLVKNQWGTSWGEGGYMR 333
Query: 327 ILRDAGL--CGIATAASYPV 344
I R G CGI+ A YP
Sbjct: 334 IARGNGAPNCGISAYAYYPT 353
>gi|81294188|gb|AAI08032.1| Cathepsin L, 1 b [Danio rerio]
Length = 336
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 141/341 (41%), Positives = 203/341 (59%), Gaps = 16/341 (4%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ +LV S V + S+ + + + W +QHG++Y +++E R+ I+++NL I
Sbjct: 1 MMFALLVTLYISAVFAAPSI-DIQLDDHWNSWKSQHGKSYHEDVEVGRRM-IWEENLRKI 58
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E+ N E GN T+K+G N+F D+TNEEFR GY Q+S+ F +
Sbjct: 59 EQHNFEYSYGNHTFKMGMNQFGDMTNEEFRQAMNGYTHD----PNQTSQGPLFMEPSFFA 114
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P +DWR++G VT +KDQ QCGSCW+FS+ A+EG GKLI +SEQ LVDCS
Sbjct: 115 APQQVDWRQRGYVTPVKDQKQCGSCWSFSSTGALEGQLFRKTGKLISMSEQNLVDCSRPQ 174
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GGLMD AF+Y+ ENKGL +E YPY + + A I+ + D+P G+
Sbjct: 175 GNQGCNGGLMDLAFQYVKENKGLDSEQSYPYLARDDLPCRYDPRFNVAKITGFVDIPSGN 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL--NADCGNNCDHGVAVVGFG-TAEEEN 304
E AL+ AV+ PVSV +DAS ++ FY+SG+ A + DH V VVG+G +
Sbjct: 235 ELALMNAVAAVGPVSVAIDASHQSLQFYQSGIYYERACSSSRLDHAVLVVGYGYQGADVA 294
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G +YW++KNSW + WG+ GYI + +D CG+AT ASYP+
Sbjct: 295 GNRYWIVKNSWSDKWGDKGYIYMAKDKNNHCGVATKASYPL 335
>gi|238481789|gb|ACR43934.1| cathepsin L-like cysteine proteinase [Haliotis diversicolor
supertexta]
Length = 347
Score = 258 bits (659), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 149/307 (48%), Positives = 195/307 (63%), Gaps = 18/307 (5%)
Query: 48 QHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYT 104
QHGR Y+ E+ R IFKQNL+YIE+ NK+ G ++Y LG N+F+D+ NEEFR +Y
Sbjct: 48 QHGRLYEKHEEEEERFEIFKQNLQYIEEHNKKFSLGQKSYYLGINQFADMKNEEFR-MYN 106
Query: 105 GYNRPVP-SVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAA 163
G R S Q S T +Y P +DWR+KG VT +K+QGQCGSCW+FS +
Sbjct: 107 GLRRDYNYSREVQCSNHLTPEY---LVAPDEVDWRKKGYVTAVKNQGQCGSCWSFSTTGS 163
Query: 164 VEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRH 221
+EG GKL+ LSEQQLVDCS N GC+GGLMD+AFEYII N G+ TE +YPY
Sbjct: 164 LEGQHFHKSGKLVSLSEQQLVDCSGKFGNEGCNGGLMDQAFEYIITNGGIETEEEYPYDA 223
Query: 222 EEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL 280
+ C +K + VAAT S D+ GDE L +V+ PVS+ +DAS ++F Y GV
Sbjct: 224 RQERCHFKKSE-VAATASGCVDVKSGDETDLKNSVAEVGPVSIAIDASHQSFQLYSGGVY 282
Query: 281 N-ADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIA 337
+ C + DHGV VVG+GT ++G YWL+KNSWG TWG GY+++ R+ CG+A
Sbjct: 283 DEPKCSSTELDHGVLVVGYGT---DDGQDYWLVKNSWGTTWGLEGYVKMSRNQDNQCGVA 339
Query: 338 TAASYPV 344
T ASYP+
Sbjct: 340 TQASYPL 346
>gi|405966498|gb|EKC31776.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 147/310 (47%), Positives = 192/310 (61%), Gaps = 20/310 (6%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRA 101
++ HG+ Y E E+A R I++ NL+YIEK N G+ ++ LG NE+ D+TNEEFR+
Sbjct: 30 YLKAHGKQYGAE-EEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMTNEEFRS 88
Query: 102 LYTGYNRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
GY + +SR S + N+ D+P ++DWR KG VT IK+QGQCGSCW+FSA
Sbjct: 89 TMNGYK-----MRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSA 143
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYP 218
++EG T GKL LSEQ LVDCS NHGC GGLMD AF+YI +N G+ TE+ YP
Sbjct: 144 TGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNSGIDTESSYP 203
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
Y + G C V AT S + D+ E L AV+ P+SV +DAS +F Y+S
Sbjct: 204 YEAKNGKCRFNAAN-VGATDSGFTDIKSKSESDLQSAVATVGPISVAIDASHMSFQLYRS 262
Query: 278 GVLNA-DCG-NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLC 334
GV + C DHGV VG+GT E+G YWL+KNSWGE+WG+ GYI + R+ C
Sbjct: 263 GVYHEFFCSETRLDHGVLAVGYGT---ESGKDYWLVKNSWGESWGQKGYIMMSRNKRNNC 319
Query: 335 GIATAASYPV 344
GIAT+ASYP
Sbjct: 320 GIATSASYPT 329
>gi|21483184|gb|AAF86584.1| cathepsin L cysteine protease [Haemonchus contortus]
Length = 355
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 144/304 (47%), Positives = 189/304 (62%), Gaps = 14/304 (4%)
Query: 50 GRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGY 106
G++Y+ E E + F +N+ +IE+ NKE G +T+++G NE +DL ++R L GY
Sbjct: 56 GKSYEPEEENDY-MEAFVKNVIHIEEHNKEHRLGRKTFEMGLNEIADLPFSQYRKL-NGY 113
Query: 107 NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEG 166
S + F +P S+DWRE+G VT +K+QG CGSCWAFS+ A+EG
Sbjct: 114 RMRRQFGDSMQSNGTKFLVPFNVQIPESVDWREEGLVTPVKNQGMCGSCWAFSSTGALEG 173
Query: 167 ITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEG 224
GKL+ LSEQ LVDCST NHGC+GGLMD AFEYI EN G+ TE YPY E
Sbjct: 174 QHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENHGVDTEDSYPYVGRET 233
Query: 225 TCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNA 282
C + K V A + DLP+GDE+AL +AV+ Q P+S+ +DA R+F YK GV +
Sbjct: 234 KC-HFKRNTVGADDKGFVDLPEGDEEALKKAVATQGPISIAIDAGHRSFQLYKKGVYFDE 292
Query: 283 DCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAA 340
+C + DHGV +VG+GT E YWL+KNSWG TWGE GYIRI R+ CG+AT A
Sbjct: 293 ECSSEELDHGVLLVGYGTDPE--AGDYWLVKNSWGPTWGEKGYIRIARNRNNHCGVATKA 350
Query: 341 SYPV 344
SYP+
Sbjct: 351 SYPL 354
>gi|358345461|ref|XP_003636796.1| Cysteine proteinase [Medicago truncatula]
gi|355502731|gb|AES83934.1| Cysteine proteinase [Medicago truncatula]
Length = 475
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 136/319 (42%), Positives = 183/319 (57%), Gaps = 31/319 (9%)
Query: 35 EPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT--YKLGTNEFS 92
E +VE +QW +H + Y E A+RL FK+NL+YI + N N + LG N F+
Sbjct: 44 EEQVVELFQQWKKEHQKFYIHPEEAALRLENFKRNLKYIVERNAMRNSPVGHHLGLNRFA 103
Query: 93 DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
D++NEEF+ + K ++ D P S+DWR+KG VT +KDQG C
Sbjct: 104 DMSNEEFKNKFIS------------------KVESCDDAPYSLDWRKKGVVTGVKDQGNC 145
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLA 212
GSCW+FS+ A+EG+ I G LI LSEQ+LVDC T N GC GG MD AFE++I N G+
Sbjct: 146 GSCWSFSSTGAIEGVNAIVTGDLISLSEQELVDCDTTNDGCEGGYMDYAFEWVINNGGID 205
Query: 213 TEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAF 272
TEADYPY GTC+ KE+ TI Y D+ + D AL A QP+SV +D S F
Sbjct: 206 TEADYPYIGVGGTCNVTKEETKVVTIDGYTDVTQSD-SALFCATVKQPISVGIDGSTLDF 264
Query: 273 HFYKSGVLNADCGNN---CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILR 329
Y G+ + DC +N DH V +VG+G+ + YW++KNSWG +WG G+I I R
Sbjct: 265 QLYTGGIYDGDCSSNPDDIDHAVLIVGYGS---DGNQDYWIVKNSWGTSWGIEGFIYIRR 321
Query: 330 DA----GLCGIATAASYPV 344
+ G+C I AS+P
Sbjct: 322 NTNLKYGVCAINYMASFPT 340
>gi|405958751|gb|EKC24845.1| Cathepsin L [Crassostrea gigas]
Length = 330
Score = 258 bits (658), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 147/310 (47%), Positives = 192/310 (61%), Gaps = 20/310 (6%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRA 101
++ HG+ Y E E+A R I++ NL+YIEK N G+ ++ LG NE+ D+TNEEFR+
Sbjct: 30 YLKAHGKQYGAE-EEARRRVIWEGNLDYIEKHNLAADRGDYSFWLGMNEYGDMTNEEFRS 88
Query: 102 LYTGYNRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
GY + +SR S + N+ D+P ++DWR KG VT IK+QGQCGSCW+FSA
Sbjct: 89 TMNGY-----KMRNGTSRGSLYLPPSNIGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSA 143
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYP 218
++EG T GKL LSEQ LVDCS NHGC GGLMD AF+YI +N G+ TE+ YP
Sbjct: 144 TGSLEGQTFKKTGKLPSLSEQNLVDCSQKQGNHGCQGGLMDDAFQYIKDNNGIDTESSYP 203
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
Y + G C V AT S + D+ E L AV+ P++V +DAS +F YKS
Sbjct: 204 YEAKNGKCRFNAAN-VGATDSGFTDIKSKSESDLQSAVATVGPIAVAIDASHMSFQLYKS 262
Query: 278 GVLNA-DCG-NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLC 334
GV + C DHGV VG+GT E+G YWL+KNSWGE+WG+ GYI + R+ C
Sbjct: 263 GVYHEFFCSETRLDHGVLAVGYGT---ESGKDYWLVKNSWGESWGQKGYIMMSRNKRNNC 319
Query: 335 GIATAASYPV 344
GIAT+ASYP
Sbjct: 320 GIATSASYPT 329
>gi|355567871|gb|EHH24212.1| Cathepsin L1 [Macaca mulatta]
Length = 333
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 206/343 (60%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
P F++ + AS ++ S+ + +W A H R Y E+ R ++++N++
Sbjct: 3 PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N+E G ++ + N F D+T+EEFR + G+ +R+ + F+
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLF 111
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
+ P S+DWREKG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY E +C E +V A + + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEEAYPYEATEESCKYNPEYSV-ANDTGFVDIPK 230
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
E+AL++AV+ P+SV +DA +F FYK G+ DC + + DHGV VVG+G + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+ +KYWL+KNSWGE WG GYI++ +D CGIA+AASYP
Sbjct: 290 SDNSKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>gi|330434686|gb|AEC22811.1| cathepsin L [Macrobrachium nipponense]
Length = 342
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 140/322 (43%), Positives = 199/322 (61%), Gaps = 20/322 (6%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ E + +H + Y+ + E+ R+ IF +N + I NK G++TYKLG N++ D+
Sbjct: 25 VMEEWESFKFEHSKKYESDTEETFRMKIFAENKQKIAAHNKLYHTGSKTYKLGMNKYGDM 84
Query: 95 TNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD------VPTSIDWREKGAVTHIKD 148
+ EF + G+ +++R F+ + + +P S+DWREKGAVT +KD
Sbjct: 85 LHHEFVNMMNGFRANTSGAGYKANR--GFQGAHFVEPPEDVVMPKSVDWREKGAVTEVKD 142
Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYII 206
QG CGSCWAFSA A+EG G L+ LSEQ LVDCS+ N+GC+GGLMD AF+YI
Sbjct: 143 QGSCGSCWAFSATGALEGQHYRQTGDLVSLSEQNLVDCSSKFGNNGCNGGLMDNAFQYIK 202
Query: 207 ENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCV 265
N G+ TE YPY E+ C A A + D+ +G+E AL +A++ PVSV +
Sbjct: 203 VNGGIDTEKSYPYEAEDEPCRYNPANA-GADDRGFVDVREGNENALKKAIATIGPVSVAI 261
Query: 266 DASGRAFHFYKSGVL-NADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
DAS +F FY+ GV + DC N DHGV VG+GT E+ G YWL+KNSW ++WG+ G
Sbjct: 262 DASQDSFQFYQHGVYSDPDCSAENLDHGVLAVGYGTTED--GQDYWLVKNSWSKSWGDQG 319
Query: 324 YIRILRDA-GLCGIATAASYPV 344
YI+I R+ +CGIA+AASYP+
Sbjct: 320 YIKIARNQNNMCGIASAASYPL 341
>gi|84660246|emb|CAI43320.1| cathepsin L [Lubomirskia baicalensis]
gi|85677150|emb|CAI46307.1| cathepsin L [Lubomirskia baicalensis]
Length = 327
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 142/313 (45%), Positives = 194/313 (61%), Gaps = 16/313 (5%)
Query: 40 EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR-TYKLGTNEFSDLTNEE 98
E+ E W +HG+ Y + E+ R I++ N +Y+++ N + + +G N+F+DL + E
Sbjct: 20 EEWESWKKEHGKVYNSDREELTRHIIWQANRKYVDEHNAHAEKFGFTVGMNQFADLESSE 79
Query: 99 FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
F LY GYN PS+ + S+ + K V D+PTS+DWR KG VT IK+QGQCGSCWAF
Sbjct: 80 FGRLYNGYNNK-PSMKKAQSKVFSTK---VGDLPTSVDWRTKGFVTAIKNQGQCGSCWAF 135
Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEAD 216
SAVA +EG G L+ LSEQ LVDCST N GC+GGLMD AF+Y+I+N G+ TEA
Sbjct: 136 SAVAGLEGQHFNATGTLVSLSEQNLVDCSTAEGNQGCNGGLMDNAFQYVIKNGGIDTEAS 195
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYED-LP-KGDEQALLQAVSNQPVSVCVDASGRAFHF 274
YPY+ + C V +T S + D LP K + + P+SV +DAS +F
Sbjct: 196 YPYKAVDQKCKFNAAN-VGSTCSGFSDILPHKSEAALQVAVAVVGPISVAIDASHTSFQL 254
Query: 275 YKSGVL--NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
YKSGV +A + DHGV VG+ + +G YW++KNSWG TWG++GYI + R+
Sbjct: 255 YKSGVYSESACSQTSLDHGVTAVGY---DSSSGVAYWIVKNSWGTTWGQAGYIWMSRNKN 311
Query: 332 GLCGIATAASYPV 344
CGIATAASYP+
Sbjct: 312 NQCGIATAASYPI 324
>gi|357122137|ref|XP_003562772.1| PREDICTED: fruit bromelain-like [Brachypodium distachyon]
Length = 358
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 144/332 (43%), Positives = 199/332 (59%), Gaps = 35/332 (10%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNE 97
++++ + A + RTY E+ R ++++N++YIE N+ G+ TY+LG N+F+DLT +
Sbjct: 36 MMDRFRAFQATYNRTYASPEERLRRFEVYRRNVDYIEAMNRRGDLTYELGENQFADLTVQ 95
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV---------------------PTSID 136
EFRA+YT +P+ R SRP ++ + + PTS+D
Sbjct: 96 EFRAMYT-----MPA--RVDSRPDAWRRRQMITTLAGPVTEDGGSYYSDAWEEAGPTSVD 148
Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGG 196
WR KGAVT +KDQG CG CWAF+ VA +EG+ +I G+L+ LSEQ+LVDC + GC GG
Sbjct: 149 WRSKGAVTPVKDQGGCGCCWAFATVATIEGLHKIKTGQLVSLSEQELVDCDDADDGCGGG 208
Query: 197 LMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAV 256
L + A E++ N GL TEA+YPY + G CD K AA I+ + + E L +AV
Sbjct: 209 LPEIAMEWVAHNGGLTTEANYPYTGKAGKCDRGKASNHAAKIAAAQMVRANSEAELERAV 268
Query: 257 SNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWG 316
+ QPV+V ++A + FYKSGV + C DH V VVG+G + G KYW+IKNSW
Sbjct: 269 ARQPVAVAINAPD-SLMFYKSGVYSGPCTAEFDHAVTVVGYGA--DNKGHKYWIIKNSWA 325
Query: 317 ETWGESGYIRILRDA----GLCGIATAASYPV 344
ETWGE GY R+ R GLCGIAT ASYPV
Sbjct: 326 ETWGEKGYGRMQRGVAAKEGLCGIATHASYPV 357
>gi|288548566|gb|ADC52431.1| cathepsin L2 cysteine protease [Pinctada fucata]
Length = 330
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 142/306 (46%), Positives = 191/306 (62%), Gaps = 18/306 (5%)
Query: 48 QHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYT 104
Q+ + Y++E E RL +++ NL++I N G T+ +G NE+ D+TNEEF
Sbjct: 33 QYNKLYQNEEEARRRL-VWESNLDFITLHNLAADRGEHTFWVGMNEYGDMTNEEFTKTMN 91
Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
GY ++ S+ P N+ D+P ++DWR KG VT IK+QGQCGSCW+FSA ++
Sbjct: 92 GYRMR----NKTSNAPVFMPPNNMGDLPDTVDWRPKGYVTPIKNQGQCGSCWSFSATGSL 147
Query: 165 EGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
EG T GKL+ LSEQ LVDCS NHGC GGLMD AF YI N G+ TEA YPY+
Sbjct: 148 EGQTFKKTGKLVSLSEQNLVDCSKKQGNHGCEGGLMDDAFTYIKANNGIDTEASYPYKAR 207
Query: 223 EGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL- 280
+G C+ K V AT + + D+ DE+AL QAV+ P+SV +DAS +F Y++GV
Sbjct: 208 DGKCE-FKSADVGATDTGFVDIKTKDEEALKQAVATVGPISVAIDASHMSFQLYRTGVYH 266
Query: 281 NADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIAT 338
+ C DHGV VG+GT E+ YWL+KNSWGE+WG+ GYI++ R+ CGIAT
Sbjct: 267 DWFCSQTKLDHGVLAVGYGT---EDSKDYWLVKNSWGESWGQKGYIQMSRNRRNNCGIAT 323
Query: 339 AASYPV 344
+ASYP
Sbjct: 324 SASYPT 329
>gi|297663703|ref|XP_002810310.1| PREDICTED: LOW QUALITY PROTEIN: cathepsin S [Pongo abelii]
Length = 330
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 148/341 (43%), Positives = 209/341 (61%), Gaps = 23/341 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
M ++ +++ C+S V +H+ ++ H W +G+ YK++ E+A+R I+++NL+
Sbjct: 1 MKQLVCVLLVCSSAVAQ---LHKDPTLDHHWHLWKKTYGKQYKEKNEEAVRRLIWEKNLK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
++ N E G +Y LG N D+T+EE +L + VPS Q R T+K
Sbjct: 58 FVMIHNLEHSMGMHSYDLGMNHLGDMTSEEVMSLMSSLR--VPS---QWQRNITYKSNPN 112
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
+P S+DWREKG VT +K QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCST
Sbjct: 113 RILPDSVDWREKGCVTEVKYQGSCGACWAFSAVGALEAQLKLKTGKLVSLSAQNLVDCST 172
Query: 189 D---NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
+ N GC+GG M AF+YII+NKG+ ++A YPY+ + K + AAT SKY D
Sbjct: 173 EKYGNKGCNGGFMTTAFQYIIDNKGIDSDASYPYKAMVKCQYDSKYR--AATCSKYTDFX 230
Query: 246 KGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEE 303
G E L +AV+N+ PVSV VDA +F Y+SGV C N +HGV VVG+G +
Sbjct: 231 YGREDVLKEAVANKGPVSVGVDARHPSFFLYRSGVYYEPSCTQNVNHGVLVVGYG---DL 287
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
NG +YWL+KNSWG +GE GYIR+ R+ G CGIA+ S+P
Sbjct: 288 NGKEYWLVKNSWGRNFGEEGYIRMARNKGNHCGIASFPSFP 328
>gi|217072410|gb|ACJ84565.1| unknown [Medicago truncatula]
Length = 328
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 124/220 (56%), Positives = 155/220 (70%), Gaps = 9/220 (4%)
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P S+DWR++GAV +KDQ CGSCWAFSA+AAVEGI +I G LI LSEQ+LVDC T
Sbjct: 24 LPESVDWRKEGAVVGVKDQASCGSCWAFSAIAAVEGINKIVTGDLISLSEQELVDCDTSY 83
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
N GC+GGLMD AFE+II N G+ +E DYPY+ +G CD ++ A TI YED+P DE
Sbjct: 84 NEGCNGGLMDYAFEFIISNGGIDSEDDYPYKAVDGRCDQNRKNAKVVTIDDYEDVPAYDE 143
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
AL +AV+NQP++V V+ GR F Y+ GVL CG DHGVA VG+GT ENG YW
Sbjct: 144 LALQKAVANQPIAVAVEGGGREFQLYEYGVLTGRCGTALDHGVAAVGYGT---ENGKDYW 200
Query: 310 LIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
+++NSWG +WGE GYIR+ R+ AG CGIA SYP+
Sbjct: 201 IVRNSWGGSWGEQGYIRLERNLASSRAGKCGIAIEPSYPI 240
>gi|71897043|ref|NP_001026516.1| cathepsin S precursor [Gallus gallus]
gi|53126701|emb|CAG30977.1| hypothetical protein RCJMB04_1f23 [Gallus gallus]
Length = 328
Score = 258 bits (658), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 147/339 (43%), Positives = 201/339 (59%), Gaps = 27/339 (7%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M V++ LV V G +P++ + + W HG+ Y+ + E+ R +++NL
Sbjct: 7 MAVLVTLV------AVMGHP--DPTLDQHWQLWKKAHGKEYRHQAEEGQRRATWEKNLRL 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
+ N E G +Y+LG N D+T+E+ AL TG VP Q+S Y+
Sbjct: 59 VMLHNLEHSLGLHSYQLGMNHMGDMTSEDVAALLTGLR--VPYGHNQTS-----TYRRRG 111
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
P ++DWREKG VT +K+QG CG+CWAFSAV A+E ++ GKL+ LS Q LVDCS
Sbjct: 112 GAPDAMDWREKGCVTEVKNQGACGACWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSMM 171
Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC GG M +AF+YII+N G+ +E YPY + GTC AAT SKY +LP
Sbjct: 172 YGNKGCGGGFMTRAFQYIIDNNGIDSEESYPYMAQNGTC-QYNVSTRAATCSKYVELPYA 230
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENG 305
DE AL AV+N PVSV +DA+ F Y+SGV + C +HGV VVG+GT E++
Sbjct: 231 DEAALKDAVANVGPVSVAIDATQPTFFLYRSGVYDDPRCTQEVNHGVLVVGYGTLNEKD- 289
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYP 343
+WL+KNSWGE +G+ GYIR+ R+ A CGIA+ ASYP
Sbjct: 290 --FWLVKNSWGERFGDGGYIRMSRNHANHCGIASYASYP 326
>gi|313118768|gb|ADR32296.1| C14 cysteine protease [Solanum demissum]
gi|313118770|gb|ADR32297.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 119/218 (54%), Positives = 150/218 (68%), Gaps = 8/218 (3%)
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-N 190
P S+DWR+KG + +KDQG CGSCWAFSAVAA+E I I G LI LSEQ+LVDC N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGNLISLSEQELVDCDKSYN 61
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
GC GGLMD AFE++I N G+ TE DYPY+ G CD ++ A TI YED+P +E+
Sbjct: 62 EGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNGVCDQYRKNAKVVTIDSYEDVPVNNEK 121
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
AL +AV++QPVS+ ++A GR F YKSG+ CG DHGV V G+GT ENG YW+
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVVAGYGT---ENGMDYWI 178
Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
++NSWG WGE GY+R+ R+ +GLCG+A SYPV
Sbjct: 179 VRNSWGAKWGEKGYLRVQRNVASSSGLCGLAIEPSYPV 216
>gi|17062058|gb|AAL34984.1|AF320565_1 cathepsine L-like cysteine protease [Rhodnius prolixus]
Length = 316
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 142/323 (43%), Positives = 194/323 (60%), Gaps = 25/323 (7%)
Query: 36 PSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTN 89
PS ++W+A HG+ Y+++ E+ R+ +F N + I++ N + G +YK+ N
Sbjct: 4 PSFDIDPQEWLAFKAMHGKNYRNQFEEIFRMKVFIDNKKKIDEHNAKYELGEASYKMKMN 63
Query: 90 EFSDLTNEEFRALYTGYNRPVPSVSRQSS--RPSTFKYQNVTDVPTSIDWREKGAVTHIK 147
DL EF+AL G+ + P+ R PS ++P S+DWR++GAVT +K
Sbjct: 64 HLGDLMVHEFKALMNGFKK-TPNAERNGKIYVPSN------ENLPKSVDWRQRGAVTPVK 116
Query: 148 DQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYI 205
DQG CGSCW+FSA ++EG + G+L+ LSEQ LVDCS N GC GGLM++AF+Y+
Sbjct: 117 DQGHCGSCWSFSATGSLEGQLFLKTGRLVSLSEQNLVDCSKTYGNSGCEGGLMNQAFQYV 176
Query: 206 IENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVC 264
+NKG+ TEA YPY E C KE V T Y D+ + E+ L AV+ P+SV
Sbjct: 177 RDNKGIDTEASYPYEARENNC-RFKEDKVGGTDKGYVDILEASEKDLQSAVATVGPISVR 235
Query: 265 VDASGRAFHFYKSGVLNAD-CG-NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGES 322
+DAS +F FY GV C + DHGV VG+GT ENG YWL+KNSWG +WGES
Sbjct: 236 IDASHESFQFYSEGVYKEQYCSPSQLDHGVLTVGYGT---ENGQDYWLVKNSWGPSWGES 292
Query: 323 GYIRILRD-AGLCGIATAASYPV 344
GYI+I R+ CGIA+ ASYPV
Sbjct: 293 GYIKIARNHKNHCGIASMASYPV 315
>gi|332260024|ref|XP_003279085.1| PREDICTED: cathepsin L1 isoform 3 [Nomascus leucogenys]
gi|441593306|ref|XP_004087072.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
gi|441593309|ref|XP_004087073.1| PREDICTED: cathepsin L1 [Nomascus leucogenys]
Length = 333
Score = 257 bits (657), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 145/342 (42%), Positives = 204/342 (59%), Gaps = 20/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M +IL C + S + S+ + +W A H R Y E+ R ++++N++
Sbjct: 1 MNPTLILAAFCLG-IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58
Query: 73 IEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE+ N +EG ++ + N F D+T+EEFR + G+ P + P +
Sbjct: 59 IEQHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQNRKPRKGKVFQEPLFY------ 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
+ P S+DWREKG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSGP 172
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD AF+Y+ +N GL +E YPY E +C + +V A + + D+PK
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSV-ANDTGFVDIPK- 230
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+AL++AV+ P+SV VDA ++F FYK G+ DC + + DHGV VVG+G + E
Sbjct: 231 QEKALMKAVATVGPISVAVDAGHQSFQFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+ KYWL+KNSWGE WG GYI++ +D CGIA+AASYP
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>gi|21489677|gb|AAM55195.1|AF412313_1 cathepsin L cysteine protease [Haemonchus contortus]
gi|21483192|gb|AAL14224.1| cathepsin L [Haemonchus contortus]
Length = 354
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 145/313 (46%), Positives = 193/313 (61%), Gaps = 14/313 (4%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNE 97
K + + G++Y+ + E + F +N+ +IE+ NKE G +T+++G NE +DL
Sbjct: 46 KWDDYKETFGKSYEPDEENDY-MEAFVKNVIHIEEHNKEHRLGRKTFEMGLNEIADLPFS 104
Query: 98 EFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWA 157
++R L GY S + F +P S+DWRE+G VT +K+QG CGSCWA
Sbjct: 105 QYRKL-NGYRMRRQFGDSLQSNGTKFLVPFNVQIPESVDWREEGLVTPVKNQGMCGSCWA 163
Query: 158 FSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEA 215
FS+ A+EG GKL+ LSEQ LVDCST NHGC+GGLMD AFEYI EN G+ TE
Sbjct: 164 FSSTGALEGQHARATGKLVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKENHGVDTED 223
Query: 216 DYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHF 274
YPY E C + K AV A + DLP+GDE+AL +AV+ Q P+S+ +DA R+F
Sbjct: 224 SYPYVGRETKC-HFKRNAVGADDKGFVDLPEGDEEALKKAVATQGPISIAIDAGHRSFQL 282
Query: 275 YKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA- 331
YK GV + +C + DHGV +VG+GT E YWL+KNSWG TWGE GYIRI R+
Sbjct: 283 YKKGVYFDEECSSEELDHGVLLVGYGTDPE--AGDYWLVKNSWGPTWGEKGYIRIARNRN 340
Query: 332 GLCGIATAASYPV 344
CG+AT ASYP+
Sbjct: 341 NHCGVATKASYPL 353
>gi|121543825|gb|ABM55577.1| putative cathepsin L-like protease [Maconellicoccus hirsutus]
Length = 341
Score = 257 bits (656), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 147/345 (42%), Positives = 197/345 (57%), Gaps = 16/345 (4%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNL 70
+ F + V+ S V+ S ++ I E+ E + Q + Y E+E+ R+ +F N
Sbjct: 1 MKAFAFLCCVLIYHSNSVTAVSFNDL-IAEEWELFKTQFSKAYNTEIEEKFRMKVFMDNK 59
Query: 71 EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
I + NK G +Y+L N F DL + EF GY + V+ TF
Sbjct: 60 HKIARHNKLFQNGEVSYELEMNHFGDLLHHEFVKTVNGYRHSLRRVTGDEIDSVTFIPAY 119
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
VP S+DWR +GAVT +K+QGQCGSCWAFS ++EG +L LSEQ L+DCS
Sbjct: 120 NVTVPDSVDWRTEGAVTEVKNQGQCGSCWAFSTTGSLEGQHFRNTKQLTSLSEQNLIDCS 179
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
N+GCSGGLMD AF YI NKG+ TE YPY + C K + AT + D+P
Sbjct: 180 GKYGNNGCSGGLMDNAFAYIKSNKGIDTEQSYPYEGIDDKC-RYKPQESGATDKGFVDIP 238
Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN---NCDHGVAVVGFGTA 300
+GDE+ L AV+ P+SV +DAS ++F FYK GV + CGN + DHGV VG+GT
Sbjct: 239 QGDEEKLKLAVATVGPISVAIDASHQSFQFYKKGVYYDKGCGNGEEDLDHGVLAVGYGT- 297
Query: 301 EEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
ENG YWL+KNSWG+ WG GYI++ R+ CGIAT+ASYP+
Sbjct: 298 --ENGKDYWLVKNSWGKRWGLDGYIKMARNKHNHCGIATSASYPL 340
>gi|348565223|ref|XP_003468403.1| PREDICTED: cathepsin L1-like [Cavia porcellus]
Length = 333
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 201/344 (58%), Gaps = 21/344 (6%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
+ P FV+ L + +VS + ++ + +QW A HGR Y E+ R ++++N
Sbjct: 1 MTPSFVLAALCLG----IVSALPKLDQTLDAQWDQWKAAHGRLYGLN-EEGWRRAVWEKN 55
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
L IE N E G ++ LG N F D+TNEEFR + G+ + P +
Sbjct: 56 LRMIELHNGEYSQGRHSFTLGMNHFGDMTNEEFRQVMNGFQHQKHKTGKMYQEPLLLQ-- 113
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+P S+DWREKG VT +K+QGQCGSCWAFSA ++EG G L+ LSEQ LVDC
Sbjct: 114 ----LPKSVDWREKGYVTEVKNQGQCGSCWAFSATGSLEGQMFHKTGNLVSLSEQNLVDC 169
Query: 187 S--TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
S N GC+GGLMD AF+Y+ +NKGL E YPY ++G C + E + AA + + D+
Sbjct: 170 SRPQGNQGCNGGLMDFAFQYVKDNKGLEAEKSYPYVGKDGECKYKPELS-AANDTGFVDV 228
Query: 245 PKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEE 302
P+ ++ + P+SV +DA ++F FYK G+ + C + + +HGV +VG+GT
Sbjct: 229 PQREKVVQKALATVGPLSVAIDAGLQSFQFYKEGIYYDPGCSSRDLNHGVLLVGYGTDAS 288
Query: 303 ENG-AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
E G YWLIKNSWG TWG GY++I R+ CG+ATAASYP+
Sbjct: 289 ETGKGDYWLIKNSWGTTWGADGYVKIARNRNNHCGVATAASYPL 332
>gi|432936690|ref|XP_004082231.1| PREDICTED: cathepsin L-like [Oryzias latipes]
Length = 334
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 145/315 (46%), Positives = 194/315 (61%), Gaps = 25/315 (7%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRA 101
W + GRTY E+A R + N + + N +G ++Y+LG F+D+ NEE++
Sbjct: 29 WRLKFGRTYSSPTEEAQRRQTWLNNRKLVLVHNILADQGIKSYRLGMTYFADMENEEYKR 88
Query: 102 LYT-----GYNRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
L + +N +P R STF + D+P ++DWR+KG VT +KDQ QCGSC
Sbjct: 89 LISQGCLGSFNASLPR------RGSTFFRLPENKDLPAAVDWRDKGYVTDVKDQKQCGSC 142
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLAT 213
WAFSA ++EG T GKL+ LSEQQLVDCS D N GC GGLMD AF YI G+ T
Sbjct: 143 WAFSATGSLEGQTFRKTGKLVSLSEQQLVDCSGDYGNMGCGGGLMDDAFRYIQATGGIDT 202
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAF 272
E YPY E+G C K AV AT + Y D+ GDE AL +AV+ P+SV +DAS +F
Sbjct: 203 EESYPYEAEDGEC-RYKPDAVGATCTGYVDVSSGDEDALQEAVATIGPISVGIDASHISF 261
Query: 273 HFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
Y+SG+ + C ++ DHGV VG+G+ ENG YWL+KNSWG TWG+ GYI++ ++
Sbjct: 262 QLYESGLYDEPQCSSSELDHGVLAVGYGS---ENGQDYWLVKNSWGLTWGDQGYIKMSKN 318
Query: 331 -AGLCGIATAASYPV 344
+ CGIATAASYP+
Sbjct: 319 KSNQCGIATAASYPL 333
>gi|149755226|ref|XP_001494409.1| PREDICTED: cathepsin L1-like [Equus caballus]
Length = 334
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 143/338 (42%), Positives = 202/338 (59%), Gaps = 19/338 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
+ L C + S +PS+ + QW A H R Y E+ R ++++N+ IE
Sbjct: 5 LFLAALCLG-IASAAPKLDPSLDAQWYQWKATHRRLYGVN-EEGWRRAVWEKNMRMIELH 62
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N+E G + + N F D+TNEEFR + G+ +++ + F +VP
Sbjct: 63 NQEYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQ------NQKHKKGRVFLEPLFLEVPK 116
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNH 191
++DWREKG VT +K+QG CGSCWAFSA A+EG GKL+ LSEQ LVDCS N
Sbjct: 117 TVDWREKGYVTPVKNQGPCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAEGNQ 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC+GGLMD AF+Y+ +N GL +E YPY +EG N K + AA + Y D+P+ E+A
Sbjct: 177 GCNGGLMDNAFQYVKDNGGLDSEESYPYLAKEGNNCNYKPEYSAANDTGYVDIPQ-KEKA 235
Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEENGAK 307
L++AV+ P+SV +DA +F FYKSG+ + DC + + DHGV VVG+G + N K
Sbjct: 236 LMKAVATVGPISVAIDAGHESFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGRDSNNNK 295
Query: 308 YWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+W++KNSWG WG +GY+++ +D CGIATAASYP
Sbjct: 296 FWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPT 333
>gi|348514005|ref|XP_003444531.1| PREDICTED: cathepsin L1-like [Oreochromis niloticus]
Length = 338
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 146/334 (43%), Positives = 206/334 (61%), Gaps = 24/334 (7%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GN 81
S V+S + +P + E W + H + Y E E+ R ++++NL+ IE N + G
Sbjct: 14 SSVLSAPHL-DPQLDEHWNLWKSWHTKKYH-EKEEGWRRMVWEKNLKKIELHNLDHSMGK 71
Query: 82 RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKG 141
TY+LG N F D+TNEEFR L GY + + + S F N + P S+DWR+KG
Sbjct: 72 HTYRLGMNHFGDMTNEEFRQLMNGYKHK----AERKVKGSLFLEPNFLEAPRSLDWRDKG 127
Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMD 199
VT +KDQGQCGSCWAFSA A+EG GK+++LSEQ LV+CS N GC+GGLMD
Sbjct: 128 YVTPVKDQGQCGSCWAFSATGALEGQQFRKTGKMVQLSEQNLVECSRPEGNEGCNGGLMD 187
Query: 200 KAFEYIIENKGLATEADYPYRHEEGTCDNQK----EKAVAATISKYEDLPKGDEQALLQA 255
+AF+Y+ +N+GL +E YPY GT D+QK + A + + D+ G E AL++A
Sbjct: 188 QAFQYVKDNQGLDSEESYPYL---GT-DDQKCHYDPRYNAVNDTGFVDIKSGSEHALMKA 243
Query: 256 VSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLI 311
V+ P+SV +DA +F FY+SG+ +C + DHGV +VG+G E+ +G KYW++
Sbjct: 244 VTAVGPISVAIDAGHESFQFYQSGIYYEPECSSEELDHGVLLVGYGFEGEDVDGKKYWIV 303
Query: 312 KNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
KNSW E WG+ GY+ + +D CGIATAASYP+
Sbjct: 304 KNSWSEKWGDKGYVYMAKDRQNHCGIATAASYPL 337
>gi|309380130|gb|ADO65978.1| cathepsin L [Eriocheir sinensis]
gi|309380134|gb|ADO65980.1| cathepsin L [Eriocheir sinensis]
Length = 325
Score = 257 bits (656), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 139/312 (44%), Positives = 194/312 (62%), Gaps = 20/312 (6%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEF 99
+Q+ A++G+ Y+ E + R ++++QN E+I N++ G ++ L N+F D+T EE
Sbjct: 23 QQFKARYGKQYRSTKEDSYRQSVYEQNQEFINSHNEQYENGLVSFTLAMNQFGDMTTEEI 82
Query: 100 RALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKDQGQCGSCWAF 158
A G+ +S P YQ + D +P ++DWR+KGAVT +KDQ CGSCWAF
Sbjct: 83 NAAMNGF------LSAGKKVPRGTMYQPLVDELPDTVDWRDKGAVTPVKDQKACGSCWAF 136
Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEAD 216
SA ++EG ++ GKL+ LSEQ LVDCS N GC GGLMD AF YI +N G+ TE
Sbjct: 137 SATGSLEGQHFLSTGKLVSLSEQNLVDCSDKYGNFGCGGGLMDNAFRYIKDNNGIDTEES 196
Query: 217 YPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFY 275
YPY + G C + V AT+S Y D+ G E L +AV+ + PVSV +DAS FHFY
Sbjct: 197 YPYEAKNGPCRFNSDN-VGATLSSYVDIQHGSEDDLQKAVAEKGPVSVAIDASTSTFHFY 255
Query: 276 KSGV-LNADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-G 332
G+ + C ++ DHGV VG+GT ++ + YWL+KNSW ETWG+SGYI++ R+
Sbjct: 256 SRGIYYDEKCSSSFLDHGVLAVGYGT---DDSSDYWLVKNSWNETWGDSGYIKMSRNRNN 312
Query: 333 LCGIATAASYPV 344
CGIA+ ASYPV
Sbjct: 313 NCGIASQASYPV 324
>gi|392873948|gb|AFM85806.1| cathepsin L [Callorhinchus milii]
Length = 338
Score = 256 bits (655), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 141/342 (41%), Positives = 211/342 (61%), Gaps = 15/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + +++ C + ++ S+ +P + EQW + HG++Y ++ E+ R +++++L
Sbjct: 1 MRLPFVVLSLCLAGGLAAPSL-DPGLDTHWEQWKSWHGKSY-EQKEETWRRMVWEKHLRV 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +++LG N F D+ NEEFR L GY Q S F N
Sbjct: 59 IEIHNLEHSLGKHSFRLGMNHFGDMPNEEFRQLMNGYKYKQTHKKLQGSH---FLEPNFL 115
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+VP +DWR++G VT +KDQGQCGSCWAFS A+EG G+L+ LSEQ LV+CS
Sbjct: 116 EVPKHVDWRDEGYVTPVKDQGQCGSCWAFSTTGALEGQHFRRTGQLVSLSEQNLVECSKP 175
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+Y+ +N G+ +E YPY + T + + AA + + D+P G
Sbjct: 176 EGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYVGTDDTPCHYNPQYNAANDTGFVDIPSG 235
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEE- 303
E+AL++A++ PVSV +DA +F FY+SG+ A+C + + DHGV VVG+G + +
Sbjct: 236 KERALMKAIAAVGPVSVAIDAGHTSFQFYQSGIYFEAECSSTDLDHGVLVVGYGVEKRDT 295
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+G KYW++KNSW E G++GYI + +D CGIATAASYP+
Sbjct: 296 DGKKYWIVKNSWSEKLGQNGYILMAKDKDNHCGIATAASYPL 337
>gi|296082368|emb|CBI21373.3| unnamed protein product [Vitis vinifera]
Length = 245
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 122/220 (55%), Positives = 157/220 (71%), Gaps = 9/220 (4%)
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P S+DWRE GAV +KDQ CGSCWAFS VAAVEGI QI G+LI LSEQ+LVDC T+
Sbjct: 6 LPESVDWRETGAVNPVKDQRSCGSCWAFSTVAAVEGINQIVTGELISLSEQELVDCDTEY 65
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
+ GC+GGLMD AF++II+N GL TE DYPY +G C+ + + +I YED+P DE
Sbjct: 66 DMGCNGGLMDYAFDFIIKNGGLDTEKDYPYTGFDGECNLSGKSSKVVSIDGYEDVPPFDE 125
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
+AL +AV++QPVSV V+A GRA Y SG+ +CG DHG+ VG+GT ENG YW
Sbjct: 126 KALQKAVAHQPVSVAVEAGGRALQLYVSGIFTGECGTALDHGIVAVGYGT---ENGTDYW 182
Query: 310 LIKNSWGETWGESGYIRILRD-----AGLCGIATAASYPV 344
+++NSWG +WGE+GYIR+ R+ +G CGIA ASYP+
Sbjct: 183 IVRNSWGSSWGENGYIRMERNMADAFSGKCGIAMEASYPI 222
>gi|223646726|gb|ACN10121.1| Cathepsin L1 precursor [Salmo salar]
gi|223672581|gb|ACN12472.1| Cathepsin L1 precursor [Salmo salar]
Length = 338
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 145/341 (42%), Positives = 202/341 (59%), Gaps = 18/341 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ + +++ C S V + S +E H W H ++Y E E+ R ++++NL+ I
Sbjct: 4 LYLAVLVLCVSAVCAAPRF--DSQLEDHWHLWKNWHSKSYH-ESEEGWRRMVWEKNLKKI 60
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E N E G +Y+LG N F D+TNEEFR GY + + + + S F N
Sbjct: 61 EMHNLEHTMGKHSYRLGMNHFGDMTNEEFRQTMNGYKQ----TTERKFKGSLFMEPNYLQ 116
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P ++DWREKG VT +KDQG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 117 APKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQQFRKTGKLVSLSEQNLVDCSRPE 176
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GGLMD+AF+YI +N GL TE YPY + + K + A + + D+P G
Sbjct: 177 GNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSGANETGFVDIPSGK 236
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEEN 304
E A+++AV+ PVSV +DA +F FY+ G+ +C + DHGV VVG+G E+ +
Sbjct: 237 EHAMMKAVAAVGPVSVAIDAGHESFQFYEFGIYYEKECSSEELDHGVLVVGYGFEGEDVD 296
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
G KYW++KNSW E WG+ GYI + +D CGIATA+SYP+
Sbjct: 297 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|426216524|ref|XP_004002512.1| PREDICTED: cathepsin S isoform 1 [Ovis aries]
Length = 331
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 145/333 (43%), Positives = 206/333 (61%), Gaps = 20/333 (6%)
Query: 20 VITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE 79
++ C+S + + +P++ + W +G+ Y+++ E+ R I+++NL+ + N E
Sbjct: 8 LLLCSSAMA--QVHRDPTLDHHWDLWKKTYGKQYEEKNEEVARRLIWEKNLKTVMLHNLE 65
Query: 80 ---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSID 136
G +Y+LG N D+T+EE + + VPS Q R T+K +P S+D
Sbjct: 66 HSMGMHSYELGMNHLGDMTSEEVISSMSSLR--VPS---QWPRNVTYKSSPNQKLPDSLD 120
Query: 137 WREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST---DNHGC 193
WREKG VT +K QG CGSCWAFSAV A+E ++ GKL+ LS Q LVDCST N GC
Sbjct: 121 WREKGCVTEVKYQGACGSCWAFSAVGALEAQVKLKTGKLVSLSAQNLVDCSTVKYGNKGC 180
Query: 194 SGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALL 253
+GG M +AF+YII+N G+ +EA YPY+ +G C K AAT S+Y +LP G E+AL
Sbjct: 181 NGGFMTEAFQYIIDNNGIDSEASYPYKAMDGRCQ-YDVKNRAATCSRYIELPFGSEEALK 239
Query: 254 QAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGNNCDHGVAVVGFGTAEEENGAKYWLI 311
+AV+N+ PVSV +DA +F YK+GV + C N +HGV VVG+G+ NG YWL+
Sbjct: 240 EAVANKGPVSVGIDAKQTSFFLYKTGVYYDPSCTQNVNHGVLVVGYGSL---NGKDYWLV 296
Query: 312 KNSWGETWGESGYIRILRDAG-LCGIATAASYP 343
KNSWG +G+ GYIR+ R++G CGIA SYP
Sbjct: 297 KNSWGLNFGDQGYIRMARNSGNHCGIANFPSYP 329
>gi|402770503|gb|AFQ98386.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 256 bits (655), Expect = 8e-66, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 205/340 (60%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF ++ G++ +R++ + NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSTFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P +DWR+KGAVT +KDQGQCGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGRHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC GGLM+ AF+YI EN G+ TE YPY +G C +KE V AT + Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKENDGIDTEKSYPYEAVDGECRFKKED-VGATDTGYVEIKAGS 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
E L +AV+ P+SV +DAS +F Y GV + +C + + DHGV VVG+G + G
Sbjct: 235 EDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|157093357|gb|ABV22333.1| cysteine protease 1 [Noctiluca scintillans]
Length = 338
Score = 256 bits (655), Expect = 9e-66, Method: Compositional matrix adjust.
Identities = 138/324 (42%), Positives = 188/324 (58%), Gaps = 14/324 (4%)
Query: 27 VVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKL 86
V +G + P + + ++G+ Y E A+R IFK N++ I N N T+ L
Sbjct: 12 VAAGHEVPPPDYMMMFNNFKTKYGKVYNGINEDAVRFGIFKANVDIIYATNAR-NLTFAL 70
Query: 87 GTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHI 146
G NEF+DLT EEF A YTG +P S+ R ST +Y N + +S+DW +G VT +
Sbjct: 71 GVNEFTDLTQEEFAASYTGL-KPA-SLWSGLPRLSTHEY-NGAPLASSVDWTTQGVVTPV 127
Query: 147 KDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYII 206
K+QGQCGSCW+FS A+EG ++ G L+ LSEQQ DC T + GC+GG MD AF +
Sbjct: 128 KNQGQCGSCWSFSTTGALEGAWALSTGNLVSLSEQQFEDCDTTDSGCNGGWMDNAFSFAK 187
Query: 207 ENKGLATEADYPYRHEEGTCD--NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVC 264
+N + TE YPY +GTC+ + + Y D+ EQA++ AV+ QPVS+
Sbjct: 188 KNS-ICTEGSYPYTATDGTCNLSGCQVGIPQGGVVGYTDVSTDSEQAMMSAVAQQPVSIA 246
Query: 265 VDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
++A +F Y SGVL A CG DHGV VG+G+ E G YW +KNSWG +WGE GY
Sbjct: 247 IEADQYSFQLYSSGVLTASCGTRLDHGVLAVGYGS---EAGTDYWKVKNSWGSSWGEQGY 303
Query: 325 IRILR---DAGLCG-IATAASYPV 344
+R+ R AG CG +A SYPV
Sbjct: 304 VRLQRGKGGAGECGLLAGPPSYPV 327
>gi|356515116|ref|XP_003526247.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 333
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 145/340 (42%), Positives = 188/340 (55%), Gaps = 48/340 (14%)
Query: 41 KHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFR 100
+ ++W+ +G Y+D+ E +R I++ N+EYI K +Y L N+F+DLTNEEF
Sbjct: 4 RFDRWLKXNGXNYEDKEEWEIRFVIYQANVEYI-GCKKSQKNSYNLTDNKFADLTNEEFV 62
Query: 101 ALYTGY-NRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCG------ 153
+ Y G+ R +P + FKY ++P S DWR++GAVT IKDQG CG
Sbjct: 63 STYLGFATRLIPH--------TRFKYHEHGNLPXSKDWRKEGAVTDIKDQGNCGKHSTWF 114
Query: 154 -----------------------SCWAFSAVAAVEGITQITRGKLIELSEQQLV--DCST 188
S WAFS VAAVE I +I GKL+ LSEQ+LV D +
Sbjct: 115 SPEISHNLRNILTNYNTINFRDISFWAFSVVAAVERINKIKSGKLVSLSEQELVDYDVAN 174
Query: 189 DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC GGLMD F +I +N GL T DYPY +G+C+ +K A IS YE P D
Sbjct: 175 KNQGCEGGLMDTTFAFIKKNGGLTTSKDYPYEGVDGSCNKEKALHHAVNISGYERAPSKD 234
Query: 249 EQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKY 308
E L A +NQP+SV +DA G AF Y GV + CG +HGV +VG+ + KY
Sbjct: 235 EAMLKVAAANQPISVAIDAGGYAFQLYSQGVFSGVCGKKLNHGVTIVGYDKGTFD---KY 291
Query: 309 WLIKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
+KNS G WGESGYIR+ RD AG CGIA ASYP+
Sbjct: 292 RTVKNSXGADWGESGYIRMKRDAFDKAGTCGIAMKASYPL 331
>gi|426219875|ref|XP_004004143.1| PREDICTED: cathepsin L1 [Ovis aries]
Length = 333
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 146/338 (43%), Positives = 201/338 (59%), Gaps = 20/338 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
++L + C + S + S+ + E W A H + Y D E+ R ++K+N++ IE
Sbjct: 5 LLLTVLCLG-IASAAPKFDHSLNTQWELWKAVHRKPY-DLNEEGWRKAVWKKNMKMIELH 62
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N+E G ++ + N F DLT+EEFR + G+ R +++ + F +P
Sbjct: 63 NQEYSQGKHSFSMAMNAFGDLTSEEFRQMMNGFQR------QENKKGKVFHETIFASIPP 116
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
S+DWREKG VT +K+QG+CGSCWAFS A+EG GKL+ LSEQ LVDCS N
Sbjct: 117 SVDWREKGYVTPVKNQGKCGSCWAFSTTGALEGQMFRKTGKLVSLSEQNLVDCSQPEGNR 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GGLMD AF+Y+++ GL +E YPY GTC N K AA + + DLPK E A
Sbjct: 177 GCHGGLMDNAFQYVLDVGGLDSEESYPYTGLVGTC-NYNPKNSAANETGFVDLPK-QENA 234
Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADC-GNNCDHGVAVVGFG-TAEEENGAK 307
L++AV+ P+SV VDAS +F FYKSG+ C + DHGV VVG+G + + K
Sbjct: 235 LMKAVATLGPISVAVDASNPSFQFYKSGIYYEPKCKSESVDHGVLVVGYGFEGADSDDNK 294
Query: 308 YWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
YWL+KNSWG+ WG +GYI++ +D CGIAT ASYP
Sbjct: 295 YWLVKNSWGKHWGINGYIKMAKDQNNHCGIATMASYPT 332
>gi|146216002|gb|ABQ10203.1| cysteine protease Cp5 [Actinidia deliciosa]
Length = 509
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 141/329 (42%), Positives = 193/329 (58%), Gaps = 18/329 (5%)
Query: 30 GRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT--YKLG 87
G S+ E +VE ++W +HG+ YK E + F+ NL Y+ + N E + + +G
Sbjct: 39 GESIAEERVVELFKKWTEKHGKVYKHGQEVEKKFQNFRDNLRYVMEKNGERGASGGHLVG 98
Query: 88 TNEFSDLTNEEFRALYTGYNRPVPSVS-----RQSSRPSTFKYQNVTDVPTSIDWREKGA 142
N+F+D++NEEFR +Y + S R+ + + K D PTS+DWR+ G
Sbjct: 99 LNKFADMSNEEFREVYVSKVKKPTSKRMAIERRRQGKAAAAKAVAACDGPTSLDWRKYGI 158
Query: 143 VTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAF 202
VT +KDQG CGSCWAFS+ A+EGI + G LI LSEQ+LVDC + N GC GG MD AF
Sbjct: 159 VTGVKDQGDCGSCWAFSSTGAIEGINALANGDLISLSEQELVDCDSTNDGCEGGYMDYAF 218
Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
E+++ N G+ TE DYPY E+GTC+ KE+ A +I YED+ + +E AL AV QP+S
Sbjct: 219 EWVMSNGGIDTETDYPYTGEDGTCNTTKEETKAVSIDGYEDVAE-EESALFCAVLKQPIS 277
Query: 263 VCVDASGRAFHFYKSGVL---NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
V +D F Y G+ +D ++ DH V VVG+G E+G +YW+IKNSWG W
Sbjct: 278 VGIDGGAIDFQLYTGGIYDGDCSDDPDDIDHAVLVVGYGA---ESGEEYWIIKNSWGTDW 334
Query: 320 GESGYIRILR----DAGLCGIATAASYPV 344
G GY I R D G+C I ASYP
Sbjct: 335 GMKGYAYIKRNTSKDYGVCAINAMASYPT 363
>gi|449673497|ref|XP_002169904.2| PREDICTED: cathepsin L-like [Hydra magnipapillata]
Length = 325
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 146/337 (43%), Positives = 198/337 (58%), Gaps = 22/337 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ + L+ C ++ + + E S W H + Y E E+ +R I+K N+ I
Sbjct: 4 LIFVSLITLCFGYIIE-KPIRESSWY----VWKMAHNKAYSHESEENVRYAIWKDNMNRI 58
Query: 74 EKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
+ N + ++ L N F D+TN EFRA G + + STF + T P
Sbjct: 59 TEYNSK-SKNVILRMNHFGDMTNTEFRAKMNGL------LLHKHQNGSTFLVPSHTAAPD 111
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
++DWR +G VT +K+QGQCGSCWAFS+ A+EG G+L+ LSEQ LVDCSTD N+
Sbjct: 112 AVDWRSEGYVTPVKNQGQCGSCWAFSSTGALEGQHFKKTGRLVSLSEQNLVDCSTDYGNN 171
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC+GGLMD AF YI N G+ TE YPY ++GTC K ++ A + + D+P+GDE A
Sbjct: 172 GCNGGLMDNAFSYIKANGGIDTETGYPYEGQDGTCRYSKS-SIGADDTGFVDIPEGDEDA 230
Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNC-DHGVAVVGFGTAEEENGAKY 308
L QAV+ PVSV +DAS +F FY SGV + C + DHGV VVG+GT +NG Y
Sbjct: 231 LKQAVATVGPVSVAIDASHMSFQFYHSGVYDEPQCSPSALDHGVLVVGYGT---DNGKDY 287
Query: 309 WLIKNSWGETWGESGYIRILR-DAGLCGIATAASYPV 344
WL+KNSWG WG GYI + R + CGIA+ ASYP+
Sbjct: 288 WLVKNSWGTGWGTEGYIYMSRNNQNQCGIASKASYPL 324
>gi|402770507|gb|AFQ98388.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 145/340 (42%), Positives = 204/340 (60%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF ++ GY+ SR+S + NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGYHG-----SRKSGGSTFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P ++DWR+KGAVT +KDQGQCGSCWAFS ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKAVDWRKKGAVTPVKDQGQCGSCWAFSTTGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC GGLM+ AF+YI N G+ TE YPY +G C +KE V AT + Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKED-VGATDTGYVEIKAGC 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
E L +AV+ P+SV +DAS +F Y GV + +C + + DHGV VVG+G + G
Sbjct: 235 EDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|424513619|emb|CCO66241.1| predicted protein [Bathycoccus prasinos]
Length = 396
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 138/333 (41%), Positives = 196/333 (58%), Gaps = 22/333 (6%)
Query: 31 RSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLG 87
R + E I + + W+ ++ + + E+ RL IF +N ++ + N + G ++ +
Sbjct: 61 RVLRESKIEDAFDAWLVKYDKEIANAEERLKRLKIFGENYLFVLEHNAKYVAGKVSHYVE 120
Query: 88 TNEFSDLTNEEFRALYTGYNRPV---PSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
N+F+ T EE+R + G+ + + + S ++Y+ V + P SIDW ++G +T
Sbjct: 121 MNKFAAHTREEYRKML-GFKKSLRRKKDSGEAAKDVSLWEYEGV-EAPESIDWVDEGVIT 178
Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAF 202
K+QG CGSCWAFSA+ AVEGI I GKL+ LSEQ+LV C+ + N GC+GGLMD AF
Sbjct: 179 TPKNQGSCGSCWAFSAIGAVEGINAIRTGKLVSLSEQELVSCAREGGNQGCNGGLMDNAF 238
Query: 203 EYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVS 262
E+I+EN G+ +E Y Y+ C +K A+I + D+P DE AL +AVS QPVS
Sbjct: 239 EWIVENGGVDSEKQYQYKASFDDCKTRKTLLHIASIDGFNDVPSNDETALKKAVSQQPVS 298
Query: 263 VCVDASGRAFHFYKSGVLNA-DCGNNCDHGVAVVGFGTAEEENGA-------KYWLIKNS 314
V ++A R+F Y GV +A DCG DHGV VVG+G + KYW IKNS
Sbjct: 299 VAIEADQRSFQLYGGGVYHAEDCGTQLDHGVLVVGYGIDHNSSNVIIPGATKKYWKIKNS 358
Query: 315 WGETWGESGYIRILRD----AGLCGIATAASYP 343
W E WGE GYIRI RD +G+CG+A ASYP
Sbjct: 359 WSEQWGEGGYIRIARDVESPSGMCGVAEMASYP 391
>gi|402770517|gb|AFQ98393.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 205/340 (60%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF ++ G++ +R++ S NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSSFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P +DWR+KGAVT +KDQGQCGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC GGLM+ AF+YI N G+ TE YPY+ +G C +KE V AT + Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYKAVDGECRFKKED-VGATDTGYVEIKAGS 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
E L +AV+ P+SV +DAS +F Y GV + +C + + DHGV VVG+G + G
Sbjct: 235 EVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|310975577|gb|ADP55137.1| cathepsin S [Miichthys miiuy]
Length = 338
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 145/336 (43%), Positives = 194/336 (57%), Gaps = 19/336 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
++L CA +M + + E W HG+TY++ +E R ++++NL I
Sbjct: 13 LLLFSLCAGAA----AMFDSKLDGHWELWKKMHGKTYRNYVEDESRRELWEKNLVLITMH 68
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N E G TYKL N DLT EE + P + R PS F + VP
Sbjct: 69 NLEASMGLHTYKLSMNHMGDLTPEEIMQSFATLTPPT-DIQRA---PSPFAGTSGAAVPD 124
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
++DWREKG VT +K QG CGSCWAFSA A+EG T GKL++LS Q LVDCST NH
Sbjct: 125 TMDWREKGCVTSVKMQGACGSCWAFSAAGALEGQLAKTTGKLVDLSPQNLVDCSTKYGNH 184
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC+GG M KAF+Y+I+N G+ ++A YPY + + K AA S+Y LP+GDE A
Sbjct: 185 GCNGGFMHKAFQYVIDNHGIDSDAAYPYTGRQSQECHYSPKFRAANCSQYSFLPEGDEGA 244
Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYW 309
L QA++ P+SV +DA F FY SGV + C + +HGV VG+GT NG YW
Sbjct: 245 LKQALATIGPISVAIDARRPRFAFYSSGVYDDPSCSQDVNHGVLAVGYGTL---NGQDYW 301
Query: 310 LIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
L+KNSWG+T+G++GYIR+ R+ CGIA YP+
Sbjct: 302 LVKNSWGQTFGDNGYIRMARNKNDQCGIARYGCYPI 337
>gi|356582227|ref|NP_001239115.1| cathepsin L1 precursor [Canis lupus familiaris]
gi|62899810|sp|Q9GL24.1|CATL1_CANFA RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|10185020|emb|CAC08809.1| cathepsin L [Canis lupus familiaris]
Length = 333
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 140/337 (41%), Positives = 202/337 (59%), Gaps = 18/337 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
+ L C + S + S+ + QW A H R Y E+ R ++++N++ IE
Sbjct: 5 LFLTALCLG-IASAAPKFDQSLNAQWYQWKATHRRLYGMN-EEGWRRAVWEKNMKMIELH 62
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N+E G + + N F D+TNEEFR + G+ +++ + F+ ++P
Sbjct: 63 NREYSQGKHGFTMAMNAFGDMTNEEFRQVMNGFQ------NQKHKKGKMFQEPLFAEIPK 116
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS--TDNH 191
S+DWREKG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS N
Sbjct: 117 SVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRAQGNE 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC+GGLMD AF Y+ +N GL +E YPY + N K + AA + + DLP+ E+A
Sbjct: 177 GCNGGLMDNAFRYVKDNGGLDSEESYPYLGRDTETCNYKPECSAANDTGFVDLPQ-REKA 235
Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKY 308
L++AV+ P+SV +DA ++F FYKSG+ + DC + + DHGV VVG+G ++ K+
Sbjct: 236 LMKAVATLGPISVAIDAGHQSFQFYKSGIYFDPDCSSKDLDHGVLVVGYGFEGTDSNNKF 295
Query: 309 WLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
W++KNSWG WG +GY+++ +D CGIATAASYP
Sbjct: 296 WIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPT 332
>gi|402770501|gb|AFQ98385.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 256 bits (654), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 205/340 (60%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF ++ G++ +R++ + NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSTFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P ++DWR+KGAVT +KDQGQCGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC GGLM+ AF+YI N G+ TE YPY +G C +KE V AT + Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKED-VGATDTGYVEIKAGS 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
E L +AV+ P+SV +DAS +F Y GV + +C + + DHGV VVG+G + G
Sbjct: 235 EDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|383410403|gb|AFH28415.1| cathepsin L1 preproprotein [Macaca mulatta]
Length = 333
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 205/343 (59%), Gaps = 23/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
P F++ + AS ++ S+ + +W A H R Y E+ R ++++N++
Sbjct: 3 PTFILAAFCLGIASATLT----FNHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
IE N+E G ++ + N F D+T+EEFR + G+ +R+ + F+
Sbjct: 58 MIELHNQEYSQGKHSFTMAMNTFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLF 111
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
+ P S+DWREKG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 YEAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSG 171
Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD AF+Y+ +N GL +E YPY E +C E +V A + + D+PK
Sbjct: 172 PQGNEGCNGGLMDYAFQYVADNGGLDSEESYPYEATEESCKYNPEYSV-ANDTGFVDIPK 230
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
E+AL++AV+ P+SV +DA +F FYK G+ DC + + DHGV VVG+G + E
Sbjct: 231 -QEKALMKAVATVGPISVAIDAGHESFMFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTE 289
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+ +KYWL KNSWGE WG GYI++ +D CGIA+AASYP
Sbjct: 290 SDNSKYWLGKNSWGEEWGMGGYIKMAKDRRNHCGIASAASYPT 332
>gi|21593501|gb|AAM65468.1| cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 140/321 (43%), Positives = 198/321 (61%), Gaps = 17/321 (5%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
+E ++ +EQW+ ++G+ Y EK R IFK NL+ IE+ N + NR+Y+ G N+FSD
Sbjct: 33 NEGGVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSD 92
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT-HIKDQGQC 152
LT +EF+A Y G S+S + R ++Y+ +P +DWRE+GAV +K QG+C
Sbjct: 93 LTADEFQASYLGGKMEKKSLSDVAER---YQYKEGDVLPDEVDWRERGAVVPRVKRQGEC 149
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC--STDNHGCSGGLMDKAFEYIIENKG 210
GSCWAF+A AVEGI QIT G+L+ LSEQ+L+DC DN GC+GG AFE+I EN G
Sbjct: 150 GSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGG 209
Query: 211 LATEADYPYRHEE-GTCDNQKEKAV-AATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
+ ++ Y Y E+ C + K TI+ +E +P DE +L +AV+ QP+SV + A+
Sbjct: 210 IVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMISAA 269
Query: 269 GRAFHFYKSGVLNADCGNNC-DHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
+ YKSGV C N DH V +VG+GT+ +E YWLI+NSWG WGE GY+R+
Sbjct: 270 NMS--DYKSGVYKGACSNLWGDHNVLIVGYGTSSDE--GDYWLIRNSWGPEWGEGGYLRL 325
Query: 328 LRD----AGLCGIATAASYPV 344
R+ G C +A A YP+
Sbjct: 326 QRNFHEPTGKCAVAVAPVYPI 346
>gi|7523482|dbj|BAA94210.1| putative cysteine protease [Oryza sativa Japonica Group]
gi|10800060|dbj|BAB16480.1| putative cysteine protease [Oryza sativa Japonica Group]
Length = 349
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 136/322 (42%), Positives = 184/322 (57%), Gaps = 27/322 (8%)
Query: 39 VEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
+ E+WMA+ G+ Y EK R +F+ N+ +I L N+F+DLTN+E
Sbjct: 38 TQMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDE 97
Query: 99 FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
F + +TG P P + + P +P IDWR KGAVT +KDQG CGSCWAF
Sbjct: 98 FVSTHTGAKPPCPKDAPRGVDP--------IWLPCCIDWRYKGAVTDVKDQGACGSCWAF 149
Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
+AVAA+EG+TQI GKL LSEQ+LVDC T + GC+GG D+AFE + G+ E+ Y
Sbjct: 150 AAVAAIEGLTQIRTGKLTPLSEQELVDCDTGSSGCAGGHTDRAFELVAAKGGITAESGYR 209
Query: 219 YRHEEGTCDNQKEKAV---AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
Y G C + + A+ AA I + +P GDE+ L AV+ QPV+ +DASG AF FY
Sbjct: 210 YEGYRGKC--RADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFY 267
Query: 276 KSGVLNADCGN---------NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
SGV CG+ +H V +VG+ + +G KYW+ KNSWG+TWGE GYI
Sbjct: 268 GSGVFPGPCGSGSGAAAAAPTTNHAVTLVGY-CQDGASGKKYWVAKNSWGKTWGEKGYIL 326
Query: 327 ILRDA----GLCGIATAASYPV 344
+ +D G CG+A + YP
Sbjct: 327 LEKDVASPHGTCGVAVSPFYPT 348
>gi|326430491|gb|EGD76061.1| cathepsin [Salpingoeca sp. ATCC 50818]
Length = 381
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 133/276 (48%), Positives = 169/276 (61%), Gaps = 22/276 (7%)
Query: 51 RTYKDELEKAMRLNIFKQNLEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRALYTGYN 107
+ Y+ E+A R IF NL +I + N E R T+ +G N+F+DLTNEE+R LY
Sbjct: 29 KQYESPEEEARRFAIFADNLAFIARHNAEAARGLHTHTVGVNQFADLTNEEYRQLYL--- 85
Query: 108 RPVPSVSRQSSRPSTFKYQNVTDVPT--SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVE 165
RP P+ R + D P S+DWR+KGAVT IK+QGQCGSCW+FS +VE
Sbjct: 86 RPYPTELLGRERQEVW-----LDGPNAGSVDWRQKGAVTPIKNQGQCGSCWSFSTTGSVE 140
Query: 166 GITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEE 223
G I G L+ LSEQQLVDCS N GC+GGLMD AF+YII N GL TE DYPY +
Sbjct: 141 GAHAIATGNLVSLSEQQLVDCSGSFGNQGCNGGLMDNAFKYIISNGGLDTEQDYPYTARD 200
Query: 224 GTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNAD 283
G CD KE A +IS Y+D+P+ +E L AV PVSV ++A ++F Y SGV +
Sbjct: 201 GVCDKSKESKHAVSISGYKDVPQNNEDQLAAAVEKGPVSVAIEADQQSFQMYSSGVFSGP 260
Query: 284 CGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
CG N DHGV VVG+ + YW++KNSWG +W
Sbjct: 261 CGTNLDHGVLVVGYTS-------DYWIVKNSWGASW 289
>gi|426219849|ref|XP_004004130.1| PREDICTED: cathepsin L1 isoform 1 [Ovis aries]
gi|426219851|ref|XP_004004131.1| PREDICTED: cathepsin L1 isoform 2 [Ovis aries]
Length = 334
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 143/343 (41%), Positives = 205/343 (59%), Gaps = 22/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
P F + +L + V S +P++ QW A H R Y E+ R ++++N +
Sbjct: 3 PSFFLTVLCLG----VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEGWRRAVWEKNKK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
I+ N+E G + + N F D+TNEEFR + G+ +++ + F+ +
Sbjct: 58 IIDLHNQEYSQGKHGFSMAMNAFGDMTNEEFRQVMNGFQ------NQKRKKGKLFREPLL 111
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
DVP S+DW +KG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 IDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR 171
Query: 189 D--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD AF+YI EN GL +E YPY + + N K + AA + + D+P+
Sbjct: 172 PQGNQGCNGGLMDNAFQYIKENGGLDSEESYPYLATDTSSCNYKPECSAANDTGFVDIPQ 231
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
E+AL++AV+ P+SV +DA +F FYKSG+ + DC + + DHGV VVG+G +
Sbjct: 232 -REKALMKAVATVGPISVAIDAGHASFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTD 290
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
N K+W++KNSWG WG +GY+++ +D CGIATAASYP
Sbjct: 291 SNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPT 333
>gi|115438530|ref|NP_001043562.1| Os01g0613500 [Oryza sativa Japonica Group]
gi|11034572|dbj|BAB17096.1| cysteine proteinase-like [Oryza sativa Japonica Group]
gi|113533093|dbj|BAF05476.1| Os01g0613500 [Oryza sativa Japonica Group]
gi|215697766|dbj|BAG91959.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 360
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 190/322 (59%), Gaps = 16/322 (4%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLT 95
S+ +HE+WMA+ GR Y D EKA R+ +F N E ++ AN+ G +RTY LG N+FSDLT
Sbjct: 38 SMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDLT 97
Query: 96 NEEFRALYTGYN-RPVPSVSRQSSRP---STFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
++EF + GY+ P P R R + + TDVP S+DWR +GAVT +K+Q
Sbjct: 98 DDEFAQTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQRS 157
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGL 211
CGSCWAF+AVAA EG+ Q+ G L+ LSEQQ++DC+ + CSGG + A YI + GL
Sbjct: 158 CGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDCTGGANTCSGGDVSAALRYIAASGGL 217
Query: 212 ATEADYPYRHEEGTCD----NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
TEA Y Y ++G C A A +++ L GDE AL + QPV V V+A
Sbjct: 218 QTEAAYAYGGQQGACRAGGFAAPNSAAAVGGARWARL-YGDEGALQALAAGQPVVVVVEA 276
Query: 268 SGRAFHFYKSGVL--NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
S F Y+SGV +A CG +H V VV A + G +YWL+KN WG WGE GY+
Sbjct: 277 SEPDFRHYRSGVYAGSAACGRRLNHAVTVV-GYGAAADGGGEYWLVKNQWGTWWGEGGYM 335
Query: 326 RILRD---AGLCGIATAASYPV 344
R+ R G CGIAT A YP
Sbjct: 336 RVARGGAAGGNCGIATYAFYPT 357
>gi|18407678|ref|NP_566867.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|30315950|sp|Q9LXW3.1|CPR2_ARATH RecName: Full=Probable cysteine proteinase At3g43960; Flags:
Precursor
gi|7594557|emb|CAB88124.1| cysteine proteinase-like protein [Arabidopsis thaliana]
gi|26452289|dbj|BAC43231.1| putative cysteine proteinase [Arabidopsis thaliana]
gi|332644328|gb|AEE77849.1| putative cysteine proteinase [Arabidopsis thaliana]
Length = 376
Score = 256 bits (653), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 140/321 (43%), Positives = 198/321 (61%), Gaps = 17/321 (5%)
Query: 34 HEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSD 93
+E ++ +EQW+ ++G+ Y EK R IFK NL+ IE+ N + NR+Y+ G N+FSD
Sbjct: 33 NEGEVLTMYEQWLVENGKNYNGLGEKERRFKIFKDNLKRIEEHNSDPNRSYERGLNKFSD 92
Query: 94 LTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT-HIKDQGQC 152
LT +EF+A Y G S+S + R ++Y+ +P +DWRE+GAV +K QG+C
Sbjct: 93 LTADEFQASYLGGKMEKKSLSDVAER---YQYKEGDVLPDEVDWRERGAVVPRVKRQGEC 149
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKG 210
GSCWAF+A AVEGI QIT G+L+ LSEQ+L+DC DN GC+GG AFE+I EN G
Sbjct: 150 GSCWAFAATGAVEGINQITTGELVSLSEQELIDCDRGNDNFGCAGGGAVWAFEFIKENGG 209
Query: 211 LATEADYPYRHEE-GTCDNQKEKAV-AATISKYEDLPKGDEQALLQAVSNQPVSVCVDAS 268
+ ++ Y Y E+ C + K TI+ +E +P DE +L +AV+ QP+SV + A+
Sbjct: 210 IVSDEVYGYTGEDTAACKAIEMKTTRVVTINGHEVVPVNDEMSLKKAVAYQPISVMISAA 269
Query: 269 GRAFHFYKSGVLNADCGNNC-DHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
+ YKSGV C N DH V +VG+GT+ +E YWLI+NSWG WGE GY+R+
Sbjct: 270 NMS--DYKSGVYKGACSNLWGDHNVLIVGYGTSSDE--GDYWLIRNSWGPEWGEGGYLRL 325
Query: 328 LRD----AGLCGIATAASYPV 344
R+ G C +A A YP+
Sbjct: 326 QRNFHEPTGKCAVAVAPVYPI 346
>gi|158268253|gb|ABW25046.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 156/359 (43%), Positives = 211/359 (58%), Gaps = 33/359 (9%)
Query: 13 MFVIIILVITCASQVVS--GRSMHEPSIVEKHEQWMAQH---------------GRTY-K 54
MF ++ LV+ CAS S S H+ +I + + Q G++Y K
Sbjct: 1 MFRLLSLVLLCASVFASIDSGSRHDHTIRLHRVKSLRQKIDEAFKLWDDYKESFGKSYNK 60
Query: 55 DELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVP 111
DE M F +N+ +I++ N+E G +T+++G N +DL ++R L +R
Sbjct: 61 DEENDYME--AFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKLNGYRHRRNF 118
Query: 112 SVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQIT 171
S QS+ NV ++P S+DWR+KG VT +K+QG CGSCWAFSA A+EG
Sbjct: 119 GDSMQSNGTKWLAPFNV-EIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARA 177
Query: 172 RGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ 229
GK++ LSEQ LVDCST NHGC+GGLMD AFEYI +N G+ TE YPY E C +
Sbjct: 178 SGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKC-HF 236
Query: 230 KEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGN- 286
K+K + A + DLP+GDE+AL AV+ Q P+S+ +DA R F YK GV + +C +
Sbjct: 237 KKKDIGAEDKGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSE 296
Query: 287 NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
DHGV +VG+GT E YWLIKNSWG WGE GYIRI R+ + CG+AT ASYP+
Sbjct: 297 ELDHGVLLVGYGTDPE--AGDYWLIKNSWGPGWGEKGYIRIARNRSNHCGVATKASYPL 353
>gi|125526835|gb|EAY74949.1| hypothetical protein OsI_02845 [Oryza sativa Indica Group]
Length = 360
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 144/322 (44%), Positives = 190/322 (59%), Gaps = 16/322 (4%)
Query: 37 SIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEG-NRTYKLGTNEFSDLT 95
S+ +HE+WMA+ GR Y D EKA R+ +F N E ++ AN+ G +RTY LG N+FSDLT
Sbjct: 38 SMAARHERWMARFGRAYADAAEKARRMEVFAANAERVDAANRAGGDRTYTLGLNQFSDLT 97
Query: 96 NEEFRALYTGYN-RPVPSVSRQSSRP---STFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
++EF + GY+ P P R R + + TDVP S+DWR +GAVT +K+Q
Sbjct: 98 DDEFARTHLGYSWAPPPPSHRHGHRAENGTAAAAADDTDVPDSVDWRARGAVTEVKNQRS 157
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGL 211
CGSCWAF+AVAA EG+ Q+ G L+ LSEQQ++DC+ + CSGG + A YI + GL
Sbjct: 158 CGSCWAFAAVAATEGLVQLATGNLVSLSEQQVLDCTGGANTCSGGDVSAALRYIAASGGL 217
Query: 212 ATEADYPYRHEEGTCD----NQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDA 267
TEA Y Y ++G C A A +++ L GDE AL + QPV V V+A
Sbjct: 218 QTEAAYAYGGQQGACRAGGFAAPNSAAAVGGARWARL-YGDEGALQALAAGQPVVVVVEA 276
Query: 268 SGRAFHFYKSGVL--NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
S F Y+SGV +A CG +H V VV A + G +YWL+KN WG WGE GY+
Sbjct: 277 SEPDFRHYRSGVYAGSAACGRRLNHAVTVV-GYGAAADGGGEYWLVKNQWGTWWGEGGYM 335
Query: 326 RILRD---AGLCGIATAASYPV 344
R+ R G CGIAT A YP
Sbjct: 336 RVARGGAAGGNCGIATYAFYPT 357
>gi|242020003|ref|XP_002430447.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
gi|212515585|gb|EEB17709.1| Cathepsin L precursor, putative [Pediculus humanus corporis]
Length = 345
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 144/348 (41%), Positives = 211/348 (60%), Gaps = 22/348 (6%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M ++ + +T S ++ S ++ ++E+ + + A+H + Y +++E+ R+ IF N +
Sbjct: 1 MKILFFIALTVLS--INAVSFYD-LVMEEWQLFKAEHKKNYNNDVEEKFRMKIFMDNKQK 57
Query: 73 IEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPV-PSVSRQSS-----RPSTF 123
I K N + G YKLG N++SD+ + EF + G+N+ + P R ++ + S F
Sbjct: 58 ITKHNTKYQRGEVGYKLGLNKYSDMLHHEFINTFNGFNKSIIPPHLRSNNGKTHLKGSFF 117
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
+P +DW + GAVT +KDQG CGSCWAFSA A+EG+ L+ LSEQ L
Sbjct: 118 IPPANVKLPKHVDWVKLGAVTPVKDQGHCGSCWAFSATGALEGLHFRKTKVLVSLSEQNL 177
Query: 184 VDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKY 241
+DCST+ N+GC+GGLMD+AF+Y+ N G+ TE YPY C + E + A + Y
Sbjct: 178 IDCSTEEGNNGCNGGLMDQAFQYVRINGGIDTERSYPYEGNNDVCRYEPENS-GAIDTGY 236
Query: 242 EDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN---NCDHGVAVVG 296
D+P GDE AL AV+ PVSV +DAS +F Y SGV +C N + DHGV VVG
Sbjct: 237 TDVPLGDEDALKSAVATVGPVSVAIDASQESFQLYSSGVYFEPNCKNEPESLDHGVLVVG 296
Query: 297 FGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYP 343
+GT +EE YWL+KNSWG++WGE+GYI++ R+A CGIAT S+P
Sbjct: 297 YGT-DEETQQDYWLVKNSWGDSWGENGYIKMARNADNQCGIATQPSFP 343
>gi|225719768|gb|ACO15730.1| Cathepsin L1 precursor [Caligus clemensi]
Length = 338
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 145/341 (42%), Positives = 202/341 (59%), Gaps = 18/341 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQ-WMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
+ + +++ C S V + S +E H W H + Y E+ R ++++NL+ I
Sbjct: 4 LYLAVLVLCVSAVCAAPRF--DSQLEDHWHLWKNWHSKNYHAS-EEGWRRMVWEKNLKKI 60
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E N E G +++LG N F D+TNEEFR GY + + + + S F N
Sbjct: 61 EIHNLEHTMGKHSHRLGMNHFGDMTNEEFRQTMNGYKQ----TTERKFKGSLFMEPNYLQ 116
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
P ++DWREKG VT +KDQG CGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 117 APKAVDWREKGYVTPVKDQGSCGSCWAFSTTGAMEGQPFRKTGKLVSLSEQNLVDCSRPE 176
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GGLMD+AF+YI +N GL TE YPY + + K + AA + + D+P G
Sbjct: 177 GNEGCNGGLMDQAFQYIQDNAGLDTEESYPYVGTDEDPCHYKPEFSAANETGFVDIPSGK 236
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEEN 304
E A+++AV+ PVSV +DA +F FY+SG+ +C + DHGV VVG+G E+ +
Sbjct: 237 EHAMMKAVAAVGPVSVAIDAGHESFQFYESGIYYEKECSSEELDHGVLVVGYGFEGEDVD 296
Query: 305 GAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
G KYW++KNSW E WG+ GYI + +D CGIATA+SYP+
Sbjct: 297 GKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATASSYPL 337
>gi|346466067|gb|AEO32878.1| hypothetical protein [Amblyomma maculatum]
Length = 358
Score = 256 bits (653), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 144/343 (41%), Positives = 207/343 (60%), Gaps = 24/343 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
FV++ + A+ + H+ + + + A HG+ Y E E+ RL I+ +N I
Sbjct: 27 FVVLGCLFVTAAAIT-----HQELVGAEWSAFKALHGKEYHSETEEYYRLKIYMENRLKI 81
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS---RPSTFKYQN 127
+ N++ +YKL NEF DL + EF + G+ R S R+ S P + ++
Sbjct: 82 ARHNEKYANNKASYKLAMNEFGDLLHHEFVSTRNGFKRNYRSTPREGSFYIEPEGIEDKH 141
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
+ P ++DWR+KGAVT +K+QGQCGSCWAFS ++EG G+++ LSEQ LVDCS
Sbjct: 142 L---PKTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGRMVSLSEQNLVDCS 198
Query: 188 TD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
N+GC GGLMD AF+YI N G+ TE YPY +G C +K V AT + + D+P
Sbjct: 199 GKFGNNGCEGGLMDNAFKYIKANGGIDTELSYPYNGTDGICHFEKSD-VGATDTGFVDIP 257
Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEE 302
+G+EQ L +AV+ PVSV +DAS +F FY GV + +C + + DHGV VVG+GT
Sbjct: 258 EGNEQLLKKAVATVGPVSVAIDASHESFQFYSQGVYDEPECSSESLDHGVLVVGYGT--- 314
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
++G YWL+KNSWG TWG+ GYI + R+ CGIA++ASYP+
Sbjct: 315 KDGQDYWLVKNSWGTTWGDDGYIYMTRNKENQCGIASSASYPL 357
>gi|91092014|ref|XP_970644.1| PREDICTED: similar to cathepsin-L-like cysteine peptidase 02
[Tribolium castaneum]
gi|270001249|gb|EEZ97696.1| cathepsin L precursor [Tribolium castaneum]
Length = 337
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 147/344 (42%), Positives = 202/344 (58%), Gaps = 23/344 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNL 70
F++ + + SQ VS + + EQW A H + Y+ E E+ R+ IF +N
Sbjct: 3 FLVFVALCVVGSQAVSFFDLVQ-------EQWGAFKVTHKKQYESETEERFRMKIFMENA 55
Query: 71 EYIEKANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRP-VPSVSRQSSRPSTFKYQ 126
+ K NK +G ++KLG N++SD+ N EF GYNR P S + TF
Sbjct: 56 HKVAKHNKLYAQGLVSFKLGVNKYSDMLNHEFVHTLNGYNRSKTPLRSGELDESITFIPP 115
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
++P IDWR+ GAVT +KDQGQCGSCW+FS ++EG KL+ LSEQ L+DC
Sbjct: 116 ANVELPKQIDWRKLGAVTPVKDQGQCGSCWSFSTTGSLEGQHFRKSKKLVSLSEQNLIDC 175
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
S N+GC+GGLMD AF YI +N G+ TE YPY+ E+ C + K + AT + D+
Sbjct: 176 SEKYGNNGCNGGLMDNAFRYIKDNGGIDTEQSYPYKAEDEKC-HYKPRNKGATDRGFVDI 234
Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAE 301
GDE+ L AV+ P+SV +DAS F Y GV +C + DHGV VVG+GT
Sbjct: 235 ESGDEEKLKAAVATVGPISVAIDASHPTFQQYSEGVYYEPECSSEQLDHGVLVVGYGT-- 292
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+E+G YWL+KNSWG++WG+ GYI++ R+ CGIAT ASYP+
Sbjct: 293 DEDGNDYWLVKNSWGDSWGDQGYIKMARNRDNNCGIATQASYPL 336
>gi|89272015|emb|CAJ83143.1| cathepsin L2 [Xenopus (Silurana) tropicalis]
Length = 335
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 145/342 (42%), Positives = 206/342 (60%), Gaps = 18/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + + + C + V + + +P++ W H ++Y + E+ R ++++NL
Sbjct: 1 MALYLGIAAICLTTVFAAPTT-DPALDNHWNLWKNWHKKSYAPK-EEGWRRVLWEKNLRM 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G ++ LG N+F D+TNEEFR L GY +++ R STF N
Sbjct: 59 IEFHNLEHSLGKHSHSLGMNQFGDMTNEEFRQLMNGYK------NQKKIRGSTFLAPNNF 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
+ P S+DWR+KG VT +KDQGQCGSCWAFS A+EG GK+I LSEQ LVDCS
Sbjct: 113 ESPKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRNTGKMISLSEQNLVDCSRA 172
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+Y+ +N G+ +E YPY ++ + +A + + D+
Sbjct: 173 QGNQGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNYNSANDTGFVDVTSE 232
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+ L+ AV++ PVSV VDA ++F FYKSG+ +C + + DHGV VVG+G E+E
Sbjct: 233 SEKDLMNAVASVGPVSVAVDAGHQSFQFYKSGIYYEPECSSEDLDHGVLVVGYGFEGEDE 292
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+G KYW++KNSW E WG GYI I +D CGIATAASYP+
Sbjct: 293 DGKKYWIVKNSWSEKWGNDGYIYIAKDRHNHCGIATAASYPL 334
>gi|256082975|ref|XP_002577726.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 1471
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 141/312 (45%), Positives = 195/312 (62%), Gaps = 21/312 (6%)
Query: 48 QHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYT 104
Q R Y E+ R IF N + + N +EG TYK+G NEF+D T+ E + L
Sbjct: 66 QFKRAYNGIHEETRRFFIFSANFVKMMEHNHAFQEGKVTYKMGVNEFTDKTDYELKKL-R 124
Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
GY ++ + S TF T +P+ +DWR +GAVT +K+QGQCGSCWAFS A+
Sbjct: 125 GYKVTSGAIRHKGS---TFIRSEHTKLPSKVDWRREGAVTDVKNQGQCGSCWAFSTTGAI 181
Query: 165 EGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
EG +L+ LSEQQLVDCS N+GCSGGLM+ AFEY+ +N+G+ +E YPY
Sbjct: 182 EGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYVRDNEGIDSEISYPYVSG 241
Query: 223 EGTCDNQ---KEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSG 278
+GT +N+ + A ++ Y ++ +GDE+AL+ AV+ + PVSV ++A +F YKSG
Sbjct: 242 DGTENNRCLFNASNILAQVTGYVNIHEGDERALMDAVATKGPVSVAINAGLPSFSMYKSG 301
Query: 279 VL-NADCG---NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
+ + DC + DHGV VVG+G EENG YWLIKNSWGE WGE GYI+I + + +
Sbjct: 302 IYSDTDCEGTLDALDHGVLVVGYG---EENGRSYWLIKNSWGEEWGEKGYIKISKGSHNM 358
Query: 334 CGIATAASYPVA 345
CG+A+AASYP+
Sbjct: 359 CGVASAASYPLV 370
>gi|356545079|ref|XP_003540973.1| PREDICTED: LOW QUALITY PROTEIN: vignain-like [Glycine max]
Length = 330
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 135/301 (44%), Positives = 186/301 (61%), Gaps = 20/301 (6%)
Query: 24 ASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRT 83
ASQV R++ + S+ E+HE+WM+++G+ YKD E+ R IFK+N+ YIE +N +
Sbjct: 5 ASQVTC-RTLQDASMYERHEEWMSRYGKVYKDPREREKRFRIFKENMNYIETSNNVAIKP 63
Query: 84 YKLGTNEFSDLTNEEF---RALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREK 140
KL N+F+DL NEEF R ++ G + R SR TF + P +K
Sbjct: 64 XKLVINQFADLNNEEFIAPRNIFKGM-----ILCRFLSRKHTFPF------PYVFLGHKK 112
Query: 141 GAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLM 198
GAVT +KDQG CG CWAF VA+ EGI +T GKLI LSEQ+LVDC T + GC GLM
Sbjct: 113 GAVTPVKDQGHCGFCWAFYDVASTEGILALTAGKLISLSEQELVDCDTKGVDQGCECGLM 172
Query: 199 DKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN 258
D AF++II+N G+ +A+YPY+ +G C+ +E AATI+ ED+P +E+AL + V+N
Sbjct: 173 DDAFKFIIQNHGV-XDANYPYKGVDGKCNANEEANPAATITGXEDVPANNEKALQKVVAN 231
Query: 259 QPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGET 318
QPV V +DA F FYKSGV C +HGV +G+G + + G +YWL+KNS
Sbjct: 232 QPVFVAIDACDSDFQFYKSGVFTGSCETELNHGVTTMGYGVSHD--GTQYWLVKNSXETE 289
Query: 319 W 319
W
Sbjct: 290 W 290
>gi|218187750|gb|EEC70177.1| hypothetical protein OsI_00904 [Oryza sativa Indica Group]
gi|222617983|gb|EEE54115.1| hypothetical protein OsJ_00884 [Oryza sativa Japonica Group]
Length = 327
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 136/322 (42%), Positives = 184/322 (57%), Gaps = 27/322 (8%)
Query: 39 VEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEE 98
+ E+WMA+ G+ Y EK R +F+ N+ +I L N+F+DLTN+E
Sbjct: 16 TQMFEEWMAKFGKKYPCHGEKEYRFGVFRDNVRFIRSYRPPAGYNSALRVNQFADLTNDE 75
Query: 99 FRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAF 158
F + +TG P P + + P +P IDWR KGAVT +KDQG CGSCWAF
Sbjct: 76 FVSTHTGAKPPCPKDAPRGVDPIW--------LPCCIDWRYKGAVTDVKDQGACGSCWAF 127
Query: 159 SAVAAVEGITQITRGKLIELSEQQLVDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
+AVAA+EG+TQI GKL LSEQ+LVDC T + GC+GG D+AFE + G+ E+ Y
Sbjct: 128 AAVAAIEGLTQIRTGKLTPLSEQELVDCDTGSSGCAGGHTDRAFELVAAKGGITAESGYR 187
Query: 219 YRHEEGTCDNQKEKAV---AATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFY 275
Y G C + + A+ AA I + +P GDE+ L AV+ QPV+ +DASG AF FY
Sbjct: 188 YEGYRGKC--RADDALFNHAARIGGHRAVPPGDERQLATAVARQPVTAYIDASGPAFQFY 245
Query: 276 KSGVLNADCGN---------NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
SGV CG+ +H V +VG+ + +G KYW+ KNSWG+TWGE GYI
Sbjct: 246 GSGVFPGPCGSGSGAAAAAPTTNHAVTLVGY-CQDGASGKKYWVAKNSWGKTWGEKGYIL 304
Query: 327 ILRDA----GLCGIATAASYPV 344
+ +D G CG+A + YP
Sbjct: 305 LEKDVASPHGTCGVAVSPFYPT 326
>gi|149617838|ref|XP_001521715.1| PREDICTED: cathepsin L1-like [Ornithorhynchus anatinus]
Length = 338
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 145/342 (42%), Positives = 209/342 (61%), Gaps = 15/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M +++ LV C VS + + + + W H ++Y E E+ R ++++NL+
Sbjct: 1 MNLLVCLVSLCWGLAVSA-PLGDSELDRHWKLWKNWHQKSYH-EAEEGWRRTVWEENLKA 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I+ N E G TY+LG N+F DLTNEEF+ + TG R +R + S F N
Sbjct: 59 IQLHNLEQSLGLHTYRLGMNQFGDLTNEEFQEILTG-ERHFSKGNRING--SAFLEANFV 115
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
VPTS+DWR+ G VT +K+QG CGSCWAFS A+EG G+LI LSEQ LVDCS
Sbjct: 116 QVPTSVDWRDHGYVTPVKNQGHCGSCWAFSTTGALEGQLFRKSGRLISLSEQNLVDCSWQ 175
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC GG++D AF+YI++N+G+ +E YPY ++ K + A ++ + D+P
Sbjct: 176 QGNQGCHGGIVDLAFQYILQNQGIDSEDCYPYTAKDTAQCTFKPECATAPVTGFVDIPPH 235
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGN-NCDHGVAVVGFG-TAEEE 303
E+AL++AV+ PVSV +DAS +F FY+SG+ + C + + DH V VVG+G E+E
Sbjct: 236 SEEALMKAVATVGPVSVGIDASSTSFRFYQSGIFYDPKCSSESLDHAVLVVGYGYEREDE 295
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
G KYW++KNSWG+ WG+ GY+ + +D G CGIAT ASYP+
Sbjct: 296 AGKKYWIVKNSWGKHWGDRGYVYMSKDRGNHCGIATVASYPL 337
>gi|281203744|gb|EFA77940.1| hypothetical protein PPL_08585 [Polysphondylium pallidum PN500]
Length = 505
Score = 255 bits (652), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 147/375 (39%), Positives = 207/375 (55%), Gaps = 38/375 (10%)
Query: 5 FEKSF---------IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKD 55
FE SF I+ ++ I+L+I + + E + E W+ + + Y D
Sbjct: 135 FESSFRCFSIIFLKIMNRYINILLLIFGLIAISNALLFSEEQYKNEFENWIDRFEKKY-D 193
Query: 56 ELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSR 115
E R +IFK N++++ N + ++T LG N +DLTN E+R Y G ++ +V
Sbjct: 194 VSEFKKRFSIFKSNMDFVHSWNSKNSQTV-LGLNHLADLTNLEYRQFYLGTHKK--AVLG 250
Query: 116 QSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKL 175
Q+V ++DWR+KGAV+ IKDQGQCGSCW+FS +VEG QI G +
Sbjct: 251 TPGNHEVSNLQSVFGDSATVDWRQKGAVSPIKDQGQCGSCWSFSTTGSVEGAHQIKSGNM 310
Query: 176 IELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKA 233
+ELSEQ LVDCST N GC+GGLMD AFEYII N G+ TE+ YPY GT +
Sbjct: 311 VELSEQNLVDCSTSEGNMGCNGGLMDYAFEYIITNNGIDTESSYPYTASSGTTCKYNKAN 370
Query: 234 VAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGN-NCDH 290
ATIS Y+++ G E L AV N PVSV +DAS +F Y G+ +A C + N DH
Sbjct: 371 SGATISSYKNITAGSESDLADAVKNAGPVSVAIDASHNSFQLYSHGIYYDASCSSVNLDH 430
Query: 291 GVAVVGFGTAEEENGAK-------------------YWLIKNSWGETWGESGYIRILRDA 331
GV VVG+G+ ++ ++ YW++KNSWG +WG+ G+I + +D
Sbjct: 431 GVLVVGYGSGTPDSDSRVHKGSQVRVKVPKTDDTKNYWIVKNSWGTSWGDKGFIYMSKDR 490
Query: 332 -GLCGIATAASYPVA 345
CGIA+ ASYP+
Sbjct: 491 DNNCGIASCASYPIV 505
>gi|66823245|ref|XP_644977.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
gi|166201986|sp|P54640.2|CYSP5_DICDI RecName: Full=Cysteine proteinase 5; Flags: Precursor
gi|60473097|gb|EAL71045.1| cysteine proteinase 5 precursor [Dictyostelium discoideum AX4]
Length = 344
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 142/352 (40%), Positives = 200/352 (56%), Gaps = 29/352 (8%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M V+ L + S + + E WM H ++Y E E R NIFK N++Y
Sbjct: 1 MKVLSFLCVLLVSVATAKQQFSELQYRNAFTDWMITHQKSYTSE-EFGARYNIFKANMDY 59
Query: 73 IEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-VSRQSSRPSTFKYQNVTDV 131
+++ N +G+ T LG N F+D+TNEE+R Y G S + Q + T T
Sbjct: 60 VQQWNSKGSETV-LGLNNFADITNEEYRNTYLGTKFDASSLIGTQEEKVFT------TSS 112
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTDNH 191
S DWR +GAVT +K+QGQCG CW+FS + EG ++G+L+ LSEQ L+DCST+N
Sbjct: 113 AASKDWRSEGAVTPVKNQGQCGGCWSFSTTGSTEGAHFQSKGELVSLSEQNLIDCSTENS 172
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GGLM AFEYII N G+ TE+ YPY+ E G C+ + E + AT+S Y+ + G E +
Sbjct: 173 GCDGGLMTYAFEYIINNNGIDTESSYPYKAENGKCEYKSENS-GATLSSYKTVTAGSESS 231
Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGA--- 306
L AV+ PVSV +DAS ++F Y SG+ +C + N DHGV VG+G+ +
Sbjct: 232 LESAVNVNPVSVAIDASHQSFQLYTSGIYYEPECSSENLDHGVLAVGYGSGSGSSSGQSS 291
Query: 307 -------------KYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+YW++KNSWG +WG GYI + R+ CGIA++AS+PV
Sbjct: 292 GQSSGNLSASSSNEYWIVKNSWGTSWGIEGYILMSRNRDNNCGIASSASFPV 343
>gi|311265493|ref|XP_003130681.1| PREDICTED: cathepsin L1-like [Sus scrofa]
Length = 332
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 200/337 (59%), Gaps = 19/337 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
++L C + S H+ S+ +W A H + Y E+ R I+++N++ IE+
Sbjct: 5 LLLAAFCLG-IASAAPRHDHSLDADWYKWKATHRKLYGLN-EEGRRRAIWEKNMKMIERH 62
Query: 77 N---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N ++G ++ + N F D+TNEEFR G+ +++ + F P
Sbjct: 63 NWEHRQGKHSFTMAMNAFGDMTNEEFRKTMNGFQ------NQKHKKGKVFLDAGSALTPH 116
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS--TDNH 191
S+DWREKG VT +K+QG CGSCWAFSA A+EG KLI LSEQ LVDCS N
Sbjct: 117 SVDWREKGYVTAVKNQGHCGSCWAFSATGALEGQMFRKTSKLISLSEQNLVDCSWPEGNE 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC+GGLMD AF+YI +N GL +E YPY ++G+C K ++ AA + Y D+PK E+A
Sbjct: 177 GCNGGLMDNAFQYIKDNGGLDSEESYPYFGKDGSC-KYKPQSSAANDTGYVDIPK-QEKA 234
Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFGTAEEENGAKY 308
L++AV+ P+SV +DAS +F FY +G+ C + + DHGV VVG+G + KY
Sbjct: 235 LMKAVATVGPISVGIDASHESFQFYSTGIYFEPQCSSEDLDHGVLVVGYGVEGAHSNNKY 294
Query: 309 WLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
WL+KNSWG TWG GYI++ +D CGIAT ASYPV
Sbjct: 295 WLVKNSWGNTWGMDGYIKMTKDQNNHCGIATMASYPV 331
>gi|410519429|gb|AFV73398.1| cathepsin L [Haliotis discus hannai]
Length = 326
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 142/309 (45%), Positives = 185/309 (59%), Gaps = 23/309 (7%)
Query: 48 QHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNR---TYKLGTNEFSDLTNEEFRALYT 104
+H + YKD E+A R +F + +EYI++ N E +R ++++G NE++D+ NEEF +
Sbjct: 28 RHNKQYKDNQEEAYRKGVFMKAVEYIQQHNLEADRGVHSFRVGINEYADMPNEEFVRVMN 87
Query: 105 GYNRPVPSVSRQSSRPSTFKYQ---NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
GY Q RP Y NV D+P ++DWR KG VT +K+QGQCGSCWAFS+
Sbjct: 88 GY-------KMQEQRPKAPTYMPPSNVGDLPATVDWRTKGYVTEVKNQGQCGSCWAFSST 140
Query: 162 AAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPY 219
++EG T KLI LSEQ LVDCST+ N GC GGLMD+AF YI N G+ TE YPY
Sbjct: 141 GSLEGQTFKKYNKLISLSEQNLVDCSTEQGNMGCGGGLMDQAFTYIKVNDGIDTETSYPY 200
Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSG 278
G C K V A + Y D+ E L AV+ P++V +DAS +F YKSG
Sbjct: 201 EAASGKCRFNKAN-VGANDTGYTDIKSKSESDLQSAVATVGPIAVAIDASHMSFQLYKSG 259
Query: 279 VLN-ADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCG 335
V + C DHGV VG+GT ++G YWL+KNSWG TWG+ GYI + R+ CG
Sbjct: 260 VYHYIFCSQTRLDHGVLAVGYGT---DSGKDYWLVKNSWGATWGQQGYIMMSRNRDNNCG 316
Query: 336 IATAASYPV 344
IAT ASYP
Sbjct: 317 IATQASYPT 325
>gi|402770515|gb|AFQ98392.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 204/340 (60%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF ++ G++ +R++ S NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSSFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P +DWR+KGAVT +KDQGQCGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC GGLM+ AF+YI N G+ TE YPY +G C +KE V AT + Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKED-VGATDTGYVEIKAGS 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
E L +AV+ P+SV +DAS +F Y GV + +C + + DHGV VVG+G + G
Sbjct: 235 EVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|254746340|emb|CAX16635.1| putative C1A cysteine protease precursor [Manduca sexta]
Length = 342
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 141/319 (44%), Positives = 193/319 (60%), Gaps = 20/319 (6%)
Query: 43 EQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDLTN 96
E+W+A QH + Y E+E R+ I+ +N I K N+ +G +YKLG N+++D+ +
Sbjct: 26 EEWVAFKMQHDKKYDSEVEDRFRMKIYAENKHKIAKHNQLYEQGLVSYKLGPNKYTDMLH 85
Query: 97 EEFRALYTGYNRPVPSVS-----RQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
EF GYNR + R +TF P +DW +KGAVT +KDQG+
Sbjct: 86 HEFIQAMNGYNRTAKHNKGLYGKKHDVRGATFIPPAHVKYPDHVDWTKKGAVTEVKDQGK 145
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
CGSCWAFS A+EG G L+ LSEQ L+DCS+ N+GC+GGLMD AF+YI +N
Sbjct: 146 CGSCWAFSTTGALEGQHFRKSGYLVSLSEQNLIDCSSTYGNNGCNGGLMDNAFKYIKDNG 205
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
G+ TE YPY + C + + A + + D+P GDE+ L+QAV+ PVSV +DAS
Sbjct: 206 GIDTEKTYPYEGVDDKCRYNPKNSGAEDVG-FVDIPSGDEEKLMQAVATVGPVSVAIDAS 264
Query: 269 GRAFHFYKSGVL-NADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+F FY GV + +C + + DHGV VVG+GT +E G YWL+KNSW TWGE GYI+
Sbjct: 265 QNSFQFYSGGVYYDTECSSTDLDHGVLVVGYGT--DEAGGDYWLVKNSWSRTWGELGYIK 322
Query: 327 ILRDA-GLCGIATAASYPV 344
+ R+ CGIAT ASYP+
Sbjct: 323 MARNRDNHCGIATDASYPL 341
>gi|47522698|ref|NP_999057.1| cathepsin L1 precursor [Sus scrofa]
gi|2499874|sp|Q28944.1|CATL1_PIG RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|1468964|dbj|BAA07140.1| porcine cathepsin L [Sus scrofa]
gi|15027272|emb|CAC44793.1| cathepsin L [Sus scrofa]
Length = 334
Score = 255 bits (651), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 138/311 (44%), Positives = 194/311 (62%), Gaps = 18/311 (5%)
Query: 44 QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
+W A HGR Y E+ R ++++N++ IE N+E G + + N F D+TNEEFR
Sbjct: 31 KWKATHGRLYGMN-EEGWRRAVWEKNMKMIELHNQEYSQGKHGFSMAMNAFGDMTNEEFR 89
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
+ G+ +++ + F V +VP S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90 QVMNGFQ------NQKHKKGKVFHESLVLEVPKSVDWREKGYVTAVKNQGQCGSCWAFSA 143
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
A+EG GKL+ LSEQ LVDCS N GC+GGLMD AF+Y+ +N GL TE YP
Sbjct: 144 TGALEGQMFRKTGKLVSLSEQNLVDCSRPQGNQGCNGGLMDNAFQYVKDNGGLDTEESYP 203
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
Y E K + AA + + D+P+ E+AL++AV+ P+SV +DA +F FYKS
Sbjct: 204 YLGRETNSCTYKPECSAANDTGFVDIPQ-REKALMKAVATVGPISVAIDAGHSSFQFYKS 262
Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
G+ + DC + + DHGV VVG+G + N +K+W++KNSWG WG +GY+++ +D
Sbjct: 263 GIYYDPDCSSKDLDHGVLVVGYGFEGTDSNSSKFWIVKNSWGPEWGWNGYVKMAKDQNNH 322
Query: 334 CGIATAASYPV 344
CGI+TAASYP
Sbjct: 323 CGISTAASYPT 333
>gi|158268255|gb|ABW25047.1| cathepsin L-like protease [Strongylus vulgaris]
Length = 354
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 157/359 (43%), Positives = 209/359 (58%), Gaps = 33/359 (9%)
Query: 13 MFVIIILVITCASQVVS----GRSMH----------EPSIVEKHEQW---MAQHGRTY-K 54
MF ++ LV+ CAS S R H I E + W G++Y K
Sbjct: 1 MFRLLSLVLLCASVFASIDSGSRRDHTIRLHRVKSLRQKIDEAFKLWDDYKEAFGKSYNK 60
Query: 55 DELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVP 111
DE M F +N+ +I++ N+E G +T+++G N +DL ++R L +R
Sbjct: 61 DEENDYME--AFVKNVIHIDEHNQEHRLGRKTFEMGLNSIADLPFSQYRKLNGYRHRRNF 118
Query: 112 SVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQIT 171
S QS+ NV ++P S+DWR+KG VT +K+QG CGSCWAFSA A+EG
Sbjct: 119 GDSMQSNGTKWLAPFNV-EIPDSVDWRDKGLVTDVKNQGMCGSCWAFSATGALEGQHARA 177
Query: 172 RGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQ 229
GK++ LSEQ LVDCST NHGC+GGLMD AFEYI +N G+ TE YPY E C +
Sbjct: 178 SGKMVSLSEQNLVDCSTKYGNHGCNGGLMDLAFEYIKDNHGIDTEESYPYVGRETKC-HF 236
Query: 230 KEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGV-LNADCGN- 286
K+K + A + DLP+GDE+AL AV+ Q P+S+ +DA R F YK GV + +C +
Sbjct: 237 KKKDIGAEDKGFVDLPEGDEEALKVAVATQGPISIAIDAGHRTFQLYKKGVYYDEECSSE 296
Query: 287 NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
DHGV +VG+GT E YWLIKNSWG WGE GYIRI R+ + CG+AT ASYP+
Sbjct: 297 ELDHGVLLVGYGTDPE--AGDYWLIKNSWGPGWGEKGYIRIARNRSNHCGVATKASYPL 353
>gi|226821421|gb|ACO82386.1| cathepsin L-like protein [Lutjanus argentimaculatus]
Length = 301
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 138/299 (46%), Positives = 188/299 (62%), Gaps = 14/299 (4%)
Query: 56 ELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS 112
E E+ R ++++NL+ IE N E G +Y+LG N F D+T+EEFR + GY R
Sbjct: 6 EKEEGWRRMVWEKNLKKIEMHNLEHSMGTHSYRLGMNHFGDMTHEEFRQIMNGYKRK--- 62
Query: 113 VSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITR 172
++ S F N + P ++DWR+ G VT +KDQGQCGSCWAFS A+EG
Sbjct: 63 -PQRKFTGSLFMEPNFLEAPRAVDWRDNGYVTPVKDQGQCGSCWAFSTTGALEGQHFRKT 121
Query: 173 GKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQK 230
GKL+ LSEQ LVDCS N GC+GGLMD+AF+YI +N+GL +E YPY + +
Sbjct: 122 GKLVSLSEQNLVDCSRPEGNEGCNGGLMDQAFQYIKDNQGLDSEDSYPYLGTDDQPCHYD 181
Query: 231 EKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-N 287
K +A + + D+P G E+AL++AV+ PVSV +DA +F FY+SG+ DC +
Sbjct: 182 PKYNSANDTGFVDIPSGKERALMKAVAAVGPVSVAIDAGHESFQFYQSGIYYEKDCSSEE 241
Query: 288 CDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
DHGV VVG+G E+ +G KYW++KNSW E WG+ GYI + +D CGIATAASYP+
Sbjct: 242 LDHGVLVVGYGFEGEDVDGKKYWIVKNSWSEKWGDKGYIYMAKDRKNHCGIATAASYPL 300
>gi|33348834|gb|AAQ16117.1| cathepsin L-like cysteine proteinase A [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 332
Score = 255 bits (651), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 145/341 (42%), Positives = 205/341 (60%), Gaps = 22/341 (6%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
++ L + CA V+ + + + E + H ++Y+ +E+ +R IF +N I K
Sbjct: 1 MLRLSLLCAIVAVTVAANSHEILRTQWEAFKTTHKKSYESHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF-KYQNVTD- 130
N + G +YKLG N+F DL EF ++ GY +++SR STF NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFAKIFNGYR------GQRTSRGSTFMPPANVNDS 114
Query: 131 -VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+P+++DWR+KGAVT +KDQGQCGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 115 SLPSTVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKDGELVSLSEQNLVDCSQS 174
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N+GC GGLMD AF+YI N G+ E YPY + C +KE V AT + + D+ G
Sbjct: 175 FGNNGCEGGLMDNAFKYIKANDGIDAEESYPYEAMDDKCRFKKED-VGATDTGFVDIEGG 233
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEEN 304
E L +AV+ P+SV +DA +F Y GV + +C + DHGV VG+G ++
Sbjct: 234 SEDDLKKAVATVGPISVAIDAGHSSFQLYSEGVYDEPECSSEELDHGVLAVGYGV---KD 290
Query: 305 GAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G KYWL+KNSWG +WG++GYI + RD CGIA+AASYP+
Sbjct: 291 GKKYWLVKNSWGGSWGDNGYILMSRDKNNQCGIASAASYPL 331
>gi|357216861|gb|AET71138.1| cysteine peptidase isoform b [Sphenophorus levis]
Length = 324
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 143/345 (41%), Positives = 199/345 (57%), Gaps = 30/345 (8%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
KSFI+ +LV+ ++ ++ H + + +HG+TYK++ E+ R IF
Sbjct: 2 KSFILAS----LLVVAVSATLLKEDGAH-------FQSFKLKHGKTYKNQAEETKRFAIF 50
Query: 67 KQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
++NL IE N K+G +Y G N+F+D+T EF+A+ + PS+ TF
Sbjct: 51 RENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAEFKAMLATQVKTKPSIVATK----TF 106
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
+ + VP SIDWR + VT IKDQ QCGSCWAF+ V + EG ++ GKL SEQQL
Sbjct: 107 QLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWAFAVVGSTEGAYALSTGKLTRFSEQQL 166
Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
VDC+TD N+GC GG +D F YI N GL E+DYPY +G C + K V +S Y
Sbjct: 167 VDCTTDLNYGCDGGYLDDTFPYIQTN-GLELESDYPYTGYDGYCSYESSK-VVTKVSSYV 224
Query: 243 DLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGVLNADCGNN--CDHGVAVVGFGT 299
+P +EQALL+AV PV++ ++A F+F SG+++ + DHGV VG+
Sbjct: 225 SVP-ANEQALLEAVGTAGPVAIAINADDLQFYF--SGIIDDKYCDPEYLDHGVLAVGY-- 279
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPV 344
+ ENG YWLIKNSWG WGESGY R LR +CG+ A YP+
Sbjct: 280 -DSENGRDYWLIKNSWGADWGESGYFRFLRGQNICGVKEDAVYPL 323
>gi|401397136|ref|XP_003879989.1| cathepsin L, related [Neospora caninum Liverpool]
gi|325114397|emb|CBZ49954.1| cathepsin L, related [Neospora caninum Liverpool]
Length = 415
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 138/353 (39%), Positives = 198/353 (56%), Gaps = 29/353 (8%)
Query: 5 FEKSFIIPMFVIIILVITCASQVV-----SGRSMHEPSIVEKHE------QWMAQH---- 49
F+ I + + ++ AS ++ R + PS VE H+ +W +H
Sbjct: 59 FKTRAWIALVAAAVSLLVFASFLIQWQGDDDRGVFPPSPVEDHKTPVNIWEWKEEHFQNA 118
Query: 50 --------GRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRA 101
G++Y E E R IFK NL YI N++G +Y L N F DL+ EEFR
Sbjct: 119 FGSFRATYGKSYATEEETQKRYAIFKNNLAYIHTHNQQG-YSYSLKMNHFGDLSREEFRR 177
Query: 102 LYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAV 161
Y GYN+ S + + +DVP+++DWREKG VT +KDQ CGSCWAFSA
Sbjct: 178 KYLGYNKSRNLKSNNLGVATELLKVSPSDVPSAVDWREKGCVTPVKDQRDCGSCWAFSAT 237
Query: 162 AAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKGLATEADYPY 219
A+EG G+L+ LSEQ+LVDCS N GCSGG M+ AF+Y++++ GL +E YPY
Sbjct: 238 GALEGAHCAKTGELLSLSEQELVDCSLAEGNQGCSGGEMNDAFQYVVDSGGLCSEEGYPY 297
Query: 220 RHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGV 279
+G C +K V TIS ++D+P+ E A+ A+++ PVS+ ++A F FY GV
Sbjct: 298 LARDGECKRACKKVV--TISGFKDVPRKSETAMKAALAHSPVSIAIEADQLPFQFYHEGV 355
Query: 280 LNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDAG 332
+A CG + DHGV +VG+GT ++E +W++KNSWG WG GY+ + G
Sbjct: 356 FDASCGTDLDHGVLLVGYGT-DKETKKDFWIMKNSWGSGWGRDGYMYMAMHKG 407
>gi|255635645|gb|ACU18172.1| unknown [Glycine max]
Length = 355
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 133/345 (38%), Positives = 198/345 (57%), Gaps = 16/345 (4%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMH--------EPSIVEKHEQWMAQHGRTYKDELEKAM 61
I+ +F++ + ++S + H + ++ E+W+ +H + Y EK
Sbjct: 5 IVLLFMVFAVSSALDMSIISHDNAHADRATRRTDDEVMSMFEEWLVKHDKVYNALGEKEK 64
Query: 62 RLNIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPS 121
R IFK NL +I++ N NRTYKLG N F+DLTN E+RA+Y P + + +
Sbjct: 65 RFQIFKNNLRFIDERNSL-NRTYKLGLNVFADLTNAEYRAMYLRTWDDGPRLDLDTPPRN 123
Query: 122 TFKYQNVTDVPTSIDWREKGAVTHIKDQG-QCGSCWAFSAVAAVEGITQITRGKLIELSE 180
+ + +P S+DWR++GAVT +K+QG C SCWAF+AV AVE + +I G LI LSE
Sbjct: 124 RYVPRVGDTIPKSVDWRKEGAVTPVKNQGATCNSCWAFTAVGAVESLVKIKTGDLISLSE 183
Query: 181 QQLVDCST-DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATIS 239
Q++VDC+T + GC GG + + YI +N G++ E DYPYR +EG CD+ K+ A+ TI
Sbjct: 184 QEVVDCTTSSSRGCGGGDIQHGYIYIRKN-GISLEKDYPYRGDEGKCDSNKKNAI-VTID 241
Query: 240 KYEDLPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGT 299
+ +P E+AL Q ++NQPV+V + A F +Y SGV CG +H + +VG+G
Sbjct: 242 GHGWVPTQLEEALKQGIANQPVAVPIPADDYEFQYYTSGVFKGKCGTELNHALLLVGYGA 301
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPV 344
E YW+ KNS+ + WGE+GYIRI R C YP+
Sbjct: 302 ---EKDGDYWIAKNSYSDKWGENGYIRIQRKLSTCKFGNGGYYPI 343
>gi|402770511|gb|AFQ98390.1| cathepsin L [Rhipicephalus microplus]
gi|402770513|gb|AFQ98391.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 144/340 (42%), Positives = 204/340 (60%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF ++ G++ +R++ S NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSSFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P +DWR+KGAVT +KDQGQCGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKVVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC GGLM+ AF+YI N G+ TE YPY +G C +KE V AT + Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKED-VGATDTGYVEIKAGS 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
E L +AV+ P+SV +DAS +F Y GV + +C + + DHGV VVG+G + G
Sbjct: 235 EVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|261289779|ref|XP_002611751.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
gi|229297123|gb|EEN67761.1| hypothetical protein BRAFLDRAFT_284345 [Branchiostoma floridae]
Length = 330
Score = 254 bits (650), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 147/342 (42%), Positives = 205/342 (59%), Gaps = 29/342 (8%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNLEYI 73
+++++ C + + E QW A +H + Y ++ E A RL IF+ NL+ I
Sbjct: 3 LLVLLACVAMATAASLSFES-------QWEAFKIKHDKVYSEKEEYARRL-IFQDNLKTI 54
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
E N+E G +Y LG N+F+D+T+ E+ G ++++ SR +T++Y
Sbjct: 55 ESHNQEADTGKHSYWLGVNQFADMTHAEYLNQVIGGCLITSNLTKTGSR-ATYRYMPNMQ 113
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
V ++DWR+KG VT IKDQGQCGSCWAFS ++EG G L+ LSEQ LVDCS
Sbjct: 114 VNDTVDWRDKGLVTDIKDQGQCGSCWAFSTTGSLEGQHAKATGTLVSLSEQNLVDCSRQE 173
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTC--DNQKEKAVAATISKYEDLPK 246
N GC GG MD+ F+YII+NKG+ TE YPY+ + C DN + AT+S + D+
Sbjct: 174 GNKGCEGGDMDQGFQYIIQNKGIDTEQCYPYKAKNHRCKFDN---SCIGATMSSFTDVTS 230
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLNA-DCGNN-CDHGVAVVGFGTAEEE 303
GDE AL QA +N P+SV +DAS ++F FY SGV N +C + DHGV VVG+GT +
Sbjct: 231 GDEDALKQACANIGPISVGIDASHQSFQFYSSGVYNEFECSSTKLDHGVLVVGYGTYGSK 290
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+ YWL+KNSWG WG GYI + R+ CG+AT AS+PV
Sbjct: 291 D---YWLVKNSWGTVWGNEGYIMMSRNKDNQCGVATDASFPV 329
>gi|402770505|gb|AFQ98387.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 254 bits (650), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 143/340 (42%), Positives = 204/340 (60%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF +N I K
Sbjct: 1 MLRLSVLCAIVAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTENSLIIAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF ++ G+ +R++ + NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHRG-----TRKTGGSTFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P ++DWR+KGAVT +KDQGQCGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC GGLM+ AF+YI N G+ TE YPY +G C +KE V AT + Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKED-VGATDTGYVEIKAGS 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
E L +AV+ P+SV +DAS +F Y GV + +C + + DHGV VVG+G + G
Sbjct: 235 EVDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|225706086|gb|ACO08889.1| Cathepsin S precursor [Osmerus mordax]
Length = 333
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 136/310 (43%), Positives = 191/310 (61%), Gaps = 16/310 (5%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEF 99
+ W QHG+ YK E+E+ R ++++NL+ I N E G TY LG N D+T EE
Sbjct: 31 QMWKKQHGKNYKTEVEELGRREVWERNLQLISLHNLEASMGMHTYDLGMNHMGDMTEEEI 90
Query: 100 RALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
+ P + R+ PS F + T VP ++DWR+KG VT +K+QG CGSCWAFS
Sbjct: 91 LQSFASLKVPA-DLKRE---PSAFVASSGTPVPDTVDWRQKGYVTQVKNQGSCGSCWAFS 146
Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADY 217
+V A+EG T GKL++LS Q LVDCS+ N GC+GG M +AF+Y+I+NKG+ ++ Y
Sbjct: 147 SVGALEGQLMRTTGKLLDLSPQNLVDCSSKYGNKGCNGGFMSEAFQYVIDNKGIDSDTSY 206
Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYK 276
PY+ +GTC + +A ++Y LP+GDE L QAV+ P+SV +DA+ +F ++
Sbjct: 207 PYQGVQGTC-HYNPSYRSANCTRYSFLPEGDETTLKQAVAMIGPISVAIDATRPSFILWR 265
Query: 277 SGVLN-ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLC 334
SGV N C +H V VVG+GT + G YWL+KNSWG +GE+GYIR+ R+ C
Sbjct: 266 SGVYNDLTCTQKINHAVLVVGYGTLD---GQDYWLVKNSWGTRFGENGYIRMSRNRNNQC 322
Query: 335 GIATAASYPV 344
GIA YP+
Sbjct: 323 GIALYGCYPI 332
>gi|18202415|sp|P82474.1|CPGP2_ZINOF RecName: Full=Zingipain-2; AltName: Full=Cysteine proteinase GP-II
gi|6137410|pdb|1CQD|A Chain A, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137411|pdb|1CQD|B Chain B, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137412|pdb|1CQD|C Chain C, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
gi|6137413|pdb|1CQD|D Chain D, The 2.1 Angstrom Structure Of A Cysteine Protease With
Proline Specificity From Ginger Rhizome, Zingiber
Officinale
Length = 221
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 121/219 (55%), Positives = 153/219 (69%), Gaps = 8/219 (3%)
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
D+P SIDWRE GAV +K+QG CGSCWAFS VAAVEGI QI G LI LSEQQLVDC+T
Sbjct: 2 DLPDSIDWRENGAVVPVKNQGGCGSCWAFSTVAAVEGINQIVTGDLISLSEQQLVDCTTA 61
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
NHGC GG M+ AF++I+ N G+ +E YPYR ++G C N A +I YE++P +E
Sbjct: 62 NHGCRGGWMNPAFQFIVNNGGINSEETYPYRGQDGIC-NSTVNAPVVSIDSYENVPSHNE 120
Query: 250 QALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYW 309
Q+L +AV+NQPVSV +DA+GR F Y+SG+ C + +H + VVG+GT EN +W
Sbjct: 121 QSLQKAVANQPVSVTMDAAGRDFQLYRSGIFTGSCNISANHALTVVGYGT---ENDKDFW 177
Query: 310 LIKNSWGETWGESGYIRILRDA----GLCGIATAASYPV 344
++KNSWG+ WGESGYIR R+ G CGI ASYPV
Sbjct: 178 IVKNSWGKNWGESGYIRAERNIENPDGKCGITRFASYPV 216
>gi|226821425|gb|ACO82388.1| cathepsin S [Lutjanus argentimaculatus]
Length = 337
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 140/321 (43%), Positives = 190/321 (59%), Gaps = 16/321 (4%)
Query: 32 SMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGT 88
+M E + + W H + Y++E+E+ R ++++NL I N E G TY+LG
Sbjct: 24 AMFESRLDAHWDLWKKTHEKKYQNEVEEFSRRRLWEKNLMLITMHNLEASMGLHTYELGM 83
Query: 89 NEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKD 148
N D+T EE ++ + P Q + PS F + D+P ++DWREKG VT +K
Sbjct: 84 NHMGDMTPEE---IWQSFATLTPPTDIQRA-PSPFAGSSGADIPDTMDWREKGCVTSVKT 139
Query: 149 QGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYII 206
QG CGSCWAFSAV A+EG GKL++LS Q LVDCST NHGC+GG MD AF+Y+I
Sbjct: 140 QGSCGSCWAFSAVGALEGQLAKKTGKLVDLSPQNLVDCSTKYGNHGCNGGFMDHAFQYVI 199
Query: 207 ENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCV 265
+N+G+ ++A YPY C + AA S Y LP+GDE AL QA++ P+SV +
Sbjct: 200 DNQGIDSDASYPYTGRSDQC-HYNPSYRAANCSSYNFLPEGDEGALKQALATIGPISVAI 258
Query: 266 DASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGY 324
DA+ F FY+SGV N C +HGV VG+GT NG YWL+KNSWG +G+ GY
Sbjct: 259 DATRPRFIFYRSGVYNDPSCSQEVNHGVLAVGYGTL---NGQDYWLVKNSWGTKFGDQGY 315
Query: 325 IRILRDAG-LCGIATAASYPV 344
IR+ R+ CGIA YP+
Sbjct: 316 IRMARNQNDQCGIAMYGCYPI 336
>gi|348525618|ref|XP_003450319.1| PREDICTED: cathepsin S-like [Oreochromis niloticus]
Length = 330
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 139/344 (40%), Positives = 202/344 (58%), Gaps = 24/344 (6%)
Query: 10 IIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQN 69
++P+ +++L + + E S+ + E+W + H R Y E+ +R I+++N
Sbjct: 1 MLPLVCVLLLATSALGR------FDESSLDAQWEEWKSTHRREYNGLGEEGIRRAIWEKN 54
Query: 70 LEYIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQ 126
+ IE N+E G ++++G N D+T+EE TG P+ R T
Sbjct: 55 MRMIEAHNEEAALGIHSFEMGMNHLGDMTSEEVVEKMTGLQIPM-----NQERSFTLAMD 109
Query: 127 NV-TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
++ + +P S+D+R+KG VT +K+QG CGSCWAFSA A+EG + GKL++LS Q LVD
Sbjct: 110 DMPSKIPKSVDYRKKGMVTSVKNQGACGSCWAFSAAGALEGQLAKSTGKLVDLSPQNLVD 169
Query: 186 CSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
CS NHGC+GG M +AF+Y+I+N G+ ++A YPY + C AA S Y+
Sbjct: 170 CSGKYGNHGCNGGFMTRAFQYVIDNHGIDSDASYPYTGRDEQC-RYNPATRAANCSSYQF 228
Query: 244 LPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAE 301
LP+GDE AL QA++ P+SV +DA F FY+SGV N C +HGV VG+G+
Sbjct: 229 LPEGDENALKQALATIGPISVAIDARRPRFSFYRSGVYNDPSCTQEVNHGVLAVGYGSL- 287
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDAG-LCGIATAASYPV 344
NG YWL+KNSWG T+G+ GYIR+ R+ G CGIA A YPV
Sbjct: 288 --NGQDYWLVKNSWGSTFGDQGYIRMARNTGNQCGIALYACYPV 329
>gi|1483570|emb|CAA68066.1| cathepsin l [Litopenaeus vannamei]
Length = 328
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 144/320 (45%), Positives = 192/320 (60%), Gaps = 21/320 (6%)
Query: 36 PSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFS 92
PS+ ++ + A+HGR Y E+ RL++F+QN ++I+ N + G T+ L N+F
Sbjct: 18 PSLRQQWRDFKAEHGRRYASVQEERYRLSVFEQNQQFIDDHNARFENGEVTFTLQMNQFG 77
Query: 93 DLTNEEFRALYTGY-NRPVPSVSRQSSRPSTFKYQNVTD-VPTSIDWREKGAVTHIKDQG 150
D+T+EEF A G+ N P S RP+ + + +P +DWR KGAVT +KDQ
Sbjct: 78 DMTSEEFTATMNGFLNVP-------SRRPTAILRADPDETLPKEVDWRTKGAVTPVKDQK 130
Query: 151 QCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIEN 208
QCGSCWAFS ++EG + GKL+ LSEQ LVDCS N GC GGLMD+AF YI N
Sbjct: 131 QCGSCWAFSTTGSLEGQHFLKDGKLVSLSEQNLVDCSDKFGNMGCMGGLMDQAFRYIKAN 190
Query: 209 KGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDA 267
KG+ TE YPY ++G C V AT + Y D+ G E AL +AV+ P+SV +DA
Sbjct: 191 KGIDTEDSYPYEAQDGKCRFDASN-VGATDTGYVDVEHGSESALKKAVATIGPISVAIDA 249
Query: 268 SGRAFHFYKSGVLNAD-CGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYI 325
S +F FY GV + C + DHGV VG+G E E G YWL+KNSW +WG GYI
Sbjct: 250 SQPSFQFYHDGVYYEEGCSSTMLDHGVLAVGYG--ETEKGEAYWLVKNSWNTSWGNKGYI 307
Query: 326 RILRD-AGLCGIATAASYPV 344
++ RD CGIA+ ASYP+
Sbjct: 308 QMSRDKKNNCGIASQASYPL 327
>gi|217323618|gb|ACK38176.1| midgut cysteine peptidase, partial [Sphenophorus levis]
Length = 324
Score = 254 bits (649), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 142/345 (41%), Positives = 200/345 (57%), Gaps = 30/345 (8%)
Query: 7 KSFIIPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIF 66
KSFI+ +LV+ ++ ++ +H + + +HG+TYK++ E+ R IF
Sbjct: 2 KSFILAS----LLVVAVSATLLKEDGVH-------FQSFKLKHGKTYKNQAEETKRFAIF 50
Query: 67 KQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
++NL IE N K+G +Y G N+F+D+T EF+A+ + PS+ TF
Sbjct: 51 RENLRKIEAHNAEYKQGIHSYTQGINKFADMTRAEFKAMLATQVKTKPSIVATK----TF 106
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
+ + VP SIDWR + VT IKDQ QCGSCW+F+ V + EG ++ GKL SEQQL
Sbjct: 107 QLADGVSVPESIDWRSRNVVTPIKDQAQCGSCWSFAVVGSTEGAYALSTGKLTRFSEQQL 166
Query: 184 VDCSTD-NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYE 242
VDC+TD N+GC GG +D F YI N GL E+DYPY +G+C K V +S Y
Sbjct: 167 VDCTTDLNYGCDGGYLDDTFPYIQTN-GLELESDYPYTGYDGSCSYDSSK-VVTKVSSYV 224
Query: 243 DLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGVLNADCGNN--CDHGVAVVGFGT 299
+P +EQALL+AV PV++ ++A F+F SG+++ + DHGV VG+ +
Sbjct: 225 SVP-ANEQALLEAVGTAGPVAIAINADDLQFYF--SGIIDDKYCDPEWLDHGVLAVGYNS 281
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRDAGLCGIATAASYPV 344
ENG YWLIKNSWG WGESGY R LR +CG+ A YP+
Sbjct: 282 ---ENGLDYWLIKNSWGADWGESGYFRFLRGQNICGVKEDAVYPL 323
>gi|146217394|gb|ABQ10739.1| cathepsin L [Penaeus monodon]
Length = 341
Score = 254 bits (649), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 139/319 (43%), Positives = 200/319 (62%), Gaps = 15/319 (4%)
Query: 38 IVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANK---EGNRTYKLGTNEFSDL 94
++E+ E + +H + Y E+E++ R+ IF +N I NK +G+ TYKL N++ D+
Sbjct: 25 VLEEWEAFKLEHSKKYDSEVEESFRMKIFTENKHKIANHNKGFAQGHHTYKLSMNKYGDM 84
Query: 95 TNEEFRALYTGY--NRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKDQGQ 151
+ EF + G+ N + ++ +TF + + +P ++DWR KGAVT IKDQGQ
Sbjct: 85 LHHEFVSTMNGFRGNHTGGYKNNRAYTGATFIEPDDDVQLPKNVDWRTKGAVTPIKDQGQ 144
Query: 152 CGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENK 209
CGSCWAFSA A+EG T G+L+ LSEQ LVDCS N+GC+GGLMD AFEY+ EN
Sbjct: 145 CGSCWAFSATGALEGQTFRKTGQLVSLSEQNLVDCSRKFGNNGCNGGLMDNAFEYVKENG 204
Query: 210 GLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDAS 268
G+ TE YPY E+ C + +A A + D+ +G E AL +AV+ PVSV +DAS
Sbjct: 205 GIDTEESYPYDAEDEKC-HYNPRAAGAEDKGFVDVREGSEHALKKAVATVGPVSVAIDAS 263
Query: 269 GRAFHFYKSGV-LNADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIR 326
+F FY GV + +C DHGV VVG+G +++G YWL+KNSWG TWG+ GY++
Sbjct: 264 HESFQFYSHGVYIEPECSPEMLDHGVLVVGYGI--DDDGTDYWLVKNSWGTTWGDQGYVK 321
Query: 327 ILRDA-GLCGIATAASYPV 344
+ R+ CGIA++AS+P+
Sbjct: 322 MARNRDNQCGIASSASFPL 340
>gi|223673161|gb|ACN12762.1| Cathepsin S precursor [Salmo salar]
Length = 330
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 139/340 (40%), Positives = 198/340 (58%), Gaps = 20/340 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M ++L + C + V ++ +P + + + W HG+ Y+ E+E+ R ++++NL+
Sbjct: 2 MLWSLLLAVLCGTAV----ALFDPMLEQHWQMWKKTHGKNYQTEVEELGRREVWERNLQL 57
Query: 73 IEKANKEGN---RTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
I N E + TY LG N D+T EE + P + R+ PS F +
Sbjct: 58 ISLHNLEASMDMHTYDLGMNHMGDMTQEEIAQSFASLLVPA-DLKRE---PSAFAGSSGA 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+P + DWREKG VT +K QG CGSCWAFS+V A+EG T GKLI+LS Q LVDCS+
Sbjct: 114 PIPDTFDWREKGYVTGVKMQGSCGSCWAFSSVGALEGQLMKTTGKLIDLSPQNLVDCSSK 173
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC GG M KAF+Y+I+N+G+A++ YPY+ + C + AA S+Y LP+G
Sbjct: 174 YGNKGCHGGFMTKAFQYVIDNQGIASDQSYPYKGVQQQCIYNPAQR-AANCSRYSFLPEG 232
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNNCDHGVAVVGFGTAEEENG 305
DE L +A++ P+SV +DA+ +F FY+SGV N C +H V VG+GT G
Sbjct: 233 DEGVLKEALATIGPISVGIDATRPSFAFYRSGVYNDPTCTKKTNHAVLAVGYGTL---GG 289
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
YWL+KNSWG +WG+ GYIR+ R+ CGIA YPV
Sbjct: 290 QDYWLVKNSWGLSWGDQGYIRMSRNKDNQCGIALYGCYPV 329
>gi|27806673|ref|NP_776457.1| cathepsin L2 precursor [Bos taurus]
gi|1542853|emb|CAA62870.1| cathepsin L [Bos taurus]
Length = 334
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 142/343 (41%), Positives = 203/343 (59%), Gaps = 22/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
P F + +L + V S +P++ QW A H R Y E+ R ++++N +
Sbjct: 3 PSFFLTVLCLG----VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNKK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
I+ N+E G +++ N F D+TNEEFR + G+ +++ + F +
Sbjct: 58 IIDLHNQEYSEGKHAFRMAMNAFGDMTNEEFRQVMNGFQ------NQKHKKGKLFHEPLL 111
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
DVP S+DW +KG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 VDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR 171
Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD AF+YI +N GL +E YPY + N K + AA + + D+P+
Sbjct: 172 AQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ 231
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCG-NNCDHGVAVVGFG-TAEE 302
E+AL++AV+ P+SV +DA +F FYKSG+ + DC + DHGV VVG+G +
Sbjct: 232 -REKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSCKDLDHGVLVVGYGFEGTD 290
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
N K+W++KNSWG WG +GY+++ +D CGIATAASYP
Sbjct: 291 SNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPT 333
>gi|357477225|ref|XP_003608898.1| Cysteine proteinase, partial [Medicago truncatula]
gi|355509953|gb|AES91095.1| Cysteine proteinase, partial [Medicago truncatula]
Length = 260
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 130/265 (49%), Positives = 169/265 (63%), Gaps = 33/265 (12%)
Query: 89 NEFSDLTNEEFRALY----TGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVT 144
N+F+D+TN EFR++Y ++R +S + F Y+NV VP+SIDWR+ GAVT
Sbjct: 3 NKFADMTNYEFRSIYADSKVNHHRMFRGMSHDNG---PFMYENVEGVPSSIDWRKIGAVT 59
Query: 145 HIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDKAFE 203
+KDQGQCGSCWAFS + AVEGI QI KL+ LSEQ+LVDC T+ N GC+GGLM+ AFE
Sbjct: 60 GVKDQGQCGSCWAFSTIVAVEGINQIKTQKLVSLSEQELVDCDTEVNQGCNGGLMEYAFE 119
Query: 204 YIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQPVSV 263
+I +N G+ TE +YPY ++GTC+ QKE A +I +E++P +E+ALL+A +NQP+SV
Sbjct: 120 FIKQN-GITTETNYPYAAKDGTCNIQKENKPAVSIDGHENVPANNEKALLKAAANQPISV 178
Query: 264 CVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESG 323
+DA G F FY GV CG +HGV NSWG WGE G
Sbjct: 179 AIDAGGSDFQFYSEGVFTGHCGTELNHGV--------------------NSWGSEWGEQG 218
Query: 324 YIRILR----DAGLCGIATAASYPV 344
YIR+ R GLCGIA ASYP+
Sbjct: 219 YIRMQRAISHKQGLCGIAMEASYPI 243
>gi|37994576|gb|AAH60335.1| Unknown (protein for MGC:68554) [Xenopus laevis]
Length = 335
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 143/342 (41%), Positives = 211/342 (61%), Gaps = 18/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M +I+ C + + + + +P++ W H +TY + E+ R ++++NL+
Sbjct: 1 MTPYLIIGAICLTTLYAAPAT-DPALDNHWYSWKDWHKKTYAPK-EEGWRRVLWEKNLKM 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N + G +Y+LG N+F D+TNEEF+ L GY +++ R STF N
Sbjct: 59 IEFHNLDHSLGKHSYRLGMNQFGDMTNEEFKQLMNGYK------NQKMIRGSTFLAPNNF 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
+ P S+DWR+KG VT +KDQGQCGSCWAFS A+EG KLI LSEQ LVDCS
Sbjct: 113 EAPKSVDWRKKGYVTPVKDQGQCGSCWAFSTTGALEGQHYRKTSKLISLSEQNLVDCSRA 172
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+Y+ +N G+ +E YPY ++ + +A + + D+ G
Sbjct: 173 QGNEGCNGGLMDQAFQYVKDNGGIDSEDSYPYTAKDDQECHYDPNNNSANDTGFVDVQSG 232
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+ L++AV++ PVSV +DA ++F FY+SG+ +C + + DHGV VVG+G +E+
Sbjct: 233 CEKDLMKAVASVGPVSVAIDAGHQSFQFYQSGIYYEPECSSEDLDHGVLVVGYGFESEDV 292
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+G KYW++KNSW E WG++GYI I +D CGIATAASYP+
Sbjct: 293 DGKKYWIVKNSWSEKWGDNGYINIAKDRHNHCGIATAASYPL 334
>gi|12847813|dbj|BAB27719.1| unnamed protein product [Mus musculus]
Length = 334
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 19/311 (6%)
Query: 44 QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
QW + H R Y E+ R I+++N+ I+ N E G + + N F D+TNEEFR
Sbjct: 31 QWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFR 89
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
+ GY R P K +P S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90 QVVNGYRHQKHKKGRLFQEPLMLK------IPKSVDWREKGCVTPVKNQGQCGSCWAFSA 143
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
+EG + GKLI LSEQ LVDCS N GC+GGLMD AF+YI EN GL +E YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDYAFQYIKENGGLDSEESYP 203
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
Y ++G+C + E AV A + + D+P+ E+AL++AV+ P+SV +DAS + FY S
Sbjct: 204 YEAKDGSCKYRAEFAV-ANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261
Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
G+ +C + N DHGV +VG+G + N KYWL+KNSWG WG GYI+I +D
Sbjct: 262 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 321
Query: 334 CGIATAASYPV 344
CG+ATAASYPV
Sbjct: 322 CGLATAASYPV 332
>gi|293342579|ref|XP_001065885.2| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|293354415|ref|XP_225137.5| PREDICTED: cathepsin L1 [Rattus norvegicus]
gi|149039747|gb|EDL93863.1| rCG24278 [Rattus norvegicus]
Length = 330
Score = 254 bits (648), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 140/336 (41%), Positives = 194/336 (57%), Gaps = 18/336 (5%)
Query: 17 IILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKA 76
I L+ T ++S H+PS E+W +HG+TY E+ + +++ N++ I
Sbjct: 4 IFLLATLCLGMISAAPTHDPSFDTVWEEWKTKHGKTYNTN-EEGQKRAVWENNMKMINLH 62
Query: 77 NKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT 133
N++ G + L N F DLTN EFR L TG+ P + F+ + D+P
Sbjct: 63 NEDYLKGKHGFSLEMNAFGDLTNTEFRELMTGFQSMGPK------ETTIFREPFLGDIPK 116
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
S+DWRE G VT +K+QGQCGSCWAFSAV ++EG GKL+ LSEQ LVDCS N
Sbjct: 117 SLDWREHGYVTPVKNQGQCGSCWAFSAVGSLEGQIFKKTGKLVSLSEQNLVDCSWSYGNL 176
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC+GGLM+ AF+Y+ EN+GL T Y Y ++G C K AA ++ + +P ++
Sbjct: 177 GCNGGLMEFAFQYVKENRGLDTGESYAYEAQDGLC-RYNPKYSAANVTGFVKVPLSEDDL 235
Query: 252 LLQAVSNQPVSVCVDASGRAFHFYKSGV-LNADCGNN-CDHGVAVVGFGTAEEENGAKYW 309
+ S PVSV +D+ ++F FY G+ DC + DH V VVG+G EE +G KYW
Sbjct: 236 MSAVASVGPVSVGIDSHHQSFRFYSGGMYYEPDCSSTEMDHAVLVVGYG--EESDGGKYW 293
Query: 310 LIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
L+KNSWGE WG GYI++ +D CGIAT A YP
Sbjct: 294 LVKNSWGEDWGMDGYIKMAKDQNNNCGIATYAIYPT 329
>gi|109940313|sp|P25975.3|CATL1_BOVIN RecName: Full=Cathepsin L1; Contains: RecName: Full=Cathepsin L1
heavy chain; Contains: RecName: Full=Cathepsin L1 light
chain; Flags: Precursor
gi|74354943|gb|AAI02313.1| CTSL2 protein [Bos taurus]
gi|154425700|gb|AAI51426.1| Cathepsin L2 [Bos taurus]
gi|296484466|tpg|DAA26581.1| TPA: cathepsin L2 precursor [Bos taurus]
gi|440898893|gb|ELR50299.1| Cathepsin L1 [Bos grunniens mutus]
Length = 334
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 142/343 (41%), Positives = 204/343 (59%), Gaps = 22/343 (6%)
Query: 12 PMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLE 71
P F + +L + V S +P++ QW A H R Y E+ R ++++N +
Sbjct: 3 PSFFLTVLCLG----VASAAPKLDPNLDAHWHQWKATHRRLYGMN-EEEWRRAVWEKNKK 57
Query: 72 YIEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNV 128
I+ N+E G +++ N F D+TNEEFR + G+ +++ + F +
Sbjct: 58 IIDLHNQEYSEGKHGFRMAMNAFGDMTNEEFRQVMNGFQ------NQKHKKGKLFHEPLL 111
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS- 187
DVP S+DW +KG VT +K+QGQCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 112 VDVPKSVDWTKKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSR 171
Query: 188 -TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N GC+GGLMD AF+YI +N GL +E YPY + N K + AA + + D+P+
Sbjct: 172 AQGNQGCNGGLMDNAFQYIKDNGGLDSEESYPYLATDTNSCNYKPECSAANDTGFVDIPQ 231
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEE 302
E+AL++AV+ P+SV +DA +F FYKSG+ + DC + + DHGV VVG+G +
Sbjct: 232 -REKALMKAVATVGPISVAIDAGHTSFQFYKSGIYYDPDCSSKDLDHGVLVVGYGFEGTD 290
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
N K+W++KNSWG WG +GY+++ +D CGIATAASYP
Sbjct: 291 SNNNKFWIVKNSWGPEWGWNGYVKMAKDQNNHCGIATAASYPT 333
>gi|74151179|dbj|BAE27712.1| unnamed protein product [Mus musculus]
Length = 334
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 19/311 (6%)
Query: 44 QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
QW + H R Y E+ R I+++N+ I+ N E G + + N F D+TNEEFR
Sbjct: 31 QWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFR 89
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
+ GY R P K +P S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90 QVVNGYRHQKHKKGRLFQEPLMLK------IPKSVDWREKGCVTPVKNQGQCGSCWAFSA 143
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
+EG + GKLI LSEQ LVDCS N GC+GGLMD AF+YI EN GL +E YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
Y ++G+C + E AV A + + D+P+ E+AL++AV+ P+SV +DAS + FY S
Sbjct: 204 YEAKDGSCKYRAEFAV-ANDTGFVDIPQ-QEEALMKAVATVGPISVAMDASHPSLQFYSS 261
Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
G+ +C + N DHGV +VG+G + N KYWL+KNSWG WG GYI+I +D
Sbjct: 262 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 321
Query: 334 CGIATAASYPV 344
CG+ATAASYPV
Sbjct: 322 CGLATAASYPV 332
>gi|33348836|gb|AAQ16118.1| cathepsin L-like cysteine proteinase B [Rhipicephalus
haemaphysaloides haemaphysaloides]
Length = 335
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 143/337 (42%), Positives = 200/337 (59%), Gaps = 19/337 (5%)
Query: 20 VITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE 79
++ C V + H+ + + + A HG+ Y + E+ RL I+ +N I + N++
Sbjct: 5 IVLCCLFVTAAAITHQELVGAEWSAFKALHGKDYASDTEEYYRLKIYMENRLKIARHNEK 64
Query: 80 GNRT---YKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSS---RPSTFKYQNVTDVPT 133
++ YKL NEF DL + EF + G+ R R+ S P F+ +P
Sbjct: 65 YAKSQVSYKLAMNEFGDLLHHEFVSTRNGFKRNYRDSPREGSFFVEPEGFE---DLQLPK 121
Query: 134 SIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NH 191
++DWR+KGAVT +K+QGQCGSCWAFS ++EG KL+ LSEQ LVDCS N+
Sbjct: 122 TVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGPHFRKTRKLVSLSEQNLVDCSRSFGNN 181
Query: 192 GCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQA 251
GC GGLMD AF+YI NKG+ TE YPY +G C + V AT + + D+P+GDE
Sbjct: 182 GCEGGLMDNAFKYIKSNKGIDTEWSYPYNATDGVCHFNRSD-VGATDTGFVDIPEGDENK 240
Query: 252 LLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENGAKY 308
L +AV+ PVSV +DAS +F FY GV + +C + DHGV VVG+GT ++G Y
Sbjct: 241 LKKAVAAVGPVSVAIDASHESFQFYSEGVYDEPECSSEQLDHGVLVVGYGT---KDGQDY 297
Query: 309 WLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
WL+KNSWG TWG+ GYI + R+ CGIA++ASYP+
Sbjct: 298 WLVKNSWGTTWGDEGYIYMTRNKDNQCGIASSASYPL 334
>gi|60827856|gb|AAX36816.1| cathepsin L [synthetic construct]
Length = 334
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 143/344 (41%), Positives = 205/344 (59%), Gaps = 20/344 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M +IL C + S + S+ + +W A H R Y E+ R ++++N++
Sbjct: 1 MNPTLILAAFCLG-IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58
Query: 73 IEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N +EG ++ + N F D+T+EEFR + G+ +R+ + F+
Sbjct: 59 IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLFY 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
+ P S+DWREKG VT +K+QGQCGSCWAFSA A+EG G+LI LSEQ LVDCS
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP 172
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD AF+Y+ +N GL +E YPY E +C + +V A + + D+PK
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSV-ANDTGFVDIPK- 230
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+AL++AV+ P+SV +DA +F FYK G+ DC + + DHGV VVG+G + E
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPVAI 346
+ KYWL+KNSWGE WG GY+++ +D CGIA+AASYP +
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPTVL 334
>gi|6753558|ref|NP_034114.1| cathepsin L1 preproprotein [Mus musculus]
gi|115742|sp|P06797.2|CATL1_MOUSE RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; AltName: Full=p39 cysteine proteinase;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|53047|emb|CAA29470.1| unnamed protein product [Mus musculus]
gi|309186|gb|AAA37445.1| preprocysteine proteinase [Mus musculus]
gi|12832050|dbj|BAB21945.1| unnamed protein product [Mus musculus]
gi|26340196|dbj|BAC33761.1| unnamed protein product [Mus musculus]
gi|45768760|gb|AAH68163.1| Cathepsin L [Mus musculus]
gi|74139700|dbj|BAE31701.1| unnamed protein product [Mus musculus]
gi|74146632|dbj|BAE41323.1| unnamed protein product [Mus musculus]
gi|74151584|dbj|BAE41141.1| unnamed protein product [Mus musculus]
gi|74185397|dbj|BAE30172.1| unnamed protein product [Mus musculus]
gi|74197196|dbj|BAE35143.1| unnamed protein product [Mus musculus]
gi|74203006|dbj|BAE26206.1| unnamed protein product [Mus musculus]
gi|74219606|dbj|BAE29572.1| unnamed protein product [Mus musculus]
gi|148684295|gb|EDL16242.1| cathepsin L [Mus musculus]
Length = 334
Score = 254 bits (648), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 19/311 (6%)
Query: 44 QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
QW + H R Y E+ R I+++N+ I+ N E G + + N F D+TNEEFR
Sbjct: 31 QWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFR 89
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
+ GY R P K +P S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90 QVVNGYRHQKHKKGRLFQEPLMLK------IPKSVDWREKGCVTPVKNQGQCGSCWAFSA 143
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
+EG + GKLI LSEQ LVDCS N GC+GGLMD AF+YI EN GL +E YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
Y ++G+C + E AV A + + D+P+ E+AL++AV+ P+SV +DAS + FY S
Sbjct: 204 YEAKDGSCKYRAEFAV-ANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261
Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
G+ +C + N DHGV +VG+G + N KYWL+KNSWG WG GYI+I +D
Sbjct: 262 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 321
Query: 334 CGIATAASYPV 344
CG+ATAASYPV
Sbjct: 322 CGLATAASYPV 332
>gi|391338876|ref|XP_003743781.1| PREDICTED: cathepsin L-like isoform 4 [Metaseiulus occidentalis]
Length = 336
Score = 253 bits (647), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 203/340 (59%), Gaps = 21/340 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
F+I+ +++ AS ++ + + + + + H + Y+ +A R IF QN I
Sbjct: 8 FLILAVLVGAASAALTLEQLFDA----EWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLI 63
Query: 74 EKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+ N +G TYKL N+F D+ + EF + G R S ++ ST+
Sbjct: 64 ARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGLLR-----SNRTYFGSTWIEPESVS 118
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P S+DWREKGAVT +K+QG CGSCW+FS A+EG G+L+ LSEQ L+DCST
Sbjct: 119 LPKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSY 178
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC GGLMD AF YI EN G+ TE YPY ++G C KE + A + + D+P G+
Sbjct: 179 GNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDS-AGRDTGFVDIPSGN 237
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENG 305
E+AL +A++ PVSV +DAS +F FY GV N DC ++ DHGV VG+GT ++ G
Sbjct: 238 ERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDD--G 295
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
Y++IKNSWGE WG+ GY+ + R++ CG+AT ASYP+
Sbjct: 296 QDYYIIKNSWGERWGQEGYVLMARNSKNECGVATQASYPL 335
>gi|327289213|ref|XP_003229319.1| PREDICTED: cathepsin S-like [Anolis carolinensis]
Length = 333
Score = 253 bits (647), Expect = 6e-65, Method: Compositional matrix adjust.
Identities = 139/318 (43%), Positives = 202/318 (63%), Gaps = 18/318 (5%)
Query: 37 SIVEKH-EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFS 92
S+++ H E W ++ + Y+++ E+ +R I+++NL ++ N E G +Y+LG N
Sbjct: 23 SMLDGHWELWKKKYNKEYQNKEEEGVRRVIWEKNLRFVMLHNLEQSLGLHSYELGMNHLG 82
Query: 93 DLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQC 152
D+T+EE AL TG PV QS + + + P ++DWREKG VT++K+QG C
Sbjct: 83 DMTSEEVTALMTGLKIPVS----QSRNSTLYWARQGASAPDTVDWREKGCVTNVKNQGSC 138
Query: 153 GSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST--DNHGCSGGLMDKAFEYIIENKG 210
GSCWAFSAV A+E ++ G L+ LS Q LVDCS+ NHGC+GG + AF+Y+I N G
Sbjct: 139 GSCWAFSAVGALECQLKLKTGNLVSLSPQNLVDCSSAFGNHGCNGGYISAAFQYVIYNNG 198
Query: 211 LATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASG 269
+ +EA YPY + GTC + AAT S+Y DLP G+E AL AV+N PVSV +DAS
Sbjct: 199 IDSEASYPYTGQSGTC-RYNLQGRAATCSRYVDLPSGNEAALKDAVANFGPVSVAIDASR 257
Query: 270 RAFHFYKSGVL-NADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRI 327
+F ++ GV + C + + +HGV VVG+GT E+G YWL+KNSWG ++G+ GYI+I
Sbjct: 258 PSFFLFRKGVYDDPSCTSAHINHGVLVVGYGT---EDGIDYWLVKNSWGVSFGDQGYIKI 314
Query: 328 LRDA-GLCGIATAASYPV 344
R+ CGIA+ +YP+
Sbjct: 315 ARNHDNRCGIASQCTYPL 332
>gi|391338870|ref|XP_003743778.1| PREDICTED: cathepsin L-like isoform 1 [Metaseiulus occidentalis]
gi|391338872|ref|XP_003743779.1| PREDICTED: cathepsin L-like isoform 2 [Metaseiulus occidentalis]
gi|391338874|ref|XP_003743780.1| PREDICTED: cathepsin L-like isoform 3 [Metaseiulus occidentalis]
Length = 331
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 203/340 (59%), Gaps = 21/340 (6%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
F+I+ +++ AS ++ + + + + + H + Y+ +A R IF QN I
Sbjct: 3 FLILAVLVGAASAALTLEQLFDA----EWQNFKVHHNKKYEGSTVEAFRKKIFLQNTHLI 58
Query: 74 EKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD 130
+ N +G TYKL N+F D+ + EF + G R S ++ ST+
Sbjct: 59 ARHNIKHAKGETTYKLKMNQFGDMLHHEFVSTMNGLLR-----SNRTYFGSTWIEPESVS 113
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P S+DWREKGAVT +K+QG CGSCW+FS A+EG G+L+ LSEQ L+DCST
Sbjct: 114 LPKSVDWREKGAVTPVKNQGHCGSCWSFSTTGALEGQLFRKTGELVSLSEQNLIDCSTSY 173
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC GGLMD AF YI EN G+ TE YPY ++G C KE + A + + D+P G+
Sbjct: 174 GNNGCGGGLMDNAFTYIKENHGIDTEESYPYEGKQGKCRYHKEDS-AGRDTGFVDIPSGN 232
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADC-GNNCDHGVAVVGFGTAEEENG 305
E+AL +A++ PVSV +DAS +F FY GV N DC ++ DHGV VG+GT ++ G
Sbjct: 233 ERALAKALATIGPVSVAIDASHESFQFYHEGVYNPPDCDSHSLDHGVLAVGYGTTDD--G 290
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
Y++IKNSWGE WG+ GY+ + R++ CG+AT ASYP+
Sbjct: 291 QDYYIIKNSWGERWGQEGYVLMARNSKNECGVATQASYPL 330
>gi|6630972|gb|AAF19630.1|AF194426_1 cysteine proteinase precursor [Myxine glutinosa]
Length = 324
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 139/311 (44%), Positives = 195/311 (62%), Gaps = 16/311 (5%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEF 99
E W ++G++Y E+ +R +++ NL+ +++ N +G Y+LG N ++DL NEEF
Sbjct: 20 ESWKGKYGKSYLGRGEEVLRKRVWESNLQIVQQHNVLADQGQANYRLGMNTYADLYNEEF 79
Query: 100 RALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFS 159
AL + + QSS TFK +P+S+DWR +G VT +KDQGQCGSCW+FS
Sbjct: 80 MALKG--SSGILQAKDQSS-TQTFKPLVGVTLPSSVDWRNQGYVTPVKDQGQCGSCWSFS 136
Query: 160 AVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADY 217
A ++EG G L+ LSEQQLVDCS N+GCSGGLM+ A++YI + G+ E+ Y
Sbjct: 137 ATGSLEGQHFAKTGTLVSLSEQQLVDCSWSYGNYGCSGGLMESAYDYIRDAGGVQLESAY 196
Query: 218 PYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYK 276
PY + G C + KAV AT + + +P GDEQ+L+QAV PV+V +DASG F Y+
Sbjct: 197 PYTAQNGRCHFDQSKAV-ATCTGHVAIPSGDEQSLMQAVGTVGPVAVAIDASGYDFQLYE 255
Query: 277 SGVLN-ADC-GNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD-AGL 333
SGV + + C ++ DHGV G+GT E G YWL+KNSWG WG GYI++ R+ +
Sbjct: 256 SGVYDRSRCSSSSLDHGVLAAGYGT---EGGNDYWLVKNSWGPGWGAQGYIKMSRNKSNQ 312
Query: 334 CGIATAASYPV 344
CGIAT A YP+
Sbjct: 313 CGIATMACYPL 323
>gi|262410743|gb|ACY66807.1| cathepsin L [Aphis gossypii]
Length = 341
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 144/342 (42%), Positives = 201/342 (58%), Gaps = 17/342 (4%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
VI++ ++ A VS +++E I E+ + AQ + Y+D E+A R ++ N I
Sbjct: 4 VIVLGLVVFAISSVSSINLNE-VIEEEWSLFKAQFKKIYEDVKEEAFRKKVYLDNKLKIA 62
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPST---FKYQNV 128
+ NK G TY L N F DL E++ + G+ + + + K +NV
Sbjct: 63 RHNKLYETGEETYALEMNHFGDLMQHEYKKMMNGFKPSLAGGDKNFTDDDAVTFLKSENV 122
Query: 129 TDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST 188
VP +IDWR+KG VT +K+QGQCGSCW+FSA ++EG G L+ LSEQ L+DCS
Sbjct: 123 V-VPKAIDWRKKGYVTPVKNQGQCGSCWSFSATGSLEGQHFRKTGVLVSLSEQNLIDCSR 181
Query: 189 D--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPK 246
N+GC GGLMD AF+YI NKGL TE YPY E+ C E + AT + D+P+
Sbjct: 182 KYGNNGCEGGLMDLAFKYIKSNKGLDTEKSYPYEAEDDKCRYNPENS-GATDKGFVDIPE 240
Query: 247 GDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGNN-CDHGVAVVGFGTAEEE 303
GDE AL+ A++ PVS+ +DAS F FYK GV N C + DHGV VG+GT +
Sbjct: 241 GDEDALMHALATVGPVSIAIDASSEKFQFYKKGVFYNPRCSSTELDHGVLAVGYGT--DH 298
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
G YW++KNSWG+TWG+ GYI + R+ CG+A++ASYP+
Sbjct: 299 KGGDYWIVKNSWGKTWGDQGYIMMARNKKNNCGVASSASYPL 340
>gi|261289785|ref|XP_002611754.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
gi|229297126|gb|EEN67764.1| hypothetical protein BRAFLDRAFT_284341 [Branchiostoma floridae]
Length = 327
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 148/346 (42%), Positives = 200/346 (57%), Gaps = 37/346 (10%)
Query: 14 FVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYI 73
F+I++L +T A+ ++ + E + HG+ YK E+ +R IF+ N + I
Sbjct: 3 FLILVLSVTMAT-----------AMDVEWEAFKLTHGKQYKSPDEENVRRAIFRDNNQMI 51
Query: 74 EKANKE---GNRTYKLGTNEFSDLTNEEFRALYTG-----YNRPVPSVSRQSSRPSTFKY 125
++ N+E G R+Y +G N+F DL + E+ L G N PS + S P
Sbjct: 52 KEHNQEAAMGRRSYFMGMNQFGDLAHSEYLELVVGPGLLPLNLSTPSENVFESTPGL--- 108
Query: 126 QNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVD 185
V ++DWR+KGAVT IKDQG CGSCWAFS ++EG + GKL+ LSEQ L+D
Sbjct: 109 ----QVDDTVDWRQKGAVTPIKDQGHCGSCWAFSTTGSLEGQHFMKTGKLVSLSEQNLLD 164
Query: 186 CST--DNHGCSGGLMDKAFEYIIENKGLATEADYPYR-HEEGTCDNQKEKAVAATISKYE 242
CS N GC GGLMD+AF YI N G+ TE YPY +E CD K AT+S Y
Sbjct: 165 CSRRFGNKGCEGGLMDQAFRYIKSNGGIDTEECYPYMAKDEKVCD-YKTSCSGATLSSYT 223
Query: 243 DLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCG-NNCDHGVAVVGFGT 299
D+ DE AL+QAV PVSV +DAS ++ FYKSG+ + +C DHGV VG+G+
Sbjct: 224 DIKAMDEMALMQAVGTVGPVSVAIDASHKSLRFYKSGIYDEPECSRTKLDHGVLAVGYGS 283
Query: 300 AEEENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+ G YWL+KNSWG WG+ GY+++ R+ CGIAT ASYPV
Sbjct: 284 MD---GMDYWLVKNSWGSAWGDMGYVKMTRNKNNQCGIATKASYPV 326
>gi|74149661|dbj|BAE36450.1| unnamed protein product [Mus musculus]
Length = 334
Score = 253 bits (647), Expect = 7e-65, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 19/311 (6%)
Query: 44 QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
QW + H R Y E+ R I+++N+ I+ N E G + + N F D+TNEEFR
Sbjct: 31 QWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFR 89
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
+ GY R P K +P S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90 QVVNGYRHQKHKKGRLFQEPLMLK------IPKSVDWREKGCVTPVKNQGQCGSCWAFSA 143
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
+EG + GKLI LSEQ LVDCS N GC+GGLMD AF+YI EN GL +E YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
Y ++G+C + E AVA + + D+P+ E+AL++AV+ P+SV +DAS + FY S
Sbjct: 204 YEAKDGSCKYRAEFAVANG-TGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261
Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
G+ +C + N DHGV +VG+G + N KYWL+KNSWG WG GYI+I +D
Sbjct: 262 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 321
Query: 334 CGIATAASYPV 344
CG+ATAASYPV
Sbjct: 322 CGLATAASYPV 332
>gi|254674508|dbj|BAH86062.1| cysteine protease [Haemaphysalis longicornis]
Length = 333
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 199/340 (58%), Gaps = 19/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
++ L C + + + + + E + +QH + Y +E+ +R IF +N + K
Sbjct: 1 MLRLAFLCGCVAAAIAASSQEILRTEWEAFKSQHNKAYSSHVEELLRFKIFTENTLLVAK 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKL N+F DL EF + GY + RP+ N+ D
Sbjct: 61 HNAKYAKGLVSYKLAMNKFGDLLPHEFAKMVNGYR----GKQNKEQRPTFIPPANLNDSS 116
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+PT++DWR+KGAVT +K+QGQCGSCWAFS ++EG GKL+ LSEQ LVDCS D
Sbjct: 117 LPTTVDWRKKGAVTPVKNQGQCGSCWAFSTTGSLEGQHFRKTGKLVSLSEQNLVDCSDDF 176
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N GC+GGLMD F+YI N G+ TE +PY ++G C +K V AT + + D+ +G
Sbjct: 177 GNQGCNGGLMDNGFQYIKANGGIDTEESHPYTAQDGDCKFKKAD-VGATDAGFVDIQQGS 235
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGNN-CDHGVAVVGFGTAEEENG 305
E L +AV+ PVSV +DAS +F Y GV + DC ++ DHGV VG+G +NG
Sbjct: 236 EDDLKKAVATVGPVSVAIDASHGSFQLYSQGVYDEPDCSSSQLDHGVLTVGYGV---KNG 292
Query: 306 AKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
KYWL+KNSWG WG++GYI + RD CGIA++ASYP+
Sbjct: 293 KKYWLVKNSWGGDWGDNGYILMSRDKDNQCGIASSASYPL 332
>gi|4886998|gb|AAD32136.1|AF121837_1 cathepsin L [Mus musculus]
gi|4887000|gb|AAD32137.1|AF121838_1 cathepsin L [Mus musculus]
gi|4887002|gb|AAD32138.1|AF121839_1 cathepsin L [Mus musculus]
gi|200501|gb|AAA39984.1| preprocathepsin L precursor [Mus musculus]
Length = 334
Score = 253 bits (647), Expect = 8e-65, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 19/311 (6%)
Query: 44 QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
QW + H R Y E+ R I+++N+ I+ N E G + + N F D+TNEEFR
Sbjct: 31 QWKSTHRRLYGTN-EEEWRRAIWEKNMRIIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFR 89
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
+ GY R P K +P S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90 QVVNGYRHQKHKKGRLFQEPLMLK------IPKSVDWREKGCVTPVKNQGQCGSCWAFSA 143
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
+EG + GKLI LSEQ LVDCS N GC+GGLMD AF+YI EN GL +E YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
Y ++G+C + E AV A + + D+P+ E+AL++AV+ P+SV +DAS + FY S
Sbjct: 204 YEAKDGSCKYRAEFAV-ANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261
Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
G+ +C + N DHGV +VG+G + N KYWL+KNSWG WG GYI+I +D
Sbjct: 262 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 321
Query: 334 CGIATAASYPV 344
CG+ATAASYPV
Sbjct: 322 CGLATAASYPV 332
>gi|156371477|ref|XP_001628790.1| predicted protein [Nematostella vectensis]
gi|156215775|gb|EDO36727.1| predicted protein [Nematostella vectensis]
Length = 330
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 147/343 (42%), Positives = 197/343 (57%), Gaps = 23/343 (6%)
Query: 11 IPMFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQ---HGRTYKDELEKAMRLNIFK 67
+ + V L+ AS V E +QW A H + Y E+ R I++
Sbjct: 1 MKLLVAACLLFAVASGFV-------VKFDEDEQQWQAWKLFHTKKYTTVTEEGARKAIWR 53
Query: 68 QNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQN 127
NL+ I+K N EG+ ++ L N DLT +EFR YTG + +++ + S F +
Sbjct: 54 DNLKKIQKHNAEGH-SFTLAMNHLGDLTQDEFRYFYTGMRSHYSNYTKK--QGSAFLAPS 110
Query: 128 VTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS 187
VP ++DWR++G VT +K+QGQCGSCWAFS ++EG GKL+ LSEQ LVDCS
Sbjct: 111 HVQVPDTVDWRKEGYVTPVKNQGQCGSCWAFSTTGSLEGQNFKKTGKLVSLSEQNLVDCS 170
Query: 188 T--DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLP 245
T N+GC GGLMD AF+YI EN G+ TE YPY C QK + A + + D+
Sbjct: 171 TAYGNNGCQGGLMDYAFKYIKENGGIDTEESYPYEARNDRCRFQKSN-IGAVDTGFVDVT 229
Query: 246 KGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVL-NADCGN-NCDHGVAVVGFGTAEE 302
GDE+AL A P+SV +DA +F FY SGV NA C + + DHGV VVG+GT +
Sbjct: 230 HGDEEALKTAAGTVGPISVAIDAGHMSFQFYHSGVYNNAGCSSTSLDHGVLVVGYGTYQ- 288
Query: 303 ENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
G+ YWL+KNSWGE WG GYI + R+ CG+AT ASYP+
Sbjct: 289 --GSDYWLVKNSWGERWGMEGYIMMSRNKNNQCGVATQASYPL 329
>gi|74200292|dbj|BAE22939.1| unnamed protein product [Mus musculus]
Length = 308
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 144/311 (46%), Positives = 189/311 (60%), Gaps = 19/311 (6%)
Query: 44 QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
QW + H R Y E+ R I+++N+ I+ N E G + + N F D+TNEEFR
Sbjct: 5 QWKSTHRRLYGTN-EEEWRRAIWEKNMRMIQLHNGEYSNGQHGFSMEMNAFGDMTNEEFR 63
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
+ GY R P K +P S+DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 64 QVVNGYRHQKHKKGRLFQEPLMLK------IPKSVDWREKGCVTPVKNQGQCGSCWAFSA 117
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCS--TDNHGCSGGLMDKAFEYIIENKGLATEADYP 218
+EG + GKLI LSEQ LVDCS N GC+GGLMD AF+YI EN GL +E YP
Sbjct: 118 SGCLEGQMFLKTGKLISLSEQNLVDCSHAQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 177
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
Y ++G+C + E AV A + + D+P+ E+AL++AV+ P+SV +DAS + FY S
Sbjct: 178 YEAKDGSCKYRAEFAV-ANDTGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 235
Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
G+ +C + N DHGV +VG+G + N KYWL+KNSWG WG GYI+I +D
Sbjct: 236 GIYYEPNCSSKNLDHGVLLVGYGYEGTDSNKNKYWLVKNSWGSEWGMEGYIKIAKDRDNH 295
Query: 334 CGIATAASYPV 344
CG+ATAASYPV
Sbjct: 296 CGLATAASYPV 306
>gi|384941728|gb|AFI34469.1| cathepsin L2 preproprotein [Macaca mulatta]
Length = 334
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 143/342 (41%), Positives = 200/342 (58%), Gaps = 19/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + ++L C + S + ++ K QW A H R Y E+ R ++++N++
Sbjct: 1 MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAS-EEGWRRAVWEKNMKM 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G + + N F D+TNEEFR + + +++ + F+
Sbjct: 59 IELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMGCFR------NQKLRKGKLFREPLFL 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
D+P S+DWR+KG VT +K+Q QCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GG M+ AF Y+ EN GL +E YPY +G C + E +V A + +E +P G
Sbjct: 173 QGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRSENSV-ANDTGFEVVPAG 231
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+AL++AV+ P+SV +DA +F FYKSG+ DC + N DHGV VVG+G
Sbjct: 232 KEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANS 291
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+ KYWL+KNSWG WG +GY++I +D CGIATAASYP
Sbjct: 292 DNNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPT 333
>gi|402770509|gb|AFQ98389.1| cathepsin L [Rhipicephalus microplus]
Length = 332
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 141/340 (41%), Positives = 205/340 (60%), Gaps = 20/340 (5%)
Query: 16 IIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEK 75
++ L + CA V+ + + + + E + H +TY+ +E+ +R IF ++ I +
Sbjct: 1 MLRLSVLCAIAAVTVAASSQEILRTQWEAFKTTHKKTYQSHMEELLRFKIFTESSLIIAR 60
Query: 76 ANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTD-- 130
N + G +YKLG N+F DL EF ++ G++ +R++ + NV D
Sbjct: 61 HNAKYAKGLVSYKLGMNQFGDLLAHEFARIFNGHHG-----TRKTGGSTFLPPANVNDSS 115
Query: 131 VPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD- 189
+P ++DWR+KGAVT +KDQGQCGSCWAFSA ++EG + G+L+ LSEQ LVDCS
Sbjct: 116 LPKAVDWRKKGAVTPVKDQGQCGSCWAFSATGSLEGQHFLKNGELVSLSEQNLVDCSQSF 175
Query: 190 -NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGD 248
N+GC GGLM+ AF+YI N G+ TE YPY +G C +KE V AT + Y ++ G
Sbjct: 176 GNNGCEGGLMEDAFKYIKANDGIDTEKSYPYEAVDGECRFKKED-VGATDTGYVEIKAGS 234
Query: 249 EQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENG 305
E L +AV+ P+SV +DAS +F Y GV + +C + + DHGV VVG+G + G
Sbjct: 235 EDDLKKAVATVGPISVAIDASHSSFQLYSEGVYDEPECSSEDLDHGVLVVGYGV---KGG 291
Query: 306 AKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
KYWL+KNSW E+WG+ GYI + RD CGIA+ ASYP+
Sbjct: 292 KKYWLVKNSWAESWGDQGYILMSRDNNNQCGIASQASYPL 331
>gi|164420679|ref|NP_001037464.2| fibroinase precursor [Bombyx mori]
gi|40556818|gb|AAR87763.1| fibroinase precursor [Bombyx mori]
Length = 341
Score = 253 bits (646), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 142/344 (41%), Positives = 204/344 (59%), Gaps = 24/344 (6%)
Query: 18 ILVITCASQVVSGRSMHEPSIVEKHEQWMA---QHGRTYKDELEKAMRLNIFKQNLEYIE 74
++++ CA VS + +V+ E+W A QH Y+ E+E R+ I+ ++ I
Sbjct: 4 LVLLLCAVAAVSAVQFFD--LVK--EEWSAFKLQHRLNYESEVEDNFRMKIYAEHKHIIA 59
Query: 75 KANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPS-----VSRQSSRPSTFKYQ 126
K N++ G +YKLG N++ D+ + EF G+N+ + S R + F
Sbjct: 60 KHNQKYEMGLVSYKLGMNKYGDMLHHEFVKTMNGFNKTAKHNKNLYMKGGSVRGAKFISP 119
Query: 127 NVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDC 186
+P +DWR+ GAVT IKDQG+CGSCW+FS A+EG G L+ LSEQ L+DC
Sbjct: 120 ANVKLPEQVDWRKHGAVTDIKDQGKCGSCWSFSTTGALEGQHFRQSGYLVSLSEQNLIDC 179
Query: 187 STD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDL 244
S N+GC+GGLMD AF+YI +N G+ TE YPY + C + A + + D+
Sbjct: 180 SEQYGNNGCNGGLMDNAFKYIKDNGGIDTEQTYPYEGVDDKCRYNPKNTGAEDVG-FVDI 238
Query: 245 PKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKSGVLNAD--CGNNCDHGVAVVGFGTAE 301
P+GDEQ L++AV+ PVSV +DAS +F Y SGV N + + DHGV VVG+GT
Sbjct: 239 PEGDEQKLMEAVATVGPVSVAIDASHTSFQLYSSGVYNEEECSSTDLDHGVLVVGYGT-- 296
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+E G YWL+KNSWG +WGE GYI+++R+ CGIA++ASYP+
Sbjct: 297 DEQGVDYWLVKNSWGRSWGELGYIKMIRNKNNRCGIASSASYPL 340
>gi|348546019|ref|XP_003460476.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
gi|348546143|ref|XP_003460538.1| PREDICTED: cathepsin L-like [Oreochromis niloticus]
Length = 334
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 143/315 (45%), Positives = 195/315 (61%), Gaps = 25/315 (7%)
Query: 45 WMAQHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRA 101
W + R+Y E+A R I+ N +++ N +G ++Y+LG F+D+ NEE++
Sbjct: 29 WKLKFERSYHSPSEEAHRRQIWLNNRKFVLVHNILADQGLKSYRLGMTYFADMENEEYKR 88
Query: 102 LYT-----GYNRPVPSVSRQSSRPSTF-KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSC 155
+ + +N +P R STF + TD+P ++DWR+KG VT +KDQ QCGSC
Sbjct: 89 VISQGCLHSFNASLPR------RGSTFFRLPEGTDLPDAVDWRDKGYVTDVKDQKQCGSC 142
Query: 156 WAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLAT 213
WAFSA ++EG G L+ LSEQQLVDCS D N GC GGLMD AF+YI N G+ T
Sbjct: 143 WAFSATGSLEGQHFRKTGTLVSLSEQQLVDCSGDYGNMGCMGGLMDYAFQYIQANGGIDT 202
Query: 214 EADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAF 272
E YPY E G C + + AT + Y ++ +GDE AL +AV+ P+SV +DAS +F
Sbjct: 203 EESYPYEAENGKCRYNPDN-IGATSTGYTEVSQGDEDALKEAVATIGPISVGIDASQMSF 261
Query: 273 HFYKSGVLN-ADCGN-NCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRD 330
FY+SGV N DC + DHGV VG+GT E+G YWL+KNSWG WG+ GYI++ R+
Sbjct: 262 QFYESGVYNEPDCSSLELDHGVLAVGYGT---EDGNDYWLVKNSWGLEWGDKGYIKMSRN 318
Query: 331 -AGLCGIATAASYPV 344
+ CGIATAASYP+
Sbjct: 319 KSNQCGIATAASYPL 333
>gi|297845822|ref|XP_002890792.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
gi|297336634|gb|EFH67051.1| hypothetical protein ARALYDRAFT_473117 [Arabidopsis lyrata subsp.
lyrata]
Length = 322
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 196/329 (59%), Gaps = 37/329 (11%)
Query: 25 SQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKEGNRTY 84
SQ +++E SIV+ H+QWM Q R Y+DE EK MRL +FK+NL++IE N GN++Y
Sbjct: 21 SQARPHVTLNEQSIVDYHQQWMTQFSRVYQDESEKEMRLQVFKKNLKFIENFNNMGNQSY 80
Query: 85 KLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPT---SIDWREKG 141
+G NEF+D T EEF A +TG V ++S + + N++D+ S DWR++G
Sbjct: 81 TVGVNEFTDWTIEEFLATHTGLRVNVTTLSELFNETMPSRNWNISDIDIDDESKDWRDEG 140
Query: 142 AVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-NHGCSGGLMDK 200
AV +K QG C G+T+I+ L+ LSEQQL+DC T+ N GC GG +++
Sbjct: 141 AVIPVKVQGAC-------------GLTKISGKNLLTLSEQQLIDCDTEKNTGCDGGGIEE 187
Query: 201 AFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQP 260
AF+YII+N G++ E +YPY+ ++G+C A I +E +P +E+ALL+AV QP
Sbjct: 188 AFKYIIKNGGVSLETEYPYQVKKGSCRANARSATQTQIRGFEMVPSHNERALLEAVRRQP 247
Query: 261 VSVCVDASGRAFHFYKSGVL-NADCGNNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETW 319
VSV +DA +F YK GV DCG + +H V VG+GT ++W
Sbjct: 248 VSVLIDARADSFKTYKGGVYAGLDCGTDVNHAVTFVGYGTMI---------------QSW 292
Query: 320 GESGYIRILRDA----GLCGIATAASYPV 344
GE+GY+RI RD G+CGIA A+YP+
Sbjct: 293 GENGYMRIRRDVEWPQGMCGIAQVAAYPI 321
>gi|402898110|ref|XP_003912074.1| PREDICTED: cathepsin L2 [Papio anubis]
Length = 334
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 143/342 (41%), Positives = 200/342 (58%), Gaps = 19/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + ++L C + S + ++ K QW A H R Y E+ R ++++N++
Sbjct: 1 MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAS-EEGWRRAVWEKNMKM 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G + + N F D+TNEEFR + + +++ + F+
Sbjct: 59 IELHNGEYSQGKHGFTMAMNAFGDMTNEEFRQVMGCFR------NQKLRKGKLFREPLFL 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
D+P S+DWR+KG VT +K+Q QCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSRP 172
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GG M+ AF Y+ EN GL +E YPY +G C + E +V A + +E +P G
Sbjct: 173 QGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRPENSV-ANDTGFEVVPAG 231
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+AL++AV+ P+SV +DA +F FYKSG+ DC + N DHGV VVG+G
Sbjct: 232 KEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANS 291
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+ KYWL+KNSWG WG +GY++I +D CGIATAASYP
Sbjct: 292 DNNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPT 333
>gi|313118772|gb|ADR32298.1| C14 cysteine protease [Solanum demissum]
Length = 217
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 117/218 (53%), Positives = 148/218 (67%), Gaps = 8/218 (3%)
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-N 190
P S+DWR+KG + +KDQG CGSCWAFSAVAA+E I I G LI LSEQ+LVDC N
Sbjct: 2 PVSVDWRDKGVLVGVKDQGSCGSCWAFSAVAAMESINAIVTGDLISLSEQELVDCDKSYN 61
Query: 191 HGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQ 250
GC GGLMD AFE++I N G+ TE DYPY+ CD ++ A I YED+P +E+
Sbjct: 62 QGCDGGLMDYAFEFVINNGGIDTEEDYPYKERNDVCDQYRKNAKVVKIDSYEDVPVNNEK 121
Query: 251 ALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEENGAKYWL 310
AL +AV++QPVS+ ++A GR F YKSG+ CG DHGV G+GT ENG YW+
Sbjct: 122 ALQKAVAHQPVSIALEAGGRDFQHYKSGIFTGKCGTAVDHGVVAAGYGT---ENGMDYWI 178
Query: 311 IKNSWGETWGESGYIRILRD----AGLCGIATAASYPV 344
++NSWG WGE GY+R+ R+ +GLCG+AT SYPV
Sbjct: 179 VRNSWGAKWGEKGYLRVQRNIASSSGLCGLATEPSYPV 216
>gi|118429523|gb|ABK91809.1| cathepsin L-like proteinase precursor [Clonorchis sinensis]
Length = 373
Score = 253 bits (646), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 151/344 (43%), Positives = 200/344 (58%), Gaps = 33/344 (9%)
Query: 24 ASQVVSGRSMHEPSIV---------EKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
AS + S SMH ++ + +M + R Y D E R IF N I
Sbjct: 39 ASPLTSLDSMHMQDVIGVDWNFTLSSIWKHFMTTYKRNYIDPSEHERRFKIFANNFVRIS 98
Query: 75 KANK---EGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
K N +G +Y +G NEFSD T+EE + L R + SR S+ T
Sbjct: 99 KHNVRFIQGQVSYTMGINEFSDKTDEELKRLRCF--RGSLNASRDGSKYITI----AAPP 152
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
P+ IDWR KGAVT +K+QG CGSCWAFSA A+EG + G L+ LSEQQLVDCS++
Sbjct: 153 PSEIDWRNKGAVTPVKNQGNCGSCWAFSATGAIEGQNFLATGNLVSLSEQQLVDCSSEYG 212
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEG-----TCDNQKEKAVAATISKYEDL 244
N+ C+GGLMD AF+Y+ ++ G+ TEA YPY E TC ++AV ++ Y DL
Sbjct: 213 NNACNGGLMDNAFKYVKDSNGIDTEASYPYVSGETGDANPTCRFNLKEAV-VRVTGYIDL 271
Query: 245 PKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSGVLNAD--CGNNCDHGVAVVGFGTAE 301
P+G L QAV + P+SV ++A +F YKSGV + D ++ DHGV +VG+G
Sbjct: 272 PRGQVSELKQAVGHYGPISVAINAGLPSFMSYKSGVYSDDQCSSDDLDHGVLLVGYG--- 328
Query: 302 EENGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
EENG YWLIKNSWG WGE+GY++ILRD LCG+A+ ASYP+
Sbjct: 329 EENGIPYWLIKNSWGPHWGENGYVKILRDHNNLCGVASMASYPL 372
>gi|23306947|dbj|BAC16538.1| cathepsin L [Engraulis japonicus]
Length = 336
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 146/342 (42%), Positives = 202/342 (59%), Gaps = 17/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M+V + + C S V++ S + + + W H + Y E E+ R ++++NL
Sbjct: 1 MYVAAVFTL-CLSAVLAAPSF-DRELDDHWNHWKNFHTKKYH-EKEEGWRRVVWEKNLRK 57
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +Y+LG N F D+T+EEFR + GY + + + S F N
Sbjct: 58 IEMHNLEHSMGAHSYRLGMNHFGDMTHEEFRQVMNGYKHK----AERRVKGSLFMEPNFI 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCST- 188
+ P ID+R+ G T +KDQGQCGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 114 EAPKKIDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLSEQNLVDCSRP 173
Query: 189 -DNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+YI +N GL TE YPY + + K AA + + D+P+G
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDTEDAYPYLGTDDQDCHYDPKYSAANDTGFVDIPEG 233
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNN-CDHGVAVVGFG-TAEEE 303
E+AL++AV+ PVSV +DA +F FY SG+ +C + DHGV VVG+G E+
Sbjct: 234 KERALMKAVAAVGPVSVAIDAGHESFQFYHSGIYFEKECSSTELDHGVLVVGYGFEGEDV 293
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+G KYW++KNSW E WG+ GYI + +D CGIATAASYP+
Sbjct: 294 DGKKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPL 335
>gi|34850847|dbj|BAC87861.1| cathepsin L [Engraulis japonicus]
Length = 336
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 146/342 (42%), Positives = 202/342 (59%), Gaps = 17/342 (4%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M+V + + C S V++ S + + + W + H + Y E E+ R ++++NL
Sbjct: 1 MYVAAVFTL-CLSAVLAAPSF-DRELDDHWNHWKSFHTKKYH-EKEEGWRRVVWEKNLRK 57
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G +Y+LG N F D+T+EEFR + GY + + + S F N
Sbjct: 58 IEMHNLEHSMGAHSYRLGMNHFGDMTHEEFRQVMNGYKHK----AERRVKGSLFMEPNFI 113
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD 189
+ P ID+R+ G T +KDQGQCGSCWAFS A+EG GKL+ LSEQ LVDCS
Sbjct: 114 EAPKKIDYRDLGYATPVKDQGQCGSCWAFSTTGAMEGQLFREGGKLVSLSEQNLVDCSRP 173
Query: 190 --NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD+AF+YI +N GL TE YPY + + K AA + + D+P+G
Sbjct: 174 EGNEGCNGGLMDQAFQYIKDNGGLDTEDAYPYLGTDDQDCHYDPKYSAANDTGFVDIPEG 233
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGNN-CDHGVAVVGFG-TAEEE 303
E+AL++AV+ PVSV +DA F FY SG+ +C + DHGV VVG+G E+
Sbjct: 234 KERALMKAVAAVGPVSVAIDAGHECFQFYHSGIYFEKECSSTELDHGVLVVGYGFEGEDV 293
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+G KYW++KNSW E WG+ GYI + +D CGIATAASYP+
Sbjct: 294 DGKKYWIVKNSWSEKWGDEGYIYMAKDRKNHCGIATAASYPL 335
>gi|355567966|gb|EHH24307.1| Cathepsin L2 [Macaca mulatta]
gi|355753494|gb|EHH57540.1| Cathepsin L2 [Macaca fascicularis]
gi|380790509|gb|AFE67130.1| cathepsin L2 preproprotein [Macaca mulatta]
Length = 334
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 143/342 (41%), Positives = 200/342 (58%), Gaps = 19/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + ++L C + S + ++ K QW A H R Y E+ R ++++N++
Sbjct: 1 MNLSLVLAAFCLG-IASAVPKFDQNLDTKWYQWKATHRRLYGAS-EEGWRRAVWEKNMKM 58
Query: 73 IEKANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N E G + + N F D+TNEEFR + + +++ + F+
Sbjct: 59 IELHNGEYSQGKHGFAMAMNAFGDMTNEEFRQVMGCFR------NQKLRKGKLFREPLFL 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
D+P S+DWR+KG VT +K+Q QCGSCWAFSA A+EG GKL+ LSEQ LVDCS
Sbjct: 113 DLPKSVDWRKKGYVTPVKNQKQCGSCWAFSATGALEGQMFRKTGKLVSLSEQNLVDCSHP 172
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GG M+ AF Y+ EN GL +E YPY +G C + E +V A + +E +P G
Sbjct: 173 QGNQGCNGGFMNSAFRYVKENGGLDSEESYPYVAMDGICKYRPENSV-ANDTGFEVVPAG 231
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+AL++AV+ P+SV +DA +F FYKSG+ DC + N DHGV VVG+G
Sbjct: 232 KEKALMKAVATVGPISVAMDAGHSSFQFYKSGIYFEPDCSSKNLDHGVLVVGYGFEGANS 291
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
+ KYWL+KNSWG WG +GY++I +D CGIATAASYP
Sbjct: 292 DNNKYWLVKNSWGPEWGSNGYVKIAKDKDNHCGIATAASYPT 333
>gi|4503155|ref|NP_001903.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|22202619|ref|NP_666023.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|384081592|ref|NP_001244900.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|384081594|ref|NP_001244901.1| cathepsin L1 isoform 1 preproprotein [Homo sapiens]
gi|332832229|ref|XP_003312197.1| PREDICTED: cathepsin L1 isoform 2 [Pan troglodytes]
gi|332832233|ref|XP_001137800.2| PREDICTED: cathepsin L1 isoform 1 [Pan troglodytes]
gi|397470218|ref|XP_003806728.1| PREDICTED: cathepsin L1 isoform 1 [Pan paniscus]
gi|397470220|ref|XP_003806729.1| PREDICTED: cathepsin L1 isoform 2 [Pan paniscus]
gi|397470222|ref|XP_003806730.1| PREDICTED: cathepsin L1 isoform 3 [Pan paniscus]
gi|410042824|ref|XP_003951515.1| PREDICTED: cathepsin L1 [Pan troglodytes]
gi|115741|sp|P07711.2|CATL1_HUMAN RecName: Full=Cathepsin L1; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Cathepsin L1 heavy
chain; Contains: RecName: Full=Cathepsin L1 light chain;
Flags: Precursor
gi|29715|emb|CAA30981.1| pro-(cathepsin L) [Homo sapiens]
gi|190418|gb|AAA66974.1| preprocathepsin L precursor [Homo sapiens]
gi|31873292|emb|CAD97637.1| hypothetical protein [Homo sapiens]
gi|48146223|emb|CAG33334.1| CTSL [Homo sapiens]
gi|119583135|gb|EAW62731.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583136|gb|EAW62732.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583137|gb|EAW62733.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583138|gb|EAW62734.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|119583140|gb|EAW62736.1| cathepsin L, isoform CRA_a [Homo sapiens]
gi|208965934|dbj|BAG72981.1| cathepsin L1 [synthetic construct]
gi|410303006|gb|JAA30103.1| cathepsin L1 [Pan troglodytes]
gi|410303008|gb|JAA30104.1| cathepsin L1 [Pan troglodytes]
gi|410303010|gb|JAA30105.1| cathepsin L1 [Pan troglodytes]
Length = 333
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 143/342 (41%), Positives = 204/342 (59%), Gaps = 20/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M +IL C + S + S+ + +W A H R Y E+ R ++++N++
Sbjct: 1 MNPTLILAAFCLG-IASATLTFDHSLEAQWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58
Query: 73 IEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N +EG ++ + N F D+T+EEFR + G+ +R+ + F+
Sbjct: 59 IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLFY 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
+ P S+DWREKG VT +K+QGQCGSCWAFSA A+EG G+LI LSEQ LVDCS
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGRLISLSEQNLVDCSGP 172
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD AF+Y+ +N GL +E YPY E +C + +V A + + D+PK
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSV-ANDTGFVDIPK- 230
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+AL++AV+ P+SV +DA +F FYK G+ DC + + DHGV VVG+G + E
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+ KYWL+KNSWGE WG GY+++ +D CGIA+AASYP
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPT 332
>gi|290462225|gb|ADD24160.1| Cathepsin L [Lepeophtheirus salmonis]
Length = 334
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 142/339 (41%), Positives = 199/339 (58%), Gaps = 20/339 (5%)
Query: 15 VIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEYIE 74
++++ VI + VS + ++ E W H + Y +E+ +RL IF +N I
Sbjct: 6 ILLLSVIISTASAVSFFDV----VLSDWESWKLTHQKGYDSSVEEKLRLKIFMENSLRIS 61
Query: 75 KANKE---GNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVTDV 131
+ N E G TY + N + DL + EF A+ GY + +++ TF ++
Sbjct: 62 RHNAEAIQGRHTYFMKMNHYGDLLHHEFVAMVNGY-----IYNNKTTLGGTFIPSKNINL 116
Query: 132 PTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD-- 189
P +DWRE+GAVT +K+QGQCGSCW+FSA ++EG GKLI LSEQ LVDCS
Sbjct: 117 PEHVDWREEGAVTPVKNQGQCGSCWSFSATGSLEGQDFRKTGKLISLSEQNLVDCSRKYG 176
Query: 190 NHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDE 249
N+GC GGLMD AF+YI +N G+ TEA YPY +G C + + I + D+ KG E
Sbjct: 177 NNGCEGGLMDYAFKYIQDNNGIDTEASYPYEGIDGHCHYDPKNKGGSDIG-FVDIKKGSE 235
Query: 250 QALLQAVSN-QPVSVCVDASGRAFHFYKSGVLN-ADCG-NNCDHGVAVVGFGTAEEENGA 306
+ L +A++ P+SV +DAS +F FY GV + C N DHGV VG+GT +E G
Sbjct: 236 KDLQKALATVGPISVAIDASHMSFQFYSHGVYSEKKCSPENLDHGVLAVGYGT-DEVTGE 294
Query: 307 KYWLIKNSWGETWGESGYIRILRDA-GLCGIATAASYPV 344
YWL+KNSW E WGE GYI++ R+ +CGIA++ASYPV
Sbjct: 295 DYWLVKNSWSEKWGEDGYIKMARNKDNMCGIASSASYPV 333
>gi|157779038|gb|ABV71063.1| cathepsin L3 precursor [Schistosoma mansoni]
gi|360044915|emb|CCD82463.1| subfamily C1A unassigned peptidase (C01 family) [Schistosoma
mansoni]
Length = 370
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 195/311 (62%), Gaps = 21/311 (6%)
Query: 48 QHGRTYKDELEKAMRLNIFKQNLEYIEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYT 104
Q R Y E+ R IF N + + N +EG TYK+G NEF+D T+ E + L
Sbjct: 66 QFKRAYNGIHEETRRFFIFSANFVKMMEHNHAFQEGKVTYKMGVNEFTDKTDYELKKL-R 124
Query: 105 GYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAV 164
GY ++ + S TF T +P+ +DWR +GAVT +K+QGQCGSCWAFS A+
Sbjct: 125 GYKVTSGAIRHKGS---TFIRSEHTKLPSKVDWRREGAVTDVKNQGQCGSCWAFSTTGAI 181
Query: 165 EGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYPYRHE 222
EG +L+ LSEQQLVDCS N+GCSGGLM+ AFEY+ +N+G+ +E YPY
Sbjct: 182 EGQHYRKTNRLVNLSEQQLVDCSKSYGNNGCSGGLMNSAFEYVRDNEGIDSEISYPYVSG 241
Query: 223 EGTCDNQ---KEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGRAFHFYKSG 278
+GT +N+ + A ++ Y ++ +GDE+AL+ AV+ + PVSV ++A +F YKSG
Sbjct: 242 DGTENNRCLFNASNILAQVTGYVNIHEGDERALMDAVATKGPVSVAINAGLPSFSMYKSG 301
Query: 279 VL-NADCG---NNCDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
+ + DC + DHGV VVG+G EENG YWLIKNSWGE WGE GYI+I + + +
Sbjct: 302 IYSDTDCEGTLDALDHGVLVVGYG---EENGRSYWLIKNSWGEEWGEKGYIKISKGSHNM 358
Query: 334 CGIATAASYPV 344
CG+A+AASYP+
Sbjct: 359 CGVASAASYPL 369
>gi|395740610|ref|XP_002819972.2| PREDICTED: cathepsin L1 [Pongo abelii]
Length = 333
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 143/342 (41%), Positives = 203/342 (59%), Gaps = 20/342 (5%)
Query: 13 MFVIIILVITCASQVVSGRSMHEPSIVEKHEQWMAQHGRTYKDELEKAMRLNIFKQNLEY 72
M + L C + S + S+ + +W A H R Y E+ R ++++N++
Sbjct: 1 MNPTLFLAAFCLG-IASATLTFDHSLEARWTKWKAMHNRLYGMN-EEGWRRAVWEKNMKM 58
Query: 73 IEKAN---KEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTFKYQNVT 129
IE N +EG ++ + N F D+T+EEFR + G+ +R+ + F+
Sbjct: 59 IELHNQEYREGKHSFTMAMNAFGDMTSEEFRQVMNGFQ------NRKPRKGKVFQEPLFY 112
Query: 130 DVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQLVDCS-- 187
+ P S+DWREKG VT +K+QGQCGSCWAFSA A+EG GKLI LSEQ LVDCS
Sbjct: 113 EAPRSVDWREKGYVTPVKNQGQCGSCWAFSATGALEGQMFRKTGKLISLSEQNLVDCSGP 172
Query: 188 TDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKG 247
N GC+GGLMD AF+Y+ +N GL +E YPY E +C + +V A + + D+PK
Sbjct: 173 QGNEGCNGGLMDYAFQYVQDNGGLDSEESYPYEATEESCKYNPKYSV-ANDTGFVDIPK- 230
Query: 248 DEQALLQAVSN-QPVSVCVDASGRAFHFYKSGV-LNADCGN-NCDHGVAVVGFG-TAEEE 303
E+AL++AV+ P+SV +DA +F FYK G+ DC + + DHGV VVG+G + E
Sbjct: 231 QEKALMKAVATVGPISVAIDAGHESFLFYKEGIYFEPDCSSEDMDHGVLVVGYGFESTES 290
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRD-AGLCGIATAASYPV 344
+ KYWL+KNSWGE WG GY+++ +D CGIA+AASYP
Sbjct: 291 DNNKYWLVKNSWGEEWGMGGYVKMAKDRRNHCGIASAASYPT 332
>gi|530734|emb|CAA56914.1| cathepsin l [Nephrops norvegicus]
gi|1582620|prf||2119193A cathepsin L-related Cys protease
Length = 324
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 139/317 (43%), Positives = 192/317 (60%), Gaps = 29/317 (9%)
Query: 43 EQWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEF 99
E++ + GR Y D E+ RLN+F NL+YIE+ NK+ G TY L N+FSDLTN+EF
Sbjct: 21 EEFKGKFGRKYVDLEEERYRLNVFLDNLQYIEEFNKKYESGEVTYNLAINQFSDLTNDEF 80
Query: 100 RALYTGYN-----RPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGS 154
++ GY +PV + + P T T +DWR KG VTH+KDQGQCGS
Sbjct: 81 NSMMKGYKTSLRPKPVAVFTSTDAAPET----------TEVDWRTKGCVTHVKDQGQCGS 130
Query: 155 CWAFSAVAAVEGITQITRGKLIELSEQQLVDCSTD---NHGCSGGLMDKAFEYIIENKGL 211
CWAFSA ++EG + G+L+ L+EQQLVDC+ N GC+GG +++AF+YI N G+
Sbjct: 131 CWAFSATGSLEGQHFLKYGELVSLAEQQLVDCAGGIYYNQGCNGGWVNQAFKYIKANGGI 190
Query: 212 ATEADYPYRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSNQ-PVSVCVDASGR 270
TE+ YPY + TC +VAAT S + + +G E ++ +N P+SV +DA+ R
Sbjct: 191 DTESSYPYEARDNTC-RFNSNSVAATCSGFVSIAQGSESPEVRRTTNTGPISVAIDAAHR 249
Query: 271 AFHFYKSGV-LNADCGNN-CDHGVAVVGFGTAEEENGAKYWLIKNSWGETWGESGYIRIL 328
+F Y SGV C ++ DH V VG+G+ E G +WL+KNSWG +WG +GYI +
Sbjct: 250 SFQSYSSGVYYEPSCSSSQLDHAVLAVGYGS---EGGQDFWLVKNSWGTSWGSAGYINMA 306
Query: 329 RDA-GLCGIATAASYPV 344
R+ CGIAT ASYP
Sbjct: 307 RNRNNNCGIATDASYPT 323
>gi|115743|sp|P07154.2|CATL1_RAT RecName: Full=Cathepsin L1; AltName: Full=Cyclic protein 2;
Short=CP-2; AltName: Full=Major excreted protein;
Short=MEP; Contains: RecName: Full=Procathepsin L;
Contains: RecName: Full=Cathepsin L1 heavy chain;
Contains: RecName: Full=Cathepsin L1 light chain; Flags:
Precursor
gi|38648869|gb|AAH63175.1| Cathepsin L1 [Rattus norvegicus]
gi|149029152|gb|EDL84437.1| cathepsin L, isoform CRA_a [Rattus norvegicus]
gi|386267881|dbj|BAM14518.1| cathepsin L [Rattus norvegicus]
Length = 334
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 141/311 (45%), Positives = 191/311 (61%), Gaps = 19/311 (6%)
Query: 44 QWMAQHGRTYKDELEKAMRLNIFKQNLEYIEKANKE---GNRTYKLGTNEFSDLTNEEFR 100
QW + H R Y E+ R ++++N+ I+ N E G + + N F D+TNEEFR
Sbjct: 31 QWKSTHRRLYGTN-EEEWRRAVWEKNMRMIQLHNGEYSNGKHGFTMEMNAFGDMTNEEFR 89
Query: 101 ALYTGYNRPVPSVSRQSSRPSTFKYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSA 160
+ GY R P + +P ++DWREKG VT +K+QGQCGSCWAFSA
Sbjct: 90 QIVNGYRHQKHKKGRLFQEPLMLQ------IPKTVDWREKGCVTPVKNQGQCGSCWAFSA 143
Query: 161 VAAVEGITQITRGKLIELSEQQLVDCSTD--NHGCSGGLMDKAFEYIIENKGLATEADYP 218
+EG + GKLI LSEQ LVDCS D N GC+GGLMD AF+YI EN GL +E YP
Sbjct: 144 SGCLEGQMFLKTGKLISLSEQNLVDCSHDQGNQGCNGGLMDFAFQYIKENGGLDSEESYP 203
Query: 219 YRHEEGTCDNQKEKAVAATISKYEDLPKGDEQALLQAVSN-QPVSVCVDASGRAFHFYKS 277
Y ++G+C + E AVA + + D+P+ E+AL++AV+ P+SV +DAS + FY S
Sbjct: 204 YEAKDGSCKYRAEYAVAND-TGFVDIPQ-QEKALMKAVATVGPISVAMDASHPSLQFYSS 261
Query: 278 GV-LNADCGN-NCDHGVAVVGFG-TAEEENGAKYWLIKNSWGETWGESGYIRILRDA-GL 333
G+ +C + + DHGV VVG+G + N KYWL+KNSWG+ WG GYI+I +D
Sbjct: 262 GIYYEPNCSSKDLDHGVLVVGYGYEGTDSNKDKYWLVKNSWGKEWGMDGYIKIAKDRNNH 321
Query: 334 CGIATAASYPV 344
CG+ATAASYP+
Sbjct: 322 CGLATAASYPI 332
>gi|302763127|ref|XP_002964985.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
gi|300167218|gb|EFJ33823.1| hypothetical protein SELMODRAFT_406652 [Selaginella moellendorffii]
Length = 320
Score = 253 bits (645), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 136/329 (41%), Positives = 191/329 (58%), Gaps = 31/329 (9%)
Query: 8 SFIIPMFVIIILVITCASQVVSGRSMHEPS----IVEKHEQWMAQHGRTYKDELEKAMRL 63
S +I + +I+++V+ A ++ + E I E W A+HG++Y + EKA R+
Sbjct: 3 SNMIALILILLVVVGAAPFAIARPAALEDDRALEIKNMFEDWAAKHGKSYSSDWEKARRM 62
Query: 64 NIFKQNLEYIEKANKEGNRTYKLGTNEFSDLTNEEFRALYTGYNRPVPSVSRQSSRPSTF 123
IF L YIEK N N T+ LG N+FSDLTN EFRA Y G +P Q RP+
Sbjct: 63 TIFSDTLAYIEKHNALPNTTFTLGLNKFSDLTNAEFRANYVGKFKPP---RYQDRRPAKD 119
Query: 124 KYQNVTDVPTSIDWREKGAVTHIKDQGQCGSCWAFSAVAAVEGITQITRGKLIELSEQQL 183
+V+ +PTS+DWR++GAVT IKDQGQCGSCWAFSA+A++E + +L+ LSEQQL
Sbjct: 120 VDVDVSSLPTSLDWRQEGAVTPIKDQGQCGSCWAFSAIASIESAHFLATNQLVSLSEQQL 179
Query: 184 VDCSTDNHGCSGGLMDKAFEYIIENKGLATEADYPYRHEEGTCDNQKEKAVAATISKYED 243
+DC T + GC E YPY G+C+ K K A I+ +
Sbjct: 180 IDCDTVDEGCQ-------------------EEAYPYTGLAGSCNANKNK--VAEITGFNV 218
Query: 244 LPKGDEQALLQAVSNQPVSVCVDASGRAFHFYKSGVLNADCGNNCDHGVAVVGFGTAEEE 303
+ K AL++AVS PV+V + S + F Y+SG+L+ C N+ DH V V+G+GT E
Sbjct: 219 VTKDKADALMKAVSKTPVTVGICGSDQNFQNYRSGILSGQCCNSRDHVVLVIGYGT---E 275
Query: 304 NGAKYWLIKNSWGETWGESGYIRILRDAG 332
G YW+IKNSWG +WGE G+++I + G
Sbjct: 276 GGMPYWIIKNSWGTSWGEDGFMKIEKKDG 304
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.132 0.398
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,522,330,294
Number of Sequences: 23463169
Number of extensions: 233223774
Number of successful extensions: 619409
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 6734
Number of HSP's successfully gapped in prelim test: 905
Number of HSP's that attempted gapping in prelim test: 586823
Number of HSP's gapped (non-prelim): 10033
length of query: 346
length of database: 8,064,228,071
effective HSP length: 143
effective length of query: 203
effective length of database: 9,003,962,200
effective search space: 1827804326600
effective search space used: 1827804326600
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 77 (34.3 bits)